You are on page 1of 2

B.Tech.

in Electronics and Communication Engineering, V/VI/VII/VIII Semester

SPEECH PROCESSING
(PROFESSIONAL ELECTIVE- V)

Course Code:19EC1165 L T P C

3 0 0 3

Prerequisites: Signals and Systems, Digital Signal Processing

Course Outcomes: At the end of the course the student will be able to
CO1: Understand production of speech.
CO2: Analyze time domain models for speech signals.
CO3: Analyze linear predictive coding techniques.
CO4: Illustrate the homomorphic speech processing.
CO5: Analyze the speech enhancement techniques.

UNIT-I 12 Lectures
Fundamentals of Digital Speech Processing
Anatomy and Physiology of Speech Organs, The process of Speech Production, Acoustic
Phonetics, Articulatory Phonetics, The Acoustic Theory of Speech Production- Uniform lossless
tube model, effect of losses in vocal tract, effect of radiation at lips, Digital models for speech
signals.

Learning outcomes: At the end of this unit, the student will be able to
1. understand speech production mechanisms in humans (L2)
2. differentiate between acoustic phonetics and articulatory phonetics (L2)
3. describe the vocal tract system in human beings (L2)

UNIT-II 08 Lectures
Time Domain Models for Speech Processing
Introduction- Window considerations, Short time energy and average magnitude Short time
average zero crossing rate, Speech Vs Silence discrimination using energy and zero crossing, Pitch
period estimation using a parallel processing approach, The short time autocorrelation function,
The short time average magnitude difference function, Pitch period estimation using the
autocorrelation function.

Learning outcomes: At the end of this unit, the student will be able to
1. analyze different time domain features of speech signals (L4)
2. interpret speech and silence in a speech sample using energy and zero crossing (L2)
3. illustrate pitch period, short time autocorrelation and short time average magnitude
difference functions (L3)

UNIT-III 12 Lectures
Linear Predictive Coding (LPC) Analysis
Basic principles of Linear Predictive Analysis: The Autocorrelation Method, The Covariance
Method, Solution of LPC Equations: Cholesky Decomposition Solution for Covariance Method,

74 (R-2019) 
B.Tech. in Electronics and Communication Engineering, V/VI/VII/VIII Semester

Durbin’s Recursive Solution for the Autocorrelation Equations, Comparison between the Methods
of Solution of the LPC Analysis Equations, Applications of LPC Parameters: Pitch Detection using
LPC Parameters, Formant Analysis using LPC Parameters.

Learning outcomes: At the end of this unit, the student will be able to
1. analyze autocorrelation and covariance methods (L4)
2. describe different methods for solving LPC equations (L2)
3. summarize the methods of solution of LPC analysis equations (L2)

UNIT-IV 8 Lectures
Homomorphic Speech Processing
Introduction, Homomorphic Systems for Convolution: Properties of the Complex Cepstrum,
Computational Considerations, The Complex Cepstrum of Speech, Pitch Detection, Formant
Estimation, The Homomorphic Vocoder.

Learning outcomes: At the end of this unit, the student will be able to
1. illustrate the Homomorphic systems for convolution (L3)
2. interpret the homomorphic system for pitch detection and formant estimation(L2)
3. demonstrate homomorphic vocoder (L3)

UNIT-V 10 Lectures
Speech Enhancement
Nature of interfering sounds, Speech enhancement techniques: Single Microphone Approach:
spectral subtraction, Enhancement by re-synthesis, Comb filter, Wiener filter, Multi microphone
Approach.

Applications of Speech Processing:


Speech recognition systems, Text-to-Speech system and Speaker recognition systems: speaker
verification systems, speaker identification systems.

Learning outcomes: At the end of this unit, the student will be able to
1. understand the interfering sounds (L2)
2. demonstrate different speech enhancement systems (L3)
3. Illustrate the applications of speech processing (L4)

Text Books:
1. L.R. Rabiner and S. W. Schafer, Digital Processing of Speech Signals, Pearson Education,
2008.
2. Human & Machine, Douglas O'Shaughnessy, Speech Communications, 2nd Edition., Wiley
India, 2000.
References:
1. Thomas F. Quateri, Discrete Time Speech Signal Processing: Principles and Practice, 1st
Edition, PE, 2001.
2. Ben Gold & Nelson Morgan, Speech & Audio Signal Processing, 1st Edition, Wiley, 2011.

***

75 (R-2019) 

You might also like