Professional Documents
Culture Documents
Techniques
Mr. L.Ramesh
AP/ECE
Introduction
Speech
Air pushed from the lungs past the vocal
cords and along the vocal tract
The basic vibrations – vocal cords
The sound is altered by the disposition of the
vocal tract ( tongue and mouth)
Model the vocal tract as a filter
The shape changes relatively slowly
The vibrations at the vocal cords
The excitation signal
Speech sounds
Voiced sound
The vocal cords vibrate open and close
Quasi-periodic pulses of air
The rate of the opening and closing – the pitch
Unvoiced sounds
Forcing air at high velocities through a constriction
Noise-like turbulence
Show little long-term periodicity
Short-term correlations still present
Plosive sounds
A complete closure in the vocal tract
Air pressure is built up and released suddenly
Voice Sampling
Discrete Time LTI Systems: The Convolution
Sum
x[n] x[k ] [n k ]
k
y[n] x[k ]h[n k ]
k
1
h[n]
0 1 2 n
2.5
2 2
x[n] y[n]
0.5 0.5
0 1 n 0 1 2 3 n
Nyquist sampling theorem
X c ( j )
s (t ) (t nT )
n
N N
xs (t ) xc (t ) s (t )
xc (t ) (t nT )
S 0 X c ( j ) S n
2
S ( j )
T
( k )
k
s
S N N S
( S N )
Quantization (Scalar
Quantization)
v1 v2 vk+1 vL
m0 = -A m1 m2 …… 0 mL=A
x[n] ^
x[n]
F(x) Uniform Uniform F1(x)
Quantization Decoder
A-law log 1 μ x
F ( x) ,0 x 1
log( 1 μ)
Ax 1
,0 x
F ( x ) 1 lnA A
1 ln[ A x ] , 1 x 1
1 lnA A
formant structure of
speech signals
A good approximation,
though not precise enough
LPC Vocoder(Voice Coder)
x[n] { ak }
LPC Encoder
Analysis N,G
…11011…
v/u
N by pitch detection
v/u by voicing detection
receiver
{ ak } x[n]
Decoder Ex g[n]
N,G G(z)
…11011…
v/u
law
65 kbps DS0 rate
North America
A-law
Other countries, a little friendlier to
lower signal levels
An MOS of about 4.3
ADPCM(adaptive differential
PCM)
DPCM and ADPCM.
ADPCM : Adaptive Prediction in DPCM
Adaptive Quantization
Adaptive Quantization
Quantization level varies with local signal level
[n] = ax[n]
x[n] : locally estimated standard deviation of x[n]