Professional Documents
Culture Documents
where 0(t) =
echo suppression or cancellation is required to
attenuate further the echo level.
[50(t)...
x(t) [x(t). . .x(tp+l)]T
The echo suppressor provides a satisfactory
solution to the echo silencing problem for are the canceller coefficient vector and the
moderately delayed (say less than 100 ms) signal vector, respectively.
echoes. But in circuits with long round—trip
delay, e.g., a long-distance call through a
1398
x(t)
ro(t) = rx(o)O
x(0)
8. For n = O,...,p•-l update X(t) =
RC(t) = R(t—1) + (1 —
n(tfl e(t) Lx(t) ... x(t-p+1)
= XAn(t_l) + (1 —
(t}) r(t-l) e(t) The term xT(t) (XT(t_l) X(t-1)) x(t) converges
to zero at a rate proportional to II(t-p).
Therefore the echo residual variance converges to
9. For n = l,...,p update the measurement noise level a2 at a rather fast
rate. As long as there is no near—end speech, the
echo residual remains at the measurement noise
K(t-1) = L(t-1)[R(t-2)i1 level after convergence. When the near-end
y(t) = n_i(t) + K_1(t—1) rni(tl) speaker starts to interrupt, the residual level
increases substantially. This abrupt increase of
- the residual amplitude is a clear indication of
en(t) = y(t) y(t) the on-set of the near-end speech and a likelihood
detector can thus be formed. Once it is detected,
= x4(t-1) + (1 -
cn(t)) r(t-l) en(t) a procedure should be taken to "freeze" or slow
10. Go to 2 down the adaptation to prevent the canceller
coefficients from being diverged by large error
*ISE FLAT DELAY ESTIMATION signals. The likelihood detector is based on the
ratio between the echo residual amplitude and the
Depending on the length of the "tail" of the estimated measurement noise level a. The
4-wire system, the length of the echo return path measurement noise variance can be estimated by a
response varies. The dispersive portion of the one-pole window as
response is usually around 2 ms and the flat—delay
portion can be rather long. For a 15 ms response =x 2(._) 4 (1-x) e2(t) (13)
the adaptive echo canceller should be of 120 taps
(at a 8 kHz sampling frequency) to cover the whole The threshold used to separate the outliers from
range. However, only 16—20 taps are really the good data points is adjustable.
necessary to simulate the dispersive portion of
the response, the rest taps corresponding to the COMPUTER SIMULATION RESULTS
flat delay should all be set to zero not only to
reduce the circuit complexities but to lower the Due to the limit of space we present only
so—called "misadjustment" noise which is due to some of our simulation •results.A mixed—phase
the fluctuation of filter coefficients around ARMA(2,2) model is used to simulate the echo
their true values. However, if we want to turn return path transfer function where y(t) is the
off the "flat-delay" taps to match the generated echo and x(t) is the input signal. An
corresponding flat delay and maintain the rest adjustable delay used to introduce the flat
"dispersive taps" to be adaptive, the flat delay delay. The impulse response of the ARMA(2,2)
has to be estimated first. We propse here a model is depicted in Fig. 7 with 40 samples of
minimum-mean—squared—error (MMSE) based flat-delay flat delay. The input signal x(t) to the echo
estimator. It is of a parallel structure as shown return path is generated by driving a zero mean,
in F1QS. 5 and 6 for the corresoondina LS lattice unit variance Gaussian random sequence through an
1399
The canceller coefficients are updated by the A(t) = A(t-1) - K(t) E(t)
instantaneous gradient of the error square
0(t) = KT(t) x(t)
e2(t) as A
V0e2(t) = -2e(t) x(t) (3)
ae (t) = (1 0(t)) c(t)
-
The LMS gradient algorithm is known as the r(t) x(t-p) + BT(t1) x(t+1)
stochastic approximation realization of the
recursive LS (RLS) algorithm where the norm K(t+1) = L(t) - B(t-1) q(t) (1 - q(t) r(t)Y4
[
P(t) = 4 [P(t) ] (8) its filter configuration as it is shown in Fig. 3,
t-1) x(t) but with a rather different filter gain updating
where P(t) (9) mechanism.
1400
8th order elliptic bandpass filter. At an 8 kHz Digitized Speech as Input
sampling rate the simulated input signal is 300
Hz—3400 Hz bandpassed, which is typical in a One segment of digitized speech was used as
telephone channel. An adjustable amount of random the input and the LMS, the Fast Kalman and the LS
Gaussian noise can be added to the simulated echo lattice algorithms were all tested. The results
y(t) to take into account the channel noise, A—D are depicted in Figs. 11.
quantization noise, and the background noise. A
separate 8th order elliptic bandpass filter driven Outlier Near—End Speech Detector
by a separately generated random Gaussian sequence
is used to generate a near-end 2nd speaker's Two segments of digitized speech were used as
signal. the far—end and near—end speech signals to test
the near—end speech detector and the "freezing" of
Though different in configurations, adaptation. The results are depicted in Fig.
computational complexities as well as numerical 12. Note that the echo was cancelled while the
characteristics, all LS type echo cancellers such 2nd speaker's signal, even the low amplitude
as the RLS, the fast Kalman or the LS lattice are, unvoiced portion was very well preserved.
in principle, equivalent. Therefore, sometimes
the simulation results are presented in only one REFERENCES
type of LS filters. However, the simulations were
done in a 32 bit floating—point arithmetic. In a Duttweiler, D.L. [1978] "A Twelve-Channel Digital
finite arithmetic and digital hardware level , it Echo Canceller", IEEE Trans. Comm. Vol. Corn—
is expected that different LS algorithms will 26, May, pp 647-53.
behave differently.
Kailath, T. [1980] Linear System,Prentice—Hall.
Comparisons Between the LS and the LMS Algorithms Ljurig, L., Morf, M., and Falconer, D. [1978] "Fast
Calculation of Gain Matrices for Recursive
The first experiment is to compare the Estimation Schemes", mt. J. Control, Vol. 27,
convergence rate between the LS type algorithms No. 1, Jan., pp. 1—19.
and the most widely used LMS gradient algorithm in Morf, M. and Lee, D.T.L., [1978] "Fast Algorithms
their echo cancellation performance. for Speech Modelling", Tech. Rept. M303-1,
Information System Lab., Stanford Univ.,
The echo to measurement noise ratio (ENR) is Stanford, CA, December.
40 dB. No flat delay is introduced. 25 Soong, F.K. [1981] "Fast Least— Squares Estimation
statistically independent runs with 1000 samples and Its Applications," Ph. D dissertation,
in each run were conducted and a 30th order filter Dept. of E.E., Stanford University
Weinstein, S.B. [1977] "Echo Cancellation in the
whose coefficients were initiated at zero values
telephone Network", IEEE Comm. Magazine, Jan.,
was used. A power normalized step size was used
pp. 9—15.
for the LMS gradient filter, see Duttweiler [1978)
Widrow, B. [1970] "Adaptive Filters", From Aspects
for details of this choice of . A comparable
forgetting factor Xwhich is 1 — was chosed for of Network and System Theory, N., Holt,
Rinehart and Winston.
the LS lattice. Scatter plots and the ensamble
average of the 25 runs are depicted in Figs. 8 and
9, respectively. The LS lattice converges to the
measurement noise level (-40 dB) around the 200th
samples while the echo residual of the LMS
gradient filter is still at the -30 dB level at
the end of each run.
1401
y)t) • e(t)
x)t)__
LMS or
FAST KALMAN
It)
1.00
60
.60
.40
.20
.00
—.20
—.40
— .60
20 40 60 60 100 120
Scoipi es
x(t)H I
F'— x( t) —H
___. x(t-1)___-.
x(t-L) I
-0000
-000
.
.
. ... .
.. .
-9000 -
x(t) —so -— .
400 600 1000
1
1110 IN SNIPLES
e0(t) NB
10 OO[-
o AR PLOTS OF IZATIONS
-00 ( LS Lorotce )
e1(t)
-00 O0
I
e(t)
1402
BB
—10.00 ( kB MOnIEioT
-20 00
-30.00
-00.00
—00 00
90 E
000 400 500 500 10(01,
----—
10.OC, (LSLATTICE)
20 CC
AW\
-S0 00
B
-50.00
BNIDPASSED (spur (110 55 55I441(IC WOVE
-0
000 400 600 000 1000 (b)
lIMO IN SN.IPLES
Fig. 10 Flat-delay estimator performance
Fig. 9 Ensemble average of 25 runs
2.2
Woo RESIDUAL LoNCE MOFUT
- 1.00
(LB [ERECT Fc> (LS Wuico)
- I
jff.;Nf —1.00
i
[EVE RESIDUAL
5
CASCE
(LS Lorrico) (FAST [Eu) I. 0
0,04 0.0 A L I I I)
1 I
0.30 t.0I
Ii
[coo RESIDUAL
(FAST K500WI)
2.0:
[coo [coo * [coo Pom WISE
1.00
I
-1.00
1403