You are on page 1of 7

Mobile crosstalk control—Enhancing speech

quality in digital cellular networks


Anders Eriksson, Maria Eriksson, Tõnu Trump and Teresa Vallon Hulth

Although today’s mobile phones satisfy recommendations for echo attenua- match in the 4-wire to 2-wire conversion in
tion, under certain conditions echo that originates from the acoustic PSTN subscriber interfaces. Today, it is
crosstalk in mobile handsets can still be noticed by users. Acoustic common for international and mobile
crosstalk echo and network echo demand different approaches. Hence, switching centers to employ echo cancellers
Ericsson has developed mobile crosstalk control, a special algorithm that for controlling echo generated in the local
loop on the PSTN end of the connection.
eliminates acoustic crosstalk echo and enhances speech quality in digital
Echo can also occur in cellular networks
cellular networks. as the result of acoustic crosstalk inside mo-
The authors briefly describe echo in the PSTN and how network echo can- bile handsets due to the acoustic coupling
cellers work. They explain the fundamental differences between network between the microphone and loudspeaker of
echo and echo that originates from mobile digital handsets, the particular the digital handset. Echo of this kind is best
requirements for eliminating acoustic crosstalk echo in digital cellular net- controlled within the handset, as is recog-
works, and the methods that mobile crosstalk control uses to eliminate nized in several international recommenda-
echo. The authors also discuss the challenge of evaluating the speech quali- tions. For example, the GSM specification
ty that results from a specific algorithm and the variety of situations that an requires that echo attenuation (measured in
algorithm must be able to handle. the switching system as the path loss from
the input to the speech coder to the output
from the speech decoder) should measure at
least 46 dB.1 Because transmission in digi-
tal cellular systems is “4-wire” throughout,
mobile telephones are usually not regarded
Echo in digital cellular as a potential source of echo in the system.
networks Although handsets satisfy requirements
for echo attenuation, under certain condi-
Cellular operators have identified speech tions users can still notice echo that origi-
quality as being a significant competitive nates from acoustic crosstalk. This is main-
factor in today’s market, which has spurred ly due to two factors:
developments aimed at removing distur- • The test specification does not take into
bances from the voice channel in cellular account all possible variations in the po-
systems. sition of the handset during a normal con-
A major cause of disturbance in telepho- versation.
ny systems is echo, which occurs when part • The line levels within the telephone sys -
of a speaking party’s voice signal energy is tem may deviate from nominal levels.
reflected back to him or her (Figure 1). Echo Echo from acoustic crosstalk can be annoy-
with a substantial delay (physical or pro- ing to users. If the echo can be eliminated,
cessing delay in the transmission path) is the overall speech quality of the system will
irritating in telephone conversations. A improve. However, because acoustic-
typical source of echo is the impedance mis- crosstalk echo differs greatly from conven-

Figure 1
Sources of echo that affect digital cellular
networks.
Figure 2
Location of network echo cancellers in digital
cellular networks.
MSC Mobile services switching center
LE Local exchange
H Hybrid

tional network echo, cancelling it requires


Box A different solutions.
Abbreviations and definitions

CNG ERL
Over view of network
Comfort noise generator. Echo return loss. The attenuation of the echo cancellers
Comfort noise signal as it passes the echo path. Today, international and mobile switching
A signal—whose characteristics are sim- ERLE centers employ echo cancellers to control
ilar to the background sound—that is Echo return loss enhancement. Attenua- echo generated in the PSTN local loop. The
added to the output of an echo canceller tion obtained by linear filtering tech- means of cancelling this type of echo have
in order to mitigate the undesired effects niques in an echo canceller. been the subject of extensive study.2
of the non-linear processor. FIR Ordinarily, the PSTN echo path can be
Conversation test Finite impulse response. accurately modeled using a linear finite-
Test in which test persons carry on a con- GSM impulse-response (FIR) filter with coeffi-
versation, simultaneously evaluating the Global system for mobile communication. cients that are constant or that vary slowly
speech quality of the conversation. Handover over time. The echo canceller is usually po-
D-AMPS The process of a mobile phone changing sitioned in the system so that the duration
Digital advanced mobile phone service. cells while a call is in process. Handover
of the echo path is less than 64 ms (Figure
Downlink is the term used in GSM. In D-AMPS
2). An adaptive FIR filter with up to 512
Transmission from a base station to and PDC, the corresponding term is hand-
coefficients is used for modeling the echo
a mobile handset in a cellular network. off.
DSP MCC
path. The echo path might change sudden-
Digital signal processor. Mobile crosstalk control. An algorithm ly if the PSTN subscriber changes tele-
DTX developed by Ericsson for deployment phone sets or if a third party or additional
Discontinuous transmission. To save in the switching system. The algo- equipment is connected to the call.
power, the transmitter turns off when no rithm solves the problem of acoustic The attenuation of echo, which is mea-
one is speaking into the handset. This func- crosstalk in hand-held mobile tele- sured in decibels, is called echo return loss
tion is called DTX in GSM and D-AMPS and phones. (ERL). Echo return loss from the PSTN,
voice-operated transmitter (VOX) in PDC. NLP which depends on the hybrid used in the
EC Non-linear processor. A function in an network, can vary significantly depending
Echo canceller. echo canceller which further attenuates on the local loop. According to K. Shenoi3,
ECP 303 echo; for example, by completely or par- the ERL can generally be regarded as a ran-
Ericsson echo canceller in pool model tially blocking the transmission signal. dom variable picked from a Gaussian dis-
303. PCM tribution. For the US network, the distrib-
ECP 323 Pulse code modulation. ution has a mean value of 13.6 dB and stan-
Ericsson echo canceller in pool model PDC dard deviation of 2.8 dB in a segregated
323 (model 303 featuring MCC). Personal digital cellular. loop-balancing scheme. Recordings taken
ECP 404 PSTN from the Swedish PSTN show a similar dis-
Ericsson echo canceller in pool model 404. Public switched telephony network. tribution but a higher mean value (Figure 3).
ECP 424 Uplink The adaptive FIR filter is used to model
Ericsson echo canceller in pool model Transmission from a mobile handset to a the impulse response of the echo path (Fig-
424 (model 404 featuring MCC). base station in a cellular network. ure 4). The filter yields echo return loss en-
hancement (ERLE), which reduces echo by
about 20 to 40 dB. The reduction is limit-
ed by non-linearities; that is, the part of the
echo that cannot be modeled using a linear
filter. One source of non-linearities is the
quantization of signals in the analog-to-dig-
ital converter. Other sources are possible
non-linear effects (such as saturation) in hy-
brid or telephone sets. Figure 5 shows the
optimum ERLE achievable using linear fil-
tering techniques for different echo paths in
the Swedish PSTN.
The adaptive algorithm used in the FIR Figure 3
filter can accurately estimate the coeffi- Histogram of echo return loss values mea-
cients, provided the signal from the PSTN sured for hybrids in the Swedish PSTN.
consists mainly of echo. However, if speech
or background sound from the PSTN dom-
inates the echo—that is, if “double talk” ex-
ists—the algorithm returns a poor estimate
of the echo path. What is more, the power
of voice signals varies greatly from one mo-
ment to the next, meaning that the control
part of the echo canceller must determine at
each instant whether or not conditions are
favorable for filter adaptation.
The echo that remains after linear pro-
cessing, called residual echo, is often still au-
dible. A non-linear processor (NLP) is used
to remove it. When residual echo is expect-
ed, the NLP blocks the signal, either in part
or completely. However, because the level
of residual echo is not known in advance, the
NLP must be adaptive, taking into account
all possible variations in echo attenuation
(the sum of echo return loss and echo return-
loss enhancement). Consequently, the de-
sign of the NLP function is critical to user Figure 4
perceptions of the performance of echo can- Echo canceller principles. A replica of the
echo is obtained via a linear filter and sub-
cellers. A well-designed NLP, which acti- tracted from the PSTN subscriber signal. The
vates only if needed, should preserve speech residual echo signal is further suppressed
and background sound from the PSTN end using a non-linear processor.
as faithfully as possible.
Finally, an echo canceller includes a
comfort-noise generator (CNG) that gener-
ates a signal whose characteristics are simi-
lar to the background sound on the PSTN.
When the NLP is active, the comfort-noise
generator adds a comfort-noise signal to the
output, which reduces any undesirable ef-
fects of the background sound modulation
perceived by users.

Digital handset echo vs.


network echo
From the viewpoint of the switching sys-
tem, there are many characteristics that dis- Figure 5
tinguish the echo that originates from the Histogram of achievable echo return loss
acoustic crosstalk in a mobile digital hand- enhancement (ERLE) for PSTN hybrids as
set from network echo. Only minor signal measured in the Swedish PSTN.
ties talk simultaneously, the DTX func-
tion is not activated, in which case echo
might be noticed. Owing to double talk,
this echo cannot be modeled successfully
by an adaptive algorithm.
• Non-linear effects are introduced by
speech coding and bit errors in the radio
transmission. For typical communication,
speech coding and radio transmission re-
duce the ERLE achievable by a linear fil-
ter to less than 10 dB. The acheived ERLE,
which to a large extent depends on the
input signal, can vary significantly over a
Figure 6
Differences in the echo paths of acoustic short period. This contrasts with the
crosstalk and PSTN echo. much more consistent ERLE achievable
MSC Mobile services switching center in a network echo canceller, which is lim-
LE Local exchange ited to 38 dB because of PCM coding in
H Hybrid
the network echo path.
These problems make it significantly more
complicated for the switching system to
disturbances occur in the echo path of the control acoustic crosstalk echo from a mo-
network echo canceller (Figure 6). Howev- bile handset than to control network echo.
er, the echo path of the echo canceller that At the same time, some other factors sim-
controls acoustic crosstalk includes radio plify the design of the acoustic crosstalk al-
transmission. This gives rise to fundamen- gorithm. First, the length of the acoustic
tal differences in the characteristics of the crosstalk echo path (the dispersion), which
mobile echo path, compared with network is usually less than 5 to 10 ms, is relatively
echo cancellation. short compared with that of network echo.
• The delay in the handset echo is long, Second, the overall level of echo from the
since radio transmission requires coding mobile handset is significantly lower than
and interleaving. The lower limit of this that of network echo, thanks to the design
delay is specified by the delays in each of the handsets. In most cases, the level of
necessary processing block in the echo echo from the mobile handset is compara-
path. In a GSM system, the delay is ap- ble to or lower than the level of residual echo
proximately 180 ms. The actual echo after linear processing in a network echo can-
delay for a call can vary, depending on the celler.
handset, system hardware, extra signal Taken together, the factors discussed
processing equipment, and the routing of above clearly show that the linear echo-path
the call. model used for designing a network echo
• The characteristics of the mobile echo canceller is not adequate for treating the
path are apt to vary rapidly and frequently problem of echo from digital handsets. Con-
due to changes in the position of the hand- sequently, an ordinary network echo can-
set. The characteristics may also be af- celler will not produce good results for the
fected by bit errors in the radio transmis- latter type of echo, which demands a differ-
sion, handover, or discontinuous trans- ent algorithm.
mission—in order to save power, the mo-
bile unit might interrupt transmission; Basics of the MCC
this function is called discontinuous
transmission (DTX)—in this case, on the algorithm
uplink. Discontinuous transmission in The basic requirement for any device used
the uplink causes large variations in the in the switching system for cancelling echo
characteristics of the echo path. The DTX that originates from acoustic crosstalk in
function is activated for the uplink when mobile terminals is that the device must
only the downlink carries speech signals. never introduce any artifacts in calls when
Thus the handset produces no echo. Spu- audible acoustic crosstalk is not present.
rious echo bursts might be transmitted Ericsson has developed an algorithm that
anyway, although the duration of the can successfully operate under the condi-
bursts is generally too short for them to tions described in the previous section and
be modeled successfully. When both par- that also meets this basic requirement. The
Figure 7
MCC principles. The delay and level of the
echo is estimated. The estimates are then
used for controlling an attenuator in the
transmission path.

new function is called mobile crosstalk con- unit is even more difficult. Different meth-
trol (MCC). ods are used for making subjective evalua-
The MCC algorithm (Figure 7) detects the tions. Today, no methods for making ob-
presence of echo in the uplink signal and de- jective measurements exist. Traditional
termines its delay and power. In order for methods of subjectively evaluating speech
the echo canceller to react properly to quality on transmission channels, such as
changes caused by hand-over, handset posi- methods used for evaluating speech coders,
tioning, or the amplification setting on the are not appropriate. Due to the duplex na-
handset, the delay and power of the echo ture of the MCC algorithm, evaluations
must be estimated continuously during the must involve simultaneous speech from the
call. mobile handset end and the PSTN end, as
If echo is the dominant component of the is done when evaluating network echo can-
uplink signal, and is regarded as annoying cellers. Different types of speech-quality-
to the PSTN subscriber, the uplink signal impairment must be defined for evaluating
is attenuated—but only for short periods, the echo-cancellation algorithm. At pre-
determined by an estimate of echo power. sent, four classes have been defined:
This ensures that the algorithm preserves • Echo—residual echo not handled by the
the duplex quality of the mobile connection. algorithm.
To prevent line noise or background • Clipping—loss of speech and back-
sound from the digital handset from being ground sound from the mobile handset
modulated, carefully designed comfort end. In addition to attenuating the echo,
noise is added to the uplink whenever the the algorithm may unintentionally at-
attenuator is activated. tenuate the speech and background
sound from the mobile subscriber end.
Demands put on mobile • Distortion—unintentional modification
(other than clipping) of speech from the
crosstalk control mobile handset end.
The task of mobile crosstalk control is ex- • Faulty insertion of comfort noise—the
acting: insertion of comfort noise that is percep-
• It must cancel the acoustic crosstalk echo tually different from actual background
from the cellular phone. sound.
• It must never degrade speech quality if The speech quality produced by the algo-
echo is not present. rithm for cancelling echo from the mobile
• It must never introduce clipping into or handset end must be evaluated in listening
distort the incoming speech signal. and conversation tests. In a listening test,
Evaluating the speech quality produced a group of people listens to a recording of
by network echo cancellers is difficult; eval- an unmodified conversation and compares
uating the speech quality produced by an al- it with a simulation of the same conversa-
gorithm for cancelling echo from a mobile tion modified by the algorithm. This
method can also be used to compare differ- faithful in coding; superior speech-
ent algorithms. Any impairments caused by coding algorithms make the echo easier
an algorithm are clearly audible, which to hear. For example, the GSM enhanced-
makes listening tests a controlled means of full-rate offers better speech coding than
evaluating algorithms. GSM full-rate. Consequently, the former
Listening tests are particularly helpful should produce a clearer echo than the
during development, but must be aug- latter.
mented by conversation tests, in order to ac- • Different background sound—because
curately evaluate impairment factors. User the handset is portable, it is used in en-
perception of clipping in a listening test may vironments with varying background
be more irritating than it would be in a real sound, such as music or ambient noise in
conversation, since the side tone in a gen- cars or buildings. When the algorithm
uine conversation masks short speech gaps. activates attenuation, the comfort noise
Moreover, a listening test is inadequate for generator must be able to handle any of
evaluating residual echo: listeners may have these situations.
difficulty perceiving recorded echo as gen- • Different line-level settings—the level of
uine echo, because they do not hear their echo depends on the line level. For ex-
own voices and the echo is not coordinated ample, if the line levels in the transcoder
with their own speech. are changed, the level of the echo will also
The evaluation of algorithm performance change; if the line level on the downlink
is indeed quite complex, and an algorithm is increased, the level of the echo will also
for cancelling echo from the mobile hand- increase.
set end must satisfy many requirements. For • Different handset loudspeaker vol-
example, the tests must evaluate the perfor- umes—the level of the echo depends on
mance of an algorithm under several differ- the volume setting of the handset. If the
ent situations (Figure 8): loudspeaker volume is increased, the
• Different mobile handsets—different level of the echo increases.
handsets produce different amounts of • Various bit error rates—bit errors in the
echo. Some mobile handsets have built-in radio interface make echo non-linear, and
echo cancellers, which also differ in per- the complete loss of frames causes time
formance. An algorithm for cancelling variations in the echo path.
echo from the mobile handset end must • Handover—time variations are intro-
never allow deterioration of speech qual- duced at handover, owing to the loss
ity, especially for a handset that does not of speech frames. The echo delay might
produce echo. also change when the handover is be-
• Different speech-coding algorithms—all tween base stations that use different
speech-coding algorithms are not equally hardware.
Figure 8
• Discontinuous transmission—when the
System characteristics that influence MCC DTX function is activated in the uplink,
performance. GSM shown as an example. spurious echo bursts may be transmitted.
TRA Transcoder The MCC algorithm has been tested for
FR Full-rate each of the above situations. Mobile
HR Half-rate
EFR Enhanced full-rate crosstalk control cancels echo, if present,
+?–? Line levels differ without clipping or distorting speech from
the mobile handset end, and without in-
serting faulty comfort noise when
background sound is present. Hence, the
MCC algorithm successfully enhances
speech quality in digital cellular networks.

Echo cancellers with


mobile crosstalk control
Ericsson’s family of echo cancellers in pool
(ECP)2 products has proven effective in a va-
riety of systems. Now, mobile crosstalk
control is available in a digital signal
processor (DSP) software module designed
to be loaded on existing ECPs. Operators
Figure 9
ECP 424: MCC implemented on the ECP 404
hardware platform. A single board contains 32
echo canceller devices featuring MCC.

already using echo cancellers in pool can add cellent speech quality as demonstrated in in-
mobile crosstalk control without altering dependent evaluations.
their current hardware platform or changing
the number of echo cancellers in their ex- Conclusion
change cabinets. Ericsson also offers new
products that include mobile crosstalk con- Fundamental differences in echo that origi-
trol (Figure 9): nate in the PSTN local loop (network echo)
• ECP 323 (the ECP 303 with MCC); and mobile digital handsets (acoustic
• ECP 424 (the ECP 404 with MCC). crosstalk echo) require different echo can-
The MCC function in the ECP 323 and ECP cellation methods. Ericsson’s MCC software
424 echo cancellers is always active in traf- estimates the delay and level of echo in mo-
fic cases that require a network echo can- bile handsets, takes into account the diverse
celler, thereby providing extra enhancement situations that can cause the occurrence and
of speech quality. Also, because of the high variation of echo, and successfully attenu-
degree of variation in the path delay of the ates it, yielding improved speech quality.
echo from the mobile handset end, MCC The MCC software augments existing
adapts to any delay between 144 and 320 software in Ericsson’s ECP products with-
ms, with no parameter adjustment needed. out affecting the ECP’s algorithms for can-
Finally, the algorithm of the network echo celling network echo. The new ECP combi-
canceller part is unchanged in the new units nations thus provide enhanced speech qual-
and thus continues to provide the same ex- ity in digital cellular networks.

References
1 GSM Technical Specification 11.10,
version 4.9.0, European Telecommuni-
cations Standards Institute, 1994.
2 Eriksson, A., Eriksson, G., Karlsen, J.,
Roxström, A. and Vallon Hulth, T.:
“Ericsson echo cancellers—A key to
improved speech quality,” Ericsson
Review, No. 1, 1996, pp. 25–33.
3 Shenoi, K.: Digital Signal Processing
in Telecommunications, Prentice Hall,
1995.

You might also like