Professional Documents
Culture Documents
Appendix 1
The AMR codec dynamically adapts its error protection level to the channel
error conditions. For instance, lower speech coding bit rate and more error
protection schemes are used in bad channel conditions. This principle is illustrated
in Figure A1.1 where AMR strives to change to the best curve associated to a given
AMR mode. It has been shown that the degradation on the audio quality caused by a
lower speech coding rate is compensated by increased robustness with the channel
coding. Note, however, that this channel robustness is more beneficial in GSM than
in UMTS due to the embedded fast power control used in WCDMA systems. Using
a variable-rate transmission scheme also makes it possible to control the
transmission power of the UE, a fact that is particularly useful when the UE
376 UMTS
suddenly attains its maximum transmit power: in CDMA: lower bit rates generally
need lower transmit power and vice versa.
Figure A1.2 depicts the generic structure of the AMR frame. As observed in the
figure, the frame is divided into a header, auxiliary information and core frame. The
header contains the Frame Types and Frame Quality Indicator fields. The Frame
Type can indicate the use of one of the eight AMR codec modes for that frame, a
noise frame, or an empty frame. The Frame Quality Indicator indicates if the frame
is good or bad. The auxiliary information part includes the Mode Indication, Mode
Request and Codec CRC fields. The CRC field is used for the purpose of error-
detection calculated over all the Class A bits in the AMR Core frame. The Core
frame part is used to carry the encoded bits divided into A, B and C classes. In case
of a comfort noise frame, comfort noise parameters, i.e. a SID frame, replace “class
A” bits of the core frame while “class B” and “class C” bits are omitted.
AMR frame
Class A bits
Class B bits Core frame
(speech or comfort noise)
Class C bits
Table A1.1. AMR modes and relationship with AMR frame structure
378 UMTS
Based on the fact that voice activity in a normal conversation is about 40%, all
AMR modes implement a Voice Activity Detection (VAD) algorithm that detects if
each 20 ms-frame contains speech or not on the transmitting side. VAD works
together with the Discontinuous Transmission (DTX) or Source Controlled Rate
(SCR) [TS 26.093] techniques where RF transmission is cut during speech pauses.
When the transmission is cut, “comfort noise” parameters are sent at a regular rate in
AMR frames during discontinuous activity. These frames are known as SID (SIlence
Descriptor) frames. The receiver decodes these parameters and generates locally a
“comfort noise”. Without this background noise the participants in a conversation,
might think that their connection is broken during silence periods. The SCR
technique for AMR in UMTS is mandatory and aims at prolonging the battery life
(UE side) and reducing the interference.
The AMR mode adaptation in UMTS networks means using different AMR
coding for the data stream. Mode adaptation can independently be applied in the
uplink and the downlink. At any point in time, a different AMR mode can be used in
each direction and this can be dynamically changed during a voice conversation.
from radio quality measurements and from traffic volume measurements. The
command is sent to the encoder inside the TC via the Iu interface.
Control
of AMR modes
UE Node B RNC TC
AMR speech Uu Iub Iu-CS AMR speech
codec codec
AMR encoded speech data in DL with new AMR mode (ongoing call)
Change Request to modify
AMR mode the AMR mode in UL
AMR encoded speech data in UL with new AMR mode (ongoing call)
Figure A1.3. Overview of AMR codec mode control during an ongoing voice call
Figure A1.4. Implementation of AMR 12.2 kbps mode in the radio interface
The AMR wideband (AMR-WB) codec has been standardized in 3GPP and is
part of the specification in Release 5. The codec is based on the same adaptive
AMR Codec in UMTS 381
principles as the AMR narrowband. The AMR-WB comprises nine codec modes:
6.6 kbps, 8.85 kbps, 12.65 kbps, 14.25 kbps, 15.85 kbps, 18.25 kbps, 19.85 kbps,
23.05 kbps and 23.85 kbps. The encoder of the AMR-WB is able to code an audio
signal with bandwidth between 50 and 7,000 Hz. A higher sampling rate is thus
needed compared with the narrowband approach (16 kHz instead of 8 kHz) leading
to a 14 bit samples with 16,000 samples/s. Wideband coding provides improved
voice quality especially in terms of increased voice naturalness since it covers twice
the audio bandwidth compared to the classical telephone voice bandwidth of 4 kHz
[TS 26.190, R5].