Professional Documents
Culture Documents
2 Audio compression
• Unlike text and images, both audio and most video 4.2.1 Speech coding
signals are continuously varying analog signals.
• Differential pulse code modulation (DPCM) is a
• Compression algorithms associated with digitized audio derivative of standard PCM and exploits the fact that, for
and video are different from those associated with text most audio signals, the range of the differences in
and images. amplitude between successive samples of the audio
waveform is less than the range of the actual sample
amplitudes. (G.711)
CYH/MMT/CmpAV/p.1 CYH/MMT/CmpAV/p.2
• Summary of speech compression standards and their 4.2.2 Perceptual coding
applications:
• Audio signal is coded based on a psychoacoustic model
Standard Compression Compressed Quality Example
technique bit rate (kbps) applications which describes the limitations of the human ear.
G.711 PCM+ 64 Good PSTN/ISDN
companding telephony • Ear is more sensitive to some signals than others.
G.721 ADPCM 32 Good Telephony at
16 Fair reduced bit • Frequency masking: A strong signal may reduce the
rates level of sensitivity of the ear to other signals which are
G.722 ADPCM 64 Excellent Audio near to it in frequency.
with 56/48 Good conferencing
subband
• Temporal masking: When the ear hears a loud sound, it
coding
G.726 ADPCM 40/32 Good General takes a short but finite time before it can hear a quieter
with 24/16 Fair telephony at sound.
subband reduced bit
coding rates
LPC-10 LPC 2.4/1.2 Poor Telephony in
military
networks
G.728 Code-excited 16 Good Low delay/low
LPC (CELP) bit rate
telephony
G.729 CELP 8 Good Telephony in
cellular
networks
G.729(A) CELP 8 Good Simultaneous
telephony and
data (fax)
G.723.1 CELP 6.3 Good Video and
5.3 Fair internet
telephony
CYH/MMT/CmpAV/p.3 CYH/MMT/CmpAV/p.4
CYH/MMT/CmpAV/p.5 CYH/MMT/CmpAV/p.6
MPEG audio coders
CYH/MMT/CmpAV/p.7 CYH/MMT/CmpAV/p.8
Dolby audio coders
CYH/MMT/CmpAV/p.11 CYH/MMT/CmpAV/p.12
• I-frames:
• P-frames:
CYH/MMT/CmpAV/p.13 CYH/MMT/CmpAV/p.14
• A prediction of the target frame is made with the
reference frame based on the motion vectors obtained.
• B-frames:
CYH/MMT/CmpAV/p.15 CYH/MMT/CmpAV/p.16
• The number of frames between successive I-frames is
known as a group of pictures (GOP).
CYH/MMT/CmpAV/p.17 CYH/MMT/CmpAV/p.18
• Basic bitstream format:
• Type : type of frame , I, P or B
• Address : identifies the location of the macroblock in
the frame
• Quantization value: the threshold value used to
quantize all DCT coefficients in the macroblock.
• Motion vector: encoded vector
• Block present: indicates which block in the
macroblock are present
CYH/MMT/CmpAV/p.19 CYH/MMT/CmpAV/p.20
4.3.2 H.261
• Encoding format:
• H.261 has been defined by the ITU-T for the provision
• Type: indicates if the macroblock is intracoded or
of video telephony and videoconferencing services over
intercoded
an ISDN.
• Address: identifies the location of the macroblock in
• Supports I- and P-frames only. the frame
• Quantization value: the threshold value used to
quantize all DCT coefficients in the macroblock.
• Motion vector: encoded vector
• Coded block pattern: indicates which block in the
macroblock are present
• Picture start code: indicates the start of a new frame.
• Temporal reference: a timestamp for the decoder to
synchronize the video information with the audio
information.
• Picture type: indicates if the frame is encoded as I- or
P-frame.
• GOB start code: is a resynchronization marker which
is used for resynchronization in case of error.
CYH/MMT/CmpAV/p.21 CYH/MMT/CmpAV/p.22
4.3.3 H.263
• H.263 has been defined by the ITU-T for use in a range
of real-time video applications over wireless and PSTNs.
CYH/MMT/CmpAV/p.23 CYH/MMT/CmpAV/p.24
4.3.4 MPEG • Typical figures of the compression ratios
• I-frames: 10:1
• The Motion Pictures Expert Group (MPEG) was formed
by the ISO to formulate a set of standards relating to a • P-frames: 20:1
range of multimedia applications that involve the use of • B-frames: 50:1
video with sound.
CYH/MMT/CmpAV/p.25 CYH/MMT/CmpAV/p.26
• Bitstream format:
CYH/MMT/CmpAV/p.27 CYH/MMT/CmpAV/p.28
• Video parameters: specify the screen size and aspect MPEG2 : ISO Recommendation 13818
ratio
• Bitstream parameters: indicate the bit rate and the size • It supports four levels - low, main, high 1440 and high -
of the memory/ frame buffers that are required each targeted at a particular application domain.
• Quantization parameters: contain the contents of the
quantization tables that are to be used. • There are 5 profiles associated with each level: simple,
- main, spatial resolution, quantization accuracy and high.
• GOP start code: indicates the start of a GOP
• The different combinations of levels and profiles form a
• Time stamp: used for synchronization purposes framework for all standards activities associated with
• Parameters: defines the particular sequence of frame MPEG-2.
types that are used in each GOP (e.g. IPPBPP)
- • One of the most popular setting is the MP@ML standard
• Picture start code: indicates the start of a frame which is for digital television broadcasting.
• Type: indicates if it's a I-, P- or B-frame • There are 3 standards associated with HDTV: advanced
• Buffer parameters: indicate how full the buffer should television (ATV) in North America, digital video
be before the decoding operation should start broadcast (DVB) in Europe, and multiple sub-Nyquist
• Encode parameters: indicate the resolution of a motion sampling encoding (MUSE) in Japan.
vector.
ATV DVB MUSE
-
Aspect ratio 16/9 4/3 16/9
• Slice start code: indicates the start of a slice Resolution 1280x720 1440x1152 1920x1035
• Vertical position: indicates the scan line in which the Compression MP@HL of SSP@H1440 Similar to
slice is (video) MPEG2 of MPEG2 MP@HL
Compression Dolby AC-3 MP2
• Quantization parameters: indicates the scaling factor
(Audio)
that applies to this slice.
CYH/MMT/CmpAV/p.29 CYH/MMT/CmpAV/p.30
• Summary of video compression standards
Standard Digitization Compressed Example applications
format bit rate
H.261 CIF/QCIF x64kbps Video telephony/
conferencing over ISDN
and LANs
H.263 S-QCIF/ <64kbps Video telephony/
QCIF conferencing and security
surveillance over low bit
rate channels
MPEG-1/ SIF <1.5Mbps Storage of VHS-quality
ISO11172 video on CD-ROMs
MPEG-2/
ISO13818
Low SIF <4Mbps Recording of VHS-quality
video
Main 4:2:0 <15Mbps Digital video broadcasting
4:2:2 <20Mbps
High 1440 4:2:0 <60Mbps HDTV (4/3 aspect ratio)
4:2:2 <80Mbps
High 4:2:0 <80Mbps HDTV (16/9 aspect ratio)
4:2:2 <100Mbps
MPEG-4 Various 5kbps- Versatile multimedia
tens Mbps coding standard
CYH/MMT/CmpAV/p.31