You are on page 1of 7

UNDERSTANDING MPEG-2

DIGITAL VIDEO COMPRESSION

By Mark Long

First published at the World-wide Web Site of SHOWTIME/Gulf DTH

¨Ï copyright 1996 MLE INC. All Rights Reserved.

( All graphics presented below are from The New Age of Digital Video Compression,
Produced by Shelburne Films ¨Ï 1994. All Rights Reserved. )

Compressed Digital Video (CDV) is the driving force behind the world-wide revolution in
satellite delivered Direct to Home (DTH) TV program distribution. CDV television signals are
transmitted in an abbreviated format that dramatically reduces the amount of frequency
bandwidth required without substantially degrading the quality of the received pictures and
sound. The introduction of CDV technology is causing a dramatic decline in the operational
costs for TV service providers. The result has been a global explosion in the number of new
satellite delivered DTH TV services, including, news, sports, movies, Pay Per View (PPV)
special events, educational programming, and narrowcast offerings that can target the needs
of small segments within any potential viewing audience.

Personal computers have long used digital compression techniques to reduce the amount of
data storage capacity needed to save large computer files. For the pat decade, the global
telephone industry also has been using compression techniques to reduce the bandwidth, and
consequently the cost, required for establishing narrow-band telephone circuits. During the
early 1990s, communications engineers began developing high-capacity very large scale
integrated circuits (VLSICs) and sophisticated software routines that could compress broad-
band telecommunications signals such as video.

From Analogue to Digital Television

A conventional PAL or SECAM video signal contains 625 lines per individual image or frame.
Flashed at a rate of 25 frames per second, each frame has two interlaced fields, each
consisting of 312 and 1/2 lines, with field '1' displaying the even numbered lines and field '2'
displaying the odd numbered lines. The interlaced scanning of the two fields is so rapid that
the eye perceives each complete image or frame rather than the two separate fields within
any one frame.

A single frame of conventional analogue PAL or SECAM video is composed of more than
174,000 picture elements or pixels. Since video operates at a rate of 25 frames per second,
more than 4 million pixels are being sent to the TV screen each second. Digital TV
transmission systems convert the visual and audio information into streams of binary digits or
bits: strings of zeros and ones that correspond to the 'off' and 'on' logic states of computer
circuitry. A maximum of 32 bits per pixel, 8 bits per luminance element and 24 bits per
chrominance element, is needed to digitize a single color video signal. Four million pixels times
32 bits per pixel equals a transmission rate of 128 Megabits per second.

A 128 Mb/s digital TV signal would fill an entire satellite transponder to capacity! To allow
multiple digital TV signals to occupy the same transponder, some form of signal compression
is needed to dramatically reduce the number of bits used by multiple TV services occupying
the same satellite transponder.

The Moving Pictures Experts Group


In 1988, the International Standards Organization (ISO) of the International
Telecommunication Union established the Moving Pictures Experts Group (MPEG) to agree on
an internationally recognized world-wide standard for the compressed representation of video,
film, graphic and text materials. The committee's goals were to develop a relatively simple,
inexpensive, and flexible standard that put most of the complex functions at the transmitter
rather than the receiver. Representatives from more than fifty corporations and governmental
organizations world-wide took part in the MPEG committee's deliberations.

In 1991, the MPEG 1 standard was introduced to handle the compressed digital representation
of non video sources of multimedia. However, some manufacturers soon discovered that MPEG
1 could be adapted for the transmission of video signals as long as the video material was first
converted from the original interlaced mode to a progressively scanned format. A few TV
programmers elected to use MPEG 1 to transmit via satellite while the MPEG committee
developed a standard specifically for interlaced scanning applications known as MPEG 2. The
MPEG committee selected its criteria for MPEG 2 in 1994.

MPEG Compression Techniques

MPEG compression is accomplished by four basic techniques: pre-processing, temporal


prediction, motion compensation, and quantisation coding. Pre-processing filters out non
essential visual information from the video signal, information that is difficult to encode but
not an important component of human visual perception. Motion compensation takes
advantage of the fact that video sequences are most often highly correlated in time: each
frame in any given sequence may be similar to the preceding AND following frames.

The compressed digital video encoder scans subsections within each frame, called macro
blocks, and identifies which ones will not change position from one frame to the next. The
encoder also identifies predictor macro blocks while noting their position and direction of
motion. Only the relatively small difference, called the motion compensated residual, between
each predictor block and the affected current block is transmitted to the receiver. The
receiver/decoder stores the information that does not change from frame to frame it's buffer
memory and uses it periodically to fill in the blanks, so to speak.
A mathematical algorithm called the Discrete Cosine Transform (DCT) reorganizes the residual
difference between frames from a spatial domain into an equivalent series of coefficient
numbers in a frequency domain that can be more quickly transmitted. Quantization coding
converts these sets of coefficient numbers into even more compact representative numbers.
The encoder refers to an internal index or code book of possible representative numbers from
which it selects the code word that best matches each set of coefficients. Quantization coding
also rounds off all coefficient values, within a certain range of limits, to the same value.
Although this results in an approximation of the original signal, it is close enough to the
original to be acceptable for most viewing applications.

I, P, and B Frames

MPEG 2 provides for up to three types of frames called the I, P and B frames. The intra-frame,
or 'I' frame, serves as a reference for predicting subsequent frames. 'I' frames, which occur on
an average of one out of every ten to fifteen frames, only contains information presented
within itself. 'P' Frames are predicted from information presented in the nearest preceding 'I'
or 'P' frame. The bi-directional 'B' frames are coded using prediction data from the nearest
preceding 'I' or 'P' frame AND the nearest following 'I' or 'P' frame.

Not all MPEG 2 systems use 'B' frames. Although a more efficient level of compression is
achieved by 'B' frames, compatible receiver/decoders must have an additional memory buffer,
which increases the cost of the decoder.
MPEG 2 Satellite Transmission Rates

The transmission speed required for any MPEG 2 broadcast varies according to the nature of
the video material. The MPEG 2 encoder located at the satellite uplink has a finite time to
make encoding decisions. Pre-recorded movies and other taped material do not push the time
constraints of the encoder to the limit; the encoder can select at its leisure the most efficient
method for encoding at the lowest possible data rate. Live sports and other live action
materials require a higher data rate because the encoder is forced to make immediate coding
decisions and must also transmit complex, rapid motion changes without introducing high
levels of distortion.

* Representative MPEG-2 Transmission Rates

Type of Video Service Data Rate

▪ Movies (VHS quality) : 1.152 Mb/s


▪ News/Entertainment : 3.456 Mb/s
▪ Live Sports Event : 4.608 Mb/s
▪ 16:9 Wide Screen TV : 5.760 Mb/s
▪ Studio Quality Broadcast TV : 8.064 Mb/s
▪ High Definition Television : 14.00 Mb/s

Audio or Data Services Data Rate

▪ Monaural sound : 0.128 Mb/s


▪ Stereo sound (L + R) : 0.512 Mb/s
▪ Digital Data : 9.6 kb/s

MPEG DTH Transmissions


DTH satellite TV programmers use a transmission format called Multiple Channel Per Carrier
(MCPC) to multiplex two or more program services into a single unified digital bit stream. With
MCPC, a package of program services can use the same conditional access and forward error
correction systems, thereby economizing on the overall bandwidth and transmission speed
requirements. What's more, programmers can dynamically assign capacity within the digital
bit stream of any multiplexed transmission, so that more bits are available to a live sport
broadcast and fewer bits to a news report or interview program consisting of 'talking heads.'
At the conclusion of a live basketball game, for example, the digital capacity previously used
to relay a single sport event even could be used to transmit four separate movies.

Digital Satellite TV Installations

Satellite installers usually tune in to conventional satellite TV signals while viewing a TV


monitor connected to the receiver and adjusting the antenna pointing. During the fine tuning
process, extremely weak signals are detected first by the appearance of a sync bar across the
TV screen, then black and white pictures, and finally, a color picture that is totally devoid of
impulse noise or sparklies.

Tuning into an MPEG 2 compressed digital video signal, however, is totally different. Weak
signals virtually appear to be random noise; the receiver will not display any picture at all until
sufficient signal is reaching the antenna. Then once the digital threshold of the
receiver/decoder has been exceed, a perfect picture will appear on the TV screen. MPEG 2
reception is like a light switch: it's either on or off. Furthermore, if the installer moves past the
antenna's peak performance position, the receiver/decoder will 'freeze frame' on the last
picture in its buffer memory and will not receive any further video until the antenna is brought
back to it optimum position.

Rain fades, which occur whenever these is moisture in the atmosphere between the satellite
and the receiving system on the ground, can degrade the incoming signal level. DTH installers
are advised to use antenna tuning meters or portable spectrum analyzers to peak antenna
performance, so that the DTH systems they install will have some signal margin above
receiver/decoder threshold to compensate for any Ku-band signal degradation caused by rain
fades.

The ABCS of Digital Receiver/Decoders


Coping in the new digital age will require a basic understanding of a few new terms that have
grown up surrounding this new technology. The following is a brief explanation of some of the
new digital jargon that you may come across.

* Bit Rate : This is the amount of data information being transmitted in one second of time.
The total bit stream passing through a single satellite transponder consists of as many as eight
TV services with associated audio, auxiliary audio services, conditional access data, and
auxiliary data services such as teletext. The informational bit rate for this transmission may be
as high as 49 Mega (million) bits per second (Mb/s) over a 36-MHz-wide satellite transponder.
A single video signal within this bit stream will have a lower bit rate. For example, a VHS
quality movie can be transmitted at a bit rate of 1.152 Mb/s; a news or general entertainment
TV program at 3.456 Mb/s; live sports at 4.608 Mb/s or studio-quality broadcasts at a rate of
more than 8 Mb/s.

* Bit Error Rate (BER) : Measured in exponential notation, the Bit Error Rate (BER)
expresses the performance level of the digital receiver. For example, a higher BER of 5.0 x 10-
5 is superior to a lower BER of 9.0 x 10-4. The higher the BER, the greater the
receiver/decoder's ability to perform well during marginal reception conditions, such as when
there is heavy rainfall, snow or wind gusts.

* Digital Receiver/Decoder Threshold : Unlike traditional analogue Receiver/decoders,


where the unit continues to deliver a picture even when it is operating below the
receiver/decoder's threshold rating, digital receiver/decoders stop dead in their tracks when
the signal level falls below the digital threshold level of the unit. It's like a light switch: either
on or off. Manufacturers define the threshold as a BER such as 3.0 x 10-3. Reception stops
whenever the BER falls below receiver threshold.

* Digital Video Broadcasting (DVB) : A 'broadcast specific' version of MPEG 2 developed in


Europe for DTH, cable and terrestrial TV applications which numerous satellite programmers
world-wide have since adopted for their systems. However, just because a satellite
receiver/decoder is MPEG 2 DVB compliant doesn't mean that it can pick up any MPEG 2 DVB
compliant satellite TV signal. Major differences in encryption and conditional access methods,
use or non-use of 'B' frames, and other technical parameters have prevented MPEG 2 from
becoming a truly world-wide video standard. A few programmers have formed alliances so
that their DTH subscribers will have access to the maximum number of services available from
a given satellite. Other programmers, however, have elected to use unique systems that can
only access a single DTH program package.

* Forward Error Correction (FEC) : The digital bit stream contains special codes that the
receiver/decoder can use to check to ensure that all bits of information sent have been
received. If bit errors due to thermal noise, for example, are detected, the receiver/decoder
can use the convolutional encoding information that has been forwarded to it by the
transmitting station to either correct each detected error or conceal it. Burst noise
interference, which can be caused by nearby automobile ignition noise or microwave ovens,
can be detected by the Reed Solomon block decoder.
.

You might also like