Professional Documents
Culture Documents
MultiMedia1 - Overview - Chapter 1,2,3-Doc Den Trang 4-133
MultiMedia1 - Overview - Chapter 1,2,3-Doc Den Trang 4-133
Introduction (2)
Overview
q
q
q
n
n
n
n
n
n
n
q
4/2/2003
JPEG
MPEG-1/MPEG -2 Audio & Video
MPEG-4
MPEG-7 (brief introduction)
HDTV (brief introduction)
H261/H263 (brief introduction)
Model base coding (MBC) (brief introduction)
CATV systems
DVB systems
Introduction
n
4/2/2003
Multimedia network
Introduction
Chapter 1: Background of compression techniques
Chapter 2: Multimedia technologies
4/2/2003
The Internet was designed in the 60s for low-speed internetworks with boring textual applications High delay,
high jitter.
Multimedia applications require drastic modifications
of the INTERNET infrastructure.
Many frameworks have been being investigated and
deployed to support the next generation multimedia
Internet. (e.g. IntServ, DiffServ)
In the future, all TVs (and PCs) will be connected to the
Internet and freely tuned to any of millions broadcast
stations all over the World.
At present, multimedia networks run over ATM (almost
obsolete), IPv4, and in the future IPv6 should
guarantee QoS (Quality of Service) !!
Why compression ?
q
2 types of compression:
q
q
4/2/2003
Lossless compression
Lossy compression
Nguyen Chan Hung Hanoi University of Technology
Lossy Compression
Information rate
q Entropy is the measure of information content.
n
4/2/2003
q
q
Lossless Compression
n
4/2/2003
4/2/2003
Process of Compression
4/2/2003
Why sampling?
q
PCM
q
Quantization
q
4/2/2003
Predictive coding
n
4/2/2003
4/2/2003
11
Drawbacks of compression
Prediction
q
10
Artifacts
q
4/2/2003
12
4/2/2003
13
Frame-Differential Coding
n
n
q
q
15
4/2/2003
4/2/2003
Unpredictable Information
14
4/2/2003
16
Scene change
q
4/2/2003
17
Transform Coding
q
q
q
q
q
4/2/2003
19
4/2/2003
Uncovered information
q
18
4/2/2003
20
Masking
n
Run-Level coding
21
Variable quantization
q
q
4/2/2003
4/2/2003
23
Key points:
4/2/2003
n
n
n
Compression process
Quantization & Sampling
Coding:
q
q
q
q
q
22
Masking
4/2/2003
24
Roadmap
n
JPEG
MPEG-1/MPEG-2 Video
MPEG-1 Layer 3 Audio (mp3)
MPEG-4
MPEG-7 (brief introduction)
HDTV (brief introduction)
n
n
n
n
n
n
4/2/2003
25
q
q
q
q
q
JPEG decoder
q
Input image A:
q
q
26
Output image B:
q
27
4/2/2003
JPEG - DCT
n
JPEG encoder
q
4/2/2003
4/2/2003
28
n
n
n
n
n
q
q
q
q
q
29
4/2/2003
58
-12
-4
-6
78
-1
-1
-73 -27
-1
-5
-5
-4
-1
-4
-5
-3
q
q
11
-65
80
-49
37
-87
12
10
27
-50
29
13
13
-6
-16
21
-11 -10
10
-21
-6
-1
-14
14
-14
16
-8
-4
-1
-13
12
-9
-1
-4
-2
-7
-1
30
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 EOB
31
DCT Coefficients
Quantization result
Zigzag scan result: 78 -1 1 -4 -5 4 4 6 3 2 -1 -3 -5 -4 -1 0 -1 0 1 1 0 0 0 0 0 0 0
4/2/2003
MPEG-1 (Obsolete)
43
MPEG standards
n
1255 -15
MPEG standards:
q
4/2/2003
4/2/2003
32
Block
q
q
4/2/2003
33
4/2/2003
q
q
q
q
q
q
q
q
4/2/2003
= 8 x 8 array of pixels.
A block is the fundamental unit for the DCT coding
(discrete cosine transform).
35
Macroblock
34
4/2/2003
36
Slice
n
n
I, P, B Pictures
Encoded pictures are classified into 3 types: I, P, and B.
n I Pictures = Intra Coded Pictures
q
q
q
Nguyen Chan Hung Hanoi University of Technology
37
Picture
n
n
n
n
q
q
q
4/2/2003
n
n
39
n
n
4/2/2003
B B
B P
B
Time
4/2/2003
38
4/2/2003
40
Sequence
n
n
q
q
q
q
q
q
Transport stream
q
q
picture size
aspect ratio
frame rate and bit rate
optional quantizer matrices
required decoder buffer size
chroma pixel structure
optional user data
q
q
4/2/2003
41
n
n
n
n
4/2/2003
42
4/2/2003
43
4/2/2003
Video filter
Discrete cosine transform (DCT)
DCT coefficient quantizer
Run-length amplitude/variable length coder (VLC)
44
Video Filter
n
q
q
n
n
n
4/2/2003
n
n
45
4:2:2
(8 blocks)
YYYYCbCrCbCr
Studio production
environments
Professional editing
equipment,
4:4:4
(12 blocks)
YYYYCbCrCbCrCbCrCbCr
Computer graphics
4/2/2003
Upper bounds:
1152 x 1920, 60Hz progressive
80 Mbits/s
Nguyen Chan Hung Hanoi University of Technology
47
Application
YYYYCbCr
4:2:0
(6 blocks)
4/2/2003
46
4/2/2003
48
Prediction
4/2/2003
n
n
n
49
q
q
q
q
q
4/2/2003
4/2/2003
F(u,v) = two-dimensional
NxN DCT.
u,v,x,y = 0,1,2,...N-1
x,y are spatial coordinates in
the sample domain.
u,v are frequency coordinates
in the transform domain.
C(u), C(v) = 1/(square root
(2)) for u, v = 0.
C(u), C(v) = 1 otherwise.
51
I(1) B(2) B(3) P(4) B(5) B(6) P(7) B(8) B(9) P(10) B(11)
B(12) I(13)
I(1) B(2) B(3) P(4) B(5) B(6) P(7) B(8) B(9) P(10) B(11)
B(12) I(13)
50
I(1) P(4) B(2) B(3) P(7) B(5) B(6) P(10) B(8) B(9) I(13) B(11)
B(12)
Eq 3 Normal form
Eq 4 Matrix form
Where:
q
IDCT:
q
Eq 1 Normal form
Eq 2 Matrix form
DCT:
q
I P B Picture Reordering
n
4/2/2003
52
Quantization matrix
MPEG scanning
n
Note DCT
coefficients are:
53
4/2/2003
After adaptive
quantization, the
result is a matrix
containing many
zeros.
55
4/2/2003
Why ?
q
4/2/2003
54
4/2/2003
56
Amplitude
N/A
8 (DC Value)
110 1000
00001100
00001100
01000
01000
01000
110
110
110
110
12
0010 0010 0
EOB
EOB
10
4/2/2003
MPEG
Code Value
n
n
q
q
q
q
57
4/2/2003
4/2/2003
59
Adaptation Field:
q
q
q
q
q
58
MPEG packages all data into fixed-size 188-byte packets for transport.
Video or audio payload data placed in PES packets before is broken up
into fixed length transport packet payloads.
A PES packet may be much longer than a transport packet Require
segmentation:
4/2/2003
q
q
q
q
q
q
PCR_flag
OPCR_flag
splicing_point_flag
transport_private_data_flag
adaptation_field_extension_flag
The optionalfields are present if
indicated by one of the preceding flags.
The remainder of the adaptation field is
filled with stuffing bytes (0xFF, all
ones).
60
1.
2.
3.
4.
Timing - Synchronization
n
4/2/2003
n
n
4/2/2003
61
Video stream
Audio stream
Point A:
Encoder input
Constant/specifi
edrate
Point B:
Encoder
output
Variable rate
Point C:
Encoderbuffer
output
Constant rate
Point D:
Communication
channel +
decoderbuffer
Constant
rate
Point E:
Decoder input
Variable rate
Point F:
Decoderoutput
Constant/specifi
edrate
62
4/2/2003
63
Decode Time Stamp (DTS) can optionally combined into the bit
stream represents the time at which the data should be taken
instantaneously from the decoder buffer and decoded.
q
q
q
q
n
n
n
DTS and PTS are identical except in the case of picture reordering for B
pictures.
The DTS is only used where it is needed because of reordering.
Whenever DTS is used, PTS is also coded.
PTS (or DTS) inserted interval = 700 m S.
In ATSC PTS (or DTS) must be inserted at the beginning of each
coded picture (access unit ).
4/2/2003
64
n
n
n
HDTV (2)
All video and audio streams included in a program must get their
time stamps from a common STC so that synchronization of the
video and audio decoders with each other may be accomplished.
The data rate and packet rate on the channel (at the multiplexer
output) can be completely asynchronous with the System Time
Clock (STC)
PCR time stamps allows synchronizations of different
multiplexed programs having different STCs while allowing STC
recovery for each program.
If there is no buffer underflow or overflow delays in the buffers
and transmission channel for both video and audio are
constant.
The encoder input and decoder output run at equal and constant
rates.
Fixedend-to-end delay from encoder input to decoder output
If exact synchronization is not required, the decoder clock can be
free running video frames can be repeated / skipped as
necessary to prevent buffer underflow / overflow , respectively.
4/2/2003
65
4/2/2003
HDTV proposals are for a screen which is wider than the conventional
TV image by about 33%. It is generally agreed that the HDTV aspect
ratio will be 16:9, as opposed to the 4:3 ratio of conventional TV
systems. This ratio has been chosen because psychological tests have
shown that it best matches the human visual field.
It also enables use of existing cinema film formats as additional source
material, since this is the same aspect ratio used in normal 35 mm film.
Figure 16.6(a) shows how the aspect ratio of HDTV compares with that
of conventional television, using the same resolution, or the same
surface area as the comparison metric.
To achieve the improved resolution the video image used in HDTV
must contain over 1000 lines, as opposed to the 525 and 625 provided
by the existing NTSC and PAL systems. This gives a much improved
vertical resolution. The exact value is chosen to be a simple multiple of
one or both of the vertical resolutions used in conventional TV.
However, due to the higher scan rates the bandwidth requirement for
analogue HDTV is approximately 12 MHz, compared to the nominal 6
MHz of conventional TV
4/2/2003
67
HDTV (3)
66
4/2/2003
68
H261- H263
n
n
H261-H263 (3)
4/2/2003
69
H261-H263 (2)
n
n
n
n
n
4/2/2003
71
4/2/2003
70
At the very low bit rates (20 kbit/s or less) associated with video
telephony, the requirements for image transmission stretch the
compression techniques described earlier to their limits.
In order to achieve the necessary degree of compression they
often require reduction in spatial resolution or even the
elimination of frames from the sequence.
Model based coding (MBC) attempts to exploit a greater degree
of redundancy in images than current techniques, in order to
achieve significant image compression but without adversely
degrading the image content information.
It relies upon the fact that the image quality is largely subjective.
Providing that the appearance of scenes within an observed
image is kept at a visually acceptable level, it may not matter that
the observed image is not a precise reproduction of reality.
4/2/2003
72
n
n
4/2/2003
73
n
n
n
n
n
n
n
4/2/2003
75
Key points:
n
n
n
n
Prediction
Motion compensation
Scanning
YCbCr formats (4:4:4, 4:2:0, etc)
Profiles @ Level
I,P,B pictures & reordering
Encoder/ Decoder process & Block diagram
74
Pixel, Block, Macroblock, Field DCT Coding / Frame DCT Coding, Slice,
Picture, Group of Pictures (GOP), Sequence, Packetized Elementary Stream
(PES)
4/2/2003
STC/SCR/DTS
PCR/PTS
76
Technical terms
n
n
n
n
n
n
n
n
n
n
n
n
n
n
A Brief History:
Macro blocks
HVS = Human Visual System
GOP = Group of Pictures
VLC = Variable Length Coding/Coder
IDCT/DCT = (Inverse) Discrete Cosine Transform
PES = Packetized ElementaryStream
MP@ML = Main profile @ Main Level
PCR = Program Clock Reference
SCR = System Clock Reference
STC = System Time Clock
PTS = Presentation Time Stamp
DTS = Decode Time Stamp
PAT = Program Association Table
PMT = Program Map Table
4/2/2003
77
4/2/2003
Coaxial only
Tree-and-Branch only
TV only
No return path ( high-pass filters are installed in
customers houses to block return low frequency noise)
79
Key elements:
q
Overview:
A brief history
q Modern CATV networks
q CATV systems and equipments
q
q
q
q
q
4/2/2003
78
4/2/2003
CO or
Master
Headend
Headends/
Hub
Server
complex
CMTS
TV content
provider
Optical
Nodes
Taps
Amplifiers
(GNA/TNA/L
E)
80
n
n
4/2/2003
FDM
Nguyen Chan Hung Hanoi University of Technology
81
4/2/2003
83
Vocabulary
n
4/2/2003
82
4/2/2003
84