0% found this document useful (0 votes)

80 views36 pages

Chapter 7

Chapter 7 discusses multimedia data compression, explaining the processes of lossless and lossy compression, their goals, and constraints. It covers various methods such as Run Length Coding and Huffman Coding, emphasizing the importance of removing redundancy and exploiting human perception. Additionally, it highlights the applications of compression in reducing data volume for transmission and storage, along with standards like JPEG and MPEG.

Uploaded by

ENBAKOM ZAWUGA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

80 views36 pages

Chapter 7

Uploaded by

ENBAKOM ZAWUGA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

CHAPTER 7

Multimedia Data Compression

By: Moti B.

1/6/2025 1
What is Compression?

Compression

• Is a process of deriving more compact (i.e., smaller) representations of data.

• The process of coding that will effectively reduce the total number of bits
needed to represent certain information.

• Representation of file in fewest number of bits and as accurate as possible

Goal of Compression

 Remove redundancy

 Reduce irrelevance

Constraints on Compression

• Perfect or near-perfect reconstruction (lossless/lossy)

1/6/2025 2
con’t…
• Strategies for Compression
 Reducing redundancies
 Exploiting (manipulating) the characteristics of human vision

• If compression and decompression processes induce no information loss, then the

compression scheme is lossless; otherwise, it is lossy.

• Lossy does not necessarily mean loss of quality. In fact the output could be “better” than
the input.
• Drop random noise in images (dust on lens)
• Drop background in music
• Fix spelling errors in text. Put into better form.

• Writing is the art of lossy text compression 3

1/6/2025
Why Compression?

• To reduce the volume of data to be transmitted (text, fax, images)

• To reduce the bandwidth required for transmission and to reduce storage

requirements (speech, audio, video)

Compression

• How is compression possible?

• Redundancy in digital audio, image, and video data

• Properties of human perception

• Digital audio is a series of sample values; image is a rectangular array of pixel

values; video is a sequence of images played out at a certain rate

1/6/2025 4
Data Compression

• Data compression is the reduction or elimination of redundancy in data representation in

order to achieve savings in storage, speed up file transfer and decrease costs for storage
hardware & network.

• Compression or Data compression is used to reduce the size of one or more files. When a file
is compressed, it takes up less disk space than an uncompressed version & can be transferred
to other systems more quickly. There are two types of data compression.

1/6/2025 5
Types of Compression
• Lossless data compression
• Lossless data compression make use of data compression algorithms that allows the
exact original data to be reconstructed from the compressed data
• Original can be recovered exactly. Higher quality, bigger.
• Lossless compression for legal and medical documents, computer programs
• Exploit only data redundancy
• Error free compression

• Lossy data compression

• Lossy data compression, which does not allow the exact original data to be
reconstructed from the compressed data.
• Only an approximation of the original can be recovered. Lower quality, smaller.
• Digital audio, image, video where some errors or loss can be tolerated
• Exploit both data redundancy and human perception properties
•1/6/2025
Error containing compression 6
Con’t…
Summary
Lossless Lossy
 Original data and the data after  Original data and the data after
compression and decompression are compression and decompression are
exactly the same. not exactly the same

 Redundant data is removed in  Used for compressing images and

compression and added during video files (our eyes cannot
decompression distinguish subtle changes, so lossy
data is acceptable).

 Lossless methods are used when we  lossy algorithm removes information

can’t afford to lose any data: legal that it cannot later restore.
and medical documents, computer
programs.

 Used to compress text, images,  Lossy algorithms are used to

sound and programs compress still images, video and
audio.
1/6/2025 7
Lossless Compression
Common methods to remove redundancy

• Basics of Information Theory

• Run Length coding

• Huffman Coding and etc

1/6/2025 8
Information Theory
Information theory is a branch of science that is concerned with quantifying
information.
• Information theory is defined to be the study of efficient coding and its
consequences.
• An interface between modeling and coding
• It is the field of study concerned about the storage and transmission of data.
• It is concerned with source coding and channel coding.
 Source coding: involves compression
 Channel coding: how to transmit data, how to overcame noise, etc
• Data compression may be viewed as a branch of information theory in which the
primary objective is to minimize the amount of data to be transmitted.

1/6/2025 9
Entropy
• The measure of information of a set is known as the Shannon entropy or entropy.

• Entropy is a measure of the number of specific ways in which a system may be

arranged commonly understood as a measure of the disorder of a system.

• The change in information before and after the split is known as the information gain.

• The entropy η of an information source with alphabet S = {s1,s2,...,sn} is defined as

• Where pi is the probability that symbol si in S will occur.

• The term indicates the amount of information (the so-called self-

information) 10
1/6/2025
Con’t…
• If all outcomes have an equal probability of 1/4, then the number of bits to send is
on average 4 × (1/4) × log2(1/(1/4)) = 2 bits. To communicate (transmit) the results
of our two decisions, we would need to transmit 2 bits.

• Example : Calculate the entropy of the following diagram

The histogram of an image with uniform distribution of gray-level intensities, that is,
∀i pi = 1/256. Hence, the entropy of this image is:

1/6/2025 11
Run Length coding (RLC)
• Run-length coding is one of the simplest forms of data compression.

• It can be used to compress data made of any combination of symbols.

• It does not need to know the frequency of occurrence of symbols and can be very
efficient if data is represented as 0s and 1s.

• The general idea behind this method is to replace consecutive repeating occurrences
of a symbol by one occurrence of the symbol followed by the number of
occurrences.

• The method can be even more efficient if the data uses only two symbols (for
example 0 and 1) in its bit pattern and one symbol is more frequent than the other.

1/6/2025 12
Con’t…
 Compress repeated 'runs' of the same character by storing the length of that run, and
provide a function to reverse the compression.

Input: WWWWWWWWWWWWBWWWWWWWWWWWWBBBWWWWWW
WWWWWWWWWWWWWWWWWWBWWWWWWWWWWWWWW

Output: 12W1B12W3B24W1B14W

Run-Length encoding for 2 symbol.

1/6/2025 13
Example
Original bit stream :

• 000000111111111111110000000000000111111111

• Size is : 42 bits because we have 6 zeros, 14 ones, 13 zeros and 9

ones(6+14+13+9)

• The compressed bit stream is:

• 0:6, 1:14, 0:13,1:9

• Resulting 5 bits (to make similar)

• 00110 11110 01101 11001compressed bits

• Size is: 20bits

1/6/2025 14
Huffman Coding
• The Huffman Coding is a lossless data compression algorithm, developed by David
Huffman in the early of 50s while he was a PhD student at MIT.

• The algorithm is based on a binary-tree frequency-sorting method that allow

encode any message sequence into shorter encoded messages and a method to
reassemble into the original message without losing any data.

• The algorithm is based on the frequency of occurrence of the data item(byte).

• The most frequent data items will represented and encoded with a lower
number of bits.

• The main idea of the algorithm is create a binary tree, called Huffman tree, based
on the bytes frequency on the data, where the leafs are the bytes symbols, and the
path from the root to a leaf determines the new representation of that leaf byte.

1/6/2025 15
Building the Huffman Tree
• Each node of the tree are represented with a byte symbol and the frequency of that byte
on the data.

• The creation of the Huffman tree have the following steps:

1. Scan the data and calculate the frequency of occurrence of each byte;

2. Insert those nodes into a reverse priority queue based on the frequencies(a lowest
frequency is given highest priority);

3. Start a loop until the queue is empty;

4. Remove two nodes from the queue and combine them into a internal node with the
frequency equal to the sum of the two nodes frequencies;

5. Insert the two nodes removed from the queue as children of the created internal node;

6. Insert the created internal node into the queue;

7. The last node remaining on the queue is the root of the tree.
1/6/2025 16
Con’t…
• Using the text HELLO as example and applying those steps, we have the

following tree:

1/6/2025 17
Con’t…
• Huffman coding assigns shorter codes to symbols that occur more frequently
and longer codes to those that occur less frequently.

• For example, imagine we have a text file that uses only five characters (A, B, C, D, E).

• Before we can assign bit patterns to each character, we assign each character a
weight based on its frequency of use.

• In this example, assume that the frequency of the characters is as shown in Table
below.

1/6/2025 18
Con’t…

1/6/2025 19
Con’t…

A B C D E
00 010 011 10 11
1/6/2025 20
Con’t…

1/6/2025 21
Example 2

1/6/2025 22
Con’t…
• Step 1

1 1 1 1 2 2 3 3 5

A G M T E H _ I S
1/6/2025 23
Con’t…
• Step 2

2 2

1 1 1 1 2 2 3 3 5

A G M T E H _ I S
1/6/2025 24
Con’t…
• Step 3

2 2 4

1 1 1 1 2 2 3 3 5

A G M T E H _ I S
1/6/2025 25
Con’t…
• Step 4

2 2 4

1 1 1 1 2 2 3 3 5

A G M T E H _ I S
1/6/2025 26
Con’t…
• Step 5

2 2 4 6

1 1 1 1 2 2 3 3 5

A G M T E H _ I S
1/6/2025 27
Con’t…
• Step 6

4 4

2 2 2 2 6

E H

1 1 1 1 3 3 5

A G M T _ I S
1/6/2025 28
Con’t…
• Step 7

8 11

4 4 6 5

2 2 2 2 3 3

E H _ I

1 1 1 1

A G M T
1/6/2025 29
Con’t…
• Step 8
19

8 11

4 4 6 5

2 2 2 2 3 3

E H _ I

1 1 1 1

A G M T
1/6/2025 30
Con’t…
• Label edges with 0 and 1
19
0 1

8 11
0 1 0 1

4 4 6 5

0 1 0 1 0 1 S

2 2 2 2 3 3

0 1 0 1 E H _ I

1 1 1 1

A G M T
1/6/2025 31
Con’t…
• Huffman code & encoded message

1/6/2025 32
Lossy Compression
Our eyes and ears cannot distinguish subtle changes. In such cases, we can use a
lossy data compression method.

These methods are cheaper—they take less time and space when it comes to
sending millions of bits per second for images and video.

Several methods have been developed using lossy compression techniques. JPEG
(Joint Photographic Experts Group) encoding is used to compress pictures and
graphics, MPEG (Moving Picture Experts Group) encoding is used to compress
video, and MP3 (MPEG audio layer 3) for audio compression.

1/6/2025 33
Image Compression Standards
• JPEG (Joint Photographic Experts Group)

• An image compression standard

• Accepted as an international standard in 1992.

• A lossy image compression method by using DCT(Discrete Cosine Transform)

• Useful when image contents change relatively slowly

• Human less to notice loss of very high spatial frequency component

• Visual acuity is much greater for gray than for color.

1/6/2025 34
Exercise
1. Compress the following bits of stream by using Run Length Encoding

11111111111100000000000111110000000000000011111111

2. Based on the following data given for you

A. Build the Huffman tree

B. Find the code word

C. Check the difference between using Huffman code and not to use

1/6/2025 35
THE END!

1/6/2025 36

Lecture 3 Compression in Multimedia
No ratings yet
Lecture 3 Compression in Multimedia
60 pages
L15 Compression
No ratings yet
L15 Compression
63 pages
Chapter 3 Multimedia Data Compression
100% (2)
Chapter 3 Multimedia Data Compression
23 pages
DC M1 Merged
No ratings yet
DC M1 Merged
26 pages
Overview of Data Compression Techniques
No ratings yet
Overview of Data Compression Techniques
20 pages
Dereje Teferi Dereje - Teferi@aau - Edu.et
No ratings yet
Dereje Teferi Dereje - Teferi@aau - Edu.et
36 pages
Chapter 2-Compression Techniques
No ratings yet
Chapter 2-Compression Techniques
63 pages
Introduction To Data Compression - Guy E. Blelloch PDF
No ratings yet
Introduction To Data Compression - Guy E. Blelloch PDF
54 pages
Stu-Lossless Compression Algos
No ratings yet
Stu-Lossless Compression Algos
21 pages
HTCS501 Unit 4
No ratings yet
HTCS501 Unit 4
17 pages
Lec 42024
No ratings yet
Lec 42024
13 pages
Understanding Data Compression Techniques
No ratings yet
Understanding Data Compression Techniques
53 pages
Compression 2
No ratings yet
Compression 2
70 pages
Compression PDF
No ratings yet
Compression PDF
55 pages
Image Compression Techniques Explained
No ratings yet
Image Compression Techniques Explained
82 pages
Assignment 1
No ratings yet
Assignment 1
9 pages
Eee 427 - 4
No ratings yet
Eee 427 - 4
9 pages
Chapter 5 Data Compression
No ratings yet
Chapter 5 Data Compression
17 pages
Data Compression Techniques Explained
No ratings yet
Data Compression Techniques Explained
16 pages
Vik
No ratings yet
Vik
23 pages
Data Compression Techniques
No ratings yet
Data Compression Techniques
5 pages
Image Compression Techniques Explained
No ratings yet
Image Compression Techniques Explained
20 pages
Data Compression Techniques Overview
No ratings yet
Data Compression Techniques Overview
35 pages
Assignment Cyber Security Solved
No ratings yet
Assignment Cyber Security Solved
22 pages
Multimedia Data Compression Techniques
No ratings yet
Multimedia Data Compression Techniques
31 pages
Image Compression Techniques Explained
No ratings yet
Image Compression Techniques Explained
26 pages
Data Compression Techniques
No ratings yet
Data Compression Techniques
10 pages
Ut 1 PPT
No ratings yet
Ut 1 PPT
77 pages
Chapter Five Lossless Compression
No ratings yet
Chapter Five Lossless Compression
49 pages
Data Compression Techniques Overview
No ratings yet
Data Compression Techniques Overview
23 pages
Data Compression Techniques and Algorithms
No ratings yet
Data Compression Techniques and Algorithms
14 pages
Chapter 5 Data Compression
No ratings yet
Chapter 5 Data Compression
71 pages
Understanding Data Compression Techniques
No ratings yet
Understanding Data Compression Techniques
23 pages
Understanding Image Compression Techniques
No ratings yet
Understanding Image Compression Techniques
22 pages
Data Compression Techniques Explained
No ratings yet
Data Compression Techniques Explained
37 pages
Understanding Data Compression Techniques
No ratings yet
Understanding Data Compression Techniques
21 pages
Measuring Data Storage
No ratings yet
Measuring Data Storage
5 pages
Data Compression Techniques Overview
No ratings yet
Data Compression Techniques Overview
51 pages
DC (Ca 1)
No ratings yet
DC (Ca 1)
11 pages
Unit 1 Data Compression
No ratings yet
Unit 1 Data Compression
30 pages
Chapter 5 Data Compression
No ratings yet
Chapter 5 Data Compression
57 pages
Multimedia Class 11
No ratings yet
Multimedia Class 11
6 pages
Advanced Multimedia Compression
No ratings yet
Advanced Multimedia Compression
32 pages
Compression Techniques and Cyclic Redundency Check
No ratings yet
Compression Techniques and Cyclic Redundency Check
5 pages
Image Compression Techniques Explained
100% (1)
Image Compression Techniques Explained
38 pages
Data Compression Review
No ratings yet
Data Compression Review
9 pages
Umit 1 Mmdcs
No ratings yet
Umit 1 Mmdcs
17 pages
Lossless Compression Algorithms Overview
No ratings yet
Lossless Compression Algorithms Overview
45 pages
Data Compression Techniques Explained
100% (1)
Data Compression Techniques Explained
38 pages
Data Compression Techniques Overview
No ratings yet
Data Compression Techniques Overview
20 pages
Data Compression Techniques Explained
No ratings yet
Data Compression Techniques Explained
21 pages
Image Compression Techniques
No ratings yet
Image Compression Techniques
49 pages
Data Compression Techniques Guide
No ratings yet
Data Compression Techniques Guide
31 pages
Unit 5 - Data Compression
No ratings yet
Unit 5 - Data Compression
46 pages
DC/AC & DC/DC Converters Training Module
No ratings yet
DC/AC & DC/DC Converters Training Module
1 page
Securitas Direct Alarm Manual
No ratings yet
Securitas Direct Alarm Manual
28 pages
DSLAM and Fiber Network Overview
No ratings yet
DSLAM and Fiber Network Overview
14 pages
Power Allocation For OFDM-Based Free Space Optical Integrated Sensing and Communication
No ratings yet
Power Allocation For OFDM-Based Free Space Optical Integrated Sensing and Communication
6 pages
500W Ka-Band Compact Outdoor TWTA
No ratings yet
500W Ka-Band Compact Outdoor TWTA
3 pages
OptiX OSN 1800 I&II Compact Product Brochure
100% (1)
OptiX OSN 1800 I&II Compact Product Brochure
3 pages
Conduct Script For COC Type
100% (2)
Conduct Script For COC Type
3 pages
Computer Science MCQs for Beginners
No ratings yet
Computer Science MCQs for Beginners
88 pages
Understanding ARP Spoofing Attacks
No ratings yet
Understanding ARP Spoofing Attacks
28 pages
Report - Hawassa University
100% (4)
Report - Hawassa University
62 pages
Assignment 1
No ratings yet
Assignment 1
16 pages
Computer Network
No ratings yet
Computer Network
5 pages
1 Computer Organization Class 11
No ratings yet
1 Computer Organization Class 11
15 pages
Evolution and Impact of Wi-Fi Technology
No ratings yet
Evolution and Impact of Wi-Fi Technology
2 pages
LED vs Plasma vs MegaView Wall Guide
No ratings yet
LED vs Plasma vs MegaView Wall Guide
6 pages
Отправка По Электронной Почте XK-F303E User Manual
No ratings yet
Отправка По Электронной Почте XK-F303E User Manual
16 pages
CEC331 - 4G 5G Communication Networks Lab Manual - Faculty
No ratings yet
CEC331 - 4G 5G Communication Networks Lab Manual - Faculty
65 pages
Mobile Computing
No ratings yet
Mobile Computing
10 pages
Physical Design of Iot
No ratings yet
Physical Design of Iot
24 pages
Distributed Intrusion Detection Study
No ratings yet
Distributed Intrusion Detection Study
27 pages
Product Information - INVITE Elevator Screens
No ratings yet
Product Information - INVITE Elevator Screens
15 pages
GMDSS Overview and Requirements
No ratings yet
GMDSS Overview and Requirements
159 pages
CN100 Hardware Planning and Installation Guide-EPDOC-X630-en-520A
No ratings yet
CN100 Hardware Planning and Installation Guide-EPDOC-X630-en-520A
42 pages
Mentura Solution Portfolio 2023
No ratings yet
Mentura Solution Portfolio 2023
11 pages
Distributed Control System (DCS) Basics
No ratings yet
Distributed Control System (DCS) Basics
5 pages
EE555 Broadband Networks Homework 4
No ratings yet
EE555 Broadband Networks Homework 4
4 pages
PHILIPS CH - TPS2.1E LA PDF
No ratings yet
PHILIPS CH - TPS2.1E LA PDF
103 pages
Computer Networks Mcqs
100% (3)
Computer Networks Mcqs
13 pages
Inverter Waveforms: Square & Quasi Types
No ratings yet
Inverter Waveforms: Square & Quasi Types
16 pages
Data Connection Security Procedures
No ratings yet
Data Connection Security Procedures
2 pages

Chapter 7

Uploaded by

Chapter 7

Uploaded by

CHAPTER 7

Multimedia Data Compression

• Is a process of deriving more compact (i.e., smaller) representations of data.

• Representation of file in fewest number of bits and as accurate as possible

• Perfect or near-perfect reconstruction (lossless/lossy)

• If compression and decompression processes induce no information loss, then the

• Writing is the art of lossy text compression 3

• To reduce the volume of data to be transmitted (text, fax, images)

• To reduce the bandwidth required for transmission and to reduce storage

• How is compression possible?

• Redundancy in digital audio, image, and video data

• Properties of human perception

• Digital audio is a series of sample values; image is a rectangular array of pixel

• Data compression is the reduction or elimination of redundancy in data representation in

• Lossy data compression

 Redundant data is removed in  Used for compressing images and

 Lossless methods are used when we  lossy algorithm removes information

 Used to compress text, images,  Lossy algorithms are used to

• Basics of Information Theory

• Run Length coding

• Huffman Coding and etc

• Entropy is a measure of the number of specific ways in which a system may be

• The entropy η of an information source with alphabet S = {s1,s2,...,sn} is defined as

• Where pi is the probability that symbol si in S will occur.

• The term indicates the amount of information (the so-called self-

• Example : Calculate the entropy of the following diagram

• It can be used to compress data made of any combination of symbols.

Run-Length encoding for 2 symbol.

• Size is : 42 bits because we have 6 zeros, 14 ones, 13 zeros and 9

• The compressed bit stream is:

• 0:6, 1:14, 0:13,1:9

• Resulting 5 bits (to make similar)

• 00110 11110 01101 11001compressed bits

• Size is: 20bits

• The algorithm is based on a binary-tree frequency-sorting method that allow

• The algorithm is based on the frequency of occurrence of the data item(byte).

• The creation of the Huffman tree have the following steps:

3. Start a loop until the queue is empty;

6. Insert the created internal node into the queue;

• An image compression standard

• Accepted as an international standard in 1992.

• A lossy image compression method by using DCT(Discrete Cosine Transform)

• Useful when image contents change relatively slowly

• Human less to notice loss of very high spatial frequency component

• Visual acuity is much greater for gray than for color.

2. Based on the following data given for you

A. Build the Huffman tree

B. Find the code word

You might also like