Data Compression Techniques Explained

Data compression algorithms reduce the size of data representations. Lossless compression, like Lempel-Ziv-Storer-Szymanski (LZSS), identifies and eliminates statistical redundancies without losing information. LZSS works by replacing repeated strings with references to a dictionary. Huffman coding assigns variable length codes to characters based on frequency, with more common characters having shorter codes. The Burrows-Wheeler transform permutes the characters in a string to group common substrings together, making run-length encoding more effective.

Uploaded by

shashank jakka

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

168 views12 pages

Data Compression Techniques Explained

Uploaded by

shashank jakka

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

DATA

COMPRESS
TEXT

DATA COMPRESSION
In Data Compression or bit-rate reduction involves encoding information
using fewer bits than the original representation.

Compression can be either lossy or lossless. Lossless compression reduces

bits by identifying and eliminating statistical redundancy.

No information is lost in lossless compression.

Lossy compression reduces bits by identifying marginally important

information and removing it.
TEXT

LEMPEL-ZIV-STORER-SZYMANSKI(LZSS)
LZSS is a dictionary encoding technique.

Unlike Huffman coding, which attempts to reduce the

average amount of bits required to represent a symbol,
LZSS attempts to replace a string of symbols with a
reference to a dictionary location for the same string.

It is intended that the dictionary reference should be shorter

than the string it replaces.
TEXT

LEMPEL-ZIV-STORER-SZYMANSKI(LZSS)
Step 1. Initialise the dictionary to a known value.
Step 2. Read an original string that is the length of the maximum allowable match.
Step 3. Search for the longest matching string in the dictionary.
Step 4. If a match is found greater than or equal to the minimum allowable match
length:

Write the encoded flag, then the offset and length to the encoded output.

Otherwise, write the original flag and the first original symbol to the encoded output.
Step 5. Shift a copy of the symbols written to the encoded output from the original
string to the dictionary.
Step 6. Read a number of symbols from the original input equal to the number of
symbols written in Step 4.
Step 7. Repeat from Step 3, until all the entire input has been encoded.
LEMPEL-ZIV-STORER-SZYMANSKI(LZSS)
Lets take an string : thisisthe
This would take 72 bits ! We can do better.
For this tiny string we would need 9 bits allowing up
to 512 unique characters.

Current Next Output Add to the

dictionary
t h t 116 th
256
h i h 104 hi
257
LEMPEL-ZIV-STORER-SZYMANSKI(LZSS)
Current Next Output Add to the dictionary
i s i 105 is
258
s i s 115 si
259
i s is is in the dictionary
is t is 258 ist
260
t h th is in the dictionary
th e th 256 the
261
e - e 101 -
TEXT

RUN LENGTH ALGORITHM

Run-length encoding (RLE) is a very simple form of data
compression in which runs of data (that is, sequences in which
the same data value occurs in many consecutive data elements)
are stored as a single data value and count, rather than as the
original run.
Let us take a hypothetical single scan line, with B representing a
black pixel and W representing white:
WWWWWWWWWWWWBWWWWWWWWWWWWBBBWWWWWW
WWWWWWWWWWWWWWWWWWBWWWWWWWWWWWWWW
If we apply the run-length encoding (RLE) data compression
algorithm to the above hypothetical scan line, we get the
following: 12W1B12W3B24W1B14W
TEXT

HUFFMAN CODING
Huffman coding is a lossless data compression
algorithm. The idea is to assign variable length codes
to input characters, lengths of the assigned codes
are based on the frequencies of corresponding
characters.
The most frequent character gets the smallest code
and the least frequent character gets the largest
code.
TEXT

HUFFMAN CODING
1. Create a leaf node for each unique character and build a min
heap of all leaf nodes (Min Heap is used as a priority queue. The
value of frequency field is used to compare two nodes in min heap.
Initially, the least frequent character is at root)
2. Extract two nodes with the minimum frequency from the min
heap.
3. Create a new internal node with frequency equal to the sum of
the two nodes frequencies. Make the first extracted node as its left
child and the other extracted node as its right child. Add this node
to the min heap.
4. Repeat steps#2 and #3 until the heap contains only one node.
The remaining node is the root node and the tree is complete.
TEXT

BURROW-WHEELER ALGORITHM
When a character string is transformed by the BWT, none
of its characters change value.
The transformation permutes the order of the characters.
If the original string had several substrings that occurred
often, then the transformed string will have several places
where a single character is repeated multiple times in a
row.
This is useful for compression, since it tends to be easy to
compress a string that has runs of repeated characters by
techniques such as move-to-front transform and run-
length encoding.
TEXT
TEXT

Data Compression Techniques Explained
No ratings yet
Data Compression Techniques Explained
32 pages
DC CH05
No ratings yet
DC CH05
37 pages
Unit 1 Data Compression
No ratings yet
Unit 1 Data Compression
30 pages
LZW Encoding and Decoding
No ratings yet
LZW Encoding and Decoding
18 pages
Chapter 5 - Dictionary Techniques
No ratings yet
Chapter 5 - Dictionary Techniques
25 pages
LZW Compression Algorithm Overview
No ratings yet
LZW Compression Algorithm Overview
20 pages
LZW Algorithm: Lossless Data Compression
No ratings yet
LZW Algorithm: Lossless Data Compression
9 pages
Testing - Document - Sourjyendra - Data Compression Techniques - Lecture 7 - Dictionary Compression (DCT2015-Lecture7Web)
No ratings yet
Testing - Document - Sourjyendra - Data Compression Techniques - Lecture 7 - Dictionary Compression (DCT2015-Lecture7Web)
40 pages
Dictionary Techniques in Data Compression
No ratings yet
Dictionary Techniques in Data Compression
50 pages
Compression: Author: Paul Penfield, Jr. Url: Toc
No ratings yet
Compression: Author: Paul Penfield, Jr. Url: Toc
5 pages
MMC Module 3
No ratings yet
MMC Module 3
65 pages
Fast Lempel-Ziv Encoding with Hashing
No ratings yet
Fast Lempel-Ziv Encoding with Hashing
4 pages
Lempel-Ziv Lossless Compression Techniques
No ratings yet
Lempel-Ziv Lossless Compression Techniques
11 pages
Lempel-Ziv Compression Techniques Explained
No ratings yet
Lempel-Ziv Compression Techniques Explained
26 pages
Chapter 7
No ratings yet
Chapter 7
70 pages
Dictionary-Based Coding Techniques Explained
No ratings yet
Dictionary-Based Coding Techniques Explained
31 pages
Lossless Compression Algorithms Explained
No ratings yet
Lossless Compression Algorithms Explained
21 pages
LZW Compression Scheme Lab Manual
No ratings yet
LZW Compression Scheme Lab Manual
7 pages
A New Approach For Compression On Textual Data
No ratings yet
A New Approach For Compression On Textual Data
4 pages
Compression: Author: Paul Penfield, Jr. 2004 Massachusetts Institute of Technology Url: Start: Back: Next
No ratings yet
Compression: Author: Paul Penfield, Jr. 2004 Massachusetts Institute of Technology Url: Start: Back: Next
8 pages
Range Coder
No ratings yet
Range Coder
11 pages
Forouzan6e ch11 PPTs Accessible
No ratings yet
Forouzan6e ch11 PPTs Accessible
119 pages
Dictionary Coding Explained
No ratings yet
Dictionary Coding Explained
56 pages
Understanding Data Compression Techniques
No ratings yet
Understanding Data Compression Techniques
33 pages
Understanding Data Compression Techniques
No ratings yet
Understanding Data Compression Techniques
11 pages
Multimedia Data Compression Techniques
No ratings yet
Multimedia Data Compression Techniques
29 pages
Understanding Data Compression Techniques
No ratings yet
Understanding Data Compression Techniques
21 pages
Information Theory and Compression Techniques
No ratings yet
Information Theory and Compression Techniques
21 pages
Enhancing Tunstall Code Efficiency
No ratings yet
Enhancing Tunstall Code Efficiency
16 pages
Module-3: Text and Image Compression
No ratings yet
Module-3: Text and Image Compression
66 pages
Huffman and LZW Coding Techniques
No ratings yet
Huffman and LZW Coding Techniques
41 pages
Lec5 - LZW Compression
No ratings yet
Lec5 - LZW Compression
29 pages
Implementation of Lempel-Ziv Algorithm For Lossless Compression Using VHDL
No ratings yet
Implementation of Lempel-Ziv Algorithm For Lossless Compression Using VHDL
2 pages
LZW Encoding Class Notes
No ratings yet
LZW Encoding Class Notes
5 pages
3 Chapter Text and Image Compression
No ratings yet
3 Chapter Text and Image Compression
132 pages
Lempel-Ziv Compression Algorithms Explained
No ratings yet
Lempel-Ziv Compression Algorithms Explained
6 pages
SSE Compression Method for Text Data
No ratings yet
SSE Compression Method for Text Data
13 pages
Data Compression Techniques Overview
No ratings yet
Data Compression Techniques Overview
193 pages
Notes Module2
No ratings yet
Notes Module2
16 pages
In This Lecture: Recap of Huffman LZW Compression
No ratings yet
In This Lecture: Recap of Huffman LZW Compression
59 pages
Multimedia Communication Module 3
No ratings yet
Multimedia Communication Module 3
68 pages
Lossy vs Lossless Image Compression
No ratings yet
Lossy vs Lossless Image Compression
4 pages
Data Compression
No ratings yet
Data Compression
35 pages
DC 1
No ratings yet
DC 1
3 pages
File Compression Techniques Explained
No ratings yet
File Compression Techniques Explained
8 pages
LZW Compression Algorithm Overview
No ratings yet
LZW Compression Algorithm Overview
5 pages
Image and Video Compression: Lecture 12, April 27, 2009 Lexing Xie
No ratings yet
Image and Video Compression: Lecture 12, April 27, 2009 Lexing Xie
77 pages
Compression: Some Slides Courtesy James Allan@umass
No ratings yet
Compression: Some Slides Courtesy James Allan@umass
47 pages
Chapter 2-Compression Techniques
No ratings yet
Chapter 2-Compression Techniques
63 pages
Data Compression Techniques
No ratings yet
Data Compression Techniques
11 pages
CS 300 Data Structures: Sabancı University Faculty of Engineering and Natural Sciences
No ratings yet
CS 300 Data Structures: Sabancı University Faculty of Engineering and Natural Sciences
6 pages
Multimedia Data Compression Guide
No ratings yet
Multimedia Data Compression Guide
21 pages
7 String Matching: 7.1 Brute Force
No ratings yet
7 String Matching: 7.1 Brute Force
15 pages
Playfair Cipher Programming Project
No ratings yet
Playfair Cipher Programming Project
5 pages
LZW Compression Algorithm Overview
No ratings yet
LZW Compression Algorithm Overview
12 pages
Asdba
No ratings yet
Asdba
2 pages
Management Accounting Course Overview
No ratings yet
Management Accounting Course Overview
8 pages
Fractals Part2
No ratings yet
Fractals Part2
21 pages
Key Concepts in Research Methodology
33% (3)
Key Concepts in Research Methodology
5 pages
Employee Satisfaction and Research Methods
No ratings yet
Employee Satisfaction and Research Methods
2 pages
IMF Policies and Legal Considerations
No ratings yet
IMF Policies and Legal Considerations
9 pages
2024 Fspen An Ultra-Lightweight Network For Real Time Speech Enahncment
No ratings yet
2024 Fspen An Ultra-Lightweight Network For Real Time Speech Enahncment
5 pages
22CAE12 Artificial Intelligence and Machine Learning
No ratings yet
22CAE12 Artificial Intelligence and Machine Learning
2 pages
Interview Questions in Neural Network
No ratings yet
Interview Questions in Neural Network
9 pages
Frequency Analysis of Signals and Systems
No ratings yet
Frequency Analysis of Signals and Systems
40 pages
DLRL Notes
No ratings yet
DLRL Notes
88 pages
Trading Algo
No ratings yet
Trading Algo
3 pages
4 - Protocol Building Blocks
No ratings yet
4 - Protocol Building Blocks
22 pages
Lec 5
No ratings yet
Lec 5
23 pages
Analog to Discrete Signal Conversion
No ratings yet
Analog to Discrete Signal Conversion
9 pages
DSA Lab 2 Fall 2025
No ratings yet
DSA Lab 2 Fall 2025
4 pages
Decision Tree Assignment
0% (2)
Decision Tree Assignment
5 pages
COE4TL4 Final Qs
No ratings yet
COE4TL4 Final Qs
11 pages
Quadratic Programming in Engineering
No ratings yet
Quadratic Programming in Engineering
16 pages
BCS 465 Neural Network - 2020
No ratings yet
BCS 465 Neural Network - 2020
5 pages
Chapter 10
No ratings yet
Chapter 10
8 pages
MA1006 A-SNM Unit-4 QB New PDF
No ratings yet
MA1006 A-SNM Unit-4 QB New PDF
24 pages
EEG Ocular Artifact Removal Algorithm
No ratings yet
EEG Ocular Artifact Removal Algorithm
6 pages
Optimization Techniques in Engineering
No ratings yet
Optimization Techniques in Engineering
15 pages
CE315 Numerical Solution Syllabus 1
No ratings yet
CE315 Numerical Solution Syllabus 1
6 pages
Digital Signal Processing Notes
100% (1)
Digital Signal Processing Notes
99 pages
AE 6310 - Optimization For The Design of Engineered Systems
No ratings yet
AE 6310 - Optimization For The Design of Engineered Systems
2 pages
Machine Learning Notes Btech
No ratings yet
Machine Learning Notes Btech
3 pages
Eee 513 - Lecture III
No ratings yet
Eee 513 - Lecture III
31 pages
Dynamic Programming Concepts Explained
No ratings yet
Dynamic Programming Concepts Explained
40 pages
Aifile
No ratings yet
Aifile
30 pages
Coordinate-wise Optimization in α-CL
No ratings yet
Coordinate-wise Optimization in α-CL
25 pages
Complete PDF File At: 5 Semester BE (CBCS) EC Syllabus
No ratings yet
Complete PDF File At: 5 Semester BE (CBCS) EC Syllabus
7 pages
CPU Scheduling: Priority Algorithms Simulation
No ratings yet
CPU Scheduling: Priority Algorithms Simulation
5 pages
Introduction to Graph Theory Concepts
No ratings yet
Introduction to Graph Theory Concepts
6 pages
EC8553 Discrete Time Signal Processing MCQ Padeepz
No ratings yet
EC8553 Discrete Time Signal Processing MCQ Padeepz
17 pages

Data Compression Techniques Explained

Uploaded by

Data Compression Techniques Explained

Uploaded by

DATA

Compression can be either lossy or lossless. Lossless compression reduces

No information is lost in lossless compression.

Lossy compression reduces bits by identifying marginally important

Unlike Huffman coding, which attempts to reduce the

It is intended that the dictionary reference should be shorter

Current Next Output Add to the

RUN LENGTH ALGORITHM

You might also like