Welcome to Scribd!

Huffman Encoding: WWW - Cis.Upenn - Edu/ Matuszek/Cit594-2002/SLIDES/HUFFMAN

Uploaded by

0% found this document useful (0 votes)

8 views13 pages

Huffman encoding is a method of data compression that assigns variable-length codes to characters based on their frequency of occurrence, with more common characters represented by shorter bit sequences. It creates a binary tree where the leaves are characters and frequencies, and codes are assigned to branches. This allows more frequent characters to be encoded with fewer bits than less common ones, achieving compression. The encoded bit strings have the unique prefix property to allow unambiguous decoding.

Original Description:

Original Title

huffman

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

8 views13 pages

Huffman Encoding: WWW - Cis.Upenn - Edu/ Matuszek/Cit594-2002/SLIDES/HUFFMAN

Uploaded by

Sara Khan

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 13

Search inside document

Huffman Encoding

(CREDIT:
WWW.CIS.UPENN.EDU/~MATUSZEK/CIT594-
2002/SLIDES/HUFFMAN.PPT)
Fixed and variable bit widths

To encode English text, we need 26 lower case

letters, 26 upper case letters, and a handful of
punctuation
We can get by with 64 characters (6 bits) in all
Each character is therefore 6 bits wide
We can do better, provided:
 Some characters are more frequent than others
 Characters may be different bit widths, so that for
example, e use only one or two bits, while x uses several
 We have a way of decoding the bit stream
 Must tell where each character begins and ends
Example Huffman encoding

A = 0
B = 100
C = 1010
D = 1011
R = 11
 ABRACADABRA = 01001101010010110100110
This is eleven letters in 23 bits
A fixed-width encoding would require 3 bits for
five different letters, or 33 bits for 11 letters
Notice that the encoded bit string can be
decoded!
Why it works

In this example, A was the most common letter

In ABRACADABRA:
 5 As code for A is 1 bit long
 2 Rs code for R is 2 bits long
 2 Bs code for B is 3 bits long
 1 C code for C is 4 bits long
 1 D code for D is 4 bits long
Creating a Huffman encoding

For each encoding unit (letter, in this example),

associate a frequency (number of times it occurs)
 You can also use a percentage or a probability
Create a binary tree whose children are the encoding
units with the smallest frequencies
 The frequency of the root is the sum of the frequencies of the
leaves
Repeat this procedure until all the encoding units are
in the binary tree
Example, step I
 Assume that relative frequencies are:
 A: 40
 B: 20
 C: 10
 D: 10
 R: 20

 (I chose simpler numbers than the real frequencies)

 Smallest number are 10 and 10 (C and D), so connect
those
Example, step II

C and D have already been used, and the new node

above them (call it C+D) has value 20
The smallest values are B, C+D, and R, all of which
have value 20
 Connect any two of these
Example, step III

The smallest values is R, while A and B+C+D all have

value 40
Connect R to either of the others
Example, step IV

Connect the final two nodes

Example, step V
 Assign 0 to left branches, 1 to right branches
 Each encoding is a path from the root
A = 0
B = 100
C = 1010
D = 1011
R = 11
 Each path
terminates at
a leaf
 Do you see
why encoded
strings are
decodable?
Unique prefix property
A = 0
B = 100
C = 1010
D = 1011
R = 11
No bit string is a prefix of any other bit string
For example, if we added E=01, then A (0) would
be a prefix of E
Similarly, if we added F=10, then it would be a
prefix of three other encodings (B=100, C=1010,
and D=1011)
The unique prefix property holds because, in a
binary tree, a leaf is not on a path to any other
node
Practical considerations

It is not practical to create a Huffman encoding

for a single short string, such as ABRACADABRA
 To decode it, you would need the code table
 If you include the code table in the entire message, the
whole thing is bigger than just the ASCII message
Huffman encoding is practical if:
 The encoded string is large relative to the code table, OR
 We agree on the code table beforehand
 For example, it’s easy to find a table of letter frequencies for
English (or any other alphabet-based language)
Data compression

Huffman encoding is a simple example of data

compression: representing data in fewer bits
than it would otherwise need
A more sophisticated method is GIF (Graphics
Interchange Format) compression, for .gif files
Another is JPEG (Joint Photographic Experts
Group), for .jpg files
 Unlike the others, JPEG is lossy—it loses information
 Generally OK for photographs (if you don’t compress
them too much), because decompression adds “fake”
data very similiar to the original

Error-Correction on Non-Standard Communication Channels
From Everand
Error-Correction on Non-Standard Communication Channels
Edward A. Ratzer
No ratings yet
Huffman Encoding
Document17 pages
Huffman Encoding
mujtaba21
No ratings yet
Newnes Radio and Electronics Engineer's Pocket Book
From Everand
Newnes Radio and Electronics Engineer's Pocket Book
Keith Brindley
No ratings yet
Image Compression
Document50 pages
Image Compression
Vipin Singh
No ratings yet
Learn C++
From Everand
Learn C++
Durgesh
Rating: 4.5 out of 5 stars
4.5/5 (9)
Huffman Encoding Explained
Document17 pages
Huffman Encoding Explained
Farhad Muhammad Riaz
No ratings yet
Diophantine Approximations
From Everand
Diophantine Approximations
Ivan Niven
Rating: 3 out of 5 stars
3/5 (1)
Huffman
Document17 pages
Huffman
allsee
No ratings yet
Practical Electronics Handbook
From Everand
Practical Electronics Handbook
Ian R. Sinclair
Rating: 4 out of 5 stars
4/5 (6)
Data Structure: Huffman Tree:Project Submitted To: Sir Abdul Wahab
Document24 pages
Data Structure: Huffman Tree:Project Submitted To: Sir Abdul Wahab
Muhammad Zia Shahid
No ratings yet
A Short Course in Discrete Mathematics
From Everand
A Short Course in Discrete Mathematics
Edward A. Bender
Rating: 3 out of 5 stars
3/5 (1)
Huffman Code1
Document13 pages
Huffman Code1
api-3844034
100% (1)
Introduction to Partial Differential Equations: From Fourier Series to Boundary-Value Problems
From Everand
Introduction to Partial Differential Equations: From Fourier Series to Boundary-Value Problems
Arne Broman
Rating: 2.5 out of 5 stars
2.5/5 (2)
Huffman Coding: Version of September 17, 2016
Document27 pages
Huffman Coding: Version of September 17, 2016
Boy Afrianda Sinaga
No ratings yet
Electronics 3 Checkbook: The Checkbooks Series
From Everand
Electronics 3 Checkbook: The Checkbooks Series
S. A. Knight
Rating: 5 out of 5 stars
5/5 (1)
ch3 Part1
Document7 pages
ch3 Part1
hassan IQ
No ratings yet
Lecture 15
Document3 pages
Lecture 15
Veda Vyas
No ratings yet
Huffman Coding Trees
Document3 pages
Huffman Coding Trees
razee_
No ratings yet
Information Theory and Coding
Document12 pages
Information Theory and Coding
Prashaant Yerrapragada
No ratings yet
Lecture 14
Document25 pages
Lecture 14
f20201862
No ratings yet
Intro to Data Compression Techniques & Concepts
Document22 pages
Intro to Data Compression Techniques & Concepts
Vijaya Azimuddin
No ratings yet
Chapter 4 Lossless Compression Algorithims
Document30 pages
Chapter 4 Lossless Compression Algorithims
Technology
No ratings yet
Greedy Huffman codes
Document27 pages
Greedy Huffman codes
Nikhil Yadala
No ratings yet
Packet Transmission Delay Calculations
Document23 pages
Packet Transmission Delay Calculations
Pon Shanmugakumar
100% (1)
Lossless Data Compression Techniques
Document22 pages
Lossless Data Compression Techniques
Dawit Bassa
No ratings yet
Discrete Mathematics Trees
Document42 pages
Discrete Mathematics Trees
najma rasool
No ratings yet
Unique Decipherability Notes
Document5 pages
Unique Decipherability Notes
mohammed229
No ratings yet
5.3 Kraft Inequality and Optimal Codeword Length: Theorem 22 Let X
Document11 pages
5.3 Kraft Inequality and Optimal Codeword Length: Theorem 22 Let X
reachsapan
No ratings yet
Why Study The Theory of Computation?: Implementations Come and Go
Document68 pages
Why Study The Theory of Computation?: Implementations Come and Go
Raj Sharma
No ratings yet
Notes Compression
Document9 pages
Notes Compression
Andrea Fabbricatore
No ratings yet
Theory of Computation Questions
Document7 pages
Theory of Computation Questions
Nirav
No ratings yet
KMA SS05 Kap03 Compression
Document54 pages
KMA SS05 Kap03 Compression
jnax101
No ratings yet
Point-to-Point Wireless Communication (III) :: Coding Schemes, Adaptive Modulation/Coding, Hybrid ARQ/FEC
Document156 pages
Point-to-Point Wireless Communication (III) :: Coding Schemes, Adaptive Modulation/Coding, Hybrid ARQ/FEC
Basir Usman
No ratings yet
2 Problem Sheet Two
Document4 pages
2 Problem Sheet Two
Gourav_Jhangik_350
No ratings yet
Characterization Results For Time-Varying Codes: Fundamenta Informaticae 53 (2), 2002, 185-198
Document15 pages
Characterization Results For Time-Varying Codes: Fundamenta Informaticae 53 (2), 2002, 185-198
Dorin
No ratings yet
Huffman Codes: Spring 2010
Document7 pages
Huffman Codes: Spring 2010
Shivam Pal
No ratings yet
Huffman Coding Technique
Document13 pages
Huffman Coding Technique
Anchal Rathore
No ratings yet
Solutions To Set 17 - (Telegram @myhackersworld2)
Document18 pages
Solutions To Set 17 - (Telegram @myhackersworld2)
jigneshsonkusare18
No ratings yet
Introduction To Digital Communications and Information Theory
Document8 pages
Introduction To Digital Communications and Information Theory
Randall Glover
No ratings yet
Addis Ababa Institute of Technology Center of Information Technology and Scientific Computing
Document2 pages
Addis Ababa Institute of Technology Center of Information Technology and Scientific Computing
Abiy Alemu
No ratings yet
Linz - Chapter 2 & 3 - Exer PDF
Document25 pages
Linz - Chapter 2 & 3 - Exer PDF
نغم حمدونة
No ratings yet
String Compression
Document13 pages
String Compression
vatarpl
No ratings yet
Noise, Information Theory, and Entropy: Understanding Random Processes and Channel Capacity
Document34 pages
Noise, Information Theory, and Entropy: Understanding Random Processes and Channel Capacity
Reetika Arora
No ratings yet
Huffman Coding - Wikipedia
Document11 pages
Huffman Coding - Wikipedia
rommel baldago
No ratings yet
Input Source Encoder Channel Encoder Binary Interface
Document29 pages
Input Source Encoder Channel Encoder Binary Interface
Muhammadwaqasnaseem
No ratings yet
ENTROPY & RUN LENGTH CODING TECHNIQUES
Document45 pages
ENTROPY & RUN LENGTH CODING TECHNIQUES
Rohan Borgalli
No ratings yet
062 Teorija Informacije Aritmeticko Kodiranje
Document22 pages
062 Teorija Informacije Aritmeticko Kodiranje
AidaVeispahic
No ratings yet
Theory of Computation
Document24 pages
Theory of Computation
saranjsp
67% (3)
Module-3 Till Classful Address
Document14 pages
Module-3 Till Classful Address
Shain Sethi
No ratings yet
Curs 1 Represenations
Document15 pages
Curs 1 Represenations
Tudor Octav
No ratings yet
Term Paper Huffman Coding
Document9 pages
Term Paper Huffman Coding
endale
No ratings yet
Introduction to Finite Automata Concepts
Document37 pages
Introduction to Finite Automata Concepts
abel bahiru
No ratings yet
Lecture Notes - Wired LANs Ethernet
Document14 pages
Lecture Notes - Wired LANs Ethernet
Padakandla Madhusudanacharyulu
No ratings yet
Wired LANs: Ethernet Standards and Evolution
Document38 pages
Wired LANs: Ethernet Standards and Evolution
cooljsean
No ratings yet
Exercises Principle Electronics Communications
Document5 pages
Exercises Principle Electronics Communications
Datpm
No ratings yet
Design and Analysis of Dynamic Huffman Codes: Jeffrey Scott Vitter
Document21 pages
Design and Analysis of Dynamic Huffman Codes: Jeffrey Scott Vitter
Zlatni Presek
No ratings yet
Chapter Two - Part 1
Document21 pages
Chapter Two - Part 1
Saif Ahmed
No ratings yet
Unit Iii
Document31 pages
Unit Iii
divya varsha
No ratings yet
Section 15 - Digital Communications
Document27 pages
Section 15 - Digital Communications
Cherryl Mae Almojuela
0% (1)
Deterministic Finite Automata: Alphabets, Strings, and Languages Transition Graphs and Tables Some Proof Techniques
Document41 pages
Deterministic Finite Automata: Alphabets, Strings, and Languages Transition Graphs and Tables Some Proof Techniques
Lalalalalala
No ratings yet
Ordered Pairs, Triples & Vector Spaces Explained
Document7 pages
Ordered Pairs, Triples & Vector Spaces Explained
Sara Khan
No ratings yet
Vector Space Models Explained
Document5 pages
Vector Space Models Explained
Sara Khan
No ratings yet
Vector Space Position Vector
Document5 pages
Vector Space Position Vector
Sara Khan
No ratings yet
Botany Part II Growth and Development
Document12 pages
Botany Part II Growth and Development
Sara Khan
No ratings yet
Photoshop Intro
Document37 pages
Photoshop Intro
Sara Khan
No ratings yet
Fundamental concepts of computer networks
Document70 pages
Fundamental concepts of computer networks
Sf All People faiz
No ratings yet
QuickSort 2256
Document18 pages
QuickSort 2256
Sara Khan
No ratings yet
Internet History and Growth: William F. Slater, III Chicago Chapter of The Internet Society September 2002
Document50 pages
Internet History and Growth: William F. Slater, III Chicago Chapter of The Internet Society September 2002
janga1988
No ratings yet
34 Quicksort
Document20 pages
34 Quicksort
Sara Khan
No ratings yet
L8 Single Variable Optimization Algorithms
Document9 pages
L8 Single Variable Optimization Algorithms
Sudipta Maity
No ratings yet
JNTUA Advanced Data Structures and Algorithms Lab Manual R20
Document71 pages
JNTUA Advanced Data Structures and Algorithms Lab Manual R20
sandeepabi25
No ratings yet
Ada Lab Recordd New
Document26 pages
Ada Lab Recordd New
Riyaz Shaik
No ratings yet
Excel Solver Optimization Report
Document4 pages
Excel Solver Optimization Report
PARAG AGRAWAL
No ratings yet
Daa Important Questions
Document2 pages
Daa Important Questions
jay
No ratings yet
Artificial Intelligence Unit-I Question Bank (Solved) (Theory & MCQ) Theory
Document78 pages
Artificial Intelligence Unit-I Question Bank (Solved) (Theory & MCQ) Theory
shivam mishra
No ratings yet
CSE Assignment on Data Structures and Algorithms
Document21 pages
CSE Assignment on Data Structures and Algorithms
Nishchay Verma
No ratings yet
Data Mining
Document15 pages
Data Mining
सुजन कार्की
No ratings yet
Location Strategy
Document15 pages
Location Strategy
hesham hassan
No ratings yet
Vlsi Implementation of Turbo Decoder: Seminar On
Document22 pages
Vlsi Implementation of Turbo Decoder: Seminar On
abizerattary
No ratings yet
Bba Bca 1 Sem Principles of Programming and Algorithms 103 p13 Apr 2018
Document2 pages
Bba Bca 1 Sem Principles of Programming and Algorithms 103 p13 Apr 2018
Tukaram Satpute
No ratings yet
Mba 19 Pat 302 Ds Unit 1.3.1 NWCM
Document29 pages
Mba 19 Pat 302 Ds Unit 1.3.1 NWCM
Anonymous XTL7o0tGo9
No ratings yet
Practice Neural Networks
Document9 pages
Practice Neural Networks
Muhammad Rizwan Khalid
No ratings yet
TI4101 Facility Layout Models
Document60 pages
TI4101 Facility Layout Models
Jonathan Haryanto
No ratings yet
Mad Trucker
Document2 pages
Mad Trucker
Anonymous uzuSQSYs5J
No ratings yet
Task 1: Search Algorithms: Depth First Search (DFS) - Is An Algorithm For Traversing or Searching Tree or Graph Data
Document8 pages
Task 1: Search Algorithms: Depth First Search (DFS) - Is An Algorithm For Traversing or Searching Tree or Graph Data
Prakash Pokhrel
No ratings yet
Array Implementation of Queue (Simple) - GeeksforGeeks
Document9 pages
Array Implementation of Queue (Simple) - GeeksforGeeks
Khandakar Hossin
No ratings yet
Queue
Document61 pages
Queue
Alia Maitah
No ratings yet
Solving Three Musketeers Game with AI
Document51 pages
Solving Three Musketeers Game with AI
Kiran Fiatth
100% (1)
GeorgiaTech CS-6515: Graduate Algorithms: Exam 1 Flashcards by Calvin Hoang - Brainscape
Document15 pages
GeorgiaTech CS-6515: Graduate Algorithms: Exam 1 Flashcards by Calvin Hoang - Brainscape
Jan_Su
No ratings yet
D1: Algorithms (Further Maths) : Name .. Score: Percentage: Grade: Further Maths Target Grade
Document1 page
D1: Algorithms (Further Maths) : Name .. Score: Percentage: Grade: Further Maths Target Grade
chippingnortonschool
No ratings yet
AI 3000 / CS 5500: Reinforcement Learning Assignment 1: Problem 1: Markov Reward Process
Document5 pages
AI 3000 / CS 5500: Reinforcement Learning Assignment 1: Problem 1: Markov Reward Process
SHUBHAM PANCHAL
No ratings yet
Lab5 PDF
Document7 pages
Lab5 PDF
ThangaGiri Baskaran
No ratings yet
Talbi - Met A Heuristics - From Design To Implementation (Wiley)
Document618 pages
Talbi - Met A Heuristics - From Design To Implementation (Wiley)
Roberto Pinheiro Domingos
No ratings yet
03 Linked List
Document57 pages
03 Linked List
오현진
No ratings yet
B-Tree Structure and Operations Explained
Document14 pages
B-Tree Structure and Operations Explained
Swati
No ratings yet
(A) Plot The Error E As A Function of The Number of Training Iterations
Document8 pages
(A) Plot The Error E As A Function of The Number of Training Iterations
Ruoyu Tan
No ratings yet
ETEG 425 Internal Exam Questions 2021
Document2 pages
ETEG 425 Internal Exam Questions 2021
subash
No ratings yet
Unit-4 (Dynamic Programming)
Document96 pages
Unit-4 (Dynamic Programming)
HACKER SPACE
No ratings yet