Efficient Data Compression with Huffman Codes

Huffman coding is an efficient method for data compression that uses a greedy algorithm to create optimal prefix codes based on character frequency. It allows for variable-length codes, significantly reducing the number of bits needed to represent data compared to fixed-length codes. The process involves building a Huffman tree from character frequencies and assigning binary codes to each character, ensuring no code is a prefix of another for unambiguous decoding.

Uploaded by

luckymlcvl

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views2 pages

Efficient Data Compression with Huffman Codes

Uploaded by

luckymlcvl

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Huffman Tree and Coding

Thursday, December 1, 2022 10:58 PM

Huffman Codes
• (i) Data can be encoded efficiently using Huffman Codes.
• (ii) It is a widely used and beneficial technique for compressing data.
• (iii) Huffman's greedy algorithm uses a table of the frequencies of occurrences of each character
to build up an optimal way of representing each character as a binary string.
Suppose we have 105 characters in a data file. Normal Storage: 8 bits per character (ASCII) - 8 x 105
bits in a file. But we want to compress the file and save it compactly. Suppose only six characters
appear in the file:

How can we represent the data in a Compact way?

(i) Fixed length Code: Each letter represented by an equal number of bits. With a fixed length code, at
least 3 bits per character:
For example:
a 000
b 001
c 010
d 011
e 100
f 101
For a file with 10 characters, we need 3 x 105 bits.
5

(ii) A variable-length code: It can do considerably better than a fixed-length code, by giving many
characters short code words and infrequent character long codewords.
For example:
a 0
b 101
c 100
d 111
e 1101
f 1100
Number of bits = (45 x 1 + 13 x 3 + 12 x 3 + 16 x 3 + 9 x 4 + 5 x 4) x 1000 = 224000 = 2.24 x 105bits
Thus, 224,000 bits to represent the file, a saving of approximately 25%.This is an optimal character
code for this file.

Prefix Codes:
• The prefixes of an encoding of one character must not be equal to complete encoding of another
character, e.g., 1100 and 11001 are not valid codes because 1100 is a prefix of some other code
word is called prefix codes.
• Prefix codes are desirable because they clarify encoding and decoding. Encoding is always simple
for any binary character code; we concatenate the code words describing each character of the
file. Decoding is also quite comfortable with a prefix code. Since no codeword is a prefix of any
other, the codeword that starts with an encoded data is unambiguous.

Greedy Algorithm for constructing a Huffman Code:

Huffman invented a greedy algorithm that creates an optimal prefix code called a Huffman Code.

Greedy Technique Page 1

• The algorithm builds the tree T analogous to the optimal code in a bottom-up manner. It starts
with a set of |C| leaves (C is the number of characters) and performs |C| - 1 'merging' operations to
create the final tree. In the Huffman algorithm 'n' denotes the quantity of a set of characters, z
indicates the parent node, and x & y are the left & right child of z respectively.

Huffman (C)
1. n=|C|
2. Q ← C
3. for i=1 to n-1
4. do
5. z= allocate-Node ()
6. x= left[z]=Extract-Min(Q)
7. y= right[z] =Extract-Min(Q)
8. f [z]=f[x]+f[y]
9. Insert (Q, z)
10. return Extract-Min (Q)

There are mainly two major parts in Huffman Coding

1. Build a Huffman Tree from input characters.
2. Traverse the Huffman Tree and assign codes to characters.
Steps to build Huffman Tree
Input is an array of unique characters along with their frequency of occurrences and output is Huffman
Tree.
1. Create a leaf node for each unique character and build a min heap of all leaf nodes (Min Heap is
used as a priority queue. The value of frequency field is used to compare two nodes in min heap.
Initially, the least frequent character is at root)
2. Extract two nodes with the minimum frequency from the min heap.

3. Create a new internal node with a frequency equal to the sum of the two nodes frequencies. Make
the first extracted node as its left child and the other extracted node as its right child. Add this
node to the min heap.
4. Repeat steps#2 and #3 until the heap contains only one node. The remaining node is the root node
and the tree is complete.

Greedy Technique Page 2

Huffman Coding: Greedy Algorithm Guide
No ratings yet
Huffman Coding: Greedy Algorithm Guide
27 pages
HuffmanCoding 2
No ratings yet
HuffmanCoding 2
16 pages
Huffman Coding Algorithm
No ratings yet
Huffman Coding Algorithm
4 pages
Huffman Coding: Data Compression Explained
No ratings yet
Huffman Coding: Data Compression Explained
29 pages
Huffman Encoding with Greedy Method
No ratings yet
Huffman Encoding with Greedy Method
16 pages
Lecture 09 - Greedy Algos Updates
No ratings yet
Lecture 09 - Greedy Algos Updates
64 pages
5.2 Huffman Algorithm
No ratings yet
5.2 Huffman Algorithm
12 pages
Huffman Coding
No ratings yet
Huffman Coding
22 pages
5c. Huffman
No ratings yet
5c. Huffman
13 pages
Greedy Huffman Coding
No ratings yet
Greedy Huffman Coding
7 pages
Huffman
No ratings yet
Huffman
13 pages
Huffman Code
No ratings yet
Huffman Code
7 pages
Huffman Coding
No ratings yet
Huffman Coding
7 pages
Manual GRP A - Assignment 2 .Docx 1 1
No ratings yet
Manual GRP A - Assignment 2 .Docx 1 1
15 pages
Huffman Coding Explained
100% (1)
Huffman Coding Explained
13 pages
Huffman Codes and Its Implementation: Submitted by Kesarwani Aashita Int. M.Sc. in Applied Mathematics (3 Year)
No ratings yet
Huffman Codes and Its Implementation: Submitted by Kesarwani Aashita Int. M.Sc. in Applied Mathematics (3 Year)
28 pages
Huffman
No ratings yet
Huffman
11 pages
Huffman Coding for Data Compression
No ratings yet
Huffman Coding for Data Compression
5 pages
Unit 4: Greedy Algorithms Overview
No ratings yet
Unit 4: Greedy Algorithms Overview
41 pages
Huffman Algo
No ratings yet
Huffman Algo
13 pages
Huffman Coding
No ratings yet
Huffman Coding
17 pages
Greedy Algorithms & Huffman Codes
No ratings yet
Greedy Algorithms & Huffman Codes
21 pages
Lecture 22 Compression
No ratings yet
Lecture 22 Compression
42 pages
Huffman Coding: Algorithm and Complexity
No ratings yet
Huffman Coding: Algorithm and Complexity
8 pages
Huffman Coding
No ratings yet
Huffman Coding
10 pages
Huffman Coding Presentation Updated
No ratings yet
Huffman Coding Presentation Updated
7 pages
Huffman
No ratings yet
Huffman
15 pages
Unit III - Daa
No ratings yet
Unit III - Daa
127 pages
Huffman Codes
No ratings yet
Huffman Codes
8 pages
Unit Iii Greedy and Dynamic Programming
No ratings yet
Unit Iii Greedy and Dynamic Programming
120 pages
Huffman Coding for Beginners
No ratings yet
Huffman Coding for Beginners
6 pages
Wa0023.
No ratings yet
Wa0023.
28 pages
Greedy Algorithms & Huffman Codes
No ratings yet
Greedy Algorithms & Huffman Codes
21 pages
Huffman Coding Ms 140400147 Sadia Yunas Butt
No ratings yet
Huffman Coding Ms 140400147 Sadia Yunas Butt
9 pages
Huffman Coding for Beginners
No ratings yet
Huffman Coding for Beginners
10 pages
Huffman Coding Explained: Algorithms and Trees
No ratings yet
Huffman Coding Explained: Algorithms and Trees
4 pages
Unit 2
No ratings yet
Unit 2
28 pages
Assignment No-05
No ratings yet
Assignment No-05
3 pages
Huffman Coding: Data Compression Guide
No ratings yet
Huffman Coding: Data Compression Guide
10 pages
Unit 2 CA209
No ratings yet
Unit 2 CA209
29 pages
Huffman Coding Explained
No ratings yet
Huffman Coding Explained
27 pages
Steps of Huffman Encoding:: Calculate The Frequency of Each Character Build A Priority Queue Build A Binary Tree
No ratings yet
Steps of Huffman Encoding:: Calculate The Frequency of Each Character Build A Priority Queue Build A Binary Tree
1 page
Graph Theory - Important Application of Trees Huffman Coding
No ratings yet
Graph Theory - Important Application of Trees Huffman Coding
50 pages
Huffman Coding: Data Compression Guide
No ratings yet
Huffman Coding: Data Compression Guide
15 pages
0g Huffman
No ratings yet
0g Huffman
23 pages
Mini Project
No ratings yet
Mini Project
26 pages
Huffman Coding Explained
No ratings yet
Huffman Coding Explained
4 pages
Unit 2
No ratings yet
Unit 2
82 pages
Department of Artificial Intelligence & Data Science K. K. Wagh Institute of Engineering Education and Research
No ratings yet
Department of Artificial Intelligence & Data Science K. K. Wagh Institute of Engineering Education and Research
5 pages
FALLSEM2024-25 STS3007 TH AP2024252001217 2024-11-13 Reference-Material-I
No ratings yet
FALLSEM2024-25 STS3007 TH AP2024252001217 2024-11-13 Reference-Material-I
17 pages
Ajayroyal828@gmail - Com 9908104197
No ratings yet
Ajayroyal828@gmail - Com 9908104197
10 pages
Huffman Coding Notes
No ratings yet
Huffman Coding Notes
7 pages
Understanding Huffman Compression Techniques
No ratings yet
Understanding Huffman Compression Techniques
38 pages
Chapter 6-Greedy Algorithms
No ratings yet
Chapter 6-Greedy Algorithms
32 pages
University of Management & Technology: Submitted By: Usama Dastagir 14030027011 Hassan Humayoun 14030027043
No ratings yet
University of Management & Technology: Submitted By: Usama Dastagir 14030027011 Hassan Humayoun 14030027043
7 pages
Huffman Coding
No ratings yet
Huffman Coding
11 pages
Huffman Coding Tutorial with C++
No ratings yet
Huffman Coding Tutorial with C++
5 pages
Huffman Coding: Efficient Data Compression
No ratings yet
Huffman Coding: Efficient Data Compression
6 pages
Algorithmics: Information Coding Techniques
No ratings yet
Algorithmics: Information Coding Techniques
44 pages
Dsi Ecc
No ratings yet
Dsi Ecc
7 pages
CCSDS LDPC Code Overview
No ratings yet
CCSDS LDPC Code Overview
32 pages
Reed Solomon Codes
No ratings yet
Reed Solomon Codes
13 pages
Number System Types & Conversions
No ratings yet
Number System Types & Conversions
14 pages
KSSR Year 1 Mathematics Practice
No ratings yet
KSSR Year 1 Mathematics Practice
70 pages
Java String Manipulation Programs
No ratings yet
Java String Manipulation Programs
10 pages
DLD Lecture No. 5
No ratings yet
DLD Lecture No. 5
6 pages
Alumiando
No ratings yet
Alumiando
8 pages
Key Codes
No ratings yet
Key Codes
7 pages
C Format Specifiers Guide
No ratings yet
C Format Specifiers Guide
3 pages
Digital vs. Analog Electronics Guide
No ratings yet
Digital vs. Analog Electronics Guide
25 pages
BSCS Digital Logic Practice
No ratings yet
BSCS Digital Logic Practice
4 pages
Understanding Line Coding Techniques
No ratings yet
Understanding Line Coding Techniques
6 pages
A Low Power Design of Redundant Binary Multiplier Using Parallel Prefix Adder
No ratings yet
A Low Power Design of Redundant Binary Multiplier Using Parallel Prefix Adder
10 pages
Number Systems and Codes in Computing
No ratings yet
Number Systems and Codes in Computing
78 pages
Linear Code Decoding Complexity
No ratings yet
Linear Code Decoding Complexity
1 page
Braille Secret Code Challenge Cards - Ver - 6
No ratings yet
Braille Secret Code Challenge Cards - Ver - 6
5 pages
Unit 2 ch-1
No ratings yet
Unit 2 ch-1
48 pages
Mapa Modbus EF-A
No ratings yet
Mapa Modbus EF-A
17 pages
Covert QR Codes
No ratings yet
Covert QR Codes
19 pages
Base Conversion
No ratings yet
Base Conversion
4 pages
Form 1 Test 2 Term 3 2025 Ict
No ratings yet
Form 1 Test 2 Term 3 2025 Ict
2 pages
Christmas Alphabet Worksheets
100% (1)
Christmas Alphabet Worksheets
9 pages
Kaithi: Range: 11080-110CF
No ratings yet
Kaithi: Range: 11080-110CF
3 pages
Digital Logic Quiz Review
No ratings yet
Digital Logic Quiz Review
5 pages
Alphabet Tracing Worksheets A-Z
100% (1)
Alphabet Tracing Worksheets A-Z
34 pages
Grade 5 Multiply Fraction Mixed Number B
No ratings yet
Grade 5 Multiply Fraction Mixed Number B
2 pages
Code For Hamming Distance Transmission & Reception
No ratings yet
Code For Hamming Distance Transmission & Reception
8 pages
Formatting Guide for Designers
No ratings yet
Formatting Guide for Designers
2 pages
Data Transfer Statistics Overview
No ratings yet
Data Transfer Statistics Overview
162 pages

Efficient Data Compression with Huffman Codes

Uploaded by

Efficient Data Compression with Huffman Codes

Uploaded by

Huffman Tree and Coding

Thursday, December 1, 2022 10:58 PM

How can we represent the data in a Compact way?

Greedy Algorithm for constructing a Huffman Code:

Greedy Technique Page 1

There are mainly two major parts in Huffman Coding

Greedy Technique Page 2

You might also like