0% found this document useful (0 votes)

13 views15 pages

Huffman Code Greedy Approach

The document discusses Huffman coding, an efficient technique for lossless data compression that uses a greedy algorithm to create optimal prefix codes, resulting in significant savings in storage space. It also covers the naive string matching algorithm, which finds all occurrences of a pattern in a text through character-by-character comparison, with a running time of O(mn). The analysis highlights the efficiency of Huffman coding using a priority queue and the time complexity involved in both algorithms.

Uploaded by

tayyaba

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views15 pages

Huffman Code Greedy Approach

Uploaded by

tayyaba

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 15

Greedy Algorithms

Lecture 22
29th November, 2013
Huffman Codes
 Suppose we have a 100,000 character data file that we wish to store
compactly
 Frequencies of the characters are given in Table
 A fixed length code requires 3 bits, so 300,000 bits are required to code
entire file

a b c d e f
Frequency (in thousands) 45 13 12 16 9 5
Fixed length code 000 001 010 011 100 101

November, 2013
Huffman Codes
 A widely used and very efficient technique for compressing data

 Savings of 20% to 90% are typical, depending on characteristics

of data being compressed

 Huffman coding is an entropy encoding algorithm used for

lossless data compression

 We consider data to be a sequence of characters

 The algorithm uses a greedy strategy

November, 2013
Huffman Codes
 Consider a variable length code

 If we give frequent characters short codes and infrequent characters

long codes, we can do better

 For example, using the coding scheme in the table, bits required are
(45*1+13*3+12*3+16*3+9*4+5*4)*1000=224,000 bits
a b c d e f
Frequency (in thousands) 45 13 12 16 9 5
Fixed length code 000 001 010 011 100 101
Variable length code 0 101 100 111 1101 1100

November, 2013
Huffman Codes
 Codes in which no codeword is prefix of some other codeword is called
a prefix code

 Encoding is simple: A 3 character file abc is coded as 0.101.100 which

is 0101100

 Prefix codes simplify decoding

 The codeword that begins an encoded file is unambiguous. We can

simply identify the initial codeword, translate it back to original, and
repeat decoding process on remainder of the encoded file

 e.g. 001011101 parses uniquely to aabe

November, 2013
Huffman Codes
 Decoding requires an appropriate representation of the prefix
code

 A binary tree whose leaves are the given characters provides one
such representation
0 100 1
55
0 100 1 a 0 1

86 14 25
1 30
0 1
14 0 0 1
58 28 c fb 14 d
1
0 1 0 1 0 0 1
a b c d e f f e

November, 2013
Huffman Pseudocode
 Huffman invented a greedy algorithm that constructs an optimal prefix
code called a Huffman code

November, 2013
Analysis
 The Huffman code tree is generated by using a priority queue(heap data
structure).

 Initially, all the characters are stored in the heap. A data element is
extracted from a heap in O(lg n) time.

 Each cycle, extracts two nodes and frequencies from the queue, and
adds one node and one frequency to the queue. This operation is
repeated n times. Thus, altogether 3n times the queue is enqueued
/dequeued.

 The total time for extracting and storing is 3n.lg n.

 Therefore, running time for the creation of code tree is O(nlg n)

November, 2013
String Matching
 Finding all occurrences of a pattern in a text

 More formally, assume text is an array T[1..n] of length n, and the

pattern is an array P[1..m] of length m <=n. We further assume that
elements of P and T are characters drawn from a finite alphabet Σ. For
example, we may have Σ={0, 1} or Σ={a, b, …, z}

 We say that P occurs with shift s in text T if 0<=s<=n-m and

T[s+1..s+m]=P[1..m]. If P occurs with shift s in T, then we call s a
valid shift otherwise we call s an invalid shift

 The string matching problem is the problem of finding all valid shifts
for which a given pattern P occurs in a given text T

November, 2013
Naïve String Matching Algorithm
 The brute force algorithm makes character-by-character
comparison between the pattern and given text block

 The pattern is moved against the text to examine all possible

comparisons

 The figure shows the initial and terminal placements of the

pattern

November, 2013
Naïve String Matching Algorithm

November, 2013
Naïve String Matching Algorithm
 The naïve algorithm finds all valid shifts using a loop that checks
the condition P[1..m]=T[s+1…s+m] for each of the n-m+1
possible values of s

NAIVE-STRING-MATCHER(T, P)
1. n ← length[T]
2. m ← length[P]
3. For s ← 0 to n-m
4. do if P[1..m] = T[s+1..s+m]
5. then print ‘pattern occurs with shift’ s

November, 2013
Naïve String Matching Algorithm

November, 2013
Analysis
 The code consists of two nested loops
for i =1 to n-m+1 do
for j = 1 to m do

 The outer loop iterates n-m+1 times. The inner loop iterates m
times. Thus, in worst case, altogether m(n-m+1) iterations are
performed.

 Thus, running time for the algorithm is O(m(n-m+1)), which is

simplified as O(mn)

November, 2013

Huffman Coding: Greedy Algorithm Guide
No ratings yet
Huffman Coding: Greedy Algorithm Guide
27 pages
5.2 Huffman Algorithm
No ratings yet
5.2 Huffman Algorithm
12 pages
Greedy Algorithms & Huffman Codes
No ratings yet
Greedy Algorithms & Huffman Codes
21 pages
Data Structure: Huffman Tree:Project Submitted To: Sir Abdul Wahab
No ratings yet
Data Structure: Huffman Tree:Project Submitted To: Sir Abdul Wahab
24 pages
Huffman Coding Explained: Algorithms and Trees
No ratings yet
Huffman Coding Explained: Algorithms and Trees
4 pages
Lect18 19
No ratings yet
Lect18 19
17 pages
Text Processing: Algorithms Overview
No ratings yet
Text Processing: Algorithms Overview
58 pages
Huffman Algorithm Implementation Guide
No ratings yet
Huffman Algorithm Implementation Guide
7 pages
Chapter 6-Greedy Algorithms
No ratings yet
Chapter 6-Greedy Algorithms
32 pages
Greedy Algorithms & Huffman Codes
No ratings yet
Greedy Algorithms & Huffman Codes
21 pages
Documentation in Daa
No ratings yet
Documentation in Daa
16 pages
Huffman Coding for Character Frequencies
No ratings yet
Huffman Coding for Character Frequencies
22 pages
Greedy Huffman Coding
No ratings yet
Greedy Huffman Coding
7 pages
Lecture 22 Compression
No ratings yet
Lecture 22 Compression
42 pages
Huffman Coding Algorithm Overview
No ratings yet
Huffman Coding Algorithm Overview
59 pages
04huffman 2x2
No ratings yet
04huffman 2x2
6 pages
Algorithmics: Information Coding Techniques
No ratings yet
Algorithmics: Information Coding Techniques
44 pages
Unit-V String Matching Algorithms
No ratings yet
Unit-V String Matching Algorithms
53 pages
Mini Project
No ratings yet
Mini Project
26 pages
Unit 4: Greedy Algorithms Overview
No ratings yet
Unit 4: Greedy Algorithms Overview
41 pages
Huffman Coding and Variable-Length Encoding
No ratings yet
Huffman Coding and Variable-Length Encoding
3 pages
Greedy Algorithms Explained
No ratings yet
Greedy Algorithms Explained
63 pages
Huffman Coding: Encoding Messages Efficiently
No ratings yet
Huffman Coding: Encoding Messages Efficiently
40 pages
2 2 5huffman
No ratings yet
2 2 5huffman
52 pages
W11 Greedy Algorithms Lecture 21 06052024 115021am
No ratings yet
W11 Greedy Algorithms Lecture 21 06052024 115021am
6 pages
Graph Theory - Important Application of Trees Huffman Coding
No ratings yet
Graph Theory - Important Application of Trees Huffman Coding
50 pages
Dsa Q31
No ratings yet
Dsa Q31
3 pages
Huffman Coding
No ratings yet
Huffman Coding
65 pages
Huffman Coding Example Explained
No ratings yet
Huffman Coding Example Explained
26 pages
Unit 3
No ratings yet
Unit 3
122 pages
Efficient Data Compression with Huffman Codes
No ratings yet
Efficient Data Compression with Huffman Codes
2 pages
Lecture 09 - Greedy Algos Updates
No ratings yet
Lecture 09 - Greedy Algos Updates
64 pages
Huffman Coding for Data Compression
No ratings yet
Huffman Coding for Data Compression
52 pages
Huffman Coding: Data Compression Guide
No ratings yet
Huffman Coding: Data Compression Guide
15 pages
Slot24 25 26 TextProcessing 2022
No ratings yet
Slot24 25 26 TextProcessing 2022
60 pages
Unit III - Daa
No ratings yet
Unit III - Daa
127 pages
String Matching Algorithms Guide
No ratings yet
String Matching Algorithms Guide
46 pages
Unit III
No ratings yet
Unit III
28 pages
Huffman Coding
No ratings yet
Huffman Coding
17 pages
Huffman Coding
No ratings yet
Huffman Coding
22 pages
Unit Iii Greedy and Dynamic Programming
No ratings yet
Unit Iii Greedy and Dynamic Programming
120 pages
Huffman Coding: Data Compression Explained
No ratings yet
Huffman Coding: Data Compression Explained
69 pages
Huffman
No ratings yet
Huffman
24 pages
HuffmanCoding 2
No ratings yet
HuffmanCoding 2
16 pages
Manual GRP A - Assignment 2 .Docx 1 1
No ratings yet
Manual GRP A - Assignment 2 .Docx 1 1
15 pages
Test4 2023 ZFL withSolutionCorrect
No ratings yet
Test4 2023 ZFL withSolutionCorrect
7 pages
Huffman Alg
No ratings yet
Huffman Alg
14 pages
Huffman Encoding with Greedy Method
No ratings yet
Huffman Encoding with Greedy Method
16 pages
Huffman Coding Notes
No ratings yet
Huffman Coding Notes
7 pages
Huffman Coding for Beginners
No ratings yet
Huffman Coding for Beginners
6 pages
Huffman Coding
No ratings yet
Huffman Coding
7 pages
4 Module Algorithms
No ratings yet
4 Module Algorithms
28 pages
Huffman Assign (Hifza 117)
No ratings yet
Huffman Assign (Hifza 117)
6 pages
Huffman Tree Construction Method
No ratings yet
Huffman Tree Construction Method
3 pages
Wa0023.
No ratings yet
Wa0023.
28 pages
Chapter 4 Multi
No ratings yet
Chapter 4 Multi
45 pages
5 Huffman Coding
No ratings yet
5 Huffman Coding
50 pages
Activity Selection & Huffman Encoding
No ratings yet
Activity Selection & Huffman Encoding
4 pages
Huffman Coding Explained
No ratings yet
Huffman Coding Explained
27 pages
Nbu 100 SCL (1) Updated
No ratings yet
Nbu 100 SCL (1) Updated
91 pages
Essentials of Data Structures & Algorithms
No ratings yet
Essentials of Data Structures & Algorithms
4 pages
Vrealize Automation 8forward Support Matrix
No ratings yet
Vrealize Automation 8forward Support Matrix
8 pages
OpenStack Private Cloud Setup Guide
No ratings yet
OpenStack Private Cloud Setup Guide
29 pages
CS Syllabus
No ratings yet
CS Syllabus
71 pages
Android Forensics Guide
No ratings yet
Android Forensics Guide
14 pages
Wa0001
No ratings yet
Wa0001
8 pages
String DS
No ratings yet
String DS
13 pages
Static Keyword
No ratings yet
Static Keyword
9 pages
1st Semester Web Design and Development Worksheet For Grade 11
No ratings yet
1st Semester Web Design and Development Worksheet For Grade 11
4 pages
Java Crash Cource Book
No ratings yet
Java Crash Cource Book
17 pages
AWS Solution Architect Class Notes
100% (2)
AWS Solution Architect Class Notes
22 pages
Motivational Letters for Master's Programs
No ratings yet
Motivational Letters for Master's Programs
2 pages
SSD Buying Guide for Tech Enthusiasts
No ratings yet
SSD Buying Guide for Tech Enthusiasts
7 pages
GE RX3i PLC Firmware Guide
No ratings yet
GE RX3i PLC Firmware Guide
3 pages
Obstacle Avoidance Robot Guide
No ratings yet
Obstacle Avoidance Robot Guide
9 pages
BestCode Remote Communications Protocol
No ratings yet
BestCode Remote Communications Protocol
51 pages
Results and Discussion Functional and Non-Functional Requirements of Uncle John's Car Rental: An Android Based Management and Monitoring System
No ratings yet
Results and Discussion Functional and Non-Functional Requirements of Uncle John's Car Rental: An Android Based Management and Monitoring System
4 pages
Wii Nunchuck Robot Control Guide
No ratings yet
Wii Nunchuck Robot Control Guide
10 pages
Sta Aot v08
No ratings yet
Sta Aot v08
118 pages
User Interface Design and Analysis Guide
No ratings yet
User Interface Design and Analysis Guide
36 pages
C Programming Exam Paper BCA-141/20
No ratings yet
C Programming Exam Paper BCA-141/20
2 pages
SE Practical File
No ratings yet
SE Practical File
14 pages
Consensus Algorithms in Distributed Systems
No ratings yet
Consensus Algorithms in Distributed Systems
46 pages
Brkaci 2005
No ratings yet
Brkaci 2005
41 pages
CS610P Finalterm Current Papers Spring 2023 (Merged by Mubashir) .-1
No ratings yet
CS610P Finalterm Current Papers Spring 2023 (Merged by Mubashir) .-1
3 pages
Imagicle On Cisco CCW
No ratings yet
Imagicle On Cisco CCW
7 pages
Automating Cumulus Linux With Ansible
No ratings yet
Automating Cumulus Linux With Ansible
8 pages
Chapter 1 Computer Programming
No ratings yet
Chapter 1 Computer Programming
43 pages
Arabic Digit Recognition
No ratings yet
Arabic Digit Recognition
5 pages

Huffman Code Greedy Approach

Uploaded by

Huffman Code Greedy Approach

Uploaded by

Greedy Algorithms

 Savings of 20% to 90% are typical, depending on characteristics

 Huffman coding is an entropy encoding algorithm used for

 We consider data to be a sequence of characters

 The algorithm uses a greedy strategy

 If we give frequent characters short codes and infrequent characters

 Encoding is simple: A 3 character file abc is coded as 0.101.100 which

 Prefix codes simplify decoding

 The codeword that begins an encoded file is unambiguous. We can

 e.g. 001011101 parses uniquely to aabe

 The total time for extracting and storing is 3n.lg n.

 Therefore, running time for the creation of code tree is O(nlg n)

 More formally, assume text is an array T[1..n] of length n, and the

 We say that P occurs with shift s in text T if 0<=s<=n-m and

 The pattern is moved against the text to examine all possible

 The figure shows the initial and terminal placements of the

 Thus, running time for the algorithm is O(m(n-m+1)), which is

You might also like