0% found this document useful (0 votes)

72 views9 pages

Huffman Coding

David Huffman's 1951 paper presents an algorithm for efficiently encoding messages based on their probabilities, achieving performance limits outlined by Claude Shannon. The paper details the construction of minimum-redundancy codes, which minimize the average length of encoded messages while ensuring that no code is a prefix of another. This method has become widely popular in compression tools due to its simplicity and effectiveness.

Uploaded by

luffyxzoro79

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

72 views9 pages

Huffman Coding

Uploaded by

luffyxzoro79

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

In 1951, David HuftinaD o.

Vdopod·aa aJaoritlun for etBciently encoding the output of a source that

produces a sequeaee ofsymbols, cacllof~jch h1Saprobability ofoecu.rtenee. This aljorithm, essentially
achieved the theoretical limits ofpedormance preseated in Claude Shannon's classic paper of J948. The
simplicity of the Huffman technique, described in his paper reproduced here, makes it extremely popular
for use in compression tools.
Priti Shankar

A Method for the Construction of

Minimum-Redundancy Codes*
David A Huffman, Associate, IRE
Massachusetts Institute of Technology, Cambridge, Mass.

Summary - An optimum method of coding an ensemble of mes-

sages consisting of a finite number of members is developed. A
minimum-redundancy code is one constructed in such a way that
the average number of coding digits per message is minimized.

Introduction

One important method of transmitting messages is to transmit in their place

sequences of symbols. If there are more messages which might be sent than
there are kinds of symbols available, then some of the messages must use
more than one symbol. If it is assumed that each symbol requires the same
time for transmission, then the time for transmission (length) of a message
is directly proportional to the number of symbols associated with it. In this
paper, the symbol or sequence of symbols associated with a given message
will be called the "message code" The entire number of messages which
might be transmitted will be called the "message ensemble" The mutual

'Decimal classification: R531.1. Original manuscript received by the Institute, December 6,1951.
Reproduced from Proceedings of the IRE, Vol.40 (9). p.1098, September, 1952.

--------~--------
RESONANCE I February 2006 91
agreement between the transmitter and the receiver about the meaning of
the code for each message of the ensemble will be called the "ensemble
code"

Probably the most familiar ensemble code was stated in the phrase "one if
by land and two if by sea" In this case, the message ensemble consisted of
the two individual messages "by land" and "by sea" , and the message codes
were "one" and "two"

In order to formalize the requirements of an ensemble code, the coding

symbols will be represented by numbers. Thus, if there are D different
types of symbols to be used in coding, they will be represented by the digits
0,1,2, (D - 1). For example, a ternary code will be constructed using
the three digits 0, 1, and 2 as coding symbols.

The number of messages in the ensemble will be called N. Let P(i) be the
probability of the ith message. Then
N

LP(i) = 1. (1)
i=l

The length of a message, L(i), is the number of coding digits assigned to it.
Therefore, the average message length is
N

Lav = L P(i)L(i). (2)

i=l

The term "redundancy" has been defined by Shannon [1] as a property of

codes. A "minimum-redundancy code" will be defined here as an ensem-
ble code which, for a message ensemble consisting of a finite number of
members, N, and for a given number of coding digits, D, yields the lowest
possible average message length. In order to avoid the use of the lengthy
term "minimum-redundancy", this term will be replaced here by "opti-
mum" It will be understood then that, in this paper, "optimum code"
rneans "minimum-redundancy code"

The following basic restrictions will be imposed on an ensemble code:

-92-----------------------------~----------------------------
RESONANCE I February 2006
(a) No two messages will consist of identical arrangements of coding digits.

(b) The message codes will be constructed in such a way that no additional
indication is necessary to specify where a message code begins and
ends once the starting point of a sequence of messages is known.

Restriction (b) necessitates that no message be coded in such a way that

its code appears, digit for digit, as the first part of any message code of
greater length. Thus, 01, 102, 111, and 202 are valid message codes for
an ensemble of four members. For instance, a sequence of these messages
1111022020101111102 can be broken up into the individual messages 111-
102-202-01-01-111-102. All the receiver neep know is the ensemble code.
However, if the ensemble has individual message codes including 11, 111,
102, and 02, then when a message sequence starts with the digits 11, it
is not immediately certain whether the message 11 has been received or
whether it is only the first two digits of the message 111. Moreover, even if
the sequence turns out to be 11102, it is still not certain whether 111-02 or
11-102 was transmitted. In this example, change of one of the two message
codes 111 or 11 is indicated.
C E Shannon (1] and R M Fano [2] have developed ensemble coding proce-
dures for the purpose of proving that the average number of binary digits
required per message approaches from above the average amount of informa-
tion per message. Their coding procedures are not optimum, but approach
the optimum behaviour when N approaches infinity. Some work has been
done by Kraft [3] toward deriving a coding method which gives an average
code length as close as possible to the ideal when the ensemble contains
a finite number of members. However, up to the present time, no definite
proced ure has been suggested for the construction of such a code to the
knowledge of the author. It is the purpose of this paper to derive such a
procedure.
Derived Coding Requirements

For an optimum code, the length of a given message code can never be less
than the length of a more probable message code. If this requirement were
not met, then a reduction in average message length could be obtained by

--------~--------
RESONANCE I February 2006 93
interchanging the codes for the two message in question in such a way that
the shorter code becomes associated with the more probable message. Also,.
if there are several messages with the same probability, then it is possible
that the codes for these messages may differ in length. However, the codes
for these messages may be interchanged in any way without affecting the
average code length for the message ensemble. Therefore, it may be assumed
that the messages in the ensemble have been ordered in a fashion such that
P{l) ~ P(2) ~ ~ P{N - 1) ~ P{N) (3)
and that, in addition, for an optimum code, the condition
L(l) ~ L(2) ~ ~ L{N - 1) ~ L(N) (4)
holds. This requirements is assumed to be satisfied throughout the following
discussion.

I t might be imagined that an ensemble code could assign q more digits

to the Nth message than to the (N - 1)st message. However, the first
L (N - 1) digits of the Nth message must not be used as the code for any
other message. Thus the additional q digits would serve no useful purpose
and would unnecessarily increase Lav. Therefore, for an optimum code it is
necessary that L{N) be equal to L{N - 1).

The kth prefix of a message code will be defined as the first k digits of that
message code. Basic restriction (b) could then be restated as: No message
shall be coded in such a way that its code is a prefix of any other message,
or that any of its prefixes are used elsewhere as a message code.

Imagine an optimum code in which no two of the messages coded with

length L{ N) have identical prefixes of order L( N) - 1. Since an optimum
code has been assumed, then none of these messages of length L{ N) can
have codes or prefixes of any order· which correspond to other codes. It
would then be possible to drop the last digit of all of this group of messages
and thereby reduce the value of Lav. Therefore, in an optimum code, it is
necessary that at least two (and no more than D) of the codes with length
L(N) have identical prefixes of order L(N) - 1.
One additional requirement can be made for an optimum code. Assume that
there exists a combination of the D different types of coding digits which

-94--------------------------~----------R-ES-O-N-A-N-C-E-I-F-eb-r-ua-rY--2-00-6
is less than L( N) digits in length and which is not used as a message code
or which is not a prefix of a message code. Then this combination of digit
could be used to replace the code for the Nth message with a consequent
reduction of Lav. Therefore, all possible sequences of L(N) - 1 digits must
be used either as message codes, or must have one of their prefixes used as
message codes.

The derived restrictions for an optimum code are summarized in condensed

form below and considered in addition to restrictions (a) and (b) given in
the first part of this paper:

(c) L(l) ~ L(2) ~ ~ L(N - 1) = L(N).

(d) At least two and not more than D of the messages with code length
L(N) have codes which are alike except for their final digits.
(e) Each possible sequence of L( N) - 1 digits must be used either as a
message code or must have one of its prefixes used as a message code.

Optimum Binary Code

For ease of development of the optimum coding procedure, let us now re-
strict ourselves to the problem of binary coding. Later this procedure will
be extended to the general case of D digits.

Restriction (c) makes it necessary that the two least probable messages have
codes of equal length. Restriction (d) places the requirement that, for D
equal to two, there be only two of the messages with coded length L(N)
which are identical except for their last digits. The final digits of these two
codes will be one of the two binary digits, 0 and 1. It will be necessary
to assign these two message codes to the Nth and the (N - 1)st messages
since at this point it is not known whether or not other codes of length
L(N) exist. Once this has been done, these two messages are equivalent
to a single composite message. Its code (as yet undetermined) will be the
common prefixes of order L(N) - 1 of these two messages. Its probability
will be the sum of the probabilities of the two messages from which it was

-RE-S-O-NA-N-C-E-I--Fe-b-rU-ar-Y-2-0-06----------~--------------------------9-5
created. The ensemble containing this composite message in the place of its
two component messages will be called the first auxiliary message ensemble.

This newly created ensemble contains one less message than the original. Its
members should be rearranged if necessary so that the messages are again
ordered according to their probabilities. It may be considered exactly as the
original ensemble was. The codes for each of the two least probable messages
in this new ensemble are required to be identical except in their final digits;
1 and 2 are assigned as these digits, one for each of the two messages.
Each new auxiliary ensemble contains one less message than the preceding
ensemble. Each auxiliary ensemble represents the original ensemble with
full use made of the accumulated necessary coding requirements.

The procedure is applied again and again until the number of members
in the most recently formed auxiliary message ensemble is reduced to two.
One of each of the binary digits is assigned to each of these two composite
messages. There messages are then combined to form a single composite
message with probability unity, and the coding is complete.

Now let us examine Table 1. The left-had column contains the ordered
message probabilities of the ensemble to be coded. N is equal to 13. Since
each combination of two messages (indicating by a bracket) is accompanied
by the assigning of a new digit to each, then the total number of digits which
should be assigned to each original message is the same as the number of
combinations indicted for that message. For example, the message marked
* or a composite of which it is a part, is combined with others five times,
and therefore should be assigned a code length of five digits.

When there is no alternative in choosing the two least probable messages,

then it is clear that the requirements, established as necessary, are also
sufficient for deriving an optimum code. There may arise situations in
which a choice may be made between two or more groupings of least likely
messages. Such a case arises, for example, in the fourth auxiliary ensemble
of Table 1. Either of the messages of probability 0.08 could have been
combined with that of probability 0.06. However, it is possible to rearrange
codes in any manner among equally likely messages without affecting the
average code length, and so a choice of either of the alternatives could have

-96----------------------------~-----------R-ES-O-N-A-N-C-E-I-F-e-br-u-ar-Y-2-0-06
Mes5age Probabilities

Original A uxili ary message ensembl es

Message 2 3 4 567 8 9 10 11 12
Ensanble
r
1 00
.

~0.24
o.36
0.24
r~::P:~
0.241"
0.20

0.18
Q~

0.18

0.10
Q~

0.18

0.10
Q~

0.18

0.10
Q~

0.18

0.10
Q~

0.18

~.14
Q~

0.18
+0.18
0.14
0.10 0.10
Q~
+O.~
Q~
O.~
0.18 0.1811
0.18 0.18J
0.14,
0.10-,
r Q~I
0.201

0.10
0.10 0.10 0.10 0.10 0.10 0.10 0.1gJ
0.10 0.10 0.10 0.10 0.10 0.10 0.10
o.10 0.10,

r
+0.08 0.00 0.00 O.W
o.OOy

~
'OO
0.06 0.06 0.06 0.06 0.06
0.06 0.06 0.06 0.00..1
0.04 0.04 0.04 0.Q41
tlO.04 0.04 O.~I
0.04 0.04 0.041
0.04 O.~I

0.03
0.01~
r 0.Q4f Table 1. Optimum binary coding procedure.

been made. Therefore, the procedure given is always sufficient to establish

an optimum binary code.

The lengths of all the, encoded messages derived from Table 1 are given in
Table 2.

Having now determined proper lengths of code for each message, the prob-
lem of specifying the actual digits remains. Many alternatives exist. Since
the combining of messages into their composites is similar to the succes-
sive confluences of trickles, rivulets, brooks, and creeks into a final large
river, the procedure thus far described might be considered analogous to
the placing of signs by a water-borne insect at each of these junctions as
he journeys downstream. It should be remembered that the code which we

--------~--------97
RESONANCE I February 2006
i P(i) L(i) P(i )L(i) Table 2. Results of optimum binary coding pro-
Code
cedure.
1 0.20 2 0.40 10
2 0.18 3 0.54 000
3 0.10 3 0.30 011
4 0.10 3 0.30 110
5 0.10 3 0.30 111
6 0.00 4 0.24 0101
7 0.00 5 0.30 00100
8 0.04 5 0.20 00101
9 0.04 5 0.20 01000
10 0.04 5 0.20 01001
11 0.04 5 0.20 00110
12 0.03 6 0.18 001110
13 0.01 6 0.00 001111
Lav = 3:42

desire is that one which the insect must remember in order to work his way
back upstream. Since the placing of the signs need not follow the same rule,
such as "zero-right-returning", at each junction, it can be seen that there
are at least 212 different ways of assigning code digits for our example.

The code in Table 2 was obtained by using the digit 0 for the upper message
and the digit 1 for the lower message of any bracket. It is important to note
in Table 1 that coding restriction (e) is automatically met as long as two
messages (and not one) are placed in each bracket.

Generalization of the Method

Optimum coding of an ensemble of messages using three or more types

of digits is similar to the binary coding procedure. A table of auxiliary
message ensembles similar to Table 1 will be used. Brackets indicating
messages combined to form composite messages will be used in the same
way as was done in Table 1. However, in order to satisfy restriction (e),
it will be required that all these brackets, with the possible exception of
one combining the least probable messages of the original ensemble, always
combine a number of messages equal to D.

I t will be noted that the terminating auxiliary ensemble always has one
unity probability message. Each preceding ensemble is increased in number
by D - 1 until the first auxiliary ensemble is reached. Therefore, if N1 is the

-98----------------------------~--------------------------
RESONANCE I February 2006
Table 3. Optimum coding pro-
Message probci>iI i t ies
cedure for D 4.=
Original
Message
Ensanble Auxiliary Eflsanbles L(i) Code

0.22 0.22
r+O.40
0.22
~100 1 1
0.20 0.20 0.20 1 2
0.18 0.18 0.18 1 3
0.15 0.15 2 00
0.10 0.10..- 2 01
0.08 0.08 2 02

0.05
r-f0007 3 030
0.02 3 031

number of messages in the first auxiliary ensemble, then (N1 - 1) / (D - 1)

must be an integer. However Nl = N - no + 1, where no is the number of
the least probable messages combined in a bracket in the original ensemble.
Therefore, no (which, of course, is at least two and no more than D) must
be of such a value that (N - no)/(D - 1) is an integer.

In Table 3 an example is considered using an ensemble of eight messages

which is to be coded with four digits; no is found to be 2. The code listed
in the table is obtained by assigning the four digits 0, 1, 2, and 3, in order,
to each of the brackets.

Acknowledgements

The author is indebted to Dr. W K Linvill and Dr. R M Fano, both of

the Massachusetts Institute of Technology, for their helpful criticism of this
paper.

[1] CE Shannon, A mathematical theory of communication, BellSys. Tech-J., Vo1.27,pp.398-403,July 1948.

[2] R M Fano, The transmission ofinformation, Technical Report No. 65, Research Laboratory of Electronics,
M.I.T., Cambridge, Mass., 1949.
[3] L G Kraft,A device for quantizing, grouping, and coding amplitude-modultued pulses, Electrical Engineering
Thesis, M.I.T., Cambridge, Mass., 1949.

--------~--------
RESONANCE I February 2006 99

Lecture 4
No ratings yet
Lecture 4
11 pages
Source Coding of Discrete Sources: 1-The Average Code Length L Must Be As Minimum As Possible. This Average Length Is
No ratings yet
Source Coding of Discrete Sources: 1-The Average Code Length L Must Be As Minimum As Possible. This Average Length Is
17 pages
Chapter Three Source Coding: 1-Sampling Theorem
No ratings yet
Chapter Three Source Coding: 1-Sampling Theorem
19 pages
1954 - Elias - Error-Free Coding
No ratings yet
1954 - Elias - Error-Free Coding
16 pages
Source Coding
No ratings yet
Source Coding
10 pages
Lecture35-37 SourceCoding
No ratings yet
Lecture35-37 SourceCoding
20 pages
Data Compression
No ratings yet
Data Compression
35 pages
Unit 2
No ratings yet
Unit 2
28 pages
Adaptive Data Compression
No ratings yet
Adaptive Data Compression
7 pages
Information Theory & Coding Basics
No ratings yet
Information Theory & Coding Basics
40 pages
UNIT-5 Part-2 Coding Theory PDF
No ratings yet
UNIT-5 Part-2 Coding Theory PDF
61 pages
Source & Channel Encoding Basics
No ratings yet
Source & Channel Encoding Basics
15 pages
Rohini 67178593226
No ratings yet
Rohini 67178593226
6 pages
Publication 3 26433 1410
No ratings yet
Publication 3 26433 1410
6 pages
Source Coding
No ratings yet
Source Coding
10 pages
ch3 Part1
No ratings yet
ch3 Part1
7 pages
Lesson 4 Information Theory
No ratings yet
Lesson 4 Information Theory
39 pages
Shannon's Coding Techniques Explained
No ratings yet
Shannon's Coding Techniques Explained
68 pages
Audio and Video Coding PDF
No ratings yet
Audio and Video Coding PDF
72 pages
Source Coding & Theorems Guide
No ratings yet
Source Coding & Theorems Guide
29 pages
Source Coding Techniques
No ratings yet
Source Coding Techniques
18 pages
ECEVSP L03 Compression2
No ratings yet
ECEVSP L03 Compression2
40 pages
Communication Principles: Information Can Be Encoded Into Messages
No ratings yet
Communication Principles: Information Can Be Encoded Into Messages
5 pages
Unit 5 - Part-Ii
No ratings yet
Unit 5 - Part-Ii
41 pages
Designing Optimal Binary Prefix Codes
No ratings yet
Designing Optimal Binary Prefix Codes
37 pages
Rizzoni Principles 7e Ch19 ISM
No ratings yet
Rizzoni Principles 7e Ch19 ISM
43 pages
M1 - 7 - Construction of Basic Codes - Shannon Fano and Huffman
No ratings yet
M1 - 7 - Construction of Basic Codes - Shannon Fano and Huffman
55 pages
Coding Theory
No ratings yet
Coding Theory
49 pages
Lecture 5
No ratings yet
Lecture 5
13 pages
Chapter-6 Concepts of Error Control Coding - Block Codes
No ratings yet
Chapter-6 Concepts of Error Control Coding - Block Codes
77 pages
Data Compression and Huffman Codes
No ratings yet
Data Compression and Huffman Codes
26 pages
Data Compression Essentials
No ratings yet
Data Compression Essentials
19 pages
MSc Wireless Communication Coding Techniques
No ratings yet
MSc Wireless Communication Coding Techniques
54 pages
97351
No ratings yet
97351
17 pages
Structure: Chapter 1: Error Control Coding
No ratings yet
Structure: Chapter 1: Error Control Coding
66 pages
Error Detection Codes Explained
No ratings yet
Error Detection Codes Explained
6 pages
Basic Concepts of Encoding
No ratings yet
Basic Concepts of Encoding
34 pages
Ees452 2021 2.1
No ratings yet
Ees452 2021 2.1
9 pages
Information Theory Problems and Solutions
No ratings yet
Information Theory Problems and Solutions
13 pages
Uniquely Decodable Codes in Compression
No ratings yet
Uniquely Decodable Codes in Compression
4 pages
Error Correction in Data Transmission
No ratings yet
Error Correction in Data Transmission
4 pages
Module IV
No ratings yet
Module IV
37 pages
Coding & Information Theory: By: Shiva Navabi January, 29 2011
No ratings yet
Coding & Information Theory: By: Shiva Navabi January, 29 2011
38 pages
Robust Universal Complete Codes For Transmission and Compression
No ratings yet
Robust Universal Complete Codes For Transmission and Compression
25 pages
Source Coding and Shannon's Theorem
No ratings yet
Source Coding and Shannon's Theorem
30 pages
Lec 2
No ratings yet
Lec 2
17 pages
Introduction To Digital Communications and Information Theory
No ratings yet
Introduction To Digital Communications and Information Theory
8 pages
6.1 Reminder: Communications Systems II-Lec.6 Fourth Stage 2020-2021
No ratings yet
6.1 Reminder: Communications Systems II-Lec.6 Fourth Stage 2020-2021
5 pages
Binary Source Coding Basics
No ratings yet
Binary Source Coding Basics
16 pages
Data Compression: Coding Techniques
No ratings yet
Data Compression: Coding Techniques
18 pages
Error Correcting Codes: - 34 - R-E-S-O-N-A-N-C-E-I - Ja-n-u-a-rY - 19-9-7
No ratings yet
Error Correcting Codes: - 34 - R-E-S-O-N-A-N-C-E-I - Ja-n-u-a-rY - 19-9-7
10 pages
Lecture 4-Print
No ratings yet
Lecture 4-Print
18 pages
VLSI LDPC Decoder Implementation
100% (1)
VLSI LDPC Decoder Implementation
30 pages
Code Combining for Reliable Decoding
No ratings yet
Code Combining for Reliable Decoding
9 pages
Introduction to Coding Theory
No ratings yet
Introduction to Coding Theory
28 pages
LDPC Codes: An Introduction: Amin Shokrollahi
No ratings yet
LDPC Codes: An Introduction: Amin Shokrollahi
34 pages
Signal Weighting by Richard Grinold
100% (1)
Signal Weighting by Richard Grinold
11 pages
Class XI Computer Science Syllabus 2023-24
No ratings yet
Class XI Computer Science Syllabus 2023-24
29 pages
BLOCKBOARD
100% (1)
BLOCKBOARD
9 pages
Half Yearly English
No ratings yet
Half Yearly English
5 pages
Jojoba Oil: A Cosmetic Marvel
No ratings yet
Jojoba Oil: A Cosmetic Marvel
10 pages
Development and Validation of Uv-Spectrophotometric Method For Estimation of Hydroquinone in Bulk, Marketed Cream and Preapared NLC Formulation
No ratings yet
Development and Validation of Uv-Spectrophotometric Method For Estimation of Hydroquinone in Bulk, Marketed Cream and Preapared NLC Formulation
7 pages
Killer Consulting Resumes
No ratings yet
Killer Consulting Resumes
104 pages
2.5 Fuel Oil System
No ratings yet
2.5 Fuel Oil System
14 pages
Freshwater and Seawater Decay Rates
No ratings yet
Freshwater and Seawater Decay Rates
1 page
Evaluation Sheet Classroom Advisory Tasks
No ratings yet
Evaluation Sheet Classroom Advisory Tasks
1 page
Class 12 Psychology Exam Guide
100% (1)
Class 12 Psychology Exam Guide
6 pages
Modbus TCP: Industrial Ethernet Protocol
No ratings yet
Modbus TCP: Industrial Ethernet Protocol
2 pages
The Foreign Policy of Park Chung Hee (1968 - 1979)
No ratings yet
The Foreign Policy of Park Chung Hee (1968 - 1979)
217 pages
Hsslive XI Eco CH 2 Collection of Data
No ratings yet
Hsslive XI Eco CH 2 Collection of Data
3 pages
5S Housekeeping Principles for Productivity
No ratings yet
5S Housekeeping Principles for Productivity
14 pages
DRMS Data Resource Management System
No ratings yet
DRMS Data Resource Management System
15 pages
Chap 8
No ratings yet
Chap 8
8 pages
AARIKA Udyam Certificate
No ratings yet
AARIKA Udyam Certificate
4 pages
Rio+20: The Emerging Challenge of An Ageing World
No ratings yet
Rio+20: The Emerging Challenge of An Ageing World
25 pages
Case Study Bhendi Bajar
100% (1)
Case Study Bhendi Bajar
2 pages
Wbin HB1805A1 - I PDF
No ratings yet
Wbin HB1805A1 - I PDF
5 pages
Acne Studios Sustainability Report 20-21
No ratings yet
Acne Studios Sustainability Report 20-21
25 pages
Ganaskt
No ratings yet
Ganaskt
30 pages
Ransum Formulasi Ayam Petelur dan Broiler
No ratings yet
Ransum Formulasi Ayam Petelur dan Broiler
3 pages
1st Periodical Test Math5 Melc Based With Tos
100% (4)
1st Periodical Test Math5 Melc Based With Tos
8 pages
CE Physics Solution
100% (5)
CE Physics Solution
24 pages
Understanding Urbanization Dynamics
No ratings yet
Understanding Urbanization Dynamics
6 pages
Led Daylight 4 Installation Guide.1-6
No ratings yet
Led Daylight 4 Installation Guide.1-6
6 pages
The Princess Bride
No ratings yet
The Princess Bride
125 pages
Solution For Assignment 3
No ratings yet
Solution For Assignment 3
6 pages

Huffman Coding

Uploaded by

Huffman Coding

Uploaded by

In 1951, David HuftinaD o.

Vdopod·aa aJaoritlun for etBciently encoding the output of a source that

A Method for the Construction of

Summary - An optimum method of coding an ensemble of mes-

One important method of transmitting messages is to transmit in their place

In order to formalize the requirements of an ensemble code, the coding

Lav = L P(i)L(i). (2)

The term "redundancy" has been defined by Shannon [1] as a property of

The following basic restrictions will be imposed on an ensemble code:

Restriction (b) necessitates that no message be coded in such a way that

I t might be imagined that an ensemble code could assign q more digits

Imagine an optimum code in which no two of the messages coded with

The derived restrictions for an optimum code are summarized in condensed

(c) L(l) ~ L(2) ~ ~ L(N - 1) = L(N).

Optimum Binary Code

When there is no alternative in choosing the two least probable messages,

Original A uxili ary message ensembl es

been made. Therefore, the procedure given is always sufficient to establish

Generalization of the Method

Optimum coding of an ensemble of messages using three or more types

number of messages in the first auxiliary ensemble, then (N1 - 1) / (D - 1)

In Table 3 an example is considered using an ensemble of eight messages

The author is indebted to Dr. W K Linvill and Dr. R M Fano, both of

[1] CE Shannon, A mathematical theory of communication, BellSys. Tech-J., Vo1.27,pp.398-403,July 1948.

You might also like