Information Theory and Coding Unit Second

it is the study material of the subject information theory and coding unit 2 yes it is the document of that subject

Uploaded by

Claude Monet

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views108 pages

Information Theory and Coding Unit Second

it is the study material of the subject information theory and coding unit 2 yes it is the document of that subject

Uploaded by

Claude Monet

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Unit 2

Uncertainty and Information

1. Introduction

In computer science, mathematics, and information theory, uncertainty

refers to the lack of complete knowledge about a system, event, or
outcome, while information represents the amount of knowledge
gained when uncertainty is reduced.

These concepts are foundational in probability theory, data

compression, cryptography, and decision-making systems.
Importance in Computer Science
[Link] Compression:
1. Files are compressed by removing predictable (low-uncertainty) parts.
2. Example: Huffman coding assigns shorter codes to more probable symbols.

[Link]:
1. Secure keys have high uncertainty (entropy), making them hard to guess.

[Link] Learning:
1. Models reduce uncertainty about unseen data by learning patterns.

[Link] Systems:
1. Efficient transmission depends on quantifying uncertainty and minimizing
errors.
2. Uncertainty
Definition
Uncertainty is the degree to which the outcome of an event is
unpredictable.
If we already know the outcome of an event, there is no uncertainty. If
multiple outcomes are possible and we cannot predict with certainty
which will occur, there is uncertainty.
• Mathematically, uncertainty is linked with probability:
• Certain event: Probability = 1 (no uncertainty)
• Impossible event: Probability = 0 (no uncertainty)
• Uncertain event: 0 < Probability < 1
3. Measuring Uncertainty
Shannon Entropy
1. Introduction
• Shannon entropy, introduced by Claude E. Shannon in his landmark
1948 paper “A Mathematical Theory of Communication”, is a measure
of uncertainty in a random variable or the average information
content per outcome.
• It tells us how many bits, on average, are needed to represent the
outcome of a random process.
Definition:
4. Properties of Uncertainty
• Higher entropy More uncertainty, outcomes are more unpredictable.
• Example: Tossing a fair coin (p(H)=0.5, p(T)=0.5) gives maximum uncertainty.
• Lower entropy Less uncertainty, outcomes are more predictable.
• Example: A biased coin (p(H)=0.99, p(T)=0.01) has less uncertainty.
• Zero entropy when the outcome is known in advance
• Example: Tossing a coin with both sides heads.
5. Properties of Shannon Entropy
6. Examples
6.1 Fair Coin Toss

6.2 Biased Coin Toss

INFORMATION
Definition
Information is the reduction of uncertainty.
When we learn the outcome of an event, we gain information.
More uncertainty resolved → more information gained.
Shannon’s View of Information:
Relationship Between Uncertainty and
Information
• Before knowing the outcome: We have uncertainty.
• After knowing the outcome: Uncertainty decreases, and the reduction is information.
• Example:
• Before tossing a fair coin: H=1 bit (uncertainty).
• After knowing "Heads": Uncertainty = 0 → Information gained = 1 bit.
• Example:
Joint Entropy, Conditional Entropy, and
Mutual Information
• Joint Entropy
Definition:
Meaning of p(x,y)
• p(x,y) is the joint probability that two events happen at the same
time:
• Event X=x
• Event Y=y
• It’s the probability of both conditions being true simultaneously.
Formula:
Steps to Calculate p(x,y)
Example: Weather and Umbrella
Example: Two independent Fair coin:
• Example: Perfectly Correlated Dice Rolls
Conditional Entropy
Relation between joint entropy and conditional entropy
Definitions:
PROOF:
Numerical Example proof:
Properties of Conditional Entropy:
Mutual Information (MI)
Definition
• Mutual information measures the amount of information that one random
variable contains about another.
It quantifies the reduction in uncertainty of one variable due to the knowledge of
the other.
Properties of MI:
Interpretation
• High MI: Knowing X tells a lot about Y (strong relationship).
• Low MI: Knowing X tells little about Y.
• Zero MI: X and Y are independent.
Formula from Probabilities
Numerical Example
Where MI>0:
Kullback–Leibler divergence
Proof of Mutual Information is aways non-
negative
Example:
Definition of a Code
• A code is a mapping (assignment) of a sequence of bits (codewords)
to a set of source symbols.
• Purpose: Represent information (symbols, messages) in a binary form
suitable for storage or transmission.
Definition of Prefix Codes
• A prefix code is a variable-length code in which no codeword is a
prefix of another codeword.
• This ensures instantaneous decoding: as soon as a codeword is read,
it can be uniquely identified without looking ahead.
Example:
• Prefix code: {0, 10, 110, 111} (no codeword starts with another).
• Not prefix code: {0, 01, 011} ✘ (because "0" is a prefix of "01" and
"011").
Uniquely Decipherable (UD) Codes
Definition
• A code is uniquely decipherable if every encoded sequence can be
decoded into exactly one sequence of source symbols.
• This ensures that no two different symbol sequences produce the
same encoded string.
• UD codes may or may not be prefix-free.
Instantaneous Codes (Prefix-free Codes)
Definition
• An instantaneous code (or prefix-free code) is one where no
codeword is the prefix of another codeword.
• Guarantees immediate (instantaneous) decoding — as soon as a
codeword is read, you know where it ends.
• All instantaneous codes are uniquely decipherable.
Classes of Codes
• Source coding: Process of representing data (symbols from a source)
with as few bits as possible (compression). Example: Huffman coding,
Shannon-Fano coding.
• Entropy coding: A specific type of source coding where the code
length is based on the probability of symbols, approaching the
entropy limit. Examples: Arithmetic coding, Huffman coding.
Difference: Source coding is the broader concept; entropy coding
is a subset of it that uses symbol probabilities to achieve optimal
compression.
Kraft Inequality I
1. Introduction
• In information theory and coding, Kraft’s inequality gives a
fundamental condition that must be satisfied by the lengths of
codewords in a prefix code (also called instantaneous code).
• It tells us whether it is possible to construct a prefix code with given
codeword lengths.
• It is the mathematical foundation for designing efficient codes like
Huffman codes.
Source Coding Theorem (Shannon’s Noiseless
Coding Theorem)
Optimal Codes
Why Optimal Codes?

• Some symbols occur more frequently → they should get shorter

codewords.
• Rare symbols should get longer codewords.
• This reduces redundancy and saves storage/bandwidth.
The principle is the same as in Morse code, Huffman code,
Arithmetic coding, etc.
Bounds on the Optimal Codelength:
Block Coding
Definition
• Block coding is a source coding technique in which a sequence of k
source symbols is grouped into a block, and then encoded into a
fixed-length or variable-length codeword.
• Instead of encoding one symbol at a time, we encode multiple
symbols together.
• This improves efficiency and allows the average codeword length to
approach entropy more closely.
This means that as we group larger and larger blocks of symbols, the average length per
symbol gets closer and closer to the entropy.
Shannon–Fano Coding

• Shannon-Fano coding, named after Claude Shannon and Robert

Fano, was the first algorithm to construct a set of the best variable-
size codes. We start with a set of n symbols with known probabilities
(or frequencies) of occurrence.
• Idea: Construct a variable-length prefix code based on symbol
probabilities.
Huffman Coding

Idea: Construct an optimal prefix code by building a binary tree based on symbol probabilities.

Information Theory Basics
No ratings yet
Information Theory Basics
26 pages
IICT Notes Unit-2
No ratings yet
IICT Notes Unit-2
17 pages
Unit 1 INFORMATION ENTROPY FUNDAMENTALS
No ratings yet
Unit 1 INFORMATION ENTROPY FUNDAMENTALS
13 pages
Information & Coding Theory Basics
No ratings yet
Information & Coding Theory Basics
34 pages
Information Theory and Coding Syllabus
100% (2)
Information Theory and Coding Syllabus
45 pages
Information Theory: Entropy and Coding
No ratings yet
Information Theory: Entropy and Coding
29 pages
Info Theory for Telecom Students
No ratings yet
Info Theory for Telecom Students
28 pages
Lecture 3 - Mutual Information. Source Coding and Channel Coding 2
No ratings yet
Lecture 3 - Mutual Information. Source Coding and Channel Coding 2
23 pages
Channel Coding Theorem Explained
No ratings yet
Channel Coding Theorem Explained
23 pages
Information Theory 5th Unit
No ratings yet
Information Theory 5th Unit
20 pages
Lecture 2 28 August, 2015: 2.1 An Example of Data Compression
No ratings yet
Lecture 2 28 August, 2015: 2.1 An Example of Data Compression
7 pages
Lossless Compression: Huffman Coding: Mikita Gandhi Assistant Professor Adit
No ratings yet
Lossless Compression: Huffman Coding: Mikita Gandhi Assistant Professor Adit
39 pages
Information Theory 1
No ratings yet
Information Theory 1
37 pages
Information Theory and Coding PDF
No ratings yet
Information Theory and Coding PDF
150 pages
21ECE72 - Coding and Cryp Module 1
No ratings yet
21ECE72 - Coding and Cryp Module 1
34 pages
C&C Combined Module Notes
No ratings yet
C&C Combined Module Notes
206 pages
Information Theory in Communication Systems
No ratings yet
Information Theory in Communication Systems
105 pages
Information Coding Techniques
No ratings yet
Information Coding Techniques
42 pages
Source Coding and Compression Techniques
No ratings yet
Source Coding and Compression Techniques
34 pages
Information Theory and Source Coding
No ratings yet
Information Theory and Source Coding
45 pages
Understanding Information Theory Concepts
No ratings yet
Understanding Information Theory Concepts
43 pages
Introduction to Information Theory
No ratings yet
Introduction to Information Theory
45 pages
3-1-Lossless Compression
No ratings yet
3-1-Lossless Compression
10 pages
Lossless Compression Techniques Overview
No ratings yet
Lossless Compression Techniques Overview
10 pages
15ec54 PDF
No ratings yet
15ec54 PDF
56 pages
PTSP VI Part 2
No ratings yet
PTSP VI Part 2
44 pages
Information Theory
No ratings yet
Information Theory
108 pages
Unit 1
No ratings yet
Unit 1
94 pages
Introduction to Data Compression
No ratings yet
Introduction to Data Compression
22 pages
Information Theory
No ratings yet
Information Theory
40 pages
Materi Source Coding
No ratings yet
Materi Source Coding
39 pages
2015 Chapter 7 MMS IT
No ratings yet
2015 Chapter 7 MMS IT
36 pages
Information Theory and Coding: Universit' A Degli Studi Di Siena Facolt'a Di Ingegneria
No ratings yet
Information Theory and Coding: Universit' A Degli Studi Di Siena Facolt'a Di Ingegneria
156 pages
Script PDF
No ratings yet
Script PDF
78 pages
Info Theory
No ratings yet
Info Theory
59 pages
Information Theory and Coding Overview
100% (1)
Information Theory and Coding Overview
79 pages
Chapter Five Lossless Compression
No ratings yet
Chapter Five Lossless Compression
49 pages
Information T Information Theory and Coding: S.Chandramohan
No ratings yet
Information T Information Theory and Coding: S.Chandramohan
38 pages
1
No ratings yet
1
86 pages
Info Theory & Entropy Basics
No ratings yet
Info Theory & Entropy Basics
44 pages
Module 1
No ratings yet
Module 1
40 pages
Iict Unit One
No ratings yet
Iict Unit One
35 pages
Unit 5 - Part-Ii
No ratings yet
Unit 5 - Part-Ii
41 pages
Data Compression Basics: Discrete Source
No ratings yet
Data Compression Basics: Discrete Source
34 pages
Topic 2 Information and Coding Theory
No ratings yet
Topic 2 Information and Coding Theory
68 pages
Data Compression Seminar Report
No ratings yet
Data Compression Seminar Report
49 pages
Source Coding & Theorems Guide
No ratings yet
Source Coding & Theorems Guide
29 pages
Information and Coding Theory
No ratings yet
Information and Coding Theory
177 pages
Rohini 67178593226
No ratings yet
Rohini 67178593226
6 pages
Concepts & Information Theory
No ratings yet
Concepts & Information Theory
68 pages
Information Theory & Coding Basics
No ratings yet
Information Theory & Coding Basics
45 pages
Amount of Information I Log (1/P)
No ratings yet
Amount of Information I Log (1/P)
2 pages
Lec35 - 210108062 - ZAINAB ALI
No ratings yet
Lec35 - 210108062 - ZAINAB ALI
9 pages
Information Theory: Prepared By: Amit Degada Teaching Assistant, ECED, NIT Surat
No ratings yet
Information Theory: Prepared By: Amit Degada Teaching Assistant, ECED, NIT Surat
30 pages
Lecture 6
No ratings yet
Lecture 6
22 pages
Entropy in Natural Language Processing
No ratings yet
Entropy in Natural Language Processing
34 pages
Lecture 3-Huffman Coding
No ratings yet
Lecture 3-Huffman Coding
30 pages
HTML Injection Defense The Research PPT On
No ratings yet
HTML Injection Defense The Research PPT On
9 pages
Cec91960intro To Drug Design & Discovery
No ratings yet
Cec91960intro To Drug Design & Discovery
74 pages
0394167dnatural Products in Drug Discovery
No ratings yet
0394167dnatural Products in Drug Discovery
22 pages
Lab 3
No ratings yet
Lab 3
3 pages
Malware Analysis - The Research PDF On Malware
No ratings yet
Malware Analysis - The Research PDF On Malware
43 pages
AI Insights for Math Learning at Matiks
No ratings yet
AI Insights for Math Learning at Matiks
2 pages
Exam Prep National Incident Management System Principles and Practice 2nd Edition HQ File Comprehensive
0% (1)
Exam Prep National Incident Management System Principles and Practice 2nd Edition HQ File Comprehensive
326 pages
Bolted Joint Analysis and Preload Calculations
No ratings yet
Bolted Joint Analysis and Preload Calculations
2 pages
2019 WASTE HEAT RECOVERY 11-COM.P-18-rev.64 PDF
No ratings yet
2019 WASTE HEAT RECOVERY 11-COM.P-18-rev.64 PDF
41 pages
Advanced Construction Tech Module
No ratings yet
Advanced Construction Tech Module
3 pages
Lirneasia-Tabop3-Eoi 21apr08
No ratings yet
Lirneasia-Tabop3-Eoi 21apr08
7 pages
(BIO) Chapter 15 - Reproduction in Plants
No ratings yet
(BIO) Chapter 15 - Reproduction in Plants
12 pages
Professional Rotisserie Equipment Guide
No ratings yet
Professional Rotisserie Equipment Guide
17 pages
22622-2023-Summer-Question-Paper (Msbte Study Resources) (3 Files Merged)
No ratings yet
22622-2023-Summer-Question-Paper (Msbte Study Resources) (3 Files Merged)
3 pages
AUV ADCP Data Processing Tool
No ratings yet
AUV ADCP Data Processing Tool
4 pages
Slides For Students-V3
No ratings yet
Slides For Students-V3
10 pages
Leadership Styles As Predictors of Leadership Effectiveness Among Filipino Youth Leaders
No ratings yet
Leadership Styles As Predictors of Leadership Effectiveness Among Filipino Youth Leaders
16 pages
8º Ano - Test Unit 4
No ratings yet
8º Ano - Test Unit 4
6 pages
AS 1259.2-1990 Acoustics
No ratings yet
AS 1259.2-1990 Acoustics
28 pages
Have You Ever Lost Your Temper
No ratings yet
Have You Ever Lost Your Temper
4 pages
Press Tool PDF
No ratings yet
Press Tool PDF
55 pages
Electricity Worksheet
No ratings yet
Electricity Worksheet
7 pages
Operation Management
No ratings yet
Operation Management
16 pages
Sharktv Alphatx Me ?? ?? ?? Hits by ?????????
No ratings yet
Sharktv Alphatx Me ?? ?? ?? Hits by ?????????
50 pages
Resistance Substitution Box: Model RS-500
No ratings yet
Resistance Substitution Box: Model RS-500
2 pages
B.Com SEM IV Exam Hall Ticket 2024
No ratings yet
B.Com SEM IV Exam Hall Ticket 2024
88 pages
Week1 Watermark
No ratings yet
Week1 Watermark
92 pages
Locolub Eco: Safety Data Sheet According To Regulation (EC) No. 1907/ 2006 (REACH)
No ratings yet
Locolub Eco: Safety Data Sheet According To Regulation (EC) No. 1907/ 2006 (REACH)
7 pages
Detailed Lesson Plan-Interactive
0% (1)
Detailed Lesson Plan-Interactive
2 pages
Cultural Influence on Pain Responses
No ratings yet
Cultural Influence on Pain Responses
15 pages
Anthropometrics in Design
No ratings yet
Anthropometrics in Design
6 pages
BNBC Part 03 - General Building Requirements, Control and Regulation
100% (2)
BNBC Part 03 - General Building Requirements, Control and Regulation
103 pages
Sil Quick Guide 1528 PDF
No ratings yet
Sil Quick Guide 1528 PDF
4 pages
Finite Element Method Symmetry: Dr. D. R. Panchagade
No ratings yet
Finite Element Method Symmetry: Dr. D. R. Panchagade
13 pages
Reward System Process Issues
100% (1)
Reward System Process Issues
16 pages
CEC331 - 4G 5G Communication Networks Lab Manual - Faculty
No ratings yet
CEC331 - 4G 5G Communication Networks Lab Manual - Faculty
65 pages

Information Theory and Coding Unit Second

Uploaded by

Information Theory and Coding Unit Second

Uploaded by

Unit 2

Uncertainty and Information

In computer science, mathematics, and information theory, uncertainty

These concepts are foundational in probability theory, data

6.2 Biased Coin Toss

• Some symbols occur more frequently → they should get shorter

• Shannon-Fano coding, named after Claude Shannon and Robert

You might also like