0% found this document useful (0 votes)

8 views49 pages

Notes For Data Compression

This document covers data compression techniques, distinguishing between lossless and lossy methods, and explaining various encoding techniques such as run-length encoding, Huffman coding, and Lempel Ziv encoding. It discusses the principles of data redundancy and human perception in relation to compression, as well as standards for compressing images (JPEG), video (MPEG), and audio (MP3). The document also outlines the importance of removing redundancy to manage storage requirements and improve transmission efficiency.

Uploaded by

shashanksingh884050

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views49 pages

Notes For Data Compression

Uploaded by

shashanksingh884050

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 49

Data

Compression
Objectives
After studying this chapter, the student should be able to:
 Distinguish between lossless and lossy compression.

 Describe run-length encoding and how it achieves compression.

 Describe Huffman coding and how it achieves compression.

 Describe Lempel Ziv encoding and the role of the dictionary in encoding and
decoding.

 Describe the main idea behind the JPEG standard for compressing still
images.

 Describe the main idea behind the MPEG standard for compressing video
and its relation to JPEG.

 Describe the main idea behind the MP3 standard for compressing audio.

15.2
COMPRESSION PRINCIPLES

 Compression desirable to compress

digital audio, image, and video so that
their bit rates or storage requirements
become manageable.
 We achieve data compression by
exploiting two major factors:
 redundancy existing in digital audio, image,
and video data
 the properties of human perception.

15.3
Data Redundancy
 digital audio is a series sample values.
 An image is a rectangular array of
sample values (pixel values)
 video is a sequence of images played
out at a certain rate.
 Nilai-nilai diatas berkorelasi dengan nilai
sampel yg berdekatan . Korelasi ini
disebut redundancy.
 Removal of redundancy does not
change the meaning of the data.
15.4
Redundancy in Digital
 Audio
In most cases, adjacent audio samples are
similar.
 The next sample value can be predicted to
some extent based on the current sample
value.
 Compression techniques that use this feature are
called predictive coding.
 During a normal conversation, we talk for only
a very small percentage of time. Between
talking spurts, there is silence. Samples
corresponding to this silence can be removed
without affecting the meaning of the speech.
 Compression techniques that use this feature are
15.5
Redundancy in Digital
Images
 In a digital image, neighboring samples
on a scanning line are normally similar.
 Neighboring samples on adjacent lines
are also similar. These similarities are
called spatial redundancy.
 Spatial redundancies can be removed
using predictive coding techniques and
other techniques (such as transform
coding).
15.6
Redundancy in Digital
Video
 Digital video is a sequence of
images, thus it also has spatial
redundancies.
 Neighboring images in a video
sequence are normally similar.
 This similarity is called temporal
redundancy and can be removed by
applying predictive coding between
images.
15.7
Human Perception
Properties
 End users audio digital, image, dan video
adalah manusia.
 Manusia dapat mentoleransi information error
atau loss tanpa mempengaruhi keefektifan
komunikasi.
 This means that the compressed version does not need
to represent the original information samples exactly.
 This is in contrast to the conventional alphanumeric
data where any data loss or error is normally not
allowed, especially for computer programs.
 The above feature indicates that human
perception is generally not sensitive to small

15.8
Classifications of Compression
Techniques
 can be classified in many ways according
to different criteria.
 classify them based on the results of the
compression techniques.
 Two classifications we will consider are
whether the original data can be
reconstructed exactly after using a
compression technique, and whether the
output of a compression system is of
constant bit rate.
15.9
Data Compression Techniques/methods can be
divided into two broad categories: lossless and lossy
methods.
LOSSLESS COMPRESSION

• The integrity of the data is preserved.

• The original data and the data after compression and
decompression are exactly the same because, in these
methods, the compression and decompression
algorithms are exact inverses of each other: no part of
the data is lost in the process.

• Redundant data is removed in compression and added

during decompression.

• This methods are normally used when we cannot afford to

lose any data.
Run-length encoding
o The simplest method of compression.
o The general idea behind this method is to replace
consecutive repeating occurrences of a symbol by one
occurrence of the symbol followed by the number of
occurrences.

15.
12
Run-length encoding example
15.
13
Huffman coding
• Assigns shorter codes to symbols that occur more
frequently.
•For example, we have a text file that uses only five
characters (A, B, C, D, E).
•Before we can assign bit patterns to each character,
we assign each character a weight based on its
frequency of use. In this example, assume that the
frequency of the characters is as shown in Table 15.1.

15.
14
Figure 15.4 Huffman coding 15.
15
A character’s code is found by starting at the root and
following the branches that lead to that character. The code
itself is the bit value of each branch on the path, taken in
sequence.

Figure 15.5 Final tree and code 15.

16
Encoding
Let us see how to encode text using the code for our five
characters. Figure 15.6 shows the original and the encoded
text.

Figure 15.6 Huffman encoding 15.

17
Decoding
The recipient has a very easy job in decoding the data it
receives. Figure 15.7 shows how decoding takes place.

Figure 15.7 Huffman decoding 15.

18
LATIHAN :

 Tentukanlah kode huffman dari masing-

masing karakter pada Text berikut :
 before we can assign bit patterns to each
character
 Solusi :
 Hitung frekuensi kejadian masing-masing
Karakter pada Text.
 Buat pohon huffman
 Tentukan kode masing-masing Karakter
Lempel Ziv encoding
•called dictionary-based encoding.
• Membuat dictionary (a table) of strings
used during the communication session.
• Bila sender dan receiver mempunyai
copy of the dictionary, maka dapat
disubstitusi dengan indexnya di dalam
dictionary untuk mengurangi jumlah
informasi yg ditransmisikan.

15.
20
Compression
• There are two concurrent events:
• Building an indexed dictionary
• Compressing a string of symbols.
• The algorithm extracts the smallest substring that cannot be
found in the dictionary from the remaining uncompressed
string.
• It then stores a copy of this substring in the dictionary as a
new entry and assigns it an index value.
• Compression occurs when the substring, except for the last
character, is replaced with the index found in the dictionary.
• The process then inserts the index and the last character of
the substring into the compressed string.

15.
21
Figure 15.8 An example of Lempel Ziv encoding 15.
22
Decompression
o Decompression is the inverse of the compression process.
o The process extracts the substrings from the compressed
string and tries to replace the indexes with the corresponding
entry in the dictionary, which is empty at first and built up
gradually.
oThe idea is that when an index is received, there is already
an entry in the dictionary corresponding to that index.

15.
23
Figure 15.9 An example of Lempel Ziv decoding 15.
24
LOSSY COMPRESSION METHODS

Our eyes and ears cannot distinguish subtle changes.

In such cases, we can use a lossy data compression method.
o These methods are cheaper
othey take less time and space when it comes to
sending millions of bits per second for images and
video.
o Several methods have been developed using lossy
compression techniques.
o JPEG (Joint Photographic Experts Group) encoding
is used to
compress pictures and graphics.
o MPEG (Moving Picture Experts Group) encoding is used to
compress video,
15.
o MP3 (MPEG audio layer 3) for audio compression. 25
Image compression – JPEG encoding
• an image can be represented by a two-dimensional array
(table) of picture elements (pixels).
• A grayscale picture of 307,200 pixels is represented by
2,457,600 bits, and
• a color picture is represented by 7,372,800 bits.

15.
26
In JPEG,
-a grayscale picture
dibagi menjadi
blok2 dengan
ukuran block 8 × 8
pixel, untuk
mengurangi jumlah
kalkulasi.
- jumlah operasi
matematik masing
gambar adalah the
square of the
number JPEG
of units.
grayscale example, 640 × 480 pixels 15.
27
• JPEG merubah picture menjadi seperangkat vektor yang
menimbulkan redundansi.
• The redundancies can then be removed using one of the
lossless compression methods we studied previously.

Figure 15.11 The JPEG compression process 15.

28
Discrete cosine transform (DCT)
• Setiap block 64 pixels ditransformasi oleh discrete cosine
transform (DCT).
• The transformation changes the 64 values so that the
relative relationships between pixels are kept but the
redundancies are revealed.

15.
29
The transform coding process using DCT.
We apply a two-dimensional DCT on an image block of 8-
by-8 elements.
For forward transform, from definition [6] we have
C = BABT

where :
• C is a transformed coefficient matrix of 8-by-8 elements.
• B the DCT transform matrix whose entries are defined
below.
•A an image data matrix of 8-by-8 elements to be coded.
•BT denotes the transpose of B.

15.
30
For inverse DCT (IDCT), we
have
A = BT CB

15.
31
For two-dimensional DCT with block size of 8-
by-8, we denote the entries of B by B(i, j),
where i is the row index from 0 to 7 and j the
column index from 0 to 7.
According to the definition, we have

15.
32
15.
33
15.
34
Notice that after the transformation, most
energy is packed at the top left corner of C.
This means that image data are decorrelated
after the transform.
15.
35
After quantization of the coefficients (here
each entry is divided by 30 and results are
rounded off to the nearest integer), we obtain

15.
36
15.
37
15.
38
15.
39
To understand the nature of this transformation, let us show
the result of the transformations for three cases.

Figure 15.12 Case 1: uniform grayscale 15.

40
Figure 15.13 Case 2: two sections 15.
41
Figure 15.14 Case 3: gradient grayscale 15.
42
Video compression – MPEG encoding
. The Moving Picture Experts Group (MPEG) method is
used to compress video.
. In principle, a motion picture is a rapid sequence of a set of
frames in which each frame is a picture.
. In other words, a frame is a spatial combination of pixels,
and a video is a temporal combination of frames that are sent
one after another.
. Compressing video, then, means spatially compressing each
frame and temporally compressing a set of frames.

15.
43
Spatial compression
The spatial compression of each frame is done with JPEG, or
a modification of it. Each frame is a picture that can be
independently compressed.

Temporal compression
In temporal compression, redundant frames are removed.
When we watch television, for example, we receive 30
frames per second. However, most of the consecutive frames
are almost the same. For example, in a static scene in which
someone is talking, most frames are the same except for the
segment around the speaker’s lips, which changes from one
frame to the next.

15.
44
Figure 15.16 MPEG frames 15.
45
Audio compression
Audio compression can be used for speech or music. For
speech we need to compress a 64 kHz digitized signal, while
for music we need to compress a 1.411 MHz signal. Two
categories of techniques are used for audio compression:
predictive encoding and perceptual encoding.

15.
46
Predictive encoding
In predictive encoding, the differences between samples are
encoded instead of encoding all the sampled values. This
type of compression is normally used for speech. Several
standards have been defined such as GSM (13 kbps), G.729
(8 kbps), and G.723.3 (6.4 or 5.3 kbps). Detailed discussions
of these techniques are beyond the scope of this book.

Perceptual encoding: MP3

The most common compression technique used to create
CD-quality audio is based on the perceptual encoding
technique. This type of audio needs at least 1.411 Mbps,
which cannot be sent over the Internet without compression.
MP3 (MPEG audio layer 3) uses this technique.
15.
47
Relations for relationship sets
For each relationship set in the E-R diagram, we create a
relation (table). This relation has one column for the key of
each entity set involved in this relationship and also one
column for each attribute of the relationship itself if the
relationship has attributes (not in our case).

15.
48
TUGAS
 Jelaskan perbedaan antara kompresi lossless dan kompresi lossy.

 Jelaskan teknik koding berikut dan bagaimana kompresi dapat dicapai pada teknik berikut:

 teknik run-length encoding

 Huffman coding .

 Lempel Ziv encoding

 Jelaskan idea yang mendasari :

 standar JPEG untuk kompresi still images.

 Standar MPEG untuk kompresi video.

 Standar MP3 untuk kompresi audio.

Nen Anh
No ratings yet
Nen Anh
36 pages
Data Compression Btech Notes
No ratings yet
Data Compression Btech Notes
32 pages
Data Compression
No ratings yet
Data Compression
29 pages
CH 15
No ratings yet
CH 15
34 pages
Data Compression Methods Explained
No ratings yet
Data Compression Methods Explained
33 pages
Data Compression Techniques Overview
No ratings yet
Data Compression Techniques Overview
41 pages
Data Compression: CS 147 Minh Nguyen
No ratings yet
Data Compression: CS 147 Minh Nguyen
25 pages
Lossless Compression Techniques Explained
No ratings yet
Lossless Compression Techniques Explained
36 pages
Chap15 1473751047 598113
No ratings yet
Chap15 1473751047 598113
34 pages
Unit 5 - Data Compression
No ratings yet
Unit 5 - Data Compression
46 pages
Unit 5 DC
No ratings yet
Unit 5 DC
33 pages
Aadel Veri
No ratings yet
Aadel Veri
37 pages
Data Compression Techniques Overview
No ratings yet
Data Compression Techniques Overview
21 pages
Data Compression Techniques Overview
No ratings yet
Data Compression Techniques Overview
23 pages
Data Compression
No ratings yet
Data Compression
19 pages
Compression
No ratings yet
Compression
21 pages
Compression Techniques
No ratings yet
Compression Techniques
24 pages
Data Compression Techniques Explained
No ratings yet
Data Compression Techniques Explained
21 pages
Data Compression Techniques Explained
No ratings yet
Data Compression Techniques Explained
25 pages
Data Compression 1
No ratings yet
Data Compression 1
25 pages
CHAPTER FOURmultimedia
No ratings yet
CHAPTER FOURmultimedia
23 pages
Data Compression Techniques Explained
No ratings yet
Data Compression Techniques Explained
22 pages
Data Representation Techniques Overview
No ratings yet
Data Representation Techniques Overview
19 pages
Mod 1 DCT
No ratings yet
Mod 1 DCT
37 pages
Data Compression Report
No ratings yet
Data Compression Report
12 pages
Compression of Multimedia Data
No ratings yet
Compression of Multimedia Data
14 pages
Data Compression Techniques Explained
No ratings yet
Data Compression Techniques Explained
8 pages
Data Compression Techniques
No ratings yet
Data Compression Techniques
10 pages
Unit3 Ece MMC 6th Sem
No ratings yet
Unit3 Ece MMC 6th Sem
96 pages
Notes Module2
No ratings yet
Notes Module2
16 pages
Unit 5 - Presentation Layer
No ratings yet
Unit 5 - Presentation Layer
8 pages
Dereje Teferi Dereje - Teferi@aau - Edu.et
No ratings yet
Dereje Teferi Dereje - Teferi@aau - Edu.et
36 pages
Mod 3
No ratings yet
Mod 3
69 pages
Chapter-5 Data Compression
No ratings yet
Chapter-5 Data Compression
53 pages
Multimedia Data Compression Techniques
100% (1)
Multimedia Data Compression Techniques
35 pages
5 Data Compression Ioenotes
No ratings yet
5 Data Compression Ioenotes
47 pages
Wk7 1
No ratings yet
Wk7 1
22 pages
Understanding Data Compression Techniques
No ratings yet
Understanding Data Compression Techniques
21 pages
MMC Chap3
100% (1)
MMC Chap3
22 pages
Chapter 2-Compression Techniques
No ratings yet
Chapter 2-Compression Techniques
63 pages
Module 3
No ratings yet
Module 3
23 pages
Special Topics Data Compression
No ratings yet
Special Topics Data Compression
51 pages
Introduction to Source Coding Techniques
No ratings yet
Introduction to Source Coding Techniques
72 pages
3 Chapter Text and Image Compression
No ratings yet
3 Chapter Text and Image Compression
132 pages
Data Compression Techniques Explained
No ratings yet
Data Compression Techniques Explained
17 pages
Compression Techniques Overview
No ratings yet
Compression Techniques Overview
26 pages
Data Compression Techniques Guide
No ratings yet
Data Compression Techniques Guide
31 pages
BITS 2513 - Internet Technology Presentation Layer
No ratings yet
BITS 2513 - Internet Technology Presentation Layer
54 pages
Compression
100% (1)
Compression
38 pages
Unit4 Security: Data Compression Overview
No ratings yet
Unit4 Security: Data Compression Overview
56 pages
Nteractive Ultimedia Ystems: Ompression Types and Techniques
No ratings yet
Nteractive Ultimedia Ystems: Ompression Types and Techniques
12 pages
Chapter 5 Data Compression
No ratings yet
Chapter 5 Data Compression
18 pages
Compressor Principles
No ratings yet
Compressor Principles
32 pages
3-0-Fundamental of Compression
No ratings yet
3-0-Fundamental of Compression
17 pages
Image Compression Techniques Explained
No ratings yet
Image Compression Techniques Explained
82 pages
Lossy and Lossless Compression Techniques
100% (1)
Lossy and Lossless Compression Techniques
18 pages
20250320121146-Module-3 MMC Notes
No ratings yet
20250320121146-Module-3 MMC Notes
27 pages
DC CH1
No ratings yet
DC CH1
17 pages
Data Compression Techniques Guide
No ratings yet
Data Compression Techniques Guide
41 pages
Vik
No ratings yet
Vik
23 pages
Huffman and Shannon-Fano Coding Techniques
No ratings yet
Huffman and Shannon-Fano Coding Techniques
3 pages
Huffman Coding and Shannon-Fano Overview
No ratings yet
Huffman Coding and Shannon-Fano Overview
51 pages
Assign - 2 TUT - 1
No ratings yet
Assign - 2 TUT - 1
2 pages
Huffman Algo
No ratings yet
Huffman Algo
13 pages
Greedy Algorithms in CS161 Lecture
No ratings yet
Greedy Algorithms in CS161 Lecture
6 pages
Huffman Coding: Algorithm and Complexity
No ratings yet
Huffman Coding: Algorithm and Complexity
8 pages
CS251 Unit4 Slides
No ratings yet
CS251 Unit4 Slides
127 pages
Daa 11
No ratings yet
Daa 11
4 pages
Huffman Coding for Data Compression
No ratings yet
Huffman Coding for Data Compression
18 pages
Huffman Coding for Data Compression
No ratings yet
Huffman Coding for Data Compression
65 pages
MP3 Format: Theory of The Standard
No ratings yet
MP3 Format: Theory of The Standard
15 pages
Huffman Coding Ms 140400147 Sadia Yunas Butt
No ratings yet
Huffman Coding Ms 140400147 Sadia Yunas Butt
9 pages
Shannon's Source Coding Theorem Explained
No ratings yet
Shannon's Source Coding Theorem Explained
20 pages
Chapter Six
No ratings yet
Chapter Six
28 pages
Chapter 4 Color in Image and Video
No ratings yet
Chapter 4 Color in Image and Video
11 pages
Daa Lab Viva
No ratings yet
Daa Lab Viva
9 pages
Image Compression: I. Fundamentals
No ratings yet
Image Compression: I. Fundamentals
12 pages
Asymptotic Notation, Review of Functions & Summations
100% (1)
Asymptotic Notation, Review of Functions & Summations
45 pages
Itc Unit 4
No ratings yet
Itc Unit 4
17 pages
Greedy and DFS in Huffman Coding
No ratings yet
Greedy and DFS in Huffman Coding
5 pages
Huffman Coding for Beginners
No ratings yet
Huffman Coding for Beginners
10 pages
DSA Insem
No ratings yet
DSA Insem
2 pages
Huffman Encoder Lab Guide
No ratings yet
Huffman Encoder Lab Guide
4 pages
Quiz 2
No ratings yet
Quiz 2
43 pages
Adaptive Huffman Coding Guide
No ratings yet
Adaptive Huffman Coding Guide
7 pages
IRS Solutions
No ratings yet
IRS Solutions
98 pages
Huffman Coding Implementation in MATLAB
No ratings yet
Huffman Coding Implementation in MATLAB
4 pages
Bec613a MMC Mod3
No ratings yet
Bec613a MMC Mod3
50 pages
Multimedia Systems: Chapter 7: Data Compression
No ratings yet
Multimedia Systems: Chapter 7: Data Compression
25 pages

Notes For Data Compression

Uploaded by

Notes For Data Compression

Uploaded by

Data

 Describe run-length encoding and how it achieves compression.

 Describe Huffman coding and how it achieves compression.

 Compression desirable to compress

• The integrity of the data is preserved.

• Redundant data is removed in compression and added

• This methods are normally used when we cannot afford to

Figure 15.5 Final tree and code 15.

Figure 15.6 Huffman encoding 15.

Figure 15.7 Huffman decoding 15.

 Tentukanlah kode huffman dari masing-

Our eyes and ears cannot distinguish subtle changes.

Figure 15.11 The JPEG compression process 15.

Figure 15.12 Case 1: uniform grayscale 15.

Perceptual encoding: MP3

 teknik run-length encoding

 Lempel Ziv encoding

 Jelaskan idea yang mendasari :

 standar JPEG untuk kompresi still images.

 Standar MPEG untuk kompresi video.

 Standar MP3 untuk kompresi audio.

You might also like