0% found this document useful (0 votes)

75 views6 pages

Lecture 12 - Quantization

This document provides an overview of number representation and quantization effects in digital signal processing. It discusses fixed-point representation using sign-magnitude, one's-complement, and two's-complement formats. Floating-point representation stores numbers as a mantissa and exponent. Quantization is the process of mapping a continuous set of values to a discrete set, and can cause truncation or rounding errors. Fixed-point uses truncation or rounding, while floating-point quantizes the mantissa only.

Uploaded by

Vasco Rodrigues

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

75 views6 pages

Lecture 12 - Quantization

Uploaded by

Vasco Rodrigues

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Lecture 12: Number representation and Quantization effects

Instructor: Dr. Gleb V. Tcheslavski Contact: [email protected] Office Hours: Room 2030 Class web site: http://ee.lamar.edu/gleb/ds p/index.htm p/index htm

ELEN 5346/4304

DSP and Filter Design

Fall 2008

Representation of numbers
Up to this point, we were considering implementations of discrete-time systems without any considerations of finite-word-length effects that are inherent in any digital realization, whether in hardware or software. realization software Let us consider first two different representations of numbers.

1. Fixed-point representation.
A real number X is represented as:

X = ( b A ,..., b1 , b0 , b1 ,...bB )r =

i = A

br
i

, 0 bi (r 1)

(12.2.1)

Where bi represents the digit, r is the radix or base, A is the number of integer digits, and B is the number of fractional digits. For example:

(1223.45)10 = 1102 + 2 101 + 3 100 + 4 101 + 5 102 (101.01)2 = 1 22 + 0 21 + 1 20 + 0 21 + 1 22

Representation of numbers
We will focus our attention on the binary representation as most important for DSP. In this case r = 2 and the digits {bi} are called binary digits or bits. They take the values {0 1} The binary digit b-A is called the most significant bit {0, 1}. (MSB) of the number, and the binary digit bB is called the least significant bit (LSB) of the number. The binary point between the digits b0 and b1 does not exist explicitly and the logics assumes location of this point. By using an n-bit integer format (A = n-1, B = 0), we can represent unsigned integer numbers from 0 to 2n-1. More frequently, the fractional format (A = 0, B = n-1) is used with a binary point between b0 and b1 that can represent numbers from 0 to 1-2-n. Any integer or mixed number can be represented in a fraction format by factoring out the term r A. There are three formats to represent negative numbers. The format for the positive numbers is the same: the MSB is set to zero

X = 0.b1b2 ...bB = bi 2i , X 0
i =1
ELEN 5346/4304

(12.3.1)

DSP and Filter Design

Fall 2008

Representation of numbers
The negative numbers can be represented by: 1) Sign-Magnitude format: MSB is set to 1 to represent -

X SM = 1 b1b2 ...bB 1.
2) Ones-Complement Format:

for f X 0

(12.4.1)

X 1C = 1.b1b2 ...bB

for X 0

(12.4.2)

Where bi = 1 bi is the complement of bi (i.e., we replace ones by zeros and zeros by ones for all bits). B

X 1C = 1 20 + (1 bi ) 21 = 2 X 2 B
i= i 1

(12.4.3)

3) Twos-Complement Format:

X 2C = 1.b1b2 ...bB 0 0...0 1 for X < 0

(12.4.4)

Where is modulo-2 addition. For example, -3/8 is obtained by complementing 0011 (3/8) to obtain 1100 and then adding 0001, which yields 1101 to represent -3/8 in the twos-complement format.

Representation of numbers
The basic operations of addition and multiplication depend on the format used. Most fixed-point digital signal processors use twos-complement arithmetic, therefore, the range for (B + 1) bit number ranges f th f th f b from -1 t 1-2-B. 1 to 1 2 B In general, the multiplication of two fixed-point numbers each of b bits in length results in a product of 2b bits of length. The product is either truncated or rounded back to b bits resulting either in truncation or rounding errors. A fixed-point representation allows to cover a range of numbers, say, xmax xmin with a fixed resolution:

xmax xmin m 1

(12.5.1)

where m = 2b is the number of levels and b is the number of bits.

ELEN 5346/4304

DSP and Filter Design

Fall 2008

Representation of numbers
2. Floating-point representation.
Covers a larger dynamic range by representing the number X as

X = 2E M
where M is a mantissa the fractional part of the number: 0.5 M 1, E (exponent) is either negative or positive number. Both mantissa and exponent require additional sign bits for representing negative numbers. For example:

(12.6.1)

X 1 = 5 M 1 = 0.101000; E1 = 011; X 2 = 3 8 M 2 = 0.111000; E2 = 101 ;

Multiplication of two floating-point numbers is done by multiplying their mantissas and adding their exponents. Addition of two floating-point numbers requires that the exponents must be equal, which can be achieved by shifting the mantissa of the smaller number to the right and compensating by increasing the corresponding exponent. This, in general, may lead to loss of precision.

Representation of numbers
Overflow occurs in the multiplication of two floating-point numbers when the sum of the exponents exceeds the dynamic range of the fixed-point representation of the th exponent. t The floating-point representation allows us to cover a larger dynamic range than the fixed-point representation by varying the resolution across the range. The distance between two successive floating-point numbers increases as the numbers increase in size. Also, the floating-point representation provides finer resolution for small numbers but coarser resolution for large numbers.

ELEN 5346/4304

DSP and Filter Design

Fall 2008

Quantization
1. Fixed-point: truncation
To truncate a fixed-point number from (+1) bits to (b+1) bits, we just discard the least significant (-b) bits. The truncation error is denoted by

t = Q( X ) X

(12.8.1)

Here Q(X) is the truncated version of the number X. For a positive X, the error is equal t zero if all bit b i di h l to ll bits being discharged are zeros and i l d d is largest if all di h t ll discharged d bits are ones.

(2 b 2 ) t 0

(12.8.2)

Quantization
For a negative X, the truncation error will be different for three different formats: 1) Sign-Magnitude:

0 t 2 b 2
2) Ones-complement:

(12.9.1)

0 t 2 b 2
3) Twos-complement:

(12.9.2)

( 2 b 2 ) t 0

(12.9.3)

ELEN 5346/4304

DSP and Filter Design

Fall 2008

Quantization
2. Fixed-point: rounding
In case of rounding, the number is quantized to the nearest quantization level. The rounding error does not depend on the format used to represent negative numbers:

1 b ( 2 2 ) < r 1 ( 2 b 2 ) 2 2

(12.10.1)

In practice, >> b, therefore, 2- 0 in all expressions considered.

Quantization
3. Floating-point
Considering a floating-point representation floating point

Q ( X ) = 2E Q ( M )
X = 2E M

(12.11.1)

of a number

(12.11.2)

Quantization is carried out on the mantissa only in case of floating-point numbers. Therefore, it is more reasonable to consider the relative error.

Q( X ) X Q(M ) M = X M

(12.11.3)

In practice, a rounding quantizer can be modeled as follows:

Q ( X ) = 2 B round ( X 2+ B )
ELEN 5346/4304

(12.11.4)

DSP and Filter Design

Fall 2008

Signal Quantization in MATLAB Lab
No ratings yet
Signal Quantization in MATLAB Lab
14 pages
Finite Word Length Effects
No ratings yet
Finite Word Length Effects
31 pages
Unit 5 - Share
No ratings yet
Unit 5 - Share
38 pages
Cacc
No ratings yet
Cacc
106 pages
Module 1 DSPA Chapter 2
No ratings yet
Module 1 DSPA Chapter 2
8 pages
Finite Word Length Effects in Digital Filter
No ratings yet
Finite Word Length Effects in Digital Filter
26 pages
01 DigitalNumericalFormats
No ratings yet
01 DigitalNumericalFormats
27 pages
COA Unit 2
No ratings yet
COA Unit 2
23 pages
Chapter 5 Part 1
No ratings yet
Chapter 5 Part 1
17 pages
DSP Arithmetic
No ratings yet
DSP Arithmetic
33 pages
Fixed vs. Floating Point in Computing
No ratings yet
Fixed vs. Floating Point in Computing
24 pages
SW Lab 3 Fixed Point Simulation EE 462
No ratings yet
SW Lab 3 Fixed Point Simulation EE 462
7 pages
Module 1 Data Rep
No ratings yet
Module 1 Data Rep
14 pages
DSP Finite Word Length Effects
No ratings yet
DSP Finite Word Length Effects
26 pages
Unit 2
No ratings yet
Unit 2
16 pages
Rohini 98229548802
No ratings yet
Rohini 98229548802
5 pages
Computer Arithmetic (5 Hours)
No ratings yet
Computer Arithmetic (5 Hours)
27 pages
Fixed Point and Floating Point Number Representations
No ratings yet
Fixed Point and Floating Point Number Representations
7 pages
Floating-Point to Fixed-Point in Audio
No ratings yet
Floating-Point to Fixed-Point in Audio
10 pages
Finite Word Length
No ratings yet
Finite Word Length
13 pages
Lecture 14 - Arithmetic Subsystems - Numbering Systems and Floating Point Unit (FPU)
No ratings yet
Lecture 14 - Arithmetic Subsystems - Numbering Systems and Floating Point Unit (FPU)
32 pages
Finite Word Length Effects in DSP
No ratings yet
Finite Word Length Effects in DSP
28 pages
Computer Arithmetic Representations
No ratings yet
Computer Arithmetic Representations
24 pages
Digital Signal Processing Unit IV Guide
No ratings yet
Digital Signal Processing Unit IV Guide
44 pages
Finite Word Length Effects in DSP
No ratings yet
Finite Word Length Effects in DSP
11 pages
Coa Unit 2
No ratings yet
Coa Unit 2
35 pages
Number Representation Explained
No ratings yet
Number Representation Explained
5 pages
Computer Arithmetic Representations
No ratings yet
Computer Arithmetic Representations
24 pages
Lecture 4
No ratings yet
Lecture 4
154 pages
Number Representation
No ratings yet
Number Representation
7 pages
Fixed vs Floating Point Representation
No ratings yet
Fixed vs Floating Point Representation
5 pages
3-EED220 Lecture 3
No ratings yet
3-EED220 Lecture 3
22 pages
Unit 2
No ratings yet
Unit 2
85 pages
Lab # 06 PDF
No ratings yet
Lab # 06 PDF
12 pages
Finite Precision
No ratings yet
Finite Precision
50 pages
Understanding Floating Point Numbers
No ratings yet
Understanding Floating Point Numbers
22 pages
CO III SEM UNIT V (1) Anu Degree Notes For Co
No ratings yet
CO III SEM UNIT V (1) Anu Degree Notes For Co
32 pages
Binary Number Systems Explained
No ratings yet
Binary Number Systems Explained
34 pages
Mailam Engineering College Mailam (Po), Villupuram (DT) - Pin: 604 304
No ratings yet
Mailam Engineering College Mailam (Po), Villupuram (DT) - Pin: 604 304
43 pages
Digital Signal Processors & Architecture
No ratings yet
Digital Signal Processors & Architecture
190 pages
Floating Point & Fixed Point Representation - BCA II
No ratings yet
Floating Point & Fixed Point Representation - BCA II
24 pages
Floating Point Numbers
No ratings yet
Floating Point Numbers
5 pages
Unit 1 Part C
No ratings yet
Unit 1 Part C
10 pages
Computer Arithmetic: Part II: Integer Arithmetic & Floating Point
No ratings yet
Computer Arithmetic: Part II: Integer Arithmetic & Floating Point
30 pages
Floating Point 6up
No ratings yet
Floating Point 6up
7 pages
Data Representation & Number Systems
No ratings yet
Data Representation & Number Systems
29 pages
Computer Architecture: Data Types
No ratings yet
Computer Architecture: Data Types
25 pages
DSP Arithmetic for Academics
No ratings yet
DSP Arithmetic for Academics
96 pages
Signal Quantization in MATLAB Lab
No ratings yet
Signal Quantization in MATLAB Lab
10 pages
Unit V Finite Word Length Effects in Digital Filters
75% (4)
Unit V Finite Word Length Effects in Digital Filters
3 pages
Signed Number Representation in Binary
No ratings yet
Signed Number Representation in Binary
48 pages
Binary Data Representation Guide
No ratings yet
Binary Data Representation Guide
27 pages
L4
No ratings yet
L4
29 pages
COA - Unit 2 Data Representation 1
No ratings yet
COA - Unit 2 Data Representation 1
59 pages
DLCO Unit-1
100% (3)
DLCO Unit-1
38 pages
R T D S P: EAL IME Igital Ignal Rocessing
No ratings yet
R T D S P: EAL IME Igital Ignal Rocessing
56 pages
L-5 Floating Point Representation of Numbers
No ratings yet
L-5 Floating Point Representation of Numbers
21 pages
Data Representation
No ratings yet
Data Representation
28 pages
Discrete Mathematics, 1ma462, Spring 2021
No ratings yet
Discrete Mathematics, 1ma462, Spring 2021
2 pages
Coin Row Problem: Optimal Selection Algorithm
No ratings yet
Coin Row Problem: Optimal Selection Algorithm
8 pages
Ch15 Student
No ratings yet
Ch15 Student
14 pages
(BFS) Breadth First Search Is A Traversal Technique in Which We Traverse All The Nodes of The Graph in A Breadth-Wise Motion. in BFS, We Traverse
No ratings yet
(BFS) Breadth First Search Is A Traversal Technique in Which We Traverse All The Nodes of The Graph in A Breadth-Wise Motion. in BFS, We Traverse
10 pages
CS502 Midterm Exam - Spring 2010
No ratings yet
CS502 Midterm Exam - Spring 2010
5 pages
TOC Assignment No-1
No ratings yet
TOC Assignment No-1
5 pages
Binomial Theorem
No ratings yet
Binomial Theorem
8 pages
AIME Mock Exam 4 PDF
No ratings yet
AIME Mock Exam 4 PDF
3 pages
Java Programming Exercises
No ratings yet
Java Programming Exercises
3 pages
An Introduction To Floating Point Arithmetic by Example: Pat Quillen
No ratings yet
An Introduction To Floating Point Arithmetic by Example: Pat Quillen
33 pages
FRM 2 Maths Scheme Term 1
No ratings yet
FRM 2 Maths Scheme Term 1
20 pages
HKIMO 2024 Secondary 3 and 4 (Algebra)
100% (1)
HKIMO 2024 Secondary 3 and 4 (Algebra)
4 pages
XI CS Practical List 2023-24
No ratings yet
XI CS Practical List 2023-24
17 pages
Finite Math 1 Introduction
No ratings yet
Finite Math 1 Introduction
14 pages
11 Activity 5
No ratings yet
11 Activity 5
3 pages
Optimizing Job-Machine Assignments
No ratings yet
Optimizing Job-Machine Assignments
2 pages
2019 MTAP-DepEd Saturday Program in Mathematics Grade 6 Session 1
No ratings yet
2019 MTAP-DepEd Saturday Program in Mathematics Grade 6 Session 1
14 pages
Input Source Encoder Channel Encoder Binary Interface
No ratings yet
Input Source Encoder Channel Encoder Binary Interface
29 pages
Synthetic Division and Remainder Theorem PDF
No ratings yet
Synthetic Division and Remainder Theorem PDF
4 pages
Fungsi Gamma Dan Tabel Nilainya
No ratings yet
Fungsi Gamma Dan Tabel Nilainya
10 pages
P2 Fractions
No ratings yet
P2 Fractions
12 pages
AMSP Algebra Summer Courses 2024
No ratings yet
AMSP Algebra Summer Courses 2024
4 pages
Discrete Mathematics Syllabus
No ratings yet
Discrete Mathematics Syllabus
2 pages
Turing Machine Notes
No ratings yet
Turing Machine Notes
11 pages
Lovász, L. Et Schrijver, A. - Cones of Matrices and Set-Functions and 0-1 Optimization (1991)
No ratings yet
Lovász, L. Et Schrijver, A. - Cones of Matrices and Set-Functions and 0-1 Optimization (1991)
25 pages
Recurring Decimal
No ratings yet
Recurring Decimal
11 pages
Grade 11 Term 4 Test - Memo Final 1
No ratings yet
Grade 11 Term 4 Test - Memo Final 1
8 pages
StudentGradeHistory 20BCI0184
No ratings yet
StudentGradeHistory 20BCI0184
3 pages
EX of RSA Algorithm
No ratings yet
EX of RSA Algorithm
10 pages
Chapter 3 Quadratic Functions
No ratings yet
Chapter 3 Quadratic Functions
5 pages