0% found this document useful (0 votes)

40 views19 pages

Fixed Point vs Floating Point Arithmetic

Fixed point arithmetic

Uploaded by

lukasnoller

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views19 pages

Fixed Point vs Floating Point Arithmetic

Fixed point arithmetic

Uploaded by

lukasnoller

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

M7 Electronic System Level Design

Fixed Point Arithemtic

Carsten Gremzow
Fixed Point Arithmetic

Fixed Point versus Floating Point

Floating Point Arithmetic
§ After each arithmetic operation numbers are normalised
§ Used where precision and dynamic range are important
§ Most algorithms are developed in FP
§ Ease of coding
§ More Cost (Area, Speed, Power)
Fixed Point Arithmetic
§ Place of decimal is fixed
§ Simpler HW, low power, less silicon
§ Converting FP simulation to Fixed point simulation is time consuming
§ Multiplication doubles the number of bits: NxN multiplier produces 2N
bits
§ The code is less readable, need to worry about overflow and scaling
issues

M7 ESLD 2/19
Fixed Point Arithmetic
Floating Point

M7 ESLD 3/19
Fixed Point Arithmetic
Floating Point

M7 ESLD 4/19
Fixed Point Arithmetic

Floating Point
Add - typically 4 clocks:
compare, shift, add, normalize
Multiply - typically 8 clocks:
add, fixed point multiply, normalize, add
Divide - typically 20-40 clocks

M7 ESLD 5/19
Fixed Point Arithmetic
Typical System Level Design Flow

M7 ESLD 6/19
Fixed Point Arithmetic

Fixed Point versus Floating Point

Algorithms are developed in floating point format using tools like
Matlab
Floating point processors and HW are expensive
Fixed-point processors and HW are often used in embedded
systems
After algorithms are designed and tested then they are converted
into fixed- point implementation
The algorithms are ported on Fixed-point processor or application
specific hardware

M7 ESLD 7/19
Fixed Point Arithmetic

Qn.m Fixed Point Format

Qn.m format is a fixed positional number system for representing
fixed-point numbers
A Qn.m format N-bit binary number assumes n bits to the left and m
bits to the right of the binary point

M7 ESLD 8/19
Fixed Point Arithmetic

Qn.m Key Idea

in Qn.m format n entirely depends upon the range of the integer
m defines the precision of the fractional part

M7 ESLD 9/19
Fixed Point Arithmetic
Qn.m Positve Numbers
the MSB is the sign bit
for a positive fixed-point-number, MSB is ’0’:

b “ 0bn´2 . . . b1 b0 .b´1 b´2 . . . b´m

equivalent floating point value of the positive number is

b “ bn´2 2n´1 ` bn´2 2n´2 ` ¨ ¨ ¨ ` b1 21 ` b0 ` b´1 2´1 ` ¨ ¨ ¨ ` b´m 2´m

for negative numbers, MSB has neative weight and the equivalent
value is

b “ ´bn´1 2n´1 ` bn´2 2n´1 `¨ ¨ ¨` b1 21 ` b0 ` b´1 2´1 `¨ ¨ ¨` b´m 2´m

M7 ESLD 10/19
Fixed Point Arithmetic
Conversion to Qn.m
1. define total number of bits to reresent a Qn.m number
§ assume ten bits in the example
2. fix location of the decimal based on the value of the number
§ assume two bits for the integer part
§ the decimal point is implied

M7 ESLD 11/19
Fixed Point Arithmetic
Example

two bits for the integer and remaining eight bit keeps fractional part
a ten bit Q2.8 signed number covers -2 to +1.9922
increasing the fractional bits increases the precision

M7 ESLD 12/19
Fixed Point Arithmetic

Qn.m Range Determination

M7 ESLD 13/19
Fixed Point Arithmetic
The Software Side
using 16, 32 and 64 Bit Integer Types for fixed point arithmetic
§ Ñ short, int and long int in C
Converting a floating point number to fixed point:
§ Multiply the float by a power of 2 represented by a floating point
value, and cast the result to an integer:
fp_pi = (int)(3.141593f * 65536.0f); // 16 bits
fractional
§ After calculations, cast the result to int by discarding the fractional
bits. E.g.:
int result = fp_pi » 16; // divide by 65536
§ Or, get the original float back by casting to float and dividing by
2fractionalbits :
float result = (float)fp_pi / 65536.0f;
§ Note that this last option has significant overhead, which should be
outweighed by the gains.

M7 ESLD 14/19
Fixed Point Arithmetic

The Software Side

Addition and Subtraction
Adding two fixed point numbers is straightforward:
fp_a = ... ;
fp_b = ... ;
fp_sum = fp_a + fp_b;
Subtraction is done in the same way.
Note that this does require that fp_a and fp_b have the same
number of fractional bits. Also don’t mix signed and unsigned
carelessly.
fp_a = ... ; // 8:24
fp_b = ... ; // 16:16
fp_sum = (fp_a >> 8) + fp_b; // result is 16:16

M7 ESLD 15/19
Fixed Point Arithmetic

The Software Side

Multiplication
Multiplying fixed point numbers:
fp_a = ... ; // 10:22
fp_b = ... ; // 10:22
fp_sum = fp_a * fp_b; // 20:44
Situation 1: fp_sum is a 64 bit value.
§ Divide fp_sum by 222 to reduce it to 20:22 fixed point. (shift right by
22 bits)
Situation 2: fp_sum is a 32 bit value.
§ Ensure that intermediate results never exceed 32 bits.

M7 ESLD 16/19
Fixed Point Arithmetic

The Software Side

Division
Dividing fixed point numbers:
fp_a = ... ; // 10:22
fp_b = ... ; // 10:22
fp_sum = fp_a / fp_b; // 10:0
Situation 1: we can use a 64-bit intermediate value.
§ Multiply fp_a by 222 before the division (shift left by 22 bits)
Situation 2: we need to respect the 32-bit limit.

M7 ESLD 17/19
Fixed Point Arithmetic
The Hardware Side
as in software addition and subtraction operation remain the same
§ it’s the sofware’s task to perform operand conversion / scaling
§ in VHDL:
s_sum <= s_a + s_b; – beware of the carry bit..
multiplication is harder yet simpler at the same time
§ you will need the full resulting width of the multiplication operation
§ perform shifting and truncation of leading bits of the result in a
separate assignment
§ cannot be performed in a single statement
§ concurrent example in VHDL
signal s_a16, s_b16, s_prod16 : std_logic_vector(15 downto 0); -- 16 bit signed
signal s_prod32 : std_logic_vector(31 downto 0);

s_prod32 <= s_a16 * s_b16; -- generate 32 bit result

-- decimal point now between bit 15 and 16
s_prod16 <= s_prod32(23 downto 8); -- skip eight leading and trailing bits

M7 ESLD 18/19
Fixed Point Arithmetic

The Hardware Side - Pitfalls

truncating leading bits in multiplication result might jeopardize sign
information
fixed point multiplication is fast than floating point, but . . .
. . . check propagation delay of multiplication network
32x32 Bit Multiplication bound to break timing constraints with AXI
bus clock
revert to pipelined multiplication

M7 ESLD 19/19

SW Lab 3 Fixed Point Simulation EE 462
No ratings yet
SW Lab 3 Fixed Point Simulation EE 462
7 pages
Fixed Point Math Optimization Guide
No ratings yet
Fixed Point Math Optimization Guide
36 pages
13.a - Fixed Point Arithmetics
No ratings yet
13.a - Fixed Point Arithmetics
8 pages
Floating Point & Fixed Point Representation - BCA II
No ratings yet
Floating Point & Fixed Point Representation - BCA II
24 pages
Implementing Fixed-Point Math in C
No ratings yet
Implementing Fixed-Point Math in C
5 pages
C++ Fixed-Point Class for Embedded Systems
No ratings yet
C++ Fixed-Point Class for Embedded Systems
3 pages
Floating Point To Fixed
No ratings yet
Floating Point To Fixed
15 pages
Module 2
No ratings yet
Module 2
19 pages
Fixed Point
No ratings yet
Fixed Point
3 pages
Fixed Point Arithmatic
No ratings yet
Fixed Point Arithmatic
43 pages
Fixed vs Floating Point Representation
No ratings yet
Fixed vs Floating Point Representation
5 pages
Add04 Numbers
No ratings yet
Add04 Numbers
28 pages
DSP Arithmetic
No ratings yet
DSP Arithmetic
33 pages
Lecture 06 - MIPS Floating Point Arithmetic
No ratings yet
Lecture 06 - MIPS Floating Point Arithmetic
23 pages
Fixed-Point Algorithm Basics
No ratings yet
Fixed-Point Algorithm Basics
6 pages
Fixed Point Arm
No ratings yet
Fixed Point Arm
14 pages
Introduction To Fixed Point Math
No ratings yet
Introduction To Fixed Point Math
8 pages
Floating-Point Arithmetic Floating-Point Arithmetic Floating-Point Arithmetic Floating-Point Arithmetic Floating-Point Arithmetic 33333
No ratings yet
Floating-Point Arithmetic Floating-Point Arithmetic Floating-Point Arithmetic Floating-Point Arithmetic Floating-Point Arithmetic 33333
18 pages
White Paper One VHDL Maths 2008
No ratings yet
White Paper One VHDL Maths 2008
5 pages
IEEE Floating-Point Representation
No ratings yet
IEEE Floating-Point Representation
42 pages
ENSC254 - Floating Point Computation
No ratings yet
ENSC254 - Floating Point Computation
29 pages
Floating-Point to Fixed-Point in Audio
No ratings yet
Floating-Point to Fixed-Point in Audio
10 pages
DSP Finite Word Length Effects
No ratings yet
DSP Finite Word Length Effects
26 pages
01 DigitalNumericalFormats
No ratings yet
01 DigitalNumericalFormats
27 pages
Computer Arithmetic Basics
No ratings yet
Computer Arithmetic Basics
10 pages
Floatpoint Intro
No ratings yet
Floatpoint Intro
20 pages
DSP Arithmetic: Ece 450:digital Signal Processors and Applications Processors and Applications
No ratings yet
DSP Arithmetic: Ece 450:digital Signal Processors and Applications Processors and Applications
23 pages
ADSD Fall2011 09 Fixed Point Representation
No ratings yet
ADSD Fall2011 09 Fixed Point Representation
41 pages
Lecture 5 - Programming Issues
No ratings yet
Lecture 5 - Programming Issues
12 pages
Manage-Implementation of Floating - Bhagyashree Hardiya
No ratings yet
Manage-Implementation of Floating - Bhagyashree Hardiya
6 pages
Finite Word Length Effects in Digital Filter
No ratings yet
Finite Word Length Effects in Digital Filter
26 pages
An Fpga Based 64-Bit Ieee - 754 Double Precision Floating Point Adder/Subtractor and Multiplier Using VHDL
No ratings yet
An Fpga Based 64-Bit Ieee - 754 Double Precision Floating Point Adder/Subtractor and Multiplier Using VHDL
11 pages
Design of Single Precision Floating Point Multiplication Algorithm With Vector Support
No ratings yet
Design of Single Precision Floating Point Multiplication Algorithm With Vector Support
8 pages
Arithmetic & Logic Unit
No ratings yet
Arithmetic & Logic Unit
58 pages
Design and Implementation of Fast Floating Point Multiplier Unit
No ratings yet
Design and Implementation of Fast Floating Point Multiplier Unit
5 pages
Fixed-Point Conversion Guide
No ratings yet
Fixed-Point Conversion Guide
19 pages
Computer Architecture: Nguyễn Trí Thành
No ratings yet
Computer Architecture: Nguyễn Trí Thành
55 pages
An617 Fixed Point Mult PDF
No ratings yet
An617 Fixed Point Mult PDF
383 pages
Fixed Point Routines
No ratings yet
Fixed Point Routines
383 pages
Fixed Point Routines - 00617b
100% (1)
Fixed Point Routines - 00617b
383 pages
Shi Wal 95 A
No ratings yet
Shi Wal 95 A
8 pages
Fixed vs Floating Point Numbers
No ratings yet
Fixed vs Floating Point Numbers
20 pages
DSP Numeric Representation Techniques
No ratings yet
DSP Numeric Representation Techniques
27 pages
IQMath Fixed Vs Floating PDF
No ratings yet
IQMath Fixed Vs Floating PDF
30 pages
Math Library-TMS320F2812
No ratings yet
Math Library-TMS320F2812
30 pages
Computer Arithmetic Essentials
No ratings yet
Computer Arithmetic Essentials
39 pages
Computer Arithmetic: Part II: Integer Arithmetic & Floating Point
No ratings yet
Computer Arithmetic: Part II: Integer Arithmetic & Floating Point
30 pages
DSP Unit 4 2marks
No ratings yet
DSP Unit 4 2marks
3 pages
Finite Word Length Effects
No ratings yet
Finite Word Length Effects
31 pages
FPGA Floating Point Analysis
100% (1)
FPGA Floating Point Analysis
8 pages
COA - Unit 2 Data Representation 1
No ratings yet
COA - Unit 2 Data Representation 1
59 pages
Verilog Project Report
No ratings yet
Verilog Project Report
13 pages
Floating-Point Multiplication Unit With 16-Bit Significant and 8-Bit Exponent
No ratings yet
Floating-Point Multiplication Unit With 16-Bit Significant and 8-Bit Exponent
6 pages
ALU and Number Representation in Computing
No ratings yet
ALU and Number Representation in Computing
93 pages
Floating-Point Representation in Computing
No ratings yet
Floating-Point Representation in Computing
6 pages
William Stallings Computer Organization and Architecture
No ratings yet
William Stallings Computer Organization and Architecture
37 pages
Fixed vs Floating Point Arithmetic
100% (1)
Fixed vs Floating Point Arithmetic
56 pages
Oriental Mindoro Government Resolutions and Projects 2014
No ratings yet
Oriental Mindoro Government Resolutions and Projects 2014
288 pages
IIM Kozhikode: Final Placements Report 2018
No ratings yet
IIM Kozhikode: Final Placements Report 2018
8 pages
Demarcus Mckinstry
No ratings yet
Demarcus Mckinstry
2 pages
Timeline
No ratings yet
Timeline
1 page
Thunderbolt Kids Science Comic Books Grade 5
No ratings yet
Thunderbolt Kids Science Comic Books Grade 5
176 pages
Paperback 8.500x11.000 64 BW White en Us
No ratings yet
Paperback 8.500x11.000 64 BW White en Us
1 page
Faculty Civil Engineering 2020 Session 1 - Degree Ecm442 416
No ratings yet
Faculty Civil Engineering 2020 Session 1 - Degree Ecm442 416
7 pages
Stylistics Final
No ratings yet
Stylistics Final
17 pages
S-70I Variable Direct Operating Cost Project Status: Conklin and Dedecker (C&D) Update June 10, 2017
No ratings yet
S-70I Variable Direct Operating Cost Project Status: Conklin and Dedecker (C&D) Update June 10, 2017
15 pages
Mechanical Engg Exam Guide
No ratings yet
Mechanical Engg Exam Guide
1 page
Nonlinear Analysis of Stress-Strain of Reinforced
No ratings yet
Nonlinear Analysis of Stress-Strain of Reinforced
6 pages
How To Install Kubernetes Cluster On Ubuntu 24.04 LTS (Step-by-Step Guide)
No ratings yet
How To Install Kubernetes Cluster On Ubuntu 24.04 LTS (Step-by-Step Guide)
4 pages
Prosodic Words and Phonological Structure
No ratings yet
Prosodic Words and Phonological Structure
6 pages
Advanced Linear Algebra Problems
No ratings yet
Advanced Linear Algebra Problems
3 pages
DSP Project
No ratings yet
DSP Project
15 pages
Belbin's Team Roles
No ratings yet
Belbin's Team Roles
16 pages
Colloquial English Phrases
No ratings yet
Colloquial English Phrases
2 pages
U 4
No ratings yet
U 4
19 pages
Tybms Sem - Vi (Apr 2023)
No ratings yet
Tybms Sem - Vi (Apr 2023)
39 pages
Ottoman Tradition in Modern Bosnian and PDF
No ratings yet
Ottoman Tradition in Modern Bosnian and PDF
337 pages
Math Students' Project Report
No ratings yet
Math Students' Project Report
7 pages
Bob Heilig - Legacy Leadership - FB Groups Guide
No ratings yet
Bob Heilig - Legacy Leadership - FB Groups Guide
15 pages
Topic A: Module 2: Addition and Subtraction Relationships
No ratings yet
Topic A: Module 2: Addition and Subtraction Relationships
18 pages
English Worksheets For Playgroup-By Activity Wallet
No ratings yet
English Worksheets For Playgroup-By Activity Wallet
20 pages
Cat Hand Pallet Truck Replacement Parts A4
100% (2)
Cat Hand Pallet Truck Replacement Parts A4
3 pages
MBA Project: Suruchi Spices Survey
No ratings yet
MBA Project: Suruchi Spices Survey
72 pages
Intro:: Continuation of Lists of Honorable Guests
No ratings yet
Intro:: Continuation of Lists of Honorable Guests
3 pages
Types of Chemical Reaction
No ratings yet
Types of Chemical Reaction
2 pages
Borides
No ratings yet
Borides
4 pages
Load Sensing Steering Units TI BC152886483962en-001003 April2021
No ratings yet
Load Sensing Steering Units TI BC152886483962en-001003 April2021
90 pages

Fixed Point vs Floating Point Arithmetic

Uploaded by

Fixed Point vs Floating Point Arithmetic

Uploaded by

M7 Electronic System Level Design

Fixed Point Arithemtic

Fixed Point versus Floating Point

Fixed Point versus Floating Point

Qn.m Fixed Point Format

Qn.m Key Idea

b “ 0bn´2 . . . b1 b0 .b´1 b´2 . . . b´m

b “ bn´2 2n´1 ` bn´2 2n´2 ` ¨ ¨ ¨ ` b1 21 ` b0 ` b´1 2´1 ` ¨ ¨ ¨ ` b´m 2´m

b “ ´bn´1 2n´1 ` bn´2 2n´1 `¨ ¨ ¨` b1 21 ` b0 ` b´1 2´1 `¨ ¨ ¨` b´m 2´m

Qn.m Range Determination

The Software Side

The Software Side

The Software Side

s_prod32 <= s_a16 * s_b16; -- generate 32 bit result

The Hardware Side - Pitfalls

You might also like