0% found this document useful (0 votes)

93 views7 pages

Floating Point 6up

1) Floating point numbers represent fractions in computers using scientific notation, with a mantissa and exponent. The IEEE 754 standard defines common floating point representations. 2) Floating point numbers use a sign bit, exponent field, and mantissa field. The exponent is stored using bias to represent both positive and negative exponents. 3) The IEEE 754 standard defines single and double precision floating point number formats. It allows for consistent representation of floating point values across systems.

Uploaded by

edemkv

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

93 views7 pages

Floating Point 6up

Uploaded by

edemkv

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Outline

  Fractional numbers
  Floating point scientific notation
Floating Point Representation
  Floating point in binary
  IEEE Floating Point Standard
DCS111 Computer Architecture   Behaviour of Floating Point Numbers

Recap: fractions
  Decimal 5.6710 is
  5 x 100 plus
Fractional Numbers   6 x 10-1 plus
  7 x 10–2
… not whole numbers   Binary 11.0112 is
  1 x 21 plus
  1 x 20 plus
  0 x 2-1 plus Quiz: what is
  1 x 2–2 plus 11.0112 in decimal?
  1 x 2–3

Recap: fractions Recap: fractions

Quiz: what is a third as a Quiz: what is a third as a
decimal: N.NNNNN? decimal: N.NNNNN?

  Third is 0.33333…
  Not all numbers can be represented exactly
(with limited digits)

1
Problem Solution 1 – Fixed Point
  How to hold fractions in computers?   Divide bits between whole and fractional parts

0 0 1 1 1 1 0 1

integer bits fractional bits integer bits fractional bits

Point always Quiz: what is this in

in the same decimal?
place

Solution 1 – Fixed Point Evaluation of Fix Point

  Divide bits between whole and fractional parts   Range versus Accuracy
  High accuracy means low range
  High range means low accuracy
  Has uses

integer bits fractional bits

Quiz:
•  What is maximum number?
  Really just scaled integers
range
•  What is difference between   Software library for fixed point numbers
successive numbers? accuracy   No need for special hardware

Scientific (Exponent) Notation Scientific (Exponent) Notation

3.21 x 105 6.54 x 10-5 3.21 x 105 6.54 x 10-5

Mantissa   321,000 and 0.0000654

Exponent
5 -5
  Same accuracy
  Mantissa is a fraction
  Different magnitude   Exponent is an integer
  Both mantissa and exponent can be negative
Quiz: Write these number as decimal, without exponents

2
Normalisation
Advantage of Scientific Notation

}
  Large range   0.002 x 100
  Constant proportional accuracy (… with   0.2 x 10-2
exceptions)   2.0 x 10-3 all the same value
  20 x 10-4

  Normalised number has 1 digit before the point

Binary Floating Point

  1.01 x 22
  1.1 x 2-2
Floating Point in Binary
  Exponent: positive or negative
  Mantissa: positive or negative

Quiz:
•  Effect of negative mantissa?
•  Effect of negative exponent?

Normalised Binary FP Representation (32 bits)‫‏‬

  Sign bit S
  In normalised binary scientific notation
  Exponent E
  1.mmmm…mmm x 2E
  Mantissa M
  unless the number is 0
  1.mmm…mmm is the mantissa
  E is the exponent

exponent fraction (mantissa)‫‏‬

sign

First digit
always 1

3
Representation (32 bits)‫‏‬ Negative exponents - how?
  Sign bit S – 1 bit
  Aim: ALU (Arithmetic Logic Unit) can reuse
  Exponent E – 8 bits integer machinery
  Mantissa M – 23 bits BUT   Eg, comparison with zero: x > 0
  Easy because of sign bit
  Floating point numbers can be easily classified as
negative or positive
exponent fraction (mantissa)‫‏‬
sign
  Comparison of two floating point numbers x<y
not so straightforward...
  (-1)S x 1.M x 2E   choose exponent representation to help
First digit always 1, so
not included

Exponent in 2's Comp ?? Representation of Exponents

  Consider: 1/2 < 1   We want:
  half: 0.1 = 1.0 x 2-1 (normalised)‫‏‬   FP number order to follow (unsigned) bit order
  one: 1.0 = 1.0 x 20 (normalised)‫‏‬   11111111 to represent the highest positive exponent

0 11111111 000 …   Use biased representation

0 00000000 000 …

Bad Design

Bias by N (Excess N)‫‏‬ Bias by N (Excess N)‫‏‬

  Representation of negative numbers used in   Excess 7
floating point numbers
  Numbers in ‘correct’ order 0000 -7 1000 1
0001 -6 1001 2
0010 -5 1010 3
excess-N-rep(X) = unsigned-rep(X + N) 0011 -4 1011 4
0100 -3 1100 5
  Excess 7 0101 -2 1101 6
0110 -1 1110 7
excess-7-rep(-3) = unsigned-rep(-3 + 7)‫‏‬ 0111 0 1111 8
= 0100
excess-7-rep(-7) = 0000 E.g –2 is represented as unsigned(7-2)
excess-7-rep(4) = unsigned-rep(4 + 7)‫‏‬ = unsigned(5)‫‏‬
= 1011 = 0101

4
IEEE 754-1985
  What is IEEE?
  Standard important for
IEEE Standard   exchange of data
  portability of code

  Representation for FP numbers in

  32-bit (single precision)‫‏‬
  64-bit (double precision)‫‏‬

IEEE 32-bit FP IEEE 32-bit FP

  Sign bit S – 1 bit   Sign bit S – 1 bit
  Mantissa M – 23 bits   Mantissa M – 23 bits
  Exponent E – 8 bits
S E M
exponent fraction (mantissa)‫‏‬
sign
  Exponent E – 8 bits
  Bias is 127 (-1)S x (1.M) x 2E-127
  Exponents –126 (00000001) to +127 (11111110)‫‏‬
  Exponents 00000000 and 11111111 special

Example 1 – Convert to FP Example 2 – Convert from FP

  Represent 0.312510 = 5/16   What number is represented by:
  5/16 = 1/4 + 1/16 = 0.01012= 1.01*2-2
0 01111101 010000 ... 000
 S = 0
 S = 0
  E = -2 + bias = -2 + 127 = 12510=01111101
  E = 0111 1101 = 12510
  M = 010....000
  Real exponent = E-bias = 125-127 = -2
  M = 1/4
  (-1)S x (1+M) x 2E-bias
0 01111101 010000 ... 000 = (1 + 1/4) x (1/4)
= 5/16

5
Quiz IEEE FP Extra’s
  What are   Zero
  Both E and M = zero
0 10000001 111000 ... 000   Can be positive or negative

1 01111001 011000 ... 000   +/- Infinity (exponent all 1's)‫‏‬

  De-normalised numbers
  E=0
  Convert to 32 FP using IEEE
  close to zero, exponent is -126
  4.125
  -7.625

Overflow and Underflow

  Overflow
Behaviour of Floating Point   Results too large (positive or negative) to be
Numbers represented
  Underflow
  Result too close to zero (positive or negative) to be
represented

Range – 32 bit FP Range – 32 bit FP

negative zero positive negative zero positive

smallest smallest positive (>0) largest smallest smallest positive (>0) largest
largest negative largest negative

  Quiz: find the largest and smallest FP in IEEE   Largest/smallest +/- (2 – 223) x 2127 ≈ 1038
32-bit   Near zero (normalised numbers)‫‏‬
  +/- 1.0 x 2-126

6
How do they behave? Summary
  If x, y are positive is:   FP scientific notation
  x+y>x ?   Normalised representation in binary
  If x and y are different can:   Bias to represent -ve to +ve range in exponent
  x–y=0?   Notice how a 32-bit binary number can
  Do these rules hold: represent many different entities in memory
  (x + y) + z = x + (y + z) ?   Underflow as well as overflow
  (x * y) * z = x * (y * z) ?
  x * (y + z) = x*y + x*z ?

Different evaluation orders have different rounding errors

Solution - Manual For Numerical Analysis
64% (45)
Solution - Manual For Numerical Analysis
41 pages
Floating Point
No ratings yet
Floating Point
33 pages
L-5 Floating Point Representation of Numbers
No ratings yet
L-5 Floating Point Representation of Numbers
21 pages
Cacc
No ratings yet
Cacc
106 pages
CA Notes 01
No ratings yet
CA Notes 01
14 pages
Floating Point Numbers: CS031 September 12, 2011
No ratings yet
Floating Point Numbers: CS031 September 12, 2011
22 pages
COA - Unit2 Floating Point Arithmetic 3
No ratings yet
COA - Unit2 Floating Point Arithmetic 3
19 pages
3. Floating_Point_Number
No ratings yet
3. Floating_Point_Number
36 pages
Floating Point Arithmetic
100% (1)
Floating Point Arithmetic
30 pages
COA UNIT-III PPTs Dr.G.Bhaskar ECE
No ratings yet
COA UNIT-III PPTs Dr.G.Bhaskar ECE
64 pages
HW_4_sol
No ratings yet
HW_4_sol
10 pages
COMP0068 Lecture10 High Level Data Types
No ratings yet
COMP0068 Lecture10 High Level Data Types
25 pages
ML System Optimization Lecture 11 Quantization
No ratings yet
ML System Optimization Lecture 11 Quantization
150 pages
Computer Arithmetic Representations
No ratings yet
Computer Arithmetic Representations
24 pages
"The Course That Gives CMU Its Zip!": Topics
No ratings yet
"The Course That Gives CMU Its Zip!": Topics
30 pages
01 DigitalNumericalFormats
No ratings yet
01 DigitalNumericalFormats
27 pages
9-Algorithms For Floating Point Arithmetic Operations-22-01-2024
No ratings yet
9-Algorithms For Floating Point Arithmetic Operations-22-01-2024
49 pages
4-Floating-Point-inclass
No ratings yet
4-Floating-Point-inclass
33 pages
16-Algorithms For Floating Point Arithmetic Operations and Numericals-01-02-2024
No ratings yet
16-Algorithms For Floating Point Arithmetic Operations and Numericals-01-02-2024
21 pages
Floating Point Numbers
No ratings yet
Floating Point Numbers
27 pages
ENSC254 - Floating Point Computation
No ratings yet
ENSC254 - Floating Point Computation
29 pages
Lecture 14 - Arithmetic Subsystems - Numbering Systems and Floating Point Unit (FPU)
No ratings yet
Lecture 14 - Arithmetic Subsystems - Numbering Systems and Floating Point Unit (FPU)
32 pages
Floating Point & fixed point Representation_BCA II
No ratings yet
Floating Point & fixed point Representation_BCA II
24 pages
Lec07 - Computer Arithmetic - Floating-Point Representation and Arithmetic
No ratings yet
Lec07 - Computer Arithmetic - Floating-Point Representation and Arithmetic
42 pages
2.4 Floating Points
No ratings yet
2.4 Floating Points
36 pages
Unit 2
No ratings yet
Unit 2
16 pages
Mootz Contracts Fall09
No ratings yet
Mootz Contracts Fall09
15 pages
Reparation For Injuries Suffered in The Service of The United Nations
0% (1)
Reparation For Injuries Suffered in The Service of The United Nations
50 pages
Floating Point Numbers: CS101 Introduction To Computing
No ratings yet
Floating Point Numbers: CS101 Introduction To Computing
41 pages
CH03-Data-II(2) (2)
No ratings yet
CH03-Data-II(2) (2)
31 pages
Shona
No ratings yet
Shona
20 pages
Floating Point Representation of Numbers: Wide Range
No ratings yet
Floating Point Representation of Numbers: Wide Range
11 pages
IEEE Standard 754
No ratings yet
IEEE Standard 754
10 pages
Computer Organisation
No ratings yet
Computer Organisation
4 pages
EC-502 - Aritra Dutta
No ratings yet
EC-502 - Aritra Dutta
6 pages
REPUBLIC V SANDIGANBAYAN (Puno Concurring)
No ratings yet
REPUBLIC V SANDIGANBAYAN (Puno Concurring)
36 pages
Fixed and Floating Point Numbers: Dr. Ashish GUPTA Sense, Vit-Ap Ashish - Gupta@vitap - Ac.in
No ratings yet
Fixed and Floating Point Numbers: Dr. Ashish GUPTA Sense, Vit-Ap Ashish - Gupta@vitap - Ac.in
34 pages
Fixed & Floating Point
No ratings yet
Fixed & Floating Point
31 pages
IEEE Standard 754 Floating Point Numbers
No ratings yet
IEEE Standard 754 Floating Point Numbers
7 pages
Floating Point Representation: Reading: B&O 2.4
No ratings yet
Floating Point Representation: Reading: B&O 2.4
44 pages
3.1 Data Representation: 3.1.3 Real Numebrs and Normalized Floating-Point Representation
No ratings yet
3.1 Data Representation: 3.1.3 Real Numebrs and Normalized Floating-Point Representation
14 pages
The IEEE Standard For Floating Point Arithmetic
No ratings yet
The IEEE Standard For Floating Point Arithmetic
9 pages
Machine Level Representation of Data Part 3
100% (1)
Machine Level Representation of Data Part 3
32 pages
Houle 2016
No ratings yet
Houle 2016
29 pages
EE 109 Unit 20: IEEE 754 Floating Point Representation Floating Point Arithmetic
No ratings yet
EE 109 Unit 20: IEEE 754 Floating Point Representation Floating Point Arithmetic
31 pages
Module 2 - PART D Floating
No ratings yet
Module 2 - PART D Floating
30 pages
Lect4 Floats
No ratings yet
Lect4 Floats
64 pages
MPSC
No ratings yet
MPSC
56 pages
The Polynomial Toolbox For MATLAB
No ratings yet
The Polynomial Toolbox For MATLAB
56 pages
chapter3_3
No ratings yet
chapter3_3
13 pages
FIXED and FLOAT
No ratings yet
FIXED and FLOAT
8 pages
Data Representation
No ratings yet
Data Representation
28 pages
M. Tech. Semester - IX: Highway Materials (IBMCETE 903)
No ratings yet
M. Tech. Semester - IX: Highway Materials (IBMCETE 903)
20 pages
#3 - Floating Point
No ratings yet
#3 - Floating Point
38 pages
Fixed Point and Floating Point Number Representations
No ratings yet
Fixed Point and Floating Point Number Representations
5 pages
Floating Point Sept 6, 2006 15-213: "The Course That Gives CMU Its Zip!"
No ratings yet
Floating Point Sept 6, 2006 15-213: "The Course That Gives CMU Its Zip!"
34 pages
Floating Points
No ratings yet
Floating Points
31 pages
Room Service78 PDF
No ratings yet
Room Service78 PDF
3 pages
Medical English Exercises
100% (3)
Medical English Exercises
54 pages
Computer Arithmetic Representations
No ratings yet
Computer Arithmetic Representations
24 pages
KGN SONDU 10 2013 Tender For Design, Manufacture, Assembly, Test, Supply and Delivery of Generator Transformer For Sondu-Miriu Power Station
No ratings yet
KGN SONDU 10 2013 Tender For Design, Manufacture, Assembly, Test, Supply and Delivery of Generator Transformer For Sondu-Miriu Power Station
54 pages
Floating-Point Numbers and Operations Representation
No ratings yet
Floating-Point Numbers and Operations Representation
8 pages
4.4_1 New Floating Point.pptx
No ratings yet
4.4_1 New Floating Point.pptx
22 pages
John A. Keel - UFO Kidnappers (Saga - February 1967) (Files - Afu.se)
No ratings yet
John A. Keel - UFO Kidnappers (Saga - February 1967) (Files - Afu.se)
15 pages
Floating Point Numbers
No ratings yet
Floating Point Numbers
8 pages
Difference Between The Accounts of
No ratings yet
Difference Between The Accounts of
30 pages
Riftborne Demo GigaMimic V5-1
No ratings yet
Riftborne Demo GigaMimic V5-1
5 pages
ORAL COMMUNICATION IN CONTEXT 1st qrtr exam
No ratings yet
ORAL COMMUNICATION IN CONTEXT 1st qrtr exam
2 pages
Computer Architecture & Organization Unit 2
No ratings yet
Computer Architecture & Organization Unit 2
24 pages
Week 5: IEEE Floating Point Revision Guide For Phase Test
No ratings yet
Week 5: IEEE Floating Point Revision Guide For Phase Test
23 pages
Object-Oriented Design Objectives
No ratings yet
Object-Oriented Design Objectives
13 pages
Unit-4
No ratings yet
Unit-4
61 pages
Fixed Point and Floating Point Number Representations
No ratings yet
Fixed Point and Floating Point Number Representations
7 pages
What Are Floating Point Numbers?
No ratings yet
What Are Floating Point Numbers?
7 pages
The World Is Not Just Integers: Programming Languages Support Numbers With Fraction
No ratings yet
The World Is Not Just Integers: Programming Languages Support Numbers With Fraction
51 pages
"The Course That Gives CMU Its Zip!": Topics
No ratings yet
"The Course That Gives CMU Its Zip!": Topics
31 pages
'The Tempest' Revision Guide (Themes)
No ratings yet
'The Tempest' Revision Guide (Themes)
8 pages
Bahasa Inggris PTS Dan PAT Genap 2023-2024
No ratings yet
Bahasa Inggris PTS Dan PAT Genap 2023-2024
8 pages
2023091538
No ratings yet
2023091538
15 pages
English L - 4 & 6
No ratings yet
English L - 4 & 6
2 pages
monologue ideas
No ratings yet
monologue ideas
6 pages
ACR - Career Guidance - G6
100% (1)
ACR - Career Guidance - G6
5 pages
Finite Word Length Effects
No ratings yet
Finite Word Length Effects
31 pages
Vijay Tendulkar'S Kamala: Jibe at Value System of Yellow Journalism
No ratings yet
Vijay Tendulkar'S Kamala: Jibe at Value System of Yellow Journalism
6 pages
Birbal's Lesson in Humility
No ratings yet
Birbal's Lesson in Humility
2 pages
Lesson C: Questions With Be and Short Answers: The Students Are Young
No ratings yet
Lesson C: Questions With Be and Short Answers: The Students Are Young
1 page
Marketing Advantages and Disadvantages of E
No ratings yet
Marketing Advantages and Disadvantages of E
1 page
Animation NC Ii: Technical Vocational and Livelihood (TVL) Information and Communication Technologies
100% (2)
Animation NC Ii: Technical Vocational and Livelihood (TVL) Information and Communication Technologies
18 pages
DSM-5 Insanely Simplified
98% (43)
DSM-5 Insanely Simplified
140 pages
Fundamental Math
From Everand
Fundamental Math
Russell Pead
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Floating Point 6up

Uploaded by

Floating Point 6up

Uploaded by

Outline

Recap: fractions Recap: fractions

integer bits fractional bits integer bits fractional bits

Point always Quiz: what is this in

Solution 1 – Fixed Point Evaluation of Fix Point

integer bits fractional bits

Scientific (Exponent) Notation Scientific (Exponent) Notation

Mantissa   321,000 and 0.0000654

Normalised number has 1 digit before the point

Binary Floating Point

Normalised Binary FP Representation (32 bits)‫‏‬

exponent fraction (mantissa)‫‏‬

Exponent in 2's Comp ?? Representation of Exponents

0 11111111 000 …   Use biased representation

Bias by N (Excess N)‫‏‬ Bias by N (Excess N)‫‏‬

Representation for FP numbers in

IEEE 32-bit FP IEEE 32-bit FP

Example 1 – Convert to FP Example 2 – Convert from FP

1 01111001 011000 ... 000   +/- Infinity (exponent all 1's)‫‏‬

Overflow and Underflow

Range – 32 bit FP Range – 32 bit FP

Different evaluation orders have different rounding errors

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Floating Point 6up

Uploaded by

Floating Point 6up

Uploaded by

Outline

Recap: fractions Recap: fractions

integer bits fractional bits integer bits fractional bits

Point always Quiz: what is this in

Solution 1 – Fixed Point Evaluation of Fix Point

integer bits fractional bits

Scientific (Exponent) Notation Scientific (Exponent) Notation

Mantissa 321,000 and 0.0000654

Normalised number has 1 digit before the point

Binary Floating Point

Normalised Binary FP Representation (32 bits)‫‏‬

exponent fraction (mantissa)‫‏‬

Exponent in 2's Comp ?? Representation of Exponents

0 11111111 000 … Use biased representation

Bias by N (Excess N)‫‏‬ Bias by N (Excess N)‫‏‬

Representation for FP numbers in

IEEE 32-bit FP IEEE 32-bit FP

Example 1 – Convert to FP Example 2 – Convert from FP

1 01111001 011000 ... 000 +/- Infinity (exponent all 1's)‫‏‬

Overflow and Underflow

Range – 32 bit FP Range – 32 bit FP

Different evaluation orders have different rounding errors

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Mantissa   321,000 and 0.0000654

  Normalised number has 1 digit before the point

0 11111111 000 …   Use biased representation

  Representation for FP numbers in

1 01111001 011000 ... 000   +/- Infinity (exponent all 1's)‫‏‬