Number Representations

Prabhu TL
2 Min Read
Disclosure: This website may contain affiliate links, which means I may earn a commission if you click on the link and make a purchase. I only recommend products or services that I personally use and believe will add value to my readers. Your support is appreciated!


Introduction

In this chapter we consider how numbers are represented on a computer largely with respect to the errors that occur when basic arithmetical operations are performed on them. We are most interested here in so-called rounding errors (also called roundoff errors). Floating-point computation is emphasized. This is due to the fact that most numerical computation is performed with floating-point numbers, especially when numerical methods are implemented in high-level programming languages such as C, Pascal, FORTRAN, and C++. However, an understanding of floating-point requires some understanding of fixed-point schemes first, and so this case will be considered initially. In addition, fixed-point schemes are used to represent integer data (i.e., subsets of Z), and so the fixed-point representation is important in its own right. For example, the exponent in a floating-point number is an integer.

Fixed-Point Representations

We now consider fixed-point fractions. We must do so because the mantissa in a floating-point number is a fixed-point fraction.

We assume that fractions are t + 1 digits long. If the number is in binary, then we usually say “t + 1 bits” long instead. Suppose, then, that x is a (t + 1)-bit fraction. We shall write it in the form

Share This Article
Prabhu TL is a SenseCentral contributor covering digital products, entrepreneurship, and scalable online business systems. He focuses on turning ideas into repeatable processes—validation, positioning, marketing, and execution. His writing is known for simple frameworks, clear checklists, and real-world examples. When he’s not writing, he’s usually building new digital assets and experimenting with growth channels.
Leave a review