02 Numeral Systems and Data Storage

Session 2: Numeral Systems and
Data Storage
fit@hcmus
fit@hcmus
Content
£ Bits and their Storage
p Bits, Gates
£ Main Memory
£ Representing Information as Bit Patterns
p Text, number, images, sound
£ Binary System
£ Storing Integers
£ Storing Fractions
£ Mass Storage
fit@hcmus
Question
£ How computers store data?

p Number, text, image, sound, video
£ How computers map data to the real

world?
fit@hcmus
Bits and their storage
£ Bits
£ Gates
fit@hcmus
Bits
£ Binary uses two digits: 0 and 1.

£ bit (Binary Digit): smallest unit storing
information.
£ Can be stored in memory (cell) or register.
£ Register 1 byte (8 bit) or 1 word (16 bit),
etc.
5
fit@hcmus
Bits
£ Why using 2 digits 0 and 1 to encode

data?
£ Electronic implementation
0 1 0
1.1V
0.9V
0.2V
0.0V
Nguồn: Computer System – A Programmer’s Perspective, 3e

fit@hcmus
Bits
£ Easily to encode:
p Numeric value : 1 & 0
p Boolean value : true & false
p Voltage : high & low
p Punched card : punched & not punched
£ Data à encode using binary system to

store in computers
fit@hcmus
Bits – Boolean Operations
£ An operation that manipulates one or more
true/false values
p Bit 0 ~ False
p Bit 1 ~ True
£ Specific operations : AND, OR, XOR, NOT
£ Why Boolean operations?

p Computers are built by small components
p These components can process Boolean
operations quite fast
8
fit@hcmus
Bits – Boolean Operations
Source: Computer Science - An Overview, 12e
9
fit@hcmus
Gates
£ A device that computes a Boolean operation
£ Often implemented as (small) electronic circuits:
p Including: resistor (điện trở), transistor (bòng bán
dẫn), Capacitor (tụ điện), diot (điốt), …
p 0 & 1 ~ voltage
Nguồn: Wikipedia
10
A pictorial representation of
fit@hcmus gates
Nguồn: Computer Science - An Overview, 12e
11
fit@hcmus
Example – Simple circuit
Source Chun-Jen Tsai, ics12, National Chiao Tung University
12
fit@hcmus
Quiz
£ What are the names of these gates?

fit@hcmus
Quiz
£ What input bit patterns will cause the

following circuit to produce output of 1?
fit@hcmus
Hexadecimal Notation
£ Hexadecimal notation: A shorthand

notation for long bit patterns
p Divides a pattern into groups of four bits each
p Represents each group by a single symbol
£ Example: 10100011 becomes A3

fit@hcmus
The hexadecimal coding system
Bit pattern Hexadecimal
representation
0000 0
0001 1
0010 2
0011 3
0100 4
0101 5
0110 6
0111 7
1000 8
1001 9
1010 A
1011 B
1100 C
1101 D
1110 E
1111 F
fit@hcmus
Quiz
£ What bit patterns are represented by the

following hexadecimal patterns?
p 5FD97
p 610A
p ABCD
p 0100
fit@hcmus
MAIN MEMORY
fit@hcmus
Introduction
£ We know
p How machines encode information into chain
of bits
p Basic storage devices
£ So
p To store data, machines need to have a
million of circuits (a circuit stores 1 bit)
à Place containing these bits is called Main
Memory
19
fit@hcmus
Introduction
£ In additional to Flip-flops, machines have
other storage devices (called external
memory)
p Magnetic, optical, flash deices
£ Storage devices
p Volatile memory (bộ nhớ khả biến)
¡ Requires power to maintain the stored information
p Non-volatile memory (bộ nhớ bất khả biến)
¡ Can retrieve stored information even after having
been power cycled
20
fit@hcmus
Main memory cells
£ Cell: A unit of main memory (typically 8 bits
which is one byte)
p Most significant bit: the bit at the left (high-order) end
of the conceptual row of bits in a memory cell
p Least significant bit: the bit at the right (low-order)
end of the conceptual row of bits in a memory cell
£ Organization of a byte-size memory cell
p Size 8 bits (1 byte)
p Sequence of bits
21
fit@hcmus
Main memory address
£ Address: A “name” that uniquely identifies

one cell in the computer’s main memory
p “Names” are actually
numbers
p These numbers are

assigned consecutively
starting at zero
p Numbering the cells in this

manner associates an order
to the memory cells
Source: Computer Science - An Overview, 12e
22
fit@hcmus
Terminology
£ Random Access Memory (RAM)

p Memory in which individual cells can be
easily accessed in any order
£ Dynamic Memory (DRAM)

p RAM composed of volatile memory
23
fit@hcmus
Measuring memory capacity
0
1 bit 2
1 0
2 bit 22
2 1 0
3 bit 23
n-1 5 4 3 2 1 0
n bit … 2n
0…000 à 1…111 = 2n – 1 24
fit@hcmus
Measuring memory capacity
Capacity Value
Byte B 8 bit
KiloByte KB 210 B = 1024 Byte
MegaByte MB 210 KB = 220 Byte
GigaByte GB 210 MB = 230 Byte
TeraByte TB 210 GB = 240 Byte
Peta PB 210 TB = 250 Byte
Exabyte EB 210 PB= 260 Byte
25
fit@hcmus
REPRESENTING INFORMATION
AS BIT PATTERNS
fit@hcmus
Representing Text
£ Each character (letter, punctuation, etc.)

is assigned a unique bit pattern
p ASCII: Uses patterns of 7-bits to represent
most symbols used in written English text
p ISO developed a number of 8-bit extensions
to ASCII, each designed to accommodate a
major language group
p Unicode: Uses patterns of 16-bits to
represent the major symbols used in
languages world wide (UTF-8, UTF-16,…)
fit@hcmus
The message “Hello.” in ASCII
fit@hcmus
Storage - Represent
£ Storage and processing: bit
£ Display/represent: character
Þ Need to have a map table between the
two, do the mapping between numerical
values and character values.
£ ASCII and Unicode.
29
fit@hcmus
ASCII
£ American Standard Code for Information
Interchange.
£ First edition was published in 1963.
£ Based on the English alphabet (‘a’– ‘z’, ‘A’ – ‘Z’).
£ ASCII encodes 128 specified characters into

seven-bit integers (digits 0 to 9, lowercase letters
a to z, uppercase letters A to Z, and punctuation
symbols).
30
fit@hcmus
ASCII – Characters
£ Printing characters
p ‘ ’ (blank): 32 (0x20)
p ‘0’ -> ‘9’: 48 (0x30) -> 57 (0x39)
p ‘A’ -> ‘Z’: 65 (0x41) -> 90 (0x5A)
p ‘a’ -> ‘z’: 97 (0x61) -> 122 (0x7A)
£ Non-printing/control characters
p null: 0
p ‘ ’ (tab): 9
p enter/ line feed: 10
p carriage return: 13
31
fit@hcmus
ASCII
£ ASCII extent: 256 characters.

p 128 as the first edition.
p 128 extent to characters including: Greece
(‘α’, ‘β’, ‘π’, …), currency (‘£’, ‘¥’, …), …
£ ASCII cannot represent characters in

other languages such as Vietnamese,
Russian, Japanese, Arabic, etc.
32
fit@hcmus
Unicode
£ Unicode is a computing industry standard

for the consistent encoding,
representation, and handling of text
expressed in most of the world's writing
systems.
£ Containing 1.114.112 code points, divived
into 17 regions, each has 65535 (216)
code points.
33
fit@hcmus
Unicode
£ There are different in using the Unicode,

depending on the storage size of code
point
p UTF – 8: storage size from 1 -> 4 Bytes.

p UTF – 16: storage size 2 Bytes.
p UTF – 32: storage size 4 Bytes.
34
fit@hcmus
Unicode and Vietnamese
£ Unicode contains Vietnamese code points

is at:
£ http://vietunicode.sourceforge.net/charset/
v3.htm
35
fit@hcmus
Unicode - Font
£ Each Unicode set has different ways to be represented.
£ Unicode is implemented as a digital data file containing

a set of graphically related glyphs, characters, or
symbols.
£ Fonts support Unicode (with Vietnamese) :

p Times New Roman,
p Arial,
p Tahoma,
p …
36
fit@hcmus
Representing Numeric Values
£ Binary notation: uses bits to represent a

number in base two
£ Limitations of computer representations of
numeric values
p Overflow: occurs when a value is too big to
be represented
p Underflow: occurs when a value is too small
to be represented
p Truncation: occurs when a value cannot be
represented accurately
fit@hcmus
Representing Images
£ Bit map techniques

p Pixel: short for “picture element”
p Encoding
¡ RGB
¡ Luminance (cường độ sáng) and chrominance (độ
đậm/nhạt của màu)
£ Vector: can zoom in/out à not affected to
the quality
p Scalable
p TrueType and PostScript
fit@hcmus
Representing Sound
£ Sampling techniques: get the amplitude of the sound
wave at regular intervals and store the series of values
p Used for high quality recordings
p Records actual audio
£ Sampling rate (Hertz – Hz)
p Telephone: 8,000 samples/second – 8 KHz
p Music: 44,100 samples/second – 44,1 KHz
¡ High definition: 16 bits/sample
¡ Stereo: 32 bits/sample
fit@hcmus
Quiz
£ Suppose a stereo recording of one hour

of music is encoded using a sample rate
of 44,100 samples per second as
discussed.
£ What is the size of the encoded version?
fit@hcmus
BINARY SYSTEM
fit@hcmus
Binary system
£ The traditional decimal system is based

on powers of ten.
£ The Binary system is based on powers of
two
£ The base 10 and the binary system
Decoding the binary representing
fit@hcmus 100101
fit@hcmus
Base 10 to binary
fit@hcmus
Binary additional facts
£ Additional table
+ 0 1
0 0 1
1 1 10
fit@hcmus
Quiz
£ Convert each of the following base 10

representations to its equivalent binary
form:
p 32
p 15
p 27
p 53
fit@hcmus
STORING INTEGERS
fit@hcmus
Unsigned Integers
£ Representations of quantities are always

positive
£ Ex: height, weight, ASCII code, etc.
£ All bits are used for value
£ Max value of 1 unsigned byte:
p 1111 11112 = 28 – 1 = 25510
£ Max value of 1 unsigned word (2 bytes):
p 1111 1111 1111 11112 = 216 – 1 = 6553510
fit@hcmus
Signed Integers
£ Storing/representing of negative and

positive integers
£ The leftmost bit is used to represent the
sign
£ Ex:
p 0 positive: 0101 0011
p 1 negative: 1101 0011
£ Negative integers stored in machine as
two’s complement or excess notation
Two’s complement notation
fit@hcmus systems
£ Leftmost bit: sign bit
£ One’s complement :
p Changing all the 0s to 1s, all the 1s to 0s.
p Ex: 0110 and 1001 are complements
£ Two’s complement: add 1 to the one’s

complement
p Ex: 1001 + 0001 = 1010
fit@hcmus
One’s and two’s complements
5 (1 byte) 0 0 0 0 0 1 0 1
One’s complement of 5 1 1 1 1 1 0 1 0
+ 1
Two’s complement of 5 1 1 1 1 1 0 1 1
+ 5 0 0 0 0 0 1 0 1
Result 1 0 0 0 0 0 0 0 0
Coding the value -6 in two’s
fit@hcmus complement notation using 4 bits
Two’s complement notation
fit@hcmus systems
Addition problems converted to
fit@hcmus two’s complement notation
fit@hcmus
Quiz
£ Convert each of the following two’s

complement representations (4-bit singed
integer) to its equivalence base 10 form:
p 0011
p 1111
p 1100
p 1010
p 0000
p 1000
fit@hcmus
Quiz
£ Convert to two’s complement form using

patterns of 8 bits:
p6
p -6
p -17
p 13
p -1
p0
fit@hcmus
Quiz
£ What are the largest and smallest

numbers that can be stored if the machine
uses bit patterns of the following lengths?
p A. four
p B. eight
p C. sixteen
fit@hcmus
Min and max of signed integers
N bits minimum maximum

8 -27 = -128 27 - 1 = +127
16 -215 = -32,768 215 - 1 = +32,767
32 -231 = -2,147,483,648 231 - 1 = +2,147,483,647
64 -263 = - 263-1 =
9,223,372,036,854,775,808 +9,223,372,036,854,775,807
fit@hcmus
Min and max of unsigned integers
n Minimum Maximum
8 0 28 - 1 = 255
16 0 216 - 1 = 65,535
32 0 232 -1 = 4,294,967,295
64 0 264 - 1 = 18,446,744,073,709,551,615
fit@hcmus
Excess (K-bias)
£ Choose K (a positive integer) to allows
operations on the biased numbers to be the
same as for unsigned integers, but represents
both positive and negative values
£ The remaining n bits represents the biased
value (n is the length of bit pattern)
£ Excess representations are now primarily used
for the exponent of floating-point numbers
fit@hcmus
Excess (K-bias)
£ For example, representation of the value

2(10) in K-bias:
£ Value to represent: 2 (10)

£ K = 5.
à Value to represent in 5-bias = 7 (10)
fit@hcmus Original Decimal 5-bias Decimal Binary
−5 0 0000
−4 1 0001
−3 2 0010
−2 3 0011
−1 4 0100
0 5 0101
1 6 0110
2 7 0111
3 8 1000
4 9 1001
5 10 1010
6 11 1011
7 12 1100
8 13 1101
9 14 1110
10 15 1111
fit@hcmus
STORING FRACTIONS
fit@hcmus
Fractions
£ Floating-points:
p Consisting of a sign bit, a mantissa field, and
an exponent field.
p Radix point: used to separate integer part
and the fraction part of a number
£ Mantissa: also fraction/significant
fit@hcmus
Floating points
£ Single precision floating point (32 bits)
p 1 bit (sign) + 8 bits (exponent) + 23 bits (mantissa)
p Value: from 10-37 to 1038
p Precision: 7 decimal fraction
£ Double precision floating point (64 bits)

p 15 decimal fraction
fit@hcmus
Fractions
£ Single-precision : 32 bits
s exp frac
1 8-bits 23-bits
£ Double-precision : 64 bits
s exp frac
1 11-bits 52-bits
£ Extend-precision : 80 bits (Intel only)

s exp frac
1 15-bits 63 or 64-bits
IEEE-754 32-bit Single-Precision
fit@hcmus Floating-Point Numbers
N = (-1)^S x 1.F x 2^(E-127)
£ S: Sign bit
£ F: Fraction part
£ E: Exponent part
£ 127 = 28-1 -1 (excess 8-bit)
fit@hcmus
Example
0 10000000 110 0000 0000 0000 0000 0000
Sign bit S = 0 ⇒ positive number

E = 1000 0000B = 128D
Fraction is 1.11B (with an implicit leading
1) = 1 + 1×2^-1 + 1×2^-2 = 1.75D
The number is +1.75 × 2^(128-127) = +3.5D

IEEE-754 32-bit Double-Precision
fit@hcmus Floating-Point Numbers
N = (-1)^S x 1.F x 2^(E-1023)
£ S: Sign bit
£ F: Fraction part
£ E: Exponent part
£ 1023 = 211-1 -1 (excess 11-bit)
Min and Max of Floating-Point
fit@hcmus Numbers
Precision Min Max

Single 1.1754 x 10-38 3.40282 x 1038
Double 2.2250 x 10-308 1.7976 x 10308
fit@hcmus
MASS STORAGE SYSTEM

fit@hcmus
Mass Storage
£ On-line versus off-line

£ Typically larger than main memory
£ Typically less volatile than main memory
£ Typically slower than main memory
£ Typically lower cost than main memory
fit@hcmus
Mass Storage System
£ Magnetic
p Disk
p Tape
£ Optical
p CD
p DVD
£ Flash Technology
p Flash Drives
p Secure Digital (SD) Memory Card
£ RAID system
£ ...
fit@hcmus
fit@hcmus
Magnetic disk storage system
fit@hcmus
RAID system
£ RAID system: Redundant Array of
Inexpensive Disks
p Combine multiple (physical) drives
into a single (logical) disk system
using hardware (RAID controller) or
software
p Data stored in distributed physical
disk
p Transparent to the user
£ Purpose:
p Ensure data redundancy
p Performance improvement

02 Numeral Systems and Data Storage

Uploaded by

Document Information

Original Description:

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

02 Numeral Systems and Data Storage

Uploaded by

Copyright:

Available Formats

Session 2: Numeral Systems and

£ How computers store data?

£ How computers map data to the real

£ Binary uses two digits: 0 and 1.

£ Why using 2 digits 0 and 1 to encode

Nguồn: Computer System – A Programmer’s Perspective, 3e

£ Data à encode using binary system to

£ Specific operations : AND, OR, XOR, NOT

£ Why Boolean operations?

Source: Computer Science - An Overview, 12e

Nguồn: Computer Science - An Overview, 12e

Source Chun-Jen Tsai, ics12, National Chiao Tung University

£ What are the names of these gates?

£ What input bit patterns will cause the

£ Hexadecimal notation: A shorthand

£ Example: 10100011 becomes A3

£ What bit patterns are represented by the

£ Address: A “name” that uniquely identifies

p These numbers are

p Numbering the cells in this

£ Random Access Memory (RAM)

£ Dynamic Memory (DRAM)

£ Each character (letter, punctuation, etc.)

£ Storage and processing: bit

£ ASCII and Unicode.

£ First edition was published in 1963.

£ Based on the English alphabet (‘a’– ‘z’, ‘A’ – ‘Z’).

£ ASCII encodes 128 specified characters into

£ ASCII extent: 256 characters.

£ ASCII cannot represent characters in

£ Unicode is a computing industry standard

£ There are different in using the Unicode,

p UTF – 8: storage size from 1 -> 4 Bytes.

£ Unicode contains Vietnamese code points

£ Each Unicode set has different ways to be represented.

£ Unicode is implemented as a digital data file containing

£ Fonts support Unicode (with Vietnamese) :

£ Binary notation: uses bits to represent a

£ Bit map techniques

£ Suppose a stereo recording of one hour

£ The traditional decimal system is based

£ Convert each of the following base 10

£ Representations of quantities are always

£ Storing/representing of negative and

£ Two’s complement: add 1 to the one’s

£ Convert each of the following two’s

£ Convert to two’s complement form using

£ What are the largest and smallest

N bits minimum maximum

£ For example, representation of the value

£ Value to represent: 2 (10)

£ Double precision floating point (64 bits)

£ Extend-precision : 80 bits (Intel only)

N = (-1)^S x 1.F x 2^(E-127)

0 10000000 110 0000 0000 0000 0000 0000

Sign bit S = 0 ⇒ positive number

The number is +1.75 × 2^(128-127) = +3.5D

N = (-1)^S x 1.F x 2^(E-1023)

Precision Min Max

MASS STORAGE SYSTEM

£ On-line versus off-line

You might also like