2 - 03 Bits Ints Part2

Carnegie Mellon
Bits, Bytes, and Integers – Part 2

Computer Organization and Architecture
3rd Lecture
Instructors:
Xiaofei Nan
1
Carnegie Mellon
Today: Bits, Bytes, and Integers

◆ Representing information as bits
◆ Integers
▪ Representation: unsigned and signed
▪Bit-level manipulations
▪ Conversion, casting
▪ Expanding, truncating
▪ Addition, negation, multiplication, shifting
▪ Summary
◆ Representations in memory, pointers, strings
2
Carnegie Mellon
Encoding Integers
Unsigned Two’s Complement
Sign Bit
Two’s Complement Examples (w = 5)

-16 8 4 2 1
10 = 0 1 0 1 0 8+2 = 10
-16 8 4 2 1
-10 = 1 0 1 1 0 -16+4+2 = -10
6
Carnegie Mellon
Two-complement Encoding Example (Cont.)

x = 15213: 00111011 01101101
y = -15213: 11000100 10010011
7
Carnegie Mellon
Numeric Ranges
⬛Unsigned Values
⬛Two’s Complement Values
▪ UMin = 0 ▪TMin = –2w–1
000…0
100…0
▪ UMax = 2 – w
▪ TMax 2w–1 – 1
1
=
111…1
011…1
▪ Minus 1
111…1
Values for W = 16
8
填空题 4分
Unsigned integer conversions:

1. The unsigned binary representation of 11 is [ 填
空 1] .
2. The decimal value of unsigned binary number
10012 is [ 填空 2] .
Signed integer conversions (8-bit representation):

3. The signed (two-complement) binary
representation of -128 is [ 填空 3] .
4. The decimal value of signed binary 110000002 is
[ 填空 4] .
正常使用填空题需 3.0 以上版本雨课堂
submit
Carnegie Mellon
Values for Different Word Sizes
◆ C Programming
⬛ Observations
▪ |TMin | = TMax ▪ #include <limits.h>
+1 ▪ Declares constants, e.g.,
▪Asymmetric
▪ UMax = 2range
* TMax ▪ ULONG_MAX
+1 ▪ LONG_MAX
▪ LONG_MIN
▪ Values platform specific
9
Carnegie Mellon
Negation: Complement & Increment

⬛ Negate through complement and increase
~x + 1 == -x
⬛ Why this works

▪ Observation: ~x + x == 1111…111 == -1
x 1001110 1
+ ~x 0110001 0
-1 1111111 1
10
Carnegie Mellon
Complement & Increment Examples

x=0
x = TMin Canonical counter example
11
Carnegie Mellon

◆ Integers
▪ Summary
13
Carnegie Mellon
Boolean Algebra
◆ Developed by George Boole in 19th Century
▪Algebraic representation of logic
▪Encode “True” as 1 and “False” as 0
And Or
■A&B = 1 when both A=1 and B=1 ■A|B = 1 when either A=1 or B=1
Not Exclusive-Or (Xor)

■~A = 1 when A=0 ■A^B = 1 when either A=1 or B=1, but not both
14
Carnegie Mellon
General Boolean Algebras

◆ Operate on Bit Vectors
▪Operations applied bitwise
011 01 001 011 01 001 011 01 001
& | 0 1010 10 1 ^ 0 1010 10 1
~
010 101 01 011 11 101 001 11 100 010 101 01
◆ All of010
the00Properties
0
of Boolean Algebra Apply
01
101 010 10
15
Carnegie Mellon
Example: Representing & Manipulating Sets

◆ Representation
▪Width w bit vector represents subsets of {0, …, w-1}
▪aj = 1 if j ∈ A
▪ 01101001 { 0, 3, 5, 6 }
▪ 76543210
▪ 01010101 { 0, 2, 4, 6 }
▪ 76543210
◆ Operations
▪& 01000001 { 0, 6 }
▪ |Intersection
Union 01111101 { 0, 2, 3, 4, 5,
6}
▪ ^ Symmetric difference 00111100
▪ ~ Complement { 2, 3, 4, 5 }
10101010 { 1, 3, 5, 7 } 16
Carnegie Mellon
Bit-Level Operations in C al y
im ar
e x De c in
◆ Operations &, |, ~, ^ Available H
0 0
B
0 000
in C 1 1 0 001
2 2 0 010
▪Apply to any “integral” data type 3 3 0 011
4 4 0 100
▪long, int, short, char, unsigned 5 5 0 101
6 6 0 110
▪ View arguments as bit vectors 7 7 0 111
▪ Arguments applied bit-wise 8 8 1 000
9 9 1 001
◆ Examples (char data type) A 10 1 010
B 11 1 011
▪~0x41 → C 12 1 100
D 13 1 101
E 14 1 110
▪ ~0x00 → F 15 1 111
▪0x69 & 0x55 →
▪ 17
Carnegie Mellon
Bit-Level Operations in C al y
im ar
e x De c in
◆ Operations &, |, ~, ^ Available H
0 0
B
0 000
in C 1 1 0 001
2 2 0 010
▪Apply to any “integral” data type 3 3 0 011
4 4 0 100
▪long, int, short, char, unsigned 5 5 0 101
6 6 0 110
▪ View arguments as bit vectors 7 7 0 111
▪ Arguments applied bit-wise 8 8 1 000
9 9 1 001
◆ Examples (char data type) A 10 1 010
B 11 1 011
▪~0x41 → 0xBE C 12 1 100
D 13 1 101
▪~0100 00012 → 1011 11102 E 14 1 110
▪ ~0x00 → 0xFF F 15 1 111
▪~0000 00002 → 1111 11112
▪0x69 & 0x55 → 0x41

▪0110 10012 & 0101 01012 → 0100 00012
18
Carnegie Mellon
Contrast: Logic Operations in C

◆ Contrast to Bit-Level Operators
▪Logic Operations: &&, ||, !
▪View 0 as “False”
▪Anything nonzero as “True”
▪Always return 0 or 1
▪Early termination
◆Examples Watch out for && vs. & (and || vs. |)
▪!0x41 → common programmer mistake!
0x00
▪!0x00 →
▪ 0x69 && 0x55 →
0x01
0x01
▪p0x69
▪
▪ || 0x55(avoids
&& *p
!!0x41→ → null pointer
0x01
access)
0x01
19
Carnegie Mellon

◆ Integers
▪ Summary
20
Carnegie Mellon
Mapping Between Signed & Unsigned
Two’s Complement Unsigned

T2U
x T2B B2U ux
X
Maintain Same Bit Pattern
Unsigned U2T Two’s Complement

ux U2B B2T x
X
⬛ Mappings between unsigned and two’s complement numbers:

Keep bit representations and reinterpret
21
Carnegie Mellon
Mapping Signed ↔ Unsigned

Bits Signed Unsigned
0000 0 0
0001 1 1
0010 2 2
0011 3 3
0100 4 4
0101 5 5
0110 6 T2U 6
0111 7 U2T 7
1000 -8 8
1001 -7 9
1010 -6 10
1011 -5 11
1100 -4 12
1101 -3 13
1110 -2 14
1111 -1 15
22
Carnegie Mellon
Mapping Signed ↔ Unsigned

Bits Signed Unsigned
0000 0 0
0001 1 1
0010 2 2
0011 3 3
0100 4 = 4
0101 5 5
0110 6 6
0111 7 7
1000 -8 8
1001 -7 9
1010 -6 10
1011 -5 ±16 11
1100 -4 12
1101 -3 13
1110 -2 14
1111 -1 15
23
Carnegie Mellon
Relation between Signed & Unsigned
Two’s Complement Unsigned

T2U
x T2B B2U ux
X
w–1 0
ux + + + •• • +++
x - ++ •• • +++
Large negative weight

becomes
Large positive weight
24
Carnegie Mellon
Conversion Visualized
⬛ 2’s Comp. → Unsigned
UMax
▪ Ordering Inversion UMax – 1
▪ Negative → Big Positive
TMax + 1 Unsigned
TMax TMax Range
2’s Complement 0 0
Range –1
–2
TMin
25
Carnegie Mellon
Signed vs. Unsigned in C

◆ Constants
▪ By default are considered to be signed integers
▪ Unsigned if have “U” as suffix
0U, 4294967259U
◆ Casting
▪ Explicit casting between signed & unsigned same as U2T and T2U
int tx, ty;
unsigned ux, uy;
tx = (int) ux;
uy = (unsigned)
ty;
▪ Implicit
tx = conversion
ux; int fun(unsigned u);
occurs
uy via
= assignments uy = fun(tx);
and procedure calls!
ty;
26
Carnegie Mellon
Conversion Surprises
◆ Expression Evaluation
▪If there is a mix of unsigned and signed in single
expression,
signed values implicitly converted to unsigned
▪Including comparison operations <, >, ==, <=, >=
▪Examples for W = 32: TMIN = -2,147,483,648 , TMAX = 2,147,483,647
◆ Constant1 Relation Evaluation
Constant2 == unsigned
0 0U < signed
-1 0 > unsigned
2147483647 -2147483647-1 > signed
-1 0U unsigned
2147483647U -2147483647-1 <
-1 -2 > signed
(unsigned) -1 -2 > unsigned
2147483647 2147483648U < unsigned
2147483647 (int) 2147483648U > signed
27
单选题 1分
1. What is the comparison result between -1

and 1U?
A <
B >
C =
submit
单选题 1分
2. What is the comparison result between

(unsigned)-2 and -1?
A <
B >
C =
提交
Carnegie Mellon

◆ Integers
▪ Summary
29
Carnegie Mellon
Sign Extension
⬛ Task:
▪ Given w-bit signed integer x
▪ Convert it to w+k-bit integer with same value
⬛ Rule:
▪ Make k copies of sign bit:
▪ X ′ = xw–1 ,…, xw–1 , xw–1 , xw–2 ,…, x0
k copies of MSB w
X •• •
••
•
X •• • •• •
′ k w
ersp 30
Carnegie Mellon
Sign Extension: Simple Example
Positive number Negative number
-16 8 4 2 1 -16 8 4 2 1
10 = 0 1 0 1 0 -10 = 1 1 1 1 0
-32 16 8 4 2 1 -32 16 8 4 2 1
10 = 0 1 0 1 0 -10 = 1 1 0 1 0
0 1
31
Carnegie Mellon
Larger Sign Extension Example

short int x = 15213;
int = (int) x;
ix
short int y = -15213;
int iy = (int) y;
Decimal Hex Binary

x 15213 3B 6D 00111011 01101101
ix 15213 00 00 3B 6D 00000000 00000000 00111011 01101101
y -15213 C4 93 11000100 10010011
iy -15213 FF FF C4 93 11111111 11111111 11000100 10010011
⬛ Converting from smaller to larger integer data type
⬛ C automatically performs sign extension
32
Carnegie Mellon
Truncatio
n
⬛ Task:
▪ Given k+w-bit signed or unsigned integer X
▪ Convert it to w-bit integer X’ with same value for “small enough” X
⬛ Rule:
▪ Drop top k bits:
▪ X ′ = xw–1 , xw–2 ,…, x0
k w
X •• • •• •
•• •
X′ •• •
w 33
Carnegie Mellon
Truncation: Simple Example

No sign change Sign change
-16 8 4 2 1 -16 8 4 2 1
2 = 0 0 0 1 0 10 = 0 1 0 1 0
-8 4 2 1 -8 4 2 1
2 = 0 0 1 0 -6 = 1 0 1 0
2 mod 16 = 2 10 mod 16 = 10U mod 16 = 10U = -6
-16 8 4 2 1 -16 8 4 2 1
-6 = 1 1 0 1 0 -10 = 1 0 1 1 0
-8 4 2 1 -8 4 2 1
-6 = 1 0 1 0 6 = 0 1 1 0
-6 mod 16 = 26U mod 16 = 10U = -6 -10 mod 16 = 22U mod 16 = 6U = 6
34
Carnegie Mellon
Summary:
Expanding, Truncating: Basic Rules
◆ Expanding (e.g., short int to int)
▪ Unsigned: zeros added
▪ Signed: sign extension
▪ Both yield expected result
◆ Truncating (e.g., unsigned to unsigned short)

▪ Unsigned/signed: bits are truncated
▪ Result reinterpreted
▪ Unsigned: mod operation
▪Signed: similar to mod
▪For small numbers yields expected behavior
35
Carnegie Mellon

◆ Integers
▪ Summary
46
Carnegie Mellon
Shift Operations
◆ Left Shift: x << y Argument x 0 11 000 10
▪Shift bit-vector x left y positions

<< 3 0 0010 0 00
Log. > > 2 00 011 000

– Throw away extra bits on left
▪Fill with 0’s on right Arith. > > 2 00 011 000
◆ Right Shift: x >> y

▪ Shift bit-vector x right y positions Argument x 1 01 000 10
▪Throw away extra bits on right << 3 0 0010 0 00
▪ Logical shift
Log. > > 2 00 101 000
▪Fill with 0’s on left
▪ Arithmetic shift Arith. > > 2 11 101 000
▪Replicate most significant bit on left
◆ Undefined Behavior
▪ Shift amount < 0 or ≥ word size
47
填空题 3分
1. The result of 01100101 << 2 is [ 填空 1].

2. The result of 01100101 >> 2 (logical
shift) is [ 填空 2] .
3. The result of 11100101 >>2 (arithmetic
shift) is [ 填空 3] .
Submit
Carnegie Mellon
Unsigned Addition
u •• •
Operands: w bits
•• •
+ v
True Sum: w+1 bits u+v •• •
Discard Carry: w bits UAdd w (u,v) •• •
⬛ Standard Addition Function

▪ Ignores carry output
⬛ Implements
s Modular
= w UAdd Arithmetic
u + v mod 2w
(u , v) =
48
Carnegie Mellon
Visualizing (Mathematical) Integer Addition

Add4(u , v)
⬛Integer Addition
▪4-bit integers u, v
▪Compute true
sum Add4(u , v)
▪Values increase linearly
with u and v
▪Forms planar surface
v
u
49
Carnegie Mellon
Visualizing Unsigned Addition
◆ Wraps Around Overflow
▪ If true sum ≥ 2w
▪ At most once UAdd4(u , v)
True Sum
2w+1 Overflow
2w
v
0
Modular Sum
u
50
Carnegie Mellon
Two’s Complement Addition

Operands: w bits u •• •
+ v •• •
True Sum: w+1 bits •• •
u+ v
Discard Carry: w bits TAddw(u , v) •• •
◆ TAdd and UAdd have Identical Bit-Level Behavior

▪ Signed vs. unsigned addition in C:
int s, t, u, v;
s = (int) ((unsigned) u + (unsigned) v);
t = u + v
▪Will give s == t
51
Carnegie Mellon
Characterizing TAdd
Positive Overflow
⬛ Functionality TAdd(u , v)
▪ True sum requires w+1 bits >0
▪ Drop off MSB v
▪ Treat remaining bits as 2’s <0
comp. integer
< 0 u> 0
Negative Overflow
2w (NegOver)
2w (PosOver)
53
Carnegie Mellon
Visualizing 2’s Complement Addition

NegOver
◆ Values
▪ 4-bit two’s comp. TAdd4(u , v)
▪ Range from -8 to +7
◆ Wraps Around
▪ If sum ≥ 2w–1
▪Becomes negative
▪At most once
▪ If sum < –2w–1
▪Becomes positive
▪At most once v
PosOver
u
54
Carnegie Mellon
Unsigned Multiplication in C
u •• •
Operands: w bits
* v •• •
True Product: 2*w bits u· •• • •• •
v UMult w (u,v) •• •
Discard w bits: w bits
⬛ Standard Multiplication Function

▪ Ignores high order w bits
⬛ Implements Modular Arithmetic

UMultw(u , v) = u · v mod 2w
2 21 55
Carnegie Mellon
Signed Multiplication in C
u •• •
Operands: w bits
* v •• •
True Product: 2*w bits u· •• • •• •
v TMult w (u,v) •• •
Discard w bits: w bits
◆ Standard Multiplication Function

▪ Ignores high order w bits
▪ Some of which are different for signed
vs. unsigned multiplication
▪ Lower bits are the same
1 1 1 0 100 1
* 1101 0101
110 0001 1 0 1 0010
1 1 0 1 1 101 -35 56
Carnegie Mellon
Power-of-2 Multiply with Shift

◆ Operation
▪ u << k gives u
2k
▪*Both signed and unsigned k
•••
u
Operands: w bits k 0 ••• 0 1 0 ••• 0 0
* 2
True Product: w+k bits u · ••• 0 ••• 0 0
Discard k bits: w bits 2k UMult ( u ,2 ) •••

w
k 0 ••• 0 0
TMult w (u,2 k )
◆ Examples
▪u << 3 ==
▪ (u << 5) – (u << 3) == u * 24
▪u * 8
Most machines shift and add faster than multiply
▪Compiler generates this code automatically
57
Carnegie Mellon
Unsigned Power-of-2 Divide with Shift

⬛ Quotient of Unsigned by Power of 2
▪ u >> k gives ⎣ u / 2k ⎦
▪ Uses logical shift
u ••• k ••• Binary

Operands: Point0 ••• 0 1 0 ••• 0 0
Division: / 2k 00 ••• 0 0
••• .
u / 2k
00 ••• 0 0 •••
Result: ⎣
u/2k ⎦ •••
58
Carnegie Mellon
Signed Power-of-2 Divide with Shift

⬛ Quotient of Signed by Power of 2
▪ x >> k gives ⎣ x / 2k ⎦
▪ Uses arithmetic shift
▪ Rounds wrong direction when x < 0 k
x ••• ••• Binary Point
Operands:
0 ••• 0 1 0 ••• 0 0
/ 2k
Division: x/2k 0 •• •• . •
Result: RoundDown(x/2k) 0 •
••
•
••
•
•
• •
59
Carnegie Mellon
Arithmetic: Basic Rules

◆ Addition:
▪Unsigned/signed: Normal addition followed by truncate, same
operation on bit level
▪ Unsigned: addition mod 2w
▪Mathematical addition + possible subtraction of 2w
▪ Signed: modified addition mod 2w (result in proper
range)
▪Mathematical addition + possible addition or
subtraction of 2w
◆ Multiplication:
▪ Unsigned/signed: Normal multiplication followed by truncate,
same operation on bit level
▪ Unsigned: multiplication mod 2w
▪ Signed: modified multiplication mod 2w (result in proper
81
Carnegie Mellon
Multiplication
⬛ Goal: Computing Product of w-bit numbers x, y
▪ Either signed or unsigned
⬛ But, exact results can be bigger than w bits

▪ Unsigned: up to 2w bits
▪Result range: 0 ≤ x * y ≤ (2w – 1) 2 = 22w – 2w+1 + 1
▪ Two’s complement min (negative): Up to 2w-1 bits
▪Result range: x * y ≥ (–2w–1)*(2w–1–1) = –22w–2 + 2w–1
▪ Two’s complement
▪Result range: x max
* y ≤(positive):
(–2w–1) 2 =Up to 2w bits, but only for (TMinw)2
2 2w–2
⬛ So, maintaining exact results…

▪ would need to keep expanding word size with each product computed
▪ is done in software, if needed
▪e.g., by “arbitrary precision” arithmetic packages
82
Carnegie Mellon

◆ Integers
▪ Summary
60
Carnegie Mellon
Summary
Casting Signed ↔ Unsigned: Basic Rules
◆ Bit pattern is maintained
◆ But reinterpreted
◆ Can have unexpected effects: adding or subtracting 2w
◆ Expression containing signed and unsigned int

▪int is cast to unsigned!!
61
Carnegie Mellon
Why Should I Use Unsigned?

⬛ Don’t use without understanding implications
▪ Easy to make mistakes
unsigned i;
for (i = cnt-2; i >= 0; i--)
a[i] += a[i+1];
▪ Can be very subtle

#define DELTA sizeof(int)
int i;
for (i = CNT; i-DELTA >=
0; i-= DELTA)
. . .
62
Carnegie Mellon
Counting Down with Unsigned

⬛ Proper way to use unsigned as loop index
unsigned i;
for (i = cnt-2; i < cnt; i--)
a[i] += a[i+1];
⬛ See Robert Seacord, Secure Coding in C and C++

▪ C Standard guarantees that unsigned addition will behave like modular
arithmetic
▪0 – 1 → UMax
⬛ Even better
size_t i;
for (i = cnt-2; i < cnt; i--)
a[i] += a[i+1];
▪Data type size_t defined as unsigned value with length = word size 63
Carnegie Mellon
Why Should I Use Unsigned? (cont.)

⬛ Do Use When Performing Modular Arithmetic
▪ Multiprecision arithmetic
⬛ Do Use When Using Bits to Represent Sets

▪ Logical right shift, no sign extension
⬛ Do Use In System Programming

▪ Bit masks, device commands,…
64
Carnegie Mellon

◆ Bit-level manipulations
◆ Integers
▪Representation: unsigned and signed
▪Expanding, truncating
▪Addition, negation, multiplication, shifting
▪ Summary
66
Carnegie Mellon
Byte-Addressable Memory
◆ Programs refer to data by address

▪For now, imagine memory as a large array of bytes
▪ An address is like an index into that array
▪A pointer variable stores an address
⬛Note: system provides private address spaces to each “process”

▪A process is a running instance of a program
▪ So, a program can clobber its own data, but not that of others
67
Carnegie Mellon
Machine Words
◆ Any given computer has a “Word Size”
▪Nominal size of integer-valued data
▪and of addresses
▪ Until recently, most machines used 32 bits (4 bytes) as word size

▪Limits addresses to 4GB (232 bytes)
▪ Increasingly, machines have 64-bit word size

▪Potentially, could have 18 EB (exabytes) of addressable memory
▪That’s 18.4 X 1018
▪ Machines still support multiple data formats

▪Fractions or multiples of word size
▪Always integral number of bytes
68
Carnegie Mellon
Example Data Representations

C Data Type Typical 32-bit Typical 64-bit x86-64
char 1 1 1
short 2 2 2
int 4 4 4
long 4 8 8
float 4 4 4
double 8 8 8
pointer 4 8 8
69
Carnegie Mellon
Word-Oriented Memory Organization

32-bit 64-bit Bytes Addr.
◆ Addresses Specify Byte Words Words
Locations 0000
Add 0001
▪Address of first byte in word r
=
0000 0002
▪ Addresses of successive words differ ?? Add 0003
r
by 4 (32-bit) or 8 (64-bit) =
0000 0004
Add ?? 0005
r
=
0004 0006
?? 0007
0008
Add 0009
r
=
0008 0010
Add
?? r 0011
0008
= 0012
Add ?? 0013
r
=
0012
0014
?? 0015
70
Carnegie Mellon
Byte Ordering
◆ So, how are the bytes within a multi-byte word ordered in
memory?
◆ Conventions
▪ Big Endian: Sun, PPC Mac, Internet
▪Least significant byte has highest address
▪ Little Endian: x86, ARM processors running Android, iOS, and
Windows
▪Least significant byte has lowest address
71
Carnegie Mellon
Byte Ordering Example

◆ Example
▪ Variable x has 4-byte value of 0x01234567
▪ Address given by &x is 0x100
Big Endian 0 x 1 0 0 0 x 1 0 1 0x 102 0x 103

01 23 45 67
Little Endian 0 x 1 0 0 0 x 1 0 1 0x 102 0x 103

67 45 23 01
72
Carnegie Mellon
Decimal:
Decimal: 15213
15213
Representing Integers Binary:
Binary: 00001111 11001111 00111100 110 1
1101
Hex: 3 B
Hex: 3 B 6 6 D D
int A = 15213;
IA32, x86-64 long int C = 15213;
Increasing addresses
Sun6D 00 IA32 x86-64

6D 6D 00
3B 00 Sun
3B 3B 00
00 3B
00 00 3B
00 6D
00 00 6D
00
int B = -15213; 00
00
IA32, x86-64
00
Sun9 3 FF
C4 FF
FF C4
FF 93 Two’s complement
representation
85
填空题 8分
Given a 4-byte value of 0x76543210 and its

location starting from address 0x100, please
denote the contents in each address.
(1) Big Endian:
Address: 0x100 0x101 0x102 0x103
Content: [ 填空 1] [ 填空 2] [ 填空 3] [ 填空 4]
(2) Little Endian
Address: 0x100 0x101 0x102 0x103
Content: [ 填空 5] [ 填空 6] [ 填空 7] [ 填空 8]
Submit
Carnegie Mellon
Examining Data Representations

◆ Code to Print Byte Representation of Data
▪Casting pointer to unsigned char * allows use as a byte array
typedef
typedef unsigned
unsigned char
char *pointer;
*pointer;
void
void show_bytes(pointer
show_bytes(pointer start,
start, size_t
size_t len){
len){
size_t
size_t i;i;
for
for (i
(i == 0;
0; ii << len;
len; i++)
i++) printf(”%p\t0x
printf(”%p\t0x
%.2x\n", start+i, start[i]);
%.2x\n", start+i, start[i]);
printf("\n");
printf("\n");
}}
Printf directives:
Non-portable C code! %p: Print pointer
%x: Print Hexadecimal
73
Carnegie Mellon
show_bytesExecution Example
int
int aa == 15213;
15213; //
// 0x3b6d
0x3b6d
printf("int
printf("int aa == 15213;\n");
15213;\n");
show_bytes((pointer)
show_bytes((pointer) &a,
&a, sizeof(a));
sizeof(a));
Result (Linux x86-64):
int
int aa == 15213;
15213;
0x7fffb7f71dbc
0x7fffb7f71dbc 6d6d
0x7fffb7f71dbd 3b
0x7fffb7f71dbd
0x7fffb7f71dbe
3b
00
0x7fffb7f71dbe 00
0x7fffb7f71dbf
0x7fffb7f71dbf 0000
74
Carnegie Mellon
Representing Pointers
int
int BB == -15213;
-15213;
int
int *P
*P == &B;
&B;
Sun IA3 x86-64
2
EF AC 3C
FF 28 1B
FB F5 FE
2C FF 82
FD
7F
00
00
Different compilers & machines assign different locations to objects

Even get different results each time run program
75
Carnegie Mellon
Representing Strings
char
char S[6]
S[6] == "18213";
"18213";
◆ Strings in C
▪ Represented by array of characters
▪ Each character encoded in ASCII format IA3 Sun
2
▪Standard 7-bit encoding of character set 31 31
▪Character “0” has code 0x30 38 38
– Digit i has code 0x30+i 32 32
▪ String should be NUL-terminated 31 31
33 33
▪Final character = 0
◆ Compatibility 00 00
▪Byte ordering not an issue
76
Carnegie Mellon
Reading Byte-Reversed Listings

◆ Disassembly
▪Text representation of binary machine code
▪ Generated by program that reads the machine code
◆ Example Fragment
Address Instruction Code Assembly
8 0 4 8 3 6 5 : 5b pop %ebx
Rendition
8 0 4 8 3 6 6 : 8 1 c 3 a b 1 2 0 0 00 add $0x12ab,%ebx
8 0 4 8 3 6 c : 8 3 b b 2 8 0 0 0 0 0 0 00 cmpl $0x0,0x28(%ebx)
◆ Deciphering Numbers
▪ Value: 0x12ab
▪ Pad to 32 bits: 0x000012ab
▪ Split into bytes: 00 00 12 ab
▪ Reverse: ab 12 00 00
77
Carnegie Mellon
Summary
◆ Integers
▪ Summary
78
Carnegie Mellon
Integer C Puzzles
x < 0 ⇒ ((x*2) < 0)
ux >= 0
x & 7 == 7 (x<<30) < 0
⇒
ux
x >>y-1
⇒ -x < -
y x * x >= 0
x > 0 && y > 0 x + y > 0
Initialization
⇒ >= 0
x ⇒ -x <= 0
int x = foo(); x <= 0 ⇒ -x >= 0
int y = bar(); (x|-x)>>31 == -1
unsigned ux = x; ux >> 3 == ux/8
x >> 3 == x/8
unsigned uy =
x & (x-1) != 0
y;
79
Carnegie Mellon
Appendix
80
Carnegie Mellon
Correct Power-of-2 Divide

⬛ Quotient of Negative Number by Power of 2
▪ Want ⎡ x / 2k ⎤ (Round Toward 0)
▪ Compute as ⎣ (x+2k-1)/ 2k ⎦
▪In C: (x + (1<<k)-1) >> k
▪Biases dividend toward 0
Case 1: No rounding k
1 ••• 0 ••• 0 0
Dividend: u
0 ••• 0 0 1 ••• 1 1
+2k
–1 1 ••• 1 ••• 1 1 Binary Point
0 ••• 0 1 0 ••• 0 0
Divisor: / 2k
⎡ 10 ••• 1 1 1 ••• . 1 ••• 1 1
u/2 k ⎤
Biasing has no effect
83
Carnegie Mellon
Correct Power-of-2 Divide (Cont.)

Case 2: Rounding
k
1 ••• •••
Dividend: x
0 ••• 0 0 1 ••• 1 1
+2k
–1 1 ••• •••
Incremented by 1 Binary Point

0 ••• 0 1 0 ••• 0 0
Divisor: / 2k
⎡ 10 ••• 1 1 1 ••• . •••
x/2 k ⎤
Incremented by 1
Biasing adds 1 to final result
84

2 - 03 Bits Ints Part2

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

2 - 03 Bits Ints Part2

Uploaded by

Copyright:

Available Formats

Carnegie Mellon

Bits, Bytes, and Integers – Part 2

Today: Bits, Bytes, and Integers

Two’s Complement Examples (w = 5)

-10 = 1 0 1 1 0 -16+4+2 = -10

Two-complement Encoding Example (Cont.)

Unsigned integer conversions:

Signed integer conversions (8-bit representation):

正常使用填空题需 3.0 以上版本雨课堂

Values for Different Word Sizes

Negation: Complement & Increment

⬛ Why this works

Complement & Increment Examples

x = TMin Canonical counter example

Today: Bits, Bytes, and Integers

Not Exclusive-Or (Xor)

General Boolean Algebras

Example: Representing & Manipulating Sets

▪0x69 & 0x55 →

▪0x69 & 0x55 → 0x41

Contrast: Logic Operations in C

Today: Bits, Bytes, and Integers

Mapping Between Signed & Unsigned

Two’s Complement Unsigned

Unsigned U2T Two’s Complement

⬛ Mappings between unsigned and two’s complement numbers:

Mapping Signed ↔ Unsigned

Mapping Signed ↔ Unsigned

Relation between Signed & Unsigned

Two’s Complement Unsigned

Large negative weight

Signed vs. Unsigned in C

1. What is the comparison result between -1

2. What is the comparison result between

Today: Bits, Bytes, and Integers

Sign Extension: Simple Example

Positive number Negative number

Larger Sign Extension Example

Decimal Hex Binary

⬛ Converting from smaller to larger integer data type

⬛ C automatically performs sign extension

Truncation: Simple Example

◆ Truncating (e.g., unsigned to unsigned short)

Today: Bits, Bytes, and Integers

▪Shift bit-vector x left y positions

Log. > > 2 00 011 000

◆ Right Shift: x >> y

1. The result of 01100101 << 2 is [ 填空 1].

正常使用填空题需 3.0 以上版本雨课堂

⬛ Standard Addition Function

Visualizing (Mathematical) Integer Addition

Visualizing Unsigned Addition

◆ Wraps Around Overflow

Two’s Complement Addition

◆ TAdd and UAdd have Identical Bit-Level Behavior

Visualizing 2’s Complement Addition

⬛ Standard Multiplication Function

⬛ Implements Modular Arithmetic

◆ Standard Multiplication Function

Power-of-2 Multiply with Shift

Discard k bits: w bits 2k UMult ( u ,2 ) •••

Unsigned Power-of-2 Divide with Shift

u ••• k ••• Binary