Professional Documents
Culture Documents
EEE415-Week04-Machine Language
EEE415-Week04-Machine Language
Week 4
Design Principle 4
Good design demands good compromises
• Multiple instruction formats allow flexibility
- ADD, SUB: use 3 register operands
- LDR, STR: use 2 register operands and a constant
• Number of instruction formats kept small
- toadhere to design principles 1 and 3 (regularity supports
design simplicity and smaller is faster)
Machine Language
• Binary representation of instructions
• Computers only understand 1’s and 0’s
ARM v8 (Aarch64)
• So,the instruction set and rules are not universal for all architectures
(even other ARM architectures are different)
• Without
compiling, a binary code for ARMv7A will not run in ARMv8 or
ARMv7M computers. Dr. Sajid Muhaimin Choudhury 11
EEE 415 - Department of EEE, BUET Digital Design and Computer Architecture: ARM® Edition © 2015
11
Instruction Formats
• Data-processing
• Memory
• Branch
13
Cmd Format
instr(11:4)=0
means immediate
na hole
instr(11:4)=0 and EEE 415 - Department of EEE, BUET Dr. Sajid Muhaimin Choudhury 15
baki 3:0 te Digital Design and Computer Architecture: ARM® Edition © 2015
register denoted
15 notation dekhabe(R0-R15).
Cmd Format
17
Immediate Src2
• Immediate encoded as:
imm8: 8-bit unsigned immediate
rot: 4-bit rotation value
• 32-bit constant is: imm8 ROR (rot × 2)
19
Immediate Src2
• Immediate encoded as:
imm8: 8-bit unsigned immediate ROR by X = ROL by (32-X)
rot: 4-bit rotation value Ex: ROR by 30 = ROL by 2
• 32-bit constant is: imm8 ROR (rot × 2)
• Example: imm8 = abcdefgh
rot 32-bit constant
0000 0000 0000 0000 0000 0000 0000 abcd efgh
0001 gh00 0000 0000 0000 0000 0000 00ab cdef
… …
1111 0000 0000 0000 0000 0000 00ab cdef gh00
21
0xE2432EFF
Dr. Sajid Muhaimin Choudhury 22
EEE 415 - Department of EEE, BUET Digital Design and Computer Architecture: ARM® Edition © 2015
11
22
DRAFT 6/20/2023
23
0xE0865007
Dr. Sajid Muhaimin Choudhury 25
EEE 415 - Department of EEE, BUET Digital Design and Computer Architecture: ARM® Edition © 2015
25
27
29
Shift Type sh
LSL 002
LSR 012
ASR 102
ROR 112
31
33
Week 4
Week 4
35
Instruction Formats
• Data-processing
• Memory
• Branch
ARMv7-A
Memory Instruction Format
Encodes: LDR, STR, LDRB, STRB
• op = 012
• Rn = base register
• Rd = destination (load), source (store)
• Src2 = offset
• funct = 6 control bits
37
ARMv7-A
Offset Options
Recall: Address = Base Address + Offset
Example: LDR R1, [R2, #4]
Base Address = R2, Offset = 4
Address = (R2 + 4)
• Base address always in a register
• The offset can be:
an immediate
a register
or a scaled (shifted) register
ARMv7-A
Offset Examples
ARM Assembly Memory Address
LDR R0, [R3, #4] R3 + 4
LDR R0, [R5, #-16] R5 – 16
LDR R1, [R6, R7] R6 + R7
LDR R2, [R8, -R9] R8 – R9
LDR R3, [R10, R11, LSL #2] R10 + (R11 << 2)
but not register shifted register LDR R4, [R1, -R12, ASR #4] R1 – (R12 >>> 4)
LDR R0, [R9] R9
39
ARMv7-A
Memory Instruction Format
Encodes: LDR, STR, LDRB, STRB
• op = 012
• Rn = base register
• Rd = destination (load), source (store)
• Src2 = offset: register (optionally shifted) or immediate but not register shifted register
• funct = 6 control bits
ARMv7-A
Memory Instruction Format
41
ARMv7-A
Indexing Modes
Mode Address Base Reg. Update
Offset Base register ± Offset No change
Preindex Base register ± Offset Base register ± Offset
Postindex Base register Base register ± Offset
Examples
• Offset: LDR R1, [R2, #4] ; R1 = mem[R2+4]
• Preindex: LDR R3, [R5, #16]! ; R3 = mem[R5+16]
; R5 = R5 + 16
• Postindex: LDR R8, [R1], #8 ; R8 = mem[R1]
; R1 = R1 + 8
ARMv7-A
Memory Instruction Format
• funct:
I: Immediate bar
P: Preindex
U: Add
B: Byte
W: Writeback
L: Load
43
ARMv7-A
Memory Format funct Encodings
Type of Operation
L B Instruction
0 0 STR
0 1 STRB
1 0 LDR
1 1 LDRB
ARMv7-A
Memory Instructions
45
ARMv7-A
Memory Format funct Encodings
Type of Operation Indexing Mode
L B Instruction P W Indexing Mode
0 0 STR 0 1 Not supported
0 1 STRB 0 0 Postindex
1 0 LDR 1 0 Offset
1 1 LDRB 1 1 Preindex
ARMv7-A
Memory Format funct Encodings
Type of Operation Indexing Mode
L B Instruction P W Indexing Mode
0 0 STR 0 1 Not supported
0 1 STRB 0 0 Postindex
1 0 LDR 1 0 Offset
1 1 LDRB 1 1 Preindex
47
ARMv7-A
Memory Instruction Format
Encodes: LDR, STR, LDRB, STRB
• op = 012
• Rn = base register
• Rd = destination (load), source (store)
• Src2 = offset: immediate or register (optionally shifted)
• funct = I (immediate bar), P (preindex), U (add),
B (byte), W (writeback), L (load)
ARMv7-A
Memory Instr. with Immediate Src2
STR R11, [R5], #-26
• Operation: mem[R5] <= R11; R5 = R5 - 26
• cond = 11102 (14) for unconditional execution
• op = 012 (1) for memory instruction
• funct = 00000002 (0)
I = 0 (immediate offset), P = 0 (postindex),
U = 0 (subtract), B = 0 (store word), W = 0 (postindex),
L = 0 (store)
• Rd = 11, Rn = 5, imm12 = 26
49
ARMv7-A
Memory Instr. with Immediate Src2
STR R11, [R5], #-26
• Operation: mem[R5] <= R11; R5 = R5 - 26
• cond = 11102 (14) for unconditional execution
• op = 012 (1) for memory instruction
• funct = 00000002 (0)
I = 0 (immediate offset), P = 0 (postindex),
U = 0 (subtract), B = 0 (store word), W = 0 (postindex),
L = 0 (store)
• Rd = 11, Rn = 5, imm12 = 26
ARMv7-A
Memory Instr. with Register Src2
LDR R3, [R4, R5]
• Operation: R3 <= mem[R4 + R5]
• cond = 11102 (14) for unconditional execution
• op = 012 (1) for memory instruction
• funct = 1110012 (57)
I = 1 (register offset), P = 1 (offset indexing),
U = 1 (add), B = 0 (load word), W = 0 (offset indexing),
L = 1 (load)
• Rd = 3, Rn = 4, Rm = 5 (shamt5 = 0, sh = 0)
1110 01 111001 0100 0011 00000 00 0 0101 = 0xE7943005
51
ARMv7-A
Memory Instr. with Scaled Reg. Src2
STR R9, [R1, R3, LSL #2]
• Operation: mem[R1 + (R3 << 2)] <= R9
• cond = 11102 (14) for unconditional execution
• op = 012 (1) for memory instruction
• funct = 1110002 (0)
I = 1 (register offset), P = 1 (offset indexing),
U = 1 (add), B = 0 (store word), W = 0 (offset indexing),
L = 0 (store)
• Rd = 9, Rn = 1, Rm = 3, shamt = 2, sh = 002 (LSL)
1110 01 111000 0001 1001 00010 00 0 0011 = 0xE7819103
ARMv7-A
Review: Memory Instruction Format
Encodes: LDR, STR, LDRB, STRB
• op = 012
• Rn = base register
• Rd = destination (load), source (store)
• Src2 = offset: register (optionally shifted) or immediate
• funct = I (immediate bar), P (preindex), U (add),
B (byte), W (writeback), L (load)
53
Week 4
Instruction Formats
• Data-processing
• Memory
• Branch
55
ARMv7-A
Branch Instruction Format
Encodes B and BL
• op = 102
• imm24: 24-bit immediate
• funct = 1L2: L = 1 for BL, L = 0 for B
ARMv7-A
Encoding Branch Target Address
• Branch Target Address (BTA): Next PC when
branch taken
• BTA is relative to current PC + 8 that means 2 instruction pore, ejonno 2 add kore
dite hoy
• imm24 encodes BTA
• imm24 = # of words BTA is away from PC+8
57
ARMv7-A
Branch Instruction: Example 1
ARM assembly code
0xA0 BLT THERE PC • PC = 0xA0
0xA4 ADD R0, R1, R2 • PC + 8 = 0xA8
0xA8 SUB R0, R0, R9 PC+8 • THERE label is 3
0xAC ADD SP, SP, #8
0xB0 MOV PC, LR instructions past
0xB4 THERE SUB R0, R0, #1 BTA PC+8
0xB8 BL TEST • So, imm24 = 3
ARMv7-A
Branch Instruction: Example 1
ARM assembly code
0xA0 BLT THERE PC • PC = 0xA0
0xA4 ADD R0, R1, R2 • PC + 8 = 0xA8
0xA8 SUB R0, R0, R9 PC+8 • THERE label is 3
0xAC ADD SP, SP, #8
0xB0 MOV PC, LR instructions past
0xB4 THERE SUB R0, R0, #1 BTA PC+8
0xB8 BL TEST • So, imm24 = 3
59
ARMv7-A
Branch Instruction: Example 1
ARM assembly code
0xA0 BLT THERE PC • PC = 0xA0
0xA4 ADD R0, R1, R2 • PC + 8 = 0xA8
0xA8 SUB R0, R0, R9 PC+8 • THERE label is 3
0xAC ADD SP, SP, #8
0xB0 MOV PC, LR instructions past
0xB4 THERE SUB R0, R0, #1 BTA PC+8
0xB8 BL TEST • So, imm24 = 3
0xBA000003
ARMv7-A
Branch Instruction: Example 2
ARM assembly code
0x8040 TEST LDRB R5, [R0, R3] BTA
0x8044 STRB R5, [R1, R3] • PC = 0x8050
0x8048 ADD R3, R3, #1 • PC + 8 = 0x8058
0x8044 MOV PC, LR • TEST label is 6
0x8050 BL TEST PC instructions before
0x8054 LDR R3, [R1], #4
0x8058 SUB R4, R3, #9 PC+8 PC+8
• So, imm24 = -6
61
ARMv7-A
Branch Instruction: Example 2
ARM assembly code
0x8040 TEST LDRB R5, [R0, R3] BTA
0x8044 STRB R5, [R1, R3] • PC = 0x8050
0x8048 ADD R3, R3, #1 • PC + 8 = 0x8058
0x8044 MOV PC, LR • TEST label is 6
0x8050 BL TEST PC instructions before
0x8054 LDR R3, [R1], #4
0x8058 SUB R4, R3, #9 PC+8 PC+8
• So, imm24 = -6
ARMv7-A
Branch Instruction: Example 2
ARM assembly code
0x8040 TEST LDRB R5, [R0, R3] BTA
0x8044 STRB R5, [R1, R3] • PC = 0x8050
0x8048 ADD R3, R3, #1 • PC + 8 = 0x8058
0x8044 MOV PC, LR • TEST label is 6
0x8050 BL TEST PC instructions before
0x8054 LDR R3, [R1], #4
0x8058 SUB R4, R3, #9 PC+8 PC+8
• So, imm24 = -6
0xEBFFFFFA
63
ARMv7-A
Review: Instruction Formats
ARMv7-A
Conditional Execution
Encode in cond bits of machine instruction
For example,
ANDEQ R1, R2, R3 (cond = 0000)
ORRMI R4, R5, #0xF (cond = 0100)
SUBLT R9, R3, R8 (cond = 1011)
65
ARMv7-A
Review: Condition Mnemonics
cond Mnemonic Name CondEx
0000 EQ Equal 𝑍
0001 NE Not equal 𝑍̅
0010 CS / HS Carry set / Unsigned higher or same 𝐶
0011 CC / LO Carry clear / Unsigned lower 𝐶̅
0100 MI Minus / Negative 𝑁
0101 PL Plus / Positive of zero 𝑁
0110 VS Overflow / Overflow set 𝑉
0111 VC No overflow / Overflow clear 𝑉
1000 HI Unsigned higher 𝑍̅𝐶
1001 LS Unsigned lower or same 𝑍 𝑂𝑅 𝐶̅
1010 GE Signed greater than or equal 𝑁⊕𝑉
1011 LT Signed less than 𝑁⊕𝑉
1100 GT Signed greater than 𝑍̅(𝑁 ⊕ 𝑉)
1101 LE Signed less than or equal 𝑍 𝑂𝑅 (𝑁 ⊕ 𝑉)
1110 AL (or none) Always / unconditional ignored
Dr. Sajid Muhaimin Choudhury 66
EEE 415 - Department of EEE, BUET Digital Design and Computer Architecture: ARM® Edition © 2015
33
66
DRAFT 6/20/2023
ARMv7-A
Conditional Execution: Machine Code
Assembly Code Field Values
31:28 27:26 25 24:21 20 19:16 15:12 11:7 6:5 4 3:0
Machine Code
31:28 27:26 25 24:21 20 19:16 15:12 11:7 6:5 4 3:0
67
ARMv7-A
Interpreting Machine Code
• Start with op: tells how to parse rest
op = 00 (Data-processing)
op = 01 (Memory)
op = 10 (Branch)
• I-bit: tells how to parse Src2
• Data-processing instructions:
If I-bit is 0, bit 4 determines if Src2 is a register (bit 4 =
0) or a register-shifted register (bit 4 = 1)
• Memory instructions:
Examine funct bits for indexing mode, instruction, and
add or subtract offset
ARMv7-A
Interpreting Machine Code: Example 1
0xE0475001
• Start with op: 002, so data-processing instruction
• I-bit: 0, so Src2 is a register
• bit 4: 0, so Src2 is a register (optionally shifted by shamt5)
69
ARMv7-A
Interpreting Machine Code: Example 1
0xE0475001
• Start with op: 002, so data-processing instruction
• I-bit: 0, so Src2 is a register
• bit 4: 0, so Src2 is a register (optionally shifted by shamt5)
• cmd: 00102 (2), so SUB
• Rn=7, Rd=5, Rm=1, shamt5 = 0, sh = 0
• So, instruction is: SUB R5,R7,R1
71
Week 3
Addressing Modes
Addressing Mode define how the machine language instructions in that
architecture identify the operand(s) of each instruction. An addressing mode
specifies how to calculate the effective memory address of an operand by using
information held in registers and/or constants contained within a machine
instruction or elsewhere.
73
Addressing Modes
How do we address operands?
• Register Only
• Immediate
• Base
• PC-Relative
Register Addressing
• Source and destination operands found in registers
• Used by data-processing instructions
• Three submodes:
• Register-only
• Immediate-shifted register
• Register-shifted register
75
Addressing Modes
How do we address operands?
• Register Only
• Immediate
• Base
• PC-Relative
77
Immediate Addressing
• Operands found in registers and immediates
Example: ADD R9, R1, #14
• Uses data-processing format with I=1
• Immediate is encoded as
– 8-bit immediate (imm8)
– 4-bit rotation (rot)
• 32-bit immediate = imm8 ROR (rot x 2)
Addressing Modes
How do we address operands?
• Register Only
• Immediate
• Base
• PC-Relative
79
Base Addressing
• Address of operand is:
base register + offset
• Offset can be a:
– 12-bit Immediate
– Register
– Immediate-shifted Register
81
Addressing Modes
How do we address operands?
• Register Only
• Immediate
• Base
• PC-Relative
PC-Relative Addressing
• Used for branches
• Branch instruction format:
– Operands are PC and a signed 24-bit immediate (imm24)
– Changes the PC
– New PC is relative to the old PC
– imm24 indicates the number of words away from PC+8
• PC = (PC+8) + (SignExtended(imm24) x 4)
83
Stored Program
Address Instructions Program Counter
(PC): keeps track of
current instruction
0000000C E5 9 1 3 0 0 0
00000008 E2 8 1 3 0 0 2
00000004 E3 A 0 2 0 4 5
00000000 E3 A 0 1 0 6 4 PC
Main Memory
85
87
NOP
• NOP is a mnemonic for “no operation” and is pronounced “no op.” It is a
pseudoinstruction that does nothing. The assembler translates it to MOV R0, R0
(0xE1A00000). NOPs are useful to, among other things, achieve some delay
or align instructions.
Exceptions
Lecture 4.3
Week 4
Dr. Sajid Muhaimin Choudhury, Assistant Professor
Department of Electrical and Electronics Engineering
Bangladesh University of Engineering and Technology
89
Exceptions
• An exception is like an unscheduled function call that branches to a new
address.
Exception Handling
• Like any other function call, an exception must save the return address, jump to
some address, do its work, clean up after itself, and return to the program where
it left off. Exceptions use a vector table to determine where to jump to the
exception handler and use banked registers to maintain extra copies of key
registers so that they will not corrupt the registers in the active program.
91
Privilege levels are important so that buggy or malicious user code cannot corrupt other programs or crash or infect
the system.
93
Banked Registers
• Before an exception changes the PC, it must save the return address in the LR
so that the exception handler knows where to return. However, it must take care
not to disturb the value already in the LR, which the program will need later.
Therefore, the processor maintains a bank of registers to use as separate LR
during each of the execution modes.
• Bank of Link Registers
• Bank of Saved Program Status Registers because CPSR of main program will be loaded in SPSR of
• Banked copy of Stack Pointer different addressing modes
Exception Handling
At the start of Exception Handling…
1. Stores the CPSR into the banked SPSR
2. Sets the execution mode and privilege level based on the type of exception
3. Sets interrupt mask bits in the CPSR so that the exception handler will not be
interrupted
4. Stores the return address into the banked LR
5. Branches to the exception vector table based on the type of exception
95
• The processor then executes the instruction in the exception vector table,
typically a branch to the exception handler.
• The handler usually pushes other registers onto its stack, takes care of the
exception, and pops the registers back off the stack.
• The exception handler returns using the MOVS PC, LR
Exception Handling
MOVS PC,LR does the following clean up:
1. Copies the banked SPSR to the CPSR to restore the status register
2. Copies the banked LR (possibly adjusted for certain exceptions) to the PC to
return to the program where the exception occurred
3. Restores the execution mode and privilege level
97
Exception-Related Instruction
• Supervisor call (SVC): To transition between levels in a controlled way, the
program places arguments in registers and issues a SVC instruction, which
generates an exception and raises the privilege level.
• The OS and other code operating at PL1 can access the banked registers for
the various execution modes using the MRS (move to register from special
register) and MSR (move to special register from register) instructions. (At Boot,
initializing the stack pointers
Start-up of Computer
• On start-up, the processor jumps to the reset vector
• Execute boot loader code in supervisor mode. The boot loader typically
configures:
• Configures the memory system
• Initializes the stack pointer
• Reads the OS from disk;
• Begins a much longer boot process in the Operating System.
• The OS eventually will load a program, change to unprivileged user mode, and
Jump to the start of the program.
99
Up Next
How to implement the ARM Instruction Set
Architecture in Hardware (Chapter 7)
Microarchitecture
101
51