You are on page 1of 43

Thumb Instructions

1 of 37
Objectives

• To understand 16-bit Thumb mode operation of ARM


Processor.
• To understand the features of Thumb mode operation and how
Thumb instructions decompress to ARM Mode.
• To know the technique of switching between ARM and Thumb
mode of operations.
• To know the similarities and differences between ARM and
Thumb mode of operation
• To understand exception handling and branching in Thumb
mode.
• To understand operation of data processing instructions and data
transfer instructions in Thumb mode.

2 of 37
CPU Instruction Set

ARM7TDMI processor has two instruction sets:


• the standard 32-bit ARM instruction set
• a 16-bit THUMB instruction set.

3 of 37
Processor Operating States

ARM state
which executes 32-bit, word-aligned ARM instructions.

THUMB state
which operates with 16-bit, halfword-aligned THUMB
instructions.

4 of 37
Thumb Instruction Set

• ARM architecture versions v4T and above define a 16-bit


instruction set called the Thumb instruction set. The
functionality of the Thumb instruction set is a subset of the
functionality of the 32-bit ARM instruction set.
• A processor that is executing Thumb instructions is operating in
Thumb state. A processor that is executing ARM instructions is
operating in ARM state.

5 of 37
Thumb Instruction Set

• A processor in ARM state cannot execute Thumb instructions,


and a processor in Thumb state cannot execute ARM
instructions. You must ensure that the processor never receives
instructions of the wrong instruction set for the current state.
• Each instruction set includes instructions to change processor
state.

Note: ARM processors always start executing code in ARM state.

6 of 37
Thumb Instruction Set

• The processor in Thumb mode uses same eight general-purpose


integer registers that are available ARM mode.Some Thumb
instructions also access the PC(ARM register 15),the Link
Register(ARM register 14) and Stack Pointer(ARM register 13).
• When R15 is read,bit[0] is zero and bits[31:1]contain the
PC.when R15 is written,bit[0] is IGNORED and bits[31:1] are
written to the PC.
• Thumb does not provide direct access to the CPSR or any
SPSR.
• Thumb execution is flagged by the T bit(bit[5]) in the CPSR.
T==0 32-bit instructions are fetched(ARM instruction)
T==1 16-bit instructions are fetched(Thumb instruction)

7 of 37
Thumb applications

In a typical embedded system:


• use ARM code in 32-bit on-chip memory for small speed-
critical routines
• use Thumb code in 16-bit off-chip memory for large non-
critical control routines

Note:
Switching between ARM and Thumb States of Execution
Using BX Instruction
8 of 37
Thumb applications

For Most Instruction Generated by the Compiler


• Condition Execution is not used.
• Source and Destination Registers are identical
• Only low registers used
• Constants are limited size
• Inline barrel shifter not used

9 of 37
DATA TYPES

Byte (8-bit):
placed on any byte boundary.
Half-word (16-bit):
aligned to two-byte boundaries.
Word (32-bit):
aligned to four- byte boundaries.

10 of 37
Features

• Not a complete architecture


• Dynamically decompressed to ARM Instruction
• Fully supported by ARM development tools
• Both entry and exit are done using corresponding BX
Instruction

11 of 37
ARM7TDMI core

12 of 37
Mode Switching

• Default entry to exception mode is always ARM


• Explicit entry to Thumb is done using ARM mode BX
Instruction
• Explicit entry back to ARM mode is done using Thumb mode
BX Instruction

13 of 37
Thumb Programmers Model

• Registers r0 to r7 are accessible (Lo)


• Few instructions require r8 to r15 to be specified
• r13 is used as the stack pointer
• r14 is used as the link register
• r15 is used as the program counter

14 of 37
THUMB Register Organisation
Thumb General registers and Program Counter

User / System FIQ Supervisor Abort IRQ Undefined


r0 r0 r0 r0 r0 r0
r1 r1 r1 r1 r1 r1
r2 r2 r2 r2 r2 r2
r3 r3 r3 r3 r3 r3
r4 r4 r4 r4 r4 r4
r5 r5 r5 r5 r5 r5
r6 r6 r6 r6 r6 r6
r7 r7 r7 r7 r7 r7
SP SP_FIQ SP_SVC SP_ABT SP_IRQ SP_UND

LR LR_ FIQ LR_ SVC LR_ ABT LR_ IRQ LR_ UND
PC_ FIQ PC_ SVC PC_ ABT PC_ IRQ PC_ UND
PC

Thumb Program Status Registers

CPSR CPSR CPSR CPSR CPSR CPSR

sprsr_fiq
SPSR_FIQ SPSR_SVC SPSR_ABT sprsr_fiq
SPSR_IRQ SPSR _UND
sprsr_fiq

15 of 37
ARM-Thumb Similarities

• Load-store architecture
• Support 8-bit byte, 16-bit half-word and 32 bit word data types
with aligned boundaries
• 32 bit unsegmented memory.

16 of 37
ARM-Thumb differences

Most Thumb instructions are executed unconditionally.


(All ARM instructions are executed conditionally.)

• Many Thumb data processing instructions use a 2-address format


(the destination register is the same as one of the source registers).
(ARM data processing instructions, with the exception of the 64-
bit multiplies, use a 3-address format.)

• Thumb instruction formats are less regular than ARM instruction


formats, as a result of the dense encoding.

17 of 37
Thumb exception

• With exception processor is returned to ARM mode.

• While returning previous mode is restored as SPSR is


transferred to CPSR

18 of 37
Thumb Branching

• Short conditional branches

• Medium range unconditional branches

• Long range Subroutine calls

• Branch to change to ARM Mode

19 of 37
Branch Instruction Formats

B <cond> <label>
15 14 13 12 11 8 7 0

1 1 0 1 Condition 8-Bit Offset

B <label>
1 1 1 0 0 11 – Bit Offset

BL <label>
1 1 1 H 11 – Bit Offset

BX Rm
0 1 0 0 0 1 1 1 0 H Rm 0 0 0

20 of 37
Features

• Different format for each case


• Offset is reduced to 11bit and 8 bit
• Offset is shifted left by 1-bit (to give half-word alignment)
and sign-extended to 32 bits.
• BL is more subtle to give 22-bit offset using link register
for temporary storage
• No direct mapping to ARM instructions as Thumb require
half-word aligned offsets.

21 of 37
BL Instruction

To allow for a reasonably large offset to the target subroutine


each of these two instructions is automatically translated
by the assembler into a sequence of two 16 bit thumb
instructions
1. H = 10
LR := PC + (sign-extended offset shifted left 12 places);

2. H = 11
PC := LR + (offset shifted left 1 place)

3. LR := address of next instruction

22 of 37
Data Processing

• Fairly complex instruction formats


• No conditional execution
• Separate shift operations provided, no shifting of second
operand
• All data processing instruction set condition code bits (no need
of ‘S’)

23 of 37
Instruction formats

• <op> Rd, Rn, Rm


• <op> Rd, Rn, # <imm3>
• <op> Rd|Rn, Rm|Rs
• <op> Rd, Rn, #<sh 5>
• <op> Rd, #<imm 8>

24 of 37
Instructions

• MOV Rd, #<imm8>


• MVN Rd, Rm
• CMP Rn, #<imm8>
• CMP Rn, Rm
• CMN Rn, Rm
• TST Rn, Rm

25 of 37
26 of 37
Instruction

• ADD Rd, Rn, #<imm3>


• ADD Rd, #< imm8>
• ADD Rd, Rn, Rm
• ADC Rd, Rm
• SUB Rd, Rn, #<imm3>
• SUB Rd, #< imm8>
• SUB Rd, Rn, Rm
• SBC Rd, Rm
• NEG Rd, Rn

27 of 37
Instruction

• LSL Rd, Rm, #<#sh>


• LSL Rd, Rs
• LSR Rd, Rm, #<#sh>
• LSR Rd, Rs
• ASR Rd, Rm, #<#sh>
• ASR Rd, Rs
• ROR Rd, Rs

28 of 37
Instruction

• AND Rd, Rm
• EOR Rd, Rm
• ORR Rd, Rm
• BIC Rd, Rm
• MUL Rd, Rm

29 of 37
Instruction (using Hi registers)

• ADD Rd, Rm (1 or 2 Hi registers)


• CMP Rn, Rm (1 or 2 Hi registers)
• MOV Rd, Rm (1 or 2 Hi registers)
• ADD Rd, PC, #<imm8>
• ADD Rd, SP, #<imm8>
• ADD SP, SP, #<imm7>
• SUB SP, SP, #<imm7>
Except others donot set condition code bits

30 of 37
Data Transfer Instruction

• LDR|STR Rd, [Rn, #off5]


• LDR|STR Rd, [Rn, Rm]
• LDRB|STRB Rd, [Rn, #off5]
• LDRB|STRB Rd, [Rn, Rm]
• LDRH|STRH Rd, [Rn, #off5]
• LDRH|STRH Rd, [Rn, Rm]
* Signed operands:
• LDR|STR {S} {H|B} Rd, [Rn, Rm]

31 of 37
Multiple register transfers

• LDMIA|STMIA Rn!, { <reg list> }

• Rn may be any register among Ro – R7

• Register set can be any subset of R0 – R7 but not base


register ‘Rn’

• Write back to base register is always selected.

32 of 37
Stack Mode

• POP|PUSH { <reg list> {, R}}

• R13 (sp) is used as base register

• Uses Full Descending Stack

• In addition any subset of Ro-R7 registers LR (lr) may be


included in PUSH instruction and PC (pc)may be included in
POP instruction

33 of 37
Thumb-ARM Decompression

• Translation from 16-bit Thumb instruction to 32-bit ARM


instruction
• Condition bits changed to ‘always’
• Lookup to translate major and minor opcodes
• Zero extending 3-bit register specifiers to give 4-bit
specifiers
• Zero extending immediate values
• Implicit ‘S’(affecting condition codes) should be explicitly
specified.
• Thumb 2-address format must be mapped to ARM 3-
address format

34 of 37
35 of 37
36 of 37
37 of 37
Thumb Instruction Decompressor
Organization

38 of 37
Thumb to ARM Instruction Mapping

39 of 37
Example of Thumb code

AREA ToThumb, CODE, READONLY


ENTRY ; Mark first instruction to execute
ARM ; Subsequent instructions are ARM
start ADR r0, into_thumb + 1 ; Processor starts in ARM state
BX r0 ; Inline switch to Thumb state
THUMB ; Subsequent instructions are Thumb into_thumb
MOVS r0, #10 ; New-style Thumb instructions

40 of 37
Properties

• Thumb code requires 70% of space of ARM code


• Thumb code uses 40% more instructions than the ARM
code
• With 32-bit memory ARM code is 40% faster
• With 16-bit memory Thumb code is 45% faster than ARM
code
• Thumb code uses 30% less external memory power than
ARM code.

41 of 37
Summary

• Thumb ARM mode switching


• Similarities and differences
• Thumb exception handling
• Data processing instructions
• Data transfer instructions
• Separate shift instructions
• Restricted stack operations.

42 of 37
Thank You, Any Questions ?

43 of 37

You might also like