Professional Documents
Culture Documents
Microprocessor Cores
Stephen Hill
ARM Austin Design Center
3. Power
o Dynamic power
o Power down modes
4. VFP10
ETM10
Summary
MIPS/MHz: 1.25
500 MIPS @ 400MHz
Dhrystone 2.1
Area
ARM1022E (2x16KB): 6.9mm2
ARM1020E (2x32KB): 10.3mm2
WriteBuffer
16
16or
or32k
32kData
Data 16 or 32k Data
16 or 32kCache
Cache
Instruction Cache
Cache
ARM1020E
64-bit data
THE ARCHITECTURE FOR THE DIGITAL WORLD
TM
Hot Chips 13 4
ARM10E Microarchitecture
64-bit instruction and data interfaces
Static branch prediction with branch folding
Parallel load/store pipeline
Dedicated machine for LDM/STM execution; all but
the first cycle of these instructions are hidden if no
dependencies are encountered
Parallel execution of multi-cycle
coprocessor operations
Multiply 16 bits per cycle
1-3 cycle throughput and 2-4 cycle latency
No data-dependent MUL cycle counts
ARM10
Branch ARM or Data + Branch Data Cache
Register
Predictor Thumb Address Interface
Instruction Read
Generator
Decode +
Instruction Coprocessor Reg
Address Result Write
Forward Shift Data
Generator Interface
Coprocessor + + ALU
Instruction Scoreboard
Instruction Issue Multiply
Fetch Multiply
Add
ARM10
Branch ARM or Data + Branch
Register Address Data Cache
Predictor Thumb Interface
Instruction Read Generator
Decode +
Instruction
Address Result Shift Coprocessor Reg
Generator Forward + Data Write
Coprocessor + ALU Interface
Instruction Scoreboard
Instruction Issue Multiply
Fetch Multiply Add
48 9 9 9 9
Instruction MMU
5%
Data MMU
Instruction Cache 5%
22%
BIU
5%
Clocks
6%
Buffers
1%
ARM10E Core
32%
(PowerMill simulation)
PM Layer PM Layer
Power
Management
Controller
PM Layer
ARM High Performance Bus DSP Core
Interrupt
Controller
Clocks
CORE and SCC CORE and SCC
PM Layer
Cache Cache
Data
ARM1020E
Control
EmbeddedICE
Logic Address
ETM10
TAP
JTAG
Multi-ICE Port
SOC Trace
Port
Trace Port
Analyzer
(ARM10200r1)