Professional Documents
Culture Documents
ch1 010902
ch1 010902
Hardware/Software Introduction
Chapter 1: Introduction
Outline
Embedded systems overview
What are they?
PCs
Laptops
Mainframes
Servers
Modems
MPEG decoders
Network cards
Network switches/routers
On-board navigation
Pagers
Photocopiers
Point-of-sale systems
Portable video games
Printers
Satellite phones
Scanners
Smart ovens/dishwashers
Speech recognizers
Stereo systems
Teleconferencing systems
Televisions
Temperature controllers
Theft tracking systems
TV set-top boxes
VCRs, DVD players
Video game consoles
Video phones
Washers and dryers
Tightly-constrained
Low cost, low power, small, fast, etc.
CCD preprocessor
Pixel coprocessor
D2A
lens
JPEG codec
Microcontroller
Multiplier/Accum
DMA controller
Memory controller
Display ctrl
UART
LCD ctrl
Design metric
A measurable feature of a systems implementation
Optimizing design metrics is a key challenge
Maintainability: the ability to modify the system after its initial release
Correctness, safety, many more
10
Power
Performance
Size
NRE cost
CCD
CCD preprocessor
Pixel coprocessor
D2A
lens
JPEG codec
Microcontroller
Multiplier/Accum
DMA controller
Memory controller
Display ctrl
UART
LCD ctrl
11
Revenues ($)
Time (months)
Average time-to-market
constraint is about 8 months
Delays can be costly
12
Revenues ($)
Peak revenue
Peak revenue from
delayed entry
On-time
Market fall
Market rise
Delayed
Loss
D
On-time
entry
Delayed
entry
2W
Time
13
Peak revenue
Peak revenue from
delayed entry
On-time
Market fall
Market rise
Delayed
On-time
entry
Delayed
entry
2W
Time
On-time = 1/2 * 2W * W
Delayed = 1/2 * (W-D+W)*(W-D)
14
Example
NRE=$2000, unit=$100
For 10 units
total cost = $2000 + 10*$100 = $3000
per-product cost = $2000/10 + $100 = $300
Amortizing NRE cost over the units results in an
additional $200 per unit
Embedded Systems Design: A Unified
Hardware/Software Introduction, (c) 2000 Vahid/Givargis
15
$160,000
to a l c o st (x 1 0 0 )
$120,000
$200
A
B
$160
p e r p ro d u c t c o st
$200,000
$120
$80
$80,000
$40,000
$40
$0
$0
0
800
1600
2400
800
1600
2400
16
Throughput
Tasks per second, e.g. Camera A processes 4 images per second
Throughput can be more than latency seems to imply due to concurrency, e.g.
Camera B may process 8 images per second (by capturing a new image while
previous image is being stored).
17
18
Processor technology
The architecture of the computation engine used to implement a
systems desired functionality
Processor does not have to be programmable
Processor not equal to general-purpose processor
Controller
Datapath
Controller
Datapath
Controller
Datapath
Control
logic and
State register
Register
file
Control logic
and State
register
Registers
Control
logic
index
State
register
IR
PC
General
ALU
IR
Custom
ALU
PC
Data
memory
Program
memory
Assembly code
for:
Data
memory
total = 0
for i =1 to
General-purpose (software)
total
Data
memory
Program memory
Assembly code
for:
total = 0
for i =1 to
Application-specific
Single-purpose (hardware)
19
Processor technology
Desired
functionality
General-purpose
processor
Embedded Systems Design: A Unified
Hardware/Software Introduction, (c) 2000 Vahid/Givargis
total = 0
for i = 1 to N loop
total += M[i]
end loop
Application-specific
processor
Single-purpose
processor
20
General-purpose processors
Programmable device used in a variety of
applications
Also known as microprocessor
Features
Program memory
General datapath with large register file and
general ALU
User benefits
Low time-to-market and NRE costs
High flexibility
Controller
Datapath
Control
logic and
State
register
Register
file
IR
PC
Program
memory
General
ALU
Data
memory
Assembly code
for:
total = 0
for i =1 to
21
Single-purpose processors
Digital circuit designed to execute exactly
one program
a.k.a. coprocessor, accelerator or peripheral
Features
Contains only the components needed to
execute a single program
No program memory
Controller
Datapath
Control
logic
index
total
State
register
Data
memory
Benefits
Fast
Low power
Small size
Embedded Systems Design: A Unified
Hardware/Software Introduction, (c) 2000 Vahid/Givargis
22
Application-specific processors
Programmable processor optimized for a
particular class of applications having
common characteristics
Compromise between general-purpose and
single-purpose processors
Controller
Datapath
Control
logic and
State
register
Registers
IR
PC
Features
Program memory
Optimized datapath
Special functional units
Benefits
Program
memory
Custom
ALU
Data
memory
Assembly code
for:
total = 0
for i =1 to
23
IC technology
The manner in which a digital (gate-level)
implementation is mapped onto an IC
IC: Integrated circuit, or chip
IC technologies differ in their customization to a design
ICs consist of numerous layers (perhaps 10 or more)
IC technologies differ with respect to who builds each layer and
when
IC package
IC
source
gate
oxide
channel
drain
Silicon substrate
24
IC technology
Three types of IC technologies
Full-custom/VLSI
Semi-custom ASIC (gate array and standard cell)
PLD (Programmable Logic Device)
25
Full-custom/VLSI
All layers are optimized for an embedded systems
particular digital implementation
Placing transistors
Sizing transistors
Routing wires
Benefits
Excellent performance, small size, low power
Drawbacks
High NRE cost (e.g., $300k), long time-to-market
Embedded Systems Design: A Unified
Hardware/Software Introduction, (c) 2000 Vahid/Givargis
26
Semi-custom
Lower layers are fully or partially built
Designers are left with routing of wires and maybe placing
some blocks
Benefits
Good performance, good size, less NRE cost than a fullcustom implementation (perhaps $10k to $100k)
Drawbacks
Still require weeks to months to develop
27
Benefits
Low NRE costs, almost instant IC availability
Drawbacks
Bigger, expensive (perhaps $30 per unit), power hungry,
slower
Embedded Systems Design: A Unified
Hardware/Software Introduction, (c) 2000 Vahid/Givargis
28
Moores law
The most important trend in embedded systems
Predicted in 1965 by Intel co-founder Gordon Moore
IC transistor capacity has doubled roughly every 18 months for the past several decades
Logic transistors
per chip
(in millions)
10,000
Note:
logarithmic scale
1,000
100
10
1
0.1
0.01
0.001
2009 2007 2005 2003 2001 1999 1997 1995 1993 1991 1989 1987 1985 1983 1981
29
Moores law
Wow
This growth rate is hard to imagine, most people
underestimate
How many ancestors do you have from 20 generations ago
i.e., roughly how many people alive in the 1500s did it take to make
you?
220 = more than 1 million people
30
1984
1987
1990
1993
1996
1999
2002
10,000
transistors
150,000,000
transistors
Leading edge
chip in 1981
Leading edge
chip in 2002
31
Design Technology
The manner in which we convert our concept of
desired system functionality into an implementation
Compilation/
Synthesis
Compilation/Synthesis:
Automates exploration and
insertion of implementation
details for lower level.
Libraries/
IP
Test/
Verification
System
specification
System
synthesis
Hw/Sw/
OS
Model simulat./
checkers
Behavioral
specification
Behavior
synthesis
Cores
Hw-Sw
cosimulators
RT
specification
RT
synthesis
RT
components
HDL simulators
Logic
specification
Logic
synthesis
Gates/
Cells
Gate
simulators
To final implementation
32
Productivity
(K) Trans./Staff Mo.
10
1
0.1
0.01
2009
2007
2005
2003
2001
1999
1997
1995
1993
1991
1989
1987
1985
1983
1981
33
Hardware/software
codesign
Compilers
(1960's,1970's)
Register transfers
Assembly instructions
RT synthesis
(1980's, 1990's)
Assemblers, linkers
(1950's, 1960's)
Machine instructions
Logic synthesis
(1970's, 1980's)
Logic gates
Microprocessor plus
program bits: software
Implementation
The choice of hardware versus software for a particular function is simply a tradeoff among various
design metrics, like performance, power, size, NRE cost, and especially flexibility; there is no
fundamental difference between what hardware or software can implement.
Embedded Systems Design: A Unified
Hardware/Software Introduction, (c) 2000 Vahid/Givargis
34
General,
providing improved:
Generalpurpose
processor
ASIP
Singlepurpose
processor
Flexibility
Maintainability
NRE cost
Time- to-prototype
Time-to-market
Cost (low volume)
Customized,
providing improved:
Power efficiency
Performance
Size
Cost (high volume)
PLD
Embedded Systems Design: A Unified
Hardware/Software Introduction, (c) 2000 Vahid/Givargis
Semi-custom
Full-custom
35
10,000
100,000
1,000
10,000
100
10
1
1000
Gap
IC capacity
0.1
0.01
100
10
productivity
0.001
1
0.1
Productivity
(K) Trans./Staff-Mo.
Logic transistors
per chip
(in millions)
0.01
2009 2007 2005 2003 2001 1999 1997 1995 1993 1991 1989 1987 1985 1983 1981
36
10,000
100,000
1,000
10,000
100
10
1
0.1
Gap
IC capacity
0.01
productivity
0.001
1000
100
10
1
0.1
0.01
Productivity
(K) Trans./Staff-Mo.
Logic transistors
per chip
(in millions)
2009 2007 2005 2003 2001 1999 1997 1995 1993 1991 1989 1987 1985 1983 1981
37
1M transistors, 1
designer=5000 trans/month
Each additional designer
reduces for 100 trans/month
So 2 designers produce 4900
trans/month each
60000
50000
40000
30000
20000
10000
16
16
19
18
23
24
43
Team
15
20
30
Number of designers
40
38
Summary
Embedded systems are everywhere
Key challenge: optimization of design metrics
Design metrics compete with one another
39