Professional Documents
Culture Documents
MEMORIES
L1-I
L2-Cache
System bus
Video
RAM
AGP
GPU bus
USB bus
Ch 1
Memory
Controller Ch 2
PCI bus
I/O
Controller
Other
buses
Disk
Adapter
DRAM
DIMM
DRAM
DIMM
Ethernet
Adapter
volatile
non-volatile
Non-Volatile
Volatile
(requires power to
hold data)
ROM
cache
off-chip
cache
memory
L2
64k to 4M
L3
L4
Relative
Performance
100,000
10,000
1,000
Processor Memory
Performance Gap
(grew about 50%/year)
CPU
100
10
1
1980
Memory
1985
1990
1995
Year
2000
2005
2010
Access Time
Cycle Time
Low
2
2
1*
1*
3
3
4
4
2
2
2
2
Cache/PDAs
*1 = Best
Access Time (tAC) time required to read data from a single memory cell
Cycle Time (tC) time required to perform a read or write operation plus any
recovery time before the next read/write operation can begin
(measure of overall data rate)
TYPICALRANDOM
RANDOMACCESS
ACCESSMEMORY
MEMORYARRAY
ARRAY
TYPICAL
ORGANIZATION
ORGANIZATION (ONE
(1) OF 2M-BIT WORD PER ROW)
G
Data
S,D
Chip
Control
Signals
SENSE AMPLIFIERS/DRIVERS
Address
(N + M)
Practical Issues:
1. N >> M
a. Long, thin layout => awkward to fit into system chip floor-plan.
b. Long bit lines slow memory access, i.e. more parasitic capacitance.
2. 2N*2M very large, say 1010 to 1012 cells
a. Long bit lines slow memory access, i.e. more parasitic capacitance.
Remedies:
1. Reorganize memory by reducing the number of rows to 2 N-k and increasing the
number of rows to 2M+k, i.e. make N k M &k (complicates column decoder)
2. Construct large memories from smaller modular blocks
Kenneth R. Laker, University of Pennsylvania, updated 02Apr15
2
addr
(to CPU)
Memory
Controller
8
Cols
1
0
1
Rows
2
3
Supercell (2, 1)
8 bits wide
data
10
11
12
13
Pseudo-nMOS
NAND gate
14
N=2
Purpose of ROW
DECODER ->
reduce number of
external signals (or
bits) needed to
select a word or row
from memory.
Kenneth R. Laker, University of Pennsylvania, updated 02Apr15
L = 2N = 4
MEMORY
Storing
L = 2N = 4
words, each
2M bits wide
L = 2N = 4 rows
(N*L) nMOS + (L) pMOS
16
Row
decoder
selects
oneof
(1)
Let
there
be
one
(1)
22MM bit
bitlines.
word per Row.
17
C2M
CA1
CA1
CA2
CA2
CAM
CAM
18
1.5
3.5 fF/m2
1.8 fF
Crow = Cox (LnMOSWnMOS) = 3.5 fF (2 x 1.5) = 10.5 fF per cell
Rrow = Rsheet-poly (Lpoly/Wpoly) = 20 (6/1.5) = 80 per cell
Kenneth R. Laker, University of Pennsylvania, updated 02Apr15
19
R512
C64
64
63
63
64
VG64
64
* row
VG64
G256
20
REVIEW
21
*column
512
R512
R512
=512 x 1.8 fF pF
= 0.9
C column 512C dbn=5120.0118
6pF
pF
22
Other Parameters:
VOH = VDD = 5V
0.9 pF
V OL 0V
VT0n = - VT0p = 1 V
nCox = 20 A/V2
.18
11 ns
20.9 ns&11
ns + 18ns=12.2
ns = 38.9
*access =* row &*column==1.2
nsns
Kenneth R. Laker, University of Pennsylvania, updated 02Apr15
23
24
of data
25
26
27
Rk
Rk = 1
a. WRITE 1 OP
b. READ 1 OP
c. WRITE 0 OP
d. READ 0 OP
Kenneth R. Laker, University of Pennsylvania, updated 02Apr15
28
Rk -> 1
Rk -> 1
29
30
Rk -> 1
31
32
Rk
Rk
33
Rk
34
CMOS SRAM
WRITE CKT
from ROW
DECODER
Rk
(0)
(1)
(1)
(1)
(0)
(0)
DATA
WB
WB
35
CMOS SRAM
SRAM READ CIRCUIT
Differential Sense Amplifier
(one per column)
MP2
MA4
Rk
WB
MA5
VNOT-C
M2
WB
Read
Select
VC
MA1
MA2
MA3
36
Rk
6T 1-bit CMOS
SRAM Cell
37
38
6
3
M1
M2
M4
M3
M2
M3
M1
39
M1
NOTE:
Two-poly
capacitors
have very
low
dissipation
40
VC
VB
VNOT-C
write select
(WS)
41
Precharge PC = 1
VC
VB
M2
I=0
CNOT-C
WS = 1; RS = 0
WS
DATA = 0
(WRITE)
(READ)
CC >> C
Kenneth R. Laker, University of Pennsylvania, updated 02Apr15
42
VB
CC
When WS = 1
Since CC >> C
C C V C &C V B
V R=
C C &C
V R V C
independent of VB
43
CC
Vdata-out
WS = 0; RS = 1
WS
(Read 1 is non-destructive)
Falling Vdata-out is
interpreted as 1
VC
VB
3-T DRAM
cell is
Inverting
CNOT-C
WS = 1; RS = 0
WS
MDATA
(and C) DISCHARGED
CC >> C
DATA = 1
Kenneth R. Laker, University of Pennsylvania, updated 02Apr15
DATA = 1
44
VB keeps
M2 OFF
CC
Vdata-out
WS = 0;
WS
High Vdata-out is
interpreted as 0
Read 0 is nondestructive
45
Din
Dout
Din
(WRITE)
(READ)
CC >> C
write
1
write
0
WS
Din
Dout
Kenneth R. Laker, University of Pennsylvania, updated 02Apr15
46
CC >> C
STEP 1 READ OP: Pre-charge column capacitance Cc to HIGH (VBL0 = VPRE = VDD/2)
STEP 2 READ OP: Set R = 1 and detect VBL on Cc + C due to charge sharing
Kenneth R. Laker, University of Pennsylvania, updated 02Apr15
47
48
Dynamic DRAM
++ Small cells (1-T to 3-T per cell) more bits per unit chip area
+ 4x to 10x higher density than SRAM with same chip area
- Periodic refresh required if DATA stored for > 1 msec
- Volatile
- Slow due to very complex interface
- row/column access multiplexed
o Used where speed in less important than high capacity, e.g. main memory
Kenneth R. Laker, University of Pennsylvania, updated 02Apr15
49