You are on page 1of 70

SEMICONDUCTOR

MEMORIES

Digital Integrated Circuits Memory © Prentice Hall 1995


Chapter Overview

• Memory Classification

• Memory Architectures

• The Memory Core

• Periphery

• Reliability

Digital Integrated Circuits Memory © Prentice Hall 1995


Semiconductor Memory
Classification

RWM NVRWM ROM

Random Non-Random EPROM Mask-Programmed


Access Access
E2PROM Programmable (PROM)

SRAM FIFO FLASH

DRAM LIFO
Shift Register
CAM

Digital Integrated Circuits Memory © Prentice Hall 1995


Memory Architecture: Decoders
M bits M bits

S0 S0
Word 0 Word 0
S1
Word 1 A0 Word 1
S2 Storage Storage
Word 2 Word 2
N Words

Cell A1 Cell

Decoder
A K-1
SN-2
Word N-2 Word N-2
SN_1
Word N-1 Word N-1

Input-Output Input-Output
(M bits) (M bits)

N words => N select signals Decoder reduces # of select signals


Too many select signals K = log2N

Digital Integrated Circuits Memory © Prentice Hall 1995


Array-Structured Memory Architecture
Problem: ASPECT RATIO or HEIGHT >> WIDTH

L-K Bit Line


2
Storage Cell

AK
Row Decoder

AK+1 Word Line

AL-1

K
M.2

Sense Amplifiers / Drivers Amplify swing to


rail-to-rail amplitude

A0
Column Decoder Selects appropriate
A K-1 word

Input-Output
(M bits)

Digital Integrated Circuits Memory © Prentice Hall 1995


Hierarchical Memory Architecture

Row
Address

Column
Address

Block
Address

Global Data Bus


Control Block Selector Global
Circuitry Amplifier/Driver

I/O

Advantages:
1. Shorter wires within blocks
2. Block address activates only 1 block => power savings

Digital Integrated Circuits Memory © Prentice Hall 1995


Memory Timing: Definitions

Read Cycle

READ

Read Access Read Access Write Cycle

WRITE

Write Access
Data Valid

DATA

Data Written

Digital Integrated Circuits Memory © Prentice Hall 1995


Memory Timing: Approaches
MSB LSB

Address
Row Address Colum n Address
Bus

RAS Address
Address
Bus

Address transition
CAS initiates memory operation

RAS-CAS timing

DRAM Timing SRAM Timing


Multiplexed Adressing Self-timed

Digital Integrated Circuits Memory © Prentice Hall 1995


MOS NOR ROM
VDD
Pull-up devices

WL[0]
GND
WL[1]

WL[2]
GND
WL[3]

BL[0] BL[1] BL[2] BL[3]

Digital Integrated Circuits Memory © Prentice Hall 1995


MOS NOR ROM Layout
Metal1 on top of diffusion

WL[0]
GND (diffusion)
WL[1]
Polysilicon
Basic cell
10 λ x 7 λ Metal1

WL[2] 2λ

WL[3]

Only 1 layer (contact mask) is used to program memory array


Programming of the memory can be delayed to one of
last process steps

Digital Integrated Circuits Memory © Prentice Hall 1995


MOS NOR ROM Layout
BL[0] BL[1] BL[2] BL[3]
Threshold raising
implant

WL[0]
GND (diffusion)
Basic Cell
8.5 λ x 7 λ
Metal1 over diffusion
WL[1]

Polysilicon
WL[2]

WL[3]

Threshold raising implants disable transistors

Digital Integrated Circuits Memory © Prentice Hall 1995


MOS NAND ROM
VDD

Pull-up devices

BL[0] BL [1] BL[2] BL[3]

WL[0]

WL[1]

WL[2]

WL[3]

All word lines high by default with exception of selected row

Digital Integrated Circuits Memory © Prentice Hall 1995


MOS NAND ROM Layout
Diffusion

Polysilicon

Basic cell

5 λx 6 λ

Threshold

lowering

implant

No contact to VDD or GND necessary;


drastically reduced cell size
Loss in performance compared to NOR ROM

Digital Integrated Circuits Memory © Prentice Hall 1995


Equivalent Transient Model for MOS NOR
ROM

VDD

BL
rword
Model for NOR ROM WL
Cbit

cword

Word line parasitics


Resistance/cell: (7/2) x 10 Ω/q = 35 Ω
Wire capacitance/cell: (7λ × 2λ) (0.6)2 0.058 + 2 × (7λ × 0.6) × 0.043 = 0.65 fF
Gate Capacitance/cell: (4λ × 2λ) (0.6)2 1.76 = 5.1 fF.
Bit line parasitics:
Resistance/cell: (8.5/4) x 0.07 Ω/q = 0.15 Ω (which is negligible)
Wire capacitance/cell: (8.5λ × 4λ) (0.6)2 0.031 + 2 × (8.5λ × 0.6) × 0.044 = 0.83 fF
Drain capacitance/cell: ((3λ × 4λ) (0.6)2 × 0.3 + 2 × 3λ × 0.6 ×0.8) × 0.375 +
4λ × 0.6 × 0.43 = 2.6 fF

Digital Integrated Circuits Memory © Prentice Hall 1995


Equivalent Transient Model for MOS NAND
ROM
VDD

BL
CL
rbit
Model for NAND ROM
rword cbit
WL

cword

Word line parasitics:


Resistance/cell: (6/2) x 10 Ω /q = 30 Ω
Wire capacitance/cell: (6λ × 2λ) (0.6)2 0.058 + 2 × (6λ × 0.6) × 0.043 = 0.56 fF
Gate Capacitance/cell: (3λ × 2λ) (0.6)2 1.76 = 3.8 fF.
Bit line parasitics:
Resistance/cell: ∼ 10 k Ω , the average transistor resistance over the range of interest.
Wire capacitance/cell: Included in diffusion capacitance
Source/Drain capacitance/cell: ((3λ×3 λ) (0.6)2 × 0.3 + 2 × 3λ× 0.6 × 0.8) × 0.375 + ( 3λ × 2λ) (0.6)2
× 1.76 = 5.2 fF

Digital Integrated Circuits Memory © Prentice Hall 1995


Propagation Delay of NOR ROM
Word line delay
Consider the 512×512 case. The delay of the distributed rc-line containing M
cells can be approximated using the expressions derived in Chapter 8.
2 2
tword = 0.38 (rword × cword ) M = 0.38 (35 Ω × (0.65 + 5.1) fF) 512 = 20 nsec

Bit line delay


Assume a (2.4/1.2) pull-down device and a (8/1.2) pull-up transistor. The bit
line switches between 5 V and 2.5 V.

Cbit = 512 × (2.6 + 0.8) fF = 1.7 pF

IavHL = 1/2 (2.4/0.9) (19.6 10 -6)((4.25)2/2 + (4.25 × 3.75 - (3.75) 2/2)) -


1/2 (8/0.9) (5.3 10-6 ) (4.25 × 1.25 - (1.25)2/2) = 0.36 mA
tHL = (1.7 pF × 1.25 V) / 0.36 mA = 5.9 nsec

The low-to-high response time can be computed using a similar approach.

tLH = (1.7 pF × 1.25 V) / 0.36 mA = 5.9 nsec

Digital Integrated Circuits Memory © Prentice Hall 1995


Decreasing Word Line Delay
Driver
WL Polysilicon word line

Metal word line

(a) Driving the word line from both sides

Metal bypass

WL K cells Polysilicon word line

(b) Using a metal bypass

(c) Use silicides


Digital Integrated Circuits Memory © Prentice Hall 1995
Precharged MOS NOR ROM
VDD
φpre
Precharge devices

WL[0]
GND
WL[1]

WL[2]
GND
WL[3]

BL[0] BL[1] BL[2] BL[3]

PMOS precharge device can be made as large as necessary,


but clock driver becomes harder to design.

Digital Integrated Circuits Memory © Prentice Hall 1995


Floating-gate transistor (FAMOS)

Floating gate Gate


D
Source Drain

tox G

tox
S
+ p +
n n
Substrate

(a) Device cross-section (b) Schematic symbol

Digital Integrated Circuits Memory © Prentice Hall 1995


Floating-Gate Transistor Programming

20 V 0V 5V

20 V 0V 5V
10 V→ 5 V − 5V − 2.5 V

S D S D S D

Avalanche injection. Removing programming voltage Programming results in


leaves charge trapped. higher V T.

Digital Integrated Circuits Memory © Prentice Hall 1995


FLOTOX EEPROM
Floating gate Gate I
Source Drain

20-30 nm − 10 V VGD
10 V
n+ Substrate n+
p
10 nm

(a) Flotox transistor (b) Fowler-Nordheim I-V characteristic


BL

WL

VDD

(c) EEPROM cell during a read operation

Digital Integrated Circuits Memory © Prentice Hall 1995


Flash EEPROM

Control gate

Floating gate

erasure Thin tunneling oxide

n+ source n+ drain
programming

p-substrate

Digital Integrated Circuits Memory © Prentice Hall 1995


Cross-sections of NVM cells

Flash EPROM
Courtesy Intel
Digital Integrated Circuits Memory © Prentice Hall 1995
Characteristics of State-of-the-art
NVM

Digital Integrated Circuits Memory © Prentice Hall 1995


Read-Write Memories (RAM)

• STATIC (SRAM)
Data stored as long as supply is applied
Large (6 transistors/cell)
Fast
Differential

• DYNAMIC (DRAM)
Periodic refresh required
Small (1-3 transistors/cell)
Slower
Single Ended
Digital Integrated Circuits Memory © Prentice Hall 1995
6-transistor CMOS SRAM Cell
WL

VDD
M2 M4

Q
Q M6
M5

M1 M3

BL BL

Digital Integrated Circuits Memory © Prentice Hall 1995


CMOS SRAM Analysis (Write)
WL

VDD
M4

Q=0 M6
M5 Q=1

M1
VDD
BL = 1 BL = 0

VDD VDD 2 VDD VDD 2


kn V
,M6 (
– VTn )----------- – -----------= k ( V – V )----------- – -----------  (W/L)n,M6 ≥ 0.33 (W/L)p,M4
DD 2 8  p, M4  DD Tp 2 8 

k n M5 V VDD 2 V DD V 2
, -----------
-------------- DD
– VTn ----------- = k n ,M1 ( VDD – VTn )----------- – -----------
DD 
2  2  2   2 8  (W/L)n,M5 ≥ 10 (W/L)n,M1

Digital Integrated Circuits Memory © Prentice Hall 1995


CMOS SRAM Analysis (Read)
WL

VDD

BL M4
BL

Q=0 M6
M5 Q=1

M1 V DD
VDD VDD

Cbit C bit

k n,M5 VDD V DD 2 VDD V DD 2


 
--------------- ------------ – VTn ------------  = k n,M1 ( VDD – V Tn )------------ – ------------ 

2  2  2   2 8 

(W/L)n,M5 ≤ 10 (W/L)n,M1 (supercedes read constraint)

Digital Integrated Circuits Memory © Prentice Hall 1995


6T-SRAM — Layout

VDD
M2 M4

Q Q
M1 M3

GND
M5 M6 WL

BL BL

Digital Integrated Circuits Memory © Prentice Hall 1995


Resistance-load SRAM Cell
WL

VDD

RL RL

Q Q
M3 M4

BL M1 M2 BL

Static power dissipation -- Want RL large


Bit lines precharged to VDD to address t p problem

Digital Integrated Circuits Memory © Prentice Hall 1995


3-Transistor DRAM Cell
BL1 BL2

WWL WWL

RWL
RWL

X VDD -VT
X M3
M2
M1 VDD
BL1
CS

BL2 VDD -VT ∆V

No constraints on device ratios


Reads are non-destructive
Value stored at node X when writing a “1” = VWWL -VTn

Digital Integrated Circuits Memory © Prentice Hall 1995


3T-DRAM — Layout

BL2 BL1 GND

RWL
M3

M2

WWL
M1

Digital Integrated Circuits Memory © Prentice Hall 1995


1-Transistor DRAM Cell
BL
WL Write "1" Read "1"
WL

M1 X
CS GND VDD − VT

VDD
BL
VDD /2 VDD /2
CBL sensing

Write: CS is charged or discharged by asserting WL and BL.


Read: Charge redistribution takes places between bit line and storage capacitance
CS
∆ V = VBL – V PRE = ( V BIT – V PRE )------------------------
C S + CBL

Voltage swing is small; typically around 250 mV.


Digital Integrated Circuits Memory © Prentice Hall 1995
DRAM Cell Observations

1T DRAM requires a sense amplifier for each bit line, due to


charge redistribution read-out.
DRAM memory cells are single ended in contrast to SRAM cells.
The read-out of the 1T DRAM cell is destructive; read and
refresh operations are necessary for correct operation.
Unlike 3T cell, 1T cell requires presence of an extra capacitance
that must be explicitly included in the design.
When writing a “1” into a DRAM cell, a threshold voltage is lost.
This charge loss can be circumvented by bootstrapping the
word lines to a higher value than VDD .

Digital Integrated Circuits Memory © Prentice Hall 1995


1-T DRAM Cell
Capacitor

Metal word line


M1 word
SiO2 line
poly
Field Oxide
n+ n+
Inversion layer Diffused
poly induced by bit line
plate bias
Polysilicon
Polysilicon
plate
(a) Cross-section gate

(b) Layout

Used Polysilicon-Diffusion Capacitance


Expensive in Area
Digital Integrated Circuits Memory © Prentice Hall 1995
SEM of poly-diffusion capacitor 1T-DRAM

Digital Integrated Circuits Memory © Prentice Hall 1995


Advanced 1T DRAM Cells
Word line
Cell plate Capacitor dielectric layer
Insulating Layer

Cell Plate Si

Capacitor Insulator Transfer gate Isolation


Refilling Poly Storage electrode

Storage Node Poly

Si Substrate
2nd Field Oxide

Trench Cell Stacked-capacitor Cell

Digital Integrated Circuits Memory © Prentice Hall 1995


Periphery

•Decoders
•Sense Amplifiers

•Input/Output Buffers
•Control / Timing Circuitry
Digital Integrated Circuits Memory © Prentice Hall 1995
Row Decoders
Collection of 2M complex logic gates
Organized in regular and dense fashion

(N)AND Decoder

NOR Decoder

Digital Integrated Circuits Memory © Prentice Hall 1995


Dynamic Decoders
Precharge devices GND GND
VDD

WL3
WL3 VDD

WL2
WL2 VDD

WL1 WL1
VDD

WL0 WL0

V DD φ A0 A0 A1 A1 A0 A0 A1 A1 φ

Dynamic 2-to-4 NOR decoder 2-to-4 MOS dynamic NAND Decoder

Propagation delay is primary concern


Digital Integrated Circuits Memory © Prentice Hall 1995
A NAND decoder using 2-input pre-
decoders

WL 1

WL 0

A0A1 A0 A1 A0A1 A0A 1 A 2A3 A2 A3 A2A3 A2A3

A1 A 0 A0 A1 A3 A2 A2 A3

Splitting decoder into two or more logic layers


produces a faster and cheaper implementation
Digital Integrated Circuits Memory © Prentice Hall 1995
4 input pass-transistor based column
decoder

BL 0 BL1 BL 2 BL3

S0
A0 2 input NOR decoder
S1

S2

A1
S3

D
Advantage: speed (t pd does not add to overall memory access time)
only 1 extra transistor in signal path
Disadvantage: large transistor count
Digital Integrated Circuits Memory © Prentice Hall 1995
4-to-1 tree based column decoder
BL0 BL 1 BL2 BL 3

A0

A0

A1

A1

D
Number of devices drastically reduced
Delay increases quadratically with # of sections; prohibitive for large decoders
Solutions: buffers
progressive sizing
combination of tree and pass transistor approaches

Digital Integrated Circuits Memory © Prentice Hall 1995


Decoder for circular shift-register

VDD VDD VDD VDD VDD VDD

WL0 WL 1 WL 2
φ φ φ φ φ φ
...
R φ φ R φ φ R φ φ
VDD

Digital Integrated Circuits Memory © Prentice Hall 1995


Sense Amplifiers
make ∆V as small
C ⋅∆ V as possible
t = ----------------
p I av

large small

Idea: Use Sense Amplifer

small
transition s.a.

input output

Digital Integrated Circuits Memory © Prentice Hall 1995


Differential Sensing - SRAM
VDD VDD
VDD PC VDD
y M3 M4 y

x M1 M2 x x x
BL BL
EQ
SE M5 SE

WLi
(b) Doubled-ended Current Mirror Amplifier

VDD
SRAM cell i
y y
Diff.
x Sense x x x
Amp
y y
D D SE

(a) SRAM sensing scheme.


(c) Cross-Coupled Amplifier

Digital Integrated Circuits Memory © Prentice Hall 1995


Latch-Based Sense Amplifier

EQ
BL BL
VDD

SE

SE

Initialized in its meta-stable point with EQ


Once adequate voltage gap created, sense amp enabled with SE
Positive feedback quickly forces output to a stable operating point.
Digital Integrated Circuits Memory © Prentice Hall 1995
Single-to-Differential Conversion

WL
BL
x Diff. x

+
S.A. _ Vref
cell

y y

How to make good V ref?

Digital Integrated Circuits Memory © Prentice Hall 1995


Open bitline architecture

EQ

R L1 L0 R0 R1 L
VDD
SE

BLL BLR

... ...
CS CS CS SE CS CS CS

dummy dummy
cell cell

Digital Integrated Circuits Memory © Prentice Hall 1995


DRAM Read Process with Dummy Cell
6.0

4.0
V (Volt)

BL
2.0 BL
5.0
0.00 4.0 WL
1 2 3 4 5

V (Volt)
t (nsec) 3.0 SE
(a) reading a zero
2.0 EQ
6.0 1.0
0.00 1 2 3 4 5
4.0
V (Volt)

(c) control signals


BL

2.0 BL

0.00 1 2 3 4 5
t (nsec)
(b) reading a one

Digital Integrated Circuits Memory © Prentice Hall 1995


Single-Ended Cascode Amplifier
VDD

Vcasc

WLC

WL

Digital Integrated Circuits Memory © Prentice Hall 1995


DRAM Timing

Digital Integrated Circuits Memory © Prentice Hall 1995


Address Transition Detection
VDD

DELAY
A0 td
ATD ATD

DELAY
A1 td

...
DELAY
AN-1 td

Digital Integrated Circuits Memory © Prentice Hall 1995


Reliability and Yield

Digital Integrated Circuits Memory © Prentice Hall 1995


Open Bit-line Architecture — Cross Coupling

EQ

WL1 WL0 WL D WLD WL 0 WL1


CWBL CWBL
BL BL
Sense
CBL CBL
Amplifier
C C C C C C

Digital Integrated Circuits Memory © Prentice Hall 1995


Folded-Bitline Architecture

WL1 WL1 WL0 WL0 WLD WL D


CWBL

BL CBL x y

... Sense
C C C C C
C EQ Amplifier

BL CBL x y
CWBL

Digital Integrated Circuits Memory © Prentice Hall 1995


Transposed-Bitline Architecture

BL’
Ccross
BL
SA
BL
BL"

(a) Straightforward bitline routing.

BL’
Ccross
BL
SA
BL
BL"

(b) Transposed bitline architecture.


Digital Integrated Circuits Memory © Prentice Hall 1995
Alpha-particles

α-particle

WL
VDD

BL
SiO2
n+

1 particle ~ 1 million carriers


Digital Integrated Circuits Memory © Prentice Hall 1995
Yield

Yield curves at different stages of process maturity


(from [Veendrick92])
Digital Integrated Circuits Memory © Prentice Hall 1995
Redundancy

Row
Address
Redundant
rows
Fuse
:
Bank
Redundant
columns

Row Decoder
Memory
Array

Column Decoder Column


Address

Digital Integrated Circuits Memory © Prentice Hall 1995


Redundancy and Error Correction

Digital Integrated Circuits Memory © Prentice Hall 1995


Programmable Logic Array

Product Terms

x0x1
AND x2 OR
PLANE PLANE

f0 f1

x0 x1 x2

Digital Integrated Circuits Memory © Prentice Hall 1995


Pseudo-Static PLA

VDD
GND GND GND GND
GND

GND

GND

VDD f0 f1
x0 x0 x1 x1 x2 x2

AND-PLANE OR-PLANE

Digital Integrated Circuits Memory © Prentice Hall 1995


Dynamic PLA
φAND

GND VDD
φOR

φOR
φAND
V DD f0 f1 GND
x0 x0 x1 x1 x2 x2

AND-PLANE OR-PLANE

Digital Integrated Circuits Memory © Prentice Hall 1995


Clock Signal Generation
for self-timed dynamic PLA

φ φ φAND

Dummy AND Row

φAND

φAND φOR
Dummy AND Row

φOR

(a) Clock signals (b) Timing generation circuitry.


Digital Integrated Circuits Memory © Prentice Hall 1995
PLA Layout
And-Plane Or-Plane
VDD φ GND

x0 x0 x1 x1 x2 x2 f0 f1
Pull-up devices Pull-up devices
Digital Integrated Circuits Memory © Prentice Hall 1995
PLA versus ROM

Programmable Logic Array


structured approach to random logic
“two level logic implementation”
NOR-NOR (product of sums)
NAND-NAND (sum of products)

IDENTICAL TO ROM!

Main difference
ROM: fully populated
PLA: one element per minterm

Note: Importance of PLA’s has drastically reduced


1. slow
2. better software techniques (mutli-level logic
synthesis)

Digital Integrated Circuits Memory © Prentice Hall 1995


Semiconductor Memory Trends

Memory Size as a function of time: x 4 every three years

Digital Integrated Circuits Memory © Prentice Hall 1995


Semiconductor Memory Trends

Increasing die size


factor 1.5 per generation
Combined with reducing cell size
factor 2.6 per generation

Digital Integrated Circuits Memory © Prentice Hall 1995


Semiconductor Memory Trends

Technology feature size for different SRAM generations

Digital Integrated Circuits Memory © Prentice Hall 1995

You might also like