Professional Documents
Culture Documents
discussions, stats, and author profiles for this publication at: https://www.researchgate.net/publication/234819774
CITATIONS READS
236 221
1 author:
Yuan Taur
University of California, San Diego
108 PUBLICATIONS 6,105 CITATIONS
SEE PROFILE
All content following this page was uploaded by Yuan Taur on 06 February 2015.
Beginning with a brief review of CMOS scaling performance, dramatically reduced cost per function,
trends from 1 m to 0.1 m, this paper and much-reduced physical size compared to their
examines the fundamental factors that will predecessors.
ultimately limit CMOS scaling and considers The prevailing VLSI technology today comprises CMOS
the design issues near the limit of scaling. devices because of their unique characteristic of negligible
The fundamental limiting factors are electron standby power, which allows the integration of tens of
thermal energy, tunneling leakage through millions of transistors on a processor chip with only a very
gate oxide, and 2D electrostatic scale length. small fraction (⬍1%) of them switching at any given
Both the standby power and the active power instant. As the CMOS dimension, in particular the channel
of a processor chip will increase precipitously length, is scaled to the nanometer regime (⬍100 nm),
below the 0.1-m or 100-nm technology however, the electrical barriers in the device begin to
generation. To extend CMOS scaling to the lose their insulating properties because of thermal
shortest channel length possible while still injection and quantum-mechanical tunneling [2]. This
gaining significant performance benefit, an results in a rapid rise of the standby power of the chip,
optimized, vertically and laterally nonuniform placing a limit on the integration level as well as on the
doping design (superhalo) is presented. It is switching speed.
projected that room-temperature CMOS will This paper considers bulk CMOS designs that extend
be scaled to 20-nm channel length with the the limit of scaling to the shortest channel length possible.
superhalo profile. Low-temperature CMOS Section 2 is an overview of the perspectives of CMOS
allows additional design space to further scaling which examines the fundamental factors that will
extend CMOS scaling to near 10 nm. ultimately limit scaling: electron thermal voltage and oxide
tunneling. Section 3 discusses a 2D MOSFET scale length
model and presents a feasible design for 25-nm CMOS,
1. Introduction likely to be near the limit of CMOS scaling. Section 4
The steady downscaling of transistor dimensions over explores the possibility and design issues of extending
the past two decades has been the main stimulus to CMOS scaling to 10-nm channel length through low-
the growth of silicon integrated circuits (ICs) and the temperature operation. Section 5 concludes the paper.
information industry. The more an IC is scaled, the higher
becomes its packing density, the higher its circuit speed, 2. CMOS scaling trends and limiting factors
and the lower its power dissipation [1]. These have been When the dimensions of a MOSFET are scaled down,
key in the evolutionary progress leading to today’s both the voltage level and the gate-oxide thickness must
computers and communication systems that offer superior also be reduced [1]. Since the electron thermal voltage,
娀Copyright 2002 by International Business Machines Corporation. Copying in printed form for private use is permitted without payment of royalty provided that (1) each
reproduction is done without alteration and (2) the Journal reference and IBM copyright notice are included on the first page. The title and abstract, but no other portions, of this
paper may be copied or distributed royalty free without further permission by computer-based and other information-service systems. Permission to republish any other portion of
this paper must be obtained from the Editor. 213
0018-8646/02/$5.00 © 2002 IBM
IBM J. RES. & DEV. VOL. 46 NO. 2/3 MARCH/MAY 2002 Y. TAUR
10 Vg 0
Gate-controlled barrier
Vt
5
Source
Vdd
Power supply and threshold voltage (V)
2 Drain
103
107 0.2
Vt
L 2
108 0
0 0.2 0.4 0.6 0.8 1 1.2
1 Gate voltage (V)
0.02 0.05 0.1 0.2 0.5 1
MOSFET channel length ( m)
Figure 2
Figure 1 MOSFET current in both logarithmic (left) and linear (right)
scales vs. gate voltage. The slope of the dotted line represents the
History and trends of power-supply voltage (Vdd ), threshold large-signal transconductance for a digital circuit. Inset shows the
voltage (Vt ), and gate-oxide thickness (tox ) vs. channel length for band diagram of an n-MOSFET. The barrier height at Vg 0 is
CMOS logic technologies. proportional to Vt.
kT/q, is a constant for room-temperature electronics, the MOSFET threshold voltage is defined as the gate voltage
ratio between the operating voltage and the thermal at which significant current begins to flow from the source
voltage inevitably shrinks. This leads to higher source- to the drain (Figure 2). Below the threshold voltage, the
to-drain leakage currents stemming from the thermal current does not drop immediately to zero. Rather, it
diffusion of electrons. At the same time, the gate oxide decreases exponentially, with a slope on the logarithmic
has been scaled to a thickness of only a few atomic layers, scale inversely proportional to the thermal energy kT. This
where quantum-mechanical tunneling gives rise to a sharp is because some of the thermally distributed electrons at
increase in gate leakage currents [3]. The effects of these the source of the transistor have high enough energy to
fundamental factors on CMOS scaling are quantified below. overcome the potential barrier controlled by the gate
voltage and flow to the drain (Figure 2 inset). Such a
Power supply and threshold voltage subthreshold behavior follows directly from fundamental
Figure 1 shows the scaling trend of power-supply voltage thermodynamics and is independent of power-supply
(V dd ), threshold voltage (V t ), and gate-oxide thickness (t ox ) voltage and channel length.
as a function of CMOS channel length [2]. It is seen that The standby power of a CMOS chip due to source-to-
the power-supply voltage has not been decreasing at a drain subthreshold leakage is given [4] by
冉 冊
rate proportional to the channel length. This means that qVt
the field has been gradually rising over the generations Poff Wtot Vdd Ioff WtotVdd I0 exp ⫺ , (1)
mkT
between 1-m and 0.1-m channel lengths. Fortunately,
thinner oxides are more reliable at high fields, thus where W tot is the total turned-off device width with V dd
allowing operation at the reduced but nonscaled supply across the source and drain, I off is the average off-current
voltages. per device width at 100⬚C (worst-case temperature), I 0 is
Below 0.1 m, threshold voltage deviates even further the extrapolated current per width at threshold voltage
214 from the past scaling behavior, as depicted in Figure 1. (of the order of 1–10 A/m for 0.1-m devices),
Y. TAUR IBM J. RES. & DEV. VOL. 46 NO. 2/3 MARCH/MAY 2002
m is a dimensionless ideality factor typically ⬇1.2 (to be
discussed later), and V t is the threshold voltage at 100⬚C.
Even if V t is kept constant, the leakage current of turned- e
Tunneling
off devices will increase in proportion to 1/t ox and
(W tot /L) because the current at threshold condition I 0 is
proportional to the inversion charge density at threshold,
Q i ⬃ (1–2)(kT/q)C ox [4], where C ox ⫽ ox /t ox is the gate- 106
Ion
oxide capacitance per unit area. The off-state leakage Data tox (nm)
104 Model
IBM J. RES. & DEV. VOL. 46 NO. 2/3 MARCH/MAY 2002 Y. TAUR
the silicon surface, which effectively reduces the gate
Gate capacitance and therefore the inversion charge to those
of an equivalent oxide ⬃0.4 nm thicker than the physical
Source tox n poly Drain
oxide [4]. Similarly, depletion effects occur in polysilicon
0 H G L y
in the form of a thin space-charge layer near the oxide
A F
Na n
n interface which acts to reduce the gate capacitance and
Wd B C D E inversion-charge density for a given gate drive. The
percentage of gate-capacitance attenuation becomes more
significant as the oxide thickness is scaled down. For a
20 ⫺3
x polysilicon doping of 10 cm , a 2-nm oxide loses about
Substrate 20% of the inversion charge at 1.5-V gate voltage because
of the combined effects of polysilicon gate depletion and
inversion-layer quantization [2]. Taking these two effects
into account, the scaling limit of the electrical oxide
thickness (t inv ), i.e., the effective oxide thickness for
Figure 4 inversion charge calculations, is likely to be 1.5–2.0 nm.
insensitive to the applied voltage or field across the oxide, 2D scale length theory
so reduced-voltage operation will not buy much relief. Figure 4 shows the essential 2D aspects of a short-channel
Although the gate leakage current may be at a level that MOSFET [7]. A key parameter is the gate depletion
is negligible compared with the on-state current of a width, W d , within which the mobile carriers (holes in the
device, it will first have an effect on the chip standby case of n-MOSFETs) are swept away by the applied gate
power. Note that the leakage power will be dominated by field. The gate depletion width reaches a maximum, W dm ,
turned-on n-MOSFETs, in which electrons tunnel from at the onset of strong inversion (threshold voltage) when
the silicon inversion layer to the positively biased gate the surface potential (s ) or band bending is such that
(Figure 3 inset). Edge tunneling in the gate-to-drain the electron concentration at the surface equals the hole
overlap region of turned-off devices should not be a concentration in the bulk substrate. This is the standard
fundamental issue, since one can always build up the s ⫽ 2B condition, with B ⫽ (kT/q)ln(N a /n i ), where N a
corner oxide thickness by additional oxidation of is the substrate doping concentration and n i the intrinsic
polysilicon after gate patterning. p-MOSFETs have a carrier concentration of silicon. For uniformly doped
much lower leakage than n-MOSFETs because there cases,
冑
⫹
are very few electrons in the p polysilicon (“poly”) gate
4si kT ln共Na/ni兲
available for tunneling to the substrate, and hole tunneling Wdm . (3)
has a much lower probability. If one assumes that the total q 2 Na
2
active gate area per chip is of the order of 0.1 cm , the A rectangle is formed by the boundary of the gate
maximum tolerable gate leakage current will be of the depletion region, the gate electrode, and the source
2
order of 10 A/cm . This sets a lower limit of 1.0 –1.5 nm and drain regions, as depicted in Figure 4 [7]. Two-
for the gate-oxide thickness. Dynamic memory devices dimensional effects can be characterized by the aspect
have a more stringent leakage requirement and therefore ratio of this rectangle. When the horizontal dimension,
must impose a higher limit on gate-oxide thickness [6]. i.e., the channel length, is at least twice as long as the
Another issue with the thin gate oxide is the loss of vertical dimension, the device behaves like a long-channel
inversion charge and therefore transconductance due to MOSFET, with its threshold voltage insensitive to channel
inversion-layer quantization and polysilicon-gate depletion length and drain bias. For channel lengths shorter than
effects [2]. Quantum mechanics dictates that the density of that, the 2D effect becomes significant, and the minimum
216 inversion electrons peaks at approximately 1 nm below surface potential (s ) which determines the threshold
Y. TAUR IBM J. RES. & DEV. VOL. 46 NO. 2/3 MARCH/MAY 2002
voltage is increasingly controlled more by the drain than
by the gate. 10
20 nm SiO2
The rectangular box consists of a silicon region of
condition, si x,si ⫽ ox x,ox , where si , ox are the
6
5 nm
permittivities of silicon and oxide, respectively. For
lateral fields ( y ) tangential to the interface, the
4
Wdm 10tox
boundary condition is y,si ⫽ y,ox , independent of m 1.3
2
the dielectric constants. By properly matching the
boundary conditions of both components of electric
0
fields at the silicon–insulator interface, one can derive 0 5 10 15 20
a scale length that is a solution to the following Depletion depth in Si (nm)
equation [8]:
si tan共
tox/ 兲 ox tan共
Wdm/ 兲 0. (4) Figure 5
Constant scale length contours (solid lines) in a tox–Wdm design
The scale length also goes into the length-dependent
plane, assuming SiO2 or Si /ox 3. The dotted line marks the
term of the maximum potential barrier in a MOSFET of boundary on which the ideality factor m equals 1.3. The intercepts
channel length L, i.e., ⌬s (SCE) ⬀ exp(⫺
L/ 2 ) [7]. The represent design points which satisfy both the scale length and the
ratio L/ is a good measure of the strength of the 2D subthreshold slope requirements.
effect. For the short-channel V t roll-off and the drain-
induced barrier lowering (DIBL) to be acceptable, the
above exponential factor must be much less than 1. This 25-nm CMOS design
means that the minimum useful channel length is about The discussions in Section 2 make it clear that CMOS
1.5–2.0 times [4]. Note that the above scale-length devices below 0.1 m will have increasing leakage
equation is valid for the high-k gate insulator as well. currents, declining performance gains, and higher chip
One simply replaces ox and t ox with i and t i , where i power. Nevertheless, with proper control of the doping
is the permittivity of that insulator and t i its thickness. profile, the limit of CMOS scaling can be extended to
The lowest-order solution to the above equation is 20-nm channel length without strict scaling of oxide
plotted in Figure 5 in the form of constant- contours in a thickness and power-supply voltage (Figure 1).
t ox –W dm design plane. In addition to the 2D scale-length An optimum design for 20-nm MOSFET calls for a
requirement, the ratio between t ox and W dm must also be vertically and laterally nonuniform doping profile, the
kept small in order for the inverse (log) subthreshold superhalo [9], to control the short-channel effect. Figure 6
current slope [Equation (1)] [4], shows such a doping profile, along with simulated
冉 冊
potential contours for a 25-nm MOSFET [9]. Halo doping,
kT si tox kT
S m(ln 10) 1 共ln 10兲 , (5) or nonuniform channel profile in the lateral direction, can
q oxWdm q
be realized by angled ion implantation self-aligned to the
to be close to the ideal (ln 10)kT/q value, or 60 mV/decade. gate, with a very restricted amount of diffusion. The highly
Here m is usually referred to as the ideality factor which nonuniform profile sets up a higher effective doping
measures the gate-voltage swing required per unit of change concentration toward shorter devices, which counteracts
in the electron potential (or band bending) at the silicon short-channel effects. With this design, the simulated
surface. A reasonable upper limit is t ox /W dm ⫽ 0.1, or off-currents are insensitive to channel-length variations
m ⫽ 1.3, as indicated by the dotted curve in Figure 5. between 20 and 30 nm, as shown in Figure 7. In terms of
This gives a long-channel inverse subthreshold slope of the threshold-voltage sensitivity to channel-length variations,
⬇80 mV/decade. The intercepts of the dotted curve with the superhalo profile extends the scaling limit by a factor
the constant- contours lie in a region where the vertical of nearly 2. For example, considering a criterion of
fields dominate, and ⬇ W dm ⫹ (si /ox )t ox , obtained by ⌬V t 50 mV for ⌬L/L ⫽ 30%, the superhalo profile
replacing the oxide region with an equivalent silicon region of in Figure 6 ( ⬇ 17 nm) allows for L min ⫽ 20 nm ⬇ 1.2 ,
thickness (si /ox )tox [7]. The design points, or the intercepts, while a non-halo MOSFET can be scaled to only
can then be solved as tox ⫽ (1 ⫺ 1/m)(ox /si ) . This means L min ⬇ 2 against the same criterion. From another
that for L min ⬇ (1.5–2.0) and m ⬇ 1.3, the oxide perspective, because of the flat V t dependence on channel
thickness required is t ox ⬇ L min /20 to L min /25 [4]. length, the superhalo profile permits a nominal device to 217
IBM J. RES. & DEV. VOL. 46 NO. 2/3 MARCH/MAY 2002 Y. TAUR
This points to a way out of the high-resistance problem
Gate associated with very shallow extensions [10]. The lateral
tox source– drain gradient, however, is much more critical.
25 nm
1.5 nm The short-channel effect degrades rapidly once the profile
Source is more graded than 4 –5 nm/decade. This is because the
Drain
channel length is largely determined by the points of
n-type: 25 nm current injection from the surface layer into the bulk,
2.0 1019 which takes place at a source– drain doping concentration
1.5 1019
1.0 1019
p-type:
1.2 V of about 2 ⫻ 10 19 cm ⫺3 [11]. Any source– drain doping
5 1018 0.8 V that extends beyond this point into the channel tends to
0 1.0 1019 0.4 V compensate or counterdope the channel region and
0 aggravate the short-channel effect. The abruptness
5 1018 0.4 V
requirements of both the source– drain and halo doping
profiles dictate absolutely minimum thermal cycles after
the implants. Note that a raised source– drain structure
Figure 6 may help in making contacts, but does not by itself satisfy
Source, drain, and superhalo doping contours in a 25-nm n- the abruptness requirement discussed here.
MOSFET design. The channel length is defined by the points at One of the concerns with the high p-type doping level
which the source–drain doping concentration falls to 2 1019 cm3.
and narrow depletion regions in Figure 6 is the band-to-
The dashed curves show the potential contours for zero gate
voltage and a drain bias of 1.0 V. 0 refers to the midgap energy band tunneling through the high-field region between the
level of the substrate. Reprinted with permission from [9]; ©1998 p-halo and the reverse-biased drain. For a drain voltage of
IEEE. 1 V, the highest field at the heavily doped halo region is
estimated to be 1.7 MV/cm. According to the published
data, the band-to-band tunneling current density of the
drain-to-substrate junction for this field magnitude is of
0.5
the order of 1 A/cm 2 [9]. This should not constitute a
0.4 Retrograde
Vds 1.0 V major component of the device leakage current, given
0.3
Threshold voltage (V)
Y. TAUR IBM J. RES. & DEV. VOL. 46 NO. 2/3 MARCH/MAY 2002
charge needed for the next stage. These two effects tend
to cancel each other. For the heavily loaded case in which 101
the devices drive a large fixed capacitance, the delay L 25 nm
IBM J. RES. & DEV. VOL. 46 NO. 2/3 MARCH/MAY 2002 Y. TAUR
101
Vt Vfb 2m B 共m 1兲Vbs ⬇ 共m 1兲共Eg Vbs兲. (6)
10
Y. TAUR IBM J. RES. & DEV. VOL. 46 NO. 2/3 MARCH/MAY 2002
Acknowledgment on VLSI Technology, Systems, and Applications, Taipei,
The author would like to thank David Frank, Bob Taiwan, June 1999, pp. 6 –9.
18. Y. Naveh and K. K. Likharev, “Modeling of 10-nm-Scale
Dennard, and Paul Solomon for many stimulating Ballistic MOSFETs,” IEEE Electron Device Lett. 21, 242
discussions. (2000).
IBM J. RES. & DEV. VOL. 46 NO. 2/3 MARCH/MAY 2002 Y. TAUR
Yuan Taur Department of Electrical and Computer
Engineering, University of California at San Diego, La Jolla,
California 92093 (taur@ece.ucsd.edu). Dr. Taur received the
B.S. degree in physics from the National Taiwan University,
Taipei, Taiwan, in 1967 and the Ph.D. degree in physics from
the University of California, Berkeley, in 1974. From 1975
to 1979, he worked at NASA, Goddard Institute for Space
Studies, New York, on low-noise Josephson junction mixers
for millimeter-wave detection. From 1979 to 1981, he worked
at Rockwell International Science Center, Thousand Oaks,
California, on II–VI semiconductor devices for infrared
sensor applications. From 1981 to 2001, Dr. Taur was with
the Silicon Technology Department of the IBM Thomas
J. Watson Research Center, Yorktown Heights, New York,
where he was Manager of Exploratory Devices and Processes.
Areas in which he has worked and published include latchup-
free 1-m CMOS, self-aligned TiSi2 , 0.5-m CMOS and
BiCMOS, shallow-trench isolation, 0.25-m CMOS with n ⫹ /p ⫹
polysilicon gates, SOI, low-temperature CMOS, and 0.1-m
CMOS. Since October 2001, he has been a professor in the
Department of Electrical and Computer Engineering of the
University of California at San Diego. Dr. Taur was elected
a Fellow of the IEEE in 1998. He is currently the Editor-in-
Chief of the IEEE Electron Device Letters. He has served on
the technical program committees and as a panelist for the
Device Research Conference and the International Electron
Devices Meeting, and as Program Chairman for the
Symposium on VLSI Technology. Dr. Taur has authored or
co-authored more than 120 technical papers; he holds 11 U.S.
patents. He is profiled in the 2001 edition of Who’s Who in
the World. Dr. Taur received four IBM Outstanding Technical
Achievement Awards and six IBM Invention Achievement
Awards during his IBM career. He co-authored the book,
Fundamentals of Modern VLSI Devices, published by
Cambridge University Press in 1998.
222
Y. TAUR IBM J. RES. & DEV. VOL. 46 NO. 2/3 MARCH/MAY 2002