Professional Documents
Culture Documents
chp3A10.10072F978 3 540 30205 6 - 5
chp3A10.10072F978 3 540 30205 6 - 5
net/publication/220799687
CITATIONS READS
33 9,472
3 authors, including:
All content following this page was uploaded by Domenik Helms on 01 July 2016.
1 Introduction
The three sources of power dissipation in digital CMOS circuits can be summa-
rized in the following equation:
2
Pavg = Pshort + Pswitch + Pstatic = Isc Vdd + αCL Vdd f + Ileak Vdd (1)
Between 1970 and 1980, the power consumption of a high-end processor rose
from 0.2W to 2W . Power was mainly consumed in the NMOS technology of
those days, since half of all transistors short-circuited their output to pull it
down to logic 0. In the early 1980’s, the first power-aware technology, the CMOS
technology was introduced. It was able to reduce the power requirements to levels
that much less complex systems some 10 years ago consumed. Static power Pshort
as the major contributor to the systems power consumption seemed to have been
removed for good.
A small remaining Pshort in CMOS logic is due to the direct path short
circuit current Isc , which occurs when both the NMOS and PMOS transistors
are simultaneously active, conducting current directly from supply to ground.
The second term of Equation 1, Pswitch , refers to the dynamic component of
power, where CL is the load capacitance, f the clock frequency and α the node
transition activity factor. Pswitch , which is dominating power consumption in
today’s circuits has been reduced by various techniques like voltage and threshold
voltage scaling since then.
It is exactly these techniques which lead to another change in power consump-
tion dominance. Prognoses [1] assume that in the middle of this decade static
effects will again become the most dominant source of power consumption.
E. Macii et al. (Eds.): PATMOS 2004, LNCS 3254, pp. 17–35, 2004.
c Springer-Verlag Berlin Heidelberg 2004
18 D. Helms, E. Schmidt, and W. Nebel
As analyzed in [2, 3, 4], leakage currents can be divided into 6 major contrib-
utors which will be reviewed in Section 2. In Section 3, we discuss the impact of
PTV1 variations on leakage currents. Section 4 presents existing work on high-
level leakage modeling under PTV variations and Section 5 introduces leakage
optimization techniques. Finally, Section 6 concludes this work.
1
Process, thermal and voltage variations
Leakage in CMOS Circuits – An Introduction 19
In Figure 2 (left) the drain to source current, which is shown mainly depends
on the gate voltage. The curves can be separated at the threshold voltage into a
linear rising and a saturating, almost constant region. Even if the gate voltage
is low - far below this threshold voltage - the current through the channel is not
exactly 0. This is because a transistor is not a simple switch but in contrast to a
high-level view it is a complex analog component. A potential difference between
source and drain will thus result in a current through the channel which is called
subthreshold current. This is the best-known kind of leakage and it also has
the highest impact on the overall static power consumption at least for recent
technologies [5].
Fig. 2. Left: Drain to source current versus gate voltage for different drain voltages
taken from [4]. The drain voltage’s effect on drain-source current can be explained by
second order effects like DIBL, reducing the threshold voltage and GIDL, decreasing the
junction’s barrier. Right: Plot of Equation 3: subthreshold current versus temperature
for different threshold voltages.
20 D. Helms, E. Schmidt, and W. Nebel
where
Cdm qsi Nch
n=1+ and K = µ0 (3)
Cox 2Φs
are technology parameters, W and L are the channel’s width and length, and Vds
is the drain-source, Vgs is the gate source voltage. VT = kB T /q is the thermal
voltage, which is a direct proportional measure to the absolute temperature, as T
is the temperature, kB is the Stefan-Boltzmann-constant and q is the elementary
charge. µ0 is the zero bias mobility, Nch is the effective channel doping, ΦS is
the potential of the surface and Cox and Cdm are the capacitances of the oxide
layer and the depletion layer. From Equation 3 we can immediately derive the
subthreshold current’s dependencies to different technological and dynamical
parameters. As can be seen in Figure 2 (right) where Equation 3 is evaluated for
different temperatures and threshold voltages, the subthreshold current varies by
several orders of magnitude as threshold voltage and temperature just differ by a
few ten percents. For upcoming technologies Figure 2 determines the following:
as the threshold voltage continues to be scaled down, the temperature’s influence
is decreased. In order to analyze this behavior in detail we evaluate the deviations
∂/∂Vth ln (Isub ) and ∂/∂T ln (Isub ), assuming independence between Vth and VT
and a high drain-source voltage Vds .
−1
∂ ln (Isub ) nkB T
=− (4)
∂Vth q
−1
∂ ln (Isub ) nkB T 2 1
= (5)
∂T 2 q (Vth − Vgs ) + nkB T
Using Equations 4 and 5 and assuming Cdm = 2Cox ⇒ n = 3, the importance
of temperature and threshold voltage can be evaluated as presented in Table 2.
Since subthreshold leakage is the most important leakage mechanism in re-
cent technologies, it is worthwhile analyzing effects that influence subthreshold
π Nsub Xd2
Vth = Vf b + ΦS + q
2 Cox W
λXd
+ γ ΦS − Vs 1 −
Lef f
+ VDS e−L/2lc + 2e−L/lc .
Narrow Width Effect and Vth Roll-Off. In this simplified equation (ref.
[10] for more details), the first line describes the static component of threshold
voltage, with Nsub being the substrate doping and Xd being the width of the
depletion zone, Vf b is the flat-band voltage, the voltage at which there is no
electrical charge in the semiconductor. Xd is the thickness of the depletion layer,
γ is the body factor, ΦS is the surface potential and λ ≈ 1 is a fitting parameter.
In addition, Vth can be reduced by Vth roll-off due to an effective length reduction
of the channel. In short channel devices this simple relation is changed by a high
influence of the source and drain depletion regions to the channel’s potential
profile [6].
Body Effect. The body effect is described by the second line of the threshold
equation. Without going into detail [4], it can be linearized to a source-bulk
voltage dependent threshold voltage shift
We can see that via the body effect, the source-bulk voltage can decrease the
subthreshold current by increasing the subthreshold voltage. As will be described
in Section 3.5, this is the main reason for data dependent leakage in CMOS
circuits, called stacking effect.
Drain Induced Barrier Lowering DIBL. The last line shows a drain-source
voltage dependent shift ∆DIBL (Vth ) where
Si Tox Xd
lc = (7)
ox η
Table 3. Prediction of the development of the subthreshold current Isub , the BTBT-
current Ipn−jun and the gate tunneling current Igate as an approximation derived
from [1, 10]
Gate Tunneling. Gate tunneling is the effect of carriers having a finite proba-
bility tunneling in or through the SiO2 layer, isolating the gate from the channel.
When the gate voltage is high, the carriers inside the gate will see an energy band
potential as presented in part a) of Figure 3 which is easy to handle quantum
mechanically. Due to the gate bias, the conduction band of the oxide layer is bent
to a trapezoidal shape4 as presented in b) of Figure 3. Depending on the barrier
height Φox of the oxide layer and the gate bias Vox , only one of two possible gate
tunneling mechanisms will occur:
2
Except for ultra-low power devices like hearing aids and devices with a very low
average switching activity like memories
3
The dominance of the gate leakage may be avoided due to the introduction of high-k
materials (ref. Section5.1).
4
Of course also the energy bands of the gate and the channel are bent, but this can
be ignored for the sake of simplicity
Leakage in CMOS Circuits – An Introduction 23
Fowler-Nordheim Tunneling. In the case of Vox > Φox the carriers need
not have to tunnel through the entire oxide layer as shown in part c) of Figure
3. Instead, the carriers just have to pass a triangular energy potential of the
reduced width TF N < Tox . Fowler-Nordheim tunneling carriers pass directly
from the semiconductors conduction band to the insulators conduction band.
Knowing Φox and the field across the insulator Eox , the total FN-tunneling
current Igate−f n can be calculated as [6]
√ 3/2
Lg Wg q 3 Eox
2
4 2m Φox
β
Igate−f n = αe with α = , β = − , (8)
16π 2 h̄Φox 3h̄qEox
where Lg and Wg are the effective gate’s length and width subjected to FN-
tunneling, q is the carriers charge, h̄ is Planck’s constant over 2π and m is the
effective mass of a carrier in the conduction band of silicon. But as for SiO2
Φox = 3.1eV , FN-tunneling is nearly negligible as usually Vox < Φox .
Hot Carrier Injection HCI. In the pinch-off region of the channel, large
voltage drops result in high electric fields. Mobile carriers in high electric fields
may lead to leakage losses by a variety of mechanisms [13] as shown in Table 4.
These high-energy carriers are called hot-carriers. Carriers with energies above
the ionization threshold of 1.3eV are able to strike out electrons from their
orbit, thus producing electron hole pairs with arbitrary directions, which is called
impact ionization. The energy of some of the carriers moving towards the gate-
insulation is high enough to pass the insulator barrier. Since the barrier height
of SiO2 for electrons (≈ 3.1eV ) is lower than that for holes (≈ 4.8eV ), hot-
electron injection is more likely to occur than hot-hole injection. In addition, as
the mobility of holes inside the SiO2 barrier is lower than that of the electrons,
more holes get trapped in the insulator before reaching the gate.
CarrierEnergy M echanism
E > 1.12eV Light emission
1.3eV < E < 1.8eV Impact ionization
E > 3.1eV Hot-electron injection
E > 4.8eV Hot-hole injection
are constants. In order to handle the complex geometry of this problem, average
effective electric fields for the junction below the source or drain Ebtm and for the
junction between source and drain Eside as effective height Yj−ef f and length
Xj−ef f of the junction was introduced (refer [8] for the details).
Gate induced drain leakage GIDL describes the influence of the gate-field on
the depletion layer of the drain. The higher the gate voltage, the thinner the
depletion layer becomes in the gate-drain overlap area. According to [15], GIDL
currents are carried by an increased band-to-band tunneling probability which
is given by
26 D. Helms, E. Schmidt, and W. Nebel
IGIDL = A · Es (Vgs ) exp − Es (V
B
gs )
(14)
1/2 1/2
2πq 2 mef f πmef f Σg
where A = 3/2 and B = √ , (15)
9h̄2 ΣG 8qh̄
where Es is the field of the surface, which is dependent on the gate-source voltage,
approximately given by
Vg − 1.2V
Es ≈ . (16)
3Tox
2.5 Punchthrough
3 Impacts on Leakage
In the last section, we reviewed what leakage is from a transistor level point of
view. We have seen that all these leakage sources depend on parameters - static
ones like gate length L as well as dynamic ones like temperature. From the
system level point of view, we can identify several processes having a high im-
pact on leakage currents. In this section, we will review the most prominent ones
which are process-temperature-voltage variations (PTV variations) and data de-
pendency, starting with leakage dependence on process variations. Then we will
discuss variations of the two most important dynamical parameters, temperature
and power supply. In the end we will discuss aspects of data-dependent leakage.
Inter-die variations, also called die-to-die variations, are the collection of effects
on leakage due to variation of circuit parameters that vary from die to die. Vari-
ations in the coating, doping and etching processes will lead to slight variations
in structure sizes and doping profiles. In [16], the leakage power due to these
variations was reported to vary by a factor of 20 between the best and the worst
die on a waver.
As inter-die variations describe the variation of the total leakage power of
different dies, they cannot be used to forecast a single die’s leakage power. But
statistical approaches [17], predicting average leakage current and variation, at
least allow a prediction of the yield-loss due to too high leakage power.
Leakage in CMOS Circuits – An Introduction 27
The same approach can be used (starting with another equation) for other
sources of leakage, always leading to a prediction of the influence of parameter
variations to the power consumption distribution.
Table 5. Example of the data dependency of the leakage currents of the NAND and
the NOR gate.
Inputs Leakage
power [pA]
AB NAND NOR
00 10 308
01 173 540
10 304 168
11 544 112
Table 6. Example of the minimal and maximal leakage current of various circuits
dependent on the input vector.
But as reported in [23], the data dependency of bigger systems seems to even
out this effect as presented in Table 6.
As can be seen, the relative standard deviation σ is below 6.1% for all an-
alyzed circuits. Thus, for high-level leakage analysis and optimization, data-
dependencies are not the most necessary topic. Nevertheless, there are approa-
ches that try to handle the components leakage dependency (ref. Section 4.1).
authors tried to separate every physical effect in order to make the remainder
as invariant as possible. Ileak is defined as
−Vdd
−|V |−V
W b(Vdd −Vdd0 ) th of f
Ileak = µ0 · COX · ·e · VT2 · 1 − e VT · e n·VT , (22)
L
which is the subthreshold leakage equation. The low level parameters can
be derived using transistor level simulations. Vdd is the current supply voltage,
Vdd0 is the nominal default supply voltage which is technology dependent. The
parameters µ0 , COX , W/L and Vdd0 are absolutely invariant with all parameters
in this approach and only depend on the technology used. In contrast, the DIBL
factor b, the subthreshold swing coefficient n and Vof f are derived from a curve
fitting on measurements based on transistor level simulations. Vdd , Vth and Vt
are calculated dynamically during the estimation [26].
While scaling down devices, the parameters of a transistor have to obey the rules
of constant field scaling in order to control the performance, the reliability and
the power consumption of a device: starting at a gate length scaling factor of
k, Tox and junction thickness have to be scaled the same way in order to keep
short channel effects (SCE) under control. Down-scaling VDD is then necessary
to keep fields constant in order to prevent high field effects like hot-electron
effects. Since VDD reduction would immediately reduce performance, Vth has to
be scaled as well too to compensate.
This way, nearly every degree of freedom of choice of parameters is fixed. The
remainder of leakage optimization on transistor level is mainly due to modifying
the doping profile and the dielectricity of the oxide [4].
Well Engineering. There are two doping techniques, mainly influencing leak-
age power. Halo doping means increasing the doping level of the p-substrate
close to the inversion layer. It was introduced in order to limit the influence of
the channel length on the threshold voltage. Thus, halo doping also limits drain
induced barrier lowering by ’guarding’ the ends of the channel, lowering drain’s
and source’s influence.
Retrograde doping means changing the doping profile of the bulk in vertical
direction: Directly under the oxide layer the doping concentration is lowered and
deeper towards the bottom it is increased. Retrograde doping was introduced in
order to shift the channel away from the impurities in the oxide interface thus
reducing scattering at impurities. This technique increases threshold voltage and
thus reduces subthreshold currents, but the high-p doped layer also acts as a
punchthrough protection ’guarding’ the bottom of the channel.
32 D. Helms, E. Schmidt, and W. Nebel
As seen in Section 3.5, the subthreshold leakage of a gate differs markedly for
different input combinations due to the stacking effect. Without the need for a
special low leakage technology, this can be used in two different ways:
At first, idle components can be set to a low leakage state. Finding low
leakage states can be done by testing all 2n states for an n input component or
by applying heuristic approaches [21]. Using state assignment, 5-30% of leakage
power can be saved during idle time, where the overhead for generating this state
as for switching to this state is not taken into account.
Another way of using stack effect is called power gating and is done by
inserting a sleep-transistor in series to the CMOS gate [28]. By doing power
gating, 35-90% of the leakage can be saved in idle phases but at the cost of a
5-60% performance loss.
Assuming that all possible techniques presented in Section 5.1 are performed
resulting in transistors with the lowest possible leakage under performance con-
straints, we have to increase the level of abstraction in order to further reduce
leakage power.
There are several techniques reported which all work by identifying avail-
able slack in the gate net-list and then assigning lower performance components
with better power behavior to the gates, having sufficient slack that the timing
constraints of the circuit are not violated.
These approaches differ in the way the performance-power tradeoff are biased,
the way they performed and the way the gates are replaced.
Multiple Vdd . Vdd reduction was proposed quite early because it is also a very
powerful dynamical power countermeasure. Since the total leakage power results
as a product of supply voltage and leakage current Pleak = Ileak Vdd , leakage
depends at least linearly on the supply voltage. But also potential dependent
leakage effects like DIBL are reduced. [29] reports a reduction in the order of
Psubth ∝ Vdd
3
and Pgate ∝ Vdd
4
.
Multiple voltage techniques have highest impact on the power consumption,
but the overhead for supplying several voltages to the gates and the need of
introducing level shifters make it more inconvenient to use.
As we have seen in Section 5.1, by using high-k insulators gate leakage can be
sufficiently reduced by a design modification that is very simple to understand,
even though it is not that easy to really perform. Gate leakage is one of the two
most important leakage sources - at least for the next 5 years. The other one, the
subthreshold current, can also be reduced by orders of magnitude by a technique
that again is simple to understand but maybe not that easy to perform:
Since small temperature differences lead to huge leakage variations, aggressive
cooling techniques up to cryogenic cooling can be applied.
Cooling techniques can strongly control subthreshold current by reducing it
by several orders of magnitude.
6 Conclusion
Leakage power will become a major issue for upcoming technologies. Leakage
analysis will become more complex as different leakage sources like gate leak-
age and band-to-band tunneling gain influence on the total power. A complete
framework for high-level leakage estimation may be available soon as this is one
major recent research concern. High-k materials and aggressive cooling may be
7
This can easily be done for silicon on insulator SOI techniques. For conventional
techniques multiple well techniques have to be applied in order to allow varying
bulk voltages
34 D. Helms, E. Schmidt, and W. Nebel
the solution to leakage problems for future technologies, but the recent and up-
coming technologies have to be controlled by more conventional, less invasive
optimizations. As on lowest levels a lot of optimization was proposed, higher
level approaches addressing leakage are missing so far.
References