Professional Documents
Culture Documents
723-729
Manuscript received February 14, 2019; revised September, 2019. DOI: 10.24425/ijet.2019.130253
Abstract—A novel dual mode logic (DML) model has a superior design [13], multi-threshold voltages [14] and dual supply
energy-performance compare to CMOS logic. The DML model has voltages [15].
unique feature that allows switching between both modes of
operation as per the real-time system requirements. The DML
functions in two dissimilar modes (static and dynamic) of operation
with its specific features, to selectively obtain either low-energy or
high-performance. The sub-threshold region DML achieves
minimum-energy. However, sub-threshold region consequence in
performance is enormous. In this paper, the working of DML
model in the moderate inversion region has been explored. The
near-threshold region holds much of the energy saving of sub-
threshold designs, along with improved performance.
Furthermore, robustness to supply voltage and sensitivity to the
process temperature variations are presented. Monte carol
analysis shows that the projected near-threshold region has
minimum energy along with the moderate performance.
Fig 1. Optimized operating region of a pareto-optimal design curve
Keyword—CMOS logic, dual mode logic, dynamic mode, high
performance, minimum energy point, near-threshold The DML models that have been proposed recently can
switch between dynamic and static modes through an
I. INTRODUCTION asymmetrical clock based on the real-time system requirement.
As a result, DML model achieves enhanced trade-off between
O VER the last four decades, enrichments in research are
being used to obtain a design optimization in the logic
circuit to achieve minimum-energy point (MEP). However,
power dissipation and performance. In dynamic mode of
operation, DML model attains very high speed though increased
power consumption. In static mode, low supply voltage DML
circuit performance becomes an issue, it needs to come out of model achieves lower power dissipation along with modest
minimum-energy point in order to achieve a significant performance [16], [17].
improvement in high performance [1]. Nowadays, researchers In this paper, we explore the dual mode design in near-
are targeting a unique design under the lower technologies to threshold operation to accomplish optimized power dissipation
optimize energy and the speed, a pareto-optimal design curve and propagation delay. Various benchmark circuits have been
for optimized operating point (OOP) is as shown in Fig. 1. It taken, such as standard logic gates, multiplexer, chain of
exclusively describes the minimum delay required for a given inverters and full adder carry cell. These circuits are realized
energy (and vice versa) over a set of design parameters such as with cadence virtuoso simulator, at super-threshold and near-
the threshold, supply voltage and size of a transistors [2]–[6]. threshold regions. Furthermore, Monte carol analysis has been
Since last 30 years, the device parameters have been scaled executed to find out the standard deviation σ (sigma) for
down to achieve lower power consumption. However, scaled threshold voltage variations.
technologies reduces the device parameters like supply voltage, This paper has been divided into five sections. The brief
threshold voltage and gate oxide thickness. As a result, the introduction part of the paper is followed by an elaborate
scaled technology suffers from severe problems. Firstly, discussion of near-threshold design in section 2. The DML
performance is reduced due to threshold variations at low power design and principle of operation are presented in Section III. In
supplies. Secondly, a low-Vth device increases the sub-threshold Section IV, near-threshold DML circuit simulation set-up,
leakage current exponential. It increases by ten times for every results are discussed. Finally, the conclusions are presented in
0.1-volt reduction of VTH [7], [8]. As device parameters scale Section V.
down, leakage current in a sub-micron region turn out to be
significant and comparable with dynamic power dissipation.
II. NEAR-THRESHOLD MODEL4
Further, scaling of supply voltage leads to performance
degradation, less robustness to temperature variations and Energy efficiency can be obtained when CMOS designs
process deviation [9]–[11]. High performance can be operate at near the threshold voltage. Based on the transistor
accomplished only at higher supply voltages. However higher ON-current, a propagation delay model has been obtained. This
supply voltages leads to increased power consumption [12]. model has been applied in DML designs to determine the
These problems can be solved through a novel mixed-mode minimum-energy point [18], [19].
In modern CMOS logic, energy consumption mainly analysis is homogeneous to the Calhoun [28] and Zhai [29]
depends on charge and discharge of inner node capacitances, which is a superior model for OFF-current. The total energy is
which can be minimized by scaling of supply voltage. However, 𝐸𝑇𝑜𝑡 = 𝐸𝑑𝑦𝑛 + 𝐸𝑙𝑒𝑎𝑘 (7)
the lower limit of the supply voltage in commercial applications The dynamic energy is
is usually around 70% of the nominal supply voltage. This lower 2
𝐸𝑑𝑦𝑛 = 𝐶𝑑𝑦𝑛 𝑉𝐷𝐷 (8)
limit is dictated by concerns about performance and robustness where 𝐶𝑑𝑦𝑛 is actual switching capacitance of the complete
loss [20], [21]. In a wide range of voltage scaling techniques, circuit. The leakage energy is
determining an optimal region in the energy per operation is 𝐸𝑙𝑒𝑎𝑘 = 𝐼𝑜𝑓𝑓 𝑉𝐷𝐷 𝑇𝐶 (9)
crucial for achieving desired optimized energy and delay. In Fig.
𝐼𝑜𝑓𝑓 is the leakage current when Vgs = 0 and Vds = VDD. Calhoun
2, the energy-delay optimized curve has been shown. The super-
threshold region (VDD > VTH), energy depends highly on the [28] approximated Ioff as Ion at VDD = 0.
ɳ𝑉𝐷𝑇
supply voltage and it rises exponentially due to switching 𝐼𝑜𝑓𝑓 = 𝐼1 𝑊𝑒𝑓𝑓 𝑒 𝑛𝑉𝑇 (10)
energy. Therefore, voltage scaled down to the near-threshold Finally, the total energy of the circuit is
region (VDD ~ VTH) produces energy saving in the order of 10X ɳ𝑉𝐷𝑇(1−ɳ) −𝑎𝑉2
𝐷𝑇
at the cost of around 10X performance loss. However, scaling 𝐸𝑇𝑜𝑡 = 2
𝐶𝑑𝑦𝑛 𝑉𝐷𝐷 [1 + 𝑅𝑒 𝑛𝑉𝑇 ] (11)
down further to the sub-threshold region (VDD < VTH) leads to
an exponential increase in propagation delay, source more With 𝑅 = 𝐶𝑙𝑒𝑎𝑘 ⁄𝐶𝑑𝑦𝑛 it can be shown that the minimum energy
leakage current. The increased leakage current dominates any point can be obtained by differentiating energy with respect to
reduction in switching energy and turn out to be more significant VDD. The sub-threshold region, leakage and delay have more
[22]–[24]. A simple experimental trans-regional expression for sensitivity to the threshold (Vt), so the minimum-energy region
ON-current and delay in the NTC region is as follows. In the is autonomous of Vt. Nevertheless, in near-threshold operation,
saturation region with VDT > 0, Ion is estimated with the α-power equation (11) shows that increasing Vt reduces the leakage more
law model [25]. than it rises. Therefore, the minimum-energy region ensues at
∝
𝐼𝑜𝑛 = 𝑘1 𝑤𝑉𝐷𝑇 (1) the lower supply voltage.
where Ion is the current at Vgs = Vds = VDD. ON-current depends
on (VDD – |Vt|), which can be denoted by VDT, k1 is process
parameter, W is width of a transistor, α is velocity saturation
index (approximately 1.3 – 1.5 nm). In the sub-threshold region
with VDT < 0, Ion is estimated with an exponential model [26].
𝑉𝐷𝑇
𝐼𝑜𝑛 = 𝐼0 𝑤𝑒 𝑛𝑉𝑇 (2)
where I0 is the current per unit width at VDT =0, VT is the thermal
voltage (kT/q, 29.6 mV at 70 ºC), and n is the sub-threshold
slope factor. Markovic proposes for a near-threshold current
expression derived from EKV [18] is
(1+𝜂)(𝑉𝐷𝐷 +𝑉𝑡 ) 2
2𝑛𝐶𝑜𝑥 𝑊
𝐼𝑜𝑛 = 𝑉𝑇2 [𝑙𝑛 (𝑒 2𝑛𝑉𝑇 + 1)] (3)
𝑘𝑓𝑖𝑡 𝐿
the DML gate is to operate in two efficient functional modes by Hence, the sub-threshold design has been degraded to niche
gaining the advantage of both power dissipation and markets due to its key performance consequences. Therefore,
performance. The DML design is divided into two types, the DML designs are analyzed in the near-threshold region in
namely DMLA model and DMLB model. DMLA model has an order to optimize the energy and performance.
extra PMOS transistor associated in parallel to pull-up network,
A. Benchmark Circuits
in between power supply and output node, which precharge the
output node to high voltage (logic one). DMLB model has an Functioning of DML circuits in near-threshold region has
extra NMOS transistor connected in parallel to pull-down been discussed. Various logic benchmark circuits have been
network, during the precharge period, which precharge output verified such as basic NAND and NOR gates with 2 and 3
node to a logic low [16]. inputs, 2 to1 multiplexers, and sequence of four inverters. A
sequence of four inverters is designated as an inverter is
VDD
commonly used in digital circuits. The inverter transistors size
In Standard
CMOS
is chosen to have equal fall and rise times in each stage. To
Design Out implement these designs, cadence virtuoso simulator has been
Clk
M1 used with gpdk 45 nm. The near threshold design function has
M1 the supply voltage at 600 mV, which is close to the threshold
Clk
voltage of a transistor. Basic CMOS and DML designs were
In Standard compared in terms of the power dissipation and the propagation
CMOS
Design Out delay times, which are shown in Fig. 4. In all the designs, DML
dynamic model has lower propagation delay and the near-
(a) (b)
threshold design increases the delay time as compared to
conventional design. However, near-threshold DML has
Fig 3. DML basic block diagram a) DMLA model and b) DMLB model reduced power as equated to conventional design.
DML gate operates in dynamic mode when the asymmetric
clock signal is given to gate terminal of the extra M1 transistor, 35 CMOS DML CMOS_nvt DML_nvt
providing two dissimilar modes: precharge and evaluation 30
mode. In precharge mode, a circuit output node is precharge to 25
logic high in DMLA model and to a logic low in DMLB model.
20
During the evaluation mode, the circuit output node is measured
power (nw)
IV. PROPOSED NEAR-THRESHOLD DML CIRCUITS A full adder is one of the examples of a classic complex
The DML circuits have been proposed in order to exploit the CMOS gate. As carry cell drives the sum output in a full adder,
advantages of the static and dynamic mode operations. DMLA and DMLB carry cells have been analysed in terms of
However, the DML design incurs certain propagation delay in leakage current, propagation delay, and power dissipation which
static mode and moderate power dissipation in dynamic mode. are shown in Fig. 5. DMLA design has low leakage and reduced
Unfortunately, the DML design operating in sub-threshold delay times with moderate power consumption as compared to
region has increased propagation delay and leakage power. DMLB carry cell design.
726 P. BIKKI
450 TABLE I
400 CDMLA CDMLB VARIOUS PARAMETERS OF THE PTM HIGH PERFORMANCE
350
16 nm 22 nm 32 nm 45 nm
300
250 Vtn (V) 0.47965 0.50308 0.49396 0.46893
Delay (ps)
200
Vtp (V) -0.43121 -0.4606 -0.49155 -0.49158
150
100 VDD (V) 0.7 0.8 0.9 1.0
50 0.95 1.05
tox_nmos (nm) 1.15 1.25
0
5 5,5 6 6,5 7 7,5 8 8,5 9 tox_pmos (nm) 1.0 1.1 1.20 1.30
VDD
(a)
15 60
Sigma (p)
10 40
20
5
0
0 45nm 32nm 22nm 16nm
5 5,5 6 6,5 7 7,5 8 8,5 9
VDD Technology
(b) (a)
1,8
CDMLA CDMLB 3 CMOS DML Dynamic DML Static
1,6
1,4 2,5
1,2
2
1
1,5
Power (nw)
0,8
Sigma (n)
0,6 1
0,4
0,5
0,2
0 0
5 5,5 6 6,5 7 7,5 8 8,5 9 45nm 32nm 22nm 16nm
VDD Technology
(c) (b)
Fig. 5. Carry cell, VDD vs (a) Propagation delay (b) Leakage power (c) Power
dissipation 140 CMOS DML Dynamic DML Static
120
minimum delay (ps)
increases the performance [33]. The main advantage of the 1200 CMOS
multi-threshold design is having the high-Vth and low-Vth in the
1000 DML dynamic
same chip [34]. In DML design additional transistor play very
important role in high to low output transition in DMLB model 800 DML static
and low to high transition in DMLA model. We have proposed
Delay (ps)
and examined the additional transistor by applying the multi- 600
threshold voltages. Low-Vth transistor achieves very low 400
propagation delay due to fast transition between high-to-low
200
logic of output node. Furthermore, other threshold voltage
combinations of DMLB model have been observed and which 0
are given in Table 2. Simulation results shows that the dynamic 0,5 0,6 0,7 0,8 0,9 1
mode operation attains low PDP up to 36-57% as compared to VDD
static DML model and list numbers are highlighted with a bold (a)
letter.
1800 CMOS
TABLE II 1600
VARIOUS THRESHOLD VOLTAGE COMBINATIONS DML dynamic
1400
1200 DML static
Combinations Power( nW) Delay (pS) PDP(aJ) 1000
Delay (ps)
800
N3, VtL Dynamic 199.6 34.9 6.98 600
N3, VtL Static 193.8 83.6 16.2 400
200
N3, VtH Dynamic 142.9 52.3 7.47 0
0,5 0,6 0,7 0,8 0,9 1
N3, VtH Static 138.1 83.9 11.6
VDD
All N,VtL Dynamic 232.3 33.4 7.75 (b)
All N, VtL Static 222 61.3 13.6
140 CMOS
All N, VtH Dynamic 139.5 57.3 7.98
120 DML dynamic
All N, VtH Static 134.2 122.1 16.4 100
DML static
All P, VtL Dynamic 286.4 3.29 9.41 80
Delay (ps)
60
All P, VtL Static 262.4 76.4 19.5
40
All P, VtH Dynamic 141.4 57.5 8.12
20
All P, VtH Static 136.5 100 13.7 0
Note: VtL- low threshold voltage, VtH -high threshold voltage 0,5 0,6 0,7 0,8 0,9 1
VDD
The supply voltage plays a major role in high performance 250 CMOS
designs, in order to analyze variation in propagation delay times
DML dynamic
with respect to region of operation. The supply voltage is varied 200
from 0.5 to 1 V. The basic NAND and NOR gates have been DML static
150
implemented with three and four inputs. Detailed graphs for
Delay (ps)
42
[7] S. W. Sun and P. G. Y. Tsui, “Limitation of CMOS supply-voltage scaling
40 by MOSFET threshold-voltage variation,” IEEE J. Solid-State Circuits,
vol. 30, no. 8, pp. 947–949, 1995.
38 [8] V. De and S. Borkar, “Technology and Design Challenges for Low Power
36 and High Performance,” Proceeding Int. Symp. Low Power Electron. Des.,
27 34 41 48 55 62 69 76 83 90 97 pp. 163–168, 1999.
[9] M. A. Al-Absi, “Low Voltage and Low Power Current-Mode Divider and
Temp
1/X Circuit Using MOS Transistor in Subthreshold,” Arab. J. Sci. Eng.,
(d) vol. 38, no. 9, pp. 2411–2414, 2013.
[10] M. Alioto, G. Palumbo, and M. Pennisi, “Understanding the effect of
52 CMOS DML dynamic DML static process variations on the delay of static and domino logic,” IEEE Trans.
Very Large Scale Integr. Syst., vol. 18, no. 5, pp. 697–710, 2010.
50 [11] A. P. Chandrakasan, S. Sheng, and R. W. Brodersen, “Low-power CMOS
48 digital design,” IEICE Trans. Electron., vol. 75, no. 4, pp. 371–382, 1992.
[12] F. Moradi, T. Vu Cao, E. I. Vatajelu, A. Peiravi, H. Mahmoodi, and D. T.
Delay (ps)
[16] I. Levi, S. Member, and A. Fish, “Dual Mode Logic — Design for Energy [26] B. J. Sheu, D. L. Scharfetter, P. K. Ko, and M. C. Jeng, “BSIM: Berkeley
Efficiency and High Performance,” vol. 1, 2013. Short-Channel IGFET Model for MOS Transistors,” IEEE J. Solid-State
[17] I. Levi, A. Belenky, and A. Fish, “Logical effort for CMOS-based dual Circuits, vol. 22, no. 4, pp. 558–566, 1987.
mode logic gates,” IEEE Trans. Very Large Scale Integr. Syst., vol. 22, [27] M. H. Na, E. J. Nowak, W. Haensch, and J. Cai, “The effective drive
no. 5, pp. 1042–1053, 2014. current in CMOS inverters,” Dig. Int. Electron Devices Meet., pp. 121–
[18] D. Markovic, C. C. Wang, L. P. Alarcon, T. Te Liu, and J. M. Rabaey, 124, 2002.
“Ultralow-power design in near-threshold region,” Proc. IEEE, vol. 98, [28] B. H. Calhoun, A. Wang, and A. Chandrakasan, “Modeling and sizing for
no. 2, pp. 237–252, 2010. Minimum Energy Operation in Subthreshold Circuits,” IEEE J. Solid-
[19] V. De, “Near-Threshold Voltage Design in Nanoscale CMOS,” p. 2012, State Circuits, vol. 40, no. 9, pp. 1778–1785, 2005.
2012. [29] B. Zhai, D. Blaauw, D. Sylvester, and K. Flautner, “The limit of dynamic
[20] J. G. Delgado-Frias, Z. Zhang, and M. A. Turi, “Near-threshold CNTFET voltage scaling and insomniac dynamic voltage scaling,” IEEE Trans.
SRAM cell design with removed metallic CNT tolerance,” Proc. - IEEE Very Large Scale Integr. Syst., vol. 13, no. 11, pp. 1239–1252, 2005.
Int. Symp. Circuits Syst., vol. 2015–July, no. 2, pp. 2928–2931, 2015. [30] I. Levi, A. Kaizerman, and A. Fish, “Low voltage dual mode logic: Model
[21] H. Kaul, M. Anders, S. Hsu, A. Agarwal, and R. Krishnamurthy, “Near- analysis and parameter extraction,” Microelectronics J., vol. 44, no. 6, pp.
Threshold Voltage ( NTV ) Design — Opportunities and Challenges.” 553–560, 2013.
[22] R. G. Dreslinski, M. Wieckowski, D. Blaauw, D. Sylvester, and T. Mudge, [31] M. Asyaei, “A new leakage-tolerant domino circuit using voltage-
“Near-threshold computing: Reclaiming moore’s law through energy comparison for wide fan-in gates in deep sub-micron technology,” Integr.
efficient integrated circuits,” Proc. IEEE, vol. 98, no. 2, pp. 253–266, VLSI J., vol. 51, pp. 61–71, 2015.
2010. [32] J. C. Park, V. J. Mooney, and P. Pfeiffenberger, “Sleepy stack reduction
[23] D. M. Harris, B. Keller, J. Karl, and S. Keller, “A transregional model for of leakage power,” Lect. notes Comput. Sci., vol. 14, no. 11, pp. 148–158,
near-threshold circuits with application to minimum-energy operation,” 2004.
Proc. Int. Conf. Microelectron. ICM, pp. 64–67, 2010. [33] M. C. Johnson, D. Somasekhar, L. Y. Chiou, and K. Roy, “Leakage
[24] F. Crupi, D. Albano, M. Alioto, J. Franco, L. Selmi, J. Mitard, and G. control with efficient use of transistor stacks in single threshold CMOS,”
Groeseneken, “Impact of high-mobility materials on the performance of IEEE Trans. Very Large Scale Integr. Syst., vol. 10, no. 1, pp. 1–5, 2002.
near- and sub-threshold CMOS logic circuits,” IEEE Trans. Electron [34] B. H. Calhoun, F. A. Honoré, and A. P. Chandrakasan, “A leakage
Devices, vol. 60, no. 3, pp. 972–977, 2013. reduction methodology for distributed MTCMOS,” IEEE J. Solid-State
[25] T. Sakurai and a. R. Newton, “Alpha-power law MOSFET model and its Circuits, vol. 39, no. 5, pp. 818–826, 2004.
applications to CMOS inverter\ndelay and other formulas,” IEEE J. Solid-
State Circuits, vol. 25, no. 2, pp. 584–594, 1990.