Professional Documents
Culture Documents
Low-Power Design of CML Drivers For On-Chip Transmission-Lines
Low-Power Design of CML Drivers For On-Chip Transmission-Lines
Akira Tsuchiya
Takeshi Kuboki
Hidetoshi Onodera
I. Introduction
According to the continuous improvement of the LSI fabrication technologies, the performance of LSIs is rapidly accelerating. One of the big challenges in high-performance LSIs is the
interconnect bottleneck problem. The on-chip interconnection
is becoming a limitation of the whole chip performance because
the performance of on-chip wires does not improve by technology scaling. At the same time, the multi-core architecture is a
trend of microprocessor design [1]. In such LSIs, on-chip buses
have strong impact on the chip performance and on-chip interconnection is required to achieve higher bandwidth [2]. Thus
on-chip high-speed communication is one of the hot topics and
several methods are developed [37].
In high-speed communication, the primal elements of the signaling system are the driver, the receiver and the interconnect.
The on-chip interconnects can transmit 10Gbps or higher bit
rate [8] and the bottleneck is the driver circuit. To realize highspeed circuits, current-mode-logic (CML) is one of the solutions
[912]. However CML buers have more design parameters
comparing static CMOS inverter [13]. CML buers can operate in higher frequency and have tolerance to the common mode
noise, but CML buers require the static current ow and the
power dissipation is larger than that of the static CMOS buers.
Thus low-power design of CML buers is discussed [14].
This paper proposes a low-power design of CML buers
for on-chip transmission-lines. Conventionally, impedancematched driver is a common practice for transmission-line
drivers [13].
The proposed method use an impedanceunmatched driver to reduce the power dissipation. Impedanceunmatched driver cause reection of the propagating wave,
however the reected wave rapidly attenuates on on-chip
transmission-lines. Therefore the impedance-mismatch is not a
serious problem. We use a resistance larger than the characteristic impedance as the pull-up resistance. By using the larger resistance, we can reduce the tail current. From the pole frequency
analysis, we can reduce the tail current without degrading the
VDD
RD
RD
Vout1
Vout2
Vin1
M1
M2
Vin2
CL
CL
Itail
VDD
Vout1
N stages
Vout2
RD Itail = Vout
1st
2 nd
N th
differential
transmission-line
VDD - RD Itail
Vin1 - Vin2
Vin, min
u N-1=X
2LItail
.
2
Cox Vout
(3)
Equation (2) is the lower limit of the output voltage. The upper limit of the output voltage depends on the threshold voltage
Vth . For high-speed operation, nMOS transistors should operate in the saturation. From this constraint, the maximum output
voltage is derived as [13]
Vout = RD Itail Vth .
(4)
RD = RD /u
k+1
,
(6)
Wk+1 = uWk
I
tailk+1 = uItailk
the subscript k + 1 and k denote the (k + 1)-st stage and the k-th
stage respectively. The parameter X is the ratio between the rst
stage and the last stage and is equal to uN1 .
The latency of tapered CML buer is discussed in Ref. [13]
and Ref. [13] concludes that the delay of CML buer has similarity with static CMOS buer and the number of stages becomes
optimal when the number of stages N satises
(
)
N ln uN1 .
(7)
In other word, the optimal taper factor is Napiers constant e.
On the other hand, the bandwidth is also an important metric of CML driver. From the viewpoint of the bandwidth, the
taper factor and the number of stages change the gain curve
of the CML buer. As the taper factor u becomes small, the
pole frequency shifts toward high frequency because the load
capacitance becomes small. However as the number of stages
increases, the gain drops rapidly in high frequency region. An
20
VDD
5 stages
15
4 stages
10
5
2 stages
Vout1
3 stages
-5
Vout2
-10
-15
-20
CL
-25
0.1
1.0
Frequency [GHz]
Vin1
Vin2
CL
10
Itail
1
,
RD C L
(8)
if there is no parasitics. Without considering the parasitic capacitances, the load capacitance CL is the sum of the drain-backgate
capacitance and the gate capacitance of the next stage. Then the
pole frequency of tapered CML buers is expressed as
p =
1
(
),
RDN CDBN + uCGN
(9)
where RDN , CDBN and CGN is the pull-up resistance, the drainbackgate capacitance and the gate capacitance of the nal stage
of tapered buer respectively.
Eye-diagram
evaluation
N stages
random NRZ
1st
2 nd
N th
bridge termination
(100 )
u N-1=X
5.6
100
5 stages
(pole: 21.6GHz)
0.4
0.5
3 stages
(pole: 14.5GHz)
0.3
0.2
4 stages
(pole: 18.9GHz)
0.1
2 stages
(pole: 5.6GHz)
0
10
20
30
40
Frequency [GHz]
50
60
70
10
Parasitics of Tr
Parasitics of wire
are dominant
are dominant
0.1
10
100
1000
Pull-up resistance [Ohm]
From Eq. (3), the transistor size is proportional to the tail current. As explained in Section II, the product of RD and Itail is determined by the output voltage swing. Thus the pole frequency
can be rewritten as
p =
1
1
1
= const.
RDCL RD W
RD Itail
(10)
Therefore if the load capacitance CL is proportional to the transistor size, the pole frequency does not change by tuning the
pull-up resistance RD . As a result, the impedance-unmatched
driver can reduce the tail current without degrading the bandwidth. We experimentally verify the impedance-unmatched
driver in the next section.
B. Relationship between the pole frequency and the pull-up resistance
If the load capacitance CL is proportional to the gate width,
we can decrease the tail current by increasing the pull-up resistance. However the load capacitance includes the parasitic capacitance of wire and the pull-up resistance. The wire parasitics
is not depends on the gate width. If we realize the pull-up resistor
by polysilicon, the parasitic capacitance of the resistance is proportional to the resistance value. When the gate width is large
and the gate capacitance and the drain-backgate capacitance is
dominant, we can reduce the tail current without degrading the
bandwidth of the CML buer. As the size of switching transistor
decreases, the parasitic capacitance of wires and the resistance
becomes signicant. In this case, the pole frequency shifts to the
lower frequency as increasing the pull-up resistance.
Figure 8 shows the relationship between the pull-up resistance
and the pole frequency. The parasitic capacitances are extracted
from a CML buer designed in a 90nm CMOS process. When
the resistance is small, the switching transistor is large and the
gate capacitance and the drain-backgate capacitance is dominant. In this region, the pole frequency is almost the constant
even the pull-up resistance is changes. As the resistance becomes larger, the parasitic capacitance of wire and that of the
resistance becomes dominant. Thus the pole frequency becomes
lower when the pull-up resistance is large.
Therefore the proposed method is limited by the parasitic capacitance of wires and that of the resistance.
Random
input
(differential)
u N-2
1 u
0.5
Eye-diagram
evaluation
terminator
cascade
N=4
eye opening voltage [V]
driver
X=uN-1
0.4
0.3
RD=50 Ohm
RD=100 Ohm
0.2
16
A. Simulation setup
22
24
26
frequency [GHz]
28
30
32
N=5
0.4
0.3
RD=50 Ohm
RD=100 Ohm
0.2
20
0.5
V. Experimental Results
In this section, we show some experimental results of the
impedance-unmatched driver. First, we explain the simulation setup. Next, the bandwidth and the power dissipation of
impedance-unmatched drivers are evaluated by circuit simulation.
18
16
18
20
22
24
26
frequency [GHz]
28
30
32
TABLE I
Total tail current of the driver.
#stages matched-driver
proposed
(N)
(RD = 50)
(RD = 100) ratio
4
13.9mA
10.4mA
0.748
5
16.5mA
12.4mA
0.752
6
19.2mA
14.4mA
0.750
8
24.6mA
18.5mA
0.752
: (ratio)=(proposed)/(matched-driver)
References
[1] D. Pham, S.Asano, M. Bolliger, M. Day, H. Hofstee,
C. Johns, J. Kahle, A. Kameyama, J. Keaty, Y. Masubuchi,
M. Riley, D. Shippy, D. Stasiak, M.Wang, J. Warnock,
S. Weitzel, D. Wendel, T. Yamazaki, and K. Yazawa, The
Design and Implementation of a First-Generation CELL
processor, IEEE International Solid-State Circuits Conference Digest of Technical Papers, Feb 2005.
[2] Tao Lin and Lawrence T. Pileggi, Throughput-Driven IC
Communication Fabric Synthesis, Proc. ICCAD, pp.274
279, 2002.
[3] Naresh Shanbhag, Pushing the Limits of Interconnect Performance: A Communication-Centric Approach, Tutorial of IEEE Workshop on Signal Propagation on Interconnects, May 2005.
[14] Massimo Alioto and Gaetano Palumbo, Design Techniques for Low-Power Cascaded CML Gates, Proceedings of IEEE International Symposium on Circuits and Systems, pp.46854688, May 2005.
[4] Anup P. Jose, George Patounakis, and K. L. Shepard, Near-Speed-of-Light On-Chip Interconnects Using
Pulsed Current-Mode Signaling, Symposium on VLSI Circuits Digest of Technical Papers, pp.108111, Jun 2005.
[15] Behzad Razavi, Design of Analog CMOS Integrated Circuits, McGraw-Hill, Aug 2000.
[16] Akira Tsuchiya, Masanori Hashimoto, and Hidetoshi Onodera, Driver Sizing for High-Performance Interconnects
Considering Transmission-Line Eects, Proceedings of
SASIMI2001, pp.377381, Oct 2001.