Professional Documents
Culture Documents
BR SSCM 1 2022
BR SSCM 1 2022
Behzad Razavi
A
As explained in “The Design of an
Equalizer—Part One” [1], we wish to
develop an equalizer meeting the fol-
lowing performance:
■■ data format: nonreturn-to-zero
This time-domain characterization
reveals that each bit generates a tail
that interferes with the next bit(s).
We, thus, seek a method of canceling
this effect, recognizing that
a scaling factor of h 1 and is called
the “first tap” or “first coefficient.”
We expect that the tail cancellation
accords the “summing junction”
signal D sum, a greater
■■ data rate = 56 Gb/s a fixed channel exhibits a eye opening than that
■■ channel loss at 28 GHz = 20 dB certain ratio between the In this simplest at the DEF input.
■■ input differential swing = 800 mVpp amplitudes of, for example, form, a DFE senses It is instructive to
■■ bit error rate (BER) 1 10
-12
the first postcursor and the data, makes examine the DFE wave-
a binary decision,
■■ power consumption = 10 mW the main cursor. Denot- forms in a favorable
delays the result
■■ V DD = 1V. ing this ratio by h 1, we condition, i.e., with little
by a 1-b period Tb ,
The simulations are carried out articulate the necessary and feeds a scaled channel loss. As illus-
with VDD = 1V - 5% in the slow–slow operation as follows: copy of the result trated in Figure 2(b), the
corner of the process and at T = 75c C. the present bit should back to the input. first bit B 1 is delayed and
We surmised that a continuous- be “memorized,” scaled scaled by a factor of h 1 as
time linear equalizer (CTLE) and a by a factor of h 1, and it appears in DF. Upon sub-
decision-feedback equalizer (DFE) subtracted from the next bit so as to traction from the next bit B 2, this
would prove necessary for this pur- remove the interfering tail. This func- scaled replica increases the voltage
pose. The two-stage CTLE designed in tion is afforded by a DFE. swing to -1 - h 1 . Similarly, B 2 boosts
the first part provides a boost factor of In this simplest form, a DFE the amplitude of B 3 to +1 + h 1 . We
about 13 dB at the Nyquist frequency senses the data, makes a binar y conclude that a 1010 data sequence
fNyq = 28 GHz. The circuit delivers an decision, delays the result by a 1-b experiences greater swings as a
eye height of 220 mV and an eye width period Tb, and feeds a scaled copy of result of the DFE action, a highly
of 13.6 ps while drawing 5 mW. In this the result back to the input. Shown desirable effect, as such a sequence
part, we develop a DFE that achieves a in Figure 2(a) is a possible realiza- displays the most severe eye closure
greater eye opening and, hence, allows tion, where the flip-flop (FF) per- in a lossy medium [1].
robust sampling of the data with a low forms both the decision action (also Interestingly, the DFE introduces
BER. The reader is referred to various referred to as “slicing”) and the its own eye closure in the presence
articles written on the subject [2]–[8]. delay function. The return path has of consecutive ones (or zeros), also
General Considerations
Suppose that we apply an impulse
to a lossy channel (Figure 1). Due Main Cursor
Postcursors
to the medium’s limited bandwidth,
the output exhibits a long tail. The
h1
peak value is called the “main cur-
Channel h2
sor,” that at t = Tb is referred to as
the “first postcursor,” and so on.
0 t 0 Tb 2Tb t
c m
1 Q TV - 4VOS 1 10 - 12, (2)
2 Vn, rms
FF1
Authorized licensed use limited to: UCLA Library. Downloaded on February 17,2022 at 10:45:39 UTC from IEEE Xplore. Restrictions apply.
third postcursors at relative levels of t y p i cally chosen in the range of 400 erate gate-source voltage, e.g., around
60%, 41%, and 30%, respectively. The to 500 mV. With complete switch- 700 mV, so that these transistors
latter indicates that the CTLE reduces ing of M 1 – M 4, these swings are eq operate in or near saturation. Third,
these to 22%, –3%, and –6%, respec- ual to 1mA # R D, suggesting the current mirror as
tively. The slightly underdamped R D . 500 X. We must also well as C 1 and C 2 can
response at the CTLE output origi- ensure that the small-signal To compute the be shared among all of
relative strengths
nates from a few decibels of peaking loop gain around M 3 and the DFE’s latches; the
of the DFE taps,
that appear in the cascade frequency M 4 is greater than unity capacitor values are then
we must examine
response (s h o w n in the first arti- so that the circuit properly the impulse chosen according to the
cle in this series [1]). regenerates when CK falls response of the total gate capacitance that
The voltage swings in Figure 4 char- and CK rises. channel–CTLE they must drive.
acterize the CTLE under small-signal The clock path in cascade. When characterizing
conditions. This is necessary because we Figure 5 merits some re the speed of FFs for a DFE
can predict the step or pulse response marks. First, at 56 GHz, environment, we should
from the impulse response only if the CK and CK are close to sinusoids, espe- examine their input sensitivity Vsen,
system remains linear. Also, the worst- cially if they are provided by resonant defined as the minimum difference
case scenario of a 1010 sequence does buffers (stages with LC tank loads). that guarantees correct decisions
lead to relatively small swings at the Compared to square waves, sinusoidal at the desired clock frequency. Spe-
channel output. Nevertheless, as the clocks exhibit longer transition times, cifically, Vsen must be studied in the
channel and CTLE “recover” from a long thus elongating the FF response. Sec- context of the waveforms delivered
run, e.g., between t 1 and t 2 in Figure 3, ond, if nearly rail-to-rail clock swings by the channel–CTLE cascade, e.g., a
dynamic nonlinearities in the CTLE can are available, then C 1 and C 2 need 1111 run followed by a 0101 pattern
cause departure from our foregoing not be much greater than the input (Figure 3). The swing received by the
results. In other words, some iteration capacitance of M 5 and M 6 . This is FF after the long run is small and of
in the DFE tap values may be necessary because we prefer to maintain a mod- opposite value, requiring that the
if the greatest eye opening is desired.
FF Design
The performance of the DFE shown
1.2 Channel Output
in Figure 2 hinges primarily upon
CTLE Output
the FF design. While the simplicity 1
and efficiency of the StrongARM
0.8
Voltage (mV)
0.9 +
Din VX VX
0.85 – 0.9 0.9
Din VY VY
0.8 0.8
0.8
Voltage (V)
Voltage (V)
Voltage (V)
0.75
0.7 0.7 0.7
0.65 0.6
0.6
0.6
0.55 0.5
0.5
0.5 0.4
4.8 4.85 4.9 4.95 5 4.8 4.85 4.9 4.95 5 4.8 4.85 4.9 4.95 5
Time (ns) Time (ns) Time (ns)
(a) (b)
FIGURE 7: The (a) input and output waveforms of a CML latch in an overdrive recovery test with V0 = 70 mV and (b) latch output for the case
of V0 = 100 mV.
Authorized licensed use limited to: UCLA Library. Downloaded on February 17,2022 at 10:45:39 UTC from IEEE Xplore. Restrictions apply.
Figure 12 and slightly improves the
eye opening.
× 10–15 In the last step of our design,
1.4 we ponder higher-order taps. Our
impulse response analysis has yielded
1.2
Noise Spectrum (V2/Hz)
0.1 0.1
Vsum (V)
FF1
0 0
From M1 M2
–0.1 –0.1
CTLE
–0.2 –0.2 100 Ω
–0.3 –0.3
–0.4 –0.4
–0.5 –0.5 0.5 mA 200 fF 0.5 mA
0 5 10 15 20 25 30 35 0 5 10 15 20 25 30 35
Time (ps) Time (ps)
(a) (b)
0
–0.1
–0.2 References
–0.3 [1] B. Razavi, “The design of an equalizer —
Part one,” IEEE Solid State Circuits Mag.,
–0.4 vol. 13, no. 4, pp. 7–160, Fall 2021, doi:
10.1109/MSSC.2021.3111426.
–0.5 [2] J. Im et al., “A 40-to-56 Gb/s PAM-4 receiv-
0 5 10 15 20 25 30 35 er with ten-tap direct decision-feedback
Time (ps) equalization in 16-nm FinFET,” IEEE J. Sol-
id-State Circuits, vol. 52, no. 12, pp. 3486–
3502, Dec. 2017, doi: 10.1109/JSSC.2017.
FIGURE 12: The DFE summer eye diagram in the presence of degeneration. 2749432.
[3] T. Shibasaki et al., “A 56-Gb/s receiver
front-end with a CTLE and 1-tap DFE in
20-nm CMOS,” in Proc. VLSI Circuits Symp.
Dig., Jun. 2014, pp. 1–2, doi: 10.1109/VL-
VDD SIC.2014.6858400.
[4] A. Roshan-Zamir et al., “A 56-Gb/s PAM4
RD RD receiver with low-overhead techniques
for threshold and edge-based DFE FIR-
and IIR-tap adaptation in 65-nm CMOS,”
FF1 FF2 FF3
IEEE J. Solid-State Circuits, vol. 54, no.
3, pp. 672–684, Mar. 2019, doi: 10.1109/
JSSC.2018.2881278.
[5] A. Cevrero et al., “A 100Gb/s 1.1pJ/b
PAM-4 RX with dual-mode 1-tap PAM-
4 / 3-tap NRZ speculative DFE in 14nm
CMOS FinFET,” in Proc. IEEE Int. Solid-
ISS1 250 µA ISS4 60 µA State Circuits Conf. (ISSCC), Feb. 2019,
pp. 112–113, doi: 10.1109/ISSCC.2019.
8662495.
[6] S. Ibrahim and B. Razavi, “Low-power CMOS
equalizer design for 20-Gb/s systems,”
FIGURE 13: The addition of the third tap to the DFE. IEEE J. Solid-State Circuits, vol. 46, no. 6,
pp. 1321–1336, Jun. 2011, doi: 10.1109/
JSSC.2011.2134450.
[7] B. Razavi, “The decision-feedback equal-
izer [A Circuit for All Seasons],” IEEE
0.5 Solid-State Circuits Mag., vol. 9, no. 4, pp.
13–16, Fall 2017, doi: 10.1109/MSSC.2017.
0.4 2745939.
[8] P. Peng, J. Li, L. Chen, and J. Lee, “A 56Gb/s
0.3 PAM-4/NRZ transceiver in 40nm CMOS,”
0.2 in Proc. IEEE Int. Solid-State C ircuits Conf.
(ISSCC), Feb. 2017, pp. 110–111, doi:
0.1 10.1109/ISSCC.2017.7870285.
Vsum (V)
Authorized licensed use limited to: UCLA Library. Downloaded on February 17,2022 at 10:45:39 UTC from IEEE Xplore. Restrictions apply.