You are on page 1of 4

470 IEEE SOLID-STATE CIRCUITS LETTERS, VOL.

3, 2020

A 6.4–11 Gb/s Wide-Range Referenceless Single-Loop


CDR With Adaptive JTOL
Hye-Ran Kim , Jun-Yeol Lee, Jeong-Su Lee, Dong-Seok Kang, and Jung-Hoon Chun , Member, IEEE

Abstract—A referenceless single-loop clock and data recovery (CDR)


with a 6.4–11-Gb/s capture range is presented. A dynamic bandwidth
control (DBWC) technique for reducing the frequency-acquisition time
and adaptive loop gain control (ALGC) for optimizing the jitter toler-
ance are described in detail. In addition, a cycle-slip detector (CSDET)
with a consecutive-edge selector (CES) is proposed to improve the accu-
racy of the frequency detection. Fabricated in a 28-nm CMOS process,
the proposed CDR has a frequency-acquisition time that is reduced by
approximately 30% with the DBWC technique enabled. Also, the cap-
ture range is expanded by 1.1 Gb/s compared to the prior work due to
improvement of the CSDET. The proposed ALGC technique based on
jitter estimation is demonstrated with JTOL test results using PRBS15
input data with 0.28-UIPP random jitter and 1–100 MHz sinusoidal jitter.
Index Terms—Adaptive loop gain control (ALGC), cycle-slip detec-
tion, frequency acquisition time, half-rate linear phase detector (PD),
referenceless clock and data recovery (CDR), single-loop CDR.

Fig. 1. Proposed referenceless single-loop CDR.


I. I NTRODUCTION
Clock and data recovery (CDR) circuits are responsible for both Given these requirements, this letter introduces a dynamic band-
frequency tracking to find the input data rate and phase tracking width control (DBWC) technique that is only activated during the
to maximize the timing margin between the recovered clock and frequency-tracking operation and an adaptive loop gain control
the input data. General, dual-loop CDRs, which implement these (ALGC) technique to improve the jitter tolerance of a referenceless
two functions in two different loops, are widely used. However, single-loop CDR.
despite the absence of a dedicated frequency-tracking loop, single- This letter is organized as follows. Section II presents the overall
loop CDRs, which have a relatively small active area and simple loop architecture of the presented CDR with a detailed explanation of the
control, can still perform frequency acquisition with a single phase- implemented circuits. Section III shows the chip measurement results.
tracking loop. Furthermore, referenceless CDRs have been studied
rigorously because they can be simply implemented without an exter- II. P ROPOSED A RCHITECTURE
nal clock. A variety of circuit techniques to expand the frequency-
tracking range, such as a rotational frequency detector (FD) or digital The overall structure of the proposed CDR is shown in Fig. 1.
quadri-correlator FD (DQFD), have been introduced [1]–[4]. The cycle-slip detector (CSDET), the DBWC block, and the ALGC
However, the frequency-tracking and phase-tracking performances block are added to the conventional phase-tracking loop based on the
of a referenceless single-loop CDR have a tradeoff relationship. For half-rate linear phase detector (PD). The CSDET controls the output
example, increasing the loop gain helps to reduce the frequency- of the PD to perform a frequency-tracking operation. To expand the
acquisition time, whereas it causes the degradation of the timing capture range, an additional circuitry named consecutive-edge selec-
margin according to the input jitter [5]–[6]. Therefore, we need tor (CES) is added to the CSDET introduced in a prior work [7].
to eliminate or mitigate the interference between the frequency- The auxiliary CP (AUX-CP) in the DBWC block is activated only
tracking and phase-tracking operations and optimize them separately during frequency tracking to reduce the frequency-acquisition time.
to achieve both fast locking and high jitter tolerance characteristics. After the frequency-tracking operation is completed, the ALGC block
estimates the frequency and amplitude of the input jitter and adjusts
Manuscript received August 1, 2020; revised September 18, 2020; accepted the loop gain accordingly to maximize the timing margin during the
September 23, 2020. Date of publication September 28, 2020; date of cur- phase-tracking operation.
rent version October 21, 2020. This article was approved by Associate
Editor C. Kim. This work was supported in part by the Future Interconnect
Technology Cluster Program of Samsung Electronics and in part by IITP A. Dynamic Bandwidth Control Technique
grant funded by the Korea Government (MSIT) under Grant 2020-0-00960. In the CSDET, the DATA edges sample CK0 and CK90 as
(Corresponding author: Jung-Hoon Chun.) illustrated in Fig. 1. According to the sampling results, the phase
Hye-Ran Kim is with the DRAM Design Team, Samsung Electronics,
Hwasung 18448, South Korea, and also with the College of Information and relationship between the input data and clock can be defined as four
Communication Engineering, Sungkyunkwan University, Suwon 16419, South states: (A), (B), (C), and (D), as shown in Fig. 2(a)–(d). For example,
Korea (e-mail: hr.kim@skku.edu). the state (A) means that the sampled CK0 and CK90 are high and
Jun-Yeol Lee, Jeong-Su Lee, Dong-Seok Kang, and Jung-Hoon Chun low (CK0 = 1, CK90 = 0), respectively. The PD outputs, UP_PRE
are with the College of Information and Communication Engineering,
Sungkyunkwan University, Suwon 16419, South Korea (e-mail:
and DN_PRE, are set according to the determined state and affect the
jhchun@skku.edu). corresponding charge pump (CP) current. When there is a frequency
Digital Object Identifier 10.1109/LSSC.2020.3027021 error, the state determined by the phase relationship between the input
2573-9603 
c 2020 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See https://www.ieee.org/publications/rights/index.html for more information.

Authorized licensed use limited to: Carleton University. Downloaded on November 05,2020 at 03:22:05 UTC from IEEE Xplore. Restrictions apply.
KIM et al.: 6.4–11-Gb/s WIDE-RANGE REFERENCELESS SINGLE-LOOP CDR WITH ADAPTIVE JTOL 471

(a) (b)
(a)

(c) (d)

Fig. 2. Frequency acquisition based on ICP when (a) 2 fck < fdata , (b) 2 (b) (c)
fck > fdata and ICP,AUX , (c) 2 fck < fdata , and (d) 2 fck > fdata .
Fig. 4. (a) Frequency detection with clock pattern DATA versus nonclock
pattern DATA. (b) Operation of the CSDET with CES. (c) Normalized PFD
gain comparison between w/o and with CES.

between the frequency-tracking loop and the phase-tracking loop, as


is needed in the dual-loop CDR.

B. Consecutive-Edge Selector in CSDET


To illustrate the shortcomings of the conventional CSDET, Fig. 4(a)
(a) (b) shows the phase relationship between CK0/CK90 and DATA with two
positive ferr ( ferr = fdata - 2 fck > 0) cases: 1) close to maximum or
Fig. 3. Timing diagram of overall frequency-acquisition process in the
proposed CDR when (a) 2 fck < fdata and (b) 2 fck > fdata . 2) nearly zero. It is assumed that during the first half of the depicted
period, DATA toggles like a clock, however, during the second half,
DATA has random transitions. With a large ferr , the state determined
data and clock continues to change. When 2 fck < fdata , as shown in at every rising edge of the clock-pattern DATA changes in order of
Fig. 2(a), the state changes in order of “A→D→C→B→ · · · ” When “A→C→B→ · · · ” Therefore, the transition from (C) to (B), which
2 fck > fdata , the state changes the opposite direction as shown in is “valid” for correct frequency detection, can be monitored by the
Fig. 2(b). Therefore, the PD outputs keep changing accordingly, and two successive rising edges. However, if DATA does not toggle like
the CP current, ICP , ramps and flips repeatedly as shown in Fig. 2(a) a clock, some of state detection can be skipped. As a result, “valid”
and (b). The abrupt change of ICP is called a cycle slip, and ICP after state transitions from (C) to (B) may not occur and instead “false”
each cycle-slip occurrence, which is marked in dark gray in Fig. 2(a) transitions from (B) to (C) may occur. On the other hand, with a
and (b), interferes with the frequency acquisition [7]. small ferr the state changes slowly (e.g., C→C→C→B→B→ · · · ):
When there are consecutive transitions of DATA, it is possible to therefore, the possibility of the wrong state transition is low even if
determine both the sign of the frequency error and whether a cycle the data transition density is low.
slip occurs, based on the state-movement direction. For example, as In Fig. 4(a), we can observe that the transition from (C) to (B)
shown in Fig. 2(a), DN_BLKB is generated by the CSDET when detected by two consecutive rising edges of DATA always passes
the state changes from (A) to (D) or from (C) to (B). If 2 fck < through the intermediate state (A) at the “falling” edge in the mid-
fdata , DN_BLKB is asserted high, blocking the pull-down current dle of the two rising edges. The proposed CES, shown in Fig. 1,
of CP to achieve the frequency acquisition. At the same time, as detects this intermediate state by sampling CK0 and CK90 at DATA’s
depicted in Fig. 2(c), an additional push-up current from AUX-CP falling edges. Using the intermediate state information from the CES
is generated by DN_BLKB, and it speeds up the increasing rate of to determine the validity of detected transitions, we can efficiently
fck . The push-up current of AUX-CP is generated only when both exclude false transitions described above. Fig. 4(b) shows the opera-
DN_BLKB and DN are activated. Similarly, UP_BLKB is generated tion of the proposed CSDET with CES. If 2 fck < fdata , the frequency
by the CSDET when the state moves in the opposite direction, as acquisition with DN_BLKB activation starts only when the state tran-
shown in Fig. 2(b). When 2 fck > fdata , UP_BLKB blocks the push- sition due to two consecutive rising edges of DATA is from (C) to
up current of CP in Fig. 2(b) and generates the pull-down current of (B) and the output of CES, the intermediate state, is (A). Similarly, if
AUX-CP in Fig. 2(d). Finally, fck decreases until the frequency acqui- 2 fck > fdata , the transition from (B) to (C) is considered valid when
sition is obtained. Fig. 3(a) and (b) shows that the timing diagram of the intermediate state from CES is (D).
the overall frequency-acquisition process in the proposed CDR. Fig. 4(c) shows the normalized PFD gain according to the
As explained above, the proposed DBWC block is activated only frequency offset between the clock and data simulated with 10-Gb/s
when the frequency acquisition is in progress. The proposed CDR PRBS15 data. Because the CES can exclude false state transitions,
contains two CPs, but there is no interference between the two CP the PFD gain increases especially when there is a large frequency off-
operations. Adding a DBWC block is effective in terms of acquisition set. A higher PFD gain with a large frequency offset helps to perform
time and complexity because it does not require any additional control frequency acquisition, thus extending the capture range.

Authorized licensed use limited to: Carleton University. Downloaded on November 05,2020 at 03:22:05 UTC from IEEE Xplore. Restrictions apply.
472 IEEE SOLID-STATE CIRCUITS LETTERS, VOL. 3, 2020

(a) (b)
(a)
Fig. 7. (a) Die-photo with block description. (b) Recovered clock jitter at
fck = 4 GHz.

(b) (c)

Fig. 5. (a) ALGC overall block diagram and timing diagram with
(b) high-frequency input jitter and (c) low-frequency input jitter.

(a)

(b)
Fig. 6. Simulation result of V and INC for various jitter frequencies and
amplitudes in ALGC block. Fig. 8. Measured frequency-acquisition behavior with and without DBWC
when (a) fdata = 11 Gb/s and (b) fdata = 6.4 Gb/s.

C. Adaptive Loop Gain Control


The jitter estimation technique in a BBPD-based phase-tracking or VPRE−α thresholds. However, when low-frequency jitter is
loop [5] is applied to the proposed CDR, and an improved present, as shown in Fig. 5(c), V exceeds the threshold because
methodology is obtained by the characteristics of a linear PD. ICP,ALG maintains the same polarity during the integration period.
Fig. 5(a) shows the configuration of the ALGC block. The adaptive Finally, INC = 1 is provided when DLKC2 is activated. While
loop gain CP (ALG CP), switched capacitor (SC), and comparator INC remains high, the counter (CNT) inside the digital deci-
(COMP) estimate the input jitter, and the following synthesized digi- sion block is activated to generate 8-bit INC_CNT as illustrated
tal decision block determines the loop gain according to the estimated in Fig. 5(a). If INC_CNT exceeds the threshold within a prede-
jitter. ICP,ALG generated by UP and DN, outputs of the linear PD, is termined time, it is determined that considerable low-frequency
provided to the SC. The SC is synchronized to DCLK1 which is on jitter is present, and the loop gain is increased for better jitter
the order of tens of MHz. The SC performs integration and precharge tracking. The determined loop gain is encoded to the number of
operations for the first and second half cycles of DCLK1, respectively. enabled CPs.
V of the SC deviates from the precharged level, VPRE, depending On the other hand, since the linear PD is used in the proposed CDR,
on the PD output determined by the input jitter. The COMP deter- V and INC_CNT increase proportionally as the amount of jitter
mines whether V is between the two threshold voltages, VPRE+α increases. Depending on INC_CNT, the incremental step for the loop-
and VPRE-α. The frequency of DCLK2 for the COMP is the same gain control is adjusted with the two different threshold values, NTH1
as that of DCLK1 with a certain phase offset. The final output signal, and NTH2 (NTH1 < NTH2), as described in Fig. 5(a). In Fig. 6,
INC, is provided by merging the two COMP outputs through a 2:1 the simulation results of V and INC are shown for three different
mux. cases of input jitter: 1) 1 MHz/1.2× input jitter; 2) 1 MHz/1.0×
Fig. 5(b) and (c) shows the timing diagrams when high- and low- input jitter; and 3) 100 MHz/0.15× input jitter. Fig. 6 shows that the
frequency sinusoidal jitter are applied, respectively. In Fig. 5(b), proposed jitter estimation circuit with VPRE = 0.5 V and α = 50 mV
the high-frequency input jitter repeats a number of cycles while generates V and INC according to the input-jitter frequency and
ICP,ALG is accumulated in SC: thus, V cannot exceed the VPRE+α amplitude.

Authorized licensed use limited to: Carleton University. Downloaded on November 05,2020 at 03:22:05 UTC from IEEE Xplore. Restrictions apply.
KIM et al.: 6.4–11-Gb/s WIDE-RANGE REFERENCELESS SINGLE-LOOP CDR WITH ADAPTIVE JTOL 473

That is, the proposed DBWC reduces the frequency-acquisition time


by approximately 30%.
As shown in Fig. 9, when ALCG is off and five CPs are always
active, JTOL is relatively high at low jitter frequencies, but could
not satisfy the BER criteria at high jitter frequencies. Conversely,
as indicated by the blue dash-dotted line, a relatively high JTOL is
obtained at high jitter frequencies when one CP is activated. When
ALGC is finally activated, the number of activated CPs adaptively
changes depending on the jitter frequency and amplitude. In other
words, up to five CPs can be activated for low-frequency input jitter,
thus maximizing JTOL, and for high-frequency input jitter, only one
CP remains active to maximize JTOL. In conclusion, the proposed
Fig. 9. Measured JTOL with ALGC. CDR with ALGC achieves high JTOL without increasing BER over
a wide range of input-jitter frequencies.
TABLE I The overall performance of the proposed CDR is compared with
P ERFORMANCE C OMPARISON W ITH P REVIOUS W ORKS that of previous works in Table I. The proposed CDR is the only
referenceless single-loop CDR with both a wide capture range and
adaptive JTOL. The proposed CDR, including the DBWC and ALGC
was implemented with only 0.11-mm2 active area and achieved an
energy efficiency of 2.7 pJ/bit. The relatively high power consumption
of the proposed CDR is due to its 8-phase VCO operating under a
1.5-V power supply.

ACKNOWLEDGMENT
The EDA tool was supported by the IC Design Education Center
(IDEC), Korea.

R EFERENCES
[1] J. Jin, X. Jin, J. Jung, K. Kwon, J. Kim, and J.-H. Chun, “A 0.75–3.0-Gb/s
dual-mode temperature-tolerant referenceless CDR with a deadzone-
compensated frequency detector,” IEEE J. Solid-State Circuits, vol. 53,
no. 10, pp. 2994–3003, Oct. 2018.
[2] K. S. Park, W. Bae, J. Lee, J. Hwang, and D.-K. Jeong, “A
III. E XPERIMENT R ESULTS 6.7–11.2 Gb/s, 2.25 pJ/bit, Single-Loop Referenceless CDR With Multi-
The proposed CDR is fabricated in a 28-nm CMOS process. The Phase, Oversampling PFD in 65-nm CMOS,” IEEE J. Solid-State Circuits,
Die photograph is shown in Fig. 7(a), and the active area of the CDR vol. 53, no. 10, pp. 2982–2993, Oct. 2018.
[3] M. S. Jalali, A. Sheikholeslami, M. Kibune, and H. Tamura, “A reference-
is approximately 0.111 mm2 . The jitters of the recovered clock are less single-loop half-rate binary CDR,” IEEE J. Solid-State Circuits,
monitored by a Tektronics oscilloscope MSO71604C, and the results vol. 50, no. 9, pp. 2037–2047, Sep. 2015.
are shown in Fig. 7(b). With a 4-GHz recovered clock, the RMS and [4] N. Kocaman et al., “An 8.5–11.5-Gbps SONET transceiver with reference-
peak-to-peak jitter were 1.66 ps and 12.5 ps, respectively. less frequency acquisition,” IEEE J. Solid-State Circuits, vol. 48, no. 8,
pp. 1875–1884, Aug. 2013.
Fig. 8(a) and (b) shows the measured frequency-acquisition behav- [5] H.-J. Jeon, R. Kulkarni, Y.-C. Lo, J. Kim, and J. Silva-Martinez, “A
ior of the proposed CDR. With various initial frequencies of the bang-bang clock and data recovery using mixed mode adaptive loop gain
VCO, the capture range was measured while sweeping the data strategy,” IEEE J. Solid-State Circuits, vol. 48, no. 6, pp. 1398–1415,
rate. A maximum capture range of 6.4–11 Gb/s can be achieved Jun. 2013.
[6] L.-H. Chiueh and T.-C. Lee, “A 6-Gb/s adaptive-loop-bandwidth clock
by setting the initial frequency to 4.2 GHz. Fig. 8(a) shows that the
and data recovery (CDR) circuit,” in Proc. IEEE Asian Solid-State Circuits
frequency-acquisition time without DBWC is approximately 14 μs Conf. (A-SSCC), KaoHsiung, Taiwan, Nov. 2014, pp. 289–292.
when fdata is 11 Gb/s and the initial fck is 4.2 GHz. With DBWC, the [7] H. R. Kim and J.-H. Chun, “Referenceless single-loop CDR with a
frequency-acquisition time is reduced from 14 to 9 μs. Also, Fig. 8(b) half-rate linear PD and frequency acquisition technique,” Electron. Lett.,
shows the frequency acquisition when fdata is 6.4 Gb/s and the initial vol. 56, no. 5, pp. 237–239, Mar. 2020.
[8] K. Park et al., “A 4-to-20Gb/s 1.87pJ/b referenceless digital CDR with
fck is 4.2 GHz. Without DBWC, the frequency acquisition time is unlimited frequency detection capability in 65nm CMOS,” in Proc. IEEE
less than 25 μs and with DBWC, it is reduced from 25 to 15 μs. Symp. VLSI Circuits, Kyoto, Japan, Jun. 2019, pp. 194–195.

Authorized licensed use limited to: Carleton University. Downloaded on November 05,2020 at 03:22:05 UTC from IEEE Xplore. Restrictions apply.

You might also like