Professional Documents
Culture Documents
net/publication/224386378
CITATIONS READS
12 465
1 author:
Liming Xiu
Institute of Electrical and Electronics Engineers
88 PUBLICATIONS 775 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Liming Xiu on 23 February 2014.
Abstract—The Flying-Adder frequency synthesis architecture is PLL is used as a digitally controlled oscillator to function as
a novel approach of generating frequencies on chip. Since its in- the VCO (DCO) in an all-digital PLL for a graphic digitizer
vention, it has been used in many commercial products to cope chip. The work of Xiu et al. [10] is the case where FAPLL can
with difficult frequency generation challenges. Along the course
of this architecture’s evolution, various circuit- and system-level be used to reduce the size of on-chip memory. The study of
problems have been resolved. In this paper, one remaining problem Xiu [11] is the summary of FAPLL applications in commercial
related to circuit implementation, namely, the construction of the products with several additional examples.
accumulator, is studied. A new scheme is proposed to achieve the One of the key circuit components used in the Flying-Adder
Flying-Adder accumulation function that not only has speed ad-
vantage but also is power and area efficient. The issue related to
architecture is the adder/accumulator. It is the fixed-VCO-
time–average frequency and jitter is also discussed. Flying-Adder-architecture speed bottleneck for two reasons:
1) The adder/accumulator needs to operate at the speed of the
Index Terms—Accumulator, adder, clock generation, Flying-
Adder, frequency synthesis, phase-locked loop (PLL) . output frequency, and 2) the size of the adder/accumulator
needs to be large for fine frequency resolution. These two
requirements, namely, high speed and large size, make the im-
I. INTRODUCTION plementation of the adder/accumulator a nontrivial task. It has
HE FLYING-ADDER frequency synthesis architecture been the most influencing limiting factor for achieving higher
T is a new way of generating frequency on chip. Mair and
Xiu [1] is the proof of concept of this architecture, and [2]
output frequency for the fixed-VCO-Flying-Adder architecture.
This issue has been discussed intensively in [2].
and [3] are the circuit-level improvements. The Flying-Adder Historically, the VLSI implementation of an adder has re-
technique has two working modes: integer flying adder and ceived substantial attention from researchers since addition is
fixed-voltage-controlled-oscillator (VCO) flying adder. The by far the most frequently used operation in circuit systems. For
integer-Flying-Adder architecture is a powerful extension of example, a digital signal processor relies heavily on the efficient
the integer- phase-locked-loop (PLL) architecture. It en- implementation of arithmetic circuits to execute dedicated algo-
hances the integer- PLL solution space since it utilizes a rithms, such as convolution, correlation, and digital filtering. As
phase divider that can reach a finer frequency resolution than the processor’s bus width enlarges and the processor’s speed in-
a frequency divider can [4]. The fixed-VCO flying adder, on creases, the VLSI implementation of an adder, which is the core
the other hand, is based on the rigorously formed concept of element of complex arithmetic circuits, becomes increasingly
time average frequency [5], [6]. It can virtually achieve any difficult.
frequency desired. Furthermore, with fixed-VCO-Flying-Adder There are several types of parallel adder architectures avail-
PLL, the output frequency can be switched instantly from one able to meet various performance and resource requirements,
frequency to another. The frequency response is instantaneous such as ripple-carry adder (RCA), Manchester carry-chain
since the frequency generation circuit is outside the PLL. adder, carry-skip adder, carry-select adder, carry-look-ahead
Since its invention, the Flying-Adder-architecture-based PLL adder, and carry-save adder [12]. The differences among these
(FAPLL) has been used in many commercial projects to solve architectures lie in the ways of how the carry bits are gen-
problems that cannot be dealt with easily by conventional erated and propagated. There are also several logic styles to
PLLs. Xiu [7] demonstrates how the Flying-Adder-based implement the basic full-adder cell: complementary CMOS,
phase divider works harmonically with the frequency divider complementary pass-transistor logic, transmission-function
to generate many frequencies on chip to support the various full adder, transmission-gate adder, 14-transistor adder, and
functions of a complex system-on-a-chip. The work of Xiu [8] 10-transistor adder [13]–[18]. All these different logic styles
is the detailed example of how the Flying-Adder-based PLL have their cost–performance tradeoffs. The selection criteria
can function as a digitally controlled crystal oscillator (DCXO) concerned include supply-voltage range, voltage swing, speed
to save cost for an MPEG2 decoder chip. In [9], a Flying-Adder or delay, power–delay product, output skew, driving capability,
and area.
All the aforementioned adder-related techniques can be used
Manuscript received September 08, 2008; revised November 01, 2008 and
December 14, 2008. First published February 20, 2009; current version pub- for implementing the accumulation function required in the
lished November 04, 2009. This paper was recommended by Associate Editor Flying-Adder architecture. However, a recent study on the
J. P. de Gyvez. time-average-frequency and Flying-Adder architecture has
The author is an independent consultant, Dallas, TX 75024 USA (e-mail:
lmngx96@gmail.com). revealed an interesting feature that can lead to a fast yet power-
Digital Object Identifier 10.1109/TCSI.2009.2015732 and area-efficient implementation of the accumulation function
1549-8328/$26.00 © 2009 IEEE
Authorized licensed use limited to: TEXAS INSTRUMENTS VIRTUAL LIBRARY. Downloaded on November 10, 2009 at 10:16 from IEEE Xplore. Restrictions apply.
2440 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS—I: REGULAR PAPERS, VOL. 56, NO. 11, NOVEMBER 2009
for the Flying-Adder PLL [5], [6]. This paper will focus its Fig. 2. Principal idea of the Flying-Adder architecture.
attention on this issue.
The rest of this paper is organized as follows. Section II
will discuss the newly observed feature associated with the
Flying-Adder architecture and its beneficial consequence on the
circuit implementation. Section III is the mathematical proof of
this method. Section IV shows the behavior-level simulation.
Section V is the transistor-level circuit simulation. Section VI
discusses the advantage of this new scheme. Section VII
presents a related interesting problem for future research.
Section VIII discusses the issue of time-average-frequency and
jitter. Fig. 3. Conceptual operation of an adder.
II. ACCUMULATOR IN THE FLYING-ADDER ARCHITECTURE Now, if we incorporate a fractional number into the control
word as ( is a fractional number), the output
A. Brief Review of Time–Average Frequency
CLKOUT will have a slightly different structure. When is
The concept of time-average frequency has been formally in- presented in , the fractional adds to itself repeatedly
troduced in [5]. The key difference between the convention- as well during the accumulation. Eventually, an overflow will
ally defined frequency and time-average frequency is graphi- happen, and a carry-in signal will propagate to the integer part.
cally shown in Fig. 1. As shown, for conventionally defined fre- In such a case, instead of , the resulting waveform is
quency, all the cycles must have the same length in time. How- . Therefore, the signal CLKOUT contains two types of cy-
ever, for time–average frequency, this not necessarily has to be cles: and . The rate of occurrence of
true. Its cycles could have different lengths. However, for a given depends on the magnitude of . Overall, after a certain number
time frame, such as 1 s, the number of cycles existing in both of cycles, the desired time–average frequency can be achieved
schemes has to be the same. The key advantage of time–average [5]. It is worth to mention that Fig. 2 is the principal circuit, not
frequency is that it can make the difficult task of clock genera- the working circuit used in real chips. Unlike Fig. 2 where the
tion (frequency synthesis) less stressful. transfer function is , in the real circuit, the
transfer function is [2]. In the following sec-
B. Accumulator in the Flying-Adder Architecture tions of simulation, we will use the real circuit.
The Flying-Adder architecture is a frequency synthesis tech- From the previous brief description of the Flying-Adder
nique that implements the time–average-frequency concept in working scheme, it is clear that the key operation is accumula-
real circuits. Fig. 2 shows the principal idea of this architec- tion, which is realized by the VLSI circuit component adder.
ture. As demonstrated in the numerical examples presented in Fig. 3 is the conceptual circuit structure of an -bit adder
[5, Section II.E], the waveform of CLKOUT is composed by (RCA). The augend and addend are all represented on base- .
utilizing the selected outputs from an -output VCO/PLL to The basic cell of this adder is the 1-bit full adder, with A, B,
trigger the toggle flip-flop. Assume that the time difference be- and CI as inputs and S and CO as outputs. All the 1-bit full
tween any two adjacent VCO outputs is (in picoseconds) adders are connected serially to form the adder. The CO of the
and the frequency control word is , where is previous stage is fed to the CI of the next stage. All the CIs
an integer. Also, assume that the initial address value of the have to be propagated to the final stage before the operation can
MUX is (an integer). Then, due to the operation of be considered complete. The adder output is stored in a register
accumulation, the address value will progress in the pattern of that is controlled by a clock.
. Each of these address values will Fig. 4 is the accumulator used in the Flying-Adder ar-
select a corresponding output from equally spaced VCO out- chitecture. As shown, its output is fed to the input for the
puts, and the rising edge of the currently selected signal triggers next cycle of addition. The number stored in the register,
the flip-flop, which, in turn, generates an edge for the output , is the result for the
signal CLKOUT. The period (frequency) of the CLKOUT is current clock cycle. One special feature of the Flying-Adder
. accumulator is that its input and result are both real numbers,
Authorized licensed use limited to: TEXAS INSTRUMENTS VIRTUAL LIBRARY. Downloaded on November 10, 2009 at 10:16 from IEEE Xplore. Restrictions apply.
XIU: FAST AND POWER–AREA-EFFICIENT ACCUMULATOR FOR FLYING-ADDER FREQUENCY SYNTHESIZER 2441
(1)
(2)
(3)
Define
From the aforementioned discussion, it is understandable that Equation (6) proves that the sums of the two approaches are
we are only interested in the number of carry-ins within a given equal for every accumulations. For time–average-frequency
time frame. We do not care when the carry-ins actually occur and definition and Flying-Adder implementation, this is precisely
Authorized licensed use limited to: TEXAS INSTRUMENTS VIRTUAL LIBRARY. Downloaded on November 10, 2009 at 10:16 from IEEE Xplore. Restrictions apply.
2442 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS—I: REGULAR PAPERS, VOL. 56, NO. 11, NOVEMBER 2009
A. Study of Accumulators
A full adder, with inputs A, B, and CI and outputs S and CO, is
constructed by using the logic operator of the Simulink package.
A conventional 6-bit RCA is then built upon it, as shown in
Fig. 6. The adder of the new approach is constructed according
to Fig. 7. As shown, there are memory units between the CO and
CI of adjacent full adders, which mimic the registers in the real
circuit. The sum bits in all the full adders are fed back to one of
the inputs (b pin) of the corresponding units, also through em-
ploying the memory units, to achieve the accumulation. How-
ever, these memory units are instantiated in the next hierarchy
level (not shown here). Both adders are used to do the accumu-
lation in the fractional part. The decimal point is immediately
before the MSB. The carry-in of the full adder at the MSB posi-
tion (leftmost) is the focus. Using these adders, (1) becomes (7)
since base- is two and
(7)
Fig. 9. Accumulation result of r = :001001b.
Fig. 8 is the accumulation result when (
and ). As expected, the addi-
tion result from the conventional adder is a linearly increased well. However, after every 64 cycles, there is one carry-in gen-
straight line since it adds one to itself at every clock cycle. After erated for both cases, just as proved in Section III.
every cycles, a carry-in is generated at the MSB Fig. 9 is the case of ( and
position. For the case of the new approach, the result is different. ). As expected, the accumulation result
Most of the time, the result at each clock cycle is not correct increases monotonically for the conventional case. Furthermore,
since it is different from that of the linear curve. In addition, the for the new approach, the result is different. However, for any
locations of the carry-ins of the two approaches are different as window of 64 cycles, the numbers of carry-ins generated are
Authorized licensed use limited to: TEXAS INSTRUMENTS VIRTUAL LIBRARY. Downloaded on November 10, 2009 at 10:16 from IEEE Xplore. Restrictions apply.
XIU: FAST AND POWER–AREA-EFFICIENT ACCUMULATOR FOR FLYING-ADDER FREQUENCY SYNTHESIZER 2443
Fig. 11. First 130 cycles for the case of F REQ = 62:125.
the same for both approaches, which are nine in this setting.
Fig. 10 shows the total number of carry-ins after a long time of
simulation (700 clock cycles). The first and fourth rows (from
the top) are the adder outputs at each clock cycle. The third and
sixth rows are the carry-ins generated. The second and fifth rows
are the total numbers of carry-ins accumulated at the current
time. For the Flying-Adder operation, the information presented
in the second and fifth rows is the careabouts. It is clear that both
approaches produce the same output.
Authorized licensed use limited to: TEXAS INSTRUMENTS VIRTUAL LIBRARY. Downloaded on November 10, 2009 at 10:16 from IEEE Xplore. Restrictions apply.
2444 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS—I: REGULAR PAPERS, VOL. 56, NO. 11, NOVEMBER 2009
Authorized licensed use limited to: TEXAS INSTRUMENTS VIRTUAL LIBRARY. Downloaded on November 10, 2009 at 10:16 from IEEE Xplore. Restrictions apply.
XIU: FAST AND POWER–AREA-EFFICIENT ACCUMULATOR FOR FLYING-ADDER FREQUENCY SYNTHESIZER 2445
TABLE I
SPEED COMPARISON OF VARIOUS ADDER SIZES
TABLE II
AREA COMPARISON OF VARIOUS ADDER SIZES
much smaller. For the VCO of 32 outputs, the adder size is 5 bits.
It will be 4 or 3 bits for the VCOs of 16 or 8 outputs, respectively.
Table I lists the logic synthesis results of the circuit speeds of
various adder sizes, using the 90-nm process. The integer part is
3 bits. As expected, the circuit speeds vary with the circuit sizes.
For the case of the new scheme, the speed is fixed at 0.43 ns. This
is due to the fact that the integer part is always 3 bits, regardless
of the whole adder size.
Authorized licensed use limited to: TEXAS INSTRUMENTS VIRTUAL LIBRARY. Downloaded on November 10, 2009 at 10:16 from IEEE Xplore. Restrictions apply.
2446 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS—I: REGULAR PAPERS, VOL. 56, NO. 11, NOVEMBER 2009
Fig. 19. Random dithering result for F REQ = 62:125 (Left) For the original. Fig. 20. Random dithering result for F REQ = 62:015625. (Left) For the
original. (Right) For dithering.
(Right) For dithering.
bit are toggling at their maximum rates. In a real working envi- carry-ins produced by the new scheme is unpredictable (i.e.,
ronment, this hardly happens, and consequently, the power-con- the mechanism has not been understood yet). However, the pat-
sumption number will be lower. The supply voltage is 1.1 V. tern still seems periodic, which produces spurious components.
This table shows that the new accumulator uses roughly one-half Thus, the question is:
of the power consumed by the conventional one for all three op- Is it possible to build an accumulator which can achieve
erating frequencies. Also evident is that power consumption is the same average effect (i.e., generate the same number of
linearly proportional to operating speed. carry-ins in a given time frame) but break the periodical
characteristic?
D. Design-Effort Comparison
The data collected in Tables I–III demonstrate the advantages VIII. TIME–AVERAGE-FREQUENCY AND JITTER
of the new scheme in terms of area, speed, and power consump- Clock signal is one of the most important signals in any
tion. Aside from these benefits, another interesting comparison VLSI chip. When clock signal is used in systems, other than
is the design effort. For the case of the conventional accumu- frequency, the most crucial characteristic is its edge uncer-
lator, the design has to be carried out in the fashion of HDL tainty. The source of clock edge uncertainty can be investigated
coding, logic simulation, and logic synthesis. This requires an from two directions: the uncertainty caused by the generator
HDL simulation tool and a synthesis tool (and associated li- (PLL, crystal oscillator, etc.) and the uncertainty caused by
cense fees). Moreover, the designer is required to have the HDL the clock distribution network (clock tree). The uncertainty
coding skill and the knowledge of using the aforementioned of the latter type is also termed as clock skew. The issue of
tools. This is often not the case for analog-circuit designers. On clock skew involves more than one clock sink, often hundreds
the other hand, using the new scheme, the designer can easily or thousands of sinks. This type of edge uncertainty will not
build the accumulator by a schematic capture tool, which analog be discussed in this paper since it has already been intensively
designers use every day. studied [19]–[21].
The edge uncertainty associated with the clock generator can
VII. PROBLEM FOR FUTURE WORK be caused by the phase noise induced by white noise, flicker
As discussed in [5] and shown in Figs. 12 and 13, there are noise, and thermal noise [22]–[27] or by modulations activated
spurious components in the output clock’s spectrum when a by certain deterministic signals, such as the input reference and
fractional number is used. These spurs can be converted into the noise in power supplies. This type of edge uncertainty is
noise by adding a modulation signal to , as shown in [5, usually called clock jitter. Time–average frequency, by its defi-
Fig. 23]. The modulation function could be a random number nition, introduces clock edge movement [5]. However, this edge
generator. For example, for the case of movement is NOT uncertain but precisely controlled. Therefore,
(Fig. 12), the spurs can be cleared out, as shown in the right it is fundamentally different than the jitter and skew.
plot of Fig. 19. As another example, Fig. 20 shows the result of As addressed before, when the control word takes
(Fig. 13). These plots demonstrate that , , the resulting clock waveform is composed
random wandering can break the periodical characteristic asso- of two types of cycles: and . The rate of
ciated with a fractional number and consequently eliminate the cycle occurrence depends on . For the case of
spurs. This brings up an interesting question: Can the random (Figs. 14 and 15), there are 15 cycles of and one
mechanism be directly built into the accumulator to save the cycle of for every 16 cycles. Fig. 21 is the histogram of
silicon area of extra circuitry? instant periods from SPICE simulation. The instant periods are
Referring back to Figs. 8 and 9, it is clear that the occur- distributed as two separate branches, i.e., left of and right
rences of carry-ins are totally different between the two types of . The fraction results in 6.25% of the total
of accumulator. The carry-ins generated by the conventional ac- cycles on the right and 93.75% of the total cycles on the left.
cumulator is periodical and completely predictable, while the The average frequency is somewhere in between the left and
Authorized licensed use limited to: TEXAS INSTRUMENTS VIRTUAL LIBRARY. Downloaded on November 10, 2009 at 10:16 from IEEE Xplore. Restrictions apply.
XIU: FAST AND POWER–AREA-EFFICIENT ACCUMULATOR FOR FLYING-ADDER FREQUENCY SYNTHESIZER 2447
Fig. 24. LVDS data and clock when driven by frequency and time–average
frequency. (a) Eye diagram of F REQ = 12
. (b) Eye diagram of F REQ =
11 36842
: (time–average-frequency).
Fig. 22. Histogram of instant period when F REQ = 12:5.
When the time–average-frequency-based clock signal is used
to drive a digital system, the timing closure (setup check) should
be carried out by using the constraint of the left branch ( in
Figs. 21 and 22 and in Fig. 23). As long as the system can
be timing closured, the size of should not be included in
the jitter calculation. Jitter is the edge uncertainty of the left
branch (Gaussian distribution). Time–average-frequency does
not have any impact on hold check since hold check only con-
cerns one clock edge.
When the time–average-frequency-based clock signal is used
to drive systems that have embedded analog components (ADC/
Fig. 23. Histogram of instant period when F REQ = 11:75. DAC), the spurious signals introduced by fractional number cer-
tainly need attention. In the case of fractional- PLL, the spurs
are usually close to the carrier since the carrier is modulated
right branches. The mission of time–average frequency can be
by the frequency of fractional number times the input reference
appreciated from the following calculation:
frequency. These close-in spurs usually have harmful impacts
on system operation. However, in the Flying-Adder PLL case,
the spurs are far away from the carrier. The reason is that the
In other words, if we have difficulty in creating a PLL that spurs are calculated from fractional number times the carrier fre-
shall generate a frequency of 716.33 MHz, we can use an quency. This is evident in Figs. 14 and 15 where the spurs are
FAPLL of to mimic this frequency. From 44 MHz away. When fractional number is extremely small,
the application point of view, the end result is the same: Within the spurs do move closer to the carrier. However, the magnitude
22.34 ns, there are 16 operations. Figs. 22 and 23 are two of the spurs in these cases decreases significantly (Fig. 20 and
additional examples to illustrate this point. Using the evidence [5]). Therefore, the spurious signal in FAPLL is not as severe as
of Figs. 14, 15, 18, and many others, we can also opine that in the case of fractional- PLL.
fractional number is the “controlled noise,” which can result in The time–average-frequency-based clock signal has been
the desired frequency shift precisely. This statement is more used in commercial products for almost ten years. Not a
persuasive when is small, such as in the case of DCXO [8]. single case of problem, related to time–average-frequency,
A fractional- PLL also uses a fractional number to achieve has been reported. Fig. 24 is the case where the time–av-
some average effect. However, the resulting clock signal cannot erage-frequency-based clock signal is used to drive the LVDS
be qualified as time–average-frequency. Its clock edge distribu- transmitter. In Fig. 24(a), an FAPLL is used to generate
tion is not in the shape of these shown in Figs. 21–23. Its edge 85.5 MHz for graphic WXGA mode (1360 768 at 60 Hz).
movement is not directly controlled. This clock is used to drive a 7 PLL to serialize the data. The
Authorized licensed use limited to: TEXAS INSTRUMENTS VIRTUAL LIBRARY. Downloaded on November 10, 2009 at 10:16 from IEEE Xplore. Restrictions apply.
2448 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS—I: REGULAR PAPERS, VOL. 56, NO. 11, NOVEMBER 2009
top waveform is the data, the bottom is the clock. The setting of [11] L. Xiu, “A Flying-Adder PLL technique enabling novel approaches for
FAPLL is as follows: MHz, , , , video/graphic applications,” IEEE Trans. Consum. Electron., vol. 54,
no. 2, pp. 591–599, May 2008.
, and . This results in MHz. [12] C. Nagendra, M. J. Irwin, and R. M. Owens, “Area–time–power trade-
is the prescalar divider ratio, is the PLL divider ratio, is offs in parallel adders,” IEEE Trans. Circuits Syst. II, Analog Digit.
the number of VCO outputs, is the postdivider ratio [please Signal Process., vol. 43, no. 10, pp. 689–702, Oct. 1996.
[13] H. T. Bui, Y. Wang, and Y. Jiang, “Design and analysis of low-power
see [7, Fig. 2 and eq. (4)]]. In this case, there is no fractional 10-transistor full adders using novel XOR-XNOR gates,” IEEE Trans.
number used in (i.e., it is not time–average-frequency). Circuits Syst. II, Analog Digit. Signal Process., vol. 49, no. 1, pp.
In Fig. 24(b), the FAPLL setting is MHz, , 25–30, Jan. 2002.
[14] A. M. Shams, T. K. Darwish, and M. A. Bayoumi, “Performance anal-
, , , and . This ysis of low-power 1-bit CMOS full adder cells,” IEEE Trans. Very
also results in MHz. is about 103 ps in this case. Large Scale Integr. (VLSI) Syst., vol. 10, no. 1, pp. 20–29, Feb. 2002.
The peak-to-peak jitter in case (a) is about 64 ps. It is about [15] M. Alioto and G. Palumbo, “Analysis and comparison on full adder
block in submicron technology,” IEEE Trans. Very Large Scale Integr.
70 ps in case (b) (left branch). There is hardly any visually (VLSI) Syst., vol. 10, no. 6, pp. 806–823, Dec. 2002.
noticeable difference between (a) and (b). [16] S. Goel, A. Kumar, and M. A. Bayoumi, “Design of robust, energy-ef-
ficient full adders for deep-submicrometer design using hybrid-CMOS
IX. CONCLUSION logic style,” IEEE Trans. Very Large Scale Integr. (VLSI) Syst., vol. 14,
no. 12, pp. 1309–1321, Dec. 2006.
This paper has demonstrated a new scheme for constructing [17] A. M. Shams, T. K. Darwish, and M. A. Bayoumi, “Performance anal-
the accumulator used in the Flying-Adder frequency synthesis ysis of low-power 1-bit CMOS full adder cells,” IEEE Trans. Very
Large Scale Integr. (VLSI) Syst., vol. 10, no. 1, pp. 20–29, Feb. 2002.
architecture. The validity of the new scheme has been proved [18] J. F. Lin, Y. T. Hwang, M. H. Sheu, and C. C. Ho, “A novel high-
from several directions. The advantages of this new scheme are speed and energy efficient 10-transistor full adder design,” IEEE Trans.
faster circuit speed, smaller circuit area, and lower power con- Circuits Syst. I, Reg. Papers, vol. 54, no. 5, pp. 1050–1059, May 2007.
[19] D. Harris and S. Naffziger, “Statistical clock skew modeling with data
sumption. This scheme solves a long-lasting problem in the im- delay variations,” IEEE Trans. Very Large Scale Integr. (VLSI) Syst.,
plementation of the Flying-Adder circuitry and will have pro- vol. 9, no. 6, pp. 888–898, Dec. 2001.
found impacts on all future Flying-Adder applications. We be- [20] X. Jiang and S. Horiguchi, “Statistical skew modeling for general clock
distribution networks in presence of process variations,” IEEE Trans.
lieve that this new approach can also be used in the accumulator Very Large Scale Integr. (VLSI) Syst., vol. 9, no. 5, pp. 704–717, Oct.
of fractional- PLL since the carry-in is the only information 2001.
needed in that case as well. The important issue of time–average [21] D. Harris, M. Horowitz, and D. Liu, “Timing analysis including clock
skew,” IEEE Trans. Comput.-Aided Design Integr. Circuits Syst., vol.
frequency and jitter has also been addressed in this paper. 18, no. 11, pp. 1608–1618, Nov. 1999.
[22] A. A. Abidi, “Phase noise and jitter in CMOS ring oscillators,” IEEE
REFERENCES J. Solid-State Circuits, vol. 41, no. 8, pp. 1803–1816, Aug. 2006.
[1] H. Mair and L. Xiu, “An architecture of high-performance frequency [23] V. F. Kroupa, “Jitter and phase noise in frequency divider,” IEEE Trans.
and phase synthesis,” IEEE J. Solid-State Circuits, vol. 35, no. 6, pp. Instrum. Meas., vol. 50, no. 5, pp. 1241–1243, Oct. 2001.
835–846, Jun. 2000. [24] A. Demir, “Phase noise and timing jitter in oscillators with colored-
[2] L. Xiu and Z. You, “A Flying-Adder architecture of frequency and noise sources,” IEEE Trans. Circuits Syst. I, Fundam. Theory Appl.,
phase synthesis with scalability,” IEEE Trans. Very Large Scale Integr. vol. 49, no. 12, pp. 1782–1791, Dec. 2002.
(VLSI) Syst., vol. 10, no. 5, pp. 637–649, Oct. 2002. [25] B. H. Leung and D. Mcleish, “Investigation of phase noise of ring os-
[3] L. Xiu and Z. You, “A Flying-Adder frequency synthesis architecture cillators with time-varying current and noise sources by time-scaling
of reducing VCO stages,” IEEE Trans. Very Large Scale Integr. (VLSI) thermal noise,” IEEE Trans. Circuits Syst. I, Reg. Papers, vol. 51, no.
Syst., vol. 13, no. 2, pp. 201–210, Feb. 2005. 10, pp. 1926–1939, Oct. 2004.
[4] L. Xiu and Z. You, “A new frequency synthesis method based on [26] B. H. Leung, “A novel model on phase noise of ring oscillator based
Flying-Adder architecture,” IEEE Trans. Circuits Syst. II, Analog on last passage time,” IEEE Trans. Circuits Syst. I, Reg. Papers, vol.
Digit. Signal Process., vol. 50, no. 3, pp. 130–134, Mar. 2003. 51, no. 3, pp. 471–482, Mar. 2004.
[5] L. Xiu, “The concept of time–average-frequency and mathematical [27] T. C. Weigandt, B. Kim, and P. R. Gray, “Analysis of timing jitter in
analysis of Flying-Adder frequency synthesis architecture,” IEEE CMOS ring oscillators,” in Proc. ISCAS, May 1994, vol. 4, pp. 27–30.
Circuits Syst. Mag., vol. 8, no. 3, pp. 27–51, 3rd Quarter 2008.
[6] L. Xiu, “Some open issues associated with a new type component: Dig-
ital-to-frequency converter,” IEEE Circuits Syst. Mag., vol. 8, no. 3, pp.
90–94, 3rd Quarter 2008. Liming Xiu (M’95–SM’03) was a Senior Member
[7] L. Xiu, “A Flying-Adder on-chip frequency generator for complex SoC of the Technical Staff of Texas Instruments Incor-
environment,” IEEE Trans. Circuits Syst. II, Exp. Briefs, vol. 54, no. 12, porated, Dallas, TX. He has done significant amount
pp. 1067–1071, Dec. 2007. of work/research in the area of frequency synthesis.
[8] L. Xiu, “A novel DCXO module for clock synchronization in MPEG2 He is the Principal Inventor of the Flying-Adder
transport system,” IEEE Trans. Circuits Syst. I, Reg. Papers, vol. 55, frequency and phase synthesis architecture, which
no. 8, pp. 2226–2237, Sep. 2008. has been used in many commercial products. He is a
[9] L. Xiu, W. Li, J. Meiners, and R. Padakanti, “A novel all digital phase holder of 12 granted and pending U.S. patents in this
lock loop with software adaptive filter,” IEEE J. Solid-State Circuits, area. He is also an expert in VLSI system-on-a-chip
vol. 39, no. 3, pp. 476–483, Mar. 2004. integration, with battle-proven integration experi-
[10] L. Xiu, S. Clynes, S. Gurrapu, T. Haider, F. Ying, and W. Mohammad, ence on several very large chips. In this area, he
“Flying-Adder PLL based synchronization mechanism for data packet has one book entitled “VLSI Circuit Design Methodology Demystified: A
transport,” in Proc. 6th IEEE DCAS Workshop, System-on-Chip, Nov. Conceptual Taxonomy” (Wiley, 2007). He is currently the IEEE Circuits and
15–16, 2007, pp. 1–5. Systems Society Vice President of Regions 1–7 for 2009–2010.
Authorized licensed
View publication stats use limited to: TEXAS INSTRUMENTS VIRTUAL LIBRARY. Downloaded on November 10, 2009 at 10:16 from IEEE Xplore. Restrictions apply.