You are on page 1of 6

IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 20, NO.

7, JULY 2012 1161

A Low-Power Low-Cost Design of Primary


Synchronization Signal Detection
Chixiang Ma, Student Member, IEEE, Hao Cao, and Ping Lin, Member, IEEE

Abstract—Synchronization is an important component of a the performance of search time. Consequently, it was decided
practical communication system. Furthermore, network entry to adopt Zadoff-Chu (ZC) sequences as the downlink primary
including synchronization is important. Since the detection of synchronization signal (PSS) and the uplink random access pre-
primary synchronization signal (PSS) is the first step of network
entry in long term evolution (LTE) systems, thus it may be a amble. The ZC sequences are a group of general-chirp-like se-
critical path for practical systems. Therefore, tradeoff between quences with good correlation properties [1]. To identify the cell
performance and low power consumption and low cost of PSS and obtain synchronization, PSS is detected while cell search
detection needs to be made carefully. This paper presents a takes place. Currently used matched filters [2]–[4] are compu-
new synchronization method for low power and low cost design. tation-intensive since they require a large number of constant
The approach of a 1-bit analog-to-digital converter (ADC) with
down-sampling is compared with that of a 10-bit ADC without complex multiplications.
down-sampling under multi-path fading conditions defined in The main objective of this paper is to propose an efficient
LTE standard for user equipment (UE) performance test [5]. The and accurate PSS detection method with low power and low
simulation results of PSS are obtained on several kinds of chan- cost. The system model, channel model, and PSS definition
nels. The simulation results explicitly show that the performance are introduced in Section II. A brief review of the matched
of the method with down-sampling for 1-bit ADC does not degrade
even if frequency offset exists. Based on the simulation results, filter approach is presented in Section III. Afterwards, both the
different implementation architectures and their synthesis report method of 1-bit ADC with down-sampling and that of 10-bit
and analysis are present. A low-power low-cost design with high ADC without down-sampling for PSS detection are discussed
performance to detect PSS is derived in this paper. in Section IV whereas their simulation results are shown in
Index Terms—Low cost, low power, matched filter, primary syn- Section V. Section VI addresses different implementation
chronization signal (PSS). architectures of PSS detection. Finally, conclusion remarks are
given in Section VII.
I. INTRODUCTION II. SYSTEM MODEL AND PROBLEM DEFINITION

T HE explosive growth of cell phone users and the in-


creasing demand for broadband wireless access has led
to the development of long term evolution (LTE) to replace the
A. OFDM System Model With Carrier Frequency Offset (CFO)
3GPP adopt OFDM to improve spectrum efficiency. In
wideband code division multiple access (WCDMA)-based air OFDM systems, a sequence of complex data symbols is
interface by the 3rd Generation Partnership Project (3GPP). considered as orthogonal subcarriers during the th OFDM
Several minimum requirements of LTE include packet data block, the sequence of data symbols is defined as follows:
support with peak data rates of 300 Mbps in the downlink and (1)
75 Mbps in the uplink, a low maximum latency of 10 ms MAC
layer round trip delay, and flexible bandwidth scalability. These The sequence of data symbols is modulated using an -point
requirements result to the adoption of orthogonal frequency inverse discrete Fourier transform (IDFT) process that produces
division multiplexing (OFDM)-based modulation and mul- the sequence
tiple access, multiple-input-multiple-output (MIMO) antenna
schemes, and adaptive modulation and coding with advanced (2)
channel coding, space time coding and hybrid automatic repeat where is the normalized -by- IDFT matrix and is
request (ARQ) protocols.
Synchronization sequence is more important because its de- (3)
tection affects not only search time but also performance of
demodulation. The 3GPP working group undertakes plenty of Consequently, the th sample in the sequence can be ex-
rigorous evaluation of different kinds of sequence to enhance pressed as

Manuscript received August 26, 2010; revised January 20, 2011; accepted (4)
April 20, 2011. Date of publication May 31, 2011; date of current version June
01, 2012.
The authors are with Beijing Embedded System Key Lab, Beijing Univer- In fading channels, a time-domain guard interval, which is
sity of Technology, Beijing 100124, China (e-mail: machixiang@emails.bjut. named as cyclic prefix (CP), is created by copying the last
edu.cn; jackycaohao@emails.bjut.edu.cn; pinglin@bjut.edu.cn). samples of the IDFT output and appending them at the begin-
Color versions of one or more of the figures in this paper are available online
at http://ieeexplore.ieee.org. ning of the OFDM symbol to be transmitted. So the transmitted
Digital Object Identifier 10.1109/TVLSI.2011.2152866 OFDM block consists of samples.
1063-8210/$26.00 © 2011 IEEE
1162 IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 20, NO. 7, JULY 2012

TABLE I TABLE II
DELAY PROFILES FOR E-UTRA CHANNEL MODELS EXTENDED PEDESTRIAN MODEL

TABLE III
EXTENDED VEHICULAR MODEL

At the receiver side, after removing the first CP samples,


the received sequence

(5)

is obtained [9]

(6)

where represents the normalized CFO, and represents


the effect of the accumulated phase rotation caused by the CFO
TABLE IV
on the time domain samples EXTENDED TYPICAL URBAN MODEL

(7)
(8)

denotes the channel frequency response during the th


OFDM block

(9)

represents a zero-mean complex white Gaussian noise


sample with variance .
Assuming that the receiver sampling clock is aligned to that TABLE V
of the transmitter, then the th element of can be expressed CHANNEL MODEL PARAMETERS
as

(10)

B. Channel Propagation Model


The evolved universal terrestrial radio access (E-UTRA)
channel model is recommended, since PSS is a component of
LTE. It can evaluate the proposed method more reasonable and The Doppler spectra are selected to be representative of low,
practical to use of E-UTRA channel model. medium and high Doppler spread environments. Then, the max
There are delay profiles, Doppler spectra and channel corre- Doppler frequency is 5, 70, and 300 Hz, respectively.
lation matrices in E-UTRA channel model. Then there are too The combinations of delay profiles and Doppler spectra are
many combinations of these components. First of all, delay pro- also defined in [5]. Only five combinations can be used to eval-
files are introduced. uate the performance measurements of receiver in multi-path
The delay profiles are selected to be representative of low, fading environment. Table V shows all the combinations.
medium and high delay spread environments. The resulting Finally, the channel correlation matrices are also selected
model parameters are defined in Table I and the tapped delay to be representative of low, medium and high channel correla-
line models are defined in Tables II–IV [5]. tion environments. Since 4-by-4 simulations will be given in
MA et al.: A LOW-POWER LOW-COST DESIGN OF PRIMARY SYNCHRONIZATION SIGNAL DETECTION 1163

TABLE VI The sequence in (12) is mapped to the subcarriers around DC


DIFFERENT CORRELATION CONSTANT and transformed to time domain by 64-point IDFT. To detect
this signal at the receiver, the correlation with the time-
domain signal of the ZC sequence is calculated [2]–[4]

(13)

where is the successive 64-by-1 received signal vector, is


TABLE VII
ROOT INDICES FOR THE PSS the DFT matrix, and is 64-by-1 vector composed of
punctured at DC.
Then, from (13), the coefficients of the matched filter can be
obtained

(14)

where

next section, thus 4-by-4 correlation matrices is provided as


follows [5]: (15)

and the matched filter can be expressed

(16)

(11) where is the received signal.

IV. PRACTICAL DETECTION METHOD


where and is define in Table VI [5], and denotes Kro-
necker product. From the power consumption perspective, a 10-bit analog-to-
digital converter (ADC) uses more power than a 1-bit ADC since
C. PSS the 10-bit pipelined ADC has several power amplifiers in it.
Typically, the power consumption of a 1-bit 122.88 MHz ADC
A synchronization channel (SCH) is specified in LTE system composed of one comparator is about 200 W, while the power
to transmit PSS and secondary synchronization signal (SSS) [1]. consumption of a 10-bit 122.88 MHz pipelined ADC is about
The sequence used for the PSS is generated from a 50 mW. To come up with a low-power solution, a method of
frequency-domain ZC sequence [1] according to PSS detection using 1-bit ADC is proposed.
PSS is transmitted periodically, twice per frame which lasts
(12) 10 ms. The sampling rate of the receiver is 122.88 MHz; how-
ever, the date rate of input data to the matched filter is 1.92 MHz.
Thus, 9600 samples at the output of the matched filter need
where the ZC root sequence index is given by Table VII [1]. to be buffered during the 5 ms period, which is not area and
The three different ZC sequences are orthogonal to each cost efficient. To come up with a low cost solution, a method of
other, and each sequence corresponds to a sector identity which down-sampling by 8 is used at the output of matched filter.
is in the range of 0 to 2. The ZC sequence is chosen for its
good periodic autocorrelation and cross-correlation properties. A. Method Without Down-Sampling by 8 for 10-Bit ADC
In particular, these sequences have a low frequency offset
sensitivity, which is described in [8]. Thus, it is easy to detect From the last section, the matched filter as expressed in (16)
PSS during the initial synchronization because the ZC sequence can be reformulated when using a 10-bit, 122.88 MHz pipelined
has the flat frequency domain autocorrelation property and the ADC
low frequency offset sensitivity.
(17)
III. FUNDAMENTAL DETECTION METHOD
The main function of PSS is to detect the boundary of a frame where is the received signal sampled by a 10-bit, 122.88
where non-coherent detection method has to be used at the re- MHz pipelined ADC, and is obtained in (14) and (15).
ceiver since there is no known reference information initially. Every output of the matched filter is buffered since
Matched filter is a basic non-coherent detection method that there is no down-sampling module, and it needs a large area
can be used to detect PSS efficiently. buffer which is very costly.
1164 IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 20, NO. 7, JULY 2012

TABLE VIII
SIMULATION ASSUMPTIONS

B. Method With Down-Sampling by 8 for 1-Bit ADC


Equation (16) can be reformulated when using a 1-bit, 122.88
MHz ADC

(18)

where is the received signal sampled by a 1-bit, 122.88


Fig. 1. Performance of both methods using low correlation channel matrix and
MHz ADC, and is obtained in (14) and (15). EPA 5 Hz channel model.
Every output of the matched filter is down-sampled by
8
proposed method under three typical channel combinations,
which is EPA 5 Hz model with low correlation channel matrix,
(19) EVA 70 Hz model with medium correlation channel matrix and
ETU 300 Hz model with high correlation channel matrix.
Fig. 1 shows the search time of both methods under EPA 5
where is the output of the down-sampling module. Hz model with low correlation channel matrix, which indicates
Now, only 1200 outputs need to be buffered during 5 ms with that their performance is very close to each other. The results
an additional comparator of 1 out of 8 implementing the down- for EVA 70 Hz and ETU 300 Hz are shown in Figs. 2 and 3,
sampling module. This results in less area which translates to respectively. From Figs. 2 and 3, we can see that the method
lower cost in a practical system. using a 1-bit 122.88 MHz ADC with down-sampling by 8 does
With the practical method introduced above, its implementa- not degrade the performance much even if the signal-to-noise
tion architecture is discussed in Section VI after the simulation ratio (SNR) is very low. When SNR is larger than 10 dB, their
performance is discussed in Section V. performance are almost identical. As a result, the method of
1-bit 122.88 MHz ADC with down-sampling by 8 is proposed
V. SIMULATION RESULTS as the low power and low cost design for PSS detection with
Primary synchronous signal is designed for cell search and good search performance.
handover in 3GPP LTE systems, which is transmitted every 5
ms. Search time of PSS detection is an important criterion when VI. HARDWARE IMPLEMENTATION
measuring its performance.
To compare the performance using a 10-bit 122.88 MHz ADC As discussed in the previous section, the performance of the
without down-sampling and that using a 1-bit 122.88 MHz ADC proposed method for PSS detection is acceptable in a practical
with down-sampling by 8, the parameters listed in Table VIII are LTE system; thus, its implementation detail is described in this
used in the simulation. section where the matched filter is considered first followed by
We assume that there are four receive antennas and four the architecture of our proposed PSS detector.
transmit antennas in the simulated LTE MIMO system.
A. Architecture of Matched Filter
Replica-based method is very useful for symbol timing de-
tection since a diversity gain of 3 dB can be obtained when The matched filter is an important component in the PSS
two PSSs are received in different time slot. Higher diversity detection, as denoted in Section III. We use 64-tap time do-
gain can be achieved when more than two PSSs are used in the main matched filter; hence 64 complex multiplication units per
detection. At most 16 PSSs are transmitted in the simulation, matched filter are used in the calculation in (16).
that is, the detection gives up after 16 PSS correlations are Since 84 matched filters are required in the system, a total
calculated at the receiver. From Section II, we know that there of 5376 units of complex multiplication is needed, which is not
are different combinations of delay profiles, Doppler spectra reasonable for a practical implementation due to the high cost
and channel correlation matrices defined in E-UTRA channel of multiplication unit in the receiver. In practice, the sampling
model. To demonstrate different simulation assumptions in the rate of input data to the matched filter is 1.92 MHz while the
channel model, we simulate both the original method and the system clock is 122.88 MHz, which implies that we can use
MA et al.: A LOW-POWER LOW-COST DESIGN OF PRIMARY SYNCHRONIZATION SIGNAL DETECTION 1165

Fig. 4. Matched filter architecture with one complex multiply unit.

Fig. 2. Performance of both methods using medium correlation channel matrix


and EVA 70 Hz channel model.

Fig. 5. Original architecture of the whole PSS detection.

Fig. 6. Area-efficient architecture of the whole PSS detection.

84 such hardware units are involved in the architecture of the


Fig. 3. Performance of both methods using high correlation channel matrix and PSS detection.
ETU 300 Hz channel model. As the sampling rate of the input data to PSS detection is 1.92
MHz and the PSS signal is repetitively transmitted every 5 ms,
there are 9600 samples during 5 ms and thus a single port RAM
only one complex multiplication unit during 64 cycles instead of with 9600 addresses is needed.
using 64 units of complex multiplication. Thus we propose the As described above, there are 84 such RAMs in the system,
structure of matched filter shown in Fig. 4. As a result, 84 units and the area is too large for the UE chip; therefore an area-
of complex multiplication are enough for the whole system. efficient architecture is proposed as shown in Fig. 6. Compared
to the architecture in Fig. 5, a small RAM with 8 addresses is
B. Architecture of PSS Detection added whose function is to find the maximum value of every
A mismatch of up to 14 part per million (ppm) can exist be- eight correlations. As a result, only 1200 correlation values need
tween the oscillators at the base station (eNodeB) and at the UE, to be stored in RAM with 1200 addresses, which reduce the
so seven groups of matched filters are used to cover the range RAM size of the whole system by a factor of almost 8.
of [ 14 ppm, 14 ppm] with each group responsible for 4 ppm As the implementation is targeted for application-specific in-
corresponding to 2/3 subcarrier spacing. Each group contains tegrated circuit (ASIC) chip, the two different architectures are
three matched filters to detect three different physical-layer IDs synthesized using SMIC 65-nm technology at 1.2 V voltage.
of value 0, 1, or 2. Therefore, there are 21 hardware units as The area and power reports are listed in Table IX. We can ob-
shown in Fig. 6 for each receiver antenna. Since the system is serve that the area of the area-efficient architecture is much
MIMO 4-by-4 and there are 4 receiver antennas at the UE end, smaller than that of the original architecture, which reduces the
1166 IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 20, NO. 7, JULY 2012

TABLE IX [4] P-SCH Sequences, Huawei, Kobe, Japan, “3GPP TSG RAN WG1
COMPARISONS OF TWO ARCHITECTURES Tdoc R1-072321,” 2007.
[5] 3GPP TS 36.101 v8.9.0 3rd Generation Partnership Project; Technical
Specification Group Radio Access Network; Evolved Universal Ter-
restrial Radio Access (E-UTRA); User Equipment (UE) Radio Trans-
mission and Reception (Release 8), 3rd Generation Partnership Project,
Tech. Rep., Dec. 2009, 3GPP.
[6] G. Colavolpe and R. Raheli, “Noncoherent sequence detection,” IEEE
Trans. Commun., vol. 47, no. 9, pp. 1376–1385, Sep. 1999.
[7] G. L. Stuiber, Principles of Mobile Communication, 2nd ed. Norwell,
cost of the chip significantly. From the power perspective, not MA: Kluwer, 2001.
only the 1-bit ADC reduces the power consumption, but the [8] S. Sesia, I. Toufik, and M. Baker, LTE-The UMTS Long Term Evolu-
hardware of digital logic also does. tion: From Theory to Practice. New York: Wiley, 2009.
[9] Y. Yao and G. B. Giannakis, “Blind carrier frequency offset estima-
tion in SISO, MIMO and multiuser OFDM systems,” IEEE Trans.
VII. CONCLUSION Commun., vol. 53, no. 1, pp. 173–183, Jan. 2005.
In this paper, we address the problem of detecting primary
synchronization signal in 3GPP LTE system. Down-sampling
block, 10-bit 122.88 MHz ADC and 1-bit 122.88 MHz ADC are Chixiang Ma (S’10) received the B.S. degree in
basic components of PSS detection methods. Theoretically, de- electrical engineering from Zhejiang University,
Hangzhou, China, in 2005. He is currently pursuing
tection with 1-bit ADC and with down-sampling would degrade the Ph.D. degree from Beijing Embedded System
the performance and prolong the detection time. However, due Key Lab of Beijing University of Technology,
to the inherent advantage of the ZC sequence, our simulation re- Beijing, China.
His research interests include MIMO and OFDM
sults show that the performance of the proposed method using of wireless communication systems and VLSI
a 1-bit ADC with down-sampling by 8 does not degrade much design.
compared with that using a 10-bit ADC without down-sampling
in the presence of frequency offset under several typical LTE
propagation channels. Subsequently, two different implementa-
tion architectures of the PSS detection are presented. As the area Hao Cao received the B.S. degree in electrical and
and the power consumption of the original implementation ar- information from Huazhong University of Science
chitecture are too large to be acceptable, based on simulation and Technology, Wuhan, China, in 2008. He is
currently pursuing the M.S. degree from Beijing
results and ASIC synthesis results, a more practical implemen- Embedded System Key Lab, Beijing University of
tation architecture is proposed where the PSS can be detected Technology, Beijing, China.
efficiently and accurately at a much lower power and lower cost His research interests include synchronization and
VLSI design.
which renders it feasible in the implementation of a UE chip.

REFERENCES
[1] 3rd Generation Partnership Project (3GPP), Sophia-Antipolis Cedex,
France, 3GPP TS 36.211 v8.9.0 3rd Generation Partnership Project;
Technical Specification Group Radio Access Network; Evolved
Universal Terrestrial Radio Access (E-UTRA); Physical Channels Ping Lin (M’10) received the M.S. degree in elec-
and Modulation (Release 8), 3rd Generation Partnership Project, Dec. trical engineering from University of Rhode Island,
2009, 3GPP. Kingston.
[2] K. Manolakis, D. M. Gutierrez Estevez, V. Jungnickel, X. Wen, and She is the Director of Beijing Embedded System
C. Drewes, “A closed concept for synchronization and cell search in Key Lab, Beijing University of Technology, Beijing,
3GPP LTE systems,” in Proc. IEEE Wirel. Commun. Network. Conf., China. Her research interests include DSP algo-
2009, pp. 1–6. rithms, VLSI design, wireless communications, and
[3] B. M. Popovic and F. Berggren, “Primary synchronization signal in embedded SOC.
E-UTRA,” in Proc. IEEE 10th Int. Symp. Spread Spectrum Techn. Appl.
(ISSSTA), 2008, pp. 426–430.

You might also like