Pilot Conatamination

2640 IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. 10, NO.
8, AUGUST 2011
Pilot Contamination and Precoding in

Multi-Cell TDD Systems
Jubin Jose, Student Member, IEEE, Alexei Ashikhmin, Senior Member, IEEE,
Thomas L. Marzetta, Fellow, IEEE, and Sriram Vishwanath, Senior Member, IEEE
Abstract—This paper considers a multi-cell multiple antenna throughput in MIMO downlink [3]–[8]. Time division duplex
system with precoding used at the base stations for downlink (TDD) systems, however, have a fundamentally different ar-
transmission. Channel state information (CSI) is essential for chitecture from the ones studied in FDD systems [9], [10].
precoding at the base stations. An effective technique for ob-
taining this CSI is time-division duplex (TDD) operation where The goal of this paper is to develop a clear understanding
uplink training in conjunction with reciprocity simultaneously of mechanisms for acquiring CSI and subsequently designing
provides the base stations with downlink as well as uplink channel precoding strategies for multi-cell MIMO TDD systems.
estimates. This paper mathematically characterizes the impact Although most current cellular systems are frequency-
that uplink training has on the performance of such multi-
cell multiple antenna systems. When non-orthogonal training division duplex (FDD), there is a compelling case to be made
sequences are used for uplink training, the paper shows that the for time-division duplex (TDD). Arguably the central problem
precoding matrix used by the base station in one cell becomes in advanced wireless systems is the acquisition of CSI in a
corrupted by the channel between that base station and the users timely manner. A major point of this paper is the desirability of
in other cells in an undesirable manner. This paper analyzes having a large excess of base station antennas compared with
this fundamental problem of pilot contamination in multi-cell
systems. Furthermore, it develops a new multi-cell MMSE-based terminals. For fast-changing channels TDD offers the only way
precoding method that mitigates this problem. In addition to to acquire timely CSI since the training burden for reverse-
being linear, this precoding method has a simple closed-form link pilots in a TDD system is independent of the number
expression that results from an intuitive optimization. Numerical of base station antennas, while conversely the training burden
results show significant performance gains compared to certain for forward-link pilots in an FDD system is proportional to
popular single-cell precoding methods.
the number of antennas. Use of FDD would forever impose a
Index Terms—Time-division duplex systems, uplink training, severe limit on the number of base station antennas.
pilot contamination, MMSE precoding.
An important distinguishing feature of TDD systems is the
notion of reciprocity, where the reverse channel is used as an
I. I NTRODUCTION estimate of the forward channel. While utilizing reciprocity,
M ULTIPLE antennas, especially at the base-station, have

now become an accepted and a central feature of
cellular networks. These networks have been studied exten-
the differences in the transfer characteristics of the amplifiers
and the filters in the two directions must be accounted for. This
can be handled by measuring the different scalar gains in the
sively over the past one and a half decades (see [2] and two directions. Arguably, this reverse channel estimation one
references therein). It is now well understood that channel of the best advantages of a TDD architecture, as it eliminates
state information (CSI) at the base station is an essential the need for feedback, and uplink training together with the
component when trying to maximize network throughput. reciprocity of the wireless medium [11], [12] is sufficient to
Systems with varying degrees of CSI have been studied in provide us with the desired CSI. In [13], channel reciprocity
great detail in literature. The primary framework under which has been validated through experiments. However, as we see
these have been studied is frequency division duplex (FDD) next, this channel estimate is not without issues that must be
systems, where the CSI is typically obtained through (limited) addressed before it proves useful.
feedback. There is a rich body of work in jointly designing this In this paper, we consider uplink training and transmit
feedback mechanism with (pre)coding strategies to maximize precoding in a multi-cell scenario with 𝐿 cells, where each
cell consists of a base station with 𝑀 antennas and 𝐾 users
Manuscript received June 29, 2010; revised January 27, 2011; accepted
March 10, 2011. The associate editor coordinating the review of this paper with single antenna each. The impact of uplink training on the
and approving it for publication was A. Hjørungnes. resulting channel estimate (and thus system performance) in
Results in this paper were presented in part at the IEEE International the multi-cell scenario is significantly different from that in a
Symposium on Information Theory (ISIT) [1].
J. Jose and S. Vishwanath are with the Department of Electrical and single-cell scenario. In the multi-cell scenario, non-orthogonal
Computer Engineering, The University of Texas at Austin, Austin, TX training sequences (pilots) must be utilized, as orthogonal
78712 USA (e-mail: {jubin, sriram}@austin.utexas.edu). The authors were pilots would need to be least 𝐾 × 𝐿 symbols long which is
supported in part by the DoD and the ARO Young Investigator Program
(YIP). infeasible for large 𝐿. In particular, short channel coherence
A. Ashikhmin and T. L. Marzetta are with Bell Laboratories, Alcatel-Lucent times due to mobility do not allow for such long training
Inc., Murray Hill, NJ 07974 USA (e-mail: aea@research.bell-labs.com; sequences.
tom.marzetta@alcatel-lucent.com).
Digital Object Identifier 10.1109/TWC.2011.060711.101155 This non-orthogonal nature causes pilot contamination [1],
1536-1276/11$25.00 ⃝
c 2011 IEEE
JOSE et al.: PILOT CONTAMINATION AND PRECODING IN MULTI-CELL TDD SYSTEMS 2641
Cell-1 Cell-2
in [9], [10]. For the setting with one user in every cell, we
derive closed-form expressions for achievable rates. These
closed-form expressions allow us to determine the extent to
which pilot contamination impacts system performance. In
BS-1 BS-2 particular, we show that the achievable rates can saturate with
h12 the number of antennas at the base station 𝑀 . This analysis
will allow system designers to determine the appropriate fre-
h22
quency/time/pilot reuse factor to maximize system throughput
User-1 User-2 in the presence of pilot contamination.
In the multi-cell scenario, there has been significant work
on utilizing coordination among base stations [15]–[18] when
Fig. 1. A two-cell example with one user in each cell. Both users CSI is available. This existing body of work focuses on
transmit non-orthogonal pilots during uplink training, which leads to pilot the gain that can be obtained through coordination of the
contamination at both the base stations. base stations. Dirty paper coding based approaches and joint
beamforming/precoding approaches are considered in [18].
Linear precoding methods for clustered networks with full
[14], which is encountered only when analyzing a multi- intra-cluster coordination and limited inter-cluster coordina-
cell MIMO system with training, and is lost when narrowing tion are proposed in [19]. These approaches generally require
focus to a single-cell setting or to a multi-cell setting where “good” channel estimates at the base stations. Due to non-
channel information is assumed available at no cost. In this orthogonal training sequences, the resulting channel estimate
paper, we perform a more detailed study of this problem (of the channel between a base station and all users) can be
and consider precoding in its presence. First, we study the shown to be rank deficient. We develop a multi-cell MMSE-
impact of pilot contamination (and thus achievable rates), and based precoding method that depends on the set of training
then, we develop methods that mitigate this contamination. In sequences assigned to the users. Note that this MMSE-based
older generation cellular systems, multiple factors including precoding is for the general setting with multiple users in every
large reuse distance of any training signal and randomization cell. Our approach does not need coordination between base
in the selection of pilots would have helped in keeping stations required by the joint precoding techniques1 . When
the impact of pilot contamination reasonable. However, with coordination is present, this approach can be applied at the
newer generation systems designed for more aggressive reuse inter-cluster level. The MMSE-based precoding derived in this
of spectrum, this impact is very crucial to understand and paper has several advantages. In addition to being a linear
mitigate. We note that pilot contamination must also figure in precoding method, it has a simple closed-form expression that
Cooperative MIMO (also called Network MIMO [15], [16]) results from an intuitive optimization problem formulation. For
where clusters of base stations are wired together to create many training sequence allocations, numerical results show
distributed arrays, and where pilots must be re-used over that our approach gives significant gains over certain popular
multiple clusters. single-cell precoding methods including zero-forcing.
The fundamental problem associated with pilot contamina-
tion is evident even in the simple multi-cell scenario shown in A. Related Work
Figure 1. Consider two cells 𝑖 ∈ {1, 2}, each consisting of one
Over the past decade, a variety of aspects of downlink
base station and one user. Let h𝑖𝑗 denote the channel between
and uplink transmission problems in a single cell setting
the base station in the 𝑖-th cell and the user in the 𝑗-th cell. Let
have been studied. In information theoretic literature, these
the training sequences used by both the users be same. In this
problems are studied as the broadcast channel (BC) and the
case, the MMSE channel estimate of h22 at the base station
multiple access channel (MAC) respectively. For Gaussian BC
in the 2-nd cell is ĥ22 = 𝑐1 h12 + 𝑐2 h22 + 𝑐w. Here 𝑐1 , 𝑐2 and
and general MAC, the problems have been studied for both
𝑐 are constants that depend on the propagation factors and the
single and multiple antenna cases. The sum capacity of the
transmit powers of mobiles, and w is 𝒞𝒩 (0, I) additive noise.
multi-antenna Gaussian BC has been shown to be achieved by
The base station in the 2-nd cell uses this channel estimate
dirty paper coding (DPC) in [20]–[23]. It was shown in [24]
to form a precoding vector a2 = 𝑓 (ĥ22 ), which is usually
that DPC characterizes the full capacity region of the multi-
aligned with the channel estimate, that is a2 = 𝑐𝑜𝑛𝑠𝑡 ⋅ ĥ22 .
antenna Gaussian BC. These results assume perfect CSI at the
However, by doing this, the base station (partially) aligns the
base station and the users. In addition, the DPC technique is
transmitted signal with both h22 (which is desirable) and h12
computationally challenging to implement in practice. There
(which is undesirable). Both signal (h22 a†2 ) and interference
has been significant research focus on reducing the compu-
(h12 a†2 ) statistically behave similarly. Therefore, the general
tational complexity at the base station and the users. In this
assumption that the precoding vector used by a base station
regard, different precoding schemes with low complexity have
in one cell is uncorrelated with the channel to users in other
been proposed. This body of work [25]–[29] demonstratess
cells is not valid with uplink training using non-orthogonal
that sum rates close to sum capacity can be achieved with
training sequences. This fundamental problem is studied in
much lower computational complexity. However, these results
further detail.
assume perfect CSI at the base station and the users.
To perform pilot contamination analysis, we first develop
analytical expressions using techniques similar to those used 1 There is no exchange of channel state information among base stations.
2642 IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. 10, NO. 8, AUGUST 2011
Cell-𝑙 Cell-𝑗
The problem of lack of channel CSI is usually studied by
considering one of the following two settings. As discussed
before, in the first setting, CSI at users is assumed to be
available and a limited feedback link is assumed to exist from
the users to the base station. In [3], [5]–[8], [30] such a √
𝑚 𝛽𝑗𝑙𝑘 ℎ𝑗𝑙𝑘𝑚
setting is considered. In [5], the authors show that at high BS
Power 𝜌𝑓 Power 𝜌𝑟
signal to noise ratios (SNRs), the feedback rate required per
User-𝑘
user must grow linearly with the SNR (in dB) in order to
obtain the full MIMO BC multiplexing gain. The main result
in [6] is that the extent of CSI feedback can be reduced
by exploiting multi-user diversity. In [7] it is shown that
Fig. 2. System model showing the base station in 𝑙-th cell and the 𝑘-th user
nonrandom vector quantizers can significantly increase the in 𝑗-th cell.
MIMO downlink throughput. In [8], the authors design a joint
CSI quantization, beamforming and scheduling algorithm to
attain optimal throughput scaling. In the next setting, time antennas and 𝐾(≤ 𝑀 ) single-antenna users2 . Let the average
division duplex systems are considered and channel training power (during transmission) at the base station be 𝑝𝑓 and the
and estimation error are accounted for in the net achievable average power (during transmission) at each user be 𝑝𝑟 . The
rate. This approach is used in [9], [10], [31], [32]. In [9], the propagation factor between the 𝑚-th base station
authors give a lower bound on sum capacity and demonstrate √antenna of
the 𝑙-th cell and the 𝑘-th user of the 𝑗-th cell is 𝛽𝑗𝑙𝑘 ℎ𝑗𝑙𝑘𝑚 ,
that it is always beneficial to increase the number of antennas where {𝛽𝑗𝑙𝑘 } are non-negative constants and assumed to
at the base station. In [10], the authors study a heterogeneous be known to everybody, and {ℎ𝑗𝑙𝑘𝑚 } are independent and
user setting and present scheduling and precoding methods for identically distributed (i.i.d.) zero-mean, circularly-symmetric
this setting. In [31], the authors consider two-way training and complex Gaussian 𝒞𝒩 (0, 1) random variables and known
propose two variants of linear MMSE precoders as alternatives to nobody3. This system model is shown in Figure 2. The
to linear zero-forcing precoder used in [9]. Single-cell analysis above assumptions are fairly accurate and justified due to
of TDD systems are also provided in [32]. the following reason. The {𝛽𝑗𝑙𝑘 } values model path-loss and
Given this extensive body of literature in single-cell sys- shadowing that change slowly and can be learned over long
tems, the main contribution of this paper is in understanding period of time, while the {ℎ𝑗𝑙𝑘𝑚 } values model fading that
multi-cell systems with channel training. Its emphasis is on change relatively fast and must be learned and used very
TDD systems, which are arguably poorly studied compared quickly. Since the cell layout and shadowing are captured
to FDD systems. Specifically, the main contributions are to using the constant {𝛽𝑗𝑙𝑘 } values, for the purpose of this paper,
demonstrate the pilot contamination problem associated with the specific details of the cell layout and shadowing model are
uplink training, understand its impact on the operation of irrelevant. In other words, any cell layout and any shadowing
multi-cell MIMO TDD cellular systems, and develop a new model can be incorporated with the above abstraction.
precoding method to mitigate this problem.
We assume channel reciprocity√for the forward and reverse
links, i.e., the propagation factor 𝛽𝑗𝑙𝑘 ℎ𝑗𝑙𝑘𝑚 is same for both
B. Notation forward and reverse links, and block fading, i.e., {ℎ𝑗𝑙𝑘𝑚 }
We use bold font variables to denote matrices and vectors. remains constant for a duration of 𝑇 symbols. Note that we
(⋅)𝑇 denotes the transpose and (⋅)† denotes the Hermitian allow for a constant factor variation in forward and reverse
transpose, tr{⋅} denotes the trace operation, (⋅)−1 denotes the propagation factors through the different average power con-
inverse operation, and ∥ ⋅ ∥ denotes the two-norm. diag{a} straints at the base stations and the users. The additive noises at
denotes a diagonal matrix with diagonal entries equal to the all terminals are i.i.d. 𝒞𝒩 (0, 1) random variables. The system
components of a. 𝔼[⋅] and var{⋅} stand for expectation and equations describing the signals received at the base station
variance operations, respectively. and the users are given in the next section.
C. Organization
The rest of this paper is organized as follows. In Section II, III. C OMMUNICATION S CHEME
we describe the multi-cell system model. In Section III, we
The communication scheme consists of two phases: uplink
explain the communication scheme and the technique to obtain
training and data transmission. Uplink training phase consists
achievable rates. We analyze the effect of pilot contamination
of users transmitting training pilots, and base stations obtain-
in Section IV, and give the details of the new precoding
ing channel estimates. Data transmission phase consists of
method in Section V. We present numerical results in Section
base stations transmitting data to the users through transmit
VI. Finally, we provide our concluding remarks in Section
precoding. Next, we describe these phases briefly and provide
VII.
a set of achievable data rates using a given precoding method.
II. M ULTI -C ELL TDD S YSTEM M ODEL
2 These are the users of interest for applying the precoding framework.
We consider a cellular system with 𝐿 cells numbered 3 For compact notation, we do not separate the subscript or superscript
1, 2, . . . , 𝐿. Each cell consists of one base station with 𝑀 indices using commas throughout the paper.
A. Uplink Training where a𝑙𝑖 is the 𝑖-th column of the precoding matrix A𝑙 and
At the beginning of every coherence interval, all users (in all 𝑧𝑗𝑘 is the 𝑘-th element of z𝑗 .
cells) transmit√training sequences, which are 𝜏 length column
vectors4 . Let 𝜏 𝝍 𝑗𝑘 (normalized such that 𝝍 †𝑗𝑘 𝝍 𝑗𝑘 = 1) be C. Achievable Rates
the training vector transmitted by the 𝑘-th user in the 𝑗-th cell.
Consider the base station of the 𝑙-th cell. The 𝜏 length column Next, we provide a set of achievable rates using the method
vector received at the 𝑚-th antenna of this base station is suggested in [9]. With the above communication scheme, the
𝐿 ∑ 𝐾 base stations have channel estimates while the users do not
∑ √
y𝑙𝑚 = 𝑝𝑟 𝜏 𝛽𝑗𝑙𝑘 ℎ𝑗𝑙𝑘𝑚 𝝍 𝑗𝑘 + w𝑙𝑚 , (1) have any channel estimate. Therefore, the achievable rates
𝑗=1 𝑘=1 we derive have a different structure compared to typical rate
expressions. In particular, the effective noise term has channel
where w𝑙𝑚 is the additive noise. Let Y𝑙 = [y𝑙1 y𝑙2 . . . y𝑙𝑀 ]
variations around
√ the mean in addition to typical terms.
(𝜏 × 𝑀 matrix), W𝑙 = [w𝑙1 w𝑙2 . . . w𝑙𝑀 ] (𝜏 × 𝑀 ma- 𝑗𝑘
Let 𝑔𝑙𝑖 = 𝑝𝑓 𝛽𝑗𝑙𝑘 [ℎ𝑗𝑙𝑘1 . . . ℎ𝑗𝑙𝑘𝑀 ] a𝑙𝑖 . Now, (5) can be
trix), Ψ𝑗 = [𝝍 𝑗1 𝝍 𝑗2 . . . 𝝍 𝑗𝐾 ] (𝜏 × 𝐾 matrix), D𝑗𝑙 =
written in the form
diag{[𝛽𝑗𝑙1 𝛽𝑗𝑙2 . . . 𝛽𝑗𝑙𝐾 ]}, and
⎡ ⎤ 𝑀 ∑
∑ 𝐾
ℎ𝑗𝑙11 . . . ℎ𝑗𝑙1𝑀 𝑥𝑗𝑘 = 𝑔𝑙𝑖𝑗𝑘 𝑞𝑙𝑖 + 𝑧𝑗𝑘 ,
⎢ .. .. .. ⎥
H𝑗𝑙 = ⎣ . . . ⎦. 𝑙=1 𝑖=1
[ ] ( [ ])
𝑗𝑘 𝑗𝑘 𝑗𝑘
ℎ𝑗𝑙𝐾1 . . . ℎ𝑗𝑙𝐾𝑀 = 𝔼 𝑔𝑗𝑘 𝑞𝑗𝑘 + 𝑔𝑗𝑘 − 𝔼 𝑔𝑗𝑘 𝑞𝑗𝑘
∑
From (1), the signal received at this base station can be + 𝑔𝑙𝑖𝑗𝑘 𝑞𝑙𝑖 + 𝑧𝑗𝑘 . (6)
expressed as (𝑙,𝑖)∕=(𝑗,𝑘)
∑ 𝐿
√ 1
In (6), the effective noise is defined as
Y𝑙 = 𝑝𝑟 𝜏 Ψ𝑗 D𝑗𝑙2 H𝑗𝑙 + W𝑙 . (2)
𝑗=1 ′
( [ ]) ∑
𝑗𝑘 𝑗𝑘
𝑧𝑗𝑘 = 𝑔𝑗𝑘 − 𝔼 𝑔𝑗𝑘 𝑞𝑗𝑘 + 𝑔𝑙𝑖𝑗𝑘 𝑞𝑙𝑖 + 𝑧𝑗𝑘 . (7)
The MMSE estimate of the channel H𝑖𝑙 given Y𝑙 in (2) is (𝑙,𝑖)∕=(𝑗,𝑘)
( 𝐿
)−1
√ 1
†
∑ † Now, the effective point-to-point channel described by (6) can
Ĥ𝑗𝑙 = 𝑝𝑟 𝜏 D𝑗𝑙2 Ψ𝑗 I + 𝑝𝑟 𝜏 Ψ𝑖 D𝑖𝑙 Ψ𝑖 Y𝑙 . (3)
be written in the familiar form
𝑖=1 [ ]
𝑗𝑘 ′
This MMSE estimate in (3) follows from standard results 𝑥𝑗𝑘 = 𝔼 𝑔𝑗𝑘 𝑞𝑗𝑘 + 𝑧𝑗𝑘 , (8)
in estimation theory (for example see [33]). We denote the
𝑗𝑘
MMSE estimate of the channel between this base station and where 𝑞𝑗𝑘 is the input, 𝑥𝑗𝑘 is the output, 𝔼[𝑔𝑗𝑘 ] is the known
all users by Ĥ𝑙 = [Ĥ1𝑙 Ĥ2𝑙 . . . Ĥ𝐿𝑙 ]. This notation is used ′
channel and 𝑧𝑗𝑘 is the additive noise. This expectation is
later in Section IV. known as it only depends on the channel distribution and
not the instantaneous channel. However, the additive noise
B. Downlink Transmission is neither independent nor Gaussian. We use the result in
Consider the base station of the 𝑙-th cell. Let the infor- [34] that shows that worst-case uncorrelated additive noise
mation symbols to be transmitted to users in the 𝑙-th cell be is independent Gaussian noise of same variance to derive the
q𝑙 = [𝑞𝑙1 𝑞𝑙2 . . . 𝑞𝑙𝐾 ]𝑇 and the 𝑀 ×𝐾 linear precoding matrix following achievable rates.
be A𝑙 = 𝑓 (Ĥ𝑙 ). The function 𝑓 (⋅) corresponds to the specific Theorem 1: Consider the point-to-point communication
(linear) precoding method performed at the base station. The channels given by (8). Then, the following set of rates are
signal vector transmitted by this base station is A𝑙 q𝑙 . We achievable:
consider transmission symbols and precoding methods such ⎛ [ ]2 ⎞
𝑗𝑘
that 𝔼[q𝑙 ] = 0, 𝔼[q𝑙 q†𝑙 ] = I, and tr{A†𝑙 A𝑙 } = 1. These ⎜ 𝔼 𝑔𝑗𝑘 ⎟
(sufficient) conditions imply that the average power constraint 𝑅𝑗𝑘 = 𝐶 ⎜⎝ { } ∑ [ ] ⎟ , (9)
𝑗𝑘 𝑗𝑘 ⎠
2
at the base station is satisfied. 1 + var 𝑔𝑗𝑘 + (𝑙,𝑖)∕=(𝑗,𝑘) 𝔼 𝑔𝑙𝑖
Now, consider the users in the 𝑗-th cell. The noisy signal
vector received by these users is where 𝐶(𝜃) = log2 (1 + 𝜃).
𝐿
∑ Proof: Consider complex Gaussian 𝒞𝒩 (0, 1) distribu-
√ 1
x𝑗 = 𝑝𝑓 D𝑗𝑙2 H𝑗𝑙 A𝑙 q𝑙 + z𝑗 , (𝐾 × 1 vector) (4) tions for all inputs 𝑄𝑗𝑘 5 . Then, the resulting mutual informa-
𝑙=1 tion with this (not necessarily maximizing) input distribution
where z𝑗 is the additive noise. From (4), the signal received 𝐼(𝑄𝑗𝑘 ; 𝑋𝑗𝑘 ) is an achievable rate. However, this does not
by the 𝑘-th user can be expressed as result in a computable expression. To obtain a computable
𝐾
𝐿 ∑
lower bound to this rate, we observe that input random variable
∑ √ ′
𝑥𝑗𝑘 = 𝑝𝑓 𝛽𝑗𝑙𝑘 [ℎ𝑗𝑙𝑘1 . . . ℎ𝑗𝑙𝑘𝑀 ]a𝑙𝑖 𝑞𝑙𝑖 + 𝑧𝑗𝑘 , (5) 𝑄𝑗𝑘 and effective noise 𝑍𝑗𝑘 given by (7) are uncorrelated
𝑙=1 𝑖=1
based on the following: The input distributions are such that
4 We assume that there is time synchronization present in the system for 5 Upper case symbols are used in this proof to emphasize that these are
coherent uplink transmission. random variables.
𝑄𝑗𝑘 is clearly independent of 𝑄𝑙𝑖 for all (𝑙, 𝑖) ∕= (𝑗, 𝑘) and obtain a simple expression for the achievable rate given by
𝑍𝑗𝑘 . Furthermore, 𝑄𝑗𝑘 is independent of 𝐺𝑗𝑘𝑗𝑘 . Therefore, (9).
[( [ ]) ] In the setting considered here, the MMSE estimate of h𝑗𝑙
𝔼 𝐺𝑗𝑘 𝑗𝑘 − 𝔼 𝐺𝑗𝑘
𝑗𝑘
∣𝑄𝑗𝑘 ∣2 = based on Y𝑙 given by (3) can be simplified using matrix
[( [ ])] [ ] inversion lemma and the fact that 𝝍 † 𝝍 = 1 as follows:
𝔼 𝐺𝑗𝑘 𝑗𝑘 − 𝔼 𝐺 𝑗𝑘
𝑗𝑘 𝔼 ∣𝑄𝑗𝑘 ∣2 = 0. ( ( ) )−1
∑𝐿
√ †
The variance of the effective noise is ĥ𝑗𝑙 = 𝑝𝑟 𝜏 𝛽𝑗𝑙 𝝍 I + 𝝍 𝑝𝑟 𝜏 𝛽𝑖𝑙 𝝍 † Y𝑙 ,
[ ′ ] { } ∑ [ ] 𝑖=1
𝔼 ∣𝑍𝑗𝑘 ∣2 = var 𝐺𝑗𝑘 + 𝔼 ∣𝐺𝑗𝑘 2
∣ + 1. (⎛ ∑𝐿 ) ⎞
𝑗𝑘 𝑙𝑖
√ 𝝍 𝑝𝑟 𝜏 𝑖=1 𝛽𝑖𝑙 𝝍 †
(𝑙,𝑖)∕=(𝑗,𝑘)
= 𝑝𝑟 𝜏 𝛽𝑗𝑙 𝝍 † ⎝I − ∑𝐿 ⎠ Y𝑙 ,
Now, from [34] (Theorem 1), we know the result that the 1 + 𝑝𝑟 𝜏 𝑖=1 𝛽𝑖𝑙
⎛ ( ) ⎞
channel with independent Gaussian noise 𝑍ˆ𝑗𝑘 with same ∑
√ 𝑝𝑟 𝜏 𝐿𝑖=1 𝛽 𝑖𝑙
variance given by = 𝑝𝑟 𝜏 𝛽𝑗𝑙 ⎝𝝍 − †
𝝍 ⎠ Y𝑙 ,
†
[ ] ∑𝐿
1 + 𝑝𝑟 𝜏 𝑖=1 𝛽𝑖𝑙
𝑋ˆ 𝑗𝑘 = 𝔼 𝐺𝑗𝑘 𝑄𝑗𝑘 + 𝑍ˆ𝑗𝑘 √
𝑗𝑘
𝑝𝑟 𝜏 𝛽𝑗𝑙 †
is worse, i.e., = 𝝍 Y𝑙 ,
𝜅𝑙
𝐼(𝑄𝑗𝑘 ; 𝑋𝑗𝑘 ) ≥ 𝐼(𝑄𝑗𝑘 ; 𝑋ˆ 𝑗𝑘 ) ∑𝐿
where 𝜅𝑙 = 1 + 𝑝𝑟 𝜏 𝑖=1 𝛽𝑖𝑙 .
ˆ 𝑗𝑘 ) − ℎ(𝑋
= ℎ(𝑋 ˆ 𝑗𝑘 ∣𝑄𝑗𝑘 ) Remark 1: The above channel estimates clearly suggest
ˆ 𝑗𝑘 ) − ℎ(𝑍ˆ𝑗𝑘 ).
= ℎ(𝑋 the graveness of the pilot contamination impairment. For a
given base station, its estimate of every channel is simply
This completes the proof. a scaled version of the same vector 𝝍 † Y𝑙 . Thus, it cannot
The set of achievable rates given by (9) is valid for any lin- distinguish between the channel to its user and other users,
ear precoding method, and depends on the precoding method which makes pilot contamination a fundamental problem in
through the expectation and variance terms appearing in (9). multi-cell systems.
Similar achievable rates are used in the single-cell setting as Since 𝝍 † Y𝑙 is proportional to the MMSE estimate of h𝑗𝑙
well to study and/or compare precoding methods. Next, we for any 𝑗, we have
perform pilot contamination analysis using these achievable
rates. ĥ†𝑗𝑙 Y𝑙† 𝝍
= , ∀𝑗. (11)
∥ĥ𝑗𝑙 ∥ ∥𝝍† Y𝑙 ∥
IV. P ILOT C ONTAMINATION A NALYSIS Using (11), we obtain
We analyze the pilot contamination problem in the fol-
ĥ†𝑗𝑙
lowing setting: one user per cell (𝐾 = 1), same training h𝑗𝑙 a𝑙 = h𝑗𝑙
sequence used by all users (𝝍𝑗1 = 𝝍, ∀𝑗) and matched-filter † ,
ĥ𝑗𝑙
(MF) precoding. We consider this setting as it captures the
primary effect of pilot contamination which is the correlation † ĥ†𝑗𝑙
= ĥ𝑗𝑙 + h̃𝑗𝑙
, (12)
between the precoding matrix (vector in this setting) used †
ĥ𝑗𝑙
by the base station in a cell and channel to users in other
cells. We provide simple and insightful analytical results in where h̃𝑗𝑙 = h𝑗𝑙 − ĥ𝑗𝑙 . From the properties of MMSE
this setting. As mentioned earlier, we emphasize that the pilot we )know that ĥ𝑗𝑙 is (independent
estimation,
( ∑ of h̃𝑗𝑙 ), ĥ𝑗𝑙 is
contamination problem results from uplink training with non- 𝑝𝑟 𝜏 𝛽𝑗𝑙 1+𝑝𝑟 𝜏 𝑖∕=𝑗 𝛽𝑖𝑙
𝒞𝒩 0, 𝜅𝑙 I , and h̃𝑗𝑙 is 𝒞𝒩 0, 𝜅𝑙 I . These
orthogonal training sequences, and hence, it is not specific to
results are used next.
the setting considered here. However, the level of its impact
on the achievable rates would vary depending on the system From (12), we get
[ ]
settings.
𝔼 [h𝑗𝑙 a𝑙 ] = 𝔼 ĥ†𝑗𝑙 ,
In order to simplify notation, we drop the subscripts asso- √
ciated with the users in every cell. In this section, H𝑗𝑙 , Ĥ𝑗𝑙 𝑝𝑟 𝜏 𝛽𝑗𝑙
= 𝔼 [𝜃] , (13)
and A𝑙 are vectors and we denote these using h𝑗𝑙 , ĥ𝑗𝑙 and 𝜅𝑙
a𝑙 , respectively. The matched-filter used at the base station in √∑
𝑀
the 𝑙-th cell is given by a𝑙 = ĥ†𝑙𝑙 /∥ĥ𝑙𝑙 ∥. The user in the 𝑗-th where 𝜃 = 2
𝑚=1 ∣𝑢𝑚 ∣ and {𝑢𝑚 } is i.i.d. 𝒞𝒩 (0, 1). From
cell receives signal from its base station and from other base (12), we also have
stations. From (4), this received signal is ⎡ ⎤
[ ] [ ] †
√ ∑√ 2 ĥ 𝑗𝑙 ĥ
𝔼 ∥h𝑗𝑙 a𝑙 ∥2 = 𝔼 ĥ†𝑗𝑙 + 𝔼 ⎣ † 𝑗𝑙 ⎦
𝑥𝑗𝑘 = 𝑝𝑓 𝛽𝑗𝑗 h𝑗𝑗 a𝑗 𝑞𝑗 + 𝑝𝑓 𝛽𝑗𝑙 h𝑗𝑙 a𝑙 𝑞𝑙 + 𝑧𝑗 . (10) † h̃𝑗𝑙 h̃𝑗𝑙 † ,
𝑙∕=𝑗
ĥ𝑗𝑙 ĥ𝑗𝑙
∑
We compute first and second order moments of the effective 𝑝𝑟 𝜏 𝛽𝑗𝑙 1 + 𝑝𝑟 𝜏 𝑖∕=𝑗 𝛽𝑖𝑙
= 𝔼[𝜃2 ] + . (14)
channel gain and the inter-cell interference and use these to 𝜅𝑙 𝜅𝑙
Γ(𝑀+ 12 )
Next, we state two lemmas required to obtain a closed-form freedom, it is straightforward to see that 𝔼[𝜃] = Γ(𝑀) ,
expression for the achievable rate. 𝔼[𝜃2 ] = 𝑀 and var{𝜃} = 𝑀 − 𝔼2 [𝜃].
Lemma 2: The effective channel gain in (10) has expecta- Using the duplication formula
tion ( )
( )1/2 1 √
[√ ] Γ(𝑧)Γ 𝑧 + = 2(1−2𝑧) 𝜋Γ(2𝑧)
𝑝𝑟 𝜏 𝛽𝑗𝑗 2
𝔼 𝑝𝑓 𝛽𝑗𝑗 h𝑗𝑗 a𝑗 = 𝑝𝑓 𝛽𝑗𝑗 ∑ 𝔼[𝜃]
1 + 𝑝𝑟 𝜏 𝐿 𝑖=1 𝛽𝑖𝑗 and Stirling’s formula
{√ }
and variance var 𝑝𝑓 𝛽𝑗𝑗 h𝑗𝑗 a𝑗 = 𝑛!
lim √ = 1,
( ∑ ) 𝑛→∞ 2𝜋𝑛𝑛𝑛 𝑒−𝑛
𝑝𝑟 𝜏 𝛽𝑗𝑗 1 + 𝑝𝑟 𝜏 𝑖∕=𝑗 𝛽𝑖𝑗
𝑝𝑓 𝛽𝑗𝑗 ∑𝐿 var{𝜃} + ∑𝐿 . we obtain
1 + 𝑝𝑟 𝜏 𝑖=1 𝛽𝑖𝑗 1 + 𝑝𝑟 𝜏 𝑖=1 𝛽𝑖𝑗
( )
Proof: The proof follows from (13) and (14). Note that 1 Γ 𝑀 + 12
var{𝜃} = 𝔼[𝜃2 ] − (𝔼[𝜃])2 by definition. lim √
𝑀→∞ 𝑀 Γ(𝑀 )
Lemma 3: For both signal and interference terms in (10), √
𝜋 (1−2𝑀) (2𝑀 − 1)!
the first and second order moments are as follows: = lim 2 ,
[√ ] 𝑀→∞ 𝑀 (𝑀 − 1)!(𝑀 − 1)!
√ √
𝔼 𝑝𝑓 𝛽𝑗𝑙 h𝑗𝑙 a𝑙 𝑞𝑙 = 0, 𝜋 (1−2𝑀) 2𝜋(2𝑀 − 1)(2𝑀 − 1)(2𝑀−1)
= lim 2 ,
𝑀→∞ 𝑀 2𝜋(𝑀 − 1)(𝑀 − 1)2(𝑀−1) 𝑒
[ 2 ] √ ( )2𝑀−1
√ 2𝑀 − 1 1
𝔼 𝑝𝑓 𝛽𝑗𝑙 h𝑗𝑙 a𝑙 𝑞𝑙 = = lim 1+ 𝑒−1 ,
( ∑ ) 𝑀→∞ 2𝑀 2(𝑀 − 1)
𝑝𝑟 𝜏 𝛽𝑗𝑙 2
1 + 𝑝𝑟 𝜏 𝑖∕=𝑗 𝛽𝑖𝑙 = 1.
𝑝𝑓 𝛽𝑗𝑙 ∑𝐿 𝔼[𝜃 ] + ∑𝐿 .
1 + 𝑝𝑟 𝜏 𝑖=1 𝛽𝑖𝑙 1 + 𝑝𝑟 𝜏 𝑖=1 𝛽𝑖𝑙 2
Therefore, lim𝑀→∞ 𝔼𝑀[𝜃] = 1 and lim𝑀→∞ var{𝜃} 𝑀 = 0. This
Proof: Since 𝔼[𝑞𝑙 ] = 0 and 𝑞𝑙 is independent of h𝑗𝑙 and completes the proof of (16).
a𝑙 , it is clear that For large 𝑀 , the value of var{𝜃} (≈ 1/4) is insignificant
[√ ] compared to 𝑀 . The results of the above theorem show
𝔼 𝑝𝑓 𝛽𝑗𝑙 h𝑗𝑙 a𝑙 𝑞𝑙 = 0. that the performance does saturate with 𝑀 . Typically, the
∑𝐿
The proof of the second order moment follows directly from reverse link is interference-limited, i.e., 𝑝𝑟 𝜏 𝑖=1 𝛽𝑖𝑙 ≫ 1, ∀𝑗.
∑𝐿
(14). The term 𝑖=1 𝛽𝑖𝑙 is the expected sum of squares of the
The main result of this section is given in the next theo- propagation coefficients between ∑𝐿the base station in the 𝑗-th
rem. This theorem provides a closed-form expression for the cell and all users. Therefore, 𝑖=1 𝛽𝑖𝑙 is generally constant
achievable rates under the setting considered in this section, with respect to 𝑗. Using these approximations in (16), we get
( )
i.e., one user per cell (𝐾 = 1), same training sequence used 2
𝛽𝑗𝑗
by all users (𝝍𝑗1 = 𝝍, ∀𝑗) and matched-filter (MF) precoding. 𝑅𝑗 ≈ 𝐶 ∑ 2 .
Theorem 4: For the setting considered, the achievable rate 𝑙∕=𝑗 𝛽𝑗𝑙
of the user in the 𝑗-th cell during downlink transmission in This clearly show that the impact of pilot contamination can
(9) is given by be very significant if cross gains (between cells) are of the
⎛ ⎞ same order of direct gains (within the same cell). It suggests
𝑝 𝜏𝛽
𝑝𝑓 𝛽𝑗𝑗 𝑟 𝜅𝑗 𝑗𝑗 𝔼2 [𝜃]
𝐶 ⎝∑ ⎠, (15) frequency/time reuse and pilot reuse techniques to reduce the
𝑝𝑟 𝜏 𝛽𝑗𝑙
𝑙∕=𝑗 𝑝𝑓 𝛽𝑗𝑙 𝜅𝑙 𝔼[𝜃2 ] + 𝜁 cross gains (in the same frequency/time) relative to the direct
gains. The benefits of frequency reuse in the limit of an infinite
where number of antennas were demonstrated in [14].
𝐿 ∑ Remark 2: Our result in Theorem 4 is not an asymptotic
𝑝𝑟 𝜏 𝛽𝑗𝑗 ∑ 1 + 𝑝𝑟 𝜏 𝑖∕=𝑗 𝛽𝑖𝑙
𝜁 = 1 + 𝑝𝑓 𝛽𝑗𝑗 var{𝜃} + 𝑝𝑓 𝛽𝑗𝑙 , result. The expression in (15) is exact for any value of the
𝜅𝑗 𝜅𝑙
𝑙=1 number of antennas 𝑀 at the base stations. Hence, this
∑𝐿 Γ(𝑀+ 1 ) expression can be used to find the appropriate frequency/time
𝜅𝑗 = 1 + 𝑝𝑟 𝜏 𝑖=1 𝛽𝑖𝑗 , 𝔼[𝜃] = Γ(𝑀)2 , 𝔼[𝜃2 ] = 𝑀 and
reuse scheme for any given value of 𝑀 and other system
var{𝜃} = 𝑀 − 𝔼2 [𝜃]. Here, Γ(⋅) is the Gamma function. For parameters. We do not focus on this in this paper, as this would
large M, the following limiting expression for achievable rate depend largely on the actual system parameters including the
can be obtained: cell layout and the shadowing model.
⎛ ⎞
2
𝛽𝑗𝑗
∑𝐿
Remark 3: The result in Theorem 4 is for the setting with
⎜ 1+𝑝𝑟 𝜏 𝑖=1 𝛽𝑖𝑗 ⎟ one user per cell. In the general setting with 𝐾 users per
lim 𝑅𝑗 = 𝐶 ⎝ ∑ 2
𝛽𝑗𝑙
⎠. (16)
𝑀→∞
∑𝐿 cell, a similar analysis can be performed, however, it need not
𝑙∕=𝑗 1+𝑝𝑟 𝜏 𝛽𝑖𝑙
𝑖=1
simplify to a simple closed-form expression. The achievable
Proof: The proof of (15) follows by substituting the rate in (9) can be numerically evaluated in the general setting,
results of Lemma 2√and Lemma 3 in (9). Since 𝜃 has a scaled and this can be used to numerically study the impact of pilot
(by a factor of 1/ 2) chi distribution with 2𝑀 degrees of contamination.
To summarize, the impact of uplink training with non- does not extend (directly) to this setting. Let us consider
orthogonal pilots can be serious when the cross-gains are not the signal and interference terms corresponding to the base
small compared to the direct gains. This pilot contamination station in the 𝑙-th cell. Based on these terms, we formulate
problem is often neglected in theory and even in many large- the following optimization problem to obtain the precoding
scale simulations. The analysis in this section shows the need √ 1
matrix A𝑙 . We use the following notation: F𝑗𝑙 = 𝑝𝑓 D𝑗𝑙2 H𝑗𝑙 ,
to account for this impact especially in systems with high √ 1
F̂𝑗𝑙 = 𝑝𝑓 D𝑗𝑙2 Ĥ𝑗𝑙 and F̃𝑗𝑙 = F𝑗𝑙 − F̂𝑗𝑙 for all 𝑗 and 𝑙. The
reuse of training sequences. In addition to uplink training
optimization problem is:
in TDD systems, which is the focus of this paper, the pilot
contamination problem would appear in other scenarios as [
well as it is fundamental to training with non-orthogonal 2
min 𝔼F̃𝑗𝑙 ,z𝑙 ,q𝑙 ∥𝛼𝑙 (F𝑙𝑙 A𝑙 q𝑙 + z𝑙 ) − q𝑙 ∥ +
pilots. A𝑙 ,𝛼𝑙
]
Next, we proceed to develop a new precoding method ∑
2
referred to as the multi-cell MMSE-based precoding. ∥𝛼𝑙 𝛾(F𝑗𝑙 A𝑙 q𝑙 )∥ F̂𝑗𝑙 (18)

𝑗∕=𝑙
V. M ULTI -C ELL MMSE-BASED P RECODING subject to tr{A†𝑙 A𝑙 } = 1.
In the previous section, we show that pilot contamination
severely impacts the system performance by increasing the This objective function is very intuitive. The objective function
inter-cell interference. In particular, we show that the inter-cell of the problem (18) consists of two parts: (i) the sum of
interference grows like the intended signal with the number squares of “errors” seen by the users in the 𝑙-th cell, and (ii)
of antennas 𝑀 at the base stations while using zero-forcing the sum of squares of interference seen by the users in all other
precoding. Therefore, in the presence of pilot contamination, cells. The parameter 𝛾 of the optimization problem “controls”
in addition to frequency/time/pilot reuse schemes, it is cru- the relative weights associated with these two parts. The real
cial to account for inter-cell interference while designing a scalar parameter 𝛼𝑙 is important as it “virtually” corresponds
precoding method. Furthermore, since pilot contamination is to the potential scaling that can be performed at the users. The
originating from the non-orthogonal training sequences, it is optimal solution to the problem (18) denoted by A𝑜𝑝𝑡 𝑙 is the
important to account for the training sequence allocation while multi-cell MMSE-based precoding matrix.
designing a precoding method. The approach of accounting for Next, we obtain a closed-form expression for A𝑜𝑝𝑡 𝑙 . The
inter-cell interference while designing a precoding method is following lemma is required later for obtaining the optimal
common, while the approach of accounting for the training solution to the problem (18).
sequence allocation is not. Again, the usual approach is to Lemma 5: Consider the optimization problem (18). For all
decouple the channel estimation and precoding completely. 𝑗 and 𝑙,
However, while using non-orthogonal pilots, this is not the [ ]

right approach. These observations follow from our pilot 𝔼 F̃†𝑗𝑙 F̃𝑗𝑙 F̂𝑗𝑙 = 𝛿𝑗𝑙 I𝑀 ,
contamination analysis in the previous section.
The precoding problem cannot be directly formulated as where
a joint optimization problem as different base stations have { ( )−1 }
1 1
different received training signals. In other words, the problem †
𝛿𝑗𝑙 = 𝑝𝑓 tr D𝑗𝑙 I𝐾 + 𝑝𝑟 𝜏 D𝑗𝑙 Ψ𝑗 Λ𝑗𝑙 Ψ𝑗 D𝑗𝑙
2 2
, (19)
is decentralized in nature. Therefore, one approach is to apply
single-cell precoding methods. For example, since we assume
and
orthogonal training sequences in every cell, we can perform ⎛ ⎞−1
zero-forcing on the users in every cell. The precoding matrix ∑
corresponding to this zero-forcing approach is given by Λ𝑗𝑙 = ⎝I + 𝑝𝑟 𝜏 Ψ𝑖 D𝑖𝑙 Ψ†𝑖 ⎠ .
( )−1 𝑖∕=𝑗
Ĝ†𝑙𝑙 Ĝ𝑙𝑙 Ĝ†𝑙𝑙
A𝑙 = √ [ , (17)
( )−1 ] Proof: Let f̃𝑗𝑙𝑚 denote the 𝑚-th column of F̃𝑗𝑙 . Similarly,
†
tr Ĝ𝑙𝑙 Ĝ𝑙𝑙 we define h𝑗𝑙𝑚 and ĥ𝑗𝑙𝑚 . From (3), we have
√ 1 √ 1
where Ĝ𝑙𝑙 = 𝑝𝑓 D𝑙𝑙2 Ĥ𝑙𝑙 . However, this zero-forcing pre- f̃𝑗𝑙𝑚 = 𝑝𝑓 D𝑗𝑙2 (h𝑗𝑙𝑚 − ĥ𝑗𝑙𝑚 ),
√ ( √ )
coding or other single-cell precoding methods do not account =
1 1
𝑝𝑓 D𝑗𝑙2 h𝑗𝑙𝑚 − 𝑝𝑟 𝜏 D𝑗𝑙2 Ψ†𝑗 Δ𝑙 y𝑙𝑚 ,
for the training sequence allocation, which is potentially the
right approach to mitigate the pilot contamination problem.
where
We explore this next. ( )−1
𝐿
∑
In order to determine the precoding matrices, we formulate
an optimization problem for each precoding matrix. Consider Δ𝑙 = I + 𝑝𝑟 𝜏 Ψ𝑖 D𝑖𝑙 Ψ†𝑖 ,
the 𝑗-th cell. The signal received by the users in this cell 𝑖=1
given by (4) is a function of all the precoding matrices y𝑙𝑚 is

(used at all the base stations). Therefore, the MMSE-based
and { }𝑀given by (1). For given 𝑗 and 𝑙, it is clear
precoding methods for single-cell setting considered in [31] that f̃𝑗𝑙𝑚 is i.i.d. zero-mean 𝒞𝒩 distributed. Hence,
𝑚=1
[ ]

𝔼 F̃†𝑗𝑙 F̃𝑗𝑙 F̂𝑗𝑙 = 𝛿𝑗𝑙 I𝑀 where U = 𝛼𝑙 R 2 A𝑙 and V = R− 2 F̂†𝑙𝑙 . We have
1 1
{ }
[ ] 2
𝐿 (A𝑙 , 𝛼𝑙 , 𝜆) = ∥U − V∥ − tr F̂𝑙𝑙 R−1 F̂†𝑙𝑙 +
†
𝛿𝑗𝑙 = 𝔼 f̃𝑗𝑙𝑚 f̃𝑗𝑙𝑚 F̂𝑗𝑙 , ( 2 )
{ 1( [ ]) 1 } 𝛼𝑙 + 1 𝐾 − 𝜆. (21)

= 𝑝𝑓 tr D𝑗𝑙2 I𝐾 − 𝔼 ĥ𝑗𝑙𝑚 ĥ†𝑗𝑙𝑚 F̂𝑗𝑙 D𝑗𝑙2 ,
{ 1( ) 1} This can be easily verified by expanding the right hand side.
1 1
= 𝑝𝑓 tr D𝑗𝑙2 I𝐾 − 𝑝𝑟 𝜏 D𝑗𝑙2 Ψ†𝑗 Δ𝑙 Ψ𝑗 D𝑗𝑙2 D𝑗𝑙2 , It is clear from (21) that, for any given 𝛼𝑙 and 𝜆, 𝐿(A𝑙 , 𝛼𝑙 , 𝜆)
{ ( )−1 1 } is minimized if and only if U = V. Hence, we obtain
1 1 1
†
= 𝑝𝑓 tr D𝑗𝑙 I𝐾 + 𝑝𝑟 𝜏 D𝑗𝑙 Ψ𝑗 Λ𝑗𝑙 Ψ𝑗 D𝑗𝑙
2 2 2
D𝑗𝑙2 . 1
A𝑜𝑝𝑡
𝑙 = R−1 F̂†𝑙𝑙 . (22)
𝛼𝑙
The last step follows from matrix inversion lemma. Let 𝐿(𝛼𝑙 , 𝜆) = 𝐿(A𝑜𝑝𝑡 𝑙 , 𝛼𝑙 , 𝜆). Now, we have
The main result of this section is given by the following { } ( )
theorem. This theorem provides a closed-form expression for 𝐿(𝛼𝑙 , 𝜆) = − tr F̂𝑙𝑙 R−1 F̂†𝑙𝑙 + 𝛼2𝑙 + 1 𝐾 − 𝜆. (23)
the multi-cell MMSE-based precoding matrix. ∑
Theorem 6: The optimal solution to the problem (18) is Note that F̂†𝑙𝑙 F̂𝑙𝑙 + 𝛾 2 𝑗∕=𝑙 F̂†𝑗𝑙 F̂𝑗𝑙 can be factorized in the
⎛ ⎞−1 form S† ∑ diag{[𝑐1 𝑐2 . . . 𝑐𝑀 ]}S where S† S = I𝑀 . Let 𝛿 =
2
∑ 𝛿𝑙𝑙 + 𝛾 𝑗∕=𝑙 𝛿𝑗𝑙 . Therefore,
1
A𝑜𝑝𝑡
𝑙 = 𝑜𝑝𝑡 ⎝F̂†𝑙𝑙 F̂𝑙𝑙 + 𝛾 2 F̂†𝑗𝑙 F̂𝑗𝑙 + 𝜂I𝑀 ⎠ F̂†𝑙𝑙 ,
𝛼𝑙 𝑗∕=𝑙
R−1
(20) ( (
where 𝜆 ) )−1
∑ = S† diag{[𝑐1 𝑐2 . . . 𝑐𝑀 ]}S + 𝛿 + 2 I𝑀 ,
𝜂 = 𝛿𝑙𝑙 + 𝛾 2 𝛿𝑗𝑙 + 𝐾, 𝛼𝑙
( {[ 𝜆 𝜆 ]} )−1
𝑗∕=𝑙 = S† diag 𝑐1 + 𝛿 + 2 . . . 𝑐𝑀 + 𝛿 + 2 S ,
𝛼𝑙 𝛼𝑙
𝛿𝑗𝑙 is given by (19) and 𝛼𝑜𝑝𝑡
𝑙 satisfies tr{(A𝑜𝑝𝑡 † 𝑜𝑝𝑡
𝑙 ) A𝑙 } = 1.
{[( 𝜆 )−1 ( 𝜆 )−1 ]}
Proof: First, we simplify the objective function 𝐽(A𝑙 , 𝛼𝑙 ) = S† diag 𝑐1 + 𝛿 + 2 . . . 𝑐𝑀 + 𝛿 + 2 S.
𝛼𝑙 𝛼𝑙
of the problem (18) as follows: (24)
𝐽(A𝑙 , 𝛼𝑙 ) Substituting (24) in (23), we get
[
2 𝑀
∑
= 𝔼 ∥𝛼𝑙 (F𝑙𝑙 A𝑙 q𝑙 + z𝑙 ) − q𝑙 ∥ + 𝑑𝑚
∑ ] 𝐿(𝛼𝑙 , 𝜆) = − 𝜆
+ (𝛼2𝑙 + 1)𝐾 − 𝜆, (25)
2 𝑐 +𝛿+
𝑚=1 𝑚
∥𝛼𝑙 𝛾F𝑗𝑙 A𝑙 q𝑙 ∥ F̂𝑗𝑙 , 𝛼2𝑙
𝑗∕=𝑙
[ where 𝑑𝑚 is the (𝑚, 𝑚)-th entry of SF̂†𝑙𝑙 F̂𝑙𝑙 S† . Consider the
2 equations obtained by differentiating (25) w.r.t. 𝛼𝑙 and 𝜆 and
= 𝔼 ∥(𝛼𝑙 F𝑙𝑙 A𝑙 − I𝐾 ) q𝑙 ∥ +
∑ ] equating to zero:
2
∥𝛼𝑙 𝛾F𝑗𝑙 A𝑙 q𝑙 ∥ F̂𝑗𝑙 + 𝛼2𝑙 𝐾, 𝑀
∑ 𝑑𝑚 1
𝑗∕=𝑙
{ [ ( )2 = 1, (26)
† 𝑚=1 𝑐𝑚 + 𝛿 + 𝜆 𝛼2𝑙
= tr 𝔼 (𝛼𝑙 F𝑙𝑙 A𝑙 − I𝐾 ) (𝛼𝑙 F𝑙𝑙 A𝑙 − I𝐾 ) + 𝛼2𝑙
∑ ]}
𝑀
∑
𝛼2𝑙 𝛾 2 A†𝑙 F†𝑗𝑙 F𝑗𝑙 A𝑙 F̂𝑗𝑙 + 𝛼2𝑙 𝐾, 𝑑𝑚 2𝜆
− ( )2 + 2𝛼𝑙 𝐾 = 0. (27)
{
𝑗∕=𝑙
[ ] 𝑚=1 𝑐𝑚 + 𝛿 + 𝜆 𝛼3𝑙
𝛼2𝑙
= tr 𝛼2𝑙 A†𝑙 𝔼 F†𝑙𝑙 F𝑙𝑙 F̂𝑗𝑙 A𝑙 − 𝛼𝑙 A†𝑙 F̂†𝑙𝑙 − 𝛼𝑙 F̂𝑙𝑙 A𝑙 +
∑ [ ] } ( ) Substituting (26) in (27), we get

𝛼2𝑙 𝛾 2 A†𝑙 𝔼 F†𝑗𝑙 F𝑗𝑙 F̂𝑗𝑙 A𝑙 + 𝛼2𝑙 + 1 𝐾, 𝜆
= 𝐾. (28)
{
𝑗∕=𝑙
( ∑ † 𝛼2𝑙
= tr 𝛼2𝑙 A†𝑙 F̂†𝑙𝑙 F̂𝑙𝑙 + 𝛾 2 F̂𝑗𝑙 F̂𝑗𝑙 − 𝛼𝑙 A†𝑙 F̂†𝑙𝑙 − Combining the results in (22), (24), and (28), we get (20),
(
𝑗∕=𝑙 where 𝛼𝑜𝑝𝑡 is such that ∥A𝑜𝑝𝑡 2
𝑙 ∥𝐹 = 1.
∑ ) ) } ( ) 𝑙
𝛼𝑙 F̂𝑙𝑙 A𝑙 + 𝛿𝑙𝑙 + 𝛾 2 𝛿𝑗𝑙 I𝑀 A𝑙 + 𝛼2𝑙 + 1 𝐾. The precoding described above is primarily suited for
𝑗∕=𝑙
maximizing the minimum of the rates achieved by all the
users. This is because all users are treated equally without
The last step follows from Lemma 5. differentiating them based on the channels. Therefore, when
Now, consider the Lagrangian formulation the performance metric of interest is sum rate, this precoding
( { } ) can be combined with power control, scheduling, and other
𝐿 (A𝑙 , 𝛼𝑙 , 𝜆) = 𝐽 (A𝑙 , 𝛼𝑙 ) + 𝜆 tr A†𝑙 A𝑙 − 1 similar techniques to enhance the net performance. Since our
main concern is the inter-cell interference resulting from pilot
for the problem (18). Let
⎛ ⎞ contamination, and to avoid too complicated systems, we do
∑ ∑ not use that possibility in this paper. In the next section,
𝜆⎠
R = F̂†𝑙𝑙 F̂𝑙𝑙 + 𝛾 2 F̂†𝑗𝑙 F̂𝑗𝑙 + ⎝𝛿𝑙𝑙 + 𝛾 2 𝛿𝑗𝑙 + 2 I𝑀 , all numerical results and comparisons are performed without
𝛼𝑙 power control.
𝑗∕=𝑙 𝑗∕=𝑙
30 5
Contamination, ZF Zero−Forcing (ZF), b = 0.1a
No Contamination, ZF 4.5 Multi−Cell MMSE, b = 0.1a
25 Zero−Forcing (ZF), b = 0.3a
4
Sum Rate (bits/symbol)
Multi−Cell MMSE, b = 0.3a
20 3.5
R (bits/symbol)
3
15
2.5
10
2
5 1.5
1
0
5 10 15 20 25 30 0.5
Number of Antennas M
0
0 0.2 0.4 0.6 0.8 1
Fig. 3. Comparison of sum rate with and without pilot contamination for Cross−gain a
the two-cell system; sum rate saturates in the presence of contamination.
Fig. 5. Comparison of zero-forcing and multi-cell MMSE precoding
60 methods; 𝑎 and 𝑏 correspond to different cross-gains and 𝑅 denotes the
Zero−Forcing minimum rate achieved by all users.
GPS
50 Multi−Cell MMSE
is reused in the 2-nd cell. With 𝜏 = 8, all users in the system
40
are given orthogonal pilots. In Figure 3, we can clearly observe
the saturation of total throughput in the presence of pilot
30
contamination. Note that both scenarios deal with interference.
20
In many practical systems, we cannot necessarily keep 𝜏 large
(𝜏 = 8 in this example) as the coherence interval is typically
10 very short, which requires using small 𝜏 .
In the presence of pilot contamination (𝜏 = 4), neither
0 GPS nor multi-cell MMSE provide noticeable improvement
5 10 15 20 25 30
Number of Antennas M in total throughput (except for small values of 𝑀 ). This is
not surprising as pilot contamination is a very fundamental
Fig. 4. Comparison of different schemes for the two-cell system; Multi-cell problem in this example.7 For 𝑀 = 4 to 𝑀 = 12, the
MMSE performs inter-cell interference mitigation leading to improved sum
rate. improvement using GPS and multi-cell MMSE is reasonable.
This improvement results from using MMSE instead of zero-
VI. N UMERICAL R ESULTS forcing, and hence both GPS and multi-cell MMSE provide
Multi-Cell MMSE precoding denotes the new precoding very similar performance.8
method developed given in (20) with parameter 𝛾 set to unity.6 In the absence of pilot contamination (𝜏 = 8), the story is
ZF precoding denotes the popular zero-forcing precoding different as shown in Figure 4. Multi-cell MMSE outperforms
given in (17). GPS denotes the single-cell precoding method both the single-cell schemes (GPS and zero-forcing) by a
suggested in [31], which is a special case of the precoding huge margin. This is possible as multi-cell MMSE is capable
given in (20) with parameter 𝛾 set to zero. In all the plots, of performing efficient inter-cell interference mitigation. Note
we average the performance metric over 103 i.i.d. channel that this inter-cell interference mitigation is achieved using
realizations. the channel estimates obtained in a distributed manner. This
clearly shows the advantage of multi-cell MMSE. However,
this example only focuses on the scenario without pilot
A. Two-cell System contamination. Therefore, a natural question is whether multi-
We consider a basic two-cell example to understand the cell MMSE can provide throughput gains in a mixed scenario,
impact of pilot contamination on the total system throughput which is addressed next.
(sum rate). In particular, we consider a two-cell system with
𝑝𝑓 = 20 dB, 𝑝𝑟 = 10 dB and 𝐾 = 4 users in both cells. We
B. Multi-cell System
set all the direct gains to 1 and all cross gains to 𝑎, i.e., for
all 𝑘, 𝛽𝑗𝑙𝑘 = 1 if 𝑗 = 𝑙 and 𝛽𝑗𝑙𝑘 = 𝑎 if 𝑗 ∕= 𝑙. We capture the We consider a multi-cell system with 𝐿 = 4 cells, 𝑀 = 8
main observations using two plots. antennas at all base stations, 𝐾 = 2 users in every cell and
First, in Figure 3, we use cross gain value of 𝑎 = 0.8 training length of 𝜏 = 4. We consider 𝑝𝑓 = 20 dB and 𝑝𝑟 = 10
and plot sum rate versus number of antennas 𝑀 with zero- dB. Orthogonal training sequences are collectively used within
forcing for training lengths of 𝜏 = 4 (scenario with pilot the 1-st and 2-nd cells. The training sequences used in the 1-st
contamination) and 𝜏 = 8 (scenario without contamination). (2-nd) cell are reused in the 3-rd (4-th) cell. Thus, we model
With 𝜏 = 4, the orthogonal training sequences used in the 1-st a scenario where training sequences are reused. We keep the
6 In our simulations, we observe that the performance is not very sensitive 7 We can still overcome pilot contamination using frequency/time reuse.
to the value of this parameter. 8 This plot is not provided as it is not the main focus of this paper.
5
Multi−Cell MMSE Zero−Forcing
60
GPS GPS
Multi−Cell MMSE
4
50

40
R (bits/symbol)
30
2
20
1
10
0 0
12 16 20 24 28 32 8 12 16 20 24 28 32
Fig. 6. Comparison of GPS and multi-cell MMSE precoding methods with Fig. 8. Comparison of different precoding methods with 𝑎 = 0.8, 𝑏 = 0.1𝑎,
𝑎 = 0.8 and 𝑏 = 0.1𝑎; 𝑅 denotes the minimum rate achieved by all users. 𝜏 = 8; Each cell has 𝐾 = 4 users; Sum rate corresponds to total sum
throughput of all 𝐿 = 4 cells.
5
GPS ZF and GPS are almost indistinguishable, and hence we have
4.5 Multi−Cell MMSE omitted ZF in these plots.
4 Finally, we consider a system with reused pilots, 𝐾 = 4
users in every cell and training length of 𝜏 = 8. In Figure 8, we
3.5
plot the total sum throughput of different precoding methods
R (bits/symbol)
3 as a function of the number of antennas 𝑀 . In summary,

2.5 all the numerical results show that the new multi-cell MMSE
precoding offers significant performance gains.
2
1.5
VII. C ONCLUSION
1 In this paper, we characterize the impact of corrupted chan-
0.5 nel estimates caused by pilot contamination in TDD systems.
When non-orthogonal training sequences are assigned to users,
0
12 16 20 24 28 32 the precoding matrix used at a (multiple antenna) base station
becomes correlated with the channel to users in other cells (re-
Fig. 7. Comparison of GPS and multi-cell MMSE precoding methods with ferred to as pilot contamination). For a special setting that cap-
𝑎 = 0.8 and 𝑏 = 0.1𝑎; Pilots in cells 1 and 3 (2 and 4) are not reused but tures pilot contamination, we obtain a closed-form expressions
rotated by 45 degrees. for the achievable rates. Using these analytical expressions,
we show that, in the presence of pilot contamination, rates
propagation factors as follows: for all 𝑘, 𝛽𝑗𝑙𝑘 = 1 if 𝑗 = 𝑙, achieved by users saturate with the number of base station
𝛽𝑗𝑙𝑘 = 𝑎 if (𝑗, 𝑙) ∈ {(1, 2), (2, 1), (3, 4), (4, 3)}, and 𝛽𝑗𝑙𝑘 = 𝑏 antennas. We conclude that appropriate frequency/time reuse
for all other values of 𝑗 and 𝑙. “Frequency reuse” is handled techniques have to be employed to overcome this saturation
semi-quantitatively by adjusting the cross-gains. effect. The fact that pilot contamination hasn’t surfaced in
Another performance metric of interest is the minimum FDD studies suggests that researchers are assuming partial
rate achieved by all users denoted by 𝑅 = min𝑗𝑘 𝑅𝑗𝑘 . In CSI with independently corrupted noise, and are not fully
Figure 5, we plot the performance of ZF and multi-cell MMSE incorporating the impact of channel estimation.
precoding methods for different values of 𝑎 and 𝑏. We observe Next, we develop a multi-cell MMSE-based precoding that
significant advantage of using multi-cell MMSE precoding depends on the set of training sequences assigned to the users.
for wide range of values of 𝑎 and 𝑏. In Figure 6, we plot We obtain this precoding as the solution to an optimization
the performance of GPS and multi-cell MMSE precoding problem whose objective function consists of two parts: (i) the
methods as a function of the number of antennas 𝑀 . We also mean-square error of signals received at the users in the same
consider the scenario when pilots in cells 1 and 3 (2 and 4) cell, and (ii) the mean-square interference caused at the users
are not reused but rotated by 45 degrees. This comparison in other cells. We show that this precoding method reduce both
is given in Figure 7. In both cases, we observe significant intra-cell interference and inter-cell interference, and thus is
advantages in using our multi-cell MMSE precoding. Thus, similar in spirit to existing joint-cell precoding techniques.
the multi-cell MMSE scheme is capable of handling different The primary differences between joint-cell precoding and our
scenarios as it utilizes training sequences for precoder design approach are that a.) our approach is distributed in nature and,
to mitigate inter-cell interference even in the presence of pilot b.) we explicitly take into account the set of training sequences
contamination. Note that in both cases the performance of assigned to the users. Through numerical results, we show
that our method outperforms popular single-cell precoding [25] F. Boccardi, F. Tosato, and G. Caire, “Precoding schemes for the MIMO-
methods. GBC,” in Int. Zurich Seminar on Communications, Feb. 2006.
[26] M. Airy, S. Bhadra, R. W. Heath, Jr., and S. Shakkottai, “Transmit pre-
coding for the multiple antenna broadcast channel,” in Proc. Vehicular
R EFERENCES Technology Conference, vol. 3, 2006, pp. 1396–1400.
[1] J. Jose, A. Ashikhmin, T. Marzetta, and S. Vishwanath, “Pilot contami- [27] B. M. Hochwald, C. B. Peel, and A. L. Swindlehurst, “A vector-
nation problem in multi-cell TDD systems,” in Proc. IEEE International perturbation technique for near-capacity multiantenna multiuser com-
Symposium on Information Theory, July 2009, pp. 2184–2188. munication part II: perturbation,” IEEE Trans. Commun., vol. 53, pp.
[2] E. Biglieri, R. Calderbank, A. Constantinides, A. Goldsmith, A. Paulraj, 537–544, Jan. 2005.
and H. V. Poor, MIMO Wireless Communications. Cambridge University [28] M. Stojnic, H. Vikalo, and B. Hassibi, “Rate maximization in multi-
Press, 2010. antenna broadcast channels with linear preprocessing,” IEEE Trans.
[3] M. Sharif and B. Hassibi, “On the capacity of MIMO broadcast channels Wireless Commun., vol. 5, pp. 2338–2342, Sep. 2006.
with partial side information,” IEEE Trans. Inf. Theory, vol. 51, pp. 506– [29] Z. Shen, R. Chen, J. G. Andrews, R. W. Heath, Jr., and B. L. Evans,
522, Feb. 2005. “Low complexity user selection algorithms for multiuser MIMO systems
[4] P. Ding, D. J. Love, and M. D. Zoltowski, “On the sum rate of channel with block diagonalization,” IEEE Trans. Signal Process., vol. 54, pp.
subspace feedback for multi-antenna broadcast channels,” in Proc. IEEE 3658–3663, Sep. 2006.
Global Telecommunications Conference, Nov. 2005, pp. 2699–2703. [30] M. Joham, P. M. Castro, L. Castedo, and W. Utschick, “MMSE optimal
[5] N. Jindal, “MIMO broadcast channels with finite-rate feedback,” IEEE feedback of correlated CSI for multi-user precoding,” in Proc. IEEE
Trans. Inf. Theory, vol. 52, pp. 5045–5060, Nov. 2006. International Conference on Acoustics, Speech and Signal Processing,
[6] T. Yoo, N. Jindal, and A. Goldsmith, “Multi-antenna broadcast channels Mar. 2008, pp. 3129–3132.
with limited feedback and user selection,” IEEE J. Sel. Areas Commun., [31] K. S. Gomadam, H. C. Papadopoulos, and C.-E. W. Sundberg, “Tech-
vol. 25, pp. 1478–1491, 2007. niques for multi-user MIMO with two-way training,” in Proc. IEEE
[7] A. Ashikhmin and R. Gopalan, “Grassmannian packings for efficient International Conference on Communications, May 2008, pp. 3360–
quantization in MIMO broadcast systems,” in Proc. IEEE International 3366.
Symposium on Information Theory, June 2007, pp. 1811–1815. [32] G. Caire, N. Jindal, M. Kobayashi, and N. Ravindran, “Multiuser MIMO
[8] K. Huang, R. W. Heath, Jr., and J. G. Andrews, “Space division achievable rates with downlink training and channel state feedback,”
multiple access with a sum feedback rate constraint,” IEEE Trans. Signal IEEE Trans. Inf. Theory, vol. 56, no. 6, pp. 2845–2866, June 2010.
Process., vol. 55, no. 7, pp. 3879–3891, 2007. [33] T. Kailath, A. H. Sayed, and B. Hassibi, Linear Estimation. Prentice
[9] T. L. Marzetta, “How much training is required for multiuser MIMO?” Hall, 2000.
in Proc. Asilomar Conference on Signals, Systems and Computers, [34] B. Hassibi and B. M. Hochwald, “How much training is needed in
Oct./Nov. 2006, pp. 359–363. multiple-antenna wireless links?” IEEE Trans. Inf. Theory, vol. 49, pp.
[10] J. Jose, A. Ashikhmin, P. Whiting, and S. Vishwanath, “Scheduling and 951–963, Apr. 2003.
pre-conditioning in multi-user MIMO TDD systems,” in Proc. IEEE
International Conference on Communications, May 2008, pp. 4100–
4105. Jubin Jose is currently a Ph.D. candidate in Elec-
[11] B. Vojčić and W. M. Jang, “Transmitter precoding in synchronous trical and Computer Engineering at The University
multiuser communications,” IEEE Trans. Commun., vol. 46, no. 10, pp. of Texas at Austin (UT Austin). He received his
1346–1355, Oct. 1998. B.Tech. degree in Electrical Engineering from the
[12] T. L. Marzetta and B. M. Hochwald, “Fast transfer of channel state Indian Institute of Technology (IIT) Madras in 2006
information in wireless systems,” IEEE Trans. Signal Process., vol. 54, and M.S. degree in Electrical and Computer En-
pp. 1268–1278, 2006. gineering from UT Austin in 2008. His industry
[13] M. Guillaud, D. T. M. Slock, and R. Knopp, “A practical method for experience includes positions at NEC labs, Prince-
wireless channel reciprocity exploitation through relative calibration,” in ton, NJ, Qualcomm Corporate R&D, San Diego,
Proc. ISSPA, 2005. CA and Alcatel-Lucent Bell Labs, Murray Hill,
[14] T. L. Marzetta, “Noncooperative cellular wireless with unlimited num- NJ. His current research interests lie in the broad
bers of base station antennas,” IEEE Trans. Wireless Commun., vol. 9, area of communication networks with emphasis on wireless communication,
no. 11, pp. 3590–3600, 2010. information theory and network optimization. He received the WNCG student
[15] G. J. Foschini, K. Karakayali, and R. A. Valenzuela, “Coordinating mul- leadership award in 2011.
tiple antenna cellular networks to achieve enormous spectral efficiency,”
IEE Proc. Commun., vol. 153, pp. 548–555, Aug. 2006.
[16] S. Venkatesan, A. Lozano, and R. Valenzuela, “Network MIMO: over-
Alexei Ashikhmin is a member of technical staff
coming intercell interference in indoor wireless systems,” in Proc.
in the Communications and Statistical Sciences
Asilomar Conference on Signals, Systems and Computers, Nov. 2007,
Department, Bell Laboratories, Murray Hill, New
pp. 83–87.
Jersey. He received his Ph.D. degree in Electrical
[17] S. Shamai (Shitz), O. Somekh, and B. M. Zaidel, “Multi-cell commu-
Engineering from the Institute of Information Trans-
nications: an information thoeretic perspective,” in Joint Workshop on
mission Problems, Russian Academy of Science,
Communicaitons and Coding, Oct. 2004.
Moscow, Russia, in 1994. In 1995 and 1996, he
[18] H. Zhang and H. Dai, “Co-channel interference mitigation and coop-
was with the Mathematics and Computer Science
erative processing in downlink multicell multiuser MIMO networks,”
Department, Delft University of Technology, The
European J. Wireless Commun. and Networking, pp. 222–235, 4th
Netherlands. From 1997 to 1999, he was a Post-
quarter 2004.
doctoral Fellow at Modeling, Algorithms, and In-
[19] J. Zhang, R. Chen, J. G. Andrews, A. Ghosh, and R. W. Heath, “Net-
formatics Group of Los Alamos National Laboratory. Since 1999, he has
worked MIMO with clustered linear precoding,” IEEE Trans. Wireless
been with Bell Laboratories. In 2005-2007, Alexei Ashikhmin was an adjunct
Commun., vol. 8, no. 4, pp. 1910–1921, 2009.
professor at Columbia University. He taught courses “Error Correcting Codes”
[20] G. Caire and S. Shamai (Shitz), “On the achievable throughput of a
and “Communications Theory.”
multiantenna Gaussian broadcast channel,” IEEE Trans. Inf. Theory,
Alexei Ashikhmin’s research interests include communications theory, the
vol. 49, pp. 1691–1707, July 2003.
theory of error correcting codes, and classical and quantum information
[21] P. Viswanath and D. N. C. Tse, “Sum capacity of the vector Gaus-
theory. From 2003 to 2006, Dr. Ashikhmin served as an Associate Editor for
sian broadcast channel and uplink-downlink duality,” IEEE Trans. Inf.
IEEE Transactions on Information Theory, and is serving again, beginning
Theory, vol. 49, pp. 1912–1921, Aug. 2003.
this year. In 2002, Dr. Ashikhmin received Bell Laboratories President’s
[22] S. Vishwanath, N. Jindal, and A. J. Goldsmith, “Duality, achievable
Gold Award for breakthrough research resulting in the ability to deliver
rates, and sum-rate capacity of Gaussian MIMO broadcast channels,”
unprecedented wireless bit-rates. In 2005, Dr. Ashikhmin was honored by the
IEEE Trans. Inf. Theory, vol. 49, pp. 2658–2668, Oct. 2003.
IEEE Communications S.O. Rice Prize Paper Award for work on LDPC codes
[23] W. Yu and J. M. Cioffi, “Sum capacity of Gaussian vector broadcast
for information transmission with multiple antennas. In 2010, Dr. Ashikhmin
channels,” IEEE Trans. Inf. Theory, vol. 50, no. 9, pp. 1875–1892, 2004.
received Bell Labs Team Award for Crosstalk Cancellation in DSL systems.
[24] H. Weingarten, Y. Steinberg, and S. Shamai (Shitz), “The capacity region
Dr. Ashikhmin is a senior member of IEEE Information Theory Society.
of the Gaussian multiple-input multiple-output broadcast channel,” IEEE
Trans. Inf. Theory, vol. 52, pp. 3936–3964, Sep. 2006.
Thomas L. Marzetta was born in Washington, Special Issue on Space-Time Transmission, Reception, Coding, and Signal
D.C. He received the PhD in electrical engineer- Design (Oct. 2003). Dr. Marzetta was the recipient of the 1981 ASSP Paper
ing from the Massachusetts Institute of Technology Award from the IEEE Signal Processing Society. He was elected a Fellow of
in 1978. His dissertation extended, to two dimen- the IEEE in Jan. 2003.
sions, the three-way equivalence of autocorrelation
sequences, minimum-phase prediction error filters, Sriram Vishwanath received the B.Tech. degree
and reflection coefficient sequences. He worked for in electrical engineering from the Indian Institute
Schlumberger-Doll Research (1978 - 1987) to mod- of Technology (IIT), Madras, India, in 1998, the
ernize geophysical signal processing for petroleum M.S. degree in electrical engineering from Califor-
exploration. He headed a group at Nichols Research nia Institute of Technology (Caltech), Pasadena, in
Corporation (1987 - 1995) which improved auto- 1999, and the Ph.D. degree in electrical engineering
matic target recognition, radar signal processing, and video motion detec- from Stanford University, Stanford, CA, in 2003.
tion. He joined Bell Laboratories in 1995 (formerly part of AT&T, then His industry experience includes positions at Lucent
Lucent Technologies, now Alcatel-Lucent). He has had research supervisory Bell Labs and National Semiconductor Corporation.
responsibilities in communication theory, statistics, and signal processing. He He is currently an Associate Professor in the De-
specializes in multiple-antenna wireless, with a particular emphasis on the partment of Electrical and Computer Engineering,
acquisition and exploitation of channel-state information. He is the originator University of Texas, Austin. His research interests include network informa-
of Large-Scale Antenna Systems which can provide huge improvements in tion theory, wireless systems, and mobile systems. Dr. Vishwanath received
wireless energy efficiency and spectral efficiency. the NSF CAREER award in 2005 and the ARO Young Investigator Award
Dr. Marzetta was a member of the IEEE Signal Processing Society in 2008. He is the co-recipient of the 2005 IEEE Joint Information Theory
Technical Committee on Multidimensional Signal Processing, a member of the Society and Communications Society best paper award. He has served as the
Sensor Array and Multichannel Technical Committee, an associate editor for general chair of the IEEE Wireless Network Coding Conference (WiNC) in
the IEEE T RANSACTIONS ON S IGNAL P ROCESSING, an associate editor for 2010, the local arrangements chair of ISIT 2010, and the Guest Editor-in-
the IEEE T RANSACTIONS ON I MAGE P ROCESSING, and a guest associate Chief of IEEE T RANSACTIONS ON I NFORMATION T HEORY Special Issue
editor for the IEEE T RANSACTIONS ON I NFORMATION T HEORY Special on Interference Networks.
Issue on Signal Processing Techniques for Space-Time Coded Transmissions
(Oct. 2002) and for the IEEE T RANSACTIONS ON I NFORMATION T HEORY

Pilot Conatamination

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Pilot Conatamination

Uploaded by

Copyright:

Available Formats

2640 IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. 10, NO.

Pilot Contamination and Precoding in

M ULTIPLE antennas, especially at the base-station, have

given by (4) is a function of all the precoding matrices y𝑙𝑚 is

Sum Rate (bits/symbol)

3 as a function of the number of antennas 𝑀 . In summary,

You might also like