You are on page 1of 12

998 IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. 34, NO.

4, APRIL 2016

Energy-Efficient Hybrid Analog and Digital


Precoding for MmWave MIMO Systems
With Large Antenna Arrays
Xinyu Gao, Student Member, IEEE, Linglong Dai, Senior Member, IEEE, Shuangfeng Han, Member, IEEE,
Chih-Lin I, Senior Member, IEEE, and Robert W. Heath Jr., Fellow, IEEE

AbstractMillimeter wave (mmWave) MIMO will likely use I. I NTRODUCTION


hybrid analog and digital precoding, which uses a small num-
ber of RF chains to reduce the energy consumption associated
with mixed signal components like analog-to-digital components
not to mention baseband processing complexity. However, most
T HE integration of millimeter-wave (mmWave) and mas-
sive multiple-input multiple-output (MIMO) technique
can achieve orders of magnitude increase in system through-
hybrid precoding techniques consider a fully connected architec- put due to larger bandwidth [1] and higher spectral efficiency
ture requiring a large number of phase shifters, which is also
[2], which makes it promising for future 5G wireless com-
energy-intensive. In this paper, we focus on the more energy-
efficient hybrid precoding with subconnected architecture, and munication systems [3]. On one hand, massive MIMO with a
propose a successive interference cancelation (SIC)-based hybrid very large antenna array (e.g., 256 antennas) at the base station
precoding with near-optimal performance and low complexity. (BS) can simultaneously serve a set of users through the use
Inspired by the idea of SIC for multiuser signal detection, we of precoding [4]. It has been theoretically proved that massive
first propose to decompose the total achievable rate optimiza-
tion problem with nonconvex constraints into a series of simple
MIMO can achieve orders of magnitude increase in spectral
subrate optimization problems, each of which only considers one efficiency, since it can provide more multi-user gain [2]. On the
subantenna array. Then, we prove that maximizing the achievable other hand, mmWave with high frequencies enables such large
subrate of each subantenna array is equivalent to simply seeking antenna array in massive MIMO to be packed in small physical
a precoding vector sufficiently close (in terms of Euclidean dis- dimension [5]. Furthermore, the large antenna array can also
tance) to the unconstrained optimal solution. Finally, we propose
provide sufficient array gain by precoding [6], [7] to overcome
a low-complexity algorithm to realize SIC-based hybrid precod-
ing, which can avoid the need for the singular value decomposition the free-space pathloss of mmWave signals and establish links
(SVD) and matrix inversion. Complexity evaluation shows that the with satisfying signal-to-noise ratio (SNR) [8].
complexity of SIC-based hybrid precoding is only about 10% as For MIMO in conventional cellular frequency band (e.g.,
complex as that of the recently proposed spatially sparse precoding 23 GHz), precoding is entirely realized in the digital domain
in typical mmWave MIMO systems. Simulation results verify that
to cancel interference between different data streams. For a
SIC-based hybrid precoding is near-optimal and enjoys higher
energy efficiency than the spatially sparse precoding and the fully conventional digital precoding, each antenna requires a dedi-
digital precoding. cated energy-intensive radio frequency (RF) chain (including
digital-to-analog converter, up converter, etc.), whose energy
Index TermsMIMO, mmWave communications, hybrid pre- consumption is a large part (about 250 mW per RF chain [9])
coding, energy-efficient, 5G. of the total energy consumption at mmWave frequencies due
to the wide bandwidth. If the conventional digital precoding is
Manuscript received July 27, 2015; accepted December 8, 2015. Date of pub- applied in mmWave massive MIMO system with a large num-
lication March 31, 2016; date of current version May 11, 2016. This work was ber of antennas, the corresponding large number of RF chains
supported in part by the International Science and Technology Cooperation will bring high energy consumption, e.g., 16 W is required by
Program of China (Grant 2015DFG12760), in part by the National Natural
a mmWave massive MIMO system with 64 antennas. To solve
Science Foundation of China (Grant 61571270 and Grant 61271266), in part
by the Beijing Natural Science Foundation (Grant 4142027), and in part by this problem, the hybrid analog and digital precoding has been
the Foundation of Shenzhen Government. This work was presented in part at proposed [10]. The key idea is to divide the conventional digital
the IEEE International Conference on Communications (ICC), London, U.K., precoder into a small-size digital precoder (realized by a small
June 2015.
X. Gao and L. Dai are with the Tsinghua National Laboratory for
number of RF chains) to cancel interference and a large-size
Information Science and Technology (TNList), Department of Electronic analog precoder (realized by a large number of analog phase
Engineering, Beijing 100084, China (e-mail: xy-gao14@mails.tsinghua.edu. shifters (PSs)) to increase the antenna array gain. In this way,
cn; daill@tsinghua.edu.cn). hybrid precoding can reduce the number of required RF chains
S. Han and C.-L. I are with the Green Communication Research
Center, China Mobile Research Institute, Beijing 100053, China (e-mail:
without obvious performance loss, which makes it enjoy a much
hanshuangfeng@chinamobile.com; icl@chinamobile.com). higher energy efficiency than digital precoding [10].
R. W. Heath is with the University of Texas at Austin, Austin, TX 78712 The existing hybrid precoding schemes can be divided into
USA (e-mail: rheath@utexas.edu). two categories. The first category of hybrid precoding based
Color versions of one or more of the figures in this paper are available online
at http://ieeexplore.ieee.org. on spatially sparse precoding was proposed in [11][13], which
Digital Object Identifier 10.1109/JSAC.2016.2549418 formulated the achievable rate optimization problem as a sparse
0733-8716 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
GAO et al.: ENERGY-EFFICIENT HYBRID ANALOG AND DIGITAL PRECODING FOR MmWAVE MIMO SYSTEMS 999

approximation problem and solved it by the orthogonal match-


ing pursuit (OMP) algorithm [14] to achieve the near-optimal
performance. The second category of hybrid precoding based
on codebooks was proposed in [15][17], which involved an
iterative searching procedure among the predefined codebooks
to find the optimal hybrid precoding matrix. However, these
algorithms are all designed for the hybrid precoding with the
fully-connected architecture, where each RF chain is connected
to all BS antennas via PSs. As the number of BS antennas is
very large (e.g., 256 as considered in [11]), the fully-connected
architecture requires thousands of PSs, which may bring three
additional limitations: 1) it consumes more energy for exci-
tation like the giant phased array radar [18]; 2) it requires
more energy to compensate for the insertion loss of PS [18];
3) it involves higher computational complexity, which will also
bring more energy consumption [19]. In contrast, the hybrid
precoding with the sub-connected architecture, where each RF
chain is connected to only a subset of BS antennas, can sig-
nificantly reduce the number of required PSs. Therefore, the
sub-connected architecture is expected to be more energy-
efficient and easier to be implemented for mmWave MIMO
systems. Unfortunately, designing the optimal hybrid precod- Fig. 1. Two typical architectures of the hybrid precoding in mmWave MIMO
ing with the sub-connected architecture is still a challenging systems: (a) Fully-connected architecture, where each RF chain is connected
problem [10], [20], since such architecture changes the con- to all BS antennas; (b) Sub-connected architecture, where each RF chain is
straints on the original problem of hybrid precoding with the connected to only a subset of BS antennas.
fully-connected architecture.
In this paper, we propose a successive interference cancela- Section III specifies the proposed SIC-based hybrid precod-
tion (SIC)-based hybrid precoding with sub-connected archi- ing, together with the complexity evaluation. The simulation
tecture. The contributions of this paper can be summarized as results of the achievable rate and energy efficiency are shown
follows. in Section IV. Finally, conclusions are drawn in Section V.
1) Inspired by the idea of SIC derived for multi-user sig- Notation: Lower-case and upper-case boldface letters denote
nal detection [21], we propose to decompose the total vectors and matrices, respectively; ()T , () H , ()1 , and ||
achievable rate optimization problem with non-convex denote the transpose, conjugate transpose, inversion, and deter-
constraints into a series of simple sub-rate optimization minant of a matrix, respectively; 1 and 2 denote the
problems, each of which only considers one sub-antenna l1 - and l2 -norm of a vector, respectively;  F denotes the
array. Then, we maximize the achievable sub-rate of each Frobenius norm of a matrix; Re{} and Im{} denote the real
sub-antenna array one by one until the last sub-antenna part and imaginary part of a complex number, respectively;
array is considered. E() denotes the expectation; Finally, I N is the N N identity
2) We prove that maximizing the achievable sub-rate of each matrix.
sub-antenna array is equivalent to seeking a precoding
vector which has the smallest Euclidean distance to the
II. S YSTEM M ODEL
unconstrained optimal solution. Based on this fact, we
can easily obtain the optimal precoding vector for each Fig. 1 illustrates two typical architectures for hybrid pre-
sub-antenna array. coding in mmWave MIMO systems, i.e., the fully-connected
3) We further propose a low-complexity algorithm to realize architecture as shown in Fig. 1 (a) and the sub-connected archi-
the SIC-based precoding, which avoids the need for sin- tecture as shown in Fig. 1 (b). In both cases the BS has N M
gular value decomposition (SVD) and matrix inversion. antennas but only N RF chains. From Fig. 1, we observe that the
Complexity evaluation shows that the complexity of SIC- sub-connected architecture will likely be more energy-efficient,
based precoding is only about 10% as complex as that since it only requires N M PSs, while the fully-connected
of the spatially sparse precoding [11] in typical mmWave architecture requires N 2 M PSs. To fully achieve the spatial
MIMO systems. Simulation results verify that the pro- multiplexing gain, the BS usually transmits N independent data
posed SIC-based hybrid precoding is near-optimal and streams to users employing K receive antennas [10].
enjoys higher energy efficiency than the spatially sparse In the sub-connected architecture as shown in Fig. 1 (b),
precoding [11] and the fully digital precoding. N data streams in the baseband are precoded by the dig-
It is worth pointing out that to the best of the authors knowl- ital precoder D. In cases where complexity is a concern,
edge, our work in this paper is the first one that considers the D can be further specialized to be a diagonal matrix as
hybrid precoding design with sub-connected architecture. D = diag [d1 , d2 , . . . , d N ], where dn R for n = 1, 2, . . . , N
The rest of the paper is organized as follows. Section II [10]. Then the role of D essentially performs some power allo-
briefly introduces the system model of mmWave MIMO. cation. After passing through the corresponding RF chain, the
1000 IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. 34, NO. 4, APRIL 2016

digital-domain signal from each RF chain is delivered to only array structures at the BS and the user, respectively. For the uni-
M PSs [22] to perform the analog precoding, which can be form linear array (ULA) with U elements, the array response
a n C
denoted by the analog weighting vector M1 , whose ele- vector can be presented as [18]
ments have the same amplitude 1/ M but different phases
1 2 2
T
[22]. After the analog precoding, each data stream is finally fULA () = 1, e j d sin() , . . . , e j (U 1) d sin() , (4)
transmitted by a sub-antenna array with only M antennas asso- U
ciated with the corresponding RF chain. Then, the received
where denotes the wavelength of the signal, and d is the
signal vector y = [y1 , y2 , . . . , y K ]T at the user in a narrowband
antenna spacing. Note that here we abandon the subscripts {t, r }
system1 can be presented as
in (3) and we also do not include since the ULA response vec-
tor is independent of the elevation angle. Additionally, when we
y = HADs + n = HPs + n, (1)
consider the uniform planar array (UPA) with W1 and W2 ele-
where is the average received power; H C K N M denotes ments (W1 W2 = U ) on horizon and vertical, respectively, the
the channel matrix, A is the N M N analog precoding matrix array response vector can be given by [18]
comprising N analog weighting vectors {am }m=1
N
as 1 2
fUPA (, ) = 1, . . . , e j d(x sin() sin()+y cos()) , . . . ,
a 1 0 . . . 0 U
0 a 2 T
0 j 2
e d((W1 1) sin() sin()+(W2 1) cos()) , (5)
A= . ,
. . . ...
(2)
..
0 0 . . . a N N MN where 0 x (W1 1) and 0 y (W2 1).

s = [s1 , s2 , . . . , s N ]T represents the transmitted signal vector III. SIC-BASED H YBRID P RECODING FOR MM WAVE
in the baseband. In this paper, we assume the widely used MIMO S YSTEMS
Gaussian signals [10][13], [15][17] with normalized signal In this section, we propose a low-complexity SIC-based
power E(ss H ) = N1 I N , while the practical system with finite- hybrid precoding to achieve the near-optimal performance. The
alphabet inputs [23], [24] will be also briefly discussed in evaluation of computational complexity is also provided to
Section IV. P = AD presents the hybrid precoding matrix of show its advantages over current solutions.
size N M N , which satisfies P F N to meet the total
transmit power constraint [11]. Finally, n = [n 1 , n 2 , . . . , n N ]T A. Structure of SIC-based hybrid precoding
is an additive white Gaussian noise (AWGN) vector, whose
In this paper, we aim to maximize the total achievable rate R
entries follow the independent and identical distribution (i.i.d.)
of mmWave MIMO systems2 , while other criteria such as the
CN(0, 2 ).
max-min fairness criterion [27] are also of interest. Specifically,
It is known that mmWave channel H will not likely fol-
R can be expressed as [11]
low the rich-scattering model assumed at low frequencies due
 
to the limited number of scatters in the mmWave proroga-  H H
R = log2 I K + HPP H  . (6)
tion environment [3]. In this paper, we adopt the geometric N2
Saleh-Valenzuela channel model to embody the low rank and
spatial correlation characteristics of mmWave communications According to the system model (1) in Section II, since
[10][13], [15][17], [25] as the hybrid precoding matrix P can be represented as
P = AD = diag {a1 , . . . , a N } diag {d1 , . . . , d N }, there are

L




three constraints for the design of P:
H= l r lr , lr t lt , lt fr lr , lr ftH lt , lt , Constraint 1: P should be a block diagonal matrix similar
l=1 to the form of A as shown in (2), i.e., P = diag {p 1 , . . . , p N },
(3) where p n = dn a n is the M 1 non-zero vector of the nth
 T
where = NMK
L is a normalization factor, L is the number column pn of P, i.e., pn = 01M(n1) , p n , 01M(N n) ;
of effective channel paths corresponding to the limited number Constraint 2: The non-zero elements of each column of P
of scatters, and we usually have L N for mmWave communi- should have the same amplitude, since the digital precoding
cation systems. l C is the gain of the lth path. lt (lt ) and lr matrix D is a diagonal matrix, and the amplitude of non-zero
(lr ) are the azimuth (elevation) angles
of departure
and
arrival elements of the analog precoding matrix A is fixed to 1/ M;
(AoDs/AoAs), respectively. t lt , lt and r lr , lr denote Constraint 3: The Frobenius norm of P should satisfy
the transmit and receive antenna array gain at a specific AoD P F N to meet the total transmit power constraint, where N
and AoA, respectively.

For simplicity

but without loss of gen- is the number of RF chains equal to the number of transmitted
erality, t lt , lt and r lr , lr can be set
as one within
the
data streams.
range of AoDs/AoAs [26]. Finally, ft lt , lt and fr lr , lr Unfortunately, these non-convex constraints on P make max-
are the antenna array response vectors depending on the antenna imizing the total achievable rate (6) very difficult to be solved.
1 While mmWave systems are expected to be broadband as in prior work [3], 2 The maximization sum-rate criterion can also suppress the interference as
the narrowband system can be regarded as a reasonable first step. The extension much as possible, and the mathematical quantification of such interference will
to broadband system is an interesting topic. be an important topic for future work.
GAO et al.: ENERGY-EFFICIENT HYBRID ANALOG AND DIGITAL PRECODING FOR MmWAVE MIMO SYSTEMS 1001

However, based on the special block diagonal structure of the


hybrid precoding matrix P, we observe that the precoding on
different sub-antenna arrays are independent. This inspires us
to decompose the total achievable rate (6) into a series of sub-
rate optimization problems, each of which only considers one
sub-antenna array.
In particular, we can divide the hybrid precoding matrix P as
P = P N 1 p N , where p N is the N th column of P, and P N 1 is
an N M (N 1) matrix containing the first (N 1) columns
of P. Then, the total achievable rate R in (6) can be rewritten as
 Fig. 2. Diagram of the proposed SIC-based hybrid precoding.

 H H
R = log2 I K + HPP H 
N2

    H H  B. Solution to the Subrate Optimization Problem
= log2 I K + H P N 1 p N P N 1 p N H 
 N2 In this subsection, we focus on the sub-rate optimization
 problem of the nth sub-antenna array, which can be directly
= log2 I K + HP N 1 P NH
1 H
H
N2  applied to other sub-antenna arrays. According to (8), the sub-
H H rate optimization problem of the nth sub-antenna array by
+ Hp p
N N H 
N2 designing the nth precoding vector pn can be stated as
 
= log2 (|T N 1 |) + log2 I K + H H
(a) 1
T Hp p H   H 
N 2 N 1
N N opt
  pn = arg max log2 1 + p n G n1 pn , (9)
(b) H H 1 pn F N2
= log2 (|T N 1 |) + log2 1+ p N H T N 1 Hp N , (7)
N 2
where Gn1 is defined as Gn1 = H H T1 n1 H, F is the set of
where (a) is obtained by defining the auxiliary matrix
all feasible vectors satisfying the three constraints described in
T N 1 = I K + N 2 HP N 1 P NH HH ,
1 and (b) is true
Section III-A. Note that the nth precoding vector pn only has
due to the fact that |I + XY| = |I + YX| by defining
M non-zero elements from the (M(n 1) + 1)th one to the
X = T1N 1Hp N and Y = p N H H H . Note that the second
 (Mn)th one. Therefore, the sub-rate optimization problem (9)
term log2 1 + N 2 p N H H H T1 Hp
N 1 N on the right side of can be equivalently written as
(7) is the achievable sub-rate of the N th sub-antenna array,  
H
while the first term log2 (|T N 1 |) shares the same form as opt
p n = arg max log2 1 + p n Gn1 p n , (10)
(6). This observation implies that we can further decompose
p n F N 2

log2 (|T N 1 |) using the similar method in (7) as


 H  where F includes all possible M 1 vectors satisfying
H 1 n1 of size M M is the cor-
log2 (|T N 2 |)+log2 1+ p N 1 H T N 2 Hp N 1 . Constraint 2 and Constraint 3, G
N2
responding sub-matrix of Gn1 by only keeping the rows and
Then, after N such decompositions, the total achievable rate R columns of Gn1 from the (M(n 1) + 1)th one to the (Mn)th
in (6) can be presented as one, which can be presented as

N  H H 1  n1 = RGn1 R H = RH H T1 HR H ,
G (11)
R= log2 1 + p H T Hpn , (8) n1
N2 n n1
 
n=1
where R = 0 MM(n1) I M 0 MM(N n) is the correspond-
where we have Tn = I K + N 2 HPn PnH H H and T0 = I N . ing selection matrix.
From (8), we observe that the total achievable rate optimization Define the singular value decomposition (SVD) of the
problem can be transformed into a series of sub-rate optimiza- Hermitian matrix G n1 = V V H , where is an
n1 as G
tion problems of sub-antenna arrays, which can be optimized M M diagonal matrix containing the singular values of G n1
one by one3 . After that, inspired by the idea of SIC for multi- in a decreasing order, and V is an M M unitary matrix. It is
user signal detection [21], we can optimize the achievable known that the optimal unconstrained precoding vector of (10)
sub-rate of the first sub-antenna array and update the matrix T1 . is the first column v1 of V, i.e., the first right singular vector
Then, the similar method can be utilized to optimize the achiev- of G n1 [11]. However, according to the constraints mentioned
opt
able sub-rate of the second sub-antenna array. Such procedure in Section III-A, we cannot directly choose p n as v1 since the
will be executed until the last sub-antenna array is considered. elements of v1 do not obey the constraint of same amplitude
Fig. 2 shows the diagram of the proposed SIC-based hybrid pre- (i.e., Constraint 2). To find a feasible solution to the sub-rate
coding. Next, we will discuss how to optimize the achievable optimization problem (10), we need to further convert (10) into
sub-rate of each sub-antenna array. another form, which is given by the following Proposition 1.
3 Note that different precoding orders of sub-antenna arrays will lead to the
Proposition 1: The optimization problem (10)
same performance, since the total achievable rate R in (8) can be exactly repre-  H 
opt
sented by the summation of the sub-rate of each sub-antenna array without any p n = arg max log2 1 + p G n1
p n
performance loss.
p n F N2 n
1002 IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. 34, NO. 4, APRIL 2016

is equivalent to the following problem opt


Note that for all sub-antenna arrays, the optimal solution p n
opt for n = 1, 2, . . . , N have a similar form. Thus, we can conclude
p n = arg min v1 p n 22 , (12)
that
p n F 
 opt 2  
P  = 
opt
, . . . ,
opt 2
where v1 is the first right singular vector of G n1 . F  diag p 1 p
FN  N, (17)

Proof: See Appendix A.  which demonstrates that the total transmit power constraint
Proposition 1 indicates that we can find a feasible precod- (Constraint 3) is satisfied.
opt
ing vector p n , which is sufficiently close (in terms of Euclidean After we have acquired p n for the nth sub-antenna
distance) to the optimal but unpractical precoding vector v1 , to array, the matrices Tn = I K + N 2 HPn PnH H H (8) and
maximize the achievable sub-rate of the nth sub-antenna array. G n = RH H T1 H
n HR (11) can be updated. Then, the method
Since p n = dn a n according to (1), the target v1 p n 22 in (12) described above for the nth sub-antenna array can be reused
can be rewritten as again to optimize the achievable sub-rate of the (n + 1)th sub-
antenna array. To sum up, solving the sub-rate optimization
v1 p n 22
problem of the nth sub-antenna array consists of the following
= (v1 dn a n ) H (v1 dn a n ) three steps.
  Step 1: Execute the SVD of G n1 to obtain v1 ;
= v1H v1 + dn2 a nH a n 2dn Re v1H a n
Step 2: Let p n = M1 v1 1 e jangle(v1 ) as the optimal solution
opt

(a)
  to the current nth sub-antenna array;
= 1 + dn2 2dn Re v1H a n
Step 3: Update matrices Tn = I K + N 2 HPn PnH H H and
  2    2  n = RH H T1 H for the next (n + 1)th sub-
= dn Re v1H a n + 1 Re v1H a n , (13) G n HR
antenna array.
opt
Note that although we can obtain the optimal solution p n by
where (a) is obtained based on the facts that v1H v1 = 1 and the method above, we need to compute the SVD of Gn1 (Step
a nH a n = 1, since v1 is the first column of the unitary n (Step 3) involving the matrix inversion
matrix V 1) and the matrix G
and each element of a n has the same amplitude 1/ M. of large size, which leads to high computational complexity as
From (13), we observe that the distance between p n and v1 well as high energy consumption for computation [19]. To this

2
consists of two parts. The first one is dn Re v1H a n , which end, next we will propose a low-complexity algorithm to obtain
H
opt
can be minimized by choosing dn = Re v1 a n . The p n without the complicated SVD and matrix inversion.
 to zero H
2 
second one is 1 Re v1 a n , which can be minimized
 H
 C. Low-Complexity Algorithm to Obtain the Optimal Solution
by maximizing Re v a n . Note that both a n and v1 have a
1
fixed power of one, i.e., v1H v1 = 1 and a nH a n = 1. Therefore, We start by considering how to avoid the SVD involving high


the optimal a n to maximize Re v1H a n  is computational complexity as well as a large number of divi-
opt
sions, which are difficult to be implemented in hardware. We
1 observe from Step 1 that the SVD of G n1 does not need to
a n = e jangle(v1 ) ,
opt
(14) be computed to acquire  and V, as only the first column v1
M opt
of V is enough to obtain p n . This observation inspires us to
where angle(v1 ) denotes the phase vector of v1 , i.e., each exploit the simple power iteration algorithm [28], which is used
opt
element of a n shares the same phase as the corresponding to compute the largest eigenvalue and the corresponding eigen-
opt
element of v1 4 . Accordingly, the optimal choice of dn is vector of a diagonalizable matrix. Since G n1 is a Hermitian
    v 
matrix, it follows that: 1) Gn1 is also a diagonalizable matrix;
1 n1 are the
dn = Re v1H a n = Re v1H e jangle(v1 ) = 1 . (15)
opt 1
2) The singular values (right singular vectors) of G
M M same as the eigenvalues (eigenvectors). Therefore, the power
opt iteration algorithm can be also utilized to compute v1 as well as
Based on (14) and (15), the optimal solution p n to the opti- n1 with low complexity.
the largest singular value 1 of G
mization problem (12) (or equivalently (10)) can be obtained by
More specifically, as shown by the pseudo-code in
1 Algorithm 1, the power iteration algorithm starts with an ini-
v1 1 e jangle(v1 ) .
opt opt opt
p n = dn a n = (16) tial solution u(0) C M1 , which is usually set as [1, 1, . . . , 1]T
M
without loss of generality [28]. In each iteration, it first com-
It is worth pointing out that v1 is the first column of the uni- putes the auxiliary vector z(s) = G n1 u(s1) (s is the number
tary matrix V, each element vi of v1 (for i = 1, . . . , M) has of iterations) and then extracts the element of z(s) having
 
 opt 2 the largest amplitude as m (s) . After that, u(s) is updated as
the amplitude less than one. Therefore, we have p n  1.
2 (s)
u(s) = mz (s) for the next iteration. The power iteration algorithm
4 It is worth pointing out that the analog precoding vector a can be also
n will stop until the number of iterations
 reaches the predefined
restricted to a DFT vector to save the overhead of quantization for limited feed-
back systems [11]. However, since more constraints are set on the design of
number S. Finally, m (S) and u(S) /u(S) 2 will be output as the
analog precoding, such scheme may lead to some performance loss compared largest singular value 1 and the first right singular vector v1
to the proposed one (14). of G n1 , respectively.
GAO et al.: ENERGY-EFFICIENT HYBRID ANALOG AND DIGITAL PRECODING FOR MmWAVE MIMO SYSTEMS 1003

Algorithm 1. Power iteration algorithm Proof: See Appendix B. 


Proposition 2 implies that we can simply exploit 1 and v1
Input: (1) G n1 ; that have been obtained by Algorithm 1 as described above to
(2) Initial solution u(0) ; update G n , which only involves one vector-to-vector multipli-
(3) Maximum number of iteration S cation instead of the complicated matrix-to-matrix multiplica-
for 1 s S tion and matrix inversion. Note that the evaluation of computa-
1) z(s) = G n1 u(s1)
  tional complexity will be discussed in detail in Section III-E.
(s)  (s) 
2) m = arg max z i 
(s) D. Summary of the Proposed SIC-Based Hybrid Precoding
zi
3) if 1 s 2 Based on the discussion so far, the pseudo-code of the
n (s) = m (s) proposed SIC-based hybrid precoding can be summarized in
else
2 Algorithm 2, which can be explained as follows. The proposed
m (s) m (s2) m (s1)
n (s) = m (s) 2m (s1) +m (s2) SIC-based hybrid precoding starts by computing the largest sin-
end if gular value 1 and first right singular vector v1 of G n1 , which
(s)
4) u(s) = nz (s) is achieved by Algorithm 1. After that, according to Section III-
end for B, the optimal precoding vector for the nth sub-antenna array
Output: (1) The largest singular value 1 = n (S) can be obtained by utilizing v1 . Finally, based on Proposition 2,
(S)
(2) The first singular vector v1 = uu(S)  G n can be updated with low complexity for the next iteration.
2 This procedure will be executed until the last (N th) sub-antenna
array is considered. Finally, after N iterations, the optimal dig-
According to [28], we know that ital, analog, and hybrid precoding matrices D, A, and P can be
  s  obtained, respectively.
(s) 2
m = 1 1 + O , (18)
1 Algorithm 2. SIC-based hybrid precoding

where 2 is the second largest singular value of G n1 . From Input: G 0


(s)
(18), we conclude that m will converges to 1 as long as for 1 n N
1 = 2 . Similarly, when 1 = 2 , u(s) /u(s) 2 will also 1) Compute v1 and 1 of G n1 by Algorithm 1
opt ) opt v1 1
converge to v1 , i.e., 2) a n = e
1 jangle(v 1 , d
n = ,
M M
opt
p n = 1
v  jangle(v1 ) (14)-(16)
u(s) 1 1 e
lim m (s) = 1 ,
M
lim  (s)  = v1 . (19)
2 v1 v1H
u 
s s
2 3) Gn = Gn1 N1+
2 1


(Proposition 2)
1
N2
Although the power iteration algorithm is convergent, its con- end for  
opt opt
vergence rate may be slow if 1 2 based on (18). To solve Output: (1) D = diag d1 , . . . , d N
this problem, we propose to utilize the Aitken acceleration  
opt opt
method [29] to further increase the convergence rate of the (2) A = diag a 1 , . . . , a N
power iteration algorithm. Specifically, we can compute (3) P = AD
 (s)
n = m (s) , for 1 s 2,

2 (20)
m (s) m (s2) m (s1) It is worth pointing out that the idea of SIC-based hybrid
n (s) = m (s) 2m (s1) +m (s2) , for 2 < s S. precoding can be also extended to the combining at the user
following the similar logic in [11]. When the number of RF
Then, u(s) and 1 will be correspondingly changed to chains at the BS is smaller than that at the user, we first
(s)
u(s) = nz (s) and 1 = n (S) , respectively. compute the optimal hybrid precoding matrix P according to
Next, we will focus on how to reduce the complex- Algorithm 2, where we assume that the combining matrix
ity to compute the matrices Tn = I K + N 2 HPn PnH H H and Q = I. Then, given the effective channel matrix HP, we can
G n = RH H T1 H
n HR , which involve the complicated matrix- similarly obtain the optimal hybrid combining matrix Q by
to-matrix multiplication and matrix inversion of large size. In referring to Algorithm 2, where the input G 0 and the optimal
particular, with some standard mathematical manipulations, the unconstrained solution v1 should be correspondingly replaced.
computation of G n can be significantly simplified as shown by Conversely, when the number of RF chains at the BS is larger
the following Proposition 2. than that at the user, we can assume P = I and obtain the
Proposition 2: The matrix G n = RH H T1 H
n HR , where optimal hybrid combining matrix Q. After that, the optimal pre-
Tn = I K + N 2 HPn PnH H H , can be simplified as coding matrix P can be acquired given the effective channel
matrix QH. Additionally, to further improve the performance,

2v vH
N2 1 1 1 we can combine the above method with the Ping-pong algo-
G n1
n G , (21)
1 + N 2 1 rithm [22], which involves an iteration procedure between the
BS and the user, to jointly seek the optimal hybrid precoding
where 1 and v1 are the largest singular value and first right and combining matrices pair. Further discussion about hybrid
n1 , respectively.
singular vector of G combining will be left for future work.
1004 IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. 34, NO. 4, APRIL 2016

TABLE I
C OMPLEXITY C OMPARISON

E. Complexity Evaluation
In this subsection, we provide the complexity evaluation
of the proposed SIC-based hybrid precoding in terms of the
required numbers of complex multiplications and divisions.
From Algorithm 2, we observe that the complexity of SIC-
based hybrid precoding comes from the following four parts: Fig. 3. Achievable rate comparison for an N M K = 64 16 (N = 8)
1) The first one originates from the computation of mmWave MIMO system.
G 0 = RH H HR H according to (11). Note that R is a
selection matrix and H has the size K N M. Therefore,
this part involves K M 2 times of multiplications without IV. S IMULATION R ESULTS
any division. In this section, we provide the simulation results of
2) The second one is from executing Algorithm 1. It is the achievable rate and energy efficiency to evaluate the
observed that in each iteration we need to compute performance of the proposed SIC-based hybrid precoding. We
a matrix-to-vector multiplication z(s) = G n1 u(s1)
compare the performance of SIC-based hybrid precoding with
together with the Aitken acceleration method

(20). the recently proposed spatially sparse precoding [11] and the
Therefore, we totally require S M 2 + 2 4 and optimal unconstrained precoding based on the SVD of the
(2S 2) times of multiplications and divisions, channel matrix, which are both with fully-connected archi-
respectively. tecture. Additionally, we also include the conventional analog
3) The third one stems from acquiring the optimal solution precoding [30] and the optimal unconstrained precoding (i.e.,
opt
p n in step 2 of Algorithm 2. We find that this part is opt
p n = v1 ) which are both with sub-connected architecture as
quite simple, which only needs 2 times of multiplications benchmarks for comparison.
without any division, since v1 has been obtained and 1 The simulation parameters are described as follows. We gen-
M
is a fixed constant. erate the channel matrix according to the channel model [31]
4) The last one comes from the update of G n . According described in Section II. The number of effective channel paths
to Proposition 2, we know that this part mainly involves is L = 3 [11]. The carrier frequency is set as 28GHz [15]. Both
a outer product v1 v1H . Thus, it requires M 2 times of the transmit and receive antenna arrays are ULAs with antenna
multiplications with only one division. spacing d = /2. Since the BS usually employs the directional
To sum up, the proposed SIC-based
hybrid precoding antennas to eliminate interference and increase antenna gain
approximately requires O M 2 (N S + K ) times of multiplica- [3], theAoDs are assumed to follow the uniform distribution
tions and O (2N S) times of divisions. Table I provides the com- within 6 , 6 . Meanwhile, due to the random position of
plexity comparison between SIC-based hybrid precoding and users, we assume that the AoAs follow the uniform distribution
the recently proposed spatially sparse
precoding [11], which within [, ], which means the omni-directional antennas are
requires O N 4 M + N 2 L 2 + N 2 M 2 L times of multiplications adopted by users. Furthermore, we set the maximum number of

and O 2N 3 times of divisions. Here, L is the number of effec- iterations S = 5 to run Algorithm 2. Finally, SNR is defined
tive channel paths as defined in (3). Considering the typical as 2 .
mmWave MIMO system with N = 8, M = 8, K = 16, L = 3 Firstly, we consider the perfect channel state information
[11], we observe that the complexity of SIC-based hybrid pre- (CSI) scenario. Fig. 3 shows the achievable rate comparison in
coding is about 4 103 times of multiplications and 102 times mmWave MIMO system, where N M K = 64 16 and the
of divisions, where we set S = 5 (note that S 5 is usually suf- number of RF chains is N = 8. We observe from Fig. 3 that
ficient to guarantee the performance, which is verified through the proposed SIC-based hybrid precoding outperforms the con-
intensive simulations). By contrast, the complexity of the spa- ventional analog precoding with sub-connected architecture in
tially sparse precoding is about 5 104 times of multiplications whole simulated SNR range. Meanwhile, Fig. 3 also verifies the
and 103 times of divisions. Therefore, the proposed SIC-based near-optimal performance of SIC-based hybrid precoding, since
hybrid precoding enjoys much lower complexity, which is only it can achieve about 99% of the rate achieved by the optimal
about 10% as complex as that of the spatially sparse precoding. unconstrained precoding with sub-connected architecture.
GAO et al.: ENERGY-EFFICIENT HYBRID ANALOG AND DIGITAL PRECODING FOR MmWAVE MIMO SYSTEMS 1005

Fig. 4. Achievable rate comparison for an N M K = 128 32 (N = 16) Fig. 5. Achievable rate comparison against the numbers of BS and user
mmWave MIMO system. antennas (N M = K ), where N = 8 and SNR = 0 dB.

Fig. 4 compares the achievable rate in mmWave MIMO sys-


tem with N M K = 128 32 and N = 16, where we observe
similar trends as those from Fig. 3. More importantly, Fig. 3
and Fig. 4 show that the performance of SIC-based hybrid pre-
coding is also close to the spatially sparse precoding and the
optimal unconstrained precoding with fully-connected archi-
tecture. For example, when SNR = 0 dB, our method can
achieve more than 90% of the rate achieved by the near-optimal
spatially sparse precoding in both simulated mmWave MIMO
configurations. Considering the low computational complex-
ity of the proposed SIC-based hybrid precoding as analyzed
before, we further conclude that SIC-based hybrid precoding
can achieve much better trade-off between the performance and
computational complexity.
Fig. 5 provides a achievable rate comparison in mmWave
MIMO systems against the numbers of BS and user anten-
Fig. 6. Achievable rate comparison against the number of user antennas K ,
nas, where N M = K , the number of RF chains is fixed to where N M = 64, N = 8, and SNR = 0 dB.
N = 8, and SNR = 0 dB. We find that the performance of
the proposed SIC-based hybrid precoding can be improved by Next we evaluate the impact of imperfect CSI on the pro-
increasing the number of BS and user antennas, which involves posed SIC-based hybrid precoding. The estimated channel
much lower energy consumption than increasing the number of can be modeled as [4]
matrix (imperfect CSI) H
energy-intensive RF chains [18].
Fig. 6 shows the achievable rate comparison against the num-
H = H+ 1 2 E, (22)
bers of user antennas K , where N M = 64, N = 8, and SNR =
0 dB. We imply from Fig. 6 that the performance loss of SIC- where H is the actual channel matrix, [0, 1] presents the
based hybrid precoding due to the sub-connected architecture CSI accuracy, and E is the error matrix with entries follow-
can be compensated by increasing the number of user antennas ing the distribution i.i.d. CN (0, 1). Fig. 7 shows the achievable
K . For example, the achievable rate of SIC-based hybrid pre- rate comparison for an N M K = 64 16 (N = 8) mmWave
coding when K = 30 is the same as that of the spatially sparse MIMO system, where the perfect CSI and the imperfect CSI
precoding when K = 20. Note that in this case, the required with different scenarios are considered. We observe that the
number of PSs of SIC-based hybrid precoding is N M = 64, proposed SIC-based hybrid precoding is not sensitive to the CSI
while for the spatially sparse precoding, the number of required accuracy. For example, the achievable rate of SIC-based hybrid
PSs is N 2 M = 512. That means much energy can saved by precoding when = 0.9 is quite close to that in the perfect CSI
SIC-based hybrid precoding, which will be also verified by scenario, where the SNR gap is about 1 dB. Even when the CSI
simulation results later. In contrast, the cost of increasing the accuracy is quite poor (i.e., = 0.5), SIC-based hybrid pre-
number of user antennas K will be negligible since the energy coding with imperfect CSI can still achieve more than 88% of
consumption of user antenna is usually small [18]. the rate in the perfect CSI scenario. Additionally, Fig. 8 shows
1006 IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. 34, NO. 4, APRIL 2016

Fig. 7. Impact of imperfect CSI on SIC-based hybrid precoding for an Fig. 9. Achievable rate of SIC-based hybrid precoding with finite-alphabet
N M K = 64 16 (N = 8) mmWave MIMO system. inputs, where N = 8, N M = K = 64, and BSPK is adopted.

u m,k is defined as
expectation with respect to the noise vector n,

  
HP sm sk + n 2 n 2
u m,k = 2 2
, (24)
2
H and P are the channel matrix and precoding matrix, respec-
tively, 2 is the power of noise, and sm is a possible signal
vector with N elements taking values from Q.
From (23) we know that the achievable rate of practical sig-
naling is quite different from ideal Gaussian signaling, where
the upper bound is determined by N log2 |Q| [24], [32]. Fig. 9
shows the achievable rate of SIC-based hybrid precoding with
finite-alphabet inputs, where N M K = 64 16, N = 8, and
the simple BPSK modulation (|Q| = 2) is considered as an
example. We can observe that as the SNR becomes large, the
proposed SIC-based hybrid precoding can also achieve the
Fig. 8. Impact of imperfect CSI on SIC-based hybrid precoding for an performance quite close to the upper bound N log2 |Q| = 8bits.
N M K = 128 32 (N = 16) mmWave MIMO system. In the end, we evaluate the energy efficiency of the proposed
SIC-based hybrid precoding. Based on the energy consumption
model in [33], [34], the energy efficiency can be defined as
the achievable rate comparison for an N M K = 128 32
(N = 16) mmWave MIMO system, where similar conclusions R R
as those from Fig. 7 can be derived. = = (bps/Hz/W) , (25)
Ptotal Pt + NRF PRF + NPS PPS
After that, we will also evaluate the proposed SIC-based
hybrid precoding in practical systems with finite-alphabet sig- 
where Ptotal = Pt + NRF PRF + NPS PPS is the total energy con-
nals instead of ideal Gaussian signals. Here, we also aim to
sumption, Pt is the transmitted energy, PRF is the energy
maximize the achievable sum-rate, since it has been shown in
consumed by RF chain, PPS is the energy consumed by PS
[23, Section IV] that maximizing the achievable sum-rate is an
(including the energy for the excitation and the energy for the
excellent criterion for precoding, and it also has direct impact
compensation of insertion loss [18]), NRF and NPS are the
on the coded bit error rate performance. For the finite-alphabet
numbers of required RF chains and PSs, respectively.
signals s, whose values are taken from a practical constellation
In this paper, we use the practical values PRF = 250 mW [9],
Q, the achievable rate R can be presented as [23]
PPS = 1 mW [18], and Pt = 1 W (about 30 dBm) in a small
cell transmission scenario [35], since mmWave is more likely
1
|Q|

N
|Q|

N
to be applied in small cells. Fig. 10 shows the energy effi-
R = N log2 |Q| E
n log e u m,k
, (23) ciency comparison against the number of RF chains N , where
|Q| N m=1 2

k=1 SNR = 0 dB, N M = K = 64 (N = 1, 2, 4, . . . , 64 to ensure
that M is an integer). We observe that both the conventional
where N is the number of RF chains (also the number of trans- spatially sparse precoding and the proposed SIC-based pre-
mitted data streams), |Q| is the cardinality of Q, En denotes the coding can achieve higher energy efficiency than the optimal
GAO et al.: ENERGY-EFFICIENT HYBRID ANALOG AND DIGITAL PRECODING FOR MmWAVE MIMO SYSTEMS 1007

Rn in (26) can be rewritten as


 H 
Rn = log2 1 + p n G n1
p n
 N2 
H
= log2 1 + p n VV H p n
 N 2

= log2 1 +
N2
  
1 0
p n [v1 V2 ]
H
[v1 V2 ] p n
H
0 2
 H
= log2 1 + p v1 1 v1H p n
N2 n 
H
+
p n V 2  2 V H
2
p n . (28)
N2
Since we aim to find a vector p n sufficiently close to v1 , it is
reasonable to assume that p n is approximately orthogonal to the
matrix V2 , i.e., p nH V2 0 [11]. Then, (28) can be simplified as
 
1 H
Rn log2 1 +
p v v H
1 1 n p
Fig. 10. Energy efficiency comparison against the numbers of RF chains N , N2 n
where N M = K = 64.  
(a) 1
= log2 1 +
unconstrained precoding (also known as the fully digital pre- N2
!   "
coding), especially when the number of RF chains N is limited 1 1 1  
(e.g., N < 30). Besides, we also observe that the proposed SIC- + log2 1 1 + 1 p n v1 v1 p n
H H
N2 N2
based precoding is more energy efficient than the conventional  
(b) 1  
spatially sparse precoding.
log2 1 + + log 2 p H
n v 1 v H
1
p n (29)
N2
V. C ONCLUSIONS where (a) is obtained by using
the formula
In this paper, we proposed a SIC-based hybrid precoding I + XY = (I + X) I (I + X)1 X (I Y) [11], where
1
with sub-connected architecture for mmWave MIMO systems. we define X = N 2
and Y =
p n 1 1
H v vH p
n ; (b) is valid by
We first showed that the total achievable rate optimization employing the high SNR approximation [36], i.e.,
problem with non-convex constraints can be decomposed into  
1 1 1
a series of sub-rate optimization problems, each of which 1+ 1. (30)
only considers one sub-antenna array. Then, we proved that N2 N2
the sub-rate optimization problem of each sub-antenna array From (29), we observe that maximizing Rn is equivalent to
 2
can be solved by simply seeking a precoding vector suffi- maximizing p nH v1 v1H p n = p nH v1 2 , the square of inner prod-
ciently close to the unconstrained optimal solution. Finally, a uct between two vectors p n and v1 . Note that v1 is a fixed vector.
low-complexity algorithm was proposed to realize SIC-based Therefore, exploring a vector p n , which has the largest projec-
precoding without the complicated SVD and matrix inver- tion on v1 , will lead to the smallest Euclidean distance to v1
sion. Complexity evaluation showed that the complexity of the as well. Based on this fact, we conclude that the optimization
proposed SIC-based hybrid precoding is only about 10% of problem (10) is equivalent to the following problem
that of the recently proposed spatially sparse precoding with opt
fully-connected architecture in typical mmWave MIMO sys- p n = arg min v1 p n 22 . (31)

p n F
tem. Simulation results verified the near-optimal performance
and high energy efficiency of the proposed SIC-based hybrid
precoding. Our further work will focus on the limited feedback
A PPENDIX B
scenario, where the angles of PSs are quantified.
P ROOF OF P ROPOSITION 2
A PPENDIX A We first consider the matrix Tn = I K + N 2 HPn PnH H H ,
P ROOF OF P ROPOSITION 1 which should n (11). By partitioning
 be inversed
 to compute G
Define the target of the optimization problem (10) as Pn as Pn = Pn1 pn , Tn can be rewritten as
 
H Tn = I K + HPn PnH H H
Rn = log2 1 + p Gn1
p n , (26) N2
N2 n
  H
= IK + H Pn1 pn Pn1 pn H H
and the SVD of G n1 as G n1 = VV H . Then, by separating N 2

the matrices  and V into two parts: = IK + H
HPn1 Pn1 HH + Hpn pnH H H
  N 2 N2
1 0
= , V = [v1 V2 ] , (27) = Tn1 + Hpn pnH H H . (32)
0 2 N2
1008 IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. 34, NO. 4, APRIL 2016

Then, by utilizing the Sherman-Morrison formula [[28], [5] L. Wei, R. Q. Hu, Y. Qian, and G. Wu, Key elements to enable mil-
Eq 2.1.4] limeter wave communications for 5G wireless systems, IEEE Wireless
Commun., vol. 21, no. 6, pp. 136143, Dec. 2014.
 1 A1 uvT A1
[6] M. Samimi et al., 28 GHz angle of arrival and angle of departure analysis
A + uvT = A1 , (33) for outdoor cellular communications using steerable beam antennas in
1 + vT A1 u New York City, in Proc. IEEE Veh. Technol. Conf. (VTC Spring13),
May 2013, pp. 16.
[7] A. Alkhateeb, J. Mo, N. Gonzlez-Prelcic, and R. Heath, MIMO pre-
T1
n can be presented as coding and combining solutions for millimeter-wave systems, IEEE
 1 Commun. Mag., vol. 52, no. 12, pp. 122131, Dec. 2014.
T1
n = Tn1 + Hp p
n n
H H
H [8] B. Yin et al., High-throughput beamforming receiver for millimeter
N2 wave mobile communication, in Proc. IEEE Global Commun. Conf.
(GLOBECOM13), Dec. 2013, pp. 36973702.
T Hpn pnH H H T1
1
= T1 N 2 n1 n1 [9] P. Amadori and C. Masouros, Low RF-complexity millimeter-wave
n1
. (34)
p H H H T1
beamspace-MIMO systems by beam selection, IEEE Trans. Commun.,
1+ N2 n n1 Hpn vol. 63, no. 6, pp. 22122222, Jun. 2015.
[10] S. Han, C.-L. I, Z. Xu, and C. Rowell, Large-scale antenna systems with
Substituting (34) into Gn = H H T1
n H, we have
hybrid precoding analog and digital beamforming for millimeter wave
5G, IEEE Commun. Mag., vol. 53, no. 1, pp. 186194, Jan. 2015.
[11] O. El Ayach, S. Rajagopal, S. Abu-Surra, Z. Pi, and R. Heath, Spatially
Gn = H H T1
n H sparse precoding in millimeter wave MIMO systems, IEEE Trans.
! 1
"
T Hpn pnH H H T1
Wireless Commun., vol. 13, no. 3, pp. 14991513, Mar. 2014.
1 2 n1 n1 [12] Y. Lee, C.-H. Wang, and Y.-H. Huang, A hybrid RF/baseband precod-
= H H
Tn1 H H 1
H ing processor based on parallel-index-selection matrix-inversion-bypass
1 + 2 pn H Tn1 Hpn simultaneous orthogonal matching pursuit for millimeter wave MIMO
H systems, IEEE Trans. Signal Process., vol. 63, no. 2, pp. 305317, Jan.
2 Gn1 pn pn Gn1
= Gn1 . (35) 2015.
1 + 2 pnH Gn1 pn [13] C.-E. Chen, An iterative hybrid transceiver design algorithm for millime-
ter wave MIMO systems, IEEE Wireless Commun. Lett., vol. 4, no. 3,
pp. 285288, Jun. 2015.
n can be obtained by
Then, according to (11), G [14] J. A. Tropp and A. C. Gilbert, Signal recovery from random measure-
ments via orthogonal matching pursuit, IEEE Trans. Inf. Theory, vol. 53,
n = RGn R H
G no. 12, pp. 46554666, Dec. 2007.
! HG
" [15] W. Roh et al., Millimeter-wave beamforming as an enabling technol-
G n1 p n p n1 ogy for 5G cellular communications: Theoretical feasibility and prototype
R Gn1 N H
2 n
= RH results, IEEE Commun. Mag., vol. 52, no. 2, pp. 106113, Feb. 2014.
1 + N 2 pn Gn1 pn [16] T. Kim, J. Park, J.-Y. Seol, S. Jeong, J. Cho, and W. Roh, Tens of Gbps

support with mmWave beamforming systems for next generation com-
G p p H G munications, in Proc. IEEE Global Commun. Conf. (GLOBECOM13),
= n1 N 2 n1 n n n1 .
G (36) Dec. 2013, pp. 36853690.
1 + N 2 p nH G
n1 p n [17] C. Kim, J. S. Son, T. Kim, and J.-Y. Seol, On the hybrid beamforming
with shared array antenna for mmWave MIMO-OFDM systems, in Proc.
Note that in Section III-B, we have obtained the precoding vec- IEEE WCNC14), Apr. 2014.
[18] C. A. Balanis, Antenna Theory: Analysis and Design. Hoboken, NJ, USA:
tor p n sufficiently close to v1 , i.e., p n v1 . Thus, (36) can be Wiley, 2012.
well approximated by replacing p n with v1 as [19] D. Schneider, Could supercomputing turn to signal processors (again)?
IEEE Spectr., vol. 49, no. 10, pp. 1314, Oct. 2012.

G p p H G
N 2 n1 n n n1
[20] S. Han, C.-L. I, Z. Xu, and S. Wang, Reference signals design for hybrid
G n1
n =G analog and digital beamforming, IEEE Commun. Lett., vol. 18, no. 7,
1 + N 2 p nH G
n1 p n pp. 11911193, Jul. 2014.
[21] Y.-C. Liang, E. Y. Cheu, L. Bai, and G. Pan, On the relationship

G v vH G
N 2 n1 1 1 n1
between MMSE-SIC and BI-GDFE receivers for large multiple-input
n1
G multiple-output channels, IEEE Trans. Signal Process., vol. 56, no. 8,
n1 v1
1 + N 2 v1 G
H
pp. 36273637, Aug. 2008.
[22] S. Hur, T. Kim, D. Love, J. Krogmeier, T. Thomas, and A. Ghosh,
(a) 2v vH
N2 1 1 1 Millimeter wave beamforming for wireless backhaul and access in small
n1
=G , (37)
1 + N 2 1 cell networks, IEEE Trans. Commun., vol. 61, no. 10, pp. 43914403,
Oct. 2013.
[23] C. Xiao, Y. R. Zheng, and Z. Ding, Globally optimal linear precoders
where (a) is true due to fact that v1H G n1
n1 = 1 v H , since G for finite alphabet signals over complex vector Gaussian channels, IEEE
1 Trans. Signal Process., vol. 59, no. 7, pp. 33013314, Apr. 2011.
is an Hermitian matrix. [24] Y. Wu, C. Xiao, X. Gao, J. D. Matyjas, and Z. Ding, Linear precoder
design for MIMO interference channels with finite-alphabet signaling,
IEEE Trans. Commun., vol. 61, no. 9, pp. 37663780, Sep. 2013.
R EFERENCES [25] F. Zhu, F. Gao, M. Yao, and H. Zou, Joint information- and jamming-
beamforming for physical layer security with full duplex base station,
[1] T. Bai, A. Alkhateeb, and R. Heath, Coverage and capacity of IEEE Trans. Signal Process., vol. 62, no. 24, pp. 63916401, Dec. 2014.
millimeter-wave cellular networks, IEEE Commun. Mag., vol. 52, no. 9, [26] A. Alkhateeb, O. El Ayach, G. Leus, and R. W. Heath, Hybrid precoding
pp. 7077, Sep. 2014. for millimeter wave cellular systems with partial channel knowledge, in
[2] T. L. Marzetta, Noncooperative cellular wireless with unlimited numbers Proc. IEEE Inf. Theory Appl. Workshop (ITA13), 2013, pp. 15.
of base station antennas, IEEE Trans. Wireless Commun., vol. 9, no. 11, [27] M. Tao and R. Wang, Linear precoding for multi-pair two-way MIMO
pp. 35903600, Nov. 2010. relay systems with max-min fairness, IEEE Trans. Signal Process.,
[3] Z. Pi and F. Khan, An introduction to millimeter-wave mobile broadband vol. 60, no. 10, pp. 53615370, Oct. 2012.
systems, IEEE Commun. Mag., vol. 49, no. 6, pp. 101107, Jun. 2011. [28] G. H. Golub and C. F. Van Loan, Matrix Computations. Baltimore, USA:
[4] F. Rusek et al., Scaling up MIMO: Opportunities and challenges with JHU Press, 2012.
very large arrays, IEEE Signal Process. Mag., vol. 30, no. 1, pp. 4060, [29] . Bjrck, Numerical methods in matrix computations, Berlin,
Jan. 2013. German: Springer, 2015.
GAO et al.: ENERGY-EFFICIENT HYBRID ANALOG AND DIGITAL PRECODING FOR MmWAVE MIMO SYSTEMS 1009

[30] O. El Ayach, R. W. Heath, S. Rajagopal, and Z. Pi, Multimode pre- Chih-Lin I (SM03) received the Ph.D. degree
coding in millimeter wave MIMO transmitters with multiple antenna in electrical engineering from Stanford University,
sub-arrays, in Proc. IEEE Global Commun. Conf. (GLOBECOM13), Stanford, CA, USA. She has been working at mul-
Dec. 2013, pp. 34763480. tiple world-class companies and research institutes
[31] A. Alkhateeb, O. El Ayach, G. Leus, and R. Heath, Channel estimation leading the R&D, including AT&T Bell Labs; AT&T
and hybrid precoding for millimeter wave cellular systems, IEEE J. Sel. HQ, ITRI of Taiwan, and ASTRI of Hong Kong.
Topics Signal Process., vol. 8, no. 5, pp. 831846, Oct. 2014. Currently, she is a China Mobiles Chief Scientist
[32] Y. Wu, C. Xiao, Z. Ding, X. Gao, and S. Jin, Linear precoding for finite- of wireless technologies and has established the
alphabet signaling over MIMOME wiretap channels, IEEE Trans. Veh. Green Communications Research Center, spearhead-
Technol., vol. 61, no. 6, pp. 25992612, Jul. 2012. ing major initiatives including key 5G technology
[33] S. Cui, A. J. Goldsmith, and A. Bahai, Energy-constrained modulation R&D; high energy efficiency system architectures,
optimization, IEEE Trans. Wireless Commun., vol. 4, no. 5, pp. 2349 technologies and devices; green energy; and C-RAN and soft base stations.
2360, Sep. 2005. Her research interests include green communications, C-RAN, network con-
[34] C. Masouros, M. Sellathurai, and T. Rantarajah, Computationally effi- vergence, bandwidth refarming, EE-SE co-design, massive MIMO, and active
cient vector perturbation using thresholded optimization, IEEE Trans. antenna arrays. She was an elected Board Member of the IEEE ComSoc, the
Commun., vol. 61, no. 5, pp. 18801890, May 2013. Chair of the ComSoc Meetings and Conferences Board, and the Founding
[35] T. S. Rappaport, J. N. Murdock, and F. Gutierrez, State of the art in 60- Chair of the IEEE WCNC Steering Committee. She is currently an Executive
GHz integrated circuits and systems for wireless communications, Proc. Board Member of GreenTouch and a Network Operator Council Member
IEEE, vol. 99, no. 8, pp. 13901436, Aug. 2011. of ETSI NFV. She was the recipient of the IEEE T RANSACTIONS ON
[36] D. Tse and P. Viswanath, Fundamentals of Wireless Communication. C OMMUNICATIONS Stephen Rice Best Paper Award and is a winner of the
Cambridge, U.K.: Cambridge Univ. Press, 2005. CCCP National 1000 Talent program.

Xinyu Gao (S14) received the B.E. degree in Robert W. Heath, Jr. (S96M01SM06
communication engineering from Harbin Institute F11) received the B.S. and M.S. degrees from the
of Technology, Heilongjiang, China, in 2014. He University of Virginia, Charlottesville, VA, USA, in
is currently pursuing the Ph.D. degree in elec- 1996 and 1997, respectively, and the Ph.D. degree
tronic engineering at Tsinghua University, Beijing, from Stanford University, Stanford, CA, USA, in
China. His research interests include massive MIMO 2002, all in electrical engineering. From 1998 to
and mmWave communications, with the empha- 2001, he was a Senior Member of the Technical Staff
sis on signal detection and precoding. He has then a Senior Consultant with Iospan Wireless Inc.,
authored several journal and conference papers pub- San Jose, CA, USA, where he worked on the design
lished in IEEE J OURNAL ON S ELECTED A REAS and implementation of the physical and link layers of
IN C OMMUNICATIONS , IEEE T RANSACTIONS ON the first commercial MIMO-OFDM communication
V EHICULAR T ECHNOLOGY, IEEE ICC, IEEE GLOBECOM, etc. He was system. Since January 2002, he has been with the Department of Electrical
the recipient of the National Scholarship in 2015. and Computer Engineering, University of Texas at Austin, Austin, TX, USA,
where he is a Cullen Trust for Higher Education Endowed Professor, and is the
Director of the Wireless Networking and Communications Group. He is also
Linglong Dai (M11SM14) received the B.S.
the President and the CEO of MIMO Wireless Inc. and the Chief Innovation
degree from Zhejiang University, Hangzhou, China,
Officer at Kuma Signals LLC. His research interests include several aspects of
in 2003, the M.S. degree (with the highest Hons.)
wireless communication and signal processing: limited feedback techniques,
from the China Academy of Telecommunications multihop networking, multiuser and multicell MIMO, interference alignment,
Technology (CATT), Beijing, China, in 2006, and
adaptive video transmission, manifold signal processing, and millimeter wave
the Ph.D. degree (with the highest Hons.) from
communication techniques.
Tsinghua University, Beijing, China, in 2011. From
Dr. Heath has been an Editor of the IEEE T RANSACTIONS ON
2011 to 2013, he was a Postdoctoral Fellow with C OMMUNICATION, an Associate Editor of the IEEE T RANSACTIONS ON
the Department of Electronic Engineering, Tsinghua
V EHICULAR T ECHNOLOGY, a Lead Guest Editor of the IEEE J OURNAL
University, where he has been an Assistant Professor
ON S ELECTED A REAS IN C OMMUNICATIONS special issue on Limited
since July 2013 and now an Associate Professor. His
Feedback Communication, and a Lead Guest Editor of the IEEE J OURNAL ON
research interests include wireless communications, with a focus on multi- S ELECTED T OPICS IN S IGNAL P ROCESSING special issue on Heterogenous
carrier techniques, multiantenna techniques, and multiuser techniques. He has
Networks. He currently serves on the steering committee for the IEEE
authored more than 50 IEEE journal papers and more than 30 IEEE conference
T RANSACTIONS ON W IRELESS C OMMUNICATIONS. He was a member of
papers. He currently serves as a Co-Chair of the IEEE Special Interest Group
the Signal Processing for Communications Technical Committee in the IEEE
(SIG) on Signal Processing Techniques in 5G Communication Systems. He was Signal Processing Society and is a former Chair of the IEEE COMSOC
the recipient of the Outstanding Ph.D. Graduate of Tsinghua University Award
Communications Technical Theory Committee. He was a technical Co-Chair
in 2011, the Excellent Doctoral Dissertation of Beijing Award in 2012, the IEEE
for the 2007 Fall Vehicular Technology Conference, a General Chair of the
ICC Best Paper Award in 2013, the National Excellent Doctoral Dissertation
2008 Communication Theory Workshop, a General Co-Chair, a Technical
Nomination Award in 2013, the IEEE ICC Best Paper Award in 2014, the URSI
Co-Chair and a Co-Organizer of the 2009 IEEE Signal Processing for
Young Scientists Award in 2014, the IEEE Transactions on Broadcasting Best
Wireless Communications Workshop, a Local Co-Organizer for the 2009 IEEE
Paper Award in 2015, the IEEE RADIO Young Scientists Award in 2015.
CAMSAP Conference, a Technical Co-Chair for the 2010 IEEE International
Symposium on Information Theory, the Technical Chair for the 2011 Asilomar
Shuangfeng Han (M06) received the M.S. and Conference on Signals, Systems, and Computers, the General Chair for the
Ph.D. degrees in electrical engineering from 2013 Asilomar Conference on Signals, Systems, and Computers, a founding
Tsinghua University, Beijing, China, in 2002 and General Co-Chair for the 2013 IEEE GlobalSIP conference, and is a Technical
2006, respectively. He joined Samsung Electronics Co-Chair for the 2014 IEEE GLOBECOM conference. He was a 2003 Frontiers
as a Senior Engineer, in 2006, working on MIMO, in Education New Faculty Fellow. He is also a licensed amateur radio operator
multi-BS MIMO, etc. Since 2012, he has been a and is a registered Professional Engineer in Texas. He was the recipient of the
Senior Project Manager in the Green Communication of Best Student Paper Awards at the IEEE VTC 2006 Spring, WPMC 2006,
Research Center, China Mobile Research Institute. IEEE GLOBECOM 2006, IEEE VTC 2007 Spring, and IEEE RWS 2009, as
His research interests include green 5G, mas- well as the corecipient of the Grand Prize in the 2008 WinTech WinCool Demo
sive MIMO, full duplex, NOMA and EE-SE Contest. He was also the corecipient of the 2010 and 2013 EURASIP Journal
co-design. Currently, he is the Vice Chair of Wireless on Wireless Communications and Networking Best Paper Awards and the 2012
Technology Work Group of Chinas IMT-2020 (5G) promotion group. Signal Processing Magazine Best Paper Award.

You might also like