You are on page 1of 15

1

Sum-Rate Maximization for Linearly Precoded


Downlink Multiuser MISO Systems with Partial
CSIT: A Rate-Splitting Approach
Hamdi Joudeh and Bruno Clerckx
arXiv:1602.09028v2 [cs.IT] 24 Aug 2016

Abstract—This paper considers the Sum-Rate (SR) maximiza- questionable. Therefore, considerable effort has been devoted
tion problem in downlink MU-MISO systems under imperfect to the characterization and improvement of the performance
Channel State Information at the Transmitter (CSIT). Contrary in the presence of CSIT uncertainties.
to existing works, we consider a rather unorthodox transmission
scheme. In particular, the message intended to one of the users is It is well established that linear transmission strategies, e.g.
split into two parts: a common part which can be recovered by Zero Forcing Beamforming (ZF-BF), achieve the optimum
all users, and a private part recovered by the corresponding user. Degrees of Freedom (DoF) of the MISO Broadcast Channel
On the other hand, the rest of users receive their information (BC) under perfect CSIT [2], [3]. It is also possible to maintain
through private messages. This Rate-Splitting (RS) approach the full DoF in the presence of imperfect CSIT given that the
was shown to boost the achievable Degrees of Freedom (DoF)
when CSIT errors decay with increased SNR. In this work, errors decay with increased Signal to Noise Ratio (SNR) as
the RS strategy is married with linear precoder design and O(SNR−1 ) [4]–[6]. For example, it has been shown that em-
optimization techniques to achieve a maximized Ergodic SR ploying ZF-BF in limited channel feedback systems achieves
(ESR) performance over the entire range of SNRs. Precoders are the maximum DoF as long as the feedback rate increases
designed based on partial CSIT knowledge by solving a stochastic linearly with the SNR in dB, satisfying an error decay rate
rate optimization problem using means of Sample Average Ap-
proximation (SAA) coupled with the Weighted Minimum Mean of O(SNR−1 ) [5]. However, maintaining such quality can be
Square Error (WMMSE) approach. Numerical results show that exhausting in terms of resources required for CSIT acquisition.
in addition to the ESR gains, the benefits of RS also include Furthermore, the CSIT accuracy is expected to drop due
relaxed CSIT quality requirements and enhanced achievable rate to the Doppler effect, feedback delays and/or mismatches
regions compared to conventional transmission with NoRS. across multiple uses of the channel. While understanding
Index Terms—MISO-BC, ergodic sum-rate, degrees of free- the MISO-BC capacity region under CSIT imperfections is
dom, sample average approximation, WMMSE approach. still far from complete, considerable progress has been made
towards characterizing the achievable and optimum DoF – as
I. I NTRODUCTION an asymptotic tractable alternative – for a variety of imperfect
CSIT scenarios [7]–[10]. In this work, we focus on the case

T HE utilization of multiple antennas at the Base Sta-


tion (BS) combined with multiple single-antenna mobile
devices tremendously increases the spectral efficiencies of
where imperfect instantaneous CSIT is available, with errors
decaying as O(SNR−α ) for some constant α ∈ [0, 1]. Such
errors are detrimental to the DoF achieved using conven-
wireless networks. High multiplexing gains are realized with tional transmission schemes initially designed assuming per-
far less restrictions on the scattering environment compared fect CSIT. For example, ZF-BF achieves a fraction α of the full
to point-to-point Multiple Input Multiple Output (MIMO) DoF [5], [6]. Therefore we consider a rather unconventional
systems [1]. However, this comes with a price of higher restric- Rate-Splitting (RS) transmission strategy.
tions imposed on the quality of the Channel State Information The message intended to one of the users (labeled as the
(CSI) required at the BS, specifically in the Downlink (DL) RS-user) is split into two parts: a common part decoded by all
mode. This stems from the necessity to deal with the inter- users, and a private part decoded by the corresponding user.
ference through preprocessing at the BS, as receivers cannot On the other hand, the information intended to the remaining
coordinate. While precise CSI at the Receivers (CSIR) can users is sent through private messages. At the receivers, the
be obtained through DL training, the ability to provide highly common message is decoded first by treating all private signals
accurate and up-to-date CSI at the Transmitter (CSIT) remains as noise. Then, each receiver decodes its private message
after removing the common message from the received signal.
This work is partially supported by the U.K. Engineering and Physical Sci-
ences Research Council (EPSRC) under grant EP/N015312/1. A preliminary This RS strategy, inspired by similar techniques used for the
version of this paper was presented at the IEEE International Conference on Interference Channel (IC) [11], was recently shown to achieve
Communications (ICC), London, U.K., June 2015. a DoF gain of 1 − α over conventional transmission (relying
H. Joudeh is with the Communications and Signal Processing group, De-
partment of Electrical and Electronic Engineering, Imperial College London, solely on the transmission of private messages and termed
London SW7 2AZ, U.K. (email: hamdi.joudeh10@imperial.ac.uk) NoRS here) in the MISO-BC [7], [8]. An interpretation of this
B. Clerckx is with the Communications and Signal Processing group, De- result in the context of limited feedback and Random Vector
partment of Electrical and Electronic Engineering, Imperial College London,
London SW7 2AZ, U.K., and also with the School of Electrical Engineering, Quantization (RVQ) of the channel vectors is given in [12],
Korea University, Seoul 136-701, Korea (e-mail: b.clerckx@imperial.ac.uk). where the RS rate performance is analysed and compared to
2

its NoRS counterpart. that a conditional Average SR (ASR) metric is maximized.


While RS in the MISO-BC is fairly understood from an This metric corresponds to the average performance w.r.t CSIT
information theoretic point of view (at least in the DoF errors for a given channel estimate, and can be computed by
sense), it is considerably less treated and analysed in the the BS using imperfect instantaneous knowledge. To solve the
transceiver/precoder design and optimization literature. For stochastic ASR problem, it is first converted into a determin-
example, none of the aforementioned works considers an istic counterpart using the Sampling Average Approximation
optimized design of precoders. One reason is that theoretical (SAA) method [26]. The deterministic problem is then solved
analysis is made possible by simpler designs, e.g. random pre- using the Weighted Minimum MSE (WMMSE) approach [14].
coding for the common message and ZF-BF for private mes- A simplified method to approximate the stochastic problem
sages [12], as it is easier to derive closed-form expressions of based on the conservative method in [19], [20] is also pro-
performance metrics. Such closed-forms are usually sufficient posed, and it limitations are discussed. The benefits of em-
for DoF analysis, where the asymptotically high SNR regime ploying the RS strategy are demonstrated through simulations.
is considered. However, this is not the case for finite SNRs These include enhanced rate performances and relaxed CSIT
where the design of precoders is highly influential. Precoder quality requirements. Moreover, the two-user RS rate region is
optimization for Multi-User (MU) MISO/MIMO systems has numerically obtained by solving a sequence of Weighted ASR
been considered in a number of works under a variety of met- (WASR) problems, where the splitting procedure is extended
rics, design objectives and constraints, and CSI assumptions to both users. Examining such achievable rate region brings us
[13]–[22]. In general, problems that involve Sum-Rate (SR) one step closer to understanding the behaviour of the MISO-
expressions in such setups are known to be very challenging BC’s capacity region under partial CSIT.
to solve in their raw forms due to the non-convexity arising It should be noted that part of the results presented here were
from decentralized receivers with coupled rate expressions1. reported in a preliminary version of this paper [27]. Since then,
Nevertheless, the novel relationship proposed in [14] enables a number of RS design problems have been addressed, namely:
the reformulation of SR problems into equivalent special achieving max-min fairness [28], robust transmission under
forms of Weighted Mean Square Error (WMSE) problems. bounded CSIT errors [29], and the application to massive
This unveils a block-wise convexity property which can be MIMO [30]. In a recent work, a topological RS strategy is
exploited using Alternating Optimization (AO). proposed for MISO networked systems [31].
Contributions: With the aim of maximizing the SR perfor- Organization: The rest of the paper is organized as follows.
mance in MU-MISO systems with partial CSIT, this paper The system model and CSIT assumptions are described in
marries the RS strategy with precoder design and optimization Section II. In Section III, the problem is formulated and DoF
techniques. In doing so, we generalize previous works on SR analysis is carried out. The SAA based WMMSE algorithm
maximization under imperfect CSIT in several directions: is proposed in Section IV, then the conservative WMMSE
• Firs, we consider a general channel fading model where algorithm is describe in Section V. Simulation results are
the channel state, and its corresponding estimate at the presented in Section VI, and Section VII concludes the paper.
BS, are allowed to change throughout the transmission Notation: Boldface uppercase letters denote matrices, bold-
according to some stationary process. This model con- face lowercase letters denote column vectors and standard
tains as special cases the Rician-fading scenario in [19]– letters denote scalars. The superscrips (·)T and (·)H denote
[21] where a fixed channel estimate (e.g. line-of-sight) transpose and conjugate-transpose (Hermitian) operators, re-
throughout the transmission was considered, and the fast- spectively. tr(·) and diag(·) are the trace and diagonal entries
fading scenario in [24] where the instantaneous channel respectively. k · k is the Euclidian norm. EX {·} denotes the
estimate is completely suppressed. A feasible solution is expectation w.r.t the random variable X.
defined as a sequence of precoders, with one for each
incoming channel estimate. II. S YSTEM M ODEL
• Second, the adoption of the RS strategy ensures that the
proposed design performs at least as well as the corre- Consider a MU-MISO system operating in DL, where a
sponding NoRS-based design across the entire range of BS equipped with Nt antennas serves a set of single-antenna
SNRs. This follows from the fact that NoRS can be seen users K , {1, . . . , K}, where K ≤ Nt . The signal received
as a special (or restricted) case of RS. We demonstrate by the kth user in a given channel use (time or frequency) is
that RS designs outperform conventional NoRS designs described as
at high SNRs through DoF analysis. y k = hH
k x + nk (1)
In order to average out the effects of CSIT errors, we where hk ∈ CNt is the channel vector between the BS
consider the Ergodic SR (ESR) achieved over a long sequence and the kth user, x ∈ CNt is the transmit signal, and
of fading states as the main performance metric [25]. Given nk ∼ CN (0, σn,k2
) is the Additive White Gaussian Noise
partial CSIT, the ESR is maximized by updating the precoding (AWGN). The input signal is subject to the power constraint
matrix according to the incoming channel state estimate such E{xH x} ≤ Pt . Without loss of generality, we assume equal
2
1 This can be resolved by resorting to asymptotic regimes, e.g. high SNR
noise variances across users, i.e. σn,k = σn2 , ∀k ∈ K. The
or large antenna arrays [23]. From a design and optimization point of view, transmit SNR writes as SNR , Pt /σn . Moreover, σn2 is non-
2

this work is mainly concerned with finite SNRs and small antenna arrays. zero and fixed. Hence, SNR → ∞ is equivalent to Pt → ∞.
3

A. Channel State Information design of Pp . Unfortunately, this is not possible when the
We assume a fading model where the channel state, given by BS experiences high CSIT uncertainty. Hence, we resort to
H , [h1 , . . . , hK ], varies during the transmission according to the unconventional RS transmission model, where part of
 the interference is broadcasted such that it is decoded and
an ergodic stationary process with probability density fH H .
Receivers are assumed to estimate and track their channel vec- cancelled by all users before decoding their own streams [12].
tors with high accuracy, i.e. perfect CSIR. The BS on the other The RS scheme is described as follows. The message
hand has an imperfect instantaneous channel estimate given by intended to the kRS th user (the RS-user) is split into a common
Hb , [hb1, . . . , h
b K ]. This is typically obtained through Uplink part Wc ∈ Wc and a private part WkRS ∈ WkRS , where
(UL) training in Time Division Duplex (TDD) systems [32] Wc × WkRS = Wt,kRS . Wc is encoded into the stream
or quantized feedback in Frequency Division Duplex (FDD) sc using a common (or public) codebook, and hence is
systems [33]. Hence, the joint decoded by all users, while Wt,1 , . . . , WkRS , . . . , Wt,K are
 fading process is characterized encoded into the private stream s1 , . . . , sK in the conventional
by the joint distribution of H, H b , assumed to be stationary
and ergodic [25]. For a given estimate, the estimation error manner. The K + 1 streams are linearly precoded using
matrix is denoted by H e , [h e1, . . . , h
e K ], from which we write P , pc , p1 , . . . , pK , where pc ∈ CNt is the common
b e
the relationship H = H + H. The CSIT error is characterized precoder. The resulting transmit signal writes as
by the conditional density fH|H b H |H .
b K
X
Taking each user separately, the marginal density of the kth  x = Ps = pc sc + pi s i (2)
channel conditioned on its estimate writes as fhk |bhk hk | h bk . i=1
The mean of the distribution is assumed to be given by the
bk } = h b k , and E b {hk hH | where s , [sc , s1 , . . . , sK ]T ∈ CK+1 groups the symbols in a
estimate, i.e. Ehk |bhk {hk | h k H
hk |h k given channel use. Assuming that E{ss  } = I, the transmit
bk } = h
h bk hb H + Re,k , where Re,k is the kth user’s CSIT H
k power constraint reduces to tr PP ≤ Pt . When |Wc | = 0,
error covariance matrix assumed to be independent of h bk. i.e. no splitting is carried out, it is natural to allocate zero
The CSIT quality is allowed to scale with SNR [5], [7]. power to the common precoder, and the signal in (2) simply
In particular, the maximum entry of diag (Re,k ) scales as reduces to the conventional transmit signal. While the model
O(Pt−α ), where α ∈ [0, ∞) is some constant.  Equivalently, we in (2) can be seen as a super-position of MU beamforming
2
write σe,k = O(Pt−α ), where σe,k 2
, Eehk kh e k k2 = tr(Re,k ) [34] and multicast beamforming [35]2 , the transmission of
is the corresponding average CSIT error power. The constant a common message has a fundamentally different purpose
log(σ2 )
α , limPt →∞ − log(Pe,k t)
is known as the quality scaling here. In particular, multicast transmission sends information
factor (or exponent), which quantifies the CSIT quality as requested by, and intended to, all users in the system. On the
SNR grows large. For example, α → ∞ corresponds to other hand, the common message in RS encapsulates part of
2 2
perfect CSIT as σe,1 , . . . , σe,K → 0. The opposite extreme of the RS-user’s private message, which is decoded by all users
α = 0 represents a fixed quality w.r.t SNR, e.g. a constant in the system for interference mitigation and performance
number of quantization (and hence feedback) bits in FDD enhancement purposes as we see later in this paper3 .
systems. A finite non-zero α corresponds to CSIT quality
that improves with increased SNR, e.g. by increasing the
C. SINRs and Rates
number of feedback bits. The exponent is truncated such that
α ∈ [0, 1], which is customary in DoF analysis as α = 1 At the kth receiver, the average receive power for a given
corresponds to perfect CSIT in the DoF sense [7]. It is worth channel state is written as
highlighting that α assumes various practical interpretations, c,k S Sk Ik
e.g. in addition to the aforementioned channel quantization  z }| { z }| { zX }| {
and feedback interpretation [5], [12], it can also correspond to Tc,k , E |yk | = |hk pc | + |hk pk |2 +
2 H 2 H
|hH p
k i | 2
+ σ 2
n.
i6=k
the Doppler process in delayed/outdated CSIT [6], [7]. | {z }
Ic,k =Tk
(3)
B. Rate-Splitting and Transmit Signal Model Each receiver decodes two streams, the common stream and
The BS wishes to communicate the independent mes- its corresponding private stream. The common stream is first
sages Wt,1 , . . . , Wt,K , drawn from the message sets decoded by treating interference from all private signals as
Wt,1 , . . . , Wt,K , to users 1, . . . , K respectively. In conven- noise. Successive Interference Cancellation (SIC) is then used
tional linearly precoded systems, Wt,1 , . . . , Wt,K are indepen- to remove the common signal from yk , in order to improve
dently encoded into the private symbol streams s1 , . . . , sK the detectability of the private stream. This is followed by
respectively. Symbols are mapped to the transmit antennas decoding the private stream in the presence of the remaining
 
through a precoding matrix denoted by Pp , p1 , . . . , pK , interference. The instantaneous (for a given channel state)
where pk ∈ CNt is the kth precoding vector. This yields the Signal to Interference plus Noise Ratios (SINRs) of the
PK
linear transmit signal model described as x = k=1 pk sk . 2 A BC with both private messages and a common message from an
At each receiver, the desired stream is decoded while treating
information theoretic point of view [11].
interference from all other streams as noise. It is, therefore, 3 Note that decoding other user(s) data at the physical layer does not
the BS’s duty to minimize MU interference through a proper necessarily breach secrecy as encryption may be implemented at higher layers.
4

common stream and the private stream at the output of the disappears as precoders for different channel states are no
kth receiver are given by longer coupled. As a result, the maximization in (6) moves
−1 inside the expectation, and the optimum solution is obtained
γc,k , Sc,k Ic,k and γk , Sk Ik−1 . (4)
by optimizing Pp separately for each H, i.e.
Assuming Gaussian codebooks, the instantaneous achievable K
information rates for the common stream and the kth private X
max Rk (7)
stream, from the point of view of the kth user, write as tr Pp PH
p ≤Pt k=1

Rc,k = log2 (1 + γc,k ) and Rk = log2 (1 + γk ). (5)


which corresponds to the problem in [14]. Due to its tractabil-
We assume that the transmission is delay-unlimited, and hence ity, we adopt the short-term power constraint.
channel coding (from messages to symbols) can be preformed
over a long sequence of channel states [36]. This is considered
A. Average Sum Rate Maximization
to achieve an average CSIT error performance as we see in
the next section. Precoders on the other hand are adapted Now what if the BS has partial instantaneous CSIT as
throughout the transmission depending on the available esti- described in Section II-A? The channel state is characterized
mate of the instantaneous channel state [6], [25]. As far as the by H b from the BS’s perspective, and it is natural to define

codewords (symbol streams) are concerned, adaptive precoders a precoding scheme as Pp (Pt , H) b b . A naive approach
Pt ,H
are observed as part of the fading channel. It follows that would be to design each precoding matrix by solving problem
sending the common message and the kth private message at (7) assuming that H b is perfect. The resulting design is not
the Ergodic Rates (ERs) given by EH {Rc,k } and EH {Rk } only unable to cope with the MU interference, it is also
respectively guarantees successful decoding by the kth user. unaware of it. In addition, this may lead to an overestimation
To guarantee that Wc is successfully decoded (and hence of the instantaneous and ergodic rates by the BS, yielding
cancelled) by all users, transmission at undecodable rates. A robust approach on
 it shouldKbe transmitted at an ER
not exceeding minj EH {Rc,j } j=1 . Finally, the total ER the other hand employs the available CSIT knowledge to:
achieved by the kth user writes as EH {Rk } for k 6= kRS , 1) design an informed precoding scheme that enhances the
 K instantaneous channel condition experienced by the receivers
and EH {Rk } + minj EH {Rc,j } j=1 for k = kRS .
in each channel state, 2) perform transmission at reliable
III. M OTIVATION , P ROBLEM F ORMULATION AND DOF rates and hence guarantee decodability. In the following, we
A NALYSIS describe how this can be achieved.
While the BS is unable to predict the instantaneous rates,
In this section, we formulate the precoder design prob-
it has access to the the Average Rates (ARs) defined as:
lem for ESR maximization in the RS transmission scheme b
R̄k , EH|H b {Rk | H} for all k ∈ K, determined by the
described in the previous section. In doing so, we propose
imperfect channel state. The AR and the ER should not be
the ASR optimization framework which exploits partial in-
confused: while the latter describes the long-term performance
stantaneous CSIT to achieve a robust maximized ergodic
over all channel states, the former is a short-term (instanta-
performance. Moreover, we derive the DoF performances of
neous) measure that captures the expected performance over
the ASR-optimized NoRS and RS designs. To gain some
the CSIT error distribution for a given channel state estimate.
insight, we start by looking at the conventional (NoRS) ESR
It turns out that the ERs can be characterized by averaging
maximization problem in the presence of perfect instantaneous b In particular, for a given
the ARs over the variation in H.
CSIT. This is formulated as
(K ) fading process and adaptive
 precoding strategy, it is evident

X b and R̄k H
that Pp (H) b depend on H, b while Rk H, H b is
 max  EH Rk . (6) 
determined by H, H b , as Pp (H)b is selected by the BS based
EH tr Pp PH
p ≤Pt k=1
on the estimate. From the law of total expectation, the ER
The availability of perfect CSIT enables the BS to adapt experienced by the user is expressed as
the precoding matrix Pp according to H such that the in- n o n   o
stantaneous SR (inside the expectation) is maximized. The E{H,H}
b Rk H, H b = EHb EH|H
b
b Rk H, H | H
b
ergodic nature of the transmission allows the long-term power n o
constraint in (6), yielding a form of inter-state (temporal = EHb R̄k H
b (8)
or frequency) power allocation. A feasible solution for such
problem consists of a set of precoding matrices, each for a fromn which theoESR for a given precoding strategy writes as
PK
channel state. Moreover, we define a precoding scheme as EHb k=1 R̄k . Following the same logic in (6) and (7),
a family
 of feasible solutions for all possible power levels, maximizing the ESR under partial CSIT and a short-term
i.e. Pp (Pt , H) Pt ,H . It is common to replace the long- power constraint is achieved by optimizing Pp H b such that
PK b For a class
term power constraint in (6) with a short-term one given by the ASR, i.e. k=1 R̄k , is maximized for each H.
tr Pp PHp ≤ Pt [6], [33], which corresponds to a precoder of information theoretic channels under imperfect CSIT with a
peak power constraint in practice. While this may result in Markov property, it has been shown that the ergodic capacity is
a smaller ESR, it has a significant impact on the tractability found in a similar manner, while replacing the ASR with the
of the problem. In particular, the inter-state power allocation average mutual-information, and maximizing over the input
5

distribution rather than the precoding matrix [25]. This gives B. DoF Performance
rise to the ASR optimization problem formulated as The DoFs for the NoRS and RS strategies are defined as
 PK b {R(Pt )}
EH b {RRS (Pt )}
EH
max R̄k lim and lim . (12)
R(Pt ) : Pp k=1
(9) Pt →∞ log2 (Pt ) Pt →∞ log2 (Pt )
s.t. 
tr Pp PH
p ≤ Pt . The DoF can be roughly interpreted as the total number of
interference-free streams that can be simultaneously supported
It follows that the maximized ESR given by EH b {R(Pt )} is
in a single channel use. This follows by noting that the rate
achievable. The fixed-coding/adaptive-precoding strategy de- of a single interference-free data stream scales as log2 (Pt ) +
scribed in Section II-C can be employed, where the knowledge O(1). The significance of the DoF in such setups comes from
of the long-term properties of CSIT is leveraged to predict the the detrimental effects MU interference may have, and the
ERs and adjust the code rates, while precoders are adapted role of CSIT in dealing with such effect. For the proof of
each time a new channel estimate is revealed. It is evident our next result, we assume isotropically distributed CSIT error
σ2
that for perfect CSIT, (9) reduces to the SR problem in (7). A vectors, i.e. Re,k = Ne,kt I for all k ∈ K. Moreover, we assume
discussion on the influence of CSIT uncertainty on the ESR that Hb is of full column rank with probability one. Note that
performance follows in the next subsection. It is worth noting these assumptions are not necessary for the optimization in
that the considered CSIT model reduces to the one in [19], the following sections.
[20] when H b is fixed over all channel states, and the fast-
fading model in [24] when H b is completely suppressed. Both Theorem 1. The DoF of an optimum NoRS scheme is
lead to special cases of the problem formulation, where the b {R(Pt )}
EH
ESR problem and the ASR problem coincide, and a precoding lim = max {1, Kα} (13)
Pt →∞ log2 (Pt )
scheme is given by a single precoding matrix for each Pt .
while the DoF of an optimum RS scheme is given by
 Now we turn to designing a precoding scheme
b
P(Pt , H) b that maximizes the ESR performance b {RRS (Pt )}
EH
Pt ,H lim = 1 + (K − 1)α. (14)
of the RS strategy by formulating the corresponding Pt →∞ log2 (Pt )
ASR problem. The the kth common AR is defined as The proof is relegated to Appendix A. It is evident that
R̄c,k , EH|H b
b Rc,k | H , from which the common AR when α = 1, the maximum attainable DoF is achieved, i.e.
is given by R̄c , minj {R̄c,j }K j=1 . The ASR is expressed K. Although CSIT may still be erroneous, it is good enough
PK
as R̄c + k=1 R̄k , and the ASR maximization problem is to design private precoders that reduce the MU interference to
formulated as the level of additive noise. However, this is not possible when
 α drops below 1, as private streams start leaking substantial
PK

 max R̄c + k=1 R̄k interference. The NoRS scheme exhibits loss in DoF until
R̄c ,P
α = 1/K. As α drops below 1/K, the CSIT quality is not
RRS (Pt ) : s.t. R̄c,k ≥ R̄c , ∀k ∈ K (10)

  good enough to support multi-stream transmission, and it is
 H
tr PP ≤ Pt preferred from a DoF perspective to switch to SU transmission
by allocating all power to one stream, hence achieving a DoF
where the inequality constraints involving R̄c are equivalent of 1. In naive designs with fixed and uniform power allocation
to the pointwise minimization in the definition of R̄c . Next, across users [5], [6], the DoF reduces to zero for α = 0, and
the gain from switching to SU transmission is not realized.
we show that the ESR given by EH b {RRS (Pt )} is achievable.
The reliability of the private ERs follows from (8). As for the Now looking at RS, as soon as α drops below 1, the scheme
common ER, we write carefully allocates powers to the private streams such that MU
interference is always received at the level of additive noise.
n  o n  o n o This is achieved by scaling down private powers to O(P α ),
t
min E{H,H} b Rc,j = min EH b R̄c,j ≥ EHb min R̄c,j hence maintaining a DoF of Kα. The rest of the power, which
j∈K j∈K j∈K
(11) scales as O(Pt ), is allocated to the common stream which is
where the equality follows from the law of total expectation broadcasted to all users. This achieves a DoF gain of 1 − α,
as in (8), and the inequality follows from the fact that moving as interference from private messages is treated as noise. It
the minimization inside the expectation does not increase the should be noted that the RS DoF is strictly greater than the
value. Leaving the minimization outside the expectation cou- NoRS DoF for all α ∈ (0, 1).
ples the common ARs, and hence P H b , across the different The RS DoF in (14) is inline with the results in [7],
channel states, leading to an intractable formulation. After [8]. Although optimizing the precoders does not improve the
formulating the NoRS and RS instantaneous ASR problems, achievable DoF in these works, simulation results in Section
the next two questions that come to mind are: 1) how does VI show that optimized RS schemes are superior from a rate
the instantaneous CSIT quality influence the long-term per- performance perspective. It is worth highlighting that the con-
formances given by EH b {R(Pt )} and EH b {RRS (Pt )}? and 2) verse result recently reported in [37] proves the optimality of
How do the NoRS and RS performances compare? To answer the DoF in (14) for the Gaussian MISO-BC when one user has
these question, we resort to DoF analysis. perfect CSIT. Since improving the CSIT quality of one user
6

cannot decrease the DoF, the optimality of the result in (14) precoders. (16) suggests that the optimum solution of problem
naturally follows. We conclude this section by highlighting (15) converges to its counterpart of problem (10) as M → ∞.
that further to the high SNR analysis, RRS (Pt ) ≥ R(Pt ) is In the following, we show that this is indeed the case.
guaranteed over the entire range of SNRs. This is seen by First, we note that P is compact and the rate functions
noting that solving (9) is equivalent to solving (10) over a are bounded and continuously differentiable in P. It follows
subset of its domain characterized by restricting kpc k2 to zero. that the convergence in (16) is uniform in P ∈ P. The ARs
In the following, we focus on solving the RS design problem, are also continuouslydifferentiable with gradients given by
achieved by solving (10). ∇P R̄c,k (P) = EH|H b ∇P Rc,k (P) | H
b and ∇P R̄k (P) =

EH|H b
b ∇P Rk (P) | H , which follows from the bounded
IV. S AMPLE AVERAGE A PPROXIMATED WMMSE convergence theorem [38]. On the other hand, the objective
A LGORITHM functions of (10) and (15) can be equivalently reformulated
The RS ASR problem in (10) is of a stochastic nature and by incorporating the common rates as
appears to be challenging to solve in its current form. In fact, k
X
 K
deterministic subproblems of (10) are known to be non-convex R̄s (P) , min R̄c,j (P) + R̄k (P) (17a)
j j=1
and non-trivial [14], [35]. We propose a two-step approach to k=1
obtain a tractable form of (10). In the first step, a deterministic  (M) Xk
K (M)
approximation is obtained using the SAA method [26]. In the R̄s(M) (P) , min R̄c,j (P) j=1
+ R̄k (P) (17b)
j
second step, the deterministic ASR problem is transformed k=1
into an equivalent, and solvable, augmented Average Weighted (M)
Sum MSE (AWSMSE) problem. Before we proceed, it should where R̄s (P) and R̄s (P) are the equivalent ASRs. Hence,
be highlighted that in the following analysis, the dependencies from (16), (17) and the continuity of the pointwise minimiza-
of some functions on certain relevant variables is occasionally tion, we have
emphasized in the notation. lim R̄s(M) (P) = R̄s (P), a.s. ∀P ∈ P. (18)
M→∞

A. Sample Average Approximation From the continuity of the ARs, we observe that R̄s (P) is
b and index set M , {1, . . . , M }, let
For a given estimate H continuous in P although not necessarily differentiable at all
 points due to the embedded pointwise minimization. Hence,
H(M) , H(m) = H b +H e (m) | H,
b m∈M it can be shown that the convergence in (18) is also uniform.
Combining these observations with [26, Theorem 5.3], it is
be a sample of M i.i.d realizations drawn from
 the conditional
b concluded that the set of global optimum solutions of the SAA
distribution with density fH|H b H | H . This sample is
problem in (15) converges to that of the stochastic problem
used to approximate the ARs through their Sample Average
(M) 1 PM (m) in (10) a.s. as M → ∞. Extending this result to the set of
Functions (SAFs) defined as: R̄c,k , M m=1 Rc,k and
(M) P M (m) (m)  points that satisfy the first-order optimality conditions (i.e.
1 (m)
R̄k , M m=1 Rk , where Rc,k , Rc,k H and KKT points) can be found in [39, Section 4.3]. Now, we turn
(m) (m)

Rk , Rk (H are rates associated with the mth realiza- to solving the sampled problem in (15).
tion. This leads to the SAA of problem (10) formulated as
 PK (M)

 max R̄c + k=1 R̄k B. Augmented AWSMSE Minimization
R̄c ,P
(M) Although (15) is deterministic, it is non-convex and very
RRS (Pt ) : s.t. (M)
R̄c,k ≥ R̄c , ∀k ∈ K (15)

  challenging to solve. Therefore, the approach proposed in [14]
 H
tr PP ≤ Pt . is employed to reformulate (15) into an equivalent augmented
It should be noted that P is fixed over the M realizations, WMSE form. Augmented WMSE problems differ from their
which follows from the definition of the ARs. Before we conventional WMSE counterparts (e.g. [16]) in two ways: 1)
proceed, we make the following finite SNR/SINR assumption. the weights are considered as optimization variables, 2) the
cost function is extended to incorporate the logarithms of the
Assumption 1. In the following we assume that SNR < ∞, weights. A Rate-WMMSE relationship is built upon the two
i.e. σn2 > 0 and Pt < ∞. Moreover, we assume that channel aforementioned features.
realizations are bounded. Hence, γc,k , γk < ∞ for all k ∈ K. Let sbc,k = gc,k yk be kth user’s estimate of sc , where gc,k
It follows that all rates are bounded for any Pt and all is a scalar equalizer. After successfully removing the common
channel realizations. Therefore, from the strong Law of Large stream, the estimate of sk is obtained as sbk = gk (yk −
Numbers (LLN), we have hHk pc sc,k ), where gk is the corresponding equalizer. At the
(M)
output of the kth receiver, the common and private MSEs are
lim R̄c,k (P) = R̄c,k (P), a.s. ∀P ∈ P (16a) defined as εc,k , E{|b sc,k − sc |2 } and εk , E{|b
s k − s k |2 }
M→∞
(M) respectively, which write as:
lim R̄k (P) = R̄k (P), a.s. ∀P ∈ P (16b)
M→∞ 
εc,k = |gc,k |2 Tc,k − 2ℜ gc,k hHk pc + 1 (19a)
where the dependency
 on P is highlighted,
 a.s. denotes almost 2
 H
surely, and P , P | tr PPH ≤ Pt is the feasible set of εk = |gk | Tk − 2ℜ gk hk pk + 1 (19b)
7

where Tc,k and Tk are defined in (3). Optimum Minimum To formulate a deterministic equivalent of (25), the Average
MSE (MMSE) equalizers are given as WMSEs (AWMSEs) are approximated by their SAFs. The
MMSE −1 −1
sampling extends to the equalizers and weights such that
gc,k = pH
c hk Tc,k and gkMMSE = pH
k hk T k (20)
M M
¯(M) 1 X (m) (M) 1 X (m)
obtained by solving
∂εc,k
= 0 and ∂εk
= 0. Substituting (20) ξc,k , ξ and ξ̄k , ξ (26)
∂gc,k ∂gk M m=1 c,k M m=1 k
into (19), the MMSEs write as
(m) (m) (m) (m)  (m)
−1 where ξc,k , ξc,k hk , gc,k , uc,k and ξk ,
εMMSE
c,k , min εc,k = Tc,k Ic,k (21a) (m) (m) (m)  (m) (m)  (m)
gc,k
ξk hk , gk , uk , gc,k , gc,k hk and gk ,
εMMSE , min εk = Tk−1 Ik (21b) (m)  (m) (m)  (m) (m) 
k gk gk hk , and uc,k , uc,k hk and uk , uk hk ,
 are all associated with the mth realization in H(M) . For
from which the SINRsare rewritten as γc,k = 1/εMMSE −1 compactness, we define the set of sampled (m)equalizers as:
c,k
and γk = 1/εMMSE − 1, and the rates write as R = 
k c,k G , gc,k , gk | k ∈ K , where gc,k , gc,k | m ∈ M
− log2 (εMMSE
c,k ) and Rk = − log 2 (ε MMSE
k ). Now, the aug-  (m)
and gk , gk | m ∈ M . In a similar manner, we define:
mented WMSEs are given by   (m)
U , uc,k , uk | k ∈ K , where uc,k , uc,k | m ∈ M
ξc,k = uc,k εc,k −log2 (uc,k ) and ξk = uk εk −log2 (uk ) (22)  (m)
and uk , uk | m ∈ M . The same approach used to prove
where uc,k , uk > 0 are weights associated with the kth user’s (23) is employed to demonstrate the following relationship
MSEs. In the following, ξc,k and ξk are referred to as the MMSE(M)
ξ̄c,k , min ξ¯
(M) (M)
= 1 − R̄c,k (27a)
WMSEs, where ”augmented” is dropped for brevity. By taking uc,k ,gc,k c,k
MMSE(M) (M) (M)
the equalizers and weights as optimization variables, the Rate- ξ¯k , min ξ¯k = 1 − R̄k (27b)
uk ,gk
WMMSE relationship is established as
MMSE
where optimality conditions are checked separately for each
ξc,k , min ξc,k = 1 − Rc,k (23a) conditional realization. The sets of optimum MMSE equalizers
uc,k ,gc,k
MMSE
 MMSE(m)
ξkMMSE , min ξk = 1 − Rk (23b) associated with (27) are defined as gc,k , gc,k |
 MMSE(m)
uk ,gk
m ∈ M and gkMMSE , gk | m ∈ M . In
which is obtained as follows. From ∂gc,k
∂ξ ∂ξk
= 0 and ∂g = 0, the same manner, thesets of optimum MMSE weights are
MMSE(m)

c,k
MMSE
k
defined as uMMSE , uc,k | m ∈ M and uMMSE ,
the optimum equalizers are given as gc,k = gc,k and gk∗ =  MMSE(m) c,k k
MMSE
gk . Substituting this back into (22) yields uk | m ∈ M . For the K users, the MMSE solution
 is composed as GMMSE , gc,k MMSE
, gkMMSE | k ∈ K and
MMSE 
ξc,k gc,k = uc,k εMMSE
c,k − log2 (uc,k ) (24a) UMMSE , uMMSE , uMMSE | k ∈ K . Motivated by the
MMSE
 MMSE
c,k k
ξk gk = u k εk − log2 (uk ). (24b) relationship in (27), the deterministic augmented AWSMSE
minimization problem is formulated as
∂ξc,k (gMMSE ) ∂ξ (gMMSE ) 
Furthermore, from c,k
= 0 and k ∂uk k = 0, PK (M)
∂uc,k 
 min ξ̄ + k=1 ξ¯k
we obtain the optimum MMSE weights given by u∗c,k = ξ̄c ,P,U,G c
−1 −1 (M)
ARS (Pt ) : (M) (28)
uMMSE
c,k , εMMSE
c,k and u∗k = uMMSE , εMMSE ,  s.t. ξ̄c,k ≤ ξ¯c , ∀k ∈ K
−1 k k 
 H

where a scaling factor of ln(2) has been omitted as it tr PP ≤ Pt
has no effect on the solution. Substituting this back into (24)
where ξ̄c represents the common AWMSE.
yields the relationship in (23).
By taking the expectation over the conditional distribution
of H given H, b an average version of the Rate-WMMSE C. Equivalence
relationship, denoted by AR-AWMMSE, is expressed as The fact that the AWMSEs are decoupled in their corre-
  sponding equalizers and weights suggests that optimizing (28)
ξ¯c,k
MMSE
, EH|Hb min ξc,k | b
H = 1 − R̄c,k (25a) w.r.t (U, G) is achieved by minimizing each of the AWMSEs
uc,k ,gc,k
  individually as shown in (27). This can be confirmed  by
ξ¯kMMSE , EH|Hb min ξk | b
H = 1 − R̄k (25b) showing that the MMSE solution UMMSE , GMMSE satisfies
uk ,gk
the KKT optimality conditions of (28) for a given P. As
where the expectations are taken outside the minimizations a result, it is easily shown that under the MMSE solution,
to account for the dependencies of the optimum equalizers (28) boils down to (15) with affine transformations applied to
(M)
and weights on H. The rational behind employing the Rate- the objective function and the common rate, i.e. ARS (Pt ) =
(M)
WMMSE relationship is explained as follows. The incorpo- K + 1 − RRS (Pt ) and ξ¯c = 1 − R̄c . This Rate-WMMSE
ration of the equalizers and weights unveils a block-wise equivalence is not restricted to the global optimum solutions,
convexity property, which can be checked by noting that the and in fact can be extended to the whole set of stationary
WMSEs are convex in each of their corresponding variables points. In particular, for any point ξ̄c∗ , P∗ , U∗ , G∗ satisfying
when all other variables are fixed. This will prove crucial for the KKT optimality  conditions of (28), the solution given by
solving the problem. R̄c∗ = 1 − ξ¯c∗ , P∗ satisfies the KKT optimality conditions
8

of (15). This can be demonstrated by following the steps in Algorithm 1 Alternating Optimization
the proof of [40, Proposition 1], where the max-min fairness [n]
1: Initialize: n ← 0, ARS ← 0, P
problem in a MIMO Interfering BC is considered. Despite 2: repeat
the different setup, the absence of RS and the assumption of 3: n ← n + 1, P[n−1] ← P
perfect CSIT in [40], the proofs are directly extendable to the 4: G ← GMMSE (P[n−1] ), U ← UMMSE (P[n−1] )
problem considered here due to a key similarity: both objective 5: update Ψ̄c,k , Ψ̄k , F̄c,k , F̄k , t̄c,k , t̄k , ūc,k , ūk , ῡc,k , ῡk
functions are non-smooth and can be expressed as a max-min for all k ∈ K
of sum-rate expressions. At this point, it is concluded that [n]
6: P ← arg ARS
solving (28) yields a solution for (15), which in turn, converges [n] [n−1]
7: until ARS − ARS < ǫR
to a solution for the ASR problem in (10) a.s. as M → ∞.

D. Alternating Optimization Algorithm


weights from the previous step, which are embedded in the
Although problem (28) is non-convex in the joint set of SAFs listed above. Problem (29) is a convex Quadratically
optimization variables, it is convex in each of the blocks Constrained Quadratic Program (QCQP) which can be solved
P, U and G, while fixing the other two. Moreover, (G, U) using interior-point methods [41].
assume the closed-form MMSE solution for a given P. These 3) Algorithm and Convergence: The two steps are repeated
properties are exploited using an AO algorithm. Each iteration in an alternating manner until convergence as summarized in
of the proposed algorithm consists of two steps: 1) updating Algorithm 1, where ǫR is some arbitrarily small constant4 .
(G, U) for a given P, 2) updating P (and ξ̄c ) for given
(G, U). Next, each of the steps is described in detail. Proposition 1. For a given H(M) , the iterates generated by
1) Updating the Equalizers and Weights: In nth iteration of Algorithm 1 converge to the set of KKT solutions of the
the AO algorithm, the equalizers and weights are updated corresponding sampled ASR problem in (15). As M → ∞,
 such
that (G, U) = GMMSE (P[n−1] ), UMMSE (P[n−1] ) , where the iterates converge, a.s., to the set of KKT solutions of the
the dependency of the MMSE solution on a given precoder is ASR problem in (10).
highlighted in the notation, and P[n−1] is the precoding matrix  [n] ∞
Proof: The sequence ARS n=1 is monotonically de-
obtained in the n − 1th iteration. To facilitate the formulation creasing, as each optimization step decreases the objective
of the precoder optimization problem in the following step, (M)
function. Since RRS (Pt ) is bounded above for a given power
we introduce the SAFs listed as: t̄c,k , t̄k , Ψ̄c,k , Ψ̄k , f̄c,k , f̄k , (M)
constraint, it follows that ARS (Pt ) is bounded below, and
ūc,k , ūk , ῡc,k and ῡk , which are obtained using the updated [n]
(G, U). In particular, ūc,k and ūk are calculated by taking the hence ARS is guaranteed to converge. Next, we show that the

(m) (m) corresponding sequence P[n] n=1 converges to the set of
ensemble averages over the M realizations of uc,k and uk ,
KKT points of problem (15).
respectively. The rest are calculated in a similar manner by
It can be seen from the relationship in (27) that for the
averaging over their corresponding realizations given by: [n]
nth iteration, problem ARS (Pt ) in (29) is in fact a convex
2 2 (M)
(m) (m)
tc,k = uc,k gc,k
(m)
and
(m)
tk = uk
(m) (m)
gk approximation of the sampled ASR problem RRS (Pt ) in (15)
[n−1]
around the point P , obtained from the previous iteration.
(m) (m) (m) (m) H (m) (m) (m) (m) H
Ψc,k = tc,k hk hk and Ψk = tk hk hk Hence, the AO procedure in Algorithm 1 in an instance
(m) (m) (m) (m) H (m) (m) (m) (m) H
of the Successive Convex Approximation (SCA) method in
fc,k = uc,k hk gc,k and fk = u k hk g k [42, Section 2.1]. Combining this with the conditions in [42,
     ∞
(m) (m)
υc,k = log2 uc,k and υk
(m) (m)
= log2 uk . Assumption 1] and the fact that the iterates P[n] n=1 lie in
the compact feasible set P, the convergence to the set of KKT
2) Updating the Precoders: Following the previous step, points of problem (15) follows from [42, Theorem 1] and [43,
the problem of updating P is formulated by plugging Corollary 1]. The a.s. convergence to the set of KKT points
GMMSE (P[n−1] ) and UMMSE (P[n−1] ) into (28). This yields of problem (10) as M → ∞ follows from Section IV-A.
[n]
It should be highlighted that due to the non-convexity of
ARS (Pt ) : the problem, the global optimality of the solution cannot
 P
 min ¯c + PK
ξ
K H 2 be guaranteed in general, and different initializations of P

 k=1 i=1 pi Ψ̄k pi + σn t̄k

 ξ̄c ,P
  may lead to different limit points. This is examined through


 −2ℜ f̄kH pk + ūk − ῡk simulations in Section VI-A where it is shown that appropriate
PK H 2 (29) initialization yields good convergence and rate performances.

 s.t. pHc Ψ̄c,k pc + i=1 pi Ψ̄c,k pi + σn t̄c,k

  Moreover, the influence of varying the finite sample size M



H
−2ℜ f̄c,k pc + ūc,k − ῡc,k ≤ ξ¯c , ∀k ∈ K on the performance is investigated in Section VI-B3.
 P
kpc k2 + K 2
k=1 kpk k ≤ Pt .
4 Contrary to the algorithm in [24], Algorithm 1 applies the entire Monte-
The expressions of the AWMSEs in (29) are obtained as Carlo sample in each iteration. This comes at the cost of updating more
follows. The MSEs in (19) are substituted into (22), followed equalizers and wights, yet liberates the convergence (and hence the number
iterates in which P is updated) from the Monte-Carlo sample size. This is
by substituting the result into the M realizations in (26). This more suitable here as equalizers and wights are scalars obtained in closed-
is carried out while considering the updated equalizers and form, while updating P is the most expensive step in each iteration.
9

 
V. C ONSERVATIVELY A PPROXIMATED WMMSE where ub , u bk | k ∈ K and g
bc,k , u b , gbc,k , b gk |
A LGORITHM k ∈ K . Using the same argument in Section IV-B, it
In this section, we consider an alternative deterministic can be shown that the aforementioned MMSE solution is
approximation of the AR-AWMMSE relationship in (25). This optimal for (34), and the cost function at any optimum point
approximation is based on relaxing the dependencies of the is an affine transformation of the conservative ASR given
P
by minj bc,j K + K R
R b
equalizers and weights in (25) on the channel state H. As a j=1 k=1 k . Problem (34) is solved
result, the minimizations are taken outside the expectations, by slightly modifying Algorithm 1. Specifically, (G, U) are
and the new AWMSEs assume closed-form expressions [19], reduced to (b b ), and step 5 is eliminated as (30) is directly
g, u
[20]. This eliminates the SAA and the need for a possibly employed to formulate the QCQP.
large Monte-Carlo sample of conditional channel realizations.
Moreover, the sets of equalizers and weights to be updated in B. Conservative Performance Limitations
the AO procedure reduce to two pairs for each user. However, It can be seen that (34) is a restricted version of the
this simplification comes at the expense of the achievable AWSMSE problem in (28), obtained by restricting the domain
performance, as the approximation is in fact conservative. of (28) such that the sets of equalizers and weights remain
unchanged across the M realizations, and driving M → ∞.
A. Conservative Approximation By definition, solving (34) is equivalent to maximizing a

The the AWMSEs are approximated by ξbc,k , EH|H b ξc,k | lower-bound on the ASR. Hence, the conservative ESR given
 b
b and ξbk , E b ξk | H
H b , which are expressed as by K + 1 − EH b ARS (Pt ) is achievable. As the equalizers
H|H
   are updated using partial CSIT, the availability of highly
ξbc,k = u
bc,k |b b H pc + 1 − log (b
gc,k |2 T̄c,k − 2ℜ gbc,k h k 2 uc,k ) accurate CSIR is completely ignored by the BS in the design
(30a) (also called an ignorant approach [44]), which may lead to
   a loss in performance compared to the SAA method. This
ξbk = u
bk |bgk |2 T̄k − 2ℜ b b H pk + 1 − log (b
gk h k 2 uk ) (30b) leaves us wondering about the effectiveness of the conservative
approximation, i.e. the looseness of the lower-bound. A similar
where (b
gc,k , gbk ) and (b
uc,k , u
bk ) are the relaxed equalizers and
ignorant approach was used for worst-case robust designs
weights respectively, and T̄c,k and T̄k are given by
 under bounded errors, where it was observed that the rate
T̄c,k = pH b bH
c hk hk + Re,k pc + T̄k (31a) lower-bound fails to achieve the anticipated DoF [45]. This
K
X was analysed in [29], where it was shown that the loss in

T̄k = pH b bH 2 performance is due to self-interference terms induced by the
i hk hk + Re,k pi + σn . (31b)
i=1 approximation. The same argument applies here, where it can
The optimum relaxed equalizers are derived from (30) as: be seen that the equivalent
 MMSE conservative SINRs, defined
 MMSE as
MMSE
gbc,k = pH b −1 bkMMSE = pH b −1 bc,k = 1 − εbMMSE
γ /b
ε and γ
b = 1 − ε
b MMSE
/bε ,
c hk T̄c,k and g k hk T̄k . Plugging c,k c,k k k k
them back into (30), the have the self-interference terms, given by pH R p and
 optimumMMSE relaxed weights are
 obtained c e,k c
as ubMMSE = ε
b MMSE −1
and u
b = ε
b MMSE −1
, where pHk Re,k pk respectively, within their denominators. Such terms
c,k c,k k k
MMSE −1 b H 2 MMSE −1 b H may scale significantly with increased power, depending on
εbc,k , 1 − T̄c,k |hk pc | and εbk , 1 − T̄k |hk pk |2 . It
the CSIT quality, hence limiting the rate performance as we
is clear that the relaxed equalizers and weights are functions of
b Incorporating this approximation into the AR-AWMMSE observe in Section VI-B4.
H.
It is worth noting that while relaxing the dependencies of
relationship in (25), the expectations are taken inside the
equalizers on H is carried out by the BS to simplify the design
minimizations such that
 algorithm, is not necessarily followed by receivers which are
ξbc,k
MMSE
, min EH|H b
b ξc,k | H = 1 − Rc,k
b (32a) still expected to apply perfect CSIR to update their equalizers.
bc,k ,b
u gc,k
 However, predicting the possibly higher ERs (achievable by
ξbkMMSE , min EH|H b
b ξk | H = 1 − Rk
b (32b) the users) at the BS, and hence adjusting the transmission
bk ,b
u gk
rates, requires the evaluation of the expressions in (8) and
where Rbc,k , − log2 (b
εMMSE bk , − log2 (b
) and R εMMSE ). The (11), which in turn requires numerical calculations that bring
c,k k
approximations in (32) are upper-bounds for the expressions us back to Monte-Carlo samplings. Hence for this approach,
in (25), i.e. ξbc,k
MMSE
≥ ξ¯c,k
MMSE
and ξbkMMSE ≥ ξ¯kMMSE . This we assume that the BS transmit at the conservative ERs.
follows from the fact that moving the expectation inside the
minimization does not decrease the value. Hence, we have VI. N UMERICAL R ESULTS AND A NALYSIS
bc,k ≤ R̄c,k
R bk ≤ R̄k .
and R (33) In this section, the proposed algorithms are evaluated
through simulations. The channel H has i.i.d. complex Gaus-
By employing (30), a conservative version of the AWSMSE sian entries with unit variance, i.e. CN (0, 1). The noise
problem is formulated as variance is fixed as σn2 = 1, from which the long-term SNR
 PK
 min ξbc + k=1 ξbk e are also i.i.d complex Gaussian
is given as Pt . Entries of H

ξbc ,P,bu,bg
drawn from CN 0, σe , where σe2 = Nt−1 σe,k
2 2
, ∀k ∈ K.
AbRS (Pt ) : s.t. ξbc,k ≤ ξbc , ∀k ∈ K (34) 2 −α

  The error variance is given as σe = βPt , where β ≥ 0 and
 H
tr PP ≤ Pt α ∈ [0, 1] are varied to represent different CSIT accuracies and
10

18
18 NoRS-Opt
RS-Opt
16 NoRS-ZF
16 RS-ZF-SVD
14
14 SNR 35 dB
12
ASR (bps/Hz)

ESR (bps/Hz)
12 SNR 20 dB
10
10
8 8
ZF-SVD
ZF-e 6
6 MRC-SVD
SNR 5 dB MRC-e
4 4

2 2

0 0
2 4 6 8 10 12 14 16 5 10 15 20 25 30 35
AO iteration SNR (dB)

Fig. 1. ASR convergence of Algorithm 1 using four different initializations Fig. 2. Comparison between the ESR performances of NoRS and RS
of P for 1 randomly generated channel estimate, K, Nt = 2, σe2 = Pt−0.6 , transmission schemes for K, Nt = 2 and σe2 = Pt−0.6 .
and SNR = 5, 20 and 35 dB.

20
b = H− H e NoRS-Opt
SNR scalings. It follows that the channel estimate H 18 RS-Opt
NoRS-ZF
is also Gaussian. The channel realization H should not be 16
RS-ZF-SVD
confused with a conditional realization H(m) . While the
14
former represents the actual channel experienced by the users
ESR (bps/Hz)
and unknown to the BS, the latter is part of a sample H(M) 12
available at the BS and used to calculate the SAFs. The size 10
of the sample is set to M = 1000 throughout the simulations, 8
unless otherwise stated. For a given estimate H, b the mth
6
conditional realization is obtained as H (m)
= H b +H e (m) ,
e (m) 4
where H is drawn from the error distribution. More on
generating the channel realizations in the simulations is given 2

in Appendix B. Convex optimization problems are solved 0


5 10 15 20 25 30 35
using the CVX toolbox [46]. SNR (dB)

Fig. 3. Comparison between the ESR performances of NoRS and RS


A. Convergence transmission schemes for K, Nt = 2 and σe2 = Pt−0.9 .
First, we examine the convergence of Algorithm 1 under
four different initializations of P. The first initialization,
denoted by ZF-e, is taken as the DoF-motivated design in [7], wide range of SNRs.
√ √
[8]. To recall, we have pc = qc p b c and pk = qk p b k , where
α α
qc = Pt −Pt and qk = Pt /K, p b c = e1 and [b b K ] are
p1 , . . . , p
b The second B. Ergodic Sum Rate Performance
normalized ZF-BF vectors constructed using H.
initialization, labeled as ZF-SVD, is a modification of ZF-e, Here we evaluate the achievable ESRs obtained by averag-
where p b c is slightly optimized by choosing it as the dominant ing the ASRs over 100 channel realizations. For each channel
left singular vector of H.b The third and fourth initializations, realization, Algorithm 1 is used to obtain the RS ASR, and a
labeled as MRC-e and MRC-SVD respectively, maintain the modification of the algorithm where the common precoder is
same common precoders as ZF-e and ZF-SVD respectively. switched off is used for NoRS.
However, Maximum Ratio Combining (MRC) is employed 1) RS vs. NoRS: The performance of the optimized RS
instead of ZF-BF, i.e. p bk = hb k \khb k k. The ASR convergence scheme (RS-Opt) is compared to the optimized NoRS scheme
of Algorithm 1 for σe2 = Pt−0.6 and SNRs 5, 20 and 35 dB (NoRS-Opt). ZF-BF with Water-Filling (WF), where power
is shown in Fig. 1. It is evident that the algorithm converges allocation is carried out assuming that the estimate H b is
to a limit point regardless of the initialization. However, the perfect, is considered as a baseline for NoRS transmission.
speed of convergence is influenced by the initial state, which This scheme is termed as NoRS-ZF. On the other hand, we
may also influence the limit point as the problem is non- consider a modified version of the ZF-SVD initialization in
convex. The initialization effect becomes more visible as SNR the previous subsection as a baseline for RS transmission. In
grows large. For example, initializing the common precoder particular, the power splitting between the common message
using SVD enhances the convergence at high SNRs. In the and the private messages is maintained, while WF is used to
following results, MRC-SVD is adopted as it provides good allocate the power among the private messages. This scheme
overall performance over various channel realizations and a is termed RS-ZF-SVD.
11

16 45
NoRS-Opt NoRS-Opt K =8
RS-Opt RS-Opt
14 NoRS-ZF 40 NoRS-Opt(α 2 )
RS-ZF-SVD
35
12 K =6
30
ESR (bps/Hz)

ESR (bps/Hz)
10
K =4
25
8
20
6
15
4
10
2 5
K =2
0 0
5 10 15 20 25 30 35 5 10 15 20 25 30 35
SNR (dB) SNR (dB)

Fig. 4. Comparison between the ESR performances of NoRS and RS Fig. 5. ESR performances of NoRS-Opt and RS-Opt with σe2 = Pt−α , and
transmission schemes for K, Nt = 2 and σe2 = 0.063. NoRS-Opt with σe2 = Pt−α2 , where α2 = 1+(K−1)α , for α = 0.6, and
K
K, Nt = 2, 4,6 and 8.

The 2-user ESRs obtained using the different optimization 20


schemes for σe2 = Pt−0.6 and Pt−0.9 are shown in Fig. M
M
=1
= 10
α = 0.9
18 M = 100
2 and Fig. 3, respectively. Comparing RS-Opt and NoRS- M = 1000 α = 0.6
Opt, the two schemes perform similarly at low SNRs. In 16
α = 0.3
particular, both reduce to SU transmission by switching off 14
α = 0.1
ESR (bps/Hz)
the weaker user. As SNR grows, MU transmission starts to 12
take over. However, reducing MU interference to the level 10
of noise (or eliminating it) is not possible due to imperfect
8
CSIT. Therefore, the overall performance does not necessarily
benefit from additional power inaccurately directed towards a 6

given user, as it may cause more damage by interfering with 4


other users. At this stage, RS-Opt starts to part from NoRS- 2
Opt by switching on the common message. The contribution 0
5 10 15 20 25 30 35
of the common message primarily manifests at high SNRs, SNR (dB)
where the gap between RS-Opt and NoRS-Opt exceeds 4
dB for α = 0.6. For α = 0.9, which is almost perfect Fig. 6. The effect of changing M on the ESR performance of the RS SAA
from a DoF perspective, RS is not as instrumental as it is algorithm for K, Nt = 2, σe2 = Pt−α , and α = 0.1, 0.3, 0.6 and 0.9.
for α = 0.6. However, rate gains can still be observed at
high SNRs. Regarding the baseline schemes, their inferiority
compared to optimized schemes is evident over the entire SNR NoRS-Opt and RS-Opt schemes achieve significantly better
range. In the low SNR regime, baselines fall behind due to performances compared to their corresponding baselines, over
the strict application of ZF-BF, which is not ideal in this case. the entire range of SNRs. Moreover, although RS-Opt is not
However, the RS-ZF-SVD scheme slightly compensates for expected to achieve DoF gains over NoRS-Opt as Pt → ∞,
this through the common message. However, it fails to match the former still manages to deliver superior rate performance
the optimized transmission in RS-Opt and NoRS-Opt. In the at medium and high SNRs.
high SNR regime, each baseline scheme achieves the same 2) Increased Number of Users: Theorem 1 implies that the
DoF (slope of the curve) as its optimized counterpart with a relative DoF gain achieved by RS over NoRS decreases as K
rate gap, which is particularly noticeable for α = 0.6. increases. Fig. 5 shows the ESRs achieved by NoRS-Opt and
The performance under CSIT errors that do not scale with RS-Opt for K = 2, 4, 6 and 8 users, assuming α = 0.6. Fig.
20 −0.6
SNR (α = 0) is shown in Fig. 4, where σe2 = 10 10 = 5 also shows the NoRS-Opt performance for a CSIT error
0.063, which corresponds to the CSIT quality obtained at 20 exponent given as: α2 = 1+(K−1)α K . This corresponds to a
dB SNR w.r.t Fig. 2. First, it can be seen that the rate of conventional system that achieves the same sum DoF as the
NoRS-ZF saturates. This is due to the naive employment of RS system, but at the cost of higher CSIT quality requirements
Hb to design the ZF-BF vectors and allocate power as if it since α2 > α, ∀α ∈ (0, 1). It is evident that the two perform
was a perfect estimate. At high SNRs, residual interference closely, which highlights another benefit of RS, i.e. relaxed
dominates and caps the performance. RS-ZF-SVD employs CSIT requirements compared to NoRS.
the DoF-motivated power splitting (i.e. qc = Pt and qk = 0) 3) SAA Sample Size: Next, we look at the influence of
and hence reduces to multicast transmission (sending only the changing the sample size M on the SAA algorithm. The 2-user
common message) maintaining a DoF of 1. On the other hand, ESRs achieved by solving Algorithm 1 for M = 1, 10, 100 and
12

20 12
SAA α = 0.9 NoRS
18 Conservative SNR 30 dB RS
α = 0.6 10
16
14
8
ESR (bps/Hz)

α = 0.3
12 SNR 20 dB

ER User-2
10 6
α = 0.1
8
6 4 SNR 10 dB
4
2
2
0
5 10 15 20 25 30 35 0
SNR (dB) 0 2 4 6 8 10 12
ER User-1
Fig. 7. ESR performances of the SAA and conservative approaches for
K, Nt = 2, σe2 = Pt−α , and α = 0.1, 0.3, 0.6 and 0.9. Fig. 8. 2-user ER regions for σe2 = Pt−0.6 , and SNR = 10, 20 and 30 dB.

1000 are given in Fig. 6. Different scaling CSIT qualities are message allocated to one user the entire time, e.g. kRS = 1. A
considered where σe2 = Pt−α , and α = 0.1, 0.3, 0.6 and 0.9. second region is obtained by allocating the common message
Using a sample of one realization significantly degrades the to the other user, and the full region is given by the convex-
performance, as the resulting design lacks statistical knowl- hull enclosing the two regions by time-sharing of the extremity
edge of the CSIT uncertainty. On the other hand, the SAA points. Alternatively, all boundary points for the RS region
algorithm performs very well with a sample size as small as can be obtained by assuming that R̄c is shared between users
M = 10 under the specified settings. Higher sample sizes PK that C̄k is the kth user’s portion of the common AR with
such
of M = 100 and 1000 perform almost identically. Note that k=1 C̄k = R̄c . The WASR problem is formulated as
the achievable ESRs in this part are obtained by numerically  PK
evaluating the right-most expressions in (8) and (11), as taking 
 max k=1 wk (R̄k + C̄k )

 P,{ C̄ } K
k k=1
the expectations of the sampled ASR objective functions for  PK
RWRS (Pt ) : s.t. R̄c,k ≥ j=1 C̄j , ∀k ∈ K (35)
low M s is not reflective of the achievable performance. 

 C̄k ≥ 0, ∀k ∈ K
4) Conservative Algorithm: In this part, we look at the 
 
performance of the conservative design in Section V. For the tr PPH ≤ Pt
Gaussian CSIT error described earlier in this section, we have
where the weights {wk | k ∈ K} are fixed parameters.
Re,k = σe2 I, ∀k ∈ K. The conservative ESR is obtained by
(35) is solved by formulating its equivalent AWSMSE prob-
averaging the conservative ASRs obtained by solving (34).
lem, and modifying Algorithm 1 accordingly. To obtain the
The ESRs predicted by the BS using the SAA approach and
two-user regions shown in Fig. 8, the corresponding prob-
the conservative approach are given in Fig. 7 for K = 2 and
lems are solved for 43 different pairs of weights. The first
σe2 = Pt−α , with α = 0.1, 0.3, 0.6 and 0.9. The SAA scheme
weight is fixed as w1 = 1, and the second weight changes
achieves significant gains over the conservative scheme, which
as w2 ∈ {10−3, 10−1 , 10−0.95 , . . . , 100.95 , 10, 103}. What is
increase with decreased CSIT qualities. This is explained by
meaningful for each pair is the ratio between the weights rather
the discussion in Section V-B. In particular, self-interference
than their absolute levels [14]. Each point on the ER region
terms become dominant and the ASR lower-bound becomes
is characterized
 by the tuple E H (R̄1 , R̄2 ) for NoRS, and
b
looser. This effect is extremely detrimental for low CSIT
EHb (R̄1 + C̄1 , R̄2 + C̄2 ) for RS, where each pair of ARs is
qualities, as the achievable ESRs are highly undermined by
obtained from solving the corresponding WASR problem5.
the BS.
The ER regions shown in Fig. 8 are obtained for SNRs:
10, 20 and 30 dB, and σe2 = Pt−0.6 . As expected, the gap
C. Ergodic Rate Region
between RS and NoRS grows with SNR. The performance
In the final part, we investigate the NoRS and RS achievable for 30 dB SNR is particularly interesting, as it is evident that
ER regions in a two-user scenario. ER regions are obtained RS enlarges the whole ER region significantly. For example,
by solving Weighted ESR (WESR) problems, where different while guaranteeing an ER of 10 bps/Hz for user-1, an ER
boundary points are realized by varying the weights. For of almost 4 bps/Hz can be achieved by user-2. On the other
NoRS, this is achieved by incorporating the weights into the hand, guaranteeing the same user-1 ER using NoRS restricts
ASR problem in (9). For a fixed pair of weights, the problem is the ER of user-2 to just over 1.5 bps/Hz. This observation
solved for several channel realizations. A boundary ER region is of special interest for designs exploiting different points of
point is obtained by averaging the resulting AR realizations the rate regions, e.g. max-min fairness and Quality of Service
for a given pair of weights. For the RS problem, applying the
weights directly to (10) results in a region with the common 5 More accurately, we take the convex hulls enclosing the resulting points.
13

(QoS) based designs as seen in [28], [29]. 1) Proof of (13): For an arbitrary precoding scheme, let us
first find an upper-bound for
VII. C ONCLUSION 
R̄k = EH|H b
b log2 (Tk ) − log2 (Ik ) | H (37)
The problem of ESR maximization in MU-MISO systems
with linear precoding and partial CSIT was addressed by by upper-bounding and lower-bounding the first and second
employing the RS transmission strategy, where part of the right-hand-side terms respectively. From Jensen’s inequality,
MU interference is broadcasted to (and decoded by) all users. b ≤ log (T̄k ), where T̄k is
we write EH|H b log2 (Tk ) | H 2
The precoding matrix is optimized based on the available defined in (31b). From the Cauchy-Schwarz inequality and
channel estimate to maximized a conditional ASR metric, the isotropic property of the CSIT errors, we have T̄k ≤
computed using partial CSIT knowledge. The stochastic ASR khb k k2 + σ 2 PK qi + 1, where σ 2 = N −1 σ 2 . Since the
e i=1 e t e,k
problem was transformed into a deterministic counterpart actual channel state H does not depend on the SNR, we have
using the SAA method. This was followed by applying the 
b k k2 = O(1). It follows that T̄k ≤ O P max{ak ,āk } ,
khk k2 , kh t
Rate-WMMSE transformation and an AO algorithm, which from which we write
converges to a stationary solution. An alternative deterministic 
approximation of the ASR problem was developed based on EH|H b
b log2 (Tk ) | H ≤ max{ak , āk } log2 (Pt ) + O(1). (38)
the conservative method in [19], [20]. While this approach
Fromthe isotropic property and [7, Lemma 1], we write
avoids the sampling process, its guaranteed achievable per- b ≥ log (2κ σ 2 λ + 1) + O(1), where
b log2 (Ik ) | H
EH|H
formance is shown to be inferior. The effectiveness of the n  Te 2 2 o e 1
|e1 hk | b is bounded, and λ1 is the
proposed algorithms and the benefits of adopting the RS κ , EH|H b log2 σe2 |H
P
strategy were demonstrated through simulations. In particular, dominant eigenvalue of i6=k pi pH i . Since the maximum is
it was shown that the gains anticipated by the DoF analysis can −1 P
lower-bounded
 by the average, we have λ1 ≥ N t i6=k qi =
be realized at finitely high SNRs where RS designs outperform O Ptāk from which we obtain
their NoRS counterparts. Moreover, additional benefits of RS 
including relaxed CSIT quality requirements were highlighted. EH|H b +
b log2 (Ik ) | H ≥ (āk − α) log2 (Pt ) + O(1). (39)
Finally, the two-user ER region was numerically obtained
by solving a sequence of WASR problems. RS proved to Form (36), (37), (38) and (39), we have d¯k ≤ max{ak , āk } −
enlarge the entire achievable ER region, drawing attention to max{āk − α, 0}. When ak ≥ āk , the upper-bound is given by
the potentials of employing RS in QoS constrained designs. min{α + ak − āk , ak }, otherwise we have 0. Combining the
two cases, we write d¯k ≤ min {(α + ak − āk )+ , ak }, from
A PPENDIX A which the sum DoF is upper-bounded by
P ROOF OF T HEOREM 1 K K
b we focus on X X 
Since P is separately designed for each H, d¯k ≤ min (α + ak − āk )+ , ak . (40)
b
a precoding scheme defined for a given H as {P}Pt . The k=1 k=1
associated powers allocated to the common precoder and the
To obtain the maximum upper-bound for (40), we define J ⊆
kth private precoder are given as: qc , kpc k2 and qk , kpk k2
K as the subset composed of all users with non-zero DoF.
respectively, which scale with increased Pt as O(Ptac ) and
Due to the symmetry in the CSIT qualities, it is sufficient
O(Ptak ) respectively, where ac , ak ∈ [0, 1]6 . The streams
interfering with the kth user are dominated by a power scaling ¯ PJ that J , {1, . . . , J}
to assume, without loss of generality,
with J , |J |. We define d(J) , j=1 d¯j . It is evident that
factor of āk , maxj {aj }j6=k . We define the corresponding kth ¯ ≤ 1. On the other hand, we have d(J)¯
d(1) ≤ Jα for J > 1.
user’s conditional average DoFs as
This is shown from
R̄c,k (Pt ) R̄k (Pt )
d¯c,k , lim and d¯k , lim . (36) J
X J
X
Pt →∞ log2 (Pt ) Pt →∞ log2 (Pt ) d¯k ≤ Jα + (ak − āk ) (41a)
For a given precoding scheme defined over all H, b k=1 k=1
J−1
X
the kth user’s long-term  achievable DoFs are given by
EH
b R̄c,k (Pt )
≤ Jα + (aJ − a1 ) + (ak − ak+1 ) (41b)
dc,k , limPt →∞ = EH ¯
b {dc,k } and dk , k=1
 log2 (Pt )
E b R̄k (Pt )
limPt →∞ Hlog (Pt ) = EH ¯
b {dk }, which follows from the
where (41a) follows from min{x, y} ≤ x, y, and the strict
2
bounded convergence theorem. The same definitions extend to positivity of the DoF in J . (41b) follows from āk ≥ aj , ∀j 6=
NoRS precoding schemes while discarding the common part. k, and is equal to Jα. We conclude that the maximum upper-
Without loss of generality, σn2 = 1 is assumed throughout the bound is obtained P when J = K for α ≥ K −1 , and J = 1
otherwise. Hence, K ¯
proof. Each of the results in (13) and (14) is obtained through k=1 dk ≤  max{1, Kα}. This is also an
two steps. First, we show that the DoF is upper-bounded by its Eb R(Pt )
upper-bound for limPt →∞ Hlog (Pt ) , which follows from the
2
corresponding result. Then, we show that the upper-bounds are bounded convergence theorem.
achievable via feasible precoding schemes. Before proceeding, This part of the proof is completed by showing that the
we define the function (x)+ , max{x, 0}. upper-bound can be achieved by a feasible precoding scheme.
6 Note that a , a < 0 can be replaced with a , a = 0 without influencing
c k c k
In particular, a DoF of 1 is achieved by Single User (SU)
the DoF results derived next. transmission in a Time Division Multiple Access (TDMA)
14

fashion. On the other hand, Kα is achieved using ZF-BF [7] S. Yang, M. Kobayashi, D. Gesbert, and X. Yi, “Degrees of freedom
vectors designed using H,b and allocating the powers such that of time correlated MISO broadcast channel with delayed CSIT,” IEEE
α Trans. Inf. Theory, vol. 59, no. 1, pp. 315–328, 2013.
qk = Pt /K, for all k ∈ K as shown in [7]. [8] C. Hao and B. Clerckx, “MISO BC with imperfect and (un)matched
2) Proof of (14): From the definitions of the ARs, we write CSIT in the frequency domain: DoF region and transmission strategies,”
 in Proc. IEEE PIMRC, Sep. 2013, pp. 1–6.
b log2 (Tc,1 ) − log2 (I1 ) | H .
R̄c + R̄1 ≤ R̄c,1 + R̄1 = EH|H b [9] R. Tandon, S. Jafar, S. Shamai Shitz, and H. Poor, “On the synergistic
benefits of alternating CSIT for the MISO broadcast channel,” IEEE
Following the same approach used in the previous part, we Trans. Inf. Theory, vol. 59, no. 7, pp. 4106–4128, Jul. 2013.
b [10] J. Chen and P. Elia, “Optimal DoF region of the two-user MISO-BC with
obtain EH|H b log2 (Tc,1 ) | H ≤ log2 (Pt ) + O(1), and general alternating CSIT,” in Proc. Asilomar, Nov. 2013, pp. 1860–1864.

EH|H log (I ) | b
H ≥ (ā − α)+
log2 (Pt ) + O(1), from [11] A. El Gamal and Y.-H. Kim, Network information theory. Cambridge
b 2 1 1
university press, 2011.
which we obtain the following upper-bound [12] C. Hao, Y. Wu, and B. Clerckx, “Rate analysis of two-receiver MISO
broadcast channel with finite rate feedback: A rate-splitting approach,”
d¯c + d¯1 ≤ d¯c,1 + d¯1 ≤ min{1 + α − ā1 , 1}. (42) IEEE Trans. Commun., vol. 63, no. 9, pp. 3232–3246, Sep. 2015.
PJ [13] M. Stojnic, H. Vikalo, and B. Hassibi, “Rate maximization in multi-
Next, we define d¯RS (J) , d¯c + k=1 d¯k for J users with antenna broadcast channels with linear preprocessing,” IEEE Trans.
strictly positive DoFs. From (42), we have d¯RS (1) ≤ 1. For Wireless Commun., vol. 5, no. 9, pp. 2338–2342, Sep. 2006.
[14] S. Christensen, R. Agarwal, E. Carvalho, and J. Cioffi, “Weighted sum-
J = 2, we have d¯RS (2) ≤ 1 + α − ā1 + a2 ≤ 1 + α, obtained rate maximization using weighted MMSE for MIMO-BC beamforming
from (42), d¯2 ≤ a2 , and ā1 ≥ a2 . For J > 2, we have design,” IEEE Trans. Wireless Commun., vol. 7, no. 12, pp. 4792–4799,
d¯RS (J) ≤ 1 + (J − 1)α obtained by combining (42) and (41). Dec. 2008.
[15] S. Shi, M. Schubert, and H. Boche, “Rate optimization for multiuser
Hence, the maximum upper-bound is 1 + (K − 1)α obtained MIMO systems with linear processing,” IEEE Trans. Signal Process.,
when J = K regardless of α. It follows that this is also an vol. 56, no. 8, pp. 4020–4030, Aug. 2008.
E b RRS (Pt ) [16] M. Shenouda and T. Davidson, “On the design of linear transceivers
upper-bound for limPt →∞ H log (Pt ) . for multiuser systems with channel uncertainty,” IEEE J. Sel. Areas
2
Finally, this is achieved using the DoF-motivated design in Commun., vol. 26, no. 6, pp. 1015–1024, Aug. 2008.
[17] N. Vucic, H. Boche, and S. Shi, “Robust transceiver optimization in
[7]. The private precoders are given as shown in the previous
√ downlink multiuser MIMO systems,” IEEE Trans. Signal Process.,
part, while qc = Pt − Ptα and pc = qc e1 . vol. 57, no. 9, pp. 3576–3587, Sep. 2009.
[18] T. Bogale, B. Chalise, and L. Vandendorpe, “Robust transceiver opti-
mization for downlink multiuser MIMO systems,” IEEE Trans. Signal
A PPENDIX B Process., vol. 59, no. 1, pp. 446–453, Jan. 2011.
G ENERATING C HANNEL S TATES [19] M. Bashar, Y. Lejosne, D. Slock, and Y. Yuan-Wu, “MIMO broadcast
channels with Gaussian CSIT and application to location based CSIT,”
For a given channel realization, a normalized channel in Proc. ITA, Feb. 2014, pp. 1–7.
estimate denoted by H b n is generating with entries drawn [20] F. Negro, I. Ghauri, and D. T. M. Slock, “Sum Rate maximization in
the noisy MIMO interfering broadcast channel with partial CSIT via the
from CN (0, 1). Moreover, a  set of M normalized estimation expected weighted MSE,” in Proc. ISWCS, Aug. 2012, pp. 576–580.
errors defined as H e (M)
n , H e (m)
n | m ∈ M is gener- [21] R. Fritzsche and G. P. Fettweis, “Robust sum rate maximization in the
ated in the same manner. This is followed by constructing multi-cell MU-MIMO downlink,” in Proc. IEEE WCNC, Apr. 2013, pp.
3180–3184.
H(M) , where
p the mth conditional
p realization is calculated as [22] Y. Wu, S. Jin, X. Gao, M. R. McKay, and C. Xiao, “Transmit designs for
H(m) = b n + σ2 H
1 − σe2 H e (m)
n . It is evident that the the MIMO broadcast channel with statistical CSI,” IEEE Trans. Signal
e
fading assumptions made at the beginning of Section VI are Process., vol. 62, no. 17, pp. 4451–4466, Sep. 2014.
[23] E. Björnson, M. Bengtsson, and B. Ottersten, “Optimal multiuser trans-
preserved. For α > 0, i.e. σe2 relies on SNR, the set H(M) mit beamforming: A difficult problem with a simple solution structure
varies with SNR. To produce the ESR curves, averaging is [lecture notes],” IEEE Signal Process. Magazine, vol. 31, no. 4, pp.
carried out over 100 realizations of H e (M)
b n , where H n is reused 142–148, Jul. 2014.
[24] M. Razaviyayn, M. Boroujeni, and Z.-Q. Luo, “A stochastic weighted
across realizations. Reusing the same normalized channels MMSE approach to sum rate maximization for a MIMO interference
across SNRs in this manner produces the smooth ESR curves channel,” in Proc. IEEE SPAWC, Jun. 2013, pp. 325–329.
shown in the figures. [25] G. Caire and K. Kumar, “Information theoretic foundations of adaptive
coded modulation,” Proc. IEEE, vol. 95, no. 12, pp. 2274–2298, Dec.
2007.
R EFERENCES [26] A. Shapiro, D. Dentcheva et al., Lectures on stochastic programming:
modeling and theory. SIAM, 2009, vol. 9.
[1] B. Clerckx and C. Oestges, MIMO Wireless Networks: Channels, [27] H. Joudeh and B. Clerckx, “Sum rate maximization for MU-MISO with
Techniques and Standards for Multi-antenna, Multi-user and Multi-cell partial CSIT using joint multicasting and broadcasting,” in Proc. IEEE
Systems. Academic Press, 2013. ICC, Jun. 2015, pp. 6349–6354.
[2] G. Caire and S. Shamai, “On the achievable throughput of a multiantenna [28] ——, “Achieving max-min fairness for MU-MISO with partial CSIT:
gaussian broadcast channel,” IEEE Trans. Inf. Theory, vol. 49, no. 7, pp. A multicast assisted transmission,” in Proc. IEEE ICC, Jun. 2015, pp.
1691–1706, Jul. 2003. 6355–6360.
[3] N. Jindal and A. Goldsmith, “Dirty-paper coding versus TDMA for [29] ——, “Robust transmission in downlink multiuser MISO systems: A
MIMO broadcast channels,” IEEE Trans. Inf. Theory, vol. 51, no. 5, pp. rate-splitting approach,” IEEE Trans. Signal Process., Accepted 2016.
1783–1794, May 2005. [30] M. Dai, B. Clerckx, D. Gesbert, and G. Caire, “A rate splitting
[4] G. Caire, N. Jindal, and S. Shamai, “On the required accuracy of strategy for massive MIMO with imperfect CSIT,” IEEE Trans. Wireless
transmitter channel state information in multiple antenna broadcast Commun., vol. 15, no. 7, pp. 4611–4624, Jul. 2016.
channels,” in Proc. Asilomar, Nov. 2007, pp. 287–291. [31] C. Hao and B. Clerckx, “MISO networks with imperfect CSIT: A
[5] N. Jindal, “MIMO broadcast channels with finite-rate feedback,” IEEE topological rate-splitting approach,” arXiv preprint arXiv:1602.03768,
Trans. Inf. Theory, vol. 52, no. 11, pp. 5045–5060, Nov. 2006. 2016.
[6] G. Caire, N. Jindal, M. Kobayashi, and N. Ravindran, “Multiuser MIMO [32] B. Hassibi and B. Hochwald, “How much training is needed in multiple-
achievable rates with downlink training and channel state feedback,” antenna wireless links?” IEEE Trans. Inf. Theory, vol. 49, no. 4, pp.
IEEE Trans. Inf. Theory, vol. 56, no. 6, pp. 2845–2866, Jun. 2010. 951–963, Apr. 2003.
15

[33] D. Love, R. Heath, V. Lau, D. Gesbert, B. Rao, and M. Andrews, “An for a MIMO interfering broadcast channel achieving max–min fairness,”
overview of limited feedback in wireless communication systems,” IEEE Signal Processing, vol. 93, no. 12, pp. 3327 – 3340, 2013.
J. Sel. Areas Commun., vol. 26, no. 8, pp. 1341–1365, Oct. 2008. [41] S. P. Boyd and L. Vandenberghe, Convex Optimization. Cambridge
[34] M. Schubert and H. Boche, “Solution of the multiuser downlink beam- university press, 2004.
forming problem with individual SINR constraints,” IEEE Trans. Veh. [42] M. Razaviyayn, “Successive convex approximation: Analysis and appli-
Technol., vol. 53, no. 1, pp. 18–28, Jan. 2004. cations,” Ph.D. dissertation, UNIVERSITY OF MINNESOTA, 2014.
[35] N. Sidiropoulos, T. Davidson, and Z.-Q. Luo, “Transmit beamforming [43] M. Razaviyayn, M. Hong, and Z.-Q. Luo, “A unified convergence
for physical-layer multicasting,” IEEE Trans. Signal Process., vol. 54, analysis of block successive minimization methods for nonsmooth
no. 6, pp. 2239–2251, Jun. 2006. optimization,” SIAM Journal on Optimization, vol. 23, no. 2, pp. 1126–
[36] D. Tse and P. Viswanath, Fundamentals of wireless communication. 1153, 2013.
Cambridge university press, 2005. [44] H. Joudeh and B. Clerckx, “AMMSE optimization for multiuser MISO
[37] A. G. Davoodi and S. A. Jafar, “Aligned image sets under channel systems with imperfect CSIT and perfect CSIR,” in Proc. IEEE GLOBE-
uncertainty: Settling conjectures on the collapse of degrees of freedom COM, Dec. 2014, pp. 3308–3313.
under finite precision CSIT,” IEEE Trans. Inf. Theory, Accepted 2016. [45] A. Tajer, N. Prasad, and X. Wang, “Robust linear precoder design for
[38] B. E. Fristedt and L. F. Gray, A modern approach to probability theory. multi-cell downlink transmission,” IEEE Trans. Signal Process., vol. 59,
Springer, 1997. no. 1, pp. 235–251, Jan. 2011.
[39] S. Kim, R. Pasupathy, and S. G. Henderson, “A guide to sample average [46] M. Grant, S. Boyd, and Y. Ye, “CVX: MATLAB software
approximation,” in Handbook of Simulation Optimization. Springer, for disciplined convex programming [Online],” Available:
2015, pp. 207–243. http://www.stanford.edu/ boyd/cvx, 2008.
[40] M. Razaviyayn, M. Hong, and Z.-Q. Luo, “Linear transceiver design

You might also like