Professional Documents
Culture Documents
MIMO Wireless Communication - MIT PDF
MIMO Wireless Communication - MIT PDF
MIMO Wireless
Communication
Daniel W. Bliss, Keith W. Forsythe, and Amanda M. Chan
■ Wireless communication using multiple-input multiple-output (MIMO)
systems enables increased spectral efficiency for a given total transmit power.
Increased capacity is achieved by introducing additional spatial channels that are
exploited by using space-time coding. In this article, we survey the environmental
factors that affect MIMO capacity. These factors include channel complexity,
external interference, and channel estimation error. We discuss examples of
space-time codes, including space-time low-density parity-check codes and space-
time turbo codes, and we investigate receiver approaches, including multichannel
multiuser detection (MCMUD). The ‘multichannel’ term indicates that the
receiver incorporates multiple antennas by using space-time-frequency adaptive
processing. The article reports the experimental performance of these codes and
receivers.
M
- multiple-output (MIMO) sys- After an introductory section, we describe the con-
tems are a natural extension of developments cept of MIMO information-theoretic capacity bounds.
in antenna array communication. While the Because the phenomenology of the channel is impor-
advantages of multiple receive antennas, such as gain tant for capacity, we discuss this phenomenology and
and spatial diversity, have been known and exploited for associated parameterization techniques, followed by ex-
some time [1, 2, 3], the use of transmit diversity has amples of space-time codes and their respective receivers
only been investigated recently [4, 5]. The advantages and decoders. We performed experiments to investigate
of MIMO communication, which exploits the physi- channel phenomenology and to test coding and receiver
cal channel between many transmit and receive anten- techniques.
nas, are currently receiving significant attention [6–9].
While the channel can be so nonstationary that it can- Capacity
not be estimated in any useful sense [10], in this article We discuss MIMO information-theoretic performance
we assume the channel is quasistatic. bounds in more detail in the next section. Capacity in-
MIMO systems provide a number of advantages creases linearly with signal-to-noise ratio (SNR) at low
over single-antenna-to-single-antenna communication. SNR, but increases logarithmically with SNR at high
Sensitivity to fading is reduced by the spatial diversity SNR. In a MIMO system, a given total transmit power
provided by multiple spatial paths. Under certain envi- can be divided among multiple spatial paths (or modes),
ronmental conditions, the power requirements associ- driving the capacity closer to the linear regime for each
ated with high spectral-efficiency communication can mode, thus increasing the aggregate spectral efficiency.
be significantly reduced by avoiding the compressive re- As seen in Figure 1, which assumes an optimal high
gion of the information-theoretic capacity bound. Here, spectral-efficiency MIMO channel (a channel matrix
spectral efficiency is defined as the total number of in- with a flat singular-value distribution), MIMO systems
formation bits per second per Hertz transmitted from enable high spectral efficiency at much lower required
one array to the other. energy per information bit.
WA T E R F I L L I N G
Power
to be x, then the total capacity is maximized by
solving
max
{ xk :∑ xk = x } k
∑ log(1 + x k /nk )
Frequency
k
FIGURE A. Notional water-filling example.
= max ∑ log(n
{ xk :∑ xk = x } k
k + xk ) − ∑ log(n ).k
k
k in each frequency cell. The volume of the water is
Use Lagrange multipliers and evaluate the total received power of the signal. Note that cells
with high levels of interference are not used at all.
∂
∂x k ∑ log(n j + x j ) − µ
∑ x j − x
A similar solution results when the capacity is ex-
pressed by
j j
to find a solution. The solution satisfies xk + nk = ∑ log(1+ g x ) k k
–1 k
µ for all nonzero xk . Figure A illustrates the solu-
tion graphically as an example of water filling. The for gains gk . One can write the gains as g k = nk−1
difference between the water level (blue) and the and use the water-filling argument above. In this
noise level (red) is the power allocated to the signal context, cells with low gains may not be used at all.
TOY 2 × 2 C H A N N E L MODE L
(1 ± ),
Eigenvalue/a 2 (dB)
2
µ1,2 = 2a v 1† v 2
–10
†
FIGURE A. Eigenvalues of HH for a 2 × 2 line-of-
For small angular separations, this definition of sight channel as a function of antenna separation.
(
= log 2 1 + 2a 2 1 + v 1† v 2 ) P ;
o
2
1
otherwise, both modes are used and the capacity is 0.5
given by
–10 –5 0 5 10 15 20
Po + µ1 + µ1 µ1 0 a 2P
C IT = log 2 1 2
, 0 (dB)
2 0 µ
2
FIGURE B. The informed transmitter capacity of a
µ µ P + µ + µ 1 2 2 × 2 line-of-sight channel, assuming antenna beam-
= log 2 1 2 o 1 2 , width separations of 0.1 (solid line) and 0.9 (dashed
2 µ1µ 2 line).
{
= 2 log 2 a 2 1 − v 1† v 2
2
Po
+1} 2 v1† v 2
2
2 2
− log 2 1 − v 1† v 2 . 2a Po + ,
1 − v † v 2
1 2
Figure B displays the resulting capacity as a func-
tion of a 2Po (mean single-input single-output SNR) when using two modes, where Po is the total noise-
for two beamwidth separations, 0.1 and 0.9. At low normalized power. In both cases, the total received
values of a 2Po the capacity associated with small power is much larger than a 2Po .
beamwidth separation performs best. In this regime, In complicated multipath environments, small
capacity is linear with receive power, and small arrays employ scatterers to create virtual arrays of
beamwidth separation increases the coherent gain. a much larger effective aperture. The effect of the
At high values of a 2Po large beamwidth separation scatterers upon capacity depends on their number
produces a higher capacity as the optimal MIMO and distribution in the environment. The individual
system distributes the energy between modes. antenna elements can be resolved by the larger ef-
The total received power is given by fective aperture produced by the scatterers. As dem-
onstrated in Figure A, the ability to resolve antenna
2 v1† v 2 a 2 Po
elements is related to the number of large singular
when using one mode, and values of the channel matrix and thus the capacity.
C UT,FS =
∫ df C UT ( Po ; H( f ))
≈
1 P
log 2 I + o HH† ,
∫ df nf nT (16)
∑
nf P
∆f log 2 I + n o H( f n ) H† ( f n ) where the distance between frequency samples is given
n =1
≈ T
∑
nf
∆f by ∆f and the nf -bin frequency-partitioned channel
n =1 matrix is given by
tr{HPH† }. The uninformed transmitter rate is maxi- trix. The maximum-likelihood estimate of H is given
mized by sending equal power to all transmit antennas by
so that tr{HPH† } becomes Po /nT tr{HH† } = nR a 2Po . Hˆ = Z T† (TT† )−1 , (25)
It is worth noting that P is not in general optimized by
the informed transmitter to maximize received power assuming that the reference signals in T are known and
but to maximize capacity. TT† is nonsingular.
The total received power for the capacity-optimized The previous channel-estimation discussion explicit-
informed transmitter, given an arbitrary channel ma- ly assumed flat fading. However, the frequency-selective
trix, is channels can be estimated by first estimating a finite
−1 impulse-response MIMO channel, which can be trans-
P + tr(D )
tr{HPIT H† } = tr o D − I formed to the frequency domain.
n+
n+
(21) A finite impulse-response extension of Equation 1
tr{D} tr{D−1 }tr{D} − n+2 is given by introducing delayed copies of T at delays
= Po + . δ1, δ 2 , , δntaps ,
n+ n+
T(δ1 )
The first term in Equation 21 is bounded from below T(δ )
by 2
≡
T , (26)
†
tr{D} tr{HH } T(δ )
Po ≥ Po (22) ntaps
n+ min{nT , nR }
≥ max{nT , nR } a 2 Po . so that the transmit matrix has dimension (nT ⋅ ntaps ) × ns .
The resulting wideband channel matrix has the dimen-
The second term in Equation 22 is bounded from be- sion (nT ⋅ ntaps ) × ns ,
low by zero. Consequently, the total received power is ˆ (δ ) H
ˆ (δ ) H
ˆ (δ )]
[H
greater than or equal to max{nT , nR } a 2Po . 1 2 ntaps
(27)
For very small a 2Po , far from the nonlinear regime of † )−1.
† (TT
= ZT
the Shannon limit, the optimal solution is to maximize
received power. This is done by transmitting the best Using this form, an effective channel filter is associated
mode only, setting n + = 1. In this regime the total re- with each transmit-to-receive antenna link. By assum-
ceived power is given by ing regular delay sampling, we can use a discrete Fou-
rier transform to construct the explicit frequency-selec-
tr{HPIT H† } → Po maxeig{HH† } . (23) tive form,
2
This result is bounded from above by nT nR a Po , which ˆ(f ) H
ˆ(f ) H
ˆ ( f )]
[H 1 2 ntaps
is achieved if there is only a single nontrivial mode in (28)
the channel. ˆ (δ ) H
= [H ˆ (δ ) H
ˆ (δ )]( ⊗ I ) ,
1 2 ntaps ntaps nT
B AY E SI A N BE L I E F N E T WOR K S
Variable dependencies
For loopless graphs, the order of
x1 Loopless directed acyclic graph (DAG) activation does not matter and the
Directed Markov field process converges. Unfortunately,
Bayesian belief network
x2 x3 for decoding applications, interest-
p(x1, x2, x3, x4) = p(x4|x3)p(x3|x1)p(x2|x1)p(x1) ing graphs have loops, so order of
x4 activation matters and convergence
Parent nodes (alphabets uk ∈ Bk ) Messages passed Received Sent is not guaranteed. Typically, nodes
are activated in a repetitive pattern
likelihoods λC λP for a certain number of iterations
Update node
(alphabet x ∈ A)
until a stopping criterion is met.
Symbol decisions are based on the
Child nodes priors πP πC
belief function.
Node calculations and messages
s
References
1. J. Pearl, Probabilistic Reasoning in In-
π (x) = Σ p(x|u1,…,us ) Π πkP(uk )
u k=1 telligent Systems: Networks of Plausible
Inference (Morgan Kaufmann, San
λ (x) = Π λkC (x) Belief Mateo, Calif., 1988).
k
λ(x) π (x) 2. R.J. McEliece, D.J.C. MacKay, and
π jC (x) = π (x) Π λ kC(x) J.-F. Cheng, “Turbo Decoding as an
k≠j Instance of Pearl’s ‘Belief Propagation’
λjP(uj) = Σ
x,uk :k≠j
λ (x)p(x|u1, …, us ) Π
k≠j
πkP(uk ) Algorithm,” IEEE J. Sel. Areas Com-
mun. 16 (2), 1998, pp. 140–152.
FIGURE A. Bayesian belief networks provide a framework for representing conditional probabilities in a graphi-
cal manner. Each node has a symbol alphabet on which it maintains a belief function that factors as a product of a
prior-like function and a likelihood-like function. Beliefs are updated by passing messages among nodes in a man-
ner suggested by the terminology. Initial states and a node update order must be chosen. Only in special cases do
the iterations converge to a Bayesian decision, but for many interesting applications, the iterative technique is both
practical and effective. Turbo codes and low density parity-check codes have decoders based on this paradigm.
works” provides more information). However, beyond Decoding occurs by treating the graph as a Bayes-
connecting the decoding algorithm of LDPC codes to ian belief network using the conditional probabilites
Bayesian belief networks, a thorough explanation of the p( z k | c k ), which express the likelihood ratios, and
(∑ c ),
steps in this algorithm is outside the scope of this ar-
ticle; we present only a concise summary. p(0 | c i1 ,…, c il ) = δ
k ik
For LDPC codes, Figure 2 shows a graph illustrat-
ing data and parity-check dependencies for the code- which expresses the parity-check relation. The resulting
words. In general, each nonzero entry in the parity- algorithm can be viewed as sweeping through the rows
check matrix indicates the edge of a graph connecting a and columns of the parity-check matrix, updating like-
parity-check node (row index) and a codeword symbol lihood ratios lk for each nonzero entry in the matrix.
(column index). The example in Figure 2 is a single par- The notation below denotes l ij as the likelihood ratio
ity-check code on four symbols. The graph shows the stored with the ij-th (nonzero) entry in a fixed (for the
symbol nodes c 1,…, c 4, the data nodes z 1,…, z 4, and given step) row or column of the parity-check matrix. In
the parity-check nodes, labeled by zeroes. Each edge this form, the iterative steps of the algorithm are sum-
between a parity-check node and a symbol node corre- marized for the simple case of a binary symbol alphabet
sponds to a nonzero entry in the parity-check matrix. by the equations:
1 1 0 0
Parity-check matrix 0 1 1 0
0 0 1 1
c1 c2 c3 c4 Codeword
p(z|c) component
p(0|ci , . . . ,ci ) = δ (
1 s
Σk cik) 0 0 0 Parity checks
Some initialization
Flat priors for codeword nodes: π (ci ) Node firing order: z 0 c z 0 c ···
For the code used in the experiments, each row sweep p(inew ) ∝ j ≠ k p(iold ) p(cLF )
k j
involves eight l ij per row and each column sweep four l ij
per column. 3. Symbol decisions
Each of the row (column) operations is independent
of any other row (column operation) and hence can be ( p ) p
m im
( LF )
c .
The components of the vector pk express probabilities GF(16). The remaining two transmitters send the oth-
for the values of the kth symbol, the permutation πk in- er two bits of the same symbol. Decoding is based on
dicates the effect of a particular nonbinary coefficient likelihood functions built over GF(16) using estimates
in the parity-check equation, Un is a Walsh-Hadamard of the channel matrices. Again, differential frequencies
matrix, and the notation denotes the Hadamard among transmitters enable spatial diversity.
(component by component) product.
Space-Time Turbo Code and Multichannel
Space-Time Extension LDPC Multiuser Detectors
There are a variety of extensions of LDPC codes to While the theoretical performance is determined by
space-time codes, which are introduced and explained the channel phenomenology, practical MIMO perfor-
in the sidebar entitled “Space-Time Codes.” For the ex- mance requires the selection of a space-time code and
periments described below, only one type of extension an appropriate matched receiver. In this section we dis-
was considered. cuss the space-time turbo code used in this example.
Each space-time channel transmits one of several We develop a maximum-likelihood formulation of a
possible quadrature phase-shift keying (QPSK) wave- multiple-antenna multiuser receiver, and we discuss
forms with slightly offset carrier frequencies. The dif- suboptimal implementations of the receiver. We also
ferential frequencies are sufficiently large to effectively introduce minimum-mean-squared-error extensions of
decorrelate the transmitted waveforms over the length the receiver, and we discuss the value and use of train-
of a codeword (1024 bits) even if the data sequences in ing data.
each channel are identical. These differential frequen-
cies are also large compared to the expected Doppler Space-Time Turbo Code
spreads and small compared to the signal bandwidth. Turbo codes, introduced elsewhere [35], illustrate that
In the simplest example of such a code, the I and codes constructed with simple components, such as
Q components of a transmitter represent, respectively, with interleavers and convolutional encoders, combined
two different LDPC codewords. Each transmitter sends with an iterative decoding process can achieve near-
the same complex baseband sequence (QPSK) shifted Shannon capacity performance. The iterative decod-
in frequency. The transmitter outputs, viewed collec- ing process, taking advantage of information exchange
tively as a vector at any instant, vary in time and thus among component decoders, provides a feasible way to
effectively probe the environment characterized by the approach optimal performance. For each component
channel matrix. Since the transmitted vector varies sig- decoder, the best decoding algorithm is the maximum
nificantly over the duration of a codeword, the coding a posteriori (MAP) algorithm or the BCJR algorithm
provides spatial diversity. Decoding occurs by forming [36], which is derived from the MAP principle. Modi-
likelihood ratios based on channel-matrix estimates and fications of the MAP algorithm include log-MAP and
then using the iterative decoder described above. Note max-log-MAP [37]. Recently, implementation of turbo
that the channel matrix can change during the code- decoders has been carried out and high data-rate decod-
word, in which case channel-matrix estimates can vary ing is possible [38].
sample to sample. A number of space-time extensions of turbo coding
The LDPC space-time code just described exhibits have been suggested [21, 22]. The approach used here,
full spatial redundancy among all transmitters. Less which was introduced elsewhere [39], provides a 2-bit/
redundancy, and therefore higher data rates, can be sec/Hz link for a 4 × 4 MIMO system with independent
achieved by dividing the transmitters into subsets, each QPSK waveforms from each transmitter. A single data
of which is fully redundant yet different from any other stream is turbo encoded and the encoded data stream is
subset. For example, the space-time code discussed lat- distributed redundantly amongst the transmitters. The
er, in the section on experiments, uses four transmitters. turbo encoder employs a rate-1/3, 16-state convolution-
The first two transmitters send two bits (redundant al encoder twice with two different 4096-bit random
in I and Q) of a symbol of an LDPC codeword over interleavers. The distribution of systematic bits is such
S PA C E -T I M E C O D E S
|
of the matrix difference C1 – C2 .
|
k=1 4N0 4
Constrained searches over the code
coefficients are commonly used to
Example of space-time trellis code (ak , bk ∈ {0, 1, 2, 3})
kth transmitter find codes with the smallest pos-
v1 v2 sible error between closest code-
(It1, It2 ) Σ
p=0
It1– p apk + Σ It2– q bqk mod4
q=0
xtk
k
i xt words under either criterion. When
data: bit pair trellis coding codeword QPSK
4 ≥ rnR , it is important to ensure
symbol modulation that the rank of the matrix differ-
ence is not too small. When 4 < rnR ,
FIGURE A. Space-time trellis codes introduce spatial as well as temporal maximizing the Euclidean distance
redundancy in the transmitted data. Code design often involves a pruned between the two codewords Ck be-
search over a class of codes based on a simple figure of merit. For exam-
ple, the minimum least-squares distance between codewords (represent-
comes important.
ed by space-time matrices Ck) can be maximized. In the example shown,
an alphabet consisting of the integers modulo 4 is used for convolutional Reference
1. S.M. Alamouti, “A Simple Transmit Di-
encoding at each transmitter. The resulting output symbols are mapped to versity Technique for Wireless Com-
a QPSK alphabet. The coefficients ak, bk determine the code. Note that the munications,” IEEE J. Sel. Areas Com-
spectral efficiency is 2 bits/sec/Hz. mun., 16 (8), 1998, pp. 1451–1458.
We present here the results of the maximum likeli- where R indicates the spatial covariance matrix of the
hood (ML) formulation of MCMUD, employing a interference plus noise, | | indicates the determinant
quasistatic narrowband MIMO-channel model. The of a matrix, † indicates the Hermitian conjugate, and tr
number of receive antennas nR by number of samples, ns indicates the trace of a matrix. Maximizing the prob-
data matrix, Z ∈ nR ×ns , is given by ability density with respect to H is equivalent to mini-
Z = HT + N , (44) mizing the tr{} in Equation 41,
row space spanned by T, and PT⊥ = Ins − PT projects ond term can be simplified and interpreted in terms of
onto the orthogonal complement of the row space of T. a beamformer
Maximizing with respect to an internal parameter of R
Ins − PTX PZ X = 1 − (TX TX† )−1 TX PZ X TX† ,
gives
tr{R −1 ZPT⊥ Z † R −1 R } − ns tr{R −1 R } = 0 , (45) w †A Z X TX†
= 1− ,
ns (50)
where R indicates the derivative of R with respect to
some internal parameter. This relationship is satisfied w †A ZPT⊥B TA†
= 1− ,
when ns
ˆ ZPT⊥ Z † where
R= , (46)
ns w A = Rˆ −X1 H
ˆ ,
A
1 1
assuming that R is not rank deficient. Using these re- Rˆ X ≡ Z X Z †X = ZPT⊥B Z † ,
sults, the ML statistic for estimating T is given by ns ns
(51)
Hˆ ≡ Z T † (T T † )−1
− ns nR A X X X X
πe
max p(Z | R , H, T ) = | ZPT⊥ Z † |−ns . (47) = ZPT⊥B TA† (TA PT⊥B TA† )−1 .
R ,H ns
The nR × 1 vector wA contains the receive beamforming
The determinant of ZPT⊥ Z †
is minimized to demodu- weights, R̂ X is the interference-mitigated signal-plus-
late the signals for all transmitters jointly. noise covariance matrix estimate, and Ĥ A is the chan-
Although it is theoretically possible to use the statis- nel estimate associated with TA . It is worth noting that
tic ZPT⊥ Z † directly for demodulation, an iterative ap- the form for Ĥ A is simply the column of Ĥ , given in
proach is much more practical. We define T ≡ (TA† TB† )† Equation 43, associated with TA .
to be a partitioned form of T, where the nA × ns ma-
trix TA contains the signals associated with a particular ˆ = ZT † (TT † )−1
H
subset of nA transmit antennas and the (nT – nA) × ns
−1
matrix TB contains the signals associated with all other † † A A
T T† TA TB†
= Z TA TB
transmit antennas. By factoring PT⊥B = X † X , the rows T T
B A
†
TBTB†
of X form an orthonormal basis for the complement −1
M M1,2
of the row space of TB such that X X † = I , where the ≡ † †
Z TA TB †
11,
(52)
symmetric identity matrix has a dimension of ns minus M1,,2 M2,2
the number of rows in TB. By defining Z X ≡ ZX † and
TX ≡ TA X † , we can show that = Z TA† TB†
which is the same form found in Equation 51. codes require relatively long block lengths to be effec-
Demodulation is performed by maximizing the tive, they are particularly sensitive to Doppler offsets.
magnitude of the inner product of the beamformer out- Extending the beamformer to include delay and Dop-
put w †A Z and the interference-mitigated reference sig- pler corrects this deficiency. With this approach, the
nal TA PT⊥B . spatial-beamformer interpretation presented in Equa-
tion 50 is formally the same, but all projectors are ex-
Suboptimal Implementation tended to include delay and Doppler spread. The data
A variety of suboptimal but computationally more ef- matrix is replaced with
ficient variants are possible. In general, these approxi-
Z STF ≡ [Z †X (δ t1, δ f1 ) Z †X (δ t1, δ f 2 ) (57)
mations become increasingly valid as the number of
samples in the block increases. Z †X (δ t1, δ f nδ f )Z †X (δ t nδt , δ f nδ f †
)] ,
The first computational simplification is found by
noting that the normalization term of the channel esti- which is a (nR ⋅ nδ f ⋅ nδt ) × ns matrix that includes pos-
mate in Equation 51 can be approximated by sible signal distortions. The new channel estimate has
dimension (nR ⋅ nδ f ⋅ nδt ) × nT , but T remains the same.
ˆ = Z P ⊥ T † (T P ⊥ T † )−1
H A TB A A TB A The MMSE beamformer is given by
= Z PT⊥B TA† (TA TA† )−1// 2 (54) w STF = argmin w †STF Z STF − TX
2
× (I − [TA TA† ]−1/ 2 TA PTB TA† [TA TA† ]−1/ 2 )−1 (58)
= (Z STF Z †STF )−1 Z STF TX† .
× (TA TA† )−1/ 2
Figure 3 shows a diagram for this demodulator (MC-
≈ Z PT⊥B TA† (TA TA† )−1 . MUD).
Space-time-
beamformer
Temporal
frequency
adaptive
demultiplexer
ects on the orthogonal complement of the row space of Info
Space-time
decoder
bits
Turbo
M is given by
⊥
PM = I − M† (MM† )−1 M . (55)
This operator can be approximated by
Channel
∏ I − M
⊥ † −1 estimation
PM ≈ †
m ( Mm Mm ) Mm , (56)
Space-time
multiplexer
m
encoder
Turbo
20
Training Data
10
In principle, there is no need for training data, because
a 2 (dB)
0
the channel and information can be estimated jointly.
Furthermore, the use of training data competes directly –10
ten times. All four or eight transmitters emitted nearly the number of transmit antennas, and the estimated α
orthogonal signals simultaneously. for the transmit site. Uncertainty in α is determined by
using the bootstrap technique [42]. The CDF values re-
Attenuation ported here are evaluated over appropriate entries from
Figure 4 displays the peak-normalized mean-squared Table 1. The systematic uncertainty in the estimation
SISO attenuation averaged over transmit and receive of α caused by estimation bias, given the model, is less
antenna pairs for a given transmit site for the outdoor than 0.02.
environment. The uncertainty in the estimate is evalu- Figure 5 displays CDFs of a 2 = tr{HH† }/(nT nR )
ated by using a bootstrap technique. estimates normalized by mean a 2 for each transmit site.
CDFs are displayed for narrowband SISO, 4 × 4, and
Channel Complexity 8 × 8 MIMO systems. Because of the spatial diversi-
We present channel complexity by using three differ- ty, the variation in mean antenna-pair received power
ent approaches: variation in a2 estimates, eigenvalue cu- decreases dramatically as the number of antenna pairs
mulative distribution functions (CDF), and α estimate increases, as we would expect. This reduction in varia-
CDFs. Table 1 is a list of transmit sites used for these tion demonstrates one of the most important statistical
results. The table includes the distance (range) between effects that MIMO links exploit to improve commu-
transmitter and receiver, the velocity of the transmitter, nication link robustness. For example, if we wanted to
0.4 0.4
0.2 0.2
0 0
–15 –10 –5 0 5 –40 –30 –20 –10 0 10 20
a 2 (dB) Eigenvalue (dB)
FIGURE 5. Cumulative distribution function (CDF) of chan- FIGURE 6. CDF of narrowband channel eigenvalue distribu-
nel a 2 estimates, normalized by the mean a 2 for each site, for tions for 4 × 4 MIMO systems.
SISO, 4 × 4, and 8 × 8 MIMO systems.
0.8
operate with a probability of 0.9 to close the link, we
would have to operate the SISO link with an excess 0.6
SISO SNR (a 2Po ) margin of over 15 dB. The MIMO 0.4
systems received the added benefit of array gain, which
0.2
is not accounted for in the figure.
Figures 6 and 7 present CDFs of eigenvalues for 4 × 4 0
and 8 × 8 mean-squared-channel-matrix-element-nor- –40 –30 –20 –10 0 10 20
CDFs are evaluated over all site lists. Some care must be FIGURE 7. CDF of narrowband channel eigenvalue distribu-
taken in interpreting these figures because eigenvalues tions for 8 × 8 MIMO systems.
are not independent. Nonetheless, the steepness of the
CDFs is remarkable. We might interpret this to indicate eigenvalues of the stationary transmitter do vary some-
that optimized space-time codes should operate with a what. While the transmitters and receivers are physi-
relatively high probability of success. cally stationary, the environment does move. This effect
Figure 8 shows the CDFs for α estimates. The mean is particularly noticeable near busy roads. Furthermore,
values of α for each environment are 0.76 for 4 × 4 sites while the multiple antennas are driven with the same lo-
and 0.79 for 8 × 8 sites, where the form x ± y indicates cal oscillator, given the commercial grade transmitters,
the estimated value x with statistical uncertainty y es- there are always some small relative-frequency offsets.
timated by using a bootstrap uncertainty estimation The example variation is given for transmit sites 7 and
technique. While we might expect smaller variation in 14 from Table 1.
the 8 × 8 systems because of the much larger number of
1.0
paths, this effect may have been exaggerated in Figure 8
4×4
because of the limited number of 8 × 8 sites available in 0.8
8×8
CDF of α
0.4
Channel Stationarity
Figure 9 displays the temporal variation of eigenvalues 0.2
0
cated by using the metric presented in Equation 36. In
–10 the example, the stationary transmitter is located at site
–20 1.0
0 0.05 0.10 0.15
γ{H(t0), H(t )}
0.8
0.3
0.8 Stationary 0.1
γ{H(t 0 ), H(t)}
Moving 0.6
0.6
0.4
0.4
0.2
0.2
0 0.02 0.04 0.06
0 0.02 0.04 0.06 Time (sec)
Time (sec)
FIGURE 12. CDF of time variation of power-weighted mean
FIGURE 10. Example time variation of power-weighted mean cos 2 θ, γ {H(t0 ),H(t)} , for a moving 4 × 4 MIMO system. Con-
cos 2 θ, γ {H(t0 ),H(t)} , for stationary and moving 4 × 4 MIMO tours of CDF probabilities of 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8,
systems. and 0.9 are displayed.
0.8
ment consists predominantly of two- and three-story
0.6 residential buildings and some commercial buildings of
0.4 similar heights in an urban setting. Propagation delay
spreads are typically several microseconds and Dop-
0.2
pler spreads are at most a few hundred hertz. There is
typically no identifiable line-of-sight component in
–600 –400 –200 0 200 400 600
the propagation. The signal has a pulse-shaped QPSK
Frequency (kHz)
modulation and bandwidth of about 100 kHz. Coding
FIGURE 13. Example of frequency-selective variation of the provides a spectral efficiency of 2 bits/sec/Hz.
power-weighted mean cos 2 θ , γ {H(f0 ),H(f )} .
The receiver consists of sixteen channels fed by low-
gain elements with wide azimuth beamshapes. The
elements are oriented in various directions, not neces-
1.0
sarily pointing at the sources. For the example below,
γ{H(f0 ), H(f)}
• Spectral efficiency
FIGURE 14. CDF of frequency-selective variation of the 0.15
2
2 bits/sec/Hz
power-weighted mean cos θ , γ {H(f0 ),H(f )} . The graph dis-
plays contours of CDF probabilities of 0.1, 0.2, 0.3, 0.4, 0.5,
0.6, 0.7, 0.8, and 0.9. 0.10
MIMO links can be evaluated. The channel transfer ent transmit locations. In both examples the link does
matrices have a random structure that varies from sub- not have line of sight. Different four-antenna subsets of
array to subarray. the sixteen-channel receiver were used to improve statis-
Figure 15 shows symbol-error probability as a func- tical significance.
tion of excess Eb /N0, which is related to the excess spec- Example 1. In this example, the transmitter was lo-
tral efficiency (beyond 2 bits/sec/Hz) predicted by a cated in the parking lot at Boston University (Universi-
capacity bound, given the measured channel transfer ty Road and Storrow Drive) with about a half-kilometer
matrix. For this example, in a comparatively station- separation between the transmitter and receiver. Figure
ary environment, the channel transfer matrices are used 16 shows the geometry of the experiment. Traffic on
both for the simulated results and for the computation Storrow Drive is typically heavy and the posted speed
of excess Eb /N0. As the figure shows, about 4.5 dB ex- limit of 45 mph is generally misinterpreted as the mini-
cess Eb /N0 is required to complete the link at 2 bits/sec/ mum allowed speed. While the transmitter is stationary,
Hz. Simulations agree to within about 1 dB. the environment is nonstationary because of the traffic.
Example 2. The transmitter was moving at 10 m/sec
Space-Time Turbo-Code Experiments at a range of 500 m. Figure 17 shows the geometry of
In this section we present the experimental performance the experiment. To simulate the effects of local oscilla-
of a space-time turbo code. We begin by discussing the tor errors, we introduced artificial frequency offsets at
experimental parameters, and then we summarize the the transmitters. These errors were within ±80 Hz.
performance of the MIMO system with stationary Two wideband jammers were transmitting at a range
transmitter and receiver in a dynamic environment. Ad- of 100 m. Each jammer was received at a jammer-to-
ditionally, for an even more complicated environment, noise ratio (JNR) of approximately 25 dB. Figure 18
we describe performance results with a mobile transmit- shows the eigenvalues of the noise-normalized interfer-
ter and multiple strong interferers. ence-plus-noise spatial covariance matrix. The “noise”
eigenvalues of the jammer spatial eigenvalue distribution
Experimental Parameters
Outdoor experiments were performed in a frequency
allocation near the PCS band, using a sixteen-channel
receiver. A variety of coding and interference regimes
Cambridge
were explored for both moving and stationary transmit-
ters. Channel-probing sequences and four- and eight-
transmitter space-time codes were transmitted. This Receive MIT
section reports on the outdoor performance results of array
space-time turbo codes for 4 × 4 MIMO systems. The
outdoor experiments were performed during July and
August 2002 on and near the MIT campus. The receive
antenna array was placed on top of a one-story building
(at Brookline Street and Henry Street) surrounded by
two- and three-story buildings.
Transmit
For the examples discussed in this article, quadra- array Boston
ture-phase-shift-key (QPSK) signals were transmitted University
on four antennas at 123 × 103 chips per second for a
total data rate of 246 kb/sec, using the space-time code
discussed earlier. A 160-kHz spectral limit was enforced
by using a root-raised-cosine pulse shaping. Total trans- FIGURE 16. Example 1: map of MIMO communication exper-
mit power was approximately 100 mW, radiated from iment near the MIT campus, including the locations of the
0-dBi antennas. We discuss two examples with differ- transmitter and receiver.
40
20
Receive
array Jammers 10
25 dB JNR
100 m
0
1 2 3 4
Eigenvalue number
FIGURE 17. Example 2: map of MIMO communication ex-
periment near the MIT campus, including the locations of FIGURE 18. Eigenvalue distribution of the noise-normalized
the transmitter, receiver, and jammers at a fixed jammer-to- interference-plus-noise spatial covariance matrix.
noise ratio (JNR).
are slightly higher than we would naively expect, given ing (three turbo iterations; Doppler taps: {–4/3, –2/3,
the 0-dB noise normalization. This behavior is probably 0, 2/3, 4/3}; temporal taps: {–1/2, 0, 1/2}), and where
an indication of either delay spread or nonstationarity in Doppler taps are represented in resolution cells (60
the received jammer signal. Either of these explanations Hz).
presents additional challenges to the receiver. Performance improves with receiver complexity; the
Both the delay and the Doppler spread affect the algorithm, however, must bootstrap up in complexity
design and performance of the receiver. Here a space- iteratively. Starting with the highest complexity on the
time-frequency adaptive processor is employed. The first iteration increases the probability of converging to
number of delay and frequency taps in the adaptive pro- the wrong solution. Because the channel contains sig-
cessor depends upon the phenomenology. Delay spread nificant Doppler spread, the spatial beamformer per-
was found to be less than ±4 µsec. For the stationary forms poorly. With the relatively long block lengths of
environment, in quiet regions (no nearby traffic), no the turbo code, Doppler beamforming is required in
Doppler spread was detected. For the stationary trans- this environment. We note that experimental perfor-
mitter near heavy traffic in experimental example 1, the mance is essentially the same as was found in simula-
Doppler spread was found to be within ±150 Hz. For tions. Furthermore, the experimental performance is
the moving transmitter in experimental example 2, the
Doppler spread was found to be within ±180 Hz.
10 –1
Experimental Example 1 Spatial beamforming
Bit error rate
Training-data-based
Bit error rates for various detector alternatives are re- 10 –2 SFAP
10 –2
jammed environments. A channel phenomenology pa-
rameterization was introduced. Experimental phenom-
10 –3
enological results were reported, the results indicating
that the observed channels can be typically character-
10 –4
–21 –20 –19 –18 –17 ized by high degrees of complexity. Furthermore, for
SISO SINR (dB) environments with transmitters on moving vehicles,
FIGURE 20. Bit error rate of 4 × 4, 2-bit/sec/Hz space-time
the channel varies significantly on a time scale less than
turbo code using the MCMUD receivver with space-time- 10 msec. Two space-time coding techniques were in-
frequency beamforming as a function of mean SISO signal- troduced, one based on LDPC and the other on turbo
to-interference-plus-noise ratio (SINR). codes. Experimental demodulation performance results
were presented for a variety of environments, including
similar to the simulated performance of the best space- those with wideband jammers. In the presence of the
time codes. jammer, the MIMO system (using the MCMUD re-
ceiver) operated dramatically better than SISO systems.
Experimental Example 2
The experimental data includes the effects of two high- Acknowledgments
power wideband jammers, a moving transmitter, and The authors would like to thank Peter Wu of Lincoln
local oscillator errors. Experimental performance of this Laboratory for his help developing the space-time turbo
space-time turbo code for a stationary transmitter in the code, and Naveen Sunkavally of MIT and Nick Chang
absence of interference is discussed elsewhere [39]. of the University of Michigan for their help with the ex-
Figure 20 shows the bit error rate of the space-time periment. The authors would also like to thank the ex-
turbo code using the MCMUD receiver. The bit error cellent Lincoln Laboratory staff involved in the MIMO
rate is displayed in terms of the mean SISO signal-to- experiment, in particular Sean Tobin, Jeff Nowak, Lee
interference-plus-noise ratio (SINR). This is the aver- Duter, John Mann, Bob Downing, Peter Priestner, Bob
age SINR at a given receive antenna, assuming that Devine, Tony Tavilla, and Andy McKellips. We also
all power of the transmit array is transmitted from a thank Ali Yegulalp of Lincoln Laboratory and Vahid
single transmit antenna. We note that this experimen- Tarokh of Harvard University for their thoughtful com-
tal system in this difficult environment operates at an ments, and Dorothy Ryan of Lincoln Laboratory for
SINR that is 25 dB better than the information-theo- her helpful comments. Finally, the authors would like
retic SISO bound, and operates probably at least 35 dB to thank the MIT New Technology Initiative Commit-
better than a practical SISO system. Furthermore, there tee for their support.
is only approximately a 3-dB loss in a 2Po performance
compared to the performance in an environment with-
out jammers. The effectiveness of the receiver is due in
part to its ability to compensate for delay and frequency
spread. The MCMUD employs a space-time-frequen-
cy adaptive processor that uses a four-antenna receiver
with temporal and frequency taps that cover a range of
±4 microseconds and ±200 Hz.
Summary
In this article we addressed information-theoretic, phe-
nomenological, coding, and receiver issues for MIMO
pp. 2041–2052.
19. K.W. Forsythe, “Capacity of Flat-Fading Channels Associat-
R EFER ENCE S ed with a Subspace-Invariant Detector,” 34th Asilomar Conf.
on Signals, Systems and Computers 1, Pacific Grove, Calif., 29
1. W.C. Jakes, Microwave Mobile Communications (Wiley, New Oct.–1 Nov. 2000, pp. 411–416.
York, 1974). 20. K.W. Forsythe, “Performance of Space-Time Codes over a
2. R.A. Monzingo and T.W. Miller, Introduction to Adaptive Flat-Fading Channel Using a Subspace-Invariant Detector,”
Arrays (Wiley, New York, 1980). 36th Asilomar Conf. on Signals, Systems and Computers 1, Pa-
3. K.W. Forsythe, D.W. Bliss, and C.M. Keller, “Multichan- cific Grove, Calif., 3–6 Nov. 2002, pp. 750–755.
nel Adaptive Beamforming and Interference Mitigation in 21. A. Stefanov and T.M. Duman, “Turbo Coded Modulation
Multiuser CDMA Systems,” Thirty-Third Asilomar Conf. on for Wireless Communications with Antenna Diversity,” Proc.
Signals, Systems & Computers 1, Pacific Grove, Calif., 24–27 IEEE Vehicular Technology Conf. 3, Amsterdam, 19–22 Sept.
Oct. 1999, pp. 506–510. 1999, pp. 1565–1569.
4. A. Wittneben, “Basestation Modulation Diversity for Digital 22. Y. Liu, M.P. Fitz, and O.Y. Takeshita, “Full Rate Space–Time
SIMULCAST,” Proc. IEEE Vehicular Technology Conf., St. Turbo Codes,” IEEE J. Sel. Areas Commun. 19 (5), 2001, pp.
Louis, Mo., 19–22 May 1991, pp. 848–853. 969–980.
5. V. Weerackody, “Diversity for Direct-Sequence Spread Spec- 23. H. Sampath and A.J. Paulraj, “Joint Transmit and Receive
trum Using Multiple Transmit Antennas,” Proc. IEEE Int. Optimization for High Data Rate Wireless Communication
Communications Conf. 3, Geneva, 23–26 May, 1993, pp. Using Multiple Antennas,” Conf. Record Thirty-Third Asilo-
1775–1779. mar Conf. on Signals, Systems & Computers 1, Pacific Grove,
6. G.J. Foschini, “Layered Space-Time Architecture for Wire- Calif., 24–27 Oct. 1999, pp. 215–219.
less Communication in a Fading Environment When Using 24. N. Sharma and E. Geraniotis, “Analyzing the Performance
Multi-Element Antennas,” Bell Labs Tech. J. 1 (2), 1996, pp. of the Space-Time Block Codes with Partial Channel State
41–59. Feedback,” Proc. Wireless Communications and Networking
7. I.E. Telatar, “Capacity of Multi-Antenna Gaussian Chan- Conf. 3, Chicago, 23–28 Sept., 2000, pp. 1362–1366.
nels,” Eur. Trans. Telecommun. 10 (6), 1999, pp. 585–595. 25. D.W. Bliss, K.W. Forsythe, A.O. Hero, and A.F. Yegulalp,
8. D.W. Bliss, K.W. Forsythe, A.O. Hero, and A.L. Swindle- “Environmental Issues for MIMO Capacity,” IEEE Trans.
hurst, “MIMO Environmental Capacity Sensitivity,” Thir- Signal Process. 50 (9), 2002, pp. 2128–2142.
ty-Fourth Asilomar Conf. on Signals, Systems & Computers 1, 26. T.M. Cover and J.A. Thomas, Elements of Information Theory
Pacific Grove, Calif., 29 Oct.–1 Nov. 2000, pp. 764–768. (Wiley, New York, 1991).
9. D.W. Bliss, K.W. Forsythe, and A.F. Yegulalp, “MIMO 27. F. R. Farrokhi, G. J. Foschini, A. Lozano, and R.A. Valen-
Communication Capacity Using Infinite Dimension Ran- zuela, “Link-Optimal Space–Time Processing with Multiple
dom Matrix Eigenvalue Distributions,” Thirty-Fifth Asilo- Transmit and Receive Antennas,” IEEE Comm. Lett. 5 (3),
mar Conf. on Signals, Systems & Computers 2, Pacific Grove, 2001, pp. 85–87.
Calif., 4–7 Nov. 2001, pp. 969–974. 28. D.W. Bliss, A.M. Chan, and N.B. Chang, “MIMO Wireless
10. T.L. Marzetta and B.M. Hochwald, “Capacity of a Mobile Communication Channel Phenomenology,” IEEE Trans. An-
Multiple-Antenna Communication Link in Rayleigh Flat tennas Propag., 52 (8), 2004, pp. 2073–2082.
Fading,” IEEE Trans. Inf. Theory 45 (1),1999, pp. 139–157. 29. D. Gesbert, H. Bölcskei, D.A. Gore, and A.J. Paulraj, “Perfor-
11. L. Zheng and D.N.C. Tse, “Diversity and Freedom: A Fun- mance Evaluation for Scattering MIMO Channel Models,”
damental Tradeoff in Multiple-Antenna Channels,” IEEE Thirty-Fourth Asilomar Conf. on Signals, Systems & Computers
Trans. Inf. Theory 49 (9), 2003, pp. 1076–1093. 1, Pacific Grove, Calif., 29 Oct.–1 Nov. 2001, pp. 748–752.
12. S.M. Alamouti, “A Simple Transmit Diversity Technique for 30. H. Bölcskei and A.J. Paulraj, “Performance of Space-Time
Wireless Communications,” IEEE J. Sel. Areas Commun. 16 Codes in the Presence of Spatial Fading Correlation,” Thirty-
(8), 1998, pp. 1451–1458. Fourth Asilomar Conference on Signals, Systems & Computers
13. V. Tarokh, H. Jafarkhani, and A.R. Calderbank, “Space– 1, Pacific Grove, Calif., 29 Oct.–1 Nov. 2000, pp. 687–693.
Time Block Codes from Orthogonal Designs,” IEEE Trans. 31. R.G. Gallager, Low-Density Parity-Check Codes (MIT Press,
Inf. Theory 45 (5), 1999, pp. 1456–1467. Cambridge, Mass., 1963).
14. G. Ganesan and P. Stoica, “Space–Time Block Codes: A 32. S.-Y. Chung, G.D. Forney, Jr., T.J. Richardson, and R. Ur-
Maximum SNR Approach,” IEEE Trans. Inf. Theory 47 (4), banke, “On the Design of Low-Density Parity-Check Codes
2001, pp. 1650–1656. within 0.0045 dB of the Shannon Limit,” IEEE Commun.
15. B. Hassibi and B. Hochwald, “High-Rate Linear Space-Time Lett. 5 (2), 2001, pp. 58–60.
Codes,” Proc. IEEE Int. Conf. on Acoustics, Speech, and Sig- 33. T.J. Richardson, M.A. Shokrollahi, and R.L. Urbanke, “De-
nal Processing 4, Salt Lake City, Utah, 7–11 May 2001, pp. sign of Capacity-Approaching Irregular Low-Density Pari-
2461–2464. ty-Check Codes,” IEEE Trans. Inf. Theory 47 (2), 2001, pp.
16. V. Tarokh, N. Seshadri, and A.R. Calderbank, “Space–Time 619–637.
Codes for High Data Rate Wireless Communication: Perfor- 34. R.J. McEliece, D.J.C. MacKay, and J.-F. Cheng, “Turbo De-
mance Criterion and Code Construction,” IEEE Trans. Inf. coding as an Instance of Pearl’s ‘Belief Propagation’ Algo-
Theory 44 (2), 1998, pp. 744–765. rithm,” IEEE J. Sel. Areas Commun. 16 (2), 1998, pp. 140–
17. B.M. Hochwald and T.L. Marzetta, “Unitary Space–Time 152.
Modulation for Multiple-Antenna Communications in Ray- 35. C. Berrou, A. Glavieux, and P. Thitimajshima, “Near Shan-
leigh Flat Fading,” IEEE Trans. Inf. Theory 46 (2), 2000, pp. non Limit Error-Correcting Coding and Decoding: Turbo-
543–564. Codes,” Proc. IEEE Int. Communications Conf. 2, Geneva,
18. B.M. Hochwald and W. Sweldens, “Differential Unitary 23–26 May 1993, pp. 1064–1070.
Space-Time Modulation,” IEEE Trans. Com. 48 (12) 2000, 36. L.R. Bahl, J. Cocke, F. Jelinek, and J. Raviv, “Optimal De-