Professional Documents
Culture Documents
Abstract—Sequential Monte Carlo (SMC) methods are pow- Carlo samples of the state variables or some other latency vari-
erful techniques for online filtering of nonlinear and non-Gaussian ables, these methods can flexibly adapt to the dynamics of the
dynamic systems. Typically dynamic systems exhibit strong underlying stochastic systems. A complete theoretical frame-
memory effects, i.e., future observations can reveal substantial
information about the current state. Recently, the delayed-sample work for the SMC has appeared in [23]. Many successful appli-
sampling method has been proposed in the context of mixture cations of SMC in diverse areas of science and engineering can
Kalman filter (MKF), which makes use of future observations in be found in the recent books [9], [20].
generating samples of the current state. Although this method is Recently, we have applied the SMC technique, and in
highly effective in producing accurate filtering results, its compu- particular, the mixture Kalman filter (MKF) for conditional
tational complexity is exponential in terms of the delay, due to the
need to marginalize the future states. In this paper, we address this dynamic linear models [3], to the problem of adaptive detection
difficulty by developing two new sampling schemes for delayed and decoding in flat-fading channels [4] in the presence of
estimation, namely, delayed-pilot sampling and hybrid-pilot sam- Gaussian or non-Gaussian ambient noise. This problem is
pling. The basic idea of delayed-pilot sampling is that instead of of fundamental importance in communication theory, and it
exploring the entire space of future states, we generate a number is known that the optimal solution has a prohibitively high
of random pilot streams, each of which indicates what would
happen in the future if the current state takes a particular value. computational complexity [13], [25], [26]. Various suboptimal
The sampling distribution of the current state is then determined approaches have been proposed in the literature to bear on this
by the incremental importance weight associated with each pilot problem [2], [5]–[7], [10], [11], [14], [16], [17], [19], [24],
stream. The delayed-pilot sampling can be used in conjunction [27], [29], [30]. Our approach in [4] is based on a Bayesian
with the delayed-sample method, resulting in a hybrid scheme. formulation of the problem of on-line detection/decoding in
This new sampling technique is then applied to solve the problem
of adaptive detection and decoding in flat-fading communication flat-fading channels and the SMC methodology. The basic idea
channels. Simulation results are provided to demonstrate the is to sequentially impute multiple samples of the transmitted
performance of the new low-complexity sampling techniques symbols based on the current observation. Associated with
for delayed estimation and to compare with the delayed-sample each sequence of imputed symbols is its important weight.
method. The imputed symbol sequences, together with their importance
Index Terms—Adaptive decoding, delayed-pilot, delayed- weights, are used to compute the Bayesian estimates of the
sample, fading channel, mixture Kalman filter, sequential Monte transmitted information. It is shown in [4] through simulations
Carlo. that the performance of the SMC-based receivers can be
remarkably close to the so-called genie-aided bound in fading
I. INTRODUCTION channels for both uncoded and coded systems without the use
of any training/pilot symbols or decision feedback.
the receiver performance when the signals are channel-coded In particular, if is sampled from a trial distribution that
and/or when the channel ambient noise is non-Gaussian. has the same support as , and let , then
However, the computational complexity of the de- is properly weighted with respect to .
layed-sample method developed in [4] is quite high, due Consider the following conditional dynamic linear model
to the need of marginalizing out the future states. For ex- (CDLM) of the form
ample, in a Gaussian noise flat-fading channel, for a -step
delayed-sample method, the algorithmic complexity is (1)
(where is the finite set of signal constellations, (2)
and is the number of Monte Carlo samples) for an uncoded
where and are the state and ob-
system, and it is for a coded system with a rate
servation noise, respectively, and is a sequence of random
convolutional code. The complexity is even much higher
indicator variables that may form a Markov chain but are inde-
if the ambient noise is non-Gaussian. Hence, it becomes im-
pendent of and and the past and , . The matrices
portant to develop computationally efficient delayed estimation
, , , and are known given . The indicator
techniques.
takes values from a finite set .
In this paper, new low-complexity techniques are developed The MKF in [3] is a novel sequential Monte Carlo method
for delayed estimation under the MKF framework. In order to for online filtering and prediction of CDLMs; it exploits
draw a new sample of the current state, instead of marginal- the conditional Gaussian property and utilizes a marginal-
izing the future states of steps, we generate random pilot
ization operation to improve the algorithmic efficiency. Let
streams of these future states. Each pilot stream is obtained by and . By
starting with one of the possible value and propagating recursively generating a set of properly weighted random
steps of SMC. The incremental importance weight of this
samples to represent , the MKF
pilot stream is then given to the value and with probability
approximates the target distribution by a random
proportional to which a sample of the current state is drawn. Fi-
mixture of Gaussian distributions
nally, to correct the bias introduced by the pilot approximation,
the usual incremental weight is multiplied by the inverse of the
pilot weight. We term this method of delayed estimation the de-
layed-pilot method. Moreover, such an idea can be combined
with the delayed-sample method developed in [4] to yield a de- where
layed-hybrid method. That is, in order to achieve a -step
and
delayed estimation, we implement the combination of a -step
delayed-sample sampling and a -step delayed-pilot sampling. are obtained with a Kalman filter on the system (1), (2) for the
The delayed-pilot method and the delayed-hybrid method are given indicator trajectory . Denote
then applied to the problem of adaptive detection and decoding
in flat-fading channels treated in [4].
The rest of this paper is organized as follows. In Section II,
we briefly summarize the MKF algorithm, as well as the de- Thus, a key step in the MKF is the production at time of
layed-weight method and delayed-sample method for MKF. In the weighted samples of the indicators and the Kalman filter
Section III, we develop the delayed-pilot method and the de- outputs , based on the set of samples
layed-hybrid method for MKF. In Section IV, we treat the prob- at the previous time [3], [4].
lems of adaptive detection and adaptive decoding in flat-fading The model (1) and (2) is often observed to be highly cor-
channels using the proposed new sampling methods. Simulation related. As a result, future observations often contain informa-
results are provided in Section V. Finally, Section VI contains tion about the current state. Hence, a delayed estimate is usually
the conclusion. more accurate than the concurrent estimate. In delayed estima-
tion, instead of making inference on or instantaneously
with the posterior distribution or , we delay
II. BACKGROUND this inference to a later time , with the distribution
First, we define an important concept that is used throughout and , respectively. In [4], two delayed
the paper. A set of random samples and the corresponding estimation techniques are proposed, namely, the delayed-weight
weights is said to be properly weighted with method and the delayed-sample method.
respect to distribution if, for any measurable function , The idea behind the simple delayed-weight method is as fol-
we have lows. If the set is properly weighted with
respect to , then when we focus our attention on
at time , we have that is properly
weighted with respect to . Then, any inference about
the indicator , can be approximated by
as
(3)
WANG et al.: DELAYED-PILOT SAMPLING FOR MIXTURE KALMAN FILTER 243
with
(4)
Using
(12)
(13)
(9)
(14)
(11)
WANG et al.: DELAYED-PILOT SAMPLING FOR MIXTURE KALMAN FILTER 245
This is the original weight multiplied by the incremental This is because is properly weighted with
weight of the sampled pilot propagated from to . respect to .
Let Alternatively, if one does not maintain two sets of weights
and is willing to wait additional ( ) steps, then the
weight generated at time from the above algo-
rithm can be used for inference. At that time, is a
be the selected pilot. Then, is properly weighted
properly weighted sample with respect to . Then,
with respect to . This can be easily seen from the
following: is properly weighted with respect to .
Since , we still take full advantage of the delay .
Remark 4 (Resampling): Resampling is an important step in
SMC. It rejuvenates the sampler when the effective sample size
becomes small, hence providing more efficient samples for the
future states. For more details, see [23]. Specifically, suppose at
time we have a set of random samples prop-
erly weighted with respect to . By treating the samples
as a discrete approximation of the posterior distribution, we can
generate another discrete representation as follows.
• Select a set of positive numbers , .
• For
— Randomly select from the set with
probability proportional to .
— If , then set the new weight associated
with this sample to be .
(16) • Return the new representation .
where the numerator of (16) is the target distribution, and the de- The new weight is also properly weighted with respect to
nominator is the sampling distribution of , the th stream .
up to time , and the selected th pilot path. The choice of the resampling probability directly affects
Hence, is properly weighted with respect to the efficiency of the algorithm. As in [23], one choice is
. Therefore, we have , which makes the resulting resampled set have
equal weights. In the delay-pilot algorithm, the auxiliary weight
is a better choice. As shown above, gives small
weights to samples that are close to the future distribution
with but away from the current distribution .
Using as the resampling probability is shortsighted. It tends
(17) to remove the samples that are less important at time but may
be important in the future (e.g., at time ). On the other
Note that no additional computation is needed to obtain the aux- hand, the auxiliary weight is based on the future distribution.
iliary weights; hence, it is recommended to keep two sets of This resampling procedure does not reduce the variance of the
weights in the propagation. However, the auxiliary weight are current weight distribution as much as resampling according
computed at every time step and cannot be used for further prop- to . However, since it keeps better samples (in the eyes of
agation (i.e., to replace ). It is solely used for inference of future), the weight distribution at latter time will be better.
at time . Note that with , the new weight is proportional to
246 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 50, NO. 2, FEBRUARY 2002
B. Delayed-Hybrid Sampling
The delayed-hybrid sampling method is a combination of the
delayed-sample method and the delayed-pilot method. Here, we
fully explore the state space of the immediate future and gen-
erate pilots to explore the space beyond that.
Let be the total delay time. A delayed-hybrid sam-
Fig. 3. Illustration of the delayed-hybrid sampling.
pling method generates pilots, where each takes one of
the possible paths of and makes addi-
tional concurrent MKF steps to time . Then, the For :
probability of selecting is proportional to the sum of the • For each , and for each ,
incremental importance weights of all pilots, starting with perform a one-step update on the corresponding Kalman
. Since we have explored the entire space of , filter , i.e.,
the weight calculation will be based on the target distribution
. Fig. 3 illustrates the delayed-hybrid sampling op-
eration, with , , and . (21)
Specifically, for each possible “future” (relative to time )
symbol sequence at time , i.e., Compute
(22)
(19)
• For every path of , do the following:
Denote For , repeat:
— Denote
(20)
(25)
(29)
Draw a sample according to the above • Do resampling if the effective sample size is below certain
sampling density. If , set threshold, as discussed in Remark 4 (Section III-B).
Note that at each time the above delayed-hybrid MKF
algorithm involves calculating
one-step Kalman filter updates, whereas the full delayed-sample
— Compute the incremental importance weight MKF algorithm requires Kalman updates at each
time.
Similar to the proof of (14), it can be easily shown that the
above algorithm generates, at time , a weighted
(26) sample , which is properly weighted with respect
to but not to .
Remark 6 (Auxilary Weight): Similar to the delay-pilot algo-
• For each , compute
rithm, we also need to resort to the auxiliary weight to make
better inferences on at time . However, it is
slightly more complicated for the delay-hybrid algorithm.
Suppose . Define the concurrent weight
(27)
Using
(28) (30)
where is the sampling probability (28). Then,
is properly weighted with respect to , which is the con-
current posterior distribution.
as the trial sampling distribution, draw a sample . Ap- Let the th path in be , and , where
pend to to obtain . Based on this sample, is the pilot path started from in the
form based on the results from the previous step. th sample stream. Define for
• Compute the importance weight. Suppose and and
; then for .
For every possible value of , define the single path
weight as
(31)
248 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 50, NO. 2, FEBRUARY 2002
.. .. .. .. ..
. . . . .
and
(35)
Hence .. ..
. .
and (46)
(43) Then, run steps of Kalman filter update to obtain
(44)
for some function determined by the structure of the en- For each , compute the corre-
coder. We also assume the information bits are independent. sponding transmitted symbols , and run steps of
Let be the received signals at Kalman filter update to obtain
time , and let be the channel states
corresponding to and . Recall that .
In addition, denote ; ;
(49)
. The Monte Carlo samples recorded at time
are , where — For each , compute the sampling density
contains the mean and covariance matrix of the channel state (50)
vector conditioned on and . That is
Using
(57)
(53)
and the corresponding transmitted symbols
(54)
(59)
• Do resampling if the effective sample size is below certain
threshold. — For each , compute the sampling density
The adaptive decoding algorithms based on the delayed-hy-
brid MKF is as follows.
Algorithm 4 [Delayed-Hybrid Adaptive Decoder]: Initial-
ization: Same as in Algorithm 4.
Suppose that at time , a set of properly
weighted samples with respect to
and the corresponding transmitted signals (60)
are available; then, at time , as the
new data become available, the following steps are Draw a sample according to the above sam-
implemented to update each weighted sample. pling density. If , set
For :
• For each , and for each
, compute the corresponding transmitted sym-
bols . Perform steps update on the corresponding — Compute the increment importance weight
Kalman filter , i.e.,
(55) (61)
and compute
• For each , compute
(56) (62)
WANG et al.: DELAYED-PILOT SAMPLING FOR MIXTURE KALMAN FILTER 251
Using
(63)
(65)
Fig. 5. BER performance of the simple pilot-delayed MKF receiver with Fig. 7. BER performance of the delayed-hybrid MKF receiver with different
different delays in a coded system. delays in a coded system.
Xiaodong Wang (M’98) received the B.S. degree in Rong Chen (M’92) received the B.S. degree in
electrical engineering and applied mathematics (with mathematics from Peking University, Beijing, China,
the highest honor) from Shanghai Jiao Tong Uni- in 1985 and the M.S. and Ph.D. degrees in statistics
versity, Shanghai, China, in 1992, the M.S. degree from Carnegie Mellon University, Pittsburgh, PA, in
in electrical engineering from Purdue University, 1987 and 1990, respectively.
West Lafayette, IN, in 1995, and the Ph.D. degree He is a Professor at the Department of Informa-
in electrical engineering from Princeton University, tion and Decision Sciences, College of Business Ad-
Princeton, NJ, in 1998. ministration, University of Illinois at Chicago (UIC).
From July 1998 to December 2001, he was an Before joining UIC in 1999, he was with the Depart-
Assistant Professor with the Department of Elec- ment of Statistics, Texas A&M University, College
trical Engineering, Texas A&M University, College Station. His main research interests are in time series
Station. In January 2002, he joined the Department of Electrical Engineering, analysis, statistical computing and Monte Carlo methods in dynamic systems,
Columbia University, New York, NY, as an Assistant Professor. His research and statistical applications in engineering and business. He is an Associate Ed-
interests fall in the general areas of computing, signal processing, and commu- itor for the Journal of American Statistical Association, the Journal of Business
nications. He has worked in the areas of digital communications, digital signal and Economic Statistics, Statistica Sinica, and Computational Statistics.
processing, parallel and distributed computing, nanoelectronics, and quantum
computing. His current research interests include multiuser communications
theory and advanced signal processing for wireless communications. He wwas
with AT&T Labs—Research, Red Bank, NJ, during the summer of 1997. Dong Guo received the B.S. degree in geophysics
Dr. Wang is a member of the American Association for the Advancement and computer science (minor) in 1993 from the
of Science. He received the 1999 NSF CAREER Award and the 2001 IEEE Chinese University of Mining and Technology
Communications Society and Information Theory Society Joint Paper Award. (CUMT), Xuzhou, China, and the M.S. degree in
He currently serves as an Associate Editor for the IEEE TRANSACTIONS ON geophysics in 1998 from the Graduate School of
COMMUNICATIONS and for the IEEE TRANSACTIONS ON SIGNAL PROCESSING. Research Institute of Petroleum Exploration and
Development (RIPED), Beijing, China, in 1996. He
received the Ph.D. degree in applied mathematics
from Peking University, Beijing, China, in 1999.
He is pursing a second Ph.D. degree with the
Department of Electrical Engineering, Texas A&M
University, College Station.
His research interests are in the area of statistical signal processing and its
applications, primarily in digital communications.