Hawes Et Al. - 2017 - Bayesian Compressive Sensing Approaches For Direction of Arrival Estimation With Mutual Coupling Effects

IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, VOL. 65, NO.
3, MARCH 2017 1357
Bayesian Compressive Sensing Approaches for

Direction of Arrival Estimation With
Mutual Coupling Effects
Matthew Hawes, Lyudmila Mihaylova, François Septier, and Simon Godsill
Abstract— The problem of estimating the dynamic direction Compressive sensing (CS) theory says that when cer-
of arrival (DOA) of far-field signals impinging on a uniform tain conditions are met it is possible to recover sig-
linear array, with mutual coupling effects, is addressed. This nals from fewer measurements than used by traditional
paper proposes two novel approaches able to provide accurate
solutions, including at the endfire regions of the array. First, methods [10], [11]. Hence, CS can be applied to the problem
a Bayesian compressive sensing Kalman filter is developed, which of DOA estimation [12]–[15] by splitting the angular region
accounts for the predicted estimated signals rather than using into N potential DOAs, where only L N of the DOAs
the traditional sparse prior. The posterior probability density have an impinging signal (alternatively N − L of the angular
function of the received source signals and the expression for directions have a zero-valued impinging signal present). These
the related marginal likelihood function are derived theoretically.
Next, a Gibbs sampling-based approach with indicator variables DOAs are then estimated by finding the minimum number of
in the sparsity prior is developed. This allows sparsity to be DOAs with a non-zero valued impinging signal that still give
explicitly enforced in different ways, including when an angle an acceptable estimate of the array output.
is too far from the previous estimate. The proposed approaches The problem can also be converted into a probabilistic
are validated and evaluated over different test scenarios and form and solved via Bayesian CS (BCS) [16], implemented
compared to the traditional relevance vector machine (RVM)-
based method. An improved accuracy in terms of average with a relevance vector machine (RVM) [17]–[19]. Such a
root-mean-square error values is achieved (up to 73.39% for method has been used to solve the problem of static DOA
the modified RVM-based approach and 86.36% for the Gibbs estimation [20], [21], where a belief of having a sparse
sampling-based approach). The proposed approaches prove to received signal is made and the most likely values found.
be particularly useful for DOA estimation when the angle of The Kalman filter (KF) can be used to track dynamic DOAs,
arrival moves into the endfire region of the array.
with the angular range narrowed to focus in more closely on
Index Terms— Bayesian compressive sensing (BCS), dynamic the DOA estimate from the previous iteration [22]. However,
direction of arrival (DOA) estimation, Gibbs sampling, Kalman
filter (KF), relevance vector machine (RVM). this prevents directly working with the measured array signals
and introduces an additional stage of having to reevaluate
I. I NTRODUCTION the steering vector of the array at each iteration of the KF.
Hierarchical KFs have been used to track dynamic sparse
D IRECTION of arrival (DOA) estimation is the process
of determining which direction a signal impinging on an
array has arrived from. Commonly used methods of solving
signals [23], [24], where the predicted mean of the signals
at each iteration is taken as the estimate from the previous
this problem are: MUSIC [1], [2], ESPRIT [3]–[6], and the iteration and the hyperparameters are estimated using BCS,
maximum likelihood DOA estimator [7]–[9]. However, these hence the term BCSKF.
methods have some disadvantages, in particular they require However, a problem remains when a BCSKF is applied to
knowledge of the number of signals present beforehand and dynamic DOA estimation with a uniform linear array (ULA).
evaluation of a covariance matrix of the array output (adding The estimation accuracy can be reduced when the DOA
computational complexity). approaches the endfire region of the array, i.e., when the
impinging signal arrives parallel to or almost parallel to the
Manuscript received November 5, 2016; accepted January 3, 2017. Date of array. This can be particularly problematic when there is a lot
publication February 2, 2017; date of current version March 1, 2017. This
work was supported by the UK Engineering and Physical Sciences Research of noise present.
Council through the project Bayesian Tracking and Reasoning over Time An additional challenge to address when considering the
under Grant EP/K021516/1. DOA estimation problem is that of mutual coupling. One way
M. Hawes and L. Mihaylova are with the Department of Automatic Control
and Systems Engineering, University of Sheffield, Sheffield S1 3JD, U.K. of modeling the mutual coupling effects is to use a mutual
(e-mail: m.hawes@sheffield.ac.uk; l.s.mihaylova@sheffield.ac.uk). coupling matrix [25], [26]. In [25], the mutual coupling matrix
F. Septier is with the IMT Lille, Univ. Lille, CNRS, UMR 9198-CRIStAL- is found using two methods: minimum mean-square matching
Centre de Recherche en Informatique Signal et Automatique de Lille, F-59000
Lille, France (e-mail: francois.septier@telecom-lille.fr). and the mutual impedance method. The method in [26] applies
S. Godsill is with the Engineering Department, University of Cambridge, a symmetric Toeplitz matrix, where only antennas within a set
Cambridge CB2 1TN, U.K. (e-mail: sjg@eng.cam.ac.uk). separation of each other can cause mutual coupling effects.
Color versions of one or more of the figures in this paper are available
online at http://ieeexplore.ieee.org. In this paper, the method in [26] is used to ensure mutual
Digital Object Identifier 10.1109/TAP.2017.2655013 coupling effects are included in the signal model.
This work is licensed under a Creative Commons Attribution 3.0 License. For more information, see http://creativecommons.org/licenses/by/3.0/
1358 IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, VOL. 65, NO. 3, MARCH 2017
The steering vector of the array is given by

a(, θ ) = [1, e− j μ2 cos θ , . . . , e− j μ M cos θ ]T (1)
where = ωTs is the normalized frequency with Ts being the
sampling period, μm = (dm /cTs ) for m = 1, 2, . . . , M, c gives
the wave propagation speed and {·}T denotes the transpose
operation.
The array output, yk , at time snapshot k is then given by
yk = Ast xk + nk (2)
where xk = [x k,1 , x k,2 , . . . , x k,N ]T ∈ C N×1 gives the received
source signals, nk = [n k,1 , n k,2 , . . . , n k,M ]T ∈ C M×1 is a
noise term, given by a zero mean multivariate Gaussian ran-
dom variable and Ast = [a(, θ1), a(, θ2 ), . . . , a(, θ N )] ∈
Fig. 1. Linear array structure being considered, consisting of M antennas C M×N is the matrix containing the steering vectors for each
with a uniform adjacent antenna separation of d. angle of interest. Note, N is the number of points in the
grid of potential DOAs the angular region has been split into.
The contributions of this paper are as follows. However, only L N of these angular directions will have
1) A BCSKF with a modified RVM, where the traditional an impinging signal present.
sparsity prior is replaced with a belief that the estimated In practice there will also be mutual coupling effects
signals will instead match predicted signal values, is pro- present, which alter the pattern of an individual antenna as
posed. The result of this new prior is that a new posterior compared to if it was being used on its own. As a result (2)
distribution and marginal likelihood have been derived. has to be altered to account for this fact. A mutual coupling
Initial results for this method using a signal model matrix is used to achieve this [26], by giving the true steering
without mutual coupling have been reported in [27]. vector matrix as
2) A Gibbs sampling approach is proposed. In this
approach zero valued signals can be explicitly enforced A = M MC Ast . (3)
when there is too large a change in DOA in order to Here M MC ∈ C M×M is the mutual coupling matrix given by
alleviate the estimation accuracy problem for the endfire ⎛ ⎞
region of the array. 1 m2 ... m D−1 . . . mM
⎜ .. .. ⎟
3) A comprehensive performance evaluation is provided, ⎜ m2 1 m2 ... . . ⎟
⎜ ⎟
with the proposed methods being compared to a ⎜ .. ⎟
⎜ . m2 1 m2 . . . m D−1 ⎟
BCSKF using the traditional RVM approach. Significant M MC = ⎜
⎜ .. ⎟
⎟.
⎜ m D−1 . . . .. .. ..
improvements in terms of the average root-mean-square
⎜ . . . . ⎟⎟
error (RMSE) values are observed (up to 73.39% for the ⎜ . .. ⎟
⎝ .. . ... m2 1 m2 ⎠
BCSKF with modified RVM and up to 86.36% for the
Gibbs sampling approach). mM ... m D−1 ... m2 1
The remainder of this paper is structured in the following (4)
manner. Section II gives details of the proposed estimation
In (4), the mutual coupling coefficients are given by
methods, including the array model with mutual coupling
m i = ρi exp{ j φi } for i = 2, . . . , D − 1, D, . . . , M, where
effects (Section II-A), the modified RVM framework for
ρi and φi give the amplitude and phase, respectively. The
BCS (Section II-B), the BCSKF (Section II-C), and the
variable D places a limit on the separation between antennas
Gibbs sampling implementation (Section II-D). In Section III,
above which there will be no mutual coupling effects. In other
an evaluation of the effectiveness of the proposed approaches
words when i > D, then ρi = 0. This then gives the following:
is presented and conclusions are drawn in Section IV.
yk = M MC Ast xk + nk
II. P ROPOSED E STIMATION M ETHODS = Axk + nk . (5)
A. Array Model Equation (5) can then be split into real and imaginary
A narrowband ULA structure consisting of M omnidirec- components (given by R(·) and I(·), respectively) as follows:
tional antennas, with identical responses is shown in Fig. 1. ỹk = Ãx̃k + ñk
Here, a plane-wave signal mode is assumed, i.e., the signal
R(yk ) R(A) −I(A) R(xk ) R(nk )
impinges upon the array from the far-field and the angle of = + . (6)
I(yk ) I(A) R(A) I(xk ) I(nk )
arrival is limited to 0° ≤ θ ≤ 180°. The distance from the
first antenna to subsequent antennas is denoted as dm for The difference between yk and ỹk is that yk has been split
m = 1, 2, . . . , M, with d1 = 0, i.e., the distance from the into its real and imaginary components in ỹk . As a result the
first antenna to itself. Note, these values are multiples of a dimensions of ỹk are increased. A similar relationship exists
uniform adjacent antenna separation of d. between A and Ã, xk and x̃k and nk and ñk .
HAWES et al.: BCS APPROACHES FOR DOA ESTIMATION WITH MUTUAL COUPLING EFFECTS 1359
B. Modified Relevance Vector Machine for DOA Estimation where the covariance matrix and the mean are given by

T −1
The aim is to now find a solution for x̃k which gives the = σk−2 Ã Ã + Pk (14)
closest possible match to a predicted set of signal values.
To achieve this one can follow a modified RVM frame- and

T
work [27], by evaluating the following: μ = σk−2 Ã ỹk + Pk x̃ p (15)

respectively. Note, the maximum of (13) is the posterior
x̃k,opt = max P x̃k , σk2 , pk |ỹk , x̃ p (7) mean μ. For a derivation of (13), see Appendix A.
where σk2 is the variance of the Gaussian noise nk , Similar to [17], the probability P(σk2 , pk |ỹk , x̃ p ) can be
pk = [ pk,1 , pk,2 , . . . , pk,2N ]T contains the hyperparameters represented in the following form:
that are to be estimated and x̃ p = [R(x p )T , I(x p )T ]T =

P σk2 , pk |ỹk , x̃ p ≈ P ỹk |σ 2 , p, x̃ p P σk2 P(pk )P(x̃ p ) (16)
[R(x p,1 ), . . . , R(x p,N ), I(x p,1 ), . . . , I(x p,N )]T holds the
predicted values of x̃k . where P(x̃ p ) is constant as fixed values are used and the
From (6), it is possible to find second two terms on the right hand side of (16) are con-
stant if β1 = β2 = β3 = β4 = 1 × 10−4 as in [17].
2 −M 1
2
P(ỹk |x̃k , σk ) = (2πσk ) 2
exp − 2 ||ỹk − Ãx̃k ||2 . (8) Therefore, maximizing P(σk2 , pk |ỹk , x̃ p ) is roughly equivalent
2σ to maximizing P(ỹk |σk2 , pk , x̃ p ). This can be achieved by a
The traditional RVM would now apply a belief that x̃k is type 2 maximization of its logarithm, which is given by (see
sparse. However, here this is changed to a belief that x̃k will Appendix B)
match the predicted signals x̃ p

L σk2 , pk

P(x̃k |pk , x̃ p ) = (2π)−N |Pk | 2
1
−M 1 1
= log 2πσk2 || 2 |Pk | 2
1
T
× exp − (x̃k − x̃ p ) Pk (x̃k − x̃ p ) . (9) 1 T T 2 T
2 × exp − × (ỹk Bỹk + x̃ p Cx̃ p − 2σk ỹk ÃPk x̃ p )
1
2
Note, when x̃ p = [0, 0, . . . , 0]T then (9) reverts to the = − 2M log(2π) + 2M log σk2 − log || − log |Pk |
2
hierarchical prior used in the traditional RVM [16], [17] and + σk−2 ||ỹk − Ãμ||22 + μT Pk μ + x̃Tp Pk x̃ p − x̃Tp Pk μ
|Pk | indicates the determinant of Pk , where Pk = diag(pk ). (17)
It is also necessary to define the hyperparameters over T
pk and σk2 . There are various possibilities for the structuring where B = (σk2 I + ÃP−1
k Ã )
−1 and C = P − PT P .
k k k
This is now differentiated with respect to pk,n and σk−2 to
of the priors on pk , which represent mixing parameters in
obtain the update expressions
a scale mixture of normal representation of the marginal γn
distribution of xk , which will here be in the Student-t family
new
pk,n = 2 (18)
μn + x̃ p,n − x̃ p,n μn
2
(see [28]). One possibility would be to treat the complex
components of xk as complex Student-t distributed, as detailed where γn = 1 − pk,n nn , nn is the nth diagonal element
in [29] and [30]. However, this paper treats the real and of and
imaginary components of xk as independent Student-t dis- ||ỹk − Ãμ||22
σk,new
2
= . (19)
tributed random variables, and hence there are independent 2M − n γn
Gamma priors for the mixing variables pk,n over all real and For the derivation of (18) and (19), see Appendix C.
imaginary components of xk The maximization is then achieved by iteratively finding

2N and μ, followed by pk,nnew for n = 1, . . . , N and σ 2
k,new until a
P(pk ) = G( pk,n |β1 , β2 ). (10) convergence criterion is met [16], [17]. In other words, the new
n=1 estimates for the noise variance and precision hyperparameters
found from (19) and (18) are then used in (14) and (15) to
A Gamma prior can also be used for σk2

find new estimates of the covariance matrix and mean of the
P(σk2 ) = G σk−2 |β3 , β4 (11) distribution in (13). Note that when x̃ p = [0, 0, . . . , 0]T the
update expressions match those used by the traditional RVM.
where β1 , β2 , β3 , and β4 are scale and shape priors. The final estimate of the received signals is then given by
It is known that T −1 T

Ã Ã Ã ỹk
P x̃k , σk2 , pk |ỹk , x̃ p x̃k,opt = + Pk,opt + Pk,opt x̃ p (20)

σk,opt
2 σk,opt
2
= P x̃k |ỹk , σk2 , pk , x̃ p P σk2 , pk |ỹk , x̃ p (12)
where σk,opt
2 and Pk,opt =
and diag( pk,opt,1 , pk,opt,2 , . . . , pk,opt,2N ) are the result of

P x̃k |ỹk , σk2 , pk , x̃ p optimizing the noise variance estimate and hyperparameters,

respectively. Now x̃k,opt can be used to reconstruct the
P ỹk |x̃k , σk2 P x̃k |pk , x̃ p
=
estimated signals as
P ỹk |σk2 , pk , x̃ p
x k,opt,n = x̃ k,opt,n + j x̃ k,opt,N+n (21)
1
= (2π)−N ||−1/2 exp − (x̃k − μ)T −1 (x̃k − μ) (13)
2 where n = 1, 2, . . . , N.
The thresholding scheme in [20] can then be applied to keep subsequent estimates are likely to not be sparse. As a result,
the L̃ most significant signals. To do this find the total energy care should be taken when choosing the initial parameter
content of the estimated received signals and then sort them. values and determining the likely DOA change.
A threshold value, η, is then defined as a percentage of the
energy content that is to be retained. Starting with the most
D. Gibbs Sampling for DOA Estimation
significant estimated signal, the estimated signals are summed
until the threshold is reached and the remaining signals are The method described in the previous sections based on a
then set to be equal to 0. The remaining nonzero valued signals BCSKF with a modified RVM required the use of prior knowl-
then give the DOA estimates and L̃ is an estimate of the edge of the predicted change in DOA. However, in practice,
number of far-field signals impinging on the array. this may not always be known, making it important to have
an alternative method that can still give improved accuracy for
C. Bayesian Compressive Sensing Kalman Filter the endfire region.
This paper proposes using a sparsity prior which is given
In order to track the changes in the DOA estimates at as a combination of a point mass concentrated at zero
each time snapshot the modified RVM-based DOA estimation (Dirac delta function) and a zero mean Gaussian distribu-
procedure detailed above is combined with a Bayesian KF, tion [31]–[33], giving
giving a BCSKF for DOA estimation [27]. The signal model
described above is again used along with the prediction
2N
P(x̃k |pk , z̃k ) = (1 − z̃ k,n )δ0 + z̃ k,n N (x̃ k,n |0, pk,n ) (25)
x̃k|k−1 = x̃k−1|k−1 + x k|k−1 = k−1 + P−1
k n=1
ỹk|k−1 = Ãx̃k|k−1 ỹe,k = ỹk − ỹk|k−1 (22)
where z̃k = [zkT , zkT ]T and zk = [z k,1 , z k,2 , . . . , z k,N ]T .
and update steps Note, z̃ k,n is the indicator variable for x̃ k,n and determines
which of the two components in (25) is selected. When
x̃k = x̃k|k−1 + Kk ỹe,k k|k = (I − Kk Ã) k|k−1 z̃ k,n = 0, the value of x̃ k,n is determined solely by the point
T
T −1
Kk = k|k−1 Ã σk2 I + Ã k|k−1 Ã (23) mass concentrated at zero. As a result, x̃ k,n = 0 and sparsity is
explicitly introduced. Alternatively, when z̃ k,n = 1 the value
of the BCSKF. Here, k|k − 1 indicates prediction at time of x̃ k,n is determined by the Gaussian distribution allowing
instance k given the previous measurements and x is deter- a nonzero valued estimate. The repetition of zk in z̃k means
mined by the assumed DOA change. Note, x is fixed by the that the same indicator variable is used for both the real and
predetermined constant motion rather than being a random imaginary parts of each entry in xk .
noise term. For example, if the angular range is sampled This indicator value can also be used to address the endfire
every 1° and the DOA is assumed to increase by 2° then accuracy problem by selecting the value of z k,n = 0 if
x will be selected to increase the index of the nonzero valued |n − i | > j . Here i is the index of the closet nonzero
entries in x̃k−1|k−1 by two to give the index of the nonzero valued estimate from the previous time snapshot and j defines
valued entries in x̃k|k−1 . a maximum allowed change in the DOA estimate. Only
At each time snapshot it is necessary to estimate the n = 1, 2, . . . , N is considered to get the entries for zk , with z̃k
noise variance and hyperparameters in order to evaluate the then being found as previously stated.
prediction and update steps of the BCSKF. This is done by This leaves the following:
considering the log likelihood function given by

1
z 1 if |n − i | ≤ j
L σk2 , pk = − 2M log(2π) + 2M log σk2 − log || z k,n = k,n (26)
2
2
z k,n if |n − i | > j
− log |Pk | + σk−2 ||ỹe,k − Ãμ||22 + μT Pk μ 1
T T where z k,n and z n2 are defined by the following Beta
+ x̃k|k−1 Pk x̃k|k−1 − x̃k|k−1 Pμ (24) distributions:
which can be optimized by following the procedure described
1
1
z k,n = B z k,n |β51 , β61
in Section II-B. In other words, we apply the modified
2
RVM framework to ỹe,k , using the KF prediction x̃k|k−1 as
2
z k,n = B z k,n |β52 , β62 . (27)
the expected estimate values x̃ p .
In order to enforce zero-valued estimates when |n − i | > j ,
It is worth noting that the continued accuracy of the pro-
it is necessary to select β52 and β62 to ensure a zero-valued
posed BCSKF relies on the accuracy of the initial estimate and
z k,n is preferred. However, when |n − i | ≤ j it is necessary to
the parameter values selected. If the initial estimate (made
choose β51 and β61 so that the chances of z k,n = 0 and z k,n = 1
using the framework described in Section II-B and x̃ p =
are equal. This can be achieved by
[0, 0, . . . , 0]T ) of the received signals is accurate and sparse,

1
1
then the priors that are enforced will ensure this continues P z k,n |β5 , β6 = B z k,n |β5 , β6
to be the case. However, an inaccurate initial DOA esti-

2 1 1
mate or poorly matched expected DOA change can lead to the P z k,n |β5 , β6 = B z k,n
2
|β5 − , β6 + (28)
j j
introduction of inaccuracies in subsequent estimates. Similarly,
if the initial estimate of the received signals is not sparse then where β5 = β6 = 1.
The posterior distribution of x̃k can be written as [33] The proposed Gibbs sampling-based method can then be used

at the subsequent time snapshots to get the next DOA estimate.
P x̃k |ỹk , σk2 , pk , z̃k
2N III. P ERFORMANCE E VALUATION

∝ [(1 − z̃ k,n )δ0 + z̃ k,n × N (x̃ k,n |0, pk,n )] In this section, a comparison in performance of the proposed
n=1 methods and the traditional RVM-based BCSKF method will

×N ỹk |Ãx̃k , σk−2 . (29) be made over five example scenarios, under the same test
conditions. First, an example is considered where the initial
Now also define Ãn as being the entries in Ã relating to DOA starts outside of the endfire region and then moves into it.
the index n and Ã−n are the entries of Ã excluding the Second, an example is given where the DOA remains out of the
entries relating to index n (and similarly for x̃k ). Then as per endfire region. In the third scenario, the initial DOAs and the
Appendix D this gives signal values are randomly generated. Then the evaluation will

also consider the scenario where there is a mismatch between
P x̃ k,n |ỹk , x̃k,−n , σk2 , pk , z̃k = (1 − ẑ k,n )δ0 + ẑ k,n
the actual and assumed change in DOA. Finally, the evaluation
×N (x̃ k,n |μ̂k,n , p̂k,n ) (30) will consider a random change in DOA at each time snapshot.
p̂k,n = pk,n + p0 Ãn Ãn
T
(31) This means that x, which is selected for the modified
−1 T RVM-based BCSKF, will not be a true reflection of how the
μ̂k,n = p̂k,n p0 Ãn ỹk,n (32) DOA actually changes for the last two examples.
ẑ k,n z̃ k,n Note, the term traditional RVM-based BCSKF method
=
1 − ẑ k,n 1 − z̃ k,n means the entries of Pk in the prediction step of the BCSKF
N (0|0, pk,n ) are found using the RVM optimization method as detailed
× (33) in [16] and [17]. In other words, this is the method detailed in
N (0|μ̃k,n , p̂k,n )
Section II-B with x̃ p = 0. All of the examples are implemented
where p0 = 1/σk2 and ỹk,n = ỹk − Ã−n x̃k,−n . in MATLAB on a computer with an Intel Xeon CPU E3-1271
There are two further posterior distributions that have to be (3.60 GHz) and 16 GB of RAM.
considered. That is the distributions for pk,n and p0 which are The performance of each method will be measured using the
given by root mean square error (RMSE) in the DOA estimate. This is

given by
P( pk,n |xk−1 ) = G β1 + ||xk−1,n j ||0 , β2 + ||xk−1,n j ||22 (34)
Q L̃

q=1 l=1 |θl − θ̂l |
2
and
RMSE = (36)

1 Q L̃
P( p0 |ỹk , x̃k ) = G β3 + M, β4 + ||ỹk − Ãx̃k ||22 (35)
2
where θl is the actual DOA, θ̂l is the estimated DOA, and
respectively. Note, in (34), xk−1,n j gives the entries within Q is the number of Monte Carlo simulations carried out, with
xk−1 that have an index within the distance j of index n. Q = 100 being used in each case. This gives a measure of the
By using x rather than x̃ to find xk−1,n j it guarantees the estimation accuracy and the computation time will be used as
same value of ||xk−1,n j ||22 and ||xk−1,n j ||0 for both the real a measure of the complexity of each method.
and imaginary components. For the Gibbs sampling method a burn-in period
As a result the Gibbs sampling steps are detailed as follows. of 250 iterations is used and then 50 further iterations used to
1) Sample x̃ k,n from P(x̃ k,n |ỹk , x̃k,−n , σk2 , pk , z̃k ). find the final estimate of the received array signals. When a
2) Sample pk,n from P( pk,n |xk−1 ). distance of j = 5° is exceeded a zero-valued estimate of the
3) if n ≤ N then Sample z k,n 1 from P(z k,n 1 |β , β ),
5 6 received signals is enforced in order to alleviate the endfire
else z k,n = z k,n−N .
1 1
accuracy problem.
4) if n ≤ N then Sample z k,n 2 from P(z k,n 2 |β , β ),
5 6 For all the design examples considered the selection of
2 = z2
else z k,n σk2 = 0.4 as the noise variance is used, with an initial estimate
k,n−N .
5) Sample p0 from P( p0 |ỹk , x̃k ). of the noise variance given by σk,0
2 = 0.1. The array geometry
These steps are done for each of the T iterations of the being used is that of a ULA with M = 20 antennas and an
Gibbs sampler, where the first TB I iterations are the burn- adjacent antenna separation of λ/2, where λ is the wavelength
in iterations. The final estimate of the received array signals of the signal of interest. This gives an array aperture of 9.5λ.
is then given by the mean values of the final T − TB I For the mutual coupling matrix a value of D = 3 is selected,
iterations [32]. The DOA estimate can then be found using meaning that a separation of 1.5λ or greater gives negligible
the previously described thresholding scheme (see II-B), with mutual coupling effects. The values ρ1 = 0.65, ρ2 = 0.25,
the remaining non-zero valued estimates corresponding to the φ1 = π/7, and φ2 = π/10 are then also used. Finally, in each
DOA estimates. example a single narrowband signal impinging on the array is
Note, the performance of this method will again heavily considered, meaning L = 1.
depend on the accuracy of the first estimate. As a result,
it is possible to use the traditional BCS DOA estimation A. Endfire Region
method (Section II-B with x̃ p = [0, 0, . . . , 0]T ) to ensure an as For this example, the initial DOA of the signal is θ = 20°,
accurate as possible initial estimate at the first time snapshot. which then decreases by 1° at each time snapshot. The signal
TABLE I TABLE II
P ERFORMANCE S UMMARY FOR THE E NDFIRE R EGION E XAMPLE P ERFORMANCE S UMMARY FOR THE E NDFIRE R EGION E XAMPLE
W ITH R EDUCED A DJACENT A NTENNA S EPARATIONS
Fig. 2. RMSE values for the endfire region example.

Fig. 3. RMSE values for the endfire region example with reduced antenna
separation.
value at each snapshot is set to be 1. Table I summarizes the
performance of the three methods for this example, with the in accuracy (at the cost of computation time) using the Gibbs
RMSE values at each time snapshot being shown in Fig. 2. sampling-based method.
Here, it can be seen that there has been a significant In the previous simulation, an adjacent antenna separation of
decrease in the average RMSE values for both the modified λ/2 is used as it is known that this is the largest separation that
RVM method (67.51% improvement) and the Gibbs sampling- can be used while still avoiding a degraded performance due to
based method (80.78% improvement). Overall this suggests the introduction of grating globes [34]. However, an example
that a more accurate estimate of the DOA is possible. It is of what the relative performance of the methods is when a
worth noting that there has still been an increase in the smaller adjacent antenna separation will now be considered.
RMSE for the modified RVM-based approach in the endfire In this instance, an adjacent antenna separation of λ/4 is
region of the angular range. However, this has come much later selected. As the array aperture is kept constant (to allow a fair
on the than for the traditional RVM-based approach (indicating comparison between adjacent antenna separation sizes) this
a degradation in performance for a smaller angular range) and means the number of antennas is given by M = 39. This also
the maximum RMSE value reached is lower (indicating the means a value of D = 9 is required to keep the same distance
degradation is less severe). limits on mutual coupling occurring. The values of ρi and
These improvements have come at the cost of an increased φi are then selected to be uniformly spread over the range
computation time in both instances. For the modified of 0.65 to 0.25 and π/7 to π/10, respectively. The remaining
RVM method this increase is insignificant as the average parameters are kept constant and the same test scenario as for
computation time is still less than one second. The increase the previous example is used.
for the Gibbs sampling-based method is larger, illustrating an The performances of each of the methods in this instance are
increase in computational complexity. However, it is worth summarized in Table II and Fig. 3, respectively. Here, it can
remembering that this increase has resulted in a more accurate be seen that the larger number of antennas used has resulted
DOA estimate being achieved without prior knowledge about in a lower average RMSE values for all three of the methods.
what the change in DOA will be. In this instance only the modified RVM method has performed
In this instance, the results suggest that one of the better that the traditional RVM-based method when comparing
two proposed methods should be used when the estimated average RMSE values (decrease in average RMSE of 61.09%).
DOA approaches the endfire region of the array. If the change However, by looking at the maximum RMSE values it can
in DOA is known in advance and computational complexity be seen that the largest estimation error possible with the
is a primary concern then the modified RVM-based method is traditional RVM-based method is larger than that for the Gibbs
the most suitable (a more accurate estimate can be achieved sampling-based method (16.42° as compared to 3.71°).
without a large increase in computation time). However, when It is worth noting that such an array configuration is unlikely
this information is not available, or computational complexity to be used in practice. This is due to the costs associated with
is not a concern, it is possible to get a significant improvement the number of antennas required. As a result, the remaining
TABLE III TABLE IV

P ERFORMANCE S UMMARY FOR THE N ONENDFIRE R EGION E XAMPLE P ERFORMANCE S UMMARY FOR THE R ANDOM I NITIAL DOA E XAMPLE
Fig. 4. RMSE values for the nonendfire region example. Fig. 5. RMSE values for the random initial DOA example.
examples will stick to the adjacent antenna separation of λ/2 the modified RVM-based method has offered improvements
and associated parameters previously defined. in terms of RMSE(73.39%), without a significant increase in
computation time. The Gibbs sampling-based method has also
B. Nonendfire Region given an estimation accuracy improvement (76.12%) and has
For this example the initial DOA is θ = 100° with the even outperformed the modified RVM-based method, without
DOA increasing by 1° at each time snapshot, with the signal prior knowledge of how the DOA was going to change. How-
value remaining constant at −1. The performance of the three ever, this has come at the expense of an increased computation
methods is summarized in Table III, with the RMSE values time.
illustrated in Fig. 4.
In this instance it can be seen that there has not been as D. Mismatched Actual and Assumed DOA Change
large an increase in RMSE for the traditional RVM method, This section compares the performances of the estimation
as the DOA does not enter the endfire region. However, both methods for two situations where the actual change in DOA is
the modified RVM and Gibbs sampling-based methods have not known. First, consider the case where there is an ini-
managed to achieve improvements in average RMSE values tial DOA of θ = 100° which increases by 1° for 9 time
of 36.42% and 50.00%, respectively. For the Gibbs sampling- snapshots before decreasing by 1° for the remaining time
based method this comes at the expenses of an increase snapshots. In this instance, assume a constant signal value
in computation time, whereas the time for the modified of 1 throughout.
RVM-based method is comparable to the traditional The performance comparison is now made between the tra-
RVM-based method. As with the previous test scenario this ditional RVM-based method, the modified RVM-based method
would suggest that the modified RVM-based method should with the assumed DOA change set to a constant increase
be used when the expected DOA change information is avail- of 1°, the modified RVM-based method with the assumed
able and the Gibbs sampling-based method when this is not DOA change set to a constant decrease of 1° and the Gibbs
the case, or when computational complexity is not a major sampling-based method. The performances for each are sum-
concern. marized in Fig. 6 and Table V, respectively.
In this instance, the average RMSE values suggest a com-
C. Random Initial DOA parable performance in terms of estimation accuracy between
Next consider the case where the initial DOA is randomly the traditional RVM-based methods and the two modified
chosen from the entire angular range and increased by 1° at RVM-based examples. This can be explained by the fact that
each time snapshot. The signal value is randomly selected for both of the modified RVM-based examples, the assumed
as ±1 for each simulation and remains constant as the DOA change does not match the actual DOA changes for
DOA changes. the entire time range which means the same improvements
Table IV and Fig. 5 summarize the performance of the as for the previous scenario can no longer be guaranteed.
various methods in this instance. Again, it can be seen that Fig. 6 highlights this in the results for the two modified
Fig. 6. RMSE values for the increasing DOA followed by decreasing

DOA example.
Fig. 7. RMSE values for the increasing DOA with an assumed decrease
TABLE V in DOA of 3° example.
P ERFORMANCE S UMMARY FOR THE I NCREASING DOA
F OLLOWED BY D ECREASING DOA E XAMPLE
TABLE VII
P ERFORMANCE S UMMARY FOR THE R ANDOM C HANGES IN DOA W ITH
AN A SSUMED I NCREASE IN DOA OF 3° E XAMPLE
TABLE VI
Here, it can be seen that the Gibbs sampling-based
P ERFORMANCE S UMMARY FOR THE I NCREASING DOA W ITH
AN A SSUMED D ECREASE IN DOA OF 3° E XAMPLE method has offered an 86.36% improvement in average
RMSE compared to the traditional RVM-based method. There
has again been a significant increase in the computational
complexity.
For the modified RVM-based method the average RMSE
values suggests that there has been an improvement in esti-
mation accuracy. However, this is smaller than when the actual
and assumed DOA changes match. It is also unlikely that this
improvement would be obtained in every scenario the method
RVM examples. It demonstrates that with an assumed increas- could be applied to. From looking at Fig. 7, we can see that
ing DOA the modified RVM offers some initial improvements, the traditional and modified RVM-based methods are showing
while the performance is significantly degraded when the comparable performance for the just over half of the time
DOA starts to decrease again. On the other hand, the exam- frame considered. This is the relative performance that would
ple with an assumed decreasing DOA performs worse than be expected in the majority of cases.
the traditional RVM-based method initially and then offers
significant improvements when the actual DOA also starts to
decrease. E. Random Changes in Direction of Arrival
It can also be seen that for the Gibbs sampling-based method Finally, consider the example where the initial signal value
there has been an improvement in DOA estimation accuracy. is assumed to be equal to 1 and the initial DOA is chosen
In terms of average RMSE values this is a decrease of 68.60%, to be 100°. The actual DOA is then allowed to randomly
which has been achieved without any knowledge of how the change by up to ±3° for each time snapshot. For the modified
DOA was going to change. However, there is again an increase RVM method assume that the actual DOA change is
in the computation time. an increase of 3°. This gives the results as summarized
To illustrate how a larger mismatch between actual and in Table VII and Fig. 8.
assumed DOA changes effects the performance of the modified It can be seen that the Gibbs sampling-based method has
RVM-based method now consider an example where the actual again outperformed the modified RVM-based method in terms
DOA is increasing by 1° in each snapshot, while the assumed of estimation accuracy, due to the fact that no prior knowledge
change is a decrease of 3°. Here, the initial DOA is 100°, of how the DOA will change is required. As compared to the
with a constant signal value of −1. The RMSE values for traditional RVM-based method there has been an improvement
the methods are shown in Fig. 7 and summarized in Table VI in RMSE of 78.81%. However, as is expected this is at the
along with the computation times. cost of computation time.
Now following the method suggested in [17] carry out the

multiplication on the right-hand side of (37), collect terms
in x̃k in the exponential and complete the square
1 −2
− σk (ỹk − Ãx̃k )T (ỹk − Ãx̃k ) + (x̃k − x̃ p )T Pk (x̃k − x̃ p )
2
1 T
= − σ −2 ỹkT ỹk − σk−2 ỹkT Ãx̃k − σ −2 x̃kT Ã ỹk
2
T
+ σk−2 x̃kT Ã Ãx̃k + x̃kT Pk x̃k − x̃kT Pk x̃ p

− x̃Tp Pk x̃k + x̃Tp Pk x̃ p
1
= − (x̃k − μ)T −1 (x̃k − μ) − μT −1 μ
2
+ σ −2 ỹkT ỹk + x̃Tp Pk x̃ p (38)
Fig. 8. RMSE values for the random DOA change example.
where and μ are given by (14) and (15), respectively. This
then gives the posterior distribution as (13), with the remaining
It is also worth noting that the average RMSE values suggest exponential terms
that the modified RVM and traditional RVM have offered 1 −2 T
a comparable performance. This is due to the fact that the − σk ỹk ỹk + x̃Tp Pk x̃ p − μT −1 μ . (39)
assumption of how the DOA will change is not valid, mean- 2
ing the modified RVM no longer offers any improvements.
Therefore, in this situation, the Gibbs sampling-based method B. Derivation of Marginal Likelihood
would be the best to use, assuming computational complexity From (37), the following is known:
is not the main motivating factor.

P ỹk |x̃k , σk2 , P(x̃k |pk , x̃ p )
P ỹk |σk2 , pk , x̃ p =
(40)
IV. C ONCLUSION P x̃k |ỹk , σk2 , pk , x̃ p
This paper has proposed two novel approaches for the
estimation of a dynamic DOA using ULAs with mutual meaning the term in the exponential will be (39) where
coupling. The first approach is a Bayesian compressed KF with
T T
μT −1 μ = σk−2 Ã ỹk + Pk x̃ p T −1
a modified RVM, where the traditional sparsity assumption

T
is replaced by an assumption that the estimated signals will × σk−2 Ã ỹk + Pk x̃ p
instead match predicted signal values. This results in the
T T
T
= σk−2 Ã ỹk + Pk x̃ p σk−2 Ã ỹk + Pk x̃ p
derivation of a new posterior probability density function
T
of the received signals and the expression for the related = σk−4 ỹkT Ã Ã ỹk + σ −2 ỹkT ÃPk x̃ p
marginal likelihood function. The second proposed approach T
+σk−2 x̃Tp PkT Ã ỹk + x̃Tp PkT Pk x̃ p . (41)
is a Gibbs sampling approach, where sparsity is explicitly
enforced if there is a large difference between the previous Therefore the exponential term is given by
DOA estimate and the angle currently being considered. The
proposed approaches will be particularly useful when applied 1 −2 T T
− σk ỹk ỹk + x̃Tp Pk x̃ p − σk−4 ỹkT Ã Ã ỹk
to the problem of dynamic DOA estimation in the endfire 2
T
region of antenna arrays. Such problems can arise in numerous − σk−2 ỹkT ÃPk x̃ p − σk−2 x̃Tp PkT Ã ỹk − x̃Tp PkT Pk x̃ p
application areas such as in communications and surveillance.
1 T
An extensive performance evaluation is provided and shows = − ỹkT σk−2 − σk−4 Ã Ã ỹk + x̃Tp Pk − PkT Pk x̃ p
that both of the proposed approaches outperform the traditional 2
T
RVM-based BCS KF in terms of mean RMSE values, by up − σk−2 ỹkT ÃPk x̃ p − σk−2 x̃Tp PkT Ã ỹk . (42)
to 73.39% for the modified RVM-based method and 86.36%
for the Gibbs sampling-based method. The term outside of the exponential is given by

−M
2πσk2 (2π)−N |Pk |1/2
−M 1 1
A PPENDIX 1
= 2πσk2 || 2 |Pk | 2 . (43)
−
A. Derivation of Posterior Distribution (2π)−N || 2
Bayes’ rule gives This gives the marginal likelihood as

P x̃k |ỹk , σk2 , pk , x̃ p P ỹk |σk2 , pk , x̃ p P ỹk |σk2 , pk , x̃ p

−M
= P ỹk |x̃k , σk2 P(x̃k |pk , x̃ p ) (37) = 2πσk2
1 1
|| 2 |Pk | 2

where P(ỹk |x̃k , σk2 )
and P(x̃k |pk , x̃ p ) are known from (8) 1
× exp − ỹkT Bỹk + x̃Tp Cx̃ p − 2σk2 ỹkT ÃPk x̃ p (44)
and (9), respectively. 2
where B and C are defined as in Section II-B. The log Now differentiate (52) with respect to τ and equate to zero to
likelihood is then given by give

L σk2 , pk 2M T
− + tr( Ã Ã) + ||ỹk − Ãμ||22 = 0 (53)

−M 1 1 τ
= log 2πσk2 || 2 |Pk | 2
T
where tr(·)
1 T indicates the trace. As tr( Ã Ã) can be written
T 2 T
× exp − ỹk Bỹk + x̃ p Cx̃ p − 2σk ỹk ÃPk x̃ p as τ −1 n γn giving
2

1 1
= −M log(2π) − M log σk2 + log || log |Pk | τ −1
2M − γn = ||ỹk − Ãμ||22 (54)
2 2
1 T T 2 T
n
+ − ỹk Bỹk + x̃ p Cx̃ p − 2σk ỹk ÃPk x̃ p . (45)
2 which in turn gives (19).
Using the Woodbury matrix inversion identity gives

T −1 T
B = σk−2 I − σk−2 Ã Pk + σk−2 Ã Ã Ã σk−2 (46) D. Derivation of (30)–(33)
which means From (29), it is known that

ỹkT Bỹk = ỹkT σk−2 ỹk − ỹkT σk−2 I − σk−2 Ã P x̃ k,n | ỹk,n , σk2 , pk,n , z̃ k,n ∝ (1 − z̃ k,n )δ0

T −1 T
× P + σk−2 Ã Ã Ã σk−2 ỹk ×N (ỹk,n |Ãn x̃ k,n , σk2 )
T
= ỹkT σk−2 ỹk − ỹkT σk−2 Ã Ã σk−2 ỹk + z̃ k,n N (x̃ k,n |0, pk,n )N (ỹk,n |Ãn x̃ k,n , σk2 ). (55)
= σk−2 ỹkT (ỹk − Ãμ) + σk−2 ỹkT ÃPk x̃ p If we then combine the exponential terms in the second term
2
= σk−2 ỹkT − Ãμ2 + μT Pk μ + σ −2 ỹkT ÃPk x̃ p . in (55) we get
(47) 1 T
− x̃ k,n pk,n x̃ k,n + p0 (ỹk,n − Ãn x̃ k,n )T × (ỹk,n − Ãn x̃ k,n )
Also, we know that PkT = Pk as Pk is a real-valued diagonal 2
matrix. This means 1 T T T
= − x̃ k,n pk,n x̃ k,n + p0 (ỹk,n ỹk,n − ỹk,n Ãn x̃ k,n
2
x̃Tp Cx̃ p = x̃Tp [Pk − Pk Pk ]x̃ p T T
− x̃ k,n
T
Ãn ỹk,n + x̃ k,n
T
Ãn Ãn x̃ k,n )
= x̃Tp Pk x̃ p − x̃Tp Pk Pk x̃ p
1 T T T
= x̃Tp Pk x̃ p − x̃Tp Pk μ + σk−2 ỹkT ÃPk x̃ p (48) =− x̃ [ pk,n + p0 Ãn Ãn ]x̃ k,n + p0 ỹk,n ỹk,n
2 k,n
T T
which then gives the log likelihood function in (17). − p0 ỹk,n Ãn x̃ k,n − p0 x̃ k,n
T
Ãn ỹk,n . (56)
C. Derivation of Update Expressions for Modified RVM Completing the square gives
First, differentiating (17) with respect to pk,n gives 1 T

− (x̃ k,n − μ̂n )T p̂k,n (x̃ k,n − μ̂n ) − μ̂n p̂k,n μ̂n + p0 ỹk,n ỹk,n
1 1 2
2 2
− nn − + μn + x̃ e,n − x̃ e,n μn (49) (57)
2 pk,n
and equating to zero gives where p̂k,n and μ̂k,n are given by (31) and (32),
respectively.
1 In order to complete the expression given in (30), it is now
nn − + μ2n + x̃ e,n
2
− x̃ e,n μn = 0
pk,n necessary to get a new indicator variable, ẑ k,n for the new
1 − pk,n nn − pk,n μ2n − pk,n x̃ e,n
2
+ pk,n x̃ e,n μn = 0 posterior distribution for x̃ k,n . To do this, assume that
γn − pk,n [μ2n + x̃ e,n
2
− x̃ e,n μn ] = 0 (50) z̃ k,n ẑ k,n
N (0|0, pk,n ) = N (0|μ̂n , p̂k,n ). (58)
which leads to (18). 1 − z̃ k,n 1 − ẑ k,n
Now collect the terms with σk2 in to give
Thus giving us (33), allowing us to write the posterior distri-
1 bution for x k,n in the form given in (30).
− 2M log σk2 − log || + σk−2 ||ỹk − Ãμ||22 (51)
2
and then define τ = σk−2 giving ACKNOWLEDGMENT
1 The authors would like to thank the anonymous reviewers
− 2M log τ −1 − log || + τ ||ỹk − Ãμ||22
2 for their suggestions that have helped improve this paper
1 and the Associate Editor for handling the review of this
= − − 2M log τ − log || + τ ||ỹk − Ãμ||22 . (52)
2 paper.
R EFERENCES [26] B. Liao, Z.-G. Zhang, and S.-C. Chan, “DOA estimation and tracking
of ULAs with mutual coupling,” IEEE Trans. Aerosp. Electron. Syst.,
[1] R. O. Schmidt, “Multiple emitter location and signal parameter esti- vol. 48, no. 1, pp. 891–905, Jan. 2012.
mation,” IEEE Trans. Antennas Propag., vol. 34, no. 3, pp. 276–280, [27] M. Hawes, L. Mihaylova, F. Septier, and S. Godsill, “A Bayesian
Mar. 1986. compressed sensing Kalman filter for direction of arrival estimation,”
[2] A. L. Swindlehurst and T. Kailath, “A performance analysis of subspace- in Proc. Int. Conf. Inf. Fusion, 2015, pp. 969–975.
based methods in the presence of model errors. I. The MUSIC algo- [28] D. F. Andrews and C. L. Mallows, “Scale mixtures of normal distrib-
utions,” J. Roy. Statist. Soc. B, Methodol., vol. 36, no. 1, pp. 99–102,
rithm,” IEEE Trans. Signal Process., vol. 40, no. 7, pp. 1758–1774,
Jul. 1992. 1974.
[29] P. J. Wolfe, M. Dorfler, and S. J. Godsill, “Bayesian modelling of time-
[3] R. Roy and T. Kailath, “Esprit-estimation of signal parameters via
frequency coefficients for audio signal enhancement,” in Advances in
rotational invariance techniques,” IEEE Trans. Acoust., Speech, Signal
Neural Information Processing Systems, vol. 15. Cambridge, MA, USA:
Process., vol. 37, no. 7, pp. 984–995, Jul. 1989.
MIT Press, 2002.
[4] M. D. Zoltowski, M. Haardt, and C. P. Mathews, “Closed-form 2-D [30] P. J. Wolfe, S. J. Godsill, and W.-J. Ng, “Bayesian variable selection
angle estimation with rectangular arrays in element space or beamspace and regularization for time–frequency surface estimation,” J. Roy. Statist.
via unitary ESPRIT,” IEEE Trans. Signal Process., vol. 44, no. 2, Soc. B, Statist. Methodol., vol. 66, no. 3, pp. 575–589, 2004.
pp. 316–328, Feb. 1996. [31] C. Fevotte and S. J. Godsill, “Sparse linear regression in unions of bases
[5] N. Tayem and H. M. Kwon, “Conjugate ESPRIT (C-SPRIT),” IEEE via Bayesian variable selection,” IEEE Signal Process. Lett., vol. 13,
Trans. Antennas Propag., vol. 52, no. 10, pp. 2618–2624, Oct. 2004. no. 7, pp. 441–444, Jul. 2006.
[6] F. Gao and A. B. Gershman, “A generalized ESPRIT approach to [32] L. He and L. Carin, “Exploiting structure in wavelet-based Bayesian
direction-of-arrival estimation,” IEEE Signal Process. Lett., vol. 12, compressive sensing,” IEEE Trans. Signal Process., vol. 57, no. 9,
no. 3, pp. 254–257, Mar. 2005. pp. 3488–3497, Sep. 2009.
[7] I. Ziskind and M. Wax, “Maximum likelihood localization of multiple [33] L. Yu, H. Sun, J. P. Barbot, and G. Zheng, “Bayesian compressive
sources by alternating projection,” IEEE Trans. Acoust., Speech Signal sensing for cluster structured sparse signals,” Signal Process., vol. 92,
Process., vol. 36, no. 10, pp. 1553–1560, Oct. 1988. no. 1, pp. 259–269, 2012.
[8] Y.-Y. Huang and M. Barkat, “A dynamic programming algorithm for [34] H. L. Van Trees, Detection, Estimation, and Modulation Theory: Opti-
the maximum likelihood localization of multiple sources,” IEEE Trans. mum Array Processing. New York, NY, USA: Wiley, 2002.
Antennas Propag., vol. 40, no. 9, pp. 1023–1030, Sep. 1992.
[9] P. Stoica and A. B. Gershman, “Maximum-likelihood DOA estimation
by data-supported grid search,” IEEE Signal Process. Lett., vol. 6, no. 10,
pp. 273–275, Oct. 1999.
[10] E. J. Candès, J. Romberg, and T. Tao, “Robust uncertainty principles:
Exact signal reconstruction from highly incomplete frequency informa-
tion,” IEEE Trans. Inf. Theory, vol. 52, no. 2, pp. 489–509, Feb. 2006.
[11] D. L. Donoho, “Compressed sensing,” IEEE Trans. Inf. Theory, vol. 52, Matthew Hawes received the M.Eng. and Ph.D.
no. 4, pp. 1289–1306, Apr. 2006. degrees from the Department of Electronic and Elec-
[12] D. Malioutov, M. Çetin, and A. S. Willsky, “A sparse signal reconstruc- trical Engineering, University of Sheffield, Sheffield,
tion perspective for source localization with sensor arrays,” IEEE Trans. U.K., in 2010 and 2014, respectively.
Signal Process., vol. 53, no. 8, pp. 3010–3022, Aug. 2005. Since then he has been a Research Associate with
[13] M. M. Hyder and K. Mahata, “A robust algorithm for joint-sparse the Department of Automatic Control and Systems
recovery,” IEEE Signal Process. Lett., vol. 16, no. 12, pp. 1091–1094, Engineering, University of Sheffield. He is currently
Dec. 2009. involved in the EU funded SETA project, the main
[14] I. Bilik, T. Northardt, and Y. Abramovich, “Expected likelihood for scope of which is the development of models,
compressive sensing-based DOA estimation,” in Proc. IET Int. Conf. methods and a platform for mobility prediction,
Radar Syst., 2012, pp. 1–4. congestion avoidance, and sensor data fusion for
[15] Q. Shen, W. Liu, W. Cui, S. Wu, Y. D. Zhang, and M. G. Amin, smart cities. His current research interests include array signal processing,
“Group sparsity based wideband DOA estimation for co-prime arrays,” localization and tracking, big data, modeling complex systems, intelligent
in Proc. IEEE China Summit Int. Conf. Signal Inf. Process., Jul. 2014, transportation systems, mobility, data fusion, sequential Monte Carlo methods,
pp. 252–256. and Markov chain Monte Carlo methods.
[16] S. Ji, Y. Xue, and L. Carin, “Bayesian compressive sensing,” IEEE Trans.
Signal Process., vol. 56, no. 6, pp. 2346–2356, Jun. 2008.
[17] M. E. Tipping, “Sparse Bayesian learning and the relevance vector
machine,” J. Mach. Learn. Res., vol. 1, pp. 211–244, Sep. 2001.
[18] M. E. Tipping and A. C. Faul, “Fast marginal likelihood maximisation
for sparse Bayesian models,” in Proc. Int. Workshop Artif. Intell. Statist.,
2003, pp. 3–6.
[19] C. M. Bishop, Pattern Recognition and Machine Learning. New York, Lyudmila Mihaylova (M’98–SM’08) is a Professor
NY, USA: Springer, 2006. of Signal Processing and Control at the Department
[20] M. Carlin, P. Rocca, G. Oliveri, F. Viani, and A. Massa, “Directions- of Automatic Control and Systems Engineering,
of-arrival estimation through Bayesian compressive sensing strategies,” University of Sheffield, Sheffield, U.K. Her research
IEEE Trans. Antennas Propag., vol. 61, no. 7, pp. 3828–3838, Jul. 2013. is in the areas of machine learning and autonomous
[21] Z. Yang, L. Xie, and C. Zhang, “Off-grid direction of arrival estimation systems with various applications such as navigation,
using sparse Bayesian inference,” IEEE Trans. Signal Process., vol. 61, surveillance, and sensor network systems. She has
no. 1, pp. 38–43, Jan. 2013. given a number of talks and tutorials, including
[22] P. Khomchuk and I. Bilik, “Dynamic direction-of-arrival estimation via the plenary talk for the IEEE Sensor Data Fusion
spatial compressive sensing,” in Proc. IEEE Radar Conf., May 2010, 2015 (Germany), invited talks at the University of
pp. 1191–1196. California, Los Angeles, CA, USA, IPAMI Traffic
[23] E. Karseras, K. Leung, and W. Dai, “Tracking dynamic sparse signals Workshop 2016 (USA), IET ICWMMN 2013 in Beijing, China.
using hierarchical Bayesian Kalman filters,” in Proc. IEEE Int. Conf. Dr. Mihaylova is an Associate Editor of the IEEE T RANSACTIONS ON
Acoust., Speech, Signal Process., May 2013, pp. 6546–6550. A EROSPACE AND E LECTRONIC S YSTEMS and the Elsevier Signal Processing
[24] J. Filos, E. Karseras, W. Dai, and S. Yan, “Tracking dynamic sparse Journal. She was elected in 2016 as the President of the International Society
signals with hierarchical Kalman filters: A case study,” in Proc. Int. of Information Fusion (ISIF). She is on the Board of Directors of ISIF. She was
Conf. Digit. Signal Process., 2013, pp. 1–6. the general Co-Chair of the IET Data Fusion & Target Tracking 2014 and 2012
[25] T. Su and H. Ling, “On modeling mutual coupling in antenna arrays Conferences, the Program Co-Chair for the 19th International Conference on
using the coupling matrix,” Microw. Opt. Technol. Lett., vol. 28, no. 4, Information Fusion, Heidelberg, Germany, 2016, and an Academic Chair of
pp. 231–237, 2001. the Fusion 2010 conference.
François Septier received the engineering degree Simon Godsill is a Professor of Statistical Sig-
in electrical engineering and signal processing from nal Processing with the Engineering Department,
Télécom Lille, Villeneuve-d’Ascq, France, in 2004, University of Cambridge, Cambridge, U.K. He is
and the Ph.D. degree in electrical engineering also a Professorial Fellow and a Tutor at Cor-
from the University of Valenciennes, Valenciennes, pus Christi College, Cambridge. He coordinates an
France, in 2008. active research group in signal inference and its
From 2008 to 2009, he was a Research Associate applications with the Signal Processing and Com-
with the Signal Processing and Communications munications Laboratory at Cambridge, specializing
Laboratory, Engineering Department, University of in Bayesian computational methodology, multiple
Cambridge, Cambridge, U.K. Since 2009 he has object tracking, audio and music processing, and
been an Associate Professor with the IMT Lille financial time series modeling. A particular method-
Douai, Univ. Lille, CNRS, UMR 9189-CRIStAL-Centre de Recherche en ological theme over recent years has been the development of novel techniques
Informatique Signal et Automatique de Lille, France. His current research for optimal Bayesian filtering, using Sequential Monte Carlo or Particle
interests include Bayesian computational methodology with a particular Filtering methods. He has authored extensively in journals, books, and
emphasis on the development of Monte Carlo based approaches for complex international conference proceedings, and has given a number of high profile
and high-dimensional problems. invited and plenary addresses at conferences such as the Valencia conference
on Bayesian Statistics and the IEEE Statistical Signal Processing Workshop.
Dr. Godsill was a Technical Chair of the successful IEEE NSSPW Workshop
in 2006 on sequential and nonlinear filtering methods, and has been on the
conference panel for numerous other conferences/workshops. He has served as
an Associate Editor of the IEEE T RANSACTIONS ON S IGNAL P ROCESSING
and the journal Bayesian Analysis. He was a Theme Leader in Tracking and
Reasoning over Time for the U.K.’s Data and Information Fusion Defense
Technology Center and Principal Investigator on many grants funded by the
EU, EPSRC, QinetiQ, MOD, Microsoft U.K., Citibank, and Mastercard.

Hawes Et Al. - 2017 - Bayesian Compressive Sensing Approaches For Direction of Arrival Estimation With Mutual Coupling Effects

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Hawes Et Al. - 2017 - Bayesian Compressive Sensing Approaches For Direction of Arrival Estimation With Mutual Coupling Effects

Uploaded by

Copyright:

Available Formats

IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, VOL. 65, NO.

3, MARCH 2017 1357

Bayesian Compressive Sensing Approaches for

The steering vector of the array is given by

Fig. 2. RMSE values for the endfire region example.

TABLE III TABLE IV

Fig. 6. RMSE values for the increasing DOA followed by decreasing

Now following the method suggested in [17] carry out the

First, differentiating (17) with respect to pk,n gives 1 T

You might also like

Hawes Et Al. - 2017 - Bayesian Compressive Sensing Approaches For Direction of Arrival Estimation With Mutual Coupling Effects

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Hawes Et Al. - 2017 - Bayesian Compressive Sensing Approaches For Direction of Arrival Estimation With Mutual Coupling Effects

Uploaded by

Copyright:

Available Formats

IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, VOL. 65, NO.

3, MARCH 2017 1357

Bayesian Compressive Sensing Approaches for

The steering vector of the array is given by

Fig. 2. RMSE values for the endfire region example.

TABLE III TABLE IV

Fig. 6. RMSE values for the increasing DOA followed by decreasing

Now following the method suggested in [17] carry out the

First, differentiating (17) with respect to pk,n gives 1 T 

You might also like

First, differentiating (17) with respect to pk,n gives 1 T