You are on page 1of 4

HANDLING SPATIAL ALIASING IN SPHERICAL ARRAY APPLICATIONS

Jens Meyer and Gary W Elko
mh acoustics
25A Summit Ave.
Summit, NJ 07901, USA
ABSTRACT Another characteristic that determines the number of re-
quired sensors is the upper frequency limit of the operating
Spherical microphone arrays are an attractive solution for bandwidth of the array. As with conventional arrays this up-
many audio applications where flexible beamforming in all per frequency limit is determined by spatial aliasing. Spatial
directions is desired. However, as with all discretely sampled aliasing occurs when the spacing between adjacent sensors
array systems, spatial aliasing puts a constraint on the upper becomes too large. The topic has been treated by other au-
operating frequency range and therefore limits some poten- thors before, e.g. [4], [6]. However, no practical solution
tial applications. Simply increasing the number of sensors has been suggested for non-spatially-bandlimited soundfields.
to overcome the effect of spatial aliasing is effective but can This work takes a closer look at the spatial aliasing and de-
be expensive, especially if very high quality microphones scribe several solutions to reduce it.
elements are used. This paper reviews the origin of spatial
aliasing for spherical arrays based on modal beamforming
and investigates alternative approaches to mitigate the spatial 2. MODAL BEAMFORMING
aliasing problem. The alternative approaches are based on:
spatial anti-aliasing filters, exploiting the natural diffraction Spherical array beamforming is motivated by the fact that ev-
of the spherical baffle and compromising the directivity at ery soundfield on a spherical surface can be expressed as a
higher frequencies. All approaches are practical and can be series of orthonormal spatial spherical harmonic modes
implemented without increasing the system cost significantly.
00 n
Index Terms- Microphone array, spherical microphone P(ka, i9, (o) = bn(ka) E3 AnmymYA(t, ()), (1)
array, spatial aliasing
where bn describes the modal response,
Ynm
are the spherical
1. INTRODUCTION harmonics of order
n
and degree m (see [7] for more details),
k is the wavenumber and a is the radius of the sphere. bn
Spherical microphone arrays
have
gained
attention over the
can be determined
analytically
once the
array setup
is known
past
few
years ([1], [2], [3], [4], [5]). Due to their inherent (surface impedance, radial microphone position, nearfield
vs.
characteristics spherical arrays are an attractive solution for farfield sources. etc.). The coefficients bn are discussed later
applications
such as
conferencing systems
or room acoustic
in more detail. All soundfield
spatial
information is contained
measurement tools, to namejust a few. Among others the ma- in the Anm coefficients. The idea of the modal beamforming
jor
characteristics are (a) a
coverage
of the three-dimensional
is to
decompose
the soundfield into its modal
components
and
(3D) space independent of direction and (b) the beamformer
therefore gain access to the the soundfield information Anmm
associated with the array can be implemented in an efficient For an ideal continuous microphone aperture the isolation
manner as modular blocks allowing for independent control of modes can be done by applying a pressure sensitive mate-
of the beam pattern and steering direction. rial on the spherical surface where the sensitivity M(i', 0) is
A potential drawback for spherical arrays is the relatively
equivalent to the complex conjugate spherical harmonic:
large number of microphones that are required. It can be
shown that at least (N + 1)2 sensors are required where N Fnm
in ] P(ka, V,
~o)M(i(,
o) dQ
is the highest order mode that the array uses for beamform- X
ing. The higher the order N the higher the spatial resolution f m ,
of the array. For real-time arrays a maximum order of N =4 = jyP(kat,t0,c,)Ynm, (t$O,co) dQ
is a reasonable compromise between good spatial resolution X
and cost for the array (hardware as well as processing power). = bn' (ka)An',m' (2)
978-1-4244-2338-5/08/$25.00 ©C2008 IEEE 1 HSCMA 2008
Since bn are known, the soundfield information
Anm,Tn'
can be +
easily computed.
M stands for the
aperture sensitivity
func- n=1
tion and F is the
array
factor (note that at this
point
the
aper-
x n=2
n=3
ture is continuous and the term
array
is
technically
not cor-
-*
rect). The result given in Equ. 2 is derived simply by applying
the orthonormality characteristic of spherical harmonics: 5 -20 0 n=6
E A 9
Q5 -35/------------>/- -yv)1yvrJ42........
-30
3. THE ORIGIN OF THE SPATIAL ALIASING 0.5 1 2 5 10
ka
Although solutions exist that allow one to realize a continu-
ous spherical aperture, it is not practical since one would only Fig.
2. Modal
response
of a
spherical array
with sensors
have access to one mode at a time. To overcome this limita- mounted flush in an
acoustically rigid sphere.
tion one can sample the aperture at positions [a, Is, (s] and
then apply weights to the sampled positions to extract multi-
ple
modes in
parallel.
This
approach basically
leads to mul-
However.tsearnements
t
a
ve.
tiple weight-and-add beamformers, one for each mode. The performance outside the designed range.
tiple~~~~ wegtan,d
bafresonfrechmd.
Te
Up to this point the modal response bn have been ne-
array factor for mode in, m' for the sampled aperture iS ex- U oti on h oa epneb aebe e
press fas:or
for mode
n',m'
glected. For a
spherical array
mounted on the surface of a
rigid sphere bn becomes [7]:
S-1
Fn',m'
S P(ka, t, Wo)M(i.s, (
bn(ka)
=
jn(ka)
h(
)(ka) (5)
s=O
h$P2/(k
a)n
00
5
bn (ka)
x and is
plotted
in
Fig.
2.
Combining
the modal
response
with
n=O the aliasing error one can see that the 6th-order aliasing only
n S-I starts for about ka > 4. Below this the 6th-order mode is
E AnmT Ynm(,s, os)Yn7 (,s, fos) (4) virtually not present. For an array with a radius of 4cm this
m=-n s=O implies that the theoretical upper frequency limit for 4th-order
an_ni,m/±e(in,m,in',m')
modes is about 5.5kHz. Above this
frequency
the 4th-order
modes will be seriously aliased by 6th order modes (and more
where F is the array factor of the discretely sampled aper- higher order modes as the frequency continues to increase).
ture. In cases where the last sum in Equ. 4 fulfills the or-
thonormality constraint (e = 0), the array output agrees with 4. APPROACHES TO OVERCOME SPATIAL
the continuous solution: F = F. However, due to the typi- ALIASING
cally non-spatially-bandlimited nature of soundfields aliasing
(e :t 0) will occur for all but very simple soundfields. From The most obvious solution to spatial aliasing is to employ a
Equ. 4 it is also obvious that aliasing depends to a great ex- more dense sensor arrangement. This would either mean an
tend on the sensor positions. This makes it difficult to present increase in the number of sensors or a reduction in radius. The
a general closed-form mathematical description. In the fol- first approach can become prohibitively expensive while the
lowing we focus on the characteristics of a 32-element array second one is not desirable due to poorer low frequency noise
geometry where the microphones are placed at the center of performance [8]. Next, three alternative approaches are pro-
the faces of a truncated icosahedron. posed to reduce spatial aliasing in spherical arrays. The solu-
Figure 1 shows the level of the aliasing error e(n, m, n', tions require little or no additional resources and are therefore
m') as a function of the order and degree of the spherical of practical relevance.
harmonics. For easier visualization the order n and degree m
are transformed to an index nm =
(n + 1)2 - n + m. White
4.1. Limit spatial resolution at high frequencies
lines are added to help separate the different orders n visually.
The figure shows that the array is capable of extracting modes The first approach is motivated by a closer view at Fig. 1. It is
up to 4th-order with little or no aliasing. Significant aliasing obvious that the higher order modes are more prone to spatial
starts to with 6th-order soundfield modes leaking into the 4th- aliasing than the lower order modes. For example the 1st-
order beam. Note that other configurations with 32 sensors order mode experiences its first aliasing from the 9th-order
exist (e.g. [4]) that have no aliasing error up to 4th-order. mode. FromFig. 2 it canbe seenthat the 9th-order mode only
2
20
25
10 20 30 40 50 60 70 80 90 100
nm
Fig. 1. Aliasing for the 32 element truncated icosahedron geometry.
contributes significant energy from about ka > 7 onwards.
Therefore, by scaling back in directivity from a forth order 10 + n=0
pattern to a first order pattern at high ka one can extend the 5 ° n=1
* n=2
frequency range from about ka = 4 to about ka = 7. o
n-3
Note that the ka values and mode orders in this section x n=4
are specific to the truncated icosahedron sampling. However, | n=6
the general trend should hold for other sampling schemes as LA
well.
-20-
4.2. Use of spatial anti-aliasing filter -25-
Time domain processing commonly employs anti-aliasing 10 20 30 40 60 80
low-pass filters. Unfortunately, spatial anti-aliasing filter are ° [ ]
not as easy to deploy. The following development of an
extended sensor covering an area of the spherical array sur- Fig. 3. Sensitivity of a conformal patch sensor to modes 0 to
face gives a qualitative analysis that can be used to compute 6 depending on the size of the sensor.
the spatial filter operation of a larger microphone. Spatial
anti-aliasing filters are also explored in [6].
To gain some insight in the modal low pass filter we start
by looking at a continuous circular conformal sensor aperture
Tmoa2 w(1-cosido)
,n =
o
that covers the pole of a sphere. To keep the analysis simple M 2 / x
we assume a constant sensitivity mo. The sensor output can 1 2n+±1
then be computed according to:
(Pn_2(costo)- Pn+±(costo))
n>O
'Of (7)
F mo
]
P(w, r, io)dQC The result from Equ. 7 is plotted in Fig. 3. Note that the
modal
sensitivity
is shown relative to the mode 0. The ordi-
nate indicates the size of the sensor represented by the open-
S
n
b1(w, r)AAn (i, 0) x ing angle 00. To the left side the sensor is small, to the ex-
n=O treme it is a point source or a spatial Dirac function. Com-
2-r do paring the values towards the left of Fig. 3 one finds that
moa2
f
n Y(ji,cc)sin(c)ddfdc
(6) they agree
with the
spherical
harmonic transform of the
spa-
JJ
tial Dirac function. Towards the right side of the figure the
> ' ~~~~~~sensor increases in size and its sensitivity to the higher or-
Mn (Wo mo ,a) der modes starts to drop. The uniform weighing of the sen-
Equation 6 introduces the new measure Mn2. This measure sor aperture results in rather large sidelobes. These can be
can be seen as the sensitivity of the sensor to a mode of order reduced by optimizing the aperture weighting function. The
3
result shows that a rather large sensor (do
= 30) will be re-
quired to attenuate modes of order 6 and higher. ka = 0.5
90
To apply 32 sensors of the required size onto a sphere - - ka=8.0
is not practical since the sensors would overlap significantly. 1soD
ka =
30
A solution is to sample the continuous aperture with small
sensors, making sure to place small sensors dense enough to
avoid spatial aliasing up to the desired frequency limit. Small 180 O
sensors in the
overlapping region
can be used
by
all channels
covering this space. Sampling the large sensor area with small V
sensors also allows for easier implementation of an aperture
21 330
weighting function.
At a first glance this solution seems to increase the sys-
24 0
tem cost significantly since the number of sensors required
becomes much larger. However, (a) the number of input chan-
nels stays the same (in our example 32) and (b) since each
input channel is now made up of many small microphones Fig. 4. Directivity pattern for an omnidirectional sensor
one can use inexpensive capsules and still obtain good noise mounted flush on the surface of an acoustically rigid sphere.
performance. One can also expect that the sensor tolerances
- typically a major concern for array beamforming - are less
6 REFERENCES
important since they will tend to average out among the many
sensors that make
up
each
input
channel.
[1] Meyer J. and Elko G. W., "A highly scalable spherical
microphone array based on an orthonormal decomposi-
4.3. Using the natural diffraction of the sphere tion of the soundfield," Proc ofIEEE ICASSP, vol. II, pp.
1781-1784,2002.
Utilizing the diffraction of the rigid spherical baffle is suit-
able for spherical arrays that are mounted on the surface of an [2] Abhayapala T. D. and Ward D. B., "Theory and design
acoustically rigid sphere. The sphere acts as an acoustic scat- of higher order sound field microphones using spherical
terer which results in a direction dependent sensitivity for an microphone arrays," Proc of IEEE ICASSP, vol. II, pp.
omnidirectional microphone that is positioned at or close to 19491952,2002.
the surface of the sphere. This means that an omnidirectional
[3] Daniel J. and Moreau S., "Further
study
of sound field
sensor actually becomes a directional sensor. This effect if
coding
with
higher
order ambisonics," Proc. 116th AES
more prominent towards higher frequencies. Fig. 4 shows the
Convention,
2004.
resulting directivity pattern for an omnidirectional point sen-
sor positioned flush on the surface of a rigid sphere for several [4] Li Z., Duraiswami R., Grassi E., and Davies L. S., "Flex-
frequencies ka. It can be seen that for ka = 2 and above the ible layout and optimal cancelation of the orthonormal-
sensor exhibits directional properties. The maximum Direc- ity error for spherical microphone arrays," Proc ofIEEE
tivity Index that is achieved towards higher ka is about 3.2dB. ICASSP, pp. 41-44, 2004.
The Directivity Index will increase further if one uses mico-
phone with larger membranes. [5] Rafaely B., "Analysis and design of spherical microphone
To overcome the spatial aliasing one can exploit the scat- arrays," IEEE Trans. Speech Audio Process., vol. 13, no.
tering and diffraction and use the output of the sensor closest 1, pp. 135-143,2005.
to the current look-direction at high frequencies. Since only a [6] Rafaely B., Weiss B., and Bachmat
E.,
"Spatial alias-
single sensor is used no spatial aliasing will occur. ing in spherical microphone arrays," IEEE Trans. Signal
Process., vol. 55, no. 3, pp. 1003-1010,2007.
5. CONCLUSION [7] E.G. Williams, Fourier Acoustics, Academic Press, San
Diego, 1999.
This paper presented three practical approaches to reduce the
effect of spatial aliasing. The solutions emphasize the practi- [8] Meyer
J. and Elko G. W., "Spherical microphone ar-
cality of an implementation. The first (sec. 4.1) and last (sec. rays for 3d sound recording," in Audio Signal Process-
4.3) approach can be used in most current systems since it re- ingfor Next-Generation Multimedia Communication Sys-
quires only a modification in the processing algorithm while tems, J. Benesty Y. (A.) Huang, Ed. Kluwer Academic
the second approach (sec. 4.2) would require a new physical Publishers, Boston, 2004.
array setup.
4