38 views

Uploaded by Vinay Kothapally

save

You are on page 1of 4

**Jens Meyer and Gary W Elko
**

mh acoustics

25A Summit Ave.

Summit, NJ 07901, USA

ABSTRACT Another characteristic that determines the number of re-

quired sensors is the upper frequency limit of the operating

Spherical microphone arrays are an attractive solution for bandwidth of the array. As with conventional arrays this up-

many audio applications where flexible beamforming in all per frequency limit is determined by spatial aliasing. Spatial

directions is desired. However, as with all discretely sampled aliasing occurs when the spacing between adjacent sensors

array systems, spatial aliasing puts a constraint on the upper becomes too large. The topic has been treated by other au-

operating frequency range and therefore limits some poten- thors before, e.g. [4], [6]. However, no practical solution

tial applications. Simply increasing the number of sensors has been suggested for non-spatially-bandlimited soundfields.

to overcome the effect of spatial aliasing is effective but can This work takes a closer look at the spatial aliasing and de-

be expensive, especially if very high quality microphones scribe several solutions to reduce it.

elements are used. This paper reviews the origin of spatial

aliasing for spherical arrays based on modal beamforming

and investigates alternative approaches to mitigate the spatial 2. MODAL BEAMFORMING

aliasing problem. The alternative approaches are based on:

spatial anti-aliasing filters, exploiting the natural diffraction Spherical array beamforming is motivated by the fact that ev-

of the spherical baffle and compromising the directivity at ery soundfield on a spherical surface can be expressed as a

higher frequencies. All approaches are practical and can be series of orthonormal spatial spherical harmonic modes

implemented without increasing the system cost significantly.

00 n

Index Terms- Microphone array, spherical microphone P(ka, i9, (o) = bn(ka) E3 AnmymYA(t, ()), (1)

array, spatial aliasing

where bn describes the modal response,

Ynm

are the spherical

1. INTRODUCTION harmonics of order

n

and degree m (see [7] for more details),

k is the wavenumber and a is the radius of the sphere. bn

Spherical microphone arrays

have

gained

attention over the

can be determined

analytically

once the

array setup

is known

past

few

years ([1], [2], [3], [4], [5]). Due to their inherent (surface impedance, radial microphone position, nearfield

vs.

characteristics spherical arrays are an attractive solution for farfield sources. etc.). The coefficients bn are discussed later

applications

such as

conferencing systems

or room acoustic

in more detail. All soundfield

spatial

information is contained

measurement tools, to namejust a few. Among others the ma- in the Anm coefficients. The idea of the modal beamforming

jor

characteristics are (a) a

coverage

of the three-dimensional

is to

decompose

the soundfield into its modal

components

and

(3D) space independent of direction and (b) the beamformer

therefore gain access to the the soundfield information Anmm

associated with the array can be implemented in an efficient For an ideal continuous microphone aperture the isolation

manner as modular blocks allowing for independent control of modes can be done by applying a pressure sensitive mate-

of the beam pattern and steering direction. rial on the spherical surface where the sensitivity M(i', 0) is

A potential drawback for spherical arrays is the relatively

equivalent to the complex conjugate spherical harmonic:

large number of microphones that are required. It can be

shown that at least (N + 1)2 sensors are required where N Fnm

in ] P(ka, V,

~o)M(i(,

o) dQ

is the highest order mode that the array uses for beamform- X

ing. The higher the order N the higher the spatial resolution f m ,

of the array. For real-time arrays a maximum order of N =4 = jyP(kat,t0,c,)Ynm, (t$O,co) dQ

is a reasonable compromise between good spatial resolution X

and cost for the array (hardware as well as processing power). = bn' (ka)An',m' (2)

978-1-4244-2338-5/08/$25.00 ©C2008 IEEE 1 HSCMA 2008

Since bn are known, the soundfield information

Anm,Tn'

can be +

easily computed.

M stands for the

aperture sensitivity

func- n=1

tion and F is the

array

factor (note that at this

point

the

aper-

x n=2

n=3

ture is continuous and the term

array

is

technically

not cor-

-*

rect). The result given in Equ. 2 is derived simply by applying

the orthonormality characteristic of spherical harmonics: 5 -20 0 n=6

E A 9

Q5 -35/------------>/- -yv)1yvrJ42........

-30

3. THE ORIGIN OF THE SPATIAL ALIASING 0.5 1 2 5 10

ka

Although solutions exist that allow one to realize a continu-

ous spherical aperture, it is not practical since one would only Fig.

2. Modal

response

of a

spherical array

with sensors

have access to one mode at a time. To overcome this limita- mounted flush in an

acoustically rigid sphere.

tion one can sample the aperture at positions [a, Is, (s] and

then apply weights to the sampled positions to extract multi-

ple

modes in

parallel.

This

approach basically

leads to mul-

However.tsearnements

t

a

ve.

tiple weight-and-add beamformers, one for each mode. The performance outside the designed range.

tiple~~~~ wegtan,d

bafresonfrechmd.

Te

Up to this point the modal response bn have been ne-

array factor for mode in, m' for the sampled aperture iS ex- U oti on h oa epneb aebe e

press fas:or

for mode

n',m'

glected. For a

spherical array

mounted on the surface of a

rigid sphere bn becomes [7]:

S-1

Fn',m'

S P(ka, t, Wo)M(i.s, (

bn(ka)

=

jn(ka)

h(

)(ka) (5)

s=O

h$P2/(k

a)n

00

5

bn (ka)

x and is

plotted

in

Fig.

2.

Combining

the modal

response

with

n=O the aliasing error one can see that the 6th-order aliasing only

n S-I starts for about ka > 4. Below this the 6th-order mode is

E AnmT Ynm(,s, os)Yn7 (,s, fos) (4) virtually not present. For an array with a radius of 4cm this

m=-n s=O implies that the theoretical upper frequency limit for 4th-order

an_ni,m/±e(in,m,in',m')

modes is about 5.5kHz. Above this

frequency

the 4th-order

modes will be seriously aliased by 6th order modes (and more

where F is the array factor of the discretely sampled aper- higher order modes as the frequency continues to increase).

ture. In cases where the last sum in Equ. 4 fulfills the or-

thonormality constraint (e = 0), the array output agrees with 4. APPROACHES TO OVERCOME SPATIAL

the continuous solution: F = F. However, due to the typi- ALIASING

cally non-spatially-bandlimited nature of soundfields aliasing

(e :t 0) will occur for all but very simple soundfields. From The most obvious solution to spatial aliasing is to employ a

Equ. 4 it is also obvious that aliasing depends to a great ex- more dense sensor arrangement. This would either mean an

tend on the sensor positions. This makes it difficult to present increase in the number of sensors or a reduction in radius. The

a general closed-form mathematical description. In the fol- first approach can become prohibitively expensive while the

lowing we focus on the characteristics of a 32-element array second one is not desirable due to poorer low frequency noise

geometry where the microphones are placed at the center of performance [8]. Next, three alternative approaches are pro-

the faces of a truncated icosahedron. posed to reduce spatial aliasing in spherical arrays. The solu-

Figure 1 shows the level of the aliasing error e(n, m, n', tions require little or no additional resources and are therefore

m') as a function of the order and degree of the spherical of practical relevance.

harmonics. For easier visualization the order n and degree m

are transformed to an index nm =

(n + 1)2 - n + m. White

4.1. Limit spatial resolution at high frequencies

lines are added to help separate the different orders n visually.

The figure shows that the array is capable of extracting modes The first approach is motivated by a closer view at Fig. 1. It is

up to 4th-order with little or no aliasing. Significant aliasing obvious that the higher order modes are more prone to spatial

starts to with 6th-order soundfield modes leaking into the 4th- aliasing than the lower order modes. For example the 1st-

order beam. Note that other configurations with 32 sensors order mode experiences its first aliasing from the 9th-order

exist (e.g. [4]) that have no aliasing error up to 4th-order. mode. FromFig. 2 it canbe seenthat the 9th-order mode only

2

20

25

10 20 30 40 50 60 70 80 90 100

nm

Fig. 1. Aliasing for the 32 element truncated icosahedron geometry.

contributes significant energy from about ka > 7 onwards.

Therefore, by scaling back in directivity from a forth order 10 + n=0

pattern to a first order pattern at high ka one can extend the 5 ° n=1

* n=2

frequency range from about ka = 4 to about ka = 7. o

n-3

Note that the ka values and mode orders in this section x n=4

are specific to the truncated icosahedron sampling. However, | n=6

the general trend should hold for other sampling schemes as LA

well.

-20-

4.2. Use of spatial anti-aliasing filter -25-

Time domain processing commonly employs anti-aliasing 10 20 30 40 60 80

low-pass filters. Unfortunately, spatial anti-aliasing filter are ° [ ]

not as easy to deploy. The following development of an

extended sensor covering an area of the spherical array sur- Fig. 3. Sensitivity of a conformal patch sensor to modes 0 to

face gives a qualitative analysis that can be used to compute 6 depending on the size of the sensor.

the spatial filter operation of a larger microphone. Spatial

anti-aliasing filters are also explored in [6].

To gain some insight in the modal low pass filter we start

by looking at a continuous circular conformal sensor aperture

Tmoa2 w(1-cosido)

,n =

o

that covers the pole of a sphere. To keep the analysis simple M 2 / x

we assume a constant sensitivity mo. The sensor output can 1 2n+±1

then be computed according to:

(Pn_2(costo)- Pn+±(costo))

n>O

'Of (7)

F mo

]

P(w, r, io)dQC The result from Equ. 7 is plotted in Fig. 3. Note that the

modal

sensitivity

is shown relative to the mode 0. The ordi-

nate indicates the size of the sensor represented by the open-

S

n

b1(w, r)AAn (i, 0) x ing angle 00. To the left side the sensor is small, to the ex-

n=O treme it is a point source or a spatial Dirac function. Com-

2-r do paring the values towards the left of Fig. 3 one finds that

moa2

f

n Y(ji,cc)sin(c)ddfdc

(6) they agree

with the

spherical

harmonic transform of the

spa-

JJ

tial Dirac function. Towards the right side of the figure the

> ' ~~~~~~sensor increases in size and its sensitivity to the higher or-

Mn (Wo mo ,a) der modes starts to drop. The uniform weighing of the sen-

Equation 6 introduces the new measure Mn2. This measure sor aperture results in rather large sidelobes. These can be

can be seen as the sensitivity of the sensor to a mode of order reduced by optimizing the aperture weighting function. The

3

result shows that a rather large sensor (do

= 30) will be re-

quired to attenuate modes of order 6 and higher. ka = 0.5

90

To apply 32 sensors of the required size onto a sphere - - ka=8.0

is not practical since the sensors would overlap significantly. 1soD

ka =

30

A solution is to sample the continuous aperture with small

sensors, making sure to place small sensors dense enough to

avoid spatial aliasing up to the desired frequency limit. Small 180 O

sensors in the

overlapping region

can be used

by

all channels

covering this space. Sampling the large sensor area with small V

sensors also allows for easier implementation of an aperture

21 330

weighting function.

At a first glance this solution seems to increase the sys-

24 0

tem cost significantly since the number of sensors required

becomes much larger. However, (a) the number of input chan-

nels stays the same (in our example 32) and (b) since each

input channel is now made up of many small microphones Fig. 4. Directivity pattern for an omnidirectional sensor

one can use inexpensive capsules and still obtain good noise mounted flush on the surface of an acoustically rigid sphere.

performance. One can also expect that the sensor tolerances

- typically a major concern for array beamforming - are less

6 REFERENCES

important since they will tend to average out among the many

sensors that make

up

each

input

channel.

[1] Meyer J. and Elko G. W., "A highly scalable spherical

microphone array based on an orthonormal decomposi-

4.3. Using the natural diffraction of the sphere tion of the soundfield," Proc ofIEEE ICASSP, vol. II, pp.

1781-1784,2002.

Utilizing the diffraction of the rigid spherical baffle is suit-

able for spherical arrays that are mounted on the surface of an [2] Abhayapala T. D. and Ward D. B., "Theory and design

acoustically rigid sphere. The sphere acts as an acoustic scat- of higher order sound field microphones using spherical

terer which results in a direction dependent sensitivity for an microphone arrays," Proc of IEEE ICASSP, vol. II, pp.

omnidirectional microphone that is positioned at or close to 19491952,2002.

the surface of the sphere. This means that an omnidirectional

[3] Daniel J. and Moreau S., "Further

study

of sound field

sensor actually becomes a directional sensor. This effect if

coding

with

higher

order ambisonics," Proc. 116th AES

more prominent towards higher frequencies. Fig. 4 shows the

Convention,

2004.

resulting directivity pattern for an omnidirectional point sen-

sor positioned flush on the surface of a rigid sphere for several [4] Li Z., Duraiswami R., Grassi E., and Davies L. S., "Flex-

frequencies ka. It can be seen that for ka = 2 and above the ible layout and optimal cancelation of the orthonormal-

sensor exhibits directional properties. The maximum Direc- ity error for spherical microphone arrays," Proc ofIEEE

tivity Index that is achieved towards higher ka is about 3.2dB. ICASSP, pp. 41-44, 2004.

The Directivity Index will increase further if one uses mico-

phone with larger membranes. [5] Rafaely B., "Analysis and design of spherical microphone

To overcome the spatial aliasing one can exploit the scat- arrays," IEEE Trans. Speech Audio Process., vol. 13, no.

tering and diffraction and use the output of the sensor closest 1, pp. 135-143,2005.

to the current look-direction at high frequencies. Since only a [6] Rafaely B., Weiss B., and Bachmat

E.,

"Spatial alias-

single sensor is used no spatial aliasing will occur. ing in spherical microphone arrays," IEEE Trans. Signal

Process., vol. 55, no. 3, pp. 1003-1010,2007.

5. CONCLUSION [7] E.G. Williams, Fourier Acoustics, Academic Press, San

Diego, 1999.

This paper presented three practical approaches to reduce the

effect of spatial aliasing. The solutions emphasize the practi- [8] Meyer

J. and Elko G. W., "Spherical microphone ar-

cality of an implementation. The first (sec. 4.1) and last (sec. rays for 3d sound recording," in Audio Signal Process-

4.3) approach can be used in most current systems since it re- ingfor Next-Generation Multimedia Communication Sys-

quires only a modification in the processing algorithm while tems, J. Benesty Y. (A.) Huang, Ed. Kluwer Academic

the second approach (sec. 4.2) would require a new physical Publishers, Boston, 2004.

array setup.

4

- Guardsense-wireless Wall EnUploaded bypaulmazziotta
- Miking SymphonyOrchestraUploaded bySara Sarita
- mics_for_music_studioUploaded byMatheus Albuquerque
- DM3xxx Accessory List LeafletUploaded byAdvantec Srl
- Pro Audio ReviewUploaded byaragon1974
- Lab_Manual_S07.pdf p spiceUploaded byAnonymous eWMnRr70q
- Chapter 1Uploaded byapi-3733590
- Homework 2Uploaded byDr. Ir. R. Didin Kusdian, MT.
- raspberry pi developing ideasUploaded byapi-292954778
- sony_hvr-a1_n_hc1_colorimetric_n_res_setsUploaded bycrapuloni
- Does Capsule Quality MatterUploaded byDavid Kane
- SMM8 InstructionsUploaded byShashika Sameera
- Laney LA30C ManualUploaded byneiltton_ferreira
- Speech Trainer Kit Using Laryngeal VibrationsUploaded bywazeembasheerk
- MIT6_003S10_lec21Uploaded byMasoud Hoseinzade
- IYPT Problems 2013Uploaded byWarmed_Tuberosum
- V8i UpdateUploaded byndorsa
- Korg DTR2000_1000_EFG2Uploaded byPeter Paul Zammit
- Automatic Speech Recognition of Human-Symbiotic Robot EMIEWUploaded byhardik_patel182000
- History Electronic MusicUploaded bymigueldelaquila5540
- SensorUploaded byatmegamicro
- TransducersUploaded byAstrosca
- Speech and Channel CodingUploaded byTri Nguyen
- Airport HiViz UKUploaded byRuben Pinaya Perez
- A Four-Antenna Receiver in 90-Nm CMOS for Beam Forming and Spatial DiversityUploaded by陳俊宏
- Communication Radar Systems Design and SimulationUploaded bydimitar55555
- Acoustics and Broadcasting MITRC RefresherUploaded byLorenz Ardiente
- Wiltgen TimothyE Masters Thesis ME DeptUploaded byFernando de Sá
- Microphone types and specific microphone techniquesUploaded byDimitris Mylis
- Introducing DAQUploaded byds_srinivas

- PP Paper GoldUploaded byVinay Kothapally
- FASST_UserGuide_v1Uploaded byVinay Kothapally
- Spiral PsoUploaded byVinay Kothapally
- Spherical Harmonic Modal Beamforming for an Augmented Circular Microphone ArrayUploaded byVinay Kothapally
- Multiple sensorUploaded byVinay Kothapally
- A Steerable and Variable First-Order Differential Microphone ArrayUploaded byVinay Kothapally
- MainUploaded byVinay Kothapally
- ECE-R07Uploaded byVinay Kothapally
- DAY Wise ProgramsUploaded byVinay Kothapally
- adcUploaded byVinay Kothapally