Professional Documents
Culture Documents
CHAPTER 1
AUDITORY SPACE
Simon Carlile
aThis is a point sometimes missed in the design of high fidelity audio systems
where the drive for system linearity can result in over-engineering when
compared to resolution of the final receiver, the human ear.
Auditory Space 3
Fig. 1.1. (a) When we listen to sounds over headphones the source of the sound is
generally perceived to be inside the head. If we vary the signals at each headphone so
that, as in the case illustrated in the figure, the signal is of greater amplitude in the left
ear and arrives earlier in the left ear, the apparent source of the sound will appear closer
to the left ear. (b) If a sound source is located off the midline in free space, in this case
close to the left ear of the listener, the sound will be of greater amplitude in the left ear
and arrive earlier in the left ear. In contrast to the figure above, under normal listening
conditions, the sound is also filtered by the outer ear before it is encoded by the auditory
nervous system. These effects are illustrated by the time/pressure graphs on each side
of the head and represent the pressure waves generated by a particular sound source.
In this case we perceive the sound to be located in free space away from the head. (c)
To generate the illusion of a sound in free space, the pattern of sound waves that would
have been produced by a sound in free space is generated over headphones. This is
achieved by taking into account the normal filtering effects of the outer ear. In this case,
the illusion is generated of a sound source at a particular location outside the head.
Reprinted with permission from Carlile S and King AJ, Cur Biol 1993; 3:446-448.
4 Virtual Auditory Space: Generation and Applications
b
A highly reverberant environment results in an increase in the incoherence
of the sounds waves at each ear, thereby degrading the acoustic cues used by the
auditory system in determining spatial position (see chapter 2, section 1).
c The emotional impact of music in such situations may be related to our
global location on the planet. Other coordinate systems are sometimes employed
to describe sound location and these are described in greater detail in
chapter 2, section 1.4.2.
Auditory Space 5
ing with a hand held gun, pointing the face towards the target and
tracking the position of the head or simply having the subject call out
the coordinates of the apparent location of the source. We have found
that, with appropriate training, pointing the head towards the target
and tracking the head location is a highly efficient and reliable method
for indicating perceived target location10 (Fig. 1.2).
Fig. 1.2. The figure shows a subject inside the anechoic chamber at the Auditory Neuro-
science Laboratory (University of Sydney). The subject stands on a raised platform in the
center of the chamber. The robot arm, which carries the sound source, is suspended from the
ceiling such that rotation of the vertical frame varies the azimuth location of the source. The
inner hoop, which actually carries the speaker, is driven by small stepper motors on either
side of the main frame; one such motor and its gearing can be seen to the left of the picture.
The task of the subject in a localization experiment is to turn and point her nose at the speaker
at each test location (the experiments are carried out in complete darkness). The small cap
on the head of the subject carries a 6 degrees of freedom tracking receiver which indicates
the location of the head and the direction towards which the subject is pointing. The subject
indicates completion of each task by pressing the button seen in this subjects left hand.
14 Virtual Auditory Space: Generation and Applications
Fig. 1.3. The mean localization accuracy from 9 subjects is shown together with an
estimate of the variance of the location estimates. Each sphere represents the hemi-
sphere of space surrounding the subject as indicated on each plot. The filled circle
indicates the location directly in front of the subject (Azimuth 0 Elevation 0). The
actual location of the target is indicated by the small cross at the origin of each ray. The
center of each ellipse indicates the mean location (azimuth and elevation) of six
localization trials for each subject. The variance of the azimuth and elevation
components estimate is indicated by the extent of the ellipse. The distributions of the
localization estimates for each target position are described by a Kent distribution.72
Data from Carlile et al.10
16 Virtual Auditory Space: Generation and Applications
Fig. 1.4. A simple line drawing showing the main features of the complexly convoluted
structure of the outer ear. The main functional components are (a) the pinna flange
comprising helix, anti helix and lobule, (b) the concha including the cymba and the
cavum and (c) the ear canal connecting to the floor of the concha (not shown). Adapted
with permission from Shaw EAG. In: Keidel WD et al, Handbook of Sensory physiol-
ogy. Berlin: Springer-Verlag, 1974:455-490.
range of velocities that could be examined (e.g., ref. 67) or were re-
stricted to pure tone stimuli (e.g., refs. 68, 69). Pure tones are not a
particularly ecological stimulus as most natural sounds are rich in spectral
features. More importantly, their use may have negated an important
potential cue for auditory motion, namely the location-dependent varia-
tions in the peaks and notches in the filter functions of the outer ear.
These spectral features are only available with complex, broadband stimuli.
Only recently have broadband sounds been employed with actual moving
sources70 or simulated movements using multiple sources.64,65 It is note-
worthy that, where comparison between studies is possible, the mini-
mum audible movement angle (MAMA) is considerably less with broad-
band stimuli than with pure tones.
In this chapter we have seen that our perception of auditory space
is dependent on a range of auditory and non auditory factors. The
generation of virtual auditory space promises to provide a very power-
ful research tool for the study of this important perceptual ability. A
key determinant of the utility of VAS is its fidelity. While the genera-
tion of VAS is simple in conception, its implementation involves a
number of acoustic compromises. It has been proposed here that a
behavioral measurement of the localization ability of subjects listening
to short duration noise stimuli presented in VAS represents an appro-
priate measure of that fidelity. A review of the literature examining
such localization behavior reveals that there are individual differences
in ability and that nonauditory factors can play an important role in
localization performance. Therefore, adequate tests of VAS fidelity us-
ing auditory localization tasks need to take these factors into account.
In the next chapter we will consider the physical cues to a sounds
location that are available to the auditory system. However, demon-
strating the presence of a particular physical cue does not necessarily
imply that the auditory system utilizes this cue. Some physiological
models of the encoding of these cues will be described and psycho-
physical tests examining the sensitivity of subjects to particular physi-
cal cues will also be considered. Such studies also provide insights into
the limits of sensory coding of these cues and provide important bench-
marks for the acoustic precision with which VAS needs to be gener-
ated.
ACKNOWLEDGMENTS
I would like to acknowledge Drs. Martin, Morey, Parker, Pralong
and Professor Irvine for comments on a previous version of this chap-
ter. The recent auditory localization work from the Auditory Neuro-
science Laboratory reported in this chapter was supported by the Na-
tional Health and Medical Research Council (Australia), the Australian
Research Council and the University of Sydney. The Auditory Neuro-
science Laboratory maintains a Web page outlining the laboratory facilities
and current research work at http://www.physiol.usyd.edu.au/simonc.
22 Virtual Auditory Space: Generation and Applications
REFERENCES
1. Gazzaniga MS. The congitive neurosciences. Cambridge, Mass.: MIT Press,
1994.
2. Yost WA. Auditory image perception and analysis: the basis for hearing.
Hear Res 1991; 56:8-18.
3. Masterton RB. Role of the central auditory system in hearing: the new
direction. TINS 1992; 15:280-285.
4. Blauert J. Spatial Hearing: The psychophysics of human sound localiza-
tion. Cambridge, Mass.: MIT Press, 1983.
5. Blauert J, Lindermann W. Auditory spaciousness: Some further psycho-
acoustic analyses. J Acoust Soc Am 1986; 80:533-542.
6. Dye RH, Yost WA, Stellmack MA et al. Stimulus classification procedure
for assessing the extent to which binaural processing is spectrally analytic
or synthetic. J Acoust Soc Am 1994; 96:2720-2730.
7. Wightman FL, Kistler DJ. Headphone simulation of free field listening.
II: Psychophysical validation. J Acoust Soc Am 1989; 85:868-878.
8. Wenzel EM, Arruda M, Kistler DJ et al. Localization using non-
individualized head-related transfer functions. J Acoust Soc Am 1993;
94:111-123.
9. Makous J, Middlebrooks JC. Two-dimensional sound localization by hu-
man listeners. J Acoust Soc Am 1990; 87:2188-2200.
10. Carlile S, Leong P, Hyams S et al. Distribution of errors in auditory
localization. Proceedings of the Australian Neuroscience Society 1996;
7:225.
11. Erulkar SD. Comparitive aspects of spatial localization of sound. Physiol
Rev 1972; 52:238-360.
12. Oldfield SR, Parker SPA. Acuity of sound localization: a topography of
auditory space I. Normal hearing conditions. Percept 1984; 13:581-600.
13. Mills AW. On the minimum audible angle. J Acoust Soc Am 1958;
30:237-246.
14. Hartman WM, Rakerd B. On the minimum audible angleA decision
theory approach. J Acoust Soc Am 1989; 85:2031-2041.
15. Middlebrooks JC, Green DM. Sound localization by human listeners. Annu
Rev Psychol 1991; 42:135-159.
16. Wightman FL, Kistler DJ. Sound localization. In: Yost WA, Popper AN,
Fay RR, ed. Human psychophysics. New York: Springer-Verlag,
1993:155-192.
17. Yost WA, Gourevitch G. Directional hearing. New York: Springer-Verlag,
1987.
18. Blauert J. Binaural localization. Scand Audiol 1982; Suppl.15:7-26.
19. Oldfield SR, Parker SPA. Acuity of sound localization: a topography of
auditory space II: Pinna cues absent. Percep 1984; 13:601-617.
20. Oldfield SR, Parker SPA. Acuity of sound localization: a topography of
auditory space. III Monaural hearing conditions. Percep 1986; 15:67-81.
21. Wightman FL, Kistler DJ, Perkins ME. A new approach to the study of
human sound localization. In: Yost WA, Gourevitch G, ed. Directional
Hearing. New York: Academic, 1987:26-48.
Auditory Space 23