Professional Documents
Culture Documents
47:461–84
#15933 in IAP Ref DB
Copyright © 1996 by Annual Reviews Inc. All rights reserved
Charles S. Watson
Departments of Speech and Hearing Sciences and of Psychology, Indiana University,
Bloomington, Indiana 47405
ABSTRACT
In this review of auditory psychophysics and perception, we cite some important
books, research monographs, and research summaries from the past decade.
Within auditory psychophysics, we have singled out some topics of current
importance: Cross-Spectral Processing, Timbre and Pitch, and Methodological
Developments.
Complex sounds and complex listening tasks have been the subject of new
studies in auditory perception. We review especially work that concerns auditory
pattern perception, with emphasis on temporal aspects of the patterns and on
patterns that do not depend on the cognitive structures often involved in the
perception of speech and music.
Finally, we comment on some aspects of individual differences that are
sufficiently important to question the goal of characterizing auditory properties
of the typical, average, adult listener. Among the important factors that give rise
to these individual differences are those involved in selective processing and
attention.
CONTENTS
INTRODUCTION..................................................................................................................... 461
Contemporary Psychophysics ............................................................................................. 463
AUDITORY PSYCHOPHYSICS—SELECTED TOPICS...................................................... 465
0066-4170/96/0101-0461$08.00 461
462 HIRSH & WATSON
INTRODUCTION
Fechner’s so-called inner and outer psychophysics are today represented in
audition by (a) the relations between auditory behavior and correlated physi-
ological observations, mostly in the cochlea and auditory nerve, and (b) the
relations between auditory responses and properties of stimulus sounds. In this
review, we deal primarily with the latter because the former are richly repre-
sented in the biological and neuroscience literature.
Auditory psychophysics or psychoacoustics has been concerned with meas-
ures of absolute sensitivity, masking, and discrimination between sounds that
differ in frequency content, intensity, duration, and spatial location. After a
century of work, psychophysical methods are more precise (mostly through
the application of signal-detection theory), and attention has turned toward
complex sounds—sounds that contain many components simultaneously and
sequentially and that are more representative of sounds in a natural or speech-
filled acoustic environment.
The past two decades have also seen a growing interest in auditory percep-
tion, a development that, in contrast with the history in visual perception, has
been long delayed. From the 1930s, of course, there were considerable ad-
vances in our understanding of the auditory perception of speech (see Miller &
Eimas 1995) and, to a lesser degree, of music. Now, however, we find several
books and many articles on this more complex aspect of hearing, the scope of
which extends beyond the auditory perception of speech and music.
We note three important trends in psychoacoustic research in these recent
decades. First, after a century-old preoccupation with pitch and other fre-
quency-related dimensions as the principal features of auditory processes, the
important role of time and time-varying properties in the stimulus has become
clear. Temporal relations have been clear enough in their application to speech
perception, but now it is becoming evident that such temporal relations are a
necessary component of auditory perception in general.
Second, in spite of a traditional goal of psychophysics to reveal charac-
teristics of the average adult listener, experimenters have in recent years
shown results for individual listeners. These data on individual listeners chal-
AUDITORY PSYCHOPHYSICS AND PERCEPTION 463
lenge the notion of the typical observer. The dependence of these differences
on properties of the stimulus, the psychophysical methods used, and the
amount of training on a task may predict the form of an account of individual
differences in sensory processes or capabilities, and also in listeners’ proclivi-
ties in responding in a variety of auditory tasks.
Third, the important research on speech perception (see Miller & Eimas
1995) and on the psychology of music (see McAdams & Bigand 1993) has
been developed in light of the characteristics of speech (or music) as a part of a
general language system, including remembered rules and structures. Such
cognitive processes have not always been clearly distinguished from auditory
perceptual processes.
In previous recent volumes of the Annual Review of Psychology, hearing
has been described as follows:
In Volume 30, Trahiotis & Robinson (1979) wrote on auditory psychophysics,
emphasizing pitch perception, intensity perception, auditory nonlinearities,
and binaural hearing.
In Volume 34, McFadden & Wightman (1983) wrote on relations between
normal and pathological hearing.
In Volume 38, de Boer & Dreschler (1987) wrote on spectrotemporal repre-
sentation of signals.
In Volume 40, Teas (1989) wrote on auditory physiology.
In Volume 42, Middlebrooks & Green (1991) wrote on sound localization, and
Krumhansl (1991) wrote on music psychology.
In Volume 46, Miller & Eimas (1995) wrote on speech perception.
In the present chapter, we bring the reader up to date on some of these
topics, largely through books and other secondary sources. In somewhat
greater detail, we focus on three psychoacoustic topics: profile analysis, “co-
modulation masking release” (CMR), and pitch and timbre.
In addition, we address more general areas such as auditory perception and
individual differences. Our literature sources are principally from the past
decade.
Contemporary Psychophysics
Several good summaries of contemporary work in auditory psychophysics are
available (Green 1988a, Handel 1989, Moore 1989). We note especially the
book edited by Yost, Popper, and Fay (1993), in which the field is addressed in
chapters, each written by an expert or two. In the chapter “Auditory Intensity
Discrimination” (Green 1993a) are discussions of classical intensity discrimi-
nation, loudness, signal-detection tasks, profile analysis, and comodulation
masking release (see “Cross-Spectral Processing” in this review). In the chap-
ter “Frequency Analysis and Pitch Perception” (Moore 1993) are discussions
464 HIRSH & WATSON
Cross-Spectral Processing
The latest general review of psychoacoustic research in the Annual Review of
Psychology (de Boer & Dreschler 1987) briefly introduced two new classes of
experiment: profile analysis (Spiegel et al 1981) and what later was termed
comodulation masking release (CMR) (Hall et al 1984). Such experiments
have in common the listener’s use of information distributed over many criti-
cal bands (CB) in detecting a narrow-band signal. The large number of sub-
sequent experiments on profiles and CMR have not led to a rejection of the
critical-band theory, which has been the foundation of most auditory theories
since Fletcher’s (1940) germinal paper on that topic. But they have made it
clear that listeners can monitor the output of more than one CB simultaneously
and thereby can achieve better performance than they would by listening only
to the CB containing the signal. Although de Boer & Dreschler (1987) ob-
served that, “Appealing as the concept of profile analysis might be for auditory
theory, it probably does not play an important part in everyday sound analy-
sis,” the recognition and discrimination of vowels on the basis of spectral
peaks (formants) do not accord with that conclusion.
most of the other profile effects discussed earlier in this section. An alternative
theory would be to estimate the pitch of the profiles, on the basis of the
“envelope weighted average instantaneous frequency” (EWAIF) proposed by
Feth (1974). However, that model does not seem applicable to stimuli as broad
in their bandwidth as are most profiles.
below the band of noise containing the signal (e.g. Wright & McFadden 1990).
In some experiments either (a) these flanking bands and the signal band are
modulated with a common envelope, the “correlated” or “comodulated” condi-
tion; (b) all of the flanking bands share an envelope but the signal band is
incoherently modulated from those flanking bands, the “uncorrelated condi-
tion”; or (c) each band is independently modulated, the “all-uncorrelated”
condition. Wright & McFadden intermixed these conditions within trial
blocks. They concluded that the absence of any large effect of this form of
trial-to-trial uncertainty implies that the listeners do not have to choose among
several condition-specific listening strategies.
Several theories have been proposed to explain the CMR, and some experi-
mental support has been found for most of them. No theory is yet able to
account for all CMR effects. Perhaps earliest was the theory that the envelope
fluctuations remote from the signal frequency served to identify regions (dips
or troughs in the envelope) where the signal-to-noise ratio was highest and to
which the listener therefore attended (Buus 1985, Hall 1986). This explanation
was supported by the demonstration that brief signals are more detectable
when they are positioned in the valleys of sinusoidally amplitude-modulated
(SAM) noise in a CMR task than when they are at the peaks, and also that
phase-incoherent placement of the signals yields still higher thresholds (Moore
et al 1990).
Another theoretical explanation of CMR is that the listener detects the
reduction in the correlation between the envelopes of the signal band and those
of the flanking bands, resulting from the introduction of the signal (Cohen &
Schubert 1987, Green 1992, Richards 1987). Hall & Grose (1988) provided a
partial test of both models by comparing the detection of a single sinusoidal
signal presented in the center of three bands to conditions in which signals
were present in two or all three of the bands. The results were more consistent
with the correlation-decorrelation hypothesis than they were with valley-lis-
tening. Actual measures, however, of the correlation between samples of the
noises with and without signals did not provide a convincing fit to the correla-
tion-detection hypothesis. The data were somewhat more consistent with a
version of the Durlach-Braida (1963) equalization-cancellation model for the
detection of signals presented in dichotic listening conditions.
Timbre vs Pitch
The auditory attributes of pitch and loudness appear to be unidimensional and
extend from low to high or from soft to loud. In contrast, timbre is a more
qualitative attribute and includes several dimensions (Plomp 1970). Useful
summaries of work on timbre are found in Moore (l989, pp. 229–33), Handel
(1989, pp. 226–63), and Rasch & Plomp (1982, pp. 12–24).
470 HIRSH & WATSON
The classical view of timbre holds that different timbres result from differ-
ent distributions of amplitudes of the harmonic components of a complex tone
in a steady state. That quality that permits a listener to identify the instruments
of an orchestra, however, involves also dynamic features of the sound (Moore
1989, pp. 232–33), as had been shown earlier by Grey (1977), by Grey &
Moorer (1977), and by Risset & Wessel (1982), particularly for onset charac-
teristics. The tones of different instruments do not begin in the same manner,
and further, not all of the harmonics of different instruments rise at the same
rate.
Extending the work of Kendall (1986), Iverson & Krumhansl (1993) have
assessed the relative importance of the onset and the “remainder,” or steady
state, by asking listeners to judge the similarity of different orchestral-instru-
ment tones. Applying a multidimensional-scaling technique to results for com-
plete tones, for 80-ms onsets, and for remainders (tones without the onset),
they found that both the temporal and spectral information was available in all
three types of tone samples. These results were not consistent with earlier
work that relied on identification of instruments as opposed to judgments of
similarity.
Beyond the domain of musical tones, timbre has become almost a synonym
for “quality.” One problem is that one of the many adjectives used by listeners
to describe different timbres is “pitchlike.” Singh (1987) demonstrated that
there is an intimate relation, even a tradeoff, between pitch and timbre in
certain tasks involving tone sequences. Pitt (1994) reported that listeners are
less accurate in judging the similarity of pitches of tones with different timbres
than they are in judging the similarity of pitches of tones with the same timbre.
Further, although Bregman’s (1990) early studies of stream segregation were
concerned with the pitch relations among the notes in a sequence, Singh &
Hirsh (1992) found that streaming could also be observed across same-pitch
notes with different timbres. It is widely accepted these days (Moore 1993)
that pitch can be mediated both by different parts of the spectrum and by the
temporal structure of the waveform. No wonder then that timbral differences
in spectral patterns are confused with spectral pitch.
Methodological Developments
Although no effort has been made in recent years to catalog the broad range of
modifications of standard psychophysical procedures used to study particular
auditory abilities, we mention two methodological variants, both directed at
increasing psychophysical efficiency. Green (1993b) has developed a maxi-
mum-likelihood technique by which a stimulus dimension can be sampled
with high efficiency and that yields valid threshold estimates in as few as a
dozen trials, using a yes-no (single-interval) procedure. This method appears
to have promise for studies of large stimulus domains in which stimuli yield-
AUDITORY PSYCHOPHYSICS AND PERCEPTION 471
AUDITORY PERCEPTION
While perceptual studies in the field of vision have been numerous and fruit-
ful, auditory perception has been a strong area of research only during the past
few decades (see Yost & Watson 1987). To be sure, interest in the perception
of speech and music has been strong for much of this century, but, owing to
the unique aspects of speech and music, a major emphasis has been placed on
the cognitive facets of linguistic and musical structures without an accompa-
nying concern for perception that is uniquely auditory (Julesz & Hirsh 1972).
This observation is curious in view of the importance accorded by the early
Gestalt psychologists to auditory as well as visual examples of so-called
organized perceptual processes—for example, von Ehrenfels’s demonstration
of the recognizability of a melody under transposition (when the “elements”
are changed but the Gestaltqualität or melodic structure remains). Interest has
now shifted from sensory processes to perception of auditory patterns and
patterns of patterns.
472 HIRSH & WATSON
Auditory Patterns
Auditory events or entities, whether brief or prolonged, may be said to com-
prise auditory patterns. On the acoustical or stimulus side, such patterns range
from simple to complex. Hirsh (1988) suggested a hierarchy from single
sounds in a steady state, to single sounds with a changing state, to sound
sequences, to concurrent sequences. CHABA (1989, pp. 23–27) made similar
suggestions. Hartmann (1988), Bregman (1990), and Yost (1992), seeking
single dimensions that controlled segregation and integration of single sounds
into larger entities, listed variables like onset and offset times, temporal modu-
lation, duration, spectral content, level and location in space, in addition to
more general features like context, musical training, and selective attention.
AUDITORY PSYCHOPHYSICS AND PERCEPTION 473
Temporal Perception
The auditory system can respond quickly to changes in the acoustic stimulus.
The literature on temporal resolution is well reviewed and interpreted by
Viemeister & Plack (1993). The chapter addresses minimal detectable separa-
tion between brief sounds, mostly through gap-detection studies; the temporal
modulation transfer function; the influence of spectrum on temporal resolu-
tion; the influence of temporal relations in masking (see the section in this
review on “Comodulation Masking Release”); and temporal integration.
Time also serves as an important dimension in which auditory patterns are
formed. Judgments of the temporal order of events are critical. Warren (1993)
has reviewed his own studies and related work by others. When the listener’s
task is to discriminate the order AB from BA, thresholds can be in the 1–2 ms
range for pairs of brief events. To make it possible for the listener to report
whether A or B came first in such pairs, 20 ms is sufficient (Rosen & Howell
1987). If the listener must give verbal responses to name the order of events in
longer sequences or if the order is continuously recycled, longer onset separa-
tions are required.
Judged order for a single sequence can be disrupted by the presence of a
second sequence. Order relations in either of two such streams depend upon
the rate of events and frequencies involved (Bregman 1990, 1993). The basic
observation out of which grew auditory stream segregation and scene analysis,
summarized in these recent Bregman references, concerns sequences of
pitches. At moderate rates with a relatively narrow range of pitches, a listener
reports hearing a single line of pitches. When, however, some of the pitches
are in a high range and others are in a low range and the rate of alternation is
higher, listeners report hearing two concurrent pitch lines.
rhythmic pattern, but such changes elaborate the patterns with markings or
accents. Listeners’ sensitivity to the time of occurrence of elements within
rhythmic and arrhythmic sequences has been demonstrated by Hirsh et al (1990)
and by Monahan & Hirsh (1990).
Paul Fraisse has summarized his large body of work on time and rhythm
(Deutsch 1982). His reliance on Gestalt principles of grouping and of
common fate to explain the various structures in rhythmic patterns, with their
meters and beats, is supplemented by notions of expectancy and anticipation
(Fraisse 1984). Similar lines are found in the beat-based theory of Povel &
Essens (1985) and in the emphasis on expectancy of Jones (see Jones & Yee
1993).
noise and those for tonal sequences is sufficiently strong to make it appear that
the proportional-duration rule may be general for the discrimination of a wide
range of unfamiliar sounds.
Lutfi (1993) has developed a model of pattern discrimination that is appli-
cable to tonal sequences and to a variety of other complex stimuli: “Compo-
nent discriminability in an unfamiliar tone pattern (profile or sequence) is a
linearly increasing function of the component’s relative entropy (CoRE) in the
pattern. Ancillary: The threshold for detection of a change decreases by one
order of magnitude (factor of 10) for each one bit increase in CoRE.” This
model successfully describes discrimination performance in a number of to-
nal-pattern studies by Watson and various coworkers in which the dimension
subject to variation was the frequency, duration, or intensity of single target
tones. The success with which this model fits a wide range of discrimination
experiments is strongly dependent on the assumption that the weight assigned
to the target tone (as well as the weights for all other tones) varies with the
square of the amplitude-duration product for that tone. Given this assumption,
the predictions of the CoRE model should be in general agreement with the
proportional-target-tone-duration rule described in the preceding section (Kidd
& Watson 1992). In fact, it fits a broad range of experimental data somewhat
more accurately than does that empirical rule.
A much more restricted model of temporal sequence discrimination has
been proposed by Sorkin (1990). It successfully describes a variety of cases in
which arrhythmic temporal sequences of tones differ only in the durations of
the intercomponent gaps. If listeners are presented with two arrhythmic se-
quences of tones, frequency-pattern discrimination is dependent on the com-
plexity of the sequence of intervals (Sorkin 1987). Further, such temporally
jittered patterns can be discriminated in the temporal domain (Sorkin 1990).
But when the overall time of the sequence is compressed or expanded, such a
model is not invariant with overall time (Sorkin & Montgomery 1991). Two
facts are noteworthy in this line of research. One is that random variation in the
temporal spacing of components of a tonal sequence when the frequencies of
the components remain fixed has a surprisingly strong degrading effect on
pattern discriminability. The other fact is that Sorkin’s model, based simply on
the temporal correlation between the patterns to be discriminated (and limited
by an assumed internal temporal noise with a standard deviation of about 15
ms), does an excellent job of predicting the variation in discrimination for a
wide range of gap-sequence correlations.
INDIVIDUAL DIFFERENCES
The study of individual differences has received considerably less attention
than the performance of average normal adult listeners, with subject-to-subject
476 HIRSH & WATSON
ures may reflect in part the range of intellectual demands of different psycho-
physical methods. In support of a central-processing-efficiency explanation of
individual differences in speech processing is a significant correlation (r >
0.5–0.6) between the ability to recognize speech under degraded listening
conditions and the ability to read lips in the absence of any auditory input
(Brink 1974, Watson et al 1992).
In an earlier chapter in this series, McFadden & Wightman (1983) wrote of
the need for systematic test-battery studies of the “interrelations among the
various auditory abilities in both normal and hearing-impaired listeners.” Few
such studies have been conducted since they made that recommendation, while
reasons to do them have accumulated. For example, controversies concerning
typical listeners’ abilities to discriminate profile stimuli (see section on “Pro-
file Analysis” in this review) have largely involved the range of individual
abilities to perform in those experiments. When small numbers of subjects are
used in a profile experiment, it appears that the only way to collect reasonably
homogeneous data is to pretest larger groups of listeners and reject those who
cannot, without lengthy training, discriminate on the basis of small differences
in spectral shape (Green 1988b, p. 94; Henn & Turner 1990; Kidd et al 1991).
Various studies of the auditory performance of young children and even of
infants in a wide variety of listening tasks have been made available in a
conference proceedings edited by Werner & Rubel (1992). As these editors
note in their introduction, “even in the simplest psychoacoustic tasks, such as
the detection of a sound in quiet, infants and children exhibit what might be
interpreted as auditory deficits compared to young adults.” The results of
many recent investigations suggest a continuous reduction in psychophysical
thresholds starting around age 2 and continuing to age 5, or even to age 10 or
11 in some cases, after which adult-like performance is achieved. Specific
examples include a 12–13 dB reduction in the masked threshold for tones,
with an almost linear decrease between six months and ten years of age
(Schneider et al 1989); the minimum audible angle in spatial localization
decreases from about 27° at two months of age to 5–6° at 20 months and is
nearly at adult levels of 2–3° for five-year-old subjects (Clifton 1992). The
majority of recent developmental studies of auditory capabilities, however,
have not traced changing performance from infancy to young adulthood but
instead have simply compared infant performance with that of adults, which
leaves considerable uncertainty about the time course of the developmental
changes.
Two contributions to the volume edited by Werner & Rubel describe inter-
nal-noise models that assume a central or decision-process noise, the power of
which diminishes over the course of childhood (Viemeister & Schlauch 1992,
Wightman & Allen 1992). In the absence of strong neurophysiological evi-
dence of auditory-system development over this same period, there appears to
478 HIRSH & WATSON
SELECTIVE PROCESSING
Auditory Attention
The concept of selective auditory attention to particular sounds or to selected
spectral or temporal portions of complex sounds has been the subject of many
investigations over the past decade or so. Considered in subjective terms,
learning to direct attention to some particular aspect of a sound can lead the
listener to report that a spectral-temporal region becomes more salient or
stands out from its acoustic background. In some cases, such as those studied
by Bregman (1990) and others concerned with auditory-stream segregation,
the perceptual experience is quite similar to certain experiences with visual
displays, in which one part is seen as “figure” and the remainder as “ground.”
Psychophysical correlates of selective auditory attention include evidence
from “streaming” experiments that certain (within-figure) portions of complex
sounds are processed more accurately than others. If some spectral-temporal
portions of a complex sound can be shown to have a stronger influence on
psychophysical judgments than others, to which the listener is also capable of
responding, this might be considered objective evidence of the “distribution of
auditory attention.” When the listener is either trained or “cued” to respond
preferentially to one sound out of a catalog or to one spectral-temporal part of
a complex sound (Darwin & Carlyon 1995), evidence of enhanced processing
of the trained, or cued, stimuli is an objective correlate of what is commonly
meant by “selective attention.” It is not identical to phenomenological atten-
tion, of course, because psychophysical evidence of selective processing does
not presuppose claims by the listener that one sound, or part of a sound, seems
more salient than others. In this section, experimental evidence relevant to
psychophysically defined selective auditory attention is discussed; pheno-
AUDITORY PSYCHOPHYSICS AND PERCEPTION 479
CONCLUSION
The areas of psychoacoustics and auditory perception are well represented in
recent books and monographs. A bridge is being formed between the psycho-
physics of simple sounds and the processing of complex sounds. The latter is
enabling further relations to be made to more cognitive aspects of auditory
perception, especially attention and individual differences. We can anticipate a
fruitful expansion in the next decade of auditory sensory and perceptual
theory.
ACKNOWLEDGMENTS
Preparation of this chapter was supported by the Research Department, Central
Institute for the Deaf; and by grants NIH/NINCD DC00250 and AFSOR
F49620-92-J-0506 to Indiana University. The authors thank William Yost for
helpful suggestions.
Any Annual Review chapter, as well as any article cited in an Annual Review chapter,
may be purchased from the Annual Reviews Preprints and Reprints service.
1–800–347–8007; 415–259–5017; email: arpr@class. org
Literature Cited
Aiello R, ed. 1994. Musical Perceptions. New observation tasks. J. Acoust. Soc. Am. 86:
York: Oxford. 290 pp. 1743–46
Atkinson RC, Herrnstein RJ, Lindzey G, Luce Berg B, Green DM. 1990. Spectral weights in
RD, eds. 1988. Stevens’ Handbook of Ex- profile listening. J. Acoust. Soc. Am. 88:
perimental Psychology, Vol. 1. New York: 758–66
Wiley. 905 pp. Bernstein LB, Green DM. 1988. Detection of
Berg B. 1989. Analysis of weights in multiple changes in spectral shape: uniform vs. non-
AUDITORY PSYCHOPHYSICS AND PERCEPTION 481
uniform background spectra. Hear. Res. model for discrimination of broadband sig-
32:157–65 nals. J. Acoust. Soc. Am. 80:63–72
Bilger RC, Nuetzel JM, Rabinowitz WM, Era P, Jokela J, Qvarnberg Y, Heikkinen E.
Rzeczkowski C. 1984. Standardization of a 1986. Pure-tone thresholds, speech under-
test of speech perception in noise. J. standing, and their correlates in samples of
Speech Hear. Res. 27:32–48 men of different ages. Audiology 25:
Bregman AS. 1990. Auditory Scene Analysis. 338–52
Cambridge, MA: MIT Press. 783 pp. Espinoza-Varas B, Watson CS. 1988. Low
Bregman AS. 1993. Auditory scene analysis: commonality between tests of auditory dis-
hearing in complex environments. See crimination and of speech perception. J.
McAdams & Bigand 1993, pp. 10–36 Acoust. Soc. Am. 84(Suppl. 1):S143
Brink B. 1974. Experience concerning audio- Fallon-Coble S, Robinson DE. 1992. Dis-
visual tests utilized at a school for the deaf. criminability of bursts of reproducible
In Scandinavian Audiology, Suppl. 4, Vis- noise. J. Acoust. Soc. Am. 92:2630–35
ual and Audio-Visual Perception of Feth LL. 1974. Frequency discrimination of
Speech, ed. H Birk Nielsen, E Kampp, pp. complex periodic tones. Percept. Psycho-
210–21. Stockholm, Sweden: Almquist & phys. 15:375–78
Wiksell Fletcher H. 1940. Auditory patterns. Rev. Mod.
Buus S. 1985. Release from masking caused by Phys. 12:47–65
envelope fluctuations. J. Acoust. Soc. Am. Fraisse P. 1984. Perception and estimation of
78:1958–65 time. Annu. Rev. Psychol. 35:1–36
Cazals Y, Demany K, Horner K, eds. 1992. Gaver WW. 1993. What in the world do we
Auditory Physiology and Perception. Ad- hear? An ecological approach to auditory
vanced Bioscience, Vol. 83, Oxford: Per- event perception. Ecol. Psychol. 5:1–29
gamon Gelfand SA. 1990. Hearing: An Introduction
CHABA (Committee on Hearing, Bioacous- to Psychological and Physiological Acous-
tics, and Biomechanics). 1989. Classifica- tics. New York: Dekker. 2nd ed. 535 pp.
tion of Complex Nonspeech Sounds. Wash- Green DM. 1964. Consistency of auditory de-
ington, DC: Natl. Acad. Press. 88 pp. tection judgments. Psychol. Rev. 71:
Clifton R. 1992. The development of spatial 392–407
hearing in human infants. See Werner & Green DM. 1988a. Audition: psychophysics
Rubel 1992, pp. 135–57 and perception. See Atkinson et al 1988,
Cohen MF, Schubert ED. 1987. The effect of pp. 327–76
cross-spectrum correlation on the detecta- Green DM. 1988b. Profile Analysis: Auditory
bility of a noise signal. J. Acoust. Soc. Am. Intensity Discrimination. New York: Ox-
81:721–23 ford
Darwin CJ, Carlyon RP. 1995. Auditory group- Green DM. 1992. On the similarity of two
ing. In BCJ Moore(Ed) Hearing, ed. BCJ theories of comodulation masking release.
Moore, pp. 387–424. San Diego: Academic J. Acoust. Soc. Am. 91:1769
Dai H, Scharf B, Buus S. 1991. Effective at- Green DM. 1993a. Auditory intensity discrimi-
tenuation of signals in noise under focussed nation. See Yost et al 1993, pp. 13–55
attention. J. Acoust. Soc. Am. 89:2837–42 Green DM. 1993b. A maximum-likelihood
de Boer E, Dreschler WA. 1987. Auditory psy- method for estimating thresholds in a yes-
chophysics: spectrotemporal representation n o t a s k. J . A c o u s t . S o c . A m. 93 :
of signals. Annu. Rev. Psychol. 38:181–202 2096–2105
Deutsch D, ed. 1982. The Psychology of Music. Green DM, Kidd G Jr, Picardi M. 1983. Suc-
New York: Academic. 542 pp. cessive versus simultaneous comparison in
Dowling WJ, Harwood DL. 1986. Music Cog- auditory intensity discrimination. J. Acoust.
nition. Orlando, FL: Academic. 258 pp. Soc. Am. 73:639–43
Duda R, Lyon RF, Slaney M. 1990. Correlo- Green DM, Mason CR. 1985. Auditory profile
grams and the separation of sounds. In analysis: frequency, phase and Weber’s
IEEE Conference on Signals, Systems and law. J. Acoust. Soc. Am. 77:1155–61
Computers, 1:457–61. San Jose: Maple Green DM, Onson ZA, Forrest TG. 1987. Fre-
Press quency effects in profile analysis and de-
Durlach NI, Braida LD. 1963. Equalization and tecting complex spectra changes. J. Acoust.
cancellation theory of binaural masking Soc. Am. 81:692–99
level differences. J. Acoust. Soc. Am. 35: Greenberg GZ, Larkin WD. 1968. Frequency-
1206–18 response characteristic of auditory ob-
Durlach NI, Braida LD. 1969. Intensity percep- servers detecting signals of a single fre-
tion. I. Preliminary theory of intensity reso- quency in noise: the probe-signal method.
lution. J. Acoust. Soc. Am. 46:372–83 J. Acoust. Soc. Am. 44:1513–23
Durlach NI, Braida LD, Ito Y. 1986. Toward a Grey JM. 1977. Multidimensional perceptual
482 HIRSH & WATSON
scaling of musical timbres. J. Acoust. Soc. Julesz B, Hirsh IJ. 1972. Visual and auditory
Am. 61:1270–77 perception: an essay of comparison. In Hu-
Grey JM, Moorer JA. 1977. Perceptual evalu- man Communication: A Unified View, ed.
ations of synthesized musical instrument EE David, P Denes, pp. 283–340. New
tones. J. Acoust. Soc. Am. 62:454–62 York: McGraw-Hill
Hafter ER, Buell TN, Richards VM. 1988. On- Katz J, Stecker NA, Henderson D, eds. 1992.
set coding in lateralization: its form, site Central Auditory Processing: A Transdis-
and function. In Auditory Function: Neuro- ciplinary View. St. Louis, MO: Mosby
biological Bases of Hearing, ed. GW Edel- Kendall RA. 1986. The role of acoustic signal
man, WE Gall, WM Cowan, pp. 647–76. partitions in listener categorization of musi-
New York: Wiley cal phrases. Music Percept. 4:185–214
Hafter ER, Schlauch RS, Tang J. 1993. Attend- Kewley-Port D, Watson CS. 1994. Formant-
ing to auditory filters that were not stimu- frequency discrimination for isolated Eng-
lated directly. J. Acoust. Soc. Am. 94: lish vowels. J. Acoust. Soc. Am. 9 5 :
743–47 485–96
Hall JW III. 1986. The effect of across-fre- Kidd G Jr, Mason CR, Uchanski RM, Brantley
quency differences in masking level on MA, Shah P. 1991. Evaluation of simple
spectro-temporal pattern analysis. J. models of auditory profile analysis using
Acoust. Soc. Am. 79:781–87 random reference spectra. J. Acoust. Soc.
Hall JW III, Grose JH. 1988. Comodulation Am. 90:1340–54
masking release: evidence for multiple Kidd GR, Watson CS. 1992. The “proportion-
cues. J. Acoust. Soc. Am. 84:1669–75 of-the-total-duration rule” for the discrimi-
Hall JW III, Grose JH. 1990. Comodulation nation of auditory patterns. J. Acoust. Soc.
masking release and auditory grouping. J. Am. 92:3109–18
Acoust. Soc. Am. 88:119–25 Krumhansl CL. 1991. Music psychology: tonal
Hall JW III, Haggard MP, Fernandes MA. structures in perception and memory.
1984. Detection in noise by spectro-tempo- Annu. Rev. Psychol. 42:277–303
ral pattern analysis. J. Acoust. Soc. Am. 76: Luce RD. 1993. Sound & Hearing: A Concep-
50–56 tual Introduction. Hillsdale, NJ: Erlbaum.
Handel S. 1989. Listening: An Introduction to 322 pp., Demonstration CD
the Perception of Auditory Events. Cam- Lutfi RA. 1993. A model of auditory pattern
bridge, MA: MIT Press. 597 pp. analysis based on component-relative-en-
Hartmann WM. 1988. Pitch perception and the tropy. J. Acoust. Soc. Am. 94:748–58
organization and integration of auditory en- McAdams S. 1993. Recognition of sound
tities. In Auditory Function: Neurobiologi- sources and events. See McAdams &
cal Bases of Hearing, ed. GW Edelman, Bigand 1993, pp. 146–98
WE Gall, WM Cowan, pp. 623–45. New McAdams S, Bigand E, eds. 1993. Thinking in
York: Wiley Sound: The Cognitive Psychology of Hu-
Henn CC, Turner CW. 1990. Pure-tone incre- man Audition. New York: Oxford. 354 pp.
ment detection in harmonic and inharmonic McFadden D, Wightman FL. 1983. Audition:
backgrounds. J. Acoust. Soc. Am. 88: some relations between normal and patho-
126–31 logical hearing. Annu. Rev. Psychol. 34:
Hirsh IJ. 1988. Auditory perception and 95–128
speech. See Atkinson et al 1988, pp. McFadden D, Wright BA. 1990. Temporal de-
377–408 cline of masking and comodulation detec-
Hirsh IJ, Monahan CB, Grant KW, Singh PG. tion differences. J. Acoust. Soc. Am. 88:
1990. Studies in auditory timing. I. Simple 711–24
patterns. Percept. Psychophys. 47:215–26 Meddis R, Hewitt MJ. 1992. Modeling the
Humes L, Christopherson L, Cokely C. 1992. identification of concurrent vowels with
Central auditory processing disorders in the different fundamental frequencies. J.
elderly: fact or fiction? See Katz et al 1992, Acoust. Soc. Am. 91:233–45
pp. 141–49 Middlebrooks JC, Green DM. 1991. Sound lo-
Iverson P, Krumhansl CL. 1993. Isolating the calization by human listeners. Annu. Rev.
dynamic attributes of musical timbre. J. Psychol. 42:135–59
Acoust. Soc. Am. 94:2595–603 Miller JL, Eimas PD. 1995. Speech perception:
Johnson DM, Watson CS, Jensen JK. 1987. from signal to word. Annu. Rev. Psychol.
Individual differences in auditory capabili- 46:467–92
ties. J. Acoust. Soc. Am. 81:427–38 Monahan CB, Hirsh IJ. 1990. Studies in audi-
Jones MR, Holleran S, eds. 1992. Cognitive tory timing. II. Rhythmic patterns. Percept.
Bases of Musical Communication. Wash- Psychophys. 47:227–42
ington, DC: Am. Psychol. Assoc. 284 pp. Moore BCJ. 1973. Frequency difference li-
Jones MR, Yee W. 1993. Attending to auditory mens for short tones. J. Acoust. Soc. Am.
events: the role of temporal organization. 54:610–19
See McAdams & Bigand 1993, pp. 69–112 Moore BCJ. 1989. An Introduction to the Psy-
AUDITORY PSYCHOPHYSICS AND PERCEPTION 483