Professional Documents
Culture Documents
WS 2015/16
Tutorial
© Fraunhofer IDMT
Block Diagram of a Perceptual Audio Encoder
• loudness
• critical bands
• masking:
• frequency domain
• time domain
• binaural cues (overview)
© Fraunhofer IDMT
Structure of the Human Ear
© Fraunhofer IDMT
Structure of the Human Ear
Ossicles archways
concentrates the
sound waves
pinna
cochlea with
ear canal organ of Corti
© Fraunhofer IDMT
Human Ear Functions (1/3)
Outer Ear
Functions:
• sound energy collection
• sound energy transition to the Ear Drum
Advantages:
• Ear drum protection
• Enables the inner ear to be positioned close to the brain
Middle Ear
Functions:
• eardrum – transforms sound wave into vibrations
• ossicles - transfer the mechanical vibrations to the cochlea
© Fraunhofer IDMT
Human Ear Functions (2/3)
Inner Ear
• cochlear structure - induces travelling waves along the length of the
basilar membrane
• neural receptors - connected along the length of the basilar
membrane
• convert these travelling into chemical and electrical signals
Source:
http://bio1152.nicerweb.com/Locked/media/ch50/ 6
© Fraunhofer IDMT
Human Ear Functions (3/3)
Example
Source:
http://147.162.36.50/cochlea/cochleapages/overview/history.htm 7
© Fraunhofer IDMT
Sound Preprocessing in the Peripheral System
Source: http://cochlearimplanthelp.com/journey/choosing-a-cochlear- Source: Yuli You “Audio Coding Theory and Applications ”
implant/electrodes-and-channels/
8
© Fraunhofer IDMT
Sound Perception
© Fraunhofer IDMT
Frequency and Level Range of Human Hearing
© Fraunhofer IDMT
Loudness and Loudness Levels
• Loudness level LN: the sound pressure of a 1 kHz tone which is as loud
as the sound (measured in ‘phon’)
[1] American National Standards Institute, "American national psychoacoustical terminology" S3.20, 1973, American Standards Association. 11
© Fraunhofer IDMT
Loudness (1/2)
© Fraunhofer IDMT
Loudness (2/2)
• Loudness Scale:
© Fraunhofer IDMT
Critical Bands
© Fraunhofer IDMT
Critical Band
Band - a range of frequencies.
• The middle of the band is called the center frequency
Source: http://cochlearimplanthelp.com/journey/choosing-a-cochlear-implant/electrodes-and-
channels/
15
© Fraunhofer IDMT
Frequency Grouping Bandwidth The Critical Bands
© Fraunhofer IDMT
Critical Bands: Bark Scale
- A non-linear frequency scale modeling the resolution of the
human hearing system.
• scale ranges from 1 to 24, unit “Bark” (after Zwicker)
• one Bark corresponds to one critical band
...
Source: https://en.wikipedia.org/wiki/Bark_scale
17
© Fraunhofer IDMT
Frequency Grouping in Human Hearing
• Adding Loudness
– Tones in a critical band: their energy adds up, but the perceived
combined loudness is only slightly greater than either sound alone
o By using tones under the threshold in quiet, their intensity add up in a
critical band and are now audible
– 2 tones of equal loudness which are widely separated in pitch: total
sound is about twice as loud
Source: http://hyperphysics.phy-
astr.gsu.edu/hbase/sound/loud.html 18
© Fraunhofer IDMT
Masking
© Fraunhofer IDMT
Masking
• Basic principle:
- a test signal, called a maskee is placed at the center frequency of
the critical bandwidth
- one masking signal, called masker (equal power and distance from
maskee)
- If the Pmaskee is weak relative to the total power of the maskers
the test signal is not audible test signal is masked
Source: http://www.nptel.ac.in/courses/117105083/pdf/ssg_m9l28.pdf 20
© Fraunhofer IDMT
Masking of Pure Tones by Noise - Broad-Band Noise
•broad-band noise:
•white noise from 20 Hz
- 20 kHz
•figure:
•masking thresholds for
pure tones masked by
broad band noise of
different density levels
lWN
© Fraunhofer IDMT
Masking of Pure Tones by Noise - Narrow-Band Noise
•narrow-band noise:
•noise with a bandwidth
equal or smaller than critical
bandwidth
•figure:
•Threshold s of pure tones
masked by narrow-band
noise for different centre
frequencies
•difference between
maximum of masked
threshold and test tone level
Fig: Zwicker, Fastl “Psychoacoustics - Facts and Models”, 2nd Edition,
1999.
© Fraunhofer IDMT
Temporal Masking Effects
© Fraunhofer IDMT