Professional Documents
Culture Documents
Professional singers have an exciting "ringing" extra formant improved the ability of the vocal tract
voice quality in the singing performance. The pres- to transfer sound (4). In spite of many investigations
ence of this quality, which corresponds to the so- (1-6), no uniform agreement of the definition of the
called singer's formant in the spectrum envelope, singer's formant exists.
enhances the singer's ability to be heard without This paper attempts to provide a new parameter
amplification over an orchestra (1). Seidner et al. of spectrum analysis for quantitative evaluation of
reported that the center frequency of the singer's singing voice quality rather than the presence or
formant varies with pitch between 2.3 and 3.0 kHz absence of the singer's formant. From previous
in basses and between 3.0 and 3.8 kHz in tenors (2). studies (1-6), the center frequency of the singer's
Sundberg reported that the center frequency of the formant lies roughly between 2 and 4 kHz in power
singer's formant varies, depending on the voice spectrum. In the first part of the present study, re-
type, and was approximately 2.2 kHz in basses, 2.7 gardless of the existence of the extra peak, the
kHz in baritones, 2.8 kHz in tenors, and 3.2 kHz in greatest harmonics peak between 2 and 4 kHz was
altos (3). He also reported that there was an extra identified in spectrum display, and its power was
formant in sung vowels between the third and measured in contrast to the power of the greatest
fourth formants of the spoken vowels and that this harmonics peak between 0 and 2 kHz in trained
singers (professional and nonprofessional) and
nonsingers. In the second part of the study, to as-
Accepted November 28, 1995.
Address correspondence and reprint requests to Dr. Koichi certain whether the greatest harmonics peak be-
Omori, Department of Otolaryngology, Kyoto University Hos- tween 2 and 4 kHz affects singing voice quality,
pital, 54 Kawahara-cho, Shogoin, Sakyo-ku, Kyoto 606 Japan. power of the peaks was varied by digital signal pro-
This paper was presented at the Voice Foundations' 24th an-
nual symposium, Care of the Professional Voice, Philadelphia, cessing, and the processed sounds were perceptu-
1995. ally analyzed.
228
SINGING t~OWER RATIO 229
OdB]
-10 SPP
P1 P2
-20 1
-30
FIG. 1. Power spectrum display -40
of a sung voice sample / a/. SPP,
singing power peak; SPR, singing
power ratio; Pl, P2, harmonics -50
peaks between 0 and 2 kHz; P3,
P4, harmonics peaks between 2 -60
and 4 kHz.
-70
-80
-90
-100
0 1 2 3 4 5 kHz
reduced for 6, 12, 18, or 24 dB from the level of the quality (dull voice). Degree of " r i c h n e s s " was
original voice sample. scored 1, 2, 3, 4, 5, 6, or 7, with 7 the richest and 1
All the original sung samples and processed the least rich quality (thin voice).
sound samples of singers were played back through
a 16-bit D/A converter (Digidesign) and a Room-
mate II speaker (Bose). Frequency response of the RESULTS
speaker ranged from 50 Hz to 15 kHz. Each sample
was judged by five experienced voice teachers on Spectrum analysis
two semantic bipolar scales (ringing - dull, rich - In the spectrum envelope between 2 and 4 kHz,
thin). Degree of "ringing" was scored I, 2, 3, 4, 5, two harmonics peaks were identified in 24 cases (15
6, or 7, with 7 the best and 1 the worst ringing males, 9 females) and only one peak was identified
OdB
-10
-20
-30
-40 FIG. 2. Power spectrum display
of the processed sound sample.
-50 P4, harmonics peak (original
voice sample); P4', harmonics
-60 peak with 12-dB reduction (pro-
cessed sample).
-70
-80
-90
-100
0 1 2 3 4 5 kHz
of the sung sample and in SPR of the spoken sam- Prof. Non-prof.
ple. However, SPR of the sung sample in soprano
singers was significantly lower than that in other Singer Non-singer
voice type singers (ANOVA, p < 0.01). Data of F I G . 3. Singing p o w e r r a t i o ( S P R ) o f s u n g s a m p l e / a / in s i n g e r s
SPR of sung sample for each voice type were plot- a n d n o n s i n g e r s . Prof.: p r o f e s s i o n a l s i n g e r s ; N o n - p r o f . : n o n p r o -
fessional singers; O , male; © , female.
ted in Fig. 5.
Relationships between SPR and the age and pe- years and the singers who had voice training 1>4
riod of voice training were statistically analyzed in years. Relationships between SPR of singer's sung
sung samples of the 37 singers. SPR had no rela- sample and the acoustic parameters (fundamental
tionship with the singer's age by Pearson's correla- frequency, frequency of SPP) were statistically an-
tion coefficients. Figure 6 shows a relationship be- alyzed by Pearson's correlation coefficients. SPR
tween SPR and the period of voice training. By had no significant relationships with fundamental
ANOVA, there was a significant difference in SPR frequency and with frequency of SPP.
between the singers who had voice training <4 Relationship between voice type and frequency
of SPP in sung samples of trained singers is shown
TABLE 2. SPR o f sung and s p o k e n / a / i n singers in Fig. 7. By ANOVA, frequency of SPP in soprano
and nonsingers singers was significantly higher than that in other
Sung Spoken voice type singers (p < 0.01). Relationship between
(mean -+ SD) (mean -+ SD) fundamental frequency and frequency of SPP in
Nonsinger Male (n = 10) -21.1 -+ 2.8 - 2 2 . 4 -- 8.7 sung samples of trained singers is shown in Fig. 8.
Female (n = 10) - 2 4 . 2 -+ 6.4 - 2 2 . 9 - 6.1
Total (n = 20) - 2 2 . 7 -+ 5.1 - 2 2 . 7 -- 7.3
By Pearson's correlation coefficients, frequency of
Nonprofessional Male (n = 8) - 11.5 - 8.2 - 19.9 -+ 9.1 SPP had a significant relationship with fundamental
singer Female (n = 8) - 1 6 . 9 --- 3.3 - 2 5 . 4 -- 7.0
Total(n= 16) -14.2-+6.7 -22.6-+8.3
frequency (p < 0.01).
Professional singer Male (n = 8) - 11.8 -+ 2.2 - 18.1 -+ 2.9
Female (n = 13) - 14.0 -+ 4.4 - 2 0 . 6 -+ 4.8 Perceptual analysis
Total (n = 21) -13.1 -+3.8 - 1 9 . 7 -+ 4.2
In Test l, perceptual scores of five listeners were
SPR, singing power ratio. averaged for each sample of original sung vowel
differences of perceptual scores in the two semantic Baritone Tenor Mezzo Soprano
scales between the original voice samples and the soprano
processed sound samples of the smaller harmonics FIG. 5. Voice type and singing power ratio (SPR) of sung sample
peaks (ANOVA). in singers. Horizontal line, mean of SPR.
SPR was no greater peak >4 kHz than the greatest peak
between 2 and 4 kHz. As reported earlier (1-4), the
10 (dB) center frequency of the singer's formant varies from
2.2 to 3.8 kHz. The present study demonstrated that
SPR separated the singers' group from the nonsing-
0 ers' group and also separated sung from spoken
voices. Therefore, the frequency range of our cur-
rent study between 2 and 4 kHz was appropriate to
0
identify the greatest peak in power spectrum that
represents singing voice quality of trained singers.
0 Sundberg (4) reported that the main acoustical
0
contribution to the generation of the singer's for-
mant stems from a clustering of the third, fourth,
-10 and fifth formants. Burns (9) demonstrated that op-
era singers lowered their fourth formant, creating a
o wide-band resonance area. In clustering of these
formants, identification of the exact peak of singer's
§ o o formant is difficult and meaningless. In the percep-
-20 O0 0
0
tual analysis of our current study, power of the
greatest harmonics peak between 2 and 4 kHz (SPP)
had a significant relationship to singing voice qual-
Freq (SPP)
-30 u u I I
j o o 8 0,'0
• 0
O," O
• 0
I oo oo : ,'OO
,~'888 8
t •
¢•0
•• 0 8
0
0 0
I o •"
,.• O o
0 0
O
8o° O
O
2 0 0 0 ~ 2
0 100 200 300 400 500 600(Hz)
! I I ! I
Fundamental frequency 2 3 4 5 6 7
FIG. 8. Relationship between fundamental frequency and fre-
quency of singing power peak (SPP) in sung samples of singers. Score (Original)
FIG. 10. Perceptual scores of ringing for the original voice sam-
ity; the smaller harmonics peak did not. In cases ple and the processed sound sample of singing power peak (SPP)
in Test 2.
that had only one harmonics peak in the spectrum
envelope between 2 and 4 kHz, the one peak also
affected singing voice quality without an extra for- for singing voice quality regardless of the existence
mant. Power of the greatest harmonics peak in the of the extra formant and the exact center frequency
frequency range between 2 and 4 kHz is important of the singer's formant.
Hollien reported that the singer's formant has a
lower amplitude in female voices, particularly so-
Score (Ringing) pranos, than in male voices (8). From our current
study, SPR in soprano singers was significantly
lower than that in other voice type singers, although
o
6 o o
there was no significant difference in SPR between
ooo O o ° / male and female singers. Hollien also reported that
the power of the singer's formant appears to be
5
4
~ ~ooOO° o
closely related to variations in fundamental fre-
quency. Our current study, however, showed that
SPR had no significant correlation to fundamental
0 00
o frequency. The reason may be that SPR is not di-
rectly influenced by vocal fold vibration but by the
3
O O
T A B L E 3. Perceptual scores o f ringing quality in Test 3
2
Power of SPP A B C D E
1 Original 7 7 7 7 6
-30 -20 -10 () 1() (dB) 6-dB reduction
12-dB reduction
6
6
6
5
5
3
5
4
6
4
18-dB reduction 4 3 2 3 3
Singing power ratio 24-dB reduction 4 I 2 2 2
FIG. 9. Relationship between singing power ratio (SPR) and per- SPP, singing power peak.
ceptual scores of ringing in the original sung samples in Test I. A, B, C, D, E: listeners.