Professional Documents
Culture Documents
the first group discriminated two glides if the F0 interval was -20000
between 1.5 and 2 st, the second group had a discrimination
threshold of at least 4 st, and the third group only responded to 5kHz
4kHz
the tonal end points of the glides. A detailed overview on pitch 3kHz
perception can be found in ’t Hart, Collier & Cohen 1990 [13]. 2kHz
Phonetic form and linguistic function 1kHz
Remijsen & van Heuven 1999 [10] did the same for Dutch.
Both observed falling pitch associated with statements and ris- 50
falls with larger F0 intervals rather than slightly falling con- Fig. 1: Signal “Ulf stickt” [ lf th kh th] (engl.“Ulf stitches”) with
tours. a final voicing duration of 70 ms and an F0 interval of +6.5 st.
95 95 95 95 95 95
−2.5st −2.5st −2.5st −2.5st −2.5st −2.5st
85 85 85 85 85 85
80 80 80 80 80 80
0 10 20 0 10 20 30 0 10 20 30 40 0 10 20 30 40 50 0 10 20 30 40 50 60 0 10 20 30 40 50 60 70
time [ms] time [ms] time [ms] time [ms] time [ms] time [ms]
Fig. 2: All F0 contours for the final voiced stretch of the utterance in discrete steps (dark) and interpolated (gray). F0 interval values
are between -4.0 and +6.5 semitones (st). Glottal excitation impulses are located at 0 ms, each visible step, and at the end of a contour.
66
100
20 ms df F p Variance explained
30 ms
90
40 ms
Duration 2.304 47.943 0.00 17.8%
80 50 ms F0 interval 3.025 207.109 0.00 76.9%
60 ms
70 ms
Interaction 9.811 14.391 0.00 5.3%
70
judged as question [%]
100 100
20 ms 20 ms
90 30 ms 90 30 ms
40 ms 40 ms
80 50 ms 80 50 ms
60 ms 60 ms
70 ms 70 ms
70 70
judged as question [%]
60 60
50 50
40 40
30 30
20 20
10 10
0 0
−4 −2.5 −1 0.5 2 3.5 5 6.5 −4 −2.5 −1 0.5 2 3.5 5 6.5
F0 interval [semitones] F0 interval [semitones]
Fig. 4: Mean results of 7 subjects capable of differentiating be- Fig. 5: Mean results of 7 subjects not differentiating between
tween statements and questions for short voicing duration. statements and questions for short voicing duration.
67
900
20 ms
quency of occurence of statements in natural speech and/or the
850
30 ms concept of markedness of interrogatives.
40 ms
The experimental results can be related to categorical per-
mean reaction time [ms]
50 ms
800 60 ms ception effects of sentence modality (see e.g. Schneider & Lint-
70 ms
fert 2003 [11]) but this will be discussed when further experi-
750 ments are carried out.
700 4. Conclusions
650
Our findings permit the conclusion that a voicing duration 50
ms is long enough to carry communicatively relevant intonation
600 contours. An utterance-final rising pitch contour with a constant
−4 −2.5 −1 0.5 2 3.5 5 6.5
F0 interval [semitones]
F0 interval of 2 semitones or more and a voicing duration of at
least 50 ms leads to unanimously identified interrogative modal-
Fig. 6: Mean reaction times as a function of F0 interval (from ity. Even at durations of 20 ms and 30 ms a significant number
-4 to 6.5 semitones) and voicing duration (from 20 ms to 70 ms). of listeners (approx. 25% of all subjects) is able to consistently
identify sentence modality.
for the 20 ms stimuli with rising F0 contours the reaction time Whether declarative or interrogative modality of an utter-
values remain high indicating that subjects had severe problems ance with short voicing is perceived mainly depends on the F0
in judging these stimuli. interval between onset and offset of the last syllable. This inves-
tigation provides first indication that the slope of the F0 contour
3. Discussion (e.g. in st/s) is probably not the crucial acoustic property.
68