Professional Documents
Culture Documents
This paper has a tripartite focus: (1) to establish the best segmentation for two
American English texts according to inter-rater agreement measurements. By
doing this, we differentiate the behavior of experts and non-experts annotators.
The experts’ annotation constitute the basis for the analysis; (2) to capture, meas-
ure, and analyze the phonetic features that correlate with boundaries, as they
are marked by the expert annotators; (3) to informationally annotate prosodic
units according to the Language into Act Theory, and analyze their correspond-
ing information structure: in order to do this, we make and justify decisions in
marking the reference units and assigning informational value to prosodic units;
additionally we further discuss some cases of major disagreements.
1. Introduction
We assume, in line with many other scholars, that speech is prosodically segmented
into intonation units (IU). By IUs we mean prosodic organizations that encapsulate
a certain amount of segmental material. IUs are separated by boundaries that are
generally clearly perceivable. Their perception is cued by combinations of acoustic
parameters, but these combinations are yet to be determined (Barbosa & Raso,
2018), although some parameters are commonly considered in the literature as
playing a strong role in cueing boundary perception. A full list should include, at
least, pause, f0 parameters, duration parameters, intensity parameters, rhythmic
parameters, and voice quality change.
https://doi.org/10.1075/scl.94.10ras
© 2020 John Benjamins Publishing Company
310 Tommaso Raso et al.
Prosodic boundaries, however, are not always of the same type. We must dis-
tinguish at least between conclusive and continuative boundaries. Different kinds
of continuative boundaries can probably be found; for example, some presenting
a specific continuative tone, and some lacking such a clear tone, but nonetheless
perceived as non-conclusive (Teixeira, Barbosa, & Raso, 2018).1
Despite the rich literature on the prosodic segmentation of speech (see Barth-
Weingarten, 2016, for a survey), it is clear that we still need to improve our method-
ology in order to completely capture this phenomenon and its physical correlates. In
this paper, we try to give a contribution in a very interesting exercise together with
other scholars from different traditions and native languages, all of us concentrating
on the same excerpts of two English texts, one dialogic and one monologic (see
Introduction to Part II). We will proceed as follows:
Our attempt was primarily to segment the texts without any theoretical influence
(as far as this is possible), and only then to analyze the functional effects of the seg-
mentation. This means that we take IUs as having a mainly functional motivation
in the organization of speech.
To decide how to segment the excerpts of the texts Hearts and Navy extracted from
the Santa Barbara Corpus (Du Bois, Chafe, Meyer, & Thompson, 2000–2005), we
asked 16 annotators to perform the task. Among them, there were six expert an-
notators with good or very good knowledge of English; the remaining ones were
either less or not skilled in speech segmentation, or they had less competence in
1. The possibility that we should distinguish among different types of non-conclusive bound-
aries is supported by the first results of a project in which two authors of this paper participate.
The project aims at creating an automatic tool for spontaneous speech segmentation trained
through human segmentation (see Mittmann & Barbosa, 2016; Teixeira, 2018; Teixeira et al.,
2018). The partial results presently point that it is much easier to produce one unique model for
detecting conclusive boundaries (so far, the model reaches an agreement higher than 80% with
human annotators using less than 10 measurements) than for continuative ones (for which the
agreement is much lower – around 45% – if we use just one model, but reaches almost 100% with
the use of three different models, with 8 to 10 measurements).
The general Kappa for the monologic text was 0.71 for all the annotators and 0.76
for the six experts; in both cases the results were very good. The agreement for
terminal boundaries was 0.72 for all the annotators and 0.91 for the six experts; the
agreement for non-terminal boundaries was 0.59 for all the annotators and 0.71
for the six experts. Also, for this text we did a test challenging only the difference
between presence versus absence of boundary, and we reached the same result, that
is, 0.76, for both groups. Again, we can conclude that the decision regarding the
presence versus absence of a boundary does not seem to require a special expertise,
whereas expertise does seem to be important to decide the nature of a boundary.
3. Phonetic measurements
Our final segmentation was done according to those of the six expert annotators
(Appendices A and B). We will make some comments in case of strong disagree-
ment across annotations. To better understand the possible acoustic cues that
guides an annotator’s perception, we took the following measurements, reported
in Appendices A and B:
Concerning the difference in articulation rate (8), for the units with more than
four syllables, we also calculated the difference in articulation rate considering
just the last four syllables at the left and/or the first four syllables at the right of the
boundary. In no case did these differences turn out statistically significant, but this
may be due to insufficient number of measurements; in fact, according to Quené
(2007) and other studies, the JND for articulation rate is 5%. As Appendices A and
B show, all the boundaries in the two texts feature differences higher than 5% in
articulation rate between the unit at the left and the unit at the right (or between the
last four syllables at the left and the first four at the right). This allows for us to say
that articulation rate can often be considered an important feature in determining
perception of boundaries.
In addition, stress groups were automatically located using the SG_Detector
script (Barbosa, 2013), which provides the normalization (smoothed z-scores) of the
durations of VV units. The reference durations (means and standard deviations) for
the English segments, based on British English, were taken from Campbell (1992).
Figure 1 shows the relation among intonation unit boundaries (breaks), du-
ration peaks (stress group boundaries), and pauses for the text Navy. Intonation
unit boundaries are signaled in red; the highest peaks indicate terminal bounda-
ries, while the intermediate and lowest ones indicate non-terminal boundaries and
boundaries produced by fragmentation phenomena respectively. The normalized
duration curve is indicated in yellow; the different heights of the peaks reflects
the normalized durations of the VV units. When the boundaries include a pause,
a blue vertical line is added; the size of the blue bar is proportional to the pause
duration. The numbering on the x-axis refers to the VV units into which the text
was segmented to calculate stress groups and normalized duration. Figure 2 shows
the same measurements for the text Hearts.
Pause
smoothed z
Break
1 5 9 13 17 21 25 29 33 37 41 45 49 53 57 61 65 69 73 77 81 85 89
Pause
smoothed z
Break
Turn
1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39
For the functional analysis, we follow the principles of the Language into Act Theory
(L-AcT; Cresti, 2000; Moneglia, 2005; Moneglia & Raso, 2014; see also Bossaglia,
Mello, & Raso, this volume; Cresti, this volume; Cresti & Moneglia, this volume).
We consider as the communicative reference unit for speech what we can call
terminated sequence (TS), defined as the smallest stretch of speech that is both
pragmatically and prosodically interpretable in isolation. The pragmatic interpret-
ability is conveyed by the presence of at least one illocutionary unit; the prosodic
interpretability is conveyed by a boundary that yields the perception of conclusion.
The Kappa scores for the segmentations showed that the very strong agreement
concerning the presence versus absence of a boundary becomes less evident when
the annotators must decide about the nature of the boundary. This could be inter-
preted as a weakening of the concept of reference unit based on the perception of
terminality. We will come back to this point.
TSs can coincide with one IU or with a sequence of more than one IU. They
coincide with one IU when the IU carries the illocutionary force (and therefore the
pragmatic interpretability) and its boundary conveys conclusion. We can call these
kind of TS “simple utterances”. We found five such cases in the dialogic text, Hearts:
1. Wait //; 4. <you’ve never> played hearts //; 7. oh //; 8. okay //; 9. I’ll teach you //.
All the TSs of the monological text, Navy, and the other TS of Hearts should
be considered complex sequences, since they are formed by more than one IU, ac-
cording to most of our annotators. We can distinguish between two different kinds
of complex TS: complex utterances and stanzas. Complex utterances show only one
pattern, which can vary in complexity. In this case, we find one illocution (or two or
more patterned illocutions, as we will show) with other non-illocutionary units that
form a pattern around the illocution. Stanzas are formed by more than one pattern:
Each pattern is made up of one illocution (mandatory) and non-illocutionary op-
tional units that prosodically depend on a specific illocution; the different patterns
are juxtaposed and separated by a continuation prosodic tone at the boundary (or by
a prosodic break that does not convey the perception of terminality) (Cresti, 2009).
Most of the disagreement on the nature of the boundary (conclusive/terminal
vs. continuative/non-terminal) coincide with prosodic boundaries that do not fea-
ture a clear continuation tone, which is conveyed by at least a rising movement and
a final lengthening.2 These boundaries are positioned after the accomplishment of
an illocutionary unit, but they exhibit prosodic cues that, although lacking a clear
2. See Silber-Varod (2011) for an interesting description of five different continuative boundaries.
continuation tone, nevertheless do not convey the perception that the unit is fully
concluded. Once the illocution is accomplished, this kind of boundary can easily
lead to disagreement in the assessment of the boundary. In fact, the annotator,
especially a non-trained one, can either consider the sequence as a concluded one,
since no continuation tone is perceived, or place a non-terminal boundary, since,
despite the lack of a continuation tone, conclusion is not fully perceived. These cases
are those with less agreement among the annotators. Therefore, we can say that,
to perceive a terminated sequence, it is mandatory to have the accomplishment
of the illocution; but when we have neither a clear continuation tone nor a strong
reason to perceive terminality (end of a turn or a clear final profile), the perception
of complete conclusion can easily be subjective. In our excerpts, a good example of
this kind of potentially ambiguous break is the boundary after unit 7 in Navy (naval
headquarters). Here, no explicit continuation tone is present, but at the same time,
the falling profile ends higher than the lowest f0 level of the unit. Moreover, the
prominence on the first syllable of the last word (HEADquarters) can interfere with
the perception of the boundary. It is not surprising that this is the break that exhib-
its the highest disagreement (either terminal or non-terminal). No other position
presents such a strong uncertainty between terminal and non-terminal boundary.
Interestingly, the disagreement is not strong among the six experts (5 non-terminal
vs. 1 terminal), but is high among the other 10 (6 vs. 4), with a preference for a
solution different from that chosen by the experts. A different, though interesting,
case is unit 9 (this is terribly awkward). Here, f0 ends at a very low level, but no
syllabic lengthening is featured; additionally, this may lead to uncertainty with
respect to the perception of the terminality or non-terminality of the boundary.
Nevertheless, in this case, only three annotators did not chose to mark terminality.
These observations are intended to explain disagreements related to the nature of
specific boundaries, and can suggest that the distinction between terminal and
non-terminal boundaries be insufficient. We probably need to distinguish at least
among different kinds of non-terminal boundaries (for a discussion, see Barbosa
& Raso, 2018; for a first attempt to model terminal and non-terminal boundaries,
see Teixeira & Mittmann, 2018, and Teixeira et al., 2018).
We find the following complex patterns in Hearts: 2–3. play novice / I’ve never
played hearts before <in my life> //; 5–6. No / I don’t know how to play it //; 10–11.
passing disabled / <that’s you> //. As we can observe, their complexity is limited to
just two IUs. None of them comprises a stanza.
The situation in the monological text Navy is much different. Not only do we
have mostly complex sequences, but, what is more, these sequences are usually
more complex than those in Hearts. We can divide them into complex utterances
and stanzas (we will come back to the difference later). The only complex utterance
seems to be the last one: 25–26. you / are in the ues Navy now //. All the other se-
quences are stanzas: 1–9. so I went to this / &m / what I thought was my friend /
&he / &he / this navy captain down at the / naval headquarters / I said / this is terribly
awkward //; 10–17. I’ve just been promoted from / third mate / to second mate / and /
and / could we / possibly postpone these orders for a little bit //; 18–24. my friend /
stood up / behind his desk / in his full &f / four stripes / and said / Lieutenant //
These differences between the two texts reflect well the common structural
distinction between interactive dialogic texts and monologues.
cognitive domain for the interpretation of the illocutionary force. When no Topic is
present, the illocution is “unloaded” onto the context. We have found three different
forms of Topic; all of them are found in Italian, Brazilian and European Portuguese,
and in English (Cavalcante, 2016, 2018; Raso et al., 2017). In this text, there are two
Topics with two different forms, the other one being my friend (18).
We have come now to the analysis of the three stanzas: (1–9), (10–17), and
(18–24) respectively.
The first one is made up of seven intonation units (1–9) (plus two very small
retracted units with one incomplete word), one of which is due to a fragmentation
phenomenon (the): so I went to this / &m [/1] what I thought was my friend / &he /
&he[/1]this navy captain down at the / naval headquarters / I said / this is terribly
awkward //. This stanza features two patterns, the first encompassing units 1–7
and the second encompassing just the last two units. The illocutions of the two
patterns are respectively so I went to this / … this navy captain down at the / naval
headquarters and this is terribly awkward, in both cases the functional focus being
on the last word. Notice that the first illocution is made up of three intonation units;
in fact, after the first one, the illocution is interrupted by a parenthetic, and then a
fragmentation phenomenon yields the perception of a boundary in the second part
of the illocution. When an information unit is formed by two or more units, we call
the units before the last one scanning (SCA) units, thus making it clear that they are
part of a larger information unit. The parenthetic, a well-known unit in the litera-
ture (Schneider, 2007; Tucci, 2004), can be identified by its prosodic and functional
features: different f0 profile (usually lower) from what precedes and what follows it,
lower intensity and different articulation rate; functionally, the parenthetic provides
comments about the locutive content of the pattern or part of it; distributionally, it
can occupy almost any position, even inside another information unit, like in this
case; the only position it cannot occupy is the absolute initial position of a TS. The
second pattern, besides the illocution, features a Locutive Introducer (INT) (Giani,
2004; Maia Rocha & Raso, 2011). This unit basically has the function of introduc-
ing a meta-illocution, for example, as in this case, a reported speech. Its prosodic
features are falling profile and higher articulation rate, as well as clear contrasts in
f0, intensity and range with the units in reported speech.
The second pattern is made up of units 10–17, according to our segmentation:
I’ve just been promoted from / third mate / to second mate / and [/1] and / could we /
possibly postpone these orders for a little bit //. We can analyze this segmentation as the
two following functional patterns: the first would be organized around two patterned
illocutions (the second and the third units) preceded by a long INT. The second one
would be realized by units 14–17. Here a fragmentation phenomenon occurs (and
[/1] and /). The second and can be analyzed in two ways: either as a scanning unit
or a unit called Discourse Connector (DCT). This unit has been described as a unit
that connects different sub patterns in stanza or that begins utterances by marking
continuity with the previous one (Frosali, 2008; Raso, 2014); nevertheless, no specific
prosodic features have been found so far for DCTs, apart from their tendency to
exhibit long duration (Raso & Ferrari, forthcoming). Therefore, we still need more
research to understand whether DCT really is a specific IU or should be treated as a
SCA. But this is a minor issue for our purpose. If we do not distinguish between SCA
and DCT, the whole second pattern would be made up of three SCA IUs composing
one single illocutionary unit.
There are different possible segmentations that deserve to be mentioned. The
first pattern could be I’ve just been promoted from third mate / … This alternative
was chosen by three of the six expert annotators; as for the other three experts, two
put a boundary after from and the other put a boundary between promoted and from
(showing that he perceives a boundary approximately in the same position). With
this latter segmentation, we could interpret the first unit (I’ve just been promoted
from) as a Locutive Introducer that announces an illocutionary pattern (CMM),
which is sort of isolated so as to be emphasized. Our measurements support the
choice for this second alternative.
The second pattern could be could we / possibly postpone these orders / for a little
bit //. This was the preferred solution among the 16 annotators (11 vs. 5), but not
among the six experts (2 vs. 4). Segmenting as the majority did, we would have a
unit of Appendix of Comment (for a little bit), that is, a unit that integrates the text
of the illocution. The prosodic parameters of this unit are basically a falling profile
and low intensity. Our measurements do not actually support the placement of a
boundary after orders; moreover, for a little bit can easily be interpreted as a coda
(i.e., a post-nuclear part) of the illocution. From a functional point of view, the two
interpretations are not significantly different.
The third stanza (18–24) features a Topic in the first unit (my friend), three
illocutionary juxtaposed units (stood up / behind his desk and in his full &f[/1] four
stripes), the third one being made up of two units (there is a SCA boundary proba-
bly due to a fragmentation phenomenon), and a last pattern formed by a Locutive
Introducer (and said) and a strong illocution of recall (Lieutenant) in reported
speech (which continues in the last utterance).
5. Final remarks
We have conducted three different tasks on the two English texts. (1) we asked 16
annotators to segment the texts and observed the inter-rater agreement. Doing so,
we differentiated within the 16 annotators and additionally considered the agree-
ment among the six higher trained subjects. This allowed us to observe that, while
the perception of boundaries is a very natural task, the distinction of their nature,
whether terminal or non-terminal, is significantly improved by training; (2) we
phonetically analyzed the boundary region, following what the literature considers
as the major features responsible for conveying boundary perception. This allowed
us to confirm the presence of the phonetic features mentioned in different phonetic
studies where the annotators marked the boundaries, and at the same time to make
a decision in a few cases of clear disagreement. This decision was made following
phonetic findings, trying therefore to avoid theoretical motivations; (3) a func-
tional analysis of the intonation units was implemented only after the segmentation,
which was based on (1) and (2). Therefore, we tried to maintain the segmentation
task independent from the task represented by the assignment of a functional value
to the units that resulted from the segmentation process.
Acknowledgements
The authors would like to acknowledge CAPES, CNPq and FAPEMIG for the research grants
that have made writing this chapter possible.
References
Barbosa, P. A. (2013). Semi-automatic and automatic tools for generating prosodic descriptors
for prosody research. In B. Bigi & D. Hirst (Eds.), TRASP 2013 Proceedings (Vol. 13, pp.
86–89). Aix-en-Provence: Laboratoire Parole et Langage.
Barbosa, P. A., & Raso, T. (2018). Spontaneous speech segmentation: Functional and prosodic
aspects with applications for automatic segmentation. Revista de Estudos da Linguagem,
26(4), 1361–1396.
Barth-Weingarten, D. (2016). Intonation units revisited. Cesura in talk-in-interaction. Amsterdam:
John Benjamins. https://doi.org/10.1075/slsi.29
Bossaglia, G., Mello, H., & Raso, T. (this volume). Illocution as a unit of reference for spontane-
ous speech: An account for insubordinated adverbial clauses in Brazilian Portuguese. In S.
Izre’el, H. Mello, A. Panunzi, & T. Raso (Eds.), In search of basic units of spoken language: A
corpus-driven approach. Amsterdam: John Benjamins.
Campbell, W. N. (1992). Syllable-based segmental duration. In G. Bailly, C. Benoıt, & T. R.
Sawallis (Eds.), Talking machines: Theories, models, and designs (pp. 221–224). Amsterdam:
North-Holland.
Cavalcante, F. (2016). The topic unit in spontaneous American English (Unpublished Master’s
thesis). Universidade Federal de Minas Gerais, Brazil.
Cavalcante, F. (2018). The Information Unit of Topic: A Crosslinguistic, Statistical Study Based
on Spontaneous Speech Corpora (Unpublished PhD Dissertation). Universidade Federal
de Minas Gerais, Brazil.
Cresti, E. (2000). Corpus di italiano parlato. Firenze: Accademia della Crusca.
Cresti, E. (2011). The definition of focus in Language into Act Theory (L-AcT). In H. Mello, A.
Panunzi, & T. Raso, Pragmatics and prosody, illocution, modality, attitude, information pat-
terning and speech annotation (pp. 39–82). Firenze: Firenze University Press.
Cresti, E. (2018). The illocution-prosody relationship and the information pattern in spontane-
ous speech according to the Language into Act Theory. In M. Heinz & M. C. Moroni (Eds.),
Prosody: Grammar, information structure, interaction. Linguistik Online, 88(1), 33–62.
https://doi.org/10.13092/lo.88.4187
Cresti, E. (this volume). The pragmatic analysis of speech and its illocutionary classification
according to Language into Act Theory. In S. Izre’el, H. Mello, A. Panunzi, & T. Raso (Eds.),
In search of basic units of spoken language: A corpus-driven approach. Amsterdam: John
Benjamins.
Cresti, E., & Moneglia, M. (this volume). Some notes on the excerpts according to L-AcT. In S.
Izre’el, H. Mello, A. Panunzi, & T. Raso (Eds.), In search of basic units of spoken language: A
corpus-driven approach. Amsterdam: John Benjamins.
Du Bois, J. W., Chafe, W. L., Meyer, C., & Thompson, S. A. (2000–2005). Santa Barbara corpus of
spoken American English, Part 1–4. Philadelphia, PA: Linguistic Data Consortium.
Fleiss, J. L. (1971). Measuring nominal scale agreement among many raters. Psychological Bul
letin, 76, 378–382. https://doi.org/10.1037/h0031619
Frosali, F. F. (2008). L’unità di informazione di ausilio dialogico: Valori percentuali, caratteri
intonativi, lessicali e morfo-sintattici in un corpus di italiano parlato (C-ORAL-ROM). In
E. Cresti (Ed.), Prospettive nello studio del lessico italiano (pp. 417–424). Firenze: Firenze
University Press.
Giani, D. (2004). Una strategia di costruzione del testo parlato: L’introduttore locutivo. In F. A.
Leoni, F. Cutugno, M. Pettorino, & R. Savy (Eds.), Atti del convegno “Il parlato italiano”
Napoli, 13–15.02 2003 (pp. 84–97). Napoli: M. D‟Auria.
t’Hart, J. (1981). Differential sensitivity to pitch distance, particularly in speech. Journal of the
Acoustical Society of America, 69(3), 811–821. https://doi.org/10.1121/1.385592
Maia Rocha, B., & Raso, T. (2011). A unidade informacional de introdutor locutivo no portu-
guês do Brasil: Uma primeira descrição baseada em corpus. Domínios de Linguagem, 5(1),
327–343.
Mittmann, M., & Barbosa, P. A. (2016). An automatic speech segmentation tool based on multiple
acoustic parameters. CHIMERA: Romance Corpora and Linguistic Studies, 3(2), 133–147.
Moneglia, M. (2005). The C-ORAL-ROM resource. In E. Cresti & M. Moneglia (Eds.), C-ORAL-
ROM: Integrated reference corpora for spoken Romance languages (pp. 1–70). Amsterdam:
John Benjamins. https://doi.org/10.1075/scl.15.03mon
Moneglia, M., & Raso, T. (2014). Notes on Language into act theory. In T. Raso & H. Mello (Eds.),
Spoken corpora and linguistic studies (pp. 468–495). Amsterdam: John Benjamins.
https://doi.org/10.1075/scl.61.15mon
Quené, H. (2007). On the just noticeable difference for tempo in speech. Journal of Phonetics,
35(3), 353–362. https://doi.org/10.1016/j.wocn.2006.09.001
Raso, T. (2014). Prosodic constraints for discourse markers. In T. Raso & H. Mello (Eds.), Spoken
corpora and linguistic studies (pp. 412–467). Amsterdam: John Benjamins.
https://doi.org/10.1075/scl.61.14ras
Raso, T., Cavalcante, F., & Mittmann, M. (2017). Prosodic forms of the topic information unit in
a cross-linguistic perspective: A first survey. In A. De Meo & F. M. Dovetto, La comunicazi-
one parlata/Spoken communication (pp. 473–498). Roma: Aracne.
Raso, T., Ferrari, L. Uso dei Segnali Discorsivi in corpora di parlato spontaneo italiano e brasi-
liano. In: Ferroni, R., Birello, M. (forthcoming). La competenza discorsiva a lezione di lingua
straniera. Roma: Aracne.
Raso, T., Mittmann, M., & Oliveira Mendes, A. C. (2015). O papel da pausa na segmentação
prosódica de corpora de fala. Revista de Estudos da Linguagem, 23(3), 883–922.
https://doi.org/10.17851/2237-2083.23.3.883-922
Raso, T., & Rocha, B. (2017). Illocution and attitude: On the complex interaction between pros-
ody and pragmatic parameters. Journal of Speech Science, 5, 5–27.
Rocha, B., & Raso, T. (2016). The interaction between illocution and attitude, and its conse-
quences for the empirical study of illocutions. In C. Bardel & A. De Meo (Eds.), Parler les
langues romanes: Proceedings of the international GSCP conference (Stockholm, 2014) (pp.
69–88). Napoli: Università degli Studi L’Orientale.
Schneider, S. (2007). Reduced parenthetical clauses as mitigators: A corpus study of spoken French,
Italian and Spanish. Amsterdam: John Benjamins. https://doi.org/10.1075/scl.27
Senn, P., Kompis, M., Vischer, M., & Haeusler, R. (2005). Minimum audible angle, just notice-
able interaural differences and speech intelligibility with bilateral cochlear implants using
clinical speech processors. Audiology and Neurotology, 10, 342–352.
https://doi.org/10.1159/000087351
Silber-Varod, V. (2011). The SpeeCHain perspective: Prosody-syntax interface in spontaneous
spoken Hebrew (Unpublished doctoral dissertation). Tel- Aviv University, Israel.
Teixeira, B. H. (2018). Correlatos fonético-acústicos de fronteiras prosódicas na fala espontânea
(Unpublished Master’s thesis). Universidade Federal de Minas Gerais, Belo Horizonte,
Brazil.
Teixeira, B. H., Barbosa, P., & Raso, T. (2018). Automatic detection of prosodic boundaries in Bra
zilian Portuguese spontaneous speech. In A. Villavicencio, V. Moreira, A. Abad, H. Caseli, P.
Gamallo, C. Ramisch, H. G. Oliveira, & G. H. Paetzold (Eds.), Computational processing of
the Portuguese language (pp. 429–437). New York, NY: Springer.
Teixeira, B. H., & Mittmann, M. M. (2018). Acoustic models for the automatic identification
of prosodic boundaries in spontaneous speech. Revista de Estudos da Linguagem, 26(4),
1455–1488.
Tucci, I. (2004). L’inciso: Caratteristiche morfosintattiche e intonative in un corpus di riferi
mento. In F. A. Leoni, F. Cutugno, M. Pettorino, & R. Savy (Eds.), Atti del convegno “Il par
lato italiano” (CD-ROM -14 p.). Napoli: M. D’Auria.
Rank Speaker IU and boundary Function Dur Art. rate Pause F0 Movement Intensity
Rank IU and boundary Function Dur Art. rate Pause F0 Mov. Intensity
type
(smoothed z) (syll./s) (s) (st) (dB)
Rank IU and boundary Function Dur Art. rate Pause F0 Mov. Intensity