Professional Documents
Culture Documents
To cite this article: Hailey C. Kopera & Maria I. Grigos (2019): Lexical stress in childhood apraxia
of speech: acoustic and kinematic findings, International Journal of Speech-Language Pathology,
DOI: 10.1080/17549507.2019.1568571
Article views: 19
Department of Communicative Sciences and Disorders, New York University, New York, NY, USA
Abstract
Purpose: This study investigated the acoustic and articulatory movement parameters underlying lexical stress production
in children with apraxia of speech (CAS), children with articulation/phonological delay (i.e. speech delay, SD), and chil-
dren with typical speech-language development (TD). We examined whether there were group differences in these instru-
mental measures of stress production.
Method: Participants were 24 children (seven CAS, eight SD, nine TD) between three and seven years of age. Acoustic
and kinematic measures, including acoustic duration, peak and average fundamental frequency, and jaw movement dur-
ation and displacement, were taken from perceptually accurate productions of a strong-weak form. Relative stress analyses
were conducted using the Pairwise Variability Index (PVI).
Result: There was a significant difference between the CAS and TD groups in the PVI for movement duration, with the
CAS group showing a smaller movement duration contrast between stressed and unstressed syllables. There were no sig-
nificant group differences for displacement or any of the acoustic variables.
Conclusion: The kinematic findings suggest reduced temporal control for lexical stress production in children with CAS.
This finding surfaced during analyses of perceptually accurate productions but suggests a possible basis for lexical stress
errors in CAS that could be explored in future studies.
Keywords: Childhood apraxia of speech (CAS); lexical stress; acoustic measures; kinematic measures
Correspondence: Hailey C. Kopera, Department of Communicative Sciences and Disorders, New York University, 665 Broadway, 9th Floor, New York,
NY 10012, USA. E-mail: hcs280@nyu.edu
ISSN 1754-9507 print/ISSN 1754-9515 online ß 2019 The Speech Pathology Association of Australia Limited
Published by Taylor & Francis
DOI: 10.1080/17549507.2019.1568571
2 H. C. Kopera and M. I. Grigos
used in the present study (e.g. Grigos, Moss, & Lu, forms. Interestingly, movements for WS forms were
2015; Smith, Sadagopan, Walsh, & Weber- less variable than for SW forms in both groups, even
Fox, 2010). though the former is often considered to be more
challenging. The authors suggested that children may
Correlates of stress in typical speakers be able to use established rhythmic behaviours, such
as those used during babbling, to produce SW forms
Acoustic correlates but need to implement more specified (i.e. less vari-
Stressed and unstressed syllables are associated with able) movement sequences to produce highly-modu-
differences in duration, fundamental frequency (F0), lated WS forms.
and intensity (Fry, 1958; Liberman, 1960), as well as Taken together, previous kinematic findings sug-
vowel formant structure (Fry, 1965; Howell, 1993). gest that lexical stress production is associated with
While attempts have been made to rank these acous- various changes in articulator movement. In an
tic cues according to their importance in both stress articulatory model of stress production, de Jong
production (e.g. Liberman, 1960) and perception (1995) describes stress as a “local shift toward hyper-
(e.g. Fry, 1958), large individual differences in cue articulate speech” (p. 493), which encompasses a var-
use have been reported (Howell, 1993). Additionally, iety of articulatory changes that increase a syllable’s
studies have provided evidence for trading relation- distinctness. This includes actions such as increased
ships among acoustic cues (Howell, 1993; Kehoe, oral opening in stressed vowels, increased oral closure
Stoel-Gammon, & Buder, 1995; Liberman, 1960), in stressed consonants, and stress-related changes
where the absence of a given stress cue may be offset along a horizontal dimension, such as increased
by use of another. These findings support the notion retraction and protrusion (de Jong, 1995). An
that stress is not realised by a single, consistent acous- important component of the hyperarticulation model
tic parameter (e.g. Hayes, 1995) but rather a combin- is that stress is viewed in terms of “abstract articula-
ation of cues which may vary between and within tory goals” (p. 501) rather than being articulator-
individuals. Although children as young as 18 months specific. Thus, a variety of different articulators and
of age have been shown to use F0, intensity, and dur- strategies (i.e. movement amplitude, timing, etc.)
ation to mark stress (Kehoe et al., 1995), develop- may be used to achieve the speaking goal. The pre-
mental differences in lexical stress production have sent study examines the acoustic and articulatory cor-
been reported. For instance, Ballard, Djaja, Arciuli, relates of stress reviewed above in relation to CAS.
James, and van Doorn (2012) found that adults and
children as young as three did not differ in their con- Stress in CAS
trol of relative duration, intensity, or F0 for strong-
Prosodic deficits, including stress errors, have been
weak (SW; i.e. trochaic) forms, but differences were
noted as a characteristic of CAS since some of the
observed between adults and seven-year-old children
earliest descriptions of this disorder (e.g. Rosenbek &
for weak-strong (WS; i.e. iambic) forms.
Wertz, 1972; Yoss & Darley, 1974), and there has
been much interest in atypical prosody as a potential
Articulatory correlates
indicator of CAS. For instance, a series of studies by
Stress production is thought to involve the actions of Shriberg et al. (1997a, 1997b) compared the speech
multiple subsystems (pulmonary, laryngeal, articula- and prosody-voice profiles of three separate samples
tory), resulting in a range of physiological correlates of children with suspected CAS to children with
(Kent & Netsell, 1971). In terms of articulation, speech delay and found that only stress significantly
Kent and Netsell (1971) found that vowels in differentiated a portion of children in these groups.
stressed, compared to unstressed, syllables tended to The most common stress error pattern was described
be produced with lower jaw positions (i.e. greater as excessive/equal phrasal stress, but lexical stress
opening) and greater lingual displacement toward the errors were also noted. While the present study fur-
target. Similarly, Kelso, Vatikiotis-Bateson, Saltzman, ther investigates stress in CAS, we note that recent
and Kay (1985) demonstrated that articulatory work within the prosodic domain has also focussed
movements associated with stressed syllables were on phrasing errors (i.e. inappropriate pauses) to
generally larger in displacement, longer in duration, develop a Pause Marker for differentiating CAS from
and higher in velocity compared to unstressed sylla- speech delay (e.g. Shriberg et al., 2017).
bles. In a study with developmental implications, A number of studies have examined the acoustic
Goffman and Malin (1999) demonstrated that adults correlates of stress in children with CAS. A common
and children (three- to four-year-olds) produced dis- focus throughout this research has been on determin-
tinct lip movement patterns for SW and WS forms. ing whether acoustic measures of stress differentiate
For adults, both forms showed evidence of amplitude children with CAS from those without this disorder
modulation (i.e. larger movement displacement for (Munson et al., 2003; Shriberg et al., 2003; Skinder
stressed compared to unstressed syllables), but this et al., 1999), but there are numerous methodological
was greater for WS forms. Children also showed evi- differences that limit the ability to directly compare
dence of amplitude modulation but only in WS these works. These include differences in stimuli,
3
participant group studied in comparison to children children with CAS from other children clearly
with CAS, and type of lexical stress analysis (e.g. sep- remains unresolved.
arate analyses of each acoustic variable versus use of a
composite measure).
Purpose and research questions
In Skinder et al.’s (1999) study, children with CAS
and typically-developing peers imitated di- and multi- Previous research has provided perceptual as well
syllabic words and interrogative and emotive senten- as some acoustic support that stress production
ces, and listeners judged the stress accuracy of the differs in children with CAS compared to children
children’s productions. As a group, the children with without this disorder. The conflicting acoustic
CAS were judged to be less accurate than the typic- findings from past work and lack of articulatory/
ally-developing children for all types of stimuli, but kinematic studies of stress production in CAS
acoustic analysis (conducted for two of the sentences) motivate the need to develop a clearer under-
showed that children in both groups used F0, ampli- standing of the patterns of speech motor control
tude, and duration similarly to mark sentential stress. that underlie lexical stress production in children
Subsequently, Munson et al. (2003) investigated the with and without CAS. Therefore, the purpose of
acoustic correlates of stress in children with CAS in this study was to examine whether acoustic and/or
comparison to children with phonological disorder. kinematic parameters associated with accurate lex-
The participants in this study imitated SW and WS ical stress production differentiate children with
nonwords, and listeners judged whether the produc- CAS from children with SD and TD children.
tions had initial or final stress. Similar to Skinder Specifically, the following research questions were
et al.’s (1999) findings, the listeners judged the pro- addressed: (1) Do children with CAS, SD, and
ductions of the CAS group to match the target stress TD differ in their modification of acoustic cues to
pattern less often than those of the phonological dis- mark lexical stress? (2) Do children with CAS,
order group, but no significant differences were noted SD, and TD produce different oral articulator
in the groups’ use of acoustic cues (vowel duration, movement patterns to mark lexical stress? Based
F0 and amplitude at vowel midpoint, or timing of the on perceptual descriptions of equalised stress in
pitch peak in stressed syllables) to mark stress. Both CAS (e.g. Rosenbek & Wertz, 1972; Shriberg
Skinder et al. (1999) and Munson et al. (2003) pos- et al., 1997a; Yoss & Darley, 1974), we predicted
ited that reduced segmental accuracy of the children that children with CAS would demonstrate
with CAS compared to their comparison groups may reduced acoustic and kinematic contrasts between
have contributed to the lower stress accuracy ratings strong and weak syllables compared to children
for the CAS group, despite the lack of group differen- with SD and TD.
ces in the acoustic analyses. Munson et al. (2003)
also raised the possibility that other acoustic corre- Method
lates of stress which were not assessed in their study,
such as measures of vowel and consonant articula- Participants
tion, may have differed between the groups and con- Twenty-four children between the ages of three and
tributed to the perceptual findings. seven years participated in this study: seven with
The results of the Skinder et al. (1999) and CAS, eight with SD, and nine TD controls. These
Munson et al. (2003) studies differ from those of children were part of a larger study (n ¼ 33; 11 CAS,
Shriberg et al. (2003), which demonstrated differences 11 SD, 11 TD) examining articulatory control in
in acoustic measures of lexical stress between children childhood apraxia of speech (Grigos et al., 2015).
with CAS and those with speech delay. Shriberg et al. From this larger group of children, nine children
(2003) studied imitated productions of SW words in (four CAS, three SD, two TD) were excluded from
11 participants with CAS and 24 with speech delay. the present study because they did not produce a
Composite Lexical Stress Ratios were derived from required minimum of three tokens that met the ana-
participants’ average ratios of strong to weak syllable lysis criteria, described below. This minimum of three
measures of vowel duration, frequency area (frequency analysable productions was set to allow intra-partici-
average vowel duration), and amplitude area (ampli- pant patterns and/or variability to be observed. The
tude average vowel duration). Five of the six Lexical groups were similar in age (CAS: M ¼ 63.6 months,
Stress Ratios that fell at the upper and lower extremes St Dev ¼ 13.3; SD: M ¼ 65.0 months, St Dev ¼
of the distribution were obtained by children with 10.7; TD: M ¼ 66.6 months, St Dev ¼ 12.5). There
CAS, which was significantly more than expected by was one female participant in the SD group, one in
chance. The authors interpreted their findings as sup- the TD group, and the remaining participants were
port for the Lexical Stress Ratio as a possible diagnos- male. All participants were monolingual speakers of
tic marker for CAS. However, in light of conflicting American English.
acoustic findings from prior studies (Munson et al., All of the children in this study completed a stand-
2003; Skinder et al., 1999), the question of whether ardised speech, language, cognitive, and oral motor
acoustic measures of stress are able to differentiate testing protocol and a hearing screening as part of
4 H. C. Kopera and M. I. Grigos
additions, and distortions consistent with an articula- effects on the unstressed syllable (Ballard et al.,
tion impairment (e.g. interdental lisp, r-distortion) 2012). Thus, lexical stress analyses were conducted
and/or systematic use of phonological processes (e.g. on the SW form puppy within the puppypop
velar fronting, gliding) consistent with a phonological productions.
impairment. These error patterns had to be observed Participants were introduced to the character pup-
in more than one speaking context. Critically, chil- pypop through a prop and verbal models of the char-
dren in the SD group did not exhibit a combination of acter’s name. As the experimenter told a story, the
the three features highlighted in ASHA’s (2007) tech- prop was used to prompt the children to produce the
nical report or motor speech/sequencing deficits on target in response to a question or cloze sentence.
the VMPAC. There was not an expressive language Approximately 10–15 productions of puppypop were
requirement for children in the CAS and SD groups elicited from each participant in a randomised order
since speech production deficits may impact expres- with the other target words. Participants were period-
sive language skills. Children were assigned to the ically exposed to additional models of the target
TD group if they had negative histories of speech, throughout the story, but direct imitations were
language, hearing, or neurological problems and avoided whenever possible. This elicitation paradigm
demonstrated the following: normal structure and was selected over tasks such as repetition or picture
function of the oral mechanism based on the naming in order to encourage naturalistic produc-
VMPAC, age-appropriate expressive language skills tions of the targets.
based on the TELD-3 (not lower than one standard
deviation below the mean), and age-appropriate Data selection
articulation skills in single-word productions and con-
Participants’ productions of puppypop were narrowly
nected speech. The latter was determined using a
transcribed by the first author. Each participant’s first
GFTA-2 cut-off score of 85 and normative data pro-
five accurate and appropriate productions were selected
vided in Smit, Hand, Freilinger, Bernthal, and Bird
for acoustic and kinematic analysis, resulting in a
(1990) and Hodson and Paden (1981).
maximum data set of 35 productions for the CAS
group (five seven participants), 40 for the SD group
Data collection
(five eight participants), and 45 for the TD group
Data collection took place in a sound-attenuated (five nine participants), or 120 total. The target
booth in the Department of Communicative Sciences number was set at five per participant since this was
and Disorders at New York University. A motion cap- attainable for most of the children. However, six par-
ture system (Vicon 460; Vicon Motion Systems, ticipants did not reach five analysable productions,
2001) was used to track articulator movement in including three participants that had four (all CAS)
three dimensions during a word production task at a and three participants that had three (one CAS, one
sampling rate of 120 frames per second. Participants SD, one TD). Thus, the actual data set was reduced
were seated in a chair that faced three motion capture to 111 productions (30 CAS, 38 SD, 43 TD).
cameras. Seven 3-mm reflective markers were placed Accurate tokens were those that were judged to be per-
on the lips (midline of the vermilion border of the ceptually correct on both segmental and supraseg-
upper lip, midline of the vermilion border of the lower mental (i.e. lexical stress) levels. Analyses were
lip, right and left corners of the mouth) and jaw (mid- limited to correct productions for several reasons.
line, right and left sides) to track movement of these First, since the interest in this study was potential
articulators. In addition, five reference markers were group differences in acoustic and kinematic parame-
placed on the forehead (midline, right and left sides) ters of lexical stress, this decreased the likelihood that
and nose (nasion and tip). Data collection sessions any observed differences would be due to segmental
were recorded using a Sony digital video camera. A effects caused by articulation errors. Additionally,
series of Audio Video Interleave files were created underlying deficits in speech motor control have been
during each session, which were converted into observed in perceptually correct productions (e.g.
Waveform Audio (WAV) files with a 22,050 Hz sam- Smith et al., 2010), further justifying the analysis of
pling rate. accurate productions. Fully accurate productions of
Children participated in a task that elicited mul- puppypop were always selected first. However, given
tiple productions of single-word targets in the context that this multisyllabic token was challenging for some
of a story. Productions of several different target of the participants to produce, certain exceptions
words (pop, puppet, puppypop) were elicited from each were made if five fully accurate productions of puppy-
participant as part of the larger project, but only the pop were not available. The criteria for acceptable seg-
procedures and analyses from puppypop (/’pˆ.pi."pap/) mental errors were those that did not involve, or were
will be reported here. This “word” represented not directly adjacent to, the portion of the target to be
a character’s name in the story and was chosen analysed (puppy). Therefore, acceptable errors could
for the present study because its trisyllabic affect the vowel or final consonant in the third,
structure enabled the analysis of a SW stress pattern unanalysed, syllable of the original three-syllable tar-
(puppy, /’pˆ.pi/) while avoiding final lengthening get (puppypop), such as vowel substitution/distortion
6 H. C. Kopera and M. I. Grigos
and final consonant substitution/deletion. A total of change in its periodicity (e.g. elimination of period-
nine productions with acceptable segmental errors icity, change in complexity).
were analysed, including one from the CAS group,
three from the SD group, and five from the TD Acoustic duration
group. Appropriate tokens were those that were free of
Acoustic duration was measured from vowel onset
factors that could prohibit or compromise valid
(i.e. the point of consonant release) to vowel offset,
acoustic measures of lexical stress parameters, such as
which included voice onset time and any aspiration
background noise, rising intonation (as in questioning that occurred during that time. While the onset of
or guessing), and abnormal vocal characteristics that voicing is sometimes used as the starting point for
are considered unrelated to stress (Kehoe acoustic measures of vowel duration (e.g. Munson
et al., 1995). et al., 2003, Peterson & Lehiste, 1960), we selected
the present approach because it is more similar to the
Reliability movement duration measure used in this study. In
Two additional transcribers experienced in narrow support of this decision, we note that previous studies
transcription independently listened to and tran- that have taken both acoustic and kinematic measures
scribed a subset of the puppypop productions to of vowel duration have measured acoustic duration
ensure that those selected for acoustic and kinematic from the release of the preceding consonant (e.g.
analysis were perceived as accurate by more than one Edwards, Beckman, & Fletcher, 1991).
listener. One participant from each group was ran-
domly selected for this process, and transcribers were Fundamental frequency
blind to group membership. Each transcriber listened For F0 measures, the pitch range was set from 100 to
to the participant’s full inventory of puppypop produc- 550 Hz based on the range typically used by the par-
tions (i.e. the five productions selected for analysis ticipants. In order to decrease consonantal effects on
and the remainder of their puppypop productions, the F0 measures, the initial and final 15 ms of the
including any with segmental and/or suprasegmental voiced region of each vowel was eliminated; if the
errors) so that the transcribers were not only exposed voiced region was less than 40 ms long, then one
to correct productions. The transcribers were quarter of its duration was eliminated (Levi, 2005).
instructed to judge if each production was accurate or On several occasions, tokens selected for analysis
inaccurate, with accurate productions being those showed obvious pitch track errors that could not be
perceived as correct on both segmental and supraseg- resolved by adjusting pitch settings. These were
mental levels. Then, judgments on the productions replaced with the next accurate/appropriate token
selected for analysis were compared and percentage from the inventory, if available. On the four occasions
of agreement was calculated. The average agreement where replacement tokens were not available, the
with the other transcribers was 90% for puppypop vowel was cropped using the procedure described
judgments, rising to 97% when limiting judgments to above, followed by additional manual cropping until
the analysed portion of the word (puppy). the analysis region only contained a normal pitch
line. Maximum (“peak”) and average F0 were meas-
Acoustic measures ured for each cropped vowel using Praat’s automatic,
Acoustic analyses were carried out using Praat autocorrelation method.
(Boersma & Weenink, 2011). Acoustic duration and
F0 measures were taken from the first and second Kinematic measures
vowels in each disyllabic form puppy (V1:/ˆ/, V2:/i/). Jaw movement was tracked from the right jaw marker
While intensity was also an acoustic parameter of to reduce error associated with chin surface tracking
interest, the absence of a head-mounted microphone (Green, Wilson, Wang, & Moore, 2007). Reference
during data collection limited the validity of our markers on the forehead and nose determined orien-
intensity measures, precluding their use. Vowel onsets tation and accounted for head movement/rotation.
and offsets were identified using a broadband spectro- The jaw signal was derived by subtracting its y coord-
gram with a window length of 0.005 seconds to deter- inate from the stationary forehead markers (jaw -
mine general boundary regions and the acoustic forehead). The kinematic traces were analysed using
waveform for final decisions (Turk, Nakai, & MATLAB, version 7.5 (MathWorks, 2007). As
Sugahara, 2006). Vowel onset was marked at the described in Grigos et al. (2015), the onset and offset
release of the preceding consonant (Peterson & of the movement sequence for each puppypop produc-
Lehiste, 1960), as indicated by the release burst on tion was determined using the jaw displacement tra-
the spectrogram and waveform. Vowel offset was jectory. Movement onset was marked as 10 frames
determined by a combination of cues (e.g. Jacewicz, (0.083 seconds) prior to the maximum (“peak”) jaw
Fox, & Salmons, 2011; Turk et al., 2006), including closing displacement for the initial consonant (puppy-
the offset of second formant energy in the spectro- pop), and movement offset was marked as 10 frames
gram and a decrease in waveform amplitude and/or a (0.083 seconds) after the peak jaw opening
7
Limitations and future directions release. Our results suggest difficulty controlling
temporal aspects of speech movements as a possible
This study was retrospective in nature since we con-
basis for lexical stress errors which are prevalent in
ducted lexical stress analyses on data that had been
previously collected as part of a larger study (Grigos CAS and should continue to be explored in
et al., 2015). A future, prospective study would allow future studies.
greater control over the design of the stimuli and the
experimental conditions. Across groups and depend- Acknowledgements
ent measures, negative PVIs were occasionally
observed for individual, trial-level responses. Recall The authors acknowledge Julie Case, Panagiota
that negative PVIs indicate larger values of the meas- Keough, Penelope Elias, and Jessica Storer for assist-
ured parameter (e.g. duration, F0) for the second syl- ance with data collection and processing. Portions of
lable compared to the first, which was not the this work were presented at the 2013 American
expected pattern for the SW form analysed in this Speech-Language-Hearing Association Convention in
study, puppy. Since productions were elicited using a Chicago, Illinois and the 2014 Conference on Motor
multisyllabic stimulus (as in Ballard et al., 2012) and Speech in Sarasota, Florida.
analyses were limited to productions that were per-
ceived to have correct stress, the occurrence of nega-
tive PVIs may relate more to the intrinsic
characteristics of the vowels in puppy rather than Disclosure statement
suprasegmental effects such as final lengthening or
stress. For instance, V1 (/ˆ/) and V2 (/i/) differ along No potential conflict of interest was reported by
at least two dimensions, vowel tension and height, the authors.
that have been associated with duration and F0,
respectively (Small, 1999; Whalen & Levitt, 1995). Funding
The vowel /ˆ/ is lax and low-mid in height, whereas /i/
This research was supported by National Institute on
is typically classified as tense and high, which may
Deafness and Other Communicative Disorders
explain why children sometimes produced an unex-
Grant [R03DC009079] awarded to Maria Grigos.
pected duration pattern (i.e. longer V2 than V1) and/
or F0 pattern (i.e. higher F0 for V2 compared to V1)
in perceptually accurate productions of the SW tar- References
get. In light of this stimulus consideration, along with American Speech, Language, Hearing Association. (2007).
factors such as small sample size and a limited num- Childhood Apraxia of Speech [Technical Report]. Retrieved from
ber of accurate/appropriate productions available for http://www.ASHA.org/policy
analysis, the results of this study should be confirmed Bahr, R.H. (2005). Differential diagnosis of severe speech disor-
ders using speech gestures. Topics in Language Disorders, 25,
in a prospective research study with (a) larger groups
254–265. doi:10.1097/00011363-200507000-00008
of participants, and (b) a wider variety of stimulus Ballard, K.J., Djaja, D., Arciuli, J., James, D.G.H., & van
items. If future work continues to find differences in Doorn, J. (2012). Developmental trajectory for production of
the degree of lexical stress contrasts between children prosody: Lexical stress contrastivity in children ages 3 to 7
with and without CAS, it will be important to explore years and in adults. Journal of Speech, Language, and Hearing
the specific factors that may contribute to this (i.e. is Research, 55, 1822–1835. doi:10.1044/1092-4388(2012/11-
a smaller degree of contrast due to reduced stress on 0257)
the stressed syllable or to excess stress on the Ballard, K.J., Robin, D.A., McCabe, P., & McDonald, J. (2010).
A treatment for dysprosody in childhood apraxia of speech.
unstressed syllable?). Finally, future studies should
Journal of Speech, Language, and Hearing Research, 53,
also directly explore a potential contribution of stop
1227–1245. doi:10.1044/1092-4388(2010/09-0130)
closure duration to the present study’s findings. Bates, D., Maechler, M., Bolker, B., & Walker, S. (2015).
Fitting linear mixed effects models using lme4. Journal of
Conclusion Statistical Software, 67, 1–48. doi:10.18637/jss.v067.i01
Boersma, P., & Weenink, D. (2011). Pratt 5.3.03 [Computer
The purpose of this study was to examine whether Software]. Amsterdam: Institute of Phonetic Sciences.
acoustic and/or articulatory correlates of stress differ- Burgmeister, B., Blum, L., & Lorge, I. (1972). Columbia Mental
entiate children with CAS from those with SD and Maturity Scale. San Antonio, TX: The Psychological
TD. Children with CAS were found to produce a Corporation.
smaller durational contrast between stressed and Davis, B.L., Jakielski, K.J., & Marquardt, T.P. (1998).
Developmental apraxia of speech: Determiners of differential
unstressed syllables in perceptually accurate produc-
diagnosis. Clinical Linguistics & Phonetics, 12, 25–45. doi:
tions compared to children with TD. This difference
10.3109/02699209808985211
was only revealed through kinematic measures of jaw de Jong, K.J. (1995). The supraglottal articulation of promin-
movement duration that may have included portions ence in English: Linguistic stress as localized hyperarticula-
of consonantal closure and not through acoustic tion. Journal of the Acoustical Society of America, 97, 491–504.
measures of vowels as measured from consonantal doi:10.1121/1.412275
11
Edwards, E., Beckman, M.E., & Fletcher, J. (1991). The articu- Liberman, P. (1960). Some acoustic correlates of word stress in
latory kinematics of final lengthening. Journal of the Acoustical American English. The Journal of the Acoustical Society of
Society of America, 89, 369–382. doi:10.1121/1.400674 America, 32, 451–454. doi:10.1121/1.1908095
Fry, D.B. (1958). Experiments in the perception of stress. Low, E.L., Grabe, E., & Nolan, F. (2000). Quantitative charac-
Language and Speech, 1, 126–152. doi:10.1177/ terizations of speech rhythm: Syllable-timing in Singapore
002383095800100207 English. Language and Speech, 43, 377–401. doi:10.1177/
Fry, D.B. (1965). The dependence of stress judgments on vowel 00238309000430040301
formant structure. In E. Zwirner & W. Bethge (Eds.), MathWorks. (2007). MATLAB Version 7.2 [Computer software].
Proceedings of the 5th international congress of phonetic sciences Natick, MA: MathWorks.
(pp. 306–311). Basel: Karger. Munson, B., Bjorum, E.M., & Windsor, J. (2003). Acoustic and
Goffman, L. (1999). Prosodic influences on speech production perceptual correlates of stress in nonwords produced by chil-
in children with specific language impairment and speech def- dren with suspected developmental apraxia of speech and
icits: Kinematic, acoustic, and transcription evidence. Journal children with phonological disorder. Journal of Speech,
of Speech, Language, and Hearing Research, 42, 1499–1517. Language, and Hearing Research, 46, 189–202. doi:10.1044/
doi:10.1044/jslhr.4206.1499 1092-4388(2003/015)
Goffman, L., & Malin, C. (1999). Metrical effects on speech Nijland, L., Maassen, B., van der Meulen, S., Gabreels, F.,
movements in children and adults. Journal of Speech, Kraaimaat, F.W., & Schreuder, R. (2003). Planning of sylla-
Language, and Hearing Research, 42, 1003–1015. doi:10.1044/ bles in children with developmental apraxia of speech.
jslhr.4204.1003 Clinical Linguistics and Phonetics, 17, 1–24. doi:10.1080/
Goldman, R., & Fristoe, M. (2000). Goldman-Fristoe Test of 0269920021000050662
Articulation – 2. Circle Pines, MN: American Guidance Odell, K.H., & Shriberg, L.D. (2001). Prosody-voice character-
Services, Inc. istics of children and adults with apraxia of speech. Clinical
Green, J.R., Wilson, E.M., Wang, Y., & Moore, C.A. (2007). Linguistics & Phonetics, 15, 275–307. doi:10.1080/
Estimating mandibular motion based on chin surface targets 02699200010021800
during speech. Journal of Speech, Language, and Hearing Peterson, G.E., & Lehiste, I. (1960). Duration of syllable nuclei
Research, 50, 928–939. doi:10.1044/1092-4388(2007/066) in English. The Journal of the Acoustical Society of America, 32,
Grigos, M.I., Moss, A., & Lu, Y. (2015). Oral articulatory con- 693–703. doi:10.1121/1.1908183
trol in childhood apraxia of speech. Journal of Speech, R Core Team. (2018). R: A Language and Environment for
Language, and Hearing Research, 58, 1103–1118. doi:10.1044/ Statistical Computing. Vienna, Austria: R Foundation for
2015_JSLHR-S-13-0221 Statistical Computing.
Hall, P.K., Jordan, L.S., & Robin, D.A. (1993). Developmental Rosenbek, J.C., & Wertz, R.T. (1972). A review of fifty cases of
apraxia of speech: Theory and clinical practice. Austin, TX: Pro- developmental apraxia of speech. Language, Speech, and
Ed. Hearing Services in the Schools, 3, 23–33. doi:10.1044/0161-
Hayden, D.A., & Square, P.A. (1999). VMPAC: Verbal Motor 1461.0301.23
Production Assessment for Children. San Antonio, TX: Schmitt, L.S., Howard, B.H., & Schmitt, J.F. (1983).
Psychological Association. Conversational speech sampling in the assessment of articula-
Hayes, B. (1995). Metrical stress theory: Principles and case studies. tion proficiency. Language, Speech, and Hearing Services in
Chicago, IL: The University of Chicago Press. Schools, 14, 210–214. doi:10.1044/0161-1461.1404.210
Hodson, B.W., & Paden, E.P. (1981). Phonological processes Shriberg, L.D., Aram, D.M., & Kwiatkowski, J. (1997a).
which characterize intelligble and unintelligible speech in Developmental apraxia of speech: II. Toward a diagnostic
early childhood. Journal of Speech and Hearing Disorders, 46, marker. Journal of Speech, Language, and Hearing Research,
369–373. doi:10.1044/jshd.4604.369 40, 286–312. doi:10.1044/jslhr.4002.286
Howell, P. (1993). Cue trading in the production and perception Shriberg, L.D., Aram, D.M., & Kwiatkowski, J. (1997b).
of vowel stress. Journal of the Acoustical Society of America, 94, Developmental apraxia of speech III: A subtype marked by
2063–2073. doi:10.1121/1.407479 inappropriate stress. Journal of Speech, Language, and Hearing
Hresko, W., Reid, D., & Hammill, D. (2007). Test of Early Research, 40, 313–337. doi:10.1044/jslhr.4002.313
Language Development. Austin, TX: Pearson Assessment. Shriberg, L.D., Austin, D., Lewis, B.A., McSweeny, J.L., &
Jacewicz, E., Fox, R.A., & Salmons, J. (2011). Regional dialect Wilson, D.L. (1997c). The percentage of consonants correct
variation in the vowel systems of typically developing chil- (PCC) metric: Extensions and reliability data. Journal of
dren. Journal of Speech, Language, and Hearing Research, 54, Speech, Language, and Hearing Research, 40, 708–722. doi:
448–470. doi:10.1044/1092-4388(2010/10-0161) 10.1044/jslhr.4004.708
Kehoe, M., Stoel-Gammon, C., & Buder, E.H. (1995). Acoustic Shriberg, L.D., Campbell, T.F., Karlsson, H.B., Brown, R.L.,
correlates of stress in young children’s speech. Journal of McSweeny, J.L., & Nadler, C.J. (2003). A diagnostic marker
Speech and Hearing Research, 38, 338–350. doi:10.1044/ for childhood apraxia of speech: The lexical stress ratio.
jshr.3802.338 Clinical Linguistics & Phonetics, 17, 549–574. doi:10.1080/
Kelso, J.A.S., Vatikiotis-Bateson, E., Saltzman, E.L., & Kay, B. 0269920031000138123
(1985). A qualitative dynamic analysis of reiterant speech Shriberg, L.D., Strand, E.A., Fourakis, M., Jakielski, K.J., Hall,
production: Phase portraits, kinematics, and dynamic model- S.D., Karlsson, H.B., … Wilson, D.L. (2017). A diagnostic
ing. Journal of the Acoustical Society of America, 77, 266–280. marker to discriminate childhood apraxia of speech from
doi:10.1121/1.392268 speech delay: I. Development and description of the pause
Kent, R.D., & Netsell, R. (1971). Effects of stress contrasts on marker. Journal of Speech, Language, and Hearing Research,
certain articulatory parameters. Phonetica, 24, 23–44. doi: 60, S1096–S1117. doi:10.1044/2016_JSLHR-S-15-0296
10.1159/000259350 Skinder, A., Strand, E.A., & Mignerey, M. (1999). Perceptual
Kuznetsova, A., Brockhoff, P.B., & Christensen, R.H.B. (2017). and acoustic analysis of lexical and sentential stress in chil-
lmerTest Package: Tests in linear mixed effects models. dren with developmental apraxia of speech. Journal of Medical
Journal of Statistical Software, 82, 1–26 .doi:10.18637/ Speech-Language Pathology, 7, 133–144.
jss.v082.i13 Small, L.H. (1999). Fundamentals of phonetics: A practical guide
Levi, S.V. (2005). Acoustic correlates of lexical accent in for students. Needham Heights, MA: Allyn & Bacon.
Turkish. Journal of the International Phonetic Association, 35, Smit, A.B., Hand, L., Freilinger, J.J., Bernthal, J.E., & Bird, A.
73–97. doi:10.1017/S0025100305001921 (1990). The Iowa articulation norms project and its
12 H. C. Kopera and M. I. Grigos
Nebraska replication. Journal of Speech & Hearing Disorders, Velleman, S.L., & Strand, K. (1994). Developmental verbal
55, 779–798. doi:10.1044/jshd.5504.779 dyspraxia. In J. E. Bernthal & N. W. Bankson (Eds.),
Smith, A., Sadagopan, N., Walsh, B., & Weber-Fox, C. (2010). Child phonology: Characteristics, assessment, and intervention
Increasing phonological complexity reveals heightened with special populations (pp. 110–139). New York: Thieme
instability in inter-articulatory coordination in adults who Medical.
stutter. Journal of Fluency Disorders, 35, 1–18. doi:10.1016/ Vicon Motion Systems. (2001). Vicon 460 [Computer software].
j.jfludis.2009.12.001
Los Angeles, CA: Author.
Stathopoulos, E.T., & Weismer, G. (1983). Closure duration of
Whalen, D.H., & Levitt, A.G. (1995). The universality of intrin-
stop consonants. Journal of Phonetics, 11, 395–400.
sic F0 of vowels. Journal of Phonetics, 23, 349–366. doi:
Turk, A., Nakai, S., & Sugahara, M. (2006). Acoustic segment
10.1016/S0095-4470(95)80165-0
durations in prosodic research: A practical guide. In S.
Winter, B. (2013). Linear models and linear mixed effects mod-
Sudhoff, D. Lenertova, R. Meyer, S. Pappert, P. Augurzky, I.
Mleinek, N. Richter & J. Schlieber (Eds.), Methods in empir- els in R with linguistic applications. arXiv:1308.5499.
ical prosody research (pp. 1–28). Berlin: Mouton de Gruyter. Retrieved from http://arxiv.org/pdf/1308.5499.pdf
Velleman, S.L., & Shriberg, L.D. (1999). Metrical analysis of Yoss, K.A., & Darley, F.L. (1974). Developmental apraxia of
the speech of children with suspected developmental apraxia speech in children with defective articulation. Journal of
of speech. Journal of Speech, Language, and Hearing Research, Speech and Hearing Research, 17, 399–416. doi:10.1044/
42, 1444–1460. doi:10.1044/jslhr.4206.1444 jshr.1703.399