Professional Documents
Culture Documents
Research Article
Purpose: First, we sought to extend our knowledge of Results: Standard scores were significantly lower for
second language (L2) receptive compared to expressive bilingual children with PLI than those without PLI. An L2
narrative skills in bilingual children with and without primary receptive–expressive gap existed for bilingual children with
language impairment (PLI). Second, we sought to explore PLI at kindergarten but dissipated by first grade. Using
whether narrative receptive and expressive performance in single pictures during narrative generation compared to
bilingual children’s L2 differed based on the type of multiple pictures during narrative generation or no pictures
contextual support. during narrative retell appeared to minimize the presence
Method: In a longitudinal group study, 20 Spanish–English of a receptive–expressive gap.
bilingual children with PLI were matched by sex, age, Conclusions: In early stages of L2 learning, bilingual children
nonverbal IQ score, and language exposure to 20 bilingual with PLI have an L2 receptive–expressive gap, but their
peers with typical development and administered the Test typical development peers do not. Using a single picture
of Narrative Language (Gillam & Pearson, 2004) in English during narrative generation might be advantageous for this
(their L2) at kindergarten and first grade. population because it minimizes a receptive–expressive gap.
N
arrative tasks are often used to assess the lan- processes (Bates, 1993). A significant discrepancy between
guage of school-age children because they tap a receptive and expressive ability has been a hallmark of
range of functional language skills (as opposed language impairment (Gibson, Jarmulowicz, & Oller,
to decontextualized language assessments that, e.g., ask 2018). Recent research indicates that a receptive–expressive
children to name pictures, point to pictures, and finish gap occurs in vocabulary (Gibson, Oller, Jarmulowicz, &
sentences). Language skills engaged during narratives in- Ethington, 2012) and semantic (Gibson, Peña, & Bedore,
clude but are not limited to children’s ability to remember 2014a) testing for bilingual children with typical develop-
a complete story, link ideas within that story, and orga- ment (TD) and is exacerbated for bilingual children with
nize those ideas around a common theme (Gillam & PLI (Gibson, Peña, & Bedore, 2014b). The current study
Pearson, 2004). Difficulty in performing these tasks is seeks to expand our understanding of the receptive–
associated with language impairment (Tsimpli, Peristeri, expressive gap by investigating the trajectory and develop-
& Andreou, 2016). Furthermore, narrative ability develops ment of narrative receptive and expressive abilities of
both receptively (comprehension) and expressively (pro- bilingual children with and without PLI.
duction; Bishop, 1997), which are similar but dissociable
Narrative Competence
a
Louisiana State University, Baton Rouge
Narratives develop early in life and follow a similar
b
University of California, Irvine trajectory across languages (Berman & Slobin, 1994). In
c
The University of Texas at Austin early development, children identify and link basic elements
Correspondence to Todd A. Gibson: toddandrewgibson@gmail.com of stories, which Stein and Glenn (1979) have identified
Editor: Sean Redmond as settings, characters, actions, and events. By the age of
Associate Editor: Elizabeth Kay-Raining Bird 3 years, children produce true narratives (Applebee, 1978).
Received November 21, 2016 By ages 5–6 years, children produce complex narratives
Revision received May 9, 2017
Accepted February 2, 2018 Disclosure: The authors have declared that no competing interests existed at the time
https://doi.org/10.1044/2018_JSLHR-L-16-0432 of publication.
Journal of Speech, Language, and Hearing Research • Vol. 61 • 1381–1392 • June 2018 • Copyright © 2018 American Speech-Language-Hearing Association 1381
(Peterson & McCabe, 1983; Stein & Glenn, 1979). Similar (1989) tested English-speaking children with and without
trajectories have been observed cross-linguistically, with PLI between ages 9 and 11 years. Participants retold two
well-developed narratives being produced across a variety stories and generated three. To generate the stories, chil-
of languages by the age of 5 years (Berman & Slobin, 1994). dren were given story stems and asked to complete them
The development of narrative competence corre- (Merritt & Liles, 1989, p. 446). For the retell task, children
sponds with and is supported by the development of short- watched a video of the story being told and were then
term memory (Barrouillet, Gavens, Vergauwe, Gaillard, asked to retell the same story. The authors found that nar-
& Camos, 2009; Gathercole & Baddeley, 1993; Gathercole, ratives were longer and contained more story elements
Willis, Emslie, & Baddeley, 1991; Swanson, 2008). Dodwell when they were retold compared to when they were gener-
and Bavin (2008) tested short-term memory as well as ated based on a prompt. Similarly, Westerveld and Gillon
narrative comprehension and production abilities of 6-year- (2010) tested children ages 7;3 to 9;3 on story retell and
olds with and without primary language impairment (PLI) story generation. In the story retell task, children looked
and identified statistically significant correlations between at pictures in a book while listening to a corresponding au-
measures of short-term memory and measures of narrative dio recording. To generate stories, children were asked to
performance. Duinmeijer, de Jong, and Scheper (2012) make up a story based on a single picture of a scene. Story
found similar correlations for Dutch speaking 6- to 9-year- retelling produced the most linguistically complex and
olds with and without PLI. longest stories. However, in the Dutch context, Duinmeijer
Memory is important for remembering narrative et al. (2012) found greater complexity (e.g., embedding)
events and tying those events together coherently, which is associated with retelling, but length and fluency (absence
an important competency in narrative performance. Com- of fillers and repetitions) were enhanced when generating
pare “He ate breakfast. He put on his clothes,” to “He a new narrative.
put on his clothes after he ate breakfast.” The two short
utterances include the same number of elements, but the Influence of Visual Support on Narrative Performance
latter version connects those elements whereas the former Even within these elicitation paradigms (retell vs.
does not. This cohesion is accomplished by cohesive de- generation), task differences affect outcomes. Westerveld
vices (Halliday & Hasan, 1976), such as the word “after” and Vidler (2015) asked typically developing children
in the above example. A large body of research has shown ages 5;3 to 8;9 to follow along with the pictures of a word-
that children with language impairment have difficulty less picture book as the examiner read the story aloud.
with narrative cohesion (Boudreau & Chapman, 2000). Children retold the story both with and without visual aids.
Accomplishing cohesion requires the choice of appropriate Retelling a story with accompanying visual aids elicited
vocabulary and grammar at the utterance level and the longer and more complex narratives. However, as highlighted
connection between distant aspects of a narrative, such as by Duinmeijer et al. (2012), comparisons across these elici-
the initiating event and conclusion. The ability to effec- tation techniques are misleading because they differ in
tively accomplish cohesion at both the utterance and story more than the number of pictures provided. For example,
levels is what constitutes narrative competence. W. M. Pearce (2003) found that children’s narratives
were of higher quality when based on a series of pictures
compared to a single picture. Spinillo and Pinto (1994)
Task Influence on Narrative Performance
found the opposite pattern. Duinmeijer et al. (2012) explained
Performance on narrative assessments varies as a that this discrepancy was likely due to children in the Spinillo
function of the way narratives are elicited (Morris-Friehe & and Pinto (1994) study drawing their own pictures, which
Sanger, 1992). Elicitation techniques range from minimal was a lived experience and likely enhanced significantly the
support (e.g., asking a child simply to tell a story with no children’s narrative performance. This demonstrates that
contextual support; Spinillo & Pinto, 1994) to the highly not only the quantity but also the quality of contextual sup-
supported task of asking a child to retell a story while port influences narrative outcomes.
viewing picture stimuli (Botting, 2002). The task demands
vary considerably under these conditions. For example, Narrative Comprehension Versus Production
because children’s short-term memory was associated with Narrative comprehension and production both re-
scores on self-generated stories but not story retells (i.e., quire the understanding of words and the sentences in
repeating a story that one hears), Dodwell and Bavin (2008) which they are embedded. However, production addition-
proposed that generating one’s own narrative helped to ally requires strong representations that are sufficiently
create strong representations that were easier to remember precise to express narrative meaning. Therefore, although
compared to the representations created by merely listen- children can often understand complex narratives, they may
ing to a story. not be able to produce them at their same level of com-
prehension (Bates, 1993). The result might be a receptive–
Narrative Retell Versus Narrative Generation expressive gap.
Several studies have demonstrated that outcomes It is unclear how differences in narrative tasks might
on narrative retell and narrative generation tasks differ differentially impact a receptive–expressive gap, if at all.
(Botting, 2002; W. M. Pearce, 2003). Merritt and Liles More visual aids (vs. fewer), narrative retell (vs. narrative
1382 Journal of Speech, Language, and Hearing Research • Vol. 61 • 1381–1392 • June 2018
generation), and narrative comprehension (vs. narrative with TD. Bohnacker (2016) administered receptive and
production) all result in fewer memory demands (Baddeley, expressive measures of narratives to 5- and 6-year-old
1999; Mayer and Moreno, 2003; Paivio, 1986). Because Swedish-dominant Swedish–English bilinguals in both of
short-term memory has a limited capacity, we might antici- their languages. Receptive measures focused on narrative
pate greater receptive–expressive gaps when narrative elements by asking children how and why questions to
comprehension is compared to narrative generation (with determine if children could understand the goals and inter-
greater memory demands) than when compared to narrative nal states of the characters. Expressive measures were
retell (with fewer memory demands). based on narratives elicited through a four-picture series.
Results were similar in both languages and showed a
large gap across receptive and expressive modalities, with
Receptive–Expressive Discrepancies in Narratives children more likely to understand than produce these
Tests of language comprehension are often consid- narrative elements. This discrepancy was present for both
ered easier than tests of language production (Gibson the 5- and 6-year-old age groups, despite having TD.
et al., 2014a), but direct comparisons between the two can However, because the degrees of difficulty for receptive
be made by converting raw scores to standard scores compared to expressive tasks were not controlled, inter-
(assuming a co-normed sample). This conversion mathe- preting these results is difficult.
matically controls for the differences in difficulty level.
There is no reason to assume, therefore, that there would
Narrative Gap in Bilingual Children
be discrepancies between receptive and expressive stan-
dard scores for groups of typically developing individuals With and Without PLI
(Gibson et al., 2014a). Indeed, such a discrepancy histori- Knowledge of story elements, such as characters,
cally has been treated as a hallmark of expressive language settings, and events, has been referred to as macrostructure.
disorder (Gibson et al., in press). For example, many stan- Macrostructure appears early in narrative development
dardized tests of omnibus language skills (e.g., Clinical and follows a similar trajectory across languages (Berman
Evaluation of Language Fundamentals–Fifth Edition; Wiig, & Slobin, 1994). Fully formed narratives characterized
Semel, & Secord, 2013; Preschool Language Scales–Fifth by a problem, plot, character development, causal relation-
Edition; Zimmerman, Steiner, & Pond, 2012) provide not ships, and resolution typically have developed by the age
only separate standard scores for receptive and expressive of 5 years (Stein & Glenn, 1979). Similar narrative devel-
performance but also discrepancy comparisons that allow opmental patterns have been identified across a variety of
clinicians to determine if these differences are clinically languages (Berman & Slobin, 1994). A distinct aspect of
meaningful. When receptive skills exceed expressive skills narrative competence is the ability to understand and pro-
by some threshold, results are often interpreted as an ex- duce internal narrative structures (Justice et al., 2006; Liles,
pressive language disorder (American Psychiatric Associa- Duffy, Merritt, & Purcell, 1995), and this is referred to as
tion, 2000, p. 58). Indeed, the World Health Organization microstructure. At this level, speakers process the narra-
(2005) has codified the distinction by providing separate tive within sentences by monitoring lexical and grammatical
International Classification of Diseases–Tenth Edition codes constructions and process across sentences by monitoring
for receptive and expressive language disorders. how sentences cohere to preceding and subsequent sen-
To our knowledge, only the Test of Narrative Lan- tences (Liles et al., 1995). These elements are often mea-
guage (TNL; Gillam & Pearson, 2004) provides separate sured in terms of productivity (e.g., how many words are
comprehension and production standard scores for narrative produced per utterance; Paul & Smith, 1993) or complex-
skills, with a mean of 10 and a standard deviation of 3. ity (e.g., the complexity of grammatical constructions;
However, the magnitude of discrepancies between receptive Norbury & Bishop, 2003). Indeed, researchers have found
and expressive standard scores has been inconsistent across that microstructure is a multidimensional construct (Justice
studies. Colozzo, Gillam, Wood, Schnell, and Johnston (2011) et al., 2006) made up of complexity and productivity
reported TNL scores for Canadian and American groups of (Westerveld & Gillon, 2010). Narrative competence, there-
children with and without PLI and age-matched peers with fore, relies on individuals’ understanding and production
TD. Discrepancies between comprehension and production of both macro- and microstructure. Indeed, guidelines for
were less than 1 SD for both groups. Redmond (2011) found best practices in the assessment of narrative ability indi-
a similar discrepancy for children with PLI, but the compari- cate that both levels should be assessed (Hughes, McGillivray,
son with TD had a larger discrepancy, with comprehension & Schmidek, 1997).
exceeding production by 1.05 SDs. Domsch et al. (2012) re- Few studies have investigated the narratives of
ported TNL comprehension and production scores for a bilingual children with PLI, but it appears that these chil-
group of late talkers and an age-matched control group of dren’s errors are similar to those of monolingual children
children who were not late talkers. For the late talkers, com- with PLI (Gutiérrez-Clellen, Simon-Cereijido, & Sweet,
prehension exceeded production by 1.03 SDs, but for the 2012). Most of these comparisons, however, have been
control group, this discrepancy was only 0.63 SD. made at the level of vocabulary and syntax, for which there
Another group that might present with a receptive– is general consensus that bilingual children with PLI have
expressive gap in narrative performance is bilingual speakers impoverished narratives compared to their peers with TD
1384 Journal of Speech, Language, and Hearing Research • Vol. 61 • 1381–1392 • June 2018
within 20% English and Spanish). Language exposure Receptive testing included who, what, where, how,
was calculated based on parent and teacher questionnaires and why questions about the stories that they heard. Chil-
(Bohman, Bedore, Pena, Mendez-Perez, & Gillam, 2010; dren were awarded 1, 2, or 3 points for each question cor-
Gutiérrez-Clellen & Kreiter, 2003; Restrepo, 1998) that re- rectly answered (some questions could be answered with a
sulted in an hour-by-hour report of language exposure. Chil- single response, and others required up to three responses;
dren were also matched as closely as possible to age of e.g., children were asked what one of the characters ordered,
first English exposure, which averaged 2.2 years (see Table 1 which required three responses). The receptive version of
for demographic information). One girl from the PLI group the McDonald’s Story had a total possible 15 points, the
was not administered the Dragon Story, which was a nar- Shipwreck Story had a possible 11 points, and the Dragon
rative generation task based on a single picture; therefore, Story had a possible 14 points. Each expressive subtest tar-
she and her matched peer from the TD group were not in- geted aspects of narrative. In the expressive version of the
cluded in the final analysis. Eight children (40%) from each McDonald’s Story, children earned points by how faith-
group received dual language instruction with 16%–83% of fully they recalled the elements of the story that they heard.
the school day taught in Spanish according to teacher inter- For example, they earned points for including the names
views. The other 12 children (60%) were in English-only of participants, including the word McDonald’s, and includ-
classrooms. ing what they ate and what they did, for a total of 26 pos-
sible points awarded. In the Late for School Story, children
received either 0, 1, or 2 points for how thoroughly they
Materials included information about the events in the story, grammar
(the number of grammatical errors, whether they main-
A battery of standardized language assessments was tained the same tense throughout the story), and global
administered to participants in kindergarten and again when story organization (i.e., did it make sense, was it complete?),
those children reached first grade. Of interest to the cur- for a total of 30 possible points. In the Aliens Story, chil-
rent study was the TNL (Gillam & Pearson, 2004), which dren earned 0, 1, or 2 points for how thoroughly they in-
was used to test narrative abilities in English. This English cluded information about the setting, characters, story
language test provides standardized receptive, expressive, elements (events and temporal relationships), vocabulary
and overall scores with a mean of 10 and a standard devi- and grammar, and global story organization, for a total of
ation of 3. The TNL includes six subtests.1 In the first, 34 possible points earned. A total raw score for compre-
children listen to a story with 155 words about going to a hension and a separate total raw score for production were
McDonald’s restaurant. Children then answer questions calculated by adding the total points across the three com-
about what they heard (e.g., they are asked the name of prehension subtests and three production subtests, respec-
the girl in the story). No visual support is provided. In the tively. Separate standard scores for comprehension and
second subtest, children retell the McDonald’s Story without production were calculated by referencing tables in the
visual support. For the third subtest, children listen to the Examiner’s Manual. In addition, an overall standard score
Shipwreck Story, a story about a child who ruins his model can be calculated by matching the sum of standard scores
boat on the way to school. The story includes 175 words to tables provided in the Examiner’s Manual.
and is accompanied by five wordless pictures illustrating the The norming sample of the TNL was chosen to
events in the story. This story serves as a model for the match the demographic characteristics of the United States
fourth subtest, the Late for School Story, in which children as reported by the United States Census Bureau (2001).
generate an original story based on a series of five word- Twelve percent of the sample was Hispanic, but L2 status
less pictures illustrating a boy rising from bed and ulti- was not reported. Thirteen percent of the sample included
mately arriving late for school. In the fifth subtest, children children with exceptional status, such as language/learning
listen to a story with 390 words about a dragon. The Dragon disorder, articulation disorder, attention-deficit disorder, or
Story corresponds to a single picture in which appear a other. Because TNL norming samples did not adequately
fire-breathing dragon, a box of treasure, a large rock, and represent bilingual participants, results were not used to de-
two children. During the instructions for the Dragon Story, termine the presence or absence of language disorder.
children are told that the next task will require them to gen- Licensed speech-language pathologists administered
erate their own stories. In this way, the dragon story serves the TNL and scored the tests. As per the TNL Examiner’s
as an explicit model for the subsequent subtest. In the sixth Manual, scoring took place by listening to the digital au-
subtest, the Aliens Story, children generate their own story dio recording of the testing sessions. There were some
based on a single picture that includes a family of space occasions when children code-switched and responded in
aliens leaving a space ship with their pet and arriving at a Spanish. In the current study, we scored these occurrences
park where they appear to terrify one human child and fas- as inaccurate, reflecting how the average English-speaking
cinate another. speech-language pathologist would likely treat these re-
sponses. To determine reliability, research assistants trained
1
Descriptions of test items and scoring procedures from the Test of by licensed speech-language pathologists rescored a pro-
Narrative Language (Gillam & Pearson, 2004) used with permission. portion of the tests. The percentage of agreement between
Copyright © 2004 Pro-Ed. scorers was calculated for each subtest. For 20% of the
Note. K = kindergarten; TD = typical development; PLI = primary language impairment. PLI severity level
based on a scale of 0 to 5, with 0 = severely impaired, 4 = typical, and 5 = above average. TOLD = Test of
Language Development (standard score with mean of 100 and SD of 15). BESA = Bilingual English–Spanish
Assessment. BESA scores represent percent correct.
*p < .01 (difference between the two groups on that measure based on one-way ANOVA).
tests, the points awarded by each examiner for every item Although the TNL provides multiple narrative tasks,
administered to a child were compared. For each subtest, the test does not provide separate standardized scores for
the total number of scorer agreements were divided by the each subtest. However, we were able to examine each sub-
total number of comparisons and multiplied by 100 to form test by calculating standard scores based on raw scores
a percentage. Interrater reliability was 99.2% for narrative and standard deviations shared with us by the test devel-
comprehension and 100% for oral narration. opers. The means and standard deviations for raw scores
in the normative sample for 5-, 6-, and 7-year olds were
calculated. These raw scores and standard deviations were
Procedure used to create z scores for each of the children in the cur-
Testing occurred in quiet spaces at children’s schools. rent study. The z scores were then transformed to standard
All samples were recorded using a digital audio recorder scores based on a mean of 10 and standard deviation of 3,
(Sony MS-515 or ICD-P320) with an external microphone which is the standard score format used by the TNL.
(ECM115) and then transcribed using Sony digital voice
editor version 2.4.04. To ensure transcription reliability, all
transcripts were transcribed by a trained research assis- Results
tant and checked by a second research assistant (usually the For the first analysis, we were interested in whether
individual who had collected the language sample data). there were differences between receptive and expressive
scores as tested by the TNL in kindergarten children with
and without language impairment and whether there were
Statistical Analyses changes over time. Toward that end, we performed a
In order to determine differences in the receptive– 2 × 2 × 2 repeated-measures ANOVA with time (kinder-
expressive gap in narrative performance, we performed garten vs. first grade) and modality (receptive vs. expres-
repeated-measures analyses of variance (ANOVAs). Be- sive) as within-subject variables and ability (PLI vs. TD) as
cause partial eta-squared includes variance only from the the between-subjects variable (see Figure 1). TNL stan-
target variable (Pierce, Block, & Aguinis, 2004), we re- dard scores were the dependent variable. There were
port it. As published previously (Gibson et al., 2014a), we main effects for time, F(1, 38) = 60.61, p < .01, ηp2 = .62,
developed guidelines for the interpretation of partial eta- a large effect size; modality, F(1, 38) = 4.82, p = .03,
squared because none exists. Because of its similarity with ηp2 = .11, a small effect size; and ability, F(1, 38) = 46.59,
the general linear model, we interpreted the effect sizes p < .01, ηp2 = .55, a large effect size. Children increased
of 0–.10, .10–.25, .25–.50, .50–.80, and .80–1.00 as negligi- their scores from kindergarten (M = 4.66, SD = 0.28) to
ble, small, moderate, large, and very large, respectively. We first grade (M = 6.71, SD = 0.31). They scored higher in
used t tests to follow up with the identification of differ- narrative receptive performance (M = 5.97, SD = 0.28)
ences between individual variables. To control for multiple compared to narrative expressive performance (M = 5.40,
comparisons, we used a Bonferroni correction of .02. In or- SD = 0.31), and children with TD scored higher (M =
der to correct for dependence among means, we used a 7.52, SD = 0.38) than those with PLI (M = 3.85, SD =
within-group Cohen’s d as the effect size measure (Morris 0.38). There was a significant interaction between time,
& DeShon, 2002). modality, and ability, F(1, 38) = 6.09, p = .02, ηp2 = .14,
1386 Journal of Speech, Language, and Hearing Research • Vol. 61 • 1381–1392 • June 2018
Figure 1. Typical development (TD) and primary language impairment Figure 3. Typical development (TD) and primary language impairment
(PLI) total narrative performance: Modality × Time. Kinder = (PLI) narrative performance in the multiple pictures + generation
kindergarten. condition: Modality × Time. Kinder = kindergarten.
a small effect size. Follow-up analyses at the univariate sphericity for the interaction between contextual support
level showed a receptive–expressive gap at kindergarten and time, χ2(2) = 8.22, p = .02; therefore, we used a
for children with PLI, t(19) = 3.81, p = .001, d = 0.85, a Greenhouse–Geisser correction for this interaction, ε = .83.
large effect size, but not for children with TD, t(19) = .87, The comparisons of interest to answer this question
p = .39, d = 0.19. There was no statistically significant were contextual support as well as its interactions. Results
receptive–expressive gap for either group at first grade. showed main effects for contextual support, F(2, 76) =
We also asked whether there were discrepancies 11.42, p < .001, ηp2 = .23, a small effect size. Follow-up paired
between receptive and expressive performance related samples t tests with Bonferonni corrected p value of .02
to the degree of contextual support for children with and showed statistically significant differences when comparing
without PLI. To answer this question, we performed a 2 × the no picture + retell condition (M = 6.87, SD = 2.35)
2 × 3 × 2 repeated-measures ANOVA with time (kinder- with multiple pictures + generation (M = 5.82, SD = 2.52),
garten vs. first grade), modality (receptive vs. expressive), t(39) = 4.11, p < .001, d = 0.65, a large effect size, and
and contextual support (no picture + retell vs. multiple single pictures + generation (M = 5.90, SD = 2.47), t(39) =
pictures + generation vs. single picture + generation) as 4.11, p < .001, d = 0.65, a large effect size. There was no
within-subject variables and ability (PLI vs. TD) as the statistically significant difference between the single picture +
between-subjects variable (see Figures 2, 3, and 4). Mauchly’s generation and multiple picture + generation conditions,
Test of Sphericity (Mauchly, 1940) indicated a violation of t(39) = .29, p = .76, d = 0.04. There was a statistically
Figure 2. Typical development (TD) and primary language Figure 4. Typical development (TD) and primary language
impairment (PLI) narrative performance in the no picture + retell impairment (PLI) narrative performance in the single picture +
condition: Modality × Time. Kinder = kindergarten. generation condition: Modality × Time. Kinder = kindergarten.
1388 Journal of Speech, Language, and Hearing Research • Vol. 61 • 1381–1392 • June 2018
English or a combination of additional exposure and addi- Our previous research found receptive–expressive
tional practice may have contributed to the diminution in gaps for bilingual children with and without PLI during
the receptive–expressive gap. To explore this possibility, we semantic tasks (Gibson et al., 2014a). We argued that a
asked whether language experience correlated with recep- possible reason for this discrepancy was the quality of chil-
tive and expressive TNL performance at kindergarten and dren’s phonological representations. Perhaps because of
first grade for bilingual children with and without PLI. We limited experience with the language, L2 representations
found that there were no statistically significant correla- were underspecified such that they were sufficient to be
tions between language experience and TNL performance successful with the easier receptive task but not sufficient
for the children with PLI. It does not appear, therefore, to be successful with the more difficult expressive task
that additional experience with the language (in the current (Gibson et al., 2014b). A similar phenomenon may have
case, a year) plays a significant role in the diminution of been engaged in the current generative narrative tasks. If
the receptive–expressive gap. children had difficulty in accessing individual words related
We speculate that maturational development, espe- to generating narratives, it would have had detrimental
cially in short-term memory capacity, might play a prom- effects on scoring.
inent role in the dissipation of the receptive–expressive It might also be the case that the receptive–expressive
gap over time. As mentioned above, both Dodwell and gap for the children with PLI diminished over time because
Bavin (2008) and Duinmeijer et al. (2012) found that mea- of the childhood culture in which these children lived.
sures of short-term memory correlated significantly with Corsaro and Eder (1990) identified a childhood culture in
narrative performance. There is a substantial body of which stories play an important role. Narratives produced
literature that has demonstrated that children’s short-term during pretend play help children earn peer acceptance and
memory improves with age (Barrouillet et al., 2009; become a part of the social group (Hoyle, 1998). During
Swanson, 2008), and this improvement aids in the devel- these activities, children learn about characters and events
opment of language (Gathercole & Baddeley, 1993; and their organization. For example, if children pretend to
Gathercole et al., 1991). Because narratives require chil- take the role of mother and daughter during play, they
dren to remember large amounts of information and con- take on the relationship characteristics of real mothers and
nect that information coherently, it might be the case that daughters in a pretend reality, providing insight into char-
the more difficult production tasks are enhanced to a greater acters, settings, and events (Goodwin, 1993; Sacks, 1992).
degree than the less difficult comprehension tasks, contribut- In addition, this is a period when stories are used in the ed-
ing to the diminution of the narrative receptive–expressive ucational setting. For example, the Common Core State
gap over time. Furthermore, because narrative retellings as Standards for education in the United States provide direc-
opposed to narrative generation likely would be more af- tion on curricula from kindergarten through 12th grade
fected by memory, the diminution in a receptive–expressive (Common Core State Standards Initiative, 2010). The ma-
gap might be greatest in narrative retellings. jority of the goals related to reading standards in kinder-
Additional results also suggest a role for memory. garten and the first grade are based on understanding and
Although there was no statistically significant difference generating stories. Although we do not have information
overall between the single picture + generation and multiple about these classroom activities for the participants in the
picture + generation conditions, their patterns of perfor- current study, perhaps these children benefited from such
mance with respect to receptive and expressive modalities educative practice. Expressive narrative abilities might
differed. In the multiple pictures + generation context, re- have been boosted more than receptive because receptive
ceptive was better than expressive performance at both abilities may have already been near these participants’ ceiling
kindergarten and first grade. The relationship between re- performance. Future studies should explore this possibility.
ceptive and expressive performance also held at both kinder- We additionally sought to identify differences between
garten and first grade for the single picture + generation groups based on different levels of contextual support
condition, with no statistically significant difference be- used to elicit the narratives. Results showed that levels of
tween receptive and expressive performance at either mo- contextual support had a similar impact on bilingual chil-
ment in time. One possible interpretation of these patterns dren with TD and bilingual children with PLI. Single
is that the multiple pictures + generation context provides picture + generation and multiple pictures + generation
too many elements for these children to contend with ex- had a statistically similar impact on narrative performance,
pressively; although overall performance improves across but performance was best in the no picture + retell context.
the year for this condition, the relationship between expres- The no picture + retell context was based on a McDonald’s
sive and receptive performance remains because of the narrative. We suspect that, because of the popularity of
heavy demands on memory to successfully perform this McDonald’s across the world, these children already had
task. On the other hand, a single picture provides all of the a mental model of the event of visiting a McDonald’s
elements of the story in a single frame, which perhaps restaurant, which they could use to support their perfor-
taxes memory to a lesser degree and thus does not provoke mance. Future studies might compare performance when
a gap between receptive and expressive performance. Fu- the narrative is based on a less frequent activity and set-
ture studies could test this possibility by including memory ting. On the other hand, the McDonald’s narrative was a
measures in addition to the TNL. story retell task. Because children did not have to generate
1390 Journal of Speech, Language, and Hearing Research • Vol. 61 • 1381–1392 • June 2018
American Psychiatric Association. (2000). Diagnostic and Statistical and vocabulary acquisition? European Journal of Psychology
Manual of Mental Disorders–Fourth Edition (Text Revision). of Education, 8, 259–272.
Washington, DC: American Psychiatric Association. Gathercole, S. E., Willis, C., Emslie, H., & Baddeley, A. D.
Applebee, A. N. (1978). The child’s concept of story: Ages two to (1991). The influences of number of syllables and wordlikeness
seventeen. Chicago, IL: The University of Chicago Press. on children’s repetition of nonwords. Applied Psycholinguistics,
Baddeley, A. (1999). Essentials of human memory. Essex, UK: 12, 349–367.
Psychology Press. Gibson, T. A., Jarmulowicz, L., & Oller, D. K. (2018). Difficulties
Barrouillet, P., Gavens, N., Vergauwe, E., Gaillard, V., & Camos, V. using standardized tests to identify the receptive–expressive gap
(2009). Working memory span development: A time-based in bilingual children’s vocabularies. Bilingualism: Language and
resource-sharing model account. Developmental Psychology, Cognition, 21, 328–339.
45, 477–490. Gibson, T. A., Oller, D., Jarmulowicz, L., & Ethington, C. (2012).
Bates, E. (1993). Comprehension and production in early lan- The receptive–expressive gap in the vocabulary of young
guage development. Monographs of the Society for Research second-language learners: Robustness and possible mechanisms.
in Child Development, 58, 222–242. Bilingualism: Language and Cognition, 15, 102–116.
Berman, R. A., & Slobin, D. I. (1994). Relating events in narrative: Gibson, T. A., Peña, E. D., & Bedore, L. M. (2014a). The relation
A crosslinguistic developmental study. Hillsdale, NJ: Erlbaum. between language experience and receptive–expressive se-
Bishop, R. (1997). Interviewing as collaborative storying. Education mantic gaps in bilingual children. International Journal of
Research and Perspectives, 24, 28–47. Bilingual Education and Bilingualism, 17, 90–110.
Blom, E., & Boerma, T. (2016). Why do children with language Gibson, T. A., Peña, E. D., & Bedore, L. M. (2014b). The recep-
impairment have difficulties with narrative macrostructure? tive–expressive gap in bilingual children with and without
Research in Developmental Disabilities, 55, 301–311. primary language impairment. American Journal of Speech-
Bohman, T. M., Bedore, L. M., Pena, E. D., Mendez-Perez, A., & Language, 23, 655–667.
Gillam, R. B. (2010). What you hear and what you say: Lan- Gillam, R. B., & Pearson, N. (2004). Test of Narrative Language.
guage performance in Spanish–English bilinguals. International Austin, TX: Pro-Ed.
Journal of Bilingual Education and Bilingualism, 13, 325–344. Gillam, R. B., Pena, E. D., Bedore, L. M., Bohman, T. M., &
Bohnacker, U. (2016). Tell me a story in English or Swedish: Nar- Mendez-Perez, A. (2013). Identification of specific language
rative production and comprehension in bilingual preschoolers impairment in bilingual children: I. Assessment in English.
and first graders. Applied Psycholinguistics, 37, 19–48. Journal of Speech, Language, and Hearing Research, 56,
Botting, N. (2002). Narrative as a tool for the assessment of lin- 1813–1823.
guistic and pragmatic impairments. Child Language Teaching Goodwin, M. H. (1993). Accomplishing social organization in
and Therapy, 18, 1–22. girls’ play: Patterns of competition and cooperation in an
Boudreau, D. M., & Chapman, R. S. (2000). The relationship African American working-class girls’ group. In S. T. Hollis,
between event representation and linguistic skill in narratives L. Pershing, & M. J. Young (Eds.), Feminist theory and the
of children and adolescents with Down syndrome. Journal of study of folklore (pp. 149–165). Urbana: University of Illinois
Speech, Language, and Hearing Research, 43, 1146–1159. Publishing.
Bracken, B., & McCallum, S. (1998). Universal Nonverbal Intelli- Gutiérrez-Clellen, V. F. (2002). Narratives in two languages:
gence Test. Itasca, IL: Riverside. Assessing performance of bilingual children. Linguistics and
Colozzo, P., Gillam, R., Wood, M., Schnell, R., & Johnston, J. Education, 13, 175–197.
(2011). Content and form in the narratives of children with Gutiérrez-Clellen, V. F., & Kreiter, J. (2003). Understanding child
specific language impairment. Journal of Speech, Language, bilingual acquisition using parent and teacher reports. Applied
and Hearing Research, 54, 1609–1627. Psycholinguistics, 24, 267–288.
Common Core State Standards Initiative. (2010). Common core Gutiérrez-Clellen, V. F., Simon-Cereijido, G., & Sweet, M. (2012).
state standards for English language arts and literacy in history/ Predictors of second language acquisition in Latino children
social studies, science, and technical subjects. Washington, DC: with specific language impairment. American Journal of Speech-
Council of Chief State School Officers and National Governors Language Pathology, 21, 64–77.
Association. Retrieved from http://www.corestandards.org/ Gwet, K. L. (2008). Computing inter-rater reliability and its
Corsaro, W. A., & Eder, D. (1990). Children’s peer cultures. Annual variance in the presence of high agreement. British Journal of
Review of Sociology, 16, 197–220. Mathematical and Statistical Psychology, 61, 29–48.
Dodwell, K., & Bavin, E. L. (2008). Children with specific language Halliday, M. A., & Hasan, R. (1976). Cohesion in English. London,
impairment: An investigation of their narratives and memory. England: Longman.
International Journal of Language & Communication Disorders, Hoyle, S.M. (1998). Register and footing in role play. In M. S.
43, 201–218. Hoyle & C. Adger (Eds.), Kids talk (pp. 47–68). New York, NY:
Domsch, C., Richels, C., Saldana, M., Coleman, C., Wimberly, C., Oxford University Press.
& Maxwell, L. (2012). Narrative skill and syntactic complexity Hughes, D. L., McGillivray, L., & Schmidek, M. (1997). Guide to
in school-age children with and without late language emergence. narrative language: Procedures for assessment. Eau Claire, WI:
International Journal of Language & Communication Disorders, Thinking Publications.
47, 197–207. Iluz-Cohen, P., & Walters, J. (2012). Telling stories in two lan-
Duinmeijer, I., de Jong, J., & Scheper, A. (2012). Narrative abili- guages: Narratives of bilingual preschool children with typical
ties, memory and attention in children with a specific language and impaired language. Bilingualism: Language and Cognition,
impairment. International Journal of Language & Communica- 15, 58–74.
tion Disorders, 47, 542–555. Justice, L., Bowles, R., Kaderavek, J., Ukrainetz, T., Eisenberg, S.,
Gathercole, S. E., & Baddeley, A. D. (1993). Phonological work- & Gillam, R. (2006). The index of narrative microstructure:
ing memory: A critical building block for reading development A clinical tool for analyzing school-age children’s narrative
1392 Journal of Speech, Language, and Hearing Research • Vol. 61 • 1381–1392 • June 2018