Professional Documents
Culture Documents
130
5.1 Memory systems and their role in L2 learning
131
SECOND LANGUAGE LEARNING THEORIES
132
5.1 Memory systems and their role in L2 learning
133
SECOND LANGUAGE LEARNING THEORIES
been put forward. Swain (1995) has argued that this is due to tensions between
semantic and syntactic processing; other researchers (for example, Gass, 1997;
VanPatten, 2004) say that L2 learners process language for communication,
for which many grammatical forms are redundant (see Chapter 4). Recent
studies using methodologies such as self-paced reading (Jiang, 2004, 2007),
ERPs (Chen et al., 2007) and eye-tracking (Grüter et al., 2012; Lew-Williams
and Fernald, 2010) suggest that L2 learners are less sensitive to inflectional
morphology than L1 speakers, and that the L1 and L2 processing of inflectional
morphology are based on different mechanisms (see Clahsen et al., 2010, for a
review). The Shallow Structure Hypothesis (SSH), developed by Clahsen and
Felser (2006a, 2006b), formalizes some of these differences between L1 and L2
morphosyntactic processing, and makes the following claims:
1. Children rely on syntax when processing grammar, as do adult L1 speakers
(known as the ‘Continuity Hypothesis’), whereas late (adult) language
learners rely to a greater extent on lexical–semantic and pragmatic
information and do not compute syntactic structures in the same way as L1
learners or adults.
2. The procedural memory system is partially available to late L2 learners,
though primarily for ‘local’ morphology and not (or much less) for
morphosyntactic processes which link more distant elements. For example,
verb–subject agreement may be proceduralized if they are close together as
in the woman runs fast, but not if they are far apart as in the woman who won
first prize in last year’s race runs fast. (Note: this is a more conservative view of
eventual L1-like attainment than Ullman’s DP model.)
3. The L1 does not significantly impact on the L2 of the learners, so that shallow
syntactic processing occurs regardless of the relationship between L1 and L2.
Felser et al. (2003) tested the SSH by investigating how learners work out who
the referent is in relative clauses in sentences such as:
The dean liked the secretary of the professor who was reading a letter.
This sentence is ambiguous as either the professor or the secretary can be the
subject of was reading a letter. Speakers of different languages have different
preferences for choosing either the professor or the secretary as subject. Felser
et al. found that highly proficient L2 English speakers (German and Greek L1)
exhibited no preference for either subject, that is, they did not transfer their L1
preference for secretary nor did they adopt the English preference for professor.
Felser et al. concluded that this is because L2 speakers, in contrast to native
speakers, tend to opt for semantic and pragmatic cues rather than computing
the syntactic structure of a sentence.
However, there are numerous challenges for research into the SSH, online
processing and memory systems. We mention five of these here.
134
5.1 Memory systems and their role in L2 learning
First, some researchers suggest that factors such as learners’ L1, their working
memory capacity or the verb semantics can overcome shallow processing and
so allow some advanced learners to behave like adult L1 speakers (Jiang et al.,
2011; Havik et al., 2009; Jackson, 2008). (In contrast to the SSH, Ullman’s model
does account for some learners being able to attain native-like procedural
knowledge.)
Second, the SSH has been criticized for claiming a qualitative difference
between L1 and L2 processing, whereas the difference is merely one of
frequency; that is, L1 and L2 speakers all use shallow processing but L2 speakers
use it more often (Sekerina and Brooks, 2006; Indefrey, 2006). A similar idea is
found in Bley-Vroman’s (2009) reformulated Fundamental Difference Hypothesis
mentioned above.
Third, neurolinguistic evidence for L1 and L2 storage and morphosyntactic
processing provides a mixed picture in terms of informing learning theory
(DeKeyser, 2012a, pp. 448 and 453). Some evidence suggests that L1 and L2
morphosyntactic processing rely on the same neurolinguistic architecture, and
that dissimilarities only occur when the features being processed are dissimilar in
the L1 and L2 (Tokowicz and MacWhinney, 2005; Tolentino and Tokowicz, 2011).
Fourth, researchers need to address the extent to which millisecond differences
in online processing or differences in brain activation reflect real differences
in how the L2 is used in comprehension and production, combining online
measures of processing and language use (Chen et al., 2007; Roberts et al., 2008;
Grüter et al., 2012).
Finally, it is not yet clear how improved understanding of online processing
and L1/L2 memory systems actually improves our understanding of learning
mechanisms themselves. For example, it remains somewhat unclear exactly
‘how certain learners gain [L1-like] proficiency and begin to use their procedural
networks more than other learners’ (Herschensohn, 2007, p. 220). It is also
unclear when, exactly, knowledge stored in the procedural memory can become
accessible to conscious control: this is crucial if explicit awareness is needed for
learning, as argued by some researchers (see below).
The focus of our review in this area has been morphosyntax, neglecting
memory-based accounts of L1/L2 differences in other areas. In terms of
vocabulary storage, the interplay between L1 and L2 is an active research area
(Linck et al., 2008; Sunderman and Kroll, 2006). Ullman’s model of memory
suggests that lexical items are learned and stored primarily using declarative
memory in both L1 and L2, consistent with the observation that vocabulary
learning in L2 is not prone to age effects (DeKeyser, 2012a). As the declarative
memory system works with both implicit and explicit information, Ullman’s
model can account for incidental and, possibly, implicit vocabulary learning
(Chen and Truscott, 2010).
135
SECOND LANGUAGE LEARNING THEORIES
Overall, the notion that there is a change in the way humans learn as they move
from childhood to adulthood receives considerable support as an explanation
for L1/L2 differences in grammatical processing. As noted in Chapter 3, for
some theorists this is due to reduced access to certain features or interfaces
within UG. Others suggest that it is due to a change in more general cognitive
capabilities. For example, DeKeyser et al. (2010) argue that age influences
morphosyntactic ability, and that different aptitudes have a role in this: ‘younger
learners learn more while relying less on aptitude; older learners learn less, and
to the extent they do learn, must rely more heavily on their verbal aptitude’
(p. 433). Similarly, DeKeyser et al. (forthcoming, cited in DeKeyser, 2012a)
found that salience interacts with age, that is, older learners are more affected
by salience of morphological form than younger learners. Both findings are
compatible with the argument that child learners rely more heavily on implicit
learning mechanisms, whereas adults rely more on explicit learning initially
(see DeKeyser, 2000, 2003, 2012a and b; and Skill Acquisition Theory in Section
5.2.2). We now turn to research into defining and identifying explicit knowledge
more closely, and then we explore the extent and nature of explicit learning in
SLA.
136
5.2 Explicit knowledge, information processing and skill acquisition
implicit knowledge. R. Ellis (2005b) lays out some criteria for defining implicit
and explicit knowledge, summarized in Table 5.1.
Table 5.1 Operationalizing the constructs of L2 implicit and explicit knowledge (source: R. Ellis,
2005b, p. 152)
137
SECOND LANGUAGE LEARNING THEORIES
parts
138
5.2 Explicit knowledge, information processing and skill acquisition
139
SECOND LANGUAGE LEARNING THEORIES
140
5.2 Explicit knowledge, information processing and skill acquisition
141
SECOND LANGUAGE LEARNING THEORIES
(c) Why learning is incremental (step by step). Recall that during the shift from
declarative through to automatic processing, controlled processes are freed
up gradually, to establish new declarative knowledge or deal with more
complex processing.
(d) Why there are differences between individual learners. The establishment
of declarative knowledge is likely to be affected by individual cognitive
differences, such as working memory capacity, analytical ability, general
intelligence, or learning style (DeKeyser, 2003, 2005; Roehr, 2008; Ullman,
2001a).
(e) Why there is fossilization. Fossilization, that is, non-native-like structures in
the L2 in spite of abundant linguistic input and years of practice, is a result
of declarative knowledge that is non-native-like becoming automatized.
As automatized knowledge is difficult to modify, it is likely to remain in
the learner’s interlanguage, giving rise to a stable, though non-native-like,
construction.
(f) Why some structures are more likely to fossilize than others. Non-native-like
declarative knowledge is more likely for some structures than others due to
many factors, including: the nature of the feature (for example, complexity,
frequency, saliency, relation to the L1); the number of communicative events
which would actually push the learner to restructure their initial declarative
knowledge (for example, failed parsing or corrective feedback, see Chapter
6). In addition, communicative contexts for particular forms are very
frequent (for example, contexts that require third person –s). If a learner had
non-native-like declarative knowledge about a highly frequent structure (for
example, ‘subject–verb agreement is never necessary in English’), then there
is ample opportunity for the learner to proceduralize and automatize this
frequently ‘required’, but incorrect, knowledge.
142
5.2 Explicit knowledge, information processing and skill acquisition
2008 for reviews), and Roehr (2007) found that such correlations were stronger
among more proficient learners. However, as Roehr notes, correlation research
cannot determine whether increasing proficiency causes improvement in
metalinguistic ability, or vice versa. For that, we need studies that compare
learners with high and low metalinguistic ability and examine the effects on
increasing proficiency.
Researchers using think-aloud protocols, and others analysing learner–learner
or learner–expert interactions, have demonstrated that learners with greater
metalinguistic awareness show superior learning gains over those with less
(see Section 5.3, and Chapter 8). Recall, however, that in Chapter 4 we reviewed
research that set out to determine whether explicit awareness is essential.
Results indicated that learning was observed even when participants did not
seem to have any initial declarative knowledge of any sort. Such studies have
enabled researchers to identify the situations in which skill acquisition does not
seem to be applicable.
143
SECOND LANGUAGE LEARNING THEORIES
agenda, see Doughty and Williams (1998), Loewen (2011), Norris and Ortega
(2000), Spada and Tomita (2010). Meta-analyses are increasingly frequent,
although the diversity of research designs makes comparisons difficult. Overall,
the consensus seems to be that explicit learning conditions result in higher
scores on post tests than less explicit conditions, at least on tests that give
instructed learners time to access explicit knowledge. The evidence is more
mixed as to whether gains can be made, within the timeframes investigated, on
tests that require free, unplanned communicative production (see R. Ellis, 2002;
Marsden and Chen, 2011).
One area that remains to be fully addressed is the relative merits of whether
the initial, declarative knowledge is picked out by the learner (induced) or
provided by some external source such as a teacher or textbook (deduced).
The advantages of deduced knowledge are its accuracy and generalizability. On
the other hand, induced explicit knowledge may have greater relevance to the
moment and so perhaps be more easily established in memory. For studies in
which learners seemed to induce declarative knowledge, see Marsden and Chen
(2011), Robinson (1997a), Sanz and Morgan-Short (2004).
Some researchers are interested in whether declarative knowledge is more likely
to be proceduralized when target language forms are simple, or complex. Spada
and Tomita (2010) carried out a meta-analysis of 30 ESL studies that compared
metalinguistic teaching activities to implicit teaching approaches such as
input floods and implicit error correction. They found that for both simple and
complex structures explicit teaching was more effective, regardless of the type of
test used to elicit language knowledge. However, they acknowledge difficulties
with their definition of complexity. It is also difficult to know whether the
explicit knowledge provided by the teachers in these classroom studies had
become proceduralized or automatized by the time it was tested. So, we have to
be careful in interpreting these results in terms of skill acquisition theory.
Other researchers are interested in whether explicit information about
constructions is useful for classroom learners (Chapter 4, and Robinson
and Ellis, 2008). For example, it may be useful to tell learners about particular
prototypes, that is, best exemplars of a particular category, structure, or feature,
and tell them how to break these prototypes into their constituent parts.
However, research must first establish which prototypes are valid and useful
(Wu, 2011) and whether constructions are accessible to conscious reflection so
that learners can ‘break them down’.
144
5.2 Explicit knowledge, information processing and skill acquisition
145
SECOND LANGUAGE LEARNING THEORIES
146
5.3 Awareness and attention in second language acquisition
147
SECOND LANGUAGE LEARNING THEORIES
provide indirect support for the benefits of conscious awareness. For example,
Williams’ (2005) study discussed in Chapter 4 found that all aware participants
performed significantly better than the ‘unaware’ participants, as did Marsden
and Chen (2011).
The debate as to whether awareness is necessary was described in Chapter 4, in
the accounts of the work on implicit learning by Williams and colleagues. The
evidence we reviewed suggested that it is possible to learn a generalizable form-
meaning connection without awareness of it. Though recall that Hama and
Leow (2010), along with other work by Leow, argued that none of their unaware
participants learned and that noticing is essential.
As we saw in Chapter 4, in their study of learning of a semi-artificial language
with novel articles, Leung and Williams (2011) find evidence for learning
without awareness. They explain exactly what their ‘unaware’ learners must
have noticed, and what they did not:
the participants were fully aware of (that is, they noticed) the critical phrase (for
example, ul Mary) and its association to the entity (for example, a girl) within the
event … of a girl kissing a boy. In this sense, the participants were aware of a form-
meaning relationship … What they were not aware of was the specific relationship
between [the article ul] and [its function] that … was implicit within their
understanding of the event. (p. 51)
148
5.3 Awareness and attention in second language acquisition
As Mackey points out, learners that were identified as ‘aware’ did not necessarily
demonstrate development. And, conversely, in the control group, learning was
not necessarily accompanied by noticing (Table 5.2). Noticing was associated
with learning for question development but not for the other two forms tracked
in this study.
Table 5.2 Learners’ reports about noticing and learning (source: Mackey, 2006, p. 421)
149
SECOND LANGUAGE LEARNING THEORIES
language, that is, instances of noticing, are used to document the frequency and
focus of incidental, or spontaneous, attention (Park and Han, 2008; Bell, 2012).
Evidence suggests that morphosyntax is attended to, though the lexicon is
much more frequently attended (Williams 1999).
A long line of studies has documented the effects of ‘forcing’ learners to attend
to a form-meaning (or form-function) connection while listening to or reading
sentences (known as Processing Instruction, VanPatten, 2004). VanPatten’s
claim is that attending to form-meaning connections is essential for L2 learning.
To test this, comparison must be made to a condition in which form-meaning
connections are not essential. Evidence from classroom studies investigating
this issue found that when learners’ attention was not drawn to the form-
meaning connection, no significant learning was observed, broadly compatible
with VanPatten’s claims (Marsden, 2006; Marsden and Chen, 2011). However,
other, more implicit measures of learning may have obtained different results.
The proposal that attending to morphosyntax and other cues in the input is in
tension with attending to other things is based on the notion that attentional
resources are limited or constrained in some way. We briefly examine this idea
next.
150
5.4 Working Memory and second language acquisition
151
SECOND LANGUAGE LEARNING THEORIES
152
5.4 Working Memory and second language acquisition
153
SECOND LANGUAGE LEARNING THEORIES
the idea that when comprehension remains effortful, WM will not be available
to allocate attention to redundant morphosyntactic cues (see VanPatten, 2007,
pp. 116–19).
There is considerable evidence of strong associations between WM (specifically,
phonological memory) and L2 vocabulary learning (Speciale et al., 2004).
Williams and Lovatt (2003) demonstrated the role of phonological short-term
memory in learning both new vocabulary and morphosyntax. However, recent
cross-linguistic evidence suggests that non-alphabetic scripts in learners’ L1
may reduce the reliance on the phonological loop in word learning (Hamada
and Koda, 2011).
Other studies explore how WM may influence whether novel forms are learned
by investigating the role of WM in interaction (see Chapter 6). In a series of
studies Mackey and colleagues explore whether learners with higher WM
capacity are likely to benefit more from corrective feedback during interaction.
Mackey et al. (2002) found that higher WM capacity learners took longer than
low WM participants to reflect on their output, during which they compared it
with existing knowledge. They observed significant relationships between their
outcome measures and WM capacity, though interestingly only in delayed test
results. Mackey et al. (2010) found that WM capacity (measured by listening
span score) was associated with just under a fifth of the amount of modified
output produced by learners, showing some role for WM capacity in the uptake
of feedback.
Some researchers have raised methodological issues related to WM. For
example, Goo (2010) found that the think-aloud technique used to elicit data
about awareness actually causes increased awareness and more rule learning
among participants with high WM capacity. Révész (2012) shows that different
subcomponents of WM were associated with different language sub-skills:
learners with high phonological short-term memory made more gains in oral
performance, whereas learners with a high reading span score were more likely
to improve on a written test. She speculates, based on Skill Acquisition Theory,
that different WM tests can tap into learners’ ability to learn declarative
or procedural knowledge (see also Wen, 2012 who calls for skill-specific
WM tests). Gass and Lee (2011) present a serious methodological challenge
for L2 research as they found that scores on L2 tests of WM increased
with proficiency; this makes it difficult to unpick causal relations between
proficiency and WM.
What is clear from this short review is that L2 researchers typically use WM
in correlational designs, which limits the claims that can be made about
underlying causal relationships. Although the research on attention (reviewed
in the previous section) goes some way to explaining a link between WM and
L2 learning, the exact mechanisms that link WM to long-term memory (that
is, why and how some representations move from WM to long-term memory,
154
5.4 Working Memory and second language acquisition
and not others) remain to be explored (see Jones et al., 2007 for one model in L1
learning).
155
SECOND LANGUAGE LEARNING THEORIES
156
5.5 Evaluation of cognitive approaches (2)
157
SECOND LANGUAGE LEARNING THEORIES
158
5.5 Evaluation of cognitive approaches (2)
159