You are on page 1of 40

Psychological Review Copyright 2000 by the American Psychological Association, Inc.

2000, VoL 107, No. 3, 460-499 0033-295X/00/$5.00 DOI: 10.1037//0033-295X.107.3.460

Discreteness and Interactivity in Spoken Word Production


Brenda Rapp and Matthew Goldrick
Johns Hopkins University

Five theories of spoken word production that differ along the discreteness-interactivity dimension are
evaluated. Specifically examined is the role that cascading activation, feedback, seriality, and interaction
domains play in accounting for a set of fundamental observations derived from patterns of speech errors
produced by normal and brain-damaged individuals. After reviewing the evidence from normal speech
errors, case studies of 3 brain-damaged individuals with acquired naming deficits are presented. The
patterns these individuals exhibit provide important constraints on theories of spoken naming. With the
help of computer simulations of the 5 theories, the authors evaluate the extent to which the error patterns
predicted by each theory conform with the empirical facts. The results support a theory of spoken word
production that, although interactive, ptaces important restrictions on the extent and locus of interactivity.

In much of current theorizing on cognitive representation and findings from normal and impaired spoken production that need to
processing, the default assumption is that cognitive systems are be accounted for by any theory of spoken word production. Then,
highly interactive. Nonetheless, relatively little attention has been with the help of computer simulation, we evaluate the extent to
directed at examining what is meant by "interactivity" or what which the error patterns predicted by each of the five theories
empirical evidence is required to decide that a system is interac- conform with the empirical facts. We find support for a theory of
tive. In this article, we consider these issues b y examining inter- spoken word production that, although interactive, places impor-
activity in the domain of spoken word production. Specifically, we tant restrictions on the extent and locus of interactivity. Before
consider the role of interactivity in the processes involved in going turning to the specific issues of spoken word production, we
from a word's meaning to its phonological form. Rather than consider several general matters regarding discreteness and
attempting to decide for or against interactivity, we ask the fol- interactivity.
lowing questions: Which aspects of spoken naming are best ac-
counted for by discrete processes and which require that we Interactivity: C o n c e p t s and M e c h a n i s m s
assume interactivity? Where there is interactivity, what is the
nature and extent of the interaction? Concepts
In this article, we identify five theories of spoken word produc- The term interactivityhas been applied to a range of situations
tion that represent different points along a continuum from high that differ from one another in significant ways (see Boland &
discreteness to high interactivity. These five theories differ sys- Cutler, 1996). We use interactivity to refer to multiple processes
tematically with respect to the features of cascading activation, (e.g., Processes A and B) that influence one another as they carry
feedback, domains of interaction, and seriality. We identify a set of out their computations. This use of interactivity is intended to
capture the notion of "mutual influence" that the term typically
evokes.
A distinction can be made between forward-backward and
Brenda Rapp and Matthew Goldrick, Cognitive Science Department,
lateral (side-to-side) interactivity (Figure 1). Forward-backward
Johns Hopkins University.
interactivity occurs when information flows not only forward,
This research was supported by National Institute of Mental Health
Grant R29MH55758 and by the Johns Hopkins University Provost's from "earlier" to "later" processes, but also backward, from later to
Award for Undergraduate Research. earlier ones. In contrast, lateral interactivity refers to the situation
We would like to acknowledge the dedication and insights that Seth in which the flow of information is between processes that are not
Dennis and Brian Glucroft contributed to the scoring and analysis of the ordered with respect to one another.
patient data. Argye Hillis and Alfonso Caramazza kindly provided us with There are, of course, extremely difficult questions regarding
R.G.B.'s data for reanalysis. We are grateful to Gary Dell and David Plaut what counts as a process that we do not address here. We use the
for their willingness both to share their simulation programs and to assist term process in an informational sense to refer to transformations
us in getting them up and running. Michael McCloskey and Paul Smulen- or mappings of one type of information to another (the mapping of
sky provided invaluable intellectual and moral support throughout the
semantic features onto words, the transformation of abstract word
project. Finally, we very much appreciate the patience and the help-
representations into phonemes, etc.). Furthermore, we would like
ful comments of Luigi Burzio, Argye Hillis, Gary Dell, and Michael
McCloskey on earlier versions of this article. to emphasize that although it may be possible to localize a process
Correspondence concerning this article should be addressed to Brenda to a particular part of a network, a direct relationship between
Rapp, Cognitive Science Department, Johns Hopkins University, Krieger processes and layers or nodes in a network is not necessary. Thus,
Hall, 3400 North Charles Street, Baltimore, Maryland 21218-2585. Elec- a process may involve multiple layers, and furthermore, these
tronic mail may be sent to brenda@mail.cog.jhu.edu. layers may be involved in more than one process.

460
INTERACTIVITYIN SPOKEN WORD PRODUCTION 461

@
Forward-Backward Lateral Integration
Interactivity Interactivity
Figure 1. Differenttypes of interactivity.

Forward-backward interactivity has been invoked to explain a put of some subsequent process (e.g., object identification). This
number of phenomena, such as the word superiority effect--the proposal stands in contrast not only to a hypothesis that assumes
observation that people identify letters more accurately in brief that the two prOcesses interact but also with a nonintegrative (e.g.,
visual presentations of words than in brief visual presentations of horse race) hypothesis that assumes that although the two inde-
single letters or random letter strings (Reicher, 1969; Wheeler, pendent processes are engaged, only one (e.g., the quickest) con-
1970). McClelland and Rumelhart (1981; Rumelhart & McClel- tributes to the object identification decision.
land, 1982) proposed that a letter identification process feeds Although it is useful to distinguish between interactivity and
forward to a word recognition process and that the word recogni- integration, it is also important to note that the consequences of
tion process not only feeds forward to subsequent processes (e.g., integration and interactivity are similar in the sense that both serve
semantic ones) but also feeds back to the letter identification to reduce the isolation of processing systems by increasing the
process. The letter identification process then again feeds forward extent to which different processes jointly contribute to an out-
to the word recognition process, creating an interactive cycle (see come. In this article, we are concerned specifically with forward-
Elman & McClelland, 1988, for an example of forward-backward backward interactivity and with a particular form of integration.
interactivity in spoken language perception; see also Samuel, Despite the fact that interactive and integrative mechanisms are
1997). often assumed in cognitive theorizing, solid evidence for interac-
Lateral interactivity can be illustrated by a hypothesis that tivity has often been difficult to come by. For example, the word
edge-based and motion-based processes (which can independently superiority effect is often cited as the prototypical result requiring
compute form) may influence one another in the course of object forward-backward interactivity; nonetheless, this account has not
recognition. According to such a hypothesis, edge- and motion- gone unchallenged. The argument for interactivity rests on estab-
based processes function successfully and autonomously with lishing that the effect stems from word-driven enhancement at the
stimuli that lack either motion or edge cues, respectively. Further- letter level. This is crucial because alternative hypotheses based on
more, these processes are not ordered with respect to one another. strictly feedforward processing assume that the superiority effect
Only with stimuli containing both edge and motion information originates at the word level (see Johnston, 1978; Massaro &
would the two processes interact. Cohen, 1991; Paap, Newsome, McDonald, & Schvaneveldt, 1982).
As well as referring to reciprocal effects, the term interactivity Research on this topic illustrates that establishing strong evidence
is sometimes applied to a somewhat different situation in which for interactivity is generally not straightforward and, crucially, that
the outputs of multiple processes come together and jointly con- it requires a clear demonstration that the effects cannot be ac-
tribute to some other subsequent process or processes (Figure 1). counted for within a noninteractive architecture.
Note that in such cases the computations of the contributing
processes themselves are not affected as a result of their joint Mechanisms of lnteractivity
action. To distinguish this from the reciprocal action situation, we
refer to it as integration. A variety of mechanisms or architectural features may play a
An example illustrating integration is the hypothesis that the role in implementing interactlvity and integration. These include
outputs of relatively early and independent visual and auditory cascading activation, feedback, connectivity distance, extent of
processes jointly contribute--are integrated--to determine the out- interactivity domains, and seriality. Cascading activation refers to
462 RAPP AND GOLDRICK

the flow of activation to a later level before a decision has been activation of a set of semantic features (but see Roelofs, 1992,
reached at the earlier level, feedback refers to the flow of infor- 1997, for a nondecompositional semantics position).
mation from later to earlier levels, connectivity distance refers to Beyond the semantic level, all of the theories assume at least one
the directness of connections between levels, domains of interac- level of lexical representation (i.e., where units correspond to
tion refer to the number and types of processes that are assumed to words) that mediates between the conceptual-semantic level and
interact, and seriality refers to the degree to which there are the level of individual phonemes. However, the nature and orga-
processing steps or decision points between input and output. nization of this lexical level differs across theories.
Later, we elaborate on the contribution these mechanisms make to According to the lemrha-based theories, there is, for each word,
the level of discreteness-interactivity in the system. an amodal, lexically specific node (lemma) that represents the
syntactic characteristics of that word. This node mediates between
T h e o r i e s o f S p o k e n W o r d Production the conceptual and phoneme levels and serves as the basis for
retrieving both orthographic and phonological information. It is
Word retrieval in spoken production is generally referred to as important to note that the lexically specific syntactic information
a two-stage process in which the first stage is meaning-based and represented by the lemma must be accessed to gain access to a
the second is phonology-based (Butterworth, 1989; Garrett, 1980; word's phonological characteristics)
Levelt, 1989). Accordingly, it is assumed that a nonverbal meaning In contrast, Caramazza's (1997) independent network theory
is first mapped onto a word and that the word is then mapped onto assumes that the nodes mediating between semantic features and
its corresponding phonology. phonemes are modality-specific phonological lexeme nodes (with
In this context, the terms stage and process are often used a corresponding set of orthographic lexeme nodes mediating be-
interchangeably. However, we draw a distinction between them. tween semantics and graphemes). These phonological lexemes
As stated earlier, we use process in the informational sense; we use have pointers to features in a syntactic network as well as to
stage in a temporal sense to refer to activities occurring before or phonemes in a segmental network. A central claim of the inde-
after some event or point in time. Furthermore, as is the case for pendent network theory is that access to phoneme information
process, we assume that there need not be a direct relationship does not require prior access to syntactic information.
between stages and layers or levels of a network. In sum, the lemma and independent network theories differ in
Current theories of spoken word production differ primarily two key respects: (a) whether the lexical representations mediating
regarding (a) the number, type, and organization of representa- between semantics and phonemes are m o d a l or modality-specific
tions; (b) whether or not activation spreads or cascades to subse-
and (b) whether access to lexical syntax is a prerequisite for access
quent levels before processing is finished at an earlier level; and
to phonological form.
(c) whether processing across levels is strictly feedforward (But-
The debate between lemma theories and the independent net-
terworth, 1989; Levelt et al., 1991a; Roelofs, 1992, 1997; Schrie-
work account is ongoing, and it is beyond the scope of this article
fers, Meyer, & Levelt, 1990) or involves feedback (Dell, 1986;
to review the relevant evidence (see Caramazza, 1997; Caramazza
Harley, 1984). Although we focus on points (b) and (c), it is useful
& Miozzo, 1997; Miozzo & Caramazza, 1997). Given that our
to first briefly review the debate on representational type and
primary concern is with the discreteness-interactivity question, we
number.
emphasize the commonalities among the theories by assuming a
generic architecture (Figure 3). According to this account, there is
Representational Issues a semantically decomposed level of representation followed by an
L level that corresponds to a representation that may be either
Most accounts of word production posit three major represen- modality-specific (e.g., phonological lexemes) or amodal (e.g.,
tational types: semantic, syntactic, and phonological. 1 These are lemmas) and that may or may not be fundamentally syntactic in
typically conceptualized as pools of units or nodes that both nature.
accumulate activation and pass it on to other units. For example,
Dell and O'Seaghdha (1991, 1992; see also Harley, 1993) posited
semantic feature nodes, lexical nodes, and phonological segment 1 The representation of orthographic form is largely ignored in most
nodes (with the lexical nodes representing lexically specific syn- accounts, although it turns out that issues of orthographic representation .
tactic information such as noun, masculine, etc.; see Figure 2A); play an important role in establishing which aspects of lexical knowledge
Levelt (1992) referred to lexical concepts, lemmas, and word are modal (common to both written and spoken forms) and which are
forms (in which lemmas are syntactic objects; see Figure 2B); modality specific (Caramazza, 1997; Caramazza & Hillis, 1990; Rapp &
Caramazza, 1998).
similarly, Roelofs (1992) proposed conceptual, syntactic, and word
form strata; and Caramazza (1997) described a lexical semantic 2 Furthermore, some theories posit additional sublexical levels of pho-
nological representation corresponding to syllables or syllable constituents,
network, a syntactic network, and phonological lexemes (see Fig-
whereas others do not.
ure 2C). In addition, Dell (1986) and Roelofs (1992, 1997) posited
3 Roelofs's (1992) theory is in the lemma class; however, because
an additional lexical level between the 1emma level and the pho- syntactic feature information is accessed from the amodal lemma node, it
neme level (Figure 2D) that is sometimes referred to as the is not clear whether accessing syntactic feature information is essential for
morpheme level. 2 phonological retrieval. That is to say, the activation of the "empty" lemma
At the conceptual-semantic level, most theories assume a de- node (without proper access to syntactic feature information) could serve
composed semantic representation and depict processing as the as the basis for phonological encoding.
INTERACTIVITYIN SPOKEN WORD PRODUCTION 463

PHONEMES

C D
/~EMANTIC FEATURES . ~

"., ~ ' ~ : : ~ l l'Wd I/\~1 I / \ \ \ I / / \ \ I II \ ~IN\l I

Figure 2. Schematicsof the followingtheories: (A) Dell and O'Seaghdha (1991, 1992): distributedsemantic
concepts map onto amodallexicalnodes, which, in turn, map onto phonemes;(B) Levelt (1989): lexicalconcepts
map onto lemmas specifyingsyntactic information,and these, in turn, map onto word forms; (C) Caramazza
(1997): distributedsemanticconceptsdirectlyaccess modality-specificlexemesas well as syntacticfeatures; and
(D) Roelofs (1992, 1997): nondecomposedconcepts map onto modal lemmas, and these map onto modality-
specific (phonological)morphemes, which, in turn, map onto phonemes.

Cascading Activation and F e e d b a c k ends when a single L-level unit is selected, where selection means
that the activation of the competing L-level units (corresponding to
The literature on spoken word production has been dominated
semantic competitors) is reduced to 0; (c) during Stage 2, phono-
by a debate between proponents of strictly discrete accounts (But-
logical retrieval takes place for only the selected L-level unit (see
terworth, 1989; Garrett, 1980; Kempen & Huijbers, 1983; Levelt,
Figure 4A).
1989, 1992; Levelt et al., 1991a; Roelofs, 1992, 1997; Schriefers
In contrast, the central, characteristics of a highly interactive
et al., 1990) and advocates of interactive theories (Dell, 1985,
position (e.g., Dell & O'Seaghdha, 1991, 1992) are as follows: (a)
1986, 1988; Dell & O'Seaghdha, 1991, 1992; Harley, 1984, 1993;
MacKay, 1987; Stemberger, 1985). Although the work of Dell and Stage 1 begins when semantic information regarding the target
colleagues and Levelt and colleagues refers to a number of detailed produces activation of the target and its semantically related com-
aspects of spoken production that we do not consider here, their petitors, and furthermore, Stage 1 continues as all of the activated
proposals with regard to the discreteness-interactivity issue are L-level units pass on activation to the phoneme level; (b) activa-
representative of discrete and interactive positions, and thus we tion throughout Stages 1 and 2 involves not only a forward flow of
take them as a starting point. activation but also a backward flow of activation between the
The central characteristics of a strict, discrete two-stage position phonological and L levels as well as between the L and semantic
(e.g., Levelt et al., 1991a) are as follows: (a) Stage 1 begins when levels; (c) Stage 1 "ends" with the selection of the most active
semantic information regarding the target (e.g., CAT) produces L-level unit, where selection means that the activation level of the
activation of the target and its semantically related competitors selected unit is raised above that of its competitors (but the
(e.g., DOG, HOG, RAT) at the semantic and L levels; (b) Stage 1 competitors are not turned off); (d) during Stage 2, processing at
464 RAPP AND GOLDRICK

MAT, CAN, COT). For example, if CAT activates the phonemes


Semantic /k/,/ae/, and/t/, then feedback from those phonemes back to the L
Activation
level will both reactivate CAT and activate word nodes corre-
sponding to any words sharing any of those phonemes (e.g, HAT;
Figure 5C). In turn, these active L-level nodes will contribute,
through their forward connections, to the activation of their cor-
L-Level responding phonemes at the phoneme level (e.g, /h/ from HAT)
Selection
and, through their backward connections, also activate their cor-
responding semantic features.

Phoneme Cascading Activation Plus Feedback


Selection
As just indicated, feedback alone adds the phonological neigh-
bors of the target to the "cast of characters." When combined with
Figure 3. The generic two-stage account of spoken naming. Distributed cascading activation, feedback also serves to further strengthen the
semantic concepts map onto lexical nodes, which, in turn, map onto semantic competitors of the target at the L level and at the
phonemes. semantic level. As a result, in a system with cascading activation
and full feedback, semantic and phonological neighbors of the
target will be active at all levels of the system.
all levels continues until the end of the stage, at which time the
most active phoneme units are selected (again by raising their
activation levels above those of their competitors; Figure 4D). Further Interactivity: Connectivity Distance, Seriality,
These two theoretical positions are similar, as indicated earlier, and Domains of Interaction
in terms of the representational types adopted and their commit-
Cascading activation and feedback do not exhaust the possibil-
ment to a two-stage framework. In addition, they share the as-
ities for implementing interactivity in spoken word production. In
sumption that both the target and its semantic competitors are
fact, as Dell and O'Seaghdha (1991) argued, the architecture they
active during Stage 1 processing. However, as pointed out by
proposed is "globally modular but locally interactive" (p. 604). As
Levelt et al. (1991a; see also Stemberger, 1985), the interactive
mentioned earlier, at least three other factors may contribute to a
theory differs from the discrete theory in terms of two key features:
system's discreteness-interactivity: connectivity distance, serial-
the cascading activation assumption and the feedback assumption.
ity, and the extent of interaction domains.

Cascading Activation
Connectivity Distance
The most important 'consequence of cascading activation is that
multiple units at the L level (e.g., those corresponding to the target In Dell and O'Seaghdha's (1991), Dell (1986), and Dell,
CAT and its semantic neighbors DOG and RAT) send activation to Schwartz, Martin, Saffran, and Gagnon (1997), there is interaction
the phoneme level. This allows the phonemes corresponding to the only between adjacent representational levels; nonadjacent levels
target's semantically related neighbors ( e . g . , / d / a n d / r / f r o m DOG interact only indirectly. However, one can imagine a system in
and RAT, respectively) to become "players" at the phoneme level
(see Figure 5B). 4
4 This activation of phonemes of semantic neighbors results from two
In fact, one could say that cascading activation implements a
aspects of the way in which cascading activation is typically assumed to be
form of integration (albeit in a somewhat different manner than implemented: (a) The activation of phonology begins before the semanti-
illustrated in Figure 1). It is a form of integration in that it allows cally driven L-level selection process has been completed (preactivation of
activation generated by two processes (L-level selection and pho- phonology), and (b) all candidates of this semantically driven process (the
nological retrieval) to be combined in determining the final out- target and its semantically related competitors) are allowed to influence
put of the system. This contrasts with a highly discrete ac- second-stage phonological retrieval beyond the point of L level selection.
count in which the nonselected candidates of the semantic process In point of fact, either (a) or (b) would be sufficient for introducing
(DOG-RAT) do not influence events at the phonological level semantically related words as players at the phoneme level. That is, if
(Figure 5A). activation of phonology is allowed to occur as soon as Stage 1 processing
begins but only the top candidate is allowed to continue beyond the end of
Stage 1, then the "pre"activation of phonology would allow the semanti-
Feedback cally driven process to exert an influence on the outcome of Stage 2
Feedback allows phonological neighbors of the target to be processing. Alternatively, even if there were no preactivation of phonology
but the multiple candidates activated at the L level by the semantically
competitive throughout the system. Specifically, through the back-
driven first-stage process were allowed to pass on their activation to
ward flow of activation, the phoneme level can influence the L Stage 2, then semantically generated candidates would also compete in the
level, and (with full feedback) the phoneme level can influence the phonological arena. Although these two aspects of cascading activation can
semantic level. One particularly significant consequence of this is be distinguished, they have largely the same consequences and are typi-
that phonological units receive activation not only from the target cally assumed to go together. Therefore, we assume both when referring to
word but also from the target's phonological neighbors (e.g., HAT, cascading activation.
INTERACTIVITY IN SPOKEN WORD PRODUCTION 465

o 0
@ o 0
@ s.~
0 o 0 .~ ~ o ,ii~"i
@15
¢GX@Je ¢+-@58
GN@,,, '.0.=

©4.@_> o o
0 @ Oo 0.,~@,,~Oo :'-"°°,""=..
0 0 =="
O 0 ='-I
.Ii I

~Io..

0 0 .°o~

8 8

$ _@sq)e$'""
oO=.¢ O+o '-'WXO*o
0 0 o 0
466 RAPP AND GOLDRICK

A DISCRETE SYSTEM

L LEVEL

f% @
PHONEMES

B CASCADING SYSTEM

®@@ L LEVEL

d
@@6) PHONEMES

C FEEDBACK SYSTEM

L LEVEL

PHONEMES

Figure 5. Consequencesof cascading activationand feedback. A: Activationflow in a discrete system--the


only active phonemes are those of the target. B: Consequencesof cascading activation--phonemesof semantic
neighbors of the target are active. C: Consequencesof feedback formalneighborsof the target are active at the
L level, and nontargetphonemes of formal neighborsare active.

which all levels interact directly with one another. For example, multiple levels of the system, then any process that serves to
Van Orden, de Haar, and Bosman (1997) sketched an account of exaggerate the difference between a winner and its competitors
reading involving full connectivity among and between all levels. will reduce the influence of the competitors on all subsequent
We do not further consider connectivity distance as a mechanism processing. In this way, strong selection points reduce the extent
for increasing interactivity; instead, we examine seriality and do- and the time course over which full and free interaction among
mains of interaction. levels is allowed.

Seriality
Domains of lnteraction
In Dell and O'Seaghdha (1992), Dell (1986), and Dell et al.
(1997), at the end of each stage, a selection occurs whereby the The theories considered thus far assume (at least implicitly) that
activation of the most highly active node is increased. Although the most, if not the only, relevant domains for understanding
processing is interactive until each selection point, selection intro- spoken naming are semantic and phonological (as well as syntac-
duces a strong ordering of the processing stages (seriality), thereby tic). However, the input domain (e.g., visually based processes in
reducing interactivity. If interactivity serves to strengthen the the case of picture naming) could also make a significant contri-
influence of competitors that share a target's characteristics across bution to spoken output (Humphreys, Riddoch, & Quinlan, 1988).
".~o~(~1 ~ u!~!,~ s~!unuoa,~,loqsuo.~oouuooa~i.u,u!sjo ONelS UO^!$ e Jo puo oql le polaOlOS suaol! ao tool! oql £iuo (a) ',(iuo
oouosoad at0 o~o!pu! SA~Oa.mpomnD -suogoouuoo o.mourtu,(s'ieuoDooa.rp!q o ~ l s i~u!ssoooad luoa.ma ot0 o~ pom,juoo s! ~u!ssooozd (q) 'uo.rtoa.a!p
ol~o!pu! sl.mn jo ~o,~ I uooA~lOCls~o.tm i~uo.rlooa!p!fl "(VS"IH) lunoo pa¢~ao,~ ,(laO.ms ~ u! spooooad ffu!ssoooad (g) oa~ Vd(I jo so.ns!
-oe ~!l~.Uos A~OI 'UODOV,.IO~U ! ZOqmL30t 0 Nuuo!dop o.~emoqoS "9 aang~.d
-aolot~xeqo ~..quoo oq,L "(Vd(l) ~unooav pa~,~ofpaaf a~aaos~.o •I
:SA~O[[OJS'e O.I~ suo.utsod OAI,~oql '~(lJo.~t "su.retuop uo.nogao~u!
souJouoqd jo uo!suedxo pug £l!Ft.uos poanpoa t0o q oonponu! ota 'faooql
q ~ oq~ zod "~Io~qpoojjo saoa~op IRIOAOSptre UOD~A.UOe~u.Ipgos~o
q)!A~ oamool.rqoa~ oloaos.tp g ~ut.~!pom £II~luomoaou! £q poluomoid
s),!un uepp!q -m[ o.i~ SO.LIOOt0 .inoj lsat.j oql ~uoui~ ,';aI.A.tl0~.IO:;Ut.u.t soouoloj.j.I(I
'(L66I "I~ '1o I[O(I 'E66I 'eqPq~t~oS,O ~g IIO(1:986I 'IIO(I) son~eol
¢ -1oo ptm iioG jo ~oot 0 ot0 tmq~ oA.r,o~aolut, oaom UOAO1unoool~
u~ ol 'suo.~.tsod ole.rpolmolu! leaOAos q~noaq~ 'lunoooe olozos.rp
soJn;8o=l OltUe','oS £[qN.rq e m o ~ ~u#tma suoD!sod l~oDaaoot00Ag pom,Jop OAt~q
OA~'puo s.n0 p~A~O,L "uoDonpoad paoA~ uo~Iods ol omq.~uoo £t~m
su.mmop uoDo~zo~u! ptm ',~.qe.uos '~Ioeqpooj 'uoDeADoe ~u.rpt~os~o
S~,lun uepp!q ) jo so.modoad oq~ A~oq ~m.u.m~aoose u! s! lsoaa~u! ~mtu.ud anO
IIOISUOUII(] .,(][A.~OI~.IO]III--SSOUO]O.K)S.I(I
Ol~q ~IIOIV S]U.IOd :so I.IOOLLL OA.b-I
soJm,eO:l lens!^ ~)
'(L66I "I ~ 1o
-aasqo s.lq,L "mms,(s moaos~ ~(lq~3~ e u! pmoodxo oq plnoA~ tmtll IIOO '.E66I 'ePqgeoS,O ~ IIOO '986I 'lIO(I) son~eOllOOpm~ ilO(i jo
,(ouonboa3 aoleoa~ ql!A~poAaosqo ore oaod <-- ~aoo ptm uaoq ~ uatrp uoD!sod oA.r,o~aolu.to ~ tmtI1 umnuguoo ~I.IA.UORIOlU.I--SSOtlOIO.IOs.Ip
OtI1 ~UOI~ aoqlanj s! ~et0 uo.u!sod ~ dolo^op ol soamool.rqa2m
se qans sao~o 'pon~a~ s! ~! 'sng.L "oamooLrtIOa~o~oaos!p e jo suo!l
-dmnsse o ~ aopun po~oodxo oq pInoA~ tm~ uogo oaotu sosuodsoa a3.oql ~depe p ~ ou!quloz om 'i~u.mmu ua~Iods moqe smrelo og!a
paoA~ looaaoau! soonpoad 'IOAOIlea!i~OlOUOqdo ~ ~e po~drus.rp UOqA~ -ods o ~ t u lou p!p (qg66I '~g66I) oo.ql~qS ptm meId qNnot0w
'mo~s£s uo!~onpoad paoA~Ot0 ~ t 0 m.mlOot0 ol saojoz so!q lVO?x~'-i •po~a~olu! ,(ii~uo!l~luosoadoa oq
£~ua 1! 'aoq~z '.aouu~ua poleNoa~os ~ u! poluasoadoa oq lou £~tu
looJJ~t su!fl IeO!xo-i OtLL uo.q~maoju.~ o!qd~a~oq~ao pue ogu~mos 'uo!~eluomoldm! jo ~aos
s!q~ u I "su.remop mdmo pug lndu.t oq) qloq jo sot.ls!aoloea~q3 oq~
•uo.u:vos Si~tI(Ifl£S ~ISVD oql u! oaoua OA~1uo luoso.ldoa :l~q:l pu~ NuT.uaeoI Jo osznoo o ~ u.t dOlOAOp ~eq~ soam
uodoa o~ pu~ 'suogaos NU!A~OIIO J oql u! osot0 Jo oA~loq.uosop OAk -zna~s Ieuo!letuzoju! se jo lq~noq~ oq uez saoloeaW¢ "~(~OlOUOqd
•s~mp.ug i~o.t.q.dmojo s~os anoj u~idxo ol ~l!l!qu ~oq~ ffu.m.m~xo pu~ sa!m~mos ao so!memos pu~ £qd~z~oq~ao s~ qans summ
£q suo!l!sod ~oDoaooql og.toods osoql o1~ni~^o oA~'o[o.rLre s.rql u I -op pol~ioz £i.u~al!qze uoo~loq ffu!ddetu ~ Nu!ua~oI jo ssoaoad
"(9 oznNM) u~.mop mdu! oam~oj l~nS[.At~ opniou! oql u! doiono p £em sao,tova~w 'mols,(s d G d ~ jo sa!ls.uolae~qo
ol su.mmop uog~ao~u ! oql Jo uo!stmdxo tm (q) pu~ (s~u!od uo!13OlOS oql uo ffu!puodo(I "sod2(] zo.uo jo oouoaanaoo-oa s!q~ I/u!pug~s
jo oouosq~ oql ,(q 1.rod u.t ls~oI ~ po^o.tqo~) ~!I~.Uos poonpoz -zopun uI alOZ ~uelaoduI! ug S£~ld szologzll~ jo uo!lou OtLL
(e) moto ol sppe pLm VIH Jo suo.udtunsse at0 jo II~ sopnput. "([66I 'oa!II~qS ~' uo,u.q-I
VS~_q "(Vg"lld) lunooav ~a~.l~.Uas ~tol 'uo~Aova~m. aag~and "~ OSl~ oos) sod£1 aoaao oldgintu soonpoad (sa.mn uopp!q jo saoggI
"((It, oan~kI) so.nu~mos ole.rpotuaolu! q~noah~) gNolouoqd oluo uato ptm sogtmtuos ol ,(qdea
oa aa~q IOAOI "I Oq) moaJ ~IagqpooJ (o3 osoq~ o~ sppv pu~ 'o~Us -$ot0ao mo~ sd~m leql uog~Inm~.s ~ jo ~md £tm ol uo.[sol OlNU!Se
qogo Jo puo oql ~g slu!od uoDoolos ~u.rpn[ou.[ 'X~I jo suoDdmns luq~ pomoqs £oql 'suoD~a)suomop uoD~inm!s jo so.uos ~ jo stream
-s~ oql [I~ (g) sldope ¥ I H '(VIH) ~unooov uo~.iooaalm. ~IS~.H "17 ~fl 'tuolsgs or0 jo so.modoad I~UOm~unj oaom mo O ao mo~s,~s
"(~)17o-I1~l.t:D[OAOI'-[ Or00] ~IOeq [OaOI omouoqd oql mo O £II~O! ~u!peoa oq~ U.II0.tA~.I001uo!so[ old.~inm moaj polinsoa sodk aoaao jo
-~oods uo.rlt~A!~Oejo A~OlJp~A~loeq ~ s! oaoql ~ql (o) uoDdmnsst~ oouoaanooo-oo s.rql aoq~oqA~poaop!suoo saot0n~ Oq~L"saoaao pox.Ru
oql osoql ol spp~ 1! ptm "o~ls qa~o jo puo oq~ le slm.od uo!loo[os ptm 'saoaao lenS.[A'S.IO,LIOODtreRIos--Su.lp~o:IUI sodk aoaao ald!llnm
(cO s~ ilOta s~ V~ID jo suoDdmnss~ UO.U~A90~8u!p~os~o oql Jo oonpoad sa!o~op Nutpuoa po~nbou q~!A~ sienp.[A!pm, u.relIOO :~I0
Ill~(~) sol~zodaoou! ~ "(VFar)~unooov uo~Aoaaa~u?pa~o.u~sa~t "~ UO!leAaOSqoOt0 aOJ lunooo~ plnoo oamao~.rqoa~d(Id ~s.moDoouuoo
•(fl~ oart~!~ saff~s ffm.~oIIo~ o~ o~ spooooad stuo~! A~oq ql!A~pomoOUO0 £i.uemgd oaoA~(~g661) oa.ql~qs ptre m~M
Il~ jo ~m.ssoooad 'oN~s qoeo jo puo oq~ 1~ s~m.od uo.~aOlOSo.m oaoq~ "[e lo IIOQ £q pomnsse tmql ~!l~.Uos
q~noqll'e (o) ptre '.so~s ~m.ssooo.[d :luonbosqns ol p.mA~oj sop~oseo aO~OA~ se I[O~ S~ su.remop uo.~ouao~u!aop~ozq t0.1~ suog.tsod i~o!
uo.~A.nO~ l~q* 'pealsm. 'mq oN~s Nm.ssoooad moamo oq~ ol Valo.msoa -lozooq~ jo suoD~uomoldU~, st~ (qg66I 'e£66I) oo.qleqs ptm m~Id
lou s! uo.ue^.Ua~ (cO mt0 soumss~ 1! V~t(I o~lu n '.uo.~a'a.qp.p.mmaoj .~q podo|oAop ~u.Iu~u oama!d ptm $ ~ . eoz jo suogt~inm!s (d(Id)
,qlo.ms e m . spooooad Nu!ssoooad (~) leql uo.udtunss~ at0 V~I(I Suissoooad pomq.ms.tp l O l l ~ d oq~ aop!suoo tmo ouo 'oidumxa
tO!m so.mqs V~ID "(VdD) mnooot, paoatJofp~f gm.poosoD "E .IO~I "mnnm3uoo .,~l[.A!loeaolut.at 0 jo puo omoa~xo at0 le lou st
"(Vt, oani~kl) oSels I/u!ssoooad qo~o jo puo oql ~ sm!od (E66I) ~qPq~goS,O ptm liO(i pug '(9860 IlO(2 '(L66I) uou~t~D pug
uogoolos a~op ore oaoq~ (p) ptm 'ouo ~u!~aOllOjaq~ ,e possoaoad oa~ 'tm~J~S 'u.meIAI ' z ~ q o S 'iio(i u! ~OOtla Otla 'oaoloaot 0 '£1a~aiD
L9iz NOLLDflGO~Id (I'dOA~N~AOdS NI hJ3ALLDV~IRJA([
468 RAPP AND GOLDRICK

vation has been used to argue that the spoken production system is successfully compete for selection than the phonemes correspond-
biased to produce words over nonwords. ing to nonwords (/h/should be a stronger competitor than/g/for
Presumably, in a highly discrete system, errors in selecting or the onset position). This is possible only because there is an active
encoding phonemes should result in words or nonwords simply on L-level process "pushing" the final selection outcome in the di-
the basis of chance. That is, according to a discrete account, the rection of the phonemes corresponding to alternative words of the
phoneme level "knows" nothing about words (it is "blind,' to language, such as HAT, BAT, MAT, and RAT (as well as words that
lexicality), and thus there is no reason why disruptions at the are phonologically related to semantic neighbors of the target and
phoneme level should preferentially result in word responses. only indirectly related to the target itself; e.g., CAT ~ DOG
It has not been straightforward to establish the statistical prob- LOG). In contrast, the phonemes corresponding to nonwords do
ability with which a phoneme-level disruption in DFA should not have this L-level support. This contrasts with the situation in
result in another word of the language. Nonetheless, a number of DFA, in which disruption in encoding the initial phoneme of CAT
studies involving both corpora of spontaneous and experimentally should result in a word or a nonword simply on the basis of the
elicited speech errors have made use of a variety of analysis number of words and nonwords that can be formed by changing
procedures and claimed to find lexical error rates higher than the first phoneme.
would be expected by chance under DFA (Baars, Motley, & This discussion illustrates that feedback is necessary for lexical
MacKay, 1975; Dell, 1986; Dell & Reich, 1981; Stemberger, bias effects. Cascading activation, in contrast, is not strictly nec-
1985). In addition, researchers have found evidence of lexical bias essary for lexical bias (see Best, 1996, on this point). For this
in the errors of certain aphasic individuals. For example, Best reason, lexical bias effects are problematic for theories of spoken
(1996) and Gagnon, Schwartz, Martin, Dell, and Saffran (1997) word production that do not allow phonologically related neigh-
found that formal errors (phonologically related word responses) bors of a target to compete for output. Of the five theories under
occurred at rates higher than would be expected under DFA (see consideration, lexical bias effects are clearly problematic for DFA
also Blanken, 1998; Laine & Martin, 1996). and CFA. The other three--RIA, HIA, and FILSA--adopt feed-
Although the existing lexical bias data are not unassailable, a back mechanisms that should, in principle, support lexical bias
true lexical bias effect would be problematic for DFA. Proponents effects. Nonetheless, there may be specific limitations, which, with
of the discrete feedforward position typically account for an ap- the help of computer simulations, we explore later in this article.
parent lexical bias effect by assuming a postencoding (and pre-
sumably prearticulation) "editor" that monitors the output before it The Mixed Error Effect
is articulated and vetoes nonword outcomes (Baars et al., 1975;
Butterworth, 1981; Kempen & Huijbers, 1983; Levelt, 1989). The Speal~ers sometimes substitute a target word for one that is
detection of an error by the editor stops the speech production related to it in meaning (zebra for giraffe, oyster for lobster). A
process, and word retrieval begins anew. This editorial mechanism question that has received considerable attention is the extent to
is, of course, error-prone, and word responses are more likely to which targets and errors in these semantic substitutions are also
slip past unnoticed than are nonword responses. This would result phonologically similar. A number of analyses of spontaneous and
in a lexical bias effect. 5 experimentally elicited speech errors claim that semantic errors
However, the postencoding editor (at least in its current form) is show a higher degree of phonological similarity than would be
a relatively unsatisfactory means of accounting for lexical bias. As predicted by DFA (Butterworth, 1981; Dell & Reich, 1981; Har-
pointed out by Dell and Reich (1980) and Martin, Weisberg, and ley, 1984; Martin et al., 1989; Stemberger, 1983; but see del Viso,
Saffran (1989), not only does it require seemingly needless redu- Igoa, & Garcia-Albert, 1991; Levelt, 1983).
plication of information, but because the specific nature of the As is the case for lexical bias, a central issue has been estab-
mechanism has remained unclear, the proposal is overly powerful. lishing the "chance" rates of mixed errors that would be expected
Rather than assume this type of editorial process, interactive under DFA. In this regard, mixed error analyses can generally be
theories account for lexical bias by pointing to the natural conse- divided into two types: (a) analyses based on predictions of rates
quences of cascading activation and feedback. Lexical bias can be of mixed, semantic, and other errors and (b) analyses of the errors
understood as follows: As activation passes from the L-level themselves based on predicted degree of phonological overlap
representation of a target (CAT) to its phonemes (/kL /ae/, It/), between targets and errors. The former are more typically carried
feedback connections send activation from these phonemes back to out with experimentally induced errors and the latter with corpora
all L-level units that share phonemes with the target, including of spontaneous errors where a determination of rate is often
L-level units corresponding to formal neighbors of the target (e.g., impossible because the number of items attempted and the number
HAT, BAT, MAT, RAT). The L-level nodes of these formal neigh- of correct responses are typically not known.
bors, in turn, activate all their constituent phonemes, including
those that are not shared with the target (/h/for HAT). These then 5 Levelt (1989) argued that independentevidencefor the postencoding,
reactivate their corresponding L-level nodes, creating what Dell prearticulatoryeditorcomes from findingsindicatingthat the magnitudeof
(1986) referred to as "positive feedback loops. ''6 Nonword out- lexical bias effects is strongly influencedby factors one might expect an
comes (e.g., GAT) do not enter into such relationships. When a editor to be sensitiveto. These are factors such as task demands and the
disruption occurs at the phoneme level--for example, difficulty in conversational setting. For example, Baal's et al. (1975) found no lexical
retrieving the first phoneme of CAT--the phonemes corresponding bias effect in an elicitationparadigmwhen all of the stimuliare nonwords.
to the phonological neighbors of CAT that have been activated 6 Positive feedback loops are similar, if not comparable, to attractors
through feedback connections will have the opportunity to more referred to in PDP modeling(e.g., Plaut & Shallice, 1993a).
INTERACTIVITYIN SPOKEN WORD PRODUCTION 469

The baseline rate (chance rate) of mixed errors or the degree of Other Evidence
phonological overlap between targets and errors that would be
expected under DFA is typically assumed to be (a) the rate of In addition to analyses concerned with errors, there are a number
mixed errors or phonological overlap that could be expected if a of experimental results relating to other types of predictions de-
true semantic error (arising from L-level misselection) just hap- rived from discrete and interactive theories. For completeness, we
pened to be phonologically similar to the target plus (b) the rate of review these very briefly.
mixed errors or phonological overlap that would be expected if a For example, a number of researchers have evaluated the ade-
true phonological error just happened to be semantically related to quacy of discrete and interactive accounts with regard to the time
the target plus (c) some correction for the possibility that errors course of semantic and phonological effects (see Levelt, Roelofs,
may occur at both levels on some trials (Martin et al., 1989; Plant & Meyers, 1999, for a review). Some of these studies provide
& Shallice, 1993a; Shallice & McGill, 1978). Analyses based on support for cascading activation and/or feedback (see Cutting &
these assumptions (Dell & Reich, 1981; Martin, Gagnon, Ferreira, 1999; Damian & Martin, 1999; Peterson & Savoy, 1998;
Schwartz, Dell, & Saffran, 1996; Martin et al., 1989) report mixed Starreveld & La Heij, 1995, 1996; but see Roelofs, Meyer, &
error rates significantly greater than predicted. In addition, signif- Levelt, 1996; Schriefers et al., 1990).
icant mixed error effects have also been reported for aphasic In addition, Nickels (1995) examined predictions of interactive
individuals (e.g., Blanken, 1998; but see Best, 1996). versus discrete accounts regarding the effects of semantic
The discrete feedforward position has typically responded to (concreteness-abstractness) and formal (length and frequency)
claims of significant mixed error effects either by pointing to the variables on the errors of aphasic individuals and found that the
weaknesses in the analyses or by invoking the postencoding editor. evidence favored a discrete position (but see Harley, 1995, for a
It can be argued that just as the editor is more likely to overlook response).
word versus nonword errors, it is also less likely to detect mixed Furthermore, in a very comprehensive study, Dell et al. (1997)
versus either semantic or phonological errors because mixed errors examined the picture-naming performance of a set of individuals
share both semantic and phonological characteristics with the with acquired neurological damage in order to determine whether
target. an interactive two-stage theory (with cascading activation and
The interactive position accounts for these "mixed" errors by feedback) could account for the range of individual error patterns
assuming that the feedback connections from phonology to L level (see also Foygel & Dell, in press; Martin & Saffran, 1992; Martin
and from L level to semantics allow for interaction between et al., 1989; Schwartz, Saffran, Bloch, & Dell, 1994). Through a
semantic and phonological processes. Given this connectivity, if series of simulation studies, Dell et al. showed (among other
something were to go wrong in the course of naming, mixed things) that the fit between observed and simulated patterns was
neighbors would be favored over competitors that were either only substantially better than the fit obtained for randomly generated
semantically or only phonologically related to the target. For patterns of errors (pseudopatients). This successful fit between
example, if there were difficulties in encoding the first position patient data and simulation data lent general support for the inter-
phoneme for CAT, the phonemes corresponding to its phonological active two-stage account of naming. We return to this study in the
neighbors HAT, MAT, and RAT would be active as well as those General Discussion section.
corresponding to its semantic neighbors DOG and RAT (these
semantic competitors would not have been turned off at L-level Plan for the Remainder of This Article
selection, and their activation would be further maintained through
the feedback structure). Other things being equal, the phonemes In sum, the evidence from mixed errors and lexical bias provides
corresponding to mixed neighbors such as RAT (which received support for the assumptions of cascading activation and feedback.
support from both the semantic and phonological processes) As indicated earlier, this evidence has not gone unchallenged.
should have the competitive advantage over the phonemes of However, even if we were to assume its validity, the existing
either a semantic or a formal neighbor such as DOG or MAT evidence does not distinguish between the various theoretical
(which has been supported by either semantic or phonological positions that assume cascading activation and feedback (e.g.,
processes, but not both). RIA, HIA, and FILSA).
Although interactive accounts that include both cascading acti- This article has two empirical sections. In the first,, we present
vation and full feedback connectivity can certainly account for three case studies of neurologically injured individuals. These case
mixed error effects, the respective contribution of each of these studies establish two further sets of empirical results that provide
features has not been systematically explored. We show through important constraints on theories of spoken word production. In
our simulation work that cascading activation alone is sufficient to the second empirical section, with the assistance of computer
account for a mixed error effect arising at the phonological level simulation, we carry out a detailed evaluation of the five theoret-
and, furthermore, that cascading activation plus feedback to the L ical positions we have identified. They are evaluated with respect
level is sufficient to account for mixed error effects that arise in the to their ability to account for four sets of findings: the mixed error
course of L-level selection. Thus, mixed error effects that are and lexical bias effects in unimpaired individuals as well as the
problematic for a strictly discrete feedforward account such as two sets of findings from our study of the neurologically injured
DFA can be accommodated under all of the other architectures individuals. We show that these latter results not only provide
(CFA, RIA, HIA, and FILSA). In our simulation work, we explore further evidence in support of cascading activation and feedback
the specific capacities and limitations of the five architectures in but also allow us to go some distance in adjudicating among the
this regard. various interactive accounts of spoken word production.
470 RAPP AND GOLDRICK

Table 1
Results of Lexical Testing for K.E.

Task Correct Semantic Similar word Nonword Other word "Don't know"

Oral picture naming 56% 38% 0% 0% 0% 6%


(n = 335)
Written picture naming 55% 32% 0% 7% a 1%b 5%
(n = 335)
Oral reading 58% 35% 1% 0% 0%b 7%
(n = 335)
Writing to dictation 59% 25% 1% 10%a 2% 4%
(n = 335)
oral tactile naming 53% 45% 0% 0% 0% 2%
(n = 47)
Written tactile naming 60% 34% 0% 2% 0% 4%
(n = 47)
Auditory comprehension 58% 40% 0% 2%¢ 0%
(n = 191)
Reading comprehension 65% 28% 1% 7% ~ 0%
(n = 191)

a Includes misspelled, possible semantic errors (e.g., crab ---* [clam] ~ clim), b Includes possible visual --->
semantic errors (waist ~ [wrist] --* watch) and semantic --~ visual errors (check ~ [rouge] ~ rug). ¢ Rejection
of correct word-picture matches.

Case Studies One important characteristic of K.E.'s performance is that he


produced no phonologically related responses in oral naming, but
Case 1: K . E . m S e m a n t i c Deficit, S e m a n t i c Errors Only instead he produced only semantic errors or " d o n ' t know" respons-
K.E. was a 52-year-old, right-handed man with a master of es. 7 K.E.'s (6%) " d o n ' t know" responses may represent instances
business administration degree who worked as a high-level man- when the semantic system was not able to settle on any likely
ager in a large corporation when he had a thromboembolic stroke 6 candidate. For our purposes, what is important is that there is no
months prior to the initiation of the testing reported in this article evidence that these responses originated in a faulty phonological
(see Hillis, Rapp, Romani, & Caramazza, 1990, for a detailed process. A deficit at a phonological level should result in at least
report). A C T scan 5 days postonset revealed a large area of some phonological errors; the fact that none were observed allows
decreased density in the left fronto-parietal area, consistent with an us to assume that the " d o n ' t know" responses did not have a
infarct in the distribution of the left middle cerebral artery. Testing phonological origin.
indicated intact visual perception, reasoning, and arithmetic skills. The second striking observation is that K.E. made semantic
K.E.'s spontaneous speech was sparse, consisting primarily of errors at similar rates on all tasks--in production and comprehen-
nouns, semantic paraphasias, and overlearned phrases. His repeti- sion; across auditory, written, and tactile input modalities; and
tion, however, was good. K.E. had significant difficulties in spo- across written and spoken output modalities. This pattern points to
ken and written comprehension and production of single words, a common error source--presumably a lexical semantic-level def-
and a number of tasks were administered to examine more sys- icit affecting comprehension and production in all modalities, s The
tematically these lexical difficulties. only alternative account would be that of multiple impairments to
K.E. was tested with two sets of stimulus items. Set 1 consisted numerous input-output mechanisms. This possibility was exam-
of 144 stimulus items from 10 semantic categories. Set 2 consisted ined and rejected in Hillis et al. (1990) on the basis of item
of 47 items from 5 semantic categories. Both sets of stimuli were consistency analyses.
presented in four production tasks (oral picture naming, written
picture naming, oral reading, and writing to dictation) and in two
comprehension tasks (auditory word-picture matching and written 7 Although immediately after the stroke K.E. exhibited some oral-verbal
word-picture matching). In addition, items from Set 2 were also apraxia in attempts to produce words or nonverbal oral movements, this
presented for spoken and written tactile naming. For details of test quickly resolved such that, by the time of the testing reported here,
administration, see Hillis et al. (1990). sound-based errors were virtually absent from his spoken production. In
testing with a different set of items, K.E. made two or three errors in oral
Responses were scored as semantic errors if they clearly shared
reading that consisted of responses that were ambiguous in terms of
a semantic relationship with the target; nonword responses if they
whether they were either visually or phonologically similar to the target.
did not correspond to words of the language; similar words if they Given the absence of such errors in his naming, however, we assumed that
were not semantically related to the target but shared 50% of the in reading these errors arose as a result of the contribution of sublexical
target's phonemes; don't know responses if K.E. did not produce a processing to the oral reading output (Hillis & Caramazza, 1995).
response; or other, which included words that were neither pho- 8 In addition to the central semantic deficit, K.E. showed evidence of an
nologically similar nor obviously semantically related to the target. additional impairment specific to written production. This impairment
Results for all tasks are reported in Table I. manifested itself in numerous misspellings that involved letter deletions,
INTERACTIV1TY IN SPOKEN WORD PRODUCTION 471

Table 2
Results of Lexical Testing for P. W.
Task Correct Semantic Similar word Nonword Other word "Don't know"

Oral picture naming 72% 19% 0% 0% 3% 6%


Written picture naming 46% 12% 3% 17% 6% 16%
Spoken word-picture
matching 95% 5% 0% 0%
Written word-picture
matching 94% 6% 0% 0%
Repetition 100% 0% 0% 0% 0% 0%

In sum, the results of K.E.'s testing clearly point to a lexical be the source of his semantic errors in spoken and written
semantic locus of impairment. Importantly, the fact that K.E. naming. Several additional pieces of evidence support this
produced semantic errors and no sound-based errors indicates the conclusion. P.W.'s semantic errors were often accompanied by
lexical production system, must be structured in such a way that a verbal indication that he knew he had not produced the correct
damage at the semantic level can result in a pattern of pure word (e.g., picture of a zebra --->"horse, not r e a l l y . . , a horse
semantic errors. in the jungle"). This suggests that he was not confused about the
word's meaning. Also, on many occasions when he made a
semantic error in spoken naming, he was able to spell the
Case 2: P. W.--Postsemantic Deficit, correct word (e.g., picture of a tiger --->"lion"; T-I-G-E-R). In
Semantic Errors Only fact, in a series of picture-naming tasks in which P.W. was
P.W. was a 51-year-old, right-handed man who had completed required to say, write, say (or write, say, write) the name of
one semester of college and was employed as a manager of a meat each picture on each triM, he was able to write the name of the
department when he had a left-hemisphere stroke that resulted in picture on about 20% of trials when he was unable to provide
written and spoken language difficulties. CT scanning revealed an the correct name in spoken form (Rapp et al., 1997).
ischemic infarction of the left parietal and posterior frontal areas. ff semantic errors do not originate from a semantic deficit, then
The results we report were obtained between 24 and 48 months how do they come about? Caramazza and Hillis (1990) argued that
after the cerebral vascular accident (CVA; for further information, semantic errors may have either a semantic or a postsemantic
see Rapp, Benzing, & Caramazza, 1997; Rapp & Caramazza, basis. They assumed that semantic representations activate phono-
1998). logical lexemes to a degree proportional to their shared semantics,
P.W. was asked to carry out the following tasks with 258 items such that when one is intending to say "tiger," the phonological
from Snodgrass and Vanderwart's (1980) picture set: spoken pic- lexemes of lion, cat, leopard, and so forth will also be activated.
ture naming, written picture naming, repetition, and written and Given this, if (as a result of postsemantic damage) the tiger lexeme
spoken word-picture matching. The tasks were administered as for is relatively unavailable, then the most highly activated word (e.g.,
K.E., and the results are reported in Table 2. "lion") may be produced (see also Nickels, 1992). With regard to
P.W.'s performance was similar to that of K.E. in one important P.W., Rapp et al. (1997) argued that he had postsemantic damage
respect and crucially different in another. In oral picture naming, affecting the retrieval of both phonological and orthographic
P.W., like K.E., produced a considerable proportion of semantic lexemes. 9
errors but no phonological errors (similar words or nonwords);
sound-based errors were also not observed in repetition. However,
P.W. was markedly different from K.E. in that his word compre-
hension was largely normal, as indicated by his accuracy of 95% Case 3: CS.S.--Postsemantic Deficit,
on the spoken word-picture matching task (for normal elderly Multiple Error Types
control participants [n = 14], accuracy ranged from 96% to
100%). C.S.S. was a 62-year-old, right-handed man with 3 years of
P.W.'s performance on the comprehension task was essen- college who worked as a jet engine testing engineer before having
tially normal, indicating that a semantic deficit was unlikely to a left-hemisphere stroke 26 months before the initiation of our
work with him. CT scanning revealed a left parietal infarct and a

substitutions, additions, and transpositions, for example, pelican --->PELI- 9 We further argued that the deficit in spoken naming was lexical and
CAR, and a number of errors that, presumably, are misspellings of semantic prephonological because we found highly significant effects of lexical
errors (e.g., shark -->ALLIGATE).In other work (Caramazza, Micefi, Villa, frequency on accuracy but we found no significant effects of length or of
& Romani, 1987), these types of errors have been attributed to damage to phonological complexity (number of clusters; Rapp et al., 1997). These are
the graphemic buffer--a postlexical locus of impairment within the spell- factors that, it has been argued (Butterworth, 1992), affect postlexical
ing system. phonological processing.
472 RAPP AND GOLDRICK

Table 3
C.S.S. 's Performance on a Set of Lexical Processing Tasks

Task Correct Semantic Similar word Nonword Other word "Don't know"

Oral picture naming 82% 6% 3% 7% 2% 0%


Written picture naming 82% 2% 0% 8% 8% 0%
Spoken word-picture
matching 100% 0% 0% 0%
Written word-picture
matching 100% 0% 0% 0%
Repetition 96% 1% 1% 2% 0% 0%

small lacunar infarct in the fight basal ganglia. In addition, and apparently be damaged such that a pattern of only semantic errors
prior to the infarct, C.S.S. had had, for a number of years, an can arise from either a semantic or a postsemantic locus of
asymptomatic and stable left frontal lobe menlngioma. damage.
C.S.S.'s spontaneous speech was marked by frequent hesitations
and word-finding difficulties as well as semantic and phonological Phonological Influence on Semantic Errors:
errors. We examined his single-word processing in a number of Mixed Error Effects ?
tasks, including Spoken picture naming, repetition, and spoken and
written word-picture matching. Stimuli for these tasks were A pattern of only semantic errors in naming in the absence of
Snodgrass and Vanderwart's (1980) 258 items that were used with phonologically similar word or nonword errors would seem to
P.W. Task administration and error classification procedures were suggest little or no influence of phonology at the level of the
the same as those described above. The list was administered twice system at which the errors were generated. The absence of such
for the tasks of oral naming and repetition and once for spoken and errors does not, however, rule out the possibility that the semantic
written word comprehension. The results are presented in Table 3. errors themselves show a mixed error effect. Given this, we carried
Like P.W., C.S.S. performed well in spoken and written word out a series of analyses to determine the extent to which the
comprehension. However, unlike P.W. and K.E. and although his semantic errors produced by K.E., P.W., and C.S.S. were phono-
overall accuracy was higher, C.S.S. produced multiple error types logically similar to the target words. Under DFA, semantic errors
in spoken naming: semantic errors, similar word responses, and a should be no more phonologically similar to the target than one
significant number of nonword responses. Furthermore, unlike would expect by the random pairing of words. This is because, as
P.W. and K.E., C.S.S. made similar types of errors (albeit at far indicated earlier, according to this theory there is no mechanism by
lower rates) in repetition as well. The intact comprehension indi- which words that are both semantically and phonologically similar
cates a postsemantic deficit; additionally, the presence of phono- to a target should have an advantage over words that are only
logically similar word and nonword errors is potentially indicative semantically (or only phonologically) similar.
of a deficit to a level of phonological processing. Further indica- To obtain a larger set of errors for the mixed error analyses, we
tions of phonological involvement are that repetition was also collected responses from the administration of various additional
error-prone and there was a significant word length effect. On this sets of stimuli. To create similar conditions across the three indi-
basis, we assume that C.S.S. is likely to have some difficulty viduals, we restricted the analyses to items from eight categories
affecting his ability to retrieve and/or encode the phonemes cor- with which all three were tested (vehicles, vegetables, clothing,
responding to the words he is trying to produce. We cannot, birds, furniture, fruit, animals, and body parts). 1° This procedure
however, be sure that this is the only, or even the primary, source provided us with 67 target-error pairs for C.S.S., 109 for P.W.,
of his errors. We consider single and multiple damage loci for and 63 for K.E.
C.S.S. in evaluating the five candidate theories.
Analysis 1: Overall Phonological Overlap Index
Interim Summary
The Phonological Overlap Index (POI) between targets and
We have described the performance of 3 individuals with def- errors was determined for each individual as follows: (a) Each
icits affecting their spoken naming abilities. Both K.E. and P.W. phoneme of a target word was compared with each phoneme of the
exhibited a pattern of only semantic errors in naming, whereas
C.S.S. produced multiple error types. Importantly, the data indicate
that a pattern of only semantic errors can arise either from a 1oMost semantic errors consisted of items with a coordinate relationship
to the target. There were, however, some responses that were semantic
semantic locus of impairment (K.E.) or from a postsemantic locus
associates of the target (lemon ~ sour) or that were the target's superor-
(P.W.). A pattern of mixed error types (C.S.S.) may involve
dinate category (e.g., caterpillar ---->bug). There were few such responses
additional damage affecting phonological retrieval or encoding. In (two errors for K.E., three errors for C.S.S., and eight errors for P.W.).
this regard, C.S.S. resembles others in the literature (Dell et al., Nonetheless, it is possible that these do not represent "true" semantic errors
1997; Ruml, Caramazza, Sbelton, & Chialant, in press). These but rather an attempt to explain the meaning of the target. If so, a true
results establish the first set of findings from impaired perfor- phonological effect in naming might be obscured. To prevent this, we
mance that we are concerned with: The spoken naming system can removed all of these errors.
INTERACTIVITY IN SPOKEN WORD PRODUCTION 473

Table 4
Phonological Overlap Index (POI) Values for the Three Participants as Well as the
Characteristics of a Baseline Distribution of POI Values Obtained
From 5,000 Random Re-Pairings of Stimulus-Response Sets

Participant POI Estimated p Adjusted POI Estimated p Readjusted POI Estimated p

K.E. .134 >.32 .124 >.722


P.W. .220 <.0002 .201 <.0002
C.S.S. .301 <.0002 .291 <.0002 .259 <.0002

Mean POI SD Minimum Maximum


Baseline distribution .129 .01 .097 .169

response, and each match was assigned a value of 2 (indicating that nological influence on semantically driven processing, then it is
two of the phonemes in the target and response pool were shared); crucial to eliminate any possible artifactual phonological effects.
(b) the proportion of shared phonemes for each target-response We carried out an analysis to determine the average POI value
pair was calculated by dividing the number of matches by the total for items within each of the eight semantic categories, as well as
number of phonemes in target plus response; and (c) the propor- the average POI value for items from different categories. To do
tions of shared phonemes for each target-response pair were then so, we first specified the members of the eight categories by
averaged to generate an overall POI value for each individual.H taking, for each category, all items listed as a member of that
Using this technique, we obtained the following overall POI val- category by at least 20 of the 442 participants in Battig and
ues: .134 for K.E., .220 for P.W., and .301 for C.S.S. Montague's (1969) norms. This provided us with reasonably large
Interpreting these POI values requires determining the baseline categories made up of fairly typical members.
POI values that would be expected given the general characteris- A mean within-category POI for each category was calculated
tics of the English language and the specific types of stimuli with
by randomly pairing items within a category and then computing
which the these individuals were tested. To determine this, we
the mean POI. Random within-category pairings were carded
randomly paired all targets and errors produced by the 3 individ-
out 1,000 times for each category, and then the mean POI for
uals. We then computed a mean baseline POI in the same manner
the 1,000 values was calculated for each category. The mean
as described above. We carded out 5,000 such random pairings of
across-category POI for each category was determined by ran-
the items in the target-error list, generating 5,000 baseline POI
domly pairing the items of one category with items from the other
values. The mean POI value of this set was .129, with a standard
seven categories and then computing a mean POI. Again, this was
deviation of .01 and minimum and maximum POI values of .097
and .169, respectively (Table 4): carded out 1,000 times for each category, and the mean POI values
The frequency with which a value equal to or greater than each for each set of 1,000 values were calculated (Table 5). Averaging
individual's observed POI was generated in the 5,000 baseline runs across the eight categories, the average within-category POI was
provides a direct estimation of p values (see Krauth, 1988, for a .175, and the average across-category POI was .148. That is, on
discussion of distribution-free statistics). As shown in Table 4, the average, words within a category shared 17.5% of their phonology,
results indicate that the POI values observed for P.W. and C.S.S. whereas words across categories shared 14.8%. Importantly, an
were never observed in 5,000 random pairings (p < .0002). average of 2.7% of the phonemes of two items from the same
However, a POI value equal to or greater than K.E.'s was gener- category were shared simply by virtue of their common category
ated 1,605 times (p > .32). Thus, whereas P.W.'s and C.S.S.'s membership. 12 This difference was significant in one-tailed test-
semantic errors were clearly more phonologically similar to the
target words than would be expected given a random pairing of
words, K.E.'s responses were not. t ~A number of other features could have been considered in calculating
phonological overlap, including number of phonemes in the same syllabic
position, number of syllables, stress pattern, and so forth. Any choices are
Analysis 2: Adjusted Phonological Overlap Index obviously theory dependent, in that they rely on assumptions about what
"counts" as similar to the activation process; for example, are only words
We assumed baseline rates to be comparable with the overlap with the same phonemes in the same syllabic position confusable? Current
observed by randomly pairing items. However, given that the understanding of these issues is so limited that our choice was driven
primarily by ease of computation. What is critical is that the same criteria
errors are from the same semantic category as the targets, there is
used in establishing the observed POI values were applied in establishing
a possibility that words within the same category are more pho- chance POI rates.
nologically similar to one another than to words from other cate- ~2Because some categories include items with possibly shared morphol-
gories. If so, then some proportion of the overlap observed in the ogy (e.g., bicycle, tricycle, motorcycle), we re.computed the within- and
semantic errors would come "for free" merely by virtue of their across-category POI, excluding these morphologically related items. There
being semantic errors. This is important because if we take a were relatively few such items, and the re.computed values were virtually
significant mixed error effect to be an indicator of external pho- identical to the original values.
474 RAPP AND GOLDRICK

Table 5 possibility that some of C.S.S.'s semantic errors arose from dam-
Average Phonological Overlap Index (POI) Valuesfor Items age to more than one part of the system. Under DFA, we must
From the Same Semantic Category and POI Valuesfor Items consider the possibility that some of these errors may have come
Across Semantic Categories from a phonologically based error-generation mechanism that pro-
duces phonological errors that just happen to be semantically
Within Across- related to the target. Any such errors need to be removed from
Category category POI category POI Difference
C.S.S.'s corpus of semantic errors and a mean POI value needs to
Transportation .248 .170 .078 be recalculated.
Vegetables .214 .169 .044 C.S.S. is an interesting case in that the analysis of his data
Clothing .193 .143 .050 presents some of the same difficulties that one faces in analyzing
Birds .150 .138 .012
normal slips of the tongue--in both cases it is possible (indeed,
Furniture .143 .145 - .002
Fruit .211 .153 .058 likely) that errors arise at multiple levels of the processing system.
Animals .124 .136 -.012
Body Parts .117 .127 -.010
Analysis 3: Readjusted Phonological Overlap Index
M .175 .148 .027
for C.S.S.

To determine the proportion of C.S.S.'s semantic errors that


may have been generated by a phonologically based error mech-
ing, t(7) = 2.19, p < .05, and marginally so in two-tailed testing.
anism, we collected all of C.S.S.'s nonsemantic word responses to
The finding of an overall POI advantage for within- versus across-
items within the eight semantic categories, that is, all real word
category items has implications not only for the evaluation of
responses that bore no apparent semantic relationship with the
phonological effects in the semantic errors of the 3 individuals we
target (e.g.,/g o t / - - ~ / n o t/); there were 35 such errors. We then
studied but also for the evaluation of mixed error effects in slips of
calculated the phonological overlap for each of the target-error
the tongue of normal speakers.
pairs and compared the distribution of overlap values for these
The finding of a positive and significant POI effect of category
nonsemantic word errors with the overlap distribution for the
membership indicates that we may have overestimated the effect
semantic errors. The two distributions were quite different. The
of external phonological influence in the genesis of the semantic
mean POI value for the distribution of nonsemantic word errors
errors. To take into account the within-category POI advantage, we
was .60 (SD = .18, minimum = .25, maximum = .89), and the
calculated an adjustment value for each of the 3 individuals. To
most frequent overlap values fell in the .60-.80 range. In contrast,
compute these adjustment values, we first counted the number of
the mean POI value for the semantic errors was .301 (SD = .21,
items from each individual's data set in each of the eight catego-
minimum = 0, maximum = .71), and the most common overlap
ries. We multiplied these numbers by the within- versus across-
values were in the .21-.40 range. To eliminate all errors in the
category POI difference for each category, and then an average
semantic distribution that might have been produced by the pho-
adjustment value was calculated for the entire list. For K.E., this
nological process, we needed to reduce the semantic distribution
adjustment value was .009; for P.W., it was .019; and for C.S.S.,
by the largest possible proportion of such errors. At the same time,
it was .010.13 We adjusted the POI values reported above for each
the adjustment needed to conform to the characteristics of the
individual by subtracting the adjustment values from them. The
distribution of the phonologically based errors. To accomplish this,
resulting adjusted POI values were .124 for K.E., .201 for P.W.,
we assumed that all of the semantic errors within the .61-.80 range
and .298 for C.S.S. When we compared the adjusted values with
(the peak overlap range for the phonological process) were actu-
the values in the baseline distribution, we found (Table 4) that the
ally generated by the phonological process. We eliminated all of
adjustment had no significant consequences for the pattern re-
these and then further reduced the remaining semantic error dis-
ported prior to adjustment. A POI value at least as large as that
tribution in a manner proportional to the distribution of nonseman-
observed for K.E. occurred in the random pairings 3,611 out
tic word errors (see Figure 7). Removing these values (13% of the
of 5,000 times (p > .72). In contrast, even with the adjustment,
total number of semantic errors) presumably left us with the set of
P.W.'s and C.S.S.'s POI values were never generated by a random
semantic errors that were not likely to have been generated by the
pairing of items.
phonologically based error mechanism. 14
The results of the mixed error analyses can be summarized as
We then recalculated the POI over the remaining items and
follows: The individual with semantic-level damage (K.E.)
found that the readjusted POI value of .259 (actually .269 - .01 for
showed no mixed error effect, whereas the individual with inde-
the within-category error adjustment) was, as before, never ob-
pendently established postsemantic damage (P.W.) showed a
served in the random baseline distribution. In sum, when we
strong and robust mixed error effect. The situation with C.S.S. is
eliminated all semantic errors that could have arisen accidentally
more complex, however. The mixed error analysis carded out thus
far assumes a single semantically driven, error-generating mech-
anism. As described above, on the basis of performance across a 13 For all 3 individuals, the adjustment value was lower than the average
range of tasks, this assumption is motivated for K.E. and P.W. For across all categories because the data sets included many items from the
K.E., this mechanism is presumably at the conceptual-semantic categories of animals and body parts, where there are slightly negative
level, whereas for P.W., it may be a process involved in L-level within- versus across-category POI differences.
activation, selection, or both. However, we cannot rule out the 14 Gislin Dagnelie deserves credit for proposing this adjustment method.
INTERACTIVITYIN SPOKEN WORD PRODUCTION 475

60% obtained value of .319 indicates a highly significant mixed error


--0-- SemanticErrors ] effect. The similarity in the results for P.W., R.G.B., and C.S.S.
---12-- Adjustment increases our confidence in the finding that semantic errors arising
50% from a post,semantic locus of damage may exhibit a robust mixed
error effect.

40%
The Evaluation of Five Theories
W
In this section, we use computer simulation to assist us in
"6 30%
evaluating the extent to which five theories, representing five
points along a continuum of discreteness-interactivity,can account
fl_ for the following four sets of findings: (a) a mixed error effect in
20°/° unimpaired individuals, Co) a lexical bias effect in unimpaired
individuals, (c) a pattern of only semantic errors arising from either
a semantic (K.E.) or a postsemantic (P.W.) locus of damage, and
10%
(d) the differential effects of phonology according to impairment
locus--the absence of phonological effects on semantic errors
resulting from semantic damage and the presence of phonological
0%
effects on semantic errors resulting from postsemantic damage.
0 1-20 21-40 41-60 61-80 >80
Additionally, we consider the adequacy of each of the theories
with regard to the pattern of multiple error types exhibited by
Percent Target/Error Overlap C.S.S. because this represents a commonly reported pattern (see
Dell et al., 1997; Ruml et al., in press).
Figure7. Distributionof C.S.S.'s semanticerrors in terms of percentage As indicated in the introduction, the five theories we simulate
of phonological overlap between target and error. The line with open are (a) DFA: no cascading activation, no feedback; (b) CFA:
squares indicates the proportion of the distribution removed in order to cascading activation, no feedback; (c) RIA: cascading activation,
correct for the possible contributionof a phonologicalerror process.
partial feedback from phoneme levels to the L level; (d) IliA:
cascading activation, feedback between all adjacent levels; and (e)
FILSA: cascading activation, feedback between all adjacent levels,
low seriality, and extended interaction domain.
from a phonological process, we still observed a strong and sig-
This work has two objectives: (a) to determine which one of
nificant effect of phonology on C.S.S.'s semantic errors.
these particular theoretical positions best accounts for the data and
(b) to draw more general conclusions regarding discreteness and
Summary of Findings From Impaired Naming interactivity in spoken word production.
We first determined the locus of damage for each of the 3
participants on the basis of the pattern of accuracy and naming
errors across a range of tasks. We then examined the possibility of
Simulation Information
phonological influence on the semantic errors. We determined the The simulations of DFA, CFA, RIA, and HIA all use the same
locus of damage independently of, and prior to, examining the
lexicon and the same localist connectionistprocessing architecture.
extent of phonological influence. There are two sets of findings
These are described in the following section. The simulation of
from these case studies that may provide important constraints on
FILSA is based on a PDP architecture used by Plant and Shallice
theories of naming: (a) The naming system can be damaged such
(1993a) and is described in the section on the evaluation of FILSA.
that a pattern of only semantic errors can result from either a
semantic locus of damage or some postsemantic locus, and (b)
whereas a semantic locus of damage need not give rise to a mixed Simulation Neighborhood
error effect (K.E.), a postsemantic locus may do so (P.W. and
C.S.S.). Following Dell et al. (1997), we simulated spoken naming with
Of particular interest is the pattern exhibited by P.W.--a pattern the naming of a single consonant-vowel-consonant(CVC) target
of only semantic errors with a significant mixed error effect. Given word presented in a neighborhood with the error opportunities of
the potential importance of this pattern and in order to consider its an "average" English neighborhood. The error opportunities for
replicability, we carded out an analysis of the semantic errors of English were reported by Dell et al. and are presented in Table 6.
another individual who had a functional deficit apparently quite This table indicates that for any given target word, a random
similar to P.W.'s. R.G.B. (Caramazza & Hillis, 1990) was de- selection of phonemes of the language will yield a nonword
scribed as producing a pattern of only semantic errors in spoken outcome 80% of the time and a word outcome 20% of the time.
naming as a result of postsemantic damage. Hillis and Caramazza When the outcome is a word, it will be an unrelated word 10% of
kindly provided us with the set of R.G.B.'s spoken naming errors, the time, and so forth. The simulation neighborhood was also
and we computed the POI value for R.G.B.'s semantic errors. The designed to be consistent with our findings (reported above) of
476 RAPP AND GOLDRICK

Table 6 L-level units, whereas HIA assumes additional feedback connec-


Error Opportunities in English (Based on Dell et al., 1997) and tions between the L level and semantic feature layers.
for the Neighborhood Adopted in the Simulations of the Discrete
Feedforward Account, the Cascading Feedforward Account, Processing Characteristics and Parameters
the Restricted Interaction Account, and
the High Interaction Account Processing time in the network takes place in a series of steps;
at each step, the activation of units is updated synchronously. Each
Estimated error Simulation error unit's activation is determined by its own activation at the previous
Error category opportunity opportunity time step as well as by the activation (at the previous time step) of
Nonword .80 .75 the units that connect to it. We used the following activation
Word .20 .25 function, adapted from Dell et al. (1997):
Semantic .01 .018
Mixed .004 .009 Aj(t) = Aj(t - 1)(1 - Qj) + .~,P,fi,(t - 1) + GN(Aj(t - 1), Nj).
Formal .09 .099
Unrelated .10 .126 Aj(t) is the activation (A) of any unit j at time (t). Q is the decay
parameter (set in our simulations to .517). P is the parameter
determining the weight on the connections between units. All
weights connecting one layer with another had the same positive
greater phonological overlap within versus across semantic value of .04 except when we carried out manipulations of feedback
categories. 15 strength. Noise is generated by selecting a random number from a
To match the error opportunity and phonological overlap con- Gaussian function (GN). This function has a normal distribution
straints, we designated words from three independent pools of with a mean of 0 and a standard deviation proportional to the
position-specific phonemes--corresponding to the onset, vowel, activity of the unit at the previous time step (Aj[t - 1]) and the
and coda of CVC words. We used seven onsets, four vowels, and noise parameter (Nj). In normal processing, we assume that N is
four codas. The lexical neighborhood we created consisted of a at or near 0, and we assume that brain damage corresponds to a
target word and 28 neighbors (Table 7). This neighborhood well pathological increase in N.
matched the estimated error opportunities in the language (Table In addition to the parameters P, Q, and N, additional parameters
6) and had a greater within (.22) versus across (.173) semantic include step number and jolt size. Processing in the simulation
category POI value. begins with an initial activation or "jolt" to each of the 10 semantic
units comprising the semantic representation of the target word.
Simulation Architecture
We used a single four-layer network to simulate the naming is Dell et al. (1997) used two networks with different neighborhood
process from conceptual processing to phoneme selection (Figure structures to simulate the structure of a single prototypical neighborhood.
8). The network builds on the three-layer network of Dell et al. One neighborhood consisted of a target, one semantic and two formal
(1997), to which we added a concept layer. We did so to examine neighbors, and two unrelated words, and the other neighborhood consisted
K.E.'s pattern of semantic/conceptual-level damage. In effect, we of a target, one semantic neighbor, one mixed neighbor, one formal
simulated a three-stage theory of naming by adding a stage that is neighbor, and two unrelated words. The first was sampled on 90% of
assumed, although not usually discussed, in current theorizing (see naming trials and the second on 10%. Dell et al.'s neighborhoods were
potentially problematic in two ways. First, the manner in which sampling
Levelt et al., 1999). The three stages are Stage 1 (conceptual
from the lexical neighborhood was implemented put a ceiling on the
processing), Stage 2 (L-level selection), and Stage 3 (phonological
number of mixed errors that could be observed. Second, the greater
retrieval). across-category versus within-category POI values for Dell et al.'s lexicon
The first layer consists of 29 lexical concept nodes, one for each overpredicted formal error rates. In the neighborhoods adopted by Dell et
word in the neighborhood. The second layer consists of 281 al., the overall (across the two neighborhoods) within-category POI was
semantic feature nodes. Each concept is connected to 10 semantic .033, and the across-category POI was .32. That is, the structure of the
feature nodes that make up each word's semantic representation. neighborhoods actually favored the activation of formal neighbors over
Words within the same semantic category (the target and its three mixed neighbors.
semantic neighbors) share 3 semantic feature units, and words In a simulation investigation, Ruml et al. (in press) examined the role of
outside the semantic category share none. The semantic feature simulation neighborhood characteristics under various types of simulated
damage. Somewhat surprisingly, and despite the obvious aforementioned
nodes are, in turn, connected to any of the 29 L-level nodes that
shortcomings, Ruml et al. concluded in favor of Dell et al.'s (1997)
they serve to define. Finally, each L-level node is connected to one
neighborhood.
phoneme in each of the three (onset, vowel, coda) phoneme pools.
16 In the simulations of DFA, CFA, and RIA, the fact that there is
In addition to these feedforward connections, all simulations feedback between semantic features and concept units has no consequences
have feedback connections from semantic feature units to concept for evaluating the role of interactivity-discreteness in L-level selection and
units. The feedback from semantic features to concept units en- phonological encoding because the absence of feedback from L-level units
sures that the simulations of the different theories are identical to semantic feature units ensures that phonological effects cannot contrib-
with respect to everything except for the architectural features that ute to semantic-conceptual processing and selection.
are being manipulated. 16 Additionally, for the simulation of RIA, x7 With the exception that, largely for "historical" reasons, Q was set to
there are feedback connections between phoneme-level and .7 at the concept level.
INTERACTIVITY IN SPOKEN WORD PRODUCTION 477

Table 7 Evaluation of the Discrete Feedforward Account


The Number and Characteristics of the Words in the
Prototypical Neighborhood Adopted in the Simulations of the The critical theoretical assumptions of a DFA are (a) processing
Discrete Feedforward Account, the Cascading Feedforward proceeds in a strictly forward direction, (b) processing is confined
Account, the Restricted Interaction Account, to the current processing stage only, and (c) only the item selected
and the High Interaction Account at the end of a given stage is processed at the following one. The
predictions of DFA with regard to the four sets of facts with which
Category No. of words Phonemes shared with the target we are concerned are fairly transparent; nonetheless, we simulate
this position as a check on the basic simulation architecture.
Target 1
Mixed 1 2/3 phonemes
Semantic 2 0/3 phonemes
Formal 11 n = 2, 2/3 phonemes Mixed Error Effect
n = 9, 1/3 phonemes
Unrelated 14 0/3 phonemes DFA does not predict a mixed error effect because it does not
allow for the integration of semantically and phonologically driven
activation required to provide an advantage to a mixed neighbor
over both semantic and formally related neighbors.
After this, there are three subsequent selection points: at the Concept- or L-level noise. If noise at the concept level or the
semantic level, at the L level, and at the phoneme level. These L level results in the lowering of the target's activation levels, then
selection points can be thought of as marking the end of each an error will originate from among the target' s competitors at those
processing stage (conceptual processing, L-level selection, and levels: semantic and mixed neighbors. Given that these two neigh-
phoneme retrieval) and the beginning of the next. In our simula- bor types are identical at the concept and L levels, the long-term
tions, the following are adopted: 10 steps for Stage 1 (conceptual probabilities of producing a mixed neighbor or one of the two
processing), 8 steps for Stage 2 (L-level selection), and 8 steps for semantic neighbors should be equivalent. Because there are twice
Stage 3 (phoneme retrieval). as many semantic neighbors as mixed neighbors, we should expect
At each selection point, the most active unit in a particular layer a 2:1 ratio of semantic to mixed errors from damage at either the
is selected by the introduction of external activation. In the case of concept level or the L level.
a tie, the selection process randomly chooses among all equally Figures 9A and 9B illustrate these consequences. With both
active units. A trial begins with an initial jolt of 1 unit to each of concept- and L-level noise, semantic and mixed error rates in-
the 10 semantic units corresponding to the target, the semantic crease as accuracy decreases, with the rate of mixed errors remain-
selection jolt increases the activation of the semantic feature units ing a constant 50% of semantic errors. Not surprisingly, Z scores
corresponding to the most active concept unit to 10, and the comparing the rates of mixed to semantic errors 18 are not signif-
L-level selection jolt sets the most highly active L-level unit to 4. icantly different from predicted (ranging from - 0 . 6 4 to 1.30 with
Phoneme selection does not involve a jolt; rather, it simply consists concept-level noise and from - 1 . 6 1 to 1.19 with L-level noise).
of noting the identity of the most highly active phoneme unit in Phoneme-level noise. Noise at the phoneme level simulates a
each of the three phoneme pools. situation in which damage is restricted to the phonological re-
trieval stage. With noise at this level, the activation of a target
Simulating Damage phoneme may drop, and another phoneme might be selected in-
stead. A true mixed error effect at the phoneme level is supported
We simulated damage by introducing what Dell et al. (1997)
only when the phonemes of a mixed neighbor have an advantage
referred to as activation noise, which is GN(Aj(t - 1), Nj) in the
over the phonemes of an equivalent formal neighbor (and semantic
activation function above, at either single or multiple levels. Un-
neighbors). Although in the simulation neighborhood there are a
less stated otherwise, each simulation was run for 10,000 trials at
t o t a l of 11 formal neighbors of the target (2 sharing 2 target
each of five values of N.
phonemes and 9 sharing 1 target phoneme), only 1 of them, which
Given that the amount of noise a unit receives is directly
we refer to as formal*, is precisely matched to the mixed neighbor
proportional to its own activation level, the most frequent error
in terms of phonological overlap with the target, with the other
types will involve units whose activation levels are normally very
formal neighbors of the target, and with words unrelated to the
close to the target, the next most frequent error types will involve
target. Under DFA, mixed and formal* errors result from the
those units whose activation levels are the next closest to the
"accidental" misselections of phonemes and should be equivalent;
target, and so forth. We also carded out extensive manipulations of
therefore, we expect the ratio of mixed:formal* errors to be 1:1.
other parameters and found that all of these (with the possible
Even though absolute nonword rates are greater than formal error
exception of P) have similar consequences (see Appendix A for a
rates (as seen in Figure 9C) Z scores comparing rates of matched
more detailed discussion).
mixed and formal* errors are, as predicted, nonsignificant (range:
In evaluating each theoretical position, we discuss response
- 0 . 6 5 to 0.47).
rates subsequent to "damage" at each of three levels: the concept
level, the L level, and the phoneme level. It is essential to keep in
mind that, regardless of the locus of damage, the responses always is Because the output of the system under noisy conditions at the
correspond to the most active phonemes at the end of each simu- concept level is unstable with high levels of noise, we ran 60,000 naming
lated naming trial. trials at the highest concept-level noise values.
478 RAPP AND GOLDRICK

Select
semantic
features.

S e l e c t an
L - L e v e l unit.

Select
phonemes.

Figure8. Schematic of the simulation architecture used in implementing a discrete feedforward account-high
interaction account.

Phoneme plus L-level noise. It is possible, however, that the Only Semantic Errors: K.E. and P.W.
normal spoken word production scenario is better approximated by
noisy conditions at both phonological and L levels. Accordingly, DFA should have no difficulty in accounting for a pattern of
we submitted the DFA simulation to 20,000 trials at each noise only semantic errors originating from either a concept or a post-
value at both phonological and L levels. Determining the predic- semantic (L-level) locus. These patterns are readily accounted for
tions of DFA with regard to mixed error rates subsequent to because at these levels only the target and its semantic neighbors
multilevel noise is fairly complex, and the details of the algorithm are active. Figures 9A and 9B indicate that damage at either level
used are described in Appendix B. When we compared the ob- results in only semantic or mixed errors--matching both K.E.'s
served DFA simulation results with those predicted by a discrete and P.W.'s patterns of only semantic errors. In addition (although
feedforward theory, we found nonsignificant Z scores (ranging not depicted in the figures), damage at multiple levels can easily
from - 0 . 6 3 to 1.55). reproduce the pattern of multiple error types exhibited by C.S.S.

Lexical Bias Differential Effects of Phonology on Semantic Errors


DFA does not predict a mixed error effect even under severe
As discussed earlier, under DFA, there is no mechanism that
damage-noise conditions. This can be seen clearly in Figures 9A
biases the naming system to produce words over nonwords. As a
and 9B, in which the mixed error rate remains the predicted 50%
consequence, we expect rates of word-nonword outcomes to de-
of the semantic error rate (and Z scores are nonsignificant). Thus,
pend entirely on their phonological overlap with the item selected
although this allows DFA to account for K.E.'s pattern, it cannot
at L level. We can document the absence or presence of lexical
reproduce P.W.'s pattern.
bias by comparing the rates of formal neighbors with those of
nonwords that are matched to the formal neighbors in terms of
phonological overlap with target. Although in a system with true Summary of the Evaluation of the Discrete
lexical bias we expect that words will have the advantage over Feedforward Account
matched nonwords, under DFA, we expect them to be comparable.
Figure 9C depicts the overall rate of nonwords that are produced A simulation incorporating the key assumptions of a discrete
as phoneme-level noise is increased. For the lexical bias analysis, feedforward theory of spoken naming did not exhibit either mixed
however, we need to specifically compare rates of matched for- error or lexical bias effects under conditions ranging from low
mal* and nonword* responses. When we do so, we find, as levels of noise to more severe damage. DFA was, however, able to
expected, nonsignificant Z scores (ranging from - 1.40 to 1.37). recreate P.W.'s and K.E.'s patterns of only semantic errors as well
INTERACTIVITYIN SPOKEN WORD PRODUCTION 479

A - 0 - Ooralot
bors (e.g., RAT for the target CAT) to be better competitors than
100%- - 0 - Mixed semantic neighbors (e.g., DOG); they must also be better compet-
Onnfleled
Som~ itors than comparable formal neighbors (e.g., HAT). With cascad-


80%-
Formal
- O - Ner~,~ora
ing activation, phonemes of mixed neighbors receive activation
from both (a) the mixed neighbor nodes themselves at the L level
IG (as a consequence of shared semantics with the target) and Co) the
I~ 4o% I target at the L level (as a consequence of shared phonology with
2O%
the target). This contrasts with the situation of the phonemes of
semantic or formal neighbors. The phonemes of purely semantic
0% neighbors benefit only from the semantically driven activation of
0.10 0.30 0.G0 0.70 0.90
the semantic neighbor nodes at the L level. The phonemes of
NOIse Value
formal neighbors receive their primary input from the target at the
B
100%- L level as a consequence of shared phonology (and they may also
receive input from semantic neighbors of the target with which
80%-
they share phonemes). On this basis, we expect that, with cascad-
ing activation, the phonemes corresponding to a mixed neighbor
"6
should be more active than the phonemes of a matched semantic or
I -0
a formal neighbor.
Figure 10C shows that, although nonwords constitute the prin-
20%-
cipal error type subsequent to phoneme-level damage, the pho-
0%
nemes of the mixed neighbor are indeed selected more often than
'~ "r' PW "w" "w" 'w"
0.10 0.30 0.50 0.70 0.g0 those of either the semantic or formal neighbors. We can statisti-
Noise Value
cally evaluate the specific hypothesis that more mixed errors are
C observed under CFA than would be expected under the assump-
I00% 0
tions of DFA by comparing the observed simulation response rates
8O% for the mixed neighbor and the matched formal neighbor (formal*)
|
with those predicted by a discrete feedforward theory (again see
Appendix B). Analyses of the errors at each noise value yield
15
highly significant Z score values that increase with increasing
noise levels and range from 233.79 to 366.67.
2O% Concept- and L-level noise: Indirect phoneme-level disruption.
We can also consider the consequences of cascading activation on
0% tr, '~" "T "qr
0.10 0.30 0.50 0.70 0.90 noise at either the concept level or the L level. Given that cascad-
Noise Value ing activation exerts its consequences only at the phoneme level,
mixed neighbors should not be favored in selection at either the
Figure 9. Outputat the phoneme level resulting from introduction of concept level or the L level. If, however, the phonological retrieval
activation noise at each level in the discrete feedforward account simula-
process can be indirectly disrupted by noise from higher levels,
tion. A: Concept-levelnoise. The dashed line indicates K.E.'s accuracy
level. B: L-level noise. The dashed line indicatesP.W.'s accuracy level. C: then we might expect the same mixed error effect observed sub-
Phoneme-levelnoise. sequent to direct disruption to the phoneme level. With cascading
activation, we should see indirect phoneme retrieval disruption.
This should occur because the L level continues to provide input to
as C.S.S.'s pattern of multiple error types (see Table 8 for a the phoneme level beyond the point of L-level selection. This
summary). allows L-level instability to create instability throughout phono-
logical retrieval. Whether phonological instability is direct or
Evaluation of the Cascading Feedforward Account indirect, it should favor the phonemes of mixed neighbors.
In the simulation indirect disruption of phoneme retrieval sub-
CFA is similar to DFA in that it also assumes a strictly forward
sequent to concept-level damage is difficult to achieve. This is
flow of information. However, it differs from DFA in that it
indicated by the absence of nonword errors in Figure 10A. How-
incorporates the assumptions that (a) activation is not restricted to
ever, indirect phonological disruption subsequent to L-level dam-
the current processing stage but instead continues forward to
subsequent processing stages and (b) although "winning" items are age is observed (Figure 10B). We see that although semantic and
selected at each stage, processing of all items proceeds to subse- mixed errors predominate, as noise increases, nonword errors
quent stages throughout the entire naming trial. appear and increase. The appearance of nonword errors is an
indication that L-level instability has disrupted phonological re-
trieval (recall that the output we consider is always based on the
Mixed Error Effect
most active phonemes). The mixed error analysis indicates a small,
Phoneme-level noise: Direct phoneme-level disruption. For a yet reliable, mixed error effect at high (P = .7 and .9) noise levels
true mixed error effect to arise, it is not enough for mixed neigh- (with Z scores of 4.44 and 4.60 respectively).
480 RAPP AND GOLDRICK

Table 8
Summary of the Ability of the Various Theoretical Positions to Account for the Four Sets of Facts Under Consideration

Only semantic errors: Only semantic errors: Absent mixed error Mixed error effect:
Normal mixed Normal lexical Semantic locus Postsemantic locus effect: Semantic locus Postsemantic locus
Theory error effect bias effect (K.E.) (P.W.) (K.E.) (P.W., R.G.B.)

DFA x x ~/ ,/ ,/ x
CFA ,/ x ,/ ,/ ,/ x
Only for errors
arising at the
phoneme level
mA 7 ,/ ,/ ,/ ,/
Need to limit
phoneme to
L-level feedback
HIA ,/ ./ ,/ ,./ ~/ ,/
Need to limit L-level to Need to limit Need to limit L-level to
concept-level feedback phoneme to concept-level feedback
L-level feedback
FILSA Not tested but Not tested but Not tested but likely Not tested but
likely to be likely to be to be likely to be
,/ ,/ x x x ,/
Note. DFA = discrete feedforward account; CFA = cascading feedforward account; RIA = restricted interaction account; HIA = high interaction
account; FILSA = further interaction, low seriality account.

In sum, cascading activation allows for mixed error rates beyond L-level deficit. Given that at these levels the only active units are
those expected under the assumptions of DFA subsequent to even those corresponding to the target and its semantic neighbors,
relatively minor disruptions at the phoneme level (e.g., at 99% damage should result in the misselection of a semantic neighbor
accuracy level) and also subsequent to major disruptions at the L (see Figures 10A and 10B). Although, as noted earlier, noise at
level (below 50% accuracy). It is worth noting, however, that higher levels may indirectly affect the phoneme level and give rise
under CFA a significant mixed error effect cannot arise at L-level to multiple error types at the end of the phonological retrieval
selection itself because mixed neighbors are not favored over other stage, the pattern of only semantic errors exhibited by P.W. and
semantic neighbors at the time of L-level selection. 19 K.E. is readily observed over an extensive range of parameter
settings.
Lexical Bias
Differential Effect of Phonology on Semantic Errors
As is the case for DFA, under the assumptions of CFA, there are
no reasons to expect a true lexical bias effect. The analyses K.E.'s pattern of only semantic errors without any evidence of
required to confirm this, however, are more complex than under phonological influence is also readily observed under the assump-
DFA. Under DFA, the activation levels of all nontarget phonemes
(e.g., word initial /r/ or /g/ given the target CAT) should be
comparable. However, this is not necessarily the case with cas-
cading activation. The complication is that in a system with cas-
19 Some researchers have supposed (e.g., Dell et al., 1997; Martin et al.,
cading activation, the activation of nontarget phonemes depends 1996) that mixed errors may arise primarily during L-level selection. This
on the composition of the target word's semantic neighborhood. supposition is based, in large part, on the observation that word errors
Nontarget phonemes shared by a greater number of semantic (semantic, mixed, and formal) typically respect the grammatical category
neighbors of the target (e.g., word i n i t i a l / r / i n the context of a of the target and on the assumption that L-level activation is driven by both
semantic neighborhood composed of CAT, DOG, and RAT) will be grammatical and semantic considerations. However, under these same
more active than nontarget phonemes supported by fewer (or no) assumptions, the preservation of grammatical category in mixed and se-
semantic neighbors (e.g., word i n i t i a l / g / i n such a neighborhood). mantic errors might be expected even under CFA as well. Specifically, if
Given this, it is necessary to consider the phonological inventory we assume that grammatical information is an additional source of input to
the L level, then at the phoneme level, the phonemes corresponding to
of a target word's semantic neighborhood in order to find
mixed or semantic neighbors that share the target's grammatical category
"matched" words and nonwords whose response rates can be will be more highly activated than the phonemes corresponding to mixed
compared. Indeed, when such items were identified, compar- or semantic neighbors that do not. Under this scenario, disruption of
able response rates were observed in simulation testing (see phonological encoding in CFA should yield a significant mixed error effect
Appendix C). favoring mixed neighbors that share the target's grammatical category.
Thus, it seems plausible that preservation of grammatical category in
Only Semantic Errors: K.E. and P.W. lexical errors might be expected under CFA (although not necessarily at the
high rates that have been observed). Nonetheless, it is certainly true that to
CFA (like DFA) should easily account for a pattern of only the extent to which there are good reasons to think that mixed errors arise
semantic errors originating from either a concept-level or an in L-level selection, CFA's credibility is diminished.
INTERACTIVITY IN SPOKEN WORD PRODUCTION 481

-0- Correct
P.W. and R.G.B. (Caramazza & Hillis, 1990). The problem for
A
100% - Mixed CFA, simply put, is that a significant mixed error effect necessarily
Unrelated
Semantic goes hand-in-hand with nonword error production. This is true for
80%- Formal
-0-" Nonword
both direct and indirect phonological disruption. Phoneme-level
noise gives rise to multiple error types (nonwords in particular) as
6(~-
soon as accuracy falls below 95% (Figure 10C). Furthermore,
40*/* - noise values at the L level that are sufficient to cause a significant
mixed error effect downstream also produce nonword errors. We
20%-
illustrate and discuss these issues in more detail in the next section.
0% *- ',r ",r KE -,r ",r
0.10 0.30 0.50 0.70 0.90
Summary of the Evaluation of the Cascading
NoiSe Value
B Feedforward Account
100%
The incorporation of the assumption of cascading activation
8O% allows for the occurrence of true mixed errors. Additionally, CFA
(like DFA) can account for the pattern of only semantic errors
60%
exhibited by K.E. and P.W. CFA does not, however, predict a
40%-
"--(3 significant lexical bias. Most important, the pattern of only seman-
tic errors accompanied by a significant mixed error effect exhib-
20%- ited by P.W. (and R.G.B.) represents a seemingly insurmountable
obstacle for CFA (Table 8). 20
0% q" ",r" P W "~" w" "v"
0.10 0.30 0.50 0.70 0.90
Noise Value Further Reductions in Discreteness Under the
C
100% - Cascading Feedforward Account: Jolt Size,
Selection Strength, and Seriality
80%
W e have moved along the discreteness-interactivity continuum
60% from DFA to CFA by incorporating the assumption of cascading
activation. However, selection strength (jolt size) also contributes
40"1,
to the degree of discreteness-interactivity of a system. Recall that
20"1. at the end of each processing stage the activation of the winning
node is elevated above that of its competitors. 21 As a result, the
0% influence exerted by competitors on all subsequent processing is
0.10 0.30 0.50 0.70 0.90
Noise Value diminished. It is in this way that selection points reduce the extent
and time course over which full and free interaction is allowed.
Figure 10. Output at the phoneme level resulting from introduction of To gain a fuller understanding of CFA, we examined the effects
activation noise at each level in the cascading feedforward account simu- of manipulating selection strength under conditions of L-level
lation. A: Concept-level noise. The dashed line indicates K.E.'s accuracy noise (comparable to P.W.'s hypothesized damage locus). The
level. B: L-level noise. The dashed line indicates P.W.'s accuracy level. C: relationships between selection strength and nonword error rates
Phoneme-level noise.
and mixed error effects are depicted in Figure 1 I. The results
reveal that indirect phonological disruption subsequent to L-level
dons of CFA. As Figure 10A indicates, damage to the concept noise (as indexed by nonword errors) depends on the strength of
level (at K.E.'s 55% accuracy level) yields a pattern of only the selection process ("jolt" size) at the L level: the lower the jolt,
semantic errors. Z scores evaluating a mixed error effect at all the more likely the phonological disruption. Figure 11 also illus-
levels of noise at the concept level were nonsignificant (range = trates the point made above that under CFA a significant mixed
- 0 . 9 6 to 1.12). error effect is necessarily accompanied by nonword errors. Thus,
CFA can also account for C.S.S.'s pattern (a significant mixed P.W.'s pattern of no nonword errors is observed only with jolts greater
error effect in the context of the production of multiple error
types). CFA does not predict C.S.S.'s high rate of semantic and
mixed errors (6%) with phoneme-level damage alone; an addi- 20 Given the importance of the absence of phonologically based errors in
P.W.'s performance, we reviewed all of the spoken picture-naming data for
tional L-level damage locus is required to yield these rates. When
P.W. (reported in Rapp et al., 1997). This consisted of a total of 2,189 trials
we simulate multiple damage loci, C.S.S.'s pattern is readily of spoken picture naming. Across all of these trials, P.W. produced no
matched: simulation accuracy = 85% (C.S.S. = 83%), simulation formal errors and only two nonword responses: "rhinoceros" was named as
semantic + mixed = 8% (C.S.S. = 6%), simulation formal er- /r ai n a s e s e s/and "accordion" was named as/a k o r d i n/.
rors = 1% (C.S.S. = 3%), and simulation nonwords = 6% (C.S.S. 21 Although in the particular algorithm we have adopted this "enchance-
= 6%). ment" of the winner is implemented through an external jolt, there are other
Crucially, however, CFA cannot account for the pattern of only means of accomplishing the same end. For example, a competitive output
semantic errors with a significant mixed error effect exhibited by mechanism should yield comparable results.
482 RAPP AND GOLDRICK

A "-0- Jolt =3 The primary consequences of allowing phonological activation to


10%
Jolt =4 influence L-level processing are to (a) create an activation advan-
PW Jolt ffi 5
Jolt = 10 tage for mixed over semantic neighbors at the L level and (b)
8% provide L-level support for the phonemes of formal neighbors.
i
Mixed Error Effect
In addition to the mixed error effect at the phoneme level
"E observed under CFA, the feedback in RIA allows for a mixed error
~P effect in the course of L-level selection. This is because the
2% feedback allows the advantage of the phonemes for mixed neigh-
bors at the phoneme level (seen in CFA) to be "carded back" to the
L level.
O%
80% 70% 60% 50%
It is not surprising, therefore, that when noise is introduced at
Percent Correct either the phonological level or the L level, mixed error rates are
greater than 50% of the semantic error rates (see Figures 12B and
B
6.00-

5.00--

4.00
Pw / A --0- Cornmt
100% Mixed
-~- Unrelated
3,00
Semantic
80% Formal
W 2.00 -~ . . . . . . p < .05 -~-- Nonworcl

~ 1,011 60%

E
~ 0.~ I
N
-1.00
2~,.-
-2.00
I IP/o
-3.00
I
I I I I I
80% 70% 60% 50% Noise Value
Percent Correct B
100%

Figure 11. Indirect phoneme-level disruption. Effect of L-level jolt size


on phoneme-level output under conditions of L-level noise. The vertical 8O%

dashed line indicates P.W.'s accuracy level. A: Percentage of nonword i


responses at 50%-80% accuracy in cascading feedforward account simu-
lations with varying jolt sizes. B: Z scores testing for a mixed error effect '15

in the same simulations. The horizontal dashed line indicates the Z score I
value for two-tailed probability less than .05. 2O%

than five. The problem for CFA is that (at P.W.'s accuracy level) 0%
',P ",r PW ",r -,r" T
0.10 0.30 0.50 0.70 0.90
these large jolt sizes do not produce a significant mixed error effect. Noise Value
If the selection process is "strong" (enhancing the difference C
100%
between winner and losers), then the primary consequence of
cascading activation--the introduction of multiple semantic com-


80%
petitors on the phonological stage---is diminished or overridden. In
other words, as selection strength increases, interactivity de-
creases. One can easily imagine, in fact, that if selection strength "6
at the L level were strong enough, a system with cascading
activation could be effectively transformed into DFA. The simple 20%
lesson is that to increase interactivity it is not sufficient to simply
incorporate cascading activation (or any other mechanism); other 0%
0.10 0.30 0.50 0~70 0.90
aspects of processing (in this case, selection strength) contribute to
Noise Value
the "functionality" of the interactive mechanism.
Figure 12. Output at the phoneme level resulting from introduction of
Evaluation of the Restricted Interaction Account activation noise at each level in the restricted interaction account simula-
tion. A: Concept-level noise. The dashed line indicates K.E.'s accuracy
Under RIA, discreteness is further reduced with "lower level" level. B: L-level noise. The dashed line indicates P.W.'s accuracy level. C:
feedback connections from the phoneme level back to the L level. Phoneme-level noise.
INTERACTIVITY IN SPOKEN WORD PRODUCTION 483

12C). If we analyze the phoneme-level output, Z scores are sig- ranging from - 0 . 8 9 to 0.54 (Figure 12A). These results extend
nificant at every value of noise at the phoneme level or the L level down to accuracy levels well below K.E.'s.
(range: 8.50 to 27.16). RIA can also account for C.S.S.'s multiple error pattern plus
Additionally (and in contrast to CFA), if we consider the output significant mixed error effect either with multiple damage loci (as
of L-level selection itself (not depicted in Figure 12), we also fred under CFA) or by assuming phoneme-level damage alone. For
significant mixed error effects at all noise values (Z scores ranging example, phoneme-level damage that results in 85% accuracy for
from 4.20 to 9.64). the simulation (C.S.S.: 82%) also results in 5% semantic plus
mixed errors (C.S.S.: 6%), 2% formal errors (C.S.S.: 3%), and 8%
nonwords (C.S.S.: 7%).
Lexical Bias With respect to P.W.'s pattern, as indicated above, disruption to
The feedback connections in RIA should provide a mechanism the L level can produce a pattern of only semantic errors at
by which phonemes of formal neighbors may be preferred over accuracy levels even below P.W.'s. Importantly, at accuracy levels
phonemes of matched nonword outcomes. As activation passes at which only semantic errors are observed, there are also highly
from the target to its phonemes, the feedback connections send significant mixed error effects. Thus, in Figure 12B, it is evident
activation from these phonemes back to the L-level units that share that with L-level damage (at P.W.'s accuracy level) not only is
phonemes with the target, including formal neighbors. The L-level there a pattern of only semantic errors but also the rate of mixed
nodes of these formal neighbors, in turn, activate all their constit- errors is clearly greater than 50% of the semantic errors. This result
uent phonemes, including those that are not shared with the target. is the most important difference between CFA and RIA: Whereas
These will then reactivate their corresponding L-level nodes. CFA cannot account for P.W.'s full pattern, RIA can.
We carded out simulations with a smaller network that al-
lowed us to compare appropriately matched word and nonword Summary of the Evaluation of the Restricted
responses. 22 The results revealed significant word bias effects (Z Interaction Account
score = 2.82), providing confn-mation of this account of lexical
This section has revealed that the combined assumptions of
bias under RIA.
cascading activation and restricted interactivity can readily ac-
count for all four sets of facts under consideration. Critically, and
Only Semantic Errors: K.E. and P.W. in contrast to CFA, the introduction of limited feedback provides
a mechanism that (a) predicts a true lexical bias effect and (b)
Concept-level damage should have comparable consequences accounts for P.W.'s pattern of only semantic errors plus mixed
under RIA as it does under CFA and DFA because, at this level, error effect arising from postsemantic damage (Table 8).
disruption can easily result in the misselection of one of the There are, however, limits to the degree of interactivity that can
target's semantically related neighbors. The misselected item can be tolerated within RIA. Under RIA, the effects of further increas-
then pass activation downward in a relatively undisturbed manner ing interactivity were explored by manipulating selection and
such that what was misselected at the concept level is encoded feedback strength. We found that as selection strength at the L
correctly at the phoneme level. Figure 12A depicts this outcome of level was decreased and feedback from the phonological to the L
concept-level damage. level was increased, P.W.'s pattern of only semantic errors became
The patterns arising from L-level damage, however, may be increasingly difficulty to match. This is illustrated in Figure 13, A
expected to be somewhat more complex. The feedback from the
phonological level to the L level allows formal neighbors to
become competitors at the L level itself, both in the course of
L-level selection and afterward. This creates an opportunity for
formal neighbors to be selected at the L level. This leads us to 22 We were unable to check our thinking on this matter with the full
expect that, at least under certain circumstances, L-level damage network because it was (accidentally) constituted in a way that did not
may lead to both formal and nonword errors. allow for identical L-level input to any word-nonword pair. Instead, we
Given this possibility, it is first important to establish if there are implemented the "BabyFeedbackNet." This simulation consisted of two
conditions under which P.W.'s pattern will be observed under the layers, corresponding to the L level and the phoneme level in our full
simulation. The neighborhood consisted of the target (CAT), two semantic
general assumptions of RIA. Figure 12B indicates that the pattern
neighbors (RAT and BAT), and two formal neighbors (BADand RAP). Each
of only semantic errors ( < 1 % nonsemantic errors) is observed trial consisted of a four-unit activation jolt to the L-level target, CAT.
with L-level damage at accuracy levels well below P.W.'s. Only as Semantic neighbors of the target received one tenth the activation of the
accuracy falls further do nonword and formal errors begin to target (to simulate the effects of prior semantic processing). Activation was
emerge. Thus, under RIA, P.W.'s pattern of only semantic errors then allowed to proceed for eight time steps, at which point the most active
resulting from postsemantic damage can be readily accounted for phonemes in each position were selected. Activation noise at the phoneme
with L-level damage. layer was set to .9, and 100,000 iterations were run. Feedforward connec-
tions were set to P = .04 and feedback connections were to set to P = .01.
Under these conditions a significant word bias (BAD matched nonword
Differential Effect of Phonology on Semantic Errors RAD) was observed (Z = 2.82, p < .05). The strength of such a word bias
effect is clearly dependent on the strength of the feedback connections, and
Under RIA, concept-level damage easily results in a pattern of when we varied feedback strength from P = .0001 to P = 1.0, we found
only semantic errors and no mixed error effect, with Z scores that Z scores increased from 0.34 to 7.85.
484 RAPP AND GOLDRICK

A Not unexpectedly, feedback strength also plays an important

I
-0- Jolt = 3
10% Jolt = 5 role in the rate of formal errors that occur at L-level selection itself.
Jolt = 10
PYV Whereas under CFA, formal errors occur only in the course of
8*/*
phonological retrieval (either as a result of direct or indirect
disruption), under RIA, phonologically related lexieal neighbors
are competitive at the L level and may be selected. As indicated in
== 6%
Figure 14 (with L-level noise), as feedback strength increases from
P = .01 to P = .1, formal error rates dramatically increase at
4% L-level selection. This is another way in which increases in feed-
E
back strength make it increasingly difficult to match P.W.'s
2%
pattern.
In sum, the restricted feedback assumption in RIA allows for a
pattern of only semantic errors in both concept processing and
0%
80% 70% 60% 50%
L-level selection at the accuracy levels exhibited by P.W. and
Percent Correct K.E., while at the same time matching the differential effects of
phonology on their performance. It is also true, however, that
B generating a significant mixed error effect at the L level while
maintaining a pattern of only semantic errors (P.W.'s pattern)
10%
requires a careful balancing of selection strength and feedback.
PYV
Evaluation of the High Interaction Account
8%

Given RIA's ability to account for the four sets of facts under
consideration, we could assume that an even higher degree of
"2 interactivity in the naming system is not required to account for
this set of facts. Nonetheless, it is important to establish whether
more interactive accounts can also account for these facts and, if
#. not, to understand why.
2% HIA adds "upper level" feedback between the L level and the
concept level. This has a number of general consequences; here,
0%
we highlight two of them. First, this feedback provides an oppor-
80% 70% 60% 50% tunity for conditions downstream--at both the phonological and L
percent Correct levels--to contribute to upstream concept-level processing. Spe-
cifically, we might expect that at the concept level itself mixed
Figure 13. The relationship between L-level jolt size and feedback
strength (strong and weak) under conditions of L-level noise (restricted
interaction account [RIA]). The dashed line indicates P.W.'s accuracy
level. A: Percentage of nonword responses at 50%-80% accuracy in RIA 8.0% feedback =. 1
simulations with varying jolt sizes and weak feedback (phoneme- to
L-level feedback = .01). B: Percentage of nonword responses at 50%- 80% 7.0%
accuracy in RIA simulations with varying jolt sizes and stronger feedback
(phoneme- to L-level feedback = .1). (9
-~ 6.0%
CO
"~ 5.0%
and B. This figure compares the effects of lower (.01) and higher 0
LL 4.0%
(. 1) feedback values on nonword error rates. First, it is important
to note that all jolt size-feedback combinations produced a mixed >
.--I 3.0%
error effect at P.W.'s 70% accuracy level. In addition, as Figure 13
.3
indicates, there was a range of selection strength-feedback values
2.0%
that also produced nonword rates of 0% to 0.5% at P.W.'s accu-
racy level. However, these numbers also reveal that as selection 2 j/J feedback = .04
n 1.0%
strength decreased (from jolt size = 10 to 3) and feedback in-
creased (from P = .01 to .1), unacceptably high nonword error .~I-~.. ~feedbackl., , .01
0.0%
rates were observed at P.W.'s accuracy. This is due to the fact that 80% 70% 60% 50%
decreases in selection strength serve to increase indirect phono-
Percent Correct
logical disruption and increases in feedback serve to amplify
support for formal neighbors at the L level. Both of these, in turn, Figure 14. Formal errors occurring at L-level selection in restricted
translate into support for nontarget phonemes and an increase in interaction account simulations with different phoneme- to L-level feed-
nonword responses. back values.
INTERACTIVITY IN SPOKEN WORD PRODUCTION 485

neighbors of the target should develop an advantage over semantic A - ' O - Correct
neighbors and, additionally, that formal neighbors should develop 10o~- Mlxod
-'~ Unrelated
an advantage over unrelated words. Second, with this feedback, the Semantic
80%- Formal*
concept level receives downstream input throughout each naming --0- Nonword
trial. As a consequence, the concept level maintains a higher level ~ 60%-
of activation throughout the naming trial than under any of the
other theories we have examined. The primary consequences are ~ ,ms- t
both to keep activation levels higher throughout the entire network I
20*/.
and to maintain a stronger and more prolonged semantically driven
pressure on the downstream processes. Because HIA supports 0% •w -r ',r KE'W "w
many of the same predictions as RIA, we review these only very 0.10 0.30 0.50 0.70 0.90
Noise Value
briefly, focusing instead on the differences between the two accounts.
B
100% -

Mixed Error Effect 80% -

I
It is not surprising that noise at the phoneme or L level results ~ s0~.-
in statistically significant mixed error effects at all levels of
accuracy, with Z scores ranging from 10.37 to 85.90 (Figure 15, i 40%-

B and C).
20%-

I
Lexical Bias Effect 0%
0.10 0.30 0.50 0.70 o.go
Noise Value
For the same reasons discussed above for RIA, a true lexical
C
bias effect is expected and observed under HIA. 10o~

80%
Only Semantic Errors
0O%
HIA can be damaged at the concept and L levels in ways that
match the pattern of only semantic errors exhibited by both P.W. i 40%

and K.E. This can be seen in Figure 15A and B. We discuss


limitations on this below, a0%
0%

Differential Effect o f Phonology on Semantic Errors 0.10 0.30 0.50 0.70 0.0Q
Noise Value

Both P.W.'s and K.E.'s patterns can be matched under HIA.


However, as we discuss in some detail below, matching either of Figure 15. Output at the phoneme level resulting from introduction of
activation noise at each level in the high interaction account simulation. A:
the patterns occurs under a very limited range of parameters. This
Concept-level noise. The dashed line indicates K.E.'s accuracy level. B:
is especially true for K.E.
L-level noise. The dashed line indicates P.W.'s accuracy level. C:
Phoneme-level noise.
Summary of the Evaluation of the High Interaction Account
HIA can account for the full set of facts under consideration. It is
important to note, however, that it can do so under condi- For example, Figure 16, A and B, indicate that I-I/A requires a
tions that highly restrict interactivity. We now consider those jolt size of 10 or larger to prevent nonword errors, whereas this
conditions. was achieved under RIA (cf. Figure 13, A and B) with much
smaller jolt sizes. Thus, the limitations of RIA that are related to
The Limits on Interactivity in the High Interaction "lower level" feedback strength are magnified in HIA.
Also regarding P.W.'s pattern, we found that upper level feed-
Account: Selection Strength and Feedback
back allows damage at the L level to indirectly disrupt concept-level
P.W.'s pattern of exclusively semantic errors is matched only if processing. Figure 17 reveals that as upper level feedback strength
interactivity is limited. In our evaluation of RIA, we observed that increases from .0001 to 1.0, L-level damage produces increasingly
to match a pattern of only semantic errors arising from postseman- greater rates of errors (semantic and mixed) at the point of
tic (L-level) damage, it was crucial to balance phoneme- to L-level concept-level selection (Figure 17). Such errors are kept to mini-
feedback and L-level selection strength. Otherwise, formal errors mal levels only with feedback values below .01. These findings
arose in L-level selection, and nonword errors were produced on suggest that individuals (e.g., P.W.) with presumed postsemantic
output. Because of HIA's greater potential for interaction, restrict- damage should show conceptual confusion and, presumably, make
ing interactivity becomes an even more significant concern than it errors in comprehension tasks (something that P.W. does not do).
is under RIA. Most problematic for I l i A is the fact that K.E.'s pattern is
486 RAPP AND GOLDRICK

matched under only a very restricted set Of conditions. As indi- 100% A •


w
cated in Figure 18, as upper level feedback strength increases, a
mixed error effect is observed on output. The absence of a mixed
error effect is seen only at parameter settings that dramatically
limit feedback levels (e.g., upper level feedback p = .004; lower 80%

level feedback p = .04). In other words, only when the upper level
feedback signal is reduced to 4/1,000 of the downward signal are _J

significant mixed error effects eliminated. 60%


K
Although the full connectivity assumed under HIA can account
t-
for P.W.' s and K.E.'s pattern of only semantic errors, it is impor- O
o
tant to note that this is achieved by severely limiting the actual
interactivity in the system. This again illustrates the point that the 40%
mere presence of some m e c h a n i s m - - i n this case, connectivity--
does not ensure its functionality. Thus, although HIA can account
for the same set of facts as RIA, it does so only to the extent to
20%
which it approximates R I A ' s assumption of null upper level inter-

-f
Jolt = 3 0% I I I I I
A - c - Jo~= s
10% ~ Jolt = 10 0,0001 0.0010 0.0100 0.1000 1.0000

8% L-level to Semantics Feedback Level

"2

c
6%-

4%-
// Figure 17. Accuracy at the concept level following indirect concept-level
disruption due to L-level noise in the high interaction account.

activity (Table 8). In other words, H I A ' s upper level feedback is at


best irrelevant and at worst a liability (see Harley & MacAndrew,
2%- 1995, for a similar point).

0%
Evaluation of the Further Interaction,
80% 70% 60% 50% Low Seriality Account
Percent Correct

The central tenet of FILSA is that the mapping between input


B and output domains is a highly interactive, continuous process with
low discreteness and seriality. We have already demonstrated that
10%- as interactivity increases and seriality and discreteness decrease, it
PW becomes increasingly more difficult to match the patterns of data
8%- under consideration. FILSA corresponds to a theoretical position
that is more interactive than any examined thus far, and on this
i== 6%-
basis we can anticipate that it will encounter important difficulties.
"2
FILSA's interactivity assumptions are stronger than those of HIA
in two ways: (a) interaction from visual input to phonological
4%- output and (b) extremely limited seriality.

2%- Neighborhood and Architecture in the Further


Interaction, Low Seriality Account
0% 1 I I I
80% 7~ 60% ~%
We used much of the architecture developed by Plaut and Shallice
Pe~ntco~ (1993a). 23 We used their set of 40 words in five different semantic

Figure 16. The relationship between L-level jolt size and feedback
strength (strong and weak) under conditions of L-level noise (high inter- 23 Extremely low seriality was something that we found difficult to
action account [HIA]). The dashed line indicates P.W.'s accuracy level. A: accomplish in our implementation of HIA, presumably because of certain
Percentage of nonword responses at 50%-80% accuracy in HIA simula- characteristics of the simulation architecture. We found that extreme re-
tions with varying jolt sizes and weak feedback (phoneme- to L-level ductions of selection strength (jolt size) in HIA resulted in poor perfor-
feedback = .01). B: Percentage of nonword responses at 50%- 80% accu- mance even under minimal noise conditions. In contrast, our implementa-
racy in IliA simulations with varying jolt sizes and stronger feedback tion of FILSA corresponds to a low seriality system that is more resilient
(phoneme- to L-level feedback = . 1). under damage conditions.
INTERACTIVITYIN SPOKEN WORD PRODUCTION 487

L Processing in the network begins with fixing ("clamping") the


8.00 visual input unit states to the input values corresponding to some
word in the simulation lexicon. All other units have an initial
"~ 7.00 activation of 0. The initial temperature is set to 50, from which it
i1)
:I= exponentially decays toward 1. At each time step, the temperature
uJ 6.00
is lowered, and each noninput unit in the network synchronously
updates according to the activation function outlined above. This
UJ 5.00
process is allowed to continue until no unit state changes more
x 4.00 than 0.01.

3.00 Learning in the Further Interaction,


O
o Low Seriality Account
09 2.00 . . . . . p < .05
N The training process used was contrastive Hebbian learning and
1.00 is described in Appendix D (see also Plant & Shallice, 1993a).
~ k = .004
0.00 I I I -1- I

70% 60% 50% 40% Seriality and Emergent Seriality

Percent Correct As indicted, FILSA was designed to explore a theoretical position


that is more interactive than HIA with regard to seriality and extent of
Figure 18. Z scores testing for a mixed error effect on errors resulting interaction domains. The expansion of the interaction domains was
from concept-levelnoise under different values of L level to semantics implemented with additional representational levels and connectivity.
feedback in the high interactionaccount. However, establishing whether or not the simulation actually imple-
mented extremely low seriality was less straightforward.
Smolensky (1986) examined a simulation that was based on the
categories as well as their semantic and phonological representations. same class of learning and processing principles we have adopted
For the input representations, we adopted the visual representational and that lacked externaUy imposed seriality. He found, however,
scheme described in Plaut and Shallice (1993b).24 that seriality emerged in the course of learning. Specifically, in the
Each of the visible layers (visual, semantic, phonological) is simple four-unit system he examined, the units did not reach their
completely internally connected. In addition to these visible layers, final settling state simultaneously. Instead, two units settled at
two hidden layers of 40 units each were used. The first hidden much the same time, followed by the third and then the fourth. He
layer is completely connected to both visual input and semantics, concluded that "out of the statistical din of parallel microdecisions
and the second is completely connected to both semantic repre- emerges a sequence of macrodecisions"(Smolenksy, 1986, p. 246).
sentations and phonological output. All weights are bidirectional: Given this possibility, it was important to determine if FILSA,
Activation can pass both forward and backward along any con- despite the absence of explicit selection points, could have
nection, and the weight has the same value in both directions (see (through learning) developed a serial processing structure consist-
Figure 6). ing of a largely semantically driven stage followed by a phono-
logically driven stage.
To look at this possibility, we tracked the processing (in the
Processing in the Further Interaction,
undamaged system) o f each unit by examining changes in
Low Seriality Account
distance from final activation state. We did this by computing
We used the deterministic Boltzmann machine/mean field the- (at each processing step) a mean sum squared error based on the
ory framework adopted in Plaut and Shallice (1993a). The state of difference between the units' current activation and their acti-
a particular unit in the network ranges from - 1 to 1 and is vations when the network had settled to its final stable state. We
determined by the following function: then computed the mean of this distance for all units during a
full set of naming trials.
Aj(t) = Aj(t - 1)(1 - Q) + Q t a n h [ l / T ~ wi:A,(t - 1)]. Figure 19A presents hypothetical results that would be expected
The activity (A) of any unit j at time (t) is determined by its in a highly serial system where a "decision" regarding the seman-
activation at the previous time step (t - 1) and the activation of tics of the target is made before much progress has been made
each unit i that connects to unit j (Ai[t - 1]), modulated by the regarding its phonological encoding. Figure 19B depicts the actual
weight on the connection between i andj (wij). Q determines how pattern obtained in the simulation. It shows that the observed
much each factor is allowed to contribute to the new activation. In pattern is clearly quite different from the idealized serial pattern
our simulation, Q was globally equal to .4, meaning that 40% of
the unit's activation came from other units and 60% from its own 24Forty-four visual features were used as an input representation. The
activation at the previous time step. "tanh" is the hyperbolic first 25 of these featuresencodeinformationaboutthe three principalcompo-
tangent function (a sigmoid function), and T is the temperature nentsthat compriseeach object,as well as informationabouttheir relativesize
parameter. When T is high, less input is supplied to the tanh and placement. The remaining 19 features describe each object's general
function, reducing the range and sharpness of the sigmoid. characteristics (e.g., texture, color, and absolutesize of the entire object).
488 RAPP AND GOLDRICK

A
80

70

60

50

!-~
i:5 3O
= = = = = = = = = = = = =~= = = = = = = = = =
I---s'me°ti°",°r I
~ Phonologcal Layerj

~ 2o
:i
'i 0 7 14 21 28 35 42 49 56 63 70 77 84 91 98 1 0 5 1 1 2 1 1 9 1 2 6 1 3 3 1 4 0 1 4 7
Iteration

B
so T

6O

~1
"i
~ 5o

~4o+
\~
I .
~Semantic~yer]
I~".o.o'oo'o'La,e,l

~20

0
0 7 14 21 2B 35 42 49 56 63 70 77 84 91 98 1 0 5 1 1 2 1 1 9 1 2 6 1 3 3 1 4 0 1 4 7
Iteration

Figure 19. Lack of "emergent seriality" in the further interaction, low seriality account (FILSA). A: Mean
distance of the semantic and phonological layer activation vectors from their final states in an idealized discrete
system. Distance is averaged over all words at each iteration. B: Observed mean distance from final state for the
two layers in the FILSA simulation.

and thus that there is little evidence of emergent seriality. If Only Semantic Errors From Semantic or
anything, phonological units drop toward their final state slightly Postsemantic Damage ?
sooner than do the semantic units. Only at the very end of the
naming process do the semantic units settle slightly before the We can fairly safely assume that with its features of cascad-
phonological units. These results provide good evidence that the ing activation and feedback FILSA will exhibit mixed error and
simulation of FILSA we developed does, in fact, correspond to a lexical bias effects. For this reason, and because establishing
theoretical position that assumes very low seriality. chance rates of mixed errors under F I L S A is complex, we focus
INTERACTIVITY IN SPOKEN WORD PRODUCTION 489

our evaluation of FILSA on whether or not it allows for the A perfect match to P.W.'s pattern would similarly require a
pattern of only semantic errors exhibited by P.W. and K.E. pattern of only semantic errors. However, as Figure 20B de-
To examine this question, we simulated semantic and postse- picts, nonsemantic errors were observed at all accuracy levels
mantic damage by "lesioning" a proportion of the weights by subsequent to postsemantic damage. In particular, at P.W.'s
setting them to 0. Semantic-level damage was simulated by lesion- 70% accuracy level, 20% of the errors were semantic errors (of
ing the weights among semantic units; postsemantic damage was these, 4% were purely semantic, 1% were semantic-
simulated by lesioning the weights between the semantic and phonological, 9% were semantic-visual, and 6% were visual-
hidden units. A random set of weights was selected and zeroed, phonological-semantic), whereas 5% were nonsemantic. These
and each damaged network was then presented with the entire set consisted of similar rates of visual-phonological errors and
of 40 words. For each damage level, 20 repetitions of damage were visual errors (2.1% and 1.8% respectively), followed by pho-
performed to obtain an average damage effect. nological errors ( < 1%). Five percent of the responses could not
The network's output for a given trial corresponded to the most be classified as clear exemplars of any error type.
highly active unit in the onset, vowel, and coda pools. Addition-
ally, for a response to be well formed, one unit in each pool had to Summary of the Evaluation of the Further Interaction,
be active above a certain threshold level, and all other units had to Low Seriality Account
be relatively inactive. In this regard, we followed the procedure
used by Plaut and Shallice (1993a) and adopted their criterion level FILSA is clearly unable to account for patterns of only
(see Plaut & Shallice, 1993a, for formal specifications). semantic errors arising from either semantic or postsemantic
The following categorization of target-error types was adopted: loci of damage. Even adopting a classification scheme that
semantic, phonological, visual, visual-semantic, visual-phono- favors semantic over nonsemantic errors, we found significant
logical, or visual-semantic-phonological. Because target-error numbers of nonsemantic errors (visual, visual-phonological, or
pairs vary continuously in their similarity along visual-semantic- phonological; see Table 8).
phonological dimensions, we developed a method for discrete The difficulties of F1LSA are unsurprising given that we have
error categorization, which is described in Appendix E. The fol- already shown that incrementing interactivity even within the
lowing are examples of errors under this categorization scheme: architectural constraints of HIA or RIA leads to increasing diffi-
purely semantic = BONE-PORE, phonological = BOG-DOG, culties in matching the observed patterns. One might, nonethe-
visual = NUT-ROCK, semantic-visual = DOG-CAT, semantic- less, wonder if the FILSA simulation failures are attributable to
phonological = LEG-LIP, visual-phonological = GUT-NUT, and some aspect of the way we implemented the central assump-
visual-phonological-semantic = RAT-CAT. tions of the FILSA theory, and whether perhaps another imple-
mentation of a position with the same assumptions would have
worked. Although for the reasons stated this seems unlikely, it
Resul~ cannot be ruled out.

All outputs were well-formed patterns that could be classified as


words in the simulation lexicon. K.E.'s pattern would be best
General D i s c u s s i o n
matched if semantic damage produced only semantic errors. One
complication is the possibility that errors categorized as semantic- With the help of computer simulations, we have evaluated five
visual, semantic-phonological, or semantic-visual-phonological theories of spoken naming with respect to their ability to account
might be "true" semantic errors. To be as generous as possible with for four sets of facts: (a) a mixed error effect in the speech errors
the hypothesis that only semantic errors were observed following of normal unimpaired individuals; (b) a lexical bias effect in the
semantic-level damage, we considered all of these error types as speech errors of unimpaired individuals; (c) the pattern of only
semantic errors. Nonsemantic errors were members of the visual, semantic errors resulting from either a semantic locus or a
phonological, and visual-phonological categories. postsemantic locus of damage; and (d) the differential phono-
Figure 20A depicts the percentage of semantic and nonse- logical effects on the semantic errors, specifically the absence
mantic errors that resulted from damage to semantic units. It is of phonological effects on semantic errors arising from a se-
evident that at every accuracy level, semantic damage resulted mantic locus of damage and the presence of phonological
in a considerable percentage of errors that were clearly nonse- effects on semantic errors arising from postsemantic loci of
mantic. For example, at 50% correct (near K.E.'s 55% accuracy damage. The five theories vary along the discreteness-
level), 30% of the responses were considered to be semantic interactivity dimension, and we have specifically considered the
errors (of these, 3% were purely semantic, 2% were semantic- roles played by cascading activation, feedback, seriality, and
phonological, 15% were semantic-visual, and 10% were extent of interaction domains in accounting for the set of critical
visual-semantic-phonological), but 11% of responses were observations.
nonsemantic errors. It is interesting to note that, of the nonse- The principal results of this investigation can be summarized
mantic errors, visual errors (7%) were the most common, fol- rather simply. To satisfy the constraints provided by these four sets
lowed by visual-phonological (4%) and then phonological of facts, we must assume that, at a minimum, the naming system
(<1%). Nine percent of the responses at this accuracy level incorporates the following: (a) a mechanism--such as cascading
were not clear exemplars of any error category and could be activation--that allows the phonemes corresponding to the seman-
considered to be unrelated responses. tic neighbors of a target word to be active during phonological
490 RAPP AND GOLDRICK

A FILSA Semantic Level Damage


--O- Semantic
35% -
Nonsemantic

30% -

(D
u~ 25% -
t-
O
Q.
(D 20% -
IZ

15% -
e-

10% -

5%-

0% I I I I I
90% 80% 70% 60% 50%
Percent Correct

B FILSA Post Semantic Damage


35% -

30% -

¢0
25% -
e-

20% -
rr

"6
15% -
~E

Q. 10% -

5%-

0% I I I I I
90% 80% 70% 60% 50%
Percent Correct

Figure 20. Results on output after removing a random set of connections in the further interaction, low seriality
account (FILSA) simulation. "Semantic" includes phonological-semantic, pure semantic, visual-semantic, and
visual-phonological-semantic errors. "Nonsemantic" includes visual-phonological, pure visual, and pure pho-
nological errors. Results are reported as a percentage of 800 trials (20 replications of each damage level, with 40
words presented in each replication). A: Results at different accuracy levels after removing intersemantic
connections. B: Results at different accuracy levels after removing connections between the semantic and the
hidden layer preceding the phoneme layer.

retrieval and (b) a mechanism--such as functional feedback con- As previously stated, we had two objectives with this re-
nections from a phonological layer to a preceding lexical-level search: (a) to determine which of the specific theoretical posi-
layer--that provides a means by which lexical outcomes will be tions examined best accounts for the data and (b) to draw more
preferred over nonlexical ones and phonological information can general conclusions regarding discreteness and interactivity in
influence semantically driven, lexical-level processing. A mecha- spoken word production. With regard to the first objective, we
nism such as cascading activation allows us to understand the found that RIA most readily accounts for the relevant facts.
mixed error effects that occur in normal and impaired perfor- Additionally, HIA is satisfactory to the extent that upper level
mance. A mechanism such as phonological/L-level feedback ac- L-level/semantic feedback levels approximate R I A ' s null feed-
counts for lexical bias effects and mixed error effects occurring back condition. That is, there are conditions under H I A - -
prior to phonological encoding in either impaired or normal namely, strong seriality combined with extremely low levels of
performance. L-level/semantic feedback--that do not create an obstacle for
INTER.ACTIVITYIN SPOKEN WORD PRODUCTION 491

matching K.E.'s error pattern. However, unless there are inde- sume that units at this intermediate level correspond to amodal
pendent grounds motivating this upper level feedback, it would (lemmas) or modality-specific (lexemes) lexical representations.
seem unnecessary at best. One might well wonder if these differences have any bearing on
In additionto providing support for a restricted interactivity the results that we have reported.
account of spoken word production, this research more generally The answer is that they generally do not. First, to account for the
provides insight into the architectural and processing features that four sets of facts under consideration, no theory can do without a
are minimally required to account for the relevant facts. In this mechanism such as cascading activation that allows the phonemes
regard, the important generalization is that although interaction is corresponding to the semantic neighbors of a target word to be
necessary, it is also true that interactivity is problematic as it active at the phoneme level. Furthermore, a mechanism such as
increases beyond some optimal point--whether this increase is feedback connections from the phonological layer to some preced-
achieved by means of lowered seriality, increased feedback ing lexical level is necessary to account for lexical bias effects and
strength, or expansion of domains of interaction. Any architecture mixed errors arising at some lexical level. In this regard, one could
that allows for interaction across domains--be it FII_~A, HIA, say the theories simply vary in terms of what is represented at the
RIA, or even CFA--will encounter difficulties in matching the lexical level preceding the phoneme level--in some accounts this
patterns exhibited by P.W. and K.E. if the interaction is too great. is an amodal lemma level, whereas in others it is a modahty-
specific lexeme level. The data we have examined certainly do not
distinguish between these possibilities.
The Respective Contributions of Patterns of Impaired and
There is, however, one sense in which the lexeme-lemma dif-
Unimpaired Performance ference is indeed relevant. It concerns the broader generalizations
Although P.W.'s and K.E.'s errors are similar to the errors of one may wish to draw from this work. The evidence presented here
unimpaired individuals, what proves to be especially useful about indicates that the levels that most strongly interact are the phoneme
K.E.'s and P.W.'s errors is the fact that only semantic errors are and L levels. Given this, if we assume that the L level is a
produced and that we have independent grounds (from compre- modality-specific phonological lexeme level, then the generaliza-
hension and spelling tasks) for determining the loci (semantic and tion may well be that levels within a common representational
postsemantic, respectively) of these errors. Their contributions can domain (in this case, phonological) interact more strongly than
be summarized in the following way: P.W.'s pattern helps us to levels from different representational domains (phonology vs. se-
begin to set a lower bound on interactivity, whereas K.E.'s pattern mantics). In contrast, if the L level corresponds to an m o d a l
contributes to setting an upper bound. Simply put, there must be lexical level, then this kind of domain-based account of strong
sufficient interactivity for a true mixed error effect to arise in versus weak interactivity cannot be put forward. Thus, a lexeme-
L-level selection (P.W.'s pattern), yet interactivity must be suffi- based interpretation of the L level supports a broader and more
ciently restrained so that phonological effects are not observed elegant generalization.
subsequent to conceptual-level disruption (K.E.'s pattern). We have also reported that there are theories that assume that
Table 8 presents the key observations we have been concerned both lemma and lexeme levels mediate between concept and
with and summarizes the ability of the five theoretical positions to phoneme levels (Dell, 1986; Roelofs, 1992; see Figure 2D). It is
account for them. Table 8 makes it quite clear that (a) P.W.'s worth considering the implications of this assumption. First of all,
pattern creates insurmountable obstacles for DFA and CFA, (b) such a theory would allow for the domain-based generalizationjust
K.E.' s pattern puts limitations on the amount of upper level inter- proposed above. Second, if one were to require P.W.'s semantic
activity permitted under HIA, and (c) both P.W. and K.E. create errors (or the mixed errors produced by normal individuals) to
fundamental problems for FILSA. arise specifically in the course of lemma (rather than lexeme)
The data from normal speech errors make their primary contri- selection, then functional feedback connections would be required
bution in discriminating between DFA and CFA because DFA not only from the phoneme level to the lexeme level but also from
does not provide a mechanism that would predict mixed errors or the lexeme level to the lemma level. Although it is not clear that
a lexical bias effect. If an L-level locus could be convincingly there are good reasons to assume that mixed errors arise in lemma
established for a mixed error effect in normal speech errors, then versus lexeme selection, we point this out as an imphcation of the
these would play the same role as P.W.'s pattern of mixed errors lemma plus lexeme architecture.
originating at a postsemantic-prephonological locus. That is, a Finally, there are implications of a lemma plus lexeme-based
clear L-level locus for the normal mixed error effect would also be architecture for the problematic nature of K.E.'s pattern under
a fundamental obstacle for CFA. HIA. As connectivity distance increases, the functional conse-
quences of feedback connections decrease. In an architecture with
feedback and feedforward connections only between adjacent lev-
L-Level Nodes, Lemmas, and Lexemes
els, an additional level increases the distance between phonolog-
In the introduction, we adopted the convention of referring to ical and concept levels. In the context of K.E.'s data, increasing
the lexical level of representation mediating between concept and connectivity distance has the consequence that--at uniform feed-
phoneme levels as the L level. We did so to capture the common- back values throughout the system--the influence of phonology on
alities between the lemma-based theories of Dell and O'Seaghdha conceptually generated errors will decrease. On this basis, a sys-
(1992), Levelt (1989), Garrett (1980), and others and the lexeme- tem with full feedforward and feedback connectivity as well as
based independent network model proposed by Caramazza (1997). with both lemmas and lexemes will have less difficulty with K.E.'s
Recall that these theories differ with regard to whether they as- data. Crucially, however, to the extent to which connectivity
492 RAPP AND GOLDRICK

distance decreases the problematic nature of K.E.'s error pattern, it The expansion of interaction domains in FILSA does, however,
also renders the system less interactive functionally. To retain true make verifiable predictions regarding error types. That is, if a
interactivity across increasing connectivity distance, one must do functional expansion of interaction domains (visual features
things such as increase feedback strength, reduce seriality, and so through phonemes) has been achieved, we should see error types
forth. This underscores the point made earlier that the mere pres- that are not expected in a system in which these domains do not
ence of connectivity does not ensure its functionality. Thus, it is interact. The fact that we observed visual-phonological errors in
not enough to note that feedback connections are in place; it is the FILSA simulation is evidence that domain expansion was
critical to determine if the connections actually serve to bring achieved and that extensive domain interaction is indeed
together information from distant parts of the system. We have problematic.
made this point strongly with respect to feedback connections, but It is important, however, to make clear that the failures of the
it applies, of course, equally to all other means of implementing FILSA simulation to account for the relevant findings are not a
interactivity. consequence of the fact that the FILSA theory was implemented in
a distributed connectionist architecture rather than in the localist
Dell et al. (1997) connectionist architectures used to implement the other positions.
We assume that FILSA's failures stem from the fact that the
Dell et al. (1997) cautiously drew the conclusion that "parameter interaction domains were too extensive given the overall level of
alterations affect all layers of the network equally and, hence, interactivity in the system. Importantly, we assume that the ques-
differences in error rates across patients are explicable without tion of distributed versus localist connectionist principles is gen-
differential involvement of semantic, lexical or phonological erally orthogonal to the question of interactivity-discreteness with
units" (p. 832). They reached this conclusion, which they referred which we have been concerned. That is, the particular distributed
to as "the globality assumption," because parameter alterations connectionist architecture we used simply provided a means for
across the entire simulated naming system generated a fairly good implementing a more interactive position than HIA. It is presumed
match to the patterns exhibited by the patients they studied (but see that there would have been a way of accomplishing the same
Foygel & Dell, in press). Dell et al. did indicate, however, that it objective within a localist architecture. Likewise, we assume that
may be difficult for the globality assumption to account for pa- PDP principles could also be used to implement more discrete
tients (e.g., K.E. and P.W.) who make only semantic errors at theoretical positions than FILSA. Our conclusions concern the
relatively low levels of overall accuracy. 25 In fact, for these locus and extent of interactivity and discreteness in the spoken
reasons (among others), Foygel and Dell backed away from the naming system and not the classes of algorithms used for simu-
globality claim. lating the various theoretical positions.
Although we did not carry out an exhaustive exploration of all
possible parameter permutations, we did examine a wide range of The Role of Simulation in Evaluating Theoretical Claims
damage situations that involved manipulating multiple parameters We have used simulation as a tool to help us in thinking through
(P, Q, intrinsic noise, activation noise, jolt size, and step number). the predictions and implications of various theoretical claims. We
In the course of doing so, we did not find any global damage
setting that produced a pattern of only semantic errors at the rates
observed for K.E. and P.W. The results we report that indicate a 2~In response to these potentially problematic patterns, Dell et al. (1997)
good fit of the data under HIA were achieved only through local suggested that these patients might have made other kinds of errors had
damage. In addition, we were also able to show that C.S.S.'s different testing materiais and scoring methods been used. These method-
pattern of multiple error types plus a significant mixed error effect ological concerns are, in our view, completely groundless. First, both K.E.
(similar to that of Dell et al.'s [1997] high-weight individuals) and P.W. were tested with hundreds of line-drawing stimuli from a wide
variety of semantic categories (taken largely, but not exclusively, from
could (under both RIA and HIA) be accounted for with only local
Snodgrass and Vanderwart's [1980] set). These did not differ in any
damage. On this basis, we propose (see also Ruml & Caramazza,
apparent way from the stimuli used to test Dell et al.'s participants.
in press) that patterns such as those exhibited by K.E., P.W., and Furthermore, C.S.S., who did produce multiple error types, was tested with
R.G.B. constitute evidence that compels rejection of the assump- most of the same materials. Regarding scoring criteria, we too scored the
tion that brain damage necessarily affects global aspects of pro- first complete response produced and counted single phoneme errors as
cessing (see also Caplan, Vanier, & Baker, 1986, and Hillis, Rapp, errors. In addition, Dell et al. suggested that patients exhibiting a pattern of
& Goldrick, 1998, for other potentially problematic patterns). 26 only semantic errors produced a high degree of no responses and semantic
descriptions. This was not, however, the case for either P.W. or K.E., who
produced approximately 6% omissions. This rate was well within the range
Implementing lnteractivity exhibited by Dell et al.'s participants. Thus, the pattern of only semantic
errors that we report cannot be simply dismissed as artifactual.
Our work illustrates that interactivity can be achieved in a
26 A further claim of Dell et al. (1997) is of a distinction between deficits
variety of ways: with cascading activation, increasing feedback,
of representational integrity (implemented with increased decay parameter
decreasing seriality, and extending the interaction domains. We
values) and deficits of transmission strength (implemented with decreased
have focused primarily on establishing the locus and extent of weights on connections between units; see also Schwartz et al., 1994, and
interactivity rather than on determining the specific contributions Martin, Dell, Saffran, & Schwartz, 1994). The predictions derived from
of each of these mechanisms. Although the different means of this distinction were matched to an impressive degree in Dell et al. (but see
achieving interactivity may make empirically distinguishable pre- Foygel & Dell, in press). For this reason, the proposed distinction may
dictions, this has not been our principal concern here. prove to be a fruitful one.
INTERACTIVITY IN SPOKEN WORD PRODUCTION 493

have proceeded by first identifying the specific theoretical claims Our results suggest, however, that interaction may not be compa-
and the empirical phenomena with which we were concerned. rable between all adjacent levels. Specifically, the results indicate
Only then did we attempt to instantiate the various theoretical the presence of greater interactivity between the phoneme level
claims in simulations. The claims that we have been concerned and the L level than between the concept level and the L level. If,
with are fairly broad--they refer to the degree and locus of as indicated earlier, we take the L level to correspond to a
interactivity and discreteness in spoken word production. Further- modality-specific lexeme level, then we can take our work as
more, the phenomena we have been concerned with are fairly evidence for greater within- versus across-domain interaction. It is
qualitative and "broad" as well--presence of lexical bias, mixed interesting to note that this notion is consistent with generally
error effect, mixed error effect in the absence of phonological accepted facts about cortical connectivity, namely, that short-
errors, and so forth. Clearly, one could have detailed theoretical distance connections far outnumber long-distance connections. 28
claims that make specific quantitative predictions; the theoretical This lends itself to the supposition of a greater degree of functional
claims could include such things as the exact number of processing interaction between adjacent versus nonadjacent networks. If we
steps, the relationship between the strength of feedforward and adopt the further (also widely held) assumption that adjacent
feedback connections, the type and number of semantic features, neurons are likely to be involved in computations within a com-
and aspects of the activation function. In addition, one could be mon domain, then we would expect within-domain neural connec-
interested in far more detailed empirical phenomena--the specific tivity to support greater interactivity than across-domain
magnitude of a lexical bias effect, the specific rates of different connectivity.
erro r types, and so forth. Although this certainly does not count as evidence in favor of
It is important for empirical phenomena and theoretical claims RIA, we mention it to indicate that conclusions regarding restricted
to be commensurate in "grain." It clearly makes no sense to interactivity are at least broadly consistent with our current under-
evaluate theoretical claims that make precise quantitative predic- standing of cortical connectivity.
tions with qualitative behavioral findings. It is equally senseless,
however, to evaluate broad theoretical claims with detailed empir- Why Discreteness ? Why Interactivity ?
ical results. For this reason, we have not been overly concerned
The traditional argument for modularity (discreteness) as a
wi'th data "fitting." We have assumed that, given the theoretical
general design principle is that a modular system is easier to
issues with which we are concerned, the minor details of the data
"debug" or to improve than a highly interactive one. The rationale
are largely irrelevant. For example, C.S.S.'s rate of formal errors
underlying this is that in a modular system problems in one part of
was 3%, and his nonword error rate was 7%. Although CFA's best
the system will not necessarily have widespread consequences.
matches were 1% and 6%, respectively, whereas RIA's best
Likewise, improvements to some aspect of functioning can be
matches were 2% and 8%, respectively, we did not consider such carded out locally and will not necessarily require widespread
things in adjudicating among theories. We ignored them because
modifications (Marr, 1982). A further argument (in the context of
the theoretical claims we were evaluating simply did not make
spoken word production) was made by L e v e r et al. (1991b), who
predictions that were specific enough to make these distinctions.
argued that
We do not consider the simulations to be theories of spoken
word production. They are merely attempts to capture the central lexical selection and phonological encoding fulfill wildly different
aspects of various theoretical positions in ways that are, hopefully, functions. The former involves fast search in a huge lexicon for an
relatively insensitive to the multitude of details that need to be appropriate item; the latter creates an articulatory program for that
specified to get a simulation up and running but that, given the item. Any feedback from the latter to the former level only makes the
system more error prone than necessary. Modularity, one could say, is
theoretical issues under consideration, are irrelevant. To this end,
nature's protection against error. (p. 618)
we have examined the robustness of the behavior of the simula-
tions under a range of parameter settings, noise types, and so forth Interestingly, arguments favoring interactivity make very simi-
and have reported on the extent to which these factors influence lar points. For example, in terms of design principles, distributed
the results we have observed. On the basis of these explorations interactive systems are claimed to be highly robust under damage
and because we have developed some understanding of the mech- (Hinton & Sejnowski, 1986). The reasoning here is that unless
anisms of discreteness and interactivity, we consider our conclu- damage to some part of a system is very substantial, input from (or
sions to be fairly general and not restricted to the specific algo- interaction with) other parts of the system will often be sufficient
rithms and sets of implementational decisions that were made. to "clean up" the noise. Additionally, Dell et al. (1997), in striking
Nonetheless, without formal proof, this must remain an open contrast with Levelt et al. (1991b), argued that
question. 27
the function of lexical access is to get from conceptual representations
to phonological forms. It would be worthwhile if the decision about
Neurobiological Plausibility
Much has been made of the implications for cognitive theorizing 27 Rural et al. (in press) adopted a "data-fitting" approach to evaluating
that, presumably, derive from the observation of extensive neural the adequacy of various simulations including Dell et al.'s (1997) and RIA.
interconnectivity (and particularly extensive feedback connectiv- We consider this to reflect overconcem with matching aspects of the data
ity). In simulations of interactive cognitive systems, this extensive that are not predicted by the theoretical claims under consideration.
connectivity is typically implemented by means of full and com- 2a By some estimates, as many as 70% of synapses may be of local or
parable feedforward and feedback connectivity between levels. intrinsic rather than extrinsic origin (White, 1989).
494 RAPP AND GOLDRICK

which word to choose at lemma access were to be informed about how in 56 categories: A replication and extension of the Connecticut category
retrievable that word's phonological form is. (p. 830) norms. Journal of Experimental Psychology, 80, 1-46.
Best, W. (1996). When racquets are baskets but baskets are biscuits, Where
More generally, arguments favoring interaction stem from the do the words come from? A single case study of formal paraphasic errors
possibilities that interactivity provides for multiple constraint sat- in aphasia. Cognitive Neuropsychology, 13, 443-480.
isfaction. In fact, McCleUand, Rumelhart, and Hinton (1986) ar- Blanken, G. (1998). Lexicalisation in speech production: Evidence from
gued that people are smart because the brain uses a computational form-related word substitutions in aphasia. Cognitive Neuropsychology,
15, 321-360.
architecture that is ideally suited for simultaneously considering
Boland, J., & Cutler, A. (1996). Interaction with autonomy: Multiple
many and diverse pieces of information. This capacity favors
output models and the inadequacy of the great divide. Cognition, 58,
behaviors that are optimal in the sense that they are shaped by the 309 -320.
multiple constraints an organism is trying to satisfy. Butterworth, B. (1981). Speech errors: Old data in search of new theories.
Although this is not the place for an in-depth discussion of this Linguistics, 19, 627-662.
topic, we would like to briefly make three points. First, it is fairly Butterworth, B. (1989). Lexical access in speech production. In W.
easy to see that either modularity or interactivity taken to an Marslen-Wilson (Ed.), Lexical representation and process (pp. 108-
extreme would be likely to have unfavorable consequences. Sec- 135). Cambridge, MA: M1T Press.
ond, the extent to which either is advantageous is likely to vary Butterworth, B. (1992). Disorders of phonological encoding. Cognition,
across domains and even across tasks within a domain. And third, 42, 261-286.
there is a wide range of methods and means by which the neces- Caplan, D., Vanier, M., & Baker, C. (1986). A case study of reproduction
conduction aphasia: I. Word production. Cognitive Neuropsychology, 3,
sary degree of interactivity-discreteness may be achieved.
99-128.
With regard to extremes, in accord with Dell et al. (1997), it is
Caramazza, A. (1997). How many levels of processing are there in lexical
clear that a consideration of the phonological forms that are
access? Cognitive Neuropsychology, 14, 177-208.
available should play some role in expressive language. We would Caramazza, A., & Hillis, A. E. (1990). Where do semantic errors come
not want to be able to only utter the word corresponding to the from? Cortex, 26, 95-122.
precise meaning we have in mind or remain mute. For example, if Caramazza, A., Miceli, G., Villa, G., & Romani, C. (1987). The role of the
one is unable to retrieve an intended phonological form (sofa), a graphemic buffer in spelling: Evidence from a case of acquired dys-
synonym may often (although certainly not always) be satisfactory graphia. Cognition, 26, 59-85.
(couch). However, if one is thinking canine, one does not want the Caramazza, A., & Miozzo, M. (1997). The relation between syntactic and
fact that cat is the most highly accessible phonological form phonological knowledge in lexical access: Evidence from the "tip-of-
to lead one to think that the meaning one wants to express is, the-tongue" phenomenon. Cognition, 64, 309-343.
in fact, cat. Cutting, J. C., & Ferreira, V. S. (1999). Semantic and phonological infor-
marion flow in the production lexicon. Journal of Experimental Psy-
Furthermore, it is likely that success in certain tasks requires
chology: Learning, Memory, and Cognition, 25, 318-344.
maximal consideration of multiple, comparably important con-
Damian, M. F., & Martin, R. C. (1999). Semantic and phonological codes
straints, whereas other tasks require that certain constraints be interact in single word production. Journal of Experimental Psychology:
given high or even exclusive priority. 29 In spoken naming, for Learning, Memory, and Cognition, 25, 345-361.
example, the meaning to be expressed is the driving consideration; Dell, G. S. (1985). Positive feedback in hierarchical connectionist models:
accessibility of the phonological forms is a lesser one. In contrast, Applications to language production. Cognitive Science, 9, 3-23.
when one is planning a reaching movement to a salt shaker on a Dell, G. S. (1986). A spreading activation theory of retrieval in sentence
crowded stove top with hot burners, the locations of the obstacles production. Psychological Review, 93, 283-321.
and the hot burners may provide constraints comparable to those of Dell, G. S. (1988). The retrieval of phonological forms in production: Test
the location of the salt shaker itself. of predictions from a connectionist model. Journal of Memory and
In conclusion, we have discussed a variety of mechanisms that Language, 27, 124-142.
can be used to implement discreteness and interactivity. The work Dell, G. S., & O'Seaghdha, P. G. (1991). Mediated and convergent lexical
priming in language production: A comment on Levelt et al. (1991).
reported in this article represents an attempt to determine the
Psychological Review, 98, 604-614.
balance between discreteness and interactivity that exists in the
Dell, G. S., & O'Seaghdha, P. G. (1992). Stages of lexical access in speech
human word production system and to begin to understand the production. Cognition, 42, 287-314.
mechanisms by which it may be achieved. Dell, G. S., & Reich, P. A. (1980). Toward a unified model of slips of the
tongue. In V. A. Fromkin (Ed.), Errors in linguistic performance (pp.
473-486). New York: Academic Press.
Dell, G. S., & Reich, P. A. (1981). Stages in sentence production: An
29 These ideas are related to the more technical notions of harmony
analysis of speech error data. Journal of Verbal Learning and Verbal
maximization, constraint ranking, and strict domination expressed in har-
Behavior, 20, 611-629.
mony theory (Smolensky, 1986) and optimality theory (Prince & Smolen-
Dell, G. S., Schwartz, M. F., Martin, N., Saffran, E. M., & Gagnon, D. A.
sky, in press).
(1997). Lexical access in aphasic and nonaphasic speakers. Psycholog-
ical Review, 104, 801-838.
References del Viso, S., Igoa, J. M., & Garcia-Albert, J. E. (1991). On the autonomy
of phonological encoding: Evidence from slips of the tongue in Spanish.
Baars, B. J., Motley, J. T., & MacKay, D. (1975). Output editing for lexical Journal of Psycholinguistic Research, 20, 161-I 85.
status from artificially elicited slips of the tongue. Journal of Verbal Elman, J. L., & MeClelland, J. L. (1988). Cognitive penetration of the
Learning and Verbal Behavior, 14, 382-391. mechanisms of perception: Compensation for coarticulation of lexieally
Battig, W. F., & Montague, W. E. (1969). Category norms of verbal items restored phonemes. Journal of Memory and Language, 27, 143-165.
INTERACTIV1TY IN SPOKEN WORD PRODUCTION 495

Foygel, D., & Dell, G. (in press). Models of impaired lexical access in & I-Iavinga, J. (1991b). Normal and deviant lexical processing: l(eply to
speech production. Psychological Review. Dell and O'Seaghdha (1991). Psychological Review, 98, 615-618.
Gaguon, D. A., Schwartz, M. F., Martin, N., Dell, G. S., & Saffran, E. M. MacKay, D. G. (1987). The organization of perception and action: A
(1997). The origins of formal paraphasias in aphasics' picture naming. theory for language and other cognitive skills. New York: Springer-
Brain and Language, 59, 450-472. Verlag.
Garrett, M. F. (1980). Levels of processing in sentence production. In B. Marl D. (1982). Vision. New York: Freeman.
Butterworth (F_xl.), Language production (pp. 177-220). New York: Martin, N., Dell, G. S., Saffran, E. M., & Schwartz, M. F. (1994). Origins
Academic Press. of paraphasias in deep dysphasia: Testing the consequences of a decay
Harley, T. A. (1984). A critique of top-down independent levels models of impairment to an interactive spreading activation model of lexical re-
speech production: Evidence from non-plan-internal speech errors. Cog- trieval. Brain and Language, 47, 609-660.
nitive Science, 8, 191-219. Martin, N., Gagnon, D. A., Schwartz, M. F., Dell, G. S., & Saffran, E. M.
Harley, T. A. (1993). Phonological activation of semantic competitors (1996). Phonological facilitation of semantic errors in normal and apha-
during lexical access in speech production. Language and Cognitive sic speakers. Language and Cognitive Processes, 11, 257-282.
Processes, 8, 291-309. Marlin, N., & Saffran, E. M. (1992). A computational account of deep
Harley, T. A. (1995). Connectionist models of anomia: A comment on dysphasia: Evidence from a single case study. Brain and Language, 43,
Nickels. Language and Cognitive Processes, 10, 47-58. 240-274.
Harley, T. A., & MacAndrew, S. B. G. (1995). Interactive models of Martin, N., Weisberg, R. W., & Saffran, E. M. (1989). Variables influenc-
lexicalisation: Some constraints from speech error, picture naming, and ing the occurrence of naming errors: Implications for models of lexical
neuropsychological data. In J. P. Levy, D. Bairaktaris, J. A. Bullinaria, retrieval. Journal of Memory and Language, 28, 462-485.
& P. Cairns (F_As.), Connectionist models of memory and language (pp. Massaro, D. W., & Cohen, M. M. (1991). Integration versus interactive
311-331). London: UCL Press. activation: The joint influence of stimulus and context in perception.
Hillis, A. E., & Caramazza, A. (1995). Converging evidence for the Cognitive Psychology, 23, 558-614.
interaction of semantic and sublexical phonological information in ac- McClelland, J. L., & Rumelhart, D. E. (1981). An interactive activation
cessing lexical representations for spoken output. Cognitive Neuropsy- model of context effects in letter perception: I. An account of basic
chology, 12, 187-227. findings. Psychological Review, 88, 375-407.
Hillis, A. E., Rapp, B., & Goldrick, M. (1998, June). A pattern of phono- McCleUand, J. L., Rurnelhart, D. E., & Hinton, G. E. (1986). The appeal of
logical errors in spoken word comprehension and production: Implica- parallel distributed processing. In D. E. Rumelhart & J. L. McCleUand
tions for interactive theories of spoken language. Poster presented at the (Eds.), Parallel distributed processing: Explorations in the microstruc-
2rid International Workshop on Neural Modeling of Brain and Cognitive ture of cognition (Vol. 1, pp. 3-44). Cambridge, MA: MIT Press.
Disorders, College Park, MD. Miozzo, M., & Caramazza, A. (1997). Retrieval of lexieal-syntactic fea-
Hillis, A. E., Rapp, B., Romani, C., & Caramazza, A. (1990). Selective tures in tip-of-the-tongue states. Journal of Experimental Psychology:
impairment of semantics in lexical processing. Cognitive Neuropsychol- Learning, Memory, and Cognition, 23, 1410-1423.
ogy, 7, 191-243. Nickels, L. (1992). The autocue? Self-ganerated phonemic cues in the
Hinton, G. E., & Sejnowski, T. (1986). Learning and relearning in Boltz- treatment of a disorder of reading and naming. Cognitive Neuropsychol-
mann machines. In D. E. Rumelhart & J. L. McClelland (Eds.), Parallel ogy, 9, 155-182.
distributed processing: Explorations in the microstructure of cognition Nickels, L. (1995). Getting it right? Using aphasic naming errors to
(Vol. 1, pp. 282-317). Cambridge, MA: MIT Press. evaluate theoretical models of spoken word recognition. Language and
I-Iinton, G. E , & ShaUice, T. (1991). Lesioning an attractor network: Cognitive Processes, 10, 13-45.
Investigations of acquired dyslexia. Psychological Review, 98, 74-95. Paap, K. R., Newsome, S. L., McDonald, J. E., & Schvaneveldt, R. W.
Humphreys, G., Riddoch, M. J., & Quinlan, P. T. (1988). Cascade pro- (1982). An activation verification model for letter and word recognition:
cesses in picture identification. Cognitive Neuropsychology, 5, 67-104. The word superiority effect. Psychological Review, 89, 573-594.
Johnston, J. C. (1978). A test of the sophisticated guessing theory of word Peterson, R. R., & Savoy, P. (1998). Lexical selection and phonological
perception. Cognitive Psychology, 10, 123-153. encoding during language production: Evidence for cascaded process-
Kempen, G., & Huijbers, P. (1983). The lexicalization process in sentence ing. Journal of Experimental Psychology: Learning, Memory, and Cog-
production and naming: Indirect election of words. Cognition, 14, 185- nition, 24, 539-557.
209. Plaut, D. C., & Shallice, T. (1993a). Deep dyslexia: A case study of
Krauth, J. (1988). Distribution-free statistics: An application-oriented ap- connectionist neuropsychology. Cognitve Neuropsychology, 10, 377-
proach. Amsterdam: Elsevier. 500.
Laine, M., & Martin, N. (1996). Lexical retrieval deficit in picture naming: Plant, D. C., & ShaUiee, T. (1993b). Perseverative and semantic influences
Implications for word production models. Brain and Language, 53, on visual object naming in optic aphasia: A cormectionist account.
283-314. Journal of Cognitive Neuroscience, 5, 89-117.
Levelt, W. J. M. (1983). Monitoring and self-repair in speech. Cognition, Prince, A., & Smolensky, P. (in press). Optimality theory: Constraint
14, 41-104. interaction in generative grammar. Cambridge, MA: M1T Press.
Levelt, W. J. M. (1989). Speaking: From intention to articulation. Cam- Rapp, B., Benzing, L., & Caramazza, A. (1997). The autonomy of lexical
bridge, MA: MIT Press. orthography. Cognitive Neuropsychology, 14, 71-104.
Levelt, W. J. M. (1992). Accessing words in speech production: Stages, Rapp, B., & Caramazza, A. (1998). A case of selective difficulty in writing
processes and representations. Cognition, 42, 1-22. verbs. Neurocase, 4, 127-140.
Levelt, W. J. M., Roelofs, A., & Meyers, A. S. (1999). A theory of lexical Reicher, G. M. (1969). Perceptual recognition as a function of meaning-
access in speech production. Behavioral and Brain Sciences, 22, 1-75. fulness of stimulus material. Journal of Experimental Psychology, 81,
Levelt, W. J. M., Schriefers, H., Vorberg, D., Meyer, A. S., Pechnaatm, T., 275-280.
& Havinga, J. (1991a). The time course of lexical access in speech Roelofs, A. (1992). A spreading-activation theory of lemma retrieval in
production: A study of picture naming. Psychological Review, 98, 122- speaking. Cognition, 42, 107-142.
142. Roelofs, A. (1997). The WEAVER model of word-form encoding in
Levelt, W. J. M., Schriefers, H., Vorberg, D., Meyer, A. S., Pechmann, T., speech production. Cognition, 64, 249-284.
496 RAPP AND GOLDRICK

Roelofs, A., Meyer, A. S., & Levelt, W. J. M. (1996). Interaction between microstructure of cognition (Vol. 1, pp. 194-281). Cambridge, MA:
semantic and orthographic factors in conceptually driven naming: Com- MIT Press.
ment on Starreveld and La Heij (1995). Journal of Experimental Psy- Snodgrass, J. G., & Vanderwart, M. A. (1980). A standardized set of 260
chology: Learning, Memory, and Cognition, 22, 246-251. pictures: Norms for name agreement, image agreement, familiarity, and
Rumelhart, D. E., & McClelland, J. L. (1982). An interactive activation visual complexity. Journal of Experimental Psychology: Human Learn-
model of context effects in letter perception: U. The contextual enhance- ing and Memory, 6, 174-215.
ment effect and some tests and extensions of the model. Psychological
Starreveld, P. A., & La Heij, W. (1995). Semantic interference, ortho-
Review, 89, 60-94.
graphic facilitation, and their interaction in naming tasks. Journal of
Ruml, W., & Caramazza, A. (2000). An evaluation of a computational
model of lexical access: Comment on Dell et al. (1997). Psychological
Experimental Psychology: Learning, Memory, and Cognition, 21,686-
Review, 107, 609-634. 698.
Ruml, W., Caramazza, A., Shelton, J., & Chialant, D. (in press). Testing Starreveld, P. A., & La Heij, W. (1996). Time-course analysis of
assumptions in computational theories of aphasia. Journal of Memory semantic and orthographic context effects in picture naming. Journal
and Language. of Experimental Psychology: Learning, Memory, and Cognition, 22,
Samuel, A. G. (1997). Lexical activation produces potent phonemic per- 869-918.
cepts. Cognitive Psychology, 32, 97-127. Stemberger, J. P. (1983). Inflectional malapropisms: Form based errors in
Schriefers, H., Meyer, A. S., & Levelt, W. J. M. (1990). Exploring the time English morphology. Linguistics, 21, 573-602.
course of lexicai access in production: Picture-word interference studies. Stemberger, J. P. (1985). An interactive activation model of language
Journal of Memory and Language, 29, 86-102. production. In A. W. Ellis (Ed.), Progress in the psychology of language
Schwartz, M. F., Saffran, E. M., Bloch, D. E., & Dell, G. S. (1994). (Vol. 1, pp. 143-186). Hillsdale, NJ: Erlbaum.
Disordered speech production in aphasic and normal speakers. Brain and
Van Orden, G. C., de Haar, M. A., & Bosman, A. M. T. (1997). Complex
Language, 47, 52-88.
dynamic systems also predict dissociations, but they do not reduce to
Shallice, T., & McGill, J. (1978). The origins of mixed errors. In J. Requin
autonomous components. Cognitive Neuropsycho logy, 14, 131-165.
(Ed.), Attention and performance VII (pp. 193-208). Hillsdale, NJ:
Erlbaum. Wheeler, D. D. (1970). Processes in word recognition. Cognitive Psychol-
Smolensky, P. (1986). Information processing in dynamical systems: ogy, 1, 59-85.
Foundations of harmony theory. In D. E. Rumelhart & J. L. McClel- White, E. L. (1989). Cortical circuits: Synaptic organization of the cere-
land (Eds.), Parallel distributed processing: Explorations in the bral cortex--Structure, function and theory. Boston: Birkhauser.

Appendix A

Other Means for Simulating Brain Damage

Dell et al. (1997) simulated damage by increasing values of P or Q as over that of the target. At this point, errors will again correspond fairly
well as using intrinsic and activation noise. In spite of apparent differences directly to the activation functions observed under noiseless conditions.
between our respective methods of implementing damage, the conse- However, when absolute activation values become very small relative to
quences are highly similar. Note that Dell et al.'s activation function is intrinsic noise levels, intrinsic noise will begin to approximate a random
identical to the one we adopted except that in addition to the activation error selection process. In sum, Q manipulations merely create the condi-
noise term, it incorporates an "intrinsic error" term. tions whereby noise processes can generate errors, whereas intrinsic or
The first thing to note is that changing the Q (decay) value for a activation noise largely respects the relationships present in the nondam-
particular layer (or globally) will merely decrease every unit's activation aged state.
by the same proportion. Only absolute activation values will be affected, The consequences of manipulating P in a feedforward architecture are
and thus the relationships among units will be unaffected. In fact, the only
much like those for Q. However, the consequences of P manipulation in
way that an increase in decay rate can result in any errors at all is if absolute
interactive systems are less transparent. Dell et al. (1997) argued that
activation values are reduced to a level where noise processes can generate
reductions in P have the effect of reducing the "concurrence" and interac-
errors. Thus, an increase in decay per se does not result in errors; it only
tivity between levels. They claimed that this manifests itself in a variety of
indirectly allows noise to have an impact on the outcome.
ways, including an attenuation of mixed error effects and an increase in
Activation noise will preserve the activation relationships among units
that are apparent in the noiseless situation--strong competitors of the target nonword and unrelated word errors. We have not been able to characterize
(highly active units) are more likely errors than weak competitors. Intrinsic the effects of P damage in any systematic manner (but see Foygel & Dell,
noise also has much the same effect as activation noise,A1 although because in press).
it affects all units comparably, the consequences are less predictable as
absolute activation levels fall to very low levels. As absolute activation
values begin to fall (as time passes or if Q is increased), the difference in A! For example, intrinsic noise is thought to be the primary cause of
activation levels between the target and other units becomes small enough unrelated lexical errors. However, high activation noise at the L level
for intrinsic noise to have an effect and push a competitor's activation level produces high levels (e.g., 48%) of unrelated errors.
INTERACTIVITY IN SPOKEN WORD PRODUCTION 497

Appendix B

Computation of Predicted Mixed Error Rates

Our analysis of mixed errors is based on the following formula and is We derive this estimation as follows. To a first approximation (see
designed to determine if the number of mixed errors we observe in the below), pure semantic errors arise only through the semantic process. As
simulations under noisy conditions at phoneme and L levels is what would shown above, this will be equal to 2/3 of S. Because 1/3 o r S will be mixed
be predicted by the assumptions of a discrete feedforward theory. Accord- errors, we estimate the number of mixed errors arising through the seman-
ing to a discrete feedforward theory, tic process to be 1/2 of the observed pure semantic errors. Furthermore,
because formal* errors arise only through the phonological process, their
Sob. . . . . d = k ( S ) + (1 - k ) ( S ) + c ( F " ) observed error rate will be equal to 1/2 of Fm. This is equal to the number
or (given the simulation lexicon) of mixed errors arising through the phonological process. The estimated
rate of mixed errors will therefore be the sum of mixed errors from both
Sob..... d = 2/3(S) + 1/3(S) + 1/2(Fm). processes: 1/2 the number of pure semantic errors plus the number of
formal* errors.
Sob~ve,t consists of all errors that are semantically related to the target This estimation does not state several minor complications, which we,
(pure semantic plus mixed semantic and phonological errors). These errors nonetheless, did take into account. First, we do not wish to include formal*
can be generated by either a semantic process or a phonological process. S errors that are not strictly the result of the phonological process, for
corresponds to the number of Sobser~~ that are generated from the semantic example, a mixed error at L-level selection followed by a phonological
process. Some percentage of S will, because of the structure of the lexicon, encoding error that yields formal*. Consequently, our calculations use the
not have a phonological relationship with the target. We denote this number of formal* errors that specifically come about as a result of correct
percentage by k, equal to 2/3 in our neighborhood (two of the three selection at the L level followed by an error in phonological encoding.
semantic neighbors do not share phonology with the target). The first term, In addition, there are also slight adjustments for the mixed and pure
k(S) + (1 - k)(S), expresses this: Some percentage k of the errors semantic error rates. The actual mixed error rate M is
generated by the semantic process will share no phonology with the target,
whereas 1 - k of the errors will share phonology. M = (1 - k ) S + c F ~ + (M1Sn~vho~) - (Sno~phJM)-- (Non-S~o~pho,errodM),
Semantic errors can also be generated by a phonological process. F m
corresponds to the number of errors arising from the phonological pro- whereas the "nonphonologicar' semantic error rate, S,o,pnon, is actually
cesses that are as similiar to the target as the mixed error is. Some Sno,pho, = k S + (SnonphonlM)- (MlSnonphon)-- (Non-MIS.o~pbon).
percentage of these errors will also, because of the lexicon structure, have
a semantic relationship with the target. We call this percentage c, which is For the mixed errors, these include such things as mixed errors that result
equal to I/2 in our neighborhood (formal* and the mixed error are equally from the misselection of a semantic neighbor that is then followed by the
similiar to the target; thus, 1/2 of these errors will bear a semantic misselection of phonemes at the phoneme level (MlSnonpbon)or the situa-
relationship). Note that mixed and formal* neighbors are only 2 of the 12 tion in which the selection of a mixed neighbor at the L level is followed
words in the neighborhood that are formally related to the target. by a phonological error that results either in a semantic e r r o r (SnonphonlM)
From this formula, using the pure semantic and formal* errors produced or in some other response (Non-Snonphon IM). Similar adjustments must be
by any of the simulations, we derive the estimated rate of mixed errors made for the pure semantic errors.
expected according to the assumptions of DFA. We then compare (by a Z If we assume that (SnonphonlM) and (MlSnonpbon) ale essentially equal,
test) the rate of expected mixed errors and the rate of simulated mixed then these contributions will cancel out in each formula. If we also assume
errors. If they do not differ, then we conclude that the simulation is that (Non-Snonphon errorIM) and (Non-M errorlSnonphon)are equal, they will
compatible with DFA. If they differ, then we assume a mixed error effect affect each error rate in the same manner and thus will not change the
that is incompatible with a discrete account. relationship between them. Therefore, we can ignore these corrections.

Appendix C

Lexical Bias Effects and the Cascading Activation Account

As discussed in the text, it may be necessary to consider the phonolog- run as before, except that just prior to phoneme selection the onset and
ical inventory of a target word's semantic neighborhood in order to vowel phonemes of the target were "deselected" by getting their activity
establish that the rate of formal errors observed conforms to that predicted levels to 0. This mimics a trial on which an error is made in both the onset
under the theoretical assumptions of CFA. and vowel.
To demonstrate the role of the semantic neighborhood in determining The results of 10,000 runs conformed to what was predicted. First, when
word and nonword outcomes, we ran the CFA simulation with some minor matched for similarity to semantic neighbors of the target, comparable
modifications. We identified two pairs (A and B) of word and nonword error word and nonword rates were observed for the nonsemantic pair (9
outcomes. All four outcomes had the same vowel and coda, but they words vs. 10 nonwords) and the semantic pair (1,179 words vs. 1,195
differed in their onset phonemes. Each pair was matched in connectivity in nonwords). Second, outcomes (word or nonword) supported by a semantic
the onset position. Word A and" Nonword A had onsets that were not neighbor (the semantic pair) were far more likely than outcomes not
constituents of any semantic neighbor (pairs lacking semantic support). supported by semantic neighbors (nonsemantic pairs).
Word B and Nonword B each shared an onset with a single semantic This suggests that in evaluating empirically obtained measures of word
neighbor of the target (pairs with semantic support). The simulation was and nonword outcomes, it may be important to assess the potentially
498 RAPP AND GOLDRICK

distorting effect of the phonological makeup of the semantic category. To activation. The results (Table C1) of word body pairings with the most
determine the extent to which such considerations may actually be neces- frequent onset competitor indicate a wide range of word outcome rates
sary, we established word and nonword error rates predicted under CFA for across categories---from a high of 87% in the category of body parts to a
onset errors in eight semantic categories. low of 0% in the category of fruits. The results obtained from pairings with
For each category, we listed all the phonemes that were in word onset the lower frequency onset phonemes indicate that there is also considerable
positions. These were assumed to be the most likely competitors for the within-category variability with word outcomes ranging from 0% to 57%
onset position of a target word because they would be receiving "cascad- in the fruit category and from 33% to 88% in transportation.
ing" activation from higher levels. (Onset-less items as well as morpho- Given these results, it is possible that if empirically determined
logically complex items such as compounds or plausibly prefixed items, for word-nonword rates are higher than would expected under DFA, they
example, blackberry-blueberryand bicycle-tricycle, were removed.) From may, depending on the semantic category membership of the stimuli
this group, the three onsets that appeared first, second, and third most (and especially in cases in which stimuli are from a single or limited
frequently in each semantic category were identified. These would be the number of categories), be consistent with a cascading activation ac-
competitor phonemes receiving the most support from higher level units. count. In that case, further interactivity assumptions (e.g., feedback)
For each semantic category, we then paired the word body of each item would be unwarranted. Because of these possibilities, definitive con-
with each of the most frequent competitor onset phonemes. The results of clusions regarding whether reported word-nonword rates in normal
pairing word bodies with the most frequent onset competitor constitute the speech errors are actually problematic for CFA await further analyses
most likely error for each word in each category and provide an estimate that take into account the phonological composition of the semantic
of predicted rates of word-nonword outcomes in a system with cascading neighborhoods of target items.

Table C1
Predicted Percentage of Word Outcomes for Onset Errors Across Different Semantic Categories
Under the Assumptions of the Cascading Feedforward Account

Most frequent 2rid most frequent 3rd most frequent


Category onset competitor onset competitor onset competitor

Animals 51 45 58
Furniture 27 37 37
Body parts 87 74 88
Fruit 0 57 29
Clothes 69 39 35
Birds 39 24 9
Transportation 33 88 33
Vegetables 18 28 8

Appendix D

L e a r n i n g in t h e F u r t h e r I n t e r a c t i o n , L o w S e d a l i t y A c c o u n t

The procedure involved two phases: a positive phase and a negative semantics, how to generate visual representations from phonological input
phase. In the positive phase, all three visible layers are clamped, and the through semantics, and how to generate phonology and visual representa-
hidden unit activations are calculated. The positive phase corresponds (in tions from semantic input.
some sense) to the target of learning. In the negative phase, only one visible To balance the positive and negative phases, the pairwise product of
layer is clamped, and hidden and visible layers are allowed to update their the units' positive phase activation was multiplied by 3 before subtract-
states. In contrast to the positive phase, the negative phase measures how ing the three negative phase products. This difference was added into a
well the network has already learned the task. Learning then involves list of pending weight changes. The entire process was repeated for each
changing the weight between two units in proportion to the difference in word, with the positive and negative phase product differences for each
the products between the two units for the positive and negative phases. weight accumulating until every word had been processed. After the set
In the FILSA simulation, the negative phase had three subparts, each was done, the weights were changed (using a weight step of 0.01 and no
corresponding to the clamping of one of the three visible layers. This was momentum). In our network, the 40 words were presented 7,900 times
done to make the system highly interactive, forcing it to be responsive to before the state of each visible unit was within 0.2 of its correct value
the demands of all three visible layers. Essentially, the network was during each of the three negative phases. No attempt was made to
learning three tasks: how to generate phonology from visual input through minimize the training time.

You might also like