Professional Documents
Culture Documents
Controlled and Automatic Human Information Processing - II. Perceptual Learning, Automatic Attending, and A General Theory PDF
Controlled and Automatic Human Information Processing - II. Perceptual Learning, Automatic Attending, and A General Theory PDF
TRIAL I
TRIAL 2 rs63
CM -
TRIAL 3
TRIAL 4
TRIAL I
TRIAL 2 CJK
VM -
TRIAL 3
TRIAL 4
Figure 1. Examples of trials in the multiple-frame search paradigm of Experiment 1, Part I. In all
cases, memory-set size = 4 and frame size = 2. Four varied-mapping (VM) trials and four con-
sistent-mapping (CM) trials are depicted. The memory set is presented in advance of each trial,
then the fixation dot goes on for .5 sec when the subject starts the trial, and then 20 frames are
presented at a fixed time per frame. Either 0 or 1 member of the memory set is presented during
each trial. Frame time, memory-set size, and frame size are varied across conditions.
roles of automatic and controlled processing. frame. Each trial consists of the presentation
Our theory will then be compared and con- of 20 frames in immediate succession. The
trasted with extant theories of search and elements presented are characters (i.e., digits
attention. or consonants) or random dot masks. In ad-
vance of each trial the subject is presented
A. Review of Paradigms and Results From with several items, called the memory set, and
Part I is then required to detect any memory-set
items that appear in the subsequent frames.
The paradigm for most of the studies of The frame time is kept constant across the
Part I (and Part II also) is depicted in 20 frames of each trial, and the basic depen-
Figure 1. Four elements are presented simul- dent variable is the psychometric function
taneously in a square; and their joint presen- relating accuracy to frame time for each
tation for a brief period of time is termed a condition.
PERCEPTUAL LEARNING AND AUTOMATIC ATTENDING 129
CONSISTENT VARIED
MAPPINGS MAPPINGS
100 100
c/) O)
H
.(4,1)
I80 4,2)
80
r- (4,4)
A" (1,4)
60 LU
60
O
LU LJ
40
120
40 80 120 200 400 600 800
FRAME TIME (msec) FRAME TIME (msec)
Figure 2. Data from Experiment 1, Part I. Hits as a function of frame time for each of the 12
conditions. Three frame times were utilized for each condition. The first number in parentheses
indicates the memory-set size and the second number in parenthesis indicates the frame size.
dons of greatest interest were those in which frame procedure and will infer the presence
two targets per trial were presented. In these of automatic detection or controlled search
cases the spacing between the two targets from the nature of the spacing effect and the
varied from 0 to 4 (0 spacing indicates simul- target-similarity effect.
taneous presentation; spacings of 1, 2, and 4
indicate the number of intervening frame B. Rationale for the Experiments
plus 1). Target similarity also varied, the
two targets being either physically identical We argued in Part I that the cause of the
(II) or different (NI). difference between the CM and VM results
The results for Part I/Experiment 3 was the consistency of the mapping (over
showed markedly different patterns for the trials) of the memory-set items and dis-
VM and CM conditions. Consider target tractors to responses. We argued that con-
similarity first. In the VM conditions de- sistent mapping leads to the development of
tection of identical targets (II) was superior automatic detection, which enables auto-
to detection of different targets (NI). How- matic-attention responses to become attached
ever, in the CM conditions, the reverse to the memory-set items. The automatic-at-
was true: NI detection was superior to II tention responses enable the serial, controlled
detection. Consider target spacing next. In search to be bypassed by a parallel detection
the VM conditions performance was lowest process unaffected by load. However, these
when the targets occurred in successive frames hypotheses must be further tested for the
(spacing 1). However, in the CM conditions following reasons. First, the fact that mem-
performance was lowest when the targets were ory-set items were categorically distinct
simultaneous (spacing 0). These results from the distractors was confounded with the
helped emphasize the qualitative difference consistency of the mapping. These factors
between the automatic detection processing will be separated in Part II/Experiments 1,
mode presumed to be utilized in the CM 2, and 3. Second, the course of development
conditions, and the controlled search mode of the hypothesized automatic detection
presumed to be utilized in the VM conditions. process was not studied in Part I. The course
In certain of the studies in this paper, we of learning will be traced in Part II/Experi-
shall utilize this multiple-target, multiple- ments 1 and 3. Third, there was no demon-
PERCEPTUAL LEARNING AND AUTOMATIC ATTENDING 131
stration in Part I that an automatic-atten- was always equal to 2 and the memory-set size was
tion response is learned in CM paradigms. always equal to 4. Two disjoint character sets were
used for the memory ensemble and the distractor
Such a fact will be suggested by Part II / set. One consisted of the following nine consonants:
Experiments 1 to 3 and demonstrated in B, C, D, F, G, H, J, K, and L; the other con-
Part II/Experiment 4. Finally, although these sisted of the following nine consonants: Q, R, S,
goals provide an initial justification for the T, V, W, X, Y, and Z.
present experiments, these studies will serve There were four new subjects, two of each sex,
all naive to our tasks.1 Two subjects began training
an even more important purpose in elu- with the consonants from the second half of the
cidating the characteristics and development alphabet 'as the memory ensemble, and two subjects
of automatic processing and in differentiating began with the consonants from the first half of
automatic from controlled processing. the alphabet as the memory ensemble. There were
five blocks of trials per session, each containing
60 test trials. There were no practice trials. Subjects
II. The Development of Automatic Processing: were informed at the start of the experiment con-
Perceptual Learning cerning the nature of the study and the composi-
tion of the memory ensemble and the distractor set.
A. Perceptual Learning and Unlearning in a Each trial began with the presentation of a memory
CM Task Using Letters Only set (M = 4) selected randomly from the memory
ensemble.
Experiment 1 of the present series is simple During the first 1,500 trials of the experiment
in conception. With the use of the same basic for each subject, the frame time was 200 msec;
multiple-frame paradigm used in Part I/ during the following 600 trials, 120 msec. After
these 2,100 trials the memory ensemble and the
Experiment 1 (see Figure 1), performance is distractor set were switched for each subject and
examined as a function of the amount of the frame time was set back to 200 msec. This
training under consistent-mapping conditions. reversal 'Condition was then run at the frame time
One change is made, however: Both the dis- of 200 msec for a total of 2,400 additional trials. The
subjects were informed of the reversal at the time
tractor set and the memory ensemble consist of the switch.
of consonants. This procedure enables us to The subjects responded whenever a target was
study the acquisition of automaticity when detected, or gave a negative response at the end of
the two sets are not already-learned cate- the trial. The accuracy of the response and the
gories. Furthermore, it had been observed reaction time for hits were both recorded. Sub-
jects heard a tone signifying an error after each
informally that the use of digits and con- incorrect response.
sonants (in the CM conditions of Part I/
Experiment 1) led to a relatively rapid 2. Results and Discussion
acquisition of automatic detection. It was
felt that the use of letters only to make up The results are presented in Figure 3. Re-
the two sets would slow down acquisition. sults for each block, averaged across sub-
Values of M and F were chosen so as to jects, are graphed consecutively. Thus, the
make controlled processing difficult (M = 4, graphed points in each interval are based on
F — 2 ) , and a frame time (/) of 200 msec 120 observations.
was chosen so as to make performance low In the initial group of 1,500 trials, the hit
when controlled processing was being utilized. rate rose from just over 50% to about 90%,
(These choices are justified by the data in while the false alarm rate dropped from
Figure 2.) Thus it was expected that per- about 12% to about 3% (and the reaction
formance would be quite poor at the start of
training, when automatic detection had not 1
been learned and controlled search had to be Since the experiments are not reported in their
used, but would improve markedly as auto- chronological order, we list here the experiments in
their original order and indicate the subjects that
matic detection developed. were used in each. Experiment 1 was run with new
subjects. Experiment 4 was run next ) using the
I . Method subjects who took part in Experiment 3 of Part I.
Experiment 2 was then run on three of the four
The CM presentation procedure of Part I/Experi- subjects in Experiment 4. Experiment 3 was run
ment 1 was utilized (see Figure 1). The frame size last, on new subjects.
132 RICHARD M. SHIFFRIN AND WALTER SCHNEIDER
100
EC/)
ot 80
0:=
60
INITIAL LEARNING
• REVERSED LEARNING-
40 200 MSEC FRAME TIME
HO: 120 MSEC FRAME TIME _
2<
LU-J
20
*tt*^*fi*.
2 o
600 1200 1800 2400
X 850
o:
£
I-
2 650
g
550
UJ 600 1200 1800 2400
CC
TRIAL NUMBER
Figure 3. Data from Experiment 1. Initial consistent-mapping learning and reversed consistent-
mapping learning for target and distractor sets taken from the first and second halves of the
alphabet. Memory-set size = 4 J frame size = 2, frame times are shown. Percentage of hits, percentage
of false alarms, and mean reaction time for hits are graphed as a function of trial number. After
2,100 trials the target and distractor sets were switched with each other.
time for hits dropped from about 770 msec alarm probabilities in the first 60 trials of
to about 670 msec). the present study were .56 and .13, respec-
It seems plausible that the subjects adopted tively. The estimated probability of detecting
a controlled search strategy at the start of a single target and the observed probability
training. Some support for this hypothesis is of giving a false alarm when no target was
found in a comparison of the data from this present were .60 and .18, respectively, in the
study with the data from Experiment 3 of VM condition of Experiment 3b of Part I in
Part I (though different subjects were in- which M was equal to 4 and F was equal to 2
volved in the two studies). The hit and false (this condition and the first 60 trials of the
PERCEPTUAL LEARNING AND AUTOMATIC ATTENDING 133
present study were both run at a frame time planted in long-term memory and if the at-
of 200 msec). These two conditions should tention response occurs automatically, then
give similar results if controlled search is it should prove very difficult for the subject
being utilized in both studies; in fact, the to alter or unlearn his automatic response in
probabilities are quite similar. any short period of time. To test this hy-
It should be noted that the subjects all re- pothesis we reversed the memory ensemble
ported extensive, attention-demanding re- and the distractor set for each subject (and
hearsal of the memory set during about the set the frame time back to 200 msec). We
first 600 trials of Experiment 1, but they hypothesized that automatic detection would
gradually became unaware of rehearsal or prove impossible and that the subject would
other attention-demanding controlled pro- be forced to revert to controlled search.
cessing after this point. Both the objective The results of the reversal were quite
evidence and subjective reports, then, sug- dramatic. The hit rate just after reversal
gest that the subjects used controlled search fell to a level well below that seen at the
at the start of Experiment 1 but gradually start of training when the subjects were com-
shifted to automatic detection. pletely unpracticed. Very gradually there-
The hit rate and false alarm rate after after the hit rate recovered, so that after
1,500 trials appeared to be changing only 2,400 trials of reversal training, performance
slowly, but this could have resulted from a reached a level about equal to that seen after
ceiling effect. The frame time was therefore 1,500 trials of original training. In summary,
reduced to 120 msec for the next 600 trials. the original training resulted in quite strong
At the conclusion of this additional training, negative transfer, rather than positive
the hit rate appears to have reached about transfer.
82%, while the false alarm rate dropped to Subject's verbal reports indicated that a
about 5%. These results may be compared shift back to controlled search occurred after
with the CM results from Experiment 1 of reversal. Initially, before reversal, all sub-
Part I (see Figure 2 ) . In that study, with jects reported rehearsing the memory set
/ = 120 msec, M - 4, and F = 2, the CM hit during each trial. After the 2nd day (600
rate was about 92% and the false alarm rate trials) subjects reported that they were no
(not shown) was about 5%. longer rehearsing and only glanced at the
Thus, the present results indicate that memory set. However, subjects reported that
subjects were utilizing automatic detection after reversal they tried various methods to
by the end of the initial 2,100 trials of CM perform the task and eventually ended up
training. First, they performed at a level not rehearsing the memory-set items again, though
far from that of the Part I/Experiment 1 this rehearsal also decreased after a week of
subjects in the comparable CM condition postreversal practice. This pattern of re-
with the same frame time. Furthermore, the ports could indicate a shift from controlled
present study utilized a larger memory en- search to automatic detection during original
semble and distractor set than the VM con- learning, then a shift after reversal to con-
ditions of Part I/Experiment 1, a fact that trolled search, and then finally a return to
could only have increased the task difficulty. automatic detection.
Second, from the Part I/Experiment 1 VM What might be the cause of the negative
data we can estimate that a frame time of transfer after reversal? If the reversal caused
600-800 msec would have been required to subjects to revert to controlled search, then
achieve a similar level of performance had it might have been expected that perform-
controlled search been used. ance would fall to the level seen at the start
At this stage of the experiment, after 2,100 of original learning (when controlled search
trials of CM training, the subjects could be was presumably utilized). The actual results
expected to have developed a well-learned suggest that controlled search is hindered
attention response to the members of the when the distractors are items that subjects
memory ensemble. If such learning is firmly have been previously trained to respond to as
134 RICHARD M. SHIFFRIN AND WALTER SCHNEIDER
CM targets (i.e., items that, due to previous always consisted solely of items from the
training, give rise to automatic-attention distractor set in the CM conditions. Thus a
responses). This hypothesis is verified in subject at the conclusion of the series of
Experiment 4, to be reported later.2 Part I studies who had been searching, say,
Beyond the poor performance at the start for consonants in digit distractors, had never
of reversal learning, negative transfer is also been given a consonant as a distractor:
evidenced by the extremely large amount of Whenever a consonant had appeared, it was
training needed to overcome the effects of orig- a target. These facts naturally suggested a
inal learning. After about 900 trials of original study in which the roles of consonants and
practice the hit rate reached about 90%. digits were reversed for these subjects.
However, it took about 2,100 trials to return
to this performance level after reversal. Ap- 1. Method
parently the necessity to unlearn the at-
The procedure was identical to the multiple-
tention responses to the previous memory-set frame, multiple-target paradigm of Experiment 3a
items, or the necessity to overcome some of Part I (whose results were described in the
learned inhibition to the previous distractors, Introduction) in most respects, except that the
or both, causes a reduction in the rate of memory ensemble and the distractor set were
switched in both the CM and VM conditions. Thus
acquisition of automatic processing after in the CM condition the two subjects who had
reversal. been searching for consonants in digit distractors
It would be interesting to know whether now searched for digits in consonant distractors;
negative transfer would obtain in several in the VM condition these two subjects had been
variations of our paradigm. For example, in searching for digits in digits and now searched for
consonants in consonants. These contingencies were
the present study, the initial training might reversed for the remaining two subjects who had
have caused the memory set to become a previously been searching for digits in consonant
learned category, but probably did not cause distractors in their CM conditions.
categorization of the distractor set. If the The frame time (/) was 60 msec for the CM
conditions and 200 msec for the VM conditions.
distractor set was a well-known category, Just as in Experiment 3 a of Part I, M was equal
would reversal still cause a severe perform- to 2, and F was equal to 2 ; zero targets were pre-
ance drop? Furthermore, it could be argued sented on one fourth of the trials, one target was
that the initial training did not continue long presented on one fourth of the trials and two
enough for the negative transfer to reach full targets were presented on one half of the trials.
When two targets were presented, they were either
strength. The amount of negative transfer identical (II) or nonidentical (NI), and the spac-
might depend on the amount of overtraining ing between them was either 0, 1, 2, or 4. Each
during initial learning. Such a view has been subject was given 12 blocks of 132 trials in each
put forth for certain verbal and animal learn- of the VM and CM conditions. The VM and CM
blocks alternated.
ing situations (see Handler, 1962, 196S; Upon 'Completion of the initial set of 24 blocks,
Jung, 1965 for two views on the subject). the frame time was increased to 120 msec and an
Thus it might be asked whether performance additional 18 blocks of CM trials were run for each
drops would be caused by reversal if the subject.
original automatic detection responses were
greatly overlearned. These two questions are 2
One might hypothesize that the basic compari-
answered by Experiment 3. son rate of controlled search is altered and slowed
after reversal, thereby accounting for the negative
B. The Reversal of Consonants and Digits transfer. Two facts argue against this hypothesis.
for Well-Practiced Subjects Experiment 2 shows that controlled search proceeds
at an unchanged rate even when all items consist
Throughout the series of experiments in of targets from previous CM conditions. Experiment
Part I, the memory ensemble in the CM 4 shows that even one CM target presented in a
condition was always the same, either digits to-be-ignored spatial location greatly reduces detec-
tion of a simultaneously presented target in a to-be-
or consonants, and no member of the CM attended location. This result suggests that reversal
memory ensemble was ever a distractor, even performance is harmed because attention tends to
in the VM conditions. The VM conditions be drawn to the distractors.
PERCEPTUAL LEARNING AND AUTOMATIC ATTENDING 135
It should be noted that the subjects for this ex- tion of two targets is about 80% prior to
periment had had roughly 20,000 trials of practice reversal and about 20% after reversal. Fur-
in the various CM and VM conditions in earlier
studies (see Footnote 1). thermore, these data represent 12 blocks of
training over which very little, if any, re-
covery was taking place. It is interesting to
2. Results and Discussion
note that all of the subjects predicted that
The results are depicted in Figure 4. A no change in their performance would occur
simple correction for guesses and false alarms after the consonants and digits were reversed,
has been carried out on the raw data, as and the subjects were uniformly startled and
described in Part I, but this correction did even dismayed by the extreme difficulty of
not affect the qualitative features of the re- the reversed task.
sults. The circles in the left-hand panel give These results resolve the confounding be-
the comparable VM prereversal data from tween categorization and the mapping condi-
Experiment 3a of Part I ( / = 200 msec), tions. Even when there is a well-learned and
while the circles in the center panel give the well-practiced categorical difference between
comparable CM prereversal data from Ex- memory ensemble and distractor set, reversal
periment 3c of Part I (/ = ,60 msec); the in the CM paradigm gives rise to marked
triangles in the two left-most panels give the impairment of performance. Thus, categorical
data from the present experiment (see Foot- differences between memory ensemble and
note 1). distractor set cannot be the key factor under-
The most dramatic findings of Experiment lying the utilization of automatic detection.
2 are shown in the middle panel of Figure 4, Rather it is the consistency of mapping that
which gives the CM reversal results. It is underlies the acquisition of automatic detec-
evident that performance drops off markedly tion.
after reversal. Estimated percentage of detec- Consider next the VM data given in the
REVERSAL CONDITIONS
— IUU
u h ' ' ' ' ' D
LU °o Q
UJOT 80
01-
,I-O
llJ
zee
UJ<
60
DU
-
. Nf^.
aA
" A ^^°~
A
i v-:
a! ;J 40
o<
^A__^
*^A—Ti-
£
1-
w
i i i i i
0 ' I 2 4
i i i i
0
i i
1 2 4
i t i i ii i
0 1 2 4
SPACING SPACING SPACING
0 I 01 2 01
NUMBER OF TARGETS
Figure 4. Data from Experiment 2. Probabilities of correctly detecting one target when one was
presented, or two targets when two were presented, as a function of spacing, for each condition.
The data shown have been adjusted to remove effects of guessing and false alarms. The observed
percentage of "none" responses to no-target trials is also shown. Circles show the estimated
percentage of detection prior to reversal (data from Experiment 3, Part I), while triangles indicate
performance after reversal in Experiment 2. The squares in the right-hand panel show post-
reversal detection from Experiment 2 after frame time was increased to 120 msec. The open
symbols and dashed lines indicate the II conditions (identical targets); the closed circles and solid
lines indicate the NI conditions (different targets). (VM = varied mapping; CM = consistent
mapping; f = frame time, in msec.)
136 RICHARD M. SHIFFRIN AND WALTER SCHNEIDER
left-hand panel of Figure 4. It is evident that search operates normally, with targets com-
the switch from consonants to digits (or pared in a random order of search, and per-
vice versa for other subjects) had almost formance is equivalent to that seen prior to
no effect. The levels of performance remain the switch of character sets.
the same as in Experiment 3a of Part I, and In short, the use in a search task of stimuli
the qualitative pattern of results is still about that elicit automatic-attention responses (de-
the same, showing the pattern indicative of veloped through previous CM training) can
controlled search. That is, worst perform- (a) improve performance over that seen in
ance occurred at spacing 1, and performance the usual VM conditions if these stimuli are
was worst when the targets were different memory-set items but not distractors, in
(NI). These effects indicate the presence of which case automatic detection will be uti-
controlled search. In summary, a change of lized; (b) leave performance unchanged
the character set used in the VM conditions from that seen in the usual VM conditions if
had no appreciable effect under the present these stimuli are both memory-set items and
conditions. distractor items, in which case normal con-
These results are most interesting, be- trolled search will be utilized; and (c) im-
cause after reversal the memory ensemble pair performance from that seen in the usual
and the distractor set both consisted of VM conditions if these stimuli are distractors
items that had come to elicit automatic- but not memory-set items, in which case
attention responses in previous training. In subjects will utilize controlled search that
the reversal conditions of Experiment 1 we will be reduced in effectiveness by automatic
saw that making the distractor set but not the attending to distractors.
memory ensemble consist of such "automatic" Let us now turn to the results of the final
items resulted in considerable negative trans- reversed CM training blocks. We suspected
fer. In the present experiment it is evident that the subjects in the reversed CM condi-
that performance is not worse after reversal tion (the middle panel) were reverting back
when subjects have been trained to re- to the use of controlled search, but perform-
spond to both the targets and distractors in ance levels were so low that it was difficult
previously CM training. to interpret the pattern of results. Thus
These results might be expected on the frame time in the CM reversal condition was
basis of the following reasoning. Just after increased to 120 msec, and 18 blocks of addi-
reversal in both Experiment 1 and in the CM tional CM training were run.
conditions of Experiment 2 the subjects were The results of the trials with / = 120 msec
using controlled search. In Experiment 1, after are represented by the squares in the right-
reversal, the automatic-attention responses hand panel of Figure 4. It should be noted
to the distractor items tended to draw at- first that the pattern of the results is in-
tention away from target items to which sub- dicative of controlled search in that the worst
jects had not previously been trained to re- performance occurs at spacing 1. However,
spond and hence to order the controlled search the results do not show the target-similarity
in such a way that the distractor items were effect that we have been led to expect (in
compared earlier in the search. Thus per- Experiment 3 of Part I) for controlled search:
formance was impaired. (This hypothesis is Results for conditions using II target pairs
confirmed in Experiment 4.) However, in the are not superior to those using NI target
present experiment, after reversal, all items, pairs. Furthermore, the overall level of per-
distractors and targets alike, gave rise to formance is higher than one would expect
attention responses, which could be expected for the VM conditions. At a frame time of
to cancel each other out on the average. 120 msec performance is higher than that
That is, there should be no selective bias to seen for the VM conditions at a frame time
compare distractors prior to targets since of 200 msec (in the left-hand panel of Fig-
both types of items give rise to equivalent ure 4).
attention responses. Thus, the controlled The results can be explained simply and
PERCEPTUAL LEARNING AND AUTOMATIC ATTENDING 137
elegantly by the hypothesis that the sub- of memory ensemble and distractor set causes
jects were carrying out a controlled search a great decrement in performance in the CM
for the category as a whole. The Experi- conditions. The CM decrement occurs despite
ment 2 reversal condition differed from the the categorical difference between the mem-
Part I/Experiment 3 VM conditions in that ory-set items and distractor items (letters
the memory-set items and the distractor vs. numbers). It is argued that the subjects,
items in the present instance were cate- after reversal, adopt a controlled search
gorically different (numbers and letters). strategy in which they search for the category
This categorical difference obviously did not as a whole rather than search for the in-
allow the subjects to utilize automatic detec- dividual members of the category.
tion after reversal. However, the categorical The results of Experiments 1 and 2 could
difference could very well have helped the hardly have provided a more dramatic demon-
subjects to adopt a more efficient controlled stration of the qualitative difference between
search. Suppose the subject ignored the automatic detection and controlled search,
specific members of the memory set and and of the dependence of the search mecha-
searched instead for any instance of the cate- nism on the nature of the training procedures
gory "numbers" (or "letters" as the case (i.e., the consistency of the stimulus-response
may be). Then only one comparison would mapping).
be needed for each display item in each The results of Experiments 1 and 2 also
frame, regardless of memory-set size. demonstrate the long-term nature of the
Compared with the case when a categoriza- learning underlying the automatic process.
tion is unavailable, a controlled search for Experiment 1 showed that even original
the category of each input saves search time learning of an automatic response could take
in two ways. First, there is no need for mul- thousands of trials of practice to develop if
tiple memory-set comparisons for each frame a previously known categorization did not
item. Second, there is no need for time- distinguish the memory ensemble and the dis-
consuming switches between memory-set tractor set. Relearning after reversal was
items. Thus performance improves consider- even slower to develop. Experiment 2 showed
ably. Furthermore, the distinction between that relearning after reversal could be very
identical and nonidentical multiple targets is retarded even when the sets were categorized,
no longer meaningful—both targets are sim- as long as the original automatic detection
ply category members, regardless of their process was well overlearned.
physical similarity. Thus no difference be-
tween the II and NI conditions would be
predicted. The results in Figure 4 support C. The Role of Categorization in Controlled
these contentions. Search and in the Development
oj Automaticity
As far as we can tell, through a block-by-
block analysis of the reversal results, the sub- The role played by categorization in con-
jects did not begin to recover any appreciable trolled search and in the development of
degree of automatic detection even after 30 automatic detection is suggested by results
blocks of CM reversal training. Undoubtedly, of the preceding experiments, but it is an
the difficulty in relearning (or unlearning) important enough concept to be examined
is related to the great amount of overtrain- in a separate experiment. Furthermore, the
ing with the original mapping of stimuli to nature of the process by which categories
responses. Eventually, however, were the are learned is also worth studying. These
reversal experiment continued, we would ex- considerations underlay the design of Experi-
pect automatic detection to develop once ment 3. We decided to use new subjects and
again. to train them in two conditions, in neither
In summary, then, the results of Experi- of which were the memory and distractor
ment 2 show that a switch of character sets sets preexperimentally categorized. The first
does not affect VM performance, but a switch condition was designed to induce controlled
138 RICHARD M. SHIFFRIN AND WALTER SCHNEIDER
100
S 80
60
40
M=2 M=4
o • Categorical
20 a • Mixed
f = 350
V M Training CM Training
SESSIONS
Figure S. Data from Experiment 3. Estimated percentage of detection of both targets, as a func-
tion of session number, averaged over spacing and target-similarity conditions, for each of the
four main conditions. At Session 25 training was switched to a CM procedure. At Session 30, the
frame time was reduced to 160 msec. (VM = varied mapping; CM = consistent mapping; M =
memory-set size; f = frame time, in msec).
search in circumstances such that a cate- ditions, and two memory-set sizes were used in
gorization could not develop. The second con- different blocks: M = 2 and Af = 4.
In the mixed condition, the members of the
dition was also designed to induce controlled memory set were chosen randomly on each trial from
search, but in circumstances such that a the ensemble of eight consonants; the distractor set
categorization of the memory sets could be consisted of four items randomly chosen from
learned. Both of these conditions utilized a those consonants not used in the memory set. These
varied-mapping procedure to prevent auto- four .distractors were chosen randomly to fill the
nontarget, nonmask, positions in the various frames
matic detection, and both were followed by of the trial. The key feature of the mixed condition
a series of CM trials, during which trials was the fact that memory-set items and distractors
automatic detection developed. Thus, the were randomly intermixed from trial to trial.
CM trials traced the course of development In the categorical .condition, the eight ensemble
items were divided into two sets of four that re-
of automatic detection when a categoriza- mained disjoint throughout the experiment for each
tion was, or was not, present at the start of subject. The sets of four are indicated by the posi-
CM training. tion of the comma in the above listing of the con-
sonants making up each set. Note that the two sets
1. Method of four were chosen to be highly confusable with
each other in the sense that pairs of visually con-
The subjects each took part in two conditions fusable letters were separated into the two sets. On
called mixed and categorical, run in different blocks. a given trial one of these sets of four was chosen
In each condition the memory ensemble and dis- randomly to be the distractor set, and the memory-
tractor set were drawn from a total of eight con- set items were then chosen randomly from the
sonants. The two eight-consonant sets did not over- remaining set. Thus, in the categorical condition
lap. The sets were {GMFP, CNHD} and {RVJZ, there were just two disjoint categories of items,
BWTX}. The assignment of these two sets to the one from which the memory set was drawn, and
two conditions was permuted across subjects. one that served as the distractor set. However, the
Both conditions used a VM procedure similar in category that served as the distractor set varied
certain respects to that of Experiment 2 of this randomly from trial to trial. Thus, 'the procedure
report and Experiment 3 of Part I. The multiple- still involved a varied mapping, but the categories
target, multiple-frame paradigm was used with were never intermixed and could eventually become
/ — 250 msec per frame. Only 12 frames were used well learned.
on each trial, with all targets appearing on frames To make it easier for the subjects to learn the
3 through 10. Frame size was equal to 2 in all con- categories, the members of the memory set pre-
PERCEPTUAL LEARNING AND AUTOMATIC ATTENDING 139
sented at the start of each trial were always pre- 2. Results and Discussion of the VM
sented in alphabetical order (this was also done in Conditions
the mixed 'condition). In each session there were
four blocks, always run in the following order:
(a) categorical condition, M = 2; (b) categorical
The results are summarized in Figures 5
condition, M = 4; (c) mixed condition, M = 2; (d) and 6. Figure 5 gives the estimated prob-
mixed condition, M = 4. There were 96 trials in ability of detecting both targets when two
each block, 24 with no target, 24 with one target, are present, averaged across target similarity,
and 6 for each of the eight combinations of spacing spacing, subjects and certain combinations of
and target similarity when two targets were
presented. sessions. Performance in all four main con-
Four new subjects, one male and three females, ditions tended to improve during VM train-
took part. Each ran in 24 sessions of about 1 hour ing, but is clearly stable over the last 10
each. This finished the first, varied mapping, phase
of the study. After these 24 sessions the subjects sessions (15-24). Initially the M = 2 con-
should have been using controlled search in both dition is superior to the M — 4 condition in
conditions, but should have learned the categories both the categorical and mixed cases.
in the categorical condition. Before turning to the
CM procedure used in subsequent sessions, we shall After training, however, the pattern changes
discuss the VM results. considerably. In the categorical conditions
100
80
o> II NI
o—& M = 2
2 60
Categorical, VM
c i i
o
u 100
*-
0>
lu
•a
c
O
oo
<u
o
w
a>
a.
O
a>
73
a>
o
E
O
-t>
O
ro
Mixed, VM
0 1 0 1 2 4
i Spacing of two t a r g e t s |
Number o f t a r g e t s
Figure 6. Data from Experiment 3. Estimated percentage of detection as a function of number,
spacing, and similarity of targets for VM Sessions 19-23 for all conditions. (VM = varied mapping;
M = memory-set size; II = identical targets; NI = nonidentical targets.)
140 RICHARD M. SHIFFRIN AND WALTER SCHNEIDER
the results of the M — 2 and M = 4 condi- dition. This reasoning explains why the cate-
tions converge (indicating category learning gorical conditions are both superior to the
and the use of a categorical search strategy). best mixed condition, which uses a memory-
In the mixed condition performance in the set size of 2.
M = 4 condition remains much lower than in The category hypothesis, however, sug-
the M = 2 condition, but performance in gests that target similarity should not matter,
both of these is worse than in either cate- so that the II and NI functions should be
gorical condition (suggesting that the pres- identical. To explain the spacing 0 results
ence of well-known categories reduces the without abandoning the category model, we
effective memory-set size to 1). suggest the following hypothesis. Suppose
To ascertain whether controlled search was that when a subject locates a target category,
being utilized in these various VM conditions he or she briefly switches to an item mode,
and to determine the nature of that search, it perhaps to check that the input is truly a
is necessary to consider the spacing functions category member. If the second item in the
shown in Figure 6. This figure gives the frame is identical, then it will be found in an
spacing functions for Sessions 19-23, when item-comparison mode, but if nonidentical, it
performance had stabilized, averaged across will be located only if the subject reverts
subjects. For both M = 2 and M = 4 con- quickly enough to a categorical-comparison
ditions, the mixed condition shows the pat- mode. By the next frame, the reversion to a
tern we have come to expect for controlled categorical mode is complete, so the various
search, with performance worse at spacing 1 functions tend to converge. Why would sub-
and worse for different targets (NI). jects tend to switch to an item mode? The
Performance in the categorical condition categories were constructed so as to be ex-
was equivalent when M = 2 and M — 4. tremely confusable with each other. Perhaps
The usual performance impairment at spacing category encoding is learned under these cir-
1 occurs for identical targets (II), but when cumstances but remains somewhat error
the two targets differ (NI), performance in prone. Then it would be logical to check any
the spacing 0 condition is greatly impaired, target category by using an item mode.
a result not seen in previous VM conditions. Whatever the explanation for the details of
Furthermore, the II and NI conditions show the spacing function, the data as a whole
only a small difference except at spacing 0. make a strong case that categories have been
The results from the mixed condition con- learned in the categorical condition during
form to the pattern expected for controlled VM training, and that the presence of cate-
search in three respects; When M = 4 per- gories in a VM situation allows the subject
formance is worse than when M — 2; there to adopt a simpler and more efficient form of
is a performance reduction at spacing 1; and controlled search.
performance is worse for NI than II con- The argument that controlled search is
ditions. operating in these conditions would be sub-
The results from the categorical condition stantiated if it could be demonstrated that
were unexpected in some respects but are performance improves when the subjects
explicable in terms of an hypothesis that switched to CM training. We turn next to
categories were learned and utilized in con- the CM procedure.
trolled search. Let us assume that the pres-
ence of a known category allows one to com- 3. Method
pare the category of the memory set to the Following the 24th session of VM training, a CM
category of any given display item in a single procedure was initiated. In the categorical condi-
operation. Then the M = 2 conditions should tion, one of the categories was fixed thenceforth
not differ from the M — 4 conditions, and in as the memory ensemble, and the other category
fact both conditions should show performance was fixed thereafter as the distractor set. In the
mixed condition, a set of 4 was chosen and was
levels equivalent to those expected for a fixed thereafter as the memory ensemble—the re-
memory-set size of 1 in a normal mixed con- maining 4 items were fixed thereafter as the dis-
PERCEPTUAL LEARNING AND AUTOMATIC ATTENDING 141
tractor set. The two sets of 4 used in the mixed the first four sessions of CM training). In
condition were those indicated by the position ofother words, the learning of a category is
the comma in the listing of the two sets of eight
consonants: {GMFP, CNHD} and (RVJZ, BWTX}. apparently not a necessary prerequisite for
In other words, these were the same sets that hadthe development of automatic detection.
been used as categories in the categorical conditions
Figure 7 shows the spacing functions for
for other subjects. This CM training procedure was
the six sessions (30-35) of CM training run
identical for both conditions and followed in other
respects the procedures used in the preceding VM at a 160-msec frame time. The results are
conditions. Each subject was given at least 10 averaged over subjects and sessions. In ad-
sessions of VM training. The frame time after CM dition the M — 2 and M = 4 conditions are
Session 4 (Session 24 overall) was reduced to 160lumped together since they did not differ.
msec and after CM Session 11 (Session 35 overall)Somewhat to our surprise the pattern is al-
was .reduced again to 120 msec (at the time of this
writing this study had not been completed). most identical to that seen for the categorical
condition during the latter stages of VM
training, except that the performance level
4. Results and Discussion of the CM is much higher. The comparable pattern from
Conditions the CM conditions of Part I/Experiment 3
The changes over sessions following the was quite different: Detection was about
switch to CM training are depicted in Figure equal in all conditions except for a depressed
5. Consider first the results of Sessions 25 detection rate when spacing was 0 and when
through 28, which immediately followed the the targets were identical. At spacing 0 the
switch to CM training. Quite clearly, a very present data clearly show performance to be
rapid and dramatic rise in performance took higher when the targets were identical.
place, and furthermore, the results of the Disregarding the shape of the spacing
mixed and categorical conditions tend to functions, the large improvement in perform-
converge. By Session 28 the performance ance when CM training commenced does sug-
level in the mixed condition had risen to over gest that the subjects were learning auto-
90% and in the categorical condition had matic detection. An hypothesis that reconciles
risen to over 95%. Since the subjects were these facts suggests that the subjects were
clearly approaching ceiling, the frame time using automatic detection to locate the first
was then reduced to 160 msec and CM train- target, and were then reverting to controlled
ing continued for six additional sessions. search to check the accuracy of the target
Note that performance was still improving detection. Some time may be lost before the
and that the mixed and categorical, and M subjects revert to automatic detection. Thus
= 2 and M = 4, conditions effectively con- the advantage of II at spacing 0 would be
verged. due to the temporary use of a controlled
It is interesting to note that during VM search. This hypothesis is similar to that
training, 20-25 sessions were necessary before used to explain the categorical results in the
the M = 2 and M = 4 performance levels be- VM conditions (Figure 6). In both cases, a
came equal in the categorical condition. Thus tendency to recheck the first detected item
one might tentatively conclude that category may have led to an alteration in search
encoding in the present experimental context strategy. The reason may be the same in both
required 20-25 sessions to develop. Thus cases: Rechecking of located targets may
category encoding would be unlikely to de- have been induced by the stimulus sets,
velop for the mixed condition in just four which were chosen so that the memory set
CM sessions. Yet after only four sessions of and distractor set were maximally visually
CM training, the performance level in the confusable. Thus, both automatic detection
mixed condition rose dramatically and ap- and category encoding may have been some-
proached that of the categorical condition. This what error prone, thereby requiring recheck-
result suggests that automatic detection de- ing in an item mode. We are currently ex-
veloped in the mixed condition in the absence ploring this hypothesis.
of a well-learned category (at least during In summary, a number of conclusions can
142 RICHARD M. SHIFFRIN AND WALTER SCHNEIDER
I/) 1 \J\J
g
«- **
c . ° ^x ^p=Z^^ -
i- 80-
d0
•So
- .^.^
^^.~-^"m
Estimate
IINI
detection
O
cr>
Categorical ~
a— •* Mixed C M , f = 160MSEC
i i i i i i
O
-P*
0 1 0 1 2 4
i S p a c i n g of two t a r g e t s t
N umber of t a r g e t s
Figure 7. Data from Experiment 3. Estimated percentage of detection as a function of number,
spacing, and similarity of targets for CM Sessions 30-35, averaged over memory-set size (which
had no effect) for all conditions. (CM — consistent mapping; f = frame time; II = identical targets;
NI = nonidentical targets.)
be drawn from the results of Experiment 3. category. Verbal labels are one common class
First, arbitrary collections of characters can of categories, but other types of categories
be learned as categories. For this learning to might also exist. For example, a visual symbol
occur, it is necessary that the categories re- might represent several objects. Of course, a
main consistently denned across trials, but category in general need not exist in a form
not necessary that the same category re- similar to any of the sensory modalities. A
main the target set across trials. Second, the completely abstract node could represent two
learning of a category in a VM search para- or more objects in memory.
digm enables the subject to adopt a more With respect to search tasks, the import-
efficient controlled search, one in which the ance of categories is clear. When all of the
category of the memory set may be com- members of a memory set are members of
pared in a single operation to the category the same category and no distractors are in
of a display item, thus eliminating the effects that category, then a controlled search can
of variations in memory-set size. Third, a bypass the individual memory-set items and
switch to a CM training procedure results utilize comparisons that involve matching the
in fairly rapid acquisition of automatic de- single category against the category of each
tection, with a concomitant improvement in display item. For category search to be uti-
performance for both the categorical and non- lized effectively, however, the category must
categorical stimuli. These conclusions sup- be learned well enough that each displayed
port those suggested by the results of Ex- element will be encoded automatically and
periment 2 in all important respects. consistently according to its category. Of
course other features, such as the element's
D. What is a Category? A Discussion and name and shape, will also be encoded, but
Selective Review only the category feature is necessary for a
category search. We suggest that the category
1. Some Hypotheses Concerning Categories
coding must be automatic, because if it were
Experiments 2 and 3 showed how search not, a search of long-term memory would
benefits from the learning of a categorical have to be carried out to identify the cate-
distinction between memory ensemble and gory; the time taken for such a long-term
distractor set. But what is a category? Most search would almost certainly wipe out any
generally, we may define any object in mem- gains that are due to the reduction in num-
ory that refers to (stands for, consists of) ber of comparisons allowed by the category
any two or more objects in memory as a search (at least for small memory-set sizes).
PERCEPTUAL LEARNING AND AUTOMATIC ATTENDING 143
Note that automatic category encoding is members to develop so quickly that advan-
not the same as automatic detection. Auto- tages due to a category response were mini-
matic detection refers to the case when a mized.
stimulus gives rise to an automatic-attention It is conceptually important to keep in
response that bypasses the need for a serial mind the possibility that automatic responses
search through either memory or the display. might be attached to different stages of en-
Automatic category encoding refers to the coding of a single stimulus. In particular,
case when the stimulus gives rise auto- when an automatic-attention response de-
matically to a node representing the category velops for a category, other attention re-
of the stimulus. However, without an at- sponses may develop at a different rate for
tention response attached to the stimulus the stimuli making up that category. Thus,
node or the category node, the presence of if a categorization is available at the start of
the category would have to be deduced CM training, the attention responses to the
through a serial search of the displayed category might develop sooner than atten-
elements. tion responses to some or all of the individual
Our studies have shown that an automatic stimuli in the category. This effect would be
category encoding of arbitrary collections of expected since any one stimulus would appear
characters (consonants, in the case of Experi- only occasionally as a target, whereas every
ment 3) can develop. The cause of the learn- target would be an instance of the category.
ing is not yet clear. Some subjects in early On the other hand, in the cases in which the
versions of Experiment 3 never showed any memory ensemble does not form a category
evidence of categorical search (usually sub- at the start of CM training, there might be
jects with low levels of performance). It may at least some individual stimuli that come
be that these subjects did not notice the to elicit automatic-attention responses while
categorical nature of the memory ensemble the category is still being learned.
and hence could not develop a well-learned Once an automatic-attention or encoding
category node in memory. Alternatively, response develops, we assume that it is no
these subjects could have developed an ap- longer under control of the subject and will
propriate category node but have failed to occur whenever its corresponding stimulus is
use this node to facilitate their search. presented. This assumption will be supported
There are several lines of evidence sug- by Experiment 4. However, when a cate-
gesting that automatic detection may develop gorial encoding (but not an attention re-
faster if a categorization is available at the sponse) is learned, it will not necessarily be
start of consistent training. For example, it utilized by the subject during a controlled
is easy to learn to search for a set of stimuli search. We suppose a category response will
defined by a simple physical feature (see facilitate controlled search only if the subject
Neisser, 1963, 1967, and studies in the next both notices the category and also decides to
section). The CM results in Figure 5 show that alter his search to compare the category
the categorical condition retains some superi- rather than the individual stimuli. These
ority over the mixed condition for about eight considerations would no longer apply if an
sessions (25-32) but the magnitude of the automatic-attention response were learned in
difference is surprisingly small. It is possible response to the category; in such a case sub-
that the high confusability between the two ject strategy would not matter since attention
categories of Experiment 3 makes the cate- would be directed to the category in any
gories and the automatic response difficult to event.
learn. If so, more training prior to the CM
phase could possibly have led to a larger dif- 2. Search Studies Using Categories
ference in the rate of development for the
two groups. Alternatively (or additionally) In a number of memory-search studies (in
the small size of these categories may have these studies F = 1 and M varies), the mem-
allowed automatic responses to the individual ory set consists of several categories, and
144 RICHARD M. SHIFFRIN AND WALTER SCHNEIDER
the distractors might be drawn from these the use of some sort of controlled search that
same categories or from different (unknown) is facilitated by the presence of categories in
categories. The results of these studies all the memory set.3
demonstrate the use of some sort of con- The final studies to be considered are those
trolled search, perhaps in conjunction with visual search studies in which M = 1 and F
simultaneous automatic detection, that is varies, but in which the visually presented
facilitated by the presence of categories. items fall in two or more categories, one of
For example, Naus, Glucksberg, and Orn- which matches the category of the memory-
stein (1972) and Naus (1974) used memory
sets consisting of words from several cate- 3
We shall not attempt at this time to explain
gories and a distractor set made up of words why a particular controlled search strategy is adopted
from these same categories. The results in a particular paradigm. Procedural differences
showed linear memory-set-size functions, among the studies would make such explanations
parallel for negative and positive display pure guesswork. Another class of studies using
categories also deserves mention for completeness,
items. However, there was a slope reduction but is not discussed in the main text because of a
when the number of categories increased. The difficulty in ascertaining whether the effects found
authors suggested that categories were suc- in these studies were due to automatic or controlled
cessively searched in random order, each in processing. These are visual search studies (M = 1,
F varied) that have used memory ensembles and
serial, exhaustive fashion, until the category distractor sets from different categories. They have
of the displayed item was reached. When the shown either flat set-size functions or curvilinear set-
search of this category was completed, a size functions with reduced slopes (e.g., Brand,
response was initiated. Homa (1973) pre- 1971; Ingling, 1972; Jonides & Gleitman, 1972,
sented a related paradigm; his results sug- 1976). These studies have utilized CM designs with
low practice levels; this makes it difficult to ascertain
gested that a serial, exhaustive search of whether the slope reductions are due to the presence
categories was followed by a serial, exhaustive of automatic detection, to beneficial effects of cate-
search within the category of the displayed gorization on controlled search, or to some combina-
item. Williams (1971), who presented an- tion of these factors.
other related paradigm, also argued that a Note that the slope reductions for visual set-size
functions normally should be taken as evidence of
serial, exhaustive search of categories was automatic detection. That is, in a controlled search
followed by a serial search within the cate- one would expect a serial comparison process to
gory. Okada and Burrows (1973) used mul- be needed to compare the memorized category
tiple categories in the memory set and some- against the category of each displayed item. How-
ever, it was seen in our VM 'data from Part I/
times cued the subject as to the category of Experiment 2 that a reduction of slope occurred
the displayed item in advance of the trial. when M was equal to 1 and F varied, and we sug-
Their results showed that the search could gested that a "controlled parallel search" might be
be restricted to the cued category. Clifton responsible for this result. A similar argument might
and Gutschera (1971) used memory sets con- be made to explain the slope reductions in the above
visual search studies. We do not particularly favor
sisting of two-digit numbers and showed that such an explanation because it does not explain
on some trials the subjects would first com- why normal set-size functions are found in visual
pare the 10s digits and then would compare search studies that do not use categories (e.g.,
the Is digit only if a 10s digit match was Atkinson, Holmbren, & Juola, 1969).
A finding related to those from studies using
found. Lively and Sanford (1972) used a categories in visual search is that of De Rosa and
memory set from one category and a dis- Morin (1970) in a memory search study. They
tractor set consisting of some members of showed that when the memory set consisted of
that category and other members outside consecutive digits, then both the positive and nega-
that category. Negative items outside the tive reaction times increased as a function of the
numerical closeness of the displayed item to the
category of the memory set showed a set-size boundaries of the consecutive memory set. The CM
function with reduced slope. procedure was used with small amounts of practice,
The various findings in these memory- so it is again difficult to ascertain whether such
effects were due to the action of automatic detection
search studies differ in many details that will or to some controlled search facilitation allowed by
not be discussed here. They all tend to show the categorical character of the memory set.
PERCEPTUAL LEARNING AND AUTOMATIC ATTENDING 145
set item. Smith (1962) and Green and Ander- tion deficits in these paradigms are due to
son (1956) carried out similar studies in which the limitations of controlled processes, in
a field of colored two-digit numbers was pre- particular, to the limited rate of short-term
sented and the subject was asked to search search.
for a particular two-digit number in a par- Thus, all our comments and conclusions
ticular color. The results showed that search concerning search and detection apply equally
rate depended much more on the number of well to attention studies. In particular, CM
items in the designated color than on the training should lead to the development of
total number of items presented, though both automatic detection that shoulld bypass
effects were present. A VM procedure was divided-attention limitations, while VM
utilized in their studies so that automatic training should cause controlled search that
detection could not have been used to restrict should severely limit the ability to divide
search to the items of a particular color. attention.
Probably a two-phase controlled search was In discussing the development of automatic
utilized in this situation. First, a rather fast detection we have proposed that an auto-
serial search was probably carried out to matic-attention response is learned in re-
locate the next item of the designated color; sponse to the unchanging members of the
then a slow comparison of that two-digit memory ensemble. The present study will
number was probably made to determine test this hypothesis. The test (Experiment
whether it was the target. This explanation is 4d) will entail asking the subject to ignore
supported by the fact that searching took a certain locations and then inserting in those
long time (up to 20 sec), so that a 30-40 msec locations items that subjects had been pre-
search rate for color could have accounted viously trained to respond to as CM targets
for the increases in response time as total (to see whether these targets attract atten-
size increased. These increases were small tion).
only in comparison with the larger within- These studies will also answer the following
category effects. question: To what degree can the subject
The main conclusion to be drawn from the focus attention on a specified subset of the
studies reported in this section, based on re- inputs without distraction from the remain-
peated findings, is that the presence of cate- ing (irrelevant) inputs. Such studies are
gories in search tasks can be used to facilitate, usually termed focused-attention studies, in
benefit, and modify controlled search. contrast to the studies of Part I and Experi-
ments 1 to 3 of the present article, studies
III. Experiments on Focused Attention that would be appropriately termed divided-
In part I we made the case that the attention studies,
processes utilized in attention experiments and
those utilized in search and detection experi- A. Terminology
ments are often the same. In fact, in many There is a problem of terminology in the
cases it is purely arbitrary whether a given studies of this section that is best solved by
study is referred to as an "attention,"
the introduction of the following definitions:
"search," or "detection" study. Experiment 1
in Part I could be described as a divided- 1. A foil refers to any input that appears in
attention study in which attention had to a to-be-ignored display location, whereas a
be divided among M memory-set items and distractor refers to a nontarget that appears
F frame items during each frame. The results in to-be-attended display location.
showed tremendous deficits in dividing at- 2. A CM foil is a foil that has previously
tention in the VM conditions, and virtually been used in CM training as a memory-set
no deficit in dividing attention in the CM item.
conditions. The results of Experiment 2 of 3. A CM target foil is a CM foil that would
Part I, and the fit of the quantitative model have been a target requiring a positive re-
to both studies, showed that divided-atten- sponse if it had appeared in a to-be-attended
146 RICHARD M. SHIFFRIN AND WALTER SCHNEIDER
the target foil, when the target foil is iden- inadvertently checked in addition. In this
tical (FIX) and nonidentical (FNI) to the event, target foils would occasionally be
target. The correct rejection rate when no noticed and might harm subsequent detection
target was present was 91%. The results may in a fashion similar to that caused by true
be summarized briefly: The target foil had target detection. Of course, this hypothesis
no effect on the hit rate except when it was is speculative and other explanations are
different from the target (FNI) and preceded undoubtedly available.
the target by 1 frame, in which case detec- Whatever the explanations for the details
tion probability was decreased by .11. of the results, it is safe to summarize the
In the present study a target foil reduced results of Experiments 4a and 4b as follows:
performance when it preceded, but not when Subjects are able to control their search in
it followed, the target, Suppose that this re- varied-mapping paradigms. The degree of
duction is due to the same factors that pro- their control is sufficient to eliminate any
duce decrements in multiple-target detection distracting effect of nontarget foils and to
in VM conditions (Experiment 3, Part I). reduce the distracting effect of target foils
Then one target probably inhibits detection well below the level caused by other targets.
of a subsequent target, but not of an anteced- Thus, attention focusing is quite successful:
ent target. That is, the reduced detection at VM foils have, at most, a small distracting
spacing 1 seen in the VM conditions of Part effect when they appear in to-be-ignored
I/Experiment 3 and also seen in Figure 4 locations during controlled search.
and 6 of the present paper, could have been
caused by either of the two targets' reducing D. The Distracting Effect of "Targets"
detection of the other. The present results During CM Search
provide some reason to argue that it is the
first of two successive targets that reduces Experiments 4a and 4b examined the
detection of the second. ability to focus attention during controlled
Let us assume that the decrement in per- search. We next ask: To what degree does
centage of detection of two targets in VM attention focusing affect automatic detec-
paradigms is in fact a decrement in detection tion? Our previous studies have demonstrated
of the second target. Then it is possible to that automatic detection is not affected by
compare the magnitude of the decrements in frame size, so there would be no point in
the multiple-target studies (e.g., Experi- carrying out a CM counterpart of Experiment
ment 3, Part I) and the present study. In 4a in which the invalid diagonal would con-
fact, the decrements are much larger in the tain distractors only. We decided to carry out
earlier studies in which all display locations a counterpart to Experiment 4b to find out
were valid. For example, the difference in whether a CM target foil would interfere
detection between spacing 1 and spacing 4 with automatic target detection on the valid
in Part I/Experiment 3 was 30% in the NI diagonal.
condition and 8% in the II condition; the M and F were set equal to 2. During each
comparable decrements in the present study frame, each diagonal contained one mask and
are \\% and 0%. Thus a target foil reduces one character. One diagonal was always
detection of a target in the next frame, but valid, the other invalid. On every trial a CM
the reduction is much smaller than that target foil appeared on the invalid diagonal
caused by an actual target. in exactly one frame. A CM target appeared
How can the present findings be reconciled on the valid diagonal on two-thirds of the
with those of Experiment 4a in which foils trials; on one-half of these trials the target
did not impair performance? The pattern of and target foil were identical (FII) and on
findings would be explicable if the valid one-half, nonidentical (FNI). In each of
diagonal were always searched first and then, these cases, the target foil occurred either
whenever that search finished early, one or four frames before, in the same frame with,
more characters on the invalid diagonal were or four frames after the target (-J probability
PERCEPTUAL LEARNING AND AUTOMATIC ATTENDING 149
Session 4
It is interesting to compare these findings 30-msec frame time 71.5
to those of Eriksen and Eriksen (1974). In FII -4 77.3
FNI -4 78.9
our terminology, they used a single-frame FII 0 80.3
CM task with M — 2, F = 1, and they used FNI 0 81.0
FII +4 78.7
reaction time as the dependent measure. FNI +4 78.2
However, the distractor set and the memory Note. FII = trial in which two identical memory-set items
ensemble were both of size 2 and were con- appear, one in a to-be-attended display position, and one in a
to-be-ignored display position. FNI •= trial in which two dif-
sistently mapped across trials, so that an ferent memory-set items appear, one in a to-be-attended posi-
equally strong but opposite tendency to auto- tion, and one in a to-be-ignored position.
matically respond to the members of the two slowing (since they automatically produced a
sets was probably learned. (In our CM competing response); flanking characters
studies, only the memory-set items can come that were not distractors or memory-set
to elicit automatic responses, because dis- items produced a moderate slowing (regard-
tractors are presented on every trial. The
only exception in our studies occurred when 4
F = 1 in the single-frame paradigm of Part I, The results in Table 2 give some indications
that the disrupting effect may change with practice.
and even then the distractor set was always Over the four sessions, the differences between FII
larger than the memory set.) In the Eriksen and FNI at spacing 0 was, respectively, 6.2%, 3.1%,
and Eriksen study, the relevant item always 3.1%, and .7%. If this trend actually exists, it may
appeared directly above the fixation point, be caused by the development of a new automatic
but three invalid items were placed on each response that restricts search to the valid diagonal.
That is, position-specific information might govern
side of the relevant item. The nature and the automatic-attention response; such an auto-
distance of these invalid items were varied. matic process could be learned because the valid
When the distance to the nearest item reached diagonal never changes over trials. It would be
1 ° of visual angle, the invalid characters had interesting in future investigations to compare per-
formance in this condition with performance in a
little effect on reaction time. At closer dis- condition that alternates the valid diagonal from
tances, all reaction times were slowed, but trial to trial, since alternation should prevent the
flanking distractors produced the greatest long-term learning of position-specific encoding.
150 RICHARD M. SHIFFRIN AND WALTER SCHNEIDER
lower than the 84% base rate. When the The primary finding of the present study
target and CM foil were presented simul- is the demonstration that the responses to
taneously, the hit rate dropped to 62%. And CM targets are both well learned and auto-
when the CM foil followed the target by one matic. CM targets cannot be ignored, even
frame the hit rate rose to 77%, a figure still when they are known to be irrelevant and
significantly lower statistically than the 84% occur in consistently invalid display loca-
hit rate when no CM foil was present. In tions and even when subjects are instructed
short, a CM foil provides little hindrance to to ignore them.
the detection of a VM target presented in Our results, of course, do not establish
the following frame, greatly hinders detection that it is impossible to control the detection
of a VM target presented simultaneously, and processes that we have labeled "automatic."
even reduces slightly the detection of a VM In fact, Sperling (Note 1, Note 2) has re-
target presented in the preceding frame. ported some findings that do tend to show
The simplest interpretation of these re- that at least some amount of control over
sults is that a CM foil interrupts the ongoing some aspects of the search process in CM
controlled search and causes a loss of process- multiple-frame tasks is possible. However,
ing time, thereby reducing target detection. the degree of control over, and the ability to
The interruption is caused by what we have ignore, stimuli that subjects have been
termed an automatic-attention response to trained to see as targets in CM situations
the CM foil. We have argued that subjects are clearly much less than in VM situations.
have been trained to respond to CM targets Thus, for the processing mode in the CM
with an automatic-attention response. Even conditions of the present tasks, we prefer
when such a target appears on a to-be-ignored the possibly slightly inaccurate, but de-
display diagonal, it apparently causes an at- scriptive, term automatic to the logically
tention response that interrupts processing more accurate, but less descriptive, term
along the valid diagonal and directs attention systemic, which was used in Shiffrin (197Sa)
to the invalid diagonal. The time lost before to refer to the same type of processing.
attention is returned to the valid diagonal
and the search is resumed causes a consider- F. Limitations on the Ability to
able decrement in performance if the target Focus Attention
is in fact on the valid diagonal during that
frame. The present series of studies (4a-4d) ex-
Since target detection in a frame following amined the subjects' ability to focus atten-
a CM foil is not hindered, the recovery from tion. Experiment 4a showed that attention
the distraction caused by a CM foil must be during controlled search could be focused on
quite rapid; that is, recovery must take place specified spatial locations. Experiment 4b
in a time period under 200 msec. Further- showed that the focusing is not complete,
more, the automatic-attention response must because certain types of stimuli in to-be-
occur fairly rapidly, since a CM foil even ignored locations are sometimes processed
reduces detection of a target in the preceding and when processed, can reduce target detec-
frame. Perhaps the processing of one frame tion. Experiment 4c showed that a CM
occasionally overlaps the start of the next; target in a to-be-ignored location hinders
for example, a comparison initiated but not automatic deletion by a small amount. Ex-
completed before termination of a frame may periment 4d showed that attention is greatly
be completed before controlled processing of distracted from the relevant locations when
the next frame begins. Occasionally, a target a CM target appears in a to-be-ignored loca-
might be the character undergoing compari- tion during a VM search task. This last find-
son when the subsequent frame begins, and ing is particularly interesting because the
a CM foil in that next frame could interrupt distracting stimulus differs from the relevant
comparison of the target and thereby impair stimuli not only in spatial location but also
performance. in category.
152 RICHARD M. SHIFFRIN AND WALTER SCHNEIDER
These findings are particularly important messages barely harm shadowing performance
because previous demonstrations of difficul- when they are distinguishable by an obvious
ties in focusing attention have usually utilized physical characteristic (such as spatial origin;
incompatible responses. That is, a stimulus see Cherry, 1953; Cherry & Taylor, 1954).
in response to which a subject has been trained These studies are probably like our Experi-
to emit response x is presented in a to-be- ment 4a in which one diagonal is relevant
ignored location in conjunction with a stimulus during VM search and the other can be
requiring a response y that is incompatible ignored. A result related to Experiment 4b
with x. In the Stroop Color-Word Inter- can be found in studies by Treisman (1964a,
ference Test (1935), for example, the sub- 1964c) in which items of some possible
ject must read names of colors that are relevance are presented in the nonshadowed
printed in ink colors that do not correspond ear (similar to our target foils). For example,
to the names. Reading speed is greatly slowed if the distracting message is in the same
in this case (see Jensen & Rohwer, 1966). voice, but in French, a considerable distrac-
Keele (1972) showed that decrements in tion occurs. Many studies have shown that
processing stimuli resulted when a similar the message in the nonshadowed ear is an-
color-name incompatibility was present. The alyzed to some considerable depth (e.g.,
Eriksen and Eriksen (1974) study discussed Lewis, 1970; Treisman, 1960, 1964b). In
above also showed that response time was these studies it is not clear which type of
dependent on the compatibility of responses processing causes the analysis, but in other
between stimuli to be attended to and those studies it is clear that automatic processing
to be ignored. is responsible for the analysis that occurs
Many other examples of this kind can be (Corteen & Wood, 1972; Von Wright, Ander-
found in the literature, but our demonstra- son, & Stenman, 1975).
tions differ in one very important respect. Further reviews of such studies will not
Namely, the distracting stimulus in our stud- be undertaken here because of the difficulty
ies hurts performance even though subjects in ascertaining whether focused-attention
have been trained to respond to it in exactly deficits are caused by controlled processing of
the same way as to the target stimulus. Both invalid items or by automatic processing of
types of studies demonstrate that the to-be- invalid items.5 If nothing else, our results
ignored stimulus receives processing, but the strongly suggest that future research on
studies utilizing incompatible responses do focused attention should include controls to
not demonstrate any processing interaction differentiate deficits caused by controlled and
short of the motor responses themselves. On automatic processing.
the other hand, our studies suggest that the In summary, we have seen that subjects
focused-attention deficit results from a di- can divide attention almost without deficit
version of attention and an accompanying when automatic detection is utilized (Part I/
loss of processing time, since there is no re-
sponse incompatibility. 6
Our studies have shown that focused-at- Controlled processing might be given to oc-
casional inputs in an irrelevant ear, because ear of
tention deficits can arise due to distraction origin is a fairly difficult cue to utilize (compared
by either CM or VM items in appropriate with spatial location in a visual task, for example).
contexts, though probably due to rather dif- Whether automatic processing or controlled proc-
ferent mechanisms in the two cases. In many essing is used to restrict analysis to the desired ear,
studies in the literature that have shown occasional failures of selection are to be expected,
failures that will cause occasional items in the to-be-
focused-attention deficits, it is not clear which ignored ear to be given controlled processing. In
factor is responsible. For example, in shadow- addition, 'Controlled processing of items in the to-be-
ing tasks the listener repeats aloud a desig- ignored ear may occur on occasion when processing
nated message (usually in one ear) while of the items in the valid ear is completed and a
brief interval occurs before the next valid input
other distracting messages are also presented. arrives (an argument similar to this is used in the
The early studies showed that distracting discussion of Experiment 4b).
PERCEPTUAL LEARNING AND AUTOMATIC ATTENDING 153
as targets, with other new items as distrac- complex interrelations among nodes. An in-
tors, or with the previously trained VM dividual node may consist of a complex set
items as distractors. The old VM distractors of information elements, including associa-
should severely hinder performance if they tive connections, programs for responses or
elicit attention responses developed during actions, and directions for other types of
previous training. Other similar tests could information processing. What then sets off
also be carried out, but a definitive answer one node from a group of nodes? One node
is not yet available at the time of this writing. is distinguishable from a group of nodes be-
cause it is unitized, that is, when any of its
IV. A Framework for Information Processing elements are activated (i.e., placed in STS),
all of them are activated. One activated node
This section of the paper will be organized
may of course activate another node, but it
as follows. Section A will present an overview
does not do so in all situations, only when the
of the system and an overview of the role of
context or the state of the information-
controlled and automatic processing. Section
processing system is appropriate.
B will elaborate on the fundamental nature
of controlled and automatic processing. Sec-
tion C will present a framework for search, 2. Structural Levels
detection, and attention. Section D will dis-
cuss how automatic and controlled processing It seems likely that the structure of long-
are utilized in memory storage and retrieval. term store, at least in part, is arranged in
levels (perhaps sometimes in a hierarchical
A. An Outline of a General Theory tree structure). By levels we refer to a tem-
poral directionality of processing such that
Memory is conceived to be a large and certain nodes activate other nodes but not
permanent collection of nodes, which become vice versa. In sensory processing, there is a
complexly and increasingly interassociated tendency for increasing information reduction
and interrelated through learning. Most of as successive levels are activated. Thus, a
these nodes are normally passive and inactive visually presented word could first be pro-
and termed long-term store, or LTS, when in cessed as a pattern of contrast regions, colors,
the inactive state. The set of currently ac- regular variations, and so forth; then lines,
tivated nodes is termed short-term store, or angles, and other similar features could be
STS. LTS is thus a permanent, passive repos- activated; then letters and letter names and
itory for information. STS is a temporary verbal or articulatory codes; then the word's
state; information in STS is said to be lost verbal code; and finally, the meaning and
or forgotten when it reverts from an active semantic correlates of the word. This se-
to an inactive phase. Control of the informa- quence is meant as an example, and we do
tion-processing system is carried out through not wish to imply that these are the relevant
a manipulation of the flow of information features, that this is the only possible order-
into and out of STS. These control processes ing, or that this listing is exhaustive. Such
include decisions of all sorts, rehearsal, cod- a sequence of feature encoding should occur
ing, and search of short- and long-term store. automatically to a normally skilled reader.
LTS contains learned sequences of informa-
tion processing which may be initiated by a
control process or by environmental or in- 3. Automatic Processes
ternal information input, but are then ex- An automatic process can be defined within
ecuted automatically with few demands on this system as a sequence of nodes that nearly
the capacity of STS. always becomes active in response to a par-
1. Long-Term Structure ticular input configuration, where the inputs
may be externally or internally generated and
The structure of the nodes making up LTS include the general situational context, and
will be treated as a very general graph with where the sequence is activated without the
156 RICHARD M. SHIFFRIN AND WALTER SCHNEIDER
necessity of active control or attention by or intensities even these features will not be
the subject. activated.
An automatic sequence differs from a single Although we have described automatic
node because it is not necessarily unitized. processes as largely beyond subject control,
The same nodes may appear in different some indirect control is possible through ma-
automatic sequences, depending on the con- nipulation of the activation threshold for auto-
text. For example, a red light might elicit matic processes. In particular, according to
a braking response when the perceiver is in what we shall call the contextual hypothesis,
a car, and elicit a walking, halting, or traffic- the activation threshold can be lowered by
scanning response when the perceiver is a the inclusion of information in STS (at the
pedestrian. time of presentation of the activating input)
Since an automatic process utilizes a rela- that is associatively related to the nodes
tively permanent set of associative connec- making up the automatic sequence.6
tions in long-term store, most automatic Note that a lowering of the threshold does
processes will require an appreciable amount not imply that the quality of processing is
of training to develop fully. Furthermore, improved. One result of threshold lowering
once learned, an automatic process will be is that the automatic process will be triggered
difficult to suppress or to alter. by inputs that normally would and should
When an automatic sequence is initiated, not do so. For example, the feature "horse"
its nodes are activated and hence the as- might be incorrectly triggered by the input
sociative information enters STS. This fact "house" if the threshold for "horse" is
does not, however, mean that the various lowered sufficiently.
elements of the automatic process must be
available to consciousness or recallable at a S. Controlled Processes
later time. The activation in STS could be
extremely brief (in msec, say) and unless A controlled process utilizes a temporary
attention is directed to the process when it sequence of nodes activated under control of,
occurs or unless the sequence includes an and through attention by, the subject; the
automatic attention-calling response, then sequence is temporary in the sense that each
the information may be immediately lost activation of the sequence of nodes requires
from STS, and the subject may be quite un- anew the attention of the subject. Because
aware that the process took place. Even active attention by the subject is required,
when an automatic-attention response is part only one such sequence at a time may be con-
of the sequence, it will not necessarily affect trolled without interference, unless two se-
ongoing controlled processing unless the quences each requires such a slow sequence of
strength of the attention response is suffi- activations that they can be interwoven
ciently high. serially. Controlled processes are therefore
tightly capacity-limited, but the cost of ca-
4. Thresholds of Activation pacity limitations is balanced by the benefits
However an automatic process is initiated, deriving from the ease with which such
whether by internal or external input or by a processes may be set up, altered, and applied
control process, it is presumed that the prob- in novel situations for which automatic se-
ability that the process will run to comple-
tion depends on the strength of the initiating 6
It may also be possible to lower the threshold
stimulation. The simplest and most common by a .recent activation of the same automatic se-
examples are seen in studies of psychophysical quence. However, this hypothesis is difficult to
thresholds. If a letter, for example, is visually distinguish from the contextual one, because an
presented at a low-enough intensity or a activation of a sequence is usually accomplished by
the prior, .recent presentation of information as-
short-enough duration, then it may be en- sociatively related to the sequence, and this related
coded not as a letter but as a partial collec- information is still likely to be in STS at the time
tion of line-like features. At lower durations of test.
PERCEPTUAL LEARNING AND AUTOMATIC ATTENDING 157
quences have never been learned. Controlled STS has two somewhat distinct roles. The
processing operations utilize short-term store, first is the provision of a temporary store-
so the nature of their limitations is deter- house for information currently important to
mined at least in part by the capacity limita- the organism. That is, it acts as a selective
tions of STS. window on LTS to reduce the amount of
information for processing to manageable
6. Short-Term Store (STS) proportions. The second role of STS is the
provision of a work space for decision making,
Short-term store is the labile form of the thinking, and control processes in general.
memory system and consists of the set of
concurrently activated nodes in memory. 7. Learning: Transfer to LTS
The phenomenological feeling of conscious-
ness may lie in a subset of STS, particularly Consider first what is meant by transfer
in the subset that is attended to and given from STS to LTS. Transfer implies the for-
controlled processing.7 mation in permanent memory of information
The capacity of STS is determined sto- not previously there. To be precise, this con-
chastically (see Shiffrin & Cook, Note 3) sists of the association (in a new relation-
so that a large amount of information may ship) of information structures already in
be present (activated) at any one moment, LTS. A minimal requirement for this new
but only a small amount of information will associative structure is the simultaneous
persist for an appreciable time period of presence in STS of the separate elements to
several seconds or more. Forgetting or loss be associated or related. That is, the various
from STS is simply the reversion of cur- nodes to be linked in a new relationship must
rently active information to an inactive state be activated in STS. Thus, transfer to LTS
in LTS. does not imply the removal of the transferred
What determines the loss rate? We sup- information from STS, nor the placing of
pose that the rate of loss of any informational new "subunits" in LTS that do not already
element or node in STS depends on the num- exist in either LTS or STS. Rather, transfer
ber of similar elements simultaneously active means the formation of new associations (or
in STS. By similarity we refer not only to the strengthening of old associations) be-
formal physical similarity but also to sim- tween information structures or nodes not
ilarity of features at comparable levels of previously associated (or strengthened) in
processing (e.g., the typeface of a printed LTS. Most new associative structures will
word will be less likely than the word itself include as a component the context in STS
to cause forgetting of a verbally encoded at the time of the transfer.
second word in memory). When a large The above remarks specify the nature of
amount of similar information is present, the new learning in STS but not the cause under-
loss rate will be rapid but will slow as the lying the storage mechanism. It has been
amount of active information decreases. To
give an example, when a complicated visual 7
An alternative formulation, closer to that sug-
scene is presented to a subject briefly in a gested by Atkinson and Shiffrin (1968), would
tachistoscope, a flood of visual information use the term STS to refer to those nodes that are
enters STS and initiates a series of chains given attention and controlled processing. Other
activated nodes would be referred to by another
of automatic processing that result in higher term, such as "sensory register" in Atkinson and
level features' also entering STS. However, Shiffrin's terminology. At such a general level of
most of the activated information will have discussion, it is doubtful that there are substantive
decayed and will be lost from STS in just differences between the two approaches—versions of
a few hundred milliseconds after physical each could be constructed to be identical to each
other. Each approach has its own heuristic ad-
offset of the scene (see Sperling 1960); just vantages, but a theory is better judged in terms of
a few of the features, perhaps at higher levels, its detailed assumptions and predictions than by
will remain present longer than a few seconds. its choice of either type of terminology.
158 RICHARD M. SHIFFRIN AND WALTER SCHNEIDER
shown that rehearsal (and coding rehearsal) trolled search) and have shown how auto-
are strongly implicated in learning (see matic processing (e.g., automatic detection)
Shiffrin, 197Sb, for a discussion). We prefer can bypass attentional limitations. It should
an extension of the rehearsal hypothesis. We not be overlooked that the initial learning of
assume that what is stored is what is at- automatic processing may require a variety
tended to and given controlled processing. of control processes and hence will be sub-
Rote rehearsal is just one form that attention ject to various limitations that may dis-
can take. In fact, maintenance rehearsal may appear when learning has progressed to a
result in storage of low-level auditory or high level.8
verbal codes not helpful for long-term recall,
(but helpful for long-term recognition), while 8. Retrieval of Information from Short-Term
coding rehearsal may result in storage of Store
high-level codes useful for recall. This model Several retrieval modes from STS are pos-
suggests that some degree of attention or sible. An automatic process might direct the
controlled processing is a prerequisite for retrieval process to just a subset of the ac-
storage. Thus, incidental learning situations, tivated information (e.g., only the letters,
in which no extended rehearsal takes place, not the masks, are compared in our VM
will still result in some learning to the degree search studies).
that the input items are attended to during The various controlled search strategies
presentation. that are available are normally serial in na-
These hypotheses that controlled processing ture but a variety of search orders is possible.
will underlie learning apply of course to the Search order may depend upon instructions,
development of automatic processing, and strength of short-term traces, the nature of
in particular, to the development of auto- categories of the traces, the modality of the
matic detection studied in the search and information, and the structure of STS. This
detection paradigms earlier in this paper and last point is worth emphasizing: Since STS
in Part I. In those paradigms it was seen is embedded in LTS, it partakes of the struc-
that automatic detection developed only with ture of LTS and is not a totally undiffer-
consistent training. Consistent training is entiated mass of information. Thus the order-
crucial when the learned sequences in LTS ing of a search can utilize this existent struc-
contains an internal or overt response that ture. In addition, the comparison process may
will be harmful to performance on trials that be exhaustive or terminating, and the in-
are inconsistent. For example, an attention formation located in one phase of the search
response to an item will harm performance on can be used to redirect a later phase of the
trials when that item is a distractor. Purely search.
informational sequences (i.e., sequences that It is the control of search order that is
do not include responses that direct internal responsible for many of the selective atten-
processes or overt actions) will be stored in tion effects that are observed. Information in
LTS when attended to, regardless of con- locations to which attention is first directed
sistency of training. In all cases, however, (i.e., the first locations searched) will in
consistent training and large numbers of general receive faster and more accurate
repetitions should lead to stronger automatic processing for the obvious reasons.
encodings and processes.
An implication of the hypothesis that trans- 8
fer to LTS is engendered by controlled pro- This distinction is helpful in understanding the
relation of our work to studies of attention in
cessing is the important concept that con- infrahuman organisms: the traditional studies of at-
trolled processing, and hence attentional tention in discrimination learning and blocking (e.g.
limitations, will be involved during the ac- see Mackintosh, 1975) are involved with those
quisition of automatic processing. We have limitations that occur during acquisition, while a
number of newer studies are involved with limita-
largely been identifying attentional limita- tions in steady state situations when learning is not
tions with controlled processing (e.g., con- possible (e.g. see Riley and Leith, 1976).
PERCEPTUAL LEARNING AND AUTOMATIC ATTENDING 159
9. Retrieval of Information from Long-Term their nature imprecise and fallible. When
Store retrieval fails, perhaps temporarily, then we
Both automatic and controlled processing say forgetting has occurred. The causes of
are utilized in long-term retrieval. In gen- such failure are somewhat to the side of the
eral, an automatic associative mechanism will main interests of this paper but may be sum-
cause currently inactive information in LTS marized briefly as follows. First, the probe
to become active when associatively related cues used to activate related information in
information is in STS. The simplest example LTS may be ineffective, either because the
is the perceptual encoding process by which subject chooses them incorrectly or because
information concerning environmental inputs the general context in STS at the moment is
is located in LTS. As each feature is acti- dissimilar to that stored with the desired in-
vated, it can serve as a base for further acti- formation in LTS. Second, the process of
vation of features associatively related to it retrieval itself may harm additional retrieval
and to other previously activated features. attempts, especially if the probe cues are
This same principle applies to long-term not altered from one cycle of the search to
retrieval of higher order information. the next. Third, the search may fail due to
It should be noted well that information premature termination; that is, it may fail
once activated automatically from LTS will due to a decision that further retrievals are
then be in STS and may have to be located not worth the effort, or due to the reaching
through use of any of the short-term retrieval of some sort of time limit available for
mechanisms discussed above. search. (See Shiffrin, 1970, 1976, for a fuller
How does the subject exert control over discussion of these matters.)
LTS retrieval? The subject can utilize re-
hearsal and selective processes to raise the B. The Characteristics of Controlled and
strength of certain information in STS above Automatic Processing
the strength of other information. Then the In capsule form, we have covered the major
activated and retrieved information from phases of the information-processing system.
LTS will tend to be information related to We shall now attempt to define and out-
the selectively accentuated subset, line the characteristics of automatic and con-
In Shiffrin's (1970) paper the details of trolled processing in a more precise fashion.
the controlled LTS search were presented at
length. In summary, the controlled search
1. Controlled Processing
process is a series of cycles. On each cycle
(a) the subject generates probe information It is important to note that not all control
and places it in STS; (b) the probe informa- processes are available to conscious percep-
tion, along with the general contextual infor- tion, and not all control processes can be
mation presently in STS, activates associated manipulated through verbal instruction. It is
information from LTS, called the search set; therefore convenient to divide control pro-
(c) the subject searches the STS search set; cesses into two classes: accessible and veiled.
(d) the subject decides whether the appro- Accessible control processes are those like
priate information has been found and rote rehearsal or alphabetic search of LTS
whether the search is over. The process then that can be instituted and modified by in-
continues cycling. Note that step (c) is a struction. These are generally slow processes
retrieval from STS, so that one phase of con- that are easily perceived by the subject.
trolled LTS retrieval may be STS retrieval. Veiled control processes are those like the
serial comparison of items in short-term store
10. Long-Term Forgetting that are difficult to modify through instruc-
The forgetting of information in LTS is, tion. They are not easy to perceive through
by definition in the present theory, a question introspection because they take place so
of retrieval failure. The retrieval processes quickly.
discussed in the preceding sections are by Both classes of control processes have the
160 RICHARD M. SHIFFRIN AND WALTER SCHNEIDER
following general properties: (a) They are trol, but once initiated all automatic processes
limited-capacity processes requiring attention. run to completion automatically (though
Because these limitations prevent multiple some indirect control is possible through
control processes from occurring simulta- manipulation of the contents of STS at the
neously, these processes often consist of the time of the inciting input), (c) They require
stringing together in time of a series of considerable training to develop and are most
singly controlled unitary operations, (b) The difficult to modify, once learned, (d) Their
limitations of control processes are based on speed and automaticity will usually keep
those of STS (such as the limited-comparison their constituent elements hidden from con-
rate and the limited amount of information scious perception, (e) They do not directly
that can be maintained without loss), (c) cause new learning in LTS (though they can
Control processes can be adopted quickly, indirectly affect learning through forced al-
without extensive training, and modified location of controlled processing), (f) Per-
fairly easily (though not always by verbal formance levels will gradually improve over
instruction), (d) Control processes can be trials as the automatic sequence is learned.
used to control the flow of information within In many cases asymptotic performance levels
and between levels, and between STS and may not be reached for thousands of trials.
LTS. In particular, they can be utilized to It is particularly important to specify care-
cause permanent learning (i.e., transfer to fully what we mean when we say that an
LTS) both of automatic sequences and of in- automatic process does not partake of the
formation in general, (e) Control processes capacity limitations of STS. To make this
show a rapid development of asymptotic per- clear, let us suppose that X, Y, and Z are
formance. For a given control process, if nodes that are automatically activated in
automatic processing does not develop (due, turn by input I in the presence of general
say, to varied-mapping procedures) and if context nodes C. We may depict the sequence
the constituent elements of the process do as follows:
not otherwise change due to long-term learn-
ing, then performance level will stabilize very
quickly at an asymptotic value. When per- C C
formance improves over trials, it does so be- As each of X, Y, and Z is activated, it enters
cause the control process is changed, or be-
STS and its future residence in STS will be
cause automatic processing develops, or be- affected by the limitations of STS. In fact,
cause the constituent nodes that are linked if the nodes X, Y, and Z do not include an
by the control process are themselves altered attention-attracting response, then it is quite
by long-term learning.
conceivable that all three nodes will decay
Common examples of control processes in- and be lost within a few hundred milliseconds,
clude maintenance or rote rehearsal, coding
and the subject will be quite unaware that
rehearsal, serial search, long-term memory
such an automatic sequence ever occurred.
search, and decisions and strategies of all
Nevertheless, the sequence itself is not
kinds. These will be discussed in more detail
governed by the limitations of STS in the
in the sections below.
sense that the probability of its being ac-
2. Automatic Processing tivated and the rate of its occurrence will
not be affected by other concurrent automatic
Automatic processes have the following and conrolled processes taking place in STS
properties, (a) They are not hindered by the (at least if context C is present and the
capacity limitations of STS and do not re- other concurrent processes do not also utilize
quire attention. Thus automatic processes nodes X, Y, or Z).
often appear to act in parallel with one an-
other and sometimes appear to be indepen- 3. The Development oj Automatic Processing
dent of each other, (b) Some automatic The tendency for one node to activate
processes may be initiated under subject con- another will be increased if both nodes are
PERCEPTUAL LEARNING AND AUTOMATIC ATTENDING 161
present simultaneously in STS and if a con- trolled processing, it must not be concluded
trol process and attention are directed toward that automatic processing invariably precedes
these nodes, thereby increasing their salience controlled processing. In fact, automatic and
in STS. controlled processes can proceed in parallel
Before we can describe efficacious training with one another (as in Experiment 4d when
conditions, we must distinguish between two automatic processing of the elements on the
types of automatic processes. One type of invalid diagonal proceeded in parallel with
automatic sequence may be called actional, controlled processing of the valid diagonal).
because it includes phases that direct internal Even more important, controlled processing
processes (like calls for attention) or phases is often used to initiate automatic processing.
that produce overt responses (such as button Particularly in complex processing situations,
presses). A second type, called informational, (such as reading), an ongoing mixture of con-
contains no directions for actions. The dis- trolled and automatic processing is utilized.
tinction is crucial because an informational The next steps in the serial, controlled
sequence strengthened on one trial will not processing sequence are based on the output
have deleterious consequences for perform- of automatic processes initiated earlier; then
ance on other types of trials. On the other new automatic sequences are initiated and
hand an actional sequence may give rise to these run to completion in parallel with the
a response antagonistic to a response re- ongoing controlled processing.
quired on another trial. Thus, to be useful in
a given task, actional sequences require 5. The Benefits of Automatic and Controlled
special, consistent, training conditions. Processing
To make this point clear, suppose that
node B produces a response antagonistic to A system based on the two basic processing
that produced by node C, while nodes D and modes with the characteristics we have de-
E produce no responses. If any of A-B, A-C, scribed has many advantages. In novel situa-
A-D, or A-E is trained alone, then that tions or in situations requiring moment-to-
sequence can be learned. If training on A-D moment decisions, controlled processing may
is mixed from trial to trial with training on be adopted and used to perform accurately,
A-E, then A will come to elicit both D and E. though slowly. Then as the situations become
However, if A-B trials are mixed with A-C familiar, always requiring the same sequence
trials then learning of both will be impossible, of processing operations, automatic processing
since A cannot lead simultaneously to two will develop, attention demands will be eased,
antagonistic responses. The automatic de- other controlled operations can be carried
tection system discussed at length in this out in parallel with the automatic processing,
paper is just an actional sequence, since an and performance will improve. Some of the
automatic-attention response to one stimulus advantages of such a system are (a) It allows
is incompatible with a simultaneous atten- the organism to make efficient use of a
tion response to another stimulus; therefore limited-capacity processing system. The de-
automatic detection requires consistent train- velopment of automatic processing allows the
ing to develop. Thus, in the varied-mapping limited-capacity system to be cleared and
conditions of any of our studies automatic devoted to other types of processing neces-
detection could not develop and controlled sary for new tasks, (b) It allows attention
processing had to be utilized. to be directed (through automatic-attention
responses) to important stimulation, what-
ever the nature of the ongoing controlled
4. Combinations of Automatic and Controlled
Processing processing, (c) It allows the organism to
adjust to changes in the environment that
Although sensory inputs are first encoded make previously learned activity patterns
with the automatic processing system, with useless or harmful, (d) It allows the organism
the results being made available to con- to deal with novel situations for which auto-
162 RICHARD M. SHIFFRIN AND WALTER SCHNEIDER
N
K=3— — M=)
S
S "T*
ii v~
/""
P
0
\ , 0
0 i~ 4 ^ ^"'— / / •^,
heb
N U
R ^f
T
Y "V *~ ' / S P
(^- E
s*
tku omatic U
I r es ponse
P T
N
P V. ,<!^ /
R
0
S
U D
T YAttentioiT( C Dntroiled U
S \ director ) response
Automatic C
attention T
response I
Controlled processing 0
N
Figure 11. A model for automatic and controlled processing during tasks requiring detection of
certain input stimuli. Short-term store is the activated subset of long-term store. N levels of auto-
matic encoding are shown, the activated nodes being depicted within each level. The dashed arrows
going from higher to lower levels indicate the possibility that higher level features can sometimes
influence the automatic processing of lower level features. The solid arrow from a node in Level 2
to the attention system indicates that this node has produced an automatic-attention response, and
the large arrow from the attention system to Level 2 indicates that the attention system has
responded. The arrow from level N to the Response Production indicates that this node has called
for an automatic overt response, which will shortly be executed. The arrow from Controlled
Processing to the Response Production indicates the normal mode of responding in which the
response is based on controlled comparisons and decisions. Were it not for the automatic responses
indicated, detection would have proceeded in a serial, controlled search of nodes and levels in an
order chosen by the subject.
matic sequences have not previously been For example, the letter "M" may first be
learned, (e) It allows the organism to learn encoded in features indicating contrast,
increasingly complex modes of processing by color, and position; then curvature, con-
building upon automatically learned sub- vexity, and angles; then a visual letter
systems. code and a verbal, acoustic-articulatory
code, then the codes "letter," "consonant,"
C. A Framework jor Processing in Detection, "capital;" and finally, perhaps, semantic
Search, and Attention Tasks and conceptual codes like "followed by 'N',"
"middle of the alphabet," and the like. (We
The framework we have in mind is built do not necessarily imply that these features
within the general theory already presented. are correct or exhaustive; if, say, amplitude
It is illustrated in Figures 11 and 12. When a components of the spatial-frequency analysis
set of inputs is presented (let us suppose of the inputs prove to be relevant features
for convenience that the inputs are visual) encoded by the system, the theory we de-
then each begins to undergo automatic scribe would be unchanged in all important
processing. The system automatically encodes respects.) What features will become ac-
each stimulus input in a series of stages and tivated depends on the physical nature of the
activates a series of features in the process. nervous system that was predetermined gen-
PERCEPTUAL LEARNING AND AUTOMATIC ATTENDING 163
etically, the degree and type of prior learning, nodes that are automatically activated may
the physical characteristics of the display initiate a response that will direct subsequent
(like duration and contrast), and the general processing or subsequent actions. For ex-
context, both that in the environment and ample, an attention response might be ac-
that generated in STS by the subject. To the tivated by a feature; the attention response
extent that the subject directs the sensory might direct controlled processing to the cor-
receptor orientation and to the extent that responding set of features representing that
internally generated information can alter the input stimulus, so long as other competing
context in STS, the subject will have at least attention responses do not occur simulta-
some indirect control over automatic sensory neously. As another example, an overt re-
coding. sponse might be engendered by a particular
The automatic processing as described stimulus (such as a startle response to a
above takes place in parallel for each of the sudden loud noise, a galvanic skin response to
input stimuli. The processing of each stimulus an aversively conditioned word, a ducking
is often independent, except for lateral and response to a missile thrown at the head).
temporal interactions at early stages, called If the set of inputs contains a target stimulus
masking, and except for learned relationships that gives rise to a relevant nonconflicting
between items that may affect processing at attention response, or to an overt response,
later stages (if adjacent letters form a word, then we say that automatic detection is
for example). The various features that are operating. Note that the attention response
activated are all placed thereby in STS could be attached to features at any level
where they reside for a short period before of processing and to more than one feature
being lost (i.e., before returning to an in- at a time. In Figure 12, as an example, at-
active state in LTS). tention responses are attached both to the
We propose that some of the features or feature "8" and to the feature "number."
Visual Higher
Visual character Category level
features code code codes
Inputs ^\
IV h"\S I Controlled
processing
Son's age"
'Model
railroad
track" etc.
Figure 12. Another view of the model shown in Figure 11. This figure depicts a conceivable (but
abbreviated) series of feature encodings for a frame in which two characters and two masks are
presented. The 'Consistent-mapping condition is shown in which numbers are memory-set items
and letters are distractors. The arrows skipping levels indicate that a given feature can help
activate features at several different levels. The stimulus 8 is a consistent-mapping target, and
hence both the visual and category codes produce automatic-attention responses. The attention
response has attracted attention to the information deriving from the input 8.
164 RICHARD M. SHIFFRIN AND WALTER SCHNEIDER
It is possible that many of the inputs will ever this is the case, the framework described
give rise to attention or other responses and here applies equally to attention tasks.
that these responses will conflict. For ex- Divided-attention tasks study the increments
ample, every input might automatically in performance that may occur when the
engender an attention response; since it is load is decreased. Decreased loads are usually
impossible to direct attention to every item specified through advance instructions that
in the display at once, the various attention certain inputs are irrelevant, thereby de-
responses will conflict and cancel each other. creasing the size of the memory set, the frame
In such cases, the subject will not be governed set, or both.
by the output of the automatic system and According to our framework, experiments
will base subsequent actions and responses that do not show dividing attention to re-
on the results of a controlled search. These duce performance fall into two classes. One
hypotheses were supported in Experiment 2 class includes those studies in which auto-
(see Figure 4, left panel for the results). In matic detection is operating. These are usually
this study, all inputs were items that sub- studies utilizing a CM paradigm and high
ject had previously been trained to respond degrees of practice. Divided-attention deficits
to with automatic-attention responses. The will not be seen because the target stimulus
results showed that the subjects reverted to will be detected automatically, in parallel
the use of controlled search; furthermore, with the other stimuli, and often indepen-
the controlled search was almost identical dently of other stimuli. An exception to this
to that seen in the usual VM conditions in general rule may apply when two (or more)
which none of the inputs had been trained to targets are presented simultaneously on a
elicit automatic-attention responses. trial. Even if both targets generate attention
In cases when the input stimuli do not responses, there may be a difficulty in dis-
activate automatic responses that govern the criminating a double from a single occurrence.
detection process, then a controlled search In such an event, the controlled comparison
must be used. In this event, the subject must system may have to be called into play to
attempt to search through the set of features count the targets. (A detection decrement
that is activated and to use a search strategy may then occur, because the second item
that is as efficient as possible. For example, may have decayed from STS by the time
in our visual search studies, letter codes are the comparison of the first item is complete;
normally compared in preference to (and see Moray, 1975, Shiffrin, 1976, and the
prior to) sets of angles and lines; a category discussion of the 0 spacing effect in the CM
code representing a set of letters is often condition of Experiment 3, Part I.)
compared in preference to (and prior to) The second class of studies in which di-
individual letter codes, as shown in Experi- vided-attention deficits do not occur is that
ment 3. Furthermore, the order of search in which controlled search is utilized but the
and the placement of decisions within the capacity of STS is not stressed. Examples
search are also controlled in an attempt to are seen in the series of studies by Shiffrin
increase efficiency. To give an example, com- and his colleagues, summarized in Shiffrin's
parisons in our studies in Part I took place (1975a) article. In such studies either the
in an order that cycled through the frame load in STS is kept low, or a cue informing
for a given memory-set item before switching the subject which input is relevant appears
to a new memory-set item, and a matching at about the same time as the inputs.
decision was made after every comparison. It is, however, only in specially designed
To give another example, matching decisions situations that capacity limits are not stressed.
in many memory search studies (M varying, In most studies requiring controlled search,
F — 1) are withheld until all comparisons the extra time required to search a larger
are completed (Sternberg, 1975). number of relevant inputs will impair per-
Note finally that many attention tasks formance, whether measured by reaction
utilize search or detection paradigms. When- time or accuracy. This was the case in the
PERCEPTUAL LEARNING AND AUTOMATIC ATTENDING 165
directed to a perceptually segregated subset at some time after presentation. One example
of the inputs. We are suggesting that an is the "split-span" technique reviewed by
automatic process might develop and direct Broadbent (19S8, 1971). Other examples
the search in situations where the search- are reported by Sperling (1960) and Von
directing response is consistently trained (as Wright (1970), who studied cued partial
is usually true in such studies). We do not report following tachistoscopic exposure of
feel, however, that the segregation of the character displays.
relevant inputs must be based on simple In these memory paradigms, essentially
physical features. With enough consistent the same underlying mechanisms apply as
training, an automatic search-directing re- in the detection paradigms, though memory
sponse could develop for a set of stimuli storage rather than detection is the processing
defined by quite complex features. For ex- goal. When controlled processing is utilized,
ample, Gould and Cam (1973) reported the subject has considerable control over
results suggesting that an automatic process storage: The items that are rehearsed or
was directing search to the locations of items coded are the items that will be recalled.
from a set of 10 potentially relevant letters If an input is presented that generates an
and away from the locations of items from a automatic-attention response, then this in-
set of 8 other letters that never included a put will receive attention and tend to be
target. (Any target was always drawn from recalled regardless of the nature of the con-
the potentially relevant set of 10 letters, but trolled process utilized to store the other
on most trials distractors were chosen from items. An excellent demonstration of these
this set of 10 as well as from the set of 8). points may be found in Kahneman (197S).
Although the evidence is not yet available,
we raise the possibility that the converse of 5. Threshold Detection Tasks
the argument in the preceding paragraph
An input in a threshold detection task, by
might also hold: Even with perceptual seg-
definition, is presented in such a way that
regation of the display into categories deter-
automatic encoding on most trials will be
mined by simple physical features, auto-
incomplete and therefore ambiguous. That is,
matic restriction of the search to the relevant
in Section IV.A.4 we discussed the fact the
category might not be possible unless the
input stimulation must be above some thresh-
relevant category is consistent across trials.
old level for automatic encoding to run to
completion in an accurate fashion. In Figure
4. Attention Tasks 12, for example, the "M," if presented near
threshold, might cause activation of some
The models applying in attention studies of the features at the visual feature level, but
that utilize detection or search paradigms no features at higher levels. This set of
are just those discussed in the preceding partial features will in general be ambiguous
sections. The general rule is that instructional —several letters and numbers might be con-
manipulations affect the order of controlled sistent with the activated feature set. Thus,
search; increases in load cause the time a target will on some trials give rise to feature
needed for controlled search to increase; and sets consistent with either targets or distrac-
consistent training leads to the development tors. Similarly, distractors will on some trials
of automatic responses that allow attentional give rise to feature sets consistent with either
limitations to be bypassed. targets or distractors. Under these conditions
We will say a few words, however, con- it is clear even in CM situations that auto-
cerning attention studies that do not utilize matic-attention responses and automatic de-
detection or search paradigms. One class of tection can only develop very slowly, if at
studies utilizes a memory paradigm. The all, because the internal representations of
inputs are manipulated in ways similar to targets and distractors will be mapped con-
those used in detection studies, but the sub- sistently to responses only on those few trials
ject is asked to recall or recognize the items when the encoding happens to be complete.
168 RICHARD M. SHIFFRIN AND WALTER SCHNEIDER
A similar argument applies if encoding is in- detection tasks as well as many others, can
accurate rather than incomplete. be summarized briefly in the following man-
Even in CM threshold tasks, therefore, the ner: Sensory inputs are encoded auto-
subject will be forced to adopt controlled matically, resulting in states of evidence
search on most of the trials, namely the trials (consisting of feature sets) that the subject
on which encoding is incomplete. On such utilizes in subsequent controlled processing.
trials a serial comparison process will be While generally accurate, this view is too
necessary to match the activated features simple in several respects. First, it must not
against the features of the members of the be assumed that there is ever one moment
memory set. Occasionally, however, auto- in time at which all activated features are
matic encoding will manage to process the simultaneously present and available to the
target completely. Suppose that almost al- subject for controlled processing. Rather it
ways, those stimuli that are completely will usually be the case that some features
processed will be encoded correctly. Suppose will still be undergoing the process of en-
also that a CM procedure is being utilized in coding (and hence will not yet be activated)
the task. In these special conditions, an at- while other features have already been acti-
tention response can be learned that might vated and are in the process of being for-
be utilized on those trials when complete gotten. Second, it must not be assumed that
encoding of the target happens to occur. controlled processing will follow automatic
A model very similar to this was used by processing in time. Such an assumption is a
Shiffrin and Geisler (1973) to fit the results fairly accurate simplification when used in
from a threshold detection study by Estes models for certain tasks including threshold
(1972). In that model, stimuli automatically detection. However, it is an inherent feature
processed to a final, incomplete result were of our theory that automatic and controlled
called "component detections" and stimuli processing can often operate in parallel, so
automatically processed to the complete that controlled processing can begin while
(letter) level were called "letter recogni- automatic processing is still progressing. This
tions." The model proposed a two-part con- point becomes crucial in situations in which
trolled search in which the letter recognitions the subject is required to give extremely fast
were searched first (in parallel through mem- responses (including those paradigms pre-
ory, but serially through the display), and senting above-threshold stimuli), since a
then the component detections were matched very fast response might have to be based
serially, feature by feature, against the mem- on the features available at a certain point
bers of the memory set. In light of the in time, even though better features might
present work, we would like to propose for be available at a later point in time.
consideration an alternative but similar
model in which the letter recognitions result D. Automatic and Controlled Processing in
in an automatic-attention response, so that Memory Storage and Retrieval
controlled search need be utilized only if the
In this section we shall review briefly how
target is not among the letter recognitions.
automatic and controlled processing might be
In practice, the predictions of these two
models would not differ greatly. involved in short-term retention, learning,
and in long-term retrieval.
In summary, it seems clear that controlled
search, whether at the feature or character
level, is bound to be the predominant de- 1. Maintenance of Information in Active
tection mechanism in all threshold detection Memory
tasks, but there is a possibility in CM para- We suggest that the control process most
digms that automatic detection can be util- evident to introspection is rehearsal. Atkinson
ized on a subset of the trials when the target and Shiffrin (1968) studied many tasks in
happens to be encoded relatively completely. which subjects utilized a continually updated
Our general view of processing, in threshold rehearsal "buffer" to aid their memory sys-
PERCEPTUAL LEARNING AND AUTOMATIC ATTENDING 169
tern. Rote rehearsal is of course only one of Craik & Jacoby, 1975; Craik & Lockhart,
many control processes that cause items to 1972; and Craik & Tulving, 1975, for a dis-
have an extended residence in STS; coding, cussion of these issues.)
deciding, retrieving, and the like also cause The other side of the storage question con-
a similar result, though these processes are cerns the nature of LTS transfer when at-
primarily intended to serve other purposes. tention is not directed to an item or sequence
Iq general, items given attention of any sort of items. Studies of incidental learning tend
are maintained in STS, including items to to show that subjects learn what they attend
which attention is drawn by an automatic to, not what they are told to learn (see Craik
process. & Tulving, 1975; Hyde & Jenkins, 1969,
Items from which attention has been re- 1973; Postman, 1964; Schneider & Kintz,
moved do not necessarily leave STS quickly, 1967).
however. When the load is small and new If controlled processing is necessary for
inputs are minimized, items can remain in long-term learning then automatic processing
STS for extended periods (up to 40 sec in without controlled processing should not lead
the Shiffrin, 1973, study). Another example to appreciable retention. One source of evi-
is seen in the overt forced-rehearsal study dence is found by examining retention for
reported in Atkinson and Shiffrin (1971). A distractors in CM tasks. Such items pre-
long list of items was presented and subjects sumably have received almost no controlled
rehearsed aloud a continually updated list processing. However, it may be assumed that
of the three most recent items. As a result, due to prior exposures and prior learning the
the last three items presented were always distractors are given automatic encoding at
recalled on an immediate test. However, the least up to the "name" level (for evidence
probability of recall for items prior to the see Corteen & Wood, 1972; Keele, 1972;
last three decreased systematically as a func- Von Wright et al., 1975, among others).
tion of the spacing from last rehearsal to Does the automatic encoding that these dis-
test. Thus an item removed from controlled tractors receive during the many trials of a
processing still requires a period of time CM task lead to any retention? Moray
before it becomes lost from STS (i.e., be- (1959) had subjects repeat aloud a prose
comes inactive). This persistence in the passage in one ear while a seven-word list
absence of attention might be called the auto- was repeated 35 times in the other ear. Later
matic component of short-term maintenance. recognition for the unattended words was at
It is this automatic component of persistence the chance level. Gordon (1968) showed that
that was studied by Shiffrin and Cook (see a set of four distractors in a CM search task
Note 3) and that determines the basic ca- was recognized near the chance level even
pacity and persistence of STS. In brief, after 10 days of practice. Gleitman and
Shiffrin and Cook propose that the loss Jonides (1976) showed that distractors were
process is a stochastic mechanism whose more poorly recognized in a CM search task
rate increases as the amount of similar ma- than in a VM search task (though, due to
terial concurrently in STS increases. the low levels of practice used in their study,
the effect could have been due to the use of
2. Coding, or Transfer to LTS a controlled search for categories in the VM
The role of control processes in long-term condition).
storage is well established. Rote rehearsal Evidence of a rather different sort is found
appears to transfer low-level auditory-verbal in reading tasks. If automatic processing does
codes to LTS; these are not very useful for not lead to retention, then it might be ex-
long-term recall but can facilitate long-term pected that a skilled reader who automatically
recognition. Coding rehearsal appears to processes surface-structure features and who
facilitate long-term retrieval of all kinds. In gives controlled processing to conceptual
general, what is attended to and rehearsed is properties of a passage would retain little
what is stored in LTS. (See Bjork, 1975; information regarding the surface-structure
170 RICHARD M. SHIFFRIN AND WALTER SCHNEIDER
details. Bransford and Franks (1971) have tivate many features (and perhaps responses)
collected data that may be interpreted in and place them in STS. Internally generated
this fashion. Of course, if controlled process- inputs also tend automatically to activate
ing is directed toward levels of analysis associated information in LTS and to place
normally carried out automatically, then the it in STS.
features attended to will be remembered (see In general, the two primary controlled
Postman & Senders, 1946). phases of LTS retrieval consist of the pro-
To summarize, whether or not some cesses the subject uses to search and process
nominal storage manages to be eked out in the information in STS that has just been
the absence of controlled or attentive process- activated, and the processes used to alter the
ing, it seems clear that any phenomenon that probe information from step to step of the
would be appropriately called "automatic LTS search.
storage" is a far less important determinant It is important to note that the selection
of LTS learning than controlled processing. of probe information is not entirely under
Before leaving the general topic of learn- subject control. In fact, the probe informa-
ing it is interesting to speculate on the de- tion includes the general contextual informa-
velopment of complex information-processing tion present in STS at the time of retrieval,
skills. Since controlled processing is limited, in addition to the specific cues the subject
only a small part of the memory system can manipulates to facilitate retrieval. The gen-
be modified at any one time. After invariant eral context is only partly under subject con-
relations are learned at one stage, the pro- trol, since much of it is environmentally de-
cessing becomes automatic and controlled termined and even the internally generated
processing can be allocated to higher levels context (e.g., what the subject is currently
of processing. "thinking about") may be difficult to alter
For example, the child learning to read radically. Thus, changes in probe cues will
would first give control processing and then be under subject control to only a degree,
give automatic processing to various units of and this fact is one of the factors underlying
information, The sequence of automatically retrieval failure.
learned units might be foreground-back- This extremely brief and superficial over-
ground, features, shapes, letters,, words, and view should not be allowed to give the reader
meanings of phrases or sentences. The child an impression that distinguishing the auto-
would be utilizing controlled processing to matic and controlled phases of retrieval is
lay down "stepping stones" of automatic generally a simple matter. Furthermore, the
processing as he moves on to more and more controlled phase can be a most complex and
difficult levels of learning. The transition many-faceted process. To give just one ex-
from controlled to automatic processing at ample, Anderson and Bower (1973) consider
each stage would result in reduced discrimina- how subjects compare test sentences with
tion time, more attention to higher order stored sentences in long-term memory. In
features, and ignoring of irrelevant infor- our model, there are two phases to this ex-
mation. Gibson (1969, chap. 20) describes periment: (a) a retrieval of the appropriate
these effects to be three of the major trends informational nexus from long-term memory
in perceptual development. In short, the through the use of probe information (in-
staged development of skilled automatic per- cluding the test question) and (b) a com-
formance can be interpreted as a sequence parison in short-term memory of the test
of transitions from controlled to automatic question and the retrieved search set. The
processing. Anderson and Bower model, in the present
view, is primarily a model of the second of
3. Retrieval from LTS these processes—the comparison within
Retrieval from LTS has a large automatic short-term memory of the test sentence and
component. Sensory inputs result in an auto- the retrieved information. See Shiffrin's
matic series of stages of encoding that ac- (1970, 1975b, 1976) studies for a more
PERCEPTUAL LEARNING AND AUTOMATIC ATTENDING 171
sentation of the memory set (Baddeley & the comparison process is carried out in a
Ecob, 1973). speeded fashion for an expected item. An-
4. The outcomes in CM paradigms often other possibility is that the encoding process
differ in fundamental ways from those in or the response production stage is speeded
VM paradigms, especially when some simple when an expected item is tested. Shiffrin and
physical basis separates the memory and dis- Schneider (1974) attempted to carry out a
tractor sets, or when the degree of training test discriminating among the possible mod-
is high. Many such findings have been dis- els. While the general notion of expectancy
cussed extensively in earlier sections of this was supported by the results, the study was
paper and will not be reviewed again here. not conclusive in determining which version
5. Subjects typically give incorrect re- of the model was to be preferred.
sponses on 1-10% of the trials, depending The important point to be emphasized is
upon the condition. The serial search models that we (and Sternberg also) suggest that
do not generally posit explicit mechanisms findings (1) to (3) listed above are not in-
to predict errors. compatible with a controlled search process
that is serial and exhaustive. We suggest these
Before turning to alternative models, it is findings can be explained by factors that af-
useful to consider how these phenomena are fect other stages of the response process than
dealt with in the framework of our theory, the comparison stage or perhaps by a factor
and also to see how Sternberg explains the that affects the comparison process only on
results. certain trials. Many other researchers have
The various findings in CM paradigms are preferred to discard the hypothesis of a serial,
easiest to explain: Automatic detection de- exhaustive comparison process. Some of the
velops that enables the serial search to be models proposed as alternatives will be dis-
bypassed. Much of the research in this paper cussed below.
was directed toward establishing this fact. The final problem for the serial, exhaustive
Sternberg (1975) is less specific but also sug- search model is posed by the occurrence of
gests that some alternative, more efficient, errors. The explanations of controlled search
search process is used in such situations. processes by Sternberg (and by us) for re-
Within varied-mapping paradigms, re- action time tasks do not propose an explicit
course to the hypothesis of automatic detec- mechanism by which errors may occur. Many
tion cannot be used to explain the results. investigators have simply ignored errors as
It would be parsimonious if each of the long as the error rate rate is low, under 5%,
above findings (1) to (3) proved to be the say. This is perhaps justifiable if errors are
result of a common mechanism. Shiffrin and anomalous events that do not interact with
Schneider (1974) proposed one possible mech- any of the other variables being studied. In-
anism—namely, that when information con- deed the stability of findings across numerous
cerning test probabilities is available to the studies that do not attempt to fix error rates
subject, then one item is placed in a special at any given value and that vary consider-
state prior to each test display (called a ably in the observed rates (in the range 0-
state of "expectancy"), and that a test of 10%) lends some support to the view that
the expected item results in a faster response ignoring errors will not distort the conclusions
time than tests of nonexpected items. Stimu- drawn from the reaction time data.
lus probability, serial position, cueing, stim- In principle, however, error rates cannot
ulus repetition, or differential importance be ignored. Pachella (1974) has shown that
could all be expected to determine the prob- instructions used to change error rates by
ability with which stimuli will be expected. just a few percent can cause considerable
The effects of expectancy could possibly changes in the level and pattern of reaction
act at several different stages en route to the times. There are many ways in which error-
execution of the response. One possibility, producing mechanisms can be appended to
also mentioned by Sternberg (1975), is that controlled search models. Two of the most
PERCEPTUAL LEARNING AND AUTOMATIC ATTENDING 173
important are as follows: First, the subject error rates corresponding to each amount of
might terminate, or be forced to terminate, available search time are collected. The model
the controlled search before the search is derived from one study can then be used to
completed. Then a response would have to predict the results of the other. This is the
be made as a guess based on partial informa- method that was adopted in Experiments 1
tion at most, or perhaps no response at all and 2 of Part I. This approach has the ad-
would be made (an omission). These types vantage of relating two fields of inquiry that
of error resulting from early search termina- are normally treated separately. In the
tion account for almost all errors in search present paper, for example, attention tasks
and attention tasks using above-threshold using accuracy as a measure were linked to
stimuli with accuracy as a measure; in par- search tasks using reaction time as a measure.
ticular, most of the errors in the accuracy It should be noted, by the way, that the
tasks reported in the present paper and in type of error predicted for the results of Part
Part I are errors of the type. Second, the I/Experiment 1 is that caused by premature
search process might be carried out incor- termination of controlled search whose char-
rectly; that is, a comparison might be car- acteristics were derived from the reaction
ried out incorrectly owing to confusions time results of Part I/Experiment 2. How-
among items, forgetting, misperception, and ever, the errors seen in Part I/Experiment 2
the like. When error mechanisms are ap- were not necessarily caused by the same
pended to the basic search model a new ex- mechanism. In fact many, if not most, of
panded theory results, which must be tested them may have resulted from confusions,
through joint consideration of error data and forgetting, or anomalous condition-indepen-
reaction time data. dent factors.
Two basic approaches may be utilized to
test search theories incorporating error pre- B. The Theios Model
dictions. One approach involves manipulat-
ing the error rate systematically within each There are really two separate treatments
condition; the manipulation may be carried to be discussed here: the specific micromodel
out through instruction, deadline training, used by Theios et al. (1973) to predict re-
or signals-to-respond (Reed, 1973). This ap- sults that would otherwise be fit by a serial,
proach has the advantage of making full use exhaustive model, and the general theory
of both the error and reaction time data used by Theios (1973, 197S) to describe the
from a single experiment. It has the dis- production of response times in a variety of
advantage that the manipulations of error tasks.
rates may affect the nature of the controlled
search strategy adopted by the subject. For
example, Reed (1976) carried out a remark- 1. Serial, Terminating Search Through a List
able series of tests of a wide variety of models of Memory Items and Distractors
using data collected in the signal-to-respond In this model a list is constructed in mem-
procedure. Unfortunately, though one model ory, a list on which appear all items that
could be singled out in preference to the might be displayed for test. Each of these
others, the basic data differed in important items has an associated response cue (posi-
respects from those ordinarily found in the tive or negative) attached to it. The memory-
simpler version of the same paradigm. set items and distractors may in general be
The alternative approach involves carrying intermingled in this list, but a variety of
out two separate studies involving the same factors affect list ordering, and in many cases
subjects. One study utilizes the usual re- the memory-set items may tend to appear
action time methodology, with instructions early in the list. During the test, the subject
to keep error rates low. The second study serially scans the list until the test item is
utilizes a paradigm in which the available located, then terminates the search and re-
search time is systematically varied and the sponds according to the cue information
174 RICHARD M. SHIFFRIN AND WALTER SCHNEIDER
located along with that item. A group of serial, terminating search is not a good can-
items at the end of the list is assumed, how- didate as a model for the simpler para-
ever, to be searched in parallel, in one step, digms in which linear, parallel set-size func-
if search has not previously terminated. These tions are collected (see Sternberg, 1975, for
last items are supposed to be accessed through the relevant arguments). There are many
LTS, while those scanned serially are in STS. additional reasons for us to reject the hy-
(There are a variety of complications and pothesis of Theios et al; these reasons arise
extensions added to this model that we from the results of the present paper. Bas-
shall not discuss.) In at least some situations, ically, the Theios et al. model was proposed
this model can approximately predict linear to deal with a variety of stimulus prob-
set-size functions that are parallel for nega- ability effects that arise in CM paradigms.
tive and positive trials. We have shown in this article that indeed a
Various versions of the model were fitted different detection process develops in such
to data collected by Theios et al. (1973). paradigms; however, the fully developed
They utilized a CM procedure with a moder- automatic process and the pure controlled
ate amount of training. It seems likely, there- process are two different mechanisms and
fore, that many of the observed effects in both are relatively simple. In effect, Theios
their study, especially those based on differ- et al. have been led to propose a rather com-
ences in stimulus frequency, resulted from plex unitary model to try to fit data that
the development of at least a small degree of probably reflect a mixture of two different
automatic detection. The more frequent items simple search mechanisms.
would come to elicit automatic responses
sooner, causing the observed reduction in 2. A Hybrid Serial-Parallel Model
response reaction times. In our terms, the
A much more general theory of response
subject's strategy was probably a mixture
time generation in search tasks has been
of automatic detection and controlled search,
presented by Theios (1973, 1975). This
making simple conclusions difficult. Never-
theory has many similarities to the theory
theless, even if automatic detection is not
presented in the present article. It postulates
involved, it may be asked whether models
input and identification stages (which we
of the complexity suggested by Theios (or
would call automatic encoding), a response
suggested by the comments above) are neces-
determination stage (which in our terms is
sary to fit his data. Shiffrin and Schneider
either a controlled search or an automatic
(1974) showed that a simple version of the
process or some combination, depending on
expectancy model using serial, exhaustive
the paradigm), and response program selec-
search through the positive set could provide
tion and response output stages (which we
a fit to the data about as good as that pro-
term response production). Theios's de-
vided by the more complex models of Theios
scriptions of the situations in which auto-
et al. (1973). Under these circumstances the
matic processing should occur are quite sim-
data do not provide strong support for the
ilar to those we suggest in this paper. His
Theios model.
distinction between controlled and automatic
Aside from questions concerning the de-
processing is not made as explicit as in the
tails of the Theios et al. (1973) study, one
present article, and in his view the use of
can address the more general hypothesis put
controlled search is tied more to stimulus-
forward by these investigators. This hy-
response compatibility than to the type and
pothesis suggests that most memory search
amount of practice (i.e., the consistency of
studies are best conceived of in terms of a the mapping). Furthermore, the details he
(rather complex) serial, terminating search presents concerning controlled search pro-
model. Although we have collected data cesses are somewhat different and more limited
arguing for a serial, terminating search than we have presented in the present treat-
through the memory set (Experiment 2, Part ments. On the whole, however, the Theios
I), we agree with Sternberg (1975) that a (1975) theory is quite compatible with the
PERCEPTUAL LEARNING AND AUTOMATIC ATTENDING 175
argument much further and have suggested must be high-level ones, however, because in
that the time utilized during controlled pro- the studies masks were used to delete low-
cessing is in many tasks the immediate, level features from storage. Thus such results
proximal cause of the inability to divide call into question the need for two selective
attention. The results of Experiments 1 and processes, one before and one after genera-
2 in Part I, and our analysis of them, were tion of states of internal evidence. We argue
intended to demonstrate the role of timing instead that only one controlled selection
by relating the accuracy of performance in process occurs, after automatic encoding has
attention tasks to the reaction times produced produced states of internal evidence. Even
in search tasks. if it should eventually be determined that
some small degree of selection occurs during
B. The Shiffrin (1975a) Model initial perceptual processing, the accumula-
The work by Shiffrin and his colleagues tion of data makes it clear that the magnitude
summarized by Shiffrin (1975a) may be re- of attentional effects due to controlled pro-
garded as a preface to the present studies and cessing after automatic encoding is far
present theory. The primary aim of the greater than the magnitude of any effects due
earlier studies was a demonstration that en- to early selection. For further discussion see
coding quality is virtually unaffected by at- Shiffrin & Gardner (1972), Shiffrin, Craig, &
tentional instructions. The results from a Cohen (1973), Shiffrin, Gardner, & Allmeyer
(1973), Shiffrin & Grantham(1974), Shiffrin,
series of experiments demonstrated that in-
formation from a source (location) is pro- Pisoni, & Casteneda-Mendez (1974), and
cessed as well when it is the only information Shiffrin, McKay, & Shaffer (1976). The hy-
requiring processing as when it arrives pothesis supported by these studies, that of
simultaneously with other information also a single selection phase following automatic
encoding, has been proposed by a number of
requiring processing. If filtering or attenua-
tion were occurring between stimulus input researchers and theorists. We turn now to
and the production of internal states of evi- these proposals.
dence, then attention would have been al-
located less effectively when simultaneous C. The Deutsch and Deutsch Theory
inputs were used, and performance would As preface to the Deutsch and Deutsch
have suffered. (1963) model and to other similar theories,
It could well be argued that the results we will review some results in auditory selec-
were obtained only because attentional ca- tive perception from shadowing or dichotic
pacity had not been exceeded. In effect, this listening tasks. Then we will consider the
is our own argument. Such an argument does Deutsch and Deutsch treatment, objections
not help resuscitate the views that early to it, and our present view.
filtering, attenuation, or blocking of pro-
cessing takes place, since the situations in 1. Experiments on Auditory Shadowing and
which the results were obtained were as com- Dichotic Listening
plex at the stimulus input side as most of
the studies in which attentional deficits are Starting with Cherry's (1953) study, nu-
found. merous studies have utilized a shadowing
It should be noted that the Shiffrin technique in which the subject repeats con-
(197Sa) results are easily accommodated in tinually a stream of speech (usually presented
Broadbent's general theory. It must be as- to one ear) while other messages are pre-
sumed that the various inputs are retained in sented simultaneously (usually to the other
the sensory buffer long enough that they may ear). Moray (1959) showed that information
all be passed successfully through the limited- on the nonshadowed ear could not be recalled,
capacity system (in essence this is our ex- recognized, or relearned with savings. How-
planation, also, if "sensory buffer" is replaced ever, the subject's own name on the non-
by "STS"). The features in the sensory buffer shadowed ear could be noticed and recalled
180 RICHARD M. SHIFFRIN AND WALTER SCHNEIDER
at the end of the experiment. The traditional tion, overcomes these objections. If the stim-
explanations for such results hold that the uli presented do not cause automatic-atten-
information presented to the nonattended ear tion responses, then a controlled search situa-
is greatly attenuated, but that some informa- tion (like that of Experiment 4a) obtains,
tion, enough to let highly salient inputs be and a similar explanation holds: The subject
noticed, does pass the filter. The basis for will carry on attention-demanding, controlled
the types of information that are noticed on processing primarily on the shadowed or at-
the nonshadowed ear is usually considered to tended message, with only minimal controlled
be rooted in the content of the input. The processing of the nonattended information,
earlier treatments supposed that simple phys- just enough to establish which information
ical cues served as a basis (not only in is to be given deeper processing. Physical
Broadbent, 19S8, but also in Neisser, 1967). cues, when available, will make this relevancy
For example, two messages in the same voice decision easy. On the other hand, if auto-
are confused, but if the messages are in a matic-attention responses have been pre-
man's voice and a woman's voice, then the experimentally attached to stimuli, these
nonshadowed message can be ignored. A stimuli will be attended to and remembered
number of studies by Treisman (see 1969 for even when they are presented on a to-be-
a review) showed that content of messages ignored channel. This fact explains the re-
can also serve as a basis for selection though call for the subject's own name, for example.
usually not as effectively as physical cues. Furthermore, the objection that selection is
difficult without a physical cue does not hold,
2. The Deutsch and Deutsch Approach since we have shown that consistent train-
Faced with selection on the basis of con- ing leads to the development of automatic-
attention responses even without the pres-
tent, Deutsch and Deutsch (1963) suggested
that all inputs are analyzed to a relatively ence of a simple, consistent physical cue
(though a great deal of training may be
high level and that the results of the pro-
cessing are then used to select certain stimuli necessary—see Experiments 1 and 3).
for further processing, for memory, and for To return to the original controversy, some
response. We are in essential agreement with of the most persuasive evidence in favor of
this view, though we have elaborated on it the Deutsch and Deutsch position has been
considerably and introduced the important the subject's ability to divide attention with-
distinction between controlled search and out deficit, even in VM situations when auto-
learned attention responses leading to auto- matic detection could not have been operat-
matic detection. ing. Such data have often been discounted by
The Deutsch and Deutsch model has not Broadbent and others on the following basis.
yet gained general acceptance and in par- It is argued that relatively unanalyzed in-
ticular has been rejected by Broadbent formation is held temporarily in a sensory
(1971) and Treisman and Riley (1969). buffer; thus it should not be surprising that
Within the framework of shadowing studies, multiple stimuli can pass through the filter
several objections have been raised to the as long as a free period exists after presenta-
Deutsch and Deutsch position. First, selection tion that allows the information to be passed
without a physical cue remains difficult even serially from the buffer through the filter.
with wide variations in content. Second, dif- This argument is used to explain the ap-
ferences in content affect selection very little parent ability to analyze multiple simulta-
when a physical cue is present. Third, selec- neous stimuli in a variety of tasks and is
tion of a given word in a dichotic sequence thereby used to discount much of the evi-
depends on the previously attended word and dence supporting the Deutsch and Deutsch
not on the previously presented, but un- position. Broadbent's argument, however,
attended, word. does not account for the results of our multi-
Our present theory, with its distinction be- ple-frame studies, for in these studies there
tween controlled search and automatic detec- were no blank periods to allow serial process-
PERCEPTUAL LEARNING AND AUTOMATIC ATTENDING 181
ing, and yet the rate of presentation could trolled processing. Thus extrapolating from
even be speeded under the consistent (CM) Norman's theory, visual search for a single
training conditions. We do not disagree that memory item, even in a VM task, should be
under VM conditions a postpresentation free almost capacity-unlimited and parallel. The
period will allow simultaneous inputs to be subject should, by hypothesis, raise the per-
handled serially by the attention system. We tinence value of the single item, and hence
merely wish to point out that this explanation this item should be processed first. However,
cannot help explain the relationship between all of our studies have demonstrated that
our VM and CM conditions in the multiple- such pertinence changes cannot take place on
frame tasks. Instead, Broadbent and Treis- a trial-by-trial basis. It is only after con-
man would have to argue that the salience of siderable CM training that an automatic-
the consistently trained targets is so high attention response develops (or, in Norman's
that the filter plays no role in processing terms, that pertinence is raised). Thus a sub-
whatsoever, in which case it might as well be ject may wish to find a particular stimulus
argued that automatic processing bypasses but does not in general have the attentional
the selective system. control to enable the stimulus to be located
In summary, we argue that the position without a serial, limited, controlled search.
of Deutsch and Deutsch, when extended in
the manner of our present theory, is both E. The Neisser Model
fully defensible and in better accord with the Neisser (1967) has proposed a general
data than the opposing views. model that postulates an early stage of par-
D. The Norman Model allel processing not under subject control
called "preattentive" (similar to our sys-
Norman, alone and with coauthors, has temic, automatic stage), and a later, controll-
been responsible for a number of rather dif- able, serial stage referred to by the term
ferent models. We consider here the version "focal attention." In many respects this gen-
most closely in accord with that of Deutsch eral approach is quite similar to the present
and Deutsch (Norman, 1968, 1969; see also treatment. For example, Neisser discusses in
Lindsay & Norman, 1972). This model is some detail the fact that preattentive pro-
basically an extension of the Deutsch and cesses can control attention or responses (as
Deutsch approach in the sense that it pro- in eye movements). Furthermore, it is sug-
poses a relatively complete analysis of all in- gested that practice can lead to search be-
coming stimuli. As in the Deutsch and coming dependent on preattentive mecha-
Deutsch approach, a mechanism is proposed nisms, and hence lead detection to become
by which certain of these complete encodings automatic.
are selected for further serial processing. This Neisser's own treatment of his visual search
mechanism is called "pertinence." data differed in some respects from our present
In the model, each set of encodings is as- treatment. For example, Neisser argued that
signed a value of pertinence that determines irrelevant targets in CM designs are not pro-
the order of further processing. The value of cessed in as much detail as relevant targets.
pertinence is assumed to derive from two We argue that automatic analysis is equal,
sources: the stimulus encoding and an anal- but that attention is drawn to the relevant
ysis of previous inputs. The item selected for targets. Furthermore, there are some differ-
further analysis is that one whose pertinence ences in emphasis between Neisser's and our
value is highest as determined by a com- models. For example, Neisser tends to em-
bination of sensory encoding and previous phasize that preattentive discriminations are
analysis. In very general terms this model is based on relatively crude physical differ-
highly similar to the theory we have proposed. ences among stimuli, whereas we feel "pre-
However, the Norman model fails to make attentive" processes are determined by
the important and necessary distinction be- conditions of training and practice, with phys-
tween automatic-attention learning and con- ical differences affecting the rate of develop-
182 RICHARD M. SHIFFRIN AND WALTER SCHNEIDER
ment of automaticity. However, such differ- determine the ability of features to attract
ences should not be allowed to mask the attention. That is, we have shown how the
similarity of the general approaches.9 attention-attracting ability of features de-
pends on the use of consistent-mapping train-
F. The LaBerge Model ing procedures.
LaBerge (1973, 1975) presents an atten- On the whole, our model and LaBerge's
tional model, in the context of reaction time agree when applied to the same data. Thus,
matching tasks, that has many elements in although the two theories have been built
common with the present theory. He sup- on somewhat different data bases, they are
poses, like Deutsch and Deutsch, that there consistent even to the degree that there are
exists an automatic, learned encoding system similarities in the diagrammatic representa-
that operates in parallel on input stimuli and tions (compare our Figures 11 and 12 with
produces features independently of controlled, the depictions of the model in LaBerge, 1973,
attentive processing. Depending on prior 1975).
learning, there are no limits to the depth of Let us now return to models that are in
such processing. Subsequent to such auto- basic agreement with the Broadbent-Treis-
matic encoding, controlled processing re- man view, as opposed to the Deutsch and
quiring attention is necessary to carry out Deutsch view.
further analyses of the input. The features
resulting from automatic encoding are as- G. The Kahneman Theory
sumed to be capable of calling attention to Kahneman (1973) proposes a rather ex-
themselves. LaBerge proposes, and demon- tensive theory and reviews the literature in
strates empirically, that new automatic en- considerable detail. We cannot begin to do
codings, that is, perceptual learning, may justice to his overall system in the limited
occur with practice. Finally, LaBerge em- space available and will therefore discuss
phasizes that perceptual learning takes place only a few general features. Kahneman pro-
mainly after high accuracy is achieved. All of poses a general theory of the allocation of
these aspects of the system are quite similar capacity according to a factor called "effort."
to ours. In our terms, he is concerned with the general
One additional element of the LaBerge limits on controlled processing: How can
model holds that attention may be directed control be allocated? Can total capacity
toward a not-yet-learned feature, causing it vary? What types of operations can be car-
to act temporarily as an automatically learned ried out together with what losses of
unit, but only while attention is maintained. efficiency?
That is, a feature that would normally re- Interpreting our theory in Kahneman's
quire extended controlled processing to be terms, one would say that we have proposed
encoded from automatically activated, sim- a specific time-based theory determining con-
pler subunits, can, when attention is focused trolled processing capacity in detection tasks,
on it in advance, be activated by the per- and that our theory gives a set of rules or
ceptual encoding process. This suggests at
least some degree of control over encoding.
9
While we have downplayed the role of such One must take care to distinguish the Neisser
control, we have not ruled it out. In fact, (1967) view of attention from the Neisser (1967)
we suggested that the results from Part I/ view of cognitive processing in general. The general
view emphasizes the constructivist aspects of per-
Experiment 2, when M was equal to 1, might ception, sometimes known as "top-down" processing,
well be explained by such a mechanism. in which perception is heavily influenced at all
If there is a difference between our ap- stages by the subject's expectations and general
proach and LeBerge's, it lies in the different knowledge. The essence of this general view seems
to be a bit incompatible with the notion of an
areas of applications and the completeness initial automatic processing system that is fixed,
of the assumptions. For example, we have permanent, and largely unaffected by higher level
elaborated on the conditions of training that controlled processing.
PERCEPTUAL LEARNING AND AUTOMATIC ATTENDING 183
Many other investigators have provided in humans to models deriving from auto-
classifications of attentional paradigms and matic-attention learning in consistent-map-
results. The Treisman (1969) review, for ping paradigms in animals. The results in the
example, is largely atheoretical and classifies present article should make it clear that such
a variety of paradigms and tasks with a view a correspondence will fail. In fact, however,
toward indicating the types of attention re- it is becoming clear that a comparison of
striction that occur in each. Similarly Moray VM with VM, and CM with CM, studies in
(1969a, 1969b) has classified attentional the two areas reveals similar findings. To
studies and paradigms. Unfortunately, the give just one example, Riley and Leith
variety of results and the experimenter's (1976) review some paradigms in which ani-
ability to design new tasks seem to have mals are forced to utilize what we term con-
grown about as fast as the growth in com- trolled processing; in such cases they reveal
plexity of the classifications. Thus as useful capacity limitations and attentional control
as these classifications have been, they have roughly similar to that found in human sub-
been limited to a posteriori descriptions. jects (see also Mackintosh, 1975, and
Wagner, 1976, for discussions of related
J. Other Models matters).
The brief summary of a number of theories
given above by no means exhausts the field. K. Summary
Keele (1973) and Hochberg (1970) pre-
sented models like that of Deutsch and Many of the elements constituting our
Deutsch and generally similar to ours. Im- theory may be found in previous proposals.
portant and influential reviews, with con- There are certain themes, which have re-
siderable theoretical import, have been curred in earlier theories, about which we
written by Egeth (1967), Moray (1969a, take a particular position.
1969b),Lindsay (1970), and Swetsand Krist- 1. Control oj processing during encoding
offerson (1970). Important research has been and feature abstraction. We argue that con-
carried out by Eriksen and his associates trol is minimal, that subject-controlled filter-
(e.g., Eriksen Si Collins, 1969; Eriksen & ing, blocking, or attenuating does not occur
Eriksen, 1974; Eriksen & Hoffman, 1972; during this stage but subsequent to per-
Eriksen & Rohrbaugh, 1970) and by Posner ceptual encoding.
and his associates (e.g., Posner & Boies, 2. The role oj crude physical differences in
1971; Posner & Snyder, 1975). Although leading to automatic discrimination. We argue
space limitations preclude our reviewing that the conditions of training are crucial to
this work, we feel that the models we have the development of automatic processing and
discussed above represent a cross section of automatic detection; the nature of physical
typical approaches that have been put for- differences among items is of secondary im-
ward for tasks that resemble those in this portance theoretically, though it may in
paper. practice be important in determining the
We would like to finish this discussion of rate of acquisition and levels of perform-
alternative approaches by considering very ance during acquisition.
briefly the relation of our theory to those 3. The limitations on, and capacity oj, con-
arising in the fields of animal discrimination trolled processing. We have presented a
learning and attention (e.g., see Mackintosh, specific, detailed model in which limitations
1975; Sutherland & Mackintosh, 1971). We on rate of processing determine the con-
feel that data and theory in the human and trolling system's capacity.
infrahuman areas are in much closer accord
than previous work might lead one to believe. On the one hand, our theory may be seen
There has been a tendency to compare at- as both a specification and extension of views
tentional models deriving from controlled proposed in various earlier models. On the
processing effects in varied-mapping studies other hand, our theory is grounded in a
PERCEPTUAL LEARNING AND AUTOMATIC ATTENDING 185
Experiment 3 was designed to test directly 2. Dividing attention is possible when the
the role of categories in the development of targets have been consistently mapped during
automatic detection and the operation of training until automatic detection operates.
controlled search. It was shown that in VM 3. Focused-attention deficits arise when the
conditions, category learning for arbitrary, distracting stimuli initiate automatic-atten-
visually confusable sets of letters eventually tion responses.
took place after 25 sessions of training. At 4. Focusing attention is possible during
this point, controlled search was still being controlled processing (e.g., in VM tasks).
used, but the comparison process had switched
from individual items to categories as a We shall conclude by summarizing briefly
whole (memory-set size had no effect). the progress we feel has been made in organ-
However, when training was then switched izing the phenomena of detection, search,
to a CM procedure, performance improved attention, and perceptual learning. One of
radically and automatic detection was learned. the greatest contributions of the work is the
The results suggest that fact that all these areas have been tied both
empirically and theoretically to each other
1. Both categories and automatic responses and to common mechanisms. Another im-
can be learned for arbitrarily constituted, portant contribution is the delineation of
visually confusable character sets. the differences between, and the character-
2. The role of categories is to improve con- istics of, automatic and controlled processing.
trolled search, and possibly to speed the We make no claims that these experiments
acquisition of automatic detection. have concluded the study of these areas.
Quite to the contrary, one of the most im-
As a group, Experiments 1, 2, and 3 pro- portant results of this work is the host of new
vided a convincing demonstration of the
questions that can now be phrased, and we
qualitatively different characteristics of con- hope, answered.
trolled search and automatic detection.
Experiment 4 was designed to test the sub-
jects' ability to focus attention in the face Reference Notes
of distraction by (a) neutral characters, (b) 1. Sperling, G. Multiple detections in a brief visual
current targets in to-be-ignored display posi- stimulus: The sharing and switching of atten-
tions, (c) items in to-be-ignored positions tion. Paper presented at the 16th annual meeting
that subjects had been trained previously to of the Psychonomic Society, Denver, November
1975.
respond to with automatic-attention re- 2. Sperling, G. Attention operating characteristics
sponses. The results showed that controlled for visual search. Paper presented at the 9th
search can usually be directed to locations annual Mathematical Psychology meeting, New
that the subject desires to attend to but that York, August 1976.
3. Shiffrin, R. M., & Cook, J. R. Short-term for-
automatic-attention responses can overwhelm getting without rehearsal or interference: Data
the controlled processing system and can and theory. Paper presented at the 16th annual
cause attention to be allocated to positions meeting of the Psychonomic Society, Denver,
that should be ignored. Thus automatic-atten- November 1975.
tion responses cannot be ignored and will
interrupt and redirect ongoing controlled References
processing.
Anderson, J. A. A theory for the recognition of
The attention literature as a whole was items from short memorized lists. Psychological
considered, and in light of our results the Review, 1973, 80, 417-438.
following rules were suggested. Anderson, J. R., & Bower, G. H. Human associative
memory. Washington, D.C.: V. H. Winston & Sons,
1. Divided-attention deficits arise from 1973.
Atkinson, R. C., Holmbren, J. E., & Juola, J. F.
limitations on controlled processing. In par- Processing time as influenced by the number of
ticular, detection deficits are due to the elements in a visual display. Perception & Psycho-
limited rate of the serial comparison process. physics, 1969, 6, 321-326.
PERCEPTUAL LEARNING AND AUTOMATIC ATTENDING 187
Atkinson, R. C., & Juola, J. F. Search and decision archical search processes in a recognition memory
processes in recognition memory. In D. H. task. Journal of Verbal Learning and Verbal Be-
Krantz, R. C. Atkinson, R. D. Luce, & P. Suppes havior, 1971, 10, 528-541.
(Eds.), Contemporary developments in mathe- Corballis, M. C. Serial order in recognition and
matical psychology (Vol. 1). San Francisco: recall. Journal of Experimental Psychology, 1967,
Freeman, 1974. 74, 99-105.
Atkinson, R. C. & Shiffrin, R. M. Human memory: Corballis, M. C. Access to memory: An analysis of
A proposed system and its control processes. In recognition times. In P. M. A. Rabbit & S. Domic
K. W. Spence & J. T. Spence (Eds.), The psy- (Eds.), Attention and performance V. New
chology of learning and motivation: Advances in York: Academic Press, 1975.
research and theory (Vol. 2). New York: Academic Corballis, M. C., Kirby, J., & Miller, A. Access to
Press, 1968. elements of a memorized list. Journal of Experi-
Atkinson, R. C., & Shiffrin, R. M. The control of mental Psychology, 1972, 94, 185-190.
short-term memory. Scientific American, 1971, 225, Corteen, R. S., & Wood, B. Autonomic responses to
82-90. shock-associated words in an unattended channel.
Baddeley, A. D., & Ecob, J. R. Reaction time and Journal of Experimental Psychology, 1972, 94,
short-term memory: Implications of repetition ef- 308-313.
fects for the high-speed exhaustive scan hy- Craik, F. I. M., & Jacoby, L. L. A process view
pothesis. Quarterly Journal of Experimental Psy- of short-term retention. In F. Restle et al., (Eds.),
chology, 1973, 25, 229-240. Cognitive theory (Vol. 1). Hillsdale, N.J.: Erlbaum
Biederman, I., & Stacey, E. W., Jr. Stimulus prob- Associates, 1975.
ability and stimulus set size in memory scanning. Craik, F. I. M., & Lockhart, R. S. Levels of pro-
Journal of Experimental Psychology, 1974, 102, cessing: A framework for memory research. Jour-
1100-1107. nal of Verbal Learning and Verbal Behavior, 1972,
Bjork, R. A. Short-term storage: The ordered out- 11, 671-684.
put of a central processor. In F. Restle et al., Craik, F. I. M.. & Tulving, E. Depth of processing
(Eds.), Cognitive theory (Vol. 1). Hillsdale, N.J.: and the retention of words in episodic memory.
Erlbaum Associates, 1975. Journal of Experimental Psychology: General,
Brand, J. Classification without identification in 1975, 104, 268-294.
visual search. Quarterly Journal of Experimental DeRosa, D. V., & Morin, R. E. Recognition reaction
Psychology, 1971, 23, 178-186. time for digits in consecutive and nonconsecutive
Bransford. J. D., & Franks, J. J. The abstraction of memorized sets. Journal of Experimental Psy-
linguistic ideas. Cognitive Psychology, 1971, 2, chology, 1970, 83, 472-479.
331-350. Deutsch, J. A., & Deutsch, D. Attention: Some
Broadbent, D. E. Perception and communication. theoretical considerations. Psychological Review,
London: Pergamon Press, 1958. 1963, 70, 80-90.
Broadbent, D. E. Decision and stress. London: Egeth, H. Selective attention. Psychological Bulletin,
Academic Press, 1971. 1967, 67, 41-57.
Burrows, D. & Okada, R. Serial position effects in Eriksen, C. W., & Collins, J. F. Temporal courses
high-speed memory search. Perception & Psycho- of selective attention. Journal of Experimental
physics, 1971, 10, 305-308. Psychology, 1969, 80, 254-261.
Cavanagh, P. Holographic processes realizable in the Eriksen, B. A., & Eriksen, C. W. Effects of noise
neural realm: Prediction of short-term memory letters upon identification of target in nonsearch
performance (Doctoral dissertation, Carnegie- task. Perception & Psychophysics, 1974, 16, 143-
Mellon University, 1972). Dissertation Abstracts 149.
International, 1973, 33, 3280B. (University Micro- Eriksen, C. W., & Hoffman, J. E. Some character-
films No. 72-234, 79) istics of selective attention in visual perception
Cavanagh, P. Holographic and trace strength determined by vocal reaction time. Perception &
models of rehearsal effects in the item recogni- Psychophysics, 1972, 11, 169-171.
tion task. Memory & Cognition, 1976, 4, 186-199. Eriksen, C. W., & Rohrbaugh, J. W. Some fac-
Cherry, C. Some experiments on the reception of tors determining efficiency of selective attention.
speech with one and with two ears. Journal American Journal of Psychology, 1970, 83, 330-
of the Acoustical Society of America, 1953, 25, 342.
975-979. Estes, W. K. Interactions of signal and background
Cherry, E. C., & Taylor, W. K. Some further experi- variables in visual processing. Perception & Psy-
ments upon the recognition of speech with one chophysics, 1972, 12, 278-286.
and two ears. Journal of the Acoustical Society Gibson, E. J. Principles of perceptual learning and
of America, 1954, 26, 554-559. development. New York: Prentice-Hall, 1969.
Clifton, C., & Birenbaum, S. Effects of serial posi- Gleitman, H., & Jonides, J. The cost of categoriza-
tion and delay of probe in a memory scan task. tion in visual search: Incomplete processing of
Journal of Experimental Psychology, 1970, 86, 69- targets and field items. Perception & Psycho-
76. physics, 1976, 20, 281-288.
Clifton, C., & Gutschera, K. D. Evidence for hier- Gordon, G. T. Interaction between items in visual
188 RICHARD M. SHIFFRIN AND WALTER SCHNEIDER
Bower (Ed.), The psychology of learning and Treisman, A. M. Strategies and models of selective
motivation: Advances in research and theory attention. Psychological Review, 1969, 76, 282-299.
(Vol. 7). New York: Academic Press, 1973. Treisman, A. M., & Riley, J. G. A. Is selective at-
Theios, J. The components of response latency in tention selective perception or selective response?
simple human information tasks. In P. M. A. A further test. Journal of Experimental Psychol-
Rabbit & S. Dornic (Eds.), Attention and per- ogy, 1969, 79, 27-34.
formance V. New York: Academic Press, 197S. Von Wright, J. M. On selection in visual immediate
Theios, J. et al. Memory scanning as a serial self- memory. In A. F. Sanders (Ed.), Attention and
terminating .process. Journal of Experimental Psy- performance (Vol. 3). Amsterdam: North-Holland,
chology, 1973, 97, 323-336. 1970.
Townsend, J. T. A note on the identifiability of Von Wright, J. M., Anderson, K., & Stenman, U.
parallel and serial processes. Perception & Psycho- Generalization of conditioned GSRs in dichotic
physics, 1971, 10, 161-163. listening. In P. M. A. Rabbit & S. Dornic (Eds.),
Treisman, A. M. Contextual cues in selective listen- Attention and performance V. New York: Aca-
ing. Quarterly Journal of Experimental Psy- demic Press, 197S.
chology, 1960, 12, 242-248. Wagner, A. R. Priming in STM: An information
processing mechanism for self-generated or re-
Treisman, A. M. Effect of irrelevant material on
trieval-generated depression in performance. In
the efficiency of selective listening. American Jour- T. J. Tighe & R. N. Leaton (Eds.), Habituatiom:
nal of Psychology, 1964, 77, S33-S46. (a)
Perspectives from child development, animal be-
Treisman, A. M. Monitoring and storage of ir- havior, and neurophysiology. Hillsdale, N.J.:
relevant messages in selective attention. Journal of Erlbaum Associates, 1976.
Verbal Learning and Verbal Behavior, 1964, 3, Williams, J. D. Memory ensemble selection in human
449-459. (b) information processing. Journal of Experimental
Treisman, A. M. Verbal cues, language and mean- Psychology, 1971, 88, 231-238.
relevant messages in selective attention. Journal of
Psychology, 1964, 77, 206-219. (c) Received August 30, 1976 •