You are on page 1of 50

Psychological Review

V O L U M E 8S NUMBER 2 MARCH 1978

A Theory of Memory Retrieval


Roger Ratcliff
University of Toronto, Ontario, Canada
A theory of memory retrieval is developed and is shown to apply over a range
of experimental paradigms. Access to memory traces is viewed in terms of a
resonance metaphor. The probe item evokes the search set on the basis of
probe-memory item relatedness, just as a ringing tuning fork evokes sympa-
thetic vibrations in other tuning forks. Evidence is accumulated in parallel from
each probe-memory item comparison, and each comparison is modeled by a
continuous random walk process. In item recognition, the decision process is
self-terminating on matching comparisons and exhaustive on nonmatching com-
parisons. The mathematical model produces predictions about accuracy, mean
reaction time, error latency, and reaction time distributions that are in good
accord with experimental data. The theory is applied to four item recognition
paradigms (Sternberg, prememorized list, study-test, and continuous) and to
speed-accuracy paradigms; results are found to provide a basis for comparison
of these paradigms. It is noted that neural network models can be interfaced to
the retrieval theory with little difficulty and that semantic memory models may
benefit from such a retrieval scheme.

At the present time, one of the major and guided by phenomena (e.g., those of the
deficiencies in cognitive psychology is the lack Sternberg paradigm and the Brown-Peterson
of explicit theories that encompass more than paradigm). Second, attempts to theorize
a single experimental paradigm. The lack of result in the construction of binary oppositions
such theories and some of the unfortunate (e.g., serial vs. parallel, storage vs. retrieval,
consequences have been discussed recently by and semantic vs. episodic). Thus, an appre-
Allport (1975) and Newell (1973). Two im- ciable portion of research in cognitive psychol-
portant points are made by Newell: First, ogy can be described as testing binary opposi-
research in cognitive psychology is motivated tions within the framework of a particular
experimental paradigm. This style of research
This research was supported by Research Grants does not emphasize or encourage theory con-
APA 146 from the National Research Council of struction; but without theory, it is almost
Canada and OMHF 164 from the Ontario Mental impossible to relate experimental paradigms
Health Foundation to Bennet B. Murdock, Jr. I would
like to thank Ben Murdock for his support, help, and or to substantiate claims that the same pro-
criticism and the use of data from his published and cesses underlie different experimental para-
unpublished experiments. I would also like to thank digms. Furthermore, the concern with binary
the following: Ron Okada and David Burrows for use oppositions tends to obscure the more inter-
of their data; Peter Liepa for programming Experi- esting aspects of data, such as the form of
ment 2; David Andrews for help with the mathematical
development; Gail McKoon, Bill Hockley, and Bob functional relations.
Lockhart for useful discussion; and Howard Kaplan The theory presented in the present article
for general programming assistance. is concerned with providing an account of
Requests for reprints should be sent to Roger
Ratcliff, who is now at the Department of Psychology, processes underlying retrieval from memory.
Dartmouth College, Hanover, New Hampshire 03755. Perhaps the most intensively investigated class
Copyright 1978 by the American Psychological Association, Inc. All rights of reproduction in any form reserved.

59
60 ROGER RATCLIFF

of memory retrieval paradigms has been the when any one of the parallel comparisons
item recognition paradigms, for example, the terminates with a match (self-terminating on
Sternberg paradigm (Sternberg, 1969b), the positives) or when all of the comparisons
study-test paradigm (Murdock & Anderson, terminate with a nonmatch (exhaustive on
1975), the continuous recognition memory negatives). When a decision has been made, a
paradigm (Okada, 1971), and the premem- response is initiated and performed. Figure 1
orized list paradigm (Burrows & Okada, 1975). illustrates the overall retrieval scheme.
In this area of research, several models have
been developed to deal with results from each Search or Evoked Set
paradigm, but each model is inconsistent with
some data (Ratdiff & Murdock, 1976; Stern- In some models developed to account for
berg, 1975). Also, the models are paradigm item recognition data, it is assumed that the
specific and do not generalize across paradigms. only memory items accessed in the comparison
The theory presented here has been designed process are the items in the experimenter-
to deal with all aspects of the data in each designated search set, that is, the positive set
paradigm (accuracy, response latency, and (Schneider & Shiffrin, 1977; Sternberg, 1966).
latency distributions) and has been designed There is a problem with such models because
to apply over a range of paradigms. In the it can be shown that items outside this positive
first section, a qualitative account of the theoryset may be accessed. For example, Atkinson,
is presented; in the second section, the mathe- Herrmann, and Wescourt (1974) report two
matical model is introduced and briefly de- experiments in which the recency of negative
scribed. The article is written so that an under- probes was manipulated. The first used a
standing of the mathematics is not necessary Sternberg (1966) procedure. Subjects were
for comprehension of the theory. A complete presented with a memory set of either 2, 3, 4,
development of the mathematical model is or 5 words, which was followed by a probe
presented in the Appendix. In the third sec- word. It was found that latency of response to
tion, the theory is applied to five experimental a negative probe word was greater and its
paradigms, in which both response latency accuracy lower the more recently it had oc-
and accuracy are measured, and the same set curred. The second experiment used a 25-word
of processing assumptions is used to account study list and a 100-word test list with no
for performance in all of the paradigms. The repeated negatives. After 50 test words, sub-
fourth section relates the theory to neural jects were required to read written instructions;
network models, semantic memory models, 10 words in the instructions served as negative
and propositiomal models, items in the remainder of the test list. Latencies
for the negative items that had been in the
A Description of tie Theory instruction set were significantly longer than
latencies for other negative items. Therefore,
In this section, I present an overview of what to explain the processes involved in item
I term the r^mmal theory. I have mot presented recognition, a model must include some
any mathematical development in this section mechanism that allows items from outside
in order to give a qualitative account of the the experimenter-designated search set to be
theory, which cam be understood independently accessed in the comparison process.
of the mathematical formulation. I will adopt the notion of search set suggested
Let us consider a typical item recognition by a resonance metaphor (Neisser, 1967, p.
task in which & group of items in memory has 65). Suppose each item in memory corresponds
been designated the memory search set and a to a tuning fork and the informational basis
single probe item is presented for testing. on which comparisons are made corresponds
According to the theory, the probe is encoded to frequency. Then, the comparison process
and then compared with each item in the search can be viewed as the probe tuning fork ring-
set simultaneaimsly (ie., in parallel). Each ing and evoking sympathetic vibrations in
individual comparison is assumed to be ac- memory tuning forks of similar frequencies.
complished by a random walk (more speci- Whether or not the probe evokes sympathetic
fically, a diffusion) process. A decision is made vibrations in a memory-set item depends only
A THEORY OF MEMORY RETRIEVAL 61

PROBE INPUT

EVOCATION OF THE
SEARCH SET

EVOKED SET
( POTENTIALLY VERY LARGE)

PARALLEL COMPARISON
(DIFFUSION PROCESSES)
UPPER BOUNDARY MATCH
LOWER BOUNDARY NON- MATCH

DECISION PROCESS
SELF TERMINATING ON MATCH
(OR gale)
EXHAUSTIVE ON NON-MATCH
(AND gate)

RESPONSE OUTPUT

Figure 1. An overview of the item recognition model.

on the similarity of frequencies and is indepen- number of items in memory. The reason is that
dent of whether or not the memory item is in "no" responses are based on exhaustive pro-
the experimenter-designated memory set. The cessing of items in the searchi set, so the deci-
amplitude of resonance drives the random walk sion time for a "no" response is a function of
comparison process: The greater the amplitude the group of slowest individual nonmatch
(better match), the more bias there is toward comparisons. Thus, many fast finishing pro-
a "yes" response; the smaller the amplitude cesses may enter the decision process without
(poorer match), the more bias toward a "no" affecting performance.
response (see Figure 2). I will designate items giving rise to this
So far, I have not indicated how many slower group of latency- and accuracy-deter-
memory items can be contacted by the probe. mining processes as the semck set. In later
It turns out that predicted performance will applications, the search set will usually be
be little affected even if the number of items approximated by the memory set. A demon-
contacted is large, perhaps comparable to the stration that the evoked set (defined as all

MATCH 1. CLOSE PROBE-ITEM MOO*


BOUNDARY FAST TEHMMMflrme PROCESS

2. MODERATE MATCH
SUW TEMHMMMG PROCESS

STARTING.
3. MODERATE NOW-MATOHt
POINT SLOW TERMINATING PROCESS

NON-MATCH . 4. EXfflEME NON-MWTCH


BOUNDARY FAST TERMWATIMG PROCESS

Figure 2. The relation between relatedness and amount of match in the dMfasiott lamdlwn) walk process.
(Relatedness varies from high [Process 1] to low [Process 4].)
62 ROGER RATCLIFF

items contacted by the probe) can theoretically Schulman (1970) studied a complementary
be very large is presented later in this section. task, that is, one requiring the subject to make
There, it is shown that many probe-item the judgment as to whether a probe word was
comparisons with very small resonant ampli- identical to, a homonym of, or a synonym of
tudes can be added to the few performance- a presented word. Ten words were presented to
determining processes (probe-nontarget item the subject (Sternberg varied-set procedure),
comparisons with larger resonances) without the judgment condition was cued, and the
changing latency or accuracy. probe word was presented. Accuracy, as
It is worth noting at this point that the measured by d", and latency behaved in much
retrieval theory outlined in Figure 1 in con- the same way: d' was highest and latency
junction with the resonance metaphor can be shortest for the identical condition, d' was
considered an implementation (for item recog- lower and latency longer for the homonym
nition) of content-addressable memory models condition, and d' was lowest and latency
or broadcast models (Bobrow, 1975). The longest for the synonym condition.
notion of resonance also suggests similarities Some similarity effects can be quite un-
to the pandemonium model (discussed in expected. For example, Morin, DeRosa, and
Neisser, 1967) for feature encoding. Stultz (1967) and Marcel (1977) have shown
that the more numerically remote a negative
Information Entering the Comparison probe is from the memory set, the faster the
Process response (e.g., Probe 1 is faster than Probe 6
Here I examine in more detail factors to Memory Set 798). Thus, numerical related-
that influence the "amplitude of resonance." ness can enter the recognition process.
Seward (1928, cited in Woodworth, 1938) All of these results suggest that similarity
performed an experiment in which sub- between probe and memory-set items is a
jects were presented with 30 patterns ("fancy major determinant of recognition performance,
papers"). Following 10 minutes of unrelated and that performance is best conceived in terms
activity, a yes-no recognition test was given in of discriminability between memory-set items
which 10 test items were identical to the study and distractors.
items, 10 rather similar, 10 slightly similar, and Further, the fact that so many factors in-
10 very different. It was found that as similarity fluence item recognition performance raises
decreased, the proportion of "yes" responses the question, How do we conceptualize the
decreased and latency of "yes" responses in- structure of the memory trace? Tulving and
creased. Similarly, as similarity decreased, the Bower (1974) have suggested that the most
proportion of "no" responses increased and generally accepted conception of the trace is
latency of "no" responses decreased. that of a collection of features or a bundle of
More recently, Juola, Fischler, Wood, and information. In addition, Tulving and Bower
Atkinson (1971) performed an experiment describe methods that have been used to study
with words as stimulus and probe items. the features or combination's of information in
Stimulus words were memorized to a criterion the trace; some of the methods are similar to
of perfect recall, and test words were presented those described earlier in this section. However,
sequentially at a fixed rate. There were three they conclude by stressing that inferences
types of negative items: homophones and about trace structure can only be drawn in the
synonyms of target words and neutral words. context of a process model. The theory pre-
It was found that synonyms were 60 msec sented here specifies a process model in great
slower than neutral words, and homophones detail but does little more than describe the
were 120 msec slower; although when homo- memory trace as a bundle of information while
phones were broken down into two types, allowing some quantitative assessment of the
visually similar and visually dissimilar to extent to which certain (possibly featural)
target items, differences were 200 msec and 40 information is represented in the memory
msec, respectively. These results show that trace.
semantic, visual, and phonemic similarities It seems that many qualitatively different
between probe and memory items affect types of information are contained in the
recognition performance. memory trace. In order to determine how well
A THEORY OF MEMORY RETRIEVAL 63

a probe matches an item in the memory set, counter is decremented. The counter begins at
the interaction of the trace and probe informa- some starting value Z, and if a total of A
tion must be assessed. I will use the single term counts are reached, the probe is declared to
relatedness to describe the amount of match or match the memory-set item (A — Z more
size of resonance. Thus, in Figure 2, Processes feature matches than nonmatches). But if a
1 to 4 have relatedness varying from high to total of zero counts are reached, an item non-
low. Relatedness will be assumed to vary over match is declared. This process is called a
items because whenever a group of nominally random walk (see top panel of Figure 3).
equivalent items is memorized, some items are Relatedness corresponds to the ratio of the
better remembered than others (cf. strength number of matching features to the total
theory and attribute theory; Murdock, 1974). number of features. Two probe-memory-set
In quantification of the mathematical model, item comparisons involving identical related-
relatedness has variance (a normal distribution ness may still have different comparison times.
of relatedness is assumed), which allows the Suppose that in one process, the features are
calculation of a d' measure of discriminability. ordered so that the matching features are
Variability in relatedness is shown later to be processed first, whereas in the second process,
necessary to ensure that asymptotic accuracy nonmatching comparisons are carried out first.
is not infinite. It is possible to assess the extent Then, if the number of feature matches re-
to which different kinds of information con- quired for an item match is sufficiently less
tribute to the memory trace, for example, by than the total number of features, the two
calculating d' discriminability values for dif- comparison processes will give quite different
ferent kinds of negative items in the experi- comparison times (the first fast and the second
ment by Juola et al. (1971) described earlier. slow) or perhaps will give different results
So far, I have made two main points. First, (item match and nonmatch, respectively). In
the resonance metaphor is used in order to general, there is variation in comparison time,
emphasize that we are dealing with a parallel depending on order of feature comparisons and
interaction between the probe and the repre- the distribution of matching and nonmatching
sentation of the memory-set items. The features within that order. Therefore, two
retrieval scheme suggested by the resonance sources of variance have now been identified:
metaphor allows performance to be affected by variance in relatedness and variance in the
items outside the memory set, although this comparison process.
feature will not be used here in any of the
applications of the mathematical model and is
included only for sufficiency. Second, all trace Diffusion Process
information is mapped onto a unidimensional
variable, that of relatedness, and relatedness is In the mathematical formulation of the
the dimension on which discriminability be- theory, the comparison process is represented
tween positive and negative items is assessed. by the diffusion process, the continuous ver-
sion of the random walk in both temporal and
spatial coordinates. The diffusion process has
Comparison Process the advantage that the mean rate of accumula-
tion of information and the variance in that
Comparison of a probe to a memory-set item rate are independent parameters. In the
proceeds by the gradual accumulation of evi- feature-matching model, the process is es-
dence, that is, information representing the sentially binomial, and mean and variance of
goodness of match, over time. It is easiest to a binomial are correlated; whereas the diffusion
conceptualize the comparison process as a process is essentially normal, and mean and
feature-matching process in which probe and variance are independent. The third panel of
memory-set item features are matched one by Figure 3 illustrates the diffusion process.
one. A count is kept of the combined sum of the Diffusion and relatedness. The critical as-
number of feature matches and nonmatches, sumption made is that drift rate in the dif-
so that for a feature match, a counter is in- fusion process (in Figure 3, average distance
cremented, and for a feature nonmatch, the traveled vertically per unit time) is equal to
64 ROGER RATCLIFF

THE RANDOM WALK PROCESS

DISTRIBUTION OF NUMBER
OF STEPS TO THE MATCH
BOUNDARY (HITS)
MATCH BOUNDARY

NON-MATCH BOUNDARY

DISTRIBUTION OF NUMBER OF STEPS TO


THE NON-MATCH BOUNDARY (MISSES)

RELATEDNESS DISTRIBUTIONS FOR TARGET ITEMS (SIGNAL


NON-TARGET ITEMS (NOISE )

NON -TARGET
ITEMS *
TARGET ITEMS

VARIANCE *

RELATEDNESS

THE DIFFUSION PROCESS

( FOR TARGET ITEMS WITH RELATEDNESS DISTRIBUTION MEAN U +• VARIANCE f\f

DISTRIBUTION OF FIRST PASSAGES


TO THE MATCH BOUNDARY! HITS )

MATCH BOUNDARY

T
^-.VARIANCE OF DRIFT «S 2

• DRIFT PARAMETER

NON-MATCH BOUNDARY

DISTRIBUTION OF FIRST
PASSAGES TO THE
NON-MATCH BOUNDARY(MISSES)
Figure 3. An illustration of the random walk and diffusion process, together with relatedness distribu-
tions that drive the diffusion process.

the relatedness value. Thus, any particular relatedness, one for probe-target item com-
probe-memory-set item comparison has drift parisons and one for probe-nontarget item
equal to the relatedness value of that com- comparisons.
parison, so that, for example, the greater the Parameters for the comparison process.
probe-memory-set item relatedness, the faster Figure 3 shows the six parameters of the com-
the match boundary is reached. The second parison process. Over the five paradigms ex-
panel of Figure 3 shows two distributions of amined, variance in relatedness (if) and vari-
A THEORY OF MEMORY RETRIEVAL 65

ance of drift in the diffusion process (s2) were


i— REACTION TIME DISTRIBUTIONS
kept constant. Thus, there were four free
comparison process parameters to be fitted to
data: relatedness parameters u and v and
diffusion process boundary parameters z and a.
These, plus an encoding and response stage
parameter TEE, summarize performance in
item recognition experiments.
Variable criteria. For any particular item
recognition task, there are three variable
criteria to be set by the subject: the zero point time
on the relatedness dimension (corresponding
to (8 in signal detection ^theory) and the two
boundary positions in the diffusion process.
Note that an item with relatedness zero, that
Figure 4. A geometrical illustration of the mapping
is, at the relatedness criterion, will have an from a normal relatedness distribution to a skewed
average rate of accumulation of evidence that reaction time distribution (with variance in drift
leads to no systematic drift toward either the si = 0). (Note that as relatedness decreases, the dis-
match or nonmatch boundaries. Changes in tribution tail skews out. a represents the distance
between the bottom and top boundaries of the diffusion
boundary positions are the basis of speed- process, z represents the distance between the bottom
accuracy trade-off and are discussed in a later boundary and the starting point, and « represents the
section. mean of the normal relatedness distribution.)
Reaction time distributions. One feature
that distinguishes the retrieval theory from process is self-terminating on a match, so
most other models in this area of research is that if enough evidence is accumulated in just
that reaction time distribution shapes are one comparison process, a "yes" response is
predicted and fitted. Figure 4 illustrates how produced. In contrast, the decision process is
the normal distribution of relatedness with exhaustive on nonmatch comparisons, so that
mean u maps into a skewed reaction time dis- a "no" response is produced only when all
tribution. Also shown in Figure 4 is an illu- comparison processes terminate in nonmatches.
stration of the way in which the distribution It has already been noted that the finishing
changes as u changes: When u decreases, the time distribution for a single comparison is
fastest responses slow a little, and there is a similar to observed reaction time distributions.
large elongation of the tail of the distribution. It turns out that the distribution of finishing
Thus, a prediction of the model is that if times of the maximum of a number of diffusion
relatedness decreases, the mean and mode of processes (each with a skewed finishing time
the distribution will diverge. In general, the distribution) is again shaped very much like
mathematical model will be manipulated to observed reaction time distributions. Thus,
yield expressions for reaction time distributions the mathematical model produces acceptable
and accuracy for both hits and correct rejec- shapes for both positive and negative response
tions that can then be fitted to data. time distributions.
Further aspects of the comparison process In most item recognition studies, "no"
are discussed later in relation to speed-ac- responses are almost as fast as "yes" responses,
curacy trade-off and in relation to other models and sometimes, this finding has provided
of the comparison process. problems for certain models (see Collins &
Quillian, 1970; Smith, Shoben, & Rips, 1974).
Decision Process This is not a problem for the retrieval theory,
although it may be hard to see how "no"
The decision process can be viewed as a responses, which are based on exhaustive
combination stage in which the products of processing of the search set, can be fast enough.
the many comparisons are combined to produce In fits of the mathematical model, it is found
either a "yes" or "no" response. The decision that the separation of the starting point and
66 ROGER RATCLIFF

Table 1 Speed-Accuracy Trade-off


Accuracy and Latency as a Function of the
Number of Extra Fast Finishing Processes In many models of item recognition, ac-
for Correct Rejections, with v ~ — A, curacy and latency are not explicitly related,
a = .12, z = .03, and Search Set Size 16 even though it is well known that a subject
can sacrifice accuracy to produce a faster
No. standard ^ response (e.g., Pachella, 1974). There are
deviations r
No. extra (of the m several methods for studying speed-accuracy
processes process dis- trade-off, for example, Wickelgren (1977) lists
m tribution) be- Mean six methods. His six methods fall into three
(evoked low the non- reaction basic classes: (a) methods that induce the
set = m target dis- Proportion time
+ 16} tribution correct (in msec) subject, for example, by use of instructions or
payoffs, to adjust tlje amount of information
0 0 .839 358 required for a response; (b) methods that
too 2, .838 363 force the subject, for example, by use of dead-
500 3, .838 362 lines, response signals, or time bands, to
5,000 4*7 .838 358
25,000 5» .838 363 respond within some time limit; and (c)
100,000 5, .835 358 methods that partition reaction times into
groups (e.g., those between 420 msec and 440
msec, those between 440 msec and 460 msec,
noiunatch boundary (z) is smaller than the and so on) and compare accuracy values within
separation of the starting point and match each partition.
boundary (a — z). Therefore, it is even possible The method of partitioning reaction times
to have "no" responses much faster than "yes" has been discussed by Pachella (1974) and
responses (as found by Rips, Shoben, & Smith, Wickelgren (1977). Pachella claims that the
1973). speed-accuracy function obtained from this
procedure is probably independent of the
Resonance Metaphor speed-accuracy functions obtained from the
It was argued earlier that the search set other two methods. In fact, the speed-ac-
could be made very large, but that reaction curacy functions obtained from partitioning
time and error rates would still be determined simply tell us about the latency density func-
only by the slower comparison processes. tions for error and correct responses and, in
Suppose that in a typical fit to data from the terms of the retrieval theory, are largely un-
study-test paradigm, m extra processes are related to the other speed-accuracy measures
added to the 16 probe-memory-search-set and so are not considered further.
comparison processes. Table 1 shows accuracy In the retrieval theory, the relation between
and reaction time as a function of m and r, speed and accuracy is central, and the diffusion
where r is the number of standard deviations comparison process (plus the decision process)
the extra process distribution is placed below may be viewed as a transformation from a
the nontarget item distribution. Note that the relatedness discriminability scale to observed
number of extra processes is adjusted, so that speed and accuracy measures. The first speed-
the change in latency and accuracy is small for accuracy method, in which the amount of
the particular value of r used; thus, more information required for a response is manipu-
processes than m would produce significant lated, and the second method, in which the time
changes in accuracy and latency. The results in of response is manipulated, can be viewed as
Table 1 show that many probe-item com- complementary in terms of the theory. When
parisons with small relatedness may enter the instructions or payoffs are manipulated, the
decision process without affecting accuracy or process will be referred to as information
latency. These results suggest that the reso- controlled; when response signals or time win-
nance metaphor has the right sort of properties dows are used, the process will be referred to
to be useful in theorizing about item recogni- as time controlled.
tion processes. Information-controlled processing. To
A THEORY OF MEMORY RETRIEVAL 67

demonstrate the way in which speed-accuracy probe-memory-set item comparisons: one


trade-off is modeled when instructions or match and one nonmatch. The assumption is
payoffs are manipulated, let us consider two that payoffs or instructions induce the subject

/\^__DISTRI8UTION OF HITS
' ^ MATCH

PH..8
TIME
Z
PCR'-B
0 NON -MATCH
DISTRIBUTION OF CORRECT REJECTIONS

I NON-MATCH
V BOUNDARY
INCREASED

MATCH BOUNDARY
INCREASED

NON-MATCH
BOUNDARY
INCREASED FURTHER

Figure 5. An example of the change in comparison time distributions and correct response rates (ps is
the probability of a hit and pen. is the probability of a correct rejection) for changes in boundary posi-
tions for one match process and one nonmatch process, with all other parameters constant. (The nu-
merical values and positions and shapes of the latency distributions illustrate the qualitative relations
between the three speed-accuracy conditions but are not quantitatively exact. [Note that in the middle
panel, there will be a slight decrease in ^CR.] Processes 1 and 2 illustrate comparisons that would have
terminated at the wrong boundary if the boundaries had been close as in the top panel, a is the dis-
tance between the two boundaries in the random walk comparison process, and z is the bottom boundary
to starting point distance.)
68 ROGER RATCLIFF

MATCHING
COMPARISONS

NONMATCHING
COMPARISONS

t,

Figure 6. Spread of evidence as a function of time in the unrestricted diffusion process for one matching
and one nonmatching process. (At time t\, there is a large amount of overlap; at ta, the overlap has
reached asymptote, with asymptotic d' — (u — ti)/r; [see middle panel of Figure 3]. « is the mean of
the match relatedness distribution, and D is the mean of the nonmatch relatedness distribution.)

to adjust the positions of the match and non- increase in accuracy as the diffusion process
match boundaries and so to adjust the amount boundaries are moved apart.
of information required for a decision. For Time-controlled processing. Time-controlled
example, Figure 5 illustrates the effect of processing is modeled in a different way than
increasing the position of the nonmatch information-controlled processing. Figure 6
boundary (middle panel) and increasing the shows the way evidence accumulates from two
position of both boundaries (bottom panel). distributions of relatedness: (a) a probe-
When the position of the nonmatch boundary memory-set item match and (b) a nonmatch
is increased, "no" responses are made slower (see Figure 3, middle panel) as a function of
and the number of misses reduced. When both time. The decision assumption is that if a
boundaries are moved apart, latency and particular comparison is stopped at time ti
accuracy of both hits and correct rejections and the amount of evidence is greater than
increase. the starting value, then the comparison is
As the boundaries move apart, the reaction called a match, and if the amount of evidence
time distributions both shift (the fastest is less than the starting value, the comparison
responses or leading edge of the distribution is called a nonmatch, (For mathematical
become slower) and spread (the mode and tractability later, it is assumed that the
mean of the distribution diverge), as shown in match and nonmatch boundaries are far apart
Figure 5. The bottom panel of Figure 5 and that the random walk diffusion process
illustrates the time course of evidence ac- can be considered unrestricted). Figure 6
cumulation for four comparison processes. shows the way evidence has accumulated at
The two processes marked "1" and "2" are times ti, fa, and t3. At early times (ti), there is a
examples of processes that would terminate large amount of overlap, and d' is rather small.
incorrectly with narrow boundaries but that At later times (fe and k), target and nontarget
terminate correctly with the relatively wide distributions have diverged and are approach-
boundaries shown in the bottom panel. It is ing an asymptotic form that corresponds to
the correct termination of processes such as the relatedness distributions shown in the
Processes 1 and 2 that is responsible for the middle panel of Figure 3. Thus d' as a function
A THEORY OF MEMORY RETRIEVAL 69

of time approaches an asymptote. (Mathe- agreed that the exhaustive serial scanning
matical expressions for d' as a function of time model is simple and easily testable, with only
and fits to data are shown later.) three parameters, one slope parameter, and
Thus, we see that the random walk com- two intercept parameters. It is not generally
parison process can account for results from realized, however, that to fit reaction time
both information- and time-controlled speed- distributions (using additive-factors logic and
accuracy experiments. the Pearson system of frequency curves), the
serial scanning model suddenly requires nine
Model Freedom parameters (Sternberg, Note 1). If accuracy
performance were to be fitted also, then still
In the description of the retrieval theory, more parameters would be required. Thus,
I have stressed the flexible nature of the pro- the number of parameters needed for the
cessing system, and this flexibility shows up retrieval theory to fit results from the Stern-
as apparent model freedom. I now discuss berg paradigm may turn out to be much the
the problem of model freedom and indicate same as the number of parameters needed by
which functional relations provide reasonably the serial scanning model. This discussion
constrained tests of the mathematical model. shows that there are certain constraints that
When applied to an experiment, the mathe- are rather hard to evaluate, for example, how
matical model has five variable parameters, many degrees of freedom are taken up in
namely, TEE, z, a, i>, and u. The encoding and representing reaction time distributions.
response parameter TEE. is fixed, so that in It was argued earlier that many kinds of
order to evaluate the effect of independent information enter the comparison process,
variables on performance, TER can be con- including recency information. The decay of
sidered constant. In any situation where experi- recency information (or strength) has been
mental conditions change between trials and modeled several times, and various simple
the subject can know which condition is being models with few parameters have been de-
tested (e.g., set size in the Sternberg paradigm),veloped (Atkinson & Shiffrin, 1968; Murdock,
it is possible that criteria (z, a, v, and u) will1974; Norman & Rumelhart, 1970; Wickel-
change across conditions. So, there are four gren & Norman, 1966). The retrieval theory
parameters free to vary in fitting the data. describes performance as a function of recency
There is a rather weak constraint on the value with a series of relatedness values u. Thus,
of these parameters in that if performance for example, there will be a separate value of
exhibits regular behavior, then changes in relatedness u at each serial position, that is,
z, a, v, and u should be regular. More useful there will be as many free parameters as there
tests of the mathematical model can be found are serial positions. If some assumptions about
in experiments where, on any particular trial, decay in relatedness were made, then this
it is not possible for the subject to know which series of relatedness values could possibly be
condition is being tested prior to retrieval (e.g.,fitted by a simple few-parameter model (as in
serial position in the Sternberg paradigm). the models mentioned above), thus reducing
In such situations, criteria cannot be adjusted the number of free parameters.
to systematically vary with the independent In conclusion, it seems that the problem of
variable. Thus, the only parameter free to vary model freedom is more complex than it would
is relatedness u, and with only u varying, appear at first sight. Simple few-parameter
accuracy and latency must be simultaneously models require the addition of many more
fitted. parameters to account for anything more than
One major aim in developing the mathe- crude summary statistics, and complex models
matical model is to fit reaction time distribu- may be quite tightly constrained in non-
tions. In fact, reaction time distributions are obvious ways.
fitted without the need to add any further
parameters. This account contrasts sharply Mathematical Model
with that given by the serial scanning model In this section, some of the principal param-
for the Sternberg paradigm. It is generally eters, functions, and expressions for the
70 ROGER RATCLIFF

mathematical model are introduced. A much comparison with relatedness £ ; as noted earlier,
more complete discussion of the random walk drift in the diffusion process is set equal to
and diffusion processes with derivation com- relatedness. Let the nonmatch boundary be at
bination rules for exhaustive and self-terminat- zero, the starting point of the process be at z,
ing parallel processing are presented in the and the match boundary be at a. Then, if s2
Appendix. Later in the present section, the is the variance in the drift in the diffusion
method of fitting the theory to data is de- process, the probability of a nonmatch is given
scribed, checks on the fitting program are by
described, and relations to other models are
discussed.
(Throughout this section the subscript minus
Parameters of the Mathematical Model will refer to a nonmatch and the subscript
The parameters used in the mathematical plus will refer to a match.) The finishing time
model arise from different theoretical sources density function for a nonmatch is given by
and may be summarized as follows: (a) for the -ITS'
comparison (diffusion) process: lower bound-
ary zero, starting point in the diffusion process /a'X. (2)
z, and upper boundary a; drift in the diffusion
process (equal to relatedness) and variance in The equivalent expressions for a match
the drift s2; (b) for probe-item relatedness: a [T+(?) and g+(t, £)] can be found by setting
normal distribution N(u, rf) for matching £ = — £ and 2 = a — z in Equations 1 and 2.
comparisons and a normal distribution N(v,»;) Note that g-(t, £) is not a probability density
for nonmatching comparisons; and (c) an function because some proportion of compari-
encoding and response output parameter rBR. sons end up as matches. However, g-(t, £)/
Note that s2 and if are kept constant through- 7-(£) is a probability density function.
out the fits presented in this article, so that the
only variable parameters are a, z, u, v, and Decision Process
TER. Thus, M and v alone represent input from
memory into the decision system. The decision process is self-terminating for
matches and exhaustive for nonmatches. Thus,
Diffusion Process in order for a "no" response to be made, all
the comparison processes must finish, and the
The diffusion process is used to represent decision time is the maximum of the individual
the accumulation of evidence for a single comparison process times. Let Gi-(t) be the
probe-memory-set item comparison. As dis- finishing time distribution function
cussed earlier, one has to select aspects of the
data (summary statistics) as points of contact G-(t, & =
between theory and data. I have chosen to use
mean error rate and reaction time distribution
for a nonmatch of process i, and Gm&K(t) be
statistics, and so it is necessary to derive
the finishing time distribution function for the
expressions for error rate and finishing time maximum of n nonmatch processes. Then,
density functions for the diffusion process. In
the Appendix, expressions for error rate and
finishing time density functions are presented
for the discrete random walk. Taking the
limit as the number of steps becomes large and and also
the size of each step becomes small allows us to
obtain expressions for error rate and finishing (3)
z=-l
time density functions for the diffusion process
(as shown in the Appendix). where 7i-(£) is the probability of a nonmatch
Consider a single probe-memory-set item of the ith comparison, and 7-(£) is the prob-
A THEORY OF MEMORY RETRIEVAL 71

ability of obtaining a nonmatch from the n use the convolution distribution


comparisons. The expressions for the match
decision (minimum of the comparison processes
that terminate in a match) are more compli-
cated and are shown in Equations A19 to A22
in the Appendix.
Equations 1, 2, 3 and A20 form the basis for
computation of predicted reaction time dis-
£
as a meeting point of theory and data. The
(4)

tributions and error rates. convolution model is fitted to data, and


parameter values obtained serve to summarize
that data. Theoretical reaction time distribu-
Probe-Memory-Set Item Relatedness
tions are computed, and the convolution model
is fitted to the theoretical distribution giving
Because relatedness £ has a normal distribu-
theoretical values of the parameters. Figure 7
tion [_N(u, r;) and N(v, y) for matching and
shows some fits of the convolution model to
nonmatching comparisons, respectively], it is the theoretical distributions, and it can be seen
necessary to average G(t, £) and y(l-) over that the curves are similar.
relatedness before obtaining the final theo- In order to fit the theory to data, the equa-
retical expressions for error rates and reaction
tions derived earlier were used to give values
time distributions. of correct response rate pt and reaction time
A complete description of the mathematical
distribution parameters /*< and T«. (The
model, together with all the equations neces-
parameter a of the convolution model can be
sary to obtain reaction time distributions and
left out because generally a does not theo-
error rates, is presented in the Appendix.
retically and empirically change appreciably
with experimental condition, and the values
Fitting the Mathematical Model to Data for both theoretical and empirical estimates
are similar.) The corresponding three empirical
Despite the recent popularity of reaction parameters p,, pe, and re are estimated from
time research, reaction time distributions have the data (averaging across subjects), and the'
been largely ignored as a source of information. function
This is surprising in view of the fact that dis-
tributional information can be useful in (P. - o*. -
evaluating models (Ratcliff & Murdock, 1976;
Sternberg, Note 2). However, one problem in
using reaction time distributions concerns the
choice of statistics used to describe a distribu-
tion. The traditional method that uses mo- (summed over both hits [H] and correct
ments (Sternberg, Note 1) turns out to be of rejections CCR]) is minimized as a function of
little practical use because many thousands of the theoretical parameters. The variances <rM2
observations per experimental condition are and crT2 are asymptotic variance estimates for
required before stable estimates can be ob- the convolution model (see Table 2 of Ratcliff
tained. Furthermore, higher moments (third & Murdock, 1976), andffp2= />.(! - p.)/N,
and fourth) describe the behavior of the ex- where N is the number of observations on
treme tail of the distribution (Ratcliff, Note 3) which pe is based.
and, therefore, are of little practical interest. Because the theoretical reaction time dis-
Ratcliff (Note 3) has suggested that the param- tributions are best fits to the average reaction
eters of the probability density function of time distributions across subjects (/*« and re
the convolution of a normal distribution are averages over subjects), it is difficult to
[_N(IJ,, <r)] and an exponential distribution get any direct idea of how well the theory
C/(0 — (lA)e~"T] provide a good summary of accounts for the data. Ratcliff (Note 3) has
empirical reaction time distributions (see also developed a method of producing group reac-
Ratcliff & Murdock, 1976). I have decided to tion time distributions. Essentially, reaction
72 ROGER RATCLIFF

HITS CORRECT REJECTIONS

11=0.25 V= -.38
V = -. 43 Q= .121
0 = .116 2= o/4
Z- a/4 - .I40ms
u = .MOms 0= .037ms
CTr .037ms = ,255ms
s . 187 ms

03
<
CO
o
oc V= -.45
Q_ Q= .115
Z= 0/4
JU- .114 ms
0-= .031 ms
"t= .165ms

1.2 1.4
TIME (SEC)
7. Some sample theoretical reaction time distributions (solid lines) for theoretical parameters
u, a, z, and v and fits of the convolution model parameters p, a, and T to theoretical distributions (dots).
(These distributions illustrate the adequacy of the convolution approximation to the theoretical
distributions, ms = msec.)

time quantiles are derived for each subject cell, the convolution parameters that are the
and these quantiles are averaged over subjects average, across subjects, of convolution fits to
to give group quantiles. The group quantiles individual subject cells (Ratcliff, Note 3).
are then plotted on the horizontal axis of a
graph. If, for example, 5% quantiles were used, Checks on the Estimation Method
then 5% of the probability density would lie
between adjacent quantiles. Therefore, a In fitting moderately complex models, such
probability density function can be con- as this diffusion process model, there are many
structed by drawing equal area rectangles be- places where errors might occur. Therefore,
tween adjacent quantiles. The theoretical two kinds of checks should be carried out:
probability density function can be compared internal and external checks. For example, an
with the empirical function by plotting the internal check that should be carried out when
two functions on the same graph. This method numerically integrating to infinity involves
is used to show how well the theoretical reac- printing out the integrand over the range of
tion time distributions fit the data. It should integration and checking the values by hand
also be noted that the convolution parameters calculation. External checks can be somewhat
derived by fitting the convolution model to the more involved. The idea here is to compare
reaction time quantiles are almost identical to values obtained from the program with some
A THEORY OF MEMORY RETRIEVAL 73

known or established values. A program to argues that the Markov (no memory) property
compute accuracy and reaction time distribu- of exponential distributions is inconsistent
tions was implemented on an IBM 370-165 with what is meant by processing over time.
computer in WATFIV FORTRAN and, without For an exponential process, the expected time
the integration over relatedness (Equations to termination is independent of the time al-
A24 and A25), was implemented on a PDP- ready elapsed; whereas a more reasonable view
12A computer hi FOCAL language. Letting of cognitive processing would suppose that the
17 —» 0 in the FORTRAN program, the two pro- longer a process has been running, the greater
grams were found to give identical results. The the probability of its termination. Second, the
FOCAL program was checked by calculating use of exponentials to represent stage distribu-
values of first passage time distribution for tions may produce good fits to- mean reaction
a » z. These results were then compared with time but may not lead to theoretical distribu-
values obtained from the one barrier diffusion tions that match the shape of observed reac-
process, Equation 6 (see Cox & Miller, 1965, tion time distributions. Furthermore, when the
p. 221; Feller, 1968, p. 368), with probability additive-factors method has been applied and
density function component stage distributions extracted using
Pearson's method (Sternberg, Note 1), the
underlying distributions are far from ex-
g-(0 = ponential. Thus, a good strategy may be to
develop a model using exponentials and then
These values were identical, and so the evalua- test whether in fact the stage distributions are
tion of the model can be considered satisfactory. exponential. Third, simply assuming that pro-
cessing stages have certain finishing time
Relations to Other Models distributions avoids the crucial fact that both
latency distributions and errors arise from
In this section, I compare and contrast the common stages or mechanisms, and that
present theory with three classes of models and perhaps the major theoretical effort should
theories. The first class of models assumes focus on relating accuracy and latency.
exponentially distributed processing stages; Exponential distributions are an extremely
some of these models attempt to deal with the useful tool for beginning an investigation but
same experimental domain as the retrieval can be somewhat misleading theoretically if
theory. The second and third classes are they represent processing stage distributions in
random walk and counter models; these have a mature model.
been developed to account for both accuracy Random walk models. Several models of
and latency in choice reaction time and re- choice reaction time based on the random
search in perception. walk comparison process have been developed
Models with exponential processing stages. recently (Laming, 1968; Link, 1975; Link &
In several models of memory search and Heath, 1975; Stone, 1960). Here, I consider a
memory retrieval, exponential distributions recent formulation termed relative judgment
have been used to represent the finishing time theory (Link, 1975; Link & Heath, 1975) and
distributions of processing stages (Anderson & note some of the similarities and differences to
Bower, 1973; Atkinson, Holmgren, & Juola, the retrieval theory. :
1969; Townsend, 1971,1972). The exponential In relative judgment theory, a physio-
distribution is useful because it is a one- logically transduced value of the stimulus is
parameter distribution and has a skewed posi- compared to an internal referent or standard.
tive tail just as observed reaction time dis- During a unit of time, the difference obtained
tributions. Furthermore, the exponential is is added to an accumulator of differences. When
easy to work with and quite often allows exact the accumulated difference exceeds one of two
solutions to be obtained for complicated ex- subject-controlled thresholds, a response is
pressions (e.g., Anderson, 1974). However, made.
there are several problems in using the ex- Relative judgment theory has several im-
ponential distribution. First, Sternberg (1975) portant similarities to the theory developed in
74 ROGER RATCLIFF

this article. First and foremost, the decision trieval theory ensures that d' asymptotes as a
process is modeled by a random walk, and function of retrieval time. Third, relative
information discriminating the two stimuli is judgment theory has been developed to deal
mapped onto a single dimension. Second, there with the simplest case, namely, a two-stimulus,
are three criteria to be set by the subject: the two-response discrimination procedure. On the
value of the internal referent, corresponding other hand, the retrieval theory has been
to the zero point of relatedness in the retrieval developed to deal with the problem of match-
theory, and the two boundary positions. Thus, ing one item against many items in memory
there are the same number of variable criteria and discriminating one match against many
in the two mathematical models. Third, the nonmatches. Fourth, relative judgment theory
two theories are both concerned with the joint has provided somewhat cleaner experimental
behavior of accuracy and latency as a func- tests than will be shown in the next section.
tion of independent variables. However, it is somewhat difficult to compare
It is the differences between the theories that the power of the two sets of tests of the two
provide most insight into their relation. First, theories.
the random walk in relative judgment theory From this discussion, it can be seen that
has discrete steps, and the difference accumu- there is potential for making explicit a con-
lated in a particular interval has some dis- tinuity between memory retrieval and per-
tribution ; whereas the retrieval theory assumes ceptual discrimination.
continuous processing. Mathematical expres- Counter models. The class of counter models
sions for the first passage time (latency) dis- provides a serious competitor to random walk
tributions have not yet been obtained for models. Perhaps the major difference between
relative judgment theory, except in the case of counter and random walk models is that
binomial (or trinomial) and normal distribu- random walk models assume that positive and
tions of differences, as they have for the con- negative differences cancel, whereas counter
tinuous random walk. For choice reaction models assume separate counters for positive
time, error responses are faster than correct and negative evidence. Thus, for counter
responses when accuracy is high, and relative models, termination occurs when one of the
judgment theory accounts for this by assum- two counters exceeds a criterial count.
ing that the moment-generating function for Counter models have been examined in some
differences is nonsymmetric. This assumption is detail by Pike (1973) with specific applica-
equivalent to supposing that the individual tion to signal detection. In Pike's article,
step distribution is positively skewed in the many of the main properties of counter models
direction of an incorrect response. In contrast, are listed and discussed. Anderson (1973)
in the retrieval theory, the continuous random combines a counter model with a neural
walk has the normal distribution underlying network model to account for results from
performance [essentially drift is distributed choice reaction time studies and the Sternberg
N(u, s)~\. Second, the internal referent in paradigm. Huesmann and Woocher (1976)
relative judgment theory is assumed to have no have applied a counter model, together with
systematic drift over the course of testing. parallel processing assumptions, to the Stern-
Thus, there is only one source of noise or berg paradigm.
variance and that is in the comparison pro- One major problem with counter models
cess. In the retrieval theory, there is a further concerns the behavior of latency distributions
source of variance and that is in memory-set as a function of count criteria. When count
item-probe relatedness. Therefore, relative criteria are low (e.g., 1 or 2), latency distribu-
judgment theory predicts that if the response
boundaries are set at infinity, discriminability tions are reasonably skewed. However, as the
d' increases as a function of the square root count criteria increase (e.g., 4 to 10), the dis-
of the number of steps. This prediction is tributions become more normal (Pike, 1973,
quite reasonable if, for example, the choice is p. 61). This contrasts with the prediction of
between two well-discriminated tones. In random walk models that the mode and mean
contrast, variance in relatedness in the re- of the latency distribution should diverge as
A THEORY OF MEMORY RETRIEVAL 75

the random walk boundaries (criteria) are cesses. Ratcliff and Murdock (1976) provided
moved apart. a summary of empirical effects, and this
At the moment, counter models as well as summary (see Figure 8) is the basis for much
random walk models have not been investi- of the following discussion. It should be noted
gated in enough detail, and further research that the data actually come from a confidence
may provide useful competitive theories for judgment procedure but are modeled as though
both memory retrieval and perceptual they came from a yes-no procedure. It turns
discrimination. out that results from the two procedures are
similar (Murdock, Hockley, & Muter, 1977),
Experimental Paradigms so this approximation seems reasonable.
To model the study-test paradigm, I will
Up to this point, I have set up the basic assume that a test (or probe) item is encoded,
framework of the theory and derived the and parallel comparisons are made between
necessary mathematical expressions. In the the encoded test item and members of the
following sections, the theory is applied to search set. The search set is approximated by
five experimental paradigms. the 16-item study set, with the same normal
probability density function N ( v , y ) for each
Study-Test Paradigm nonmatching comparison (i.e., it is assumed
that only study items are accessed by the
In a typical study-test experiment, 16 test item and all nonmatching comparisons
stimulus words are presented for study, and have equal relatedness). The probability
then 32 words (16 old and 16 new in random density function of relatedness for a matching
order) are tested for recognition, one at a time. comparison is N(u, i)).
The study phase is paced at about 1 sec per
item, and the test phase is self-paced, with Serial and Test Position Functions
each test item staying in view until a response
is made. Responses are made on a 6-point The main empirical effects to be modeled
confidence scale, and accuracy and latency are are the behavior of latency and accuracy as a
recorded. function of study (or input) and test (or
The study-test paradigm has been studied output) position for old items and of test
recently in some detail (Murdock, 1974; position for new items (see Figures 8b and 8c).
Murdock & Anderson, 1975; Ratcliff & Data are taken from Ratcliff and Murdock's
Murdock, 1976), primarily as the empirical (1976) Experiment 1.
basis for the conveyor belt model (Murdock, Parameter variation. As noted earlier, when
1974) for item recognition. The conveyor belt variation in some experimental variable can
model assumes that a record of the temporal be known to the subject prior to retrieval, there
order of both study and test items is stored in is the possibility of changes in criteria as a
memory. In order to decide whether a test function of that experimental variable. Test
item was in the study list or not, a high speed position is one such variable. As test position
self-terminating backward serial scan is carried increases, forgetting of study items increases,
out over the temporal record of both test and and the subject may set more lenient criteria
study items. The conveyor belt model deals (increases a and z) in an effort to improve
with retrieval from supraspan lists only and accuracy. Thus, relatedness of probe-non-
brings coherence to a number of empirical target item comparisons (ti) and diffusion
results. However, the model has some problems process boundary parameters (z and a) can
and limitations, and these are discussed in change as a function of test position.
Ratcliff and Murdock (1976). Because of the Relatedness of probe-target item compari-
large amount of data collected in Murdock's sons (u) varies as a function of test position, as
experiments and because of the stability of do it, a, and z. However, unlike 11, a, and z, u
the empirical effects, the study-test paradigm varies as a function of study (or serial) position.
provides a very strong empirical foundation for The subject cannot alter criteria as a function
attempts to model recognition memory pro- of serial position because knowledge of the
76 ROGER RATCLIFF

LATENCY A
(o) CONFIDENCE JUDGEMENTS

«•+ CONFIDENCE JUDGEMENT

(b)TEST HITS CORRECT


LATENCY REJECTIONS LATENCY
POSITION
ACCURACY 'ACCURACY

OUTPUT POSITION OUTPUT POSITION

(c) PRIMACY EFFECT A


(INPUT POSITION) ACCURACY

LATENCY

INPUT POSITION

(d)RATE OF PRESENTATION ACCURACY

LATENCY

L5 PRESENTATION
0.6 RATE (SEC/ITEM)
(e) NUMBER OF fLATENCY
STIMULUS 'ACCURACY
PRESENTATIONS

LATENCY
• ' > NUMBER OF
OUTPUT POSITION ' 2
PRESENTATIONS

( f ) L I S T LENGTH LATENCY
64 ( LIST LENGTH )

OUTPUT POSITION

Figure 8. Schematic representation of functional relations obtained in the study-test paradigm (from
Ratcliff & Murdock, 1976). (IP and 2P represent once-presented and twice-presented stimuli,
respectively.)

study position of a probe item can only result position must both be predicted by changes in
from the retrieval of that item. Thus, changes ( the single parameter u.
in accuracy and latency as a function of serial The variance parameters s* and if are kept
A THEORY OF MEMORY RETRIEVAL 77

constant, as mentioned earlier, at (.08)2 and and z. The fits are quite acceptable, and it can
(.18)2, respectively. A single constant time be seen that, indeed, criteria change as a
TEE is assumed for probe encoding, preparation function of test position to compensate for the
for comparison and decision processes, execu- overall drop in discriminability (forgetting).
tion of the decision process, and response out- Hits. Figure 10 shows accuracy and mean
put processes. These processes could be reaction time as a function of study (or serial)
assumed to have some distribution of process- position, together with theoretical fits and
ing times with small variance, without altering values of relatedness u. From these fits, it can
the fits to distributions shown later. be seen that the one-parameter u can fit both
Correct rejections. Figure 9 shows accuracy, accuracy and latency and that changes in
mean reaction time, and reaction time distri- accuracy and latency can be modeled by
bution parameters /i and r for correct rejections changes in the single parameter u. Inspection
as a function of test position, together with of Figure 10 shows that u behaves in a very
theoretical fits and theoretical parameters v, a, regular manner as a function of study and test

.94
ACCURACY MEAN LATENCY
.90 (sec)

.86
.84
.82 .82
.80
.78 .78
.76
74 .74

.54 .30
(sec)
.53 28
.52 26

.51 24

-.38 16

15
-.40

-.42
038
036

2-8 9-16 17-24 25-32 2-8 9-16 17-24 25-32

TEST POSITION
Figure 9. Accuracy, mean reaction time, and reaction time distribution parameters (/u and T) as a func-
tion of test position for the study-test paradigm, together with the theoretical parameters », a, and z,
which are used to generate the theoretical fits. (Data are from Ratcliff and Murdock's, 1976, Experi-
ment 1.)
78 ROGER RATCLIFF

LATENCY msec ACCURACY RELATEDNESS U


0/P 25-32 0/P 25-32 -4
•*!
~* .2
V -^
I t I T I l I I i I I i

0/P 17-24

0/P 9-16

4 6 8 10 12 W 16 " 2 4 6 8 10 12 14 16 2 4 6 8 0 12 14
STUDY OR SERIAL POSITION
Figure 10. Accuracy and mean latency as a function of study position blocked by test, or output, posi-
tion (O/P), which is used to produce the fits. (Other theoretical parameters used in the derivations
are shown in Figure 9.)

position. There is a consistent primacy effect argued that the most satisfactory way to fit
across test (or output) position, and the strong the theoretical reaction time distributions to
recency seen at early test positions has dissi- data (in the case of this theoretical model)
pated by later test positions. Therefore, it is through the use of the convolution model as
seems likely that one could reduce the number an empirical summary of shape. Further, it
of degrees of freedom by modeling the pattern was also argued that the best way to display
of behavior of u just as the pattern of accuracy fits of the theory to data averaged over
performance in free recall has been modeled subjects is by the use of group reaction time
as a function of serial position and delay (i.e., distributions (see also Ratcliff, Note 3).
test position; Murdock, 1974, chap. 8). I have Figure 9 shows both the theoretical and
not attempted to fit such a functional relation empirical values of ju (leading edge) and T (tail),
because it adds little theoretically. Changes in parameters of the convolution model, as a
reaction time distributions as a function of function of test position for correct rejections.
serial position are mainly changes in spread of It is these data that posed problems for the
the distribution or the T parameter in the backward serial scanning (conveyor belt) model
convolution model (Ratcliff & Murdock, (Ratcliff & Murdock, 1976). The conveyor
1976), and this pattern of results is predicted by belt model predicts that most of the increase
the theory. in mean latency will be accounted for by an
Reaction time distributions. In the section increase in latency of the whole distribution
on fitting the mathematical model to data, I (/u parameter) rather than the tail (T param-
A THEORY OF MEMORY RETRIEVAL 79

eter). Although the theoretical distributions times, it was reasonable to use the data from
are fitted to empirical data by using the ^ and high-confidence correct responses because re-
T parameters (i.e., minimizing a function of sults were not changed much by addition of
H and T) of the convolution model, the retrieval lower confidence reaction times. Another
theory makes the strong prediction that reason to use only the high-confidence re-
changes in mean latency will be manifest sponses is that many lower confidence responses
mainly as changes in T (i.e., slower responses may be contaminated by other processes such
getting slower; see Figure 4). In fact, patterns as more than one comparison or a slow guess.
of results with changes in /* greater than In contrast, there are relatively few error
changes in T as a function of test position would responses, and high-confidence responses are
be impossible to accommodate without large typically 150 msec to 600 msec faster than
changes in random walk boundary positions low-confidence responses. The theory makes
a and 0. Large changes in a and 2 would change predictions (using parameter values obtained
error rates and hit latencies in such a earlier in this section for Ratcliff & Murdock's,
way that the set of data could not be fitted 1976, Experiment 1) about error reaction times
simultaneously. that are not too unreasonable. For example,
Figure 11 shows fits of the theoretical the theory predicts that miss reaction times
distributions to the group reaction time increase with output position at about 5
distributions for both hits and correct rejections msec per item (by linear regression), with an
as a function of test position. From these intercept of about 860 msec. The data for
graphs, it can be seen that the theory does a high-confidence responses show a slope of
good job of fitting the reaction time about 11 msec per item, with an intercept of
distributions. 820 msec. All miss responses combined (high,
Error reaction times. It was noted earlier medium, and low confidence) give reaction
that the data used in fitting the theory came times about 200 msec longer. For false alarms,
from a confidence judgment procedure rather predicted slope is about 5 msec per item, and
than a yes-no procedure. For correct reaction the intercept is 1,200 msec. High-confidence

CORRECT REJECTIONS HITS

TEST POSITION 2-8 TEST POSITION 2-8

TEST POSITION 9-16

TEST POSITION 17-24 TEST POSITION 17-24

TEST POSITION 25-32 TEST POSITION 25-32

Figure 11. Group reaction time distributions and theoretical fits to those distributions from Ratcliff
and Murdock's (1976) Experiment 1.
80 ROGER RATCLIFF

Table 2
Fits to Accuracy and Latency for Repeated and Nonrepeated Study Items in the Study-Test
Paradigm (Data are from Ratcliff & Murdoch's, 1976, Experiment 4)

Theoretical Experimental

Response u v a Accuracy M a" T Accuracy M (7 T

Correct rejection — -.38 .1 .74 472 37 216 .74 470 40 220


Once-presented hit .18 -.38 .1 .69 455 34 190 .70 450 40 190
Twice-presented hit .18 -.38 .1 .87 445 26 170 .84 450 40 160

Note, , a, and T are measured in msec; TEB = 350 msec and z = .02.

false alarms have slopes of 8 msec per item and slight theoretical overestimation of accuracy
intercepts of 830 msec, so the theory over- of twice-presented items.
predicts high-confidence false alarms. However,
adding lower confidence false alarms increases
reaction time by about 600 msec. Thus, the Rate of Presentation: Dangers Inherent in
theoretical error reaction times come near to "Between" Designs in Reaction Time
the data in some cases but are not so near in Experiments
other cases.
A major problem for strength- or familiarity-
Number of Stimulus Presentations based theories of retrieval is the result shown
in Figure 8d. As rate of presentation of study
When items are repeated in the study list, items decreases, accuracy increases; but re-
I assume that separate representations are action time increases instead of decreasing,
established. The assumption of separate as a strength theory would predict. In the
representations is made because of the prob- experiment from which Figure 8d was derived
lems that a single-representation strength (Ratcliff & Murdock's, 1976, Experiment 2),
theory appears to have (Anderson & Bower, rate of presentation was a between-sessions
1972; Wells, 1974). With two representations, variable. The retrieval theory would predict
there are two comparison processes with high the same result as a strength theory if all
relatedness values, and these two processes criteria were constant across conditions. How-
race to match. Thus, reaction time will be ever, there is a way out for the retrieval theory,
faster and accuracy higher than for singly and that is to suppose that criteria change
presented items, as is seen in the data (see with conditions.
Figure 8e). To model these data, representative As discussed earlier, one of the stronger
values of accuracy and latency for once- tests of the retrieval theory is made when
presented items are chosen, model parameters criteria cannot be adjusted to experimental
for those values are deduced, and then ac- conditions. Thus, a strong test of the theory
curacy and latency for the two racing processes can be made if presentation time can be made
are computed. To compute fits for twice- a within-trials variable. Then, only relatedness
presented items, it was assumed that the two u can vary as a function of presentation time
racing processes had equal u values, and per item, and variation in both accuracy and
Equation A22 was modified appropriately. latency must be predicted by variation in u
This approximation is reasonable, taking into alone. If the results shown in Figure 8d hold
account the input or serial position functions
up when presentation time per item is made a
shown in Figure 10, because in general, u
values are not too different at different serial within-trials variable, then the theory in its
positions. Table 2 shows theoretical fits and present form must be rejected. In Experiment
experimental results for accuracy and latency 1, presentation time per item is made a within-
distributions. The only discrepant result is the trials variable.
A THEORY OF MEMORY RETRIEVAL 81

Experiment 1 Results. Figure 12 shows the main result:


When presentation time per item is a within-
Method. A standard recognition memory procedure trials variable, then the longer an item is
was employed. A list of 16 study items was presented,
followed by a test list containing all the study items presented, the more accurate and faster is the
and an equal number of new items in random order. response to that item. This result contrasts
The lists were random samples from the University with the result found in Ratcliff and Murdock's
of Toronto word pool, a collection of 1,024 two-syllable (1976) Experiment 2 (shown in Figure 8d),
common English words not more than eight letters
long, with homophones, contractions, and proper where responses became more accurate but
nouns excluded. Each trial consisted of a random selec- slower as duration of presentation increased.
tion from the word pool. Each session had 32 lists, and Also shown in Figure 12 are values of u as a
there were no repetitions within a session. List genera- function of presentation time per item and
tion, display, and response recording were controlled predicted values of accuracy and reaction time.
by a PDP-12A laboratory computer. In each study
list, there were four different study times (.5, .8, 1.2, Thus, the theory makes the correct prediction
and 1.8 sec per item), and four study items (in random and is supported by the experimental results.
presentation order) were assigned to each of four study Implications for reaction time research. Be-
time conditions per list. The study list was terminated sides providing a critical test of the retrieval
by an instruction asking the subject to press a response
key to start the test phase. The test phase was self- theory, these results on presentation time per
paced, and items stayed in view until a response was item have important consequences for the
made. A confidence judgment procedure was employed, methodology of reaction time research. It has
and the subjects had to respond on a 6-point scale from often been argued that subjects in reaction
("sure new") to +++ ("sure old") by pressing
the appropriate response key. For each item tested, time experiments can adopt different speed-
input and output position (output position only for accuracy criteria under different experimental
new items), confidence judgment (i.e., key pressed), conditions, and subjects have been induced
and latency (stimulus onset to response key depression) to do so in many experiments (e.g., Banks &
were recorded. A S-msec time base was used. The four Atkinson, 1974; Lively, 1972). However, in
subjects were undergraduates in psychology at the
University of Toronto and were paid $30 for the 12 most reaction time experiments, there does not
sessions plus 1 practice session. seem to be much evidence of criteria being

.86 V = -0.402
0 : 0. 126
Z =0032
T ER = 297ms
o: CORRECT REJECTIONS
§.82 PREDICTED EMPIRICAL
RT 7&4m« 707i2m»
ACCURACY .844 .848± .003
.80

650-
30
en
E 0-
28 CO
Qj

§
:630

620
0.5 1.0 1.5 2.0 SEC 0.5 1.0 1.5 2.0 SEC
PRESENTATION TIME PER ITEM
Figure 12. Accuracy, mean latency, and relatedness u as a function of presentation time per item for
Experiment 1. (Theoretical parameters v, a, z, and TBR are shown in the figure. RT represents mean
latency; ms = msec.)
82 ROGER RATCLIFF

adjusted as a function of experimental condi- presentations was varied. The theory was able
tion. For example, Wickelgren (1977) states, to account for results from this experiment by
assuming separate representations, so that
To be completely fair, we do not know for certain that there were two processes with high relatedness
subjects' capability to achieve any degree of speed-
accuracy tradeoff (from chance to asymptotic accuracy) racing to the match boundary. Both latency
under speed-accuracy tradeoff instructions implies and accuracy were well fitted by results derived
that this capacity is used under reaction time instruc- from this assumption.
tions. Perhaps in reaction time experiments, all sub- An investigation into rate-of-presentation
jects use the same speed-accuracy criterion and use it
under all conditions, (p. 81) effects exposed important methodological
dangers in between designs involving the use of
The above results on presentation time show latency measurements. Ratcliff and Murdock
that a theoretically important functional (1976) showed that both accuracy and latency
relation (latency as a function of presentation increased as rate of presentation was decreased
time per item) contradicts theories when in a between-sessions design. The latency effect
studied in a between design yet supports the is contrary to many theories, including the
same theories when studied in a within design. theory developed here, if it is assumed that
There are two important consequences: (a) speed-accuracy criteria are kept constant.
In any reaction time experiment where criteria The present Experiment 1 varied presentation
can be changed as a function of experimental time per item within trials. Accuracy increased
condition, any differences in reaction time as as presentation time increased, as before, but
a function of condition may be interpretable reaction time decreased. Therefore, in between
only as a change in speed-accuracy criteria designs, it is possible for the subject to change
and nothing more, (b) Theories of reaction speed-accuracy criteria and thus to make
time should be flexible enough to deal with between latency comparisons subject to
criteria changes such as those demonstrated misinterpretation.
above.
Sternberg Paradigm
Study-Test Paradigm: Summary
The Sternberg paradigm is the most studied
The study-test paradigm provides a firm paradigm in memory research that employs
empirical basis for modeling item recognition latency measures. The method is simple: A
processes. The retrieval theory models results small number of items (digits, words, pictures,
by assuming that parallel comparisons between and so on) within memory span is presented
the probe (test item) and members of the to the subject one at a time. A probe item is
study list are made. The decision process is then presented, and the subject has to decide
self-terminating on matches and exhaustive if that item was in the set of study items or not.
on nonmatches. Test or output position func- The subject pushes one button for a "yes"
tions are well fitted by assuming that v, response, another for a "no" response, and
relatedness of nontarget comparisons, de- reaction time and accuracy are recorded. The
creases and diffusion process boundaries in- result that has been of most interest theoreti-
crease a little with increasing test position. cally has been the reaction-time-set-size func-
Latency and accuracy as a function of study tion. Often, reaction time has been a linear
position show serial position functions with function of set size, with equal slopes for
both primacy and recency. For fixed output "yes" and "no" responses. This result led
position, only the relatedness parameter « Sternberg (1966) to develop a serial exhaustive
can vary with serial position, and fits to both scanning model. The model is easily testable,
accuracy and latency with just this one and several serious problems have been found;
parameter varying are good. Latency distri- for example, the model is unable to deal with
butions and error latencies are also well serial position effects (Corballis, 1967; Cor-
described by theoretical predictions. ballis, Kirby, & Miller, 1972), repetition effects
Ratcliff and Murdock (1976) presented an (Baddeley & Ecob, 1973), stimulus and
experiment in which number of stimulus response probability effects (Theios et al.,
A THEORY OF MEMORY RETRIEVAL 83

1973), and nonlinear set-size effects (Briggs, the way in which the theory is applied to the
1974). There are alternative simple models that Sternberg paradigm. The experiment was
have been set up to account for subsets of designed to replicate typical patterns of results
the empirical findings, but it seems that each found in the varied-set procedure while maxi-
of these is inconsistent with some data (Stern- mizing conditions for serial position effects
berg, 1975), Thus, we find an area of research (fast presentation rate and short probe delay).
in which experiments are relatively easy to Serial position effects were sought because
perform and in which each of the competing changes in accuracy and latency as a function
models (in an unelaborated form at least) is of serial position can only be the result of
falsified. changes in the single relatedness parameter u,
There are more serious problems though. as all criteria are fixed. Two subjects were
First, it is generally found that error rate and each tested for eight experimental sessions in
mean reaction time covary (Banks & Atkinson, order to obtain reasonable estimates of error
1974; Forrin & Cunningham, 1973; Miller & rates and reaction time distributions.
Pachella, 1973), but little attempt has been
made to relate accuracy and latency theoreti- Experiment 2
cally (Wickelgren, 1977). Second, it seems
likely that each of the models could be elabo- Method. Two paid student volunteers served as
rated to account for most of the empirical subjects. Sequence generation, display, and response
recording were controlled by a PDP-12A laboratory
effects (Townsend, 1974), as long as mean computer. The memory set was either three, four, or
reaction time is the only statistic considered, five randomly selected digits, from the set zero to nine,
so that no resolution at this level of analysis with no repetitions. The memory set was presented
can be expected. Third, the main statistic sequentially at .5 sec per digit. One-tenth second
following the presentation of the last digit, a fixation
used to describe reaction time is mean reaction point was displayed, which remained on for .4 sec. The
time. Because of this emphasis, few of the probe digit immediately followed and remained on until
models are able to account for properties of a response was made. Each list length was presented
reaction time distributions (Sternberg, 1975). equally often, and each serial position within each list
This is important because distributional was probed equally often. Also, there were equal num-
bers of old and new probe items. There was a 3-sec delay
properties may prove to be decisive in evalu- between trials, with a fixation point appearing for the
ating models (Ratcliff & Murdock, 1976; last .5 sec of that period. Responses were made by
Sternberg, 1975, Note 1, Note 2). For reviews depressing one of two buttons on a response box, right
of this area of research, see Corballis (1975), hand for "yes" responses and left hand for "no"
responses. The subjects served in 8 sessions preceded
Nickerson (1972), and Sternberg (1969a, by 1 practice session. Each session consisted of 10 blocks
1969b, 1975). of 48 trials. Subjects were instructed to be as fast as
To model the Sternberg paradigm, I will use possible while maintaining high accuracy. Feedback
the same assumptions used in modeling the on accuracy performance was presented at the end of
each session.
study-test paradigm. The probe is encoded,
and parallel comparisons are made between Results. Figure 13 shows mean latency as a
the encoded probe and members of the search function of set size for hits and correct rejec-
set. The search set is approximated by the tions, together with linear least-squares re-
memory set, each item having the same gression lines. These show that the basic
normal probability density function N(v, TJ) finding, namely, parallel, linear set-size-latency
of relatedness for nonmatching comparisons. functions, is replicated. Figure 13 also shows
The probability density function of relatedness serial-position-latency functions, which have
for a match comparison is N(u,ri), where u a large recency effect and replicate Burrows
varies with serial position and set size, v, a, and Okada (1971), Corballis (1967), Corballis
and 2 may vary with set size, and ij and s are et al. (1972), and Forrin and Cunningham
kept constant at the same values used in the (1973).
study-test paradigm. As before, a single Figure 14 shows detailed fits of the mathe-
constant time is assumed for probe encoding, matical model to the data. As noted earlier,
response output, and so on. the mathematical model was fitted to the
Experiment 2 was performed to demonstrate summary statistics (proportion correct and
84 ROGER RATCLIFF

determined from the data, that is, latency


RT distribution parameters n and T and accuracy.
680 The fits to the data, although not perfect,
660
show no more than four deviations of theory
from data greater than about two standard
640 deviations out of 45 fitted statistics.
620 Figure 15 shows fits of the theoretical
600 distributions to the group reaction time
580
distributions. The theoretical distributions
capture the main features of the empirical
560 distributions, and changes in distribution shape
SET SIZE 3-*-1 T SIZE 4
540 as a function of set size and serial position are
520 well modeled by changes in the size of memory
2 3 4 5 set and relatedness u.
SERIAL POSITION
RT Model Freedom
CORRECT REJECTIONS
680 At this point, it may be argued that the
660 Sternberg exhaustive serial scanning model has
^22s+586 far fewer parameters than the random walk
640
model presented above. If mean reaction time
620 as a function of set size is to be fitted, then the
600 Sternberg model has 3 parameters and the
580 retrieval theory (for the above data) has 16
RT=l8s + 531 parameters (TEE, V, a, z, and 12 values of u)
560
or 7 if serial position effects are averaged.
540 However, if distributions are to be modeled,
1 2 3 4 5
the number of parameters in the Sternberg
SET SIZE (s) model jumps to 9 (Sternberg, Note 1): 4
Figure 13. Latency-set-size functions and latency- parameters for the comparison stage distri-
serial-position functions for Experiment 2. (Error bars bution and 4 parameters for the base (en-
are one standard deviation. RT represents mean coding, decision, and response output) distri-
latency in msec.)
bution (plus one parameter for the yes-no
intercept difference). Further, if error rates
latency distribution parameters /* and T) by
as a function of set size are to be fitted, then
weighted least squares for hits by set size and
perhaps as many as 6 more parameters have
serial position and for correct rejections by
to be added. The number of parameters in the
set size. The encoding and response output
two models is now comparable. If serial position
parameter TEE was kept constant across all
functions are considered, then as noted earlier,
these conditions, but it also turned out that
the Sternberg model is falsified, but the
to a good approximation, random walk bound-
retrieval theory does a good job. Thus, the
ary criteria a and z and relatedness of probe-
nontarget item comparisons v were constant problems of model freedom and model com-
across set size. Thus, latency and accuracy parison are not as simple and straightforward
for correct rejections changed as a function as might be imagined.
of number of items in the memory set (no Perhaps the major difference between the
additional degrees of freedom once a, z, and v retrieval theory and the Sternberg model can
were fitted). Latency and accuracy for hits be summarized as follows: The retrieval theory
varied only as a function of probe-target item (unlike the Sternberg model) provides an
relatedness u (and memory set size); thus, intrinsic tie-up between reaction time and
across serial position, the single parameter u accuracy, whereas the Sternberg model (un-
varied to produce fits to the three parameters like the retrieval theory) can make the advance
A THEORY OF MEMORY RETRIEVAL 85

prediction that the reaction-time-set-size func- if a theory of recency effects were developed,
tion is linear. then the prediction of relatedness as a function
Further reduction in the number of param- of serial position would be an important test of
eters in the retrieval theory might be accom- the conjunction of that theory with the re-
plished if the relation between relatedness u trieval theory.
and serial position could be described by a
few-parameter functional relation, for example, Error Reaction Times
exponential decay. I have not attempted to
fit such a functional relation because at In typical reports of Sternberg varied-set
present it adds nothing theoretically. However, experiments, error reaction times are tarely

HITS LIST HITS LIST HITS LIST CORRECT


LENGTH 3 LENGTH 4 LENGTH 5 REJECTIONS

V = -0.471
0 = 0 . 237
Z = 0.099
T£R= 259 mMC

—I—I—i—o_
4 5 1 2 3 4 5 2 3 4 5 2 3 4 5
SERIAL POSITION SET SIZE
Figure 14. Accuracy, latency distribution parameters (from the convolution model) tt and T, and related-
ness u as a function of serial position and list length for hits, together with accuracy and latency dis-
tribution parameters n and T as a function of set size for correct rejections in Experiment 2. (The filled
circles represent the theoretical values. For correct rejections, only number of processes in the search set
changes with set size; for hits, only relatedness u changes with serial position. All other theoretical
parameters [v, a, z, and TER] are fixed at the values shown in the bottom right-hand panel. Error bars
are one standard deviation, ms = msec.)
86 ROGER RATCLIFF

SET SIZE 3 SET SIZE 4 SET SIZE 5


CORRECT REJECTIONS CORRECT REJECTIONS CORRECT REJECTIONS
ffrh

SET SIZE 4 SET SIZE 5


HIT S.P1 HIT S P. 1

J ••
SET SIZE 3 SET SIZE 4 SET SIZE 5
HIT S.P. 2 HIT S P. 2 HIT S.P.2

d° lit,
m SET SIZE 3 SET SIZE 4 SET SIZE 5
HIT S.P 3 HIT S.P. 3 HIT S.P. 3

SET SIZE S SET SIZE 4


S P.5 S P4

4 5 6 7 .8 9 10 .9 IO II
REACTION TIME ( S E C )

Figure IS. Group reaction time distributions (bar graphs) and theoretical fits (dots) for correct rejections
by set size and for hits by set size and serial position (S.P.) for Experiment 2. (These data and fits
should be compared with the distribution parameters M and ^ in Figure 14.)

presented because error rates are usually low, (1975) reported that in his experiments using
so that error latencies are unreliable, and also the Sternberg procedure, error latencies were
because error latencies are not of theoretical generally smaller than latencies for correct
interest. However, error latencies can be responses. In Experiment 2 above, error
useful in deciding between theories (Corballis, latencies for Set Sizes 3, 4, and 5 were 940
1975; Murdock & Dufty, 1972). Corballis msec, 760 msec, and 1,005 msec for misses
A THEORY OF MEMORY RETRIEVAL 87

(average of 902 msec) and 1,307 msec, 909


msec, and 1,055 msec for false alarms (average
of 1,090 msec), respectively. Predictions of CORRECT RT..62sec
ERROR RATE* 1%
mean reaction time from the theory are far ERROR RT = .62 sec
too large. For typical values of parameters
used to fit Experiment 2 (« = .435, v = — .437,
a = .237, z = .099, and Set Size 4), reaction
time for false alarms is 2.05 sec; for misses,
it is 1.59 sec.
Error latency depends to a large extent on § 2 CORRECT RT = .66sec
ERROR RATE = 3 %
the tails of the relatedness distributions. There- > ERROR RT = 1.30 sec
fore, if the choice of normal distributions were 3
inappropriate, everything else in the theory m i
might still be tenable. For example, suppose
the relatedness distributions have extremely
long tails, so that about 1% to 2% of the total
probability density resides in these tails. Then, CORRECT RT*.66sec
theoretical values of latency and accuracy of ERROR RATE" 2 %
correct responses may not be altered ERROR RT = 2.05 sec

much, but error latency may change quite


significantly.
Figure 16 demonstrates the effect of the
shape of the distribution and tails of the -1.0 -.5 0 .5 1.0 1.5
distribution on latency and accuracy. For RELATEDNESS
reference, the bottom panel of Figure 16 shows Figure 16. Effect of the shape of the relatedness dis-
the normal distribution of relatedness for a tribution on correct and error reaction time (RT)
nonmatching comparison with correct and error and error rate. (The bottom panel shows a normal
latencies and error rate. The middle panel distribution of relatedness; the middle panel shows a
normal distribution, with 3% of the probability density
shows the same normal distribution, with 3% forming a rectangular distribution between —2 and +1
of the probability density removed and added relatedness; the top panel shows a rectangular distribu-
as a rectangular distribution over the range of tion of relatedness with the same 3% tail as in the
relatedness —2 to +1. The error rate is middle panel).
increased a little; correct latency is the same
as correct latency in the bottom panel, but Figure 15. The error rate drops to 1%, all errors
error latency is reduced to 1.3 sec, which is coming from the tail of the small rectangular
near the experimental result. distribution that extends into positive related-
The top panel of Figure 16 shows just how ness. But now error latency has decreased to
far it is possible to change the assumption of 620 msec. The assumption of rectangular
normal relatedness distributions without affect- distributions has some precedent in that Luce
ing correct reaction time or accuracy. The (1963) and Krantz (1969) have both used
large rectangle contains 97% of the probability rectangular distributions in high-threshold
density; and as in the middle panel, the small models to account for signal detection.
rectangle, between —2 and +1 relatedness, There are three conclusions that can be
contains 3% of the probability density. With drawn from this exercise. First, reaction time
this assumed relatedness function, mean distributions for correct responses are rather
reaction time for correct responses decreases to insensitive to the shape of the relatedness
.62 sec, with the leading edge (M parameter in distributions so long as there is a lump of
the convolution model) of the reaction time probability density around the appropriate
distribution increasing by 30 msec and the r relatedness value. Second, error rates depend
parameter in the convolution model decreasing on the amount of probability density near and
by 50 msec. The reaction time distribution is greater than the zero point of relatedness.
still very similar in shape to those shown in Third, error latency depends critically on the
88 ROGER RATCLIFF

Table 3
Theoretical Predictions for Latency and Accuracy oj Hits for Repeated Items
for Parameter Values in Experiment 2

Mean
Response type u Accuracy latency M a r

Once-presented hit .35 .966 686 433 47 253


Twice-presented hit .35 .999 566 429 44 137
Once-presented hit .435 .988 605 420 42 185
Twice-presented hit .435 1.0 514 415 38 99
Once-presented hit .55 .998 592 404 36 125
Twice-presented hit .55 1.0 465 399 32 66

Note. Mean latency, /i, <r, and T are in msec. Other parameter values are as follows: TER = 259 msec, v
= — .471, a = .237, z = .099, and memory set size = 4.

shape of the tail of the relatedness distribution such as the study-test paradigm, rather than
near the zero point of relatedness. Thus, a paradigm with error rates around 1% of the
reaction time distributions and error rates are total responses.
reasonably independent of relatedness shape
so long as there is a lump of probability density Repetition Effects
near the mean relatedness value of the normal
distribution used to fit the experimental data Baddeley and Ecob (1973) studied perform-
and so long as there is a reasonable amount of ance on memory sets containing repetitions
overlap. This shape-independent property using a varied-set Sternberg procedure and
suggests that the correct focus of the model is found faster recognition for repeated items.
on error rates and correct response latencies The effects on reaction time (50 msec to 100
rather than on error latencies. msec) were larger than those observed in the
Reed (Note 4) has some evidence that the study-test paradigm (30 msec to 40 msec;
signal distribution (in a signal detection see Table 2). Changes in accuracy were not
analysis of bilingual recognition) changes shape reported, probably because accuracy was near
over the time course of recognition. The signal 100%, and there were not enough observations
distribution starts off as multimodal, with a to allow the detection of significant differences.
signal peak in the middle of the noise distri- There are two possible ways to model the
bution, and then quite quickly becomes effect of repetitions. For repetition effects in
unimodal. This kind of change in relatedness the study-test paradigm, I assumed two
distributions (the theoretical equivalents of matching racing processes. The alternative is
signal and noise distributions in signal detec- to assume one representation twice as strong,
tion analysis) would give rise to fast errors in that is, a compressed memory set (presentation
the retrieval theory and so may be an alter- of Digits 3238 would lead to Memory Set 238
native to non-normal unimodal relatedness with 3 stronger than 2 or 8). However, Baddeley
distributions. Although this finding is a little and Ecob (1973) ruled out this possibility by
tentative, little is known about the time course noting that nonrepeated items from sequences
of recognition, and we should at least keep such with repetitions were no faster than items
possibilities in mind. from sequences (of the same size) with
One further point is that just as very long nonrepetitions.
outlier reaction times are often assumed to be To model the repetition result, I took
spurious, a certain proportion of errors may parameter values used in Experiment 2 and
also be spurious. For example, subjects often computed accuracy and latency for one and
report that they simply hit the wrong button. then two racing matching processes. Results are
Therefore, to get meaningful and stable error presented in Table 3 and show that twice-pre-
latency data, it is better to work with a sented items are between 64 msec and 120 msec
paradigm with reasonably large error rates, faster than once-presented items. Thus, the
A THEORY OF MEMORY RETRIEVAL 89

theory gives quite good quantitative agreement The fits to error latency were poor, and this
with the result presented by Baddeley and led to an investigation of the dependence of
Ecob (1973). fits of the model on the form of the relatedness
distributions. Results showed that mean
Discussion and Summary of the Sternberg latency and the shape of latency distributions
Paradigm for correct responses are rather insensitive to
the shape of the relatedness distribution so
There are many variations on the Sternberg long as there is a lump of probability density
paradigm published in the literature, but none around the appropriate relatedness value;
provide qualitative problems for the theory. further, error rate does not depend on the shape
For example, stimulus probability effects in of the relatedness distribution but on the
the fixed-set procedure have provided problems amount of probability density around the zero
for the serial exhaustive scanning model point of relatedness; finally, error latency is
(Miller & Pachella, 1973; Sternberg, 1975; dependent on the tail of the relatedness
Theios et al., 1973; Theios & Walter, 1974). distribution near the criterion. Although the
In Link's (1975) random walk model for fits to error latency were poor, it was shown
choice reaction time, stimulus probability that better fits could be obtained by changing
effects provide the basis of a successful test the shape of the relatedness distribution, and
of the theory. In effect, stimulus probability such changes could be made without affecting
affects the random walk boundary positions. predictions about correct latency or error rate.
Therefore, it seems likely that the memory Therefore, until a theory of the structure of the
retrieval theory could account for stimulus memory trace can be developed that will
probability effects in much the same way. specify the relatedness distribution, the most
There have been several speed-accuracy useful data for testing the mathematical model
studies using the Sternberg paradigm, but appear to be error rates and latency distri-
because the question of speed-accuracy trade- butions for correct responses.
off is central to the retrieval theory, the topic
is discussed later in a separate section. Supraspan Prememorized List Paradigm
To summarize, the Sternberg paradigm is
modeled in much the same way as the study- It has been argued that in order to compare
test paradigm. Parallel comparisons between reaction time results from experiments using
the probe and members of the search set are long lists (above memory span) with results
made. The decision process is self-terminating from experiments using short lists (subspan),
on matches and exhaustive on nonmatches. error rates should be low and approximately
The search set is approximated by the memory the same. To produce low error rates with long
set, even though representations of items from lists, prememorized lists have been employed.
earlier trials (with probe-item relatedness An experiment that shows the way in which
lower than probe-memory-set item related- the theory applies to the prememorized list
ness) may have to be included to account for paradigm is the Burrows and Okada (1975,
effects of recency of negative probes (Atkinson Experiment 1) study. This experiment used
et al., 1974). An experiment was performed the Sternberg fixed-set procedure, with pre-
to demonstrate fits of the mathematical model memorized memory-set items and list length
to data. All criteria were fixed as a function of varying from 2 to 20. Each word in the positive
set size, and changes in accuracy and latency set was tested once in the test sequence for
of correct rejections as a function of set size that list, together with an equal number of
were well fitted as a result of search set size new words. Subjects had to press one button
changing (no free parameters to produce those to indicate that the test word was in the study
changes). The covariation of accuracy and list, another button to indicate a new word.
the shape of latency distributions as a function The results showed parallel slopes for positive
of serial position was well accounted for by and negative responses as a function of set size
changes in the single parameter u (relatedness and low error rates. Reaction time as a function
for a probe-memory-set item comparison). of set size showed an apparently discontinuous
90 ROGER RATCLIFF

function. Subspan reaction times were fitted syllable words, and it is likely that the per-
by a linear function with a slope approximating ceptual centers of those words could vary even
that typically found in the Sternberg paradigm more than 80 msec. Of course, if the main
(35 msec to 55 msec per item). Supraspan reaction time statistic considered is mean
reaction times were fitted by a linear function reaction time, then there is no problem—just
with a much lower slope (13 msec per item). an addition of a little noise—but if distributions
It was argued that the discontinuity was are considered, then the use of verbal probes
evidence for separate long-term and short-term can lead to fast (as well as slow) outliers.
retrieval processes. However, in the same To obtain better estimates of distribution
article, it was shown that a continuous function parameters, I trimmed off 5% of the fast
(logarithmic) did almost as good a job of reaction times and refitted the convolution
fitting the data as the two linear functions. model. The average value of cr was reduced by
I now demonstrate that the data from the 47 msec to 57 msec. The main effect on the
Burrows and Okada (1975) study can be well other distribution parameters was to increase
fitted by the random walk retrieval theory, r, leaving /t about the same as before trimming.
which assumes only one kind of retrieval The retrieval theory was applied as in the
process. It should be noted that in terms of the study-test and Sternberg paradigms. Parallel
retrieval theory, it is not necessary to have comparisons are made between the encoded
low error rates in studying retrieval from long probe and members of the search set. The
lists. The study-test paradigm uses supraspan search set is approximated by the experimental
lists, and error rates can be as high as 25%. memory set, and each item has a normal
In order to fit the mathematical model to distribution of relatedness N(v, r]) for non-
the data, it is necessary to estimate reaction matching comparisons. Each memory-set item
time distribution parameters using the em- has a normal distribution of relatedness
pirical convolution model, as was done with N(u,rj) for matching comparisons, and the
other paradigms. There were between 70 and variance parameters r? and s are kept constant
90 observations per set size per subject, which at the values assigned earlier.
is barely enough to allow stable estimations The fits of the theory to the data are shown
of distribution parameters. The fits of the in Figures 17 and 18. Figure 17 shows fits to
convolution model were performed, and it was the empirical reaction time distribution param-
immediately apparent that the a parameter eters n and T for hits and correct rejections.
(the standard deviation in the normal com- The error bars represent the maximum likeli-
ponent of the convolution) was far larger than hood standard deviation estimates in param-
in any fits to any other paradigm: The average eters but do not take into account any subject-
value of a across subjects and set size was 104 to-subject variability. Figure 18 shows fits of
msec. The large value of a (and inspection of the theoretical error rates to data, together
the distributions) showed that the reaction with behavior of theoretical parameters u, v, a,
time distributions had slowly rising leading and z as a function of set size. The fits of the
edges, sometimes with short outlier reaction theory to data are by no means perfect. The
times. fact that the theory cannot be made to give
I was at a loss to explain why these reaction better fits demonstrates quite clearly that there
time distributions were much less skewed than are fairly powerful nonobvious constraints on
usual until I noted that the test words were the theory (before the fast reaction times were
presented verbally. A recent article by Morton, trimmed, the best fit was much worse; pre-
Marcus, and Prankish (1976) suggests dangers dicted miss rates were half the size of those
in this procedure. They demonstrated that if shown in Figure 18).
the physical onset of a series of words is fixed, Figure 18 shows values of u and v that allow
the time for perception of the words can be a d' measure of discriminability to be calculated
extremely variable. For example, the delay from d' = (u — v)/ri. d' starts at a value
between onset and perceptual center of the of 5.3 at Set Size 2 and decreases to 4.8 by
spoken digits "seven" and "eight" differs by Set Size 10 and remains at that value to Set
80 msec. Burrows and Okada used two- Size 20. The constancy of d' for longer lists
A THEORY OF MEMORY RETRIEVAL

can be interpreted as showing that the list would correspond to Processes 2 and 4,
prememorization criterion produced equiv- respectively, and repeated presentations would
alent relatedness levels for the larger set sizes. correspond to Processes 1 and 3, respectively,
The continuous change in d' as a function of thus leading to the observed pattern of results.
set size shows quantitative continuity between Several studies have investigated the situa-
subspan and supraspan retrieval in item tion in which there are different sets of memory-
recognition. set items stored in long- and short-term
Much of the research done using this memory (Forrin & Morin, 1969; Scheirer &
paradigm has been performed within the frame- Hanley; 1974; Sternberg, 1969b; Wescourt &
work of the Atkinson and Juola (1973) model. Atkinson, 1973). The general consensus seems
In the experimental procedure used by to favor models that postulate parallel access
Atkinson and Juola (1973), a study list was to short- and long-term sets (Wescourt &
memorized prior to the experimental session. Atkinson, 1976). For example, Wescourt and
In any block of trials, test items consisted of Atkinson (1973) used a procedure in which a
both new positive items and new negative 30-word long-term set was memorized pre-
items, together with all previously presented experimentally. In an experimental trial,
items. Results showed that repetition of subjects were presented with a short-term set
positive items made them faster and more of one to four words (or zero in a control
accurate, whereas repetition of negative items condition) before each test word. Test words
made them slower and less accurate. The way came from either the short- or long-term set,
in which the theory would deal with these with each having a probability of .25, or the
results can be seen by referring to Figure 2. test word was a negative. The set-size-latency
First presentations of positives and negatives function for probes from the short-term set

HITS CORRECT REJECTIONS


msec
800

TOO

600

900

t J50
msec
300

250

200

150

2 4 6 8 10 12 W 16 * 20 2 4 6 8 K) 12 14 16 18 20
SET SIZE
Figure 17. Reaction time distribution parameters (jn and T) as a function of set size in Experiment 2 (Bur-
rows & Okada, 1975), together with theoretical fits (continuous lines). (Note that TBR = 420 msec. Error
bars are one standard deviation.)
92 ROGER RATCLIFF

.04

' -- ' ! I ' L-

3 4 6 8 10 12 14 16 18 20 2 4 6 8 10 12 14 16 18 20
SET SIZE

Figure IS. Error rates as a function of set size in Experiment 2 (Burrows & Okada, 1975), together with
theoretical fits (continuous lines). (Also shown are theoretical parameters «, v, a, and 2 as a function
of set size. Note that TEE = 420 msec. Error bars are one standard deviation.)

was roughly linear with a slope of 21 msec per 1974). In contrast, when material in the two
item. In contrast, the latency function for sets is not highly discriminable, latency is a
probes from the long-term set (as a function joint function of short- and long-term set size.
of short-term set size and excluding Set Size 0)
was nearly flat, 1 msec per item slope, and Summary
about 100 msec slower than response to short-
term probes. Negatives showed the same Results from prememorized list procedures
pattern of results with a slope of about 5 are modeled in the same way as results from
msec per item and responses 40 msec slower the Sternberg and study-test procedures.
than responses to long-term probes. The Application of the theory to the Burrows and
specific model proposed by Wescourt and Okada (1975) study, in which list length was
Atkinson (1973) is similar to the decision varied from subspan to supraspan, showed
component of the retrieval theory, namely, there is no need to assume separate retrieval
parallel independent searches of the short-term processes for subspan and supraspan item
and long-term sets, self-terminating on posi- recognition. Furthermore, in experiments where
tives and exhaustive on negatives. To make the short-term and long-term sets are searched
two models identical, it is only necessary to simultaneously, it seems that the best model
assume independent and parallel probe- for the data assumes parallel search of both
memory-set item comparisons within both the sets in line with the retrieval schemes developed
long-term and short-term stores. Therefore, in this article.
because processing is the same for both sets
of items, there is no basis for separating short- Continuous Recognition Memory Paradigm
and long-term stores in terms of processing.
Another finding that is consistent with the In the continuous recognition memory
theory is the result that when material in the paradigm, single words are presented, and the
short-term set is discriminable from material subject has to press one button if the test
in the long-term set, there is no effect of long- word is new and another if the test word was
term set size on reaction time to short-term presented earlier in the test list. This task is
items and vice versa (Scheirer & Hanley, somewhat different from the three paradigms
A THEORY OF MEMORY RETRIEVAL 93

discussed up to this point in that items change not change the fits very much). At any test
from negative to positive on presentation, and position in the sequence, the test item can be
the search set is continually increasing in size. either new or old with lag values of 0, 1, 2, 4,
The principal independent variable is lag or 6. Therefore, in the theory, all criteria are
between successive presentations of an item, fixed and only relatedness u may vary with
and accuracy and latency are measured. lag. Thus, any single value of u must fit both
Okada (1971) performed two experiments accuracy and reaction time distributions
that investigated performance on this task. (through the parameters n and r). Empirical
In both experiments, mean reaction time as a results are presented in Figure 19 for propor-
function of lag was well approximated by a tion correct and reaction time distribution
negatively accelerating exponential. I have parameters M and T, together with theoretical
fitted the mathematical model to Okada's fits, as a function of lag. In these fits, i and ij
(1971) Experiment 2 to show how the theory is are fixed at .08 and .18 as before. The three
applied. In his Experiment 2, items were experimental statistics (accuracy and latency
presented once, twice, or three times, but only distribution parameters/* and T) are adequately
once- and twice-presented items were fitted fit with just one theoretical parameter u,
because there were not sufficient data to which varies as a function of lag.
perform distributional analyses on thrice- Thus, the pattern of results found in the
presented items. Even though the paradigm continuous recognition memory task can be
is different in many respects from the study- modeled in the same way as results from the
test and Sternberg paradigms, the same study-test and Sternberg paradigms.
processing assumptions are made. The memory
set changes from 0 to 150 in the course of a Speed-Accuracy Trade-off Paradigms
block of trials, but in order to approximate over
the whole block, the search set size was set Experiments to study speed-accuracy trade-
at 40 (though values between 30 and 60 would off are usually performed within the context

ms ms
160 160

140 140

120 120

100 no

80
90
1.00
V = -0.52

§
<r
.96
0 = 0. 13
Z = 0.025 .60 Z
Q
LJ
ffi

2 .92 .50 <t


tE LJ
£ .88 40
o:

.84
2 3 4 5 6 CR 0 I 2 3 4 5 6 CR
LAG
Figure 19. Proportion correct, reaction time distribution parameters M and T, and relatedness u for hits
as a function of lag for Okada's (1971) Experiment 2. (Theoretical fits are shown by open circles; also
shown are correct rejection data [CR]. The number of processes assumed to be in the search set was
40, and TEE = 520 msec. Theoretical parameters 11, a, and z are as shown in the figure.)
94 ROGER RATCLIFF

of a particular paradigm, for example, choice from the retrieval theory are compatible with
reaction time and the Sternberg paradigm the size of the effects obtained by Banks and
(Pachella, 1974), the Brown-Peterson para- Atkinson (1974), typical parameter values
digm (Reed, 1973), sentence processing used in Experiment 2 were taken, and random
(Dosher, 1976), cued recall (Murdock, 1968), walk boundary position parameters a and z
paired associate recall, and recognition (Wickel- were reduced to .04 and .015, respectively.
gren & Corbett, 1977). Because the topic is Latency for both hits and correct rejections
central to the theory developed in this article, decreased by 300 msec, and average error
it is dealt with in this separate section. rates increased from 2% to 18%. Thus, the
In the introduction, speed-accuracy trade- theory produces effects that are quite con-
off methods were partitioned into two classes sistent with results from the payoff method.
(excluding latency partitioning). Information- Payoff or instruction is usually a between-
controlled processing refers to the usual sessions variable; therefore, criteria can change
reaction time method in which the subject sets between conditions, and such experiments do
some limit on the amount of information not provide a strong test of the theory.
required for a response, and time of response Nevertheless, the demonstration that speed
is not limited. Experimentally, the amount of and accuracy can covary over a wide range of
evidence required for a response can be values is important in evaluating models of
manipulated using instructions (e.g., speed vs. retrieval processes. For further discussion of
accuracy) or payoffs. Time-controlled process- these methods, see Pachella (1974) and
ing refers to the class of experimental methods Wickelgren (1975,1977).
where the time of response is experimentally
controlled using methods such as response Time-Controlled Processing
signals, deadlines, or time windows.
In this section, it is shown that time-
Information-Controlled Processing controlled processing methods produce data
that provide a strong test of the retrieval
Each of the preceding item recognition theory. One example of time-controlled pro-
paradigms is an example of information- cessing is the response signal method that was
controlled processing. To check that the developed to avoid the problem that criteria
retrieval theory is capable of accounting for may change between conditions. In essence,
results obtained when speed-accuracy criteria the subject is given the probe followed by a
are changed, I shall show that changes in signal to respond presented a variable amount
boundary criteria are sufficient to account for of time after probe onset (Reed, 1973, 1976;
the kinds of changes observed in experimental Schouten & Bekker, 1967). Reed (1976) used
data when speed-accuracy conditions are the response signal method to investigate the
changed. Banks and Atkinson (1974) used the time course of recognition in the Sternberg
method of payoffs to investigate speed- varied-set procedure. One, two, or four letters
accuracy effects in the Sternberg varied-set comprising the memory set were presented
procedure. They found that with payoffs sequentially. Following probe onset, a signal
stressing accuracy, error rates were about 1%; to respond was presented at a lag varying
whereas with payoffs stressing speed, error from 7 msec to 4,104 msec. Subjects were
rates were between 10% and 30%, the slope required to respond "yes" or "no" as quickly
of the latency-set-size function was halved, as possible after the signal and to make a
and the intercept was reduced by about 200 confidence judgment on how confident they
msec. To model this pattern of results, it is felt at the time of response. There were two
assumed that under speed conditions, the analyses of major interest: d' as a function of
random walk boundaries are moved close response signal lag and response latency as a
together; whereas under accuracy conditions, function of signal lag. It was argued that the
the boundaries are moved relatively far apart first of these indicated the time course of
(see Figure 5 for an illustration). accumulation of evidence on whether the probe
To check that the numerical values generated was a member of the memory set, and the
A THEORY OF MEMORY RETRIEVAL 95

second gave evidence that the decision is based Equation 10 gives an expression for the
on the termination of a discrete process. growth of d' as a function of time for Memory
Before discussing the results, I will derive an Set Size 1. For Set Sizes 2 and 4, the decision
expression for the increase in accuracy as a rule must be taken into account: If at the time
function of signal lag from the theory. of the response signal, there is evidence from
In order to make the problem tractable, let at least one process greater than the starting
us assume that the match and nonmatch value 2, then a "yes" response is produced;
boundaries are placed far from the starting otherwise, a "no" response is produced (see
point, so that in the range of signal lags Equation A23). Therefore, to calculate d'
considered here, the diffusion process can be values for Set Sizes 2 and 4, it is necessary (a)
considered unrestricted. Also, let us consider to calculate d' values from Equation 10, (b) to
only Memory Set Size 1. In an unrestricted use the d' values to give values of hit rate and
diffusion process, the probability density false alarm rate, (c) to combine the hit rate
function h(x, t) at position x and time t with and false alarm rate according to the decision
drift f and variance in the drift *2 is given by rule, and (d) to thus produce a hit rate and
false alarm rate for the combined process from
1 which a d' value can then be calculated.
h(x,f) = (7)
Let p be the hit rate and q the false alarm
rate for Set Size 1 at some time t (derived from
(Feller, 1968, p. 357). In the theory, however, the d' calculated from Equation 10). Then,
relatedness £ has a normal distribution (see for n processes, the probability of a nonmatch
Equation A24). Therefore, in order to calculate w is given by
the distribution of evidence as a function of
time, we must integrate over the distribution W = 1 - (1 - q)n, (11)
of relatedness N(u, TJ) and find
and the probability of a hit r is given by
1
?)«- , (12)
by analogy with Equation A 19. Thus, r and w
-j<3-{fl!/s!< : depend on the particular values of p and q, so
2irs*t~ Vfcnj"8 that for a particular discriminability value
d' = (u — v)/r], the discriminability for n
processes depends on the criterion chosen, and
subjects could maximize the d' value by
y(x, t} = adjusting the criterion. The dependence of
discriminability d' on criterion value is rather
an undesirable property; however, the de-
Thus, the distribution of evidence is given by a pendence is rather small. Consider d' = 1.85;
normal distribution N[_ut, V/(r;2/ + s2)]. For values of p and q that can give this value are
two distributions of relatedness, N(u, rj) for p = .86, .75, and .54 and q = .22, ,12, and .04,
matches and N(v,ri) for nonmatches, d' is respectively. Using Equations 11 and 12,
given by values of d' for n = 2 are 1.51, 1.52, and 1.54,
respectively, and for n = 4 are 1.14, 1.20, and
u —v u— 1.27. Thus d' is reasonably robust to the choice
d1 (9)
(variance)1 + i!/(»?2<)]' of criterion. In Reed's study, hit rates were
usually high (80% to 90%), so that these
Now as t—><x>, the value of d' asymptotes larger p values were used in the following fits.
(ASY) at ^'ASY — (u— »)/ij, that is, d' of the Figure 20 shows a plot of the accuracy-
original relatedness (as expected, given a large response-signal-lag data obtained by Reed
amount of time). Thus, (1976), together with fits from Equation 10
and d' values for Set Sizes 2 and 4, calculated
d' ASY using Equations 11 and 12. The encoding and
d' = (10)
response output parameter was 220 msec, and
96 ROGER RATCLIFF

d1
3.

.2 .4 .6 .8 i.O 1.2
RESPONSE SIGNAL LAG (SEC)
Figure 20. d' as a function of signal lag for a speed-accuracy experiment by Reed (1976) using response
signals in the Sternberg varied-set procedure. (The broken line represents data, and the smooth curves
represent fits ^derived from Equations 10, 11, and 12].)

<*'ASY for Set Size 1 was set at 3.2. It should be Confidence Judgment Procedures
noted that each of the three theoretical curves
was fixed in shape by the initial choice of j2 The study-test paradigm employs a 6-point
and r;2 made at the start of fitting the study- confidence judgment procedure for response
test paradigm, so that shape is described with recording. Up to this point, the task has been
no free parameters. modeled as though it used a yes-no procedure
Reed (1976) has argued that the response- because results from a yes-no procedure show
latency-signal-lag function is particularly use- the same pattern of data. In order to model
ful in discriminating models. These curves (qualitatively) the confidence judgment pro-
show a minimum at some particular lag, cedure, it is necessary to assume a variable
depending on type of response (yes-no) and internal temporal deadline. In terms of the
list length. It was argued that these provide theory presented here, the only information
evidence for the termination of a discrete available about the "strength" of the item
process that leads to an accelerated response during the recognition processes the compari-
latency for a short period. The difference son time. Thus, if a comparison is taking a
between the minimum for "yes" and "no" long time, the subject may reduce his confi-
responses was used to argue for differential dence and respond on a lower confidence key.
termination of the "yes" and "no" decision It should be noted that subjects are quite
process. These two results are exactly what the good at responding to external deadlines
theory predicts if it is now assumed that the (Reed, 1976) and so may be able to set internal
diffusion process boundaries are close to the deadlines. Thus, confidence judgment results
starting point of the process but not close may be explained in terms of subject-set
enough to significantly affect the shape of the internal deadlines on the comparison process.
theoretical accuracy-signal-lag curves: It
should be noted that the latency minima Summary
correspond to a significant increase in response
latency rejections (100 msec < T < 500 msec Speed-accuracy trade-off is a phenomenon
only accepted; see Table 1 of Reed, 1976), that can probably be induced in all reaction
and this may be a problem for interpretation time experiments. Therefore, any retrieval
of these results. model that deals adequately with both latency
A THEORY OF MEMORY RETRIEVAL 97

and accuracy must have mechanisms at the Each comparison process is formalized as a
heart of the model to deal with this trade-off. diffusion process. In the diffusion process,
The theory developed here was designed with evidence for a probe-memory-set item match
this in mind, and so speed-accuracy relations is accumulated over time: the greater the
are fundamental. The theory is quite consistent evidence (greater the resonant amplitude), the
with results from several speed-accuracy pro- more likely a match; the lower the evidence
cedures and fits results from the response signal (smaller the resonant amplitude), the more
procedure (Reed, 1976) especially well. In likely a nonmatch (see Figure 2). The rate of
the theory, two variance parameters (if and s2) accumulation of evidence is determined by the
were fixed throughout fits to previous para- relatedness of the probe to a memory-set
digms. The shape of the accuracy-signal-lag item (see Figure 3), where relatedness is a
function was determined solely by the ratio of single-dimension variable onto which all
those two parameters (s^/if), and the fit was structural, semantic, and phonemic (etc.)
excellent. information is mapped. The decision process is
self-terminating on a comparison process
Discussion match, leading to a "yes" response. For a
"no" response, all comparison processes must
Comparison and Summary of Paradigms have terminated with a nonmatch.
The search set was assumed to be the
Each of the experimental paradigms con- memory set for the study-test, Sternberg, and
sidered so far has been discussed more or less prememorized list paradigms, .though it was
in isolation from the others. Therefore, it is noted that this is only an approximation
appropriate to give a brief summary and because there is evidence that items from earlier
comparison of experimental parameters be- lists enter the comparison process. For the
tween paradigms. The study-test, Sternberg, continuous recognition memory paradigm, a
prememorized list, and continuous recognition search set size of 40 was assumed.
paradigms are all assumed to have the same Table 4 shows parameter values for the
processing stages (see Figure 1). The encoded four item recognition paradigms for the two
representation of the probe is compared in conditions in each experiment in which the
parallel with members of the memory search largest value of d' and the smallest value of d'
set. The search set is defined as those items were obtained. The theoretical value of d'
determining processing rate and is to be represents the difference in the size of reso-
conceptualized as the set of resonating elements nance between an item in memory and the
in the framework of a resonance metaphor. probe when the probe matches and when the

Table 4
Parameter Values for Four Item Recognition Paradigms for the Two Conditions Giving
the Largest and Smallest d' Values

Largest d' vaJue Smallest d' value

Paradigm a z u

Study-test (Experiment 1) .15 .04 .45 -.44 5.0 .16 .04 .19 -.38 3.2 360 msec
Sternberg (Experiment 2) .24 .1 .55 -.47 5.7 .24 .1 .35 -.47 4.6 260 msec
Prememorized lists .16 .09 .4 -.55 5.3 .33 .11 .36 -.52 4.9 420 msecb
Continuous recognition .13 .03 .6 -.52 6.2 .13 .03 .35 -.52 4.8 520 msec0

• In the response signal study by Reed (1976), TEH = 230 msec and d' = 3.2.
b
The test words were spoken, and spoken words have a larger stimulus onset to perception than visually
presented words.
° Stimuli were presented on an IBM typewriter advanced by a solenoid, the pulse triggering the solenoid,
which started a timer. There is a 250-msec delay (approximately) from timer start to stimulus onset with this
apparatus (compare Experiments 1 and 2 of Murdock et al,, 1977, with stimuli presented on a computer and
stimuli presented on a typewriter).
98 ROGER RATCLIFF

probe does not match the item. Therefore, to Anderson (1973), which deals with retrieval
calculate asymptotic d' for any experiment, it from short memorized lists. It should be stated
is necessary to obtain a hit and false alarm rate at the outset that this model is not to be viewed
from the d' value, to use Equations 11 and 12 to as definitive but rather as the first step of a
derive hit and false alarm rates for the com- continuing and developing research program
bination of n processes in the memory set, and (Anderson, 1972, 1976; Anderson, Silverstein,
then to use the derived hit and false alarm rates Ritz, & Jones, 1977; Borsellino & Poggio,
to estimate asymptotic d'. It turns out that in 1973; Cavanagh, 1976; Kohonen, 1976;
all four experiments, asymptotic accuracy is not Kohonen & Oja, 1973). The basic structure of
much more than the accuracy obtained in the the model is fairly simple. It is assumed that
experiment, which suggests that subjects are what is of importance is the simultaneous
working with an emphasis on accuracy. pattern of individual neuron activities in a
Parameters show no more variation within large group of neurons. The patterns interact
a paradigm than a factor of two; and between in the process of storage, so that memory is
paradigms, variations are no more than a simply the sum of past activities in the system.
factor of three. Values of v, that is, probe- In the recognition process, the probe input
memory-set item nonmatch relatedness, are trace is matched to the memory. If a certain
relatively invariant across paradigms, showing level of positive evidence is accumulated, a
that subjects tend to set a relatively constant "yes" response is made, and if a certain level
relatedness criterion with respect to the non- of negative evidence is accumulated, a "no"
target item relatedness (noise) distribution. response is produced. The memory is formally
The encoding and response output param- represented by a vector of N elements, and a
eter (TER)' seems to fall in the range of 250 memory trace /,• is represented as a specific
msec to 350 msec, so that the other 250 msec vector of length N. If there are K traces, then
to 500 msec is comparison and decision time. the memory vector is constructed by
This division of processing time seems much
K
more reasonable than that proposed by other
models, for example, 35-msec comparison and *=£/*.
~ 4-1 ~
(13)
400-msec encoding, decision, arid response
output (Sternberg; 1966) or 5-msec comparison Then, the match between the input probe /
and 600-msec encoding, decision, and response and the memory vector is given by the dot
output (Murdock & Anderson, 1975). product/-5 (or matched filter), that is, each
It must be apparent that such cross- element of the input vector is multiplied with
paradigm comparisons may prove most interest- the corresponding ;element of the memory
ing when the same group of subjects is used in vector, and the sum of these values represents
each of the paradigms. Furthermore, such the amount of match.
comparisons may prove useful in studying If this description is rephrased slightly, then
the locus of individual differences. the recognition model shows remarkable
similarity to the retrieval model developed in
Relation to Neural Network Models this article. For example, the comparison
process can be viewed as resonance, and if
A large part of this article lias been concerned each element of the dot product is accumulated
with the development of a theory that inte- successively through one filter (not two), then
grates a. number of experimental paradigms. the resulting process is a random walk. Thus,
In this and the following section, the comple- the two theories are closely related.
mentary problem is treated, that of the relation However, there are some problems with
between this theory and other more global Anderson's (1973) model. The model suffers
theories. I consider neural network models in many of the problems of strength theory in
this section and semantic network models in that there are no separate records of each
the next section. memorized item, and activities simply sum,
Perhaps one of the simpler neural network as in strength theory. Therefore, the model is
models is the model developed by J. A. not capable of performance levels shown by
A THEORY OF MEMORY RETRIEVAL 99

subjects in judgments of frequency (Wells, and, in particular, the statistic mean reaction
1974) and list discrimination (Anderson & time. One of the main thrusts of this article
Bower, 1972). has been to demonstrate that mean reaction
I noted earlier that this model is part of a time is of limited use as a statistic. Also mean
continuing research program, which can be reaction time can often be misleading because
seen in J. A. Anderson (1976) and Anderson models based on mean latency may be contra-
et al. (1977). In these articles, it is shown how dicted by further analyses of reaction time
network models can deal with associations, data (e.g., Ratcliff & Murdock, 1976). Further-
probability learning, feature analysis and more, in many item recognition studies, it is
decomposition, and categorical perception. found that accuracy and latency covary, and
Therefore, it may be a useful future exercise to this seems to be true in many semantic
attempt to relate in some detail these two memory studies (e.g., Collins & Quillian, 1969;
theories: neural network theory and the Meyer, 1970; Rips et al., 1973). Therefore, any
retrieval theory presented here. model that purports to account for latency
<. effects should also account for accuracy effects
and be capable of dealing with speed-accuracy
Relation to Semantic Network Models and
trade-off.
Prepositional Models
Process models of retrieval latency that are
Semantic processing models. There is no derived from network models assume that
doubt that the human memory system is latency is a function of number of links
highly structured. One approach to modeling traversed. Latency for "yes" responses is well
the structural complexity of the system has modeled using this assumption, but latency
involved the use of semantic networks. The for "no" responses poses serious problems.
use of a network representation has been in For example, in verification of statements such
part a consequence of the appealing digital as "a robin is a mammal," latency can be as
computer metaphor for the human information much as 200 msec slower than for statements
processor. However, just because this form of in which the two nouns are more unrelated
representation is appealing and most useful in semantically, for example, "a robin is a car"
setting out structural relations, it does not (Rips et al., 1973). In attempting to verify
mean that process models suggested by a the latter, one might expect that more links
network representation are true or useful. would be searched than in the former (e.g.,
This point can be illustrated as follows. Collins & Quillian, 1969), but as shown above,
Rips, Shoben, and Smith (1973) developed a the derived prediction is incorrect. It is interest-
set theoretic model of semantic representation. ing to note that the relation between latency
Later, Hollan (1975) pointed out that set and accuracy as a function of "semantic
theoretic representations can be translated relatedness" is similar to the relation between
into networks, that is, there exists an iso- latency and accuracy as a function of related-
morphism between the two representations. ness in the retrieval theory developed here,
However, Rips, Shoben, and Smith (1975) in namely, extreme relatedness leads to fast
a rejoinder made the important point that the reaction times and high accuracy.
choice of representation can lead to sub- Collins and Loftus (1975) have proposed
stantive processing differences. For example, a comparison process for semantic processing,
network models explain reaction time effects wherein evidence is accumulated until either
in terms of the time for retrieval of pathways a positive or negative criterion is reached,
between nodes; whereas for set theoretic whereupon a response is initiated. This com-
models, reaction time effects are determined parison process belongs to the class of random
by comparisons of semantic elements. Thus, walk processes (as Collins and Loftus point
for any network model, there is at least one out) that forms the basis of the retrieval
other isomorphic representation that suggests theory developed here. Collins and Loftus
alternative processing assumptions. (1975, Table 1) listed qualitatively different
Much of the data used to support semantic kinds of evidence that can contribute to a
processing models makes use of reaction time decision. I argued earlier that qualitatively
100 ROGER RATCLIFF

different kinds of information contribute to assumption seems most consistent with data.
relatedness in item recognition. Therefore, it The assumption of complete dependence
seems that the two points of view converge, certainly does not represent serial processing.
and the theory developed earlier at least Rather, it is more consonant with the view
qualitatively extends into the domain of that the greater the number of links, the less
semantic processing. evidence there is for the required relation
Propositioned models. Propositional models holding (for in a network theory, eventually
(J. R. Anderson, 1976; Anderson & Bower, a pathway can be found between any pair of
1973; Kintsch, 1974; Rumelhart, Lindsay, & nodes). Thus, the assumption of complete
Norman, 1972; Schank, 1976; see also Cofer, dependence in processing seems halfway
1976) are close cousins of semantic network toward adoption of a decision process similar
models but have slightly different aims in to the random walk class discussed earlier.
that the central task is to model sentence or J. R. Anderson (1976) has gone a stage
text representation. The ancestry of these beyond earlier models in that the production
models is in linguistic theory, automata theory, system is the first attempt to model control
and the theory of formal languages (Chomsky, processes in a model that makes close contact
1963). Therefore, it is not surprising that a with experimental data. However, the process-
major goal of these theories has been suffi- ing assumptions underlying comparison pro-
ciency, that is, the ability of the theory (at cesses still reflect the influence of the computer
least in principle) to deal with and represent metaphor, and predictions for latency and
the complexities of human language and accuracy come from separate sources (e.g.,
knowledge. Thus, there has been less attention fan size and deadline, respectively) and are not
paid to processes underlying comprehension closely interrelated. This is an important
and retrieval, although the work of J. R. problem because most of the data the model
Anderson (1976), Anderson and Bower (1973), makes contact with are reaction time data,
and Kintsch (1974, 1975) are important and it can be seen that accuracy and latency
exceptions. Even so, Anderson and Bower covary in many of the data presented (J. R.
(1973, p. 6), in their formulation, selected Anderson, 1976).
processing assumptions on the basis of their This discussion suggests that the retrieval
ease of implementation on a digital computer. theory presented in this article may be capable
In particular, serial search of network struc- of representing the retrieval and decision
tures was assumed for matching or comparison processes involved in processing semantic and
processes. more highly structured information (proposi-
J. R. Anderson (1974) developed two mathe- tional and text). Thus, a task for the future is
matical models for retrieval of propositional the integration of the retrieval theory and
information. One model assumed independence theories of semantic and text memory structure.
of processing stages (the usual search model),
whereas the other assumed "complete de- General Conclusions
pendence," in which search time was exponen-
tially distributed with the time constant equal I have presented a theory of memory
to the sum of time constants for individual link retrieval that not only applies over a range of
search times. Anderson claimed that the two paradigms but also deals with experimental
models gave similar predictions for mean data in greater depth and more detail than
latency, but it is easy to see that they give competing models. The theory provides a
opposite predictions for the behavior of skew- rationale for relating accuracy, mean reaction
ness of the latency distributions. As the number time, error latency, and reaction time distri-
of links increases, for complete independence, butions. Such relations have not been dealt
skewness decreases; whereas for complete with explicitly before. Because the theory
dependence, skewness increases. In general, in applies over a range of experimental paradigms,
word recognition experiments, as mean latency it provides a basis for the claim that the same
increases, skewness of the distribution in- processes are involved in each of these para-
creases. Therefore, the complete-dependence digms, allowing theoretical comparisons to be
A THEORY OF MEMORY RETRIEVAL 101

made between paradigms, and thus sets the Search processes in recognition memory. In R. L.
stage for experimental comparison of para- Solso (Ed.), Theories in cognitive psychology: The
Loyola Symposium. Hillsdale, N.J.: Erlbaum, 1974.
digms. Also, the theory makes contact with Atkinson, R. C., Holmgren, J. E., & Juola, J. F.
several other more general theories, namely, Processing time is influenced by the number of ele-
neural network, semantic memory, and prepo- ments in a visual display. Perception &• Psychophysics,
sitional models. The success of the theory so 1969, 6, 321-327.
Atkinson, R. C., & Juola, J. F. Factors influencing
far, and the potential shown for integration speed and accuracy of word recognition. In S.
with other general models, leads to the hope Kornblum (Ed.), Attention and performance IV.
that some measure of theoretical unification New York: Academic Press, 1973.
may soon be achieved. Atkinson, R. C., & Shiffrin, R. M. Human memory:
A proposed system and its control processes. In
K. W. Spence & J. T. Spence (Eds.), The psychology
Reference Notes of learning and motivation: Advances in research and
theory (Vol. 2). New York: Academic Press, 1968.
1. Sternberg, S. Estimating the distribution of additive Baddeley, A. D., & Ecob, J. R. Reaction time and short-
reaction-time components. Paper presented at the term memory: Implications of repetition effects for
meeting of the Psychometric Society, Niagara Falls, the high-speed exhaustive scan hypothesis. Quarterly
Ontario, Canada, October 1964. Journal of Experimental Psychology, 1973, 25, 229-
2. Sternberg, S. Evidence against self-terminating 240.
memory search from properties of RT distributions. Banks, W. P., & Atkinson, R. C. Accuracy and speed
Paper presented at the meeting of the Psychonomic strategies in scanning active memory. Memory &
Society, St. Louis, November 1973. Cognition, 1974, 2, 629-636.
3. Ratcliff, R. Group reaction time distributions and an Bobrow, D. G. Dimensions of representation. In D. G.
analysis of distribution statistics. Paper presented at Bobrow & A. M. Collins (Eds.), Representation and
the Mathematical Psychology meeting, Los Angeles, understanding: Studies in cognitive science. New York:
August 1977. Academic Press, 1975.
Borsellino, A., & Poggio, T. Convolution and correlation
4. Reed, A. V. Discriminability spectra in bilingual algebras, Kybernetik, 1973,13, 113-122.
recognition. Paper presented at the Mathematical Briggs, G. E. On the predictor variable for choice
Psychology meeting, Los Angeles, August 1977. reaction time. Memory & Cognition, 1974, 2, 575-580.
Burrows, D., & Okada, R. Serial position effects in
References high-speed memory search. Perception & Psycho-
physics, 1971, 10, 305-308.
Allport, D. A. Critical notice: The state of cognitive Burrows, D., & Okada, R. Memory retrieval from long
psychology. Quarterly Journal of Experimental and short lists. Science, 1975, 188, 1031-1033.
Psychology, 1975, 27, 141-152. Cavanagh, J. P. Holographic and trace strength models
Anderson, J. A. A simple neural network generating an of rehearsal effects in the item recognition task.
interactive memory. Mathematical Biosciences, 1972, Memory &• Cognition, 1976, 4, 186-199.
14, 197-220. Chomsky, N. Formal properties of grammars. In R. D.
Anderson, J. A. A theory for the recognition of items Luce, R. Bush, & E. Galanter (Eds.), Handbook of
from short memorized lists. Psychological Review, mathematical psychology (Vol. 2). New York: Wiley,
1973, 80, 417-438. 1963.
Anderson, J. A. Neural models with cognitive implica- Churchill, R. V. Fourier series and boundary value prob-
tions. In D. La Berge & S. J. Samuels (Eds.), lems (2nd. ed.). New York: McGraw-Hill, 1963.
Basic processes in reading: Perception and compre- Cofer, C. N. The structure of human memory. San
hension. Hillsdale, N.J.: Erlbaum, 1976. Francisco: Freeman, 1976.
Anderson, J. A., Silverstein, J. W., Ritz, S. A., & Jones, Collins, A. M., & Loftus, E. F. A spreading-activation
R. S. Distinctive features, categorical perception, theory of semantic processing. Psychological Review,
and probability learning: Some applications of a 1975, 82, 407-428.
neural model. Psychological Review, 1977, 84, 413- Collins, A. M., & Quillian, M. R. Retrieval time from
451. semantic memory. Journal of Verbal Learning and
Anderson, J. R. Retrieval of prepositional information Verbal Behavior, 1969, 8, 240-247.
from long-term memory. Cognitive Psychology, 1974, Collins, A. M., & Quillian, M. R. Does category size
6, 451-474. affect categorization time? Journal of Verbal Learning
Anderson, J. R. Language, memory and thought. Hills- and Verbal Behavior, 1970, 9, 432-438.
dale, N.J.: Erlbaum, 1976. Corballis, M. C. Serial order in recognition and recall.
Anderson, J. R., & Bower, G. H. Recognition and re- Journal of Experimental Psychology, 1967, 74, 99-105.
trieval processes in free recall. Psychological Review, Corballis, M. C. Access to memory: An analysis of
1972, 79, 97-123. recognition times. In P. M. A. Rabbitt & S. Dornic
Anderson, J. R., & Bower, G. H. Human associative (Eds.), Attention and performance V. New York:
memory. Washington, D.C.: Winston, 1973. Academic Press, 1975.
Atkinson, R. C., Herrmann, D. J., & Wescourt, K. T. Corballis, M. C., Kirby, J., & Miller, A. Access to
102 ROGER RATCLIFF

elements of a memorized list. Journal of Experimental Miller, J. O., & Pachella, R. G. Locus of the stimulus
Psychology, 1972, 94, 185-190. probability effect. Journal of Experimental Psychol-
Cox, D. R., & Miller, H. D. The theory of stochastic ogy, 1973, 101, 227-231.
processes. London: Methuen, 1965. Morin, R. E., DeRosa, D. V., & Stultz, V. Recognition
Dosher, B. A. The retrieval of sentences from memory: memory and reaction time. In A. F. Sanders (Ed.),
A speed-accuracy study. Cognitive Psychology, 1976, Attention and performance I. Amsterdam: North-
8, 291-310. Holland, 1967.
Feller, W. An introduction to probability theory and its Morton, J., Marcus, S., & Prankish, C. Perceptual
applications (Vol. 1, 3rd. ed.). New York: Wiley, centers (P-centers). Psychological Review, 1976, 83,
1968. 405-408.
Forrin, B., & Cunningham, K. Recognition time and Murdock, B. B., Jr. Response latencies in short-term
serial position of probed item in short-term memory. memory. Quarterly Journal of Experimental Psy-
Journal of Experimental Psychology, 1973, 99, 272- chology, 1968, 20, 79-82.
279. Murdock, B. B., Jr. Human memory: Theory and data.
Forrin, B., & Morin, R. E. Recognition times for items Potomac, Md.: Erlbaum, 1974.
in short- and long-term memory. Ada Psychologica, Murdock, B. B., Jr., & Anderson, R. E. Encoding,
1969, 30, 126-141. storage and retrieval of item information. In R. L.
Hollan, J. D. Features and semantic memory: Set- Solso (Ed.), Theories in cognitive psychology: The
theoretic or network model? Psychological Review, Loyola Symposium. Hillsdale, NJ.: Erlbaum, 1975.
1975, 82, 154-155. Murdock, B. B., Jr., & Duf ty, P. O. Strength theory and
Huesmann, L. R., & Woocher, F. D. Probe similarity recognition memory. Journal of Experimental
and recognition of set membership: A parallel- Psychology, 1972, 94, 284-290.
processing serial-feature-matching model. Cognitive Murdock, B. B., Jr., Hockley, W. E., & Muter, P. M.
Psychology, 1976, 8, 124-162. Two tests of the conveyor belt model for item recog-
Juola, J. F., Fischler, I., Wood, C. T., & Atkinson, nition. Canadian Journal of Psychology, 1977, 31,
R. C. Recognition time for information stored in 71-89.
long-term memory. Perception &• Psychophysics, Neisser, U. Cognitive psychology. New York: Appleton-
1971,10, 8-14. Century-Crofts, 1967.
Kintsch, W. The representation of meaning in memory. Newell, A. You can't play 20 questions with nature and
Hillsdale, NJ.: Erlbaum, 1974. win: Projective comments on the papers of this
Kintsch, W. Memory representations of text. In R. L. symposium. In W. G. Chase (Ed.), Visual information
Solso (Ed.), Information processing and cognition: processing. New York: Academic Press, 1973.
The Loyola Symposium. Hillsdale, NJ.: Erlbaum, Nickerson, R. S. Binary-classification reaction time: A
1975. review of some studies of human information-
Kohonen, T. Associative memory: A system-theoretical processing capabilities. Psychonomic Monograph
approach. Berlin, West Germany: Springer-Verlag, Supplements, 1972, 4(17, Whole No. 65).
1976. Norman, D. A., & Rumelhart, D. E. A system for per-
Kohonen, T., & Oja, E. Fast adaptive formation of ception and memory. In D. A. Norman (Ed.),
orthogonalizing filters and associative memory in Models of human memory. New York: Academic
recurrent networks of neuron-like elements. Bio- Press, 1970.
logical Cybernetics, 1973, 21, 85-95. Okada, R. Decision latencies in short-term recognition
Krantz, D. H. Threshold theories of signal detection. memory. Journal of Experimental Psychology, 1971,
Psychological Review, 1969, 76, 308-324. 90, 27-32.
Laming, D. R. Information theory of choice reaction time. Pachella, R. G. An interpretation of reaction time in
New York: Wiley, 1968. information processing research. In B. Kantowitz
Link, S. W. The relative judgment theory of choice (Ed.), Human information processing: Tutorials in
reaction time. Journal of Mathematical Psychology, performance and cognition. Hillsdale, N J.: Erlbaum,
1975, 12, 114-135. 1974.
Link, S. W., & Heath, R. A. A sequential theory of Pike, R. Response latency models for signal detection.
psychological discrimination. Psychometrika, 1975, Psychological Review, 1973, 80, 53-68.
40, 77-105. Ratcliff, R., & Murdock, B. B., Jr. Retrieval processes
Lively, B. L. Speed/accuracy trade off and practice as in recognition memory. Psychological Review, 1976,
determinants of stage durations in a memory- 83, 190-214.
search task. Journal of Experimental Psychology, Reed, A. V. Speed-accuracy trade-off in recognition
1972, 96, 97-103. memory. Science, 1973, 181, 574-576.
Luce, R. D. A threshold theory for simple detection Reed, A. V. List length and the time course of recogni-
experiments. Psychological Review, 1963, 70, 61-79. tion in immediate memory- Memory &• Cognition,
Marcel, A. J. Negative set effects in character classifica- 1976, 4, 16-30.
tion: A response-retrieval view of reaction time. Rips, L. J., Shoben, E. J., & Smith, E. E. Semantic
Quarterly Journal of Experimental Psychology, 1977, distance and the verification of semantic relations.
29, 31-48. Journal of Verbal Learning and Verbal Behavior,
Meyer, D. E. On the representation and retrieval of 1973, 12, 1-20.
stored semantic information. Cognitive Psychology, Rips, L. J., Shoben, E. J., & Smith, E. E. Set-theoretic
1970, /, 242-300. and network models reconsidered: A comment on
A THEORY OF MEMORY RETRIEVAL 103

Hollan's "Features and semantic memory." Psycho- Theios, J., & Walter, D. G. Stimulus and response
logical Review, 1975, 82, 156-157. frequency and sequential effects in memory scanning
Rumelhart, D. E., Lindsay, P. H., & Norman, D. A. reaction times. Journal of Experimental Psychology,
A process model for long-term memory. In E. Tulv- 1974, 102, 1092-1099.
ing & W. Donaldson (Eds.), Organization of memory. Tikhonov, A., & Samarskii, A. A. Equations of mathe-
New York: Academic Press, 1972. matical physics. New York: Macmillan, 1963.
Schank, R. C. The role of memory in language process- Townsend, J. T. A note on the identifiability of parallel
ing. In C. Cofer (Ed.), The structure of human memory. and serial processes. Perception 6* Psychophysics,
San Francisco: Freeman, 1976. 1971,10, 161-163.
Scheirer, C. J., & Hanley, M. J. Scanning for similar Townsend, J. T. Some results concerning the "identi-
and different material in short- and long-term fiability" of parallel and serial processes. British
memory. Journal of Experimental Psychology, 1974, Journal of Mathematical and Statistical Psychology,
102, 343-346. 1972, 25, 168-199.
Schneider, W., & Shiffrin, R. M. Controlled and auto- Townsend, J. T. Issues and models concerning the
matic human information processing. Psychological processing of a finite number of inputs. In B. H.
Review, 1977, 86, 1-66. Kantowitz (Ed.), Human information processing:
Schouten, J. F., & Bekker, J. A. M. Reaction time and Tutorials in performance and cognition. Hillsdale,
accuracy. Ada. Psychologica, 1967, 27, 143-153. NJ.: Erlbaum, 1974.
Schulman, H. G. Encoding and retention of semantic Tulving, E., & Bower, G. H. The logic of memory
and phonemic information in short-term memory. representations. In G. H. Bower (Ed.), The psychol-
Journal of Verbal Learning and Verbal Behavior, ogy of learning and motivation: Advances in research
1970, 9, 499-508. and theory (Vol. 8). New York: Academic Press,
Seward, G. H. Recognition time as a measure of con- 1974.
fidence. Archives of Psychology, 1928, 99, 723. (Cited Wells, J. E. Strength theory and judgments of recency
in Woodworth, R. S. Experimental psychology. New and frequency. Journal of Verbal Learning and
York: Holt, 1938.) Verbal Behavior, 1974, 13, 378-392.
Smith, E. E., Shoben, E. J., & Rips, L. J. Structure Wescourt, K. T., & Atkinson, R. C. Scanning for
and process in semantic memory: A featural model information in short-term and long-term memory.
for semantic decisions. Psychological Review, 1974, Journal of Experimental Psychology, 1973, 98,
81, 214-241. 95-101.
Sternberg, S. High-speed scanning in human memory.
Wescourt, K. T., & Atkinson, R. C. Fact retrieval
Science, 1966, 153, 652-654.
Sternberg, S. The discovery of processing stages: processes in human memory. In W. K. Estes (Ed.),
Handbook of learning and cognitive processes (Vol. 4).
Extensions of Bonders' method. In W. G. Koster
(Ed.), Attention and performance II. Amsterdam: Hillsdale, N.J.: Erlbaum, 1976.
North-Holland, 1969. (a) Wickelgren, W. A. Dynamics of retrieval. In D. Deutsch
Sternberg, S. Memory-scanning: Mental processes & J. A. Deutsch (Eds.), Short-term memory. New
revealed by reaction-time experiments. American York: Academic Press, 1975.
Scientist, 1969, 57, 421-457. (b) Wickelgren, W. A. Speed-accuracy tradeoff and in-
Sternberg, S. Memory scanning: New findings and formation processing dynamics. Acta Psychologica,
current controversies. Quarterly Journal of Experi- 1977, 41, 67-85.
mental Psychology, 1975, 27, 1-32. Wickelgren, W. A., & Corbett, A. T. Associative
Stone, M. Models for choice reaction time. Psycho- interference and retrieval dynamics in yes-no recall
metrika, 1960, 25, 251-260. and recognition. Journal of Experimental Psychology:
Theios, J., Smith, P. G., Haviland, S. E., Traupmann, Human Learning and Memory, 1977, 3, 189-202.
J., & Moy, M. C. Memory scanning as a serial self- Wickelgren, W. A., & Norman, D. A. Strength models
terminating process. Journal of Experimental Psy- and serial position in short-term recognition memory.
chology, 1973, 97, 323-336. Journal of Mathematical Psychology, 1966,3, 316-347.

Appendix
In this section, the mathematical model used are indicated. Third, equations for the maxi-
to relate the theory to data is formulated. The mum of n processes (exhaustive processing)
organization is as follows: First, the theory of and the minimum of n processes (self-termin-
the random walk (gambler's ruin problem) is ating processing) are derived.
described as an introduction to the diffusion In the development of any model, one has
process. Second, the theory of the diffusion to select aspects of the data, that is, summary
process is outlined, and two ways of deriving statistics, as points of contact between the
first-passage time distributions and error rates model and data. In item recognition, the
104 ROGER RATCLIFF

statistic generally used is mean reaction time, probability of a nonmatch is given by


but as argued earlier, mean error rate and
reaction time distribution statistics are more
useful. Thus, the following development is when
aimed primarily at deriving expressions for
distributions and error rates. qz =

1 —j, when q = p.
Random Walk A
This is the required expression for error rate.
Traditionally, the theory of this process is The next result needed is the first-passage
centered around the example of a gambler time distribution for a nonmatch (the distri-
who wins or loses a dollar with probabilities bution of number of steps to nonmatch).
p and q, respectively. His initial capital is Z Let gz.n denote the probability that the
and his opponent's capital is A — Z. The process ends with the wth step at the barrier 0
game terminates when his capital becomes A (nonmatch at the wth feature comparison).
or 0 dollars (Feller, 1968, p. 342). After the first step, the position is Z + 1 or
It is simple to rephrase the problem in Z— l f o r l < Z < X — 1 and n ^ 1 ; thus,
terms of a feature-matching, memory com-
parison process (cf. Huesmann & Woocher, gZ,n+l = PgZ+l,n (A4)
1976). Suppose the features of a probe item Boundary conditions for this difference equa-
are compared with the features of a memory tion are given by
item, and the probability of a match is p and
of a nonmatch is q (= 1 — p). Then, if Z more gO.n = gA,n = 0
nonmatches than matches are required for a
probe-memory item nonmatch, and A — Z and (AS)
more matches than nonmatches are required go,o = 1, gz,o = 0, when Z > 0.
for a probe-memory item match, this process
is isomorphic to the gambler's ruin problem. Then, the difference Equation A4, with
I shall discuss the random walk within the boundaries given by Equation AS, holds for
feature match framework. all 0 < Z < A and all « > 0.
One of the main methods for finding solu- The solution to Equations A4 and AS is
tions for stochastic processes consists of given by
deriving difference equations for the process 2n+1
£Z,n =
and solving these equations with respect to ~A~f
certain boundary conditions. In our feature .irk . irk .
comparison process, let qz be the probability X cos —7 sin — sin (A6)
k<A/2
of a probe-item nonmatch, given starting
point Z, match at A, and nonmatch at 0. Then, The derivation of Equation A6 is shown in
after the first trial, either Z ~ 1 or Z + 1 Feller (1968, chap. 14) and involves deriving
steps are required for termination with a the probability generating function for first-
nonmatch. Thus, passage times, then performing a partial
fraction expansion on that expression.
qz = pqz+i + qqz~i, (Al) In order to obtain the expressions equivalent
to Equations A3 and A6 for a probe-memory
provided 1 < Z < A — 1. For Z = 1 and item match, q and p are interchanged and Z
Z = A —• 1, the first trial may lead to termi- becomes A — Z. It should also be noted that
gz,» is not a probability density function, but
nation, so we define that the sum of the first-passage time proba-
bilities for matches and nonmatches is a
go 1 and =0 (A2) probability density function.
to be boundary conditions on the random
walk. Thus, the probability of nonmatch qz Diffusion Process
satisfies Equations Al and A2. It was noted earlier that the discrete
Using the method of particular solutions, random walk could be used to represent a
the solution to Equations Al and A2 for the feature comparison process. However, I have
A THEORY OF MEMORY RETRIEVAL 105

chosen to use the continuous random walk where Z —> z/8 and A —» a/8. 7+(£), the proba-
(Wiener diffusion) process for several reasons. bility of a match, can be obtained by replacing
First, my conceptual bias is to think of in- z with a — z and £ with — £. Then, 7+ + y_ = 1.
formation or evidence accumulation as a The limiting form of the first-passage time
continuous process rather than a discrete distribution is given by
process. Second, for the continuous random
walk, drift and variance in drift are in-
dependent, and because these and other
parameters of the model are relatively de- v
coupled, manipulation is simpler. Third, it is X 2J sin ( — )«-* '. (A9)
t-i \ a )
faster and cheaper to integrate numerically
over continuous functions than to sum over g-(t, £) is a function of /, a, z, £, and s2, but it is
discrete functions. only necessary to express it as a function of t
Before proceeding, I will introduce the and £ for further calculations in this section.
following convention: F and G will represent The first-passage time distribution for a match
first-passage time distribution functions, and g+(t, £) can be found by setting £ = — £ and
/ and g will represent first-passage time density 2 = a — z in Equation A9. Again, note that
functions. These are not normalized (i.e., are g-(t, £) and g+(t, £) are not probability density
not probability distributions or density func- functions, but g+(t, $ + g.(t, £), g-.(t, £)/7-(£),
tions), but they may be normalized by division and g+(t, £)/7+(£) are probability density
by the appropriate error or correct probability functions.
7. On the other hand, H and h will represent It was stated earlier that Equation A9, the
proper probability distribution and density first-passage time distribution, can be derived
functions. Subscripts + and — will refer to by solving the diffusion equation. The diffusion
the boundaries giving match and nonmatch, equation can be derived from Equation A4,
respectively. using the limiting expressions in Equation A7
It was noted earlier that the Wiener diffusion for % and s2. Equation A4 becomes
process (Brownian motion) is the result of
taking the limit as step size tends toward zero
in the random walk process. This limiting dt
process will now be described.
In order to apply the random walk in the Methods for the solution of this equation (Cox
case where the number of events per unit time & Miller, 1965) lead into the enormous litera-
is large, but the effect of each event is small ture on the solution of partial differential
(e.g., molecular collisions in Brownian motion), equations (Churchill, 1963; Tikhonov &
one can take the discrete expressions and find Samarskii, 1963). By substituting g-(t, £) from
the limit directly. Let the length of the in- Equation A9 in Equation A10, it can be seen
dividual steps S be small, the number of steps that g-(t, £) is a solution of Equation A10.
per unit time r be large, and (p — q) be small, The first-passage time density function
with the constraint that the following expres- g- (t, £) turns out not to be the best expression
sions approach finite limits : to use in evaluating the maximum and mini-
mum of a number of similar processes. Instead,
(/»- (A7) the first-passage time distribution function

It is necessary for the two expressions to have G-(t,Q f'e-V, Qdi? (All)
finite limits. For example, if s 2 —» 0, then the Jo
process is deterministic; if s 2 —» «, then the
process is infinitely variable, and the proba- proves most useful. Now,
bility of terminating at each boundary is 1/2
with zero time delay. If £—» 0, there is no G-(t,
drift ; if £ —-> ± °o, there is no time delay before
the boundary is reached. The probability of a
nonmatch is obtained by taking the limit in 2k sin [l ~
Equation A3 and is given by

however, this series converges slowly, and a


106 ROGER RATCLIFF

more useful alternative is given by T — max Ti is


!<<<n

t)
Pr( max Ti ^ 0
l<t<«
2ksln(—
\a -. (A12) Pr(Tt ^ / ..... Tn ^ t)

P r (r t ^ t) . . . Pr(Tn ^ /)

^ a2 / (assuming independence)
From this expression, it can be seen that
(A13)

Then, the probability density function of the


maximum of n processes is given by
Relatedness and Diffusion ^maxW =
H maxW

The main assumption made to relate diffu- = II Hi(t) L jj~. (A14)


sion and relatedness is that relatedness is
represented by drift £ in the diffusion process. For misses and correct rejections, we are not
Thus, the greater the relatedness, the greater dealing now with probability distribution
is the value of drift; the less the relatedness, functions because sometimes a process may
the less the drift. The scale on which related- terminate at the wrong boundary. Therefore,
ness is measured is not absolute; rather, it is it is necessary to find the maximum of n
only sensible to talk about the degree of dis- processes, given that there are different proba-
criminability. The subject sets the zero point bilities of termination associated with each
(or criterion) of relatedness, so that the average process. Let G,-_(2) be the first-passage time
drift for a matching comparison is toward the distribution function for a nonmatch of process
match boundary, and average drift for a i, and Gmax(i) be the first-passage time distri-
nonmatching comparison is toward the non- bution function for the maximum of the n
match boundary. The scale of relatedness is nonmatch processes. Then,
linear in drift, and therefore, the scale is
defined in terms of the comparison process. Gmax(0 = P,(T ^ /)
There are many other possible scales and n
relations between relatedness and drift, but (A15)
this one seems the most natural. I will use £
to represent relatedness of probe-memory-set from Equation A13. Also,
item match comparisons and v to represent n
relatedness of probe-memory-set item non-
match comparisons, where v usually has the
7- = n 7»-, (A16)
opposite sign to £. where 7,- is the probability of the ith process
nonmatch, and 7_ is the probability that all of
Decision Process the comparison processes produce nonmatches.
For correct rejections, the model will be
The decision process is self-terminating on applied, with each comparison assumed to
comparison process matches and exhaustive have the same first-passage time distribution
on nonmatches. The distributions of '"yes" G-(t, v). Thus, for correct rejections (CR), the
and "no" responses therefore correspond to the first-passage time distribution function FCR(I)
distributions of minimum and maximum is given by
completion times of the n comparison processes. GL"(*. v). (A17)
Maximum of n processes. Let TI, . . . , Tn Also, for the single nonmatching process when
be independent observations on n pro- a memory-set item is probed, let the first-
cesses, with probability density functions passage time distribution function be G_ (t, £) ;
hi(t), . . . , hn(t) and distribution functions thus, for misses (M),
H\(t), , . . , Hn(t). Let us assume each
process terminates with probability one. Fu(f) = G-(t, k}G-n~l(t, v). (A18)
Then, the probability distribution function of Minimum of n processes. For "yes" re-
A THEORY OF MEMORY RETRIEVAL 107

sponses (hits and false alarms), only one where G+(t, v) can be found by setting
comparison process has to terminate with a z = a — z and v — — v in Equation A12.
match; therefore, all combinations of the Let the first-passage time distribution of a
number of match terminations have to be probe being compared with a memory-set item
considered. For example, if only one match (i.e., for a match) be G+(t, (•). Then, for hits
occurs, then the time for that comparison is (H), the first-passage time distribution func-
the time for the decision ; if two matches occur, tion is
then the decision time is the minimum of those
two comparison times. Therefore, it is neces-
sary to sum over all possible combinations of
processes finishing. First, let us consider the
minimum of n processes finishing. Then, the
distribution function Hmin(t) of T — min T,
l<i<"
is found as follows :

Now 1 rc+(*.»oiM
..... Tn>f) L^rJI
= Pr(Tl>t)...Pr(Tn>t) (A22)
At this stage, we should note that both y+(v)
and 7_(£) are usually small (.1 or less) and
G±(t) tends to 7± as t becomes large ; therefore,
Therefore,
Equations A21 and A22 are adequately ap-
proximated by the first few terms in the series.
(A19)
The probabilities of a correct rejection and
miss are given by
Let Fmin(t) be the first-passage time distri-
bution function (not a probability distribution PCS, = 7-"(") = 1 —
function) for the self-terminating process, and and (A23)
let pi be the probability of a match. Then,
= 1- PH.
Pr(T>t)=E E [Pr(r><|», ij) The approximation in Equation A22 was
y=l all subsets
of size j checked by calculating FH(°°) and by making
sure that the value is equal to pn.
ri p.-if d-w
-II
:*if
fc=*l
Relatedness Distributions

- all subsets
A critical assumption is that relatedness
enters the diffusion process as the drift param-
of size j
eter £. In the last section, it was noted that
x f c l ( l - f c ) , (A20) there should be a distribution of relatedness
{-,'! h=hi } and therefore a distribution over drift values
(different from the variance of the diffusion
using Equation A19. As before, when the process s2). The simplest choice used with most
probe was not a memory-set item, all nonmatch precedence (Murdock, 1974, p. 28) is the
comparison first-passage time distribution normal distribution. This means that the
functions GL (t, v) are assumed equal. Then, for distribution function G(f) and accuracy y for
false alarms (FA), the diffusion process have to be averaged over
the relatedness distribution before being used
to calculate distributions and proportions of
hit, miss, correct rejection, and false alarm
n(n- l ) r 2 G + ( f , i Q _ responses.
Let relatedness £ be distributed N(u, r;) and
(A21) relatedness v be distributed N(v, if). Then, we
108 ROGER RATCLIFF

can compute first-passage time distribution Similarly, 7±(w) and y±(v) can be calculated
functions G±(t, u) and G±(/, v), using using Equations A24 and A25, respectively,
substituting 7±(£) and y±(v) for G±(t, £) and
°° 1 . G-tO, v), respectively.
G±(t {)
/ 00
' VfcH * (A24) The e uations
q developed to this point are
sufficient for the calculation of error rates and
and reaction time distributions.
l
x, , N r* ^ , s _ir.»_i,/,, ,.^rs Received December 17, 1976
CT_I_
±I/ U I "— I CJ_L ( / VJ— £ UfV \ l\.2iJ i
/„ * V2irr)2 Revision received July 12, 1977 •

Erratum for Librarians and Secondary Services

For all six issues of Volume 84 (1977) and the first issue of Volume 85 (January
1978) of Psychological Review, the ISSN Number (8098-7970) on the front
cover is incorrect. The correct ISSN Number is 0033-29SX.

You might also like