The Cambridge Handbook of Thinking and Reasoning (2005) PDF

The Cambridge Handbook of
Thinking and Reasoning

Edited by
Keith J. Holyoak
and
Robert G. Morrison
cambridge university press
Cambridge, New York, Melbourne, Madrid, Cape Town, Singapore, São Paulo
Cambridge University Press

The Edinburgh Building, Cambridge cb2 2ru, UK
Published in the United States of America by Cambridge University Press, New York
www.cambridge.org
Information on this title: www.cambridge.org/9780521824170
© Cambridge University Press 2005
This book is in copyright. Subject to statutory exception and to the provision of

relevant collective licensing agreements, no reproduction of any part may take place
without the written permission of Cambridge University Press.
First published in print format 2005
isbn-13 978-0-511-11330-7 eBook (NetLibrary)

isbn-10 0-511-11330-7 eBook (NetLibrary)
isbn-13 978-0-521-82417-0 hardback

isbn-10 0-521-82417-6 hardback
isbn-13 978-0-521-53101-6 paperback

isbn-10 0-521-53101-2 paperback
Cambridge University Press has no responsibility for the persistence or accuracy of

urls for external or third-party internet websites referred to in this book, and does not
guarantee that any content on such websites is, or will remain, accurate or appropriate.
http://avaxhome.ws/blogs/ChrisRedfield
The editors gratefully dedicate this volume to
Patricia Wenjie Cheng
(from KJH)
and
Deborah Lee Morrison
(from RGM)
Contents
Preface page ix p a r t ii
Contributors xi REASONING
5. The Problem of Induction 95
1 . Thinking and Reasoning: Steven A. Sloman
A Reader’s Guide 1 David A. Lagnado
Keith J. Holyoak
6. Analogy 117
Robert G. Morrison
Keith J. Holyoak
pa r t i 7. Causal Learning 1 43
THE NATURE OF HUMAN Marc J. Buehner
CONCEPTS Patricia W. Cheng
8. Deductive Reasoning 1 69
2 . Similarity 13
Jonathan St. B. T. Evans
Robert L. Goldstone
Ji Yun Son 9. Mental Models and Thought 1 85
3. Concepts and Categories: P. N. Johnson-Laird
Memory, Meaning, and 1 0. Visuospatial Reasoning 209
Metaphysics 37 Barbara Tversky
Douglas L. Medin
Lance J. Rips p a r t iii
JUDGMENT AND DECISION
4. Approaches to Modeling Human
MAKING
Mental Representations: What
Works, What Doesn’t, and Why 73 1 1 . Decision Making 243
Leonidas A. A. Doumas Robyn A. LeBoeuf
John E. Hummel Eldar B. Shafir
vii
viii contents
1 2 . A Model of Heuristic Judgment 267 p a r t vi

Daniel Kahneman ONTOGENY, PHYLOGENY,
Shane Frederick LANGUAGE, AND CULTURE
1 3. Motivated Thinking 295 2 2 . Development of Thinking 5 29
Daniel C. Molden Graeme S. Halford
E. Tory Higgins
2 3. Mathematical Cognition 559
C. R. Gallistel
p a r t iv Rochel Gelman
PROBLEM SOLVING AND
2 4. Effects of Aging on Reasoning 5 89
COMPLEX LEARNING
Timothy A. Salthouse
1 4. Problem Solving 3 21
2 5. Reasoning and Thinking in
Laura R. Novick
Nonhuman Primates 607
Miriam Bassok
Josep Call
1 5. Creativity 351 Michael Tomasello
Robert J. Sternberg
2 6. Language and Thought 63 3
Todd I. Lubart
James C. Kaufman Lila Gleitman
Jean E. Pretz Anna Papafragou
1 6. Complex Declarative Learning 3 71 2 7. Paradigms of Cultural Thought 663

Michelene T. H. Chi Patricia M. Greenfield
Stellan Ohlsson
p a r t vii
1 7. Thinking as a Production System 401
THINKING IN PRACTICE
Marsha C. Lovett
John R. Anderson 2 8. Legal Reasoning 685
1 8. Implicit Cognition and Thought 43 1 Phoebe C. Ellsworth
Leib Litman 2 9. Scientific Thinking and

Arthur S. Reber Reasoning 705
Kevin Dunbar
Jonathan Fugelsang
pa r t v
COGNITIVE AND NEURAL 30. Thinking and Reasoning in
CONSTRAINTS ON HUMAN Medicine 727
THOUGHT Vimla L. Patel
José F. Arocha
1 9. Thinking in Working Memory 45 7 Jiajie Zhang
Robert G. Morrison
31 . Intelligence 75 1
2 0. Cognitive Neuroscience of Robert J. Sternberg
Deductive Reasoning 475
Vinod Goel
32 . Learning to Think: The
Challenges of Teaching Thinking 775
2 1 . Cognitive and Neuroscience Ron Ritchhart
Aspects of Thought Disorder 493 David N. Perkins
Peter Bachman
Tyrone D. Cannon Index 803
Preface
A few decades ago, when the science of soning). Even more crucially, this handbook
cognition was in its infancy, the early text- would provide an entry point into the field
books on cognition began with perception for the next generation of researchers by pro-
and attention and ended with memory. So- viding a text for use in classes on thinking and
called higher-level cognition – the mysteri- reasoning designed for graduate students and
ous, complicated realm of thinking and rea- upper-level undergraduates.
soning – was simply left out. Things have The Cambridge Handbook of Thinking and
changed – any good cognitive text (and there Reasoning is intended to be this previously
are many) devotes several chapters to topics missing handbook. The project was first con-
such as categorization, inductive and deduc- ceived at the meeting of the Cognitive Sci-
tive reasoning, judgment and decision mak- ence Society in Edinburgh, Scotland, dur-
ing, and problem solving. What has still been ing the summer of 2001 . The contents of
missing, however, is a true handbook for the volume are sketched in Chapter 1 . Our
the field of thinking and reasoning – a book aim is to provide comprehensive and au-
meant to be kept close “at hand” by those in- thoritative reviews of all the core topics of
volved in the field. Such a book would bring the field of thinking and reasoning, with
together top researchers to write chapters, many pointers for further reading. Undoubt-
each of which summarizes the basic con- edly, there are still omissions, but we have
cepts and findings for a major topic, sketches included as much as we could realistically
its history, and provides a sense of the di- fit in a single volume. Our focus is on re-
rections in which research is currently head- search from cognitive psychology, cognitive
ing. This handbook would provide quick science, and cognitive neuroscience, but we
overviews for experts in each topic area, and also include work related to developmen-
more importantly for experts in allied topic tal, social, and clinical psychology; philos-
areas (because few researchers can keep up ophy; economics; artificial intelligence; lin-
with the scientific literature over the full guistics; education; law; and medicine. We
breadth of the field of thinking and rea- hope that scholars and students in all these
ix
x preface
fields and others will find this to be a valuable tual problems sometimes experience a mo-
collection. ment of insight – a sense that although many
We have many to thank for their help laborious steps may lay ahead, the basic ele-
in bringing this endeavor to fruition. Philip ments of a solution are already in place. Such
Laughlin, our editor at Cambridge Univer- fortunate people work on happily, confident
sity Press, gave us exactly the balance of that ultimate success is assured. In preparing
encouragement and patience we needed. It this handbook, we also had our moment of
is fitting that a handbook of thinking and “insight.” It came when all these outstanding
reasoning should bear the imprint and in- researchers agreed to join our project. Be-
deed the name of this illustrious press, with fore the first chapter was drafted, we knew
its long history reaching back to the ori- the volume was going to be of the highest
gins of scientific inquiry. Michie Shaw, Se- quality. Along the way, our distinguished au-
nior Project Manager at TechBooks, pro- thors graciously served as each other’s crit-
vided us with close support throughout the ics as we passed drafts around, working to
arduous editing process. At UCLA, Chris- make the chapters as integrated as possible,
tine Vu did a great deal of organizational adding in pointers from one to another. Then
work in her role as our editorial assistant the authors all changed hats again and went
for the entire project. During this period, back to work revising their own chapters in
our own efforts were supported by grants light of the feedback their peers had pro-
R3 05 H03 01 41 from the Institute of Educa- vided. We thank you all for making our own
tion Sciences and SES-00803 75 from the small labors a great pleasure.
National Science Foundation to KJH, and
from Xunesis and National Service Research KEITH J. HOLYOAK
Award MH-064244 from the National Insti- University of California, Los Angeles
tute of Mental Health to RGM.
Then there are the authors. (It would ROBERT G. MORRISON
seem a bit presumptuous to call them “our” Xunesis, Chicago
authors!) People working on tough intellec- October 2 004
Contributors
John R. Anderson Marc J. Buehner

Carnegie Mellon University School of Psychology
Department of Psychology Cardiff University
Pittsburgh, PA 1 5 21 3 -3 890 Tower Building
ja+@cmu.edu Park Place
Cardiff, CF1 0 3 AT
José F. Arocha Wales, UK
Department of Health Studies & Gerontology BuehnerM@Cardiff.ac.uk
University of Waterloo Josep Call
200 University Ave. W. Max Planck Institute for Evolutionary
Waterloo, Ontario Anthropology
Canada N2L 3 G1 Deutscher Platz 6
jfarocha@healthy.uwaterloo.ca D-041 03 Leipzig, Germany
call@eva.mpg.de
Peter Bachman
University of California, Los Angeles Tyrone D. Cannon
Department of Psychology University of California, Los Angeles
Franz Hall Department of Psychology
Los Angeles, CA 90095 -1 5 63 Franz Hall
bachman@psych.ucla.edu Los Angeles, CA 90095 -1 5 63
cannon@psych.ucla.edu
Miriam Bassok Patricia W. Cheng
University of Washington University of California, Los Angeles
Department of Psychology Department of Psychology
Box 3 5 1 5 25 Franz Hall
Seattle, WA 981 95 -1 5 25 Los Angeles, CA 90095 -1 5 63
mbassok@u.washington.edu cheng@psych.ucla.edu
xi
xii contributors
Michelene T. H. Chi Rochel Gelman

University of Pittsburgh Rutgers University
Learning Research and Development Center Psychology and Rutgers Center for Cognitive
3 93 9 O’Hara Street Science
Pittsburgh, PA 1 5 260 1 5 2 Frelinghuysen Road
chi@pitt.edu Piscataway, NJ 0885 4-8020
rgelman@ruccs.rutgers.edu
Leonidas A. A. Doumas
University of California, Los Angeles Lila Gleitman
Department of Psychology University of Pennsylvania
Franz Hall Departments of Psychology and Linguistics
Los Angeles, CA 90095 -1 5 63 Institute for Research in Cognitive Science
adoumas@psych.ucla.edu 3 401 Walnut St. – 4th floor
Philadelphia, PA 1 91 04
Kevin Dunbar gleitman@cattell.psych.upenn.edu
Dartmouth College
Department of Psychological & Brain Sciences
Hanover, NH 03 75 5 Vinod Goel
kevin.n.dunbar@dartmouth.edu York University
Department of Psychology
Phoebe C. Ellsworth Toronto, Ontario
University of Michigan Canada M3 J 1 P3
Department of Psychology vgoel@yorku.ca
5 25 East University
Ann Arbor, MI 481 09-1 1 09 Robert L. Goldstone
pce@umich.edu Indiana University
Psychology Department
Jonathan St. B. T. Evans Psychology Building
University of Plymouth 1 1 01 E 1 0th St.
Centre for Thinking and Language Bloomington, IN 47405 -7007
School of Psychology rgoldsto@indiana.edu
Plymouth PL4 8AA UK
J.Evans@plymouth.ac.uk Patricia M. Greenfield
University of California, Los Angeles
Shane Frederick Department of Psychology
Massachusetts Institute of Technology Franz Hall
Sloan School of Management Los Angeles, CA 90095 -1 5 63
Room E5 6-3 1 7 greenfield@psych.ucla.edu
3 8 Memorial Drive
Cambridge, MA 021 42-1 3 07
Graeme S. Halford
shanefre@mit.edu
University of Queensland
Jonathan Fugelsang School of Psychology
Dartmouth College Brisbane
Department of Psychological & Brain Sciences Queensland 4072
Hanover, NH 03 75 5 Australia
jonf@dartmouth.edu gsh@psy.uq.edu.au
Charles R. Gallistel E. Tory Higgins

Rutgers University Columbia University
Psychology and Rutgers Center for Cognitive Department of Psychology
Science 401 D Schermerhorn
1 5 2 Frelinghuysen Road Mail Code 5 5 01
Piscataway, NJ 0885 4-8020 New York, NY 1 0027-5 5 01
galliste@ruccs.rutgers.edu tory@psych.columbia.edu
contributors xiii
Keith J. Holyoak – editor Marsha C. Lovett

University of California, Los Angeles Carnegie Mellon University
Department of Psychology Department of Psychology
Franz Hall Pittsburgh, PA 1 5 21 3 -3 890
Los Angeles, CA 90095 -1 5 63 lovett+@cmu.edu
holyoak@lifesci.ucla.edu
Todd I. Lubart
John E. Hummel Laboratoire Cognition et Développement
University of California, Los Angeles Institut de Psychologie – Université Paris 5
Department of Psychology 71 , avenue Edouard Vaillant
Franz Hall 92774 Boulogne-Billancourt cedex
Los Angeles, CA 90095 -1 5 63 France
jhummel@psych.ucla.edu lubart@psycho.univ-paris5 .fr
P. N. Johnson-Laird
Douglas L. Medin
Princeton University
Northwestern University
3 -C-3 Green Hall
2029 Sheridan Road
Princeton, NJ 085 44
Evanston, IL 60208
phil@princeton.edu
medin@northwestern.edu
Daniel Kahneman
Princeton University Daniel C. Molden
Woodrow Wilson School Northwestern University
3 24 Wallace Hall Department of Psychology
Princeton, NJ 085 44 2029 Sheridan Road
kahneman@princeton.edu Evanston, IL 60208
molden@northwestern.edu
James C. Kaufman
California State University, San Bernardino Robert G. Morrison – editor
Department of Psychology Xunesis
5 5 00 University Parkway P.O. Box 2691 87
San Bernardino, CA 92407 Chicago, IL 60626-91 87
jkaufman@csusb.edu robertmorrison@xunesis.org
David A. Lagnado Laura R. Novick
Department of Psychology Vanderbilt University
University College London Department of Psychology & Human
Gower Street Development
London, UK WC1 E 6BT Peabody College #5 1 2
d.lagnado@ucl.ac.uk 23 0 Appleton Place
Robyn A. LeBoeuf Nashville, TN 3 7203 -5 721
University of Florida Laura.Novick@vanderbilt.edu
Warrington College of
Business Stellan Ohlsson
Marketing Department University of Illinois, Chicago
PO Box 1 1 71 5 5 Department of Psychology
Gainesville, FL 3 261 1 -71 5 5 Chicago, IL 60607-71 3 7
LeBoeuf@ufl.edu stellan@uic.edu
Leib Litman Anna Papafragou

Brooklyn College of CUNY University of Pennsylvania
Department of Psychology Institute for Research in Cognitive Science
2900 Bedford Avenue 3 401 Walnut Street, Suite 400A
Brooklyn, NY 1 1 21 0 Philadelphia, PA 1 91 04
LeibL@Brooklyn.cuny.edu anna4@linc.cis.upenn.edu
xiv contributors
Vimla L. Patel Eldar B. Shafir

Columbia University Princeton University
Department of Biomedical Informatics and Department of Psychology and the
Psychiatry Woodrow Wilson School of Public Affairs
Vanderbilt Clinic-5 622 West 1 68th Street Green Hall
New York, NY 1 0003 Princeton, NJ 085 44
patel@dbmi.columbia.edu shafir@princeton.edu
David N. Perkins Steven A. Sloman

Harvard Graduate School of Brown University
Education Cognitive & Linguistic Sciences
Project Zero Box 1 978
3 1 5 Longfellow Hall, Appian Way Providence, RI 0291 2
Cambridge, MA 021 3 8 Steven Sloman@brown.edu
david perkins@harvard.edu
Ji Yun Son
Jean E. Pretz Indiana University
Department of Psychology Psychology Department
Illinois Wesleyan University Psychology Building
P.O. Box 2900 1 1 01 E 1 0th St.
Bloomington, IL 61 702-2900 Bloomington, IN 47405 -7007
jpretz@iwu.edu jys@indiana.edu
Arthur S. Reber Robert J. Sternberg

Brooklyn College of CUNY PACE Center
Department of Psychology Yale University
2900 Bedford Avenue P.O. Box 2083 5 8
Brooklyn, NY 1 1 21 0 New Haven, CT 065 20-83 5 8
areber@brooklyn.cuny.edu robert.sternberg@yale.edu
Lance J. Rips Michael Tomasello

Northwestern University Max Planck Institute for Evolutionary
Department of Psychology Anthropology
2029 Sheridan Road Deutscher Platz 6
Evanston, IL 60208 D-041 03 Leipzig, Germany
rips@northwestern.edu tomas@eva.mpg.de
Ron Ritchhart Barbara Tversky

Harvard Graduate School of Stanford University
Education Department of Psychology
Project Zero Building 420
1 24 Mount Auburn Street Stanford, CA 943 05 -21 3 0
Cambridge, MA 021 3 8 bt@psych.stanford.edu
ron ritchhart@pz.harvard.edu
Jiajie Zhang
Timothy A. Salthouse School of Health Information Sciences
University of Virginia University of Texas at Houston
Department of Psychology 7000 Fannin, Suite 600
Charlottesville, VA 22904-4400 Houston, TX 7703 0
salthouse@virginia.edu Jiajie.Zhang@uth.tmc.edu
CHAPTER 1
Thinking and Reasoning:

A Reader’s Guide
Keith J. Holyoak
Robert G. Morrison
“Cogito, ergo sum,” the French philosopher What Is Thinking?

René Descartes famously declared, “I think,
therefore I am.” Every normal human adult We can start to answer this question by look-
shares a sense that the ability to think, to rea- ing at the various ways the word “think-
son, is a part of their fundamental identity. ing” is used in everyday language. “I think
A person may be struck blind or deaf, yet that water is necessary for life” and “George
we still recognize his or her core cognitive thinks the Pope is a communist” both ex-
capacities as intact. Even loss of language, press beliefs (of varying degrees of appar-
the gift often claimed as the sine qua non ent plausibility), that is, explicit claims of
of homo sapiens, does not take away a per- what someone takes to be a truth about the
son’s essential humanness. Unlike language world. “Anne is sure to think of a solution”
ability, which is essentially unique to our carries us into the realm of problem solv-
species, the rudimentary ability to think and ing, the mental construction of an action
reason is apparent in nonhuman primates plan to achieve a goal. The complaint “Why
(see Call & Tomasello, Chap. 25 ); and yet it didn’t you think before you went ahead with
is thinking, not language, that lies closest to your half-baked scheme?” emphasizes that
the core of our individual identity. A person thinking can be a kind of foresight, a way
who loses language but can still make intel- of “seeing” the possible future.1 “What do
ligent decisions, as demonstrated by actions, you think about it?” calls for a judgment,
is viewed as mentally intact. In contrast, the an assessment of the desirability of an op-
kinds of brain damage that rob an individ- tion. Then there’s “Albert is lost in thought,”
ual of the capacity to think and reason are where thinking becomes some sort of mental
considered the harshest blows that can be meadow through which a person might me-
struck against a sense of personhood. Cogito, ander on a rainy afternoon, oblivious to the
ergo sum. world outside.
1
2 the cambridge handbook of thinking and reasoning
Rips and Conrad (1 989) elicited judg- state of affairs, some goal that motivates the
ments from college students about how var- thinker to perform mental work.
ious mentalistic terms relate to one another. Our definition thus includes quite a few
Using statistical techniques, the investigators stipulations, but notice also what is left out.
were able to summarize these relationships We do not claim that thinking necessarily
in two diagrams, shown in Figure 1 .1 . Fig- requires a human (higher-order primates,
ure 1 .1 (A) is a hierarchy of kinds, or cat- and perhaps some other species on this or
egories. Roughly, people believe planning other planets, have a claim to be considered
is a kind of deciding, which is a kind of thinkers) (see Call & Tomasello, Chap. 25 )
reasoning, which is a kind of conceptual- or even a sentient being. (The field of ar-
izing, which is a kind of thinking. People tificial intelligence may have been a disap-
also believe that thinking is part of con- pointment in its first half-century, but we
ceptualizing, which is part of remembering, are reluctant to define it away as an oxy-
which is part of reasoning, and so on [Fig- moron.) Nonetheless, our focus in this book
ure 1 .1 (B)]. The kinds ordering and the parts is on thinking by hominids with electro-
ordering are similar; most strikingly, “think- chemically powered brains. Thinking often
ing” is the most general term in both order- seems to be a conscious activity of which
ings – the grand superordinate of mental ac- the thinker is aware (cogito, ergo sum); how-
tivities, which permeates all the others. ever, consciousness is a thorny philosophi-
It is not easy to make the move from the cal puzzle, and some mental activities seem
free flow of everyday speech to scientific def- pretty much like thinking, except for being
initions of mental terms, but let us nonethe- implicit rather than explicit (see Litman &
less offer a preliminary definition of thinking Reber, Chap. 1 8). Finally, we do not claim
to suggest what this book is about: Thinking that thinking is inherently rational, optimal,
is the systematic transformation of mental rep- desirable, or even smart. A thorough history
resentations of knowledge to characterize ac- of human thinking will include quite a few
tual or possible states of the world, often in chapters on stupidity.
service of goals. Obviously, our definition in- The study of thinking includes several in-
troduces a plethora of terms with meanings terrelated subfields that reflect slightly dif-
that beg to be unpacked, but at which we can ferent perspectives on thinking. Reasoning,
only hint. A mental representation of knowl- which has a long tradition that springs from
edge is an internal description that can be philosophy and logic, places emphasis on the
manipulated to form other descriptions. To process of drawing inferences (conclusions)
count as thinking, the manipulations must from some initial information (premises). In
be systematic transformations governed by standard logic, an inference is deductive if the
certain constraints. Whether a logical deduc- truth of the premises guarantees the truth
tion or a creative leap, what we mean by of the conclusion by virtue of the argument
thinking is more than unconstrained associ- form. If the truth of the premises renders the
ations (with the caveat that thinking may in- truth of the conclusion more credible but
deed be disordered; see Bachman & Cannon, does not bestow certainty, the inference is
Chap. 21 ). The internal representations cre- called inductive.2 Judgment and decision mak-
ated by thinking describe states of some ex- ing involve assessment of the value of an
ternal world (a world that may include the option or the probability that it will yield
thinker as an object of self-reflection) – that a certain payoff ( judgment) coupled with
world might be our everyday one, or per- choice among alternatives (decision mak-
haps some imaginary construction obeying ing). Problem solving involves the construc-
the “laws” of magical realism. Often (not tion of a course of action that can achieve a
always – the daydreamer, and indeed the goal.
night dreamer, are also thinkers), thinking Although these distinct perspectives on
is directed toward achieving some desired thinking are useful in organizing the field
thinking and reasoning: a reader’s guide 3
Figure 1 .1 . People’s conceptions of the relationships among terms for mental activities. A, Ordering
of “kinds.” B, Ordering of “parts.” (Adapted from Rips & Conrad, 1 989, with permission.)
(and this volume), these aspects of thinking eighteenth-century philosophers Immanuel

overlap in every conceivable way. To solve Kant (in Germany) and David Hume (in
a problem, one is likely to reason about the Scotland) laid the foundations for all subse-
consequences of possible actions and make quent work on the origins of causal knowl-
decisions to select among alternative actions. edge, perhaps the most central problem in
A logic problem, as the name implies, is a the study of thinking (see Buehner & Cheng,
problem to be solved (with the goal of de- Chap. 7). If we were to choose one phrase
riving or evaluating a possible conclusion). to set the stage for modern views of think-
Making a decision is often a problem that ing, it would be an observation of the British
requires reasoning. These subdivisions of the philosopher Thomas Hobbes, who, in 1 65 1 ,
field, like our preliminary definition of think- in his treatise Leviathan, proposed, “Rea-
ing, should be treated as guideposts, not soning is but reckoning.” “Reckoning” is an
destinations. odd term today, but in the seventeenth cen-
tury it meant computation, as in arithmetic
calculations.3
It was not until the twentieth century that
A Capsule History the psychology of thinking became a scien-
tific endeavor. The first half of the century
Thinking and reasoning, long the academic gave rise to many important pioneers who
province of philosophy, have over the past in very different ways laid the foundations
century emerged as core topics of empirical for the emergence of the modern field of
investigation and theoretical analysis in the thinking and reasoning. Foremost were the
modern fields known as cognitive psychol- Gestalt psychologists of Germany, who pro-
ogy, cognitive science, and cognitive neuro- vided deep insights into the nature of prob-
science. Before psychology was founded, the lem solving (see Novick & Bassok, Chap. 1 4).
Most notable of the Gestaltists were Karl predictions (much like an engineer might
Duncker and Max Wertheimer, students use a physical scale model of a bridge to
of human problem solving, and Wolfgang anticipate the effects of stress on the ac-
Köhler, a keen observer of problem solv- tual bridge intended to span a river).4 In the
ing by great apes (see Call & Tomasello, 1 960s and 1 970s, modern work on the psy-
Chap. 25 ). chology of reasoning began in Britain with
The pioneers of the early twentieth cen- the contributions of Peter Wason and his col-
tury also include Sigmund Freud, whose laborator Philip Johnson-Laird (see Evans,
complex and ever-controversial legacy in- Chap. 8).
cludes the notions that forms of thought The modern conception of thinking as
can be unconscious (see Litman & Reber, computation became prominent in the
Chap. 1 8) and that “cold” cognition is tan- 1 970s. In their classic treatment of human
gled up with “hot” emotion (see Molden & problem solving, Allen Newell and Herbert
Higgins, Chap. 1 3 ). As the founder of clini- Simon (1 972) showed that the computa-
cal psychology, Freud’s legacy also includes tional analysis of thinking (anticipated by
the ongoing integration of research on nor- Alan Turing, the father of computer science)
mal thinking with studies of thought disor- could yield important empirical and theo-
ders, such as schizophrenia (see Bachman & retical results. Like a program running on a
Cannon, Chap. 21 ). digital computer, a person thinking through
Other early pioneers in the early and a problem can be viewed as taking an in-
mid-twentieth century contributed to vari- put that represents initial conditions and a
ous fields of study that are now embraced goal, and applying a sequence of operations
within thinking and reasoning. Cognitive de- to reduce the difference between the initial
velopment continues to be influenced by the conditions and the goal. The work of Newell
early theories developed by the Swiss psy- and Simon established computer simulation
chologist Jean Piaget (see Halford, Chap. 22) as a standard method for analyzing human
and the Russian psychologist Lev Vygotsky thinking. Their work also highlighted the po-
(see Greenfield, Chap. 27). In the United tential of production systems (see Novick &
States, Charles Spearman was a leader in the Bassok, Chap. 1 4), which were subsequently
systematic study of individual differences in developed extensively as cognitive models
intelligence (see Sternberg, Chap. 3 1 ). In the by John Anderson and his colleagues (see
middle of the century, the Russian neurolo- Lovett & Anderson, Chap. 1 7).
gist Alexander Luria made immense contri- The 1 970s saw a wide range of major de-
butions to our understanding of how think- velopments that continue to shape the field.
ing depends on specific areas of the brain, Eleanor Rosch, building on earlier work by
anticipating the modern field of cognitive Jerome Bruner (Bruner, Goodnow, & Austin,
neuroscience (see Goel, Chap. 20). Around 1 95 6), addressed the fundamental question
the same time, in the United States, Herbert of why people have the categories they do,
Simon argued that the traditional rational and not other logically possible groupings of
model of economic theory should be re- objects (see Medin & Rips, Chap. 3 ). Rosch
placed with a framework that accounted for argued that natural categories often have
a variety of human resource constraints such fuzzy boundaries (a whale is an odd mam-
as bounded attention and memory capac- mal) but nonetheless have clear central ten-
ity and limited time (see LeBoeuf & Shafir, dencies or prototypes (people by and large
Chap. 1 1 , and Morrison, Chap. 1 9). This was agree that a bear makes a fine mammal).
one of the contributions that in 1 978 earned The psychology of human judgment was re-
Simon the Nobel Prize in Economics. shaped by the insights of Amos Tversky and
In 1 943 , the British psychologist Kenneth Daniel Kahneman, who identified simple
Craik sketched the fundamental notion that cognitive strategies, or heuristics, that people
a mental representation provides a kind of use to make judgments of frequency and
model of the world that can be “run” to make probability. Often quick and accurate, these
strategies can in some circumstances lead might be performed using numbers in ei-
to nonnormative judgments. After Tversky’s ther decimal or binary code, starting with
death in 1 996, this line of work was con- either the leftmost or rightmost digit. Fi-
tinued by Kahneman, who was awarded the nally, the level of implementation addresses
Nobel Prize in Economics in 2002. The cur- the question, “How are the representation
rent view of judgment, which has emerged and algorithm realized physically?” The cash
from 3 0 years of research, is summarized by register could be implemented as an elec-
Kahneman and Frederick (Chap. 1 2; also see tronic calculator, a mechanical adding ma-
LeBoeuf & Shafir, Chap. 1 1 ). (Goldstone and chine, or even a mental abacus in the mind of
Son, Chap. 2, review Tversky’s influential the clerk.
theory of similarity judgments.) In his book, Marr stressed the importance
In 1 982, a young vision scientist, David of the computational level of analysis, ar-
Marr, published a book called Vision. Largely guing that it could be seriously misleading
a technical treatment of visual perception, to focus prematurely on the more concrete
the book includes an opening chapter that levels of analysis for a cognitive task with-
lays out a larger vision – a vision of how out understanding the goal or nature of the
the science of mind should proceed. Marr mental computation.5 Sadly, Marr died of
distinguished three levels of analysis, which leukemia before Vision was published, and
he termed the level of computation, the level so we do not know how his thinking about
of representation and algorithm, and the level levels of analysis might have evolved. In
of implementation. Each level, according to very different ways, Marr’s conception of a
Marr, addresses different questions, which computational level of analysis is reflected
he illustrated with the example of a phys- in several chapters in this book (see espe-
ical device, the cash register. At Marr’s most cially Doumas & Hummel, Chap. 4; Buehner
abstract level, computation (not to be con- & Cheng, Chap. 7; Lovett & Anderson,
fused with computation of an algorithm on a Chap. 1 7).
computer), the basic questions are “What is In the most recent quarter-century, many
the goal that the cognitive process is meant other springs of research have fed into the
to accomplish?” and “What is the logic of the river of thinking and reasoning, including
mapping from the input to the output the field of analogy (see Holyoak, Chap. 6),
that distinguishes this mapping from other neural network models (see Doumas &
input–output mappings?” A cash register, Hummel, Chap. 4; Halford, Chap. 22), and
viewed at this level, is used to achieve the cognitive neuroscience (see Goel, Chap. 20).
goal of calculating how much is owed for a The chapters of this handbook collectively
purchase. This task maps precisely onto the paint a picture of the state of the field at the
axioms of addition (e.g., the amount owed dawn of the new millennium.
should not vary with the order in which
items are presented to the sales clerk, a
constraint that precisely matches the com-
mutativity property of addition). It follows Overview of the Handbook
that, without knowing anything else about
the workings of a particular cash register, This volume brings together the contribu-
we can be sure (if it is working prop- tions of many of the leading researchers
erly) that it will be performing addition in thinking and reasoning to create the
(not division). most comprehensive overview of research
The level of representation and algo- on thinking and reasoning that has ever been
rithm, as the name implies, deals with the available. Each chapter includes a bit of his-
questions, “What is the representation of torical perspective on the topic and ends
the input and output?” and “What is the with some thoughts about where the field
algorithm for transforming the former into seems to be heading. The book is organized
the latter?” Within a cash register, addition into seven sections.
Part I: The Nature of Human Concepts Part III: Judgment and Decision Making
The three chapters in Part I address foun- We then turn to topics related to judgment
dational issues related to the representation and decision making. In Chapter 1 1 , LeBoeuf
of human concepts. Chapter 2 by Gold- and Shafir set the stage with a general re-
stone and Son reviews work on the core view of work on decision making. Then,
concept of similarity – how people assess in Chapter 1 2, Kahneman and Frederick
the degree to which objects or events are present an overarching model of heuristic
alike. Chapter 3 by Medin and Rips consid- judgment. In Chapter 1 3 , Molden and Hig-
ers research on categories and how concepts gins review research revealing the ways in
are organized in semantic memory. Think- which human motivation and emotion influ-
ing depends not only on representations of ence judgment.
individual concepts, such as dogs and cats,
but also on representations of the relation- Part IV: Problem Solving
ships among concepts, such as the fact that and Complex Learning
dogs often chase cats. In Chapter 4, Doumas
and Hummel evaluate different compu- The five chapters that comprise this section
tational approaches to the representation deal with problem solving and allied issues
of relations. concerning how people learn in problem-
solving situations. In Chapter 1 4, Novick
and Bassok provide a general overview of
Part II: Reasoning the field of human problem solving. Prob-
lem solving has close connections to the
Chapters 5 to 1 0 deal with varieties of topic of creativity, the focus of Chapter 1 5
the core topic of reasoning. In Chapter 5 , by Sternberg, Lubart, Kaufman, and Pretz.
Sloman and Lagnado set the stage by lay- Beyond relatively routine problem solving,
ing out the issues surrounding induction – there are occasions when people need to re-
using what is known to generate plausi- structure their knowledge in complex ways
ble, although uncertain, inferences. Then, to generate deeper understanding. How such
in Chapter 6, Holyoak reviews the liter- complex learning takes place is the topic of
ature on reasoning by analogy, an impor- Chapter 1 6 by Chi and Ohlsson. In Chap-
tant variety of inductive reasoning that is ter 1 7, Lovett and Anderson review work
critical for learning. The most classic as- on thinking that is based on a particular
pect of induction is the way in which hu- formal approach rooted in work on prob-
mans and other creatures acquire knowledge lem solving, namely, production systems.
about causal relations, which is critical for Finally, in Chapter 1 8, Litman and Reber
predicting the consequences of actions and consider research suggesting that some as-
events. In Chapter 7, Buehner and Cheng pects of thinking and learning depend on im-
discuss research and theory on causal learn- plicit mechanisms that operate largely out-
ing. Then, in Chapter 8, Evans reviews work side of awareness.
on the psychology of deductive reasoning,
the form of thinking with the closest ties
Part V: Cognitive and Neural Constraints
to logic. In Chapter 9, Johnson-Laird de-
on Human Thought
scribes the work that he and others have
performed using the framework of men- High-level human thinking cannot be fully
tal models to deal with various reasoning understood in isolation from fundamental
tasks, both deductive and inductive. Men- cognitive processes and their neural sub-
tal models have close connections to percep- strates. In Chapter 1 9, Morrison reviews the
tual representations that are visuospatial in wealth of evidence indicating that thinking
Chapter 1 0, Barbara Tversky reviews work and reasoning depend critically on what is
on the role of visuospatial representations known as “working memory,” that is, the sys-
in thinking. tem responsible for short-term maintenance
and manipulation of information. Current ferences in the nature and quality of hu-
work is making headway in linking thought man thinking. This section includes three
processes to specific brain structures such as chapters focusing on thinking in particu-
the prefrontal cortex; in Chapter 20, Goel lar practices and two chapters that deal
discusses the key topic of deductive reason- with variations in thinking ability. In Chap-
ing in relation to its neural substrate. Brain ter 28, Ellsworth reviews what is known
disorders, notably schizophrenia, produce about thinking in the field of law. In Chap-
striking disruptions of normal thought pro- ter 29, Dunbar and Fugelsang discuss think-
cesses, which can shed light on how thinking ing and reasoning as manifested in the prac-
takes place in normal brains. In Chapter 21 , tice of science. In Chapter 3 0, Patel, Arocha,
Bachman and Cannon review research and and Zhang discuss reasoning in a field –
theory concerning thought disorder. medicine – in which accurate diagnosis and
treatment are literally everyday matters of
Part VI: Ontogeny, Phylogeny, Language, life and death. Then, in Chapter 3 1 , Stern-
and Culture berg reviews work on the concept of intel-
ligence as a source of individual differences
Our understanding of thinking and reason-
in thinking and reasoning. Finally, Chapter
ing would be gravely limited if we restricted
3 2 by Ritchhart and Perkins concludes the
investigation to young adult English speak-
volume by reviewing one of the major chal-
ers. The six chapters in Part VI deal with the
lenges for education – finding ways to teach
multifaceted ways in which aspects of think-
people to think more effectively.
ing vary across the human lifespan, across
species, across speakers of different lan-
guages, and across cultures. In Chapter 22,
Halford provides an overview of the devel- Examples of Chapter Assignments
opment of thinking and reasoning over the for a Variety of Courses
course of childhood. In Chapter 23 , Gallistel
and Gelman discuss mathematical thinking, This volume offers a comprehensive treat-
a special form of thinking found in rudi- ment of higher cognition. As such, it serves
mentary form in nonhuman animals that un- as an excellent source for courses on think-
dergoes development in children. In Chap- ing and reasoning, both at the graduate
ter 24, Salthouse describes the changes in level and for upper-level undergraduates. Al-
thinking and reasoning brought on by the though instructors for semester-length grad-
aging process. The phylogeny of thinking – uate courses in thinking and reasoning may
thinking and reasoning as performed by apes opt to assign the entire volume as a text-
and monkeys – is discussed in Chapter 25 by book, there are a number of other possibili-
Call and Tomasello. One of the most contro- ties (including using chapters from this vol-
versial topics in the field is the relationship ume as introductions for various topics and
between thinking and the language spoken then supplementing with readings from the
by the thinker; in Chapter 26, Gleitman primary literature). Here are a few examples
and Papafragou review the hypotheses and of possible chapter groupings tailored to a
evidence concerning the connections be- variety of possible course offerings:
tween language and thought. In Chapter 27,
Greenfield considers the ways in which
Introduction to Thinking and Reasoning
modes of thinking may vary in the context
of different human cultures. Chapter 1 Thinking and Reasoning: A
Reader’s Guide
Part VII: Thinking in Practice Chapter 2 Similarity
In cultures ancient and modern, thinking Chapter 3 Concepts and Categories:
is put to particular use in special cultural Memory, Meaning, and
practices. Moreover, there are individual dif- Metaphysics
Chapter 5 The Problem of Induction Chapter 7 Causal Learning

Chapter 6 Analogy Chapter 9 Mental Models and Thought
Chapter 7 Causal Learning Chapter 2 2 Development of Thinking
Chapter 8 Deductive Reasoning Chapter 1 7 Thinking as a Production
Chapter 9 Mental Models and Thought System
Chapter 1 0 Visuospatial Reasoning
Chapter 1 1 Decision Making Applied Thought
Chapter 1 2 A Model of Heuristic
Judgment Chapter 1 4 Problem Solving
Chapter 1 4 Problem Solving Chapter 1 0 Visuospatial Reasoning
Chapter 1 5 Creativity Chapter 2 3 Mathematical Thinking
Chapter 1 6 Complex Declarative Chapter 2 6 Language and Thought
Learning Chapter 1 5 Creativity
Chapter 1 8 Implicit Cognition and Chapter 3 1 Intelligence
Thought Chapter 1 3 Motivated Thinking
Chapter 2 7 Paradigms of Cultural
Thought
Development of Thinking
Chapter 1 6 Complex Declarative
Chapter 2 Similarity Learning
Chapter 3 Concepts and Categories: Chapter 1 8 Implicit Cognition and
Memory, Meaning, and Thought
Metaphysics Chapter 2 8 Legal Reasoning
Chapter 2 2 Development of Thinking Chapter 2 9 Scientific Thinking and
Chapter 2 3 Mathematical Thinking Reasoning
Chapter 2 6 Language and Thought Chapter 3 0 Reasoning in Medicine
Chapter 2 4 Effects of Aging on Reasoning
Chapter 2 5 Reasoning and Thinking in Differences in Thought
Nonhuman Primates
Chapter 1 9 Thinking in Working Memory Chapter 3 1 Intelligence
Chapter 3 1 Intelligence Chapter 1 5 Creativity
Chapter 3 2 Learning to Think: The Chapter 1 9 Thinking in Working Memory
Challenges of Teaching Chapter 2 1 Cognitive and Neuroscience
Thinking Aspects of Thought Disorder
Chapter 2 2 Development of Thinking
Chapter 2 5 Reasoning and Thinking in
Modeling Human Thought Nonhuman Primates
Chapter 2 Similarity Chapter 2 4 Effects of Aging on Reasoning
Chapter 3 Concepts and Categories: Chapter 2 6 Language and Thought
Memory, Meaning, and Chapter 1 3 Motivated Thinking
Metaphysics Chapter 2 7 Paradigms of Cultural
Chapter 4 Approaches to Modeling Thought
Human Mental Chapter 2 9 Scientific Thinking and
Representations: Reasoning
What Works, What Doesn’t, Chapter 3 2 Learning to Think: The
and Why Challenges of Teaching
Chapter 6 Analogy Thinking
Acknowledgments tive reasoning.” In an old Western movie, a

hero in a tough spot might venture, “I reckon
Preparation of this chapter was supported we can hold out till sun-up,” illustrating how
calculation has crossed over to become a
by grants R3 05 H03 01 41 from the Institute
metaphor for mental judgment.
of Education Sciences and SES-00803 75
4. See Johnson-Laird, Chap. 9, for a current view
from the National Science Foundation to
of thinking and reasoning that owes much to
Holyoak, and by Xunesis (www.xunesis.org) Craik’s seminal ideas.
and a National Institute of Mental Health
5 . Indeed, Marr criticized Newell and Simon’s
National Service Research Award (MH- approach to problem solving for paying insuf-
064244) to Morrison. The authors thank ficient attention to the computational level in
Miriam Bassok and Patricia Cheng for his sense.
comments on an earlier draft of this chapter.
References
Notes
Bruner, J. S., Goodnow, J. J., & Austin, G. A.
(1 95 6). A study of thinking. New York: Wiley.
1 . Notice the linguistic connection between
“thinking” and “seeing,” and thought and per- Craik, K. (1 943 ). The nature of explanation. Cam-
ception, which was emphasized by the Gestalt bridge, UK: Cambridge University Press.
psychologists of the early twentieth century. Hobbes, T. (1 65 1 /1 968). Leviathan. London: Pen-
2. The distinction between deduction and in- guin Books.
duction blurs in the study of the psychol- Marr, D. (1 982). Vision. San Francisco: W. H.
ogy of thinking, as we see in Part II of Freeman.
this volume. Newell, A., & Simon, H. A. (1 972). Hu-
3 . There are echoes of the old meaning of man problem solving. Englewood Cliffs, NJ:
“reckon” in such phrases as “reckon the cost.” Prentice Hall.
As a further aside, the term “dead reckon- Rips, L. J., & Conrad, F. G. (1 989). Folk psychol-
ing,” a procedure for calculating the position ogy of mental activities. Psychological Review,
of a ship or aircraft, derives from “deduc- 96, 1 87–207.
Part I
THE NATURE OF
HUMAN CONCEPTS

CHAPTER 2
Similarity
Robert L. Goldstone
Ji Yun Son
Introduction From this perspective, psychological as-

sessments of similarity are valuable to the
Human assessments of similarity are funda- extent that they provide grounds for predict-
mental to cognition because similarities in ing as many important aspects of our world
the world are revealing. The world is an or- as possible (Holland, Holyoak, Nisbett,
derly enough place that similar objects and & Thagard, 1 986; see Dunbar & Fugelsang,
events tend to behave similarly. This fact Chap. 29). Appreciating the similarity be-
of the world is not just a fortunate coinci- tween crocodiles and alligators is helpful
dence. It is because objects are similar that because information learned about one is
they will tend to behave similarly in most generally true of the other. If we learned an
respects. It is because crocodiles and alliga- arbitrary fact about crocodiles, such as they
tors are similar in their external form, in- are very sensitive to the cold, then we could
ternal biology, behavior, diet, and customary probably infer that this fact is also true of
environment that one can often successfully alligators. As the similarity between A and
generalize from what one knows of one to B increases, so does the probability of cor-
the other. As Quine (1 969) observed, “Sim- rectly inferring that B has X upon knowing
ilarity, is fundamental for learning, knowl- that A has X (Tenenbaum, 1 999). This re-
edge and thought, for only our sense of sim- lation assumes we have no special knowl-
ilarity allows us to order things into kinds edge related to property X. Empirically, Heit
so that these can function as stimulus mean- and Rubinstein (1 994) showed that if we do
ings. Reasonable expectation depends on the know about the property, then this knowl-
similarity of circumstances and on our ten- edge, rather than a one-size-fits-all similarity,
dency to expect that similar causes will have is used to guide our inferences. For example,
similar effects” (p. 1 1 4). Similarity thus plays if people are asked to make an inference
a crucial role in making predictions because about an anatomical property, then anatom-
similar things usually behave similarly. ical similarities have more influence than
13
behavioral similarities. Boars are anatomi- Another argument for the importance of
cally but not behaviorially similar to pigs, similarity in cognition is simply that it plays
and this difference successfully predicts that a significant role in psychological accounts
people are likely to make anatomical but of problem solving, memory, prediction,
not behavioral inferences from pigs to boars. and categorization. If a problem is similar
The logical extreme of this line of reason- to a previously solved problem, then the
ing (Goodman, 1 972; Quine, 1 977) is that if solution to the old problem may be applied
one has complete knowledge about the rea- to the new problem (Holyoak & Koh, 1 987;
sons why an object has a property, then gen- Ross, 1 987, 1 989). If a cue is similar enough
eral similarity is no longer relevant to gener- to a stored memory, the memory may be
alizations. The knowledge itself completely retrieved (Raaijmakers & Shiffrin, 1 981 ).
guides whether the generalization is appro- If an event is similar enough to a previ-
priate. Moonbeams and melons are not very ously experienced event, the stored event’s
similar generally speaking, but if one is told outcome may be offered as a candidate pre-
that moonbeams have the property that the diction for the current event (Sloman, 1 993 ;
word begins with Melanie’s favorite letter, Tenenbaum & Griffiths, 2001 ). If an un-
then one can generalize this property to mel- known object is similar enough to a known
ons with very high confidence. object, then the known object’s category
By contrasting the cases of crocodiles, label may be applied to the unknown object
boars, and moonbeams, we can specify the (Nosofsky, 1 986). The act of comparing
benefits and limitations of similarity. We events, objects, and scenes and establishing
tend to rely on similarity to generate in- similarities between them is of critical
ferences and categorize objects into kinds importance for the cognitive processes we
when we do not know exactly what prop- depend on.
erties are relevant or when we cannot eas- The utility of similarity for ground-
ily separate an object into separate proper- ing our concepts has been rediscovered
ties. Similarity is an excellent example of a in all the fields comprising cognitive sci-
domain-general source of information. Even ence (see Medin & Rips, Chap. 3 ). Exem-
when we do not have specific knowledge plar (Estes, 1 994; Kruschke, 1 992; Lamberts,
of a domain, we can use similarity as a de- 2000; Medin & Schaffer, 1 978; Nosofsky,
fault method to reason about it. The contra- 1 986), instance-based (Aha, 1 992), view-
vening limitation of this domain generality based (Tarr & Gauthier, 1 998), case-based
is that when specific knowledge is available, (Schank, 1 982), nearest neighbor (Ripley,
then a generic assessment of similarity is 1 996), configural cue (Gluck & Bower,
no longer as relevant (Keil, 1 989; Murphy, 1 990), and vector quantization (Kohonen,
2002; Murphy & Medin, 1 985 ; Rips, 1 989; 1 995 ) models share the underlying strat-
Rips & Collins, 1 993 ). Artificial laboratory egy of giving responses learned from similar,
experiments in which subjects are asked to previously presented patterns to novel pat-
categorize unfamiliar stimuli into novel cat- terns. Thus, a model can respond to rep-
egories invented by the experimenter are sit- etitions of these patterns; it can also give
uations in which similarity is clearly impor- responses to novel patterns that are likely
tant because subjects have little else to use to be correct by sampling responses to
(Estes, 1 994; Nosofsky, 1 984, 1 986). How- old patterns weighted by their similar-
ever, similarity is also important in many ity to the novel pattern. Consistent with
real world situations because our knowledge these models, psychological evidence sug-
does not run as deep as we think it does gests that people show good transfer to
(Rozenblit & Keil, 2002) and because a gen- new stimuli in perceptual tasks to the ex-
eral sense of similarity often has an influence tent that the new stimuli resemble previ-
even when more specific knowledge ought ously learned stimuli (Kolers & Roediger,
to overrule it (Allen & Brooks, 1 991 ; Smith & 1 984; Palmeri, 1 997). Another common
Sloman, 1 994). feature of these approaches is that they
similarity 15
represent patterns in a relatively raw, un- A Survey of Major Approaches

processed form. This parallels the constraint to Similarity
described previously on the applicability
of similarity. Both raw representations and There have been a number of formal treat-
generic similarity assessments are most use- ments that simultaneously provide theoreti-
ful as a default strategy when one does not cal accounts of similarity and describe how it
know exactly what properties of a stimulus can be empirically measured (Hahn, 2003 ).
are important. One’s best bet is to follow the These models have had a profound practical
principle of least commitment (Marr, 1 982) impact in statistics, automatic pattern recog-
and keep mental descriptions in a relatively nition by machines, data mining, and mar-
raw form to preserve information that may keting (e.g., online stores can provide “peo-
be needed at a later point. ple similar to you liked the following other
Another reason for studying similarity is items . . . ”). Our brief survey is organized in
that it provides an elegant diagnostic tool terms of the following models: geometric,
for examining the structure of our mental feature based, alignment based, and trans-
entities and the processes that operate on formational.
them. For example, one way to tell that a
physicist has progressed beyond the novice Geometric Models and
stage is that he or she sees deep similari- Multidimensional Scaling
ties between problems that require calcu-
lation of force even though the problems Geometric models of similarity have been
are superficially dissimilar (Chi, Feltovich, among the most influential approaches to
& Glaser, 1 981 ; see Novick & Bassok, Chap. analyzing similarity (Carroll & Wish, 1 974;
1 4). Given that psychologists have no mi- Torgerson, 1 95 8, 1 965 ). These approaches
croscope with direct access to people’s rep- are exemplified by nonmetric multidimen-
resentations of their knowledge, appraisals sional scaling (MDS) models (Shepard,
of similarity provide a powerful, if indirect, 1 962a, 1 962b). MDS models represent sim-
lens onto representation/process assemblies ilarity relations between entities in terms
(see also Doumas & Hummel, Chap. 4). of a geometric model that consists of a set
A final reason to study similarity is of points embedded in a dimensionally or-
that it occupies an important ground be- ganized metric space. The input to MDS
tween perceptual constraints and higher- routines may be similarity judgments, dis-
level knowledge system functions. Similar- similarity judgments, confusion matrices,
ity is grounded by perceptual functions. A correlation coefficients, joint probabilities,
tone of 200 Hz and a tone of 202 Hz sound or any other measure of pairwise proximity.
similar (Shepard, 1 987), and the similar- The output of an MDS routine is a geomet-
ity is cognitively impenetrable (Pylyshyn, ric model of the data, with each object of
1 985 ) – enough that there is little that the data set represented as a point in an n-
can be done to alter this perceived similar- dimensional space. The similarity between a
ity. However, similarity is also highly flexi- pair of objects is taken to be inversely related
ble and dependent on knowledge and pur- to the distance between two objects’ points
pose. By focusing on patterns of motion in the space. In MDS, the distance between
and relations, even electrons and planets can points i and j is typically computed by
be made to seem similar (Gentner, 1 983 ; r1
n
Holyoak & Thagard, 1 989; see Holyoak, dissimilarity(i, j) = |Xik − X jk|r
Chap. 6). A complete account of similar- k=1
ity will make contact both with Fodor’s
(2.1 )
(1 983 ) isolated and modularized percep-
tual input devices and the “central system” where n is the number of dimensions, X ik
in which everything a person knows may is the value of dimension k for item i, and r
be relevant. is a parameter that allows different spatial
metrics to be used. With r = 2, a stan- we cannot simultaneously place Russia near

dard Euclidean notion of distance is invoked, Cuba (similarity = 7) and place Russia far
whereby the distance between two points away from Jamaica (similarity = 1 ). In MDS
is the length of the straight line connect- terms, the “stress” of the one-dimensional so-
ing the points. If r = 1 , then distance in- lution would be high. We could increase the
volves a city-block metric where the dis- dimensionality of our solution and position
tance between two points is the sum of the points in two-dimensional space. A per-
their distances on each dimension (“short- fect reconstruction of any set of proximities
cut” diagonal paths are not allowed to di- among a set of n objects can be obtained if
rectly connect points differing on more than a high enough dimensionality (specifically,
one dimension). An Euclidean metric of- n − 1 dimensions) is used.
ten provides a better fit to empirical data One of the main applications of MDS is to
when the stimuli being compared are com- determine the underlying dimensions com-
posed of integral, perceptually fused dimen- prising the set of compared objects. Once
sions such as the brightness and saturation the points are positioned in a way that faith-
of a color. Conversely, a city-block metric is fully mirrors the subjectively obtained simi-
often appropriate for psychologically sepa- larities, it is often possible to give interpreta-
rated dimensions such as brightness and size tions to the axes or to rotations of the axes.
(Attneave, 1 95 0). In the previous example, dimensions may
Richardson’s (1 93 8) fundamental insight, correspond to “political affiliation” and “cli-
which is the basis of contemporary use of mate.” Russia and Cuba would have similar
MDS, was to begin with subjects’ judgments values on the former dimension; Jamaica and
of pairwise object dissimilarity and work Cuba would have similar values on the lat-
backward to determine the dimensions and ter dimension. A study by Smith, Shoben,
dimension values that subjects used in mak- and Rips (1 974) illustrates a classic use of
ing their judgments. MDS algorithms pro- MDS (Figure 2.1 ). They obtained similar-
ceed by placing entities in an n-dimensional ity ratings from subjects on many pairs of
space such that the distances between the birds. Submitting these pairwise similarity
entities accurately reflect the empirically ob- ratings to MDS analysis, they hypothesized
served similarities. For example, if we asked underlying features that were used for rep-
people to rate the similarities [on a scale resenting the birds. Assigning subjective in-
from 1 (low similarity) to 1 0 (high similar- terpretations to the geometric model’s axes,
ity)] of Russia, Cuba, and Jamaica, we might the experimenters suggested that birds were
find represented in terms of their values on di-
mensions such as “ferocity” and “size.” It is
Similarity (Russia, Cuba) = 7 important to note that the proper psycholog-
ical interpretation of a geometric represen-
Similarity (Russia, Jamaica) = 1
tation of objects is not necessarily in terms
Similarity (Cuba, Jamaica) = 8 of its Cartesian axes. In some domains, such
as musical pitches, the best interpretation
An MDS algorithm would try to position the of objects may be in terms of their polar
three countries in a space such that coun- coordinates of angle and length (Shepard,
tries that are rated as being highly similar 1 982). More recent work has extended ge-
are very close to each other in the space. ometric representations still further, repre-
With nonmetric scaling techniques, only or- senting patterns of similarities by general-
dinal similarity relations are preserved. The ized, nonlinear manifolds (Tenenbaum, De
interpoint distances suggested by the simi- Silva, & Lanford, 2000).
larity ratings may not be simultaneously sat- MDS is also used to create a compressed
isfiable in a given dimensional space. If we representation that conveys relative similar-
limit ourselves to a single dimension (we ities among a set of items. A set of n items
place the countries on a “number line”), then requires n(n − 1 )/2 numbers to express
similarity 17
Figure 2 .1 . Two multidimensional scaling (MDS) solutions for sets of birds (A) and animals (B). The
distances between words in the MDS space reflect their psychology dissimilarity. Once an MDS
solution has been made, psychological interpretations for the dimensions may be possible. In these
solutions, the horizontal and vertical dimensions may represent size and domesticity, respectively.
(Reprinted from Rips, Shoben, & Smith, 1 974, by permission.)
all pairwise distances among the items, if for which one can obtain subjective sim-
it is assumed that any object has a dis- ilarity data. Once constructed, these nu-
tance of 0 to itself and distances are sym- meric representations can be used to pre-
metric. However, if an MDS solution fits dict people’s categorization accuracy, mem-
the distance data well, it can allow these ory performance, or learning speed. MDS
same distances to be reconstructed using models have been successful in express-
only ND numbers, where D is the num- ing cognitive structures in stimulus do-
ber of dimensions of the MDS solution. mains as far removed as animals (Smith,
This compression may be psychologically Shoben, & Rips, 1 974), Rorschach ink blots
very useful. One of the main goals of psy- (Osterholm, Woods, & Le Unes, 1 985 ),
chological representation is to create effi- chess positions (Horgan, Millis, & Neimeyer,
cient codes for representing a set of objects. 1 989), and air flight scenarios (Schvaneveldt,
Compressed representations can facilitate 1 985 ). Many objects, situations, and con-
encoding, memory, and processing. Shimon cepts seem to be psychologically structured
Edelman (1 999) proposed that both peo- in terms of dimensions, and a geomet-
ple and machines efficiently code their ric interpretation of the dimensional orga-
world by creating geometric spaces for ob- nization captures a substantial amount of
jects with much lower dimensionality than that structure.
the objects’ physical descriptions (see also
Gardenfors, 2000).
Featural Models
A third use of MDS is to create quan-
titative representations that can be used In 1 977, Amos Tversky brought into promi-
in mathematical and computational models nence what would become the main con-
of cognitive processes. Numeric representa- tender to geometric models of similarity in
tions, namely coordinates in a psychologi- psychology. The reason given for propos-
cal space, can be derived for stories, pic- ing a feature-based model was that subjec-
tures, sounds, words, or any other stimuli tive assessments of similarity did not always
satisfy the assumptions of geometric models similar to a prominent item than vice versa.
of similarity. This is consistent with the result that peo-
ple judge their friends to be more similar
to themselves than they themselves are to
problems with the standard their friends (Holyoak & Gordon, 1 983 ), un-
geometric model
der the assumption that a person is highly
Three assumptions of standard geometric prominent to him- or herself. More recently,
models of similarity are Polk et al. (2002) found that when the fre-
Minimality: D(A,B) ≥ D(A,A ) = 0 quency of colors is experimentally manipu-
Symmetry: D(A,B) = D(B,A ) lated, rare colors are judged to be more sim-
ilar to common colors than common colors
Triangle Inequality: D(A,B) + D(B,C) ≥
are to rare colors.
D(A,C )
According to the triangle inequality as-
where D(A,B) is interpreted as the dissim- sumption (Figure 2.2), the distance/ dissim-
ilarity between items A and B. Accord- ilarity between two points A and B cannot
ing to the minimality assumption, all ob- be more than the distance between A and
jects are equally (dis)similar to themselves. a third point C plus the distance between C
Some violations of this assumption are found and B. Geometrically speaking, a straight line
(Nickerson, 1 972) when confusion rates or connecting two points is the shortest path
RT measures of similarity are used. First, between the points. Tversky and Gati (1 982)
not all letters are equally similar to them- found violations of this assumption when it
selves. For example, in Podgorny and Gar- is combined with an assumption of segmen-
ner (1 979), if the letter S is shown twice tal additivity [D(A,B) + D(B,C) = D(A,C),
on a screen, subjects are faster to correctly if A, B, and C lie on a straight line]. Con-
say that the two tokens are similar (i.e., they sider three items in multidimensional space,
come from the same similarity defined clus- A, B, and C, falling on a straight line such
ter) than if the twice-shown letter is W. By that B is between A and C. Also consider
the reaction time measure of similarity, the a fourth point, E, that forms a right trian-
letter S is more similar to itself than the let- gle when combined with A and C. The tri-
ter W is to itself. Even more troublesome angle inequality assumption cum segmental
for the minimality assumption, two differ- additivity predicts that
ent letters may be more similar to each other
than a particular letter is to itself. The letter D(A,E) ≥ D(A,B) and D(E,C) ≥ D(B,C)
C is more similar to the letter O than W is to or
itself, as measured by interletter confusions.
D(A,E) ≥ D(B,C) and D(E,C) ≥ D(A,B)
In Gilmore, Hersh, Caramazza, and Griffin
(1 979), the letter M is more often recognized Systematic violations of this prediction are
as an H ( p = .3 91 ) than as an M ( p = .1 80). found such that the path going through the
This is problematic for geometric represen- corner point E is shorter than the path going
tations because the distance between a point through the center point B. For example, if
and itself should be zero. the items are instantiated as
According to the symmetry assumption,
(dis)similarity should not be affected by the A= White, 3 inches
ordering of items because the distance from
B= Pink, 4 inches
point A to B is equal to the distance from
B to A. Contrary to this presumed sym- C= Red, 5 inches
metry, similarity is asymmetric on occasion E= Red, 3 inches
(Tversky, 1 977). In one of Tversky’s exam-
ples, North Korea is judged to be more simi- then people’s dissimilarity ratings indi-
lar to Red China than Red China is to North cate that D(A,E) < D(A,B) and D(E,C) <
Korea. Often, a nonprominent item is more D(B,C). Such an effect can be modeled by
similarity 19
6 yield the most satisfactory and interpretable

5 C solutions in low-dimensional space. MDS so-
lutions involving more than six dimensions
4 B are rare. On the third point, the addition of
A E the same feature to a pair of items increases
3
their rated similarity (Gati & Tversky,
size
2 1 984), but this is incompatible with sim-

ple MDS models. If adding a shared feature
1 corresponds to adding a dimension in which
the two items under consideration have the
0
same value, then there will be no change to
redness the items’ dissimilarity because the geomet-
Figure 2 .2 . The triangle inequality assumption ric distance between the points remains the
requires the path from A to C going through B to same. MDS models that incorporate the di-
be shorter than the path going through E. mensionality of the space could predict the
influence of shared features on similarity, but
such a model would no longer relate similar-
geometric models of similarity if r in Eq. 2.1 ity directly to an inverse function of inter-
is given a value less than 1 . However, if r is item distance.
less than 1 , then dissimilarity does not satisfy One research strategy has been to aug-
a power metric, which is often considered a ment geometric models of similarity in ways
minimal assumption for geometric solutions that solve these problems. One solution, sug-
to be interpretable. The two assumptions gested by Carol Krumhansl (1 978), has been
of a power metric are (1 ) distances along to model dissimilarity in terms of both inter-
straight lines are additive, and (2) the short- item distance in a multidimensional space
est path between points is a straight line. and spatial density in the neighborhoods of
Other potential problems with geometric the compared items. The more items there
models of similarity are (1 ) they strictly limit are in the vicinity of an item, the greater
the number of nearest neighbors an item the spatial density of the item. Items are
can have (Tversky & Hutchinson, 1 986), (2) more dissimilar if they have many items sur-
MDS techniques have difficulty describing rounding them (their spatial density is high)
items that vary on a large number of features than if they have few neighboring items. By
(Krumhansl, 1 978), and (3 ) standard MDS including spatial density in an MDS analy-
techniques do not predict that adding com- sis, violations of minimality, symmetry, and
mon features to items increases their sim- the triangle inequality can potentially be ac-
ilarity (Tversky & Gati, 1 982). On the first counted for, as well as some of the influence
point, MDS models consisting of two dimen- of context on similarity. However, the em-
sions cannot predict that item X is the clos- pirical validity of the spatial density hypoth-
est item to 1 00 other items. There would be esis is in some doubt (Corter, 1 987, 1 988;
no way of placing those 1 00 items in two Krumhansl, 1 988; Tversky & Gati, 1 982).
dimensions such that X would be closer to Robert Nosofsky (1 991 ) suggested an-
all of them than any other item. For human other potential way to save MDS models
data, a superordinate term (e.g., fruit) is of- from some of the previous criticisms. He in-
ten the nearest neighbor of many of its ex- troduces individual bias parameters in addi-
emplars (apples, bananas, etc.), as measured tion to the inter-item relation term. Sim-
by similarity ratings. On the second point, ilarity is modeled in terms of inter-item
although there is no logical reason why ge- distance and biases toward particular items.
ometric models cannot represent items of Biases toward items may be due to
any number of dimensions (as long as the attention, salience, knowledge, and fre-
number of dimensions is less than number of quency of items. This revision handles
items minus one), geometric models tend to asymmetric similarity results and the result
that a single item may be the most similar (A ∩ B) is hypothesized to receive more
item to many other items, but it does not di- weight in similarity than difference judg-
rectly address several of the other objections. ments; the distinctive features term receives
relatively more weight in difference judg-
the contrast model ments. As a result, certain pairs of stimuli
In light of the previous potential prob- may be perceived as simultaneously being
lems for geometric representations, Tversky more similar to and more different from each
(1 977) proposed to characterize similarity in other compared with other pairs (Tversky,
terms of a feature-matching process based 1 977). Sixty-seven percent of a group of sub-
on weighting common and distinctive fea- jects selected West Germany and East Ger-
tures. In this model, entities are represented many as more similar to each other than
as a collection of features and similarity is Ceylon and Nepal. Seventy percent of sub-
computed by jects also selected West Germany and East
Germany as more different from each other
S(A, B) than Ceylon and Nepal. According to Tver-
= θ f (A ∩ B) − a f (A − B) − b f (B − A) sky, East and West Germany have more
common and more distinctive features than
(2.2)
Ceylon and Nepal. Medin, Goldstone, and
The similarity of A to B is expressed as a Gentner (1 993 ) presented additional evi-
linear combination of the measure of the dence for nonmirroring between similarity
common and distinctive features. The term and difference, exemplified in Figure 2.3 .
(A ∩ B) represents the features that items A When two scenes share a relatively large
and B have in common. (A − B) represents number of relational commonalities (e.g.,
the features that A has but B does not. (B − scenes T and B both have three objects that
A) represents the features of B that are not in have the same pattern), but also a large num-
A. θ , a, and b are weights for the common ber of differences on specific attributes (e.g.,
and distinctive components. Common fea- none of the patterns in scene T match any
tures, as compared with distinctive features, of the patterns in B), then the scenes tend
are given relatively more weight for verbal as to be judged as simultaneously very similar
opposed to pictorial stimuli (Gati & Tversky, and very different.
1 984), cohesive as opposed to noncohesive A number of models are similar to
stimuli (Ritov, Gati, & Tversky, 1 990), sim- the contrast model in basing similarity
ilarity as opposed to difference judgments on features and in using some combina-
(Tversky, 1 977), and entities with a large tion of the (A ∩ B), (A − B), and (B − A)
number of distinctive as opposed to com- components. Sjoberg (1 972) proposed that
mon features (Gati & Tversky, 1 984). There similarity is defined as f (A ∩ B)/f (A ∪ B).
are no restrictions on what may constitute a Eisler and Ekman (1 95 9) claimed that
feature. A feature may be any property, char- similarity is proportional to f (A ∩ B)/
acteristic, or aspect of a stimulus. Features (f (A) + f(B)). Bush and Mosteller (1 95 1 )
may be concrete or abstract (i.e., “symmet- defined similarity as f (A ∩ B)/f (A). These
ric” or “beautiful”). three models can all be considered spe-
The contrast model predicts asymmetric cializations of the general equation f (A ∩
similarity because a is not constrained to B)/[f (A ∪ B) + af (A − B) + bf (B − A)]. As
equal b and f(A − B) may not equal f(B − such, they differ from the contrast model
A). North Korea is predicted to be more by applying a ratio function as opposed to
similar to Red China than vice versa if Red a linear contrast of common and distinc-
China has more salient distinctive features tive features.
than North Korea, and a is greater than b. The fundamental premise of the con-
The contrast model can also account for trast model, that entities can be described
nonmirroring between similarity and differ- in terms of constituent features, is a pow-
ence judgments. The common features term erful idea in cognitive psychology. Featural
similarity 21
A B
Figure 2 .3. The set of objects in B is selected as both more similar to, and more
different from, the set of objects in T relative to the set of objects in A. From
Medin, Goldstone, and Gentner (1 990). Reprinted by permission.
analyses have proliferated in domains of in neural networks normalize dissimilarities

speech perception (Jakobson, Fant, & Halle, by string length. Normalized Hamming dis-
1 963 ), pattern recognition (Neisser, 1 967; tance functions can be expressed by [(A −
Treisman, 1 986), perception physiology B) + (B − A)]/[ f (A ∩ B)].
(Hubel & Wiesel, 1 968), semantic content
(Katz & Fodor, 1 963 ), and categorization similarities between geometric
(Medin & Schaffer, 1 978; see Medin & Rips, and feature-based models
Chap. 3 ). Neural network representations Although MDS and featural models are of-
are often based on features, with entities be- ten analyzed in terms of their differences,
ing broken down into a vector of ones and they also share a number of similarities.
zeros, where each bit refers to a feature or More recent progress has been made on
“microfeature.” Similarity plays a crucial role combining both representations into a sin-
in many connectionist theories of generaliza- gle model, using Bayesian statistics to deter-
tion, concept formation, and learning. The mine whether a given source of variation is
notion of dissimilarity used in these systems more efficiently represented as a feature or
is typically the fairly simple function “Ham- dimension (Navarro & Lee, 2003 ). Tversky
ming distance.” The Hamming distance be- and Gati (1 982) described methods of trans-
tween two strings is simply their city- lating continuous dimensions into featural
block distance; that is, it is their (A − B) + representations. Dimensions that are sensi-
(B − A) term. “1 0 0 1 1 ” and “1 1 1 1 1 ” bly described as being more or less (e.g., loud
would have a Hamming distance of 2 be- is more sound than soft, bright is more light
cause they differ on two bits. Occasionally, than dim, and large is more size than small)
more sophisticated measures of similarity can be represented by sequences of nested
feature sets. That is, the features of B include Another commonality between geomet-
a subset of A’s features whenever B is louder, ric and featural representations, one that
brighter, or larger than A. Alternatively, for motivates the next major class of similar-
qualitative attributes such as shape or hue ity models that we consider, is that both
(red is not subjectively “more” than blue), use relatively unstructured representations.
dimensions can be represented by chains of Entities are structured as sets of features
features such that if B is between A and or dimensions with no relations between
C on the dimension, then (A ∩ B) ⊃ (A ∩ C) these attributes. Entities such as stories, sen-
and (B ∩ C) ⊃ (A ∩ C). For example, if or- tences, natural objects, words, scientific the-
ange lies between red and yellow on the hue ories, landscapes, and faces are not sim-
dimension, then this can be featurally repre- ply a “grab bag” of attributes. Two kinds
sented if orange and red share features that of structure seem particularly important:
orange and yellow do not share. propositional and hierarchical. A proposi-
An important attribute of MDS mod- tion is an assertion about the relation be-
els is that they create postulated representa- tween informational entities (Palmer, 1 975 ).
tions, namely dimensions, that explain the For example, relations in a visual domain
systematicities present in a set of similar- might include above, near, right, inside, and
ity data. This is a classic use of abductive larger than, which take informational enti-
reasoning; dimensional representations are ties as arguments. The informational enti-
hypothesized that, if they were to exist, ties might include features such as square
would give rise to the obtained similarity and values on dimensions such as 3 inches.
data. Other computational techniques share Propositions are defined as the smallest unit
with MDS the goal of discovering the un- of knowledge that can stand as a separate
derlying descriptions for items of interest assertion and have a truth value. The or-
but create featural rather than dimensional der of the arguments in the predicate is
representations. Hierarchical cluster analy- critical. For example, above (triangle, circle)
sis, such as MDS, takes pairwise proximity does not represent the same fact as above
data as input. Rather than output a geo- (circle, triangle). Hierarchical representations
metric space with objects as points, hierar- involve entities that are embedded in one
chical cluster analysis outputs an inverted- another. Hierarchical representations are re-
tree diagram with items at the root-level quired to represent the fact that X is part of
connected with branches. The smaller the Y or that X is a kind of Y. For example, in
branching distance between two items, the Collins and Quillian’s (1 969) propositional
more similar they are. Just as the dimen- networks, labeled links (“Is-a” links) stand for
sional axes of MDS solutions are given sub- the hierarchical relation between canary and
jective interpretations, the branches are also bird.
given interpretations. For example, in Shep- Some quick fixes to geometric and feat-
ard’s (1 972) analysis of speech sounds, one ural accounts of similarity are possible, but
branch is interpreted as voiced phonemes, they fall short of a truly general capacity to
whereas another branch contains the un- handle structured inputs. Hierarchical clus-
voiced phonemes. In additive cluster analysis tering does create trees of features, but there
(Shepard & Arabie, 1 979), similarity data are is no guarantee that there are relationships,
transformed into a set of overlapping item such as Is-a or Part-of, between the subtrees.
clusters. Items that are highly similar will However, structure might exist in terms of
tend to belong to the same clusters. Each features that represent conjunctions of prop-
cluster can be considered as a feature. More erties. For example, using the materials in
recent progress has been made on efficient Figure 2.4, 20 undergraduates were shown
and mathematically principled models that triads consisting of A, B, and T and were
find such featural representations for large asked to say whether scene A or B was more
databases (Lee, 2002a, 2002b; Tenenbaum, similar to T. The strong tendency to choose
1 996). A over B in the first panel suggests that
similarity 23
Figure 2 .4. The sets of objects T are typically judged to be more similar to the
objects in the A sets than the B sets. These judgments show that people pay
attention to more than just simple properties such as “black” or “square” when
comparing scenes.
the feature “square” influences similarity. However, there are two objects in T,
Other choices indicated that subjects also bringing the total number of features re-
based similarity judgments on the spatial lo- quired to at least two times the six features
cations and shadings of objects as well as required for one object. The number of fea-
their shapes. tures required increases still further if we
However, it is not sufficient to represent include feature triplets such as “left-black-
the leftmost object of T as {left, square, square.” In general, if there are O objects
black} and base similarity on the number of in a scene and each object has F features,
shared and distinctive features. In the second then there will be OF simple features. There
panel, A is again judged to be more simi- will be O conjunctive features that combine
lar to T than is B. Both A and B have the two simple features (i.e., pairwise conjunc-
features “black” and “square.” The only dif- tive features). If we limit ourselves to simple
ference is that for A and T, but not B, the and pairwise features to explain the pattern
“black” and “square” features belong to the of similarity judgments in Figure 2.3 , we still
same object. This is only compatible with will require OF(F + 1 )/2 features per scene,
feature set representations if we include the or OF(F + 1 ) features for two scenes that are
possibility of conjunctive features in addition compared with one another.
to simple features such as “black” and “square” Thus, featural approaches to similarity re-
(Gluck, 1 991 ; Hayes-Roth & Hayes-Roth, quire a fairly large number of features to rep-
1 977). By including the conjunctive feature resent scenes that are organized into parts.
“black-square,” possessed by both T and A, Similar problems exist for dimensional ac-
we can explain, using feature sets, why T is counts of similarity. The situation for these
more similar to A than B. The third panel models becomes much worse when we con-
demonstrates the need for a “black-left” fea- sider that similarity is also influenced by re-
ture, and other data indicate a need for a lations between features such as “black to
“square-left” feature. Altogether, if we want the left of white” and “square to the left
to explain the similarity judgments that peo- of white.” Considering only binary relations,
ple make, we need a feature set representa- there are O2 F2 R -OFR relations within a
tion that includes six features (three simple scene that contains O objects, F features
and three complex) to represent the square per object, and R different types of re-
of T. lations between features. Although more
sophisticated objections have been raised object in the top set. Subjects who had rated
about these approaches by Hummel and col- the similarity of the sets were more likely to
leagues (Holyoak & Hummel, 2000; Hum- choose the rightmost object – presumably
mel, 2000, 2001 ; Hummel & Biederman, because both objects were the smallest ob-
1 992; Hummel & Holyoak, 1 997, 2003 ; see jects in their sets. Subjects who did not first
Doumas & Hummel, Chap. 4), at the very assess similarity had a tendency to select
least, geometric and featural models appar- the middle object because its size exactly
ently require an implausibly large number of matched the target object’s size. These re-
attributes to account for the similarity rela- sults are predicted if similarity judgments
tions between structured, multipart scenes. naturally entail aligning the elements of
two scenes. Additional research has found
that relational choices such as “smallest ob-
Alignment-Based Models
ject in its set” tend to influence similar-
Partly in response to the difficulties that the ity judgments more than absolute attributes
previous models have in dealing with struc- like “3 inches” when the overall amount
tured descriptions, a number of researchers of relational coherency across sets is high
have developed alignment-based models of (Goldstone, Medin, & Gentner, 1 991 ), the
similarity. In these models, comparison is scenes are superficially sparse rather than
not just matching features but determin- rich (Gentner & Rattermann, 1 991 ; Mark-
ing how elements correspond to, or align man & Gentner, 1 993 a), subjects are given
with, one another. Matching features are more time to make their judgments (Gold-
aligned to the extent that they play simi- stone & Medin, 1 994), the judges are adults
lar roles within their entities. For example, rather than children (Gentner & Toupin,
a car with a green wheel and a truck with 1 986), and abstract relations are initially cor-
a green hood both share the feature green, related with concrete relations (Kotovsky &
but this matching feature may not increase Gentner, 1 996).
their similarity much because the car’s wheel Formal models of alignment-based simi-
does not correspond to the truck’s hood. larity have been developed to explain how
Drawing inspiration from work on analog- feature matches that belong to well-aligned
ical reasoning (Gentner, 1 983 ; Holyoak & elements matter more for similarity than
Thagard, 1 995 ; see Holyoak, Chap. 6), in matches between poorly aligned elements
alignment-based models, matching features (Goldstone, 1 994a; Love, 2000). Inspired
influence similarity more if they belong to by work in analogical reasoning (Holyoak &
parts that are placed in correspondence, and Thagard, 1 989), Goldstone’s (1 994a) SIAM
parts tend to be placed in correspondence if model is a neural network with nodes that
they have many features in common and are represent hypotheses that elements across
consistent with other emerging correspon- two scenes correspond to one another. SIAM
dences (Goldstone, 1 994a; Markman & Gen- works by first creating correspondences be-
tner, 1 993 a). Alignment-based models make tween the features of scenes. Once features
purely relational similarity possible (Falken- begin to be placed into correspondence,
hainer, Forbus, & Gentner, 1 989). SIAM begins to place objects into corre-
Initial evidence that similarity involves spondence that are consistent with the fea-
aligning scene descriptions comes from ture correspondences. Once objects begin to
Markman and Gentner’s (1 993 a) result that be put in correspondence, activation is fed
when subjects are asked to determine corre- back down to the feature (mis)matches that
sponding objects, they tend to make more are consistent with the object alignments. In
structurally sound choices when they have this way, object correspondences influence
first judged the similarity of the scenes that activation of feature correspondences at the
contain the objects. For example, in Figure same time that feature correspondences in-
2.5 , subjects could be asked which object in fluence the activation of object correspon-
the bottom set corresponds to the leftmost dences. Activation between nodes spreads
similarity 25
with the clarity of the alignments (Gold-

stone, 1 994a); and (4) under some circum-
stances, adding a poorly aligned feature
match can actually decrease similarity by in-
terfering with the development of proper
alignments (Goldstone, 1 996).
Target Another empirically validated set of pre-
dictions stemming from an alignment-based
approach to similarity concerns alignable
and nonalignable differences (Markman &
Gentner, 1 993 b). Nonalignable differences
between two entities are attributes of one
Size match entity that have no corresponding attribute
Relation match in the other entity. Alignable differences
Figure 2 .5. The target from the gray circles are differences that require the elements
could match either the middle black object of the entities first be placed in correspon-
because they are the same size, or the rightmost dence. When comparing a police car with an
object because both objects are the smallest ambulance, a nonalignable difference is that
objects in their sets. police cars have weapons in them, but am-
bulances do not. There is no clear equivalent
in SIAM by two principles: (1 ) nodes that of weapons in the ambulance. Alignable dif-
are consistent send excitatory activation to ferences include the following: police cars
each other, and (2) nodes that are inconsis- carry criminals to jails rather than carrying
tent inhibit each another (see also Holyoak, sick people to hospitals, a police car is a
Chap. 6). Nodes are inconsistent if they cre- car whereas ambulances are vans, and police
ate two-to-one alignments – if two elements car drivers are policemen rather than emer-
from one scene would be placed into cor- gency medical technicians. Consistent with
respondence with one element of the other the role of structural alignment in similar-
scene. Node activations affect similarity via ity comparisons, alignable differences influ-
the equation ence similarity more than nonalignable dif-
n ∗ ferences (Markman & Gentner, 1 996) and
(match valuei Ai )
similarity = i=1 n , are more likely to be encoded in memory
i=1 Ai (Markman & Gentner, 1 997). Alignable dif-
(2.3 ) ferences between objects also play a dispro-
where n is the number of nodes in the sys- portionately large role in distinguishing be-
tem, Ai is the activation of node i, and the tween different basic-level categories (e.g.,
match value describes the physical similar- cats and dogs) that belong to the same super-
ity between the two features placed in cor- ordinate category (e.g., animals) (Markman
respondence according to the node i. & Wisniewski, 1 997). In short, knowing
By this equation, the influence of a partic- these correspondences affects not only how
ular matching or mismatching feature across much a matching element increases similar-
two scenes is modulated by the degree to ity (Goldstone, 1 994a), but also how much
which the features have been placed in align- a mismatching element decreases similarity.
ment. Consistent with SIAM, (1 ) aligned Thus far, much of the evidence for struc-
feature matches tend to increase similar- tural alignment in similarity has used some-
ity more than unaligned feature matches what artificial materials. Often, the systems
(Goldstone, 1 994a); (2) the differential in- describe how “scenes” are compared, with
fluence between aligned and unaligned fea- the underlying implication that the elements
ture matches increases as a function of pro- comprising the scenes are not as tightly con-
cessing time (Goldstone & Medin, 1 994); nected as elements comprising objects. Still,
(3 ) this same differential influence increases if the structural alignment account proves
to be fertile, it will be because it is applica- ify what transformational operations are

ble to naturally occurring materials. Toward possible.
this goal, researchers have considered struc- In an early incarnation of a transforma-
tural accounts of similarity in language do- tional approach to cognition broadly con-
mains. The confusability of words depends strued, Garner (1 974) stressed the notion of
on structural analyses to predict that “stop” stimuli that are transformationally equiva-
is more confusable with “step” than “pest” lent and are consequently possible alterna-
(the “st” match is in the correct location with tives for each other. In artificial intelligence,
“step” but not “pest”), but more confusable Shimon Ullman (1 996) argued that objects
with “pest” than “best” (the “p” match counts are recognized by being aligned with mem-
for something even when it is out of place). orized pictorial descriptions. Once an un-
Substantial success has been made on the known object has been aligned with all can-
practical problem of determining the struc- didate models, the best match to the viewed
tural similarity of words (Bernstein, Demor- object is selected. The alignment operations
est, & Eberhardt, 1 994; Frisch, Broe, & Pier- rotate, scale, translate, and topographically
rehumbert, 1 995 ). Structural alignment has warp object descriptions. For rigid trans-
also been implicated when comparing more formations, full alignment can be obtained
complex language structures such as sen- by aligning three points on the object with
tences (Bassok & Medin, 1 997). Likewise, three points on the model description. Un-
structural similarity has proven to be a use- like recognition strategies that require struc-
ful notion in explaining consumer prefer- tural descriptions (e.g., Biederman, 1 987;
ences of commercial products, explaining, Hummel, 2000, 2001 ), Ullman’s alignment
for example, why new products are viewed does not require an image to be decomposed
more favorably when they improve over ex- into parts.
isting products along alignable rather than In transformational accounts that are ex-
unalignable differences (Zhang & Markman, plicitly designed to model similarity data,
1 998). Additional research has shown that similarity is usually defined in terms of trans-
alignment-based models of similarity pro- formational distance. In Wiener-Ehrlich,
vide a better account of category-based Bart, and Millward’s (1 980) generative rep-
induction than feature-based models (Las- resentation system, subjects are assumed to
saline, 1 996). Still other researchers have ap- possess an elementary set of transformations
plied structural accounts of similarity to the and invoke these transformations when ana-
legal domain (Hahn & Chater, 1 998; Simon lyzing stimuli. Their subjects saw linear pairs
& Holyoak, 2002). This area of application of stimuli such as {ABCD, DABC} or two-
AB DA
is promising because the U.S. legal system dimensional stimuli such as { CD , BC }. Sub-
is based on cases and precedents, and cases jects were required to rate the similarity of
are structurally rich and complex situations the pairs. The researchers determined trans-
involving many interrelated parties. Retriev- formations that accounted for each subject’s
ing a historic precedent and assessing its rel- ratings from the set {rotate 90 degrees, ro-
evance to a current case almost certainly in- tate 1 80, rotate 270, horizontal reflection,
volves aligning representations that are more vertical reflection, positive diagonal reflec-
sophisticated than assumed by geometric or tion, negative diagonal reflection}. Similar-
featural models. ity was assumed to decrease monotonically
as the number of transformations required
to make one sequence identical to the other
Transformational Models
increased.
A final historic approach to similarity that Imai (1 977) made a similar claim.
has been more recently resuscitated is that The stimuli used were sequences such as
the comparison process proceeds by trans- XXOXXXOXXXOX, where Xs represent
forming one representation into the other. white ovals and Os represent black ovals.
A critical step for these models is to spec- The four basic transformations were mirror
similarity 27
image (XXXXXOO → OOXXXXX), stated in the structural alignment method,

phase shift (XXXXXOO → XXXXOOX), they are implicit in transformational align-
reversal (XXXXXOO → OOOOOXX), ment. The transformational account often
and wavelength (XXOOXXOO → XOX- does produce globally consistent correspon-
OXOXO). The researcher found that se- dences – for example, correspondences that
quences that are two transformations re- obey the one-to-one mapping principle;
moved (e.g., XXXOXXXOXXXO and however, this consistency is a consequent of
OOXOOOXOOOXO require a phase shift applying a patternwide transformation and is
and a reversal to be equated) are rated to not enforced by interactions between emerg-
be less similar than sequences that can be ing correspondences. It is revealing that
made identical with one transformation. In transformational accounts have been applied
addition, sequences that can be made identi- almost exclusively to perceptual stimuli,
cal by more than one transformation (XOX- whereas structural accounts are most often
OXOXO and OXOXOXOX can be made applied to conceptual stimuli such as sto-
identical by mirror image, phase shift, or ries, proverbs, and scientific theories (there
reversal transformations) are more similar are also notable structural accounts in per-
than sequences that have only one identity- ception, i.e., Biederman, 1 987; Hummel,
producing transformation. 2000; Hummel & Biederman, 1 992; Marr &
More recent work has followed up on Nishihara, 1 978). Defining a set of con-
Imai’s research and generalized it to stimulus strained transformations is much more ten-
materials, including arrangements of Lego able for perceptual stimuli. The conceptual
bricks, geometric complexes, and sets of col- similarity between an atom and the solar
ored circles (Hahn, Chater, & Richardson, system could possibly be discovered by
2003 ). According to these researchers’ ac- transformations. As a start, a miniaturization
count, the similarity between two entities transformation could be applied to the so-
is a function of the complexity required to lar system. However, this single transforma-
transform the representation of one into the tion is not nearly sufficient; a nucleus is not
representation of the other. The simpler the simply a small sun. The transformations that
transformation, the more similar they are as- would turn the solar system into an atom are
sumed to be. The complexity of a transfor- not readily forthcoming. If we allow transfor-
mation is determined in accord with Kol- mations such as an “earth-becomes-electron”
mogorov complexity theory (Li & Vitanyi, transformation, then we are simply reex-
1 997), according to which the complexity of pressing the structural alignment approach
a representation is the length of the short- and its part-by-part alignment of relations
est computer program that can generate and objects.
that representation. For example, the condi- Some similarity phenomena that are well
tional Kolmogorov complexity between the explained by structural alignment are not
sequence 1 2 3 4 5 6 7 8 and 2 3 4 5 6 7 easily handled by transformations. To ac-
8 9 is small because the simple instructions count for the similarity of “BCDCB” and
add 1 to each digit and subtract 1 from each “ABCDCBA” we could introduce the fairly
digit suffice to transform one into the other. abstract transformation “add the leftmost
Experiments by Hahn et al. (2003 ) demon- letter’s predecessor to both sides of string.”
strate that once reasonable vocabularies of However, the pair “LMN” and “KLMNK” do
transformation are postulated, transforma- not seem as similar as the earlier pair, even
tional complexity does indeed predict sub- though the same transformation is applied.
jective similarity ratings. A transformation of the form “if the struc-
It is useful to compare and con- ture is symmetric, then add the preceding
trast alignment-based and transformational element in the series to both ends of the
accounts of similarity. Both approaches string” presupposes exactly the kind of anal-
place scene elements into correspondence. ysis in defining “symmetric” and “preceding”
Whereas the correspondences are explicitly that are the bread and butter of propositional
representations and structural alignment. Murphy and Medin argued that the cate-
For this reason, one fertile research di- gorization of the man’s behavior does not
rection would be to combine alignment- depend on matching the man’s features to
based accounts’ focus on representing the the category drunk’s features. It is highly un-
internal structure within individual scenes likely that the category drunk would have
with the constraints that transformational such a specific feature as “jumps into pools
accounts provide for establishing psycholog- fully clothed.” It is not the similarity be-
ically plausible transformations (Hofstadter, tween the instance and the category that de-
1 997; Mitchell, 1 993 ). termines the instance’s classification; it is the
fact that our category provides a theory that
explains the behavior.
Conclusions and Further Directions Developmental psychologists have ar-
gued that even young children have inchoate
theories that allow them to go beyond su-
To provide a partial balance to our largely
perficial similarities in creating categories
historic focus on similarity, we conclude by
(Carey, 1 985 ; Gelman & Markman, 1 986;
raising some unanswered questions for the
Keil, 1 989). For example, Carey (1 985 ) ob-
field. These questions are rooted in a desire
served that children choose a toy monkey
to connect the study of similarity to cogni-
over a worm as being more similar to a hu-
tion as a whole.
man, but that when they are told that hu-
mans have spleens are more likely to in-
Is Similarity Flexible Enough to Provide
fer that the worm has a spleen than that
Useful Explanations of Cognition?
the toy monkey does. Thus, the categoriza-
The study of similarity is typically justi- tion of objects into “spleen” and “no spleen”
fied by the argument that so many theo- groups does not appear to depend on the
ries in cognition depend on similarity as a same knowledge that guides similarity judg-
theoretical construct. An account of what ments. Adults show similar dissociations be-
makes problems, memories, objects, and tween similarity and categorization. In an
words similar to one another often provides experiment by Rips (1 989), an animal that
the backbone for our theories of problem is transformed (by toxic waste) from a bird
solving, attention, perception, and cogni- into something that looks like an insect is
tion. As William James put it, “This sense judged by subjects to be more similar to an
of Sameness is the very keel and backbone insect but is still judged to be a bird. Again,
of our thinking” (James, 1 890/1 95 0, p. 45 9). the category judgment seems to depend on
However, others have argued that simi- biological, genetic, and historic knowledge,
larity is not flexible enough to provide a suf- whereas the similarity judgments seems to
ficient account, although it may be a nec- depend more on gross visual appearance (see
essary component. There have been many also Keil, 1 989; Rips & Collins, 1 993 ).
empirical demonstrations of apparent disso- Despite the growing body of evidence
ciations between similarity and other cog- that similarity appraisals do not always track
nitive processes, most notably categoriza- categorization decisions, there are still some
tion. Researchers have argued that cognition reasons to be sanguine about the contin-
is frequently based on theories (Murphy & ued explanatory relevance of similarity. Cat-
Medin, 1 985 ), rules (Sloman, 1 996; Smith & egorization itself may not be completely
Sloman, 1 994), or strategies that go beyond flexible. People are influenced by similarity
“mere” similarity. To take an example from despite the subjects’ intentions and the ex-
Murphy and Medin (1 985 ), consider a man perimenters’ instructions (Smith & Sloman,
jumping into a swimming pool fully clothed. 1 994). Allen and Brooks (1 991 ) gave sub-
This man may be categorized as drunk be- jects an easy rule for categorizing cartoon
cause we have a theory of behavior and animals into two groups. Subjects were then
inebriation that explains the man’s action. transferred to the animals that looked very
similarity 29
similar to one of the training stimuli but be- that the properties used to assess the simi-
longed in a different category. These animals larity of objects are determined, in part, by
were categorized more slowly and less accu- the compared objects themselves.
rately than animals that were equally similar Similarity judgments not only depend on
to an old animal but also belonged in the the context established by recently exposed
same category as the old animal. Likewise, items, simultaneously presented items, and
Palmeri (1 997) showed that even for the inferred contrast sets, but also on the ob-
simple task of counting the number of dots, server. Suzuki, Ohnishi, and Shigemasu
subjects’ performances are improved when (1 992) showed that similarity judgments de-
a pattern of dots is similar to a previously pend on level of expertise and goals. Expert
seen pattern with the same numerosity and and novice subjects were asked to solve the
worse when the pattern is similar to a previ- Tower of Hanoi puzzle and judge the sim-
ously seen pattern with different numeros- ilarity between the goal and various states.
ity. People seem to have difficulty ignoring Experts’ similarity ratings were based on
similarities between old and new patterns the number of moves required to trans-
even when they know a straightforward and form one position to the other. Less expert
perfectly accurate categorization rule. subjects tended to base their judgments on
There may be a mandatory considera- the number of shared superficial features.
tion of similarity in many categorization Similarly, Hardiman, Dufresne, and Mestre
judgments (Goldstone, 1 994b), adding con- (1 989) found that expert and novice physi-
straints to categorization. At the same time, cists evaluate the similarity of physics prob-
similarity may be more flexible and sophisti- lems differently, with experts basing simi-
cated than commonly acknowledged (Jones larity judgments more on general principles
& Smith, 1 993 ) and this may also serve to of physics than on superficial features (see
bridge the gap between similarity and high- Sjoberg, 1 972, for other expert/novice dif-
level cognition. Krumhansl (1 978) argued ferences in similarity ratings). The depen-
that similarity between objects decreases dency of similarity on observer-, task-, and
when they are surrounded by many close stimulus-defined contexts offers the promise
neighbors that were also presented on pre- that it is indeed flexible enough to subserve
vious trials (also see Wedell, 1 994). Tversky cognition.
(1 977) obtained evidence for an extension
effect according to which features influence
Is Similarity Too Flexible to Provide
similarity judgments more when they vary
Useful Explanations of Cognition?
within an entire set of stimuli. Items pre-
sented within a particular trial also influence As a response to the skeptic of similarity’s
similarity judgments. Perhaps the most fa- usefulness, the preceding two paragraphs
mous example of this is Tversky’s (1 977) could have the exact opposite of their in-
diagnosticity effect according to which fea- tended effect. The skeptic might now be-
tures that are diagnostic for relevant clas- lieve that similarity is much too flexible to be
sifications will have disproportionate influ- a stable ground for cognition. In fact, Nelson
ence on similarity judgments. More recently, Goodman (1 972) put forth exactly this
Medin, Goldstone, and Gentner (1 993 ) ar- claim, maintaining that the notion of similar-
gued that different comparison standards are ity is either vague or unnecessary. He argued
created, depending on the items that are that “when to the statement that two things
present on a particular trial. Other research are similar we add a specification of the
has documented intransitivities in similarity property that they have in common . . . we
judgments situations in which A is judged render it [the similarity statement] superflu-
to be more similar to T than is B, B is more ous” (p. 445 ). That is, all the potential ex-
similar to T than is C, and C is more similar planatory work is done by the “with respect
to T than is A (Goldstone, Medin, & Halber- to property Z” clause and not by the similar-
stadt, 1 997). This kind of result also suggests ity statement. Instead of saying “this object
belongs to category A because it is similar to grating multiple sources of information into

A items with respect to the property ‘red’,” a single assessment of similarity becomes
we can simplify matters by removing any no- particularly important. The four approaches
tion of similarity with “this object belongs to to similarity described in the previous sec-
category A because it is red.” tion provide methods for integrating multi-
There are reasons to resist Goodman’s ple properties into a single similarity judg-
conclusion that “similarity tends under anal- ment and, as such, go significantly beyond
ysis either to vanish entirely or to require simply determining a single “property Z” to
for its explanation just what it purports to attend.
explain” (p. 446). In most cases, similarity A final point to make about the poten-
is useful precisely because we cannot flesh tial overflexibility of similarity is that, al-
out the “respect to property Z” clause with though impressions of similarity can change
just a single property. Evidence suggests that with context and experience, automatic and
assessments of overall similarity are natural “generic” assessments of similarity typically
and perhaps even “primitive.” Evidence from change slowly and with considerable iner-
children’s perception of similarity suggests tia. Similarities that were once effortful and
that children are particularly likely to judge strategic become second nature to the organ-
similarity on the basis of many integrated ism. Roughly speaking, this is the process of
properties rather than analysis into dimen- perceiving what was once a conceptual similar-
sions. Even dimensions that are perceptu- ity. At first, the novice mycologist explicitly
ally separable are treated as fused in sim- uses rules for perceiving the dissimilarity be-
ilarity judgments (Smith & Kemler, 1 978). tween the pleasing Agaricus Bisporus mush-
Children younger than 5 years of age tend room and the deadly Amanita Phalloides.
to classify on the basis of overall similar- With time, this dissimilarity ceases to be ef-
ity and not on the basis of a single criterial fortful and rule based and becomes percep-
attribute (Keil, 1 989; Smith, 1 989). Chil- tual and phenomenologically direct. When
dren often have great difficulty identifying this occurs, the similarity becomes generic
the dimension along which two objects vary, and default and can be used as the ground
even though they can easily identify that the for new strategic similarities. In this way, our
objects are different in some way (Kemler, cognitive abilities gradually attain sophisti-
1 983 ). Smith (1 989) argued that it is rel- cation by treating territory as level ground
atively difficult for young children to say that once made for difficult mental climbing.
whether two objects are identical on a par- A corollary of this contention is that our de-
ticular property but relatively easy for them fault impression of similarity does not typi-
to say whether they are similar across many cally mislead us; it is explicitly designed to
dimensions. lead us to see relations between things that
There is also evidence that adults of- often function similarly in our world. Peo-
ten have an overall impression of similar- ple, with good reason, expect their default
ity without analysis into specific properties. similarity assessments to provide good clues
Ward (1 983 ) found that adult subjects who about where to uncover directed, nonappar-
tended to group objects quickly also tended ent similarities (Medin & Ortony, 1 989).
to group objects like children by consider-
ing overall similarity across all dimensions
Should “Similarity” Even Be a Field
instead of maximal similarity on one dimen-
of Study Within Cognitive Science?
sion. Likewise, Smith and Kemler (1 984)
found that adults who were given a distract- This survey has proceeded under the conve-
ing task produced more judgments by over- nient fiction that it is possible to tell a gen-
all similarity than subjects who were not. To eral story for how people compare things.
the extent that similarity is determined by One reason to doubt this is that the meth-
many properties, it is less subject to drastic ods used for assessing similarity have large
context-driven changes. Furthermore, inte- effects on the resulting similarity viewed.
similarity 31
Similarity as measured by ratings is not coherency of similarity. An alternative per-

equivalent to similarity as measured by spective would use these task differences as
perceptual discriminability. Although these an illuminating source of information in de-
measures correlate highly, systematic differ- veloping a unified account. The systematic
ences are found (Podgorny & Garner, 1 979; nature of these task differences should stim-
Sergent & Takane, 1 987). For example, Beck ulate accounts that include a formal descrip-
(1 966) found that an upright T is rated tion not only of stimulus components but
as more similar to a tilted T than an up- also of task components. Future success in
right L but that it is also more likely to be understanding the task of comparison may
perceptually grouped with the upright Ls. depend on comparing tasks.
Previously reviewed experiments indicate
the nonequivalence of assessments that use
similarity versus dissimilarity ratings, cat-
egorization versus forced-choice similarity Acknowledgments
judgments, or speeded versus leisurely judg-
ments. In everyday discourse we talk about This research was funded by NIH grant
the similarity of two things, forgetting that MH5 6871 and NSF grant 01 25 287. Corre-
this assessment depends on a particular task spondence concerning this chapter should
and circumstance. be addressed to rgoldsto@indiana.edu or
Furthermore, it may turn out that the cal- Robert Goldstone, Psychology Department,
culation of similarity is fundamentally differ- Indiana University, Bloomington, Indiana
ent for different domains (see Medin, Lynch, 47405 . Further information about the labo-
& Solomon, 2000, for a thoughtful discus- ratory can be found at http://cognitrn.psych.
sion of this issue). To know how to calculate indiana.edu
the similarity of two faces, one would need
to study faces specifically and the eventual
account need not inform researchers inter-
ested in the similarity of words, works of References
music, or trees. A possible conclusion is that
similarity is not a coherent notion at all. The Aha, D. W. (1 992). Tolerating noisy, irrelevant
term similarity, similar to the bug or family and novel attributes in instance-based learning
values, may not pick out a consolidated or algorithms. International Journal of Man Ma-
principled set of things. chine Studies, 3 6, 267–287.
Although we sympathize with the im- Allen, S. W., & Brooks, L. R. (1 991 ). Specializ-
pulse toward domain-specific accounts of ing the operation of an explicit rule. Journal of
similarity, we also believe in the value of Experimental Psychology: General, 1 2 0, 3 –1 9.
studying general principles of comparison Attneave, F. (1 95 0). Dimensions of similar-
that potentially underlie many domains. Al- ity. American Journal of Psychology, 63 , 5 1 6–
though we do not know whether general 5 5 6.
principles exist, one justification for pursu- Bassok, M., & Medin, D. L. (1 997). Birds of
ing them is the large payoff that would re- a feather flock together: Similarity judgments
sult from discovering these principles if they with semantically rich stimuli. Journal of Mem-
ory and Language, 3 6, 3 1 1 –3 3 6.
do exist. A historically fruitful strategy, ex-
emplified by Einstein’s search for a law to Beck, J. (1 966). Effect of orientation and of shape
similarity on perceptual grouping. Perception
unify gravitational and electromagnetic ac-
and Psychophysics, 1 , 3 00–3 02.
celeration and Darwin’s search for a uni-
Bernstein, L. E., Demorest, M. E., & Eberhardt,
fied law to understand the origins of humans
S. P. (1 994). A computational approach to ana-
and other animals, has been to understand lyzing sentential speech perception: Phoneme-
differences as parametric variations within to-phoneme stimulus/response alignment.
a single model. Finding differences across Journal of the Acoustical Society of America, 95 ,
tasks does not necessarily point to the in- 3 61 7–3 622.
Biederman, I. (1 987). Recognition-by-compo- Gati, I., & Tversky, A. (1 984). Weighting com-
nents: A theory of human image understand- mon and distinctive features in perceptual and
ing. Psychological Review, 94, 1 1 5 –1 47. conceptual judgments. Cognitive Psychology,
Bush, R. R., & Mosteller, F. (1 95 1 ). A model 1 6, 3 41 –3 70.
for stimulus generalization and discrimination. Gelman, S. A., & Markman, E. M. (1 986). Cate-
Psychological Review, 5 8, 41 3 –423 . gories and induction in young children. Cogni-
Carey, S. (1 985 ). Conceptual change in childhood. tion, 2 3 , 1 83 –209.
Cambridge, MA: Bradford Books. Gentner, D. (1 983 ). Structure-mapping: A the-
Carroll, J. D., & Wish, M. (1 974). Models and oretical framework for analogy. Cognitive Sci-
methods for three-way multidimensional scal- ence, 7, 1 5 5 –1 70.
ing. In D. H. Krantz, R. C. Atkinson, R. D. Luce, Gentner, D., & Rattermann, M. J. (1 991 ). Lan-
& P. Suppes (Eds.), Contemporary developments guage and the career of similarity. In S. A.
in mathematical psychology (Vol. 2, pp. 5 7–1 05 ). Gelman & J. P. Byrnes (Eds.), Perspectives on
San Francisco: Freeman. language and thought interrelations in develop-
Chi, M. T. H., Feltovich, P., & Glaser, R. (1 981 ). ment (pp. 225 –277). Cambridge: Cambridge
Categorization and representation of physics University Press.
problems by experts and novices. Cognitive Sci- Gentner, D., & Toupin, C. (1 986). Systematicity
ence, 5 , 1 21 –1 5 2. and surface similarity in the development of
Collins, A. M., & Quillian, M. R. (1 969). Re- analogy. Cognitive Science, 1 0(3 ), 277–3 00.
trieval time from semantic memory. Journal of Gilmore, G. C., Hersh, H., Caramazza, A., &
Verbal Learning and Verbal Behavior, 8, 240– Griffin, J. (1 979). Multidimensional letter sim-
247. ilarity derived from recognition errors. Percep-
Corter, J. E. (1 987). Similarity, confusability, and tion and Psychophysics, 2 5 , 425 –43 1 .
the density hypothesis. Journal of Experimental Gluck, M. A. (1 991 ). Stimulus generalization and
Psychology: General, 1 1 6, 23 8–249. representation in adaptive network models of
Corter, J. E. (1 988). Testing the density category learning. Psychological Science, 2 , 5 0–
hypothesis: Reply to Krumhansl. Journal of 55.
Experimental Psychology: General, 1 1 7, 1 05 – Gluck, M. A., & Bower, G. H. (1 990). Compo-
1 06. nent and pattern information in adaptive net-
Edelman, S. (1 999). Representation and recognition works. Journal of Experimental Psychology: Gen-
in vision. Cambridge, MA: MIT Press. eral, 1 1 9, 1 05 –1 09.
Eisler, H., & Ekman, G. (1 95 9). A mechanism Goldstone, R. L. (1 994a). Similarity, interactive
of subjective similarity. Acta Psychologica, 1 6, activation, and mapping. Journal of Experimen-
1 –1 0. tal Psychology: Learning, Memory, and Cogni-
Estes, W. K. (1 994). Classification and cognition. tion, 2 0, 3 –28.
New York: Oxford University Press. Goldstone, R. L. (1 994b). The role of similarity in
Falkenhainer, B., Forbus, K. D., & Gentner, D. categorization: Providing a groundwork. Cogni-
(1 989). The structure-mapping engine: Algo- tion, 5 2 , 1 25 –1 5 7.
rithm and examples. Artificial Intelligence, 41 , Goldstone, R. L. (1 996). Alignment-based non-
1 –63 . monotonicities in similarity. Journal of Experi-
Fodor, J. A. (1 983 ). The modularity of mind. mental Psychology: Learning, Memory, and Cog-
Cambridge, MA: MIT Press/Bradford Books. nition, 2 2 , 988–1 001 .
Frisch, S. A., Broe, M. B., & Pierrehumbert, J. B. Goldstone, R. L., & Medin, D. L. (1 994). The
(1 995 ). The role of similarity in phonology: time course of comparison. Journal of Experi-
Explaining OCP-Place. In K. Elenius & P. mental Psychology: Learning, Memory, and Cog-
Branderud (Eds.), Proceedings of the 1 3 th inter- nition, 2 0, 29–5 0.
national conference of the phonetic sciences, 3 , Goldstone, R. L., Medin, D. L., & Gentner, D.
5 44–5 47. (1 991 ). Relational similarity and the noninde-
Gardenfors, P. (2000). Conceptual spaces: The pendence of features in similarity judgments.
geometry of thought. Cambridge, MA: MIT Cognitive Psychology, 222–263 .
Press. Goldstone, R. L., Medin, D. L., & Halberstadt,
Garner, W. R. (1 974). The processing of information J. (1 997). Similarity in context. Memory and
and structure. New York: Wiley. Cognition, 2 5 , 23 7–25 5 .
similarity 33
Goodman, N. (1 972). Seven strictures on similar- Horgan, D. D., Millis, K., & Neimeyer, R. A.
ity. In N. Goodman (Ed.), Problems and projects (1 989). Cognitive reorganization and the de-
(pp. 43 7–446). New York: The Bobbs-Merrill velopment of chess expertise. International
Co. Journal of Personal Construct Psychology, 2 , 1 5 –
Hahn, U. (2003 ). Similarity. In L. Nadel (Ed.), 3 6.
Encyclopedia of cognitive science. London: Hubel, D. H., & Wiesel (1 968). Receptive fields
Macmillan. and functional architecture of monkey stri-
Hahn, U., & Chater, N. (1 998). Understanding ate cortex. Journal of Physiology, 1 95 , 21 5 –
similarity: A joint project for psychology, case- 243 .
based reasoning and law. Artificial Intelligence Hummel, J. E. (2000). Where view-based theo-
Review, 1 2 , 3 93 –427. ries break down: The role of structure in shape
Hahn, U., Chater, N., & Richardson, L. B. (2003 ). perception and object recognition. In E. Diet-
Similarity as transformation. Cognition, 87, 1 – rich & A. Markman (Eds.), Cognitive dynamics:
3 2. Conceptual change in humans and machines
Hardiman, P. T., Dufresne, R., & Mestre, J. P. (pp. 1 5 7–1 85 ). Hillsdale, NJ: Erlbaum.
(1 989). The relation between problem cate- Hummel, J. E. (2001 ). Complementary solutions
gorization and problem solving among experts to the binding problem in vision: Implications
and novices. Memory and Cognition, 1 7, 627– for shape perception and object recognition.
63 8. Visual Cognition, 8, 489–5 1 7.
Hayes-Roth, B., & Hayes-Roth, F. (1 977). Con- Hummel, J. E., & Biederman, I. (1 992). Dynamic
cept learning and the recognition and classifi- binding in a neural network for shape recogni-
cation of exemplars. Journal of Verbal Learning tion. Psychological Review, 99, 480–5 1 7.
and Verbal Behavior, 1 6, 3 21 –3 3 8. Hummel, J. E., & Holyoak, K. J. (1 997). Dis-
Heit, E., & Rubinstein, J. (1 994). Similarity and tributed representations of structure: A theory
property effects in inductive reasoning. Journal of analogical access and mapping. Psychological
of Experimental Psychology: Learning, Memory, Review, 1 04, 427–466.
and Cognition, 2 0, 41 1 –422. Hummel, J. E., & Holyoak, K. J. (2003 ). A
Hofstadter, D. (1 997). Fluid concepts and creative symbolic-connectionist theory of relational in-
analogies: Computer models of the fundamen- ference and generalization. Psychological Re-
tal mechanisms of thought. New York: Basic view, 1 1 0, 220–263 .
Books. Imai, S. (1 977). Pattern similarity and cognitive
Holland, J. H., Holyoak, K. J., Nisbett, R. E., & transformations. Acta Psychologica, 41 , 43 3 –
Thagard, P. R. (1 986). Induction: Processes of 447.
inference, learning, and discovery. Cambridge, James, W. (1 890/1 95 0). The principles of psychol-
MA: Bradford Books/MIT Press. ogy. New York: Dover. (Original work pub-
Holyoak, K. J., & Gordon, P. C. (1 983 ). Social lished 1 890).
reference points. Journal of Personality & Social Jakobson, R., Fant, G., & Halle, M. (1 963 ). Pre-
Psychology, 44, 881 –887. liminaries to speech analysis: The distinctive fea-
Holyoak, K. J., & Hummel, J. E. (2000). The tures and their correlates. Cambridge, MA: MIT
proper treatment of symbols in a connection- Press.
ist architecture. In E. Dietrich & A. Mark- Jones, S. S., & Smith, L. B. (1 993 ). The place of
man (Eds.), Cognitive dynamics: Conceptual perception in children’s concepts. Cognitive De-
change in humans and machines. Hillsdale, NJ: velopment, 8, 1 1 3 –1 3 9.
Erlbaum. Katz, J. J., & Fodor, J. (1 963 ). The structure of
Holyoak, K. J., & Koh, K. (1 987). Surface semantic theory. Language, 3 9, 1 70–21 0.
and structural similarity in analogical transfer. Keil, F. C. (1 989). Concepts, kinds and develop-
Memory and Cognition, 1 5 , 3 3 2–3 40. ment. Cambridge, MA: Bradford Books/MIT
Holyoak, K. J., & Thagard, P. (1 989). Analogi- Press.
cal mapping by constraint satisfaction. Cogni- Kemler, D. G. (1 983 ). Holistic and analytic modes
tive Science, 1 3 , 295 –3 5 5 . in perceptual and cognitive development. In
Holyoak, K. J., & Thagard, P. (1 995 ). Mental leaps: T. J. Tighe & B. E. Shepp (Eds.), Perception, cog-
Analogy in creative thought. Cambridge, MA: nition, and development: Interactional analyses
MIT Press. (pp. 77–1 01 ). Hillsdale, NJ: Erlbaum.
Kohonen, T. (1 995 ). Self-organizing maps. Berlin: Markman, A. B., & Wisniewski, E. J. (1 997).
Springer-Verlag. Similar and different: The differentiation of
Kolers, P. A., & Roediger, H. L. (1 984). Proce- basic-level categories. Journal of Experimental
dures of mind. Journal of Verbal Learning and Psychology: Learning, Memory, and Cognition,
Verbal Behavior, 2 3 , 425 –449. 2 3 , 5 4–70.
Kotovsky, L., & Gentner, D. (1 996). Comparison Marr, D. (1 982). Vision. San Francisco: Freeman.
and categorization in the development of rela- Marr, D., & Nishihara, H. K. (1 978). Represen-
tional similarity. Child Development, 67, 2797– tation and recognition of three dimensional
2822. shapes. Proceedings of the Royal Society of Lon-
Krumhansl, C. L. (1 978). Concerning the appli- don, Series B, 2 00, 269–294.
cability of geometric models to similarity data: Medin, D. L., Goldstone, R. L., & Gentner, D.
The interrelationship between similarity and (1 993 ). Respects for similarity. Psychological
spatial density. Psychological Review, 85 , 45 0– Review, 1 00, 25 4–278.
463 . Medin, D. L., Lynch, E. B., & Solomon, K. O.
Krumhansl, C. L. (1 988). Testing the density (2000). Are there kinds of concepts? Annual
hypothesis: Comment on Corter. Journal of Ex- Review of Psychology, 5 1 , 1 21 –1 47.
perimental Psychology: General, 1 1 7, 1 01 –1 04. Medin, D. L., & Ortony, A. (1 989). Psychologi-
Kruschke, J. K. (1 992). ALCOVE: An exemplar- cal essentialism. In S. Vosniadou & A.
based connectionist model of category learning. Ortony (Eds.), Similarity and analogical rea-
Psychological Review, 99, 22–44. soning. Cambridge, UK: Cambridge University
Lamberts, K. (2000). Information-accumulation Press.
theory of speeded categorization. Psychological Medin, D. L., & Schaffer, M. M. (1 978). A con-
Review, 1 07, 227–260. text theory of classification learning. Psycholog-
Lassaline, M. E. (1 996). Structural alignment in ical Review, 85 , 207–23 8.
induction and similarity. Journal of Experimen- Mitchell, M. (1 993 ). Analogy-making as percep-
tal Psychology: Learning, Memory, and Cogni- tion: A computer model. Cambridge, MA: MIT
tion, 2 2 , 75 4–770. Press.
Lee, M. D. (2002a). A simple method for gen- Murphy, G. L. (2002). The big book of concepts.
erating additive clustering models with limited Cambridge, MA: MIT Press.
complexity. Machine Learning, 49, 3 9–5 8. Murphy, G. L., & Medin, D. L. (1 985 ). The role of
Lee, M. D. (2002b). Generating additive cluster- theories in conceptual coherence. Psychological
ing models with limited stochastic complexity. Review, 92 , 289–3 1 6.
Journal of Classification, 1 9, 69–85 . Navarro, D. J., & Lee, M. D. (2003 ). Combin-
Li, M., & Vitanyi, P. (1 997). An introduction to ing dimensions and features in similarity-based
Kolmogorov complexity and its applications (2nd representations. In S. Becker, S. Thrun, & K.
ed.). New York: Springer-Verlag. Obermayer (Eds.) Advances in Neural Infor-
Love, B. C. (2000). A computational level theory mation Processing Systems, 1 5 , 67–74. MIT
of similarity. Proceeding of the Cognitive Science Press.
Society, 2 2 , 3 1 6–3 21 . Neisser, U. (1 967). Cognitive psychology. New
Markman, A. B., & Gentner, D. (1 993 a). Struc- York: Appleton-Century-Crofts.
tural alignment during similarity comparisons. Nickerson, R. S. (1 972). Binary classification re-
Cognitive Psychology, 2 5 , 43 1 –467. action time: A review of some studies of hu-
Markman, A. B., & Gentner, D. (1 993 b). Splitting man information-processing capabilities. Psy-
the differences: A structural alignment view of chonomic Monograph Supplements, 4 (whole
similarity. Journal of Memory and Language, 3 2 , no. 6), 275 –3 1 7.
5 1 7–5 3 5 . Nosofsky, R. M. (1 984). Choice, similarity, and
Markman, A. B., & Gentner,D. (1 996). Com- the context theory of classification. Journal
monalities and differences in similarity com- of Experimental Psychology: Learning, Memory,
parisons. Memory & Cognition, 2 4, 23 5 – and Cognition, 1 0, 1 04–1 1 4.
249 Nosofsky, R. M. (1 986). Attention, similarity,
Markman, A. B., and Gentner, D. (1 997). The and the identification-categorization relation-
effects of alignability on memory. Psychological ship. Journal of Experimental Psychology: Gen-
Science, 8, 3 63 –3 67. eral, 1 1 5 , 3 9–5 7.
similarity 35
Nosofsky, R. M. (1 991 ). Stimulus bias, asymmet- semantic relations. Journal of Verbal Learning
ric similarity, and classification. Cognitive Psy- and Verbal Behavior, 1 2 , 1 –20.
chology, 2 3 , 94–1 40. Ritov, I., Gati, I., & Tversky, A. (1 990). Differen-
Osterholm, K., Woods, D. J., & Le Unes, A. tial weighting of common and distinctive com-
(1 985 ). Multidimensional scaling of Rorschach ponents. Journal of Experimental Psychology:
inkblots: Relationships with structured self- General, 1 1 9, 3 0.
report. Personality and Individual Differences, 6, Ross, B. H. (1 987). This is like that: The use of
77–82. earlier problems and the separation of similar-
Palmer, S. E. (1 975 ). Visual perception and ity effects. Journal of Experimental Psychology:
world knowledge. In D. A. Norman & D. E. Learning, Memory, and Cognition, 1 3 , 629–
Rumelhart (Eds.), Explorations in cognition 63 9.
(pp. 279–3 07). San Francisco: W. H. Ross, B. H. (1 989). Distinguishing types of super-
Freeman. ficial similarities: Different effects on the access
Palmeri, T. J. (1 997). Exemplar similarity and the and use of earlier problems. Journal of Experi-
development of automaticity. Journal of Experi- mental Psychology: Learning, Memory, and Cog-
mental Psychology: Learning, Memory, and Cog- nition, 1 5 , 45 6–468.
nition, 2 3 , 3 24–3 5 4. Rozenblit, L., & Keil, F. (2002). The misunder-
Podgorny, P., & Garner, W. R. (1 979). Reaction stood limits of folk science: An illusion of ex-
time as a measure of inter-intraobject visual planatory depth. Cognitive Science, 2 6, 5 21 –
similarity: Letters of the alphabet. Perception 5 62.
and Psychophysics, 2 6, 3 7–5 2. Schank, R. C. (1 982). Dynamic memory: A the-
Polk, T. A., Behensky, C., Gonzalez, R., & Smith, ory of reminding and learning in computers
E. E. (2002). Rating the similarity of simple and people. Cambridge: Cambridge University
perceptual stimuli: Asymmetries induced by Press.
manipulating exposure frequency. Cognition, Schvaneveldt, R. (1 985 ). Measuring the struc-
82 , B75 –B88. ture of expertise. International Journal of Man-
Pylyshyn, Z. W. (1 985 ). Computation and cogni- Machine Studies, 2 3 , 699–728.
tion. Cambridge, MA: MIT Press. Sergent, J., & Takane, Y. (1 987). Structures in
Quine, W. V. (1 969). Ontological relativity and two-choice reaction-time data. Journal of Ex-
other essays. New York: Columbia University perimental Psychology: Human Perception and
Press. Performance, 1 3 , 3 00–3 1 5 .
Quine, W. V. (1 977). Natural kinds. In S. P. Shepard, R. N. (1 962a). The analysis of proxi-
Schwartz (Ed.), Naming, necessity, and natural mities: Multidimensional scaling with an un-
kinds (pp. 1 5 5 –1 75 ). Ithaca, NY: Cornell Uni- known distance function. Part I. Psychometrika,
versity Press. 2 7, 1 25 –1 40.
Raaijmakers, J. G. W., & Shiffrin, R. M. (1 981 ). Shepard, R. N. (1 962b). The analysis of proxi-
Search of associative memory. Psychological Re- mities: Multidimensional scaling with an un-
view, 88, 93 –1 3 4. known distance function. Part II. Psychome-
Richardson, M. W. (1 93 8). Multidimensional trika, 2 7, 21 9–246.
psychophysics. Psychological Bulletin, 3 5 , 65 9– Shepard, R. N. (1 972). Psychological representa-
660. tion of speech sounds. In E. E. David, Jr., & P. B.
Ripley B. D. (1 996). Pattern recognition and neu- Denes (Eds.), Human communication: A unified
ral networks. Cambridge: Cambridge Univer- view, (pp. 67–1 1 1 ). New York: McGraw-Hill.
sity Press. Shepard, R. N. (1 982). Geometrical approxima-
Rips, L. J. (1 989). Similarity, typicality, and cate- tions to the structure of musical pitch. Psycho-
gorization. In S. Vosniadu & A. Ortony (Eds.), logical Review, 89, 3 05 –3 3 3 .
Similarity, analogy, and thought (pp. 21 –5 9). Shepard, R. N. (1 987). Toward a universal law
Cambridge: Cambridge University Press. of generalization for psychological science.
Rips, L. J., & Collins, A. (1 993 ). Categories and Science, 2 3 7, 1 3 1 7–1 3 23 .
resemblance. Journal of Experimental Psycho- Shepard, R. N., & Arabie, P. (1 979). Additive
logy: General, 1 2 2 , 468–486. clustering: Representation of similarities as
Rips, L. J., Shoben, E. J., & Smith, E. E. (1 973 ). combinations of discrete overlapping proper-
Semantic distance and the verification of ties. Psychological Review, 86, 87–1 23 .
Simon, D., & Holyoak, K. J. (2002). Structural Solla, & D. A. Cohn (Eds.), Advances in neural
dynamics of cognition: From consistency the- information processing systems 1 1 (pp. 5 9–68).
ories to constraint satisfaction. Personality and Cambridge, MA: MIT Press.
Social Psychology Review, 6, 283 –294. Tenenbaum, J. B., De Silva, V., & Lanford, J. C.
Sjoberg, L. (1 972). A cognitive theory of similar- (2000). A global geometric framework for non-
ity. Goteborg Psychological Reports, 2 (1 0). linear dimensionality reduction. Science, 2 90,
Sloman, S. A. (1 993 ). Feature-based induction. 22–23 .
Cognitive Psychology, 2 5 , 23 1 –280. Tenenbaum, J. B., & Griffiths, T. L. (2001 ).
Sloman, S. A. (1 996). The empirical case for Generalization, similarity and Bayesian infer-
two systems of reasoning. Psychological Bulletin, ence. Behavioral and Brain Sciences, 2 4, 629–
1 1 9, 3 –22. 640.
Smith, E. E., Shoben, E. J., & Rips, L. J. (1 974). Torgerson, W. S. (1 95 8). Theory and methods of
Structure and process in semantic memory: A scaling. New York: Wiley.
featural model for semantic decisions. Psycho- Torgerson, W. S. (1 965 ). Multidimensionsal scal-
logical Review, 81 , 21 4–241 . ing of similarity. Psychometrika, 3 0, 3 79–3 93 .
Smith, E. E., & Sloman, S. A. (1 994). Similarity- Treisman, A. M. (1 986). Features and objects
versus rule-based categorization. Memory and in visual processing. Scientific American, 2 5 5 ,
Cognition, 2 2 , 3 77–3 86. 1 06–1 1 5 .
Smith, J. D., & Kemler, D. G. (1 984). Overall sim- Tversky, A. (1 977). Features of similarity. Psycho-
ilarity in adults’ classification: The child in all of logical Review, 84, 3 27–3 5 2.
us. Journal of Experimental Psychology: General, Tversky, A., & Gati, I. (1 982). Similarity, separa-
1 1 3 , 1 3 7–1 5 9. bility, and the triangle inequality. Psychological
Smith, L. B. (1 989). From global similarity to Review, 89, 1 23 –1 5 4.
kinds of similarity: The construction of dimen- Tversky, A., & Hutchinson, J. W. (1 986). Near-
sions in development. In S. Vosniadou & A. est neighbor analysis of psychological spaces.
Ortony (Eds.), Similarity and analogical reason- Psychological Review, 93 , 3 –22.
ing (pp. 1 46–1 78). Cambridge: Cambridge Uni- Ullman, S. (1 996). High-level vision: Object recog-
versity Press. nition and visual cognition. London: MIT Press.
Smith, L. B., & Kemler, D. G. (1 978). Levels Ward, T. B. (1 983 ). Response tempo and
of experienced dimensionality in children and separable-integral responding: Evidence for
adults. Cognitive Psychology, 1 0, 5 02–5 3 2. an integral-to-separable processing sequence
Suzuki, H., Ohnishi, H., & Shigemasu, K. (1 992). in visual perception. Journal of Experimental
Goal-directed processes in similarity judgment. Psychology: Human Perception and Performance,
Proceedings of the fourteenth annual conference 9, 1 03 –1 1 2.
of the Cognitive Science Society (pp. 3 43 –3 48). Wedell, D. (1 994). Context effects on similar-
Hillsdale, NJ: Erlbaum. ity judgments of multidimensional stimuli: In-
Tarr, M. J., & Gauthier, I. (1 998). Do view- ferring the structure of the emotion space.
point-dependent mechanisms generalize across Journal of Experimental Social Psychology, 3 0,
members of a class? Cognition. Special Issue: 1 –3 8.
Image-Based Object Recognition in Man, Mon- Wiener-Ehrlich, W. K., Bart, W. M., & Millward,
key, and Machine, 67, 73 –1 1 0. R. (1 980). An analysis of generative representa-
Tenenbaum, J. B. (1 996). Learning the structure tion systems. Journal of Mathematical Psychol-
of similarity. In G. Tesauro, D. S. Touretzky, & ogy, 2 1 (3 ), 21 9–246.
T. K. Leen (Eds.), Advances in Neural Informa- Zhang, S., & Markman, A. B. (1 998). Over-
tion Processing Systems 8 (pp. 4–9). Cambridge, coming the early entrant advantage: The
MA: MIT Press. role of alignable and nonalignable differ-
Tenenbaum, J. B. (1 999). Bayesian modeling of ences. Journal of Marketing Research, 3 5 , 41 3 –
human concept learning. In M. S. Kearns, S. A. 426.
CHAPTER 3
Concepts and Categories: Memory,

Meaning, and Metaphysics
Douglas L. Medin
Lance J. Rips
Introduction Our goal in this chapter is to provide an

overview of work on concepts and categories
The concept of concepts is difficult to define, in the last half-century. There has been such
but no one doubts that concepts are funda- a consistent stream of research during this
mental to mental life and human commu- period that one reviewer of this literature,
nication. Cognitive scientists generally agree Gregory Murphy (2002), was compelled to
that a concept is a mental representation that call his monograph, The Big Book of Con-
picks out a set of entities, or a category. That cepts. Our task is eased by recent reviews,
is, concepts refer, and what they refer to are including Murphy’s aptly named one (e.g.,
categories. It is also commonly assumed that Medin, Lynch, & Solomon, 2000; Murphy,
category membership is not arbitrary, but 2002; Rips, 2001 ; Wisniewski, 2002). Their
rather a principled matter. What goes into thoroughness gives us the luxury of writ-
a category belongs there by virtue of some ing a review focused on a single perspective
lawlike regularities. However, beyond these or “flavor” – the relation between concepts,
sparse facts, the concept CONCEPT is up memory, and meaning.
for grabs. As an example, suppose you have The remainder of this chapter is orga-
the concept TRIANGLE represented as “a nized as follows. In the rest of this section,
closed geometric form having three sides.” we briefly describe some of the tasks or
In this case, the concept is a definition, but functions that cognitive scientists have ex-
it is unclear what else might be in your trian- pected concepts to perform. This will pro-
gle concept. Does it include the fact that ge- vide a road map to important lines of re-
ometry books discuss them (although some search on concepts and categories. Next, we
don’t) or that they have 1 80 degrees (al- return to developments in the late 1 960s and
though in hyperbolic geometry none do)? It early 1 970s that raised the exciting possi-
is also unclear how many concepts have def- bility that laboratory studies could provide
initions or what substitutes for definitions in deep insights into both concept represen-
ones that do not. tations and the organization of (semantic)
37
memory. Then we describe the sudden col- gencies – can result in that novel property
lapse of this optimism and the ensuing lines being incorporated into the conceptual rep-
of research that, however intriguing and im- resentation. In other cases, relations between
portant, essentially ignored questions about categories may support inference. For exam-
semantic memory. Next, we trace a number ple, finding out that flashlights can contain
of relatively recent developments under the sirens may lead you to entertain the idea that
somewhat whimsical heading, “Psychometa- cell phones and fire extinguishers might also
physics.” This is the view that concepts are contain sirens. Hierarchical conceptual re-
embedded in (perhaps domain-specific) the- lations support both inductive and deduc-
ories. This will set the stage for returning tive reasoning. If all trees contain xylem and
to the question of whether research on con- hawthorns are trees, then one can deduce
cepts and categories is relevant to semantics that hawthorns contain xylem. In addition,
and memory organization. We use that ques- finding out that white oaks contain phloem
tion to speculate about future developments provides some support for the inductive in-
in the field. In this review, we use all caps to ference that other kinds of oaks contain
refer to concepts and quotation marks to re- phloem. People also use categories to in-
fer to linguistic expressions. stantiate goals in planning (Barsalou, 1 983 ).
For example, a person planning to do some
night fishing might create an ad hoc con-
Functions of Concepts
cept, THINGS TO BRING ON A NIGHT
For purposes of this chapter, we collapse the FISHING TRIP, which would include a
many ways people can use concepts into two fishing rod, tackle box, mosquito repellent,
broad functions: categorization and com- and flashlight.
munication. The conceptual function that Concepts are also centrally involved in
most research has targeted is categorization, communication. Many of our concepts corre-
the process by which mental representations spond to lexical entries, such as the English
(concepts) determine whether some entity word “flashlight.” For people to avoid mis-
is a member of a category. Categorization understanding each other, they must have
enables a wide variety of subordinate func- comparable concepts in mind. If A’s con-
tions because classifying something as a cat- cept of cell phone corresponds with B’s con-
egory member allows people to bring their cept of flashlight, it will not go well if A
knowledge of the category to bear on the asks B to make a call. An important part
new instance. Once people categorize some of the function of concepts in communica-
novel entity, for example, they can use rel- tion is their ability to combine to create an
evant knowledge for understanding and pre- unlimited number of new concepts. Nearly
diction. Recognizing a cylindrical object as a every sentence you encounter is new – one
flashlight allows you to understand its parts, you have never heard or read before – and
trace its functions, and predict its behavior. concepts (along with the sentence’s gram-
For example, you can confidently infer that mar) must support your ability to under-
the flashlight will have one or more batter- stand it. Concepts are also responsible for
ies, will have some sort of switch, and will more ad hoc uses of language. For exam-
normally produce a beam of light when the ple, from the base concepts of TROUT and
switch is pressed. FLASHLIGHT, you might create a new con-
Not only do people categorize in order to cept, TROUT FLASHLIGHT, which in the
understand new entities, but they also use context of our current discussion would pre-
the new entities to modify and update their sumably be a flashlight used when trying
concepts. In other words, categorization sup- to catch trout (and not a flashlight with a
ports learning. Encountering a member of a picture of a trout on it, although this may
category with a novel property – for exam- be the correct interpretation in some other
ple, a flashlight that has a siren for emer- context). A major research challenge is to
concepts and categories 39
understand the principles of conceptual com- A Minihistory

bination and how they relate to commu-
nicative contexts (see Fodor, 1 994, 1 998; Research on concepts in the middle of the
Gleitman & Papafragou, Chap. 26 ; Hamp- last century reflected a gradual easing away
ton, 1 997; Partee, 1 995 ; Rips, 1 995 ; Wisni- from behaviorist and associative learning tra-
ewski, 1 997). ditions. The focus, however, remained on
learning. Most of this research was con-
ducted in laboratories using artificial cate-
Overview gories (a sample category might be any geo-
metric figure that is both red and striped)
So far, we have introduced two roles for con- and directed at one of two questions: (1 )
cepts: categorization (broadly construed) Are concepts learned by gradual increases
and communication. These functions and in associative strength, or is learning all
associated subfunctions are important to or none (Levine, 1 962; Trabasso & Bower,
bear in mind because studying any one in 1 968)?, and (2) Which kinds of rules or
isolation can lead to misleading conclusions concepts (e.g., disjunctive, such as RED
about conceptual structure (see Solomon, OR STRIPED, versus conjunctive, such as
Medin, & Lynch, 1 999, for a review bear- RED AND STRIPED) are easiest to learn
ing on this point). At this juncture, how- (Bourne, 1 970; Bruner, Goodnow, & Austin,
ever, we need to introduce one more plot 1 95 6; Restle, 1 962)?
element into the story we are telling. Pre- This early work tended either to ignore
sumably everything we have been talking real world concepts (Bruner et al., 1 95 6, rep-
about has implications for human memory resent something of an exception here) or
and memory organization. After all, con- to assume implicitly that real world con-
cepts are mental representations, and people cepts are structured according to the same
must store these representations somewhere kinds of arbitrary rules that defined the
in memory. However, the relation between artificial ones. According to this tradition,
concepts and memory may be more inti- category learning is equivalent to finding
mate. A key part of our story is what we out the definitions that determine category
call “the semantic memory marriage,” the membership.
idea that memory organization corresponds
to meaningful relations between concepts. Early Theories of Semantic Memory
Mental pathways that lead from one concept
to another – for example, from ELBOW to Although the work on rule learning set the
ARM – represent relations like IS A PART stage for what was to follow, two develop-
OF that link the same concepts. Moreover, ments associated with the emergence of cog-
these memory relations may supply the con- nitive psychology dramatically changed how
cepts with all or part of their meaning. By people thought about concepts.
studying how people use concepts in cat-
egorizing and reasoning, researchers could turning point 1: models
simultaneously explore memory structure of memory organization
and the structure of the mental lexicon. In The idea of programming computers to
other words, the idea was to unify catego- do intelligent things (artificial intelligence
rization, communication (in its semantic as- or AI) had an important influence on the
pects), and memory organization. As we will development of new approaches to con-
see, this marriage was somewhat troubled, cepts. Quillian (1 967) proposed a hierarchi-
and there are many rumors about its break- cal model for storing semantic information
up. However, we are getting ahead of our in a computer that was quickly evaluated
story. The next section begins with the ini- as a candidate model for the structure of
tial romance. human memory (Collins & Quillian, 1 969).
Figure 3 .1 provides an illustration of part of by trying out hypotheses until they hit on the
a memory hierarchy that is similar to what correct definition. In the early 1 970s, how-
the Quillian model suggests. ever, Eleanor Rosch and her associates (e.g.,
First, note that the network follows a Rosch, 1 973 ; Rosch & Mervis, 1 975 ) argued
principle of cognitive economy. Properties that most everyday concepts are not orga-
true of all animals, such as eating and breath- nized in terms of the sorts of necessary and
ing, are stored only with the animal con- sufficient features that would form a (con-
cept. Similarly, properties that are generally junctive) definition for a category. Instead,
true of birds are stored at the bird node, such concepts depend on properties that are
but properties distinctive to individual kinds generally true but need not hold for every
(e.g., being yellow) are stored with the spe- member. Rosch’s proposal was that concepts
cific concept nodes they characterize (e.g., have a “family resemblance” structure: What
CANARY). A property does not have to determines category membership is whether
be true of all subordinate concepts to be an example has enough characteristic prop-
stored with a superordinate. This is illus- erties (is enough like other members) to be-
trated in Figure 3 .1 , where CAN FLY is as- long to the category.
sociated with the bird node; the few excep- One key idea associated with this view
tions (e.g., flightlessness for ostriches) are is that not all category members are equally
stored with particular birds that do not fly. “good” examples of a concept. If member-
Second, note that category membership is ship is based on characteristic properties and
defined in terms of positions in the hierar- some members have more of these proper-
chical network. For example, the node for ties than others, then the ones with more
CANARY does not directly store the infor- characteristic properties should better ex-
mation that canaries are animals; instead, emplify the category. For example, canaries
membership would be “computed” by mov- but not penguins have the characteristic
ing from the canary node up to the bird node bird properties of flying, singing, and build-
and then from the bird node to the animal ing a nest, so one would predict that ca-
node. It is as if a deductive argument is be- naries would be more typical birds than pen-
ing constructed of the form, “All canaries are guins. Rosch and Mervis (1 975 ) found that
birds and all birds are animals and therefore people do rate some examples of a cate-
all canaries are animals.” gory to be more typical than others and
Although these assumptions about cog- that these judgments are highly correlated
nitive economy and traversing a hierarchi- with the number of characteristic features
cal structure may seem speculative, they an example possesses. They also created
yield a number of testable predictions. As- artificial categories conforming to family
suming traversal takes time, one would pre- resemblance structures, and produced typ-
dict that the time needed for people to ver- icality effects on learning and on goodness-
ify properties of concepts should increase of-example judgments.
with the network distance between the con- Rosch and her associates (Rosch, Mervis,
cept and the property. For example, people Gray, Johnson, & Boyes-Braem, 1 976) also
should be faster to verify that a canary is yel- argued that the family resemblance view
low than to verify that a canary has feath- has important implications for understand-
ers and faster to determine that a canary ing concept hierarchies. Specifically, they
can fly than that a canary has skin. Collins suggested that the correlational structure
and Quillian found general support for these of features (instances that share some fea-
predictions. tures tend to share others) creates natu-
ral “chunks” or clusters of instances that
turning point 2: natural concepts correspond to what they referred to as
and family resemblance basic-level categories. For example, having
The work on rule learning suggested that feathers tends to correlate with nesting in
children (and adults) might learn concepts trees (among other features) in the animal
Has skin
Can move around
Animal Eats
Breathes
Has fins
Has wings
Fish Can swim
Can fly
Bird Has gills
Has feathers
Canary
Ostrich Shark Salmon
Can bite Is pink

Can sing Has long, Is dangerous Is edible
Is yellow thin legs Swims upstream
Is tall to lay eggs
,
Can t fly
Figure 3.1 . A semantic network.
kingdom, and having gills with living in according to their similarity in meaning,
water. The first cluster tends to isolate birds, where similarity is imposed by correlated
whereas the second picks out fish. The and taxonomic structure (see Anderson &
general idea is that these basic-level cat- Bower, 1 973 , and Norman & Rumelhart,
egories provide the best compromise be- 1 975 , for theories and research in this tra-
tween maximizing within-category similar- dition, and Goldstone & Son, Chap. 2, for
ity (birds tend to be quite similar to each current theories of similarity).
other) and minimizing between-category
similarity (birds tend to be dissimilar to
Fragmentation of Semantics and Memory
fish). Rosch et al. showed that basic-level
categories are preferred by adults in nam- Prior to about 1 980, most researchers in
ing objects, are learned first by children, this field saw themselves as investigating “se-
are associated with the fastest categoriza- mantic memory” – the way that long-term
tion reaction times, and have a number of memory organizes meaningful information.
other properties that indicate their special Around 1 980, the term itself became passé,
conceptual status. at least for this same group of researchers,
Turning points 1 and 2 are not unrelated. and the field regrouped under the banner
To be sure, the Collins and Quillian model, of “Categories and Concepts” (the title of
as initially presented, would not predict typ- Smith & Medin’s, 1 981 , synthesis of research
icality effects (but see Collins & Loftus, in this area). At the time, these researchers
1 975 ), and it was not obvious that it con- may well have seen this change as a purely
tained anything that would predict the im- nominal one, but we suspect it reflected a
portance of basic-level categories. Nonethe- retreat from the claim that semantic mem-
less, these conceptual breakthroughs led to ory research had much to say about either
an enormous amount of research premised semantics or memory. How did this change
on the notion that memory groups concepts come about?
memory organization wise studiously avoided a stand on memory

Initial support for a Quillian-type mem- structure.
ory organization came from Quillian’s own Evidence from priming in lexical decision
collaboration with Allan Collins (Collins & tasks also appeared ambiguous. Although
Quillian, 1 969), which we mentioned ear- priming occurs between associatively related
lier. Related evidence also came from ex- words (e.g., “bread” and “butter”), it is not
periments on lexical priming: Retrieving the so clear that there is priming between se-
meaning of a word made it easier to retrieve mantically linked words in the absence of
the meaning of semantically related words such associations. It is controversial whether,
(e.g., Meyer & Schvanevelt, 1 971 ). In these for example, there is any automatic activa-
lexical decision tasks, participants viewed tion between “glove” and “hat” despite their
a single string of letters on each trial and joint membership in the clothing category
decided, under reaction time instructions, (see Balota, 1 994, for a discussion). If mem-
whether the string was a word (“daisy”) or ory is organized on a specifically semantic
a nonword (“raisy”). The key result was that basis – on the basis of word meanings – then
participants were faster to identify a string there should be activation between seman-
as a word if it followed a semantically re- tically related words even in the absence of
lated item rather than an unrelated one. For other sorts of associations. A meta-analysis
example, reaction time for “daisy” was faster by Lucas (2000) turned up a small effect
if, on the preceding trial, the participant had of this type, but as Lucas noted, it is diffi-
seen “tulip” rather than “steel.” This priming cult to tell whether the semantically related
effect is consistent with the hypothesis that pairs in these experiments are truly free of
activation from one concept spreads through associations.
memory to semantically related ones. The idea that memory organization mim-
Later findings suggested, however, that ics semantic organization is an attractive one,
the relation between word meaning and and memory researchers attempted to mod-
memory organization was less straightfor- ify the original Quillian approach to bring
ward. For example, the typicality findings it into line with the results we have just re-
(see turning point 2) suggested that time to viewed (e.g., Collins & Loftus, 1 975 ). The
verify sentences of the form An X is a Y data from the sentence verification and lex-
(e.g., “A finch is a bird”) might be a func- ical decision experiments, however, raised
tion of the overlap in the information that doubts about these theories. Later in this
participants knew about the meaning of X chapter, we consider whether newer tech-
and Y rather than the length of the pathway niques can give us a better handle on the
between these concepts. The greater the in- structure of memory, but for now let’s turn
formation overlap – for example, the greater to the other half of the memory equals
the number of properties that the referents meaning equation.
of X and Y shared – the faster the time to
confirm a true sentence and the slower the
time to disconfirm a false one. For exam- semantics
ple, if you know a lot of common informa- Specifying the meaning of individual words
tion about finches and birds but only a lit- is one of the goals of semantics, but only one.
tle common information about ostriches and Semantics must also account for the mean-
birds, you should be faster to confirm the ing of phrases, sentences, and longer units
sentence “A finch is a bird” than “An ostrich of language. One problem in using a theory
is a bird.” Investigators proposed several the- like Quillian’s as a semantic theory is how to
ories along these lines that made minimal extend its core idea – that the meaning of a
commitments to the way memory organized word is the coordinates of a node in mem-
its mental concepts (McCloskey & Glucks- ory structure – to explain how people under-
berg, 1 979; Smith, Shoben, & Rips, 1 974; stand meaningful phrases and sentences. Of
Tversky, 1 977). Rosch’s (1 978) theory like- course, Quillian’s theory and its successors
can tell us how we understand sentences that word meaning. It seems unlikely, for exam-
correspond to preexisting memory path- ple, that learning the sentence about Fred
ways. We have already seen how the model changes the meaning of “daisy.” Moreover,
can explain our ability to confirm sentences because meaning is a function of the en-
such as “A daisy is a flower.” However, what tire network, the same incidental sentences
about sentences that do not correspond to change the meaning of all words. Learning
preexisting connections – sentences such as about Fred’s daisy placing shifts the meaning
“Fred placed a daisy in a lunchbox”? of seemingly unrelated words such as “hip-
The standard approach to sentence mean- popotamus” if only a bit.
ing in linguistics is to think of the mean- Related questions apply to other psycho-
ing of sentences as built from the meaning logical theories of meaning in the semantic
of the words that compose them, guided memory tradition. To handle the typicality
by the sentence’s grammar (e.g., Chierchia results mentioned earlier, some investigators
& McConnell-Ginet, 1 990). We can under- proposed that the mental representation of
stand sentences that we have never heard or a category such as daisies consists of a pro-
read before, and because there are an enor- totype for that category – for example, a
mous number of such novel sentences, we description of a good example of a daisy
cannot learn their meaning as single chunks. (e.g., Hampton, 1 979; McCloskey & Glucks-
It therefore seems quite likely that we com- berg, 1 979). The meaning of “daisy” in these
pute the meaning of these new sentences. prototype theories would thus include de-
However, if word meaning is the position of fault characteristics, such as growing in gar-
a node in a network, it is hard to see how this dens, that apply to most, but not all, daisies.
position could combine with other positions We discuss prototype theories in more de-
to produce sentence meanings. What is the tail soon, but the point for now is that pro-
process that could take the relative network totype representations for individual words
positions for FRED, PLACE, DAISY, IN, and are difficult to combine to obtain a mean-
LUNCHBOX and turn them into a meaning ing for phrases that contain them. One po-
for “Fred placed a daisy in a lunchbox”? tential way to combine prototypes – fuzzy
If you like the notion of word meaning set theory (Zadeh, 1 965 ) – proved vulner-
as relative position, then one possible solu- able to a range of counterexamples (Osh-
tion to the problem of sentence meaning erson & Smith, 1 981 , 1 982). In general, the
is to connect these positions with further prototypes of constituent concepts can differ
pathways. Because we already have an ar- from the prototypes of their combinations in
ray of memory nodes and pathways at our unpredictable ways (Fodor, 1 994). The pro-
disposal, why not add a few more to en- totype of BIRDS THAT ARE PETS (per-
code the meaning of a new sentence? Per- haps a parakeet-like bird) may differ from
haps the meaning of “Fred placed a daisy in the prototypes of both BIRDS and PETS
the lunchbox” is given by a new set of path- (see Storms, de Boeck, van Mechelen, &
ways that interconnect the nodes for FRED, Ruts, 1 998, for related evidence). Thus, if
PLACE, DAISY, and so on, in a configura- word meanings are prototypes, it is hard to
tion corresponding to the sentence’s struc- see how the meaning of phrases could be
ture. This is the route that Quillian and his a compositional function of the meaning of
successors took (e.g., Anderson & Bower, their parts.
1 973 ; Norman & Rumelhart, 1 975 ; Quil- Other early theories proposed that cate-
lian, 1 969), but it comes at a high price. gory representations consist of descriptions
Adding new connections changes the over- of exemplars of the category in question.
all network configuration and thereby al- For example, the mental representation of
ters the meaning of the constituent terms. DAISY would include descriptions of spe-
(Remember: Meaning is supposed to be rel- cific daisies that an individual had encoded
ative position.) However, it is far from ob- (e.g., Hintzman, 1 986; Medin & Schaf-
vious that encoding incidental facts alters fer, 1 978; Nosofsky, 1 986). However, these
theories have semantic difficulties of their has often been followed over the past few
own (see Rips, 1 995 ). For example, if by decades. Although researchers using artifi-
chance the only Nebraskans you have met cial categories have sometimes been guilty
are chiropractors and the only chiroprac- of treating these categories as ends in them-
tors you have met are Nebraskans, then ex- selves, there are enough parallels between
emplar models appear to mispredict that results with artificial and natural categories
“Nebraskan” and “chiropractor” will be syn- that each area of research informs the other
onyms for you. (see Medin & Coley, 1 998, for a review).
To recap briefly, we have found that ex-
perimental research on concepts and cate- prototype versus exemplar models
gories was largely unable to confirm that One idea compatible with Rosch’s family re-
global memory organization (as in Quillian’s semblance hypothesis is the prototype view. It
semantic memory) conferred word meaning. proposes that people learn the characteristic
In addition, neither the global theories that features (or central tendency) of categories
initiated this research nor the local proto- and use them to represent the category
type or exemplar theories that this research (e.g., Reed, 1 972). This abstract prototype
produced were able to provide insight into need not correspond to any experienced
the basic semantic problem of how we un- example. According to this theory, catego-
derstand the meaning of novel sentences. rization depends on similarity to the pro-
This left semantic memory theory in the un- totypes. For example, to decide whether
enviable position of being unable to explain some animal is a bird or a mammal, a per-
either semantics or memory. son would compare the (representation of )
that animal to both the bird and the mam-
mal prototypes and assign it to the cate-
Functions and Findings gory whose prototype it most resembled.
The prototype view accounts for typicality
effects in a straightforward manner. Good
Current research in this field still focuses
examples have many characteristic proper-
on categorization and communication, but
ties of their category and have few charac-
without the benefit of a framework that
teristics in common with the prototypes of
gives a unified explanation for the functions
contrasting categories.
that concepts play in categorizing, reasoning,
Early research appeared to provide strik-
learning, language understanding, and mem-
ing confirmation of the idea of prototype ab-
ory organization. In this section, we survey
straction. Using random dot patterns as the
the state of the art, and in the following one,
prototypes, Posner and Keele (1 968, 1 970)
we consider the possibility of reuniting some
produced a category from each prototype.
of these roles.
The instances in a category were “distor-
tions” of the prototype generated by mov-
Category Learning and Inference
ing constituent dots varying distances from
One nice aspect of Rosch and Mervis’s their original positions. Posner and Keele first
(1 975 ) studies of typicality effects is that trained participants to classify examples that
they used both natural language categories they created by distorting the prototypes.
and artificially created categories. Finding Then they gave a transfer test in which they
typicality effects with natural (real world) presented both the old patterns and new low
categories shows that the phenomenon is or high distortions that had not appeared
of broad interest; finding these same effects during training. In addition, the prototypes,
with artificial categories provides systematic which the participants had never seen, were
control for potentially confounding variables presented during transfer. Participants had
(e.g., exemplar frequency) in a way that can- to categorize these transfer patterns, but
not be done for lexical concepts. This general unlike the training procedure, the transfer
strategy linking the natural to the artificial test gave participants no feedback about the
correctness of their responses. The tests ei- ample, pitted the number of typical features
ther immediately followed training or ap- against high similarity to particular train-
peared after a 1 -week delay. ing examples and found that categorization
Posner and Keele (1 970) found that cor- was more strongly influenced by the latter.
rect classification of the new patterns de- A prototype model would make the oppo-
creased as distortion (distance from a cat- site prediction.
egory prototype) increased. This is the Another contrast between exemplar and
standard typicality effect. The most striking prototype models revolves around sensitiv-
result was that a delay differentially affected ity to within-category correlations (Medin,
categorization of prototypic versus old train- Altom, Edelson, & Freko, 1 982). A proto-
ing patterns. Specifically, correct categoriza- type representation captures what is on av-
tion of old patterns decreased over time to a erage true of a category, but is insensitive
reliably greater extent than performance on to within-category feature distributions. For
prototypes. In the immediate test, partici- example, a bird prototype could not repre-
pants classified old patterns more accurately sent the impression that small birds are more
than prototypes; however, in the delayed likely to sing than large birds (unless one
test, accuracy on old patterns and proto- had separate prototypes for large and small
types was about the same. This differential birds). Medin et al. (1 982) found that people
forgetting is compatible with the idea that are sensitive to within-category correlations
training leaves participants with represen- (see also Malt & Smith, 1 984, for corre-
tations of both training examples and ab- sponding results with natural object cate-
stracted prototypes but that memory, for gories). Exemplar theorists were also able
examples, fades more rapidly than memory to show that exemplar models could readily
for prototypes. The Posner and Keele results predict other effects that originally appeared
were quickly replicated by others and con- to support prototype theories – differen-
stituted fairly compelling evidence for the tial forgetting of prototypes versus train-
prototype view. ing examples, and prototypes being catego-
However, this proved to be the begin- rized as accurately or more accurately than
ning of the story rather than the end. Other training examples. In short, early skirmishes
researchers (e.g., Brooks, 1 978; Medin & strongly favored exemplar models over pro-
Schaffer, 1 978) put forth an exemplar view of totype models. Parsimony suggested no need
categorization. Their idea was that memory to posit prototypes if stored instances could
for old exemplars by itself could account for do the job. Since the early 1 980s, there have
transfer patterns without the need for posit- been a number of trends and developments
ing memory for prototypes. On this view, in research and theory with artificially con-
new examples are classified by assessing their structed categories, and we give only the
similarity to stored examples and assigning briefest of summaries here.
the new example to the category that has the
most similar examples. For instance, some
unfamiliar bird (e.g., a heron) might be cor- new models
rectly categorized as a bird not because it is There are now more contending models for
similar to a bird prototype, but rather be- categorizing artificial stimuli, and the early
cause it is similar to flamingos, storks, and models have been extensively elaborated.
other shore birds. For example, researchers have generalized
In general, similarity to prototypes and the original Medin and Schaffer (1 978) ex-
similarity to stored examples will tend to emplar model to handle continuous dimen-
be highly correlated (Estes, 1 986). Nonethe- sions (Nosofsky, 1 986), to address the time
less, for some category structures and for course of categorization (Lamberts, 1 995 ;
some specific exemplar and prototype mod- Nosofsky & Palmeri, 1 997a; Palmeri, 1 997),
els, it is possible to develop differential pre- to generate probability estimates in infer-
dictions. Medin and Schaffer (1 978), for ex- ence tasks (Juslin & Persson, 2002), and
to embed it in a neural network (Krus- contrary to empirical evidence that it does

chke, 1 992). (Verguts, Storms, & Tuerlinckx, 2001 ).
Three new kinds of classification theories Neural network or connectionist models
have been added to the discussion: ration- are the third type of new model on the
al approaches, decision-bound models, and scene (see Knapp & Anderson, 1 984, and
neural network models. Anderson (1 990, Kruschke, 1 992, for examples, and Doumas
1 991 ) proposed that an effective approach & Hummel, Chap. 4, for further discussion
to modeling cognition in general and catego- of connectionism). It may be a mistake to
rization in particular is to analyze the infor- think of connectionist models as compris-
mation available to a person in the situation ing a single category because they take many
of interest and then to determine abstractly forms, depending on assumptions about hid-
what an efficient, if not optimal, strategy den units, attentional processes, recurrence,
might be. This approach has led to some new and the like. There is one sense in which
sorts of experimental evidence (e.g., Ander- neural network models with hidden units
son & Fincham, 1 996; Clapper & Bower, may represent a clear advance on proto-
2002) and pointed researchers more in the type models: They can form prototypes in
direction of the inference function of cate- a bottom-up manner that reflects within-
gories. Interestingly, the Medin and Schaf- category structure (e.g., Love, Medin, &
fer exemplar model corresponds to a spe- Gureckis, 2004). That is, if a category com-
cial case of the rational model, and Nosofsky prises two distinct clusters of examples, net-
(1 991 ) discussed the issue of whether the work models can create a separate hidden
rational model adds significant explanatory unit for each chunk (e.g., large birds versus
power. However, there is also some evidence small birds) and thereby show sensitivity to
undermining the rational model’s predic- within-category correlations.
tions concerning inference (e.g., Malt, Ross,
& Murphy, 1 995 ; Murphy & Ross, 1 994; mixed models and multiple
Palmeri, 1 999; Ross & Murphy, 1 996). categorization systems
Decision-bound models (e.g., Ashby & A common response to hearing about var-
Maddox, 1 993 ; Maddox & Ashby, 1 993 ) ious models of categorization is to suggest
draw their inspiration from psychophysics that all the models may be capturing im-
and signal detection theory. Their primary portant aspects of categorization and that
claim is that category learning consists of research should determine in which con-
developing decision bounds around the cat- texts one strategy versus another is likely
egory that will allow people to categorize to dominate. One challenge to this divide
examples successfully. The closer an item is and conquer program is that the predic-
to the decision bound the harder it should tions of alternative models tend to be highly
be to categorize. This framework offers a correlated, and separating them is far from
new perspective on categorization in that it trivial. Nonetheless, there is both empiri-
may lead investigators to ask questions such cal research (e.g., Johansen & Palmeri, 2002;
as How do the decision bounds that hu- Nosofsky, Clark, & Shin, 1 989; Reagher &
mans adopt compare with what is optimal? Brooks, 1 993 ) and theoretical modeling that
and What kinds of decision functions are support the idea that mixed models of cat-
easy or hard to acquire? Researchers have egorization are useful and perhaps neces-
also directed efforts to distinguish decision- sary. Current efforts combine rules and ex-
bound and exemplar models (e.g., Maddox, amples (e.g., Erickson & Kruschke, 1 998;
1 999; Maddox & Ashby, 1 998; McKinley & Nosofsky, Palmeri, & McKinley, 1 994), as
Nosofsky, 1 995 ; Nosofsky, 1 998; Nosofsky well as rules and decision bounds (Ashby,
& Palmeri, 1 997b). One possible difficulty Alfonso-Reese, Turken, & Waldron, 1 998).
with decision-bound models is that they Some models also combine exemplars and
contain no obvious mechanism by which prototypes (e.g., Homa, Sterling, & Trepel,
stimulus familiarity can affect performance, 1 981 ; Minda & Smith, 2001 ; Smith & Minda,
1 998, 2000; Smith, Murray, & Minda, 1 997), advocates both for and against the multi-
but it remains controversial whether the ad- ple systems view (e.g., Filoteo, Maddox, &
dition of prototypes is needed (e.g., Buse- Davis, 2001 ; Maddox, 2002; Nosofsky & Jo-
meyer, Dewey, & Medin, 1 984; Nosofsky hansen, 2000; Palmeri & Flanery, 2002; Re-
& Johansen, 2000; Nosofsky & Zaki, 2002; ber, Stark, & Squire, 1 998a, 1 998b). It is safe
Stanton, Nosofsky, & Zaki, 2002). to predict that this issue will receive continu-
The upsurge of cognitive neuroscience ing attention.
has reinforced the interest in multiple mem-
ory systems. One intriguing line of research
by Knowlton, Squire, and associates (Knowl- inference learning
ton, Mangels, & Squire, 1 996; Knowlton & More recently, investigators have begun to
Squire, 1 993 ; Squire & Knowlton, 1 995 ) fa- worry about extending the scope of cate-
voring multiple categorization systems in- gory learning studies by looking at inference.
volves a dissociation between categoriza- Often, we categorize some entity to help
tion and recognition. Knowlton and Squire us accomplish some function or goal. Ross
(1 993 ) used the Posner and Keele dot pattern (1 997, 1 999, 2000) showed that the category
stimuli to test amnesic and matched con- representations people develop in laboratory
trol patients on either categorization learn- studies depend on use and that use affects
ing and transfer or a new–old recognition later categorization. In other words, models
task (involving five previously studied pat- of categorization ignore inference and use at
terns versus five new patterns). The amne- their peril. Other work suggests that hav-
siacs performed very poorly on the recog- ing a cohesive category structure is more
nition task but were not reliably different important for inference learning than it is
from control participants on the categoriza- for classification (Yamauchi, Love, & Mark-
tion task. Knowlton and Squire took this as man, 2002; Yamauchi & Markman, 1 998,
evidence for a two-system model, one based 2000a, 2000b; for modeling implications see
on explicit memory for examples and one Love, Markman, & Yamauchi, 2000; Love
based on an implicit system (possibly pro- et al., 2004). More generally, this work raises
totype abstraction). On this view, amnesiacs the possibility that diagnostic rules based on
have lost access to the explicit system but superficial features, which appear so promi-
can perform the classification task using their nently in pure categorization tasks, may not
intact implicit memory. be especially relevant for contexts involv-
These claims have provoked a number of ing multiple functions or more meaning-
counterarguments. First, Nosofsky and Zaki ful stimuli (e.g., Markman & Makin, 1 998;
(1 998) showed that a single system (exem- Wisniewski & Medin, 1 994).
plar) model could account for both types
of data from both groups (by assuming the
exemplar-based memory of amnesiacs was feature learning
impaired but not absent). Second, investi- The final topic on our “must mention” list
gators have raised questions about the de- for work with artificial categories is feature
tails of Knowlton and Squire’s procedures. learning. It is a common assumption in both
Specifically, Palmeri and Flanery (1 999) sug- models of object recognition and category
gested that the transfer tests themselves learning that the basic units of analysis or
may have provided cues concerning cate- features remain unchanged during learning.
gory membership. They showed that un- There is increasing evidence and supporting
dergraduates who had never been exposed computational modeling that indicate this
to training examples (the students believed assumption is incorrect. Learning may in-
they were being shown patterns sublimi- crease or decrease the distinctiveness of fea-
nally) performed above chance on trans- tures and may even create new features (see
fer tests in this same paradigm. The debate Goldstone, 1 998, 2003 ; Goldstone, Lippa, &
is far from resolved, and there are strong Shriffin, 2001 ; Goldstone & Stevyers, 2001 ;
Schyns, Goldstone, & Thibaut, 1 998; Schyns organized in terms of theories, we return to
& Rodet, 1 997). some laboratory studies that illustrate this
Feature learning has important implica- fuzzy boundary. For the moment, however,
tions for our understanding of the role of we shift attention to the more language-like
similarity in categorization. It is intuitively functions of concepts.
compelling to think of similarity as a causal
factor supporting categorization – things be- Language Functions
long to the same category because they are
similar. However, this may have things back- Most investigators in the concepts and cat-
ward. Even standard models of categoriza- egories area continue to assume that, in ad-
tion assume learners selectively attend to dition to their role in recognition and cat-
features that are diagnostic, and the work on egory learning, concepts also play a role in
feature learning suggests that learners may understanding language and in thinking dis-
create new features that help partition ex- cursively about things. In addition to de-
amples into categories. In that sense, similar- termining, for example, which perceptual
ity (in the sense of overlap in features) is the patterns signal the appearance of a daisy,
by-product, not the cause, of category learn- the DAISY concept also contributes to the
ing. We take up this point again in discussing meaning of sentences such as our earlier
the theory theory of categorization later in example, “Fred placed a daisy in a lunch-
this review. box.” We noted that early psychological re-
search on concepts ran into problems in
explaining the meaning of linguistic units
reasoning
larger than single words. Most early theories
As we noted earlier, one of the central func- posited representations, such as networks,
tions of categorization is to support reason- exemplars, or prototypes, that did not com-
ing. Having categorized some entity as a bine easily and, thus, complicated the prob-
bird, one may predict with reasonable con- lem of sentence meaning. Even if we reject
fidence that it builds a nest, sings, and can the idea that sentence meanings are compo-
fly, although none of these inferences is cer- sitional functions of word meaning, we still
tain. In addition, between-category relations need a theory of sentence meanings, and no
may guide reasoning. For example, from the obvious contenders are in sight. In this sec-
knowledge that robins have some enzyme in tion, we return to the role that concepts play
their blood, one is likely to be more confi- in language understanding to see whether
dent that the enzyme is in sparrows than in new experiments and theories have clarified
raccoons. The basis for this confidence may this relationship.
be that robins are more similar to sparrows
than to raccoons or that robins and sparrows concepts as positions in memory structures
share a lower-rank superordinate category
One difficulty with the older semantic mem-
than do robins and raccoons (birds versus
ory view of word meaning is that memory
vertebrates). We do not review this literature
seems to change with experience from one
here because Sloman and Lagnado (Chap. 5 )
person to another, whereas meaning must
summarize it nicely.
be more or less constant. The sentences
you have encoded about daisies may differ
summary drastically from those we have encoded be-
Bowing to practicalities, we have glossed a cause your conversation, reading habits, and
lot of research and skipped numerous other other verbal give and take can diverge in
relevant studies. The distinction between ar- important ways from ours. If meaning de-
tificially created and natural categories is pends on memory for these sentences, then
itself artificial – at least in the sense that your meaning for “daisy” should likewise
it has no clear definition or marker. When differ from ours. This raises the question
we take up the idea that concepts may be of how you could possibly understand the
sentences in this chapter in the way we in- but performance trailed off dramatically for
tend or how you could meaningfully dis- larger amounts of noise. Foes of the meaning-
agree with us about some common topic (see as-relative-position theory might claim that
Fodor, 1 994). the poor performance under the .6% noise
It is possible that two people – say, condition proves their contention. Advo-
Calvin and Martha – might be able to main- cates would point to the successful part of
tain mutual intelligibility as long as their the simulations and note that their ability to
conceptual networks are not too different. detect correct correspondences usually im-
It is partly an empirical question as to proved as the number of points increased (al-
how much their networks can vary while though there are some nonmonotonicities in
still allowing Calvin’s concepts to map cor- the simulation results that qualify this find-
rectly into Martha’s. To investigate this issue, ing). Clearly, this is only the beginning of the
Goldstone and Rogosky (2002) carried out empirical side of the debate. For example,
some simulations that try to recover such a the differences between Martha and Calvin
mapping. The simulations modeled Calvin’s are likely to be not only random, but also
conceptual system as the distance between systematic, as in the case in which Martha
each pair of his concepts (e.g., the distance grew up on a farm and Calvin was a city kid.
between DOG and CAT in Calvin’s system
might be one unit, whereas the distance be-
tween DOG and DAISY might be six units). concept combination
Martha’s conceptual system was represented Let’s look at attempts to tackle head-on the
in the same way (i.e., by exactly the same problem of how word-level concepts com-
interconcept distances) except for random bine to produce the meanings of larger lin-
noise that Goldstone and Rogosky added to guistic units. There is relatively little re-
each distance to simulate the effect of dis- search in this tradition on entire sentences
parate beliefs. A constraint-satisfaction algo- (see Conrad & Rips, 1 986; Rips, Smith, &
rithm then applied to Calvin’s and Martha’s Shoben, 1 978), but there has been a fairly
systems that attempted to recover the origi- steady research stream devoted to noun
nal correspondence between the concepts – phrases, including adjective-noun (“edible
to map Calvin’s DOG to Martha’s DOG, flowers”), noun-noun (“food flowers”), and
Calvin’s DAISY to Martha’s DAISY, and so noun-relative clause combinations (“flowers
on. The results of the stimulations show that that are foods”). We’ll call the noun or ad-
with 1 5 concepts in each system (the max- jective parts of these phrases components and
imum number considered and the case in distinguish the main or head noun (“flowers”
which the model performed best) and with in each of our examples) from the adjective
no noise added to Martha’s system, the algo- or noun modifier (“edible” or “food”). The
rithm was always able to find the correct cor- aim of the research in question is to describe
respondence. When the simulation added to how people understand these phrases and, in
each dimension of the interconcept distance particular, how the typicality of an instance
in Martha a small random increment (drawn in these combinations depends on the typ-
from a normal distribution with mean 0 and icality of the same instance in the compo-
standard deviation equal to .004 times the nents. How does the typicality of a marigold
maximum distance), the algorithm recov- in the category of edible flowers depend on
ered the correspondence about 63 % of the the typicality of marigolds in the categories
time. When the standard deviation increased of edible things and flowers? As we already
to .006 times the maximum distance, the al- noticed, this relationship is far from straight-
gorithm succeeded about 1 5 % of the time forward (parakeets are superbly typical as
(Goldstone & Rogosky, 2002, Figure 2). pet birds but less typical pets and even less
What should one make of the Goldstone typical birds).
and Rogosky results? Correspondences may There is an optimistic way of looking at
be recovered for small amounts of noise, the results of this research program and a
pessimistic way as well (for more recent, tors have studied are familiar or, at least,
mostly optimistic, reviews of this work, see have familiar referents. Some people have
Hampton, 1 997; Murphy, 2002; Rips, 1 995 ; experience with edible flowers, for example,
and Wisniewski, 1 997). The optimistic angle and know that they include nasturtiums, are
is that interesting phenomena have turned sometimes used in salads, are often brightly
up in investigating the typicality structure of colored, are peppery tasting, and so on. We
combinations. The pessimistic angle, which learn many of these properties by direct
is a direct result of the same phenomena, is or indirect observation (by what Hampton,
that little progress has been made in figuring 1 987, called “extensional feedback”), and
out a way to predict the typicality of a com- they are sometimes impossible to learn sim-
bination from the typicality of its compo- ply by knowing the meaning of “edible” and
nents. This difficulty is instructive – in part “flower.” Because these properties can affect
because all psychological theories of concept the typicality of potential instances, the typi-
combination posit complex, structured rep- cality of these familiar combinations will not
resentations, and they depict concept combi- be a function of the typicality of their com-
nation either as rearranging (or augmenting) ponents. This means that if we are going to
the structure of the head noun by means of be able to predict typicality in a composi-
the modifier (Franks, 1 995 ; Murphy, 1 988; tional way, we will have to factor out the
Smith, Osherson, Rips, & Keane, 1 988) or as contribution of these directly acquired prop-
fitting both head and modifier into a larger erties. Rips (1 995 ) refered to this filtering as
relational complex (Gagné & Shoben, 1 997). the “no peeking principle” – no peeking at
Table 3 .1 summarizes what is on offer from the referents of the combination. Of course,
these theories. Earlier models (at the top of you might be able to predict typicality if
the table) differ from later ones mainly in you already know the relevant real-world
terms of the complexity of the combination facts in addition to knowing the meaning
process. Smith et al. (1 988), for example, of the component concepts. The issue about
aimed at explaining simple adjective-noun understanding phrases, however, is how we
combinations (e.g., “white vegetable”) that, are able to interpret an unlimited number
roughly speaking, refer to the intersection of new ones. For this purpose, people need
of the sets denoted by modifier and head some procedure for computing new mean-
(white vegetables are approximately the in- ings from old ones that is not restricted by
tersection of white things and vegetables). the limited set of facts they happened to
In this theory, combination occurs when the have learned (e.g., through idiosyncratic en-
modifier changes the value of an attribute counters with edible flowers).
in the head noun (changing the value of the Another reason for lack of progress is
color attribute in VEGETABLE to WHITE) that some of the combinations used in
and boosts the importance of this attribute in this research may be compounds or lexi-
the overall representation. Later theories at- calized phrases [e.g., “White House” (ac-
tempted to account for nonintersective com- cent on “White”) = the residence of the
binations (e.g., “criminal lawyers,” who are President] rather than modifier-head con-
often not both criminals and lawyers). These structions [e.g., “white house” (accent on
combinations call for more complicated ad- “house”) = a house whose color is white].
justments – for example, determining a rela- Compounds are often idiomatic; their mean-
tion that links the modifier and head (a crim- ing is not an obvious function of their
inal lawyer is a lawyer whose clients are in parts (see Gleitman & Gleitman’s, 1 970,
for criminal charges) or extracting a value distinction between phrasal and compound
from the modifier that can then be assigned constructions; and Partee, 1 995 ).
to the head (e.g., a panther lawyer might be There is a deeper reason, however, for
one who is especially vicious or tenacious). the difficulty in predicting compound typ-
So why no progress? One reason is that icality from component typicality. Even if
many of the combinations that investiga- we adhere to the no peeking principle and
Table 3.1 . Some Theories of Concept Combination

Representation
Model Domain of Head Noun Modification Process
Hampton (1 987) Noun-Noun and Schemas Modifier and head
Noun-Relative- (attribute-value lists contribute values to
Clause NPs with attributes combination on the
(conjunctive NPs, varying in basis of importance and
e.g., sports that are importance) centrality
also games)
Smith, Osherson, Rips, Simple Schemas Adjective shifts value on
& Keane (1 988) Adjective-Noun NPs (attribute-value lists relevant attribute in
(e.g., red apple) with distributions of head and increases
values and weighted weight on relevant
attributes) dimension
Murphy (1 988) Adj-Noun and Schemas (lists of slots Modifier fills relevant slot;
Noun-Noun NPs and fillers) then representation is
(esp. non- “cleaned up” on the
predicating NPs, e.g., basis of world
corporate knowledge
lawyer)
Franks (1 995 ) Adj-Noun and Schemas Attribute-values of
Noun-Noun NPs (attribute-value modifier and head are
(esp. privatives, e.g., structures with summed with modifier
fake gun) default values for potentially overriding
some attributes) or negating head values
Gagné & Shoben Noun-Noun NPs Lexical representations Nouns are bound as
(1 997) containing arguments to relations
distributions of (e.g., flu virus = virus
relations in which causing flu)
nouns figure
Wisniewski (1 997) Noun-Noun NPs Schemas (lists of slots 1 . Modifier noun is bound
and fillers, including to role in head noun
roles in relevant (e.g., truck soap =
events) soap for cleaning
trucks)
2. Modifier value is
reconstructed in head
noun (e.g., zebra
clam = clam with
stripes)
3 . Hybridization (e.g.,
robin canary = cross
between robin and
canary)
stick to clear modifier-head constructions, about, a smoky apple (so extensional feed-
the typicality of a combination can depend back does not inform your conception of the
on “emergent” properties that are not part noun phrase), but nevertheless it is plausi-
of the representation of either component ble to suppose that smoky apples are not
(Hastie, Schroeder, & Weber, 1 990; Kunda, good tasting. Having a bad taste, however,
Miller, & Claire, 1 990; Medin & Shoben, is not a usual property of (and is not likely
1 988; Murphy, 1 988). For example, you may to be stored as part of a concept for) ei-
never have encountered, or even thought ther apples or smoky things; on the contrary,
many apples and smoky things (e.g., smoked Of course, we have only considered the
meats, cheese, fish) are often quite good tast- role of schemas or prototypes in concept
ing. If you agree with our assessment that combination, but it is worth noting that
smoky apples are likely to be bad tasting, many of the same problems with semantic
that is probably because you imagine a way composition affect other contemporary the-
in which apples could become smoky (being ories, such as latent semantic analysis (Lan-
caught in a kitchen fire, perhaps) and you in- dauer & Dumais, 1 997), which take a global
fer that under these circumstances the apple approach to meaning. Latent semantic
would not be good to eat. The upshot is that analysis takes as input a table of the fre-
the properties of a combination can depend quencies with which words appear in spe-
on complex inductive or explanatory infer- cific contexts. In one application, for exam-
ences (Johnson & Keil, 2000; Kunda et al., ple, the items comprise about 60,000 word
1 990). If these properties affect the typical- types taken from 3 0,000 encyclopedia en-
ity of an instance with respect to the com- tries, and the table indicates the frequency
bination, then there is little hope of a sim- with which each word appears in each entry.
ple model of this phenomenon. No current The analysis then applies a technique similar
theory comes close to providing an adequate to factor analysis to derive an approximately
and general account of these processes. 3 00-dimensional space in which each word
appears as a point and in which words that
inferential versus atomistic concepts tend to co-occur in context occupy neigh-
Research on the typicality structure of noun boring regions in the space. Because this
phrases is of interest for what it can tell technique finds a best fit to a large corpus
us about people’s inference and problem- of data, it is sensitive to indirect connections
solving skills. However, because these pro- between words that inform their meaning.
cesses are quite complex – drawing on gen- However, the theory has no clear way to
eral knowledge and inductive reasoning to derive the meaning of novel sentences. Al-
produce emergent information – we can not though latent semantic analysis could rep-
predict noun phrase typicality in other than resent a sentence as the average position of
a limited range of cases. For much the same its component words, this would not allow
reason, typicality structure does not appear it to capture the difference between, say,
very helpful in understanding how people The financier dazzled the movie star versus
construct the meaning of a noun phrase The movie star dazzled the financier, which
while reading or listening. By themselves, depend on sentence structure. In addition,
emergent properties do not rule out the pos- the theory uses the distance between two
sibility of a model that explains how people words in semantic space to represent the re-
derive the meaning of a noun phrase from lation between them, and so the theory has
the meaning of its components. Composi- trouble with semantic relations that, unlike
tionality does not require that all aspects distances, are asymmetric. It is unclear, for
of the noun phrase’s meaning are parts of example, how it could cope with the fact
the components’ meanings. It is sufficient that father implies parent but parent does not
to find some computable function from the imply father.
components to the composite that is simple On the one hand, online sentence un-
enough to account for people’s understand- derstanding is a rapid, reliable process. On
ing (see Partee, 1 995 , for a discussion of types the other hand, the meaning of even sim-
of composition). The trouble is that if noun ple adjective-noun phrases seems to re-
phrases’ meanings require theory construc- quire heady inductive inferences. Perhaps
tion and problem solving, such a process is we should distinguish, then, between the
unlikely to explain the ease and speed with interpretation of a phrase or sentence and
which we usually understand them in ongo- its comprehension (Burge, 1 999). On this
ing speech. view, comprehension gives us a more or less
immediate understanding of novel phrases generic sentences seem to hold despite the
based primarily on the word meaning of the existence of numerous exceptions; “Lions
components and syntactic/semantic struc- have manes” seems to be true even though
ture. Interpretation, by contrast, is a po- most lions (e.g., female and immature lions)
tentially unlimited process relying on the do not have manes (see Krifka et al., 1 995 ,
result of comprehension plus inference for an introduction to generic sentences).
and general knowledge. The comprehen- There is an obvious relation between the
sion/interpretation distinction may be more truth or acceptability of generic sentences
of a continuum than a dichotomy, but the and the typicality structure of categories be-
focus on the interpretation end of the con- cause the typical properties of a category
tinuum means that research on concepts is are those that appear in true generic sen-
difficult to apply to comprehension. As we tences. Of course, as Krifka et al. noted, this
have just noticed, it is hard, if not impossi- may simply be substituting one puzzle (the
ble, to compute the typicality structure of truth conditions of generic sentences) for an-
composites. So if we want something read- other (the nature of typical properties), but
ily computable in order to account for com- this may be one place where linguistic and
prehension, we have to look to something cognitive theories might provide mutual in-
simpler than typicality structures (and the sight. Research by Susan Gelman and her
networks, prototypes, schemas, or theories colleagues (see Gelman, 2003 , for a thor-
that underlie them). One possibility (Fodor, ough review) suggests that generic sentences
1 994, 1 998) is to consider a representation are a frequent way for caregivers to convey
in which word meanings are mental units category information to children. Four-year-
not much different from the words them- olds differentiate sentences with bare plurals
selves, and whose semantic values derive (“Lions have manes”) from those explicitly
from (unrepresented) causal connections to quantified by “all” or “some” in comprehen-
their referents. sion, production, and inference tasks (Gel-
man, Star, & Flukes, 2002; Hollander, Gel-
generic noun phrases man, & Star, 2002). It would be of interest
Even if we abandon typicality structures as to know, however, at what age, and in what
accounts of comprehension, however, it does way, children discriminate generics from ac-
not follow that these structures are use- cidental generalizations – for example, when
less in explaining all linguistic phenomena. they first notice the difference between “Li-
More recent research on two fronts seems ons have manes” and “Lions frequently have
to us to hold promise for interactions be- manes” or “Most lions have manes.”
tween psychological and linguistic theories.
First, there are special constructions in En- polysemy
glish that, roughly speaking, describe default A second place to look for linguistic-cogni-
characteristics of members of a category. For tive synergy is in an account of the mean-
example, “Lions have manes” means (ap- ings of polysemous words. Linguists (e.g.,
proximately) that having a mane is a char- Lyons, 1 977, Chap. 1 3 ) traditionally distin-
acteristic property of lions. Bare plural noun guish homonyms such as “mold,” which have
phrases (i.e., plurals with no preceding de- multiple unrelated meanings (e.g., a form
terminers) are one way to convey such a into which liquids are poured vs. a fungus),
meaning as we have just noticed, but indefi- from polysemous terms such as “line,” which
nite singular sentences (“A lion has a mane”) have multiple related meanings (e.g., a geo-
and definite singular sentences (“The lion – metric line vs. a fishing line vs. a line of peo-
Panthera leo – has a mane”) can also convey ple, etc.). What makes polysemous terms in-
the same idea in some of their senses. These teresting to psychologists in this area is that
generic sentences seem to have normative the relations among their meanings often
content. Unlike “Most lions have manes,” possess a kind of typicality structure of their
own. This is the typicality of the senses of the teeming with people in Figure 3 .2). Multiple
expression rather than the typicality of the mechanisms are likely to be at work here.
referents of the expression and is thus a type
of higher-level typicality phenomenon. Fig- summary
ure 3 .2 illustrates such a structure for the We do not mean to suggest that the only lin-
polysemous verb “crawl,” as analyzed by guistic applications of psychologists’ “con-
Fillmore and Atkins (2000). A rectangle cepts” are in dealing with interpretation,
in the figure represents each sense or use generic phrases, and polysemy – far from
and includes both a brief label indicat- it. There are many areas, especially in de-
ing its distinctive property and an exam- velopmental psycholinguistics, that hold the
ple from British corpuses. According to promise of fruitful interactions but that we
Fillmore and Atkins, the central meanings cannot review here. Nor are we suggesting
for crawl have to do with people or crea- that investigators in this area give up the at-
tures moving close to the ground (these tempt to study the use of concepts in im-
uses appear in rectangles with darker out- mediate comprehension. However, concepts
lines in the figure). But there are many for comprehension seem to have different
peripheral uses – for example, time mov- properties from the concepts that figure in
ing slowly (“The hours seemed to crawl the other functions we have discussed, and
by”) and creatures teeming about (“The pic- researchers need to direct more attention to
nic supplies crawled with ants”). The cen- the interface between them.
tral meanings are presumably the original
ones with the peripheral meanings derived
from these by a chaining process. Malt, Theories, Modules, and
Sloman, Gennari, Shi, and Wang (1 999) Psychometaphysics
observed similar instances of chaining
in people’s naming of artifacts, such as bot- We have seen, so far, some downward pres-
tles and bowls, and it is possible that the sure on cognitive theories to portray human
gerrymandered naming patterns reflect the concepts as mental entities that are as simple
polysemy of the terms (e.g., “bottle”) rather and streamlined as possible. This pressure
than different uses of the same meaning. As comes not only from the usual goal of par-
Figure 3 .2 shows, it is not easy to distinguish simony but also from the role that concepts
different related meanings (polysemy) from play in immediate language comprehension.
different uses of the same meaning (contex- However, there is also a great deal of upward
tual variation) and from different unrelated pressure – pressure to include general knowl-
meanings (homonymy). edge about a category as part of its represen-
Some research has attacked the issue of tation. For example, the presence of emer-
whether people store each of the separate gent properties in concept combinations
senses of a polysemous term (Klein & Mur- suggests that people use background knowl-
phy, 2002) or store only the core mean- edge in interpreting these phrases. Similarly,
ing, deriving the remaining senses as needed people may bring background knowledge
for comprehension (Caramazza & Grober, and theories to bear in classifying things even
1 976; Franks, 1 995 ). Conflicting evidence when they know a decision rule for the cat-
in this respect may be due to the fact that egory. Consider psychodiagnostic classifica-
some relations between senses seem relation. Although DSM-IV (the official diag-
tively productive and derivable (regular pol- nostic manual of the American Psycholog-
ysemy, such as the relationship between ical Association) is atheoretical and orga-
terms for animals and their food products, nized in terms of rules, there is clear evidence
e.g., the animal meaning of “lamb” and its that clinicians develop theories of disorders
menu meaning), whereas other senses seem and, contra DSM-IV, weight causally cen-
ad hoc (e.g., the relation between “crawl” = tral symptoms more than causally periph-
moving close to the ground and “crawl” = eral symptoms (e.g., Kim & Ahn, 2002a).
Figure 3.2 . The meanings of crawl: Why it is difficult to distinguish different related meanings
(polysemy) from different uses of the same meaning (contextual variation) and from different
unrelated meanings (homonymy). Adapted from Fillmore & Alking (2000) by permission of Oxford
University Press.
The same holds for laypersons (e.g., Furn- category instance, that controls the nature
ham, 1 995 ; Kim & Ahn, 2002b). of that instance. The source could deter-
In this section, we examine the con- mine, among other things, the instance’s typ-
sequences of expanding the notion of a ical properties, its category membership, and
concept to include theoretical information perhaps even the conditions under which
about a category. In the case of the natu- it comes into and goes out of existence.
ral categories, this information is likely to be Alternatively, people may believe that the
causal because people probably view physi- relevant causal forces are more like a swarm –
cal causes as shaping and maintaining these not necessarily internal to an instance, nor
categories. For artifacts, the relevant infor- necessarily emanating from a unitary spot –
mation may be the intentions of the person but shaping the category in aggregate
creating the object (e.g., Bloom, 1 996). The fashion.
issues we raise here concern the content and The second issue has to do with the cogni-
packaging of these causal beliefs. tive divisions that separate beliefs about dif-
The first of these issues focuses on ferent sorts of categories. People surely be-
people’s beliefs about the locus of these lieve that the causes that help shape daisies
causal forces – what we called “psychometa- differ in type from those that shape teapots.
physics.” At one extreme, people may be- Lay theories about flowers and other liv-
lieve that each natural category is associated ing things include at least crude informa-
with a single source, concentrated within a tion about specifically biological properties,
whereas lay theories of teapots and other ar- IOU for a theory: a belief that there must be
tifacts touch instead on intended and actual something that plays the role of essence even
functions. However, how deep do these di- though they can not supply a description of
visions go? On the one hand, beliefs about it (Medin & Ortony, 1 989).
these domains could be modular (relatively Belief in a hypothetical, minimally de-
clustered, relatively isolated), innate, uni- scribed essence may not seem like the sort
versal, and local to specific brain regions. of thing that could do important cognitive
On the other hand, they may be free float- work, but psychological essentialists have
ing, learned, culturally specific, and dis- pointed out a number of advantages that
tributed across cortical space. This issue is essences might afford, especially to chil-
important to us because it ultimately af- dren. The principal advantage may be in-
fects whether we can patch up the “semantic duction potential. Medin (1 989) suggested
memory” marriage. that essentialism is poor metaphysics but
good epistemology in that it may lead peo-
Essentialism and Sortalism ple to expect that members of a kind will
share numerous, unknown properties – an
psychological essentialism assumption that is sometimes correct. In
What’s the nature of people’s beliefs about short, essences have a motivational role to
the causes of natural kinds? One hypothesis play in getting people to investigate kinds’
is that people think there is something inter- deeper characteristics. Essences also explain
nal to each member of the kind – an essence – why category instances seem to run true to
that is responsible for its existence, cate- type – for example, why the offspring of pigs
gory membership, typical properties, and grow up to be pigs rather than cows. They
other important characteristics (e.g., Atran, also explain the normative character of kinds
1 998; Gelman & Hirschfeld, 1 999; Medin & (e.g., their ability to support inductive ar-
Ortony, 1 989). Of course, it is unlikely that guments and their ability to withstand ex-
people think that all categories of natural ceptions and superficial changes) as well as
objects have a corresponding essence. There people’s tendency to view terms for kinds as
is probably no essence of pets, for example, well defined.
that determines an animal’s pet status. How- Evidence for essentialism tends to be in-
ever, for basic-level categories, such as dogs direct. There are results that show that chil-
or gold or daisies, it is tempting to think that dren and adults do in fact hold the sorts
something in the instance determines cru- of beliefs that essences can explain. By the
cial aspects of its identity. Investigators who time they reach first or second grade, chil-
have accepted this hypothesis are quick to dren know that animals whose insides have
point out that the theory applies to people’s been removed are no longer animals, that
beliefs and not to the natural kinds them- baby pigs raised by cows grow up to be
selves. Biologists and philosophers of science pigs rather than cows (Gelman & Well-
agree that essentialism will not account for man, 1 991 ), and that cosmetic surgery does
the properties and variations that real species not alter basic-level category membership
display, in part because the very notion of (Keil, 1 989). Research on adults also shows
species is not coherent (e.g., Ghiselin, 1 981 ; that “deeper” causes – those that themselves
Hull, 1 999). Chemical kinds, for example, have few causes but many effects – tend to
gold, may conform much more closely to be more important in classifying than shal-
essentialist doctrine (see Sober, 1 980). Nev- lower causes (Ahn, 1 998; Sloman, Love, &
ertheless, expert opinion is no bar to layper- Ahn, 1 998).
sons’ essentialist views on this topic. In addi- However, results like these are evidence
tion, psychological essentialists have argued for essence only if there are no better ex-
that people probably do not have a fully planations for the same results, and it seems
fleshed out explanation of what the essence at least conceivable that children and adults
is. What they have, on this hypothesis, is an make room for multiple types and sources
of causes that are not yoked to an essence. sentialism. Ultimately, the issue boils down
According to Strevens (2000), for example, to determining to what extent causal under-
although people’s reasoning and classifying standings are biased toward the assumption
suggest that causal laws govern natural kinds, of a unique, central cause for a category’s
it may be these laws alone, rather than a uni- usual properties.
fying essence, that are responsible for the
findings. According to essentialists, people sortalism
think there is something (an essence) that is According to some versions of essential-
directly or indirectly responsible for the typ- ism, an object’s essence determines not only
ical properties of a natural kind. According which category it belongs to but also the ob-
to Strevens’ minimalist alternative, people ject’s very identity. According to this view,
think that for each typical property there is it is by virtue of knowing that Fido is a dog
something that causes it and that something that you know (in principle) how to identify
may vary for different properties. It is im- Fido over time, how to distinguish Fido from
portant to settle this difference – the pres- other surrounding objects, and how to de-
ence or absence of a unique central cause – termine when Fido came into existence and
if only because the essentialist claim is the when he will go out of it. In particular, if Fido
stronger one. happens to lose his dog essence, then Fido
Essentialists counter that both chil- not only ceases to be a dog, but he also ceases
dren and adults assume a causal struc- to exist entirely. As we noted in discussing
ture consistent with essence (see Braisby, essentialism, not all categories provide these
Franks, & Hampton, 1 996; Diesendruck & identity conditions. Being a pet, for example,
Gelman, 1 999; and Kalish, 1 995 , 2002, for doesn’t lend identity to Fido because he may
debate on this issue). One strong piece of continue to survive in the wild as a nonpet.
evidence for essentialism is that participants According to one influential view (Wiggins,
who have successfully learned artificial, fam- 1 980), the critical identity-lending category
ily resemblance categories (i.e., those in is the one that answers the question What
which category members have no single fea- is it? for an object, and because basic-level
ture in common) nevertheless believe that categories are sometimes defined in just this
each category contained a common, defining way, basic-level categories are the presumed
property (Brooks & Wood, as cited by Ahn source of the principles of identity. (Theo-
et al., 2001 ). Other studies with artificial ries of this type usually assume that identity
“natural” kinds have directly compared es- conditions are associated with just one cate-
sentialist and nonessentialist structures but gory for each object because multiple iden-
have turned in mixed results (e.g., Rehder tity conditions lead to contradictions; see
& Hastie, 2001 ). It is possible that explicit Wiggins, 1 980). Contemporary British phi-
training overrides people’s natural tendency losophy tends to refer to such categories as
to think in terms of a common cause. sortals, however, and we adopt this termi-
In the absence of more direct evidence nology here.
for essence, the essentialist-minimalist de- Sortalism plays an important role in cur-
bate is likely to continue (see Ahn et al., rent developmental psychology because de-
2001 ; Sloman & Malt, 2003 ; and Strevens, velopmentalists have used children’s mas-
2001 , for the latest salvos in this dispute). tery of principles of identity to decide
Indeed, the authors of this chapter are not whether these children possess the associ-
in full agreement. Medin finds minimalism ated concept. In some well-known studies,
too unconstrained, whereas Rips opines that Xu and Carey (1 996) staged for infants a
essentialism suffers from the opposite prob- scene in which a toy duck appears from one
lem. Adding a predisposition toward parsi- side of an opaque screen and then returns be-
mony to the minimalist view seems like a hind it. A toy truck next emerges from the
constructive move, but such a move would other side of the screen and then returns to
shift minimalism considerably closer to es- its hidden position. Infants habituate after
a number of encores of this performance, These consequences of sortalism may be

at which time the screen is removed to re- reasonable ones, but it is worth considering
veal both the duck and truck (the scene the possibility that sortalism – however well
that adults expect) or just one of the ob- it fares as a metaphysical outlook – incor-
jects (duck or truck). Xu and Carey reported rectly describes people’s views about object
that younger infants (e.g., 1 0-month-olds) identity. Although objects typically do not
exhibit no more surprise at seeing one ob- survive a leap from one basic-level category
ject than at seeing two, whereas older infants to another, it may not be impossible for them
(and adults) show more surprise at the one- to do so. Blok, Newman, and Rips (in press)
object tableau. Xu and Carey also showed in and Liittschwager (1 995 ) gave participants
control experiments that younger and older scenarios that described novel transforma-
infants perform identically if they see a pre- tions that sometimes altered the basic-level
view of the two starring objects together be- category. In both studies, participants were
fore the start of the performance. The inves- more likely to agree that the transformed
tigators infer that the younger infants lack object was identical to the original if the
the concepts DUCK and TRUCK because transformational distance was small. How-
they are unable to use a principle of identity ever, these judgments could not always be
for these concepts to discern that a duck predicted by basic-level membership.
cannot turn into a truck while behind the Results from these sci-fi scenarios should
screen. Xu and Carey’s experiments have be treated cautiously, but they suggest that
sparked a controversy about whether the people think individual objects have an in-
experimental conditions are simple enough tegrity that does not necessarily line up with
to allow babies to demonstrate their grip their basic-level category. Although this idea
on object identity (see Wilcox & Bail- may be flawed metaphysics, it is not unrea-
largeon, 1 998; Xu, 2003 ), but for present sonable as psychometaphysics. People may
purposes what is important is the assump- think that individuals exist as the result of lo-
tion that infants’ inability to reidentify ob- cal causal forces – forces that are only loosely
jects over temporal gaps implies lack of the tethered to basic-level kinds. As long as these
relevant concepts. forces continue to support the individual’s
Sortal theories impose strong constraints coherence, it can exist even if it finds itself in
on some versions of essentialism. We noted a new basic-level category. Of course, not all
that one of essentialism’s strong points is essentialists buy into this link between sor-
its ability to explain some of the norma- talism and essentialism. For example, people
tive properties of concepts – for example, might believe that an individual has both
the role concepts play in inductive infer- a category essence and a history and other
ences. However, sortalism places some re- characteristics that make it unique. Gutheil
strictions on this ability. Members of sortal and Rosengren (1 996) hypothesized that ob-
categories can not lose their essence without jects have two difference essences, one for
losing their existence, even in counterfac- membership and another for identity. Just
tual circumstances. This means that if we are how individual identity and kind identity
faced with a premise such as Suppose dogs can play out under these scenarios could then
bite through wire . . . , we cannot reason about be highly variable.
this supposition by assuming the essence of
dogs has changed in such a way as to make
Domain Specificity
dogs stronger. A dog with changed essence
is not a superdog, according to sortalism, The notion of domain specificity has served
but rather has ceased to exist (see Rips, to organize a great deal of research on con-
2001 ). For the same reason, it is impossible ceptual development. For example, much of
to believe without contradiction both that the work on essentialism has been conducted
basic-level categories are sortals and that ob- in the context of exploring children’s naı̈ve
jects can shift from one basic-level category biology (see also Au, 1 994; Carey, 1 995 ;
to another. Gopnik & Wellman, 1 994; Spelke, Phillips,
& Woodward, 1 995 ). Learning in a given and about 0.5 with higher levels included
domain may be guided by certain skeletal (Atran, 1 999; Bailenson et al., 2002; Medin
principles, constraints, and (possibly innate) et al., 2002). Much of the remaining vari-
assumptions about the world (see Gelman, ance owes to obvious perceptual biases
2003 ; Gelman & Coley, 1 990; Keil, 1 981 ; (Itza’ Maya group bats with birds in the
Kellman & Spelke, 1 983 ; Markman, 1 990; same life form) and local ecological con-
Spelke, 1 990). Carey’s (1 985 ) influential cerns. Contrary to received notions about
book presented a view of knowledge acquisi- the history and cross-cultural basis for folk
tion as built on framework theories that en- biological classification, utility does not ap-
tail ontological commitments in the service pear to drive folk taxonomies (cf. Berlin
of a causal understanding of real-world phe- et al., 1 973 ).
nomena. Two domains can be distinguished These folk taxonomies also appear to
from one another if they represent ontologi- guide and constrain reasoning. For exam-
cally distinct entities and sets of phenomena ple, Coley, Medin, and Atran (1 997) found
and are embedded within different causal that both Itza’ Maya and U.S. undergradu-
explanatory frameworks. These ontological ates privilege the generic species level in in-
commitments serve to organize knowledge ductive reasoning. That is, an inference from
into domains such as naive physics (or me- swamp white oak to all white oaks is little if
chanics), naive psychology, or naive biology any stronger than an inference from swamp
(e.g., see Au, 1 994; Carey, 1 995 ; Gelman white oak to all oaks. Above the level of
& Koenig, 2001 ; Gopnik & Wellman, 1 994; oak, however, inductive confidence takes a
Hatano & Inagaki, 1 994; Keil, 1 994; Spelke sharp drop. In other words, people in both
et al., 1 995 ; Wellman & Gelman, 1 992). In cultures treat the generic level (e.g., oak) as
the following, we focus on one candidate do- maximizing induction potential. The results
main, naı̈ve biology. for undergraduates are surprising because
the original Rosch et al. (1 976) basic-level
studies had suggested that a more abstract
folk biology and universals level (e.g., TREE) acted as basic for under-
There is fairly strong evidence that all cul- graduates and should have been privileged
tures partition local biodiversity into tax- in induction. That is, there is a discrep-
onomies whose basic level is that of the ancy between results with undergraduates
“generic species” (Atran, 1 990; Berlin et al., on basicness in naming, perceptual classifi-
1 973 ). Generic species often correspond to cation, and feature listing, on the one hand,
scientific species (e.g., elm, wolf, robin); and inductive inference, on the other hand.
however, for the large majority of percep- Coley et al. (1 997) suggested that the rea-
tually salient organisms (see Hunn, 1 999), soning task relies on expectations associated
such as vertebrates and flowering plants, a with labeling rather than knowledge and that
scientific genus frequently has only one lo- undergraduates may know very little about
cally occurring species (e.g., bear, oak). In biological kinds (see also Wolff, Medin, &
addition to the spontaneous division of local Pankratz, 1 999). Medin and Atran (in press)
flora and fauna into generic species, cultures cautioned against generalizing results on bi-
seem to structure biological kinds into hi- ological thought from undergraduates be-
erarchically organized groups, such as white cause most have relatively little first-hand
oak/oak/tree. Folk biological ranks vary lit- experience with nature.
tle across cultures as a function of theo-
ries or belief systems (see Malt, 1 994, for a
review). For example, in studies with Na- interdomain differences
tive American and various U.S. and Low- One of the most contested domain distinc-
land Maya groups, correlations between folk tions, and one that has generated much
taxonomies and classical evolutionary tax- research, is that between psychology and bi-
onomies of the local fauna and flora av- ology (e.g., Au & Romo, 1 996, 1 999; Carey,
erage r = .75 at the generic species level 1 991 ; Coley, 1 995 ; Gelman, 2003 ; Hatano
& Inagaki, 1 996, 2001 ; Inagaki, 1 997; In- specific deficits in which patients may lose
agaki & Hatano, 1 993 , 1 996; Johnson & their ability to recognize and name category
Carey, 1 998; Keil, 1 995 ; Keil, Levin, Rich- members in a particular domain of concepts.
man, G. Gutheil, 1 999; Rosengren et al., For example, Nelson (1 946) reported a pa-
1 991 ; Springer & Keil, 1 989, 1 991 ). Carey tient who was unable to recognize a tele-
(1 985 ) argued that children initially under- phone, a hat, or a car but could identify
stand biological concepts such as ANIMAL people and other living things (the opposite
in terms of folk psychology, treating ani- pattern is also observed and is more com-
mals as similar to people in having beliefs mon). These deficits are consistent with the
and desires. Others (e.g., Keil, 1 989) argued idea that anatomically and functionally dis-
that young children do have biologically spe- tinct systems represent living versus non-
cific theories, albeit more impoverished than living things (Sartori & Job, 1 988). An al-
those of adults. For example, Springer and ternative claim (e.g., Warrington & Shallice,
Keil (1 989) showed that preschoolers think 1 984) is that these patterns of deficits are due
biological properties are more likely to be to the fact that different kinds of informa-
passed from parent to child than are so- tion aid in categorizing different kinds of ob-
cial or psychological properties. They ar- jects. For example, perceptual information
gued that this implies that the children have may be relatively more important for recog-
a biology-like inheritance theory. The evi- nizing living kinds and functional informa-
dence concerning this issue is complex. On tion more important for recognizing artifacts
the one hand, Solomon, Johnson, Zaitchik, (see Devlin et al., 1 998; Farah & McClelland,
and Carey (1 996) claimed that preschoolers 1 991 , for computational implementations of
do not have a biological concept of inheri- these ideas). Although the weight of evi-
tance because they do not have an adult’s dence appears to favor the kinds of informa-
understanding of the biological causal mech- tion view (see Damasio et al., 1 996; Forde
anism involved. On the other hand, there & Humphreys, in press; Simmons & Barsa-
is growing cross-cultural evidence that 4- lou, 2003 ), the issue continues to be debated
to 5 -year-old children believe (like adults) (see Caramazza & Shelton, 1 998, for a strong
that the category membership of animals defense of the domain specificity view).
and plants follows that of their progeni-
tors regardless of the environment in which domains and memory
the progeny matures (e.g., progeny of cows The issue of domain specificity returns us
raised with pigs, acorns planted with apple to one of earlier themes: Does memory
seeds) (Atran et al., 2001 ; Gelman & Well- organization depend on the meaning? We
man, 1 991 ; Sousa et al., 2002). Furthermore, have seen that early research on semantic
it appears that Carey’s (1 985 ) results on psy- memory was problematic in this respect be-
chology versus biology may only hold for ur- cause many of the findings that investigators
ban children who have little intimate con- used to support meaning-based organiza-
tact with nature (Atran, et al., 2001 ; Ross tion had alternative explanations. General-
et al., 2003 ). Altogether, the evidence sug- purpose decision processes could produce
gests that 4- to 5 -year-old children do have a the same pattern of results even if the in-
distinct biology, although perhaps one with- formation they operated on was haphaz-
out a detailed model of causal mechanisms ardly organized. Of course, in those olden
(see Rozenbilt & Keil, 2002, for evidence days, semantic memory was supposed to
that adults also only have a superficial unbe a hierarchically organized network like
derstanding of mechanisms). that in Figure 3 .1 ; the network clustered
concepts through shared superordinates and
domains and brain regions properties but was otherwise undifferenti-
Are these hypothesized domains associated. Modularity and domain specificity of-
ated with dedicated brain structure? There fer a new take on semantic-based memory
is intriguing evidence concerning category- structure – a partition of memory space into
distinct theoretical domains. Can large-scale cause the modules will not organize them in
theories like these support memory organi- the same way), spelling is out (both concepts
zation in a more adequate fashion than ho- might be tied to the word “people” in an in-
mogeneous networks? ternal dictionary, but then fungi and metal
One difficulty in merging domain speci- forms are both tied to the word “mold”),
ficity with memory structure is that domain and interconnections are out (because they
theories do not taxonomize categories – they would defeat the idea that memory is or-
taxonomize assumptions. What differenti- ganized by domain). We can not treat the
ates domains is the set of assumptions or multiple PEOPLE concepts as independent
warrants they make available for thinking either because it is important to get back
and reasoning (see Toulmin, 1 95 8, for one and forth between them. For example, the
such theory), and this means that a par- rights and responsibilities information about
ticular category of objects usually falls in people in your law school module has to
more than one domain. To put it another get together with the goals and desires in-
way, domain-specific theories are “stances” formation about people in your psychology
(Dennett, 1 971 ) or “construals” (Keil, 1 995 ) module in case you have to decide, together
that overlap in their instances. Take the with your fellow jury members, whether the
case of people. The naive psychology do- killing was a hate crime or was committed
main treats people as having beliefs and goals with malice aforethought.
that lend themselves to predictions about It is reasonable to think that background
actions (e.g., Leslie, 1 987; Wellman, 1 990). theories provide premises or grounds for in-
The naive physics domain treats people as ferences about different topics, and it is also
having properties such as mass and velocity reasonable to think that these theories have
that warrant predictions about support and their “proprietary concepts.” However, if we
motion (e.g., Clement, 1 983 ; McCloskey, take domain-specific modules as the basis
1 983 ). The naive law school domain treats for memory structure – as a new semantic
people as having properties, such as social memory – we also have to worry about non-
rights and responsibilities, that lead to pre- proprietary concepts. We have argued that
dictions about obedience or deviance (e.g., there must be such concepts because we
Fiddick, Cosmides, & Tooby, 2000). The can reason about the same thing with dif-
naive biology domain (at least in the West- ferent theories. Multiple storage is a possi-
ern adult version) treats people as having bility if you are willing to forego memory
properties such as growth and self-animation economy and parsimony and if you can solve
that lead to expectations about behavior and the identifiability problem that we discussed
development. In short, each ordinary cate- in the previous paragraph. Otherwise, these
gory may belong to many domains. domain-independent concepts have to in-
If domains organize memory, then long- habit a memory space of their own, and
term memory will have to store a concept modules can not be the whole story.
in each of the domains to which it is re-
lated. Such an approach makes some of summary
the difficulties of the old semantic memory We seem to be arriving at a skeptical posi-
more perplexing. Recall the issue of identify- tion with respect to the question of whether
ing the same concept across individuals (see memory is semantically organized, but we
“Concepts as Positions in Memory Struc- need to be clear about what is and what is
tures”). Memory modules have the same not in doubt. What we doubt is that there
problem, but they add to it the dilemma of is compelling evidence that long-term mem-
identifying concepts within individuals. How ory is structured in a way that mirrors lexical
do you know that PEOPLE in your psychol- structure as in the original semantic mem-
ogy module is the same concept as PEOPLE ory models. We do not doubt that mem-
in your physics module and PEOPLE in your ory reflects meaningful relations among con-
law school module? Similarity is out (be- cepts, and it is extremely plausible that these
relations depend to some extent on word concepts. We do not recommend a solemn

meanings. For example, there may well be attitude toward our predictions. However,
a relation in memory that links the concept there are several trends that we have identi-
TRUCKER with the concept BEER, and the fied and, barring unforeseen circumstances
existence of this link is probably due in part (never a safe assumption), these trends
to the meaning of “trucker” and “beer.” What should continue. One property our nomina-
is not so clear is whether memory structure tions share is that they uniformly broaden
directly reflects the sort of relations that, in the scope of research on concepts. Here’s our
linguistic theory, organizes the meaning of shortlist.
words (where, e.g., “trucker” and “beer” are
probably not closely connected). We note, Sensitivity to Multiple Functions
too, that we have not touched (and we do
not take sides on) two related issues, which The prototypical categorization experiment
are themselves subjects of controversy. involves training undergraduates for about
One of these residual issues is whether an hour and then giving transfer tests to as-
there is a split in memory between (1 ) gen- sess what they have learned. This practice is
eral knowledge and (2) personally experi- becoming increasingly atypical, even among
enced information that is local to time and researchers studying artificially constructed
place. Semantic memory (Tulving, 1 972) or categories in the lab. More recently, re-
generic memory (Hintzman, 1 978) is some- searchers have studied functions other than
times used as a synonym for general knowl- categorization, as well as interactions across
edge in this sense, and it is possible that functions. (See also Solomon et al., 1 999.)
memory is partitioned along the lines of this
semantic/episodic difference, even though Broader Applications of Empirical
the semantic side is not organized by lexical Generalizations and Computational
content. The controversy in this case is how Models
such a dual organization can handle learning As a wider range of conceptual functions
of “semantic” information from “episodic” comes under scrutiny, new generalizations
encounters (see Tulving, 1 984, and his crit- emerge and computational models face new
ics in the same issue of Behavioral and Brain challenges (e.g., Yamauchi et al., 2002). Both
Sciences, for the ins and outs of this debate). developments set the stage for better bridg-
The second issue that we are shirking is ing to other contexts and applications. This is
whether distributed brands of connection- perhaps most evident in the area of cognitive
ist models can provide a basis for meaning- neuroscience, where computational models
based memory. One reason for shirking is have enriched studies of multiple categoriza-
that distributed organization means that tion and memory systems (and vice versa).
concepts such as DAISY and CUP are not Norman, Brooks, Coblenz, and Babcock
stored according to their lexical content. (1 992) provided a nice example of exten-
Instead, parts of the content of each con- sions from laboratory studies to medical di-
cept are smeared across memory in over- agnosis in the domain of dermatology.
lapping fashion. It is possible, however, that
at a subconcept level – at the level of fea-
Greater Interactions between Work on
tures or hidden units – memory has a se-
Concepts and Psycholinguistic Research
mantic dimension, and we must leave this
question open. We have pressed the point that research on
concepts has diverged from psycholinguis-
tics because two different concepts of con-
Conclusions and Future Directions cepts seem to be in play in these fields. How-
ever, it cannot be true that the concepts
Part of our charge was to make some pro- we use in online sentence understanding
jections about the future of research on are unrelated to the concepts we employ in
reasoning and categorizing. There is an op- is safe to predict even greater future interest
portunity for theorists and experimenters in these questions.
here to provide an account of the interface
between these functions. One possibility, for All of the Above in Combination
example, is to use sentence comprehension
techniques to track the way that the lexical Concepts and categories are shared by all the
content of a word in speech or text is trans- cognitive sciences, and so there is very little
formed in deeper processing (see Pinango, room for researchers to stake out a single
Zurif, & Jackendoff, 1 999, for one effort in paradigm or subtopic and work in blissful
this direction). Another type of effort at in- isolation. Although the idea of a seman-
tegration is Wolff and Song’s (2003 ) work tic memory uniting memory structure,
on causal verbs and people’s perception of lexical organization, and categorization
cause in which they contrast predictions de- may have been illusory, this does not
rived from cognitive linguistics with those mean that progress is possible by ig-
from cognitive psychology. noring the insights on concepts that
these perspectives (and others) pro-
vide. We may see further fragmentation
Greater Diversity of Participant
in the concepts of concepts, but it will still
Populations
be necessary to explore the relations among
Although research with U.S. undergradu- them. Our only firm prediction is that the
ates at major universities will probably never work we will find most exciting will be re-
go out of style (precedent and convenience search that draws on multiple points of view.
are two powerful staying forces), we expect
the recent increase to continue in the use
of other populations. Work by Nisbett and Acknowledgments
his associates (e.g., Nisbett & Norenzayan,
2002; Nisbett, Peng, Choi, & Norenzayan, Preparation of this chapter was supported
2001 ) has called into question the idea that by grants NSF SBR 9983 260 and NSF SES-
basic cognitive processes are universal, and 9907424. The authors also want to thank
categories and conceptual functions are ba- Serge Blok, Rob Goldstone, Keith Holyoak,
sic cognitive functions. In much of the work Ji Son, and Sandra Waxman for comments
by Atran, Medin, and their associates, un- on an earlier version of the chapter.
dergraduates are the “odd group out” in the
sense that their results deviate from those
of other groups. In addition, cross-linguistic References
studies are often an effective research tool
for addressing questions about the relation- Ahn, W-K. (1 998). Why are different features
ship between linguistic and conceptual de- central for natural kinds and artifacts?: The role
velopment (e.g., Waxman, 1 999). of causal status in determining feature central-
ity. Cognition, 69, 1 3 5 –1 78.
More Psychometaphysics Ahn, W-K., Kalish, C., Gelman, S. A., Medin,
D. L., Luhmann, C., Atran, S., Coley, J. D., &
An early critique of the theory theory is that Shafto, P. (2001 ). Why essences are essential
it suffered from vagueness and imprecision. in the psychology of concepts. Cognition, 82 ,
As we have seen in this review, however, this 5 9–69.
framework has led to more specific claims Anderson, J. R. (1 990). The adaptive character of
(e.g., Ahn’s causal status hypothesis) and the thought. Hillsdale, NJ: Erlbaum.
positions are clear enough to generate the- Anderson, J. R. (1 991 ). Is human cognition
oretical controversies (e.g., contrast Smith, adaptive? Behavioral and Brain Sciences, 1 4,
Jones, & Landau, 1 996 with Gelman, 2000, 471 –5 1 7.
and Booth & Waxman, 2002, in press, with Anderson, J. R., & Bower, G. H. (1 973 ). Human
Smith, Jones, Yoshida, & Colunga, 2003 ). It associative memory. Hillsdale, NJ: Erlbaum.
Anderson, J. R., & Fincham, J. M. (1 996). Catego- ture in folk biology. American Anthropologist,
rization and sensitivity to correlation. Journal 75 , 21 4–242.
of Experimental Psychology: Learning, Memory, Blok, S., Newman, G., & Rips, L. J. (in press).
and Cognition, 2 2 , 25 9–277. Individuals and their concepts. In W-K. Ahn,
Ashby, F. G., Alfonso-Reese, L. A., Turken, A. U., R. L. Goldstone, B. C. Love, A. B. Markman, &
& Waldron, E. M. (1 998). A neuropsycho- P. Wolff (Eds.), Categorization inside and outside
logical theory of multiple systems in cate- the lab. Washington, DC: American Psycholog-
gory learning. Psychological Review, 1 05 , 442– ical Association.
481 . Bloom, P. (1 996). Intention, history, and artifact
Ashby, F. G., & Maddox, W. T. (1 993 ). Rela- concepts. Cognition, 60, 1 –29.
tions between prototype, exemplar, and deci- Bourne, L. E., Jr. (1 970). Knowing and using con-
sion bound models of categorization. Journal of cepts. Psychological Review, 77, 5 46–5 5 6.
Mathematical Psychology, 3 7, 3 72–400. Braosby, N., Franks, B., & Hampton, J. (1 996). Es-
Atran, S. (1 985 ). The nature of folk-botanical life sentialism, word use, and concepts. Cognition,
forms. American Anthropologist, 87, 298–3 1 5 . 5 9, 247–274.
Atran, S. (1 990). Cognitive foundations of natural Brooks, L. R. (1 978). Nonanalytic concept forma-
history. Cambridge, UK: Cambridge University tion and memory for instances. In E. Rosch &
Press. B. B. Lloyd (Eds.), Cognition and categorization
Atran, S. (1 998). Folk biology and the anthropol- (pp. 1 69–21 1 ). New York: Wiley.
ogy of science: Cognitive universals and cul- Bruner, J. S., Goodnow, J. J., & Austin, G. A.
tural particulars. Behavioral and Brain Sciences, (1 95 6). A study of thinking. New York: Wiley.
2 1 , 5 47–609. Burge, T. (1 999). Comprehension and interpreta-
Atran, S. (1 999). Itzaj Maya folk-biological taxon- tion. In L. E. Hahn (Ed.), The philosophy of Don-
omy. In D. Medin & S. Atran (Eds.), Folk biology ald Davidson (pp. 229–25 0). Chicago: Open
(pp. 1 1 9–203 ). Cambridge, MA: MIT Press. Court.
Atran, S., Medin, D., Lynch, E., Vapnarsky, V., Busemeyer, J. R., Dewey, G. I., & Medin, D. L.
Ucan Ek’, E., & Sousa, P. (2001 ). Folkbiol- (1 984). Evaluation of exemplar-based general-
ogy doesn’t come from folkpsychology; Evi- ization and the abstraction of categorical in-
dence from Yukatek Maya in cross-cultural per- formation. Journal of Experimental Psychology:
spective. Journal of Cognition and Culture, 1 , Learning, Memory, and Cognition, 1 0, 63 8–
3 –42. 648.
Au, T. K. (1 994). Developing an intuitive under- Caramazza, A., & Grober, E. (1 976). Polysemy
standing of substance kinds. Cognitive Psychol- and the structure of the subjective lexicon.
ogy, 2 7, 71 –1 1 1 . In C. Raman (Ed.). Semantics: Theory and
Au, T. K., & Romo, L. F. (1 996). Building a coher- Application (pp. 1 81 –206). Washington, DC:
ent conception of HIV transmission. Psychology Georgetown University Press.
of Learning and Motivation, 3 5 , 1 93 –241 . Caramazza, A., & Shelton, J. R. (1 998). Domain
Au, T. K., & Romo, L. F. (1 999). Mechanical specific knowledge systems in the brain. Journal
causality in children’s “Folkbiology.” In D. L. of Cognitive Neuroscience, 1 0, 1 –3 4.
Medin & S. Atran (Eds.), Folkbiology (pp. 3 5 5 – Carey, S. (1 985 ). Conceptual change in childhood.
401 ). Cambridge, MA: MIT Press. Cambridge, MA: MIT Press.
Bailenson, J. N., Shum, M., Atran, S., Medin, Corey, S. (1 991 ). Knowledge acquisition. In
D. L., & Coley, J. D. (2002). A bird’s eye view: S. Carey & R. Gelman (Eds.), The epige-
Biological categorization and reasoning within nesis of mind (pp. 25 7–291 ). Hillsdale, NJ:
and across cultures. Cognition, 84, 1 –5 3 . Erlbaum.
Balota, D. A. (1 994). Visual word recognition: Carey, S. (1 995 ). On the origin of causal under-
A journey from features to meaning. In M. A. standing. In D. Sperber, D. Premack, & A. J.
Gernsbacher (Ed.), Handbook of psycholinguis- Premack (Eds.), Causal cognition: A multidisci-
tics (pp. 3 03 –3 5 8). San Diego: Academic Press. plinary debate (pp. 268–3 08). New York: Ox-
Barsalou, L. W. (1 983 ). Ad-hoc categories. Mem- ford University Press.
ory and Cognition, 1 1 , 21 1 –227. Chierchia, G., & McConnell-Ginet, S. (1 990).
Berlin, B., Breedlove, D., & Raven, P. (1 973 ). Gen- Meaning and grammar: An introduction to se-
eral principles of classification and nomencla- mantics. Cambridge, MA: MIT Press.
Clapper, J., & Bower, G. (2002). Adaptive cat- dence from patients with Huntington’s disease.
egorization in unsupervised learning. Journal Behavioral Neuroscience, 1 1 5 , 786–798.
of Experimental Psychology: Learning, Memory, Fodor, J. (1 994). Concepts: A potboiler. Cogni-
and Cognition, 2 8, 908–923 . tion, 5 0, 95 –1 1 3 .
Clement, J. (1 983 ). A conceptual model dis- Fodor, J. (1 998). Concepts: Where cognitive science
cussed by Galileo and used intuitively by went wrong. Oxford, UK: Oxford University
physics students. In D. Gentner & A. L. Stevens Press.
(Eds.), Mental models (pp. 3 25 –3 40). Hillsdale,
Franks, B. (1 995 ). Sense generation: A “quasi-
NJ: Erlbaum.
classical” approach to concepts and concept
Coley, J. D. (1 995 ). Emerging differentiation of combination. Cognitive Science, 1 9, 441 –5 05 .
folk biology and folk psychology. Child Devel-
Furnbam, A. (1 995 ). Lay beliefs about phobia.
opment, 66, 1 85 6–1 874.
Journal of Clinical Psychology, 5 1 , 5 1 8–5 25 .
Coley, J. D., Medin, D. L., & Atran, S. (1 997).
Gagné, C. L., & Shoben, E. J. (1 997). Influence
Does rank have its privilege? Cognition, 64, 73 –
of thematic relations on the comprehension of
1 1 2.
modifier-head combinations. Journal of Experi-
Collins, A. M., & Loftus, E. F. (1 975 ). A spreading mental Psychology: Learning, Memory, and Cog-
activation theory of semantic processing. Psy- nition, 2 3 , 71 –87.
chological Review, 82 , 407–428.
Gelman, S. A. (2000). The role of essentialism
Collins, A. M., & Quillian, M. R. (1 969). Re- in children’s concepts. In H. W. Reese (Ed.),
trieval time from semantic memory. Journal of Advances in child development and behavior
Verbal Learning and Verbal Behavior, 8, 240– (Vol. 27, pp. 5 5 –98). San Diego: Academic
247. Press.
Conrad, F. G., & Rips, L. J. (1 986). Concep-
Gelman, S. A. (2003 ). The essential child: Origins
tual combination and the given/new distinc-
of essentialism in everyday thought. Oxford, UK:
tion. Journal of Memory and Language, 2 5 , 25 5 –
Oxford University Press.
278.
Gelman, S. A., & Coley, J. D. (1 990). The impor-
Dennett, D. C. (1 971 ). Intensional systems. Jour-
tance of knowing a dodo is a bird: Categories
nal of Philosophy, 68, 87–1 06.
and inferences in 2-year-old children. Develop-
Diesendruck, G., & Gelman, S. A. (1 999). Do- mental Psychology, 2 6(5 ), 796–804.
main differences in absolute judgments of cate-
gory membership. Psychonomic Bulletin and Re- Gelman, S. A., & Hirschfeld, L. A. (1 999). How
view, 6, 3 3 8–3 46. biological is essentialism? In D. L. Medin &
S. Atran (Eds.), Folkbiology (pp. 403 –446).
Erickson, M. A., & Kruschke, J. K. (1 998). Rules
Cambridge, MA: MIT Press.
and exemplars in category learning. Journal of
Experimental Psychology: General, 1 2 7, 1 07– Gelman, S. A., & Koenig, M. A. (2001 ). The
1 40. role of animacy in children’s understanding of
“move.” Journal of Child Language, 2 8(3 ), 683 –
Farah, M. J., & McClelland, J. L. (1 991 ). A com-
701 .
putational model of semantic memory impair-
ment. Journal of Experimental Psychology: Gen- Gelman, S. A., Star, J. R., & Flukes, J. E. (2002).
eral, 1 2 0, 3 3 9–3 5 7. Children’s use of generics in inductive infer-
ence. Journal of Cognition and Development, 3 ,
Fiddick, L., Cosmides, L., & Tooby, J. (2000).
1 79–1 99.
No interpretation without representation: The
role of domain-specific representations and in- Gelman, S. A., & Wellman, H. M. (1 991 ). Insides
ferences in the Wason selection task. Cognition, and essence: Early understandings of the non-
77, 1 –79. obvious. Cognition, 3 8(3 ), 21 3 –244.
Fillmore, C. J., & Atkins, B. T. S. (2000). De- Gleitman, L. R., & Gleitman, H. (1 970). Phrase
scribing polysemy: The case of ‘crawl.’ In Y. and paraphrase. New York: W. W. Norton.
Ravin & C. Leacock (Eds.), Polysemy: Theoreti- Goldstone, R. L. (1 998). Perceptual learning. An-
cal and computational approaches (pp. 91 –1 1 0). nual Review of Psychology, 49, 5 85 –61 2.
Oxford, UK: Oxford University Press. Goldstone, R. L. (2003 ). Learning to perceive
Filoteo, J. V., Maddox, W. T., & Davis, J. D. while perceiving to learn. In R. Kimchi, M.
(2001 ). A possible role of the striatum in linear Behrmann, & C. Olson (Eds.), Perceptual or-
and nonlinear categorization rule learning: Evi- ganization in vision: Behavioral and neural
perspectives (pp. 23 3 –278). Mahwah, NJ: noun phrases. Developmental Psychology, 3 8,

Erlbaum. 883 –894.
Goldstone, R. L., Lippa, Y., & Shiffrin, Homa, D., Sterling, S., & Trepel, L. (1 981 ). Lim-
R. M. (2001 ). Altering object representations itations of exemplar based generalization and
through category learning. Cognition, 78, 27– the abstraction of categorical information. Jour-
43 . nal of Experimental Psychology: Human Learn-
Goldstone, R. L., & Rogosky, B. J. (2002). Using ing and Memory, 7, 41 8–43 9.
relations within conceptual systems to translate Inagaki, K. (1 997). Emerging distinctions be-
across conceptual systems. Cognition, 84, 295 – tween naive biology and naive psychology. In
3 20. H. M. Wellman & K. Inagaki (Eds.) The emer-
Goldstone, R. L., & Stevyers, M. (2001 ). The gence of core domains of thought (pp. 27–44). San
sensitization and differentiation of dimensions Francisco: Jossey-Bass.
during category learning. Journal of Experimen- Inagaki, K., & Hatano, G. (1 993 ). Young chil-
tal Psychology: General, 1 3 0, 1 1 6–1 3 9. dren’s understanding of the mind–body distinc-
Gopnik, A., & Wellman, H. M. (1 994). The tion. Child Development, 64, 1 5 3 4–1 5 49.
Theory Theory. In L. Hirschfeld & S. A. Inagaki, K., & Hatano, G. (1 996). Young chil-
Gelman (Eds.), Mapping the mind (pp. 25 7– dren’s recognition of commonalities between
293 ). Cambridge, UK: Cambridge University animals and plants. Child Development, 67,
Press. 2823 –2840.
Gutheil, G., & Rosengren, K. S. (1 996). A rose Johansen, M. J. & Palmeri, T. J. (in press).
by any other name: Preschoolers understanding Are there representational shifts in category
of individual identity across name and appear- learning? Cognitive Psychology.
ance changes. British Journal of Developmental Johnson, C., & Keil, F. (2000). Explanatory under-
Psychology, 1 4, 477–498. standing and conceptual combination. In F. C.
Hampton, J. (1 979). Polymorphous concepts in Keil & R. A. Wilson (Eds.), Explanation and
semantic memory. Journal of Verbal Learning cognition (pp. 3 27–3 5 9). Cambridge, MA: MIT
and Verbal Behavior, 1 8, 441 –461 . Press.
Hampton, J. (1 987). Inheritance of attributes Johnson, S. C., & Carey, S. (1 998). Knowledge
in natural concept conjunctions. Memory and enrichment and conceptual change in folkbiol-
Cognition, 1 5 , 5 5 –71 . ogy: Evidence from Williams syndrome. Cogni-
Hampton, J. (1 997). Conceptual combination. In tive Psychology, 3 7, 1 5 6–200.
K. Lamberts & D. Shanks (Eds.), Knowledge, Juslin, P., & Persson, M. (2002). Probabilities from
concepts, and categories (pp. 1 3 3 –1 5 9). Cam- Exemplars. Cognitive Science, 2 6, 5 63 –607.
bridge, MA: MIT Press. Kalish, C. W. (1 995 ). Graded membership in an-
Hastie, R., Schroeder, C., & Weber, R. (1 990). imal and artifact categories. Memory and Cog-
Creating complex social conjunction categories nition, 2 3 , 3 3 5 –3 5 3 .
from simple categories. Bulletin of the Psycho- Kalish, C. W. (2002). Essentialist to some degree.
nomic Society, 2 8, 242–247. Memory and Cognition, 3 0, 3 40–3 5 2.
Hatano, G., & Inagaki, K. (1 994). Young chil- Keil, F. C. (1 989). Concepts, Kinds, and Cognitive
dren’s naive theory of biology. Cognition, 5 0, Development. Cambridge, MA: MIT Press.
1 71 –1 88. Keil, F. C. (1 981 ). Constraints on knowledge and
Hatano, G., & Inagaki, K. (1 996). Cognitive and cognitive development. Psychological Review,
cultural factors in the acquisition of intuitive 88(3 ), 1 97–227.
biology. In D. Olson & N. Torrance (Eds.), The Keil, F. C. (1 994). The birth and nurturance of
handbook of education and human development concepts by domains: The origins of concepts
(pp. 63 8–708). Malden, MA: Blackwell. of living things. In L. A. Hirschfeld & S. A.
Hintzman, D. L. (1 978). The psychology of learning Gelman (Eds.), Mapping the mind: Domain
and memory. San Francisco: Freeman. specificity in cognition and culture. (pp. 23 4–
Hintzman, D. L. (1 986). ‘Schema abstraction’ in 25 4). Cambridge, UK: Cambridge University
a multiple-trace memory model. Psychological Press.
Review, 93 , 41 1 –428. Keil, F. C. (1 995 ). The growth of causal un-
Hollander, M. A., Gelman, S. A., & Star, J. derstanding of natural kinds. In D. Sperber,
(2002). Children’s interpretation of generic D. Premack, & A. J. Premack (Eds.), Causal
cognition (pp. 23 4–262). Oxford, UK: Oxford representation of knowledge. Psychological Re-
University Press. view, 1 04, 21 1 –240.
Keil, F. C., Levin, D. T., Richman, B. A., & Leslie, A. M. (1 987). Pretense and representation:
Gutheil, G. (1 999). Mechanism and explana- The origins of “theory of mind.” Psychological
tion in the development of biological thought: Review, 94, 41 2–426.
The case of disease. In D. Medin & S. Atran Liittschwager, J. C. (1 995 ). Children’s reasoning
(Eds.), Folkbiology (pp. 285 –3 1 9). Cambridge, about identity across transformations. Disser-
MA: MIT Press. tation Abstracts International, 5 5 (1 0), 4623 B.
Kellman, P. J., & Spelke, E. S. (1 983 ). Perception (UMI No. 95 083 99).
of partly occluded objects in infancy. Cognitive Love, B. C., Markman, A. B., & Yamauchi, T.
Psychology, 1 5 , 483 –5 24. (2000). Modeling inference and classification
Kim, N. S., & Ahn, W. K. (2002a). Clinical learning. The national conference on artificial in-
psychologists’ theory-based representations of telligence (AAAI-2 000), 1 3 6–1 41 .
mental disorders predict their diagnostic rea- Love, B. C., Medin, D. L., & Gureckis, T. M.
soning and memory. Journal of Experimental (2004). SUSTAIN: A network model of cat-
Psychology: General, 1 3 1 , 45 1 –476. egory learning. Psychological Review, 1 1 1 , 3 09–
Kim, N. S., & Ahn, W. K. (2002b). The influence 3 3 2.
of naive causal theories on concepts of men- Lucas, M. (2000). Semantic priming without as-
tal illness. American Journal of Psychology, 1 1 5 , sociation. Psychonomic Bulletin and Review, 7,
3 3 –65 . 61 8–63 0.
Klein, D. E., & Murphy, G. L. (2002). Paper has Lyons, J. (1 977). Semantics (Vol. 2). Cambridge,
been my ruin: Conceptual relations of polyse- UK: Cambridge University Press.
mous senses. Journal of Memory and Language, Maddox, W. T. (1 999). On the dangers of averag-
47, 5 48–5 70. ing across observers when comparing decision
Knapp, A. G., & Anderson, J. A. (1 984). Theory bound models and generalized context models
of categorization based on distributed mem- of categorization. Perception and Psychophysics,
ory storage. Journal of Experimental Psychology: 61 , 3 5 4–3 75 .
Learning, Memory, and Cognition, 1 0, 61 6– Maddox, W. T. (2002). Learning and atten-
63 7. tion in multidimensional identification, and
Knowlton, B. J., Mangels, J. A., & Squire, L. R. categorization: Separating low-level percep-
(1 996). A neostriatal habit learning system in tual processes and high level decisional pro-
humans. Science, 2 73 , 1 3 99–1 402. cesses. Journal of Experimental Psychology:
Knowlton, B. J., & Squire, L. R. (1 993 ). The learn- Learning, Memory, and Cognition, 2 8, 99–
ing of categories: Parallel brain systems for item 115.
memory and category knowledge. Science, 2 62 , Maddox, W. T., & Ashby, F. G. (1 993 ). Com-
1 747–1 749. paring decision bound and exemplar models of
Krifka, M., Pelletier, F. J., Carlson, G. N., ter categorization. Perception & Psychophysics, 5 3 ,
Meulen, A., Link, G., & Chierchia, G. (1 995 ). 49–70.
Genericity: An introduction. In G. N. Carlson Maddox, W. T., & Ashby, F. G. (1 998). Se-
& F. J. Pelletier (Eds.), The generic book (pp. 1 – lective attention and the formation of linear
1 24). Chicago: University of Chicago Press. decision boundaries: Comment on McKinley
Kruschke, J. K. (1 992). ALCOVE: An exemplar and Nosofsky (1 996). Journal of Experimental
based connectionist model of category learning. Psychology: Human Perception and Performance,
Psychological Review, 99, 22–44. 2 4, 3 01 –3 21 .
Kunda, Z., Miller, D. T., & Claire, T. (1 990). Malt, B. C. (1 994). Water is not H-sub-2O. Cog-
Combining social concepts: The role of causal nitive Psychology, 2 7(1 ), 41 –70.
reasoning. Cognitive Science, 1 4, 5 5 1 –5 77. Malt, B. C., Ross, B. H., & Murphy, G. L. (1 995 ).
Lamberts, K. (1 995 ). Categorization under time Predicting features for members of natural cat-
pressure. Journal of Experimental Psychology: egories when categorization is uncertain. Jour-
General, 1 2 4, 1 61 –1 80. nal of Experimental Psychology: Learning, Mem-
Landauer, T. K., & Dumais, S. T. (1 997). A so- ory, and Cognition, 2 1 , 646–661 .
lution to Plato’s problem: The latent semantic Malt, B. C., Sloman, S. A., Gennari, S., Shi,
analysis theory of acquisition, induction, and M., & Wang, Y. (1 999). Knowing vs. naming:
Similarity and the linguistic categorization of B. H. Ross (Ed.), The psychology of learning and
artifacts. Journal of Memory and Language, 40, motivation: Advances in research and theory, (pp.
23 0–262. 1 –41 ). San Diego: Academic Press.
Malt, B. C., & Smith, E. E. (1 984). Corre- Medin, D. L., & Schaffer, M. M. (1 978). Context
lated properties in natural categories. Journal theory of classification learning. Psychological
of Verbal Learning & Verbal Behavior, 2 , 25 0– Review, 85 , 207–23 8.
269. Medin, D. L., & Shoben, E. J. (1 988). Context and
Markman, A. B., & Makin, V. S. (1 998). Ref- structure in conceptual combination. Cognitive
erential communication and category acquisi- Psychology, 2 0, 1 5 8–1 90.
tion. Journal of Experimental Psychology: Gen- Meyer, D. E., & Schvaneveldt, R. W. (1 971 ). Fa-
eral, 1 2 7, 3 3 1 –3 5 4. cilitation in recognizing pairs of words: Evi-
Markman, E. M. (1 990). Constraints children dence of a dependence between retrieval oper-
place on word meaning. Cognitive Science, 1 4, ations. Journal of Experimental Psychology, 90,
5 7–77. 227–23 4.
McCloskey, M. (1 983 ). Naive theories of mo- Minda, J. P., & Smith, J. D. (2001 ). Prototypes
tion. In D. Gentuer & A. L. Stevens (Eds.), in category learning. Journal of Experimental
Mental models (pp. 299–3 24). Hillsdale, NJ: Psychology: Learning, Memory, & Cognition, 2 7,
Erlbaum. 775 –799.
McCloskey, M., & Glucksberg, S. (1 979). Deci- Murphy, G. L. (1 988). Comprehending complex
sion processes in verifying category member- concepts. Cognitive Science, 1 2 , 5 29–5 62.
ship statements: Implications for models of se- Murphy, G. L. (2002). The big book of concepts.
mantic memory. Cognitive Psychology, 1 1 , 1 – Cambridge, MA: MIT Press.
3 7. Murphy, G. L., & Ross, B. H. (1 994). Predictions
McKinley, S. C., & Nosofsky, R. M. (1 995 ). In- from uncertain categorizations. Cognitive Psy-
vestigations of exemplar and decision bound chology, 2 7, 1 48–1 93 .
models in large, ill defined category structures. Nisbett, R., Peng, K., Choi, I., & Norenzayan, A.
Journal of Experimental Psychology: Human Per- (2001 ). Culture and systems of thought: Holis-
ception and Performance, 2 1 , 1 28–1 48. tic vs. analytic cognition. Psychological Review,
Medin, D. (1 989). Concepts and conceptual 1 08, 291 –3 1 0.
structures. American Psychologist, 45 , 1 469– Nisbett, R. E., & Norenzayan, A. (2002). Cul-
1 481 . ture and cognition. In H. Pashler & D. Medin
Medin, D. L., Altom, M. W., Edelson, S. M., & (Eds.), Strevens’ handbook of experimental psy-
Freko, D. (1 982). Correlated symptoms and chology, Vol. 2 : Memory and cognitive pro-
simulated medical classification. Journal of Ex- cesses (3 rd ed., pp. 5 61 –5 97). New York:
perimental Psychology: Learning, Memory, and Wiley.
Cognition, 8, 3 7–5 0. Norman, D. A., & Rumelhart, D. E. (1 975 ). Ex-
Medin, D. L., & Coley, J. D. (1 998). Concepts and plorations in cognition. San Francisco: W. H.
categorization. In J. Hochberg (Ed.), Handbook Freeman.
of perception and cognition. Perception and cogni- Norman, G. R., Brooks, L. R., Coblentz, C. L., &
tion at century’s end: History, philosophy, theory Babcock, C. J. (1 992). The correlation of tea-
(pp. 403 –43 9). San Diego: Academic Press. ture identification and category judgements in
Medin, D. L., Lynch, E. B., & Solomon, K. O. diagnostic radiology. Memory & Cognition, 2 0,
(2000). Are there kinds of concepts? Annual 3 44–3 5 5 .
Review of Psychology, 5 1 , 1 21 –1 47. Nosofsky, R. M. (1 986). Attention, similarity,
Medin, D. L., & Ortony, A. (1 989). Psy- and the identification–categorization relation-
chological essentialism. In S. Vosniadou & ship. Journal of Experimental Psychology: Gen-
A. Ortony (Eds.), Similarity and analogical eral, 1 1 5 , 3 9–5 7.
reasoning (pp. 1 79–1 95 ). New York: Cambridge Nosofsky, R. M. (1 991 ). Tests of an exemplar
University Press. model for relation perceptual classification and
Medin, D. L., Ross, N., Atran, S., Burnett, R. C., recognition in memory. Journal of Experimental
& Blok, S. V. (2 002). Categorization and rea- Psychology: Human Perception and Performance,
soning in relation to culture and expertise. In 1 7, 3 –27.
Nosofsky, R. M. (1 998). Dissociations between Palmeri, T. J., & Flanery, M. A. (1 999). Learn-
categorization and recognition in amnesic ing about categories in the absence of training:
and normal individuals: An exemplar-based Profound amnesia and the relationship be-
interpretation. Psychological Science, 9, 247– tween perceptual categorization and recogni-
25 5 . tion memory. Psychological Science, 1 0, 5 26–
Nosofsky, R. M., Clark, S. E., & Shin, H. J. (1 989). 5 3 0.
Rules and exemplars in categorization, identifi- Palmeri, T. J., & Flanery, M. A. (2002). Memory
cation, and recognition. Journal of Experimental systems and perceptual categorization. In B. H.
Psychology: Learning, Memory, and Cognition, Ross (Ed.), The psychology of learning and moti-
1 5 , 282–3 04. vation (Vol. 41 , pp. 1 41 –1 89). San Diego: Aca-
Nosofsky, R. M., & Johansen, M. K. (2000). Ex- demic Press.
emplar based accounts of “multiple system” Partee, B. H. (1 995 ). Lexical semantics and com-
phenomena in perceptual categorization. Psy- positionality. In L. R. Gleitman & M. Liber-
chonomic Bulletin and Review, 7, 3 75 –402. man (vol. eds.) & D. N. Osherson (series ed.),
Nosofsky, R. M., & Palmeri, T. J. (1 997a). An ex- Invitation to cognitive science (Vol. 1 : Language,
emplar based random walk model of speeded pp. 3 1 1 –3 60). Cambridge, MA: MIT Press.
classification. Psychological Review, 1 04, 266– Pinango, M. M., Zurif, E., & Jackendoff, R.
3 00. (1 999). Real-time processing implications of
Nosofsky, R. M., & Palmeri, T. J. (1 997b). enriched composition at the syntax–semantics
Comparing exemplar retrieval and decision- interface. Journal of Psycholinguistics Research,
bound models of speeded perceptual classifi- 2 8, 3 95 –41 4.
cation. Perception and Psychophysics, 5 9, 1 027– Posner, M. I., & Keele, S. W. (1 968). On the gen-
1 048. esis of abstract ideas. Journal of Experimental
Nosofsky, R. M., Palmeri, T. J., & McKinley, S. C. Psychology, 77, 3 5 3 –3 63 .
(1 994). Rule-plus-exception model of classifi- Posner, M. I., & Keele, S. W. (1 970). Retention of
cation learning. Psychological Review, 1 01 , 5 3 – abstract ideas. Journal of Experimental Psychol-
79. ogy, 83 , 3 04–3 08.
Nosofsky, R. M., & Zaki, S. R. (1 998). Dissoci- Quillian, M. R. (1 967). Word concepts: A theory
ations between categorization and recognition and simulation of some basic semantic capabil-
in amnesic and normal individuals: An exem- ities. Behavioral Sciences, 1 2 , 41 0–43 0.
plar based interpretation. Psychological Science, Quillian, M. R. (1 969). The teachable language
9, 247–25 5 . comprehender: A simulation program and the-
Nosofsky, R. M., & Zaki, S. R. (2002). Ex- ory of language. Communications of the ACM,
emplar and prototype models revisited: Re- 1 2 , 45 9–476.
sponse strategies, selective attention, and stim- Reagher, G., & Brooks, L. R. (1 993 ). Perceptual
ulus generalization. Journal of Experimental manifestations of an analytic structure: The pri-
Psychology: Learning, Memory, and Cognition, ority of holistic individuation. Journal of Exper-
2 8, 924–940. imental Psychology: General, 1 2 2 , 92–1 1 4.
Osherson, D. N., & Smith, E. E. (1 981 ). On the Reber, P. J., Stark, C. E. L., & Squire, L. R.
adequacy of prototype theory as a theory of (1 998a). Cortical areas supporting category
concepts. Cognition, 1 1 , 3 5 –5 8. learning identified using functional MRI. Pro-
Osherson, D. N., & Smith, E. E. (1 982). Grad- ceedings of the National Academy of Sciences of
edness and conceptual combination. Cognition, the USA, 95 , 747–75 0.
1 2 , 299–3 1 8. Reber, P. J., Stark, C. E. L., & Squire, L. R.
Palmeri, T. J. (1 997). Exemplar similarity and the (1 998b). Contrasting cortical activity asso-
development of automaticity. Journal of Experi- ciated with category memory and recogni-
mental Psychology: Learning, Memory, and Cog- tion memory. Learning and Memory, 5 , 420–
nition, 2 3 , 3 24–3 5 4. 428.
Palmeri, T. J. (1 999). Learning hierarchically Reed, S. K. (1 972). Pattern recognition and cate-
structured categories: A comparison of categorization. Cognitive Psychology, 3 , 3 82–407.
gory learning models. Psychonomic Bulletin and Rehder, B., & Hastie, R. (2001 ). The essence
Review, 6, 495 –5 03 . of categories: The effects of underlying causal
mechanisms on induction, categorization, and development of folkbiological induction. Cog-

similarity. Journal of Experimental Psychology: nitive Development, 1 8, 25 –47.
General, 1 3 0, 3 23 –3 60. Rozenblit, L., & Keil, F. (2002). The misunder-
Restle, F. (1 962). The selection of strategies in stood limits of folk science: An illusion of ex-
cue learning. Psychological Review, 69, 3 29– planatory depth. Cognitive Science, 2 6, 5 21 –
3 43 . 5 62.
Rips, L. J. (1 995 ). The current status of research Sartori, G., & Job, R. (1 988). The oyster with four
on concept combination. Mind and Language, legs. Cognitive Neurosychology, 5 , 1 05 –1 3 2.
1 0, 72–1 04. Schyns, P., Goldstone, R., & Thibaut, J. (1 998).
Rips, L. J. (2001 ). Necessity and natural cate- Development of features in object concepts.
gories. Psychological Bulletin, 1 2 7, 827–85 2. Behavioral and Brain Sciences, 2 1 , 1 –5 4.
Rips, L. J., Smith, E. E., & Shoben, E. J. (1 978). Schyns, P., & Rodet, L. (1 997). Categorization
Semantic composition in sentence verification. creates functional features. Journal of Experi-
Journal of Verbal Learning and Verbal Behavior, mental Psychology: Learning, Memory, and Cog-
1 7, 3 75 –401 . nition, 2 3 , 681 –696.
Rosch, E. (1 973 ). On the internal structure of per- Simmons, W. K., & Barsalau, L. W. (2003 ).
ceptual and semantic categories. In T. E. Moore The similarity-in-topography principle. Cogni-
(Ed.), Cognitive development and the acquisition tive Neurosychology, 2 0, 45 1 –486.
of language (pp. 1 1 1 –1 44). New York: Academic Sloman, S. A., & Malt, B. (2003 ). Artifacts are
Press. not ascribed essences, nor are they treated as
Rosch, E. (1 978). Principles of categorization. belonging to kinds. Language and Cognitive Pro-
In E. Rosch & B. B. Lloyd (Eds.), Cognition cesses, 1 8, 5 63 –5 82.
and categorization (pp. 27–48). Hillsdale, NJ: Sloman, S. A., Love, B. C., & Ahn, W.-K. (1 998).
Erlbaum. Feature centrality and conceptual coherence.
Rosch, E., & Mervis, C. B. (1 975 ). Family Cognitive Science, 2 2 (2), 1 89–228.
resemblances: Studies in the internal struc- Smith, E. E., & Medin, D. L. (1 981 ). Categories
ture of categories. Cognitive Psychology, 7, 5 73 – and concepts. Cambridge, MA: Harvard Univer-
605 . sity Press.
Rosch, E., Mervis, C. B., Gray, W. D., Johnson, Smith, E. E., Osherson, D. N., Rips, L. J., &
D. M., & Boyes-Braem, P. (1 976). Basic objects Keane, M. (1 988). Combining prototypes: A
in natural categories. Cognitive Psychology, 8, selective modification model. Cognitive Science,
3 82–43 9. 1 2 , 485 –5 27.
Rosengren, K. S., Gelman, S. A., Kalish, G. W., & Smith, E. E., Shoben, E. J., & Rips, L. J. (1 974).
McCormick, M. (1 995 ). As time goes by. Child Structure and process in semantic memory: A
Development, 62 , 1 3 02–1 3 20. featural model for semantic decisions. Psycho-
Ross, B. H. (1 997). The use of categories affects logical Review, 81 , 21 4–241 .
classification. Journal of Memory and Language, Smith, J. D., & Minda, J. P. (1 998). Prototypes
3 7, 240–267. in the mist: The early epochs of category
Ross, B. H. (1 999). Postclassification category use: learning. Journal of Experimental Psychology:
The effects of learning to use categories af- Learning, Memory and Cognition, 2 4, 1 41 1 –
ter learning to classify. Journal of Experimental 1 43 6.
Psychology: Learning, Memory, and Cognition, Smith, J. D., & Minda, J. P. (2000). Thirty cate-
2 5 , 743 –75 7. gorization results in search of a model. Journal
Ross, B. H. (2000). The effects of category use on of Experimental Psychology: Learning, Memory,
learned categories. Memory and Cognition, 2 8, and Cognition, 2 6, 3 –27.
5 1 –63 . Smith, J. D., Murray, M. J., Jr., & Minda, J. P.
Ross, B. H., & Murphy, G. L. (1 996). Category (1 997). Straight talk about linear seperability.
based predictions: Influence of uncertainty and Journal of Experimental Psychology: Learning,
feature associations. Journal of Experimental Memory, and Cognition, 2 3 , 65 9–680.
Psychology: Learning, Memory, and Cognition, Smith, L. B., Jones, S. S., & Landau, B. (1 996).
2 2 , 73 6–75 3 . Naming in young children: A dumb atten-
Ross, N., Medin, D., Coley, J. D., Atran, S. (2003 ). tional mechanism? Cognition, 60(2), 1 43 –
Cultural and experimental differences in the 1 71 .
Smith, L. B., Jones, S. S., Yoshida, H., & Colunga, ganization of memory (pp. 3 81 –403 ). New York:
E. (2003 ). Whose DAM account? Attentional Academic Press.
learning explains Booth and Waxman. Cogni- Tulving, E. (1 984). Precis of elements of episodic
tion, 87(3 ), 209–21 3 . memory. Behavioral and Brain Sciences, 7, 223 –
Sober, E. (1 980). Evolution, population thinking, 268.
and essentialism. Philosophy of Science, 47, 3 5 0– Tversky, A. (1 977). Features of similarity. Psycho-
3 83 . logical Review, 84, 3 27–3 5 2.
Solomon, G. E. A., Johnson, S. C., Zaitchik, D., Verguts, T., Storms, G., & Tuerlinckx, F. (2001 ).
& Carey, S. (1 996). Like father like son. Child Decision bound theory and the influence of
Development, 67, 1 5 1 –1 71 . familiarity. Psychonomic Bulletin and Review,
Solomon, K. O., Medin, D. L., & Lynch, E. B. 1 0, 1 41 –1 48.
(1 999). Concepts do more than categorize. Wellman, H. M. (1 990). The child’s theory of mind.
Trends in Cognitive Science, 3 , 99–1 05 . Cambridge, MA: MIT Press.
Sousa, P., Atran, S., & Medin, D. (2002). Essen- Wellman, H. M., & Gelman, S. A. (1 992). Cog-
tialism and folkbiology: Evidence from Brazil. nitive development: Foundational theories of
Journal of Cognition and Culture, 2 , 1 95 – core domains. Annual Review of Psychology, 43 ,
223 . 3 3 7–3 75 .
Spelke, E. S. (1 990). Principles of object percep- Wiggins, D. (1 980). Sameness and sub-
tion. Cognitive Science, 1 4, 29–5 6. stance. Cambridge, MA: Harvard University
Spelke, E. S., Phillips, A., & Woodward, A. L. Press.
(1 995 ). Infants’ knowledge of object motion Wilcox, T., & Baillargeon, R. (1 998). Object in-
and human section. In D. Sperber, D. Premack, dividuation in infancy: The use of featural in-
& A. J. Premack (Eds.), Causal Cognition formation in reasoning about occlusion events.
(pp. 44–78). New York: Oxford University Cognitive Psychology, 3 7, 97–1 5 5 .
Press. Wisniewski, E. J. (1 997). When concepts com-
Springer, K., & Keil, F. C. (1 989). On the develop- bine. Psychonomic Bulletin and Review, 4, 1 67–
ment of biologically specific beliefs: The case 1 83 .
of inheritance. Child Development, 60, 63 7– Wisniewski, E. J. (2002). Concepts and cat-
648. egorization. In D. L. Medin (Ed.), Steven’s
Springer, K., & Keil, F. C. (1 991 ). Early differ- handbook of experimental psychology (3 rd ed.,
entiation of causal mechanisms appropriate to pp. 467–5 3 2). Wiley: New York.
biological and nonbiological kinds. Child Devel- Wisniewski, E. J., & Medin, D. L. (1 994).
opment, 62 , 767–781 . On the interaction of theory and data in
Stanton, R., Nosofsky, R. M., & Zaki, S. (2002). concept learning. Cognitive Science, 1 8, 221 –
Comparisons between exemplar similarity and 281 .
mixed prototype models using a linearly sepa- Wolff, P., Medin, D., & Pankratz, C. (1 999). Evo-
rable category structure. Memory and Cogni- lution and devolution of folkbiological knowl-
tion, 3 0, 93 4–944. edge. Cognition, 73 , 1 77–204.
Storms, G., de Boeck, P., van Mechelen, I., & Wolff, P., & Song, G. (2003 ). Models of causa-
Ruts, W. (1 998). No guppies, nor goldfish, but tion and the semantics of causal verbs. Cogni-
tumble dryers, Noriega, Jesse Jackson, panties, tive Psychology, 47, 241 –275 .
car crashes, bird books, and Stevie Wonder.
Xu, F. (2003 ). The development of object indi-
Memory and Cognition, 2 6, 1 43 –1 45 .
viduation in infancy. In J. Fagen & H. Hayne
Strevens, M. (2000). The essentialist aspect of (Eds.), Progress in infancy research (Vol. 3 ,
naive theories. Cognition, 74, 1 49–1 75 . pp. 1 5 9–1 92). Mahwah, NJ: Erlbaum.
Strevens, M. (2001 ). Only causation matters. Cog- Xu, F., & Carey, S. (1 996). Infants’ metaphysics:
nition, 82 , 71 –76. The case of numerical identity. Cognitive Psy-
Toulmin, S. (1 95 8). The uses of argument. Cam- chology, 3 0, 1 1 1 –1 5 3 .
bridge, UK: Cambridge University Press. Yamauchi, T., Love, B. C., & Markman, A. B.
Trabasso, T., & Bower, G. H. (1 968). Attention in (2002). Learning non-linearly separable cate-
learning. New York: Wiley. gories by inference and classification. Journal
Tulving, E. (1 972). Episodic and semantic mem- of Experimental Psychology: Learning, Memory,
ory. In E. Tulving & W. Donaldson (Eds.), Or- and Cognition, 2 8(3 ), 5 85 –5 93 .
Yamauchi, T., & Markman, A. B. (1 998). Category Yamauchi, T., & Markman, A. B. (2000b). Learn-
learning by inference and classification. Journal ing categories composed of varying instances:
of Memory and Language, 3 9, 1 24–1 49. The effect of classification, inference and struc-
Yamauchi, T., & Markman, A. B. (2000a). Infer- tural alignment. Memory & Cognition, 2 8, 64–
ence using categories. Journal of Experimental 78.
Psychology: Learning, Memory and Cognition, Zadeh, L. (1 965 ). Fuzzy sets. Information and
2 6, 776–795 . Control, 8, 3 3 8–3 5 3 .
CHAPTER 4
Approaches to Modeling Human Mental

Representations: What Works, What
Doesn’t, and Why
Leonidas A. A. Doumas
John E. Hummel
Relational Thinking Relational thinking is so commonplace

that it is easy to assume the psychologi-
A fundamental aspect of human intelligence cal mechanisms underlying it are relatively
is the ability to acquire and manipulate simple. They are not. The capacity to form
relational concepts. Examples of relational and manipulate relational representations
thinking include our ability to appreciate appears to be a late evolutionary develop-
analogies between seemingly different ob- ment (Robin & Holyoak, 1 995 ) closely tied
jects or events (e.g., Gentner, 1 983 ; Gick to the increase in the size and complexity
& Holyoak, 1 980, 1 983 ; Holyoak & Tha- of the frontal cortex in the brains of higher
gard, 1 995 ; see Holyoak, Chap. 6), our abil- primates, especially humans (Stuss & Ben-
ity to apply abstract rules in novel situations son, 1 986). Relational thinking also devel-
(e.g., Smith, Langston, & Nisbett, 1 992), ops relatively late in childhood (see, e.g.,
our ability to understand and learn language Smith, 1 989; Halford, Chap. 22). Along with
(e.g., Kim, Pinker, Prince, & Prasada, 1 991 ), language, the human capacity for relational
and even our ability to appreciate percep- thinking is the major factor distinguish-
tual similarities (e.g., Goldstone, Medin, & ing human cognition from the cognitive
Gentner, 1 991 ; Hummel, 2000; Hummel & abilities of other animals (for reviews, see
Stankiewicz, 1 996; Palmer, 1 978; see Gold- Holyoak & Thagard, 1 995 ; Oden, Thomp-
stone & Son, Chap. 2). Relational thinking son, & Premack, 2001 ; Call & Tomasello,
is ubiquitous in human cognition, under- Chap. 25 ).
lying everything from the mundane (e.g.,
the thought “the mug is on the desk”) to
Relational Representations
the sublime (e.g., Cantor’s use of set the-
ory to prove that the cardinal number of the Central to understanding human relational
reals is greater than the cardinal number of thinking is understanding the nature of the
the integers). mental representations underlying it: How
73
does the mind represent relational ideas such even though its role differs across the two ex-
as “if every element of set A is paired with pressions. At the same time, relational repre-
a distinct element of set B, and there are sentations explicitly specify how arguments
still elements of B left over, then the car- are bound to relational roles. The relation
dinal number of B is greater than the cardi- “murders (Bill, Susan)” differs from “murders
nal number of A,” or even simple relations (Susan, Bill)” only in the binding of argu-
such as “John loves Mary” or “the mag- ments to relational roles, yet the two expres-
azine is next to the phone”? Two prop- sions mean very different things (especially
erties of human relational representations to Susan and Bill).
jointly make this apparently simple question The claim that formal representational
surprisingly difficult to answer (Hummel systems (e.g., propositional notation, mathe-
& Holyoak, 1 997): As elaborated in the matical notation) are symbolic is completely
next sections, human relational representa- uncontroversial. In contrast, the claim that
tions are both symbolic and semantically rich. human mental representations are symbolic
Although these properties are straightfor- is highly controversial (for reviews, see
ward to account for in isolation, account- Halford et al., 1 998; Hummel & Holyoak,
ing for both together has proven much 1 997, 2003 a; Marcus, 1 998, 2001 ). The best-
more challenging. known argument for the role of symbolic
representations in human cognition – the ar-
gument from systematicity – was made by
relational representations are symbolic Fodor and Pylyshyn (1 988). They observed
A symbolic representation is one that rep- that knowledge is systematic in the sense
resents relations explicitly and specifies the that the ability to think certain thoughts
arguments to which they are bound. Rep- seems to imply the ability to think related
resenting relations explicitly means having thoughts. For example, a person who un-
primitives (i.e., symbols, nodes in a network, derstands the concepts “John,” “Mary,” and
neurons) that correspond specifically to rela- “loves,” and can understand the statement
tions and/or relational roles. This definition “John loves Mary,” must surely be able to
of “explicit,” which we take to be uncontro- understand “Mary loves John.” This prop-
versial (see also Halford et al., 1 998; Holland erty of systematicity, they argued, demon-
et al., 1 986; Newell, 1 990), implies that rela- strates that human mental representations
tions are represented independently of their are symbolic. Fodor and Pylyshyn’s argu-
arguments (Hummel & Biederman, 1 992; ments elicited numerous responses from
Hummel & Holyoak, 1 997, 2003 a). That is, the connectionist community claiming to
the representation of a relation cannot vary achieve or approximate systematicity in
as a function of the arguments it happens to nonsymbolic (e.g., traditional connection-
take at a given time, and the representation ist) architectures (for a recent example, see
of an argument cannot vary across relations Edelman & Intrator, 2003 ). At the same
or relational roles.1 time, however, Fodor and Pylyshyn’s defi-
Some well-known formal representa- nition of “systematicity” is so vague that it
tional systems that meet this requirement in- is difficult or impossible to evaluate these
clude propositional notation, labeled graphs, claims of “systematicity achieved or approx-
mathematical notation, and computer pro- imated” (van Gelder & Niklasson, 1 994; for
gramming languages (among many others). an example of the kind of confusion that
For example, the relation murders is repre- has resulted from the attempt to approxi-
sented in the same way (and means the same mate systematicity, see Edelman & Intrator,
thing) in the proposition murders (Bill, Su- 2003 , and the reply by Hummel, 2003 ). The
san) as it is in the proposition murders (Sally, concept of “systematicity” has arguably done
Robert), even though it takes different ar- more to cloud the debate over the role of
guments across the two expressions. Like- symbolic representations in human cogni-
wise, “2” means the same thing in x2 as in 2 x , tion than to clarify it.
approaches to modeling human mental representations 75
We propose that a clearer way to de- statements differ. We take the human abil-
fine symbolic competence is in terms of the ity to appreciate these similarities and differ-
ability to appreciate what different bind- ences as strong evidence that the represen-
ings of the same relational roles and fillers tations underlying human relational thinking
have in common and how they differ (see are symbolic.
also Garner, 1 974; Hummel, 2000; Hummel
& Holyoak, 1 997, 2003 a; Saiki & Hummel, relational representations are
1 998). Under this definition, what matters semantically rich
is the ability to appreciate what “John loves The second fundamental property of hu-
Mary” has in common with “Mary loves man relational representations, and human
John” (i.e., the same relations and argu- mental representations more broadly, is that
ments are involved) and how they differ (i.e., they are semantically rich. It means some-
the role-filler bindings are reversed). It does thing to be a lover or a murderer, and the
not strictly matter whether you can “under- human mental representation of these rela-
stand” the statements, or even whether they tions makes this meaning explicit. As a re-
make any sense. What matters is that you sult, there is an intuitive sense in which loves
can evaluate them in terms of the relations (John, Mary) is more like likes (John, Mary)
among their components. This same ability than murders (John, Mary). Moreover, the
allows you to appreciate how “the glimby meanings of various relations seem to ap-
jolls the ronket” is similar to and differ- ply specifically to individual relational roles
ent from “the ronket jolls the glimby,” even rather than to relations as indivisible wholes.
though neither statement inspires much by For example, it is easy to appreciate that the
way of understanding. To gain a better ap- agent (i.e., killer) role of murders (x, y) is
preciation of the abstractness of this ability, similar to the agent role of attempted-murder
note that the ronket and glimby may not (x, y) even though the patient roles dif-
even be organisms (as we suspect most read- fer (i.e., the patient is dead in the former
ers initially assume they are) but may instead case but not the latter), and the patient role
be machine parts, mathematical functions, of murder (x, y) is like the patient role of
plays in a strategy game, or anything else that manslaughter (x, y) even though the agent
can be named. roles differ (i.e., the act is intentional in the
This definition of symbolic competence former case but not the latter).
admits to more objective evaluation than The semantic richness of human rela-
does systematicity: one can empirically eval- tional representations is also evidenced by
uate, for any f, x, and y, whether someone their flexibility (Hummel & Holyoak, 1 997).
knows what f (x, y) has in common with and Given statements such as taller-than (Abe,
how it differs from f (y, x). It is also important Bill), tall (Charles), and short (Dave), it is
because it relates directly to what we take to easy to map Abe onto Charles and Bill onto
be the defining property of a symbolic (i.e., Dave even though doing so requires the rea-
explicitly relational) representation: namely, soner to violate the “n-ary restriction” (i.e.,
as noted previously, the ability to represent mapping the argument(s) and role(s) of an
relational roles independently of their argu- n-place predicate onto those of an m-place
ments and to simultaneously specify which predicate, where m = n). Given shorter-than
roles are bound to which arguments (see also (Eric, Fred), it is also easy to map Eric onto
Hummel, 2000, 2003 ; Hummel & Holyoak, Bill (and Dave) and Fred onto Abe (and
1 997, 2003 a). It is the independence of roles Charles). These mappings are based on the
and fillers that allows one to appreciate that semantics of individual roles, rather than, for
the glimby in “the glimby jolls the ronket” is instance, the fact that taller-than and shorter-
the same thing as the glimby in “the ron- than are logical opposites: The relation loves
ket jolls the glimby”; and it is the ability (x, y) is in some sense the opposite of hates
to explicitly bind arguments to relational (x, y) [or if you prefer, not-loves (x, y)] but
roles that allows one to know how the two in contrast to taller-than and shorter-than in
which the first role of one relation maps to man mental representations. We review tra-
the second role of the other, the first role ditional symbolic approaches to mental rep-
of loves (x, y) maps to the first role of hates resentation, traditional distributed connec-
(x, y) [or not-loves (x, y)]. The point is that tionist approaches, conjunctive distributed
the similarity and/or mappings of various re- connectionist approaches (based on tensor
lational roles are idiosyncratic and based not products and their relatives), and an ap-
on the formal syntax of propositional nota- proach based on dynamic binding of dis-
tion, but on the semantic content of the indi- tributed and localist connectionist represen-
vidual roles in question. The semantics of re- tations into symbolic structures.
lational roles matter and are an explicit part
of the mental representation of relations.
The semantic properties of relational
Approaches to Modeling Human
roles manifest themselves in numerous other
Mental Representation
ways in human cognition. For example, they
influence both memory retrieval (e.g., Gen-
Symbol-Argument-Argument Notation
tner, Ratterman, & Forbus, 1 993 ; Ross, 1 987;
Wharton, Holyoak, & Lange, 1 996) and our The dominant approach to modeling rela-
ability to discover structurally appropriate tional representations in the computational
analogical mappings (Bassok, Wu, & Olseth, literature is based on propositional notation
1 995 ; Krawczyk, Holyoak, & Hummel, in and formally equivalent systems (including
press; Kubose, Holyoak, & Hummel, 2002; varieties of labeled graphs and high-rank ten-
Ross, 1 987). They also influence which in- sor representations). These representational
ferences seem plausible from a given collec- systems – which we refer to collectively
tion of stated facts. For instance, upon learn- as symbol-argument-argument notation,
ing about a culture in which nephews tra- or “SAA” – borrow conventions directly
ditionally give their aunts a gift on a par- from propositional calculus and are com-
ticular day of the year, it is a reasonable monly used in symbolic models based on
conjecture that there may also be a day on production systems (see Lovett & Anderson,
which nieces in this culture give their uncles Chap. 1 7, for a review), many forms of graph
gifts. This inference is based on the seman- matching (e.g., Falkenhainer et al., 1 989;
tic similarity of aunts to uncles and nieces Keane et al., 1 994) and related algorithms.
to nephews, and on the semantics of gift SAA represents relations and their argu-
giving, not the syntactic properties of the ments as explicit symbols and represents the
give-gift relation. bindings of arguments to relational roles in
In summary, human mental representa- terms of the locations of the arguments in
tions are both symbolic (i.e., they explic- the relational expression. For example, in the
itly represent relations and the bindings of proposition loves (John, Mary), John is
relational roles to their fillers) and seman- bound to the lover role by virtue of appear-
tically rich (in the sense that they make ing in the first slot after the open paren-
the semantic content of individual relational thesis, and Mary to the beloved by virtue of
roles and their fillers explicit). A complete appearing in the second slot. Similarly, in a
account of human thinking must elucidate labeled graph the top node (of the local sub-
how each of these properties can be achieved graph coding “John loves Mary”) represents
and how they work together. An account the loves relation, and the nodes directly be-
that achieves one property at the expense of low it represent its arguments with the bind-
the other is at best only a partial account of ings of arguments to roles captured, for ex-
human thinking. The next section reviews ample, by the order (left to right) in which
the dominant approaches to modeling hu- those arguments are listed. These schemes,
man mental representations, with an em- which may look different at first pass, are in
phasis on how each approach succeeds or fact isomorphic. In both cases, the relation
fails to capture these two properties of hu- is represented by a single symbol, and the
bindings of arguments to relational roles are meaning and other subtleties associated with
captured by the syntax of the notation (as list semantic content. This limitation was a cen-
position within parentheses, as the locations tral focus of the influential critiques of sym-
of nodes in a directed graph, etc.). bolic modeling presented by the connection-
Models based on SAA are meaningfully ists in the mid-1 980s (e.g., Rumelhart et al.,
symbolic in the sense described previously: 1 986). A review of how traditional symbolic
They represent relations explicitly (i.e., in- models have handled this problem (typi-
dependently of their arguments), and they cally with external representational systems
explicitly specify the bindings of relational such as lookup tables or matrices of hand-
roles to their arguments. This fact is no sur- coded “similarity” values between symbols;
prise, given that SAA is based on represen- see Lovett & Anderson, Chap. 1 7) also re-
tational conventions that were explicitly de- veals that the question of semantics in SAA
signed to meet these criteria. However, the is, in the very least, a thorny inconvenience
symbolic nature of SAA is nontrivial because (Hummel & Holyoak, 1 997). However, at
it endows models based on SAA with all the same time, it is tempting to assume it is
the advantages of symbolic representations. merely an inconvenience – that surely there
Most important, symbolic representations exists a relatively straightforward way to add
enable relational generalization – generaliza- semantic coding to propositional notation
tions that are constrained by the relational and other forms of SAA and that a solu-
roles that objects play, rather than simply tion will be found once it becomes impor-
the features of the objects themselves (see tant enough for someone to pay attention to
Holland et al., 1 986; Holyoak & Thagard, it. In the mean time, it is surely no reason to
1 995 ; Hummel & Holyoak, 1 997, 2003 a; abandon SAA as a basis for modeling human
Thompson & Oden, 2000). Relational gener- cognition.
alization is important because, among other However, it turns out that it is more than
things, it makes it possible to define, match, a thorny inconvenience: As demonstrated
and apply variablized rules. (It also makes by Doumas and Hummel (2004), it is logi-
it possible to make and use analogies, to cally impossible to specify the semantic con-
learn and use schemas, and ultimately to tent of relational roles within an SAA rep-
learn variablized rules from examples; see resentation. In brief, SAA representations
Hummel & Holyoak, 2003 a.) For example, cannot represent relational roles explicitly
with a symbolic representational system, it and simultaneously specify how they come
is possible to define the rule “if loves (x, y) together to form complete relations. The
and loves (y, z) and not [loves (y, x)], then reason for this limitation is that SAA repre-
jealous (x, z)” and apply that rule to any sentations specify role information only im-
x, y, and z that match its left-hand (“if ”) plicitly (see Halford et al., 1 998). Specify-
side. As elaborated shortly, this important ing this information explicitly requires new
capacity, which plays an essential role in hu- propositions, which must be related to the
man relational thinking, lies fundamentally original relational representation via a sec-
beyond the reach of models based on non- ond relation. In SAA, this results in a new
symbolic representations (Holyoak & Hum- relational proposition, which itself implies
mel, 2000; Hummel & Holyoak, 2003 a; role representations to which it must be re-
Marcus, 1 998). lated by a third relational proposition, and
Given the symbolic nature of SAA, it is no so forth, ad infinitum. In short, attempt-
surprise that it has figured so prominently in ing to use SAA to link relational roles to
models of relational thinking and symbolic their parent relations necessarily results in
cognition more generally (see Lovett & An- an infinite regress of nested “constituent of”
derson, Chap. 1 7). Less salient are the limi- relations specifying which roles belong to
tations of SAA. It has been known for a long which relations/roles (see Doumas & Hum-
time that SAA and related representational mel, 2004 for the full argument). As a result,
schemes have difficulty capturing shades of attempting to use SAA to specify how roles
form complete relations renders any SAA These representations are distributed in the
system ill-typed (i.e., inconsistent and/or sense that (1 ) any single concept is repre-
paradoxical; see, e.g., Manzano, 1 996). sented as a pattern (i.e., vector) of activa-
The result of this limitation is that SAA tion over many elements (“nodes” or “units”
systems are forced to use external (i.e., non- that are typically assumed to correspond
SAA) structures to represent the meaning of roughly to neurons or small collections of
symbols (or to approximate those meanings, neurons), and (2) any single element will par-
e.g., with matrices of similarity values) and ticipate in the representation of many differ-
external control systems (which themselves ent concepts.2 As a result, two patterns of ac-
cannot be based on SAA) to read the SAA, tivation will tend to be similar to the extent
access the external structures, and relate the that they represent similar concepts: In con-
two. Thus, it is no surprise that SAA-based trast to SAA, distributed connectionist rep-
models rely on lookup tables, similarity ma- resentations provide a natural basis for rep-
trices and so forth to specify how different resenting the semantic content of concepts.
relations and objects are semantically related Similar ideas have been proposed in the con-
to one another: It is not merely a conve- text of latent semantic analysis (Landauer
nience; it is a necessity. & Dumais, 1 997) and related mathemati-
This property of SAA sharply limits its cal techniques for deriving similarity metrics
utility as a general approach to modeling from the co-occurrence statistics of words in
human mental representations. In particu- passages of text (e.g., Lund & Burgess, 1 996).
lar, it means that the connectionist critiques In all these cases, concepts are represented
of the mid-1 980s were right: Not only do as vectors, and vector similarity is taken as
traditional symbolic representations fail to an index of the similarity of the corres-
represent the semantic content of the ideas ponding concepts.
they mean to express, but the SAA represen- Because distributed activation vectors
tations on which they are based cannot even provide a natural basis for capturing the sim-
be adapted to do so. The result is that SAA ilarity structure of a collection of concepts
is ill equipped, in principle, to address those (see Goldstone & Son, Chap. 2), connection-
aspects of human cognition that depend on ist models have enjoyed substantial success
the semantic content of relational roles and simulating various kinds of learning and gen-
the arguments that fill them (which, as sum- eralization (see Munakata & O’Reilly, 2003 ):
marized previously, amounts to a substan- Having been trained to give a particular
tial proportion of human cognition). This output (e.g., generate a specific activation
fact does not mean that models based on vector on a collection of output units) in
SAA (i.e., traditional symbolic models) are response to a given input (i.e., vector of
“wrong” but only that they are incomplete. activations on a collection of input units),
SAA is at best only a shorthand (a very connectionist networks tend to generalize
short hand) approximation of human mental automatically (i.e., activate an appropriate
representations. output vector, or a close approximation of
it) in response to new inputs that are similar
to trained inputs. In a sense, connectionist
Traditional Connectionist
representations are much more flexible than
Representations
symbolic representations based on varieties
In response to limitations of traditional sym- of SAA. Whereas models based on SAA re-
bolic models, proponents of connectionist quire predicates to match exactly in order to
models of cognition (see, e.g., Elman et al., treat them identically,3 connectionist mod-
1 996; Rumelhart et al., 1 986; St. John & Mc- els generalize more gracefully based on the
Clelland, 1 990; among many others) have degree of overlap between trained patterns
proposed that knowledge is represented not and new ones.
as discrete symbols that enter into symbolic In another sense, however, connectionist
expressions but as patterns of activation models are substantially less flexible than
distributed over many processing elements. symbolic models. The reason is that the
distributed representations used by tradi- ble input and output vectors; e.g., O’Reilly,
tional connectionist models are not sym- 2001 ), the resulting models are not capable
bolic in the sense defined previously. That is, of relational generalization (see Hummel &
they cannot represent relational roles inde- Holyoak, 1 997, 2003 a; Marcus, 1 998, 2001 ,
pendently of their fillers and simultaneously for detailed discussions of this point).
specify which roles are bound to which fillers A particularly clear example of the im-
(Hummel & Holyoak, 1 997, 2003 a). Instead, plications of this limitation comes from the
a network’s knowledge is represented as sim- story Gestalt model of story comprehension
ple vectors of activation. Under this ap- developed by St. John (1 992; St. John &
proach, relational roles (to the extent that McClelland, 1 990). In one computational
they are represented at all) are either repre- experiment (St. John, 1 992, simulation 1 ),
sented on separate units from their potential the model was first trained with 1 ,000,000
fillers (e.g., with one set of units for the lover short texts consisting of statements based on
role of the loves relation, another set for the 1 3 6 constituent concepts. Each story instan-
beloved role, a third set for John, a fourth tiated a script such as “<person> decided to
set for Mary, etc.), in which case the bind- go to <destination>; <person> drove <ve-
ings of roles to their fillers is left unspecified hicle> to <destination>” (e.g., “George de-
(i.e., simply activating all four sets of units cided to go to a restaurant; George drove a
cannot distinguish “John loves Mary” from Jeep to the restaurant”; “Harry decided to
“Mary loves John” or even from a statement go to the beach; Harry drove a Mercedes to
about a narcissistic hermaphrodite); or else the beach”).
units are dedicated to specific role-filler con- After the model had learned a network
junctions (e.g., with one set of units for “John of associative connections based on the
as lover” another for “John as beloved”, etc.; 1 ,000,000 examples, St. John tested its abil-
e.g., Hinton, 1 990), in which case the bind- ity to generalize by presenting it with a text
ings are specified, but only at the expense of containing a new statement, such as “John
role-filler independence (e.g., nothing rep- decided to go to the airport.” Although the
resents the lover or beloved roles, indepen- statement as a whole was new, it referred
dently of the argument to which they hap- to people, objects and places that had ap-
pen to be bound). In neither case are the peared in the examples used for training. St.
resulting representations truly symbolic. John reported that when given a new exam-
Indeed, some proponents of traditional ple about deciding to go to the airport, the
connectionist models (e.g., Elman et al., model would typically activate the restau-
1 996) – dubbed “eliminative connectionists” rant or the beach (i.e., the destinations in
by Pinker and Prince (1 988; see also Marcus, prior examples of the same script) as the
1 998) for their explicit desire to eliminate destination, rather than making the contex-
the need for symbolic representations from tually appropriate inference that the per-
models of cognition – are quite explicit in son would drive to the airport. This type
their rejection of symbolic representations as of error, which would appear quite unnat-
a component of human cognition. Instead of ural in human comprehension, results from
representing and matching symbolic “rules,” the model’s inability to generalize relation-
eliminative (i.e., traditional) connectionist ally (e.g., if a person wants to go location x,
models operate by learning to associate vec- then x will be the person’s destination – a
tors of features (where the features corre- problem that requires the system to repre-
spond to individual nodes in the network). sent the variable x and its value, indepen-
As a result, they are restricted to generaliz- dently of its binding to the role of desired
ing based on the shared features in the train- location or destination). As St. John noted,
ing set and the generalization set. Although “Developing a representation to handle role
the generalization capabilities of these net- binding proved to be difficult for the model”
works often appear quite impressive at first (1 992, p. 294).
blush (especially if the training set is judi- In general, although an eliminative con-
ciously chosen to span the space of all possi- nectionist model can make “inferences” on
which it has been directly trained (i.e., The deep reason the eliminative connec-
the model will remember particular associationist model illustrated in Figure 4.1 fails
tions that have been strengthened by learn- to learn the identity function is that it vio-
ing), the acquired knowledge may not gen- lates variable/value (i.e., role/filler) indepen-
eralize at all to novel instantiations that dence. The input and output units in Figure
lie outside the training set (Marcus, 1 998, 4.1 are intentionally mislabeled to suggest
2001 ). For example, having learned that Al- that they represent the concepts “1 ,” “2,” and
ice loved Sam, Sam loved Betty, and Al- so on. However, in fact, they do not repre-
ice was jealous of Betty, and told that John sent these concepts at all. Instead, the unit
loves Mary and Mary loves George, a per- labeled “1 ” in the input layer represents not
son is likely to conjecture that John is likely “1 ,” but “1 as the input to the identity function.”
to be jealous of George. An eliminative That is, it represents a conjunctive binding
connectionist system would be a complete of the value “1 ” to the variable “input to
loss to make any inferences: John, Mary, the function.” Likewise, the unit labeled “1 ”
and George are different people than Al- in the output layer represents, not “1 ,” but
ice, Sam, and Betty (Holyoak & Hummel, “1 ” as output of the identity function. Thus,
2000; Hummel & Holyoak, 2003 a; Phillips & counter to initial appearances, the concept
Halford, 1 997). “1 ” is not represented anywhere in the net-
A particularly simple example that re- work. Neither, for that matter, is the concept
veals such generalization failures is the iden- “input to the identity function”: Every unit in
tity function (Marcus, 1 998). Suppose, for the input layer represents some specific input
example, that a human reasoner was trained to the function; there are no units to repre-
to respond with “1 ” to “1 ,” “2” to “2,” and “3 ” sent input as a generic unbound variable.
to “3 .” Even with just these three examples, Because of this representational con-
the human is almost certain to respond with vention (i.e., representing variable-value
“4” to “4,” without any direct feedback that conjunctions instead of variables and val-
this is the correct output for the new case. In ues), traditional connectionist networks are
contrast, an eliminative connectionist model forced to learn the identity function as a
will be unable to make this obvious general- mapping from one set of conjunctive units
ization. Such a model can be trained to give (the input layer) to another set of conjunc-
specific outputs to specific inputs (e.g., as il- tive units (the output layer). This mapping,
lustrated in Figure 4.1 ). But when training which to our eye resembles an approxima-
is over, it will have learned only the input– tion of the identity function, f(x) = x, is, to
output mappings on which it was trained the network, just an arbitrary mapping. It
(and perhaps those that can be represented is arbitrary precisely because the unit repre-
by interpolating between trained examples; senting “1 as output of the function” bears
see Marcus, 1 998): Because the model lacks no relation to the unit representing “1 as in-
the capacity to represent variables, extrap- put to the function.” Although any func-
olation outside the training set is impossi- tion specifies a mapping [e.g., a mapping
ble. In other words, the model will simply from values of x to values of f(x)], learning a
have learned to associate “1 ” with “1 ,” “2” mapping is not the same thing as learning a
with “2,” and “3 ” with “3 .” A human, by con- function. Among other differences, a func-
trast, will have learned to associate input (x) tion can be universally quantified [e.g., ∀x,
with output (x), for any x; and doing so re- f(x) = x], whereas a finite mapping cannot;
quires the capacity to bind any new number universal quantification permits the function
(whether it was in the training space or not) to apply to numbers (and even nonnum-
to the variable x. Indeed, most people are bers) that lie well outside the “training” set.
willing to generalize even beyond the world The point is that the connectionist model’s
of numbers. We leave it to the reader to give failure to represent variables independently
the appropriate outputs in response to the of their values (and vice versa) relegates it
following inputs: “A”; “B”; “flower.” to (at best) approximating a subset of the
human mind is the product of a symbol

system; hence, any model that succeeds in
eliminating symbol systems will ipso facto
have succeeded in eliminating itself from
contention as a model of the human cog-
nitive architecture.
Conjunctive Connectionist
Representations
Some modelers, recognizing both the es-
sential role of relational representations in
human cognition (e.g., for relational gen-
eralization) and the value of distributed
representations, have sought to construct
symbolic representations in connectionist ar-
chitectures. The most common approach is
based on Smolensky’s (1 990) tensor prod-
ucts (e.g., Halford et al., 1 998) and its
relatives, such as spatter codes (Kanerva,
1 998), holographic reduced representations
(HRRs; Plate, 1 994), and circular convo-
Figure 4.1 . Diagram of a two-layer
connectionist network for solving the identity
lutions (Metcalfe, 1 990). We restrict our
function in which the first three units (those discussion to tensor products because the
representing the numbers 1 , 2, and 3 ) have been properties of tensors we discuss also apply
trained and the last two (those representing the to the other approaches (see Holyoak &
numbers 4 and 5 ) have not. Black lines indicate Hummel, 2000).
already trained connections, whereas grey lines A tensor product is an outer product of
denote untrained connections. Thicker lines two or more vectors that are treated as an
indicate highly excitatory connections, whereas activation vector (i.e., rather than a matrix)
thinner lines signify slightly excitatory or slightly for the purposes of knowledge representa-
inhibitory connections. tion (see Smolensky, 1 990). In the case of a
rank 2 tensor, uv, formed from two vectors,
identity function as a simple, and ultimately u and v, the activation of the ijth element
arbitrary, mapping (see Marcus, 1 998). Peo- of uv is simply the product of the activa-
ple, by contrast, represent variables indepen- tions of the ith and j th elements of u and v,
dently of their values (and vice versa) and respectively: uvij = ui vj . Similarly, the ijk th
so can recognize and exploit the decidedly value of the rank 3 tensor uvw is the product
nonarbitrary relation between the function’s uvwijk = ui vj wk , and so forth, for any num-
inputs and its outputs: To us, but not to ber of vectors (i.e., for any rank).
the network, the function is not an arbitrary Tensors and their relatives can be used
mapping at all, but rather a trivial game of to represent role-filler bindings. For exam-
“say what I say.” ple, if the loves relation is represented by the
As these examples illustrate, the power of vector u, John by the vector v, and Mary
human reasoning and learning, most notably by the vector w, then the proposition loves
our capacity for sophisticated relational gen- (John, Mary) could be represented by the
eralizations, is dependent on the capacity tensor uvw; loves (Mary, John) would be rep-
to represent relational roles (variables) and resented by the tensor uwv. This procedure
bind them to fillers (values). This is precisely for representing propositions as tensors – in
the same capacity that permits composition which the predicate is represented by one
of complex symbols from simpler ones. The vector (here, u) and its argument(s) by the
others (v and w) – is isomorphic with SAA isomorphic tensors (as advocated by Halford
(Halford et al., 1 998): One entity (here, a and colleagues) and role-filler binding-based
vector) represents the relation, other enti- tensors (as advocated by Smolensky and col-
ties represent its arguments, and the bind- leagues). A tensor product is a product of
ings of arguments to roles of the relation two or more vectors, and so the similarity of
are represented spatially (note the differ- two tensors (e.g., their inner product or the
ence between uvw and uwv). This version cosine of the angle between them) is equal
of tensor-based coding is SAA-isomorphic; to the product of the similarities of the ba-
the entire relation is represented by a single sic vectors from which they are constructed.
vector or symbol, and arguments are bound For example, in the case of tensors ab and cd
directly to that symbol. Consequently, it formed from vectors a, b, c, and d:
provides no basis for differentiating the se-
ab · cd = (a · c)(b · d), (4.1 )
mantic features of the various roles of a
relation. where the “·” denotes the inner product, and
Another way to represent relational bind-
cos(ab, cd) = cos(a, c)cos(b, d), (4.2)
ings using tensors is to represent individual
relational roles as vectors, role-filler bind- where cos(x, y) is the cosine of the angle
ings as tensors, and complete propositions between x and y.
as sums of tensors (e.g., Tesar & Smolensky, In other words, two tensor products are
1 994). For example, if the vector l repre- similar to one another to the extent that their
sents the lover role of the loves relation, b the roles and fillers are similar to one another. If
beloved role, j John and m Mary, then loves vectors a and c represent relations (or re-
(John, Mary) would be represented by the lational roles) and b and d represent their
sum lj + bm, and loves (Mary, John) would fillers, then the similarity of the ab binding
be the sum lm + bj. to the cd binding is equal to the similarity of
Tensors provide a basis for representing roles a and c times the similarity of fillers b
the semantic content of relations (in the case and d. This fact sounds unremarkable at first
of tensors that are isomorphic with SAA) blush. However, consider the case in which
or relational roles (in the case of tensors a and c are identical (for clarity, let us re-
based on role-filler bindings) and to repre- place them both with the single vector r),
sent role-filler bindings explicitly. Accord- but b and d are completely unrelated (i.e.,
ingly, numerous researchers have argued that they are orthogonal, with an inner product
tensor products and their relatives provide of zero). In this case,
an appropriate model of human symbolic
(rb · rd) = (r · r)(b · d) = 0. (4.3 )
representations. Halford and his colleagues
also showed that tensor products based on That is, the similarity of rb to rd is zero even
SAA representations provide a natural ac- though both refer to the same relational role.
count of the capacity limits of human work- This result is problematic for tensor-
ing memory and applied these ideas to ac- based representations because a connection-
count for numerous phenomena in relational ist network (and for that matter, probably a
reasoning and cognitive development (see person) will generalize learning from rb to
Halford, Chap. 22). Tensors are thus at least rd to the extent that the two are similar to
a useful approximation of human relational one another. Equation (4.3 ) shows that, if b
representations. and d are orthogonal, then rb and rd will be
However, tensor products and their orthogonal even though they both represent
relatives have two properties that limit bindings of different arguments to exactly
their adequacy as a general model of hu- the same relational role (r). As a result, ten-
man relational representations. First, tensors sor products cannot support relational gener-
necessarily violate role-filler independence alization. The same limitation applies to all
(Holyoak & Hummel, 2000; Hummel & multiplicative binding schemes (i.e., repre-
Holyoak, 2003 a). This is true both of SAA- sentations in which the vector representing
a binding is a function of the product of the ability to represent similar concepts with
the vectors representing the bound ele- similar vectors. This trade-off is a symp-
ments), including HRRs, circular convolu- tom of the fact that tensors are trapped on
tions, and spatter codes (see Hummel & the implicit relations continuum (Hummel &
Holyoak, 2003 a). Biederman, 1 992) – the continuum from
A second problem for tensor-based repre- holistic (localist) to feature-based (dis-
sentations concerns the representation of the tributed), vector-based representations of
semantics of relational roles. Tensors that are concepts – characterizing representational
SAA-isomorphic (e.g., Halford et al., 1 998) schemes that fail to code relations indepen-
fail to distinguish the semantics of differ- dently of their arguments.
ent roles of the relation precisely because
they are SAA-isomorphic (see Doumas &
Role-Filler Binding by Vector Addition
Hummel, 2004): Rather than using sepa-
rate vectors to represent a relation’s roles, What is needed is a way to both represent
SAA-isomorphic tensors represent the rela- roles and their fillers in a distributed fash-
tion, as a whole, using a single vector. Role- ion (to capture their semantic content) and
filler binding tensors (e.g., as proposed by simultaneously bind roles to their fillers in
Smolensky and colleagues) do explicitly rep- a way that does not violate role-filler inde-
resent the semantic content of the individual pendence (to achieve meaningfully symbolic
roles of a relation. However, these represen- representation and thus relational general-
tations are limited by the summing opera- ization). Tensor products are on the right
tion that is used to conjoin the separate role- track in the sense that they represent rela-
filler bindings into complete propositions. tions and fillers in a distributed fashion, and
The result of the summing operation is a “su- they can represent role-filler bindings – just
perposition catastrophe” (von der Malsburg, not in a way that preserves role-filler inde-
1 981 ) in which the original role-filler bind- pendence. Accordingly, in the search for a
ings – and therefore the original roles and distributed code that preserves role-filler in-
fillers – are unrecoverable (a sum underde- dependence, it is instructive to consider why,
termines its addends). mathematically, tensors violate it.
The deleterious effects of this superpo- The reason is that a tensor is a product
sition can be minimized by using sparse of two or more vectors, and so the value of
representations in a very high-dimensional ij th element of the tensor is a function of
space (Kanerva, 1 998; Plate, 1 991 ). This the i th value of the role vector and the j th
approach works because it minimizes the element of the filler vector. That is, a tensor
representational overlap between separate is the result of a multiplicative interaction
concepts. However, minimizing the repre- between two or more vectors. Statistically,
sentational overlap also minimizes the pos- when two or more variables do not interact –
itive effects of distributed representations that is, when their effects are independent, as
(which stem from the overlap between rep- in the desired relationship between roles and
resentations of similar concepts). In the their fillers – their effects are additive (rather
limit, sparse coding becomes equivalent to than multiplicative). Accordingly, the way
localist conjunctive coding with completely to bind a distributed vector, r, representing
separate codes for every possible conjunc- a relational role to a vector, f, representing
tion of roles and fillers. In this case, there its filler is not to multiply them but to add
is no interference between separate bind- them (Holyoak & Hummel, 2000; Hummel
ings, but neither is there overlap between & Holyoak, 1 997, 2003 a):
related concepts. Conversely, as the overlap
between related concepts increases, so does rf = r + f, (4.4)
the ambiguity of sums of separate role bind-
ings. The ability to keep separate bindings where rf is just an ordinary vector (not
separate thus invariably trades off against a tensor).4
Binding by vector addition is most com- Vaadia et al., 1 995 ). It seems that evolution
monly implemented in the neural network and the brain may be happy to exploit “en-
modeling community as synchrony of neural gineering conveniences.” This would be un-
firing (for reviews, see Hummel & Holyoak, surprising given the computational benefits
1 997, 2003 a), although it can also be real- endowed by dynamic binding (namely, re-
ized in other ways (e.g., as systematic asyn- lational generalization based on distributed
chrony for firing; Love, 1 999). The basic idea representations), the ease with which syn-
is that vectors representing relational roles chrony can be established in neural systems,
fire in synchrony with vectors represent- and the ease with which it can be exploited
ing their fillers and out of synchrony with (it is well known that spikes arriving in close
other role-filler bindings. That is, at each in- temporal proximity have superadditive ef-
stant in time, a vector representing a role is fects on the postsynaptic neuron relative to
“added to” (fires with) the vector represent- spikes arriving at very different times). The
ing its filler. mapping between the limitations of human
Binding by synchrony of firing is much WM and the limitations of synchrony cited
reviled in some segments of the connec- by O’Reilly et al. (2003 ) also constitutes in-
tionist modeling community. For example, direct support for the synchrony hypothe-
Edelman and Intrator (2003 ) dismissed it sis, as do the successes of models based on
as an “engineering convenience.” Similarly, synchrony (for reviews, see Hummel, 2000;
O’Reilly et al. (2003 ) dismissed it on the Hummel & Holyoak, 2003 b; Shastri, 2003 ).
grounds that (1 ) it is necessarily transient However, synchrony of firing cannot be
[i.e., it is not suitable as a basis for stor- the whole story. At a minimum, conjunc-
ing bindings in long-term memory (LTM)], tive coding is necessary for storing bindings
(2) it is capacity limited (i.e., it is only in LTM and forming localist tokens of roles,
possible to have a finite number of bound objects, role-filler bindings, and complete
groups simultaneously active and mutually propositions (Hummel & Holyoak, 1 997,
out of synchrony; Hummel & Biederman, 2003 a). It seems likely, therefore, that an ac-
1 992; Hummel & Holyoak, 2003 a; Hum- count of the human cognitive architecture
mel & Stankiewicz, 1 996), and (3 ) bind- that includes both “mundane” acts (such as
ings represented by synchrony of firing must shape perception, which actually turns out
ultimately make contact with stored con- to be relational; Hummel, 2000) and sym-
junctive codes in LTM. These limitations do bolic cognition (such as planning, reason-
indeed apply to binding by synchrony of ing, and problem solving) must incorporate
firing; (1 ) and (2) are also precisely the lim- both dynamic binding (for independent rep-
itations of human working memory (WM) resentation of roles bound to fillers in WM)
(see Cowan, 2000). Limitation (3 ) is meant and conjunctive coding (for LTM storage
to imply that synchrony is redundant: If and token formation) and specify how they
you already have to represent bindings con- are related.
junctively in order to store them in LTM, The remainder of this chapter reviews
then why bother to use synchrony? The an- one example of this approach to knowl-
swer is that synchrony, but not conjunctive edge representation – “LISAese,” the rep-
coding, makes it possible to represent roles resentational format used by Hummel and
independently of their fillers and thus al- Holyoak’s (1 992, 1 997, 2003 a) LISA (Learn-
lows symbolic representations and relational ing and Inference with Schemas and Analo-
generalization. gies) model of analogical inference and
Despite the objections of Edelman and schema induction – with an emphasis on
Intrator (2003 ), O’Reilly et al. (2003 ), and how LISAese permits symbolic representa-
others, there is substantial evidence for bind- tions to be composed from distributed (i.e.,
ing by synchrony in the primate visual cortex semantically rich) representations of roles
(see Singer, 2000, for a review) and frontal and fillers and how the resulting representa-
cortex (e.g., Desmedt & Tomberg, 1 994; tions are uniquely suited to simulate aspects
Figure 4.2 . Representation of propositions in LISAese. Objects and relational roles are represented
both as patterns of activation distributed over units representing semantic features (semantic units;
small circles) and as localist units representing tokens of objects (large circles) and relational roles
(triangles). Roles are bound to fillers by localist subproposition (SP) units (rectangles), and role-filler
bindings are bound into complete propositions by localist proposition (P) units (ovals).
(a) Representation of loves (Susan, Jim). (b) Representation of knows [Jim, loves (Susan, Jim)]. When
one P takes another as an argument, the lower (argument) P serves in the place of an object unit
under the appropriate SP of the higher-level P unit [in this case, binding loves (Susan, Jim) to the SP
representing what is known].
of human perception and cognition (also see ality traits, etc.), and Susan might be rep-
Holyoak, Chap. 6). resented as human and female (along with
LISAese is based on a hierarchy of dis- units for her unique attributes). Similarly,
tributed and localist codes that collectively the lover and beloved roles of the loves re-
represent the semantic features of objects lation would be represented by semantic
and relational roles and their arrangement units capturing their semantic content. At
into complete propositions (Figure 4.2). At the next level of the hierarchy, object and
the bottom of the hierarchy, semantic units predicate units (large circles and triangles in
(small circles in Figure 4.2) represent ob- Figure 4.2) represent objects and relational
jects and relational roles in a distributed roles in a localist fashion and share bidi-
fashion. For example, Jim might be repre- rectional excitatory connections with the
sented by features such as human, and male corresponding semantic units. Subproposi-
(along with units representing his person- tion units (SPs; rectangles in Figure 4.2)
represent bindings of relational roles to their features specifying her height in an analog
arguments [which can either be objects, as in about her playing basketball; see Hummel
Figure 4.2(a), or complete propositions, as in & Holyoak, 2003 a). Thus, whereas the local-
Figure 4.2(b)]. At the top of the hierarchy, ist units represent tokens, the semantic units
separate role-filler bindings (i.e., SPs) are represent types.
bound into a localist representation of the The hierarchy of units depicted in Fig-
proposition as a whole via excitatory connec- ure 4.2 represents propositions both in
tions to a single proposition (P) unit (ovals LISA’s LTM and, when the units become ac-
in Figure 4.2). Representing propositions in tive, in its WM. In this representation, the
this type of hierarchy reflects our assump- binding of roles to fillers is captured by the
tion that every level of the hierarchy must be localist (and conjunctive) SP units. When
represented explicitly as an entity in its own a proposition becomes active, its role-filler
right (see Hummel & Holyoak, 2003 a). The bindings are also represented dynamically
resulting representational system is com- by synchrony of firing. When a P unit be-
monly referred to as a role-filler binding sys- comes active, it excites the SPs to which it
tem (see Halford et al., 1 998). Both rela- is connected. Separate SPs inhibit one an-
tional roles and their fillers are represented other, causing them to fire out of synchrony
explicitly, and relations are represented as with one another. When an SP fires, it acti-
linked sets of role-filler bindings. Impor- vates the predicate and object units beneath
tantly, in role-filler binding systems, rela- it, and they activate the semantic units be-
tional roles, their semantics, and their bind- neath themselves. On the semantic units, the
ings to their fillers are all made explicit in result is a collection of mutually desynchro-
the relational representations themselves. As nized patterns of activation, one for each role
a result, role-filler binding representations binding. For example, the proposition loves
are not subject to the problems inherent (Susan, Jim) would be represented by two
in SAA representations discussed previously such patterns, one binding the semantic fea-
wherein relational roles are left implicit in tures of Susan to the features of lover, and the
the larger relational structures. other binding Jim to beloved. The proposi-
A complete analog (i.e., story, situation, tion loves (Jim, Susan) would be represented
or event) in LISAese is represented by the by the very same semantic units (as well as
collection of P, SP, predicate, object, and se- the same object and predicate units); only
mantic units that code its propositional con- the synchrony relations would be reversed.
tent. Within an analog, a given object, re- The resulting representations explicitly
lational role, or proposition is represented bind semantically rich representations of
by a single localist unit regardless of how relational roles to representations of their
many times it is mentioned in the analog fillers (at the level of semantic features, pred-
[e.g., Susan is represented by the same unit icate and object units, and SPs) and represent
in both loves (Susan, Jim) and loves (Charles, complete relations as conjunctions of role-
Susan)], but a given element is represented filler bindings (at the level of P units). As
by separate localist units in separate analogs. a result, they do not fall prey to the short-
The localist units thus represent tokens of in- comings of traditional connectionist repre-
dividual objects, relations, or propositions in sentations (which cannot dynamically bind
particular situations (i.e., analogs). A given roles to their fillers), those of SAA (which
object or relational role will tend to be can represent neither relational roles nor
connected to many of the same semantic their semantic content explicitly), or those
units in all the analogs in which it is men- of tensors.
tioned, but there may be small differences Hummel, Holyoak, and their colleagues
in the semantic representation, depending have shown that LISAese knowledge rep-
on context (e.g., Susan might be connected resentations, along with the operations that
to semantics describing her profession in act on them, account for a very large
an analog that refers to her work and to number of phenomena in human relational
reasoning, including phenomena surround- semantic content of the entities they repre-
ing memory retrieval, analogy making sent but fail to provide any basis for binding
(Hummel & Holyoak, 1 997), analogical in- those entities together into symbolic (i.e.,
ference, and schema induction (Hummel & relational) structures. This failure renders
Holyoak, 2003 a). They provide a natural ac- them incapable of relational generalization.
count of the limitations of human WM, on- Connectionist models that attempt to
togenetic and phylogenetic differences be- achieve symbolic competence by using ten-
tween individuals and species (Hummel & sor products and other forms of conjunc-
Holyoak, 1 997), the relation between ef- tive coding as the sole basis for role-filler
fortless (“reflexive”; Shastri & Ajjanagadde, binding find themselves in a strange world
1 993 ) and more effortful (“reflective”) forms in between the symbolic and connection-
of reasoning (Hummel & Choplin, 2000), ist approaches (i.e., on the implicit relations
and the effects of frontotemporal degener- continuum) neither fully able to exploit the
ation (Morrison et al., 2004; Waltz et al., strengths of the connectionist approach nor
1 999) and natural aging (Viskontas et al., fully able to exploit the strengths of the sym-
in press) on reasoning and memory. They bolic approach.
also provide a basis for understanding the Knowledge representations based on dy-
perceptual–cognitive interface (Green & namic binding of distributed representations
Hummel, 2004) and how specialized cog- of relational roles and their fillers (of which
nitive “modules” (e.g., for reasoning about LISAese is an example) – in combination
spatial arrays of objects) can work with with a localist representations of roles, fillers,
the broader cognitive architecture in the role-filler bindings, and their composition
service of specific reasoning tasks (e.g., into complete propositions – can simulta-
transitive inference; Holyoak & Hummel, neously capture both the symbolic nature
2000) (see Hummel & Holyoak, 2003 b, for and semantic richness of human mental rep-
a review). resentations. The resulting representations
are neurally plausible, semantically rich,
flexible, and meaningfully symbolic. They
Summary provide the basis for a unified account of hu-
man memory storage and retrieval, analogi-
An explanation of human mental represen- cal reasoning, and schema induction, includ-
tations – and the human cognitive architec- ing a natural account of both the strengths,
ture more broadly – must account both for limitations, and frailties of human rela-
our ability to represent the semantic content tional reasoning.
of relational roles and their fillers and for our
ability to bind roles to their fillers dynam-
ically without altering the representation
Acknowledgments
of either.
Traditional symbolic approaches to cog-
nition capture the symbolic nature of hu- This work was supported by a grant from the
man relational representations, but they fail UCLA Academic Senate. We thank Graeme
to specify the semantic content of roles and Halford and Keith Holyoak for very helpful
their fillers – a failing that, as noted by the comments on an earlier draft of this chapter.
connectionists in the 1 980s, renders them
too inflexible to serve as an adequate ac-
count of human mental representations, and, Notes
as shown by Doumas and Hummel (2004),
appears inescapable. 1 . Arguments (or roles) may suggest different
Traditional distributed connectionist ap- shades of meaning as a function of the roles
proaches have the opposite strengths and (or fillers) to which they are bound. For exam-
weaknesses: They succeed in capturing the ple, “loves” suggests a different interpretation
in loves (John, Mary) than it does in loves (John, further layer of conjunctive coding and further
chocolate). However, such contextual varia- violating role-filler independence.
tion does not imply in any general sense that
the filler (or role) itself necessarily changes its
identity as a function of the binding. For exam- References
ple, our ability to appreciate that the “John”
in loves (John, Mary) is the same person as
Bassok, M., Wu, L., & Olseth, K. L. (1 995 ). Judg-
the “John” in bites (Rover, John) demands ex-
ing a book by its cover: Interpretive effects of
planation in terms of John’s invariance across
content on problem-solving transfer. Memory
the different bindings. If we assume invari-
and Cognition, 2 3 , 3 5 4–3 67.
ance of identity with binding as the general
case, then it is possible to explain contextual Bowers, J. S. (2002). Challenging the widespread
shadings in meaning when they occur (Hum- assumption that connectionism and distributed
mel & Holyoak, 1 997). However, if we assume representations go hand-in-hand. Cognitive
lack of invariance of identity as the general Psychology, 45 , 41 3 –445 .
case, then it becomes impossible to explain Cowan, N. (2000). The magical number 4 in
how knowledge acquired about an individual short-term memory: A reconsideration of men-
or role in one context can be connected to tal storage capacity. Behavioral and Brain Sci-
knowledge about the same individual or role in ences, 2 4, 87–1 1 4.
other contexts. Desmedt, J., & Tomberg, C. (1 994). Transient
2. In the most extreme version of this account, phase-locking of 40 Hz electrical oscillations in
the individual processing elements are not as- prefrontal and parietal human cortex reflects
sumed to “mean” anything at all in isolation; the process of conscious somatic perception.
rather they take their meaning only as part of a Neuroscience Letters, 1 68, 1 26–1 29.
whole distributed pattern. Some limitations of Doumas, L. A. A., & Hummel, J. E. (2004). A
this extreme account are discussed by Bowers fundamental limitation of symbol-argument-
(2002) and Page (2000). argument notation as a model of human re-
3 . For example, Falkenhainer, Forbus, and Gen- lational representations. In Proceedings of the
tner’s (1 989) structure matching engine Twenty-Second Annual Conference of the
(SME), which uses SAA-based representations Cognitive Science Society, 3 27–3 3 2.
to perform graph matching, cannot map loves Edelman, S., & Intrator, N. (2003 ). Towards
(Abe, Betty) onto likes (Peter, Bertha) because structural systematicity in distributed, stati-
loves and likes are nonidentical predicates. To cally bound visual representations. Cognitive
perform this mapping, SME must recast the Science, 2 7, 73 –1 09.
predicates into a common form, such as has- Elman, J., Bates, E., Johnson, M., Karmiloff-
affection-for (Abe, Betty) and has-affection-for Smith, A., Parisi, D., & Plunkett, K. (1 996).
(Alex, Bertha) and then map these identical Rethinking innateness: A connectionist perspec-
predicates. tive on development. Cambridge, MA: MIT
4. At first blush, it might appear that adding two Press/Bradford Books.
vectors where one represents a relational role Falkenhainer, B., Forbus, K. D., & Gentner, D.
and the other its filler should be susceptible (1 989). The structure mapping engine: Algo-
to the very same problem that we faced when rithm and examples. Artificial Intelligence, 41 ,
adding two tensors where each represented a 1 –63 .
role-filler binding, namely the superposition
Fodor, J. A., & Pylyshyn, Z. W. (1 988). Connec-
catastrophe. It is easy to overcome this prob-
tionism and cognitive architecture. Cognition,
lem in the former case, however, by simply
2 8, 3 –71 .
using different sets of units to represent roles
and fillers so the network can distinguish them Garner, W. R. (1 974). The processing of information
when added (see Hummel & Holyoak, 2003 a). and structure. Hillsdale, NJ: Erlbaum.
This solution might also be applied to role- Gentner, D. (1 983 ). Structure-mapping: A the-
filler binding with tensors, although doing so oretical framework for analogy. Cognitive Sci-
would require using different sets of units to ence, 7, 1 5 5 –1 70.
code different role-filler bindings. This solu- Gentner, D., Ratterman, M. J., & Forbus, K. D.
tion would require allocating separate tensors (1 993 ). The roles of similarity in transfer:
to separate role-filler bindings, thus adding a Separating retrievability from inferential
soundness. Cognitive Psychology, 2 5 , 5 24– reasoning. In Proceedings of the twenty-second

5 75 . annual conference of the Cognitive Science Soci-
Gick, M. L., & Holyoak, K. J. (1 980). Analogical ety. Hillsdale, NJ: Erlbaum.
problem solving. Cognitive Psychology, 1 2 , 3 06– Hummel, J. E., & Holyoak, K. J. (1 992). Indirect
355. analogical mapping. Proceedings of the 1 4th an-
Gick, M. L., & Holyoak, K. J. (1 983 ). Schema in- nual conference of the Cognitive Science Society,
duction and analogical transfer. Cognitive Psy- 5 1 6–5 21 .
chology, 1 5 , 1 –3 8. Hummel, J. E., & Holyoak, K. J. (1 997). Dis-
Goldstone, R. L., Medin, D. L., & Gentner, D. tributed representations of structure: A theory
(1 991 ). Relational similarity and the noninde- of analogical access and mapping. Psychological
pendence of features in similarity judgments. Review, 1 04, 427–466.
Cognitive Psychology, 2 3 , 222–262. Hummel, J. E., & Holyoak, K. J. (2003 a). A
Green, C. B., & Hummel, J. E. (2004). Rela- symbolic-connectionist theory of relational in-
tional perception and cognition: Implications ference and generalization. Psychological Re-
for cognitive architecture and the perceptual- view, 1 1 0, 220–263 .
cognitive interface. In B. H. Ross (Ed.), The Hummel, J. E., & Holyoak, K. J. (2003 b).
psychology of learning and motivation (Vol. 44, Relational reasoning in a neurally-plausible
pp. 201 –223 ). San Diego: Academic Press. cognitive architecture: An overview of the
Halford, G. S., Wilson, W. H., & Phillips, S. LISA project. Cognitive Studies: Bulletin of
(1 998). Processing capacity defined by rela- the Japanese Cognitive Science Society, 1 0,
tional complexity: Implications for compara- 5 8–75 .
tive, developmental, and cognitive psychology. Hummel, J. E., & Stankiewicz, B. J. (1 996).
Brain and Behavioral Sciences, 2 1 , 803 –864. An architecture for rapid, hierarchical struc-
Hinton, G. E. (Ed.). (1 990). Connectionist symbol tural description. In T. Inui & J. McClelland
processing. Cambridge, MA: MIT Press. (Eds.), Attention and performance XVI: Informa-
Holland, J. H., Holyoak, K. J., Nisbett, R. E., & tion integration in perception and communication
Thagard, P. (1 986). Induction: Processes of infer- (pp. 93 –1 21 ). Cambridge, MA: MIT Press.
ence, learning, and discovery. Cambridge, MA: Kanerva, P. (1 998). Sparse distributed memory.
MIT Press. Cambridge, MA: MIT Press.
Holyoak, K. J., & Hummel, J. E. (2000). The Keane, M. T., Ledgeway, T., & Duff, S. (1 994).
proper treatment of symbols in a connection- Constraints on analogical mapping: A compari-
ist architecture. In E. Dietrich & A. Markman son of three models. Cognitive Science, 1 8, 3 87–
(Eds.), Cognitive dynamics: Conceptual change 43 8.
in humans and machines (pp. 229–263 ). Hills- Kim, J. J., Pinker, S., Prince, A., & Prasada, S.
dale, NJ: Erlbaum. (1 991 ). Why no mere mortal has ever flown
Holyoak, K. J., & Thagard, P. (1 995 ). Mental leaps: out to center field. Cognitive Science, 1 5 , 1 73 –
Analogy in creative thought. Cambridge, MA: 21 8.
MIT Press. Krawczyk, D. C., Holyoak, K. J., & Hummel,
Hummel, J. E. (2000). Where view-based theo- J. E. (in press). Structural constraints and object
ries break down: The role of structure in shape similarity in analogical mapping and inference.
perception and object recognition. In E. Diet- Thinking and Reasoning.
rich & A. Markman (Eds.), Cognitive dynamics: Kubose, T. T., Holyoak, K. J., & Hummel, J. E.
Conceptual change in humans and machines (pp. (2002). The role of textual coherence in incre-
1 5 7–1 85 ). Hillsdale, NJ: Erlbaum. mental analogical mapping. Journal of Memory
Hummel, J. E. (2003 ). Effective systematicity in, and Language, 47, 407–43 5 .
effective systematicity out: A reply to Edelman Landauer, T. K., & Dumais, S. T. (1 997). A so-
& Intrator (2003 ). Cognitive Science, 2 7, 3 27– lution to Plato’s problem: The latent seman-
3 29. tic analysis theory of acquisition, induction and
Hummel, J. E., & Biederman, I. (1 992). Dynamic representation of knowledge. Psychological Re-
binding in a neural network for shape recogni- view, 1 04, 21 1 –240.
tion. Psychological Review, 99, 480–5 1 7. Love, B. C. (1 999). Utilizing time: Asynchronous
Hummel, J. E., & Choplin, J. M. (2000). Toward binding. Advances in Neural Information Pro-
an integrated account of reflexive and reflective cessing Systems, 1 1 , 3 8–44.
Lund, K., & Burgess, C. (1 996). Producing high- Phillips, S., & Halford, G. S. (1 997). Syste-
dimensional semantic spaces from lexical co- maticity: Psychological evidence with connec-
occurrence. Behavior Research Methods, Instru- tionist implications. In M. G. Shafto & P.
mentation, and Computers, 2 8, 203 –208. Langley (Eds.), Proceedings of the nineteenth
Manzano, M. (1 996). Extensions of first order logic. conference of the Cognitive Science Society
Cambridge: Cambridge University Press. (pp. 61 4–61 9). Hillsdale, NJ: Erlbaum.
Marcus, G. F. (1 998). Rethinking eliminative con- Pinker, S., & Prince, A. (1 988). On language
nectionism. Cognitive Psychology, 3 7(3 ), 243 – and connectionism: Analysis of a parallel dis-
282. tributed processing model. Cognition, 2 8, 73 –
Marcus, G. F. (2001 ). The algebraic mind. Cam- 1 93 .
bridge, MA: MIT Press. Plate, T. (1 991 ). Holographic reduced represen-
Metcalfe, J. (1 990). Composite holographic as- tations: Convolution algebra for compositional
sociative recall model (CHARM) and blended distributed representations. In J. Mylopoulos &
memories in eyewitness testimony. Journal of R. Reiter (Eds.), Proceedings of the 1 2 th interna-
Experimental Psychology: General, 1 1 9, 1 45 – tional joint conference on artificial intelligence
1 60. (pp. 3 0–3 5 ). San Mateo, CA: Morgan Kauf-
mann.
Morrison, R. G., Krawczyk, D., Holyoak, K. J.,
Hummel, J. E., Chow, T., Miller, B., & Knowl- Plate, T. A. (1 994). Distributed representations
ton, B. J. (2004). A neurocomputational model and nested compositional structure. Unpublished
of analogical reasoning and its breakdown in doctoral dissertation, Department of Com-
frontotemporal lobar degeneration. Journal of puter Science, University of Toronto, Toronto,
Cognitive Neuroscience, 1 6, 1 –1 1 . Canada.
Munakata, Y., & O’Reilly, R. C. (2003 ). Devel- Robin, N., & Holyoak, K. J. (1 995 ). Relational
opmental and computational neuroscience ap- complexity and the functions of prefrontal cor-
proaches to cognition: The case of generaliza- tex. In M. S. Gazzaniga (Ed.), The cognitive
tion. Cognitive Studies, 1 0, 76–92. neurosciences (pp. 987–997). Cambridge, MA:
Newell, A. (1 990). Unified theories of cognition. MIT Press.
Cambridge, MA: Harvard University Press. Ross, B. (1 987). This is like that: The use of earlier
O’Reilly, R. C. (2001 ). Generalization in interac- problems and the separation of similarity ef-
tive networks: The benefits of inhibitory com- fects. Journal of Experimental Psychology: Learn-
petition and Hebbian learning. Neural Compu- ing, Memory, and Cognition, 1 3 , 629–63 9.
tation, 1 3 , 1 1 99–1 242. Rumelhart, D. E., McClelland, J. L., & the PDP
O’Reilly, R. C., Busby, R. S., & Soto, R. Research Group. (1 986). Parallel distributed
(2003 ). Three forms of binding and their neu- processing: Explorations in the microstructure of
ral substrates: Alternatives to temporal syn- cognition (Vol. 1 ). Cambridge, MA: MIT Press.
chrony. In A. Cleeremans (Ed.), The unity of Saiki, J., & Hummel, J. E. (1 998). Connected-
consciousness: Binding, integration, and dissocia- ness and the integration of parts with relations
tion (pp. 1 68–1 92). Oxford, UK: Oxford Uni- in shape perception. Journal of Experimental
versity Press. Psychology: Human Perception and Performance,
Oden, D. L., Thompson, R. K. R., & Premack, 2 4, 227–25 1 .
D. (2001 ). Spontaneous transfer of matching Shastri, L. (2003 ). Inference in connectionist net-
by infant chimpanzees. In D. Gentner, K. J. works. Cognitive Studies, 1 0, 45 –5 7.
Holyoak, & B. N. Kokinov (Eds.), The analogi- Shastri, L., & Ajjanagadde, V. (1 993 ). From sim-
cal mind (pp. 471 –497). Cambridge, MA: MIT ple associations to systematic reasoning: A con-
Press. nectionist representation of rules, variables and
Page, M. (2000). Connectionist modelling in dynamic bindings using temporal synchrony.
psychology: A localist manifesto. Behavioral Behavioral and Brain Sciences, 1 6, 41 7–494.
and Brain Sciences, 2 3 , 443 –5 1 2. Singer, W. (2000). Response synchronization, a
Palmer, S. E. (1 978). Fundamental aspects of cog- universal coding strategy for the definition of
nitive representation. In E. Rosch & B. B. Lloyd relations. In M. S. Gazzaniga (Ed.), The new
(Eds.), Cognition and categorization (pp. 25 9– cognitive neurosciences (2nd ed.) (pp. 3 25 –3 3 8).
3 03 ). Hillsdale, NJ: Erlbaum. Cambridge, MA: MIT Press.
Smith, E. E., Langston, C., & Nisbett, R. E. by nonhuman primates: The paleological mon-
(1 992). The case for rules in reasoning. Cog- key and the analogical ape. Cognitive Science,
nitive Science, 1 6, 1 –40. 2 4, 3 63 –3 96.
Smith, L. B. (1 989). From global similarities to Vaadia, E., Haalman, I., Abeles, M., Bergman, H.,
kinds of similarities: The construction of di- Prut, Y., Slovin, H., & Aertsen, A. (1 995 ). Dy-
mensions in development. In S. Vosniadou & A. namics of neuronal interactions in monkey cor-
Ortoney (Eds.), Similarity and analogical rea- tex in relation to behavioural events. Nature,
soning (pp. 1 47–1 77). Cambridge, UK: Cam- 3 73 , 5 1 5 –5 1 8.
bridge University Press. van Gelder, T. J., & Niklasson, L. (1 994). On
Smolensky, P. (1 990). Tensor product variable being systematically connectionist. Mind and
binding and the representation of symbolic Language, 9, 288–3 02.
structures in connectionist systems. Artificial Viskontas, I. V., Morrison, R. G., Holyoak, K. J.,
Intelligence, 46, 1 5 9–21 6. Hummel, J. E., & Knowlton, B. J. (in press).
St. John, M. F. (1 992). The story Gestalt: A model Relational integration, attention and reasoning
of knowledge-intensive processes in text com- in older adults.
prehension. Cognitive Science, 1 6, 271 –3 02. von der Malsburg, C. (1 981 ). The correlation
St. John, M. F., & McClelland, J. L. (1 990). Learn- theory of brain function. Internal Report 81 –
ing and applying contextual constraints in sen- 2. Department of Neurobiology, Max-Planck-
tence comprehension. Artificial Intelligence, 46, Institute for Biophysical Chemistry, Göttingen,
21 7–25 7. Germany.
Stuss, D., & Benson, D. (1 986). The frontal lobes. Waltz, J. A., Knowlton, B. J., Holyoak, K. J.,
New York: Raven Press. Boone, K. B., Mishkin, F. S., de Menezes Santos,
Tesar, B., & Smolensky, P. (1 994, August). M., Thomas, C. R., & Miller, B. L. (1 999). A
Synchronous-firing variable binding is spatio- system for relational reasoning in human pre-
temporal tensor product representation. Pro- frontal cortex. Psychological Science, 1 0, 1 1 9–
ceedings of the 1 6th annual conference of the Cog- 1 25 .
nitive Science Society, Atlanta, GA. Wharton, C. M., Holyoak, K. J., & Lange, T. E.
Thompson, R. K. R., & Oden, D. L. (2000). Cat- (1 996). Remote analogical reminding. Memory
egorical perception and conceptual judgments and Cognition, 2 4, 629–643 .
Part II
REASONING

CHAPTER 5
The Problem of Induction
Steven A. Sloman
David A. Lagnado
In its classic formulation, due to Hume reasoning, Hume proceeded to raise a funda-
(1 73 9, 1 748), inductive reasoning is an ac- mental question now known as “the problem
tivity of the mind that takes us from the of induction” – what are the grounds for such
observed to the unobserved. From the fact inductive or causal inferences? In attempting
that the sun has risen every day thus far, we to answer this question, Hume presents both
conclude that it will rise again tomorrow; a negative and a positive argument.
from the fact that bread has nourished us In his negative thesis, Hume argued that
in the past, we conclude that it will nour- our knowledge of causal relations is not
ish us in the future. The essence of induc- attainable through demonstrative reasoning,
tive reasoning lies in its ability to take us be- but is acquired through past experience. To
yond the confines of our current evidence or illustrate, our belief that fire causes heat, and
knowledge to novel conclusions about the the expectation that it will do so in the fu-
unknown. These conclusions may be partic- ture, is based on previous cases in which one
ular, as when we infer that the next swan has followed the other, and not on any a pri-
we see will be white, or general, as when we ori reasoning. However, once Hume iden-
infer that all swans are white. They may con- tified experience as the basis for inductive
cern the future, as in the prediction of rain inference, he proceeded to demonstrate its
from a dark cloud, or concern something in inadequacy as a justification for these infer-
the past, as in the diagnosis of an infection ences. Put simply, any such argument re-
from current symptoms. quires the presupposition that past experi-
Hume argued that all such reasoning is ence will be a good guide to the future, and
founded on the relation of cause and effect. this is the very claim we seek to justify.
It is this relation that takes us beyond our For Hume, what is critical about our
current evidence, whether it is an inference experience is the perceived similarity be-
from cause to effect, or effect to cause, or tween particular causes and their effects:
from one collateral effect to another. Having “From causes, which appear similar, we ex-
identified the causal basis of our inductive pect similar effects. This is the sum of all our
95
experimental conclusions” (see Goldstone & depend on the environment we experience.

Son, Chap. 2). However, this expectation This idea of a general-purpose associative
cannot be grounded in reason alone because learning system has inspired many contem-
similar causes could conceivably be followed porary accounts of inductive learning (see
by dissimilar effects. Moreover, if one intro- Buehner & Cheng, Chap. 7).
duces hidden powers or mechanisms to ex- Hume’s descriptive account suffers from
plain our observations at a deeper level, the several shortcomings. For one, it seems to as-
problem just gets shifted down. What guar- sume there is an objective sense of similarity
antees that the powers or mechanisms that or resemblance that allows us to pass from
underlie our current experiences will do so like causes to like effects, and vice versa. In
in the future? fact, a selection from among many dimen-
In short, Hume’s negative argument un- sions of similarity might be necessary for a
dermines the assumption that the future will particular case. For example, to what degree
resemble the past. This assumption cannot and in what respects does a newly encoun-
be demonstrated a priori because it is not tered object (e.g., a new type of candy bar)
contradictory to imagine that the course of need to be similar to previously encountered
nature may change. However, neither can it objects for someone to expect a similar prop-
be supported by an appeal to past experience erty (like a similar taste)? If we are to acquire
because this would be to argue in a circle. any predictive habits, we must be able to
Hume’s argument operates at two levels, generalize to some extent from one object
both descriptive and justificatory. At the de- to another, or to the same object at differ-
scriptive level, it suggests that there is no ac- ent times and contexts. How this is carried
tual process of reflective thought that takes out is as much in need of a descriptive ac-
us from the observed to the unobserved. count as the problem of induction itself. Sec-
After all, as Hume points out, even young ond, we might accept that no reflective rea-
infants and animals make such inductions, soning can justify our inductive inferences,
although they clearly do not use reflective but this does not entail that reflective rea-
reasoning. At the justificatory level, it sug- soning cannot be the actual cause of some
gests that there is no possible line of rea- of our inferences. Nevertheless, Hume pre-
soning that could do so. Thus, Hume argues sciently identified the critical role of both
both that reflective reasoning does not and similarity and causality in inductive reason-
could not determine our inductive inferences. ing, the variables that, as we will see, are
Hume’s positive argument provides an at the heart of work on the psychology of
answer to the descriptive question of how induction.
we actually pass from the unobserved to the Hume was concerned with questions of
observed but not to the justificatory one. both description and justification. In con-
He argues that it is custom or habit that trast, the logical empiricists (e.g., Carnap,
leads us to make inferences in accordance 1 95 0, 1 966; Hempel, 1 965 ; Reichenbach,
with past regularities. Thus, after observing 1 93 8) focused only on justification. Having
many cases of a flame being accompanied successfully provided a formal account of de-
by heat, a novel instance of a flame creates ductive logic (Frege, 1 880; Russell & White-
the idea, and hence an expectation, of heat. head, 1 925 ) in which questions of deductive
In this way, a correspondence is set up be- validity were separated from how people ac-
tween the regularities in the world and the tually make deductive inferences (see Evans,
expectations of the mind. Moreover, Hume Chap. 8), philosophers attempted to do the
maintains that this tendency is “implanted same for inductive inference by formulating
in us as an instinct” because nature would rules for an inductive logic.
not entrust it to the vagaries of reason. In Central to this approach is the belief
modern terms, then, we are prewired to ex- that inductive logic, like deductive logic,
pect past associations to hold in the future, concerns the logical relations that hold be-
although what is associated with what will tween statements irrespective of their truth
the problem of induction 97
or falsity. In the case of inductive logic, made equally well using mundane predicates
however, these relations admit of varying or simply in terms of functions (see Hempel,
strengths, a conditional probability measure 1 965 ). Indeed, the problem of drawing a line
reflecting the rational degree of belief that or curve through a finite set of data points
someone should have in a hypothesis given illustrates the same difficulty. Two curves C1
the available evidence. For example, the hy- and C2 may fit the given data points equally
pothesis that “all swans are white” is made well but diverge otherwise. According to the
probable (to degree p) by the evidence state- simple inductive rule, both are equally con-
ment that “all swans in Central Park are firmed and yet we often prefer one curve
white.” On this basis, the logical empiricists over the other. Unfortunately, an inductive
hoped to codify and ultimately justify the logic of the kind proposed by Carnap (1 95 0)
principles of sound inductive reasoning. gives us no grounds to decide which predi-
This project proved to be fraught with dif- cate (or curve) to project.
ficulties, even for the most basic inductive In general, then, Goodman’s (1 95 5 ) prob-
rules. Thus, consider the rule of induction by lem of projectibility concerns how we distin-
enumeration, which states that a universal guish projectible predicates such as “green”
hypothesis H1 is confirmed or made proba- from nonprojectible ones such as “grue.” Al-
ble by its positive instances E. The problem though he concurred with Hume’s claim
is that these very same instances will also that induction consists of a mental habit
confirm a different universal hypothesis H2 formed by past regularities, he argued that
(indeed, an infinity of them), which makes Hume overlooked the further problem (the
an entirely opposite prediction about sub- new riddle) of which past regularities are se-
sequent cases. The most notorious illustra- lected by this mental habit and thus pro-
tion of this point was provided by Goodman jected in the future. After all, it would
(1 95 5 ) and termed “the new riddle of in- appear that we experience a vast range of
duction.” Imagine that you have examined regularities and yet are prepared to project
numerous emeralds and found them all to only a small subset. Goodman himself of-
be colored green. You take this body of evi- fered a solution in terms of entrenchment. In
dence E to confirm (to some degree) the hy- short, a predicate is entrenched if it has a past
pothesis that “All emeralds are green.” How- history of use, where both the term itself,
ever, suppose we introduce the predicate and the extension of the term, figure in this
“grue,” which applies to all objects examined usage. Thus, “green” is entrenched, whereas
so far (before time t) and found to be green “grue” is not because our previous history of
and to all objects not examined and blue. projections involves numerous cases of the
Given this definition and the rule that a uni- former, but none of the latter. In common
versal hypothesis is confirmed by its positive with Hume, then, Goodman gave a descrip-
instances, our evidence set E also confirms tive account of inductive inference, but one
the gruesome hypothesis “All emeralds are grounded in the historic practices of people,
grue.” However, this is highly undesirable and in particular their language use, rather
because each hypothesis makes an entirely than simply the psychology of an individual.
different prediction as to what will happen One shortcoming of Goodman’s proposal
in the future (after time t), when we exis that it hinges on language use. Ultimately,
amine a new emerald. Goodman stated this he attempted to explain our inductive prac-
problem as one of projectibility: How can we tices in terms of our linguistic practices: “the
justify or explain our preference to project roots of inductive validity are to be found
predicates such as “green” from past to fu- in our use of language.” However, surely in-
ture instances, rather than predicates such ductive questions, such as the problem of
as “grue”? projectibility, arise and are solved by infants
Many commentators object that the and animals without language (see Suppes,
problem hinges on the introduction of a 1 994). Indeed, our inductive practices may
bizarre predicate, but the same point can be drive our linguistic practices, rather than
the other way around. Moreover, Goodman order inductions become entrenched if suc-
ruled out, or at least overlooked, the pos- cessful. Although this progress from primi-
sibility that the notions of similarity and tive to theoretical similarity may actually en-
causality are integral to the process of in- gender a qualitative change in our reasoning
ductive reasoning. However, as we will see, processes, the same inductive tendencies ap-
more recent analyses suggest that these are ply throughout. Thus, whether we infer heat
the concepts that will give us the most lever- from a flame, or a neutrino from its path in a
age on the problem of induction. bubble chamber, or even the downfall of an
In his essay, “Natural Kinds” (1 970), empire from the dissatisfaction of its work-
Quine defended a simple and intuitive an- ers, all such inferences rest on our propensity
swer to Goodman’s problem: Projectible to group kindred entities and project them
predicates apply to members of a kind, a into the future on this basis.
grouping formed on the basis of similarity. For Quine, our notions of similarity and
Thus, “green” is projectible, whereas “grue” the way in which we group things become
is not because green things are more simi- increasingly sophisticated and abstract, cul-
lar than grue things; that is, green emeralds minating, he believed, in their eventual re-
form a kind, whereas grue emeralds do not. moval from mature science altogether. This
This shifts the explanatory load onto the conclusion seems to sit uneasily with his
twin notions of similarity and kind, which claims about theoretical similarity. Never-
Quine held to be fundamental to inductive theless, as mere humans, we will always be
inference: “every reasonable expectation de- left with a spectrum of similarity notions
pends on similarity.” For Quine, both hu- and systems of kinds applicable as the con-
mans and animals possess an innate stan- text demands, which accounts for the coex-
dard of similarity useful for making appro- istence of a variety of procedures for carrying
priate inductions. Without this prior notion, out inductive inference, a plurality that ap-
no learning or generalization can take place. pears to be echoed in more recent cognitive
Despite the subjectivity of this primitive psychology (e.g., Cheng & Holyoak, 1 985 ).
similarity standard, Quine believed that its Both Goodman and Quine said little
uniformity across humans makes the induc- about the notion of causality. This is prob-
tive learning of verbal behavior relatively ably a hangover from the logical empiricist
straightforward. What guarantees, however, view of science that sought to avoid all ref-
that our “innate subjective spacing of quali- erence to causal relations in favor of logical
ties” matches up with appropriate groupings ones. Contemporary philosophical accounts
in nature? Here, Quine appealed to an evolu- have striven to reinstate the notion of causal-
tionary explanation: Without such a match, ity into induction (Glymour, 2001 ; Lipton,
and thus the ability to make appropriate in- 1 991 ; Miller, 1 987).
ductions, survival is unlikely. Miller (1 987) and Lipton (1 991 ) provided
Like Hume, then, Quine proposed a nat- numerous examples of inductive inferences
uralistic account of inductive inference, but that depend on the supposition of, or ap-
in addition to the instinctive habit of associa- peal to, causal relations. Indeed, Miller pro-
tion, he proposed an innate similarity space. posed a definition of inductive confirmation
Furthermore, Quine argued that this primi- as causal comparison: Hypotheses are con-
tive notion of similarity is supplemented, as firmed by appropriate causal accounts of the
we advance from infant to adult and from data-gathering process. Armed with this no-
savage to scientist, by ever more developed tion, he claimed that Goodman’s new rid-
senses of “theoretical” similarity. The devel- dle of induction is soluble. It is legitimate to
opment of such theoretical kinds by the re- project “green” but not ”grue” because only
grouping of things, or the introduction of “green” is consistent with our causal knowl-
entirely new groupings, arises through “trial- edge about color constancy and the belief
and-error theorizing.” In Goodman’s terms, that no plausible causal mechanism sup-
novel projections on the basis of second- ports spontaneous color change. He argued
that any adequate description of inductive fixed, fundamental set of dimensions along
reasoning must allow for the influence of which all complex concepts of objects and
causal beliefs. Further development of such predicates can be aligned. This requirement
an account, however, awaits a satisfactory has been by and large rejected for many rea-
theory of causality (for recent advances, see sons. One problem is that concepts tend to
Pearl, 2000). arise in systems, not individually. Even a sim-
In summary, tracing the progress of philo- ple linguistic predicate like “is small” is con-
sophical analyses suggests a blueprint for a strued very differently when applied to mice
descriptive account of inductive reasoning – and when applied to elephants. Many pred-
a mind that can extract relations of similarity icates that people reason about are emer-
and causality and apply them to new cate- gent properties whose existence depends on
gories in relevant ways. In subsequent sec- the attitude of a reasoning agent (consider
tions, we argue that this is the same pic- “is beautiful” or a cloud that “looks like a
ture that is emerging from empirical work in mermaid”). So we cannot simply represent
psychology. predicates as functions of simpler perceptual
properties. Something else is needed, some-
thing that respects the information we have
about predicates via the relations of objects
Empirical Background and predicates to one another.
In the 1 970s, the answer proffered was
Experimental work in psychology on how similarity (see Goldstone & Son, Chap. 2).
people determine the projectibility of a The additional information required to
predicate has its roots in the study of general- project a predicate was the relative posi-
ization in learning. Theories of learning were tion of a category with respect to other
frequently attempts to describe the shape of categories; the question about one category
a generalization gradient for a simple predi- could be decided based on knowledge of
cate applied to an even simpler class often the predicate’s relation to other (similar)
defined by a single dimension. For exam- categories (see Medin & Rips, Chap. 3 ).
ple, if an organism learned that a tone pre- Prior to the 1 970s, similarity had gener-
dicts food, one might ask how the organism ally been construed as a distance in a fairly
would respond to other tones. The function low-dimensional space (Shepard, 1 980). In
describing how a response (such as saliva- 1 977, Tversky proposed a new measure that
tion) varies with the similarity of the stimu- posited that similarity could be computed
lus to the originally trained stimulus is called over a large number of dimensions, that both
a generalization gradient. Shepard (1 987) ar- common and distinctive features were essen-
gued that such functions are invariably neg- tial to determine the similarity between any
atively exponential in shape. pair of objects, and, critically, that the set
If understood as general theories of induc- of features used to measure similarity were
tion, such theories are necessarily reduction- context dependent. Features depended on
ist in orientation. Because they only consider their diagnosticity in the set of objects be-
the case of generalization along specific di- ing compared and on the specific task used to
mensions that are closely tied to the senses measure similarity. Tversky’s contrast model
(often spectral properties of sound or light), of similarity would, it was hoped, prove
the assumption is, more or less explicitly, to have sufficient representational power to
that more complex predicates can be de- model a number of cognitive tasks, including
composed into sets of simpler ones. The pro- categorization and induction.
jectibility of complex predicates is thus be- The value of representing category struc-
lieved to be reducible to generalization along ture in terms of similarity was reinforced
more basic dimensions. by Rosch’s (1 973 ) efforts to construct a
Reductionism of this kind is highly re- similarity-based framework for understand-
strictive. It requires that there exist some ing natural categories. Her seminal work on
1 00 the cambridge handbook of thinking and reasoning
the typicality structure of categories and on arrive at an intermediate level of hierarchical

the basic level of hierarchical category struc- structure. Jones (1 983 ) made this suggestion,
ture provided the empirical basis for her ar- calling it a measure of “collocation.” A more
guments that categories were mentally rep- sophisticated information-theoretic analysis
resented in a way that carved the world at along these lines is presented in Corter and
its joints. She imagined categories as clusters Gluck (1 992) and Fisher (1 987).
in a vast high-dimensional similarity space Another quite different but complemen-
that were devised to maximize the similar- tary line of work going on at about the
ity within a cluster and minimize the simi- same time as Rosch’s, with related implica-
larity between clusters. Her belief that the tions for inductive inference, was Tversky
structure of this similarity space was given and Kahneman’s (1 974) development of
by the world and was not simply a matter of the representativeness heuristic of proba-
subjective opinion implies that the similar- bility and frequency judgment. The rep-
ity space contains a lot of information that resentativeness heuristic is essentially the
can be used for a number of tasks, including idea that categorical knowledge is used to
inductive inference. make probability judgments (see Kahneman
Rosch (1 978) suggested that the main & Frederick, Chap. 1 2). In that sense, it is an
purpose of category structure was to pro- extension of Rosch’s insights about category
vide the evidential base for relating predi- structure. She showed that similarity was
cates to categories. She attempted to moti- a guiding principle in decisions about cat-
vate the basic level as the level of hierarchical egory membership; Kahneman and Tversky
structure that maximized the usefulness of a showed that probability judgment could, in
cue for choosing a category, what she called some cases, be understood as a process of
cue validity, the probability of a category categorization driven by similarity. To illus-
given a cue. Basic-level categories were pre- trate, Linda is judged more likely to be a fem-
sumed to maximize cue validity by virtue inist bankteller than a bankteller (despite the
of being highly differentiated; members of conjunction rule of probability that disal-
a basic-level category have more common lows this conclusion) if she has characteristic
attributes than members of a superordinate, feminist traits (i.e., if she seems like she is a
and they have fewer common attributes with member of the category of feminists).
other categories than do members of a subor- In sum, the importance of similarity for
dinate. Murphy (1 982) observed, however, how people make inductive inferences was
that this will not work. The category with recognized in the 1 970s in the study of nat-
maximum probability given a cue is the most ural category structure and probability judg-
general category possible (“entity”), whose ment and manifested in the development of
probability is 1 (or at least close to it). How- models of similarity. Rips (1 975 ) put these
ever, Rosch’s idea can be elaborated using a strands together in the development of a cat-
measure of inductive projectibility in a way egorical induction task. He told people that
that succeeds in picking out the basic level. all members of a particular species of animal
If the level of a hierarchy is selected by ap- on a small island had a particular contagious
pealing to the inductive potential of the cat- disease and asked participants to guess what
egory, say by maximizing category validity, proportion of other species would also have
the probability of a specific feature given a the disease. For example, if all rabbits have it,
category, then one is driven in the opposite what proportion of dogs would? Rips found
direction of cue validity, namely to the most that judgments went up with the similarity
specific level. Given a particular feature, one of the two categories and with the typicality
is pretty much guaranteed to choose a cate- of the first (premise) category.
gory with that feature by choosing a specific Relatively little work on categorical in-
object known to have the feature. By trading duction was performed by cognitive psy-
off category and cue validity, the usefulness chologists immediately following Rips’s
of a category for predicting a feature and of seminal work. Instead, the banner was pur-
a feature for predicting a category, one can sued by developmental psychologists such as
the problem of induction 1 01
Carey (1 985 ). She focused on the theoretical Boys use GABA as a neurotransmitter.
schema that children learn through devel- Therefore, girls use GABA as a neu-
opment and how they use those schema to rotransmitter.
make inductive inferences across categories.
that can be written schematically as a list of
In particular, she showed that adults and
sentences:
1 0-year-olds used general biological knowl-
edge to guide their inductions about novel P1 . . . Pn /C (5 .1 )
animal properties, whereas small children
based their inductions on knowledge about in which the Pi are the premises of an argu-
humans. Gelman and Markman (1 986) ar- ment and C is the conclusion. Each state-
gued that children prefer to make inductive ment includes a category (e.g., boys) to
inferences using category structure rather which is applied a predicate (e.g., use GABA
than superficial similarity. However, it was as a neurotransmitter). In most of the exam-
the theoretical discussion and mathematical ples discussed, the categories will vary across
models of Osherson and his colleagues, dis- statements, whereas the predicate will re-
cussed in what follows, that led to an ex- main constant. The general question will be
plosion of interest by cognitive psychologists how people go about determining their be-
with a resulting menu of models and phe- lief in the conclusion of such an argument af-
nomena to constrain them. ter being told that the premises are true. We
discuss this question both by trying to de-
scribe human judgment as a set of phenom-
ena and by trying to explain the existence
Scope of Chapter
of these phenomena in terms of more fun-
damental and more general principles. The
To limit the scope of this chapter, in the phenomena will concern judgments of the
remainder we focus exclusively on the psy- strength of categorical arguments or the con-
chology of categorical induction: How peo- vincingness of an argument or some other
ple arrive at a statement of their confidence measure of belief in the conclusion once the
that a conclusion category has a predicate premises are given (reviewed by Heit, 2000).
after being told that one or more premise One way to represent the problem we
categories do. As Goodman’s (1 95 5 ) analysis address is in terms of conditional probabil-
makes clear, this is a very general problem. ity. The issue can be construed in terms of
Nevertheless, we do not address a number how people make judgments of the follow-
of issues related to induction. For example, ing form:
we do not address how people go about se-
lecting evidence to support a hypothesis (see P(Category C has some property |
Doherty et al., 1 996; Klayman & Ha, 1 987; Categories P1 . . . Pn have the property)
Oaksford & Chater, 1 994). We do not ad-
dress how people discover hypotheses but Indeed, some of the tasks we discuss involve
rather focus only on their degree of cer- a conditional probability judgment explic-
tainty in a prespecified hypothesis (cf. the itly. But even those that do not, such as ar-
distinction between the contexts of discov- gument strength, can be directly related to
ery and confirmation; Reichenbach, 1 93 8). judgments of conditional probability.
This rules out a variety of work on the topic Most of the experimental work we ad-
of hypothesis discovery (e.g., Klahr, 2000; dress attempts to restrict attention to how
Klayman, 1 988). Relatedly, we do not cover people use categories to reason by minimiz-
the variety of work on the topic of cue learning the role of the predicate in the reasoning
ing, that is, how people learn the predictive process. To achieve this, arguments are usu-
or diagnostic value of stimuli (see Buehner ally restricted to “blank” predicates – pred-
& Cheng, Chap. 7). icates that use relatively unfamiliar terms
Most of our discussion concerns the eval- (e.g., “use GABA as a neurotransmitter”) so
uation of categorical arguments, such as they do not contribute much to how people
reason about the arguments (Osherson, overlap and because they sometimes speak
Smith, Wilkie, López, & Shafir, 1 990). They at different levels of abstraction.
do contribute some, however. For instance,
all the predicates applied to animals are ob- Similarity-Based Induction
viously biological in nature, thus suggest-
ing that the relevant properties for reason- Perhaps the most obvious and robust pre-
ing are biological. Lo, Sides, Rozelle, and dictor of inductive strength is similarity. In
Osherson (2002) characterized blank pred- the simplest case, most people are willing
icates as “indefinite in their application to to project a property known to be true of
given categories, but clear enough to com- (say) crocodiles to a very similar class, such
municate the kind of property in question” as alligators, with some degree of confidence.
(p. 1 83 ). Such willingness exists either because simi-
Philosophers such as Carnap (1 95 0) and larity is a mechanism of induction (Osherson
Hacking (2001 ) have distinguished inten- et al., 1 990) or because induction and sim-
sional and extensional representations of ilarity judgment have some common an-
probability (sometimes called epistemic vs. tecedent (Sloman, 1 993 ). From the scores of
aleatory representations). Correspondingly, examples of the representativeness heuris-
in psychology we can distinguish modes tic at work (Tversky & Kahneman, 1 974)
of inference that depend on assessment of through Rosch’s (1 973 ) analysis of typicality
similarity structure and modes that depend in terms of similarity, a strong correlation be-
on analyses of set structure [see Lagnado tween probability and similarity is more the
& Sloman, (2004), for an analysis of the rule than the exception. The argument has
correspondence between the philosophical been made that similarity is not a real expla-
and psychological distinctions]. We refer nation at all (Goodman, 1 972; see the review
to the former as the inside view of cate- in Sloman & Rips, 1 998) and phenomena ex-
gory structure and the latter as the out- ist that contradict prediction based only on
side view (Sloman & Over, 2003 ; Tversky similarity (e.g., Gelman & Markman, 1 986).
& Kahneman, 1 983 ). In this chapter, we fo- Nevertheless, similarity remains the key con-
cus on induction from the inside via simi- struct in the description and explanation of
larity structure. We thus neglect a host of inductive phenomena.
work concerning, for example, how people Consider the similarity and typicality
make conditional probability judgments in phenomena (López, Atran, Coley, Medin,
the context of well-defined sample spaces & Smith, 1 997; Osherson et al., 1 990; Rips,
(e.g., Johnson-Laird et al., 1 999), reasoning 1 975 ):
using explicit statistical information (e.g., Similarity
Nisbett, 1 993 ), and the relative advantages Arguments are strong to the extent that
of different kinds of representational format categories in the premises are similar to
(e.g., Tversky & Kahneman, 1 983 ). the conclusion category. For example,
Robins have sesamoid bones.
Therefore, sparrows have sesamoid
Two Theoretical Approaches bones.
to Inductive Reasoning is judged stronger than
Robins have sesamoid bones.
A number of theoretical approaches have Therefore, ostriches have sesamoid
been taken to the problem of categorical in- bones.
duction in psychology. Using broad strokes,
because robins are more similar to spar-
the approaches can be classified into two
rows than to ostriches.
groups: similarity-based induction and induc-
tion as scientific methodology. We discuss each Typicality
in turn. As becomes clear, the approaches are The more typical premise categories are
not mutually exclusive both because they of the conclusion category, the stronger
is the argument. For example, people Therefore, buffaloes have 3 8 chromo-

are more willing to project a predicate somes.
from robins to birds than from penguins
is judged stronger than
to birds because robins are more typical
birds than penguins. Buffaloes have 3 8 chromosomes.
Therefore, tigers have 3 8 chromo-
The first descriptive mathematical ac- somes.
count of phenomena like these expressed
argument strength in terms of similarity. The similarity-coverage model explains it
Osherson et al. (1 990) posited the similarity- by appealing to typicality. Tigers are more
coverage model that proposed that people typical mammals than buffaloes and there-
make categorical inductions on the basis of fore tigers provide more category coverage.
two principles, similarity and category cover- The feature-based model explains it by ap-
age. Category coverage was actually cashed pealing to familiarity. Tigers are more fa-
out in terms of similarity. According to the miliar than buffaloes and therefore have
model, arguments are deemed strong to the more features. So the features of tigers cover
degree that premise and conclusion cate- more of the features of buffaloes than vice
gories are similar and to the degree that versa.
premises “cover” the lowest-level category Differences between the models play out
that includes both premise and conclusion in the analysis of several phenomena. The
categories. The idea is that the categories similarity-coverage model focuses on rela-
present in the argument elicit their com- tions among categories; the feature-based
mon superordinate – in particular, the most model on relations among properties. Con-
specific superordinate that they share. Cate- sider diversity (Osherson et al., 1 990):
gory coverage is determined by the similar-
Diversity
ity between the premise categories and all
The less similar premises are to each
the categories contained in this lowest-level other, the stronger the argument tends to
superordinate. be. People are more willing to draw the
Sloman (1 993 ) proposed a competing conclusion that all mammals love onions
theory of induction that reduces the two from the fact that hippos and hamsters
principles of similarity and category cov- love onions than from the fact that hip-
erage into a single principle of feature pos and rhinos do because hippos and
coverage. Instead of appealing to a class in- rhinos are more similar than hippos and
clusion hierarchy of superordinates and sub- hamsters.
ordinates, this theory appeals to the extent of The phenomenon has been demonstrated
overlap among the properties of categories. on several occasions with Western adults
Predicates are projected from premise cate- (e.g., López, 1 995 ), although some evi-
gories to a conclusion category to the degree dence suggests the phenomenon does not
that the previously known properties of the always generalize to other groups. López
conclusion category are also properties of the et al. (1 997) failed to find diversity ef-
premise categories – specifically, in propor- fects among Itza’ Maya. Proffitt, Coley, and
tion to the number of conclusion category Medin (2000) found that parks mainte-
features that are present in the premise cat- nance workers did not show diversity effects
egories. Both models can explain the simi- when reasoning about trees, although tree
larity, typicality, and asymmetry phenomena taxonomists did. Bailenson, Shum, Atran,
(Rips, 1 975 ): Medin, and Coley (2002) did not find di-
versity effects with either Itza’ Maya or
Asymmetry
bird experts. There is also some evidence
Switching premise and conclusion cate-
that children are not sensitive to diversity
gories can lead to arguments of different
strength: (Carey, 1 985 ; Gutheil & Gelman, 1 997;
López, Gelman, Gutheil, & Smith, 1 992).
Tigers have 3 8 chromosomes. However, using materials of greater interest
to young children, Heit and Hahn (2001 ) did to add more to category coverage than
find diversity effects with 5 - and 6-year-olds. giraffes.
The data show only mixed support for the Monotonicity and Nonmonotonicity
phenomenon. Nevertheless, it is predicted When premise categories are sufficiently
by the similarity-coverage model. Categories similar, adding a supporting premise will
that are less similar will tend to cover the su- increase the strength of an argument.
perordinate that includes them better than However, a counterexample to mono-
categories that are more similar. The feature- tonicity occurs when a premise with a
based model also predicts the phenomenon category dissimilar to all other categories
as a result of feature overlap. When cate- is introduced:
gories differ, their features have relatively Crows have strong sternums.
little overlap, and thus they cover a larger Peacocks have strong sternums.
part of feature space; when categories are Therefore, birds have strong sternums.
similar, their coverage of feature space is
more redundant. As a result, more dissim- is stronger than
ilar premises are more likely to show more Crows have strong sternums.
overlap with a conclusion category. How- Peacocks have strong sternums.
ever, this is not necessarily so and, indeed, Rabbits have strong sternums.
the feature-based model predicts a bound- Therefore, birds have strong sternums.
ary condition on diversity (Sloman, 1 993 ):
The similarity-coverage model explains non-
Feature exclusion monotonicity through its coverage term.
A premise category that has little over- The lowest-level category that must be cov-
lap with the conclusion category should ered in the first argument is birds because all
have no effect on argument strength categories in the argument are birds. How-
even if it leads to a more diverse set of ever, the lowest-level category that must be
premises. For example, covered in the second argument is more
Fact: German Shepherds have sesa- general – animals – because rabbits are not
moid bones. birds. Worse, rabbits are not similar to very
Fact: Giraffes have sesamoid bones. many animals; therefore, the category does
Conclusion: Moles have sesamoid not contribute much to argument strength.
bones. The feature-based model cannot explain this
phenomenon except with added assump-
is judged stronger than
tions – for example, that the features of
Fact: German Shepherds have sesa- highly dissimilar premise categories com-
moid bones. pete with one another – as explanations for
Fact: Blue whales have sesamoid bones. the predicate (see Sloman, 1 993 ).
Conclusion: Moles have sesamoid As the analysis of nonmonotonicities
bones. makes clear, the feature-coverage model dif-
fers from the similarity-coverage model pri-
even though the second argument has a marily in that it appeals to properties of cat-
more diverse set of premises than the first. egories rather than instances in explaining
The feature-based model explains this by ap- induction phenomena and, as a result, in not
pealing to the lack of feature overlap be- appealling to the inheritance relations of a
tween blue whales and moles over and above class inclusion hierarchy. That is, it assumes
the overlap between German Shepherds and people will not in general infer that a cate-
moles. To explain this phenomenon, the gory has a property because its superordi-
similarity-coverage model must make the ad nate does. Instead, it assumes that people
hoc assumption that blue whales are not sim- think about categories in terms of their struc-
ilar enough to other members of the lowest- tural relations, in terms of property over-
level category, including all categories in lap and relations among properties. This
the arguments (presumably mammals), is surely the explanation for the inclusion
fallacy (Osherson et al., 1 990; Shafir, Smith, inductive inference. This might be related
& Osherson, 1 990): to the limited role of inclusion relations in
other kinds of categorization tasks. For ex-
Inclusion Fallacy
ample, Hampton (1 982) showed intransitiv-
Similarity relations can override categor-
ical relations between conclusions. Most ities in category verification using everyday
people judge objects. He found, for example, that people
affirmed that “A car headlight is a kind of a
All robins have sesamoid bones. lamp” and that “A lamp is a kind of furni-
Therefore, all birds have sesamoid ture,” but not “A car headlight is a kind of
bones. furniture.”
to be stronger than People are obviously capable of infer-
ring a property from a general to a more
All robins have sesamoid bones.
specific category. Following an explanation
Therefore, all ostriches have sesamoid
that appeals to inheritance is not difficult (I
bones.
know naked mole rats have livers because all
mammals have livers). However, the inclu-
Of course, ostriches are birds, and so the first
sion fallacy and the inclusion similarity phe-
conclusion implies the second; therefore, the
nomenon show that such information is not
second argument must be stronger than the
inevitably, and therefore, not automatically
first. Nevertheless, robins are highly typical
included in the inference process.
birds and therefore similar to other birds.
Gelman and Markman showed that
Yet they are distinct from ostriches. These
children use category labels to mediate
similarity relations determine most people’s
induction:
judgments of argument strength rather than
the categorical relation. Naming effect
An even more direct demonstration of Children prefer to project predicates be-
failure to consider category inclusion rela- tween objects that look similar rather
tions is the following (Sloman, 1 993 , 1 998): than objects that look dissimilar. How-
ever, this preference is overridden when
Inclusion Similarity the dissimilar objects are given similar
Similarity relations can override even labels.
transparent categorical relations be-
tween premise and conclusion. People do Gelman and Coley (1 990) showed that chil-
not always judge dren as young as 2 years old are also sensitive
Every individual body of water has a to the use of labels. So, on the one hand, peo-
high number of seiches. ple are extremely sensitive to the informa-
tion provided by labels when making induc-
Every individual lake has a high num- tive inferences. On the other hand, the use
ber of seiches. of structured category knowledge for induc-
to be perfectly strong even when they tive inference seems to be a derivative abil-
agree that a lake is a body of water. More- ity, not a part of the fabric of the reasoning
over, they judge process. This suggests that the naming effect
Every individual body of water has a
does not concern how people make infer-
high number of seiches. ences using knowledge about category struc-
ture per se, because if the use of structural
Every individual reservoir has a high knowledge is not automatic, very young chil-
number of seiches. dren would not be expected to use it. Rather,
the effect seems to be about the pragmatics
to be even weaker, presumably because of language – in particular, how people use
reservoirs are less typical bodies of water
language to mediate induction. The nam-
than lakes.
ing effect probably results from people’s ex-
These examples suggest that category inclu- treme sensitivity to experimenters’ linguistic
sion knowledge has only a limited role in cues. Even young children apparently have
the capacity to note that when an experi- constitute the basic-level categories that
menter gives two objects similar labels, the are often used to organize hierarchical lin-
experimenter is giving a hint, a hint that the guistic and conceptual categories (Brown,
objects should be treated similarly at least in 1 95 8; Rosch et al., 1 976; see Murphy, 2002,
the context of the experiment. This ability for a review). Nevertheless, the dominance
to take cues from others, and to use language of generic categories was not expected by
to do so, may well be key mechanisms of hu- Coley et al. (1 997) because Rosch et al.
man induction. (1 976) had found that for the biological cate-
This is also the conclusion of cross- gories tree, fish, and bird, the life-form level
cultural work by Coley, Medin, and Atran was the category level satisfying a number
(1 997). Arguments are judged stronger the of operational definitions of the basic level.
more specific the categories involved. If told For example, Rosch et al.’s American col-
that dalmations have an ulnar artery, peo- lege students preferred to call objects they
ple are more willing to generalize ulnar ar- were shown “tree,” “fish,” or “bird” rather
teries to dogs than to animals (Osherson than “oak,” “salmon,” or “robin.”
et al., 1 990). Coley et al. (1 997) compared Why the discrepancy? Why do American
people’s willingness to project predicates college students prefer to name an object a
from various levels of the hierarchy of liv- tree over an oak, yet prefer to project a prop-
ing things to a more general level. For ex- erty from all red oaks to all oaks rather than
ample, when told that a subspecific category from all oaks to all trees? Perhaps they simply
such as “male black spider monkey” is sus- cannot identify oaks, and therefore fall back
ceptible to an unfamiliar disease, did partic- on the much more general “tree” in order
ipants think that the members of the folk- to name. However, this begs the question:
specific category “black spider monkey” were If students consider “tree” to be informative
susceptible? If members of the specific cat- and precise enough to name things, why are
egory were susceptible, then were members they unwilling to project properties to it?
of the folk-generic category (“spider mon- Coley et al.’s (1 997) answer to this conun-
key”) also susceptible? If members of the drum is that naming depends on knowledge;
generic category were susceptible, then were that is, names are chosen that are precise
members of the life-form category (“mam- enough to be informative given what peo-
mal”) also susceptible? Finally, if the life- ple know about the object being named. In-
form category displayed susceptibility, then ductive inference, they argued, also depends
did the kingdom (“animal”)? Coley et al. on a kind of conventional wisdom. People
found that both American college students have learned to maximize inductive poten-
and members of a traditional Mayan village tial at a particular level of generality (the
in lowland Guatemala showed a sharp drop folk-generic) level because culture and lin-
off at a certain point: guistic convention specify that that is the
most informative level for projecting prop-
Preferred level of induction erties (see Greenfield, Chap. 27). For exam-
People are willing to make an inductive ple, language tends to use a single morpheme
inference with confidence from a subor-
for naming generic level categories. This is
dinate to a near superordinate up to the
a powerful cue that members of the same
folk-generic level; their willingness drops
off considerably when making inferences generic level have a lot in common and that
to categories more abstract. therefore it is a good level for guessing that
a predicate might hold across it. This idea is
These results are consistent with Berlin’s related to Shipley’s (1 993 ) notion of overhy-
(1 992) claim that the folk-generic level is potheses (cf. Goodman, 1 95 5 ): that people
the easiest to identify, the most commonly use categorywide rules about certain kinds
distinguished in speech, and serves best to of properties to make some inductive in-
distinguish categories. Therefore, one might ferences. For example, upon encountering a
imagine that the folk-generic level would new species, people might assume members
of the species will vary more in degree of and a variety of evidence shows that beliefs
obesity than in, say, skin color (Nisbett et al., about what is being perceived affects what
1 983 ) despite having no particular knowl- is perceived (e.g., Gregory, 1 973 ). Never-
edge about the species. theless, as suggested by the following phe-
This observation poses a challenge to nomena, induction is mediated by knowl-
feature- and similarity-based models of in- edge of categories’ role in causal systems;
duction (Heit, 1 998; Osherson et al., 1 990; beliefs about the way the world works influ-
Sloman, 1 993 ). These models all start from ence induction as much as overlap of prop-
the assumption that people induce new erties does. Mandler and McDonough’s data
knowledge about categories from old knowl- provide evidence that this is true even for
edge about the same categories. However, if 1 4-month-olds.
people make inductive inferences using not
only specific knowledge about the categories Induction as Scientific Methodology
at hand but also distributional knowledge
about the likelihood of properties at differ- Induction is of course not merely the
ent hierarchical levels, knowledge that is in province of individuals trying to accom-
part culturally transmitted via language, then plish everyday goals, but also one of the
more enters the inductive inference pro- main activities of science. According to one
cess than models of inductive process have common view of science (Carnap, 1 966;
heretofore allowed. Hempel, 1 965 ; Nagel, 1 961 ; for opposing
Mandler and McDonough (1 998) argued views, see Hacking, 1 983 ; Popper, 1 963 ), sci-
that the basic-level bias comes relatively late, entists spend much of their time trying to
and demonstrated that 1 4-month-old infants induce general laws about categories from
show a bias to project properties within a particular examples. It is natural, therefore,
broad domain (animals or vehicles) rather to look to the principles that govern induc-
than at the level usually considered to be tion in science to see how well they describe
basic. This finding is not inconsistent with individual behavior (for a discussion of sci-
Coley et al.’s (1 997) conclusion because entific reasoning, see Dunbar & Fugelsang,
the distributional and linguistic properties Chap. 29). Psychologists have approached
that they claim mediate induction presum- induction as a scientific enterprise in three
ably have to be learned, and so finding a different ways.
basic-level preference only amongst adults
is sufficient for their argument. Mandler the rules of induction
and McDonough (1 998) argued that infants’ First, some have examined the extent to
predilection to project to broad domains which people abide by the normative rules
demonstrates an initial propensity to rely of inductive inference that are generally ac-
on “conceptual” as opposed to “perceptual” cepted in the scientific community. One
knowledge as a basis for induction, meaning such rule is that properties that do not vary
that infants rely on the very abstract com- much across category instances are more
monalities among animals as opposed to the projectible across the whole category than
perhaps more obvious physical differences properties that vary more. Nisbett et al.
among basic-level categories (pans vs. cups (1 983 ) showed that people are sensitive to
and cats vs. dogs). Of course, pans and cups this rule:
do have physical properties in common that
Variability/Centrality
distinguish them from cats and dogs (e.g.,
People are more willing to project predi-
the former are concave, the latter have ar-
cates that tend to be invariant across cat-
ticulating limbs). Moreover, the distinction egory instances than variable predicates.
between perceptual and conceptual prop- For example, people who are told that
erties is tenuous. Proximal and distal stim- one Pacific island native is overweight
uli are necessarily different (i.e., even the tend to think it is unlikely that all na-
eye engages in some form of interpretation), tives of the island are overweight because
weight tends to vary across people. In are less similar to field mice than to tigers),
contrast, if told the native has dark skin, the second argument might seem stronger
they are more likely to generalize to all because house cats could conceivably be-
natives because skin color tends to be come infected with the parasite Floxum
more uniform within a race. while hunting field mice. Even if you do not
However, sensitivity to variability does find the second argument stronger, merely
not imply that people consider the variabil- accepting the relevance of this infection sce-
ity of predicates in the same deliberative nario undermines the diversity principle,
manner that a scientist should. This phe- which prescribes that the similarity principle
nomenon could be explained by a sensitivity should be determinative for all pairs of ar-
to centrality (Sloman, Love, & Ahn, 1 998). guments. At minimum, it shows that the di-
Given two properties A and B, such that B versity principle does not dominate all other
depends on A but A does not depend on principles of sound inference.
B, people are more willing to project prop- Lo et al. (2002) proved that a different
erty A than property B because A is more and simple principle of argument strength
causally central than B, even if A and B does follow from the Bayesian philosophy
are equated for variability (Hadjichristidis, of science. Consider two arguments with
Sloman, Stevenson, & Over, 2004). More the same conclusion in which the conclu-
central properties tend to be less variable. sion implies the premises. For example, the
Having a heart is more central and less vari- conclusion “every single mammal carries
able among animals than having hair. Cen- the parasite Floxum” implies that “every sin-
trality and variability are almost two sides of gle tiger carries the parasite Floxum” (on the
the same coin (the inside and outside views, assumption that “mammal” and “tiger” re-
respectively). In Nisbett et al.’s case, having fer to natural, warm-blooded animals). In
dark skin may be seen as less variable than such a case, the argument with the less
obesity by virtue of being more central and likely premises should be stronger. Lo et al.
having more apparent causal links to other referred to this as the premise probability
features of people. principle. In a series of experiments, they
The diversity principle is sometimes iden- show that young children in both the United
tified as a principle of good scientific prac- States and Taiwan make judgments that con-
tice (e.g., Heit & Hahn, 2001 ; Hempel, 1 965 ; form to this principle.
López, 1 995 ). Yet, Lo et al. (2002) argued
against the normative status of diversity. induction as naive scientific theorizing
They consider the following argument:
A second approach to induction as a scien-
House cats often carry the parasite tific methodology examines the contents of
Floxum. beliefs, what knowledge adults and children
Field mice often carry the parasite make use of when making inductive infer-
Floxum. ences. Because knowledge is structured in a
way that has more or less correspondence
All mammals often carry the parasite to the structure of modern scientific theo-
Floxum. ries, sometimes to the structure of old or
which they compare to discredited scientific theories, such knowl-
edge is often referred to as a “naive the-
House cats often carry the parasite
Floxum.
ory” (Carey, 1 985 ; Gopnik & Meltzoff, 1 997;
Keil, 1 989; Murphy & Medin, 1 985 ). One
Tigers often carry the parasite Floxum.
strong, contentful position (Carey, 1 985 ) is
All mammals often carry the parasite that people are born with a small num-
Floxum. ber of naive theories that correspond to a
small number of domains such as physics,
Even though the premise categories of the biology, psychology, and so on, and that all
first argument are more diverse (house cats other knowledge is constructed using these
original theories as a scaffolding. Perhaps, for and behavior than to objects’ mere appear-
example, other knowledge is a metaphorical ance, even though appearance is, by defini-
extension of these original naive theories (cf. tion, more directly observable.
Lakoff & Johnson, 1 980). Of course, analogies between everyday
One phenomenon studied by Carey induction and scientific induction have to
(1 985 ) to support this position is exist. As long as both children and scien-
tists have beliefs that have positive induc-
Human bias tive potential, those beliefs are likely to have
Small children prefer to project a prop- some correspondence to the world, and the
erty from people rather than from other knowledge of children and scientists will
animals. Four-year-olds are more likely therefore have to show some convergence.
to agree that a bug has a spleen if told
If children did operate merely on the basis
that a person does than if told that a
bee does. Ten-year-olds and adults do of superficial similarities, such things as pho-
not show this asymmetry and project as tographs and toy cars would forever stump
readily from nonhuman animals as from them. Children have no choice but to be
humans. “little scientists,” merely to walk around the
world without bumping into things. Because
Carey argued that this transition is due to a of the inevitability of such correspondences
major reorganization of the child’s knowl- and because scientific theories take a multi-
edge about animals. Knowledge is consti- tude of different forms, it is not obvious that
tuted by a mutually constraining set of con- this approach, in the absence of a more fully
cepts that make a coherent whole in analogy specified model, has much to offer theories
to the holistic coherence of scientific the- of cognition. Furthermore, proponents of
ories. As a result, concepts do not change this approach typically present a rather im-
in isolation, but instead as whole networks poverished view of scientific activity, which
of belief are reorganized (Kuhn, 1 962). On neglects the role of social and cultural norms
this view, the human bias occurs because and practices (see Faucher et al., 2002). Ef-
a 4-year-old’s understanding of biological forts to give the approach a more principled
functions is framed in terms of human be- grounding have begun (e.g., Gopnik et al.,
havior, whereas older children and adults 2004; Rehder & Hastie, 2001 ; Sloman, Love,
possess an autonomous domain of biologi- & Ahn, 1 998).
cal knowledge. Lo et al. (2002) rejected the approach
A different enterprise is more descriptive; outright. They argue that it just does
it simply shows the analogies between not matter whether people have repre-
knowledge structures and scientific theories. sentational structures that in one way or
For example, Gopnik and Meltzoff (1 997) another are similar to scientific theories.
claimed that, just like scientists, both chil- The question that they believe has both
dren and laypeople construct and revise prescriptive value for improving human
abstract lawlike theories about the world. induction and descriptive value for develop-
In particular, they maintain that the gen- ing psychological theory is whether what-
eral mechanisms that underlie conceptual ever method people use to update their be-
change in cognitive development mirror liefs conforms to principles of good scientific
those responsible for theory change in ma- practice.
ture science. More specifically, even very
young children project properties among
natural kinds on the basis of latent, underly- computational models of induction
ing commonalities between categories rather The third approach to induction as a sci-
than superficial similarities (e.g., Gelman & entific methodology is concerned with the
Coley, 1 990). So children behave like “little representation of inductive structure with-
scientists” in the sense that their inductive out concern for the process by which peo-
inferences are more sensitive to the causal ple make inductive inferences. The approach
principles that govern objects’ composition takes its lead from Marr’s (1 982) analysis of
the different levels of psychological analy- data given in the premises to determine a
sis. Models at the highest level, those that posterior degree of belief in the conclusion.
concern themselves with a description of the Prior beliefs concern relative likelihoods that
goals of a cognitive system without direct de- each combination of categories in the argu-
scription of the manner in which the mind ment would all have the relevant property.
tries to attain those goals or how the system For example, for the argument
is implemented in the brain, are computa-
Cows can get disease X.
tional models. Three kinds of computational
Sheep can get disease X.
models of inductive inference have been sug-
gested, all of which find their motivation in Heit assumes people can generate beliefs
principles of good scientific methodology. about the relative prior probability that both
cows and sheep have the disease, that cows
Induction as Hypothesis Evaluation Mc-
do but sheep do not, and so on. These be-
Donald, Samuels, and Rispoli (1 996) pro-
liefs are generated heuristically; people are
posed an account of inductive inference that
assumed to bring to mind properties shared
appeals to several principles of hypothesis
by cows and by sheep, properties that cows
evaluation. They argued that when judging
have but sheep do not, and so on. The prior
the strength of an inductive argument, peo-
probabilities reflect the ease of bringing each
ple actively construct and assess hypothe-
type of property to mind. Premises contri-
ses in light of the evidence provided by the
bute other information as well – in this case,
premises. They advanced three determinants
that only states in which cows indeed have
of hypothesis plausibility: the scope of the
the disease are possible. This can be used to
conclusion, the number of premises that in-
update priors to determine a posterior de-
stantiate it, and the number of alternatives to
gree of belief that the conclusion is true.
it suggested by the premises. In their experi-
On the basis of assumptions about what
ments, all three factors were good predictors
people’s priors are, Heit (1 998) described
of judged argument strength, although cer-
a number of the phenomena of categori-
tain pragmatic considerations, and a fourth
cal induction: similarity, typicality, diversity,
factor – “acceptability of the conclusion” –
and homogeneity. However, the model is
were also invoked to fully cover the results.
inconsistent with nonmonotonicity effects.
Despite the model’s success in explain-
Furthermore, because it relies on an exten-
ing some judgments, others, such as non-
sional updating rule, Bayes’ rule, the model
monotonicity, are only dealt with by appeal
cannot explain phenomena that are nonex-
to pragmatic postulates that are not de-
tensional such as the inclusion fallacy or the
fended in any detail. Moreover, the model
inclusion-similarity phenomenon.
is restricted to arguments with general con-
Sanjana and Tenenbaum (2003 ) offered a
clusions. Because the model is at a com-
Bayesian model of categorical inference with
putational level of description, it does not
a more principled foundation. The model is
make claims about the cognitive processes
applied only to the animal domain. They de-
involved in induction. As we see next, other
rive all their probabilities from a hypothesis
computational models do offer something
space that consists of clusters of categories.
in place of a process model that McDonald
The model’s prediction for each argument
et al.’s (1 996) framework does not: a rigor-
derives from the probability that the conclu-
ous normative analysis of an inductive task.
sion category has the property. This reflects
Bayesian models of inductive inference the probability that the conclusion category
Heit (1 998) proposed that Bayes’ rule pro- is an element of likely hypotheses – namely,
vides a representation for how people de- that the conclusion category is in the same
termine the probability of the conclusion of cluster as the examples shown (i.e., as the
a categorical inductive argument given that premise categories) and that those hypoth-
the premises are true. The idea is that peo- esized clusters have high probability. The
ple combine degrees of prior belief with the probability of each hypothesis is assumed to
be inversely related to the size of the hy- Unfortunately, current ideas and data place
pothesis (the number of animal types it in- too few constraints on the cognitive pro-
cludes) and to its complexity, the number of cesses and procedures that people actu-
disjoint clusters that it includes. This model ally use.
performed well in quantitative compar-
isons against the similarity-coverage model
and the feature-based model, although its Conclusions and Future Directions
consistency with the various phenomena
of induction has not been reported and is
We have reviewed two ways that cognitive
rather opaque.
scientists have tried to describe how peo-
The principled probabilistic foundation
ple make inductive inferences. We limited
of this model and its good fit to data so far
the scope of the problem to that of cat-
yield promise that the model could serve
egorical induction – how people generate
as a formal representation of categorical in-
degrees of confidence that a predicate ap-
duction. The model would show even more
plies to a stated category from premises con-
promise and power to generalize, however, if
cerning other categories that the predicate is
its predictions had been derived using more
assumed to apply to. Nevertheless, neither
reasonable assumptions about the structure
approach is a silver bullet. The similarity-
of categorical knowledge. The pairwise clus-
based approach has produced the most well-
ter hierarchy Sanjana and Tenenbaum use
specified models and phenomena, although
to represent knowledge of animals is poorly
consideration of the relation between scien-
motivated (although see Kemp & Tenen-
tific methodology and human induction may
baum, 2003 , for an improvement), and there
prove the most important prescriptively and
would be even less motivation in other do-
may in the end provide the most enduring
mains (cf. Sloman, 1 998). Moreover, if and
principles to distinguish everyday human in-
how the model could explain fallacious rea-
duction from ideal – or at least other – in-
soning is not clear.
ductive processes.
A more liberal way to proceed is to ac-
summary of induction as scientific
methodology cept the apparent plurality of procedures
and mechanisms that people use to make in-
Inductive inference can be fallacious, as
ductions and to see this pluralism as a virtue
demonstrated by the inclusion fallacy de-
rather than a vice.
scribed previously. Nevertheless, much of
the evidence that has been covered in this
The Bag of Tricks
section suggests that people in the psychol-
ogist’s laboratory are sensitive to some of the Many computational problems are hard be-
same concerns as scientists when they make cause the search space of possible answers
inductive inferences. People are more likely is so large. Computer scientists have long
to project nonvariable over variable predi- used educated guesses or what are often
cates, they change their beliefs more when called heuristics or rules of thumb to prune
premises are a priori less likely, and their be- the search space, making it smaller and
havior can be modeled by probabilistic mod- thus more tractable at the risk of making
els constructed from rational principles. the problem insoluble by pruning off the
Other work reviewed shows that peo- best answers. The work of Kahneman and
ple, like scientists, use explanations to medi- Tversky imported this notion of heuristics
ate their inference. They try to understand into the study of probability judgment (see
why a category should exhibit a predicate Kahneman & Frederick, Chap. 1 2). They
based on nonobservable properties. These suggested that people use a set of cognitive
are valuable observations to allow psychol- heuristics to estimate probabilities – heuris-
ogists to begin the process of building a tics that were informed, that made people’s
descriptive theory of inductive inference. estimates likely to be reasonable, but left
open the possibility of systematic error in than from tigers to hawks but more likely
cases in which the heuristics that came nat- to project “prefers to feed at night” from
urally to people had the unfortunate conse- tigers to hawks than from chickens to
quence of leading to the wrong answer. hawks.
Kahneman and Tversky suggested the
heuristics of availability, anchoring and ad- More specifically, argument strength de-
justment, simulation, and causality to depends on how people explain why the cat-
scribe how people make probability judg- egory has the predicate. In the example,
ments. They also suggested that people make chickens and hawks are known to have bi-
judgments according to representativeness, ological properties in common, and there-
the degree to which a class or event used fore, people think it likely that a biologi-
as evidence is similar to the class or process cal predicate would project from one to the
being judged. Representativeness is a very other. Tigers and hawks are known to both
abstract heuristic that is compatible with a be hunters and carnivores; therefore “prefers
number of different models of the judgment to feed at night” is more likely to project be-
process. We understand it not so much as tween them. Sloman (1 994) showed that the
a particular claim about how people make strength of an argument depends on whether
probability judgments as the claim that pro- the premise and conclusion are explained in
cesses of categorization and similarity play the same way. If the premise and conclusion
central roles in induction. This is precisely have different explanations, the premise can
the claim of the similarity-based model out- actually reduce belief in the conclusion.
lined previously. The explanations in these cases are causal;
We believe that the bag of tricks describes they refer to more or less well-understood
most completely how people go about mak- causal processes. Medin, Coley, Storms, and
ing inductive leaps. People seem to use a Hayes (2003 ) have demonstrated five dis-
number of different sources of information tinct phenomena that depend on causal intu-
for making inductive inferences, including itions about the relations amongst categories
the availability of featural information and and predicates. For example, they showed
knowledge about feature overlap, linguistic
cues about the distribution of features, the Causal asymmetry
relative centrality of features to one another, Switching premise and conclusion cate-
the relative probability of premises, and ob- gories will reduce the strength of an argu-
ment if a causal path exists from premise
jects’ roles in causal systems.
to conclusion. For example,
Gazelles contain retinum.
Causal Induction
Lions contain retinum.
Our guess is that the treasure trove for fu-
is stronger than
ture work in categorical induction is in the
development of the latter mode of infer- Lions contain retinum.
ence. How do people go about using causal Gazelles contain retinum.
knowledge to make inductions? That they because the food chain is such that lions
do is indisputable. Consider the following eat gazelles and retinum could be trans-
phenomenon due to Heit and Rubinstein ferred in the process.
(1 994):
What is striking about this kind of example
Relevance
is the exquisite sensitivity to subtle (if mun-
People’s willingness to project a predi-
cate from one category to another de- dane) causal relations that it demonstrates.
pends on what else the two categories The necessary causal explanation springs to
have in common. For example, people mind quickly, apparently automatically, and
are more likely to project “has a liver with it does so even though it depends on one
two chambers” from chickens to hawks fact that most people are only dimly aware
of (that lions eat gazelles) among the vast peal to causality, then the appeal to scientific
number of facts that are at our disposal. methodology is trivial.
We do not interpret the importance of Normative models of causal structure
causal relations in induction as support for have recently flowered (cf. Pearl, 2000;
psychological essentialism, the view that Spirtes, Glymour, & Scheines, 1 993 ), and
people base judgments concerning cate- some of the insights of these models seem
gories on attributions of “essential” qualities: to have some psychological validity (Sloman
of a true underlying nature that confers & Lagnado, 2004). Bringing them to bear
kind identity unlike, for example, Kornblith on the problem of inductive inference will
(1 993 ), Medin and Ortony (1 989), and not be trivial. However, the effort should be
Gelman and Hirschfeld (1 999). We rather made because causal modeling seems to be
follow Strevens (2001 ) in the claim that it a critical element of the bag of tricks that
is causal structure per se that mediates in- people use to make inductive inferences.
duction; no appeal to essential properties
is required (cf. Rips, 2001 ; Sloman & Malt,
2003 ). Indeed, the causal relations that sup- Acknowledgments
port inductive inference can be based on
very superficial features that might be very
We thank Uri Hasson and Marc Buehner for
mutable. To illustrate, the argument
comments on an earlier draft. This work was
Giraffes eat leaves of type X. funded by NASA grant NCC2-1 21 7.
African tawny eagles eat leaves of type X.
seems reasonably strong only because both
giraffes and African eagles can reach high References
leaves and both are found in Africa – hardly
a central property of either species. Bailenson, J. B., Shum, M. S., Atran, S., Medin,
The appeal to causal structure is instead D., & Coley, J. D. (2002). A bird’s eye view:
intended to appeal to the ability to pick Biological categorization and reasoning within
and across cultures. Cognition, 84, 1 –5 3 .
out invariants and act as agents to make
use of those invariants. Organisms have a Berlin, B. (1 992). Ethnobiological classification:
Principles of categorization of plants and animals
striking ability to find the properties of
in traditional societies. Princeton, NJ: Princeton
things that maximize their ability to predict
University Press.
and control, and humans seem to have the
Brown, R. (1 95 8). How shall a thing be called?
most widely applicable capacity of this sort.
Psychological Review, 65 , 1 4–21 .
However, prediction and control come from
Carey, S. (1 985 ). Conceptual change in childhood.
knowing what variables determine the val-
Cambridge, MA: MIT Press.
ues of other variables – that is, how one pre-
Carnap, R. (1 95 0). The logical foundations of prob-
dicts future outcomes and knows what to
ability. Chicago: University of Chicago Press.
manipulate to achieve an effect. This is, of
Carnap, R. (1 966). Philosophical foundations of
course, the domain of causality. It seems only
physics. Ed. M. Gardner, New York: Basic
natural that people would use this talent to Books.
reason when making inductive inferences.
Cheng, P. W., & Holyoak, K. J. (1 985 ). Pragmatic
The appeal to causal relations is not nec- reasoning schemas. Cognitive Psychology, 1 7,
essarily an appeal to scientific methodology. 3 91 –41 6.
In fact, some philosophers such as Russell Coley, J. D., Medin, D. L., & Atran, S. (1 997).
(1 91 3 ) argued that theories are not scientific Does rank have its privilege? Inductive infer-
until they are devoid of causal reference, and ences within folkbiological taxonomies. Cogni-
the logical empiricists attempted to exorcise tion, 64, 73 –1 1 2.
the notion of causality from “scientific” phi- Corter, J. & Gluck, M. (1 992). Explaining basic
losophy. Of course, to the extent that sci- categories: feature predictability and informa-
entists behave like other people in their ap- tion. Psychological Bulletin, 1 1 1 , 291 –3 03 .
Doherty, M. E., Chadwick, R., Garavan, H., and property induction. Cognitive Science, 2 8,
Barr, D., & Mynatt, C. R. (1 996). On people’s 45 –74.
understanding of the diagnostic implications of Hampton, J. A. (1 982). A demonstration of in-
probabilistic data. Memory and Cognition, 2 4, transitivity in natural categories. Cognition, 1 2 ,
644–65 4. 1 5 1 –1 64.
Faucher, L., Mallon, R., Nazer, D., Nichols, S., Heit, E. (1 998). A Bayesian analysis of some
Ruby, A., Stich, S., & Weinberg, J. (2002). The forms of inductive reasoning. In M. Oaksford
baby in the lab coat. In P. Carruthers, S. Stich, & N. Chater (Eds.), Rational models of cognition
& M. Siegal (Eds.), The cognitive basis of science. (pp. 248–274). Oxford, UK: Oxford University
Cambridge, UK: Cambridge University Press. Press.
Fisher, D. H. (1 987). Knowledge acquisition Heit, E. (2000). Properties of inductive reasoning.
via incremental conceptual clustering. Machine Psychonomic Bulletin and Review, 7, 5 69–5 92.
Learning, 2 , 1 3 9–1 72. Heit, E., & Hahn, U. (2001 ). Diversity-based rea-
Frege, G. W. (1 880). Posthumous writings. Black- soning in children. Cognitive Psychology, 47,
well, 1 979. 243 –273 .
Gelman, S. A., & Coley, J. D. (1 990). The impor- Heit, E., & Rubinstein, J. (1 994). Similarity and
tance of knowing a dodo is a bird: Categories property effects in inductive reasoning. Journal
and inferences in 2-year-old children. Develop- of Experimental Psychology: Learning, Memory,
mental Psychology, 2 6, 796–804. and Cognition, 2 0, 41 1 –422.
Gelman, S. A., & Hirschfeld, L. A. (1 999). How Hempel, C. (1 965 ). Aspects of scientific explana-
biological is essentialism? In D. L. Medin & S. tion. New York: Free Press.
Atran (Eds.), Folkbiology (pp. 403 –446). Cam- Hume, D. (1 73 9). A treatise of human nature. Ed.
bridge, MA: MIT Press. D. G. C. Macnabb. London: Collins.
Gelman, S. A., & Markman, E. M. (1 986). Cate- Hume, D. (1 748). An enquiry concerning human
gories and induction in young children. Cogni- understanding. Oxford, UK: Clarendon.
tion, 2 3 , 1 83 –209.
Johnson-Laird, P. N., Legrenzi, P., Girotto, V.,
Glymour, C. (2001 ). The mind’s arrows: Bayes Legrenzi, M., & Caverni, J-P. (1 999). Naive
nets and graphical causal models in psychology. probability: a mental model theory of exten-
Cambridge, MA: Bradford Books. sional reasoning. Psychological Review, 106, 62–
Goodman, N. (1 95 5 ). Fact, fiction, and forecast. 88.
Cambridge, MA: Harvard University Press. Jones, G. (1 983 ). Identifying basic categories. Psy-
Gopnik, A., Glymour, C., Sobel, D. M., Schulz, chological Bulletin, 94, 423 –428.
L. E., Kushnir, T., & Danks, D. (2004). A theory Keil, F. C. (1 989). Concepts, kinds, and cognitive
of causal learning in children: Causal maps and development. Cambridge, MA: MIT Press.
Bayes nets. Psychological Review, 1 1 1 , 1 –3 1 .
Kemp, C., & Tenenbaum, J. B. (2003 ). Theory-
Gopnik, A., & Meltzoff, A. N. (1 997). Words, based induction. Proceedings of the twenty-fifth
thoughts, and theories. Cambridge, MA: MIT annual conference of the Cognitive Science Soci-
Press. ety, Boston, MA.
Gregory, R. L. (1 973 ). The intelligent eye. New Klahr, D. (2000). Exploring science: The cognition
York: McGraw-Hill. and development of discovery processes. Cam-
Gutheil, G., & Gelman, S. A. (1 997). The use of bridge, MA: MIT Press.
sample size and diversity in category-based in- Klayman, J., & Ha, Y-W. (1 987). Confirmation,
duction. Journal of Experimental Child Psychol- disconfirmation, and information in hypothesis
ogy, 64, 1 5 9–1 74. testing. Psychological Review, 94, 21 1 –228.
Hacking, I. (1 983 ). Representing and intervening: Klayman, J. (1 988). Cue discovery in probabilistic
Introductory topics in the philosophy of natural environments: Uncertainty and experimenta-
science. Cambridge, UK: Cambridge University tion. Journal of Experimental Psychology: Learn-
Press. ing, Memory, and Cognition, 1 4, 3 1 7–3 3 0.
Hacking, I. (2001 ). An introduction to probability Kornblith, H. (1 993 ). Inductive inference and its
and inductive logic. Cambridge, UK: Cambridge natural ground. Cambridge: MIT Press.
University Press. Kuhn, T. (1 962). The structure of scientific
Hadjichristidis, C., Sloman, S. A., Stevenson, revolutions. Chicago: University of Chicago
R. J., & Over D. E. (2004). Feature centrality Press.
Lagnado, D., & Sloman, S. A., (2004). Inside Nagel, E. (1 961 ). The structure of science: Problems
and outside probability judgment. In D. J. in the logic of scientific explanation. New York:
Koehler & N. Harvey (Eds.), Blackwell hand- Harcourt, Brace and World.
book of judgment and decision making (pp. 1 5 7– Nisbett, R. E. (Ed.) (1 993 ). Rules for reasoning.
1 76). Oxford, UK: Blackwell Publishing. Hillsdale, NJ: Erlbaum.
Lakoff, G. & Johnson, M. (1 980). Metaphors Nisbett, R. E., Krantz, D. H., Jepson, D. H., &
we live by. Chicago: University of Chicago Kunda, Z. (1 983 ). The use of statistical heuris-
Press. tics in everyday inductive reasoning. Psycholog-
Lipton, P. (1 991 ). Inference to the best explanation. ical Review, 90, 3 3 9–3 63 .
New York: Routledge. Oaksford, M., & Chater, N. (1 994). A ratio-
Lo, Y., Sides, A., Rozelle, J., & Osherson, D. nal analysis of the selection task as optimal
(2002). Evidential diversity and premise prob- data selection. Psychological Review, 1 01 , 608–
ability in young children’s inductive judgment. 63 1 .
Cognitive Science, 2 6, 1 81 –206. Osherson, D. N., Smith, E. E., Wilkie, O., López,
Lopez, A. (1 995 ). The diversity principle in the A., & Shafir, E. (1 990). Category-based induc-
testing of arguments. Memory and Cognition, tion. Psychological Review, 97, 1 85 –200.
2 3 , 3 74–3 82. Pearl, J. (2000). Causality. Cambridge, MA:
López, A., Atran, S., Coley, J. D., Medin, D. L., Cambridge University Press.
& Smith E. E. (1 997). The tree of life: Univer- Popper, K. (1 963 ). Conjectures and refutations.
sal and cultural features of folkbiological tax- London: Routledge.
onomies and inductions. Cognitive Psychology, Proffitt, J. B., Coley, J. D., & Medin, D. L. (2000).
3 2 , 25 1 –295 . Expertise and category-based induction. Jour-
Lopez, A., Gelman, S. A., Gutheil, G., & Smith, nal of Experimental Psychology: Learning, Mem-
E. E. (1 992). The development of category- ory, and Cognition, 2 6, 81 1 –828.
based induction. Child Development, 63 , 1 070– Quine, W. V. (1 970). Natural kinds. In N. Rescher
1 090. (Ed.), Essays in honor of Carl G. Hempel (pp. 5 –
Mandler, J. M., & McDonough, L. (1 998). Stud- 23 ). Dordrecht: D. Reidel.
ies in inductive inference in infancy. Cognitive Rehder, B., & Hastie, R. (2001 ). Causal knowl-
Psychology, 3 7, 60–96. edge and categories: The effects of causal be-
Marr, D. (1 982). Vision. New York: W. H. liefs on categorization, induction, and similar-
Freeman and Co. ity. Journal of Experimental Psychology: General,
McDonald, J., Samuels, M., & Rispoli, J. (1 996). 1 3 0, 3 23 –3 60.
A hypothesis-assessment model of categorical Reichenbach, H. (1 93 8). Experience and pre-
argument strength. Cognition, 5 9, 1 99–21 7. diction. Chicago: University of Chicago
Medin, D. L., Coley, J. D., Storms, G., & Press.
Hayes, B. (2005 ). A relevance theory of in- Rips, L. J. (1 975 ). Inductive judgements about
duction. Psychonomic Bulletin and Review, 1 0, natural categories. Journal of Verbal Learning
5 1 7–5 3 2. and Verbal Behavior, 1 4, 665 –681 .
Medin, D. L., & Ortony, A. (1 989). Psycho- Rips, L. (2001 ). Necessity and natural categories.
logical essentialism. In S. Vosniadou & A. Psychological Bulletin, 1 2 7, 827–85 2.
Ortony (Eds.), Similarity and analogical reason- Rosch, E. H. (1 973 ). Natural categories. Cognitive
ing (pp. 1 79–1 95 ). New York: Cambridge Uni- Psychology, 4, 3 28–3 5 0.
versity Press.
Rosch, E., Mervis, C. B., Gray, W. D., Johnson, D.
Miller, R. W. (1 987). Fact and method. Princeton, M., & Boyes-Braem, P. (1 976). Basic objects in
NJ: Princeton University Press. natural categories. Cognitive Psychology, 8, 3 82–
Murphy, G. L. (1 982). Cue validity and levels of 43 9.
categorization. Psychological Bulletin, 91 , 1 74– Rosch, E. (1 978). Principles of categorization.
1 77. In E. Rosch & B. B. Lloyd (Eds.). Cognition
Murphy, G. L. (2002). The big book of concepts. and categorization (pp. 27–48), Hillsdale, NJ:
Cambridge, MA: MIT Press. Erlbaum.
Murphy, G. L. & Medin, D. L. (1 985 ). The role of Russell, B. (1 91 3 ). On the notion of cause.
theories in conceptual coherence. Psychological Proceedings of the Aristotelian Society, 1 3 ,
Review, 92 , 289–3 1 6. 1 –26.
Russell, B., & Whitehead, A. N. (1 925 ). Principia Sloman, S. A., Love, B. C., & Ahn, W. (1 998).
mathematica. Cambridge, UK: Cambridge Uni- Feature centrality and conceptual coherence.
versity Press. Cognitive Science, 2 2 , 1 89–228.
Sanjana, N. E., & Tenenbaum, J. B. (2003 ). Sloman, S. A., & Malt, B. C. (2003 ). Artifacts are
Bayesian models of inductive generalization. not ascribed essences, nor are they treated as
In Becker, S., Thrun, S., & Obermayer, K. belonging to kinds. Language and Cognitive Pro-
(Eds.), Advances in Neural Processing Systems cesses, 1 8, 5 63 –5 82.
1 5 . Cambridge, MA: MIT Press. Sloman, S. A., & Over, D. (2003 ). Probability
Shafir, E., Smith, E. E., & Osherson, D. N. (1 990). judgment from the inside and out. In D. Over
Typicality and reasoning fallacies. Memory and (Ed.), Evolution and the psychology of thinking:
Cognition, 1 8, 229–23 9. The debate (pp. 1 45 –1 69). New York: Psychol-
Shepard, R. N. (1 980). Multidimensional scaling, ogy Press.
tree-fitting, and clustering. Science, 2 1 0, 3 90– Sloman, S. A., & Rips, L. J. (1 998). Similarity
3 98. as an explanatory construct. Cognition, 65 , 87–
Shepard, R. N. (1 987). Towards a universal law 1 01 .
of generalization for psychological science. Sci- Spellman, B. A., López, A., & Smith, E. E. (1 999).
ence, 2 3 7, 1 3 1 7–1 3 23 . Hypothesis testing: Strategy selection for gen-
Shipley, E. F. (1 993 ). Categories, hierarchies, and eralizing versus limiting hypotheses. Thinking
induction. In D. L. Medin (Ed.), The psychology and Reasoning, 5 , 67–91 .
of learning and motivation, 3 0, pp. 265 –3 01 . San Spirtes, P., Glymour, C., & Scheines, R. (1 993 ).
Diego: Academic Press. Causation, prediction, and search. New York:
Sloman, S. A. (1 993 ). Feature based induction. Springer-Verlag.
Cognitive Psychology, 2 5 , 23 1 –280. Strevens, M. (2001 ). The essentialist aspect of
Sloman, S. A. (1 994). When explanations naive theories. Cognition, 74, 1 49–1 75 .
compete: The role of explanatory coherence Suppes, P. (1 994). Learning and projectibility. In
on judgments of likelihood. Cognition, 5 2 , 1 – D. Stalker (Ed.), Grue: The new riddle of induc-
21 . tion (pp. 263 –272). Chicago: Open Court.
Sloman, S. A. (1 998). Categorical inference is Tversky, A., & Kahneman, D. (1 974). Judgment
not a tree: The myth of inheritance hierarchies. under uncertainty: Heuristics and biases. Sci-
Cognitive Psychology, 3 5 , 1 –3 3 . ence, 1 85 , 1 1 24–1 1 3 1 .
Sloman, S. A., & Lagnado, D. A. (2004). Causal Tversky, A. & Kahneman, D. (1 983 ). Extension
invariance in reasoning and learning. In B. Ross versus intuitive reasoning: The conjunction fal-
(Ed.), Handbook of learning and motivation, 44, lacy in probability judgment. Psychological Re-
287–3 25 . view, 90, 293 –3 1 5 .
CHAPTER 6
Analogy
Keith J. Holyoak
Analogy is a special kind of similarity (see gies have figured prominently in the history
Goldstone & Son, Chap. 2). Two situations of science (see Dunbar & Fugelsang, Chap.
are analogous if they share a common pat- 29) and mathematics (Pask, 2003 ) and are of
tern of relationships among their constituent general use in problem solving (see Novick &
elements even though the elements them- Bassok, Chap. 1 4). In legal reasoning, the use
selves differ across the two situations. Typi- of relevant past cases (legal precedents) to
cally, one analog, termed the source or base, is help decide a new case is a formalized appli-
more familiar or better understood than the cation of analogical reasoning (see Ellsworth,
second analog, termed the target. This asym- Chap. 28). Analogies can also function to in-
metry in initial knowledge provides the ba- fluence political beliefs (Blanchette & Dun-
sis for analogical transfer, using the source to bar, 2001 ) and to sway emotions (Thagard
generate inferences about the target. For ex- & Shelley, 2001 ). Analogical reasoning goes
ample, Charles Darwin drew an analogy be- beyond the information initially given, using
tween breeding programs used in agriculture systematic connections between the source
to select more desirable plants and animals and target to generate plausible, although
and “natural selection” for new species. The fallible, inferences about the target. Analogy
well-understood source analog called atten- is thus a form of inductive reasoning (see
tion to the importance of variability in the Sloman & Lagnado, Chap. 5 ).
population as the basis for change in the dis- Figure 6.1 sketches the major compo-
tribution of traits over successive generations nent processes in analogical transfer (see
and raised a critical question about the tar- Carbonell, 1 983 ; Gentner, 1 983 ; Gick &
get analog: What plays the role of the farmer Holyoak, 1 980, 1 983 ; Novick & Holyoak,
in natural selection? (Another analogy, be- 1 991 ). Typically, a target situation serves
tween Malthus’ theory of human population as a retrieval cue for a potentially useful
growth and the competition of individuals in source analog. It is then necessary to es-
a species to survive and reproduce, provided tablish a mapping, or a set of systematic
Darwin’s answer to this question.) Analo- correspondences that serve to align the
117
telligence, metaphor, and the representation

of knowledge.
Psychometric Tradition
Work in the psychometric tradition focuses
on four-term or “proportional” analogies in
the form A:B::C:D, such as HAND: FIN-
GER :: FOOT: ?, where the problem is to
infer the missing D term (TOE) that is re-
lated to C in the same way B is related to
A (see Sternberg, Chap. 3 1 ). Thus A:B plays
the role of source analog and C:D plays the
role of target. Proportional analogies were
discussed by Aristotle (see Hesse, 1 966) and
in the early decades of modern psychology
became a centerpiece of efforts to define
and measure intelligence. Charles Spearman
Figure 6.1 . Major components of analogical (1 923 , 1 927) argued that the best account
reasoning. of observed individual differences in cogni-
tive performance was based on a general or
g factor, with the remaining variance being
elements of the source and target. On the unique to the particular task. He reviewed
basis of the mapping, it is possible to de- several studies that revealed high correla-
rive new inferences about the target, thereby tions between performance in solving anal-
elaborating its representation. In the after- ogy problems and the g factor. Spearman’s
math of analogical reasoning about a pair of student John C. Raven (1 93 8) developed the
cases, it is possible that some form of rela- Raven’s Progressive Matrices Test (RPM),
tional generalization may take place, yielding which requires selection of a geometric fig-
a more abstract schema for a class of situa- ure to fill an empty cell in a two-dimensional
tions, of which the source and target are both matrix (typically 3 × 3 ) of such figures. Sim-
instances. For example, Darwin’s use of anal- ilar to a geometric proportional analogy, the
ogy to construct a theory of natural selection RPM requires participants to extract and ap-
ultimately led to the generation of a more ab- ply information based on visuospatial rela-
stract schema for a selection theory, which tions. (See Hunt, 1 974, and Carpenter, Just,
in turn helped to generate new specific the- & Shell, 1 990, for analyses of strategies for
ories in many fields, including economics, solving RPM problems.) The RPM proved to
genetics, sociobiology, and artificial intelli- be an especially pure measure of g.
gence. Analogy is one mechanism for effect- Raymond Cattell (1 971 ), another student
ing conceptual change (see Chi & Ohlsson, of Spearman, elaborated his mentor’s the-
Chap. 1 6). ory by distinguishing between two compo-
nents of g: crystallized intelligence, which de-
pends on previously learned information or
skills, and fluid intelligence, which involves
A Capsule History reasoning with novel information. As a form
of inductive reasoning, analogy would be
The history of the study of analogy in- expected to require fluid intelligence. Cat-
cludes three interwoven streams of research, tell confirmed Spearman’s (1 946) observa-
which respectively emphasize analogy in re- tion that analogy tests and the RPM pro-
lation to psychometric measurement of in- vide sensitive measures of g, clarifying that
analogy 119
Figure 6.2 . Multidimensional scaling solution based on intercorrelations among the Raven’s
Progressive Matrices test, analogy tests, and other common tests of cognitive function. (From Snow,
Kyllonen, & Marshalek, 1 984, p. 92. Reprinted by permission.)
they primarily measure fluid intelligence processes (e.g., Mulholland, Pellegrino, &
(although verbal analogies based on diffi- Glaser, 1 980; Sternberg, 1 977). The earli-
cult vocabulary items also depend on crys- est computational models of analogy were
tallized intelligence). Figure 6.2 graphically developed for four-term analogy problems
depicts the centrality of RPM performance (Evans, 1 968; Reitman, 1 965 ). The basic
in a space defined by individual differences components of these models were elabora-
in performance on various cognitive tasks. tions of those proposed by Spearman (1 923 ),
Note that numeric, verbal, and geometric including encoding of the terms, accessing
analogies cluster around the RPM at the cen- a relation between the A and B terms, and
ter of the figure. evoking a comparable relation between the
Because four-term analogies and the RPM C and D terms.
are based on small numbers of relatively More recently, four-term analogy prob-
well-specified elements and relations, it is lems and the RPM have figured promi-
possible to manipulate the complexity of nently in neuropsychological and neu-
such problems systematically and analyze roimaging studies of reasoning (e.g., Bunge,
performance (based on response latencies Wendelken, Badre & Wagner, 2004; Kroger
and error rates) in terms of component et al., 2002; Luo et al., 2003 ; Prabhakaran
et al., 1 997; Waltz et al., 1 999; Wharton against inflation” extends the metonymy
et al., 2000). Analogical reasoning depends into metaphor.
on working memory (see Morrison, Chap. Fauconnier and Turner (1 998; Fauconnier,
1 9). The neural basis of working memory in- 2001 ) analyzed complex conceptual blends
cludes the dorsolateral prefrontal cortex, an that are akin to metaphor. A typical exam-
area of the brain that becomes increasingly ple is a description of the voyage of a mod-
activated as the complexity of the problem ern catamaran sailing from San Francisco to
(measured in terms of number of relations Boston that was attempting to beat the speed
relevant to the solution) increases. It has record set by a clipper ship that had sailed
been argued that this area underlies the fluid the same route over a century earlier. A
component of Spearman’s g factor in intelli- magazine account written during the cata-
gence (Duncan et al., 2000), and it plays an maran’s voyage said the modern boat was
important role in many reasoning tasks (see “barely maintaining a 4.5 day lead over the
Goel, Chap. 20). ghost of the clipper Northern Light. . . . ” Fau-
connier and Turner observed that the maga-
zine writer was describing a “boat race” that
Metaphor
never took place in any direct sense; rather,
Analogy is closely related to metaphor and the writer was blending the separate voy-
related forms of symbolic expression that ages of the two ships into an imaginary race.
arise in everyday language (e.g., “the evening The fact that such conceptual blends are
of life,” “the idea blossomed”), in literature so natural and easy to understand attests to
(Holyoak, 1 982), the arts, and cultural prac- the fact that people can readily comprehend
tices such as ceremonies (see Holyoak & novel metaphors.
Thagard, 1 995 , Chap. 9). Similar to anal- Lakoff and Johnson (1 980; also Lakoff &
ogy in general, metaphors are characterized Turner, 1 989) argued that much of human
by an asymmetry between target (conven- experience, especially its abstract aspects,
tionally termed “tenor”) and source (“ve- is grasped in terms of broad conceptual
hicle”) domains (e.g., the target/tenor in metaphors (e.g., events occurring in time
“the evening of life” is life, which is un- are understood by analogy to objects mov-
derstood in terms of the source/vehicle of ing in space). Time, for example, is under-
time of day). In addition, a mapping (the stood in terms of objects in motion through
“grounds” for the metaphor) connects the space as in expressions such as “My birth-
source and target, allowing the domains to day is fast approaching” and “The time for
interact to generate a new conceptualiza- action has arrived.” (See Boroditsky, 2000,
tion (Black, 1 962). Metaphors are a special for evidence of how temporal metaphors in-
kind of analogy in that the source and tar- fluence cognitive judgments.) As Lakoff and
get domains are always semantically distant Turner (1 989) pointed out, the course of a
(Gentner, 1 982; Gentner, Falkenhainer, & life is understood in terms of time in the solar
Skorstad, 1 988), and the two domains are year (youth is springtime; old age is winter).
often blended rather than simply mapped Life is also conventionally conceptualized as
(e.g., in “the idea blossomed,” the target a journey. Such conventional metaphors can
is directly described in terms of an action still be used in creative ways, as illustrated
term derived from the source). In addition, by Robert Frost’s famous poem, “The Road
metaphors are often combined with other Not Taken”:
symbolic “figures” – especially metonymy
Two roads diverged in a wood, and I –
(substitution of an associated concept).
I took the one less traveled by,
For example, “sword” is a metonymic ex- And that has made all the difference.
pression for weaponry, derived from its
ancient association as the prototypical According to Lakoff and Turner, compre-
weapon – “Raising interests rates is the Fed- hension of this passage depends on our im-
eral Reserve Board’s sword in the battle plicit knowledge of the metaphor that life
analogy 1 21
is a journey. This knowledge includes un- and brain accomplish role binding is thus a
derstanding several interrelated correspon- central problem to be solved by any psycho-
dences (e.g., person is a traveler, purposes logical theory of structured knowledge, in-
are destinations, actions are routes, diffi- cluding any theory of analogy (see Doumas
culties in life are impediments to travel, & Hummel, Chap. 4).
counselors are guides, and progress is the In the 1 980s, a number of cognitive sci-
distance traveled). entists recognized the centrality of analogy
Psychological research has focused on as a tool for discovery and its close connec-
demonstrations that metaphors are in- tion with theories of knowledge represen-
tegral to everyday language understand- tation. Winston (1 980), guided by Minsky’s
ing (Glucksberg, Gildea, & Bookin, 1 982; (1 975 ) treatment of knowledge representa-
Keysar, 1 989) and debate about whether tion, built a computer model of analogy that
metaphor is better conceptualized as a kind highlighted the importance of causal rela-
of analogy (Wolff & Gentner, 2000) or a tions in guiding analogical inference. Other
kind of categorization (Glucksberg & Keysar, researchers in artificial intelligence also be-
1 990; Glucksberg, McClone, & Manfredi, gan to consider the use of complex analogies
1 997). A likely resolution is that novel in reasoning and learning (Kolodner, 1 983 ;
metaphors are interpreted by much the same Schank, 1 982), leading to an approach to ar-
process as analogies, whereas more conven- tificial intelligence termed case-based reason-
tional metaphors are interpreted as more ing (see Kolodner, 1 993 ).
general schemas (Gentner, Bowdle, Wolff, & Around 1 980, two research projects in
Boronat, 2001 ). psychology began to consider analogy in
relation to knowledge representation and
eventually integrate computational model-
Knowledge Representation
ing with detailed experimental studies of
The most important influence on analogy human analogical reasoning. Gentner (1 982,
research in the cognitive science tradition 1 983 ; Gentner & Gentner, 1 983 ) began
has been concerned with the representa- working on mental models and analogy in
tion of knowledge within computational sys- science. She emphasized that in analogy,
tems. Many seminal ideas were developed the key similarities lie in the relations that
by the philosopher Mary Hesse (1 966), who hold within the domains (e.g., the flow of
was in turn influenced by Aristotle’s dis- electrons in an electrical circuit is analog-
cussions of analogy in scientific classifica- ically similar to the flow of people in a
tion and Black’s (1 962) interactionist view crowded subway tunnel), rather than in fea-
of metaphor. Hesse placed great stress on tures of individual objects (e.g., electrons
the purpose of analogy as a tool for scien- do not resemble people). Moreover, analog-
tific discovery and conceptual change and on ical similarities often depend on higher-order
the close connections between causal rela- relations – relations between relations. For ex-
tions and analogical mapping. In the 1 970s, ample, adding a resistor to a circuit causes a
work in artificial intelligence and psychol- decrease in flow of electricity, just as adding a
ogy focused on the representation of com- narrow gate in the subway tunnel would de-
plex knowledge of the sort used in scientific crease the rate at which people pass through
reasoning, problem solving, story compre- (where causes is a higher-order relation). In
hension, and other tasks that require struc- her structure-mapping theory, Gentner pro-
tured knowledge. A key aspect of structured posed that analogy entails finding a struc-
knowledge is that elements can be flexibly tural alignment, or mapping, between do-
bound into the roles of relations. For exam- mains. In this theory, alignment between two
ple, “dog bit man” and “man bit dog” have the representational structures is characterized
same elements and the same relation, but by structural parallelism (consistent, one-
the role bindings have been reversed, radi- to-one correspondences between mapped
cally altering the meaning. How the mind elements) and systematicity – an implicit
preference for deep, interconnected systems The retrieval and mapping components are
of relations governed by higher-order rela- first considered followed by inference and
tions, such as causal, mathematical, or func- relational generalization.
tional relations.
Holyoak (1 985 ; Gick & Holyoak, 1 980,
Retrieval and Mapping
1 983 ; Holyoak & Koh, 1 987) focused on the
role of analogy in problem solving with a a paradigm for investigating
strong concern for the role of pragmatics in analogical transfer
analogy – that is, how causal relations that Gick and Holyoak (1 980, 1 983 ) introduced
impact current goals and context guide the a general laboratory paradigm for investigat-
interpretation of an analogy. Holyoak and ing analogical transfer in the context of prob-
Thagard (1 989a, 1 995 ) developed an ap- lem solving. The general approach was first
proach to analogy in which several factors to provide people with a source analog in
were viewed as jointly constraining analogi- the guise of some incidental context, such
cal reasoning. According to their multicon- as an experiment on “story memory.” Later,
straint theory, people tend to find mappings participants were asked to solve a problem
that maximize similarity of corresponding el- that was in fact analogous to the story they
ements and relations, structural parallelism had studied earlier. The questions of cen-
(i.e., isomorphism, defined by consistent, tral interest were (1 ) whether people would
one-to-one correspondences), and prag- spontaneously notice the relevance of the
matic factors such as the importance of el- source analog and use it to solve the target
ements and relations for achieving a goal. problem, and (2) whether they could solve
Gick and Holyoak (1 983 ) provided evidence the analogy once they were cued to consider
that analogy can furnish the seed for form- the source. Spontaneous transfer of the anal-
ing new relational categories by abstracting ogous solution implies successful retrieval
the relational correspondences between ex- and mapping; cued transfer implies success-
amples into a schema for a class of problems. ful mapping once the need to retrieve the
Analogy was viewed as a central part of hu- source has been removed.
man induction (Holland, Holyoak, Nisbett, The source analog used by Gick and
& Thagard, 1 986; see Sloman & Lagnado, Holyoak (1 980) was a story about a general
Chap. 5 ) with close ties to other basic who is trying to capture a fortress controlled
thinking processes, including causal infer- by a dictator and needs to get his army to
ence (see Buehner & Cheng, Chap. 7), cate- the fortress at full strength. Because the en-
gorization (see Medin & Rips, Chap. 3 ), de- tire army could not pass safely along any sin-
ductive reasoning (see Evans, Chap. 8), gle road, the general sends his men in small
and problem solving (see Novick & Bassok, groups down several roads simultaneously.
Chap. 1 4). Arriving at the same time, the groups join
together and capture the fortress.
A few minutes after reading this story
under instructions to read and remember it
Analogical Reasoning: Overview (along with two other irrelevant stories), par-
of Phenomena ticipants were asked to solve a tumor prob-
lem (Duncker, 1 945 ), in which a doctor has
This section provides an overview of the to figure out how to use rays to destroy a
major phenomena involving analogical rea- stomach tumor without injuring the patient
soning that have been established by em- in the process. The crux of the problem is
pirical investigations. This review is orga- that it seems that the rays will have the same
nized around the major components of effect on the healthy tissue as on the tumor –
analogy depicted in Figure 6.1 . These com- high intensity will destroy both, whereas low
ponents are inherently interrelated, so the intensity will destroy neither. The key issue
connections among them are also discussed. is to determine how the rays can be made to
analogy 1 23
impact the tumor selectively while sparing a surgeon treating a brain tumor), whereas
the surrounding tissue. The source analog, only 1 2% retrieved a source from a remote
if it can be retrieved and mapped, can be domain (the general story). This difference
used to generate a “convergence” solution to in ease of access was dissociable from the
the tumor problem, one that parallels the ease of postaccess mapping and transfer be-
general’s military strategy: Instead of using cause the frequency of generating the con-
a single high-intensity ray, the doctor could vergence solution to the radiation prob-
administer several low-intensity rays at once lem once the source analog was cued was
from different directions. In that way, each high and equal (about 86%), regardless of
ray would be at low intensity along its path, whether the source analog was from the
and hence, harmless to the healthy tissue, same or a different domain.
but the effects of the rays would sum to
achieve the effect of a high-intensity ray at differential impact of similarity and
their focal point, the site of the tumor. structure on retrieval versus mapping
When Gick and Holyoak (1 980) asked The main empirical generalization concern-
college students to solve the tumor problem, ing retrieval and mapping is that similar-
without a source analog, only about 1 0% of ity of individual concepts in the analogs
them produced the convergence solution. has a relatively greater impact on retrieval,
When the general story had been studied, whereas mapping is relatively more sensi-
but no hint to use it was given, only about tive to relational correspondences (Gentner
20% of participants produced the conver- et al., 1 993 ; Holyoak & Koh, 1 987; Ross,
gence solution. In contrast, when the same 1 987, 1 989). However, this dissociation is
participants were then given a simple hint not absolute. Watching the movie West Side
that “you may find one of the stories you read Story for the first time is likely to trigger a re-
earlier to be helpful in solving the problem,” minding of Shakespeare’s Romeo and Juliet
about 75 % succeeded in generating the anal- despite the displacement of the characters
ogous convergence solution. In other words, in the two works over centuries and conti-
people often fail to notice superficially nents. The two stories both involve young
dissimilar source analogs that they could lovers who suffer because of the disapproval
readily use. of their respective social groups, causing a
This gap between the difficulty of re- false report of death, which in turn leads
trieving remote analogs and the relative to tragedy. It is these structural parallels be-
ease of mapping them has been replicated tween the two stories that make them anal-
many times, both with adults (Gentner, ogous rather than simply that both stories
Rattermann, & Forbus, 1 993 ; Holyoak & involve a young man and woman, a disap-
Koh, 1 987; Spencer & Weisberg, 1 986) proval, a false report, and a tragedy.
and with young children (Chen, 1 996; Experimental work on story reminding
Holyoak, Junn, & Billman, 1 984; Tunteler confirms the importance of structure, as well
& Resing, 2002). When analogs must as similarity of concepts, in retrieving analogs
be cued from long-term memory, cases from memory. Wharton and his colleagues
from a domain similar to that of the (Wharton et al., 1 994; Wharton, Holyoak,
cue are retrieved much more readily than & Lange, 1 996) performed a series of exper-
cases from remote domains (Keane, 1 987; iments in which college students tried to find
Seifert, McKoon, Abelson, & Ratcliff, 1 986). connections between stories that overlapped
For example, Keane (1 987) measured rein various ways in terms of the actors and ac-
trieval of a convergence analog to the tu- tions and the underlying themes. In a typical
mor problem when the source analog was experiment, the students first studied about
studied 1 to 3 days prior to presentation of a dozen “target” stories presented in the guise
the target radiation problem. Keane found of a study of story understanding. For exam-
that 88% of participants retrieved a source ple, one target story exemplified a theme of-
analog from the same domain (a story about ten called “sour grapes” after one of Aesop’s
fables. The theme in this story is that the proof everyday analogical reminding. Other ev-
tagonist tries to achieve a goal, fails, and then idence indicates that having people generate
retroactively decides the goal had not really case examples, as opposed to simply asking
been desirable after all. More specifically, the them to remember cases presented earlier,
actions involved someone trying unsuccess- enhances structure-based access to source
fully to get accepted to an Ivy League col- analogs (Blanchette & Dunbar, 2000).
lege. After a delay, the students read a set
of different cue stories and were asked to the “relational shift” in development
write down any story or stories from the
Retrieval is thus sensitive to structure and
first session of which they were reminded.
direct similarity of concepts. Conversely,
Some stories (far analogs) exemplified the
mapping is sensitive to direct similarity and
same theme, but with very different char-
structure (e.g., Reed, 1 987; Ross, 1 989).
acters and actions (e.g., a “sour grapes” fairy
Young children are particularly sensitive
tale about a unicorn who tries to cross a river
to direct similarity of objects; when asked
but is forced to turn back). Other stories
to identify corresponding elements in two
were far “disanalogs” formed by reorganizing
analogs, their mappings are dominated by
the characters and actions to represent a dis-
object similarity when semantic and struc-
tinctly different theme (e.g., “self-doubt” –
tural constraints conflict (Gentner & Toupin,
the failure to achieve a goal leads the pro-
1 986). Younger children are particularly
tagonist to doubt his or her own ability or
likely to map on the basis of object simi-
merit). Thus, neither type of cue was simi-
larity when the relational response requires
lar to the target story in terms of individual
integration of multiple relations, and hence,
elements (characters and actions); however,
is more dependent on working memory re-
the far analog maintained structural corre-
sources (Richland, Morrison, & Holyoak,
spondences of higher-order causal relations
2004). The developmental transition to-
with the target story, whereas the far disana-
ward greater reliance on structure in map-
log did not.
ping has been termed the “relational shift”
Besides varying the relation between the
(Gentner & Rattermann, 1 991 ). Greater sen-
cue and target stories, Wharton et al. (1 994)
sitivity to relations with age appears to arise
also varied the number of target stories that
owing to a combination of incremental ac-
were in some way related to a single cue.
cretion of knowledge about relational con-
When only one target story in a set had been
cepts and stage-like increments in working
studied (“singleton” condition), the proba-
memory capacity (Halford, 1 993 ; Halford
bility of reminding was about equal, regard-
& Wilson, 1 980). (For reviews of develop-
less of whether the cue was analogous to the
mental research on analogy, see Goswami,
target. However, when two target stories had
1 992, 2001 ; Halford, Chap. 22 ; Holyoak &
been studied (e.g., both “sour grapes” and
Thagard, 1 995 ).
“self-doubt,” forming a “competition” condi-
tion), the analogous target was more likely to
be retrieved than the disanalogous one. The goal-directed mapping
advantage of the far analog in the competi- Mapping is guided not only by relational
tion condition was maintained even when a structure and element similarity but also by
week intervened between initial study of the the goals of the analogist (Holyoak, 1 985 ).
target stories and presentation of the cue sto- People draw analogies not to find a pris-
ries (Wharton et al., 1 996). tine isomorphism for its own sake but to
These results demonstrate that structure make plausible inferences that will achieve
does influence analogical retrieval, but its their goals. Particularly when the mapping
impact is much more evident when multi- is inherently ambiguous, the constraint of
ple memory traces, each somewhat similar pragmatic centrality – relevance to goals –
to the cue, must compete to be retrieved. is critical (Holyoak, 1 985 ). Spellman and
Such retrieval competition is likely typical Holyoak (1 996) investigated the impact of
analogy 1 25
processing goals on the mappings gener- the sense that multiple constraints converge
ated for inherently ambiguous analogies. In on a solution that satisfies as many differ-
one experiment, college students read two ent constraints as possible (Thagard, 2000).
science fiction stories about countries on Everyday use of analogies depends on the
two planets. These countries were interre- human ability to find coherent mappings –
lated by various economic and military al- even when source and target are complex
liances. Participants first made judgments and the mappings are ambiguous. For ex-
about individual countries based on either ample, political debate often makes use of
economic or military relationships and were analogies between prior situations and some
then asked mapping questions about which current controversy (Blanchette & Dunbar,
countries on one planet corresponded to 2001 , 2002). Ever since World War II, politi-
which on the other. Schematically, planet 1 cians in the United States and elsewhere
included three countries, such that “Afflu” have periodically argued that some military
was economically richer than “Barebrute,” intervention was justified because the cur-
whereas the latter was militarily stronger rent situation was analogous to that lead-
than “Compak.” Planet 2 included four ing to World War II. A commonsensical
countries, with “Grainwell” being richer than mental representation of World War II, the
“Hungerall” and “Millpower” being stronger source analog, amounts to a story figuring
than “Mightless.” The critical aspect of this an evil villain, Hitler; misguided appeasers,
analogy problem is that Barebrute (planet 1 ) such as Neville Chamberlain; and clear-
is both economically weak (like Hunger- sighted heroes, such as Winston Churchill
all on planet 2) and militarily strong (like and Franklin Delano Roosevelt. The coun-
Millpower) and therefore, has two compet- tries involved in World War II included the
ing mappings that are equally supported by villains, Germany and Japan; the victims,
structural and similarity constraints. such as Austria, Czechoslovakia, and Poland;
Spellman and Holyoak (1 996) found that and the heroic defenders, notably Britain and
participants whose processing goal led them the United States.
to focus on economic relationships tended A series of American presidents have used
to map Barebrute to Hungerall rather than the World War II analog as part of their
Millpower, whereas those whose process- argument for American military interven-
ing goal led them to focus on military tion abroad (see Khong, 1 992). These in-
relationships had the opposite preferred clude Harry Truman (Korea, 1 95 0), Lyndon
mapping. The variation in pragmatic cen- Johnson (Vietnam, 1 965 ), George Bush se-
trality of the information thus served to nior (Kuwait and Iraq, 1 991 ), and his son
decide between the competing mappings. George W. Bush (Iraq, 2003 ). Analogies to
One interpretation of such findings is that World War II have also been used to sup-
pragmatically central propositions tend to port less aggressive responses. Most notably,
be considered earlier and more often than during the Cuban missile crisis of 1 962,
those that are less goal relevant and hence, President John F. Kennedy decided against
dominate the mapping process (Hummel & a surprise attack on Cuba in part because he
Holyoak, 1 997). did not want the United States to behave in
a way that could be equated to Japan’s sur-
prise attack on Pearl Harbor.
coherence in analogical mapping The World War II situation was, of course,
The key idea of Holyoak and Thagard’s very complex and is never likely to map per-
(1 989a) multiconstraint theory of analogy is fectly onto any new foreign policy problem.
that several different kinds of constraints – Nonetheless, by selectively focusing on goal-
similarity, structure, and purpose – all in- relevant aspects of the source and target and
teract to determine the optimal set of cor- using multiple constraints in combination,
respondences between source and target. A people can often find coherent mappings in
good analogy is one that appears coherent in situations of this sort. After the Iraqi invasion
of Kuwait in 1 990, President George United States to the United States should
H. W. Bush argued that Saddam Hussein, the instead map Bush to Roosevelt.
Iraqi leader, was analogous to Adolf Hitler During the first 2 days of the U.S.-led
and that the Persian Gulf crisis in general was counterattack against the Iraqi invasion of
analogous to events that had led to World Kuwait, Spellman and Holyoak (1 992) asked
War II a half-century earlier. By drawing the a group of American undergraduates a few
analogy between Hussein and Hitler, Pres- questions to find out how they interpreted
ident Bush encouraged a reasoning process the analogy between the then-current situ-
that led to the construction of a coherent ation in the Persian Gulf and World War II.
system of roles for the players in the Gulf sit- The undergraduates were asked to sup-
uation. The popular understanding of World pose that Saddam Hussein was analogous
War II provided the source, and analogical to Hitler. Regardless of whether they be-
mapping imposed a set of roles on the tar- lieved the analogy was appropriate, they
get Gulf situation by selectively emphasizing were then asked to write down the most
the most salient relational parallels between natural match in the World War II situation
the two situations. Once the analogical cor- for Iraq, the United States, Kuwait, Saudi
respondences were established (with Iraq Arabia, and George Bush. For those stu-
identified as an expansionist dictatorship like dents who gave evidence that they knew the
Germany, Kuwait as its first victim, Saudi basic facts about World War II, the major-
Arabia as the next potential victim, and the ity produced mappings that fell into one of
United States as the main defender of the two patterns. Those students who mapped
Gulf states), the clear analogical inference the United States to itself also mapped
was that both self-interest and moral con- Bush to Roosevelt; these same students also
siderations required immediate military intended to map Saudi Arabia to Great Britain.
tervention by the United States. Aspects of Other students, in contrast, mapped the
the Persian Gulf situation that did not map United States to Great Britain and Bush to
well to World War II (e.g., lack of democracy Churchill, which in turn (so as to maintain
in Kuwait) were pushed to the background. one-to-one correspondences) forced Saudi
Of course, the analogy between the two Arabia to map to some country other than
situations was by no means perfect. Simi- Britain. The mapping for Kuwait (which did
larity at the object level favored mapping not depend on the choice of mappings for
the United States of 1 991 to the United Bush, the United States, or Saudi Arabia)
States of World War II simply because it was usually to one or two of the early vic-
was the same country, which would in turn tims of Germany in World War II (usually
support mapping Bush to President Roo- Austria or Poland).
sevelt. However, the United States did not The analogy between the Persian Gulf sit-
enter World War II until it was bombed uation and World War II thus generated a
by Japan, well after Hitler had marched “bistable” mapping: People tended to pro-
through much of Europe. One might there- vide mappings based on either of two coher-
fore argue that the United States of 1 991 ent but mutually incompatible sets of corre-
mapped to Great Britain of World War II and spondences. Spellman and Holyoak (1 992)
that Bush mapped to Winston Churchill, the went on to perform a second study, using a
British Prime Minister (because Bush, sim- different group of undergraduates, to show
ilar to Churchill, led his nation and West- that people’s preferred mappings could be
ern allies in early opposition to aggression). pushed around by manipulating their knowl-
These conflicting pressures made the map- edge of the source analog, World War II.
pings ambiguous. However, the pressure to Because many undergraduates were lacking
maintain structural consistency implies that in knowledge about the major participants
people who mapped the United States to and events in World War II, it proved pos-
Britain should also tend to map Bush to sible to “guide” them to one or the other
Churchill, whereas those who mapped the mapping pattern by having them first read a
analogy 1 27
slightly biased summary of events in World

War II. The various summaries were all his-
torically “correct,” in the sense of providing
only information taken directly from history Target
books, but each contained slightly differ- Object
ent information and emphasized different
points. Each summary began with an iden-
tical passage about Hitler’s acquisition of
Austria, Czechoslovakia, and Poland and the
efforts by Britain and France to stop him.
The versions then diverged. Some versions
went on to emphasize the personal role of
Churchill and the national role of Britain; Relational
other versions placed greater emphasis on Featural Match
Match
what Roosevelt and the United States did
to further the war effort. After reading one
of these summaries of World War II, the un-
dergraduates were asked the same mapping
questions as had been used in the previous
study. The same bistable mapping patterns
emerged as before, but this time the sum-
maries influenced which of the two coher-
ent patterns of responses students tended Figure 6.3. An example of a pair of pictures
to give. People who read a “Churchill” ver- used in studies of analogical mapping with
sion tended to map Bush to Churchill and arrows added to indicate featural and relational
the United States to Great Britain, whereas responses. (From Tohill & Holyoak, 2000, p. 3 1 .
those who read a “Roosevelt” version tended Reprinted by permission.)
to map Bush to Roosevelt and the United
States to the United States. It thus ap-
pears that even when an analogy is messy working memory in analogical mapping
and ambiguous, the constraints on analog- Analogical reasoning, because it depends on
ical coherence produce predictable inter- manipulating structured representations of
pretations of how the source and target knowledge, would be expected to make crit-
fit together. ical use of working memory. The role of
Achieving analogical coherence in map- working memory in analogy has been ex-
ping does not, of course, guarantee that the plored using a picture-mapping paradigm in-
source will provide a clear and compelling troduced by Markman and Gentner (1 993 ).
basis for planning a course of action to deal An example of stimuli similar to those they
with the target situation. In 1 991 , President used is shown in Figure 6.3 . In their exper-
Bush considered Hussein enough of a Hitler iments, college students were asked to ex-
to justify intervention in Kuwait but not amine the two pictures and then decide (for
enough of one to warrant his removal from this hypothetical example) what object in
power in Iraq. A decade later his son, Presi- the bottom picture best goes with the man
dent George W. Bush, reinvoked the World in the top picture. When this single map-
War II analogy to justify a preemptive inva- ping is considered in isolation, people often
sion of Iraq itself. Bush claimed (falsely, as indicate that the boy in the bottom picture
was later revealed) that Hussein was acquir- goes with the man in the top picture based
ing biological and perhaps nuclear weapons on perceptual and semantic similarity of
that posed an imminent threat to the United these elements. However, when people are
States and its allies. Historical analogies can asked to match not just one object but three
be used to obfuscate as well as to illuminate. (e.g., the man, dog, and the tree in the top
picture to objects in the bottom picture), Inference and Relational Generalization

they are led to build an integrated represen-
tation of the relations among the objects and copy with substitution and generation
of higher-order relations between relations. Analogical inference – using a source analog
In the top picture, a man is unsuccessfully to form a new conjecture, whether it be a
trying to restrain a dog, which then chases step toward solving a math problem (Reed,
the cat. In the bottom picture, the tree is un- Dempster, & Ettinger, 1 985 ; see Novick
successful in restraining the dog, which then & Bassok, Chap. 1 4), a scientific hypoth-
chases the boy. Based on these multiple in- esis (see Dunbar & Fugelsang, Chap. 29),
teracting relations, the preferred match to a diagnosis for puzzling medical symptoms
the man in the top picture is not the boy in (see Patel, Arocha, & Zhang, Chap. 3 0),
the lower scene but the tree. Consequently, or a basis for deciding a legal case (see
people who map three objects at once are Ellsworth, Chap. 28) – is the fundamental
more likely to map the man to the tree on purpose of analogical reasoning. Mapping
the basis of their similar relational roles than serves to highlight correspondences between
are people who map the man alone. the source and target, including “alignable
Whereas Markman and Gentner (1 993 ) differences” (Markman & Gentner, 1 993 ) –
showed that the number of objects to be the distinct but corresponding elements of
mapped influences the balance between the two analogs. These correspondences pro-
the impact of element similarity versus re- vide the input to an inference engine that
lational structure, other studies using the generates new target propositions. The ba-
picture-mapping paradigm have demon- sic form of analogical inference has been
strated that manipulations that constrict called “copy with substitution and genera-
working memory resources have a similar tion” (CWSG; Holyoak et al., 1 994). CWSG
impact. Waltz, Lau, Grewal, and Holyoak involves constructing target analogs of un-
(2000) asked college students to map pic- mapped source propositions by substituting
tures while performing a secondary task the corresponding target element, if known,
designed to tax working memory (e.g., gen- for each source element, and if no corre-
erating random digits). Adding a dual task di- sponding target element exists, postulating
minished relational responses and increased one as needed. This procedure gives rise to
similarity-based responses (see Morrison, two important corollaries concerning infer-
Chap. 1 9). A manipulation that increases ence errors. First, if critical elements are dif-
people’s anxiety level (performing mathe- ficult to map (e.g., because of strong repre-
matical calculations under speed pressure sentational asymmetries such as those that
prior to the mapping task) yielded a sim- hinder mapping a discrete set of elements
ilar shift in mapping responses (Tohill & to a continuous variable; Bassok & Holyoak,
Holyoak, 2000). Most dramatically, degen- 1 989; Bassok & Olseth, 1 995 ), then no in-
eration of the frontal lobes radically impairs ferences can be constructed. Second, if ele-
relation-based mapping (Morrison et al., ments are mismapped, predictable inference
2004). In related work using complex story errors will result (Holyoak et al., 1 994; Reed,
analogs, Krawczyk, Holyoak, and Hummel 1 987).
(2004) demonstrated that mappings (and in- All major computational models of ana-
ferences) based on element similarity ver- logical inference use some variant of CWSG
sus relational structure were made about (e.g., Falkenhainer et al., 1 989; Halford et al.,
equally often when the element similarities 1 994; Hofstadter & Mitchell, 1 994; Holyoak
were salient and the relational structure was et al., 1 994; Hummel & Holyoak, 2003 ;
highly complex. All these findings support Keane & Brayshaw, 1 988; Kokinov & Petrov,
the hypothesis that mapping on the basis of 2001 ). CWSG is critically dependent on
relations requires adequate working mem- variable binding and mapping; hence, mod-
ory to represent and manipulate role bind- els that lack these key computational prop-
ings (Hummel & Holyoak, 1 997). erties (e.g., traditional connectionist models)
analogy 1 29
fail to capture even the most basic as- An important question is when analogi-
pects of analogical inference (see Doumas & cal inferences are made and how inferences
Hummel, Chap. 4). generated by CWSG relate to facts about the
Athough all analogy models use some target analog that are stated directly. One
form of CWSG, additional constraints extreme possibility is that people only make
on this inference mechanism are critical analogical inferences when instructed to do
(Clement & Gentner, 1 991 ; Holyoak et al., so and that inferences are carefully “marked”
1 994; Markman, 1 997). If CWSG were as such so they will never be confused with
unconstrained, then any unmapped source known facts about the target. At the other
proposition would generate an inference extreme, it is possible that some analogi-
about the target. Such a loose criterion for cal inferences are triggered when the tar-
inference generation would lead to ram- get is first processed (given that the source
pant errors whenever the source was not has been activated) and that such inferences
isomorphic to a subset of the target, and are then integrated with prior knowledge
such isomorphism will virtually never hold of the target. One paradigm for address-
for problems of realistic complexity. Sev- ing this issue is based on testing for false
eral constraints on CWSG were demon- “recognition” of potential inferences in a
strated in a study by Lassaline (1 996; also subsequent memory test. The logic of the
see Clement & Gentner, 1 991 ; Spellman recognition paradigm (Bransford, Barclay, &
& Holyoak, 1 996). Lassaline had college Franks, 1 972) is that if an inference has been
students read analogs describing proper- made and integrated with the rest of the
ties of hypothetical animals and then rate target analog, then later the reasoner will
various possible target inferences for the falsely believe that the inference had been
probability that the conclusion would be directly presented.
true given the information in the premise. Early work by Schustack and Anderson
Participants rated potential inferences as (1 979) provided evidence that people some-
more probable when the source and tar- times falsely report that analogical infer-
get analogs shared more attributes, and ences were actually presented as facts.
hence, mapped more strongly. In addition, Blanchette and Dunbar (2002) performed
their ratings were sensitive to structural a series of experiments designed to assess
and pragmatic constraints. The presence when analogical inferences are made. They
of a higher-order linking relation in the had college students (in Canada) read a text
source made an inference more credible. For describing a current political issue, possible
example, if the source and target animals legalization of marijuana use, which served
were both described as having an acute as the target analog. Immediately afterward,
sense of smell, and the source animal was half the students read, “The situation with
said to have a weak immune system that marijuana can be compared to . . . ”, followed
“develops before” its acute sense of smell, by an additional text describing the period
then the inference that the target animal also early in the twentieth century when alco-
has a weak immune system would be bol- hol use was prohibited. Importantly, the stu-
stered relative to stating only that the source dents in the analogy condition were not told
animal had an acute sense of smell “and” how prohibition mapped onto the marijuana
a weak immune system. The benefit con- debate, nor were they asked to draw any in-
veyed by the higher-order relation was inferences. After a delay (1 week in one ex-
creased if the relation was explicitly causal periment, 1 5 minutes in another), the stu-
(e.g., in the source animal, a weak immune dents were given a list of sentences and were
system “causes” its acute sense of smell), asked to decide whether each sentence had
rather than less clearly causal (“develops actually been presented in the text about
before”). (See Hummel & Holyoak, 2003 , marijuana use. The critical items were sen-
for a simulation of this and other inference tences such as “The government could set up
results using a CWSG algorithm.) agencies to control the quality and take over
the distribution of marijuana.” These sen- Holyoak, 1 983 ). Indeed, people will form
tences had never been presented; however, schemas simply as a side effect of applying
they could be generated as analogical infer- one solved source problem to an unsolved
ences by CWSG based on a parallel state- target problem (Novick & Holyoak, 1 991 ;
ment contained in the source analog (“The Ross & Kennedy, 1 990).
government set up agencies to control the In the case of problem schemas, more
quality and take over the distribution of al- effective schemas are formed when the
cohol”). Blanchette and Dunbar found that goal-relevant relations are the focus rather
students in the analogy condition said “yes” than incidental details (Brown et al., 1 986;
to analogical inferences about 5 0% of the Brown, Kane, & Long, 1 989; Gick &
time, whereas control subjects who had not Holyoak, 1 983 ). In general, any kind of pro-
read the source analog about prohibition said cessing that helps people focus on the under-
“yes” only about 25 % of the time. This ten- lying causal structure of the analogs, thereby
dency to falsely “recognize” analogical infer- encouraging learning of more effective prob-
ences that had never been read was obtained lem schemas, will improve subsequent trans-
both after long and short delays and with fer to new problems. For example, Gick
both familiar and less familiar materials. and Holyoak (1 983 ) found that induction
It thus appears that when people notice of a “convergence” schema from two dis-
the connection between a source and target, parate analogs was facilitated when each
and they are sufficiently engaged in an effort story stated the underlying solution prin-
to understand the target situation, analogi- ciple abstractly: “If you need a large force
cal inferences will be generated by CWSG to accomplish some purpose, but are pre-
and then integrated with prior knowledge of vented from applying such a force directly,
the target. At least sometimes, an analogical many smaller forces applied simultaneously
inference becomes accepted as a stated fact. from different directions may work just as
This result obviously has important impli- well.” In some circumstances, transfer can
cations for understanding analogical reason- also be improved by having the reasoner
ing, such as its potential for use as a tool generate a problem analogous to an initial
for persuasion. example (Bernardo, 2001 ). Other work has
shown that abstract diagrams that highlight
the basic idea of using multiple converging
relational generalization forces can aid in schema induction and sub-
In addition to generating local inferences sequent transfer (Beveridge & Parkins, 1 987;
about the target by CWSG, analogical rea- Gick & Holyoak, 1 983 ) – especially when
soning can give rise to relational general- the diagram uses motion cues to convey per-
izations – abstract schemas that establish ception of forces acting on a central target
an explicit representation of the common- (Pedone, Hummel, & Holyoak, 2001 ; see
alities between the source and the target. Figure 6.4, top).
Comparison of multiple analogs can result Although two examples can suffice to es-
in the induction of a schema, which in tablish a useful schema, people are able to
turn will facilitate subsequent transfer to incrementally develop increasingly abstract
additional analogs. The induction of such schemas as additional examples are provided
schemas has been demonstrated in both (Brown et al., 1 986, 1 989; Catrambone &
adults (Catrambone & Holyoak, 1 989; Gick Holyoak, 1 989). However, even with mul-
& Holyoak, 1 983 ; Loewenstein, Thompson, tiple examples that allow novices to start
& Gentner, 1 999; Ross & Kennedy, 1 990) forming schemas, people may still fail to
and young children (Brown, Kane, & Echols, transfer the analogous solution to a prob-
1 986; Chen & Daehler, 1 989; Holyoak et al., lem drawn from a different domain if a
1 984; Kotovsky & Gentner, 1 996). People substantial delay intervenes or if the con-
are able to induce schemas by comparing text is changed (Spencer & Weisberg, 1 986).
just two analogs to one another (Gick & Nonetheless, as novices continue to develop
analogy 1 31
Figure 6.4. Sequence of diagrams used to convey the convergence schema by

perceived motion. Top: sequence illustrating convergence (arrows appear to
move inward in II–IV). Bottom: control sequence in which arrows diverge
instead of converge (arrows appear to move outward in II–IV). (From Pedone,
Holyoak, & Hummel, 2001 , p. 21 7. Reprinted by permission.)
more powerful schemas, long-term transfer structed between analogous situations that
in an altered context can be dramatically fit the schema. As schemas are acquired
improved (Barnett & Koslowski, 2002). For from examples, they in turn guide future
example, Catrambone and Holyoak (1 989) mappings and inferences (Bassok, Wu, &
gave college students a total of three con- Olseth, 1 995 ).
vergence analogs to study, compare, and
solve. The students were first asked a series
of detailed questions designed to encourage Computational Models of Analogy
them to focus on the abstract structure com-
mon to two of the analogs. After this ab- From its inception, work on analogy
straction training, the students were asked in relation to knowledge representation
to solve another analog from a third do- has involved the development of detailed
main (not the tumor problem), after which computational models of the various com-
they were told the convergence solution to ponents of analogical reasoning typically fo-
it (which most students were able to gen- cusing on the central process of structure
erate themselves). Finally, 1 week later, the mapping. The most influential early models
students returned to participate in a dif- included SME (Structure Mapping Engine;
ferent experiment. After the other experi- Falkenhainer, Forbus, & Gentner, 1 989),
ment was completed, they were given the ACME (Analogical Mapping by Constraint
tumor problem to solve. More than 80% Satisfaction; Holyoak & Thagard, 1 989a),
of participants came up with the converg- IAM (Incremental Analogy Model; Keane &
ing rays solution without any hint. As the Brayshaw, 1 988), and Copycat (Hofstadter
novice becomes an expert, the emerging & Mitchell, 1 994). More recently, models
schema becomes increasingly accessible and of analogy have been developed based on
is triggered by novel problems that share its knowledge representations constrained by
structure. Deeper similarities have been con- neural mechanisms (Hummel & Holyoak,
1 992). These efforts included an approach predicate-calculus notation is equivalent to a

based on the use of tensor products for vari- graph structure. An analogical mapping can
able binding, the STAR model (Structured then be viewed as a set of correspondences
Tensor Analogical Reasoning; Halford et al., between partially matching graph structures.
1 994; see Halford, Chap. 22), and another The heart of the SME algorithm is a pro-
based on neural synchrony, the LISA model cedure for finding graph matches that sat-
(Learning and Inference with Schemas and isfy certain criteria. The algorithm operates
Analogies; Hummel & Holyoak, 1 997, 2003 ; in three stages, progressing in a “local-to-
see Doumas & Hummel, Chap. 4). (For a global” direction. First, SME proposes lo-
brief overview of computational models of cal matches between all identical predicates
analogy, see French, 2002.) Three models are and their associated role fillers. It is as-
sketched to illustrate the general nature of sumed similar predicates (e.g., “Führer-of ”
computational approaches to analogy. and “president-of ”; “occupy” and “invade”)
are first transformed into more general pred-
Structure Mapping Engine (SME) icates (e.g.,“leader-of ”; “attack”) that reveal
a hidden identity. (In practice, the program-
SME (Falkenhainer et al., 1 989) illustrates mer must make the required substitutions
how analogical mapping can be performed so similar but nonidentical predicates can be
by algorithms based on partial graph match- matched.) The resulting matches are typi-
ing. The basic knowledge representation for cally inconsistent in that one element in the
the inputs is based on a notation in the style source may match multiple elements in the
of predicate calculus. If one takes a simple target (e.g., Hitler might match either Hus-
example based on the World War II analogy sein or Bush because all are “leaders”). Sec-
as it was used by President George Bush in ond, the resulting local matches are inte-
1 991 , a fragment might look like grated into structurally consistent clusters or
SOURCE:
“kernels” (e.g., the possible match between
Führer-of (Hitler, Germany) Hitler and Bush is consistent with that be-
occupy (Germany, Austria) tween Germany and the United States, and
evil (Hitler) so these matches would form part of a sin-
cause [evil (Hitler), occupy (Germany, gle kernel). Third, the kernels are merged
Austria)] into a small number of sets that are max-
prime-minister-of (Churchill, Great imal in size (i.e., that include matches be-
Britain) tween the greatest number of nodes in the
cause [occupy (Germany, Austria), coun- two graphs), while maintaining correspon-
terattack (Churchill, Hitler)] dences that are structurally consistent and
TARGET: one to one. SME then ranks the result-
president-of (Hussein, Iraq) ing sets of mappings by a structural eval-
invade (Iraq, Kuwait) uation metric that favors “deep” mappings
evil (Hussein)
(ones that include correspondences between
cause [evil (Hussein), invade (Iraq,
Kuwait)]
higher-order relations). For our example,
president-of (Bush, United States) the optimal set will respectively map Hitler,
Germany, Churchill, and Great Britain to
SME distinguishes objects (role fillers, Hussein, Iraq, Bush, and the United States
such as “Hitler”), attributes (one-place pred- because of the support provided by the map-
icates, such as “evil” with its single role filler), ping between the higher-order “cause” rela-
first-order relations (multiplace predicates, tions involving “occupy/invade.” Using this
such as “occupy” with its two role fillers), and optimal mapping, SME applies a CWSG al-
higher-order relations (those such as “cause” gorithm to generate inferences about the
that take at least one first-order relation as a target based on unmapped propositions in
role filler). As illustrated in Figure 6.5 , the the source. Here, the final “cause” relation
analogy 1 33
Figure 6.5. SME’s graphical representation of a source and target analog.
in the source will yield the analogical infer- ular predicate in the corresponding struc-
ence, cause [attack (Iraq, Kuwait), counter- tured representation. (Content vectors are
attack (Bush, Hussein)]. computed automatically from the underly-
SME thus models the mapping and ining structural representations.) The content
ference components of analogical reason- vector for the target is then matched to vec-
ing. A companion model, MACFAC (“Many tors for all analogs stored in memory, and
Are Called but Few Are Chosen”; Forbus, the dot product for each analog pair is cal-
Gentner, & Law, 1 995 ) deals with the ini- culated as an index of similarity. The source
tial retrieval of a source analog from long- analog with the highest dot product, plus
term memory. MACFAC has an initial stage other stored analogs with relatively high dot
(“many are called”) in which analogs are rep- products, are marked as retrieved. In its sec-
resented by content vectors, which code the ond stage, MACFAC uses SME to assess
relative number of occurrences of a partic- the degree of the structural overlap between
the target and each possible source, allowing lated to establish optimal global coherence.
the program to identify a smaller number of Third, if the model is being used to gener-
potential sources that have the highest de- ate inferences and correspondences, CWSG
grees of structural parallelism with the target is applied to generate inferences based
(“few are chosen”). As the content vectors on the correspondences identified in the
used in the first stage of MACFAC do not second step.
code role bindings, the model provides a ACME has a companion model, ARCS
qualitative account of why the retrieval stage (Analog Retrieval by Constraint Satis-
of analogy is less sensitive to structure than faction; Thagard, Holyoak, Nelson, &
is the mapping stage. Gochfeld, 1 990) that models analog re-
trieval. Analogs in long-term memory are
connected within a semantic network (see
Analogical Mapping by Constraint
Medin & Rips, Chap. 3 ); this network of
Satisfaction (ACME)
concepts provides the initial basis by which
The ACME model (Holyoak, Novick, & a target analog activates potential source
Melz, 1 994; Holyoak & Thagard, 1 989a) was analogs. Those analogs in memory that are
directly influenced by connectionist mod- identified as having semantic links to the tar-
els based on parallel constraint satisfac- get (i.e., those that share similar concepts)
tion (Rumelhart, Smolensky, McClelland, then participate in an ACME-like con-
& Hinton, 1 986; see Doumas & Hummel, straint satisfaction process to select the opti-
Chap. 4). ACME takes as input symbolic mal source. The constraint network formed
representations of the source and target by ARCS is restricted to those concepts
analogs in essentially the same form as those in each analog that have semantic links;
used in SME. However, whereas SME fo- hence, ARCS shows less sensitivity to struc-
cuses on structural constraints, ACME in- ture in retrieval than does ACME in map-
stantiates a multiconstraint theory in which ping. Because constraint satisfaction algo-
structural, semantic, and pragmatic con- rithms are inherently competitive, ARCS
straints interact to determine the optimal can model the finding that analogical ac-
mapping. ACME accepts a numeric code cess is more sensitive to structure when sim-
for degree of similarity between predicates, ilar source analogs in long-term memory
which it uses as a constraint on mapping. compete to be retrieved (Wharton et al.,
Thus, ACME, unlike SME, can match simi- 1 994, 1 996).
lar predicates (e.g., “occupy” and “invade”)
without explicitly recoding them as iden-
Learning and Inference with Schemas
tical. In addition, ACME accepts a nu-
and Analogies (LISA)
meric code for the pragmatic importance
of a possible mapping, which is also used Similar to ACME, the LISA model
as a constraint. (Hummel & Holyoak, 1 997, 2003 ) is
ACME is based on a constraint satis- based on the principles of the multicon-
faction algorithm, which proceeds in three straint theory of analogy; unlike ACME,
steps. First, a connectionist “mapping net- LISA operates within psychologically and
work” is constructed in which the units rep- neurally realistic constraints on working
resent hypotheses about possible element memory (see Doumas & Hummel, Chap. 4;
mappings and the links represent specific in- Morrison, Chap. 1 9). The models discussed
stantiations of the general constraints (Fig- previously include at most localist rep-
ure 6.6). Second, an interactive-activation resentations of the meaning of concepts
algorithm operates to “settle” the map- (e.g., a semantic network in the case of
ping network in order to identify the set ARCS), and most of their processing is
of correspondences that collectively repre- performed on propositional representations
sent the “optimal” mapping between the unaccompanied by any more detailed level
analogs. Any constraint may be locally vio- of conceptual representation (e.g., neither
analogy 1 35
PURPOSE SIMILARITY
president-of invade
Führer-of occupy
Saddam Iraq Kuwait
Hitler Germany Austria
Saddam Iraq Iraq Kuwait
Germany Austria Hitler Germany
president-of invade
occupy Führer-of
Figure 6.6. A constraint-satisfaction network in ACME.
ACME nor SME includes any represen- of activation on the semantic units, which in
tation of the meaning of concepts). LISA turn activate propositions in potential source
also goes beyond previous models in that analogs residing in long-term memory. The
it provides a unified account of all the resulting coactivity of source and target el-
major components of analogical reasoning ements, augmented with a capacity to learn
(retrieval, mapping, inference, and re- which structures in the target were coactive
lational generalization). with which in the source, serves as the basis
LISA represents propositions using a hi- for analogical mapping. LISA includes a set
erarchy of distributed and localist units (see of mapping connections between units of the
Figure 4.1 in Doumas & Hummel, Chap. 4). same type (e.g., object, predicate) in sepa-
LISA includes both a long-term memory rate analogs. These connections grow when-
for propositions and concept meanings and ever the corresponding units are active si-
a limited-capacity working memory. LISA’s multaneously and thereby permit LISA to
working memory representation, which uses learn the correspondences between struc-
neural synchrony to encode role-filler bind- tures in separate analogs. They also permit
ings, provides a natural account of the ca- correspondences learned early in mapping to
pacity limits of working memory because it influence the correspondences learned later.
is only possible to have a finite number of Augmented with a simple algorithm for self-
bindings simultaneously active and mutually supervised learning, the mapping algorithm
out of synchrony. serves as the basis for analogical inference
Analog retrieval is accomplished as a form by CWSG. Finally, augmented with a sim-
of guided pattern matching. Propositions in a ple algorithm for intersection discovery, self-
target analog generate synchronized patterns supervised relational learning serves as the
basis for schema induction. LISA has been begun to connect behavioral work on anal-
used to simulate a wide range of data on ogy with research in cognitive neuroscience
analogical reasoning (Hummel & Holyoak, (Morrison et al., 2004). We already have
1 997, 2003 ), including both behavioral some knowledge of the general neural cir-
and neuropsychological studies (Morrison cuits that underlie analogy and other forms
et al., 2004). of reasoning (see Goel, Chap. 20). As
more sophisticated noninvasive neuroimag-
ing methodologies are developed, it should
become possible to test detailed hypothe-
Conclusions and Future Directions ses about the neural mechanisms underly-
ing analogy, such as those based on temporal
When we think analogically, we do much properties of neural systems.
more than just compare two analogs based Most research and modeling in the field
on obvious similarities between their el- of analogy has emphasized quasilinguistic
ements. Rather, analogical reasoning is a knowledge representations, but there is good
complex process of retrieving structured reason to believe that reasoning in general
knowledge from long-term memory, repre- has close connections to perception (e.g.,
senting and manipulating role-filler bind- Pedone et al., 2001 ). Perception provides
ings in working memory, performing self- an important starting point for grounding at
supervised learning to form new inferences, least some “higher” cognitive representations
and finding structured intersections between (Barsalou, 1 999). Some progress has been
analogs to form new abstract schemas. The made in integrating analogy with perception.
entire process is governed by the core con- For example, the LISA model has been aug-
straints provided by isomorphism, similarity mented with a Metric Array Module (MAM;
of elements, and the goals of the reasoner Hummel & Holyoak, 2001 ), which provides
(Holyoak & Thagard, 1 989a). These con- specialized processing of metric information
straints apply in all components of analog- at a level of abstraction applicable to both
ical reasoning: retrieval, mapping, inference, perception and quasispatial concepts. How-
and relational generalization. When analogs ever, models of analogy have generally failed
are retrieved from memory, the constraint of to address evidence that the difficulty of
element similarity plays a large role, but rela- solving problems and transferring solution
tional structure is also important – especially methods to isomorphic problems is depen-
when multiple source analogs similar to the dent on the difficulty of perceptually encod-
target are competing to be selected. For ing key relations. The ease of solving appar-
mapping, structure is the most important ently isomorphic problems (e.g., isomorphs
constraint but requires adequate working of the well-known Tower of Hanoi) can vary
memory resources; similarity and purpose enormously, depending on perceptual cues
also contribute. The success of analogical in- (Kotovsky & Simon, 1 990; see Novick & Bas-
ference ultimately depends on whether the sok, Chap. 1 4).
purpose of the analogy is achieved, but satis- More generally, models of analogy have
fying this constraint is intimately connected not been well integrated with models of
with the structural relations between the problem solving (see Novick & Bassok,
analogs. Finally, relational generalization oc- Chap. 1 4), even though analogy clearly af-
curs when schemas are formed from the fords an important mechanism for solving
source and target to capture those structural problems. In its general form, problem solv-
patterns in the analogs that are most rele- ing requires sequencing multiple operators,
vant to the reasoner’s purpose in exploiting establishing subgoals, and using combina-
the analogy. tions of rules to solve related but non-
Several current research directions are isomorphic problems. These basic require-
likely to continue to develop. Computa- ments are beyond the capabilities of virtually
tional models of analogy, such as LISA all computational models of analogy (but
(Hummel & Holyoak, 1 997, 2003 ), have see Holyoak & Thagard, 1 989b, for an
analogy 1 37
early although limited effort to integrate the National Science Foundation. Kevin
analogy within a rule-based problem-solving Dunbar and Robert Morrison provided valu-
system). The most successful models of able comments on an earlier draft.
human problem solving have been formu-
lated as production systems (see Lovett &
Anderson, Chap. 1 7), and Salvucci and An-
derson (2001 ) developed a model of anal- References
ogy based on the ACT-R production system.
However, this model is unable to solve re- Barnett, S. M., & Koslowski, B. (2002). Solving
liably any analogy that requires integration novel, ill-defined problems: Effects of type of
of multiple relations – a class that includes experience and the level of theoretical under-
analogies within the grasp of young children standing it generates. Thinking & Reasoning, 8,
(Halford, 1 993 ; Richland et al., 2004; see 23 7–267.
Halford, Chap. 22). The integration of anal- Barsalou, L. W. (1 999). Perceptual symbol sys-
ogy models with models of general problem tems. Behavioral and Brain Sciences, 2 2 , 5 77–
solving remains an important research goal. 660.
Perhaps the most serious limitation of Bassok, M., & Holyoak, K. J. (1 989). Interdomain
current computational models of analogy is transfer between isomorphic topics in algebra
that their knowledge representations must and physics. Journal of Experimental Psychology:
be hand-coded by the modeler, whereas hu- Learning, Memory, and Cognition, 1 5 , 1 5 3 –
1 66.
man knowledge representations are formed
autonomously. Closely related to the chal- Bassok, M., & Olseth, K. L. (1 995 ). Object-based
representations: Transfer between cases of con-
lenge of avoiding hand-coding of represen-
tinuous and discrete models of change. Journal
tations is the need to flexibly rerepresent of Experimental Psychology: Learning, Memory,
knowledge to render potential analogies per- and Cognition, 2 1 , 1 5 22–1 5 3 8.
spicuous. Concepts often have a close con- Bassok, M., Wu, L. L., & Olseth, K. L. (1 995 ).
ceptual relationship with more complex re- Judging a book by its cover: Interpretative ef-
lational forms (e.g., Jackendoff, 1 983 ). For fects of content on problem-solving transfer.
example, causative verbs such as lift (e.g., Memory & Cognition, 2 3 , 3 5 4–3 67.
“John lifted the hammer”) have very simi- Bernardo, A. B. I. (2001 ). Principle explanation
lar meanings to structures based on an ex- and strategic schema abstraction in problem
plicit higher-order relation, cause (e.g., “John solving. Memory & Cognition, 2 9, 627–63 3 .
caused the hammer to rise”). In such cases, Beveridge, M., & Parkins, E. (1 987). Visual repre-
the causative verb serves as a “chunked” rep- sentation in analogical problem solving. Mem-
resentation of a more elaborate predicate- ory & Cognition, 1 5 , 23 0–23 7.
argument structure. People are able to “see” Black, M. (1 962). Models and metaphors. Ithaca,
analogies even when the analogs have very NY: Cornell University Press.
different linguistic forms (e.g., “John lifted Blanchette, I., & Dunbar, K. (2000). Analogy use
the hammer in order to strike the nail” might in naturalistic settings: The influence of audi-
be mapped onto “The Federal Reserve used ence, emotion, and goal. Memory & Cognition,
an increase in interest rates as a tool in its 2 9, 73 0–73 5 .
efforts to drive down inflation”). A deeper Blanchette, I., & Dunbar, K. (2001 ). How analo-
understanding of human knowledge repre- gies are generated: The roles of structural and
sentation is a prerequisite for a complete the- superficial similarity. Memory & Cognition, 2 8,
ory of analogical reasoning. 1 08–1 24.
Blanchette, I., & Dunbar, K. (2002). Represen-
tational change and analogy: How analogical
Acknowledgments inferences alter target representations. Journal
of Experimental Psychology: Learning, Memory,
and Cognition, 2 8, 672–685 .
Preparation of this chapter was supported by
Boroditsky, L. (2000). Metaphoric structuring:
grants R3 05 H03 01 41 from the Institute of Understanding time through spatial meta-
Education Sciences and SES-00803 75 from phors. Cognition, 75 , 1 –28.
Bransford, J. D., Barclay, J. R., & Franks, J. J. Duncker, K. (1 945 ). On problem solving. Psycho-
(1 972). Sentence memory: A constructive ver- logical Monographs, 5 8 (Whole No. 270).
sus interpretive approach. Cognitive Psychology, Evans, T. G. (1 968). A program for the solution of
3 , 1 93 –209. geometric-analogy intelligence test questions.
Brown, A. L., Kane, M. J., & Echols, C. H. In M. Minsky (Ed.), Semantic information pro-
(1 986). Young children’s mental models deter- cessing (pp. 271 –3 5 3 ). Cambridge, MA: MIT
mine analogical transfer across problems with a Press.
common goal structure. Cognitive Development, Falkenhainer, B., Forbus, K. D., & Gentner, D.
1 , 1 03 –1 21 . (1 989). The structure-mapping engine: Algo-
Brown, A. L., Kane, M. J., & Long, C. (1 989). rithm and examples. Artificial Intelligence, 41 ,
Analogical transfer in young children: Analo- 1 –63 .
gies as tools for communication and expo- Fauconnier, G. (2001 ). Conceptual blending and
sition. Applied Cognitive Psychology, 3 , 275 – analogy. In D. Gentner, K. J. Holyoak, &
293 . B. N. Kokinov (Eds.), The analogical mind: Per-
Bunge, S. A., Wendelken, C., Badre, D., & spectives from cognitive science (pp. 25 5 –285 ).
Wagner, A. D. (2004). Analogical reasoning Cambridge, MA: MIT Press.
and prefrontal cortex: Evidence for separable Fauconnier, G., & Turner, M. (1 998). Concep-
retrieval and integration mechanisms. Cerebral tual integration networks. Cognitive Science, 2 2 ,
Cortex, x. 1 3 3 –1 87.
Carbonell, J. G. (1 983 ). Learning by analogy: For- Forbus, K. D., Gentner, D., & Law, K. (1 995 ).
mulating and generalizing plans from past ex- MAC/FAC: A model of similarity-based re-
perience. In R. S. Michalski, J. G. Carbonell, & trieval. Cognitive Science, 1 9, 1 41 –205 .
T. M. Mitchell (Eds.), Machine learning: An ar- French, R. M. (2002). The computational model-
tificial intelligence approach (pp. 1 3 7–1 61 ). Palo ing of analogy-making. Trends in Cognitive Sci-
Alto, CA: Tioga. ence, 6, 200–205 .
Carpenter, P. A., Just, M. A., & Shell, P. (1 990). Gentner, D. (1 982). Are scientific analogies
What one intelligence test measures: A theo- metaphors? In D. S. Miall (Eds.), Metaphor:
retical account of the processing in the Raven Problems and perspectives (pp. 1 06–1 3 2).
Progressive Matrices test. Psychological Review, Brighton, UK: Harvester Press.
97, 404–43 1 .
Gentner, D. (1 983 ). Structure-mapping: A the-
Catrambone, R., & Holyoak, K. J. (1 989). Over- oretical framework for analogy. Cognitive Sci-
coming contextual limitations on problem- ence, 7, 1 5 5 –1 70.
solving transfer. Journal of Experimental Psy-
Gentner, D., Bowdle, B., Wolff, P., & Boronat,
chology: Learning, Memory, and Cognition, 1 5 ,
C. (2001 ). Metaphor is like analogy. In D.
1 1 47–1 1 5 6.
Gentner, K. J. Holyoak, & B. N. Kokinov (Eds.),
Cattell, R. B. (1 971 ). Abilities: Their structure, The analogical mind: Perspectives from cognitive
growth, and action. Boston, MA: Houghton- science (pp. 1 99–25 3 ). Cambridge, MA: MIT
Mifflin. Press.
Chen, Z. (1 996). Children’s analogical problem Gentner, D., Falkenhainer, B., & Skorstad, J.
solving: The effects of superficial, structural, (1 988). Viewing metaphor as analogy. In D. H.
and procedural similarity. Journal of Experimen- Helman (Eds.), Analogical reasoning: Perspec-
tal Child Psychology, 62 , 41 0–43 1 . tives of artificial intelligence, cognitive science,
Chen, Z., & Daehler, M. W. (1 989). Positive and and philosophy (pp. 1 71 –1 77). Dordrecht, The
negative transfer in analogical problem solving Netherlands: Kluwer.
by 6-year-old children. Cognitive Development, Gentner, D., & Gentner, D. R. (1 983 ). Flow-
4, 3 27–3 44. ing waters or teeming crowds: Mental models
Clement, C. A., & Gentner, D. (1 991 ). System- of electricity. In D. Gentner & A. L. Stevens
aticity as a selection constraint in analogical (Eds.), Mental models (pp. 99–1 29). Hillsdale,
mapping. Cognitive Science, 1 5 , 89–1 3 2. NJ: Erlbaum.
Duncan, J., Seitz, R. J., Kolodny, J., Bor, D., Her- Gentner, D., & Rattermann, M. (1 991 ). Language
zog, H., Ahmed, A., Newell, F. N., & Emslie, H. and the career of similarity. In S. A. Gelman
(2000). A neural basis for general intelligence. & J. P. Byrnes (Eds.), Perspectives on thought
Science, 2 89, 45 7–460. and language: Interrelations in development
analogy 1 39
(pp. 225 –277). Cambridge, UK: Cambridge J. A. Barnden (Eds.), Analogical connections.
University Press. Advances in connectionist and neural computa-
Gentner, D., Rattermann, M., & Forbus, K. tion theory (Vol. 2, pp. 3 1 –1 1 2). Norwood, NJ:
(1 993 ). The roles of similarity in transfer: Sepa- Ablex.
rating retrievability from inferential soundness. Holland, J. H., Holyoak, K. J., Nisbett, R. E., &
Cognitive Psychology, 2 5 , 5 24–5 75 . Thagard, P. (1 986). Induction: Processes of infer-
Gentner, D., & Toupin, C. (1 986). Systematicity ence, learning, and discovery. Cambridge, MA:
and surface similarity in the development of MIT Press.
analogy. Cognitive Science, 1 0, 277–3 00. Holyoak, K. J. (1 982). An analogical frame-
Gick, M. L., & Holyoak, K. J. (1 980). Analogical work for literary interpretation. Poetics, 1 1 ,
problem solving. Cognitive Psychology, 1 2 , 3 06– 1 05 –1 26.
355. Holyoak, K. J. (1 985 ). The pragmatics of analog-
Gick, M. L., & Holyoak, K. J. (1 983 ). Schema in- ical transfer. In G. H. Bower (Ed.), The psychol-
duction and analogical transfer. Cognitive Psy- ogy of learning and motivation (Vol. 1 9, pp. 5 9–
chology, 1 5 , 1 –3 8. 87). New York: Academic Press.
Glucksberg, S., Gildea, P., & Bookin, H. (1 982). Holyoak, K. J., Junn, E. N., & Billman, D. O.
On understanding nonliteral speech: Can peo- (1 984). Development of analogical problem-
ple ignore metaphors? Journal of Verbal Learn- solving skill. Child Development, 5 5 , 2042–
ing and Verbal Behaviour, 2 1 , 85 –98. 205 5 .
Holyoak, K. J., & Koh, K. (1 987). Surface and
Glucksberg, S., & Keysar, B. (1 990). Understand-
structural similarity in analogical transfer.
ing metaphorical comparisons: Beyond similar-
Memory & Cognition, 1 5 , 3 3 2–3 40.
ity. Psychological Review, 97, 3 –1 8.
Holyoak, K. J., Novick, L. R., & Melz, E. R.
Glucksberg, S., McClone, M. S., & Manfredi, D.
(1 994). Component processes in analogical
(1 997). Property attribution in metaphor com-
transfer: Mapping, pattern completion, and
prehension. Journal of Memory and Language,
adaptation. In K. J. Holyoak & J. A. Barnden
3 6, 5 0–67.
(Eds.), Advances in connectionist and neural
Goswami, U. (1 992). Analogical reasoning in chil- computation theory, Vol. 2 : Analogical connec-
dren. Hillsdale, NJ: Erlbaum. tions (pp. 1 1 3 –1 80). Norwood, NJ: Ablex.
Goswami, U. (2001 ). Analogical reasoning in Holyoak, K. J., & Thagard, P. (1 989a). Analogi-
children. In D. Gentner, K. J. Holyoak, & cal mapping by constraint satisfaction. Cogni-
B. N. Kokinov (Eds.), The analogical mind: Per- tive Science, 1 3 , 295 –3 5 5 .
spectives from cognitive science (pp. 43 7–470). Holyoak, K. J., & Thagard, P. (1 989b). A compu-
Cambridge, MA: MIT Press. tational model of analogical problem solving.
Halford, G. S. (1 993 ). Children’s understanding: In S. Vosniadou & A. Ortony (Eds.), Similarity
The development of mental models. Hillsdale, NJ: and analogical reasoning (pp. 242–266). New
Erlbaum. York: Cambridge University Press.
Halford, G. S., & Wilson, W. H. (1 980). A cat- Holyoak, K. J., & Thagard, P. (1 995 ). Mental leaps:
egory theory approach to cognitive develop- Analogy in creative thought. Cambridge, MA:
ment. Cognitive Psychology, 1 2 , 3 5 6–41 1 . MIT Press.
Halford, G. S., Wilson, W. H., Guo, J., Gayler, Hummel, J. E., & Holyoak, K. J. (1 992). Indi-
R. W., Wiles, J., & Stewart, J. E. M. (1 994). rect analogical mapping. In Proceedings of the
Connectionist implications for processing ca- Forteenth Annual Conference of the Cognitive
pacity limitations in analogies. In K. J. Holyoak Science Society (pp. 5 1 6–5 21 ). Hillsdale, NJ:
& J. A. Barnden (Eds.), Advances in connection- Erlbaum.
ist and neural computation theory, Vol. 2 : Analog- Hummel, J. E., & Holyoak, K. J. (1 997). Dis-
ical connections (pp. 3 63 –41 5 ). Norwood, NJ: tributed representations of structure: A theory
Ablex. of analogical access and mapping. Psychological
Hesse, M. (1 966). Models and analogies in science. Review, 1 04, 427–466.
Notre Dame, IN: Notre Dame University Press. Hummel, J. E., & Holyoak, K. J. (2001 ). A process
Hofstadter, D. R., & Mitchell, M. (1 994). The model of human transitive inference. In M. L.
Copycat project: A model of mental fluid- Gattis (Ed.), Spatial schemas in abstract thought
ity and analogy-making. In K. J. Holyoak & (pp. 279–3 05 ). Cambridge, MA: MIT Press.
Hummel, J. E., & Holyoak, K. J. (2003 ). A A parametric study of relational complexity.

symbolic-connectionist theory of relational Cerebral Cortex, 1 2 , 477–485 .
inference and generalization. Psychological Re- Lakoff, G., & Johnson, M. (1 980). Metaphors
view, 1 1 0, 220–263 . we live by. Chicago: University of Chicago
Hunt, E. B. (1 974). Quote the raven? Nevermore! Press.
In L. W. Gregg (Ed.), Knowledge and cognition Lakoff, G., & Turner, M. (1 989). More than cool
(pp. 1 29–1 5 7). Hillsdale, NJ: Erlbaum. reason: A field guide to poetic metaphor. Chicago:
Jackendoff, R. (1 983 ). Semantics and cognition. University of Chicago Press.
Cambridge, MA: MIT Press. Lassaline, M. E. (1 996). Structural alignment in
Keane, M. T. (1 987). On retrieving analogues induction and similarity. Journal of Experimen-
when solving problems. Quarterly Journal of tal Psychology: Learning, Memory, and Cogni-
Experimental Psychology, 3 9A, 29–41 . tion, 2 2 , 75 4–770.
Keane, M. T., & Brayshaw, M. (1 988). The incre- Loewenstein, J., Thompson, L., & Gentner, D.
mental analogical machine: A computational (1 999). Analogical encoding facilitates knowl-
model of analogy. In D. Sleeman (Ed.), Euro- edge transfer in negotiation. Psychonomic Bul-
pean working session on learning (pp. 5 3 –62). letin and Review, 6, 5 86–5 97.
London: Pitman. Luo, Q., Perry, C., Peng, D., Jin, Z., Xu, D., Ding,
Keysar, B. (1 989). On the functional equivalence G., & Xu, S. (2003 ). The neural substrate of
of literal and metaphorical interpretations in analogical reasoning: An fMRI study. Cognitive
discourse. Journal of Memory and Language, 2 8, Brain Research, 1 7, 5 27–5 3 4.
3 75 –3 85 . Markman, A. B. (1 997). Constraints on analogical
Khong, Y. F. (1 992). Analogies at war: Korea, Mu- inference. Cognitive Science, 2 1, 3 73 –41 8.
nich, Dien Bien Phu, and the Vietnam decisions Markman, A. B., & Gentner, D. (1 993 ). Struc-
of 1 965 . Princeton, NJ: Princeton University tural alignment during similarity comparisons.
Press. Cognitive Psychology, 2 3 , 43 1 –467.
Kokinov, B. N., & Petrov, A. A. (2001 ). Inte- Minsky, M. (1 975 ). A framework for represent-
gration of memory and reasoning in analogy- ing knowledge. In P. Winston (Ed.), The psy-
making: The AMBR model. In D. Gentner, K. J. chology of computer vision (pp. 21 1 –281 ). New
Holyoak, & B. N. Kokinov (Eds.), The analogical York: McGraw-Hill.
mind: Perspectives from cognitive science (pp. 5 9– Morrison, R. G., Krawczyk, D. C., Holyoak, K. J.,
1 24). Cambridge, MA: MIT Press. Hummel, J. E., Chow, T. W., Miller, B. L., &
Kolodner, J. L. (1 983 ). Reconstructive memory: Knowlton, B. J. (2004). A neurocomputational
A computer model. Cognitive Science, 7, 281 – model of analogical reasoning and its break-
3 28. down in frontotemporal lobar degeneration.
Kolodner, J. L. (1 993 ). Case-based reasoning. San Journal of Cognitive Neuroscience, 1 6, 260–271 .
Mateo, CA: Morgan Kaufmann. Mulholland, T. M., Pellegrino, J. W., & Glaser,
Kotovsky, K., & Simon, H. A. (1 990). What R. (1 980). Components of geometric analogy
makes some problems really hard? Explo- solution. Cognitive Psychology, 1 2 , 25 2–284.
rations in the problem space of difficulty. Cog- Novick, L. R., & Holyoak, K. J. (1 991 ). Math-
nitive Psychology, 2 2 , 1 43 –1 83 . ematical problem solving by analogy. Journal
Kotovsky, L., & Gentner, D. (1 996). Comparison of Experimental Psychology: Learning, Memory,
and categorization in the development of rela- and Cognition, 1 7, 3 98–41 5 .
tional similarity. Child Development, 67, 2797– Pask, C. (2003 ). Mathematics and the science of
2822. analogies. American Journal of Physics, 71 , 5 26–
Krawczyk, D. C., Holyoak, K. J., & Hummel, 5 3 4.
J. E. (2004). Structural constraints and object Pedone, R., Hummel, J. E., & Holyoak, K. J.
similarity in analogical mapping and inference. (2001 ). The use of diagrams in analogical prob-
Thinking & Reasoning, 1 0, 85 –1 04. lem solving. Memory & Cognition, 2 9, 21 4–
Kroger, J. K., Saab, F. W., Fales, C. L., 221 .
Bookheimer, S. Y., Cohen, M. S., & Holyoak, Prabhakaran, V., Smith, J. A. L., Desmond, J. E.,
K. J. (2002). Recruitment of anterior dorso- Glover, G. H., & Gabrieli, J. D. E. (1 997). Neu-
lateral prefrontal cortex in human reasoning: ral substrates of fluid reasoning: An fMRI study
analogy 1 41
of neocortical activation during performance of for new information. Journal of Verbal Learning
the Raven’s Progressive Matrices Test. Cogni- and Verbal Behavior, 1 8, 5 65 –5 83 .
tive Psychology, 3 3 , 43 –63 . Seifert, C. M., McKoon, G., Abelson, R. P., &
Raven, J. C. (1 93 8). Progressive matrices: A percep- Ratcliff, R. (1 986). Memory connections be-
tual test of intelligence, individual form. London: tween thematically similar episodes. Journal
Lewis. of Experimental Psychology: Learning, Memory,
Reed, S. K. (1 987). A structure-mapping model and Cognition, 1 2 , 220–23 1 .
for word problems. Journal of Experimental Snow, R. E., Kyllonen, C. P., & Marshalek, B.
Psychology: Learning, Memory, and Cognition, (1 984). The topography of ability and learning
1 3 , 1 24–1 3 9. correlations. In R. J. Sternberg (Ed.), Advances
Reed, S. K., Dempster, A., & Ettinger, M. (1 985 ). in the psychology of human intelligence (pp. 47–
Usefulness of analogical solutions for solving 1 03 ). Hillsdale, NJ: Erlbaum.
algebra word problems. Journal of Experimental Spearman, C. (1 923 ). The nature of intelligence
Psychology: Learning, Memory, and Cognition, and the principles of cognition. London, UK:
1 1, 1 06–1 25 . Macmillan.
Reitman, W. (1 965 ). Cognition and thought. New Spearman, C. (1 927). The abilities of man. New
York: Wiley. York: Macmillan.
Richland, L. E., Morrison, R. G., & Holyoak, Spearman, C. (1 946). Theory of a general factor.
K. J. (2004). Working memory and inhibition as British Journal of Psychology, 3 6, 1 1 7–1 3 1 .
constraints on children’s development of ana- Spellman, B. A., & Holyoak, K. J. (1 992). If
logical reasoning. In K. Forbus, D. Gentner & Saddam is Hitler then who is George Bush?:
T. Regier (Eds.), Proceedings of the Twenty-sixth Analogical mapping between systems of social
Annual Conference of the Cognitive Science Soci- roles. Journal of Personality and Social Psychol-
ety. Mahwah, NJ: Erlbaum. ogy, 62 , 91 3 –93 3 .
Ross, B. (1 987). This is like that: The use of earlier Spellman, B. A., & Holyoak, K. J. (1 996). Prag-
problems and the separation of similarity ef- matics in analogical mapping. Cognitive Psy-
fects. Journal of Experimental Psychology: Learn- chology, 3 1 , 3 07–3 46.
ing, Memory, and Cognition, 1 3 , 629–63 9. Spencer, R. M., & Weisberg, R. W. (1 986).
Ross, B. (1 989). Distinguishing types of superfi- Context-dependent effects on analogical trans-
cial similarities: Different effects on the access fer. Memory & Cognition, 1 4, 442–449.
and use of earlier problems. Journal of Experi- Sternberg, R. J. (1 977). Component processes in
mental Psychology: Learning, Memory, and Cog- analogical reasoning. Psychological Review, 84,
nition, 1 5 , 45 6–468. 3 5 3 –3 78.
Ross, B. H., & Kennedy, P. T. (1 990). Generaliz- Thagard, P. (2000). Coherence in thought and ac-
ing from the use of earlier examples in prob- tion. Cambridge, MA: MIT Press.
lem solving. Journal of Experimental Psychology: Thagard, P., Holyoak, K. J., Nelson, G., &
Learning, Memory, and Cognition, 1 6, 42–5 5 . Gochfeld, D. (1 990). Analog retrieval by con-
Rumelhart, D. E., Smolensky, P., McClelland, straint satisfaction. Artificial Intelligence, 46,
J. L., & Hinton, G. E. (1 986). Schemata and 25 9–3 1 0.
sequential thought processes in PDP models. Thagard, P., & Shelley, C. (2001 ). Emotional
In J. L. McClelland, D. E. Rumelhart, & the analogies and analogical inference. In D.
PDP Research Group (Eds.), Parallel distributed Gentner, K. J. Holyoak, & B. N. Kokinov (Eds.),
processing: Explorations in the microstructure of The analogical mind: Perspectives from cognitive
cognition (Vol. 2). Cambridge, MA: MIT Press. science (pp. 3 3 5 –3 62). Cambridge, MA: MIT
Salvucci, D. D., & Anderson, J. R. (2001 ). Inte- Press.
grating analogical mapping and general prob- Tohill, J. M., & Holyoak, K. J. (2000). The impact
lem solving: The path-mapping theory. Cogni- of anxiety on analogical reasoning. Thinking &
tive Science, 2 5 , 67–1 1 0. Reasoning, 6, 27–40.
Schank, R. C. (1 982). Dynamic memory. New Tunteler, E., & Resing, W. C. M. (2002). Sponta-
York: Cambridge University Press. neous analogical transfer in 4-year-olds: A mi-
Schustack, M. W., & Anderson, J. R. (1 979). Ef- crogenetic study. Journal of Experimental Child
fects of analogy to prior knowledge on memory Psychology, 83 , 1 49–1 66.
Waltz, J. A., Knowlton, B. J., Holyoak, K. J., Wharton, C. M., Holyoak, K. J., Downing, P. E.,
Boone, K. B., Mishkin, F. S., de Menezes Santos, Lange, T. E., Wickens, T. D., & Melz, E. R.
M., Thomas, C. R., & Miller, B. L. (1 999). A (1 994). Below the surface: Analogical similar-
system for relational reasoning in human pre- ity and retrieval competition in reminding. Cog-
frontal cortex. Psychological Science, 1 0, 1 1 9– nitive Psychology, 2 6, 64–1 01 .
1 25 . Wharton, C. M., Holyoak, K. J., & Lange, T. E.
Waltz, J. A., Lau, A., Grewal, S. K., & Holyoak, (1 996). Remote analogical reminding. Memory
K. J. (2000). The role of working memory in & Cognition, 2 4, 629–643 .
analogical mapping. Memory & Cognition, 2 8, Winston, P. H. (1 980). Learning and reasoning
1 205 –1 21 2. by analogy. Communications of the ACM, 2 3 ,
Wharton, C. M., Grafman, J., Flitman, S. S., 689–703 .
Hansen, E. K., Brauner, J., Marks, A., & Honda, Wolff, P., & Gentner, D. (2000). Evidence
M. (2000). Toward neuroanatomical models for role-neutral initial processing of meta-
of analogy: A positron emission tomography phors. Journal of Experimental Psychology:
study of analogical mapping. Cognitive Psychol- Leaning, Memory, and Cognition, 2 6, 5 29–
ogy, 40, 1 73 –1 97. 5 41 .
CHAPTER 7
Causal Learning
Marc J. Buehner
Patricia W. Cheng
Introduction The Nature of the Problem and a

Historical Review: Is Causality an
This chapter is an introduction to the psy- Inscrutable Fetish or the Cement
chology of causal inference using a compu- of the Universe?
tational perspective with the focus on causal
discovery. It explains the nature of the prob- Imagine a world in which we could not rea-
lem of causal discovery and illustrates the son about causes and effects. What would it
goal of the process with everyday and hypo- be like? Typically, reviews about causal rea-
thetical examples. It reviews two approaches soning begin by declaring that causal rea-
to causal discovery, a purely statistical ap- soning enables us to predict and control
proach and an alternative approach that in- our environment and by stating that causal
corporates causal hypotheses in the infer- reasoning allows us to structure an other-
ence process. The latter approach provides wise chaotic flux of events into meaningful
a coherent framework within which to an- episodes. In other words, without causal in-
swer different questions regarding causal inference, we would be unable to learn from
ference. The chapter ends with a discussion the past and incapable of manipulating our
of two additional issues – the level of abstrac- surroundings to achieve our goals. Let us
tion of the candidate cause and the tempo- see how a noncausal world would be grim
ral interval between the occurrence of the and the exact role causal inference plays for
cause and the occurrence of the effect – and adaptive intelligence. We illustrate the non-
a sketch of future directions for the field. causal world by intuitive examples as well
1 43
as by what is predicted by associative and event remained unperturbed. However, one

other purely statistical models – models that might be unable to predict the consequences
do not go through an intermediate step of of actions (e.g., exposure to the sun, deceiv-
positing hypotheses about causal relations in ing the rooster into crowing). Causation, and
the world rather than just in the head. only causation, licenses the prediction of the
We want to see the goals of causal reason- consequences of actions. Both kinds of pre-
ing; we also want to see what the givens are, dictions are obviously helpful (e.g., we ap-
so we can step back and see what the prob- preciate weather reports), but the latter is
lem of causal learning is. One way of casting what allows (1 ) goal-directed behaviors to
this problem is to ask, “What minimal set achieve their goals and (2) maladaptive rec-
of processes would one endow an artificial ommendations that accord with mere corre-
system, so that when put on Planet Earth lations to be dismissed. The examples also
and given the types of information humans illustrate that only causation supports ex-
receive, it will evolve to represent the world planation (Woodward, 2003 ). Whereas one
as they do?” For example, what process must would explain that one’s skin is tanned be-
the system have so it would know that ex- cause of exposure to the sun, one would not
posure to the sun causes tanning in skin but explain that the sun rises because the rooster
bleaching in fabrics? These causal facts are crows, despite the reliable predictions that
unlikely to be innate in humans. The learning one can make in each case. Understanding
process would begin with noncausal obser- what humans do when they reason about
vations. For both cases, the input would be causation is a challenge, and the ability to
observations on various entities (people and build a system that accomplishes what hu-
articles of clothing, respectively) with vary- mans accomplish is a test of one’s under-
ing exposures to sunlight and, in one case, standing of that psychological process.
the darkness of skin color and, in the other, We see that even when there is temporal
the darkness of fabric colors. Consider an- information so one can reliably predict an
other example: Suppose the system is pre- event from an earlier observation (e.g., sun-
sented with observations that a rooster in rise from a rooster’s crowing, a storm from a
a barn crowed soon before sunrise and did drop in the barometric reading), correlation
not crow at other times during the day when need not imply causation. One might think
the sun did not rise. What process must the that intervention (i.e., action, manipulation)
system have so it would predict that the is what differentiates between covariation
sun would soon rise when informed that the and causation: When the observations are
rooster had just spontaneously crowed but obtained by intervention, by oneself or oth-
would not predict the same when informed ers, the covariations are causal; otherwise,
that the rooster had just been deceived into they are not necessarily causal. A growing
crowing by artificial lighting? Neither would body of research is dedicated to the role
the system recommend starting a round-the- of intervention in causal learning, discov-
clock solar energy enterprise even if there ery, and reasoning (e.g., Gopnik et al., 2004;
were reliable ways of making roosters crow. Lagnado & Sloman, 2004; Steyvers, Tenen-
Nor would it, when a sick rooster is ob- baum, Wagenmakers, & Blum, 2003 ). In-
served not to crow, worry about cajoling it deed, the general pattern reported is that ob-
into crowing to ensure that the sun will rise servations based on intervention allow causal
in the morning. In a noncausal world, such inferences that are not possible with mere
recommendations and worries would be nat- observations. However, although interven-
ural (also see Sloman & Lagnado, Chap. 5 ). tion generally allows causal inference, it does
Our examples illustrate that by keep- not guarantee it. Consider a food allergy
ing track of events that covary (i.e., vary test that introduces samples of food into
together, are statistically associated), one the body by needle punctures on the skin.
would be able to predict a future event from The patient may react with hives on all
a covariation provided that causes of that punctured spots, and yet one may not know
causal learning 1 45
whether the patient is allergic to any of the scope of this chapter. Within our framework,
foods. Suppose the patient’s skin is allergic however, the problem has the same general
to needle punctures so hives also appear on core: How would an intelligent system trans-
punctured spots without food. In this exam- form noncausal input into a causal relation
ple, there is an intervention, but no causal as its output? That problem remains, despite
inference regarding food allergy seems war- the additional innate or learned spatiotem-
ranted (Cheng, 1 997). What then are the poral constraints (see, Cheng, 1 993 , for an
conditions that allow causal discovery? Note inductive analysis of the launching effect,
that in this example the intervention was and Scholl & Nakayama, 2002, for a demon-
suboptimal because two interventions oc- stration of inductive components of the vi-
curred concurrently (adding allergens into sual system’s analysis of launching events).
the bloodstream and puncturing the skin), Causal discovery is not the only process
resulting in confounding. with which one would endow the artificial
Historically, causality has been the reasoning system. Many psychologists have
domain of philosophers, from Aristotle addressed a related but distinct issue of up-
through to Hume and Kant, to name just a dating and applying prior causal knowledge.
few. The fundamental challenge since Hume Once causal knowledge is acquired, it would
(1 73 9/1 888) that has been occupying schol- be efficient to apply it to novel situations in-
ars in this area is that causality per se is not volving events of like kind. We are all famil-
directly in the input. This issue fits well in iar with such applications of causal knowl-
the framework of creating an artificial rea- edge transmitted culturally or acquired on
soning system – causal knowledge has to our own. A number of researchers have pro-
emerge from noncausal input. Nothing in posed Bayesian accounts of the integration
the evidence available to our sensory sys- of prior causal knowledge and current in-
tem can ensure someone of a causal rela- formation (Anderson, 1 990; Tenenbaum &
tion between, say, flicking a switch and the Griffiths, 2002). It may seem that there is
hallway lights turning on. Yet, we regularly a ready answer to the updating and appli-
and routinely have strong convictions about cation problem. What may not be straight-
causality. David Hume made a distinction forward, however, is the determination of
between analytic and empirical knowledge. “events of like kind,” the variables in a causal
Moreover, he pointed out that causal knowl- relation. The application of causal knowl-
edge is empirical, and that of this kind of edge therefore highlights an issue that has
knowledge, we can only be certain of the been mostly neglected in the research on
states of observable events or objects (e.g., causal discovery: What determines which
the presence of an event of interest and its categories are formed and the level of ab-
magnitude) and the temporal and spatial re- straction at which they are formed (see
lations between them. Any impression of Medin & Rips, Chap. 3 ; Rosch, 1 978)? Sim-
causality linking two constituent events, he ilarly, what determines which events are re-
argued, is a mental construct. garded as analogous (see Holyoak, Chap. 6)?
Psychologists entered the arena to study The “cause” categories in causal learning ex-
the exact nature and determinants of such periments were typically predefined by the
mental constructs. Michotte (1 946/1 963 ) experimenter in terms of a variable with
investigated the perceptual processing of a single causal value and do not have the
causal events (mostly impact of one mov- structure of natural categories (see Lien &
ing object on another object, the “launching Cheng, 2000, for an exception). If the rela-
effect”). Many researchers since then have tions inferred have no generality, they can-
argued that such perception of causality is not be applied to novel but similar events,
modular or encapsulated (for an overview, thus failing to fulfill a primary function of
see Scholl & Tremoulet, 2000) and not sub- causal inference.
ject to conscious inference. To some, the It is perhaps the segregation of re-
encapsulation puts the process outside the search on category formation and on causal
learning that has engendered the mechanism and a general causal framework in scien-
view, which pits top-down and bottom-up tific explanation. Illustrating with cases in
causal reasoning against each other. It has medical history (e.g., the bacterial theory of
been argued that inferring a causal connec- ulcers), he showed that inferring a causal
tion is contingent on insight into the mecha- connection is not contingent on insight into
nism (i.e., a network of intervening causal an intervening mechanism, but is bolstered
relations) by which the candidate cause by it. The inferred causal networks subse-
brings about its effect (e.g., Ahn, Kalish, quently explain novel instances when the
Medin, & Gelman, 1 995 ). A commonly networks are instantiated by information
used research paradigm involved providing on the instances. Maximizing explanatory
participants with current information con- coherence might be a process closely inter-
cerning the covariation between potential twined with causal discovery, but nonethe-
causes and effects at some designated level less separate from it, that one would incor-
of abstraction but manipulating whether a porate in an artificial reasoning system.
(plausible) causal mechanism was presented In the rest of this chapter, we review the
(Ahn, Kalish, Medin, & Gelman, 1 995 ; Bul- main computational accounts of causal dis-
lock, Gelman, & Baillargeon, 1 982; Shultz, covery. We first review statistical models,
1 982; White, 1 995 ), with the causal mecha- then problems with the statistical approach,
nism implying more reliable covariation in- problems that motivate a causal account that
formation at a different, more abstract, level. incorporates assumptions involving alterna-
The common finding from these studies was tive causes. We follow these accounts with
that participants deemed knowledge about a review of new empirical tests of the two
causal power or force as more significant approaches. We then broaden our scope to
than what was designated as covariational consider the possible levels of abstraction of
information. Studies in this “causal power” a candidate cause and the analogous problem
tradition are valuable in that they demon- of the possible temporal lag of a causal rela-
strate the role of abduction and coherence: tion. These issues have implications for cat-
People indeed strive to link causes and egory formation. We end the chapter with a
effects mentally by postulating the (per- sketch of future research directions from a
haps hypothetical) presence of some known computational perspective.
causal mechanism that connects them in an
attempt to create the most coherent ex-
planation encompassing multiple relevant Information Processing Accounts
pieces of knowledge (see Holland, Holyoak,
Nisbett, & Thagard, 1 986, on abduction; see A Statistical Approach
Thagard, 1 989, for accounts of coherence).
This work shows that coherence plays a key overview
role in the application of causal knowledge Some computational accounts of causal dis-
(also see Lien & Cheng, 2000; coherence also covery are only concerned with statistical
plays a role in causal discovery, see Cheng, information (e.g., Allan & Jenkins, 1 980;
1 993 ). However, the argument that inferring Chapman & Robbins, 1 990; Jenkins & Ward,
a causal relation is contingent on belief in 1 965 ; Rescorla & Wagner, 1 972), ignoring
an underlying causal network is circular – it hypotheses regarding unobservable causal
simply pushes the causal discovery question relations (see Gallistel’s, 1 990, critique of
one step back. How was knowledge about these models as being unrepresentational).
the links in the causal network discovered in Such accounts not only adopt Hume’s
the first place? (1 73 9/1 888) problem but also his solution.
Rather than pitting covariation and To these theorists, causality is nothing more
prior causal knowledge against each other, than a mental habit, a fictional epiphe-
Thagard (2000) offered a complementary nomenon floating unnecessarily on the sur-
view of covariation, prior causal knowledge, face of indisputable facts.1 After all, causal
relations are unobservable. In fact, Karl Pear-

son, one of the fathers of modern statis-
tics, subscribed to a positivist view and con-
cluded that calculating correlations is the
ultimate and only meaningful transforma-
A B
tion of evidence at our disposal: “Beyond
such discarded fundamentals as ‘matter’ and
‘force’ lies still another fetish amidst the in- C D
scrutable arcana of modern science, namely,
the category of cause and effect” (Pearson,
1 892/1 95 7). Correlation at least enables one Figure 7.1 . A standard 2 × 2 contingency table.
to make predictions based on observations A through D are labels for the frequencies of
event types resulting from a factorial
even when the predictions are not accom-
combination of the presence and absence of
panied by causal understanding.
cause c and effect e.
Psychological work in this area was pi-
oneered by social psychologists, most no-
(see Rescorla, 1 968, for a parallel demon-
tably Kelley (1 973 ), who studied causal at-
stration of the role of contingency in rats).
tributions in interpersonal exchanges. His
As for temporal contiguity, Shanks, Pearson,
ANOVA model specifies a set of inference
and Dickinson (1 989) showed that separat-
rules that indicate, for instance, whether
ing cause and effect in time tends to decrease
a given outcome arose owing to particular
impressions of causality (see also Buehner &
aspects of the situation, the involved per-
May, 2002, 2003 , 2004). This pattern of re-
son(s), or both.
sults, Shanks and Dickinson argued, parallels
Around the same time in a different do-
well-established findings from conditioning
main (Pavlovian and instrumental condition-
studies involving nonhuman animals.
ing), prediction based on observations was
Contingency and temporal contiguity are
also the primary concern. Predictive learn-
conditions that enable causal learning. A
ing in conditioning, often involving non-
robust feature of the resultant acquisition
human animals, and causal reasoning in hu-
of causal knowledge is that it is gradual
mans showed so many parallels (Rescorla,
and can be described by a negatively ac-
1 988) that associative learning theorists
celerated learning curve with judgments
were prompted to apply models of condi-
reaching an equilibrium level under some
tioning to explain causal reasoning. Explain-
conditions after sufficient training (Shanks,
ing causal learning with associative theories
1 985 a, 1 987).
implies a mapping of causes to cues (or CSs)
and effects to outcomes (or USs). In a de- A Statistical Model for Situations with One
tailed review, Shanks and Dickinson (1 987; Varying Candidate Cause For situations in-
see Dickinson, 2001 , for a more recent re- volving only one varying candidate cause,
view) noted that the two cornerstones of an influential decision rule for almost four
associative learning, cue-outcome contingency decades has been the P rule:
and temporal contiguity, also drive human
P = p(e | c) − P (e | c̄) (Eq. 7.1 )
causal learning (also see Miller & Matute,
1 996). To a first approximation, association according to which the strength of the rela-
matters: The more likely that a cause will tion between a binary cause c and effect e
be followed by an effect, the stronger paris determined by their contingency or proba-
ticipants believe that they are causally re- bilistic contrast – the difference between the
lated. However, if this probability stays con- probabilities of e in the presence and ab-
stant, but the probability with which the ef- sence of c (see, e.g., Allan & Jenkins, 1 980;
fect occurs in the absence of the cause in- Jenkins & Ward, 1 965 ). P is estimated by
creases, causal judgments tend to decrease; relative frequencies. Figure 7.1 displays a
in other words, it is contingency that matters contingency table where A and B represent
the frequencies of occurrence of e in the tual outcome of each trial (usually 1 .0 if it

presence and absence of c, respectively, and is present and 0 if it is absent); and V is
C and D represent the frequencies of nonoc- the expected outcome defined as the sum
currence of e in the presence and absence of of all associative strengths of all CSs present
c, respectively. P (e|c) is estimated by A A+ B , on that trial. Each time a cue is followed by
C
and P (e|c̄) is estimated by C + D
. an outcome, the association between them is
If P is positive, then c is believed to pro- strengthened (up to the maximum strength
duce e; if it is negative, then c is believed to US can support, λ); each time the cue is pre-
prevent e; and if P is zero, then c and e are sented without the outcome, the association
not believed to be causally related to each weakens (again within certain boundaries,
other. Several modifications of the P rule −V, to account for preventive cues).
have been discussed (e.g., Anderson & Sheu, For situations involving only one varying
1 995 ; Mandel & Lehman, 1 998; Perales & cue, its mean weight at equilibrium accord-
Shanks, 2003 ; Schustack & Sternberg, 1 981 ; ing to the RW algorithm has been shown
White, 2002). All these modifications pa- to equal P if the value of β remains the
rameterize the original rule in one way or same when the US is present and when it
another and thus, by allowing extra degrees is absent (for the λ values just mentioned;
of freedom, manage to fit certain aspects of Chapman & Robbins, 1 990). In other words,
human judgment data better than the orig- this simple and intuitive algorithm elegantly
inal rule. What is common across all these explains why causal learning is a function
models, however, is that they take covaria- of contingency. It also explains a range of
tional information contained in the contin- results for designs involving multiple cues
gency table as input and transform it into a such as blocking (see “Blocking: Illustrating
measure of causal strength as output without an Associationist Explanation” section), con-
any consideration of the influence of alterna- ditioned inhibition, overshadowing, and cue
tive causes. Whenever there is confounding validity (Miller, Barnet, & Grahame, 1 995 ).
by an alternative cause (observed or unob- For some of these designs, the mean weight
served), the P rule fails. of a cue at equilibrium has been shown to
equal P conditional on the constant pres-
A Statistical Model for Situations Involving
ence of other cues that occur in combina-
Multiple Varying Candidate Causes Predic-
tion with that cue (see Cheng, 1 997; Danks,
tive learning, of course, is the subject of
2003 ). Danks derived the mean equilibrium
associative learning theory. An appeal of
weights for a larger class of designs.
this approach is that it is sometimes capa-
ble of explaining inference involving multi-
blocking: illustrating an associationist
ple causes. The most influential such theory explanation
(Rescorla & Wagner, 1 972, and all its vari-
Beyond the cornerstones, the parallels be-
ants since) is based on an algorithm of error
tween conditioning and human causal learn-
correction driven by a discrepancy between
ing are manifested across numerous experi-
the expected and actual outcomes. For each
mental designs often called paradigms in the
learning trial where the cue was presented,
literature. One parallel involves the block-
the model specifies
ing paradigm. Using a Pavlovian condition-
VCS = αCS βUS (λ − V ) (Eq. 7.2) ing paradigm, Kamin (1 969) established cue
B as a perfect predictor for an outcome (B+,
where V is the change in the strength of with “+” representing the occurrence of the
a given CS–US association on a given trial outcome). In a subsequent phase, animals
(CS = conditioned stimulus, e.g., a tone; were presented with a compound consist-
US = unconditioned stimulus, e.g., a foot- ing of B and a new, redundant cue A. The
shock); α and β represent learning rate pa- AB compound was also always followed by
rameters reflecting the saliencies of the CS the outcome (AB+), yet A received little
and US, respectively; λ stands for the ac- conditioning; its conditioning was blocked by
B. According to RW, B initially acquires the in which knowledge about AB and B is ac-
maximum associative strength supported by quired should play a role under a causal
the stimulus. Because the association be- learning perspective. This is not so under
tween B and the outcome is already at an associative learning perspective, however.
asymptote when A is introduced, there is no The standard assumption here is that the
error left for A to explain. In other words, strength of a cue can only be updated when
the outcome is already perfectly predicted that cue is present. In the backward block-
by B, and nothing is left to be predicted by ing paradigm, however, participants retro-
A, which accounts for the lack of condition- spectively alter their estimate of A on the B+
ing to cue A. Shanks (1 985 b) replicated the trials in phase 2. In other words, the P of A,
same finding in a causal reasoning experi- conditional on the presence of B, decreases
ment with human participants, although the over a course of trials in which A is actually
human responses seem to reflect uncertainty absent, and the algorithm fails to track the
of the causal status of A rather than cer- covariation for A.
tainty that it is noncausal (e.g., Waldmann & Several modifications of RW have been
Holyoak, 1 992). proposed to allow the strengths of absent
cues to be changed, for instance, by setting
failure of the rw algorithm to track the learning parameter α negative on trials
covariation when a cue is absent where the cue is absent (see Dickinson &
The list of similarities between animal condi- Burke, 1 996; Van Hamme & Wasserman,
tioning and human causal reasoning seemed 1 994). Such modifications can explain back-
to grow, prompting the interpretation that ward blocking and some other findings
causal learning is nothing more than asso- showing retrospective revaluation (see, e.g.,
ciative learning. However, Shanks’ (1 985 b) Larkin, Aitken, & Dickinson, 1 998; for an
results also revealed evidence for back- extensive review of modifications to asso-
ward blocking; in fact, there is evidence for ciative learning models applicable to human
backward blocking even in young children learning, see De Houwer & Beckers, 2002).
(Gopnik et al., 2004). In this procedure, the However, they also oddly predict that one
order of learning phases is simply reversed; will have difficulty learning that there are
participants first learn about the perfect re- multiple sufficient causes of an effect. For
lation between AB and the outcome (AB+) example, if one sometimes drinks both tea
and subsequently learn that B by itself is also and lemonade, then learning that tea alone
a perfect predictor (B+). Conceptually, for- can quench thirst will cause one to unlearn
ward and backward blocking are identical – that lemonade can quench thirst. They also
at least from a causal perspective. A causal fail when two steps of retrospective revalua-
explanation might go: If one knows that A tion are required. Macho and Burkart (2002)
and B together always produce an effect, and demonstrated that humans are capable of it-
one also knows that B by itself also always erative retrospective revaluation, a backward
produces the effect, one can infer that B is process whereby the causal strength of a
a strong cause. A, however, could be a cause, target cause is disambiguated by evaluating
even a strong one, or noncausal; its causal sta- another cause, which in turn is evaluated
tus is unclear. Typically, participants express by drawing on information about a third
such uncertainty with low to medium rat- cause (see also Lovibond, Been, Mitchell,
ings relative to ratings from control cues that Bouton, & Frohardt, 2003 , for further evi-
have been paired with the effect an equal dence that blocking in human causal reason-
number of times (see Cheng, 1 997, for a ing is inferential, and De Houwer, 2002, for
review). a demonstration that even forward blocking
Beyond increasing susceptibility to atten- recruits retrospective inferences). In these
tion and memory biases (primacy and re- cases, P with other cues controlled coin-
cency, cf. for example, Dennis & Ahn, 2001 ), cides with causal intuitions, but associative
there is no reason why the temporal order models fail to track conditional P.
Causal Inference Goes Beyond ing belongs to a category that does not cause
Covariation Tracking sunrise (and does not belong to any category
predictions by the statistical view that does cause sunrise), and the confounded
and the causal mechanism view on covariation between crowing and sunrise is
some intuitive examples disregarded as spurious.
Even successfully tracked covariation, how- Our consideration shows that, contrary
ever, does not equal causation, as we il- to the causal mechanism view, prior knowl-
lustrated earlier and as every introductory edge of noncausality neither precludes nor
statistics text warns. None of these cases can refutes observation-based causal discovery.
be explained by the P rule in Eq. (7.1 ). For Thagard (2000) gave a striking historic illus-
example, even if the P for rooster crow- tration of this fact. Even though the stomach
ing is 1 , nobody would claim that the crow- had been regarded as too acidic an environ-
ing caused the sun to rise. Although the ment for viruses to survive, a virus was in-
candidate cause, crowing, covaries perfectly ferred to be a cause of stomach ulcer. Prior
with the effect, sunrise, there is an alterna- causal knowledge may render a novel can-
tive cause that covaries with the candidate: didate causal relation more or less plausible
Whenever the rooster crows, the Earth’s ro- but cannot rule it out definitively. Moreover,
tation is just about to bring the farm toward prior causal knowledge is often stochastic.
the sun. Our intuition would say that be- Consider a situation in which one observes
cause there is confounding, one cannot draw that insomia results whenever one drinks
any causal conclusion. This pattern of in- champagne. Now, there may be a straightfor-
formation fits the overshadowing design. If ward physiological causal mechanism link-
crowing is the more salient of the two con- ing cause and effect, but it is also plausible
founded cues, then RW would predict that that the relation is not causal; it could eas-
crowing causes sunrise. ily be that drinking and insomnia are both
Let us digress for a moment to consider caused by a third variable – for example, at-
what the causal mechanism view predicts. tending parties (cf. Gopnik et al., 2004).
Power theorists might argue that the ab- Returning to the pitfall of statistical and
sence of a plausible mechanism whereby a associative models, besides the confounding
bird could influence the motion of stellar problem, we find that there is the overde-
objects, rather than anything that has to do termination problem, where two or more
with covariation, is what prevents us from causes covary with an effect, and each cause
erroneously inducing a causal relation. In by itself would be sufficient to produce
this example, in addition to the confound- the effect. The best-known illustration of
ing by the Earth’s rotation, there happens overdetermination is provided by Mackie
to be prior causal knowledge, specifically, of (1 974): Imagine two criminals who both
the noncausality of a bird’s crowing with re- want to murder a third person who is about
spect to sunrise. Tracing the possible origin of to cross a desert; unaware of each other’s
that knowledge, however, we see that we do intentions, one criminal puts poison in the
have covariational information that allows victim’s water bottle, while the other punc-
us to arrive at the conclusion that the re- tures the bottle. Each action on its own co-
lation is noncausal. If we view crowing and varies perfectly with the effect, death, and
sunrise at a more general level of abstrac- would have been sufficient to bring the ef-
tion, namely, as sound and the movement fect about. However, in the presence of the
of large objects, we no longer have the con- alternative cause of death (a given fact in
founding we noted at the specific level of this example), so that there is no confound-
crowing and sunrise. We have observed that ing, varying each candidate cause in this case
sounds, when manipulated at will so alter- makes no difference; for instance, the P for
native causes do occur independently of the poison with respect to death, conditional on
candidate, thus allowing causal inference, do the presence of the puncturing of the wa-
not move large objects. Consequently, crow- ter canteen, is 0! So, Mackie’s puzzle goes,
which of the two criminals should be called strains the pool of possible candidate causes
the murderer? Presumably, a lawyer could of an effect. A straightforward demonstra-
defend each criminal by arguing that their tion that humans are sensitive to the direc-
respective deed made no difference to the tion of the causal arrow was provided by
victim’s ultimate fate – he would have died Waldmann and Holyoak (1 992).
anyway as a result of the other action (but A corollary of the directional nature of
see Katz, 1 989; also see Ellsworth, Chap. the causal arrow, Waldmann and Holyoak
28; Pearl, 2000; and Wright, 1 985 , on ac- (1 992) reasoned, is that only causes, but
tual causation). Mackie turned to the actual not effects, should “compete” for explana-
manner of death (by poison or by dehydra- tory power. Let us first revisit the blocking
tion) for a solution. But, suppose the death paradigm with a causal interpretation. If B
is discovered too late to yield useful autopsy is a perfect cause of an outcome O, and A
information. Would the desert traveler then is only presented in conjunction with B, one
have died without a cause? Surely our in- has no basis of knowing to what extent, if
tuition says no: The lack of covariation in at all, A actually produces O. Consequently,
this case does not imply the lack of causation the predictiveness of A should be depressed
(see Ellsworth, Chap. 28; Spellman & Kin- relative to B in a predictive situation. How-
cannon, 2001 , for studies on intuitive judg- ever, if B is a consistent effect of O, there is no
ments in situations involving multiple suffi- reason why A cannot also be an equally con-
cient causes). What matters is the prediction sistent effect of O. Alternative causes need
of the consequences of actions, such as poi- to be kept constant to allow causal inference,
soning, which may or may not be revealed but alternative effects do not. Consequently,
in the covariation observed in a particular the predictiveness of A should not be de-
context. pressed in a diagnostic situation.
This asymmetric prediction was tested us-
Empirical Findings on Humans and Rats ing scenarios to manipulate whether a vari-
able is interpreted as a candidate cause or
The observed distinction between covaria- an effect without changing the associations
tion and causation in the causal learning liter- between variables. For example, participants
ature corroborates intuitive judgment in the had to learn the relation between several
rooster and desert traveler examples. It is no light buttons and the state of an alarm sys-
wonder that Pearson’s condemnation of the tem. The instructions introduced the but-
concept of causality notwithstanding, con- tons as causes for the alarm in the predictive
temporary artificial intelligence has whole- condition but as potential consequences of
heartedly embraced causality (see, for exam- the state of the alarm system in the diagnos-
ple, Pearl, 2000). We now review how hu- tic condition.
man causal reasoning capacities exceed the As predicted: There was blocking in the
mere tracking of stimulus–outcome associa- predictive condition, but not in the diagnos-
tions. tic condition. These results reveal that hu-
mans are sensitive to, and make use of, the
the direction of causality direction of the causal arrow.
As mentioned earlier, correlations and asso- Associationists in fact have no reason
ciations are bidirectional (for implications of for objecting to using temporal information.
the bidirectional nature of associations on Unlike causal relations, temporal ordering is
conditioning, see, e.g., Miller & Barnet, 1 993 ; observable. To address the problem raised by
and Savastano & Miller, 1 998) and thus can- Waldmann and Holyoak (1 992), association-
not represent directed causal information. ist models can specify that, when applied
However, the concept of causality is funda- to explain causal learning, candidate causes
mentally directional (Reichenbach, 1 95 6) in can precede their effects, but not vice versa,
that causes produce effects, but effects can- and that the temporal ordering that counts is
not produce causes. This directionality con- that of the actual occurrence of events rather
than that of the recount of events to the rea- entist really conclude that? No, the scientist
soner. Previous associationist models, how- would instead recognize that she has con-
ever, have not made a distinction between ducted a poor experiment. For some reason,
occurrence and presentation order. There- her sample suffered from a preventive ver-
fore, by default, they treat the buttons, for sion of the ceiling effect – the effect never
which information was presented first, as occurred, regardless of the manipulation. If
cues and the alarm, for which information the effect never occurs in the first place, how
was presented second, as an outcome, and can a preventive intervention be expected to
hence, predict equal amounts of cue com- prove its effectiveness?
petition in both scenarios. Even rats seem to appreciate this argu-
Instead of amending associationist mod- ment. When an inhibitory cue, that is, one
els to treat the order of actual occurrence with negative associative strength, is repeat-
as critical, which would be natural un- edly presented without the outcome so that
der a computational approach, researchers the actual outcome is 0 whereas the ex-
criticized Waldmann and Holyoak’s (1 992) pected outcome is negative, associative mod-
findings on technical grounds (Matute, els would predict that the cue reduces its
Arcediano, & Miller, 1 996; Shanks & Lopez, strength toward 0. That is, in a noncausal
1 996). Follow-up work from Waldmann’s world, we would unlearn our preventive
lab (Waldmann, 2000, 2001 ; Waldmann & causes whenever they are not accompanied
Holyoak, 1 997), however, has demonstrated by a generative cause. For example, when we
that the asymmetry in cue competition is in- inoculate child after child with polio vac-
deed a robust finding (Waldmann, 2001 ). cine in a country and there is no occur-
rence of polio in that country, we would
come to believe that the polio vaccine does
ceiling effects and people’s sensitivity not function anymore (rather than merely
to proper experimental design that it is not needed). To the contrary, even
A revealing case of the distinction between for rats, the inhibitory cue retains its nega-
covariation and causation has to do with tive strength (Zimmerhart-Hart & Rescorla,
what is known in experimental design as 1 974). In other words, when an outcome in
a ceiling effect. This case does not involve question never occurred, both when a condi-
any confounding. We illustrate it with the tioned inhibitory cue was present and when
preventive version of the effect, which is it was not, the rats apparently treated the
never covered in courses on experimental zero P value as uninformative and retained
design – the underlying intuition is so pow- the inhibitory status of the cue. In this case,
erful it needs no instructional augmenta- in spite of a discrepancy between the ex-
tion. Imagine that a scientist conducts an pected and actual outcomes, there is no re-
experiment to find out whether a new drug vision of causal strength. We are not aware
cures migraine. She follows the usual proce- of any modification of associative algorithms
dure and administers the drug to an exper- that can accomodate this finding.
imental group of patients, while an equiv- Notice that in the hypothetical migraine
alent control group receives a placebo. At experiment, one can in fact conclude that
the end of the study, the scientist discov- the drug does not cause migraine. Thus, given
ers that none of the patients in the ex- the exact same covariation, one’s conclu-
perimental group, but also none of the pa- sion differs depending on the direction of
tients in the control group, suffered from influence under evaluation (generative vs.
migraine. If we enter this information into preventive). Wu and Cheng (1 999) con-
the P rule, we see that P(e|c) = 0 and ducted an experiment that showed that
P(e|c̄) = 0, yielding P = 0. According to beginning college students, just like experi-
the P rule and RW, this would indicate enced scientists, refrain from making causal
that there is no causal relation; that is, the inferences in the generative and preventive
drug does not cure migraine. Would the sci- ceiling effects situations. People’s preference
to refrain from causal judgment in such sit- a computational-level theory

uations is at odds with purely covariational of causal induction
or associative accounts. What must the pro- Cheng (1 997)’s power PC theory (short for a
cess of human causal induction involve so causal power theory of the probabilistic con-
it will reflect people’s unwillingness to en- trast model) starts with the Humean con-
gage in causal inference in such situations? straint that causality can only be inferred
More generally, what must this process in- using observable evidence (in the form of
volve so it will distinguish causation from covariations and temporal and spatial infor-
mere covariation? mation) as input to the reasoning process.
She combines that constraint with Kant’s
(1 781 /1 965 ) postulate that reasoners have an
a priori notion that types of causal relations
A Causal Network Approach
exist in the universe. This unification can
A solution to the puzzle posed by the best be illustrated with an analogy. Accord-
distinction between covariation and causa- ing to Cheng, the relation between a causal
tion is to test hypotheses involving causal relation and a covariation is like the relation
structures (Cheng, 1 997; Novick & Cheng, between a scientific theory and a model. Sci-
2004; Pearl, 1 988, 2000; Spirtes, Glymour, & entists postulate theories (involving unob-
Scheines, 1 993 /2000). Pearl (2000) and servable entities) to explain models (i.e., ob-
Spirtes et al. (1 993 /2000) developed a for- served regularities or laws); the kinetic the-
mal framework for causal inference based on ory of gases, for example, is used to explain
causal Bayesian networks. In this framework, Boyle’s law. Boyle’s law describes an observ-
causal structures are represented as directed able phenomenon, namely that pressure ×
acyclic graphs, graphs with nodes connected volume = constant (under certain boundary
by arrows. The nodes represent variables, conditions), and the kinetic theory of gases
and each arrow represents a direct causal re- explains in terms of unobservable entities
lation between two variables. “Acyclic” refers why Boyle’s law holds (gases consist of small
to the constraint that the chains formed by particles moving at a speed proportional to
the arrows are never loops. The graphs are their temperature, and pressure is generated
assumed to satisfy the Markov condition, by the particles colliding with the walls of
which states that for any variable X in the the container). Likewise, a causal relation is
graph, for any set S of variables in the graph the unobservable entity that reasoners hope
not containing any direct or indirect effects to infer in order to explain observable regu-
of X, X is jointly independent of the vari- larities between events (Cheng, 1 997).
ables in S conditional on any set of values This distinction between a causal relation
of the set of variables that are direct causes as a distal, postulated entity and covariation
of X (see Pearl, 1 988, 2000; Spirtes et al., as an observable, proximal stimulus implies
1 993 /2000). An effect of X is a variable that that there can be situations in which there is
has (1 ) an arrow directly from X pointing observable covariation but causal inference
into it or (2) a pathway of arrows originating is not licensed. Computationally, this means
from X pointing into it. Gopnik et al. (2004) that causality is represented as an unbound
proposed that people are able to assess pat- variable (cf. Doumas & Hummel, Chap. 4;
terns of conditional independence using the Holyoak & Hummel, 2000) represented sep-
Markov assumption and infer entire causal arately and not bound to covariation, allow-
networks from the patterns. Cheng (1 997) ing situations in which covariation has a def-
proposed instead that people (and perhaps inite value (e.g., 0, as in the ceiling effect)
other species) evaluate one causal relation but causal power has no value. Traditional
in a network at a time while taking into models (Allan & Jenkins, 1 980; Anderson &
consideration other relations in the network. Sheu, 1 995 ; Jenkins & Ward, 1 965 ; Man-
Clearcut evidence discriminating between del & Lehman, 1 998; Schustack & Sternberg,
these two variants is still unavailable. 1 981 ; White, 2002; and Rescorla & Wagner,
1 972), which are purely covariational, do Equation (7.3 ) “explains” that, given that c
not represent causality as a separate vari- has occurred, e is produced by c or by the
able. Hence, whenever there is observed composite a, nonexclusively (e is jointly pro-
covariation, they will always compute a defi- duced by both with a probability that fol-
nite causal strength. In an analogy to percep- lows from the independent influence of c and
tion, one could say that such models never a on e). Equation (7.4) “explains” that given
go beyond describing features of the proxi- that c did not occur, e is produced by a alone.
mal stimulus (observable evidence – covari- It follows from Eqs. (7.3 ) and (7.4) that
ation or image on the retina) and fail to infer
features of the distal stimulus (causal power Pc = qc + P (a | c) · qa − qc · P (a | c) · qa
that produced the covariation or object in −P (a | c̄) · qa (Eq. 7.5 )
the 3D world that produced retinal images).
How then does the power PC theory From Eq. (7.5 ), it can be seen that unless
(Cheng, 1 997) go beyond the proximal stim- c and a occur independently, there are four
ulus and explain the various ways in which unknowns: −qc , qa , P (a | c), and P (a | c̄); it
covariation does not imply causation? The follows that, in general, despite P’s having a
first step in the solution is the inclusion definite value, there is no unique solution for
of unobservable entities, including the de- qc . This failure corresponds to our intuition
sired unknown, the distal causal relation, in that covariation need not imply causation –
the equations. The theory partitions all (ob- an intuition that purely covariational models
served and unobserved) causes of effect e are incapable of explaining.
into the candidate cause in question, c, and In the special case in which a occurs inde-
a, a composite of all alternative causes of pendently of c (e.g., when alternative causes
e. The unobservable probability with which are held constant), Eq. (7.5 ) simplifies to
c produces e (in other words, the probabil- Eq. (7.6),
ity that e occurs as a result of c’s occurring)
is termed the generative power of c, repre- P
qc = (Eq. 7.6)
sented by qc here. When P ≥ 0, qc is the 1 − P (e | c̄)
desired unknown. Likewise, when P ≤ 0,
the preventive power of c is the desired unin which all variables besides qc are observ-
known. Two other relevant theoretical unable. In this case, qc can be solved. Being
knowns are qa , the probability with which able to solve for qc only under the condi-
a produces e when it occurs, and P(a), the tion of independent occurrence explains why
probability with which a occurs. The com- manipulation by free will encourages causal
posite a may include unknown and there- inference (the principle of control in ex-
fore unobservable causes. Because any causal perimental design and everyday reasoning).
power may have a value of 0, or even When one manipulates a variable, that deci-
no value at all, these variables are merely sion by free will is likely to occur indepen-
hypotheses – they do not presuppose that dently of alternative causes of that variable.
c and a indeed have causal influence on e. At the same time, the condition of indepen-
The idea of a cause producing an effect and dent occurrence explains why causal infer-
the idea of a cause preventing an effect are ences resulting from interventions are not
primitives in the theory. always correct. Alternative causes are un-
On the assumption that c and a influ- likely to covary with one’s decision to ma-
ence e independently, the power PC the- nipulate, but sometimes they may, as the
ory explains the two conditional probabil- food allergy example illustrates. Note that
ities defining P as follows: the principle of “no confounding” is a result
in this theory, rather than an unexplained
P (e | c) = qc + P (a | c) · qa − qc · P (a | c) · qa axiomatic assumption, as it is in current
(Eq. 7.3 ) scientific methodology (also see Dunbar &
P (e | c̄) = P (a | c̄) · qa (Eq. 7.4) Fugelsang, Chap. 29).
An analogous explanation yields pc , the causal power involving a single factor will
power of c to prevent e hold. That is, under that condition, it does
−P not matter what the reasoner assumes about
pc = (Eq. 7.7) the independent influence of c and a on e.
P (e | c̄)
Now it is also obvious how the power PC
experimental tests of a computational
theory can explain why the ceiling effects causal power approach
block causal inference (even when there is
The predictions made by the power PC the-
no confounding) and do so under differ-
ory and by noncausal accounts differ in di-
ent conditions. In the generative case, e al-
verse ways. We review three of these dif-
ways occurs, regardless of the manipulation;
ferences in this section. The first concerns
hence, P (e | c) = P (e | c̄) = 1 , leaving qc in
a case in which covariation does not equal
Eq. (7.6) with an undefined value. In con-
causation. The second concerns a qualitative
trast, in the preventive case, e never occurs
pattern of the influence of P (e | c̄), the base
again regardless of the manipulation; there-
rate of e, for candidate causes with the same
fore, P (e | c) = P (e | c̄) = 0, leaving pc in
P. The third concerns the flexible and co-
Eq. (7.7) with an undefined value.
herent use of causal power to make causal
Although the theory distinguishes be-
predictions.
tween generative and preventive causal
powers, this distinction does not constitute More Studies on Covariation and Causa-
a free parameter. Which of the two equation We have already mentioned Wu and
tions applies readily follows from the value Cheng’s (1 999) study on ceiling situations,
of P. On occasions where P = 0, both showing that they distinguish covariation
equations apply and make the same predic- from causation. Lovibond et al. (2003 ) re-
tion, namely, that causal power should be 0 ported a further test of this distinction. Their
except in ceiling effect situations. Here, the experiments are not a direct test of the
reasoner has to make a pragmatic decision power PC theory because they do not in-
on whether he or she is evaluating the ev- volve binary variables only. They do, how-
idence to assess a preventive or generative ever, test the same fundamental idea under-
relation, and whether the evidence at hand lying a distal approach. That is, to account
is meaningful or not for that purpose. for the distinction between covariation and
Most causes are complex, involving not causation, there must be an explicit repre-
just a single factor but a conjunction of fac- sentation of unobservable causal relations.
tors operating in concert. In other words, Lovibond et al. (2003 ) tested human sub-
the assumption made by the power PC the- jects on “backward blocking” and on “re-
ory that c and a influence e independently lease from overshadowing,” when the out-
is false most of the time. When this as- come (an allergic reaction to some food)
sumption is violated, if an alternative cause occurred at what the subjects perceived as
(part of a) is observable, the independent the “ceiling” level for one condition and at
influence assumption can be given up for an intermediate level for another condition.
that cause, and progressively more complex The release-from-overshadowing condition
causes can be evaluated using the same dis- involved a retrospective design, and differed
tal approach that represents causal powers. from the backward blocking condition only
This approach has been extended to eval- in that, when the blocking cue B (the cue
uate conjunctive causes involving two fac- that did appear by itself ) appeared, the out-
tors (see Novick & Cheng, 2004). Even if come did not occur. Thus, considering the
alternative causes are not observable, how- effect of cue A, the cue that never appeared
ever, Cheng (2000) showed that as long as by itself, with cue B held constantly present,
they occur with about the same probability one sees that introducing A made a differ-
in the learning context as in the generaliza- ence to the occurrence of the outcome. This
tion context, predictions according to simple nonzero P implies causality, regardless of
whether the outcome occurred (given the condition? This result adds to the challenges
compound) at a ceiling or nonceiling level. for associative accounts. Both designs in-
The critical manipulation was a “pretrain- volved retrospective revaluation, but even
ing compound” phase during which one modifications of associative models that ex-
group of subjects, the ceiling group, saw plain retrospective revaluation cannot ex-
that a combination of two allergens pro- plain this difference. In contrast, a simple
duced an outcome at the same level (“an and intuitive answer follows from a causal
allergic reaction”) as a single allergen (i.e., account.
the ceiling level). In contrast, the nonceil-
ing group saw that a combination of two Base Rate Influence on Conditions with
allergens produced a stronger reaction (“a Identical P Several earlier studies on hu-
STRONG allergic reaction”) than a single man contingency judgment have reported
allergen (“an allergic reaction”). Following that, although P clearly influences
this pretraining phase, all subjects were pre- causal ratings (e.g., Allan & Jenkins, 1 980;
sented with information regarding various Wasserman, Elek, Chatlosh, & Baker, 1 993 ),
cues and outcomes according to their assign- for a given level of P, causal ratings diverge
ment to the backward-blocking or release- from P as the base rate of the effect e,
from-overshadowing groups. Critically, the P (e | c̄) increases. If we consider Eq. (7.6)
outcome in this main training phase always (the power PC theory) for any constant
only occurred at the intermediate level (“an positive P, causal ratings should increase
allergic reaction”) for both the ceiling and as P (e | c̄) increases. Conversely, according
nonceiling groups. Ingeniously, as a result of to Eq. (7.7), preventive causal ratings
pretraining, subjects’ perception of the level should decrease as P (e | c̄) increases for
of the outcome in the main phase would be the same negative P. Zero contingencies,
expected to differ. For the exact same out- however, regardless of the base rate of e,
come, “an allergic reaction,” the only form should be judged as noncausal (except
of the outcome then, whereas the ceiling when judgment should be withheld due
group would perceive it to occur at the ceil- to ceiling effects). No other current model
ing level, the nonceiling group would per- of causal learning predicts this qualitative
ceive it to occur at an intermediate level. For pattern of the influence of the base rate
the backward-blocking condition for both of e, although some covariational or asso-
groups, cue A made no difference to the ciative learning models can explain one or
occurrence of the outcome (holding B con- another part of this pattern given felicitous
stant, there was always a reaction whether or parameter values. For example, in the RW,
not A was there). However, as explained by if βUS > βUS , causal ratings will always
the power PC theory, whereas a P of 0 im- increase as base rate increases, whereas
plies noncausality (i.e., a causal rating of 0) the opposite trend would be obtained if
when the outcome occurred at a nonceiling the parameter ordering were reversed.
level, the same value does not allow causal Another prominent associative learning
inference when the outcome occurred at a model, Pearce’s (1 987) model of stimulus
ceiling level. In support of this interpreta- generalization, can likewise account for
tion, the mean causal rating for cue A was opposite base rate influences in positive and
reliably lower for the nonceiling group than negative contingencies if the parameters are
for the ceiling group. In contrast, recovery set accordingly, but this model would then
from overshadowing was not dependent on additionally predict a base rate influence on
whether or not the outcome was perceived noncontingent conditions.
to occur at a ceiling level. Figure 7.2 illustrates the intuitiveness of
Why does the level at which the outcome a rating that deviates from P. The rea-
was perceived to occur lead to different re- soning is counterfactual. P (e | c̄) estimates
sponses in the backward-blocking condition the “expected” probability of e in the pres-
but not in the release-from-overshadowing ence of c if c had been absent so that only
Figure 7.2 . Examples of stimulus materials from a

condition in Buehner et al. (2003 ).
causes other than c exerted an influence on e. itive and negative contingencies in the way
A deviation from this counterfactual proba- that power PC predicts. However, contrary
bility indicates that c is a simple cause of to the predictions of the power PC the-
e. Under the assumption that the patients ory, Buehner and Cheng (1 997) also found
represented in the figure were randomly as- that base rate did not only influence con-
signed to the two groups, one that received tingent conditions with equal P values but
the drug and another that did not, one would also influenced noncontingent conditions (in
reason that about one-third of the patients which P = 0). The latter, a robust re-
in the “drug” group would be expected to sult (see Shanks 1 985 a; 1 987; and Shanks,
have headaches if they had not received Holyoak & Mediu, 1 996, for a review) seems
the drug. The drug then would be the sole nonsensical if P had in fact been 0 in the
cause of headaches among the two-thirds input to the reasoner. Furthermore, they
who did not already have headaches caused also found that comparisons between cer-
by other factors. In this subgroup, headaches tain conditions where causal power [as de-
occurred in three-fourths of the patients. fined in Eqs. (7.6) and (7.7)] was constant
One might therefore reason, although P = but P varied showed variations in the di-
1 /2, that the probability the drug will pro- rection of P, as predicted by the RW and
duce headaches is three-fourths. Pearce model.
The initial attempts to test the power PC Many researchers treated Buehner and
theory yielded mixed results. Buehner and Cheng’s (1 997) and similar results (Lober &
Cheng (1 997; see Buehner, Cheng, & Clif- Shanks, 2000) as a given and regarded the
ford,2003 for a more detailed report) var- findings that deviated from the predictions
ied the base rate of e for conditions with of the power PC theory as refutations of
the same value of P using a sequential it. Lober and Shanks (2000) concluded
trial procedure and demonstrated that base that these results fully support RW, even
rate indeed influences the evaluation of pos- though they had to use opposite parameter
orderings of β US and βUS for generative can- these interpretations (see Buehner et al.,
didates, as was the case for preventive candi- 2003 ; Tenenbaum & Griffiths, 2001 ). It
dates, to fit the data. Similarly, Tenenbaum seems plausible that people are capable
and Griffiths (2001 ) concluded that these re- of answering a variety of causal questions.
sults support their Bayesian causal support Moreover, they may be able to do so coher-
model, which evaluates how confident one ently, in which case models of answers to the
is that c causes e. It does so by comparing various questions would be complementary
the posterior probabilities of two causal net- if they are logically consistent.
works, both of which have a background Answers to the various questions (regard-
cause that is constantly present in the learning the same conditions), however, may form
ing context, differing only in that one net- different patterns. Testing the power PC
work has an arrow between c and e. When theory directly requires removing both am-
the posterior probability of the network with biguities. To do so, Buehner et al. (2003 )
the extra arrow is greater than that without adopted a counterfactual question: for ex-
the arrow, then one decides that c causes e. ample, “Imagine 1 00 patients who do not
Otherwise, one decides that c does not cause suffer from headaches. How many would
e. Support is defined as the log of the ratio have headaches if given the medication?”
of the two posterior probabilities. To minimize memory demands, Buehner
et al. presented the trials simultaneously.
They found that causal ratings using the
deviations from normativity counterfactual question and simultaneous
and ambiguous experiments trials were perfectly in line with causal
Buehner et al.’s (2003 ) attempts to test the power as predicted by the power PC the-
qualitative pattern of causal strengths pre- ory. Berry (2003 ) corroborated Buehner et
dicted by the power PC theory illustrate a al.’s findings with a nonfrequentist counter-
modular approach to psychological research. factual question.
This approach attempts to study the mind Buehner et al. (2003 ) explained how the
rather than behavior as it happens to be ambiguity of earlier causal questions can lead
observed. It attempts to isolate the influ- to confounded results that show an influence
ence of a mental process under study, even of P on conditions with identical causal
though tasks in our everyday life typically in- power. However, it cannot account for the
volve confounded contributions from multi- base rate influence on noncontingent con-
ple cognitive processes (e.g., comprehension ditions. But, given the memory demands in
and memory). An analysis of the experimen- typical sequential trial experiments, it is in-
tal materials in Buehner and Cheng (1 997) evitable that some participants would er-
suggests that the deviations from the power roneously misperceive the contingencies to
PC theory are due to factors extraneous to be nonzero, in which case Eqs. (7.6) and
the causal inference process (Buehner et al., (7.7) would predict an influence of base rate.
2003 ). First, the typical dependent variable These equations explain why the mispercep-
used to measure causal judgments is highly tions do not cancel each other out, as one
ambiguous. Participants are typically asked might expect if they were random. Instead,
to indicate how strongly they think c causes for the same absolute amount of misper-
or prevents e. The question may be inter- ception, a positive misperception that oc-
preted to ask how confident one is that c curs at a higher base rate would imply a
causes e, rather than how strongly c causes e. higher generative power, and a negative mis-
Also, it may be interpreted to refer to either perception (leading to a negative causal rat-
the current learning context or a counter- ing) that occurs at a lower base rate would
factual context in which there are no other imply a more negative preventive power. In
causes. both cases, causal ratings for objectively non-
Notably, the distal approach allows for- contingent candidates would increase as base
mulations of coherent answers to each of rate increases. Thus, the base-rate influence
for noncontingent candidates may reflect nation of iterative retrospective revaluation

an interaction between memory and causal (Macho & Burkart, 2002).
reasoning.
Buehner et al. (2003 ) confirmed this in- Iterative Retrospective Revaluation If an
terpretation in two ways. First, when learn- equation in several variables characterizes
ing trials were presented simultaneously, the operation of a system, the equation can
thereby eliminating the possibility of mis- potentially be used flexibly to solve for each
perceiving a zero contingency to be nonzero, variable when given the values of other vari-
participants no longer exhibited a base rate ables, and the solutions would all be logi-
influence in noncontingent conditions. Sec- cally consistent. Evidence suggests that the
ond, they showed that in an experiment equations in the power PC theory are used
involving sequential trials, every judgment this way.
that deviated from 0 was indeed traceable Macho and Burkart (2002, Experiment 2)
to the subject’s misperception of the zero presented trials in two phases: In the first,
contingency. All accurately perceived P of two pairs of candidate causes (TC and CD)
0 was rated as noncausal. Not a single sub- were presented with the outcome e some-
ject did what all nonnormative accounts pre- times occurring, with the same relative fre-
dict – differentially weighing an accurately quency for both combinations; in the sec-
perceived P of 0 to result in a nonzero ond phase, a single disambiguiting candidate,
causal rating. D, was presented. Two experimental groups
In sum, earlier deviations form the power differed only with respect to whether e al-
PC theory’s predictions were the result ways or never occurred with D in the second
of confounding due to comprehension and phase. For these groups, despite the fact that
memory processes. Once these extraneous for both groups T and C were equally absent
problems were curtailed, as motivated by a in the critical second phase, the mean causal
modular approach, causal ratings followed ratings for T were higher than for C in one
exactly the pattern predicted by power PC. group, but lower in the other group. Con-
The complex pattern of results observed sider what one would infer about T and C
cannot be accounted for by any current when D was always accompanied by e in the
associationist model, regardless of how its second phase (without D, e did not occur;
parameters are set. In contrast, the power therefore, D causes e). Holding D constantly
PC theory explains the results without any present, because e occurred less often when
parameters. C was there than when it was not, C prevents
e, and its preventive power can be estimated.
Instantiating Eq. (7.7) for this design, pc is
flexibility and coherence estimable as just mentioned, and P (e | TC)
A general goal of inference is that it is given in phase 1 ; therefore, P (e | T not-
is both flexible and coherent. We men- C), the only unknown in the equation, can
tioned earlier that a distal approach al- be solved. Once this unknown is solved, one
lows a coherent formulation of answers to can next use it to apply Eq. (7.6) to T, which
different questions. These questions may has a positive P: together with the infor-
concern confidence in the existence of a mation that p(e | not-T not-C) = 0 given in
causal relation (Tenenbaum & Griffiths, both phases, a positive generative power of
2001 ); conjunctive causation (Novick & T results. T and C are therefore generative
Cheng, 2004); prediction under a change and preventive, respectively. An analogous
in context, enabling conditions rather than sequence of inference can be made when D
causes (Cheng & Novick, 1 991 ; Gold- is never accompanied by e, resulting in re-
varg & Johnson-Laird, 2001 ); and interven- versed causal powers for T and C (preventive
tions (e.g., Cheng, 1 997; Gopnik et al., 2004; and generative, respectively). Associative
Lagnado & Sloman, 2004; Steyvers et al., models either cannot predict any retrospec-
2003 ). The approach also provides an expla- tive revaluation or erroneously predict that
C and T acquire weights in the same direc- was a three-node network into a four-node
tion in phase 2 for each condition, because network: The amount of TV is controlled
these cues are equally absent in the phase 2 by an external agent, which was not rep-
when their weights are adjusted. resented in the simple three-node network.
These results on iterative revaluations When the amount of TV is manipulated un-
show that the use of Eqs. (7.6) and (7.7) der free will, the external node would oc-
in Cheng’s (1 997) power PC theory is more cur independently of aggression in the causal
flexible than she originally discussed. In her chain structure, because aggression and the
paper, she interpreted the causal power vari- external agent are alternative causes (of con-
ables on the left-hand side (LHS) as the de- sumption of violent TV) in the causal chain
sired unknowns. What Macho and Burkart structure, but not in the common cause
(2002) showed is that, when given the value structure. As mentioned earlier, one is likely
of the variables on the LHS, people are able to assume that alternative causes of an out-
to treat a variable on the right-hand side as come remain constant while that outcome
the desired unknown and solve for it. is manipulated under free will. This assump-
tion, along with the independent occurrence
Intervention. The advantage of interven- condition, together explain why manipula-
tion over observation is most readily ap- tion allows differentiation between the two
preciated when trying to establish which structures.
of several competing hypotheses underlies
a complex data structure. We explained this An Enabling Condition. When asked
abstractly earlier in terms of the likely satis- “What caused the forest fire?” investigators
faction of the independent occurrence con- are unlikely to reply, “The oxygen in the air.”
dition. Let us non consider as an exam- Rather, they are likely to reserve the title of
ple the often reported correlation between cause to to such factors as “lightning,” “ar-
teenage aggression, consumption of violent son,” or the “dryness of the air.” To explain
television or movies, and poor school per- the distinction between causes and enabling
formance. A correlation between these three conditions, a number of theorists argued that
variables could be due to either a common- a causal question invariably implies com-
cause structure: AGGRESSION ← TV → putation within a selected set of events
SCHOOL, where violent television would in which a component cause is constantly
be the cause for both poor school per- present (e.g., Mackie, 1 974). On this view,
formance and increased aggression, or a the forest fire question can be understood as
chain structure: AGGRESSION → TV “What made the difference between this oc-
→ SCHOOL, where increased aggression casion in the forest on which there was a fire
would lead to increased consumption of vi- and other occasions in the forest on which
olent TV, which in turn results in poor there was no fire?” Note that the selected
school performance. Without temporal in- set of events in the expanded question does
formation, these competing causal models not include all events in one’s knowledge
cannot be distinguished by observation lim- base that are related to fire. In particular,
ited to the three-node network alone. How- it does not include events in which oxygen
ever, if one were to intervene on the TV is absent, even though such events (at
node, the two structures make different least in an abstract form) are in a typical
predictions: According to the former, restric- educated person’s knowledge base. The
tions on access to violent TV should lead power PC theory explains the distinction
to both improved school performance and between causes, enabling conditions, and
decreased aggression; according to the lat- irrelevant factors the same way as Cheng
ter, the same restriction would still improve and Novick (1 992) do, except that now
school performance but would have no ef- there is a justification for conditions that
fect on aggressive behavior. Note that the allow causal inference. A varying candidate
intervention on TV effectively turned what cause is a cause if it covaries with the target
effect in the current set of events, the set ticipants could not possibly recruit back-
specified by the expanded question, in ground knowledge (unlike in the smoking/
which other causes and causal-factors are lung cancer example), stimuli varied along
constant. A candidate cause is an enabling two dimensions, color and shape, such that
condition if it is constantly present in the variations could be described at various lev-
current set of events but is a cause ac- els of abstraction (e.g., cool vs. warm col-
cording to another subset of events. Finally, ors, red vs. orange, or particular shades of
a candidate cause is irrelevant if its co- red). Participants in Lien and Cheng’s exper-
variation with the effect is not noticeably iments spontaneously represented the causal
different from 0 in any subset of events relation they learned at the level of abstrac-
that allows causal inference. (See Gold- tion at which P was maximal.
varg & Johnson-Laird, 2001 , for a similar Computing P at an optimal level is con-
explanation.) sistent with an approach to causal learning
that does not begin with well-defined can-
didate causes. In contrast, the current de-
fault assumption in the psychological liter-
Causal Inference and Category ature is that causal discovery depends on
Formation: What Is the Level of the definition of the entities among which
Abstraction at Which ∆P Should relations are to be discovered; categoriza-
Be Computed? tion therefore precedes causal discovery. The
opposite argument can be made, however.
Cheng (1 993 ) noted the problem of the Causal discovery could be the driving force
level of abstraction at which covariations underlying our mental representation of the
should be calculated. Consider the problem world – not only in the sense that we need
of evaluating whether smoking causes lung to know how things influence each other
cancer. The candidate cause “smoking” can but also in the sense that causal relations
be viewed at various levels of abstraction, define what should be considered things in
for instance, “smoking a particular brand of our mental universe (Lewis, 1 929). Lien and
cigarettes” or “inhaling fumes”. If one were Cheng (2000) provided evidence that the
to compute P for smoking with respect to definition of an entity and the discovery of
lung cancer, one would obtain lower values a causal relation operate as a single pro-
for both the narrower and the more abstract cess in which optimal causal discovery is
conceptions of the cause than for “smoking the driving force. Causal discovery therefore
cigarettes.” For example, if one adopted the has direct implications for the formation of
more abstract conception “inhaling fumes,” categories instead of requiring well-defined
P (e | c̄) would remain unchanged, but one candidate causes as givens.
would lower P (e | c) because now other
noncarcinogenic fumes (e.g., steam) contri-
Time and Causal Inference: The Time-
bute to the estimate of this probability. The
Frame of Covariation Assessment
more abstract exception would result in a
smaller overall probability of c to produce e. We have concentrated on theoretical ap-
Causes and effects (like all events, see proaches that specify how humans take the
Vallacher & Wegner, 1 987) can be con- mental leap from covariation to causation.
ceptualized at various levels of abstraction. Irrespective of any differences in theoreti-
Cheng (1 993 ) hypothesized that to evalu- cal perspective, all these approaches assume
ate a causal relation, people represent the covariation can be readily assessed. This as-
relation at the level of abstraction at which sumption is reflected in the experimental
P, with alternative causes held constant, is paradigms most commonly used. Typically,
maximal. Lien and Cheng (2000) showed participants are presented with evidence
that people indeed are sensitive to this idea. structured in the form of discrete, simulta-
In a completely novel situation, where par- neous, or sequential learning trials in which
each trial contains observations on whether Perceptual causality (see beginning of

the cause occurred and whether the effect chapter) refers to the instant impression of
occurred. In other words, in these tasks it is causality that arises from certain stimulus
always perfectly clear whether a cause is fol- displays. The most prominent phenomenon
lowed by an effect on a given occasion. Such is the launching effect. An object A moves
tasks grossly oversimplify the complexities toward a stationary object B until it col-
of causal induction in some situations out- lides with B. Immediately after the colli-
side experimental laboratories: Some events sion, B moves along the same trajectory as A,
have immediate outcomes; others do not re- while A becomes stationary. Nearly all per-
veal their consequences until much later. Be- ceivers report that such displays look as if A
fore an organism can evaluate whether a spe- “launched” B or “made B move” (Michotte,
cific covariation licenses causal conjecture, 1 946/1 963 ; for a recent overview, see Scholl
the covariation needs to be detected and & Tremoulet, 2000). However, if a temporal
parsed in the first place. gap of more than 1 5 0 ms is inserted between
So far, little research effort has been the collision of A and B and the onset of
directed toward this problem. The scarce B’s motion, the impression of causality dis-
evidence that exists comes from two very appears and observers report two distinct,
different theoretical approaches. One is as- unrelated motions.
sociative learning, and the other is per- From a computational perspective, it is
ception of causality. Using an instrumen- easy to see why delays would produce decre-
tal learning paradigm, Shanks, Pearson, and ments in causal reasoning performance.
Dickinson (1 989) instructed participants to Contiguous event pairings are less demand-
monitor whether pressing a key caused a tri- ing on attention and memory. They are also
angle to light up on a computer screen. The much easier to parse. When there is a tem-
apparatus was programmed to illuminate the poral delay and there are no constraints on
triangle 75 % of the time the key was pressed how the potential causes and effects are
and never when the key was not pressed. bundled, as in Shanks et al. (1 989), the basic
However, participants were also told that question on which contingency depends no
sometimes the triangle might “light up on its longer has a clear answer: Should this par-
own.” This actually never happened in any ticular instance of e be classified as occur-
of the experimental conditions but only in a ring in the presence of c or in its absence?
set of yoked control conditions during which Each possible value of temporal lag results in
the apparatus played back an outcome pat- a different value of contingency. The prob-
tern produced in the previous experimental lem is analogous to that of the possible lev-
condition. In other words, in these control els of abstractions of the candidate causes
conditions, participants’ key presses were and the effects at which to evaluate contin-
without any consequences whatsoever. Par- gency (and may have an analogous solution).
ticipants could distinguish reliably between Moreover, for a given e, when alternative in-
experimental and control conditions (i.e., tervening events occur, the number of hy-
they noticed whether their key presses were potheses to be considered multiply. The re-
causally effective). However, when Shanks sult is a harder, more complex inferential
et al. inserted a delay between pressing the problem – one with a larger search space.
key and the triangle’s illumination, the dis- One might think that keeping track of out-
tinction became considerably harder. In fact, come rates and changes in these rates condi-
when the delay was longer than 2 seconds, tional on the presence and absence of other
participants could no longer distinguish events would solve the problem (Gallistel &
between causal and noncausal conditions, Gibbon, 2000). Measuring outcome rates,
even though their key presses were still ef- however, would not help in Shanks et al.’s
fective 75 % of the time. Shanks et al. inter- (1 989) situation. Unless there are additional
preted this finding as supporting an associa- constraints (e.g., discrete entities in which c
tive account of causal judgment. may or may not occur at any moment, but
once it occurs for an entity, c can be consid- ability of the observations resulting from a
ered “present” for that entity, even when it hypothesis. Both Shanks et al.’s (1 989) and
is no longer occurring), the parsing problem Michotte’s (1 946/1 963 ) findings are consis-
remains, as does the proliferation of candi- tent with Einhorn and Hogarth’s (1 986) hy-
date causes that precede an outcome. pothesis. However, these findings cannot be
Until now, we have focused on situations cited as unequivocally demonstrating that
in which there is no prior causal knowl- adults use prior causal knowledge as a ba-
edge. We digress here to discuss a case in sis for event parsing because the inductive
which there is such knowledge. When the problem gets increasingly difficult as the de-
search space is large, constraints provided lay increases, and an account based on prob-
by prior knowledge of types of causal re- lem difficulty alone would predict the same
lations become increasingly important. As- qualitative pattern of results.
sessing maximal covariation among the set Hagmayer and Waldmann (2002) showed
of hypotheses may be impractical given that people use prior knowledge of tempo-
the large search space, or at least ineffi- ral intervals in causal relations to classify ev-
cient given the existence of prior knowl- idence about the presence and absence of c
edge. When there is prior knowledge, why and e in continuous time accordingly. Par-
not use it? Some evidence suggests, however, ticipants in their Experiment 1 were pre-
that children are unable to integrate prior sented with longitudinal information con-
temporal knowledge with frequency obser- cerning the occurrence of mosquito plagues
vations. Schlottmann (1 999) showed that over a 20-year period in two adjacent com-
5 - to 7-year-old children, although able to munities. They were told that one com-
learn about and understand delayed causal munity relied on insecticides, whereas the
mechanisms perfectly, when presented with other employed biological means (planting
a choice between a delayed and immedi- a flower that mosquito larvae-eating beetles
ate cause, always preferred the immediate, need to breed). Although the instructions
contiguous cue, even when they explicitly never mentioned the time frame of the
knew that the causal relation in question in- causal mechanisms in question explic-
volved a delay. Schlottmann interpreted her itly, Hagmayer and Waldmann assumed
findings to indicate that temporal contigu- the insecticide instructions would create
ity is a powerful cue to causality. Because expectations of immediate causal agency,
young children fail to integrate two kinds of whereas mentioning the biological mecha-
evidence (knowledge of a delayed mechanism would create expectation of a delay.
nism and contingency evaluated at the hy- Data were presented in tabular form show-
pothesized delay), they discard the knowl- ing for each of the 20 years whether the in-
edge cue and focus exclusively on temporal tervention had taken place (insecticide de-
contiguity. livered, plants planted) and whether there
Adult reasoners, in contrast, can most was a plague in that year. The data were con-
likely integrate the two kinds of evidence. structed to yield a moderately negative con-
If the reasoner anticipates that a causal re- tingency between intervention and plague
lation might involve a delay, its discovery when considered within the same year but a
and assessment should be considerably eas- positive contingency when considered over
ier. According to Einhorn and Hogarth’s a 1 -year delay. Participants’ evaluation of the
(1 986) knowledge mediation hypothesis, peo- same covariational data varied as a function
ple make use of their prior causal knowledge of the instructions in line with a knowledge-
about the expected length of the delay to mediation account. These results illustrate
reduce the complexity of the inference prob- that people in principle can and do use tem-
lem. They focus on the expected delay for poral knowledge to structure evidence into
a type of causal relation and evaluate ob- meaningful units.
servations with respect to it. In Bayesian Buehner and May (2002, 2003 , 2004) fur-
terms, they evaluate likelihoods, the prob- ther showed that adults are able to reduce
the otherwise detrimental influence of delay posed a model of this process and began to
on causal relations elapsing in real time also evaluate it. The assessment of confidence,
by making use of expectations about the unlike causal strength, can give rise to the
time frame of the causal relation in question. observed gradual acquisition curves.
Buehner and May instructed participants at r How do people evaluate how much an
the outset of the experiment about potential outcome that is known to have occurred
delays. They did this in a number of ways and is attributable to a candidate cause (see
found that both explicit and implicit instruc- Ellsworth, Chap. 28; Spellman, 2000)?
tions about potential delays improved the This issue is important in legal decision
assessment of delayed causal relationships. making. Can a causal power approach
The use of prior temporal knowledge raises overcome the difficulty in cases involving
the question of how that knowledge might overdetermination?
have been acquired. Causal discovery with- r What determines the formation of the
out prior temporal knowledge may be diffi- categories? Does it matter whether the
cult (e.g., longitudinal studies are expensive, variables are linked by a causal relation?
even though they have more constraints for What demarcates an event given that
limiting the search space than in Shanks et events occur in continuous time? Does a
al.’s situation), but it is possible given com- new category form in parallel as a new
putational resources. causal relation is inferred?
r What determines the level of abstraction
at which a causal relation is inferred?
Summary and Future Directions
What determines the choice of the tem-
poral interval between a cause and an ef-
Our chapter has taken a computational perfect for probabilistic causal relations?
spective – in particular, one of construct- r Do people make use of prior causal
ing an artificial intelligence system capable
knowledge in a Bayesian way (Tenen-
of causal learning given the types of non-
baum & Griffiths, 2002)? Are various
causal observations available to the system.
kinds of prior causal knowledge (e.g.,
We have reviewed arguments and empirical
temporal, mechanistic) integrated with
results showing that an approach that inter-
current information in the same way?
prets observable events in terms of a hypo-
What role, if any, does coherence play?
thetical causal framework explains why co-
All models of causal learning in principle
variation need not imply causation and how
allow the use of prior causal knowledge,
one can go beyond predicting future obser-
regardless of whether they are Bayesian.
vations to predicting the consequences of
If a comparison among these models in-
interventions. An additional appeal of this
volves a situation in which the reasoner
approach is that it allows one to address
has prior knowledge, then the default as-
multiple research questions within a coher-
sumption would be to equate the input
ent framework. We compared this frame-
to the models, for example, by supply-
work with an associative framework in our
ing the data on which prior causal knowl-
review of previous theoretical and empirical
edge is based in the input supplied to
research, which focused on the estimation
the non-Bayesian models. They would
of causal strength. There are many other in-
not be alternative models with respect
teresting causal questions that remain to be
to the last two questions, for example.
addressed under this framework. Some of
It seems to us that including the use of
these are
prior knowledge would not make a differ-
r How do people evaluate their confi- ence at Marr’s computational level with
dence in whether a causal relation exists? respect to the issue of what is computed
Tenenbaum and Griffiths (2001 ) pro- in the process but would concern issues of
representation and algorithm. In Bayesian the nineteenth annual conference of the Cogni-
models, there is explicit representative Science Society (pp. 5 5 –60). Hillsdale, NJ:
tion of the prior probability of a causal Erlbaum.
hypothesis. Buehner, M. J., Cheng, P. W., & Clifford, D.
r Are people able to make use of patterns (2003 ). From covariation to causation: A test of
the assumption of causal power. Journal of Ex-
of conditional independence as Bayesian
perimental Psychology: Learning, Memory, and
network models do (Gopnik et al., 2004)
Cognition, 2 9(6), 1 1 1 9–1 1 40.
to infer entire causal networks, rather
Buehner, M. J., & May, J. (2002). Knowledge me-
than infer individual causal relations link
diates the timeframe of covariation assessment
by link as assumed by most current asso- in human causal induction. Thinking and Rea-
ciative and causal accounts? soning, 8(4), 269–295 .
Buehner, M. J., & May, J. (2003 ). Rethink-
Acknowledgments ing temporal contiguity and the judgment of
causality: Effects of prior knowledge, experi-
ence, and reinforcement procedure. Quarterly
Preparation of this chapter was supported Journal of Experimental Psychology Section A –
by grant MH6481 0 from the National In- Human Experimental Psychology, 5 6A(5 ), 865 –
stitute of Mental Health to Cheng. We also 890.
thank Steve Sloman for detailed and help- Buehner, M. J., & May, J. (2004). Abolishing the
ful comments on an earlier version of this effect of reinforcement delay on human causal
chapter. learning. Quarterly Journal of Experimental Psy-
chology Section B – Comparative and Physiolog-
ical Psychology, 5 7B(2), 1 79–1 91 .
Note Bullock, M., Gelman, R., & Baillargeon, R.
(1 982). The development of causal reasoning.
1 . Ulrike Hahn provided this interpretation. In W. J. Friedman (Ed.), The developmental psy-
chology of time (pp. 209–25 4). New York: Aca-
demic Press.
References Chapman, G. B., & Robbins, S. J. (1 990). Cue
interaction in human contingency judgment.
Ahn, W.-K., Kalish, C. W., Medin, D. L., & Memory and Cognition, 1 8(5 ), 5 3 7–5 45 .
Gelman, S. A. (1 995 ). The role of covariation Cheng, P. W. (1 993 ). Separating causal laws from
vs. mechanism information in causal attribu- casual facts: Pressing the limits of statistical rel-
tion. Cognition, 5 4, 299–3 5 2. evance. In D. L. Medin (Ed.), The psychology of
Allan, L. G., & Jenkins, H. M. (1 980). The judg- learning and motivation. Advances in research
ment of contingency and the nature of response and theory (Vol. 3 0, pp. 21 5 –264). San Diego,
alternatives. Canadian Journal of Psychology, CA: Academic Press.
3 4(1 ), 1 –1 1 . Cheng, P. W. (1 997). From covariation to
Anderson, J. R. (1 990). The adaptive character of causation: A causal power theory. Psychological
thought. Hillsdale, NJ: Erlbaum. Review, 1 04(2), 3 67–405 .
Anderson, J. R., & Sheu, C. F. (1 995 ). Causal in- Cheng, P. W. (2000). Causality in the mind:
ferences as perceptual judgments. Memory and Estimating contextual and conjunctive causal
Cognition, 2 3 (4), 5 1 0–5 24. power. In F. Keil & R. Wilson (Eds.), Cognition
Berry, C. J. (2003 ). Conformity to the power PC the- and explanation (pp. 227–25 3 ). Cambridge,
ory of causal induction: The influence of a counter- MA: MIT Press.
factual probe. London, UK: University College Cheng, P. W., & Novick, L. R. (1 991 ). Causes
London. versus enabling conditions. Cognition, 40, 83 –
Buehner, M. J., & Cheng, P. W. (1 997). 1 20.
Causal induction: The power PC theory ver- Cheng, P. W., & Novick, L. R. (1 992). Covaria-
sus the Rescorla–Wagner model. In M. G. tion in natural causal induction. Psychological
Shafto & P. Langley (Eds.), Proceedings of Review, 99(2), 3 65 –3 82.
Danks, D. (2003 ). Equilibria of the Rescorla– Holyoak, K. J., & Hummel, J. E. (2000). The
Wagner model. Journal of Mathematical Psy- proper treatment of symbols in a connec-
chology, 47(2), 1 09–1 21 . tionist architecture. In E. Dietrich & A.
De Houwer, J. (2002). Forward blocking depends Markman (Eds.), Cognitive dynamics: Concep-
on retrospective inferences about the presence tual change in humans and machines (pp. 229–
of the blocked cue during the elemental phase. 263 ). Mahwah, NJ: Erlbaum.
Memory and Cognition, 3 0(1 ), 24–3 3 . Hume, D. (1 73 9/1 888). A treatise of human na-
De Houwer, J., & Beckers, T. (2002). A review ture. In L. A. Selby-Bigge (Ed.), Hume’s treatise
of recent developments in research and theo- of human nature. Oxford, UK: Clarendon Press.
ries on human contingency learning. Quarterly Jenkins, H., & Ward, W. (1 965 ). Judgment of con-
Journal of Experimental Psychology: Compara- tingencies between responses and outcomes.
tive and Physiological Psychology, 5 5 B(4), 289– Psychological Monographs, 7, 1 –1 7.
3 1 0. Kamin, L. J. (1 969). Predictability, surprise, at-
Dennis, M. J., & Ahn, W.-K. (2001 ). Primacy in tention and conditioning. In B. A. Campbell
causal strength judgments: The effect of initial & R. M. Church (Eds.), Punishment and aver-
evidence for generative versus inhibitory rela- sive behavior. New York: Appleton-Century-
tionships. Memory and Cognition, 2 9(1 ), 1 5 2– Crofts.
1 64. Kant, I. (1 781 /1 965 ). Critique of pure reason.
Dickinson, A. (2001 ). Causal learning: An as- London: Macmillan.
sociative analysis. Quarterly Journal of Experi- Katz, L. (1 989). Bad acts and guilty minds.
mental Psychology Section B – Comparative and Chicago: The University of Chicago Press.
Physiological Psychology, 5 4(1 ), 3 –25 . Kelley, H. H. (1 973 ). The processes of causal at-
Dickinson, A., & Burke, J. (1 996). Within- tribution. American Psychologist, 2 8(2), 1 07–
compound associations mediate the retro- 1 28.
spective revaluation of causality judgements. Lagnado, D., & Sloman, S. (2004). The advan-
Quarterly Journal of Experimental Psychology: tage of timely intervention. Journal of Experi-
Comparative and Physiological Psychology, mental Psychology: Learning, Memory and Cog-
49B(1 ), 60–80. nition, 3 0(4), 85 6–876.
Einhorn, H. J., & Hogarth, R. M. (1 986). Judging Larkin, M. J. W., Aitken, M. R. F., & Dickinson,
probable cause. Psychological Bulletin, 99(1 ), A. (1 998). Retrospective revaluation of causal
3 –1 9. judgments under positive and negative con-
Gallistel, C. R. (1 990). The organization of learn- tingencies. Journal of Experimental Psychology:
ing. Cambridge, MA: MIT Press. Learning, Memory, and Cognition, 2 4(6), 1 3 3 1 –
Gallistel, C. R., & Gibbon, J. (2000). Time, rate, 1 3 5 2.
and conditioning. Psychological Review, 1 07(2), Lewis, C. I. (1 929). Mind and the world order. New
289–3 44. York: Scribner.
Goldvarg, Y., & Johnson-Laird, P. N. (2001 ). Lien, Y. W., & Cheng, P. W. (2000). Distin-
Naive causality: A mental model theory of guishing genuine from spurious causes: A
causal meaning and reasoning. Cognitive Sci- coherence hypothesis. Cognitive Psychology,
ence, 2 5 , 5 65 –61 0. 40(2), 87–1 3 7.
Gopnik, A., Glymour, C., Sobel, D. M., Schulz, Lober, K., & Shanks, D. R. (2000). Is causal induc-
L. E., Kushnir, T., & Danks, D. (2004). A the- tion based on causal power? Critique of Cheng
ory of causal learning in children: Causal maps (1 997). Psychological Review, 1 07(1 ), 1 95 –21 2.
and Bayes nets. Psychological Review, 111(1 ), 3 – Lovibond, P. F., Been, S. L., Mitchell, C. J., Bou-
3 2. ton, M. E., & Frohardt, R. (2003 ). Forward and
Hagmayer, Y., & Waldmann, M. R. (2002). How backward blocking of causal judgment is en-
temporal assumptions influence causal judg- hanced by additivity of effect magnitude. Mem-
ments. Memory & Cognition, 3 0(7), 1 1 28–1 1 3 7. ory and Cognition, 3 1 (1 ), 1 3 3 –1 42.
Holland, J. H., Holyoak, K. J., Nisbett, R. N., & Macho, S., & Burkart, J. (2002). Recursive retro-
Thagard, P. (1 986). Induction: Processes of infer- spective revaluation of causal judgments. Jour-
ence, learning, and discovery. Cambridge, MA: nal of Experimental Psychology: Learning, Mem-
MIT Press. ory, and Cognition, 2 8(6), 1 1 71 –1 1 86.
Mackie, J. L. (1 974). The cement of the universe: tioning. Journal of Comparative and Physiologi-
A study on causation. Oxford, UK: Clarendon cal Psychology, 66, 1 –5 .
Press. Rescorla, R. A. (1 988). Pavlovian condition-
Mandel, D. R., & Lehman, D. R. (1 998). Integra- ing: It’s not what you think it is. American
tion of contingency information in judgments Psychologist.
of cause, covariation, and probability. Journal of Rescorla, R. A., & Wagner, A. R. (1 972). A the-
Experimental Psychology: General, 1 2 7(3 ), 269– ory of Pavlovian conditioning: Variations in
285 . the effectiveness of reinforcement and nonre-
Matute, H., Arcediano, F., & Miller, R. R. (1 996). inforcement. In A. H. Black & W. F. Prokasy
Test question modulates cue competition be- (Eds.), Classical conditioning II: Current theory
tween causes and between effects. Journal and research (pp. 64–99). New York: Appleton-
of Experimental Psychology: Learning, Memory, Century-Crofts.
and Cognition, 2 2 (1 ), 1 82–1 96. Rosch, E. (1 978). Principles of categorization. In
Michotte, A. E. (1 946/1 963 ). The perception of E. Rosch & B. Lloyd (Eds.), Cognition and cate-
causality (T. R. Miles, Trans.). London, UK: gorization (pp. 27–48). Hillsdale, NJ: Erlbaum.
Methuen & Co. Savastano, H. I., & Miller, R. R. (1 998). Time as
Miller, R. R., & Barnet, R. C. (1 993 ). The role of content in Pavlovian conditioning. Behavioural
time in elementary associations. Current Direc- Processes, 44(2), 1 47–1 62.
tions in Psychological Science, 2 (4), 1 06–1 1 1 . Schlottmann, A. (1 999). Seeing it happen and
Miller, R. R., Barnet, R. C., & Grahame, N. J. knowing how it works: How children under-
(1 995 ). Assessment of the Rescorla–Wagner stand the relation between perceptual causal-
model. Psychological Bulletin, 1 1 7, 3 63 –3 86. ity and underlying mechanism. Developmental
Miller, R. R., & Matute, H. (1 996). Animal ana- Psychology, 3 5 (5 ), 3 03 –3 1 7.
logues of causal judgment. In D. R. Shanks, Scholl, B. J., & Nakayama, K. (2002). Causal
K. J. Holyoak, & D. L. Medin (Eds.), The psy- capture: Contextual effects on the perception
chology of learning and motivation – Causal of collision events. Psychological Science, 1 3 (6),
learning, Vol. 3 4: (pp. 1 3 3 –1 66). San Diego, 493 –498.
CA: Academic Press. Scholl, B. J., & Tremoulet, P. D. (2000). Percep-
Novick, L. R., & Cheng, P. W. (2004). Assessing tual causality and animacy. Trends in Cognitive
interactive causal influence. Psychological Re- Sciences, 4(8), 299–3 09.
view, 1 1 1 (2), 45 5 –485 .
Schustack, M. W., & Sternberg, R. J. (1 981 ). Eval-
Pearce, J. M. (1 987). A model for stimulus gener- uation of evidence in causal inference. Journal
alization in Pavlovian conditioning. Psychologi- of Experimental Psychology: General, 1 1 0, 1 01 –
cal Review, 94(1 ), 61 –73 . 1 20.
Pearl, J. (1 988). Probabilistic reasoning in intelligent Shanks, D. R. (1 985 a). Continuous monitor-
systems. San Mateo, CA: Morgan Kaufmann. ing of human contingency judgment across
Pearl, J. (2000). Causality: Models, reasoning, and trials. Memory and Cognition, 1 3 (2), 1 5 8–
inference. Cambridge, UK: Cambridge Univer- 1 67.
sity Press. Shanks, D. R. (1 985 b). Forward and backward
Pearson, K. (1 892/1 95 7). The Grammar of Science. blocking in human contingency judgement.
New York: Meridian Books. Quarterly Journal of Experimental Psychology:
Perales, J. C., & Shanks, D. R. (2003 ). Norma- Comparative and Physiological Psychology,
tive and descriptive accounts of the influence 3 7B(1 ), 1 –21 .
of power and contingency on causal judgments. Shanks, D. R. (1 987). Acquisition functions in
Quarterly Journal of Experimental Psychology contingency judgment. Learning and Motiva-
Section A – Human Experimental Psychology, tion, 1 8(2), 1 47–1 66.
5 6A(6), 977–1 007. Shanks, D. R., & Dickinson, A. (1 987). Associa-
Reichenbach, H. (1 95 6). The direction of time. tive accounts of causality judgment. In G. H.
Berkeley & Los Angeles: University of Califor- Bower (Ed.), The Psychology of learning and
nia Press. motivation – Advances in research and theory
Rescorla, R. A. (1 968). Probability of shock in (Vol. 21 , pp. 229–261 ). San Diego, CA: Aca-
the presence and absence of CS in fear condi- demic Press.
Shanks, D. R., Holyoak, K. J., & Media, D. L. elements. Learning and Motivation, 2 5 (2), 1 27–
(Eds.) (1 996). The Psychology of Learning and 151.
Motivation – Causal learning (Vol. 3 4) San Waldmann, M. R. (2000). Competition among
Diego, CA: Academic Press. causes but not effects in predictive and diagnos-
Shanks, D. R., & Lopez, F. J. (1 996). Causal order tic learning. Journal of Experimental Psychology:
does not affect cue selection in human asso- Learning, Memory, and Cognition, 2 6(1 ), 5 3 –76.
ciative learning. Memory and Cognition, 2 4(4), Waldmann, M. R. (2001 ). Predictive versus diag-
5 1 1 –5 22. nostic causal learning: Evidence from an over-
Shanks, D. R., Pearson, S. M., & Dickinson, shadowing paradigm. Psychonomic Bulletin and
A. (1 989). Temporal contiguity and the judg- Review, 8, 600–608.
ment of causality by human subjects. Quar- Waldmann, M. R., & Holyoak, K. J. (1 992). Pre-
terly Journal of Experimental Psychology Section dictive and diagnostic learning within causal
B – Comparative and Physiological Psychology, models: Asymmetries in cue competition.
41(2), 1 3 9–1 5 9. Journal of Experimental Psychology: General,
Shultz, T. R. (1 982). Rules of causal attribution. 1 2 1 (2), 222–23 6.
Monographs of the Society for Research in Child Waldmann, M. R., & Holyoak, K. J. (1 997). De-
Development, 47(1 ), 1 –5 1 . termining whether causal order affects cue se-
Spellman, B., & Kincannon, A. (2001 ). The re- lection in human contingency learning: Com-
lation between counterfactual (“But for”) and ments on Shanks and Lopez (1 996). Memory
causal reasoning: Experimental findings and and Cognition, 2 5 (1 ), 1 25 –1 3 4.
implications for jurors’ decisions. Law and con- Wasserman, E. A., Elek, S. M., Chatlosh, D. L., &
temporary problems, 64, 241 –264. Baker, A. G. (1 993 ). Rating causal relations:
Spirtes, P., Glymour, C., & Scheines, R. (1 993 / Role of probability in judgments of response-
2000). Causation, prediction and search (2nd outcome contingecy. Journal of Experimental
ed.). Boston, MA: MIT Press. Psychology:Learning, Memory, and Cognition,
Steyvers, M., Tenenbaum, J. B., Wagenmakers, 1 9, 1 74–1 88.
E-J., & Blum, B. (2003 ). Inferring causal net- White, P. A. (1 995 ). Use of prior beliefs in the as-
works from observations and interventions. signment of causal roles: Causal powers versus
Cognitive Science, 2 7, 45 3 –489. regularity-based accounts. Memory and Cogni-
Tenenbaum, J. B., & Griffiths, T. L. (2001 ). tion, 2 3 (2), 243 –25 4.
Structure learning in human causal induction. White, P. A. (2002). Causal attribution from co-
In T. K. Leen, T. G. Dietterich, & V. Tresp variation information: The evidential evalua-
(Eds.), Advances in neural processing systems tion model. European Journal of Social Psychol-
(Vol. 1 3 , pp. 5 9–65 ). Cambridge, MA: MIT ogy, 3 2 (5 ), 667–684.
Press. Woodward, J. (2003 ). Making things happen: A
Thagard, P. (1 989). Explanatory coherence. Be- theory of causal explanation. Oxford, UK: Ox-
havioral and Brain Sciences, 1 2 , 43 5 –467. ford University Press.
Thagard, P. (2000). Explaining disease: Corre- Wright, R. W. (1 985 ). Causation in tort law. Cal-
lations, causes, and mechanisms. In F. Keil ifornia Law Review, 73 , 1 73 5 –1 828.
& R. Wilson (Eds.), Cognition and explana- Wu, M., & Cheng, P. W. (1 999). Why causation
tion (pp. 227–25 3 ). Cambridge, MA: MIT need not follow from statistical association:
Press. Boundary conditions for the evaluation of gen-
Vallacher, R. R., & Wegner, D. M. (1 987). What erative and preventive causal powers. Psycho-
do people think they’re doing? Action identi- logical Science, 1 0(2), 92–97.
fication and human behavior. Psychological Re- Zimmerhart & Rescorla (1 974): Zimmer-Hart,
view, 94(1 ), 3 –1 5 . C. L., & Rescorla, R. A. (1 974). Extinction
Van Hamme, L. J., & Wasserman, E. A. (1 994). of Pavlovian conditioned inhibition. Journal of
Cue competition in causality judgments: The Comparative and Physiological Psychology, 86,
role of nonpresentation of compound stimulus 83 7–845 .
CHAPTER 8
Deductive Reasoning
Jonathan St. B. T. Evans
The study of deductive reasoning has been a sion will also be true. Consider a syllogism
major field of cognitive psychology for the (an old form of logic devised by Aristotle)
past 40 years or so (Evans, 2002; Evans, with the following form:
Newstead, & Byrne, 1 993 ; Manktelow,
All C are B.
1 999). The field has its origins in philosophy,
within the ancient discipline of logic, and No A are B.
reflects the once influential view known as Therefore, no A are C.
logicism in which logic is proposed to be the This is valid argument and will remain so no
basis for rational human thinking. This view matter what terms we substitute for A, B,
was prevalent in the 1 960s when psycho- and C. For example,
logical study of deductive reasoning became
an established field in psychology, espe- All frogs are reptiles.
cially reflecting the theories of the great de- No cats are reptiles.
velopmental psychologist Jean Piaget (e.g., Therefore, no cats are frogs.
Inhelder & Piaget, 1 95 8). Logicism was
also influentially promoted to psychologists has two true premises and a true conclusion.
studying reasoning in a famous paper by Unfortunately, the argument is equally valid
Henle (1 962). At this time, rationality was if we substitute terms as follows:
clearly tied to logicality. All frogs are mammals.
So what exactly is deductive logic? (See No cats are mammals.
Sloman & Lagnado, Chap. 5 , for a contrast
Therefore, no cats are frogs.
with induction.) As a model for human rea-
soning, it has one great strength but several A valid argument can allow a true conclu-
serious weaknesses. The strength is that an sion to be drawn from false premises, as pre-
argument deemed valid in logic guarantees viously, which would make it seem a non-
that if the premises are true, then the conclu- sense to most ordinary people (that is, not
1 69
logicians). This is one weakness of logic in No C are B. (1 ) No C are B. (2)

describing everyday reasoning, but there are Some A are B. Some B are A.
others. The main limitation is that deduc- Therefore, Therefore,
tive reasoning does not allow you to learn some A are some C are
anything new at all because all logical argu- not C. not A.
ment depends on assumptions or supposi-
tions. At best, deduction may enable you to Although these arguments look very simi-
draw out conclusions that were only implicit lar, (1 ) is logically valid and (2) is invalid.
in your beliefs, but it cannot add to those Like most invalid arguments, the conclusion
beliefs. There are also severe limitations to (2) is possible given the premises, but not
in applying logic to real world arguments necessary. Hence, it is a fallacy. Here is a case
where premises are uncertain and conclu- in which a syllogism in form (2) seems per-
sions may be made provisionally and later suasive because it has true premises and a
withdrawn (Evans & Over, 1 996; Oaksford & true conclusion:
Chater, 1 998). No voters are under 1 8 years of age.
Although these limitations are nowa- Some film stars are under 1 8 years of age.
days widely recognized, the ability of peo-
Therefore, some voters are not film stars.
ple to reason logically (or the lack of it)
was considered an important enough is- However, we can easily construct a coun-
sue in the past for the use of the deduc- terexample case. A counterexample proves
tion paradigm to become well established. an argument to be invalid by showing that
The standard paradigm consists of giving you could have true premises but a false con-
people premises and asking them to draw clusion, such as
conclusions. There are two key instructions
No bees are carnivores.
that make this a deductive reasoning task.
First, people must be told to assume the Some animals are carnivores.
premises are true and (usually) are told to Therefore, some bees are not animals.
base their reasoning only on these premises. Evans et al. (1 999) actually gave partici-
Second, they must only draw or endorse pants all 64 possible combinations of syllo-
a conclusion that necessarily follows from gistic premises and asked them to decide in
the premises. one group whether each of the four possible
An example of a large deductive reason- conclusions followed necessarily from these
ing study was that more recently reported by premises in line with standard deductive rea-
Evans, Handley, Harper, and Johnson-Laird soning instructions (in this study, all problem
(1 999) using syllogistic reasoning. Syllogisms materials were abstract, using capital letters
have four kinds of statement as follows: for the terms). A relatively small number of
Universal All A are B. syllogisms have necessary (valid) conclusions
Particular Some A are B. or impossible (determinately false) conclu-
Negative universal No A are B. sions. Most participants accepted the former
Negative particular Some A are not B. and rejected the latter in accord with logic.
The interesting cases are the potential falla-
Because a syllogism comprises two premises cies like (2), where the conclusion could be
and a conclusion, there are 64 possible moods true but does not have to be. In accordance
in which each of the three statements can with previous research, Evans et al. found
take each of the four forms. In addition, that fallacies were frequently endorsed, al-
there are four figures produced by chang- though with an interesting qualification to
ing the order of reference to the three linked which we return. They ran a second group
terms, A, B, and C, making 25 6 logically who were instructed to endorse conclusions
distinct syllogisms. For example, the fol- that could be true (that is possible) given
lowing syllogisms have the same mood but their premises. The results suggested that
different figures: ordinary people have a poor understanding
deductive reasoning 1 71
of logical necessity. Possibility instructions where most rules – such as conditional state-
should have selectively increased acceptance ments – do have exceptions.
of conclusions normally marked as fallacies. Some more complex rules involve suppo-
In fact, participants in the possibility groups sitions. In suppositional reasoning, you add
accepted conclusions of all kinds more fre- a temporary assumption to those given that
quently, regardless of the logical argument. is later deleted. An example is conditional
proof (CP), which states that if by assum-
ing p you can derive q, then it follows that
Rule- Versus Model-Based Accounts if p then q, a conclusion that no longer de-
of Reasoning pends on the assumption of p. Suppose the
following information is given:
Logical systems can be described using a syn- If the car is green, then it has four-wheel
tactic or semantic approach, and psycholog- drive.
ical theories of deductive reasoning can be The car has either four-wheel drive or
similarly divided. In the syntactic approach, power steering, but not both.
reasoning is described using a set of abstract
inference rules that can be applied in se- What can you conclude? If you make the
quence. The approach is algebraic in that supposition that the car is in fact green, then
one must start by recovering the logical form you can draw the conclusion, in two steps,
of an argument and discarding the particu- that it does not have power steering. Now
lar content or context in which it is framed. you do not know if the car is actually green,
In standard propositional logic, for example, but the CP rule allows you to draw the con-
several inference rules are applied to conclusion, “If the car is green then it does not
ditional statements of the form if p then q. have power steering.”
These rules can be derived from first prin- Some philosophers described inference
ciples of the logic and provide a short-cut rule systems as “natural logics,” reflecting the
method of deductive reasoning. Here are idea that ordinary people reason by apply-
some examples: ing such rules. This has been developed by
modern psychologists into sophisticated psy-
Modus Ponens (MP) Modus Tollens (MT) chological theories of rule-based reasoning,
If p then q If p then q often described as “mental logics.” The best-
p not-q developed systems are those of Rips (1 994)
Therefore q Therefore, not-p and Braine and O’Brien (1 998). According
to these accounts, people reason by abstract-
For example, suppose we know that “if ing the underlying logical structure of argu-
the switch is down then the light is on.” If ments and then applying inference rules. Di-
I notice that the switch is down, then I can rect rules of inferences, such as MP, are ap-
obviously deduce that the light is on (MP). plied immediately and effortlessly. Indirect,
If I see that the light is off, I can also validly suppositional rules such as CP are more dif-
infer that the switch is not down (MT). One ficult and error prone. Although MT is in-
of the difficulties with testing people’s logi- cluded as a standard rule in propositional
cal ability with such arguments, however, is logic, mental logicians do not include this
that they can easily imagine counterexample as a direct rule of inference for the simple
cases that block such valid inferences (Evans reason that people find it difficult. Here is
et al., 1 993 ). For example, if the light bulb an MT argument:
has burned out, neither MP not MT will de-
liver a true conclusion. That is why the in- If the card has an A on the left, then it has
struction to assume the truth of the premises a 3 on the right.
should be part of the deduction experiment. The card does not have a 3 on the right.
It also shows why deductive logic may have Therefore, the card does not have an A on
limited application in real world reasoning, the left.
Table 8.1 . Truth Table Analysis ordinary conditional of everyday discourse

First premise Second Conclusion
could be a material conditional (Edgington,
Possibility if A then 3 premise not-3 not-A 1 995 ; Evans, Handley, & Over, 2003 ; Evans
& Over, 2004). However, this distinction
A, 3 True False False does not affect the validity of the arguments
A, not-3 False True False discussed here. In the previous example, be-
Not-A, 3 True False True cause there is no case in which true premises
Not-A, True True True
can lead to a false conclusion, the argu-
not-3
ment is valid. Let us contrast this with one
of the classical fallacies of conditional rea-
Whereas MP is made nearly 1 00% of the soning known as affirmation of the conse-
time with such abstract materials, MT rates quent (AC). Suppose we are tempted to ar-
are quite variable but typically around 70% gue from the previous conditional that if the
to 75 % (Evans et al., 1 993 ). Mental logicians letter on the right is known to be a 3 , then
therefore propose that it depends on an indi- the letter on the left must be an A. See
rect suppositional rule known as reductio ad Table 8.2 for the truth table.
absurdum (RAA). This rule states that if a The analysis exposes the argument as a
supposition leads to a contradiction, then the fallacy because there is a state of affairs –
negation of the supposition is a valid conclu- a card that does not have an A on the left
sion. With the previous, we make the sup- but has a 3 on the right – in which the
position that the card has an A on the left. premises would both be true but the con-
Hence, it follows that there is a 3 on the clusion false.
right (MP). However, we are told that there Just as the mental logic approaches do not
is not a 3 on the right, which gives us a con- simply adopt the inference rules of standard
tradiction. Contradictions are not logically logic to account for human reasoning, so the
possible, and so the supposition from which mental models approach does not endorse
it followed must be false. Hence, the conclu- truth table analysis either (Johnson-Laird,
sion given must be true. Byrne, 1 991 ; 2002). Mental models do repre-
A powerful rival account of deductive sent logical possibilities, but the model the-
reasoning is given by the mental model theory adds psychological proposals about how
ory (Johnson-Laird, 1 983 ; Johnson-Laird & people construct and reason with such mod-
Byrne, 1 991 , 2002; see Johnson-Laird, Chap. els. First, according to the principle of truth,
9), which is based on the semantic logical people normally represent only true possi-
approach. The semantic method proves ar- bilities. Hence, the theory proposes that the
guments by examining logical possibilities. full meaning of a “basic conditional” is the
In this approach, for example, the previous explicit set of true possibilities:
MT argument could be proved by truth table
analysis. This involves writing down a line in { pq, ¬ pq, ¬ p¬q}
the truth table for each possibility and evalu-
where ¬ means “not.” Second, owing to
ating both premises and conclusions. An ar-
working memory limitations, people form
gument is valid if there is not a line in the
table where the premises are true and the
conclusion false. A truth table analysis for Table 8.2 . Truth Table Analysis
the previous argument is shown in Table 8.1 . First premise Second Conclusion
It should be noted that the previous anal- Possibility if A then 3 premise 3 A
ysis, in accord with standard propositional
logic, assumes the conditional statement “if A, 3 True True True
p then q” conveys a logical relationship A, not-3 False False True
Not-A, 3 True True False
called material implication. Severe doubts
Not-A, True False False
have been expressed in both the philosoph-
not-3
ical and psychological literatures that the
incomplete initial representations. Thus the premises and not with the conclusion. This
conditional if p then q is normally repre- involves the same semantic principle as truth
sented as table analysis: An argument is valid if there
is no counterexample to it in which the
[ p]q
premises hold and the conclusion does not.
... Although this accounts for deductive com-
where “ . . . ” is a mental footnote to the ef- petence, the main finding on syllogistic rea-
fect that there may be other possibilities, al- soning is that people in fact endorse many
though they are not explicitly represented. fallacies. By analyzing the nature of the fal-
Like the mental logic theory, mental model lacies that people make and those they avoid,
theory gives an account of why MP is eas- Evans et al. (1 999) were able to provide
ier than MT. The square brackets around strong evidence that people do not normally
p in the model for the pq possibility indi- search for counterexample cases during syl-
cate that p is exhaustively represented with logistic reasoning. Some fallacies are made
respect to q (that is, it must be present in as frequently as valid inferences and some as
all models that include q). Hence, when the infrequently as on syllogisms where the con-
premise p is presented, there is no need to clusion is impossible. This strongly suggests
flesh out any other possibilities and the con- that people consider only a single model of
clusion q can be drawn right away (MP). the premises, endorsing the fallacy if this
When the MT argument is presented, how- model happens to include the conclusion.
ever, the second premise is not-q, which This issue has also been addressed in more
is not represented in any explicit model. recent papers by Newstead, Handley, and
Consequently, some people will say that Buck (1 999) and by Bucciarelli & Johnson-
“nothing follows.” Laird (1 999).
Successful MT reasoners, according to Both the mental logic and mental mod-
this theory, flesh out the explicit models for els theories described here provide abstract,
the conditional: general-purpose systems that can account
for human deductive competence across any
pq domain, but that also allow for error. There
¬ pq has been a protracted – and in my view, in-
conclusive – debate between advocates of
¬ p¬q
the two theories with many claims and coun-
The second premise eliminates the first two terclaims that one side or the other had
models, leaving only the possibility ¬ p¬q. found decisive empirical evidence (for re-
Hence, the conclusion not-p must follow. view and discussion, see Evans et al., 1 993 ,
With regard to the MT problem presented Chap. 3 ; Evans & Over, 1 996, 1 997). It is
earlier, this means that people must decide important to note that these two theories
that if there is not a 3 on right of the card, by no means exhaust the major theoreti-
the only possibility consistent with the con- cal attempts to account for the findings in
ditional is that the card does not have an A reasoning experiments, although other the-
on the left either. orists are less concerned with providing a
The model theory was originally devel- general account of deductive competence.
oped to account for syllogistic reasoning of Other approaches include theories framed in
the kind considered earlier (Johnson-Laird & terms of content-specific rules such as prag-
Bara, 1 984). In this version, it was argued matic reasoning schemas (Cheng & Holyoak,
that people formed a model of the premises 1 985 ; Holyoak & Cheng, 1 995 ) or Dar-
and formulated a provisional conclusion winian algorithms (Cosmides, 1 989; Fiddick,
consistent with this model. It was further Cosmides, & Tooby, 2000), which were de-
proposed that people made an effort at designed to account for content and context
duction by searching for a counterexample effects in reasoning discussed in the next sec-
case, that is, a model that agrees with the tion. The heuristic-analytic theory of Evans
(1 984, 1 989) was intended to given an ac- one must then work out that this means that
count of biases in deductive reasoning tasks A must be on the left. The double nega-
to which we now turn. tion effect can also be given an interpre-
tation within mental model theory (Evans,
Clibbens, & Rood, 1 995 ).
Introducing negatives into conditional
Biases in Deductive Reasoning
statements can also cause an effect known
as matching bias (Evans, 1 998). This is best
I have already mentioned that people are illustrated in a problem known as the Wason
very prone to making fallacies in syllogis- selection task (Wason, 1 966). Although not
tic reasoning and that they do not always strictly a deductive reasoning task, the selec-
succeed in drawing valid inferences such as tion task involves the logic of conditionals
MT in conditional reasoning. In fact, peo- and is considered part of the literature on
ple make many logical errors generally on the deduction. In a typical abstract version
deductive reasoning tasks. These errors are of the problem, participants are shown four
not necessarily random but often system- cards lying on a table and told that each has
atic, leading to description by term bias. We a capital letter on one side and a single figure
should note at this point that a bias is by number on the other. The visible sides are
definition a regular deviation from the logic
norm and defer for the time being the ques- B L 2 9
tion of whether biases should be taken to
indicate irrationality. They are told that the following rule ap-
One of the earliest known biases in con- plies to these four cards and may be true or
ditional reasoning was that of “negative con- false:
clusion bias” (Evans, 1 982), which affects
several conditional inferences, including MT If a card has a B on one side, then it has a
(Schroyens, Schaeken, & d’Ydewalle, 2001 ). 2 on the other side.
I gave an example of an MT inference earlier, The task is to decide which cards need to be
with an affirmative conditional statement, turned over in order to check whether the
and said that people solve this about 75 % of rule is true or false. Wason argued that the
the time. Consider a subtly changed version correct choice is B and 9 because only a card
of the earlier problem: with a B on one side and a number other than
If the card does not have an A on the left, 2 on the other side could disprove the rule.
then it has a 3 on the right. Most subsequent researchers have accepted
The card does not have a 3 on the right. this normative analysis, although some ar-
gue against it on the assumption that people
Therefore, the card has an A on the left.
interpret the task as having to do with cate-
The difference is that a negative has been gories rather than specific cards (Oaksford &
introduced into the first part of the condi- Chater, 1 994). In any event, only around
tional and the conclusion is now affirmative. 1 0% of university students typically choose
This argument is still MT and valid, but now the B and 9. The most common choices are
only around 40% to 5 0% of the time do peo- B and 2, or just B. Wason originally argued
ple succeed in making it – a very large and that this provided evidence of a confirmation
reliable difference across many studies. The bias in reasoning (Wason & Johnson-Laird,
most likely account of this bias is a double 1 972). That is, participants were trying to
negation effect. Reasoning by RAA on the discover the confirming combination of B
previous problem will, following discovery and 2 rather than the disconfirming combi-
of the contradiction, lead one to conclude nation of B and 9.
that the supposition that the card does not Wason later abandoned this account,
have an A on the left must be false. How- however, in light of the evidence of Evans
ever, this is a double negative from which and Lynch (1 973 ). These authors argued that
with an affirmative conditional the verifying of whether they were logically appropriate.
cards are also the matching cards in other Of course, in the explicit negation group,
words, those that match the values specified the negative cases really still match because
in the rule. By introducing negative compo- they refer to the letter and number in the
nents, it is possible to separate the two ac- conditional statement. In spite of this strong
counts. For example, suppose the rule was evidence, an alternative theory of match-
ing bias has been promoted by Oaksford
If a card has a B on one side, then it does
and Chater (1 994) based on expected in-
NOT have a 2 on the other side.
formation gain (negative statements con-
Now the matching choice of B and 2 is vey less information). Yama (2001 ) more
also the correct choice because a card with a recently reported experiments trying to sep-
B on one side and a 2 on the other side could arate the two accounts with somewhat ambi-
disprove the rule. Nearly everyone gets the valent findings.
task right with this version – a curious case One of the most important biases inves-
of a negative making things a lot easier. In tigated in the deductive reasoning literature
fact, when the presence of negatives is sys- is the belief bias effect, which is typically
tematically rotated, the pattern of findings but inaccurately described as a tendency to
strongly supports matching bias in both the endorse the validity of arguments when you
Evans and Lynch (1 973 ) study and a number agree with their conclusions. I consider the
of replication experiments reported later in belief bias effect in the following section on
the literature (Evans, 1 998). content and context effects. First, I briefly
What then is the cause of this match- discuss the implications of reasoning biases
ing bias? There is strong evidence that it for the debate about human rationality. Co-
reflects difficulty in processing implicit nega- hen (1 981 ) was one of the first critics to
tion. Evans, Clibbens, and Rood (1 996) pre- launch an attack on research in this field,
sented descriptions of the cards in place of as well as the related “heuristic and biases”
the actual cards. In the materials of the ex- program of work on probability judgment
ample given previously, their descriptions (Gilovich, Griffin, & Kahneman, 2002; Kah-
for an implicit and explicit negation group neman, Slovic, & Tversky, 1 982; see Kahne-
were as follows: man & Frederick, Chap. 1 2). Cohen argued
that evidence of error and bias in experi-
Implicit negation Explicit negation ments on reasoning and judgment should not
The letter on the The letter on the be taken as evidence of human irrationality.
card is a B. card is a B. Cohen’s arguments fall into three categories
The letter on the The letter on the that have also been reflected in writings of
card is an L. card is not a B. subsequent authors: the normative system
The number on the The number on the problem, the interpretation problem, and the
card is a 2. card is a 2. external validity problem (Evans, 1 993 ).
The number on the The number on the The first issue is that people can only be
card is a 9. card is not a 9. judged to be in error relative to some norma-
tive system that may well be disputable. For
The presence of negations was also var- example, philosophers have proposed alter-
ied in the conditionals in order to provide native logics, and the standard propositional
the standard method of testing for matching logic for deductive reasoning can be seen
bias. Whereas the implicit negation group as mapping poorly to real world reasoning,
showed normal strong matching bias, there which allows for uncertainty and the with-
was no matching bias at all in the explicit drawal of inferences in light of new evidence
negation group. However, this group did not (Evans & Over, 1 996; Oaksford & Chater,
perform more logically. They simply picked 1 998). The interpretation problem is that
more of the mismatching cards that would correctness of inference is judged on the as-
normally have been suppressed, regardless sumption that the participant understands
the task as the experimenter intended. This ground belief and knowledge (see the next
is also a pertinent criticism. As I (Evans, section) may prevent us from the kind of ab-
2002, p. 991 ) previously put it: stract reasoning that is needed in a modern
technological society, so different from the
The interpretation problem is a very seri- world in which we evolved.
ous one indeed for traditional users of the
deduction paradigm who wish to assess log-
ical accuracy. To pass muster, participants
are required not only to disregard prob- Content and Context Effects
lem content but also any prior beliefs they
have relevant to it. They must translate the Once thematic materials are introduced into
problem into a logical representation using deductive reasoning experiments, especially
the interpretation of key terms that accord
when some kind of context – however min-
with a textbook (not supplied) of standard
imal – is given, participants’ responses be-
logic . . . whilst disregarding the meaning of
the same terms in everyday discourse. come heavily influenced by pragmatic fac-
tors. This has led paradoxically to claims
The external validity argument is that both that familiar problem content can facil-
the demonstration of cognitive biases and il- itate logical reasoning and that such familiar-
lusions in the psychological laboratory does ity can be cause of bias! The task on which
not necessarily tell us anything about the real facilitation is usually claimed is the deontic
world. This one I have much less sympa- selection task that we examine first.
thy with. The laws of psychology apply in
the laboratory, as well as everywhere else, The Deontic Selection Task
and many of the biases that have been dis-
It has been known for many years that “re-
covered have been shown to also affect ex-
alistic” versions of the Wason selection task
pert groups. For example, base rate neglect
can facilitate correct card choices, although
in statistical reasoning has been shown many
it was not immediately realized that most of
times in medical and other expert groups
these versions change the logic of the task
(Koehler, 1 996), and there are numerous
from one of indicative reasoning to one of
real world studies of heuristics and biases
deontic reasoning. An indicative conditional,
(Fischhoff, 2002).
of the type used in the standard abstract task
One way of dealing with the normative
discussed earlier, makes an assertion about
system problem is to distinguish between
the state of the world that may be true or
normative and personal rationality (Ander-
false. Deontic conditionals concern rules and
son, 1 990; Evans & Over, 1 996). Logical er-
regulations and are often phrased using the
rors on deductive reasoning tasks violate nor-
terms “may” or “must,” although these may
mative rationality because the instructions
be implicit. A rule such as “if you are driv-
require one to assume the premises and draw
ing on the highway then you must keep your
necessary conclusions. Whether they violate
speed under 70 mph” cannot be true or false.
personal rationality is moot, however, be-
It may or may not be in force, and it may or
cause we may have little use for deductive
may not be obeyed.
reasoning in everyday life and carry over in-
A good example of a facilitatory ver-
appropriate but normally useful procedures
sion of the selection task is the drinking
instead (Evans & Over, 1 996). A different
age problem (Griggs & Cox, 1 982). Partici-
distinction is that between individual and
pants are told to imagine that they are police
evolutionary rationality (Stanovich, 1 999;
officers observing people drinking in a bar
Stanovich & West, 2000, 2003 ). Stanovich
and making sure that they comply with the
argues that what serves the interests of the
following law:
genes does not always serve the interests of
the individual. In particular, the tendency If a person is drinking in a bar, then that
to contextualize all problems against back- person must be over 1 9 years of age
(The actual age given depends on which announces the following rule:
population group is being presented with the
If a customer spends more than $1 00,
task and normally corresponds to the local
then he or she may take a free gift.
law it knows.) They are told that each card
represents a drinker and has on one side the The four cards represent customers showing
beverage being drunk and on the other side the amount spent on one side and whether
the age of the drinker. The visible sides of they received a gift on the other: “spent
the four cards show: $1 20,” “spent $75 ,” “received gift,” “did not
take gift.” If participants are given the per-
Drinking Drinking 22 years 1 6 years
beer spective of a store detective looking for
coke of age of age
cheating customers, they turn over cards 2
The standard instruction is to choose and 3 because a cheater would be taking the
those cards that could show that the rule gift without spending $1 00. If they are given
is being violated. The correct choice is the the perspective of a customer checking that
drinking beer and 1 6 year old, and most peo- the store is keeping its promise, however,
ple choose this. Compared with the abstract they turn cards 1 and 4 because a cheating
task, it is very easy. However, the task has store would not provide the gift to customers
not simply been made realistic. It is a deontic who spent the required amount.
task and one in which the context makes not There are several theoretical accounts of
only the importance of violation salient but the deontic selection task in the literature.
also makes it very easy to identify the violat- One of the earliest was the pragmatic rea-
ing case. There have been many replications soning schema theory of Cheng and Holyoak
and variations of such tasks (see Evans et al., (1 985 ). These authors proposed that peo-
1 993 , and Manktelow, 1 999, for reviews). It ple retrieve and apply a permission schema
has been established that real world knowl- comprising a set of production rules. For ex-
edge of the actual rule is not necessary to ample, on the drinking age problem, you
achieve facilitation (see, for example, Cheng need to fulfil the precondition of being older
& Holyoak, 1 985 ). Rules that express per- than 1 9 years of age in order to have permis-
mission or obligation relationships in plau- sion to drink beer in a bar. Once these ele-
sible settings usually lead people to the ap- ments are recognized and encoded as “pre-
propriate card choices. condition” and “action,” the abstract rules of
Most of the elements of presentation of the schema can be applied, leading to ap-
the drinking age problem as originally de- propriate card choices. This theory does not
vised by Griggs and Cox need to be in place, suppose that some general process of logi-
however. Removing the deontic orientation cal reasoning is being facilitated. The authors
of the violation instructions greatly weak- later added an obligation schema to explain
ens the effect (see Evans et al., 1 993 ), and the perspective shift effect discussed previ-
removing the minimal context about the ously (Holyoak & Cheng, 1 995 ). The rules of
police officer blocks most of the facilita- the obligation schema change the pattern of
tion (Pollard & Evans, 1 987). Hence, it is card choices, and the perspective determines
important to evoke pragmatic processes of which schema is retrieved and applied.
some kind that introduce prior knowledge A well-known but somewhat controver-
into the reasoning process. These factors sial theory is that choices on the deontic se-
can override the actual syntax of the condi- lection task are determined by Darwinian
tional rule. Several authors discovered inde- algorithms for social contracts, leading to
pendently that the perspective given to the cheater detection, or else by an innate hazard
participant in the scenario can change card avoidance module (Cosmides, 1 989; Fiddick
choices (Gigerenzer & Hug, 1 992; Mank- et al., 2000). The idea is that such mod-
telow & Over, 1 991 ; Politzer & Nguyen- ules would have been useful in the evolv-
Xuan, 1 992). For example, imagine that a ing environment, although that does not in
big department store, struggling for business, itself constitute evidence for them (Fodor,
2000). Although influential in philosophy ference if the second premise is changed to

and environmental biology, this work has “she does not go to the play,” inferring that
been subject to a number of criticisms in the “she does not meet her friend.” These infer-
psychological literature (Cheng & Holyoak, ences are easily defeated by additional infor-
1 989; Evans & Over, 1 996; Sperber, Cara, mation, however, a process known techni-
& Girotto, 1 995 ; Sperber & Girotto, 2002). cally as defeasible inference (Elio & Pelletier,
One criticism is that the responses that are 1 997; Oaksford & Chater, 1 991 ). Suppose
predicted are those that would be adap- we add an extra statement:
tive in contemporary society and so could
If she meets her friend, she will to go a
be accounted for by social learning in the
play.
lifetime of the individual; another is that
the effects to which the theory is applied If she has enough money, she will go to a
can be accounted for by much more gen- play.
eral cognitive processes. These include the- She meets her friend.
ories that treat the selection task as a decision What follows?
task in which people make choices in accord
In one study (Byrne, 1 989), 96% of par-
with expected utility (Evans & Over, 1 996;
ticipants gave the conclusion “she goes to
Manktelow & Over, 1 991 ; Oaksford &
the play” for the first MP problem, but only
Chater, 1 994), as well as a theory applying
3 8% for the second problem. In standard
principles of pragmatic relevance (Sperber
logic, an argument that follows from some
et al., 1 995 ).
premises must still follow if you add new
Regardless of which – if any – of these ac-
information. What is happening psychologi-
counts may be correct, it is clear that prag-
cally in the second case is that the extra con-
matic process heavily influences the deontic
ditional statement introduces doubt about
selection task. I have more to say about this
the truth of the first. People start to think
in a later section of the chapter when dis-
that, even though she wants to go to the
cussing “dual process” theory.
play with her friend, she might not be able
to afford it, and the lack of money will pre-
Biasing Effects of Content and Context vent her. The same manipulation inhibits the
MT inference.
In contrast with the claims of facilitation ef-
This work illustrates the difficulty of us-
fects on the Wason selection task, psycholo-
ing the term “bias” in deductive reasoning
gists have produced evidence that introduc-
research. Because a valid inference has been
ing real world knowledge may bias responses
suppressed, the effect is technically a bias.
to deductive reasoning tasks. It is known,
However, the reasoning of the participants
for example, that certain logically valid in-
in this experiment seems perfectly reason-
ferences that people normally draw can be
able and indeed more adaptive to everyday
suppressed when people introduce back-
needs than a strictly logical answer would
ground knowledge (see Evans et al., 1 993 ,
have been. A related finding is that, even
pp. 5 5 –61 ). Suppose you give people the
though people may be told to assume the
following problem:
premises of arguments are true, they are re-
If she meets her friend, she will go to a luctant to draw conclusions if they person-
play. ally do not believe the premises. In real life,
She meets her friend. of course, it makes perfect sense to base your
What follows? reasoning only on information that you be-
lieve to be true.
Nearly everyone will say, that she will go In logic, there is a distinction drawn be-
to the play. This is a very simple and, of tween a valid inference and a sound infer-
course, valid argument known in logic as MP. ence. A valid inference may lead to a false
Many participants will also make the MT in- conclusion, if at least one premise is false, as
in the following syllogism: Handley, & Harper, 2001 ). So we might as

well call it belief debias!
All students are lazy. Could people’s preference for sound ar-
No lazy people pass examinations. guments explain the belief bias effect?
Therefore, no students pass examinations. Many experiments in the literature have
failed to control for the believability of
The falsity of the previous conclusion is
premises. However, this can be done by in-
more immediately evident than that of ei-
troducing nonsense linking terms, as in the
ther of the premises. However, the argu-
following syllogism:
ment is valid, and so at least one premise
must be false. A sound argument is a valid All fish are phylones.
argument based on true premises and has All phylones are trout.
the merit of guaranteeing a true conclu-
Therefore, all fish are trout.
sion. Because the standard deductive rea-
soning task includes instructions to assume Because no one knows what a phylone
the premises, as well as to draw necessary is, he or she can hardly be expected to
conclusions, psychologists generally assume have any prior belief about either premise.
they have requested their participants to However, the conclusion is clearly unbeliev-
make validity judgments. However, there is able, and the same technique can be made
evidence that when familiar problem con- to render believable conclusions. Newstead,
tent is used, people respond as though they Pollard, Evans, and Allen (1 992) found sub-
had been asked to judge soundness instead stantial belief bias effects with such syllo-
(Thompson, 2001 ). This might well account gisms. However, it could still be the case
for the suppression of MP. The inference is that people resist arguments with false con-
so obvious that it can hardly reflect a failure clusions because such arguments must by
in reasoning. definition be unsound. As we observed ear-
People are also known to be influenced lier, if the argument is valid and the conclu-
by the believability of the conclusion of the sion false, at least one premise must be false,
argument presented, reliably (and usually even if we cannot tell which one. For fur-
massively) preferring to endorse the valid- ther discussion of this and related issues, see
ity of arguments with believable rather than Evans et al. (2001 ) and Klauer, Musch, and
unbelievable conclusions, the so-called “be- Naumer (2000).
lief bias” effect. The standard experiment
uses syllogisms and independently manipu-
lates the believability of the conclusion and
the validity of the argument. People accept Dual-Process Theory
both more valid arguments (logic effect) and
more believable conclusions (belief effect), The deductive reasoning paradigm has
and the two factors normally interact (Evans, yielded a wealth of psychological data over
Barston, & Pollard, 1 983 ). This is because the past 40 years or so. Understanding
the belief bias effect is much stronger on the issues involved has been assisted by
invalid than valid arguments. The effect is more recent developments in dual-process
really misnamed, however, because as we theories of reasoning (Evans, 2003 ; Evans
saw in our earlier discussion, people tend & Over, 1 996; Sloman, 1 996; Stanovich,
to endorse many fallacies when engaged in 1 999), which have gradually evolved from
abstract syllogistic reasoning. When belief- much earlier proposals in the reasoning lit-
neutral content is included in belief bias erature (Evans, 1 984; Wason & Evans, 1 975 )
experiments, the effect of belief is shown and has been linked with research on im-
to be largely negative: Unbelievable conclu- plicit learning (see Litman & Reber, Chap.
sions cause people to withhold fallacies that 1 8; Dienes & Perner, 1 999; Reber, 1 993 )
they would otherwise have made (Evans, and intuitive judgment (Gilovich & Griffin,
2002; Kahneman & Frederick, 2002; see judgment problems. This clearly implicates
Kahneman & Frederick, Chap. 1 2). The idea system 2.
is that there are two distinct cognitive sys- Consider the Wason selection task, for
tems with different evolutionary histories. example. The abstract indicative version,
System 1 (to use Stanovich’s terminology) which defeats most people, contains no
is the ancient system that relies on asso- helpful pragmatic cues and thus requires
ciative learning through distributed neural abstract logical reasoning for its solution.
networks and may also reflect the opera- Stanovich and West (1 998) accordingly
tion of innate modules. It is really a bundle showed that the small numbers who solve it
of systems that most theorists regarded as have significantly higher SAT scores. How-
implicit, meaning that only the final prod- ever, they also showed no difference in SAT
ucts of such a process register in conscious- scores between solvers and nonsolvers of
ness, and they may stimulate actions without the deontic selection task. This makes sense
any conscious reflection. System 2, in con- because the pragmatic processes that ac-
trast, is evolutionarily recent and arguably count for the relative ease of this task are
unique to humans. This system requires use of the kind attributed in the theory to sys-
of central working memory resources and tem 1 . However, this does call into ques-
is therefore slow and sequential in nature. tion whether the deontic selection task re-
System 2 function relates to general mea- ally requires a process that we would want
sures of cognitive ability such as IQ, whereas to call reasoning. The solution appears to
system 1 function does not (Reber, 1 993 ; be provided automatically, without con-
Stanovich, 1 999). However, system 2 allows scious reflection.
us to engage in abstract reasoning and hy- If the theory is right, then system 2 in-
pothetical thinking. There is more recent tervention occurs mostly because of the
supporting evidence of a neuropsycholog- use of explicit instructions requiring an ef-
ical nature for this theory. When resolv- fort at deduction. We know that the in-
ing belief–logic conflicts in the belief bias structions used have a major influence on
paradigm, the response that dominates cor- the response people make (Evans, Allen,
relates with distinct areas of brain activ- Newstead, & Pollard, 1 994; George, 1 995 ;
ity (Goel, Buchel, Rith, & Olan, 2000; see Stevenson & Over, 1 995 ). The more in-
Goel, Chap. 20). structions emphasize logical necessity, the
Dual-process theory can help us make more logical the responding; when instruc-
sense of much of the research on deductive tions are relaxed and participants are asked
reasoning that we have been discussing. It if a conclusion follows, responses are much
seems that the default mode of everyday rea- more strongly belief based. The ability to
soning is pragmatic, reflecting the associative resist belief in belief–logic conflict prob-
processes of system 1 . Deductive reasoning lems when instructed to reason logically is
experiments, however, include instructions strongly linked to measures of cognitive abil-
that require a conscious effort at deduction ity (Stanovich & West, 1 997), and the same
and often require the suppression of prag- facility is known to decline sharply in old
matic processes because we are asked to dis- age (Gilinsky & Judd, 1 994; see Salthouse,
regard relevant prior belief and knowledge. Chap. 24). This provides strong converging
Hence, reasoning tasks often require strong evidence for dual systems of reasoning (see
system 2 intervention if they are to be solved. also Sloman, 2002).
In support of this theory, Stanovich (1 999)
reviewed a large research program in which
it was consistently shown that participants
with high SAT scores (a measure of general Conclusions and Future Directions
cognitive ability) produced more normative
solutions than those with lower scores on Research on deductive reasoning was origi-
a wide range of reasoning, decision, and nally stimulated by the traditional interest in
logicism – the belief that logic provided the ever, that we understand that this is what
rational basis for human thinking. This ratio- we are doing. It is no longer appropriate to
nale has been considerably undermined over equate performance on deductive reasoning
the past 40 years because many psycholo- tasks with rationality or to assume that logic
gists have abandoned logic, first as a descrip- provides an appropriate normative account
tive and later as a normative system for hu- of everyday, real world reasoning.
man reasoning (Evans, 2002). Research with
the deduction paradigm has also shown, as
indicated in this chapter, that pragmatic pro-
cesses have a very large influence once re- References
alistic content and context are introduced.
Studying such processes using the paradigm Anderson, J. R. (1 990). The adaptive character of
necessarily defines them as biases because thought. Hillsdale, NJ: Erlbaum.
the task requires one to assume premises Braine, M. D. S., & O’Brien, D. P. (Eds). (1 998).
and draw necessary conclusions. However, it Mental logic. Mahwah, NJ: Erlbaum.
is far from clear that such biases should be Bucciarelli, M., & Johnson-Laird, P. N. (1 999).
regarded as evidence of irrationality, as dis- Strategies in syllogistic reasoning. Cognitive Sci-
cussed earlier. ence, 2 3 , 247–3 03 .
The deductive reasoning field has seen Byrne, R. M. J. (1 989). Suppressing valid in-
discussion and debate of a wide range of ferences with conditionals. Cognition, 3 1, 61 –
theoretical ideas, a number of which have 83 .
been described here. This includes the long- Cheng, P. W., & Holyoak, K. J. (1 985 ). Pragmatic
running debate over whether rule-based reasoning schemas. Cognitive Psychology, 1 7,
mental logics or mental model theory pro- 3 91 –41 6.
vides the better account of basic deductive Cheng, P. W., & Holyoak, K. J. (1 989). On the
competence, as well as the development of natural selection of reasoning theories. Cogni-
accounts based on content-specific reason- tion, 3 3 , 285 –3 1 4.
ing, such as pragmatic reasoning schemas, Cohen, L. J. (1 981 ). Can human irrationality be
relevance theory, and Darwinian algorithms. experimentally demonstrated? Behavioral and
It has been a major focus for the develop- Brain Sciences, 4, 3 1 7–3 70.
ment of dual-process theories of cognition, Cosmides, L. (1 989). The logic of social
even though these have a much wider appli- exchange: Has natural selection shaped how
cation. It has also been one of the major fields humans reason? Cognition, 3 1, 1 87–276.
(alongside intuitive and statistical judgment) Dienes, Z., & Perner, J. (1 999). A theory of im-
in which cognitive biases have been studied plicit and explicit knowledge. Behavioral and
and their implications for human rationality Brain Sciences, 2 2 , 73 5 –808.
debated at length. Edgington, D. (1 995 ). On conditionals. Mind,
So where does the future of the deduction 1 04, 23 5 –3 29.
paradigm lie? I have suggested (Evans, 2002) Elio, R., & Pelletier, F. J. (1 997). Belief change
that we should use a much wider range of as propositional update. Cognitive Science, 2 1,
41 9–460.
methods for studying human reasoning, es-
pecially when we are interested in investi- Evans, J. St. B. T. (1 982). The psychology of deduc-
tive reasoning. London: Routledge.
gating the pragmatic reasoning processes of
system 1 . In fact, there is no point at all in Evans, J. St. B. T. (1 984). Heuristic and analytic
processes in reasoning. British Journal of Psy-
instructing people to make an effort at de-
chology, 75 , 45 1 –468.
duction unless we are interested in system
Evans, J. St. B. T. (1 989). Bias in human reasoning:
2 reasoning or want to set the two systems
Causes and consequences. Hove, UK: Erlbaum.
in conflict. However, this conflict is of both
Evans, J. St. B. T. (1 993 ). Bias and rational-
theoretical and practical interest and will un-
ity. In K. I. Manktelow & D. E. Over (Eds.),
doubtedly continue to be studied using the
Rationality: Psychological and philosophical per-
deduction paradigm. It is important, how- spectives (pp. 6–3 0). London: Routledge.
Evans, J. St. B. T. (1 998). Matching bias in condi- role of domain-specific representations and in-
tional reasoning: Do we understand it after 25 ferences in the Wason selection task. Cognition,
years? Thinking and Reasoning, 4, 45 –82. 77, 1 –79.
Evans, J. St. B. T. (2002). Logic and human Fischhoff, B. (2002). Heuristics and biases in
reasoning: An assessment of the deduction application. In T. Gilovich, D. Griffin, & D.
paradigm. Psychological Bulletin, 1 2 8, 978–996. Kahneman (Eds.), Heuristics and biases: The
Evans, J. St. B. T. (2003 ). In two minds: Dual pro- psychology of intuitive judgement (pp. 73 0–
cess accounts of reasoning. Trends in Cognitive 748). Cambridge, UK: Cambridge University
Sciences, 7, 45 4–45 9. Press.
Evans, J. St. B. T., Allen, J. L., Newstead, S. E., Fodor, J. (2000). Why we are so good at catching
& Pollard, P. (1 994). Debiasing by instruction: cheaters? Cognition, 75 , 29–3 2.
The case of belief bias. European Journal of Cog- George, C. (1 995 ). The endorsement of the
nitive Psychology, 6, 263 –285 . premises: Assumption-based or belief-based
Evans, J. St. B. T., Barston, J. L., & Pollard, P. reasoning. British Journal of Psychology, 86, 93 –
(1 983 ). On the conflict between logic and be- 111.
lief in syllogistic reasoning. Memory and Cogni- Gigerenzer, G., & Hug, K. (1 992). Domain-
tion, 1 1 , 295 –3 06. specific reasoning: Social contracts, cheating
Evans, J. St. B. T., Clibbens, J., & Rood, B. (1 995 ). and perspective change. Cognition, 43 , 1 27–
Bias in conditional inference: Implications for 1 71 .
mental models and mental logic. Quarterly Gilinsky, A. S., & Judd, B. B. (1 994). Working
Journal of Experimental Psychology, 48A, 644– memory and bias in reasoning across the life-
670. span. Psychology and Aging, 9, 3 5 6–3 71 .
Evans, J. St. B. T., Clibbens, J., & Rood, B. (1 996). Gilovich, T., & Griffin, D. (2002). Introduction –
The role of implicit and explicit negation in Heuristics and biases: Then and now. In T.
conditional reasoning bias. Journal of Memory Gilovich, D. Griffin, & A. Kahneman (Eds.),
and Language, 3 5 , 3 92–409. Heuristics and biases: The psychology of intuitive
Evans, J. St. B. T., Handley, S. H., & Harper, judgment (pp. 1 –1 8). Cambridge, UK: Cam-
C. (2001 ). Necessity, possibility and belief: A bridge University Press.
study of syllogistic reasoning. Quarterly Jour- Gilovich, T., Griffin, D., & Kahneman, D. (2002).
nal of Experimental Psychology, 5 4A, 93 5 –95 8. Heuristics and biases: The psychology of intuitive
Evans, J. St. B. T., Handley, S. J., Harper, C., & judgement. Cambridge, UK: Cambridge Uni-
Johnson-Laird, P. N. (1 999). Reasoning about versity Press.
necessity and possibility: A test of the mental Goel, V., Buchel, C., Rith, C., & Olan, J.
model theory of deduction. Journal of Experi- (2000). Dissociation of mechanisms underly-
mental Psychology: Learning, Memory and Cog- ing syllogistic reasoning. NeuroImage, 1 2 , 5 04–
nition, 2 5 , 1 495 –1 5 1 3 . 5 1 4.
Evans, J. St. B. T., Handley, S. H., & Over, D. E. Griggs, R. A., & Cox, J. R. (1 982). The elusive
(2003 ). Conditionals and conditional probabil- thematic materials effect in the Wason selec-
ity. Journal of Experimental Psychology: Learn- tion task. British Journal of Psychology, 73 , 407–
ing, Memory and Cognition, 2 9, 3 21 –3 5 5 . 420.
Evans, J. St. B. T., & Lynch, J. S. (1 973 ). Match- Henle, M. (1 962). On the relation between logic
ing bias in the selection task. British Journal of and thinking. Psychological Review, 69, 3 66–
Psychology, 64, 3 91 –3 97. 3 78.
Evans, J. St. B. T., Newstead, S. E., & Byrne, Holyoak, K., & Cheng, P. (1 995 ). Pragmatic rea-
R. M. J. (1 993 ). Human reasoning: The psychol- soning with a point of view. Thinking and Rea-
ogy of deduction. Hove, UK: Erlbaum. soning, 1 , 289–3 1 4.
Evans, J. St. B. T., & Over, D. E. (1 996). Rationality Inhelder, B., & Piaget, J. (1 95 8). The growth of log-
and reasoning. Hove, UK: Psychology Press. ical thinking. New York: Basic Books.
Evans, J. St. B. T., & Over, D. E. (2004). If. Oxford, Johnson-Laird, P. N. (1 983 ). Mental models. Cam-
UK: Oxford University Press. bridge, UK: Cambridge University Press.
Fiddick, L., Cosmides, L., & Tooby, J. (2000). Johnson-Laird, P. N., & Bara, B. G. (1 984). Syllo-
No interpretation without representation: The gistic inference. Cognition, 1 6, 1 –61 .
Johnson-Laird, P. N., & Byrne, R. M. J. (1 991 ). Pollard, P., & Evans, J. St. B. T. (1 987). On the
Deduction. Hove, UK: Erlbaum. relationship between content and context ef-
Johnson-Laird, P. N., & Byrne, R. M. J. (2002). fects in reasoning. American Journal of Psychol-
Conditionals: A theory of meaning, pragmatics ogy, 1 00, 41 –60.
and inference. Psychological Review, 1 09, 646– Reber, A. S. (1 993 ). Implicit learning and tacit
678. knowledge. Oxford, UK: Oxford University
Kahneman, D., & Frederick, S. (2002). Represen- Press.
tativeness revisited: Attribute substitution in Rips, L. J. (1 994). The psychology of proof. Cam-
intuitive judgement. In T. Gilovich, D. Griffin, bridge, MA: MIT Press.
& D. Kahneman (Eds.), Heuristics and biases: Schroyens, W., Schaeken, W., & d’Ydewalle, G.
The psychology of intuitive judgement (pp. 49– (2001 ). The processing of negations in condi-
81 ). Cambridge, UK: Cambridge University tional reasoning: A meta-analytic study in men-
Press. tal models and/or mental logic theory. Thinking
Kahneman, D., Slovic, P., & Tversky, A. (1 982). and Reasoning, 7, 1 21 –1 72.
Judgment under uncertainty: Heuristics and bi- Sloman, S. A. (1 996). The empirical case for
ases. Cambridge, UK: Cambridge University two systems of reasoning. Psychological Bulletin,
Press. 1 1 9, 3 –22.
Klauer, K. C., Musch, J., & Naumer, B. (2000). Sloman, S. A. (2002). Two systems of reason-
On belief bias in syllogistic reasoning. Psycho- ing. In T. Gilovich, D. Griffin, & D. Kahneman
logical Review, 1 07, 85 2–884. (Eds.), Heuristics and biases: The psychology of
Koehler, J. J. (1 996). The base rate fal- intuitive judgment (pp. 3 79–3 98). Cambridge,
lacy reconsidered: Descriptive, normative and UK: Cambridge University Press.
methodological challenges. Behavioral and Sperber, D., Cara, F., & Girotto, V. (1 995 ). Rel-
Brain Sciences, 1 9, 1 –5 3 . evance theory explains the selection task. Cog-
Manktelow, K. I. (1 999). Reasoning and thinking. nition, 5 7, 3 1 –95 .
Hove, UK: Psychology Press. Sperber, D., & Girotto, V. (2002). Use or mis-
Manktelow, K. I., & Over, D. E. (1 991 ). Social use of the selection task? Rejoinder to Fiddick,
roles and utilities in reasoning with deontic Cosmides and Tooby. Cognition, 85 , 277–290.
conditionals. Cognition, 3 9, 85 –1 05 . Stanovich, K. E. (1 999). Who is rational? Studies
Newstead, S. E., Handley, S. H., & Buck, E. of individual differences in reasoning. Mahwah,
(1 999). Falsifying mental models: Testing the NJ: Erlbaum.
predictions of theories of syllogistic reasoning. Stanovich, K. E., & West, R. F. (1 997). Reason-
Journal of Memory and Language, 2 7, 3 44–3 5 4. ing independently of prior belief and individ-
Newstead, S. E., Pollard, P., Evans, J. St. B. T., & ual differences in actively open-minded think-
Allen, J. L. (1 992). The source of belief bias ing. Journal of Educational Psychology, 89, 3 42–
effects in syllogistic reasoning. Cognition, 45 , 3 5 7.
25 7–284. Stanovich, K. E., & West, R. F. (1 998). Cog-
Oaksford, M., & Chater, N. (1 991 ). Against logi- nitive ability and variation in selection task
cist cognitive science. Mind and Language, 6, performance. Thinking and Reasoning, 4, 1 93 –
1 –3 8. 23 0.
Oaksford, M., & Chater, N. (1 994). A rational Stanovich, K. E., & West, R. F. (2000). Individ-
analysis of the selection task as optimal data ual differences in reasoning: Implications for
selection. Psychological Review, 1 01 , 608–63 1 . the rationality debate. Behavioral and Brain Sci-
Oaksford, M., & Chater, N. (1 998). Rationality ences, 2 3 , 645 –726.
in an uncertain world. Hove, UK: Psychology Stanovich, K. E., & West, R. F. (2003 ). Evolu-
Press. tionary versus instrumental goals: How evolu-
Politzer, G., & Nguyen-Xuan, A. (1 992). Reason- tionary psychology misconceives human ratio-
ing about conditional promises and warnings: nality. In D. Over (Ed.), Evolution and the psy-
Darwinian algorithms, mental models, rel- chology of thinking (pp. 1 71 –23 0). Hove, UK:
evance judgements or pragmatic schemas? Psychology Press.
Quarterly Journal of Experimental Psychology, Stevenson, R. J., & Over, D. E. (1 995 ). Deduc-
44, 401 –41 2. tion from uncertain premises. The Quarterly
Journal of Experimental Psychology, 48A, 61 3 – Wason, P. C., & Evans, J. St. B. T. (1 975 ).
643 . Dual processes in reasoning? Cognition, 3 , 1 41 –
Thompson, V. A. (2001 ). Reasoning from false 1 5 4.
premises: The role of soundness in making logi- Wason, P. C., & Johnson-Laird, P. N. (1 972).
cal deductions. Canadian Journal of Experimen- Psychology of reasoning: Structure and content.
tal Psychology, 5 0, 3 1 5 –3 1 9. London: Batsford.
Wason, P. C. (1 966). Reasoning. In B. M. Foss Yama, H. (2001 ). Matching versus optimal data
(Ed.), New horizons in psychology I (pp. 1 06– selection in the Wason selection task. Thinking
1 3 7). Harmondsworth: Penguin. and Reasoning, 7, 295 –3 1 1 .
CHAPTER 9
Mental Models and Thought
P. N. Johnson-Laird
How do we think? One answer is that from my pocket – my pockets are deep and
we rely on mental models. Perception yields I walked slowly back to my room – and so
models of the world that lie outside us. it’s here or in the restaurant.
An understanding of discourse yields mod-
Embedded in this sequence is a logical de-
els of the world that the speaker describes
duction of the form:
to us. Thinking, which enables us to antic-
ipate the world and to choose a course of A or B or C.
action, relies on internal manipulations of Not B.
these mental models. This chapter is about Therefore, A or C.
this theory, which it refers to as the model
theory, and its experimental corroborations. The conclusion is valid: It must be true given
The theory aims to explain all sorts of think- that the premises are true. However, other
ing about propositions, that is, thoughts ca- sorts of thinking occur in the protocol (e.g.,
pable of being true or false. There are other the inference that the book could not have
sorts of thinking – the thinking, for in- fallen out of the protagonist’s pocket).
stance, of a musician who is improvising. A simple way to categorize thinking about
In daily life, unlike the psychological labo- propositions is in terms of its effects on se-
ratory, no clear demarcation exists between mantic information (Johnson-Laird, 1 993 ).
one sort of thinking and another. Here is The more possibilities an assertion rules out,
a protocol of a typical sequence of every- the greater the amount of semantic informa-
day thoughts: tion it conveys (Bar-Hillel & Carnap, 1 964).
Any step in thought from current premises
I had the book in the hotel’s restaurant, to a new conclusion therefore falls into one
and now I’ve lost it. So, either I left it in the of the following categories:
restaurant, or it fell out of my pocket on the
way back to my room, or it’s somewhere r The premises and the conclusion elimi-
here in my room. It couldn’t have fallen nate the same possibilities.
1 85
r The premises eliminate at least one possible actions within its head, it is able to
more possibility over those the conclusion try out various alternatives, conclude which
eliminates. is the best of them, react to future situations
r The conclusion eliminates at least one before they arise, utilize the knowledge of
past events in dealing with the present and
more possibility over those the premises
the future, and in every way to react in
eliminate. a much fuller, safer, and more competent
r The premises and conclusion eliminate manner to the emergencies which face it.
disjoint possibilities.
r The premises and conclusion eliminate This same process of internal imitation of
overlapping possibilities. the external world, Craik wrote, is carried
out by mechanical devices such as Kelvin’s
The first two categories are deductions (see tidal predictor. Craik died in 1 945 , before
Evans, Chapter 1 1 ). The third category in- he could develop his ideas. Several earlier
cludes all the traditional cases of induction, thinkers had, in fact, anticipated him (see
which in general is definable as any thought Johnson-Laird, 2003 ). Nineteenth-century
yielding such an increase in semantic infor- physicists, including Kelvin, Boltzmann, and
mation (see Sloman & Lagnado, Chap. 3 ). Maxwell, stressed the role of models in
The fourth category occurs only when the thinking. In the twentieth century, physicists
conclusion is inconsistent with the premises. downplayed these ideas with the advent of
The fifth case occurs when the conclusion quantum theory (but cf. Deutsch, 1 997).
is consistent with the premises but refutes One principle of the modern theory
at least one premise and adds at least one is that the parts of a mental model and
new proposition. Such thinking goes beyond their structural relations correspond to those
induction. It is associative or creative (see which they represent. This idea has many
Sternberg, Chap. 1 3 ). antecedents. It occurs in Maxwell’s (1 91 1 )
The model theory aims to explain all views on diagrams, in Wittgenstein’s (1 922)
propositional thinking, and this chapter il- “picture” theory of meaning, and in Köhler’s
lustrates its application to the five preceding (1 93 8) hypothesis of an isomorphism be-
categories. The chapter begins with the his- tween brain fields and the world. However,
tory of the model theory. It then outlines the nineteenth-century grandfather of the
the current theory and its account of de- model theory is Charles Sanders Peirce.
duction. It reviews some of the evidence for Peirce coinvented the main system of
this account. It shows how the theory ex- logic known as predicate calculus, which gov-
tends to probabilistic reasoning. It then turns erns sentences in a formal language contain-
to induction, and it describes the uncon- ing idealized versions of negation, sentential
scious inferences that occur in understand- connectives such as “and” and “or,” and quan-
ing discourse. It shows how models underlie tifiers such as “all” and “some.” Peirce devised
causal relations and the creation of expla- two diagrammatic systems of reasoning, not
nations. Finally, it assesses the future of the to improve reasoning, but to display its un-
model theory. derlying mental steps (see Johnson-Laird,
2002). He wrote:
Deduction is that mode of reasoning which

The History of Mental Models examines the state of things asserted in the
premisses, forms a diagram of that state
of things, perceives in the parts of the di-
In the seminal fifth chapter of his book, The agram relations not explicitly mentioned in
Nature of Explanation, Kenneth Craik (1 943 ) the premisses, satisfies itself by mental ex-
wrote: periments upon the diagram that these re-
lations would always subsist, or at least
If the organism carries a “small-scale would do so in a certain proportion of cases,
model” of external reality and of its own and concludes their necessary, or probable,
mental models and thought 1 87
truth (Peirce, 1 .66; this standard notation ents. Similar ideas occurred in psycholinguis-
refers to paragraph 66 of Volume 1 of Peirce, tics (e.g., Bransford, Barclay, & Franks, 1 972),
1 93 1 –1 95 8). linguistics (Karttunen, 1 976), artificial intel-
ligence (Webber, 1 978), and formal seman-
Diagrams can be iconic, in other words, have tics (Kamp, 1 981 ). Experimental evidence
the same structure as what they represent corroborated the hypothesis, showing that
(Peirce, 4.447). It is the inspection of an individuals rapidly forget surface and un-
iconic diagram that reveals truths other than derlying syntax (Johnson-Laird & Stevenson,
those of the premises (2.279, 4.5 3 0). Hence, 1 970), and even the meaning of individ-
Peirce anticipates Maxwell, Wittgenstein, ual sentences (Garnham, 1 987). They re-
Köhler, and the model theory. Mental mod- tain only models of who did what to whom.
els are as iconic as possible (Johnson-Laird, Psycholinguists discovered that models are
1 983 , pp. 1 25 , 1 3 6). constructed from the meanings of sentences,
A resurgence of mental models in cog- general knowledge, and knowledge of hu-
nitive science began in the 1 970s. Theorists man communication (e.g., Garnham, 2001 ;
proposed that knowledge was represented in Garnham & Oakhill, 1 996; Gernsbacher,
mental models, but they were not wed to 1 990; Glenberg, Meyer, & Lindem, 1 987).
any particular structure for models. Hayes Another early discovery was that con-
(1 979) used the predicate calculus to de- tent affects deductive reasoning (Wason
scribe the naive physics of liquids. Other & Johnson-Laird, 1 972; see Evans, Chap.
theorists in artificial intelligence proposed 8), which was hard to reconcile with
accounts of how to envision models and the then dominant view that reason-
use them to simulate behavior (de Kleer, ers depend on formal rules of inference
1 977). Psychologists similarly examined (Braine, 1 978; Johnson-Laird, 1 975 ; Osher-
naive and expert models of various domains, son, 1 974–1 976). Granted that models come
such as mechanics (McCloskey, Caramazza, from perception and discourse, they could
& Green, 1 980) and electricity (Gentner be used to reason (Johnson-Laird, 1 975 ):
& Gentner, 1 983 ). They argued that vi- An inference is valid if its conclusion holds
sion yields a mental model of the three- in all the models of the premises because
dimensional structure of the world (Marr, its conclusion must be true granted that its
1 982). They proposed that individuals use premises are true. The next section spells out
these models to simulate behavior (e.g., this account.
Hegarty, 1 992; Schwartz & Black, 1 996).
They also studied how models develop (e.g.,
Vosniadou & Brewer, 1 992; Halford, 1 993 ), Models and Deduction
how they serve as analogies (e.g., Holland,
Holyoak, Nisbett, & Thagard, 1 986; see Mental models represent entities and per-
Holyoak, Chap. 6), and how they help in sons, events and processes, and the opera-
the diagnosis of faults (e.g., Rouse & Hunt, tions of complex systems. However, what
1 984). Artifacts, they argued, should be de- is a mental model? The current theory is
signed so users easily acquire models of them based on principles that distinguish mod-
(e.g., Ehrlich, 1 996; Moray, 1 990, 1 999). els from linguistic structures, semantic net-
Discourse enables humans to experience works, and other proposed mental represen-
the world by proxy, and so another early tations (Johnson-Laird & Byrne, 1 991 ). The
hypothesis was that comprehension yields first principle is
models of the world (Johnson-Laird, 1 970).
The principle of iconicity: A mental model
The models are iconic in these ways: They
has a structure that corresponds to the
contain a token for each referent in the known structure of what it represents.
discourse, properties corresponding to the
properties of the referents, and relations cor- Visual images are iconic, but mental mod-
responding to the relations among the refer- els underlie images. Even the rotation of
mental images implies that individuals ro- Table 9.1 . The Truth Table for Exclusive
tate three-dimensional models (Metzler & Disjunction
Shepard, 1 982), and irrelevant images im- A B A or else B, but not both
pair reasoning (Knauff, Fangmeir, Ruff, &
Johnson-Laird, 2003 ; Knauff & Johnson- True True False
Laird, 2002). Moreover, many components True False True
of models cannot be visualized. False True True
False False False
One advantage of iconicity, as Peirce
noted, is that models built from premises can
yield new relations. For example, Schaeken,
Johnson-Laird, and d’Ydewalle (1 996) in- This principle is illustrated in sentential
vestigated problems of temporal reasoning reasoning, which hinges on negation and
concerning such premises as such sentential connectives as “if” and “or.”
John eats his breakfast before he listens to In logic, these connectives have idealized
the radio. meanings: They are truth-functional in that
the truth-values of sentences formed with
Given a problem based on several premises them depend solely on the truth-values of
with the form: the clauses that they connect. For example,
A before B. a disjunction of the form: A or else B but not
B before C. both is true if A is true and B is false, and if
A is false and B is true, but false in any other
D while A.
case. Logicians capture these conditions in a
E while C. truth table, as shown in Table 9.1 . Each row
reasoners can build a mental model with in the table represents a different possibility
the structure: (e.g., the first row represents the possibility
in which both A and B are true), and so here
A B C the disjunction is false.
D E Naive reasoners do not use truth tables
(Osherson, 1 974–1 976). Fully explicit mod-
where the left-to-right axis is time, and the els of possibilities, however, are a step to-
vertical axis allows different events to be ward psychological plausibility. The fully ex-
contemporaneous. Granted that each event plicit models of the exclusive disjunction,
takes roughly the same amount of time, A or else B but not both, are shown here on
reasoners can infer a new relation: separate lines:
D before E.
Formal logic less readily yields the conclu- A ¬B
sion. One difficulty is that an infinite num- ¬A B
ber of conclusions follow validly from any
set of premises, and logic does not tell you where “¬” denotes negation. Table 9.2
which conclusions are useful. From the pre- presents the fully explicit models for the
vious premises, for instance, this otiose con- main sentential connectives. Fully explicit
clusion follows: models correspond exactly to the true rows
in the truth table for each connective. As
A before B, and B before C. the table shows, the conditional If A then B
Possibilities are crucial, and the second is treated in logic as though it can be para-
principle of the theory assigns them a central phrased as If A then B, and if not-A then B or
role: not-B. The paraphrase does not do justice to
the varied meanings of everyday condition-
The principle of possibilities: Each mental als (Johnson-Laird & Byrne, 2002). In fact,
model represents a possibility. no connectives in natural language are truth
Table 9.2 . Fully Explicit Models and Mental Models of Possibilities

Compatible with Sentences Containing the Principal Sentential Connectives
Sentences Fully Explicit Models Mental Models
A and B: A B A B
Neither A nor B: ¬A ¬B ¬A ¬B
A or else B but not both: A ¬B A
¬A B B
A or B or both: A ¬B A
¬A B B
A B A B
If A then B: A B A B
¬A B ...
¬A ¬B
If, and only if A, then B: A B A B

¬A ¬B ...
functional (see the section on implicit induc- or in brief:

tion and the modulation of models).
Fully explicit models yield a more effi- ¬A B
cient reasoning procedure than truth tables. Because an inference is valid if its conclu-
Each premise has a set of fully explicit mod- sion holds in all the models of the premises,
els, for example, the premises: it follows that: B. The same rules are
used recursively to construct the models
1 . A or else B but not both.
of compound premises containing multiple
2. Not-A. connectives.
have the models: Because infinitely many conclusions fol-
low from any premises, computer programs
(Premise 1 ) (Premise 2) for proving validity generally evaluate con-
A ¬B ¬A clusions given to them by the user. Hu-
¬A B man reasoners, however, can draw conclu-
sions for themselves. They normally abide
Their conjunction depends on combining by two constraints (Johnson-Laird & Byrne,
each model in one set with each model in 1 991 ). First, they do not throw semantic in-
the other set according to two main rules: formation away by adding disjunctive alter-
r A contradiction between a pair of models natives. For instance, given a single premise,
A, they never spontaneously conclude, A or
yields the null model (akin to the empty
B or both. Second, they draw novel conclu-
set).
sions that are parsimonious. For instance,
r Any other conjunction yields a model of
they never draw a conclusion that merely
each proposition in the two models. conjoins the premises, even though such
a deduction is valid. Of course, human
The result is: performance rapidly degrades with com-
plex problems, but the goal of parsimony
Input Input Output suggests that intelligent programs should
from (1 ) from (2) draw conclusions that succinctly express
A ¬B ¬A null model all the information in the premises. The
¬A B ¬A ¬A B model theory yields an algorithm that draws
such conclusions (Johnson-Laird & Byrne, The ellipsis denotes an implicit model of the
1 991 , Chap. 9). possibilities in which the antecedent of the
Fully explicit models are simpler than conditional is false. In other words, there are
truth tables but place a heavy load on work- alternatives to the possibility in which A and
ing memory. Mental models are still simpler B are true, but individuals tend not to think
because they are limited by the third princi- explicitly about what holds in these possibil-
ple of the theory: ities. If they retain the footnote about what
is false, then they can flesh out these mental
The principle of truth: A mental model rep- models into fully explicit models. The men-
resents a true possibility, and it represents a
tal models of the biconditional, If, and only
clause in the premises only when the clause
if, A then B, as Table 9.2 shows, are identical
is true in the possibility.
to those for the conditional. What differs is
The simplest illustration of the principle is that the footnote now conveys that both A
to ask naive individuals to list what is possi- and B are false in the implicit model. The
ble for a variety of assertions (Barrouillet & program at its higher level uses fully explicit
Lecas, 1 999; Johnson-Laird & Savary, 1 996). models and so makes no errors in reasoning.
Given an exclusive disjunction, not-A or else Inferences can be made with mental mod-
B, they list two possibilities corresponding els using a procedure that builds a set of
to the mental models: models for a premise and then updates them
according to the other premises. From the
¬A premises,
B
A or else B but not both.
The first mental model does not represent Not-A.
B, which is false in this possibility; and the
second mental model does not represent not- the disjunction yields the mental models
A, which is false in this possibility, in other A
words, A is true. Hence, people tend to ne- B
glect these cases. Readers might assume that The categorical premise eliminates the first
the principle of truth is equivalent to the model, but it is compatible with the second
representation of the propositions mentioned model, yielding the valid conclusion, B. The
in the premises. However, this assumption rules for updating mental models are sum-
yields the same models of A and B regardless marized in Table 9.3 .
of the connective relating them. The right The model theory of deduction began
way to conceive the principle is that it yields with an account of reasoning with quanti-
pared-down versions of fully explicit mod- fiers as in syllogisms such as:
els, which in turn map into truth tables. As
we will see, the principle of truth predicts a Some actuaries are businessmen.
striking effect on reasoning. All businessmen are conformists.
Individuals can make a mental footnote Therefore, some actuaries are
about what is false in a possibility, and these conformists.
footnotes can be used to flesh out mental
models into fully explicit models. However, A plausible hypothesis is that people con-
footnotes tend to be ephemeral. The most struct models of the possibilities compati-
recent computer program implementing the ble with the premises and draw whatever
model theory operates at two levels of conclusion, if any, holds in all of them.
expertise. At its lowest level, it makes no use Johnson-Laird (1 975 ) illustrated such an
of footnotes. Its representation of the main account with Euler circles. A premise of
sentential connectives is summarized in Ta- the form, Some A are B, however, is com-
ble 9.2. The mental models of a conditional, patible with four distinct possibilities, and
if A then B, are the previous premises are compatible with
1 6 distinct possibilities. Because the infer-
A B ence is easy, reasoners may fail to consider
· · ·
Table 9.3. The procedures for forming a conjunction of a pair of models. Each procedure is presented
with an accompanying example. Only mental models may be implicit and therefore call for the first
two procedures
1 : The conjunction of a pair of implicit models yields the implicit model:
. . . and . . . yield . . .
2: The conjunction of an implicit model with a model representing propositions yields the null model
(akin to the empty set) by default, for example,
. . . and B C yield nil.
But, if none of the atomic propositions (B C) is represented in the set of models containing the
implicit model, then the conjunction yields the model of the propositions, for example,
. . . and B C yield B C.
3 : The conjunction of a pair of models representing respectively a proposition and its negation yield
the null model, for example,
A ¬B and ¬A yield nil.
4: The conjunction of a pair of models in which a proposition, B, in one model is not represented in
the other model depends on the set of models of which this other model is a member. If B occurs in
at least one of these models, then its absence in the current model is treated as negation, for
example,
A B and A yields nil.
However, if B does not occur in one of these models (e.g., only its negation occurs in them), then its
absence is treated as equivalent to its affirmation, and the conjunction (following the next
procedure) is
A B and A yields A B.
5 : The conjunction of a pair of fully explicit models free from contradiction update the second model
with all the new propositions from the first model, for example,
¬A B and ¬A C yield ¬A B C.
all the possibilities (Erickson, 1 974), or (1 984) described two alternative strategies.
they may construct models that capture Years of tinkering with the models for syl-
more than one possibility (Johnson-Laird & logisms suggest that reasoning does not rely
Bara, 1 984). The program implementing the on a single deterministic procedure. The fol-
model theory accordingly constructs just one lowing principle applies to thinking in gen-
model for the previous premises: eral but can be illustrated for reasoning:
actuary [businessman] conformist The principle of strategic variation: Given
actuary a class of problems, reasoners develop a va-
riety of strategies from exploring manipu-
[businessman] conformist lations of models (Bucciarelli & Johnson-
. . . Laird, 1 999).
where each row represents a different sort of Stenning and his colleagues anticipated this
individual, the ellipsis represents the possi- principle in an alternative theory of syl-
bility of other sorts of individual, and the logistic reasoning (e.g., Stenning & Yule,
square brackets represent that the set of 1 997). They proposed that reasoners focus
businessmen has been represented exhaus- on individuals who necessarily exist given
tively – in other words, no more tokens the premises (e.g., given the premise Some
representing businessmen can be added to A are B, there must be an A who is B).
the model. This model yields the conclusion They implemented this idea in three differ-
that Some actuaries are conformists. There are ent algorithms that all yield the same in-
many ways in which reasoners might use ferences. One algorithm is based on Euler
such models, and Johnson-Laird and Bara circles supplemented with a notation for
necessary individuals, one is based on tokens mises containing more than one quantifier
of individuals in line with the model theory, (Johnson-Laird, Byrne, & Tabossi, 1 989).
and one is based on verbal rules, such as Many such inferences are beyond the scope
If there are two existential premises, that of Euler circles, although the general prin-
is, that contain “some”, then respond that ciples of the model theory still apply to
there is no valid conclusion. them. Consider, for example, the inference
(Cherubini & Johnson-Laird, 2004):
Stenning and Yule concluded from the
equivalence of the outputs from these al- There are four persons: Ann, Bill, Cath,
gorithms that a need exists for data be- and Dave.
yond merely the conclusions that reason- Everybody loves anyone who loves some-
ers draw, and they suggested that reasoners one.
may develop different representational sys- Ann loves Bill.
tems, depending on the task. Indeed, from
What follows?
Störring (1 908) to Stenning (2002), psy-
chologists have argued that some reasoners Most people can envisage this model in
may use Euler circles and others may use which arrows denote the relation of loving:
verbal procedures.
The external models that reasoners con-
structed with cut-out shapes corroborated Ann Bill Cath Dave
the principle of strategic variation: Individ-
uals develop various strategies (Bucciarelli
& Johnson-Laird, 1 999). They also overlook Hence, they infer that everyone loves Ann.
possible models of premises. Their search However, if you ask them whether it follows
may be organized toward finding necessary that Cath loves Dave, they tend to respond
individuals, as Stenning and Yule showed, “no.” They are mistaken, but the inference
but the typical representations of premises calls for using the quantified premise again.
included individuals who were not neces- The result is this model (strictly speaking, all
sary; for example, the typical representation four persons love themselves, too):
of Some A are B was
A B Ann Bill Cath Dave

A B
A
It follows that Cath loves Dave, and people
A focus on necessary individuals is a partic-
grasp its validity if it is demonstrated with
ular strategy. Other strategies may call for
diagrams. No complete model theory exists
the representation of other sorts of individ-
for inferences based on quantifiers and con-
uals, especially if the task changes – a view
nectives (cf. Bara, Bucciarelli, & Lombardo,
consistent with Stenning and Yule’s theory.
2001 ). However, the main principles of the
For example, individuals readily make the
theory should apply: iconicity, possibilities,
following sort of inference (Evans, Handley,
truth, and strategic variation.
Harper, & Johnson-Laird, 1 999):
Some A are B.
Some B are C. Experimental Studies of
Therefore, it is possible that Some A Deductive Reasoning
are C.
Such inferences depend on the representa- Many experiments have corroborated the
tion of possible individuals. model theory (for a bibliography, see the
The model theory has been extended Web page created by Ruth Byrne: www.tcd.
to some sorts of inference based on pre- ie/Psychology/People/Ruth Byrnelmental
models/). This section outlines the corrob- loads working memory. This problem de-
orations of five predictions. feats most people:
Prediction 1 : The fewer the models
needed for an inference, and the simpler they Ann is in Alaska or Beth is in Barbados,
are, the less time the inference should take or both.
and the less prone it should be to error. Fewer Beth is in Barbados or Cath is in Canada,
entities do improve inferences (e.g., Birney & or both.
Halford, 2002). Likewise, fewer models What follows?
improve spatial and temporal reasoning
(Byrne & Johnson-Laird, 1 989; Carreiras & The premises yield five models, from which
Santamarı́a, 1 997; Schaeken, Johnson-Laird, it follows: Ann is in Alaska and Cath is in
& d’Ydewalle, 1 996; Vandierendonck & De Canada, or Beth is in Barbados, or all three.
Vooght, 1 997). Premises yielding one model When the order of the premises reduces the
take less time to read than corresponding number of models to be held in mind, rea-
premises yielding multiple models; how- soning improves (Garcı́a-Madruga, Moreno,
ever, the difference between two and three Carriedo, Gutiérrez, & Johnson-Laird, 2001 ;
models is often so small that it is un- Girotto, Mazzocco, & Tasso, 1 997; Mac-
likely that reasoners construct all three mod- kiewicz & Johnson-Laird, 2003 ).
els (Vandierendonck, De Vooght, Desim- Because one model is easier than many,
pelaere, & Dierckx, 2000). They may build a an interaction occurs in modal reasoning. It is
single model with one element represented easier to infer that a situation is possible (one
as having two or more possible locations. model of the premises suffices as an exam-
Effects of number of models have been ple) than that it is not possible (all the mod-
observed in comparing one sort of sentential els of the premises must be checked for a
connective with another and in examining counterexample to the conclusion). In con-
batteries of such inferences (see Johnson- trast, it is easier to infer that a situation is
Laird & Byrne, 1 991 ). To illustrate these not necessary (one counterexample suffices)
effects, consider the “double disjunction” than that it is necessary (all the models of
(Bauer & Johnson-Laird, 1 993 ): the premises must be checked as examples).
The interaction occurs in both accuracy and
speed (Bell & Johnson-Laird, 1 998; see also
Ann is in Alaska or else Beth is in Barba-
Evans et al., 1 999).
dos, but not both.
Beth is in Barbados or else Cath is in Prediction 2: Reasoners should err as a re-
Canada, but not both. sult of overlooking models of the premises.
What follows? Given a double disjunction (such as the pre-
vious one), the most frequent errors were
conclusions consistent with just a single
Reasoners readily envisage the two possibil-
model of the premises (Bauer & Johnson-
ities compatible with the first premise, but
Laird, 1 993 ). Likewise, given a syllogism of
it is harder to update them with those from
the form,
the second premise. The solution is
None of the A is a B.
All the B are C.
Ann in Alaska Cath in Canada
Beth in Barbados reasoners infer: None of the A is a C (New-
stead & Griggs, 1 999). They overlook the
possibility in which Cs that are not Bs are
People represent the spatial relations: Mod- As, and so the valid conclusion is
els are not made of words. The two models
Some of the C are not A.
yield the conclusion: Either Ann is in Alaska
and Cath is in Canada or else Beth is in Bar- They may have misinterpreted the second
bados. An increase in complexity soon over- premise, taking it also to mean that all
the C are B (Newstead & Griggs, 1 999), 1 995 ), and reasoners’ diagrams have some-
but many errors with syllogisms appear times failed to show their use (e.g., New-
to arise because individuals consider only stead, Handley, & Buck, 1 999). However,
a single model (Bucciarelli & Johnson- when reasoners had to construct exter-
Laird, 1 999; Espino, Santamarı́a, & Garcı́a- nal models (Bucciarelli & Johnson-Laird,
Madruga, 2000). Ormerod proposed a “min- 1 999), they used counterexamples (see
imal completion” hypothesis according to also Neth & Johnson-Laird, 1 999; Roberts,
which reasoners construct only the min- in press).
imally necessary models (see Ormerod, There are two sorts of invalid conclusions.
Manktelow, & Jones, 1 993 ; Richardson & One sort is invalid because the conclusion is
Ormerod, 1 997). Likewise, Sloutsky pos- disjoint with the premises; for example,
tulated a process of “minimalization” in
which reasoners tend to construct only sin- A or B or both.
gle models for all connectives, thereby re- B or else C but not both.
ducing them to conjunctions (Morris & Therefore, not-A and C.
Sloutsky, 2002; Sloutsky & Goldvarg, 1 999).
Certain assertions, however, do tend to The premises have three fully explicit
elicit more than one model. As Byrne models:
and her colleagues showed (e.g., Byrne,
2002; Byrne & McEleney, 2000; Byrne & A ¬B C
Tasso, 1 999), counterfactual conditionals ¬A B ¬C
such as A B ¬C
If the cable hadn’t been faulty then the The conclusion is inconsistent with the
printer wouldn’t have broken premises because it conflicts with each of
their models. But, another sort of invalid
tend to elicit models of both what is factu- conclusion is consistent with the premises
ally the case, that is, but does not follow from them such as the
conclusion A and not-C from the previous
cable faulty printer broken
premises. It is consistent with the premises
and what holds in a counterfactual possibil- because it corresponds to their third model,
ity but it does not follow from them because
the other two models are counterexamples.
¬ cable faulty ¬ printer broken Reasoners usually establish the invalidity of
the first sort of conclusion by detecting its
Prediction 3 : Reasoners should be able to inconsistency with the premises, but they
refute invalid inferences by envisaging coun- refute the second sort of conclusion with a
terexamples (i.e., models of the premises counterexample (Johnson-Laird & Hasson,
that refute the putative conclusion). There 2003 ). An experiment using functional mag-
is no guarantee that reasoners will find a netic resonance imaging showed that reason-
counterexample, but, where they do suc- ing based on numeric quantifiers, such as at
ceed, they know that an inference is in- least five – as opposed to arithmetical cal-
valid (Barwise, 1 993 ). The availability of a culation based on the same premises – de-
counterexample can suppress fallacious in- pended on the right frontal hemisphere. A
ferences from a conditional premise (Byrne, search for counterexamples appeared to ac-
Espino, & Santamarı́a, 1 999; Markovits, tivate the right frontal pole (Kroger, Cohen,
1 984; Vadeboncoeur & Markovits, 1 999). & Johnson-Laird, 2003 ).
Nevertheless, an alternative theory based Prediction 4: Reasoners should succumb
on mental models has downplayed the to illusory inferences, which are compelling
role of counterexamples (Polk & Newell, but invalid. They arise from the principle of
truth and its corollary that reasoners neglect and deontic relations. Table 9.4 illustrates
what is false. Consider the problem: some different illusions. Studies have used
remedial procedures to reduce the illusions
Only one of the following assertions is true (e.g., Santamarı́a & Johnson-Laird, 2000).
about a particular hand of cards: Yang taught participants to think explic-
itly about what is true and what is false.
There is a king in the hand or there is
The difference between illusions and con-
an ace, or both.
trol problems vanished, but performance
There is a queen in the hand or there is
on the control problems fell from almost
an ace, or both.
1 00% correct to around 75 % correct (Yang
There is a jack in the hand or there is a
& Johnson-Laird, 2000). The principle of
ten, or both.
truth limits understanding, but it does so
Is it possible that there is an ace in the without participants realizing it. They were
hand? highly confident in their responses, no less
so when they succumbed to an illusion
Nearly everyone responds, “yes” (Goldvarg than when they responded correctly to a
& Johnson-Laird, 2000). They grasp that control problem.
the first assertion allows two possibilities in The rubric, “one of these assertions is
which an ace occurs, so they infer that an ace true and one of them is false,” is equiva-
is possible. However, it is impossible for an lent to an exclusive disjunction between two
ace to be in the hand because both of the first assertions: A or else B, but not both. This us-
two assertions would then be true, contrary age leads to compelling illusions that seduce
to the rubric that only one of them is true. novices and experts alike, for example,
The inference is an illusion of possibility:
Reasoners infer wrongly that a card is pos- If there is a king then there is an ace, or
sible. A similar problem to which reason- else if there isn’t a king then there is an
ers tend to respond “no” and thereby com- ace.
mit an illusion of impossibility is created by There is a king.
replacing the two occurrences of “there is What follows?
an ace” in the problem with, “there is not
an ace.” When the previous premises were More than 2000 individuals have tackled this
stated with the question problem (see Johnson-Laird & Savary, 1 999),
and nearly everyone responded, “there is an
Is it possible that there is a jack?
ace.” The prediction of an illusion depends
the participants nearly all responded “yes,” not on logic but on how other participants
again. They considered the third assertion, interpreted the relevant connectives in sim-
and its mental models showed that there ple assertions. The preceding illusion occurs
could be a jack. However, this time they with the rubric: One of these assertions is
were correct: The inference is valid. Hence, true and one of them is false applying to the
the focus on truth does not always lead to er- conditionals. That the conclusion is illusory
ror, and experiments have accordingly com- rests on the following assumption, corrobo-
pared illusions with matching control prob- rated experimentally: If a conditional is false,
lems for which the neglect of falsity should then one possibility is that its antecedent
not affect accuracy. is true and its consequent is false. If skep-
The computer program implementing tics think that the illusory responses are
the theory shows that illusory inferences correct, then how do they explain the ef-
should be sparse in the set of all possi- fects of a remedial procedure? They should
ble inferences. However, experiments have then say that the remedy produced illusions.
corroborated their occurrence in reasoning Readers may suspect that the illusions arise
about possibilities, probabilities, and causal from the artificiality of the problems, which
Table 9.4. Some illusory inferences in abbreviated form, with percentages of illusory responses. Each
study examined other sorts of illusions and matched control problems
Percentages of
Premises Illusory responses illusory responses
1 . If A then B or else B. A. B. 1 00
2. Either A and B, or else C and D. A. B. 87
3 . If A then B or else if C then B. A and B. Possibly both are true. 98
4. A or else not both B and C. A and not B. Possibly both are true. 91
5 . One true and one false: not-A or not-B, or neither.
Not-C and not-B. Possibly not-C and not-B. 85
6. Only one is true: At least some A are not B.
No A are B. Possibly No B are A. 95
7. If one is true so is the other: A or else not B. A. A is more likely than B. 95
8. If one is true so is the other: A if and only if B. A. A is equally likely as B. 90
Note: 1 is from Johnson-Laird and Savary (1 999), 2 is from Walsh and Johnson-Laird (2003 ), 3 is from Johnson-
Laird, Legrenzi, Girotto, and Legrenzi (2000), 4 is from Legrenzi, Girotto, and Johnson-Laird (2003 ), 5 is from
Goldvarg and Johnson-Laird (2000), 6 is from Experiment 2, Yang and Johnson-Laird (2000), and 7 and 8 are from
Johnson-Laird and Savary (1 996).
never occur in real life and therefore where A, B, . . . refer to different colored
confuse the participants. The problems may marbles in a box. Some individuals develop
be artificial, although analogs do occur in a strategy based on suppositions. They say,
real life (see Johnson-Laird & Savary, 1 999), for example,
and artificiality fails to explain the cor-
rect responses to the controls or the high Suppose not A. It follows from the first
ratings of confidence in both illusory and premise that not B. It follows from the sec-
control conclusions. ond premise that C. The third premise then
Prediction 5 : Naive individuals should de- implies D. So, yes, the conclusion follows.
velop different reasoning strategies based on
models. When they are tested in the labo- Some individuals construct a chain of con-
ratory, they start with only rough ideas of ditionals leading from one clause in the con-
how to proceed. They can reason, but not clusion to the other – for example: If D then
efficiently. With experience but no feedback C, If C then not B, If not B then not A. Oth-
about accuracy, they spontaneously develop ers develop a strategy in which they enu-
various strategies (Schaeken, De Vooght, merate the different possibilities compatible
Vandierendonck, & d’Ydewalle, 1 999). De- with the premises. For example, they draw
duction itself may be a strategy (Evans, a horizontal line across the page and write
2000), and people may resort to it more down the possibilities for the premises:
in Western cultures than in East Asian cul-
tures (Peng & Nisbett, 1 999). However, A B
deduction itself leads to different strate-
gies (Van der Henst, Yang, & Johnson- C D
Laird, 2002). Consider a problem in which
When individuals are taught to use this
each premise is compound, that is, contains
strategy, as Victoria Bell showed in un-
a connective:
published studies, their reasoning is faster
and more accurate. The nature of the
A if and only if B.
premises and the conclusion can bias rea-
Either B or else C, but not both. soners to adopt a predictable strategy (e.g.,
C if and only if D. conditional premises encourage the use of
Does it follow that if not A then D? suppositions, whereas disjunctive premises
encourage the enumeration of possibilities) The principle of equiprobability: Each

(Van der Henst et al., 2002). mental model is assumed to be equiproba-
Reasoners develop diverse strategies for ble, unless there are reasons to the contrary.
relational reasoning (e.g., Goodwin &
The probability of an event accordingly de-
Johnson-Laird, in press; Roberts, 2000), sup-
pends on the proportion of models in which
positional reasoning (e.g., Byrne & Hand-
it occurs. The theory also allows that mod-
ley, 1 997), and reasoning with quantifiers
els can be tagged with numerals denoting
(e.g., Bucciarelli & Johnson-Laird, 1 999).
probabilities or frequencies of occurrence,
Granted the variety of strategies, there re-
and that simple arithmetical operations
mains a robust effect: Inferences from one
can be carried out on them. Shimojo and
mental model are easier than those from
Ichikawa (1 989) and Falk (1 992) proposed
more than one model (see also Espino,
similar principles for Bayesian reasoning.
Santamarı́a, Meseguer, & Carreiras, 2000).
The present account differs from theirs in
Different strategies could reflect different
that it assigns equiprobability, not to ac-
mental representations (Stenning & Yule,
tual events, but to mental models. And
1 997), but those so far discovered are all
equiprobability applies only by default. An
compatible with models. Individuals who
analogous principle of “indifference” oc-
have mastered logic could make a strategic
curred in classical probability theory, but it
use of formal rules. Given sufficient expe-
is problematic because it applies to events
rience with a class of problems, individuals
(Hacking, 1 975 ).
begin to notice some formal patterns.
Consider a simple problem such as
In the box, there is a green ball or a blue
ball or both.
Probabilistic Reasoning What is the probability that both the
green and the blue ball are there?
Reasoning about probabilities is of two
sorts. In intensional reasoning, individuals The premise elicits the mental models:
use heuristics to infer the probability of green
an event from some sort of index, such as blue
the availability of information. In extensional green blue
reasoning, they infer the probability of an
event from a knowledge of the different ways Naive reasoners follow the equiprobability
in which it might occur. This distinction principle, and infer the answer, “1 /3 .” An ex-
is due to Nobel laureate Daniel Kahneman periment corroborated this and other pre-
and the late Amos Tversky, who together dictions based on the mental models for
pioneered the investigation of heuristics the connectives in Table 9.2 (Johnson-Laird
(Kahneman, Slovic, & Tversky, 1 982; see et al., 1 999).
Kahneman & Frederick, Chap. 1 2). Studies Conditional probabilities are on the bor-
of extensional reasoning focused at first on derline of naive competence. They are dif-
“Bayesian” reasoning in which participants ficult because individuals need to consider
try to infer a conditional probability from the several fully explicit models. Here is a typi-
premises. These studies offered no account cal Bayesian problem:
of the foundations of extensional reasoning.
The model theory filled the gap (Johnson- The patient’s PSA score is high. If he doesn’t
have prostate cancer, the chances of such
Laird, Legrenzi, Girotto, Legrenzi, & Cav-
a value is 1 in 1 000. Is he likely to have
erni, 1 999), and the present section outlines prostate cancer?
its account.
Mental models represent the extensions Many people respond, “yes.” However, they
of assertions (i.e., the possibilities to which are wrong. The model theory predicts the
they refer). The theory postulates error: Individuals represent the conditional
probability in the problem as one explicit The subset of chances of prostate can-
model and one implicit model tagged with cer within the two possibilities of a high
their chances: PSA (rows 1 and 3 ) yields the conditional
probability: P(prostate cancer | high PSA) =
¬ prostate cancer high PSA 1 2/3 . It is high, but far from 999/1 000.
. . . 999 Evolutionary psychologists postulate that
natural selection led to an innate “mod-
The converse conditional probability has ule” in the mind that makes Bayesian in-
the same mental models, and so people as- ferences from naturally occurring frequen-
sume that if the patient has a high PSA cies. It follows that naive reasoners should
the chances are only 1 in 1 000 that he fail the patient problem because it is about
does not have prostate cancer. Because the a unique event (Cosmides & Tooby, 1 996;
patient has a high PSA, then he is highly Gigerenzer & Hoffrage, 1 995 ). In contrast, as
likely to have prostate cancer (999/1 000). the model theory predicts, individuals cope
To reason correctly, individuals must envis- with problems about unique or repeated
age the complete partition of possibilities events provided they can use the subset prin-
and chances. However, the problem fails to ciple and the arithmetic is easy (Girotto &
provide enough information. It yields only: Gonzalez, 2001 ).
The model theory dispels some common
¬ prostate cancer high PSA 1 misconceptions about probabilistic reason-
¬ prostate cancer ¬ high PSA 999 ing. It is not always inductive. Extensional
prostate cancer high PSA ? reasoning can be deductively valid, and it
prostate cancer ¬ high PSA ? need not depend on a tacit knowledge of the
probability calculus. It is not always correct
There are various ways to provide the miss- because it can yield illusions (Table 9.4).
ing information. One way is to give the
base rate of prostate cancer, which can be
used with Bayes’s theorem from the prob- Induction and Models
ability calculus to infer the answer. How-
ever, the theorem and its computations Induction is part of everyday thinking (see
are beyond naive individuals (Kahneman & Sloman & Lagnado, Chap. 5 ). Popper (1 972)
Tversky, 1 973 ; Phillips & Edwards, 1 966). argued, however, that it is not part of sci-
The model theory postulates an alternative: entific thinking. He claimed that science is
based on explanatory conjectures, which ob-
The subset principle: Given a complete
partition, individuals infer the conditional servations serve only to falsify. Some sci-
probability, P(A | B), by examining the sub- entists agree (e.g., Deutsch, 1 997, p. 1 5 9).
set of B that is A and computing its propor- However, many astronomical, meteorologi-
tion (Johnson-Laird et al., 1 999). cal, and medical observations are not tests
of hypotheses. Everyone makes inductions
If models are tagged with their absolute fre- in daily life. For instance, when the starter
quencies or chances, then the conditional will not turn over the engine, your immedi-
probability equals their value for the model ate thought is that the battery is dead. You
of A and B divided by their sum for all the are likely to be right, but there is no guar-
models containing B. A complete partition antee. Likewise, when the car ferry, Herald
for the patient problem might be of Free Enterprise, sailed from Zeebrugge on
March 6, 1 987, its master made the plausi-
¬ prostate cancer high PSA 1 ble induction that the bow doors had been
¬ prostate cancer ¬ high PSA 999 closed. They had always been closed in the
prostate cancer high PSA 2 past, and there was no evidence to the con-
prostate cancer ¬ high PSA 0 trary. However, they had not been closed,
the vessel capsized and sank, and many peo- in induction. Some inductions are implicit:
ple drowned. Induction is a common but They are rapid, involuntary, and unconscious
risky business. (see Litman & Reber, Chap. 1 8). Other in-
The textbook definition of induction – ductions are explicit: They are slow, volun-
alas, all too common – is that it leads from tary, and conscious. This distinction is fa-
the particular to the general. Such argu- miliar (e.g., Evans & Over, 1 996; Johnson-
ments are indeed inductions, but many in- Laird & Wason, 1 977, p. 3 41 ; Sloman, 1 996;
ductions such as the preceding examples Stanovich, 1 999). The next part considers
are inferences from the particular to the implicit inductions, and the part thereafter
particular. That is why the “Introduction” considers explicit inductions and the resolu-
offered a more comprehensive definition: tion of inconsistencies.
Induction is a process that increases semantic
information. As an example, consider again
the inference: Implicit Induction and the Modulation
The starter won’t turn. of Models
Therefore, the battery is dead.
Semantics is central to models, and the con-
Like all inductions, it depends on knowledge tent of assertions and general knowledge can
and, in particular, on the true conditional: modulate models. Psychologists have pro-
If the battery is dead, then the starter posed many theories about the mental rep-
won’t turn. resentation of knowledge, but knowledge is
about what is possible, and so the model the-
It is consistent with the possibilities: ory postulates that it is represented in fully
explicit models (Johnson-Laird & Byrne,
battery dead ¬ starter turn
2002). These models, in turn, modulate the
¬ battery dead ¬ starter turn
mental models of assertions according to
¬ battery dead starter turn
The principle of modulation: The meanings
The premise of the induction eliminates the of clauses, coreferential links between them,
third possibility, but the conclusion goes be- general knowledge, and knowledge of con-
yond the information given because it elim- text, can modulate the models of an asser-
inates the second of them. The availability tion. In the case of inconsistency, meaning
and knowledge normally take precedence
of the first model yields an intensional infer-
over the models of assertions.
ence of a high probability, but its conclusion
rejects a real possibility. Hence, it may be Modulation can add information to mental
false. Inductions are vulnerable because they models, prevent their construction, and flesh
increase semantic information. them out into fully explicit models. As an il-
Inductions depend on knowledge. As lustration of semantic modulation, consider
Kahneman and Tversky (1 982) showed, var- the following conditional:
ious heuristics constrain the use of knowl-
If it’s a game, then it’s not soccer.
edge in inductions. The availability heuris-
tic, illustrated in the previous example, re- Its fully explicit models (Table 9.2), if they
lies on whatever relevant knowledge is avail- were unconstrained by coreference and se-
able (e.g., Tversky & Kahneman, 1 973 ). The mantics, would be
representativeness heuristic yields inferences
game ¬ soccer
dependent on the representative nature of
¬ game ¬ soccer
the evidence (e.g., Kahneman & Frederick,
¬ game soccer
2002; also see Kahneman & Frederick, Chap.
1 2). The present account presupposes these The meaning of the noun soccer entails that
heuristics but examines the role of models it is a game, and so an attempt to construct
the third model fails because it would yield This possibility and the model of the
an inconsistency. The conditional has only premises are used to construct a counterfac-
the first two models. tual conditional:
The pragmatic effects of knowledge have
If it had not been the case that match wet
been modeled in a computer program,
and given match struck, then it might have
which can be illustrated using the example been the case that match lights.
If the match is struck properly, then it Modulation is rapid and automatic, and
lights. it affects comprehension and reasoning
The match is soaking wet and it is struck (Johnson-Laird & Byrne, 2002; Newstead,
properly. Ellis, Evans, & Dennis, 1 997; Ormerod &
What happens? Johnson-Laird, in press). In logic, connec-
tives such as conditionals and disjunctions
In logic, it follows that the match lights, but
are truth functional, and so the truth value
neither people nor the program draws this
of a sentence in which they occur can be
conclusion. Knowledge that wet matches
determined solely from a knowledge of the
do not light overrides the model of the
truth values of the clauses they interconnect.
premises. The program constructs the men-
However, in natural language, connectives
tal model of the premises:
are not truth functional: It is always nec-
essary to check whether their content and
match wet match match lights
context modulate their interpretation.
struck [the model of
the premises]
If a match is soaking wet, it does not light, Explicit Induction, Abduction, and the
and the program has a knowledge base con- Creation of Explanations
taining this information in fully explicit
models: Induction is the use of knowledge to increase
semantic information: Possibilities are elim-
match wet ¬ match lights inated either by adding elements to a mental
¬ match wet ¬ match lights model or by eliminating a mental model al-
¬ match wet match lights together. After you have stood in line to no
avail at a bar in Italy, you are likely to make
The second premise states that the match is
an explicit induction:
wet, which triggers the matching possibility
in the preceding models: In Italian bars with cashiers, you pay the
cashier first and then take your receipt to
match wet ¬ match lights the bar to make your order.
This induction is a general description. You
The conjunction of this model with the may also formulate an explanation:
model of the premises would yield a contra-
diction, but the program follows the princi- The barmen are too busy to make change,
ple of modulation and gives precedence to and so it is more efficient for customers to
knowledge yielding the following model: pay a cashier.
Scientific laws are general descriptions of
match wet match struck ¬ match lights phenomena (e.g., Kepler’s third law de-
scribes the elliptical orbits of the planets).
and so the match does not light. The model Scientific theories explain these regularities
of the premises also triggers another possi- in terms of more fundamental considerations
bility from the knowledge base: (e.g., the general theory of relativity explains
planetary orbits as the result of the sun’s
¬ match wet match lights mass curving space-time). Peirce (1 903 )
called thinking that leads to explanations ab- by having her ears pierced and was spend-
duction. In terms of the five categories of the ing the money on a TV set, and so on. Only
“Introduction,” abduction is creative when it rarely were the participants stumped for an
leads to the revision of beliefs. explanation. They were almost as equally
Consider the following problem: ingenious with the sentences that were not
coreferential.
If a pilot falls from a plane without a Abduction depends on knowledge, es-
parachute, the pilot dies. This pilot did not
pecially of causal relations, which accord-
die, however. Why not?
ing to the model theory refer to tempo-
Most people respond, for example, that rally ordered sets of possibilities (Goldvarg &
Johnson-Laird, 2001 ; see Cheng & Buehner,
The plane was on the ground. Chapter 5 .). An assertion of the form C
The pilot fell into a deep snow drift. causes E is compatible with three fully ex-
plicit possibilities:
Only a minority draws the logically valid
conclusion: C E
The pilot did not fall from the plane without ¬C E
a parachute. ¬C ¬E
Hence, people prefer a causal explanation with the temporal constraint that E cannot
repudiating the first premise to a valid de- precede C. An “enabling” assertion of the
duction, albeit they may presuppose that form C allows E is compatible with the three
the antecedent of the conditional is true. possibilities:
Granted that knowledge usually takes prece-
C E
dence over contradictory assertions, the ex-
planatory mechanism should dominate the C ¬E
ability to make deductions. ¬C ¬E
In daily life, the propensity to explain is This account, unlike others, accordingly dis-
extraordinary, as Tony Anderson and this tinguishes between the meaning and logical
author discovered when they asked partic- consequences of causes and enabling condi-
ipants to explain the inexplicable. The par- tions (pace, e.g., Einhorn & Hogarth, 1 978;
ticipants received pairs of sentences selected Hart & Honoré, 1 985 ; Mill, 1 874). It also
at random from separate stories: treats causal relations as determinate rather
John made his way to a shop that sold TV than probabilistic (pace, e.g., Cheng, 1 997;
sets. Suppes, 1 970). Experiments support both
Celia had recently had her ears pierced. these claims: Participants listed the previous
possibilities, and they rejected other cases
In another condition, the sentences were as impossible, contrary to probabilistic ac-
modified to make them coreferential: counts (Goldvarg & Johnson-Laird, 2001 ).
Of course, when individuals induce a causal
Celia made her way to a shop that sold TV
sets.
relation from a series of observations, they
She had recently had her ears pierced. are influenced by relative frequencies. How-
ever, on the present account, the mean-
The participants’ task was to explain what ing of any causal relation that they induce
was going on. They readily went beyond is deterministic.
the given information to account for what Given the cause from a causal relation,
was happening. They proposed, for example, there is only one possible effect, as the pre-
that Celia was getting reception in her ear- vious models show; however, given the ef-
rings and wanted the TV shop to investigate, fect, there is more than one possible cause.
that she wanted to see some new earrings on Exceptions do occur (Cummins, Lubart,
closed circuit TV, that she had won a bet Alksnis, & Rist, 1 991 ; Markovits, 1 984),
but the principle holds in general. It may “conjunction” fallacy in which a conjunc-
explain why inferences from causes to ef- tion is in error judged to be more probable
fects are more plausible than inferences from than its constituents (Tversky & Kahneman,
effects to causes. As Tversky and Kahneman 1 983 ).
(1 982) showed, conditionals in which the Abductions that resolve inconsistencies
antecedent is a cause such as have been implemented in a computer pro-
gram that uses a knowledge base to create
A girl has blue eyes if her mother has blue causal explanations. Given the preceding ex-
eyes. ample, the program constructs the mental
are judged as more probable than condition- models of the conditional:
als in which the antecedent is an effect: trigger pulled pistol fires
. . .
The mother has blue eyes if her daughter
has blue eyes. The conjunction of the categorical assertion
yields
According to the model theory, when in-
dividuals discover inconsistencies, they try to trigger pistol fires [the model of
construct a model of a cause and effect that pulled the premises]
resolves the inconsistency. It makes possible
the facts of the matter, and the belief that That the pistol did not fire is inconsistent
the causal assertion repudiates is taken to be with this model. The theory predicts that
a counterfactual possibility (in a comparable individuals should tend to abandon their be-
way to the modulation of models by knowl- lief in the conditional premise because its
edge). Consider, for example, the scenario: one explicit mental model conflicts with the
fact that the pistol did not fire (see Girotto,
If the trigger is pulled then the pistol will fire. Johnson-Laird, Legrenzi, & Sonino, 2000,
The trigger is pulled, but the pistol does not for corroborating evidence). Nevertheless,
fire. Why not? the conditional expresses a useful idealiza-
tion, and so the program treats it as the basis
Given 20 different scenarios of this form
for a counterfactual set of possibilities:
(in an unpublished study carried out by
Girotto, Legrenzi, & Johnson-Laird), most trigger ¬pistol fires [the model of
explanations were causal claims that repu- pulled the facts]
diated the conditional. In two further ex- trigger pistol fires [the models of
periments with the scenarios, the partici- pulled counterfactual
pants rated the statements of a cause and possibilities]
its effect as the most probable explanations;
. . .
for example,
People know that a pistol without bullets
A prudent person had unloaded the pistol does not fire, and so the program has in its
and there were no bullets in the chamber.
knowledge base the models:
The cause alone was rated as less probable,
but as more probable than the effect alone, ¬ bullets in pistol ¬ pistol fires
which in turn was rated as more probable bullets in pistol ¬ pistol fires
than an explanation that repudiated the cat- bullets in pistol pistol fires
egorical premise; for example,
The model of the facts triggers the first
The trigger wasn’t really pulled. possibility in this set, which modulates the
model of the facts to create a possibility:
The greater probability assigned to the con-
junction of the cause and effect than to ¬ bullets in trigger ¬ pistol fires
either of its clauses is an instance of the pistol pulled
The new proposition in this model triggers a there are distinguishable variants of the the-
causal antecedent from another set of mod- ory itself (see, e.g., Evans, 1 993 ; Ormerod,
els in the knowledge base, which explains Manktelow, & Jones, 1 993 ; Polk & Newell,
the inconsistency: A person emptied the pis- 1 995 ). The most urgent demands for the
tol and so it had no bullets. The counterfac- twenty-first century are the extension of the
tual possibilities yield the claim: If the per- theory to problem solving, decision making,
son had not emptied the pistol, then it would and strategic thinking when individuals com-
have had bullets, and . . . it would have fired. pete or cooperate.
The fact that the pistol did not fire has been
used to reject the conditional premise, and
available knowledge has been used to create Acknowledgments
an explanation and to modulate the condi-
tional premise into a counterfactual. There This chapter was made possible by a
are, of course, other possible explanations. grant from the National Science Foundation
In sum, reasoners can resolve inconsisten- (Grant BCS 0076287) to study strategies in
cies between incontrovertible evidence and reasoning. The author is grateful to the edi-
the consequences of their beliefs. They use tor, the community of reasoning researchers,
their available knowledge – in the form of and his colleagues, collaborators, and stu-
explicit models – to try to create a causal dents – many of their names are found in
scenario that makes sense of the facts. Their the “References” section.
reasoning may resolve the inconsistency, cre-
ate an erroneous account, or fail to yield any
explanation whatsoever. References
Bar-Hillel, Y., & Carnap, R. (1 964). An outline

Conclusions and Further Directions of a theory of semantic information. In Y. Bar-
Hillel (Ed.), Language and information process-
Mental models have a past in the nineteenth ing. Reading, MA: Addison-Wesley.
century. The present theory was developed Bara, B. G., Bucciarelli, M., & Lombardo, V.
in the twentieth century. In its application to (2001 ). Model theory of deduction: A unified
deduction, as Peirce anticipated, if a conclu- computational approach. Cognitive Science, 25 ,
sion holds in all the models of the premises, 83 9–901 .
it is necessary given the premises. If it holds Barrouillet, P., & Lecas, J-F. (1 999). Mental mod-
in a proportion of the models, then, granted els in conditional reasoning and working mem-
ory. Thinking and Reasoning, 5 , 289–3 02.
that they are equiprobable, its probability
is equal to that proportion. If it holds in Barsalou, L. W. (1 999). Perceptual symbol sys-
tems. Behavioral and Brain Sciences, 2 2 , 5 77–
at least one model, then it is possible. The
660.
theory also applies to inductive reasoning –
Barwise, J. (1 993 ). Everyday reasoning and logi-
both the rapid implicit inferences that un-
cal inference. Behavioral and Brain Sciences, 1 6,
derlie comprehension and the deliberate in- 3 3 7–3 3 8.
ferences yielding generalizations. It offers an
Bauer, M. I., & Johnson-Laird, P. N. (1 993 ). How
account of the creation of causal explana- diagrams can improve reasoning. Psychological
tions. However, if Craik was right, mental Science, 4, 3 72–3 78.
models underlie all thinking with a proposi- Bell, V., & Johnson-Laird, P. N. (1 998). A model
tional content, and so the present theory is theory of modal reasoning. Cognitive Science,
radically incomplete. 2 2 , 25 –5 1 .
What of the future of mental models? The Birney, D., & Halford, G. S. (2002). Cognitive
theory is under intensive development and complexity of suppositional reasoning: An ap-
intensive scrutiny. It has been corroborated plication of relational complexity to the knight-
in many experiments, and it is empirically knave task. Thinking and Reasoning, 8, 1 09–
distinguishable from other theories. Indeed, 1 3 4.
Braine, M. D. S. (1 978). On the relation be- Cummins, D. D., Lubart, T., Alksnis, O., & Rist,
tween the natural logic of reasoning and stan- R. (1 991 ). Conditional reasoning and causa-
dard logic. Psychological Review, 85 , 1 –21 . tion. Memory and Cognition, 1 9, 274–282.
Bransford, J. D., Barclay, J. R., & Franks, J. J. de Kleer, J. (1 977). Multiple representations of
(1 972). Sentence memory: A constructive ver- knowledge in a mechanics problem-solver. In-
sus an interpretive approach. Cognitive Psychol- ternational Joint Conference on Artificial Intelli-
ogy, 3 , 1 93 –209. gence, 299–3 04.
Bucciarelli, M., & Johnson-Laird, P. N. (1 999). Deutsch, D. (1 997). The fabric of reality: The sci-
Strategies in syllogistic reasoning. Cognitive Sci- ence of parallel universes – and its implications.
ence, 2 3 , 247–3 03 . New York: Penguin Books.
Bucciarelli, M., & Johnson-Laird, P. N. (in press). Ehrlich, K. (1 996). Applied mental models in
Naı̈ve deontics: A theory of meaning, represen- human–computer interaction. In J. Oakhill &
tation, and reasoning. Cognitive Psychology. A. Garnham (Eds.), Mental models in cognitive
Byrne, R. M. J. (2002). Mental models and coun- science. Mahwah, NJ: Erlbaum.
terfactual thoughts about what might have Einhorn, H. J., & Hogarth, R. M. (1 978). Confi-
been. Trends in Cognitive Sciences, 6, 426– dence in judgment: Persistence of the illusion
43 1 . of validity. Psychological Review, 85 , 3 95 –41 6.
Byrne, R. M. J., Espino, O., & Santamarı́a, C. Erickson, J. R. (1 974). A set analysis theory of
(1 999). Counterexamples and the suppression behaviour in formal syllogistic reasoning tasks.
of inferences. Journal of Memory and Language, In R. Solso (Ed.), Loyola symposium on cognition
40, 3 47–3 73 . (Vol. 2). Hillsdale, NJ: Erlbaum.
Byrne, R. M. J., & Handley, S. J. (1 997). Reasoning Espino, O., Santamarı́a, C., & Garcı́a-Madruga,
strategies for suppositional deductions. Cogni- J. A. (2000). Activation of end terms in syllo-
tion, 62 , 1 –49. gistic reasoning. Thinking and Reasoning, 6, 67–
Byrne, R. M. J., & Johnson-Laird, P. N. (1 989). 89.
Spatial reasoning. Journal of Memory and Lan- Espino, O., Santamarı́a, C., Meseguer, E., & Car-
guage, 2 8, 5 64–5 75 . reiras, M. (2000). Eye movements during syl-
Byrne, R. M. J., & McEleney, A. (2000). Coun- logistic reasoning. In J. A. Garcı́a-Madruga,
terfactual thinking about actions and fail- N. Carriedo, & M. J. González-Labra (Eds.),
ures to act. Journal of Experimental Psychology: Mental models in reasoning (pp. 1 79–1 88).
Learning, Memory, and Cognition, 2 6, 1 3 1 8– Madrid: Universidad Nacional de Educación a
1331. Distancia.
Byrne, R. M. J., & Tasso, A. (1 999). Deductive Evans, J. St. B. T. (1 993 ). The mental model the-
reasoning with factual, possible, and counter- ory of conditional reasoning: Critical appraisal
factual conditionals. Memory and Cognition, 2 7, and revision. Cognition, 48, 1 –20.
726–740. Evans, J. St. B. T. (2000). What could and
Carreiras, M., & Santamarı́a, C. (1 997). Rea- could not be a strategy in reasoning. In W. S.
soning about relations: Spatial and nonspatial Schaeken, G. De Vooght, A. Vandierendonck,
problems. Thinking and Reasoning, 3 , 1 91 –208. & G. d’Ydewalle (Eds.), Deductive reason-
Cheng, P. W. (1 997). From covariation to ing and strategies. (pp. 1 –22) Mahwah, NJ:
causation: A causal power theory. Psychological Erlbaum.
Review, 1 04, 3 67–405 . Evans, J. St. B. T., Handley, S. J., Harper, C. N. J.,
Cherubini, P., & Johnson-Laird, P. N. (2004). & Johnson-Laird, P. N. (1 999). Reasoning about
Does everyone love everyone? The psychology necessity and possibility: A test of the mental
of iterative reasoning. Thinking and Reasoning, model theory of deduction. Journal of Experi-
1 0, 3 1 –5 3 . mental Psychology: Learning, Memory, and Cog-
Cosmides, L., & Tooby, J. (1 996). Are humans nition, 2 5 , 1 495 –1 5 1 3 .
good intuitive statisticians after all? Rethink- Evans, J. St. B. T., & Over, D. E. (1 996). Rationality
ing some conclusions from the literature on and reasoning. Hove, East Sussex: Psychology
judgment under uncertainty. Cognition, 5 8, Press.
1 –73 . Falk, R. (1 992). A closer look at the probabilities
Craik, K. (1 943 ). The nature of explanation. of the notorious three prisoners. Cognition, 43 ,
Cambridge: Cambridge University Press. 1 97–223 .
Forbus, K. (1 985 ). Qualitative process theory. shop on Advances in Geographic Information Sys-
In D. G. Bobrow (Ed.), Qualitative reasoning tems (pp. 1 1 2–1 1 7). Arlington, VA: Association
about physical systems. Cambridge, MA: MIT for Computing Machinery.
Press. Glenberg, A. M., Meyer, M., & Lindem, K.
Garcı́a-Madruga, J. A., Moreno, S., Carriedo, N., (1 987). Mental models contribute to fore-
Gutiérrez, F., & Johnson-Laird, P. N. (2001 ). grounding during text comprehension. Journal
Are conjunctive inferences easier than disjunc- of Memory and Language, 2 6, 69–83 .
tive inferences? A comparison of rules and Goldvarg, Y., & Johnson-Laird, P. N. (2000). Illu-
models. Quarterly Journal of Experimental Psy- sions in modal reasoning. Memory and Cogni-
chology, 5 4A, 61 3 –63 2. tion, 2 8, 282–294.
Garnham, A. (1 987). Mental models as representa- Goldvarg, Y., & Johnson-Laird, P. N. (2001 ).
tions of discourse and text. Chichester, UK: Ellis Naı̈ve causality: A mental model theory of
Horwood. causal meaning and reasoning. Cognitive Sci-
Garnham, A. (2001 ). Mental models and the inter- ence, 2 5 , 5 65 –61 0.
pretation of anaphora. Hove, UK: Psychology Goodwin, G., & Johnson-Laird, P. N. (in press).
Press. Reasoning about the relations between rela-
Garnham, A., & Oakhill, J. V. (1 996). The mental tions.
models theory of language comprehension. In Hacking, I. (1 975 ). The emergence of probabil-
B. K. Britton & A. C. Graesser (Eds.), Models ity. Cambridge, UK: Cambridge University
of understanding text (pp. 3 1 3 –3 3 9). Hillsdale, Press.
NJ: Erlbaum. Halford, G. S. (1 993 ). Children’s understanding:
Gentner, D., & Gentner, D. R. (1 983 ). Flow- The development of mental models. Hillsdale, NJ:
ing waters or teeming crowds: Mental mod- Erlbaum.
els of electricity. In D. Gentner & A. L. Hart, H. L. A., & Honoré, A. M. (1 985 ). Causa-
Stevens (Eds.), Mental models. Hillsdale, tion in the law (2nd ed.). Oxford, UK: Claren-
NJ: Erlbaum. don Press. (First edition published in 1 95 9.)
Gernsbacher, M. A. (1 990). Language compre- Hayes, P. J. (1 979). Naive physics I – Ontology
hension as structure building. Hillsdale, NJ: for liquids. Mimeo, Centre pour les études Se-
Erlbaum. mantiques et Cognitives, Geneva. (Reprinted
Gigerenzer, G., & Hoffrage, U. (1 995 ). How in Hobbs, J., & Moore, R. (Eds.). (1 985 ). For-
to improve Bayesian reasoning without mal theories of the commonsense world. Hillsdale,
instruction: Frequency format. Psychological NJ: Erlbaum.)
Review, 1 02 , 684–704. Hegarty, M. (1 992). Mental animation: Inferring
Girotto, V., & Gonzalez, M. (2001 ). Solving prob- motion from static diagrams of mechanical sys-
abilistic and statistical problems: A matter of tems. Journal of Experimental Psychology: Learn-
question form and information structure. Cog- ing, Memory, and Cognition, 1 8, 1 084–1 1 02.
nition, 78, 247–276. Holland, J. H. (1 998). Emergence: From chaos to
Girotto, V., Johnson-Laird, P. N., Legrenzi, P., & order. Reading, MA: Perseus Books.
Sonino, M. (2000). Reasoning to consistency: Holland, J. H., Holyoak, K. J., Nisbett, R. E., &
How people resolve logical inconsistencies. Thagard, P. R. (1 986). Induction: Processes of
In J. A. Garcı́a-Madruga, N. Carriedo, & inference, learning, and discovery. Cambridge,
M. González-Labra (Eds.), Mental models in MA: MIT Press.
reasoning (pp. 83 –97). Madrid: Universidad Johnson-Laird, P. N. (1 970). The perception
Naciónal de Educacion a Distanzia. and memory of sentences. In J. Lyons (Ed.),
Girotto, V., Legrenzi, P., & Johnson-Laird, P. N. New horizons in linguistics (pp. 261 –270). Har-
(Unpublished studies). mondsworth: Penguin Books.
Girotto, V., Mazzocco, A., & Tasso. A. (1 997). Johnson-Laird, P. N. (1 975 ). Models of de-
The effect of premise order in conditional duction. In R. Falmagne (Ed.), Reasoning:
reasoning: A test of the mental model theory. Representation and process. Springdale, NJ:
Cognition, 63 , 1 –28. Erlbaum.
Glasgow, J. I. (1 993 ). Representation of spatial Johnson-Laird, P. N. (1 983 ). Mental models: To-
models for geographic information systems. In wards a cognitive science of language, infer-
N. Pissinou (Ed.), Proceedings of the ACM Work- ence and consciousness. Cambridge: Cambridge
University Press; Cambridge, MA: Harvard Kahneman, D., Slovic, P., & Tversky, A. (Eds.).
University Press. (1 982). Judgment under uncertainty: Heuristics
Johnson-Laird, P. N. (1 993 ). Human and machine and biases. Cambridge, UK: Cambridge Uni-
thinking. Hillsdale, NJ: Erlbaum. versity Press.
Johnson-Laird, P. N. (2002). Peirce, logic dia- Kahneman, D., & Tversky, A. (1 973 ). On the psy-
grams, and the elementary operations of rea- chology of prediction. Psychological Review, 80,
soning. Thinking and Reasoning, 8, 69–95 . 23 7–25 1 .
Johnson-Laird, P. N. (in press). The history of Kamp, H. (1 981 ). A theory of truth and se-
mental models. In K. Manktelow (Ed.), Psy- mantic representation. In J. A. G. Groe-
chology of reasoning: Theoretical and historical nendijk, T. M. V. Janssen, & M. B. J. Stokhof
perspectives. London: Psychology Press. (Eds.), Formal methods in the study of lan-
Johnson-Laird, P. N., & Bara, B. G. (1 984). Syllo- guage (pp. 277–3 22). Amsterdam: Mathemati-
gistic inference. Cognition, 1 6, 1 –61 . cal Centre Tracts.
Johnson-Laird, P. N., & Byrne, R. M. J. (1 991 ). Karttunen, L. (1 976). Discourse referents. In J. D.
Deduction. Hillsdale, NJ: Erlbaum. McCawley (Ed.), Syntax and semantics, vol.
7: Notes from the linguistic underground. New
Johnson-Laird, P. N., & Byrne, R. M. J. (2002).
York: Academic Press.
Conditionals: A theory of meaning, pragmatics,
and inference. Psychological Review, 1 09, 646– Knauff, M., Fangmeir, T., Ruff, C. C., &
678. Johnson-Laird, P. N. (2003 ). Reasoning, mod-
els, and images: Behavioral measures and cor-
Johnson-Laird, P. N., Byrne, R. M. J., & Tabossi, P.
tical activity. Journal of Cognitive Neuroscience,
(1 989). Reasoning by model: The case of mul-
4, 5 5 9–5 73 .
tiple quantification. Psychological Review, 96,
65 8–673 . Knauff, M., & Johnson-Laird, P. N. (2002). Im-
agery can impede inference. Memory and Cog-
Johnson-Laird, P. N., & Hasson, U. (2003 ). Coun-
nition, 3 0, 3 63 –3 71 .
terexamples in sentential reasoning. Memory
and Cognition, 3 1 , 1 1 05 –1 1 1 3 . Köhler, W. (1 93 8). The place of value in a world of
facts. New York: Liveright.
Johnson-Laird, P. N., Legrenzi, P., Girotto, P., &
Legrenzi, M. S. (2000). Illusions in reasoning Kroger, J. K., Cohen, J. D., & Johnson-Laird, P. N.
about consistency. Science, 2 88, 5 3 1 –5 3 2. (2003 ). A double dissociation between logic
and mathematics. Unpublished MS.
Johnson-Laird, P. N., Legrenzi, P., Girotto, V.,
Legrenzi, M., & Caverni, J.-P. (1 999). Naive Kuipers, B. (1 994). Qualitative reasoning: Model-
probability: A mental model theory of exten- ing and simulation with incomplete knowledge.
sional reasoning. Psychological Review, 106, 62– Cambridge, MA: MIT Press.
88. Legrenzi, P., Girotto, V., & Johnson-Laird, P. N.
Johnson-Laird, P. N., & Savary, F. (1 996). Illusory (2003 ). Models of consistency. Psychological
inferences about probabilities. Acta Psycholog- Science, 1 4, 1 3 1 –1 3 7.
ica, 93 , 69–90. Mackiewicz, R., & Johnson-Laird, P. N. (in press).
Johnson-Laird, P. N., & Savary, F. (1 999). Illusory Deduction, models, and order of premises.
inferences: A novel class of erroneous deduc- Markovits, H. (1 984). Awareness of the “possible”
tions. Cognition, 71 , 1 91 –229. as a mediator of formal thinking in conditional
Johnson-Laird, P. N., & Stevenson, R. (1 970). reasoning problems. British Journal of Psychol-
Memory for syntax. Nature, 2 2 7, 41 2. ogy, 75 , 3 67–3 76.
Johnson-Laird, P. N., & Wason, P. C. (Eds.). Marr, D. (1 982). Vision. San Francisco: Freeman.
(1 977). Thinking. Cambridge, UK: Cambridge Maxwell, J. C. (1 91 1 ). Diagram. The Encyclopae-
University Press. dia Britannica, Vol. XVIII. New York: Ency-
Kahneman, D., & Frederick, S. (2002). Rep- lopaedia Britannica Co.
resentativeness revisited: Attribute substitu- McCloskey, M., Caramazza, A., & Green, B.
tion in intuitive judgment. In T. Gilovich, (1 980). Curvilinear motion in the absence of
D. Griffin, & D. Kahneman (Eds.), Heuris- external forces: Naı̈ve beliefs about the mo-
tics of intuitive judgment: Extensions and ap- tions of objects. Science, 2 1 0, 1 1 3 9–1 1 41 .
plications. New York: Cambridge University Metzler, J., & Shepard, R. N. (1 982). Transfor-
Press. mational studies of the internal representations
of three-dimensional objects. In R. N. Shep- Peirce, C. S. (1 903 ). Abduction and induction. In

ard & L. A. Cooper (Eds.), Mental images J. Buchler (Ed.), Philosophical writings of Peirce.
and their transformations (pp. 25 –71 ). Cam- New York: Dover, 1 95 5 .
bridge, MA: MIT Press. (Originally published Peirce, C. S. (1 93 1 –1 95 8). Collected Papers of
in Solso, R. L. (Ed.). (1 974). Theories in cog- Charles Sanders Peirce. 8 vols. Hartshorne, C.,
nitive psychology: The Loyola Symposium. Hills- Weiss, P., & Burks, A. (Eds.) Cambridge, MA:
dale, NJ: Erlbaum). Harvard University Press.
Mill, J. S. (1 874). A system of logic, ratiocinative Peng, K., & Nisbett, R. E. (1 999). Culture, dialec-
and inductive: Being a connected view of the prin- tics, and reasoning about contradiction. Amer-
ciples of evidence and the methods of scientific ev- ican Psychologist, 5 4, 741 –75 4.
idence (8th ed.). New York: Harper. (First edi- Phillips, L., & Edwards, W. (1 966). Conservatism
tion published 1 843 .) in a simple probability inference task. Journal
Moray, N. (1 990). A lattice theory approach to of Experimental Psychology, 72 , 3 46–3 5 4.
the structure of mental models. Philosophical Polk, T. A., & Newell, A. (1 995 ). Deduction
Transactions of the Royal Society of London B, as verbal reasoning. Psychological Review, 1 02 ,
3 2 7, 5 77–5 83 . 5 3 3 –5 66.
Moray, N. (1 999). Mental models in theory and Popper, K. R. (1 972). Objective knowledge. Ox-
practice. In D. Gopher & A. Koriat (Eds.), At- ford, UK: Clarendon.
tention & performance XVII: Cognitive regula-
Richardson, J., & Ormerod, T. C. (1 997). Re-
tion of performance: Interaction of theory and ap-
phrasing between disjunctives and condition-
plication (pp. 223 –25 8). Cambridge, MA: MIT
als: Mental models and the effects of thematic
Press.
content. Quarterly Journal of Experimental Psy-
Morris, B. J., & Sloutsky, V. (2002). Children’s chology, 5 0A, 3 5 8–3 85 .
solutions of logical versus empirical problems: Roberts, M. J. (2000). Strategies in relational in-
What’s missing and what develops? Cognitive ference. Thinking and Reasoning, 6, 1 –26.
Development, 1 6, 907–928.
Roberts, M. J. (in press). Falsification and men-
Neth, H., & Johnson-Laird, P. N. (1 999). The tal models: It depends on the task. In W.
search for counterexamples in human reason- Schaeken, A. Vandierendonck, W. Schroyens,
ing. Proceedings of the twenty first annual confer- & G. d’Ydewalle (Eds.), The mental models
ence of the Cognitive Science Society, 806. theory of reasoning: Refinement and extensions.
Newstead, S. E., Ellis, M. C., Evans, J. St. B. T., & Mahwah, NJ: Erlbaum.
Dennis, I. (1 997). Conditional reasoning with Rouse, W. B., & Hunt, R. M. (1 984). Human
realistic material. Thinking and Reasoning, 3 , problem solving in fault diagnosis tasks. In W. B.
49–76. Rouse (Ed.), Advances in man–machine systems
Newstead, S. E., & Griggs, R. A. (1 999). research. Greenwich, CT: JAI Press.
Premise misinterpretation and syllogistic rea- Santamarı́a, C., & Johnson-Laird, P. N. (2000).
soning. Quarterly Journal of Experimental Psy- An antidote to illusory inferences. Thinking and
chology, 5 2 A, 1 05 7–1 075 . Reasoning, 6, 3 1 3 –3 3 3 .
Newstead, S. E., Handley, S. J., & Buck, E. (1 999). Schaeken, W. S., De Vooght, G., Vandierendonck,
Falsifying mental models: Testing the predic- A., & d’Ydewalle, G. (Eds.). (1 999). Deductive
tions of theories of syllogistic reasoning. Mem- reasoning and strategies. Mahwah, NJ: Erlbaum.
ory and Cognition, 2 7, 3 44–3 5 4. Schaeken, W. S., Johnson-Laird, P. N., &
Ormerod, T. C., & Johnson-Laird, P. N. (in press). d’Ydewalle, G. (1 996). Mental models and
How pragmatics modulates the meaning of temporal reasoning. Cognition, 60, 205 –23 4.
sentential connectives. Schwartz, D., & Black, J. B. (1 996). Analog
Ormerod, T. C., Manktelow, K. I., & Jones, imagery in mental model reasoning: Depic-
G. V. (1 993 ). Reasoning with three types of tive models. Cognitive Psychology, 3 0, 1 5 4–
conditional: Biases and mental models. Quar- 21 9.
terly Journal of Experimental Psychology, 46A, Shimojo, S., & Ichikawa, S. (1 989). Intuitive rea-
65 3 –678. soning about probability: Theoretical and ex-
Osherson, D. N. (1 974–1 976) Logical abilities in perimental analyses of the ‘problem of three
children, vols. 1 –4. Hillsdale, NJ: Erlbaum. prisoners’. Cognition, 3 2 , 1 –24.
Sloman, S. A. (1 996). The empirical case for Vadeboncoeur, I., & Markovits, H. (1 999). The
two systems of reasoning. Psychological Bulletin, effect of instructions and information retrieval
1 1 9, 3 –22. on accepting the premises in a conditional rea-
Sloutsky, V. M., & Goldvarg, Y. (1 999). Effects soning task. Thinking and Reasoning, 5 , 97–1 1 3 .
of externalization on representation of inde- Van der Henst, J-B., Yang, Y., & Johnson-Laird,
terminate problems. In M. Hahn & S. Stones P. N. (2002). Strategies in sentential reasoning.
(Eds.), Proceedings of the 2 1 st annual conference Cognitive Science, 2 6, 425 –468.
of the Cognitive Science Society (pp. 695 –700). Vandierendonck, A., & De Vooght, G. (1 997).
Mahwah, NJ: Erlbaum. Working memory constraints on linear reason-
Stanovich, K. E. (1 999). Who is rational? Studies ing with spatial and temporal contents. Quar-
of individual differences in reasoning. Mahwah, terly Journal of Experimental Psychology, 5 0A,
NJ: Erlbaum. 803 –820.
Stenning, K. (2002). Seeing reason: Image and lan- Vandierendonck, A., De Vooght, G., Desim-
guage in learning to think. Oxford: Oxford Uni- pelaere, C., & Dierckx, V. (1 999). Model con-
versity Press. struction and elaboration in spatial linear syl-
Stenning, K., & Yule, P. (1 997). Image and lan- logisms. In W. S. Schaeken, G. De Vooght, A.
guage in human reasoning: A syllogistic illus- Vandierendonck, & G. d’Ydewalle (Eds.), De-
tration. Cognitive Psychology, 3 4, 1 09–1 5 9. ductive reasoning and strategies (pp. 1 91 –207).
Stevenson, R. J. (1 993 ). Language, thought and Mahwah, NJ: Erlbaum.
representation. New York: Wiley. Vosniadou, S., & Brewer, W. F. (1 992). Mental
Störring, G. (1 908). Experimentelle Unter- models of the earth: A study of conceptual
suchungen über einfache Schlussprozesse. change in childhood. Cognitive Psychology, 2 4,
Archiv für die gesamte Psychologie, 1 1 , 1 –27. 5 3 5 –5 85 .
Suppes, P. (1 970). A probabilistic theory of causal- Walsh, C. R., & Johnson-Laird, P. N. (2004). Co-
ity. Amsterdam: North-Holland. reference and reasoning. Memory and Cogni-
tion, 3 2 , 96–1 06.
Tversky, A., & Kahneman, D. (1 973 ). Avail-
ability: A heuristic for judging frequency Wason, P. C., & Johnson-Laird, P. N. (1 972). The
and probability. Cognitive Psychology, 5 , 207– psychology of reasoning. Cambridge, MA: Har-
23 2. vard University Press.
Tversky, A., & Kahneman, D. (1 982). Causal Webber, B. L. (1 978). Description formation and
schemas in judgements under uncertainty. In discourse model synthesis. In D. L. Waltz (Ed.),
D. Kahneman, P. Slovic, & A. Tversky (Eds), Theoretical issues in natural language processing
Judgement under uncertainty: Heuristics and (vol. 2). New York: Association for Computing
biases (pp. 1 1 7–1 28). Cambridge, Cambridge Machinery.
University Press. Wittgenstein, L. (1 922). Tractatus logico-philoso-
Tversky, A., & Kahneman, D. (1 983 ). Extensional phicus. London: Routledge & Kegan Paul.
versus intuitive reasoning: The conjunction fal- Yang, Y., & Johnson-Laird, P. N. (2000). How
lacy in probability judgment. Psychological Re- to eliminate illusions in quantified reasoning.
view, 90, 292–3 1 5 . Memory and Cognition, 2 8, 1 05 0–1 05 9.
CHAPTER 1 0
Visuospatial Reasoning
Barbara Tversky
Visuospatial reasoning is not simply a mat- uity of spatial reasoning, perhaps because
ter of running to retrieve a fly ball or wend- of the naturalness of mapping abstract el-
ing a way through a crowd or plotting a ements and relations to spatial ones, spatial
path to a destination or stacking suitcases reasoning serves as a basis for abstract knowl-
in a car trunk. It is a matter of deter- edge and inference. The prevalence of spa-
mining whether gears will mesh (Schwartz tial figures of speech in everyday talk attests
& Black, 1 996a), understanding how a car to that: We feel close to some people and
brake works (Heiser & Tversky, 2002), dis- remote from others; we try to keep our spir-
covering how to destroy a tumor without de- its up, to perform at the peak of our pow-
stroying healthy tissue (Duncker, 1 945 ; Gick ers, to avoid falling into depressions, pits,
& Holyoak, 1 980, 1 983 ), and designing a or quagmires; we enter fields that are wide
museum (Suwa & Tversky, 1 997). Perhaps open, struggling to stay on top of things and
more surprising, it is also a matter of decid- not get out of depth. Right now, in this sec-
ing whether a giraffe is more intelligent than tion, we establish fuzzy boundaries for the
a tiger (Banks & Flora, 1 977; Paivio, 1 978), current field of inquiry.
whether one event is later than another
(Boroditsky, 2000), and whether a conclu-
sion follows logically from its premises (Bar- Reasoning
wise & Etchemendy, 1 995 ; Johnson-Laird,
1 983 ). All these abstract inferences, and Before the research, a few words about the
more, appear to be based on spatial reason- words are in order. The core of reasoning
ing. Why is that? People begin to acquire seems to be, as Bruner put it years ago, go-
knowledge about space and the things in it ing beyond the information given (Bruner,
probably before they enter the world. In- 1 973 ). Of course, nearly every human ac-
deed, spatial knowledge is critical to survival tivity requires going beyond the information
and spatial inference critical to effective sur- given. The simplest recognition or general-
vival. Perhaps because of the (literal) ubiq- ization task, as well as the simplest action,
2 09
requires going beyond the information and transformations, even under direct ob-
given, for, according to a far more ancient servation of the brain, is another distinction
saying, you never step into the same river fraught with complexity and controversy.
twice. Yet many of these tasks and actions Evidence brought to bear for one can fre-
do not feel cognitive, do not feel like reason- quently be reinterpreted as evidence for the
ing. However, the border between percep- other (e.g., Anderson, 1 978). Both represen-
tual and cognitive processes may be harder tations and transformations themselves can
to establish than the borders between coun- each be decomposed into representations
tries in conflict. Fortunately, psychology is and transformations. Despite these compli-
typically free of territorial politics, and so cations, the distinction has been a productive
establishing boundaries between perception way to think about psychological processes.
and cognition is not essential. There seems to In fact, it is a distinction that runs deep
be a tacit understanding as to what counts as in human cognition, captured in language
perceptual and what as cognitive, although as subject and predicate and in behavior as
for these categories just as for simpler ones, agent/object and action. The distinction will
such as chairs and cups, the centers of the prove useful here more than as a way of or-
category enjoy more consensus than the bor- ganizing the literature (for related discus-
ders. Invoking principles or requirements for sion, see Doumas & Hummel, Chap. 4).
the boundaries between perception and cog- It has been argued that the very estab-
nition – consciousness, for example – seems lishment of representations entails inferen-
to entail more controversy than the separa- tial operations. A significant example is the
tion into territories. Gestalt principles of perceptual organiza-
How do we go beyond the information tion – grouping by similarity, proximity,
given? Going beyond the information given common fate, and good continuity – that
does not necessarily mean adding informa- contribute to scene segmentation and rep-
tion. One way to go beyond the information resentation. These are surely a form of vi-
given is to transform the information given. suospatial inference. Representations are in-
This is the concern of the earlier part of the ternal translations of external stimuli (or
manuscript. Going beyond the information internal data); as such, they not only elimi-
given can also mean transforming the given nate information from the external world –
information, sometimes according to rules, they also add to it and distort it in the ser-
as in deductive reasoning. Another way to vice of interpretation or behavior. Thus, if
go beyond the information given is to make inference is to be understood in terms of
inferences or judgments from it. Inference operating on or manipulating information
and judgment are the concerns of the later to draw new conclusions, then it begins in
part of the manuscript. Now some more dis- the periphery of the sensory systems with
tinctions regarding the visuospatial portion leveling and sharpening and feature detec-
of the title are made. tion and organization. Nevertheless, the field
has accepted a level of description of repre-
sentations and transformations – one higher
Representations and Transformations
than the levels of sensory and perceptual
Truths are hard to come by in science, processing; that level is reflected here.
but useful fictions and approximate truths
abound. One of these is the distinction
between representations and transforma- Visuospatial
tions, between information and processes,
between data and the operations performed What makes visuospatial representations
on data. Representations place limits on visuospatial? Visuospatial transformations
transformations as they select and structure visuospatial? First and foremost, visuospatial
the information captured from the world representations capture visuospatial proper-
or the mind. Distinguishing representations ties of the world. They do this in a way
visuospatial reasoning 21 1
that preserves, at least in part, the spatial– top-down by studying complex reasoning
structural relations of that information (see that has a visuospatial basis. Both ap-
Johnson-Laird, 1 983 ; Pierce in Houser & proaches have been productive. We begin
Kloesel, 1 992). This means that visuospa- with elements.
tial properties that are close or above or be-
low in the world preserve those relations Imagery as Internalized Perception
in the representations. Visual includes static
properties of objects, such as shape, texture, The major research tradition studying visu-
and color, or between objects and reference ospatial reasoning from a bottom-up per-
frames, such as distance and direction. It also spective has been the imagery program pi-
includes dynamic properties of objects such oneered by Shepard (see Finke & Shepard,
as direction, path, and manner of movement. 1 986; Shepard & Cooper, 1 982; Shepard &
By this account, visuospatial transformations Podgorny, 1 978, for overviews) and Kosslyn
are those that change or use visuospatial in- (1 980, 1 994b), which has aimed to demon-
formation. Many of these properties of static strate parallels between visual perception
and dynamic objects and of spatial relations and visual imagery. There are two basic
between objects are available from modal- tenets of the approach, one regarding rep-
ities other than vision. This may explain resentations and the other regarding opera-
why well-adapted visually impaired individ- tions on representations: that mental images
uals are not disadvantaged at many spatial resemble percepts and that mental trans-
tasks (e.g., Klatzky, Golledge, Cicinelli, & formations on images resemble observable
Pellegrino, 1 995 ). Visuospatial representa- changes in things in the world, as in men-
tions are regarded as contrasting with other tal rotation, or perceptual processes per-
forms of representation – notably linguis- formed on things in the world, as in men-
tic. The similarities (e.g., Talmy, 1 983 , 2001 ) tal scanning. Kosslyn (1 994b) has persisted
and differences between visuospatial and in these aims, more recently demonstrat-
linguistic representations provide insights ing that many of the same neural structures
into both. are used for both. Not the demonstrations
Demonstrating properties of internal rep- per se, but the interpretations of them have
resentations and transformations is tricky for met with controversy (e.g., Pylyshyn, 1 978,
another reason; representations are many 1 981 ). In attempting to demonstrate the sim-
steps from either (controlled) input or ilarities between imagery and perception,
(observed) output. For these reasons, the the imagery program has focused both on
study of internal representations and pro- properties of objects and on characteristics
cesses was eschewed not only by behavior- of transformations on objects – the former,
ists but also by experimentalists. It was one representations, and the latter, operations or
of the first areas to flourish after the so- transformations. The thrust of the research
called Cognitive Revolution of the 1 960s programs has been to demonstrate that im-
with a flurry of innovative techniques to ages are like internalized perceptions and
demonstrate form and content of internal transformations of images like transforma-
representations and the transformations per- tions of things in the world.
formed on them. It is to that research that we
now turn. representations
In the service of demonstrating that im-
ages preserve characteristics of perceptions,
Representations and Transformations
Shepard and his colleagues brought evi-
Visuospatial reasoning can be approached dence from similarity judgments as sup-
bottom-up by studying the elementary rep- port. They demonstrated “second-order
resentations and processes that presumably isomorphisms,” similarity spaces for per-
form the building blocks for more com- ceived and imagined stimuli that have the
plex reasoning. It can also be approached same structure, that is, are fit by the
same underlying multidimensional space Additional support for the claim that images
(Shepard & Chipman, 1 970). For example, preserve properties of percepts comes from
similarity judgments of shapes of cutouts tasks requiring construction of images. Con-
of states conform to the same multidimen- structing images takes longer when there are
sional space as similarity judgments of imag- more parts to the image, even when the
ined shapes of states. The same logic was same figure can be constructed from more
used to show that color is preserved in im- or fewer parts (Kosslyn, 1 980).
ages, as well as configurations of faces (see The imagery-as-internalized-perception
Gordon & Hayward, 1 973 ; Shepard, 1 975 ). has proved to be too narrow a view of the
Similar reasoning was used to demonstrate variety of visuospatial representations. In ac-
qualitative differences between pictorial and counting for syllogistic reasoning, Johnson-
verbal representations in a task requiring se- Laird (1 983 ) proposed that people form
quential same–different judgments on pairs mental models of the situations described
of schematic faces and names (Tversky, by the propositions (see Johnson-Laird,
1 969). The pictorial and verbal similarity Chap. 9). Mental models contrast with clas-
of the set of faces was orthogonal so the sic images in that they are more schematic
“different” responses were a clue to the un- than classical images. Entities are repre-
derlying representation; times to respond sented as tokens, not as likenesses, and
“different” were faster when more features spatial relations are approximate, almost
between the pairs differ. These times indi- qualitative. A similar view was developed
cated that when participants expected the to account for understanding text and dis-
target (second) stimulus would be a picture, course, then listeners and readers construct
they encoded the first stimulus pictorially, schematic models of the situations described
whether it had been a picture of a face or (e.g., Kintsch & van Dijk, 1 983 ; Zwaan &
its name. The converse also held: When the Radvansky, 1 998). As is seen, visuospatial
target stimulus was expected to be a name, mental representations of environments, de-
participants coded the first stimulus verbally vices, and processes are often schematic,
irrespective of its presented modality. even distorted, rather than detailed and ac-
To demonstrate that mental images pre- curate internalized perceptions.
serve properties of percepts, Kosslyn and his
colleagues presented evidence from studies
of reaction times to detect features of imag- transformations
ined objects. One aim is to show that prop- Here, the logic is the same for most research
erties that take longer to verify in percepts programs and in the spirit of Shepard’s
take longer to identify in images. For ex- notion of second-order isomorphisms: to
ample, when participants were instructed to demonstrate that the times to make par-
construct images of named animals in order ticular visuospatial judgments in memory
to judge whether the animal had a partic- increase with the times to observe or per-
ular part, they verified large parts of ani- form the transformations in the world. The
mals, such as the back of a rabbit, faster dramatic first demonstration was mental ro-
than small but highly associated ones, such tation (Shepard & Metzler, 1 971 ): time to
as the whiskers of a rat. When participants judge whether two figures in different ori-
were not instructed to use imagery to make entations (Figure 1 0.1 ) are the same or
judgments, they verified small associated mirror images correlate linearly with the an-
parts faster than large ones. When not in- gular distance between the orientations of
structed to use imagery, participants used the figures. The linearity of the relation-
their general world knowledge to make judg- ship – 1 2 points on a straight line – suggests
ments (Kosslyn, 1 976). Importantly, when smooth, continuous mental transformation.
the participants explicitly used imagery, they Although linear functions have been ob-
took longer to verify parts, large or small, tained for the original stimuli, strings of
than when they relied on world knowledge. 1 0 cubes with two bends, monotonic, but not
creases (Bundesen, Larsen, & Farrell, 1 981 ;

Moyer, 1 973 ). New objects can be con-
structed in imagery, which is a skill presum-
ably related to design and creativity (e.g.,
Finke, 1 990, 1 993 ). In a well-known exam-
ple, Finke, Pinker, and Farah (1 989) asked
students to imagine a capital letter J centered
under an upside-down grapefruit half. Stu-
dents reported “seeing” an umbrella. Even
without instructions to image, certain tasks
spontaneously encourage formation of vi-
sual images. For example, when participants
are asked whether a described spatial array,
such as star above plus, matches a depicted
one, response times indicate that they trans-
form the description into a depiction when
given sufficient time to mentally construct
the situation (Glushko & Cooper, 1 978;
Tversky, 1 975 ).
In the cases of mental rotation, mental
movement, and mental size transformations,
objects or object parts undergo imagined
transformations. There is also evidence that
objects can be mentally scanned in a contin-
Figure 1 0.1 . Mental rotation task of Shepard uous manner. In a popular task introduced
and Metzler (1 971 ). Participants determine
by Kosslyn and his colleagues, participants
whether members of each pair can be rotated
into congruence.
memorize a map of an island with several
landmarks such as a well and a cave. Partic-
ipants are then asked to conjure an image
of the map and to imagine looking first at
linear, functions are obtained for other stim-
the well and then mentally scanning from
uli such as letters (Shepard & Cooper, 1 982).
the well to the cave. The general finding is
There are myriad possible mental transfor-
that mental scanning between two imagined
mations, only a few of which have been stud-
landmarks increases linearly as the distance
ied in detail. They may be classified into
between them increases (Denis & Kosslyn,
mental transformations on other objects and
1 999; Kosslyn, Ball, & Rieser, 1 978; Fig-
individuals, and mental transformations on
ure 1 0.2). The phenomenon holds for spa-
oneself. In both cases, the transformations
tial arrays established by description rather
may be global, wholistic, or of the entire en-
than depiction – again, under instructions to
tity – the transformations may be operations
form and use images (Denis, 1 996). Men-
on parts of entities.
tal scanning occurs for arrays in depth and
Mental Transformations on Objects. Ro- for flat perspectives on 3 D arrays (Pinker,
tation is not the only transformation that 1 980). In the previous studies, participants
objects in the world undergo. They can were trained to mentally scan and directed to
undergo changes of size, shape, color, in- do so, leaving open the question of whether
ternal features, position, combination, and it occurs spontaneously. It seems to do so in
more. Mental performance of some of these a task requiring direction judgments on re-
transformations has been examined. The membered arrays. Participants first saw an
time to mentally compare the shapes of array of dots. After the dots disappeared,
two rectangles differing in size increases as an arrow appeared on the screen. The task
the actual size difference between them in- was to say whether the arrow pointed to
the previous location of a dot. Reaction visuospatial reasoning has not only percep-
times increased with distance of the arrow tual, but also motor, foundations.
to the likely dot, suggesting that participants
Mental Transformations of Self. That men-
mentally scan from the arrow to answer
tal imagery is both perceptual and motor
the question (Finke & Pinker, 1 982, 1 983 ).
follows from broadening the basic tenets of
Mental scanning may be part of catching
the classical account for imagery. According
or hitting the ball in baseball, tennis, and
to that account, mental processes are inter-
other sports.
nalizations of external or externally driven
Applying Several Mental Transformations. processes – perceptual ones according to
Other mental transformations on objects are the classic view (e.g., in the chapter title of
possible – for example, altering the internal Shepard & Podgorny, 1 978, “Cognitive pro-
configuration of an object. To solve some cesses that resemble perceptual processes”).
problems, such as geometric analogies, peo- The acts of drawing a figure or construct-
ple need to apply more than one mental ing an object entail both perceptual and
transformation to a figure to obtain the an- motor processes working in concert as do
swer. In most cases, the order of applying many other activities performed in both real
the transformations is optional; that is, first and virtual worlds, from shaking hands to
rotating and then moving a figure yield the wayfinding.
same answer as first moving and then rotat- Evidence for mental transformations of
ing. Nevertheless, people have a preferred self, or motor imagery, rather than or in addi-
order for performing a sequence of mental tion to visual imagery has come from a vari-
transformations, and when this order is vi- ety of tasks. The time taken to judge whether
olated, both errors and performance time a depicted hand is right or left correlates
increase (Novick & Tversky, 1 987). What with the time taken to move the hand into
accounts for the preferred order? Although the depicted orientation as if participants
the mental transformations are performed were mentally moving their hands in or-
in working memory, the determinants of order to make the right/left decision (Parsons,
der do not seem to be related to working 1 987b; Sekiyama, 1 982). Mental reorienta-
memory demands. Move is one of the least tion of one’s body has been used to ac-
demanding transformations, and it is typi- count for reaction times to judge whether
cally performed first, whereas rotate is one a left or right arm is extended in pictures
of the most difficult transformations and is of bodies in varying orientations from up-
performed second. Then transformations of right (Parsons, 1 987a). In those studies, re-
intermediate difficulty are performed. What action times depend on the angle of rotation
correlates with the order of applying succes- and the degree of rotation. For some orienta-
sive mental transformations is the order of tions, notably the picture plane, the degree
drawing. Move determines where the pencil of rotation from upright has no effect. This
is to be put on the paper, the first act of draw- allows dissociating mental transformations
ing. Rotate determines the direction in which of other, in this case, mental rotation from
the first stroke should be taken, and it is the mental transformations of self, in this case,
next transformation. The next transforma- perspective transformations, for the latter do
tions to be applied are those that determine yield increases in reaction times with de-
the size of the figure and its internal details gree of rotation from upright (Zacks, Mires,
(remove, add part, change size, change shading, Tversky, & Hazeltine, 2000; Zacks & Tver-
add part). Although the mental transforma- sky, in press). Imagining oneself interacting
tions have been tied to perceptual processes, with a familiar object such as a ball or a ra-
the ordering of performing them appears zor selectively activates left inferior parietal
to be tied to a motor process, the act of and sensorimotor cortex, whereas imagining
drawing or constructing a figure. This finding another interacting with the same objects
presaged later work showing that complex selectively activates right inferior parietal,
same level and are asked which would spill

first when tilted, they are typically incorrect
from visual imagery. However, if they close
their eyes and imagine tilting each glass un-
til it spills, they correctly tilt a wide glass
less than a narrow one (Schwartz & Black,
1 999). Think of turning a car versus turning a
boat. To imagine making a car turn right, you
must imagine rotating the steering wheel to
the right; however, to imagine making a boat
turn right, you must imagine moving the
rudder lever left. In mental rotation of left
and right hands, the shortest motor path ac-
counts for the reaction times better than the
shortest visual path (Parsons, 1 987b). Men-
tal enactment also facilitates memory, even
for actions described verbally (Englekamp,
1 998). Imagined motor transformations pre-
sumably underlie mental practice of athletic
Figure 1 0.2 . Mental scanning. Participants and musical routines – techniques known
memorize map and report time to mentally scan to benefit performance (e.g., Richardson,
from one feature to another (after Kosslyn, Ball, 1 967).
& Rieser, 1 978).
The reasonable conclusion, then, is that
both internalized perceptual transforma-
tions and internalized motor transformations
precuneus, posterior cingulated, and fron-
can serve as bases for transformations in
topolar cortex (Ruby & Decety, 2001 ).
mental imagery. Perceptual and motor im-
There have been claims that visual and
agery can work in concert in imagery, just
motor imagery, or as we have put it, mental
as perceptual and motor processes work in
transformations of object and of self, share
concert in conducting the activities of life.
the same underlying mechanisms (Wexler,
Kosslyn, & Berthoz, 1 998; Wolschlager &
Wolschlager, 1 998). For example, perform- elementary transformations
ing clockwise physical rotations facilitates The imagery-as-internalized-perception ap-
performing clockwise mental rotations but proach has provided evidence for myriad
interferes with performing counterclock- mental transformations. We have reviewed
wise mental rotations. However, this may evidence for a number of mental per-
be because planning, performing, and mon- ceptual transformations: scanning, changing
itoring the physical rotation require both orientation, location, size, shape, color; con-
perceptual and motor imagery. The work structing from parts; and rearranging parts.
of Zacks and collaborators (Zacks et al., Then we have motor transformations: mo-
2000; Zacks & Tversky, in press) and Ruby tions of bodies, wholes, or parts. This ap-
and Decety (2001 ) suggests that these two proach has the potential to provide a catalog
classes of mental transformations are disso- of elementary mental transformations that
ciable. Other studies directly comparing the are simple inferences and that can combine
two systems support their dissociability: The to enable complex inferences.
consequences of using one can be different The work on inference, judgment, and
from the consequences of using the other problem solving will suggest transformations
(Schwartz, 1 999; Schwartz & Black, 1 999; that have yet to be explored in detail. Here,
Schwartz & Holton, 2000). When people we propose a partial catalog of candidates
imagine wide and narrow glasses filled to the for elementary properties of representations
and transformations, expanding from the re- figures, finding simple figures in more com-
search reviewed: plex ones, loads high on spatial visualization,
and performance on mental rotation tasks
r Determining static properties of entities:
naturally loads high on the mental rotation
figure/ground, symmetry, shape, internal factor. As frequently as they are found, these
configuration, size, color, texture, and three abilities do not span the range of spa-
more tial competencies. Yet another partially in-
r Determining relations between static dependent visuospatial ability is visuospatial
entities: memory, remembering the layout of display
◦ With respect to a frame of reference: (e.g., Betrancourt & Tversky, in press). The
location, direction, distance, and more number of distinct spatial abilities as well as
◦ With respect to other entities, com- their distinctness remain controversial (e.g.,
paring size, color, shape, texture, loca- Carroll, 1 993 ; Hegarty & Waller, in press).
tion, orientation, similarity, and other More recent work explores the relations
attributes of spatial abilities to the kinds of men-
r Determining relations of dynamic and static tal transformations that have been distin-
entities: guished – for example, imagining an object
◦ With respect to other entities or rotate versus imagining changing one’s own
to a reference frame: direction, orientation. The mental transformations, in
speed, acceleration, manner, intersec- turn, are often associated with different
tion/collision brain regions (e.g., Zacks, Mires, Tversky, &
r Performing transformations on entities: Hazeltine, 2000; Zacks, Ollinger, Sheridan,
& Tversky, 2002; Zacks & Tversky, in
change location (scanning); change per-
press). Kozhevniikov, Kosslyn, and Shepard
spective, orientation, size, shape; mov-
in press) proposed that spatial visualiza-
ing wholes; reconfiguring parts; zooming;
tion and mental rotation correspond respec-
enacting
r Performing transformations on self: change tively to the two major visual pathways in
the brain – the ventral “what” pathway un-
of perspective, change of location, derlying object recognition and the dorsal
change of size, shape, reconfiguring parts, “where” pathway underlying spatial loca-
enacting tion. Interestingly, scientists and engineers
score relatively high on mental rotation and
individual differences artists score relatively high on spatial visu-
Yes, people vary in spatial ability. However, alization. Similarly, architects and design-
spatial ability does not contrast with ver- ers score higher than average on embed-
bal ability; in other words, someone can be ded figure tasks but not on mental rota-
good or poor at both, as well as good in one tion (Suwa & Tversky, 2003 ). Associating
and poor in the other. In addition, spatial spatial ability measures to mental transfor-
ability (like verbal ability) is not a single, mations and brain regions are promising
unitary ability. Some of the separate spa- directions toward a systematic account of
tial abilities differ qualitatively; that is, they spatial abilities.
map well onto the kinds of mental transfor-
mations they require. A meta-analysis of a
number of factor analyses of spatial abili-
Inferences
ties yielded three recurring factors (Linn &
Peterson, 1 986): spatial perception, spatial
Inferences from Observing Motion
visualization, and mental rotation. Rod-and-
in Space
frame and water-level tasks load high on spa-
tial perception; this factor seems to reflect To ensure effective survival, in addition to
choice of frame of reference, within an ob- perceiving the world as it is we need to
ject or extrinsic. Performance on embedded also anticipate the world that will be. This
entails inference – inferences from visuospa- From abstract motion paths, adults can
tial information. Some common inferences, make further inferences about what gen-
such as determining where to intersect a fly- erated the motion. In point-light films,
ing object – in particular, a fly ball (e.g., the only thing visible is the movement
McBeath, Shaffer, & Kaiser, 1 995 ) – or what of lights placed at motion junctures of,
moving parts belong to the same object (e.g., for example, the joints of people walk-
Spelke, Vishton, & von Hofsten, 1 995 ) are ing or along branches of bushes swaying.
beyond the scope of the chapter. From sim- From point-light films, people can determine
ple, abstract motions of geometric figures, whether the motion is walking, running, or
people, even babies, infer causal impact and dancing, of men or of women, of friends
basic ontological categories – notably, inani- (Cutting & Kozlowski, 1 977; Johannson,
mate and animate. A striking demonstration 1 973 ; Kozlowski & Cutting, 1 977), of bushes
of perception of causality comes from the or trees (Cutting, 1 986). Surprisingly, from
work of Michotte (1 946/1 963 ; see Buehner point-light displays of action, people are bet-
& Cheng, Chap. 7). Participants watch films ter at recognizing their own movements than
of a moving object, A, coming into contact those of friends, suggesting that motor ex-
with a stationary object, B. When object B perience contributes to perception of mo-
moves immediately, continuing the direction (Prasad, Loula, & Shiffrar, 2003 ). Even
tion of motion suggested by object A, people abstract films of movements of geometric
perceive A as launching B, A as causing B to figures in sparse environments can be inter-
move. When A stops so both A and B are preted as complex social interactions, such
stationary before B begins to move, the per- as chasing and bullying, when they are espe-
ception of a causal connection between A’s cially designed for that (Heider & Simmel,
motion and B’s is lost; their movements are 1 944; Martin & Tversky, 2003 ; Oatley &
seen as independent events. This is a forceful Yuill, 1 985 ) or playing hide-and-seek, but in-
demonstration of immediate perception of terpreting these as intentional actions is not
causality from highly abstract actions, as well immediate; rather, it requires repeated ex-
as of the conditions for perception of causal- posure and possibly instructions to interpret
ity. What seems to underlie the perception the actions (Martin & Tversky, 2003 ).
of causality is the perception that object A Altogether, simply from abstract mo-
acts on object B. Actions on objects turn out tion paths or animated point-light displays,
to be the basis for segmenting events into people can infer several basic ontological
parts (Zacks, Tversky, & Iyer, 2001 ). categories: causal action, animate versus
In Michotte’s (1 946/1 963 ) demonstra- inanimate motion, human motion, motion
tions, the timing of the contact between the of males or females and familiar individuals,
initially moving object and the stationary ob- and social interactions.
ject that begins to move later is critical. If
A stops moving considerably before B be- Mental Spatial Inferences
gins to move, then B’s motion is perceived
to be independent of A’s. B’s movement inferences in real environments
in this case is seen as self-propelled. Self- Every kid who has figured out a short-cut,
propelled movement is possible only for ani- and who has not, has performed a spatial
mate agents, or, more recently in the history inference (for a more recent overview of
of humanity, for machines. Possible paths kids, see Newcombe & Huttenlocher, 2000).
and trajectories of animate motion differ Some of these inferences turn out to be
from those for inanimate motion. Preschool easier than others, often surprisingly. For
children can infer which motion paths are example, in real environments, inferences
appropriate for animate and inanimate mo- about where objects will be in relationship
tion, and even for abstract stimuli; they also to oneself after imagined movement in the
offer sensible explanations for their infer- environment turn out to be relatively ac-
ences (Gelman, Durgin, & Kaufman, 1 995 ). curate when the imagined movement is a
translation, that is, movement forward or also see that actual motor actions affect and
backward aligned with the body. How- reflect the character of mental ones.
ever, if the imagined movement is rota-
tional, a change in orientation, updating is inferences in mental environments
far less accurate (e.g., Presson & Montello, The section on inference opened with spa-
1 994; Reiser, 1 989). When asked to imagine tial inferences made in real environments.
walking forward a certain distance, turning, Often, people make inferences about envi-
walking forward another distance, and then ronments they are not currently in, for ex-
pointing back to the starting point, partic- ample, when they tell a friend how to get
ipants invariably err by not taking into ac- to their house and where to find the key
count the turn in their pointing (Klatzky, when they arrive. For familiar environments,
Loomis, Beall, Chance, & Golledge, 1 998). If people are quite competent at these sorts
they actually move forward, turn, and con- of spatial inferences. The mental represen-
tinue forward, but blindfolded, they point tations and processes underlying these in-
correctly. Spatial updating in real environ- ferences have been studied for several kinds
ments is more accurate after translation than of environments – notably the immediately
after rotation, and updating after rotation surrounding visible or tangible environment
is selectively facilitated by physical rotation. and the environment too large to be seen
This suggests a deep point about spatial in- at a glance. These two situations, the space
ferences and possibly other inferences: that around the body, and the space the body
in inference, mental acts interact with phys- navigates, seem to function differently in our
ical acts. lives, and consequently, to be conceptualized
differently (Tversky, 1 998).
Spatial updating for the space around the
gesture body was first studied using language alone
Interaction of mind and body in inference is to establish the environments (Franklin &
also revealed in gesture. When people de- Tversky, 1 990). It is significant that lan-
scribe space but are asked to sit on their guage alone, with no specific instructions
hands to prevent gesturing, their speech fal- to form images, was sufficient to establish
ters (Rauscher, Krauss, & Chen, 1 996), sug- mental environments that people could up-
gesting that the acts of gesturing promote date easily and without error. In the proto-
spatial reasoning. Even blind children ges- typical spatial framework task, participants
ture as they describe spatial layouts (Iverson read a narrative that describes themselves
& Goldin-Meadow, 1 997). in a 3 D spatial scene, such as a museum
The nature of spontaneous gestures sug- or hotel lobby (Franklin & Tversky, 1 990;
gests how this happens. When describing Figure 1 0.3 ). The narrative locates and de-
continuous processes, people make smooth, scribes objects appropriate to the scene be-
continuous gestures; when describing dis- yond the observer’s head, feet, front, back,
crete ones, people make jagged, discontin- left, and right (locations chosen randomly).
uous ones (Alibali, Bassok, Solomon, Syc, After participants have learned the scenes
& Goldin-Meadow, 1 999). For space, peo- described by the narratives, they turn to a
ple tend to describe environments as if they computer that describes them as turning in
were traveling through them or as if they the environment so they are now facing a dif-
were viewing them from above. The plane ferent object. The computer then cues them
of their gestures differs in each case in cor- with direction terms, front, back, head, and so
respondence with the linguistic perspective on, to which the participants respond with
they adopt (Emmorey, Tversky, & Taylor, the name of the object now in that direc-
2000). Earlier, mental transformations that tion. Of interest are the times to respond,
appear to be internalized physical transfor- depending on the direction from the body.
mations, such as those underlying handed- The classical imagery account would predict
ness judgments, were described. Here, we that participants will imagine themselves in
and slowest to the left/right axis, the pat-

tern obtained for the prototypical situation.
When narratives describe observers as reclin-
ing in the scenes, turning from back to side
to front, then no axis of the body is corre-
lated with gravity; thus, times depend on the
asymmetries of the body, and the pattern
changes. Times to retrieve objects in front
and back are then fastest because the per-
ceptual and behavioral asymmetries of the
front/back axis are most important. This is
the axis that separates the world that can be
seen and manipulated from the world that
cannot be seen or manipulated.
By now, dozens of experiments have ex-
amined patterns of response times to system-
atic changes in the described spatial envi-
Figure 1 0.3. Spatial framework situation. ronment (e.g., Bryant, Tversky, & Franklin,
Participants read a narrative describing objects 1 992; Franklin, Tversky, & Coon, 1 992). In
around an observer (after Bryant, Tversky, & one variant, narratives described participants
Franklin, 1 992).
at an oblique angle outside the environ-
ment looking onto a character (or two!) in-
the environment facing the selected object side the environment; in that case, none of
and then imagine themselves turning to face the axes of the observer’s body is corre-
each cued object in order to retrieve the ob- lated with axes of the characters in the nar-
ject in the cued direction. The imagery ac- rative, and the reaction times to all direc-
count predicts that reaction times should be tions are equal (Franklin et al., 1 992). In an-
fastest to the object in front, then to the ob- other variant, narratives described the scene,
jects 90 degrees away from front, that is, left, a special space house constructed by NASA,
right, head, and feet, and slowest to objects as rotating around the observer instead of
1 80 degrees from front, that is, objects to the the observer’s turning in the scene (Tver-
back. Data from dozens of experiments fail sky, Kim, & Cohen, 1 999). That condition
to support that account. proved difficult for participants. They took
Instead, the data conform to the spatial twice as long to update the environment
framework theory according to which partic- when the environment moved than when
ipants construct a mental spatial framework the observer moved – a case problematic
from extensions of three axes of the body: for pure propositional accounts of mental
head/feet, front/back, and left/right. Times spatial transformations. Once participants
to access objects depend on the asymmetries had updated the environment, retrieval
of the body axes as well as the asymmetries times corresponded to the spatial frame-
of the axes of the world. The front/back and work pattern.
head/feet axes have important perceptual Yet other experiments have varied the
and behavioral asymmetries that are lacking way the environment was conveyed, com-
in the left/right axis. The world also has three paring description, diagram, 3 D model, and
axes, only one of which is asymmetric, the life (Bryant & Tversky, 1 999; Bryant, Tver-
axis conferred by gravity. For the upright ob- sky, & Lanca, 2001 ). When the scene is con-
server, the head/feet axis coincides with the veyed by narrative, life, or a 3 D model, the
axis of gravity, and so responses to head and standard spatial framework pattern obtains.
feet should be fastest, and they are. Accord- However, when the scene is conveyed by
ing to the spatial framework account, times a diagram, participants spontaneously adopt
should be next fastest to the front/back axis an external perspective on the environment.
Their response times are consonant with route perspectives, the mental transforma-
performing a mental rotation of the entire tion needed to understand the location in-
environment rather than performing a men- formation is a transformation of self, an ego-
tal change of their own perspective with re- centric transformation of one’s viewpoint
spect to a surrounding environment (Bryant in an environment. For survey perspectives,
& Tversky, 1 999). Which viewpoint partici- the mental transformation needed to under-
pants adopt, and consequently which mental stand the location information is a transfor-
transformation they perform, can be altered mation of other, a kind of mental scanning
by instructions. When instructed to do so, of an object.
participants will adopt the internal perspec- The prevalence of these two perspectives
tive embedded in the environment in which in imagery, the external perspective viewing
the observer turns from a diagram or the ex- an object or something that can be repre-
ternal perspective from a model in which the sented as an object and the internal perspec-
entire environment is rotated with the pre- tive viewing an environment from within,
dicted changes in patterns of retrieval times. is undoubtedly associated with their preva-
Similar findings have been reported by lence in the experience of living. In life, we
Huttenlocher and Presson (1 979), Wraga, observe changes in the orientation, size, and
Creem, and Proffitt (2000), and Zacks et al. configuration of objects in the world and
(in press). scan them for those changes. In life, we move
around in environments, updating our po-
sition relative to the locations of other ob-
route and survey perspectives jects in the environment. We are adept at
When people are asked to describe envi- performing the mental equivalents of these
ronments that are too large to be seen at a actual transformations. There is a natural
glance, they do so from one of two perspec- correspondence between the internal and
tives (Taylor & Tversky, 1 992a, 1 996). In a external perspectives and the mental trans-
route perspective, people address the listener formations of self and other, but the human
as “you,” and take “you” on a tour of the en- mind is flexible enough to apply either trans-
vironment, describing landmarks relative to formation to either perspective. Although
your current position in terms of your front, we are biased to take an external perspec-
back, left, and right. In a survey perspective, tive on objects and mentally transform them
people take a bird’s eye view of the envi- and biased to take an internal perspective on
ronment and describe locations of landmarks environments and mentally transform our
relative to one another in terms of north, bodies with respect to them, we can take
south, east, and west. Speakers (and writers) internal perspectives on objects and ex-
often mix perspectives, contrary to linguists ternal perspectives on events. The mental
who argue that a consistent perspective is world allows perspectives and transforma-
needed both for coherent construction of tions, whereas the physical world does not.
a message and for coherent comprehen- Indeed, conceptualizing a 3 D environment
sion (Taylor & Tversky, 1 992, 1 996; Tversky, that surrounds us and is too large to be seen
Lee, & Mainwaring, 1 999). In fact, con- at once as a small flat object before the eyes,
struction of a mental model is faster when something people, even children, have done
perspective is consistent, but the effect is for eons whenever they produce a map, is
small and disappears quickly during retrieval a remarkable feat of the human mind (cf.
from memory (Lee & Tversky, in press). Tversky, 2000a).
In memory for locations and directions of
landmarks, route and survey statements are
verified equally quickly and accurately re- effects of language on spatial thinking
gardless of the perspective of learning, pro- Speakers of Dutch and other Western lan-
vided the statements are not taken verbatim guages use both route and survey perspec-
from the text (Taylor & Tversky, 1 992b). For tives. Put differently, they can use either a
visuospatial reasoning 221
relative spatial reference system or an abso- shopping center. Indeed, visuospatial think-
lute (extrinsic) spatial reference system to ing is fundamental to the reasoning processes
describe locations of objects in space. Rela- described in other chapters in this handbook,
tive systems use the spatial relations “left,” as discussed in the chapters on similarity (see
“right,” “front,” and “back” to locate objects; Goldstone & Son, Chap. 2), categorization
absolute or extrinsic systems use terms (see Medin & Rips, Chap. 3 ), induction (see
equivalent to “north,” “south,” “east,” and Sloman & Lagnado, Chap. 5 ), analogical rea-
“west.” A smattering of languages dispersed soning (see Holyoak, Chap. 6), causality (see
around the world do not describe locations Buehner & Cheng, Chap. 7), deductive rea-
using “left” and “right” (Levinson, 2003 ). In- soning (see Evans, Chap. 8), mental models
stead, they rely on an absolute system, so (see Johnson-Laird, Chap. 9), and problem
a speaker of those languages would refer to solving (see Novick & Bassok, Chap. 1 4). For-
your coffee cup as the “north” cup rather tunately for both reader and author, there is
than the one on “your right.” Talk appar- no need to repeat those discussions here.
ently affects thought. Years of talking about
space using an absolute spatial reference sys- Distortions as Clues to Reasoning
tem have had fascinating consequences for
thinking about space. For example, speakers Another approach to revealing visuospa-
of absolute languages reconstruct a shuffled tial reasoning has been to demonstrate the
array of objects relative to extrinsic direc- ways that visuospatial representations differ
tions in contrast to speakers of Dutch, who systematically from situations in the world.
reconstruct the array relative to their own This approach, which can be called the dis-
bodies. What’s more, when speakers of lan- tortions program, contrasts with the classi-
guages with only extrinsic reference sys- cal imagery approach. The aim of the distor-
tems are asked to point home after being tions approach is to elucidate the processes
driven hither and thither, they point with involved in constructing and using men-
impressive accuracy, in contrast to Dutch tal representations by showing their conse-
speakers, who point at random. The view quences. The distortions approach has fo-
that the way people talk affects how they cused more on relations between objects
think has naturally aroused controversy (see and relations between objects and refer-
Gleitman & Papafragou, Chap. 26), but is re- ence frames, as these visuospatial properties
ceiving increasing support from a variety of seem to require more constructive processes
tasks and languages (e.g., Boroditsky, 2001 ; than those for establishing representations
Boroditsky, Ham, & Ramscar, 2002). If we of objects. Some systematic distortions have
take a broader perspective, the finding that also been demonstrated in representations
language affects thought is not as startling. of objects.
Language is a tool, such as measuring in-
struments or arithmetic or writing; learn- representations
ing to use these tools also has consequences Early on, the Gestalt psychologists at-
for thinking. tempted to demonstrate that memory for
figures got distorted in the direction of good
figures (see Riley, 1 962). This claim was con-
tested and countered by increasingly sophis-
Judgments ticated empirical demonstrations. The dis-
pute faded in a resolution: visual stimuli
Complex visuospatial thinking is fundamen- are interpreted, sometimes as good figures;
tal to a broad range of human activity, from memory tends toward the interpretations.
providing directions to the post office and So if o – o is interpreted as “eyeglasses,” par-
understanding how to operate the latest ticipants later draw the connection curved,
electronic device to predicting the conse- whereas if it is interpreted as “barbells,”
quences of chemical bonding or designing a they do not (Carmichael, Hogan, & Walter,
1 93 2). Little noticed is that the effect does Tversky, 1 992, 2000b, 2000c), it is diffi-
not appear in recognition memory (Prentice, cult to clearly attribute error to either rep-
1 95 4). Since then, and relying on the sophis- resentations or processes. Rather the errors
ticated methods developed, there has been seem to be consequences of both, of schema-
more evidence for shape distortion in repre- tized, hence distorted, representations con-
sentations. Shapes that are nearly symmet- structed ad hoc in order to enable specific
ric are remembered or judged as more sym- judgments, such as the direction or distance
metric than they actually are, as if people between pairs of cities. When answering
code nearly symmetric objects as symmetric such questions, it is unlikely that people con-
(Freyd & Tversky, 1 984; McBeath, Schiano, sult a library of “cognitive maps.” Rather, it
& Tversky, 1 997; Tversky & Schiano, 1 989). seems that they draw on whatever informa-
Given that many of the objects and be- tion they have that seems relevant, organiz-
ings that we encounter are symmetric, but ing it for the question at hand. The reliability
are typically viewed at an oblique angle, of the errors under varying judgments makes
symmetry may be a reasonable assump- it reasonable to assume erroneous represen-
tion, although one that is wrong on occa- tations are reliably constructed. Some of the
sion. Size is compressed in memory (Kerst organizing principles that yield systematic
& Howard, 1 978). When portions of ob- errors are reviewed in the next section.
jects are truncated by picture frames, the
objects are remembered as more complete Hierarchical Organization. Dots that are
than they actually were (Intraub, Bender, & grouped together by good continuation, for
Mangels, 1 992). example, parts of the same square out-
lined in dots, are judged to be closer than
representations and transformations: spatial dots that are actually closer but parts of
configurations and cognitive maps separate groups (Coren & Girgus, 1 980).
The Gestalt psychologists also produced An analogous phenomenon occurs in judg-
striking demonstrations that people organize ments of distance between buildings (Hirtle
the visual world in principled ways, even & Jonides, 1 985 ): Residents of Ann Arbor
when that world is a meaningless array (see think that pairs of university (or town) build-
Hochberg, 1 978). Entities in space, espe- ings are closer than actually closer pairs of
cially ones devoid of meaning, are difficult buildings that belong to different groups, one
to understand in isolation but easier to grasp to the university and the other to the town.
in context. People group elements in an array Hierarchical organization of essentially flat
by proximity or similarity or good continua- spatial information also affects accuracy and
tion. One inevitable consequence of percep- time to make judgments of direction. People
tual organizing principles is distorted repre- incorrectly report that San Diego is west of
sentations. Reno. Presumably this error occurs because
Many of the distortions reviewed here people know the states to which the cities
have been instantiated in memory for per- belong and use the overall directions of the
ceptual arrays that do not stand for anything. states to infer the directions between cities in
They have also been illustrated in memory the states (Stevens & Coupe, 1 978). People
for cognitive maps and for environments. As are faster to judge whether one city is east or
such, they have implications for how people north of another when the cities belong to
reason in navigating the world, a visuospa- separate geographic entities than when they
tial reasoning task that people of all ages and are actually farther but part of the same ge-
parts of the world need to solve. Even more ographic entity (Maki, 1 981 ; Wilton, 1 979).
intriguing, many of these phenomena have A variant of hierarchical organization
analogs in abstract thought. occurs in locating entities belonging to a
For the myriad spatial distortions de- bounded region. When asked to remember
scribed here (and analyzed more fully in the location of a dot in a quadrant, people
place it closer to the center of the quadrant,

as if they were using general information
about the area to locate the entity contained
in it (Huttenlocher, Hedges, & Duncan,
1 991 ; Newcombe & Huttenlocher, 2000).
Amount of Information. That representa-

tions are constructed on the fly in the ser-
vice of particular judgments seems to be the
case for other distance estimates. Distances
between A and B, say two locations within a
town, are greater when there are more cross
streets or more buildings or more obstacles
or more turns on the route (Newcombe &
Liben, 1 982; Sadalla & Magel, 1 980; Sadalla
& Staplin, 1 980a, 1 980b; Thorndyke, 1 981 ),
as if people mentally construct a represen-
tation of a path from A to B from that in-
formation and use the amount of informa-
tion as a surrogate for the missing exact
distance information. There is an analogous Figure 1 0.4. Alignment. A significant majority
visual illusion: A line appears longer if bi- of participants think the incorrect lower map is
correct. The map has been altered so the United
sected and longer still with more tick marks
States and Europe and South American and
(at some point of clutter, the illusion ceases Africa are more aligned (after Tversky, 1 981 ).
or reverses).
Perspective. Steinberg regaled generations

of readers of the New Yorker and denizens of Alignment. Hierarchical, perspective, and
dormitory rooms with his maps of views of landmark effects can all be regarded as con-
the world. In the each view, the immediate sequences of the Gestalt principle of group-
surroundings are stretched and the rest of ing. Even groups of two equivalent entities
the world shrunk. The psychological reality can yield distortion. When people are asked
of this genre of visual joke was demonstrated to judge which of two maps is correct, a map
by Holyoak and Mah (1 982). They asked stu- of North and South America in which South
dents in Ann Arbor to imagine themselves America has been moved westward to over-
on either coast and to estimate the distances lap more with North America, or the ac-
between pairs of cities distributed more or tual map, in which the two continents barely
less equally on an east–west axis across the overlap, the majority of respondents pre-
states. Regardless of imagined perspective, fer the former (Tversky, 1 981 ; Figure 1 0.4).
students overestimated the near distances A majority of observers also prefer an in-
relative to the far ones. correct map of the Americas and Europe/
Africa/Asia in which the Americas are
Landmarks. Distance judgments are also moved northward so the United States and
distorted by landmarks. People judge the dis- Europe and South America and Africa are
tance of an undistinguished place to be closer more directly east–west. This phenomenon
to a landmark than vice versa (McNamara has been called alignment; it occurs when
& Diwadkar, 1 997; Sadalla, Burroughs, & people group two spatial entities and then
Staplin, 1 980). Landmark asymmetries vi- remember them more in correspondence
olate elementary metric assumptions, as- than they actually are. It appears not only
sumptions that are more or less realized in in judgments of maps of the world but also
real space. in judgments of directions between cities in
memory for artificial maps and in memory What might a representation that cap-
for visual blobs. tures all these distortions look like? It would
Spatial entities cannot be localized in look like nothing that can be sketched on a
isolation; they can be localized with re- sheet of paper, that is, is coherent in two di-
spect to other entities or to frames of ref- mensions. Landmark asymmetries alone dis-
erence. When they are coded with respect allow that. It does not seem likely that peo-
to another entity, alignment errors are likely. ple make these judgments by retrieving a
When entities are coded with respect to a coherent prestored mental representation, a
frame of reference, rotation errors, described “cognitive map,” and reading the direction or
in the next section, are likely. distance from it. Rather, it seems that people
construct representations on the fly, incorpo-
Rotation. When people are asked to place
rating only the information needed for that
a cutout of South America in a north–south
judgment, the relevant region, the specific
east–west frame, they upright it. A large spa-
entities within it. Some of the information
tial object, such as South America, induces
may be visuospatial from experience or from
its own coordinates along an axis of elon-
maps; some may be linguistic. For these rea-
gation and an axis parallel to that one. The
sons, “cognitive collage” seems a more apt
actual axis of elongation of South America
metaphor than “cognitive map” for what-
is tilted with respect to north–south, and
ever representations underlie spatial judg-
people upright it in memory. Similarly, peo-
ment and memory (Tversky, 1 993 ). Such
ple incorrectly report that Berkeley is east
representations are schematic; they leave
of Stanford when it is actually slightly west.
out much information and simplify others.
Presumably this occurs because they up-
Schematization occurs for at least two rea-
right the Bay Area, which actually runs
sons. More exact information may not be
at an angle with respect to north–south.
known and therefore cannot be represented.
This error has been called rotation; it oc-
More exact information may not even be
curs when people code a spatial entity with
needed because the situation on the ground
respect to a frame of reference (Tversky,
may fill it in. More information may over-
1 981 ; Figure 1 0.5 ). As for rotation, it ap-
load working memory, which is notoriously
pears in memory for artificial maps and un-
limited. Not only must the representation be
interpreted blobs, as well as in memory for
constructed in working memory, but a judg-
real environments. Others have replicated
ment must also be made on the representa-
this error in remembered directions and in
tion. Schematization may hide incoherence,
navigation (e.g., Glicksohn, 1 994; Lloyd &
or it may not be noticed. Schematization
Heivly, 1 987; Montello, 1 991 ; Presson &
necessarily entails systematic error.
Montello, 1 994).
Are Spatial Representations Incoherent? Why do Errors Persist? It is reasonable to
This brief review has brought evidence for wonder why so many systematic errors per-
distortions in memory and judgment for sist. Some reasons for the persistence of er-
shapes of objects, configurations of objects, ror have already been discussed – that there
and distances and directions between objects may be correctives on the ground, that some
that are a consequence of the organization errors are a consequence of the schematiza-
of the visuospatial information. These are tion processes that are an inherent part of
not errors of lack of knowledge; even ex- memory and information processing. Yet an-
perienced taxi drivers make them (Chase other reason is that the correctives are spe-
& Chi, 1 981 ). Moreover, many of these bi- cific – now I know that Rome is north of
ases have parallels in abstract domains, such Philadelphia – and do not affect or even
as judgments about members of one’s own make contact with the general information
social or political groups relative to judg- organizing principle that generated the error
ments about members of other groups (e.g., and that serves us well in many situations
Quattrone, 1 986). (e.g., Tversky, 2003 a).
Figure 1 0.5. Rotation. When asked to place a cutout of South America in a

NSEW framework, most participants upright it, as in the left example (after
Tversky, 1 981 ).
From Spatial to Abstract Reasoning Indeed, spatial reasoning is often studied

in the context of graphics, maps, diagrams,
Visuospatial reasoning does not only entail graphs, and charts. External representations
visuospatial transformations on visuospatial bear similarities to internal representations if
information. Visuospatial reasoning also in- only because they are creations of the human
cludes making inferences from visuospatial mind that is cognitive tools to increase the
information, whether that information is in power of the human mind. They also bear
the mind or in the world. An early demon- formal similarities in that both internal and
stration was the symbolic distance effect (e.g., external representations are mappings be-
Banks & Flora, 1 977; Moyer, 1 973 ; Paivio, tween elements and relations. External rep-
1 978). The time to judge which of two ani- resentations are constrained by a medium
mals is more intelligent or pleasant is faster and unconstrained by working memory; for
when the entities are farther on the dimen- this reason, inconsistencies, ambiguities, and
sion than when they are closer – as if people incompleteness may be reduced in external
were imagining the entities arrayed on a line representations.
corresponding to the abstract dimension. It
is easier, hence faster, to discriminate larger
Graphics: Elements
distances than smaller ones. Note that a sub-
jective experience of creating and using an The readiness with which people map ab-
image does not necessarily accompany mak- stract information onto spatial information
ing these and other spatial and abstract judg- is part of the reason for the widespread use
ments. Spatial thinking can occur regardless of diagrams to represent and convey ab-
of whether thinkers have the sensation of stract information from the sublime – the
using an image. So many abstract concepts harmonies of the spheres rampant in re-
have spatial analogs (for related discussion, ligions spanning the globe – to the mun-
see Holyoak, Chap. 6). dane corporate charts and statistical graphs.
Graphics, such as these, consist of elements poral, quantitative, and preference informa-
and spatial relations among the elements. In tion, for example, to place stickers for TV
contrast to written (alphabetic) languages, shows they loved, liked, or disliked. Almost
both elements and use of space in graph- all the preschoolers put the stickers on a line,
ics can convey meaning rather directly (e.g., preserving ordinal information. Children in
Bertin, 1 967/1 983 ; Pinker, 1 994; Tversky, the middle school years were able to repre-
1 995 , 2001 ; Winn, 1 989). Elements may sent interval information, but representing
consist of likenesses, such as road signs de- more than ordinal information was unusual
picting picnic tables, falling rocks, or deer. El- for younger children, despite strong manipu-
ements may also be figures of depiction, sim- lations to encourage them. Not only did chil-
ilar to figures of speech: synecdoche, where dren (and adults) spontaneously use spatial
a part represents a whole, common in ideo- relations to represent abstract relations, but
graphic writing, for example, using a ram’s children also showed preferences for the di-
horns to represent a ram; or metonomy, rection of increases in abstract dimensions.
where an association represents an entity Increases were represented from right to left
or action, which is common in computer or left to right (irrespective of direction of
menus, such as scissors to denote cut text writing for quantity and preference) or down
or a trashcan to allow deletion of files. to up. Representing increasing time or quan-
tity from up to down was avoided. Rep-
Graphics: Relations resenting increases as upward is especially
robust; it affects people’s ability to make
Relations among entities preserve different inferences about second-order phenomena
levels of information. The information pre- such as rate, which is spontaneously mapped
served is reflected in the mapping to space. In to slope, from graphs (Gattis, 2002; Gattis &
some cases, the information preserved is sim- Holyoak, 1 996). The correspondence of up-
ply categorical; space is used to separate en- ward to more, better, and stronger appears
tities belonging to different categories. The in language – on top of the world, rising to
spaces between words, for example, indi- higher levels of platitude – and in gesture –
cate that one set of letters belongs to one thumbs up, high five – as well as in graph-
meaning and another set to another mean- ics. These spontaneous and widespread cor-
ing. Space can also be used to represent ordi- respondences between spatial and abstract
nal information, for example, listing historic relations suggest they are cognitively natural
events in their order of occurrence, groceries (e.g., Tversky, 1 995 a, 2001 ).
by the order of encountering them in the The demonstrations of spontaneous use
supermarket, and companies by their prof- of spatial language and diagrammatic space
its. Space can be used to represent interval to represent abstract relations suggests that
or ratio information, as in many statistical spatial reasoning forms a foundation for
graphs, where the spatial distances among more abstract reasoning. In fact, children
entities reflect their distances on some used diagrammatic space to represent ab-
other dimension. stract relations earlier for temporal relations
than for quantitative ones, and earlier for
spontaneous use of space to represent quantitative relations than for preference re-
abstract relations lations (Tversky et al., 1 991 ). Corrobora-
Even preschool children spontaneously use tive evidence comes from simple spatial and
diagrammatic space to represent abstract in- temporal reasoning tasks, such as judging
formation (e.g., diSessa, Hammer, Sherin, & whether one object or person is before an-
Kolpakowski, 1 991 ; Tversky, Kugelmass, & other. In many languages, words for spatial
Winter, 1 991 ). In one set of studies (Tver- and temporal relations, such as before, after,
sky et al., 1 991 ), children from three lan- and in between, are shared. That spatial terms
guage communities were asked to place are the foundation for the temporal comes
stickers on paper to represent spatial, tem- from research showing priming of temporal
perspective from spatial perspective but not 65 .1 1 .1 2, ca. 5 00–480 b.c.) depicts a spear
vice versa (Boroditsky, 2000). More support that appears in one piece from the desired
for the primacy of spatial thinking for ab- viewing angle, but in three pieces when
stract thought comes from studies of prob- viewed straight on (J. P. Small, personal com-
lem solving (Carroll, Thomas, & Mulhotra, munication, May 27, 2003 ).
1 980). One group of participants was asked The perceptual and cognitive processes
to solve a spatial problem under constraints, and biases that people bring to graphics in-
arranging offices to facilitate communica- clude the catalog of mental representations
tion among key people. Another group was and transformations that was begun earlier.
asked to solve a temporal analog, arranging In that spirit, several researchers have devel-
processes to facilitate production. The solu- oped models for graph understanding, no-
tions to the spatial analog were superior to tably Pinker (1 990), Kosslyn (1 989, 1 994a),
those to the temporal analog. When exper- and Carpenter and Shah (1 998) (see Shah
imenters suggested using a diagram to yet 2003 /2004, for an overview). These mod-
another group solving the temporal analog, els take account of the particular perceptual
their success equaled that of the spatial ana- or imaginal processes that need to be ap-
log group. plied to particular kinds of graphs to yield
the right inferences. Others have taken ac-
diagrams facilitate reasoning count of perceptual and cognitive processing
in the construction of guidelines for design.
Demonstrating that using a spatial dia-
(e.g., Carswell & Wickens, 1 990; Cleveland,
gram facilitates temporal problem solving
1 985 ; Kosslyn, 1 994a; Tufte, 1 983 , 1 990,
also illustrates the efficacy of diagrams in
1 997; Wainer, 1 984, 1 997). In some cases the
thinking – a finding amply supported, even
design principles are informed by research,
for inferences entailing complex logic, such
but in most they are informed by the au-
as double disjunctions, although to succeed,
thors’ educated sensibilities and/or rules of
diagrams have to be designed with attention
thumb from graphic design.
to the ways that space and spatial entities are
used to make inferences (Bauer & Johnson-
Inferences from Diagrams: Structural and
Laird, 1 993 ). Middle school children study-
Functional. The existence of spontaneous
ing science were asked to put reminders
mapping of abstract information onto spatial
on paper. Those children who sketched dia-
does not mean that the meanings of diagrams
grams learned the material better than those
are transparent and can be automatically and
who did not (Rode & Stern, in press).
easily extracted (e.g., Scaife & Rogers, 1 995 ).
Diagrams can support many different classes
diagrams for communicating of inferences, notably, structural and func-
Many maps, charts, diagrams, and graphs are tional (e.g., Mayer & Gallini, 1 990). Struc-
meant to communicate clearly for travel- tural inferences, or inferences about quali-
ers, students, and scholars, whether they are ties of parts and the relations among them,
professionals or amateurs. To that end, they can be readily made from inspection of a di-
are designed to be clear and easy to com- agram. Distance, direction, size, and other
prehend, and they meet with varying suc- spatial qualities and properties can be “read
cess. Good design takes account of human off” a diagram (Larkin & Simon, 1 987), at
perceptual and cognitive skills, biases, and least with some degree of accuracy. “Reading
propensities. Even ancient Greek vases take off” entails using the sort of mental trans-
account of how they will be seen. Because formations discussed earlier, mental scan-
they are curved round structures, creating ning, mental distance, size, shape, or direc-
a veridical appearance requires artistry. The tion judgments or comparisons. Functional
vase “Achilles and Ajax playing a game” by inferences, or inferences about the behav-
the Kleophrades Painter in the Museum of ior of entities, cannot be readily made from
Metropolitan Art in New York City (Art. inspection of a diagram in the absence of
additional knowledge or assumptions that depend on the order of that pulley in the
are often a consequence of expertise. Spa- mechanics of the system. To answer func-
tial information may provide clues to functional questions, it is as if participants men-
tional information, but it is not sufficient tally animate the pulley system in order to
for concepts such as force, mass, and fric- generate an answer. Mental animation, how-
tion. Making functional inferences requires ever, does not seem to be a continuous pro-
linking perceptual information to concep- cess in the same way as physical animation.
tual information; it entails both knowing Rather, mental animation seems to be a se-
how to “read” a diagram, that is, what vi- quence of discrete steps – for example, the
suospatial features and relations to inspect first pulley goes clockwise, and the rope goes
or transform, and knowing how to interpret under the next pulley to the left of it, so it
that visuospatial information. must go counterclockwise. That continuous
Structural and functional inferences re- events are comprehended as sequences of
spectively correspond to two senses of men- steps is corroborated by research on segmen-
tal model prevalent in the field. In both cases, tation and interpretation of everyday events,
mental model contrasts with image. In one such as making a bed (Zacks, Tversky, &
sense, a mental model contrasts with an im- Iyer, 2001 ).
age in being more skeletal or abstract. This is It has long been known that domain ex-
the sense used by Johnson-Laird in his book, perts are more adept at functional inferences
Mental Models (1 983 ), in his explication of from diagrams than novices. Experts can
how people solve syllogisms (see Johnson- “see” sequences of organized chess moves
Laird, Chap. 9, and Evans, Chap. 8). Here, in a midgame display (Chase & Simon,
a mental model captures the structural re- 1 973 ; De Groot, 1 965 ). Similarly, experts
lations among the parts of a system. In the in Go (Reitman, 1 976), electricity (Egan
other sense, a mental model contrasts with & Schwartz, 1 979), weather (Lowe, 1 989),
an image in having moving parts, in being architecture (Suwa & Tversky, 1 997), and
“runnable” to derive functional or causal in- more make functional inferences with ease
ferences (for related discussion on causal- from diagrams in their domain. Novices
ity, see Buehner and Cheng, Chap. 7, and are no different from experts in structural
on problem solving, see Chi and Ohlsson, inferences.
Chap. 1 6). This is the sense used in another
book also titled Mental Models (Gentner & Inferences from Diagrams of Systems. The
Stevens, 1 983 ). One goal of diagrams is to distinction between structural and func-
instill mental models in the minds of their tional inferences is illustrated by work on
users. To that end, diagrams abstract the es- production and comprehension of diagrams
sential elements and relations of the system for mechanical systems, such as a car brake,
they are meant to convey. As is seen, convey- a bicycle pump, or a pulley system (Heiser
ing structure is more straightforward than & Tversky, 2002; Figure 1 0.6). Participants
conveying function. were asked to interpret a diagram of one of
What does it mean to say that a mental the systems. On the whole, their interpreta-
model is “runnable?” One example comes tions were structural, that is, they described
from research on pulley systems (Hegarty, the relations among the parts of the system.
1 992). Participants were timed to make two Another set of participants was given the
kinds of judgments from diagrams of three- same diagrams enriched by arrows indicat-
pulley systems. For true-false judgments of ing the sequence of action in the systems.
structural questions, such as “The upper left Those participants gave functional descrip-
pulley is attached to the ceiling,” response tions; that is, they described the step-by-step
times did not depend on which pulley in operation of the system. Reversing the tasks,
the system was queried. For judgments of other groups of participants read structural
functional questions, such as “The upper left or functional descriptions of the systems
pulley goes clockwise,” response times did and produced diagrams of them. Those who
Figure 1 0.6. Diagrams of a car brake and a bicycle pump (both after Mayer & Gallini, 1 990), and a
pulley system (after Hegarty, 1 992). Diagrams without arrows encouraged structural descriptions and
diagrams with arrows yielded functional descriptions (Heiser and Tversky, in press).
read functional descriptions used arrows in were able to infer functional information
their diagrams far more than those who read from functional text. This finding suggests
structural descriptions. Arrows are an ex- that people with high expertise/ability can
trapictorial device that have many meanings form unitary diagrammatic mental models
and functions in diagrams, such as point- of mechanical systems that allow spatial and
ing, indicating temporal sequence, causal se- functional inferences with relative ease, but
quence, and path and manner of motion people with low expertise/ability have and
(Tversky, 2001 ). use diagrammatic mental models for struc-
Expertise came into play in a study of tural information but rely on propositional
learning rather than interpretation. Partic- representations for functional information.
ipants learned one of the mechanical sys-
tems from a diagram with or without ar- Enriching Diagrams to Facilitate Functional
rows or from structural or functional text. Inferences. As noted, conveying spatial or
They were later tested on both structural and structural information is relatively straight-
functional information. Participants high in forward in diagrams. Diagrams can use space
expertise/ability (self-assessed) were able to to represent space in direct ways that are
infer both structural and functional infor- readily interpreted, as in maps and archi-
mation from either diagram. In contrast, tectural sketches. Conveying information
participants low in expertise/ability could that is not strictly spatial, such as change
derive structural but not functional informa- over time, forces, and kinematics, is less
tion from the diagrams. Those participants straightforward. Some visual conventions for
conveying information about dynamics or in problem solving. For example, a diagram

forces have been developed in comics and containing many arrows moving toward the
in diagrams (e.g., Horn, 1 998; Kunzle, 1 990; center of a display was superior to a diagram
McCloud, 1 994), and many of these con- with static arrows in suggesting the solution
ventions are cognitively compelling. Arrows to the Duncker radiation problem of how to
are a good example. As lines, arrows in- destroy a tumor without destroying healthy
dicate a relationship, a link. As asymmet- tissue (Pedone, Hummel, & Holyoak, 2001 ;
ric lines, they indicate an asymmetric rela- see Holyoak, Chap. 6, Figure 6.4). The fail-
tionship. The arrowhead is compelling as an ure of animations to improve learning itself
indicator of the direction of the asymme- becomes intuitive on further reflection. For
try because of its correspondence to arrow- one thing, animations are often complex, so
heads common as weapons in the world or it is difficult for a viewer to know where to
its correspondence to Vs created by paths look and to make sense of the timing of many
of downward moving water. A survey of moving components. However, even simple
diagrams in science and engineering texts animations, such as the path of a single mov-
shows wide use of extrapictorial diagram- ing circle, are not superior to static graphics
matic devices, such as arrows, lines, brack- (Morrison & Tversky, in press). The second
ets, and insets, although not always consis- reason for the lack of success of anima-
tently (Tversky, Heiser, Lozano, MacKenzie, tions is one reviewed earlier. If people think
& Morrison, in press). As a consequence, of dynamic events as sequences of steps
these devices are not always correctly in- rather than continuous animations, then
terpreted. Some diagrams of paradigmatic presenting change over time as sequences
processes, such as the nitrogen cycle in bi- of steps may make the changes easier
ology or the rock cycle in geology, contain to comprehend.
the same device, typically an arrow, with
multiple senses, pointing or labeling, indi-
Diagrams for Insight
cating movement path or manner, suggest-
ing forces or sequence, in the same diagram. Maps for highways and subways, diagrams
Of course, there is ambiguity in many words for assembly and biology, graphs for eco-
that appear commonly in scientific and other nomics and statistics, and plans for electri-
prose, words that parallel these graphic de- cians and plumbers are designed to be con-
vices, such as line and relationship. Neverthe- cise and unambiguous, although they may
less, the confusion caused by multiple senses not always succeed. Their inventors want to
of diagrammatic devices in interpreting di- communicate clearly and without error. In
agrams suggests that greater care in design contrast are graphics created to be ambigu-
is worthwhile. ous, to allow reinterpretation and discovery.
An intuitive way to visualize change over Art falls into both those categories. Early de-
time is by animations. After all, an animation sign sketches are meant to be ambiguous, to
uses change over time to convey change over commit the designer to only those aspects
time, a cognitively compelling correspon- of the design that are likely not to change,
dence. Despite the intuitive appeal, a sur- and to leave open other aspects. One reason
vey of dozens of studies that have compared for this is fixation; it is hard to “think out
animated graphics to informationally com- of the box.” Visual displays express, suggest,
parable static graphics in teaching a wide more than what they display. That expres-
variety of concepts, physical, mechanical, sion, in fact, came from solution attempts to
and abstract, did not find a single example the famous nine-dot problem (see Novick
of superior learning by animations (Tversky, & Bassok, Chap. 1 4, Fig. 1 4.4). Connect all
Morrison, & Betrancourt, 2002). Animations nine dots in a 3 × 3 array using four straight
may be superior for purposes other than lines without lifting the pen from the pa-
learning, for example, in maintaining per- per. The solution that is hard to see is to
spective or in calling attention to a solution extend the lines beyond the “box” suggested
Figure 1 0.7. A sketch by an architect designing a museum. Upon

reinspection, he made an unintentional discovery (Suwa, Tversky, Gero, &
Purcell, 2001 ).
by the 3 × 3 array. The Gestalt psychologists configuration in his own design, he was more
made us aware of the visual inferences the likely to invent a new design idea; similarly,
mind makes without reflection, grouping by when he invented a new design idea, he was
proximity, similarity, good continuation, and more likely to see a new configuration in his
common fate. sketch (Suwa et al., 2001 ; Figure 1 0.7).
Underlying these unintended discoveries
in sketches is a cognitive skill termed con-
inferences from sketches structive perception, which consists of two
Initial design sketches are meant to be am- independent processes: a perceptual one,
biguous for several reasons. In early stages of mentally reorganizing the sketch, and a con-
design, designers often do not want to com- ceptual one, relating the new organization
mit to the details of a solution, only the gen- to some design purpose (Suwa & Tversky,
eral outline, leaving open many possibilities; 2003 ). Participants adept at generating mul-
gradually, they will fill in the details. Per- tiple interpretations of ambiguous sketches
haps more important, skilled designers are excelled at the perceptual ability of finding
able to get new ideas by reexamining their hidden figures and at the cognitive ability of
own sketches, by having a conversation with finding remote meaningful associations, yet
their sketches, bouncing ideas off them (e.g., these two abilities were uncorrelated.
Goldschmidt, 1 994; Schon, 1 983 ; Suwa Expertise affects the kinds of inferences
& Tversky, 1 997; Suwa, Tversky, Gero, & designers are able to make from their
Purcell, 2001 ). They may construct sketches sketches. Novice designers are adept at per-
with one set of ideas in mind, but on later ceptual inferences, such as seeing proxim-
reexamination they see new configurations ity and similarity relations. Expert design-
and relations that generate new design ideas. ers are also adept at functional inferences,
The productive cycle between reexamining such as “seeing” the flow of traffic or the
and reinterpreting is revealed in the protocol changes in light from sketches (Suwa &
of one expert architect. When he saw a new Tversky, 1 997).
Conclusions and Future Directions Acknowledgments
Starting with the elements of visuospatial I am grateful to Phil Johnson-Laird and

representations in the mind, we end with Jeff Zacks for insightful suggestions on a
visuospatial representations created by the previous draft. Preparation of this chapter
mind. Like language, graphics serve to ex- and some of the research reported were
press and clarify individual spatial and ab- supported by Office of Naval Research,
stract concepts. Graphics have an advantage Grant Numbers NOOO1 4-PP-1 -O649,
over language in expressiveness (Stenning N0001 401 1 071 7, and N0001 4021 05 3 4 to
& Oberlander, 1 995 ); graphics use elements Stanford University.
and relations in graphic space to convey el-
ements and relations in real or metaphoric
space. As such, they allow inference based References
on the visuospatial processing that people
have become expert in as a part of their
Alibali, M. W., Bassok, M., Solomon, K. O., Syc,
everyday interactions with space (Larkin & S. E., & Goldin-Meadow, S. (1 999). Illuminat-
Simon, 1 997). As cognitive tools, graphics ing mental representations through speech and
facilitate reasoning, both by externalizing, gesture. Psychological Science, 1 0, 3 27–3 3 3 .
thus offloading memory and processing, and Anderson, J. R. (1 978). Arguments concerning
by mapping abstract reasoning onto spatial representations for mental imagery. Psychologi-
comparisons and transformations. Graphics cal Review, 85 , 249–277.
organize and schematize spatial and abstract Banks, W. P., & Flora, J. (1 977). Semantic and
information to highlight and focus the es- perceptual processes in symbolic comparisons.
sential information. Like language, graphics Journal of Experimental Psychology: Human Per-
serve to convey spatial and abstract concepts ception and Performance, 3 , 278–290.
to others. They make private thoughts pub- Barwise, J., & Etchemendy. (1 995 ). In J. Glasgow,
lic to a community that can then use and N. H. Naryanan, & G. Chandrasekeran (Eds.),
revise those concepts collaboratively. Diagrammatic reasoning: Cognitive and compu-
Of course, graphics and physical and men- tational perspectives (pp. 21 1 –23 4). Cambridge,
tal transformations on them are not identi- MA: MIT Press.
cal to visuospatial representations and rea- Bauer, M. I., & Johnson-Laird, P. N. (1 993 ). How
soning; they are an expression of it. Talk diagrams can improve reasoning. Psychological
Science, 6, 3 72–3 78.
about space and actions in it were probably
among the first uses of language, telling oth- Bertin, J. (1 967/1 983 ). Semiology of graphics: Di-
agrams, networks, maps. (Translated by W. J.
ers how to find their way and what to look for
Berg.) Madison: University of Wisconsin Press.
when they get there. Cognitive tools to pro-
Betrancourt, M., & Tversky, B. (in press). Sim-
mote visuospatial reasoning were among the
ple animations for organizing diagrams. Inter-
first to be invented from tokens for property
national Journal of Human-Computer Studies.
counts, believed to be the precursor of writ-
Beveridge, M., & Parkins, E. (1 987). Visual repre-
ten language (Schmandt-Besserat, 1 992), to
sentation in analogical problem solving. Mem-
trail markers to maps in the sand. Spatial ory and Cognition, 1 5 , 23 0–23 7.
thought, spatial language, and spatial graph-
Boroditsky, L. (2000). Metaphoric structuring:
ics reflect the importance and prevalence Understanding time through spatial
of visuospatial reasoning in our lives, from metaphors. Cognition, 75 , 1 –28.
knowing how to get home to knowing how Boroditsky, L. (2001 ). Does language shape
to design a house, from explaining how to thought?: Mandarin and English speakers’ con-
find the freeway to explaining how the judi- ceptions of time. Cognitive Psychology, 43 , 1 –
cial system works, from understanding basic 23 .
science to inventing new conceptions of the Boroditsky, L., Ham, W., & Ramscar, M. (2002).
origins of the universe. Where do we go from What is universal in event perception? Com-
here? Onward and upward! paring English and Indonesian speakers. In
W. D. Gray & C. D. Schunn (Eds.), Proceed- Configurality, stimulus homogeneity, and ob-
ings of the 2 4th annual meeting of the Cognitive ject integration. Perception and Psychophysics,
Science Society (pp. 1 3 6–1 441 ). Mahwah, NJ: 47, 1 5 7–1 68.
Erlbaum. Chase, W. G., & Chi, M. T. H. (1 981 ). Cognitive
Bruner, J. S. (1 973 ). Beyond the information given: skill: Implications for spatial skill in large-scale
Studies in the psychology of knowing. Oxford, environments. In J. H. Harvey (Ed.), Cognition,
UK: Norton. social behavior, and the environment (pp. 1 1 1 –
Bryant, D. J., & Tversky, B. (1 999). Mental rep- 1 3 6). Hillsdale, NJ: Erlbaum.
resentations of spatial relations from diagrams Chase, W. G., & Simon, H. A. (1 973 ). The mind’s
and models. Journal of Experimental Psychology: eye in chess. In W. G. Chase (Ed.), Visual infor-
Learning, Memory, and Cognition, 2 5 , 1 3 7–1 5 6. mation processing. New York: Academic Press.
Bryant, D. J., Tversky, B., & Franklin, N. (1 992). Cleveland, W. S. (1 985 ). The elements of graphing
Internal and external spatial frameworks for data. Monterey, CA: Wadsworth.
representing described scenes. Journal of Mem- Coren, S., & Girgus, J. S. (1 980). Principles of
ory and Language, 3 1 , 74–98. perceptual organization and spatial distortion:
Byrant, D. J., Tversky, B., & Lanca, M. (2001 ). Re- The Gestalt illusions. Journal of Experimental
trieving spatial relations from observation and Psychology: Human Performance and Perception,
memory. In E. van der Zee & U. Nikanne (Eds.), 6, 404–41 2.
Conceptual structure and its interfaces with other Cutting, J. E. (1 986). Perception with an eye for
modules of representation (pp. 1 1 6–1 3 9). Ox- motion. Cambridge, MA: Bradford Books/MIT
ford, UK: Oxford University Press. Press.
Bundesen, C., & Larsen, A. (1 975 ). Visual trans- Cutting J. E., & Kozlowski L. T. (1 977). Recog-
formation of size. Journal of Experimental nizing friends by their walk: Gait perception
Psychology: Human Perception and Performance, without familiarity cues. Bulletin of the Psycho-
1 , 21 4–220. nomic Society, 9, 3 5 3 –3 5 6.
Bundesen, C., Larsen, A., & Farrell, J. E. (1 981 ). De Groot, A. D. (1 965 ). Thought and choice in
Mental transformations of size and orientation. chess. The Hague: Mouton.
In A. Baddeley & J. Long (Eds.), Attention and Denis, M. (1 996). Imagery and the description
performance IX (pp. 279–294). Hillsdale, NJ: of spatial configurations. In M. de Vega & M.
Erlbaum. Marschark (Eds.), Models of visuospatial cogni-
Carmichael, R., Hogan, H. P., & Walter, A. A. tion (pp. 1 28–1 1 97). New York: Oxford Univer-
(1 93 2). An experimental study of the effect of sity Press.
language on the reproduction of visually per- Denis, M., & Kosslyn, S. M. (1 999). Scanning vi-
ceived forms. Journal of Experimental Psychol- sual mental images: A window on the mind.
ogy, 1 5 , 73 –86. Cahiers de Psychologie Cognitive, 1 8, 409–
Carpenter, P. A., & Shah, P. (1 998). A model 465 .
of the perceptual and conceptual processes in diSessa, A. A., Hammer, D., Sherin, B., &
graph comprehension. Journal of Experimental Kolpakowski, T. (1 991 ). Inventing graphing:
Psychology: Applied, 4, 75 –1 00. Meta-representational expertise in children.
Carroll, J. (1 993 ). Human cognitive abilities: A sur- Journal of Mathematical Behavior, 1 0, 1 1 7–1 60.
vey of factor-analytical studies. New York: Cam- Duncker, K. (1 945 ). On problem solving. Psycho-
bridge University Press. logical Monographs, 5 8, (Whole No. 270).
Carroll, J. M., Thomas, J. C., & Malhotra, A. Egan, D. E., & Schwartz, B. J. (1 979). Chunk-
(1 980). Presentation and representation in de- ing in recall of symbolic drawings. Memory and
sign problem solving. British Journal of Psychol- Cognition, 7, 1 49–1 5 8.
ogy, 71 , 1 43 –1 5 3 . Emmorey, K., Tversky, B., & Taylor, H. A. (2000).
Carswell, C. M. (1 992). Reading graphs: Interac- Using space to describe space: Perspective in
tion of processing requirements and stimulus speech, sign, and gesture. Journal of Spatial Cog-
structure. In B. Burns (Ed.), Percepts, concepts, nition and Computation, 2 , 1 5 7–1 80.
and categories (pp. 605 –645 ). Amsterdam: Englekamp, J. (1 998). Memory for action. Hove,
Elsevier. UK: Psychology Press.
Carswell, C. M., & Wickens, C. D. (1 990). The Finke, R. A. (1 990). Creative imagery. Hillsdale,
perceptual interaction of graphic attributes: NJ: Erlbaum.
Finke, R. A. (1 993 ). Mental imagery and creative Glicksohn, J. (1 994). Rotation, orientation, and
discovery. In B. Roskos-Evoldsen, M. J. Intons- cognitive mapping. American Journal of Psy-
Peterson, & R. E. Anderson (Eds.), Imagery, chology, 1 07, 3 9–5 1 .
creativity, and discovery. Amsterdam: North- Glushko, R. J., & Cooper, L. A. (1 978). Spa-
Holland. tial comprehension and comparison processes
Finke, R. A., & Pinker, S. (1 982). Spontaneous im- in verification tasks. Cognitive Psychology, 1 0,
agery scanning in mental extrapolation. Journal 3 91 –421 .
of Experimental Psychology: Learning, Memory, Goldschmidt, G. (1 994). On visual design
and Cognition, 8, 1 42–1 47. thinking: The vis kids of architecture. Design
Finke, R. A., & Pinker, S. (1 983 ). Directional Studies, 1 5 , 1 5 8–1 74.
scanning of remembered visual patterns. Jour- Gordon, I. E., & Hayward, S. (1 973 ). Second-
nal of Experimental Psychology: Learning, Mem- order isomorphism of internal representations
ory, and Cognition, 9, 3 98–41 0. of familiar faces. Perception and Psychophysics,
Finke, R. A., Pinker, S., & Farah, M. J. (1 989). 1 4, 3 3 4–3 3 6.
Reinterpreting visual patterns in mental im- Hegarty, M. (1 992). Mental animation: Inferring
agery. Cognitive Science, 1 2 , 5 1 –78. motion from static displays of mechanical sys-
Finke, R., & Shepard, R. N. (1 986). Visual func- tems. Journal of Experimental Psychology: Learn-
tions of mental imagery. In K. R. Boff, L. Kauf- ing, Memory, and Cognition, 1 8, 1 084–1 1 02.
man, & J. P. Thomas (Eds.), Handbook of percep- Hegarty, M., & Waller, D. (in press). Individual
tion and human performance (vol. II, pp. 1 9–5 5 ). differences in spatial abilities. In P. Shah & A.
New York: Wiley. Miyake (Eds.), Handbook of higher-level visu-
Franklin, N., & Tversky, B. (1 990). Searching ospatial thinking and cognition. Cambridge, UK:
imagined environments. Journal of Experimen- Cambridge University Press.
tal Psychology: General, 1 1 9, 63 –76. Heider, F., & Simmel, M. (1 944). An experimen-
Franklin, N., Tversky, B., & Coon, V. (1 992). tal study of apparent behavior. American Jour-
Switching points of view in spatial mental mod- nal of Psychology, 5 7, 243 –25 9.
els acquired from text. Memory and Cognition, Heiser, J., & Tversky, B. (2002). Diagrams and
2 0, 5 07–5 1 8. descriptions in acquiring complex systems. Pro-
Freyd, J., & Tversky, B. (1 984). The force of sym- ceedings of the meetings of the Cognitive Science
metry in form perception. American Journal of Society.
Psychology, 97, 1 09–1 26. Heiser, J., Tversky, B., Agrawala, M., & Hanra-
Gattis, M. (2002). Structure mapping in spatial han, P. (2003 ). Cognitive design principles for
reasoning. Cognitive Development, 1 7, 1 1 5 7– visualizations: Revealing and instantiating. In
1 1 83 . Proceedings of the Cognitive Science Society meet-
Gattis, M., & Holyoak, K. J. (1 996). Mapping ings.
conceptual to spatial relations in visual reason- Hirtle, S. C., & Jonides, J. (1 985 ). Evidence of hi-
ing. Journal of Experimental Psychology: Learn- erarchies in cognitive maps. Memory and Cog-
ing, Memory, and Cognition, 2 2 , 1 –9. nition, 1 3 , 208–21 7.
Gelman, R., Durgin, F., & Kaufman, L. Hochberg, J. (1 978). Perception. Englewood
(1 995 ). Distinguishing between animates and Cliffs, NJ: Prentice-Hall.
inanimates: Not by motion alone. In D. Sper- Holyoak, K. J., & Mah, W. A. (1 982). Cognitive
ber, D. Premack, & A. J. Premack (Eds.), Causal reference points in judgments of symbolic mag-
cognition: A multidisciplinary debate (pp. 1 5 0– nitude. Cognitive Psychology, 1 4, 3 28–3 5 2.
1 84). Oxford, UK: Clarendon Press. Horn, R. E. (1 998). Visual language. Bainbridge
Gentner, D., & Stevens, A. (1 983 ). Mental models. Island, WA: MacroVu, Inc.
Hillsdale, NJ: Erlbaum. Houser, N., & Kloesel, C. (1 992). The essential
Gick, M. L., & Holyoak, K. J. (1 980). Analogical Pierce, Vol. 1 and Vol. 2 . Bloomington: Indiana
problem solving. Cognitive Psychology, 1 2 , 3 06– University Press.
355. Huttenlocher, J., Hedges, L. V., & Duncan, S.
Gick, M. L., & Holyoak, K. J. (1 983 ). Schema in- (1 991 ). Categories and particulars: Prototype
duction and analogical transfer. Cognitive Psy- effects in estimating spatial location. Psycholog-
chology, 1 5 , 1 –28. ical Review, 98, 3 5 2–3 76.
Huttenlocher, J., Newcombe, N., & Sandberg, scanning. Journal of Experimental Psychology:
E. H. (1 994). The coding of spatial location in Human Perception and Performance, 4,
young children. Cognitive Psychology, 2 7, 1 1 5 – 47–60.
1 47. Kozhevnikov, M., Kosslyn, S., & Shepard, J. (in
Huttenlocher, J., & Presson, C. C. (1 979). The press). Spatial versus object visualizers: A new
coding and transformation of spatial informa- characterization of visual cognitive style. Mem-
tion. Cognitive Psychology, 1 1 , 3 75 –3 94. ory and Cognition.
Intraub, H., Bender, R. S., & Mangels, J. A. (1 992). Kozlowski, L. T., & Cutting, J. E. (1 977). Recog-
Looking at pictures but remembering scenes. nizing the sex of a walker from a dynamic point
Journal of Experimental Psychology: Learning, light display. Perception and Psychophysics, 2 1 ,
Memory, and Cognition, 1 8, 1 80–1 91 . 5 75 –5 80.
Iverson, J., & Goldin-Meadow, S. (1 997). What’s Kunzle, D. (1 990). The history of the comic strip.
communication got to do with it? Gesture in Berkeley: University of California Press.
children blind from birth. Developmental Psy- Lakoff, G., & Johnson, M. (1 980). Metaphors we
chology, 3 3 , 45 3 –467. live by. Chicago: University of Chicago Press.
Johansson, G. (1 973 ). Visual perception of bio- Larkin, J. H., & Simon, H. A. (1 987). Why a
logical motion and a model for its analysis. Per- diagram is (sometimes) worth ten thousand
ception and Psychophysics, 1 4, 201 –21 1 . words. Cognitive Science, 1 1 , 65 –99.
Johnson-Laird, P. N. (1 983 ). Mental models. Cam- Lee, P. U., & Tversky, B. (in press). Costs to switch
bridge, MA: Harvard University Press. perspective in acquiring but not in accessing
Kerst, S. M., & Howard, J. H. (1 978). Memory spatial information.
psychophysics for visual area and length. Mem- Levinson, S. C. (2003 ). Space in language and
ory and Cognition, 6, 3 27–3 3 5 . cognition: Explorations in cognitive diversity.
Kieras, D. E., & Bovair, S. (1 984). The role of a Cambridge, UK: Cambridge University Press.
mental model in learning to operate a device. Linn, M. C., & Petersen, A. C. (1 986). A
Cognitive Science, 1 1 , 25 5 –273 . meta-analysis of gender differences in spatial
Klatzky, R. L., Golledge, R. G., Cicinelli, J. G., & ability: Implications for mathematics and sci-
Pellegrino, J. W. (1 995 ). Performance of blind ence achievement. In J. S. Hyde & M. C.
and sighted persons on spatial tasks. Journal of Linn (Eds.), The psychology of gender: Advances
Visual Impairment and Blindness, 89, 70–82. through metaanalysis (pp. 67–1 01 ). Baltimore:
Klatzky, R. L., Loomis, J. M., Beall, A. C., Johns Hopkins University Press.
Chance, S. S., & Golledge, R. G. (1 998). Spatial Lowe, R. K. (1 989). Search strategies and infer-
updating of self-position and orientation during ence in the exploration of scientific diagrams.
real, imagined, and virtual locomotion. Psycho- Educational Psychology, 9, 27–44.
logical Science, 9, 293 –298. Maki, R. H. (1 981 ). Categorization and distance
Kosslyn, S. M. (1 976). Can imagery be dis- effects with spatial linear orders. Journal of
tinguished from other forms of internal Experimental Psychology: Human Learning and
representation? Memory and Cognition, 4, 291 – Memory, 7, 1 5 –3 2.
297. Martin, B., & Tversky, B. (2003 ). Segmenting am-
Kosslyn, S. M. (1 980). Image and mind. Cam- biguous events. Proceedings of the Cognitive Sci-
bridge, MA: Harvard University Press. ence Society meeting, Boston.
Kosslyn, S. M. (1 989). Understanding charts and Mayer, R. E. (1 998). Instructional technology. In
graphs. Applied Cognitive Psychology, 3 , 1 85 – F. Durso (Ed.), Handbook of applied cognition.
223 . Chichester, UK: Wiley.
Kosslyn, S. M. (1 994a). Elements of graph design. Mayer, R. E., & Gallini, J. K. (1 990). When is an
New York: Freeman. illustration worth ten thousand words? Journal
Kosslyn, S. M. (1 994b). Image and brain: The res- of Educational Psychology, 82 , 71 5 –726.
olution of the imagery debate. Cambridge, MA: McBeath, M. K., Schiano, D. J., & Tversky,
MIT Press. B. (1 997). Three-dimensional bilateral sym-
Kosslyn, S. M., Ball, T. M., & Rieser, B. J. metry bias in judgments of figural identity
(1 978). Visual images preserve metric spatial and orientation. Psychological Science, 8, 21 7–
information: Evidence from studies of image 223 .
McBeath, M. K., Shaffer, D. M., & Kaiser, M. K. Parsons, L. M. (1 987a). Imagined spatial transfor-
(1 995 ). How baseball outfielders determine mation of one’s body. Journal of Experimental
where to run to catch fly balls. Science, Psychology: General, 1 1 6, 1 72–1 91 .
2 68(5 21 0), 5 69–5 73 . Parsons, L. M. (1 987b). Imagined spatial trans-
McCloud, S. (1 994). Understanding comics. New formations of one’s hands and feet. Cognitive
York: HarperCollins. Psychology, 1 9, 1 92–1 91 .
McNamara, T. P., & Diwadkar, V. A. (1 997). Sym- Pedone, R., Hummel, J. E., & Holyoak, K. J.
metry and asymmetry of human spatial mem- (2001 ). The use of diagrams in analogical prob-
ory. Cognitive Psychology, 3 4, 1 60–1 90. lem solving. Memory and Cognition, 2 9, 21 4–
Michotte, A. E. (1 946/1 963 ). The perception of 221 .
causality. New York: Basic Books. Pinker, S. (1 980). Mental imagery and the third
Milgram, S., & Jodelet, D. (1 976). Psychological dimension. Journal of Experimental Psychology:
maps of Paris. In H. Proshansky, W. Ittelson, & General, 1 09, 3 5 4–3 71 .
L. Rivlin (Eds.), Environmental psychology (2nd Pinker, S. (1 990). A theory of graph comprehen-
edition, pp. 1 04–1 24). New York: Holt, Rine- sion. In R. Freedle (Ed.), Artificial intelligence
hart and Winston. and the future of testing (pp. 73 –1 26). Hillsdale,
Montello, D. R. (1 991 ). Spatial orientation and NJ: Erlbaum.
the angularity of urban routes: A field study. Pinker, S., Choate, P., & Finke, R. A. (1 984). Men-
Environment and Behavior, 2 3 , 47–69. tal extrapolation in patterns constructed from
Montello, D. R., & Pick, H. L., Jr. (1 993 ). Integrat- memory. Memory and Cognition, 1 2 , 207–21 8.
ing knowledge of vertically-aligned large-scale Pinker, S., & Finke, R. A. (1 980). Emergent
spaces. Environment and Behavior, 2 5 , 45 7– two-dimensional patterns in images rotated in
484. depth. Journal of Experimental Psychology: Hu-
Morrison, J. B., & Tversky, B. (in press). Failures man Perception and Performance, 6, 224–264.
of simple animations to facilitate learning. Prasad, S., Loula, F., & Shiffrar, M. (2003 ). Who’s
Moyer, R. S. (1 973 ). Comparing objects in there? Comparing recognition of self, friend,
memory: Evidence suggesting an internal psy- and stranger movement. Proceedings of the Ob-
chophysics. Perception and Psychophysics, 1 3 , ject Perception and Memory meeting.
1 80–1 84. Prentice, W. C. H. (1 95 4). Visual recognition of
Newcombe, N., & Huttenlocher, J. (2000). Mak- verbally labeled figures. American Journal of
ing space. Cambridge, MA: MIT Press. Psychology, 67, 3 1 5 –3 20.
Newcombe, N., Huttenlocher, J., Sandberg, E., Presson, C. C., & Montello, D. (1 994). Up-
Lee, E., & Johnson, S. (1 999). What do mises- dating after rotational and translational body
timations and asymmetries in spatial judgment movements: Coordinate structure of perspec-
indicate about spatial representation? Journal tive space. Perception, 2 3 , 1 447–1 45 5 .
of Experimental Psychology: Learning, Memory, Pylyshyn, Z. W. (1 973 ). What the mind’s eye tells
and Cognition, 2 5 , 986–996. the mind’s brain: A critique of mental imagery.
Newcombe, N., & Liben, L. S. (1 982). Barrier Psychological Bulletin, 80, 1 –24.
effects in the cognitive maps of children and Pylyshyn, Z. W. (1 979). The rate of “mental ro-
adults. Journal of Experimental Child Psychol- tation” of images: A test of a holistic analogue
ogy, 3 4, 46–5 8. hypothesis.. Memory and Cognition, 7, 1 9–28.
Novick, L. R., & Tversky, B. (1 987). Cognitive Pylyshyn, Z. W. (1 981 ). The imagery debate: Ana-
constraints on ordering operations: The case logue media versus tacit knowledge. Psycholog-
of geometric analogies. Journal of Experimental ical Review, 88, 1 6–45 .
Psychology: General, 1 1 6, 5 0–67. Quattrone, G. A. (1 986). On the perception of a
Oatley, K., & Yuill, N. (1 985 ). Perception of per- group’s variability. In S. Worchel & W. Austin
sonal and inter-personal action in a cartoon (Eds.), The psychology of intergroup relations
film. British Journal of Social Psychology, 2 4, (pp. 25 –48). New York: Nelson-Hall.
1 1 5 –1 24. Rauscher, F. H., Krauss, R. M., & Chen, Y. (1 996).
Paivio, A. (1 978). Mental comparisons involving Gesture, speech, and lexical access: The role of
abstract attributes. Memory and Cognition, 6, lexical movements in speech production. Psy-
1 99–208. chological Science, 7, 226–23 1 .
Reitman, W. (1 976). Skilled perception in GO: Schwartz, D. L., & Black, J. B. (1 996a). Analog
Deducing memory structures from interre- imagery in mental model reasoning: Depictive
sponse times. Cognitive Psychology, 8, 3 3 6– models. Cognitive Psychology, 3 0, 1 5 4–21 9.
3 5 6. Schwartz, D., & Black, J. B. (1 996b). Shuttling
Richardson, A. (1 967). Mental practice: A review between depictive models and abstract rules:
and discussion. Research Quarterly, 3 8, 95 – Induction and feedback. Cognitive Science, 2 0,
1 07. 45 7–497.
Rieser, J. J. (1 989). Access to knowledge of spatial Schwartz, D. L., & Black, T. (1 999). Inferences
structure at novel points of observation. Journal through imagined actions: Knowing by simu-
of Experimental Psychology: Learning, Memory, lated doing. Journal of Experimental Psychology:
and Cognition, 1 5 , 1 1 5 7–1 1 65 . Learning, Memory, and Cognition, 2 5 , 1 1 6–1 3 6.
Riley, D. A. (1 967). Memory for form. In L. Post- Schwartz, D. L., & Holton, D. L. (2000). Tool
man (Ed.), Psychology in the making (pp. 402– use and the effect of action on the imagina-
465 ). New York: Knopf. tion. Journal of Experimental Psychology: Learn-
Rode, C., & Stern, E. (in press). Diagrammatic ing, Memory, and Cognition, 2 6, 1 65 5 –1 665 .
tool use in male and female secondary school Sekiyama, K. (1 982). Kinesthetic aspects of men-
students. Learning and Instruction. tal representations in the identification of left
Ruby, P., & Decety, J. (2001 ). Effect of subjective and right hands. Perception and Psychophysics,
perspective taking during simulation of action: 3 2 , 89–95 .
A PET investigation of agency. Nature Neuro- Shah, P., & Carpenter, P. A. (1 995 ). Conceptual
science, 4, 5 46–5 5 0. limitations in comprehending line graphs. Jour-
Sadalla, E. K., Burroughs, W. J., & Staplin, nal of Experimental Psychology: General, 1 2 4,
L. J. (1 980). Reference points in spatial cog- 43 –61 .
nition. Journal of Experimental Psychology: Hu- Shah, P., Freedman, E. O., & Vekiri, I.
man Learning and Memory, 5 , 5 1 6–5 28. (2003 /2004). Graphical displays. In P. Shah &
Sadalla, E. K., & Magel, S. G. (1 980). The per- A. Miyake (Eds.), Handbook of higher-level visu-
ception of traversed distance. Environment and ospatial thinking and cognition. Cambridge, UK:
Behavior, 1 2 , 65 –79. Cambridge University Press.
Sadalla, E. K., & Montello, D. R. (1 989). Remem- Shah, P., & Miyake, A. (Eds.). (2003 /2004).
bering changes in direction. Environment and Handbook of higher-level visuospatial thinking
Behavior, 2 1 , 3 46–3 63 . and cognition. Cambridge, UK: Cambridge Uni-
Sadalla, E. K., & Staplin, L. J. (1 980a). An in- versity Press.
formation storage model or distance cognition. Shepard, R. N. (1 975 ). Form, formation, and
Environment and Behavior, 1 2 , 1 83 –1 93 . transformation of internal representations. In
Sadalla, E. K., & Staplin, L. J. (1 980b). The per- R. Solso (Ed.), Information processing and
ception of traversed distance: Intersections. En- cognition: The Loyola symposium. Hillsdale, NJ:
vironment and Behavior, 1 2 , 1 67–1 82. Erlbaum.
Scaife, M., & Rogers, Y. (1 996). External Shepard, R. N., & Chipman, S. F. (1 970). Second-
cognition: How do graphical representa- order isomorphism of internal representations:
tions work? International Journal of Human- Shapes of states. Cognitive Psychology, 1 , 1 –1 7.
Computer Studies, 45 , 1 85 –21 3 . Shepard, R. N., & Cooper, L. (1 982). Mental im-
Schiano, D., & Tversky, B. (1 992). Structure strat- ages and their transformation. Cambridge, MA:
egy in viewing simple graphs. Memory and Cog- MIT Press.
nition, 2 0, 1 2–20. Shepard, R. N., & Feng, C. (1 972). A chronomet-
Schmandt-Besserat, D. (1 992). Before writing, vol- ric study of mental paper folding. Cognitive Psy-
ume 1 : From counting to cuneiform. Austin: Uni- chology, 3 , 228–243 .
versity of Texas Press. Shepard, R. N., & Metzler, J. (1 971 ). Mental rota-
Schon, D. A. (1 983 ). The reflective practitioner. tion of three-dimensional objects. Science, 1 71 ,
New York: Harper Collins. 701 –703 .
Schwartz, D. L. (1 999). Physical imagery: Kine- Shepard, R. N., & Podgorny, P. (1 978). Cognitive
matic vs. dynamic models. Cognitive Psychol- processes that resemble perceptual processes.
ogy, 3 8, 43 3 –464. In W. K. Estes (Ed.), Handbook of learning and
cognitive processes (Vol. 5 , pp. 1 89–23 7). Hills- Tufte, E. R. (1 983 ). The visual display of quantita-
dale, NJ: Erlbaum. tive information. Chesire, CT: Graphics Press.
Shiffrar, M., & Freyd, J. J. (1 990). Apparent mo- Tufte, E. R. (1 990). Envisioning information.
tion of the human body. Psychological Science, Cheshire, CT: Graphics Press.
1 , 25 7–264. Tufte, E. R. (1 997). Visual explanations. Cheshire,
Spelke, E. P., Vishton, P. M., & von Hofsten, CT: Graphics Press.
C. (1 995 ). Object perception, object-directed Tversky, B. (1 969). Pictorial and verbal encod-
action, and physical knowledge in infancy. In ing in short-term memory. Perception and Psy-
M. S. Gazzaniga (Ed.), The cognitive neuro- chophysics, 5 , 275 –287.
sciences (pp. 275 –3 40). Cambridge, MA: MIT Tversky, B. (1 975 ). Pictorial encoding in
Press. sentence-picture comparison. Quarterly Jour-
Stenning, K., & Oberlander, J. (1 995 ). A cognitive nal of Experimental Psychology, 2 7, 405 –
theory of graphical and linguistic reasoning: 41 0.
Logic and implementation. Cognitive Science, Tversky, B. (1 981 ). Distortions in memory for
1 9, 97–1 40. maps. Cognitive Psychology, 1 3 , 407–43 3 .
Stevens, A., & Coupe, P. (1 978). Distortions in Tversky, B. (1 985 ). Categories and parts. In C.
judged spatial relations. Cognitive Psychology, Craig & T. Givon (Eds.), Noun classes and
1 0, 422–43 7. categorization (pp. 63 –75 ). Philadelphia: John
Suwa, M., & Tversky, B. (1 997). What architects Benjamins.
and students perceive in their sketches: A pro- Tversky, B. (1 991 ). Spatial mental models. In
tocol analysis. Design Studies, 1 8, 3 85 –403 . G. H. Bower (Ed.), The psychology of learning
Suwa, M., & Tversky, B. (2003 ). Constructive and motivation: Advances in research and theory
perception: A skill for coordinating perception (Vol. 27, pp. 1 09–1 45 ). New York: Academic
and conception. In Proceedings of the Cognitive Press.
Science Society meeting. Tversky, B. (1 992). Distortions in cognitive maps.
Suwa, M., Tversky, B., Gero, J., & Purcell, T. Geoforum, 2 3 , 1 3 1 –1 3 8.
(2001 ). Seeing into sketches: Regrouping parts Tversky, B. (1 993 ). Cognitive maps, cognitive col-
encourages new interpretations. In J. S. Gero, lages, and spatial mental models. In A. U. Frank
B. Tversky, & T. Purcell (Eds.), Visual and spa- & I. Campari (Eds.), Spatial information theory:
tial reasoning in design (pp. 207–21 9). Sydney, A theoretical basis for GIS (pp. 1 4–24). Berlin:
Australia: Key Centre of Design Computing Springer-Verlag.
and Cognition.
Tversky, B. (1 995 a). Cognitive origins of graphic
Talmy, L. (1 983 ). How language structures space. conventions. In F. T. Marchese (Ed.), Un-
In H. L. Pick, Jr., & L. P. Acredolo (Eds.), Spa- derstanding images (pp. 29–5 3 ). New York:
tial orientation: Theory, research and application Springer-Verlag.
(pp. 225 –282). New York: Plenum.
Tversky, B. (1 995 b). Perception and cognition of
Talmy, L. (2001 ). Toward a cognitive semantics. 2D and 3 D graphics. Human factors in comput-
Vol. 1 : Concept-structuring systems. Vol. 2 : Ty- ing systems. New York: ACM.
pology and process in concept structuring. Cam-
Tversky, B. (1 998). Three dimensions of spatial
bridge, MA: MIT Press.
cognition. In M. A. Conway, S. E. Gather-
Taylor, H. A., & Tversky, B. (1 992a). Descriptions cole, & C. Cornoldi (Eds.), Theories of mem-
and depictions of environments. Memory and ory II (pp. 25 9–275 ). Hove, UK: Psychological
Cognition, 2 0, 483 –496. Press.
Taylor, H. A., & Tversky, B. (1 992b). Spatial men- Tversky, B. (2000a). Some ways that maps and di-
tal models derived from survey and route de- agrams communicate. In C. Freksa, W. Brauer,
scriptions. Journal of Memory and Language, 3 1 , C. Habel, & K. F. Wender (Eds.), Spatial cog-
261 –282. nition II: Integration abstract theories, empirical
Taylor, H. A., & Tversky, B. (1 996). Perspective studies, formal models, and powerful applications
in spatial descriptions. Journal of Memory and (pp. 72–79). Berlin: Springer-Verlag.
Language, 3 5 , 3 71 –3 91 . Tversky, B. (2000b). Levels and structure of cog-
Thorndyke, P. (1 981 ). Distance estimation from nitive mapping. In R. Kitchin & S. M. Freund-
cognitive maps. Cognitive Psychology, 1 3 , 5 26– schuh (Eds.), Cognitive mapping: Past, present
5 5 0. and future (pp. 24–43 ). London: Routledge.
Tversky, B. (2000c). Remembering spaces. In E. Tversky, B., Morrison, J. B., & Betrancourt, M.
Tulving & F. I. M. Craik (Eds.), Handbook (2002). Animation: Does it facilitate? Interna-
of memory (pp. 3 63 –3 78). New York: Oxford tional Journal of Human-Computer Studies, 5 7,
University Press. 247–262.
Tversky, B. (2001 ). Spatial schemas in depictions. Tversky, B., Morrison, J. B., & Zacks, J. (2002). On
In M. Gattis (Ed.), Spatial schemas and ab- bodies and events. In A. Meltzoff & W. Prinz
stract thought (pp. 79–1 1 1 ). Cambridge, MA: (Eds.), The imitative mind: Development evolu-
MIT Press. tion, and brain bases (pp. 221 –23 2). Cambridge,
Tversky, B. (2003 a). Navigating by mind and by UK: Cambridge University Press.
body. In C. Freksa (Ed.), Spatial cognition III Tversky, B., & Schiano, D. (1 989). Perceptual and
(pp. 1 –1 0). Berlin: Springer-Verlag. conceptual factors in distortions in memory
Tversky, B. (2003 b). Structures of mental spaces: for maps and graphs. Journal of Experimental
How people think about space. Environment Psychology: General, 1 1 8, 3 87–3 98.
and Behavior, 3 5 , 66–80. Ullman, S. (1 996). High-level vision: Object recog-
Tversky, B. (in press). Functional significance of nition and visual cognition. Cambridge, MA:
visuospatial representations. In P. Shah & A. MIT Press.
Miyake (Eds.), Handbook of higher-level visu- van Dijk, T. A., & Kintsch, W. (1 983 ). Strategies of
ospatial thinking. Cambridge, UK: Cambridge discourse comprehension. New York: Academic
University Press. Press.
Tversky, B., Heiser, J., Lozano, S., MacKenzie, R., Wainer, H. (1 984). How to display data badly.
& Morrison, J. B. (in press). Enriching anima- The American Statistician, 3 8, 1 3 7–1 47.
tions. In R. Lowe & W. Schwartz (Eds.), Leaving Wainer, H. (1 997). Visual revelations. Graphi-
with animals: Research and Innovation Design. cal tales of fate and deception from Napoleon
New York: Cambridge University Press. Bonaparte to Ross Perot. New York: Springer-
Tversky, B., & Hemenway, K. (1 984). Objects, Verlag.
parts, and categories. Journal of Experimental Wexler, M., Kosslyn, S. M., & Berthoz, A. (1 998).
Psychology: General, 1 1 3 , 1 69–1 93 . Motor processes in mental rotation. Cognition,
Tversky, B., Kim, J., & Cohen, A. (1 999). Men- 68, 77–94.
tal models of spatial relations and transforma- Wilton, R. N. (1 979). Knowledge of spatial
tions from language. In C. Habel & G. Rickheit relations: The specification of information used
(Eds.), Mental models in discourse processing and in making inferences. Quarterly Journal of Ex-
reasoning (pp. 23 9–25 8). Amsterdam: North- perimental Psychology, 3 1 , 1 3 3 –1 46.
Holland. Winn, W. (1 989). The design and use of instruc-
Tversky, B., Kugelmass, S., & Winter, A. (1 991 ). tional graphics. In H. Mandl & J. R. Levin
Cross-cultural and developmental trends in (Eds.), Knowledge acquisition from text and pic-
graphic productions. Cognitive Psychology, 2 3 , tures (pp. 1 25 –1 43 ). Amsterdam: Elsevier.
5 1 5 –5 5 7. Wohlschlager, A., & Wolschlager, A. (1 998).
Tversky, B., & Lee, P. U. (1 998). How space struc- Mental and manual rotation. Journal of Exper-
tures language. In C. Freksa, C. Habel, & K. F. imental Psychology: Human Perception and Per-
Wender (Eds.), Spatial cognition: An interdisci- formance, 2 4, 3 7–41 2.
plinary approach to representation and process- Wraga, M., Creem, S. H., & Proffitt, D. R.
ing of spatial knowledge (pp. 1 5 7–1 75 ). Berlin: (2000). Updating displays after imagined ob-
Springer-Verlag. ject and viewer rotations. Journal of Experimen-
Tversky, B., & Lee, P. U. (1 999). Pictorial and ver- tal Psychology: Learning, Memory, and Cogni-
bal tools for conveying routes. In C. Freksa & tion, 1 5 1 –1 68.
D. M. Mark (Eds.), Spatial information theory: Zacks, J. M., Mires, J., Tversky, B., & Hazeltine, E.
Cognitive and computational foundations of geo- (2000). Mental spatial transformations of ob-
graphic information science (pp. 5 1 –64). Berlin: jects and perspective. Journal of Spatial Cogni-
Springer-Verlag. tion and Computation, 2 , 3 1 5 –3 3 2.
Tversky, B., Lee, P. U., & Mainwaring, S. (1 999). Zacks, J. M., Ollinger, J. M., Sheridan, M., & Tver-
Why speakers mix perspectives. Journal of sky, B. (2002). A parametric study of mental
Spatial Cognition and Computation, 1 , 3 99– spatial transformations of bodies. Neuroimage,
41 2. 1 6, 85 7–872.
Zacks, J. M., & Tversky, B. (in press). Multiple structure in events. Journal of Experimental
systems for spatial imagery: Transformations of Psychology: General, 1 3 6, 29–5 8.
objects and perspective. Zwaan, R. A., & Radvansky, G. A. (1 998). Situ-
Zacks, J., Tversky, B., & Iyer, G. (2001 ). ation models in language comprehension and
Perceiving, remembering and communicating memory. Psychological Bulletin, 1 2 3 , 1 62–1 85 .
Part III
JUDGMENT AND DECISION

MAKING

CHAPTER 1 1
Decision Making
Robyn A. LeBoeuf
Eldar B. Shafir
Introduction (For further reviews and edited collections,

see, among others, Hastie & Dawes, 2001 ;
People make countless decisions every day, Goldstein & Hogarth, 1 997; Kahneman &
ranging from ones that are barely noticed Tversky, 2000.)
and soon forgotten (“What should I drink The classical treatment of decision mak-
with lunch?” “What should I watch on ing, known as the “rational theory of choice”
TV?”), to others that are highly consequen- or the “standard economic model,” posits
tial (“How should I invest my retirement that people have orderly preferences that
funds?” “Should I marry this person?”). In obey a few simple and intuitive axioms.
addition to having practical significance, de- When faced with a choice problem, deci-
cision making plays a central role in many sion makers are assumed to gauge each al-
academic disciplines: Virtually all the social ternative’s “subjective utility” and to choose
sciences – including psychology, sociology, the alternative with the highest. In the face
economics, political science, and law – rely of uncertainty about whether outcomes will
on models of decision-making behavior. This obtain, decision makers are believed to cal-
combination of practical and scholarly fac- culate an option’s subjective expected utility,
tors has motivated great interest in how de- which is the sum of its subjective utilities
cisions are and should be made. Although over all possible outcomes weighted by these
decisions can differ dramatically in scope outcomes’ estimated probabilities of occur-
and content, research has uncovered sub- rence. Deciding then is simply a matter of
stantial and systematic regularities in how choosing the option with the greatest ex-
people make decisions and has led to the pected utility; indeed, choice is believed to
formulation of general psychological prin- reveal a person’s subjective utility functions
ciples that characterize decision-making be- and, hence, his or her underlying preferences
havior. This chapter provides a selective re- (e.g., Keeney & Raiffa, 1 976; Savage, 1 95 4;
view of those regularities and principles. von Neumann & Morgenstern, 1 944).
2 43
Although highly compelling in principle, the fundamental ways in which they conflict
the standard view has met with persistent with normative expectations.
critiques addressing its inadequacy as a de-
scription of how decisions are actually made.
For example, Simon (1 95 5 ) suggested re- Choice Under Uncertainty
placing the rational model with a framework
that accounted for a variety of human re-
In the context of some decisions, the avail-
source constraints, such as bounded atten-
ability of options is essentially certain (as
tion and memory capacity, as well as limited
when choosing items from a menu or cars
time. According to this bounded rationality
at a dealer’s lot). Other decisions are made
view, it was unreasonable to expect decision
under uncertainty: They are “risky” when
makers to exhaustively compute options’ ex-
the probabilities of the outcomes are known
pected utilities.
(e.g., gambling or insurance) or, as with most
Other critiques have focused on system-
real world decisions, they are “ambiguous,”
atic violations of even the most funda-
in that precise likelihoods are not known and
mental requirements of the rational the-
must be estimated by the decision maker.
ory of choice. According to the theory, for
When deciding under uncertainty, a person
example, preferences should remain unaf-
must consider both the desirability of the po-
fected by logically inconsequential factors
tential outcomes and their likelihoods; much
such as the precise manner in which op-
research has addressed the manner in which
tions are described, or the specific proce-
these factors are estimated and combined.
dure used to elicit preferences (Arrow, 1 95 1 ,
1 988; Tversky & Kahneman, 1 986). How-
Prospect Theory
ever, compelling demonstrations emerged
showing that choices failed to obey sim- When facing a choice between a risky
ple consistency requirements and were, in- prospect that offers a 5 0% chance to win
stead, affected by nuances of the decision $200 (and a 5 0% chance to win nothing) ver-
context that were not subsumed by the nor- sus an alternative of receiving $1 00 for sure,
mative accounts (e.g., Lichtenstein & Slovic, most people prefer the sure gain over the
1 971 , 1 973 ; Tversky & Kahneman, 1 981 ). In gamble, although the two prospects have the
particular, preferences appeared to be con- same expected value. (The expected value
structed, not merely revealed, in the making is the sum of possible outcomes weighted
of decisions (Slovic, 1 995 ), and this, in turn, by their probabilities of occurrence. The ex-
was shown to lead to significant and system- pected value of the gamble above is .5 0 *
atic departures from normative predictions. $200 + .5 0 * 0 = $1 00.) Such preference
The mounting evidence has forced a clear for a sure outcome over a risky prospect of
division between normative and descriptive equal expected value is called risk aversion;
treatments. The rational model remains the people tend to be risk averse when choos-
normative standard against which decisions ing between prospects with positive out-
are often judged, both by experts and by comes. The tendency toward risk aversion
novices (cf. Stanovich, 1 999). At the same can be explained by the notion of dimin-
time, substantial multidisciplinary research ishing sensitivity first formalized by Daniel
has made considerable progress in develop- Bernoulli (1 73 8/1 95 4). Bernoulli proposed
ing models of choice that are descriptively that preferences are better described by ex-
more faithful. Descriptive accounts as ele- pected utility than by expected value and
gant and comprehensive as the normative suggested that “the utility resulting from a
model are not yet (and may never be) avail- fixed small increase in wealth will be in-
able, but research has uncovered robust prin- versely proportional to the quantity of goods
ciples that play a central role in the mak- previously possessed,” thus effectively pre-
ing of decisions. In what follows, we review dicting a concave utility function (a func-
some of these principles, and we consider tion is concave if a line joining two points
decision making 2 45
Subjective & Kahneman, 1 992), a highly influential de-

Value (U($)) scriptive theory of choice. The value func-
tion of prospect theory, illustrated in Fig-
ure 1 1 .2, has three important properties: (1 )
it is defined on gains and losses rather than
total wealth, capturing the fact that peo-
ple normally treat outcomes as departures
from a current reference point (rather than
in terms of final assets, as posited by the ra-
tional theory of choice); (2) it is steeper for
$ losses than for gains, thus, a loss of $X is
100 200
more aversive than a gain of $X is attractive,
Figure 1 1 .1 . A concave function for gains. capturing the phenomenon of loss aversion;
and (3 ) it is concave for gains and convex for
losses, predicting, as described previously,
on the curve lies below the curve). The ex- risk aversion in the domain of gains and risk
pected utility of a gamble offering a 5 0% seeking in the domain of losses.
chance to win $200 (and 5 0% nothing) is In addition, according to prospect the-
.5 0 * u($200), where u is the person’s utility ory, probabilities are not treated linearly;
function (u(0) = 0). As illustrated in Fig- instead, people tend to overweight small
ure 1 1 .1 , diminishing sensitivity and a con- probabilities and to underweight large ones
cave utility function imply that the subjec- (Gonzalez & Wu, 1 999; Kahneman &
tive value attached to a gain of $1 00 is more Tversky, 1 979; Prelec, 2000). This, among
than one-half of the value attached to a gain other things, has implications for the attrac-
of $200 (u(1 00) > .5 *u(200)), which entails tiveness of gambling and of insurance (which
preference for the sure $1 00 gain and, hence, typically involve low-probability events),
risk aversion. and it yields substantial discontinuities at the
However, when asked to choose between endpoints, where the passage from impos-
a prospect that offers a 5 0% chance to lose sibility to possibility and from high likeli-
$200 (and a 5 0% chance of nothing) versus hood to certainty can have inordinate im-
losing $1 00 for sure, most people prefer the pact (Camerer, 1 992; Kahneman & Tversky,
risky gamble over the certain loss. This is be- 1 979). Furthermore, research has suggested
cause diminishing sensitivity applies to nega- that the weighting of probabilities can be
tive as well as to positive outcomes: The im- influenced by factors such as the decision
pact of an initial $1 00 loss is greater than that
of an additional $1 00, which implies a con- Value
vex value function for losses. The expected
utility of a gamble offering a 5 0% chance to
lose $200 is thus greater (i.e., less negative)
than that of a sure $1 00 loss: (.5 0*u(−$200)
> u(−$1 00)). Such preference for a risky
prospect over a sure outcome of equal ex-
pected value is described as risk seeking. With Losses Gains
the exception of prospects that involve very
small probabilities, risk aversion is generally
observed in choices involving gains, whereas
risk seeking tends to hold in choices involv-
ing losses.
These insights led to the S-shaped value
function that forms the basis for prospect
theory (Kahneman & Tversky, 1 979; Tversky Figure 1 1 .2 . Prospect theory’s value function.
maker’s feeling of competence in a domain quences. Consider, for example, the fol-
(Heath & Tversky, 1 991 ), or by the level of lowing choice between gambles (Tversky &
affect engulfing the options under consid- Kahneman, 1 981 , p. 45 5 ):
eration (Rottenstreich & Hsee, 2001 ). Such
attitudes toward value and chance entail A. A 2 5 % chance to win $3 0
substantial sensitivity to contextual factors B. A 2 0% chance to win $45
when making decisions, as discussed further
Faced with this choice, the majority (5 8%)
in the next section.
of participants preferred option B. Now, con-
sider the following extensionally equivalent
The Framing of Risky Decisions problem:
The previously described attitudes toward In the first stage of this game, there is a 75 %
risky decisions appear relatively straightfor- chance to end the game without winning
ward, and yet, they yield choice patterns that anything, and a 2 5 % chance to move into
conflict with normative standards. Perhaps the second stage. If you reach the second
the most fundamental are “framing effects” stage, you have a choice between:
(Tversky & Kahneman, 1 981 , 1 986): Because C. A sure win of $3 0
risk attitudes differ when outcomes are seen D. An 80% chance to win $45
as gains as opposed to losses, the same deci-
sion can be framed to elicit conflicting risk The majority (78%) of participants now pre-
attitudes. In one example, respondents were ferred option C over option D, even though,
asked to assume themselves $3 00 richer and when combined with the “first stage” of the
to choose between a sure gain of $1 00 or an problem, options C and D are equivalent to
equal chance to win $200 or nothing. Alter- A and B, respectively. Majority preference
natively, they were asked to assume them- thus reverses as a function of a supposedly
selves $5 00 richer and to choose between irrelevant contextual variation. In this par-
a sure loss of $1 00 and an equal chance ticular case, the reversal is due to the impact
to lose $200 or nothing. The two prob- of apparent certainty (which renders option
lems are identical in terms of final assets: C more attractive) and to another important
Both amount to a choice between $400 for factor, namely, people’s tendency to contem-
sure versus an even chance at $3 00 or $5 00 plate decisions from a “local” rather than a
(Tversky & Kahneman, 1 986). People, how- “global” perspective. Note that a combina-
ever, tend to “accept” the provided frame tion of the two stages in the last problem
and consider the problem as presented, fail- would have easily yielded the same repre-
ing to reframe it from alternate perspectives. sentation as that of the preceding version.
As a result, most people choosing between However, rather than amalgamating across
“gains” show a risk-averse preference for events and decisions, as is often assumed
the certain ($400) outcome, whereas most in normative analyses, people tend to con-
of those choosing between “losses” express template each decision separately, which can
a risk-seeking preference for the gamble. yield conflicting attitudes across choices. We
This pattern violates the normative require- return to the issue of local versus global per-
ment of “description invariance,” according spectives in a later section.
to which logically equivalent descriptions of As a further example of framing, it is
a decision problem should yield the same interesting to note that, even within the
preferences (see Kühberger, 1 995 ; Levin, domain of losses, risk attitudes can re-
Schneider, & Gaeth, 1 998, for reviews). verse depending on the context of deci-
The acceptance of the problem frame, sion. Thus, participants actually tend to
combined with the nonlinear weighting of prefer a sure loss to a risky prospect
probabilities and, in particular, with the el- when the sure loss is described as “insur-
evated impact of perceived “certainty,” has ance” against a low-probability, high-stakes
a variety of normatively troubling conse- loss (Hershey & Schoemaker, 1 980). The
insurance context brings to the forefront Riskless Choice

a social norm, making the insurance pre-
mium appear more like an investment than Not all decisions involve risk or uncertainty.
a loss, with the low-probability, high-stakes For example, when choosing between items
loss acquiring the character of a neglected in a store, we can be fairly confident that
responsibility rather than a considered risk the displayed items are available. (Naturally,
(e.g., Hershey & Schoemaker, 1 980; Kahne- there could be substantial uncertainty about
man & Tversky, 1 979; Slovic, Fischhoff, & one’s eventual satisfaction with the choice,
Lichtenstein, 1 988). but we leave those considerations aside for
The framing of certainty and risk also im- the moment.) The absence of uncertainty,
pacts people’s thinking about financial trans- however, does not eliminate preference mal-
actions through inflationary times, as illus- leability, and many of the principles dis-
trated by the following example. Participants cussed previously continue to exert an im-
were asked to imagine that they were pact even on riskless decisions. Recall that
in charge of buying computers (currently outcomes can be framed as gains or as losses
priced at $1 000) that would be delivered and relative to a reference point, that losses typ-
paid for 1 year later, by which time, due to ically “loom larger” than comparable gains,
inflation, prices were expected to be approx- and that people tend to accept the presented
imately 20% higher (and equally likely to be frame. These factors, even in the absence of
above or below the projected 20%). All par- risk, can yield normatively problematic de-
ticipants essentially faced the same choice: cision patterns.
They could agree to pay either $1 200 (20%
more than the current price) upon delivery
Loss Aversion and the Status Quo
next year, or they could agree to pay the
going market price in 1 year, which would A fundamental fact about the making of de-
depend on inflation. Reference points were cisions is loss aversion: According to loss
manipulated to make one option appear cer- aversion, the pain associated with giving
tain while the other appeared risky: Half up a good is greater than the pleasure
the participants saw the contracts framed associated with obtaining it (Tversky &
in nominal terms so the $1 200 price ap- Kahneman, 1 991 ). This yields “endowment
peared certain, whereas the future nominal effects,” wherein the mere possession of a
market price (which could be more or less good (such that parting with it is rendered
than $1 200) appeared risky. Other partici- a loss) can lead to higher valuation of the
pants saw the contracts framed in real terms, good than if it were not in one’s possession.
so the future market price appeared appro- A classic experiment illustrates this point
priately indexed, whereas precommitting to (Kahneman, Knetsch, & Thaler, 1 990). Par-
a $1 200 price, which could be lower or ticipants were arbitrarily assigned to be sell-
higher than the actual future market price, ers or choosers. The sellers were each given an
seemed risky. As predicted, in both con- attractive mug, which they could keep, and
ditions respondents preferred the contract were asked to indicate the lowest amount for
that appeared certain, preferring the fixed which they would sell the mug. The choosers
price in the nominal frame and the indexed were not given a mug but were instead asked
price in the “real” frame (Shafir, Diamond, & to indicate the amount of money that the
Tversky, 1 997). As with many psychological mug was worth to them. Additional pro-
tendencies, the preference for certainty can cedural details were designed to promote
mislead in some circumstances, but it may truthful estimates; in short, an official mar-
also be exploited for beneficial ends, such as ket price, $X, was to be revealed; all those
when the certainty associated with a partic- who valued the mug at more than $X re-
ular settlement is highlighted to boost the ceived a mug, whereas those who valued the
chance for conflict resolution (Kahneman & mug below $X received $X. All participants,
Tversky, 1 995 ). whether sellers or choosers, essentially faced
the same task of determining a price at which in favor of the limited alternative. Whereas
they would prefer money over the mug. Be- only about 20% of New Jersey drivers chose
cause participants were randomly assigned to acquire the full right to sue, approxi-
to be sellers or choosers, standard expecta- mately 75 % of Pennsylvania drivers chose to
tions are that the two groups would value the retain it. The difference in adoption rates
mugs similarly. Loss aversion, however, sug- resulting from the alternate defaults had
gests that the sellers would set a higher price financial repercussions estimated at nearly
(for what they were about to “lose”) than $200 million (Johnson, Hershey, Meszaros,
the choosers. Indeed, sellers’ median asking & Kunreuther, 1 993 ). Another naturally oc-
price was twice that of choosers. curring “experiment” was more recently ob-
Another manifestation of loss aversion is served in Europeans’ choices to be potential
a general reluctance to trade, illustrated in organ donors (Johnson & Goldstein, 2003 ).
a study in which one-half of the subjects In some European nations drivers are by de-
were given a decorated mug, whereas the fault organ donors unless they elect not to be,
others were given a bar of Swiss choco- whereas in other European nations they are,
late (Knetsch, 1 989). Later, each subject was by default, not donors unless they choose to
shown the alternative gift and offered the be. Observed rates of organ donors are al-
opportunity to trade his or her gift for the most 98% in the former nations and about
other. Because the initial allocation of gifts 1 5 % in the latter, a remarkable difference
was arbitrary and transaction costs minimal, given the low transaction costs and the sig-
economic theory predicts that about one- nificance of the decision.
half the participants would exchange their For another example, consider two candi-
gifts. Loss aversion, however, predicts that dates, Frank and Carl, who are running for
most participants would be reluctant to give election during difficult times and have an-
up a gift in their possession (a loss) to ob- nounced target inflation and unemployment
tain the other (a gain). Indeed, only 1 0% figures. Frank proposes a 42% yearly infla-
of the participants chose to trade. This con- tion rate and 1 5 % unemployment, whereas
trasts sharply with standard analysis in which Carl envisions 23 % inflation and 22% un-
the value of a good does not change when it employment. When Carl’s figures repre-
becomes part of one’s endowment. sent the status quo, Frank’s plans entail
Loss aversion thus promotes stability greater inflation and diminished unemploy-
rather than change. It implies that people ment, whereas when Frank’s figures are the
will not accept an even chance to win or lose status quo, Carl’s plan entails lower inflation
$X, because the loss of $X is more aversive and greater unemployment. As predicted,
than the gain of $X is attractive. In particular, neither departure from the “current” state
it predicts a strong tendency to maintain the was endorsed by the majority of respon-
status quo because the disadvantages of de- dents, who preferred whichever candidate
parting from it loom larger than the advan- was said to represent the status quo (Quat-
tages of its alternative (Samuelson & Zeck- trone & Tversky, 1 988).
hauser, 1 988). A striking tendency to main- The status quo bias can affect decisions
tain the status quo was observed in the con- in domains as disparate as job selection
text of insurance decisions when New Jersey (Tversky & Kahneman, 1 991 ), investment al-
and Pennsylvania both introduced the op- location (Samuelson & Zeckhauser, 1 988),
tion of a limited right to sue, entitling auto- and organ donation (Johnson & Goldstein,
mobile drivers to lower insurance rates. The 2003 ), and it can also hinder the negotiated
two states differed in what they offered con- resolution of disputes. If each disputant sees
sumers as the default option: New Jersey the opponent’s concessions as gains but its
motorists had to acquire the full right to own concessions as losses, agreement will be
sue (transaction costs were minimal: a signa- hard to reach because each will perceive it-
ture), whereas in Pennsylvania, the full right self as relinquishing more than it stands to
was the default, which could be forfeited gain. Because loss aversion renders foregone
gains more palatable than comparable losses often required to translate from one frame
(cf. Kahneman, 1 992), an insightful medi- to another.
ator may do best to set all sides’ refer-
ence points low, thus requiring compromises
over outcomes that are mostly perceived
as gains. Conflict and Reasons
Choices can be hard to make. People often

Semantic Framing
approach difficult decisions by looking for
The tendency to adopt the provided a compelling rationale for choosing one op-
frame can lead to “attribute-framing” effects tion over another. At times, compelling ra-
(Levin, Schneider, & Gaeth, 1 998). A pack- tionales are easy to come by and to articulate,
age of ground beef, for example, can be whereas other times no compelling ratio-
described as 75 % lean or else as 25 % fat. nale presents itself, rendering the conflict be-
Not surprisingly, it tends to be evaluated tween options hard to resolve. Such conflict
more favorably under the former description can be aversive and can lead people to post-
than the latter (Levin, 1 987; see also Levin, pone the decision or to select a “default”
Schnittjer, & Thee, 1 988). Similarly, a com- alternative. The tendency to rely on com-
munity with a 3 .7% crime rate tends to be pelling rationales that help minimize conflict
allocated greater police resources than one appears benign; nonetheless, it can generate
described as 96.3 % “crime free” (Quattrone preference patterns that are fundamentally
& Tversky, 1 988). Attribute-framing effects different from those predicted by normative
are not limited to riskless choice; for exam- accounts based on value maximization.
ple, people are more favorably inclined to-
ward a medical procedure when its chance
Decisional Conflict
of success, rather than failure, is highlighted
(Levin et al., 1 988). One way to avoid conflict in choice is to
Attribute-framing manipulations affect opt for what appears to be no choice at
the perceived quality of items by changing all, namely, the status quo. In one exam-
their descriptions. Part of the impact of such ple (Tversky & Shafir, 1 992a), participants
semantic factors may be due to spreading ac- who were purportedly looking to buy a CD
tivation (Collins & Loftus, 1 975 ), wherein player were presented with a Sony player
positive words (e.g., “crime-free”) activate that was on a 1 -day sale for $99, well below
associated positive concepts, and negative the list price. Two-thirds of the participants
words activate negative concepts. The psy- said they would buy such a CD player. An-
chophysical properties of numbers also con- other group was presented with the same
tribute to these effects. A 96.3 % “crime Sony player and also with a top-of-the-line
free” rate, for example, appears insubstan- Aiwa player for $1 5 9. In the latter case, only
tially different from 1 00% and suggests that 5 4% expressed interest in buying either op-
“virtually all” are law abiding. The difference tion, and a full 46% preferred to wait until
between 0% and 3 .7%, in contrast, appears they learned more about the various mod-
more substantial and suggests the need for els. The addition of an attractive option in-
intervention (Quattrone & Tversky, 1 988). creased conflict and diminished the number
Like the risk attitudes previously described, who ended up with either player, despite the
such perceptual effects often seem natural fact that most preferred the initial alterna-
and harmless in their own right but can tive to the status quo. This violates what is
generate preference inconsistencies that ap- known as the regularity condition, according
pear perplexing, especially given the rather to which the “market share” of an existing
mild and often unavoidable manipulations option – here, the status quo – cannot be in-
(after all, things need to be described one creased by enlarging the offered set (see also
way or another) and the trivial computations Tversky & Simonson, 1 993 ).
A related pattern was documented using patients awaiting surgery ought to be oper-
tasting booths in an upscale grocery store, ated on first. Half the respondents were pre-
where shoppers were offered the opportu- sented with two patients, a woman in her
nity to taste any of 6 jams in one condition, early fifties and a man in his seventies. Others
or any of 24 jams in the second (Iyengar saw the same two patients along with a third,
& Lepper, 2000). In the 6-jams condition, a woman in her early fifties highly compara-
40% of shoppers stopped to have a taste ble to the first, so it was difficult to think of a
and, of those, 3 0% proceeded to purchase rationale for choosing either woman over the
a jam. In the 24-jam condition, a full 60% other. As predicted, more physicians (5 8%)
stopped to taste, but only 3 % purchased. chose to operate on the older man in the
Presumably, the conflict between so many latter version, where the two highly compa-
attractive options proved hard to resolve. rable women presented decisional conflict,
Further studies found that those choos- than in the former version (3 8%), in which
ing goods (e.g., chocolate) from a larger the choice was between only one younger
set later reported lower satisfaction with woman and the man.
their selections than those choosing from The addition of some options can gen-
a smaller set. Conflict among options thus erate conflict and increase the tendency to
appears to make people less happy about refrain from choosing. Other options, how-
choosing, as well as less happy with their ever, can lower conflict and increase the like-
eventual choices. lihood of making a choice. Asymmetric dom-
Decisional conflict tends to favor default inance refers to the fact that in a choice
alternatives, much as it advantages the sta- between options A and B, a third option, A ,
tus quo. In one study, 80 students agreed to can be added that is clearly inferior to A (but
fill out a questionnaire in return for $1 .5 0. not to B), thereby increasing the choice like-
Following the questionnaire, one-half of the lihood of A (Huber, Payne, & Puto, 1 982).
respondents were offered the opportunity to For example, a choice between $6 and an
exchange the $1 .5 0 (the default) for one of elegant pen presents some conflict for par-
two prizes: a metal Zebra pen, or a pair of ticipants. However, when a less attractive
plastic Pilot pens. The remaining subjects pen is added to the choice set, the superior
were only offered the opportunity to ex- pen clearly dominates the inferior pen. This
change the $1 .5 0 for the Zebra. The pens dominance provides a rationale for choos-
were shown to subjects, who were informed ing the elegant alternative and leads to an
that each prize regularly costs just over increase in the percentage of those choos-
$2.00. The results were as follows. Twenty- ing the elegant pen over the cash. Along
five percent opted for the payment over the related lines, the compromise effect occurs
Zebra when Zebra was the only alternative, when the addition of a third, extreme option
but a reliably greater 5 3 % chose the pay- makes a previously available option appear
ment over the Zebra or the Pilot pens when as a reasonable compromise, thus increasing
both options were offered (Tversky & Shafir, its popularity (Simonson, 1 989; Simonson &
1 992a). Whereas the majority of subjects Tversky, 1 992).
took advantage of the opportunity to obtain Standard normative accounts do not deny
a valuable alternative when only one was of- conflict, nor, however, do they assume any
fered, the availability of competing valuable direct influence of conflict on choice. (For
alternatives increased the tendency to retain people who maximize utility, there does not
the default option. appear to be much room for conflict: Ei-
Related effects have been documented ther the utility difference is large and the
in decisions made by expert physicians and decision is easy, or it is small and the de-
legislators (Redelmeier & Shafir, 1 995 ). In cision is of little import.) In actuality, peo-
one scenario, neurologists and neurosur- ple are concerned with making the “right”
geons were asked to decide which of several choice, which can render decisional conflict
influential beyond mere considerations of the reason for the choice. Respondents eas-
value. Conflict is an integral aspect of de- ily generated “reasons” (in which they cited
cision making, and the phenomenology of attributes, such as stocking texture), but
conflict, which can be manipulated via the the reasons they provided bore little resem-
addition or removal of alternatives, yields blance to those that actually guided choice
predictable and systematic violations of stan- (Nisbett & Wilson, 1 977).
dard normative predictions. Finally, and perhaps most normatively
troubling, a reliance on reasons can induce
preference inconsistencies because nuances
Reason-Based Choice
in decisional context can render certain rea-
The desire to make the “right” choice of- sons more or less apparent. In one study
ten leads people to look for good reasons (Tversky & Shafir, 1 992b), college students
when making decisions, and such reliance were asked to imagine that they had just
on reasons helps make sense of phenomena taken and passed a difficult exam and now
that appear puzzling from the perspective had a choice for the Christmas holidays:
of value maximization (Shafir, Simonson, They could buy an attractive vacation pack-
& Tversky, 1 993 ). Relying on good rea- age at a low price, they could forego the va-
sons seems like sound practice: After all, cation package, or they could pay a $5 fee
the converse, making a choice without good to defer the decision by a day. The major-
reason, seems unwise. At the same time, ity elected to buy the vacation package, and
abiding by this practice can be problem- less than one-third elected to delay the deci-
atic because the reasons that come to mind sion. A second group was asked to imagine
are often fleeting, are limited to what is in- that they had taken the exam and failed and
trospectively accessible, and are not neces- would need to retake it after the Christmas
sarily those that guide, or ought to guide, holidays. They were then presented with
the decision. For example, participants who the same choice and, as before, the major-
were asked to analyze why they felt the way ity elected to buy the vacation package; less
that they did about a set of jams showed than one-third preferred to defer. However,
less agreement with “expert” ratings of the when a third group of participants was to
jams than did those who merely stated their imagine they did not know whether they
preferences (Wilson & Schooler, 1 991 ). A had passed or failed the exam, the major-
search for reasons can alter preference in ity preferred to pay to defer the decision
line with reasons that come readily to mind, until the next day, when the exam result
but those reasons may be heavily influenced would be known, and only a minority was
by salience, availability, or momentary con- willing to commit to the trip without know-
text. A heavy focus on a biased set of tem- ing. Apparently, participants were comfort-
porarily available reasons can cause one to able booking the trip when they had clear
lose sight of one’s (perhaps more valid) reasons for the decision – celebrating when
initial feelings (Wilson, Dunn, Kraft, & they passed the exam or recuperating when
Lisle, 1 989). they had failed – but were reluctant to com-
Furthermore, a wealth of evidence sug- mit when their reasons for the trip were
gests that people are not always aware of uncertain. This pattern, which violates the
their reasons for acting and deciding (see sure thing principle (Savage, 1 95 4), has been
Nisbett & Wilson, 1 977). In one example, documented in a variety of contexts, includ-
participants presented with four identical ing gambling and strategic interactions (e.g.,
pairs of stockings and asked to select one prisoner’s dilemmas; see also Shafir, 1 994;
showed a marked preference for the op- Shafir & Tversky, 1 992).
tion on the right. However, despite this ev- The tendency to delay decision for the
idence that choice was governed by posi- sake of further information can have a
tion, no participant mentioned position as significant impact on the ensuing choice.
Consider the following scenario (Bastardi & reliance on reasons in choice leaves decision
Shafir, 1 998): makers susceptible to a variety of contextual
and procedural nuances that render alterna-
For some time, you have considered adding tive potential reasons salient and thus may
a compact disc (CD) player to your stereo lead to inconsistent choices.
system. You now see an ad for a week-long
sale offering a very good CD player for only
$1 2 0, 5 0% off the retail price. Recently,
however, your amplifier broke. You learn Processing of Attribute Weights
that your warranty has expired and that
you have to pay $90 for repairs. Choices can be complex, requiring the eval-
uation of multiattribute options. Consider,
One group (the “simple” condition) was
for example, a choice between two job
asked whether they would buy the CD
candidates: One candidate did well in school
player during the sale, and the vast major-
but has relatively unimpressive work ex-
ity (91 %) said they would. Another (“uncer-
perience and moderate letters of recom-
tain”) group was presented with the same
mendation, whereas the other has a poor
scenario, but was told that they would not
scholastic record but better experience and
know until the next day whether the war-
stronger letters. To make this choice, the de-
ranty covered the $90 repairs. They could
cision maker must somehow combine the
wait until the following day (when they
attribute information, which requires deter-
would know about the warranty) to de-
mining not only the quality or value of each
cide whether to buy the CD player; 69%
attribute, but also the extent to which a
elected to wait. Those who chose to wait
shortcoming on one attribute can be com-
then learned that the warranty had expired
pensated for by strength on another.
and would not cover repairs; upon receiv-
Attribute evaluation may be biased by
ing the news, the majority decided not to
a host of factors known to hold sway over
buy the CD player. Note that this contrasts
human judgment (for a review, see Kahne-
sharply with the unequivocal choice to buy
man & Frederick, Chap. 1 2). Moreover, re-
the CD player when the $90 repair costs
searchers have long known that people have
were a given. Although they faced the same
limited capacity for combining information
decision, only 5 5 % (including those who
across attributes. Because of unreliable at-
waited and those who did not) chose to
tribute weights in human judges, simple lin-
buy the CD player in the uncertain con-
ear models tend to yield normatively better
dition, when they did not know but could
predictions than the very judges on whom
pursue information about the repair costs,
the models are based (Dawes, 1 979; Dawes,
compared with 91 % in the certain condi-
Faust, & Meehl, 1 989). In fact, people’s
tion, when repair costs were known from
unreliable weighting of attributes makes
the start. The decision to pursue informa-
them susceptible to a host of manipulations
tion can focus attention on the information
that alter attribute weights and yield con-
obtained and thereby trigger emergent ratio-
flicting preferences (see Shafir & LeBoeuf,
nales for making the choice, ultimately dis-
2004, for a further discussion of multiattri-
torting preference (Bastardi & Shafir, 1 998).
bute choice).
Similar patterns have been replicated in a
variety of contexts, including one involving
Compatibility
professional nurses in a renal failure ward,
more of whom expressed willingness to do- Options can vary on several dimensions.
nate a kidney (to a hypothetical relative) Even simple monetary gambles, for exam-
when they had purportedly been tested and ple, differ on payoffs and the chance to win.
learned that they were eligible than when Respondents’ preferences among such gam-
they had known they were eligible from the bles can be assessed in different but logically
start (Redelmeier, Shafir, & Aujla, 2001 ). A equivalent, ways (see Schkade & Johnson,
1 989, for a review). For example, partici- and negative attributes, such as gorgeous
pants may be asked to choose among the beaches and great sunshine but cold water
gambles or, alternatively, they may estimate and strong winds, and another that is neutral
their maximum willingness to pay for each in all respects. Some respondents were asked
gamble. Notably, these procedures, although which destination they preferred; others de-
logically equivalent, often result in differ- cided which to forego. Because positive fea-
ential weightings of attributes and, conse- tures are weighed more heavily in choice and
quently, in inconsistent preferences. negative features matter relatively more dur-
Consider two gambles: One offers an ing rejection, the enriched destination was
eight-in-nine chance to win $4 and the other most frequently chosen and rejected. Over-
a one-in-nine chance to win $40. People all, its choice and rejection rates summed to
typically choose the high-probability gamble 1 1 5 %, significantly more than the impover-
but assign a higher price to the high-payoff ished destination’s 85 %, and more than the
gamble, thus expressing conflicting prefer- 1 00% expected if choice and rejection were
ences (Grether & Plott, 1 979; Lichtenstein complementary (see also Downs & Shafir,
& Slovic, 1 971 , 1 973 ; Tversky, Slovic, & 1 999; Wedell, 1 997).
Kahneman, 1 990). This pattern illustrates
the principle of compatibility, according to
Separate Versus Comparative Evaluation
which an attribute’s weight is enhanced by
its compatibility with the response mode Decision contexts can facilitate or ham-
(Slovic, Griffin, & Tversky, 1 990; Tversky, per attribute evaluation, and this can alter
Sattath, & Slovic, 1 988). In particular, a gam- attribute weights. Not surprisingly, an at-
ble’s potential payoff is weighted more heav- tribute whose value is clear can have greater
ily in pricing, where both the price and impact than an attribute whose value is
the payoff are in the same monetary units, vague. The effects of ease of evaluation, re-
than in choice, where neither attribute maps ferred to as “evaluability,” occur, for exam-
onto the response scale (Schkade & Johnson, ple, when an attribute proves difficult to
1 989). As a consequence, the high-payoff gauge in isolation but easier to evaluate in
gamble is valued more in pricing relative a comparative setting (Hsee, 1 996; Hsee,
to choice. Loewenstein, Blount, & Bazerman, 1 999). In
For another type of response compati- one study, subjects were presented with two
bility, imagine having to choose or, alter- second-hand music dictionaries: one with
natively, having to reject, one of two op- 20,000 entries but a slightly torn cover, and
tions. Logically speaking, the two tasks are the other with 1 0,000 entries and an un-
interchangeable: If people prefer one option, blemished cover. Subjects had only a vague
they will reject the second, and vice versa. notion of how many entries to expect in a
However, people tend to focus on the rel- music dictionary; when they saw these one
ative strengths of options (more compati- at a time, they were willing to pay more for
ble with choosing) when they choose, and the dictionary with the new cover than for
on weaknesses (compatible with rejecting) the one with a cover that was slightly torn.
when they reject. As a result, options’ posi- When the dictionaries were evaluated con-
tive features (the pros) loom larger in choice, currently, however, the number-of-entries
whereas their negative features (the cons) attribute became salient: Most subjects ob-
are weighted relatively more during rejec- viously preferred the dictionary with more
tion. In one study, respondents were pre- entries, despite the inferior cover.
sented with pairs of options – an enriched For another example, consider a job that
option, with various positive and negative pays $80,000 a year at a firm where one’s
features, and an impoverished option, with peers receive $1 00,000, compared with a job
no real positive or negative features (Shafir, that pays $70,000 while coworkers are paid
1 993 ). For example, consider two vacation $5 0,000. Consistent with the fact that most
destinations: one with a variety of positive people prefer higher incomes, a majority of
second-year MBA students who compared preferences. Notably, discrepancies between

the two options preferred the job with the separate versus concurrent evaluation have
higher absolute – despite the lower rela- profound implications for intuition and for
tive – income. When the jobs are contem- policy. Outcomes in life are typically experi-
plated separately, however, the precise mer- enced one at a time: A person lives through
its of one’s own salary are hard to gauge, one scenario or another. Normative intu-
but earning less than comparable others ren- itions, however, typically arise from concur-
ders the former job relatively less attractive rent introspection: We entertain a scenario
than the latter, where one’s salary exceeds along with its alternatives. When an event
one’s peers’. Indeed, the majority of MBA triggers reactions that stem from its being
students who evaluated the two jobs sepa- experienced in isolation, important aspects
rately anticipated higher satisfaction in the of the experience will be misconstrued by
job with the lower salary but the higher rela- intuitions that arise from concurrent evalu-
tive position, obviously putting more weight ation (see Shafir, 2002).
on the latter attribute in the context of sep-
arate evaluation (Bazerman, Schroth, Shah,
Diekmann, & Tenbrunsel, 1 994). Local Versus Global Perspectives
In the same vein, decision principles that
are hard to apply in isolated evaluation may
Many of the inconsistency patterns de-
prove decisive in comparative settings, pro-
scribed previously would not have arisen
ducing systematic fluctuations in attribute
were decisions considered from a more
weights. Kahneman and Ritov (1 994), for
global perspective. The framing of decisions,
example, asked participants about their will-
for instance, would be of little consequence
ingness to contribute to several environmen-
were people to go beyond the provided
tal programs. One program was geared to-
frame to represent the decision outcomes in
ward saving dolphins in the Mediterranean
a canonical manner that is description inde-
Sea; another funded free medical check-
pendent. Instead, people tend to accept the
ups for farm workers at risk for skin can-
decision problem as it is presented, largely
cer. When asked which program they would
because they may not have thought of other
rather support, the vast majority chose the
ways to look at the decision, and also be-
medical checkups for farm workers, pre-
cause they may not expect their preferences
sumably following the principle that human
to be susceptible to presumably incidental
lives come before those of animals. How-
alterations. (Note that even if they were to
ever, when asked separately for the largest
recognize the existence of multiple perspec-
amount they would be willing to pay for
tives, people may still not know how to ar-
each intervention, respondents, moved by
rive at a preference independent of a spe-
the animals’ vivid plight, were willing to
cific formulation; cf. Kahneman, 2003 ). In
pay more for the dolphins than for work-
this final section, we review several addi-
ers’ checkups. In a similar application, po-
tional decision contexts in which a limited
tential jurors awarded comparable dollar
or myopic approach is seen to guide deci-
amounts to plaintiffs who had suffered ei-
sion making, and inconsistent preferences
ther physical or financial harm, as long as the
arise as a result of a failure to adopt a more
cases were evaluated separately. However, in
“global” perspective. Such a perspective re-
concurrent evaluation, award amounts in-
quires one to ignore momentarily salient fea-
creased dramatically when the harm was
tures of the decision in favor of other, often
physical as opposed to financial, affirming
less salient, considerations that have long-
the notion that personal harm is the graver
run consequences.
offense (Sunstein, Kahneman, Schkade, &
Ritov, 2001 ).
Repeated Decisions
Attribute weights, which are normatively
assumed to remain stable, systematically Decisions that occur on a regular basis are
shift and give rise to patterns of inconsistent often more meaningful when evaluated “in
the long run.” For example, the choice to diet predicts a pattern of individual decisions that
or to exercise makes little difference on any is inconsistent with what these physicians
one day and can only be carried out under a would endorse from a more global perspec-
long-term perspective that trumps the per- tive. For a more mundane example, people
son’s short-term preferences for cake over report greater willingness to wear a seatbelt –
vegetables or for sleeping late rather than go- and to support proseatbelt legislation – when
ing to the gym early. People, however, often they are shown statistics concerning the life-
do not take this long-term perspective when time risk of being in a fatal accident instead
evaluating instances of a recurring choice; in- of the dramatically lower risk associated with
stead, they tend to treat each choice as an any single auto trip (Slovic et al., 1 988).
isolated event. Similar patterns prompted Kahneman
In one study, participants were offered a and Lovallo (1 993 ) to argue that decision
5 0% chance to win $2000 and a 5 0% chance makers often err by treating each decision as
to lose $5 00. Although most participants re- unique rather than categorizing it as one in
fused to play this gamble once, the major- a series of similar decisions made over a life-
ity were eager to play the gamble five times, time (or, in the case of corporations, made by
and, when given the choice, preferred to play many workers). They distinguish an “inside
the gamble six times rather than five. Appar- view” of situations and plans, characterized
ently, fear of possibly losing the single gam- by a focus on the peculiarities of the case at
ble is compensated for by the high likelihood hand, from an “outside view,” guided by an
of ending up ahead in the repeated version. analysis of a large number of similar cases.
Other participants were asked to imagine Whereas an outside view, based, for exam-
that they had already played the gamble five ple, on base rates, typically leads to a more
times (outcome as yet unknown) and were accurate evaluation of the current case, peo-
given the option to play once more. In this ple routinely adopt an inside view, which
formulation, a majority of participants re- typically overweighs the particulars of the
jected the additional play. Although partici- given case at the expense of base-rate con-
pants preferred to play the gamble six times siderations. Managers, for example, despite
rather than five, once they had finished play- knowing that past product launches have
ing five, the additional opportunity was im- routinely run over budget and behind sched-
mediately “segregated” and treated as a single ule, may convince themselves that this time
instance, which – as we know from the sin- will be different because the team is excel-
gle gamble version – participants preferred lent or the product exceptional. The inside
to avoid (Redelmeier & Tversky, 1 992). view can generate overconfidence (Kahne-
In a related vein, consider physicians, man & Lovallo, 1 993 ), as well as undue op-
who can think of their patients “individ- timism, for example, regarding the chances
ually” (i.e., patient by patient) or “glob- of completing projects by early deadlines
ally” (e.g., as groups of patients with simi- (e.g., the planning fallacy; Buehler, Griffin, &
lar problems). In several studies, Redelmeier Ross, 1 994). The myopia that emerges from
and Tversky (1 990) found that physicians treating repeated decisions as unique leads
were more likely to take “extra measures,” to overly bold predictions and to the neglect
such as ordering an expensive medical test of considerations that ought to matter in the
or recommending an in-person consultation, long run.
when they considered the treatment of an
individual patient than when they consid-
Mental Accounting
ered a larger group of similarly afflicted
patients. Personal concerns loomed larger Specific forms of myopia arise in the con-
when patients were considered individually text of “mental accounting,” the behav-
than when “patients in general” were con- ioral equivalent of accounting done by firms
sidered, with the latter group more likely to wherein people reason about and make
highlight efficiency concerns. Because physi- decisions concerning matters such as in-
cians tend to see patients one at a time, this come, spending, and savings. Contrary to
the assumption of “fungibility,” according to as a substantial ratio in one case and as quite
which money in one account, or from one negligible in the other.
source, is a perfect substitute for money in Specific formulations and contextual de-
another, it turns out that the labeling of ac- tails are not spontaneously reformulated
counts and the nature of transactions have or translated into more comprehensive or
a significant impact on people’s decisions canonical representations. As a consequence,
(Thaler, 1 999). For one example, people’s re- preferences prove highly labile and depen-
ported willingness to spend $25 on a theater dent on what are often theoretically, as well
ticket is unaffected by having incurred a $5 0 as practically, unimportant and accidental
parking ticket but is significantly lowered details. An extensive literature on mental ac-
when $5 0 is spent on a ticket to a sporting counting, as well as behavioral finance, forms
event (Heath & Soll, 1 996). Respondents ap- part of the growing field of behavioral eco-
parently bracket expenses into separate ac- nomics (see, e.g., Camerer, Loewenstein, &
counts so spending on entertainment is im- Rabin, 2004; Thaler 1 993 , 1 999).
pacted by a previous entertainment expense
in a way that it is not if that same expense
Temporal Discounting
is “allocated” to, say, travel. Along similar
lines, people who had just lost a $1 0 bill were A nontrivial task is to decide how much
happy to buy a $1 0 ticket for a play but were weight to give to outcomes extended into
less willing to buy the ticket if, instead of the the distant future. Various forms of uncer-
money, they had just lost a similar $1 0 ticket tainty (regarding nature, one’s own tastes,
(Tversky & Kahneman, 1 981 ). Apparently, and so on) justify some degree of discount-
participants were willing to spend $1 0 on a ing in calculating the present value of future
play even after losing $1 0 cash but found it goods. Thus, $1 000 received next year is typ-
aversive to spend what was coded as $20 on ically worth less than $1 000 received today.
a ticket. As it turns out, observed discount rates tend
Finally, consider the following scenario, to be unstable and often influenced by fac-
which respondents saw in one of two tors, such as the size of the good and its tem-
versions: poral distance, that are not subsumed un-
der standard normative analyses (see Ainslie,
Imagine that you are about to purchase 2001 ; Frederick, Loewenstein, & Donoghue,
a jacket for $1 2 5 [$1 5 ] and a calculator 2002; Loewenstein & Thaler, 1 989, for re-
for $1 5 [$1 2 5 ]. The calculator salesman view). For example, although some people
informs you that the calculator you want prefer an apple today over two apples to-
to buy is on sale for $1 0 [$1 2 0] at the morrow, virtually nobody prefers one apple
other branch of the store, located 2 0 min- in 3 0 days over two apples in 3 1 days (Thaler,
utes drive away. Would you make the trip 1 981 ). Because discount functions are non-
to the other store? (Tversky & Kahneman, exponential (see also Loewenstein & Prelec,
1 981 , p. 45 7) 1 992), a 1 -day delay has greater impact when
that day is near than when it is far. Simi-
Faced with the opportunity to save $5 on larly, when asked what amount of money in
a $1 5 calculator, a majority of respondents the future would be comparable to receiv-
agreed to make the trip. However, when ing a specified amount today, people require
the calculator sold for $1 25 , only a minor- about $60 in 1 year to match $1 5 now, but
ity was willing to make the trip for the they are satisfied with $4000 in a year in-
same $5 savings. A global evaluation of ei- stead of $3 000 today. This implies discount
ther version yields a 20-minute voyage for $5 rates of 3 00% in the first case and of 3 3 % in
savings; people, however, seem to make de- the second. To the extent that one engages
cisions based on what has been referred to as in a variety of transactions throughout time,
“topical” accounting (Kahneman & Tversky, imposing wildly disparate discount rates
1 984), wherein the same $5 saving is coded on smaller versus larger amounts ignores
the fact that numerous small amounts will priming

eventually add up to be larger, yielding At the most basic level, transient mindsets
systematic inconsistency. arise when specific criteria are made mo-
Excessive discounting turns into myopia, mentarily salient. Grocery shopping while
which is often observed in people’s atti- very hungry, for example, is likely to lead to
tudes toward future outcomes (see, e.g., purchases that would not have been made
Elster, 1 984; Elster & Loewenstein, 1 992). under normal circumstances (cf. Loewen-
Loewenstein and Thaler (1 989) discussed a stein, 1 996). In a study of the susceptibil-
West Virginia experiment in which the high ity to temporary criterion salience, partici-
school dropout rate was reduced by one- pants first received a “word perception test”
third when dropouts were threatened with in which either creativity, reliability, or a
the loss of their driving privileges. This im- neutral topic was primed. Participants then
mediate consequence apparently had a sig- completed an ostensibly unrelated “prod-
nificantly greater impact than the far more uct impression task” that gauged their opin-
serious but more distant socioeconomic im- ions of various cameras. Cameras advertised
plications of failing to graduate from high for their creative potential were rated as
school. These authors also mention physi- more attractive by those primed for creativ-
cians’ typical lament that warning about the ity than by those exposed to words related
risk of skin cancer from excessive sun ex- to reliability or a neutral topic (Bettman &
posure has less effect than the warning that Sujan, 1 987). Momentary priming thus im-
such exposure can cause large pores and pacted ensuing preferences, rendering more
acne. In fact, “quit smoking” campaigns have salient criteria that had not previously been
begun to stress the immediate benefits of considered important, despite the fact that
quitting (quick reduction in the chance of a product consumption was likely to occur
heart attack, improved ability to taste foods long after such momentary criterion salience
within 2 days, and such) even more promi- dissipated (see Mandel & Johnson, 2002;
nently than the long-term benefits (Ameri- Verplanken & Holland, 2002; Wright &
can Lung Association, 2003 ). Similar reason- Heath, 2000).
ing applies in the context of promoting safe
sex practices and medical self-examinations,
where immediate gratification or discom- identities
fort often trumps much greater, but tempo- At a broader level, preferences fluctuate
rally distant, considerations. Schelling (1 980, along with momentarily salient identities. A
1 984) thought about similar issues of self- working woman, for example, might think
control in the face of immediate temptation of herself primarily as a mother when in the
as involving multiple “selves”; it is to related company of her children but may see her-
considerations of alternate frames of mind self primarily as a professional while at work.
that we turn next. The list of potential identities can be exten-
sive (Turner, 1 985 ) with some of a person’s
identities (e.g., “mother”) conjuring up strik-
ingly different values and ideals from oth-
Frames of Mind
ers (e.g., “CEO”). Although choices are typ-
Myopic decisions can occur when highly ically expected to reveal stable and coherent
transient frames of mind are momentar- preferences that correspond to the wishes
ily triggered, highlighting values and desires of the self as a whole, in fact, choice often
that may not reflect the decision maker’s fluctuates in accord with happenstance fluc-
more global preferences. Because choices of- tuations in identity salience. In one study,
ten involve delayed consumption, failure to college students whose “academic” identi-
anticipate the labile nature of preferences ties had been triggered were more likely to
may lead to the selection of later-disliked opt for more academic periodicals (e.g., The
alternatives. Economist) than were those whose “socialite”
identities had been made salient. Similarly, & Welch, 2001 ) just as drives and moti-
Chinese Americans whose American iden- vations can influence reasoning more gen-
tities were evoked adopted more stereotyp- erally (see Molden & Higgins, Chap. 1 3 ).
ically American preferences (e.g., for indi- Emotion and affect thus influence people’s
viduality and competition over collectivism preferences; however, because these senti-
and cooperation) compared with when ments are often transient, such influence
their Chinese identities had been triggered contributes to reversals of preference as mo-
(LeBoeuf, 2002; LeBoeuf & Shafir, 2004). mentary emotions and drives fluctuate.
Preference tends to align with currently Inconsistency thus often arises because
salient identities, yielding systematic tension people do not realize that their preferences
anytime there is a mismatch between the are being momentarily altered by situation-
identity that does the choosing and the one ally induced sentiments. Evidence suggests,
likely to do the consuming, as when a par- however, that even when people are aware of
ent commits to a late work meeting only to being in the grip of a transient drive or emo-
regret missing her child’s soccer game once tion, they may not be able to “correct” ade-
back at home. quately for that influence. For example, re-
spondents in one study were asked to predict
whether they would be more bothered by
emotions and drives thirst or by hunger if trapped in the wilder-
Emotions can have similar effects, influenc- ness without water or food. Some answered
ing the momentary evaluation of outcomes, right before exercising (when not especially
and thus choice. The anticipated pain of a thirsty), whereas others answered immedi-
loss is apparently greater for people in a pos- ately after exercising (thus, thirsty). Postex-
itive mood than for those in a negative mood; ercise, 92% indicated that they would be
this leads to greater risk aversion among more troubled by thirst than by hunger in
those in a good mood as they strive for the wilderness, compared with 61 % preexer-
“mood maintenance” (e.g., Isen, Nygren, & cise (Van Boven & Loewenstein, 2003 ). Post-
Ashby, 1 988). Furthermore, risk judgments exercise, people could easily attribute their
tend to be more pessimistic among people in thirst to the exercise. Nonetheless, when
a negative than a positive mood (e.g., John- imagining how they would feel in another,
son & Tversky, 1 983 ). However, valence is quite different and distant situation, peo-
not the sole determinant of an emotion’s ple projected their current thirst. More
influence: Anger, a negative emotion, seems generally, people tend to exhibit “empathy
to increase appraisals of individual control, gaps,” wherein they underestimate the de-
leading to optimistic risk assessment and to gree to which various contextual changes
risk seeking, whereas fear, also a negative will impact their drives, emotions, and
emotion, is not associated with appraisals of preferences (e.g., Van Boven, Dunning, &
control and promotes risk aversion (Lerner Loewenstein, 2000; see also Gilbert, Pinel,
& Keltner, 2001 ). Wilson, Blumberg, & Wheatley, 1 998). This
Emotions, or affect, also influence the can further contribute to myopic decision
associations or images that come to mind making, for people honor present feelings
in decision making. Because images can be and inclinations not fully appreciating the
consulted quickly and effortlessly, an “affect extent to which these may be attributable
heuristic” has been proposed with affective to fairly incidental factors that thus may
assessments sometimes guiding decisions soon dissipate.
(Slovic, Finucane, Peters, & MacGregor,
2002). Furthermore, “anticipatory emo-
tions” (e.g., emotional reactions to being in Conclusions and Future Directions
a risky situation) can influence the cogni-
tive appraisal of decision situations and can A review of the behavioral decision-making
affect choice (Loewenstein, Weber, Hsee, literature shows peoples’ preferences to be
highly malleable and systematically affected perspective more compelling, rather than
by a host of factors not subsumed under highlighting the need for debiasing (Arkes,
the compelling and popular normative the- 1 991 ; LeBoeuf & Shafir, 2003 ; Thaler, 1 991 ).
ory of choice. People’s preferences are heav- Research in decision making is active and
ily shaped, among other things, by par- growing. Among interesting current devel-
ticular perceptions of risk and value, by opments, several researchers have argued for
multiple influences on attribute weights, a greater focus on emotion as a force guid-
by the tendency to avoid decisional con- ing decisions (Hsee & Kunreuther, 2000;
flict and to rely on compelling reasons for Loewenstein et al., 2001 ; Rottenstreich &
choice, by salient identities and emotions, Hsee, 2001 ; Slovic et al., 2002). Others
and by a general tendency to accept deci- are investigating systematic dissociations be-
sion situations as they are described, rarely tween experienced utility, that is, the he-
reframing them in alternative, let alone donic experience an option actually brings,
canonical, ways. from decision utility, the utility implied by
It is tempting to attribute many of the the decision. Such investigations correctly
effects to shallow processing or to a fail- point out that, in addition to exhibiting
ure to consider the decision seriously (see, consistent preferences, one would also want
e.g., Grether & Plott, 1 979; Smith, 1 985 ; decision makers to choose those options
see also Shafir & LeBoeuf, 2002, for fur- that will maximize the quality of experi-
ther review of critiques of the findings). ence (Kahneman, 1 994). As it turns out,
After all, it seems plausible that partici- misprediction of experienced utility is com-
pants who consider a problem more care- mon, in part because people misremember
fully might notice that it can be framed in the hedonic qualities of past events (Kahne-
alternate ways. This would allow a consid- man, Fredrickson, Schreiber, & Redelmeier,
eration of the problem from multiple per- 1 993 ), and in part because they fail to antic-
spectives and perhaps lead to a response ipate how enjoyment may be impacted by
unbiased by problem frame or other “incon- factors such as mere exposure (Kahneman
sequential” factors (cf. Sieck & Yates, 1 997). & Snell, 1 992), the dissipation of satiation
Evidence suggests, however, that the pat- (Simonson, 1 990), and the power of adapta-
terns documented previously cannot be at- tion, even to dramatic life changes (Gilbert
tributed to laziness, inexperience, or lack et al., 1 998; Schkade & Kahneman, 1 998).
of motivation. The same general effects are An accurate description of human de-
observed when participants are provided cision making needs to incorporate those
greater incentives (Grether & Plott, 1 979; and other tendencies not reviewed in this
see Camerer & Hogarth, 1 999, for a review), chapter, including a variety of other judg-
when they are asked to justify their choices mental biases (see Kahneman & Frederick,
(Fagley & Miller, 1 987; LeBoeuf & Shafir, Chap. 1 2), as well as people’s sensitivity to
2003 ; Levin & Chapman, 1 990), when they considerations such as fairness (Kahneman,
are experienced or expert decision makers Knetsch, & Thaler, 1 986a, 1 986b; Rabin,
(Camerer, Babcock, Loewenstein, & Thaler, 1 993 ) and sunk costs (Arkes & Blumer,
1 997; McNeil, Pauker, Sox, & Tversky, 1 982; 1 985 ; Gourville & Soman, 1 998). A suc-
Redelmeier & Shafir, 1 995 ; Redelmeier, cessful descriptive model must allow for
Shafir, & Aujla, 2001 ), or when they are the violations of normative criteria, such as
types (e.g., “high need for cognition”) who procedure and description invariance, dom-
naturally think more deeply about prob- inance, regularity, and, occasionally, transi-
lems (LeBoeuf & Shafir, 2003 ; Levin, Gaeth, tivity. It must also allow for the eventual
Schreiber, & Lauriola, 2002). These findings incorporation of other psychological pro-
suggest that many of the attitudes triggered cesses that might impact choice. For ex-
by specific choice problem frames are at least ample, it has been suggested that taking
somewhat entrenched, with extra thought aspiration levels into account may some-
or effort only serving to render the dominant times predict risky decision making better
than does prospect theory’s reliance only Arkes, H. R., & Blumer, C. (1 985 ). The psychol-
on reference points (Lopes & Oden, 1 999). ogy of sunk cost. Organizational Behavior and
The refinement of descriptive theories is Human Decision Processes, 3 5 , 1 24–1 40.
an evolving process; however, the product Arrow, K. J. (1 95 1 ). Alternative approaches to
that emerges continuously seems quite dis- the theory of choice in risk-taking situations.
tant from the elegant and optimal normative Econometrica, 1 9, 404–43 7.
treatment. At the same time, acknowledged Arrow, K. J. (1 988). Behavior under uncertainty
departures from the normative theory need and its implications for policy. In D. E. Bell, H.
not weaken that theory’s normative force. Raiffa, & A. Tversky (Eds.), Decision making:
Descriptive, normative, and prescriptive interac-
After all, normative theories are themselves
tions (pp. 497–5 07). Cambridge, UK: Cam-
empirical projects, capturing what people bridge University Press.
consider ideal: As we improve our under-
Bastardi, A., & Shafir, E. (1 998). On the pur-
standing of how decisions are made, we
suit and misuse of useless information. Jour-
may be able to formulate prescriptive pro- nal of Personality and Social Psychology, 75 ,
cedures to guide decision makers, in light of 1 9–3 2.
their limitations, to better capture their nor- Bazerman, M. H., Schroth, H. A., Shah, P. P.,
mative wishes. Diekmann, K. A., & Tenbrunsel, A. E. (1 994).
Of course, there are instances in which The inconsistent role of comparison with oth-
people have very clear preferences that no ers and procedural justice in reactions to hypo-
amount of subtle manipulation will alter (cf. thetical job descriptions: Implications for job
Payne, Bettman, & Johnson, 1 992). At other acceptance decisions. Organizational Behavior
times, we appear to be at the mercy of fac- and Human Decision Processes, 60, 3 26–3 5 2.
tors that we would often like to consider Bernoulli, D. (1 73 8/1 95 4). Exposition of a new
inconsequential. This conclusion, well ac- theory on the measurement of risk. Economet-
cepted within psychology, is becoming in- rica, 2 2 , 23 –3 6.
creasingly influential not only in decision re- Bettman, J. R., & Sujan, M. (1 987). Effects of
search, but also in the social sciences more framing on evaluation of comparable and non-
generally, with prominent researchers in law, comparable alternatives by expert and novice
medicine, sociology, and economics exhort- consumers. Journal of Consumer Research, 1 4,
ing their fields to pay attention to findings of 1 41 –1 5 4.
the sort reviewed here in formulating new Buehler, R., Griffin, D., & Ross, M. (1 994).
ways of thinking about and predicting be- Exploring the “planning fallacy:” Why peo-
ple underestimate their task completion times.
havior. Given the academic, personal, and
Journal of Personality and Social Psychology, 67,
practical import of decision making, such de-
3 66–3 81 .
velopments may prove vital to our under-
Camerer, C. (1 992). Recent tests of general-
standing of why people think, act, and de-
izations of expected utility theories. In W.
cide as they do. Edwards (Ed.), Utility theories: Measurement
and applications (pp. 207–25 1 ). Dordrecht:
Kluwer.
Camerer, C., Babcock, L., Loewenstein, G., &
References Thaler, R. (1 997). Labor supply of New York
City cabdrivers: One day at a time. The Quar-
terly Journal of Economics, 1 1 2 , 407–441 .
Ainslie, G. (2001 ). Breakdown of will. New York:
Cambridge University Press. Camerer, C. F., & Hogarth, R. M. (1 999). The
effects of financial incentives in experiments:
American Lung Association. (2003 ). What are
A review and capital-labor-production frame-
the benefits of quitting smoking? Available:
work. Journal of Risk and Uncertainty, 1 9, 7–42.
http://www.lungusa.org/tobacco/quit ben.html.
Arkes, H. R. (1 991 ). Costs and benefits of judg- Camerer, C. F., Loewenstein, G., & Rabin,
ment errors: Implications for debiasing. Psycho- M. (2004). Advances in behavioral economics.
logical Bulletin, 1 1 0, 486–498. Princeton, NJ: Princeton University Press.
Collins, A. M., & Loftus, E. F. (1 975 ). A judgement and decision making. Thousand
spreading-activation theory of semantic pro- Oaks: Sage.
cessing. Psychological Review, 82 , 407–428. Heath, C., & Soll, J. B. (1 996). Mental budgeting
Dawes, R. M. (1 979). The robust beauty of im- and consumer decisions. Journal of Consumer
proper linear models in decision making. Amer- Research, 2 3 , 40–5 2.
ican Psychologist, 3 4, 5 71 –5 82. Heath, C., & Tversky, A. (1 991 ). Preference and
Dawes, R. M., Faust, D., & Meehl, P. E. (1 989). belief: Ambiguity and competence in choice
Clinical verus actuarial judgment. Science, 2 43 , under uncertainty. Journal of Risk and Uncer-
1 668–1 674. tainty, 4, 5 –28.
Downs, J. S., & Shafir, E. (1 999). Why some are Hershey, J. C., & Schoemaker, P. J. H. (1 980). Risk
perceived as more confident and more inse- taking and problem context in the domain of
cure, more reckless and more cautious, more losses: An expected utility analysis. The Journal
trusting and more suspicious, than others: En- of Risk and Insurance, 47, 1 1 1 –1 3 2.
riched and impoverished options in social judg- Hsee, C. K. (1 996). The evaluability hypothesis:
ment. Psychonomic Bulletin and Review, 6, 5 98– An explanation of preference reversals be-
61 0. tween joint and separate evaluations of alterna-
Elster, J. (1 984). Studies in rationality and ir- tives. Organizational Behavior and Human De-
rationality. Revised edition. Cambridge, UK: cision Processes, 67, 247–25 7.
Cambridge University Press. Hsee, C. K., & Kunreuther, H. C. (2000). The
Elster, J., & Loewenstein, G. (Eds.). (1 992). affection effect in insurance decisions. Journal
Choice over time. New York: Russell Sage Foun- of Risk and Uncertainty, 2 0, 1 41 –1 5 9.
dation. Hsee, C. K., Loewenstein, G. F., Blount, S., & Baz-
Fagley, N. S., & Miller, P. M. (1 987). The effects of erman, M. H. (1 999). Preference reversals be-
decision framing on choice of risky vs. certain tween joint and separate evaluations of options:
options. Organizational Behavior and Human A review and theoretical analysis. Psychological
Decision Processes, 3 9, 264–277. Bulletin, 5 , 5 76–5 90.
Frederick, S., Loewenstein, G., & Donoghue, T. Huber, J., Payne, J. W., & Puto, C. (1 982). Adding
(2002). Time discounting and time preference: asymmetrically dominated alternatives: Viola-
A critical review. Journal of Economic Literature, tions of regularity and the similarity hypothesis.
40, 3 5 1 –401 . Journal of Consumer Research, 9, 90–98.
Gilbert, D. T., Pinel, E. C., Wilson, T. D., Blum- Isen, A. M., Nygren, T. E., & Ashby, F. G. (1 988).
berg, S. J., & Wheatley, T. P. (1 998). Immune Influence of positive affect on the subjective
neglect: A source of durability bias in affec- utility of gains and losses: It is just not worth the
tive forecasting. Journal of Personality and Social risk. Journal of Personality and Social Psychology,
Psychology, 75 , 61 7–63 8. 5 5 , 71 0–71 7.
Goldstein, W. M., & Hogarth, R. M. (Eds.). Iyengar, S. S., & Lepper, M. R. (2000). When
(1 997). Research on judgment and decision choice is demotivating: Can one desire too
making: Currents, connections, and controversies. much of a good thing? Journal of Personality
New York: Cambridge University Press. and Social Psychology, 79, 995 –1 006.
Gonzalez, R., & Wu, G. (1 999). On the shape of Johnson, E. J., & Goldstein, D. (2003 ). Do de-
the probability weighting function. Cognitive faults save lives? Science, 3 02 , 1 3 3 8–1 3 3 9.
Psychology, 3 8, 1 29–1 66. Johnson, E. J., Hershey, J., Meszaros, J., & Kun-
Gourville, J. T., & Soman, D. (1 998). Payment reuther, H. (1 993 ). Framing, probability distor-
depreciation: The behavioral effects of tempo- tions, and insurance decisions. Journal of Risk
rally separating payments from consumption. and Uncertainty, 7, 3 5 –5 1 .
Journal of Consumer Research, 2 5 , 1 60–1 74. Johnson, E. J., & Tversky, A. (1 983 ). Affect, gen-
Grether, D., & Plott, C. (1 979). Economic theory eralization, and the perception of risk. Jour-
of choice and the preference reversal phe- nal of Personality and Social Psychology, 45 ,
nomenon. American Economic Review, 69, 623 – 20–3 1 .
63 8. Kahneman, D. (1 992). Reference points, anchors,
Hastie, R., & Dawes, R. M. (2001 ). Rational and mixed feelings. Organizational Behavior
choice in an uncertain world: The psychology of and Human Decision Processes, 5 1 , 296–3 1 2.
Kahneman, D. (1 994). New challenges to the Knetsch, J. L. (1 989). The endowment effect
rationality assumption. Journal of Institutional and evidence of nonreversible indifference
and Theoretical Economics, 1 5 0, 1 8–3 6. curves. American Economic Review, 79, 1 277–
Kahneman, D. (2003 ). A perspective on judg- 1 284.
ment and choice: Mapping bounded rational- Kühberger, A. (1 995 ). The framing of decisions:
ity. American Psychologist, 5 8, 697–720. A new look at old problems. Organizational
Kahneman, D., Fredrickson, B. L., Schreiber, Behavior and Human Decision Processes, 62 ,
C. A., & Redelmeier, D. A. (1 993 ). When more 23 0–240.
pain is preferred to less: Adding a better end. LeBoeuf, R. A. (2002). Alternating selves and con-
Psychological Science, 4, 401 –405 . flicting choices: Identity salience and preference in-
Kahneman, D., Knetsch, J. L., & Thaler, R. H. consistency. Unpublished doctoral dissertation,
(1 986a). Fairness and the assumptions of eco- Princeton University, Princeton, NJ.
nomics. Journal of Business, 5 9, s285 –s3 00. LeBoeuf, R. A., & Shafir, E. (2003 ). Deep
Kahneman, D., Knetsch, J. L., & Thaler, R. H. thoughts and shallow frames: On the suscep-
(1 986b). Fairness as a constraint on profit tibility to framing effects. Journal of Behavioral
seeking: Entitlements in the market. American Decision Making, 1 6, 77–92.
Economic Review, 76, 728–741 . LeBoeuf, R. A., & Shafir, E. (2004). Alternating
Kahneman, D., Knetsch, J. L., & Thaler, R. selves and conflicting choices: Identity salience
(1 990). Experimental tests of the endowment and preference inconsistency. Manuscript under
effect and the Coase theorem. Journal of Politi- review.
cal Economics, 98, 1 3 25 –1 3 48. Lerner, J. S., & Keltner, D. (2001 ). Fear, anger, and
Kahneman, D., & Lovallo, D. (1 993 ). Timid risk. Journal of Personality and Social Psychology,
choices and bold forecasts: A cognitive per- 81 , 1 46–1 5 9.
spective on risk taking. Management Science, 3 9, Levin, I. P. (1 987). Associative effects of informa-
1 7–3 1 . tion framing. Bulletin of the Psychonomic Society,
Kahneman, D., & Ritov, I. (1 994). Determinants 2 5 , 85 –86.
of stated willingness to pay for public goods: A Levin, I. P., & Chapman, D. P. (1 990). Risk taking,
study in the headline method. Journal of Risk frame of reference, and characterization of vic-
and Uncertainty, 9, 5 –3 8. tim groups in AIDS treatment decisions. Jour-
Kahneman, D., & Snell, J. (1 992). Predicting a nal of Experimental Social Psychology, 2 6, 421 –
changing taste: Do people know what they will 43 4.
like? Journal of Behavioral Decision Making, 5 , Levin, I. P., Gaeth, G. J., Schreiber, J., & Lauriola,
1 87–200. M. (2002). A new look at framing effects: Dis-
Kahneman, D., & Tversky, A. (1 979). Prospect tribution of effect sizes, individual differences,
theory: An analysis of decision under risk. and independence of types of effects. Organiza-
Econometrica, 47, 263 –291 . tional Behavior and Human Decision Processes,
Kahneman, D., & Tversky, A. (1 984). Choices, 88, 41 1 –429.
values, and frames. American Psychologist, 3 9, Levin, I. P., Schneider, S. L., & Gaeth, G. J. (1 998).
3 41 –3 5 0. All frames are not created equal: A typology
Kahneman, D., & Tversky, A. (1 995 ). Con- and critical analysis of framing effects. Orga-
flict resolution: A cognitive perspective. In nizational Behavior and Human Decision Pro-
K. J. Arrow, R. H. Mnookin, L. Ross, A. cesses, 76, 1 49–1 88.
Tversky, & R. B. Wilson (Eds.), Barriers to con- Levin, I. P., Schnittjer, S. K., & Thee, S. L. (1 988).
flict resolution (pp. 45 –60). New York: W. W. Information framing effects in social and per-
Norton. sonal decisions. Journal of Experimental Social
Kahneman, D., & Tversky, A. (Eds.). (2000). Psychology, 2 4, 5 20–5 29.
Choices, values, and frames. Cambridge, UK: Lichtenstein, S., & Slovic, P. (1 971 ). Reversals of
Cambridge University Press. preference between bids and choices in gam-
Keeney, R. L., & Raiffa, H. (1 976). Decisions with bling decisions. Journal of Experimental Psychol-
multiple objectives: Preferences and value trade- ogy, 89, 46–5 5 .
offs. Cambridge, UK: Cambridge University Lichtenstein, S., & Slovic, P. (1 973 ). Response-
Press. induced reversals of preferences in gambling:
An extended replication in Las Vegas. Journal Redelmeier, D. A., & Shafir, E. (1 995 ). Medical
of Experimental Psychology, 1 01 , 1 6–20. decision making in situations that offer multi-
Loewenstein, G. (1 996). Out of control: Visceral ple alternatives. Journal of the American Medical
influences on behavior. Organizational Behav- Association, 2 73 , 3 02–3 05 .
ior and Human Decision Processes, 65 , 272– Redelmeier, D. A., & Tversky, A. (1 990).
292. Discrepancy between medical decisions for
Loewenstein, G., & Prelec, D. (1 992). Anoma- individual patients and for groups. New
lies in intertemporal choice: Evidence and an England Journal of Medicine, 3 2 2 , 1 1 62–
interpretation. The Quarterly Journal of Eco- 1 1 64.
nomics, 1 07, 5 73 –5 97. Redelmeier, D. A., & Tversky, A. (1 992). On the
Loewenstein, G., & Thaler, R. H. (1 989). In- framing of multiple prospects. Psychological Sci-
tertemporal choice. Journal of Economic Perspec- ence, 3 , 1 91 –1 93 .
tives, 3 , 1 81 –1 93 . Rottenstreich, Y., & Hsee, C. K. (2001 ). Money,
Loewenstein, G. F., Weber, E. U., Hsee, C. K., & kisses, and electric shocks: On the affective psy-
Welch, N. (2001 ). Risk as feelings. Psychological chology of risk. Psychological Science, 1 2 , 1 85 –
Bulletin, 1 2 7, 267–286. 1 90.
Lopes, L. L., & Oden, G. C. (1 999). The role Samuelson, W., & Zeckhauser, R. (1 988). Status
of aspiration level in risky choice: A compar- quo bias in decision making. Journal of Risk and
ison of cumulative prospect theory and SP/A Uncertainty, 1 , 7–5 9.
theory. Journal of Mathematical Psychology, 43 , Savage, L. J. (1 95 4). The foundations of statistics.
286–3 1 3 . New York: Wiley.
Mandel, N., & Johnson, E. J. (2002). When Web Schelling, T. (1 980). The intimate contest for self-
pages influence choice: Effects of visual primes command. Public Interest, 60, 94–1 1 8.
on experts and novices. Journal of Consumer Schelling, T. (1 984). Self-command in practice, in
Research, 2 9, 23 5 –245 . policy, and in theory of rational choice. Ameri-
McNeil, B. J., Pauker, S. G., Sox, H. C., & Tversky, can Economic Review, 74, 1 –1 1 .
A. (1 982). On the elicitation of preferences for Schkade, D. A., & Johnson, E. J. (1 989). Cog-
alternative therapies. New England Journal of nitive processes in preference reversals. Orga-
Medicine, 3 06, 1 25 9–1 262. nizational Behavior and Human Decision Pro-
Nisbett, R. E., & Wilson, T. D. (1 977). Telling cesses, 44, 203 –23 1 .
more than we can know: Verbal reports on Schkade, D. A., & Kahneman, D. (1 998). Does
mental processes. Psychological Review, 84, living in California make people happy? A fo-
23 1 –25 9. cusing illusion in judgments of life satisfaction.
Payne, J. W., Bettman, J. R., & Johnson, E. J. Psychological Science, 9, 3 40–3 46.
(1 992). Behavioral decision research: A con- Shafir, E. (1 993 ). Choosing versus rejecting: Why
structive processing perspective. Annual Re- some options are both better and worse than
view of Psychology, 43 , 87–1 3 1 . others. Memory and Cognition, 2 1 , 5 46–5 5 6.
Prelec, D. (2000). Compound invariant weighting Shafir, E. (1 994). Uncertainty and the difficulty
functions in prospect theory. In D. Kahneman of thinking through disjunctions. Cognition, 5 0,
& A. Tversky (Eds.), Choices, values, and frames 403 –43 0.
(pp. 67–92). New York: Cambridge University Shafir, E. (2002). Cognition, intuition, and pol-
Press. icy guidelines. In R. Gowda & J. C. Fox (Eds.),
Quattrone, G. A., & Tversky, A. (1 988). Con- Judgments, decisions, and public policy (pp. 71 –
trasting rational and psychological analyses of 88). New York: Cambridge University Press.
political choice. American Political Science Re- Shafir, E., Diamond, P., & Tversky, A. (1 997).
view, 82 , 71 9–73 6. Money illusion. Quarterly Journal of Economics,
Rabin, M. (1 993 ). Incorporating fairness into 1 1 2 , 3 41 –3 74.
game theory and economics. American Eco- Shafir, E., & LeBoeuf, R. A. (2002). Rationality.
nomic Review, 83 , 1 281 –1 3 02. Annual Review of Psychology, 5 3 , 491 –5 1 7.
Redelmeier, D., Shafir, E., & Aujla, P. (2001 ). The Shafir, E., & LeBoeuf, R. A. (2004). Context and
beguiling pursuit of more information. Medical conflict in multiattribute choice. In D. Koehler
Decision Making, 2 1 , 3 76–3 81 . & N. Harvey (Eds.), Blackwell handbook of
judgment and decision making (pp. 3 41 –3 5 9). judgments. Unpublished working paper, The
Oxford, UK: Blackwell. University of Chicago Law School, Chicago,
Shafir, E., Simonson, I., & Tversky, A. (1 993 ). IL.
Reason-based choice. Cognition, 49, 1 1 –3 6. Thaler, R. H. (1 981 ). Some empirical evidence
Shafir, E., & Tversky, A. (1 992). Thinking through on dynamic inconsistency. Economic Letters, 8,
uncertainty: Nonconsequential reasoning 201 –207.
and choice. Cognitive Psychology, 2 4, 449– Thaler, R. H. (1 991 ). The psychology of
474. choice and the assumptions of economics. In
Sieck, W., & Yates, J. F. (1 997). Exposition effects R. H. Thaler (Ed.), Quasi-rational economics
on decision making: Choice and confidence in (pp. 1 3 7–1 66). New York: Russell Sage Foun-
choice. Organizational Behavior and Human dation.
Decision Processes, 70, 207–21 9. Thaler, R. H. (1 993 ). Advances in behavioral fi-
Simon, H. A. (1 95 5 ). A behavioral model of ra- nance. New York: Russell Sage Foundation.
tional choice. Quarterly Journal of Economics, Thaler, R. H. (1 999). Mental accounting matters.
69, 99–1 1 8. Journal of Behavioral Decision Making, 1 2 , 1 83 –
Simonson, I. (1 989). Choice based on reasons: 206.
The case of attraction and compromise effects. Turner, J. C. (1 985 ). Social categorization and
Journal of Consumer Research, 1 6, 1 5 8–1 74. the self-concept: A social cognitive theory of
Simonson, I. (1 990). The effect of purchase group behavior. In E. J. Lawler (Ed.), Advances
quantity and timing on variety seeking behav- in group processes (Vol. 2, pp. 77–1 21 ). Green-
ior. Journal of Marketing Research, 2 7, 1 5 0– wich, CT: JAI Press.
1 62. Tversky, A., & Kahneman, D. (1 981 ). The fram-
Simonson, I., & Tversky, A. (1 992). Choice in ing of decisions and psychology of choice. Sci-
context: Tradeoff contrast and extremeness ence, 2 1 1 , 45 3 –45 8.
aversion. Journal of Marketing Research, 2 9, Tversky, A., & Kahneman, D. (1 986). Rational
289–295 . choice and the framing of decisions. Journal of
Slovic, P. (1 995 ). The construction of preference. Business, 5 9, s25 1 –s278.
American Psychologist, 5 0, 3 64–3 71 . Tversky, A., & Kahneman, D. (1 991 ). Loss aver-
Slovic, P., Finucane, M., Peters, E., & MacGregor, sion in riskless choice: A reference dependent
D. G. (2002). The affect heuristic. In T. model. Quarterly Journal of Economics, 1 06,
Gilovich, D. Griffin, & D. Kahneman (Eds.), 1 03 9–1 061 .
Heuristics and biases: The psychology of intu- Tversky, A., & Kahneman, D. (1 992). Advances
itive judgment (pp. 3 97–420). New York: Cam- in prospect theory: Cumulative representation
bridge University Press. of uncertainty. Journal of Risk and Uncertainty,
Slovic, P., Fischhoff, B., & Lichtenstein, S. (1 988). 5 , 297–3 23 .
Response mode, framing, and information- Tversky, A., Sattath, S., & Slovic, P. (1 988). Con-
processing effects in risk assessment. In D. E. tingent weighting in judgment and choice. Psy-
Bell, H. Raiffa, & A. Tversky (Eds.), Deci- chological Review, 95 , 3 71 –3 84.
sion making: Descriptive, normative, and pre- Tversky, A., & Shafir, E. (1 992a). Choice under
scriptive interactions (pp. 1 5 2–1 66). Cambridge, conflict: The dynamics of deferred decision.
UK: Cambridge University Press. Psychological Science, 3 , 3 5 8–3 61 .
Slovic, P., Griffin, D., & Tversky, A. (1 990). Tversky, A., & Shafir, E. (1 992b). The disjunction
Compatibility effects in judgment and choice. effect in choice under uncertainty. Psychologi-
In R. M. Hogarth (Ed.), Insights in decision cal Science, 3 , 3 05 –3 09.
making: A tribute to Hillel J. Einhorn (pp. 5 –27). Tversky, A., & Simonson, I. (1 993 ). Context-
Chicago: University of Chicago Press. dependent preferences. Management Science,
Smith, V. L. (1 985 ). Experimental economics: 3 9, 1 1 78–1 1 89.
Reply. American Economic Review, 75 , 265 –272. Tversky, A., Slovic, P., & Kahneman, D. (1 990).
Stanovich, K. E. (1 999). Who is rational? Studies The causes of preference reversal. American
of individual differences in reasoning. Mahwah, Economic Review, 80, 204–21 7.
NJ: Erlbaum. Van Boven, L., Dunning, D., & Loewenstein,
Sunstein, C. R., Kahneman, D., Schkade, D., G. (2000). Egocentric empathy gaps between
& Ritov, I. (2001 ). Predictably incoherent owners and buyers: Misperceptions of the
endowment effect. Journal of Personality and Wilson, T. D., Dunn, D. S., Kraft, D., & Lisle,
Social Psychology, 79, 66–76. D. J. (1 989). Introspection, attitude change,
Van Boven, L., & Loewenstein, G. (2003 ). Social and attitude consistency: The disruptive effects
projection of transient drive states. Personality of explaining why we feel the way we do. In
and Social Psychology Bulletin, 2 9, 1 1 5 9–1 1 68. L. Berkowitz (Ed.), Advances in experimental
Verplanken, B., & Holland, R. W. (2002). Mo- and social psychology (pp. 1 23 –205 ). San Diego:
tivated decision making: Effects of activation Academic Press.
and self-centrality of values on choices and Wilson, T. D., & Schooler, J. W. (1 991 ). Thinking
behavior. Journal of Personality and Social Psy- too much: Introspection can reduce the quality
chology, 82 , 43 4–447. of preferences and decisions. Journal of Person-
von Neumann, J., & Morgenstern, O. (1 944). The- ality and Social Psychology, 60, 1 81 –1 92.
ory of games and economic behavior. Princeton, Wright, J., & Heath, C. (2000, November).
NJ: Princeton University Press. Identity-based choice: Who I am determines what
Wedell, D. H. (1 997). Another look at reasons for I choose. Paper presented at the annual meet-
choosing and rejecting. Memory and Cognition, ing of the Society for Judgment and Decision
2 5 , 873 –887. Making, New Orleans, LA.
CHAPTER 1 2
A Model of Heuristic Judgment
Daniel Kahneman
Shane Frederick
The program of research now known as the of a stranger as menacing entails a prediction
heuristics and biases approach began with a of future harm.
study of the statistical intuitions of experts, The ancient idea that cognitive processes
who were found to be excessively confi- can be partitioned into two main families –
dent in the replicability of results from small traditionally called intuition and reason –
samples (Tversky & Kahneman, 1 971 ). The is now widely embraced under the general
persistence of such systematic errors in the label of dual-process theories (Chaiken &
intuitions of experts implied that their intu- Trope, 1 999; Evans and Over, 1 996; Ham-
itive judgments may be governed by funda- mond, 1 996; Sloman, 1 996, 2002; see Evans,
mentally different processes than the slower, Chap. 8). Dual-process models come in
more deliberate computations they had been many flavors, but all distinguish cognitive
trained to execute. operations that are quick and associative
From its earliest days, the heuristics and from others that are slow and governed by
biases program was guided by the idea that rules (Gilbert, 1 999).
intuitive judgments occupy a position – per- To represent intuitive and deliberate rea-
haps corresponding to evolutionary history – soning, we borrow the terms “system 1 ” and
between the automatic parallel operations “system 2” from Stanovich and West (2002).
of perception and the controlled serial op- Although suggesting two autonomous ho-
erations of reasoning. Intuitive judgments munculi, such a meaning is not intended.
were viewed as an extension of percep- We use the term “system” only as a label for
tion to judgment objects that are not cur- collections of cognitive processes that can
rently present, including mental represen- be distinguished by their speed, their con-
tations that are evoked by language. The trollability, and the contents on which they
mental representations on which intuitive operate. In the particular dual-process model
judgments operate are similar to percepts. we assume, system 1 quickly proposes intu-
Indeed, the distinction between perception itive answers to judgment problems as they
and judgment is often blurry: The perception arise, and system 2 monitors the quality of
2 67
these proposals, which it may endorse, cor- In the context of a dual-system view,
rect, or override. The judgments that are errors of intuitive judgment raise two
eventually expressed are called intuitive if questions: “What features of system 1 cre-
they retain the hypothesized initial proposal ated the error?” and “Why was the error not
with little modification. detected and corrected by system 2?” (cf.
The effect of concurrent cognitive tasks Kahneman & Tversky, 1 982). The first ques-
provides the most useful indication of tion is more basic, of course, but the second
whether a given mental process belongs to is also relevant and ought not be overlooked.
system 1 or system 2. Because the over- Consider, for example, the paragraph that
all capacity for mental effort is limited, ef- Tversky and Kahneman (1 974; p. 3 in
fortful processes tend to disrupt each other, Kahneman, Slovic, & Tversky, 1 982) used to
whereas effortless processes neither cause introduced the notions of heuristic and bias:
nor suffer much interference when com-
bined with other tasks (Kahneman, 1 973 ; The subjective assessment of probability re-
Pashler, 1 998). It is by this criterion that we sembles the subjective assessment of physi-
assign the monitoring function to system 2: cal quantities such as distance or size. These
People who are occupied by a demanding judgments are all based on data of lim-
ited validity, which are processed accord-
mental activity (e.g., attempting to hold in
ing to heuristic rules. For example, the ap-
mind several digits) are much more likely
parent distance of an object is determined
to respond to another task by blurting out in part by its clarity. The more sharply
whatever comes to mind (Gilbert, 1 989). By the object is seen, the closer it appears to
the same criterion, the acquisition of highly be. This rule has some validity, because in
skilled performances – whether perceptual any given scene the more distant objects
or motor – involves the transformation of an are seen less sharply than nearer objects.
activity from effortful (system 2) to effort- However, the reliance on this rule leads to
less (system 1 ). The proverbial chess master systematic errors in the estimation of dis-
who strolls past a game and quips, “White tance. Specifically, distances are often over-
mates in three” is performing intuitively estimated when visibility is poor because
the contours of objects are blurred. On the
(Simon & Chase, 1 973 ).
other hand, distances are often underesti-
Our views about the two systems are
mated when visibility is good because the
similar to the “correction model” proposed objects are seen sharply. Thus the reliance
by Gilbert (1 989, 1 991 ) and to other dual- on clarity as an indication leads to com-
process models (Epstein, 1 994; Hammond, mon biases. Such biases are also found in
1 996; Sloman, 1 996; see also Shweder, intuitive judgments of probability.
1 977). We assume system 1 and system 2
can be active concurrently, that automatic This statement was intended to extend
and controlled cognitive operations compete Brunswik’s (1 943 ) analysis of the percep-
for the control of overt responses, and that tion of distance to the domain of intuitive
deliberate judgments are likely to remain thinking and to provide a rationale for us-
anchored on initial impressions. We also ing biases to diagnose heuristics. However,
assume that the contribution of the two the analysis of the effect of haze is flawed:
systems in determining stated judgments It neglects the fact that an observer looking
depends on both task features and individ- at a distant mountain possesses two relevant
ual characteristics, including the time avail- cues, not one. The first cue is the blur of the
able for deliberation (Finucane et al., 2000), contours of the target mountain, which is
mood (Bless et al., 1 996; Isen, Nygren, & positively correlated with its distance, when
Ashby, 1 988), intelligence (Stanovich & all else is equal. This cue should be given
West, 2002), cognitive impulsiveness (Fred- positive weight in a judgment of distance,
erick, 2004), and exposure to statistical and it is. The second relevant cue, which
thinking (Agnoli, 1 991 ; Agnoli & Krantz, the observer can readily assess by looking
1 989; Nisbett et al., 1 983 ). around, is the ambient or general haziness.
a model of heuristic judgment 2 69
In an optimal regression model for estimat- ships break up within a year?” may answer
ing distance, general haziness is a suppressor as if she had been asked “Do instances of
variable, which must be weighted negatively failed long-distance relationships come read-
because it contributes to blur but is uncor- ily to mind?” This would be an applica-
related with distance. Contrary to the argu- tion of the availability heuristic. A profes-
ment made in 1 974, using blur as a cue does sor who has heard a candidate’s job talk and
not inevitably lead to bias in the judgment now considers the question “How likely is it
of distance – the illusion could just as well that this candidate could be tenured in our
be described as a failure to assign adequate department?” may answer the much easier
negative weight to ambient haze. The effect question: “How impressive was the talk?”.
of haziness on impressions of distance is a This would be an example of one form of
failing of system 1 : The perceptual system is the representativeness heuristic.
not designed to correct for this variable. The The heuristics and biases research pro-
effect of haziness on judgments of distance gram has focused primarily on representa-
is a separate failure of system 2. Although tiveness and availability – two versatile at-
people are capable of consciously correcting tributes that are automatically computed
their impressions of distance for the effects and can serve as candidate answers to many
of ambient haze, they commonly fail to do different questions. It has also focused prin-
so. A similar analysis applies to some of the cipally on thinking under uncertainty. How-
judgmental biases we discuss later, in which ever, the restriction to particular heuristics
errors and biases only occur when both sys- and to a specific context is largely arbitrary.
tems fail. Kahneman and Frederick (2002) argued that
In the following section, we present this process of attribute substitution is a
an attribute-substitution model of heuris- general feature of heuristic judgment; that
tic judgment, which assumes that difficult whenever the aspect of the judgmental ob-
questions are often answered by substi- ject that one intends to judge (the target at-
tuting an answer to an easier one. This tribute) is less readily assessed than a related
elaborates and extends earlier treatments property that yields a plausible answer (the
of the topic (Kahneman & Tversky, 1 982; heuristic attribute), individuals may unwit-
Tversky & Kahneman, 1 974, 1 983 ). Fol- tingly substitute the simpler assessment. For
lowing sections introduce a research design an example, consider the well-known study
for studying attribute substitution, as well by Strack, Martin, and Schwarz (1 988) in
as discuss the controversy over the repre- which college students answered a survey
sentativeness heuristic in the context of a that included these two questions: “How
dual-system view that we endorse. The final happy are you with your life in general?” and
section situates representativeness within “How many dates did you have last month?”
a broad family of prototype heuristics, in The correlation between the two questions
which properties of a prototypical exemplar was negligible when they occurred in the
dominate global judgments concerning an order shown, but rose to .66 if the dating
entire set. question was asked first. We suggest that the
question about dating frequency automati-
cally evokes an evaluation of one’s romantic
Attribute Substitution satisfaction and that this evaluation lingers
to become the heuristic attribute when the
The early research on judgment heuris- global happiness question is subsequently
tics was guided by a simple and general encountered.
hypothesis: When confronted with a diffi- To further illustrate the process of at-
cult question, people may answer an eas- tribute substitution, consider a question in
ier one instead and are often unaware of a study by Frederick and Nelson (2004):
the substitution. A person who is asked “If a sphere were dropped into a open
“What proportion of long-distance relation- cube, such that it just fit (the diameter
of the sphere is the same as the interior these attributes must be drawn from the con-
width of the cube), what proportion of straints of ordinary language and are often
the volume of the cube would the sphere imprecise. For example, the conventional in-
occupy?” The target attribute in this judg- terpretation of overall happiness does not
ment (the volumetric relation between a specify how much weight ought to be given
cube and sphere) is simple enough to be un- to various life domains. However, it certainly
derstood but complicated enough to accom- does require that substantial weight be given
modate a wide range of estimates as plau- to every important domain of life and that
sible answers. Thus, if a relevant simpler no weight at all be given to the current
computation or perceptual impression ex- weather or to the recent consumption of a
ists, respondents will have no strong basis for cookie. Similar rules of common sense ap-
rejecting it as their “final answer.” Frederick ply to judgments of probability. For example,
and Nelson (2004) proposed that the areal the statement “John is more likely to survive
ratio of the respective cross-sections serves a week than a month” is clearly true, and,
that function; that is, that respondents an- thus, implies a rule that people would want
swer as if they were asked the simpler two- their probability judgments to follow. Ac-
dimensional analog of this problem (“If a cordingly, neglect of duration in assessments
circle were drawn inside a square, what proof survival probabilities would be properly
portion of the area of the square does the described as a weighting bias, even if there
circle occupy?”). As evidence, they noted were no way to establish a normative prob-
that the mean estimate of the “sphere inside ability for individual cases (Kahneman &
cube” problem (74%) is scarcely different Tversky, 1 996).
from the mean estimate of the “circle inside For some judgmental tasks, information
square” problem (77%) and greatly exceeds that could serve to supplement or correct the
the correct answer (5 2%) – a correct an- heuristic is not neglected or underweighted
swer that most people, not surprisingly, are but simply lacking. If asked to judge the rela-
surprised by. tive frequency of words beginning with K or
R (Tversky & Kahneman, 1 973 ) or to com-
pare the population of a familiar foreign city
Biases with one that is unfamiliar (Gigerenzer &
Whenever the heuristic attribute differs Goldstein, 1 996), respondents have little re-
from the target attribute, the substitution course but to base their judgments on ease
of one for the other inevitably introduces of retrieval or recognition. The necessary re-
systematic biases. In this treatment, we liance on these heuristic attributes renders
are mostly concerned with weighting bi- such judgments susceptible to biasing factors
ases, which arise when cues available to (e.g., the amount of media coverage). How-
the judge are given either too much or ever, unlike weighting biases, such biases of
too little weight. Criteria for determining insufficient information cannot be described
optimal weights can be drawn from sev- as errors of judgment because there is no way
eral sources. In the classic lens model, the to avoid them.
optimal weights associated with different
cues are the regression weights that opti-
Accessibility and Substitution
mize the prediction of an external criterion,
such as physical distance or the grade point The intent to judge a target attribute initi-
average that a college applicant will attain ates a search for a reasonable value. Some-
(Brunswik, 1 943 ; Hammond, 1 95 5 ). Our times this search ends quickly because the
analysis of weighting biases applies to such required value can be read from a stored
cases, but it also extends to attributes for memory (e.g., the answer to the question
which no objective criterion is available, “How tall are you?”) or a current experience
such as an individual’s overall happiness (e.g., the answer to the question “How much
or the probability that a particular patient do you like this cake?”). For other judg-
will survive surgery. Normative standards for ments, however, the target attribute does
not readily come to mind, but the search stitution in a wide variety of affect-laden
for it evokes other attributes that are con- judgments. Indeed, the evidence suggests
ceptually and associatively related. For ex- that a list of major general-purpose heuris-
ample, a question about overall happiness tics should include an affect heuristic (Slovic
may retrieve the answer to a related ques- et al., 2002). Slovic and colleagues (2002)
tion about satisfaction with a particular as- show that a basic affective reaction gov-
pect of life upon which one is currently erns a wide variety of more complex evalua-
reflecting. tions such as the cost–benefit ratio of various
We adopt the term accessibility to refer technologies, the safe level of chemicals, or
to the ease (or effort) with which particu- even the predicted economic performance
lar mental contents come to mind (see, e.g., of various industries. In the same vein, Kah-
Higgins, 1 996; Tulving & Pearlstone, 1 966). neman and Ritov (1 994) and Kahneman,
The question of why thoughts become ac- Ritov, and Schkade (1 999) proposed that an
cessible – why particular ideas come to mind automatic affective valuation is the principal
at particular times – has a long history in psy- determinant of willingness to pay for public
chology and encompasses notions of stimu- goods, and Kahneman, Schkade, and Sun-
lus salience, associative activation, selective stein (1 998) interpreted jurors’ assessments
attention, specific training, and priming. In of punitive awards as a mapping of outrage
the present usage, accessibility is determined onto a dollar scale of punishments.
jointly by the characteristics of the cogni- Attributes that are not naturally assessed
tive mechanisms that produce it and by the can become accessible if they have been re-
characteristics of the stimuli and events that cently evoked or primed (see, e.g., Bargh et
evoke it, and it may refer to different aspects al., 1 986; Higgins & Brendl, 1 995 ). The ef-
and elements of a situation, different ob- fect of temporary accessibility is illustrated
jects in a scene, or different attributes of an by the “romantic satisfaction heuristic” for
object. judging happiness. The mechanism of at-
Attribute substitution occurs when a rela- tribute substitution is the same, however,
tively inaccessible target attribute is assessed whether the heuristic attribute is chronically
by mapping a relatively accessible and re- or temporarily accessible.
lated heuristic attribute onto the target scale. There is sometimes more than one can-
Some attributes are permanent candidates didate for the role of heuristic attribute. For
for the heuristic role because they are rou- an example that we borrow from Anderson
tinely evaluated as part of perception and (1 991 ), consider the question “Are more
comprehension and therefore always acces- deaths caused by rattlesnakes or bees?” A re-
sible (Tversky & Kahneman, 1 983 ). These spondent who has recently read about some-
natural assessments include physical prop- one who died from a snakebite or bee sting
erties such as size and distance and more may use the relative availability of instances
abstract properties such as similarity (e.g., of the two categories as a heuristic. If no
Tversky & Kahneman, 1 983 ; see Goldstone instances come to mind, that person might
& Son, Chap. 2), cognitive fluency in per- consult his or her impressions of the “dan-
ception and memory (e.g., Jacoby & Dallas, gerousness” of the typical snake or bee, an
1 991 ; Schwarz & Vaughn, 2002; Tversky & application of representativeness. Indeed, it
Kahneman, 1 973 ), causal propensity (Hei- is possible that the question initiates both
der, 1 944; Kahneman & Varey, 1 990; Mi- a search for instances and an assessment of
chotte, 1 963 ), surprisingness (Kahneman & dangerousness, and that a contest of accessi-
Miller, 1 986), mood (Schwarz & Clore, bility determines the role of the two heuris-
1 983 ), and affective valence (e.g., Bargh, tics in the final response. As Anderson ob-
1 997; Cacioppo, Priester, & Berntson, 1 993 ; served, it is not always possible to determine
Kahneman, Ritov, & Schkade, 1 999; Slovic a priori which heuristic will govern the re-
et al., 2002; Zajonc, 1 980, 1 997). sponse to a particular problem.
Because affective valence is a natural as- The original list of heuristics (Tver-
sessment, it is a candidate for attribute sub- sky & Kahneman, 1 974) also included an
“anchoring heuristic.” An anchoring effect, get attribute has no upper bound. Kahne-
however, does not involve the substitution of man, Ritov, and Schkade (1 999) discussed
a heuristic attribute for a target attribute: It two situations in which an attitude (or af-
is due to the temporary salience of a particu- fective valuation) is mapped onto an un-
lar value of the target attribute. However, an- bounded scale of dollars: when respondents
choring and attribute substitution are both in surveys are required to indicate how much
instances of a broader family of accessibility money they would contribute for a cause,
effects (Kahneman, 2003 ). In attribute sub- and when jurors are required to specify an
stitution, a highly accessible attribute con- amount of punitive damages against a neg-
trols the evaluation of a less accessible one. ligent firm. The mapping of attitudes onto
In anchoring, a highly accessible value of dollars is a variant of direct scaling in psy-
the target attribute dominates its judgment. chophysics, where respondents assign num-
This conception is compatible with more bers to indicate the intensity of sensations
recent theoretical treatments of anchor- (Stevens, 1 975 ). The normal practice of di-
ing (see, e.g., Chapman & Johnson, 1 994, rect scaling is for the experimenter to pro-
2002; Mussweiler & Strack 1 999; Strack & vide a modulus – a specified number that
Mussweiler, 1 997). is to be associated with a standard stimu-
lus. For example, respondents may be asked
to assign the number 1 0 to the loudness of
Cross-Dimensional Mapping
a standard sound and judge the loudness
The process of attribute substitution in- of other sounds relative to that standard.
volves the mapping of the heuristic at- Stevens (1 975 ) observed that when the ex-
tribute of the judgment object onto the perimenter fails to provide a modulus, re-
scale of the target attribute. Our notion of spondents spontaneously adopt one. How-
cross-dimensional mapping extends Stevens’ ever, different respondents may pick moduli
(1 975 ) concept of cross-modality matching. that differ greatly (sometimes varying by a
Stevens postulated that intensive attributes factor of 1 00 or more); thus, the variability
(e.g., brightness, loudness, the severity of in judgments of particular stimuli is domi-
crimes) can be mapped onto a common scale nated by arbitrary individual differences in
of sensory strength, allowing direct matching the choice of modulus. A similar analysis
of intensity across modalities – permitting, applies to situations in which respondents
for example, respondents to match the loud- are required to use the dollar scale to ex-
ness of sounds to the severity of crimes. Our press affection for a species or outrage to-
conception allows other ways of compar- ward a defendant. Just as Stevens’ observers
ing values across dimensions, such as match- had no principled way to assign a number to
ing relative positions (e.g., percentiles) a moderately loud sound, survey participants
in the frequency distributions or ranges of and jurors have no principled way to scale
different attributes (Parducci, 1 965 ). An im- affection or outrage into dollars. The anal-
pression of a student’s position in the dis- ogy of scaling without a modulus has been
tribution of aptitude may be mapped di- used to explain the notorious variability of
rectly onto a corresponding position in the dollar responses in surveys of willingness to
distribution of academic achievement and pay and in jury awards (Kahneman, Ritov,
then translated into a letter grade. Note & Schkade, 1 999; Kahneman, Schkade, &
that cross-dimensional matching is inher- Sunstein, 1 998).
ently nonregressive: A judgment or predic-
tion is just as extreme as the impression
System 2 : The Supervision of
mapped onto it. Ganzach and Krantz (1 990)
Intuitive Judgments
applied the term “univariate matching” to a
closely related notion. Our model assumes that an intuitive judg-
Cross-dimensional mapping presents spe- ment is expressed overtly only if it is
cial problems when the scale of the tar- endorsed by system 2. The Stroop task
illustrates this two-system structure. Ob- beyond this simple strategy, but they need
servers who are instructed to report the color more than 5 seconds to do so. As this exam-
in which words are printed tend to stum- ple illustrates, judgment by heuristic often
ble when the word is the name of another yields simplistic assessments, which system 2
color (e.g., the word BLUE printed in green sometimes corrects by bringing additional
ink). The difficulty arises because the word is considerations to bear.
automatically read, and activates a response Attribute substitution can be prevented
(“blue” in this case) that competes with the by alerting respondents to the possibility
required response (“green”). Errors are rare that their judgment could be contaminated
in the Stroop test, indicating generally suc- by an irrelevant variable. For example, al-
cessful monitoring and control of the overt though sunny or rainy weather typically af-
response, but the conflict produces delays fects reports of well-being, Schwarz and
and hesitations. The successful suppression Clore (1 983 ) found that weather has no
of erroneous responses is effortful, and its effect if respondents are asked about the
efficacy is reduced by stress and distraction. weather just before answering the well-
Gilbert (1 989) described a correction being question. Apparently, this question re-
model in which initial impulses are often minds respondents that their current mood
wrong and normally overridden. He argued (a candidate heuristic attribute) is influ-
that people initially believe whatever they enced by a factor (current weather) that is
are told (e.g., “Whitefish love grapes”) and irrelevant to the requested target attribute
that it takes some time and mental effort to (overall well-being). Schwarz (1 996) also
“unbelieve” such dubious statements. Here found that asking people to describe their
again, cognitive load disrupts the control- satisfaction with some particular domain of
ling operations of system 2, increasing the life reduces the weight this domain receives
rate of errors and revealing aspects of intu- in a subsequent judgment of overall well be-
itive thinking that are normally suppressed. ing. As these examples illustrate, although
In an ingenious extension of this approach, priming typically increases the weight of that
Bodenhausen (1 990) exploited natural tem- variable on judgment (a system 1 effect), this
poral variability in alertness. He found that does not occur if the prime is a sufficiently
“morning people” were substantially more explicit reminder that brings the self-critical
susceptible to a judgment bias (the conjunc- operations of system 2 into play.
tion fallacy) in the evening and that “evening We suspect that system 2 endorsements of
people” were more likely to commit the fal- intuitive judgments are granted quite casu-
lacy in the morning. ally under normal circumstances. Consider
Because system 2 is relatively slow, its op- the puzzle “A bat and a ball cost $1 .1 0 in to-
erations can be disrupted by time pressure. tal. The bat costs $1 more than the ball. How
Finucane et al. (2000) reported a study in much does the ball cost?” Almost everyone
which respondents judged the risks and ben- we ask reports an initial tendency to answer
efits of various products and technologies “1 0 cents” because the sum $1 .1 0 separates
(e.g., nuclear power, chemical plants, cellu- naturally into $1 and 1 0 cents, and 1 0 cents
lar phones). When participants were forced is about the right magnitude. Many peo-
to respond within 5 seconds, the correlations ple yield to this immediate impulse. Even
between their judgments of risks and their among undergraduates at elite institutions,
judgments of benefits were strongly nega- about half get this problem wrong when it
tive. The negative correlations were much is included in a short IQ test (Frederick,
weaker (although still pronounced) when re- 2004). The critical feature of this problem
spondents were given more time to ponder is that anyone who reports 1 0 cents has ob-
a response. When time is short, the same viously not taken the trouble to check his
affective evaluation apparently serves as a or her answer. The surprisingly high rate
heuristic attribute for assessments of both of errors in this easy problem illustrates
benefits and risks. Respondents can move how lightly system 2 monitors the output of
system 1 : People are often content to trust one group of respondents provides judg-
a plausible judgment that quickly comes to ments of a target attribute for a set of ob-
mind. (The correct answer, by the way, is jects and another group evaluates the hy-
5 cents.) pothesized heuristic attribute for the same
The bat and ball problem elicits many er- objects. The substitution hypothesis im-
rors, although it is not really difficult and plies that the judgments of the two groups,
certainly not ambiguous. A moral of this when expressed in comparable units (e.g.,
example is that people often make quick percentiles), will be identical. This section
intuitive judgments to which they are not examines several applications of heuristic
deeply committed. A related moral is that elicitation.
we should be suspicious of analyses that ex-
plain apparent errors by attributing to re- Eliciting Representativeness
spondents a bizarre interpretation of the
question. Consider someone who answers a Figure 1 2.1 displays the results of two ex-
question about happiness by reporting her periments in which a measure of represen-
satisfaction with her romantic life. The re- tativeness was elicited. These results were
spondent is surely not committed to the ab- published long ago, but we repeat them here
surdly narrow interpretation of happiness because they still provide the most direct
that her response seemingly implies. More evidence for both attribute substitution and
likely, at the time of answering, she thinks the representativeness heuristic. For a more
that she is reporting happiness: A judgment recent application of a similar design, see
comes quickly to mind and is not obviously Bar-Hillel and Neter (1 993 ).
mistaken – end of story. Similarly, we pro- The object of judgment in the study from
pose that respondents who judge probabil- which Figure 1 2.1 (a) is drawn (Kahneman &
ity by representativeness do not seriously be- Tversky, 1 973 ; p. 1 27 in Kahneman, Slovic,
lieve that the questions “How likely is X to & Tversky, 1 982) was the following descrip-
be a Y?” and “How much does X resemble tion of a fictitious graduate student, which
the stereotype of Y?” are synonymous. Peo- was shown along with a list of nine fields of
ple who make a casual intuitive judgment graduate specialization:
normally know little about how their judg- Tom W. is of high intelligence, although
ment came about and know even less about lacking in true creativity. He has a need
its logical entailments. Attempts to recon- for order and clarity and for neat and tidy
struct the meaning of intuitive judgments by systems in which every detail finds its ap-
interviewing respondents (see, e.g., Hertwig propriate place. His writing is rather dull
& Gigerenzer, 1 999) are therefore unlikely and mechanical, occasionally enlivened by
to succeed because such probes require bet- somewhat corny puns and by flashes of
ter introspective access and more coherent imagination of the sci-fi type. He has a
beliefs than people normally muster. strong drive for competence. He seems to
have little feel and little sympathy for other
people and does not enjoy interacting with
others. Self-centered, he nonetheless has a
deep moral sense.
Identifying a Heuristic
Participants in a representativeness group
Hypotheses about judgment heuristics have ranked the nine fields of specialization by
most often been studied by examining the degree to which Tom W. “resembles a
weighting biases and deviations from nor- typical graduate student.” Participants in the
mative rules. However, the hypothesis that probability group ranked the nine fields ac-
one attribute is substituted for another in a cording to the likelihood of Tom W.’s spe-
judgment task – for example, representative- cializing in each. Figure 1 2.1 (a) plots the
ness for probability – can also be tested more mean judgments of the two groups. The
directly. In the heuristic elicitation design, correlation between representativeness and
a Tom W. b Linda
9 7
8
6
mean rank (likelihood)
mean rank (likelihood)

7
5
6
5 4
4
3
3
2
2
1 1
1 2 3 4 5 6 7 8 9 1 2 3 4 5 6 7
mean rank (similarity) mean rank (similarity)
Figure 1 2 .1 . (a) Plot of average ranks for nine outcomes for Tom W. ranked by probability and by
similarity to stereotypes of graduate students in various fields. (b) Plot of average ranks for eight
outcomes for Linda ranked by probability and by representativeness.
probability is nearly perfect (.97). No The logic of probabilistic prediction in

stronger support for attribute-substitution this task suggests that the ranking of out-
could be imagined. However, interpreting comes by their probabilities should be in-
representativeness as the heuristic attribute termediate between their rankings by rep-
in these judgments does require two addi- resentativeness and by base rate frequencies.
tional plausible assumptions – that represen- Indeed, if the personality description is taken
tativeness is more accessible than probabil- to be a poor source of information, proba-
ity, and that there is no third attribute that bility judgments should stay quite close to
could explain both judgments. the base rates. The description of Tom W.
The Tom W. study was also intended to was designed to allow considerable scope
examine the effect of the base rates of out- for judgments of probability to diverge from
comes on categorical prediction. For that judgments of representativeness, as this logic
purpose, respondents in a third group esti- requires. Figure 1 2.1 (a) shows no such di-
mated the proportion of graduate students vergence. Thus, the results of the Tom W.
enrolled in each of the nine fields. By design, study simultaneously demonstrate the sub-
some outcomes were defined quite broadly, stitution of representativeness for probabil-
whereas others were defined more narrowly. ity and the neglect of known (but not explic-
As intended, estimates of base rates var- itly mentioned) base rates.
ied markedly across fields, ranging from 3 % Figure 1 2.1 (b) is drawn from an early
for Library Science to 20% for Humanities study of the Linda problem, the best-known
and Education. Also by design, the descrip- and most controversial example in the rep-
tion of Tom W. included characteristics (e.g., resentativeness literature (Tversky & Kahne-
introversion) that were intended to make man, 1 982) in which a woman named Linda
him fit the stereotypes of the smaller fields was described as follows:
(library science, computer science) better Linda is 3 1 years old, single, outspoken
than the larger fields (humanities and social and very bright. She majored in philoso-
sciences).1 As intended, the correlation be- phy. As a student she was deeply concerned
tween the average judgments of representa- with issues of discrimination and social jus-
tiveness and of base rates was strongly nega- tice and also participated in antinuclear
tive (−.65 ). demonstrations.
As in the Tom W. study, separate groups they were separated by a filler so respondents
of respondents were asked to rank a set of would not feel compelled to compare them.
eight outcomes by representativeness and In the anthropomorphic language used here,
probability. The results are shown in Fig- system 2 was given a chance to correct the
ure 1 2.1 (b). Again the correlation between judgment but was not prompted to do so.
these rankings was almost perfect (.99).1 In view of the confusing controversy that
Six of the eight outcomes that subjects followed, it is perhaps unfortunate that the
were asked to rank were fillers (e.g., ele- articles documenting base rate neglect and
mentary school teacher, psychiatric social conjunction errors did not stop with subtle
worker). The two critical outcomes were #6 tests. Each article also contained an experi-
(bank teller) and the so-called conjunction mental flourish – a demonstration in which
item #8 (bank teller and active in the fem- the error occurred in spite of a manipula-
inist movement). Most subjects ranked the tion that called participants’ attention to the
conjunction higher than its constituent, both critical variable. The engineer–lawyer prob-
in representativeness (85 %) and probabil- lem (Kahneman & Tversky, 1 973 ) included
ity (89%). The observed ranking of the two special instructions to ensure that respon-
items is quite reasonable for judgments of dents would notice the base rates of the
similarity, but not for probability: Linda may outcomes. The brief personality descriptions
resemble a feminist bank teller more than shown to respondents were reported to have
she resembles a bank teller, but she cannot been drawn from a set containing descrip-
be more likely to be a feminist bank teller tions of 3 0 lawyers and 70 engineers (or vice
than to be a bank teller. In this problem, re- versa), and respondents were asked “What
liance on representativeness yields probabilis the probability that this description be-
ity judgments that violate a basic logical rule. longs to one of the 3 0 lawyers in the sample
As in the Tom W. study, the results make two of 1 00?” To the authors’ surprise, base rates
points: They support the hypothesis of at- were largely neglected in the responses, de-
tribute substitution and also illustrate a pre- spite their salience in the instructions. Sim-
dictable judgment error. ilarly, the authors were later shocked to dis-
cover that more than 80% of undergraduates
committed a conjunction error even when
asked point blank whether Linda was more
The Representativeness Controversy likely to be “a bank teller” or “a bank teller
who is active in the feminist movement”
The experiments summarized in Figure 1 2.1 (Tversky & Kahneman, 1 983 ). The novelty
provided direct evidence for the represen- of these additional direct or “transparent”
tativeness heuristic and two concomitant tests was the finding that respondents con-
biases: neglect of base rates and conjunc- tinued to show the biases associated with
tion errors. In the terminology introduced representativeness even in the presence of
by Tversky and Kahneman (1 983 ), the de- strong cues pointing to the normative re-
sign of these experiments was “subtle”: Ad- sponse. The errors that people make in trans-
equate information was available for partic- parent judgment problems are analogous to
ipants to avoid the error, but no effort was observers’ failure to allow for ambient haze
made to call their attention to that informa- in estimating distances: A correct response
tion. For example, participants in the Tom is within reach, but not chosen, and the fail-
W. experiment had general knowledge of the ure involves an unexpected weakness of the
relative base rates of the various fields of spe- corrective operations of system 2.
cialization, but these base rates were not ex- Discussions of the heuristics and biases
plicitly mentioned in the problem. Similarly, approach have focused almost exclusively
both critical items in the Linda experiment on the direct conjunction fallacy and on
were included in the list of outcomes, but the engineer–lawyer problems. These are
also the only studies that have been exten- A more lenient concept, reasoning ra-
sively replicated with varying parameters. tionality, only requires an ability to reason
The amount of critical attention is remark- correctly about the information currently
able because the studies were not, in fact, at hand without demanding perfect consis-
essential to the authors’ central claim. In tency among beliefs that are not simulta-
terms of the present treatment, the claim neously evoked. The best known violation
was that intuitive prediction is an operation of reasoning rationality is the famous “four
of system 1 , which is susceptible to both base card” problem (Wason, 1 960). The failure of
rate neglect and conjunction fallacies. There intelligent adults to reason their way through
was no intent to deny the possibility of sys- this problem is surprising because the prob-
tem 2 interventions that would modify or lem is “easy” in the sense of being easily
override intuitive predictions. Thus, the ar- understood once explained. What everyone
ticles in which these studies appeared would learns, when first told that intelligent peo-
have been substantially the same, although ple fail to solve the four-card problem, is
far less provocative, if respondents had over- that one’s expectations about human rea-
come base rate neglect and conjunction er- soning abilities had not been adequately cal-
rors in transparent tests. ibrated. There is, of course, no well-defined
To appreciate why the strong forms of metric of reasoning rationality, but whatever
base rate neglect and of the conjunction fal- metric one uses, the Wason problem calls
lacy sparked so much controversy, it is use- for a downward adjustment. The surprising
ful to distinguish two conceptions of human results of the Linda and engineer–lawyer
rationality (Kahneman, 2000b). Coherence problems led Tversky and Kahneman to a
rationality is the strict conception that re- similar realization: The reasoning of their
quires the agent’s entire system of beliefs subjects was less proficient than they had an-
and preferences to be internally consistent ticipated. Many readers of the work shared
and immune to effects of framing and con- this conclusion, but many others strongly
text. For example, an individual’s probabil- resisted it.
ity p (“Linda is a bank teller”) should be the The implicit challenge to reasoning ra-
sum of the probabilities p (“Linda is a bank tionality was met by numerous attempts to
teller and a feminist”), and p (“Linda is a bank dismiss the findings of the engineer–lawyer
teller and not a feminist”). A subtle test of and the Linda studies as artifacts of ambigu-
coherence rationality could be conducted by ous language, confusing instructions, conver-
asking individuals to assess these three prob- sational norms, or inappropriate normative
abilities on separate occasions under circum- standards. Doubts have been raised about
stances that minimize recall. Coherence can the proper interpretation of almost every
also be tested in a between-groups design. If word in the conjunction problem, including
random assignment is assumed, the sum of “bank teller,” “probability,” and even “and”
the average probabilities assigned to the two (see, e.g., Dulany & Hilton, 1 991 ; Hilton &
component events should equal the average Slugoski, 2001 ). These claims are not dis-
judged probability of “Linda is a bank teller.” cussed in detail here. We suspect that most
If this prediction fails, then at least some of them have some validity and that they
individuals are incoherent. Demonstrations identified mechanisms that may have made
of incoherence present a significant chal- the results in the engineer–lawyer and Linda
lenge to important models of decision the- studies exceptionally strong. However, we
ory and economics, which attribute to agents note a significant weakness shared by all
a very strict form of rationality (Tversky & these critical discussions: They provide no
Kahneman, 1 986). Failures of perfect coher- explanation of the essentially perfect con-
ence are less provocative to psychologists, sistency of the judgments observed in di-
who have a more realistic view of human rect tests of the conjunction rule and in
capabilities. three other types of experiments: subtle
comparisons, between-Ss comparisons, and are successfully overcome and some circum-
most important, judgments of representa- stances under which intuitions may not be
tiveness (see also Bar-Hillel & Neter, 1 993 ). evoked at all.
Interpretations of the conjunction fallacy
as an artifact implicitly dismiss the results statistical sophistication
of Figure 1 2.1 (b) as a coincidence (for an
exception, see Ayton, 1 998). The story of The performance of statistically sophisti-
the engineer-lawyer problem is similar. Here cated groups of respondents in different ver-
again, multiple demonstrations in which sions of the Linda problem illustrates the ef-
base rate information was used (see Koehler, fects of both expertise and research design
1 996, for a review) invited the inference that (Tversky & Kahneman, 1 983 ). Statistical ex-
there is no general problem of base rate ne- pertise provided no advantage in the eight-
glect. Again, the data of prediction by repre- item version in which the critical items were
sentativeness in Figure 1 2.1 (a) (and related separated by a filler and were presumably
results reported by Kahneman & Tversky, considered separately. In the two-item ver-
1 973 ) were ignored. sion, in contrast, respondents were effec-
The demonstrations that under some con- tively compelled to compare “bank teller”
ditions people avoid the conjunction fallacy with “bank teller and is active in the femi-
in direct tests, or use explicit base rate in- nist movement.” The incidence of conjunc-
formation, led some scholars to the blanket tion errors remained essentially unchanged
conclusion that judgment biases are artifi- among the statistically naive in this condi-
cial and fragile and that there is no need for tion but dropped dramatically for the statis-
judgment heuristics to explain them. This tically sophisticated. Most of the experts fol-
position was promoted most vigorously by lowed logic rather than intuition when they
Gigerenzer (1 991 ). Kahneman and Tversky recognized that one of the categories con-
(1 996) argued in response that the heuris- tained the other. In the absence of a prompt
tics and biases position does not preclude the to compare the items, however, the statis-
possibility of people’s performing flawlessly tically sophisticated made their predictions
in particular variants of the Linda and the in the same way as everyone else does – by
engineer–lawyer problems. Because laypeo- representativeness. As Stephen Jay Gould
ple readily acknowledge the validity of (1 991 , p. 469) noted, knowledge of the truth
the conjunction rule and the relevance of does not dislodge the feeling that Linda is a
base rate information, the fact that they feminist bank teller: “I know [the right an-
sometimes obey these principles is neither a swer], yet a little homunculus in my head
surprise nor an argument against the role of continues to jump up and down, shouting at
representativeness in routine intuitive pre- me – ‘but she can’t just be a bank teller; read
diction. However, the study of conditions the description.’”
under which errors are avoided can help us
understand the capabilities and limitations intelligence
of system 2. We develop this argument fur- Stanovich (1 999) and Stanovich and West
ther in the next section. (2002) observed a generally negative corre-
lation between conventional measures of in-
telligence and susceptibility to judgment bi-
Making Biases Disappear: A Task
ases. They used transparent versions of the
for System 2
problems, which include adequate cues to
Much has been learned over the years about the correct answer and therefore provide
variables and experimental procedures that a test of reasoning rationality. Not surpris-
reduce or eliminate the biases associated ingly, intelligent people are more likely to
with representativeness. We next discuss possess the relevant logical rules and also to
conditions under which errors of intuition recognize the applicability of these rules in
particular situations. In the terms of manipulations of attention

the present analysis, high-IQ respondents The weight of neglected variables can be in-
benefit from relatively efficient system 2 op- creased by drawing attention to them, and
erations that enable them to overcome er- experimenters have devised many ingenious
roneous intuitions when adequate informa- ways to do so. Schwarz et al. (1 991 ) found
tion is available. (However, when a problem that respondents pay more attention to base
is too difficult for everyone, the correlation rate information when they are instructed
may reverse because the more intelligent re- to think as statisticians rather than clini-
spondents are more likely to agree on a plau- cal psychologists. Krosnick, Li, and Lehman
sible error than to respond randomly, as dis- (1 990) exploited conversational conventions
cussed in Kahneman, 2000b.) about the sequencing of information and
confirmed that the impact of base rate in-
formation was enhanced by presenting that
frequency format information after the personality descrip-
Relative frequencies (e.g., 1 in 1 0) are more tion rather than before it. Attention to the
vividly represented and more easily under- base rate is also enhanced when partici-
stood than equivalent probabilities (.1 0) or pants observe the drawing of descriptions
percentages (1 0%). For example, the emo- from an urn (Gigerenzer, Hell, & Blank,
tional impact of statements of risk is en- 1 988) perhaps because watching the draw-
hanced by the frequency format: “1 person ing induces conscious expectations that re-
in 1 000 will die” is more frightening than a flect the known proportions of possible out-
probability of .001 (Slovic et al., 2002). The comes. The conjunction fallacy can also
frequency representation also makes it eas- be reduced or eliminated by manipulations
ier to visualize partitions of sets and detect that increase the accessibility of the rel-
that one set is contained in another. As a evant rule, including some linguistic vari-
consequence, the conjunction fallacy is gen- ations (Macchi, 1 995 ), and practice with
erally avoided in direct tests in which the logical problems (Agnoli, 1 991 ; Agnoli &
frequency format makes it easy to recog- Krantz, 1 989).
nize that feminist bank tellers are a subset of The interpretation of these attentional ef-
bank tellers (Gigerenzer & Hoffrage, 1 995 ; fects is straightforward. We assume most
Tversky & Kahneman, 1 983 ). For similar rea- participants in judgment studies know, at
sons, some base rate problems are more eas- least vaguely, that the base rate is rele-
ily solved when couched in frequencies than vant and that the conjunction rule is valid
in probabilities or percentages (Cosmides & (Kahneman & Tversky, 1 982). Whether they
Tooby, 1 996). However, there is little sup- apply this knowledge to override an intu-
port for the more general claims about the itive judgment depends on their cognitive
evolutionary adaptation of the mind to deal skills (education, intelligence) and on for-
with frequencies (Evans et al., 2000). Fur- mulations that make the applicability of a
thermore, the ranking of outcomes by pre- rule apparent (frequency format) or a rel-
dicted relative frequency is very similar to evant factor more salient (manipulations of
the ranking of the same outcomes by rep- attention). We assume intuitions are less sen-
resentativeness (Mellers, Hertwig, & Kahne- sitive to these factors and that the appear-
man, 2001 ). We conclude that the frequency ance or disappearance of biases mainly re-
format affects the corrective operations of flects variations in the efficacy of corrective
system 2, not the intuitive operations of sys- operations. This conclusion would be circu-
tem 1 . The language of frequencies improves lar, of course, if the corrective operations
respondents’ ability to impose the logic of were both inferred from the observation of
set inclusion on their considered judgments correct performance and used to explain that
but does not reduce the role of representa- performance. Fortunately, the circularity can
tiveness in their intuitions. be avoided because the role of system 2
can be verified – for example, by using ma- observed that people often average or add
nipulations of time pressure, cognitive load, where they should multiply.
or mood to interfere with its operations.
In summary, the factorial design is not
appropriate for testing hypotheses about bi-
within-subjects factorial designs
ases of neglect because it effectively guaran-
The relative virtues of between-subjects and tees that no manipulated factor is neglected.
within-subject designs in studies of judg- Figure 1 2.2 illustrates this claim by sev-
ment are a highly contentious issue. Facto- eral examples of an additive extension effect
rial designs have their dismissive critics (e.g., that we discuss further in the next section.
Poulton, 1 989) and their vigorous defenders The experiments summarized in the differ-
(e.g., Birnbaum, 1 999). We do not attempt ent panels share three important features:
to adjudicate this controversy here. Our nar- (1 ) In each case, the quantitative variable
rower point is that between-subjects designs plotted on the abscissa was completely ne-
are more appropriate for the study of heuris- glected in similar experiments conducted in
tics of judgment. The following arguments a between-subjects or subtle design; (2) in
favor this conclusion: each case, the quantitative variable com-
r Factorial designs are transparent. Partici- bines additively with other information; (3 )
in each case, a compelling normative ar-
pants are likely to identify the variables gument can be made for a quasimulti-
that are manipulated, especially if there plicative rule in which the lines shown in
are many trials and especially in a fully Figure 1 2.2 should fan out. For example, Fig-
factorial design in which the same stimu- ure 1 2.2(c) presents a study of categorical
lus attributes are repeated in varying com- prediction (Novemsky & Kronzon, 1 999) in
binations. The message that the design which respondent 5 judged the relative like-
conveys to the participants is that the ex- lihood that a person was a member of one
perimenter expects to find effects of ev- occupation rather than another (e.g., com-
ery factor that is manipulated (Bar-Hillel puter programmer vs. flight attendant) on
& Fischhoff, 1 981 ; Schwarz, 1 996). the basis of short personality sketches (e.g.,
r Studies that apply a factorial design
“shy, serious, organized, and sarcastic”) and
to judgment tasks commonly involve one of three specified base rates (1 0%, 5 0%,
schematic and impoverished stimuli. The or 90%). Representativeness and base rate
tasks are also highly repetitive. These were varied factorially within subjects. The
features encourage participants to adopt effect of base rate is clearly significant in this
simple mechanical rules that will allow design (see also Birnbaum & Mellers, 1 983 ).
them to respond quickly without forming Furthermore, the effects of representative-
an individuated impression of each stim- ness and base rate are strictly additive. As
ulus. For example, Ordóñez and Benson Anderson (1 996) argued, averaging (a spe-
(1 997) required respondents to judge the cial case of additive combination) is the most
attractiveness of gambles on a 1 00-point obvious way to combine the effects of two
scale. They found that under time pres- variables that are recognized as relevant (e.g.,
sure many respondents computed or esti- “She looks like a bank teller, but the base-rate
mated the expected values of the gambles is low.”). Additivity is not normatively ap-
and used the results as attractiveness rat- propriate in this case – any Bayes-like com-
ings (e.g., a rating of 1 5 for a 5 2% chance bination would produce curves that initially
to win $3 1 .5 0). fan out from the origin and converge again
r Factorial designs often yield judgments at high values. Similar considerations apply
that are linear combinations of the ma- to the other three panels of Figure 1 2.2 dis-
nipulated variables. This is a central cussed later. Between-subjects and factorial
conclusion of a massive research effort designs often yield different results in stud-
conducted by Anderson (1 996), who ies of intuitive judgment. Why should we
a Kahneman, Ritov, & Schkade Data b Schreiber & Kahneman Data
35 10
9
Mean contribution in $
30
8
25 7 71 dB
Aversiveness
Low
20 6
75 dB
Medium 5
15 78 dB
High 4
10 3 80 dB
5 2
1
0 0
0 20 40 60 80 0 5 10 15 20 25 30
Percentage Population Decline Duration in Seconds
c Novemsky & Kronzon Data d Ariely Data
90 80
80 75
Posterior Probability (%)
70 70
65
Aversiveness
60 Programmer Down
60
50 Surgeon Down&Up
55
40 Accountant
50 Up
`
30 Engineer 45 Up&Down
20 40
10 35
30
0
0 20 40 60 80 100 0 10 20 30 40 50
Base-rate (%) Duration in Seconds
Figure 1 2 .2 . (a) Willingness to pay to restore damage to species that differ in popularity as a function
of the damage they have suffered (from Kahneman, Ritov, & Schkade 2000); (b) global evaluations of
aversive sounds of different loudness as a function of duration for subjects selected for their high
sensitivity to duration (from Schreiber & Kahneman, 2000); (c) ratings of probability for predictions
that differ in representativeness as a function of base rate frequency (from Novemsky & Kronzon,
1 999); (d) global evaluations of episodes of painful pressure that differ in temporal profile as a
function of duration (Ariely, 1 998).
believe one design rather than the other? are made and is more likely to evoke the ca-
The main argument against the factorial de- sually intuitive mode of judgment that gov-
sign is its poor ecological validity. Encounter- erns much of mental life in routine situations
ing multiple judgment objects in rapid suc- (e.g., Langer, 1 978).
cession in a rigidly controlled structure is
unique to the laboratory, and the solutions
that they evoke are not likely to be typical.
Direct comparisons among concepts that Prototype Heuristics and the Neglect
differ in only one variable – such as bank of Extension
teller and feminist bank tellers – also provide
a powerful hint and a highly unusual oppor- In this section, we offer a common account
tunity to overcome intuitions. The between- of three superficially dissimilar judgmental
subjects design, in contrast, mimics the hap- tasks: (1 ) categorical prediction (e.g., “In a
hazard encounters in which most judgments set of 3 0 lawyers and 70 engineers, what is the
probability that someone described as ‘charm- ement of the set adds to the overall judg-
ing, talkative, clever, and cynical’ is one of the ment an amount that depends on the el-
lawyers?”); (2) summary evaluations of past ements already included. In simple cases,
events (e.g., “Overall, how aversive was it to conditional adding is just regular adding –
be exposed for 3 0 minutes to your neighbor’s the total weight of a collection of chairs is
car alarm?”); and (3 ) economic valuations the sum of their individual weights. In other
of public goods (e.g., “What is the most you cases, each element of the set contributes
would be willing to pay to prevent 2 00,000 mi- to the overall judgment, but the combina-
grating birds from drowning in uncovered oil tion rule is not simple addition and is most
ponds?”). We propose that a generalization typically subadditive. For example, the eco-
of the representativeness heuristic accounts nomic value of protecting X birds should be
for the remarkably similar biases that are ob- increasing in X, but the value of saving 2000
served in these diverse tasks. birds is for most people less than twice as
The original analysis of categorical pre- large as the value of saving 1 000 birds.
diction by representativeness (Kahneman & The logic of categorical prediction entails
Tversky 1 973 ; Tversky & Kahneman, 1 983 ) that the probability of membership in a cat-
invoked two assumptions in which the word egory should vary with its relative size, or
“representative” was used in different ways: base rate. In prediction by representative-
(1 ) A prototype (a representative exemplar) ness, however, the representation of out-
is used to represent categories (e.g., bank comes by prototypical exemplars effectively
tellers) in the prediction task, and (2) the discards base rates because the prototype of a
probability that the individual belongs to a category (e.g., lawyers) contains no informa-
category is judged by the degree to which the tion about the size of its membership. Next,
individual resembles (is representative of) the we show that phenomena analogous to the
category stereotype. Thus, categorical pre- neglect of base rate are observed in other
diction by representativeness involves two prototype heuristics: The monetary value at-
separate acts of substitution – the substitu- tached to a public good is often insensitive
tion of a representative exemplar for a cat- to its scope, and the global evaluation of a
egory and the substitution of the heuris- temporally extended experience is often in-
tic attribute of representativeness for the sensitive to its duration. These various in-
target attribute of probability. Perhaps be- stantiations of extension neglect (neglect of
cause they share a label, the two pro- base rates, scope, and duration) have been
cesses have not been distinguished in dis- discussed in separate literatures, but all can
cussions of the representativeness heuristic. be explained by the two-part process that
We separate them here by describing proto- defines prototype heuristics: (1 ) A category
type heuristics in which a prototype is sub- is represented by a prototypical exemplar,
stituted for its category, but in which repre- and (2) a (nonextensional) property of the
sentativeness is not necessarily the heuristic prototype is then used as a heuristic attribute
attribute. to evaluate an extensional target attribute of
The target attributes to which prototype the category. As might be expected from the
heuristics are applied are extensional. An ex- earlier discussion of base rate neglect, exten-
tensional attribute pertains to an aggregated sion neglect in all its forms is most likely to be
property of a set or category for which an observed in between-subjects experiments.
extension is specified – the probability that Within-subject factorial designs consistently
a set of 3 0 lawyers includes Jack, the over- yield the additive extension effect illustrated
all unpleasantness of a set of moments of in Figure 1 2.2.
hearing a neighbor’s car alarm, and the per-
sonal dollar value of saving a certain number
Scope Neglect in Willingness to Pay
of birds from drowning in oil ponds. Nor-
mative judgments of extensional attributes The contingent valuation method (CVM)
are governed by a general principle of con- was developed by resource economists (see
ditional adding, which dictates that each el- Mitchell & Carson, 1 989) as a tool for
assessing the value of public goods for pur- for the more complex target attribute of
poses of litigation or cost–benefit analysis. economic value. Other examples of radical
Participants in contingent valuation (CV) insensitivity to scope lend themselves to a
surveys are asked to indicate their willing- similar interpretation. Among others, Kah-
ness to pay (WTP) for specified public goods, neman (1 986) found that Toronto residents
and their responses are used to estimate the were willing to pay almost as much to clean
total amount that the community would pay up polluted lakes in a small region of On-
to obtain these goods. The economists who tario as to clean up all the polluted lakes in
design contingent valuation surveys inter- Ontario, and McFadden and Leonard (1 993 )
pret WTP as a valid measure of economic reported that residents in four western states
value and assume that statements of WTP were willing to pay only 28% more to protect
conform to the extensional logic of con- 5 7 wilderness areas than to protect a single
sumer theory. The relevant logic has been area (for more discussion of scope insensitiv-
described by a critic of CVM (Diamond, ity, see Frederick & Fischhoff, 1 998).
1 996), who illustrates the conditional adding The similarity between WTP statements
rule by the following example: In the ab- and categorical predictions is not limited
sence of income effects, WTP for saving X to such demonstrations of almost complete
birds should equal WTP for saving (X − k) extension neglect. The two responses also
birds, plus WTP to save k birds, where the yield similar results when extension and
last value is contingent on the costless prior prototype information are varied factori-
provision of safety for (X − k) birds. ally within subjects. Figure 1 2.2(a) shows
Strict adherence to Bayes’ rule may be the results of a study of WTP for pro-
an excessively demanding standard for intu- grams that prevented different levels of
itive predictions; similarly, it would be too damage to species of varying popularity
much to ask for WTP responses that strictly (Ritov & Kahneman, unpublished observa-
conform to the “add-up rule.” In both cases, tions, cited in Kahneman, Ritov, & Schkade,
however, it seems reasonable to expect some 1 999). As in the case of base rate [Figure
sensitivity to extension – to the base rate 1 2.2(c)], extensional information (levels of
of outcomes in categorical prediction and to damage) combines additively with nonex-
the scope of the good in WTP. In fact, several tensional information. This rule of combina-
studies have documented nearly complete tion is unreasonable; in any plausible theory
neglect of scope in CV surveys. The best- of value, the lines would fan out.
known demonstration of scope neglect is an Finally, the role of the emotion evoked
experiment by Desvouges et al. (1 993 ), who by a prototypical instance was also exam-
used the scenario of migratory birds that ined directly in the same experiment, us-
drown in oil ponds. The number of birds said ing the heuristic elicitation paradigm intro-
to die each year was varied across groups. duced earlier: Some respondents were asked
The WTP responses were completely insen- to imagine that they saw a television pro-
sitive to this variable; the mean WTPs for gram documenting the effect of adverse eco-
saving 2000, 20,000, or 200,000 birds were logical circumstances on individual mem-
$80, $78, and $88, respectively. bers of different species. The respondents
A straightforward interpretation of this indicated, for each species, how much con-
result involves the two acts of substitution cern they expected to feel while watching
that characterize prototype heuristics. The such a documentary. The correlation be-
deaths of numerous birds are first repre- tween this measure of affect and willingness
sented by a prototypical instance – perhaps to pay, computed across species, was .97.
an image of a bird soaked in oil and drown-
ing. The prototype automatically evokes
Duration Neglect in the Evaluation
an affective response, and the intensity of
of Experiences
that emotion is then mapped onto the dol-
lar scale – substituting the readily accessi- We next discuss experimental studies of the
ble heuristic attribute of affective intensity global evaluation of experiences that extend
over some time, such as a pleasant or a a simple average of the peak affect recorded
horrific film clip (Fredrickson & Kahne- during a film and the end affect reported as
man, 1 993 ), a prolonged unpleasant noise the film was about to end. This has been
(Schreiber & Kahneman, 2000), pressure called the peak/end rule. However, the cor-
from a vise (Ariely, 1 998), or a painful med- relation between retrospective evaluations
ical procedure (Redelmeier & Kahneman, and the duration of the films was negligible –
1 996). Participants in these studies provided a finding that Fredrickson and Kahneman la-
a continuous or intermittent report of hedo- beled duration neglect. The resemblance of
nic or affective state, using a designated scale duration neglect to the neglect of scope and
of momentary affect (Figure 1 2.3 ). When base rate is striking and unlikely to be ac-
the episode had ended, they indicated a cidental. In this analysis, all three are mani-
global evaluation of “the total pain or dis- festations of extension neglect caused by the
comfort” associated with the entire episode. use of a prototype heuristic.
We first examine the normative rules that The peak/end rule and duration neglect
apply to this task. The global evaluation of have both been confirmed on multiple oc-
a temporally extended outcome is an exten- casions. Figure 1 2.3 presents raw data from
sional attribute, which is governed by a dis- a study reported by Redelmeier and Kahne-
tinctive logic. The most obvious rule is tem- man (1 996), in which patients undergoing
poral monotonicity: There is a compelling colonoscopy reported their current level of
intuition that adding an extra period of pain pain every 60 seconds throughout the proce-
to an episode of discomfort can only make dure. Here again, an average of peak and end
it worse overall. Thus, there are two ways pain quite accurately predicted subsequent
of making a bad episode worse – making global evaluations and choices. The duration
the discomfort more intense or prolonging of the procedure varied considerably among
it. It must therefore be possible to trade off patients (from 4 to 69 minutes), but these
intensity against duration. Formal analyses differences were not reflected in subsequent
have identified conditions under which the global evaluations in accord with duration
total utility of an episode is equal to the neglect. The implications of these psycho-
temporal integral of a suitably transformed logical rules of evaluation are paradoxical. In
measure of the instantaneous utility associ- Figure 1 2.3 , for example, it appears evident
ated with each moment (Kahneman, 2000a; that patient B had a worse colonoscopy than
Kahneman, Wakker, & Sarin, 1 997). patient A (on the assumption they used the
Next, we turn to the psychology. scale similarly). However, it is also appar-
Fredrickson and Kahneman (1 993 ) proposed ent that the peak/end average was worse
a “snapshot model” for the retrospective for patient A, whose procedure ended at
evaluation of episodes, which again involves a moment of relatively intense pain. The
two acts of substitution: First, the episode is peak/end rule prediction for these two pro-
represented by a prototypical moment; next, files is that patient A would evaluate the
the affective value attached to the represen- procedure more negatively than patient B
tative moment is substituted for the exten- and would be more likely to prefer to un-
sional target attribute of global evaluation. dergo a barium enema rather than a repeat
The snapshot model was tested in an exper- colonoscopy. The prediction was correct for
iment in which participants provided con- these two individuals and confirmed by the
tinuous ratings of their affect while watch- data of a large group of patients.
ing plotless films that varied in duration and The effects of substantial variations of du-
affective value (e.g., fish swimming in coral ration remained small (although statistically
reefs, pigs being beaten to death with clubs), robust) even in studies conducted in a fac-
and later reported global evaluations of their torial design. Figure 1 2.2(d) is drawn from a
experiences. The central finding was that the study of responses to ischemic pain (Ariely,
retrospective evaluations of these observers 1 998), in which duration varied by a factor of
were predicted with substantial accuracy by 4. The peak/end average accounted for 98%
Patient A unimportant . . . people may be aware of

duration and consider it important in the
8 abstract [but] what comes most readily to
7 mind in evaluating episodes are the salient
moments of those episodes and the affect
6
Pain Intensity
associated with those moments. Duration

5 neglect might be overcome, we suppose, by
4 drawing attention more explicitly to the
3 attribute of time. (p. 5 4)
2
This comment applies equally well to
1 other instances of extension neglect: The ne-
0 glect of base rate in categorical prediction,
0 10 20
the neglect of scope in willingness to pay, the
Time (minutes)
neglect of sample size in evaluations of ev-
idence (Griffin & Tversky, 1 992; Tversky &
Patient B Kahneman, 1 971 ), and the neglect of prob-
ability of success in evaluating a program of
8
species preservation (DeKay & McClelland,
7
1 995 ). More generally, inattention plays a
Pain Intensity
6
similar role in any situation in which the in-
5 tuitive judgments generated by system 1 vio-
4 late rules that would be accepted as valid by
3 the more deliberate reasoning that we asso-
2 ciate with system 2. As we noted earlier, the
1 responsibility for these judgmental mishaps
0 is properly shared by the two systems: Sys-
0 10 20
tem 1 produces the initial error, and system
Time (minutes)
2 fails to correct it, although it could.
Figure 1 2 .3. Pain intensity reported by two
colonoscopy patients.
Violations of Dominance
The conjunction fallacy observed in the
of the systematic variance of global evalua-
Linda problem is an example of a domi-
tions in that study and for 88% of the vari-
nance violation in judgment: Linda must be
ance in a similar factorial study of responses
at least as likely to be a bank teller as to
to loud unpleasant sounds [Schreiber & Kah-
be a feminist bank teller, but people be-
neman, 2000, Figure 1 2.2(b)]. Contrary to
lieve the opposite. Insensitivity to extension
the normative standard for an extensional at-
(in this case, base rate) effectively guaran-
tribute, the effects of duration and of other
tees the existence of such dominance viola-
determinants of evaluation were additive
tions. For another illustration, consider the
[Figures 1 2.2(b) and 1 2.2(d)].
question: “How many murders were there
The participants in these studies were
last year in [Detroit/Michigan]?” Although
well aware of the relative duration of their
there cannot be more murders in Detroit
experiences and did not consciously de-
than in Michigan, because Michigan con-
cide to ignore duration in their evalua-
tains Detroit, the word “Detroit” evokes a
tions. As Fredrickson and Kahneman (1 993 )
more violent image than the word “Michi-
noted, duration neglect is an attentional
gan” (except of course for people who im-
phenomenon:
mediately think of Detroit when Michigan
. . . duration neglect does not imply is mentioned). If people use an impres-
that duration information is lost, nor sion of violence as a heuristic and neglect
that people believe that duration is geographic extension, their estimates of
murders in the city may exceed their esti- ishing pain. In a replication, Schreiber and
mates for the state. In a large sample of Uni- Kahneman (2000, experiment 2) exposed
versity of Arizona students, this hypothesis participants to pairs of unpleasant noises in
was confirmed – the median estimate of the immediate succession. The participants lis-
number of murders was 200 for Detroit and tened to both sounds and chose one to be re-
1 00 for Michigan. peated at the end of the session. The “short”
Violations of dominance akin to the con- noise lasted 8 seconds at 77 db. The “long”
junction fallacy have been observed in sev- noise consisted of the short noise plus an
eral other experiments involving both indi- extra period (of up to 24 seconds) at 66 db
rect (between-subjects) and direct tests. In a (less aversive, but still unpleasant and cer-
clinical experiment reported by Redelmeier, tainly worse than silence). Here again, the
Katz, and Kahneman (2001 ), half of a large longer noise was preferred most of the time,
group of patients (N = 682 ) undergoing a and this unlikely preference persisted over a
colonoscopy were randomly assigned to a series of five choices.
condition that made the actual experience The violations of dominance in these di-
strictly worse. Unbeknownst to the patient, rect tests are particularly surprising because
the physician deliberately delayed the re- the situation is completely transparent. The
moval of the colonoscope for approximately participants in the experiments could eas-
1 minute beyond the normal time. The in- ily retrieve the durations of the two experi-
strument was not moved during the extra pe- ences between which they had to choose,
riod. For many patients, the mild discomfort but the results suggest that they simply
of the added period was an improvement ignored duration. A simple explanation is
relative to the pain than they had just ex- that the results reflect “choosing by liking”
perienced. For these patients, of course, pro- (see Frederick, 2002). The participants in
longing the procedure reduced the peak/end the experiments simply followed the nor-
average of discomfort. As expected, retro- mal strategy of choice: “When choosing be-
spective evaluations were less negative in tween two familiar options, consult your ret-
the experimental group, and a 5 -year follow- rospective evaluations and choose the one
up showed that participants in that group that you like most (or dislike least).” Lik-
were also somewhat more likely to comply ing and disliking are products of system 1 ,
with recommendations to undergo a repeat which do not conform to the rules of ex-
colonoscopy (Redelmeier, Katz, & Kahne- tensional logic. System 2 could have inter-
man, 2001 ). vened, but in these experiments it generally
In an experiment that is directly analo- did not. Kahneman et al. (1 993 ) described a
gous to the demonstrations of the conjunc- participant in their study, who chose to re-
tion fallacy, Kahneman et al. (1 993 ) exposed peat the long cold-pressor experience. Soon
participants to two cold-pressor experiences, after the choice was recorded, the partic-
one with each hand: a “short” episode (im- ipant was asked which of the two expe-
mersion of one hand in 1 4 ◦ C water for riences was longer. As he correctly identi-
60 seconds), and a “long” episode (the short fied the long trial, the participant was heard
episode, plus an additional 3 0 seconds during to mutter “the choice I made doesn’t seem
which the water was gradually warmed to to make much sense.” Choosing by liking
1 5 ◦ C). The participants indicated the inten- is a form of mindlessness (Langer, 1 978),
sity of their pain throughout the experience. which illustrates the casual governance of
When they were later asked which of the system 2.
two experiences they preferred to repeat, Like the conjunction fallacy in direct
a substantial majority chose the long trial. tests, which we discussed earlier, violations
These choices violate dominance, because of temporal monotonicity in choices should
after 60 seconds in cold water anyone will be viewed as an expendable flourish. Be-
prefer the immediate experience of a warm cause the two aversive experiences occurred
towel to 3 0 extra seconds of slowly dimin- within a few minutes of each other and
respondents could accurately recall the dura- others (see also Kahneman, Ritov, &
tion of the two events, system 2 had enough Schkade, 1 999). We attribute extension ne-
information to override choosing by liking. glect and violations of dominance to a lazy
Its failure to do so is analogous to the fail- system 2, and to a prototype heuristic that
ures observed in direct tests of the Linda combines two processes of system 1 : the rep-
problem. In both cases, the violations of resentation of categories by prototypes and
dominance tell us nothing new about sys- the substitution of a nonextensional heuris-
tem 1 ; they only illustrate an unexpected tic attribute for an extensional target at-
weakness of system 2. Just as the theory of tribute. We also propose that people have
intuitive categorical prediction would have some appreciation of the role of extension
remained intact if the conjunction fallacy in the various judgment tasks. Consequently,
had not “worked” in a direct test, the model they will incorporate extension in their judg-
of evaluation by moments would have sur- ments when their attention is drawn to this
vived even if violations of dominance had factor – most reliably in factorial experi-
been eliminated in highly transparent situa- ments, and sometimes (although not always)
tions. The same methodological issues arise in direct tests. The challenge for compet-
in both contexts. Between-subjects experi- ing interpretations is to provide a unified ac-
ments or subtle tests are most appropriate count of the diverse phenomena that have
for studying the basic intuitive evaluations of been considered in this section.
system 1 , and also most likely to reveal com-
plete extension neglect. Factorial designs in
which extension is manipulated practically
guarantee an effect of this variable, and al- Conclusions and Future Directions
most guarantee that it will be additive, as
in Figures 1 2.2(b) and 1 2.2(d) (Ariely, 1 998; The original goal of the heuristics and biases
Ariely, Kahneman, & Loewenstein, 2000; program was to understand intuitive judg-
Schreiber & Kahneman, 2000). Finally, al- ment under uncertainty. Heuristics were de-
though direct choices sometimes yield sys- scribed as a collection of disparate cognitive
tematic violations of dominance, these vio- procedures, related only by their common
lations can be avoided by manipulations that function in a particular judgmental domain –
prompt system 2 to take control. choice under uncertainty. It now appears,
In our view, the similarity of the re- however, that judgment heuristics are ap-
sults obtained in diverse contexts is a com- plied in a wide variety of domains and share
pelling argument for a unified interpreta- a common process of attribute substitution,
tion, and a significant challenge to critiques in which difficult judgments are made by
that pertain only to selected subsets of this substituting conceptually or semantically re-
body of evidence. A number of commenta- lated assessments that are simpler and more
tors have offered competing interpretations readily accessible.
of base rate neglect (Cosmides & Tooby, The current treatment explicitly ad-
1 996; Koehler, 1 996), insensitivity to scope dresses the conditions under which intu-
in WTP (Kopp, 1 992), and duration ne- itive judgments are modified or overridden.
glect (Ariely & Loewenstein, 2000). How- Although attribute substitution provides an
ever, these interpretations are generally spe- initial input into many judgments, it need
cific to a particular task and would not carry not be the sole basis for them. Initial impres-
over to analogous findings in other domains. sions are often supplemented, moderated, or
Similarly, the various attempts to explain the overridden by other considerations, includ-
conjunction fallacy as an artifact do not ex- ing the recognition of relevant logical rules
plain analogous violations of dominance in and the deliberate execution of learned al-
the cold-pressor experiment. The account gorithms. The role of these supplemental or
we have offered is, in contrast, equally ap- alternative inputs depends on characteristics
plicable to all three contexts and possibly of the judge and the judgment task.
Our use of the dual-process terminology ments that are made in a split second. Thus,
does not entail a belief that every mental one need not be committed, a priori, to as-
operation (including each postulated heuris- signing a process to a particular system; the
tic) can be definitively assigned to one sys- data will dictate the best characterization.
tem or the other. The placement of di- The two-system model is a framework
viding lines between “systems” is arbitrary that combines a set of empirical generaliza-
because the bases by which we characterize tions about cognitive operations with a set
mental operations (difficulty of acquisition, of tests for diagnosing the types of cognitive
accessibility to introspection, and disrupt- operations that underlie judgments in spe-
ability) are all continua. However, this does cific situations. The generalizations and the
not make distinctions less meaningful; there specific predictions are testable and can be
is broad agreement that mental operations recognized as true or false. The framework
range from rapid, automatic, perception-like itself will be judged by its usefulness as a
impressions to deliberate computations that heuristic for research.
apply explicit rules or external aids.
Many have questioned the usefulness
of the notion of heuristics and biases by Acknowledgments
pointing to inconsistencies in the degree to
which illusions are manifested across differ-
This chapter is a modified version of a
ent studies. However, there is no mystery
chapter by Kahneman and Frederick (2002).
here to explain. Experimental studies of “the
Preparation of this chapter was supported by
same” cognitive illusions can yield different
grant SES-021 3 481 from the National Sci-
results for two reasons: (1 ) because of vari-
ence Foundation.
ation in factors that determine the accessi-
bility of the intuitive illusion, and (2) be-
cause they vary in factors that determine the
accessibility of the corrective thoughts that Note
are associated with system 2. Both types of
variation can often be anticipated because 1 . The entries plotted in Figure 1 2.1 are averages
of the vast amount of psychological knowl- of multiple judgments, and the correlations are
edge that has accumulated about the differ- computed over a set of judgment objects. It
ent sets of factors that determine the ease should be noted that correlations between av-
erages are generally much higher than corre-
with which thoughts come to mind – from
sponding correlations within the data of indi-
principles of grouping in perception to prin- vidual respondents (Nickerson, 1 995 ). Indeed,
ciples that govern transfer of training in rule group results may even be unrepresentative if
learning (Kahneman, 2003 ). Experimental they are dominated by a few individuals who
surprises will occur, of course, and should produce more variance than others and have
lead to refinements in the understanding of an atypical pattern of responses. Fortunately,
the rules of accessibility. this particular hypothesis is not applicable to
The argument that system 1 will be ex- the experiments of Figure 1 2.1 , in which all re-
pressed unless it is overridden by system 2 sponses were ranks.
sounds circular, but it is not, because empir-
ical criteria can be used to test whether a par-
ticular characterization of the two systems is References
accurate. For example, a feature of the situ-
ation will be associated with system 2 if it is Agnoli, F. (1 991 ). Development of judgmental
shown to influence judgments only when at- heuristics and logical reasoning: Training coun-
tention is explicitly directed to it (through, teracts the representativeness heuristic. Cogni-
say, a within-subjects design). In contrast, a tive Development, 6, 1 95 –21 7.
variable will be associated with system 1 if it Agnoli, F., & Krantz, D. H. (1 989). Suppressing
can be shown to influence even those judg- natural heuristics by formal instruction: The
case of the conjunction fallacy. Cognitive Psy- Birnbaum, M. H. (1 999). How to show that 9 >
chology, 2 1 , 5 1 5 –5 5 0. 221 : Collect judgments in a between-subjects
Anderson, N. H. (1 991 ). Contributions to informa- design. Psychological Methods, 4(3 ), 243 –249.
tion integration theory. Vol. I: cognition. Hillsdale, Birnbaum, M. H., & Mellers, B. A. (1 983 ).
NJ: Erlbaum. Bayesian inference: Combining base rates with
Anderson, N. H. (1 996). A functional theory of opinions of sources who vary in credibility.
cognition. Mahwah, NJ: Erlbaum. Journal of Personality and Social Psychology, 45 ,
Ariely, D. (1 998). Combining experiences over 792–804.
time: The effects of duration, intensity changes Bless, H., Clore, G. L., Schwarz, N., Golisano,
and on-line measurements on retrospective V., Rabe, C., & Wolk, M. (1 996). Mood and
pain evaluations. Journal of Behavioral Decision the use of scripts: Does a happy mood really
Making, 1 1 , 1 9–45 . lead to mindlessness? Journal of Personality and
Ariely, D. (2001 ). Seeing sets: Representation Social Psychology, 71 (4), 665 –679.
by statistical properties. Psychological Science, Bodenhausen, G. V. (1 990). Stereotypes as judg-
1 2 (2), 1 5 7–1 62. mental heuristics: Evidence of circadian vari-
Ariely, D., Kahneman, D., & Loewenstein, G. ations in discrimination. Psychological Science,
(2000). Joint comment on “When does dura- 1 (5 ), 3 1 9–3 22.
tion matter in judgment and decision making?” Brunswik, E. (1 943 ). Organismic achievement
(Ariely & Loewenstein, 2000). Journal of and environmental probability. Psychological
Experimental Psychology: General, 1 2 9, 5 24– Review, 5 0, 25 5 –272.
5 29. Cacioppo, J. T., Priester, J. R., & Berntson,
Ariely, D., & Loewenstein, G. (2000). When does G. G. (1 993 ). Rudimentary determinants of
duration matter in judgment and decision mak- attitudes: II. Arm flexion and extension have
ing. Journal of Experimental Psychology: Gen- differential effects on attitudes. Journal of Per-
eral, 1 2 9, 5 08–5 23 . sonality and Social Psychology, 65 , 5 –1 7.
Ayton, P. (1 998). How bad is human judgment? Chaiken, S., & Trope, Y. (1 999). Dual-process the-
In G. Wright & P. Goodwin (Eds.), Forecasting ories in social psychology. New York: Guilford
with judgment. West Sussex, UK: Wiley. Press.
Bargh, J. A. (1 997). The automaticity of everyday Chapman, G., & Johnson, E. (2002). Incorpo-
life. In R. S. Wyer, Jr. (Ed.), Advances in social rating the irrelevant: Anchors in judgments of
cognition (Vol. 1 0). Mahwah, NJ: Erlbaum. belief and value. In T. Gilovich, D. Griffin, &
Bargh, J. A., Bond, R. N., Lombardi, W. J., & Tota, D. Kahneman (eds.), Heuristics and biases: The
M. E. (1 986). The additive nature of chronic psychology of intuitive judgment (pp. 1 20–1 3 8).
and temporary sources of construct accessibil- New York: Cambridge University Press.
ity. Journal of Personality and Social Psychology, Chase, W. G., & Simon, H. A. (1 973 ). Perception
5 0(5 ), 869–878. in chess. Cognitive Psychology, 4, 5 5 –81 .
Bar-Hillel, M., & Fischhoff, B. (1 981 ). When do Cohen, J. (1 969). Statistical power analysis for the
base-rates affect predictions? Journal of Per- behavioral sciences. San Diego: Academic Press.
sonality and Social Psychology, 41 (4), 671 – Cohen, J. (1 992). A power primer. Psychological
680. Bulletin, 1 1 2 (1 ), 1 5 5 –1 5 9.
Bar-Hillel, M., & Neter, E. (1 993 ). How alike is Cosmides L., & Tooby J. (1 996). Are humans
it versus how likely is it: A disjunction fallacy good intuitive statisticians after all? Rethinking
in probability judgments. Journal of Personality some conclusions from the literature on judg-
and Social Psychology, 41 (4), 671 –680. ment under uncertainty. Cognition, 5 8(1 ), 1 –
Begg, I., Anas, A., & Farinacci, S. (1 992). Disso- 73 .
ciation of processes in belief: Source recollec- DeKay, M. L., & McClelland, G. H. (1 995 ). Prob-
tion, statement familiarity, and the illusion of ability and utility components of endangered
truth. Journal of Experimental Psychology: Gen- species preservation programs. Journal of Ex-
eral, 1 2 1 (4), 446–45 8. perimental Psychology: Applied, 2 , 60–83 .
Begg, I., & Armour, V. (1 991 ). Repetition and Desvouges, W. H., Johnson, F., Dunford, R.,
the ring of truth: Biasing comments. Cana- Hudson, S., Wilson, K., & Boyle, K. (1 993 ).
dian Journal of Behavioural Science, 2 3 , 1 95 – Measuring resource damages with contingent
21 3 . valuation: Tests of validity and reliability.
In Contingent valuation: A critical assessment. European review of social psychology (Vol. 2,

Amsterdam: North Holland. pp. 83 –1 1 5 ). Chichester, UK: Wiley.
Diamond, P. (1 996). Testing the internal consis- Gigerenzer, G., Czerlinski, J., & Martignon,
tency of contingent valuation surveys. Journal L. (2002). How good are fast and frugal
of Environmental Economics and Management, heuristics? In T. Gilovich, D. Griffin, & D.
2 8, 1 5 5 –1 73 . Kahneman (Eds.), Heuristics and biases: The
Dulany, D. E. & Hilton, D. J. (1 991 ). Conver- psychology of intuitive judgment (pp. 5 5 9–5 81 ).
sational implicature, conscious representation, New York: Cambridge University Press.
and the conjunction fallacy. Social Cognition, 9, Gigerenzer, G., & Goldstein, D. G. (1 996).
85 –1 1 0. Reasoning the fast and frugal way: Models
Epstein, S. (1 994). Integration of the cognitive of bounded rationality. Psychological Review,
and the psychodynamic unconscious. American 1 03 (4), 65 0–669.
Psychologist, 49(8), 709–724. Gigerenzer, G., Hell, W., & Blank, H. (1 988). Pre-
Evans, J., Handley, S. J., Perham, N., Over, D. E., sentation and content – The use of base rates
& Thompson, V. A. (2000). Frequency versus as a continuous variable. Journal of Experimen-
probability formats in statistical word prob- tal Psychology – Human Perception and Perfor-
lems. Cognition, 77, 1 97–21 3 . mance, 1 4(3 ), 5 1 3 –5 25 .
Evans, J. St. B. T., & Over, D. E. (1 996). Rationality Gigerenzer, G., & Hoffrage, U. (1 995 ). How
and reasoning. Hove, UK: Psychology Press. to improve Bayesian reasoning without
instruction: Frequency formats. Psychological
Finucane, M. L., Alhakami, A., Slovic, P., &
Review, 1 02 , 684–704.
Johnson, S. M. (2000). The affect heuristic in
judgments of risks and benefits. Journal of Be- Gigerenzer, G., Todd, P. M., & the ABC Group.
havioral Decision Making, 1 3 , 1 –1 7. (1 999). Simple heuristics that make us smart.
New York: Oxford University Press.
Frederick, S. W. (2002). Automated choice
heuristics. In T. Gilovich, D. Griffin, & D. Gilbert, D. (1 989). Thinking lightly about others:
Kahneman (eds.), Heuristics and biases: The Automatic components of the social inference
psychology of intuitive judgment (pp. 5 48–5 5 8). process. In J. Uleman & J. A. Bargh (Eds.),
New York: Cambridge University Press. Unintended thought (pp. 1 89–21 1 ). New York:
Guilford Press.
Frederick, S. W. (2004). (Im)patient, (un)-
adventurous, (in)sensitive, and (im)moral: Cog- Gilbert, D. (1 991 ). How mental systems believe.
nitive (in)ability as a predictor of time preference, American Psychologist, 46(2), 1 07–1 1 9.
risk preference, magnitude sensitivity, and cheat- Gilbert, D. (1 999). What the mind’s not. In S.
ing. Working paper, Massachusetts Institute of Chaiken & Y. Trope (Eds.), Dual-process theo-
Technology, Cambridge, MA. ries in social psychology (pp. 3 –1 1 ). New York:
Frederick, S. W., & Fischhoff, B. (1 998). Scope Guilford Press.
(in)sensitivity in elicited valuations. Risk, Deci- Gould, S. J. (1 991 ). Bully for brontosaurus. Reflec-
sion, and Policy, 3 , 1 09–1 23 . tions in natural history. New York: Norton.
Frederick, S. W., & Nelson, L. (2004). Attribute Griffin, D., & Tversky, A. (1 992). The weighing of
substitution in physical judgments, Working pa- evidence and the determinants of confidence.
per, Massachusetts Institute of Technology, Cognitive Psychology, 2 4(3 ), 41 1 –43 5 .
Cambridge, MA. Hammond, K. R. (1 95 5 ). Probabilistic function-
Fredrickson, B. L., & Kahneman, D. (1 993 ). Du- ing and the clinical method. Psychological Re-
ration neglect in retrospective evaluations of view, 62 , 25 5 –262.
affective episodes. Journal of Personality and So- Hammond, K. R. (1 996). Human judgment and
cial Psychology, 65 (1 ), 45 –5 5 . social policy. New York: Oxford University
Ganzach, Y., & Krantz, D. H. (1 990). The psy- Press.
chology of moderate prediction: I. Experience Heider, F. (1 944). Social perception and phenom-
with multiple determination. Organizational enal causality. Psychological Review, 5 1 , 3 5 8–
Behavior and Human Decision Processes, 47, 3 74.
1 77–204. Hertwig, R., & Gigerenzer, G. (1 999). The ‘con-
Gigerenzer, G. (1 991 ). How to make cognitive junction fallacy’ revisited: How intelligent in-
illusions disappear: Beyond “heuristics and bi- ferences look like reasoning errors. Journal of
ases.” In W. Stroebe & M. Hewstone (Eds.), Behavioral Decision Making, 1 2 (4), 275 –3 05 .
Higgins, E. T., & Brendl, C. M. (1 995 ). Accessi- study in the headline method. Journal of Risk
bility and applicability: Some “activation rules” and Uncertainty, 9, 5 –3 8.
influencing judgment. Journal of Experimental Kahneman, D., Ritov, I., & Schkade, D. (1 999).
Social Psychology, 3 1 , 21 8–243 . Economic preferences or attitude expressions?
Hilton, D. J., & Slugoski, B. R. (2001 ). Conversa- An analysis of dollar responses to public issues.
tional processes in reasoning and explanation. Journal of Risk and Uncertainty, 1 9, 203 –23 5 .
In A. Tesser & N. Schwarz (Eds.), Blackwell Kahneman, D., Schkade, D. A., & Sunstein, C. R.
handbook of social psychology. Vol. 1 : Intrain- (1 998). Shared outrage and erratic awards: The
dividual processes (pp. 1 81 –206). Oxford, UK: psychology of punitive damages. Journal of Risk
Blackwell. and Uncertainty, 1 6, 49–86.
Isen, A. M., Nygren, T. E., & Ashby, F. G. (1 988). Kahneman, D., Slovic, P., & Tversky, A. E. (1 982).
Influence of positive affect on the subjective Judgment under uncertainty: Heuristics and bi-
utility of gains and losses – It is just not worth ases. New York: Cambridge University Press.
the risk. Journal of Personality and Social Psy- Kahneman, D., & Tversky, A. (1 972). Subjective
chology, 5 5 (5 ), 71 0–71 7. probability: A judgment of representativeness.
Jacoby, L. L., & Dallas, M. (1 981 ). On the re- Cognitive Psychology, 3 , 43 0–45 4.
lationship between autobiographical memory Kahneman, D., & Tversky, A. (1 973 ). On the psy-
and perceptual learning. Journal of Experimen- chology of prediction. Psychological Review, 80,
tal Psychology: General, 3 , 3 06–3 40. 23 7–25 1 .
Kahneman, D. (1 986). Valuing environmen- Kahneman, D., & Tversky, A. (1 982). On the
tal goods: An assessment of the contin- study of statistical intuitions. Cognition, 1 1 ,
gent valuation method. In R. Cummings, D. 1 23 –1 41 .
Brookshire, & W. Schulze (Eds.), Valuing envi- Kahneman, D., & Tversky, A. (1 996). On the re-
ronmental goods: An assessment of the contingent ality of cognitive illusions: A reply to Gigeren-
valuation method. Totowa, NJ: Rowman and zer’s critique. Psychological Review, 1 03 , 5 82–
Allanheld. 5 91 .
Kahneman, D. (2000a). Experienced utility and Kahneman, D., & Varey, C. A. (1 990). Propensi-
objective happiness: A moment-based ap- ties and counterfactuals: The loser that almost
proach. In D. Kahneman & A. Tversky (Eds.), won. Journal of Personality and Social Psychol-
Choices, values, and frames. New York: Cam- ogy, 5 9(6), 1 1 01 –1 1 1 0.
bridge University Press and the Russell Sage
Kahneman, D., Wakker, P. P., & Sarin, R. (1 997).
Foundation.
Back to Bentham? Explorations of experienced
Kahneman, D. (2000b). A psychological point of utility. Quarterly Journal of Economics, 1 1 2 ,
view: Violations of rational rules as a diagnostic 3 75 –405 .
of mental processes (commentary on Stanovich Keysar, B. (1 989). On the functional equivalence
and West). Behavioral and Brain Sciences, 2 3 , of literal and metaphorical interpretations in
681 –683 . discourse. Journal of Memory and Language, 2 8,
Kahneman, D. (2003 ). A perspective on judg- 3 75 –3 85 .
ment and choice: Mapping bounded rational- Koehler, J. (1 996). The base-rate fallacy reconsi-
ity. American Psychologist, 5 8, 697–720. dered: Descriptive, normative, and method-
Kahneman, D., & Frederick, (2002). In T. ological challenges. Behavioral and Brain Sci-
Gilovich, D. Griffin, & D. Kahneman (Eds.), ences, 1 9, 1 –5 3 .
Heuristics of intuitive judgment. New York: Kopp, R. (1 992). Why existence value should be
Cambridge University Press. used in cost–benefit analysis. Journal of Policy
Kahneman, D., Fredrickson, D. L., Schreiber, Analysis and Management, 1 1 , 1 23 –1 3 0.
C. A., & Redelmeier, D. A. (1 993 ). When more Krosnick, J. A., Li, F., & Lehman, D. R. (1 990).
pain is preferred to less: Adding a better end. Conversational conventions, order of informa-
Psychological Science, 4, 401 –405 . tion acquisition, and the effect of base rates and
Kahneman, D., & Miller, D. T. (1 986). Norm individuating information on social judgment.
theory: Comparing reality with its alternatives. Journal of Personality and Social Psychology, 5 9,
Psychological Review, 93 , 1 3 6–1 5 3 . 1 1 40–1 1 5 2.
Kahneman, D., & Ritov, I. (1 994). Determinants Langer, E. J. (1 978). Rethinking the role of
of stated willingness to pay for public goods: A thought in social interaction. In Harvey, Ickes,
& Kidd (Eds.), New directions in attribution re- Redelmeier, D., Katz, J., & Kahneman, D. (2001 ).
search (Vol. 2). Potomac, MD: Erlbaum. Memories of colonoscopy: A randomized trial.
Macchi, L. (1 995 ). Pragmatic aspects of the base- Redelmeier, D., & Kahneman, D. (1 996). Pa-
rate fallacy. Quarterly Journal of Experimental tients’ memories of painful medical treatments:
Psychology, 48A, 1 88–207. Real-time and retrospective evaluations of
Mandler, G., Hamson, C., & Dorfman, J. (1 990). two minimally invasive procedures. Pain, 66,
Tests of dual process theory – Word priming 3 –8.
and recognition. Quarterly Journal of Experi- Schreiber, C., & Kahneman, D. (2000). Deter-
mental Psychology, 42 (4), 71 3 –73 9. minants of the remembered utility of aver-
Margolis, H. (1 987). Patterns, thinking & cognition. sive sounds. Journal of Experimental Psychology:
Chicago: University of Chicago Press. General, 1 2 9, 27–42.
McFadden, D., & Leonard, G. K. (1 993 ). Issues Schwarz, N. (1 996). Cognition and communi-
in the contingent valuation of environmental cation: Judgmental biases, research methods, and
goods: Methodologies for data collection and the logic of conversation. Mahwah, NJ: Erlbaum.
analysis. In Hausman (Ed.), Contingent valua- Schwarz, N., Bless, H., Strack, F., Klumpp, G.,
tion. A critical assessment. Amsterdam: North- Rittenauer-Schatka, H., & Simons, A. (1 991 ).
Holland. Ease of retrieval as information: Another look
Mellers, B., Hertwig, R., & Kahneman, D. (2001 ). at the availability heuristic. Journal of Personal-
Do frequency representations eliminate con- ity and Social Psychology, 61 , 1 95 –202.
junction effects? An exercise in adversarial col- Schwarz, N., & Clore, G. L. (1 983 ). Mood, mis-
laboration. Psychological Science, 1 2 , 269–275 . attribution, and judgments of well-being: In-
Michotte, A. (1 963 ). The perception of causality. formative and directive functions of affective
New York: Basic Books. states. Journal of Personality and Social Psychol-
Mitchell, R., & Carson, R. (1 989). Using surveys ogy, 45 (3 ), 5 1 3 –5 23 .
to value public goods: The contingent valuation Schwarz, N., Strack, F., Hilton, D., & Naderer, G.
method. Washington, DC: Resources for the (1 991 ). Base rates, representativeness, and the
Future. logic of conversation: The contextual relevance
Nickerson, C. (1 995 ). Does willingness-to-pay of “irrelevant” information. Social Cognition, 9,
reflect the purchase of moral satisfaction? A re- 67–84.
consideration of Kahneman and Knetsch. Jour- Schwarz, N., & Vaughn, L. A. (2002). The avail-
nal of Environmental Economics and Manage- ability heuristic revisited: Ease of recall and
ment, 2 8, 1 26–1 3 3 . content of recall as distinct sources of infor-
Nisbett, R. E., Krantz, D. H., Jepson, C., & Kunda, mation. In T. Gilovich, D. Griffin, & D. Kahne-
Z. (1 983 ). The use of statistical heuristics in man (Eds.), Heuristics & biases: The psychology
everyday inductive reasoning. Psychological Re- of intuitive judgment (pp. 1 03 –1 1 9). New York:
view, 90(4), 3 3 9–3 63 . Cambridge University Press.
Novemsky, N., & Kronzon, S. (1 999). How are Shweder, R. (1 977). Likeness and likelihood in
base-rates used, when they are used: A compar- everyday thought – Magical thinking in judg-
ison of Bayesian and additive models of base- ments about personality. Current Anthropology,
rate use. Journal of Behavioral Decision Making, 1 8(4), 63 7–65 8.
1 2 , 5 5 –69. Simon, H. A., & Chase, W. G. (1 973 ). Skill in
Ordóñez, L., & Benson, L., III. (1 997). Decisions chess. American Scientist, 61, 3 94–403 .
under time pressure: How time constraint af- Sloman, S. A. (1 996). The empirical case for
fects risky decision making. Organizational Be- two systems of reasoning. Psychological Bulletin,
havior and Human Decision Processes, 71 (2), 1 1 9, 3 –22.
1 21 –1 40. Sloman, S. A. (2002). Two systems of reason-
Parducci, A. (1 965 ). Category judgment: A range- ing. In T. Gilovich, D. Griffin, & D. Kahne-
frequency model. Psychological Review, 72 , man (Eds.), Heuristics & biases: The psychology
407–41 8. of intuitive judgment (pp. 3 79–3 96). New York:
Pashler, H. E. (1 998). The psychology of attention. Cambridge University Press.
Cambridge, MA: MIT Press. Slovic, P., Finucane, M., Peters, E., & MacGre-
Poulton, (1 989). Bias in quantifying judgments. gor, D. G. (2002). The affect heuristic. In T.
London: Erlbaum. Gilovich, D. Griffin, & D. Kahneman (Eds.),
Heuristics & biases: The psychology of intu- probability. Cognitive Psychology, 5 (2), 207–
itive judgment (pp. 3 97–420). New York: Cam- 23 2.
bridge University Press. Tversky, A., & Kahneman, D. (1 974). Judgment
Stanovich, K. E. (1 999). Who is rational?: Studies under uncertainty: Heuristics and biases. Sci-
of individual differences in reasoning. Mahwah, ence, 1 85 (41 5 7), 1 1 24–1 1 3 1 .
NJ: Erlbaum. Tversky, A., & Kahneman, D. (1 982). Judgments
Stanovich, K. E., & West, R. (2002). Individual of and by representativeness. In D. Kahneman,
differences in reasoning: Implications for the P. Slovic, & A. Tversky (Eds.), Judgment under
rationality debate? In T. Gilovich, D. Griffin, uncertainty: Heuristics and biases (pp. 84–98).
& D. Kahneman (Eds.), Heuristics & biases: New York: Cambridge University Press.
The psychology of intuitive judgment (pp. 421 – Tversky, A., & Kahneman, D. (1 983 ). Extensional
440). New York: Cambridge University vs. intuitive reasoning: The conjunction fallacy
Press. in probability judgment. Psychological Review,
Stevens, S. S. (1 95 7). On the psychophysical law. 90, 293 –3 1 5 .
Psychological Review, 64, 1 5 3 –1 81 . Tversky, A., & Kahneman, D. (1 986). Rational
Stevens, S. S. (1 975 ). Psychophysics: Introduction choice and the framing of decisions. Journal of
to its perceptual, neural, and social prospects. Business, 5 9, S25 1 –S278.
New York: Wiley. Wason, P. C. (1 960). On the failure to elimi-
Strack, F., Martin, L. L., & Schwarz, N. (1 988). nate hypotheses in a conceptual task. Quarterly
Priming and communication: The social deter- Journal of Experimental Psychology, 1 2 , 1 29–
minants of information use in judgments of life- 1 40.
satisfaction. European Journal of Social Psychol- Zajonc, R. B. (1 980). Feeling and thinking: Pref-
ogy, 1 8, 429–442. erences need no inferences. American Psychol-
Tversky, A., & Kahneman, D. (1 971 ). Belief in the ogist, 3 5 (2), 1 5 1 –1 75 .
law of small numbers. Psychological Bulletin, 76, Zajonc, R. B. (1 997). Emotions. In D. T. Gilbert,
1 05 –1 1 0. S. T. Fiske, & G. Lindzey (Eds.), Handbook of
Tversky, A., & Kahneman, D. (1 973 ). Availa- social psychology (4th ed., pp. 5 91 –63 2). New
bility: A heuristic for judging frequency and York: Oxford University Press.
CHAPTER 1 3
Motivated Thinking
Daniel C. Molden
E. Tory Higgins
At one time or another, every one of us has ment. More recently however, there has
engaged in “wishful thinking,” or “let our been a sharp increase in attempts to achieve
hearts influence our heads.” That is, every a marriage between these two viewpoints in
one of us has felt the effects of our motiva- a wide variety of research areas. The primary
tions on our thought processes. Given this objective of this chapter is to review these
common everyday experience, it is not sur- attempts and to demonstrate how it is not
prising that an essential part of early psy- only possible, but also desirable, to reintro-
chological research was the idea that drives, duce motivational approaches to the study
needs, desires, motives, and goals can proof basic thought processes. We begin by pro-
foundly influence judgment and reasoning. viding some historical background on such
More surprising is that motivational vari- approaches.
ables play only a small role in current the-
ories of reasoning. Why might this be?
One possible explanation is that since the
cognitive revolution in the 1 960s and 1 970s, A Brief History of Motivated Thinking
researchers studying motivational and cogni-
tive processes have been speaking somewhat Motivational perspectives on thought and
different languages. That is, there has been reasoning originated most prominently with
a general failure to connect traditional moti- Freud’s (1 905 ) clinical theorizing on the
vational concepts, such as drives or motives, psychodynamic conflicts created by uncon-
to information processing concepts, such as scious drives and urges. These perspectives
expectancies or spreading activation, which quickly spread to other areas of psychology.
form the foundation for nearly all contem- Early pioneers of experimental social psy-
porary research on thinking and reasoning. chology gave primary emphasis to motiva-
For a time, this led not only to misunder- tional variables such as drives, goals, and as-
standing but also to conflict between moti- pirations (e.g., Allport, 1 920; Lewin, 1 93 5 ).
vational and cognitive perspectives on judg- The study of personality came to involve the
2 95
identification and classification of different This chapter provides an overview of this

types of needs and motives (e.g., Murray, “second generation” of research on moti-
1 93 8). Even research on sensory and percep- vated thinking and discusses some of the
tual processes was influenced by a motiva- larger principles that have emerged from the
tional approach with the emergence of the study of the motivation/cognition interface.
“New Look” school (e.g., McGinnies, 1 949). We consider two general classes of motiva-
After this early period of growth and tional influences; the first involves people’s
expansion, however, research and theory desires for reaching certain types of outcomes
on motivated thinking became quite con- in their judgments, and the second involves
troversial. With the ascendance of cogni- people’s desires to use certain types of strate-
tive perspectives on thinking and reason- gies while forming their judgments. In so do-
ing in the 1 960s and 1 970s, many supposed ing, we adopt a rather broad focus and dis-
instances of motivated reasoning were re- cuss several different varieties of motivated
cast as merely a product of imperfect infor- thinking. Given space constraints, this broad
mation processing by imperfect perceivers focus necessitates being selective in the phe-
(compare Bruner, 1 95 7, with McGinnies, nomena to be described. We have chosen
1 949; Festinger, 1 95 7, with Bem, 1 967; those programs of research that we believe
Bradley, 1 978, with Nisbett & Ross, 1 980). are representative of the larger literature and
The various “motivation versus cognition” are especially relevant not only to the study
debates that subsequently developed contin- of reasoning but also to other areas in cog-
ued off and on for years before they were nitive psychology.1 After reviewing the sep-
declared not only unwinnable but also coun- arate influences on thinking of outcome-
terproductive. An uneasy armistice was de- and strategy-based motivations, we conclude
clared (Tetlock & Levi, 1 982) that effec- by suggesting potential directions for future
tively quieted the public conflict but did research, giving special attention to circum-
nothing to reconcile the deep conceptual stances in which multiple sources of motiva-
differences that still remained between re- tion might operate simultaneously.
searchers favoring cognitive or motivational
perspectives.
Following this period of conflict, enthu-
siasm for questions concerning motivational Outcome-Motivated Thinking
influences on thinking was dampened in the
1 970s and early 1 980s. Beginning in the late The most prominent approach to motivated
1 980s, however, there was a resurgence of reasoning, in both classic and contemporary
interest in this area (for recent reviews and perspectives, has been to examine the influ-
overviews, see, Dunning, 1 999; Gollwitzer ence on people’s thought processes of their
& Bargh, 1 996; Higgins & Molden, 2003 ; needs, preferences, and goals to reach desired
Kruglanski, 1 996; Kunda, 1 990; Sorrentino outcomes (or avoid undesired outcomes).
& Higgins, 1 986). One reason for this new Although the types of preferred outcomes
life is that instead of revisiting debates about that have been studied are highly diverse,
the workings of motivational versus cog- they can be divided into two general classes:
nitive processes, researchers began to ex- directional outcomes and nondirectional out-
amine the important interactions between comes (see Kruglanski, 1 996; Kunda, 1 990).
these two processes. Thus, more recent Individuals who are motivated by directional
investigations have focused on the iden- outcomes are interested in reaching specific
tification of principles that describe the desired conclusions, such as impressions of
interface between motivation and cogni- themselves as intelligent, caring, and worthy
tion, and the implications of this interface people (e.g., Dunning, 1 999; Pyszczynski &
for thinking, reasoning, and judgment (see Greenberg, 1 987), or positive beliefs about
Kruglanski, 1 996; Kunda, 1 990; Higgins & others whom they find likeable or to whom
Molden, 2003 ). they are especially close (e.g., Murray, 1 999).
motivated thinking 2 97
In contrast, individuals who are motivated different cognitive processes, including attri-
by nondirectional outcomes have more gen- bution, evaluation of evidence, information
eral concerns, such as reaching the most search, recall and knowledge activation, and
accurate conclusion possible (e.g., Fiske & the organization of concepts in memory.
Neuberg, 1 990) or making a clear and con-
cise decision (e.g., Kruglanski & Webster,
1 996), whatever this conclusion or decision effects on attribution
may be. Some of the first evidence for the effects
Whether outcome motivation is direc- on reasoning of motivations for positive self-
tional or nondirectional, however, this moti- evaluation grew out of work on attribution
vation has been conceptualized as affecting (see Kelley, 1 973 ). Early attributional re-
thought and reasoning in the same way: by search found that when people were ex-
directing people’s cognitive processes (e.g., plaining their performance on tasks measur-
their recall, information search, or attribu- ing important abilities they tended to take
tions) in ways that help to ensure they reach responsibility for their success (i.e., cite in-
their desired conclusions. That is, individu- ternal and stable causes, such as “I’m good
als’ preferences for certain outcomes are beat this task”) and to deny responsibility for
lieved to often shape their thinking so as to their failure (i.e., cite external and unstable
all but guarantee that they find a way to be- causes, such as “I was unlucky”). Such find-
lieve, decide, and justify whatever they like. ings were typically described as stemming
In this section, we review several programs from desires for positive beliefs about the
of research that have more closely examined self (for a review, see Bradley, 1 978).
the specific mechanisms by which this can The motivational nature of these find-
occur – first in relation to motivations for ings was questioned, however. Several re-
directional outcomes and then in relation searchers (e.g., Nisbett & Ross, 1 980) ar-
to motivations for nondirectional outcomes. gued that although one’s attributions may
Following this, we discuss several limitations sometimes be biased, this does not neces-
of the effects of outcome motivation on rea- sarily imply that motivational forces are at
soning and identify circumstances in which work (e.g., previous expectancies for success
these motivations are most likely to have could lead people to label an unexpected
an impact. failure as unusual or unlucky). Yet, subse-
quent research has found that, although peo-
ple’s expectancies do play a role in these
Influences of Directional
attributional effects, there is substantial ev-
Outcome Motivation
idence that motivation plays an important
Overall, the kinds of phenomena that have role as well (see Kunda, 1 990; Pyszczynski
been studied most extensively in research on & Greenberg, 1 987).
motivated thinking involve directional out- One type of evidence for the role of
come preferences (i.e., individuals’ desires to motivation in self-serving attributions is
reach specific conclusions about themselves that, independent of expectancies from prior
and others; for reviews, see Dunning, 1 999; success or failure, the more personally im-
Kunda, 1 990; Murray, 1 999; Pyszczynski & portant a success is in any given situation,
Greenberg, 1 987). Although a variety of out- the stronger is the tendency to claim respon-
comes have been investigated, people’s well- sibility for this success but to deny responsi-
documented preference for viewing them- bility for failure (Miller, 1 976). Another type
selves, and those close to them, in a generally of evidence is that people’s attributions be-
positive manner (see Baumeister, 1 998) has, come increasingly self-serving when success
by far, received the most attention. This out- or failure feedback is experienced as highly
come is the primary focus here.2 In the next arousing. For instance, Gollwitzer, Earle, and
sections, we review several effects of desires Stephan (1 982) had participants first com-
for positive self-evaluation involving many plete an intelligence test, then vigorously
ride a stationary bicycle while the test was positive rate). Those who believed TAA
being scored (increasing their arousal), and had positive health consequences, however,
finally, receive feedback about succeeding judged the test to be highly valid regardless
or failing on the test. Feedback was given of its reliability (see also Doosje, Spears, &
1 minute, 5 minutes, or 9 minutes after riding Koomen, 1 995 ).
the bicycle. Both those receiving feedback An example of the second type of in-
after 9 minutes, who were no longer aroused, fluence can be found in a study by Kunda
and those receiving feedback after 1 minute, (1 987). Participants read a scientific article
who were aroused but still associated this reporting that caffeine consumption was re-
arousal with the exercise, showed only lated to serious health problems in women.
small attributional differences following suc- Afterward, women (but not men) who were
cess versus failure feedback. In contrast, heavy caffeine consumers reported that the
those receiving feedback after 5 minutes, article was less convincing than women who
who were still aroused but no longer asso- were light caffeine consumers. In a follow-
ciated this with the exercise, misattributed up study in which people read a similar
their arousal to the feedback concerning article that revealed caffeine caused only
the test and showed a strong tendency to mild health problems, there was no rela-
credit their ability for success and blame tion between their evaluation of the ar-
bad luck for failure (see also Stephan & ticle and their caffeine consumption. Be-
Gollwitzer, 1 981 ). cause, in both studies, people’s reasoning
was altered only when there was a signif-
icant threat to the self, this demonstrates
effects on evidence evaluation the motivational nature of these results (see
Similar to these attribution effects, more re- also Beauregard & Dunning, 1 998; Ditto
cent research has found that motivations for et al., 1 998).
positive self-evaluations also influence the Similar effects of people’s desire to view
way in which people evaluate information themselves positively have also been demon-
that either supports or contradicts these pos- strated in domains that do not directly in-
itive self-evaluations. In general, individuals volve health consequences. For instance,
tend to (1 ) give more credence to, and be people who encounter scientific research
more optimistic about, the validity of infor- that appears to support their cherished at-
mation that supports or confirms their stand- titudes describe this research as being bet-
ing as kind, competent, and healthy people; ter conducted, and its conclusions as being
and (2) be more skeptical and cautious about more valid, than those who encounter the
information that threatens this standing. same research but believe it to be in conflict
An example of the first type of influence with their cherished attitudes (e.g., Lord,
can be found in a study by Ditto, Scepansky, Ross, & Lepper, 1 979). In addition, people
Munro, Apanovitch, and Lockhart (1 998). have been shown to engage in considerable
Individuals were “tested” for the presence of counterfactual thinking (i.e., mentally un-
a fictitious enzyme in the body, TAA, and doing the present state of affairs by imag-
everyone was told that they had tested pos- ining “if only . . .”; see Roese, 1 997) when
itive. Half of the people were informed that evidence supporting predictions from a pre-
this had positive health consequences, and ferred theory or worldview fails to materi-
half were informed that this had negative alize. Such counterfactual thinking allows
health consequences. Those who believed them to generate ways in which they were
TAA had negative health consequences were almost correct. However, when evidence is
largely dismissive of the test when told it consistent with their theories, these same
was slightly unreliable (i.e., had a 1 0% false- individuals do not engage in counterfactual
positive rate) and judged the result to be thinking, which would force them to gener-
only somewhat more valid when told the test ate ways in which they were almost wrong
was highly reliable (i.e., had a .05 % false- (Tetlock, 1 998).
effects on information search sires for positive views of themselves (and

The motivational influences discussed thus certain well-liked others) have also been
far center on the quality of people’s in- found to influence their use of stored knowl-
formation processing during reasoning (e.g., edge in memory such as the selective ac-
biased attributions, more or less critical tivation of concepts and recall of events
evaluations). However, desires for positive that support these views. These phenom-
self-evaluations also affect the quantity of ena are exemplified in a series of studies by
people’s information processing (Kruglan- Santioso, Kunda, and Fong (1 990). Partici-
ski, 1 996). Specifically, such desires moti- pants in these studies read fictitious articles
vate decreased processing and quick accep- revealing that either introverts or extroverts
tance of favorable evidence and increased tend to have more academic and professional
processing and hesitant acceptance of unfa- success. Following this, individuals who be-
vorable evidence. As one example, Ditto and lieved that introversion was linked to success
colleagues (Ditto & Lopez, 1 992; Ditto et al., were more likely to recall, and were faster
1 998) demonstrated that, compared with to recall, autobiographical instances of intro-
evaluating favorable evidence, when people verted behaviors than extroverted behaviors.
evaluate unfavorable evidence they spend a The opposite pattern of results was found for
greater amount of time examining this evi- individuals who believed that extroversion
dence and spontaneously generate more al- was linked to success.
ternate hypotheses about why it might be More recent work has demonstrated that,
unreliable (see also Pyszczynski & Green- in addition to creating selective recall, direc-
berg, 1 987). Moreover, they have also shown tional outcome motivation can also lead to
that individuals who are prevented from the reconstruction of previous memories. For
putting this extra cognitive effort into the instance, McDonald and Hirt (1 997) showed
examination of unfavorable evidence (e.g., people a videotape of a fellow college stu-
participants who are placed under cognitive dent who was portrayed as either likeable
load ) return evaluations that are substan- or unlikable. They then provided some ad-
tially less critical. ditional information about the target, in-
Additional evidence of increased infor- cluding his midterm scores in several classes.
mation processing of information that is in- Later, when the target’s scores on his final
consistent with preferred conclusions comes exams were revealed, those who found the
from Chaiken and colleagues (Giner-Sorolla target likeable remembered some of the tar-
& Chaiken, 1 997; Liberman & Chaiken, get’s midterm scores as lower than they ac-
1 992). In one experiment, for example, peo- tually were in order to make the final scores
ple read scientific reports claiming that there more consistent with improvement. In con-
was either a strong link or a weak link trast, those who found the target unlikable
between caffeine consumption and signi- remembered some of the midterm scores as
ficant health risks similar to the Kunda higher than they actually were in order to
(1 987) studies discussed earlier. As before, make the final scores more consistent with
the group of women who were the most decline (see also Conway & Ross, 1 984).
threatened by this information were the least Finally, besides influencing explicit recall,
convinced by the reports. In addition, the motivations to reach specific preferred con-
study found that the most threatened group clusions also influence more implicit pro-
of participants also expended the most ef- cesses, such as knowledge activation and
fort to find flaws in the studies described and accessibility. In one demonstration of this
identified the most weaknesses. (Sinclair & Kunda, 1 999), individuals either
received positive or negative feedback from
a person who was a member of multiple
effects on recall and knowledge activation social categories. One of these social cate-
In addition to affecting the appraisal and gories (doctor) was associated with mostly
encoding of new information, people’s de- positive stereotypes and another (African
American) was associated with mostly neg- rated only those traits that were central to
ative stereotypes. Those who had received their own orientation (e.g., determined in the
positive feedback from the other person former case versus dependable in the latter)
were faster than baseline to identify doctor- as more prototypical of successful leaders
related words and slower than baseline to (see also Kunda, 1 987). In another study, in-
identify African American–related words on dividuals rated their own characteristics as
a lexical-decision task. Those who had re- more prototypical of positive qualities such
ceived negative feedback showed a reverse as intelligence but as less prototypical of neg-
pattern of activation (see also Spencer, Fein, ative qualities such as aloofness.
Wolfe, Hodgson, & Dunn, 1 998; for a re- Use of the third strategy can be seen
versal of these effects when people are in another series of experiments by Dun-
motivated by egalitarian rather than self- ning and his colleagues (Beauregard & Dun-
serving outcomes, see Moskowitz, Goll- ning, 1 998; Dunning & Cohen, 1 992; see also
witzer, Wasel, & Schaal, 1 999). Alicke et al., 1 997). Participants in these ex-
periments were asked to judge the abilities
effects on organization of concepts of others in several domains (e.g., math, ath-
in memory letics). When participants themselves were
Finally, beyond affecting the activation of highly skilled in the domain they were con-
knowledge from memory, motivation for di- sidering or had just experienced a relevant
rectional outcomes can also influence the personal success, they set higher perfor-
way in which people come to organize this mance standards for others. That is, to dis-
knowledge. The most widely studied exam- tinguish their own superiority, they judged
ple of this concerns how desires for positive others as less successful. However, when par-
self-evaluation lead people to form stronger ticipants themselves were not highly skilled
associations between their self-concepts and in the domain they were considering or had
attributes that they feel are praiseworthy or just experienced a relevant personal failure,
related to success. Three primary strategies they set lower performance standards for
by which people accomplish this have been others. That is, to cast those outperform-
identified: (1 ) altering one’s self-concept ing them as relatively high achievers, they
to include attributes that are believed to judged them as more successful.
bring about successful outcomes (e.g., Klein In sum, motivations for directional out-
& Kunda, 1 992; Kunda & Santioso, 1 989); comes can affect basic cognitive processes
(2) coming to view the attributes that one and influence thinking in several profound
already possesses as essential for success- ways. These types of motivations affect not
ful outcomes (Dunning, Leuenberger, & only how people search for, evaluate, and ex-
Sherman, 1 995 ; Dunning, Perie, & Story, plain information in the world around them
1 991 ; Kunda, 1 987); and (3 ) redefining the but also how they activate, access, and orga-
criteria that must be met before one can nize their knowledge about themselves and
be considered successful or in possession others. The next section reviews research in-
of particular positive and negative qualities dicating that motivations for nondirectional
(Beauregard & Dunning, 1 998; Dunning & outcomes can be equally important.
Cohen, 1 992; see also Alicke, LoSchiavo,
Zerbst, & Zhang, 1 997).
Influences of Nondirectional
The second two strategies are of particu-
Outcome Motivation
lar relevance to the issue of knowledge or-
ganization. Use of the second strategy can Although less research exists concerning the
clearly be seen in a program of research by cognitive effects of nondirectional outcome
Dunning and his colleagues (Dunning et al., motivation, several varieties have been con-
1 995 ; Dunning et al., 1 991 ). In one study, sidered in some depth (e.g., Cacioppo, Petty,
people who considered themselves either Feinstein, & Jarvis, 1 996; Fiske & Neuberg,
more goal-oriented or more people-oriented 1 990; Kruglanski & Webster, 1 996; Lerner &
motivated thinking 301
Tetlock, 1 999). Among these, the two most there is evidence to suggest that they can
prominent are desires for accuracy (Fiske & also be influenced by accuracy and closure
Neuberg, 1 990) and desires for clarity and motivations.
conciseness, or closure (Kruglanski & Web- In one study, Tetlock (1 985 ) had par-
ster, 1 996). Here, we consider the effects of ticipants read an essay either supporting
these two motivations (which, as will be dis- or opposing affirmative action that had os-
cussed, often have opposing effects on in- tensibly been written by someone from a
formation processing) on many of the same previous experiment. They were then in-
cognitive processes examined in the previ- formed that the author of the essay had been
ous section. assigned to take this position by the exper-
Before beginning, however, it should be imenter and asked to judge the extent to
noted that both accuracy and closure mo- which the arguments presented in the essay
tivation have been operationalized in multi- reflected the author’s own attitude. People
ple ways. For example, motivations for accu- who were not provided with any additional
racy have been studied in terms of wanting to motivations displayed the typical fixation on
know as much as possible about a person on a single cause. These individuals reported
whom one is going to be dependent (Neu- that the position taken in the supportive es-
berg & Fiske, 1 987), feelings of accountabil- say could be explained by the positive atti-
ity for one’s judgments (e.g., Tetlock, 1 983 ), tude of the author toward affirmative action,
a “fear of invalidity” (e.g., Kruglanski & Fre- whereas the position taken in the oppos-
und, 1 983 ), and simple desires to be as cor- ing essay could be explained by the nega-
rect as possible (e.g., Neuberg, 1 989). Mo- tive attitude of the author toward affirma-
tivations for closure have been examined in tive action despite knowing that both essays
terms of feelings of time pressure (Kruglan- had been largely coerced by the experi-
ski & Freund, 1 983 ), a desire to quickly menter. However, people who were moti-
complete judgment tasks that are dull and vated to make accurate judgments (by in-
unattractive (Webster, 1 993 ), and desires forming them that they would later be
to escape noisy environments (Kruglanski, discussing the reasons for their impressions
Webster, & Klem, 1 993 ; see Kruglanski & with the experimenter) did consider the al-
Webster, 1 996). In the initial discussion pre- ternative cause represented by the experi-
sented, each of these varieties of accuracy or menter’s coercion. These individuals judged
closure motivation are treated as equivalent; the attitude of the author to be neutral re-
some important differences among the ef- gardless of which essay they read. A study
fects of these various operationalizations are by Webster (1 993 ) using a similar paradigm
considered at the end. showed that, in contrast, when participants’
motivation for closure was increased, the
typical fixation on a single cause became
effects on attribution even more pronounced. Thus, a need for ac-
In addition to self-serving biases that oc- curacy and a need for closure appear to have
cur when people explain their own perfor- opposite effects on people’s considerations
mance, as described previously, research on of alternate causes during attribution (see
attribution has also identified more general Kruglanski & Freund, 1 983 ; Kruglanski &
biases. For example, there is the tendency for Webster, 1 996).
people to fixate on one particular cause for
some action or event and then fail to ade- effects on evidence evaluation and
quately consider alternative causes that are information search
also possible (see Gilbert & Malone, 1 995 ; As discussed earlier, research on directional
see also Buehner & Cheng, Chap. 7; Kahne- outcome motivation has demonstrated that
man & Frederick, Chap. 1 2). Although these people engage in increased evidence eval-
attributional biases have been largely con- uations and prolonged information search
sidered from a purely cognitive standpoint, when encountering evidence unfavorable
to their preferred self-views and reduced to the participants in the study, supported
evidence evaluation and information search a different verdict for each member of the
when encountering evidence favorable to pair. Participants with high (versus low) clo-
their preferred self-views. In contrast, ac- sure motivation attempted to bring about a
curacy motivation produces prolonged in- quick end to the discussion. Moreover, when
formation search, and closure motivation asked before the discussion, they expressed a
produces reduced information search, re- strong preference for a partner who could be
gardless of the circumstances. easily persuaded to their existing viewpoint,
This consequence of accuracy motivation and once the discussion began, they stub-
is evident in a study by Neuberg (1 989), bornly attempted to convince their partner
where people were asked to conduct a tele- to see things their way rather than consider-
phone interview with a peer but were given ing alternative arguments.
unfavorable expectations concerning the in-
terviewee. Those participants who were in- effects on evaluation complexity
structed to “form the most accurate impres-
In addition to affecting the length of peo-
sions possible” of the other person spent
ple’s analysis and evaluation of evidence,
more time listening and provided more op-
nondirectional outcome motivation can also
portunities for the interviewee to elaborate
influence the complexity of this analysis.
his or her opinions. This in turn prevented
Accuracy-motivated individuals form judg-
their unfavorable expectations from creat-
ments that show greater consideration of
ing negative final impressions of the inter-
conflicting opinions and evidence, whereas
viewee, which is what occurred with those
closure-motivated individuals form judg-
participants who were not given any special
ments that show less of this type of con-
instructions for the interview.
sideration. Tetlock and colleagues demon-
Similar consequences of accuracy moti-
strated these effects in experiments in which
vation are also seen in research by Chaiken
participants were asked to write down their
and colleagues (for reviews, see Chen &
thoughts about topics such as affirmative ac-
Chaiken, 1 999; Eagly & Chaiken, 1 993 ).
tion, American foreign policy, and the causes
For example, in one study by Maheswaran
of certain historical events (for a review, see
and Chaiken (1 991 ), participants evaluated
Lerner & Tetlock, 1 999). Responses were
a product based on a detailed review that
then coded for their integrative complexity,
described this product more favorably or
which was defined in terms of the degree
less favorably than similar products. Partic-
to which multiple perspectives on an issue
ipants who were high in accuracy motiva-
were both identified and then integrated into
tion, because they believed their evaluations
a framework that included complex con-
would have important consequences gener-
nections between them. Findings with peo-
ated more thoughts about the strengths and
ple who were both novices and experts on
weaknesses of the specific product-quality
the issues they were analyzing (i.e., college
arguments that were listed in the review
students and professional historians, respec-
than did those who were low in accuracy
tively) indicated that those with increased
motivation. This again attenuated any ef-
accuracy motivation provided a more in-
fects of people’s prior expectations on their
tegratively complex analysis (e.g., Tetlock,
final evaluations.
1 983 ), whereas those with increased clo-
The consequences of closure motiva-
sure motivation provided a less integratively
tion on evidence evaluation and information
complex analysis (Tetlock, 1 998).
search has been shown in several studies by
Kruglanski et al. (1 993 ). People were paired
with someone else for a discussion about effects on recall and knowledge activation
the verdict of a mock trial. Before the dis- Whereas directional outcome motivation
cussion, everyone received a summarized le- was seen earlier to have qualitative ef-
gal analysis of the case which, unbeknownst fects on recall and knowledge activation,
nondirectional outcome motivation has less, participants based their evaluations on

largely quantitative effects. Once again, ac- whichever one of these concepts was most
curacy motivation and closure motivation accessible to a greater extent when their
have opposite influences. closure motivation was high but to a lesser
In an investigation of accuracy moti- extent when their accuracy motivation was
vation on recall during impression forma- high (Ford & Kruglanski, 1 995 ; Thompson
tion, Berscheid and colleagues found that et al., 1 994). These effects have been found
when people observed interviews involv- both when people are making online judg-
ing individuals with whom they might later ments (Kruglanski & Freund, 1 983 ; Schuette
be paired, they paid more attention to & Fazio, 1 995 ) and when they are recon-
the interview and remembered more infor- sidering previously encountered information
mation about the interviewees than when (Sanbonmatsu & Fazio, 1 990; Thompson
they did not expect any future interactions et al., 1 994).
(Berscheid, Graziano, Monson, & Dermer, Overall, then, motivations for nondirec-
1 976; see also Srull, Lichtenstein, & Roth- tional outcomes can also affect basic cog-
bart, 1 985 ). However, in studies of closure nitive processes and profoundly influence
motivation and impression formation, indi- thinking. Whereas motivations for direc-
viduals with chronically high (versus low) tional outcomes were earlier shown to alter
need for closure spent less time reading dif- how people activate, evaluate, and explain
ferent pieces of behavioral information they information during reasoning, motivations
were given about a target and later recalled for nondirectional outcomes (at least in
fewer of these behaviors (Dijksterhuis, van terms of the accuracy and closure moti-
Knippenberg, Kruglanski, & Schaper, 1 996). vations reviewed here) instead alter how
There is also evidence that people with high much activation, evaluation, or explanation,
(versus low) accuracy motivation activate in fact, occurs. Furthermore, as the findings
more pieces of individuating trait and behav- presented here illustrate, such quantitative
ioral information when forming impressions differences in thought can often affect the
of others (Kruglanski & Freund, 1 983 ; Neu- outcomes of people’s judgments and deci-
berg & Fiske, 1 987), whereas people with sions just as much as the qualitative differ-
high (versus low) need for closure display ences described previously.3
an increased tendency to rely solely on cat-
egorical information during impression for- Limits to Outcome-Motivated Thinking
mation (Dijksterhuis et al., 1 996; Kruglanski
& Freund, 1 983 ; see also Moskowitz, 1 993 ). Although, so far, people have been shown to
Similar effects are found for the use have an impressive array of cognitive mech-
of highly accessible knowledge structures anisms at their disposal when attempting
or attitudes in judgment. In typical cir- to reach desired conclusions, limits do ex-
cumstances, concepts or attitudes that have ist concerning when these mechanisms are
been recently or frequently activated will applied. These limits are first described for
lead people to assimilate their judgments directional outcome-motivated thinking and
to this highly accessible information with- then for nondirectional outcome-motivated
out considering any additional information thinking.
(see Fazio, 1 995 ; Higgins, 1 996). Increased
accuracy motivation can attenuate assimila- reality constraints on motivations for
tion effects by increasing the activation of al- directional outcomes
ternative interpretations, whereas increased Although there are often specific outcomes,
closure motivation can exacerbate assimila- such as positive self-views, that people
tion effects by decreasing the activation of al- have some preference for during judgment,
ternative interpretations. For example, when most individuals still acknowledge there
evaluating the behavior of a target person is some kind of “objective reality” about
who was ambiguously adventurous or reck- whatever information they are considering.
That is, motivated thinking related to direc- punctuality or gullibility), people engage in
tional outcomes operates within what Kunda less motivated reasoning (Dunning et al.,
(1 990) has called reality constraints (see also 1 989). Overall, these results suggest that
Pyszczynski & Greenberg, 1 987; cf. Kruglan- thinking and reasoning inspired by direc-
ski, 1 999). Therefore, although there is a tional outcomes do not so much lead peo-
degree to which people adjust their defini- ple to ignore the sometimes disappointing
tions of success, engage in selective recall, reality they face because it inspires them to
or seek to criticize unfavorable evidence, exploit the uncertainties that exist in this re-
this does not make them entirely unrespon- ality to their favor.
sive to world around them, except perhaps
in extreme circumstances (see Bachman & cognitive-resource constraints on
Cannon, Chap. 21 ). accuracy motivation
Indeed, evidence for this principle of re- Virtually all the effects of accuracy motiva-
ality constraints has been repeatedly found tion reviewed here involve increases in the
in the context of the research previously total amount of information processing that
described. For example, a study using a people perform during judgment. There-
paradigm discussed earlier, in which partic- fore, in circumstances in which one’s abil-
ipants first learned that introverts or extro- ity to engage in this information processing
verts were generally more successful before is constrained, the effects of increased accu-
rating themselves on these traits, was per- racy motivation should be minimal (Fiske &
formed using participants who had been pre- Neuberg, 1 990). One demonstration of this
selected as having high trait levels of either was provided by Pendry and Macrae (1 994).
introversion or extroversion (Santioso et al., As described earlier, accuracy-motivated in-
1 990). Although beliefs that one trait was dividuals who were forming an impression
more beneficial than the other increased ev- of a target displayed an increased use of
eryone’s self-ratings concerning that trait, individuating trait and behavioral informa-
demonstrating motivated reasoning, there tion when they possessed their full infor-
was also a large effect of people’s chronic dis- mation processing resources (see Neuberg &
positions. Introverts’ ratings of themselves, Fiske, 1 987). However, accuracy-motivated
were always more introverted than extro- individuals whose processing resources were
verts’ ratings of themselves, no matter how depleted based their impression primarily
beneficial the introverts believed the trait of on categorical information in the same way
extroversion to be. That is, regardless of how as those who had little accuracy motiva-
desirable it would have been, introverts did tion (see also Kruglanski & Freund, 1 983 ).
not suddenly believe themselves to be extro- In addition, Sanbonmatsu and Fazio (1 990)
verts and vice versa. showed that the influence of accuracy moti-
Another example of the influence of vation in reducing people’s assimilation of
reality constraints is that people’s think- their judgments to highly accessible atti-
ing is guided by their preferred outcomes tudes disappears when people are placed un-
to a much greater extent in situations of der time pressure, which prevents extended
uncertainty (e.g., Dunning, Meyerowitz, & information processing.
Holtzberg, 1 989; Hsee, 1 995 ). When there is
more potential for constructing idiosyncratic does motivation for accuracy result in
criteria for a certain judgment (e.g., judging accurate reasoning?
whether one possess somewhat vague traits Another important consideration of the ef-
such as sensibility or insecurity), then peo- fects of accuracy motivation on thinking and
ple use this opportunity to select criteria that reasoning is that even when people high in
allow them to reach their desired conclu- accuracy motivation are free to engage in
sion. However, when there is less potential extended information processing this does
for this construction (e.g., judging whether not guarantee that they will arrive at more
one possesses more precise traits such as accurate judgments. One obvious example
of this situation is finding that evidence be- of this motivation and the larger context in
yond what is immediately and effortlessly which it exists.
available does not exist or has faded from
memory (see, e.g., Thompson et al., 1 994). the influence of information availability on
In an another manifestation, people are af- closure motivation
fected by certain biases outside their aware- Certain qualifications must also be noted in
ness or are aware of such biases but un- the effects of closure motivation. All the
aware of what the proper strategy is to findings discussed so far have involved the
correct them. In all these circumstances, al- tendency for people with increased closure
though accuracy motivation might increase motivation to quickly assimilate their judg-
information search, recall, and considera- ments to readily available or highly acces-
tion of multiple interpretations, it would not sible information, leading to an early “freez-
be expected to eliminate judgment errors ing” of their information search. However, in
(Fischhoff, 1 982), and might even increase situations in which little information is avail-
them (Pelham & Neter, 1 995 ; Tetlock & able, high closure motivation may inspire ef-
Boettger, 1 989). forts to find something clear and concise to
“seize” upon and increase information search
(see Kruglanski & Webster, 1 996). For exam-
distinctions among circumstances that lead ple, in the Kruglanski et al. (1 993 ) studies
to accuracy motivation
described previously that involved partners
As alluded to earlier, the different types discussing the verdict of a mock trial, people
of accuracy motivation inductions reviewed with high closure motivation preferred eas-
here are not always equivalent and can have ily persuadable partners and were unwilling
markedly different effects. For example, al- to consider alternative arguments only when
though having one’s outcomes dependent on they had enough information at their dis-
another person can increase desires for accu- posal (i.e., a summarized legal analysis) to
racy in diagnosing that person’s true charac- form a clear initial impression. When these
ter (e.g., Neuberg & Fiske, 1 987), in other same individuals were not provided with the
cases such circumstances can produce a de- legal analysis and did not begin the discus-
sire to see a person that one is going to be sion with a clear opinion, they expressed a
depending on in the best possible light (e.g., desire to be paired with someone who was
Berscheid et al., 1 976; Klein & Kunda, 1 992; highly persuasive and shifted toward their
see Kruglanski, 1 996). As another exam- partner’s point of view.
ple, although believing that one’s judgment
has important consequences may motivate
Conclusions on Outcome-Motivated
an accurate consideration of all the relevant
Thinking
evidence, it could also motivate a more gen-
eral need to increase elaborative thinking Recent research has uncovered many po-
that is not necessarily focused on accuracy tential routes by which people’s desires for
(see Footnote 3 ; Petty & Wegener, 1 999). Fi- particular judgment outcomes can affect
nally, although justifying one’s judgments to their thinking and reasoning. To summa-
an audience can motivate accuracy when the rize, both directional outcome motivations,
opinion of the audience is unknown, it can where people have a specific preferred con-
also lead to more directional outcome mo- clusion they are trying to reach, and nondi-
tivation, such as ingratiation toward this au- rectional outcome motivations, where peo-
dience, when the opinion of the audience is ple’s preferred conclusions are more general,
known (Tetlock, 1 983 ; see Lerner & Tetlock, alter many basic cognitive processes during
1 999). Therefore, when attempting to antic- reasoning. These include (1 ) the explana-
ipate the effects of accuracy motivation on tion of events and behaviors; (2) the or-
reasoning in a particular situation, it is im- ganization, recall, and activation of knowl-
portant to consider both the current source edge in memory; and (3 ) the pursuit and
evaluation of evidence relevant to decision Regulatory focus theory distinguishes be-

making. Outcome motivation effects involve tween two basic motivational orientations:
both how such cognitive processes are initi- a promotion focus involving concerns with
ated and directed as well as how thoroughly advancement and approaching gains versus
these processes are implemented. Moreover, avoiding nongains, and a prevention focus
in any given situation the specific cognitive involving concerns with security and ap-
processes influenced by outcome motivation proaching nonlosses versus avoiding losses.
are typically those that aid the gathering and Because it centers on the presence and ab-
interpretation of information supporting the sence of positive outcomes, a promotion fo-
favored outcome. In this self-fulfilling way, cus has been found to create preferences
then, people’s outcome-motivated reason- for eager judgment strategies that empha-
ing often successfully brings about their de- size advancement (or, to use signal detec-
sired conclusions. tion terminology, finding hits) and ensure
against overlooking something that might
be important (or, to again use signal de-
tection terminology, avoiding errors of omis-
Strategy-Motivated Thinking sion). In contrast, because it centers on the
presence and absence of negative outcomes,
Although outcome-motivated thinking has a prevention focus has been found to engen-
been the most widely studied form of moder preferences for vigilant judgment strate-
tivated reasoning, other varieties of motiva- gies that emphasize protection (or making
tional influences on cognition are also pos- correct rejections) and ensure against com-
sible. One alternate perspective that has mitting to something that might be a mis-
more recently emerged and complements take (or avoiding errors of commission; see
an outcome-based view proposes that peo- Higgins & Molden, 2003 ). Therefore, even
ple are motivated not only with respect to in circumstances in which individuals are
the outcomes of their judgments but also pursuing the same outcome, they may show
with respect to the manner in which they marked differences in their pursuit of this
go about making these judgments. That is, outcome depending upon whether they are
not only do people have preferred conclu- currently promotion focused or prevention
sions, but they also have preferred strategies focused. The studies reviewed here are in-
for reaching their conclusions (Higgins & tended to illustrate the effects of eager
Molden, 2003 ; cf. Tyler & Blader, 2000). or vigilant strategic motivation on several
Therefore, independent of whatever out- types of thought processes similar to those
come holds the most interest for them, found to be influenced by outcome moti-
people may be motivated to reach these vation (for a larger overview, see Higgins &
outcomes using strategies that “feel right” Molden, 2003 ).
in terms of, and allow them to sustain,
their current motivational orientation (e.g., effects on the consideration of
eagerly gathering evidence that might sup- alternative hypotheses
port a positive self-view or facilitate cog- Considering alternative hypotheses is a fun-
nitive closure versus vigilantly suppressing damental component of many varieties of
evidence that could undermine a positive thinking (see Sloman & Lagnado, Chap. 5 ).
self-view or threaten cognitive closure). How might eager versus vigilant strategic
Several lines of research have examined preferences influence this process? In gen-
how motivations for particular judgment eral, an eager strategy of considering alter-
strategies can also influence people’s ba- natives would involve attempting to attain
sic cognitive processes. In the vast majority hits and to ensure against errors of omission
of these studies, strategic motivations were by generating and selecting any plausible
measured and manipulated in terms of peo- hypotheses that could remotely be correct.
ple’s regulatory focus (see Higgins, 1 997). However, a vigilant strategy of considering
alternatives would involve attempting to compared with those in a prevention focus

make correct rejections and to ensure against (see Kelley, 1 973 ).
errors of commission by generating and se- Finally, additional research by Molden
lecting only the most probable hypotheses and Higgins (2004) has more recently
that seem likely to be correct. Therefore peo- demonstrated similar effects for eager ver-
ple in a promotion focus would be expected sus vigilant strategic preferences on the gen-
to consider a greater number of alternatives eration and selection of alternatives during
during thinking and reasoning than people basic categorization processes. People were
in a prevention focus. given vague descriptions of a target person
This question was addressed in several from which it was not clear how to cate-
studies by Liberman, Molden, Idson, and gorize him or her correctly, and a number
Higgins (2001 ). One important instance of of alternatives could all have been possible.
considering alternatives occurs when people As before, participants with either a chronic
form hypotheses about what they are per- or experimentally induced promotion focus
ceiving (see Tversky, Chap. 1 0). Therefore, generated more possible categories for the
Liberman et al. (2001 ) examined the effects target than those with either a chronic or
of people’s strategic preferences on a task experimentally induced prevention focus.
where people identified vague and distorted Overall, then, people’s eager versus vig-
objects in a series of photographs. Across ilant strategic preferences play a significant
several studies in which a promotion or pre- role in their generation of alternatives during
vention focus was both measured as an indi- a number of important thought processes.
vidual differences variable and induced ex- Moreover, it is important to note that in all
perimentally, results indicated that those in a the studies described in this section, every-
promotion focus generated a greater number one was pursuing the exact same outcome
of alternatives for the identity of the objects (identifying an object, explaining behaviors)
than those in a prevention focus (see also and did not have motivations for any specific
Crowe & Higgins, 1 997). conclusion or end-state. Furthermore, mea-
In addition to examining the effects of sures of people’s motivations for more gen-
strategic preferences on generating alterna- eral outcomes such as accuracy and closure
tive hypotheses for object perception, Liber- were also taken, and these factors were sta-
man et al. (2001 ) also investigated whether tistically removed from all analyses. There-
similar effects occurred for social percep- fore, the observed effects of promotion or
tion. Participants read a scenario describing prevention motivational orientations are dis-
the helpful behavior of a target person and tinct from the outcome motivation effects
were asked to evaluate several equally plau- reviewed earlier and can be attributed to the
sible alternative explanations for this behav- influences of these orientations on people’s
ior. Consistent with the results described strategic preferences.
previously, participants in a promotion fo-
cus again selected a greater number of al-
ternative explanations than participants in effects on counterfactual thinking
a prevention focus. Moreover, these effects Besides generating and evaluating hypothe-
were also found to influence the general im- ses, another way in which people consider al-
pressions people formed of the target. Af- ternatives during reasoning is in their use of
ter selecting their reasons for the target’s counterfactuals. As briefly mentioned, ear-
helpful behavior, participants predicted how lier counterfactual thinking involves men-
helpfully he or she would behave in the fu- tally undoing the present state of affairs and
ture. Those in a promotion focus, because imagining alternative realities “if only” dif-
they were considering more interpretations ferent decisions had been made or actions
of a target’s behavior, formed more equivo- been taken (Roese, 1 997). Several differ-
cal impressions and showed relatively little ent varieties of counterfactual thinking have
generalization about the target’s behavior as been identified. One broad distinction that
has been made is between thoughts that how much regret they inspired (see Roese
concern the reversal of a previous inaction et al., 1 999). Therefore, the results can again
(e.g., if only I had acted, things might have only be explained in terms of differences in
gone better), or additive counterfactuals, and strategic motivation.
thoughts that concern the reversal of a pre-
vious action (e.g., if only I hadn’t acted, effects on fast versus accurate
things wouldn’t be so bad), or subtractive information processing
counterfactuals. A major focus across many areas of psy-
Because additive counterfactuals simu- chology has been when and why people
late the correction of a past error of omis- choose to emphasize either speed or ac-
sion, this type of thinking represents a more curacy in their thinking and decision mak-
eager strategy of considering alternative real- ing (e.g., Josephs & Hahn, 1 995 ; Zelaznik,
ities. In contrast, because subtractive coun- Mone, McCabe, & Thaman, 1 988). Förster,
terfactuals simulate the correction of a past Higgins, and Bianco (2003 ) more recently
error of commission, this type of thinking investigated whether promotion preferences
represents a more vigilant strategy of con- for strategic eagerness would result in faster
sidering alternate realities. Therefore, a pro- information processing and a higher quan-
motion focus should increase the generation tity of output in a search for possible hits,
of additive counterfactuals, and a preven- whereas prevention preferences for strategic
tion focus should increase the generation of vigilance would result in more accurate in-
subtractive counterfactuals. In line with this, formation processing and a higher quality of
Roese, Hur, and Pennington (1 999) found output in an effort to avoid mistakes.
that, both when analyzing hypothetical ex- Participants were given a task involving
amples and when describing particular in- four pictures taken from a children’s “con-
stances of their own behavior, participants nect the dots” drawing book. For each pic-
who considered promotion-related setbacks ture, the objective was to connect sequen-
(i.e., nongains and missed opportunities for tially numbered dots within a given time
advancement) offered a greater number of period in order to complete the outline of
additive counterfactuals, whereas partici- an image. Participants’ speed on each pic-
pants who considered prevention-related ture was assessed by the highest number
setbacks (i.e., losses and missed opportuni- dot they reached by the end of the time
ties to prevent mistakes) offered a greater period for that picture, and their accuracy
number of subtractive counterfactuals. In on each picture was assessed by the num-
the literature that exists on counterfactual ber of dots they skipped (i.e., that were
thinking, it has been traditionally assumed not connected). Across two studies where
that subtractive counterfactuals are more participants’ promotion or prevention focus
common than additive counterfactuals and was both measured and experimentally in-
that failures associated with action inspire duced, promotion-focused individuals were
more regret than failures associated with in- faster and produced a higher quantity of re-
action (Roese, 1 997). However, the results sponses, whereas prevention-focused indi-
of these studies demonstrate that, in some viduals were more accurate and produced
cases, people’s strategic preferences can re- a higher quality of responses over the entire
sult in additive counterfactuals being more task. Moreover, both of these tendencies in-
common and perhaps being associated with creased in intensity as people moved closer
greater regret (see also Camacho, Higgins, & to goal completion, resulting in stronger
Lugar, 2003 ). effects of strategic preferences toward the
It is important to note that care was taken end of a task than toward the beginning of
to make sure the outcomes that participants a task (i.e., the “goal looms larger” effect
were considering in these studies did not dif- in which motivation increases as one’s dis-
fer across any important dimensions such as tance to the completion of a goal decreases;
how painful they were imagined to be or Lewin, 1 93 5 ). This provides strong support
that people’s motivations for different judg- tion that is more meaningful in the context
ment strategies can alter their concerns with of eager strategic preferences, whereas in-
different aspects of information processing dividuals who were chronically prevention-
(e.g., speed versus accuracy). focused showed a stronger sensitivity and re-
call for loss-related information that is more
meaningful in the context of vigilant strate-
effects on knowledge activation and recall gic preferences.
Analogous to the selective recall and ac-
tivation of information from memory that
Strategic Preferences and Regulatory Fit
occurs in the presence of motivations for
directional outcomes, another influence of Although the studies presented thus far
strategic preferences on thinking is to in- have demonstrated how people’s motiva-
crease sensitivities to, and recall of, informational orientations can lead them to pre-
tion that that is particularly relevant to these fer and choose certain judgment strategies,
preferences. A study by Higgins, Roney, situations may exist in which they may be
Crowe, and Hymes (1 994) demonstrated more or less able to follow these preferences.
this by having participants read an essay For example, some situations may gener-
about the life of a hypothetical target person ally require greater use of eager strategies
in which two different types of situations of pursuing gains or vigilant strategies of
were encountered. In one type of situation, preventing mistakes such as when supervi-
the target used eager strategies that were ad- sors demand either innovative and creative
vancement oriented (e.g., waking up early practices of all their employees in search
in order to be on time for a favorite class), of advancement or cautious and responsi-
whereas in the other type of situation, the ble practices in hope of preventing losses.
target used vigilant strategies that were more What might be the consequences of making
protection oriented (e.g., being careful not judgments and decisions in a way that ei-
to sign up for a class whose schedule con- ther suits one’s current strategic preferences
flicted with a desired activity). Individu- (i.e., promotion-focused individuals using
als who had chronic promotion orientations eager strategies and prevention-focused in-
showed a stronger sensitivity for information dividuals using vigilant strategies) or does
related to advancement versus protection not suit one’s preferences (i.e., promotion-
strategies and later showed greater recall for focused individuals using vigilant strategies
these episodes, whereas individuals who had and prevention-focused individuals using ea-
chronic prevention orientations showed the ger strategies)?
reverse effect. Higgins and colleagues have examined
Another study by Higgins and Tykocin- this question and investigated how the regu-
ski (1 992), which again had people read an latory fit between one’s motivational orien-
essay about the life of a hypothetical tar- tation and the means one uses during goal
get person, extends these findings. In this pursuit affects thinking and reasoning (e.g.,
study, the target person experienced situa- Camacho et al., 2003 ; Freitas & Higgins
tions that either involved the presence or 2002; Higgins, Idson, Freitas, Spiegel, &
absence of gains (finding $20 on the street Molden, 2003 ). Although space limitations
or missing a movie that he or she wanted to prohibit a more thorough review of this
see, respectively) or the presence or absence work here (see Higgins, 2000a; Higgins &
of losses (being stuck in a crowded subway Molden, 2003 ), the general findings have
for an extended period of time or getting been that that the primary consequence of
a day off from a particularly arduous class regulatory fit is to increase the perceived
schedule, respectively). Similar to the previ- value of the goal one is pursuing. That
ous study, individuals who were chronically is, regulatory fit (as compared with nonfit)
promotion focused showed a stronger sen- leads people to “feel right” about their goal
sitivity and recall for gain-related informa- pursuit, which then leads them to (1 ) feel
good while pursuing these goals (i.e., what ing and reasoning above and beyond their
feels right feels good; see Freitas & Higgins, outcome motivations.
2002); (2) experience the outcomes they are
striving for as having more value or worth
(i.e., what feels right is good; see Higgins
et al., 2003 ); and (3 ) believe the strate- General Conclusions and Future
gies they are using are inherently right (i.e., Directions
what feels right is right; see Camacho et al.,
2003 ). Therefore, another avenue for fu- The sheer number and diversity of the stud-
ture research on how people’s motivations ies reviewed here is a testament to the return
to use certain judgment strategies can af- of motivational perspectives on cognition
fect their thought processes is the further to the vanguard of psychology. The rich-
refinement and elaboration of the process of ness and consistency of the findings emerg-
regulatory fit. ing from these studies is also a testament to
the utility of this perspective in the study
of thinking and reasoning. We optimistically
Conclusions on Strategy-Motivated
forecast a further expansion of research in-
Thinking
formed by motivational perspectives and, in
In sum, several emerging programs of re- conclusion, briefly outline two general di-
search are beginning to demonstrate that, rections we believe should be priorities for
beyond the effects on reasoning of people’s the future.
desires for particular judgment outcomes, The first direction involves expanding
there are additional effects on reasoning of current conceptualizations of the ways in
people’s desires to use particular judgment which motivational and cognitive processes
strategies. For example, preferences for ea- interact during judgment. Although there
ger judgment strategies, shown by those with is still much to be learned from examining
promotion concerns, versus preferences for the effects on thinking of people’s motiva-
vigilant judgment strategies, shown by those tions for certain outcomes (either directional
with prevention concerns, alter many basic or nondirectional), there may potentially
cognitive processes during reasoning. These be other important sources of motivated
include (1 ) the generation and testing of hy- thought as well. In this chapter, we reviewed
potheses, (2) the use of counterfactual think- our own initial research on one of these
ing, (3 ) an emphasis on fast versus accurate possible sources – people’s motivations for
processing of information, and (4) knowl- employing preferred strategies during judg-
edge activation and recall. Strategy motiva- ment. We expect that further study will lead
tion effects include whether cognitive pro- to the development of additional perspec-
cesses are implemented in order to advance tives on the interface of motivation and cog-
the right decision and avoid errors of omis- nition that go beyond both motivated out-
sion in judgment or to protect against the comes and motivated strategies.
wrong decision and avoid errors of commis- The second direction involves moving
sion in judgment. They also include whether past research that examines different va-
such implementation fits or does not fit one’s rieties of motivated thinking in isolation
current motivational orientation. The imple- from one another (i.e., studying situations in
mentation of cognitive processes for either which people are only motivated to achieve
of these strategic reasons or for regulatory positive self-views or only motivated to be
fit influences what pieces of information are accurate). There is a need to consider how
considered during judgment and how much multiple goals, desires, and motives inter-
this information is valued in a final decision. act to influence the thought process – that
In this way, then, people’s strategic motiva- is, the effects of patterns of motivational
tions have important effects on their think- forces. For instance, it has been noted for
some time now that people possess many failure, showed strong generalizations fol-
potential objectives when processing in- lowing success and almost no generaliza-
formation (e.g., Chen & Chaiken, 1 999). tion after failure. These results demonstrate
Although it is certainly the case that, at the importance of considering the effects of
times, objectives such as accuracy, ingrati- multiple sources of motivated reasoning si-
ation, or self-enhancement may be predom- multaneously (see also Förster, Higgins, &
inant (Kruglanski, 1 999), it is also true that Strack, 2000).
there are many instances in which several One final way in which investigating the
of these objectives are pursued simultane- cognitive effects of interacting motivational
ously. What happens when people not only forces could be fruitfully expanded is by
want to be accurate but also want to please synthesizing work on how motivation influ-
others or boost their own self-esteem? Stud- ences reasoning with work on how affect in-
ies addressing these questions are just be- fluences reasoning (see Forgas, 2000; Mar-
ginning to appear, and early findings are tin & Clore, 2001 ). Great strides have been
indicating that important interactions can made in determining the mechanisms by
occur (Lundgren & Prislin, 1 998; Nienhuis, which affective and emotional states can al-
Manstead, & Spears, 2001 ; Ruscher, Fiske, & ter people’s judgments. Many of the changes
Schnake, 2000). in the quality and quantity of information
Similarly, although we have made a dis- processing found in this research bear a strik-
tinction between outcome- and strategy- ing resemblance to the motivational effects
motivated thinking and discussed their ef- reviewed here. For example, positive moods
fects independently, there are situations in have generally been found to support less
which these two sources of motivation op- thorough and complex information process-
erate in concert. One of these situations ing, similar to closure motivation, whereas
has been the focus of recent studies by negative moods have generally been found to
Molden and Higgins (2004). These studies support more thorough and complex infor-
examined how preferences for eager ver- mation processing, similar to accuracy mo-
sus vigilant decision strategies influence peo- tivation (for a review, see Schwarz & Clore,
ple’s generation of alternative explanations 1 996). This is not to say, however, that the
for their own success and failure. In addi- effects reviewed here are actually just due
tion to replicating both the previously dis- to changes in emotion, because many of the
cussed self-serving pattern of attributions studies discussed carefully controlled for af-
for performance (an outcome-motivated effective influences and continued to find in-
fect) and the selection of a greater number dependent effects. Therefore, it would be
of alternative attributions by those prefer- fruitful to investigate how affective think-
ring eager strategies over vigilant strategies ing may give rise to motivational thinking
(a strategy-motivated effect), these studies (e.g., Erber & Erber, 2000), and how mo-
showed that self-serving and strategic moti- tivational thinking may give rise to affec-
vations interacted to determine the extent to tive thinking (e.g., Higgins, 2000b), in or-
which people generalized their current ex- der to develop a better understanding of
periences to their future performance. Indi- how these two factors are related and what
viduals using eager strategies, because they their combined and separate consequences
tended to consider multiple attributions, in- might be.
cluding both internal and external causes, In conclusion, this chapter reviewed re-
showed only moderate generalization after search that displays the broad applicability
both success and failure. In contrast, indi- of emerging motivational perspectives to the
viduals using vigilant strategies, because they study of thinking and reasoning. Through
tended to consider only a few attributions, this review, we attempted to convey the po-
including primarily internal causes follow- tential utility of these perspectives and to ad-
ing success but external causes following vocate a greater incorporation of principles
of outcome- and strategy-based motivation thinking are to be revealed, future investiga-

in future research. The further refinement tions of outcome-motivated thinking in dif-
and elaboration of these principles, we be- ferent cultures should take care to identify
lieve, will benefit not only the study of think- which specific outcomes are culturally desirable
ing but also cognitive science in general. in those contexts (e.g., proper fulfillment of
one’s social duties to others, high social sta-
tus relative to others; see, e.g., Endo, Heine, &
Lehman, 2000).
Acknowledgments 3 . Another type of nondirectional outcome mo-
tivation that has been the focus of considerable
study is the need for cognition, or a general desire
Preparation of this chapter was supported
for elaborative thinking and increased cogni-
by NIMH grants F3 1 MH-65 772-01 and
tive activity (Cacioppo et al., 1 996). At times,
MH3 9429. The authors want to thank the need for cognition has been considered
Phoebe Ellsworth for her helpful comments equivalent to accuracy motivation (Chen &
on an earlier draft. Chaiken, 1 999). Consistent with this, research
has shown that an increased need for cognition
can affect thinking in the same way as height-
ened accuracy motivation, reducing biases dur-
Notes ing attribution (D’agostino & Fincher-Kiefer,
1 992), increasing recall (Srull et al., 1 985 ), less-
1 . One area of study that is notably absent in ening assimilation to highly accessible attitudes
this review concerns affective and emotional (Florack, Scarabis, & Bless, 2001 ), and increas-
influences on reasoning. This important and ing information search (Verplanken, 1 993 ; see
extensive literature certainly enjoys a central Cacioppo et al., 1 996). However, at times the
place in the study of motivated thinking. How- effects of the need for cognition differ from
ever, the topic of affect and cognition has re- those of accuracy motivation. Accuracy moti-
cently been the subject of several entire hand- vation, because it inspires a thorough consid-
books on its own (see Forgas, 2000; Martin eration of all available evidence, weakens the
& Clore, 2001 ). Therefore, rather than at- tendency to base judgments on early superfi-
tempt an extremely limited overview of this cial impressions (i.e., primacy effects; Kruglan-
major topic alongside the other topics men- ski & Freund, 1 983 ). In contrast, the need
tioned previously, we instead refer the inter- for cognition, because it simply inspires cog-
ested reader to these other sources. The larger nitive elaboration even if this involves only
relation between research on emotional think- part of the available evidence, can lead to in-
ing and the research described here is discussed creased rumination on one’s early superficial
briefly below. impressions and strengthen primacy effects (see
2. It is important to note that, although a wealth Petty & Wegener, 1 999). Given these concep-
of studies have demonstrated people’s broad tual and empirical distinctions, we have not in-
and robust desires for positive self-evaluation, cluded research on need for cognition in our
these studies have almost exclusively been per- larger review of the effects of accuracy moti-
formed on members of Western, and gener- vation and consider it a separate form of nondi-
ally more individualistic cultures (Baumeister, rectional outcome motivation (for a review
1 998). In contrast, recent evidence collected of need for cognition effects, see Cacioppo
from Eastern, and generally more collectivist et al., 1 996).
cultures, has demonstrated that, in these pop-
ulations, such desires for self-evaluation are of-
ten considerably less and that some of the ef-
fects described here are thereby weaker (see References
Greenfield, Chap. 27). Yet, this should not
be taken to mean that the general effects Alicke, M. D., LoSchiavo, F. M., Zerbst, J. I., &
of outcome-motivated thinking are necessar- Zhang, S. (1 997). The person who outperforms
ily culture specific or only apply to West- me is a genius: Maintaining perceived compe-
ern cultures. Instead, this indicates that, if tence in upward social comparison. Journal of
general principles of this type of motivated Personality and Social Psychology, 73 , 781 –789.
Allport, F. H. (1 920). The influence of the group Dijksterhuis, A., van Knippenberg, A., Kruglan-
upon association and thought. Journal of Exper- ski, A. W., & Schaper, C. (1 996). Motivated
imental Psychology, 3 , 1 5 9–1 82. social cognition: Need for closure effects on
Baumeister, R. F. (1 998). The self. In D. Gilbert, memory and judgment. Journal of Experimen-
S. Fiske, & G. Lindzey (Eds.), The handbook of tal Social Psychology, 3 2 , 25 4–270.
social psychology (4th ed., Vol. 1 , pp. 680–740). Ditto, P. H., & Lopez, D. F. (1 992). Motivated
New York: McGraw-Hill. skepticism: Use of differential decision criteria
Beauregard, K. S., & Dunning, D. (1 998). Turn- for preferred and non-preferred conclusions.
ing up the contrast: Self-enhancement motives Journal of Personality and Social Psychology, 63 ,
prompt egocentric contrast effects in social 5 69–5 84.
judgment. Journal of Personality and Social Psy- Ditto, P. H., Scepansky, J. A., Munro, G. D.,
chology, 74, 606–621 . Apanovitch, A. M., & Lockhart, L. K.
Bem, D. J. (1 967). Self-perception: An alter- (1 998). Motivated sensitivity to preference-
native interpretation of cognitive dissonance inconsistent information. Journal of Personality
phenomena. Psychological Review, 74, 1 83 – and Social Psychology, 75 , 5 3 –69.
200. Doosje, B., Spears, R., & Koomen, W. (1 995 ).
Berscheid, E., Graziano, W., Monson, T., & Der- When bad isn’t all bad: Strategic use of sample
mer, M. (1 976). Outcome dependency: At- information in generalization and stereotyping.
tention, attribution, and attraction. Journal Journal of Personality and Social Psychology, 69,
of Personality and Social Psychology, 3 4, 978– 642–65 5 .
989. Dunning, D. (1 999). A newer look: Motivated so-
Bradley, G. W. (1 978). Self-serving biases in the cial cognition and the schematic representation
attribution process: A reexamination of the fact of social concepts. Psychological Inquiry, 1 0,
or fiction question. Journal of Personality and 1 –1 1 .
Social Psychology, 1 8, 68–88. Dunning, D., & Cohen, G. L. (1 992). Egocentric
Bruner, J. S. (1 95 7). On perceptual readiness. Psy- definitions of traits and abilities in social judg-
chological Review, 64, 1 23 –1 5 2. ment. Journal of Personality and Social Psychol-
ogy, 63 , 3 41 –3 5 5 .
Cacioppo, J. T., Petty, R. E., Feinstein, J. A.,
& Jarvis, W. B. G. (1 996). Dispositional dif- Dunning, D., Leuenberger, A., & Sherman, D. A.
ferences in cognitive motivation: The life and (1 995 ). A new look at motivated inference: Are
times of individuals varying in need for cogni- self serving theories of success a product of mo-
tion. Psychological Bulletin, 1 1 9, 1 97–25 3 . tivational forces? Journal of Personality and So-
Camacho, C. J., Higgins, E. T., & Lugar, L. (2003 ). cial Psychology, 69, 5 8–68.
Moral value transfer from regulatory fit: What Dunning, D., Meyerowitz, J. A., & Holtzberg,
feels right is right and what feels wrong is A. D. (1 989). Ambiguity and self-evaluation:
wrong. Journal of Personality and Social Psychol- The role of idiosyncratic trait definitions in self-
ogy, 84, 498–5 1 0. serving assessments of ability. Journal of Person-
Chen S., & Chaiken, S. (1 999). The heuristic- ality and Social Psychology, 5 7, 1 082–1 090.
systematic model in its broader context. In S. Dunning, D. A., Perie, M., & Story, A. L. (1 991 ).
Chaiken & Y. Trope (Eds.), Dual-process theo- Self-serving prototypes of social categories.
ries in social psychology (pp. 73 –96). New York: Journal of Personality and Social Psychology, 61 ,
Guilford. 95 7–968.
Conway, M., & Ross, M. (1 984). Getting what Eagly, A. H., & Chaiken, S. (1 993 ). The psy-
you want by revising what you had. Journal of chology of attitudes. New York: Harcourt Brace
Personality and Social Psychology, 47, 73 8–748. College.
Crowe, E., & Higgins, E. T. (1 997). Regulatory Endo, Y., Heine, S. J., & Lehman, D. R.
focus and strategic inclinations: Promotion and (2000). Culture and positive illusions in close
prevention in decision-making. Organizational relationships: How my relationships are bet-
Behavior and Human Decision Processes, 69, ter than yours. Personality and Social Psychology
1 1 7–1 3 2. Bulletin, 2 6, 1 5 71 –1 5 86.
D’agostino, P. R., & Fincher-Kiefer, R. (1 992). Erber, M. W., & Erber, R. (2000). The role of mo-
Need for cognition and the correspondence tivated social cognition in the regulation of af-
bias. Social Cognition, 1 0, 1 5 1 –1 63 . fective states. In J. P. Forgas (Ed.), Handbook
of affect and social cognition (pp. 275 –292). Giner-Sorolla, R., & Chaiken, S. (1 997). Selective
Mahwah, NJ: Erlbaum. use of heuristic and systematic processing un-
Fazio, R. H. (1 995 ). Attitudes as object- der defense motivation. Personality and Social
evaluation associations: Determinants, conse- Psychology Bulletin, 2 3 , 84–97.
quences, and correlates of attitude accessibil- Gollwitzer, P. M., & Bargh, J. A. (Eds.).
ity. In R. E. Petty & J. A. Krosnick (Eds.), (1 996). The psychology of action: Linking cog-
Attitude strength: Antecedents and consequences nition and motivation to behavior. New York:
(pp. 247–282). Mahwah, NJ: Erlbaum. Guilford Press.
Festinger, L. (1 95 7). A theory of cognitive disso- Gollwitzer, P. M., Earle, W. B., & Stephan, W. G.
nance. Stanford, CA: Stanford University Press. (1 982). Affect as a determinant of egotism:
Fischhoff, B. (1 982). “Debiasing.” In D. Kahne- Residual excitation and performance attribu-
man, P. Slovic, & A. Tversky (Eds.), Judgment tions. Journal of Personality and Social Psychol-
under uncertainty (pp. 23 7–262). Cambridge, ogy, 43 , 702–709.
MA: Cambridge University Press. Higgins, E. T. (1 996). Knowledge activation:
Fiske, S. T., & Neuberg, S. L. (1 990). A contin- Accessibility, applicability, and salience. In
uum of impression formation, from category- E. T. Higgins & A. W. Kruglanski (Eds.), So-
based to individuating processes: Influences of cial psychology: Handbook of basic principles
information and motivation on attention and (pp. 1 3 3 –1 68). New York: Guilford Press.
interpretation. In M. P. Zanna (Ed.), Advances Higgins, E. T. (1 997). Beyond pleasure and pain.
in experimental social psychology (Vol. 23 , pp. 1 – American Psychologist, 5 2 , 1 280–1 3 00.
74). New York: Academic Press. Higgins, E. T. (2000a). Making a good decision:
Florack, A., Scarabis, M., & Bless, H. (2001 ). Value from “fit.” American Psychologist, 5 5 ,
When do associations matter? The use of auto- 1 21 7–1 23 0.
matic associations toward ethnic groups in per- Higgins, E. T. (2000b). Promotion and preven-
son judgments. Journal of Experimental Social tion experiences: Relating emotions to non-
Psychology, 3 7, 5 1 8–5 24. emotional motivational states. In J. P. Forgas
Ford, T. E., & Kruglanski, A. W. (1 995 ). Effects of (Ed.), Handbook of affect and social cognition
epistemic motivations on the use of accessible (pp. 1 86–21 1 ). Mahwah, NJ: Erlbaum.
constructs in social judgment. Personality and Higgins, E. T., Idson, L. C., Freitas, A. L., Spiegel,
Social Psychology Bulletin, 2 1 , 95 0–962. S., & Molden, D. C. (2003 ). Transfer of value
Forgas, J. P. (Ed.). (2000). Handbook of affect and from fit. Journal of Personality and Social Psy-
social cognition. Mahwah, NJ: Erlbaum. chology, 84, 1 1 40–1 1 5 3 .
Förster, J., Higgins, E. T., & Bianco, A. T. (2003 ). Higgins, E. T., & Molden, D. C. (2003 ). How
Speed/accuracy decisions in task performance: strategies for making judgments and decisions
Built in trade-off of separate strategic con- affect cognition: Motivated cognition revisited.
cerns. Organization Behavior and Human De- In G. V. Bodenhausen & A. J. Lambert (Eds.),
cision Processes, 90, 1 48–1 64. Foundations of social cognition: A festschrift in
Förster, J., Higgins, E. T., & Strack, F. (2000). honor of Robert S Wyer, Jr. (pp. 21 1 –23 6). Mah-
When stereotype disconfirmation is a personal wah, NJ: Erlbaum.
threat: How prejudice and prevention focus Higgins, E. T., Roney, C., Crowe, E., & Hymes, C.
moderate incongruency effects. Social Cogni- (1 994). Ideal versus ought predilections for ap-
tion, 1 8, 1 78–1 97. proach and avoidance: Distinct self-regulatory
Freitas, A. L., & Higgins, E. T. (2002). Enjoying systems. Journal of Personality and Social Psy-
goal-directed action: The role of regulatory fit. chology, 66, 276–286.
Psychological Science, 1 3 , 1 –6. Higgins, E. T., & Tykocinski, O. (1 992). Self-
Freud, S. (1 905 ). Fragments of an analysis of a case discrepancies and biographical memory: Per-
of hysteria. In J. Strachey (Ed.), “The Standard sonality and cognition at the level of psycholog-
Edition of the Complete Psychological Works ical situations. Personality and Social Psychology
of Sigmund Freud.” Standard edition, vol. 7. Bulletin, 1 8, 5 27–5 3 5 .
Macmillan: Hogarth, 1 964. Hsee, C. K. (1 995 ). Elastic justification: How
Gilbert, D. T., & Malone, P. S. (1 995 ). The cor- tempting but task-irrelevant factors influence
respondence bias. Psychological Bulletin, 1 1 7, decisions. Organizational Behavior and Human
21 –3 8. Decision Processes, 62 , 3 3 0–3 3 7.
Josephs, R. A., & Hahn, E. D. (1 995 ). Bias and ac- for attributional functions. Journal of Personal-
curacy in estimates of task duration. Organiza- ity and Social Psychology, 80, 5 –1 8.
tional Behavior and Human Decision Processes, Lord, C. G., Ross, L., & Lepper, M. R. (1 979).
61 , 202–21 3 . Biased assimilation and attitude polarizations:
Kelley, H. H. (1 973 ). The process of causal attri- The effects of prior theories on subsequently
bution. American Psychologist, 2 8, 1 07–1 28. considered evidence. Journal of Personality and
Klein, W. M., & Kunda, Z. (1 992). Motivated per- Social Psychology, 3 7, 2098–21 09.
son perception: Constructing justifications for Lundgren, S. R., & Prislin, R. (1 998). Motivated
desired beliefs. Journal of Experimental Social cognitive processing and attitude change. Per-
Psychology, 2 8, 1 45 –1 68. sonality and Social Psychology Bulletin, 2 4, 71 5 –
Kruglanski, A. W. (1 996). Motivated social 726.
cognition: Principles of the interface. In E. T. Maheswaran, D., & Chaiken, S. (1 991 ). Promot-
Higgins & A. W. Kruglanski (Eds.), So- ing systematic processing in low-motivation
cial psychology: Handbook of basic principles settings: Effect of incongruent information on
(pp. 493 –5 20). New York: Guilford Press. processing and judgment. Journal of Personality
Kruglanski, A. W. (1 999). Motivation, cognition, and Social Psychology, 61 , 1 3 –25 .
and reality: Three memos for the next gen- Martin, L., & Clore, G. C. (Eds.). (2001 ). Theories
eration of research. Psychological Inquiry, 1 0, of mood and cognition. Mahwah, NJ: Erlbaum.
5 8. McDonald, H. E., & Hirt, E. R. (1 997). When ex-
Kruglanski, A. W., & Freund, T. (1 983 ). The pectancy meets desire: Motivational effects in
freezing and unfreezing of lay inferences: Ef- reconstructive memory. Journal of Personality
fects on impression primacy, ethnic stereotyp- and Social Psychology, 72 , 5 –23 .
ing and numerical anchoring. Journal of Exper- McGinnies, E. (1 949). Emotionality and percep-
imental Social Psychology, 1 9, 448–468. tual defense. Psychological Review, 5 6, 244–25 1 .
Kruglanski, A. W., & Webster, D. M. (1 996). Miller, D. T. (1 976). Ego involvement and attri-
Motivated closing of the mind: “Seizing” and butions for success and failure. Journal of Per-
“freezing.” Psychological Review, 1 03 , 263 –283 . sonality and Social Psychology, 3 4, 901 –906.
Kruglanski, A. W., Webster, D. M., & Klem, A. Molden, D. C., & Higgins, E. T. (2004). Preferred
(1 993 ). Motivated resistance and openness to strategies for self-attribution: Motivated inference
persuasion in the presence of absence of prior beyond self-serving biases. Manuscript under re-
information. Journal of Personality and Social view.
Psychology, 65 , 861 –876. Molden, D. C., & Higgins, E. T. (2004). Cate-
Kunda, Z. (1 987). Motivated inference: Self- gorization under uncertainty: Resolving vague-
serving generation and evaluation of causal the- ness and ambiguity with eager versus vigilant
ories. Journal of Personality and Social Psychol- strategies. Social Cognition, 2 2 , 248–277.
ogy, 5 3 , 63 6–647. Moskowitz, G. B. (1 993 ). Individual differences
Kunda, Z. (1 990). The case for motivated reason- in social categorization: The influence of per-
ing. Psychological Bulletin, 1 08, 480–498. sonal need for structure on spontaneous trait
Kunda, Z., & Santioso, R. (1 989). Motivated inferences. Journal of Personality and Social Psy-
change in the self-concept. Journal of Experi- chology, 65 , 1 3 2–1 42.
mental Social Psychology, 2 5 , 272–285 . Moskowitz, G. B., Gollwitzer, P. M., Wasel, W.,
Lerner, J. S., & Tetlock, P. E. (1 999). Accounting & Schaal, B. (1 999). Preconscious control of
for the effects of accountability. Psychological stereotype activation through chronic egalitar-
Bulletin, 1 2 5 , 25 5 –275 . ian goals. Journal of Personality and Social Psy-
Lewin, K. (1 93 5 ). A dynamic theory of personality. chology, 77, 1 67–1 84.
New York: McGraw-Hill. Murray, H. A. (1 93 8). Explorations in personality.
Liberman, A., & Chaiken, S. (1 992). Defensive New York: Oxford University Press.
processing of personally relevant health mes- Murray, S. L. (1 999). The quest for conviction:
sages. Personality and Social Psychology Bulletin, Motivated cognition in romantic relationships.
1 8, 669–679. Psychological Inquiry, 1 0, 23 –3 4.
Liberman, N., Molden, D. C., Idson, L. C., & Hig- Neuberg, S. L. (1 989). The goal of forming accu-
gins, E. T. (2001 ). Promotion and prevention rate impressions during social interactions: At-
focus on alternative hypotheses: Implications tenuating the impact of negative expectancies.
Journal of Personality and Social Psychology, 5 6, Santioso, R., Kunda, Z., & Fong, G. T. (1 990). Mo-
3 74–3 86. tivated recruitment of autobiographical memo-
Neuberg, S. L., & Fiske, S. T. (1 987). Motiva- ries. Journal of Personality and Social Psychology,
tional influences on impression formation: De- 5 9, 229–241 .
pendency, accuracy-driven attention, and in- Schuette, R. A., & Fazio, R. H. (1 995 ). Attitude
dividuating information. Journal of Personality accessibility and motivation as determinants of
and Social Psychology, 5 3 , 43 1 –444. biased processing: A test of the MODE model.
Nienhuis, A. E., Manstead, A. S. R., & Spears, Personality and Social Psychology Bulletin, 2 1,
R. (2001 ). Multiple motives and persuasive 704–71 0.
communication: Creative elaboration as a re- Schwarz, N., & Clore, G. L. (1 996). Feelings
sult of impression motivation and accuracy mo- and phenomenal experiences. In E. T. Higgins
tivation. Personality and Social Psychology Bul- & A. W. Kruglanski (Eds.), Social psychology:
letin, 2 7, 1 1 8–1 3 2. Handbook of basic principles (pp. 43 3 –465 ).
Nisbett, R. E., & Ross, L. (1 980). Human New York: Guilford Press.
inference: Strategies and shortcomings of so- Sinclair, L., & Kunda, Z. (1 999). Reactions to
cial judgment. Englewood Cliffs, NJ: Prentice- a black professional: Motivated inhibition and
Hall. activation and conflicting stereotypes. Journal
Pelham, B. W., & Neter, E. (1 995 ). The effect of of Personality and Social Psychology, 77, 885 –
motivation on judgment depends on the dif- 904.
ficulty of the judgment. Journal of Personality Sorrentino, R. M., & Higgins, E. T. (Eds.). (1 986).
and Social Psychology, 68, 5 81 –5 94. Handbook of motivation and cognition, vols. 1 –3 .
Pendry, L. F., & Macrae, C. N. (1 994). Stereotypes New York: Guilford Press.
and mental life: The case of the motivated but Spencer, S. J., Fein, S., Wolfe, C., Hodgson, H. L.,
thwarted tactician. Journal of Experimental So- & Dunn, M. A. (1 998). Stereotype activation
cial Psychology, 3 0, 3 03 –3 25 . under cognitive load: The moderating role of
Petty, R. E., & Wegener, D. T. (1 999). The self-image threat. Personality and Social Psy-
elaboration likelihood model: Current status chology Bulletin, 2 4, 1 1 3 9–1 1 5 2.
and controversies. In S. Chaiken & Y. Trope Srull, T. K., Lichtenstein, M., & Rothbart, M.
(Eds.), Dual-process theories in social psychology (1 985 ). Associative storage and retrieval pro-
(pp. 41 –72). New York: Guilford Press. cesses in person memory. Journal of Experimen-
Pyszczynski, T., & Greenberg, J. (1 987). Toward tal Psychology: Learning, Memory, and Cogni-
an integration of cognitive and motivational tion, 1 1 , 3 1 6–3 45 .
perspectives on social inference: A biased hy- Stephan, W. F., & Gollwitzer, P. (1 981 ). Affect as
pothesis testing model. In L. Berokowitz (Ed.), a mediator of attributional egotism. Journal of
Advances in experimental social psychology Experimental Social Psychology, 1 7, 443 –45 8.
(Vol. 20, pp. 297–3 40). New York: Academic Tetlock, P. E. (1 983 ). Accountability and com-
Press. plexity of thought. Journal of Personality and
Roese, N. J. (1 997). Counterfactual thinking. Psy- Social Psychology, 45 , 74–83 .
chological Bulletin, 1 2 1 , 1 3 3 –1 48. Tetlock, P. E. (1 985 ). Accountability: A social
Roese, N. J., Hur, T., & Pennington, G. L. (1 999). check on the fundamental attribution error. So-
Counterfactual thinking and regulatory focus: cial Psychology Quarterly, 48, 227–23 6.
Implications for action versus inaction and suf- Tetlock, P. (1 998). Close-call counterfactuals and
ficiency versus necessity. Journal of Personality belief-system defense: I was not almost wrong
and Social Psychology, 77, 1 1 09–1 1 20. but I was almost right. Journal of Personality and
Ruscher, J. B., Fiske, S. T., & Schnake, S. B. Social Psychology, 75 , 63 9–65 2.
(2000). The motivated tactician’s juggling act: Tetlock, P. E., & Boettger, R. (1 989). Account-
Compatible vs. incompatible impression goals. ability: A social magnifier of the dilution effect.
British Journal of Social Psychology, 3 9, 241 – Journal of Personality and Social Psychology, 5 7,
25 6. 3 88–3 98.
Sanbonmatsu, D. M., & Fazio, R. H. (1 990). Tetlock, P. E., & Levi, A. (1 982). Attribution
The role of attitudes in memory-based deci- bias: On the inconclusiveness of the cognition-
sion making. Journal of Personality and Social motivation debate. Journal of Experimental So-
Psychology, 5 9, 61 4–622. cial Psychology, 1 8, 68–88.
Thompson, E. P., Roman, R. J., Moskowitz, G. B., nal of Personality and Social Psychology, 65 , 261 –
Chaiken, S., & Bargh, J. A. (1 994). Systematic 271 .
processing and the debasing of covert primacy Verplanken, B. (1 993 ). Need for cognition and
effects in impression formation: Unshackling external information search: Responses to time
the motivated perceiver from constraints of ac- pressure during decision-making. Journal of Re-
cessibility. Journal of Personality and Social Psy- search in Personality, 2 7, 23 8–25 2.
chology, 66, 474–489. Zelaznik, H. N., Mone, S., McCabe, G. P., &
Tyler, T. R., & Blader, S. L. (2000). Cooperation in Thaman, C. (1 988). Role of temporal and spa-
groups: Procedural justice, social identity, and be- tial precision in determining the nature of
havioral engagement. Philadelphia: Psychology the speed-accuracy trade-off in aimed-hand
Press. movement. Journal of Experimental Psychology:
Webster, D. M. (1 993 ). Motivated augmentation Human Perception and Performance, 1 4, 221 –
and reduction of the over-attribution bias. Jour- 23 0.
Part IV
PROBLEM SOLVING AND

COMPLEX LEARNING

CHAPTER 1 4
Problem Solving
Laura R. Novick
Miriam Bassok
Introduction A little thought concerning the light bulb

situation, as well as our other examples, re-
People are confronted with problems on a veals that what constitutes a problem for one
daily basis such as extracting a broken light person may not be a problem for another
bulb from a socket, multiplying eight times person, or for that same person at another
seven, finding the roots of a quadratic equa- point in time. For example, the second time
tion, planning a family vacation, and de- one has to remove a broken light bulb from
ciding whom to vote for in a presidential a socket, the solution likely can be retrieved
election. Although these examples differ in from memory; there is no problem. Simi-
many ways, they share a common core: “A larly, 8 × 7 would generally be considered
problem arises when a living creature has a problem for 8-year-olds but not for read-
a goal but does not know how this goal ers of this chapter. Of course, age here is
is to be reached. Whenever one cannot go just a proxy for prior knowledge, for there
from the given situation to the desired sit- are 6-year-olds for whom this question does
uation simply by action [i.e., by the perfor- not constitute a problem because they know
mance of obvious operations], then there has the standard multiplication table. Given that
to be recourse to thinking” (Duncker, 1 945 , a problem has been identified, the nature
p. 1 ). Consider the broken light bulb. The of people’s background knowledge pertain-
obvious operation – holding the glass part ing to that problem has important implica-
of the bulb with one’s fingers while unscrew- tions for the solution-related thinking they
ing the base from the socket – is prevented by do. To understand this thinking, it is impor-
the fact that the glass is broken. Thus, there tant to distinguish (1 ) the solver’s represen-
must be “recourse to thinking” – for example, tation of the problem (i.e., the solver’s un-
one might try mounting half a potato on the derstanding of the underlying nature of the
broken bulb. problem) and (2) the sequence of steps the
32 1
solver takes to get from the given situation to presented to the solver) to the goal state.
the goal. Consider, for example, the Tower of Hanoi
A problem representation is a model problem: There are three pegs mounted on
of the problem constructed by the solver a base. On the leftmost peg, there are three
to summarize his or her understanding of disks of differing sizes. The disks are arranged
the problem’s essential nature. Ideally, this in order of size with the largest disk on the
model includes information about the goal, bottom and the smallest disk on the top.
the objects and their interrelations, the op- The disks may be moved one at a time,
erations that can be applied (i.e., the steps but only the top disk on a peg may be
that can be taken) to solve the problem, and moved, and at no time may a larger disk be
any constraints on the solution process. Con- placed on a smaller disk. The goal is to move
sider, for example, Posner’s (1 973 , pp. 1 5 0– the three-disk tower from the leftmost peg
1 5 1 ) trains and bird problem: to the rightmost peg. Figure 1 4.2 shows all
the possible legal arrangements of disks on
Two train stations are fifty miles apart. At
pegs. The arrows indicate transitions be-
2 p.m. one Saturday afternoon two trains
start toward each other, one from each sta- tween states that result from moving a sin-
tion. Just as the trains pull out of the sta- gle disk. The shortest path that connects the
tions, a bird springs into the air in front of initial state to the goal state (i.e., the opti-
the first train and flies ahead to the front mum solution) is indicated by the thicker
of the second train. When the bird reaches grey arrows.
the second train it turns back and flies to- Researchers who study problem solving
ward the first train. The bird continues to present people with various types of prob-
do this until the trains meet. If both trains lems for which those people do not have a
travel at the rate of twenty-five miles per prestored solution in memory and attempt
hour and the bird flies at a hundred miles
per hour, how many miles will the bird have
flown before the trains meet?
Figure 1 4.1 shows two different representa-
tions of this problem that imply different
solution methods. Solver A [Figure 1 4.1 (a)]
represents the problem as one concerning
the ongoing flight path of the bird, which
is the focus of the problem as presented.
This perspective yields a problem that would
be difficult for most people to solve (e.g.,
a series of differential equations). In con-
trast, solver B [Figure 1 4.1 ( b)] represents the
problem from the perspective of the paths of
the trains. This perspective yields a relatively
easy distance-rate-time problem. To take an-
other example, the problem 1 4 × 8 might
be represented as 8 groups of 1 4 or as 1 0
groups of 8 plus 4 groups of 8 (or in a variety of
other ways). Figure 1 4.1 . Alternative representations of
Posner’s (1 973 ) trains and bird problem. (From
For some problems, the primary work of
“Transferring symbolic representations across
solution is to find the best representation;
non-isomorphic problems,” by L. R. Novick &
for other problems, there is little uncer- C. E. Hmelo, 1 994, Journal of Experimental
tainty about the representation, and the pri- Psychology: Learning, Memory, and Cognition, 2 0,
mary work is to discover a solution path p. 1 297. Copyright 1 994 by the American
(or the best solution path) from the initial Psychological Association. Adapted with
state of the problem (the situation as initially permission.)
problem solving 32 3
Figure 1 4.2 . All possible problem states for the three-disk Tower of Hanoi problem. The thicker
grey arrows show the optimum solution path connecting the initial state (state #1 ) to the goal state
(state #27).
to find regularities in the resulting problem- likely to induce distinct representations (e.g.,
solving behavior. For example, Greeno the trains and bird problem, problems of in-
(1 978) distinguished problems of inducing ducing structure) or to require repeated se-
structure [e.g., proportional analogies such lection and application of operators within
as those found on standardized tests – for ex- a particular problem representation (e.g.,
ample, bird:fly::snake:?? (solution is slither)], the Tower of Hanoi and other problems of
transformation (e.g., the Tower of Hanoi), transformation, problems of arrangement).
and arrangement [e.g., anagrams – for ex- This division of labor, with its distinct his-
ample, unscramble dnsuo to form an English toric antecedents and research traditions, has
word (solution is sound )], and discussed led to many interesting findings. We review
the processes required to solve problems of the main findings from each tradition and
each type. Regardless of the specific prob- then review results from studies that high-
lem type, problem-solving behavior involves light the interaction between how people
an inherent interaction between construct- understand problems and how they derive
ing a representation and generating a solu- problem solutions.
tion. However, some researchers are most in- The remainder of this chapter is orga-
terested in factors that affect the way solvers nized into five sections. First, we provide
represent problems, whereas others look for a brief historic perspective on problem-
regularities in the way solvers apply opera- solving research. Next, we summarize re-
tors to get from the initial state to the goal search on the step-by-step process of gener-
state. Based on their main focus of interest, ating problem solutions. In the third section,
researchers devise or select problems that are we describe a variety of factors that affect
problem representation. Fourth, we con- ing the formation of a representation from

sider the interplay between constructing a the process of generating a solution. The
representation and generating a solution. Fi- Gestalt psychologists documented the im-
nally, we draw some conclusions and con- pact of changes in perspective on problem
sider directions for future research. Our re- difficulty as well as the effects of extrane-
view focuses on general findings that per- ous assumptions and prior knowledge on the
tain to a wide variety of problems. Research way people understand problems and, there-
on specific types of processes that are in- fore, generate problem solutions.
volved in problem solving and on problem The psychological study of human prob-
solving in particular content domains may lem solving faded into the background af-
be found elsewhere in this volume: induc- ter the demise of the Gestalt tradition, and
tion (see Sloman & Lagnado, Chap. 5 ); anal- problem solving was investigated only spo-
ogy (see Holyoak, Chap. 6); causal learning radically until 1 972, when Allen Newell and
(see Buehner & Cheng, Chap. 7); deductive Herbert Simon’s “Human problem solving”
reasoning (see Evans, Chap. 8); and problem (Newell & Simon, 1 972) sparked a flurry
solving in law (see Ellsworth, Chap. 28), sci- of research on this topic. In contrast to
ence (see Dunbar & Fugelsang, Chap. 29), the Gestalt psychologists, Newell and Simon
and medicine (see Patel, Arocha, & Zhang, emphasized the step-by-step process of
Chap. 3 0). searching for a solution path connecting
the initial state to the goal state. Their re-
search goal was to identify general-purpose
strategies that humans use to solve a variety
A Brief History of problems. Newell and Simon and their
colleagues were heavily influenced by the
Research on human problem solving has information-processing approach to cogni-
its origins in Gestalt psychology, an in- tive psychology and by work in computer
fluential approach in European psychol- science on artificial intelligence. These in-
ogy during the first half of the twentieth fluences led them to construct the General
century. (Behaviorism was the dominant Problem Solver (GPS), a computer program
perspective in American psychology at this that modeled human problem solving (Ernst
time.) Karl Duncker published a book & Newell, 1 969; Newell & Simon, 1 972).
on the topic in his native German in A great strength of GPS was its ability to
1 93 5 , which was subsequently translated solve problems as different as the Tower of
into English and published 1 0 years later Hanoi problem and the construction of logic
as the monograph “On problem-solving” proofs with a single general-purpose strat-
(Duncker, 1 945 ). Max Wertheimer also pub- egy (means-ends analysis, which we discuss
lished a book on the topic in 1 945 , titled in “Generating Problem Solutions”).
“Productive thinking.” An enlarged edition In the mid- to late 1 970s, the role of
published posthumously includes previously background knowledge became an impor-
unpublished material (Wertheimer, 1 95 9). tant research topic in cognitive psychology,
Interestingly, 1 945 seems to have been a wa- particularly in the area of text comprehen-
tershed year for problem solving, for math- sion (e.g., Anderson, Reynolds, Schallert, &
ematician George Polya’s book, “How to Goetz, 1 977; Bransford & McCarrell, 1 974).
solve it,” also appeared then. (A second edi- In the field of problem solving, researchers
tion was published 1 2 years later; Polya, recognized that a fundamental weakness
1 95 7.) Extending the organizational princi- of GPS was its lack of domain knowl-
ples of perception to the domain of problem edge. For every problem type, the general-
solving, the Gestalt psychologists empha- purpose strategy had to be supplemented
sized the importance of problem represen- with domain-specific knowledge. Moreover,
tation – how people view, interpret, or research on expertise in knowledge-rich
organize the given information – distinguish- academic domains, such as mathematics,
physics, and political science, especially dur- solution that transcend domains to the im-
ing the late 1 970s and early 1 980s, made portance of domain-specific knowledge, and
clear the necessity of taking domain knowl- from research that separates issues of repre-
edge into account for understanding prob- sentation and solution generation to a focus
lem solving. This research on expertise (e.g., on their interaction.
Chi, Feltovich, & Glaser, 1 981 ; Silver, 1 979,
1 981 ) provided empirical evidence for asser-
tions first made by Duncker decades earlier: Generating Problem Solutions
In his discussion of expertise differences in
the domain of mathematics, Duncker (1 945 , Algorithmic Versus Heuristic
p. 1 1 0) noted that “with ‘poor’ mathemati- Solution Strategies
cians, the thought-material is from the very
beginning more thoroughly imbued with The step-by-step solution process is the se-
perceptual functions. For the ‘good’ mathe- quence of actions solvers take to find and ex-
matician, on the other hand, there remains a ecute a procedure for generating a solution
more abstract stratum . . . in which only the to the problem as they understand it. Re-
specific mathematical properties still exist” searchers who study solution processes have
(italics removed). made a distinction between algorithmic and
It is perhaps inevitable that the two tra- heuristic strategies.
ditions in problem-solving research – one An algorithm is a procedure that is guar-
emphasizing representation and the other anteed to yield the solution. One type of al-
emphasizing the process of generating a so- gorithm is a mathematical formula. For ex-
lution – would eventually come together. ample, multiplying the length of the base
Although no single publication can be cred- of a rectangle times its height is guaranteed
ited for the rapprochement, one impetus for to yield the rectangle’s area. Similarly, the
a blending of the two traditions was the re- formula
alization that background knowledge plays a √
−b ± b 2 − 4ac
critical role in problem solving. In particular, X= (Eq. 1 4.1 )
2a
differences in background knowledge called
attention to the interdependence between is guaranteed to provide the roots of the
the representation constructed and the solu- quadratic equation
tion method employed, for solvers who con-
aX 2 + bX + c = 0. (Eq. 1 4.2)
structed different representations were ob-
served to generate the solution in different We discuss mathematical problem solving in
ways. Figure 1 4.1 provides a clear example some detail in “The Interplay Between Rep-
of this interdependence for the trains and resentation and Solution” (also see Gallistel
bird problem. To take another example, the & Gelman, Chap. 23 ).
8-year-old son of one of the authors mentally Another type of algorithm – exhaustive
represented the verbally stated multiplica- search – involves checking every possible
tion problem “sixty-seven times ninety-five” move. For example, one could solve the
as (60 × 95 ) + (7 × 95 ) and then proceeded Tower of Hanoi problem by exhaustively
to mentally execute the indicated arithmetic considering every possible move in Fig-
operations to get the answer. In contrast, ure 1 4.2. Similarly, one could solve a four-
most people would represent this problem letter anagram (e.g., idrb) by systematically
as 67 groups of 95 and turn to paper and pen- evaluating the 24 possible permutations
cil to compute the answer using the standard of the given letters (the solution is bird ).
multiplication algorithm (given the absence For problems with a large number of pos-
of a calculator). The structure of this chapter sible states, however, exhaustive search is
aims to capture the evolution of research in impractical or impossible. For example, if
the field of problem solving: from research the task is to find all possible solutions of a
on general principles of representation and five-letter anagram (e.g., ebrda forms bread,
beard, bared, and debar), exhaustive search their view that problem solving can be
would require examination of 1 20 letter or- described as a process of heuristic search
ders. More strikingly, consider the game of within a specific type of representation, and
chess (Holding, 1 985 ): White has 20 possible we consider in some detail two important
opening moves, to which black can respond search heuristics: hill climbing and means-
in any of 20 ways. Thus, on the second turn, ends analysis.
white may be confronted with any of 400
possible board positions. After white’s third
Problem Solving as Search Through
move, there are 7.5 million possible board
a Problem Space
positions; after black’s third move, there are
225 million possible positions. For a game of Newell and Simon (1 972) wrote a magnum
average length, the number of possible posi- opus detailing their theory of problem solv-
tions is approximately 1 0 1 1 7 . ing and presenting several lines of supporting
Clearly, some method is needed to prune evidence. Because their goal was to develop
the number of possible moves to be con- a theory to encompass all human problem
sidered. Such pruning is necessary for hu- solving, they emphasized what is common
man solvers owing to the limited capacity across the diversity of problems and problem
of working memory; it is also necessary for solvers. Their fundamental proposal was that
computers when, as in chess, the number of problem solving could be conceptualized as
possible states is extremely large. Heuristics a process of searching through a problem
are problem-solving strategies that accom- space for a path connecting the initial state
plish this goal. Although heuristics do not of knowledge (the solver’s understanding of
guarantee solution, they are highly likely to the given information) to the goal state (the
lead to success. For example, a good heuristic desired solution).
for solving anagrams, especially those with Problem space is the term Newell and
five or more letters (e.g., dsyha), is to con- Simon (1 972) coined to refer to the solver’s
sider letter pairs that commonly begin words representation of the task as presented (also
of the given length (e.g., Ronning, 1 965 ). see Simon, 1 978). Briefly, a problem space
This heuristic is useful because, by defini- consists of a set of knowledge states (the ini-
tion, most words begin with common letter tial state, the goal state, and various possible
pairs. Application of this heuristic to the ex- intermediate states), a set of operators that
ample should quickly lead to the solution, allow movement from one knowledge state
shady. That considering common initial let- to another, and local information about the
ter pairs is a heuristic rather than an algo- path one is taking through the space (e.g.,
rithm is nicely illustrated by a second ana- the current knowledge state and how one
gram, uspyr, which cannot be solved by this got there). For the three-disk Tower of Hanoi
strategy because it begins with an uncom- problem, the initial state is illustrated at the
mon letter pair (the solution, syrup, is the top of Figure 1 4.2 (state #1 ), and the goal
only five-letter word in English that begins state is illustrated at the bottom right of that
with sy; Novick & Sherman, 2004). figure (state #27). All other knowledge states
A large body of literature has examined shown in the figure are possible intermedi-
the heuristics that people use to generate ate states. The current knowledge state is
problem solutions. Much of this research the one at which the solver is located at any
has focused on puzzle-like problems, such given point in the solution process. For ex-
as the Tower of Hanoi, that require little ample, the current state for a solver who has
domain-specific knowledge. These problems made three moves along the optimum so-
are useful because they enable researchers lution path would be state #9. The solver
to focus their attention primarily on the presumably would know that he or she ar-
process of generating solutions. Newell and rived at this state from state #5 . This knowl-
Simon (1 972) were the pioneers in this area edge allows the solver to recognize a move
of research. In the next section, we discuss that involves backtracking. Finally, the three
HHHOOOb
operators in this problem are moving each 1.
of the three disks from one peg to another.
These operators are subject to the constraint
that a larger disk may not be placed on a HHHO HHOO
2. 3.
smaller disk. OOb HOb
Newell and Simon’s (1 972) primary fo-
cus of investigation was the strategies solvers 14.
HHHOOb
use to find a path connecting the initial state 4. HHHOOOb
to the goal state. That is, they sought to O
discover regularities in how solvers search OOb HOb

HHH 12. 13.
through a problem space. In a nutshell, 5. HHHO HHOO
search is a serial method for making incre- OOOb
mental progress toward the goal by applying

HHHOb O
operators to move from one knowledge state 6. 11.
to another adjacent knowledge state. Newell OO HHHOOb
and Simon discovered that, for a wide vari-

ety of problems, solvers’ search is guided by HO
10.
OOOb
7.
a small number of heuristics. HHOOb HHH
To investigate these heuristics, Newell
and Simon (1 972) relied on two primary HHOOb OO
8. 9.
methodologies – think-aloud protocols (also HO HHHOb
see Duncker, 1 945 ) and computer simula-
Figure 1 4.3. Problem states on the solution
tion. Solvers were required to say out loud
path for the Hobbits and Orcs problem. Each H
everything they were thinking as they solved represents a Hobbit, each O represents an Orc,
the problem – that is, everything that went and the b represents the boat. The two
through verbal working memory. Subjects’ horizontal lines indicate the banks of the river.
verbalizations – their think-aloud protocols – State #1 is the initial state, and state #1 4 is the
were tape-recorded and then transcribed goal state.
verbatim for analysis. This method is ad-
vantageous for studying problem solving be-
may be judged useful. Lovett and Anderson
cause it provides a detailed record of the
(Chap. 1 7) provide an in-depth treatment of
solver’s ongoing solution process. An im-
computer models of thinking.
portant caveat that must be kept in mind
while interpreting a subject’s verbalizations
is that “a protocol is relatively reliable only hill climbing
for what it positively contains, but not for Hill climbing is a heuristic in which, at each
that which it omits” (Duncker, 1 945 , p. 1 1 ). step, the solver applies the operator that
The use of think-aloud protocols to study yields a new state that appears to be most
problem solving was popularized by Newell similar to the goal state. This heuristic can be
and Simon. Ericsson and Simon (1 980) pro- used whenever solvers can define an evalua-
vided an in-depth discussion of the condition function that yields information about
tions under which this method is valid (but the similarity of the problem state gener-
see Russo, Johnson, & Stephens, 1 989, for ated by a candidate operator to the goal
an alternative perspective). To test their in- state. For example, Chronicle, MacGregor,
terpretation of a subject’s verbal protocol, and Ormerod (2004) found evidence that
Newell and Simon created a computer simu- subjects use hill climbing to solve various
lation that was intended to solve the prob- problems in which a set of coins has to be re-
lem the same way the subject did. To the ex- arranged from one configuration to another.
tent that the computer simulation provided We illustrate this heuristic using an exam-
a close approximation of the solver’s step- ple of a river-crossing problem (Figure 1 4.3 ),
by-step solution process, the interpretation one of the classic problem types in the
field: There are three Hobbits, three Orcs, taking the largest uphill step at all times.
and a boat on one side of a river (state #1 ). Sometimes one needs to walk downhill for a
The goal is to use the boat, which has a ca- while to achieve the ultimate goal of reach-
pacity of only two creatures, to ferry all the ing the mountain top.
creatures across the river (state #1 4). At no
time may Orcs outnumber Hobbits on either mean-ends analysis
side of the river because they will eat the Means-ends analysis is a more sophisticated
Hobbits. The solution path for this problem heuristic than hill climbing because it does
is essentially linear, as shown in Figure 1 4.3 . not depend on simple similarity to the goal.
From the initial state, there are two le- This heuristic consists of the following steps:
gal moves available – ferrying two Orcs or
one Orc and one Hobbit across the river. 1 . Identify a difference between the cur-
Both moves yield new states that are equally rent state and the goal (or subgoal) state.
similar to the goal state, and so either may 2. Find an operator that will remove (or re-
be chosen. Use of the hill-climbing heuristic duce) the difference.
proceeds smoothly for the most part until 3 a. If the operator can be directly applied,
the solver reaches state #7 in which there do so, or
is one Hobbit and one Orc on the original 3 b. If the operator cannot be directly ap-
side of the river; the boat and the remaining plied, set a subgoal to remove the obsta-
creatures are on the other (goal) side. The cle that is preventing execution of the
correct move at this point, in fact the only desired operator.
nonbacktracking move, is for one Hobbit and
4. Repeat steps 1 to 3 until the problem is
one Orc to take the boat back to the original
solved.
side of the river. Thomas (1 974) and Greeno
(1 974) found that solvers have particular dif- We illustrate this heuristic with the Tower
ficulty moving from state #7 to state #8: of Hanoi problem. A key difference be-
Both the probability of making an incorrect tween the initial state and the goal state
move and the time taken to make a move are (Figure 1 4.2) is that the large disk is on
quite large for this transition compared with the wrong peg (step 1 ). The move-large-disk
other transitions. According to Wickelgren operator is required to remove this differ-
(1 974), this difficulty occurs for either of ence (step 2). However, this operator can-
two reasons. For solvers who evaluate their not be applied because of the presence of
progress one move at a time, this transition is the medium and small disks on top of the
problematic because one must detour more large disk. Therefore, the solver may set a
than usual by taking two creatures back to subgoal to move that two-disk tower to the
the original side of the river (logically, only middle peg (step 3 b), thereby leaving the
one creature is needed to get the boat back right peg free for the large disk. A key dif-
to the original side). For solvers who evalu- ference between the initial state and this
ate their progress two moves at a time (i.e., new subgoal state is that the medium disk
round trips of the boat from the original is on the wrong peg. Because application of
side back to the original side), this transi- the move-medium-disk operator is blocked,
tion is problematic because it results in no the solver sets another subgoal to move
net progress toward the goal compared with the small disk to the right peg. This sub-
state #6. goal can be satisfied immediately by apply-
The difficulty solvers encounter in moving the move-small-disk operator (step 3 a),
ing from state #7 to state #8 illustrates generating state #3 . The solver then re-
the primary drawback of the hill-climbing turns to the previous subgoal – moving the
heuristic: Sometimes one needs to move ei- tower consisting of the small and medium
ther backward or laterally to move forward. disks to the middle peg. The differences be-
Climbing a mountain can rarely be accom- tween the current state (#3 ) and the subgoal
plished solely by following the strategy of state (#9) can be removed by applying first
the move-medium-disk operator (yielding Hanoi problem as opposed to the problem

state #5 ) and then the move-small-disk op- of finding the roots of a quadratic equation
erator (yielding state #9). Finally, the move- or of unscrambling an anagram.
large-disk operator is no longer blocked. The
solver takes that action, moving the large
Some Conclusions from Research
disk to the right peg, yielding state #1 1 . No-
on Problem Solving as Search
tice that the subgoals are stacked up in the
order in which they are generated so they Newell and Simon’s (1 972) goal was to
pop up in the order of last in first out. Given discover general problem-solving strategies
the first subgoal in our example, repeated ap- that are common across problem solvers
plication of the means-ends analysis heuris- and across problems. One important con-
tic will yield the shortest-path solution indi- tribution of their work concerns the meth-
cated by the thick grey arrows. ods they adopted for studying this issue.
The key difference between hill climbing Duncker (1 945 ) was an early advocate of col-
and mean-ends analysis is the online gen- lecting think-aloud protocols, and he used
eration of subgoals in the latter heuristic. this methodology very successfully to study
Adding new subgoals during problem solv- problem solving. With the rise to dominance
ing greatly increases the power of heuristic of behaviorism and the fall of the Gestalt ap-
search. Subgoals provide direction, and to proach to psychology, however, this method-
the extent that they are appropriate, they ology fell into disfavor. Newell and Simon
can be expected to prune the space of pos- (1 972) brought a high degree of scientific
sible states. Moreover, by assessing progress rigor to the collection of verbal protocols,
toward a required subgoal rather than the fi- enabling this methodology to gain a degree
nal goal, solvers may be able to make moves of acceptance in the field that it did not
that otherwise seem unwise. To take a con- previously enjoy. In addition, Newell and
crete example, consider the transition from Simon were among the early pioneers in the
state #1 to state #3 in Figure 1 4.2. Compar- use of computer simulation as a tool for
ing the initial state with the goal state, we testing theories of psychological processes.
find that this move seems unwise because Both of these methods are now seen as ordi-
it places the small disk on the bottom of nary rather than exotic means of investigat-
the right peg, whereas it ultimately needs ing problem solving (as well as other cogni-
to be at the top of the tower on that peg. tive processes).
However, if one compares the initial state Newell and Simon’s (1 972) goal of un-
with the solver-generated subgoal state of covering general problem-solving strategies
having the medium disk on the middle peg, necessitated a focus on the solution of puz-
this is exactly where the small disk needs to zles such as the Tower of Hanoi and Hobbits
go. More generally, generating subgoals al- and Orcs, which are relatively uncontami-
lows solvers to plan several moves ahead. nated by domain knowledge that necessar-
(Duncker, 1 945 , also talked about the im- ily varies across individuals. This focus was
portance of subgoals.) much like Ebbinghaus’ strategy of investigat-
As we noted in our brief historic review, ing general principles of memory by study-
means-ends analysis is the heuristic that GPS ing nonsense syllables. Using this strategy,
used to successfully model human problem Newell and Simon and their colleagues made
solving across a wide variety of tasks (Ernst important contributions to the field of prob-
& Newell, 1 969; Newell & Simon, 1 972). A lem solving: Means-ends analysis and other
large body of research has found that mean- heuristics are very flexible and general strate-
ends analysis tends to be people’s preferred gies that people frequently use to success-
solution method for novel problems that are fully solve a large variety of problems.
relatively free of specialized content and for Nevertheless, the view of problem solving
which a definite goal is given (Greeno & as search through a problem space does not
Simon, 1 988) – for example, the Tower of provide a complete understanding of how
people solve problems. Although people rely the goal is not well defined, nor is it clear
on general-purpose search heuristics when how to determine that the goal has been ac-
they encounter novel problems, because complished. For example, what constitutes
these heuristics are weak and fallible, they a tasty dinner, and how does one decide that
abort them as soon as they acquire some a particular recipe is tasty enough? It seems
knowledge about the particular problem obvious that a cook’s definition of the goal
space. At that point, they switch to more state will depend on his or her background
specialized strategies (e.g., Anzai & Simon, knowledge. A poor graduate student might
1 979). In general, whenever solvers have picture homemade pizza, a parent of young
some relevant background knowledge, they children might imagine lasagna, an Indian
tend to use stronger, albeit more narrowly couple without children might think of spicy
applicable, domain-specific methods. The lamb vindaloo, and a gourmet cook might
impact of learning and domain knowledge visualize beef Wellington. The tasty dinner
on strategy use led problem-solving re- problem is ill defined in other ways as well.
searchers to turn their attention from the The cook has to define the given informa-
solution of knowledge-lean puzzles and rid- tion (only ingredients found at home or also
dles to problems that made connections to those at the grocery store?), the operators
solvers’ background knowledge. This shift is (e.g., to bake or stir fry or simmer on the
analogous to memory and comprehension stove), and the constraints (e.g., time, cost,
researchers’ switch from studying nonsense the differing tastes of adults and children).
syllables to studying words, paragraphs, and As we noted earlier, the Gestalt psychol-
stories in order to understand the role of ogists focused their attention on the factors
prior knowledge in memory and compre- that affect how people define, understand,
hension. As we noted in the introduction, or represent problems. Greeno (1 977), in
background knowledge plays an important specific counterpoint to Newell and Simon’s
role in determining the representation a (1 972) focus on problem solving as search,
solver constructs for a problem, which, in also highlighted the central importance of
turn, affects the processes the solver uses to representation. More recently, researchers
generate a solution. In the next two sections, who have studied problem solving in par-
we focus on problem representation and the ticular knowledge domains (e.g., mathemat-
interplay between representation and solu- ics, physics, medical diagnosis) have also em-
tion, respectively. phasized the critical role of representation
in successful problem solving. Their investi-
gations have shown that various aspects of
Problem Representation
the problem situation, as well as people’s
background knowledge, affect how people
Overview
represent problems and, in turn, how they
In problems such as the Tower of Hanoi and generate problem solutions. The trains and
Hobbits and Orcs, all the problem compo- bird problem we discussed at the outset
nents – the initial conditions, the goal, the (Figure 1 4.1 ) provides an anecdotal exam-
means for generating and evaluating the so- ple of the importance of the representation
lution, and the constraints – are well defined constructed for the ultimate success of one’s
in the problem as presented. In most real- solution attempt.
world problems, however, the solver has to We stated informally at the outset that
define one or more of the problem compo- a problem representation is a model of the
nents. For example, a person’s desire to cook problem constructed by solvers to summa-
a tasty dinner, a student’s aspiration to write rize their understanding of the problem’s
a term paper that will earn a grade of “A,” essential nature. More specifically, a repre-
and a young executive’s need to find suitable sentation has four components (Markman,
housing are all examples of ill-defined prob- 1 999): (1 ) a represented world – in this case,
lems (Reitman, 1 965 ). In these problems, the description of the problem to be solved,
problem solving 331
(2) a representing world – the set of elements play between representation and solution,
to be used to depict the objects and relations focusing there on studies showing that the
in the represented world, (3 ) a set of rules representation one constructs for a problem
that map elements of the represented world affects how one generates the solution.
to elements of the representing world, and
(4) a process that uses the information in the The Importance of Problem Context
representing world – in this case, to solve
the problem. This last component high- A number of studies have found that various
lights the link between representation and aspects of the problem context have a strong
solution: Without some process that uses the influence on the representations solvers con-
information in the representation for some struct. In this section, we describe three such
purpose, the so-called representation has no studies, which illustrate three different types
symbolic meaning (i.e., it does not serve a of problem context effects. The first study il-
representational function). lustrates an effect of the perceptual form of
The representation a solver uses to sup- the problem, the second study shows an ef-
port and guide problem solving can be either fect of semantic interpretation based on how
internal (residing in working memory) or ex- objects are used, and the third study demon-
ternal (e.g., drawn on paper). In either case, strates an effect of the story content of the
the elements of the representing world may problem.
follow a variety of different formats. Some
representations are best described as verbal perceptual form
or propositional or declarative. Others are Problems that are presented as visual dis-
pictorial or diagrammatic, such as a drawing plays or diagrams may provide informa-
of a pulley system, a matrix or network, and a tion about configuration that solvers deem
bar or line graph (see Hegarty, Carpenter, & relevant to the solution and include in
Just, 1 991 , for a discussion of types of di- their problem representation. This effect is
agrammatic representations). Finally, some nicely illustrated by Maier’s (1 93 0) nine-dot
representations are “runnable” mental mod- problem: Nine dots are arrayed in a 3 ×
els (e.g., a mental abacus – Stigler, 1 984; a 3 grid, and the task is to connect all the
system of interlocking gears – Schwartz & dots by drawing four straight lines without
Black, 1 996). lifting one’s pencil from the paper. People
In the previous section of this chapter, have difficulty solving this problem because
we highlighted how solvers generate prob- their initial representations generally include
lem solutions, leaving in the background the a constraint, inferred from the configuration
question of how they represent the informa- of dots, that the lines cannot go outside the
tion in the problem. In this section, we take boundary of the imaginary square formed by
the opposite perspective, highlighting the the outer dots. With this constraint implied
problem representations that solvers con- by the perceptual form of the dots, the prob-
struct and leaving in the background the lem cannot be solved (but see Adams, 1 979).
methods by which those representations are Without this constraint, the problem may be
used to generate the solution. We consider solved as shown in Figure 1 4.4.
solution only as a dependent measure (i.e., The nine-dot problem is a classic insight
accuracy and/or solution time), illustrating problem. According to the Gestalt view
that differences in problem representation (e.g., Duncker, 1 945 ; Maier, 1 93 1 ; see Ohls-
affect problem solution. Our discussion of son, 1 984a, for a review), the solution to an
research in this area is organized around two insight problem appears suddenly, accom-
classes of factors that have been found to af- panied by an “aha!” sensation, immediately
fect the representation that solvers select or following the sudden restructuring of one’s
construct for the problem at hand – problem understanding of the problem: “The decisive
context and solver’s knowledge. In the next points in thought-processes, the moments of
section of the chapter, we consider the inter- sudden comprehension, of the ‘Aha!,’ of the
new, are always at the same time moments (1 945 ): If an object has been used for one
in which such a sudden restructuring of purpose, or is habitually used for a certain
the thought-material takes place” (Duncker, purpose, it is difficult to see that object as
1 945 , p. 29). For the nine-dot problem, one having properties that would enable it to
view of the required restructuring is that the be used for a dissimilar purpose. Duncker’s
solver relaxes the constraint implied by the basic experimental paradigm involved two
perceptual form of the problem and realizes conditions that varied in terms of whether
that the lines in fact may extend past the the object that was crucial for solution was
boundary of the imaginary square. initially used for a function other than that
To test this view, in one experiment required for solution.
Weisberg and Alba (1 981 ) compared the per- Consider the candles problem, the most
formance of control subjects who were given well-known of the five problems Duncker
20 attempts to solve the nine-dot problem (1 945 ) investigated. Three candles are to be
with that of other subjects who received mounted at eye height on a door. On the ta-
1 0 attempts before a restructuring hint, fol- ble for use in completing this task are some
lowed by 1 0 attempts after the hint. The tacks and three boxes. The solution is to
restructuring hint involved telling subjects tack the three boxes to the door to serve
that they had exhausted all possibilities in- as platforms for the candles. In the control
side the square, and so they had to go outside condition, the three boxes were presented
the square to solve the problem. No sub- to subjects empty. In the functionally fixed
ject in either condition solved the problem condition, the three boxes were filled with
in the first 1 0 tries, and no subject in the candles, tacks, and matches. Thus, in the lat-
control condition ever solved the problem ter condition, the boxes initially served the
(excluding those who had seen the problem function of container, whereas the solution
before). However, 20% of the restructuring requires that they serve the function of plat-
hint group solved the problem in the sec- form. The results showed that 1 00% of the
ond 1 0 tries. A follow-up study that gave subjects who received empty boxes solved
subjects many more solution attempts repli- the candles problem compared with only
cated these results. Interestingly, solution 43 % of subjects who received filled boxes.
was neither quick nor direct following the Every one of the five problems showed a dif-
restructuring hint in either study, for sub- ference favoring the control condition over
jects generally required 5 to 1 1 solution at- the functionally fixed condition with aver-
tempts after the hint before solving the prob- age solution rates across the five problems
lem. Moreover, 75 % to 80% of the subjects of 97% and 5 8%, respectively. In “The Inter-
failed to solve the problem despite the hint. play between Representation and Solution”
Thus, restructuring, as provided by Weisberg we discuss additional examples of object-
and Alba’s hint, appears to be necessary but based inferences that link semantic content
not sufficient for solution. We reconsider the to representation and then to the method of
nature of insight in “The Interplay Between solution adopted.
Representation and Solution.”
story content
object-based inferences In our earlier discussion of the trains and bird
In addition to making inferences from the problem, we mentioned that the text is writ-
perceptual form of a presented figure, solvers ten such that it invites the solver to focus
may draw inferences from the specific enti- on the motion of the bird [Figure 1 4.1 (a)]
ties that appear in a problem, and these in- rather than of the trains [Figure 1 4.1 (b)]. In
ferences may likewise affect the constructed general, the story content and phrasing of
problem representation. A classic example the problem text may affect how the solver
of such inferences is the phenomenon of represents the problem. Hayes and Simon
functional fixedness introduced by Duncker (1 977; also see Kotovsky, Hayes, & Simon,
problem solving 333
globes, only the larger globe may be trans-

ferred; and (3 ) a globe cannot be transferred
to a monster that is holding a larger globe.
In the change variant, subjects were told
that the monsters could shrink and expand
themselves according to the following rules:
(1 ) Only one monster may change size at
a time; (2) if two monsters are the same
size, only the one holding the larger globe
may change size; and (3 ) a monster may not
change size so it becomes the same size as an-
other monster that is holding a larger globe.
Because these two problems are struc-
turally identical, they can be solved by mak-
ing the same sequence of moves in the same
Figure 1 4.4. A solution to the nine-dot problem.
problem space. However, the subjects did
not translate the problems to a common rep-
1 985 ) provided empirical evidence that dif- resentation. Rather, they accepted the cover
ferences in the descriptions of the operators story as given and, depending on the vari-
in two isomorphic (i.e., structurally equiva- ant they received, proceeded to either move
lent) problems yielded quite different rep- globes or change monster sizes. The differ-
resentations with important consequences ent representations and operators adopted
for the problems’ relative difficulty. They were apparent in the written notations pro-
used several variants of the Tower of Hanoi duced by subjects as they solved the prob-
problem that concerned monsters and globes lem (Hayes & Simon, 1 977). Importantly,
that came in three sizes: small, medium, the representation constructed had a large
and large. We discuss one “transfer” variant effect on solution time: The transfer vari-
and one “change” variant used in Hayes and ant took about 1 4 minutes to solve com-
Simon’s research. For both variants, the ini- pared with about 29 minutes for the change
tial state had the small monster holding the variant. The greater difficulty of the change
large globe, the medium-size monster hold- variant is due to an additional step needed
ing the small globe, and the large monster to check that the operator constraints have
holding the medium-size globe. The goal been satisfied.
was for each monster to have a globe pro-
portionate to its own size. Both variants
The Importance of Solvers’ Knowledge
can be mapped onto the Tower of Hanoi
problem states shown in Figure 1 4.2. If we In the previous section, we discussed prob-
map the small, medium, and large monsters lem factors that affect the representations
onto the left, center, and right pegs, respec- solvers construct. However, the extent to
tively, and map increasing globe size onto which solvers respond to various problem
decreasing disc size, both monster variants factors depends on their prior experience
are equivalent to the task of getting from and background knowledge. Consider, for
state #1 2 to state #5 in Figure 1 4.2. example, the following mathematical word
The only difference between the two problem: “Susan has 1 2 cookies and three
monsters and globes isomorphs concerned boxes. How many cookies should she place
the description of the operators. In the trans- in each box in order to divide them up
fer variant, subjects were told that the mon- fairly?” A child who has sufficient experience
sters could transfer the globes from one to with solving such problems is likely to repre-
another as long as they followed three rules: sent this problem in terms of its mathemati-
(1 ) Only one globe may be transferred at a cal structure – simple division. In contrast,
time; (2) if a monster is holding multiple a child who has never encountered such
problems might try to understand it in terms problem. Holyoak (Chap. 6) provides an

of human motivation and behavior. This in-depth treatment of research on analogy.
child might consider the size of the cook- Here, we describe in detail only a single, now
ies and the boxes, or wonder who Susan is classic, study (Gick & Holyoak, 1 980) that
and why she wants to put the cookies in illustrates this line of research.
the boxes. In general, solvers’ background Gick and Holyoak (1 980) used Duncker’s
knowledge affects whether and to what ex- (1 945 ) radiation problem as their target
tent they focus their attention on problem (novel) problem. This problem involves
aspects that are or are not relevant to deter- finding a way to use some rays to destroy
mining the solution. In this section, we dis- a patient’s stomach tumor, without harming
cuss three types of background knowledge the patient. At sufficiently high intensity, the
that pertain to solvers’ understanding of the rays will destroy the tumor. However, at that
problem at hand. First, we consider solvers’ intensity they will also destroy the healthy
prior experience with a structurally similar tissue surrounding the tumor. At lower in-
or analogous problem. Second, we consider tensity, the rays will not harm the healthy
their generalized schemas for types of so- tissue, but they also will not destroy the tu-
lution procedures as well as types of com- mor. The desired solution is to project multi-
mon representational tools (e.g., matrices). ple low-intensity rays at the tumor from sev-
Third, we consider differences in problem eral points around the patient. The rays will
representation that are due to differences in converge on the tumor, where their individ-
solvers’ domain expertise. ual intensities will sum to a level sufficient to
destroy the tumor. Baseline use of this con-
experience with a structurally similar or vergence solution is quite low – about 1 0%
analogous problem (Gick & Holyoak, 1 980). Gick and Holyoak
A large body of research has examined peo- examined whether solvers’ understanding of
ple’s use of specific examples of problems the radiation problem, as indexed by their
to help them understand and solve a cur- use of the convergence solution, might be
rent problem. An example can be helpful facilitated by prior exposure to an analogous
for solving a novel problem only if the two situation. To this end, they had subjects at-
problems have a similar underlying structure tempt to solve the radiation problem after
because a problem’s structure is what de- having previously read a story that described
termines appropriate solution methods (e.g., the following analogous situation: A general
division for the cookie problem). The ex- was trying to capture a fortress controlled by
ample will not be helpful if the problems a dictator. Multiple roads led to the fortress
only share a similar cover story and involve from all directions. However, the roads were
similar objects (e.g., a person, cookies, and mined in such a way that large groups of sol-
boxes) but differ in their underlying struc- diers could not travel on them. The general
ture (e.g., in the example Susan distributes decided to send a separate small group of sol-
cookies among boxes, but in the novel prob- diers down each of the various roads so the
lem Leah removes one cookie from each full army would converge at the fortress. In
box). Research on analogical problem solv- this way, he was able to overthrow the evil
ing (also referred to as analogical trans- dictator and capture the fortress.
fer) shows that solvers’ understanding, or Gick and Holyoak (1 980) found that sub-
representation, of a novel problem can be jects generally did not spontaneously notice
facilitated by prior experience with an anal- that the story about the fortress was relevant
ogous (i.e., structurally equivalent) problem. to solving the radiation problem: Only about
However, people may fail to retrieve an anal- 20% provided the convergence solution to
ogous problem from memory, or fail to ap- that problem after having read the fortress
ply an analogous solution, if they focus their story. However, when these same subjects
attention on the solution-irrelevant differ- were subsequently given a simple hint indi-
ences between the example and the novel cating that one of the stories they had read
problem solving 335
earlier might be helpful for solving the ra- studies have shown that schemas for solution
diation problem, about 75 % generated the procedures can be induced by comparing
convergence solution. These results indicate two or more analogous problems (with their
that solvers may fail to spontaneously notice solutions) or by successfully solving one
the relevance of problems stored in mem- problem by analogy to another (solved)
ory for understanding and solving a current problem, and such schema induction in turn
problem, although they are able to use the facilitates understanding and solution of sub-
prior problem appropriately when its rele- sequent analogous problems (e.g., Bassok
vance is highlighted. & Holyoak, 1 989; Gick & Holyoak, 1 983 ;
An important factor that mediates spon- Novick & Holyoak, 1 991 ; Ross & Kennedy,
taneous retrieval and use of analogous solu- 1 990). Research on solution schemas is
tions is people’s understanding of the learn- discussed in more detail by Holyoak
ed example. Chi, Bassok, Lewis, Reimann, (Chap. 6).
and Glaser (1 989) investigated this issue In the remainder of this section, we dis-
in the domain of physics, using problems cuss some of the recent research on repre-
from elementary mechanics. They found sentation schemas (Hurley & Novick, 2004;
that learners who understood the logic of Novick, 2001 ; Novick, Hurley, & Francis,
textbook examples spontaneously applied 1 999). This research shows that college stu-
the example problems’ solutions to analo- dents possess abstract schemas for three
gous test problems that differed from the spatial diagrams – matrices, networks, and
learned examples in many respects. How- hierarchies – that are important tools for
ever, poor learners failed to recognize the understanding and solving problems from a
structural similarity between the examples variety of domains (see Tversky, Chap. 1 0,
and the novel problems. People’s ability to for a general review of visuospatial reason-
exploit analogous solutions also depends on ing). These schemas presumably were in-
their domain expertise. We discuss expertise duced over the course of students’ in-school
differences in problem representation after and out-of-school experiences with concrete
considering schematic knowledge. Then, in instances of these diagrams in use (Novick,
“The Interplay Between Representation and 2001 ). For example, matrices are used for
Solution” we consider the implications of ex- multiplication tables, time schedules, grade
pertise differences for analogical transfer. books, and seating charts. The spatial dia-
gram schemas seem to be at an intermedi-
ate level of generality (Novick et al., 1 999):
general schemas in memory Each type of diagram is best suited for a par-
In addition to knowledge of specific prob- ticular type of relational structure, regardless
lems encountered in the past, solvers also of the content domain in which that struc-
have in memory abstract schemas for types ture is embedded. For example, a matrix is
of problems, types of solution procedures, appropriate whenever (1 ) all possible com-
and types of representations. These schemas binations of items across two sets must be
are abstract in the sense that they include considered, (2) the relation between items is
information that is common to multiple associative (i.e., nondirectional), and (3 ) it is
problems of a particular type but exclude important to be able to distinguish between
information that is idiosyncratic to the in- items that are related and those that are not
dividual problems over which the abstrac- (Novick & Hurley, 2001 ). The abstract rep-
tion has occurred. For example, an abstract resentation schemas are more useful than are
schema for the convergence solution would specific relevant example problems for un-
specify that multiple, low-intensity forces derstanding the structures of novel problems
converge from different directions on a cen- (Novick et al., 1 999).
tral target, but it would not specify that the To measure problem understanding,
forces are soldiers (or rays) or that the tar- Novick et al. (1 999) asked subjects to select
get is a fortress (or a tumor). A number of the most appropriate type of spatial diagram
to represent the structure of each of 1 2 story prior knowledge by having them think about
problems. (Solving these problems would each abstract diagram for 20 seconds greatly
have required using analytical or mathemati- facilitated understanding of the test prob-
cal reasoning.) In one experiment, some sub- lems compared with spending 6 minutes
jects participated in a specific example con- studying and successfully solving each of the
dition, whereas other subjects participated relevant example problems.
in a general category condition. The initial
task in the specific example condition pro-
vided subjects with three example problems, expertise
each illustrating the use of a different one The studies discussed in the previous two
of the three spatial diagrams. Subjects spent sections examined the effects of back-
6 minutes solving each example problem us- ground knowledge on problem representa-
ing the diagrammatic representation given. tion among typical college students. It has
In contrast, the initial task in the general cat- also proved to be especially interesting to
egory condition was designed to cue the ab- investigate problem representation among
stract schemas that subjects were hypoth- people who differ with respect to their ex-
esized to have in memory. Subjects were pertise in the domain under investigation.
shown (one at a time) an abstract (empty) Duncker (1 945 ) was perhaps the first psy-
hierarchy, matrix, and network. Above each chologist to note that experts and novices in
diagram was a short phrase naming the type a domain focus their attention on different
of diagram (e.g., “a network or system of aspects of that domain, leading them to con-
paths”). Subjects saw the abstract diagrams struct problem representations that are quite
for 20 seconds each and were asked to famil- different: Whereas experts’ representations
iarize themselves with the diagrams so they tend to highlight solution-relevant structural
would have clearly in mind what each one is features (in particular, meaningful causal re-
like for the next task. lations among the objects in the problem),
If college students possess at least rudi- novices’ representations tend to highlight
mentary abstract schemas for the three spa- solution-irrelevant superficial features (e.g.,
tial diagrams, then the brief (20-second) the particular objects themselves or how the
study times for the abstract diagrams question is phrased). Evidence for these rep-
presented in the general category condition resentational differences has been found us-
should have been sufficient to cue those ing a wide variety of experimental tasks and
schemas. Abstract schemas provide a more procedures.
reliable source of knowledge for understand- A number of studies have found that ex-
ing new problems than do specific example perts’ attention is quickly captured by mean-
problems because the schemas do not con- ingful configurations within a presented
tain specific story content (Holyoak, 1 985 ). stimulus, a result that calls to mind the
In contrast, example problems do contain Gestalt view that problem solving is related
specific content, and this content must be to perception. In contrast, novices’ atten-
ignored when it mismatches that of the tion is focused on isolated components of
novel problems. Given this difference be- the stimulus. Perhaps the earliest research
tween abstract schemas and concrete ex- investigating this issue comes from the do-
amples, Novick et al. (1 999) predicted that main of chess (Chase & Simon, 1 973 ; de
subjects in the general category condition Groot, 1 966). In the typical study, subjects
would be more successful than those in the view 20 or more chess pieces arranged on a
specific example condition at selecting the chess board for 5 seconds and then have to
most appropriate type of representation for immediately reconstruct what they saw on a
the test problems that required spatial di- new chess board. The arrangement of chess
agram representations. The results strongly pieces is either from the middle of a real
supported this prediction: Cueing subjects’ game or is random. When the arrangement
problem solving 337
comes from a real game, recall improves dra- representations are not restricted to math-
matically as a function of expertise, from ematical domains. For example, Kindfield
about 5 pieces for novices to about 20 pieces (1 993 /1 994) analyzed the chromosome di-
for players at the level of International Mas- agrams produced by subjects who varied in
ter or above (Gobet & Simon, 1 996). Recall their degree of formal training in genetics as
also improves with expertise for random po- they reasoned about the process of meiosis.
sitions, although the effect is much smaller She found that the more expert subjects pro-
(from about 2.6 to 5 .3 pieces; Gobet & duced more abstract chromosome diagrams
Simon, 1 996). These expertise differences that highlighted the features that were bio-
can be explained by the hypothesis that ex- logically relevant to the problem at hand. In
pert chess players have stored in memory contrast, the diagrams of the less advanced
meaningful groups (chunks) of chess pieces. subjects more literally resembled chromo-
Chase and Simon (1 973 ) found evidence for some appearance under a light microscope,
such chunks based on an analysis of the la- including aspects such as dimensionality and
tencies between recall of consecutive pieces. shape that have no bearing on the process
Better recall of structured or meaningful of meiosis.
stimuli by experts than by novices has been Similar findings also have emerged from
found in many other domains as well: Circuit research involving geometric analogies, a
diagrams (Egan & Schwartz, 1 979), com- problem type that does not seem to in-
puter programming (McKeithen, Reitman, volve detailed domain knowledge. Schiano,
Rueter, & Hirtle, 1 981 ), medicine (Coughlin Cooper, Glaser, and Zhang (1 989) asked
& Patel, 1 987; Myles-Worsley, Johnston, & high school students who had received very
Simons, 1 988), basketball and field hockey low or very high scores on a standardized
(Allard & Starkes, 1 991 ), and figure skating geometric analogy test to sort proportional
(Deakin & Allard, 1 991 ). analogies (of the form A:A ::B:B ) involv-
Evidence for representational differences ing geometric figures into groups of related
between experts and novices also comes problems. They found that the low-scoring
from studies in which subjects were asked students tended to sort the problems accord-
to sort problems into groups based on how ing to superficial perceptual similarities. For
they would be solved. In one of the early example, they put the problems involving
studies using this methodology, Chi et al. circles and those involving partially shaded
(1 981 ) asked students to group physics (me- hexagons into separate piles. In contrast,
chanics) word problems into categories of the high-scoring students tended to sort the
related problems. They found that advanced problems according to the abstract, transfor-
physics graduate students tended to group mational relations underlying solution. For
the problems according to the physics prin- example, they put the problems involving
ciples required for solution (e.g., conserva- rotations and those involving size transfor-
tion of energy). In contrast, undergraduates mations into separate piles.
who had successfully completed an intro- It is important to note that these rep-
ductory physics course tended to group the resentational differences between experts
problems according to the types of objects and novices (or between people who are
presented (e.g., springs versus pulleys versus highly skilled versus less skilled in a do-
inclined planes). main) are a matter of emphasis and de-
Comparable results have been found in gree. With increasing expertise/knowledge,
the domains of mathematics and com- there is a gradual change in the focus of
puter programming using measures based on attention and in the problems that are
both problem sorting and free recall (Adel- seen as related, and the extremes are not
son, 1 981 ; McKeithen et al., 1 981 ; Silver, quite as extreme as summaries of the differ-
1 979, 1 981 ; Weiser & Shertz, 1 983 ). These ences often suggest (e.g., Deakin & Allard,
knowledge-based differences in problem 1 991 ; Hardiman, Dufresne, & Mestre, 1 989;
McKeithen et al., 1 981 ; Myles-Worsley similarity between problems that differ in

et al., 1 988; Schoenfeld & Herrmann, 1 982; appearance.
Silver, 1 981 ). In this section, we review research that
highlights this interplay between represen-
tation and solution generation. The first part
of our review focuses on problem solving in
The Interplay Between Representation mathematics. As suggested by our initial ex-
and Solution ample from Wertheimer (1 95 9), this is a do-
main in which the interplay between rep-
So far, we have considered problem representation and solution generation is easy
resentation and the process of generating to see. We show how the effects on repre-
problem solutions separately. We noted at sentation of several of the solver and prob-
the outset, however, that these topics are lem factors identified in the previous section
inherently interrelated: The representation have consequences for the solution method
one constructs is likely to affect how one employed. In the second part of our re-
goes about generating a solution. A classic view, we revisit the nature of insight prob-
example comes from Wertheimer (1 95 9). lem solving, a topic that is currently receiv-
Students are generally taught how to coming much attention. The research on this
pute the area of a parallelogram as shown in topic aims to sort out the inherent inter-
Figure 1 4.5 (a). Wertheimer distinguished play between representation and solution
two groups of students based on their rep- generation.
resentations of the solution method. Some
students constructed what we might call to- Mathematical Problem Solving
day a procedural representation. They were
able to compute the area by rote appli- domain knowledge
cation of the learned formula. The repre- Wertheimer (1 95 9) found that structural un-
sentations of other students reflected good derstanding helps solvers to see important
conceptual understanding of the solution similarities between problems that differ in
method, namely that a triangle can be cut appearance. Research reviewed in the repre-
off from one side of the geometric figure sentation section showed that experts (i.e.,
and pasted onto the other side to create people with high domain knowledge) bet-
a rectangle to which the learned formula ter understand the structure of problems
then obviously applies. Wertheimer found within their domain of expertise than do
that students who represented the problem novices (i.e., people with low domain knowl-
as one of converting the parallelogram into edge). It therefore seems reasonable to pre-
a rectangle were able to find the area of dict that the expertise-related differences
the quadrilateral in Figure 1 4.5 (b) and that in problem representation would affect the
of the irregularly shaped geometric figure methods that experts and novices attempt to
in Figure 1 4.5 (c) by similar conversion of use to solve novel problems. We review two
those figures into rectangles as shown by studies by Novick (1 988) on mathematical
the superimposed dashed lines. In contrast, problem solving by analogy that provide evi-
students who represented the parallelogram dence for such a link between representation
problem in terms of the appropriate formula and solution.
to apply were stumped by the problems In one experiment, Novick (1 988) rea-
presented in Figures 1 4.5 (b) and 1 4.5 (c), soned that arithmetic experts (i.e., people
because the formula is not applicable to who are highly skilled at arithmetic) would
those problems as presented (because the be more likely than novices (i.e., people
figures are not parallelograms). These results who are less skilled at arithmetic) to ap-
demonstrate that structural understanding ply a learned procedure to an analogous test
(exemplified by the convert-to-rectangle so- problem with a different cover story, be-
lution method) enables solvers to recognize cause only experts would construct similar
problem solving 339
subjects in each condition used the LCM

procedure to solve the marching band prob-
lem. Among the experts, in contrast, 5 6% of
subjects in the experimental condition used
the more efficient LCM procedure, com-
pared with only 6% of subjects in the con-
trol condition. Consistent with these results,
Dunbar (2001 ) reported that when scientists
attempted to resolve puzzles in their own
work, they generally retrieved analogies on
the basis of shared relational structure.
Novick’s (1 988) first experiment focused
on the beneficial consequences of experts’
structurally based representations. Another
experiment focused on potential negative
consequences of novices’ superficially based
representations. All subjects were initially
taught to solve three problems. One prob-
lem was the vegetable garden problem,
which is similar in structure but dissimilar
Figure 1 4.5. Finding the area of (a) a in story content to the marching band prob-
parallelogram, (b) a quadrilateral, and (c) an lem. A second problem concerned seating
irregularly shaped geometric figure. The solid people in rows and columns on an audi-
lines indicate the geometric figures whose areas torium stage. Despite its similarity in story
are desired. The dashed lines show how to content to the band problem, the audito-
convert the given figures into rectangles (i.e.,
rium problem required a different solution
they show solutions with understanding).
procedure (i.e., the problems were struc-
turally dissimilar). (Because the auditorium
representations for the two problems. The problem’s solution procedure is inappropri-
example problem concerned purchasing ate for the band problem, control subjects
plants for a vegetable garden. The test almost never try to use that procedure to
problem concerned arranging members of a solve the band problem.) The third problem
marching band into rows and columns. In was unrelated to the band problem. Thus,
the control condition, subjects attempted to when subjects received the band problem to
solve the band problem after having been solve, they could choose to use the LCM pro-
taught how to solve three unrelated prob- cedure from the analogous vegetable garden
lems. In the experimental condition, one of problem, the incorrect procedure from the
the unrelated problems was replaced by the superficially similar auditorium problem, or
vegetable garden problem. The learned solu- some other solution method. As predicted,
tion procedure for this problem was based on novices were more likely than experts to at-
finding the lowest common multiple (LCM) tempt to apply the incorrect procedure from
of three numbers and then examining mul- the auditorium problem to the band prob-
tiples of the LCM to find a number that lem, and they were more persistent in their
fit certain constraints. This solution proce- attempts to use this procedure. As many in-
dure is also appropriate for the band prob- termediates as novices tried to use the incor-
lem. Alternatively, the band problem can be rect procedure, but fewer tried to do so more
solved by examining multiples of the indi- than once. Thus, superficial features play a
vidual numbers given in the problem. The decreasing role in analogical problem solv-
data strongly supported Novick’s hypothe- ing as expertise increases (also see Dunbar,
sis of differential transfer for experts and 2001 ). Replicating the results of the initial
novices: Among the novice group, 6% of experiment, experts were more likely than
subjects in the other two groups to use the (80% vs. 49% correct, respectively). That
LCM procedure to solve the band problem. is, when solvers had good conceptual un-
derstanding of the numeric quantity they
needed to compute, they were better able to
learning about problem subgoals revisited devise a new method for finding that quan-
As we discussed earlier in connection with tity when the expected information was not
means-ends analysis and the Tower of Hanoi provided in the problem. This result is remi-
problem, solvers often generate subgoals niscent of Wertheimer’s (1 95 9) findings with
when they are unable to directly apply a the parallelogram problem and related area
desired operator. Subgoals also have been problems (Figure 1 4.5 ).
identified as components of task structure
that can be taught to learners (Catrambone,
1 998). For example, in a statistics class, the object-based inferences from story content
task of computing a statistic for testing a In the section on problem representation, we
hypothesis concerning central tendency can described Hayes and Simon’s (1 977) study in
be divided into three subgoals: calculate which differences in the texts of the transfer
the observed value, the hypothesized value, and change monsters and globes problems
and the appropriate standard error. Subgoals led to differences in the representations
in this sense decompose the problem into solvers constructed for those two problem
conceptually distinct and meaningful parts. isomorphs. We also described Duncker’s
Identifying the right subgoals thus implies (1 945 ) candles problem, in which the given
that one has a good understanding of the objects (boxes) evoked inferences pertain-
structure of the problem, that is, a good ing to their functional role (containers).
representation. In related work, Bassok and her colleagues
Catrambone (1 996, 1 998) investigated have found that the objects in the texts of
the consequences for problem solving of mathematical word problems affect (1 ) how
instructional manipulations that affect sub- people represent the described problem sit-
goal learning. He found that manipulat- uation (i.e., the situation model they con-
ing solvers’ opportunity to learn an impor- struct) and, accordingly, (2) which math-
tant subgoal influenced their ability to solve ematical solution, or mathematical model,
probability problems involving the Poisson they select or construct (for a review, see
distribution and to adapt the learned pro- Bassok, 2001 ).
cedure to solve slightly altered problems. In One set of studies varied the objects
one experiment, Catrambone (1 996) manip- in mathematically isomorphic word prob-
ulated subjects’ representations by varying lems involving constant change (Alibali,
whether the solution to the example prob- Bassok, Solomon, Syc, & Goldin-Meadow,
lem provided a label for the subgoal of find- 1 999; Bassok & Olseth, 1 995 ). The objects
ing the total number of objects of type X. were chosen to evoke situation models in-
Then he gave subjects several problems to volving either discrete or continuous change
solve, some of which were isomorphic to the (e.g., constant change in the number of
example problem and some of which pro- books per shelf on consecutive shelves of a
vided somewhat different information about bookcase or constant change in the amount
the objects relevant to the subgoal. He found of air pressed per minute into a hot air bal-
that all subjects were highly successful at loon, respectively). In Alibali et al.’s (1 999)
solving the isomorphic problems, which re- study, subjects had to describe the problems
quired the same solution method as the ex- to a confederate and solve the problems.
ample problem. However, for the test prob- Subjects’ internal representations of the
lems that required a different method for manner of change (i.e., their situation mod-
finding the total number of objects of type els) for each problem were coded from their
X, subjects who had learned the subgoal per- speech and, separately, from their gestures.
formed much better than those who had not The solution method a subject used for each
problem solving 341
problem was categorized as either the sum respect to the objects in the set of assignees
strategy or the average strategy, which are (m) and those in the assigned set (n). Most
compatible, respectively, with a representa- subjects who attempted to solve these novel
tion of change as a set of discrete events problems arrived at incorrect solutions that
or as a single event. The results indicated revealed systematic effects of semantic align-
that when subjects were judged to have con- ment. When the problems involved assign-
structed a situation model involving discrete ment of semantically asymmetric sets (e.g.,
change (based on both speech and gesture), m computers assigned to n secretaries), the
they were most likely to use the discrete sum solutions of most subjects placed the num-
strategy for solution. In contrast, when they bers representing the two sets in mathemati-
constructed a situation model involving con- cally asymmetric structural roles (e.g., m3 /n!
tinuous change, they were most likely to use or m/3 n); however, when the problems in-
the continuous sum strategy for solution. volved assignment of semantically symmet-
Another set of studies varied the seman- ric sets (e.g., m doctors from one hospital
tic symmetry between object pairs in mathe- assigned to n doctors from another hospi-
matical word problems and found that peo- tal), the solutions of most subjects placed the
ple’s solutions of these problems tended to numbers representing the two sets in math-
have a corresponding mathematical symme- ematically symmetric structural roles [e.g.,
try. Bassok, Chase, and Martin (1 998) pro- (m + n)/(mn)3 , 3 /(m + n)!]. That is, the
posed that the objects in a problem (e.g., incorrect solutions students generated to
tulips, vases) activate semantic and prag- the permutation problems were structurally
matic knowledge that evokes relational in- analogous to the semantic relation evoked
ferences (e.g., the “contain” relation), which by the paired sets.
people include in their representations of the Semantic alignments also determine the
described situation. These situation mod- relative difficulty of mathematically isomor-
els, in turn, guide the selection of struc- phic problems. Martin and Bassok (in press)
turally analogous mathematical solutions. In asked middle school, high school, and col-
the tulips and vases example, because the in- lege students to solve simple division word
ferred containment relation between the ob- problems, such as the following: “At a cer-
jects is asymmetric (tulips are in vases rather tain university, there are 3 ,45 0 students.
than vice versa), people select a mathemat- There are 6 times as many students as pro-
ically asymmetric solution (e.g., the divi- fessors. How many professors are there?”
sion operation, which is asymmetric because In this example, the semantic relation be-
a ÷ b = b ÷ a). In a complementary way, tween the described sets is asymmetric (pro-
objects from the same taxonomic category fessors teach students) and therefore seman-
(e.g., tulips, roses) evoke a symmetric se- tically aligned with the correct (asymmetric)
mantic relation (both tulips and roses are division operation. In other problems, the
flowers), and the semantically symmetric sit- semantic relation between the described
uation model leads people to select a mathe- sets was symmetric and therefore misaligned
matically symmetric solution (e.g., the addi- with the correct (asymmetric) division op-
tion operation, which is symmetric because eration. For example: “On a given day, a
a + b = b + a). Bassok et al. refer to this certain factory produces 3 ,45 0 nails. It pro-
two-stage process as semantic alignment. duces 6 times as many nails as screws. How
Semantic alignments affect how students many screws does it produce?” Students at
solve novel mathematical word problems. all grade levels were more successful at solv-
For example, Bassok, Wu, and Olseth (1 995 ) ing the aligned than the misaligned prob-
asked college students to solve unfamiliar lems, although the difference was most pro-
permutation problems that involved random nounced in middle school: 80% of seventh
assignment of three objects from one set to graders solved the students and professors
another set. They used two sets of mathe- problem, but only 40% solved the nails and
matically identical problems that varied with screws problem.
Guthormsen, Bassok, Osterhout, and (e.g., the amount of mental look-ahead re-
Inoue (2002) found evidence from electro- quired to find the solution). A full under-
physiological data that semantic alignments standing of insight problem solving, like non-
occur very early in the solution process, insight problem solving, requires attention
when solvers read mathematical problems. to both representation and process. The in-
People had to solve mathematically aligned terplay between these two factors is illus-
problems, such as 3 tulips + 5 daisies =, trated in the two subsections that follow in
and mathematically misaligned problems, which we consider (1 ) whether insight solu-
such as 3 tulips + 5 vases =. Their event- tions arise full blown and (2) what explains
related potentials (ERPs) revealed a sig- the initial impasse and its resolution.
nificantly larger response of a certain spe-
cific type (the N400 response, a negative
electrical response occurring approximately do insight solutions arise full blown?
400 ms after the event) to the misaligned We noted in our earlier discussion of
target word (vases) than to the aligned tar- Weisberg and Alba’s (1 981 ) research that so-
get word (daisies). This pattern is consistent lution of the nine-dot problem was neither
with other evidence that N400 is evoked by quick nor direct following the restructur-
detection of semantic anomalies. ing hint. For example, subjects who solved
the problem generally required 5 to 1 1 so-
Insight Problem Solving Revisited lution attempts after the hint to achieve
success. Multiple solution attempts were
overview needed because the required restructuring of
We introduced the notion of insight in one’s problem representation – realizing that
our discussion of perceptual factors affect- (1 ) the lines may extend outside the imagi-
ing solvers’ representations of the nine-dot nary square boundary formed by the dots,
problem. As we mentioned, the Gestalt and (2) they may intersect at points in
view (e.g., Duncker, 1 945 ; Maier, 1 93 1 ; see space that do not contain dots (Kershaw &
Ohlsson, 1 984a, for a review) is that insight Ohlsson, 2004) – suggests a new problem
problem solving is characterized by an ini- space, with alternative operators, through
tial work period during which no progress which the solver can search for the correct
toward solution is made (i.e., an impasse), a solution (Lung & Dominowski, 1 985 ; Ohls-
sudden restructuring of one’s problem repre- son, 1 984b; Weisberg & Alba, 1 981 ).
sentation to a more suitable form, followed For other problems, the required restruc-
immediately by the sudden appearance of turing “brings the goal state within the hori-
the solution. Thus, solving insight problems zon of mental look-ahead” (Ohlsson, 1 984b,
is all about representation with essentially p. 1 24), yielding insight in the traditional
no role for a step-by-step process of gen- sense of sudden understanding of the solu-
erating the solution. Although subsequent tion. For example, explain the following sit-
and contemporary researchers concur with uation (Durso, Rea, & Dayton, 1 994, p. 95 ):
the Gestalt view that getting the right rep- “A man walks into a bar and asks for a glass
resentation is crucial, this view does not proof water. The bartender points a shotgun at
vide a complete understanding of the na- the man. The man says ‘Thank you,’ and
ture of insight solutions because the solution walks out.” The solution to this problem typ-
does not necessarily arise suddenly or full- ically pops into mind suddenly and fully in-
blown following restructuring (e.g., Weis- tact, accompanied by an irresistible feeling
berg & Alba, 1 981 ). Kershaw and Ohlsson of “aha!” Moreover, the solver has no aware-
(2004) argued that insight problems are dif- ness of incremental progress toward the goal
ficult because the key behavior required for such as that which accompanies search so-
solution may be hindered by perceptual fac- lutions. (The solution to the barroom puz-
tors (this is the Gestalt perspective), back- zle is that the man had the hiccups. The
ground knowledge, and/or process factors bartender scared him with the gun, which
problem solving 343
cured him.) Anagrams are also known to forced subjects to make their yes/no judg-
yield such “pop-out” solutions (e.g., Mendel- ments based on any partial information that
sohn & O’Brien, 1 974), especially among had accrued prior to the deadline. On av-
highly skilled anagram solvers (Novick & erage, subjects’ responses were made within
Sherman, 2003 a). approximately 65 0 or 1 1 3 0 ms after the on-
For problems that yield pop-out set of the letter string. By testing highly
solutions – that is, for which solvers skilled and less skilled anagram solvers on
have the phenomenological experience of anagrams that were known to yield pop-out
insight – the question remains as to whether solutions (for experts) or not, Novick and
the solutions arise full blown or through Sherman were able to assess whether pop-
the gradual accumulation of relevant partial out solutions arise full blown or are preceded
information as for the nine-dot problem and by the gradual accumulation of partial infor-
noninsight problems (e.g., simplifying alge- mation (outside awareness). Consistent with
bra equations to solve for X ). Durso et al. Durso et al.’s (1 994) results, and contrary to
(1 994) investigated this issue using the the Gestalt view, they found that pop-out so-
barroom puzzle. In one experiment, they lutions arise gradually through the accumu-
collected similarity ratings for 1 2 pairs of lation of relevant partial information (also
concepts at several points during subjects’ see Bowden & Jung-Beeman, 2003 ).
solution attempts – before and after reading Despite this important similarity be-
the puzzle, every 1 0 minutes until the tween insight and noninsight solutions, phe-
puzzle was solved, and immediately after nomenologically, the two types of solutions
the solution. The concept pairs included are different. The solver is aware of the ac-
two insight pairs (surprise/remedy and cumulation of partial information for non-
relieved/thank you) that the results of an ini- insight solutions – for example, consider the
tial experiment showed were connected in Hobbits and Orcs problem or the problem
the conceptual networks of solvers but not of simplifying an algebra equation to solve
nonsolvers. The results suggested that the for X – but that accumulation occurs out-
key restructuring required for solution did side awareness for insight solutions (e.g.,
not arise full-blown contrary to the Gestalt the barroom puzzle, anagrams). Novick and
view of insight: The two insight pairs that Sherman (2003 a, 2003 b) hypothesized that
were critical for solution were seen as pop-out solutions to anagrams, which are
dissimilar at the first two time points, mod- characteristic of experts, may result from
erately similar at the next two time points, a parallel constraint satisfaction process; in
and highly similar after solution. In contrast, contrast, nonpop-out anagram solutions re-
the unrelated pairs (e.g., pretzel/shotgun) sult from a conscious process of serially test-
were seen as dissimilar and the related ing and rejecting hypotheses (e.g., Mendel-
pairs (e.g., shotgun/loaded) as similar at all sohn & O’Brien, 1 974).
time points.
Novick and Sherman (2003 a) noted, the impasse and its resolution
however, that having to repeatedly rate the As discussed by Knoblich, Ohlsson, Haider,
similarity of concepts that were critical for and Rhenius (1 999), theories of insight prob-
solution may have changed subjects’ solu- lem solving need to explain two phenom-
tion strategies. This possibility led them to ena concerning the interplay between rep-
provide an additional test of the hypothesis resentation and solution generation: (1 ) why
using anagrams. The accrual of partial infor- solvers initially reach an impasse in solving a
mation was tested using a solvability judg- problem for which they have the necessary
ment task in which subjects had to indi- knowledge to generate the solution, and (2)
cate whether letter strings (e.g., nrtai, botda) what enables them to break out of the im-
could be unscrambled to form an English passe. Two recent theories have attempted to
word (only the first of the two examples account for these phenomena – MacGregor,
is solvable – train). A deadline procedure Ormerod, and Chronicle’s (2001 ) progress
monitoring theory, and Knoblich et al.’s rep- example, changing II + to III − requires de-
resentational change theory. composition of the plus sign. (The solution
According to the progress monitoring to the this problem is to break apart the first
theory, solvers use hill climbing (see “Prob- V and change it to an X, yielding XI = VIII +
lem Solving as Search through a Problem III). Knoblich et al. found good support for
Space”) in their solution attempts for in- their theory using solution rate and solution
sight as well as noninsight problems. Solvers time as their dependent measures. Knoblich,
are hypothesized to monitor their progress Ohlsson, and Raney (2001 ) found additional
toward solution using a criterion generated support using eye fixation data.
from the problem’s current state. For the Jones (2003 ) attempted to distinguish the
nine-dot problem, for example, this criterion progress monitoring and representational
is the number of dots through which lines change theories using eye fixation data as
have been drawn relative to the number of subjects solved the car park problem. In this
dots remaining. If solvers reach criterion fail- problem, the goal is to maneuver a taxi out of
ure, they seek alternative solutions by trying a car park. Other cars need to be moved out
to relax one or more problem constraints. of the way, and there are constraints on how
The nine-dot problem is difficult, according cars may be moved. Jones’ results supported
to this theory, because criterion failure is not predictions from both theories, although the
reached until the fourth move (recall that effects of the experimental manipulations
the problem must be solved in four moves). suggested that the representational change
MacGregor et al. (2001 ) found support for theory is a better predictor of performance.
this theory using several variants of the nine- Based on his data, Jones argued that the two
dot problem (also see Ormerod, MacGregor, theories should be combined into a single
& Chronicle, 2002). theory. This makes sense because Knoblich
According to Knoblich et al.’s (1 999) rep- et al.’s (1 999) theory focuses more on the
resentational change theory, insight prob- representational aspect of problem solution,
lems are highly likely to evoke initial whereas MacGregor et al.’s (2001 ) theory
representations in which solvers place inap- focuses more on the step-by-step solution
propriate constraints on their solution at- process. Jones noted that the progress moni-
tempts. Impasses are resolved by revising toring theory provides an account of the so-
one’s representation of the problem. They lution process up to the point that the im-
tested this theory using Roman numeral passe is reached and representational change
matchstick arithmetic problems in which is sought. The representational change the-
solvers must move one stick to a new lo- ory picks up at this point and explains how
cation to change a false numeric statement insight may be achieved.
(e.g., VI = VIII + III) into a statement that
is true. According to Knoblich et al.’s the-
ory, rerepresentation may happen through
either of two mechanisms – constraint re- Conclusions and Directions
laxation or chunk decomposition. Constraint for Future Research
relaxation involves deactivating some knowl-
edge element that has constrained the op- In this chapter, we examined two broad
erators being considered, thereby allowing components of the problem-solving
application of new operators: For example, process – representation and solution gener-
changing II + to III − requires relaxation of ation. Although it is possible to focus one’s
the value constraint (numeric values do not research on one or the other of these com-
change except by applying an operation that ponents, a full understanding of problem
produces a compensating change in some solving requires an integration of the two,
other value). Chunk decomposition involves for the representation one constructs for a
breaking the bonds that link components problem determines (or at least constrains)
of a meaningful unit in the problem: For how one goes about trying to generate a
problem solving 345
solution. This interplay is obvious for math- world. Effective problem solving and reason-
ematical problem solving, as we discussed ing, as well as creative invention, all require
in the previous section. Consideration of appropriate models as their starting point.
both representation and solution generation Second, the trend toward examining
also seems to be behind the resurgence of more complex, knowledge-intensive prob-
interest in insight problem solving. This lems should continue. Although the avail-
new strategy for investigating insight seems able evidence suggests that many of the
to be yielding progress in understanding this conclusions about problem solving drawn
fascinating phenomenon that is at the core from research on well-defined problems are
of human creative endeavors. We believe applicable to ill-defined problems, messy,
the interplay between representation and knowledge-intensive, real-world problems
solution generation will lead to significant may not be simply scaled-up versions of lab-
progress in understanding the full range of oratory tasks or of tasks practiced in school.
activities considered to be problem solving. The critical problems of the day, at any given
Elevating this interplay to the status of a point in history, are always ill defined in
core assumption, we want to suggest three some way. Investigation of such problems
directions for future research. (e.g., in science, medicine, and technology) is
First, we would stress the importance of likely to yield both theoretical and practical
conducting educationally relevant research. payoffs.
Students spend a considerable amount of Finally, we come full circle and end where
time both solving problems and learning we began. The last direction is suggested by
how to solve problems. Society expects the definition of a problem given by Karl
that the problem-solving lessons learned in Duncker, arguably the father of research on
school – from how to solve math problems problem solving. He defined a problem as
to how to design and execute a science fair a situation in which a desired goal cannot
project to how to analyze literature – will be attained by direct application of known
transfer to students’ adult lives for the bet- operators, and so “there has to be recourse
terment of the world. We believe that a two- to thinking” (Duncker, 1 945 , p. 1 ). Our
pronged effort is needed here: (1 ) It is im- review of problem-solving research in this
portant to gain a better understanding of chapter has been rather narrow – focusing
students’ contributions to problem solving. on puzzles (e.g., Hobbits and Orcs, Tower
What are their goals, beliefs, strategies, and of Hanoi, anagrams, the nine-dot problem)
conceptions? How do they construct mean- and on mathematical problems. However,
ing and infer structure? (2) At the same Duncker’s reference to thinking is quite
time, there is an objective reality to prob- broad. By Duncker’s definition, humans en-
lems, messy though they may sometimes be. gage in problem solving when they pur-
The nature of a problem’s underlying struc- sue the following goal-directed activities:
ture places constraints on the types of rep- (1 ) placing objects into categories and mak-
resentations that will be useful or appropri- ing inferences based on category member-
ate, which in turn determine the types of ship, (2) making inductive inferences from
solution methods that will be effective and multiple instances, (3 ) reasoning by analogy,
efficient. It is important, therefore, to un- (4) identifying the causes of events, (5 ) de-
derstand the factors that facilitate or hinder ducing logical implications of given infor-
a student’s ability to represent a problem’s mation, (6) making legal judgments, and
structure as well as to investigate methods (7) diagnosing medical conditions from his-
for helping students to succeed in this en- torical and laboratory data. Much of the ma-
deavor. The National Council of Teachers of terial included in the chapters on these top-
Mathematics (2000) similarly promotes the ics in the present volume arguably could
importance of teaching students how to cre- have appeared in our chapter on problem
ate and use a variety of different types of solving. Rather than engaging in a turf battle,
representations to model phenomena in the we would suggest that research on problem
solving be integrated with research in these Bassok, M., Wu, L., & Olseth, L. K. (1 995 ). Judg-
other areas of thinking, or that research in ing a book by its cover: Interpretative effects of
these other areas be informed by insights content on problem solving transfer. Memory
gained from research on what has more tra- and Cognition, 2 3 , 3 5 4–3 67.
ditionally been identified as problem solving. Bowden, E. M., & Jung-Beeman, M. (2003 ). Aha!
Insight experience correlates with solution ac-
tivation in the right hemisphere. Psychonomic
Bulletin and Review, 1 0, 73 0–73 7.
Bransford, J. D., & McCarrell, N. S. (1 974). A
References
sketch of a cognitive approach to compre-
hension: Some thoughts about understanding
Adams, J. L. (1 979). Conceptual blockbusting: what it means to comprehend. In W. B. Weimer
A guide to better ideas (2nd ed.). New York: & D. S. Palermo (Eds.), Cognition and the sym-
Norton. bolic processes (pp. 1 89–229). Hillsdale, NJ:
Adelson, B. (1 981 ). Problem solving and the de- Erlbaum.
velopment of abstract categories in program- Catrambone, R. (1 996). Generalizing solution
ming languages. Memory and Cognition, 9, 422– procedures learned from examples. Journal
43 3 . of Experimental Psychology: Learning, Memory,
Alibali, M. W., Bassok, M., Solomon, K. O., Syc, and Cognition, 2 2 , 1 020–1 03 1 .
S. E., & Goldin-Meadow, S. (1 999). Illuminat- Catrambone, R. (1 998). The subgoal learning
ing mental representations through speech and model: Creating better examples so that stu-
gesture. Psychological Science, 1 0, 3 27–3 3 3 . dents can solve novel problems. Journal of
Allard, F., & Starkes, J. L. (1 991 ). Motor-skill ex- Experimental Psychology: General, 1 2 7, 3 5 5 –
perts in sports, dance, and other domains. In 3 76.
K. A. Ericsson & J. Smith (Eds.), Toward a Chase, W. G., & Simon, H. A. (1 973 ). Perception
general theory of expertise: Prospects and limits in chess. Cognitive Psychology, 4, 5 5 –81 .
(pp. 1 26–1 5 2). New York: Cambridge Univer- Chi, M. T. H., Bassok, M., Lewis, M. W.,
sity Press. Reimann, P., & Glaser, R. (1 989). Self-
Anderson, R. C., Reynolds, R. E., Schallert, D. L., explanations: How students study and use ex-
& Goetz, E. T. (1 977). Frameworks for com- amples in learning to solve problems. Cognitive
prehending discourse. American Educational Science, 1 3 , 1 45 –1 82.
Research Journal, 1 4, 3 67–3 81 . Chi, M. T. H., Feltovich, P. J., & Glaser, R. (1 981 ).
Anzai, Y., & Simon, H. A. (1 979). The theory Categorization and representation of physics
of learning by doing. Psychological Review, 86, problems by experts and novices. Cognitive
1 24–1 40. Science, 5 , 1 21 –1 5 2.
Bassok, M. (2001 ). Semantic alignments in math- Chronicle, E. P., MacGregor, J. N., & Ormerod,
ematical word problems. In D. Gentner, K. J. T. C. (2004). What makes an insight problem?
Holyoak, & B. N. Kokinov (Eds.), The ana- The roles of heuristics, goal conception, and
logical mind: Perspectives from cognitive science solution recoding in knowledge-lean problems.
(pp. 401 –43 3 ). Cambridge, MA: MIT Press. Journal of Experimental Psychology: Learning,
Bassok, M., Chase, V. M., & Martin, S. A. (1 998). Memory, and Cognition, 3 0, 1 4–27.
Adding apples and oranges: Alignment of Coughlin, L. D., & Patel, V. L. (1 987). Processing
semantic and formal knowledge. Cognitive of critical information by physicians and med-
Psychology, 3 5 , 99–1 3 4. ical students. Journal of Medical Education, 62 ,
Bassok, M., & Holyoak, K. J. (1 989). Interdomain 81 8–828.
transfer between isomorphic topics in algebra Deakin, J. M., & Allard, F. (1 991 ). Skilled memory
and physics. Journal of Experimental Psychology: in expert figure skaters. Memory and Cognition,
Learning, Memory, and Cognition, 1 5 , 1 5 3 –1 66. 1 9, 79–86.
Bassok, M., & Olseth, K. L. (1 995 ). Object-based de Groot, A. D. (1 966). Perception and memory
representations: Transfer between cases of con- versus thought: Some old ideas and recent find-
tinuous and discrete models of change. Journal ings. In B. Kleinmuntz (Ed.), Problem solving:
of Experimental Psychology: Learning, Memory, Research, method, and theory (pp. 1 9–5 0). New
and Cognition, 2 1 , 1 5 22–1 5 3 8. York: Wiley.
problem solving 347
Dunbar, K. (2001 ). The analogical paradox: Why relations. Paper presented at the 43 rd annual
analogy is so easy in naturalistic settings, yet meeting of the Psychonomic Society, Kansas
so difficult in the psychological laboratory. City, MO.
In D. Gentner, K. J. Holyoak, & B. Koki- Hardiman, P. T., Dufresne, R., & Mestre, J. P.
nov (Eds.), Analogy: Perspectives from cognitive (1 989). The relation between problem cate-
science (pp. 3 1 3 –3 62). Cambridge, MA: MIT gorization and problem solving among experts
Press. and novices. Memory and Cognition, 1 7, 627–
Duncker, K. (1 945 ). On problem-solving (L. S. 63 8.
Lees, Trans.). Psychological Monographs, 5 8 Hayes, J. R., & Simon, H. A. (1 977). Psychological
(Whole No. 270). (Original work published differences among problem isomorphs. In N. J.
1 93 5 .) Castellan, D. B. Pisoni, & G. R. Potts (Eds.),
Durso, F. T., Rea, C. B., & Dayton, T. (1 994). Cognitive theory (Vol. 2, pp. 21 –44). Hillsdale,
Graph-theoretic confirmation of restructuring NJ: Erlbaum.
during insight. Psychological Science, 5 , 94–98. Hegarty, M., Carpenter, P. A., & Just, M. A.
Egan, D. E., & Schwartz, B. J. (1 979). Chunking (1 991 ). Diagrams in the comprehension of
in the recall of symbolic drawings. Memory and scientific texts. In R. Barr, M. L. Kamil, P.
Cognition, 7, 1 49–1 5 8. Mosenthal, & P. D. Pearson (Eds.), Handbook
Ericsson, K. A., & Simon, H. A. (1 980). Verbal of reading research (Vol. 2, pp. 641 –668). New
reports as data. Psychological Review, 87, 21 5 – York: Longman.
25 1 . Holding, D. H. (1 985 ). The psychology of chess
Ernst, G. W., & Newell, A. (1 969). GPS: A case skill. Hillsdale, NJ: Erlbaum.
study in generality and problem solving. New Holyoak, K. J. (1 985 ). The pragmatics of analog-
York: Academic Press. ical transfer. In G. H. Bower (Ed.), The psychol-
Gick, M. L., & Holyoak, K. J. (1 980). Analogical ogy of learning and motivation (Vol. 1 9, pp. 5 9–
problem solving. Cognitive Psychology, 1 2 , 3 06– 87). New York: Academic Press.
355. Hurley, S. M., & Novick, L. R. (2004). Context
Gick, M. L., & Holyoak, K. J. (1 983 ). Schema and structure: The nature of students’ knowl-
induction and analogical transfer. Cognitive edge about three spatial diagram representations.
Psychology, 1 5 , 1 –3 8. Manuscript under review.
Gobet, F., & Simon, H. (1 996). Recall of rapidly Jones, G. (2003 ). Testing two cognitive theories
presented random chess positions is a function of insight. Journal of Experimental Psychology:
of skill. Psychonomic Bulletin and Review, 3 , Learning, Memory, and Cognition, 2 9, 1 01 7–
1 5 9–1 63 . 1 027.
Greeno, J. G. (1 974). Hobbits and orcs: Acquisi- Kershaw, T. C., & Ohlsson, S. (2004). Multi-
tion of a sequential concept. Cognitive Psychol- ple causes of difficulty in insight: The case of
ogy, 6, 270–292. the nine-dot problem. Journal of Experimental
Greeno, J. G. (1 977). Process of understanding in Psychology: Learning, Memory, and Cognition,
problem solving. In N. J. Castellan, Jr., D. B. 3 0, 3 –1 3 .
Pisoni, & G. R. Potts (Eds.), Cognitive theory Kindfield, A. C. H. (1 993 /1 994). Biology
(Vol. 2, pp. 43 –83 ). Hillsdale, NJ: Erlbaum. diagrams: Tools to think with. Journal of the
Greeno, J. G. (1 978). Natures of problem-solving Learning Sciences, 3 , 1 –3 6.
abilities. In W. K. Estes (Ed.), Handbook of Knoblich, G., Ohlsson, S., Haider, H., & Rhenius,
learning and cognitive processes (Vol. 5 , pp. 23 9– D. (1 999). Constraint relaxation and chunk de-
270). Hillsdale, NJ: Erlbaum. composition in insight problem solving. Journal
Greeno, J. G., & Simon, H. A. (1 988). Prob- of Experimental Psychology: Learning, Memory,
lem solving and reasoning. In R. C. Atkinson, and Cognition, 2 5 , 1 5 3 4–1 5 5 5 .
R. J. Herrnstein, G. Lindzey, & R. D. Luce Knoblich, G., Ohlsson, S., & Raney, G. E. (2001 ).
(Eds.), Stevens’ handbook of experimental psy- An eye movement study of insight problem
chology (2nd ed., Vol. 2, pp. 5 89–672). New solving. Memory and Cognition, 2 9, 1 000–1 009.
York: Wiley. Kotovsky, K., Hayes, J. R., & Simon, H. A. (1 985 ).
Guthormsen, A., Bassok, M., Osterhout, L., & Why are some problems hard? Evidence from
Inoue, K. (2002, November). ERP as a measure Tower of Hanoi. Cognitive Psychology, 1 7, 248–
of alignment between mathematical and semantic 294.
Lung, C.-T., & Dominowski, R. L. (1 985 ). Effects Novick, L. R., & Hmelo, C. E. (1 994). Trans-
of strategy instructions and practice on nine- ferring symbolic representations across non-
dot problem solving. Journal of Experimental isomorphic problems. Journal of Experimental
Psychology: Learning, Memory, and Cognition, Psychology: Learning, Memory, and Cognition,
1 1 , 804–81 1 . 2 0, 1 296–1 3 21 .
MacGregor, J. N., Ormerod, T. C., & Chroni- Novick, L. R., & Holyoak, K. J. (1 991 ). Math-
cle, E. P. (2001 ). Information processing and ematical problem solving by analogy. Journal
insight: A process model of performance on the of Experimental Psychology: Learning, Memory,
nine-dot and related problems. Journal of Ex- and Cognition, 1 7, 3 98–41 5 .
perimental Psychology: Learning, Memory, and Novick, L. R., & Hurley, S. M. (2001 ). To ma-
Cognition, 2 7, 1 76–201 . trix, network, or hierarchy: That is the ques-
Maier, N. (1 93 0). Reasoning in humans. I. On tion. Cognitive Psychology, 42 , 1 5 8–21 6.
direction. Journal of Comparative Psychology, Novick, L. R., Hurley, S. M., & Francis, M. (1 999).
1 0, 1 5 –43 . Evidence for abstract, schematic knowledge of
Maier, N. (1 93 1 ). Reasoning in humans. II. The three spatial diagram representations. Memory
solution of a problem and its appearance in and Cognition, 2 7, 288–3 08.
consciousness. Journal of Comparative Psychol- Novick, L. R., & Sherman, S. J. (2003 a). On the
ogy, 1 2 , 1 81 –1 94. nature of insight solutions: Evidence from skill
Markman, A. B. (1 999). Knowledge representation. differences in anagram solution. The Quarterly
Mahwah, NJ: Erlbaum. Journal of Experimental Psychology, 5 6A, 3 5 1 –
Martin, S. A., & Bassok, M. (in press). Effects of 3 82.
semantic cues on mathematical modeling: Ev- Novick, L. R., & Sherman, S. J. (2003 b). The ef-
idence from word-problem solving and equa- fects of superficial and structural information on
tion construction tasks. Memory and Cognition. on-line problem solving as a function of expertise.
McKeithen, K. B., Reitman, J. S., Rueter, H. H., Manuscript under revision.
& Hirtle, S. C. (1 981 ). Knowledge organization Novick, L. R., & Sherman, S. J. (2004). Type-
and skill differences in computer programmers. based bigram frequencies for five-letter words.
Cognitive Psychology, 1 3 , 3 07–3 25 . Behavior Research Methods, Instruments, and
Mendelsohn, G. A., & O’Brien, A. T. (1 974). The Computers.
solution of anagrams: A reexamination of the Ohlsson, S. (1 984a). Restructuring revisited I.
effects of transition letter probabilities, letter Summary and critique of the Gestalt theory
moves, and word frequency on anagram diffi- of problem solving. Scandinavian Journal of
culty. Memory and Cognition, 2 , 5 66–5 74. Psychology, 2 5 , 65 –78.
Myles-Worsley, M., Johnston, W. A., & Simons, Ohlsson, S. (1 984b). Restructuring revisited II.
M. A (1 988). The influence of expertise on An information processing theory of restruc-
X-ray image processing. Journal of Experimental turing and insight. Scandinavian Journal of Psy-
Psychology: Learning, Memory, and Cognition, chology, 2 5 , 1 1 7–1 29.
1 4, 5 5 3 –5 5 7. Ormerod, T. C., MacGregor, J. N., & Chronicle,
National Council of Teachers of Mathematics. E. P. (2002). Dynamics and constraints in in-
(2000). Principles and standards for school math- sight problem solving. Journal of Experimental
ematics. Reston, VA: Author. Psychology: Learning, Memory, and Cognition,
2 8, 791 –799.
Newell, A., & Simon, H. A. (1 972). Human prob-
lem solving. Englewood Cliffs, NJ: Prentice- Polya, G. (1 95 7). How to solve it (2nd ed.). Prince-
Hall. ton, NJ: Princeton University Press.
Novick, L. R. (1 988). Analogical transfer, prob- Posner, M. I. (1 973 ). Cognition: An introduction.
lem similarity, and expertise. Journal of Ex- Glenview, IL: Scott, Foresman and Company.
perimental Psychology: Learning, Memory, and Reitman, W. R. (1 965 ). Cognition and thought.
Cognition, 1 4, 5 1 0–5 20. New York: Wiley.
Novick, L. R. (2001 ). Spatial diagrams: Key in- Ronning, R. R. (1 965 ). Anagram solution times:
struments in the toolbox for thought. In D. L. A function of the “ruleout” factor. Journal of
Medin (Ed.), The psychology of learning and mo- Experimental Psychology, 69, 3 5 –3 9.
tivation ( Vol. 40, pp. 279–3 25 ). San Diego: Ross, B. H., & Kennedy, P. T. (1 990). Gener-
Academic Press. alizing from the use of earlier examples in
problem solving 349
problem solving. Journal of Experimental Journal for Research in Mathematics Education,

Psychology: Learning, Memory, and Cognition, 1 2 , 5 4–64.
1 6, 42–5 5 . Simon, H. A. (1 978). Information-processing
Russo, J. E., Johnson, E. J., & Stephens, D. L. theory of human problem solving. In W. K.
(1 989). The validity of verbal protocols. Estes (Ed.), Handbook of learning and cognitive
Memory and Cognition, 1 7, 75 9–769. processes (Vol. 5 , pp. 271 –295 ). Hillsdale, NJ:
Schiano, D., Cooper, L. A., Glaser, R., & Zhang, Erlbaum.
H. C. (1 989). Highs are to lows as experts are Stigler, J. W. (1 984). “Mental abacus”: The effect
to novices: Individual differences in the repre- of abacus training on Chinese children’s mental
sentation and solution of standardized figural calculation. Cognitive Psychology, 1 6, 1 45 –1 76.
analogies. Human Performance, 2 , 225 –248. Thomas, J. C., Jr. (1 974). An analysis of behavior
Schoenfeld, A. H., & Herrmann, D. J. (1 982). in the hobbits-orcs problem. Cognitive Psychol-
Problem perception and knowledge structure ogy, 6, 25 7–269.
in expert and novice mathematical problem Weisberg, R. W., & Alba, J. W. (1 981 ). An exam-
solvers. Journal of Experimental Psychology: ination of the alleged role of “fixation” in the
Learning, Memory, and Cognition, 8, 484– solution of several “insight” problems. Journal
494. of Experimental Psychology: General, 1 1 0, 1 69–
Schwartz, D. L., & Black, J. B. (1 996). Shuttling 1 92.
between depictive models and abstract rules: Weiser, M., & Shertz, J. (1 983 ). Programming
Induction and fallback. Cognitive Science, 2 0, problem representation in novice and expert
45 7–497. programmers. International Journal of Man-
Silver, E. A. (1 979). Student perceptions of relat- Machine Studies, 1 9, 3 91 –3 98.
edness among mathematical verbal problems. Wertheimer, M. (1 95 9). Productive thinking
Journal for Research in Mathematics Education, (enlarged ed.). Chicago: The University of
1 0, 1 95 –21 0. Chicago Press.
Silver, E. A. (1 981 ). Recall of mathematical prob- Wickelgren, W. A. (1 974). How to solve problems.
lem information: Solving related problems. San Francisco: Freeman Press.
CHAPTER 1 5
Creativity
Robert J. Sternberg
Todd I. Lubart
James C. Kaufman
Jean E. Pretz
Creativity is the ability to produce work that Creativity may be viewed as taking place
is novel (i.e., original, unexpected), high in in the interaction between a person and the
quality, and appropriate (i.e., useful, meets person’s environment (Amabile, 1 996; Csik-
task constraints) (Lubart, 1 994; Ochse, szentmihalyi, 1 996, 1 999; Feldman, 1 999;
1 990; Sternberg, 1 988a, 1 999c; Sternberg & Feldman, Csikszentmihalyi, & Gardner,
Lubart, 1 995 , 1 996). Creativity is a topic of 1 994; Sternberg, 1 985 a; Sternberg & Lubart,
wide scope that is important at both the in- 1 995 ). According to this view, the essence
dividual and societal levels for a wide range of creativity cannot be captured just as an
of task domains. At an individual level, cre- intrapersonal variable. Thus, we can charac-
ativity is relevant, for example, when solv- terize a person’s cognitive processes as more
ing problems on the job and in daily life. or less creative (Finke, Ward, & Smith, 1 992;
At a societal level, creativity can lead to Rubenson & Runco, 1 992; Weisberg, 1 986),
new scientific findings, new movements in or the person as having a more or less cre-
art, new inventions, and new social pro- ative personality (Barron, 1 988; Feist, 1 999).
grams. The economic importance of creativ- We further can describe the person as having
ity is clear because new products or ser- a motivational pattern that is more or less
vices create jobs. Furthermore, individuals, typical of creative individuals (Hennessey
organizations, and societies must adapt ex- & Amabile, 1 988), or even as having back-
isting resources to changing task demands to ground variables that more or less dispose
remain competitive. that person to think creatively (Simonton,
This chapter attempts to provide readers 1 984, 1 994). However, we cannot fully judge
with a basic understanding of the literature that person’s creativity independent of the
on creativity. It first reviews alternative ap- field and the temporal context in which the
proaches to understanding creativity. Then person works.
it reviews alternative approaches to under- For example, a contemporary artist might
standing kinds of creative work. Finally, it have thought processes, personality, motiva-
draws some conclusions. tion, and even background variables similar
351
to those of Monet, but that artist, painting seen as an empty vessel that a divine be-
today in the style of Monet or of Impression- ing would fill with inspiration. The individ-
ism in general, probably would not be judged ual would then pour out the inspired ideas,
to be creative in the way Monet was. Artists, forming an otherworldly product.
including Monet, have experimented with In this vein, Plato argued that a poet is
Impressionism, and unless the contemporary able to create only that which the Muse dic-
artist introduced some new twist, he or she tates, and even today, people sometimes re-
might be viewed as imitative rather than cre- fer to their own Muse as a source of in-
ative. spiration. In Plato’s view, one person might
The importance of context is illustrated be inspired to create choral songs, another,
by the difference, in general, between cre- epic poems (Rothenberg & Hausman, 1 976).
ative discovery and rediscovery. For ex- Often, mystical sources have been suggested
ample, BACON and related programs of in creators’ introspective reports (Ghiselin,
Langley, Simon, Bradshaw, and Zytkow 1 985 ). For example, Rudyard Kipling re-
(1 987) rediscover important scientific the- ferred to the “Daemon” that lives in the
orems that were judged to be creative dis- writer’s pen: “My Daemon was with me in
coveries in their time. The processes by the Jungle Books, Kim, and both Puck books,
which these discoveries are made via com- and good care I took to walk delicately,
puter simulation are presumably not iden- lest he should withdraw. . . . When your
tical to those by which the original discov- Daemon is in charge, do not think cons-
erers made their discoveries. One difference ciously. Drift, wait, and obey” (Kipling,
derives from the fact that contemporary pro- 1 985 , p. 1 62).
grammers can provide, in their programming The mystical approaches to the study of
of information into computer simulations, creativity have probably made it harder for
representations and particular organizations scientists to be heard. Many people seem
of data that may not have been available to believe, as they believe for love (see
to the original creators. However, putting Sternberg, 1 988b, 1 988c), that creativity is
aside the question of whether the processes something that just does not lend itself to
are the same, a rediscovery might be judged scientific study because it is a more spiritual
to be creative with respect to the rediscov- process. We believe it has been hard for sci-
erer but would not be judged to be creative entific work to shake the deep-seated view of
with respect to the field at the time the re- some that, somehow, scientists are treading
discovery is made. Ramanujan, the famous where they should not.
Indian mathematician, made many such re-
discoveries. A brilliant thinker, he did not
Pragmatic Approaches
have access in his early life to much of the
recent literature on mathematics and so un- Equally damaging for the scientific study of
wittingly regenerated many discoveries that creativity, in our view, has been the takeover
others had made before him. of the field, in the popular mind, by those
who follow what might be referred to as a
pragmatic approach. Those taking this ap-
proach have been concerned primarily with
Alternative Approaches to Creativity
developing creativity, secondarily with un-
derstanding it, but almost not at all with test-
Mystical Approaches to the Study
ing the validity of their ideas about it.
of Creativity
Perhaps the foremost proponent of this
The study of creativity has always been approach is Edward De Bono, whose work
tinged – some might say tainted – with asso- on lateral thinking – seeing things broadly and
ciations to mystical beliefs. Perhaps the ear- from varied viewpoints – as well as other as-
liest accounts of creativity were based on di- pects of creativity has had what appears to
vine intervention. The creative person was be considerable commercial success (e.g., De
creativity 353
Bono, 1 971 , 1 985 , 1 992). DeBono’s concern to see it as less than a serious endeavor for
is not with theory, but with practice. Thus, psychological study.
for example, he suggests using a tool such as
“Positive-Minus-Interesting” (PMI) to focus
The Psychodynamic Approach
on the aspects of an idea that are pluses, mi-
nuses, and interesting. Or he suggests using The psychodynamic approach can be con-
the word “po,” derived from hypothesis, sup- sidered the first of the major twentieth-
pose, possible, and poetry, to provoke rather century theoretical approaches to the study
than to judge ideas. Another tool, that of of creativity. On the basis of the idea that
“thinking hats,” has individuals metaphori- creativity arises from the tension between
cally wear different hats, such as a white hat conscious reality and unconscious drives,
for data-based thinking, a red hat for intu- Freud (1 908/1 95 9) proposed that writers
itive thinking, a black hat for critical think- and artists produce creative work as a way to
ing, and a green hat for generative thinking, express their unconscious desires in a pub-
in order to stimulate seeing things from dif- licly acceptable fashion. These unconscious
ferent points of view. desires may concern power, riches, fame,
DeBono is not alone in this enterprise. honor, or love (Vernon, 1 970). Case stud-
Osborn (1 95 3 ), based on his experiences ies of eminent creators, such as Leonardo da
in advertising agencies, developed the tech- Vinci (Freud, 1 91 0/1 964), were used to sup-
nique of brainstorming to encourage people port these ideas.
to solve problems creatively by seeking Later, the psychoanalytic approach in-
many possible solutions in an atmosphere troduced the concepts of adaptive regres-
that is constructive rather than critical sion and elaboration for creativity (Kris,
and inhibitory. Gordon (1 961 ) developed a 1 95 2). Adaptive regression, the primary pro-
method called synectics, which involves pri- cess, refers to the intrusion of unmodulated
marily seeing analogies, also for stimulating thoughts in consciousness. Unmodulated
creative thinking. thoughts can occur during active problem
More recently, authors such as Adams solving but often occur during sleep, in-
(1 974, 1 986) and von Oech (1 983 ) sug- toxication from drugs, fantasies or day-
gested that people often construct a series of dreams, or psychoses. Elaboration, the sec-
false beliefs that interfere with creative func- ondary process, refers to the reworking and
tioning. For example, some people believe transformation of primary process mate-
that there is only one “right” answer and that rial through reality-oriented, ego-controlled
ambiguity must be avoided whenever possi- thinking. Other theorists (e.g., Kubie, 1 95 8)
ble. People can become creative by identify- emphasized that the preconscious, which
ing and removing these mental blocks. Von falls between conscious reality and the en-
Oech (1 986) also suggested that to be cre- crypted unconscious, is the true source
ative we need to adopt the roles of explorer, of creativity because thoughts are loose
artist, judge, and warrior in order to foster and vague but interpretable. In contrast
our creative productivity. to Freud, Kubie claimed that unconscious
These approaches have had considerable conflicts actually have a negative effect
public visibility, and they may well be use- on creativity because they lead to fixated,
ful. From our point of view as psycholo- repetitive thoughts. More recent work has
gists, however, most of these approaches lack recognized the importance of both pri-
any basis in serious psychological theory as mary and secondary processes (Noy, 1 969;
well as serious empirical attempts to vali- Rothenberg, 1 979; Suler, 1 980; Werner &
date them. Of course, techniques can work Kaplan, 1 963 ).
in the absence of psychological theory or Although the psychodynamic approach
validation. However, the effect of such ap- may have offered some insights into creativ-
proaches is often to leave people associating ity, psychodynamic theory was not at the
a phenomenon with commercialization and center of the emerging scientific psychology.
The early twentieth-century schools of other problem-solving skills. The tests can
psychology, such as structuralism, function- be scored for fluency (total number of rel-
alism, and behaviorism, were devoting prac- evant responses), flexibility (number of dif-
tically no resources at all to the study of cre- ferent categories of relevant responses), orig-
ativity. The Gestaltists studied a portion of inality (the statistical rarity of the responses),
creativity – insight – but their study never and elaboration (amount of detail in the
went much beyond labeling, as opposed to responses). Some of the subtests from the
characterizing the nature of insight. Torrance battery include
Further isolating creativity research, the
psychodynamic approach and other early 1 . Asking questions: The examinee writes
work on creativity relied on case studies out all of the questions he or she can think
of eminent creators. This methodology has of based on a drawing of a scene.
been criticized historically because of the 2. Product improvement: The examinee
difficulty of measuring proposed theoretical lists ways to change a toy monkey so chil-
constructs (e.g., primary process thought), dren will have more fun playing with it.
and the amount of selection and interpreta- 3 . Unusual uses: The examinee lists interest-
tion that can occur in a case study (Weisberg, ing and unusual uses of a cardboard box.
1 993 ). Although there is nothing a priori 4. Circles: The examinee expands empty
wrong with case study methods, the emerg- circles into different drawings and titles
ing scientific psychology valued controlled, them.
experimental methods. Thus, both theoret-
ical and methodological issues served to A number of investigators have studied
isolate the study of creativity from main- the relationship between creativity and in-
stream psychology. telligence – at least as measured by IQ.
Three basic findings concerning creativity
and conventional conceptions of intelligence
Psychometric Approaches
are generally agreed upon (see, e.g., Barron
When we think of creativity, eminent artists & Harrington, 1 981 ; Lubart, 1 994). First,
or scientists such as Michelangelo or Einstein creative people tend to show above-average
immediately come to mind. However, these IQs – often above 1 20 (see Renzulli, 1 986).
highly creative people are quite rare and dif- This figure is not a cutoff but rather an ex-
ficult to study in the psychological labora- pression of the fact that people with low or
tory. In his American Psychological Asso- even average IQs do not seem to be well
ciation address, Guilford (1 95 0) noted that represented among the ranks of highly cre-
these problems had limited research on creative individuals. Cox’s (1 926) geniuses had
ativity. He proposed that creativity could an estimated average IQ of 1 65 . Barron es-
be studied in everyday subjects using paper- timated the mean IQ of his creative writers
and-pencil tasks. One of these was the Un- to be 1 40 or higher based on their scores on
usual Uses Test, in which an examinee thinks the Terman Concept Mastery Test (Barron,
of as many uses for a common object (e.g., a 1 963 , p. 242). It should be noted that the
brick) as possible. Many researchers adopted Concept Mastery Test is exclusively verbal
Guilford’s suggestion, and “divergent think- and thus provides a somewhat skewed esti-
ing” tasks quickly became the main instru- mate of IQ. The other groups in the Institute
ments for measuring creative thinking. The for Personality Assessment (IPAR) studies,
tests were a convenient way of comparing that is, mathematicians and research scien-
people on a standard “creativity” scale. tists, were also above average in intelligence.
Building on Guilford’s work, Torrance Anne Roe (1 95 2, 1 972), who did similarly
(1 974) developed the Torrance Tests of thorough assessments of eminent scientists
Creative Thinking. These tests consist of before the IPAR group was set up, estimated
several relatively simple verbal and figural IQs for her participants ranged between 1 21
tasks that involve divergent thinking plus and 1 94, with medians between 1 3 7 and 1 66,
creativity 355
depending on whether the IQ test was ver- 40-minute test of creative ability called the
bal, spatial, or mathematical. Remote Associates Test (RAT). The test is
Second, an IQ above 1 20, does not seem based on his theory that the creative think-
to matter as much to creativity as it does ing process is the “forming of associative el-
when an IQ is below 1 20. In other words, cre- ements into new combinations which either
ativity may be more highly correlated with meet specified requirements or are in some
IQ below an IQ of 1 20, but only weakly way useful. The more mutually remote the
or not at all correlated with it above an elements of the new combination, the more
IQ of 1 20. [This relationship is often called creative the process or solution” (Mednick,
the threshold theory. See the contrast with 1 962). Because the ability to make these
Hayes’s (1 989) certification theory discussed combinations and arrive at a creative solu-
below.] In the architects’ study, in which the tion necessarily depends on the existence of
average IQ was 1 3 0 (significantly above av- the combinations (i.e., the associative ele-
erage), the correlation between intelligence ments) in a person’s knowledge base, and
and creativity was −.08, not significantly dif- because the probability and speed of attain-
ferent from zero (Barron, 1 969, p. 42). How- ment of a creative solution are influenced
ever, in the military officer study, in which by the organization of the person’s associa-
participants were of average intelligence, the tions, Mednick’s theory suggests that creativ-
correlation was .3 3 (Barron, 1 963 , p. 21 9). ity and intelligence are very related; they are
These results suggest that extremely highly overlapping sets.
creative people often have high IQs, but not Moderate correlations of .5 5 , .43 , and
necessarily that people with high IQs tend .41 have been shown between the RAT
to be extremely creative (see also Getzels & and the WISC (Wechsler Intelligence Scale
Jackson, 1 962). for Children), the SAT verbal, and the
Some investigators (e.g., Simonton, 1 994; Lorge-Thorndike Verbal intelligence mea-
Sternberg, 1 996) have suggested that very sures, respectively (Mednick & Andrews,
high IQ may actually interfere with creativ- 1 967). Correlations with quantitative intel-
ity. Those who have very high IQs may be ligence measures were lower (r = .20 − .3 4).
so highly rewarded for their IQ-like (analyt- Correlations with other measures of cre-
ical) skills that they fail to develop the creative performance have been more variable
ative potential within them, which may then (Andrews, 1 975 ).
remain latent. This psychometric approach for measur-
Third, the correlation between IQ and ing creativity had both positive and nega-
creativity is variable, usually ranging from tive effects on the field. On the positive
weak to moderate (Flescher, 1 963 ; Getzels & side, the tests facilitated research by provid-
Jackson, 1 962; Guilford, 1 967; Herr, Moore, ing a brief, easy to administer, objectively
& Hasen, 1 965 ; Torrance, 1 962; Wallach & scorable assessment device. Furthermore,
Kogan, 1 965 ; Yamamoto, 1 964). The corre- research was now possible with “everyday”
lation depends in part on what aspects of cre- people (i.e., noneminent samples). How-
ativity and intelligence are being measured, ever, there were also some negative ef-
how they are being measured, and in what fects. First, some researchers criticized brief
field the creativity is manifested. The role paper-and-pencil tests as trivial, inadequate
of intelligence is different in art and music, measures of creativity; larger productions
for instance, than it is in mathematics and such as actual drawings or writing samples
science (McNemar, 1 964). should be used instead. Second, other crit-
An obvious drawback to the tests used ics suggested that no fluency, flexibility, orig-
and assessments done by Roe and Guilford inality, or elaboration scores captured the
is the time and expense involved in adminis- concept of creativity. In fact, the definition
tering them, as well as the subjective scor- and criteria for creativity are a matter of on-
ing of them. In contrast, Mednick (1 962) going debate, and relying on the objectively
produced a 3 0-item, objectively scored, defined statistical rarity of a response with
regard to all of the responses of a subject dle problem. This problem requires partici-
population is only one of many options. pants to attach a candle to a wall using only
Other possibilities include using the social objects available in a picture (candle, box of
consensus of judges (see Amabile, 1 983 ). tacks, and book of matches). Langley et al.
Third, some researchers were less enchanted (1 987) made a similar claim about the ordi-
by the assumption that noneminent sam- nary nature of creative thinking.
ples could shed light on eminent levels of As a concrete example of this approach,
creativity, which was the ultimate goal for Weisberg and Alba (1 981 ) had people solve
many studies of creativity (e.g., Simonton, the notorious nine-dot problem. In this
1 984). Thus, a certain malaise developed problem, people are asked to connect all of
and continues to accompany the paper-and- the dots, which are arranged in the shape
pencil assessment of creativity. Some psy- of a square with three rows of three dots
chologists, at least, avoided this measure- each, using no more than four straight lines,
ment quagmire in favor of less problematic never arriving at a given dot twice, and never
research topics. lifting their pencil from the page. The prob-
lem can be solved only if people allow their
line segments to go outside the periphery of
Cognitive Approaches
the dots. Typically, solution of this task had
The cognitive approach to creativity seeks been viewed as hinging upon the insight that
understanding of the mental representations one had to go “outside the box.” Weisberg
and processes underlying creative thought and Alba showed that even when people
(see Lubart, 2000–2001 ). By studying per- were given that insight, they still had diffi-
ception or memory, one would already be culty in solving the problem. In other words,
studying the bases of creativity; thus, the whatever is required to solve the nine-dot
study of creativity would merely represent problem, it is not just some kind of extra-
an extension, and perhaps not a very large ordinary insight.
one, of work that is already being done un- There have been studies with both hu-
der another guise. For example, in the cogni- man subjects and computer simulations of
tive area, creativity was often subsumed un- creative thought. Approaches based on the
der the study of intelligence (see Sternberg, study of human subjects are perhaps proto-
Chap. 3 1 ). We do not argue with the idea typically exemplified by the work of Finke,
that creativity and intelligence are related Ward, and Smith (1 992) (see also contri-
to each other (Lubart, 2003 ; Sternberg butions to Smith, Ward, & Finke, 1 995 ;
& O’Hara, 1 999). However, the subsump- Sternberg & Davidson, 1 994; Ward, Smith,
tion has often been so powerful that re- & Finke, 1 999). Finke and his colleagues
searchers such as Wallach and Kogan (1 965 ), have proposed what they call the Geneplore
among others, had to write at length on why model, according to which there are two
creativity and intelligence should be viewed main processing phases in creative thought –
as distinct entities. In more recent cognitive a generative phase and an exploratory phase.
work, Weisberg (1 986, 1 988, 1 993 , 1 999) In the generative phase, an individual con-
has proposed that creativity involves essen- structs mental representations referred to as
tially ordinary cognitive processes yielding preinventive structures, which have proper-
extraordinary products. A similar point has ties promoting creative discoveries. In the
been made by Perkins (1 981 ). Weisberg at- exploratory phase, these properties are used
tempted to show that the insights depend on to come up with creative ideas. A number
subjects using conventional cognitive proof mental processes may enter into these
cesses (e.g., analogical transfer) applied to phases of creative invention, such as re-
knowledge already stored in memory. He trieval, association, synthesis, transformation
did so through the use of case studies of (see Tversky, Chap. 1 0), analogical trans-
eminent creators and laboratory research, fer (see Holyoak, Chap. 6), and categori-
such as studies with Duncker’s (1 945 ) can- cal reduction (i.e., mentally reducing objects
creativity 357
or elements to more primitive categorical ventional tests. Thus, the components iso-
descriptions). In a typical experimental test lated for relatively novel items would tend
based on the model (Finke & Slayton, 1 988), to correlate more highly with more unusual
participants will be shown parts of objects, tests of fluid abilities than with tests of crys-
such as a circle, a cube, a parallelogram, and tallized abilities. We also found that when
a cylinder. On a given trial, three parts will response times on the relatively novel prob-
be named, and participants will be asked to lems were componentially analyzed, some
imagine combining the parts to produce a components better measured the creative as-
practical object or device. For example, par- pect of intelligence than did others. For ex-
ticipants might imagine a tool, a weapon, or ample, in the “grue-bleen” task mentioned
a piece of furniture. The objects thus pro- previously, the information processing com-
duced are then rated by judges for their prac- ponent requiring people to switch from con-
ticality and originality. Morrison and Wallace ventional green-blue thinking to grue-bleen
(2002) found that judged creativity on such thinking, and then back to green-blue think-
a task correlated strongly with the individu- ing again, was a particularly good measure of
als’ perceived imagery vividness. the ability to cope with novelty.
In work on convergent creative thinking Computer simulation approaches, re-
that required participants to think in un- viewed by Boden (1 992, 1 999), have as their
usual ways, we presented 80 individuals with goal the production of creative thought by a
novel kinds of reasoning problems that had a computer in a manner that simulates what
single best answer. For example, they might people do. Langley, Simon, Bradshaw, and
be told that some objects are green and oth- Zytkow (1 987), for example, developed a
ers blue, whereas still other objects might be set of programs that rediscover basic sci-
grue, meaning green until the year 2000 and entific laws. These computational models
blue thereafter, or bleen, meaning blue until rely on heuristics – problem-solving guide-
the year 2000 and green thereafter. Or they lines – for searching a data set or conceptual
might be told about four kinds of people on space and finding hidden relationships be-
the planet Kyron, blens, who are born young tween input variables. The initial program,
and die young; kwefs, who are born old and called BACON, uses heuristics such as “if
die old; balts, who are born young and die the value of two numeric terms increase to-
old; and prosses, who are born old and die gether, consider their ratio” to search data
young (Sternberg, 1 981 , 1 982; Tetewsky & for patterns. One of BACON’s accomplish-
Sternberg, 1 986). Their task was to predict ments has been to examine observational
future states from past states, given incom- data on the orbits of planets available to
plete information. In another set of stud- Kepler and to rediscover Kepler’s third law
ies, 60 people were given more conven- of planetary motion. This program is un-
tional kinds of inductive reasoning problems, like creative functioning, however, in that
such as analogies, series completions, and the problems are given to it in a struc-
classifications. However, the problems had tured form, whereas creative functioning is
premises preceding them that were either largely about figuring out what the prob-
conventional (dancers wear shoes) or novel lems are (see Runco, 1 994). Further pro-
(dancers eat shoes). The participants had grams have extended the search heuristics,
to solve the problems as though the coun- the ability to transform data sets, and the
terfactuals were true (Sternberg & Gastel, ability to reason with qualitative data and
1 989a, 1 989b). scientific concepts. There are also models
In these studies, we found that correla- concerning an artistic domain. For example,
tions with conventional kinds of tests de- Johnson-Laird (1 988) developed a jazz im-
pended on how novel or nonentrenched the provisation program in which novel devia-
conventional tests were. The more novel the tions from the basic jazz chord sequences
items, the higher the correlations of our tests are guided by harmonic constraints (or
with scores on successively more novel con- tacit principles of jazz) and random choice
when several allowable directions for the Clark, & Lowell, 1 95 3 ), and other motives.
improvisation exist. Amabile (1 983 , 1 996; Hennessey & Ama-
bile, 1 988) and her colleagues conducted
seminal research on intrinsic and extrinsic
Social-Personality and Social-Cognitive
motivation. Studies using motivational train-
Approaches
ing and other techniques have manipulated
Developing in parallel with the cognitive ap- these motivations and observed effects on
proach, work in the social-personality ap- creative performance tasks, such as writing
proach has focused on personality variables, poems and making collages.
motivational variables, and the sociocul- Finally, the relevance of the social envi-
tural environment as sources of creativity. ronment to creativity has also been an active
Researchers such as Amabile (1 983 ), Bar- area of research. At the societal level, Simon-
ron (1 968, 1 969), Eysenck (1 993 ), Gough ton (1 984, 1 988, 1 994, 1 999) conducted nu-
(1 979), MacKinnon (1 965 ), and others merous studies in which eminent levels of
noted that certain personality traits often creativity over large spans of time in diverse
characterize creative people. Through cor- cultures have been statistically linked to en-
relational studies and research contrasting vironmental variables. These variables in-
high and low creative samples (at both emi- clude, among others, cultural diversity, war,
nent and everyday levels), a large set of po- availability of role models, availability of re-
tentially relevant traits has been identified sources (e.g., financial support), and number
(Barron & Harrington, 1 981 ; Feist, 1 999). of competitors in a domain. Cross-cultural
These traits include independence of judg- comparisons (e.g., Lubart, 1 990) and anthro-
ment, self-confidence, attraction to com- pological case studies (e.g., Maduro, 1 976;
plexity, aesthetic orientation, openness to Silver, 1 981 ) have demonstrated cultural
experience, and risk taking. variability in the expression of creativity.
Proposals regarding self-actualization and Moreover, they have shown that cultures dif-
creativity can also be considered within the fer simply in the amount that they value the
personality tradition. According to Maslow creative enterprise.
(1 968), boldness, courage, freedom, spon- The social-cognitive and social-persona-
taneity, self-acceptance, and other traits lead lity approaches have each provided valuable
a person to realize his or her full poten- insights into creativity. However, if you look
tial. Rogers (1 95 4) described the tendency for research that investigates both social-
toward self-actualization as having motiva- cognitive and social-personality variables at
tional force and being promoted by a sup- the same time, you would find only a hand-
portive, evaluation-free environment. These ful of studies. The cognitive work on cre-
ideas, however, seem at odds with the many ativity has tended to ignore the personality
studies that have linked creativity and men- and social system, and the social-personality
tal illness (e.g., Kaufman, 2001 a, 2001 b; approaches tended to have little or nothing
Kaufman & Baer, 2002; Ludwig, 1 995 ). If full to say about the mental representations and
creative potential is truly linked with self- processes underlying creativity.
acceptance and other positive traits, then Looking beyond the field of psychology,
one would not expect to find so many emi- Wehner, Csikszentmihalyi, and Magyari-
nent creative individuals to have such malad- Beck (1 991 ) examined 1 00 more recent doc-
justed and poor coping strategies (Kaufman, toral dissertations on creativity. They found
2002; Kaufman & Sternberg, 2000). a “parochial isolation” of the various stud-
Focusing on motivation for creativity, a ies concerning creativity. There were rele-
number of theorists have hypothesized the vant dissertations from psychology, educa-
relevance of intrinsic motivation (Amabile, tion, business, history, history of science, and
1 983 , 1 996; Crutchfield, 1 962; Golann, other fields, such as sociology and political
1 962), need for order (Barron, 1 963 ), need science. However, the different fields tended
for achievement (McClelland, Atkinson, to use different terms and focus on different
creativity 359
aspects of what seemed to be the same novel and of value, that is, creative. This
basic phenomenon. For example, business process, as well as blind generation, are de-
dissertations used the term “innovation” and scribed further by Cziko (1 998).
tended to look at the organizational level, Does an evolutionary model really ade-
whereas psychology dissertations used the quately describe creativity? Robert Stern-
term “creativity” and looked at the level of berg (1 997, 2003 ) argued that it does not,
the individual. Wehner, Csikszentmihalyi, and David Perkins (1 998) also had doubts.
and Magyari-Beck (1 991 ) described the sit- Sternberg argued that it seems utterly im-
uation with creativity research in terms of plausible that great creators such as Mozart,
the fable of the blind men and the elephant. Einstein, or Picasso were using nothing more
“We touch different parts of the same beast than blind variation to come up with their
and derive distorted pictures of the whole ideas. Good creators, like experts of any
from what we know: ‘The elephant is like a kind, may or may not have more ideas than
snake,’ says the one who only holds its tail; other people have, but they have better
‘The elephant is like a wall,’ says the one ideas, ones that are more likely to be se-
who touches its flanks” (p. 270). lectively retained. The reason they are more
likely to be selectively retained is that they
were not produced in a blind fashion. This
Evolutionary Approaches to Creativity
debate is by no means resolved, however,
The evolutionary approach to creativity was and is likely to continue into the future for
instigated by Donald Campbell (1 960), who some time to come.
suggested that the same kinds of mecha- Perkins (1 995 , 1 998) argued that the
nisms that have been applied to the study analogy between biological evolution and
of the evolution of organisms could be ap- creativity is oversimplified. In particular
plied to the evolution of ideas. This idea (Perkins, 1 998), biological evolution relies
has been enthusiastically picked up by a on massive parallel search for mutations
number of investigators (Simonton, 1 995 , (millions of bacteria, for example, are mutat-
1 998, 1 999). ing every second), whereas humans do not.
The basic idea underlying this approach At the same time, humans can do fairly ex-
is that there are two basic steps in the gen- tensive searches, such as when they seek out
eration and propagation of creative ideas. new antibiotics.
The first is blind variation, by which the cre- Were it the case that an understanding
ator generates an idea without any real idea of creativity required a multidisciplinary ap-
of whether the idea will be successful (se- proach, the result of a unidisciplinary ap-
lected for) in the world of ideas. Indeed, proach might be that we would view a part
Dean Simonton (1 996) argued that creators of the whole as the whole. At the same time,
do not have the slightest idea as to which though, we would have an incomplete ex-
of their ideas will succeed. As a result, their planation of the phenomenon we are seek-
best bet for producing lasting ideas is to go ing to explain, leaving dissatisfied those who
for a large quantity of ideas. The reason is do not subscribe to the particular discipline
that their hit rate remains relatively constant doing the explaining. We believe that tradi-
through their professional life span. In other tionally this has been the case for creativ-
words, they have a fixed proportion of ideas ity. More recently, theorists have begun to
that will succeed. The more ideas they have develop confluence approaches to creativity,
in all, the more ideas they have that will which we now discuss.
achieve success.
The second step is selective retention. In
Confluence Approaches to the Study
this step, the field in which the creator works
of Creativity
either retains the idea for the future or lets
it die out. Those ideas that are selectively Many more recent works on creativity hy-
retained are the ones that are judged to be pothesize that multiple components must
converge for creativity to occur (Amabile, a different “systems” approach and high-
1 983 ; Csikszentmihalyi, 1 988; Gardner, lighted the interaction of the individual, do-
1 993 ; Gruber, 1 989; Gruber & Wallace, main, and field. An individual draws upon
1 999; Lubart, 1 994, 1 999; Lubart, Mouchi- information in a domain and transforms or
roud, Tordjman, & Zenasni, 2003 ; Mumford extends it via cognitive processes, person-
& Gustafson, 1 988; Perkins, 1 981 ; Simonton, ality traits, and motivation. The field, con-
1 988; Sternberg, 1 985 b; Sternberg & sisting of people who control or influence a
Lubart, 1 991 , 1 995 , 1 996; Weisberg, 1 993 ; domain (e.g., art critics and gallery owners),
Woodman & Schoenfeldt, 1 989). Sternberg evaluates and selects new ideas. The domain,
(1 985 b), for example, examined laypersons’ a culturally defined symbol system such as
and experts’ conceptions of the creative alphabetic writing, mathematical notation,
person. People’s implicit theories contain or musical notation, preserves and transmits
a combination of cognitive and personality creative products to other individuals and
elements, such as “connects ideas,” “sees future generations. Gardner (1 993 ; see also
similarities and differences,” “has flexibility,” Policastro & Gardner, 1 999) conducted case
“has aesthetic taste,” “is unorthodox,” “is studies that suggest that the development of
motivated,” “is inquisitive,” and “questions creative projects may stem from an anomaly
societal norms.” within a system (e.g., tension between com-
At the level of explicit theories, Amabile peting critics in a field) or moderate asyn-
(1 983 , 1 996; Collins & Amabile, 1 999) de- chronies between the individual, domain,
scribed creativity as the confluence of intrin- and field (e.g., unusual individual talent for
sic motivation, domain-relevant knowledge a domain). In particular, Gardner (1 993 ) an-
and abilities, and creativity-relevant skills. alyzed the lives of seven individuals who
The creativity-relevant skills include made highly creative contributions in the
twentieth century with each specializing in
1 . a cognitive style that involves coping with one of the multiple intelligences (Gardner,
complexities and breaking one’s mental 1 983 ): Sigmund Freud (intrapersonal), Al-
set during problem solving; bert Einstein (logical-mathematical), Pablo
2. knowledge of heuristics for generating Picasso (spatial), Igor Stravinsky (musical),
novel ideas, such as trying a counterintu- T. S. Eliot (linguistic), Martha Graham
itive approach; and (bodily-kinesthetic), and Mohandas Gandhi
3 . a work style characterized by concen- (interpersonal). Charles Darwin would be
trated effort, an ability to set aside prob- an example of someone with extremely
lems, and high energy. high naturalist intelligence. Gardner pointed
out, however, that most of these individ-
Gruber (1 981 , 1 989) and Gruber and uals actually had strengths in more than
Davis (1 988) proposed a developmental one intelligence and that they also had no-
evolving-systems model for understanding table weaknesses in others (e.g., Freud’s
creativity. A person’s knowledge, purpose, weaknesses may have been in spatial and
and affect grow over time, amplify devi- musical intelligences).
ations that an individual encounters, and Although creativity can be understood in
lead to creative products. Developmen- terms of uses of the multiple intelligences
tal changes in the knowledge system have to generate new and even revolutionary
been documented in cases such as Charles ideas, Gardner’s (1 993 ) analysis goes well be-
Darwin’s thoughts on evolution. Purpose yond the intellectual. For example, Gardner
refers to a set of interrelated goals, which pointed out two major themes in the behav-
also develop and guide an individual’s behavior of these creative giants. First, they tended
ior. Finally, the affect or mood system notes to have a matrix of support at the time
the influence of joy or frustration on the of their creative breakthroughs. Second,
projects undertaken. they tended to drive a “Faustian bargain,”
Csikszentmihalyi (1 988, 1 996; Feldman, whereby they gave up many of the plea-
Csikszentmihalyi, & Gardner, 1 994) took sures people typically enjoy in life to attain
creativity 361
extraordinary success in their careers. How- ties, knowledge, styles of thinking, personal-
ever, it is not clear that these attributes are ity, motivation, and environment.
intrinsic to creativity, per se; rather, they Concerning the confluence of compo-
seem to be associated with those who have nents, creativity is hypothesized to involve
been driven to exploit their creative gifts in more than a simple sum of a person’s
a way that leads them to attain eminence. level on each component. First, there may
Gardner (1 993 ) further followed Csik- be thresholds for some components (e.g.,
szentmihalyi (1 988, 1 996) in distinguishing knowledge), below which creativity is not
between the importance of the domain (the possible regardless of the levels on other
body of knowledge about a particular subcomponents. Second, partial compensation
ject area) and the field (the context in which may occur in which a strength on one
this body of knowledge is studied and elab- component (e.g., motivation) counteracts a
orated, including the persons working with weakness on another component (e.g., envi-
the domain, such as critics, publishers, and ronment). Third, interactions may also oc-
other “gatekeepers”). Both are important to cur between components, such as intelli-
the development, and, ultimately, the recog- gence and motivation, in which high levels
nition of creativity. on both components could multiplicatively
A final confluence theory considered here enhance creativity.
is Sternberg and Lubart’s (1 991 , 1 995 ) invest- In general, confluence theories of creativ-
ment theory of creativity. According to this ity offer the possibility of accounting for di-
theory, creative people are ones who are will- verse aspects of creativity (Lubart, 1 994).
ing and able to “buy low and sell high” in For example, analyses of scientific and artis-
the realm of ideas (see also Lubart & Runco, tic achievements suggest that the median-
1 999; Rubenson & Runco, 1 992, for use rated creativity of work in a domain tends to
of concepts from economic theory). Buying fall toward the lower end of the distribution
low means pursuing ideas that are unknown and the upper – high creativity – tail extends
or out of favor but that have growth poten- quite far. This pattern can be explained
tial. Often, when these ideas are first pre- through the need for multiple components
sented, they encounter resistance. The cre- of creativity to co-occur in order for the
ative individual persists in the face of this highest levels of creativity to be achieved. As
resistance, and eventually sells high, moving another example, the partial domain speci-
on to the next new or unpopular idea. ficity of creativity that is often observed
Preliminary research within the invest- can be explained through the mixture of
ment framework has yielded support for this some relatively domain-specific components
model (Lubart & Sternberg, 1 995 ). This re- for creativity, such as knowledge, and other
search has used tasks such as more domain-general components, such as,
perhaps, the personality trait of persever-
1 . writing short stories using unusual titles ance. Creativity, then, is largely something
(e.g., “the octopus’ sneakers”), that people show in a particular domain.
2. drawing pictures with unusual themes
(e.g., “the earth from an insect’s point of
view”), Alternate Approaches to
3 . devising creative advertisements for bor- Understanding Kinds of Creative
ing products (e.g., cufflinks), and Contributions
4. solving unusual scientific problems (e.g.,
how we could tell if someone had been Generally, we think of creative contribu-
on the moon within the past month?). tions as being of a single kind. However,
a number of researchers on creativity have
This research showed creative performance questioned this assumption. There are many
to be moderately domain specific and to be ways of distinguishing among types of cre-
predicted by a combination of six distinct ative contributions. It is important to re-
but interrelated resources: intellectual abili- member, though, that creative contributions
can be viewed in different ways at different 1 997). However, in no case were these mod-
times. At a given time, the field can never els intended explicitly to distinguish among
be sure of whose work will withstand the types of creative contributions.
judgments of the field over time (e.g., that Maslow (1 967) distinguished more gener-
of Mozart) and whose work will not (e.g., ally between two types of creativity, which
that of Salieri) (Therivel, 1 999). he referred to as primary and secondary.
Theorists of creativity and related top- Primary creativity is the kind of creativity
ics have recognized that there are differ- a person uses to become self-actualized –
ent types of creative contributions (see re- to find fulfillment in him- or herself and
views in Ochse, 1 990; Sternberg, 1 988c; his or her life. Secondary creativity is the
Weisberg, 1 993 ). For example, Kuhn (1 970) kind of creativity with which scholars in
distinguished between normal and revolu- the field are more familiar – the kind that
tionary science. Normal science expands leads to creative achievements recognized by
upon or otherwise elaborates upon an al- a field.
ready existing paradigm of scientific re- Ward, Smith, and Finke (1 999) noted that
search, whereas revolutionary science pro- there is evidence to favor the roles of both fo-
poses a new paradigm (see Dunbar & Fugel- cusing (Bowers et al., 1 990; Kaplan & Simon,
sang, Chap. 29). The same kind of distinction 1 990) and exploratory thinking (Bransford
can be applied to the arts and letters. & Stein, 1 984; Getzels & Csikszentmiha-
Gardner (1 993 , 1 994) also described dif- lyi, 1 976) on creative thinking. In focusing,
ferent types of creative contributions indi- one concentrates on pursuing a single
viduals can make. They include problem-solving approach, whereas in ex-
ploratory thinking one considers many such
1. the solution of a well-defined problem, approaches. A second distinction made by
2. the devising of an encompassing theory, Ward and his colleagues is between domain
3. the creation of a “frozen work,” specific (Clement, 1 989; Langley, Simon,
4. the performance of a ritualized work, Bradshaw, & Zytkow, 1 987; Perkins, 1 981 ;
and Weisberg, 1 986) and universal (Finke, 1 990,
5 . a “high-stakes” performance. Each type of 1 995 ; Guilford, 1 968; Koestler, 1 964) cre-
creativity has as its result a different kind ativity skills. Finally, Ward and his colleagues
of creative product. distinguish between unstructured (Bateson,
1 979; Findlay & Lumsden, 1 988; Johnson-
Other bases for distinguishing among Laird, 1 988) and structured or systematic
types of creative contributions also exist. For (Perkins, 1 981 ; Ward, 1 994; Weisberg, 1 986)
example, psychoeconomic models such as creativity, where the former is displayed in
those of Rubenson and Runco (1 992) and systems with relatively few rules, and the lat-
Sternberg and Lubart (1 991 , 1 995 , 1 996) can ter, in systems with many rules.
distinguish different types of contributions There are tens of thousands of artists, mu-
in terms of the parameters of the models. sicians, writers, scientists, and inventors to-
In the Sternberg–Lubart model, contribu- day. What makes some of them stand out
tions might differ in the extent to which they from the rest? Why will some of them be-
“defy the crowd” or in the extent to which come distinguished contributors in the an-
they redefine how a field perceives a set nals of their field and others be forgotten?
of problems. Although many variables may contribute to
Simonton’s (1 997) model of creativity who stands out from the crowd, certainly
also proposes parameters of creativity, and creativity is one of them. The standouts
various kinds of creative contributions might are often those who are doing particu-
be seen as differing in terms of the extent larly creative work in their line of profes-
to which they vary from other contributions sional pursuit. Are these highly creative in-
and the extent to which they are selected for dividuals simply doing more highly creative
recognition by a field of endeavor (see also work than their less visible counterparts, or
Campbell, 1 960; Perkins, 1 995 ; Simonton, does the creativity of their work also differ
creativity 363
in quality? One possibility is that creative what domain a contribution is creative but
contributors make different decisions regard- of what the type of creative contribution is.
ing how to express their creativity. This What makes one work in biology more cre-
section describes a propulsion theory of ative or creative in a different way from an-
creative contributions (Sternberg, 1 999b; other work in biology, or what makes its cre-
Sternberg, Kaufman, & Pretz, 2002) that ative contribution different from that of a
addresses this issue of how people decide work in art? Thus, a taxonomy of domains
to invest their creative resources. The ba- of work is insufficient to elucidate the nature
sic idea is that creativity can be of dif- of creative contributions. A field needs a ba-
ferent kinds, depending on how it pro- sis for scaling how creative contributions dif-
pels existing ideas forward. When devel- fer quantitatively and, possibly, qualitatively.
oping creativity in children, we can foster For instance,
different kinds of creativity, ranging from
minor replications to major redirections in 1 . Replication. The contribution is an at-
their thinking. tempt to show that the field is in the
Creative contributions differ not only in right place. The propulsion keeps the field
their amounts but also in the types of creativ- where it is rather than moving it. This type
ity they represent. For example, both Sig- of creativity is represented by stationary
mund Freud and Anna Freud were highly motion, as of a wheel that is moving but
creative psychologists, but the nature of staying in place.
their contributions seems in some way or 2. Redefinition. The contribution is an at-
ways to have been different. Sigmund Freud tempt to redefine where the field is. The
proposed a radically new theory of human current status of the field thus is seen from
thought and motivation, and Anna Freud different points of view. The propulsion
largely elaborated on and modified Sigmund leads to circular motion such that the cre-
Freud’s theory. How do creative contribu- ative work leads back to where the field is
tions differ in quality and not just in quantity but as viewed in a different way.
of creativity?
3 . Forward Incrementation. The contribution
The type of creativity exhibited in a cre-
is an attempt to move the field forward
ator’s works can have at least as much of an
in the direction it already is going. The
effect on judgments about that person and
propulsion leads to forward motion.
his or her work as does the amount of cre-
ativity exhibited. In many instances, it may 4. Advance Forward Incrementation. The
have more of an effect on these judgments. contribution is an attempt to move the
Given the importance of purpose, cre- field forward in the direction it is al-
ative contributions must always be defined ready going, but by moving beyond where
in some context. If the creativity of an in- others are ready for it to go. The propul-
dividual is judged in a context, then it will sion leads to forward motion that is accel-
help to understand how the context interacts erated beyond the expected rate of for-
with how people are judged. In particular, ward progression.
what are the types of creative contributions 5 . Redirection. The contribution is an at-
a person can make within a given context? tempt to redirect the field from where it is
Most theories of creativity concentrate on toward a different direction. The propul-
attributes of the individual (see Sternberg, sion thus leads to motion in a direction
1 999b). However, to the extent that creativ- that diverges from the way the field is cur-
ity depends on the interaction of person with rently moving.
context, we would also need to concentrate 6. Reconstruction/Redirection. The contribu-
on the attributes of the individual and the in- tion is an attempt to move the field back
dividual’s work relative to the environmen- to where it once was (a reconstruction of
tal context. the past) so it may move onward from that
A taxonomy of creative contributions point, but in a direction different from
needs to deal with the question not just of in the one it took from that point onward.
The propulsion thus leads to motion that Author Notes

is backward and then redirective.
7. Reinitiation. The contribution is an at- Preparation of this article was supported
tempt to move the field to a different as by Grant No. REC-9979843 from the
yet unreached starting point and then to National Science Foundation and by a grant
move from that point. The propulsion is under the Javits Act Program (Grant No.
thus from a new starting point in a direc- R206R000001 ) as administered by the In-
tion that is different from that the field stitute of Education Sciences, U.S. Depart-
previously has pursued. ment of Education. Grantees undertaking
8. Integration. The contribution is an at- such projects are encouraged to express
tempt to integrate two formerly diverse freely their professional judgment. This ar-
ways of thinking about phenomena into ticle, therefore, does not necessarily repre-
a single way of thinking about a phe- sent the position or policies of the National
nomenon. The propulsion thus is a com- Science Foundation, Office of Educational
bination of two different approaches that Research and Improvement, or the U.S. De-
are linked together. partment of Education, and no official en-
dorsement should be inferred.
The eight types of creative contributions
described previously are largely qualitatively
distinct. Within each type, however, there
can be quantitative differences. For exam-
ple, a forward incrementation can represent References
a fairly small step forward or a substan-
tial leap. An initiation can restart a subfield Adams, J. L. (1 974). Conceptual blockbusting: A
(e.g., the work of Leon Festinger on cogni- guide to better ideas. San Francisco: Freeman
tive dissonance) or an entire field (e.g., the Press.
work of Einstein on relativity theory). Thus, Adams, J. L. (1 986). The care and feeding of ideas:
the theory distinguishes contributions both A guide to encouraging creativity. Reading, MA:
Addison-Wesley.
qualitatively and quantitatively.
Amabile, T. M. (1 983 ). The social psychology of
creativity. New York: Springer–Verlag.
Conclusions and Future Directions Amabile, T. M. (1 996). Creativity in context.
Boulder, CO: Westview.
Andrews, F. M. (1 975 ). Social and psychological
In sum, creativity, which has often been factors which influence the creative process. In
viewed as beyond study, is anything but. I. A. Taylor & J. W. Getzels (Eds.), Perspectives
Creativity can be understood about as well as in creativity (pp. 1 1 7–1 45 ). Chicago: Aldine.
any psychological construct, if appropriate Barron, F. (1 963 ). Creativity and psychological
methods are brought to bear upon its inves- health. Princeton, NJ: D. Van Nostrand.
tigations. The history of creativity theory and Barron, F. (1 968). Creativity and personal freedom.
research is long and interesting. It represents New York: Van Nostrand.
a diversity of attempts to understand the Barron, F. (1 969). Creative person and creative pro-
phenomenon. More recently, scholars have cess. New York: Holt, Rinehart & Winston.
recognized that creativity can be of multi- Barron, F. (1 988). Putting creativity to work. In
ple kinds and have tried to understand these R. J. Sternberg (Ed.), The nature of creativity
different kinds. A full account of creativity (pp. 76–98). New York: Cambridge University
would need to take into account not just Press.
differing amounts of creativity but differing Barron, F., & Harrington, D. M. (1 981 ). Creativ-
kinds. These kinds would include creativity, intelligence, and personality. Annual Review
ity that accepts current paradigms, creativity of Psychology, 3 2 , 43 9–476.
that rejects them, and creativity that synthe- Bateson, G. (1 979). Mind and nature. London:
sizes them into a new whole. Wildwood House.
creativity 365
Boden, M. (1 992). The creative mind: Myths and De Bono, E. (1 992). Serious creativity: Using the
mechanisms. New York: Basic Books. power of lateral thinking to create new ideas. New
Boden, M. A. (1 999). Computer models of cre- York: Harper Collins.
ativity. In R. J. Sternberg (Ed.), Handbook of Duncker, K. (1 945 ). On problem solving. Psycho-
creativity (pp. 3 5 1 –3 72). New York: Cambridge logical Monographs, 68(5 ), whole no. 270.
University Press. Eysenck, H. J. (1 993 ). Creativity and personality:
Bowers, K. S., Regehr, G., Balthazard, C., & A theoretical perspective. Psychological Inquiry,
Parker, K. (1 990). Intuition in the context of 4, 1 47–1 78.
discovery. Cognitive Psychology, 2 2 , 72–1 09. Feist, G. J. (1 999). The influence of personal-
Bransford, J. D., & Stein, B. (1 984). The IDEAL ity on artistic and scientific creativity. In R. J.
problem solver. New York: Freeman Press. Sternberg (Ed.), Handbook of creativity (pp.
Campbell, D. T. (1 960). Blind variation and se- 273 –296). New York: Cambridge University
lective retention in creative thought and other Press.
knowledge processes. Psychological Review, 67, Feldman, D. H. (1 999). The development of cre-
3 80–400. ativity. In R. J. Sternberg (Ed.), Handbook of
Clement, J. (1 989). Learning via model con- creativity (pp. 1 69–1 86). New York: Cambridge
struction and criticism: Protocol evidence on University Press.
sources of creativity in science. In G. Glover, Feldman, D. H., Csikszentmihalyi, M., & Gard-
R. Ronning, & C. Reynolds (Eds.), Handbook ner, H. (1 994). Changing the world: A frame-
of creativity (pp. 3 41 –3 81 ). New York: Plenum. work for the study of creativity. Westport, CT:
Collins, M. A., & Amabile, T. M. (1 999). Moti- Praeger Publishers.
vation and creativity. In R. J. Sternberg (Ed.), Findlay, C. S., & Lumsden, C. J. (1 988). The cre-
Handbook of creativity (pp. 297–3 1 2). New ative mind: Toward an evolutionary theory of
York: Cambridge University Press. discovery and invention. Journal of Social and
Cox, C. M. (1 926). The early mental traits of three Biological Structures, 11, 3 –5 5 .
hundred geniuses. Stanford, CA: Stanford Uni- Finke, R. (1 990). Creative imagery: Discoveries
versity Press. and inventions in visualization. Hillsdale, NJ:
Crutchfield, R. (1 962). Conformity and cre- Erlbaum.
ative thinking. In H. Gruber, G. Terrell, & M. Finke, R. (1 995 ). A. Creative insight and prein-
Wertheimer (Eds.), Contemporary approaches ventive forms. In R. J. Sternberg & J. E. David-
to creative thinking (pp. 1 20–1 40). New York: son (Eds.), The nature of insight (pp. 25 5 –280).
Atherton Press. Cambridge, MA: MIT Press.
Csikszentmihalyi, M. (1 988). Society, culture, Finke, R. A., & Slayton, K. (1 988). Explorations
and person: A systems view of creativity. In of creative visual synthesis in mental imagery.
R. J. Sternberg (Ed.), The nature of creativity Memory and Cognition, 1 6(3 ), 25 2–25 7.
(pp. 3 25 –3 3 9). New York: Cambridge Univer- Finke, R. A., Ward T. B., & Smith, S. M. (1 992).
sity Press. Creative cognition: Theory, research, and appli-
Csikszentmihalyi, M. (1 996). Creativity: Flow and cations. Cambridge, MA: MIT Press.
the psychology of discovery and invention. New Flescher, I. (1 963 ). Anxiety and achievement
York: Harper Collins. of intellectually gifted and creatively gifted
Csikszentmihalyi, M. (1 999). Implications of a children. Journal of Psychology, 5 6, 25 1 –
systems perspective for the study of creativity. 268.
In R. J. Sternberg (Ed.), Handbook of creativity Freud, S. (1 908/1 95 9). The relation of the poet
(pp. 3 1 3 –3 3 5 ). New York: Cambridge Univer- to day-dreaming. In S. Freud (Ed.), Collected
sity Press. papers (Vol. 4, pp. 1 73 –1 83 ). London: Hogarth
Cziko, G. A. (1 998). From blind to creative: In de- Press.
fense of Donald Campbell’s selectionist theory Freud, S. (1 91 0/1 964). Leonardo da Vinci and a
of human creativity. Journal of Creative Behav- memory of his childhood. New York: Norton.
ior, 3 2 , 1 92–208. (Original work published in 1 91 0).
De Bono, E. (1 971 ). Lateral thinking for manage- Gardner, H. (1 983 ). Frames of mind: The theory of
ment. New York: McGraw-Hill. multiple intelligences. New York: Basic Books.
De Bono, E. (1 985 ). Six thinking hats. Boston: Gardner, H. (1 993 ). Multiple intelligences: The the-
Little, Brown. ory in practice. New York: Basic Books.
Gardner, H. (1 994). The stories of the right Hennessey, B. A., & Amabile, T. M. (1 988). The
hemisphere. In W. D. Spaulding (Ed.), Integra- conditions of creativity. In R. J. Sternberg (Ed.),
tive views of motivation, cognition, and emotion. The nature of creativity (pp. 1 1 –3 8). New York:
Nebraska symposium on motivation (Vol. 41 , Cambridge University Press.
pp. 5 7–69). Lincoln: University of Nebraska Herr, E. L., Moore, G. D., & Hasen, J. S. (1 965 ).
Press. Creativity, intelligence, and values: A study
Getzels, J., & Csikszentmihalyi, M. (1 976). The of relationships. Exceptional Children, 3 2 , 1 1 4–
creative vision: A longitudinal study of problem 115.
finding in art. New York: Wiley-Interscience. Johnson-Laird, P. N. (1 988). Freedom and con-
Getzels, J. W., & Jackson, P. W. (1 962). Creativ- straint in creativity. In R. J. Sternberg (Ed.), The
ity and intelligence: Explorations with gifted stu- nature of creativity (pp. 202–21 9). New York:
dents. New York: Wiley. Cambridge University Press.
Ghiselin, B. (Ed.). (1 985 ). The creative process: A Kaplan, C. A., & Simon, H. A. (1 990). In search
symposium. Berkeley: University of California of insight. Cognitive Psychology, 2 2 , 3 74–41 9.
Press. Kaufman, J. C. (2001 a). Genius, lunatics, and
Golann, S. E. (1 962). The creativity motive. Jour- poets: Mental illness in prize-winning authors.
nal of Personality, 3 0, 5 88–600. Imagination, Cognition, and Personality, 2 0(4),
Gough, H. G. (1 979). A creativity scale for the 3 05 –3 1 4.
adjective check list. Journal of Personality and Kaufman, J. C. (2001 b). The Sylvia Plath effect:
Social Psychology, 3 7, 1 3 98–1 405 . Mental illness in eminent creative writers. Jour-
Gordon, W. J. J. (1 961 ). Synectics: The development nal of Creative Behavior, 3 5 (1 ), 3 7–5 0.
of creative capacity. New York: Harper & Row. Kaufman, J. C. (2002). Creativity and confidence:
Gruber, H. (1 981 ). Darwin on man: A psycho- Price of achievement? American Psychologist,
logical study of scientific creativity (2nd ed.). 5 7, 3 75 –3 76.
Chicago: University of Chicago Press. (Origi- Kaufman, J. C., & Baer, J. (2002). I bask in dreams
nal work published 1 974.) of suicide: Mental illness and poetry. Review of
Gruber, H. E. (1 989). The evolving systems ap- General Psychology, 6(3 ), 271 –286.
proach to creative work. In D. B. Wallace & Kaufman, J. C., & Sternberg, R. J. (2000). Are
H. E. Gruber (Eds.), Creative people at work: there mental costs to creativity? Bulletin of Psy-
Twelve cognitive case studies (pp. 3 –24). New chology and the Arts, 1(2), 3 8.
York: Oxford University Press. Kipling, R. (1 985 ). Working-tools. In B. Ghis-
Gruber, H. E., & Davis, S. N. (1 988). Inching our elin (Ed.), The creative process: A sympo-
way up Mount Olympus: The evolving-systems sium (pp. 1 61 –1 63 ). Berkeley: University of
approach to creative thinking. In R. J. Sternberg California Press. (Original article published
(Ed.), The nature of creativity (pp. 243 –270). 1 93 7.)
New York: Cambridge University Press. Koestler, A. (1 964). The act of creation. New York:
Gruber, H. E., & Wallace, D. B. (1 999). The case Dell.
study method and evolving systems approach Kris, E. (1 95 2). Psychoanalytic exploration in art.
for understanding unique creative people at New York: International Universities Press.
work. In R. J. Sternberg (Ed.), Handbook of Kubie, L. S. (1 95 8). The neurotic distortion of the
creativity (pp. 93 –1 1 5 ). New York: Cambridge creative process. Lawrence: University of Kansas
University Press. Press.
Guilford, J. P. (1 95 0). Creativity. American Psy- Kuhn, T. S. (1 970). The structure of scientific revolu-
chologist, 5 , 444–45 4. tions (2nd ed.). Chicago: University of Chicago
Guilford, J. P. (1 967). The nature of human intel- Press.
ligence. New York: McGraw-Hill. Langley, P., Simon, H. A., Bradshaw, G. L., &
Guilford, J. P. (1 968). Intelligence has three Zytkow, J. M. (1 987). Scientific discovery: Com-
facets. Science, 1 60(3 828), 61 5 –620. putational explorations of the creative processes.
Hayes, J. R. (1 989). Cognitive processes in cre- Cambridge, MA: MIT Press.
ativity. In J. A. Glover, R. R. Ronning, & Lubart, T. I. (1 990). Creativity and cross-cultural
C. R. Reynolds (Eds.), Handbook of creativity variation. International Journal of Psychology,
(pp. 1 3 5 –1 45 ). New York: Plenum. 2 5 , 3 9–5 9.
creativity 367
Lubart, T. I. (1 994). Creativity. In R. J. Sternberg Mednick, S. A. (1 962). The associative basis of

(Ed.), Thinking and problem solving (pp. 290– the creative process. Psychological Review, 69,
3 3 2). San Diego: Academic Press. 220–23 2.
Lubart, T. I. (1 999). Componential models of cre- Morrison, R. G., & Wallace, B. (2002). Imagery
ativity. In M. A. Runco & S. Pritzer (Eds.), Ency- vividness, creativity, and the visual arts. Journal
clopedia of creativity (pp. 295 –3 00). New York: of Mental Imagery, 2 5 , 1 3 5 –1 5 2.
Academic Press. Mumford, M. D., & Gustafson, S. B. (1 988). Cre-
Lubart, T. I. (2000–2001 ). Models of the creative ativity syndrome: Integration, application, and
process: Past, present and future. Creativity Re- innovation. Psychological Bulletin, 1 03 , 27–43 .
search Journal, 1 3 (3 –4), 295 –3 08. Noy, P. (1 969). A revision of the psychoanalytic
Lubart, T. I. (2003 ). In search of creative theory of the primary process. International
intelligence. In R. J. Sternberg, J. Lautrey, Journal of Psychoanalysis, 5 0, 1 5 5 –1 78.
& T. I. Lubart (Eds.), Models of intelli- Ochse, R. (1 990). Before the gates of excellence.
gence for the next millennium (pp. 279–292). New York: Cambridge University Press.
Washington, DC: American Psychological Osborn, A. F. (1 95 3 ). Applied imagination (rev.
Association. ed.). New York: Charles Scribner’s Sons.
Lubart, T. I., Mouchiroud, C., Tordjman, S., & Perkins, D. N. (1 981 ). The mind’s best work. Cam-
Zenasni, F. (2003 ). Psychologie de la créativité bridge, MA: Harvard University Press.
[Psychology of creativity]. Paris: Colin.
Perkins, D. N. (1 995 ). Insight in minds and genes.
Lubart, T. I., & Runco, M. A. (1 999). Eco- In R. J. Sternberg & J. E. Davidson (Eds.), The
nomic perspective on creativity. In M. A. nature of insight (pp. 495 –5 3 4). Cambridge,
Runco & S. Pritzer (Eds.), Encyclopedia of cre- MA: MIT Press.
ativity (pp. 623 –627). New York: Academic Perkins, D. N. (1 998). In the country of the blind
Press. an appreciation of Donald Campbell’s vision of
Lubart, T. I., & Sternberg, R. J. (1 995 ). An invest- creative thought. Journal of Creative Behavior,
ment approach to creativity: Theory and data. 3 2 (3 ), 1 77–1 91 .
In S. M. Smith, T. B. Ward, & R. A. Finke (Eds.), Policastro, E., & Gardner, H. (1 999). From case
The creative cognition approach (pp. 269–3 02). studies to robust generalizations: An approach
Cambridge, MA: MIT Press. to the study of creativity. In R. J. Sternberg
Ludwig, A. M. (1 995 ). Price of greatness. New (Ed.), Handbook of creativity (pp. 21 3 –225 ).
York: Guilford Press. New York: Cambridge University Press.
MacKinnon, D. W. (1 965 ). Personality and the Renzulli, J. S. (1 986). The three-ring concep-
realization of creative potential. American Psy- tion of giftedness: a developmental model for
chologist, 2 0, 273 –281 . creative productivity. In R. J. Sternberg &
Maduro, R. (1 976). Artistic creativity in a Brah- J. E. Davidson (Eds.), Conceptions of giftedness
min painter community. Research monograph (pp. 5 3 –92). New York: Cambridge University
1 4. Berkeley: Center for South and Southeast Press.
Asia Studies, University of California. Roe, A. (1 95 2). The making of a scientist. New
Maslow, A. (1 967). The creative attitude. In R. L. York: Dodd, Mead.
Mooney & T. A. Rasik (Eds.), Explorations in Roe, A. (1 972). Patterns of productivity of scien-
creativity (pp. 43 –5 7). New York: Harper & tists. Science, 1 76, 940–941 .
Row. Rogers, C. R. (1 95 4). Toward a theory of creativ-
Maslow, A. (1 968). Toward a psychology of being. ity. ETC: A Review of General Semantics, 1 1 ,
New York: Van Nostrand. 249–260.
McClelland, D. C., Atkinson, J. W., Clark, R. A., Rothenberg, A. (1 979). The emerging goddess.
& Lowell, E. L. (1 95 3 ). The achievement motive. Chicago: University of Chicago Press.
New York: Irvington Publishing. Rothenberg, A., & Hausman, C. R. (Eds.). (1 976).
McNemar, Q. (1 964). Lost: Our intelligence? The creativity question. Durham, NC: Duke
Why? American Psychologist, 1 9, 871 –882. University Press.
Mednick, M. T., & Andrews, F. M. (1 967). Cre- Rubenson, D. L., & Runco, M. A. (1 992). The
ative thinking and level of intelligence. Journal psychoeconomic approach to creativity. New
of Creative Behavior, 1 , 428–43 1 . Ideas in Psychology, 1 0, 1 3 1 –1 47.
Runco, M. A. (Ed.). (1 994). Problem finding, Sternberg, R. J. (1 988b). Mental self-government:

problem solving, and creativity. Norwood, NJ: A theory of intellectual styles and their devel-
Ablex. opment. Human Development, 3 1 , 1 97–224.
Silver, H. R. (1 981 ). Calculating risks: The so- Sternberg, R. J. (Ed.). (1 988c). The nature of
cioeconomic foundations of aesthetic innova- creativity: Contemporary psychological perspec-
tion in an Ashanti carving community. Ethnol- tives. New York: Cambridge University Press.
ogy, 2 0(2), 1 01 –1 1 4. Sternberg, R. J. (1 996). For whom does the Bell
Simonton, D. K. (1 984). Genius, creativity, and Curve toll? It tolls for you. Journal of Quality
leadership. Cambridge, MA: Harvard Univer- Learning, 6(1 ), 9–27.
sity Press. Sternberg, R. J. (1 997). Successful intelligence.
Simonton, D. K. (1 988). Scientific genius. New New York: Plume.
York: Cambridge University Press. Sternberg, R. J. (Ed.). (1 999a). Handbook of cre-
Simonton, D. K. (1 994). Greatness: Who makes ativity. New York: Cambridge University Press.
history and why? New York: Guilford Press. Sternberg, R. J. (1 999b). A propulsion model of
Simonton, D. K. (1 995 ). Foresight in insight: A creative contributions. Review of General Psy-
Darwinian answer. In R. J. Sternberg & J. E. chology, 3 , 83 –1 00.
Davidson (Eds.), The nature of insight (pp. 495 – Sternberg, R. J. (2003 ). WICS: A theory of
5 3 4). Cambridge, MA: MIT Press. wisdom, intelligence, and creativity, synthesized.
Simonton, D. K. (1 996). Creative expertise: A New York: Cambridge University Press.
life-span developmental perspective. In K. A. Sternberg, R. J., & Davidson, J. E. (Eds.). (1 994).
Ericsson (Ed.), The road to excellence (pp. 227– The nature of insight. Cambridge, MA: MIT
25 3 ). Mahwah, NJ: Erlbaum. Press.
Simonton, D. K. (1 997). Creative productivity: A Sternberg, R. J., & Gastel, J. (1 989a). Coping with
predictive and explanatory model of career tra- novelty in human intelligence: An empirical in-
jectories and landmarks. Psychological Review, vestigation. Intelligence, 1 3 , 1 87–1 97.
1 04, 66–89. Sternberg, R. J., & Gastel, J. (1 989b). If dancers
Simonton, D. K. (1 998). Donald Campbell’s ate their shoes: Inductive reasoning with fac-
model of the creative process: Creativity tual and counterfactual premises. Memory and
as blind variation and selective retention. Cognition, 1 7, 1 –1 0.
The Journal of Creative Behavior, 3 2 , 1 5 3 – Sternberg, R. J., Kaufman, J. C., & Pretz, J. E.
1 5 8. (2002). The creativity conundrum: A propulsion
Simonton, D. K. (1 999). Talent and its develop- model of kinds of creative contributions. New
ment: An emergenic and epigenetic mode. Psy- York: Psychology Press.
chological Review, 1 06, 43 5 –45 7. Sternberg, R. J., & Lubart, T. I. (1 991 ). An invest-
Smith, S. M., Ward, T. B., & Finke, R. A. (Eds.). ment theory of creativity and its development.
(1 995 ). The creative cognition approach. Cam- Human Development, 3 4(1 ), 1 –3 1 .
bridge, MA: MIT Press. Sternberg, R. J., & Lubart, T. I. (1 995 ). Defying
Sternberg, R. J. (1 981 ). Intelligence and nonen- the crowd: Cultivating creativity in a culture of
trenchment. Journal of Educational Psychology, conformity. New York: Free Press.
73 , 1 –1 6. Sternberg, R. J., & Lubart, T. I. (1 996). Invest-
Sternberg, R. J. (1 982). Natural, unnatural, and ing in creativity. American Psychologist, 5 1 (7),
supernatural concepts. Cognitive Psychology, 677–688.
14, 45 1 –488. Sternberg, R. J., & O’Hara, L. (1 999). Creativity
Sternberg, R. J. (1 985 a). Beyond IQ: A triarchic and intelligence. In R. J. Sternberg (Ed.), Hand-
theory of human intelligence. New York: Cam- book of creativity (pp. 25 1 –272). New York:
bridge University Press. Cambridge University Press.
Sternberg, R. J. (1 985 b). Implicit theories of in- Suler, J. R. (1 980). Primary process thinking and
telligence, creativity, and wisdom. Journal of creativity. Psychological Bulletin, 88, 5 5 5 –5 78.
Personality and Social Psychology, 49(3 ), 607– Tetewsky, S. J., & Sternberg, R. J. (1 986). Concep-
627. tual and lexical determinants of nonentrenched
Sternberg, R. J. (1 988a). The triarchic mind: A new thinking. Journal of Memory and Language, 2 5 ,
theory of human intelligence. New York: Viking. 202–225 .
creativity 369
Therivel, W. A. (1 999). Why Mozart and not Weisberg, R. W. (1 986). Creativity, genius and
Salieri? Creativity Research Journal, 1 2 , 67–76. other myths. New York: Freeman Press.
Torrance, E. P. (1 962). Guiding creative talent. En- Weisberg, R. W. (1 988). Problem solving and cre-
glewood Cliffs, NJ: Prentice-Hall. ativity. In R. J. Sternberg (Ed.), The nature of
Torrance, E. P. (1 974). Torrance tests of creative creativity (pp. 1 48–1 76). New York: Cambridge
thinking. Lexington, MA: Personnel Press. University Press.
Vernon, P. E. (Ed.). (1 970). Creativity: Selected Weisberg, R. W. (1 993 ). Creativity: Beyond the
readings (pp. 1 26–1 3 6). Baltimore: Penguin myth of genius. New York: Freeman Press.
Books. Weisberg, R. W. (1 999). Creativity and know-
von Oech, R. (1 983 ). A whack on the side of the ledge: A challenge to theories. In R. J.
head. New York: Warner. Sternberg (Ed.), Handbook of creativity (pp.
von Oech, R. (1 986). A kick in the seat of the pants. 226–25 0). New York: Cambridge University
New York: Harper & Row. Press.
Wallach, M., & Kogan, N. (1 965 ). Modes of think- Weisberg, R. W., & Alba, J. W. (1 981 ). An exam-
ing in young children. New York: Holt, Rinehart, ination of the alleged role of “fixation” in the
& Winston. solution of several “insight” problems. Journal
Ward, T. B. (1 994). Structured imagination: The of Experimental Psychology: General, 1 1 0, 1 69–
role of conceptual structure in exemplar gen- 1 92.
eration. Cognitive Psychology, 2 7, 1 –40. Werner, H., & Kaplan, B. (1 963 ). Symbol forma-
Ward, T. B., Smith, S. M., & Finke, R. A. (1 999). tion. Hillsdale, NJ: Erlbaum.
Creative cognition. In R. J. Sternberg (Ed.), Woodman, R. W., & Schoenfeldt, L. F. (1 989).
Handbook of creativity (pp. 1 89–21 2). New Individual differences in creativity: An interac-
York: Cambridge University Press. tionist perspective. In J. A. Glover, R. R. Ron-
Wehner, L., Csikszentmihalyi, M., & Magyari- ning, & C. R. Reynolds (Eds.), Handbook of cre-
Beck, I. (1 991 ). Current approaches used in ativity (pp. 77–91 ). New York: Plenum.
studying creativity: An exploratory investiga- Yamamoto, K. (1 964). Creativity and sociomet-
tion. Creativity Research Journal, 4(3 ), 261 – ric choice among adolescents. Journal of Social
271 . Psychology, 64, 249–261 .
CHAPTER 1 6
Complex Declarative Learning
Michelene T. H. Chi
Stellan Ohlsson
Introduction so forth. Many of these component processes

are still poorly understood so we have even
How do people acquire a complex body less understanding of the complex process of
of knowledge, such as the history of the learning a large body of knowledge.
Panama Canal, the structure of the so- Complex knowledge can be partitioned
lar system, or the explanation for how into two types: declarative knowledge and
the human circulatory system works? Com- procedural knowledge (see Lovett & Ander-
plex learning takes longer than a few min- son, Chap. 1 7). Declarative knowledge has
utes and requires processes that are more traditionally been defined as knowledge of
complicated than the associative processes facts or knowing that, whereas procedural
needed to memorize pairs of words. The knowledge is knowing how (Anderson, 1 976;
materials that support complex learning – Winograd, 1 975 ). Declarative knowledge is
such as texts, illustrations, practice prob- descriptive and use independent. It em-
lems, and instructor feedback presented in bodies concepts, principles, ideas, schemas,
classrooms and elsewhere – are often dif- and theories (Ohlsson, 1 994, 1 996). Exam-
ficult to understand and might require ex- ples of declarative knowledge are the laws
tensive processing. For example, learning of the number system, Darwin’s theory of
about the human circulatory system requires evolution, and the history of the Panama
many component processes, such as inte- Canal. The sum total of a person’s declar-
grating information from several sources, ative knowledge is his or her understanding
generating inferences, connecting new infor- of the way the world, or some part or aspect
mation with existing knowledge, retrieving of the world, works, independently of the
appropriate analogies, producing explana- particular tasks the person undertakes.
tions, coordinating different representations Procedural knowledge, such as how to op-
and perspectives, abandoning or rejecting erate and troubleshoot a machine, how to
prior concepts that are no longer useful, and solve a physics problem, or how to use a
371
computer text editor, is prescriptive and use

Basic Characteristics of Declarative
specific. It consists of associations between
Knowledge
goals, situations, and actions. Research in
cognitive neuroscience supports the reality
Size of Knowledge Base
of this distinction between declarative and
procedural knowledge (Squire, 1 987). The most basic observation one can make
The acquisition of complex procedural about declarative knowledge is that humans
knowledge has been extensively investigated have a lot of it. There are no precise esti-
in laboratory studies of skill acquisition, mates of the amount of knowledge a person
problem solving, and expertise (Ericsson, possesses, but two attempts at an estimate
1 996; Feltovich, Ford, & Hoffman, 1 997; see seem well grounded. The first is an esti-
Novick & Bassok, Chap. 1 4), and in field mate of the size of the mental lexicon. The
studies of practitioners (Hutchins, 1 995 ; average college-educated adult knows be-
Keller & Keller, 1 996). Issues that have tween 40,000 and 60,000 words (Miller,
been explored include the role of percep- 1 996, pp. 1 3 6–1 3 8). The total number of
tual organization in expert decision mak- words in the English language is larger than
ing, the breakdown of goals into subgoals, 1 00,000. Because concepts only constitute a
the effect of ill-defined goals, the nature of subset of declarative knowledge, this repre-
search strategies, choices between compet- sents a lower bound on the size of a person’s
ing strategies, the conditions of transfer of declarative knowledge base. Second, Lan-
problem-solving strategies from one prob- dauer (1 986) estimated how much informa-
lem context to another, the effect of alter- tion, measured in bits, people can remem-
native problem representations, the role of ber from a lifetime of learning. His estimate
collaboration in complex tasks, and so on. is 2 × 1 0ˆ9 bits by age 70. It is not straight-
As is obvious in this chapter, the issues rel- forward to convert bits to concepts or pieces
evant to the study of complex procedural of knowledge, but even very fast comput-
learning are different from those relevant ers use only 3 2 or 64 bits to encode one
to the study of complex declarative learn- basic instruction. If we make the conserva-
ing. Because the acquisition of procedural tive assumption that it requires 1 000 bits to
knowledge has been researched so exten- encode one piece of knowledge, Landauer’s
sively in the past few decades, there are estimate implies that a person’s declarative
several recent reviews (Lovett, 2002; Van- knowledge base eventually approximates
Lehn, 1 989; see Novick & Bassok, Chap. 1 4). 1 million pieces of knowledge.
Therefore, this chapter focuses primarily These estimates apply to the size of the
on the acquisition of a body of declara- knowledge base as a whole. At the level of
tive knowledge. individual domains, estimates of the size of
The study of complex declarative learning domain-specific knowledge bases tend to re-
is still in its infancy and has not yet produced sult in numbers that are comparable to es-
a unified theory or paradigmatic framework. timates of the mental lexicon. For exam-
The organization of this chapter is meant ple, Simon and Gilmartin (1 973 ) estimated
to suggest one form that such a framework the number of chess piece configurations –
might take. In the first section, we describe chunks or patterns – known by master play-
basic characteristics of complex declarative ers to be between 1 0,000 and 1 00,000. We
knowledge. In the second section, we clas- do not know whether this is a coincidence
sify the different types of changes that oc- or a symptom of some deeper regularity.
cur in declarative knowledge as one learns. In short, even without a precise definition
This classification is the main contribution of what is to count as a unit of knowledge,
of the chapter. The third section is a brief the average person’s declarative knowledge
treatment of the so-called learning paradox base must be measured in tens of thousands,
(Bereiter, 1 985 ). We end with a few conclud- or more likely hundreds of thousands, of
ing remarks. units. How all this knowledge – the raw
complex declarative learning 373
material for reasoning and thinking – is ac- a hierarchy. The standard taxonomies for an-
quired is clearly a nontrivial, but under- imals and plants are prototypical examples.
researched, question. In contrast, relations such as cause–effect and
before–after produce chain-like structures. In
general, the representations of domains are
Organization
locally structured by their dominant relations.
Knowledge does not grow as a set of iso- The semantic network idea claims that
lated units but in some organized fashion. all knowledge is interrelated, but it does
To capture the organization of the learners’ not propose any single, overarching struc-
declarative knowledge, cognitive scientists ture for the network as a whole. Concepts
operate with three distinct representational and assertions are components of domains,
constructs: semantic networks, theories, and but domains are not components of a yet
schemas (Markman, 1 999). higher level of organization. Domains relate
The key claim behind semantic networks to each other in a contingent rather than
is that a person’s declarative knowledge base systematic way. Informal observations sup-
can be thought of as a gigantic set of nodes port this notion. We have one concept hi-
(concepts) connected by links (relations). erarchy for tools and another for furniture,
All knowledge is interrelated, and cognitive but the node lamp appears in both. Home
processes, such as retrieval and inferencing, decorating is not a subset of cooking, or vice
operate by traversing the links. Early com- versa, but the two share the kitchen. The con-
puter simulations of long-term memory for cept of tangled hierarchies (Hofstadter, 1 999)
declarative knowledge explored variants of describes one aspect of local, unsystematic
this network concept (Abelson, 1 973 ; An- contact points between internally structured
derson & Bower, 1 973 ; Norman & Rumel- domains. These comments are somewhat
hart, 1 975 ; Quillian, 1 968; Schank, 1 972; see speculative because there is little cognitive
Medin & Rips, Chap. 3 ). research aimed at elucidating the structure
Because the distance between two nodes of the declarative knowledge base as a whole.
in a semantic network is determined by Domains can also be represented as the-
the number of relations one must traverse ories. Theories are “deep” representations
to reach from one to the other, semantic (borrowing a term from social psychologists,
networks implicitly claim that declarative see Rokeach, 1 970) in the sense of having
knowledge is grouped by domain. We use well-articulated center-periphery structures.
the term “domain” to refer to both infor- That is, a theory is organized around a small
mal areas of knowledge, such as home dec- set of core concepts or principles – big ideas –
orating, eating at a restaurant, and watch- on which the rest of the elements in the
ing sports, and formal disciplines, such as domain are dependent. The core knowl-
botany, linguistics, and physics. Pieces of edge elements are typically fundamental and
knowledge that belong to the same domain abstract, whereas the peripheral ones are
are similar in meaning and therefore clus- based on, derived from, or instances of the
ter together functionally. Consistent with core ones. The most pristine examples of
this notion, membership in the same domain center-periphery structures are the formal
tends to produce higher similarity ratings, axiomatic systems of mathematics and logic
stronger priming effects, and other quanti- in which a small set of chosen axioms pro-
tative behavioral consequences; descriptions vide a basis for the proofs of all other the-
of these well-known effects can be found orems in a particular formal theory, and
in textbooks in cognitive psychology (e.g., natural science theories, such as Newton’s
Ashcraft, 2002; Reisberg, 2001 ). theory of mechanical motion, Darwin’s the-
The structure of any domain representa- ory of biological evolution, and the atomic
tion depends on the dominant relations of theory of chemical reactions. These theories
that domain. If the dominant relation is set are obviously experts’ and novices’ represen-
inclusion, the representation is organized as tations of those same domains and may or
may not exhibit a similar structure, indicat- increasing expertise a law of mental repre-
ing that change in structure is one dimension sentation, and to what extent is it an adap-
of complex learning. For example, DiSessa tation to the objective structure of domains,
(1 988, 1 993 ) argued that novice knowledge remains an open question.
of mechanical motion is not theory-like at The network concept codifies the intu-
all but is better thought of as an irregular ition that everything is related to everything
collection of fragments (see Smith, DiSessa, else, and the theory concept codifies the in-
& Roschelle, 1 995 , for a modified version of tuition that some knowledge elements are
this view). more important than others. The concept
Other cognitive scientists, however, pre- of a schema, however, codifies the intuition
fer to represent the novices’ understandings that much of our declarative knowledge
of the natural world as intuitive theories in represents recurring patterns in experience
deliberate analogy with the explicit and cod- (see Holyoak, Chap. 6). Although the term
ified theories of scientists and mathemati- “schema” has never been formally defined,
cians (Gopnik & Meltzoff, 1 997; Gopnik & the key strands in this construct are nev-
Wellman, 1 994; McCloskey, 1 983 ; Wiser & ertheless clear. To a first approximation, a
Carey, 1 983 ). By referring to someone’s schema is a set of relations among a set of
naive representation as a theory, one implies slots or attributes, where the slots can be
specifically that the representation shares thought of as variables that can take values
certain characteristics with explicit theories; within a specified range (Bobrow & Collins,
most prominently that it has a center- 1 975 ; Brewer & Nakamura, 1 984; Marshall,
periphery structure.1 1 995 ; Minsky, 1 975 ; Norman & Rumelhart,
A well-developed center-periphery struc- 1 975 ; Thorndyke, 1 984). Take the concept
ture is often the hallmark of an expert’s rep- of “cousin” as an example. A cousin can be
resentation of a domain, and a comparison defined by a schema containing slots such as
between novices’ and experts’ representa- children, parents, and siblings along with a
tions of the same domain often reveals dif- collection of relations such as parent-of and
ferences in the “depth” of their represen- sibling-of:
tations. However, one can raise the ques-
tion of whether “depth” should also be con- (cousin-of y w) = def[(parent-of x y)
strued as a characteristic of the domain it- ×(sibling-of z x)(parent-of z w)] (Eq. 1 6.1 )
self. That is, are some domains intrinsically
“deep” whereas others not, so that a center- To say that a person understands that Steve
periphery structure is not an appropriate (slot y) and Bob (slot w) are cousins is to say
representation for some domains? If so, we that he or she knows that Steve (slot y) is
would expect neither experts nor novices the son of Carl (slot x), Carl is the brother
to construct “deep” representations of those of John (slot z), and John is the father of
domains. For example, in informal everyday Bob (slot w). The slots are associated with
domains such as home decorating or eating at ranges of appropriate values. Being a child,
a restaurant, the center-periphery structure Steve must be younger than Carl; thus, slot
is certainly less salient. (However, even if an y might have an age range of 1 to 5 0 years
everyday domain such as entertaining might old, and slot x might have an age range of
not have a principled theory, its subdomain 21 to 85 years old. Similarly, slot y can have
of formal table setting does; Bykofsky & Far- the values of being either a male (a son) or a
gis, 1 995 , pp. 1 44–1 46; Tuckerman & Dun- female (a daughter).
nan, 1 995 , pp. 1 76–1 77.) Moreover, even for Schemas are bounded units of knowl-
informal domains such as cooking for which edge, and it is essential to their hypothe-
we as novices might claim to lack deep prin- sized function that they are retrieved or ac-
ciples, many professional chefs would dis- tivated as units. That is, if one part of a
agree. Thus, to what extent is the pervasive schema (relation or slot) is activated, there is
striving for a center-periphery structure with a high probability that the rest of the schema
will also be retrieved. Schemas are typically within the larger notion of a semantic net-
abstract precisely because they represent work. For a schema to be a distinct represen-
recurring patterns in experience. Level of tational entity, there has to be a well-defined
abstraction can vary (Ohlsson, 1 993 a). boundary between the schema and the rest
There are many variants of the schema of the knowledge network. (If not, activa-
idea in the cognitive literature. In the clas- tion will spread evenly across the nodes and
sic chess studies of deGroot (1 965 ) and links in the schema and the nodes and links
Chase and Simon (1 973 ), chess experts were that are not in the schema, which contra-
found to know by heart thousands of board dicts the central claim of schema theory
patterns (each pattern consisting of a few that the probability of spreading from one
chess pieces arranged in a meaningful con- node within the schema to another node
figuration), and these familiar patterns al- within the schema is higher than spread-
tered their perception of the board to suggest ing to a node outside the schema.) How-
promising moves. Similar findings regarding ever, the concept of a network does not
the power of perceptual patterns to influ- provide any obvious way to explain what
ence high-level cognition can be seen in a would constitute such a boundary other than
physician’s ability to read X-rays (Lesgold to assume that links among nodes within a
et al., 1 988) and a fire fighter’s ability to schema are more strongly connected than
size up a fire (Klein, 1 998). Similarly, there links among nodes between schemas (Chi &
is evidence to show that experts’ program- Ceci, 1 987; Rumelhart, Smolensky, McClel-
ming knowledge includes frame-like struc- land, & Hinton, 1 986). The differentiation
tures called plans (Soloway & Erhlich, 1 984), in the strength of linkages can create clusters
which are stereotypical situations that occur that can be conceived of as schemas (Chi &
frequently in programming: looping, accu- Koeske, 1 983 ).
mulating values, and so forth. These basic The relations between a schema and a
plans not only serve as the building blocks theory are equally unclear. One can con-
when writing programs, but they are also ceptualize a schema as a tool for organizing
necessary for comprehension of programs. information, but it is not obvious whether
Scripts are higher-order knowledge struc- a schema makes assertions or claims about
tures that represent people’s knowledge of the world. In this conception, schemas are
informal or everyday events such as eating not theories, but people obviously have the-
in a restaurant or visting the dentist’s office ories. Finally, any explication of the rela-
(Schank & Abelson, 1 977). Explanation pat- tion between networks and theories must
terns are schemas for how to construct ex- specify how the center-periphery structure
planations of particular types (Kitcher, 1 993 ; that is intrinsic to theories can be embedded
Ohlsson, 2002; Ohlsson & Hemmerich, within networks.
1 999; Schank, 1 986). Yet other schema-like In this chapter, we take the stance that
constructs have been proposed (e.g., Collins networks, theories, and schemas are three
& Ferguson, 1 993 ; Keegan, 1 989; Machamer partially overlapping but distinct theoretical
& Woody, 1 992). Chunks, explanation pat- constructs. Different aspects of the organi-
terns, frames, plans, and scripts are vari- zation of declarative knowledge are best un-
ants of the basic idea that much declarative derstood with the help of one or the other
knowledge consists of representations of re- of these constructs, or with some mixture of
curring patterns. For simplicity, we use the the three.
term schema throughout this chapter to refer In summary, declarative knowledge bases
to all these constructs. are very large and they exhibit complex or-
Although the three constructs of net- ganization. The notion of semantic networks
works, theories, and schemas appear side by captures the fact that every part of a per-
side in the cognitive literature, the relations son’s knowledge is related, directly or indi-
between them are unclear. First, it is not rectly, to every other part. Representations
clear how a schema should be understood of particular domains vary in “depth,” that
is, the extent to which they are character- with some factual information that they did
ized by a central set of fundamental ideas not know before, and some of those facts
or principles to which other, more periph- are retained. The declarative knowledge base
eral knowledge units are related. Declarative continues to grow in size throughout the life
knowledge also represents recurring patterns span, albeit perhaps at a slower rate as a
in experience with schemas, small packets person ages (Rosenzweig, 2001 ). Rumelhart
of abstract structural information that are and Norman (1 978) referred to this type of
retrieved as units and used to organize in- cumulative addition of pieces of knowledge
formation. These three types of organization as accretion.
cannot easily be reduced to each other, and For adults, cumulative acquisition of in-
explanations of change in complex knowl- dividual pieces of knowledge – facts – must
edge draw upon one or the other of these be pervasive and account for a large pro-
constructs or on some mixture of the three. portion of all learning. There is little mys-
tery as to the processes of acquisition. People
acquire them via perception and observa-
Types of Changes tion, via comprehension of oral and written
discourse, and via inductive (see Sloman &
Lagnado, Chap. 5 ) and deductive (see Evans,
The purpose of this section is to describe
Chap. 8) reasoning (i.e., by inferring new
different types of changes in the knowl-
facts from prior knowledge, or by integrating
edge base as one learns a body of declara-
new facts with old knowledge and making
tive knowledge. There exists no widely ac-
further inferences from the combination).
cepted taxonomy of changes in a body of
A particularly interesting property of
declarative knowledge. We chose to char-
accretion is that it is self-strengthening.
acterize changes as potentially occurring
Many psychology studies have confirmed
along seven dimensions. Presumably, differ-
that what is encoded, comprehended, and
ent cognitive mechanisms are responsible for
inferred depends on the individual learner’s
changes along different dimensions, but the
prior knowledge. For example, Spilich,
field has not specified with any precision
Vesonder, Chiesi, and Voss (1 979) pre-
learning mechanisms for every dimension.
sented a passage describing a fictitious base-
In each section here, we specify a dimen-
ball game. Not only was the amount of
sion of change, summarize some relevant
recall of the individuals with high prior
empirical evidence, and describe the cogni-
baseball knowledge greater (suggesting that
tive processes and mechanisms, if any, that
the information was properly encoded), but
have been proposed to explain change along
the pattern of recall also differed. The
that dimension.
high-knowledge individuals recalled more
information directly related to the goal
Larger Size
structure of the game (Spilich et al., 1 979)
Cumulative growth in size is a basic di- as well as the actions of the game and
mension of change in a body of declarative the related changes in the game states
knowledge. Adults obviously know more (Voss, Vesonder, & Spilich, 1 980), whereas
about the world in general than do chil- the low-knowledge individuals recalled the
dren (Chi, 1 976), and thus children are of- teams, the weather, and other less impor-
ten referred to as universal novices (Brown tant events and confused the order of the
& DeLoache, 1 978). Similarly, experts ob- actions. Moreover, high-knowledge individ-
viously know more about their domains of uals were better than low-knowledge indi-
expertise than novices (Chi, Glaser, & Farr, viduals at integrating a sequence of sentences
1 988). People routinely accumulate addi- (Chiesi, Spilich, & Voss, 1 979, exp. V ). In
tional facts about the world from sources short, prior knowledge leads to more effec-
such as news programs, texts, pictures, and tive accretion, which in turn generates more
conversations. These sources present people prior knowledge.
Although encoding, comprehending, and ganization of the network. In terms of intu-

inference processes augment the knowledge itive theories, cumulative growth might de-
base, they do not necessarily cause deep velop the relations between the core princi-
changes in prior knowledge. Consider once ples and peripheral knowledge items, while
again a baseball fan reading a newspaper ar- deeper learning either develops the core
ticle about a game. He or she will acquire principles or replaces or alters one or more of
facts that are obviously new – the score in the core principles. We discuss deeper learn-
the eighth inning cannot have been known ing processes later in this chapter.
before the game has been played – but the
facts about past games are not altered, and he
Denser Connectedness
or she is unlikely to acquire a new and differ-
ent conception of the game itself, although In network terms, connectedness can be de-
additional facts about baseball games per se fined as the density of relations between the
may be acquired. The key characteristic that knowledge elements. We would expect the
makes this an instance of accretion is that the density of connections in a representation to
learner already has a schema for a baseball increase as the learner acquires more knowl-
game, which presumably has slots for the ba- edge. This implication was supported by a
sic actions (throwing the ball), the highest- study in which we compared the node-link
level goal (winning the game), and other as- representation of a single child’s knowledge
pects of the game (Soloway, 1 978). Once of 20 familiar dinosaurs with his represen-
that schema has been acquired, to become tation of 20 less familiar dinosaurs (Chi &
increasingly knowledgeable is largely to ac- Koeske, 1 983 ; Figures 1 6.1 and 1 6.2). The
quire more knowledge that fits into those nodes and relations of the network were cap-
slots, as well as knowledge of subgoals and tured from the child’s generation protocols
relations between the basic actions and the of dinosaurs and their attributes. The repre-
goal (Means & Voss, 1 985 ). Similarly, read- sentation of the 20 more familiar dinosaurs
ers of narratives might acquire facts about was better connected into meaningful clus-
some fictional events, but they are unlikely ters in that it had more links relating the di-
to change their conceptions of causality, nosaurs that belonged to the same family,
time, or human motivation, arguably three as well as relating the dinosaurs with their
central schemas in comprehending narra- attributes of diet and habitat. The repre-
tives (Graesser, Singer, & Trabasso, 1 994; sentation of the 20 less familiar dinosaurs
Kintsch, 1 998). had fewer links within clusters, and thus
These observations imply that we need to the cluster were less densely connected, so
distinguish between two levels of learning. they appear less differentiated and more dif-
Comprehension as normally understood refused. In short, the better learned materi-
sults in the construction of a specific instance als were more densely connected in an or-
of a schema or the accretion of schema- ganized way, even though, overall, the two
relevant facts. New information is assimi- networks represented the same number of
lated to existing schemas. This is the basic nodes and links.
mechanism of accretion. The size of the rel- A special case of connectedness is the
evant declarative knowledge base increases mapping between layers. Layers can be de-
without fundamental changes in structure. fined in different ways in different domains.
Deeper learning, however, results in some For example, in the context of computer
structural modification of the learner’s prior programming we can conceive of the speci-
schema. The same distinction can easily be fication (the goals) as the highest layer, and
expressed within the other two theoretical the implementation (the data structures and
frameworks that we use in this chapter. In primitive actions of the program) as the low-
network terms, accretion adds nodes and est level. Designing and comprehending a
links without deleting or altering any prior program requires building a bridge between
ones, while deeper learning requires a reor- the specification and the implementation
Figure 1 6.1 . A child’s representation of 20 familiar dinosaurs. (From Chi & Koeske, 1 983 .)
(Brooks, 1 983 ). This bridge maps the im- The concept of consistency has been ex-
plementation to the specification through plored for decades in many areas of psy-
a series of layers. Expert programmers are chology, philosophy, and education. Social
skilled at linking high-level goals to specific psychologists investigated the consistency of
segments of programming code, whereas belief systems in the 1 95 0s and 1 960s (Abel-
less skilled programmers are more likely to son et al., 1 968; Festinger, 1 962/1 95 7; Fish-
link program goals to triggers like variable bein & Ajzen, 1 975 ; Heider, 1 944; McGuire,
names (Pennington, 1 987). Once again, we 1 968), and it remains an area of active re-
see that a person’s knowledge base appears search (Eagly & Chaiken, 1 993 ; Harmon-
to become more densely connected with in- Jones & Mills, 1 999). In the wake of Thomas
creased knowledge acquisition. Kuhn’s influential book The Structure of Sci-
Another special case of connectedness entific Revolutions (Kuhn, 1 970), the philo-
is between the conditions (declarative sophical debate about theory change in sci-
knowledge) and the actions (procedural ence came to focus on how scientists react to
knowledge). For example, experienced and inconsistencies (anomalies) between theory
inexperienced pilots knew equivalent num- and data, and this perspective carried over
bers of facts, but the inexperienced pilots into contemporary approaches to science ed-
failed to apply them in the context of ac- ucation (Hewson & Hewson, 1 984; Posner,
tions (Stokes, Kemper, & Kite, 1 997). One Strike, Hewson, & Gertzog, 1 982; Strike &
can interpret this to mean that the facts that Posner, 1 985 ). Education researchers were
the inexperienced pilots knew were not con- already primed for this focus by the tradi-
nected to their actions. tional concern in the Piagetian tradition with
Although the cited studies involved contradictions and inconsistencies as driv-
highly domain-specific relations, there are ing forces for cognitive development (Piaget,
many types of connections that play cen- 1 985 ). Unfortunately, the social, philosoph-
tral roles in declarative knowledge bases. ical, educational, and developmental liter-
For example, causal relations play a central atures on cognitive consistency are not as
role in the comprehension of narratives (see tightly integrated as they ought to be in light
Buehner & Cheng, Chap. 7; Trabasso & van of the nearly identical ideas that drive re-
den Broek, 1 985 ) and scientific theories (see search in these fields.
Dunbar & Fugelsang, Chap. 29), and hierar- It is reasonably certain that people pre-
chical relations such as set–subset relations fer consistent over inconsistent beliefs, at
form the backbone of taxonomic or classi- least locally, and that the discovery of lo-
ficatory knowledge structures (see Medin & cal inconsistency (or conflict; Ames & Mur-
Rips, Chap. 3 ). The general point is that, as ray, 1 982) triggers cognitive processes that
knowledge acquisition proceeds in a domain, aim to restore consistency, just as Piaget,
the learner’s representation of that domain Festinger, Kuhn, and others have hypothe-
will increase in connectedness in a meaning- sized. For example, Thagard (1 989, 2000)
ful way. explored a computational network model
called ECHO in which consistency is de-
fined as the lack of contradictions between
assertions and hypotheses. ECHO has suc-
Increased Consistency
cessfully predicted human data from a va-
The consistency of a knowledge represen- riety of situations, including the evaluation
tation refers to the degree to which the of scientific theories in light of data (Tha-
multiple assertions embedded in an intuitive gard, 1 992a) and the outcome of court cases
theory can, in fact, be true at the same time. (Thagard, 1 992b).
A person who claims that the Earth is round However, the relation between experi-
but who refuses to sail on the ocean for fear enced inconsistency and cognitive change
of falling over the edge is inconsistent in is complex. Several investigators sugges-
this sense. ted that conflict triggers efforts to restore
Figure 1 6.2 . A child’s representation of 20 less familiar dinosaurs (From Chi & Koeske, 1 983 .)
consistency only when the conflict is recog- argued that the naive conception of the
nized by the learner him- or herself through circulatory system as a single-loop system
reflection (Chi, 2000; Ohlsson, 1 999; Strike is flawed but nevertheless constrained by
& Posner, 1 992). When learners are alerted a consistent set of identifiable yet inaccu-
to inconsistencies and conflicts by an ex- rate principles. The learner can use such a
ternal source, they are more likely to ei- flawed conception systematically to gener-
ther assimilate or dismiss them (Chinn & ate incorrect explanations. (Chi, 2000). His-
Brewer, 1 993 ). Contradiction highlighted by torically, the Ptolemian epicycle theory of
an external source is likely to trigger change the solar system was as internally consistent
processes only if the learner is dissatisfied as the Keplerian theory, but obviously not
with his or her current conception (Posner as accurate.
et al., 1 982). Furthermore, there are many Consistency should also not be confused
ways to respond to inconsistency (Chinn with level of expertise. A more knowl-
& Brewer, 1 993 ; Darden, 1 992; Kelman & edgeable person does not necessarily have
Baron, 1 968), and not all modes of response a more consistent domain representation
increase consistency (as opposed to bypass- than someone who knows less. Ability to
ing the problem); we return to this topic operate with inconsistency has often been
in “The Learning Paradox: Monotonic and proposed as a sign of intellectual sophistica-
Nonmonotonic Change.” tion, whereas insistence on total consistency
Consistency should not be confused with has long been associated with dogmatism
veridicality. It is possible for a knowledge and lack of intellectual flexibility (Ehrlich
representation to be locally consistent and & Leed, 1 969; Rokeach, 1 960). A famous
yet be inaccurate. For example, we have historic example is the resolution – or lack
of resolution – within quantum mechanics and further knowledge acquisition expands

between the wave and particle models of each component into its constituent parts.
photons. These annoying entities insist on This type of process expands the knowl-
behaving as both waves and particles, and edge base, but in a particular way: It moves
since the time of Niels Bohr physicists have along part-of links (as opposed to kind-of
been content to let them be that way. links). In network terms, what was formerly
Consistency is sometimes used synony- a single node is expanded downward into an
mously with the term coherence, as in Tha- entire subtree.
gard’s (1 992a) use of the term explanatory co- Miyake (1 986) collected protocol data
herence to refer to the consistency between a that illustrated this type of change. She
hypothesis and evidence and other hypothe- showed that dyads, in attempting to under-
ses. However, consistency is distinct from stand how a sewing machine works, would
coherence in that, as a measure of a repre- move to lower and lower levels when they
sentation, coherence can be used to refer to recognized that they had not understood
the more well-defined connectedness in a se- the mechanism. For example, in figuring out
mantic representation in which the notion of how a stitch is made, one can understand it
contradiction or conflict is not an issue (Chi by explaining that the needle pushes a loop
& Koeske, 1 983 ). There is not enough ev- of the upper thread through the material
idence or agreement about the concept of to the underside so the upper thread loops
coherence to warrant discussing it as a sepa- entirely around the lower thread. However,
rate dimension of change. to understand how this looping mechanism
In summary, increased consistency is an works, one has to explain the mechanism at
important type of change in a declarative a yet finer level – namely, in terms of how
knowledge base, but it is distinct from the the bottom thread goes through the loop of
concepts of higher veridicality, more ad- the upper thread.
vanced knowledge, and coherence. Knowledge expansion via finer grain of
representation is quite common in the sci-
ences. The ultimate example is perhaps the
Finer Grain of Representation
reduction by chemists of material substances
Reality is not simple, and almost any aspect to molecules, described in terms of atoms,
of it can be described or represented at dif- which in turn are re-represented by physi-
ferent levels of grain. As one learns more cists in terms of elementary particles. We
about something, one often comes to under- should keep in mind though that it is the ex-
stand it at a finer grain. For example, learn- perts’ representations of these domains that
ing how the human circulatory system works are refined, and novices’ representations do
involves learning the components of the sys- not necessarily follow suit.
tem, such as the heart, the lungs, blood, and In analyzing biological systems such as the
blood vessels, and the relation that the con- circulatory system and machines such as
traction of the heart sends blood to different the sewing machine, the parts are objects of
parts of the body. the same kind as the system itself so they
Given this level of representation, one embody the part-of relations. In these exam-
can then ask, how does the heart contract? ples, valves and veins are of the same kind
To answer this question, one would have to and are both parts of the cardiovascular sys-
learn about the constituents of the heart: tem, and thread and a stitch are of the same
the properties of contractive muscle fibers, kind and are both parts of the sewing process.
the role of ventricle pressure, and so on. The link between the behavior of the parts
The learner might push yet toward another and the behavior of the whole can often be
level by asking how individual muscle fibers understood in terms of direct cause and ef-
contract. At each level the system is un- fect, or in terms of mechanical constraints
derstood in terms of its constituent parts, that force movement in one direction
rather than another, such as the valves in and processes that drive people to expand,
the veins. but the possibility of such expansion is one
However, there are systems in which the important dimension of change in declara-
relation between the finer and coarser levels tive knowledge.
of analysis is not of the same kind and the
behavior of the system is emergent (Chi, in
Greater Complexity
press; Wilensky & Resnick, 1 999). A traffic
jam is an example. A traffic jam is a grid- A distinct type of change in the knowledge
lock of cars such that cars can no longer structure is needed when the learner’s cur-
move at normal speed. However, the cars rent concepts are not sufficient to represent
are not of the same kind as the traffic jam. the phenomenon or system as a whole. The
In this kind of system, the (often) observ- thing to be understood cannot be assimilated
able macrolevel behavior (the traffic jam) within any schema the learner has available.
can be represented independently of the mi- The learner can respond by creating a more
crolevel objects (the moving cars). Each in- complex schema (see Halford, Chap. 22).
dividual car may be following the same sim- Although little is known about how more
ple rule, which is to accelerate if there is no complex schemas are developed, one plau-
car in front within a certain distance and to sible hypothesis is that they are created
slow down when there is another car within by combining or assembling several exist-
that distance. However, the jam itself can ing schemas (Ohlsson & Hemmerich, 1 999;
move backward even though the individual Ohlsson & Lehtinen, 1 997).
cars move forward. Thus, the behavior of the The creation of the theory of evolution
individual cars in a jam is independent of by natural selection is a case in point. In the
the jam. Nevertheless, the macrolevel pat- nineteenth century, many biologists knew
tern (the jam) arises from local interactions that there were variations within species and
among the microlevel individual cars. that many species produce more offspring
Learning about systems of this kind does than survive into adult (reproductive) age,
not necessarily proceed by unpacking parts and the fact (as opposed to the explanation)
into yet smaller parts but might more of- of inheritance was of course commonly ac-
ten occur by acquiring the two represen- cepted. The theory of evolution is the re-
tations of the system separately and then sult of assembling or combining these three
linking them. This type of learning pro- schemas in a very particular way into a new,
cess re-represents the macro in terms of more complex schema. The change process
the relationship between the micro- and here does not move along either kind-of or
the macrolevels to explain the macrolevel part-of relations, and it does not refine the
phenomenon (Chi, in press; Chi & Haus- grain of representation. Instead, it moves to
mann, 2003 ). greater complexity. The resulting schema is
It is not clear how often people are driven more complex than either of the prerequi-
to expand their representations downward site schemas. Such a move does not neces-
to a finer grain of analyses. In everyday life, sarily require a higher level of abstraction
people do not always feel the necessity to (see the next section). The prior principles of
connect phenomena at one level to phenom- intraspecies variation, inheritance, and dif-
ena at more fine-grained levels. For exam- ferential survival were already abstract, and
ple, people appear content to understand there is no significant increase in abstraction
the weather at the level of wind, tempera- in the theory that combines them.
ture, clouds, humidity, rain, and snow, with- The assembly process can be prompted.
out re-representing them at the finer lev- In one study, Ohlsson and Regan (2001 )
els of molecular phenomena available to the studied a laboratory version of the problem
professional meteorologist (Wilson & Keil, of the structure of DNA. Based on published
2000). We do not yet understand the factors historic accounts of the discovery of DNA,
we extracted eight different component con- tended to represent the same problems ac-
cepts that had to be combined to represent cording to their concrete surface compo-
the double-helix structure. These turned out nents, such as pulleys and inclined planes.
to be concepts that most educated adults can The point is that one and the same prob-
be expected to possess (e.g., parallel, pair- lem tends to be represented at these dif-
wise, inverse, complement). We found a lin- ferent levels of abstraction by two groups
ear relationship between the proportion of both of whom know the relevant principles. The
these eight concepts that were primed by novices in the Chi et al. (1 981 ) study knew
exercises prior to problem solving and the the relevant principles in the sense that they
time it took undergraduate students to solve could both state them and use them. How-
the laboratory version of the DNA problem. ever, they did not spontaneously represent
The assembly process can be understood problems in terms of those principles instead
as a combination of schemas. The key step of concrete properties. Somewhere along the
in combining schemas must be to align the path to expertise, the physicists came to do
slots of one schema to those of another. so (see Novick & Bassok, Chap. 1 4).
A natural selection schema does not work Re-representing at a higher level of ab-
unless the species that exhibits variation is straction (using already acquired abstrac-
also the species that is subject to selections) is an interesting dimension of change,
tive pressure. The assembly process might but relevant empirical studies are scarce. As
share features with conceptual combination, is the case with most other types of changes,
although the latter process refers to sin- we lack knowledge of the conditions that
gle lexical concepts consisting of unfamil- prompt people to move along this dimension
iar noun-noun or adjective-noun pairs, such and the exact nature of the relevant cogni-
as pet fish (Costello & Keane, 2000; Hamp- tive mechanism.
ton, 1 997; Medin & Shoben, 1 988; Smith,
Osherson, Rips, & Keane, 1 988; see Medin
Shifted Vantage Point
& Rips, Chap. 3 ). We know little about the
frequency and prevalence of moves toward Changing the level of abstraction is closely
creating greater complexity at either the sin- related to, but different from, the process
gle concept or schema levels, and less about that we in normal parlance call change of
the conditions that prompt people to engage perspective. A classic study by Anderson and
in such moves. Pichert (1 978) demonstrates that this phrase
does not merely refer to a metaphor but to
a concrete psychological process. They gave
Higher Level of Abstraction
subjects a text to read that described a home.
The concept of abstraction, in terms of They instructed subjects to take the perspec-
where it comes from or how it is derived, tive of either a burglar or a prospective home
continues to be controversial after two mil- buyer. The results showed that the instruc-
lennia of scholarship. Besides the issue of tions led the subjects to remember different
how abstractions are formed, there is a details, even when the perspective-taking in-
second, frequently overlooked meaning of structions were given after the subjects had
moving toward higher abstraction: Given a read the text.
preexisting set of abstractions, it is possible Shifting one’s point of view can facilitate
to re-represent an object or a domain at a problem solving. For example, Hutchins and
higher level of abstraction. For example, Chi, Levin (1 981 ) used the occurrence of deictic
Feltovich, and Glaser (1 981 ) showed that verbs, such as “come,” “go,” “take,” “send,”
physicists represented routine physics prob- and “bring,” and place adverbs, such as
lems in terms of the deep principles that “here,” “there,” and “across,” in think-aloud
would be needed to construct a solution, protocols to determine the point of view of
whereas physics novices (those who have subjects solving the Missionaries and Can-
taken one course in college with an A grade) nibals problem. They found that problem
solvers shift perspective as they solve the Gillingham, 1 999; Johnson, Moher, Ohls-
problem. Initially, they view the river that son, & Leigh, 2001 ; Ohlsson, Moher, & John-
the Missionaries and Cannibals have to cross son, 2000). Deep understanding of this topic
from the left bank. Later in the problem- requires that a person can coordinate the
solving process, they view the river from normal – we call it ego-centered – perspec-
the right bank. One of their most interest- tive of a person walking around on the Earth
ing findings was that when solvers were in with an exo-centered perspective from a hy-
an impasse after having two nonprogressive pothetical (and physically unattainable) van-
moves out of their current problem-solving tage point in space. Such perspective coordi-
state, they could resolve the impasse if they nations can be very complex. For example,
shifted their point of view. In short, the consider sunsets. What in the ego-centered
somewhat mysterious process of “taking” a perspective appears as the sun disappear-
particular perspective should not be under- ing behind the horizon appears in the exo-
stood as purely metaphorical; this form of centered perspective as movement of the
re-representation has real consequences for border between light and shadow across the
cognitive processing. surface of the Earth owing to the latter’s ro-
In the cases discussed, the perspective tation. Clearly, the mapping between these
shift was transient. There is some evidence two views of the event is far from natural,
to suggest that children become more able to simple, or direct, and it requires consider-
shift perspective as they grow older (see Hal- able learning and instruction to develop the
ford, Chap. 22). For example, Shatz and Gel- exo-centered perspective and to link it to
man (1 973 ) showed that young 2-year-olds everyday perception.
could not adjust their speech to the age of These and related studies demonstrate
the listener, whereas 4-year-olds did adjust the occurrence of shifting vantage points and
their speech, depending on whether they document the advantages they bring. This
were speaking to another peer or an adult. type of change must be an important dimen-
This suggests that older (but not younger) sion of growth of declarative knowledge.
children are capable of shifting their per-
spectives to that of the listeners. Similarly,
Discussion
Piaget and Inhelder (1 95 6) showed that
older but not younger children are capable We suggest that a complex body of declar-
of understanding what another viewer might ative knowledge over time moves along
see, when the other person views it from an- multiple dimensions of change: size, con-
other perspective. One might assume that as nectedness, consistency, grain, complexity,
children mature they acquire more knowl- abstraction, and vantage point. Undoubt-
edge that enables them to shift perspective, edly, there are other dimensions along which
and another study confirms this interpreta- declarative knowledge also changes dur-
tion because it manipulated knowledge di- ing learning, such as coherence, but each
rectly. We gave high school students oppor- of these has at least some support in
tunities to play with a computer simulation empirical studies.
that allows them to take different roles in a Although we separate these seven di-
business context, such as being the vice pres- mensions analytically for purposes of this
ident of a bank. Students were much more chapter, we do not suggest that a cognitive
able to take the perspective of the client after change typically moves along a single dimen-
playing with the simulation, whereas they sion. Most complex knowledge acquisition
were only able to take the perspective of processes will involve simultaneous move-
the bank before playing with the simulation ment along more than one dimension. For
(Jeong, Taylor, & Chi, 2000). example, learning about chemistry involves
In another series of studies, we attempted thinking of material substances as solids, liq-
to teach first-grade children about the shape uids, and gases, instead of, for example, iron,
of the Earth (Johnson, Moher, Ohlsson, & water, and air; this is a move toward higher
abstraction. At the same time, the chemistry tions of triggering conditions and the pro-
student acquires a finer-grained analysis of cesses of change.
material substances in terms of atoms and The seven types of changes discussed so
molecules and a large number of previously far expand the learner’s prior knowledge
unknown isolated facts about such sub- base in a monotonic way in that the prior
stances (e.g., their melting points). He or she knowledge need not be rejected or over-
might have to assemble a new schema such written. It is possible to move toward larger
as dynamic equilibrium, which involves shift- size, denser connectedness, finer grain of
ing the vantage point between the atomic representation, greater complexity, higher
level (where there are continuous processes) abstraction, and a different vantage point
and the emergent macrolevel (where there without rejecting or replacing one’s prior
is, nevertheless, stability). A year of high knowledge representation. The one excep-
school chemistry is likely to require move- tion is a move toward increased consistency.
ment along all seven of these dimensions. We To achieve increased consistency, one might
suggest that this is typical in the acquisition have to reject or abandon some prior knowl-
of complex declarative knowledge. edge or belief. The next section discusses
Given that a representation can change such nonmonotonic changes.
in all the ways we have described previ-
ously, research on the acquisition of complex
declarative knowledge encounters a partic-
ular difficulty – how to assess the effects The Learning Paradox: Monotonic
of different learning scenarios and training and Nonmonotonic Change
procedures. The study of declarative knowl-
edge contrasts in this respect with the study It is tempting to think of a novice as primar-
of procedural knowledge. Learning of pro- ily lacking knowledge. The learning process
cedural knowledge such as problem solv- is then naturally seen as a process of accre-
ing can be assessed relatively straightfor- tion – filling a void or adding information.
wardly by measuring the degree to which Some of the types of changes described in
a learner’s representation of the procedure the previous sections, such as increased con-
approximates the correct solution procedure nectedness and moves toward finer grain of
in terms of the rules and strategies. Learning representation, also have this cumulative na-
of declarative knowledge, however, must be ture because they significantly extend prior
measured in light of the seven dimensions knowledge. However, several of the other
mentioned previously. This is perhaps the types of changes, such as greater complex-
most important methodological problem in ity, higher level of abstraction, and shifting
the study of complex declarative knowledge. vantage point, do not have this cumulative
Although we understand the character nature. Rather, they go further in that they
of these seven dimensions relatively well, re-represent the domain rather than merely
we know little about what triggers people add to it. However, in either the cumula-
to move along one or the other dimension. tive cases or the re-representation cases, the
What are the factors that trigger someone changes do not require that prior knowl-
to move to a finer grain or to another level edge be rejected or replaced. For exam-
of abstraction? Under what conditions will a ple, re-representing something at a higher
learner move to an alternative vantage point? level of abstraction does not require rejec-
Similarly, we do not fully understand the nation of the prior representation because ab-
ture of the processes that bring about the stract and concrete representations of the
changes in each dimension. Empirical re- same thing are not mutually incompatible.
search has been focused on documenting the We can switch back and forth between con-
psychological reality of each type of change ceptualizing something as a hammer and as a
and has not sufficiently pursued the ques- tool without any need to make a permanent
choice between these two concepts. Thus, in tion is in the direction of the force, an ob-
these types of re-representation process, the ject will move with constant velocity as long
old and the new representation can coexist, as it is under the influence of a constant
as well as the re-representing of two compo- force, and the velocity of an object is pro-
nent concepts or schemas into a more com- portional to the magnitude of the applied
plex concept or schema via assembly. The force. When there is no force, an object will
representations for the original concepts re- either slow down, if it is moving, or remain
main. In short, these types of cumulative and at rest. Motion is thus misconceived as being
re-representational changes are monotonic. produced by force, as opposed to the more
However, there are learning scenarios in accurate view that motion is a natural (i.e.,
which (1 ) the learner has a well-developed equilibrium) state that will continue indef-
intuitive theory of the target domain, and initely unless some force interferes with it.
(2) the subject matter to be acquired di- Students’ intuitive theory is more like the
rectly contradicts one or more of the core impetus theory held by Jean Buridan and
principles or beliefs of that intuitive theory. other fourteenth-century thinkers (Robin &
Successful learning in scenarios with these Ohlsson, 1 989) than like the inertia principle
properties requires that the learner go be- that is central to the Newtonian theory. Mis-
yond mutually compatible representations. conceptions about other topics, such as bio-
The learner has to re-represent the domain logical evolution, are also well documented
in the more fundamental sense of abandon- (Bishop & Anderson, 1 990; Brumby, 1 984;
ing or rejecting (i.e., stop believing) what he Demasters, Settlage, & Good, 1 995 ; Ferrari
or she believed before, and replacing it with & Chi, 1 998; Lawson & Thompson, 1 988).
something else. We refer to this as nonmono- The empirical findings not only show that
tonic change. novices possess well-developed misconcep-
Science education provides numerous ex- tions about many domains (Reiner, Slotta,
amples of prior conceptions that must be Chi, & Resnick, 2000) but that these mis-
abandoned. Research on so-called miscon- conceptions persist in the face of instruction
ceptions has documented that people have and other innovate kinds of intervention.
complex and rich conceptions about do- For example, many science misconceptions
mains in which they have not received ex- in Newtonian mechanics are robust and re-
plicit instruction, but for which everyday main after instruction, even at very selec-
experience provides raw material for intu- tive academic institutions (DiSessa, 1 982;
itive theory formation (Confrey, 1 990). Re- Caramazza, McCloskey, & Green, 1 980).
search on such spontaneous science theories With respect to mechanics, innovative in-
has focused on physics, chemistry, and biol- structional interventions include using care-
ogy, although social science and nonscience fully chosen analogies (Clement, Brown, &
domains have also been investigated (Limon, Zietsman, 1 989; Driver, 1 987), deliberately
2002). (The older social psychology work invoking cognitive conflict (Posner et al.,
on belief systems focused primarily on in- 1 982), engaging in deliberate confrontation
tuitive theories of society and religion; see, (Licht, 1 987), or using a succession of in-
e.g., Abelson et al, 1 968; Rokeach, 1 970.) creasingly sophisticated models (White &
Mechanics (forces and motion) is by Frederiksen, 1 990). Although it is difficult to
far the most investigated domain. The evaluate the outcomes of such interventions,
dominant misconception in this domain is it appears that students at best acquire the
that motion implies force (Clement, 1 982; scientific conception, perhaps in an encapsu-
DiSessa, 1 983 , 1 988; Halloun & Hestenes, lated form, while maintaining their initial in-
1 985 ; McCloskey, 1 983 ; Minstrel, 1 982). tuitive conception (Johsua & Dupin, 1 987),
Students assume that when an object is in which is not quite the intended outcome.
motion, the motion is caused by a force be- There are at least three reasons (presented in
ing applied to the object, the object’s mo- the next section) why misconceptions are so
resistant to instruction that nonmonotonic ductive change processes. Social psycholo-

change often fails. gists (Abelson et al., 1 968) and cognitive re-
searchers (Chinn & Brewer, 1 993 ; Darden,
Distortion via Assimilation 1 992) have converged on very similar lists
of potential modes of response to inconsis-
As was mentioned earlier, in learning, new
tency. They agree that inconsistency often
information is typically assimilated to exist-
triggers evasive maneuvers that dismiss the
ing schemas. Thus, one reason that miscon-
inconsistency in some other way than by
ceptions persist is that, when an instructor
revising the relevant knowledge. The most
states the more veridical theory so it contra-
basic mode of response is abeyance, that is,
dicts the learner’s prior misconceived knowl-
to postpone dealing with a contradiction on
edge, the new information is typically dis-
the grounds that not enough information is
torted in the process of being assimilated
available to decide what, if anything, fol-
to the prior misconceived knowledge. To
lows. One step removed from doing nothing
illustrate, consider a young child who be-
is bolstering: The person who encounters in-
lieves that the Earth is as flat as it looks
formation that contradicts some concept or
to the unaided eye. What happens if he or
belief X hastens to seek out supporting or
she is told that the Earth is round? Nuss-
confirming evidence that supports X. Fes-
baum (1 979; 1 985 ), Nussbaum and Novak
tinger (1 962/1 95 7) and others hypothesized
(1 976), Vosniadou (1 994a, 1 994b), and Vos-
that the need to reduce an inconsistency is
niadou and Brewer (1 992) observed two
proportional to the ratio of supporting to
intuitive schemas that we are tempted to
contradicting pieces of information. Thus,
interpret as consequences of distortion by
by drowning the contradicting piece of in-
assimilation. Some children draw the Earth
formation in a flood of confirming ones, it
as a flat entity with a circular periphery (like
is possible to lower the need to resolve the
a pancake); others claim that the Earth is
contradiction and hence to keep going with-
spherical but hollow and half-filled with dirt
out altering one’s knowledge. Another pro-
(thus providing a flat surface for people to
cess with a similar outcome is recalibration,
walk on). In both cases, the Earth is both
that is, to lower the importance one attaches
flat and round. Instruction to the effect that
to the conflicting thoughts, thus making the
the Earth is round was thus assimilated to a
conflict itself less important and easier to ig-
prior flat-Earth conception without any sig-
nore. (A student might decide that he or
nificant changes in the latter.
she is not interested in science after all, so it
does not matter what they teach in science
Evasion of Conflicts
courses.) These processes constitute evasive
Distortion via assimilation is most plausible modes of response to inconsistent informa-
when the learner is unaware of the conflict tion, but they are not learning processes be-
between his or her prior knowledge and new cause there is no constructive change in the
information. The previous example involv- person’s knowledge.
ing the shape of the Earth illustrates this
well; the young child is not aware that he
Lack of Computational Power
or she is interpreting the adjective “round”
in a different way than that intended by the In describing the seven dimensions of
adult speaker. This type of distortion can be changes, we sometimes speculated on the
reliably triggered in the laboratory by de- processes of change. What would happen if
liberately creating texts that violate a nor- the inconsistent information triggered one or
mal reader’s worldview (Graesser, Kassleer, more of the learning processes that we pro-
Kreuz, & Mclain-Allen, 1 998). posed in previous sections? Take the process
However, even if the conflict between of creating greater complexity via assembly
prior knowledge and new information is de- as example. In that process, a more complex
tected, it does not necessarily trigger pro- representation is created by combining two
or more existing representations. It is doubt- not an extension of old theories or ideas ever
ful whether this process could lead to a new, be acquired? Bereiter (1 985 ) referred to this
more veridical theory. Each of the assembled as the learning paradox.
representations will presumably be consis-
tent with the learner’s prior intuitive the-
ory, so they will lack veridicality. One cannot
Conclusions and Future Directions
combine two nonveridical representations to
create a third, veridical representation. For
example, learners’ naive conception of heat Despite the prevalence of distortion via as-
and temperature, when combined, do not similation to prior knowledge, evasion of
add up to the correct scientific conception conflicts, and lack of computational power,
of heat (Wiser & Carey, 1 983 ), nor can tele- nonmonotonic change does happen.
ological and Lamarckian ideas combine to Children do replace their childhood con-
form the principle of natural selection. ceptions with adult ones, some physics stu-
Although we do not spell out each argu- dents do succeed in learning Newtonian me-
ment here, a similar case could be made re- chanics, and scientists do sometimes replace
garding the processes responsible for each of even their most fundamental theories in the
the seven types of changes discussed in the face of anomalous data. Thus, there must be
previous section. None of them has the com- cognitive mechanisms and processes that can
putational power to create a new conception overcome the learning paradox. A theory of
that goes beyond its own conceptual inputs complex learning should explain both why
because, by definition, they are nonmono- nonmonotonic change has such a low prob-
tonic changes. ability of occurring, and how, by what pro-
In summary, the mere presence of contra- cesses, it happens when it does happen.
dictory information is not sufficient to trig- The study of such noncumulative learn-
ger productive cognitive change of the non- ing processes is as yet in its infancy. In this
monotonic kind. A conflict between prior section, we offer a small number of specu-
knowledge and new information might go lative proposals about how nonmonotonic
undetected, in which case the learner might learning processes can occur. These brief
blithely assimilate the new information to proposals are intended to serve as inspiration
prior knowledge, probably distorting it in the for further research.
process. Even if the learner detects the con-
flict, he or she might hold the new infor- Pathways to Nonmonotonic Change?
mation in abeyance rather than respond to
We describe below four mechanisms, along
it. If he or she feels a need to deal with the
with some empirical support. We then con-
contradiction, there is a repertoire of evasive
sider whether each of them can potentially
maneuvers, including bolstering and recali-
achieve nonmonotonic change.
bration of subjective importance, that will
make the contradiction less disturbing with-
out any revisions in prior knowledge. Finally, transformation via bootstrapping
the productive learning processes discussed One hypothetical path to a new theory is to
previously do not have the computational edit or revise one’s existing theory piece by
power to create a new conception that goes piece until the theory says something signif-
beyond the conceptual inputs to those pro- icantly different from what it said originally.
cesses. The prevalence of these three kinds We can conceptualize such a bootstrapping
of responses to encounters with contradic- process as a series of local repairs of a knowl-
tory information – distortion via assimila- edge structure. Local repairs require simple
tion, evading conflicts, and lacking computa- mechanisms such as adding links, deleting
tional power – raises the question of how an links, reattaching links, and so forth. The
intuitive theory can ever be replaced. That critical condition for local repairs is that the
is, how can a truly new theory or idea that is student recognize that the repairs are needed
by reflecting on the differences between his a stronger center-periphery structure and

or her existing knowledge and new knowl- deeper commitment to the core principles.
edge. We have some evidence that the accu- Rokeach (1 970) presented evidence from
mulation of local repairs can lead to a sig- other than scientific domains that knowl-
nificant transformation of a person’s men- edge elements are more resistant to change
tal model of the circulatory system from the more central they are. It is plausible that
a flawed single-loop model to the correct transformation via bootstrapping a sequence
double-loop model (Chi, 2000). of local repairs is less applicable the “deeper”
As a second example of bootstrapping, the domain, at least as long as the change has
Thagard (1 992a) analyzed the changes in to encompass the core principles to be com-
the French chemist Lavoiser’s conception of plete. So perhaps this bootstrapping process
matter during the critical years of the devel- cannot be considered a true nonmonotonic
opment of the oxygen theory of combustion. change mechanism.
Thagard shows how Lavoiser’s conception
of combustion can be modeled by a seman- replacement
tic network, and how that network is grad-
If stepwise revisions can only go so far to
ually transformed over several years as the
explain nonmonotonic change, what alter-
scientist is reflecting on the outcomes of em-
native is there? Knowledge structures can
pirical experiments. By adding and deleting
be replaced. That is, an alternative represen-
nodes and redrawing links, Thagard depicts
tation of a domain is constructed in paral-
Lavoisier’s knowledge network as undergo-
lel with a prior one through processes that
ing a gradual transformation such that its ini-
do not use the prior one as input. The old
tial state represents the phlogiston theory of
and the new representations then compete
combustion, but its final state represents the
for the control of discourse and behavior in
oxygen theory.
the course of question answering, explana-
How much can transformation via local
tion, reasoning, and problem solving. The
repairs explain? There are multiple expla-
new, presumably more veridical representa-
nations for why local repairs succeed in the
tion frequently wins, and the old one even-
case of the circulatory system. One reason
tually fades from disuse.
is that the transformation from a single-loop
model to a double-loop model crosses no on- Bottom-up Replacement. Replacement can
tological categories (Chi & Roscoe, 2002). proceed either bottom-up or top-down.
Another reason might be the relative lack First, consider a new representation built
of “depth” of this domain in the sense that it bottom-up. This might occur when the
cannot be represented by a center-periphery new knowledge is encountered in a context
structure. The single-loop principle does not that does not necessarily evoke the conflict-
deductively imply the other relevant facts ing prior knowledge. For example, students
about the circulatory system in the manner might experience science instruction as so
in which Newton’s three laws of motion im- distant from everyday experience that they
ply more peripheral statements within the build representations of what is taught in
domain of motion. The looser connection class that are independent from, and un-
between center and periphery might make connected to, the former. The outcome of
the single-loop principle easy to tinker with. such encapsulated knowledge is an ability to
Finally, there is a question of commitment solve textbook problems without enriched
(Ohlsson, 1 999). Although students believe understanding of relevant phenomena en-
that there is a single circulatory loop, this is countered in other contexts (everyday ex-
not one of their most cherished beliefs and perience, news reports, etc.). Owing to the
they probably do not experience it as im- compartmentalization of contexts, the con-
portant to their worldview. Tinkering even flict between the prior intuitive theory and
with the core principle of this domain might the new theory is not salient to the learner,
therefore come easier than in domains with and the construction of the new theory can
proceed without interference from prior Medin & Rips, Chap. 3 ) and social interac-
knowledge. tion (see Greenfield, Chap. 27). Because the
If matters remain in this state, it is doubt- topic of abstraction is discussed elsewhere in
ful whether this can be considered successful this volume, we do not intend to answer this
nonmonotonic learning. The crucial ques- question here.
tion is whether the new theory, once con- Side-stepping the issue of where an ab-
structed, can migrate into and usurp the stract schema comes from in the first place,
territory of the prior intuitive conception. we first need to know whether top-down re-
Successful nonmonotonic learning requires placement is possible, given that an abstract
that a phenomenon previously understood schema exists. To test the feasibility of this
within the intuitive theory begin to be un- top-down replacement process, we are in-
derstood within the new theory instead. structing students about a domain-general
abstract schema that might serve as a tem-
Top-Down Replacement.Consider the po– plate for understanding multiple concepts in
ssibility of top-down generation of a new many domain. One example is the schema
knowledge structure. An abstract schema of emergence (Chi, in press), which has ap-
might be acquired in an alternative domain plications in biology, chemistry, and physics.
and transferred wholesale to a new domain. It is plausible that direct instruction of this
An example of this hypothetical process is sort results in the de novo construction of an
provided by more recent attempts to under- alternative conception, as opposed to grad-
stand the operation of the immune system in ual transformation of a prior conception.
Darwinian terms. Philosophers and theoret-
ical biologists have attempted to formalize
Darwin’s theory of evolution (Thompson, transfer via analogy
1 989), and the resulting abstract schema has Existence of an abstract schema may not be a
been applied to the question of how the im- necessary requisite for the top-down process
mune system could produce antibodies for to work. A concrete schema from another
a wide variety of antigens. The Darwinian domain might serve as template if the two
answer is that the immune system continu- domains are easy enough to align that the
ally generates more or less random antibod- transfer process can operate via analogy (see
ies. High fit between antibodies and antigens Holyoak, Chap. 6). In this hypothetical pro-
triggers increased production of the former; cess, the learner acquires a schema in some
thus, the antigens themselves function as an source domain S; later, he or she is learning
environment that selects for the antibodies about some target domain T for which he or
that fight them (Gazzaniga, 1 992). The ac- she already has an intuitive theory. The new
curacy of this theory of the immune system information about T contradicts his or her
is not the issue here. It is an example of a prior intuitive theory about T but is analo-
process in which a complex abstract schema gous to what is known about S. If the learner
was transferred as a whole to provide a cog- creates a new representation for T based on
nitive template for a novel theory of a phys- what is known about S instead of building di-
iological process far removed from the evo- rectly on his or her current intuitive theory
lutionary processes of speciation and adap- of T, then he or she might avoid distortion
tation for which the schema was originally by assimilation.
constructed. We tested the reality of this transfer of
This top-down process is limited in that it concrete schema process in a virtual reality-
relies on the prior existence of an appropri- based scenario for teaching children that
ate abstract schema, which raises the ques- the Earth is round (Johnson et al., 1 999,
tion of where abstractions originate. This 2001 ; Ohlsson et al., 2000). We created a
issue has remained controversial for more virtual planet that was small enough so the
than two millennia. The standard sugges- consequences of sphericality were immedi-
tions include induction over exemplars (see ately perceivable. For example, even minor
movement through the virtual world made ical status. In those cases, successful learn-
objects visibly “appear” or “disappear” over ing requires that the learner re-represent
the horizon. Having acquired a notion of the entity as belonging to another ontolog-
living on a spherical planet in the context ical category, such as from a kind of sub-
of this fictional asteroid (about which the stance to a kind of process (Slotta, Chi, &
children were not expected to have any dis- Joram, 1 995 ).
torting prior views), we then supported, via This kind of ontological shift replaces a
a one-on-one dialogue, the analogical trans- prior conception with a new conception in
fer of that schema to the context of the terms of an entity’s ontological status. Thus,
Earth. Pre- to posttest comparisons between this process of ontological shift may qualify
the treatment and a dialogue-only control as a kind of a nonmonotonic mechanism.
group showed that the effect of prior learn-
ing in the virtual environment was positive
Toward a Theory of Learning
(albeit small in magnitude). We infer that
the schema for the virtual asteroid to some In 1 965 , Robert M. Gagné published The
extent served as template for the new con- Conditions of Learning, which summarized
ception of the Earth that we tried to teach what was known about learning at the time.
them. Hence, the learning paradox was over- His approach was the unusual one of assum-
come by stimulating the children to build a ing that there are multiple, distinct types of
representation of what life on a sphere is like learning processes distinguishable with re-
independent of their prior knowledge of the spect to their prerequisites, processes, and
Earth, and then encouraging the use of that results. He presented these in order of in-
representation as a template for building a creasing complexity, beginning with “sig-
new representation of the Earth. nal learning” (simple conditioning) and end-
ing with “problem solving” (Gagné, 1 965 ).
The most noteworthy feature of his ap-
ontological shift proach is signaled by the book’s title: For
Ontological categories refer to a set of cate- each type of learning, Gagné asked un-
gories to which people partition the world der which conditions that type of learning
in terms of its most fundamental features might occur.
(as opposed to characteristic and defining In our efforts to summarize what is
features; Chi, 1 997). For example, two high- known about the acquisition of complex
level categories that people are likely to declarative knowledge, we, too, have been
partition the different types of entities in led to present a list of different types of learn-
the world into are substances and processes. ing. In the realm of monotonic learning, we
Each type of entity is conceptualized as distinguish between seven different dimen-
having certain fundamental properties. For sions of change: size, connectedness, con-
example, substances such as sand can be sistency, grain, complexity, abstraction, and
contained in a box, but processes such as vantage point. In the realm of nonmonotonic
a baseball game, cannot; however, processes change, we have specified numerous non-
can last for 2 hours, but substances can- learning modes of response to contradictory
not. Misconceptions are miscategorizations information such as assimilation and evasive
of entities into wrong ontological categories. processes of abeyance, bolstering, recalibra-
For example, students typically misconceive tion, and explained why many of the learn-
heat or electricity as a substance that can ing mechanisms cannot in principle pro-
move from one location to another (Chi, duce true nonmonotonic learning. Finally,
Slotta, & de Leeuw, 1 994). Continued study even our proposals with respect to non-
of some entity that is initially believed as be- monotonic learning break down into multi-
longing to category X might reveal proper- ple processes such as transformation via lo-
ties that are not consistent with its ontolog- cal repairs, bottom-up compartmentalized
replacement, top-down replacement with for Interdisciplinary Research on Con-

the help of abstract schemas, transfer of constructive Learning Environments, CIRCLE,
crete schema via analogies, and ontologi- http://www.pitt.edu/∼circle) to the first
cal shift. It seems likely that, as the study author, and Grant BCS-990783 9 from
of complex learning progresses, cognitive the National Science Foundation to the
scientists will further our understanding of second author.
these replacement processes.
However, as Gagné clearly saw 40 years
ago, a list of learning processes is by itself an References
incomplete theory of learning. One would
expect such a theory to support explana-
Abelson, R. P. (1 973 ). The structure of belief
tion of learning outcomes, to allow us to
systems. In R. C. Schank & K. M. Colby
say why one subject matter is more diffi- (Eds.), Computer models of thought and language
cult to acquire than another, to predict the (pp. 287–3 3 9). San Francisco: Freeman.
success rate of particular instructional sce- Abelson, R. P., Aronson, E., McGuire, W. J.,
narios, and so on. However, to accomplish Newcomb, T. M., Rosenberg, M. J., & Tannen-
these and other theoretical tasks, we need baum, P. H. (Eds.). (1 968). Theories of cogni-
to know when, under which circumstances, tive consistency: A sourcebook. Chicago: Rand
one or the other learning process is likely McNally.
to occur. A predictive science of complex Ames, G. J., & Murray, F. B. (1 982). When
learning requires that we can specify the two wrongs make a right: Promoting cognitive
when and wherefore of the many process hy- change by social conflict. Developmental Psy-
potheses that spring from the imagination of chology, 1 8, 894–897.
the cognitive scientist. Nowhere is this more Anderson, J. R. (1 976). Language, memory, and
obvious than in the case of nonmonotonic thought. Hillsdale, NJ: Erlbaum.
learning. This, we suggest, is the research Anderson, J. R., & Bower, G. H. (1 973 ). Human
front in the study of complex declarative associative memory. New York: Wiley.
learning. Anderson, R. C., & Pichert, J. (1 978). Recall of
previously unrecallable information following
a shift in perspective. Journal of Verbal Learning
and Verbal Behavior, 1 7, 1 –1 2.
Note Ashcraft, M. H. (2002). Cognition (3 rd ed.). Up-
per Saddle River, NJ: Prentice-Hall.
1 . In social cognition research, intuitive theories Bereiter, C. (1 985 ). Toward a solution of the
are called belief systems (Fishbein & Ajzen, learning paradox. Review of Educational Re-
1 975 ; Rokeach, 1 960, 1 970). Although the two search, 5 5 (2), 201 –226.
constructs of intuitive theory and belief sys- Bishop, B., & Anderson, C. (1 990). Student con-
tem are essentially identical, this connection ceptions of natural selection and its role in evo-
between social and cognitive psychology has lution. Journal of Research in Science Teaching,
been overlooked on both sides (but see Schultz 2 7, 41 5 –427.
& Lepper, 1 996). Bobrow, D. G., & Collins, A. (1 975 ). Representa-
tion and understanding: Studies in cognitive sci-
ence. New York: Academic Press.
Brewer, W. F., & Nakamura, G. V. (1 984). The
Acknowledgments
nature and functions of schemas. In R. Wyer,
& T. Srull (Eds.), Handbook of social cognition
Preparation of this chapter was sup- (pp. 1 1 9–1 60). Hillsdale, NJ: Erlbaum.
ported in part by Grant 2001 003 05 from Brooks, R. (1 983 ). Towards a theory of the
the Spencer Foundation and in part by comprehension of computer programs. Inter-
Grant Number 97203 5 9 from the Na- national Journal of Man-Machine Studies, 1 8,
tional Science Foundation (for the Center 5 43 –5 5 4.
Brown, A. L., & DeLoache, J. S. (1 978). Skills, book on innovation (pp. 43 0–444). Oxford, UK:
plans and self-regulation. In R. S. Siegler (Ed.), Elsevier.
Children’s thinking: What develops? (pp. 3 –3 5 ). Chi, M. T. H., & Koeske, R. D. (1 983 ). Network
Hillsdale, NJ: Erlbaum. representation of a child’s dinosaur knowledge.
Brumby, M. (1 984). Misconceptions about the Developmental Psychology, 1 9, 29–3 9.
concept of natural selection by medical biology Chi, M. T. H., & Roscoe, R. D. (2002). The pro-
students. Science Education, 68, 493 –5 03 . cesses and challenges of conceptual change. In
Bykofsky, S., & Fargis, P. (1 995 ). The big book of M. Limon & L. Mason (Eds.), Reconsidering con-
life’s instructions. New York: Galahad. ceptual change: Issues in theory and practice (pp.
Caramazza, A., McCloskey, M., & Green, 3 –27). Netherlands, Dordrecht: Kluwer Aca-
B. (1 980). Naive beliefs in “sophisticated” demic.
subjects: Misconceptions about trajectories of Chi, M. T. H., Slotta, J. D., & de Leeuw, N.
objects. Cognition, 9, 1 1 7–1 23 . (1 994). From things to processes: A theory of
Chase, W. G., & Simon, H. A. (1 973 ). Perception conceptual change for learning science con-
in chess. Cognitive Psychology, 4, 5 5 –81 . cepts. Learning and Instruction, 4, 27–43 .
Chi, M. T. H. (1 976). Short-term memory lim- Chiesi, H. L., Spilich, G. J., & Voss, J. F. (1 979).
itations in children: Capacity or processing Acquisition of domain-related information in
deficits? Memory and Cognition, 4, 5 5 9–5 72. relation to high and low domain knowledge.
Chi, M. T. H. (1 992). Conceptual change within Journal of Verbal Learning and Verbal Behavior,
and across ontological categories: Examples 1 8, 25 7–274.
from learning and discovery in science. In Chinn, C. A., & Brewer, W. F. (1 993 ). The role
R. N. Giere (Ed.), Cognitive models of science of anomalous data in knowledge acquisition:
(pp. 1 29–1 86). Minneapolis: University of Min- A theoretical framework and implications for
nesota Press. science instruction. Review of Educational Re-
Chi, M. T. H. (1 997). Creativity: Shifting across search, 63 , 1 –49.
ontological categories flexibly. In T. B. Ward, Clement, J. (1 982). Students’ preconceptions
S. M. Smith, & J. Vaid (Eds.), Conceptual struc- introductory mechanics. American Journal of
tures and processes: Emergence, discovery and Physics, 5 0, 66–71 .
change (pp. 209–23 4). Washington, DC: Amer- Clement, J., Brown, D., & Zietsman, A. (1 989).
ican Psychological Association. Not all preconceptions are misconceptions:
Chi, M. T. H. (2000). Self-explaining: The dual Finding anchoring conceptions for ground-
processes of generating inferences and repairing instruction on students’ intuitions. Inter-
ing mental models. In R. Glaser (Ed.), Advances national Journal of Science Education, 1 1 , 5 5 4–
in instructional psychology (pp. 1 61 –23 7). Mah- 5 65 .
wah, NJ: Erlbaum. Collins, A., & Ferguson, W. (1 993 ). Epistemic
Chi, M. T. H. (in press). Common sense conforms and epistemic games: Structures and
ceptions of emergent processes. Journal of the strategies to guide inquire. Educational Psychol-
Learning Sciences. ogist, 2 8, 25 –42.
Chi, M. T. H., & Ceci, S. J. (1 987). Content Confrey, J. (1 990). A review of the research on
knowledge: Its role, representation, and re- student cnceptions in mathematics, science,
structuring in memory development. In H. S. and programming. In C. B. Cazden (Ed.), Re-
Reese (Ed.), Advances in child development and view of research in education (Vol. 1 , pp. 3 –5 6).
behavior (Vol. 20, pp. 91 –1 42). New York: Aca- Washington, DC: American Educational Re-
demic Press. search Association.
Chi, M. T. H., Feltovich, P., & Glaser, R. (1 981 ). Costello, F. J., & Keane, M. T. (2000). Efficient
Categorization & representation of physics creativity: Constraint-guided conceptual com-
problems by experts and novices. Cognitive Sci- bination. Cognitive Science, 2 4, 299–3 49.
ence, 5 , 1 21 –1 5 2. (citation classic) Darden, L. (1 992). Strategies for anomaly reso-
Chi, M. T. H., Glaser, R., & Farr, M. (1 988). The lution. In R. Giere (Ed.), Cognitive models of
nature of expertise. Hillsdale, NJ: Erlbaum. science (pp. 25 1 –273 ). Minneapolis: University
Chi, M. T. H., & Hausmann, R. G. M. (2003 ). Do of Minnesota Press.
radical discoveries require ontological shifts? deGroot, A. D. (1 965 ). Thought and choice in
In L. V. Shavinina (Ed.), International hand- chess. The Hague, Netherlands: Mouton.
Demasters, S., Settlage, J., & Good, R. (1 995 ). Gopnik, A., & Meltzoff, A. N. (1 997). Words,
Students’ conceptions of natural selection and thoughts, and theories. Cambridge, MA: MIT
its role in evolution: Cases of replication and Press.
comparison. Journal of Research in Science Gopnik, A., & Wellman, H. M. (1 994). The the-
Teaching, 3 2 , 5 3 5 –5 5 0. ory theory. In L. A. Hirschfeld & S. A. Gelman
diSessa, A. (1 982). Unlearning Aristotelian (Eds.), Mapping the mind: Domain specificity
physics: A study of knowledge-based learning. in cognition and culture (pp. 25 5 –293 ). Cam-
Cognitive Science 6, 3 7–75 . bridge, UK: Cambridge University Press.
diSessa, A. (1 983 ). Phenomenology and the evo- Graesser, A., Kassleer, M. A., Kreuz, R. J., &
lution of intuition. In D. Gentner & A. L. Mclain-Allen, B. (1 998). Verification of state-
Stevens (Eds.), Mental models (pp. 1 5 –3 3 ). ments about story worlds that deviate from nor-
Hillsdale, NJ: Erlbaum. mal conceptions of time: What is true about
diSessa, A. (1 988). Knowledge in pieces. In G. Einstein’s dreams. Cognitive Psychology, 3 5 ,
Forman & P. Pufall (Eds.), Constructivism in 246–3 01 .
the computer age (pp. 49–70). Hillsdale, NJ: Graesser, A., Singer, M., & Trabasso, T. (1 994).
Erlbaum. Constructing inferences during narrative text
diSessa, A. (1 993 ). Toward an epistemology of comprehension. Psychological Review, 1 01 ,
physics. Cognition and Instruction, 1 0, 1 01 –1 04. 1 86–209.
Halloun, I. A., & Hestenes, D. (1 985 ). Common
Driver, R. (1 987). Promoting conceptual change
sense concepts about motion. American Journal
in classroom settings: The experience of the
of Physics, 5 3 , 1 05 6–1 065 .
children’s learning in science project. In J. D.
Novak (Ed.), Proceeding of the second interna- Hampton, J. A. (1 997). Emergent attributes in
tional seminar on misconceptions and educational combined concepts. In T. Ward, S. M. Smith, &
strategies in science and mathematics (Vol. II, J. Vaid (Eds.), Creative thought: An investigation
pp. 97–1 07). Ithaca, NY: Department of Ed- of conceptual structures and processes (pp. 83 –
ucation, Cornell University. 1 1 0). Washington, DC: American Psychologi-
cal Association.
Eagly, A. H., & Chaiken, S. (1 993 ). The psychology
of attitudes. Ft. Worth, TX: Harcourt, Brace & Harmon-Jones, E., & Mills, J. (1 999). Cognitive
Jovanovich. dissonance: Progress on a pivotal theory in social
psychology. Washington, DC: American Psy-
Ehrlich, H. J., & Leed, D. (1 969). Dogmatism,
chological Association.
learning, and resistance to change: A review
and a new paradigm. Psychological Bulletin, 71 , Heider, F. (1 944). Social perception and phenom-
249–260. enal causality. Psychological Review, 5 1 , 3 5 8–
3 74.
Ericsson, K. A. (1 996). The road to excellence.
Hewson, P. W., & Hewson, M. G. A. (1 984).
Mahwah, NJ: Erlbaum.
The role of conceptual conflict in conceptual
Feltovich, P. J., Ford, K. M., & Hoffman, R. R. change and the design of science instruction.
(1 997). Expertise in context. Menlo Park, CA: Instructional Science, 1 3 , 1 –1 3 .
AAAI Press.
Hofstadter, D. (1 999). Godel, Escher, Bach –
Ferrari, M., & Chi, M. T. H. (1 998). The nature An eternal golden braid. New York: Basic
of naı̈ve explanations of natural selection. Inter- Books.
national Journal of Science Education, 2 0, 1 23 1 –
Hutchins, E. (1 995 ). Cognition in the wild. Cam-
1 25 6.
Festinger, L. (1 962/1 95 7). A theory of cognitive
Hutchins, E. L., & Levin, J. A. (1 981 ). Point of view
dissonance. Stanford, CA: Stanford University
in problem solving. CHIP Tech. Rep. No. 1 05 ,
Press.
University of California at San Diego.
Fishbein, M., & Ajzen, I. (1 975 ). Belief, attitude, Jeong, H., Taylor, R., & Chi, M. T. H (2000).
intention and behavior. Reading, MA: Addison- Learning from a computer workplace simu-
Wesley. lation. Proceedings of the 2 2 nd annual meeting
Gagné, R. M. (1 965 ). The conditions of learning. of the Cognitive Science Society (pp. 705 –71 0).
New York: Holt, Rinehart & Winston. Mahwah, NJ: Erlbaum.
Gazzaniga, M. S. (1 992). Nature’s mind. New Johnson, A., Moher, T., Ohlsson, S., &
York: Basic Books. Gillingham, M. (1 999). The Round Earth
Project – Collaborative VR for concep- ence and Mathematics” (Vol. II, pp. 275 –284).
tual learning. IEEE Computer Graphics and Ithaca, NY: Cornell University.
Applications, 1 9(6), 60–69. Limon, M. (2002). Conceptual change in his-
Johnson, A., Moher, T., Ohlsson, S., & Leigh, J. tory. In M. Limon & L. Mason (Eds.), Recon-
(2001 ). Exploring multiple representations in sidering conceptual change: Issues in theory and
elementary school science education. In Pro- practice (pp. 25 9–289). Amsterdam: Kluwer
ceedings of IEEE VR 2 001 (pp. 201 –208). Yoko- Academic.
hama, Japan, Mar 1 3 –1 7. Lovett, M. C. (2002). Problem solving. In H.
Johsua, S., & Dupin, J. J. (1 987). Taking into Pashler & D. Medin (Eds.), Stevens’ handbook of
account student conceptions in instructional experimental psychology, volume 2 : Memory and
strategy: An example in physics. Cognition and cognitive processes (pp. 3 1 7–3 62). New York:
Instruction, 4(2), 1 1 7–1 3 5 . Wiley.
Keegan, R. T. (1 989). How Charles Darwin be- Machamer, P., & Woody, A. (1 992). Models of
came a psychologist. In D. B. Wallace & H. E. intelligibility in science: Using the balance as a
Gruber (Eds.), Creative people at work: Twelve model for understanding the motion of bodies.
cognitive case studies (pp. 1 07–1 25 ). New York: In S. Hills (Ed.), The history and philosophy of
Oxford University Press. science in science education (Vol. 2, pp. 95 –1 1 1 ).
Keller, C. M., & Keller, J. D. (1 996). Cognition and Kingston, Ontario: Queen’s University.
tool use: The blacksmith at work. Cambridge, Markman, A. B. (1 999). Knowledge representation.
UK: Cambridge University Press. Mahwah, NJ: Erlbaum.
Kelman, H. C., & Baron, R. M. (1 968). Deter- Marshall, S. P. (1 995 ). Schemas in problem solving.
minants of modes of resolving inconsistency Cambridge, UK: Cambridge University Press.
dilemmas: A functional analysis. In R. P. Abel- McCloskey, M. (1 983 ). Naı̈ve theories of mo-
son, E. Aronson, W. J. McGuire, T. M. New- tion. In D. Gentner & A. L. Stevens (Eds.),
comb, M. J. Rosenberg, & P. H. Tannenbaum Mental models (pp. 299–3 23 ). Hillsdale, NJ:
(Eds.), Theories of cognitive consistency: A source- Erlbaum.
book (pp. 670–683 ). Chicago: Rand McNally.
McGuire, W. J. (1 968). Theory of the structure of
Kintsch, W. (1 998). Comprehension. Cambridge,
human thought. In R. P. Abelson, E. Aronson,
UK: Cambridge University Press.
W. J. McGuire, T. M. Newcomb, M. J. Rosen-
Kitcher, P. (1 993 ). The advancement of science. berg, & P. H. Tannenbaum (Eds.), Theories of
New York: Oxford University Press. cognitive consistency: A sourcebook (pp. 1 40–
Klein, G. (1 998). Sources of power: How people 1 62). Chicago: Rand McNally.
make decisions. Cambridge, MA: MIT Press.
Means, M. L., & Voss, J. F. (1 985 ). Star wars:
Kuhn, T. (1 970). The structure of scientific revolu- A developmental study of expert and novice
tions. Chicago: University of Chicago Press. knowledge structures. Journal of Memory and
Landauer, T. K. (1 986). How much do people Language, 2 4, 746–75 7.
remember? Some estimates of the quantity Medin, D. L., & Shoben, E. J. (1 988). Context and
of learned information in long-term memory. structure in conceptual combination. Cognitive
Cognitive Science, 1 0, 477–493 . Psychology, 2 0, 1 5 8–1 90.
Lawson, A., & Thompson, L. (1 988). Formal rea- Miller, G. A. (1 996). The science of words. New
soning ability and misconceptions concerning York: Scientific American Library.
genetics and natural selection. Journal of Re-
Minsky, M. A. (1 975 ). A framework for the rep-
search in Science Teaching, 2 5 , 73 3 –746.
resentation of knowledge. In P. Winston (Ed.),
Lesgold, A., Rubinson, H., Feltovich, P., Glaser, The psychology of computer vision. New York:
R., Klopfer, D., & Wang, Y. (1 988). Expertise McGraw-Hill.
in a complex skill: Diagnosing x-ray pictures.
In M. Chi, R. Glaser, & M. Farr (Eds.), The na- Minstrel, J. (1 982). Explaining the “at rest” condi-
ture of expertise (pp. 3 1 1 –3 42). Hillsdale, NJ: tion of an object. The Physics Teacher, January,
Erlbaum. 1 0–1 4.
Licht, P. (1 987). A strategy to deal with concep- Miyake, N. (1 986). Constructive interaction and
tual and reasoning problems in introductory the iterative process of understanding, Cogni-
electricity education. In J. Novak (Ed.), Pro- tive Science, 1 0, 1 5 1 –1 77.
ceedings of the 2nd international seminar “Mis- Norman, D. A., & Rumelhart, D. E. (1 975 ). Ex-
conceptions and Educational Strategies in Sci- plorations in cognition. San Francisco: Freeman.
Nussbaum, J. (1 979). Children’s conception of discovery. Cognitive Science Quarterly, 1 , 23 –

the Earth as a cosmic body: A cross-age study. 277.
Science Education, 63 , 83 –93 . Pennington, N. (1 987). Comprehension strate-
Nussbaum, J. (1 985 ). The Earth as a cosmic body. gies in programming. In G. M. Olson, S. Shep-
In R. Driver, E. Guesne, & A. Tiberghien (Eds.), pard, & E. Soloway (Eds.), Empirical studies of
Children’s ideas in science (pp. 1 70–1 92). Milton programmers: Second workshop (pp. 1 00–1 1 3 ).
Keynes, UK: Open University Press. Norwood, MJ: Ablex.
Nussbaum, J., & Novak, J. D. (1 976). An assess- Piaget, J. (1 985 ). The equilibration of cogni-
ment of children’s concepts of the Earth uti- tive structures. Chicago: University of Chicago
lizing structured interviews. Science Education, Press.
60, 5 3 5 –5 5 0. Piaget, J., & Inhelder, B. (1 95 6). The child’s con-
Ohlsson, S. (1 993 a). Abstract schemas. Educa- ception of space. London: Routledge and Kegan
tional Psychologist, 2 8, 5 1 –66. Paul.
Ohlsson, S. (1 993 b). The interaction between Posner, G. J., Strike, K. A., Hewson, P. W., & Gert-
knowledge and practice in the acquisition of zog, W. A. (1 982). Accommodation of a scien-
cognitive skills. In A. Meyrowitz & S. Chipman tific conception: Toward a theory of conceptual
(Eds.), Foundations of knowledge acquisition: change. Science Education, 66, 21 1 –227.
Cognitive models of complex learning (pp. 1 47– Quillian, M. R. (1 968). Semantic memory. In M.
208). Norwell, MA: Kluwer Academic. Minsky (Ed.), Semantic information processing
Ohlsson, S. (1 994). Declarative and procedu- (pp. 227–270). Cambridge, MA: MIT Press.
ral knowledge. In T. Husen & T. Neville- Reiner, M., Slotta, J. D., Chi, M. T. H., & Resnick,
Postlethwaite (Eds.), The international encyclo- L. B. (2000). Naive physics reasoning: A com-
pedia of education (Vol. 3 , 2nd ed., pp. 1 43 2– mitment to substance-based conceptions. Cog-
1 43 4). London, UK: Pergamon Press. nition and Instruction, 1 8(1 ), 1 –3 4.
Ohlsson, S. (1 996). Learning from performance Reisberg, D. (2001 ). Cognition: Exploring the sci-
errors. Psychological Review, 1 03 , 241 –262. ence of the mind (2nd ed.). New York: Norton.
Ohlsson, S. (1 999). Theoretical commitment and Rips, L., & Medin, D. (in press). Concepts and
implicit knowledge: Why anomalies do not categories: Memory, meaning and metaphysics.
trigger learning. Science and Education, 8, 5 5 9– In K. J. Holyoak & B. G. Morrison (Eds.), Cam-
5 74. bridge handbook of thinking and reasoning. New
Ohlsson, S. (2002). Generating and understand- York: Cambridge University Press.
ing qualitative explanations. In J. Otero, J. A. Robin, N., & Ohlsson, S. (1 989). Impetus then
Leon, & A. C. Graesser (Eds.), The psychology and now: A detailed comparison between Jean
of science text comprehension (pp. 91 –1 28). Mah- Buridan and a single contemporary subject. In
wah, NJ: Erlbaum. D. E. Herget (Ed.), The history and philoso-
Ohlsson, S., & Hemmerich, J. (1 999). Articu- phy of science in science teaching (pp. 292–3 05 ).
lating an explanation schema: A preliminary Tallahassee: Florida State University.
model and supporting data. In M. Hahn & S. Rokeach, M. (1 960). The open and closed mind.
Stones (Eds.), Proceedings of the twenty first an- New York: Basic Books.
nual conference of the Cognitive Science Society Rokeach, M. (1 970). Beliefs, attitudes, and values:
(pp. 490–495 ). Mahwah, NJ: Erlbaum. A theory of organization and change. San
Ohlsson, S., & Lehtinen, E. (1 997). Abstraction Francisco: Jossey-Bass.
and the acquisition of complex ideas. Interna- Rosenzweig, M. R. (2001 ). Learning and neu-
tional Journal of Educational Research, 2 7, 3 7– ral plasticity over the life span. In P. E. Gold
48. & W. T. Greenough (Eds.), Memory consolida-
Ohlsson, S., Moher, T. G., & Johnson, A. (2000). tion (pp. 275 –294). Washington, DC: Ameri-
Deep learning in virtual reality: How to teach can Psychological Association.
children that the Earth is round. In L. R. Gleit- Rumelhart, D. E., & Norman, D. A. (1 978). Ac-
man & A. K. Joshi (Eds.), Proceedings of the cretion, tuning and restructuring: Three modes
twenty-second annual conference of the Cognitive of learning. In J. W. Cotton & R. Klatzky (Eds.),
Science Society (pp. 3 64–3 68). Mahwah, NJ: Semantic factors in cognition (pp. 3 7–5 4). Hills-
Erlbaum. dale, NJ: Erlbaum.
Ohlsson, S., & Regan, S. (2001 ). A function Rumelhart, D. E., Smolensky, P., McClelland,
for abstract ideas in conceptual learning and J. L., & Hinton, G. E. (1 986). Schemata and
sequential thought processes in PDG models. Squire, L. R. (1 987). Memory and brain. New
In J. McClelland & D. E. Rumbelhart (Eds.), York: Oxford University Press.
Parallel distributed processing: Explorations in the Stokes, A. F., Kemper, K., & Kite, K. (1 997).
microstructure of cognition (Vol. 2, pp. 7–5 7). Aeronautical decision making, cue recognition,
Cambridge, MA: MIT Press. and expertise under time pressure. In C. E.
Schank, R. (1 972). Conceptual dependency: A Zsambok & G. Klein (Eds.), Naturalistic deci-
theory of natural language understanding. Cog- sion making. Expertise: Research and applications
nitive Psychology, 3 , 5 5 2–63 1 . (pp. 1 83 –1 96). Mahwah, NJ: Erlbaum.
Schank, R. (1 986). Explanation patterns. Hills- Strike, K. A., & Posner, G. J. (1 985 ). A conceptual
dale, NJ: Erlbaum. change view of learning and understanding. In
Schank, R. C., & Abelson, R. P. (1 977). Scripts, L. West & L. Pines (Eds.), Cognitive structure
plans, goals, and understanding: An inquiry and conceptual change (pp. 21 1 –23 1 ). New York:
into human knowledge structures. Hillsdale, NJ: Academic Press.
Erlbaum. Strike, K. A., & Posner, G. J. (1 992). A revisionist
Schultz, T. R., & Lepper, M. R. (1 996). Cognitive theory of conceptual change. In R. A. Duschl
dissonance reduction as constraint satisfaction. & R. J. Hamilton (Eds.), Philosophy of science,
Psychological Review, 1 03 , 21 9–240. cognitive psychology, and educational theory and
practice (pp. 1 47–1 76). New York: State Uni-
Shatz, M., & Gelman, R. (1 973 ). The develop-
versity of New York Press.
ment of communication skills: Modifications in
the speech of young children as a function of Thagard, P. (1 989). Explanatory coherence. Be-
listener. Monographs of the Society for Re- havioral and Brain Sciences, 1 2 , 43 5 –467.
search in Child Development, Serial number Thagard, P. (1 992a). Conceptual revolutions.
1 5 2, 3 8(5 ). Princeton, NJ: Princeton University Press.
Simon, H. A., & Gilmartin, K. (1 973 ). A simu- Thagard, P. (1 992b). Adverserial problem solving:
lation of memory for chess positions. Cognitive Modeling an opponent using explanatory co-
Psychology, 5 , 29–46. herence. Cognitive Science, 1 6, 1 23 –1 49.
Slotta, J. D., Chi, M. T. H., & Joram, E. (1 995 ). Thagard, P. (2000). Coherence in thought and ac-
Assessing students’ misclassifications of physics tion. Cambridge, MA: MIT Press.
concepts: An ontological basis for conceptual Thompson, P. (1 989). The structure of biological
change. Cognition and Instruction, 1 3 (3 ), 3 73 – theories. New York: State University of New
400. York Press.
Smith, E. E., Osherson, D. N., Rips, L. J., & Thorndyke, P. W. (1 984). Applications of schema
Keane, M. (1 988). Combining prototypes: A theory in cognitive research. In J. R. Anderson
selective modification model. Cognitive Science, & S. M. Kosslyn (Eds.), Tutorials in learning and
1 2 , 485 –5 27. memory (pp. 1 67–1 92). San Francisco: Freeman.
Smith, J. P., III, DiSessa, A. A., & Roschelle, J. Trabasso, T., & van den Broek, P. (1 985 ). Causal
(1 995 ). Misconceptions reconceived: A con- thinking and the representation of narrative
structivist analysis of knowledge in transition. events. Journal of Memory and Language, 2 4,
The Journal of the Learning Sciences, 3 , 1 1 5 –1 63 . 61 2–63 0.
Soloway, E., & Erhlich, K. (1 984). Empiri- Tuckerman, N., & Dunnan, N. (1 995 ). The Amy
cal studies of programming knowledge. IEEE Vanderbilt complete book of etiquette. New York:
Transactions on Software Engineering, 1 0, 5 95 – Doubleday.
609. VanLehn, K. (1 989). Problem solving and cog-
Soloway, E. M. (1 978). Learning = interpreta- nitive skill acquisition. In M. I. Posner (Ed.),
tion + generalization: A case study in knowledge- Foundations of cognitive science (pp. 5 27–5 79).
directed learning. Unpublished doctoral disser- Cambridge, MA: MIT Press.
tation. Amherst: University of Massachusetts. Vosniadou, S. (1 994a). Capturing and modeling
Spilich, G. J., Vesonder, G. T., Chiesi, H. L., & the process of conceptual change. Learning and
Voss, J. F. (1 979). Test processing of domain- Instruction, 4, 45 –69.
related information for individuals with high Vosniadou, S. (1 994b). Universal and culture-
and low domain knowledge. Journal of Verbal specific properties of children’s mental mod-
Learning and Verbal Behavior, 1 8, 275 –290. els of the earth. In L. Hirschfeld & S. Gelman
(Eds.), Mapping the mind (pp. 41 2–43 0). Cam- ing sense of the world. Journal of Science Edu-
bridge, MA: Cambridge University Press. cation and Technology, 8, 3 –1 9.
Vosniadou, S., & Brewer, W. F. (1 992). Mental Wilson, R. A., & Keil, F. C. (2000). The shadows
models of the earth: A study of conceptual and shallows of explanation. In F. C. Keil &
change in childhood. Cognitive Psychology, 2 4, R. A. Wilson (Eds.), Explanation and cognition
5 3 5 –5 85 . (pp. 1 3 7–1 5 9). Cambridge, MA: MIT Press.
Voss, J. F., Vesonder, G. T., & Spilich, G. J. (1 980). Winograd, T. (1 975 ). Frame representations
Text generation and recall by high-knowledge and the declarative/procedural controversy. In
and low-knowledge individuals. Journal of Ver- D. Bobrow & A. Collins (Eds.), Represen-
bal Learning and Verbal Behavior, 1 9, 65 1 –667. tation and understanding: Studies in cognitive
White, B., & Frederiksen, J. (1 990). Causal model science (pp. 1 85 –21 0). New York: Academic
progressions as a foundation for intelligent Press.
learning environments. Artificial Intelligence, Wiser, M., & Carey, S. (1 983 ). When heat and
42 , 99–1 5 7. temperature were one. In D. Gentner & A.
Wilensky, U., & Resnick, M. (1 999). Thinking in Stevens (Eds.), Mental models (pp. 267–297).
levels: A dynamic systems perspective to mak- Hillsdale, NJ: Erlbaum.
CHAPTER 1 7
Thinking as a Production System
Marsha C. Lovett
John R. Anderson
Thinking as a Production System discuss our view of production systems in

future research.
Since their birth (ca. late 1 960s), produc-
Background on Production Systems
tion systems have been developed as a formal
tool not only for describing but for explain- A production system is a set of production
ing how humans think. Indeed, “to advance rules – each of which represents a contin-
our understanding of how humans think” is gency for action – and a set of mechanisms
the stated goal of Newell and Simon’s clas- for matching and applying production rules.
sic book, Human Problem Solving (1 972), in Because the production rule is the funda-
which the first body of work on production- mental unit of this formalism, it is worth giv-
system models of human thought was pre- ing a few examples. Table 1 7.1 presents four
sented (see Novick & Bassok, Chap. 1 4). The sample production rules written in English.
main goal for production systems in psy- Note that each is divided into two parts by
chological research has changed little in the the word “then”: The first part of each pro-
intervening years, and yet the state of the duction rule (before the “then”) specifies the
art has advanced dramatically. The aim of conditions under which that production rule
this chapter is to present a contemporary is applicable, and the second part specifies
production-systems approach to open ques- the actions to be applied. Conditions may re-
tions in problem solving, reasoning, anal- flect an aspect of the external world (e.g., it
ogy, and language. We highlight the ways in is dark) or an internal, mental state (e.g., my
which today’s production systems allow for current goal is to reach a particular location,
more flexibility, stochasticity, and sensitiv- or I can retrieve a particular fact). Likewise,
ity than their predecessors. Besides demon- actions may transform a feature in the real
strating that production systems can offer world (e.g., flip the light switch) or an in-
insight into current questions and add to ternal, mental state (e.g., change my current
our understanding of human thinking, we goal, or add a fact to memory).
401
Table 1 7.1 . Illustrative examples of production rules, written

in English
Number Specification of Production Rule
1 When my current goal involves navigating in a
dark room,
then I flip the light switch in that room.
2 When my current goal is to go to a location that
is more than 3 00 miles away,
then I set a subgoal to go to the local airport.
3 When my current goal is to answer an arithmetic
problem of the form D1 + D2,
then I change the goal to try retrieving the sum
of D1 and D2 from memory.
4 When my current goal is to answer an arithmetic
problem of the form D1 + D2,
then I hold up D2 fingers and change the goal to
count them starting with the number after D1 .
To operate, a production system requires interpreting production rules as program-

a dynamic memory that represents the cur- ming code. Instead, each production rule –
rent state of the system and is used to when it matches dynamic memory – has
match against production rules’ conditions. the potential to fire and change the current
For example, when dynamic memory in- state, thus setting other production rules
cludes the goal “to get to San Francisco,” the into action.
second production rule in Table 1 7.1 would This discussion leads to the question of
match for someone in Pittsburgh, Pennsyl- what it means to model thinking as a produc-
vania. This pattern matching of production tion system: What are the theoretical impli-
rules to dynamic memory leads to a set of po- cations associated with representing knowl-
tentially applicable production rules called edge as production rules? The following
the conflict set. However, not all production are four features commonly attributed to
rules in the conflict set are applied. The pro- production-rule representations:
cess of conflict resolution specifies which pro-
duction rules from the conflict set will ex- 1 . Production rules are modular. Each pro-
ecute their actions or fire. These actions are duction rule represents a well-circum-
likely to change the external and/or internal scribed unit of knowledge such that any
state of the system reflected in a change to production rule can be added, refined,
dynamic memory. Then, a potentially differ- or deleted independently of other pro-
ent set of production rules may comprise the duction rules in the system. Moreover,
conflict set, and the cycle continues. each production rule is atomic such that
One way to view how production rules it would be added, refined, and deleted
operate is by analogy to stimulus–response as whole unit. It is important to note,
associations; that is, when a particular stim- however, that this modularity does not
ulus is present, an associated response is trig- preclude production rules from inter-
gered. This fits with the notion that a pro- acting with each other extensively in a
duction rule cannot be directly verbalized running system. Indeed, adding a new
but rather is observable through behavior. production rule to an existing set can –
This analogy to stimulus–response associa- and often does – completely change the
tions emphasizes the fact that production functioning of the system because of the
systems do not operate via a homunculus way production rules’ actions impact each
thinking as a production system 403
others’ firing. Early production-system The capability to represent abstract re-

modelers (Klahr & Wallace, 1 976; Young lationships allows for transfer of learning
& O’Shea, 1 981 ) took advantage of this across different situations as long as they
feature by adding or deleting produc- fit within the conditions of the given pro-
tion rules to explicitly test how that duction rule. For example, the first pro-
change would impact the system’s be- duction rule in Table 1 7.1 could match to
havior. More recently, production systems a dark dining room, living room, or office,
have been developed with autonomous meaning that experience at flipping the
learning mechanisms that enable the sys- light switch in any of these rooms would
tem’s production rules to change based transfer to the others. Likewise, the third
on experience. In these systems, mod- production rule in Table 1 7.1 could match
ularity is achieved because these learn- to any two-addend addition problem.
ing mechanisms create and modify indi- 4. Production rules cannot be directly verbal-
vidual production rules independently of ized. This feature is based on the notion
other rules. that each production rule represents
2. Production rules are asymmetric. Each pro- knowledge about a contingency for action
duction rule is a unidirectional contin- that is not directly accessible to verbal-
gency for action. This means that the pro- ization. A good example of this occurs
duction rule “When I want to type the when someone knows how to drive a stan-
letter ‘j’, then I punch my right index fin- dard transmission car but cannot explain
ger” is different from “When I punch my it verbally. It is important to note that,
right index finger, then I type the letter while this feature implies that knowledge
‘j’”. Moreover, asymmetry and modularity represented in production-rule form can-
imply that, if these two production rules not be accessed directly, it does not im-
were in the same system, adding, delet- ply that one cannot use other techniques
ing, or refining the former would not di- to talk about performance knowledge. For
rectly change the latter. That is, practicing example, when changing gears in a stan-
typing would exercise the first produc- dard transmission car, it is possible to ob-
tion rule, strengthening the index-finger serve one’s own performance and verbally
response when “j” is the desired letter, but describe these observations. Also, knowl-
it would not strengthen one’s knowledge edge about how to perform a task may be
that “j” appears when touch-typing with represented in multiple forms – some that
that finger. For expert touch-typists, this can be verbalized and some that cannot.
asymmetry is quite noticeable: Without
looking at a keyboard, try to identify the This last point confronts a common miscon-
letter that is typed with your left index ception about production systems – namely,
finger. Tough, isn’t it? Typing the word that knowledge about rules or procedures is
“frog” would have been easier. Such asym- necessarily represented as production rules.
metry has been documented in many con- Whereas knowledge about rules and pro-
texts (see Singley & Anderson, 1 989, for cedures can be represented in production-
a review). rule form, it is not the content of knowl-
3 . Production rules can be abstract. Produc- edge that determines how it is represented.
tion rules allow for generalization be- Instead, the four features listed previously
cause their conditions may be repre- serve as a set of necessary conditions for
sented as templates that match to a wide knowledge to be considered as being repre-
range of patterns. These conditions spec- sented in production-rule form. To illustrate
ify the relationship(s) between items the distinction between knowledge con-
without specifying the items themselves tents and representational form, Table 1 7.2
(e.g., “When A is taller than B and B is shows that the same knowledge content
taller than C, then say A is taller than C” (either column) can be represented in a
is true for any values of A, B, and C). production-rule form (top entry) or not
Table 1 7.2 . Examples Illustrating that the Form (Rows) and Content (Columns) of Knowledge
Are Independent
Knowledge Contents
Representational Form Rulelike Factlike
Production rule When I want to type a letter and I When I want to type the letter “j,”
know its finger move, then then I punch with my right
make that move index finger on the home row
Declarative fact To touch-type, one must make the The letter “j” goes with the right
finger move corresponding to index finger in home-row
the currently desired letter position
(bottom entry as a declarative fact). So, effectors into models. In contrast, current
when considering what it means for knowl- production systems incorporate and empha-
edge to be represented in production-rule size perception and action in their frame-
form, the key is not in what knowledge is works (Anderson & Lebiere, 1 998; Meyer
being represented but rather in how. & Kieras, 1 997). Finally, the fifth feature
that distinguishes early and recent produc-
Production Systems, Then and Now tion systems is so strongly linked to the
early models that it has sometimes been
The first production systems set out to es-
considered a defining feature of produc-
tablish a mechanistic account of how hu-
tion systems. This is the symbolic nature of
man adults perform relatively short, mod-
early production systems. However, almost
erately difficult, symbolic tasks (Newell &
all modern production systems take a hybrid
Simon, 1 972). Besides demonstrating that
view by positing symbolic representations
production systems could solve these tasks,
as important conceptual units and acknowl-
the main goal was to connect the sys-
edging graded representations as a valuable
tem’s processing steps to human problem-
additional layer (e.g., associating continu-
solving steps. Several features distinguish
ously valued quantities with each produc-
these early production systems from their
tion rule).
current-day progeny. First, early production
systems tended to focus on demonstrating
Current Production Systems in Context
human-like performance; current models rely
heavily on learning mechanisms to derive This section provides a brief overview of four
predictions about learning and performance production systems currently being used in
across time. Second, early models focused a variety of cognitive modeling situations.
on reproducing qualitatively the processing The systems to be described are ACT-R
steps of individual problem solvers, whereas (Anderson & Lebiere, 1 998), EPIC (Meyer &
more recent models have been submitted to Kieras, 1 997), Soar (Laird, Newell, & Rosen-
both quantitative analyses of fit to aggre- bloom, 1 991 ), and 4-CAPS (Just, Carpenter,
gate data (e.g., average reaction times for & Varma, 1 999). ACT-R emphasizes the
various conditions) and qualitative analy- notion of a cognitive modeling architecture
ses (e.g., whether the model demonstrates in which the same set of mechanisms and
the same errors as people).1 Third, the role representational schemes are used to cap-
of noise processes has increased drastically ture human learning and performance across
from early models that avoided stochastic tasks. Recently, this has been extended to
processes completely to current day mod- map various ACT-R mechanisms and mod-
els in which stochasticity plays an important ules to particular brain regions for compar-
role (Lebiere, Anderson, & Bothell, 2002; ison with neuroimaging data. EPIC has fo-
Lebiere et al., 2003 ). Fourth, early models cused on capturing the connections among
focused on the “cognitive” layer of process- the cognitive, perceptual, and motor sys-
ing and eschewed integrating receptors and tems. Recently, EPIC has been used to make
quantitative predictions about perception- performance data, ACT-R and Soar have
to-action loops in multiple-task situations particularly focused on production-rule
and across the adult age span. Soar was origi- learning as well. Yet another dimension in
nally developed to address issues in both psy- which these architectures differ is their com-
chology and artificial intelligence. Recently, mitment to hybridization with Soar commit-
it has been particularly successful in simu- ted to a purely symbolic account whereas
lating multiagent, dynamic interactions with ACT-R and 4-CAPS postulate continuously
real world application (e.g., Jones, et al., varying quantities that drive the processing
1 999). The 4-CAPS architecture, like its pre- of symbolic units. EPIC does have contin-
decessor 3 -CAPS (Just & Carpenter, 1 992), uously varying parameters associated with
focuses on individual differences. various modules but does not appear to
To delineate the space of current produc- have information-laden nonsymbolic quan-
tion systems, we next highlight the dimen- tities in its theory of central cognition.
sions along which these systems differ. First, Finally, production systems differ in the
they differ with regard to their degree of pro- role that noise processes play in their pro-
cessing parallelism. Toward one end of the cessing. In Soar, their role is minimal (i.e.,
spectrum, ACT-R posits that only a single when a “tie” between production rules arises,
production rule can fire at a time. However, one of them is chosen at random). In ACT-
ACT-R allows for parallelism in other ways: R and 4-CAPS, noise processes are assumed
asynchronous parallelism among its percep- added to the various continuously varying
tual and motor modules2 , parallel retrieval of computations that influence system perfor-
information from declarative memory, and mance. In EPIC, noise is used more to rep-
parallel production-rule matching and selec- resent variability in system parameters (e.g.,
tion. Soar similarly posits serial processing rate parameter in Fitt’s law governing mo-
in that a single operator is chosen in each tor movements) than to represent a generic
decision phase, but this is preceded by an nondeterminism of the system.
elaboration phase that allows parallel pro-
duction firing. 4-CAPS allows parallel firing
Organization of the Remainder
of production rules for all cycles, but this
of the Chapter
parallelism is subject to a capacity limitation
such that the more production rules firing, Our own research has involved the ACT-
the less rapidly each of them is executed. R system and slight variants. In this chap-
EPIC is the only system with fully parallel ter, we describe six ACT-R models with
production-rule firing. To manage its mul- which we are familiar that address differ-
tiply threaded central cognition, EPIC uses ent aspects of cognition. We do not fo-
task-coordination strategies that impose or- cus on the ACT-R details of these mod-
dering constraints when necessary. els but rather on how they illustrate the
Another dimension along which the sys- general trends in production-system mod-
tems differ is the degree of modularity they els toward softer, more flexible, and highly
propose. Soar is at one end of this spectrum detailed characterizations of human cogni-
because of its unitary structure – a single set tion. We place each model in the multi-
of production rules representing long-term dimensional space described previously by
memory. 4-CAPS posits a number of distinct highlighting the following features: Does
sets of production rules connected to each the model include both performance and
other. In ACT-R and EPIC, multiple mod- learning mechanisms? Does the model make
ules correspond to separate perceptual and use of symbolic (rule-based) and other
motor modalities and to “central cognition.” continuously varying computations? Does
These modules are considered encapsulated, the model draw upon multiple processing
independent processors with their interac- modules beyond a central production-rule
tions handled by the production system. memory? We use the template in Table 1 7.3
Although all four systems produce quan- to summarize how each model fits into this
titative predictions that match well to three-dimensional space in terms of its use
Table 1 7.3. Template for Describing the Knowledge Structures and Mechanisms Involved in Each
ACT-R Model
Performance Mechanisms Learning Mechanisms
Symbolic Subsymbolic Symbolic Subsymbolic
Declarative Knowledge (usually Relative activation Adding new Changing activation
chunks facts) that can be of declarative declarative of declarative
directly chunks affects chunks to the set chunks and
verbalized retrieval changing strength
of links between
chunks
Production Knowledge for Relative utility of Adding new Changing utility of
rules taking particular production rules production rules production rules
actions in affects choice to the set
particular
situations
of various ACT-R representations and mech- ple production rules match the current situ-
anisms. In addition, we comment on how ation, the one with the highest utility value
each model makes use of parallelism and succeeds in firing. This competition occurs
noise processes, as appropriate. among the individual units themselves with-
We use the term “subsymbolic” to refer out any explicit selection by a controlling ho-
to the numerical values and computations munculus and without any conscious access
associated with each symbolic unit. In this to the utility values. Another important kind
sense, the prefix “sub” refers to a level of de- of numerical quantity in our subsymbolic
scription below the symbolic units and that representation is similarities between sym-
determines those symbolic units’ access in bols. With these quantities, a production rule
competition with other symbols. The use can partially match against a symbol similar
of the term subsymbolic from a connec- to the one specified in its condition, allowing
tionist perspective often refers to the fact the system to realize soft constraints. This
that symbols may be represented in a dis- fact further blurs the difference between the
tributed fashion, with the prefix sub refer- two senses of “subsymbolic.” Work exploring
ring to the pieces of the pattern that con- a connectionist implementation of the ACT-
stitute a symbol. For instance, Smolensky R architecture (Lebiere & Anderson, 1 993 )
(1 988, p. 3 ) writes, “The name subsymbolic suggests that symbolic units represented in
paradigm is intended to suggest cognitive a distributed fashion can yield the behavior
descriptions built up of entities that corre- of a symbolic system that has continuously
spond to constituents of the symbols used valued quantities influencing the access and
in the symbolic paradigm; these fine-grained use of its symbols.
constituents could be called subsymbols, and
they are the activities of individual process-
Choice
ing units in connectionist networks.” It is
an interesting question whether these two One of the perennial questions in problem-
views are really in contradiction. The sub- solving research involves how solvers make
symbolic values discussed in this chapter are choices: choices of the next step, of an ap-
updated and used only locally, but at the propriate solution strategy, and of whether
same time have a global impact on the sys- to use weak (domain-general) versus strong
tem’s processing, just as the activations of (domain-specific) methods. Indeed, around
units in a connectionist system do. As an ex- the time when production systems were first
ample of this, consider the utility values as- developed, Newell and Simon introduced
sociated with production rules: When multi- the idea that the very process of problem
Figure 1 7.1 . Initial state and three possible subsequent problem states from the
Building Sticks Task.
solving could be viewed as search in a prob- first time and solved a sequence of problems
lem space, which equates problem solving in which the proportion of problems that
with a series of choices. Research then ad- could be solved by each of the two strategies
dressed the question, “How do solvers make was manipulated (e.g., 3 0% overshoot-only
choices?” by focusing on cases in which problems and 70% undershoot-only prob-
solvers have little or no domain knowledge. lems or vice versa). The results can be sum-
Production-rule models representing various marized in three main findings:
problem-solving heuristics predicted perfor- 1 . Participants’ choices initially followed a
mance and established links between heuris- hill-climbing heuristic with little bias to-
tics and human data. Current research asks, ward undershoot or overshoot.
“How do solvers make choices?” but focuses
2. With experience, participants gradually
on cases in which solvers have prior, relevant
learned to prefer the more successful
experience. This is, at its heart, a question
strategy for their condition.
about learning, so production systems that
learn from their experience may offer addi- 3 . Changes in strategy choice were sensitive
tional insight. to recent experiences in that participants
In a set of studies by Lovett (Lovett were more likely to choose the strategy
& Anderson, 1 996; Lovett, 1 998), partici- that had been successful on the previous
pants’ choice learning in the Building Sticks (or even second-previous) problem.
Task (BST) was studied and modeled within The model that was built for this task has
ACT-R. The BST is an isomorph of the wa- since been applied in various forms to ac-
ter jars task (Luchins, 1 942) such that, in count for choice learning in several other
each problem, solvers must add and subtract tasks (see Lovett, 1 998). Here we describe
the lengths of three building sticks to equal the BST model specifically. The model was
the length of a goal stick (see top of Fig- initially endowed with production rules that
ure 1 7.1 ). Solvers face a choice between two implement two domain-general heuristics,
strategies (see bottom row of Figure 1 7.1 ): hill-climbing and guessing, for the partic-
overshoot, which involves starting with the ulars of this task. For example, the guess–
longest building stick and shortening it by overshoot production rule makes the first
the others, and undershoot, which involves overshoot move regardless of the details of
starting with the short or medium building the problem, and guess–undershoot does this
stick and then lengthening it. In these stud- for undershoot. These productions repre-
ies, participants encountered the BST for the sent an uninformed guess that their action
will lead to a solution and match to any with a (negated) time-weighted average of
BST problem. In addition, the hillclimb– its past costs, where cost is measured in time
overshoot production makes the first over- the production rule “spends” when fired.
shoot move but only matches when this In the case of the BST model, learned
move takes the initial state closest to the goal utility values average in new experiences
state; hillclimb–undershoot does the same of success and failure across trials, allow-
for undershoot. These productions repre- ing the model to gradually increase the util-
sent knowledge for taking the action that ity value for production rules that have had
looks best according to a hill-climbing met- greater success and lower cost, and hence
ric. Note that three of these four produc- to gradually increase the likelihood of fir-
tion rules will match to each BST prob- ing more useful production rules. Thus, the
lem’s initial state – both guess production model shows the same gradual preference
rules and one hillclimb production rule, for the more successful BST strategy, as do
whichever matches the stick lengths of the participants. In addition, because this up-
problem (e.g., hillclimb–undershoot in Figure dating mechanism includes a time-weighted
1 7.1 ). Note also that, although three pro- decay, the impact of recent successes and
duction rules match to an initial stimulus, costs on a production rule’s overall utility
two of them produce the same response but value is greater, leading the model – like
on the basis of different knowledge (i.e., participants – to change strategy choice with
two separate production rules). This empha- greater sensitivity to recent experiences.
sizes that production rules are not simply
stimulus–response associations but repre-
Summary
sent additional information in their condi-
tions, which defines the (potentially differ- This production-system model of problem-
ent) scopes over which they apply. solving choice specifies a set of fairly generic
Beyond the task-specific composition of production rules to represent the heuris-
its production rules, this model’s most tics of guessing and hill-climbing and then
important features come from ACT-R’s gen- draws on ACT-R’s pre-existing production-
eral, subsymbolic computations for pro- rule mechanisms to learn to solve problems
duction-rule utility values. Each production by experience. The major claim, then, is that
rule has an associated utility – learned by ex- strategy-choice learning is strongly guided
perience – that represents a combined esti- by problem-solving experiences of success
mate of how successful and costly that pro- and cost associated with using those strate-
duction rule is likely to be. Whenever the gies and that strategies are effectively repre-
model is faced with multiple matching pro- sented as production rules. More specifically,
duction rules, there is a noisy selection pro- the model posits that choices in problem
cess that fires the production rule with the solving are governed by an implicit competi-
highest subsymbolic utility value. This noise tion among production rules based on their
process serves to lead the model generally utility values (a subsymbolic performance
to choose the production rule that has been mechanism) and that these utilities are up-
most useful in past experience, but to do so dated naturally based on experience (a sub-
a proportion of the time consistent with that symbolic learning mechanism). The corre-
production rule’s utility relative to the com- sponding two subsymbolic, production-rule
peting production rules’ utility (e.g., com- cells have checks in Table 1 7.4. Although
peting production rules with very close util- this model does not address how produc-
ity values are selected virtually at random). tion rules specific to this task are acquired
These utility values are learned from experi- (i.e., there is no symbolic production-rule
ence according to a prespecified mechanism: learning), its initial production rule set is
Specifically, each production rule’s utility is composed mainly of general heuristics that
computed arithmetically as a time-weighted have been adapted only slightly to the con-
average of its past success rate combined text of the particular task. In other words,
Table 1 7.4. Knowledge Structures and Mechanisms Used in a Production-Systems

Model of Choice
√
Declarative chunks
√ √ √
Production rules
for this relatively knowledge-lean task, it is mon feature of these theories is that they
reasonable to suspect that participants and require a mixture of symbolic and subsym-
the model can manage without acquiring bolic processes. The symbolic processes are
many new, task-specific production rules. It required to reason about the structure of
is also interesting that, in this task, produc- the domains, but the softness of subsymbolic
tion rules – with their somewhat broad con- processes is required to stretch the analogy
ditions of applicability – largely determine in semantically plausible ways.
the behavior of the system; although declara- Given the requirement of a mixture of
tive knowledge is involved, it is not involved symbolic and subsymbolic processes, mod-
critically in the explanation of the phenom- ern production systems would seem well
ena. This representational bias is supported designed to model analogy. Salvucci & An-
by the relatively broad, within-task transfer derson (2001 ) describe a relatively success-
that problem solvers show in carrying over ful application of the ACT-R theory to
their strategic preferences from trained BST modeling results in the analogy literature.
problems to novel BST problems. Before reviewing it, we would like to high-
light the value added by such a theory. Al-
though the model incorporates many of the
Analogy insights of the other theories, it is not just
a matter of implementing these theories in
Analogy, the process of finding and using ACT-R. As a complete theory of cognition,
correspondences between concepts, plays the model contributes three factors lack-
a fundamental and ubiquitous role in hu- ing in these other models. First, it naturally
man cognition (see Holyoak, Chap. 6). From maps these processes onto precise predic-
mathematical problem solving (Novick & tions about real world metrics of latency and
Holyoak, 1 991 ) to computer programming correctness, rather than the more qualita-
(Anderson & Thompson, 1 989) to creative tive and ordinal predictions that have typ-
discovery (Holyoak & Thagard, 1 995 ), anal- ified other theories. Second, it integrates the
ogy facilitates better understanding of old process of analogy with the rest of cogni-
knowledge and the formation and inference tion and thus makes predictions about how
of new knowledge. The critical step in anal- processes such as eye movements are inter-
ogy is finding a mapping from objects and re- leaved with the analogy process. Third, it
lations in the source or known domain, where shows that the mechanisms underlying anal-
pre-existing knowledge forms the base of ogy are the same as the mechanisms under-
the analogy, to objects and relations in the lying other aspects of cognitive processing.
target or novel domain, where knowledge Figure 1 7.2 illustrates the representation
from the source domain will be applied. Nu- of the famous solar system analogy (Gentner,
merous researchers have proposed theories 1 983 ) in the Salvucci and Anderson system.
that describe how analogical mapping takes Analogs are represented as higher-order
place (Gentner, 1 983 , 1 989; Hofstadter & structures built up of three components:
Mitchell, 1 994; Holyoak & Thagard, 1 989; objects, relations, and roles. The first two
Hummel & Holyoak, 1 997; Keane, Ledge- components, objects and relations (repre-
way, & Duff, 1 994; Kokinov, 1 998). A com- sented as ovals in Figure 1 7.2) serve the same
purpose as in other theories of analogy: results from Ross (1 989) that showed role
Objects are the semantic primitives of the confusions in probability problems based on
analogs, whereas relations link objects or re- surface similarities between examples. One
lations together according to their function. limitation of the path-mapping process built
The solar-system domain contains the two into this model is that it only considers one
objects ss-sun and ss-planet, along with the proposition at at time. For that reason, the
three relations ss-causes, ss-attracts, and ss- model cannot solve analogies that require
revolves. Similarly, the atom domain con- the consideration of multiple propositions in
tains the two objects at-nucleus and at- parallel, whereas people and other models
electron and the three relations at-causes, can (e.g., Hummel & Holyoak, 1 997).
at-attracts, and at-revolves. The boxes in Fig- On the other hand, the production sys-
ure 1 7.2 represent the third component of an tem control structure leads to other predic-
analog structure – roles, which serve to link tions. Since the model goes from the source
objects and relations to form higher-order to the target, it has a preference for many-to-
conceptual structures. Each role comprises one mappings over one-to-many mappings.
five components: This enables the model to successfully pre-
dict the results of Experiment 2 in Spellman
parent: a pointer to the parent relation and Holyoak (1 996). They presented sub-
parent-type: the semantic type of the par- jects with two stories involving countries on
ent relation different planets and asked subjects to map
slot: the relation slot that the object fills countries on one planet to those on the other.
in the relation The story relations can be summarized as
child: a pointer to the child object or re- follows:
lation Story 1 Story 2
child-type: the semantic type of the child richer (Afflu, richer (Grainwell,
object or relation. Barebrute) Hungerall)
stronger stronger
For example, in the case of the ss-attractor (Barebrute, (Millpower,
role, ss-attracts is the parent, attracts is the Compak) Mightless)
parent-type, attractor is the slot, ss-sun is the
child, and sun is the child-type. The relations include an ambiguous map-
Salvucci and Anderson (2001 ) describe a ping – namely, the mapping of Barebrute
path-mapping process by which the struc- to either Hungerall or Millpower. Subjects
ture in the source is made to correspond to were divided into two conditions: In the 1 –2
the structure in the analog. This mapping condition, subjects mapped objects from
process is achieved by production rules that story 1 to those in story 2; in the 2–1 con-
essentially walk through these graphs look- dition, subjects mapped objects from story
ing for correspondences. The critical step 2 to story 1 . In both conditions, subjects had
in this mapping is retrieving roles from the the option of including any, all, or no objects
target domain to map onto roles in the in their mapping, thus allowing the possibil-
source domain. This is achieved by the par- ity of a one-to-one, one-to-many, or many-
tial matching process in ACT-R that selects to-one mapping, if so desired. Spellman and
the most similar role. Similarity between the Holyoak found that subjects rarely produced
source and target role is determined based on one-to-many mappings (fewer than 2% of
the similarities among the parent-type, slot, subjects), whereas they frequently produced
and child-type components of the roles. One many-to-one mappings (more than 3 0% of
of the consequences is that the model can subjects).
be misled to select inappropriate analogs on In addition to reproducing these results
the basis of surface similarity between the in the literature, Salvucci and Anderson
components of a source and target. For in- had subjects try to determine the analogies
stance, the model successfully simulated the between two stories and collected their eye
thinking as a production system 41 1
SOURCE
ss-causes
ss-cause ss-effect
causes causes
cause effect
attracts revolves
ss-attracts ss-revolves
ss-attractor ss-attracted ss-revolver ss-center

attracts attracts revolves revolves
attractor attracted revolver center
sun planet planet sun
ss-sun ss-planet
TARGET
at-causes
at-cause at-effect
causes causes
cause effect
attracts revolves
at-attracts at-revolves
at-attractor at-attracted at-revolver at-center

attracts attracts revolves revolves
attractor attracted revolver center
nucleus electron electron nucleus
at-nucleus at-electron
Figure 1 7.2 . Sample analogs for the solar-system and atom domains.
movements while they were doing so. The cognition and how it can control – and
data showed that subjects moved their eyes be determined by – processes such as eye
back and forth between the two stories as movements.
they read them and searched for the analogs.
The Salvucci and Anderson model was able
Summary
to predict the eye movement transitions.
This is a critical study because it shows This production-system model of analogy
how analogy is dynamically integrated with specifies a set of production rules that
Table 1 7.5. Knowledge Structures and Mechanisms Used in a Production-Systems Model of

Analogy Making
√ √
Declarative chunks
√
Production rules
implement a path-mapping process through crease, participants’ performance declines.

declaratively represented source and target Moreover, working-memory limitations ap-
structures. That is, the model posits that pear to differ across people such that some
analogies are made and used via an ex- people show a more striking decrease in per-
plicit process of path mapping that is in- formance as a function of task demands than
fluenced by the relative activation levels of others (see also Morrison, Chap. 1 9).
the elements to be mapped. The subsym- Each of the four production systems dis-
bolic mechanisms governing declarative re- cussed thus far has an account for the impact
trieval specify which parts of those declara- of working-memory demands on cognitive
tive structures will be retrieved and when. processing (see Miyake & Shah, 1 999). EPIC
In this way, the model makes specific, quan- implements Baddeley’s articulatory loop via
titative predictions about the results of anal- production rules acting on the auditory store
ogy making and its time course (as observed and vocal/motor processor. These produc-
through eye movement data). Although tion rules implement strategies for rehearsal
analogy making is a process that produces and recall and are constrained by the pro-
new knowledge – the mapping, which in cessing features of the modules they engage
turn can be used to produce new inferences – (e.g., all-or-none decay of items from the au-
the process of analogy usually occurs in a ditory store and time to re-read an item by
single trial without much learning. Thus, Ta- the vocal/motor processor). In contrast, Soar
ble 1 7.5 highlights that this model of analogy assumes no a priori limit to working memory
making draws on three of the four perfor- through its dynamic memory.3 Rather, limi-
mance mechanisms in ACT-R. tations arise when multiple levels of process-
ing are necessary to establish multiple sub-
goals to handle a sequence of impasses. In
Working Memory the CAPS architecture(s), working-memory
limitations are captured through a limita-
Just as the previous section’s model of anal- tion in the amount of activation that can
ogy makes heavy use of declarative knowl- propagate through the system: When less
edge and corresponding mechanisms, so activation is available, production-rule fir-
does this section’s model of working mem- ing takes more processing cycles. CAPS
ory. Working memory has been implicated has been used to model different pat-
in the performance of such diverse tasks terns of sensitivity to working-memory de-
as verbal reasoning and prose comprehen- mands among groups of individuals with
sion (Baddeley & Hitch, 1 974), sentence low, medium, and high capacity (e.g., Just &
processing (Just & Carpenter, 1 992), free Carpenter, 1 992).
recall learning (Baddeley & Hitch, 1 977), In ACT-R, working-memory limitations
prospective memory (Marsh & Hicks, 1 998), are imposed via a limitation to the amount
and note-taking and writing (Engle, 1 994). of attention that can be focused on the cur-
This research has suggested that working- rent goal. This attentional activation (also
memory resources are limited because, as called source activation) serves to maintain
working-memory demands of a task in- elements of the goal as highly active, activate
above their resting levels any goal-relevant

facts in declarative memory, and suppress
below their resting levels any facts negatively
associated with the current goal. Although
they sound similar, the CAPS and ACT-R
limitations in activation are quite different.
CAPS directly limits total activation in the
system, whereas ACT-R limits the ability
to differentially activate goal-relevant infor-
mation above goal-irrelevant information.
In other words, greater source activation in
ACT-R is akin to having a better signal-to-
noise ratio for retrieving facts. It is worth
noting that the working-memory limitations
in both ACT-R and CAPS are imposed as Figure 1 7.3. Graphic illustration of a Modified
constraints on a particular model parameter, Digit Span task trial with a memory set of size 3 .
whereas in other working-memory accounts The differences in the positions of the characters
(e.g., SOAR) the connectionist system LISA on-screen have been exaggerated for clarity.
(Hummel & Holyoak, 1 997) and, to some
degree EPIC, these limitations emerge as a
natural consequence of general processing. An ACT-R model of the MODS task
In this section, we demonstrate how im- successfully fits individual participant’s data
plementation of working memory in ACT-R as a function of set size (Figure 1 7.4) and
can be used to estimate individuals’ working- as a function of serial position for the set
memory capacity from performance on one size six trials (Figure 1 7.5 ) by only vary-
task (call it Task A) and then make accu- ing the source-activation parameter (Daily,
rate zero-parameter predictions of those in- Lovett, & Reder, 2001 ). This suggests that
dividuals’ performance on other tasks – B, C, source activation presents a reasonable im-
and so on. Task A is a Modified Digit Span plementation of working memory that can
task (MODS) designed as an isomorph of the explain the variation in individuals’ MODS
reading span (Daneman & Carpenter, 1 980) performance. Moreover, because source ac-
and the operation span (Turner & Engle, tivation plays the same role in all ACT-R
1 989). In this task, participants perform dual models, this allows for predictions to be
tasks of reading various presented charac- made for the same participants on other
ters and memorizing the exact order of dig- tasks by plugging each participant’s esti-
its only. Figure 1 7.3 shows a sample MODS mated source-activation parameter into the
trial in which the participant would read other task models. In Lovett, Daily, and
“a j 2 b i e 6 c f 8” and then recall the digits in Reder (2000), this is accomplished for the
order (2 6 8). Because the task is fast paced n-back task. Specifically, individual partici-
(there is little time for idiosyncratic strate- pant estimates of source activation were de-
gies), it draws on skills that are highly prac- rived from their MODS task performance
ticed (there is little chance for skill or knowl- and then used to make zero-parameter,
edge differences), and both aspects of the individual participant predictions on the
task are monitored (there is little opportu- n-back task.
nity for different levels of task compliance), The n-back task is a continuous trial
most of the variation in performance on this paradigm in which, for a given block of trials,
task should be attributable to differences in participants are asked to respond to whether
participants’ fundamental processing capac- each letter stimulus is a repeat of the stim-
ities. For our modeling purposes, we take this ulus “n” trials back (e.g., Braver et al., 1 997;
to be variation in source activation. Cohen et al., 1 994). For example, suppose
Subject 221 Subject 201

W = 0.7 W = 0.9
1.0 1.0
0.9 0.9
0.8 0.8
0.7 0.7
0.6 0.6
0.5 0.5
0.4 0.4
0.3 0.3
0.2 0.2
0.1 0.1
0.0 0.0
3 4 5 6 3 4 5 6
Memory Set Size Memory Set Size
W = 1.0 W = 1.1
1.0 1.0
0.9 0.9
0.8 0.8
0.7 0.7
0.6 0.6
0.5 0.5
0.4 0.4
0.3 0.3
0.2 0.2
0.1 0.1
0.0 0.0
3 4 5 6 3 4 5 6
Memory Set Size Memory Set Size
Figure 1 7.4. Model fits for four representative subjects from Daily et al.
(1 999). Filled symbols are subject data; open symbols are the model’s
predictions.
the participant saw the stimuli “U E E R E tions are represented by a fixed amount of
K L L”. In a “1 -back” block, a participant source activation, propagated from the cur-
should say “yes” to the third and last stimu- rent focus of attention to increase the acti-
lus and “no” elsewhere, whereas in a “2-back” vation of goal-relevant items and to decrease
block, the participant should say “yes” to the the activation of goal-irrelevant items. The
fifth stimulus and “no” elsewhere. As “n” in- larger this source activation for a given in-
creases, the working memory demands of dividual, the greater the degree of facilita-
the task increase and, not suprisingly, per- tion (suppression) of goal-relevant (irrele-
formance degrades. Figure 1 7.6 shows high- vant) items. This leads to direct performance
fidelity modeling fits at the individual par- implications as a function of source activa-
ticipant level in the n-back task by using the tion, plus there are indirect effects in the
individualized source activation parameter model (e.g., more rehearsals are possible be-
values that were estimated from the same cause of faster retrievals with high source ac-
participants’ MODS performance. tivation) that can further the implications.
In sum, this working-memory model relies
most heavily on the relative activation lev-
Summary
els of declarative chunks (both those that
This model of working memory includes are part of the initial model and those that
production rules to perform the various tasks are newly acquired as part of task perfor-
studied. Across all tasks, the ACT-R archi- mance); this is highlighted by the check
tecture provides a single theory of working marks filling the declarative chunks row in
memory in which working-memory limita- Table 1 7.6.
Table 1 7.6. Knowledge Structures and Mechanisms Used in a Production-Systems Model of

Working Memory
√ √ √
Declarative chunks
√ √
Production rules
Categorization revolution during which theorists argued

for various hypothesis-testing theories (e.g.,
Research on human category learning has a Trabasso & Bower, 1 964; Levine, 1 975 ). The
history that extends back at least to Hull’s hypothesis-testing theories were based on
(1 920) study of learning to categorize Chi- research with stimuli that had a very simple,
nese symbols and his conclusions in favor often one-dimensional categorical structure.
of an associative learning proposal. It was The 1 970s saw a renewed interest in more
an important domain early in the cognitive complex, fuzzy categories and proposals for

W = 0.7 W = 0.9
1.0 1.0
0.9 0.9
Proportion Conect
0.8 0.8
0.7 0.7
0.6 0.6
0.5 0.5
0.4 0.4
0.3 0.3
0.2 0.2
0.1 0.1
0.0 0.0
1 2 3 4 5 6 1 2 3 4 5 6
Serial Serial
Subject 211 Position Subject 203 Position
W = 1.0 W = 1.1
1.0 1.0
0.9
Proportion Conect
0.9
0.8 0.8
0.7 0.7
0.6 0.6
0.5 0.5
0.4 0.4
0.3 0.3
0.2 0.2
0.1 0.1
0.0 0.0
1 2 3 4 5 6 1 2 3 4 5 6
Serial Serial
Position Position
Figure 1 7.5. Fits to the serial position data (largest set size only) for four typical
subjects. Filled symbols are subject data; open symbols are the model’s
predictions.
prototype theories (Reed, 1 972; Rosch, strengthened by each approach. The sub-
1 975 ) and exemplar theories (e.g., Medin symbolic components of the architecture de-
& Schaffer, 1 978). The rise of connection- termine which production rules and declar-
ist models resulted in the proposal of asso- ative structures are retrieved at any time.
ciative theories (e.g., Gluck & Bower, 1 988) The component of the model incorpo-
not that different from the original Hull rating an EBRW approach retrieves past in-
proposal. Whereas the original research fo- stances from memory as a function of their
cused on accuracy data, new emphasis has similarity to the current stimulus. This de-
been on latency data to help choose among pends critically on the ability of the ACT-R
theories (e.g., Lamberts, 1 998; Nosofsky system to retrieve partially matching traces.
& Palmeri, 1 997). Recently, neuro-imaging Specifically, the probability of retrieving a
and other cognitive neuroscience data have memory in ACT-R is a function of how sim-
been recruited to try to decide among al- ilar it is to the memory probe. Anderson and
ternative theories (e.g., Ashby, et al., 1 998; Betz (2001 ) show that this retrieval function
Smith, Patalano, & Jonides, 1 998). Impres- yields a similar, but not identical, selection
sive growth has been attained in the char- rule to that used in the original Nosfosky
acterizations of the phenomena in category and Palmeri formulation. In addition, the
learning (see Medin and Rips, Chap. 3 ). ACT-R mechanism for chunk strengthening
However, the field does not seem any closer favors the retrieval of more frequently pre-
to coming to consensus regarding what “the” sented items and therefore produces a speed
mechanism of category learning is. increase similar to that in EBRW (which
Anderson and Betz (2001 ) produced a uses multiple traces and a Logan (1 988) race
production system model that reflected the process). Although the original EBRW and
belief that this contest of theories was mis- the ACT-R implementation are not identi-
placed and that different mechanisms were cal, they prove largely indistinguishable in
being used to different degrees in differ- their predictions. This near-equivalence is
ent experiments. In particular, they imple- strongly dependent on the pre-existing sub-
mented two alternative models in ACT-R symbolic processes built into ACT-R.
that have been advanced for categorization – The component of the ACT-R model
Nosofsky, Palmeri, and McKinley’s (1 994) implementing a RULEX approach depends
rule-plus-exception (RULEX) model and more on the symbolic production-level sys-
Nosofsky and Palmeri’s (1 997) exemplar- tem because the actual logic of hypothesis
based random-walk (EBRW) model. The testing in RULEX is quite complex (e.g., dif-
first model proposes that subjects store ex- ferent rules specify when to settle on a hy-
plicit rules for category membership and pothesis, when to switch from single dimen-
possible exceptions. The EBRW model pro- sion to multiple dimension rules, and when
poses that subjects retrieve instances that and how to form exceptions). Nevertheless,
are similar to the test stimulus and assign the subsymbolic level of ACT-R, which gov-
the stimulus to the category that has the erns the selection among production rules
most retrieved exemplars after exceeding based on their ever-changing utility values,
a particular threshold. Whereas the origi- is essential for this model component to cap-
nal models are mathematical characteriza- ture the randomness of RULEX. Indeed, this
tions of participants’ behavior, the ACT-R noisy selection process enables this model
model is a computational system that actu- component to reproduce the wide variety of
ally performs the task. Production rules pro- hypotheses that subjects display.
vide the control structure for how the ACT- The Anderson and Betz effort is a rela-
R model approaches the task (e.g., whether tively successful integration of the two mod-
it employs a RULEX- or EBRW-based ap- els. Moreover, the effort adds value over
proach), whereas declarative memory stores the two original models. First, it establishes
the rules, exceptions, and examples used and that the two theories are not necessarily

W = 0.8 W = 0.9
Proportion Correct 1.00 1.00

0.95 0.95
0.90 0.90
0.85 0.85
0.80 0.80
0.75 0.75
0.70 0.70
0.65 0.65
0.60 0.60
0.55 0.55
0.50 0.50
0 1 2 3 0 1 2 3
N-back Condition N-back Condition
W = 1.0 W = 1.1
1.00 1.00
Proportion Correct
0.95 0.95
0.90 0.90
0.85 0.85
0.80 0.80
0.75 0.75
0.70 0.70
0.65 0.65
0.60 0.60
0.55 0.55
0.50 0.50
0 1 2 3 0 1 2 3
N-back Condition N-back Condition
Figure 1 7.6. N-back performance and model predictions for individual
participants where parameter estimates of individuals’ working-memory
capacities were derived from performance on the Modified Digit Span task.
in opposition and in fact reflect the same cation because the latter is more efficient.
underlying subsymbolic processes but with Figure 1 7.7 shows the tendency for exemplar
different symbolic control. Moreover, those use to increase in two of the models reported
subsymbolic processes are the same ones by Anderson & Betz. This increased exem-
that can be used to model other, very dif- plar use is consistent with reported results
ferent domains of human cognition. Also, of a strategy shift with extensive practice
because both categorization mechanisms are (Johansen & Palmeri, 2002).
able to sit within the same architecture,
Anderson and Betz were able to address the
issue of choice between the two mecha- Summary
nisms. This depends on the relative utility of This contemporary production-system
these two mechanisms. Anderson and Betz model of categorization integrates two app-
show that the mixture of the two strate- roaches (implemented as different sets of
gies is able to account for phenomena that cohabitating production rules) and chooses
cannot be accounted for by either strategy between them (based on the production
alone. They also show a natural tendency for rules’ learned utility values). In one ap-
this mixture of strategies to evolve from be- proach, production rules are the conduit
ing dominated by rule-based classification to for creating and accessing exemplars (im-
being dominated by instance-based classifi- plemented as declarative chunks) in a
Figure 1 7.7. Proportion exemplar use across blocks in two data sets modeled in
Anderson and Betz (2001 ).
context-sensitive and frequency-sensitive place (e.g., Catrambone, 1 996; Chi et al.,

way. In the other approach, production 1 989; VanLehn & Jones, 1 993 ). The other
rules create and manipulate declarative focuses on how skills are refined to achieve
rules for categorizing items. In all cases, the domain expertise (see also Novick & Bassok,
ACT-R subsymbolic learning mechanisms Chap. 1 4). Research in the former category
for production rules and declarative chunks has addressed issues of learning from in-
govern how these kinds of knowledge are struction, transfer, and induction. Research
used. Table 1 7.7 highlights this (see checks in the latter category has addressed issues
in the right column) as well as the fact of generalization, specialization, and auto-
that this model employs ACT-R’s symbolic maticity. A unified approach merges these
learning mechanism for declarative chunks. issues into a single explanation. Production-
systems models – particularly those that ad-
dress the question of production-rule learn-
Skill Learning ing – hold the promise of offering such an
explanation.
Research into skill learning can be roughly Among production-systems models, Soar
divided into two categories. One category holds the most parsimonious view of
focuses on how skills are learned in the first skill learning, with its single mechanism,
Table 1 7.7. This Model of Categorization Relies on Three Out of Four of ACT-R’s
Learning mechanisms
√ √ √ √
Declarative chunks
√ √ √
Production rules
Table 1 7.8. Two Parent Production Rules and the Learned Child Production Rule
Production A Production B Production C
When the goal is to add the When the goal is to add the When the goal is to add 2 to 5 ,
numbers x and y, then try to numbers x and y and the sum then update the goal with 7
retrieve the sum of x and y of x and y has been retrieved as as the answer
z, then update the goal with z
as the answer
chunking. Chunking is invoked whenever the knowledge. Greater power comes from
system encounters an impasse (i.e., when ex- the knowledge’s being faster, no longer
isting production rules do not directly spec- subject to retrieval failures, and incurring
ify the next step). At this point, the system lower working-memory load. Less explic-
creates a subgoal to solve the impasse by itness comes from the fact that the new
applying domain-general production rules. rule transforms a fully inspectable, declara-
Solving the impasse creates a new rule spe- tive fact into the body of a production rule,
cialized for that situation. A similar rule- where knowledge is not open to inspection.
learning process is employed by Cascade, We exemplify ACT-R’s production-rule
a model of skill acquisition that incorpo- learning in the context of an experimental
rates both the impasse-repair-reflect cycle paradigm in which rule-like knowledge is
and analogical problem solving (VanLehn, learned in many different forms (Anderson
1 999). After the new rule is learned, when & Fincham, 1 994; Anderson, Fincham, and
Cascade subsequently encounters the same Douglass, 1 997). This paradigm involves
(or a related) situation, it can apply the new teaching participants a number of sports
rule directly and avoid the extra processing. facts such as “Hockey was played on Satur-
These models employ specialization – mak- day at 3 pm and then on Monday at 1 pm.”
ing a new rule that is a specific version of its After committing these sports facts to mem-
parents – and composition – combining mul- ory, participants are told that each one con-
tiple production rules into one new rule. veys a particular pattern or rule for the game
ACT-R also has a production-rule learn- times for that sport (e.g., Hockey’s second
ing mechanism. This mechanism combines game time is always two days later and two
composition – merging two production rules hours earlier than its first). Participants are
that fire in sequence – and proceduralization – then given practice at using these sports facts
creating a new version of an existing produc- to solve new problems in which either the
tion rule in which the new version avoids first or second time is given and the other
fact retrieval by instantiating necessary in- must be predicted. Figure 1 7.8a shows the
formation directly into the new rule. For ex- speed-up in performance from Anderson &
ample, consider a pair of production rules Fincham (1 994) as participants practiced
that solve addition problems of the form this over three days (each “bump” occurred
x + y = ? by first retrieving the relevant at the beginning of a new day). Figure 1 7.8a
addition fact from memory and then using also shows the predictions of an ACT-R sim-
this fact to make a response (A and B in ulation (Taatgen and Wallach, 2002) that in-
Table 1 7.8). When these production rules volves four representations of the sports-fact
are applied to the problem 2 + 5 = ?, a single knowledge.
production rule is learned (C in Table 1 7.8) Figure 1 7.8b tracks the contribution of
that combines the two steps into one but these four sources of knowledge over the
is specific to the case of 2 = 5 . This mech- three days. The initial representation was
anism treats skill learning as a ubiquitous simply the set of eight studied sports
process of building more specific, more facts represented as declarative facts (see
powerful, and less explicit problem-solving first row of Table 1 7.9). Specifically, each
Table 1 7.9. Model’s Different Representations of the Sports Facts from Anderson and Fincham (1 994)
Declarative vs.
Knowledge Type Production How Generated Sports No. of Steps Required
Original sports fact Declarative Original study 4 ≈20
components
General relationships Declarative Analogy on original 2 ≈1 0
sports fact
components
Procedural relation Production Production compilation 4 ≈6
rule on relationships
Studied instance Declarative Result of previous 2 for each 2
(& often repeated) example
example
sports fact was represented in terms of four and Anderson and Fincham showed that
interrelated chunks to capture the two days such asymmetry develops with extensive
and two times for that sport (e.g., “Hockey’s practice.
first event day was Saturday”, “Hockey’s first The third new representation is a specific
event time was 3 ”, “Hockey’s second event instance representing the solution to a par-
day was Monday”, “Hockey’s second event ticular previous (and often repeated) prob-
time was 1 ”). To solve problems using these lem. This knowledge can complete a new
facts, the model was endowed with a set problem in just two steps (one each for the
of production rules representing the weak day and time). However, it is specific to a
methods of direct retrieval (applicable for particular problem and is only generated af-
the original facts) and analogy. ter the preceding forms of knowledge have
From this initial knowledge base, the paved its way. It predicts that participants
model generated the other three representa- will be faster on frequently repeated prob-
tions of the sports-fact knowledge. The first lems, and Anderson, Fincham, and Dou-
of these represents the rule-like relationships glass (1 997) provide evidence for such item-
of each original sports fact as two declarative specific learning.
chunks (e.g., “Hockey’s day relationship is
+2”, and “Hockey’s time relationship is
Summary
−2”). The model produces this declaratively
represented generalization as a byproduct of The most noteworthy aspect of this
the analogizing process (see second row of production-systems model of skill learning
Table 1 7.9). Once these generalized rela- is that it posits multiple, overlapping stages
tionships are derived, applying them to a in the development of a new skill, some of
new problem is much simpler than solving which represent the new skill knowledge in
by analogy. The second new representation production-rule form and some of which do
of knowledge comes in true production-rule not. Because of the acquisition of new pro-
form. Specifically, a new production rule duction rules and new declarative chunks,
is learned that merges the steps involved the model relies on both symbolic learning
in applying the declarative generalizations mechanisms in ACT-R. In addition, these
just mentioned. Note that this production new knowledge representations are refined
rule is specialized to the sport and direction and strengthened through experience, draw-
(time 1 → time 2 or vice versa) under which ing on ACT-R’s subsymbolic learning mech-
it was generated. Such a directional produc- anisms. Finally, the model chooses among
tion rule should show faster performance the different knowledge representations via
for problems in the practiced direction, the subsymbolic performance mechanisms:
40
35
Data
30
Prediction
Latency (sec)
25
20
15
10
0
0 20 40 60 80 100 120
Trial
Figure 1 7.8. Latency to respond across trials in each session in Anderson and
Fincham, 1 994 (panel a), and proportion of simulation runs in which particular
knowledge representations were used across trials in Taatgen and Wallach, 2002
(panel b).
As declarative representations are strength- Language Learning: Past Tense

ened through use, those with higher activa-
tion will tend to get retrieved, and as new The learning of the English past tense is an-
production rules are used and are successful, other domain in which symbolic and sub-
those with higher utilities will tend to get symbolic models have clashed. The appear-
chosen (over more generic production rules ance of over-regularization errors in chil-
that employ declarative representations). In dren’s past tense (e.g., go–goed as opposed
sum, this model draws on all eight mecha- to go–went) had been originally taken as
nisms presented in Table 1 7.1 0. evidence (e.g., Brown, 1 973 ) that children
Table 1 7.1 0. Knowledge Structures and Mechanisms Used in Production-Systems Model of

Skill Learning
√ √ √ √
Declarative chunks
√ √ √ √
Production rules
were acquiring abstract rules. However, mation driving the past-tense form in the
Rumelhart and McClelland (1 987) showed retrieved source is applied to the tar-
that by learning associations between the get. Eventually, through the production-rule
phonological representations of stems and learning mechanisms in ACT-R, the analogy
past tense it was possible to produce a process will be converted into a production
model that made overgeneralizations with- rule that generatively applies the past-tense
out building any rules into it. It was able rule. Once the past-tense rule is learned, the
to account for the U-shaped learning func- generation of past tenses will be determined
tion demonstrated by children by which they largely by a competition between the gen-
first do not produce such overgeneralization, eral rule and retrieval of specific cases. Thus,
then do, and finally, gradually eliminate the ACT-R has basically a dual-route model of
overgeneralizations. This attracted a great past-tense generation in which both routes
many critiques and, although the fundamen- are implemented by production rules. The
tal demonstration of generalization without rule-based approach depends on general
rules stands, it is acknowledged by all to be production rules whereas the exemplar ap-
seriously flawed as a model of the process proach depends on the retrieval of declara-
of past-tense generation by children. Many tive chunks by production rules that imple-
more recent and more adequate connection- ment an instance-based strategy.
ist models (some reviewed in Elman et al., Figure 1 7.9 graphically displays the vari-
1 996) have been proposed, and many of ety of ways this model can generate the past
these have tried to use the backpropogation tense. Although all of these options are im-
learning algorithm. plemented in ACT-R production rules, only
This would seem like an appropriate do- the two rightmost options represent the ap-
main for production-system models, and plication of general past-tense rules (e.g., add
Taatgen and Anderson (2002) have pro- “ed”). The second and third options initi-
duced a successful model of these phenom- ate procedures for retrieving a memory trace
ena. Significantly, they show that one can ac- that can then be applied directly or by anal-
count for past-tense learning with a similar ogy to the current situation.
dual mechanism model like that of Ander- The general past-tense rule, once discov-
son and Betz (2001 ). The model posits that ered by analogy, gradually enters the com-
children initially approach the task of past- petition as the system learns that this new
tense generation with two strategies. Given rule is widely applicable. This gradual entry,
a particular word like “give,” they can either which depends on ACT-R’s subsymbolic
try to retrieve the past tense for that word or utility-learning mechanisms, is responsi-
they can try to retrieve some other example ble for the onset of overgeneralization. Al-
of a past tense (e.g., live–lived) and try to though this onset is not all-or-none in either
apply this by analogy to the current case. In the model or the data, it is a relatively rapid
the case of analogy, previously encountered transition in both model and data and cor-
present–past tense pairs serve as potential responds to the first turn in the U-shaped
sources, and a source that has a present tense function. However, as this is happening, the
form similar to the target’s present tense ACT-R model is encountering and strength-
form will be retrieved. Then, the transfor- ening the declarative representations of
Other rules
(phonetically defined)
Do nothing Retrieve exact Analogize from Apply general

,,
exemplar distinct exemplar ,,
rule: Add ed
Figure 1 7.9. Different choices the model can make in generating the past tense.
Each option is executed by the firing of a production rule, but only the two
rightmost options actually implement a generalized rule. ACT-R’s
production-rule competition and learning mechanisms govern the model’s
selection among these options.
exceptions to the general rule. Retrieval the utility calculations of ACT-R. Indeed, in
of the exceptions comes to counteract another model that basically invents its own
the overgeneralizations. Retrieval of excep- past-tense grammar without input from the
tions is preferred because they tend to environment, Taatgen showed that it will de-
be shorter and phonetically more regu- velop one or more past-tense rules for low-
lar (Burzio, 2002) than regular past tenses. frequency words but will tend to adopt more
Growth in this retrieval process corresponds efficient irregular forms for high-frequency
to the second turn in the U-shaped function words. In the ACT-R economy the greater
and is much more gradual – again, both in phonological efficiency of the irregular form
model and data. justifies its maintenance in declarative mem-
Note that the Taatgen model, unlike ory if it is of sufficiently high frequency.
many other past-tense models, does not Note that the model receives no feed-
make artificial assumptions about frequency back on the past tenses it generates un-
of exposure but learns given a presentation like most models but in apparent correspon-
schedule of words (both from the environ- dence with the facts about child language
ment and its own generations) like that ac- learning. However, it receives input from the
tually encountered by children. Its ability environment in the form of the past tenses
to reproduce the relatively rapid onset of it hears, and this input influences the base-
overgeneralization and slow extinction de- level activation of the past-tense forms in
pends critically on both its symbolic and declarative memory. The model also uses
subsymbolic learning mechanisms. Symbol- its own past-tense generations as input to
ically, it is learning general production rules declarative memory and can learn its own er-
and declarative representations of excep- rors (a phenomenon also noted in cognitive
tions. Subsymbolically, it is learning the util- arithmetic by Siegler, 1 988). The amount of
ity of these production rules and the activa- overgeneralization displayed by the model
tion strengths of the declarative chunks. is sensitive to the ratio of input it receives
Beyond just reproducing the U-shaped from the environment to its own past-tense
function, the ACT-R model explains why generations.
exceptions should be high-frequency words.
There are two aspects to this explanation.
Summary
First, only high-frequency words develop
enough base-level activation to be retrieved. Although this model of past-tense gener-
Indeed the theory predicts how frequent ation fully depends on the existence (and
a word has to be to maintain an excep- emergence) of rules and symbols, it also criti-
tion. Less obviously, the model explains cally depends on the subsymbolic properties
why so many high-frequency words actu- of ACT-R to produce the observed graded
ally end up as exceptions. This is because the effects. Table 1 7.1 1 highlights the fact that
greater phonological efficiency of the irregu- this model relies on learning of both declar-
lar form promotes its adoption according to ative and procedural knowledge at both the
symbolic and subsymbolic level. This eclec- Another contribution made by several
tic position enables the model to achieve of the models is specifying how multiple
a number of other features not attained by strategic approaches to a given task can be
many other models: integrated. Indeed, a common but often un-
deremphasized feature of high-level cog-
1 . It does not have to rely on artificial as-
nitive tasks is that people can approach
sumptions about presentation frequency.
them in so many ways. The problem-solving
2. It does not need corrective feedback on its model addresses this issue of choice directly
own generations. and illustrates a modern interpretation of
3 . It explains why irregular forms tend to be production-rule conflict resolution. Specif-
high frequency and why high-frequency ically, this model (along with the categoriza-
words tend to be irregular. tion, skill-learning, and past-tense–learning
4. It correctly predicts that novel words will models) demonstrates that a noisy selection
receive regular past tenses. of the production rule with highest utility
5 . It predicts the gradual onset of overgen- (where utility is naturally learned through
eralization and its much more gradual experience by the system) works well to
extinction. choose among different strategies.
A related contribution made by some
of these models is making clear that rule-
like thinking is not always best represented
Conclusions and Future Directions in terms of production rules. The categoriza-
tion, skill-learning, and past-tense–learning
This chapter describes six production- models all use multiple strategic approaches;
systems models accounting for six different in the latter two models, one of the ap-
areas of cognition: problem-solving choice, proaches is based on production-rule repre-
analogy making, working memory, catego- sentations of knowledge and another is not
rization, skill learning, and past-tense learn- based on production-rule representations of
ing. In some cases, an important contri- knowledge. Together, the two representa-
bution of the model lies in specifying a tional forms complement each other in a way
production system that implements a fairly that accounts for the variability in people’s
general reasoning strategy (e.g., analogy behavior. Accounting for variability is prob-
making and categorization). The analogy ably the most notable contribution of the
model specifies a path-mapping process as working-memory model given that it posits
a set of production rules. The categorization a theory of working-memory limitations that
model specifies two processes for categoriza- can be used to estimate individuals’ working-
tion – by rules (with exceptions) and by re- memory capacities and then predict other
trieving multiple category exemplars – both task performance on that basis.
implemented as sets of production rules that What is most striking about these mod-
cohabit a single production system. In both els as a whole, however, is that they make
models, it is not only the production rules use of the same set of mechanisms for learn-
that govern model behavior but also sub- ing and using knowledge across such a dis-
symbolic quantities that influence how the parate set of tasks and that they use the same
production rules do their work. In the anal- two kinds of knowledge representations –
ogy model, subsymbolic activation levels as- production rules and declarative chunks. Al-
sociated with different declarative chunks though each model emphasizes a somewhat
influence which parts of the analogy will different subset of mechanisms (compare
get mapped and when; in the categoriza- Tables 1 7.4–1 7.7, 1 7.1 0, and 1 7.1 1 ), they all
tion model, subsymbolic utility levels asso- fit together in a unified architecture, just as
ciated with different production rules influ- the many processes of human cognition all
ence which categorization approach will be must fit together in the human brain. Like-
chosen and when. wise, modern productions systems offer an
Table 1 7.1 1 . This Model of Past-Tense Generation Relies on All Four of ACT-R’s Learning
mechanisms
√ √ √ √
Declarative chunks
√ √ √ √
Production rules
understanding of how the many components of ACT-R, similar levels of variability in per-
of cognition are integrated. formance among human controllers. Similar
successes are beginning to arise in real world
applications of production-systems models.
Production Systems into the Future
For instance, there is the Soar model that
Given the progress represented by the rela- managed to fly 5 0 Air Force training mis-
tively few models presented here, it is worth- sions (Jones et al., 1 999), and other exam-
while to speculate how production systems ples of production-systems models used in
will continue to be involved in future re- industry and military applications will likely
search on cognition. Two areas in which pro- become more the rule than the exception
duction systems have ventured in the past (pardon the pun!). Some of these will likely
few years are already showing initial levels come in the form of cognitive agents (e.g.,
of success and promise to play a large role in Best, Lebiere, and Scarpinatto, 2002) that act
future developments in modeling. in virtual worlds (e.g., for training purposes
One of these areas involves the devel- with humans) or real environments (e.g., in
opment of production-system models that coordination with robotic systems).
can handle complex tasks. Complexity can Another area of current growth in
arise in many dimensions, but one involves production-systems modeling that promises
the dynamic qualities of the task. Air-traffic to expand is the integration of production-
control is a dynamic task in that it requires systems models (i.e., their components and
continuous attention to changing stimuli and their predictions) with neuroimaging work.
changing task demands. It is complex in that With the growth of functional neuroimaging
it requires the integration of multiple areas as a means of studying cognition (see Goel,
of knowledge (e.g., different skills to handle Chap. 20), the field of cognitive modeling
the different situations) and the integration has another dependent measure for test-
of perceptual, motor, and cognitive processing models’ predictions. To take advantage
ing (e.g., actually working with a graphical of this additional constraint, however, the
and keyboard interface akin to what real air- model must posit some mapping between
traffic controllers use). A modeling compe- model output and neuroimaging results. A
tition to account for human data of vari- direct approach is to map brain locations to
ous sorts on an air-traffic-control task called model functions and then predict localiza-
AMBR (Agent-Based Modeling and Behav- tion of activation on the basis of model func-
ior Representation) set the bar high with re- tions. This basic approach can be elaborated
gard to modeling a complex, dynamic task to account for the time course of brain activ-
(Gluck & Pew, 2001 ). Several production- ity either at a coarse-grain size (e.g., predict-
system models, including one built within ing differential localization of activity early
ACT-R (Lebiere, Anderson, and Bothell, versus late in the task or between conditions)
2001 ) and one built within an EPIC–Soar or at a fine-grain size (e.g., within a single
combination (Chong and Wray, 2002), took trial). Both of these approaches have been
on the challenge and demonstrated success used in tasks ranging from language process-
in accounting for the number and type of ing (Just, Carpenter, and Varma, 1 999) to
errors of human controllers and, in the case equation solving (Anderson, et al., in press),
to planning (Fincham et al., 2002) to task Anderson, J. R., & Fincham, J. M. (1 994). Acquisi-
switching (Sohn et al., 2000). The goal of tion of procedural skills from examples. Journal
this work is to improve our understanding of Experimental Psychology: Learning, Memory,
of brain function and also of the mapping and Cognition, 2 0, 1 3 22–1 3 40.
to production-system models of thought. Anderson, J. R., Fincham, J. M, & Douglass, S.
Whereas production systems have tended to (1 997). The role of examples and rules in the
describe cognitive processes at a high level of acquisition of a cognitive skill. Journal of Ex-
abstraction, the trend has been toward more
Cognition, 2 2 , 25 9–277.
and more fine-grained models, so it is now
Anderson, J. R., & Lebiere, C. (1 998). The
becoming appropriate to consider the neu-
Atomic Components of Thought. Mahwah, NJ:
ral processing implications of many of the Erlbaum.
issues in production-system models.
Anderson, J. R., Qin, Y., Sohn, M-H., Stenger,
V. A. & Carter, C. S. (in press). An information-
processing model of the BOLD response in
Acknowledgments symbol manipulation tasks. Psychonomic Bul-
letin and Review.
The authors thank Stacey Becker, Norma Anderson, J. R., & Thompson, R. (1 989). Use of
Chang, and Niels Taatgen for comments on analogy in a production system architecture. In
an earlier draft. This work was partially spon- S. Vosniadou & A. Ortony (Eds.), Similarity and
sored by the Department of the Navy, Of- Analogy, (pp. 3 67–3 97). New York: Cambridge
University Press.
fice of Naval Research. Any opinions, find-
ings, and conclusions, or recommendations Ashby, F., Alfonso-Reese, L., Turken, A., & Wal-
dron, E. (1 998). A neuropsychological theory
expressed in this material, are those of the
of multiple systems in category learning. Psy-
authors and do not necessarily reflect the chological Review, 1 05 (3 ), 442–481 .
views of the Office of Naval Research.
Baddeley, A. D., & Hitch, G. (1 974). Working
memory. In G. H. Bower (Ed.), The Psychol-
ogy of Learning and Motivation, 8. New York:
Notes Academic Press.
Baddeley, A. D., & Hitch, G. (1 977). Recency re-
1 . Interestingly, recent production-system mod- examined. In S. Dornic (Ed.), Attention and per-
els have returned to embrace an individual formance VI. Hillsdale, NJ: Erlbaum.
participant approach with quantitative analy- Best, B., Lebiere, C., & Scarpinatto, C. (2002). A
ses (e.g., Daily, Lovett, & Reder, 2001 ; Lovett, model of synthetic opponents in MOUT train-
Daily, & Reder, 2000). ing simulations using the ACT-R cognitive ar-
2. Asynchronous parallelism means that each per- chitecture. In Proceedings of the Eleventh Con-
ceptual/motor module can work in parallel ference on Computer Generated Forces and
with others in such a way that the actions of Behavior Representation. Orlando, FL.
each need not be synchronized with actions of Braver, T. S., Cohen, J. D., Nystrom, L. E.,
the others. Jonides, J., Smith, E. E., & Noll, D. C. (1 997).
3 . However, Young and Lewis (1 999) have A parametric study of prefrontal cortex involv-
posited that no more than two items with the ment in human working memory. Neuroimage,
same type of coding (e.g., phonological or syn- 5 , 49–62.
tactic) can be stored in dynamic memory at Brown, R. (1 973 ). A first language: The early stages
a time. Cambridge, MA: Harvard University Press.
Budiu, R. (2001 ) The role of background knowl-
edge in sentence processing. Doctoral Disserta-
References tion, Department of Computer Science, Carnegie
Mellon University. Pittsburgh, PA.
Anderson, J. R., & Betz, J. (2001 ). A hybrid model Burzio, L. (2002). Missing players: Phonology
of categorization. Psychonomic Bulletin and Re- and the past-tense debate. Lingua, 1 1 2 , 1 5 7–
view, 8(4), 629–647. 1 99.
Catrambone, R. (1 996). Generalizing solution Gluck, M. A., & Bower, G. H. (1 988). From
procedures learned from examples. Journal conditioning to category learning: An adap-
of Experimental Psychology: Learning, Memory, tive network model. Journal of Experimental
and Cognition, 2 2 , 1 020–1 03 1 . Psychology: General, 1 1 7, 225 –244.
Chi, M. T. H., Bassok, M., Lewis, M., Reimann, Gluck, K. A., & Pew, R. W. (2001 ). Overview of
P., & Glaser, R. (1 989). Self-explanations: How the agent-based modeling and behavior repre-
students study and use examples in learening sentation (AMBR) model comparison project.
to solve problems. Cognitive Science, 1 3 , 1 45 – To appear in the Proceedings of the Tenth Con-
1 82. ference on Computere Generated Forces and Be-
Chong, R. S., & Wray, R. E. (2002). An EPIC-Soar havior Representation. Norfolk, VA.
model of concurrent performance on a cate- Hofstadter, D. R., & Mitchell, M. (1 994). An
gory learning and a simplified air traffic control overview of the Copycat project. In K. J.
task. In W. D. Gray & C. D. Schunn (Eds.) Pro- Holyoak & J. A. Barnden (Eds.), Advances
ceedings of the 2 0th Annual Conference of the in Connectionist and Neural Computation The-
Cognitive Science Society (pp. 21 –22). Mahwah, ory. Vol 2 : Analogical Connections (pp. 3 1 –1 1 2).
NJ: Erlbaum. Hillsdale, NJ: Erlbaum.
Cohen, J. D., Forman, S. D., Braver, T. S., Casey, Holyoak, K. J., & Thagard, P. R. (1 989). Analog-
B. J., Servan-Schreiber, D., & Noll, D. C. ical mapping by constraint satisfaction. Cogni-
(1 994). Activation of prefrontal cortex in a tive Science, 1 3 , 295 –3 5 5 .
non-spatial working memory task with func- Holyoak, K. J., & Thagard, P. R. (1 995 ). Men-
tional MRI. Human Brain Mapping, 1 , 293 – tal Leaps: Analogy in Creative Thought. Cam-
3 04. bridge, MA: MIT Press.
Daily, L. Z., Lovett, M. C., & Reder, L. M. (2001 ). Hull, C. (1 920). Quantitative aspects of the evo-
Modeling individual differences in working lution of concepts. Psychological Monographs,
memory performance: A source activation ac- 2 8(1 ), 1 –86.
count in ACT-R. Cognitive Science, 25 , 3 1 5 – Hummel, H. E., & Holyoak, K. J. (1 997). Dis-
353. tributed representations of structure: A theory
Daneman, M., & Carpenter, P. (1 980). Individ- of analogical access and mapping. Psychological
ual differences in working memory and read- Review, 1 04, 427–466.
ing. Journal of Verbal Learning and Verbal Be- Johansen, M. K., & Palmeri, T. J. (2002). Are
havior, 1 9, 45 0–466. there representational shifts during category
Elman, J. L., Bates, E. A., Johnson, M. H., learning? Cognitive Psychology, 45 , 482–5 5 3 .
& Karmiloff-Smith, A. (1 996). Rethinking Jones, R. M., Laird, J. E., Nielsen, P. E., Coul-
Innateness: A Connectionist Perspective on De- ter, K. J., Kenny, P. G., & Koss, F. V. (1 999).
velopment. Cambridge, MA: MIT Press. Automated intelligent pilots for combat flight
Fincham, J. M., Carter, C. S., van Veen, V., simulation. AI Magazine, 2 0(1 ), 27–41 .
Stenger, V. A., & Anderson, J. R. (2002). Neu- Just, M. A., & Carpenter, P. N. (1 992) A capac-
ral mechanisms of planning: A computational ity theory of comprehension: Individual differ-
analysis using event-related fMRI. Proceedings ences in working memory. Psychological Review,
of the National Academy of Sciences, 99(5 ), 99, 1 22–1 49.
3 3 46–3 3 5 1 . Just, M. A., Carpenter, P. A., & Varma, S. (1 999).
Engle, R. W. (1 994). Individual differences in Computational modeling of high-level cogni-
memory and implications for learning. In R. tion and brain function. Human Brain Map-
Sternberg (Ed.), Encyclopedia of Intelligence. ping, 8, 1 28–1 3 6.
New York: Macmillan. Kahneman, D., Slovic, P., & Tversky, A. (1 982).
Gentner, D. (1 983 ). Structure-mapping: A the- Judgment under Uncertainty: Heuristics and Bi-
oretical framework for analogy. Cognitive Sci- ases. New York: Cambridge University Press.
ence, 7, 1 5 5 –1 70. Kaplan, C. A., & Simon, H. A. (1 990). In search
Gentner, D. (1 989). The mechanisms of analog- of insight. Cognitive Psychology, 2 2 , 3 74–41 9.
ical learning. In S. Vosniadou & A. Ortony Keane, M. T., Ledgeway, T., & Duff, S. (1 994).
(Eds), Similarity and Analogical Reasoning Constraints on analogical mapping: A compari-
(pp. 1 99–241 ). Cambridge, UK: Cambridge son of three models. Cognitive Science, 1 8, 3 87–
University Press. 43 8.
Klahr, D. & Wallace, J. G. (1 976). Cognitive Lovett, M. C., Daily, L. Z., & Reder, L. M. (2000).
Development: An Information Processing View. A source activation theory of working memory:
Hillsdale, NJ: Erlbaum. Cross-task prediction of performance in ACT-
Kokinov, B. N. (1 998). Analogy is like cognition: R. Cognitive Systems Research, 1 , 99–1 1 8.
Dynamic, emergent, and context-sensitive. In Luchins, A. S. (1 942). Mechanization in problem
K. J. Holyoak, D. Gentner, & B. Kokinov (Eds.), solving. Psychological Monographs, 5 4, 248.
Advances in Analogy Research: Integration of Marsh, R. L., & Hicks, J. L. (1 998). Event-
Theory and Data Form the Cognitive, Compu- based prospective memory and executive
tational, and Neural Sciences. Sofia, Bulgaria: control of working memory. Journal of Experi-
NBU Press. mental Psychology: Learning, Memory, and Cog-
Laird, J. E., Newell, A., & Rosenbloom, P. S. nition, 2 4, 3 3 6–3 49.
(1 991 ). Soar: An architecture for general intel- Medin, D. L., & Schaffer, M. M. (1 978). Context
ligence. Artificial Intelligence, 47, 289–3 25 . theory of classification learning. Psychological
Lamberts, K. (1 998). The time course of cate- Review, 85 , 207–23 8.
gorization. Journal of Experimental Psychology: Meyer, D. E., & Kieras, D. E. (1 997). A com-
Learning, Memory, and Cognition, 2 4, 695 –71 1 . putational theory of executive cognitive pro-
Lebiere, C., & Anderson, J. R. (1 993 ). A con- cesses and multiple-task performance: Part 1 .
nectionist implementation of the ACT-R pro- Basic mechanisms. Psychological Review, 1 04,
duction system. In Proceedings of the Fifteenth 3 –65 .
Annual Meeting of the Cognitive Science Society, Miyake, A. & Shah, P. (1 999). Models of Work-
(pp. 63 5 –640). Hillsdale, NJ: Erlbaum. ing Memory: Mechanisms of Active Maintenance
Lebiere, C., Anderson, J. R., & Bothell, D. (2001 ). and Executive Control. New York: Cambridge
Multi-tasking and cognitive workload in an University Press.
ACT-R model of a simplified air traffic control Newell, A. (1 990). Unified Theories of Cognition.
task. To appear in the Proceedings of the Tenth Cambridge, UK: Cambridge University Press.
Conference on Computere Generated Forces and Newell, A., & Simon, H. A. (1 972). Human Prob-
Behavior Representation. Norfolk, VA. lem Solving. Englewood Cliffs, NJ: Prentice-
Lebiere, C., Gray, R., Salvucci, D., & West, R. Hall.
(2003 ). Choice and learning under uncertainty: Nosofsky, R. M., & Palmeri, T. J. (1 997).
A case study in baseball batting. Proceedings An exemplar-based random walk model of
of the 2 5 th Annual Conference of the Cognitive speeded classification. Psychological Review,
Science Society. Mahwah, NJ: Erlbaum. 1 04, 266–3 00.
Levine, M. (1 975 ). A Cognitive Theory of Learning. Nosofsky, R., Palmeri, T., & McKinley, S.
Hillsdale, NJ: Erlbaum. (1 994). Rule-plus-exception model of classi-
Lewis, R. L. (1 993 ). An Architecturally-based fication learning. Psychological Review, 101 (1 ),
Theory of Human Sentence Comprehension. 5 3 –79.
Ph.D. thesis, Carnegie Mellon University. Com- Novick, L. R., & Holyoak, K. J. (1 991 ). Math-
puter Science Technical Report CMU-CS-93 - ematical problem solving by analogy. Journal
2 2 6. of Experimental Psychology: Learning, Memory,
Logan, G. (1 988). Toward an instance theory of and Cognition, 1 7, 3 98–41 5 .
automatization. Psychological Review, 95 , 492– Reed, S. K. (1 972). Pattern recognition and cate-
5 27. gorization. Cognitive Psychology, 3 , 3 82–407.
Lovett, M. C. (1 998). Cognitive task analysis in Rosch, E. (1 975 ). Cognitive representations of
service of intelligent tutoring systems design: semantic categories. Journal of Experimental
A case study in statistics. In B. P. Goettl, H. M. Psychology: General, 1 04, 1 92–23 3 .
Halff, C. L. Redfield, & V. J. Shute (Eds.) Intelli- Ross, B. H. (1 989). Distinguishing types of super-
gent Tutoring Systems, Lecture Notes in Computer ficial similarities: Different effects on the access
Science, Vol. 1 45 2 (pp. 23 4–243 ). New York: and use of earlier problems. Journal of Experi-
Springer–Verlag. mental Psychology: Learning, Memory, and Cog-
Lovett, M. C., & Anderson, J. R. (1 996). His- nition, 1 5 , 45 6–468.
tory of success and current context in problem Rumelhart, D. E., & McClelland, J. L. (1 987).
solving: Combined influences on operator se- Learning the past tenses of English verbs: Im-
lection. Cognitive Psychology, 3 1 , 1 68–21 7. plicit rules or parallel distributed processing?
In B. MacWhinney (Ed.) Mechanisms of learning the past tense without feedback. Cog-
Language Acquisition (pp. 1 95 –248). Hillsdale, nition, 86(2 ), 1 23 –1 5 5 .
NJ: Erlbaum. Taatgen, N., & Wallach, D. (2002). Whether skill
Salvucci, D. D., & Anderson, J. R. (2001 ). Inte- acquisition is rule or instance based is deter-
grating analogical mapping and general prob- mined by the structure of the task. Cognitive
lem solving: The path-mapping theory. Cogni- Science Quarterly 2 , 1 63 –204.
tive Science, 2 5 , 67–1 1 0. Trabasso, T., & Bower, G. (1 964). Memory in
Siegler, R. S. (1 988). Individual differences in concept identification. Psychonomic Science, 1 ,
strategy choices: Good students, not-so-good 1 3 3 –1 3 4.
students, and perfectionists. Child Develop- Turner, M. L., & Engle, R. W. (1 989). Is work-
ment, 5 9, 83 3 –85 1 . ing memory capacity task dependent? Journal
Singley, Mark K. & Anderson, J. R. (1 989). The of Memory and Language, 2 8, 1 27–1 5 4.
Transfer of Cognitive Skill. Cambridge, MA: VanLehn, K. (1 999). Rule-learning events in the
Harvard University Press. acquisition of a complex skill: An evaluation
Smith, E., Patalano, A., & Jonides, J. (1 998). of cascade. Journal of the Learning Sciences, 8,
Alternative strategies of categorization. Cogni- 71 –1 25 .
tion, 65 , 1 67–1 96. VanLehn, K., & Jones, R. M. (1 993 ). Learning
Smolensky, P. (1 988). On the proper treatment of by explaining examples to oneself: A computa-
connectionism. Behavioral and Brain Sciences, tional model. In S. Chipman & A. L. Meyrowitz
1 1 , 1 –74. (Eds.), Foundations of Knowledge Acquisition:
Sohn, M. H., Ursu, S., Anderson, J. R., Stenger, Cognitive Models of Complex Learning (pp. 25 –
V. A., & Carter, C. S. (2000). The role of 82). Boston: Kluwer Academic.
prefrontal cortex and posterior parietal cor- Young, R. M., & Lewis, R. L. (1 999). The
tex in task-switching. Proceedings of National SOAR cognitive architecture and human work-
Academy of Science USA, 97(2 4), 1 3 448– ing memory. In A. Miyake, & P. Shah (Eds.),
1 3 45 3 . Models of Working Memory: Mechanisms of Ac-
Spellman, B. A., & Holyoak, K. J. (1 996). Prag- tive Maintenance and Executive Control. New
matics in analogical mapping. Cognitive Psy- York: Cambridge University Press.
chology, 3 1 , 3 07–3 46. Young, R. M. & O’Shea, T. (1 981 ). Errors in chil-
Taatgen, N. A. & Anderson, J. R. (2002). Why dren’s subtraction. Cognitive Science, 5 , 1 5 3 –
do children learn to say “broke”? A model of 1 77.
CHAPTER 1 8
Implicit Cognition and Thought
Leib Litman
Arthur S. Reber
Introduction the rules despite being unaware of the na-

ture or even the existence of the rules
The debate about the existence of an uncon- (Reber, 1 967, 1 993 ).
scious mental life is as old as psychology it- r Participants display analogic transfer from
self. An overview of the contemporary opin- one complex problem to another with-
ions about the nature of the unconscious out awareness of the commonalities in the
shows that, despite countless studies, the two sets of conditions (Schunn & Dunbar,
opinions expressed by the researchers work- 1 996).
ing in this field are as diverse today as they r Infants show a similar ability to dis-
were when the debates began. The spectrum criminate rule-governed patterns of pho-
of opinions range from a profound convic- netic and visual elements from those
tion that a significant aspect of mental life is that violate the rules (Gomez & Gerken,
embodied in the unconscious (Erdelyi, 1 985 ; 1 999, 2000; Saffran, Aslin, & Newport,
Pyszczynski, Greenberg, & Solomon, 1 999; 1 996).
Reber, 1 993 ) to opinions that stress that the r fMRI data show that brain areas that
very idea of a complex and abstract uncon- normally process particular stimuli are
scious mental life is virtually a contradiction activated even when the words are
of terms (Perruchet & Vinter, 2003 ; Shanks presented subliminally (Naccache &
& St. John, 1 994). Despite the divergence in Dehaene (2001 ).
the range of opinions, findings such as the r Amnesiacs, who have lost the ability to
following certainly suggest that the notion
form conscious memories, display im-
of unconscious thought has to be taken seri-
proved performance on a variety of tasks
ously, even by the most skeptical.
over time, suggesting that the past ex-
r Subjects learn to differentiate seemingly periences are unconsciously represented
nonsensical sequences of letters that fol- and are influencing thoughts and behav-
low complex rules from those that violate ior (Glisky & Schacter, 1 986).
431
r Patients show residual knowledge for that something is explicitly or consciously

words that were read to them while under represented in memory, for example, is to
anesthesia (Kihlstrom et al., 1 990). say that it can be consciously recalled or rec-
r Subjects with neurological disorders that ognized. When someone explicitly recalls a
limit their ability to perceive certain areas friend’s name, that name, at the time of re-
of the visual field nevertheless respond to call, is consciously represented.
stimuli that are presented in those areas Implicit thought on the other hand is un-
(Weiskrantz, 1 986). conscious, and the content of a memory is
considered to be implicit when it exerts its
No one of these findings is in itself con- influence on thought or action even though
clusive. Arguments have been made that at- it cannot be recalled. Claparède (1 91 1 /1 95 1 )
tack the assumptions and techniques used described the classic case of an amnesic pa-
in each case. In a recent exchange, Perruchet tient whom he pricked with a concealed
and Vinter (2003 ) debate these and other is- needle in his palm. The patient, of course,
sues with upwards of three dozen commen- forgot what happened almost immediately
tators and critics. after the incident in the sense that she no
Here, we present an overview of the ba- longer had explicit memory for the event
sic findings that support the existence of or even, for that matter, Claparède. How-
a sophisticated cognitive system that oper- ever, when he attempted to shake the pa-
ates largely independent of consciousness, tient’s hand some days later she, surpris-
discuss the criticisms mounted against these ingly, refused exclaiming, “One never knows
findings, and outline the various approaches what people carry around in their hands.”
that have been taken in an attempt to Experience with this patient, as well as hun-
overcome those critiques. Included in our dreds of others (Cohen & Squire, 1 980;
overview are areas such as implicit learning Schacter, 1 987; Scollville & Milner, 1 95 7),
and memory, subliminal perception, the role shows that memory for events can influence
of attention, and the impact implicit pro- both thought and behavior even when that
cesses have on problem solving and creativ- memory is no longer available for conscious
ity. Rather than providing a thorough review inspection.
of the findings in any one of these areas, our Describing implicit and explicit thought
focus is on developing a conceptual theme merely in terms of subjective states, how-
that subsumes the diverse field of the studies ever, is hardly sufficient to capture the dis-
involved in investigating implicit cognition. tinctions between them. Implicit represen-
tations are, at least in some ways, not only
unconscious, but are also thought of, at least
by some theorists (e.g. Perruchet & Vinter,
What Implicit Implies 2003 ), as being of a different form than con-
scious representations.
What, exactly, do we mean by implicit and Implicit knowledge may be stored in a dif-
explicit thought? Traditionally the terms “im- ferent form from when it appears in con-
plicit” and “explicit” have been treated as sciousness. Here’s an argument put forward
synonyms of the terms “conscious” and “un- by Perruchet and Vinter (2003 ): imagine a
conscious.” Implicit knowledge is defined as computer representation of a pencil. When
being: “unconscious, covert, tacit, hence of a picture of the pencil is presented on the
a process that takes place largely outside of computer screen the entire pencil is repre-
the awareness of the individual; the term is sented. However, off the screen, there is only
used in this sense to characterize cognitive a bit-wise representation that in no way re-
processes that operate independently of con- sembles its on-screen form. Inside the com-
sciousness” (Reber & Reber, 2001 ). The use puter you will not find a picture of a pencil as
of the terms implicit and explicit in this con- such. The bits representing the pencil might
text refers to states of consciousness. To say exist in very different parts of the hard drive
implicit cognition and thought 433
and, when taken together, will look nothing Some Thoughts on Methodology
like the picture of the pencil when it is pro-
cessed and brought up to the screen. By anal- How can we know exactly when a mental
ogy, tacit knowledge may not contain a full process is unconscious? In examining this
representation of objects – this can only hap- question it is useful to make a distinction be-
pen in consciousness – that is, on the screen. tween two stages of information processing:
This position is consistent with the Lockean encoding and retrieval. The importance of
notion that true mental representations are this distinction is that a suitable method-
only possible in consciousness. ology for demonstrating the implicitness at
Most computational models, in one way one point is not applicable at another. Con-
or another, have endorsed the perspective sider research with amnesiacs in which the
that unconscious representations are not evidence for tacitly held knowledge is found
identical to conscious ones. In Anderson’s at the retrieval stage (i.e., a patient per-
ACT-R model, implicit memory repre- forms better over time but does not remem-
sents subsymbolic information that oper- ber the training session) but not at encod-
ates by controlling access to explicit declara- ing (i.e., amnesiac patients are consciously
tive knowledge (Anderson & Lebiere, 2003 ; aware of what they are learning while they
Lovett & Anderson, Chap, 1 7). Various are learning it). Demonstrating that mem-
other models that are built on connec- ory is not used consciously at the time of re-
tionist architectures (Dienes, 1 992; Cleere- trieval entails a different methodology than
mans, 1 993 ; Servan-Shreiber, Cleeremans, that used to demonstrate that the encod-
& McClelland, 1 991 ) share the notion that ing process was unconscious. For encoding,
implicit knowledge is nonrule based and it is necessary to demonstrate that at the
nonsymbolic. moment of the presentation of the stimu-
The one thing that is clear here is that lus, the subject’s awareness of that stimulus
there is no consensus as to whether implicit was deficient.
knowledge can be symbolic. Our position is It turns out that demonstrating a lack of
that it can. We will have more to say about awareness at encoding is deeply problem-
the nature of the representations of im- atic. How do we know that the stimulus
plicit knowledge later. For now, keep in mind was really not consciously perceived in some,
these two nuanced entailments of implicit: perhaps minimal, way? Because the deci-
the conscious quality of knowledge and the sion as to whether the subject consciously
extent to which the knowledge is symboli- perceived the stimulus relies on one or an-
cally represented. Although often used inter- other form of subjective self-report, we are
changeably, the two senses do not perfectly stuck with having to rely on nonverifiable
overlap. When processes such as rule use measurement.
and symbol manipulation are discussed they
are typically assumed to be conscious, top-
Accessibility and Availability
down operations. However, there is no a pri-
ori reason to conclude that the unconscious Imagine a typical subliminal perception ex-
cannot manipulate symbols and use prede- periment in which subjects are given brief or
fined rules. We favor the notion that im- masked exposure to objects or words. Later,
plicit thought can be based on abstract rep- knowledge of the target is assessed either by
resentations and that such knowledge is not direct tests (what word did you see?) or by
only possible but is responsible for much of indirect tests (changes in response times over
the complexity and adaptiveness of human time). Note that there are actually two con-
behavior (Lovett & Anderson, Chap. 1 7). structs here: the accessibility of the informa-
However, much evidence has been pre- tion at encoding and the availability of the
sented both for and against this view, and stored knowledge sometime after presen-
in what follows we provide an overview of tation. Some (Brody, 1 989; Eriksen, 1 95 9)
this research. maintain that unconscious perception can
only be reliably established if the accessibil- whether new letter strings are “well-formed”
ity of the stimulus at the time of encoding is or not – that is, whether or not they conform
zero. In other words, there should be no dif- to the rules that generated the original set.
ference in accessibility of a stimulus between The clever twist that Mathews and his
the subject and a blind person. This criterion colleagues used was to stop their partici-
is very difficult (if not impossible) to achieve pants from time to time during the “well-
because it is always possible that something formedness” phase and ask them to expli-
was perceived consciously and chance per- cate, in as much detail as possible, what they
formance is attributable to subjects’ not be- knew and how they were making their de-
ing sufficiently confident to make a response cisions. Transcripts of their responses were
based on what little they did see. then given to four different yoked-control
Alternatively, as suggested by Erdelyi groups who, without having any learning ex-
(1 986, 2004), we can require that the avail- perience, were asked to classify the same
ability of the stimulus to consciousness be strings. If subjects were consciously aware of
greater than the extent to which that stim- the knowledge they had acquired and could
ulus was consciously accessible at the time communicate it, the yoked subjects should
of encoding. This approach has an impor- perform at the same level as the experimen-
tant advantage in that it reveals a critical but tal participants.
often unrecognized fact: the impact of un- Mathews et al. found that their exper-
conscious knowledge on behavior is a con- imental subjects could make reliable de-
tinuum and not an either/or issue. cisions on the very first day of the study
but were remarkably inept in communicat-
ing what they had learned – yoked sub-
The Implicit/Explicit Continuum
jects working with the Day 1 transcripts
It is erroneous to say that a behavior has to performed at chance. However, as the ex-
be explained in its entirety by either con- periment progressed, the experimental sub-
scious or unconscious input. Most cognitive jects’ ability to verbalize their knowledge
tasks, including perception, memory, prob- improved dramatically. In fact, the yoked
lem solving, and creativity are products of subjects who received the Day 4 transcripts
the influences of both conscious and uncon- made decisions nearly as well as the experi-
scious processes. The existence of conscious mental participants. Interestingly, the exper-
factors does not in any way preclude the imental subjects’ performance on the pri-
further influence of unconscious ones. The mary task didn’t improve significantly after
findings of two important experiments help the second day although their ability to ex-
make this point. plicate what they knew did. This is an ex-
In the first, Mathews et al. (1 989) had ample of how knowledge is encoded implic-
experimental subjects engage in an implicit itly but over time becomes explicit and can
learning task over a four-day period. The then be retrieved consciously. Another im-
study used what is known as an artificial plication of Mathews teams’ study is that the
grammar (AG). An example of a typical AG implicit and the explicit are bound up in a
is given in Figure 1 8.1 along with several let- delicate synergy and we would be wise to
ter strings that it can generate and a num- refrain from all-or-none distinctions.
ber of nongrammatical or not well-formed In the second study, Knowlton and Squire
strings that contain a single letter violation. (1 994) showed that even when conscious
It is apparent that the system is complex and, knowledge is fully available (i.e. in Erdelyi’s
as Mathews et al., found, not easy to de- [1 986] terms, accessibility equals availabil-
scribe. In the canonical AG learning study, ity) it doesn’t mean that it is necessar-
subjects memorize a number (perhaps 1 5 ily being utilized at all times. They found
or 20) of exemplary letter strings and then, that amnesic patients’ performance was in-
using what knowledge they acquired from distinguishable from normal controls on a
the learning phase, attempt to distinguish standard AG learning task, as described
Figure 1 8.1 . A typical artificial grammar used in many studies of implicit

learning. The grammar generates letter strings by following the arrows from the
input state (S1 ) to the terminal state (S6 ). Several examples of “well-formed”
strings are presented along with others that contain a violation of the grammar.
previously, suggesting that representations plicit knowledge and how each is manifested
of knowledge need not be held in a con- depends as much on the task demands as
scious form to be used to make decisions. on the accessibility of conscious knowledge.
However, when both groups were encour- Second, amnesic patients, whose neurolog-
aged to make decisions by utilizing any sim- ical injuries have compromised their abil-
ilarities between the test stimuli and those ity to form consciously accessible long-term
used during learning, the two groups dif- memories, can still carry out complex im-
fered. The normal controls showed a small plicit learning tasks. The balanced synergy
but significant improvement, whereas the is largely missing in this population, but
patient group’s performance, perhaps not the implicit system appears to be relatively
surprisingly, actually diminished. intact.
There are two implications of these stud- In short, it is both theoretically sounder
ies. First, in these tasks, normal subjects pos- and methodologically more plausible to look
sess a delicate balance of implicit and ex- at the impact of implicit knowledge as
operating along a continuum. Rather than Moray, 1 95 9). For example, Johnson and
ask whether or not a particular task was im- Wilson (1 980) presented ambiguous words
plicitly or explicitly performed, one should like “sock” to one ear while a disambiguat-
examine the extent to which both implicit ing word (“foot” or “punch”) was presented
and explicit factors are playing a role in the in the other. They found that the amount
behavior in question. With this framework of attention allocated to the encoded stimu-
in mind, let’s explore the several domains in lus was critical. When the instructions were
which unconscious mechanisms have been to attend to both channels, the word “foot”
examined. facilitated the interpretation of the ambigu-
ous homophone “sock.” But when attention
Consciousness at Encoding was directed to the channel in which the tar-
get words were presented, items in the unat-
diverted attention tended channel did not influence the per-
One interesting aspect of attention, at least ceived meaning of the targets.
for the purposes of implicit processes, is that Later studies, however, questioned this
attention and consciousness are highly corre- conclusion. Eich (1 984) presented subjects
lated. When something is being attended to, with homophones such as “fare/fair” in one
for example, the words of this sentence, the ear and a modifier of the less frequent mean-
object in the focus of attention becomes con- ing (e.g., “taxi”) in the other. Subjects were
scious. Of course, because of the limits on then given a recognition test for the mod-
attention, some, if not most, of the sensorial ifiers and were asked to spell the target
events in the outside world are not within words (“fare” or “fair”). Eich found a clear
the focus of our attention. When reading a impact of implicit memory. Despite being
book, we “tune out” much of the outside virtually at chance on the recognition task,
world such as conversations and, traffic. In- subjects showed a strong tendency to spell
deed, we ignore most of the events that are the test words according to their primed,
outside of the attentional focus. The ques- but less common, meaning. Similar find-
tion that almost asks itself is, What effect do ings were reported in a series of studies by
the unattended, nonconscious events in our Mulligan (1 997, 1 998) in which subjects
environment have on us? Do they get regis- memorized word lists while their attention
tered unconsciously in some fashion with- was diverted by a secondary task involving
out our awareness, or are the effects of repeating strings of digits of varying lengths.
unattended events trivial and only become Attentional load was manipulated by vary-
important when and if they are consciously ing the length of the digit string from three
attended to? to seven. Mulligan found that increasing
The effects of diverting attention from the attentional load impaired explicit mem-
the stimulus at encoding are usually stud- ory performance for the original words (us-
ied in the context of a dual-task paradigm in ing cued recall) but had essentially no ef-
which attention is diverted by a secondary fect on implicit memory (as measured by
stimulus (Morrison, Chap. 1 9). For example, stem completion). It seems therefore that
in the classic dichotic-listening task two dif- at-encoding stimuli have an impact on subse-
ferent messages are played, one to each ear. quent performance even when they are not
One message is attended to, the other not – consciously perceived.
although the secondary message often con-
tains important information. Afterwards, a
simple memory or priming task is used to under anesthesia
discover the effects of diverted attention. Although diverting attention certainly re-
The initial findings here suggested that, duces the likelihood of the stimuli’s be-
when attention is diverted from a stimu- ing consciously encoded, Kihlstrom et al.
lus, the effect of that stimulus is greatly (1 990) made quite certain that their input
reduced (Broadbent, 1 95 8; Cherry, 1 95 3 ; stimuli were not being attended to. Their
participants were surgical patients who were et al., 1 998) and numerical presenta-
presented with a repeating list of stimulus tions produce parietal activity (Naccache &
items while completely anesthetized. Af- Dehaene, 2001 ).
ter surgery, although patients had no ex- The studies with which we are most
plicit memory for the material, an implicit, comfortable are those that follow Erdelyi’s
free-association test showed that informa- (1 986, 2004) advice cited previously. En-
tion presented during anesthesia was ensure that there are separate and reliable mea-
coded. Shanks and St. John (1 994) criti- sures of accessible and available knowledge –
cized these and other studies on the grounds with the critical inequality being those
that they produced mixed findings and of- situations in which knowledge that is “ac-
ten used questionable methodology. How- cessible” by consciousness is less than knowl-
ever, recently, Merikle and Daneman (1 996) edge that is “available” and can be shown to
conducted a meta-analysis of 44 studies with have some (indirect or implicit) impact on
several thousand participants and concluded behavior. (For more on the issues of measur-
that, taken as a whole, these studies support ing unconscious knowledge, see Merikle and
the argument that items presented during Reingold, 1 992.) Two classic studies that ap-
anesthesia can have an impact on postsur- pear to satisfy this condition (Marcel, 1 983
gical tests. Questions still remain however and Kunst-Wilson & Zajonc, 1 980) are worth
with regard to the possibility that at least a closer look.
some of the subjects were partially conscious In an extended series of experiments,
during stimulus presentation. Marcel (1 983 ) showed that graphic (what
the word looks like) and semantic (what
the word means) information of subliminally
subliminal perception presented words can affect choice behavior.
Historically, subliminal perception studies One of Marcel’s standard protocols involved
have been controversial. They have ranged presenting subjects with two words, one sub-
from the embarrassing “eat popcorn – drink liminally and the other supraliminally. Af-
coke” hoax that was foisted on the pub- ter each presentation, subjects were asked
lic a half-century ago by an overzealous whether or not they saw a word. After the
advertising agent (Pratkanis, 1 992) to the pair was presented, they were asked whether
vigorously debated use of subliminal mes- the two words were physically and seman-
sages in psychotherapeutic settings (Silver- tically related. By systematically varying the
man, 1 983 ; Weinberger, 1 992). Admittedly, sub/supra-liminality of the stimuli, Marcel
much of the early work was suspect and has was able to explore the manner in which
been vigorously criticized (Eriksen, 1 95 9; implicitly and explicitly encoded stimuli af-
Holender, 1 986; Shanks & St. John, 1 994), fected subjects’ choices. The key finding was
and, as we noted earlier, there are numer- a threshold effect whereby subjects were at
ous methodological traps that await the un- chance in determining the presence or ab-
wary. However, the impact of subliminally sence of the subliminally presented word but
presented material on subsequent behavior could reliably report whether the two words
has now been replicated in literally hundreds were semantically and graphically similar.
of experiments and the evidence appears Marcel’s conclusion, based on the full series
to be convincing. Subliminal presentation of experiments, was that although there is a
can have an effect on emotional preferences gradual effect of awareness on performance,
(Kunst-Wilson & Zajonc, 1 980; Murphy & importantly, a complete lack of awareness
Zajonc, 1 993 ) and produce semantic prim- does not entirely remove that effect.
ing (Draine & Greenwald, 1 998). Moreover, In Kunst-Wilson and Zajonc’s (1 980) clas-
similarly undetectable stimuli have been sic study, subjects were subliminally pre-
shown to activate appropriate brain regions – sented with a set of irregular octagons. They
emotionally charged stimuli activate the were then shown pairs of octagons supral-
amygdala (Elliot & Dolan, 1 998; Whalen iminally, one of which was from the set
previously presented and one of which was nally. During testing, subjects were asked to
new. Subjects were asked both to select complete word stems such as MOT with the
the item they thought was presented before restriction that they not use any word they
and to pick the one they preferred. Kunst- thought might have been used in the sub-
Wilson and Zajonc found that, despite being liminal presentation phase. The logic here
at chance on the recognition task, subjects is clever. If the word was consciously per-
showed a preference for the subliminally ceived, the subjects should have been able to
presented octagons over the novel ones, refrain from using that word to complete the
demonstrating that affective preferences can word-stem. However if they did not see the
be influenced by events that were not con- word, then the subliminally presented word
sciously noticed (Kunst-Wilson & Zajonc, should have been used as often as the others.
1 980; Murphy and Zajonc, 1 993 ). The results showed that subjects were typi-
Elliot and Dolan (1 998) extended this cally not able to follow this instruction and
“subliminal mere exposure” effect and tended to use the subliminal primes.
showed that, in addition to preferring the In summary, the suggestion that atten-
previously presented items, different brain tion and consciousness is needed for the en-
regions were activated when old and novel coding of complex, semantically sensitive
stimuli were later presented supraliminally. events (Perruchet & Vinter, 2003 ; Shanks &
This finding is consistent with a large num- St. John, 1 994) is probably unwarranted.
ber of fMRI studies that suggest that implicit These studies, although not uniform in their
and explicit memory retrieval involves the conclusions, suggest that fairly sophisticated
activation of distinct brain regions (for a re- information about complex stimulus dis-
view see Cabeza & Nyberg (2000). Whalen plays can be picked up under severe atten-
et al. (1 998) have also demonstrated that tional load, when the material is presented
subliminally presented faces displaying fear- subliminally and, possibly, under anesthe-
ful emotions activated the amygdala despite sia. They also support the notion that this
a lack of subjective awareness of ever hav- information is not simply logged in some
ing seen those faces. Happy faces presented inert form but has an impact on memo-
for identical time periods had no effect on rial representations, choice behavior, and
these structures. decision making.
Finally, Naccache and Dehaene (2001 )
presented evidence of abstract representa-
tion in subliminal priming. Subjects were
Memory
asked to decide whether a target number
was bigger or smaller than 5 . Each target Virtually every complex living organism has
was preceded by a subliminal prime that was the ability to store the products of expe-
either spelled out (six) or presented in nu- rience to be accessed at some later time.
meric form (6). Subjects displayed faster re- People’s ability to store a seemingly endless
action times when the prime and target were array of episodes, facts, motor skills, and lin-
the same number – regardless of the num- guistic and social knowledge and to retrieve
ber’s form. In addition, fMRI data revealed the appropriate information rapidly and ap-
that the subliminal primes elicited the same propriately are remarkable phenomena –
parietal lobe activity as the supraliminal tar- and one that still remains something of a
gets, suggesting similar cortical processing. mystery. Cognitive investigations of memory
All of the studies above are open to the revealed early on that human memory is not
critique that some awareness of the stim- a single, unified phenomenon. There are dif-
uli might have contaminated the procedure ferent kinds of memories and each is instan-
(Holender, 1 986; Shanks & St. John, 1 994). tiated in a variety of ways. Our concern here
To counter this criticism, Debner and Jacoby is the extent to which memory processes
(1 994) used the process-dissociation proce- are modulated by conscious intentions or are
dure (Jacoby, 1 991 ). In their study, words implicit and operate outside the spotlight of
(e.g. MOTEL) were first presented sublimi- consciousness.
Conscious or explicit memory has tradi- age (see Squire, 1 992 for a review), is
tionally been studied using direct tests in that such people do not suffer from a
which participants are asked to consciously learning deficit, per se, but rather from
recall or recognize previously memorized an inability to consolidate new explicit,
items. The original assumption was that a or declarative, knowledge. Patients with
failure to recall or an inability to recognize MTL damage show no diminished abil-
an item is diagnostic of that item having been ity to recall episodes that occurred prior
forgotten. However, as we noted earlier, just to the trauma, they present a nearly nor-
because people cannot recall something does mal short-term memory profile, and, impor-
not necessarily mean the memory no longer tantly from our perspective, they show rel-
exists. In some ways, it is surprising that it atively intact implicit learning and memory.
took cognitive psychologists so long to ap- Indeed, a large literature has accumulated in
preciate this aspect of human memory. Early recent years showing that the performance
reports by neurologists such as Claparède of anterograde amnesiacs is virtually indis-
and Korsakoff implicated implicit represen- tinguishable from that of normals on a wide
tational systems and, lest we forget, Freudian variety of memory tasks including word-
psychoanalysis was founded on the existence stem completion (Warrington & Weiskrantz,
of nonretrievable memories that play a role 1 968; 1 974), fragment completion (Tulving,
in human behavior (Erdelyi, 1 985 ). Hayman, & MacDonald, 1 991 ), context sen-
The renewed interest in implicit mem- sitive memory (Schacter & Graf, 1 986),
ory was largely attributable to the discov- memory for letter strings generated by an
ery that amnesiac patients, despite being AG (Knowlton, Ramus, & Squire, 1 992;
compromised in their ability to form new Knowlton & Squire, 1 994), and recall of
explicit knowledge, can nevertheless ac- dot patterns (Knowlton & Squire, 1 994).
quire new information implicitly. The laying As Seger (1 994) argued, amnesiac patients
down of consciously retrievable, long-term provide the best empirical support for the
memories has been compellingly shown to proposition that knowledge that is not con-
be dependent on structures in the medial sciously accessible can still have a profound
temporal lobes (MTL), specifically the hip- influence on ongoing behavior.
pocampus (Squire, 1 995 ). When the hip- These discoveries gave rise to a num-
pocampus and its associated areas are dam- ber of significant advances in our under-
aged or destroyed, it becomes difficult and, standing of memory, both implicit and ex-
in extreme cases impossible, for new explicit. Reber (1 992a,b), Schacter (1 987) and
plicit memories to be formed. The dis- Squire (1 992) all argued that the human
covery of the critical role the MTL struc- memorial system can be fruitfully viewed as
tures play here was made in the case of though there were two distinct information-
HM, the first neurological patient to have processing systems – one declarative or
his hippocampus surgically removed (see explicit, the other procedural or implicit.
Corkin, 1 968; Milner, 1 962; Milner, Corkin, The explicit system was theorized to in-
& Teuber, 1 968; Squire, 1 992; Warrington clude declarative, conscious knowledge of
& Weiskrantz, 1 968). HM suffered from se- episodes, facts, and events, whereas the im-
vere, intractable epilepsy, the neural focal plicit system was assumed to be operating
point of which was in the MTL. To alleviate largely outside of consciousness and to in-
his multiple, daily seizures surgeons extir- clude implicit learning and memory, condi-
pated bilaterally the affected brain regions. tioning, and learning various skills and habits
Although the surgery was successful in stop- (sensorimotor learning). Although this dis-
ping the seizures, HM emerged from the tinction is probably a useful one in that it
procedure with profound, chronic antero- draws attention to the ways in which implicit
grade amnesia. and explicit functions can be dissociated,
The standard interpretation of HM, it is probably not the best stance to take
based on the now rather large number from a functionalist point of view. As Reber
of patients with similar neurological dam- (1 993 ) argued, we need to be wary of falling
into a “polarity” fallacy in which we treat two Learning

distinguishable systems that lie at the poles
Implicit learning is the process whereby or-
of a continuum as though they were onto-
ganisms acquire knowledge about the reg-
logically separate and distinct. It is almost
ularities of complex environments without
certainly the case that virtually everything
intending to do so and largely independently
interesting that human beings entails a del-
of conscious awareness of the nature of what
icate synergy between the implicit and the
was learned (Stadler & Frensch, 1 998; Reber,
explicit, the conscious and declarative, and
1 967; Reber, 1 993 ). The complex environ-
the unconscious and procedural. If the im-
ments include virtually every facet of human
plicit and the explicit systems ultimately are
life, including language learning, trait knowl-
shown to be based on neuroanatomically dis-
edge, categorization, acculturation, and the
tinct structures (as we suspect will be done),
development of aesthetic preferences. The
it will still be virtually impossible to find
claim we are making is that people extract
functionally pure instantiations of them.
information about the world more often
In addition, Reber (1 992a,b) argued that
than they are aware and that this knowledge
because human consciousness and its ac-
exists in a tacit form, influencing thought
companying functions are late arrivals on the
and behavior while itself remaining mostly
evolutionary scene, there should be partic-
concealed from conscious awareness.
ular patterns of dissociation between these
two systems. The key predictions of the
model for this discussion are: implicit learning in infants
By the second month of life, infants can al-
(a) Storage and retrieval systems that serve
ready distinguish between utterances spo-
the implicit system should be more ro-
ken in their native language and those spo-
bust and relatively undisturbed by in-
ken in foreign languages. Infants can do this
sult and injury that compromise explicit
although they don’t understand what the
functions.
sentences mean in either language. Interest-
(b) There should be relatively little in the ingly, this effect disappears when the sen-
way of developmental and life-span tences are presented backwards (Dehaene-
changes in implicit compared with ex- Lambertz, & Houston, 1 998; Mehler et al.,
plicit functions. This two-system model 1 988; Ramus et al., 1 999). The implica-
has garnered significant support over the tions here are that, despite not understand-
past decade (see Reber, Allen, & Reber, ing the sentences backwards or forwards,
1 999 and Squire & Knowlton, 2000 for the infants have become attuned to the
reviews). natural flow of language. This natural flow
Taken together, this literature paints a is violated when the sentence is reversed.
clear picture. Human memory has distinct This sensitivity to the structure of linguis-
systems with distinct evolutionary histories tic sounds which seems to be the first stage
and separate, although only partly under- of language acquisition, takes place implic-
stood, neurological underpinnings that map, itly and recruits brain regions similar to
on one hand, into conscious, subjective ex- those of adults as shown by fMRI stud-
perience and, on the other, into a nexus of ies (Dehaene-Lambertz, Dehaene, & Hertz-
encoding, storage, and retrieval systems that Pannier, 2002).
function largely independently of awareness Within a surprisingly short time, infants
and consciousness. However, this picture is extract the phonetic regularities of their
still incomplete, and appreciating the man- linguistic surroundings and can differenti-
ner in which it operates in complex human ate between sound sequences that are well
thinking requires a deeper look at the topic formed and those that are not. The back-
of learning – specifically implicit learning in wards sentences sound ill-formed to the in-
which knowledge about the complexities of fants because their sequential structure is
the environment is acquired without benefit discoordinate with the infant’s experience
of consciously controlled processes. and therefore seem as ill-formed as sentences
in a foreign language. Note the similarity of plicit and explicit memorial systems, or var-
this to the standard implicit learning proce- ious patient populations manifest distinct
dure in Artificial Grammar studies discussed patterns of loss of explicit acquisitional func-
below. tions while maintaining those based on the
These kinds of effects are not restricted implicit processes. Included here are stud-
to natural languages. Rovee-Collier and her ies on motor learning (P.J. Reber & Squire,
colleagues (see Rovee-Collier, 1 990 for a re- 1 998), AG learning (Knowlton & Squire,
view) report that infants rapidly pick up the 1 994, 1 996; Reber, 1 967, 1 989), category
relationships between their own motor ac- learning (Knowlton & Squire, 1 993 ; Squire
tions and the impact that they have on the & Knowlton, 1 996), Pavlovian conditioning
external world. Haith, Wentworth, and Can- (Daum & Ackerman, 1 994; Gabrieli et al.,
field (1 993 ) showed that babies make antic- 1 995 ), decision making in social settings
ipatory eye movements to regularities in the (Lewicki, 1 986a; any of several contribu-
spatial patterns of visual displays. Saffran and tions to Uleman & Bargh, 1 989), the sequen-
her colleagues reported that infants as young tial reaction time task (see Hsiao & Reber,
as 8 months show a similar sensitivity to the 1 998 for a review), the hidden covariation
arbitrary statistical nature of auditory pat- task (Lewicki, 1 986b), preference formation
terns and can learn the rules governing arti- (Gordon & Holyoak, 1 983 ; Manza, Zizak,
ficial word segmentation (Saffran, & Aslin, & Reber, 1 998), the production control task
& Newport, 1 996). Interestingly, in Saffran’s (Berry & Broadbent, 1 988), and dot pattern
studies, the infants performed as well as a classification (P.J. Reber, Stark, & Squire,
group of adults, a result that supports Re- 1 998). The various chapters in Stadler and
ber’s (1 992b) prediction that implicit learn- Frensch’s (1 998) edited volume Handbook
ing systems are present at a very early age and of Implicit Learning are a good resource for a
undergo little developmental change. Simi- more detailed discussion.
larly, Gomez and Gerken (1 999, 2000), us- These many reports are supplemented
ing the AG learning procedure, showed that by additional findings that show that pa-
not only do one-year-olds learn the struc- tients with damage to primary visual cor-
tural characteristics of these rather complex tex learn to respond to objects in their blind
systems, they also transfer this knowledge to fields (Weiskrantz, 1 986), prosopagnosiacs
novel stimulus domains. who cannot consciously recognize the faces
To date, this research has been restricted of family members show virtually normal
largely to sensorimotor, perceptual, and cog- implicit facial memory (De Haan, Young, &
nitive tasks. Surprisingly, little empirical Newcombe, 1 991 ), patients with neglect re-
work has been carried out on behaviors that spond to the meaning of stimuli that they are
are more reflective of social learning. How- unaware of processing (Berti & Rizzolatti,
ever, given the existing database, we suspect 1 992), amnesic patients show improvement
that when processes of socialization are ex- in solving problems (Winter et al., 2001 )
amined from this perspective, they will re- and learn to operate complex equipment
veal a parallel set of operations in which in- (Glisky, Schacter, & Tulving, 1 986) despite
fants gradually become inculcated with the no conscious memory of the earlier training
social mores and ethical codes of their cul- phases of the studies. Issues of the mech-
ture without conscious awareness of what anisms underlying disordered thought are
has been learned and with little in the way pursued in detail elsewhere in this volume
of top-down control of the process. by Bachman and Cannon (see Chap. 21 ).
The model that has emerged from this lit-
erature characterizes implicit learning as a
implicit learning in adults mechanism the primary function of which
In recent years, a rather impressive array of is to monitor the environment for reli-
specific tasks have been discovered to have able relationships between events and to
dissociative elements in that either direct encode those patterns of covariation. In
and indirect tests distinguish between im- all likelihood, the underlying neurological
mechanisms are diffuse neural nets that are Abstraction and Implicit Thought
linked to the modality of input of the stimu-
lus display (Ungerleider, 1 995 ). The under- One possibility might be that these un-
lying representations that are established are conscious, perceptual, and motoric repre-
probably not as flexible or abstract as those sentations are not themselves anything like
that are under conscious control simply be- conscious thoughts in terms of their under-
cause the top-down modulation that comes lying form. Thinking consciously about the
with consciousness allows for deliberative world involves forming abstract mental “pic-
shifts in representation and use of knowl- tures.” We can be thinking about a tree and
edge. However, this issue is a highly con- not necessarily be looking at or remember-
tentious one and we have more to say on ing any specific tree. We can know an abstract
it subsequently. rule such as “if A > B and B > C, then A > C”
In addition, implicit acquisitional mech- that can be applied to any set of objects
anisms appear early in life, well before con- that can be ranked along a single dimension
scious awareness has developed. They show like height or weight. This kind of abstract
relatively little change over the life span memorial code feels very natural to us. We
compared with explicit cognitive functions freely think about legal decisions (guilt or in-
(Howard & Howard, 1 992, 1 997) and rel- nocence), geometry (all plane triangles have
atively little in the way of individual-to- 1 80 degrees), artistic expressions, drama, po-
individual variation (Reber & Allen, 2000). etry, aesthetics, politics, and so on. When we
As noted previously in several places, the do, we have an ineffable sense of manipu-
implicit system demonstrates a rather relating abstract and flexible representations,
markable robustness and continues to func- ones that feel loose and unconstrained by
tion effectively in the face of a wide vari- particular settings or features.
ety of neurological and psychiatric disorders Our personal, introspective experiences
that severely compromise functions based with these daily activities are so compelling
on explicit, consciously modulated mecha- that, historically, consciousness was often
nisms. It seems clear that the implicit sys- viewed as though it were the defining fea-
tem is the critical mental component that ture of human thought. The philosophical
enables the infant and child to learn to nav- traditions that have had the strongest in-
igate the world. Virtually all the essential fluence on psychology are those of Locke
knowledge of the perceptual, sensorimotor, and Descartes, and while these two didn’t
linguistic and social patterns, that make up agree on much, the one proposition they
the environment and eventually become the shared was that cognitive states are transpar-
epistemic foundations of adulthood is ac- ent to introspection. If it’s cognitive, it’s con-
quired through this nondeclarative, proce- scious – and by cognitive states they meant
dural mechanism. This is, indeed, how we those that are semantic, flexible in function
learn about the world around us. For further and representationally abstract. In fact, the
explorations of this and related developmen- notion that there is anything truly cogni-
tal mechanisms, see Halford (this volume). tive about any unconscious process – that
Although these aspects of implicit an implicit mechanism could result in ab-
thought are fairly well established, there are stract mental representations is, according to
two issues that remain deeply problemati- this perspective, self-contradictory nonsense
cal and need to be addressed: First, are (or (Dennett, 1 987).
better, perhaps, can) these implicitly formed In the next two sections we explore re-
representations be regarded as abstract? Sec- lated questions like: Is unconscious, tacit
ond, what role might they play in complex knowledge in any way like conscious knowl-
cognitive processes such as problem solving edge in its complexity? Are implicit rep-
that have been generally regarded as largely, resentations flexible? Can they be charac-
if not completely, explicit and under con- terized as abstract? Do they play a role in
scious control? the computations of problem solving? Or
is unconscious knowledge more like low- common words and objects, the AG experi-
level perceptual processes – rigid, inflexible, ments use novel, arbitrary stimulus displays,
and concrete and playing virtually no role in affording the opportunity to examine the
higher-level functions such as problem solv- representational form of knowledge that was
ing and creativity? acquired in a controlled setting.
The original claim (Reber, 1 967, 1 969)
was that the representations established
Perceptual and Conceptual
while memorizing exemplars from an AG
Representations
like that shown in Figure 1 8.1 are based on
Perceptual representation involves capturing the rules of the grammar and, hence, are ab-
surface features of objects without necessar- stract and independent of the surface fea-
ily understanding what the objects are. A tures of the stimuli. This claim did not go
picture taken by a computer scanner is an ex- uncontested. Brooks and Vokey (1 991 ), Du-
ample of a perceptual but not a conceptual lany, Carlson and Dewey (1 984), Perruchet
representation. A scanner can take a picture and Pacteau (1 991 ), and Shanks and St. John
of an object and store it in memory, while (1 994) all argued that subjects’ performance
having no semantics and not representing in these experiments is also consistent with
anything meaningfully. The meaningful rep- representations based on the micro compo-
resentation of objects involves, among other nents of the exemplar strings. That is, a well-
things, the ability to categorize and form formed sequence like PTVPS is not necessar-
mental representations of the categories ab- ily represented as an instance of a complex
stractly. It has been proposed by a variety rule but may be captured by a concrete in-
of researchers that many of the phenom- stantiation. Some (e.g., Perruchet & Pacteau,
ena discussed in this chapter so far such as 1 994; Servan-Schreiber & Anderson, 1 990)
priming, lexical decision making, word frag- argued for an encoding based on small
ment completion, artificial grammar learn- chunks like bi- and trigrams (PT, TV, VPS).
ing, and dot pattern classification are tapping Others (Brooks & Vokey, 1 991 ) argued for
perceptual – not conceptual – processes a more holistic instantiation of the spe-
(e.g., Perruchet & Vinter, 1 998, 2003 ; Shanks cific stimulus input, but eschew the possi-
& St. John, 1 994). From this perspective, the bility that the implicit memorial forms are
unconscious acts as a purely perceptual sys- abstract.
tem capable, in some ways like a scanner or The key studies that speak to this issue
a camera, of capturing the perceptual or au- are those that use a transfer protocol. That
ditory properties of the world. The uncon- is, subjects learn an AG instantiated in one
scious, according to this perspective, is not symbol set but are switched to stimuli made
particularly smart, and does not contain any up using a different symbol set at some point
real representations at least not those that in the experiment. The argument is that, if
are “about” something. subjects’ implicit memorial forms, are based
Exploring these considerations has be- on concrete representations then transfer to
come a virtual cottage industry. Toth and a novel letter set should seriously compro-
Reingold (1 996) present an overview of the mise their ability to function. If the repre-
work using priming, and Kirsner (1 998) sentations are abstract in nature, the subjects
provides a review of the implicit memory should be comfortable with the transfer con-
literature. Both suggest that, although the is- dition.
sues are complex, implicitly encoded mate- In the first of these studies Reber (1 969)
rial shows both abstract and instance-based asked subjects to memorize letter strings
representations. Here we review a topic that from an AG over 1 2 trial blocks. After the
focuses directly on the issue, transfer in AG sixth block, either the letters used to instan-
learning. Unlike the study of implicit mem- tiate the AG were changed, or the AG it-
ory using priming or stem completion in self was changed. Switching letter sets was
which the stimulus materials tend to be surprisingly benign. So long as the rules
that governed letter order were left intact, jects’ decision making could be traced to
subjects were able to work with novel letter an underlying abstract representation based
sets with little difficulty. However, changing upon the rules of the AG and about half was
the rules for letter order disrupted subjects’ shown to be dependent on abstract analog-
ability to encode and store the materials. ical representations that were linked to the
This study was followed up by a par- physical forms of the input stimuli.
allel series of experiments in which sub- Of course, these studies still leave open
jects memorized letter strings from an AG the question of the actual memorial form
and then had to judge how well novel of the representations. As noted previously,
strings instantiated using new letters had representations could be based not on whole
been formed. Subjects learn about the un- items but on more molecular “chunks.”
derlying regularities of an AG by memo- There is evidence both in favor of and
rizing strings like TSSVVPS but then have against this chunking interpretation. On one
to judge the grammaticality of strings like hand, Knowlton and Squire (1 994) repli-
BXXMMRX. Thus, the surface features of cated Brooks and Vokey’s findings, but when
the stimuli differ from learning to testing, they controlled for “chunk strength” (stim-
but the deep structure remains the same. Us- uli had equal numbers of common bigrams)
ing this technique, numerous studies have they found little effect of overall similarity.
found successful transfer (Altmann, Dienes, On the other, simply encoding the chunking
& Goode, 1 995 ; Brooks & Vokey, 1 991 ; characteristics of an AG cannot be all that
Gomez & Schvaneveldt, 1 994; Knowlton & is learned in these experiments. “Chunk-
Squire, 1 996; Manza & Reber, 1 997; Math- trained” subjects who never learn full strings
ews et al., 1 989; Shanks, Johnstone, & Staggs, perform reasonably well on the grammat-
1 997; Vokey & Brooks, 1 992; Whittlesea & icality task (Perruchet & Pacteau, 1 990),
Dorken, 1 993 ). but they do not show transfer (Manza &
Although it is generally agreed that the Reber, 1 997).
transfer effect is real (Perruchet & Vinter, What seems to be emerging from this line
2003 ; Redington & Chater, 1 998), there of research is that there is no “default” rep-
is still no consensus on interpretation. Al- resentational form (Whittlesea & Dorken,
though the effect would seem to implicate 1 993 ). Rather, representational form is dic-
an abstract representational form, Brooks tated by context effects and task demands.
and Vokey (1 991 ; Vokey & Brooks, 1 992) Manza and Reber (1 997) included a con-
have pointed out that transfer could also dition that supports this functionalist posi-
be a product of the physical similarity be- tion. One group memorized letter strings in-
tween the grammatical strings and the trans- stantiated in one letter set. Another group
formed test strings. For example, what learned structurally identical strings, but half
makes the two sequences given above of them were instantiated using a second let-
“similar” is that they both consist of seven ter set. Both groups were tested using strings
letters, they both contain two repeats af- made up of both old and new letter sets.
ter the first letter, and they end with one The second group showed better transfer on
of those repeating letters. They called this a the items instantiated with a novel letter set.
“relational” or “abstract analogy” for the se- Training with the two distinct instantiations
quences. According to this view, subjects are encouraged a more abstract representational
not learning the deep structure of the gram- form that assisted the subjects when they
mar that can be applied to any domain; they confronted yet another surface form.
have learned a specific set of facts about in- Finally, several researchers have demon-
dividual exemplars. strated another hallmark of abstract rep-
Brooks and Vokey tested their theory by resentation – cross-modality transfer. Alt-
controlling for the physical similarity and the mann, Dienes, and Goode (1 995 ), Howard
grammaticality of the test items and found and Ballas (1 982), and Manza and Reber
evidence for both forms of encoding. That is, (1 997) have all shown that subjects can learn
about half the explainable variance in sub- visual sequences and make judgments about
the well-formedness of auditory sequences gically placed in the room, to one of the
and vice versa. Taken together these find- strings and to swing it like a pendulum.
ings suggest that abstraction is an important Maier found that subjects were much more
factor in AG learning. Whether or not this likely to solve this problem after the ex-
conclusion ultimately applies to all forms of perimenter casually brushed against one of
tacit knowledge is yet to be determined. Our the strings, producing the swinging motion.
best guess is that virtually all forms of im- Interestingly, Maier’s subjects did not re-
plicit learning will yield some memorial rep- port having consciously noticed the manip-
resentations that are abstract but that Whit- ulation. Judson, Cofer, and Gelfand (1 95 6)
tlesea and Dorken’s message is likely correct. found similar facilitation if subjects mem-
The degree to which the underlying memo- orized word lists that contained items re-
rial code is abstract or concrete and what its lated to the problem’s solution such as swing,
detailed form will look like is going to have string or pendulum. Recently, Knoblich and
a good deal to do with the processing con- Wartenberg (1 998) reported a similar ef-
straints placed on individuals in particular fect when the priming words were presented
settings. The manner in which representa- subliminally.
tions get established is critical in determin- Most modern approaches to these issues
ing which relational generalizations can be invoke the notion of spreading activation – an
formed from multiple examples (Holyoak, important theoretical mechanism in many
Chap. 6). contemporary models of human cognition.
The notion is that experience registers in
specific cortical areas and “spreads” to other
“nodes” that are associatively linked with
Creativity and Problem Solving the input. Meyer and Schvaneveldt (1 971 )
showed that the encoding process influences
We began our exploration of implicit, un- the subsequent processing of related words
conscious process in human thought with on indirect tests such as the lexical decision
the “simpler” functions of perception and task. For example, subjects presented with
memory. We’ll end with a quick look at the words like “bread” respond more rapidly to
more complex topics of problem solving and “butter” than to “nurse” – although the re-
creativity (see Novick & Bassok, Chap. 1 4; verse applies if the prime is “doctor.” The
Sternberg et al., Chap. 1 5 ). Although there argument is that the initial prime initiates
hasn’t been much recent study of uncon- a spread of activation and related repre-
scious influence on these functions, the no- sentations in the semantic network are af-
tion that tacit knowledge affects the creative fected. The question that interests us here is
process was a central theme in the Gestalt whether such a process can take place un-
approach (Köhler, 1 925 ; Wertheimer, 1 945 ), consciously. Are processes like intuition, in-
which assumed three main elements of un- sight, and creativity facilitated by the activity
conscious thought: intuition, or the feeling engendered in semantically related but tac-
of directionality of the unconscious process; itly represented memories that are not part
incubation, or the tacit processing of in- of our conscious experience?
formation that eventually leads to problem Yaniv and Meyer (1 987) examined this
solving; and insight, the “aha” experience in possibility by looking at the influence of in-
which the implicit processes become con- accessible material on reaction time in a lex-
scious (Kihlstrom 1 999; Dorfman, Shames, ical decision task. Subjects were first read
& Kihlstrom, 1 996). definitions of rare words and asked to pro-
In the now classic test of this model, vide the word and, when they could not, to
Maier (1 93 1 ) asked subjects to tie together rate their feeling of knowing the word. In
two strings hanging from the ceiling. The the subsequent lexical decision task, subjects
strings were too far apart to grab one string showed faster reaction times for words that
while still holding on to the other. One they could not recall than for control words
solution was to tie a small object, strate- and, interestingly, the “feeling of knowing”
was positively related to the reaction times. P: No idea. It was very painful. (Laughs)
A kind of metacognition appears to be op- E: You did get it, right?
erating in this situation in which subjects P: Yeah, but it was basically luck . . . that I
are sensitive to the contents of tacit knowl- got it.
edge even though the actual material is not
E: You had no idea what you were doing?
available for conscious recall. In an extension
of this idea, Shames, Kihlstrom, and Forster P: Not really.
(1 994) presented subjects with a list of three E: Suppose somebody else was going to
words such as: “goat,” ”pass,” and “green” and do the puzzle who had never seen it
asked them to generate an associate that all before and you had to give them some
three have in common. They found that in hints, tell them how to solve it.
cases in which subjects could not provide P: Well, let’s see. I don’t know what
the correct answer to the triad (“mountain”), to say . . . but, I guess (garbled) the
the reaction time to the correct answer was puzzle . . . the good part was that there
faster on a lexical decision task than an unre- usually wasn’t any more than like one
lated word. These experiments, along with or two choices. I think there was
the Gestalt problem-solving studies, suggest one choice was there any more than
that the initial experience of trying to solve one choice? I don’t know. But I had
a problem, retrieve a rare word, or find the (garbled). Which is why I kept ending
common element in a word-triad seems to up back where I started from, which
set in motion a spread of activation pro- was frustrating. I would tell them, I
cess which, intriguingly, is engaged effec- would tell them, good luck. That’s all.
tively even with knowledge that is tacit and
unavailable for conscious recall. In spite of being unaware of the rules, this
Additional recent evidence of uncon- participant’s overall performance was quite
scious influences in problem solving is seen good. In fact, as Reber and Kotovsky re-
in complex problems that do not require ported, “Immediately after giving this fairly
top-down control such as the balls and boxes uninformative description of his process of
puzzle (P. J. Reber & Kotovsky, 1 997). In solving the puzzle he . . . solved the puzzle
these studies subjects sit in front of a com- in 21 moves (the minimum). . . . superior to
puter screen displaying five boxes, each of most of the other participants.”
which is associated with one of five balls. Ini- Recent experiments provide similar find-
tially all the balls sit outside the boxes and ings with regard to creativity. Marsh, Bink,
the goal is to place all the balls inside the and Hicks (1 999) demonstrated the pos-
boxes. The rule for moving balls in or out sible influence of previously encountered
of boxes is as follows: “The rightmost ball events, which are not necessarily consciously
can always move; other balls can be moved remembered, on creative expression and
if the ball immediately to the right is in its thought. Participants were asked to spend
box and all other balls to the right are out a period of twenty minutes drawing space
of their boxes.” The results showed that par- creatures from their imagination. Before be-
ticipants frequently solved the puzzle while ginning they were shown three pictures
being unaware of the rule system that gov- of fictitious space creatures presented as
erned it. The following is a telling conversa- examples of other participants’ drawings.
tion between the experimenter and one of Each of these creatures had fangs, spikes,
the participants immediately following the or weapons, all objects which are oriented
first completion of the puzzle. around one theme – hostility. Participants
were then asked to draw any type of crea-
Experimenter: Now I want to ask you ture that they wanted as long as they did
about the puzzle you just solved, how not copy any aspect of the creatures shown.
it worked, what you did. The results were intriguing and reminiscent
Participant: No idea of the work of Jacoby and his colleagues with
E: No idea? the process-dissociation procedure discussed
earlier. Although subjects were explicitly such intriguing topics as the implicit cog-
told not to include any of the exemplar nitive factors in sensorimotor skills (Weiss,
characteristics, the core concept around Reber, & Owen, in review), various formal
which these characteristics revolved, hos- models of implicit learning (Cleeremans,
tility, could be seen in most of their 1 993 ; Keele, Ivry, et al., 2003 ), the role that
creative work. implicit processes play in aesthetics (Zizak
Importantly, little influence of the actual & Reber, 2004), social intuition (Lieber-
characteristics of the exemplars could be man, 2000), moral judgment (Haidt, 2001 ),
seen in the creative works of the participants. creativity (Polanyi, 1 95 8; Reber, 1 993 ), the
The elements that made the exemplar crea- time course of memory consolidation and
tures hostile (fangs, spikes, and weapons) sleep (Litman & Reber, 2002), the patterns
were virtually never depicted by the partic- of lost and preserved functions in a vari-
ipants in their novel drawings. Rather it was ety of developmental disorders (Don et al.,
the underlying theme, the shared qualities of 2003 ; Smith, 2003 ), the issue of life-span
the exemplars that were influencing the par- changes and individual differences (Reber &
ticipants’ drawings. In a post-experimental Allen, 2000), implicit acquisition of fear and
interview, only 4 of the 1 42 participants de- other emotions (Phelps, 2004) and the influ-
scribed the original three samples as dis- ence of unconscious thought on psycholog-
playing hostility. These effects were not lim- ical well-being (Pyszczynski, Greenberg, &
ited to visual displays. Subjects who initially Solomon, 1 999).
worked with scrambled sentences exhibiting Over the past several decades, it has be-
a mild hostility-related theme produced sim- come increasingly clear that implicit pro-
ilar data. These results are consistent with cesses, those that operate largely outside of
the spreading activation perspective in that the spotlight of consciousness, play a signif-
the creative process is facilitated by previ- icant role in most of the interesting things
ously encountered, and unconsciously de- that human beings do. We can only hope that
tected, themes in one’s environment. the coming decades will produce a better un-
Taken together, these studies suggest that derstanding of these mechanisms, their un-
complex processes such as problem solv- derlying cortical pathways, and the manner
ing or creative invention can be influenced in which they are integrated into the com-
by previously encountered experiences that plex synergistic interplay of the top-down
are not, at the critical time of the task, and the bottom-up that makes up human
consciously available. There are relatively cognitive functioning.
few studies that have looked directly at
this issue and, of course, we are not sug-
gesting that these experiments are process
pure. It is possible that subjects in these
Acknowledgment
situations, to some extent, have been con-
sciously aware of the previously provided Preparation of this chapter was supported by
material (see Jacoby, Lindsay, & Toth, 1 992). NSF Grant No. 01 1 3 025 to Arthur S. Reber.
Nevertheless, the work is provocative and
is coordinate with the converging lines of
evidence cited previously. See Sternberg References
(Chap. 3 1 ) for additional approaches to the
issue of creativity.
Altmann, G., Dienes, Z., & Goode, A. (1 995 ).
On the modality independence of implicitly
learned grammatical knowledge. Journal of Ex-
Cognition, 2 1 , 899–91 2.
Unhappily, we don’t feel as though we have Anderson, J. R., & Lebiere, L. (2003 ). The Newell
presented more than a dollop of the liter- test for a theory of mind. Behavioral and Brain
ature. We never got to discussing work on Sciences, 26(5 ), 5 87–640.
Berry, D., & Broadbent, D. (1 988). Interac- Dehaene-Lambertz, G, & Houston, D. (1 998).
tive tasks and the implicit-explicit distinc- Faster orientation latency toward native lan-
tion. British Journal of Psychology, 79, 25 1 – guage in two-month-old infants. Language and
272. Speech, 41 , 21 –43 .
Berti, A., & Rizzolatti, G. (1 992). Visual process- Dehaene-Lambertz, G., Dehaene, S., & Hertz-
ing without awareness: Evidence from unilat- Pannier, L. (2002). Functional neuroimaging
eral neglect, Journal of Cognitive Neuroscience, of speech perception in infants. Science, 2 98,
4, 45 –3 5 1 . 201 3 –201 5 .
Broadbent, D. E. (1 95 8). Perception and Commu- Dennett, D. C. (1 987). Consciousness. In G. L.
nication. New York: Pergamon Press. Gregory (Ed.), The Oxford Companion to the
Brody, H. (1 989). Unconscious learning of rules: Mind (pp. 1 61 –1 64). New York: Oxford Uni-
Comment on Reber’s analysis of implicit learn- versity Press.
ing. Journal of Experimental Psychology: Gen- Dienes, Z. (1 992). Connectionist and memory ar-
eral, 1 1 8, 23 6–23 8. ray models of artificial grammar learning. Cog-
Brooks, L. R., & Vokey, J. R. (1 991 ). Abstract nitive Science, 1 6, 41 –79.
analogies and abstracted grammars: A Com- Don, A. J., Schellenberg, E. G., Reber, A. S.,
ment on Reber, and Mathews et al. Journal DiGirolamo, D. M., & Wang, P. P. (2003 ).
of Experimental Psychology: General, 1 2 0, 3 1 6– Implicit learning in individuals with Williams
3 23 . Syndrome. Developmental Neuropsychology, 73
Cabeza, R., & Nyberg, L. (2000). Imaging cog- (1 –2), 201 –775 .
nition II: An empirical review of 275 PET and Dorfman, J., Shames, V. A., & Kihlstrom, J. F.
fMRI studies. Journal of Cognitive Neuroscience, (1 996). Intuition, incubation, and insight: Im-
1 2 , 1 –47 plicit cognition in problem solving. In G. Un-
Cherry, E. C. (1 95 3 ). Some experiments on the derwood (Ed.), Implicit Cognition (pp. 25 7–
recognition of speech with one or two ears. 296). Oxford, UK: Oxford University Press.
Journal of the Acoustical Society of America, 2 5 , Draine, S. C., & Greenwald, A. G. (1 998). Repli-
975 –979. cable unconscious semantic priming. Journal
Claparède, E. (1 95 1 ). Recognition and “me-ness” of Experimental Psychology: General, 1 2 7, 286–
(E. Rapaport, Trans.). In D. Rapaport (Ed.), 3 03 .
Organisation and pathology of thought. New Dulany, D. E., Carlson, R., & Dewey, G. (1 984).
York: Columbia University Press. (Original A case of syntactical learning and judgment:
work published 1 91 1 .) How concrete and how abstract? Journal of Ex-
Cleeremans, A. (1 993 ). Mechanisms of implicit perimental Psychology: General, 1 1 3 , 5 41 –5 5 5 .
learning: Connectionist Models of Sequence Pro- Eich, E. (1 984). Memory for unattended events:
cessing. Cambridge, MA: MIT Press. Remembering with and without awareness.
Cohen N. J., & Squire L. R. (1 980). Preserved Memory and Cognition, 1 2 , 1 05 –1 1 1 .
learning and retention of pattern-analyzing skill Elliot, R., & Dolan, J. R. (1 998). Neural re-
in amnesia: Dissociation of knowing how and sponse during preference and memory judg-
knowing that. Science, 2 1 0, 207–21 0. ments for subliminally presented stimuli: A
Corkin, S. (1 968). Acquisition of motor skill functional neuroimaging study. The Journal of
after bilateral medial temporal-lobe excision. Neuroscience, 1 8, 4697–4704.
Neuropsychologia, 6, 225 –264. Erdelyi, M. H. (1 985 ). Psychoanalysis: Freud’s
Daum, I., & Ackermann, H. (1 994). Frontal-type Cognitive Psychology. New York: Freeman Press.
memory impairment associated with thalamic Erdelyi, M. H. (1 986). Experimental indetermi-
damage. International Journal of Neuroscience, nacies in the dissociation paradigm. Behavioral
77, 1 87–1 98. and Brain Sciences, 9, 3 0–3 1 .
De Haan, E. H. F., Young, A. W., & New- Erdelyi, M. H. (2005 ). Subliminal perception and
combe, F. (1 991 ). Covert and overt recognition its cognates: Theory, indeterminacy, and time.
in prosopagnosia. Brain, 1 1 4, 25 75 –25 91 . Consciousness and Cognition, 1 3 (1 ), 73 –91 .
Debner, J. A., & Jacoby, L. L. (1 994). Unconscious Eriksen, C. W., Unconscious Processes (1 95 9): In
perception: attention, awareness, and control. M. R. Jones (Ed.), Nebraska Symposium on Mo-
Journal of Experimental Psychology: Learning, tivation (1 95 8) (Vol. 6, pp 1 69–277). Lincoln:
Memory and Cognition, 2 0, 3 04–3 1 7. University of Nebraska Press.
Gabrieli, J. D. E., Fleischman D. A., Keane Howard, J. H., & Ballas, J. A. (1 982). Acquisi-
M. M., Reminger S. L., Morrell F. (1 995 ). Dou- tion of acoustic pattern categories by exemplar
ble dissociation between memory systems un- observation. Organisational Behaviour and Hu-
derlying explicit and implicit memory in the man Performance, 3 0, 1 5 7–1 73 .
human brain. Psychological Science, 6, 76–82. Hsiao, A., & Reber, A. S. (1 998). The role of at-
Glisky, E. L., Schacter, D. L., & Tulving, E. (1 986). tention in implicit sequence learning: exploring
Computer learning by memory-impaired the limits of the cognitive unconscious. In M.
patients: Acquisition and retention of complex Stadler & P. French (Eds) Handbook of implicit
knowledge. Neuropsychologia, 2 4, 3 1 3 –3 28. learning. Sage.
Glisky, E. L., Schacter, D. L., & Tulving, E. Hsiao, A., & Reber, A. S. (2001 ). The role of atten-
(1 986). Learning and retention of computer re- tion in the serial reaction time task: Fine tuning
lated vocabulary in memory-impaired patients: the timing. Psychonomic Bulletin and Review, 8,
Method of vanishing cues. Journal of Clin- 3 3 6–3 42.
ical and Experimental Neuropsychology, 8, Jacoby, L. L. (1 991 ). A process dissociation
292–3 1 2. framework: Separating automatic from inten-
Gomez, R. L., & Gerken, L. (1 999). Artificial tional uses of memory. Journal of Memory and
grammar learning by 1 -year-olds leads to spe- Language, 3 0, 5 1 3 –41 .
cific and abstract knowledge. Cognition, 70, Jacoby, L. L., Lindsay, D. S., & Toth, J. P. (1 992).
1 09–1 3 5 . Unconscious processes revealed: A question of
Gomez, R. L., & Gerken, L. (2000). Infant artifi- control. American Psychologist, 47, 802–809.
cial language learning and language acquisition. Johnson, W. A., & Wilson, J. (1 980). Perceptual
Trends in Cognitive Sciences, 4, 1 78–1 87. processing of non-targets in an attention task.
Gomez, R. L., & Schvaneveldt, R. W. (1 994). Memory and Cognition, 8, 3 72–3 77.
What is learned from artificial grammars? Judson, A. J., Cofer, C. N., & Gelfand, S. (1 95 6).
Transfer of simple associations. Journal of Exper- Reasoning as an associative process II: “Direc-
imental Psychology: Learning Memory and Cog- tion” in problem solving as a function of rein-
nition, 2 0, 3 96–41 0. forcement of relevant responses. Psychological
Gordon, P. C., & Holyoak, K. J. (1 983 ). Implicit Reports, 2 , 5 01 –5 07.
learning and generalization of the “Mere Ex- Keele, S. W., Ivry, R., & Mayr, U. (2003 ). The
posure” effect. Journal of Personality and Social cognitive and neural architecture of sequence
Psychology, 45 , 492–5 00. representation. Psychological Review, 1 1 0, 3 1 6–
Haith, M. M., Wentworth, N., & Canfield, R. 3 3 9.
L. (1 993 ). The formation of expectations in Kihlstrom, J. F. (1 999). Conscious versus uncon-
early infancy. In C. Rovee-Collier & L. P. scious cognition. In R. J. Sternberg, The Nature
Lipsitt (Eds.), Advances in Infancy Research. of Cognition. Cambridge, MA: MIT Press.
Norwood, NJ: Ablex. Kihlstrom, J. F., Schacter, D. L., Cork, R. C., Hurt,
Haidt, J. (2001 ). The emotional dog and its ration- C. A., & Behr, S. E. (1 990). Implicit and explicit
al tail: A social intuitionist approach to moral memory following surgical anesthesia. Psycho-
judgment. Psychological Review, 1 08, 81 4–83 4. logical Science, 1 , 3 03 –3 06.
Holender, D. (1 986). Semantic activation with- Kirsner, K. (1 998). Implicit memory. In K.
out conscious identification in dichotic listen- Kirsner, C. Speelman, M. Mayberry, A.
ing, parafoveal vision, and visual masking: A O’Brien-Malone, M. Anderson, & C. MacLeod
survey and appraisal. Behavioral and Brain Sci- (Eds.) Implicit and Explicit Mental Processes.
ences, 9, 1 –23 . Mahwah, NJ: Erlbaum.
Howard, D. V., & Howard, J. H., Jr. (1 992). Knoblich, G., & Wartenberg, F. (1 998). Unbe-
Adult age differences in the rate of learning se- merkte Lösungshinweise erleichern Repräsen-
rial patterns: Evidence from direct and indirect tationswechsel beim Problemlösen. (Unno-
tests. Psychology and Aging, 7, 23 2–241 . ticed hints facilitate representational change
Howard, J. H., Jr., & Howard, D. V. (1 997). in problem solving.) Zeitschrift für Psychologie,
Learning and memory. In A. D. Fisk & W. A. 2 06, 207–23 4.
Rogers (Eds.). Handbook of human factors and Knowlton, B. J., Ramus, S. J., & Squire, L. R.
the older adult (pp. 7–26). New York: Academic (1 992). Intact artificial grammar learning in
Press. amnesia: Dissociation of classification learning
and explicit memory for specific instances Psy- Marsh, R. L., Bink, M. L., & Hiks, J. L. (1 999).
chological Science, 3 , 1 72–1 79. Conceptual priming in a generative problem-
Knowlton, B. J., & Squire, L. R. (1 993 ). The learn- solving task. Memory and Cognition, 2 7, 3 5 5 –
ing of categories: Parallel brain systems for item 3 63 .
memory and category knowledge. Science, 2 62 , Mathews, R. C., Buss, R. R., Stanley, W. B.,
1 747–49. Blanchard-Fields, F., Cho, J. R., & Bruhan, B.
Knowlton, B. J., & Squire, L. R. (1 994). The in- (1 989). Role of implicit and explicit processes
formation acquired in artificial grammar learn- of learning from examples: A synergistic ef-
ing. Journal of Experimental Psychology: Learn- fect. Journal of Experimental Psychology: Learn-
ing, Memory and Cognition, 2 0, 79–91 . ing, Memory, and Cognition, 1 5 , 1 083 –1 1 00.
Knowlton, B. J., & Squire, L. R. (1 996). Arti- Mehler, J., Jusczyk, P., Lambertz, G., Halsted, N.,
ficial grammar learning depends on implicit Bertoncini, J., & Amiel-Tison, C. (1 988). A pre-
acquisition of both rule-based and exemplar- cursor of language acquisition in young infants.
specific information. Journal of Experimental Cognition, 2 9, 1 43 –1 78.
Psychology: Learning, Memory, and Cognition, Merikle, P., & Reingold, E. (1 992). Measur-
2 2 , 1 69–1 81 . ing unconscious perceptual processes. In R.
Köhler, W. (1 925 ). The Mentality of Apes. Bornstein and T. Pittman (Eds.), Perception
(E. Winter, trans.). New York: Harcourt, Brace. without awareness: Cognitive, clinical, and social
Kunst-Wilson, W. R., & Zajonc, R. B. (1 980). Af- perspectives. NY: Guilford Press.
fective discrimination of stimuli that cannot be Merikle, P. M., & Daneman, M. (1 996). Memory
recognized. Science, 2 07, 5 5 7–5 5 8. for unconsciously perceived events: Evidence
Lewicki, P. (1 986a). Nonconscious Social Informa- from anesthetized patients. Consciousness and
tion Processing. New York: Academic Press. Cognition, 5 , 5 25 –5 41 .
Lewicki, P. (1 986b). Processing information Meyer, D. E., & Schvaneveldt, R. W. (1 971 ). Fa-
about covariations that cannot be articulated. cilitation in recognizing words: Evidence of a
Journal of Experimental Psychology: Learning, dependence upon retrieval operations. Journal
Memory, and Cognition, 1 2 , 1 3 5 –1 46. of Experimental Psychology, 90, 227–23 4.
Lieberman, M. D. (2000). Intuition: A social Moray, N. (1 95 9). Attention in dichotic listening:
cognitive neuroscience approach. Psychological Affective cues and the influence of instructions.
Bulletin, 1 2 6, 1 09–1 3 7. Quarterly Journal of Experimental Psychology,
1 1 , 5 6–60.
Litman, L., & Reber, A. S. (2002). On the
temporal course of consolidation of implicit Milner B., Corkin, S., & Teuber, H. L., (1 968).
knowledge. Evolution and Cognition, 8, 1 45 – Alteration of perception and memory in man:
155. Reflections on methods. In L. Weiskrantz (Ed.),
Analysis of Behavioral Change. New York:
Maier, N. (1 93 1 ). Reasoning in humans II: The so-
Harper & Row.
lution of a problem and its appearance in con-
sciousness. Journal of Comparative Psychology, Milner, B. (1 962). Les troubles de la memoire
1 2 , 1 81 –1 94. accompagnant des lesions hippocampiques bi-
laterales [Disorders of memory accompany-
Manza, L., & Reber, A. S. (1 997). Represent-
ing bilateral hippocampal lesions], Physiolo-
ing artificial grammars: Transfer across stimu-
giede l’hippocampe. Paris: Centre National de
lus forms and modalities. In D. C. Berry (Ed.),
la Recherche Scientifique.
How implicit is implicit learning? Oxford, UK:
Oxford University Press. Mulligan, N. (1 997). Attention and implicit
memory tests: The effects of varying atten-
Manza, L., Zizak, D., & Reber, A. S. (1 998). Arti-
tional load on conceptual priming. Memory and
ficial grammar learning and the mere exposure
Cognition, 2 5 , 1 1 –1 7.
effect: Emotional preference tasks and the im-
plicit learning process. In M. A. Stadler & P. A. Mulligan, N. (1 998). The role of attention during
Frensch (Eds.), Handbook of implicit learning. encoding in implicit and explicit memory. Jour-
Thousand Oaks, CA: Sage. nal of Experimental Psychology: Learning, Mem-
ory, and Cognition, 2 4, 27–47.
Marcel, A. J. (1 983 ). Conscious and unconscious
perception: Experiments on visual masking Murphy, S. T., & Zajonc, R. B. (1 993 ). Affect, cog-
and word recognition. Cognitive Psychology, 1 5 , nition and awareness: Affective priming with
1 97–23 7. optimal and suboptimal stimulus exposures.
Journal of Personality and Social Psychology, 64, Reber, A. S. (1 989). Implicit learning and tacit
723 –73 9. knowledge. Journal of Experimental Psychology:
Naccache, L., & Dehaene, S. (2001 ). The priming General, 1 1 8, 21 9–23 5
method: Imaging unconscious repetition prim- Reber, A. S. (1 992a). An evolutionary context for
ing reveals an abstract representation of num- the cognitive unconscious. Philosophical Psy-
ber in the parietal lobes. Cerebral Cortex, 1 1 , chology, 5 , 3 3 –5 1 .
966–974. Reber, A, S. (1 992b). The cognitive unconscious:
Naccache, L., Blandin, E., & Dehaene, S. (2002). An evolutionary perspective. Consciousness
Unconscious masked priming depends on tem- and Cognition, 1 , 93 –1 3 3 .
poral attention. Psychological Science, 1 3 , 41 6– Reber, A. (1 993 ). Implicit Learning and Tacit
424. Knowledge. New York: Oxford University
Perruchet, P., & Pacteau, C. (1 990). Synthetic Press.
grammar learning: Implicit rule abstraction Reber, A. S., & Allen, R. (2000) Individual dif-
or explicit fragmentary knowledge? Journal of ferences in implicit learning. In R. G. Kun-
Experimental Psychology: General, 1 1 9, 264– zendorf & B. Wallace (Eds.), Individual differ-
275 . ences in conscious experience. Philadelphia: John
Perruchet, P., & Pacteau, C. (1 991 ). The im- Benjamins.
plicit acquisition of abstract knowledge about Reber, A. S., Allen, R., & Reber, P. J. (1 999).
artificial grammar: Some methodological and Implicit and explicit learning. In R. Steinberg
conceptual issues. Journal of Experimental (Ed.), The Nature of cognition. Cambridge, MA:
Psychology: General, 1 2 0, 1 1 2–1 1 6. MIT Press.
Perruchet, P., & Vinter, A. (1 998). PARSER: A Reber, A. S., & Reber, E. S. (2001 ). Dictionary of
model for word segmentation. Journal of Mem- psychology, (3 rd ed.). London: Penguin Books,
ory and Language, 3 9, 246–263 . Ltd.
Perruchet, P., & Vinter, R. (2003 ). The self- Reber, P. J., & Kotovsky, K. (1 997). Implicit learn-
organizing consciousness. Behavioral and Brain ing in problem solving: The role of work-
Sciences, 25 (3 ), 297–3 88. ing memory capacity. Journal of Experimental
Phelps, E. A. (2004). The human amygdala and Psychology: General, 1 2 6, 1 78–203 .
awareness: Interactions between emotion and Reber, P. J., & Squire, L. R. (1 998). Encapsulation
cognition. In M.S. Gazzaniga (Ed.), The cogni- of implicit and explicit memory in sequence
tive neurosciences, (3 rd ed.). Cambridge, MA: learning. Journal of Cognitive Neuroscience, 1 0,
MIT Press. 248–263 .
Polanyi, M. (1 95 8). Personal Knowledge: Towards Reber, P. J., Stark, C. E. L., & Squire, L. R. (1 998).
a Post-critical Philosophy. Chicago: University Cortical areas supporting category learning
of Chicago Press. identified using functional magnetic resonance
Pratkanis, A. R. (1 992). The cargo-cult science imaging. Proceedings of the National Academy of
of subliminal persuasion. Skeptical Inquirer, 1 6, Sciences, 95 , 747–75 0.
260–272. Redington, M., Chater, N., & Finch, S. (1 998).
Pyszczynski, T., Greenberg, J., & Solomon, Distributional information: A powerful cue for
S. (1 999). A dual process model of de- acquiring syntactic categories. Cognitive Sci-
fense against conscious and unconscious death- ence, 2 2 , 425 –469.
related thoughts: An extension of terror man- Roberts, P. (1 998). Implicit knowledge and
agement theory. Psychological Review, 1 06, connectionism: What is the connection? In
83 5 –845 . K. Kirsner, C. Speelman, M. Mayberry, A.
Ramus, F., Nespor, M., & Mehler, J. (1 999). Cor- O’Brien-Malone, M. Anderson, & C. MacLeod
relates of linguistic rhythm in the speech signal. (Eds.), Implicit and explicit mental processes.
Cognition, 73 , 265 –292. Mahwah, NJ: Erlbaum.
Reber, A. S. (1 967). Implicit learning of artificial Rovee-Collier, C. (1 990). The “memory system”
grammars. Journal of Verbal Learning and Verbal of prelinguistic infants. Annals of the New York
Behavior, 5 , 85 5 –863 . Academy of Sciences, 608, 5 1 7–5 3 6.
Reber, A. S. (1 969). Implicit learning of artificial Saffran, J. R., Aslin, R. N., & Newport, E. L.
grammars. Journal of Verbal Learning and Verbal (1 996). Statistical learning by 8-month-old in-
Behavior, 6, 85 5 –863 . fants. Science, 2 74, 1 926–1 928.
Saffran, J. R., Newport, E. L., & Aslin, R. N. lished doctoral dissertation. City University of
(1 996). Word segmentation: The role of distri- New York, NY.
butional cues. Journal of Memory and Language, Squire, L. R. (1 992). Memory and the hippo-
3 5 , 606–621 . campus: A synthesis from findings with rats,
Schacter, D. L., & Sherry, D. F. (1 987). The evolu- monkeys, and humans. Psychological Review, 2 ,
tion of multiple memory systems. Psychological 1 95 –23 1 .
Review, 94, 43 9–45 4. Squire, L. R. (1 995 ). Biological foundation of ac-
Schacter, D. L., & Graf, P. (1 986). Preserved learn- curacy and inaccuracy in memory. In D. L.
ing in amnesic patients: Perspectives from re- Schacter (Ed.), Memory Distortion: How Minds,
search on direct priming. Journal of Clinical Ex- Brains, and Societies Reconstruct the Past.
perimental Neuropsychology, 8, 727–743 . Cambridge, MA: Harvard University Press.
Scolville, W. B., & Milner, B. (1 95 7). Loss of re- Squire, L. R., & Knowlton, B. J. (1 995 ). Mem-
cent memory after bilateral hippocampal le- ory, hippocampus, and brain systems. In M. S.
sions. Journal of Neurobiology, Neurosurgery and Gazzaniga (Ed.), The Cognitive Neurosciences.
Psychiatry, 2 0, 1 1 –21 . Cambridge, MA: MIT Press.
Schunn, C. D., & Dunbar, K. (1 996). Priming, Squire, L. R., & Knowlton, B. J. (2000). In
analogy, and awareness in complex reasoning. M. S. Gazzaniga (Ed.)., The New Cognitive Neu-
Memory and Cognition, 2 4, 271 –284. rosciences. Cambridge, MA: MIT Press.
Searle, J. (1 992). The rediscovery of the mind. Stadler, M. A., & Frensch, P. A. (Eds.), (1 998),
Cambridge, MA: MIT Press. Handbook of implicit learning. Thousand Oaks,
Seger, C. A. (1 994). Implicit learning. Psychologi- CA: Sage.
cal Bulletin, 1 1 5 , 1 63 –1 96. Toth, J. P., & Reingold, E. M. (1 996). Beyond
Servan-Schreiber, E., & Anderson, J. R. (1 990). perception: Conceptual contributions to un-
Learning artificial grammars with competitive conscious influences of memory. In G. Under-
chunking. Journal of Experimental Psychology: wood (Ed.), Implicit cognition. New York: Ox-
Learning, Memory, and Cognition, 1 6, 5 92– ford University Press.
608. Tulving E., Hayman C. A. G., & MacDonald,
Servan-Schreiber, D., Cleeremans, A., & Mc- C. (1 991 ). Long-lasting perceptual priming
Clelland, J. L. (1 991 ).Graded state machines: and semantic learning in amnesia: A case ex-
The representation of temporal contingencies periment. Journal of Experimental Psychology:
in simple recurrent networks. Machine Learn- Learning Memory and Cognition, 1 7, 5 95 –
ing, 7, 1 61 –1 93 . 61 7.
Shames, V. A., Kihlstrom, J. F., & Forster, K. I. Uleman, J. S., & Bargh, J. A. (Eds). (1 989). Unin-
(1 994). An implicit test of problem-solving. tended Thought. New York: Guilford Press.
Paper presented at the annual meeting of the Ungerleider, L. G. (1 995 ). Functional brain imag-
American Psychological Society, Washington, ing studies of cortical mechanisms for memory.
D.C. Science, 2 70, 769–775 .
Shanks, D. R., & St. John, M. F. (1 994). Char- Vokey, J. R., & Brooks, L. R. (1 992). Salience
acteristics of dissociable human learning sys- of item knowledge in learning artificial gram-
tems. Behavioral and Brain Sciences, 1 7, 3 67– mars. Journal of Experimental Psychology: Learn-
447. ing, Memory, and Cognition, 1 8, 3 28–3 44.
Shanks, D. R., Johnstone, T., & Staggs, L. (1 997). Warrington, E. K., & Weiskrantz, L. (1 968). New
Abstraction processes in artificial grammar method of testing long-term retention with
learning. Quarterly Journal of Experimental Psy- special reference to amnesic patients. Nature,
chology, 5 0, 21 6–25 2. 2 1 7, 972–974.
Silverman, L. H. (1 983 ). The subliminal psy- Warrington, E. K., & Weiskrantz, L. (1 974). The
chodynamic activation method: Overview and effect of prior learning on subsequent retention
comprehensive listing of studies. In J. Masling in amnesic patients. Neuropsychologia, 1 2 , 41 9–
(Ed.), Empirical studies of psychoanalytic theo- 428.
ries. Hillsdale, NJ: Erlbaum. Weinberger, N. M. (1 992). Beyond neuronal
Smith, C. (2003 ). Intact implicit learning in a excitability: Receptive field analysis reveals
population with autistic spectrum disorder – that association specifically modifies the rep-
provided the stimuli are non-social. Unpub- resentation of auditory information. In L. R.
Squire & N. Butters (Eds.), Neuropsychology of determined: An episodic-processing account

memory (2nd ed.). New York: Guilford Press. of implicit learning. Journal of Experimental
Weiskrantz, L. (1 986). Blindsight. Oxford, UK: Psychology: General, 1 2 2 , 227–248.
Oxford University Press. Winter, W., Broman, M., Rose, A., & Reber, A. S.
Weiss, S. M., Reber, A. S., & Owen, D. R. (in (2001 ). Assessment of cognitive procedural
review). The locus of focus: Attentional strate- learning in amnesia: Why the Tower of Hanoi
gies in the acquisition of sensorimotor skills. has fallen down. Brain and Cognition, 45 , 79–
Wertheimer, M. (1 945 ). Productive Thinking. 96.
New York: Harper & Row. Yaniv, I., & Meyer, D. M. (1 987). Activa-
Whalen, P. J., Rauch, S. L., Etcoff, N. L., tion and metacognition of inaccessible stored
McInerney, S. C., Lee, M. B., & Jenike, M. A. information: Potential bases for incubation ef-
(1 998). Masked presentations of emotional fa- fects in problem solving. Journal of Experimen-
cial expressions modulate amygdala activity tal Psychology: Learning, Memory, and Cogni-
without explicit knowledge. Journal of Neuro- tion, 1 3 , 1 87–205 .
science, 1 8, 41 1 –41 8. Zajonc, (1 980). Feeling and thinking: Preferences
Whalen, P. J. (1 998). Fear, vigilance, and need no inferences. American Psychologist, 3 5 ,
ambiguity: Initial neuroimaging studies of the 1 5 1 –1 75 .
human amygdala. Current Directions in Psycho- Zizak, D., & Reber, A. R. (2004). The struc-
logical Science, 7, 1 77–1 88. tural mere exposure effect: dual role of famil-
Whittlesea, B. W. A., & Dorken, M. D. (1 993 ). iarity. Consciouness and Cognition, 1 3 (2 ), 3 3 6–
Incidentally, things in general are particularly 3 62.
Part V
COGNITIVE AND NEURAL

CONSTRAINTS ON HUMAN
THOUGHT

CHAPTER 1 9
Thinking in Working Memory
Robert G. Morrison
Introduction including dual-task experiments, individual

differences, and cognitive neuroscience.
It is not an accident that this discussion of
working memory is positioned near the cen-
ter of a volume on thinking and reasoning. Multiple Memory Systems?
Central to higher-level cognitive processes
is the ability to form and manipulate men- Although the idea of separate primary mem-
tal representations (see Doumas & Hummel, ory is credited to William James (1 890),
Chap. 4). Working memory is the cogni- Waugh and Norman (1 965 ) and Atkinson
tive construct responsible for the mainte- and Shiffrin (1 968) developed the idea
nance and manipulation of information and of distinct primary (i.e., short-term) and
therefore is neccessary for many of the secondary (i.e., long-term) memory com-
types of complex thought described in this ponents into defined models of the hu-
book. Likewise, the development and fail- man memory system. These multicompo-
ures of working memory are critical to un- nent models of memory were supported
derstanding thought changes with develop- by observations from many different stud-
ment (see Halford, Chap. 22) and aging (see ies during the 1 95 0s and 1 960s. Perhaps the
Salthouse, Chap. 24) as well as many types most familiar justification for separate short-
of higher-level cognitive impairments (see term and long-term memory systems is the
Bachman & Cannon, Chap. 21 ). In spite of serial position effect (e.g., Murdock, 1 962).
its obvious importance for thinking and rea- During list learning, the most recently stud-
soning, working memory’s role in complex ied items show an advantage when tested
thought is just beginning to be understood. immediately – an advantage that goes away
In this chapter, we review several dominant quickly with a delay in test provided that
models of working memory, viewing them participants are prevented from rehearsing.
from different methodological perspectives, This recency effect is presumably the result
457
Figure 1 9.1 . Atkinson and Shiffrin’s (1 968) multicomponent memory model.
of quickly unloading short-term memory at which information was stored in long-term

test. In contrast, the first items in the list memory.
show an advantage that withstands a delay
period. This primacy effect presumably oc-
curs because these initial items have been
stored in long-term memory through prac- A Multi-component Working
tice. Conrad (1 964) provided another im- Memory Model
portant finding justifying distinct systems
when he observed that errors in short- While exploring the issues described in the
term remembering were usually phonolog- previous section, Baddeley and Hitch (1 974)
ical whereas long-term memory was dom- proposed a model that expanded short-term
inated by semantic coding. This suggested memory into the modern concept of working
that rehearsal or storage systems were dif- memory – a term that has been used in several
ferent between the two types of memory. different contexts in psychology.1 Baddeley
Yet another important finding was that, al- (1 986) defined working memory as “a system
though the capacity of long-term memory for the temporary holding and manipula-
was seemingly limitless, short-term mem- tion of information during the performance
ory as observed in a simple digit-span task of a range of cognitive tasks such as com-
was of limited capacity (Miller, 1 95 6) – prehension, learning, and reasoning” (Ref. 3 ,
a finding confirmed using many other ex- p. 3 4). In a recent description of his working-
perimental paradigms. Lastly, around this memory model, Baddeley (2000) proposed
same era, neuropsychological evidence be- a four-component model (Figure 1 9.2), in-
gan to emerge suggesting that at least parts cluding the phonological loop, the visuospa-
of the short- and long-term memory systems tial sketchpad, the central executive, and the
were anatomically distinct. Milner’s (1 966) model’s most recent addition, the episodic
famous amnesic patient, HM, with his long-
term memory deficits but preserved short-
term digit span, and Shallice and Warring-
ton’s (1 970) patient, KF, with his intact
long-term memory but grossly impaired
digit span, presented a double dissociation
favoring at least partially distinct short- and
long-term memory systems. Atkinson and
Shiffrin’s (1 968) memory model was typical
of models from the late 1 960s with distinct
sensory, short-term, and long-term memory
stores (Figure 1 9.1 ). Short-term memory was
viewed as a short-term buffer for informa-
tion that was maintained by active rehearsal. Figure 1 9.2 . Baddeley’s (2000) four-component
It was also believed to be the mechanism by working memory model.
thinking in working memory 459
buffer. This model has primarily been con- yes yes yes yes
ceptualized based on results from behav-
H
ioral dual-task paradigms and neuropsychol-
no no
ogy. For instance, using behavioral methods,
Baddeley and Hitch (1 974) reasoned that
they could identify the separable elements no no
of working memory by looking for task in-
terference. If you assume the various compo-
nents of working memory are capacity lim-
ited, then if the simultaneous performance
* yes yes yes yes
Figure 1 9.3. The Brooks (1 968) letter task.
of a secondary task degrades performance Participants are to image a block letter and then
of a primary task, these two tasks must decide whether each corner of the letter is an
tap a common limited resource – partic- outside edge.
ularly if there exists another primary task
that is unaffected by performance of the sec-
ondary task and is affected by a different number vocally). Thus, verbal span is con-
secondary task that does not affect the first strained by both the amount of information
primary task. Likewise, neuropsychological to be maintained and the time that it takes
evidence such as the existence of patients to rehearse it. In contrast to the phonologi-
with selectively disabled verbal (e.g., patient cal loop, the visuospatial sketchpad has been
KF, Shallice & Warrington, 1 970) and visual more difficult to describe. In a dual-task ex-
(e.g., de Renzi & Nichelli, 1 975 ) digit span periment, Baddeley (1 986) asked subjects to
suggested that verbal and visual working- simultaneously perform a pursuit rotor task
memory systems are somewhat separable (i.e., track a spot of light that followed a
as well. circular path with a stylus) while perform-
Using this type of methodology, Baddeley ing either a verbal or spatial memory task
has suggested that the phonological loop and previously developed by Brooks (1 968; Fig-
visuospatial sketchpad are modality-specific ure 1 9.3 ). The verbal task required subjects
slave systems that are responsible for main- to remember a sentence (e.g., “A bird in hand
taining information over short periods of is not in the bush”) and scan through each
time. The phonological loop is responsible word deciding whether it was a noun or not.
for the maintenance and rehearsal of infor- The correct pattern of output for this exam-
mation that can be coded verbally (e.g., the ple would be: no, yes, no, yes, no, no, no, no,
digits in a digit-span task). It is phonemi- yes. In the visual memory task, participants
cally sensitive (e.g., Ted and Fred are harder are first shown a block letter with one corner
to remember than Ted and Bob), and its ca- marked with an asterisk (Figure 1 9.3 ). They
pacity is approximately equal to the amount are then asked to imagine the letter and, be-
of information that can be subvocally cy- ginning at the marked corner, judge whether
cled in approximately 2 seconds. Baddeley each corner is an outside corner or not. Thus,
(1 986) argues that these two characteris- in both the verbal and visual memory tasks,
tics of verbal working memory are best ex- participants are required to hold a modality-
plained by two components: (1 ) a phonologi- specific object in memory and inspect it, an-
cal store that holds all of the information that swering yes or no to questions about their
is currently active and is sensitive to inspection. Baddeley found that the visual
phonemic interference effects and (2) an memory task, but not the verbal memory
articulatory loop that is used to refresh the task, seriously degraded pursuit rotor track-
information via a process of time-limited ing performance.
subvocal cycling. The articulatory loop Logie (1 995 ) has argued for a visual sim-
is specifically disrupted by the common ilarity effect analogous to the phonemic
phonological loop secondary task, articula- similarity effect used to support the phono-
tory suppression (i.e., repeating a word or logical store. Participants were visually
presented strings of upper- and lowercase them every 4 seconds. Participants per-
letters (e.g., “KcPs” or “gBrQ”). Letters were formed this secondary task while simulta-
chosen based on the similarity of their lower neously performing a primary task involv-
and uppercase characters. Thus Kk, Cc, Pp, ing auditory language comprehension. They
Ss were visually similar while Gg, Bb, Rr, Qq found that language comprehension only
were visually dissimilar. To discourage use of suffered at high concurrent memory load
the phonological loop to perform the task, and not under lower memory load condi-
participants performed simultaneous articu- tions. At low memory load, participants had
latory suppression. After a retention period, sufficient resources to carry out the compre-
participants had to write down the letter hension task; however, at high memory load
sequence in correct order and case. Logie there were insufficient resources for lan-
found that participants made significantly guage comprehension. Adding the results of
more errors when the letter cases were vi- this study to many other similar experiments
sually similar. This finding suggests the ex- and the neuropsychological evidence from
istence of a visual store analogous to the patients like KF, Baddeley and Hitch postu-
phonological store in the phonological loop. lated that comprehension and digit span uti-
It is possible that a visual rehearsal loop anal- lizes separate modules of working memory
ogous to the articulatory loop exists; how- that taps a common resource pool.
ever, to date evidence is limited to introspec- Given the amorphous nature of the cog-
tive accounts of mnemonics. What is clear is nitive tasks for which the central executive
that both visual and spatial qualities of stim- was necessary, Baddeley (1 986) initially em-
uli can be stored in the short term; how- braced Norman and Shallice’s (1 980; 1 986)
ever, the independence of systems responsi- concept of a Supervisory Attentional Sys-
ble for visual and spatial memory is the topic tem as a model for the central executive.
of much debate (see Logie, 1 995 ). Norman and Shallice suggested that most
The third component of Baddeley’s work- well-learned cognitive functions operate via
ing memory model, the central executive, schemata, or sets of actions that run auto-
was initially a catch-all for the working- matically. Although many schemata may be
memory-processes necessary for certain cog- shared by most individuals (e.g., driving a
nitive abilities that did not fit cleanly car, dialing a telephone, composing a simple
into the phonological loop or visuospatial sentence, etc.), additional schemata may be
sketchpad. This category included many acquired through the development of spe-
of the cognitive abilities discussed in this cific expertise (e.g., writing lines of com-
book, including reasoning, problem solv- puter code, swinging a golf club, etc.). At
ing, and language. For instance, Shallice and many times during an ordinary day, we must
Warrington’s (1 970) patient KF had a dras- perform more than one of these schemata
tically degraded verbal span (i.e., two let- concurrently (e.g., talking while driving).
ters) with relatively intact language compre- Norman and Shallice suggest that when we
hension. Believing that both of these abili- must perform multiple schemata, their co-
ties required working memory, Baddeley and ordination or prioritization is accomplished
Hitch (1 974) reasoned that verbal span and via the semi-automatic Contention Scheduler
language comprehension must use separate and the strategically controlled Supervisory
working-memory modules. To test this hy- Attentional System. The Contention Sched-
pothesis, they devised a short-term mem- uler uses priorities and environmental cues
ory load task that balanced maintenance load (e.g., a car quickly pulls in front of me),
and time (Baddeley & Hitch, 1 974). For in- whereas the Supervisory Attentional System
stance, a low-load condition might require tends to follow larger goals (e.g., convinc-
participants to remember three numbers, ing my wife that I’m a good driver). Thus,
outputting them every 2 seconds, while a when the car rapidly pulled in front of me,
high-load condition might require partici- I pressed the brake on the car and then pro-
pants to remember six numbers, outputting ceeded to tell my wife how attentive I am
while on the road. One important charac- executive fractionates the central executive
teristic of the Supervisory Attentional Sys- in the hope that by understanding precisely
tem as a model of the central executive was what the central executive does we might
that it was sensitive to capacity limits. Ac- learn how it does it. Baddeley (1 996) sug-
cording to Norman and Shallice, capacity gested four arguably distinct central exec-
limits constrain thinking and action during utive functions: “(1 ) the capacity to coor-
(1 ) complex cognitive processes such as rea- dinate performance on two separate tasks,
soning or decision making; (2) novel tasks (2) the capacity to switch retrieval strategies
that have not developed schemata; (3 ) life- as reflected in random generation, (3 ) the
threatening or single, difficult tasks; and (4) capacity to attend selectively to one stimu-
functions that require the suppression of lus and inhibit the disrupting effect of oth-
habitual responses. ers, and (4) the capacity to hold and ma-
Baddeley (1 986) suggested that the Su- nipulate information in long-term memory,
pervisory Attentional System provided a as reflected in measures of working memory
useful framework for understanding random span” (Ref. 4, p. 5 ). Thus, Baddeley argued
generation, a task frequently associated with that the central executive is important for
the central executive. In random genera- task switching, inhibition of internal repre-
tion, a participant is asked to generate a sentations or prepotent responses, and the
series of random responses from a prede- activation of information in long-term mem-
termined list (e.g., integers from 0 to 9, ory during an activity that requires the active
for instance: 1 ,8,4,6,0,7,6, 8,4,5 ,6,1 ,2). Re- manipulation of material. In comparison to
sponse patterns from this task usually exhibit the slave systems, relatively little attention
two characteristics: (1 ) certain responses ap- has been paid to the central executive utiliz-
pear at much lower frequencies than oth- ing dual-task methodologies.
ers (e.g, 3 or 9 did not appear whereas The last and most recently added compo-
1 ,4,6, and 8 appeared repeatedly) and (2) nent of Baddeley’s working-memory model
stereotyped responses (e.g., 4,5 ,6 or 1 ,2) is the episodic buffer. One problem encoun-
are much more common than other equally tered by a modal working-memory model is
likely two- or three-number sequences (Bad- the need for integration. How can a com-
deley, 1 966). Baddeley suggested that the plex problem requiring the integration of
higher-order goal of randomness is at odds information across modalities be solved if
with the dominant schemata for the pro- all the information is being held in sepa-
duction of numbers (i.e., counting). Thus, rate distinct buffers? This binding problem,
random generation potently requires the ser- whether it is binding information within
vices of the Supervisory Attentional Sys- a modality or across modalities, is one of
tem to override or inhibit the dominant the central challenges for a working-memory
schemata. When random number genera- system capable of high-level cognition (see
tion is performed with another working- Doumas & Hummel, Chap. 4). To address
memory–intensive task, the resources avail- this issue, Baddeley (2000) has proposed
able to the Supervisory Attentional System a third type of buffer that uses a multidi-
(i.e., central executive) are in even more de- mensional code. Thus, this buffer can main-
mand and responses become more stereo- tain information from several modalities that
typed (Baddeley et al., 1 998). has been bound together by the central ex-
Although the Supervisory Attentional ecutive. Fuster, Bodner, and Kroger (2000)
System describes an important ability that have found evidence of the existence of neu-
underlies complex cognitive processes such rons in prefrontal cortex that seem to be re-
as language comprehension and problem sponsible for this type of function. Another
solving, it fails to offer a tenable account important function of the episodic buffer
of how, short of a homunculus, this direc- is serving as a scratchpad for the develop-
tion would occur. Acknowledging this prob- ment of new mental representations during
lem, Baddeley’s current model of the central complex problem solving. There are many
examples of situations requiring the func- information. As previously discussed, Bad-

tions ascribed to the episodic buffer, but the deley hypothesizes modality-specific buffers
methods for studying such a resource utiliz- for the short-term storage of information
ing the task-interference paradigm are still that coordinate with the Episodic Buffer,
under development. which is responsible for storing integrated in-
formation. In contrast, Cowan suggests that
information is maintained in working mem-
ory simply by activating its representations
Embedded-Processes in long-term memory via short-term – spe-
Working-Memory Model cific neurons in the prefrontal or parietal cor-
tices. This latter view suggests that informa-
Although Baddeley’s multi-component tion from different modalities will behave
working-memory model has dominated differently to the extent that they are coded
the field for much of the past thirty years, differently in long-term memory, a view
there are alternative conceptions of work- somewhat at odds with findings of phonolog-
ing memory. Cowan (1 988, 1 995 ) has ical errors in short-term memory tasks and
proposed a model that tightly integrates semantic errors in long-term memory tasks.
short- and long-term memory systems Cowan counters this objection by noting
with attention. In his Embedded-Processes that different codes are used in the storage of
working-memory model (Figure 1 9.4), information in long-term memory and, de-
Cowan defines working memory as the set pending on the nature of the task, different
of cognitive processes that keep mental codes are likely to be more important. Like-
representations in an easily accessible state. wise, Baddeley has argued that short-term
Within this system, information can either and long-term memory systems are distinct
be within the focus of attention, which based on neuropsychological evidence sug-
Cowan believes is capacity limited, or gesting that short-term and long-term sys-
in active memory, which Cowan suggests tems can be dissociated and therefore must
is time limited. The focus of attention is be distinct systems. This argument, how-
similar to James’s (1 890) concept of primary ever, relies to some extent on the belief that
memory and is equated to the information the individual short- and long-term systems
that is currently in conscious awareness. In are anatomically unitary, an assumption that
contrast, active memory, a concept similar seems unlikely given recent evidence from
to Hebb’s (1 949) cell assemblies or Ericsson cognitive neuroscience. Fuster has argued,
and Kintsch’s (1 995 ) long-term working based on results from single-cell recording
memory, refers to information that has in nonhuman primates, that neurons in pre-
higher activation either from recently being frontal cortex are responsible for maintain-
in the focus of attention or through some ing information in working memory (Fuster
type of automatic activation (e.g., priming). & Alexander 1 971 ); however, disrupting cir-
In the Embedded-Processes model, a central cuits between this area and more poste-
executive, somewhat similar to Norman rior or inferior regions associated with long-
and Shallice’s (1 980, 1 986) Supervisory term storage of information can also result
Attention System, is responsible for bring- in working-memory deficits (Fuster, 1 997).
ing information into the focus of attention Recent evidence from electrophysiology in
while an automatic recruitment of attention humans seems to confirm that areas in pre-
mechanism can bring information into frontal cortex and areas associated with long-
active memory without previously having term storage of information are temporally
been in the focus of attention. coactive during working-memory tasks (see
A critical distinction between Cowan’s Ruchkin et al., 2003 , for a review).
Embedded-Processes model and Baddeley’s A second important distinction be-
multi-component model is how the two tween Baddeley’s multi-component
models deal with the topic of maintenance of working-memory model and Cowan’s
Figure 1 9.4. Simplified diagram of Cowan’s (1 988) Embedded-Processes Model.
Embedded-Processs model is modality memory slave systems, relatively little has

specificity. Specifically, Baddeley has pro- been done using this technique to study
posed independent modules within working high-level cognition or the central executive.
memory for maintaining information from Central to high-level cognitive processes is
different modalities (e.g., visual or verbal). the ability to form and manipulate men-
In contrast, Cowan suggests only a domain- tal representations. Review of the functions
general central executive that, in turn, can of the central executive in either Baddeley
activate networks for various modalities of or Cowan’s models suggests that the cen-
information stored in long-term memory. tral executive should be critical for thinking
Baddeley also proposes a domain-general and reasoning – a hypothesis that has been
central executive, so the main distinction confirmed in several studies. In their semi-
between the models is whether information nal work on working memory Baddeley and
to be maintained in working memory is Hitch (1 974) asked participants to perform
loaded into domain-specific buffers or a reasoning task in which they read a sim-
whether it is simply activated in long-term ple sentence containing information about
memory. From our earlier discussion, there the order of two abstract terms (i.e., A and
seems to be no doubt that it is easier to B). Their task was to judge whether a let-
maintain a certain quantity of information ter sequence presented after the sentence
across several modalities than to maintain reflected the order of the terms in the state-
the same amount of information within just ment. For instance, a TRUE statement would
a single modality. Although this observation be “A not preceded by B” followed by AB
does not necessitate independent buffers, it (Ref. 7, p. 5 0). Baddeley and Hitch varied the
does suggest that capacity limitations may statements with respect to statement voicing
be somewhat domain-specific. (i.e., active or passive), negation, and verb
type (i.e., precedes or follows). They found
that low concurrent memory loads (i.e., one
to two items to remember) had no effect on
Reasoning and Working Memory: reasoning accuracy or response time; how-
Using the Task-Interference Paradigm ever, high concurrent memory load (i.e., six
items to remember) had a reliable effect on
Although the task-interference paradigm response time. Depending on the empha-
has been very useful in exploring working sis of the instructions used, they found that
the decrement in performance was either had any effect on either reasoning accuracy
in the reasoning task or the memory task. or latency. Of the secondary tasks they used,
There was no statistical interaction between only a high concurrent memory load (i.e., six
concurrent memory and the reasoning task digits) affected reasoning performance, and
difficulty. this effect appeared to be limited to difficult
Several other researchers have investi- syllogisms.
gated how working memory is important for Klauer, Stegmaier, and Meiser (1 997) had
deductive reasoning. Gilhooly et al. (1 993 ), participants perform syllogisms and spatial
utilizing methods similar to Baddeley and reasoning tasks that involved transitive in-
Hitch, asked participants to perform ver- ference (see Halford, Chap. 22, for a de-
bal syllogisms (Evans, Chap. 8, for a description of transitive inference tasks). The
scription of syllogistic reasoning) of varying spatial reasoning problems varied in com-
levels of complexity. In a first experiment, plexity from simple transitive inference (e.g.,
participants either viewed the premises of “The circle is to the right of the trian-
the syllogisms visually, all at once, or heard gle. The square is to the left of the tri-
the premises read one at a time. Gilhooly angle.” See Ref. 44, p. 1 3 ) to more com-
et al. hypothesized that verbal presentation plicated transitive inference problems that
would result in a higher working-memory required greater degrees of relational inte-
load because participants would have to gration. Klauer et al. had participants per-
maintain the content of the premises before form a visual tracking task (i.e., follow
they were able to solve the problem. They one object on a screen filled with distrac-
found this result: Participants made more er- tor objects) while listening to the premises
rors in the verbal condition than in the vi- of the reasoning problems. They found
sual condition. An error analysis indicated that this visuospatial secondary task inter-
that the errors made were the result of not fered with spatial reasoning but had little
remembering the premises correctly, not er- effect on syllogism performance. In another
rors made in the process of integration of in- experiment, Klauer et al. presented syllo-
formation between premises. In a second ex- gisms or spatial reasoning problems either
periment, they had participants perform the auditorally (as in the previous experiment)
syllogism task visually while performing one or visually on a computer screen. While
of three different secondary tasks. They performing these primary tasks, participants
found that only random number generation performed random generation either ver-
interfered with performance of syllogisms. bally or spatially, by pressing keys in a ran-
Gilhooly et al. concluded that the central dom pattern. They found that both forms of
executive is critical for relational reason- random generation affected both syllogism
ing and the phonological loop (as interfered and spatial reasoning performance; however,
with by articulatory suppression) may be spatial random generation caused somewhat
involved to a lesser extent. They also con- less interference than verbal random gener-
cluded that the visuospatial sketchpad, as ation – a finding consistent with Baddeley et
interfered with by spatial tapping (i.e., tap- al.’s (1 998) extensive study of random gen-
ping a fixed pattern with the fingers), was not eration. In their final experiment, Klauer et
important for performing verbal syllogisms al. found that articulatory suppression (i.e.,
and thus argued against models of reason- counting repeatedly from 1 to 5 ) had a mild
ing that are at least in principle dependent effect on syllogism and spatial reasoning la-
on involvement of visual working memory tencies. Overall, Klauer et al. found evidence
(e.g., Kirby & Kosslyn, 1 992; Johnson-Laird, for involvement of the central executive (as
1 983 ). In a similar study, Toms, Morris, and interfered with by random generation) and
Ward (1 993 ) found no evidence that a vari- somewhat less interference by slave system
ety of secondary tasks loading on either the tasks consistent with the modality of the rea-
phonological loop or visuospatial sketchpad soning task.
Unlike the examples of deductive and explanation of these results is that analogical
spatial reasoning we discussed previously, reasoning is more resource demanding than
analogical reasoning frequently requires the the deductive and spatial reasoning tasks
extensive retrieval of semantic information previously discussed, and thus even the slave
in addition to the relational processing char- system tasks cause significant interference.
acteristic of all types of reasoning (see Another possibility is that analogical reason-
Holyoak, Chap. 6, for a detailed discussion ing places greater demands specifically on
of analogical reasoning). Waltz et al. (2000) the modality-specific slave systems of work-
had participants perform an analogical rea- ing memory than other forms of relational
soning task while performing one of several reasoning. To investigate this issue, Morri-
secondary tasks. In the analogical reasoning son, Holyoak, and Truong (2001 ) had par-
task (adapted from Markman & Gentner, ticipants perform either a verbal or visual
1 993 ), participants studied pairs of pictures analogy task, while performing articulatory
of scenes with multiple objects (see Figure suppression (i.e., saying the nonword “zorn”
6.3 in Holyoak, Chap. 6). For instance, one once a second), spatial tapping (i.e., touch-
problem showed a boy trying to walk a dog ing one of four red dots each second in a
in one picture while the companion picture predetermined pattern), or verbal random
showed a dog failing to be restrained by a number generation. In the verbal analogy
leash tied to a tree. Participants were asked to task, participants verified verbal analogies,
study each picture and pick one object in the such as BLACK:WHITE::NOISY:QUIET,
second picture that “goes with” a target ob- answering either TRUE or FALSE via a floor
ject in the first picture. In the example prob- pedal. In the visual analogy task, participants
lem in Figure 6.3 , the man in the first picture performed Sternberg’s (1 977) People Pieces
is a featural match to the boy in the second analogy task. In this task, participants ver-
picture while using an analogy the boy is a ify whether the relational pattern of charac-
relational match to the tree in the second teristics between two cartoon characters is
picture. Participants were simply asked to se- the same or different than between a sec-
lect one object; they were thus free to com- ond pair of characters. Morrison, Holyoak,
plete the task based on either featural sim- and Truong found that, for verbal analogies,
ilarity or make an analogical mapping and articulatory suppression and verbal random
inference, answering based on relational sim- number generation resulted in an increase in
ilarity. Waltz et al. found that participants analogy error rate, whereas only verbal ran-
who maintained a concurrent memory load dom number generation increased analogy
or performed verbal random number gener- response time for correct responses. Spatial
ation or articulatory suppression (i.e., saying tapping had no reliable effect on verbal anal-
the word “the” once each second) gave fewer ogy performance. In contrast, for visual anal-
relational responses than a control group not ogy, both spatial tapping and verbal random
performing a dual task. In a recent extension number generation resulted in more analogy
with this task, my lab replicated Waltz et al.’s errors, whereas only random generation in-
articulatory suppression finding (i.e., saying creased analogy response time. Articulatory
the English nonword “zorn” once each sec- suppression had no reliable effect on visual
ond) and also found a similar effect for a vi- analogy performance. Thus, there seems to
suospatial working-memory dual task (man- be a modality-specific role for working mem-
ually tapping a simple spatial pattern). ory in analogical reasoning.
In the previous studies, the extent of in- In summary, all of the reasoning tasks
terference with the analogy task was similar described in the previous section are inter-
for both central executive (concurrent mem- fered with by dual tasks considered to tap
ory load and verbal random number gen- the central executive (e.g., random number
eration) and slave system (articulatory sup- generation or concurrent memory load).
pression and spatial tapping) dual tasks. One The deductive reasoning tasks reported
require the manipulation and the alignment later recall the last word of each sentence in
of premises that are provided in the prob- the correct order. The participant’s span is
lem. In addition to these operations, analog- typically defined as the maximum-sized trial
ical reasoning may require the reasoner to with perfect performance. This measure cor-
retrieve information from semantic memory related relatively well with individuals’ read-
(e.g., the relations that bind the terms in the ing comprehension ability. Unlike a simple
analogy) and then map the resulting rela- short-term memory-span task, the working-
tional statements (and in some cases make memory–span task required the subjects to
an inference that requires retrieving a term do a more complex task while also remem-
that completes the analogy). bering a list of items. In this way, the span
To evaluate the extent that working- task is believed to tap both the mainte-
memory resources are necessary for seman- nance (slave system) and manipulation (cen-
tic memory retrieval and relational binding, tral executive and episodic buffer) aspects
my lab went on to examine the compo- of working memory. Other span tasks have
nent processes in working memory is neces- been developed to vary the nature of the
sary for analogical reasoning. We wondered task that participants perform and what
whether working memory is necessary for they maintain. For example, Turner and En-
the simple process of relational binding or gle (1 989) asked participants to solve sim-
only becomes necessary when multiple rela- ple arithmetic problems and then remem-
tions need to be maintained and compared ber a word presented at the end of each
during the analogical mapping process. To problem. In the n-back task (Figure 1 9.5 ;
address this question, we used the stim- Smith & Jonides, 1 997 for a complete de-
uli from the verbal analogy task but simply scription), the manipulation task is changed
asked participants to verify relational state- to having to continuously update the set
ments instead of comparing two of them as of items. Using this approach, researchers
in the analogy task. Thus, participants would have found working-memory capacity to be
respond TRUE to a statement like “black is an important predictor of performance in a
the opposite of white” and FALSE to the broad range of higher cognitive tasks, includ-
statement “noisy is the opposite of nois- ing reading comprehension (Daneman &
ier.” As in the verbal analogy task, articula- Carpenter, 1 980), language comprehension
tory suppression and verbal random number (Just & Carpenter, 1 992), following direc-
generation affected performance with spa- tions (Engle, Carullo, & Collins, 1 991 ), rea-
tial tapping also having a smaller, but reli- soning (Carpenter, Just, & Shell, 1 990; Kyllo-
able effect. Thus, relational binding, not just nen & Christal, 1 990), and memory retrieval
maintenance and mapping, require use of (Conway & Engel, 1 994).
the working-memory system, including the Researchers using working-memory-span
modality-specific slave systems. measures typically measure participants’
working-memory span using one or more
measures and then use this to predict per-
formance on another task. A high correlation
Individual Differences in suggests that working memory is an impor-
Working Memory tant target for the task. More sophisticated
studies collect a variety of other measures of
An alternative to Baddeley’s dual-task me- information processing ability (e.g., process-
thodology uses individual differences to ing speed or short-term memory span) and
study working memory. Daneman and Car- use either multiple regression or structural
penter (1 980) first used this approach to equation modeling to determine whether
investigate how working memory was in- these various abilities are separable with
volved in language comprehension. They de- respect to the target task. Engle and his
veloped a reading span task that required collaborators (Engle, Kane, & Tuholski,
subjects to read several sentences and then 1 999; Kane & Engle, 2003 b; see also
Figure 1 9.5. The n-back task. Participants see a stream of letters, numbers, or
symbols and have to continuously answer whether the current item was the
same as the item presented “n-back” in the stream. This task requires
maintenance of the current in-set item and continuous updating of this set – an
ability considered to be manipulation of the set.
Salthouse, Chap. 24) have used this ap- are omnipresent in high-level cognition.
proach to argue that, although working- They are also the types of cognitive abil-
memory-span and short-term-memory-span ities necessary to perform traditional tests
tasks share much variance, it is working- of fluid or analytical intelligence such as
memory capacity that best predicts higher the Ravens Progressive Matricies (1 93 8),
cognitive performance as measured by tasks leading researchers to hypothesize that
such as the Ravens Progressive Matrices (see working-memory capacity is the critical
Figure 1 9.6). factor that determines analytical intelli-
Kane and Engle believe that the ability gence (see Kane & Engle, 2003 a; Sternberg,
measured by a working-memory-span task Chap. 3 1 ).
once simple maintenance is stripped away is
best described as controlled attention. They
have argued that working-memory capacity
is a good predictor of task performance in The Where, What, and How
tasks that (a) require maintenance of task of Working Memory and Thought
goals, (b) require scheduling competing ac-
tions or responses, (c) involve response com- So far, we have suggested that there are
petition or (d) involve inhibiting informa- at least two important aspects of working
tion irrelevant to the task (Engle, Kane, & memory for human thinking – a modality-
Tuholski, 1 999). This list is very similar to specific maintenance function that is ca-
the functions that Baddeley (1 996) attribu- pable of preserving information over short
tes to the central executive. Obviously, these periods of time and a manipulation or at-
are the types of cognitive processes that tentional control function that is capable
Figure 1 9.6. Structural Equation Model of the relationship of working memory and short-term
memory and their role in analytic problem solving and intelligence. From Engle, Kane, and Tuholski
(1 999).
of activating, operating, and updating this It is the belief of many of the authors in
information during conscious thought. Re- this volume that high-level cognition is in-
cently, cognitive neuroscientists have de- trinsically relational in nature, a position long
voted much effort to answering the question argued by many scientists (see Fodor and
of where in the brain these working mem- Pylyshyn, 1 988; Spearman, 1 923 ). In this
ory mechanisms operate. This topic is be- account, one critical function for working
yond the scope of this chapter [see Goel, memory to accomplish is the flexible binding
Chap. 20, for a more detailed treatment of of information stored in long-term memory.
the cognitive neuroscience of problem solv- Working memory must also be able to nest
ing and Chein, Ravizza, and Fiez (2003 ) for relations to allow more complex knowledge
a recent appraisal of the ability of Baddeley structures to be used. Halford (Chap. 22) has
and Cowan’s models to account for recent referred to this factor as relational complexity.
neuroimaging findings]; however, we know As the relational complexity of a particular
that at least several areas of the prefrontal problem increases, so do the demands placed
and parietal corticies are critical for these on working memory. Goals are a particu-
functions. Although these areas may be spe- lar subclass of relations that are especially
cific to working memory, there is mount- important in deductive reasoning (see Goel,
ing evidence from both electrophysiology Chap. 20). Maintaining the complex goal hi-
and functional magnetic resonance imaging erarchies (high relational complexity) nec-
(fMRI) that working memory is the result essary for solving complex problems such as
of activation of networks involving many those encountered in chess or in tasks such
brain regions.2 A more interesting question as the Ravens Progressive Matrices or the
than where, is how working memory op- Tower of Hanoi makes great demands on
erates thinking. Unfortunately, much less the working memory system (see Lovett &
attention has been given to this question; Anderson, Chap. 1 7; Carpenter, Just, &
however, several of the computational ap- Shell, 1 990; Newman et al., 2003 ). Most
proaches outlined in this book begin to ad- work directed at understanding how the
dress this topic.3 brain implements working memory has
focused on relatively simple tasks in which NOISY and QUIET, the correct relational
processing of relations is minimal. response, QUIET, may initially be less active
The ways in which the brain’s distributed because of spreading activation in memory
architecture is used to process problems than the distractor item, NOISIER. Thus,
that require relation flexibility and relational during reasoning, it may be necessary to in-
complexity have just begun to be explored hibit information that is highly related but
(see Christoff & Gabrieli, 2000; Christoff et inconsistent with the current goal (Morri-
al., 2001 ; Morrison et al., 2004; Prabhakaran son et al., 2004). This function of work-
et al., 2000; Waltz et al., 1 999). Hum- ing memory has also been ascribed to the
mel and Holyoak’s (1 997, 2003 ; see also prefrontal cortex (see Kane & Engle, 2003 b
Doumas & Hummel, Chap. 4) LISA model and 2003 a; Miller & Cohen, 2001 ; and Shi-
solves the binding problem created by the mamura, 2000, for reviews). Many complex
need for the flexible use of information in executive tasks associated with frontal lobe
a distributed architecture. The LISA model functioning (e.g., Tower of Hanoi or Lon-
dynamically binds roles to their fillers in don, Analogical Reasoning, Wisconsin Card
working memory by temporal synchrony of Sorting) have important inhibitory compo-
firing. This allows the distributed informa- nents [Miyake et al., 2000; Morrison et al.,
tion in long-term memory to be flexibly 2004; Viskontas et al. (in press), and Welsh,
bound in different relations and for the sys- Satterlee-Cartmell, & Stine, 1 999]. Shima-
tem to appreciate that the various entities mura (2000) suggested that the role of pre-
can serve different functions in different re- frontal cortex is to filter information dynam-
lations and relational hierarchies. It is possi- ically – a process that requires the use of
ble that one role of the prefrontal cortex is to both activation and inhibition to keep infor-
control this synchrony process by firing the mation in working memory relevant to the
distributed network of neurons representing current goal. Miller and Cohen (2001 ) ar-
the actual fillers in long-term memory (see gued that “the ability to select a weaker, task-
Doumas & Hummel Chap. 4, and Morrison relevant response (or source of information)
et al., 2004, for a more detailed account of in the face of competition from an otherwise
this approach). Although no direct evidence stronger, but task-irrelevant one [is one of
exists for synchrony of binding in high-level the most] fundamental aspects of cognitive
relational systems, several studies in animals control and goal-directed behavior” (Ref. 48,
(e.g., Gray et al., 1 989) and in humans (e.g., p. 1 70) and is a property of prefrontal cortex.
Müller et al., 1 996; Ruchkin et al., 2003 ) More generally, many researchers believed
suggest that synchrony may be an important that inhibition is an important mechanism
mechanism for other cognitive processes im- for complex cognition (see Dagenbach &
plemented in the brain. This type of sys- Carr, 1 994; Dempster & Brainerd, 1 995 ; and
tem is also consistent with Baddeley’s (2000) Kane & Engle, 2003 a, for reviews) and that
concept of an episodic buffer that binds in- changes in inhibitory control may explain
formation together in working memory. important developmental trends (Bjorklund
Implicit in a working-memory system ca- & Harnishfeger, 1 990; Hasher & Zacks, 1 988;
pable of handling relations is not only the Diamond, 1 990) and individual differences
ability to precisely activate information in (Dempster, 1 991 ; Kane & Engle, 2003 a,
long-term memory but also the ability to 2003 b) in complex cognition.
deactivate or inhibit it. Consider the simple
analogy problem:
BLACK:WHITE::NOISY: ? (1 ) QUIET
(2) NOISIER Working memory is a set of central processes
If the semantic association between NOISY that makes conscious thought possible. It
and NOISIER is stronger than that between flexibly provides for the maintenance and
manipulation of information through both Service Research Award (MH-064244) from

activation and inhibition of information re- the National Institute of Mental Health. Ap-
trieved from long-term memory and newly preciation is due to Marsha Lovett and Keith
accessed from perception. Relations are crit- Holyoak for comments on an earlier draft of
ical to thought and the working-memory sys- this chapter.
tem therefore must provide for the flexible
binding of information. It also allows the
problem solver to maintain goals that allow Notes
successful navigation of single problems but
also allows for integration of various parts of 1 . The term “working memory” was originally
larger problems. Working-memory capacity used to describe rat behavior during radial arm
is limited, and this is an important individ- maze learning [see Olton (1 979) for a de-
ual difference that affects and perhaps even scription of this literature). It was also used
determines analytic intelligence. We know by Newell and Simon (1 972)] to describe the
that working memory is critically dependent component of their computational models that
on prefrontal cortex functioning, but likely holds productions – that is, operations that
involves the successful activation and inhi- the model must perform (see also Lovett and
Anderson, Chap. 1 7).
bition of large networks in the brain. Main-
tainence of information in working mem- 2. Fuster (1 997) has long argued for this approach
to working memory based on electrophysiolog-
ory tends to be somewhat modality specific;
ical and cortical cooling data from nonhuman
however, attentional resources typically as- primates. In Fuster’s model, neurons in pre-
cribed to a central executive tend to be frontal cortex drive neurons in more posterior
more modality independent and allow for brain regions that code for the information to
the connection of information from differ- be activated in long-term memory. This per-
ent modalities. spective is also consistent with Cowan’s (1 988)
The future of working memory research Embedded-Processes model. See also Chein,
resides in better understanding how these Ravizza, and Fiez, 2003 .
processes operate in the brain. Computa- 3 . Both ACT (Lovett and Anderson, Chap. 1 7)
tional approaches allow researchers to make and LISA (Doumas and Hummel, Chap. 4)
precise statements about functional pro- provide accounts of how working memory may
cesses necessary for a working-memory sys- be involved in higher-level cognition. These
tem to perform thinking and can provide theories and computational implementations
provide excellent starting points for investi-
useful predictions for evaluation with cogni-
gating how the brain actually accomplishes
tive neuroscience methods. Whereas much high-level thought. An excellent edited vol-
effort has been placed on understanding ume by Miyake and Shah (1 999) reviews many
where working memory resides in the cor- of the traditional computational perspectives
tex, much less attention has focused on how on working memory.
it functions. Understanding the neural pro-
cesses underlying working memory will al-
most certainly require tight integration of
References
methods that provide good spatial localiza-
tion (e.g., fMRI) and good temporal informa-
Atkinson, R. C. & Shiffrin, R. M. (1 968). Hu-
tion (e.g., electrophysiology) in the brain.
man memory: A proposed system and its con-
trol processes. In K. W. Spence, & J. T. Spence
(Eds.), The Psychology of Learning and Motiva-
Acknowledgments tion Vol. 2 (pp. 89–1 95 ). New York: Academic
Press.
Preparation of this chapter was supported by Baddeley, A. D. (1 966). The capacity for gener-
grants from the Office of Naval Research, ating information by randomization. Quarterly
Xunesis (www.xunesis.org), and National Journal of Experimental Psychology, 1 8, 1 1 9–29.
Baddeley, A. D. (1 986). Working Memory. Ox- dependent inhibition model. Journal of Experi-
ford, UK: Oxford University Press. mental Psychology: General, 1 2 3 , 3 5 4–3 73 .
Baddeley, A. D. (1 996). Exploring the central ex- Cowan, N. (1 988). Evolving conceptions of mem-
ecutive. Quarterly Journal of Experimental Psy- ory storage, selective attention, and their mu-
chology, 49A, 5 –28. tual constraints within the human information
Baddeley, A. D. (2000). The episodic buffer: A processing system. Psychological Bulletin, 1 04,
new component of working memory? Trends 1 63 –1 91 .
in Cognitive Sciences, 4, 41 7–423 . Cowan, N. (1 995 ). Attention and Memory: An In-
Baddeley, A., Emslie, H., Kolodny, J., & Dun- tegrated Framework. New York: Oxford Univer-
can, J. (1 998). Random generation and the ex- sity Press.
ecutive control of working memory. Quarterly Cowan, N. (1 999). An embedded-processes
Journal of Experimental Psychology, 5 1 A, 81 9– model of working memory. In Akira Miyake &
85 2. Priti Shad (Eds.) Models of Working Memory:
Baddeley, A., & Hitch, G. J. (1 974). Working Mechanisms of Active Maintenance and Exec-
memory. In G. H. Bower (Ed.), The Psychology utive Control (pp. 62–1 01 ). Cambridge, UK:
of Learning and Motivation Vol. 8 (pp. 47–89). Cambridge University Press.
New York: Academic Press. Dagenbach, D. E., & Carr, T. H. (1 994). Inhibitory
Bjorklund, D. F., & Harnishfeger, K. K. Processes in Attention, Memory, and Language.
(1 990). The resources construct in cognitive San Diego, CA: Academic Press.
development: Diverse sources of evidence and Daneman, M., & Carpenter, P. A. (1 980). Individ-
a theory of inefficient inhibition. Developmen- ual differences in working memory and read-
tal Review, 1 0, 48–71 . ing. Journal of Verbal Learning and Verbal Be-
Brooks, L. R. (1 968). Spatial and verbal compo- havior, 1 9, 45 0–466.
nents of the act of recall. Canadian Journal of Dempster, F. N. (1 991 ). Inhibitory processes:
Psychology, 2 2 , 3 49–3 68. A neglected dimension of intelligence. Intelli-
Carpenter, P. A., Just, M. A., & Shell, P. (1 990). gence, 1 5 , 1 5 7–1 73 .
What one intelligence test measures: A theo- Dempster, F. N., & Brainerd, C. J. (Eds.). (1 995 ).
retical account of the processing in the raven Interference and inhibition in cognition. San
progressive matrices test. Psychological Review, Diego, CA: Academic Press.
97, 404–43 1 .
de Renzi, E., & Nichelli, P. (1 975 ). Verbal and
Cattell Culture Fair Test (Institute for Personality non-verbal short-term memory impairment
and Ability Testing, 1 973 ). Champaign, IL. following hemispheric damage. Cortex, 1 1 ,
Chein, J. M., Ravizza, S. M., & Fiez, J. A. 3 41 –3 5 4.
(2003 ). Using neuroimaging to evaluate mod- Diamond, A. (1 990). The development and neu-
els of working memory and their implications ral bases of memory functions as indexed by
for language processing. Journal of Neurolin- the A-not-B and delayed response tasks in hu-
guistics, 1 6, 3 1 5 –3 3 9. man infants and infant monkeys. In Diamond
Christoff, K., & Gabrieli, J. D. E. (2000). The A. (Ed.), The Development and Neural Bases of
frontopolar cortex and human cognition: Evi- Higher Cognitive Functions (pp. 267–3 1 7). New
dence for a rostrocaudal hierarchical organiza- York: New York Academy of Sciences.
tion within the human prefrontal cortex. Psy- Engle, R. W., Carullo, J. J., & Collins, K. W.
chobiology, 2 8, 1 68–1 86. (1 991 ). Individual differences in working mem-
Christoff, K., Prabhakaran, V., Dorfan, J., Zhao, ory for comprehension and following direc-
Z., Kroger, J. K., Holyoak, K. J. et al. (2001 ). tions. Journal of Educational Research, 84, 25 3 –
Rostrolateral prefrontal cortex involvement in 262.
relational integration during reasoning. Neu- Engle, R. W., Kane, M. J., & Tuholski, S. W.
roimage, 1 4, 1 1 3 6–1 1 49. (1 999). Working memory and controlled atten-
Conrad, R. (1 964). Acoustic confusion in imme- tion. In Akira Miyake & Priti Shad (Eds.), Mod-
diate memory. British Journal of Psychology, 5 5 , els of Working Memory: Mechanisms of Active
75 –84. Maintenance and Executive Control (pp. 1 02–
Conway, A. R. A., & Engle, R. W. (1 994). 1 3 4). Cambridge, UK: Cambridge University
Working memory and retrieval: A resource- Press.
Ericsson, K. A. & Kintsch, W. (1 995 ). Long-term Kane, M. J., & Engle, R. W. (2003 a). The role
working memory. Psychological Review, 1 02 , of prefrontal cortex in working-memory ca-
21 1 –245 . pacity, executive attention, and general fluid
Fodor, J. A., & Pylyshyn, Z. W. (1 988). Connec- intelligence: An individual-differences per-
tionism and cognitive architecture: A critical spective. Psychonomic Bulletin and Review, 9,
analysis. In S. Pinker & J. Mehler (Eds.), Con- 63 7–671 .
nections and Symbols (pp. 3 –71 ). Cambridge, Kane, M. J., & Engle, R. W. (2003 b). Working-
MA: MIT Press. memory capacity and the control of attention:
Fuster, J. M. & Alexander, G. E. (1 971 ). Neuron The contributions of goal neglect, response
activity related to short-term memory. Science, competition, and task set to Stroop interfer-
1 73 , 65 2–65 4. ence. Journal of Experimental Psychology: Gen-
Fuster, J. M. (1 997). The Prefrontal Cortex: eral, 1 3 2 , 47–70.
Anatomy, Physiology, and Neuropsychology of Kirby, K. N., & Kosslyn, S. M. (1 992). Thinking
the Frontal Lobe. (3 rd ed.). Philadelphia: visually. In G. W. Humphreys (Ed.), Under-
Lippincott-Raven. standing Vision: An Interdisciplinary Perspective.
Fuster, J. M., Bodner, M., & Kroger, J. K. (2000). (pp. 71 –86). Oxford, UK: Blackwell.
Cross-modal and cross-temporal association in Klauer, K. C., Stegmaier, R., & Meiser, T. (1 997).
neurons of frontal cortex. Nature, 405 , 3 47– Working memory involvement in propositional
351. and spatial reasoning. Thinking and Reasoning,
Gilhooly, K. J., Logie, R. H., Wetherick, N. E., & 3 , 9–47.
Wynn, V. (1 993 ). Working memory and strate- Kyllonen, P. C., & Christal, R. E. (1 990). Reason-
gies in syllogistic-reasoning tasks. Memory and ing ability is (little more than) working- mem-
Cognition, 2 1 , 1 1 5 –1 24. ory capacity?! Intelligence, 1 4, 3 89–43 3 .
Gray, C. M., Koenig, P., Engel, A. K., & Singer, Logie, R. H. (1 995 ). Visuo-spatial Working Mem-
W. (1 989). Oscillatory responses in cat visual ory. Hillsdale, NJ: Erlbaum.
cortex exhibit inter-columnar synchronization Markman, A. B., & Gentner, D. (1 993 ). Struc-
which reflects global stimulus properties. Na- tural alignment during similarity comparisons.
ture, 3 3 8: 3 3 4–3 3 7. Cognitive Psychology, 2 5 , 43 1 –467.
Hasher, L., & Zacks, R. T. (1 988). Working mem- Miller, E. K., & Cohen, J. D. (2001 ). An integra-
ory, comprehension, and aging: A review and a tive theory of prefrontal cortex function. An-
new view. In G. Bower (Ed.), The Psychology of nual Review of Neuroscience, 2 4, 1 67–202.
Learning and Motivation: Advances in Research Miller, G. A. (1 95 6). The magical number seven,
and Theory (Vol. 22, pp. 1 93 –225 ). San Diego, plus or minus two: Some limits on our capac-
CA: Academic Press. ity for processing information. Psychological Re-
Hebb, D. O. (1 949). The Organization of Behavior. view, 63 , 81 –97.
New York: Wiley.
Milner, B. (1 966). Amnesia following operation
Hummel, J. E., & Holyoak, K. J. (1 997). Dis- on the temporal lobes. In C. W. M. Whitty &
tributed representations of structure: A theory O. L. Zangwill (Eds.), Amnesia (pp. 1 09–1 3 3 ).
of analogical access and mapping. Psychological London: Butterworths.
Review, 1 04, 427–466.
Miyake, A., Friedman, N. P., Emerson, M. J.,
Hummel, J. E., & Holyoak, K. J. (2003 ). A Witzki, A. H., & Howerter, A. (2000). The
symbolic-connectionist theory of relational in- unity and diversity of executive functions and
ference and generalization. Psychological Re- their contributions to complex ‘frontal lobe’
view, 1 1 0, 220–264. tasks: A latent variable analysis. Cognitive Psy-
James, W. (1 890). Principles of Psychology. New chology, 41 , 49–1 00.
York: Henry Holt & Co. Miyake, A. & Shah, P. (1 999). Models of working
Johnson-Laird, P. N. (1 983 ). Mental Mod- memory: Mechanisms of active maintenance and
els. Cambridge, UK: Cambridge University executive control. New York: Cambridge Uni-
Press. versity Press.
Just, M. A., & Carpenter, P. A. (1 992). A capac- Morrison, R. G., Holyoak, K. J., & Truong,
ity theory of comprehension: Individual differ- B. (2001 ). Working memory modularity in
ences in working memory. Psychological Review, analogical reasoning. Proceedings of the Twen-
99, 1 22–1 49. tyThird Annual Conference of the Cognitive
Science Society (pp. 663 –668). Mahwah, NJ: systems: A state of activated long-term mem-
Erlbaum. ory. Behavioral and Brain Sciences, 2 6, 709–
Morrison, R. G., Krawczyk, D. C., Holyoak, K. J., 777.
Hummel, J. E., Chow, T. W., Miller, B. L., Shallice, T., & Warrington, E. K. (1 970). Inde-
et al. (2004). A neurocomputational model pendent functioning of verbal memory store:
of analogical reasoning and its breakdown in A neuropsychological study. Quarterly Journal
Frontotemporal Lobar Degeneration. Journal of of Experimental Psychology, 2 2 , 261 –273 .
Cognitive Neuroscience, 1 6, 1 –1 1 . Shimamura, A. P. (2000). The role of the pre-
Müller, M. M., Bosch, J., Elbert, T., Kreiter, A., frontal cortex in dynamic filtering. Psychobiol-
Sosa, M. V., Sosa, P. V., et al. (1 996). Visu- ogy, 2 8, 207–21 8.
ally induced gamma-band responses in human Smith, E. E., & Jonides, J. (1 997). Working
electroencephalographic activity: A link to an- memory: A view from neuroimaging. Cognitive
imal studies. Experimental Brain Research, 1 1 2 , Psychology, 3 3 , 5 –42.
96–1 02. Spearman, C. (1 923 ). The Nature of Intelligence
Murdock, B. B., Jr. (1 962). The retention of in- and the Principles of Cognition. London, UK:
dividual items. Journal of Experimental Psychol- MacMillian.
ogy, 64, 482–488. Sternberg, R. J. (1 977). Intelligence, Information
Newman, S. D., Carpenter, P. A., Varma, S. & Processing, and Analogical Reasoning: The Com-
Just, M. A. (2003 ). Frontal and parietal par- ponential Analysis of Human Abilities. Hillsdale,
ticipation in problem solving in the Tower of NJ: Erlbaum.
London: fMRI and computational modeling of Toms, M., Morris, N., & Ward, D. (1 993 ). Work-
planning and high-level perception. Neuropsy- ing memory and conditional reasoning. Quar-
chologia, 41 , 2003 . terly Journal of Experimental Psychology: Human
Newell, A., & Simon, H. A. (1 972). Human prob- Experimental Psychology, 46, 679–699.
lem solving. Englewood Cliffs, NJ: Prentice- Turner, M. L. & Engle, R. W. (1 989). Is work-
Hall. ing memory capacity task dependent? Journal
Norman, D. A., & Shallice, T. (1 980). Atten- of Memory and Language, 2 8, 1 27–1 5 4.
tion to action: Willed and automatic control of Viskontas, I. V., Morrison, R. G., Holyoak, K. J.,
behaviour. University of California, San Diego Hummel, J. E., & Knowlton, B. J. (in press).
CHIP Report 99. Relational integration, inhibition and analog-
Norman, D. A., & Shallice, T. (1 986). Attention ical reasoning in older adults. Psychology and
to action: Willed and automatic control of be- Aging.
havior. In G. E. Schwartz & D. Shapiro (Eds.), Waltz, J. A., Knowlton, B. J., Holyoak, K. J.,
Consciousness and Self-Regulation. New York: Boone, K. B., Mishkin, F. S., de Menezes San-
Plenum Press. tos, M., et al. (1 999). A system for relational
Olton, D. S. (1 979). Mazes, maps and memory. reasoning in human prefrontal cortex. Psycho-
American Psychologist, 3 4, 5 83 –5 96. logical Science, 1 0, 1 1 9–1 25 .
Prabhakaran, V., Narayanan, K., Zhao, Z., & Waltz, J. A., Lau, A., Grewal, S. K., & Holyoak,
Gabrieli, J. D. E. (2000). Integration of di- K. J. (2000). The role of working memory in
verse information in working memory within analogical mapping. Memory and Cognition, 2 8,
the frontal lobe. Nature Neuroscience, 3 (1 ), 85 – 1 205 –1 21 2.
90. Waugh, N. C. & Norman, D. A. (1 965 ). Primary
Raven, J. C. (1 93 8). Progressive Matrices: A Per- memory. Psychological Review, 72 , 89–1 04.
ceptual Test of Intelligence, Individual Form. Welsh, M. C., Satterlee-Cartmell, T., & Stine, M.
London: Lewis. (1 999). Towers of hanoi and london: Contribu-
Ruchkin, D. S., Grafman, J., Cameron, K., & tion of working memory and inhibition to per-
Berndt, R. (2003 ). Working memory retention formance. Brain and Cognition, 41 , 23 1 –242.
CHAPTER 2 0
Cognitive Neuroscience
of Deductive Reasoning
Vinod Goel
Introduction independent of the content of the proposi-

tions. Deductive arguments can be evaluated
It is 4 p.m. and I hear the school bus pull up for validity, a relationship between premises
to the house. Soon there is the taunting of and conclusion involving the claim that
a 1 3 -year-old boy followed by the exagger- the premises provide absolute grounds for
ated screams of an 8-year-old girl. My kids accepting the conclusion (i.e., if the pre-
are home from school. Exasperated, I say to mises are true, then the conclusion must be
my son, “If you want dinner tonight, you bet- true).
ter stop tormenting your sister.” Given he
doesn’t want to go to bed hungry, he needs
to draw the correct logical inference. Sure Psychological Theories of
enough, peace is eventually restored. We are Deductive Reasoning
not surprised by his actions. His behavior is
not a mystery (if he wants his dinner). It Two theories of deductive reasoning (men-
is just an example of the reasoning brain at tal logic and mental models) dominate the
work. cognitive literature. They differ with respect
Reasoning is the cognitive activity of to the competence knowledge upon which
drawing inferences from given information. they draw, the mental representations they
All reasoning involves the claim that one postulate, the mechanisms they invoke, and
or more propositions (the premises) pro- the neuroanatomical predictions they make.
vide some grounds for accepting another Mental logic theories (Braine, 1 978; Henle,
proposition (the conclusion). The aforemen- 1 962; Rips, 1 994) postulate that reason-
tioned example involves a deductive infer- ers have an underlying competence knowl-
ence (see Evans, Chap. 8). A key feature edge of the inferential role of the closed-
of deduction is that conclusions are conform, or logical terms, of the language (e.g.,
tained within the premises and are logically “all, some, none, and,” etc.). The internal
475
representation of arguments preserve the uninformed by knowledge of the brain. This

structural properties of the propositional is not an oversight. Until recently, the central
strings in which the premises are stated. A domains of human reasoning and problem
mechanism of inference is applied to these solving have been largely cognitive and com-
representations to draw conclusions from putational enterprises, with little input from
premises. Essentially, the claim is that de- neuroscience. In fact, an argument advanced
ductive reasoning is a rule-governed process by cognitive scientists – based on the in-
defined over syntactic strings. dependence of computational processes and
By contrast, mental model theory the mechanism in which they are realized
(Johnson-Laird, 1 983 ; Johnson-Laird & (i.e., the brain) – has led many to question
Byrne, 1 991 ; see Johnson-Laird, Chap. 9) the relevance of neuropsychological data to
postulates that reasoners have an underlying cognitive theories.
competence knowledge of the meaning of The “independence of computational
the closed-form, or logical terms, of the level” argument is a general argument against
language (e.g., “all, some, none, and,” etc.)1 the necessity of appealing to neurophysi-
and use this knowledge to construct and ology to capture the generalizations nec-
search alternative scenarios.2 The internal essary to explain human mental life. The
representations of arguments preserve the general idea is that liberation from neuro-
structural properties of the world (e.g., spa- physiology is one of the great virtues of the
tial relations) that the propositional strings cognitive/computational revolution. It gives
are about, rather than the structural proper- us the best of both worlds. It allows us to
ties of the propositional strings themselves. use an intentional vocabulary in our psy-
The basic claim is that deductive reasoning chological theories, and if this vocabulary
is a process requiring spatial manipulation meets certain (computational) constraints,
and search. we get a guarantee (via the Church–Turing
A third alternative is provided by dual hypothesis) that some mechanism will be
mechanism theories. At a very crude level, able to instantiate the postulated process.3
dual mechanism theories make a distinc- Beyond this, we don’t have to worry about
tion between formal, deliberate, rule-based the physical. The psychological vocabu-
processes and implicit, unschooled, auto- lary will map onto the computational vo-
matic processes. However, dual mechanism cabulary, and it is, after all, cognitive/
theories come in various flavors that dif- computational structure, not physical struc-
fer on the exact nature and properties of ture, that captures the psychologically inter-
these two systems. Theories differentially esting generalizations.
emphasize explicit and implicit processes The argument can be articulated as
(Evans & Over, 1 996), conscious and precon- follows:
scious processes (Stanovich & West, 2000),
(P1 ) There are good reasons to believe
formal and heuristic processes (Newell &
that the laws of psychology need to be
Simon, 1 972; also see Kahneman & Fred-
stated in intentional vocabulary (Fodor,
erick, Chap. 1 2), and associative and rule-
1 975 ; Pylyshyn, 1 984).
based processes (Goel, 1 995 ; Sloman, 1 996).
The relationship among these proposals has (P2) Computation (sort of ) gives us such
yet to be clarified. a vocabulary (Cummins, 1 989; Fodor,
1 975 ; Goel, 1 991 , 1 995 ; Newell, 1 980a;
Pylyshyn, 1 984).
(P3 ) Our theory construction is moti-
Relevance and Role vated by computational concepts and
of Neurophysiological Data constrained by behavioral data.
(P4) Computational processes are speci-
The reader will note that these theories fied independently of physics and can
of reasoning are strictly cognitive theories be realized in any physical system.
cognitive neuroscience of deductive reasoning 477
(C1 ) Therefore, there is no way, in princi- realizes the computational process become
ple, that neurological data can constrain very relevant. Presumably neuroscience is
our computational/cognitive theories. where we will learn about the architectural
constraints imposed on the human cogni-
A closer examination will reveal at least tive/computational system. As such, it can
two flaws in the argument. First, premise P4 hardly be ignored.
is not strictly true. Computational processes But this whole line of argument and coun-
cannot be realized in any and every system terargument makes an unwarranted assump-
(Giunti, 1 997; Goel, 1 991 , 1 992, 1 995 ). If tion. It assumes that the only contribu-
it were true, then computational explanation neuroscience can make is in terms of
tions would be vacuous (Searle, 1 990) and specifying mechanisms. However, a glance
our problems much more serious. Now, it is through any neuroscience text (e.g., Kandel,
true that computational processes can be re- Schwartz, & Jessell, 1 995 ) shows that neu-
alized in multiple systems, but that is far re- roscience is still far from making substantive
moved from universal realizability. The for- contributions to our understanding of the
mer gives computational theorizing much of computational architecture of the central
its power; the latter drains computational nervous system. This is many years in the
explanations of much of their substantive future.
content. There are, however, two more immedi-
Second, the conclusion C1 depends on ate contributions – localization and dissocia-
what computational/cognitive theories seek tion – that cognitive neuroscience can make
to explain. It is true that the organization to our understanding of cognitive processes,
of a computing mechanism (for example, including reasoning.
whether a Turing Machine has one head or
two) is irrelevant when we are interested in (1 ) Localization of brain functions: It is now
specifying what function is being computed generally accepted that Franz Joseph
and are concerned only with the mappings of Gall (Forster, 1 81 5 ) was largely right and
inputs to outputs. This is typically a concern Karl Lashley (1 929) largely wrong about
for mathematicians and logicians. If cogni- the organization of the brain. There is
tive theories will only enumerate the func- a degree of modularity in its overall
tions being computed, then the argument organization. Over the years, neuropsy-
would seem to hold. However, cognitive sci- chologists and neuroscientists have ac-
entists (and often computer scientists) have cumulated some knowledge of this or-
little interest in computation under the as- ganization. For example, we know some
pect of functions. Our primary concern is brain regions are involved in processing
with the procedures that compute the func- language and other regions process vi-
tions (Marr, 1 982). Real-time computation sual spatial information. Finding selec-
is a function of architectural considerations tive involvement of these regions in com-
and resource availability and allocation. And plex cognitive tasks such as reasoning can
it is real-time computation – the study of help us differentiate between compet-
the behavioral consequences of different re- ing cognitive theories that make differ-
source allocation and organization models – ent claims about linguistic and visuospa-
that must be of interest to cognitive science tial processes in the complex task (as do
(Newell, 1 980a; Newell & Simon, 1 976), be- mental logic and mental model theories
cause it is only with respect to specific archi- of reasoning).
tectures that algorithms can be specified and (2) Dissociation of brain functions: Brain le-
compared (to the extent that they can be). sions result in selective impairment of
If we are interested in the computational ar- behavior. Such selective impairments are
chitecture of the mind – and we clearly are called dissociations. A single dissociation
(Newell, 1 990; Pylyshyn, 1 984) – then the occurs when we find a case of a lesion in
constraints provided by the mechanism that region x resulting in a deficit of function
a but not function b. If we find another uation, in which one behavioral function
case, in which a lesion in region y re- mapped onto several causally distinct struc-
sults in a deficit in function b but not tures, we would conclude that our indi-
in function a, then we have a double dis- viduation was too coarse-grained and re-
sociation. Recurrent patterns of dissocia- fine it until we achieved a one-to-one map-
tion provide an indication of causal joints ping. One final possibility is a many-to-many
in the cognitive system invisible in un- mapping between our functional individua-
interrupted normal behavioral measures tion and casually individuated physiological
(Shallice, 1 988). Lesion studies identify structures. Here we would have a total cross-
systems necessary for the cognitive pro- classification and would have to assume that
cesses under consideration. Neuroimag- our functional individuations (f1 , f2, f3 )
ing studies identify cortical regions suf- were simply wrong and start over again.5
ficient for various cognitive processes.4 The most famous example of a dissoci-
Both are sources of knowledge regarding ation comes from the domain of language.
dissociation of cognitive functions. In the 1 860s, Paul Broca described a pa-
tient with a lesion to the left posterior infe-
The identification of dissociations is the rior frontal lobe who had difficulties in the
more important of these two contributions production of speech but was quite capa-
and warrants further discussion. Cognitive ble of speech comprehension. This is a case
theories are functional theories. Functional of a single dissociation. In the 1 870s, Carl
theories are notoriously underconstrained. Wernicke described two patients with le-
That is, they are “black box” theories. We sions to the posterior regions of the supe-
usually use them when we do not know the rior temporal gyrus who had difficulty in
underlying causal structure. This devalues speech comprehension but were quite fluent
the currency of functional distinctions. But in speech production. Jointly, the two ob-
if we can show that our functional distinc- servations indicate a double dissociation and
tions map onto causally individuated neuro- tell us something important about the causal
physiological structures, then we can have independence of language production and
much greater confidence in the functional comprehension systems. If this characteri-
individuation. zation is accurate (and there are now some
By way of an example, suppose that we questions about its accuracy), it tells us that
individuate the following three functions on any cognitive theory of speech production
the basis of behavioral data: (f1 ) raise left and comprehension needs to postulate two
arm, (f2) raise left foot, (f3 ) wiggle right causally distinct functions or mechanisms.
ear. If these functions could be mapped
Neuroanatomical Predictions of Cognitive
onto three causally differentiated structures
Theories of Reasoning
in a one-to-one fashion, we would be jus-
tified in claiming to have discovered three Given that the relevance of neuroanatomical
distinct functions. If, however, all three of data to cognitive theories has not been fully
our behaviorally individuated functions map appreciated, it is not surprising that there
onto one causally differentiated structure, in are few explicit neuroanatomical predictions
a many-to-one fashion, we would say that made by these theories. The one exception is
our functional individuation was too fine- mental model theory. Johnson-Laird (1 994)
grained and collapse the distinctions until has predicted that if mental model theory
we achieved a one-to-one mapping. That is, is correct, then reasoning must occur in the
raising the left arm does not constitute a right hemisphere. The rationale here pre-
distinct function from raising the left foot sumably is that mental model theory offers a
and wiggling the right ear, but the conjunc- spatial hypothesis, and anecdotal neuropsy-
tion of the three does constitute a single chological evidence suggests that spatial pro-
function. If we encountered the reverse sit- cessing occurs in the right hemisphere. A
more accurate prediction for mental model basic paradigm and strategy
theory would be that the neural structures We have been presenting subjects with syllo-
for visuospatial processing contribute the ba- gisms, each consisting of two premises and a
sic representational building blocks used for conclusion (e.g., All dogs are pets; All poo-
logical reasoning (i.e., the visuospatial sys- dles are dogs; All poodles are pets), while
tem is necessary and sufficient for reason- they undergo positron emission tomography
ing). I will use the latter prediction. or functional magnetic resonance imaging
By contrast, mental logic theory is a lin- (fMRI) brain scans and asking them to ex-
guistic hypothesis (Rips, 1 994) and needs hibit knowledge of what logically follows
to predict that the neuroanatomical mecha- from the premises by confirming or denying
nisms of language (syntactic) processing un- the given conclusion. Our strategy has been
derwrite human reasoning processes [i.e., to (largely) stay with one type of argument
that the language (syntactic) system is both (syllogisms), manipulate content (holding
necessary and sufficient for deductive rea- the logically relevant information constant),
soning]. Both mental model and mental logic and see how the brain reacts. The specific
theories make explicit localization predic- content manipulations are described in the
tions (i.e., whether linguistic or visuospa- studies discussed subsequently.
tial systems are involved) and implicit dis- Neuroimaging studies typically require
sociation predictions – specifically, that the a rest or baseline condition against which
one system is necessary and sufficient for to compare the active condition. For our
reasoning. baseline tasks (in the fMRI studies) we used
Dual mechanism theory needs to predict trials in which the first two sentences were
the involvement of two different brain sys- related but the third sentence was unrelated
tems in human reasoning, corresponding to (e.g., All dogs are pets; All poodles are dogs;
and the formal, deliberate, rule-based sys- All fish are scaly). Stimuli were presented
tem and the implicit, unschooled, automatic one sentence at a time with each sentence
system. But it is difficult to make a pre- staying up until the end of the trial. Trials ap-
diction about localization without further peared randomly in an event-related design
specification of the nature of the two sys- (Figure 20.1 ). The task in all trials was the
tems. Nonetheless, dual mechanism theory same. Subjects were required to determine
makes a substantive prediction about a dis- whether the conclusion followed logically
sociation in the neural mechanisms underly- from the premises (i.e., whether the argu-
ing the two different forms of reasoning. ment was valid). In baseline trials in which
the first two sentences were related, subjects
would begin to construct a representation of
the problem, but when the third, unrelated,
Functional Anatomy of Reasoning
sentence appeared they would immediately
My colleagues and I have been carrying out disengage the task and respond “no.” In
a series of studies to investigate the neural reasoning condition trials in which the three
basis of logical reasoning (Goel et al., 2000; sentences constituted an argument, subjects
Goel & Dolan, 2000, 2001 , 2003 ; Goel et al., would continue with the reasoning com-
1 995 , 1 997, 1 998). Our initial goal was to ponent of the task after the presentation of
address the hypotheses made by the cogni- the third sentence. The difference between
tive theories of reasoning and, in particu- completing the reasoning task and disen-
lar, differentiate between mental logic and gaging after the presentation of the third
mental model theories. We have made some sentence isolates the reasoning components
progress along these lines (although with of interest. The data were modeled after the
surprising results) and have also provided presentation of the third sentence. The pre-
insights into the role of prefrontal cortex sentation of the first two sentences and sub-
(PFC) in logical reasoning. jects’ motor responses were modeled out as
Event-Related
Stimuli Presentation
TR = 4.1sec
Task: Is argument valid?
Random Jitter
S1 S1 S1
Motor
* S2 S2
Response
S3 (Time Variable)
Reasoning
0 0.5 3.5 6.5 10.25-14.35sec

Stimuli Presentation
Reading/Integrating
Sentences Model BOLD
signal as hrf
Figure 2 0.1 . Stimuli presentation: Stimuli from all conditions were presented
randomly in an event-related design. An “*” indicated the start of a trial at 0
milliseconds. The sentences appeared on the screen one at a time, with the first
sentence appearing at 5 00 milliseconds, the second at 3 5 00 milliseconds, and the
last sentence at 65 00 milliseconds. The duration of trials varied from 1 0.25 to
1 4.3 5 seconds, leaving subjects 3 .75 to 7.85 seconds to respond.
events of no interest. This basic design was gument as opposed to the content of the
used in each of the imaging studies discuss- sentences.
ed subsequently. However, it is well known that the seman-
We chose to use syllogisms (which test tic contents of arguments affect people’s va-
knowledge of quantification and negation) lidity judgments. In a classic study, Wilkins
for technical reasons. Imaging studies re- (1 928) showed that subjects performed bet-
quire multiple presentations of stimuli to ter on syllogisms containing sentences with
register a reliable neural signal. Syllogisms familiar semantic content (e.g., “All apples
come in 64 different forms and therefore are red”) than on syllogisms lacking seman-
allow for multiple trial presentations with tic content (e.g., “All A are B”). When the
minimal or no repetition of form. semantic content of syllogisms was incon-
We chose to manipulate content because, gruent with beliefs (e.g., “All apples are poi-
logically, the content of an argument is irrel- sonous”), performance suffered even more.
evant to the determination of its validity. For These results have been explored and ex-
example, the argument tended in more recent literature (Cherubini
et al., 1 998; Evans, Barston, & Pollard,
All men are mortal
1 983 ; Oakhill & Garnham, 1 993 ; Oakhill,
Socrates is a man
Socrates is mortal Johnson-Laird, & Garnham, 1 989). The ef-
fect is very robust and has challenged all the-
is valid by virtue of the fact it has the follow- ories of reasoning.
ing form: We discuss our key findings subsequently.
They include: (i) a dissociation between a
All A are B frontal-temporal system and a parietal sys-
C is A tem as a function of the familiarity of the
C is B
content of the reasoning material; (ii) asym-
It remains valid irrespective of whether it metrical involvement of right and left PFC,
is about Socrates or elephants. Validity is a with the left PFC being necessary and some-
function of the logical structure of the ar- times sufficient, and the right PFC being
Figure 2 0.2 . Main effect of reasoning [(content reasoning + no content

reasoning) – (content preparation + no content preparation)] revealed activation
of bilateral cerebellum (R > L), bilateral fusiform gyrus, left superior parietal
lobe, left middle temporal gyrus, bilateral inferior frontal gyrus, bilateral basal
ganglia nuclei (centered around the accumbens, caudate nucleus, and putamen),
and brain stem. Reprinted with permission from Goel (2003 ).
sometimes necessary (in unfamiliar, inco- All P are B

herent, conflicting situations) but not suffi- All C are P
cient for logical reasoning; and (iii) clarifying All C are B
roles of right PFC and ventral medial PFC
(VMPFC) in belief–logic conflict resolution. The logically relevant information in both
conditions was identical. Half the argu-
basic findings ments were valid, and the other half
Dissociable neural networks. In Goel et al. were invalid.
(2000), we scanned eleven right-handed, If mental model theory is correct, all rea-
normal subjects using event-related fMRI to soning trials should activate a visuospatial
measure task-related neural activity while system (perhaps parietal cortex). If men-
they engaged in syllogistic reasoning. The tal logic theory is correct, we would ex-
study was designed to manipulate the pres- pect activation of the language system (left
ence of content in logical reasoning. Half frontal and temporal lobe regions). Dual
the arguments contained content sentences, mechanism theory predicts engagement of
such as two distinct (but unspecified) neural sys-
tems, as determined by whether subjects re-
All dogs are pets spond in a schooled, formal manner or an
All poodles are dogs
intuitive, implicit manner. What we actu-
All poodles are pets
ally found was that the main effect of rea-
and the other half contained “no content” soning implicated large areas of the brain
versions of these sentences, such as (Figure 20.2), including regions predicted
by both mental model and mental logic ment of these two systems in the reasoning
theories. process.
However, closer examination revealed Contrary to mental logic theories that
this to be a composite activation consist- predict the language (syntactic) system is
ing of two dissociable neural systems. The necessary and sufficient for deductive rea-
content reasoning trials compared with no- soning and mental model theories that pre-
content reasoning trials revealed activa- dict the visuospatial system is necessary and
tion in left middle and superior temporal sufficient for logical reasoning, Goel et al.
lobe (BA 21 /22), left temporal pole (BA (2000) found evidence for the engagement
21 /3 8), and left inferior frontal lobe (BA 47) of both systems. The presence of semantic
(Figure 20.3 a). This is essentially a language content engages the language and long-term
and memory system. A similar network was memory systems in the reasoning process.
activated in previous PET studies of de- The absence of semantic content engages the
ductive reasoning using contentful sentences visuospatial system in the logically identical
(Goel et al., 1 997, 1 998). reasoning task. Before discussing the impli-
The reverse comparison of no-content cations of these results for cognitive theo-
reasoning trials versus content reasoning tri- ries, let us consider some additional issues
als resulted in activation of bilateral occipital and data.
(BA 1 9), bilateral superior and inferior pari- The Goel et al. (2000) study raises several
etal lobes (BA 7), and bilateral dorsal (BA 6) interesting questions, one of which has to
and inferior (BA 44) frontal lobes (Figure do with the involvement of a parietal visual-
20.3 b). This pattern of activation is known spatial system in the no-content or abstract
to be involved in the internal representa- syllogism condition. A second question has
tion and manipulation of spatial information to do with the exact property of the stimuli
(Jonides et al., 1 993 ; Kosslyn et al., 1 989) that leads to the modulation of neural ac-
and is very similar to that reported for tran- tivity between frontal-temporal and parietal
sitive inference involving geometrical shapes systems. Pursuing the first question led to a
(Acuna et al., 2002) and certain types of clarification of the second question.
mathematical reasoning involving approx- The first question is whether argument
imation of numerical quantities (Dehaene forms involving three-term spatial relations
et al., 1 999). such as:
It is possible to argue that the patterns
The apples are in the barrel
of activation revealed by the direct compar-
The barrel is in the barn
ison of content and no-content conditions The apples are in the barn
are just a function of the presence or ab-
sence of content words, rather than being in- and
dicative of different reasoning mechanisms.
A are in B
To exclude this possibility, we examined the
B is in C
Content (content, no content) by Task (rea- A are in C
soning, baseline) interaction. The modula-
tion of reasoning, by the addition of content are sufficient to engage the parietal sys-
([content reasoning – content baseline] – tem irrespective of the presence of content?
[no-content reasoning – no-content base- One rationale for thinking this might be the
line]) revealed activation in Wernicke’s area. case is subjects’ reported phenomenological
The reverse interaction, which examined the experience of using a visuospatial strategy
effect of the absence of semantic content during these tasks. Secondly, neuroimaging
([no-content reasoning – no-content base- studies have also shown the involvement of
line] – [content reasoning – content base- the parietal system in the encoding of re-
line]), activated left parietal cortex. This lational spatial information (Laeng, 1 994;
interaction analysis eliminates the aforemen- Mellet et al., 1 996). To address this ques-
tioned possibility and confirms the involve- tion, we carried out another fMRI study, this
b c
Figure 2 0.3. (a) The content reasoning–no-content reasoning comparison
revealed activation of the left middle / superior temporal lobe (BA 21 /22), the
left inferior frontal lobe (BA 47), and bilateral (BA 1 7) and lingual gyri (BA 1 8).
(b) The no-content reasoning–content reasoning comparison revealed
activation of (a) bilateral occipital (BA 1 8, 1 9) and (c) bilateral superior and
inferior parietal lobes (BA 7, 40), bilateral precentral gyrus (BA 6), and
bilateral middle frontal gyrus (BA 6). Reprinted from Goel et al. (2000) with
permission from Elsevier.
time using three-term relational arguments lack of temporal lobe (BA 21 /22) activation
like those mentioned previously (Goel & in Goel and Dolan (2 001 ) might be the na-
Dolan, 2001 ). ture of the content used in the two stud-
Goel and Dolan (2001 ) found that reason- ies. The concrete sentences in Goel et al.
ing about abstract and concrete three-term (2000) were of the form “All apples are poi-
relations, as in the aforementioned exam- sonous” whereas the concrete sentences in
ples, recruited a bilateral parietal-occipital Goel & Dolan (2001 ) were of the form “John
system with greater involvement of pari- is to the right of Mary.” The former sentence
etal and occipital lobes in the abstract con- types predicate known properties to known
dition compared with the concrete con- objects. We have beliefs about whether “all
dition. There was an absence of the two apples are poisonous.” By contrast, the latter
dissociable networks for concrete and ab- sentence types do not allow for such beliefs.6
stract reasoning reported in the first study. This leaves open the interesting possibility
In particular, the temporal lobe (BA 21 /22) that involvement of BA 21 /22 in reasoning
activation evident in concrete syllogistic rea- may be specific to content processing in-
soning in the first study was conspicuously volving belief networks rather than just con-
absent in this study. One explanation for the crete contents.
This hypothesis was tested in Goel and reasoned about landmarks in familiar and
Dolan (2003 ) in which subjects were pre- unfamiliar environments.
sented with arguments, such as Half the arguments contained sentences,
such as
No reptiles are hairy
Some elephants are hairy Paris is south of London
No elephants are reptiles London is south of Edinburgh
Paris is south of Edinburgh
containing sentences that subjects could be
expected to have beliefs about, and belief- describing environments with which sub-
neutral arguments, such as jects would be familiar (as confirmed by a
post-scan questionnaire), whereas the other
No codes are highly complex half contained sentences, such as
Some quipu are highly complex
No quipu are codes The AI lab is south of the Roth Centre
Roth Centre is south of Cedar Hall
containing sentences that subjects may not AI lab is south of Cedar Hall
have beliefs about (because they may not
know the meaning of one or more key that subjects could not be familiar with be-
terms). The referential terms in the two con- cause they describe a fictional, unknown
ditions were counterbalanced for abstract environment.
and concrete categories. Our main finding was an interaction be-
The results of this study replicated and tween Task (reasoning and baseline) and
clarified the results of Goel et al. (2000). Spatial Content (familiar and unfamiliar).
Modulation of the reasoning task by absence Modulation of reasoning regarding unfamil-
of belief [(belief-neutral reasoning – belief- iar landmarks resulted in bilateral activation
neutral baseline) – (belief-laden reasoning – of superior and inferior parietal lobule (BA
belief-laden baseline)] revealed activation in 7, 40), dorsal superior frontal cortex (BA 6),
the left superior parietal lobe (BA 7) unique and right superior and middle frontal gyri
to the belief-neutral condition. The re- (BA 8) regions frequently implicated in
verse modulation [(belief-laden reasoning – visuospatial processing. By contrast, mod-
belief-laden baseline) – (belief-neutral rea- ulation of the reasoning task involving fa-
soning – belief-neutral baseline)] revealed miliar landmarks engaged right inferior and
activation of anterior left middle temporal orbital frontal gyrus (BA 1 1 /47), bilateral
gyrus (BA 21 ) unique to the belief-bias con- occipital (BA 1 8, 1 9), and temporal lobes.
dition. These results confirm that a critical The temporal lobe activation included right
(sufficient) factor in the modulation of ac- inferior temporal gyrus (BA 3 7), posterior
tivity between these two neural systems is hippocampus, and parahippocampal gyrus
the presence of familiar or belief-laden con- regions implicated in spatial memory and
tent in the reasoning processes. navigation tasks. These results provide sup-
port for the generalization of our dual mech-
generalization of dissociation to anism account of transitive reasoning and
transitive reasoning highlight the importance of the hippocam-
We have demonstrated dual pathways for pal system in reasoning about landmarks in
reasoning about categorical syllogisms. The familiar spatial environments.
question arises whether the results general-
ize to other forms of logical reasoning, par- evidence for dissociation from patient data
ticularly three-term spatial relations, where If we are correct that reasoning involving fa-
one might think the visuospatial system to miliar situations engages a frontal-temporal
be sufficient. To answer this question, Goel, lobe system and formally identical reasoning
Makale, and Grafman (2004) studied 1 4 vol- tasks involving unfamiliar situations recruit a
unteers using event-related fMRI, as they frontal-parietal visuospatial network – with
greater frontal lobe involvement in the forever, patient performance did not improve
mer than the latter – then frontal lobe le- with the introduction of social knowledge in
sion patients should be more impaired on the form of abstract or concrete permission
reasoning about familiar situations than on schemas as did normal control performance.
unfamiliar situations. To test this hypoth- Furthermore, there was no significant corre-
esis, Goel et al. (2004) administered the lation between volume loss, IQ scores, mem-
Wason 4-Card Selection Task (Wason, 1 966) ory scores, or years of education and perfor-
to 1 9 frontal lobe patients and 1 9 age- and mance in the abstract or concrete permission
education-matched normal controls. schema conditions. The failure of patients to
Wason 4-Card Selection Task (WST) benefit from social knowledge therefore can-
(Wason, 1 966) is the most widely used task not be explained in terms of volume loss, IQ
to explore the role of content in reasoning. scores, memory scores, or years of education.
Subjects are shown four cards. They can see Consistent with the neuroimaging data,
what is printed on one side of each card, but our interpretation is that the arbitrary rule
not the other side. They are given a rule of condition of the WST involves greater acti-
the form: if p then q (e.g., “If a card has vation of the parietal lobe system, whereas
a vowel on one side, it must have an even the permission schema trials result in greater
number on the other side.”) and asked which engagement of a frontal-temporal lobe
cards they must turn over in order to verify system. The normal controls have both
the rule. The visible values on the cards cor- mechanisms intact and can take advantage
respond to the p, not-p, q, and not-q cases of social knowledge cues to facilitate the rea-
of the rule. According to standard proposi- soning process. The patients’ parietal system
tional logic, the correct choices are p (to ver- is intact, and so their performance on the
ify q is on the other side) and not-q (to verify arbitrary rule trial is the same as that of nor-
p is not on the other side). Given an arbitrary mal controls. Their frontal lobe system is
rule like the above, typically fewer than 25 % disrupted, preventing them from taking ad-
of normal subjects will turn over both the p vantage of social knowledge cues in the per-
and the not-q cards. However, the introduc- mission schema trials.7
tion of familiar, meaningful content in a rule
(e.g., “If anyone is drinking beer, then that
person must be over 21 years old.”) greatly hemispheric asymmetry
facilitates performance (Cheng & Holyoak, Our imaging studies have also revealed an
1 985 ; Cosmides, 1 989; Cox & Griggs, 1 982; asymmetry in frontal lobe involvement in
Gigerenzer & Hug, 1 992; Griggs & Cox, logical reasoning. Reasoning about belief-
1 982; Wason & Shapiro, 1 971 ). laden material (e.g., All dogs are pets; All
Specifically, we manipulated the social poodles are dogs; All poodles are pets) ac-
knowledge involved in the task in the form tivates left prefrontal cortex (Figure 20.4a),
of “permission schemas” (Cheng & Holyoak, whereas reasoning about belief-neutral ma-
1 985 ). Subjects performed the task with an terial (e.g., All A are B; All C are A; All
arbitrary rule condition (“If a card has an A C are B) activates bilateral prefrontal cor-
on one side, then it must have a 4 on the tex (Figure 20.4b) (Goel et al., 2000; Goel
other side.”), an abstract permission condi- & Dolan, 2003 ). This asymmetry shows up
tion (“If one is to take action A, then one consistently in patient data.
must first satisfy precondition P.”), and a Caramazza et al. (1 976) administered
concrete permission condition (“If a person two-term problems such as the following:
is to drink alcohol, he or she must be at “Mike is taller than George. Who is taller?”
least 21 .”). to brain-damaged patients. They reported
The principal findings were that, in the that left hemisphere lesion patients were im-
purely logical (arbitrary rule) condition, paired in all forms of the problem, but right
frontal lobe patients performed just as well hemisphere lesion patients were impaired
(or just as poorly) as normal controls. How- only when the form of the question
Figure 2 0.4. (a) Reasoning involving familiar conceptual content activates left
inferior prefrontal cortex. (b) Reasoning involving unfamiliar content activates
bilateral prefrontal cortex. (c) Right prefrontal cortex mediates belief–logic
conflict detection and/or resolution. Reprinted from Goel et al. (2000) with
permission from Elsevier.
was incongruent with the premise and both left and right hemisphere lesion
(e.g., Who is shorter?). Read (1 981 ) tested patients failed the spatial sections. They
temporal lobectomy patients on three-term concluded by emphasizing the critical role
relational problems with semantic content of the left hemisphere in both verbal and
(e.g., George is taller than Mary; Mary is spatial logical reasoning.
taller than Carol; Who is tallest?). Subjects In the WST patient study discussed previ-
were told that using a mental imagery ously (Goel et al., 2004), not only was it the
strategy would help them to solve these case that frontal lobe patients failed to bene-
problems. He reported that left temporal fit from the introduction of familiar content
lobectomy patient performance was more into the task, but the result was driven by the
impaired than right temporal lobectomy poor performance of left hemisphere lesion
patient performance. In a more recent patients. There was no difference in perfor-
study using matched verbal and spatial mance between right hemisphere lesion pa-
reasoning tasks, Langdon and Warrington tients and normal controls but only between
(2000) found that only left hemisphere left hemisphere lesion patients and controls.
lesion patients failed the verbal section, These data show that the left hemisphere
Correct inhibitory trials Incorrect inhibitory trials
a b
Figure 2 0.5. (a) Correct inhibitory trials activate right prefrontal cortex. (b)
Incorrect inhibitory trials activate VMPFC cortex. Reprinted from Goel & Dolan
(2003 ) with permission from Elsevier.
is necessary and often sufficient for reason- jects’ beliefs about the world, the beliefs
ing whereas the right hemisphere is some- are inhibitory to the logical task and de-
times necessary but not sufficient. [This is of crease accuracy (Evans, Barston, & Pollard,
course contrary to the Johnson-Laird (1 994) 1 983 ). Inhibitory belief trials consist of valid
prediction for mental model theory, but, as arguments with unbelievable conclusions
noted previously, we chose to modify this (e.g., No harmful substances are natural; All
prediction to make it consistent with basic poisons are natural; No poisons are harmful)
neuropsychological knowledge.] and invalid arguments with believable con-
clusions (e.g., All calculators are machines;
dealing with belief–logic conflicts All computers are calculators; Some ma-
Although from a strictly logical point of chines are not computers). Performance on
view, deduction is a closed system, we have arguments that are belief-neutral usually
already mentioned that beliefs about the falls between these two extremes (Evans,
conclusion of an argument influence peo- Handley, & Harper, 2001 ).
ple’s validity judgments (Wilkins, 1 928). Goel et al. (2000) noted that when log-
When arguments have familiar content, the ical arguments result in a belief-logic con-
truth value (or believability) of a given conflict, the nature of the reasoning process is
clusion will be consistent or inconsistent changed by the recruitment of the right lat-
with the logical judgment. Subjects per- eral prefrontal cortex (Figure 20.4c). Goel
form better on syllogistic reasoning tasks and Dolan (2003 ) further noted that, within
when the truth value of a conclusion (true the inhibitory belief trials, a comparison of
or false) coincides with the logical rela- correct items with incorrect items (correct
tionship between premises and conclusion inhibitory belief trials – incorrect inhibitory
(valid or invalid) (Evans, Barston, & Pol- belief trials) revealed activation of right in-
lard, 1 983 ). Such trials are facilitatory to ferior prefrontal cortex (Figure 20.5 a). The
the logical task and consist of valid ar- reverse comparison of incorrect response tri-
guments with believable conclusions (e.g., als with correct response trials (incorrect
Some children are not Canadians; All chil- inhibitory belief trials – correct inhibitory
dren are people; Some people are not Cana- belief trials) revealed activation of VMPFC
dians) and invalid arguments with unbeliev- (Figure 20.5 b).
able conclusions (e.g., Some violinists are Within the inhibitory belief trials, the pre-
not mutes; No opera singers are violinists; potent response is associated with belief-
Some opera singers are mutes). When the bias. Correct responses (in inhibitory trials)
logical conclusion is inconsistent with sub- indicate that subjects detected the conflict
between their beliefs and the logical infer- logic theory implicates the syntactic com-
ence, inhibited the prepotent response asso- ponent of language in logical reasoning. Our
ciated with the belief bias, and engaged the studies activate both the syntactic and se-
reasoning mechanism. Incorrect responses in mantic systems and components of long-
such trials indicate that subjects failed to de- term memory.
tect the conflict between their beliefs and Our results do seem compatible with
the logical inference and/or inhibit the pre- some form of dual mechanism theory, which
potent response associated with the belief explicitly predicts a dissociation. However,
bias. Their response is biased by their beliefs. as noted, this theory comes in various fla-
The involvement of right prefrontal cortex vors and some advocates may not be keen
in correct response trials is critical in detect- to accept our conclusions. The distinction
ing and/or resolving the conflict between be- that our results point to is between rea-
lief and logic. Such a role of the right lat- soning with familiar, conceptually coher-
eral prefrontal cortex was also noted in Goel ent material versus unfamiliar, nonconcep-
et al. (2000) and in a study of maintenance tual, or incoherent material. The former
of an intention in the face of conflict be- engages a left frontal-temporal system (lan-
tween action and sensory feedback (Fink guage and long-term memory) whereas the
et al., 1 999). A similar phenomenon has latter engages a bilateral parietal (visuospa-
been noted in the Caramazza et al. (1 976) tial) system. Given the primacy of belief
study mentioned previously in which right bias over effortful thinking (Sloman, 1 996),
hemisphere lesion patients were impaired we believe that the frontal-temporal system
only when there was an incongruency in the is more “basic” and effortlessly engaged. It
form of the question and the premises. By has temporal priority. By contrast, the pari-
contrast, the activation of VMPFC in incor- etal system is effortfully engaged when the
rect trials highlights its role in nonlogical, frontal-temporal route is blocked because of
belief-based responses. a lack of familiar content, or when a conflict
is detected between the logical response and
belief bias.
This is very consistent with the dual
mechanism account developed by Newell &
Simon (1 972) for the domain of problem
Consequences for Cognitive Theories
solving. On this formulation, our frontal-
of Reasoning
temporal system corresponds to the “heuris-
We now briefly address the question of how tic” system whereas the parietal system
these data map onto the cognitive theories of corresponds to the “universal” system. Rea-
reasoning with which we began our discus- soning about familiar situations automati-
sion. This is a complex question because the cally utilizes situation-specific heuristics that
data do not fit neatly with any of the three are based on background knowledge and ex-
theories. First and foremost, we show a dis- perience. When no such heuristics are avail-
sociation in mechanisms involved in belief- able (as in reasoning about unfamiliar sit-
neutral and belief-laden reasoning. The two uations), universal (formal) methods must
systems we have identified are roughly the be used to solve the problem. In the case of
language system and the visuospatial system, syllogistic reasoning, this may well involve a
which is what mental logic theory and men- visuospatial system.
tal model theory respectively predict. How- Our results go beyond addressing cogni-
ever, neither theory anticipates this dissociative theories of reasoning and provide new
tion. Each theory predicts that the system it insight into the role of the prefrontal cor-
postulates is necessary and sufficient for rea- tex in human reasoning. In particular, the
soning. This implies that the neuroanatom- involvement of the prefrontal cortex in log-
ical data cross-classify these cognitive theo- ical reasoning is selective and asymmetric.
ries. A further complication is that mental Its engagement is greater in reasoning about
familiar, content-rich situations than unfa- dren, with which I began, if I say to my
miliar, content-sparse situations. The left son, “If you want dinner tonight, you bet-
prefrontal cortex is necessary and often suffi- ter stop tormenting your sister” in a calm,
cient for reasoning. The right prefrontal cor- unconcerned voice, it usually has an effect.
tex is sometimes necessary but not sufficient However, if I state the same proposition in
for reasoning. It is engaged in the absence of an angry, threatening voice, the impact is
conceptual content and in the face of con- much more complete and immediate. Given
flicting or conceptually incoherent content that the logic of the inference is identical in
(as in the belief–logic conflicts discussed pre- the two cases, the emotions introduced into
viously). Finally, the VMPFC is engaged by the situation through the modulation of my
nonlogical, belief-biased responses. voice are clearly contributing to the result-
ing behavior. In fact, emotions can be intro-
duced into the reasoning process in at least
Future Directions
three ways: (i) in the content or substance of
Although some progress has been made over the reasoning material; (ii) in the presenta-
the past eight years, the cognitive neuro- tion of the content of the reasoning material
science of reasoning is in its infancy. The next (as in voice intonation); and (iii) in the pre-
decade should be an exciting time of rapid existing mood of the reasoning agent. We are
development. There are a number of issues currently channeling much of our research
that we see as particularly compelling for efforts to understanding the neural basis of
further investigation. The first is how well the interaction between emotions and ratio-
the results can be generalized. Will the renal thought.
sults regarding syllogisms, which are quite
difficult, generalize to basic low-level infer-
ences such as modus ponens and modus
tollens? Second, all the imaging studies to Acknowledgments
date have utilized a paradigm involving the
recognition of a given conclusion as valid or This research has been supported by
invalid. It remains to be seen whether the a McDonnell-Pew Program in Cognitive
generation of a conclusion would involve the Neuroscience Award, National Science and
same mechanisms. Third, given the involve- Engineering Research Council of Canada
ment of visuospatial processing systems in (NSERC) & Canadian Institutes of Health
much of reasoning and the postulated differ- Research (CIHR) grants, and a Premier’s Re-
ences between males and females in process- search Excellence Award to Vinod Goel.
ing spatial information (Jones, Braithwaite,
& Healy, 2003 ), one might expect neural-
level differences in reasoning between the
Notes
genders. Fourth, the issue of task difficulty
has not been explored. As reasoning trials
become more difficult, are additional neural 1 . Whether there is any substantive difference
between “knowing the inferential role” and
resources recruited, or are the same struc-
“knowing the meaning” of the closed-form
tures activated more intensely? Fifth, what terms is a moot point, debated in the litera-
is the effect of learning on the neural mech- ture.
anisms underlying reasoning? Sixth, most 2. See Newell (1 980b) for a discussion of the re-
imaging studies to date have focused on de- lationship between search and inference.
duction. Although deduction is interesting, 3 . The Church–Turing hypothesis makes the con-
much of human reasoning actually involves jecture that all computable functions belong to
induction. The relationship between the two the class of functions computable by a Turing
at the neural level is still an open question. Machine. So if we constrain the class of func-
Finally, reasoning does not occur in a vac- tions called for by our psychological theories to
uum. Returning to the example of my chil- the class of computable functions, then there
will be some Turing Machine that can compute Dehaene, S., Spelke, E., Pinel, P., Stanescu, R.,
the function. & Tsivkin, S. (1 999). Sources of mathemati-
4. These are, of course, logical claims about neu- cal thinking: Behavioral and brain-imaging ev-
roimaging and lesion studies. As in all empirical idence. Science, 2 84(5 41 6), 970–974.
work, there are a number of complicating fac- Evans, J. S., Handley, S. J., & Harper, C. N. (2001 ).
tors, including the relationship between statis- Necessity, possibility and belief: A study of
tical significance (or insignificance) and reality syllogistic reasoning. The Quarterly Journal
of an observed effect. of Experimental Psychology A, 5 4(3 ), 93 5 –
5. Again, I am making a logical point, indepen- 95 8.
dent of the usual complexities of mapping be- Evans, J. S. B. T., Barston, J., & Pollard, P. (1 983 ).
havior onto causal mechanisms. On the conflict between logic and belief in syl-
6. It is possible to generate relational sentences logistic reasoning. Memory and Cognition, 1 1 ,
one can have beliefs about; for example, 295 –3 06.
“London is north of Rome” or “Granite is Evans, J. S. B. T., & Over, D. E. (1 996). Rationality
harder than diamonds.” and Reasoning. New York: Psychology Press.
7. See also Bachman and Cannon, Chap. 21 , for Fink, G. R., Marshall, J. C., Halligan, P. W., Frith,
further discussion of disrupted thinking in pa- C. D., Driver, J., Frackowiak, R. S., & Dolan,
tient populations. R. J. (1 999). The neural consequences of con-
flict between intention and the senses. Brain,
1 2 2 (3 ), 497–5 1 2.
Fodor, J. A. (1 975 ). The Language of Thought.
References Cambridge, MA: Harvard University Press.
Forster, T. (1 81 5 ). Sketch of the new anatomy
Acuna, B. D., Eliassen, J. C., Donoghue, J. P., & and physiology of the brain and nervous sys-
Sanes, J. N. (2002). Frontal and parietal lobe ac- tem of Drs. Gall and Spurzheim: considered
tivation during transitive inference in humans. as comprehending a complete system of zoon-
Cerebral Cortex, 1 2 (1 2 ), 1 3 1 2–1 3 21 . omy. London: Law and Whittaker.
Braine, M. D. S. (1 978). On the relation be- Gigerenzer, G., & Hug, K. (1 992). Domain-
tween the natural logic of reasoning and stan- specific reasoning: Social contracts, cheating,
dard logic. Psychological Review, 85 (1 ), 1 –21 . and perspective change. Cognition, 43 (2 ), 1 27–
1 71 .
Caramazza, A., Gordon, J., Zurif, E. B., &
DeLuca, D. (1 976). Right-hemispheric damage Giunti, M. (1 997). Computation, Dynamics, and
and verbal problem solving behavior. Brain and Cognition. New York: Oxford University Press.
Language, 3 (1 ), 41 –46. Goel, V. (1 991 ). Notationality and the informa-
Cheng, P. W., & Holyoak, K. J. (1 985 ). Pragmatic tion processing mind. Minds and Machines,
reasoning schemas. Cognitive Psychology, 1 7(4), 1 (2 ), 1 29–1 65 .
3 91 –41 6. Goel, V. (1 992). Are Computational Explanations
Cherubini, P., Garnham, A., Oakhill, J., & Vacuous? Paper presented at the Proceedings of
Morley, E. (1 998). Can any ostrich fly?: Some the Fourteenth Annual Conference of the Cogni-
new data on belief bias in syllogistic reasoning. tive Science Society, Bloomington, Indiana.
Cognition, 69(2 ), 1 79–21 8. Goel, V. (1 995 ). Sketches of Thought. Cambridge,
Cosmides, L. (1 989). The logic of social MA: MIT Press.
exchange: Has natural selection shaped how Goel, V. (2003 ). Evidence for dual neural path-
humans reason? Studies with the Wason se- ways for syllogistic reasoning Psychologica, 3 2 ,
lection task [see comments]. Cognition, 3 1 (3 ), 3 01 –3 09.
1 87–276. Goel, V., Buchel, C., Frith, C., & Dolan, R. J.
Cox, J. R., & Griggs, R. A. (1 982). The effects of (2000). Dissociation of mechanisms underly-
experience on performance in Wason’s selec- ing syllogistic reasoning. NeuroImage, 1 2 (5 ),
tion task. Memory and Cognition, 1 0(5 ), 496– 5 04–5 1 4.
5 02. Goel, V., & Dolan, R. J. (2000). Anatomical seg-
Cummins, R. (1 989). Meaning and Mental regation of component processes in an induc-
Representation. Cambridge, MA: The MIT tive inference task. Journal of Cognitive Neuro-
Press. science, 1 2 (1 ), 1 –1 0.
Goel, V., & Dolan, R. J. (2001 ). Functional neu- Kandel, E. R., Schwartz, J. H., & Jessell, T. M.
roanatomy of three-term relational reasoning. (Eds.). (1 995 ). Essentials of Neural Science and
Neuropsychologia, 3 9(9), 901 –909. Behaviour. Norwalk, CT: Appleton & Lange.
Goel, V., & Dolan, R. J. (2003 ). Explaining mod- Kosslyn, S. M., Koenig, O., Cave, C. B., Tang, J.,
ulation of reasoning by belief. Cognition, 87(1 ), & Gabrieli, J. D. E. (1 989). Evidence for two
B1 1 –22. types of spatial representations: Hemispheric
Goel, V., Gold, B., Kapur, S., & Houle, S. (1 997). specialization for categorical and coordinate re-
The seats of reason: A localization study of lations. Journal of Experimental Psychology: Hu-
deductive & inductive reasoning using PET man Perception and Performance, 1 5 (4), 723 –
(O1 5 ) blood flow technique. NeuroReport, 73 5 .
8(5 ), 1 3 05 –1 3 1 0. Laeng, B. (1 994). Lateralization of categorical &
Goel, V., Gold, B., Kapur, S., & Houle, S. (1 998). coordinate spatial functions: A study of unilat-
Neuroanatomical correlates of human reason- eral stroke patients. Journal of Cognitive Neuro-
ing. Journal of Cognitive Neuroscience, 1 0(3 ), science, 6(3 ), 1 89–203 .
293 –3 02. Langdon, D., & Warrington, E. K. (2000). The
Goel, V., Grafman, J., Sadato, N., & Hallet, M. role of the left hemisphere in verbal and
(1 995 ). Modelling other minds. NeuroReport, spatial reasoning tasks. Cortex, 3 6(5 ), 691 –
6(1 3 ), 1 741 –1 746. 702.
Goel, V., Makale, M., & Grafman, J. (2004). Lashley, K. S. (1 929). Brain Mechanisms and
The hippocampal system mediates logical Intelligence: A Quantitative Study of Injuries
reasoning about familiar spatial environments. to the Brain. Chicago: University of Chicago
Journal of Cognitive Neuroscience, 1 6(4), 65 4– Press.
664. Marr, D. (1 982). Vision: A Computational In-
Goel, V., Shuren, J., Sheesley, L., & Graf- vestigation into the Human Representation and
man, J. (2004). Asymmetrical involvement of Processing of Visual Information. San Francisco:
frontal lobes in social reasoning. Brain, 1 2 7, Freeman.
783 –790. Mellet, E., Tzourio, N., Crivello, F., Joliot, M.,
Griggs, R. A., & Cox, J. R. (1 982). The elu- Denis, M., & Mazoyer, D. (1 996). Functional
sive thematic-materials effect in Wason’s se- anatomy of spatial mental imagery generated
lection task. British Journal of Psychology, 73 , from verbal instructions. The Journal of Neuro-
407–420. science, 1 6(2 0), 65 04–65 1 2.
Henle, M. (1 962). On the relation between logic Newell, A. (1 980a). Physical symbol systems.
and thinking. Psychological Review, 69(4), 3 66– Cognitive Science, 4, 1 3 5 –1 83 .
3 78. Newell, A. (1 980b). Reasoning, problem solving,
Johnson-Laird, P. N. (1 983 ). Mental Models: To- and decision processes: The problem space as a
wards a Cognitive Science of Language, Inference, fundamental category. In R. S. Nickerson (Ed.),
and Consciousness. Cambridge, MA.: Harvard Attention and Performance VIII. Hillsdale, NJ:
University Press. Erlbaum.
Johnson-Laird, P. N. (1 994). Mental mod- Newell, A. (1 990). Unified Theories of Cognition.
els, deductive reasoning, and the brain. In: Cambridge, MA: Harvard University Press.
M. S. Gazzaniga (Ed.), The Cognitive Neuro- Newell, A., & Simon, H. A. (1 972). Human Prob-
sciences (pp. 999–1 008). Cambridge, MA: MIT lem Solving. Englewood Cliffs, NJ: Prentice-
Press. Hall.
Johnson-Laird, P. N., & Byrne, R. M. J. (1 991 ). Newell, A., & Simon, H. A. (1 976). Com-
Deduction. Hillsdale, NJ: Erlbaum. puter science as empirical inquiry: symbols
Jones, C. M., Braithwaite, V. A., & Healy, S. D. and search. Communications of the ACM,
(2003 ). The evolution of sex differences in 1 9(March), 1 1 3 –1 26.
spatial ability. Behavioral Neuroscience, 1 1 7(3 ), Oakhill, J., & Garnham, A. (1 993 ). On theories
403 –41 1 . of belief bias in syllogistic reasoning. Cognition,
Jonides, J., Smith, E. E., Koeppe, R. A., Awh, 46(1 ), 87–92; discussion 93 –87.
E., & Minoshima, S. (1 993 ). Spatial working Oakhill, J., Johnson-Laird, P. N., & Garnham, A.
memory in humans as revealed by PET. Nature, (1 989). Believability and syllogistic reasoning.
3 63 , 623 –625 . Cognition, 3 1 (2 ), 1 1 7–1 40.
Pylyshyn, Z. W. (1 984). Computation and Sloman, S. A. (1 996). The empirical case for
Cognition: Toward a Foundation for Cognitive two systems of reasoning. Psychological Bulletin,
Science. Cambridge, MA: MIT Press. 1 1 9(1 ), 3 –22.
Read, D. E. (1 981 ). Solving deductive-reasoning Stanovich, K. E., & West, R. F. (2000). Individ-
problems after unilateral temporal lobectomy. ual differences in reasoning: Implications for
Brain and Language, 1 2 , 1 1 6–1 27. the rationality debate. Behavioral and Brain Sci-
Rips, L. J. (1 994). The Psychology of Proof: Deduc- ences, 2 2 , 645 –665 .
tive Reasoning in Human Thinking. Cambridge, Wason, P. C. (1 966). Reasoning. In B. Foss (Ed.),
MA: MIT Press. New Horizons in Psychology (pp. 1 3 5 –1 5 1 ).
Searle, J. R. (1 990). Is the Brain a Digital Baltimore: Penguin.
Computer? Paper presented at the Sixty-fourth Wason, P. C., & Shapiro, D. A. (1 971 ). Natural
Annual Pacific Division Meeting for the American and contrived experience in a reasoning prob-
Philosophical Association, Los Angeles, CA, lem. Quarterly Journal of Experimental Psychol-
March 3 0, 1 990. ogy, 2 3 , 63 –71 .
Shallice, T. (1 988). From Neuropsychology to Men- Wilkins, M. C. (1 928). The effect of changed ma-
tal Structure. Cambridge, UK: Cambridge Uni- terial on the ability to do formal syllogistic rea-
versity Press. soning. Archives of Psychology, 1 6(1 02 ), 5 –83 .
CHAPTER 2 1
Cognitive and Neuroscience

Aspects of Thought Disorder
Peter Bachman
Tyrone D. Cannon
Introduction tion. For we formed our constitution, it was

taxation without representation is treason”
During the course of assessments carried out (Johnston & Holzman, 1 979, p. 2 63 ).
at two clinical research centers, the follow-
ing responses were provided to standardized Reading these two responses, and imag-
questions intended to be open-ended and to ining hearing them aloud in conversation,
elicit relatively abstract responses: almost surely evokes the feeling that some-
thing is not quite right – the statements are
[Examiner A] “Can you explain the somehow disordered. For instance, the ob-
proverb, ‘Speech is the picture of the jects referred to in the first response seem to
mind’?” be related to each other only indirectly and
[Subject A] “You see the world through along varying linguistic dimensions. Conse-
speech. Like my grandfather used to speak quently, by the end of the statement, the re-
to me of Alaskans and Alsatians and blood sponse deviates dramatically from the con-
getting thicker and thinner in the Eskimo. tent requested by the examiner. The reply
He was against the Kents in England. I to the second examiner’s question does not
can’t smoke a Kent cigarette to this day” follow such a rapidly digressing course but
(Harrow & Quinlan, 1 985 , p. 44). instead seems to fixate on an idea, or perhaps
[Examiner B] “Why should people pay a phrase (“taxation without representation”)
taxes?” indirectly related to the content of the ques-
tion, seeming to repeat and elaborate on that
[Subject B] “Taxes is an obligation as
phrase without offering additional ideas.
citizens of our country. To our nation, to
this country, the United States. As a citi- Apart from describing how these state-
zen, I think we have an obligation. I think ments are disordered, understanding why
that’s carried to an extreme. Within reason, the speakers produced them in such a
taxes within reason. Taxation, we have manner is a daunting task. The process
representation, so therefore we have taxa- of comprehending and generating speech
493
integrated into an ongoing conversation Recent work adopting this approach has
involves numerous interrelated cognitive demonstrated, for instance, that certain neu-
mechanisms (Levelt, 1 989), any or all of rocognitive disruptions in schizophrenia are
which could contribute to abnormal speech associated with genetic vulnerability to the
comprehension or production. Moreover, illness, whereas other traits are associated
thought disorder tends to occur within the with disease expression (e.g., Cannon et al.,
context of a more extensive psychopathol- 2000; Cannon et al., 2002).
ogy (Andreasen & Grove, 1 986), including This more complex, integrative view
diagnoses as diverse as schizophrenia, mood of the pathology of thought disorder en-
disorders, certain personality disorders, and dows the investigator with a more powerful
autism (American Psychiatric Association, heuristic for grappling with the multitude
2000; Andreasen & Grove, 1 986). In fact, of intertwined cognitive domains and lev-
the patient quoted in the first reply was di- els of analysis active in the study of thought
agnosed with schizophrenia, the condition and how thought might come to be disor-
perhaps most closely associated with the dered in particular disease states (Cannon &
presence of thought disorder (e.g., Bleuler, Rosso, 2002). Perhaps, for instance, the in-
1 91 1 /1 95 0). The second quote, however, was tegration of behavioral genetic and experi-
provided by a individual who was not her- mental psychopathology approaches might
self diagnosed with a psychiatric disorder but yield the finding that deficits in two infor-
who has a daughter diagnosed with schizoaf- mation processing systems may – on their
fective disorder (a condition thought to be own – be necessary but not sufficient for the
closely related to schizophrenia; American phenotypic manifestation of thought disor-
Psychiatric Association, 2000), highlighting der, whereas the coincidence of these deficits
the role of heritable factors contributing to may result in its overt expression.
significant symptom expression even in the In applying this framework to a larger dis-
absence of a clear diagnostic label. cussion of thought disorder, we attempt to
Despite the prevalence of thought dis- elucidate a set of neurobiological and cog-
order across diagnostic populations, sys- nitive conditions that may participate in the
tematic efforts to study the pathology of generation of thought disorder through their
thought disorder across diverse conditions, collective action. More specifically, we focus
looking for common disease mechanisms, first on the expression of thought disorder
are rare. Rather, most investigators have in psychopathology, highlighting descriptive
chosen to study thought disorder strictly approaches. Subsequently, we shift to dis-
within the context of a particular discussion of a prominent model of speech
ease entity such as schizophrenia. Unfortu- production, as well as two models of dis-
nately, the multitude of difficulties many ordered thinking in schizophrenia, to help
schizophrenia patients face – including de- us identify cognitive mechanisms likely dis-
graded information processing capabilities; rupted in individuals displaying thought dis-
presence of debilitating symptoms (includ- order. Finally, we attempt to integrate find-
ing hallucinations and delusions); medica- ings from distinct levels of analysis (e.g.,
tion side-effects; social and occupational behavioral and molecular genetics, structural
morbidity; stressful relationships with rel- and functional neuroanantomy, behavioral
atives, who may themselves be burdened performance) to characterize diverse aspects
by psychiatric disorders – defies models of psychiatric disorder as traits specific to
of etiology based on a single underlying disease expression, which we characterize as
deficit. In recognition of this complexity, involving abnormal activation of the brain’s
psychopathology researchers have begun to temporal lobe structures critical to the for-
dissect disorders whose manifestation coin- mation and retrieval of long-term memo-
cides with the presentation of thought dis- ries and other types of concrete information
order into more fundamental neurocognitive and also involving traits specific to genetic
traits that participate in symptom formation. vulnerability, which tend to involve more
cognitive and neuroscience aspects of thought disorder 495
frontal lobe–mediated functions such as the Several clinical examples are pertinent
online maintenance and manipulation of in- to our description of formal thought disor-
formation (also see Morrison, Chap. 1 9). der as a result of ideational disorganization.
A thought-disordered individual might pro-
duce a neologism or a novel word formed
Defining Thought Disorder by the unique integration of parts of other
words. A neologism would therefore be
Perhaps the most common usage of the term conceptualized as the loosening of nor-
“thought disorder,” at least within clinical mal associative relationships between in-
settings, is as shorthand for “formal thought dividual word parts (perhaps at the level
disorder,” which refers to a taxonomy of of grammatical encoding, discussed sub-
symptoms involving abnormal speech (An- sequently). Johnston and Holzman (1 979,
dreasen, 1 979, 1 982). In this usage, thought p. 1 00) quoted one patient as responding to
disorder is typically conceptualized as the an examiner’s request to define the word
product of a loosening of associations lead- “remorse” by replying, “Moisterous, being
ing to a loss of continuity between or- moistful,” combining legal word parts to form
dered elements inferred to underlie a spoken lexically invalid words.
utterance (Maher, 1 991 ). The “formal” Similarly, an affected individual’s speech
distinction, specifically with respect to may be characterized by lexically valid but
schizophrenia, harkens back to the notion unrelated words strung together to make an
that pathologies of thought can be charac- unintelligible statement – a loosening of as-
terized as disorders of thought content or sociations between words. An example of
as disorders of thought form. The former of this type of disordered comment is, “If things
the two categories refers primarily to hal- turn by rotation of agriculture or levels or
lucinations, or well-defined percepts gener- timed in regard to everything . . . ” (Maher,
ated endogenously but experienced as, and 1 966, p. 3 95 ). In its extreme form, clinicians
attributed to, exogenous events and to delu- sometimes refer to this type of disorgani-
sions – objectively false and often bizarre zation as “word salad,” indicating its highly
beliefs held with a high level of conviction jumbled presentation.
(i.e., the patient maintains the belief in the Formal thought disorder may also man-
face of counter-evidence; American Psychi- ifest itself in an abrupt shift between in-
atric Association, 2000). A common exam- directly related topics, representing a loos-
ple of a delusion is the belief that people in ened association between ideas or clauses
the patient’s environment have the intention within or between sentences. For example,
of monitoring and even harming the patient when one patient with formal thought dis-
(i.e., a paranoid delusion).1 The latter cate- order was asked to explain why people who
gory, disorders of thought form, involves a are born deaf are usually unable to talk,
disorganization of underlying thought pro- he replied, “When swallow in your throat
cesses indicated by abnormal speech such like a key it comes out, but not as scissors.
as that quoted at the outset of this chapter. A robin, too, it means spring” (Harrow &
Factor analytic studies of symptom preva- Quinlan, 1 985 , p. 429). In this instance, the
lence in schizophrenia (e.g., Liddle, 1 987) patient seems to have switched from em-
have generally supported this form-versus- ploying one meaning of the word swallow
content distinction, for ratings of formal (i.e., the verb) to an alternate meaning (i.e.,
thought disorder tend to covary with ratings the type of bird) and then articulating a con-
of disorganized behavior on a factor includ- cept (i.e., “robin”) semantically related to the
ing neither delusions nor hallucinations, sug- alternate meaning.
gesting that thought disorder symptoms in- Perhaps even more salient in this last ex-
deed reflect a disorganization of ideational ample than the abrupt shift is that the pa-
elements not necessarily specific to articu- tient’s response seems only very tangentially
lated speech. consistent with the interviewer’s question.
In fact, disordered speech can involve state- referred to as “pressure” of speech) and the
ments that are overly vague or overly con- latter involving speech that is impoverished
crete, or otherwise do not seem congruent in terms of average number and length ut-
with the semantic or interpersonal demands terance or with respect to ideational content
implied by the comment or question posed (as was the case in the second quote cited in
by the other participant in the conversation. the Introduction). Interestingly, this positive
In contrast to the taxonomy of speech versus negative dimension seemed to effec-
abnormalities described previously, which tively discriminate thought-disordered pa-
resulted from application of the ideational tients with mania from thought disordered
confusion definition, Andreasen (1 986) de- patients with schizophrenia.
veloped a descriptive system of assessing
thought disorder intended to eschew the-
oretical assumptions regarding the pathol-
ogy resulting in disordered speech, and to Levelt’s Model of Normal
enhance clinical assessors’ statistical reliabil- Speech Production
ity. In all, Andreasen (1 986) identified eight-
een classes of speech abnormality, most of To provide an organizing framework for our
which mapped loosely to more traditional consideration of models relevant to formal
clinical conventions. For instance, the no- thought disorder, we turn first to a model of
tion of loosened associations was replaced by normal speech production. Levelt (Levelt,
a set of five somewhat more technical cat- 1 989, 1 999; Levelt, Roelofs, & Meyer, 1 999)
egories, including “derailment,” which the described such a model particularly useful
authors characterized as a consistent flow here because of its comprehensive incorpo-
of ideas only tangentiality related to each ration of diverse cognitive processes critical
other within the context of the spontaneous for effective interpersonal communication.
production of speech (sometimes referred As shown in Figure 21 .1 , Levelt’s model
to by clinicians as “flight of ideas”; And- involves a serial process by which a message
reasen, 1 986). intended for communication moves through
For our current purposes, the specific dis- a succession of stages, each of which plays
tinctions within Andreasen’s (1 986) cata- a unique role in transforming the message
log of categories are less important than into an articulated sound wave. The first set
some of the larger constructs that emerged of stages along this speech production se-
from a factor analysis of observations of the quence constitutes what Levelt refers to as
prevalence of the abnormalities in a wide- a “rhetorical/semantic/syntactic system” re-
ranging study of several psychiatric popula- sponsible for filtering a given communica-
tions (Andreasen & Grove, 1 986). Indeed, tive intention through the speaker’s model
as mentioned previously, five of the eight- of how the listener will perceive and un-
een types of abnormality clustered together derstand the message, which can be influ-
to form a “loose associations” dimension that enced by the speaker’s mental model of the
seemed to indicate the overall level of behav- listener. This system also sequences ideas in
ioral disorganization shown by patients with a logical order and places that sequence in
psychotic disorders (Andreasen & Grove, a propositional format (specific to linguis-
1 986). Another distinction appeared be- tic expression) that includes the selection
tween what the authors characterized as as- of lexical concepts, in turn triggering the
pects of “positive” and “negative” thought retrieval of appropriate lemmas from the
disorder (not to be confused with the analo- mental lexicon (for a discussion of compu-
gous positive–negative schizophrenia symp- tational evidence for the model’s lexical se-
tom distinction) with the former involving lection mechanism, see Levelt, Roelofs, &
aspects of loosened associations (e.g., derail- Meyer, 1 999; also see Medin & Rips, Chap.
ment) in combination with a significant level 3 ). The retrieval of the appropriate lem-
of rapidity and volume of speech (sometimes mas from the mental lexicon engages the
rhetorical/semantic/syntactic
system model of addressee
(ToM)
conceptual preparation knowledge of external

and internal world
discourse model, etc.

preverbal message
parsed
speech grammatical encoding
lemmas
self-perception surface structure mental lexicon
morpho-phonological
codes
morpho-phonological
encoding
phonological score
phonetic encoding syllabary
gestural scores
articulatory score
articulation
phonological/phonetic system
overt speech
Figure 2 1 .1 . Levelt’s model of normal speech production. Reprinted with permission from Levelt,
1 999.
syntactic construction of the message, for the message, a relative process, the prod-
lemmas must agree syntactically with each uct of which does not necessarily respect
other and with the overall communicative the boundaries of the superordinate lem-
intent of the speaker. mas. Next, during the process of phonetic
This retrieval of lemmas from the men- encoding, the accumulation of the phono-
tal lexicon, which also entails retrieval logical syllables, or the phonological score,
of each lemma’s inherent morpho-phono- retrieves from a “mental syllabary” a gestu-
logical code, serves as a transition out of the ral, or articulatory score, completing the pro-
“rhetorical/semantic/syntactic system” and cess by which a fully formed syntactic and
into the “phonological/phonetic system.” In- phonological message retrieves an appropri-
deed, the lemmas’ score in the mental lex- ate articulatory motor plan. Subsequently,
icon represents the basic stage at which articulation, the generation of overt speech,
semantic and phonological information is is the physical realization of the selected
bound together. motor plan.
Accordingly, the phonological codes asso- The production of overt speech, however,
ciated with each lemma’s morphemes com- does not represent the final stage in Lev-
bine according to the predetermined se- elt’s model of speech production. In fact,
quence to form the syllabic structure of the model also includes a feedback loop by
which the speaker can perceive and monitor tegration of semantic information be likely
his or her own speech for errors or external to activate the network previously described
interference, re-engaging the model at the (Indefrey & Levelt, 1 999), but it would also
level of conceptual preparation to make ap- likely activate a relatively more anterior re-
propriate corrections if necessary. gion of left inferior prefrontal cortex (Gold
On a neural level, Indefrey and Levelt & Buckner, 2002; Kounios et al., 2003 ) that
(1 999) describe the functioning of Levelt’s appears to facilitate controlled selection of
model as being implemented in a primar- information stored in long-term memory by
ily left-hemisphere–lateralized cortical net- resolving interference from activated, non-
work. They propose that the initial process target pieces of information (Thompson-
of conceptual preparation occurs in range of Schill et al., 2002).
heteromodal and cortical association areas
(specific to the modality of contextual in-
formation preceding the present production
process), the activity of which converges Thought Disorder or
with the selection of a lexical concept oc- Speech Disorder?
curring in the left middle temporal gyrus.
Subsequently, Wernicke’s area (roughly the An area of long-running controversy in the
temporal-parietal junction) is activated by study of formal thought disorder is whether
the retrieval of phonological codes associ- the phenomenon is ultimately a disorder
ated with retrieved lexical concepts followed of thought itself, or a disorder of overt
by activation of Broca’s area (posterior left speech. Specifically, rather than considering
inferior frontal cortex) and the left mid- this markedly disrupted ability to commu-
superior temporal lobe, the sites at which nicate a speech production problem, we in-
phonological encoding continues indepen- fer that the locus of pathology lies in the
dent of lexical information. Broca’s area then thought processes underlying the intentional
remains active and is joined by activation production of speech (see Chaika, 1 982, for
in other supplementary motor areas and in additional discussion of this inference). Un-
the cerebellum during the process of ar- fortunately, as discussed in depth by Critch-
ticulation. Indefrey and Levelt (1 999) fur- ley (1 964) and Maher (1 991 ), these thought
ther specify that self-monitoring, whether processes themselves are not directly observ-
occurring covertly or overtly, activates re- able. Therefore, measurement of any puta-
gions of superior temporal lobe, as well tive disruption must necessarily occur indi-
as supplementary motor areas related to rectly – usually with the assumption that the
articulation. psychomotor transformation from a thought
Critically, the authors specify that this to a spoken utterance occurs with a nor-
proposed speech production network is ac- mal range of fidelity. It is certainly worth-
tivated as such only during relatively auto- while considering whether this assumption
matic (i.e., seemingly without effort or con- is warranted.
scious awareness and potentially occurring Referring back to Levelt’s model of nor-
in parallel with other processes) speech pro- mal speech production (Figure 20.1 ), we
duction as opposed to the process of speech can consider each of the putative process-
production specifically engaged during more ing stages and attempt to infer what the
controlled (effortful, conscious processing observable product of “lesioning” each in iso-
requiring capacity-limited attention and lation would sound like. Let us first examine
operating in a serial fashion; Schneider the processing stage most closely affiliated
& Shiffrin, 1 977) information processing, with the actual act of speaking, the process
as would be more likely during the perfor- of physical articulation. If an intact would-
mance of an experimental cognitive task. be utterance moves into the stage of phys-
Not only would speech production involv- ical articulation only to be compromised,
ing controlled selection, retrieval, and in- one might expect the output to contain the
intended words encoded grammatically but which words are selected and ordered ap-
spoken in a manner that systematically dis- propriately, but the particular form of each
torts the articulatory score of the phrase. word is not adjusted to accommodate the
Such a spoken product would not resem- grammatical demands of nearby words or
ble formally disordered thought but instead the phrase as a whole (e.g., verbs are not con-
the product of conditions such as dysarthria jugated correctly; Indefrey & Levelt, 1 999).
or speech apraxia (Dick et al., 2001 ), two Although, as apparent in the quotes at
disorders familiar to neurologists and speech the beginning of the chapter, patients with
pathologists. formal thought disorder make grammatical
Similarly, if the lesion underlying formal mistakes in their speech, it is not necessar-
thought disorder involved the process of ily clear that they make such mistakes more
phonetic encoding, one would expect spo- frequently than non–thought-disordered in-
ken output to resemble speech characteris- dividuals do.
tics of what Dodd and Crosbie (2002) refer Evidence does exist (Andreasen & Grove,
to as “speech programming deficit disor- 1 986; Berenbaum & Barch, 1 995 ), however,
der” in which speech is produced fluently, that patients with formal thought disorder
but the distorted phonological score would show a small but significant level of word
yield speech devoid of normal patterns of substitution and approximation, which is
pitch and syllabification – perhaps sound- the predictable consequence of faulty re-
ing severely slurred – in the absence of trieval of lemmas from the mental lexicon,
dysarthria or speech apraxia. the initial process occurring under the head-
The immediately preceding stage, mor- ing of grammatical encoding. We therefore
phophonological encoding, the first point in conclude that we can rule out lesions to
the process at which a word’s phonolog- all processing stages occurring after lemma
ical code is processed independent of se- retrieval up to and including the articula-
mantic content, has also been shown to be tion of overt speech. The cause of formal
compromised in isolation specifically in pa- thought disorder must therefore exist some-
tients suffering from an anomic aphasia or where along the way through the rhetor-
a word-finding deficit (Indefrey & Levelt, ical/semantic/syntactic system (including
1 999). Typically, such patients describe a application of a mental model of the lis-
sense of frustration over feeling that they tener, the conceptual preparation, etc.) or in
have particular verbal concepts in mind but the self-monitoring feedback loop. Although
cannot retrieve the phonological code – can- where one draws the line between “thought”
not think of how to say the corresponding and “speech” is a somewhat of a philosoph-
word. Certainly this condition is debilitat- ical issue, we propose that all processes un-
ing, but apart from the superficial similarity derlying these two suspect components cer-
with thought blocking (sometimes consid- tainly warrant the label “thought,” justifying
ered a feature of negative formal thought the term “formal thought disorder” rather
disorder, but not a construct included in than “speech disorder.”
Andreasen’s and Grove’s system), anomic
aphasia does not resemble formal thought
disorder.
Finally, having ruled out deficits in the Overview of Cognitive Models of
stages of speech production constituting Thought Disorder
Levelt’s (1 989, 1 999) phonological/phonetic
system, we work backward to the stage at The first major psychological discussion of
which lemmas are selected and retrieved the pathology of thought disorder was pro-
from the mental lexicon, initiating the gram- vided by Eugen Bleuler, a Swiss psychi-
matical encoding process. Agrammatic pa- atrist and theorist contemporary to both
tients, such as patients suffering from Broca’s Sigmund Freud, founder of modern clini-
aphasia, are characterized by speech in cal psychology, and Wilhelm Wundt, often
cited as the founder of modern experimental Cohen et al., 1 999; Cohen & Servan-
psychology. Based on his observation of pa- Schreiber, 1 992) have proposed a model of
tients with psychotic disorders (i.e., psy- schizophrenic information processing (see
chiatric disorders manifesting both severe Figure 21 .2) in which at least a subset of
reality distortion symptoms, such as hal- information processing deficits observed in
lucinations and delusions, and a significant schizophrenia patients results from a distur-
level of behavioral disorganization), includ- bance in the interaction between a cogni-
ing schizophrenia, Bleuler (1 91 1 /1 95 1 ) ar- tive module specialized for the representa-
gued that the cause of psychosis involves a tion, active maintenance, and updating of
fundamental “loosening of associations” be- information regarding stimulus context and
tween ideational elements, which results in a module responsible for the storage of
a conceptual confusion that manifests itself learned behavioral contingencies. Given
in disordered speech (in addition to other that an individual’s repertoire of stimulus–
symptoms) – an idea preserved almost ex- response associations must be directly ac-
actly in its original form in more contempo- cessible to the behavioral selection process
rary definitions of thought disorder,2 as dis- bridging the gap between the encoding of
cussed earlier. a stimulus and the execution of a response,
Furthermore, Bleuler’s conceptualization the existence of a pathway allowing interac-
of the pathology of psychosis is analogous tion between these stored behavioral contin-
to more contemporary cognitive explana- gencies (i.e., long-term memories, including
tions of disordered information processing motor plans) and the context processing
(e.g., Andreasen et al., 1 999; Goldman- module allows contextual information to in-
Rakic, 1 995 ; Oltmanns & Neale, 1 975 ; Sil- fluence the selection and execution of ongo-
verstein et al., 2000) in that he proposed ing behavioral plans, ideally biasing behavior
that a critical parameter of a fundamental in a goal-appropriate manner (Braver, Barch,
cognitive mechanism is abnormal and that & Cohen, 1 999). Context information there-
the consequences of this single defect ac- fore mediates the selection of learned asso-
count parsimoniously for the diverse phe- ciations, which otherwise would be dictated
nomena observed in the behavior of many by environmental stimuli.
psychiatric patients. Like Bleuler, the propo- To serve this function, context informa-
nents of these contemporary models iden- tion must be represented and maintained in
tify the pathological cognitive mechanism a manner that leaves it both buffered against
and delineate how the functional conse- interference (from task-irrelevant stimuli)
quences of the abnormality are propagated and available to be updated as required
through subsequent processing stages, cur- by changing task demands. In an extreme
tailing the normal integration of thought example, this system must be capable of
and behavior or, more specifically with re- exercising cognitive control: utilizing infor-
spect to schizophrenia, the inability to use mation from a previous stimulus to bias pro-
contextual information in the efficient guid- cessing of other relevant information and
ance of ongoing, goal-directed behavior. We to suppress processing of irrelevant infor-
shall discuss at length two such models that mation and then reflecting that critical in-
were created in investigation of informa- formation in the selection of appropriate
tion processing abnormalities in schizophre- goal-directed behavior even in the face of
nia and examine how these models might ac- competition from more salient behavioral
count for symptoms such as formal thought responses.3
disorder. Citing computational evidence (Braver,
Barch, & Cohen, 1 999), the investigators
argue that variable efficiency of the in-
Cohen’s and Braver’s Model
teraction between the context processing
Cohen, Braver, and colleagues (Braver, module and the learned behavioral con-
Barch, & Cohen, 1 999; Braver et al., 2001 ; tingencies module could, in fact, account
Figure 2 1 .2 . Cohen, Braver, and colleagues’ model of information processing disruption in

schizophrenia. Reprinted with permission from Braver, Barch, & Cohen, 1 999.
for schizophrenia patients’ apparent insen- stimulus with the stimulus presented im-
sitivity to contextual information. They mediately prior (Cohen & Servan-Schreiber,
elaborate (Braver, Barch, & Cohen, 1 999) 1 992). In each task, the difficulty of main-
that a gating mechanism must exist be- taining context information and using it to
tween the two modules, allowing contextual select appropriate behavior was manipu-
information to be encoded and main- lated by varying the length of time during
tained without interference from irrelevant which context information must be main-
perceptual information under certain cir- tained prior to response selection, as well
cumstances and, at other times, making con- as the salience of task-appropriate responses
text information available for updating or to relative to task-inappropriate responses (i.e.,
influence activation in the association stor- the demand for cognitive control during
age module. Disrupted information process- the behavioral selection process). The in-
ing in schizophrenia is therefore the con- vestigators argue that, overall, schizophrenia
sequence of failure of this “gate” between patients display a differential insensitivity
the association storage module and the con- to contextual information, and this insen-
text processing module to properly open sitivity interacts with variable information
and close, degrading the fidelity of encoding maintenance demands in two out of three
and maintenance of goal-related informa- experiments (Cohen et al., 1 999). Addi-
tion, as well as the effectiveness of its biasing tionally, the investigators report a signifi-
influence. cant negative correlation between context
In support of their model, Cohen, Braver, sensitivity and severity of disorganization
and colleagues (Cohen et al., 1 999) have pre- symptoms (including formal thought disor-
sented data from schizophrenia patients and der) among schizophrenia patients, suggest-
controls performing three tasks – a single- ing that the ability to effectively and flexibly
trial version of the Stroop task (Stroop, bind ideational elements to an appropriate
1 93 5 ), a lexical disambiguation task, and a context underlies both the production of or-
“one-back” continuous performance task re- ganized speech and successful performance
quiring subjects to continuously match each on these context-heavy tasks.
Cohen, Braver, and colleagues also bring Although perhaps an extreme argument,
to bear evidence that this contextual infor- this proposition raises a question regarding
mation is actively maintained, updated, and how this model or any like it is distinct from
buffered against interference in the dorso- one that simply predicts that schizophre-
lateral prefrontal cortex (Barch et al., 1 997; nia patients will perform any given task in-
Braver et al., 1 997; Cohen et al., 1 997; also correctly. The distinction indeed exists and
see Goel, Chap. 20), which is also implicated highlights the reason why cognitive control
in the exercise of cognitive control (Braver, is so critical to the model’s successful im-
Reynolds, & Donaldson, 2003 ). More re- plementation. Specifically, patients will per-
cently, Miller and Cohen (2 001 ; also, Kane form a given task correctly when the cor-
and Engle, 2002, and Duncan, 2001 ) have rect behavioral response is somehow most
reviewed evidence that the prefrontal cor- salient or dominant with respect to other po-
tex is not only capable of maintaining rep- tential responses; in this case, the represen-
resentations of context despite interference tation of context and the prepotency of the
but is also critical in the modulation of activ- correct response are redundant mechanisms.
ity in other regions of the brain thought to When the correct response is less salient
be associated with modality-specific buffers, or less “prepotent” than an incorrect, dis-
with the ability to hold long-term memo- tracter response, patients will tend to choose
ries at a high level of activation and the sub- the distracter. Nonpsychotic subjects, con-
sequent selection of goal-directed behavior. versely, will be more capable of using
Incorporating an additional level of analysis, representations of context to inhibit the pre-
Cohen, Braver, and colleagues cite evidence potent distracter and select the appropriate,
that phasic dopamine activity modulates the less salient behavioral response – they will be
gate between the prefrontal context process- more capable of exercising cognitive control.
ing module and the individual’s repertoire This focus on cognitive control therefore
of learned behavioral contingencies (Braver, represents a critical step in the develop-
Barch, & Cohen, 1 999). ment of this model – a process that should
Although we agree that dopaminergic continue to advance, incorporating findings
modulation of cortical activity certainly from studies of neural correlates of cognitive
plays a role in the pathology of impaired in- control (e.g., Braver et al., 2003 ), the cogni-
formation processing in psychosis (e.g., Abi- tive mechanisms underlying recognition and
Dargham et al., 2002; Okubo et al., 1 997), resolution of response conflict (Botvinick
the particular mechanism Cohen and col- et al., 2001 ), and the specificity of the find-
leagues propose (i.e., increased tonic and ings to patients suffering from psychosis
decreased phasic dopamine activity; Braver, (Barch et al., 2003 ).
Barch, & Cohen, 1 999), however, remains Finally, two additional issues awaiting res-
controversial (see, for example, Grace, 1 991 , olution are also worthy of brief mention.
or Laruelle, Kegeles, & Abi-Dargham, 2003 ). The first area involves the mechanism by
In addition to being able to account which particular behavioral responses ac-
qualitatively for the cognitive deficits the quire their levels of salience, or prepotency.
model was designed to simulate (Braver, Cohen, Braver, and colleagues refer to be-
Barch, & Cohen, 1 999), the proposition that havioral learning principles to account for
schizophrenia patients fail to appropriately how associations are formed between par-
use contextual information to guide ongo- ticular pieces of contextual information and
ing behavior in a goal-directed manner cer- specific outcomes (Braver, Barch, & Carter,
tainly has face validity.4 One might argue, 1 999), linking contextual information to in-
however, that any behavior judged to be ab- centive salience, and therefore to behavioral
normal, or more specifically, deficient with response salience; however, they do not ac-
respect to a given goal state, could be ex- count for the initial identification and cate-
plained by a failure of this context process- gorization of pieces of information (unless a
ing mechanism. stochastic process of sampling reward value
Stored
Regularities
Generator of
The World Comparator Predictions
Plans
Figure 2 1 .3. Hemsley’s and Gray’s model of disrupted information processing in

schizophrenia. Reprinted with permission from Gray et al., 1 991 .
from among the set of available stimuli is traction in our attempt to understand how
assumed), nor do they argue that associa- information processing abnormalities might
tions among behaviors, contextual informa- contribute to the manifestation of thought
tion, and outcomes will generalize across disorder.
situations.
Additionally, future discussion of con-
Hemsley’s and Gray’s Model
textual information, as defined by Cohen,
Braver, and colleagues, might benefit from A model with properties analogous to as-
consideration of how this particular con- pects of the model developed by Cohen
struct relates to definitions of context in and colleagues, but with important incon-
other fields of research within cognitive psy- gruities as well, has developed in a body
chology and neuroscience. Borrowing an ex- of publications authored by Hemsley, Gray,
ample from the study of conditioning in non- and colleagues over the past two decades
human animals, investigators predictably (Gray, 1 982; Gray, 1 995 ; Gray, 1 998; Gray
define context as the aspects of the phys- et al., 1 991 ; Hemsley, 1 987; Hemsley, 1 993 ;
ical setting in which a particular condition- Hemsley, 1 994; Weiner, 1 990). As summa-
ing trial takes place that are immediately ob- rized most recently by Gray (1 998), this
servable by the animal (e.g., Fanselow, 2000; model of disordered information process-
Goddard, 2001 ; Rudy & O’Reilly, 2001 ). ing in schizophrenia involves a disruption
This definition of context is relatively consis- in the processes by which past regulari-
tent and uniform across studies, facilitating ties of experience are integrated with on-
the construct’s incorporation into behavioral going stimulus recognition and behavior se-
models and the subsequent generalization of lection and monitoring (see Figure 21 .3 ).
those models to analogous, ecologically valid This failure to engage information fluidly
situations for which the model can generate from longer-term memory in the interpre-
behavioral predictions (such as the behavior tation of the current perceptual state of
of a recovered drug addict in a physical set- the world and the prediction of subsequent
ting with which drug use is associated; e.g., states is essentially the failure of an informa-
Shaham et al., 2003 ). Cohen, Braver, and tion processing system to identify and utilize
colleagues (Cohen et al., 1 999) on the other contextual information in the automatic
hand, seem to define context purely in terms guidance of goal-directed behavior. As de-
of performance an cognitive tasks. These am- lineated by Gray (1 998), what should seem
biguities aside, as we will discuss, this and the familiar to the patient and elicit auto-
following model provide critical theoretical matic processing of information (seemingly
without effort or conscious awareness and, nomic orienting response observed in hu-
potentially, in parallel with other processes) mans and other animals occurs in order to
instead seems novel, engaging finite, con- increase the cognitive resources available for
trolled information processing resources (ef- sensory processing.
fortful, conscious processing requiring atten- For individuals suffering from schi-
tional focus and operating in a serial fashion; zophrenia, the fundamental deficit “ . . . lies
Schneider & Shiffrin, 1 977). Consistent with at the moment of integration of past ex-
an earlier proposal put forth by Nuechterlein perience with current information han-
and Dawson (1 984), Hemsley (1 994) and dling” (Gray, 1 998, p. 261 ). Specifically,
Gray (1 998) argue that schizophrenia pa- information about past regularities of expe-
tients are significantly more likely to engage rience is not integrated fluently with cur-
these controlled processes than are nonpsy- rent perceptual information, preventing the
chotic subjects, resulting in patients’ en- system from making an appropriate pre-
gaging information processing bottlenecks diction about the next state of the per-
significantly more frequently, and, through ceptual world and markedly decreasing the
physiological mechanisms discussed subse- likelihood that a match signal will be gen-
quently, this disparity leads to the conscious erated. Consequently, the impaired individ-
experience of psychosis. ual experiences the detection of novelty in
Similar to the model discussed previously, the perceptual environment much more fre-
Hemsley’s and Gray’s model accounts for quently than would an individual generat-
the influence of contextual information on ing more frequent match signals. In light of
the goal-oriented direction of behavior. Un- the increased sensory processing demands
like the previous model, however, Hemsley and the concomitantly increased demand
and Gray (Gray et al., 1 991 ) mention explic- placed on Gray’s comparator mechanism,
itly that their model includes a dedicated as well as the need to select and initiate a
comparator mechanism that examines the different motor program, the cognitive pro-
products of regular perceptual sampling of cessing demands once fulfilled automatically
the environment within the context of a pre- now require capacity-limited, controlled
dicted model of the perceptual world (cor- processing.
rected for the influence of ongoing motor This conjecture does seem to reflect
plans, as well as other dynamic aspects of the subjective experiences reported by many
perceptual world stored in long-term mem- schizophrenia patients, who describe feel-
ory). The results of this comparison are then ing overwhelmed by a somehow foreign-
abstracted according to the degree to which seeming, disjointed perceptual landscape,
they match the prediction and transmitted unable to discern more meaningful features
to the motor programming system, which of the environment from less meaningful
interprets a relative “match” signal as an in- features (Davis & Cutting, 1 994; McGhie &
dication that it should allow the current mo- Chapman, 1 961 ). Indeed, Hemsley’s (1 994)
tor program to continue (i.e., “the behaviors and Gray’s (1 998) proposal that actively psy-
executed are having the predicted effects”) chotic schizophrenia patients engage their
and a relative “mismatch” signal as an indi- sensory environment in a much more at-
cation that it should interrupt the ongoing tentionally intensive manner, all the while
motor plan because something novel or un- sensing endogenous indications that some
expected has occurred. aspect of that environment is novel or un-
However, the presence of a relative mis- expected, appears to account for patients’
match signal orients the individual’s atten- reports of attributing increased significance
tion to the possibility of a meaningful change to aspects of the environment that non-
in the perceptual environment, increasing schizophrenic individuals would consider in-
the intensity of sensory processing (Gray, significant (Davis & Cutting, 1 994), poten-
1 998) – a proposition that converges with tially participating in the development of
Sokolov’s (1 963 ) suggestion that the auto- delusional beliefs (e.g., Maher, 2002).
Moreover, Gray (1 998) delineates a sec- dopaminergic inhibition, effectively disrupt-

ond consequence of the pathologically fre- ing the ability of the ventral frontal lobe to
quent interruption of ongoing motor pro- communicate match signals and in turn set-
grams in schizophrenia patients: disruption ting off a chain reaction of inhibitory steps
in the “labeling” of interrupted motor pro- throughout the basal ganglia, eventually in-
grams as internally generated – a conse- hibiting the reticular nucleus of the thala-
quence of impaired self-monitoring. That is, mus. Once the reticular nucleus is inhibited,
the patient recognizes the results of the (at the excitatory, largely feed-forward loops
least partial) execution of a motor program, comprising thalamocortical sensory infor-
but does not attach a sense of personally mation processing circuits are left relatively
willed intention to the motor program – a unchecked – a consequence possibly related
mechanism first proposed by Frith (1 987). to the subjective sense of increases in the de-
Considering that speech – even covert or gree of conscious sensory processing under-
subvocal speech – essentially constitutes a way (also see Grace, Moore, & O’Donnell,
complex motor program, a consequence of 1 998). Moreover, this thalamocortical disin-
a failure to recognize that motor program as hibition and concomitant sense of increased
a behavior willfully enacted by oneself may conscious processing of stimuli facilitates pa-
lead to the conclusion that the speech expe- tients’ and controls’ differential engagement
rienced was generated by an agent other than of highly controlled cognitive processes – a
the individual – the definition of an auditory functional dissociation seen most strikingly
hallucination (see Ford et al., 2002, for a pos- in the prefrontal cortex (Jansma et al., 2001 ).
sible fronto-temporal correlate of this phe- Furthermore, this thalamic disinhibition dis-
nomenon). In this manner, Gray (1 998) and rupts the functioning of parietal and inter-
Frith and colleagues (1 992) argue the failure connected prefrontal areas active during the
to associate willed intention with the exe- attribution of overt behavior as being self-
cution of a motor program can lead to the generated (Frith et al., 1 992) – an operation
experience of a significant perceptual abnor- closely related to the functioning of the ven-
mality, such as a hallucination. tral prefrontal comparator.
With respect to its neural implementa- Two areas of concern warrant brief men-
tion, Hemsley’s and Gray’s model (Gray, tion. The first involves the possibility that
1 998) focuses on regulatory functions of the model’s predictions might prove rel-
dopamine, as does Cohen’s and Braver’s; un- atively nonspecific with respect to the
like Cohen’s and colleagues’ model, how- primary locus of pathology: Any number
ever, it places greatest emphasis on the of disruptions in the proposed information
dopaminergic modulation of a structure processing system would result in a marked
other than the prefrontal cortex – namely, drop in the number of match signals re-
the nucleus accumbens, a site of great inte- ceived, leading to a greater degree of con-
gration of disparate neural circuits, located trolled processing. In addition to this po-
in the basal ganglia. Gray (1 998) posits that tential nonspecificity, one might argue that
the comparison between predicted and ob- the behavioral evidence cited in support
served perceptual information is carried out of the model does not easily map onto
in the ventral portion of the frontal lobe,5 the clinical phenomena for which it at-
and the results are transmitted through a me- tempts to account. Although Hemsley’s and
dial temporal lobe pathway to the nucleus Gray’s model seems to relate in meaning-
accumbens. Importantly, this excitatory, ful ways to the subjective experiences of
glutamatergic input to the nucleus accum- schizophrenia patients, much of the behav-
bens is paired with an inhibitory efferent ioral evidence supporting it (Gray et al.,
connection from the dopamine-releasing nu- 1 991 ; Gray, 1 998) is drawn from studies of
clei of the midbrain. Gray suggests that when latent inhibition (Lubow, 1 95 9) – a phe-
the excitatory input is disrupted, the nucleus nomenon of classical conditioning defined as
accumbens receives a relative overload of the difference in amplitude or intensity of
conditioned responses to conditioned stim- cognitive mechanisms relatively proximal to

uli of two types: stimuli to which the subject speech production, such as those included in
was already exposed prior to association Levelt’s phonological/phonetic system (e.g.,
with the present response and stimuli oth- Barch & Berenbaum, 1 996, and Goldberg
erwise novel to the subject when first et al., 1 998). In agreement with our con-
associated with the present response. An clusion, Kerns and Berenbaum (2002) re-
association usually occurs more readily be- port only a very minor relationship between
tween the “non–pre-exposed” stimulus and phonological/phonetic system impairment
the response, which is a phenomenon be- and ratings of thought disorder and argue
lieved to be associated with an inhibition of that this relationship is carried entirely by
association formation caused by the persist- measures of anomia and word substitution
ing representation of the “pre-exposure” ex- and approximation, deficits likely related to
perience of the stimulus. The contextual in- the retrieval of lemmas from the mental lexi-
formation comprised by the representation con (Indefrey & Levelt, 1 999). The vast ma-
of the pre-exposure therefore influences the jority of clinical phenomena related to for-
efficiency with which a subsequent behav- mal thought disorder (Andreasen & Grove,
ioral association is formed. Taken alone, ev- 1 986), however, is left unaccounted for by
idence that schizophrenia patients do not deficient speech production.
show expected latent inhibition effects may Kerns’ and Berenbaum’s (2002) second
be interpreted as a failure by patients to uti- category of hypothesized deficit involves in-
lize contextual information in the behav- creased amount of activation spreading au-
ioral conditioning domain. One might ques- tomatically between nodes in semantic net-
tion, however, whether evidence taken pri- works (assumed to operate like standard
marily from classical conditioning serves as neural network models; Dell & O’Seaghdha,
an adequate foundation for an information 1 991 ), resulting in increased priming of
processing model as wide-ranging and com- nearby semantic associates of a target word,
plex as Gray’s and Hemsley’s and that carries raising the probability that one of these non-
implications for elusive aspects of cognition target words will be retrieved and integrated
such as conscious awareness. into ongoing speech. A relatively intense
area of study in schizophrenia research (for
a review, see Minzenberg et al., 2002), in-
Studies of Information Processing Deficits
vestigators looking for evidence of abnormal
Related to Formal Thought Disorder
semantic network priming have reported
To help us fill in the gap in available seemingly contradictory findings, with some
theory between mechanisms underlying a showing evidence of hyper-priming at tested
modality-nonspecific degradation in infor- nodes (suggesting increased amount of ac-
mation processing ability and the mech- tivation spreading throughout the network;
anisms generating organized, goal-directed Spitzer et al., 1 993 , 1 994; Weisbrod et al.,
speech, we turn to the literature on 1 998) and others showing evidence of hypo-
(quasi) experimental approaches to study- priming at tested nodes (suggestive of a re-
ing the pathology of formal thought disorder. duced amount of activation; Blum & Frei-
Thankfully, this work has been examined des, 1 995 ; Passerieux et al., 1 997; Besche
capably in a recent meta-analysis by Kerns et al., 1 997). This contradiction prompted
and Berenbaum (2002), who organize the the suggestion that thought-disordered pa-
range of published hypotheses involving spe- tients actually experience an increase in
cific cognitive impairments associated with distance of activation spread, while main-
formal thought disorder into four general taining an overall level of activation compa-
categories. rable with controls, effectively yielding an
We have already discussed the first of increased number of nodes activated, with
these categories, involving investigations of none activated to as high a degree as controls’
nodes (Spitzer, 1 997). In their analysis, Kerns response selection: Specifically, informa-
and Berenbaum (2002) reject the hyper- tion from (long-term) semantic memory is
priming hypothesis, and indeed report that a continuously retrieved and integrated into
small amount of evidence exists supporting comprehension and online production of
increased distance of activation spread and verbal behavior. Failure of this fluid integra-
decreased amount of activation at any given tion therefore has the ability to ultimately
node, suggesting that a thought-disordered prevent a match signal from being gener-
patient should be slightly more likely than ated, engaging (albeit indirectly) capacity-
controls to retrieve a word relatively dis- limited, controlled processing resources, and
tantly related to the target word. likely recruiting activation of left inferior
Aside from this evidence of a relatively prefrontal cortex to facilitate the otherwise
minor contribution to the expression of for- automatic selection of semantic information
mal thought disorder, the first deficit shown mediated by activity in the left middle tem-
by Kerns and Berenbaum (2002) to con- poral gyrus (Gold & Buckner, 2002; Indefrey
tribute significantly to thought disorder in- & Levelt, 1 999).
volves semantic memory functioning rel- This process of semantic memory re-
atively distinct from automatic spreading trieval and integration itself may be mod-
of activation, such as impairment of con- ulated by the subject of Kerns’ and Beren-
trolled retrieval of information from seman- baum’s (2002) fourth category of cognitive
tic memory, which may itself have an ab- deficit contributing to formal thought dis-
normal netware structure (because of the order – namely, impaired executive func-
cumulative effects of a chronic inability to tioning. As a composite construct, Kerns and
encode semantic information, for instance). Berenbaum (2002) demonstrate that execu-
Relevant studies (e.g., Allen et al., 1 993 ; tive function abnormality is strongly related
Goldberg et al., 1 998; Kerns et al., 1 999) to the presence of formal thought disorder.
tend to employ fluency tasks requiring re- Of course, executive function itself entails
trieval of information from semantic mem- a number of critical subsystems (Baddeley,
ory by means such as a controlled imple- 1 986), including a mechanism for process-
mentation of retrieval strategy (Ruff et al., ing contextual information (and effectively
1 997). In agreement with conclusions of- inhibiting irrelevant, noncontextual infor-
fered by Minzenberg, Ober, & Vinogradov mation), a mechanism for allocation of at-
(2002) and by Baving and colleagues (2001 ), tentional capacity serving to maintain in-
all of whom argue that semantic retrieval is formation over a delay, and a mechanism
most consistently and robustly impaired in for monitoring one’s own behavior, in-
schizophrenia patients when a high degree cluding speech.
of controlled processing is required, Kerns
and Berenbaum (2002) present evidence of context/selective attention
a strong, consistent association between this Consistent with Cohen’s and Braver’s
type of semantic processing abnormality and model, there is indeed considerable evi-
presence of formal thought disorder. They dence that thought-disordered patients suf-
argue additionally that the current literature fer from abnormal processing of contex-
does not offer evidence permitting a disam- tual information. In fact, Levelt’s model of
biguation between abnormal network struc- speech production incorporates contextual
ture and impaired information retrieval. information at numerous stages, such as dur-
Evidence of impaired semantic retrieval ing conceptual preparation (when interper-
associated with formal thought disorder is sonal context is considered, for instance).
consistent with Hemsley’s and Gray’s (Gray Additionally, the process of lexical selec-
et al., 1 991 ; Gray, 1 998) model’s focus tion may be influenced by discourse context
on the smooth integration of stored in- (Horn & Ward, 2001 ), which describes the
formation with incoming information and representation of previously uttered verbal
information one must hold in mind to en- context of the question implied the non-
sure that subsequent utterances will show dominant meaning.
adequate structural continuity with and se- Of course, excessive yielding to normal
mantic and conceptual relevance to the over- biases is the logical complement of Cohen’s
arching conversation. Numerous investiga- and Braver’s cognitive control mechanism,
tors have examined schizophrenia patients’ which is defined by its ability to overcome
capacity to use discourse context to guide these normal biases. Accordingly, Cohen,
selection of verbal behaviors. Studies us- Braver, and colleagues argue that an indi-
ing the traditional cloze procedure (Taylor, vidual’s representation of discourse context,
1 95 3 ), in which the subject reads a block as well as his or her goals for the inter-
of text missing every fourth or fifth word action (e.g., make a particular point, com-
and must attempt to use the context pre- municate in a certain manner) constitute
ceding each blank to guess what word is re- contextual information, guiding the ongo-
quired, have found that psychotic patients ing implementation of related semantic con-
tend to show impaired performance (re- cepts (Botvinick et al., 2001 ). Failure to
viewed in Cozolino, 1 983 ); however, several encode, update, or maintain this contextual
marked methodological limitations of the information therefore leads to a failure to
procedure (Maher, 1 991 ) cast uncertainty utilize discourse context to constrain and se-
on interpretation of those findings. A great lect subsequent verbal output, appearing to
number of studies have taken a different the observer as a relative lack of association
approach (Benjamin & Watt, 1 969; Chap- between units of language output.
man & Chapman, 1 973 ; Cohen & Servan- Moreover, if failure to encode, main-
Schreiber, 1 992; Kuperberg, McGuire, & tain, or implement contextual information
David, 1 998; Sitnikova et al., 2002) using is, in fact, a mechanism underlying formal
various lexical disambiguation tasks that re- thought disorder, it may explain a long-held
quire the subject to use contextual informa- piece of clinical wisdom – specifically, dis-
tion from preceding clauses to determine the ordered speech is more likely to be elicited
relevant meaning of a homograph, or a word by abstract, ambiguous, open-ended stim-
with multiple possible definitions. uli (such as the general question posed to
These and other investigators have gener- the quoted subjects at the beginning of this
ally concluded that patients with psychotic chapter, or even Rorschach inkblots; John-
disorders fail to demonstrate sensitivity to ston & Holzman, 1 979) than by specific,
the biasing influence of preceding contextual closed-ended prompts. In other words, the
information; however, Chapman and Chap- fewer structural demands and intermediate
man (1 973 ) refined this conclusion, arguing goal states provided explicitly, the more dif-
that patients fail to demonstrate sensitivity ficult it is to practice cognitive control. Un-
to discourse context only when it suggests a der these circumstances, not only is spe-
homograph’s nondominant meaning. They cific contextual information either never en-
characterized this deficit as “excessive yield- coded or lost from active maintenance, but
ing to normal biases,” or a tendency to utilize the context processing module loses the con-
dominant meanings. For instance, when one comitant ability to inhibit the activation of
patient was asked to interpret the proverb competing pieces of information, exposing
“One swallow does not make a summer,” he the system to increased memory retrieval
responded, “When you swallow something, interference (Anderson & Spellman, 1 995 )
it could be all right, but the next minute you and subsequent loss of goal orientation in
could be coughing, and dreariness and all produced speech.
kind of miserable things coming out of your
throat” (Harrow & Quinlan, 1 985 , p. 43 6). capacity allocation
The patient clearly demonstrated a bias to- Given this continued focus on controlled
ward the more dominant meaning of the processing as critical to information process-
word “swallow,” despite the fact that the ing abnormalities related to formal thought
disorder, it is important to consider the allo- impairment in the ability to self-correct erro-
cation of working memory capacity, a pro- neous behaviors (e.g., Malenka, et al., 1 982)
cess shown to involve activation of dorso- and that patients with formal thought dis-
lateral prefrontal cortex as well as more order demonstrate significant impairment
modality-specific regions of posterior cortex in self-monitoring of motor behavior (e.g.,
(e.g., Garavan et al., 2000), as well as avail- Kircher & Leube, 2003 ), Barch and Beren-
ability of free capacity, which appears to baum (1 996) administered a task requiring
be reflected in the activity of dorso- (Cal- patients to read separate word lists and then,
licott et al., 1 999) and ventrolateral pre- later, to recall whether presented words were
frontal cortex (Rypma, Berger, & D’Espasito read aloud or silently or were novel to the
2002). Numerous studies (e.g., Docherty & testing phase of the study. Patients who
Gordinier, 1 999; Harvey & Pedley, 1 989; demonstrated worse performance on this
Nuechterlein et al., 1 986) have found corre- task tended to produce a greater number of
lational evidence of a relationship between verbal derailments (i.e., switching tangen-
working-memory capacity and aspects of tially between topics of discussion) in the
formal thought disorder. Attempting to independent speech sample, suggesting that,
clarify the direction of this relationship, whereas amount and content of disordered
Barch and Berenbaum (1 994) report that, speech are strongly affected by working-
among nonill subjects, reduction in overall memory capacity available, the coherence
processing capacity (achieved through a and goal directedness of speech are influ-
dual-task manipulation) is associated with enced to a great degree by contextual pro-
decreases in verbosity and syntactic com- cessing and self-monitoring ability.
plexity, which are verbal phenomena in-
cluded in formal thought disorder – particu- integration of cognitive deficits
larly “negative thought disorder” (Andreasen contributing to formal thought disorder
& Grove, 1 986). Melinder and Barch (2003 ) Hemsley and Gray also argue for impair-
extend this approach to include psychotic ment in self-monitoring among schizophre-
patients, showing that they, too, manifest nia patients, proposing that disruptions in
increased negative thought disorder with this capability result in a failure to gen-
decreasing availability of working-memory erate match signals and a consequential
capacity. These results are particularly note- increase in the extent of controlled, ef-
worthy because the investigators were able fortful processing engaged. Given that this
to demonstrate that reduced processing ca- repeated failure results in a reduction in
pacity can actually cause speech to become availability of online processing resources
disordered rather than to show a correla- concomitant with the shift from automatic
tion between reduced processing capacity to more controlled functioning, it should
and thought disorder. Indeed, this repre- lead to a change in the manner in which in-
sents one instance out of many in which formation is retrieved from semantic mem-
schizophrenia research has shown working- ory (Badre & Wagner, 2002). Specifically,
memory capacity to act as a bottleneck, the retrieval of target information should be
limiting the production or implementation biased by the activation as well as by the
of abstract ideas (e.g., Glahn et al., 2000; inhibition of nontarget information (Neely,
Silver et al., 2003 ). 1 977) consistent with the notion of cog-
nitive control.
An investigation by Titone, Levy, and
self-monitoring Holzman (2000) provides further empiri-
Additionally, inspired by Levelt’s (1 989) as- cal support for the presence and opera-
sertion that the production of nondisordered tion of these pathological semantic memory
speech requires the speaker to monitor his retrieval and executive functions contribut-
or her own speech, and consistent with ev- ing to formal thought disorder. The au-
idence that schizophrenia patients show an thors reported the results of a cross-modal
semantic priming study in which particular the ability to predict patient performance
meanings of otherwise semantically ambigu- data in a variety of experimental cogni-
ous words were biased either moderately or tive tasks. In addition, these models con-
strongly by the context of a preceding sen- verge with descriptive analyses of the expe-
tence. Experimental parameters were opti- rience of thought disorder in patients with
mized to increase the likelihood that more psychotic disorders. Yet the parsimony that
controlled retrieval of semantic information these models gain in attributing context-
would be utilized. Schizophrenia patients processing deficits in thought-disordered
showed a pattern of priming identical to con- patients to a disturbance in a particular pro-
trols in the strong contextual bias condition cessing component (i.e., either a disruption
but exhibited a greater degree of priming in in short-term representations of stimulus
the moderate contextual bias condition (i.e., context or in the integration of current con-
patients showed priming effects for both rel- textual information with memories of prior
atively dominant and relatively subordinate stimulus contexts) also leaves them vulnera-
meanings, whereas controls showed priming ble to refutation inasmuch as disturbances in
facilitation only for subordinate meanings). other (or multiple) processing components
The authors point out that retrieval of a par- of the complex, integrated circuitry medi-
ticular meaning of a word requires not only ating willed behavior could account equally
activation of the word within a semantic net- well for a wide variety of thought-disordered
work but also inhibition of nearby, less rel- phenomena. That is to say, demonstrating
evant meanings. Patients were able to per- that a particular neurocognitive impairment
form this selection process normally when could account for a particular behavioral ab-
strong contextual bias was present, but when normality does not necessarily demonstrate
this influence was more subtle, the patients’ that the impairment does cause the abnormal
degraded, retrieval-related inhibitory mech- behavior to occur.
anism failed to filter out alternate meanings, Indeed, as mentioned in the Introduc-
creating interference with the most imme- tion, more than four decades of intensive
diately relevant meaning. neuroscientific investigations have failed to
Therefore, to the extent that the study identify conclusively a single defining lesion
indeed engaged controlled processing mech- in patients with schizophrenia or other
anisms (and consequently did not rely en- forms of psychosis. Rather, as discussed in
tirely on the automatic spread of activation more detail subsequently, these syndromes
in a semantic network), the results support manifest with deficits to many neural sys-
the hypothesis that disordered speech results tems (e.g., cortico-cortical, fronto-striatal,
from disrupted executive-assisted seman- temporo-limbic) across several levels of
tic memory retrieval mechanisms involv- analysis (e.g., alterations in gray matter
ing both abnormal activation-based retrieval volume, dendritic arborization in cortical
of information from semantic memory and neurons, and neurotransmitter receptor dis-
impaired executive function involving re- tributions). In light of this complexity, we
duced inhibition of irrelevant, noncontex- attempt to apply a relatively new analytical
tual information. Additionally, recognizing framework that has become the dominant
this possibility, Kerns and Berenbaum (2002) paradigm in psychopathology research –
call for more direct testing of hypotheses in- that is, the endophenotype approach
volving a primarily inhibitory deficit funda- (Gottesman & Gould, 2003 ) – to theoretical
mental to formal thought disorder. accounts of thought disorder. The basic
premise of the endophenotype approach
is that a given clinical syndrome such as
Ex cogito, Dementia
schizophrenia is composed of multiple neu-
As the foregoing discussion illustrates, cur- rocognitive trait deficits, each of which may
rent cognitive models of thought disorder be determined by at least partially indepen-
have many merits, not the least of which is dent mechanisms. A major consequence of
cognitive and neuroscience aspects of thought disorder 51 1
this model is that a certain trait deficit may complications, particularly those associated
be necessary but not sufficient for the pheno- with fetal hypoxia or oxygen deprivation,
typic manifestation of a syndrome; thus, the are robustly associated with an increased
trait deficit will be shared by individuals with risk for schizophrenia. Complications asso-
a vulnerability to the syndrome regardless of ciated with fetal hypoxia are also of inter-
whether they manifest the syndrome phe- est because fetal oxygen deprivation repre-
notypically. Other deficits may be specific sents a plausible mechanism for explaining
to individuals who manifest the syndrome much of the structural pathology of the brain
phenotypically; these latter deficits may detected in neuroimaging studies of adult
thus potentiate the expression of a symptom schizophrenia patients (Cannon, 1 997).
in those who carry vulnerability (i.e., those Applying the conclusion that such ge-
who have deficits in other neurocognitive netic and environmental influences com-
domains that are necessary but not sufficient bine (additively or interactively) to deter-
for overt disease expression). To develop mine an individual’s risk for expressing a
this framework further in the context of a psychotic disorder to the study of neu-
discussion of thought disorder, it will first rocognitive traits helps demonstrate which
be useful to explicate a number of facts such traits are likely necessary, but not suf-
about the genetic epidemiology and clinical ficient, for the expression of a psychosis
neuroscience of schizophrenia. phenotype (or to the expression of any
phenotype, including specific symptoms, for
example). Specifically, deficits related en-
The Genetic Epidemiology
tirely to the genetic diathesis for develop-
of Schizophrenia
ing the given phenotype may be necessary
Although we are aware of only one study re- but clearly are not sufficient for the mani-
porting on the heritability of formal thought festation of that phenotype. This endophe-
disorder itself (Gambini et al., 1 997), a great notype should be present in any individual
deal of evidence is available demonstrating carrying the genetic vulnerability. Conse-
that genetic factors contribute substantially quently, if one member of a set of monozy-
to the development of schizophrenia, ac- gotic twins (who, by definition, have identi-
counting for about 80% of the risk of de- cal genomes) displays a vulnerability-specific
veloping the disorder. The transmission pat- trait, the other must as well. Additionally,
tern, however, is complex, involving at least any trait not shared by both monozygotic
several different genes as well as environ- twins must result to some degree from the
mental factors (Cannon et al., 1 998; Tsuang, influence of unique environmental events.
Stone, & Faraone, 1 999; Tsuang & Faraone,
1 999). One consequence of the complex-
Neural System Abnormalities
ity of the inheritance pattern in schizophre-
in Schizophrenia
nia is that an individual may carry some de-
gree of genetic predisposition to the illness Although neither the specific neurobiolog-
without expressing it phenotypically – or at ical processes associated with the expres-
least without expressing it to a degree severe sion of formal thought disorder nor those
enough to meet diagnostic criteria. Stated associated with psychosis in general have
differently, only a subset of genetically vul- been definitively isolated, disturbances in
nerable individuals actually develops a psy- prefrontal and temporo-limbic systems and
chotic disorder. For many with such a genetic their interconnections are likely to play criti-
predisposition, an environmental contribu- cal roles in both (Cohen & Servan-Schreiber,
tion (to which genetically predisposed indi- 1 992; Grace & Moore, 1 998; Gray et al.,
viduals might be differentially sensitive) to 1 991 ). The prefrontal cortex is thought
development of a psychotic disorder is also to support higher-order cognitive processes
required. Among the environmental factors such as working memory, the strategic allo-
that may be involved, prenatal and perinatal cation of attention, reasoning, planning, and
other forms of abstract thought (Goldman- of genetic loading for schizophrenia (Can-
Rakic, 1 995 ; Kane & Engle, 2002; Miller & non et al., 2000; Cannon et al., 2002; Glahn
Cohen, 2001 ). The medial temporal lobe et al., 2002). Interestingly, global and dor-
structures (i.e., hippocampus, amygdala) solateral prefrontal volumetric deficits have
and adjacent temporal cortex are involved in been found to correlate with performance
learning and recall of episodic information, deficits on tests sensitive to diverse working-
emotion (especially the amygdala), and cer- memory processes (Maher et al., 1 995 ;
tain aspects of language processing (Squire Seidman et al., 1 994). The nature of the
& Zola, 1 996). pathological mechanism underlying these
Neuropsychological studies have shown correlations is not necessarily obvious, how-
that, against a background of generalized ever. Rather than a loss of neurons or in-
information processing impairment, schi- terneurons, it has been suggested that gross
zophrenia patients manifest profound gray matter volume decrements reflect a
deficits in the areas of long-term and work- reduction of interneuronal neuropil – the
ing memory (Cannon et al., 2000; Saykin space between neural cells consisting largely
et al., 1 994). These deficits appear not to of neurons, dendrites, and axons – in the pre-
be merely secondary effects of impaired frontal region in patients with schizophre-
attention, disease chronicity, or medica- nia and result in impaired working-memory
tion exposure (Cirillo & Seidman, 2003 ). functioning through hypoactive dopamin-
Such findings have been corroborated by ergic modulation of pyramidal cell activity
evidence of abnormal physiologic activity (Goldman-Rakic & Selemon, 1 997). Rather
(i.e., altered blood flow) in prefrontal and than subcortical dopaminergic dysregula-
temporal lobe regions in patients with tion, in this case, dopamine would be acting
schizophrenia during performance of tests within the cortex (although affecting a dis-
assessing these same domains of functioning tinct set of receptors). This prediction has
(Berman et al., 1 992; Callicott et al., 1 998; been supported by a position emission to-
Heckers et al., 1 998; Yurgelun-Todd et al., mography investigation that found signifi-
1 996). At the structural anatomical level, cantly decreased dopamine receptor binding
schizophrenia patients show a variety of in the prefrontal cortex of schizophrenia
volumetric changes throughout the brain, patients (Okubo et al., 1 997). Notably,
including reduced cortical, hippocampal, dopamine receptor reduction predicted cer-
and thalamic volumes (Pfefferbaum & tain types of symptoms, as well as working-
Marsh, 1 995 ). Recent neuroimaging work memory impairment (but also see Abi-
indicates a relatively greater degree of Dargham et al., 2002). It is also of interest
reduction in frontal and temporal cortical in this context that treatment with med-
volumes compared with posterior cortical ication modulating cortical dopamine lev-
volumes (Cannon et al., 1 998). els is associated with normalization of
blood flow in the prefrontal cortex and in-
creased behavioral accuracy during perfor-
Prefrontal Cortex and Working-Memory
mance of a working-memory test (Honey &
Deficits
Andrew, 1 999).
Several lines of evidence suggest that Given that abnormalities of working
working-memory deficits and associated ab- memory and prefrontal structure and func-
normalities in prefrontal cortical structure tion are associated with genetic liability to
and function are reflective of an inherited schizophrenia, it should be possible to iden-
diathesis to schizophrenia. In a Finnish twin tify specific genes that underlie these dis-
sample, we found that impaired perfor- turbances, especially in light of accumulat-
mance on tests of spatial working-memory ing evidence of physiological abnormality.
capacity and structural abnormalities in po- Weinberger and colleagues have reported ev-
lar and dorsolateral prefrontal regions var- idence of one such genetic influence – the
ied in a dose-dependent fashion with degree MET/VAL polymorphism of the COMT
gene (located on chromosome 22), with patients and some of their first-degree rela-
VAL alleles promoting more rapid break- tives to share a certain degree of compromise
down of synaptic dopamine, leading to in prefrontal cortical systems, nongenetic,
prefrontal hypofunction in patients with disease-specific influences cause the dorso-
schizophrenia (Egan et al., 2001 ). We have lateral prefrontal cortex to be further deviant
been interested in another potential sus- in the patients.
ceptibility locus that may affect prefrontal
function in schizophrenia – this one on
chromosome 1 . temporal lobe and episodic memory deficits
Inspired by independent reports of a lo- Several microscopic abnormalities of the
cus of susceptibility within a specific region hippocampus have been documented in
on chromosome 1 (Ekelund et al., 2000; Mil- schizophrenia, including alterations in neu-
lar et al., 2000; St. Clair et al., 1 990), we ronal density (Falkai & Bogerts, 1 986; Hut-
performed linkage and association analyses tenlocher, 1 979; Jeste & Lohr, 1 989; Zaidel,
across the chromosome 1 region of interest Esiri, & Harrison, 1 997), size (Arnold, 2000;
using quantitative neuropsychological mea- Benes, Sorensen, & Bird, 1 991 ), and orienta-
sures of liability in our sample of twins dis- tion (Conrad et al., 1 991 ; Conrad & Scheibel,
cordant for schizophrenia (Gasperoni et al., 1 987; Kovelman & Scheibel, 1 984). These
2003 ). Analyses revealed that the Visual hippocampal volume decrements appear to
Span subtest of the Wechsler Memory Scale, be present at disease onset (Bilder et al.,
an indicator of spatial working-memory 1 995 ; Velakoulis et al., 1 999) and also
function, was significantly and uniquely sen- appear to be present to some degree in
sitive to allelic varation of a gene within a healthy biological relatives of schizophrenia
highly specific portion of the chromosome – patients, suggesting hippocampal volume is
very likely to be the DISC1 gene. The DNA related to the genetic diathesis for develop-
sequence of the DISC1 gene is most ho- ing schizophrenia (Lawrie et al., 1 999; Sei-
mologous to proteins involved in axon guid- dman et al., 1 999; Seidman et al., 2002).
ance, synaptogenesis, and intracellular ax- Postmortem and magnetic resonance imag-
onal and dendritic transport. Recently, the ing (MRI) studies of schizophrenia patients,
protein was shown to promote neurite out- however, have reported positive correlations
growth (Ozeki et al., 2003 ). This function between hippocampal volume and age at
may help explain the reductions in neuropil onset (Bogerts et al., 1 990; Dauphinais et
volume observed in postmortem studies of al., 1 990; Stefanis et al., 1 999; Van Erp
schizophrenia patients. et al., 2002), suggesting a relationship be-
Together, these findings strongly impli- tween hippocampal volume and the dis-
cate genetic factors as playing a role in ease process, which complicates any simple
the abnormalities of prefrontal cortex and interpretation.
working memory in schizophrenia. Because From a neurocognitive perspective, im-
deficits on tests sensitive to working mem- paired declarative memory processes that
ory have also been observed in children at depend on the integrity of the hippocam-
elevated genetic risk (Cosway et al., 2000), pus (Faraone et al., 2000) have been re-
it is tempting to conclude that disturbances ported in both high-risk adolescents (Byrne
in the prefrontal cortex in schizophrenia et al., 1 999) and nonpsychotic relatives
are reflective of an inherited vulnerability of schizophrenia patients (Cannon et al.,
to the disorder that is present from early in 1 994), suggesting they derive, in part, from
life. Nevertheless, patients with schizophre- an inherited genotype. However, because
nia have been found to show even greater long-term memory deficits are specifically
disturbances in dorsolateral prefrontal cor- more pronounced in patients compared with
tex function and structure than their nonill their own healthy monozygotic twins, non-
monozygotic twins (Cannon et al., 2002). genetic, disease-specific factors must also be
Thus, although genetic factors may cause involved (Cannon et al., 2000). Importantly,
two studies have shown a significant rela- teraction of these genes with experience of
tionship between deficits in verbal declara- fetal hypoxia. Together, these findings indi-
tive memory and smaller hippocampal vol- cate that, whereas hippocampal volume in
umes in relatives of schizophrenia patients healthy subjects is under substantial genetic
(O’Driscoll et al., 2001 ; Seidman et al., control, hippocampal volume in schizophre-
2002). Furthermore, initial evidence indi- nia patients and their relatives appears to be
cates that impairment in long-term verbal influenced to a greater extent by unique and
memory and, to a lesser extent, executive shared environmental factors (Van Erp et al.,
function is associated with the occurrence in press).
of psychotic symptoms in subjects thought
to be at significantly elevated risk for even-
Integrating Cognitive Models
tually developing a diagnosable psychotic
and Endophenotypes
disorder such as schizophrenia, suggesting
that these deficits may mark the pathophys- It appears possible to unify components of
iological processes underlying functional the two cognitive models of disrupted infor-
deterioration during the earliest phase of dis- mation processing in schizophrenia patients
ease onset (Cosway et al., 2000). and the findings related specifically to formal
Given the putative importance of the thought disorder reviewed in the first part of
hippocampus to verbal and executive func- this chapter with the research on neurocog-
tion and, therefore, its possible role in pro- nitive endophenotypes in schizophrenia just
ducing disordered speech, it is of interest summarized. At the cognitive level of anal-
to revisit the issue of genetic versus envi- ysis, two mechanisms appear to be neces-
ronmental contributions to hippocampal in- sary for the expression of formal thought
tegrity. Compared with other parts of the disorder: an executive, online processing
brain, the hippocampus is acutely vulnera- system responsible for encoding, maintain-
ble to hypoxic–ischemic damage (Vargha- ing, and updating of goal-related informa-
Khadem et al., 1 997; Zola & Squire, 2001 ) – tion (context information in Cohen’s and
that is, insult temporarily depriving neural Braver’s model) and an integrated system in-
cells of oxygen. In monozygotic twins discor- volving the retrieval of information from se-
dant for schizophrenia, relatively reduced mantic memory and its fluid integration into
hippocampal volume in the ill twin was verbal behavior (i.e., the key component of
significantly related to the presence of la- Hemsley’s and Gray’s model).
bor or delivery complications and to pro- In terms of the endophenotype frame-
longed labor, which are both risk factors work described previously, individuals at el-
associated with fetal oxygen deprivation evated genetic risk but not expressing the
(McNeil et al., 2000). We have previously schizophrenia phenotype show mildly im-
found, in a Helsinki birth cohort, that paired functioning of executive systems and
schizophrenia patients who experienced related working memory and attention com-
fetal hypoxia have smaller hippocampal ponents. These executive processing deficits
volumes than in those who did not – a therefore appear to be associated with the
difference not noted within unaffected sib- diathesis necessary, but not sufficient, for the
lings and healthy comparison subjects (Van development of thought disorder. Beyond
Erp et al., 2002). At the same time, hip- this diathesis, the abnormal interaction of
pocampal volume differences occurred in executive and semantic memory systems –
a stepwise fashion with increase in genetic likely in service of controlled retrieval of
vulnerability for developing schizophrenia information and its integration into ongo-
(consistent with the findings of Seidman ing speech – is associated with a psychosis-
et al., 2002), suggesting that, in patients specific factor itself related to both genetic
with schizophrenia spectrum disorders, hip- vulnerability and exposure to environmen-
pocampal volume is influenced in part by tal risk factors. Individuals with schizophre-
schizophrenia susceptibility genes and an in- nia and their unaffected twins show a
qualitatively similar pattern of prefrontal In summary, research related to the cog-

structural and functional abnormality – nitive, genetic, and neural pathologies of
somewhat greater in severity in the patients. thought disorder in general, and schizophre-
Patients and their relatives additionally show nia specifically, has necessarily taken on a
temporal lobe abnormalities; however, the complex, interactive structure. As we have
degree of difference in temporal lobe abnor- seen, cognitive models designed to predict
mality between schizophrenia patients and particular behavioral outcomes can, in fact,
genetically vulnerable individuals is signifi- help researchers to understand the func-
cantly larger than the corresponding differ- tional correlates of anatomical abnormalities
ence in prefrontal abnormality. measured between genetically defined risk
Taken together, these results suggest that groups. Similar permutations involving these
mild impairment in prefrontal cortex and as- and numerous other levels of analysis equip
sociated impairment of the functioning of us with heuristics that guide our struggle to
online cognitive processing systems (i.e., ex- unravel the complexities of neuropsychiatric
ecutive functions, including working mem- phenomena such as formal thought disorder.
ory and selective attention) constitute a We have attempted to present such a heuris-
necessary but not sufficient (i.e., contribut- tic framework based on links we have ob-
ing) cause of thought disorder, which itself served between bodies of research into the
derives from a genetic diathesis to develop- pathology of thought disorder; some of these
ing a psychotic disorder such as schizophre- links cross between levels of analysis, ideally
nia. An additional factor related etiologi- helping us to map genetic, neurological, and
cally to exposure to an environmental insult cognitive systems onto each other.
interacting with genetic predisposition and
also necessary but not sufficient for the ex-
pression of schizophrenia involves disrupted
interaction between an executive, online Future Directions
processing system and a semantic memory
storage and selection system loosely map- Along the way to accomplishing this inte-
ping onto schizophrenia patients’ prefrontal grative goal, a great deal more work needs
and temporal lobe abnormalities, respec- to be done. Ideally, the parsing of formal
tively. That is, abnormalities in both the thought disorder into necessary and suffi-
prefrontal or executive-related circuitry and cient functional components – such as the
in the temporal lobe circuitry (i.e., me- work being carried out on the level of cogni-
dial temporal lobe for episodic memory tive specification by Barch, Berenbaum, and
and nearby middle temporal gyrus for se- colleagues – will be complemented by fur-
mantic memory; Kircher et al., 2001 ) may ther study of the physiological and genetic
be required to account for the full range variations associated with the production of
of thought disorder observed in patients abnormal speech.
with schizophrenia, whereas only the former This line of work will likely be facili-
may be required to account for the subtler tated by cognitive neuroscience’s growing
thought disturbances seen in genetically vul- ability to study the activity of particular
nerable individuals who do not manifest the brain mechanisms during the production of
full schizophrenia syndrome phenotypically speech, overcoming previously prohibitive
(perhaps the case with the interviewee in the practical obstacles caused by movement arti-
second quote at the beginning of this chap- facts detrimental to work utilizing functional
ter). Of course, it is also possible that severity MRI (Barch et al., 1 999) and EEG (e.g.,
of phenotypic thought disorder scales with Ford et al., 2002). Prior to these method-
severity of compromise of both components ological advances, only speech production
of the system rather than to their conjunc- studies employing covert vocalization were
tion per se. Further work is needed to segre- practical; however, these investigations typ-
gate these two possibilities. ically fall short of describing compellingly
aspects of formal thought disorder itself – or frequency below diagnostic threshold),

a phenomenon measured entirely in terms we found a significant level of formal
of overt speech production. thought disorder – interestingly, coupled
Additional progress in the study of with significant impairment of selective at-
thought disorder involves application of tention – which remitted with relatively
paradigms from the emerging field of so- low-dose antipsychotic pharmacotherapy
cial cognitive neuroscience (e.g., Adolphs, (Cannon et al., 2002). These and similar
2003 ; Wood, 2003 ) to the study of inter- findings raise interesting questions regarding
personal deficits in schizophrenia (e.g., Penn the potential utility of formal thought dis-
et al., 2002; Pinkham et al., 2003 ), includ- order as a prodromal indicator of psychosis
ing the distinctly interpersonal task of verbal as well as the potential benefits of symptom-
communication (Grossman & Harrow, 1 996; based treatment outside the context of a ma-
Racenstein et al., 1 999). For instance, the jor psychiatric diagnosis.
study of communication deviance (including
aspects of formal thought disorder) within
the families of patients with psychotic dis- Acknowledgments
order diagnoses or patients thought to be at
high risk for developing a psychotic disor- Preparation of this manuscript was sup-
der has been an area of active research for ported by grants MH5 285 7, MH65 079,
some time (e.g., Docherty, 1 995 ; Sass et al., and MH66286 from the National Institute
1 984; Wahlberg et al., 2000). Applying this of Mental Health; by grant RR00827 to
established framework to the examination the FIRST Biomedical Informatics Research
of neuronal correlates of receptive and pro- Network (http://www.nbirn.net), which is
ductive aspects of intrafamily communica- funded by the National Center for Re-
tion – potentially distinct from communi- search Resources at the National Institutes
cation with nonfamily individuals because of Health; and by a gift to the University
of the role of factors such as increased in- of California, Los Angeles Foundation, from
terpersonal familiarity and less predictable Garen and Shari Staglin.
affective modulation of cognitive processes
involved in communication – offers a novel
perspective with the potential to reinvigo- Notes
rate this important line of thought disorder
research. 1 . Delusions and similar disorders of thought con-
Another area of thought disorder research tent are not the central focus of this chapter
deserving continued attention involves the but might be of interest to cognitive psychol-
study of formal thought disorder in popu- ogists. For instance, the study of development
lations other than those currently meeting and maintenance of delusions is an area of ac-
diagnostic criteria for a major mental ill- tive research. See Bermudez (2001 ), Garety
ness. Although modern antipsychotic med- and Freeman (1 999), Gold and Hohwy (2000),
ications appear to be relatively effective at and/or Maher (2002) for debate over whether
or not delusions represent products of flawed
helping psychotic patients organize their
inferential reasoning.
speech (e.g., Wirshing et al., 1 999), signif-
2. Bleuler is likely also the source of the distinc-
icant levels of thought disorder often appear
tion between thought form and content dis-
noticeable in groups of patients who would
cussed earlier because he drew the distinction
not typically be treated with therapeutic (Bleluer, 1 91 1 /1 95 0) between what he labeled
doses of such medications (Andreasen & “fundamental symptoms,” including, but not
Grove, 1 986). For instance, in a sample of limited to, the loosening of ideational associa-
patients judged to be at significantly elevated tions, and “accessory symptoms,” including hal-
risk for developing a psychotic disorder (in lucinations and delusions.
part because they were displaying some psy- 3 . Working in parallel with Cohen, Braver, and
chotic symptoms but at a level of intensity colleagues, Kane and Engle (2002) and their
brain and cognition in thought disorder 51 7
collaborators have proposed a model of in- their reliability. Archives of General Psychiatry,
formation maintenance and behavioral re- 3 6(1 2 ), 1 3 1 5 –1 3 21 .
sponse selection entirely compatible with Co- Andreasen, N. C. (1 982). Should the term
hen, Braver, and colleagues’ model. Rather “thought disorder” be revised? Comprehensive
than referring to a context-processing mod- Psychiatry, 2 3 (4), 291 –299.
ule, however, Kane and Engle (2002) deem the
Andreasen, N. C. (1 986). Scale for the assess-
same cognitive mechanism “controlled atten-
ment of thought, language, and communica-
tion” and survey implications of applying the
tion (TLC). Schizophrenia Bulletin, 1 2 (3 ), 473 –
model to patients with frontal lobe lesions, al-
482.
though this mechanism is certainly relevant to
psychosis. Andreasen, N. C., & Grove, W. M. (1 986).
Thought, language, and communication
4. In fact, Bleuler (1 91 1 /1 95 1 , as discussed in
in schizophrenia: Diagnosis and prognosis.
Chapman & Chapman, 1 973 ), writing nearly
Schizophrenia Bulletin, 1 2 (3 ), 3 48–3 5 9
one century ago, argued that formal thought
disorder in schizophrenia patients involves a Andreasen, N. C., Nopoulos, P., O’Leary, D. S.,
failure to utilize context information to bind Miller, D. D., Wassink, T., & Flaum, M. (1 999).
ideational elements together in logical se- Defining the phenotype of schizophrenia: Cog-
quence. However he blamed this disorder on nitive dysmetria and its neural mechanisms. Bi-
the breaking of the “associative thread” link- ological Psychiatry, 46(7), 908–920.
ing a given goal to the appropriate contextual Arnold, S. E. (2000). Cellular and molecular neu-
influence, rather than considering the goal con- ropathology of the parahippocampal region in
text itself. schizophrenia. Annals of the New York Academy
5 . See Andreasen et al. (1 999) for the descrip- of Science, 91 1 , 275 –292.
tion of an alternate, circuit-based comparator Baddeley, A. D. (1 986). Working Memory. New
mechanism. York: Oxford University Press.
Badre, D., & Wagner, A. D. (2004). Selection,
integration, and conflict monitoring: Assessing
the nature and generality of prefrontal cogni-
tive control mechanisms. Neuron, 41 (3 ), 473 –
References 487.
Barch, D., & Berenbaum, H. (1 994). The rela-
Abi-Dargham, A., Mawlawi, O., Lombardo, I., tionship between information processing and
Gil, R., Martinez, D., Huang, Y., et al. (2002). language production. Journal of Abnormal Psy-
Prefrontal dopamine D1 receptors and work- chology, 1 03 (2 ), 241 –25 0.
ing memory in schizophrenia. Journal of Neu- Barch, D. M., & Berenbaum, H. (1 996). Lan-
roscience, 2 2 (9), 3 708–3 71 9. guage production and thought disorder in
Adolphs, R. (2003 ). Cognitive neuroscience of schizophrenia. Journal of Abnormal Psychology,
human social behavior. Nature Reviews Neuro- 1 05 (1 ), 81 –88.
science, 4(3 ), 1 65 –1 78. Barch, D. M., Braver, T. S., Nystrom, L. E., For-
Allen, H. A., Liddle, P. F., & Frith, C. D. (1 993 ). man, S. D., Noll, D. C., & Cohen, J. D. (1 997).
Negative features, retrieval processes and ver- Dissociating working memory from task dif-
bal fluency in schizophrenia. British Journal of ficulty in human prefrontal cortex. Neuropsy-
Psychiatry, 1 63 , 769–775 . chologia, 3 5 (1 0), 1 3 73 –1 3 80.
American Psychiatric Association. (2000). Diag- Barch, D. M., Carter, C. S., MacDonald, A. W.,
nostic and Statistical Manual of Mental Disor- Braver, T. S., & Cohen, J. D. (2003 ). Context-
ders, Fourth Edition, Text Revision. Washington, processing deficits in schizophrenia: Diagnostic
DC: American Psychiatric Association. specificity, 4-week course, and relationships to
Anderson, M. C., & Spellman, B. A. (1 995 ). clinical symptoms. Journal of Abnormal Psychol-
On the status of inhibitory mechanisms in ogy, 1 1 2 (1 ), 1 3 2–1 43 .
cognition: Memory retrieval as a model case. Barch, D. M., Sabb, F. W., Carter, C. S., Braver,
Psychology Review, 1 02 (1 ), 68–1 00. T. S., Noll, D. C., & Cohen, J. D. (1 999).
Andreasen, N. C. (1 979). Thought, language, Overt verbal responding during fMRI scanning:
and communication disorders. I. Clinical as- Empirical investigations of problems and po-
sessment, definition of terms, and evaluation of tential solutions. Neuroimage, 1 0(6), 642–65 7.
Baving, L., Wagner, M., Cohen, R., & Rockstroch, Botvinick, M. M., Braver, T. S., Barch, D. M.,
B. (2001 ). Increased semantic and repetition Carter, C. S., & Cohen, J. D. (2001 ). Conflict
priming in schizophrenic patients. Journal of monitoring and cognitive control. Psychological
Abnormal Psychology, 1 1 0(1 ), 67–75 . Review, 1 08(3 ), 624–65 2.
Berenbaum, H., & Barch D. (1 995 ). The cate- Braver, T. S., Barch, D. M., & Cohen, J. D. (1 999).
gorization of thought disorder. Journal of Psy- Cognition and control in schizophrenia: A
cholinguistic Research, 2 4(5 ), 3 49–3 76. computational model of dopamine and pre-
Benes, F. M., Sorensen, I., & Bird, E. D. (1 991 ). frontal function. Biological Psychiatry, 46(3 ),
Reduced neuronal size in posterior hippocam- 3 1 2–3 28.
pus of schizophrenic patients. Schizophrenia Braver, T. S., Barch, D. M., Keys, B. A., Carter,
Bulletin, 1 7(4), 5 97–608. C. S., Cohen, J. D., Kaye, J. A., et al. (2001 ).
Benjamin, T. B., & Watt, N. F. (1 969). Psy- Context processing in older adults: Evidence
chopathology and semantic interpretation of for a theory relating cognitive control to neu-
ambiguous words. Journal of Abnormal Psychol- robiology in healthy aging. Journal of Ex-
ogy, 706–71 4. perimental Psychology: General, 1 3 0(4), 746–
763 .
Berman, K. F., Torrey, E. F., Daniel, D. G., &
Braver, T. S., Cohen, J. D., Nystrom, L. E.,
Weinberger, D. R. (1 992). Regional cerebral
Jonides, J., Smith, E. E., & Noll, D. C. (1 997). A
blood flow in monozygotic twins discordant
parametric study of prefrontal cortex involve-
and concordant for schizophrenia. Archives of
ment in human working memory. Neuroimage,
General Psychiatry, 49(1 2 ), 927–93 4.
5 (1 ), 49–62.
Bermudez, J. L. (2001 ). Normativity and ratio-
Braver, T. S., Reynolds, J. R., & Donaldson, D. I.
nality in delusional psychiatric disorders. Mind
(2003 ). Neural mechanisms of transient and
and Language, 1 6(5 ), 45 7–493 .
sustained cognitive control during task switch-
Besche, C., Passerieux, C., Segui, J., Sarfati, ing. Neuron, 3 9(4), 71 3 –726.
Y., Laurent, J. P., & Hardy-Bayle, M. C.
Byrne, M., Hodges, A., Grant, E., Owens, D. C.,
(1 997). Syntactic and semantic processing in
& Johnstone, E. C. (1 999). Neuropsycholog-
schizophrenic patients evaluated by lexical-
ical assessment of young people at high ge-
decision tasks. Neuropsychology, 1 1 (4), 498–
netic risk for developing schizophrenia com-
5 05 .
pared with controls: Preliminary findings of the
Bilder, R. M., Bogerts, B., Ashtari, M., Wu, Edinburgh High Risk Study (EHRS). Psycholog-
H., Alvir, J. M., Jody, D., et al. (1 995 ). An- ical Medicine, 2 9(5 ), 1 1 61 –1 1 73 .
terior hippocampal volume reductions pre- Callicott, J. H., Ramsey, N. F., Tallent, K.,
dict frontal lobe dysfunction in first episode Bertolino, A., Knable, M. B., Coppola, R., et al.
schizophrenia. Schizophrenia Research, 1 7(1 ), (1 998). Functional magnetic resonance imag-
47–5 8. ing brain mapping in psychiatry: Methodologi-
Bleuler, E. (1 91 1 ). Dementia Praecox, or the cal issues illustrated in a study of working mem-
Group of Schizophrenias. Translated by J. Zinkin ory in schizophrenia. Neuropsychopharmacol-
(1 95 0). New York: International Universities ogy, 1 8(3 ), 1 86–1 96.
Press. Callicott, J. H., Mattay, V. S., Bertolino, A., Finn,
Bleuler, E. (1 91 1 /1 95 1 ). Autistic-undisciplined K., Coppola, R., Frank, J. A., et al. (1 999).
Thinking. Reprinted in D. Rapaport (Ed.), Or- Physiological characteristics of capacity con-
ganization and Pathology of Thought. New York: straints in working memory as revealed by
Columbia University Press. functional MRI. Cerebral Cortex, 9(1 ), 20–26.
Blum, N. A., & Freides, D. (1 995 ). Investigat- Cannon, T. D. (1 997). On the nature and mecha-
ing thought disorder in schizophrenia with the nisms of obstetric influences in schizophrenia:
lexical decision task. Schizophrenia Research, A review and synthesis of epidemiologic stud-
1 6(3 ), 21 7–224. ies. International Review of Psychiatry, 9(4),
Bogerts, B., Ashtari, M., Degreef, G., Alvir, J. M., 3 87–3 97.
Bilder, R. M., & Lieberman, J. A. (1 990). Cannon, T. D., Huttunen, M.O, Dahlstróm, M.,
Reduced temporal limbic structure volumes Larmo, I., Rásánen, P., & Juriloo, A. (2002).
on magnetic resonance images in first episode Antipsychotic drug treatment in the prodro-
schizophrenia. Psychiatry Research, 3 5 (1 ), mal phase of schizophrenia. Archives of General
1 –1 3 . Psychiatry, 1 5 9(7), 1 23 0–1 23 2.
Cannon, T. D., Huttunen, M. O., Lonnqvist, J., Clark, H. H. (1 996). Using Language. Cambridge,
Tuulio-Henriksson, A., Pirkola, T., Glahn, D., UK: Cambridge University Press.
et al. (2000). The inheritance of neuropsy- Cohen, J. D., Barch, D. M., Carter, C.,
chological dysfunction in twins discordant for & Servan-Schreiber, D. (1 999). Context-
schizophrenia. American Journal of Human Ge- processing deficits in schizophrenia: Converg-
netics, 67(2 ), 3 69–3 82. ing evidence from three theoretically mo-
Cannon, T. D., Kaprio, J., Lonnqvist, J., Hut- tivated cognitive tasks. Journal of Abnormal
tunen, M., & Koskenvuo, M. (1 998). The Psychology, 1 08(1 ), 1 20–1 3 3 .
genetic epidemiology of schizophrenia in a Cohen, J. D., Perlstein, W. M., Braver, T. S., Nys-
Finnish twin cohort. A population-based mod- trom, L. E., Noll, D. C., Jonides, J., et al. (1 997).
eling study. Archives of General Psychiatry, Temporal dynamics of brain activation during
5 5 (1 ), 67–74. a working memory task. Nature, 3 86(662 5 ),
Cannon, T. D., & Rosso, I. M. (2002). Lev- 604–608.
els of analysis in etiological research on Cohen, J. D., & Servan-Schreiber, D. (1 992).
schizophrenia. Developmental Psychopathology, Context, cortex, and dopamine: A connec-
1 4(3 ), 65 3 –666. tionist approach to behavior and biology in
Cannon, T. D., Rosso, I. M., Hollister, J. M., schizophrenia. Psychological Review, 99(1 ), 45 –
Bearden, C. E., Sanchez, L. E., & Hadley, 77.
T. (2000). A prospective cohort study of ge-
Conrad, A. J., Abebe, T., Austin, R., Forsythe, S.,
netic and perinatal influences in the etiology
& Scheibel, A. B. (1 991 ). Hippocampal pyra-
of schizophrenia. Schizophrenia Bulletin, 2 6(2 ),
midal cell disarray in schizophrenia as a bilat-
3 5 1 –3 66.
eral phenomenon. Archives of General Psychia-
Cannon, T. D., Thompson, P. M., van Erp, try, 48(5 ), 41 3 –41 7.
T. G., Toga, A. W., Poutanen, V. P., Hut-
Conrad, A. J., & Scheibel, A. B. (1 987).
tunen, M., et al. (2002). Cortex mapping re-
Schizophrenia and the hippocampus: The em-
veals regionally specific patterns of genetic and
bryological hypothesis extended. Schizophrenia
disease-specific gray-matter deficits in twins
Bulletin, 1 3 (4), 5 77–5 87.
discordant for schizophrenia. Proceedings of the
National Academy of Sciences, U.S.A., 99(5 ), Cosway, R., Byrne, M., Clafferty, R., Hodges,
3 228–3 23 3 . A., Grant, E., Abukmeil, S. S., et al. (2000).
Neuropsychological change in young people
Cannon, T. D., van Erp, T. G., Huttunen,
at high risk for schizophrenia: Results from
M., Lonnqvist, J., Salonen, O., Valanne, L.,
the first two neuropsychological assessments of
et al. (1 998). Regional gray matter, white mat-
the Edinburgh High Risk Study. Psychological
ter, and cerebrospinal fluid distributions in
Medicine, 3 0(5 ), 1 1 1 1 –1 1 21 .
schizophrenic patients, their siblings, and con-
trols. Archives of General Psychiatry, 5 5 (1 2 ), Cozolino, L. J. (1 983 ). The oral and written
1 084–1 091 . productions of schizophrenic patients. Progress
Cannon, T. D., Zorrilla, L. E., Shtasel, D., Gur, in Experimental Personality Research, 1 2 , 1 01 –
R. E., Gur, R. C., Marco, E. J., et al. (1 994). 1 5 2.
Neuropsychological functioning in siblings dis- Critchley, M. (1 964). The neurology of psychotic
cordant for schizophrenia and healthy volun- speech. British Journal of Psychiatry, 1 1 0, 3 5 3 –
teers. Archives of General Psychiatry, 5 1 (8), 3 64.
65 1 –661 . Dauphinais, I. D., DeLisi, L. E., Crow, T. J.,
Chaika, E. (1 982). Thought disorder or speech Alexandropoulos, K., Colter, N., Tuma, I.,
disorder in schizophrenia? Schizophrenia Bul- et al. (1 990). Reduction in temporal lobe size
letin, 8(4), 5 87–5 94. in siblings with schizophrenia: A magnetic
Chapman, L. J., & Chapman, J. P. (1 973 ). Dis- resonance imaging study. Psychiatry Research,
ordered Thought in Schizophrenia. New York: 3 5 (2 ), 1 3 7–1 47.
Appleton-Century-Crofts. David, A. S., & Cutting, J. C. (1 994). The Neu-
Cirillo, M. A., & Seidman, L. J. (2003 ). ropsychology of Schizophrenia. Hove, UK: Psy-
Verbal declarative memory dysfunction in chology Press.
schizophrenia: From clinical assessment to ge- DeLisi, L. E. (2001 ). Speech disorder in
netics and brain mechanisms. Neuropsychology schizophrenia: Review of the literature and ex-
Review, 1 3 (2 ), pp. 43 –77. ploration of its relation to the uniquely human
capacity for language. Schizophrenia Bulletin, Archives of Psychiatry and Neurological Science,
2 7(3 ), 481 –496. 2 3 6(3 ), 1 5 4–1 61 .
Dell, G. S., & O’Seaghdha, P. G. (1 991 ). Me- Fanselow, M. S. (2000). Contextual fear, Gestalt
diated and convergent lexical priming in lan- memories, and the hippocampus. Behavioral
guage production: A comment on Levelt et al. Brain Research, 1 1 0(1 –2 ), 73 –81 .
(1 991 ). Psychological Review, 98(4), 604–61 4. Faraone, S. V., Seidman, L. J., Kremen, W. S.,
Dell, G. S., Schwartz, M. F., Martin, N., Saffran, Toomey, R., Pepple, J. R., & Tsuang, M. T.
E. M., & Gagnon, D. A. (1 997). Lexical access (2000). Neuropsychologic functioning among
in aphasic and nonaphasic speakers. Psycholog- the nonpsychotic relatives of schizophrenic
ical Review, 1 04(4), 801 –83 8. patients: The effect of genetic loading. Biologi-
Dick, F., Bates, E., Wulfeck, B., Utman, J. A., cal Psychiatry, 48(2 ), 1 20–1 26.
Dronkers, N., & Gernsbucher, M. A. (2001 ). Ford, J. M., Mathalon, D. H., Whitfield, S., Faust-
Language deficits, localization, and grammer: man, W. O., & Roth, W. T. (2002). Reduced
Evidence for a distributive model of language communication between frontal and temporal
breakdown in aphasic patrents and neurolog- lobes during talking in schizophrenia. Biological
ically intact individuals. Psychological Review, Psychiatry, 5 1 (6), 485 –492.
1 08(4), 75 9–788. Frith, C. (2002). Attention to action and aware-
Docherty, N. M. (1 995 ). Expressed emotion ness of other minds. Consciousness and Cogni-
and language disturbances in parents of stable tion, 1 1 (4), 481 –487.
schizophrenia patients. Schizophrenia Bulletin, Frith, C. D. (1 987). The positive and nega-
2 1 (3 ), 41 1 –41 8. tive symptoms of schizophrenia reflect impair-
Docherty, N. M., & Gordinier, S. W. (1 999). Im- ments in the perception and initiation of ac-
mediate memory, attention and communication. Psychological Medicine, 1 7(3 ), 63 1 –648.
tion disturbances in schizophrenia patients and Frith, C. D., Friston, K. J., Liddle, P. F., & Frack-
their relatives. Psychological Medicine, 2 9(1 ), owiak, R. S. (1 992). PET imaging and cognition
1 89–1 97. in schizophrenia. Journal of the Royal Society of
Dodd, B., & Crosbie, S. (2002). Language and Medicine, 85 (4), 222–224.
cognition: Evidence from disordered language. Gambini, O., Campana, A., Macciardi, F., &
In Goswami, U. (Ed.), Blackwell Handbook Scarone, S. (1 997). A preliminary report of a
of Childhood Cognitive Development. Malden, strong genetic component for thought disorder
MA: Blackwell. in normals: A twin study. Neuropsychobiology,
Dronkers, N. F., Redfern, B. A., & Knight, R. T. 3 6(1 ), 1 3 –1 8.
(1 999). The neural architecture of language Garety, P. A., & Freeman, D. (1 999). Cognitive
disorders. In M. S. Gazzaniga, (Ed.), The New approaches to delusions: A critical review of
Cognitive Neurosciences, (2nd ed.). Cambridge, theories and evidence. British Journal of Clinical
MA: MIT Press. Psychology, 3 8(2 ), 1 1 3 –1 5 4.
Duncan, J. (2001 ). An adaptive coding model of Garavan, H., Ross, T. J., Li, S. J., & Stein, E. A.
neural function in prefrontal cortex. Nature Re- (2000). A parametric manipulation of central
views Neuroscience, 2 (1 1 ), 820–829. executive functioning. Cerebral Cortex, 1 0(6),
Egan, M. F., Goldberg, T. E., Kolachana, B. S., 5 85 –5 92.
Callicott, J. H., Mazzanti, C. M., et al. (2001 ). Gasperoni, T. L., Ekelund, J., Huttunen, M.,
Effect of COMT Val1 08/1 5 8 Met genotype on Palmer, C. G., Tuulio-Henriksson, A., Lon-
frontal lobe function and risk for schizophre- nqvist, J., Kaprio, J., Peltonen, L., & Cannon,
nia. Proceedings of the National Academy of Sci- T. D. (2003 ). Genetic linkage and association
ences U.S.A., 98(1 2 ), 691 7–6922. between chromosome 1 q and working memory
Ekelund, J., Lichtermann, D., Hovatta, I., El- function in schizophrenia. American Journal of
lonen, P., Suvisaari, J., Terwilliger, J. D., et al. Medical Genetics, 1 1 6B(1 ), 8–1 6.
(2000). Genome-wide scan for schizophrenia Glahn, D. C., Cannon, T. D., Gur, R. E., Ragland,
in the Finnish population: Evidence for a locus J. D., & Gur, R. C. (2000). Working memory
on chromosome 7q22. Human Molecular Ge- constrains abstraction in schizophrenia. Biolog-
netics, 9(7), 1 049–1 05 7. ical Psychiatry, 47(1 ), 3 4–42.
Falkai, P., & Bogerts, B. (1 986). Cell loss in Glahn, D. C., Kim, J., Cohen, M. S., Poutanen,
the hippocampus of schizophrenics. European V. P., Therman, S., Bava, S., et al. (2002).
Maintenance and manipulation in spatial work- Septo-Hippocampal System. New York: Oxford
ing memory: Dissociations in the prefrontal University Press.
cortex. Neuroimage, 1 7(1 ), 201 –21 3 . Gray, J. A. (1 995 ). Dopamine release in the nu-
Goddard, M. J. (2001 ). Context modulation of cleus accumbens: The perspective from aber-
US signal value following explicit and nonex- rations of consciousness in schizophrenia. Neu-
plicit training. Behavioral Processes, 5 6(2 ), 67– ropsychologia, 3 3 (9), 1 1 43 –1 1 5 3 .
74. Gray, J. A. (1 998). Integrating schizophrenia.
Gold, B. T., & Buckner, R. L. (2002). Common Schizophrenia Bulletin, 2 4(2 ), 249–266.
prefrontal regions coactivate with dissociable Gray, J. A., Feldon, J., Rawlins, J. N., Hemsley,
posterior regions during controlled semantic D. R., Young, A. M. J., Warburton, E. C., et al.
and phonological tasks. Neuron, 3 5 (4), 803 – (1 991 ). The neuropsychology of schizophrenia.
81 2. Behavioral and Brain Sciences, 1 4(1 ), 1 –84.
Gold, I., & Hohwy, J. (2000). Rationality and Grossman, L. S., & Harrow, M. (1 996). In-
schizophrenic delusion. Mind and Language, teractive behavior in bipolar manic and
1 5 (1 ), 1 46–1 67. schizophrenic patients and its link to thought
Goldberg, T. E., Aloia, M. S., Gourovitch, M. L., disorder. Comprehensive Psychiatry, 3 7(4),
Missar, D., Pickar, D., & Weinberger, D. R. 245 –25 2.
(1 998). Cognitive substrates of thought disor- Harrow, M., & Quinlan, D. M. (1 985 ). Disor-
der, I: The semantic system. American Journal dered Thinking and Schizophrenic Psychopathol-
of Psychiatry, 1 5 5 (1 2 ), 1 671 –1 676. ogy. New York: Gardner Press.
Goldman-Rakic, P. S. (1 995 ). Architecture of the Harvey, P. D., & Pedley, M. (1 989). Auditory and
prefrontal cortex and the central executive. An- visual distractability in schizophrenia: Clinical
nals of the New York Academy of Science, 769, and medication status correlates. Schizophrenia
71 –83 . Research, 2 (3 ), 295 –3 00.
Goldman-Rakic, P. S., & Selemon, L. D. (1 997). Heckers, S., Rauch, S. L., Goff, D., Savage, C. R.,
Functional and anatomical aspects of prefrontal Schacter, D. L., Fischman, A. J., et al. (1 998).
pathology in schizophrenia. Schizophrenia Bul- Impaired recruitment of the hippocampus dur-
letin, 2 3 (3 ), 43 7–45 8. ing conscious recollection in schizophrenia.
Gottesman, II, & Gould, T. D. (2003 ). The en- Nature Neuroscience, 1 (4), 3 1 8–3 23 .
dophenotype concept in psychiatry: Etymol- Hemsley, D. R. (1 987). An experimental psycho-
ogy and strategic intentions. American Journal logical model for schizophrenia. In W. Janza-
of Psychiatry, 1 60(4), 63 6–645 . vik (Ed.), Search for the Causes of Schizophrenia
Grace, A. A. (1 991 ). Phasic versus tonic dop- (pp. 1 79–1 88). Stuttgart: Springer-Verlag.
amine release and the modulation of dopamine Hemsley, D. R. (1 993 ). A simple (or simplistic?)
system responsivity: A hypothesis for the eti- cognitive model for schizophrenia. Behavioral
ology of schizophrenia. Neuroscience, 41 (1 ), 1 – Research and Therapy, 3 1 (7), 63 3 –645 .
24. Hemsley, D. R. (1 994). A cognitive model for
Grace, A. A., & Moore, H., & O’Donnell, schizophrenia and its possible neural basis.
P. (1 998). The modulation of corticoaccum- Acta Psychiatrica Scandinavica, Suppl. 3 84, 80–
bens transmission by limbic afferents and 86.
dopamine: A model for the pathophysiology of Honey, R. C., & Watt, A. (1 999). Acquired re-
schizophrenia. Advances in Pharmacology, 42 , lational equivalence between contexts and fea-
721 –724. tures. Journal of Experimental Psychology: Ani-
Grace, A. A., & Moore, H. (1 998). Regula- mal Behavior Processes, 2 5 (3 ), 3 24–3 3 3 .
tion of information flow in the nucleus acc- Honigfeld, G. (1 963 ). Effect of an hallucinogenic
umbens: A model for the pathophysiology of agent on verbal behavior. Psychological Reports,
schizophrenia. In M. F. Lenzenweger & R. H. 1 3 (2 ), 3 83 –3 85 .
Dworkin (Eds). Origins and Development of Horn, L., & Ward, G. (2001 ). Pragmatics. In
Schizophrenia: Advances in Experimental Psy- F. C. Keil (Ed.), The MIT Encyclopedia of the
chopathology. Washington, DC: American Psy- Cognitive Sciences (MITECS), (Vol. 1 ).
chological Association. Cambridge, MA: MIT Press.
Gray, J. A. (1 982). The Neuropsychology of Huttenlocher, P. R. (1 979). Synaptic density in
Anxiety: An Enquiry into the Functions of the human frontal cortex - developmental changes
and effects of aging. Brain Research, 1 63 (2 ), Kounios, J., Bachman, P., Casasanto, D., Gross-
1 95 –205 . man, M., Smith, R. W., & Yang, W. (2003 ).
Indefrey, P., & Levelt, W. J. M. (1 999). Chap- Novel concepts mediate word retrieval from
ter 5 9. The neural correlates of language pro- human episodic associative memory: Evidence
duction. In M. S. Gazzaniga (Ed.), The New from event-related potentials. Neuroscience Let-
Cognitive Neurosciences (2nd ed.), (pp. 845 – ters, 3 45 (3 ), 1 5 7–1 60.
866). Cambridge, MA: MIT Press. Kovelman, J. A., & Scheibel, A. B. (1 984). A neu-
Jansma, J. M., Ramsey, N. F., Slagter, H. A., & rohistological correlate of schizophrenia. Bio-
Kahn, R. S. (2001 ). Functional anatomical cor- logical Psychiatry, 1 9(1 2 ), 1 601 –1 621 .
relates of controlled and automatic processing. Kuperberg, G. R., McGuire, P. K., & David,
Journal of Cognitive Neuroscience, 1 3 (6), 73 0– A. S. (1 998). Reduced sensitivity to lin-
743 . guistic context in schizophrenic thought
Jeste, D. V., & Lohr, J. B. (1 989). Hippocampal disorder: Evidence from on-line monitoring for
pathologic findings in schizophrenia. A mor- words in linguistically anomalous sentences.
phometric study. Archives of General Psychia- Journal of Abnormal Psychology, 1 07(3 ), 423 –
try, 46(1 1 ), 1 01 9–1 024. 43 4.
Johnston, M. H., & Holzman, P. S. (1 979). As- Landre, N. A., & Taylor, M. A. (1 995 ). Formal
sessing Schizophrenic Thinking. San Francisco: thought disorder in schizophrenia. Linguistic,
Jossey-Bass. attentional, and intellectual correlates. Journal
Kane, M. J., & Engle, R. W. (2002). The role of Nervous and Mental Disease, 1 83 (1 1 ), 673 –
of prefrontal cortex in working-memory ca- 680.
pacity, executive attention, and general fluid Laruelle, M., Kegeles, L. S., & Abi-Dargham,
intelligence: An individual-differences per- A. (2003 ). Glutamate, dopamine, and schi-
spective. Psychonomic Bulletin and Review, 9(4), zophrenia: From pathophysiology to treat-
63 7–671 . ment. Annals of the New York Academy of Sci-
Kerns, J. G., Berenbaum, H., Barch D. M., Ban- ences, 1 003 , 1 3 8–1 5 8.
ick, M. T., & Stolar, N. (1 999). Word produc- Lawrie, S. M., Whalley, H., Kestelman, J. N.,
tion in schizophrenia and its relationship to Abukmeil, S. S., Byrne, M., Hodges, A., et al.
positive symptoms. Psychiatry Research, 87(1 ), (1 999). Magnetic resonance imaging of brain in
29–3 7. people at high risk of developing schizophre-
Kerns, J. G., & Berenbaum, H. (2002). Cognitive nia. Lancet, 3 5 3 (91 46), 3 0–3 3 .
impairments associated with formal thought Levelt, W. J. M. (1 989). Speaking: From Inten-
disorder in people with schizophrenia. Journal tion to Articulation. Cambridge, MA: The MIT
of Abnormal Psychology, 1 1 1 (2 ), 21 1 –224. Press.
Kircher, T. T., & Leube, D. T. (2003 ). Self- Levelt, W. J. M. (1 999). Language production: A
consciousness, self-agency, and schizophrenia. blueprint of the speaker. In C. Brown & P. Ha-
Consciousness and Cognition, 1 2 (4): 65 6–669. goort, (Eds.), Neurocognition of Language. Ox-
Kircher, T. T., Liddle, P. F., Brammer, M. J., ford, UK: Oxford University Press.
Williams, S. C., Murray, R. M., & McGuire, Levelt, W. J., Roelofs, A., Meyer, A. S. (1 999). A
P. K. (2001 ). Neural correlates of formal theory of lexical access in speech production.
thought disorder in schizophrenia: Preliminary Behavioral and Brain Sciences, 2 2 (1 ), 1 –75 .
findings from a functional magnetic resonance Liddle, P. F. (1 987). The symptoms of chronic
imaging study. Archives of General Psychiatry, schizophrenia. A re-examination of the
5 8(8), 769–774. positive-negative dichotomy. British Journal of
Kolb, B., & Whishaw, I. Q. (1 996) Fundamentals Psychiatry, 1 5 1 , 1 45 –1 5 1 .
of human neuropsychology, (fourth edition). New Lubow, R. E. M. (1 95 9). Latent inhibition:
York: Freeman Press. The effect of nonreinforced pre-exposure to
Kostova, M., Passerieux, C., Laurent, J. P., Hardy- the conditional stimulus. Journal of Compar-
Bayle, M. C. (2003 ). An electrophysiologic ative and Physiological Psychology, 5 2 , 41 5 –
study: Can semantic context processes be mo- 41 9.
bilized in patients with thought-disordered Maher, B. A. (1 966). Principles of Psychopathology:
schizophrenia?. Canadian Journal of Psychia- An Experimental Approach. New York:
try, 48(9), 61 5 –623 . McGraw-Hill.
Maher, B. A. (1 991 ). Language and schizophre- A review and synthesis. Journal of the Interna-
nia. In J. H. Gruzelier (Ed.), Neuropsychology, tional Neuropsychology Society, 8(5 ), 699–720.
Psychophysiology, and Information Processing. Neely, J. H. (1 977). Semantic priming and re-
Handbook of Schizophrenia, Vol. 5 , (pp. 43 7– trieval from lexical memory: Roles of inhi-
464). New York: Elsevier Science. bitionless spreading activation and limited-
Maher, B. A. (2002). Psychopathology and capacity attention. Journal of Experimental
delusions: Reflections on methods and models. Psychology: General, 1 06(3 ), 226–25 4.
In M. F. Lenzenweger (Ed.), Principles of Ex- Nestor, P. G., Kimble, M. O., O’Donnell, B. F.,
perimental Psychopathology: Essays in Honor of Smith, L., Niznikiewicz, M., Shenton, M. E.,
Brendan A. Maher, (1 st ed.). Washington, DC: et al. (1 997). Aberrant semantic activation
American Psychological Association. in schizophrenia: A neurophysiological study.
Maher, B. A., Manschreck, T. C., Woods, B. T., American Journal of Psychiatry, 1 5 4(5 ), 640–
Yurgelun-Todd, D. A., & Tsuang, M. T. (1 995 ). 646.
Frontal brain volume and context effects in Nestor, P. G., Akdag, S. J., O’Donnell, B. F.,
short-term recall in schizophrenia. Biological Niznikiewicz, M., Law, S., Shenton, M. E., et
Psychiatry, 3 7(3 ), 1 44–1 5 0. al. (1 998). Word recall in schizophrenia: A con-
Malenka, R. C., Angel, R. W., Hampton, B., & nectionist model. American Journal of Psychia-
Berger, P. A. (1 982). Impaired central error- try, 1 5 5 (1 2 ), 1 685 –1 690.
correcting behavior in schizophrenia. Archives Nuechterlein, K. H., & Dawson, M. E. (1 984).
of General Psychiatry, 3 9(1 ), 1 01 –1 07. Information processing and attentional func-
Martin, A., Chao, L. L. (2001 ). Semantic memory tioning in the developmental course of
and the brain: Structure and processes. Current schizophrenic disorders. Schizophrenia Bulletin,
Opinion in Neurobiology, 1 1 (2 ), 1 94–201 . 1 0(2 ), 1 60–203 .
Martin, R. C., & Freedman, M. L. (2001 ). Nuechterlein, K. H., Edell, W. S., Norris, M., &
Short-term retention of lexical-semantic repre- Dawson, M. E. (1 986). Attentional vulnerabil-
sentations: Implications for speech production. ity indicators, thought disorder, and negative
Memory, 9(4), 261 –280. symptoms. Schizophrenia Bulletin, 1 2 (3 ), 408–
McGhie, A., & Chapman, J. (1 961 ). Disorders of 426.
attention and perception in early schizophre- O’Driscoll, G. A., Florencio, P. S., Gagnon,
nia. British Journal of Medical Psychology, 3 4: D., Wolff, A. V., Benkelfat, C., Mikula, L.,
1 03 –1 1 6. et al. (2001 ). Amygdala-hippocampal volume
McNeil, T. F., Cantor-Graae, E., & Weinberger, and verbal memory in first-degree relatives
D. R. (2000). Relationship of obstetric compli- of schizophrenic patients. Psychiatry Research,
cations and differences in size of brain struc- 1 07(2 ), 75 –85 .
tures in monozygotic twin pairs discordant for Okubo, Y., Suhara, T., Suzuki, K., Kobayashi,
schizophrenia. American Journal of Psychiatry, K., Inoue, O., et al. (1 997). Decreased pre-
1 5 7(2 ), 203 –21 2. frontal dopamine D1 receptors in schizophre-
McNeill, D., & S. Duncan. (1 999). Growth nia revealed by PET. Nature, 3 85 (661 7), 63 4–
points in thinking for speaking. In E. D. 63 6.
McNeill (Ed.), Language and Gesture: Win- Oltmanns, T. F., & Neale, J. M. (1 975 ).
dow into Thought and Action. Cambridge, UK: Schizophrenic performance when distractors
Cambridge University Press. are present: Attentional deficit or differential
Millar, J. K., Wilson-Annan, J. C., Anderson, S., task difficulty? Journal of Abnormal Psychology,
Christie, S., Taylor, M. S., Semple, C. A., et al. 84(3 ), 205 –209.
(2000). Disruption of two novel genes by a Ozeki, Y., Tomoda, T., Kleiderlein, J., Kamiya, A.,
translocation co-segregating with schizophre- Bord, L., Fujii, K., et al. (2003 ). Disrupted-in-
nia. Human Molecular Genetics, 9(9), 1 41 5 – Schizophrenia-1 (DISC-1 ): Mutant truncation
1 423 . prevents binding to NudE-like (NUDEL) and
Miller, E. K., & Cohen, J. D. (2001 ). An integra- inhibits neurite outgrowth. Proceedings of the
tive theory of prefrontal cortex function. An- National Academy of Sciences, U.S.A., 1 00(1 ),
nual Review of Neuroscience, 2 4, 1 67–202. 289–294.
Minzenberg, M. J., Ober, B. A., & Vinogradov, Passerieux, C., Sergui, J., Besche, C., Chevalier,
S. (2002). Semantic priming in schizophrenia: J. F., Widlocher, D., & Hardy-Bayle, M. C.
(1 997). Heterogeneity of cognitive function- Schneider, W. S., & Shiffrin, R. M. (1 977).

ing of schizophrenic patients evaluated by a Controlled and automatic human information
lexical decision task. Psychological Medicine, processing: I. Detection, search, and attention.
2 7(6), 1 295 –1 3 02. Psychological Review, 84(1 ), 1 –66.
Penn, D. L., Ritchie, M., Francis, J., Combs, Seidman, L. J., Faraone, S. V., Goldstein, J. M.,
D., & Martin, J. (2002). Social perception in Goodman, J. M., Kremen, W. S., Toomey,
schizophrenia: The role of context. Psychiatry R., et al. (1 999). Thalamic and amygdala-
Research, 1 09(2 ), 1 49–1 5 9. hippocampal volume reductions in first-degree
Pfefferbaum, A., & Marsh, L. (1 995 ). Structural relatives of patients with schizophrenia: An
brain imaging in schizophrenia. Clinical Neu- MRI-based morphometric analysis. Biological
roscience, 3 (2 ), 1 05 –1 1 1 . Psychiatry, 46(7), 941 –95 4.
Pinkham, A. E., Penn, D. L., Perkins, D. O., & Seidman, L. J., Faraone, S. V., Goldstein, J. M.,
Lieberman, J. (2003 ). Implications for the neu- Kremen, W. S., Horton, N. J., Makris, N.,
ral basis of social cognition for the study of et al. (2002). Left hippocampal volume as
schizophrenia. American Journal of Psychiatry, a vulnerability indicator for schizophrenia:
1 60(5 ), 81 5 –824. A magnetic resonance imaging morphomet-
Racenstein, J. M., Penn, D., Harrow, M., & ric study of nonpsychotic first-degree relatives.
Schleser, R. (1 999). Thought disorder and psy- Archives of General Psychiatry, 5 9(9), 83 9–
chosocial functioning in schizophrenia: The 849.
concurrent and predictive relationships. Jour- Seidman, L. J., Yurgelun-Todd, D., Kremen,
nal of Nervous and Mental Disease, 1 87(5 ), 281 – W. S., Woods, B. T., Goldstein, J. M., Faraone,
289. S. V., et al. (1 994). Relationship of prefrontal
Rosso, I. M., Cannon, T. D., Huttunen, T., Hut- and temporal lobe MRI measures to neu-
tunen, M. O., Lonnqvist, J., & Gasperoni, ropsychological performance in chronic schizo-
T. L. (2000). Obstetric risk factors for early- phrenia. Biological Psychiatry, 3 5 (4), 23 5 –
onset schizophrenia in a Finnish birth cohort. 246.
American Journal of Psychiatry, 1 5 7(5 ), 801 – Shaham, Y., Shalev, U., Lu, L., De Wit, H., &
807. Stewart, J. (2003 ). The reinstatement model
Rudy, J. W., & O’Reilly, R. C. (2001 ). Con- of drug relapse: History, methodology and
junctive representations, the hippocampus, major findings. Psychopharmacology, 1 68(1 –2 ),
and contextual fear conditioning. Cognitive, 3 –20.
Affective and Behavioral Neuroscience, 1 (1 ), Silver, H., Feldman, P., Bilker, W., & Gur, R. C.
66–82. (2003 ). Working memory deficit as a core neu-
Ruff, R. M., Light, R. H., Parker, S. B., & Levin, ropsychological dysfunction in schizophrenia.
H. S. (1 997). The psychological construct of American Journal of Psychiatry, 1 60(1 0), 1 809–
word fluency. Brain and Language, 5 7(3 ), 3 94– 1 81 6.
405 . Silverstein, S. M., Kovacs, I., Corry, R., & Valone,
Rypma, B., Berger, J. S., & D’Esposito, M. (2002). C. (2000). Perceptual organization, the disor-
The influence of working-memory demand and ganization syndrome, and context processing in
subject performance on prefrontal cortical ac- chronic schizophrenia. Schizophrenia Research,
tivity. Journal of Cognitive Neuroscience, 1 4(5 ), 43 (1 ), 1 1 –20.
721 –73 1 . Sitnikova, T., Salisbury, D. F., Kuperberg, G.,
Sass, L. A., Gunderson, J. G., Singer, M. T., & & Holcomb, P. I. (2002). Electrophysio-
Wynne, L. C. (1 984). Parental communica- logical insights into language processing in
tion deviance and forms of thinking in male schizophrenia. Psychophysiology, 3 9(6), 85 1 –
schizophrenic offspring. Journal of Nervous and 860.
Mental Disease, 1 72 (9), 5 1 3 –5 20. Sokolov, E. N. (1 963 ). Orienting reflex as
Saykin, A. J., Shtasel, D. L., Gur, R. E., a cybernetic system/Orientirovochnyi refleks
Kester, D. B., Mozley, L. H., Stafiniak, P., kak kiberneticheskaia sistema. Zhurnal Vysshei
et al. (1 994). Neuropsychological deficits in Nervnoi Deyatel’nosti, 81 6–83 0.
neuroleptic naive patients with first-episode Spitzer, M., Braun, U., Hermle, L., & Maier,
schizophrenia. Archives of General Psychiatry, S. (1 993 ). Associative semantic network dys-
5 1 (2 ), 1 24–1 3 1 . function in thought-disordered schizophrenic
patients: Direct evidence from indirect seman- Van Erp, T. G., Saleh, P. A., Rosso, I. M., Hut-
tic priming. Biological Psychiatry, 3 4(1 2 ), 864– tunen, M., Lonnqvist, J., Pirkola, T., et al.
877. (2002). Contributions of genetic risk and fe-
Spitzer, M., Weisker, I., Winter, M., Maier, S., tal hypoxia to hippocampal volume in patients
Hermle, L., & Maher, B. A. (1 994). Seman- with schizophrenia or schizoaffective disorder,
tic and phonological priming in schizophrenia. their unaffected siblings, and healthy unre-
Journal of Abnormal Psychology, 1 03 (3 ), 485 – lated volunteers. American Journal of Psychia-
494. try, 1 5 9(9), 1 5 1 4–1 5 20.
Spitzer, M. (1 997). A cognitive neuroscience Van Erp, T. G. W., Saleh, P., Huttunen,
view of schizophrenic thought disorder. M. O., Lonnqvist, J., Kaprio, J., Salonen,
Schizophrenia Bulletin, 2 3 (1 ), 29–5 0. O., et al. (in press). Hippocampal volumes
in schizophrenic twins. Archives of General
Squire, L. R., & Zola, S. M. (1 996). Structure
Psychiatry
and function of declarative and nondeclarative
memory systems. Proceedings of the National Vargha-Khadem, F., Gadian, D. G., Watkins,
Academy of Sciences, U.S.A., 93 (2 4), 1 3 5 1 5 – K. E., Connelly, A., Van Paesschen, W., &
1 3 5 22. Mishkin, M. (1 997). Differential effects of
early hippocampal pathology on episodic and
St. Clair, D., Blackwood, D., Muir, W., Carothers, semantic memory. Science, 2 77(5 3 2 4), 3 76–
A., Walker, M., Spowart, G., et al. (1 990). As- 3 80.
sociation within a family of a balanced auto-
Velakoulis, D., Pantelis, C., McGorry, P. D.,
somal translocation with major mental illness.
Dudgeon, P., Brewer, W., Cook, M., et al.
Lancet, 3 3 6(8706), 1 3 –1 6.
(1 999). Hippocampal volume in first-episode
Stefanis, N., Frangou, S., Yakeley, J., Sharma, T., psychoses and chronic schizophrenia: A
O’Connell, P., Morgan, K., et al. (1 999). Hip- high-resolution magnetic resonance imaging
pocampal volume reduction in schizophrenia: study. Archives of General Psychiatry, 5 6(2 ),
Effects of genetic risk and pregnancy and birth 1 3 3 –1 41 .
complications. Biological Psychiatry, 46(5 ),
Wahlberg, K. E., Wynne, L. C., Oja, H., Keski-
697–702.
talo, P., Anais-Tanner, H., Koistinen, P., et al.
Stroop, J. R. (1 93 5 ). Studies of interference in (2000). Thought disorder index of Finnish
serial verbal reactions. Journal of Experimental adoptees and communication deviance of their
Psychology, 1 8, 643 –662. adoptive parents. Psychological Medicine, 3 0(1 ),
Taylor, W. L. (1 95 3 ). “Cloze procedure”: A 1 27–1 3 6.
new tool for measuring readability. Journalism Weisbrod, M., Maier, S., Harig, S., Himmelsbach,
Quarterly, 3 0, 41 5 –43 3 . U., & Spitzer, M. (1 998). Lateralised semantic
Thompson-Schill, S. L., Jonides, J., Marshuetz, and indirect semantic priming effects in people
C., Smith, E. E., D’Esposito, M., Kan, I. P., with schizophrenia. British Journal of Psychia-
et al. (2002). Effects of frontal lobe damage on try, 1 72 , 1 42–1 46.
interference effects in working memory. Cogni- Weiner, I. (1 990). Neural substrates of latent
tive, Affective and Behavioral Neuroscience, 2 (2 ), inhibition: The switching model. Psychological
1 09–1 20. Bulletin, 1 08(3 ), 442–461 .
Titone, D., Levy, D. L., & Holzman, P. S. (2000). Wirshing, D. A., Marshall, B. D., Green, M. F.,
Contextual insensitivity in schizophrenic lan- Mintz, J., Marder, S. R., & Wirshing, W. C.
guage processing: Evidence from lexical ambi- (1 999). Risperidone in treatment-refractory
guity. Journal of Abnormal Psychology, 1 09(4), schizophrenia. American Journal of Psychiatry,
761 –767. 1 5 6(9), 1 3 74–1 3 79.
Tsuang, M. T., & Faraone, S. V. (1 999). The Wood, J. N. (2003 ). Social cognition and the pre-
concept of target features in schizophrenia re- frontal cortex. Behavioral and Cognitive Neuro-
search. Acta Psychiatrica Scandinavica, Suppl. science Reviews, 2 (2 ), 97–1 1 4.
99(3 95 ), 2–1 1 . Yurgelun-Todd, D. A., Waternaux, C. M., Cohen,
Tsuang, M. T., Stone, W. S., & Faraone, S. V. B. M., Gruber, S. A., English, C. D., & Ren-
(1 999). Schizophrenia: A review of genetic shaw, P. F. (1 996). Functional magnetic reso-
studies. Harvard Review of Psychiatry, 7(4), nance imaging of schizophrenic patients and
1 85 –207. comparison subjects during word production.
American Journal of Psychiatry, 1 5 3 (2 ), 200– schizophrenia. American Journal of Psychiatry,

205 . 1 5 4(6), 81 2–81 8.
Zaidel, D. W., Esiri, M. M., & Harrison, P. J. Zola, S. M., & Squire, L. R. (2001 ). Relation-
(1 997). Size, shape, and orientation of neurons ship between magnitude of damage to the hip-
in the left and right hippocampus: Investiga- pocampus and impaired recognition memory
tion of normal asymmetries and alterations in in monkeys. Hippocampus, 1 1 (2 ), 92–98.
Part VI
ONTOGENY, PHYLOGENY,
LANGUAGE, AND CULTURE

CHAPTER 2 2
Development of Thinking
Graeme S. Halford
It is appropriate to begin a review of research Two ideas that were central to Pi-
on cognitive development with the work of aget’s conception of thought were struc-
pioneering researchers such as Luria, Piaget, ture and self-regulation, both of which were
and Vygotsky, who provided much of the also held by the Gestalt school. How-
conceptual foundation on which later con- ever, a distinguishing feature of Piaget’s
tributions were built. We will begin with a theory was that it was based on logico-
survey of this legacy, then proceed to more mathematical concepts, including function,
contemporary theories, and finally consider operation, group, and lattice. Although
a number of key empirical research topics. he did not claim that logic defined the
laws of thought (cf. Boole, 1 85 4/1 95 1 ), he
used modified logics or “psycho-logics” to
model thought.
Early Influences Piaget’s very extensive empirical investi-
gations into the development of infants’ and
The single most powerful influence on past children’s cognitions were conceptualized
research into the development of thinking by a succession of distinct logics, which have
has been the work of Piaget and his col- come to be known as “stages” of cognitive de-
laborators (Inhelder & Piaget, 1 95 8, 1 964; velopment. The first was the sensorimotor
Piaget, 1 95 0, 1 95 2, 1 95 3 , 1 95 7, 1 970), but stage, lasting from birth to about one-and-
the influence of Vygotsky (1 962) appears to a-half to two years, characterized by struc-
be increasing with time. The work of Luria tured, organized activity but not thought.
(1 976) deservedly had a major influence on During this stage, a structure of actions be-
early cognitive development research, but came elaborated into a mathematical group,
not primarily devoted to thinking. In this meaning that an integrated, self-regulating
chapter, I consider Piaget first, followed by system of actions developed. Piaget believed
Vygotsky, and then the common ground that the concept of objects as real and
between them. permanent emerged as this structure was
52 9
elaborated. The preoperational stage lasted to perform as Piaget reported on the tests
from approximately two to seven years, and he used. The major challenges to his find-
during this time semiotic or symbolic func- ings have been based on different meth-
tions were developed, including play, draw- ods of assessment, the claim being that his
ing, imagery, and language. Thought at this methods underestimated the cognitive ca-
stage was conceptualized in terms of what pabilities of young children (Baillargeon,
Piaget called “function logic,” the essential 1 995 ; Bryant, 1 972; Bryant & Trabasso, 1 971 ;
idea of which is a representation of a link Donaldson, 1 971 ; Gelman, 1 972). How-
between two variables. At the concrete op- ever, these claims also have been subject
erational stage, lasting from eight to about to controversy. Miller (1 976) showed that
fourteen years, thought was conceptualized nonverbal assessments did not demonstrate
in terms of what Piaget called “groupings,” improved reasoning if the cognitive skills
which were equivalent to the mathematical employed were taken into account, and a
concept of a groupoid, meaning a set with similar point was made about subsequent re-
a single binary operation (Sheppard, 1 978). search by Halford (1 989). However, there
The essential idea here is the ability to com- were also some hundreds of training stud-
pose classes, sets, relations, or functions, into ies, reviewed by Field (1 987) and Halford
integrated systems (Halford, 1 982). Con- (1 982), that were sometimes interpreted as
cepts such as conservation (invariance of showing that cognitive development could
quantity, number, weight, and volume), se- be accelerated and depended more on ex-
riation or ordering of objects, transitive in- perience than on development of thought
ference, classification, and spatial perspec- structures. The stage concept has also been
tives emerge as a result of the more elaborate heavily criticized for theoretical inadequa-
thought structures that develop during this cies (Brainerd, 1 978) and for lack of empir-
time. At the formal operational stage, begin- ical support (Bruner, Olver, & Greenfield,
ning in adolescence, the ability to compose 1 966). In particular, acquisition tends to be
concrete operations into higher-level struc- gradual and experience-based rather than
tures emerges with the result that thought sudden or “stage-like,” and the concurrence
has greater autonomy and flexibility. between acquisitions at the same stage of-
Cognitive development depended, accor- ten has not been as close as Piagetian the-
ding to Piaget, on assimilation of experi- ory might be taken to imply. However, there
ence to cognitive structures with accom- have also been some spirited defenses of Pi-
modation of the structure to the new aget (Beilin, 1 992; Lourenco & Machado,
information. The combination of assimila- 1 996), and Smith (2002) has given a con-
tion and accommodation amounts to a pro- temporary account of Piagetian theory. See
cess of self-regulation that Piaget termed also the special issue of Cognitive Develop-
“equilibration.” He rejected the association- ment edited by Bryant (2002) on “Construc-
ist learning theories of the time, although his tivism Today.”
conceptions in many ways anticipated mod- The underlying problem here seems to
ern conceptions of information processing have been that it is difficult to operational-
and dynamic systems. ize Piagetian concepts in the methodologies
The work of the Piagetian school has been that evolved in Anglo-Saxon psychology to
one of the most controversial topics in the about 1 970. His conceptions have been more
field, and claims that Piaget was wrong in compatible with methodologies that devel-
many important respects are not uncommon oped after the “cognitive revolution,” includ-
(Bjorklund, 1 997; Gopnik, 1 996). The fol- ing information processing and dynamic sys-
lowing points are intended to help provide tems theories. In the next section I consider
a balanced account of this issue. First, Pi- alternative ways of conceptualizing the de-
aget’s empirical findings have been widely velopment of children’s thought.
replicated (Modgil, 1 974; Sigel & Hooper, The work of Vygotsky (1 962) was the
1 968). That is, children have been found other major influence on research into the
development of thinking 531
development of thinking, and his contri- Development of Theory

bution is becoming increasingly influential
even today (Lloyd & Fernyhough, 1 999). Theory of development of reasoning diversi-
Three of Vygotsky’s most important contri- fied in numerous directions in the latter half
butions were his ideas on the relation be- of the twentieth century and our concep-
tween thought and language, his emphasis tions of reasoning processes have undergone
on the role of culture in the development some fundamental changes. Perhaps one of
of thinking, and the zone of proximal de- the most important is that there is much less
velopment. Early in the history of cognitive reliance on logic as a norm of reasoning and
development research, there was consider- more emphasis on the interaction between
able debate as to whether thought depends reasoning processes and the child’s experi-
on language development, as implied by ence. Information processing theories were
Bruner, Olver, & Greenfield. (1 966), or the one of the first lines of development follow-
reverse, as implied by Slobin (1 972). Vy- ing the impact of Piaget and Vygotsky, so it
gotsky (1 962) proposed that thought and is appropriate to consider them first.
language have different origins both in evo-
lution and in development. Language was
Information Processing Theories
initially social in character, whereas problem
solving was initially motor. Language and An attempt to conceptualize development
thought develop independently for some of thinking in terms of information pro-
time after infancy; then the young child de- cessing concepts was made by what be-
velops egocentric speech, the beginning of came known as the Neo-Piagetian school
the representational function. Finally, chil- (Case, 1 985 , 1 992a; Case et al., 1 996; Chap-
dren develop “inner speech,” which serves man, 1 987, 1 990; Fischer, 1 980; Halford,
the symbolic function of thought. Vygot- 1 982, 1 993 ; McLaughlin, 1 963 ; Pascual-
sky emphasized the interaction between bi- Leone, 1 970; Pascual-Leone & Smith, 1 969).
ological maturation and social experience. These models, reviewed in detail by Halford
As the child matures, language becomes (2002), reconceptualize Piaget’s stages in
an increasingly important influence on the terms of the information processing de-
development of thought and is the chief mands they make. All of them postulate
means by which culture is absorbed by the that higher information processing capac-
child. Vygotsky’s concept of the zone of ity becomes available with development ei-
proximal development, which means that ther through maturation (Halford, 1 993 ) or
new developments are close to existing cog- increased processing efficiency that leaves
nitive abilities, is broadly consistent with more capacity available for working mem-
Piaget’s notion that new knowledge is as- ory (Case, 1 985 ). Note that these processes
similated to existing structure. This is part are not mutually exclusive. Chapman and
of a larger picture in which both Piaget and Lindenberger (1 989, p. 23 8) attempted to
Vygotsky saw cognitive development as an synthesize these theories under the prin-
active organizing process that tends toward ciple that “the total capacity requirement
an equilibrium with its own internal proof a given form of reasoning is equal to
cesses and with the external environment. the number of operatory variables that are
Piaget’s work had greater early influence, assigned values simultaneously in employ-
but the impact of Vygotsky’s work is ining that form of reasoning in a particular
creasing at what appears to be an acceler- task.”
ating rate. Among the many areas in which Other theoretical developments were
it has been important are the development more independent of the Piagetian tradi-
of education theory (Gallimore & Tharp, tion. An important class of theories was
1 999) and research on collaborative problem based on computer simulations first using
solving (Garton, 2004; see also Greenfield, symbolic architectures (Halford, Wilson, &
Chap. 27.) McDonald, 1 995 ; Klahr & Wallace, 1 976;
Simon & Klahr, 1 995 ) and later using neu- lace (1 976). With repeated experience, the
ral nets (Elman, 1 990; McClelland, 1 995 ; model gradually learns to classify the ac-
Shultz, 1 991 ; Shultz, et al, 1 995 ). The tion of spacing out the items as a conserving
model of Klahr and Wallace (1 976) was con- transformation, using the learning mecha-
cerned with quantification operators, includ- nism of the SOAR model, called “chunking,”
ing subitizing (direct estimation of small which has been shown to have considerable
sets without counting), counting, and esti- generality.
mation (approximate quantification of large Acquisition of transitive inference was
sets such as crowds). It was used to model simulated by the self-modifying produc-
conservation or understanding that a quan- tion system model of Halford, Smith, et al.
tity remains invariant despite transforma- (1 995 ). Development of transitive inference
tions of physical dimensions. In a typical sim- strategies is guided by a concept of order
ple number conservation task, two rows of based on any representation of an ordered
beads are placed in one-to-one correspon- set of at least three elements. When no pro-
dence. Then one row is transformed (e.g., duction rule exists for a given problem, the
by spacing objects more widely and thus model uses analogical mapping and means-
increasing the length of the row without end analysis to determine the correct an-
adding any items); then the child is asked swer; then a production rule is created to
whether each row still contains the same handle that case. Rules are strengthened or
number or whether they are different. Pre- weakened by subsequent experiences with
conserving children cannot answer this ques- success or failure.
tion correctly because they have not learned
that the transformation leaves number in-
Neural Net Models
variant. In the model of Klahr and Wallace
(1 976) the task is performed initially by Neural net models of thinking are reviewed
quantifying first one row followed by the by Doumas and Hummel (Chap. 4), but
other in the pretransformed display and then the contribution of neural net models to
comparing the results. The transformed row cognitive development is considered here.
is quantified again after the transformation A good way to illustrate neural net mod-
and found to be still the same as the other els of cognitive development is to examine
row. With repeated quantification before McClelland’s (1 995 ) model of children’s un-
and after a transformation, the rule that pre- derstanding of the balance scale. The net is
and post-transformed quantities are equal is shown schematically in Figure 22.1 together
learned, and the quantification operators are with a balance scale problem. It is a three-
no longer employed. (See also Chap. 1 7 by layered net, which means that activation is
Lovett & Anderson, on production system propagated from the input units to the hid-
models of thinking.) den (middle) layer and then to the output
The Q-SOAR model of Simon and Klahr layer. There are four sets of five input units
(1 995 ) applied Newell’s (1 990) SOAR ar- representing one-to-five weights on pegs one
chitecture to Gelman’s (1 982) study of num- to five steps from the fulcrum on both left
ber conservation acquisition. Children are and right sides. The units that are activated
shown two equal rows of objects, asked to are shown as black. The activations in the
count each row in turn and say how many input units represent the problem in the
each contains, then to say whether they are top of the figure. In the first set of input
the same or different. Then one row is trans- units, representing number of weights on
formed and the preconserving child is un- the left, unit 3 is activated, coding the three
able to say whether they are the same or weights on the left. Similarly, in the second
different. This is represented in Q-SOAR set of input units, representing weights on
as an impasse. The model then searches the right, unit 4 is activating, coding four
for a solution to the problem using the weights on the right. Distances are coded
quantification procedure of Klahr and Wal- in a similar way by the two sets of input
Output
Left Right Units
- -
+ +
- + + - -
+ + + +
- - -
Input
Units
left right left right
Weight Distance
Figure 2 2 .1 . Balance scale model of McClelland (1 995 ). By permission of the author and Oxford
University Press.
units on the right. In the first set, unit 3 is weight on the right is greater than on the
activated, coding the weights on peg 3 on left. The first hidden unit has the opposite
the left, whereas in the second set, unit 2 pattern of weights and will be more strongly
is activated, coding weights on peg 2 on the activated if weight is greater on the left. The
right. second hidden unit also has positive connec-
There are four hidden units (shown in the tions to the right output unit. Thus, greater
middle of the net), two of which compare weight on the right will tend to produce
weights and two that compare distances. greater activation on the right output unit,
The units that are more highly activated are representing a tendency for the right side go-
shown as black, although activations would ing down. The second pair of hidden units
be graded, rather than all-or-none. Finally, compare distances in corresponding fashion.
there are the output units that compute the The activations of the output units depend
balance state. Activation of an output unit on activations of hidden units comparing
represents the corresponding side of the bal- both weights and distances. In this case the
ance beam going down. If the beam is bal- greater weight on the right tends to make
anced, the activations in the output units the right side go down, but this is countered
would be equal, which is defined as being by the greater distance on the left; thus, the
within 0.3 of each other. predicted position of the beam will be ap-
The operation of the unit can be under- proximately balanced, although, in fact, the
stood from the connection weights between left side would go down. The network does
units, which are shown schematically in Fig- not compute the product of weight and dis-
ure 22.1 as +\−. The second hidden unit has tance but compares the influences of weights
positive connections to all input units rep- and distances on each side.
resenting weight on the right and negative The network was trained by backpropa-
connections to all input units representing gation; that is, comparing the network’s out-
weight on the left (although only a single put on each trial with the correct output and
arrow is shown in each case for simplicity). then adjusting the connection weights to re-
This unit is more strongly activated because duce the discrepancy. The training would
result in the units representing larger cept (Surber & Gzesh, 1 984). Other restric-
weights or larger distances having greater tions are that, as Marcus (1 998a, 1 998b) has
connection weights to the hidden units. pointed out, if the model is trained on two or
Thus, metrics for weight and distance three weights on either side, it cannot gener-
emerge as a result of training and are not alize to problems with four or five weights.
predefined in the net. This is possibly the Again, however, it would be reasonable to
most important property of the model be- expect that children would generalize in this
cause it shows how a structured representa- way. The conclusions therefore are that the
tion can emerge from the process of learn- model can be trained to compute one func-
ing to compute input-output functions that tion implicated by the balance scale, albeit
match those in the environment. under restricted conditions, and that it does
The model also captures a number of not fully capture understanding of the con-
crucial developmental results. Its progress cept but is nevertheless an important step
through training corresponded with the forward in our understanding of cognitive
course of development as defined by development because it shows how struc-
Siegler’s (1 981 ) rules. According to Rule I, tured representation can emerge.
judgments are based on weight, irrespective The balance scale model by McClelland
of distance. In Rule II states that distance is (1 995 ) is a three-layered, or backpropaga-
considered if the weights are first found tion, net. This type of architecture has been
to be equal. Rule III asserts that weight used in a great many models, in cognitive
and distance are considered but difficulty development and elsewhere. One reason is
is encountered when weight is greater on that it can, in principle, compute any input–
one side and distance is greater on the output function. The simple recurrent net
other. Rule IV (torque rule) involves com- (Elman, 1 990) is an important model in this
paring the product of weight and distance class. In this type of net, activations in the
on the left side with the product of weight hidden units are copied over into context
and distance on the right side. The model units. On the next trial, activations in the
also captured the torque difference effect – hidden units are influenced by activations in
that is, the difference between the product both the input units and the context units.
of weight and distance on the left (Wl × Dl ) The result is that the output of the net
and on the right (Wr × Dr ) affects children’s is influenced by representations on previ-
performance, because they are more likely ous trials as well as by the current input.
to recognize that one side will go down if The net therefore takes account of links be-
torque difference is large even though there tween events in a sequence. The model was
is no logical basis for this given that even a trained to predict the next word in a sen-
small torque difference will cause one side tence. Training was based on a large cor-
to go down. This is one of many ways in pus of sentences by representing each suc-
which neural net models capture psycholog- cessive word in the input units, and the
ical properties of task performance. output units were trained to represent the
This model computes the balance state next word. Feedback was given concerning
as a function of weight and distance on left the accuracy of the output, thereby adjust-
and right sides of the balance beam. How- ing the connection weights to improve the
ever, understanding the balance beam also model’s prediction. The model learned to
entails determining weight or distance val- predict the next word in a sentence and re-
ues that will make the beam balance. There spected grammatical categories even when
are effectively five variables here: Wl , Dl , words in related categories spanned embed-
Wr , Dr , and balance state. Complete under- ded clauses. Cluster analysis of the hidden
standing of the balance scale would include unit activations showed that words in the
being able to determine any variable given same grammatical category, such as nouns
the other four; that is, compute all five func- or verbs, tended to have similar activations.
tions implicated by the balance scale con- Semantically similar words, such as those
representing animals or foods, also tended Cascade Correlation Models

to have similar hidden unit representations.
Cascade correlation models provide a mech-
Elman (1 990) was careful not to predefine
anism by which the dimensionality of rep-
categories, and the inputs used were orthog-
resentations can be increased to handle in-
onal; thus, no pre-existing similarities were
creased dimensions in the task. They do this
supplied to the model. Similarities were
by adding units to the hidden layer. The ini-
created in the hidden unit activations that
tial net has minimal hidden units and some-
reflected the input–output functions the
times starts with none. Training takes place
model was required to learn. Therefore,
in two modes. In the first mode, weights are
to the extent that categories developed,
adjusted to yield the appropriate output for
they are an emergent property of the
each input. In the second mode, hidden units
model and one that reflects contingencies in
are recruited to increase the accuracy of the
the environment. This model, like that of
output. Recruitment is based on correlation
McClelland (1 995 ), offers a possible mech-
between a candidate’s activation and the ex-
anism by which structured representation
isting error of the network. After recruit-
might be acquired.
ment of a hidden unit, training continues
The ability of simple recurrent nets to
in the first mode and the system cycles be-
predict sequences has been utilized to model
tween the modes until a learning criterion
infants’ expectations of the reappearance of
is reached.
occluded objects (Mareschal, Plunkett, &
Cascade correlation models have been
Harris, 1 995 ; Munakata et al., 1 997) thereby
used to model a number of developmen-
simulating infants’ understanding of the ob-
tal phenomena (Shultz, 1 991 ; Shultz et al.,
ject concept (Baillargeon et al., 1 990). These
1 995 ; Sirois & Shultz, 1 998). Shultz and col-
models are basically consistent with the
leagues used cascade correlation to model
model of Smith et al. (1 999). Again, how-
the same balance scale problem modeled by
ever, there have been limitations. Marcus
McClelland (1 995 ). The initial net was sim-
(1 998a, 1 998b) found that the model of Mu-
ilar to that used by McClelland and shown
nakata et al. (1 997) did not generalize to ob-
in Figure 22.1 , but without hidden units. Ini-
jects in new positions on the display.
tial training was with problems varying only
The potential of models such as this to
in weight, and the net performed consistent
learn regularities in the environment and ac-
with Siegler’s (1 981 ) Rule 1 . Once the dis-
quire concepts has inspired a whole new
tance variable was introduced, the net re-
approach to cognitive development (Elman
cruited a single hidden unit. It then pro-
et al., 1 996). Elman et al. (1 996) see con-
gressed to Rule 2 and higher rules that take
nectionism as giving more powerful means
account of distance, effectively simulating
to analyze the gene–environment interac-
the developmental progression in a manner
tions that are the basis of development. They
similar to McClelland’s model (1 995 ).
advocate a form of connectionism that is
founded in biology, is influenced by devel-
Neural Net Models and Symbolic
opmental neuroscience, and that can pro-
Processes
duce neurologically plausible computational
models. Although they see an undoubted Concern that three-layered net models do
role for innateness in cognitive develop- not capture symbolic processes has been
ment, they argue some nativist conceptions expressed by Fodor and Pylyshyn (1 988).
underestimate the potential for new cogni- Properties that are considered essential by
tive forms to emerge from the interaction Fodor and Pylyshyn (1 988) are composition-
of neural processes. The simple recurrent ality and systematicity. The essential idea of
net nicely illustrates how representations compositionality is that symbols must re-
that respect distinctions between word cat- tain their identity and their meaning when
egories emerge from the model’s interaction combined into more complex representa-
with the environment. tions. Thus, the cognitive symbols for “dog”
and “happy” must retain their identity when 1 993 ; Halford, Wilson, & Phillips, 1 998).
combined into the symbol for “happy dog.” See Halford (2002) for a summary of this
Prototypes are not necessarily compositional approach.
in this way (Fodor, 1 995 ). One problem for
three-layered net models is that the repre-
Dynamic Systems Models
sentations in the hidden units do not neces-
sarily include the components of the input in Dynamic systems models (Fischer & Bidell,
a form that is recognizable to the performer. 1 998; Fischer & Pare-Blagoev, 2000; van
Any structure that exists in the hidden layer Geert, 1 991 , 1 998, 2000) have offered new
must be discovered by an external observer ways to analyze developmental data. A dy-
(the experimenter) using techniques such as namic system is a formal system, the state
cluster analysis (Elman, 1 990). Representa- of which depends on its state at a previous
tions in hidden units are not accessible to point in time. The dynamic system model
strategic processes. They are more like im- of van Geert (1 998) was designed around
plicit knowledge (Karmiloff-Smith, 1 994). principles derived from the work of Piaget
Systematicity, in essence, means that cog- and Vygotsky and has a number of interest-
nitive processes are subject to structural ing properties. It can account for different
constraints independent of content. Three- types of cognitive growth, such as slow linear
layered nets lack strong systematicity (Mar- increase and sudden discontinuities, within
cus, 1 998a, 1 998b; Phillips, 1 994), meaning the same system. It can also show how a
they cannot generalize to an element that has complex, self-regulating system can emerge
not occurred in the same role before, even if from the interaction of a few variables. The
the element is familiar. Thus, a net trained model was fitted to a number of develop-
on “John loves Mary” and “Tom loves Jane” mental data sets, and some important devel-
could generalize to “John loves Jane” but not opmental phenomena, including conserva-
to “Jane loves John” or even to “Mary loves tion acquisition, were simulated. Links have
John.” Nets of this type learn representa- also been made between dynamic systems
tions that are needed to compute the input– models and neural net models.
output functions on which they are trained, Dynamic systems models have also
but they do not learn abstract relations. been linked to other issues. Raijmakers,
Although three-layered net models have van Koten, and Molenaar (1 996) analyzed
real potential to advance research on cog- McClelland’s (1 995 ) neural net model of the
nitive development (Bray et al., 1 997), it balance scale and found no evidence of the
appears they lack the structural properties flags indicating discontinuities that are found
that have long been regarded as character- in empirical data. They suggest that back-
istic of higher cognition (Chomsky, 1 980; propagation models simulate the type of
Humphrey, 1 95 1 ; Mandler & Mandler, 1 964; stimulus-response associations that are char-
Miller, Galanter, & Pribram, 1 960; Newell, acteristic of animals and young children but
1 990; Piaget, 1 95 0; Wertheimer, 1 945 ). One do not simulate the rule-governed behavior
response to this problem (Smolensky, 1 988) characteristic of older children and adults. In
is that neural net models seek to explain many respects, this finding is consistent with
symbols as emergent properties of more ba- the analysis of the model presented earlier.
sic processes, as was illustrated earlier. A sec- On the other hand, backpropagation mod-
ond approach has been to develop symbolic els incorporate learning functions that have
neural net models of higher cognitive pro- been missing from models of higher cogni-
cesses (Doumas & Hummel, Chap. 4; Shas- tive processes. As we have seen, they show
tri & Ajjanagadde, 1 993 ; Smolensky, 1 990; how structured representations begin to
but see also Halford, Wilson, & Phillips, emerge as a result of learning input-output
1 998). A symbolic connectionist account functions.
of cognitive development has been given Although there are acknowledged diffi-
by Halford and his collaborators (Halford, culties with dynamic systems models (van
Geert, 1 998), they provide much more so- change of background) depends on matura-
phisticated implementations of important tion of the hippocampus.
developmental theories, including that of At a more general level, Quartz and Se-
Piaget and Vygotsky. This does not mean jnowski (1 997) have argued that synaptic
that Piaget and Vygotsky are fully vindicated growth, axonal arborization, and dendritic
by dynamic systems models, but concepts development play a role in processing ca-
such as equilibration and self-regulation, pacity increase with age. They also point out
which are at the core of their theories, do that neural plasticity would cause capacity to
seem to have a new lease on life. Most impor- increase as a function of experience. This im-
tantly, dynamic systems models have poten- plies that the issue of whether cognitive de-
tial to deepen our understanding of cognitive velopment depends on capacity, knowledge,
developmental processes. And, as Fischer or both may need to be redefined. It might
and Pare-Blagoev (2000) point out, there are be that cognitive development depends on
tools based on Lotus 1 23 or Microsoft Excel growth of capacity, which is at least partly
that make dynamic system modeling more produced by experience.
accessible.
Strategy Development
Problem-solving strategies are important to
Links to Brain Development
reasoning in children and adults, and much
The finding by Thatcher, Walker, and Giu- of the improvement in children’s reasoning
dice (1 987) of brain growth spurts that ap- can be attributed to development of more
peared to correspond to stage transitions in powerful strategies. It is appropriate there-
cognitive development stimulated consider- fore that much research has been devoted to
able interest in the explanatory potential of development of strategies. Following work
neural maturation. One of the important on rule assessment (Briars & Siegler, 1 984;
landmarks in infant development is the A Siegler, 1 981 ), Siegler and his collaborators
not-B error: If infants are shown a toy hid- conducted an extensive study of strategy
den at A several times and allowed to re- (Siegler, 1 999; Siegler & Chen, 1 998; Siegler
trieve it and then see it hidden at B, be- & Jenkins, 1 989; Siegler & Shipley, 1 995 ;
fore approximately 1 2 months of age they Siegler & Shrager, 1 984). Two of the mod-
tend to search for it at A. Studies by Di- els were concerned with development of ad-
amond (1 988) and Goldman-Rakic (1 987) dition strategies in young children. When
showing the link between frontal lobe func- asked to add two single-digit numbers, they
tion and the A not-B error were important chose between a set of strategies including
stimuli to work on infant brain development. retrieving the answer from memory, decom-
Case (1 992a, 1 992b) and Fischer (1 987; Fis- posing the numbers (e.g., 3 + 5 = 4 + 4 = 8),
cher & Rose, 1 996) have drawn interest- counting both sets (counting right through
ing parallels between cognitive development a set of three and a set of five, perhaps using
and the growth of connections between the fingers), and the min strategy of counting on
frontal lobes and other brain regions. Robin from the top number in the larger set (e.g.,
and Holyoak (1 995 ) and Waltz et al. (1 999) 5 , 6, 7, 8, so 3 + 5 = 8).
have also drawn attention to the role of the Siegler and Shrager’s early strategy choice
frontal cortex in processing relations of the model (1 984) was based on distribution of
kind described by Halford and his collabo- associations. The idea is that each addition
rators (Halford, 1 993 ; Halford, Bain et al., sum is associated with answers of varying
1 998; Halford, Wilson, & Phillips, 1 998). In strengths, and so for a given sample of chil-
a different context, Rudy, Keith, and Geor- dren, 2 + 1 might yield the answer “3 ” 80%
gen (1 993 ) present evidence that configu- of the time; “1 ” or “2,” 4%; “4,” 3 %; and so
ral learning (e.g., conditional discrimination, on. The chance of an answer being chosen is
in which a cue-response link is reversed on a function of its associative strength relative
to competing answers. The more peaked the have tended to be regarded as equivalent
distribution, the more likely it will be that if they require the same knowledge domain
a single answer will occur. However, it will and are similar methodologically, or if they
be adopted only if it is above the confidence have similar difficulties on a psychometric
criterion. If not, alternative strategies, such scale. Although these criteria have great util-
as counting, are sought. ity, they have not led to an understanding of
In their later work, Siegler and his col- factors that underlie complexity, nor do they
laborators developed the Adaptive Strategy explain why tasks that differ in content or
Choice Model (ASCM, pronounced “Ask- procedure can be of equivalent complexity
em”) which makes more active strategy whereas tasks that are superficially similar
choices. At the beginning, ASCM knows can be very different in complexity. With-
only the small set of strategies typically used out a means of assigning cognitive tasks to
by 4-year-olds, but it has general cognitive equivalence classes with common properties
skills for choosing and evaluating strategies. and relating tasks in different classes to each
The model is trained on a set of elementary other in an orderly way, psychology is in a
addition facts; then the min strategy is added position similar to that of chemistry with-
to the model’s repertoire. This entails count- out the periodic table (Frye & Zelazo, 1 998).
ing on from the larger number to be added, Two metrics for cognitive complexity have
so if the sum is 5 + 3 , the procedure is to been developed in the past decade.
count 5 , 6, 7, 8. The model chooses a strat-
egy for each problem on the basis of the past cognitive complexity and control
speed and accuracy of the strategy and on (ccc) theory
similarity between the current problem and Frye, Zelazo, & Palfai (1 995 ; Zelazo & Frye,
past problems in which a strategy has been 1 998) analyze complexity according to the
used. Each time a strategy is used, the record number of hierarchical levels of rules re-
of its success is updated, and the projected quired for the task. A simple task entails
strength of the strategy for that problem is rules that link an antecedent to a conse-
calculated. The strength of association be- quent, a → c, whereas complex tasks have
tween a problem and a specific answer is in- rules that are embedded in a higher-order
creased or decreased as determined by the rule that modifies the lower level rules; thus,
success of the answer. One of the strengths another level is added to the hierarchy. The
of the model is that it can account for vari- dimensional change card sort task has been
ability both between children and between a fruitful implementation of this theory. In
different strategies used by the same child a simple sorting task, a green circle might
for a particular class of problems. Most im- be assigned to the green category and a red
portantly, it provides a reasonably accurate triangle to the red category, where cate-
account of strategy development in children gories are indicated by templates comprising
as they age. a green triangle and a red circle. In a com-
plex task, sorting depends on whether the
higher order rule specifies sorting by color,
Complexity
as just mentioned, or by shape. If sorting is
Children become capable of more complex by shape, the green circle is sorted with the
reasoning with age, and it is therefore im- red circle, and the red triangle is sorted with
portant to have some way of comparing the the green triangle. Normative data (e.g., Ze-
complexities of reasoning tasks. A concep- lazo & Frye, 1 998; Zelazo & Jacques, 1 996)
tual complexity theory and accompanying indicate that children typically process a sin-
metric that discriminate tasks of different gle rule by two years of age, a pair of rules
difficulty and explain why they differ is es- by three years, and a pair of rules embed-
sential to understanding cognitive develop- ded under a higher order rule by four years.
ment. It is also necessary to define equiv- The dimensional change card sort task has
alence in cognitive tasks. In the past, tasks been a useful predictor of other cognitive
performances such as concept of mind (Frye variables bound in a representation. There-

Zelazo, & Palfai, 1 995 ). fore RC theory is directly applicable both to
hierarchical and nonhierarchical tasks. Also,
the principles of segmentation and concep-
the relational complexity (rc) metric tual chunking imply that difficult tasks are
Halford (Halford, Wilson, & Phillips, 1 998) those that cannot be decomposed into sim-
defines complexity as a function of the num- pler tasks. In the sorting task discussed with
ber of variables that can be related in a single respect to CCC theory, it is necessary to keep
cognitive representation. This corresponds in mind that we are sorting by color in order
to the arity, or number of arguments (slots) to determine that the green circle is sorted
of a relation (an n-ary relation is a set of with the green triangle. This means the task
points in n-dimensional space). Normative cannot be decomposed into two subtasks
data indicate that quaternary relations (four that are performed independently, because
related variables) are the most complex that the conflicting dimension is always present.
can be processed in parallel by most humans, Andrews and Halford (2002) showed that
although a minority can process quinary re- with four- to eight-year-old children, in the
lations under optimal conditions. Children domains of transitivity, hierarchical classi-
can process unary relations at one year, bi- fication, cardinality, comprehension of rel-
nary relations at two years, ternary relations ative clause sentences, hypothesis testing,
at five years, and the adult level is reached at and class inclusion, a single relational com-
1 1 years (median ages). plexity factor accounted for approximately
Complex tasks are segmented into compo- 5 0% of variance and factor scores correlated
nents that do not overload capacity to pro- with fluid intelligence (r = .79) and working
cess information in parallel. However, rela- memory (r = .66).
tions between variables in different segments
become inaccessible (just as a three-way in-
Increased Dimensionality
teraction would be inaccessible if two-way
analyses were performed). Processing loads Taking account of extra dimensions is a fun-
can also be reduced by conceptual chunking, damental requirement for cognitive devel-
which is equivalent to compressing variables opment. For example, the progression from
(analogous to collapsing factors in a multi- an undifferentiated concept of heat to a con-
variate experimental design). For example, cept that distinguishes heat and temperature
velocity = distance/time but can be recoded entails taking account of the dimensions of
to a binding between a variable and a con- mass and specific heat: Heat = temperat-
stant (e.g., speed = 80 kph) (Halford, Wil- ure × specific heat × mass. Similarly, the dis-
son, & Phillips, 1 998. Section 3 .4.1 ). Con- tinction between weight and density de-
ceptual chunking reduces processing load, pends on taking into account volume and
but chunked relations become inaccessible specific gravity: Weight = specific gravity ×
(e.g., if we think of velocity as a single vari- volume. Taking account of the extra dimen-
able, we cannot determine what happens to sions enables children to progress from un-
velocity if we travel the same distance in half differentiated concepts of heat or weight to
the time). Complexity analyses are based on more sophisticated concepts that recognize
the principle that variables can be chunked the distinction between heat and tempera-
or segmented only if relations between them do ture or between weight and specific gravity.
not need to be processed. Tasks that impose Thus, they become capable of recognizing
high loads are those in which chunking and that a piece of aluminium weighs less than
segmentation are constrained. a similar volume of lead, but a sufficiently
CCC and RC theories have some com- large piece of aluminium can weight more
mon ground, but whereas CCC attributes than a piece of lead. Arguably, the progres-
complexity to the number of levels of a hi- sion that children make here parallels the de-
erarchy, RC attributes it to the number of velopment of these concepts in the history of
science (Carey & Gelman, 1 991 ). In an en- (Carey & Gelman, 1 991 ; Ceci & Howe, 1 978;
tirely different context, acquisition of con- Keil, 1 991 ). Cognitive development can also
servation of continuous quantity arguably be seen as analogous to acquisition of ex-
entails taking account of height and width pertise, so the reasoning of young children
of containers, rather than fixating on height is analogous to that of the novice in a do-
alone (Piaget, 1 95 0). The essential point main. The effect of domain expertise on
here is that cognitive representations must even the most basic cognitive functions was
include sufficient dimensions to take ac- demonstrated by Chi (1 976), who showed
count of the variations in a phenomenon, that child chess experts outperformed adult
so children must represent volume and spe- chess novices on a simple recall test of chess
cific gravity to take account of variations in pieces on a board. On recall of digits, the
weight, and so on. children performed according to age norms,
The importance of cascade correlation and well below the level of adults. This
models, considered earlier, is that they offer experiment cannot be interpreted validly
a possible mechanism by which extra dimen- as showing that memory capacity does not
sions can be added to cognitive representa- change with age, because capacity is not
tions to take account of variations in the task. measured and the experiment is quite con-
The model does not have to be told what di- sistent with an increase in capacity with age
mensions to include. It creates dimensions that is overridden by differences in domain
in its own representations, contained in the expertise. The capacity question requires
hidden units, as required for input–output quite a different methodology. However, the
functions on which it is trained. This can be study does show how powerful effects of
seen as modeling the increased dimension- domain knowledge can be. Carey (1 991 ) ar-
ality of children’s cognitive representations gues that differentiation of heat and mass
as they learn to predict variations in the en- by young children is similarly attributable to
vironment. This mechanism illustrates the knowledge acquisition. There is, of course,
potential for neural nets to provide the long- no logical reason to assume that explanations
hoped-for basis of constructivism without based on knowledge acquisition are neces-
postulating that all the dimensions children sarily incompatible with explanations based
attend to are innately determined (Elman on growth in capacity. Most of the evidence
et al., 1 996, but see also Marcus, 1 998a, suggests an interaction of these processes.
1 998b). Mareschal & Shultz (1 996) suggest Although not in the mainstream of
that cascade correlation models can provide knowledge research in cognitive develop-
a way to increase the computational power ment, Halford and Wilson (1 980) and Hal-
of a system, thereby overcoming a criticism ford, Bain et al. (1 998) investigated possible
by Fodor (1 980) of constructivist models of mechanisms for acquisition of structured
cognitive development. knowledge along lines similar to the in-
duction theory proposed by Holland et al.
(1 986). See also a special issue of Human
Knowledge and Expertise
Development (Kuhn, 1 995 ) on reconceptual-
The theories considered so far have placed izing the intersection between development
major emphasis on development of reason- and learning.
ing processes, but acquisition and organiza- Advances in our understanding of chil-
tion of knowledge is equally important. Fur- dren’s knowledge have had a pervasive influ-
thermore, knowledge acquisition interacts ence on research in the field, and it would be
with development of reasoning processes to hard to think of a domain that has not been
determine how effectively children can rea- touched by it. In this review, knowledge is
son and solve problems. considered in relation to children’s exper-
Several important lines of research have tise in specific domains, including conser-
recognized acquisition of knowledge as vation, transitivity, classification, prototype
a major factor in cognitive development formation, theory of mind, and scientific and
mathematical concepts. (See also Chap. 1 4 of both domain-specific and domain-general

by Novick & Bassok on problem solving and processes.
expertise.)
Reasoning Processes
Domain Specificity versus Generality
Piaget based his theory of cognitive devel-
The view that cognitive processes are opment on the child’s progression through
domain-specific rather than domain-general increasingly complex logics, but this ap-
has developed in parallel with knowledge proach has not been generally successful
acquisition theories of cognitive develop- as a way of modeling children’s reasoning
ment and has been reinforced by Fodor’s (Halford, 1 993 ; Osherson, 1 974). Consider-
proposal (1 983 ) that many cognitive pro- able success has been achieved in account-
cesses are performed by specialized mod- ing for adult reasoning using mental models
ules. For example, it has been proposed (Johnson-Laird & Byrne, 1 991 ), analogies
that conditional reasoning (i.e., reasoning (Gentner & Markman, 1 997; Hofstadter,
in which the major premise has the form 2001 ; Holyoak & Hummel, 2001 ; Holyoak
“if-then”) might depend on a module for & Thagard, 1 995 ), schemas (Cheng et al.,
cheater detection (Cheng & Holyoak, 1 989; 1 986), and heuristics (Kahneman, Slovic, &
Cosmides & Tooby, 1 992; but see Cosmides Tversky, 1 982).
& Tooby, 1 989), that understanding mathe- Analogical reasoning is reviewed by
matics might depend on innate enumeration Holyoak (Chap. 6), but the implications for
processes (Gelman, 1 991 ), or that reasoning understanding cognitive development are
about cause might be facilitated by a mod- considered here. An analogy is a structure-
ule for processing causal information (Leslie preserving map from a base or source to
& Keeble, 1 987). One achievement has been a target (Gentner, 1 983 ; Holyoak & Tha-
to show that young children understand gard, 1 989). The map is validated by struc-
the distinction between artifacts and natu- tural correspondence rather than similar
ral kinds (Keil, 1 991 ) and have considerable elements. Structural correspondence is de-
knowledge of basic facts about the world. fined by two principles; uniqueness of map-
For example, they understand that animals ping implies that an element in the base
move autonomously, have blood, and can is mapped to one and only one element
die (Gelman, 1 990; Keil, 1 995 ). The disin the target; symbol-argument consistency
tinction between animate and inanimate ob- implies that if a relation symbol r in one
jects even seems to be appreciated in infancy structure is mapped to the relation symbol
(Gergely et al., 1 995 ). One result of these r in the other structure, the arguments
developments has been an increasing bio- of r are mapped to the arguments of r
logical perspective in theories of children’s and vice versa. These principles operate as
reasoning (Kenrick, 2001 ). Domain-specific soft constraints and can be violated in small
knowledge must now be seen as having a ma- parts of the mapping if the overall mapping
jor influence on the developing cognitions of conforms to the criteria. Success in map-
children, but it does not displace domain- ping depends on representation of the corre-
general knowledge entirely. Basic cognitive sponding relations in the two structures and
operations such as memory retrieval, and on ability to retrieve the relevant representa-
basic reasoning mechanisms such as anal- tions, which, in turn, depends on knowledge
ogy and means-end analysis, are applicable of the domain. Research on children’s ana-
across domains. Furthermore, some higher logical reasoning is reviewed by Goswami
reasoning processes such as transitive infer- (1 998, 2002).
ence and classification are found to corre- Numerous studies have assessed young
spond across domains (Andrews & Halford, children’s ability to perform simple propor-
2002). Theories such as that of Case (1 985 ; tional analogy – that is, problems of the
Case et al., 1 996) recognize the importance form A is to B as C is to D. Brown (1 989)
showed that children as young as three years Mental models have been found effec-
could use analogies for both learning and tive for providing explanations of human
problem solving if they understood the rel- reasoning (Johnson-Laird & Byrne, 1 991 ;
evant relations, were able to retrieve them Johnson-Laird, Chap. 9). A mental model
from memory, and understood the aims of is more content-specific than a logical rule
the task. This was borne out by Goswami and is used by analogy. Gentner and Gen-
(1 989), who showed that three-, four-, and tner (1 983 ) showed that high school and
six-year old children could perform analo- college students could use water flowing
gies based on relations they understood, such in pipes as mental models of electricity,
as cutting or melting (e.g., chocolate:melted so pipes were mapped to conductors, con-
chocolate::snowman:melted snowman). In strictions in pipes were mapped to resis-
a less tightly structured context, Gentner tors, water pressure to voltage, water flow
(1 977) showed that four- to six-year-old to electric current, and reservoirs to bat-
children could map human body parts to teries. Furthermore, reservoirs placed above
inanimate objects such as trees (e.g., if a tree one another were mapped into batteries in
had a knee it would be on the trunk a short series, and the increase in water pressure
distance above the ground). There appears was mapped to the increase in voltage, and
to be consensus now that young children so on.
can perform analogies with simple relations It appears that mental models are also an
if they have the relevant domain knowledge effective way of accounting for development
and if the test format is appropriate to the of reasoning in children (Barrouillet, Gros-
age of the children. set, & Lecas, 2000; Barrouillet & Lecas, 1 999;
Young children can also use analogies for Halford, 1 993 ; Markovits & Barrouillet,
problem solving (Brown, Kane, & Echols, 2002). Marcovits and Barrouillet (2002)
1 986; Crisafi & Brown, 1 986; Holyoak, Junn, have developed a mental models theory that
& Billman, 1 984). In the study by Holyoak accounts for most of the data on children’s
et al. (1 984), children were told a story about conditional (if-then) reasoning. Condition-
a genie who transferred jewels from one als may refer to classes (e.g., if X is a dog
bottle to another by rolling his magic car- then X is an animal, or simply, all dogs are
pet into a tube and rolling the jewels down animals) or to causal relations (e.g., if it rains,
it. Then they were given the problem of the ground will get wet). For the problem,
transferring gumballs from one jar to an- if p then q, p therefore q (modus ponens),
other using a tube made by rolling a sheet of construction of a mental model begins with
heavy paper. Even four-year-olds showed ev- the following representation:
idence of analogical reasoning. Gholson et al.
(1 996) tested children from first to fifth p q
grade on transfer from missionaries and can- ...
nibals problems to jealous husbands prob-
This represents the case in which p and
lems, both of which require a sequence of
q are both true. The model could now be
moves to be selected for transferring peo-
fleshed out with other possibilities as follows
ple from one place to another without vi-
(where ¬p is read as “not p”):
olating constraints. In a second experiment
they used similar problems that required a p q
sequence of arithmetic steps to be chosen. ¬p ¬q
The children showed evidence of analogi- ¬p q
cal transfer, based on representation of com-
mon relations. Pauen and Wilkening (1 997) The second premise “p” is processed by
found evidence that second- and fourth- selecting those components of the model
grade children transferred selected aspects where p is true, in this case, the first line;
of balance scale problems to simple physi- then inference is made by examining these
cal force problems. cases. In this model, in the only case in which
p is true, q is also true; thus, the inference is and Barrouillet (2002) can handle both
“q.” For example, if the major premise were content effects (Barrouillet & Lecas, 1 998;
“if an animal is a dog then it has legs,” then Leevers & Harris, 1 999) and complexity ef-
the initial model would be fects (Halford, Wilson, & Phillips, 1 998).
Other studies of children’s conditional
dog legs
reasoning have utilized the Wason selection
task (Wason, 1 966; see also Evans, Chap. 8).
This could be fleshed out with alternative
The task entails four cards containing (say)
cases such as
an A, B, 4, and 7, and participants are told
not dog no legs that there is a letter on one side of each
not dog legs card and a number on the other. They are
asked which cards must be turned over to
Thus the premises are processed as rela- test the proposition that if there is an A on
tional propositions, referring to specific in- one side there must be a 4 on the other.
stances, and are fleshed out by retrieving The correct answer, cards containing the A
relevant information from semantic mem- and 7, is rare even among adults. There are
ory. The accuracy of children’s reasoning de- well-known content effects, and it has been
pends on the fleshing out process, which is shown that versions of the task based on per-
influenced by availability of relevant infor- mission (Cheng et al., 1 986) or cheater de-
mation in memory and by working memory tection (Cosmides & Tooby, 1 992) are per-
capacity. The minor premise “not dog” can formed better. Similar improvements have
produce the fallacious inference “no legs” been observed in children (Cummins, 1 996;
(denial of the antecedent) if the second line Light et al., 1 989).
of the mental model is missing. This could The literature supports the claim that
occur if the child failed to retrieve any cases conditional reasoning is possible for chil-
of things that are not dogs but have legs. dren, even as young as four, and improve-
Similarly, the minor premise “legs” can pro- ments can be produced by more appro-
duce the fallacious inference “dog” (affirma- priate task presentation (Markovits et al.,
tion of the consequent). Markovits (2000) 1 996) and by experience, but consider-
has shown that children are more likely to able development occurs throughout child-
recognize that these inferences are not justi- hood (Muller, Overton, & Reene, 2001 ).
fied if they can readily generate the alterna- Among the relatively late-developing com-
tive cases. In the aforementioned example, it petences are understanding of logical ne-
is easy to generate instances of things that are cessity (Falmagne, Mawby, & Pea, 1 989;
not dogs but have legs. In a problem such as Kuhn, 1 977; Morris & Sloutsky, 2002; Os-
“if something is a cactus then it has thorns,” herson & Markman, 1 975 ) and reasoning
generation of alternative cases is more diffi- that requires representation of complex
cult, and children are less likely to recognize relations.
the fallacies. The second major factor is pro-
cessing (working memory) capacity. More
complex problems entail representation of
more relations. The example just given ef- Elementary Concepts
fectively entails three relations correspond-
ing to the three lines of the mental model. Conservation, transitivity, and classification
A simpler problem would consist of only are three concepts that have long been con-
the first and second lines and corresponds sidered fundamental to children’s reasoning.
to a biconditional interpretation of the ma- All have been controversial because Piaget’s
jor premise. This representation is simpler. claim that they are concrete operational and
Increase in effective capacity with age en- unattainable before seven to eight years has
ables children to reason correctly on more been contested by many researchers. We
complex problems. The model of Marcovits briefly consider each in turn.
Conservation claimed, but not as early as claimed by

Bryant (1 972).
Perhaps the most widely researched concept
in the field, conservation, is still not well
Transitivity and Serial Order
understood. The Q-SOAR model of conser-
vation acquisition was briefly reviewed ear- A transitive inference has the form: aRb,
lier, and other models exist (Caroff, 2002; bRc implies aRc, if R is a transitive rela-
Halford, 1 970; Shultz, 1 998; Siegler, 1 995 ). tion. For example, a > b, b > c implies a >
However, the issues that have received most c. Piaget’s claim of late attainment was chal-
attention in the literature concern how con- lenged by Bryant and Trabasso (1 971 ), who
servation should be measured and the age trained three- to six-year-old children to
at which children master it. A number remember the relative lengths of adjacent
of authors have argued that the Piagetian sticks in a series (e.g., a < b, b < c, c < d,
tests misled children and therefore under- d < e). Then they were tested on all possi-
estimated their understanding. A common ble pairs. The crucial pair is b?d because this
cause of the alleged misunderstanding is was not learned during training and must be
that an increase in the length of a row of inferred from b < c, c < d. Also, the bd pair
objects (or an increase in the height of a avoids the end elements, which tend to be
column of liquid) makes the number (or labeled as small (a) or large (e). Bryant and
amount) appear greater. This received sub- Trabasso found that three- and four-year-
stantial support from a study by Gelman old children performed above chance on the
(1 969), who used an oddity training pro- bd pair, suggesting that they made a tran-
cedure to induce five-year-old children to sitive inference. Riley and Trabasso (1 974)
attend to number rather than length and showed that both children and adults per-
showed that they conserved number. This formed the task by ordering the elements –
interpretation received further support from that is, they formed the ordered set a,b,c,d,e.
McGarrigle and Donaldson (1 975 ), who im- This in itself does not affect the validity
proved conservation in children aged four of the test because an asymmetric, transi-
to six years by having a “naughty teddy” tive binary relation is a defining property
perform the transformation, thereby mak- of an ordered set, so the children presum-
ing it accidental and removing any sugges- ably utilized transitivity in some way while
tion that an increase in amount was in- ordering the elements. The problem, how-
tended. These studies, like a host of others, ever, was that, to facilitate acquisition, the
showed improved performance in children premise pairs were presented initially in as-
about five to six years of age. However, cending or descending order (a < b, b < c,
Bryant (1 972) eliminated length cues and c < d, d < e, or the reverse). This clearly
showed that three- and four-year-old chil- gave children undue help in ordering the
dren carried a pretransformation judgment elements. Furthermore, children who failed
over into the posttransformation situation. to learn the premise pairs were eliminated,
However, his claim that this demonstrated and elimination rates were as high as 5 0% in
conservation was disputed by Halford and some experiments. The problem with this
Boyle (1 985 ). Sophian (1 995 ) also failed to is that children might have failed to learn
replicate Bryant’s finding of early conserva- the premises because they could not deter-
tion and showed that conservation was re- mine the correct order, which, in turn, might
lated to understanding of counting, suggest- reflect lack of understanding of transitivity.
ing that conservation reflects some aspects When Kallio (1 982) and Halford and Kelly
of children’s quantitative concepts. From ex- (1 984) eliminated these extraneous sources
tensive reviews of the conservation literature of help, success was not observed below five
(Halford, 1 982, 1 989), it seems there is clear years of age. Subsequent research (Andrews
evidence of conservation at approximately & Halford, 1 998; Pears & Bryant, 1 990)
five years of age, which is earlier than Piaget has confirmed that transitive inference is
understood by only a minority of four-year- Rosch et al., 1 976) and can recognize that
olds, and the median age of attainment is a set of objects with similar features, such as
about five years. For more extensive reviews animals (dogs, cats), form categories (Quinn,
and theoretical discussions, see Brainerd & 2002). They are also sensitive to the correla-
Reyna (1 993 ), Breslow (1 981 ), Thayer & tion between attributes (Younger & Fearing,
Collyer (1 978), and Halford (1 982, 1 993 ). 1 999). However, prototypes are arguably
A derivative of the Bryant and Trabasso subsymbolic because they are well simulated
(1 971 ) paradigm, transitivity of choice, has by three-layered nets (Quinn & Johnson,
found wide use in animal studies (Boysen 1 997) and do not have properties such as
et al., 1 993 ; Chalmers & McGonigle, 1 984; compositionality that are basic to symbolic
von Fersen et al., 1 991 ; McGonigle & processes (Fodor, 1 995 ; Halford, Phillips, &
Chalmers, 1 977; Terrace & McGonigle, Wilson, unpublished manuscript). Mandler
1 994). Participants are trained to choose one (2000) argues that infants make the transi-
member of each pair in a series. For example, tion from perceptual categories, which en-
they are rewarded for choosing A in prefer- able objects to be recognized by their ap-
ence to B, B in preference to C, C in prefer- pearance, to conceptual categories, defined
ence to D, and D in preference to E. Tran- by the role objects play in events and that
sitivity of choice is indicated by choice of B serve as a basis for inductive inference.
in preference to D. However, whereas tran- Neural nets are a very suitable basis for
sitive inference implies an ordinal scale of constructing models of prototype forma-
premise elements, in transitivity of choice tion, and McClelland and Rumelhart (1 985 )
there is no such scale (Markovits & Du- produced an early prototype model that fun-
mas, 1 992). Furthermore, whereas the transi- damentally changed the way we view cate-
tive inference task is performed dynamically gorization. Quinn and Johnson (1 997) devel-
in working memory, following a single pre- oped a three-layered net model of prototype
sentation of premises, the premise pairs in formation in infants. There were thirteen in-
transitivity of choice are learned incremen- put nodes that encoded attributes of picto-
tally over many trials, and the task can be rial instances of four animals (cats, dogs, ele-
performed by associative processes (Wynne, phants, rabbits) and four kinds of furniture
1 995 ). Although both paradigms are impor- (beds, chairs, dressers, tables). There were
tant, transitive inference and transitivity of three hidden units and ten output units, two
choice should not be regarded as equivalent of which coded for category (animals, furni-
tests of the transitivity concept. ture) and the remainder coded for the eight
instances. After the net was trained to rec-
Classification ognize categories and instances, the repre-
Concepts and categories are reviewed by sentations in the hidden units were exam-
Medin and Rips (Chap. 3 ). Developmen- ined. Initially there was no differentiation,
tally, categorization appears to progress from then the units differentiated mammals and
prototypes, arguably the most basic form of furniture, then instances were distinguished
categorization, to more advanced categories, within the categories. The study is important
including those based on rules or theories. for showing how categories can be formed
All advanced categories appear to have a la- by a learning algorithm. As with the mod-
bel or symbol (e.g., “dog” for the dog cate- els of McClelland (1 995 ) and Elman (1 990)
gory) so it will be convenient to deal with discussed earlier, the representations emerge
them under the heading of symbolic cate- from the learning process.
gories.
symbolic categories
prototype models of classification Even young children form categories based
There is evidence that infants can form pro- on, and draw inductive inferences about, es-
totypic categories (Rosch & Mervis, 1 975 ; sential or nonobvious properties (Gelman,
2000). This means that objects are catego- ternative interpretations. Furthermore, tech-
rized on the basis of hidden properties that niques for estimating the number of answers
cause their surface or observable features, so attributable to misinterpretation or guess-
animals contain essential biological material ing have been developed (Hodkin, 1 987;
that enable them to move, eat, make charac- Thomas & Horton, 1 997).
teristic sounds, and reproduce animals of the Alternative assessments have been de-
same kind. Categorization by essential prop- vised, one based on a sorting task that was
erties is sometimes interpreted as evidence isomorphic to class inclusion but did not in-
that people have theories about the domain, clude potentially misleading questions (Hal-
such as theories about the nature of animals, ford & Leitch, 1 989) and one based on
although there are also interpretations based property inference (Greene, 1 994; Johnson,
on causal laws rather than essences (Re- Scott, & Mervis, 1 997). Understanding class
hder & Hastie, 2001 ; Strevens, 2000). There inclusion entails recognition of the asym-
is strong evidence that young children can metric relation between categories at dif-
make inductive inferences based on category ferent levels of the hierarchy. For example,
membership. Gelman and Markman (1 986) properties of fruit apply to apples, but the re-
presented four-year-olds with a picture of a verse is not necessarily true, because apples
bird, told the children a property of the bird may have properties not shared with other
(feeds its young with mashed up food) and fruit. Halford, Andrews, & Jensen (2002)
found that the children attributed the prop- assessed category induction and class in-
erty to other, even dissimilar, birds but not clusion by equivalent methods, based on
to a different category such as bats. Young property inference. Relational complexity
children appear to generalize even nonob- analysis showed that category induction is
servable properties on the basis of category binary relational, because it entails a com-
membership, independent of appearance. parison of a class with its complement
The ease with which young children (e.g., birds and non-birds). Class inclusion is
make inductive inferences about categories ternary relational because it necessarily en-
contrasts with the difficulty they have in rea- tails an inclusive class (e.g., fruit), a sub-
soning about hierarchically structured cate- class (e.g., apples), and a complementary
gories. For example, given twelve apples and subclass (non-apple fruit). When assessed
three oranges, when asked whether there are by equivalent methods, class inclusion was
more apples or more fruit, they tend to say found to be more difficult and performance
there are more apples. This task is a deriva- on it was predicted by ternary relational
tive of class inclusion items originally used by tasks from other domains. This suggests that
Inhelder & Piaget (1 964). The Piagetian hy- category induction and class inclusion are
pothesis was that children lacked a concept really the same paradigm at two levels of
of inclusion till they reached the concrete complexity.
operations stage, but many alternative hy- Conservation, transitivity, and class inclu-
potheses have been proposed (McGarrigle, sion are all ternary relational (Andrews &
Grieve, & Hughes, 1 978; Siegel et al., 1 978; Halford, 1 998; Andrews & Halford, 2002;
Winer, 1 980). Misinterpretation of the ques- Halford, Wilson, & Phillips, 1 998) and this
tion is the common feature in these pro- level of complexity is attainable by ap-
posals. That is, children interpret “more approximately twenty percent of four-year
ples or more fruit” to mean “more apples olds, 5 0% of five-year-olds, 70% of 6-year-
or more other kinds of fruit,” and because olds, and 78% of seven- and eight-year-olds.
there are only three pieces of non-apple fruit There is no age at which children sud-
(oranges), they say there are more apples. denly attain all these concepts, as implied
On the other hand Halford (1 989) argued by some interpretations of Piagetian stage
that many of the improved performances theory. Rather, the proportion of children
produced by alternative tests were no bet- who succeed increases according to a bio-
ter than chance, or were amenable to al- logical growth function. We can conclude
that all are attained at a median age of five or module, that is specialized for processing
years. social cues indicating mood, interest, or at-
tention.
There is growing evidence that concept of
Concept of Mind
mind is related to executive function (Carl-
Children’s ability to understand other peo- son & Moses, 2001 ; Perner, Lang, & Kloo,
ple’s mental states has been one of the 2002) and is partly a function of ability
most intensively researched topics in the to deal with the appropriate level of com-
past two decades. See Astington (1 993 ) or plexity. Halford (1 993 ; Halford, Wilson, &
Wellman, Cross, and Watson (2001 ) for re- Phillips, 1 998) analyzed the complexity of
views. Two main types of tasks have been concept of mind tasks and showed it en-
employed – appearance-reality and false be- tails integrating three variables – the envi-
lief. Appearance-reality is tested by present- ronmental cue, the setting condition, and the
ing children with an object that appears to be person’s representation. Appearance-reality
something else and asking them what it re- requires processing the relation between ob-
ally is and what it appears to be. For example, ject color (white), the color of the filter
Flavell, Green, and Flavell (1 986) showed (blue), and the percept (white or blue). Evi-
children a small white fish and then covered dence that complexity is a factor in concept
it with a blue filter and asked what color it of mind has been produced by several groups
was really and what color it appeared to their of researchers (Andrews et al., 2003 ; Davis &
eyes. Children below about four years have Pratt, 1 995 ; Frye, Zelazo, & Palfai, 1 995 ; Gor-
difficulty recognizing both that the object is don & Olson, 1 998; Halford, 1 993 ; Keenan,
really white and that it appears blue. In a Olson, & Marini, 1 998).
typical false-belief task (Wimmer & Perner, The analysis showing that concept of
1 983 ), Person 1 hides an object in a box and mind requires processing ternary relations
leaves the room, then Person 2 shifts the ob- suggests this should not be possible for chim-
ject to a basket, and then Person 1 returns. panzees because the most complex relation
Before age four, children have difficulty rec- they have been shown to process is binary
ognizing that Person 1 will look for the object (Halford, Wilson, & Phillips, 1 998). Al-
in the box because he or she did not see it though the issue has been controversial, a
moved to the basket. well-controlled study by Call and Tomasello
Numerous factors have been shown to (1 999) tends to support this prediction. (See
influence children’s concept of mind, in- also Tomasello & Call, Chap. 25 .)
cluding social-perceptual knowledge (Tager-
Flusberg & Sullivan, 2000), understanding
Scientific Thinking
of mental states (Bretherton & Beeghly,
1 982), and language (Astington & Jenkins, The topics reviewed so far on children’s un-
1 999). Astington & Gopnik (1 991 ) proposed derstanding of conservation, transitive infer-
a theory-theory, meaning that children’s ence, serial order, classification, cause, and
concepts of belief, desire, and pretence biological processes are all important to the
are linked in an explanatory framework. development of scientific and mathemati-
The more neutral term “concept of mind” cal thinking. In this section, we consider
is used here, even though “theory of mind” some of the more advanced forms of sci-
is in common use, because there are still entific and mathematical reasoning in chil-
doubts that children’s understanding of the dren. (See also Chap. 5 on causal reasoning
mind amounts to a theory. For example, by Buehner & Cheng, Chap. 23 on mathe-
telling children that people’s thoughts can be matical thinking by Gallistel & Gelman, and
wrong or reminding them of their own false Chap. 29 on reasoning in science by Dunbar
beliefs does not raise three-year-olds’ per- & Fugelsang.)
formance above chance. Leslie (1 987) pro- Whether children think as young scien-
posed an innate theory of mind mechanism, tists has been a major question of interest
motivated partly by evidence that catego- Children from five years to adulthood
rization and concept of mind are driven showed evidence of the multiplicative rule.
by theories of the domain, as noted earlier. A similar assessment of children’s under-
Kuhn et al. (1 988) investigated how children standing of area indicated gradual progres-
assessed evidence in order to test hypothe- sion from additive rule (area = length +
ses, and concluded that there was a con- width) to a multiplicative rule (area =
siderable lack of scientific objectivity, espe- length × width) by adulthood. A cascade
cially among the younger children. Similarly, correlation model of time, distance, and ve-
Klahr, Fay, and Dunbar (1 993 ) found strong locity judgments is provided by Buckingham
developmental effects in a study of ability to and Shultz (2000).
design experiments to determine rules un-
derlying operation of a robot. On the other Causal Reasoning
hand, Ruffman et al. (1 993 ) found evidence
Infants are able to perceive causal links be-
that six-year-olds have some understanding
tween entities (Leslie & Keeble, 1 987), but
of how covariation evidence has implications
the causal reasoning of older children seems
for hypotheses about factors responsible for
to be influenced by complexity (Brooks,
an event. There is also evidence that chil-
Hanauer, & Frye, 2001 ; Frye et al., 1 996)
dren as young as five recognize the eviden-
or by concept availability (Ackerman, Silver,
tial diversity principle – that we can be more
& Glickman, 1 990). The explanation may
confident of a induction from a set of diverse
be that, as Leslie and Keeble (1 987) sug-
premises than from a set of similar premises
gest, the causal recognitions of infants are
(Heit & Hahn, 2001 ; Lo et al., 2002; Sloman,
based on a modular process that is essen-
Chap. 3 , this volume). A theoretical account
tially perceptual. Modular processes are not
of the development of inductive reasoning is
typically influenced by cognitive complex-
given by Kuhn (2001 ), and a review of the
ity. The causal reasoning of older children
development of scientific reasoning skills is
probably depends on more conceptual or
provided by Zimmerman (2000).
symbolic processes (Schlottman, 2001 ).
Time, Speed, Distance, and Area Balance Scale

Understanding time, speed, and distance is Siegler (1 981 ) applied the rule assessment
interesting because it entails relations among approach to children’s performance on the
three variables; speed = distance − time−1 ; balance scale, yielding the four rules that
and this relation should be accessible from were discussed in connection with McClel-
other directions, so distance = speed × time, land’s neural model (1 995 ). Siegler’s data
and so on. Matsuda (2001 ) found a pro- showed Rule I (judgments based solely on
gression from considering relations between weight) was used by five-year-olds, and
two variables (e.g., between duration and they could also be taught to use Rule II
distance or between distance and speed) (distance considered if weights are equal).
at four years to integration of all three di- Surber and Gzesh (1 984) used an informa-
mensions by age 1 1 . Wilkening (1 980) used tion integration approach and found that
an information integration theory approach five-year-olds tended to favor the distance
in which the variance in children’s judg- rule. Case (1 985 ), Marini (1 984), and Jansen
ments of distance was assessed as a function and van der Maas (1 997) generally supported
of speed and duration. In the information Siegler’s findings and saw little understand-
integration approach, reliance on a factor ing of the balance scale before age five.
is indicated by a main effect, and reliance Relational complexity theory (Halford,
on the product of speed and duration is in- 1 993 ; Halford, Wilson, & Phillips, 1 998) pro-
dicated by an interaction of these factors. poses that discrimination of weights with
Integration by an additive rule, speed + distance constant, or distances with weight
distance, is indicated by two main effects. constant, entails processing binary relations
and should be possible for two-year olds. extensive assessment of the literature to try
This prediction was contrary to previous the- to identify some of the more promising de-
ory (Case, 1 985 ) and empirical observation velopments. My bias is to look for develop-
(Siegler, 1 981 ). Integration of weight and ments that might provide a coherent body
distance requires at least ternary relations of theory because this is what the field of
and should emerge along with other ternary cognitive development, like the rest of psy-
relational concepts at age five years. These chology, needs most. I have identified four
predictions were confirmed by Halford trends I feel deserve consideration in this
et al. (2002). respect.
Concept of the Earth Neuroscience and the Biological

The development of children’s concept of Perspective
the earth (Hayes et al., 2003 ; Samarapunga- The greatly increased knowledge of neuro-
van, Vosniadou, & Brewer, 1 996; Vosniadou science and the use of brain imaging as a
& Brewer, 1 992) has special interest because converging operation to help constrain theo-
it entails a conflict between the culturally ries of cognitive development represent ma-
transmitted conception that the Earth is a jor developments in the past two decades.
sphere and everyday experience that tends Combined with the biological perspective,
to make it appear flat. Resolution of this they do offer some hope of a coherent
conflict entails recognition that the huge cir- framework for viewing cognitive develop-
cumference of the earth makes it appear flat mental data. The identification of changes
from the surface. There is also a conflict bein rates of neural and cognitive develop-
tween gravity, naively considered as making ment is one example of what this field has
objects fall down, and the notion that people achieved.
can stand anywhere on the Earth’s surface,
including the southern hemisphere, which Dynamic Systems
is conventionally regarded as “down under.”
This can be resolved by a concept of gravity Dynamic systems models have made con-
as attraction between two masses, the Earth siderable progress, and they are much more
and the body (person) on the surface, but clearly linked to data than was the case a
there is little basis for this concept in ev- decade ago. They can provide new perspec-
eryday life. The development of children’s tives on important issues such as whether de-
concept of the Earth provides an interest- velopment is continuous or discontinuous or
ing study in the integration of complex re- the fact that performance might be uneven
lations into a coherent conception. Young across different indicators of the same task.
children were found to attempt resolution of The relative importance of different classes
the conflicting ideas by, for example, draw- of observations might change fundamentally
ing a circular earth with a horizontal plat- with this perspective.
form inside for people to stand on, or as a
flattened sphere to provide more standing Transition Mechanisms
room at the top. Nevertheless, there was a
Transition mechanisms and more advanced
clear tendency for ideas to develop toward
conceptions of learning have provided real
coherence.
conceptual advances and some of the most
important empirical findings in the past
decade. Neural net models have defined
Conclusions and Future Directions some potential mechanisms of concept ac-
quisition that would almost certainly never
Although acknowledging that predictions have been recognized intuitively and would
of future developments are inherently have been very difficult to discover using our
hazardous, it seems appropriate after an contemporary empirical methods.
Analyses of Underlying Processes language and theory-of-mind development.

Developmental Psychology, 3 5 , 1 3 1 1 –1 3 20.
Analyses of underlying processes using
Baillargeon, R. (1 995 ). A model of physical rea-
methods from general cognitive psychol- soning in infancy. In C. Rovee-Collier & L. P.
ogy and cognitive science can help bring Lipsitt (Eds.), Advances in Infancy Research,
order and clarity to the field. There are vol. 9 (pp. 3 05 –3 71 ). Norwood, NJ: Ablex.
many examples of tasks that are superficially Baillargeon, R., Graber, M., DeVos, J., & Black,
similar (such as transitive inference and J. (1 990). Why do young infants fail to search
transitivity of choice) yet entail fundamen- for hidden objects? Cognition, 3 6, 25 5 –284.
tally different processes, and it only creates Barrouillet, P., Grosset, N., & Lecas, J. (2000).
confusion to categorize them together. Cor- Conditional reasoning by mental models:
respondingly, there are tasks that are super- Chronometric and developmental evidence.
ficially very different, yet may entail un- Cognition, 75 , 23 7–266.
derlying cognitive processes with important Barrouillet, P., & Lecas, J. (1 998). How can men-
common properties. An example would be tal models theory account for content effects
the corresponding difficulties of the dimen- in conditional reasoning? A developmental per-
sional change card sort task, and ternary respective. Cognition, 67, 209–25 3 .
lational tasks such as transitivity, class inclu- Barrouillet, P., & Lecas, J. (1 999). Mental models
sion, and the concept of mind. We cannot in conditional reasoning and working memory.
order tasks for difficulty, nor discover impor- Thinking and Reasoning, 5 , 289–3 02.
tant equivalences, unless we look beneath Beilin, H. (1 992). Piaget’s enduring contribution
surface properties. Cognitive psychology has to developmental psychology. Developmental
progressed to the point at which we can do Psychology, 2 8(2), 1 91 –204.
this with reasonable confidence. Bjorklund, D. F. (1 997). In search of a metatheory
for cognitive development (or Piaget is dead
and I don’t feel so good myself). Child Devel-
opment, 68, 1 44–1 48.
References Boole, G. (1 85 4/1 95 1 ). An Investigation of the
Laws of Thought, on which are founded the Math-
Ackerman, B. P., Silver, D., & Glickman, I. (1 990). ematical Theories of Logic and Probabilities. New
Concept availability in the causal inferences York: Dover.
of children and adults. Child Development, 61 , Boysen, S. T., Berntson, G. G., Shreyer, T. A.,
23 0–246. & Quigley, K. S. (1 993 ). Processing of or-
Andrews, G., & Halford, G. S. (1 998). Children’s dinality and transitity by chimpanzees (pan
ability to make transitive inferences: The im- troglodytes). Journal of Comparative Psychol-
portance of premise integration and structural ogy, 1 07, 1 –8.
complexity. Cognitive Development, 1 3 , 479– Brainerd, C. J. (1 978). The stage question in cog-
513. nitive developmental theory. The Behavioral
Andrews, G., & Halford, G. S. (2002). A complex- and Brain Sciences, 2 , 1 73 –21 3 .
ity metric applied to cognitive development. Brainerd, C. J., & Reyna, V. F. (1 993 ). Memory in-
Cognitive Psychology, 45 , 1 5 3 –21 9. dependence and memory interference in cog-
Andrews, G., Halford, G. S., Bunch, K.M, Bow- nitive development. Psychological Review, 1 00,
den, D., Jones, T. (2003 ). Concept of mind 42–67.
and relational complexity. Child Development, Bray, N. W., Reilly, K. D., Villa, M. F., & Grupe,
74(5 ), 1 476–1 499. L. A. (1 997). Neural network models and
Astington, J. W. (1 993 ). The Child’s Discovery mechanisms of strategy development. Develop-
of Mind. Cambridge, MA: Harvard University mental Review, 1 7, 5 25 –5 66.
Press. Breslow, L. (1 981 ). Reevaluation of the literature
Astington, J. W., & Gopnik, A. (1 991 ). Theoreti- on the development of transitive inferences.
cal explanations of children’s understanding of Psychological Bulletin, 89, 3 25 –3 5 1 .
the mind. British Journal of Developmental Psy- Bretherton, I., & Beeghly, M. (1 982). Talking
chology, 9, 7–3 1 . about internal states: The acquisition of an ex-
Astington, J. W., & Jenkins, J. M. (1 999). A plicit theory of mind. Developmental Psychol-
longitudinal study of the relation between ogy, 1 8, 906–921 .
Briars, D., & Siegler, R. S. (1 984). A featural anal- Case, R. (1 992a). The Mind’s Staircase: Explor-
ysis of preschoolers’ counting knowledge. De- ing the Conceptual Underpinnings of Children’s
velopmental Psychology, 2 0, 607–61 8. Thought and Knowledge. Hillsdale, NJ: Erl-
Brooks, P. J., Hanauer, J. B., & Frye, D. (2001 ). baum.
Training 3 -year-olds in rule-based causal rea- Case, R. (1 992b). The role of the frontal lobes in
soning. British Journal of Developmental Psychol- the regulation of cognitive development. Brain
ogy, 1 9, 5 73 –5 95 . and Cognition, 2 0, 5 1 –73 .
Brown, A. L. (1 989). Analogical learning and Case, R., Okamoto, Y., Griffin, S., McKeough, A.,
transfer: What develops? In S. Vosniadou & A. Bleiker, C., Henderson, B., et al. (1 996). The
Ortony (Eds.), Similarity and Analytical Rea- role of central conceptual structures in the de-
soning (pp. 3 69–41 2). New York: Cambridge velopment of children’s thought. Monographs
University Press. of the Society for Research in Child Development.,
Brown, A. L., Kane, M. J., & Echols, C. H. 61 .
(1 986). Young children’s mental models deter- Ceci, S. J., & Howe, M. J. (1 978). Age-related
mine analogical transfer across problems with a differences in free recall as a function of re-
common goal structure. Cognitive Development, trieval flexibility. Journal of Experimental Child
1, 1 03 –1 21 . Psychology, 2 6, 43 2–442.
Bruner, J. S., Olver, R. R., & Greenfield, P. M. Chalmers, M., & McGonigle, B. (1 984). Are chil-
(1 966). Studies in cognitive growth. New York: dren any more logical than monkeys on the
Wiley. five-term series problem? Journal of Experimen-
Bryant, P. (Ed.) (2002). Constructivism today. tal Child Psychology, 3 7, 3 5 5 –3 77.
Cognitive Development, 1 7(3 –4), 1 283 –1 5 08. Chapman, M. (1 987). Piaget, attentional capac-
Bryant, P. E. (1 972). The understanding of invari- ity, and the functional limitations of formal
ance by very young children. Canadian Journal structure. Advances in Child Development and
of Psychology, 2 6, 78–96. Behavior, 2 0, 289–3 3 4.
Bryant, P. E., & Trabasso, T. (1 971 ). Transitive in- Chapman, M. (1 990). Cognitive development
ferences and memory in young children. Na- and the growth of capacity: Issues in NeoPiage-
ture, 2 3 2 , 45 6–45 8. tian theory. In J. T. Enns (Ed.), The Development
Buckingham, D., & Shultz, T. R. (2000). The of Attention: Research and Theory (pp. 263 –
developmental course of distance, time, and 287). Amsterdam: Elsevier Science Publishers.
velocity concepts: A generative connectionist Chapman, M., & Lindenberger, U. (1 989). Con-
model. Journal of Cognition and Development, crete operations and attentional capacity. Jour-
1, 3 05 –3 45 . nal of Experimental Child Psychology, 47, 23 6–
Call, J., & Tomasello, M. (1 999). A nonverbal 25 8.
false belief task: The performance of children Cheng, P. W., & Holyoak, K. J. (1 989). On the
and great apes. Child Development, 70, 3 81 – natural selection of reasoning theories. Cogni-
3 95 . tion, 3 3 , 285 –3 1 3 .
Carey, S. (1 991 ). Knowledge acquisition: Enrich- Cheng, P. W., Holyoak, K. J., Nisbett, R. E., &
ment or conceptual change? In S. Carey & R. Oliver, L. M. (1 986). Pragmatic versus syntac-
Gelman (Eds.), The Epigenesis of Mind: Essays tic approaches to training deductive reasoning.
on Biology and Cognition (pp. 25 7–291 ). Hills- Cognitive Psychology, 1 8, 293 –3 28.
dale, NJ: Erlbaum. Chi, M. T. H. (1 976). Short-term memory lim-
Carey, S., & Gelman, R. (1 991 ). The Epigenesis of itations in children: Capacity or processing
Mind: Essays on Biology and Cognition. Hills- deficits? Memory and Cognition, 4, 5 5 9–5 72.
dale, NJ: Erlbaum. Chomsky, N. (1 980). Rules and representations.
Carlson, S. M., & Moses, L. J. (2001 ). Individual The Behavioral and Brain Sciences, 3 , 1 –61 .
differences in inhibitory control and children’s Cosmides, L., & Tooby, J. (1 989). Evolution-
theory of mind. Child Development, 72 , 1 03 2– ary Psychology and the generation of culture:
1 05 3 . II. Case study: A computational theory of so-
Caroff, X. (2002). What conservation anticipa- cial exchange. Ethology and Sociobiology, 1 0,
tion reveals about cognitive change. Cognitive 5 1 –97.
Development, 1 7, 1 01 5 –1 03 5 . Cosmides, L., & Tooby, J. (1 992). Cognitive adap-
Case, R. (1 985 ). Intellectual Development: Birth to tations for social exchange. In J. H. Barkow &
Adulthood. New York: Academic Press. L. Cosmides & J. Tooby (Eds.), The Adapted
Mind: Evolutionary Psychology and the Genera- Rumsey & N. Kresnegor (Eds.), Developmen-
tion of Culture (pp. 1 63 –228). New York: Ox- tal Neuroimaging: Mapping the Development of
ford University Press. Brain and Behavior (pp. 263 –283 ). San Diego:
Crisafi, M. A., & Brown, A. L. (1 986). Analogi- Academic Press.
cal transfer in very young children: Combining Flavell, J. H., Green, F. L., & Flavell, E. R.
two separately learned solutions to reach a goal. (1 986). Development of knowledge about the
Child Development, 5 7, 95 3 –968. appearance-reality distinction. Monographs of
Cummins, D. D. (1 996). Evidence of deontic rea- the Society for Research in Child Development,
soning in 3 - and 4-year-old children. Memory 5 1 , 1 –89.
and Cognition, 2 4, 823 –829. Fodor, J. (1 980). Fixation of belief and con-
Davis, H. L., & Pratt, C. (1 995 ). The develop- cept acquisition. In M. Piatelli-Palmarini
ment of children’s theory of mind: The work- (Ed.), Language and Learning: The De-
ing memory explanation. Australian Journal of bate between Chomsky and Piaget (pp. 1 43 –
Psychology, 47, 25 –3 1 . 1 49). Cambridge, MA: Harvard University
Diamond, A. (1 988). Abilities and neural mech- Press.
anisms underlying A not-B performance. Child Fodor, J. (1 994). Concepts: A potboiler. Cogni-
Development, 5 9, 5 23 –5 27. tion, 5 0, 95 –1 1 3 .
Donaldson, M. (1 971 ). Preconditions of inference. Fodor, J. A. (1 983 ). Modularity of Mind: An Essay
Lincoln: University of Nebraska Press. on Faculty Psychology. Cambridge, MA: MIT
Elman, J. L. (1 990). Finding structure in time. Press.
Cognitive Science, 1 4, 1 79–21 1 . Fodor, J. A., & Pylyshyn, Z. W. (1 988). Connec-
Elman, J. L., Bates, E. A., Johnson, M. H., tionism and cognitive architecture: A critical
Karmiloff-Smith, A., Parisi, D., et al. (1 996). analysis. Cognition, 2 8, 3 –71 .
Rethinking Innateness: A Connectionist Per- Frye, D., & Zelazo, P. D. (1 998). Complexity:
spective on Development. London: The MIT From formal analysis to final action. Behavioral
Press. and Brain Sciences, 2 1 , 83 6–83 7.
Falmagne, R. J., Mawby, R. A., & Pea, R. D. Frye, D., Zelazo, P. D., Brooks, P. J., & Samuels,
(1 989). Linguistic and logical factors in recog- M. C. (1 996). Inference and action in early
nition of indeterminacy. Cognitive Develop- causal reasoning. Developmental Psychology, 3 2 ,
ment, 4, 1 41 –1 76. 1 20–1 3 1 .
Field, D. (1 987). A review of preschool conserva- Frye, D., Zelazo, P. D., & Palfai, T. (1 995 ). The-
tion training: An analysis of analyses. Develop- ory of mind and rule-based reasoning. Cognitive
mental Review, 7, 21 0–25 1 . Development, 1 0, 483 –5 27.
Fischer, K. W. (1 980). A theory of cognitive Gallimore, R., & Tharp, R. (1 999). Teaching
development: The control and construction of mind in society: Teaching, schooling, and
hierarchies of skills. Psychological Review, 87, literate discourse. In P. Lloyd & C. Fer-
477–5 3 1 . nyhough (Eds.), Lev Vygotsky Critical As-
Fischer, K. W. (1 987). Relations between brain sessments. Vol III. (pp. 296–3 5 0). London:
and cognitive development. Child Develop- Routledge.
ment, 5 8, 623 –63 2. Garton, A. F. (2004) Exploring Cognitive
Fischer, K. W., & Bidell, T. R. (1 998). Dynamic Development: The Child as Problem Solver.
development of psychological structures in ac- Oxford, UK: Blackwell.
tion and thought. In W. Damon & R. M. Lerner Gelman, R. (1 969). Conservation acquisition: A
(Eds.), Handbook of Child Psychology. Vol. 1 : problem of learning to attend to relevant at-
Theoretical Models of Human Development (5 th tributes. Journal of Experimental Child Psychol-
ed., pp. 467–5 61 ). New York: Wiley. ogy, 7, 1 67–1 87.
Fischer, K. W., & Pare-Blagoev, J. (2000). From Gelman, R. (1 972). Logical capacity of very
individual differences to dynamic pathways young children: Number invariance rules.
of development. Child Development, 71 , 85 0– Child Development, 43 , 75 –90.
85 3 . Gelman, R. (1 982). Accessing one-to-one
Fischer, K. W., & Rose, S. P. (1 996). Dynamic correspondence: Still another paper about
growth cycles of brain and cognitive develop- conservation. British Journal of Psychology, 73 ,
ment. In R. W. Thatcher & G. R. Lyon & J. 209–220.
Gelman, R. (1 990). First principles organize at- reasoning. In U. Goswami (Ed.), Blackwell
tention to and learning about relevant data: Handbook of Childhood Cognitive Development
Number and the animate-inanimate distinc- (pp. 282–3 02). Oxford, UK: Blackwell.
tion. Cognitive Science, 1 4, 79–1 06. Greene, T. R. (1 994). What kindergartners know
Gelman, R. (1 991 ). Epigenetic foundations of about class inclusion hierarchies. Journal of Ex-
knowledge structures: Initial and transcendant perimental Child Psychology, 5 7, 72–88.
constructions. In S. Carey & R. Gelman (Eds.), Halford, G. S. (1 970). A theory of the acquisition
The Epigenesis of Mind (pp. 293 –3 22). Hillsdale, of conservation. Psychological Review, 77, 3 02–
NJ: Erlbaum. 3 1 6.
Gelman, S. (2000). The role of essentialism in Halford, G. S. (1 982). The Development of
children’s concepts. Advances in Child Devel- Thought. Hillsdale, NJ: Erlbaum.
opment and Behavior, 2 7, 5 5 –98. Halford, G. S. (1 989). Reflections on 25 years
Gelman, S. A., & Markman, E. M. (1 986). Cate- of Piagetian cognitive developmental psychol-
gories and induction in young children. Cogni- ogy, 1 963 –1 988. Human Development, 3 2 , 3 25 –
tion, 2 3 , 1 83 –209. 3 87.
Gentner, D. (1 977). If a tree had a knee, where Halford, G. S. (1 993 ). Children’s Understanding:
would it be? Children’s performance on simple The Development of Mental Models. Hillsdale,
spatial metaphors. Papers and Reports on Child NJ: Erlbaum.
Language Development, 1 3 , 1 5 7–1 64.
Halford, G. S. (2002). Information process-
Gentner, D. (1 983 ). Structure-mapping: A the- ing models of cognitive development. In U.
oretical framework for analogy. Cognitive Sci- Goswami (Ed.), Blackwell Handbook of Child-
ence, 7, 1 5 5 –1 70. hood Cognitive Development. Oxford, UK:
Gentner, D., & Gentner, D. R. (1 983 ). Flow- Blackwell.
ing waters or teeming crowds: Mental models Halford, G. S., Andrews, G., Dalton, C., Boag, C.,
of electricity. In D. Gentner & A. L. Stevens & Zielinski, T. (2002). Young children’s perfor-
(Eds.), Mental Models (pp. 99–1 29). Hillsdale, mance on the balance scale: The influence of
NJ: Erlbaum. relational complexity. Journal of Experimental
Gentner, D., & Markman, A. B. (1 997). Structure Child Psychology, 81 , 41 7–445 .
mapping in analogy and similarity. American
Halford, G. S., Andrews, G., & Jensen, I. (2002).
Psychologist, 5 2 , 45 –5 6.
Integration of category induction and hierar-
Gergely, G., Nadasky, Z., Csibra, G., & Biro, chical classification: One paradigm at two lev-
S. (1 995 ). Taking the intentional stance at 1 2 els of complexity. Journal of Cognition and De-
months of age. Cognition, 5 6, 1 65 –1 93 . velopment, 3 , 1 43 –1 77.
Gholson, B., Smither, D., Buhrman, A., & Dun- Halford, G. S., Bain, J. D., Maybery, M., &
can, M. (1 996). The sources of children’s Andrews, G. (1 998). Induction of relational
reasoning errors during analogical problem schemas: Common processes in reasoning and
solving. Applied Cognitive Psychology, 1 0, complex learning. Cognitive Psychology, 3 5 ,
s85 –s97. 201 –245 .
Goldman-Rakic, P. S. (1 987). Development of Halford, G. S., & Boyle, F. M. (1 985 ). Do young
cortical circuitry and cognitive function. Child children understand conservation of number?
Development, 5 8, 601 –622. Child Development, 5 6, 1 65 –1 76.
Gopnik, A. (1 996). The post-Piaget era. Psycho- Halford, G. S., & Kelly, M. E. (1 984). On
logical Science, 7, 221 –225 . the basis of early transitivity judgements.
Gordon, A. C. L., & Olson, D. R. (1 998). The re- Journal of Experimental Child Psychology, 3 8,
lation between acquisition of a theory of mind 42–63 .
and the capacity to hold in mind. Journal of Halford, G. S., & Leitch, E. (1 989). Process-
Experimental Child Psychology, 68, 70–83 . ing load constraints: A structure-mapping
Goswami, U. (1 989). Relational complexity and approach. In M. A. Luszcz & T. Nettel-
the development of analogical reasoning. Cog- beck (Eds.), Psychological Development: Per-
nitive Development, 4, 25 1 –268. spectives Across the Life-Span (pp. 1 5 1 –1 5 9).
Goswami, U. (1 998). Cognition in Children. East Amsterdam: North-Holland.
Sussex, UK: Psychology Press. Halford, G. S., Phillips, S., & Wilson, W. H.
Goswami, U. (2002). Inductive and deductive (under review). Structural complexity in
cognitive processes: The concept of represen- Holyoak, K. J., & Thagard, P. (1 989). Analogi-
tational rank. cal mapping by constraint satisfaction. Cogni-
Halford, G. S., Smith, S. B., Dickson, J. C., May- tive Science, 1 3 , 295 –3 5 5 .
bery, M. T., Kelly, M. E., et al. (1 995 ). Mod- Holyoak, K. J., & Thagard, P. (1 995 ). Mental
elling the development of reasoning strategies: Leaps. Cambridge, MA: MIT Press.
The roles of analogy, knowledge, and capacity. Humphrey, G. (1 95 1 ). Thinking: An Introduc-
In T. Simon & G. S. Halford (Eds.), Develop- tion to its Experimental Psychology. London:
ing Cognitive Competence: New Approaches to Methuen.
Cognitive Modelling (pp. 77–1 5 6). Hillsdale, NJ: Inhelder, B., & Piaget, J. (1 95 8). The Growth
Erlbaum. of Logical Thinking from Childhood to Adoles-
Halford, G. S., & Wilson, W. H. (1 980). A cat- cence. A. Parsons (Trans.). London: Routledge
egory theory approach to cognitive develop- & Kegan Paul. (Original work published 1 95 5 .)
ment. Cognitive Psychology, 1 2 , 3 5 6–41 1 . Inhelder, B., & Piaget, J. (1 964). The Early Growth
Halford, G. S., Wilson, W. H., & McDonald, M. of Logic in the Child. London: Routledge &
(1 995 ). Complexity of structure mapping in hu- Kegan Paul.
man analogical reasoning: A PDP model. Pa- Jansen, B. R. J., & van der Maas, H. L. J. (1 997). A
per presented at the Proceedings of the Sev- statistical test of the rule assessment methodol-
enteenth Annual Conference of the Cognitive ogy by latent class analysis. Developmental Re-
Science Society. Pittsburgh, PA. view, 1 7, 3 21 –3 5 7.
Halford, G. S., Wilson, W. H., & Phillips, S. Johnson, K. E., Scott, P., & Mervis, C. B. (1 997).
(1 998). Processing capacity defined by rela- Development of children’s understanding of
tional complexity: Implications for compara- basic-subordinate inclusion relations. Develop-
tive, developmental, and cognitive psychology. mental Psychology, 3 3 , 745 –763 .
Behavioral and Brain Sciences, 2 1 , 803 –83 1 . Johnson-Laird, P. N., & Byrne, R. M. J. (1 991 ).
Hayes, B. K., Heit, E., Goodhew, A., & Gillan, Deduction. Hillsdale, NJ: Erlbaum.
J. E. (2003 ). The role of diverse instruction Kahneman, D., Slovic, P., & Tversky, A. E. (1 982).
in conceptual change. Journal of Experimental Judgment under Uncertainity: Heuristics and
Child Psychology, 86, 25 3 –276. Biases. Cambridge, UK: Cambridge University
Heit, E., & Hahn, U. (2001 ). Diversity-based rea- Press.
soning in children. Cognitive Psychology, 43 , Kallio, K. D. (1 982). Developmental change on a
243 –273 . five-term transitive inference. Journal of Exper-
Hodkin, B. (1 987). Performance model analy- imental Child Psychology, 3 3 , 1 42–1 64.
sis in class inclusion: An illustration with two Karmiloff-Smith, A. (1 994). Precis of beyond
language conditions. Developmental Psychology, modularity: A developmental perspective on
2 3 , 683 –689. cognitive science. Behavioral and Brain Sci-
Hofstadter, D. R. (2001 ). Analogy as the core ences, 1 7, 693 –745 .
of cognition. In D. Gentner, K. J. Holyoak, & Keenan, T., Olson, D. R., & Marini, Z. (1 998).
B. N. Kokinov (Eds.), The Analogical Mind: Per- Working memory and children’s developing
spectives from Cognitive Science (pp. 499–5 3 8). understanding of mind. Australian Journal of
Cambridge, MA: MIT Press. Psychology, 5 0, 76–82.
Holland, J. H., Holyoak, K. J., Nisbett, R. E., & Keil, F. C. (1 991 ). The emergence of theoretical
Thagard, P. R. (1 986). Induction: Processes of beliefs as constraints on concepts. In S. Carey
Inference, Learning and Discovery. Cambridge, & R. Gelman (Eds.), The Epigenesis of Mind:
MA: Bradford Books/MIT Press. Essays on Biology and Cognition (pp. 23 7–25 6).
Holyoak, K. J., & Hummel, J. E. (2001 ). To- Hillsdale, NJ: Erlbaum.
ward an understanding of analogy within a bi- Keil, F. C. (1 995 ). An abstract to concrete shift
ological symbol system. In D. Gentner & K. J. in the development of biological thought: The
Holyoak & B. N. Kokinov (Eds.), The Analog- insides story. Cognition, 5 6, 1 29–1 63 .
ical Mind: Perspectives from Cognitive Science Kenrick, D. T. (2001 ). Evolutionary psychol-
(pp. 1 61 –1 95 ). Cambridge, MA: MIT Press. ogy, cognitive science, and dynamical systems:
Holyoak, K. J., Junn, E. N., & Billman, D. O. Building an integrative paradigm. Current Di-
(1 984). Development of analogical problem- rections in Psychological Science, 1 0, 1 3 –1 7.
solving skill. Child Development, 5 5 , 2042– Klahr, D., Fay, A. L., & Dunbar, K. (1 993 ).
205 5 . Heuristics for scientific experimentation: A
developmental study. Cognitive psychology, Marcus, G. F. (1 998a). Can connectionism save

2 4(1 ), 1 1 1 –1 46. constructivism? Cognition, 66, 1 5 3 –1 82.
Klahr, D., & Wallace, J. G. (1 976). Cognitive Marcus, G. F. (1 998b). Rethinking eliminative
Development: An Information Processing View. connectionism. Cognitive Psychology, 3 7, 243 –
Hillsdale, NJ: Erlbaum. 282.
Kuhn, D. (1 977). Conditional reasoning in chil- Mareschal, D., Plunkett, K., & Harris, P.
dren. Developmental Psychology, 1 3 , 3 42–3 5 3 . (1 995 , July 22–25 , 1 995 ). Developing Object
Kuhn, D. (Ed.). (1 995 ). Development and Permanence: A Connectionist Model. Paper pre-
learning. Reconceptualizing the intersec- sented at the Proceedings of the seventh annual
tion [Special Issue]. Human Development, conference of the Cognitive Science Society.
3 8(6). Pittsburgh, PA.
Kuhn, D. (2001 ). Why development does (and Mareschal, D., & Shultz, T. R. (1 996). Genera-
does not) occur: Evidence from the domain tive connectionist networks and constructivist
of inductive reasoning. In J. D. McClelland & cognitive development. Cognitive Development,
R. S. Seigler (Eds.), Mechanisms of Cognitive 1 1 , 5 71 –603 .
Development: Behavioral and Neural Perspec- Marini, Z. (1 984). The Development of Social and
tives. London: Erlbaum. Physical Cognition in Childhood and Adoles-
Kuhn, D., Amsel, E., O’Loughlin, M., Schauble, cence. Unpublished doctoral dissertation. Uni-
L., Leadbeater, B., & Yotive, W. (1 988). The versity of Toronto (OISE), Toronto.
Development of Scientific Thinking Skills. San Markovits, H. (2000). A mental model analysis
Diego, CA: Academic Press. of young children’s conditional reasoning with
Leevers, H. J., & Harris, P. L. (1 999). Persisting meaningful premises. Thinking and Reasoning,
effects of instruction on young children’s syllo- 6, 3 3 5 –3 48.
gistic reasoning with incongruent and abstract Markovits, H., & Barrouillet, P. (2002). The de-
premises. Thinking and Reasoning, 5 , 1 45 –1 73 . velopment of conditional reasoning: A mental
Leslie, A., & Keeble, S. (1 987). Do six-month-old models account. Developmental Review, 2 2 , 5 –
infants perceive causality? Cognition, 2 5 , 265 – 3 6.
288. Markovits, H., & Dumas, C. (1 992). Can pigeons
Leslie, A. M. (1 987). Pretense and representation: really make transitive inferences? Journal of
The origins of “theory of mind.” Psychological Experimental Psychology: Animal Behavior Pro-
Review, 94, 41 2–426. cesses, 1 8, 3 1 1 –3 1 2.
Light, P., Blaye, A., Gilly, M., & Girotto, V. Markovits, H., Venet, M., Janveau-Brenman, G.,
(1 989). Pragmatic schemas and logical reason- Malfait, N., Pion, N., & Vadeboncoeur, I.
ing in 6- to 8-year-old children. Cognitive De- (1 996). Reasoning in young children: Fantasy
velopment, 4, 49–64. and information retrieval. Child Development,
Lloyd, P. & Fernyhough, C. (1 999). Lev Vygot- 67, 285 7–2872.
sky Critical Assessments. Vol IV. London: Rout- Matsuda, F. (2001 ). Development of concepts of
ledge. interrelationship among duration, distance, and
Lo, Y., Sides, A., Rozelle, J., & Osherson, D. speed. International Journal of Behavioral Devel-
(2002). Evidential diversity and premise prob- opment, 2 5 , 466–480.
ability in young children’s inductive judgment. McClelland, J. L. (1 995 ). A connectionist per-
Cognitive Science, 2 6, 1 81 –206. spective on knowledge and development. In
Lourenco, O., & Machado, A. (1 996). In defense T. Simon & G. S. Halford (Eds.), Develop-
of Piaget’s theory: A reply to 1 0 common crit- ing Cognitive Competence: New Approaches to
icisms. Psychological Review, 1 03 , 1 43 –1 64. Cognitive Modelling (pp. 1 5 7–204). Hillsdale,
Luria, A. R. (1 976). Cognitive Development: Its NJ: Erlbaum.
Cultural and Social Foundations. Cambridge, McClelland, J. L., & Rumelhart, D. E. (1 985 ).
MA: Harvard University Press. Distributed memory and the representation of
Mandler, J. M. (2000). Perceptual and conceptual general and specific information. Journal of Ex-
processes in infancy. Journal of Cognition and perimental Psychology: General, 1 1 4, 1 5 9–1 88.
Development, 1 , 3 –3 6. McGarrigle, J., & Donaldson, M. (1 975 ). Conser-
Mandler, J. M., & Mandler, G. (1 964). Thinking: vation accidents. Cognition, 3 , 3 41 –3 5 0.
From Association to Gestalt. New York: McGarrigle, J., Grieve, R., & Hughes, M. (1 978).
Wiley. Interpreting inclusion: A contribution to the
study of the child’s cognitive and linguistic de- Pears, R., & Bryant, P. (1 990). Transitive infer-
velopment. Journal of Experimental Child Psy- ences by young children about spatial position.
chology, 2 6, 5 28–5 5 0. British Journal of Psychology, 81 , 497–5 1 0.
McGonigle, B. O., & Chalmers, M. (1 977). Are Perner, J., Lang, B., & Kloo, D. (2002). Theory of
monkeys logical? Nature, 2 67, 3 5 5 –3 77. mind and self-control: More than a common
McLaughlin, G. H. (1 963 ). Psycho-logic: A pos- problem of inhibition. Child Development, 73 ,
sible alternative to Piaget’s formulation. British 75 2–767.
Journal of Educational Psychology, 3 3 , 61 –67. Phillips, S. (1 994). Connectionism and system-
Miller, G. A., Galanter, E., & Pribram, K. H. aticity. In A. C. Tsoi & T. Downs (Eds.), Pro-
(1 960). Plans and the Structure of Behavior. ceedings of the Fifth Australian Conference on
New York: Holt, Rinehart & Winston. Neural Networks (pp. 5 3 –5 5 ). St. Lucia, Qld.,
Miller, S. A. (1 976). Nonverbal assessment of Australia: University of Queensland Electrical
Piagetian concepts. Psychological Bulletin, 83 , and Computer Engineering.
405 –43 0. Piaget, J. (1 95 0). The Psychology of Intelligence. M.
Modgil, S. (1 974). Piagetian Research: A Hand- Piercy & D. E. Berlyne (Trans.). London: Rout-
book of Recent Studies. Windsor, Berkshire: ledge & Kegan Paul. (Original work published
NFER Publishing. 1 947).
Morris, B. J., & Sloutsky, V. (2002). Children’s Piaget, J. (1 95 2). The Child’s Conception of Num-
solutions of logical versus empirical problems: ber. C. Gattengo & F. M. Hodgson (Trans.).
What’s missing and what develops? Cognitive London: Routledge & Kegan Paul. (Original
Development, 1 6, 907–928. work published 1 941 ).
Muller, U., Overton, W. F., & Reene, K. (2001 ). Piaget, J. (1 95 3 ). The Origin of Intelligence in the
Development of conditional reasoning: A lon- Child. London: Routledge & Kegan Paul.
gitudinal study. Journal of Cognition and devel- Piaget, J. (1 95 7). Logic and Psychology New York:
opment, 2 , 27–49. Basic Books.
Munakata, Y., McClelland, J. L., Johnson, M. H., Piaget, J. (1 970). Structuralism. C. Maschler
& Siegler, R. S. (1 997). Rethinking infant (Trans.). New York: Basic Books, (Original
knowledge: Toward an adaptive process ac- work published 1 968).
count of successes and failures in object per- Quartz, S. R., & Sejnowski, T. J. (1 997). The neu-
manence tasks. Psychological Review, 1 04, 686– ral basis of cognitive development: A construc-
71 3 . tivist manifesto. Behavioral and Brain Sciences,
Newell, A. (1 990). Unified Theories of Cognition. 2 0, 5 3 7–5 96.
Cambridge, MA: Harvard University Press. Quinn, P. C. (2002). Category representation in
Osherson, D. N. (1 974). Logical Abilities in Chil- young infants. Current Directions in Psychologi-
dren, Volume I: Organization of Length and Class cal Science, 1 1 , 66–70.
Concept: Empirical Consequences of a Piagetian Quinn, P. C., & Johnson, M. H. (1 997). The
Formalism. Potomac, MD: Erlbaum. emergence of perceptual category representa-
Osherson, D. N., & Markman, E. (1 975 ). Lan- tions in young infants: A connectionist analysis.
guage and the ability to evaluate contradictions Journal of Experimental Child Psychology, 66,
and tautologies. Cognition, 3 , 21 3 –21 6. 23 6–263 .
Pascual-Leone, J. A. (1 970). A mathematical Raijmakers, M. E. J., van Koten, S., & Molenaar,
model for the transition rule in Piaget’s devel- P. C. M. (1 996). On the validity of simulat-
opmental stages. Acta Psychologica, 3 2 , 3 01 – ing stagewise development by means of PDP
3 45 . networks: Application of catastrophe analysis
Pascual-Leone, J. A., & Smith, J. (1 969). The en- and an experimental test of rule-like network
coding and decoding of symbols by children: performance. Cognitive Science, 2 0, 1 01 –1 3 6.
A new experimental paradigm and a neo- Rehder, B., & Hastie, R. (2001 ). Causal knowl-
Piagetian model. Journal of Experimental Child edge and categories: The effects of causal be-
Psychology, 8, 3 28–3 5 5 . liefs on categorization, induction, and similar-
Pauen, S., & Wilkening, F. (1 997). Children’s ana- ity. Journal of Experimental Psychology: General,
logical raesoning about natural phenomenon. 1 3 0, 3 23 –3 60.
Journal of Experimental Child Psychology, 67, Riley, C. A., & Trabasso, T. (1 974). Comparatives,
90–1 1 3 . logical structures and encoding in a transitive
inference task. Journal of Experimental Child Siegel, L. S., McCabe, A. E., Brand, J., &
Psychology, 1 7, 1 87–203 . Matthews, J. (1 978). Evidence for the un-
Robin, N., & Holyoak, K. J. (1 995 ). Relational derstanding of class inclusion in preschool
complexity and the functions of prefrontal cor- children: Linguistic factors and training effects.
tex. In M. S. Gazzaniga (Ed.), The Cognitive Child Development, 49, 688–693 .
Neurosciences (pp. 987–997). Cambridge, MA: Siegler, R. S. (1 981 ). Developmental sequences
MIT Press. within and between concepts. Monographs of
Rosch, E., & Mervis, C. B. (1 975 ). Family the Society for Research in Child Development,
resemblences: Studies in the internal struc- 46, 1 –84.
ture of categories. Cognitive Psychology, 7, 5 73 – Siegler, R. S. (1 995 ). How does change occur:
605 . A microgenetic study of number conservation.
Rosch, E., Mervis, C. B., Gray, W. D., Johnson, Cognitive Psychology, 2 8, 225 –273 .
M. D., & Boyes-Braem, P. (1 976). Basic objects Siegler, R. S. (1 999). Strategic development.
in natural categories. Cognitive Psychology, 8, Trends in Cognitive Science, 3 , 43 0–43 5 .
3 82–43 9. Siegler, R. S., & Chen, Z. (1 998). Developmen-
Rudy, J. W., Keith, J. R., & Georgen, K. (1 993 ). tal differences in rule learning: A microge-
The effect of age on children’s learning of prob- netic analysis. Cognitive Psychology, 3 6, 273 –
lems that require a configural association solu- 3 1 0.
tion. Developmental Psychobiology, 2 6, 1 71 –1 84. Siegler, R. S., & Jenkins, E. A. (1 989). How Chil-
Ruffman, T., Perner, J., Olson, D. R., & Doherty, dren Discover New Strategies. Hillsdale, NJ:
M. (1 993 ). Reflecting on scientific thinking: Erlbaum.
Children’s understanding of the hypothesis- Siegler, R. S., & Shipley, C. (1 995 ). Variation,
evidence relation. Child Development, 64, selection, and cognitive change. In T. Simon
1 61 7–1 63 6. & G. S. Halford (Eds.), Developing Cognitive
Samarapungavan, A., Vosniadou, S., & Brewer, Competence: New Approaches to Process Mod-
W. F. (1 996). Mental models of the earth, sun eling (pp. 3 1 –76). Hillsdale, NJ: Erlbaum.
and moon: Indian children’s cosmologies. Cog- Siegler, R. S., & Shrager, J. (1 984). Strategy
nitive Development, 1 1 , 491 –5 21 . choices in addition and subtraction: How do
Schlottman, A. (2001 ). Perception versus knowl- children know what to do? In C. Sophian (Ed.),
edge of cause and effect in children: When see- Origins of Cognitive Skills (pp. 229–293 ). Hills-
ing is believing. Current Directions in Psycholog- dale, NJ: Erlbaum.
ical Science, 1 0, 1 1 1 –1 1 5 . Sigel, I., & Hooper, F. H. E. (1 968). Logical Think-
Shastri, L., & Ajjanagadde, V. (1 993 ). From sim- ing in Children. New York: Holt, Rinehart &
ple associations to systematic reasoning: A con- Winston.
nectionist representation of rules, variables, and Simon, T., & Klahr, D. (1 995 ). A computational
dynamic bindings using temporal synchrony. theory of children’s learning about number
Behavioral and Brain Sciences, 1 6, 41 7–494. conservation. In T. Simon & G. S. Halford
Sheppard, J. L. (1 978). A Structural Analysis of (Eds.), Developing Cognitive Competence: New
Concrete Operations. London: Wiley. Approaches to Process Modeling (pp. 3 1 5 –3 5 3 ).
Shultz, T. R. (1 991 ). Simulating Stages of Human Hillsdale, NJ: Erlbaum.
Cognitive Development with Connectionist Mod- Sirois, S., & Shultz, T. R. (1 998). Neural network
els. Paper presented at the 8th International modeling of developmental effects in discrimi-
Workshop of Machine Learning. nation shifts. Journal of Experimental Child Psy-
Shultz, T. R. (1 998). A computational analysis chology, 71 , 23 5 –274.
of conservation. Developmental Science, 1 , 1 03 – Slobin, D. I. (1 972). Some Questions about
1 26. Language Development. Harmondsworth, UK:
Shultz, T. R., Schmidt, W. C., Buckingham, D., Penguin.
& Mareschal, D. (1 995 ). Modeling cognitive Smith, L. (2002). Piaget’s model. In U. Goswami
development with a generative connectionist (Ed.), Blackwell Handbook of Childhood Cogni-
algorithm. In T. J. Simon & G. S. Halford tive Development (pp. 5 1 5 –5 3 7). Oxford, UK:
(Eds.), Developing Cognitive Competence: New Blackwell.
Approaches to Process Modeling (pp. 205 –261 ). Smith, L. B., Thelen, E., Titzer, R., & McLin, D.
Hillsdale, NJ: Erlbaum. (1 999). Knowing in the context of acting: The
task dynamics of the a-not-b error. Psychological Vosniadou, S., & Brewer, W. F. (1 992). Mental
Review, 1 06, 23 5 –260. models of the earth: A study of conceptual
Smolensky, P. (1 988). On the proper treatment of change in childhood. Cognitive Psychology, 2 4,
connectionism. Behavioral and Brain Sciences, 5 3 5 –5 85 .
1 1 , 1 –74. Vygotsky, L. S. (1 962). Thought and Language.
Smolensky, P. (1 990). Tensor product variable Cambridge, MA: MIT Press. (Original work
binding and the representation of symbolic published 1 93 4).
structures in connectionist systems. Artificial Waltz, J. A., Knowlton, B. J., Holyoak, K. J.,
Intelligence, 46, 1 5 9–21 6. Boone, K. B., Mishkin, F. S., de Menezes San-
Sophian, C. (1 995 ). Representation and reason- tos, M., et al. (1 999). A system for relational
ing in early numerical development: Counting, reasoning in human prefrontal cortex. Psycho-
conservation and comparison between sets. logical Science, 1 0, 1 1 9–1 25 .
Child Development, 66, 5 5 9–5 77. Wason, P. C. (1 966). Reasoning. Harmondsworth,
Strevens, M. (2000). The essentialist apect of UK: Penguin.
naive theories. Cognition, 74, 1 49–1 75 . Wellman, H. M., Cross, D., & Watson, J. (2001 ).
Surber, C. F., & Gzesh, S. M. (1 984). Reversible Meta-analysis of theory-of-mind development:
operations in the balance scale task. Journal of The truth about false belief. Child Develop-
Experimental Child Psychology, 3 8, 25 4–274. ment, 72 , 65 5 –684.
Tager-Flusberg, H., & Sullivan, K. (2000). A com- Wertheimer, M. (1 945 ). Productive Thinking.
ponential view of theory of mind: Evidence New York: Harper.
from Williams syndrome. Cognition, 76, 5 9–89. Wilkening, F. (1 980). Development of dimen-
Terrace, H. S., & McGonigle, B. (1 994). Mem- sional integration in children’s perceptual
ory and representation of serial order by chil- judgment: Experiments with area, volume and
dren, monkeys, and pigeons. Current Directions velocity. In F. Wilkening, J. Becker, & T. Tra-
in Psychological Science, 3 , 1 80–1 85 . basso (Eds.), Information Integration in Children
Thatcher, R. W., Walker, R. A., & Giudice, S. (pp. 47–69). Hillsdale, NJ: Erlbaum.
(1 987). Human cerebral hemispheres develop Wimmer, H., & Perner, J. (1 983 ). Beliefs about
at different rates and ages. Science, 2 3 6, 1 1 1 0– beliefs: Representation and constraining func-
1113. tion of wrong beliefs in young children’s
Thayer, E. S., & Collyer, C. E. (1 978). The de- understanding of deception. Cognition, 1 3 ,
velopment of transitive inference: A review of 1 03 –1 28.
recent approaches. Psychological Bulletin, 85 , Winer, G. A. (1 980). Class-inclusion reasoning in
1 3 27–1 3 43 . children: A review of the empirical literature.
Thomas, H., & Horton, J. J. (1 997). Competency Child Development, 5 1 , 3 09–3 28.
criteria and the class inclusion task: Model- Wynne, C. D. L. (1 995 ). Reinforcement accounts
ing judgments and justifications. Developmental for transitive inference performance. Animal
Psychology, 3 3 , 1 060–1 073 . Learning and Behavior, 2 3 , 207–21 7.
van Geert, P. (1 991 ). A dynamic systems model Younger, B. A., & Fearing, D. D. (1 999). Parsing
of cognitive and language growth. Psychological items into separate categories: Developmental
Review, 98, 3 –5 3 . change in infant categorization. Child Develop-
van Geert, P. (1 998). A dynamic systems model of ment, 70, 291 –3 03 .
basic developmental mechanisms: Piaget, Vy- Zelazo, P. D., & Frye, D. (1 998). Cognitive com-
gotsky, and beyond. Psychological Review, 1 05 , plexity and control: II. The development of ex-
63 4–677. ecutive function in childhood. Current Direc-
van Geert, P. (2000). The dynamics of develop- tions in Psychological Science, 7, 1 21 –1 26.
mental mechanisms: From Piaget and Vygotsky Zelazo, P. D., & Jacques, S. (1 996). Children’s
to dynamic systems models. Current Directions rule use: Representation, reflection and cogni-
in Psychological Science, 9, 64–68. tive control. Annals of Child Development, 1 2 ,
von Fersen, L., Wynne, C. D. L., Delius, J. D., 1 1 9–1 76.
& Staddon, J. E. R. (1 991 ). Transitive inference Zimmerman, C. (2000). The development of sci-
in pigeons. Journal of Experimental Psychology: entific reasoning skills. Developmental Review,
Animal Behavior Processes, 1 7, 3 3 4–3 41 . 2 0, 99–1 49.
CHAPTER 2 3
Mathematical Cognition
C. R. Gallistel
Rochel Gelman
Mathematics is a system for representing and do not lie in language and the language fac-
reasoning about quantities, with arithmetic ulty. The ability to estimate quantities and
as its foundation. Its deep interest for our un- to reason arithmetically with those estimates
derstanding the psychological foundations of exists in the brains of animals that have no
scientific thought comes from what Eugene language. The same or very similar nonver-
Wigner called “the unreasonable efficacy of bal mechanisms appear to operate in paral-
mathematics in the natural sciences.” From lel with verbal estimation and reasoning in
a formalist perspective, arithmetic is a sym- adult humans. They also operate to some
bolic game, like tic-tac-toe. Its rules are more extent before children learn to speak and
complicated, but not a great deal more com- before they have had any tutoring in the el-
plicated. Mathematics is the study of the ements of arithmetic. These findings suggest
properties of this game and of the systems that the verbal expression of number and of
that may be constructed on the foundation it arithmetic thinking is based on a nonverbal
provides. Why should this symbolic game be system for estimating and reasoning about
so powerful and resourceful when it comes discrete and continuous quantity, which we
to building models of the physical world? share with many nonverbal animals. A rea-
And on what psychological foundations does sonable supposition is that the neural sub-
the human mastery of this game rest? strate for this system arose far back in the
The first question is metaphysical – why evolution of brains precisely because of the
is the world the way it is? We do not treat puzzle to which Wigner called attention:
it, because it lies beyond the realm of exper- Arithmetic reasoning captures deeply im-
imental behavioral science. We review the portant properties of the world, which the
answers to the second question suggested by animal brain must represent in order to act
experimental research on human and non- effectively in it.
human animal cognition. The recognition that there is a nonver-
The general nature of the answer is that bal system of arithmetic reasoning in hu-
the foundations of mathematical cognition man and many nonhuman animals is recent,
559
but it influences most contemporary ex- tal ordering, then they do not function as
perimental work on mathematical cogni- numbers.
tion. This review is organized around the
questions: (1 ) What are the properties of
this nonverbal system? (2) How is it related Kinds of Numbers
to the verbal system and written numeri-
cal systems? The ancient Greeks had considerable success
axiomatizing geometry, but mathematicians
did not axiomatize the system of numbers
until the nineteenth century, after it had un-
What Is a Number? dergone a large, historically documented ex-
pansion. Before this expansion, it was too
Arithmetic is one of the few domains messy and incomplete to be axiomatized,
of human thought that has been ex- because it lacked closure. A system of num-
tensively formalized. This formalization bers is closed under a combinatorial oper-
did not begin in earnest until the mid- ation if, when you apply the operation to
dle of the nineteenth century (Boyer & any pair of numbers, the result is a number.
Merzback, 1 989). In the process of formal- Adding or multiplying two positive integers
izing the arithmetic foundations of mathe- always produces a positive integer, so the
matics, mathematicians changed their minds positive integers are closed under addition
about what a number is. Before formal- and multiplication. They are also closed un-
ization, an intuitive understanding of what der the operation of ordering. For any pair of
a number is determined what could legit- numbers, a ≥ b = 1 if a is greater or equal to
imately be done with it. Once the for- than b, and 0 if not. These three operations –
mal “games” about number were made ex- addition, multiplication, and ordering – are
plicit, anything that played by the rules was the core operations of arithmetic. They and
a number. their inverses make the system what it is.
This formalist viewpoint is crucial to an The problem comes from the inverse op-
understanding of issues in the current sci- erations of subtraction and division. When
entific literature on mathematical cognition. you subtract a bigger number from a smaller,
Many of them turn on questions of how we the result is not a positive integer. Should
are to recognize and understand the proper- one regard the result as a number? Un-
ties of mental magnitudes. Mental magnitude til well into the nineteenth century, many
refers to an inferred (but, one supposes, po- professional mathematicians did not. Thus,
tentially observable and measurable) entity subtracting a bigger number from a smaller
in the head that represents either numeros- number was not a legitimate mathematical
ity (for example, the number of oranges in operation. This was inconvenient, because it
a case) or another magnitude (for example, meant that in the course of algebraic reason-
the length, width, height, and weight of the ing (reasoning about unspecified numbers),
case) and that has the formal properties of a one might unwittingly do something that
real number. was illegitimate. This purely practical con-
For a mental magnitude to represent an sideration strongly motivated the admission
objective magnitude, it must be causally re- of the negative numbers and zero to the set
lated to that objective magnitude. It must of numbers acknowledged to be legitimate.
also be shown that it is a player in a men- When one divides one integer by an-
tal game (a functionally cohesive collection other, the result, called a rational number,
of brain processes) that operates accord- or, more colloquially, a fraction, is rarely an
ing to at least some of the rules of arith- integer. From the earliest times from which
metic. When putative mental numbers do we have written records, people who worked
not validly enter into, at a minimum, men- with written numbers included at least some
tal addition, mental subtraction, and men- rational numbers among the numbers, but,
mathematical cognition 561
like school children to this day, they had ex- Clearly, caution and clear thinking are go-
traordinary difficulties in figuring out how to ing to be necessary if we want to treat as
do arithmetic with rational numbers in gen- numbers entities that you do not get by
eral. What is the sum of 1 /3 and 1 1 /1 7? That counting. But, humans do want to do this,
was a hard question in ancient Egypt and and they have wanted to since the beginning
remains so today in classrooms all around of recorded history. We measure quantities
the world. like lengths, weights, and volumes in order
The common notation for a fraction spec- to represent them with numbers. What the
ifies a number not by giving it a unique measuring does – if it is done well – is give
name like two but rather by specifying a way us “the right number” or at least one usable
of generating it (divide the number one by for our purposes. Measuring and the result-
the number two). The practice of specifying representation of continuous quantities
ing a number by giving an arithmetic proce- by numbers go back to the earliest written
dure that will generate it to whatever level records. Indeed, it is often argued that writ-
of precision is required has grown stronger ing evolved from a system for recording the
over the millenia. It is the key to a rigor- results of measurements made in the course
ous handling of both irrational and complex of commerce (bartering, buying, and sell-
numbers and to the way in which digital ing), political economy (taxation), survey-
computers operate with real numbers. But ing, and construction (Menninger, 1 969).
it is discomfiting, for several reasons. First, The ancient Greeks believed that, in prin-
there are an infinity of different notations ciple, all measurable magnitudes could be
for the same number: 1 /2, 2/4, 3 /6, and so represented by rational numbers. Everything
on, all specifying the same number. More- was a matter of proportion, and any propor-
over, for most rational numbers, there is no tion could be expressed as the ratio of two
complete decimal representation. Carrying integers. They were also the first to try to for-
out the division gives a repeating decimal. In malize mathematical thinking. In doing so,
short, you cannot write down a symbol for they discovered, to their horror, that frac-
most rational numbers that is both complete tions did not suffice to represent all possible
and unique.1 proportions. They discovered that the pro-
Finally, when fractions are allowed to be portion between the side of a square and its
numbers, the discrete ordering of the num- diagonal could not be represented by a frac-
bers is lost. It no longer is possible to specify tion. The Pythagorean formula for calculat-
the next number in the sequence, because ing the diagonal of a square says that the di-
there are an infinite number of rational num- agonal is equal to the square root of the sum
bers between any two rational numbers. For of the squares of the √sides. In this
√ case, the
all these reasons, admitting fractions to the diagonal is equal to (1 2 + 1 2 ) = (1 +1 ) =
√
system of numbers makes the system more 2. The Greeks proved that there is no frac-
difficult to work with in the concrete, albeit tion that, when multiplied by itself, is equal
more powerful in the abstract, because the to 2. If only integers and fractions are num-
system of rational numbers is, with one ex- bers, then the length of the diagonal of the
ception, closed under division. unit square cannot be represented by a num-
Allowing negative numbers and fractions ber. Put another way, you can measure the
to be numbers also creates problems with side of the square or you can measure its di-
what otherwise seem to be sound principles agonal, but you cannot measure them both
for reasoning about numbers. For example, exactly within the same measuring system –
it seems to be sound to say that dividing the unless you are willing to include among the
bigger of two numbers by the smaller gives a numbers in that system numbers that are
number that is bigger than the number one not integers (cannot be counted) and are not
gets if one divides the smaller by the bigger. even the ratio of two integers. You must in-
What then are we to make of the “fact” that clude what the Greeks called the irrational
1 / − 1 = −1 /1 = −1 ? numbers. But if you do include the irrational
numbers, how do you go about specifying

them in the general case?
Many irrationals can be specified by the
operation of extracting roots, which is the
inverse of the operation of raising a number
to a power. Raising any positive integer to
the power of any other always produces a
positive integer. Thus, the system of positive
integers is closed under raising to a power.
The problem, as usual, comes from the in-
verse operation – extracting roots. For most
pairs of integers, a and b, the ath root of b
is not a positive integer, nor even a rational
number; it is an irrational number. The need Figure 2 3.1 . The number system on which
within algebra to have an arithmetic that was modern mathematics is based. Not shown in this
closed under the extraction of roots was a diagram are the algebraic numbers, which are the
powerful motivation for mathematicians to numbers that may be obtained through the
admit both irrational numbers and complex extraction of roots (the solving of polynomial
numbers to the set of numbers. By admitting equations), nor the transcendental numbers,
irrational numbers, they created the system which may be obtained only by solving equations
of so-called real numbers, which was essen- with trigonometric, exponential, or logarithmic
terms. These are subcategories of the irrational
tial to calculus. To this day, there are pro-
numbers.
fessional mathematicians who question the
legitimacy of irrational numbers. Nonethe-
less, the real numbers, which include the
irrationals (see Figure 23 .1 ), are taken for hierarchy of kinds of numbers shown in Fig-
granted by all but a very few contemporary ure 23 .1 – has grown up over historical time
mathematicians. with much of the growth culminating only
The notion of a real number and that in the preceding two centuries. The psycho-
of a magnitude (for example, the length of logical question is, “What is it in the minds
a line) are formally identical. This means, of humans (and perhaps also nonhuman an-
among other things, that for every line seg- imals) that has been driving this process?”
ment, there is a real number that uniquely And how and under what circumstances
represents the length of that segment (in does this mental machinery enable educated
a given system of measurement) and con- modern humans to master the basics of for-
versely, for every real number, there is a mal mathematics, when, and to the extent
line segment that represents the magnitude that they do so?
of that number. Therefore, in what follows,
when we mention a mental magnitude, we
mean an entity in the mind (brain) that func-
tions within a system with the formal prop- Numerical Estimation and Reasoning
erties of the real number system. Like the in Animals
real number system, we assume that this sys-
tem is a closed system: All of its combinato- The development of verbalized and written
rial operations, when applied to any pair of reasoning about number that culminated in
mental magnitudes, generate another mental a formalized system of real numbers isomor-
magnitude. phic to continuous magnitudes was driven
As this brief sketch indicates, the system by the fact that humans apply numerical rea-
of number recognized by almost all contem- soning to continuous quantity just as much
porary professional mathematicians as “the as they do to discrete quantity. In consid-
number system” – the ever more inclusive ering the literature on numerical estimation
and reasoning in animals, we begin by re-

viewing the evidence that they estimate and
reason arithmetically about the quintessen-
tially continuous quantity time.
Common laboratory animals, such as the
pigeon, the rat, and the monkey, measure
and remember continuous quantities, such
as duration, as has been shown in a variety
of experimental paradigms. One of these is
the so-called peak procedure. In this proce-
dure, a trial begins with the onset of a stimu-
lus signaling the possible availability of food
at the end of a fixed interval, called the feed-
ing latency. Responses made at or after the
interval has elapsed trigger the delivery of
food. Responses prior to that time have no
consequences. On twenty to fifty percent of
the trials, food is not delivered. On these tri-
als, the key remains illuminated, the lever
remains extended, or the hopper remains il-
luminated for between four and six times
longer than the feeding latency. On these tri-
als, called probe trials, responding after the Figure 2 3.2 . Representative peak procedure
feeding latency has past is pointless. data: Probability that the mouse’s head was in
Peak-procedure data come from these un- the feeding hopper as a function of the time
rewarded trials. On such trials, the subject elapsed since the beginning of a trial and the
abruptly begins to respond some time before feeding latency. (The feeding latency varied
the expected end of the feeding latency and between blocks of trials.) A. The original data.
These peak curves are the cumulative
continues to peck or press or poke for some
distribution of start times (rising phase) minus
time after it has passed before abruptly stop- cumulative distribution of stop times (falling
ping. The interval during which the subject phase). These are the raw distributions (no curve
responds brackets its subjective estimate of has been fitted.) B. Same data as in A, data
the feeding latency. Representative data are replotted as a proportion of the feeding latency.
shown in Figure 23 .2. Because the variability in the onsets and offsets
Figure 23 .2A shows seemingly smooth of responding is proportional to the feeding
increases and decreases in the probability latency, as are the location of the means of the
that the mouse is making an anticipatory re- distributions relative to the target times, the peak
sponse (poking its head into the feeding hop- curves superpose when plotted as a proportion of
per in anticipation of food delivery) on either this latency. Data originally published by King,
McDonald, & Gallistel (2001 ).
side of the feeding latency. The smoothness
is an averaging artifact. On any one trial, the
onset and offset of anticipatory responding
is abrupt, but the temporal locus of these When the data in Figure 23 .2A are re-
onsets and offsets varies from trial to trial plotted against the proportion of the feed-
(Church, Meck, & Gibbon, 1 994). The peak ing latency elapsed, rather than against the
curves in Figure 23 .2, like peak curves in latency itself, the curves superpose (Fig-
general, are the cumulative start distribu- ure 23 .2B). Thus, both the location of the
tions minus the cumulative stop distribu- distributions relative to the target latency
tions, where start and stop refer to the on- and the trial-to-trial variability in the onsets
set and offset of sustained food anticipatory and offsets of responding are proportional to
behavior. the remembered latency. Put another way,
Figure 2 3.3. The probability of breaking off to try the feeding

alcove as a function of the number of presses made on the arming
lever and the number required to arm the food-release beam at the
entrance to the feeding alcove. Subjects were rats. Redrawn from
Platt & Johnson, 1 971 , with permission.
the probabilities that the subject will have break-off number also gets proportionately
begun to respond or will have stopped re- greater. Thus, behavior based on number
sponding are determined by the proportion also exhibits scalar variability.
of the remembered feeding latency that has The fact that behavior based on remem-
elapsed. This property of remembered dura- bered numerosity exhibits scalar variability
tions is called scalar variability. just like the scalar variability seen in behav-
Rats, pigeons, and monkeys also count ior based on the remembered magnitude of
and remember numerosities (Brannon & continuous quantities such as duration sug-
Roitman, 2003 ; Church & Meck, 1 984; De- gests that numerosity is represented in the
haene, 1 997; Dehaene, Dehaene-Lambertz, brains of nonverbal vertebrates by mental
& Cohen, 1 998; Gallistel, 1 990; Gallistel magnitudes; that is, by entities with the for-
& Gelman, 2000). One of the early pro- mal properties of the real numbers, rather
tocols for assessing counting and numerical than by discrete symbols such as words or
memory was developed by Mechner (1 95 8) bit patterns. When a device such as an analog
and later used by Platt and Johnson (1 971 ). computer represents numerosities by differ-
The subject must press a lever some num- ent voltage levels, noise in the voltages leads
ber of times (the target number) to arm to confusions between nearby numbers. If,
the infrared beam at the entrance to a feed- by contrast, a device represents countable
ing alcove. When the beam is armed, inter- quantity by countable (that is, discrete) sym-
rupting it releases food. Pressing too many bols, as do digital computers and written
times before trying the alcove incurs no number systems, then one does not expect
penalty beyond that of having made super- to see the kind of variability seen in Figures
numerary presses. Trying the alcove prema- 23 .2 and 23 .3 . The bit-pattern symbol for
turely incurs a 1 0-second time-out, which fifteen is 01 1 1 1 , for example, and for sixteen
the subject must endure before returning to it is 1 0000. Although the numbers are adja-
the lever to complete the requisite number cent in the ordering of the integers, the dis-
of presses. Data from such an experiment crete binary symbols for them differ in all
are shown in Figure 23 .3 . They look strik- five bits. Jitter in the bits (uncertainty about
ingly like the temporal data. The number whether a given bit was 0 or 1 ) would make
of presses at which subjects are maximally fourteen (01 1 1 0), thirteen (01 1 01 ), eleven
likely to break off pressing and try the al- (01 01 1 ), and seven (001 1 1 ) all equally and
cove peaks at or slightly beyond the required maximally likely to be confused with fif-
number for required numbers ranging from teen, because the confusion arises in each
four to twenty four. As the remembered tar- case from the misreading of one bit. These
get number gets larger, the variability in the dispersed numbers should be confused with
fifteen much more often than is the adja-

cent sixteen. (For an analysis of the error pat-
terns to be expected in cascade counters, see
Killeen & Taylor, 2001 ). Similarly, a scribe
copying a handwritten English text is pre-
sumably more likely to confuse “seven” and
“eleven” than “seven” and “eight.” The na-
ture of the variability in a remembered tar-
get number therefore suggests that what is
being remembered is a magnitude – some-
thing that behaves like a continuous quan-
tity, which is to say something with the for-
mal properties of a real number.
Figure 2 3.4. The accumulator model for the
nonverbal counting process. At each count, the
Numerosity and Duration Are brain increments a quantity – an operation
Represented by Comparable formally equivalent to pouring a cup into a
Mental Magnitudes graduate. The final magnitude (the contents of
the graduate at the conclusion of the count) is
stored in memory, where it represents the
Meck and Church (1 983 ) pointed out that numerosity of the counted set. Memory is noisy
the mental accumulator model that Gibbon (represented by the wave in the graduate),
(1 977) had proposed to explain the gen- which is to say that the values read from
eration of mental magnitudes representing memory on different occasions vary. The
durations could be modified to make it gen- variability in the values read from memory is
erate mental magnitudes representing nu- proportional to the mean value of the
merosities. Gibbon had proposed that while distribution (scalar variability).
a duration was being timed a stream of im-
pulses fed an accumulator, so that the ac- tially the same, differing only in the mapping
cumulation grew in proportion to the dura- process that generates them and, therefore,
tion of the stream. When the stream ended in what it is they refer to. Put another way,
(when timing ceased), the resulting accumu- both numerosity and duration are repre-
lation was read into memory, where it represented mentally by real numbers. Meck and
sented the duration of the interval. Meck and Church (1 983 ) compared the psychophysics
Church postulated that to get magnitudes of number and time representation in the rat
representing numerosity, the equivalent of a and concluded that the coefficient of varia-
pulse former was inserted into the stream of tion, the ratio between the standard devi-
impulses, so that for each count there was ation and the mean, was the same, which
a discrete increment in the contents of the is further evidence for the hypothesis that
accumulator, as happens when a cup of liq- the same system of real numbers is used in
uid is poured into a graduated cylinder (Fig- both cases.
ure 23 .4). At the end of the count, the re- The model in Figure 23 .4 was originally
sulting accumulation is read into memory, proposed to explain behavior based on the
where it represents the numerosity. numerosity of a set of serial events (for ex-
The model in Figure 23 .4 is the well- ample, the number of responses made), but
known accumulator model for nonverbal it may be generalized to the case in which
counting by the successive incrementation the items to be counted are presented all at
of mental magnitudes. It is also the origin of once – for example, as a to-be-enumerated
the hypothesis that the mental magnitudes visual array. In that case, each item in the
representing duration and the mental mag- array can be assigned a unit magnitude, and
nitudes representing numerosity are essen- the unit magnitudes can then be summed
(accumulated) across space, rather than over 2003 ; Dehaene, 1 997; Gallistel, 1 990;
time. Dehaene and Changeux (1 993 ) devel- Spelke & Dehaene, 1 999).
oped a neural net model based on this idea.
In their model, the activity aroused by each Adding Numerosities
item in the array is reduced to a unit amount
Boysen and Berntson (1 989) taught chim-
of activity, so that it is no longer proportional
panzees to pick the Arabic numeral corre-
to the size, contour, and so on, of the item.
sponding to the number of items they ob-
The units of activity corresponding to the
served. In the last of a series of tests of this
entities in the array are summed across the
ability, they had their subjects go around a
visual field to yield a mental magnitude rep-
room and observe either caches of actual or-
resenting the numerosity of the array.
anges in two different locations or Arabic
numerals that substituted for the caches
Nonhuman Animals Reason themselves. When they returned from a trip,
Arithmetically the chimps picked the Arabic numeral cor-
responding to the sum of the two numerosi-
ties they had seen, whether the numerosities
We have repeatedly referred to the real num-
had been directly observed (hence, possibly
ber system because numbers (or magnitudes)
counted) or symbolically represented (hence
are truly that only if they are arithmeti-
not counted). In the latter case, the mag-
cally manipulated. Being causally connected
nitudes corresponding to the numerals ob-
to something that can be represented nu-
served were presumably retrieved from a
merically does not make an entity in the
memory map relating the arbitrary symbols
brain or anywhere else a number. It also
for number (the Arabic numerals) to the
must be processed suitably. The defining fea-
mental magnitudes that naturally represent
tures of a numerical representation are: (1 )
those numbers. Once retrieved, they could
There is a causal mapping from discrete and
be added very much like the magnitudes
continuous quantities in the world to the
generated by the nonverbal counting of the
numbers. (2) The numbers are arithmeti-
caches. (For further evidence that nonver-
cally processed. (3 ) The mapping is usefully
bal vertebrates sum numerical magnitudes,
(validly) invertible: The numbers obtained
see Beran, 2001 ; Church & Meck, 1 984;
through arithmetic processing correctly re-
Hauser, 2001 , and citations therein; Olthof,
fer through the inverse mapping back to the
Iden, & Roberts, 1 997; Olthof & Roberts,
represented reality.
2000; Rumbaugh, Savage-Rumbaugh, &
There is a considerable experimental liter-
Hegel, 1 987.)
ature demonstrating that laboratory animals
reason arithmetically with mental magni-
Subtracting Durations and Numerosities
tudes representing numerosity and duration.
They add, subtract, divide, and order subjec- On each trial of the time-left procedure
tive durations and subjective numerosities; (Gibbon & Church, 1 981 ), subjects are of-
they divide subjective numerosities by sub- fered an ongoing choice between a steadily
jective durations to obtain subjective rates diminishing delay on the one hand (the time-
of reward; and they multiply subjective rates left option) and a fixed delay on the other
of reward by the subjective magnitudes of hand (the standard option). At an unpre-
the rewards to obtain subjective incomes. dictable point in the course of a trial, the
Moreover, the mapping between real mag- opportunity to choose ends. Before it gets
nitudes and their subjective counterparts is its reward, the subject must then endure the
such that their mental operations on subjec- delay associated with the option it was ex-
tive quantities enable these animals to be- ercising at that moment. If it was respond-
have effectively. Here we summarize a few ing at the so-called standard station, it must
of the relevant studies. (For reviews, see Boy- endure the standard delay; if it was respond-
sen & Hallberg, 2000; Brannon & Roitman, ing at the time-left station, it must endure
the time left. At the beginning of a trial, In this experiment, subjects chose the
the time left is much longer than the stan- number-left key when the subjective num-
dard delay, but it grows shorter as the trial ber left was less than some fraction of the
goes on, because the time so far elapsed in subjective number of flashes required on the
a trial is subtracted from the initial value standard key. Their behavior therefore was
to yield the time left. When the subjective controlled by the subjective ordering of a
time left is less than the subjective standard, subjective numerical difference and a sub-
subjects switch from the standard option to jective numerical standard. For an example
the time-left option. The subjective time of spontaneous subtraction in monkeys, see
left is the subjective duration of a remem- Sulkowski and Hauser (2001 ).
bered initial duration (subjective initial du- There also is evidence that the mental
ration) minus the subjective duration of the magnitudes representing duration and rates
interval elapsed since the beginning of the are signed – there are both positive and nega-
trial. In this experiment, therefore, subjects’ tive mental magnitudes (Gallistel & Gibbon,
behavior depends on the subjective order- 2000; Savastano & Miller, 1 998). In other
ing of a subjective difference and a subjec- words, there is evidence for subtraction and
tive standard (two of the basic arithmetic for the hypothesis that the system for arith-
operations). metic reasoning with mental magnitudes is
In the number-left procedure (Brannon, closed under subtraction.
et al., 2001 ), pigeons peck a center key to
generate flashes and to activate two choice
Dividing Numerosity by Duration
keys. The flashes are generated on a vari-
able ratio schedule, which means that the When vertebrates, from fish to humans, are
number of pecks required to generate each free to forage in two different nearby lo-
flash varies randomly between one and eight. cations, moving back and forth repeatedly
When the choice keys are activated, the pi- between them, the ratio of the expected
geons can get a reward by pecking either durations of the stays in the two locations
of them, but only after their pecking gener- matches the ratios of the numbers of rewards
ates the requisite number of flashes. For one obtained per unit of time (Herrnstein, 1 961 ).
of the choice keys, the so-called standard key, Until recently, it had been assumed that this
the requisite number is fixed and indepen- matching behavior depended on the law of
dent of the number of flashes already gener- effect. When subjects do not match, they get
ated. For the other choice, the number-left more reward per unit of time invested in one
key, the requisite number is the difference patch than per unit of time invested in the
between a fixed starting number and the other. Only when they match do they get
tally of flashes already generated by peck- equal returns on their investments. Match-
ing the center key. The flashes generated by ing therefore could be explained on the as-
pecking a choice key are also delivered on a sumption that subjects try different ratios
variable ratio schedule. of investments (different ratios of expected
The use of variable ratio schedules for stay durations) until they discover the ra-
flash generation partially dissociates time tio that equates the returns (Herrnstein &
and number. The number of pecks required Vaughan, 1 980).
to generate any given number of flashes – Gallistel et al. (2001 ) showed that rats
and, hence, the amount of time spent peck- adjust to changes in the scheduled rates of
ing – varies greatly from trial to trial. This reward as fast as it is in principle possi-
makes possible an analysis to determine ble to do so; they are ideal detectors of
whether subjects’ choices are controlled by such changes. They could not adjust so
the time spent pecking the center key or by rapidly if they were discovering by trial
the number of flashes generated. The analy- and error the ratio of expected stay dura-
sis shows that it was number, not duration, tions that equated their returns. The im-
that controlled the pigeons’ choices. portance of this in the present context
is that a rate is the number of events – that changing the ratio of the rates of reward
a discrete or countable quantity, which is the by a given factor changed the ratio of the ex-
kind of thing naturally represented by pos- pected stay durations by that factor, regard-
itive integers – divided by a continuous or less of the ratio of the reward magnitudes,
(uncountable) quantity – the duration of the thereby proving that subjective magnitudes
given interval, which is the kind of thing that combine multiplicatively with subjective
can be represented only by a real number. rates to determine the ratio of expected stay
Gallistel and Gibbon (2000) review the durations.
evidence that both Pavlovian and instrumen-
tal conditioning depend on subjects’ estimat-
Ordering Numerosities
ing rates of reward. They argue that rate of
reward is the fundamental variable in con- Most of the paradigms that demonstrate
ditioned behavior. The importance of this mental addition, subtraction, multiplication,
in the present context is twofold. First, it and division also demonstrate the order-
is evidence that subjects divide mental mag- ing of mental magnitudes, because the sub-
nitudes. Second, it shows why it is essen- ject’s choice depends on this ordering. Bran-
tial that countable and uncountable quantity non and Terrace (2000) demonstrated more
be represented by commensurable mental directly that monkeys order numerosities
symbols – symbols that are part of the by presenting simultaneously several arrays
same system and can be combined arithmeti- differing in the numerosity of the items
cally without regard to whether they repre- constituting each array and requiring their
sent countable or uncountable quantity. If macaque subjects to touch the arrays in
countable quantity were represented by one the order of their numerosity. When sub-
system (say, a system of discretely ordered jects had learned to do this for numerosi-
symbols, formally analogous to the list of ties between one and four, they generalized
counting words) and uncountable (continu- immediately to numerosities between five
ous) quantity by a different system (a system and nine.
of mental magnitudes), it would not be pos- The most interesting feature of Brannon
sible to estimate rates. The brain would have and Terrace’s results was that they found it
to divide mental apples by mental oranges.2 impossible to teach subjects to touch the ar-
rays in an order that did not conform to the
order of the numerosities (either ascending
Multiplying Rate by Magnitude
or descending). This implies that the order-
When the magnitudes of the rewards ob- ing of numerosities is highly salient for a
tained in two different locations differ, then monkey. It cannot ignore their natural order-
the ratio of the expected stay durations ing to learn an unnatural one. It also suggests
is determined by the ratio of the incomes that the natural ordering is not itself learned;
obtained from the two locations (Catania, it is inherent in the monkey’s representation
1 963 ; Harper, 1 982; Keller & Gollub, 1 977; of numerosity. What is learned is to respond
Leon & Gallistel, 1 998). The income from on the basis of numerical order, not the or-
a location is the product of the rate and the dering itself.
reward magnitude. This result implies that For further evidence that nonverbal ver-
subjects multiply subjective rates by sub- tebrates order numerosities and durations,
jective magnitudes to obtain subjective in- see Biro and Matsuzawa (2001 ), Brannon
comes. The signature of multiplicative com- and Roitman (2003 ), Brannon and Terrace
bination is that changing one variable by (2002), Carr and Wilkie (1 997), Olthof,
a given factor – for example, doubling the Iden, and Roberts (1 997), Rumbaugh and
rate – changes the product by the same factor Washburn (1 993 ), and Washburn and Rum-
(doubles the income) regardless of the value baugh (1 991 ).
of the other factor (the magnitude of the In summary, research with vertebrates,
rewards). Leon and Gallistel (1 998) showed some of which have not shared a common
ancestor with humans since before the rise

of the dinosaurs, implies that they represent
both countable and uncountable quantity by
means of mental magnitudes. The system
of arithmetic reasoning with these mental
magnitudes is closed under the basic opera-
tions of arithmetic; that is, mental magni-
tudes may be mentally added, subtracted,
multiplied, divided, and ordered without
restriction.
Humans Also Represent Numerosity

with Mental Magnitudes
The Symbolic Size and Distance Effects

It would be odd if humans did not share
with their remote vertebrate cousins (pi-
geons) and near vertebrate cousins (chim-
panzees) the mental machinery for repre-
senting countable and uncountable quantity
by means of a system of real numbers. That Figure 2 3.5. The symbolic and nonsymbolic size
humans do represent integers with mental and distance effects on the human reaction time
magnitudes was first suggested by Moyer while judging numerical order in the range from 1
and Landauer (1 967; 1 973 ) when they dis- to 9. In three of the conditions, the numerosities
covered what has come to be called the to be judged were instantiated by two dot arrays
symbolic distance effect (Figure 23 .5 ). When (nonsymbolic numerical ordering). The dots
subjects are asked to judge the numerical or- within each array were in either a regular
der of Arabic numerals as rapidly as possible, configuration, an irregular configuration that did
their reaction time is determined by the rela- not vary upon repeated presentation, or in
tive numerical distance: The greater the dis- randomly varying configurations. In the fourth
condition, the numerosities were represented
tance between the two numbers, the more
symbolically by Arabic numerals. The top panel
quickly their order may be judged. Sub- plots mean reaction times as a function of the
sequently, Parkman (1 971 ) further showed numerical difference. The bottom plots it as a
that the greater the numerical value of the function of the size of the smaller comparand.
smaller digit, the longer it takes to judge Replotted from Figures 23 .1 and 23 .2 in Buckley
their order (the size effect). The two effects & Gillman, 1 974.
together may be summarized under a sin-
gle law, namely that the time to judge the
numerical order of two numerals is a func- pared are actually instantiated (by visual ar-
tion of the ratio of the numerical magni- rays of dots) and when they are represented
tudes they represent. Weber’s law that the symbolically by Arabic numerals (Buckley &
ability of two magnitudes to be discrimi- Gillman, 1 974). The symbolic distance and
nated is a function of their ratio therefore ap- size effects are observed in the single-digit
plies to symbolically represented numerical range and in the double-digit range (De-
magnitude. haene, Dupoux, & Mehler, 1 990; Hinrichs,
The size and distance effects in human Yurko, & Hu, 1 981 ). That this effect of nu-
judgments of the ordering of discrete and merical magnitude on the time to make an
continuous quantities are robust. They are order judgment should appear for symbol-
observed when the numerosities being com- ically represented numerosities between 1
Figure 2 3.6. The reaction time and accuracy functions for monkey (Rhesus
macaque) and human subjects in touching the more numerous of two random
dot visual arrays presented side by side on a touch-screen video monitor.
Reproduced from Brannon & Terrace, 2002 with permission.
and 1 00 is decidedly counterintuitive. If in- and monkeys for judgments of the numer-
trospection were any guide to what one’s ical ordering of pairs of visually presented
brain was doing, one would think that the dot arrays are almost exactly the same (Fig-
facts about which numbers are greater than ure 23 .6).
which are stored in a table of some kind and Buckley and Gillman (1 974) modeled
simply looked up. In that case, why would it the underlying comparison process. In their
take longer to look up the ordering of 2 and model, numbers are represented in the brain
3 (or 65 and 62) than 2 and 5 (or 65 and 47)? by noisy signals (mental magnitudes) with
It does, however, and this suggests that the overlapping distributions. The closer two
comparison that underlies these judgments numerosities are in the ordering of nu-
operates with noisy mental magnitudes. Ac- merosities, the more their corresponding sig-
cording to this hypothesis the brain maps nal distributions overlap. When the subject
the numerals to the noisy mental magnitudes judges the ordering of two numerosities, the
that would be generated by the nonverbal brain subtracts the signal representing the
numerical estimation system if it enumer- one numerosity from the signal represent-
ated the corresponding numerosity. It then ing the other, and puts the signed difference
compares those two noisy mental magni- in an accumulator – a mechanism that adds
tudes to decide which numeral represents up inputs over, in this case, time. The accu-
the bigger numerosity. mulator for the ordering operation has fixed
On this hypothesis, the comparison that positive and negative thresholds. When its
mediates the verbal judgment of the numer- positive threshold is exceeded, it reports the
ical ordering of two Arabic numerals uses one number to be greater than the other
the same mental magnitudes and the same and vice versa when its negative threshold
comparison mechanism as that used by the is exceeded. If neither accumulator thresh-
nonverbal numerical reasoning system that old is exceeded, the comparator resamples
we are assumed to share with many nonver- the two signals, computes a second differ-
bal animals. Consistent with this hypothe- ence, based on the two new samples, and
sis is Brannon and Terrace’s (2002) finding adds it to the accumulator. The resampling
that reaction time functions from humans explains why it takes longer (on average) to
make the comparison when the numerosi- not to count but to say about how many
ties being compared are closer. The closer times they thought the dot had flashed. As
they are, the more their corresponding signal in the first experiment, the mean number es-
distributions overlap. The more these distri- timated increased in proportion to the num-
butions overlap, the more samples will have ber of flashes and the standard deviation of
to be made and added together (accumu- the estimates increased in proportion to the
lated) before (on average) a decision thresh- mean estimate. This implies that the map-
old is reached. ping between the mental magnitudes gener-
ated by nonverbal counting and the verbal
symbols for numerosities is bidirectional; it
Nonverbal Counting in Humans
can go from a symbol to a mental magnitude
Given the evidence from the symbolic size that is comparable to the one that would be
and distance effects that humans represent generated by nonverbal counting, and it can
number with mental magnitudes, it seems go from the mental magnitude generated by
likely that they share with the nonverbal a nonverbal count to a roughly correspond-
animals in the vertebrate clade a nonver- ing verbal symbol. In both cases, the variabil-
bal counting mechanism that maps from nu- ity in the mapping is scalar.
merosities to the mental magnitudes that Whalen et al. (1 999) gave several rea-
represent them. If so, then it should be pos- sons for believing that their subjects did not
sible to demonstrate nonverbal counting in count subvocally. We will not review them
humans when verbal counting is suppressed. here, because a subsequent experiment
Whalen, Gallistel, and Gelman (1 999) pre- speaks more directly to this issue (Cordes
sented subjects with Arabic numerals on a et al., 2001 ).
computer screen and asked them to press a Cordes et al. (2001 ) suppressed articula-
key as fast as they could without counting tion by having their subjects repeat a com-
until it felt like they had pressed the number mon phrase (“Mary had a little lamb”) while
signified by the numeral. The results from they attempted to press a target number of
humans looked very much like the results times, or by having subjects say “the” co-
from pigeons and rats: The mean number of incident with each press. In control exper-
presses increased in proportion to the target iments, subjects were asked to count their
number and the standard deviations of the presses out loud. In all conditions, subjects
distributions of presses increased in propor- were asked to press as fast as possible.
tion to their mean, so that the coefficient of The variability data from the condition
variation was constant. under which subjects were required to say
This result suggests, first, that subjects “the” coincident with each press are shown
could count nonverbally, and, second, that in Figure 23 .7 (filled squares). As in Whalen
they could compare the mental magnitude et al. (1 999), the coefficient of variation was
thus generated to a magnitude obtained us- constant (scalar variability). The best-fitting
ing a learned mapping from numerals to line has a slope that does not differ signif-
mental magnitudes. Finally, it implies that icantly from zero. The contrasting results
the mapping from numerals to mental mag- from the control conditions, in which sub-
nitudes is such that the mental magnitude jects counted out loud, are the open squares.
given by this mapping approximates the Here, the slope – on this log–log plot – does
mental magnitude generated by counting deviate very significantly from zero. In ver-
the numerosity signified by a given numeral. bal counting, one would expect counting
In a second task, subjects observed a dot errors – double counts and skips – to be the
flashing very rapidly but at irregular inter- most common source of variability. On the
vals. The rate of flashing (eight per sec- assumption that the probability of a count-
ond) was twice as fast as estimates of the ing error is approximately the same at suc-
maximum speed of verbal counting (Man- cessive steps in a count, the resulting vari-
dler & Shebo, 1 982). Subjects were asked ability in final counts should be binomial
nonverbal counting conditions and, more

importantly, it was binomial rather than
scalar. The mean slope of the subject-by-
subject regression lines in the two control
conditions was significantly less than zero
and not significantly different from −0.5 .
The contrasting patterns of variability in the
counting-out-loud and nonverbal counting
conditions strengthen the evidence against
the hypothesis that subjects in the non-
verbal counting conditions were counting
subvocally.
In sum, nonverbal counting may be de-
monstrated in humans, and it looks just like
nonverbal counting in nonhumans. More-
over, mental magnitudes (real numbers)
comparable to those generated by nonver-
bal counting appear to mediate judgments of
the numerical ordering of symbolically pre-
Figure 2 3.7. The coefficients of variation (σ /µ) sented integers. This suggests that the non-
are plotted against the numbers of presses for verbal counting system is what underlies and
the conditions in which subjects counted gives meaning to the linguistic representa-
nonverbally and for the condition in which they tion of numerosity.
fully pronounced each count word (double
logarithmic coordinates). In the former
condition, there is scalar variability; that is, a Nonverbal Arithmetic Reasoning
constant coefficient of variation. The slope of in Humans
the regression line relating the log of the
coefficient of variation to the log of mean In humans, as in other animals, nonver-
number of presses does not differ from zero. In bal counting would be pointless if they did
the latter, the variability is much less and it is not reason arithmetically with the result-
binomial; the coefficient of variation decreases ing mental magnitudes. Recent experiments
in proportion to the square root of the target give evidence that they can.
number. In the latter case, the slope of the Barth (2001 ; see also Barth et al., under
regression line relating the log of the coefficient review 2004) tested adults’ performance on
of variation to the log of the mean number of tasks that required the addition, subtraction,
presses differs significantly from zero but does multiplication, and division of nonverbally
not differ significantly from −0.5 , which is the
estimated numerosities, under conditions in
slope predicted by the binomial variability
which verbally mediated arithmetic was un-
hypothesis. Reproduced from Cordes et al.,
2001 , with permission. likely. Subjects were given instances of two
numerosities in rapid sequence, each in-
stance presented too quickly to be countable
rather than scalar. It should increase in pro- verbally. Then, they were given an instance
portion to the square root of the target value, of a third numerosity, and they indicated
rather than in proportion to the target value. by pressing one of two buttons whether the
If the variability is binomial rather than sum, or difference, or product, or quotient
scalar, then when the coefficient of variation of the first two numerosities was greater or
is plotted against the target number on a log– less than the third.
log plot, it should form a straight line with a The numerosities were presented either
slope of −0.5 . This, in fact, is what was ob- as dot arrays (with dot density and area
served in the out-loud counting conditions: covered controlled) or as tone sequences.
The variability was much less than in the In some conditions, presentation modalities
between a billion and a billion and one is

only one. The uncertainty (estimation noise)
in the operands must propagate to the result
of the operation, so the uncertainty about
the true value of a difference must depend
in no small measure on the magnitude of the
operands from which it derived. If one looks
only at the ratio of the difference to the other
comparand, one fails to take account of the
presumably inescapable impact of operand
magnitude on the noise in the difference.
Barth’s experiments establish by direct
Figure 2 3.8. The accuracy of order judgments test the human ability to combine noisy
for two nonverbally estimated numerosities. The nonverbal estimates of numerosity in accord
estimates of numerosity were based on direct with the combinatorial operations that de-
instantiations in the first condition (N1 < N2). fine the system of arithmetic. In her data
In the other conditions, one of them was derived (Figure 23 .8), as the proportion between the
from the composition of two other estimates. smaller and larger comparand increases to-
Data replotted from Barth, 2001 , p. 1 09.
ward unity, the accuracy of the comparisons
degrades in a roughly parallel fashion regard-
less of the derivation of the first comparand.
were mixed, so, for example, subjects com- This suggests that the scalar variability in
pared the sum of a tone sequence and a dot the nonverbal estimates of numerosity prop-
array to either another tone sequence or an- agates to the mental magnitudes produced
other dot array. by the composition of those estimates.
In Barth’s results, there was no effect of Barth’s data, however, do not directly
comparand magnitude on reaction time or demonstrate the variablity in the results of
accuracy, only an effect of their ratio. That composition nor allow one to estimate the
is, it did not matter how big the two nu- quantitative relation between the noise in
merosities were; only the proportion of the the operands and the noise in the result.
smaller to the larger affected reaction time Cordes et al. (submitted 2004) used the
and accuracy. The same proved to be true in previously described key-tapping paradigm
Barth’s experiments involving mental mag- to demonstrate the nonverbal addition and
nitudes derived by arithmetic composition. subtraction of nonverbal numerical esti-
This enables a comparison between the case mates and the quantitative relation between
in which the comparands are both given di- the variability in the estimates of the sums
rectly and the case in which one comparand and differences and the variability in the es-
is the estimated sum or difference of two estimates of the operands.
timated numerosities. As Figure 23 .8 shows, In the baseline condition of the Cordes
the accuracy of comparisons involving a sum et al. (submitted 2004) experiment, sub-
was only slightly less at each ratio of the com- jects saw a sequence of rapid, arrhythmic,
parands than the accuracy of a comparison variable-duration dot flashes on a computer
between directly given comparands. screen at the conclusion of which they at-
At a given comparand ratio, the accu- tempted to make an equivalent number of
racy of comparisons involving differences taps on one button of a two-button response
was less than the accuracy of a comparison box, tapping as rapidly as they could while
between directly given comparands (Figure saying the out loud coincident with each
23 .8). This could hardly be otherwise. For tap. In the compositional conditions, sub-
addition, the sum increases as the magni- jects saw one sequence on the left side of the
tude of the pair of operands increases, but screen, a second sequence on the right side,
for subtraction, it does not; the difference and were asked to tap out either the sum or
the difference. In the subtraction condition, was expected from the sum of the estimated
they pressed the button on the side they be- variances in the operands, but the variabil-
lieved to have had the fewer flashes as many ity in the addition results was greater than
times as they felt were required to make up expected.
the difference.
Sample results are shown in Figure 23 .9.
The numbers of responses subjects made, Retrieving Number Facts
in all cases, were approximately linear func-
tions of the numbers they were estimating, There is an extensive literature on reac-
demonstrating the subjects’ ability to add tion times and error rates in adults do-
and subtract the mental magnitudes repre- ing single-digit arithmetic (Ashcraft, 1 992;
senting numerosities. In the baseline condi- Campbell, 1 999; Campbell & Fugelsang,
tion, the variability in the numbers tapped 2001 ; Campbell & Gunter, 2002; Camp-
out was an approximately scalar function of bell, 2005 ; Campbell & Fugelsang, 2001 ;
the target number, although there was some Noel, 2001 ). It resists easy summary. How-
additive and binomial variability. ever, magnitude effects analogous to those
The variability in the addition data was found for order judgments are a salient and
also, to a first approximation, a scalar func- robust finding: The bigger the numerosities
tion of the objective sum. Not surprisingly, represented by a pair of digits, the longer
however, the variability in the subtraction it takes to recall their sum or product and
data was not. In addition, answer magni- the greater the likelihood of an erroneous
tude covaries with operand magnitude: The recall. The same is true in children (Camp-
greater the magnitude of the operands, the bell & Graham, 1 985 ). For both sets of num-
greater the magnitude of their sum.3 In sub- ber facts, there is a notable exception to this
traction, answer magnitude is poorly corre- generalization. The sums and products of
lated with operand magnitude because large- ties (for example, 4 + 4 or 9 × 9) are re-
magnitude operands often produce small called much faster than is predicted by the
differences. Insofar as the scalar variability in regressions for non-ties, although ties, too,
the estimates of operand magnitudes prop- show a magnitude effect (Miller, Perlmutter,
agates to the variability in the results of the & Keating, 1 984).
operations, there will be large variability in There is a striking similarity in the effect
these small differences. of operand magnitude on the reaction times
Cordes et al. (submitted 2004) fit regres- for both addition and multiplication. The
sion models with additive, binomial, and slopes of the regression lines (reaction time
scalar variance parameters to the baseline versus the sum or product of the numbers in-
data, and to the addition and subtraction volved) are not statistically different (Geary,
data. These fits enabled them to assess the Widman, & Little, 1 986). More importantly,
extent to which the magnitude of the pair Miller, Perlmutter, & Keating (1 984) found
of operands predicted the variability in their that the best predictor of reaction times for
sum and difference. On the assumption that digit multiplication problems was the reac-
there is no covariance in the operands, the tion times for digit addition problems, and
variance in the results of both subtraction vice versa. In other words, the reaction-time
and addition should be equal to the sum of data for these two different sets of facts,
the variances for the two operands. When which are mastered at different ages, show
Cordes et al. plotted predicted variabilty very similar microstructure.
against directly estimated variability (Figure These findings suggest a critical role
23 .9D), they found that the subtraction data for mental magnitudes in the retrieval
did conform approximately to expectations of the basic number facts (the addition
but that the addition data clearly fell above and multiplication tables) upon which ver-
the line. In other words, the variability in re- bally mediated computation strategies de-
sults of subtraction was approximately what pend. Whalen’s (1 997) diamond arithmetic
A. B.
C. D.
Figure 2 3.9. A. Number of responses (key taps) as a function of the number of flashes for one subject.
B. Number of responses as a function of the sum of the numbers of flashes in two flash sequences. C.
Number and sign (side) of the responses as a function of the difference between the numbers of
flashes in two sequences of flashes. D. Predicting the variability in the sums and differences from the
variability in the operands. Adapted from Cordes et al. (under review 2004) with permission.
experiment showed that these effects de- Two Issues

pend primarily on the magnitude of the
operands, not on the magnitude of the an-
What is the Form of Mapping from
swers, nor on the frequency with which dif-
Magnitudes to Mental Magnitudes?
ferent facts are retrieved (although these
may also contribute). Whalen (1 997) taught Weber’s law, that the discriminability of two
subjects a new arithmetic operation of his magnitudes (two sound intensities or two
own devising, the diamond operation. It was light intensities) is a function of their ra-
such that there was no correlation between tio, is the oldest and best established quan-
operand magnitude and answer magnitude. titative law in experimental psychology. Its
Subjects received equal practice on each implications for the question of the quanti-
fact, so explanations in terms of differen- tative relation between directly measurable
tial practice did not apply. When subjects magnitudes (hereafter called objective magni-
had achieved a high level of proficiency at tudes) and the mental magnitudes by which
retrieving the diamond facts, Whalen mea- they are represented (hereafter called sub-
sured their reaction times. He obtained the jective magnitudes) have been the subject of
same pattern of results seen in the retrieval analysis and debate for more than a cen-
of the facts of addition and multiplication. tury. This line of investigation led to work on
the mathematical foundations of measure- From Moyer and Landauer (1 967) to the
ment, work concerning the question of what present (Dehaene, 2002), this has been taken
it means to measure something (Krantz et to imply that subjective numerosity is a log-
al., 1 971 ; Krantz, 1 972; Luce, 1 990; Stevens, arithmic function of objective numerosity.
1 95 1 , 1 970). The key insight from work If that were so, and if subjects estimated
on the foundations of measurement is that the arithmetic differences between objec-
the quantitative form of the mapping from tive magnitudes from the arithmetic differ-
things to their numerical representives can- ences in the corresponding subjective mag-
not be separated from the question of the nitudes, then the Barth (2001 ) and Cordes
arithmetic operations that are validly per- et al. (submitted 2004) subtraction exper-
formed on the results of that mapping. The iments would have failed, and so would
question of the form of the mapping is mean- the experiments demonstrating subtraction
ingful only at the point at which the num- of time and number in nonverbal animals,
bers (magnitudes) produced by the mapping because the arithmetic difference between
enter into arithmetic operations. the logarithms of two magnitudes repre-
The discussion began when Fechner used sents their quotient, not their arithmetic
Weber’s results to argue that subjective mag- difference.
nitudes (for example, loudness and bright- In short, when subjects respond appropri-
ness) are logarithmically related to the ately to the arithmetic difference between
corresponding objective magnitudes (sound two numerical magnitudes, their behavior is
and light intensity). Fechner’s reasoning is not based on the arithmetic difference be-
echoed to the present day by authors who tween mental (subjective) magnitudes that
assume that Weber’s law implies logarithmic are proportional to the logarithms of the ob-
compression in the mapping from objective jective magnitudes. That much is clear. Ei-
numerosity to subjective numerosity. These ther (Model 1 ): The behavior is based on the
conjectures are uninformed by the literature arithmetic difference in mental magnitudes
on the measurement of subjective quantities that are proportional to the objective mag-
spawned by Fechner’s assumption. In deriv- nitudes (a proportional rather than loga-
ing logarithmic compression from Weber’s rithmic mapping). Or (Model 2): Dehaene
law, Fechner assumed that equally discrim- (2001 ) has suggested that mental magni-
inable differences in objective magnitude tudes are proportional to the logarithms
correspond to equal differences in subjec- of objective magnitudes and that, to ob-
tive magnitude. When you directly ask sub- tain from them the mental magnitude cor-
jects whether they think just discriminable responding to the objective difference, the
differences in, for example, loudness, repre- brain uses a look-up table, a procedure anal-
sent equal differences, however, they do not; ogous to the procedure that Whalen’s (1 997)
they think a just discriminable difference be- subjects used to retrieve the facts of diamond
tween two loud sounds is greater than the arithmetic. In this model, the arithmetic dif-
just discriminable difference between two ference between two mental magnitudes is
soft sounds (Stevens, 1 95 1 ). irrelevant; the two magnitudes serve only
The reader will recognize that Barth per- to specify where to enter the look-up ta-
formed both experiments – the discriminable – where in memory the answer is to
tion experiment (Weber’s experiment) and be found.
the difference judging experiment – but with In summary, there are two intimately in-
numerosities instead of noises. In the dis- terrelated unknowns concerning the map-
crimination experiment, she found that We- ping from objective to subjective magni-
ber’s law applied: Two pairs of nonverbally tudes – the form of the mapping and the
estimated numerosities can be correctly or- formal character of the operations on the re-
dered 75 % of the time when N1 /N2 = sults of the mapping. Given the experimen-
N3 /N4 = .83 , where N now refers to the tal evidence showing valid arithmetic pro-
(objective) numerosity of a set (Figure 23 .8). cessing, knowing either would fix the other.
In the absence of firm knowledge about

either, can behavioral experimental evidence
decide between the alternative models? Per-
haps not definitively, but there are relevant
considerations. The Cordes et al. (submit-
ted 2004) experiment estimates the noise in
the results of the mental subtraction oper-
ation at and around zero difference (Figure
23 .9C). There is nothing unusual about the
noise around answers of approximately zero.
It is unclear what assumptions about noise
would enable a logarithmic mapping model
to explain this. The logarithm of a quantity
goes to minus infinity as the quantity ap-
proaches zero, and there are no logarithms
for negative quantities. On the assumption
that realizable mental magnitudes, like real-
izable nonmental magnitudes, cannot be in-
finite, the model has to treat zero as a spe-
cial case. How the treatment of that special
case could exhibit noise characteristics of a
piece with the noise well away from zero
is unclear.
It is also unclear how the logarithmic- Figure 2 3.1 0. Estimates of dot numerosity (top)
mapping-plus-table-lookup model can deal and time to make an estimate (bottom) as
with the fact that the sign of a difference functions of the number of dots in
is not predictable a priori. In this model, tachistoscopically presented arrays of randomly
positioned dots. Plotted from the data for the
a bigger magnitude (number) cannot be
speeded instruction group in Table 1 of Kaufman
subtracted from a smaller, because the re- et al., 1 949, p. 5 1 0.
sulting negative number does not have a
logarithm; there is no way to represent a
negative magnitude in a scheme in which jects arrive at estimates for numerosities of
magnitudes are represented by their loga- five or fewer are distinct from the processes
rithms. Thus, this model is not closed under by which they arrive at estimates for nu-
subtraction. merosities of seven or more. Kaufman et
al. (1 949) coined the term subitizing to de-
scribe the process that operates in the range
Is There a Distinct Representation for
below six.
Small Numbers?
When the dot array to be enumerated is
When instantiated as arrays of randomly ar- displayed until the subject responds, rather
ranged small dots, presented for a fraction than very briefly by a tachistoscope, the re-
of a second, small numerosities can be es- action time function is superimposable on
timated more quickly than large ones, but the one shown in Figure 23 .1 0, up to and
only up to about six. Thereafter, the esti- including numerosity six. It does not level
mates increase more or less linearly with the off at six, however; rather, it continues with
number of dots, but the reaction time is flat the same slope (about 3 25 ms/dot) indefi-
(Figure 23 .1 0). nitely (Jensen, Reese, & Reese, 1 95 0). This
Subjects’ confidence in their estimates slope represents the time it takes to count
also falls off precipitously after six (Kauf- subvocally. The discontinuity at six there-
man et al., 1 949; Taves, 1 941 ). This led Taves fore represents the point at which a non-
to argue that the processes by which sub- verbal numerosity–estimating mechanism or
process takes over from the process of are directly perceived, as was first pointed
verbal counting, because, presumably, it is out by the authors who coined the term
not possible to count verbally more than six subitizing (Kaufman, et al., 1 949).
items under tachistoscopic conditions. Gallistel and Gelman (1 992) and De-
The nonverbal numerosity-estimating haene and Cohen (1 994) suggested that,
process is probably the basis for the demon- in the subitizing range, there is a transi-
strated capacity of humans to compare (or- tion from a strategy based on mapping from
der) large numerosities instantiated either nonverbally estimated mental magnitudes to
visually or auditorily (Barth, Kanwisher, & a strategy based on verbal counting. This
Spelke, 2003 ). The reaction times and accu- hypothesis has recently received important
racies for these comparisons show the We- support from a paper by Whalen and West
ber law characteristic, which is a signature (2001 ). By strongly encouraging rapid, ap-
of the process that represents numerosities proximate estimates and taking measures
by mental magnitudes rather than by dis- to make verbal counting more difficult,
crete wordlike symbols (Cordes et al., 2001 ). Whalen & West (2001 ) obtained a reaction
The assumption that the representation is by time function with a slope of 47 ms per item,
mental magnitudes regardless of the mode from one to sixteen items.
of presentation is consistent with the finding The coefficient of variation in the esti-
that there is no cost to cross-modal compar- mated numbers was constant from 1 to 1 6, at
isons of large numerosities; these compar- about 1 4.5 %, which is close to the value of
isons take no longer and are no more inaccu- 1 6% in the animal timing literature (Gallis-
rate than comparisons within presentation tel, King, & McDonald, 2004). The Whalen
modes (Barth et al., 2003 ). et al. data therefore show scalar variabil-
There is controversy about the implica- ity in rapid number estimates all the way
tions of the reaction time function within down to estimates of one and two, as do
the subitizing range below six. In this range, the data of Cordes et al. (2001 ). Whalen,
there is approximately a 3 0-ms increment in & West (2001 ) show that with this level
going from one to two dots, an 80-ms incre- of noise in the mental magnitudes being
ment in going from two to three, and a 200- mapped to number words, the expected
ms increment in going from three to four. percent errors in the resulting verbal esti-
These are large increments. The net incre- mates of numerosity are close to zero in
ment from one to four is about 3 00 ms, half the range one to three and increase rapidly
the total latency to respond to a one-item ar- thereafter – in close accord with the exper-
ray (Jensen, Reese, & Reese 1 95 0; Kaufman imentally observed percent errors in their
et al., 1 949; Mandler & Shebo, 1 982). More- speeded condition (Figure 23 .1 1 ). This ex-
over, the increments increase at each step. plains why subjects in experiments in which
In particular, the step from two to three is it is not strongly discouraged switch to sub-
significantly greater than the step from one vocal verbal counting somewhere between
to two in almost every data set. four and six, and why their confidence in
It is often claimed that there is a discon- their speeded estimates falls off rapidly af-
tinuity in the reaction time function within ter six (Kaufman et al., 1 949; Taves, 1 941 ).
the subitizing range (Davis & Pérusse, 1 988; Whalen et al. (under review) attribute the
Klahr & Wallace, 1 973 ; Piazza et al., 2003 ; constant slope of 47 ms/item in the speeded
Simon, 1 999; Strauss & Curtis, 1 984; Wood- reaction time function to a serial nonver-
worth & Schlosberg, 1 95 4); but it also often bal counting process. In short, the reaction
has been pointed out that there is no em- time function does not support the hypoth-
pirical support for this claim (Balakrishnan esis that there are percepts of twoness and
& Ashby, 1 992). Because the reaction time threeness, constituting a representation of
function is neither flat nor linear in the range small numerosities incommensurable with
from one to three, it offers no support for the mental magnitudes that represent other
the common theory that very small numbers numerosities.
broad agreement on this conclusion within

the literature on numerical cognition be-
cause of the abundant evidence for Weber-
law characteristics in symbolic numerical
behavior. The literature on the deficits in nu-
merical reasoning seen in brain-injured pa-
tients is broadly consistent with this same
conclusion (Dehaene, 1 997; Noel, 2001 ).
It also seems plausible that the nonver-
bal system of numerical reasoning mediates
verbally expressed numerical reasoning. It
seems plausible, for example, that adults be-
Figure 2 3.1 1 . The observed percent errors as a lieve that (2 + 1 ) > 2 and four minus two
function of number of dots in Whalen’s speeded is less than four because that is the behav-
condition compared with the percent expected ior of the mental magnitudes to which they
from the hypothesis that the estimates were (unconsciously) refer those symbols to en-
obtained by way of a mapping from nonverbal dow them with meaning and reference to
mental magnitudes to the corresponding number
the world.
words and that the mental magnitudes had scalar
Empiricists offer as an alternative the hy-
variability with a coefficient of variation of 0.1 45 .
Reproduced from Whalen et al. (under review) pothesis that adults believe these symbolic
with permission. propositions because they have repeatedly
observed that the properties of the world
to which the words or symbols refer behave
in this way. Adults know, for example, that
The Development of Verbal the word two refers to every set that can
Numerical Competence be placed in one–one correspondence with
some foundational set of two and likewise,
It appears that the system of nonverbal men- mutatis mutandis, for the word one, and that
tal magnitudes plays a fundamental role the phrase plus refers to the uniting of sets,
in verbal numerical behavior: When verbal and that the phrase greater than refers to the
counting is too slow to satisfy time con- relation between a set and its proper subsets,
straints, it mediates the finding of a number and so on. From an empiricist’s perspective,
word that specifies approximately the nu- the words have these real world references
merosity of a set. It mediates the ordering of only by virtue of the experiences adults have
the symbolic numbers and the numerosities had, which are ubiquitous and universal.
they represent. And it mediates the retrieval Nativists or rationalists respond that ref-
of the verbal number facts (the addition and erence to the world by verbal expressions is
multiplication tables) upon which verbal mediated by preverbal world-referring sym-
computational procedures rest. All of these bolic systems in the mind of the hearer and
roles require a mapping between the men- that the ubiquity and universality of the ex-
tal magnitudes that represent numerosity periences that are supposed to have created
and number words and written numerals. In world-reference for these expressions are
the course of ordinary development, there- grounds for supposing that symbolic systems
fore, humans learn a bidirectional mapping with these properties are part of the innate
between the mental magnitudes that repre- furniture of the mind. We will not pursue
sent numerosity and the words and numerals this old debate further, except to note the
that represent numerosity (Gallistel & Gel- possible relevance of the experiments previ-
man, 1 992; Gelman & Cordes, 2001 ). They ously reviewed demonstrating that nonver-
make use of this bidirectional mapping in bal animals reason arithmetically about both
talking about number and the effects of com- numerosities (integer quantities) and magni-
binatorial operations with numbers. There is tudes (continuous quantities).
We turn instead to the experimental lit- Gelman and her collaborators argue that
erature on numerical competence in very the principles that govern nonverbal count-
young children. It is difficult to demonstrate ing inform the child’s counting behav-
conclusively behavior based on numerosity ior from its inception (Gelman & Gallis-
in infants because it is hard not to confound tel, 1 978). Children recognize that number
variation in one or more continuous quanti- words reference numerosities because they
ties with variation in numerosity, and infants implicitly recognize that they are generated
often respond on the basis of continuous di- by a process homomorphic to the nonverbal
mensions of the stimulus (Clearfield & Mix, counting of serially considered sets. Num-
1 999; Feigenson, Carey, & Spelke, 2002; see ber words have meaning for the child, as for
Mix, Huttenlocher, & Levine, 2002, for re- the adult, because it recognizes at an early
view). Nonetheless, there are studies that age that they map to the mental magnitudes
appear to demonstrate sensitivity to numer- by which the nonverbal mind represents nu-
ical order in infants (Brannon, 2002). More- merosities. On this account, the child’s mind
over, the ability of infants to discriminate tries to apply from the outset the Gelman
sets on the basis of numerosity extends to and Gallistel counting principles (Gelman &
pairs as large as eight versus sixteen (Lipton Gallistel, 1 978) – that counting must involve
& Spelke, 2003 ; Xu & Spelke, 2000). As a a one–one assignment of words to items in
result, there is reason to suppose that prever- the set, that the words must be taken from
bal children share with nonverbal animals a a stably ordered list, and that the last word
nonverbal representation of numerosity. represents the cardinality of the set. It takes
The assumption that preverbal children a long time to learn the list and to implement
represent numerosities by a system of men- the verbal counting procedure flawlessly, be-
tal magnitudes homologous to the system cause list learning is hard, because the im-
found in nonverbal animals is the foundation plementation of the procedure is challeng-
of the account of the development of verbal ing (Gelman & Greeno, 1 989), and because
numerical competence suggested by Gel- the child is often confused about what the
man and her collaborators (Gelman & Bren- experimenter wants.
neman, 1 994; Gelman & Cordes, 2001 ; Gel- Critical to Gelman’s account is evidence
man & Williams, 1 998). They argue that the that during the period when they are learn-
development of verbal numerical compe- ing to count children already understand
tence begins with learning to count, which is that the last count word represents a prop-
guided from the outset by the child’s recog- erty of the set about which it is appropri-
nition that verbal counting is homomorphic ate to reason arithmetically. Without such
to nonverbal counting. In nonverbal count- evidence, there is no ground for believing
ing, the pouring of successive cups into the that the child has a truly numerical rep-
accumulator (the addition of successive unit resentation. Evidence on this crucial point
magnitudes to a running sum) creates a one- comes from the so-called magic experiments
to-one correspondence between the items in (Brannon & Van de Walle, 2001 ; Bullock &
the enumerated set and a sequence of men- Gelman, 1 977; Gelman, 1 972, 1 977, 1 993 ).
tal magnitudes. Although the mental magni- These experiments drew children into a
tudes thus created have the formal proper- game in which a winner and loser plate could
ties of real numbers, the process that creates be distinguished on the basis of the num-
them generates a discretely ordered se- ber of toy mice they contained. The task en-
quence of mental magnitudes, an ordering gaged children’s attention and caused them
in which each magnitude has a next magni- to justify their judgments as to whether an
tude. The final magnitude represents the nu- uncovered plate was or was not the win-
merosity of the set. Verbal counting does the ner. Children as young as two and a half
same thing; it assigns successive words from years indicated that the numerosity was the
an ordered list to successive items in the set decisive dimension, and they spontaneously
being enumerated, with the final word rep- counted to justify their judgment that the
resenting the cardinality of the set. plate with the correct numerosity was the
winner. On magic trials, a mouse was sur- in mind in this connection is the finding
reptitiously added or subtracted from the that monkeys cannot be taught to order nu-
winner plate during the shuffling, so that it merosities in other than a numerical order
had the same numerosity as the loser plate. (Brannon & Terrace, 2000), even though
Now, both plates when uncovered were re- they can be taught to order things other than
vealed to be loser plates. In talking about numerosities in an arbitrary, experimenter-
what surprised them, children indicated that imposed order (Terrace, Son, & Brannon,
something must have been added or sub- 2003 ). This implies that numerical order is
tracted, and they counted to justify them- spontaneously salient to a monkey.
selves. This is strong evidence that chil- The account offered by Carey (Carey,
dren as young as two and one half years of 2001 a, 2001 b) begins with the assumption
age understand that counting gives a rep- that convincing cases of infant number dis-
resentation of numerosity about which it crimination involving numbers less than four
is appropriate to reason arithmetically. This may depend on the object tracking system.
is well before they become good counters In Wynn’s (1 992a) experiment, for exam-
(Fuson, 1 988; Gelman & Gallistel, 1 978; ple, the infants saw an object appear to join
Hartnett & Gelman, 1 998). Surprised two- or leave one or two objects behind an oc-
and-half-year-olds made frequent use of cluding screen. They were surprised when
number words. They used them in idiosyn- the screen was removed to reveal a number
cratic ways, but ways that nonetheless con- of objects different from the number that
formed to the counting principles (Gelman, ought to have been there. This surprise may
1 993 ), including the cardinality principle. have arisen only from the infant’s belief in
A second account of the development object permanence.
of counting and numerical understanding When an infant sees an object move be-
grows, first, out of the conviction of many hind an occluding screen, the subsequent re-
researchers that, although two-year-olds moval of which fails to reveal an object, the
count, albeit badly, they do not understand infant is surprised (Baillargeon, 1 995 ; Bail-
what they are doing (Carey, 2001 a, 2001 b; largeon, Spelke, & Wasserman, 1 985 ). The
Fuson, 1 988; Mix, Huttenlocher, & Levine child’s surprise presumably is mediated by
2002; Wynn, 1 990; Wynn, 1 992b). It rests, a system for tracking objects, such as the
secondly, on evidence suggesting that in the object file system suggested by Kahneman,
spontaneous processing of numerosities by Treisman, and Gibbs (1 992) or the FINST
infants and monkeys, there is a discontinuity system suggested by Pylyshyn and Storm
between numbers of four or less and big- (1 988). This system maintains a marker (ob-
ger numbers. In some experiments, the in- ject file or FINST) for each object it is track-
fant and monkey subjects discriminate all ing, but it can only track about four objects
numerosity pairs in the range one to four (Scholl & Pylyshyn, 1 999). As a result, in-
but fail to discriminate pairs that include a fants in experiments like Wynn’s are sur-
numerosity outside that range (e.g., <3 ,6>), prised for the same reason as in original
even when, as in the example, their ratio object-permanence experiments: An object
is greater than the ratio between discrim- is missing. The infant has an active mental
inable pairs of four or less (Feigenson, Carey, marker or pointer that no longer points to
& Hauser, 2002; Uller, et al., 1 999; Uller, an object. Alternatively, there is an object
Hauser, & Carey, 2001 ). for which it has no marker.
How to reconcile these latter findings Carey argues that sets of object files are
with the finding that infants do discriminate the foundations on which the understanding
the pair <8,1 6> (Lipton & Spelke, 2003 ; Xu of integers rests. The initial meaning of the
& Spelke, 2000) is unclear. Similarly, it is words one, two, three, and four does not come
unclear how to reconcile the monkey find- from the corresponding mental magnitudes;
ings with the literature showing the discrim- rather, it comes from sets of object files. The
ination of numerosities small and large in child comes to recognize the ordering of the
nonverbal animals. Particularly to be borne referents of one, two, three, and four because
a set of two active object files has as a proper <2,3 >, <2,4>, <3 ,3 >, and <3 ,4>). Five
subset a set of one object file, and so on. The of the nine pairs, when composed (united)
child comes to recognize that addition ap- yield a set too numerous to be a set of ob-
plies to the things referred to by these words ject files. From this foundation, the mind of
because the union of two sets of object files the child is said to infer that the numbers
yields another set of object files (provided may be extended indefinitely by addition.
the union does not create a set greater than One wants to know what the inference rule
four). This is the foundation of the child’s is that ignores the many negative instances
belief in the successor principle: Every inte- in the base data set.
ger has a unique successor.
This account seems to ignore the basic
function of a set of, for example, two object
files (FINSTs, pointers), which is to point Conclusions and Future Directions
to two particular objects. If two referred to
a particular set of two object files, it pre- There is a widespread consensus, backed by
sumably would be usable only in connection a large and diverse experimental literature,
with the two objects it pointed to. It would that adult humans share with nonverbal ani-
be a name for that pair of objects, not for mals a nonverbal system for representing dis-
all sets that share with that set the property crete and continuous quantity that has the
of twoness. formal properties of continuous magnitudes.
A particular set of pointers cannot sub- Mental magnitudes represent quantities in
stitute for (is not equal to) another such set the same sense that, given a proper mea-
without loss of function, because its function surement scheme, real numbers represent
is to point to one pair of objects, whereas line lengths. That is, the brains of nonverbal
the function of another such set is to point animals perform arithmetic operations with
to a different pair. There is no reason to be- mental magnitudes; they add, subtract, mul-
lieve that there is any such thing as a gen- tiply, divide, and order them. The processes
eral set of two pointers – a set that does or mechanisms that map numerosities (dis-
not point to any particular set of two ob- crete quantities) and magnitudes (continu-
jects, but represents all the sets that do so ous quantities) into mental magnitudes, and
point. Any set of two object files is an in- the operations that the brain performs on
stance of a set with the twoness property (a them, are together such that the results of
token of twoness), but it can no more repre- the operations are approximately valid, al-
sent twoness than a name that picks out one beit imprecise; the results of computations
particular dog (e.g., Rover) can represent the on mental magnitudes map appropriately
concept of a dog. A precondition of Rover’s back onto the world of discrete and contin-
serving the latter function is that it not serve uous quantity.
the former. By contrast, any instance of the Scalar variability is a signature of the men-
numeral 2 can be substituted for any other tal magnitude system. Scalar variability and
without loss of function, and so can a pair of Weber’s law are different sides of the same
hash marks. coin: Models that generate scalar variabil-
A second problem with this account is ity also yield Weber’s law. There are two
that it is unclear how a system so lacking such models. One assumes that the mapping
in closure could be the basis for inferring a from objective quantity to subjective quan-
system, the function of which depends so tity (mental magnitude) is logarithmic; the
strongly on closure. The Carey suggestion is other assumes that it is scalar. Both assume
motivated by findings that the maximum nu- noise. That is, they assume that the signal
merosity of a set of active object files is at corresponding to a given objective quantity
most four. There are only nine numerically varies from occasion to occasion in a man-
distinct unordered pairs of sets of four or ner described by a Gaussian probability den-
less (<1 ,1 >, <1 ,2>, <1 ,3 >, <1 ,4>, <2,2>, sity function. The variation is on the order
of 1 5 % in both animal timing and human account of the development of verbal nu-
speeded number estimation. merical competence assumes that it is di-
The first model (logarithmic mapping) as- rected from the outset by the mental magni-
sumes that scalar behavioral variability re- tude system. The homomorphism between
flects a constant level of noise in the sig- serial nonverbal counting and verbal count-
nal distributions. This yields proportional ing is what causes the child to appreciate the
(scalar) variability, because constant log- enumerative function of the count words.
arithmic intervals correspond to constant The child attends to these words because of
proportions in the corresponding nonlog- the homomorphism. Learning their mean-
arithmic magnitudes. The second model ing is the process of learning their mapping
(scalar mapping) assumes scalar variability to the mental magnitudes. Another account
in the underlying signal distributions. The assumes that the count words from one to
overlap in the two signal distributions is a four are initially understood to refer to sets
function only of the ratio between the rep- of object files – mental pointers that pick
resented numerosities in both models, which out particular objects. On this account, the
is why they both predict Weber’s law. learning of the mapping to mental magni-
Both models assume there is only one tudes comes later, after the child has exten-
mapping from objective quantities to sub- sive counting experience.
jective quantities (mental magnitudes), but
there is no compelling reason to accept this
assumption. The question of the quantita- Acknowledgments
tive form of the mapping makes sense only
at the point at which the mental magni-
Some of the research by the authors reported
tudes enter into combinatorial operations.
in this chapter was supported by NSF Grants
The form may differ for different combina-
SBR-9209741 and NSF DFS-920974 to Gel-
torial operations. In the future, the analysis
man, NIH Grant MH 63 866 to Gallistel, and
of variability in the answers from nonverbal
NSF No.SPR9720402 to both Gelman and
arithmetic may decide between the models.
Gallistel.
An important component of future models,
therefore, must be the specification of how
variability propagates from the operands to
the answers. Notes
The system of mental magnitudes plays
many important roles in verbalized adult 1 . Technically, not really true, because Cantor
number behavior. For example, it mediates discovered a way to assign a unique positive
judgments of numerical order and the re- integer to every rational number. The integers
trieval of the verbal number facts (addition his procedure assigns, however, are useless for
and multiplication tables) upon which ver- computational purposes.
balized and written calculation procedures 2. Fortran and C programmers, who have made
depend. It also mediates the finding of num- the mistake of dividing an integer variable by
ber words to represent large numerosities, a floating point variable will know whereof
we speak.
presented too briefly to be verbally counted,
and, more controversially, the rapid retrieval 3 . The magnitude of a pair of numbers is the
square root of the sum of their squares.
of number words to represent numerosities
in the subitizing range (one through six).
Any account of the development of verbal
numerical competence must explain how References
subjects learn the bidirectional mapping be-
tween number words and mental magni- Ashcraft, M. H. (1 992). Cognitive arithmetic: A
tudes, without which mental magnitudes review of data and theory. Cognition, 44(1 –2),
could not play the roles just described. One 75 –1 06.
Baillargeon, R. (1 995 ). Physical reasoning in in- imental Psychology: Animal Behavior Processes,
fancy. In M. S. Gazzaniga (Ed.), The Cognitive 2 6(1 ), 3 1 –49.
Neurosciences (pp. 1 81 –204). Cambridge, MA: Brannon, E. M., & Van de Walle, G. A. (2001 ).
MIT Press. The development of ordinal numerical compe-
Baillargeon, R., Spelke, E., & Wasserman, E. tence in young children. Cognitive Psychology,
(1 985 ). Object permanence in 5 -month-old in- 43 (1 ), 5 3 –81 .
fants. Cognition, 2 0, 1 91 –208. Brannon, E. M. & Terrace, H. S. (2002). The evo-
Balakrishnan, J. D., & Ashby, F. G. (1 992). lution and ontogeny of ordinal numerical abil-
Subitizing: Magical numbers or mere supersti- ity. In M. Bekoff, C. Allen, et al. (Eds.), The
tion. Psychological Research, 5 4, 80–90. Cognitive Animal: Empirical and Theoretical Per-
Barth, H., Kanwisher, N., & Spelke, E. (2003 ). spectives on Animal Cognition (pp. 1 97–204).
The construction of large number representa- Cambridge, MA: MIT Press.
tions in adults. Cognition, 86(3 ), 201 –221 . Brannon, E. M., Wusthoff, C. J., Gallistel, C. R.,
Barth, H., La Mont, K., Lipton, J., Dehaene, & Gibbon, J. (2001 ). Numerical subtraction
S., Kanwisher, N., & Spelke, E. (under review in the pigeon: Evidence for a linear subjec-
2003 ). Nonsymbolic arithmetic in adults and tive number scale. Psychological Science, 1 2 (3 ),
young children. 23 8–243 .
Barth, H. C. (2001 ). Numerical Cognition in Buckley, P. B., & Gillman, C. B. (1 974). Com-
Adults: Representation and Manipulation of parisons of digits and dot patterns. Journal of
Nonsymbolic Quantities. Unpublished disserta- Experimental Psychology, 1 03 , 1 1 3 1 –1 1 3 6.
tion, MIT, Cambridge, MA. Bullock, M., & Gelman, R. (1 977). Numerical
Beran, M. J. (2001 ). Summation and numerous- reasoning in young children: The ordering prin-
ness judgments of sequentially presented sets of ciple. Child Development, 48, 427–43 4.
items by chimpanzees (Pan troglodytes). Journal Campbell, J. I. D. (1 999). Division by multi-
of Comparative Psychology, 1 1 5 , 1 81 –1 91 . plication. Memory and Cognition, 2 7(5 ), 791 –
Biro, D., & Matsuzawa, T. (2001 ). Chimpanzee 802.
numerical competence: Cardinal and ordi- Campbell, J. I. D., & Fugelsang, J. (2001 ). Strat-
nal skills. In Matsuzawa, Tetsuro (Ed.), Pri- egy choice for arithmetic verification: Effects of
mate Origins of Human Cognition and Behavior numerical surface form. Cognition, 80(3 ), B21 –
(pp. 1 99–225 ). New York: Springer-Verlag. B3 0.
Boyer, C. B., & Merzback, U. C. (1 989). A Campbell, J. I. D., & Epp, L. J. (2005 ). Architec-
History of Mathematics (2nd Ed.) New York: tures for arithmetic. In J. I. Campbell (Ed.),
Wiley. Handbook of Mathematical Cognition. (3 47–
Boysen, S. T., & Berntson, G. G. (1 989). Nu- 3 60) Psychology Press.
merical comptetence in a chimpanzee (Pan Campbell, J. I. D., & Fugelsang, J. (2001 ). Strat-
troglodytes). Journal of Comparative Psychology, egy choice for arithmetic verifcation: Effects of
1 03 , 23 –3 1 . numerical surface form. Cognition, 80(3 ), B21 –
Boysen, S. T., & Hallberg, K. I. (2000). Primate B3 0.
numerical competence: Contributions toward Campbell, J. I. D., & Graham, D. J. (1 985 ). Mental
understanding nonhuman cognition. Cognitive multiplication skill: Structure, process, and ac-
Science, 2 4(3 ), 423 –443 . quisition. Canadian Journal of Psychology. 3 9,
Brannon, E. M. (2002). The development of ordi- 3 3 8–3 66.
nal numerical knowledge in infancy. Cognition, Campbell, J. I. D., & Gunter, R. (2002). Calcula-
83 (3 ), 223 –240. tion, culture, and the repeated operand effect.
Brannon, E. M., & Roitman, J. D. (2003 ). Nonver- Cognition, 86(1 ), 71 –96.
bal representations of time and number in ani- Carey, S. (2001 a). Cognitive foundations of
mals and human infants. In W. H. Meck (Ed.), arithmetic: Evolution and ontogenesis. Mind
Functional and Neural Mechanisms of Interval and Language, 1 6, 3 7–5 5 .
Timing (pp. 1 43 –1 82). Boca Raton, FL: CRC Carey, S. (2001 b). On the very possibility of dis-
Press. continuities in conceptual development. In E.
Brannon, E. M., & Terrace, H. S. (2000). Rep- Dupoux (Ed.), Language, Brain, and Cognitive
resentation of the numerosities 1 –9 by rhesus Development (pp. 3 04–3 24). Cambridge, MA:
macaques (Macaca mulatta). Journal of Exper- MIT Press.
Carr, J. A. R., & Wilkie, D. M. (1 997). Rats use Dehaene, S., Dupoux, E., & Mehler, J. (1 990). Is
an ordinal timer in a daily time-place learning numerical comparison digital? Analogical and
task. Journal of Experimental Psychology: Animal symbolic effects in two-digit number compari-
Behavior Processes, 2 3 (2), 23 2–247. son. Journal of Experimental Psychology: Human
Catania, A. C. (1 963 ). Concurrent performances: Perception and Performance, 1 6, 626–641 .
A baseline for the study of reinforcement mag- Feigenson, L., Carey, S., & Hauser, M. (2002).
nitude. Journal of the Experimental Analysis of The representations underlying infants’ choice
Behavior, 6, 299–3 00. of more: Object files versus analog magnitudes.
Church, R. M., & Meck, W. H. (1 984). The Psychological Science, 1 3 , 1 5 0–1 5 6.
numerical attribute of stimuli. In H. L. Roit- Feigenson, L., Carey, S., & Spelke, E. (2002). In-
blatt, T. G. Bever, & H. S. Terrace (Eds.), An- fants’ discrimination of number vs. continuous
imal Cognition (pp. 445 –464). Hillsdale, NJ: extent. Cognitive Psychology, 44(1 ), 3 3 –66.
Erlbaum. Fuson, K. (1 988). Children’s Counting and Con-
Church, R. M., Meck, W. H., & Gibbon, J. (1 994). cepts of Number. New York: Springer Verlag.
Application of scalar timing theory to individ- Gallistel, C. R. (1 990). The Organization of Learn-
ual trials. Journal of Experimental Psychology: ing. Cambridge, MA: Bradford Books/MIT
Animal Behavior Processes, 2 0(2), 1 3 5 –1 5 5 . Press.
Clearfield, M., & Mix, K. (1 999). Number ver- Gallistel, C. R., & Gelman, R. (1 992). Preverbal
sus contour length in infants’ discrimination of and verbal counting and computation. Cogni-
small visual sets. Psychological Science, 1 0, 408– tion, 44, 43 –74.
41 1 . Gallistel, C. R., & Gelman, R. (2000). Non-verbal
Cordes, S., Gallistel, C. R., Gelman, R., & numerical cognition: From reals to integers.
Latham, P. E. (under review 2004). Nonverbal Trends in Cognitive Sciences, 4, 5 9–65 .
arithmetic in humans. Gallistel, C. R., & Gibbon, J. (2000). Time, rate
Cordes, S., Gelman, R., Gallistel, C. R., & and conditioning. Psychological Review, 1 07,
Whalen, J. (2001 ). Variability signatures distin- 289–3 44.
guish verbal from nonverbal counting for both Gallistel, C. R., King, A., & McDonald, R. (2004).
large and small numbers. Psychonomic Bulletin Sources of variability and systematic error in
and Review, 8(4), 698–707. mouse timing behavior. Journal of Experimental
Davis, H., & Pérusse, R. (1 988). Numerical com- Psychology: Animal Behavior Processes, 3 0, 3 –
petence in animals: Definitional issues, current 1 6.
evidence, and a new research agenda. Behav- Gallistel, C. R., Mark, T. A., King, A., & Latham,
ioral and Brain Sciences., 1 1 , 5 61 –61 5 . P. E. (2001 ). The rat approximates an ideal de-
Dehaene, S. (1 997). The Number Sense. Oxford, tector of changes in rates of reward: Implica-
UK: Oxford University Press. tions for the law of effect. Journal of Experi-
Dehaene, S. (2001 ). Subtracting pigeons: Loga- mental Psychology: Animal Behavior Processes,
rithmic or linear? Psychological Science, 1 2 (3 ), 2 7, 3 5 4–3 72.
244–246. Geary, D. C., Widman, K. F., & Little, T. D.
Dehaene, S. (2002). Single neuron arithmetic. Sci- (1 986). Cognitive addition and mulitplication:
ence, 2 97, 1 65 2–1 65 3 . Evidence for a single memory network. Mem-
Dehaene, S., & Changeux, J. P. (1 993 ). Develop- ory and Cognition, 1 4, 478–487.
ment of elementary numerical abilities: A neu- Gelman, R. (1 972). Logical capacity of very
ronal model. Journal of Cognitive Neuroscience, young children: Number invariance rules.
5 , 3 90–407. Child Development, 43 , 75 –90.
Dehaene, S., & Cohen, L. (1 994). Dissociable Gelman, R. (1 977). How young children reason
mechanisms of subitizing and counting: Neu- about small numbers. In N. J. Castellan, D. B.
ropsychological evidence from simultanagnos- Pisoni & G. R. Potts (Eds.), Cognitive Theory
tic patients. Psychological Review, 2 0, 95 8– (Vol. 2, pp. 27–61 ). Hillsdale, NJ: Erlbaum.
975 . Gelman, R. (1 993 ). A rational-constructivist ac-
Dehaene, S., Dehaene-Lambertz, G., & Cohen, count of early learning about numbers and ob-
L. (1 998). Abstract representations of numbers jects. In D. Medin (Ed.), Learning and Motiva-
in the animal and human brain. Trends in Neu- tion. (Vol. 3 0, pp. 61 –96). New York: Academic
rosciences, 2 1 , 3 5 5 –3 61 . Press.
Gelman, R., & Brenneman, K. (1 994). First prin- Hinrichs, J. V., Yurko, D. S., & Hu, J. M. (1 981 ).
ciples can support both universal and culture- Two-digit number comparison: Use of place
specific learning about number and music. information. Journal of Experimental Psychol-
In L. Hirschfeld & S. Gelman (Eds.), Map- ogy. Human Perception and Performance, 7, 890–
ping the Mind: Domains, Culture and Cognition 901 .
(pp. 3 69–3 90). Cambridge, UK: Cambridge Jensen, E. M., Reese, E. P., & Reese, T. W. (1 95 0).
University Press. The subitizing and counting of visually pre-
Gelman, R., & Cordes, S. A. (2001 ). Counting in sented fields of dots. The Journal of Psychology,
animals and humans. In E. Dupoux (Ed.), Lan- 3 0, 3 63 –3 92.
guage, Brain, and Cognitive Development: Essays Kahneman, D., Treisman, A., & Gibbs, B. J.
in honor of Jacques Mehler (pp. 279–3 01 ). Cam- (1 992). The reviewing of object files: Object-
bridge, MA: MIT Press. specific integration of information. Cognitive
Gelman, R., & Gallistel, C. R. (1 978). The Child’s Psychology, 2 4, 1 75 –21 9.
Understanding of Number. Cambridge, MA: Kaufman, E. L., Lord, M. W., Reese, T. W., &
Harvard University Press. Volkman, J. (1 949). The discrimination of vi-
Gelman, R., & Greeno, J. G. (1 989). On the na- sual number. American Journal of Psychology,
ture of competence: Principles for understand- 62 , 498–5 25 .
ing in a domain. In L. B. Resnick (Ed.), Know- Keller, J. V., & Gollub, L. R. (1 977). Duration and
ing and Learning: Issues for a Cognitive Science rate of reinforcement as determinants of con-
of Instruction. Hillsdale, NJ: Erlbaum. current responding. Journal of the Experimental
Gelman, R., & Williams, E. (1 998). Enabling Analysis of Behavior, 2 8, 1 45 –1 5 3 .
constraints on cognitive development. In D. Killeen, P. R., & Taylor, T. (2001 ). How the
Kuhn & R. Siegler (Eds.), Cognition, Percep- propagation of error through stochastic coun-
tion and Language. Vol. 2 . Handbook of Child ters affects time discrimination and other psy-
Psychology. 5 th ed. (pp. 5 75 –63 0). New York: chophysical judgments. Psychological Review,
Wiley. 1 07, 43 0–45 9.
Gibbon, J. (1 977). Scalar expectancy theory and King, A. S., McDonald, R., & Gallistel, C. R.
Weber’s law in animal timing. Psychological Re- (2001 ). Screening for mice that remember in-
view, 84, 279–3 3 5 . correctly. International Journal of Comparative
Gibbon, J., & Church, R. M. (1 981 ). Time left: Psychology, 1 4, 23 2–25 7.
Linear versus logarithmic subjective time. Jour- Klahr, D., & Wallace, J. G. (1 973 ). The role of
nal of Experimental Psychology: Animal Behavior quantification operators in the development
Processes, 7(2), 87–1 07. of conservation. Cognitive Pschology, 4, 3 01 –
Harper, D. G. C. (1 982). Competitive foraging in 3 27.
mallards: Ideal free ducks. Animal Behaviour, Krantz, D., Luce, R. D., Suppes, P., & Tversky, A.
3 0, 5 75 –5 84. (1 971 ). The Foundations of Measurement. New
Hartnett, P., & Gelman, R. (1 998). Early under- York: Academic Press.
standings of numbers: Paths or barriers to the Krantz, D. H. (1 972). Measurement structures
construction of new understandings? Learning and psychological laws. Science, 1 75 , 1 427–
and Instruction, 8(4), 3 41 –3 74. 1 43 5 .
Hauser, M. D. (2001 ). What do animals think Leon, M. I., & Gallistel, C. R. (1 998). Self-
about numbers? In P. Sherman, J. Alcock, stimulating rats combine subjective reward
(Eds.), Exploring Animal Behavior: Readings magnitude and subjective reward rate multi-
from American Scientist (3 rd ed.) Research plicatively. Journal of Experimental Psychology:
Triangle Park, NC: Sigma Xi. Animal Behavior Processes, 2 4(3 ), 265 –277.
Herrnstein, R. J. (1 961 ). Relative and absolute Lipton, J. S., & Spelke, E. S. (2003 ). Origins
strength of response as a function of frequency of number sense: Large-number discrimination
of reinforcement. Journal of the Experimental in human infants. Psychological Science, 1 4(5 ),
Analysis of Behavior, 4, 267–272. 3 96–401 .
Herrnstein, R. J., & Vaughan, W. J. (1 980). Me- Luce, R. D. (1 990). What is a ratio in ratio scal-
lioration and behavioral allocation. In J. E. R. ing. In S. J. Bolanowski & G. A. Gescheider
Staddon (Ed.), Limits to Action: The Alloca- (Eds.), Ratio Scaling of Psychological Magnitude:
tion of Individual Behavior (pp. 1 43 –1 76). New In Honor of the Memory of S. S. Stevens. Hills-
York: Academic Press. dale, NJ: Erlbaum.
Mandler, G., & Shebo, B. J. (1 982). Subitizing: ior chain: Effects of error contingencies. Learn-
An analysis of its component processes. Journal ing and Motivation, 2 , 3 86–41 4.
Experimental Psychology General, 1 1 , 1 –22. Pylyshyn, Z. W., & Storm, R. W. (1 988). Tracking
Mechner, F. (1 95 8). Probability relations within multiple independent targets: Evidence for a
response sequences under ratio reinforcement. parallel tracking system. Spatial Vision, 3 , 1 79–
Journal of the Experimental Analysis of Behavior, 1 97.
1 , 1 09–1 22. Rumbaugh, D. M., Savage-Rumbaugh, S., &
Meck, W. H., & Church, R. M. (1 983 ). A mode Hegel, M. T. (1 987). Summation in the chim-
control model of counting and timing pro- panzee (Pan troglodytes). Journal of Experimen-
cesses. Journal of Experimental Psychology: Ani- tal Psychology: Animal Behavior Processes, 1 3 ,
mal Behavior Processes, 9, 3 20–3 3 4. 1 07–1 1 5 .
Menninger, K. (1 969). Number Words and Num- Rumbaugh, D. M., & Washburn, D. A. (1 993 ).
ber Symbols. Cambridge, MA: MIT Press. Counting by chimpanzees and ordinality judg-
Miller, K. F., Perlmutter, M., & Keating, D. ments by macaques in video-formatted tasks.
(1 984). Cognitive arithmetic: Comparison of In S. T. Boyesen & E. J. Capaldi (Eds.), The
operations. Journal of Experimental Psychology. Development of Numerical Competence: Animal
Learning, Memory, and Cognition, 1 0, 46–60. and Human Models (pp. 87–1 06). Hillsdale, NJ:
Mix, K. S., Huttenlocher, J., & Levine, S. C. Erlbaum.
(2002). Quantitative Development in Infancy Savastano, H. I., & Miller, R. R. (1 998). Time as
and Early Childhood. Oxford, UK: Oxford Uni- content in Pavlovian conditioning. Behavioural
versity Press. Processes, 44(2), 1 47–1 62.
Moyer, R. S., & Landauer, T. K. (1 967). Time re- Scholl, B. J., & Pylyshyn, Z. (1 999). Tracking mul-
quired for judgments of numerical inequality. tiple items through occlusion: Clues to visual
Nature, 2 1 5 , 1 5 1 9–1 5 20. objecthood. Cognitive Psychology, 3 8, 25 9–290.
Moyer, R. S., & Landauer, T. K. (1 973 ). Deter- Simon, T. J. (1 999). The foundations of numerical
minants of reaction time for digit inequality thinking in a brain without numbers. Trends in
judgments. Bulletin of the Psychonomic Society, Cognitive Sciences, 3 (1 0), 3 63 –3 64.
1 , 1 67–1 68. Spelke, E., & Dehaene, S. (1 999). Biological foun-
Noel, M. P. (2001 ). Numerical cognition. In dations of numerical thinking. Trends in Cogni-
B. Rapp (Ed.), The Handbook of Cognitive tive Sciences, 3 (1 0), 3 65 –3 66.
Neuropsychology: What Deficits Reveal About Stevens, S. S. (1 95 1 ). Mathematics, measure-
the Human Mind (pp 495 –5 1 8). Philadelphia ment and psychophysics. In S. S. Stevens (Ed.),
Psychology Press. Handbook of Experimental Psychology (pp. 1 –
Olthof, A., Iden, C. M., & Roberts, W. A. (1 997). 49). New York: Wiley.
Judgments of ordinality and summation of Stevens, S. S. (1 970). Neural events and the
number symbols by squirrel monkeys. Jour- psychophysical law. Science, 1 70(3 962), 1 043 –
nal of Experimental Psychology: Animal Behavior 1 05 0.
Processes, 2 3 (3 ), 3 25 –3 3 9. Strauss, M. S., & Curtis, L. E. (1 984). Develop-
Olthof, A., & Roberts, W. A. (2000). Summa- ment of numerical concepts in infancy. In C.
tion of symbols by pigeons (Columbia livia): Sophian (Ed.), Origins of Cognitive Skills (pp.
The importance of number and mass of re- 1 3 1 –1 5 5 ). Hillsdale, NJ: Erlbaum.
ward items. Journal of Comparative Psychology, Sulkowski, G. M., & Hauser, M. D. (2001 ). Can
1 1 4(2), 1 5 8–1 66. rhesus monkeys spontaneously subtract? Cog-
Parkman, J. M. (1 971 ). Temporal aspects of digit nition, 79(3 ), 23 9–262.
and letter inequality judgments. Journal of Ex- Taves, E. H. (1 941 ). Two mechanisms for the per-
perimental Psychology, 91 , 1 91 –205 . ception of visual numerousness. Archives of Psy-
Piazza, M., Giacomini, E., Le Bihan, D., & De- chology, 3 7(No. 265 ).
haene, S. (2003 ). Single-trial classification of Terrace, H. S., Son, L. K., & Brannon, E. M.
parallel pre-attentive and serial attentive pro- (2003 ). Serial expertise of Rhesus macaques.
cesses using functional magnetic resonance Psychological Science, 1 4, 66–73 .
imaging. Proceedings of the Royal Society (Lon- Uller, C., Carey, S., Huntley-Fenner, G., & Klatt,
don) Series B, 2 70, 1 23 7–1 245 . L. (1 999). What representations might under-
Platt, J. R., & Johnson, D. M. (1 971 ). Localiza- lie infant numerical knowledge? Cognitive De-
tion of position within a homogeneous behav- velopment, 1 4(1 ), 1 –3 6.
Uller, C., Hauser, M., & Carey, S. (2001 ). Whalen, J., & West, V. (2001 ). Why you shouldn’t
Spontaneous representation of number in count on subitizing: Evidence from counting
cotton-top tamarins (Saguinus oedipus). Jour- and estimating. Psychomics Society Abstracts, 6,
nal of Comparative Psychology, 1 1 5 (3 ), 248– p. 66, No 45 3 .
25 7. Woodworth, R. S., & Schlosberg, H. (1 95 4).
Washburn, D. A., & Rumbaugh, D. M. (1 991 ). Experimental Psychology: Revised Edition. New
Ordinal judgments of numerical symbols by York: Holt, Rinehart & Winston.
macaques (Macaca mulatta). Psychological Sci- Wynn, K. (1 990). Children’s understanding of
ence, 2 , 1 90–1 93 . counting. Cognition, 3 6, 1 5 5 –1 93 .
Whalen, J. (1 997). The Influence of Semantic Rep- Wynn, K. (1 992a). Addition and subtraction by
resentations of Number on Arithmetic Fact Re- human infants. Nature, 3 5 8, 749–75 0.
trieval. Unpublished dissertation, Johns Hop- Wynn, K. (1 992b). Children’s acquisition of the
kins, Baltimore. number words and the counting system. Cog-
Whalen, J., Gallistel, C., & Gelman, R. (1 999). nitive Psychology, 2 4, 220–25 1 .
Nonverbal counting in humans: The psy- Xu, F., & Spelke, E. S. (2000). Large number dis-
chophysics of number representation. Psycho- crimination in 6-month-old infants. Cognition,
logical Science, 1 0(2), 1 3 0–1 3 7. 74(1 ), B1 –B1 1 .
CHAPTER 2 4
Effects of Aging on Reasoning
Timothy A. Salthouse
This chapter reviews empirical research on tion of the relevant literature, but limitations
adult age differences in reasoning. It is im- of space preclude comprehensive coverage
portant to begin with three disclaimers, of all of the research related to the topic of
however. First, although many types of rea- aging and reasoning. A more inclusive review
soning have been identified (e.g., deductive, of the earlier literature on this topic can be
inductive, analogical, and visuospatial; see found in Salthouse (1 992a).
articles in this volume by Evans, Chap. 8; Some of the most convincing data on
Sloman & Lagnado, Chap. 5 ; Buehner & the relations between age and reasoning are
Cheng, Chap. 7; Holyoak, Chap. 6; and those derived from standardized tests be-
Tversky, Chap. 1 0), few age-comparative cause the variables were designed to opti-
studies have included more than two or mize psychometric properties such as sensi-
three different reasoning variables and, as a tivity, reliability, and construct validity, and
result, there is little evidence for distinctions the normative samples have typically been
among various types of reasoning in stud- moderately large and selected to be rep-
ies of aging. Different reasoning tasks there- resentative of the general population (see
fore are considered together in this chap- Sternberg, Chap. 3 1 , for discussion of intelli-
ter, although it is recognized that combining gence tests). Three recent cognitive test bat-
them in this manner may be obscuring po- teries have each included at least two mea-
tentially important distinctions. Second, the sures of reasoning. The tests included in the
discussion is limited to reasoning tasks with Kaufman Adult Intelligence Test (Kaufman
minimal involvement of knowledge. Because & Kaufman, 1 993 ) were described on page 6
knowledge is likely relevant in most every- of the test manual in the following manner:
day reasoning, the tasks discussed may refer Logical Steps – “Examinees attend to logical
to only a subset of real-life reasoning. The premises presented both visually and aurally,
third disclaimer is that most of the discus- and then respond to a question making use of
sion refers to research derived from my lab- the logical premises;” and Mystery Codes –
oratory. This obviously represents only a por- “Examinees study the identifying codes
589
0.5
0.0
Young Adult SD Units
-0.5
-1.0
-1.5
KAIT- Logical Steps

-2.0 KAIT- Mystery Codes
WAIS III-Similarities
WAIS III - Matrix Reasoning
-2.5 WJ III - Concept Formation
WJ III - Analysis Synthesis
-3.0
20 40 60 80
Chronological Age
Figure 2 4.1 . Relations of reasoning performance to age in variables from
standardized tests. Sample sizes were 1 ,3 5 0 for the Kaufman Adolescent and
Adult Intelligence Test (KAIT), 2,05 0 for the Wechsler Adult Intelligence Scale
(WAIS) III, and 2,5 05 for the Woodcock Johnson (WJ) III.
associated with a set of pictorial stimuli scale, and a convenient scale for this purpose
and then figure out the code for a novel is standard deviation units. (These particu-
pictorial stimulus.” Two reasoning tests in- lar variables could have been expressed in
cluded in the latest version of the Wechsler units of percentage correct, but that scale is
test battery, the Wechsler Adult Intelligence not as widely applicable because, for exam-
Scale III (Wechsler, 1 997) were described in ple, it is not meaningful when the variables
Table 24.1 of the Administration and Scoring are measured in units of time.) The manuals
Manual as follows: Similarities – “A series of for these tests did not present the norma-
orally presented pairs for which the exam- tive data in a form that would allow con-
inee explains the similarity of the common version of the scores to standard deviation
objects or concepts they represent;” and Ma- units of the total sample. However, it was
trix Reasoning – “A series of incomplete grid- possible to express the scores in standard de-
ded patterns that the examinee completes viations of a young adult group, which has
by pointing to or saying the number of the the advantage that the magnitude of the age-
correct response from five possible choices.” related effect can be expressed relative to the
Finally, two reasoning tests included in peak level of performance achieved across all
the Woodcock–Johnson III (Woodcock, ages. Age relations in the six reasoning tests
McGrew, & Mather, 2001 ) battery were de- just described therefore are portrayed in Fig-
scribed in Table 4.2 of the Examiner’s Man- ure 24.1 in standard deviation units of a ref-
ual as follows: Concept Formation – “Identi- erence group of young adults.
fying, categorizing, and determining rules;” Examination of the figure reveals that all
and Analysis–Synthesis – “Analyzing puzzles of the variables exhibit the same trend of
(using symbolic formulations) to determine lower performance with increased age. In
missing components.” particular, for most of the variables, the av-
To allow across-variable comparisons, the erage seventy-year-old is performing about
variables must be converted into the same one standard deviation below the average
effects of aging on reasoning 591
Matrix Reasoning Analytical Reasoning

Jason and Jessica are planning a dinner party and have
invited six guests: Mark and Meredith, Christopher and
Courtney, and Shawn and Samantha. Their table seats
three people on each side and one at each end. In planning
the seating arrangements they need to: have Jason and
Jessica sit at opposite ends of the table; place Christopher
at a corner with no one on his left; not have Mark seated
next to Samantha; and have Courtney seated next to Meredith.
Which of the following is an acceptable arrangement of

diners along one side of the table?
• Jason, Samantha, Mark
• Christopher, Jessica, Shawn
• Mark, Courtney, Samantha
• Meredith, Shawn Courtney
• Shawn,Christopher,Meredith
Series Completion Integrative Reasoning
F and G do the SAME

13 – 15 – 20 – 28 – 39 - ??? E and F do the OPPOSITE
G and H do the OPPOSITE
If E increases will H decrease?
Figure 2 4.2 . Examples of problems in four different reasoning tasks used in studies by Salthouse and
colleagues. See text for details.
of the young adults. The age trends are not mine the item that provides the best contin-
completely uniform because the age effects uation of the sequence of items. In analyt-
appear to be later and smaller for some ical reasoning tasks, the examinee uses the
variables (e.g., Similarities) than for other presented information to determine which
variables (e.g., Analysis–Synthesis). How- of several alternatives best satisfies the spec-
ever, it is important to note that there is ified constraints. Finally, examinees in inte-
also considerable across-sample variation be- grative reasoning tasks use the information
cause the age gradients are shallower for in the premises to answer a question about
both Wechsler subtests (i.e., Similarities and the relation between two of the variables.
Matrix Reasoning) than for the subtests from Although no formal evidence is available,
the other batteries. it seems likely that these four tests repre-
Relations between age and measures of sent somewhat different types of reasoning,
reasoning can also be illustrated with four and they certainly involve different require-
reasoning tasks used in several studies in my ments and types of material.
laboratory. Examples of problems in each Because the tasks were each administered
type of task are portrayed in Figure 24.2. in two or more studies from my laboratory,
In matrix reasoning tasks (such as Raven’s the data have been combined across stud-
Progressive Matrices, Raven, 1 962), the ex- ies. The research participants in the studies
aminee attempts to select the best comple- were all similar in that they ranged from 1 8
tion of the missing cell from the alternatives to over 80 years of age, had an average of
presented below the matrix. The goal in se- between 1 4 and 1 7 years of education, and
ries completion tasks (such as the Shipley generally reported themselves to be in good
Abstraction Test, Zachary, 1 986) is to deter- to excellent health.
0.5
0.0
Young Adult SD Units
-0.5
-1.0
Matrix Reasoning (N = 1976)

-1.5 Series Completion (N = 1283)
Analytical Reasoning (N = 1160)
Integrative Reasoning (N = 985)
-2.0
20 40 60 80
Chronological Age
Figure 2 4.3. Means and standard errors of performance in four different reasoning tasks as a function
of age. Data from various studies by Salthouse and colleagues.
Age relations in these four tasks are por- among letters in reverse alphabetical se-
trayed in Figure 24.3 in the same format used quence, or among words with particular
to display results of the tests from the psy- semantic relations. Additional support for
chometric test batteries. It can be seen that this differential-involvement-of-knowledge
the pattern with these data closely resembles interpretation of the different age trends is
that from the normative samples in the stan- provided by the correlations of the reasoning
dardized test batteries. In particular, there variables with a composite vocabulary vari-
is an approximately linear decline in perfor- able, as the correlations were 0.3 7 for ma-
mance with increased age, such that the av- trix reasoning, 0.23 for analytical reasoning,
erage at age seventy is about one standard 0.24 for integrative reasoning, and 0.66 for
deviation below the average of the reference series completion.
group of young adults. Although not apparent in Figures 24.1
The age relations for three of the vari- and 24.3 , other results indicate that the age
ables in Figure 24.3 were nearly identi- relations on variables assessing reasoning are
cal, but the age function was shallower for as large or, in some cases, even larger than
the series completion variable. This may be the age relations on other types of cogni-
because several items in the Shipley Ab- tive variables. For example, Verhaeghen and
straction series completion test (from which Salthouse (1 997) reported a meta-analysis
these data were derived) have considerable in which the weighted correlation (based on
reliance on verbal knowledge, which tends 9,3 42 individuals across thirty-eight studies)
to be relatively well preserved across this between age and measures of reason-
age range. For example, some of the items ing was −.40, and the weighted corre-
in that test involve determining relations lation (based on 5 ,871 individuals across
twenty-nine studies) between age and mea- tigated in this manner are comprehension,
sures of episodic memory was −0.3 3 . Fur- speed, strategy, and working memory. Em-
thermore, in analyses to be described later, pirical research relevant to each of these po-
the correlations between age and factor tential determinants is briefly summarized in
scores were very similar for factors based this section.
on memory (r = −0.48) and reasoning (r =
−0.49) variables. Comprehension
Despite their similar magnitude, age dif-
It is conceivable that at least some of the
ferences in reasoning are not as widely rec-
age differences in reasoning are simply
ognized as age differences in memory. A
attributable to greater difficulties associated
possible reason may be that considerable
with increased age in understanding exactly
knowledge is required in many everyday sit-
what is required to perform the task suc-
uations that involve reasoning, such that any
cessfully. This is an important possibility to
age effects might not be noticed either be-
consider because age differences in reasoning
cause of a large positive relation between age
would probably not be of much theoretical
and knowledge, or because any deficiencies
interest if they merely reflected comprehen-
are attributed to lack of relevant knowledge
sion problems.
instead of to problems of reasoning.
The primary means by which the com-
The primary question in light of age
prehension interpretation has been inves-
differences such as those apparent in Fig-
tigated restricted comparisons to individu-
ures 24.1 and 24.3 is, “What is responsible
als for whom there is evidence that they
for the large negative relations between age
understood the task requirements. For ex-
and performance on measures of reasoning?”
ample, participants with accuracy less than
Much of the research that has been con-
some criterion value have been excluded
ducted to address this question can be clas-
from the analyses in integrative reasoning
sified into one of two broad categories. One
(Salthouse, 1 992b, 1 992c) and matrix rea-
category consists of investigations of the in-
soning (Salthouse & Skovronek, 1 992) tasks,
fluence of factors such as comprehension,
and analyses have been restricted to partici-
speed, strategy, and working memory on
pants with correct responses on the first two
the age differences in the performance of a
items in the matrix reasoning (Salthouse,
particular reasoning task. The second cate-
1 993 ) task. In each of these cases, strong neg-
gory of research has involved examining age-
ative age relations were evident among the
related effects on measures of reasoning in
participants who understood the tasks well
the context of age-related effects on other
enough to answer several problems correctly.
cognitive abilities. In the remaining sections
These results therefore suggest that age dif-
of this chapter, the two approaches are illus-
ferences in simple comprehension proba-
trated with research from my laboratory.
bly are not responsible for much, if any,
of the age differences observed in measures
of reasoning.
Process-Oriented Research
Speed
The majority of the empirical research in the Another relatively uninteresting possibility
area of cognitive aging has focused on a sin- is that age differences in measures of rea-
gle cognitive variable (with different stud- soning might merely reflect a slower rate
ies concentrating on different variables), and of reading or of responding, without any
has attempted to determine the relative con- detrimental effects on the quality of perfor-
tribution of different processes to the age dif- mance. Because effects of age-related slow-
ferences on that particular variable. Among ing have been extensively documented (e.g.,
the potential determinants of age differences Salthouse, 1 996a), it is important to con-
in reasoning variables that have been inves- sider whether age differences in reasoning
might be attributable to slower peripheral 1 994) tasks, and on the accuracy of early
processes associated with encoding or re- items in matrix reasoning and analytical rea-
sponding to the information. soning tests that were attempted by every-
One way in which the role of slower rates one (Salthouse, 2000, 2001 ).
of input or output has been investigated in- Taken in combination, the results just de-
volves examining age relations on reason- scribed suggest that adult age differences
ing tasks administered under untimed, or in reasoning are not simply attributable to
self-paced, conditions. Most of the com- slower rates of reading or responding. The
parisons have revealed significant age dif- speed of internal mental operations may be
ferences even when the participants are a factor in some of the performance differ-
allowed to control the duration of the stim- ences (see Salthouse, 1 996a), but because
ulus presentation, and take as long as they sizable age differences in accuracy are found
want to respond. Age differences in deci- when there are no external time constraints,
sion accuracy under these conditions have the differences do not appear to be solely the
been found in geometric analogies (Salt- result of slower rates of input or output.
house, 1 987), series completion (Salthouse
& Prill, 1 987), matrix reasoning (Salthouse,
Strategy
1 993 ; 1 994; Salthouse & Skovronek, 1 992),
and integrative reasoning (Salthouse, 1 992c; One of the most popular interpretations of
Salthouse et al., 1 989; 1 990) tasks, and in age differences in cognitive functioning, at
the Wisconsin Card Sorting Test (WCST; least in part because it implies that the age
Salthouse et al., 1 996; Fristoe, Salthouse, & differences might be amenable to interven-
Woodard, 1 997; Salthouse et al., 2003 ). tion, attributes them to the use of different
The role of speed on age differences in strategies by adults of different ages. It is im-
matrix reasoning was examined more closely portant to consider two issues when evalu-
in two studies by Salthouse (1 994) by ob- ating this distinction: whether or not adults
taining separate measures of study time, de- of different ages actually do use different
cision time, and decision accuracy from each strategies when performing the task and, if
participant. Not only were significant age so, what is responsible for those differences.
differences found on each measure, but anal- Information about the existence of pos-
yses revealed that some of the age-related ef- sible strategy differences has been obtained
fects on the decision accuracy measure were by examining the distribution of study times
statistically independent of the age-related across different parts of the reasoning prob-
effects on the study time and decision time lem. For example, the research participant
measures. At least in this project, therefore, could be instructed to press a key to view
older adults took longer than younger adults each element of the problem, and then the
to work on the problems and to communi- time between successive keystrokes could be
cate their decisions, and their decisions were recorded to determine the time devoted to
less accurate. inspecting or studying each element. Vari-
A second method of investigating the role ants of this method have been used in a num-
of limited time on age differences in reason- ber of reasoning tasks with comparable out-
ing involves examining age differences in the comes. Specifically, the relative distribution
percentage of items answered correctly only of inspection or study times has been found
for attempted items, as inferred by the pres- to be similar in young and old adults in se-
ence of an overt response. Strong negative ries completion (Salthouse & Prill, 1 987), in-
age relations have been found even when tegrative reasoning (Salthouse et al., 1 990),
only attempted items were considered in in- and geometric analogies (Salthouse 1 987).
tegrative reasoning (Salthouse, 1 992b), ge- To the extent that relative time allocation
ometric analogies (Salthouse, 1 992b), and across different elements of a problem can
matrix reasoning (Salthouse, 1 991 ; 1 993 ; be considered as evidence of a particular
strategy, therefore, these results imply that Compared with young adults, older adults
young and old adults were using a similar had lower percentages of both types of be-
strategy. havior, and statistical control of a compos-
Additional evidence relevant to the strat- ite measure of feedback usage reduced the
egy interpretation of age-related differences age-related variance in a measure of WCST
in reasoning is based on an examination of performance by 74%. These results clearly
possible age differences in the pattern of in- indicate that the young and old adults in this
correct alternatives selected when choosing study performed the task in a somewhat dif-
a response. The rationale is that adults of different fashion and that the difference was
ferent ages might be expected to differ in related to success in the task. However, be-
the frequency of selecting particular incor- cause there was no evidence that the older
rect alternatives if they were relying on dif- adults were as capable as the young adults of
ferent rules or strategies to select their an- performing in the same optimal manner, it
swers. However, no age differences in the is questionable whether the differences ob-
relative percentages of different types of er- served in the way the task was performed
rors in a matrix reasoning task were found should be considered evidence for differ-
by Salthouse (1 993 ; also see Babcock, 2002), ences in strategy, which has a voluntary or
which suggests that adults of different ages optional connotation.
were probably using the same strategies but Although only a limited amount of rele-
that the effectiveness of the strategies was vant evidence is currently available, it does
lower with increased age. not appear that much, if any, of the age-
Finally, a study by Fristoe, Salthouse, & related differences in reasoning can be ex-
Woodard (1 997) was designed to investigate plained by differences in the strategies used
the manner in which young and old adults to perform the task. Furthermore, it is im-
performed the WCST. The WCST is a con- portant to recognize that, even if evidence
cept identification test in which the stim- of strategy differences were available, inter-
uli consist of cards that vary in the number, pretations based on strategy differences are
color, and shape of objects. An unusual fea- likely to be somewhat ambiguous unless an
ture of the test is that the rule (i.e., num- explanation is also provided for why peo-
ber, color, or shape) used to determine the ple of different ages used different strate-
correct sorting of the cards changes after ev- gies. That is, if strategy differences were to
ery 1 0 correct sorts without informing the be found, a critical question is whether the
participant. The participants in the Fristoe, most effective or optimal strategy is still fea-
Salthouse, & Woodard (1 997) study were sible for older adults but not used for some
asked to indicate the dimension that they reason, or whether older adults are less able
were using in making their decisions about to use the more powerful or optimal strategy
how to sort stimulus cards. By combining than young adults. As a result, a difference
this information with the responses selected in strategy might be viewed merely as a dif-
and the feedback received after each referent level of description, such that if age
sponse, it was possible to determine the per- differences were to be found, they would
centage of times each participant maintained still need to be explained, just as would
the same hypothesis after receiving positive age differences in measures of overall task
feedback (i.e., “win–stay”), and the percent- performance.
age of times he or she changed hypothe-
ses after receiving negative feedback (i.e.,
Working Memory
“lose–shift”).
Optimal performance in this type of An interpretation that has generated consid-
feedback-based concept identification situ- erable interest, particularly since a provoca-
ation would be manifested in high percent- tive article by Kyllonen and Christal (1 990)
ages of “win–stay” and “lose–shift” behavior. that reported a very strong relation between
measures of working memory (WM) and memory until it is needed. (In these partic-
measures of reasoning (see Morrison, Chap. ular studies, the task was administered on a
1 9), is that at least some of the age- computer and only one premise was visible
related differences in reasoning might be at a time.)
attributable to age differences in WM. A consistent finding in each of these stud-
Because WM has been defined as the abil- ies (i.e., Salthouse, 1 992c; Salthouse et al.,
ity to preserve information while processing 1 989; Salthouse et al., 1 990) was that the re-
the same or other information, and because lation of accuracy to the number of premises
many reasoning tasks require that informa- was nearly identical when only one premise
tion be maintained in order for it to be oper- was relevant and when two or more premises
ated upon, interpreting the age differences were relevant. Furthermore, this pattern was
in reasoning as a function of WM has con- similar across adults of all ages. These re-
siderable intuitive plausibility. sults therefore suggest that the primary rea-
One method used to investigate the son why accuracy was lower when the prob-
role of WM in reasoning involves manip- lems contained more premises was related to
ulation of the number of premises pre- the availability of information and not to dif-
sented in integrative reasoning problems. ficulties in integrating relevant information.
The rationale that increasing the number of The fact that the pattern was similar in adults
premises would increase the WM require- of all ages further implies that the age dif-
ments, which might then be expected to ferences in this task are largely attributable
increase the magnitude of the age differ- to differences in the availability of relevant
ences in reasoning performance if at least information.
some of those differences are attributable An additional expectation from the
to WM limitations. Support for this expec- information-availability interpretation is
tation was provided in four independent that age differences should be evident in
studies (Salthouse, 1 992b, 1 992c; Salthouse the shape of the serial position functions
et al., 1 989; Salthouse et al., 1 990). In each relating decision accuracy to sequential
study, reasoning accuracy decreased as the position of the relevant premise. In fact,
number of premises increased, and the mag- Salthouse et al. (1 990) did find that young
nitude of this decrease was greater for older adults exhibited a classical serial position
adults than for young adults. function, with higher accuracy for the more
Another manipulation incorporated in recent premises, whereas the function for
several integrative reasoning studies in- older adults was flat. However, for reasons
volved the presentation of trials in which that are not yet clear, this pattern was not
only one of the premises was relevant to the replicated in a later study by Salthouse
decision. Consider the problem portrayed in (1 992c).
the lower right panel of Figure 24.2, for ex- Manipulation of the number of problem
ample. In the version displayed, all of the elements has also been examined in geomet-
premises are relevant to the decision and ric analogy and matrix reasoning tasks, with
would need to be considered to reach a somewhat different patterns of results. To
valid conclusion. If, instead of referring to illustrate, three studies found that age dif-
variables E and H, the question referred to ferences in measures of decision time, de-
variables E and F, however, all of the infor- cision accuracy, or both, were larger when
mation relevant to the decision would have there were more relevant elements in geo-
been presented in a single premise. These metric analogy problems (Salthouse, 1 987,
“one-relevant” trials are interesting because 1 988, 1 992c). In several studies reported by
no across-premise integration of informa- Salthouse (1 993 ) and in a study by Salthouse
tion is required for a correct decision, and (1 994), however, age differences in a matrix
the major determinant of quality of per- reasoning task were nearly constant across
formance therefore is presumably the abil- increases in the number of relations among
ity to maintain the relevant information in elements, and in none of these studies was
there a significant interaction between age the exact extent of that involvement, and
and number of relations in the problem. Spe- the role of other factors in the age differ-
cific characteristics of the tasks may be re- ences, remain to be determined.
sponsible for the different patterns of re-
sults across integrative reasoning, geometric
analogy, and matrix reasoning tasks, but the Correlational Analyses
exact nature of those characteristics is not
yet known.
The second major approach to investigat-
Another method used to investigate the
ing adult age differences in cognition has re-
role of WM in reasoning involves assess-
lied upon correlational data to attempt to
ing on-line availability of information dur-
specify the number and nature of statisti-
ing the performance of the task. For ex-
cally distinct age-related influences operat-
ample, Salthouse (1 993 ) and Salthouse and
ing on different types of cognitive variables.
Skovronek (1 992) presented a successive
In this section, results relevant to under-
version of the matrix reasoning task in which
standing effects of aging on reasoning based
each matrix cell was numbered. To view a
on mediational, componential, correlated-
cell in the matrix, the participant had to type
factors, and hierarchical structure models are
the corresponding number. In three sepa-
described briefly.
rate studies, older adults were found to ex-
amine the same cell more frequently than
Mediational Models
young adults, as though the information in-
spected earlier was no longer functionally The goal of mediational models is to exam-
available to them. Furthermore, when pre- ine the role of one or more constructs as
sented with probes of information examined potential mediators of the age differences
earlier, older adults were less accurate than in measures of reasoning by means of sta-
young adults in recognizing the contents of tistical adjustment. The rationale is that if
previously viewed cells (Salthouse, 1 993 ). age-related effects on variable Y are at least
A final piece of evidence relevant to the partially attributable to age-related effects
WM interpretation of age differences in rea- on variable X, then statistical control of X
soning is that Salthouse (1 992c) found a should reduce the magnitude of the age-
qualitatively similar pattern of differences related effects on Y. For the purpose of these
between young and old adults, and between analyses, X could be a measure of any factor
young adults with and without a concur- hypothesized to be important in the target
rent memory load (of five random digits). variable, Y. Most of the mediational models
To the extent that a concurrent mem- applied to reasoning have used measures of
ory load is viewed as simulating reduced WM in the role of X because of the assump-
WM capacity, this finding is consistent with tion that reasoning tasks frequently require
the hypothesis that at least some of the that earlier information be preserved when
age differences in the integrative reason- processing later information, and individuals
ing task are attributable to age differences who are better able to do that, as reflected by
in WM. higher scores on WM tasks, therefore would
In summary, results from a number of dif- be expected to perform at higher levels on
ferent types of comparisons in a variety of reasoning tasks.
reasoning tasks lend credibility to the inter- Several studies in my laboratory have re-
pretation that the ability to maintain rele- lied upon two tasks to assess WM. Both
vant information during the performance of require participants to remember informa-
reasoning tasks likely contributes to at least tion while simultaneously processing other
some of the adult age differences in reason- information. In the computation span task,
ing. Although the available evidence sug- for example, arithmetic problems had to be
gests that working memory is probably in- answered while remembering the last digit
volved in the age differences in reasoning, in each problem, and in the listening span
task, questions about a sentence had to be ance after statistical control of WM, with
answered while remembering the last word reductions of 5 7% (Salthouse et al., 1 989),
in each sentence. Measures of performance 88% (Salthouse, 1 992b; also see Salthouse,
in these tasks have been found to exhibit 1 991 ), and 48% (Salthouse, 1 992c) in inte-
good reliability, and to be negatively corre- grative reasoning tasks, 65 % in a geomet-
lated with age. ric analogies task (Salthouse, 1 992b), and
Three sets of results are necessary to 43 % to 84% in matrix reasoning tasks (Salt-
establish the plausibility of a mediational house, 1 993 ). Similar findings have been
interpretation of age-related differences in reported by other researchers with matrix
reasoning. The first is the demonstration of reasoning (Babcock, 1 994) and syllogistic
age-related differences in the expected di- reasoning (Fisk & Sharp, 2002; Gilinsky &
rection in measures of the hypothesized me- Judd, 1 994) tasks. Sizable reductions in the
diator, because a construct cannot mediate age-related differences after control of WM
age differences in other variables or con- have been found even with percentage cor-
structs if it is not related to age. The second rect measures (Salthouse, 1 992b) and on the
necessary result is the existence of a moder- accuracy of individual items in a matrix rea-
ate relation between the hypothesized medi- soning task (Salthouse, 1 993 ). A significant
ator and the target variable it is presumed to relation of WM on two-premise and three-
explain, because no mediation is possible if premise integrative reasoning problems also
the suspected mediator and the target vari- has been found after control of the influence
able are not related to one another. Third, of one-premise problems (Salthouse, 1 992b,
age-related differences in the target variable 1 996b), which implies that WM specifically
should be reduced after statistical control of contributes to the maintenance of informa-
the mediator, with the magnitude of the re- tion needed in more complex problems.
duction serving as an approximate index of This pattern of results clearly is consistent
the degree of mediation. This last result is with the hypothesized influence of WM on
critical because mediation is not plausible age-related differences in reasoning. How-
if the relations of age to the target variable ever, it is important to recognize that com-
are not at least moderately attenuated when parable, and sometimes even larger, reduc-
the variability in the hypothesized mediator tions in the age-related effects in reasoning
is eliminated. have been found after statistical control of
A variety of procedures can be used to other theoretical constructs, such as percep-
statistically control the hypothesized medi- tual speed (e.g., Salthouse, 1 991 , 1 993 , 1 994,
ator, such as partial correlation, semipartial 1 996a). Because most cognitive variables are
correlation (available from hierarchical re- positively correlated with one another, some
gression), analysis of covariance, and so on. attenuation of the age-related effects on one
In each case, the goal is to eliminate the cognitive variable likely would be expected
variance in the target variable that is re- after statistical control of almost any other
lated to the mediator such that relations be- cognitive variable. A discovery of attenuated
tween age and the target variable can be age-related variance after statistical control
examined when differences in the level of of a hypothesized mediator therefore should
the mediator no longer influence the target be considered only necessary, but not suf-
variable. ficient, evidence for the validity of media-
The most relevant comparisons from me- tional hypotheses.
diational analyses of WM on reasoning are
those between the initial age relation on the
Componential Models
reasoning variable and the age–reasoning re-
lation after statistical control of the WM Componential models are more complex
measure. A consistent finding across several than mediational models because they pos-
different types of reasoning tasks has been a tulate that nearly every cognitive task in-
substantial reduction in the age-related vari- volves multiple processes or components,
and that performance of the task is in- tion applied to the initial figure would match
fluenced by the efficiency or effectiveness a comparison figure.
of each component. Componential models A critical prerequisite for a componen-
have been investigated by relying upon the tial analysis is that the pattern of correlations
pattern of correlations among measures of and, specifically, the results from a confirma-
the components and a measure of perfor- tory factor analysis, should provide evidence
mance on the target reasoning task to de- for distinct constructs. That is, only if there
termine the relative contribution of each is evidence that the variables represent sep-
hypothesized component. For example, a re- arate constructs is it meaningful to examine
searcher might postulate that components their relative contributions to the age dif-
A, B, and C are required to perform a partic- ferences in the performance of the criterion
ular task, administer tasks to obtain variables reasoning task. The results of the two stud-
that reflect A, B, and C as directly as possible, ies reported by Salthouse (2001 ) were not
and then examine correlations among the consistent with the existence of three sepa-
variables based on the reasoning tasks and rate factors because all of the variables had
the component tasks. Componential models similar correlations with one another. To il-
can be applied to research on aging by deter- lustrate, the correlation between the two-
mining the degree to which age-related ef- rule identification variables was 0.5 0, and
fects on the target reasoning task are altered their correlations with variables hypothe-
when variability in measures of the compo- sized to reflect the rule application compo-
nents is statistically controlled. nent ranged from 0.48 to 0.62. Because there
Componential models of the matrix rea- was no evidence that the hypothesized com-
soning and analytical reasoning tasks were ponents represented distinct dimensions of
investigated by Salthouse (2001 ), and a individual differences (i.e., exhibited con-
somewhat different componential analysis struct validity), it was impossible in these
of age differences in matrix reasoning was studies to decompose the age differences in
reported by Babcock (1 994). Salthouse hy- the target tasks into discrete components.
pothesized three components were involved There are at least three possible inter-
in each of the tasks: rule identification, pretations of results such as those just de-
rule application, and information integra- scribed. First, the theoretical models may
tion in the matrix reasoning task; and sim- not have been valid because the designated
ple comprehension, information integration, components are not actually required to per-
and condition verification in the analytical form the tasks. Second, the models could
reasoning task. Primarily on the basis of in- have been valid and the components might
tuition and judgments of face validity, two have been relevant to performance on the
variables were selected to represent each target task, but the components were not
hypothesized component. To illustrate, the accurately assessed with the selected tasks.
rule identification component was assessed And third, the models may not have been
by a Figure Classification test, in which ex- valid because the hypothesized components
aminees determine the basis by which differ- do not actually exist as distinct entities. Un-
ent figures are related to one another, and by fortunately, the available data do not allow
a Location test, in which examinees deter- these alternatives to be distinguished. How-
mine the rule governing the position of a set ever, it is worth considering whether a simi-
of Xs in each row of a matrix. The rule ap- lar situation may exist in componential mod-
plication component was assessed with two els of other cognitive tasks but has not been
tasks (i.e., Pattern Transformation and Geo- recognized because there have seldom been
metric Transformation) in which the exami- any attempts to investigate the construct va-
nee views an initial line pattern or geometric lidity of the hypothesized processes or com-
figure, carries out a specified transformation ponents. Results of the Salthouse (2001 )
(such as rotation, subtraction, or addition), project therefore suggest that it is important
and then decides whether the transforma- to obtain empirical evidence of the construct
validity of hypothesized components before tal of 6,828 individuals. The major variables
investigating their role in cognitive tasks. included in the aggregate data set are listed
in Table 24.1 together with the respective
sample sizes and age correlations. Entries
Correlated Factor Models
in the right-most columns in Table 24.1 are
The variables included in mediational and the factor loadings from a confirmatory fac-
componential models typically have been se- tor analysis in which factors corresponding
lected because of their presumed relevance to reasoning, spatial visualization, episodic
to the target variable one is trying to ex- memory, perceptual speed, and vocabulary
plain. An alternative approach based on cor- abilities were postulated. As expected, the
relational data would be to consider the loadings of the variables on the factors all
interrelations among a broad variety of cog- were high, with only four below 0.7, and
nitive variables in terms of some organiza- the factors were moderately correlated with
tional structure and then examine relations one another. A second model examined re-
of age to the target variable within the con- lations between age and each of the ability
text of that structure. factors. These (standardized) relations were
The simplest organizational structure is −0.49 for reasoning, −0.41 for space, −0.48
one in which the variables are grouped into for episodic memory, 0.63 for speed, and
several first-order factors or abilities, with 0.25 for vocabulary.
the factors allowed to correlate with one an- Inspection of the coefficients in the rea-
other. Age-related effects on specific reason- soning column reveals that the matrix rea-
ing variables can be investigated in this type soning and analytical reasoning variables
of correlated-factors structure by determin- both had high loadings on the reasoning fac-
ing the degree to which the age-related ef- tor and therefore can be considered proto-
fects on the target reasoning variable are di- typical reasoning tasks. The contributions of
rect or are indirect and operate through one the five abilities to these two variables there-
or more cognitive abilities. fore were examined by modifying the anal-
The ideal data set for analyses involving ysis to specify relations of each of the five
cognitive abilities would involve a wide va- abilities to these variables. In effect, these
riety of cognitive variables, and as large and analyses are asking what abilities contribute,
diverse a sample of participants as possible. and by how much, to the individual differ-
No single study is likely to possess all of ences in performance of these tests. The top
these characteristics, but an approximation panel of Table 24.2 summarizes results of
to this ideal can be obtained by aggregating these analyses, where it can be seen that,
data across different studies involving differ- as expected, the strongest relation of each
ent combinations of variables. Aggregation variable was with the reasoning factor. How-
of the data in this way essentially treats the ever, it is important to note that each vari-
individuals as though they were participants able also had significant relations with fac-
in a single large study but with missing val- tors representing other cognitive abilities.
ues for the variables that were not collected Both the matrix reasoning and the analytical
in the particular study in which an individ- reasoning variables were positively related
ual participated. Although data with a large to spatial visualization ability and negatively
proportion of missing values can be compli- related to vocabulary ability. This latter re-
cated to analyze, meaningful analyses can be lation is rather puzzling because it suggests
conducted by relying on an algorithm such that, when other relations are taken into con-
as the full information maximum likelihood sideration, people with higher levels of vo-
procedure (e.g., Enders & Bandalos, 2001 ) to cabulary tend to perform somewhat worse
take advantage of all available information. on these reasoning tasks than people with
A combined data set of this type was cre- lower levels of vocabulary.
ated by aggregating data across 3 3 separate This simple structure can be used to esti-
studies from my laboratory involving a to- mate the indirect effects of age on reasoning
Table 2 4.1 . Results of a Confirmatory Factor Analysis on Data Aggregated across Multiple Studies
Factor Loading
Variable N Age r Rea Spc Mem Spd Voc
Matrix reasoning 1 976 −.5 0 .87
Analytical reasoning 1 1 60 −.46 .76
Shipley abstraction 1 283 −.29 .87
Integrative reasoning 985 −.3 5 .62
Figure classification 45 8 −.60 .74
Cattell matrices 420 −.48 .82
Letter sets 1 1 79 −.26 .80
Geometric analogies 75 6 −.3 6 .78
PMA reasoning 3 05 −.41 .86
Grammatical reasoning 229 −.3 5 .80
Series completion 150 −.3 7 .80
Analysis synthesis 204 −.3 6 .79
Power letter series 150 −.47 .93
WCST number of 71 1 −.28 .5 6
categories
Diagramming relations 449 −.40 .76
Locations 449 −.41 .60
Spatial relations 1154 −.3 4 .91
Paper folding 994 −.43 .81
Form boards 847 −.3 8 .80
Surface development 63 9 −.3 2 .72
PMA space 3 05 −.3 9 .76
Block design 463 −.3 9 .89
Object assembly 25 9 −.41 .81
Cube assembly 1 272 −.1 7 .60
Paired associates 1 769 −.3 8 .72
Free recall 1 764 −.42 .84
Logical memory 793 −.24 .72
Free recall of transfer list 1 05 4 −.3 5 .77
Digit symbol 2041 −.5 7 .78
Letter comparison 6082 −.43 .79
Pattern comparison 6082 −.5 2 .82
Cross out 204 −.71 .92
Digit symbol reaction 241 7 −.5 6 .77
time
WAIS vocabulary 795 .1 3 .86
WJ picture vocabulary 795 .3 0 .80
Antonym vocabulary 3 5 09 .1 8 .90
Synonym vocabulary 3511 .27 .89
Shipley vocabulary 25 9 .22 .93
Factor correlations
Reasoning (Rea) – .88 .73 .79 .47
Space (Spc) – .65 .67 .46
Memory (Mem) – .70 .42
Speed (Spd) – .28
Vocabulary (Voc) –
Notes: N = number, Age r = , Rea = reasoning, Spc = space, Mem = memory, Spd = speed, Voc = vocabulary,
PMA = Primary Mental Abilities, WCST = Wisconsin Card Sorting Test, WAIS = Wechsler Adult Intelligence
Scale, WJ = Woodcock–Johnson.
Table 2 4.2 . Loadings of Matrix Reasoning and Analytical Reasoning Variables on Five
Cognitive Abilities
Rea Spc Mem Spd Voc
All
Matrix reasoning .86* .25 * −.06 −.07 −.20*
Analytical reasoning .76* .25 * −.04 −.1 3 −.1 7*
Matrix reasoning
Under age 5 0 .97* .1 8 −.1 1 −.08 −.20*
Age 5 0 and over .79* .3 0* −.02 −.02 −.21 *
Analytical reasoning
Under age 5 0 .91 * .01 −.04 −.04 −.1 0
Age 5 0 and over .5 0* .46* −.04 −.09 −.1 7
*p < .01
Note: None of the coefficients for the under-age-fifty group and the age-fifty-and-over group was significantly
different from one another.
Rea = reasoning, Spc = space, Mem = memory, Spd = speed, Voc = vocabulary.
variables by incorporating information about a significant loss of fit to the data indicated
the relations between age and each ability. that the group differences were not statis-
To illustrate, because the standardized coef- tically significant. It therefore appears from
ficient for the relation from age to the rea- these results that the two reasoning variables
soning ability factor was −0.49, and that represent nearly the same combination of
for the relation between the reasoning fac- abilities at different ages. These particular
tor and the matrix reasoning variable was results should be replicated before reaching
0.87, it can be inferred that −0.43 (i.e., any strong conclusions, but they serve to il-
−0.49 × 0.87) of the total −0.5 0 age effect lustrate how correlational results can be in-
on matrix reasoning (cf. Table 24.1 ) is asso- formative about the possibility of qualitative
ciated with influences through the reasoning differences in performance at different ages.
ability factor.
The correlated-factors structure can also
Hierarchical Structure Models
be used to investigate whether the variables
represent the same constructs to the same The correlated-factors model can be con-
degree at different ages (i.e., the issue of sidered relatively simple because, although
measurement equivalence). The preceding the factors are allowed to correlate with one
analyses therefore were repeated in samples another, there is no attempt to explain the
of adults under and over the age of 5 0, with basis for those correlations in the context
the results summarized in the bottom panels of the model. A somewhat more compli-
of Table 24.2. Inspection of the entries indicated model involves a hierarchical structure
cates that the pattern of ability relations for in which one or more higher-order factors
the matrix reasoning variable was very simi- are postulated to be responsible for the rela-
lar in the two age groups, consisting of a large tions among the first-order factors (Carroll,
positive relation with the reasoning factor, a 1 993 ). An advantage of hierarchical models
small positive relation with the spatial visu- for the investigation of age-related effects is
alization factor, and a small negative relation that they allow broad (on the higher-order
with the vocabulary factor. Although the common factor) and narrow (on the first-
pattern appears somewhat different across order ability factors) age-related influences
the two age groups for the analytical rea- to be examined simultaneously.
soning variable, a direct test in which the A hierarchical analysis was conducted
parameters were constrained to be equal in on the combined data summarized in
the two samples to determine if there was Table 24.1 by examining the relations of age
Age
.68 -.50 -.13
Common
-.30
.86 .66
.97
.86 .71
Voc Space Reas Mem Spd
Predicted: .25 -.43 -.49 -.49 -.63

Observed .25 -.41 -.49 -.48 -.63
Figure 2 4.4. Hierarchical structural model of age relations on different cognitive
abilities based on the data summarized in Table 24.1 . Numbers adjacent to the
arrows are standardized regression coefficients, and numbers in the bottom two
rows are correlations between age and the latent construct directly above.
to a second-order factor representing vari- highest-order factor, a moderate positive in-

ance common to the first-order factors and fluence on the vocabulary factor, and small
to each first-order factor, and then deleting to moderate negative influences on factors
all relations from age that were not signifi- corresponding to speed and memory abil-
cantly different from zero. Because the ag- ities. A very similar pattern recently was
gregation of data from samples with differ- found by Salthouse and Ferrer-Caja (2003 )
ent combinations of variables results in a in analyses of three separate data sets, so
very high proportion of missing values for these results apparently are robust.
most variables, conventional measures of fit The hierarchical structure represented
are not readily available in analyses with this in Figure 24.4, together with the factor
type of data. However, the observed age- loadings presented in Table 24.1 , can be
factor correlations can be compared with used to estimate age-related influences on
those predicted from the parameters of the individual variables. Because the product
model, and inspection of the entries at the of the standardized path coefficients pro-
bottom of Figure 24.4 indicates that the pre- vides an estimate of the expected correla-
dicted age correlations were very close to the tion between the variables, the product of
observed age correlations, implying that the the age-common, common-reasoning, and
model is plausible. reasoning-variable coefficients can be com-
The coefficients provided from the hi- pared with the observed age-variable cor-
erarchical structure analysis on these data relation to determine how accurately the
are portrayed in Figure 24.4, where it can model and its estimated parameters repro-
be seen that four statistically independent duce the actual relations in the data. The
age-related influences were identified. There predicted age correlation for the matrix rea-
was a large negative influence of age on the soning variable was −0.42, the observed
correlation was −0.5 0, and corresponding functioning, future research likely will ben-
predicted and observed values for the an- efit from a broader, more multivariate, per-
alytical reasoning variable were −0.3 7 and spective than that typically employed in con-
−0.46, respectively. With these particular temporary research and by considering the
variables, therefore, the age relations are un- effects of aging on what is common to many
derestimated by the model, which implies different types of cognitive variables instead
that additional paths, such as a direct neg- of focusing exclusively on the determinants
ative relation from age to the variable, may of age-related differences in one particular
be necessary to provide more accurate esti- task.
mates of the true covariations in the data.
One of the most interesting results in Fig-
ure 24.4, which was also apparent in the Acknowledgments
analyses reported by Salthouse and Ferrer-
Caja (2003 ), is that the reasoning factor was Research described in this chapter was sup-
the first-order factor with the strongest reported by National Institute on Aging Grants
lation to what is common to all variables. R3 7-06826; RO1 -0685 8, and RO1 -1 9627.
In fact, the standardized coefficient of 0.97
in Figure 24.4 indicates that there was al-
most complete overlap of the individual dif-
References
ferences in the reasoning factor with the in-
dividual differences in what is common to
Babcock, R. L. (1 994). Analysis of the adult age
several different cognitive abilities. This find-
differences on the Raven’s Advanced Progres-
ing is intriguing because it suggests that an
sive Matrices Test. Psychology and Aging, 9,
explanation of the age differences in reason- 3 03 –3 1 4.
ing likely also will explain much of the age-
Babcock, R. L. (2002). Analysis of age differences
related influences on other cognitive abili- in types of errors on the Raven’s Advanced Pro-
ties, and vice versa. gressive Matrices. Intelligence, 3 0, 485 –5 03 .
Carroll, J. B. (1 993 ). Human Cognitive Abilities:
A Survey of Factor Analytic Studies. New York:
Conclusions and Future Directions Cambridge University Press.
Enders, C. K., & Bandalos, D. L. (2001 ). The
Large age differences have been found in relative performance of full information maxi-
many measures of reasoning, and in some mum likelihood estimation for missing data in
cases the differences are as large as those structural equation models. Structural Equation
found in measures of other cognitive abilities Modeling, 8, 43 0–43 7.
such as memory. There still is no convinc- Fisk, J. E., & Sharp, C. (2002). Syllogistic rea-
ing explanation of the causes of age-related soning and cognitive aging. Quarterly Journal
effects on reasoning, although the available of Experimental Psychology, 5 5 , 1 273 –1 293 .
evidence suggests that aspects of WM likely Fristoe, N. M., Salthouse, T. A., & Woodard, J. L.
contribute to at least some of these effects. (1 997). Examination of age-related deficits on
Results of correlational analyses suggest that the Wisconsin Card Sorting Test. Neuropsychol-
reasoning variables are central to what is ogy, 1 1 , 428–43 6.
common across a wide variety of cognitive Gilinsky, A. S., & Judd, B. B. (1 994). Working
abilities and to the age differences in differ- memory and bias in reasoning across the life
span. Psychology and Aging, 9, 3 5 6–3 71 .
ent cognitive abilities. It therefore seems rea-
sonable to expect that an understanding of Kaufman, A. S. & Kaufman, N. L. (1 993 ).
Kaufman Adolescent and Adult Intelligence Test
age-related effects on reasoning may help ex-
(KAIT). Circle Pines, MN: American Guidance
plain much of the age-related differences in Service.
a broad variety of cognitive variables. Finally,
Kyllonen, P. C., & Christal, R. E. (1 990). Rea-
because of the centrality of reasoning to the soning ability is (little more than) working-
individual differences in much of cognitive memory capacity?! Intelligence, 1 4, 3 89–43 3 .
Raven, J. (1 962). Advanced Progressive Matrices, Salthouse, T. A., & Ferrer-Caja, E. (2003 ). What
Set II. London: H. K. Lewis. needs to be explained to account for age-
Salthouse, T. A. (1 987). The role of representa- related effects on multiple cognitive variables?
tions in age differences in analogical reasoning. Psychology and Aging, 1 8, 91 –1 1 0.
Psychology and Aging, 2 , 3 5 7–3 62. Salthouse, T. A., Legg, S. E., Palmon, R., &
Salthouse, T. A. (1 988). The role of processing Mitchell, D. R. D. (1 990). Memory factors
resources in cognitive aging. In M. L. Howe, & in age-related differences in simple reasoning.
C. J. Brainerd (Eds.), Cognitive Development in Psychology and Aging, 5 , 9–1 5 .
Adulthood. New York: Springer-Verlag. Salthouse, T. A., Mitchell, D. R. D., Skovronek,
Salthouse, T. A. (1 991 ). Mediation of adult age E., & Babcock, R. L. (1 989). Effects of adult
differences in cognition by reductions in work- age and working memory on reasoning and spa-
ing memory and speed of processing. Psycho- tial abilities. Journal of Experimental Psychology:
logical Science, 2 , 1 79–1 83 . Learning, Memory, and Cognition, 1 5 , 5 07–
Salthouse, T. A. (1 992a). Reasoning and spatial 5 1 6.
abilities. In F. I. M. Craik & T. A. Salthouse Salthouse, T. A., & Prill, K. A. (1 987). Inferences
(Eds.)., Handbook of Aging and Cognition. about age impairments in inferential reasoning.
Hillsdale, N.J.: Erlbaum. Psychology and Aging, 2 , 43 –5 1 .
Salthouse, T. A. (1 992b). Why do adult age dif- Salthouse, T. A., Atkinson, T. M., & Berish, D. E.
ferences increase with task complexity? Devel- (2003 ). Executive functioning as a potential
opmental Psychology, 2 8, 905 –91 8. mediator of age-related cognitive decline in
Salthouse, T. A. (1 992c). Working memory me- normal adults. Journal of Experimental Psychol-
diation of adult age differences in integrative ogy, General, 1 3 2 , 5 66–5 94.
reasoning. Memory and Cognition, 2 0, 41 3 – Salthouse, T. A., Fristoe, N., & Rhee, S. H.
423 . (1 996). How localized are age-related effects
Salthouse, T. A. (1 993 ). Influence of working on neuropsychological measures? Neuropsy-
memory on adult age differences in matrix rea- chology, 1 0, 272–285 .
soning. British Journal of Psychology, 84, 1 71 – Salthouse, T. A., & Skovronek, E. (1 992). Within-
1 99. context assessment of age differences in work-
Salthouse, T. A. (1 994). The nature of the influ- ing memory. Journal of Gerontology: Psycholog-
ence of speed on adult age differences in cog- ical Sciences, 47, P1 1 0–P1 20.
nition. Developmental Psychology, 3 0, 240–25 9. Verhaeghen, P., & Salthouse, T. A. (1 997).
Salthouse, T. A. (1 996a). The processing speed Meta-analyses of age-cognition relations in
theory of adult age differences in cognition. adulthood: Estimates of linear and non-linear
Psychological Review, 1 03 , 403 –428. age effects and structural models. Psychological
Salthouse, T. A. (1 996b) Where in an ordered se- Bulletin, 1 2 2 , 23 1 –249.
quence of variables do independent age-related Wechsler, D. (1 997). Wechsler Adult Intelligence
effects occur? Journal of Gerontology: Psycholog- Scale – Third Edition. San Antonio, TX: The
ical Sciences, 5 1 B, P1 66–P1 78. Psychological Corp.
Salthouse, T. A. (2000). Item analyses of age rela- Woodcock, R. W., McGrew, K. S., & Mather, N.
tions on reasoning tests. Psychology and Aging, (2001 ). Woodcock-Johnson III Tests of Cognitive
1 5 , 3 –8. Abilities. Itasca, IL: Riverside.
Salthouse, T. A. (2001 ). Attempted decomposi- Zachary, R. A. (1 986). Shipley Institute of Living
tion of age-related influences on two tests of Scale – Revised. Los Angeles, CA: Western Psy-
reasoning. Psychology and Aging, 1 6, 25 1 –263 . chological Services.
CHAPTER 2 5
Reasoning and Thinking in

Nonhuman Primates
Josep Call
Michael Tomasello
Fifty years ago, a chapter with the title adaptations may be considered cognitive in
“Reasoning and Thinking in Nonhuman Pri- the sense that they involve the individual or-
mates” would have been a very short chapter. ganism’s learning and reasoning and thinking
Behaviorists, of course, did not believe in rea- on the basis of its own individual experience
soning and thinking, and people who studied before deciding on the best way to act in
animals in their natural habitats (eventu- a given circumstance. There are specifiable
ally known as ethologists) were interested ecological circumstances in which evolution
in other things. In the 1 960s, the cognitive favors the greater flexibility afforded by cog-
revolution transformed the way psycholo- nitive adaptations, as opposed to, for exam-
gists studied human behavior and cognition, ple, hardwiring specific behavioral responses
but much of this research was about hu- to specific environmental stimuli (Boyd &
man symbolic, propositional representations Richerson, 1 985 ).
(“the language of thought”) and was not eas- In the case of nonhuman primates in par-
ily applied to research with nonhuman an- ticular, there were actually two pioneers in
imals. The cognitive revolution thus came cognitive research in the early part of the
to the study of animal behavior only very twentieth century. In Germany, Wolfgang
slowly. But during the past two decades, Köhler was a Gestalt psychologist inter-
it has arrived, and in the modern study of ested in intelligence as something that took
animal behavior, questions of cognition are organisms beyond punctate sensations and
among the most prominent. blind trial-and-error learning. He studied a
Scientists who study animals typically small group of chimpanzees in a variety
have a background in biology, so every- of problem-solving situations, looking for
thing flows from the theory of evolution. cases of perceptual restructuring and insight
These behavioral biologists and psychobiol- (Köhler, 1 925 ). In America, Robert Yerkes
ogists are interested in how animals adapt to studied a variety of behavioral phenomena
their environments – both physically and bein a number of primate species. His work in-
haviorally. In this context, some behavioral cluded studies in which animals had to solve
607
complex cognitive problems. In the mid- of research. Here we explore two aspects:
dle part of the century, behaviorists studied (1 ) how individuals navigate in large-scale lo-
such things as the speed with which differ- comotor space while traveling and (2) how
ent species could be taught through reward- individuals search for objects more locally
based training to make discriminations, form in small-scale manipulatory space. In both
learning sets, and the like (Harlow, 1 95 9; cases, the key skills involved in thinking and
Rumbaugh, 1 970) – phenomena which, to- reasoning enable an individual to predict
day, could be given interesting cognitive in- things – namely, the best path for its own
terpretations. locomotion or the likely future position of
The most exciting work in the modern moving objects.
context comes under the two titles “com-
parative cognition” and “cognitive ethology,” Travel Strategies
however. The former often refers to exper-
detours
imental work in the laboratory, and the lat-
ter often refers to observational work in the The use of detours was one of the main
natural environment. Ideally, for any given issues investigated by Köhler (1 925 ) with
phenomenon, the two approaches provide chimpanzees. He found that they were ca-
complementary types of information. pable of taking alternative routes to a goal
Our aim in this chapter is to provide an when the direct route was blocked. Since
up-to-date overview of research on think- then, little additional research has been done
ing and reasoning in nonhuman primates except in other animals species such as
(henceforth, simply “primates”). Thinking chickens or dogs. Recently, however, several
and reasoning, in our view, are character- researchers used computerized systems to
ized by mental transformations or leaps, not present mazes. Here, the subject does not
just direct perception or memory of partic- move, but it moves a cursor through the
ular stimuli; going “beyond the information maze to get to a goal box. This is a good
given.” We therefore focus on primates solv- tool to investigate detours because mazes
ing novel problems – that is, those that re- often involve the use of detours in which
quire them to do more than simply learn subjects have to move away from the direct
and remember. In terms of content, we focus approach to the goal box and use an indi-
on topics that constitute aspects of human rect route to reach it. Iversen and Matsuzawa
cognition represented by other chapters in (2001 ) trained two chimpanzees to navigate
the current volume, focusing in each case on through mazes presented on a computer
both selected classic studies and the latest re- touch screen. Chimpanzees gradually mas-
search findings. Our main topics are spatial tered a series of mazes of increasing diffi-
(Tversky, Chap. 1 0), relational, analogical culty. One of the chimpanzees learned to use
(Holyoak, Chap. 6), inferential, quantita- detours when the route on a familiar maze
tive (Gallistel & Gelman, Chap. 23 ), causal was blocked and later was able to use detours
(Buehner & Cheng, Chap. 7), and social rea- on novel mazes. The authors indicated, how-
soning and thinking. Although the chapter is ever, that subjects did not fully develop a
mainly about primates, readers interested in generalized ability to solve mazes, and some
fuller accounts of animal cognition in general practice with the particular mazes seemed
are referred to books published in the past to be required to solve the problem.
few years (Pepperberg, 1 999; Roberts, 1 998;
Shettleworth, 1 998; Tomasello & Call, 1 997; shortcuts
Vauclair, 1 996). Fieldworkers often report that several
species of primates travel from certain lo-
cations to others in an efficient manner –
Spatial Reasoning that is, taking the shortest routes possi-
ble (Garber, 1 989; Sigg, 1 986). Menzel
The spatial behavior and cognition of pri- (1 973 ) tested the ability of four young
mates and other animals is a very large field captive chimpanzees to use least-distance
reasoning and thinking in nonhuman primates 609
strategies, traveling the least while obtain- dergone different invisible displacements. In
ing the most food rewards in a large enclo- object permanence tasks, the experimenter
sure. He found that chimpanzees minimized places a piece of food under a small container
the distance traveled. Similarly, Boesch and that is displaced under several other contain-
Boesch (1 984) found that wild chimpanzees ers and the food is left under one of them. To
traveled efficiently when collecting stones solve this problem effectively, subjects have
needed to crack open nuts. They selected to search under all and only boxes under
stones that were closer to their current loca- which the food might have been deposited
tion. Recently, a number of researchers have given the trajectory of the box that initially
described the use of least-distance strategies contained the food. Several apes pass this
in vervet monkeys, common marmosets, and task but monkeys do not (Call, 2000; De
yellow-nosed monkeys (Cramer & Gallistel, Blois, Novak, & Bond, 1 998; De Blois &
1 997; MacDonald, Pang, & Gibeault, 1 994; Novak, 1 994; Dumas & Brunet, 1 994; Natale
MacDonald & Wilkie, 1 990). et al., 1 986), although there are individual
There also is a computerized version of exceptions (Schino, Spinozzi, & Berlinguer,
the shortcut task. Washburn (1 992; see also 1 990). Apes also have problems if the two
Washburn & Rumbaugh, 1 992) presented locations visited are not adjacent; that is, the
rhesus monkeys with a moving target on a experimenter visits the box on the right and
screen that they had to intercept with a cur- the left, leaving the center box untouched
sor that subjects controlled with a joystick. (Call, 2000; De Blois, Novak, & Bond, 1 998;
To do so appropriately, subjects not only had Natale et al., 1 986). This is interpreted as
to chase the target but on many occasions reconstructing the trajectory of the reward.
had to predict its location and use shortcuts Other types of displacements recently
to ambush it because the target speed was have been investigated with apes. In ro-
equal or superior to that of the cursor. In tational displacements, a reward is hidden
other words, they had to take shortcuts to in- under one of two cups and the platform is
tercept the moving target. This skill may be rotated circularly – for instance, 1 80 degrees.
more demanding than using shortcuts when In transpositions, the reward is placed under
traveling between various food sources be- one of various containers and their locations
cause subjects have to adapt to a moving tar- are swapped while the platform remains
get. This may be a useful skill in intercept- stationary. Results show that chimpanzees,
ing prey or competitors who hold valuable orangutans, and bonobos are capable of solv-
resources. These authors found that sub- ing these displacements (Beran & Minahan,
jects again were more effective at intercept- 2000; Call, 2003 ). Taken together, this
ing targets when they followed predictable means that subjects can track a variety of dis-
rather than unpredictable paths. Although placements based on the movement of the
subjects required some experience to learn object (object permanence), the containers
the paths of the targets, results from pre- (transpositions), or the substrate on which
sentation of novel target paths (e.g., the tar- the object and containers rest (rotations). All
get disappearing on the top and reappearing these result in changes in the location of the
on the bottom of the screen) suggested that object and subjects can infer its position.
monkeys had learned a general rule about In summary, primates are capable of trav-
the target’s behavior rather than a set of elling efficiently by using detours and short-
stimulus–response associations. In a similar cuts and they track the displacement of
vein, Iversen and Matsuzawa (2001 ) indi- hidden objects and infer their new locations
cated that when mazes had one short and a after various spatial transformations.
long route to get to the goal box, the chim-
panzee selected the short one.
Relational Reasoning
search for moving objects
Several studies investigated the ability of pri- In simple discrimination problems, subjects
mates to retrieve objects after they have unlearn to respond to a single stimulus or to a
stimulus category at some level of abstrac- Fujita, 1 982; but see Wright, Shyan, &
tion. Discrimination learning of relational Jitsumori, 1 990). Monkeys trained with
categories, on the other hand, involves con- shapes and capable of solving identity prob-
cepts that can be learned only by compar- lems with novel shapes, for instance, do
ing stimuli to one another and inducing a not transfer their identity concept to other
relation (e.g., “same as” “larger than”). The dimensions such as color (see Doumas &
three most studied instances of relational Hummel, Chap. 4, for a discussion of
concepts are the identity relation as manifest relational generalization). The rule that
in generalized match-to-sample problems, monkeys seem to learn is therefore better
the oddity relation as manifest in general- characterized as “pick the same shape” rather
ized oddity problems, and the sameness– than “pick the same.”
difference relation as manifest in general-
ized relation-matching problems. In all three Oddity
cases, the basic idea is that the subject is
Numerous studies demonstrate that many
given some problems in training that can be
primate species can acquire the concept
solved by attending to a relation and then is
of oddity, as evidenced by their ability to
given transfer tests that use completely dif-
solve novel problems after a period of train-
ferent objects that can be seen as instanti-
ing (King & Fobes, 1 982; Rumbaugh &
ations of that same relation. If learning is
McCormack; 1 967; Thomas & Boyd, 1 973 ).
relatively fast in the transfer phase, the infer-
Some primate species have also been able
ence is that the subject acquired a relational
to solve dimension-abstracted oddity prob-
concept in the training phase and is now ap-
lems in which the odd object must be distin-
plying it in the transfer phase. If the learning
guished from four other alternatives that are
is at the same basic rate in training and trans-
not identical to one another (as in traditional
fer (with some allowance for the formation
oddity problems) but only resemble one
of a learning set), the inference is that the
another with respect to some dimensions
subject has not learned a relational concept
(e.g., objects of different shapes that are all
but is treating each new problem as a sepa-
red, as opposed to the odd object, which
rate entity with its own particular stimulus
is blue). Macaques, squirrel monkeys, chim-
characteristics.
panzees, and gorillas were capable of solv-
ing this problem (Bernstein, 1 961 ; Thomas
Identity & Frost, 1 983 ). Human children have been
presented with oddity problems in a number
Several studies have shown that monkeys
of studies and generally perform very well in
and chimpanzees can solve identity prob-
the earliest trials of transfer (e.g., Lipsett &
lems based on generalized matching to sam-
Serunian, 1 963 ).
ple (D’Amato & Salmon, 1 984; D’Amato
et al., 1 986; Nissen, Blum, & Blum, 1 948;
Sameness–Difference
Oden, Thompson, & Premack, 1 988). In
the only study of which we are aware in In the previous two tasks, subjects have to
which human children were tested in this respond either to similarity or difference.
same type of procedure, they, like the chim- Some tasks have investigated whether sub-
panzees, generalized immediately to new jects can decide whether a pair of stim-
match-to-sample problems using only two uli are similar or different simultaneously.
sets of stimuli in training (Weinstein, 1 941 ). Several monkey species, chimpanzees, and
It should be noted, however, that this suc- orangutans were capable of judging whether
cessful performance with one stimulus di- two stimuli were “same” or “different”
mension (e.g., shape) does not generalize in (Wright, Santiago, & Sands; 1 984; Fujita,
most studies across other stimulus dimen- 1 983 ; King & Fobes; 1 975 ; Robinson, 1 95 5 ,
sions (D’Amato & Colombo, 1 985 ; Jackson 1 960; King, 1 973 ). These studies invari-
& Pegram, 1 970a, 1 970b; Kojima; 1 979; ably involved subjects making judgments
reasoning and thinking in nonhuman primates 61 1
at the perceptual level, not the functional Gillan, Premack, and Woodruff (1 981 )
level. Recently, however, Bovet and Vauclair presented the language-trained chimpanzee
(2001 ) investigated the ability of baboons Sarah (Premack, 1 976) with pairs of objects
to make same–different judgments based that had various relations; Sarah’s job was
on the functional properties of the stimuli. to identify another pair that had an analo-
They presented baboons with pairs of stim- gous relation. In so-called figural problems,
uli corresponding to two different categories Sarah was presented with an odd shape with
(i.e., food vs. nonfood). The items belong- a dot on it and that same shape without the
ing to these two categories varied in their dot; she was then presented with another
perceptual features. Results indicated that shape with a dot and had to choose from
baboons were capable of judging as “same” a pair of alternatives that same shape with-
items belonging to the same category despite out the dot (i.e., the analogous relation of
their perceptual dissimilarities and as “differ- two shapes with and without a dot). In so-
ent” items belonging to different categories. called conceptual problems Sarah was pre-
sented with household items with which she
was familiar and asked to draw analogies, for
example, between a key and lock and a can
Analogical Reasoning opener and can. On figural items, Sarah per-
formed correctly about three-quarters of the
Premack (1 983 ) argued that identity, odd- time and on conceptual items she was cor-
ity, and sameness–difference tasks as tradi- rect at a slightly higher rate. Having ruled
tionally administered do not require the kind out various possible alternative explanations,
of relational concepts that investigators have the investigators concluded that Sarah was
claimed. Because the matching takes place able to understand the relation in the first
across trials in all of these tasks, he claimed pair of stimuli at a level of abstraction suf-
that “the animal simply reacts to whether it ficient to allow her to identify it in subse-
has experienced the item before. Old/new quent stimulus pairs, both perceptually and
or familiar/unfamiliar would be better tags conceptually.
for this case than same/different” (Ref. 86, Recently, Thompson, Oden, and Boysen
p. 3 5 4). Instead, he advocated use of a gener- (1 997) found that language-naı̈ve chim-
alized match-to-sample procedure in which panzees were also able to solve analogies if,
the matching to be accomplished involves prior to testing, they had been trained to
the relations between items. Premack (1 983 ) associate a token of one shape with pairs
presented chimpanzees with a sample pair of of similar items and a different token for a
stimuli that either matched (so-called AA pair of items that were not similar. Presented
pairs, such as two apples) or that did not with the same token, they selected the sim-
match (so-called CD pairs, such as a pear and ilar pair, and presented with the different
an orange). Their task was to pick which of token, they selected the pair with unequal
two alternatives matched the relation exem- items (this is comparable to Burdyn and
plified in the sample – either a pair of new Thomas, 1 984, in which squirrel monkeys
items that matched (a so-called BB pair, such used a figure to choose between the identical
as two bananas) or a pair of new items that or the different objects in the pair). On the
did not match (a so-called EF pair, such as testing phase, subjects were presented with
a plum and a grape). When the sample was a pair of identical or different objects as a
AA, the subject was to choose BB (rather sample and two choices (one that bore the
than EF) because the relation between items same the relation as the sample and another
in both cases is one of “sameness.” If the with a different relation). Chimpanzees per-
sample was CD, the subject should choose formed above chance, indicating that they
EF (rather than BB) because the relation could identify the relation between relations.
between items in each case was one of In contrast, rhesus monkeys presented with
“difference.” an analogous procedure were unable to solve
this problem even though they were able to their instrumental behavior. Interestingly,
solve other kinds of relations such as cor- children perform like chimpanzees and dis-
rectly identifying the perceptual analogies tinguish the relations, whereas monkeys do
(Washburn, Thompson, & Oden, cited in not (Thompson & Oden, 2000). Thompson
Thompson & Oden, 2000). and Oden (2000), however, have indicated
Several authors have indicated that only that these represent implicit rather than ex-
chimpanzees that had received some train- plicit judgments of the kind shown in gen-
ing that involved using tokens represent- eralized relation-matching tasks (see Litman
ing “same” and “different” were able to & Reber, Chap. 1 8 on implicit thinking).
solve analogies (Premack, 1 983 ; Thompson Second, and more importantly, Vonk
& Oden, 2000). They argued that learn- (2003 ) has recently shown that orangutans
ing a symbolic code such as language fun- and a gorilla can solve analogies without
damentally changed the nature of the cog- any token experience or extensive training
nitive representations used by chimpanzees (see also Smith et al., 1 975 ) using a delayed
by providing them with an abstract proposi- matching to sample (DMTS) task in which
tional code (rather than a concrete imaginal subjects had to match the relation repre-
code) in terms of which they might inter- sented by a pair of geometric figures to those
pret their experience. This idea, however, of one of the two alternatives provided – the
has been challenged from two main direc- same method used by other authors to test
tions. First, Oden, Thompson, and Premack analogical reasoning in chimpanzees. There
(1 990) used a different procedure and found is also a study with baboons that showed
that four infant chimpanzees (around one they can match a sample depicting a set
year of age with no language training) also of identical or different items to the corre-
engaged in the matching of relations. They sponding alternative (Fagot, Wasserman, &
simply presented the subject with a sample Young, 2001 ). Unlike previous studies with
stimulus that consisted of a pair of objects apes, however, baboons reached high perfor-
mounted on a small board; the pair could mance only when the sample and alterna-
either match (AA) or not match (CD). Sub- tives were formed by multiple items. When
jects could play with this sample as desired, the number of items was reduced, there was
and their play time was recorded. They were a clear decrement in accuracy, particularly
then presented with two test pairs of objects, for the “different” samples. This effect, as
also mounted on board, that they might play well as the extensive training involved (ba-
with; one was a matching pair (BB) and one boons received thousands of trials before
was a nonmatching pair (EF). Subjects’ ini- they mastered the initial task), opens the
tial play with the sample affected their han- door to other interpretations based on the
dling time with the new test pairs. If sub- perception of perceptual entropy.
jects had played with the sample pair that One area in analogical reasoning that
matched (AA) they were no longer inter- has received some recent research attention
ested in the matching relation and so played from a comparative perspective is that of
more with the nonmatching test pair (EF); spatial analogy. Using a task pioneered by
if they had played with the nonmatching DeLoache and colleagues (DeLoache 1 995 ),
sample (CD), they played more with the Kuhlmeier, Boysen, and Mukobi (1 999)
matching test pair (BB). The conclusion of tested the ability of chimpanzees to make
these investigators was that chimpanzees can spatial analogies. Subjects were presented
understand relations among relations, even with a very accurate three-dimensional scale
if they do not always show this competence model of a room and subjects witnessed
in tasks in which they must actively choose how the experimenter placed an object (e.g.,
stimuli. The modified conclusion of these soda can) in a particular location in the
authors was that, although chimpanzees un- scale model (e.g., inside a cupboard). Sub-
derstand second-order relations, language jects then moved to the real room and were
training helps them incorporate this into allowed to search the room. Chimpanzees
were capable of using the scale model to pairs (D’Amato et al., 1 985 ; D’Amato &
accurately predict the location of the ob- Colombo, 1 988; Gillan 1 981 ; Boysen et al.,
ject in the room and vice versa; they were 1 993 ). This includes cases in which sub-
able to point to a location in the scale model jects have been trained with more than three
that corresponded to a location in the actual stimuli. This is important because the most
room. Initially, female chimpanzees were interesting cases are those that involve in-
more proficient than males at this task. Male termediate stimuli – that is, stimuli that are
chimpanzees tended to search the room in not the first or the last of the sequence, be-
a predetermined pattern until they eventu- cause those are always or never reinforced,
ally found the object rather than going to the respectively. D’Amato and Colombo (1 988)
specific places indicated in the scale model. trained capuchin monkeys to touch five ar-
When reward delivery was made contingent bitrary items in a specified order (labeled
on visiting the specific location on their first A, B, C, D, and E). After they had mas-
try, however, males’ performance became tered this task they were presented with
comparable to that of female chimpanzees novel pairs. Of particular importance were
(Kuhlmeier & Boysen, 2001 ). the internal pairs B–C, C–D, and B–D. The
In summary, primates are capable of per- B–D comparison was especially important
ceiving various types of relations between because these two items were both internal
objects. Moreover, apes can solve analogies to the series and were nonadjacent to one
regarding the similarity or difference be- another in the previous training. Subjects
tween pairs of objects, and chimpanzees can ordered these three internal pairs correctly
also solve spatial analogies involving the use 81 % to 88% of the time, well above chance.
of scale models. When presented with triplets from which
they were to choose the highest item, they
ordered the internal triplet B-C-D correctly
94% of the time, also well above chance.
Inferential Reasoning
This finding essentially replicates, with even
stronger results, the findings of McGonigle
Transitivity
and Chalmers (1 977) with squirrel monkeys.
The use of transitive inference has re- These authors also found evidence for a sym-
ceived much research attention. Although bolic distance effect – the farther apart two
human studies on transitivity (see Halford, items, the more successful the subjects, pre-
Chap. 22) have often used stimuli that vary sumably because the items were easier to
systematically and naturally along a quanti- distinguish.
tative dimension such as height (e.g., Piaget, One open question is, What is the mecha-
1 95 2), most studies with primates have used nism responsible for this performance? Two
so-called associative transitivity. This con- mechanisms have been postulated – the as-
sists of presenting subjects with pairs of ar- sociative mechanisms based on responding
bitrary stimuli and differentially reinforcing to the differential reinforcement and asso-
one of the stimulus of the pair, thereby creat- ciative strength of the stimuli, and the re-
ing different values. For instance, the red cup lational or linear mechanism based on cre-
is always reinforced when presented with ating a mental order of the stimuli. Bond,
the blue cup, whereas the blue cup is always Kamil, and Balda (2003 ) argued that, under
reinforced when presented with the yellow an associative mechanism, errors increase at
cup, and so on. Once the initial pairs are the end of the sequence, whereas latencies
trained, subjects are presented with pairs of should be unaffected regardless of the po-
stimuli that have not been paired before, for sition of the items. In contrast, the rela-
instance, red versus yellow cup. tional mechanism predicts that accuracy will
There is ample evidence showing that remain unchanged, whereas the latency to
primates can make transitive inferences respond will be affected. First, subjects’ la-
when subjects are presented with novel tency to respond to the first item of a pair
increased as that item moved down the se- ing with something more than an associative
ries: They responded most quickly to pairs chain; they were operating with some men-
in which the first item was A, then for pairs tally represented sequence of items in which
in which the first item was B, then C, then the ordinal position of each item was essen-
D. The implication is that each time they are tial information.
presented with a pair, the subjects are men-
tally reconstructing the entire five-item se-
Ordinality
ries (D’Amato & Colombo, 1 988). Second,
animals responded most quickly to the sec- Many of the studies on transitivity have al-
ond item of a pair for pairs with adjacent ready indicated that monkeys learn some-
items (e.g., A–B, C–D, etc.), then for pairs thing about the linear representation in a se-
separated by one gap (e.g., A–C, C–E, etc.), ries – they learn the order in which the items
then for pairs separated by two gaps (i.e., should appear. Some studies have pushed
A–D, B–E), and they were slowest on the this argument a bit further and have substi-
second item when the gap was three (A–E). tuted boxes by Arabic numerals represent-
Again, the implication is that subjects are ing different quantities. Boysen et al. (1 993 )
going through the entire series mentally on found that after chimpanzees were trained
every trial. Swartz, Chen, and Terrace (1 991 ) with pairs of adjacent numerals in the same
essentially replicated these results – both in pairwise as used in the studies previously re-
terms of ordinal judgments and in terms of ported, they were then presented with the
reaction times – for rhesus macaques. novel pair 2–4. In this study, after appropri-
Although these results are quite convinc- ate training with the initial pairs, subjects
ing, D’Amato and Colombo (1 989) pointed all were able to successfully choose the 4
out that the results of this study are com- over the 2 in the novel pairwise test. The
patible with an associative chain interpre- investigators concluded that with appropri-
tation in which each item simply serves as ate training, chimpanzees can learn the se-
a discriminative stimulus evoking the next rial order of symbolic stimuli. Washburn
item, obviating the need for some represen- and Rumbaugh (1 991 ) taught two rhesus
tation of serial order. To investigate whether macaques to associate Arabic numerals with
capuchin monkeys were also associating a the reception of a corresponding number of
specific serial position with each item in the food pellets. Because monkeys try to max-
associative chain, D’Amato and Colombo imize their food intake, they learned to se-
(1 989) used a procedure that essentially lect the larger quantity represented by the
broke the chain. Using monkeys who had various numerals that were presented to
already learned the ABCDE sequence, on them. The authors reserved some of the pairs
some trials they introduced a “wild card” for their transitivity tests. One of the two
item at a particular point in the sequence subjects was above chance in choosing the
(e.g., ABCXE). This was a novel item that larger member of the novel pair in the very
had never been used as part of the training first set of trials. Similarly, presented with
and therefore had no associations with any five numerals simultaneously, result of both
other items. These investigators found that were above chance immediately in choosing
no matter the position in which the wild card the largest one. These investigators interpret
item appeared, subjects treated it in a man- their results as indicating that the monkeys
ner similar to the item it replaced at above- formed a representation of a “matrix of val-
chance levels, touching it at the appropriate ues” corresponding to the numerals.
place in the sequence approximately 60% of Additional evidence for ordinality is pro-
the time. They performed just as well with vided by two studies not based on a transi-
sequences containing two wild card items. tivity paradigm. First, Brannon and Terrace
Consequently, D’Amato and Colombo ar- (1 998) trained rhesus macaques to touch
gued that the monkeys in this study, and a series of stimuli depicting different nu-
presumably in previous studies, were operat- merosities in ascending order. Initially, the
monkeys were trained on sets of stimuli de- the fruit. Chimpanzees solved this prob-
picting numerosities ranging from 1 to 4. In lem quickly, without trial-and-error, show-
transfer tests, monkeys were able to solve ing that they were able to infer that if the ex-
problems involving numerosities ranging 5 perimenter was eating the banana, the box
to 9. The authors argued that their results where the banana was deposited would be
demonstrated that rhesus macaques can rep- empty.
resent numerosities ranging from 1 to 9 in an More recently, Call (2004) presented all
ordinal manner. In a follow-up study, Bran- four great apes with two cups (one baited)
non and Terrace (2000) trained monkeys to and gave visual or auditory information
touch a series of stimuli in descending or- about the contents of one or both cups. Vi-
der. They failed to transfer the descending sual information consisted of removing the
rule into new numerosities, however. Mon- top of the cup so that subjects could look
keys also failed to learn to select numerosi- inside it. Auditory information consisted of
ties in a monotonic series despite extensive shaking the cup so that it produced a rattling
training. This suggests that ordinality may be sound when the food was inside. Subjects
an especially salient dimension for monkeys. correctly selected the baited cup both when
The accuracy in responding and the latency they saw the food and when they heard it.
indicated distance effects similar to those More importantly, subjects also selected the
in other studies, including human studies correct cup when only the empty cup was ei-
(e.g., Moyer & Landaeur, 1 967). Second, ther shown or shaken. This means that sub-
Kawai and Matsuzawa (2000) showed that jects chose correctly without having seen or
the chimpanzee Ai was capable of select- heard the food. Control tests showed that,
ing Arabic numerals presented on the com- in general, subjects were not more attracted
puter screen in ascending order. She had 90% to noisy cups or avoided shaken noiseless
accuracy with four-numeral series and 65 % cups. Also, subjects were unable to learn to
accuracy with five-numeral series. The re- use other comparable auditory cues such as
sponse latency was longest for the first item tapping on the baited cup to find the food.
in the series compared with the remaining The author argued that apes made inferences
ones, which suggests the chimpanzee was about the food location, rather than just as-
planning the sequence before executing the sociating an auditory cue with the reward.
entire sequence. This suggests that subjects understood that
the food caused the noise, not simply that
the noise was associated with the food.
Conjunctive Negation
There are also two studies in which chim-
This refers to the ability to infer that if a panzees were able to solve inferential ex-
given object can be located in one of two clusion in a matching to sample paradigm.
containers and, upon searching the first con- Hashiya and Kojima (2001 ) presented a
tainer, is not found there, then it must be in chimpanzee with two pictures of people
the other one. Premack and Premack (1 994) she knew and the voice of one of them.
presented chimpanzees with two boxes and The chimpanzees successfully matched the
two types of fruit, such as a banana and an voice with the correct picture. Then Hashiya
apple. Chimpanzees were allowed to witness and Kojima (2001 ) presented her with two
the experimenter deposit each fruit in one pictures (one of someone she knew and
of the boxes so that both boxes were baited. the other of someone she did not know)
Later, subjects saw the experimenter eating and an unfamiliar voice. The chimpanzee
one of the fruits (e.g., banana) and the ques- correctly matched the unfamiliar voice to
tion was whether given the opportunity to the unfamiliar picture. Beran and Washburn
select either box, they would select the one (2002) presented chimpanzees with pictures
in which the experimenter had deposited and lexigrams as samples and alternatives,
the food he was not currently eating (i.e., respectively. Pictures and lexigrams could
apple), presumably because it still contained be either familiar or unfamiliar. Familiar
samples and alternatives consisted of pic- their choices. Beran and Rumbaugh (2001 )
tures and lexigrams, respectively, that sub- argued that chimpanzees’ performance
jects had learned to associate before the test. decreased proportionally to the magnitude
As expected, chimpanzees reliably selected of the numerals presented. The authors
the familiar appropriate lexigrams repre- argued that the chimpanzees seemed to rep-
senting the sample pictures, but, in addi- resent quantities in a continuous rather than
tion, chimpanzees also were able to select a discrete fashion. This characterization
unfamiliar lexigrams when presented with differs in some way from that of Rumbaugh
familiar lexigrams and an unfamiliar picture. et al. (1 989), who indicated that these
Their success in this task did not translate subjects were counting in a way similar to
into the acquisition of the unfamiliar lexi- human children in that they knew not only
gram as a representation for the unfamiliar the ordinality but also the cardinality of the
picture, however. Arabic numerals involved.
In summary, primates are capable of Boysen and Berntson (1 989) trained the
making inferences about pieces of missing chimpanzee Sheba to count objects using
information in transitivity and conjunctive Arabic numerals using a different method.
negation problems of various types. First, they administered a one-to-one corre-
spondence task in which she had to place one
and only one object in each of six compart-
Quantitative Reasoning ments of a divided tray. Then, she was re-
quired to pick a card with the same number
of dots as the number of food items (rang-
Primates can perform operations on quanti-
ing from one to three pieces) presented on a
ties by adapting to novel arrays when some
tray. The researchers then replaced the cards
quantities are added, subtracted, or simply
with dots with cards having Arabic numer-
change in appearance but remain constant.
als and continued the training until Sheba
They also have some skills in counting, on
was able to select the Arabic numeral corre-
which these more complex skills depend.
sponding to the number of dots on a card.
Finally, the authors trained the subject in
Counting
Arabic numeral comprehension so that, pre-
Rumbaugh and coalleagues (Beran & sented with an Arabic numeral, she had to
Rumbaugh, 2001 ; Rumbaugh et al., 1 989) select the card with the corresponding num-
presented two chimpanzees with a com- ber of dots. After she mastered these tasks,
puter task in which subjects had to collect Boysen and Berntson conducted two trans-
the number of dots from the bottom of fer tests. First, they presented her with one,
the screen specified by an Arabic numeral two, or three common household items and
presented on the top of the screen. Subjects asked her to pick the corresponding Ara-
indicated when they had finished their bic numeral, which she readily did. Second,
selection with the use of the cursor. The they introduced the Arabic numerals 4, 5 ,
chimpanzees performed above chance and 0 directly (without first using cards with
with the numerals up to six and seven, dots), and Sheba readily learned to associate
respectively. Rumbaugh et al. (1 989) also these with the correct number of objects as
indicated that the chimpanzee Lana could well.
solve this task even if the squares disap- During this training of Sheba, Boysen and
peared as she touched them (with a tone Bernston noticed that she often engaged in
sounding as they disappeared) – implying “indicating acts” as she counted. That is, she
that she could keep track mentally of how touched, displaced, or “pointed to” objects
many she had already touched. The authors serially in attempting to determine the ap-
ruled out some explanations such as subitiz- propriate Arabic numeral – much as human
ing or using the temporal pattern rather children touch or otherwise indicate objects
than the number of dots as the basis for as they count them. In a follow-up study,
therefore, Boysen et al. (1 995 ) looked to see understood as a mental operation, however,
whether the number of indicating acts Sheba because subjects were not required to
used as she engaged in these tasks correlated perform any mental operations. Directly
with the number of items in the array (by the perceiving the larger of the two overall
time of this study, Sheba knew the numer- quantities in either side would suffice to
als 0–7). They gave her some counting tasks, solve this task. In other words, this can be
using the numerals 0 to 7, and found that seen as a relative numerousness judgment
she correctly counted 5 4% of the time (with over a large area, without any operation
errors distributed equally across the 1 to 7 beyond perception being implicated.
range). They found further that whereas the In an attempt to solve this problem, Call
absolute number of Sheba’s indicating acts (2000) presented orangutans with two quan-
did not correspond to the number of items tities in two dishes and then added a third
in the array precisely, typically being about one into one of the dishes. In some trials,
twice as large, these did correlate signifi- this resulted in the smaller of the two ini-
cantly (r = .74). It is unclear whether this tial quantities having more and sometimes
correlation is attributable to counting or to did not change. Call (2000) also subtracted
the fact that determining the numerosity of quantities from the initial quantities and
the larger numerals requires more time, so showed how much he had subtracted. The
that a constant rate of indicating acts across important point is that subjects never saw
all numerals would lead to the correlation. the final quantities directly, but they had
In any case, the investigators concluded that to decide based on how much had been
Sheba was counting objects in much the added to or subtracted from the quanti-
same way as human children, and that her in- ties. Orangutans were capable of perform-
dicating acts were serving a mediating func- ing above chance in both addition and
tion in the process. subtraction.
Sulkowski and Hauser (2001 ) also investi-
gated subtraction in rhesus macaques. They
Summation and Subtraction
showed subjects two quantities (up to three
Rumbaugh and colleagues (Perusse & items each), hid each of them in two sep-
Rumbaugh, 1 990; Rumbaugh, Savage- arate adjacent locations and then removed
Rumbaugh, & Hegel, 1 987; Rumbaugh, either one or no items from each location.
Savage-Rumbaugh, & Pate, 1 988) pre- Rhesus macaques selected the location with
sented two language-trained chimpanzees more items even when subtractions occurred
(Sherman & Austin) with two unequal from both locations and when some noned-
sets of candies (M&M) but presented as ible items rather than food were subtracted.
spatially distinct subsets. For instance, a Beran (2001 ) found that two chim-
trial consisted of presenting four and three panzees were also able to add quantities
candies compared with five and one candies. presented sequentially up to nine pieces of
Chimpanzees were capable of comparing candy (M&M). Unlike the previous study,
these two sets and combining the spatially each piece of candy was added individually
distinct subsets (e.g., 4 + 3 vs. 5 + 1 ) to net to one of two cups rather than presenting the
the larger total array (up to a maximum of array in its totality. In different experiments,
seven candies) on more than 90% of the subjects witnessed the experimenter plac-
trials. Although the investigators did not ing different quantities into the cups in var-
claim that their subjects “added” numbers ious steps. All candies may be added at one
in anything resembling the human method, time in a given cup or the addition rounds
they argued that the skills required for this could be alternated between the two cups.
task go beyond simple subitizing, because In the final experiment, subjects witnessed
the items in each of the two quantities to be the experimenter removing one candy from
compared are separated into two spatially one of the cups before being allowed
distinct subsets. This is far from summation to choose. Both chimpanzees performed
above chance in the addition trials, whereas ward the formation of logical thinking in
only one chimpanzee was above chance children. Seven-year-old children and older
in the subtraction trials. These two stud- understand that if two quantities were the
ies suggest that orangutans and chimpanzees same prior to a perceptual transformation
can represent quantities and mentally op- (and nothing has been added or removed)
erate on those quantities to net the largest they must be the same after the transforma-
array. tion has taken place. This logical necessity is
Olthof, Iden, and Roberts (1 997) raised the cornerstone of the conservation experi-
the bar a bit further and replaced the ac- ments (see also Halford, Chap. 22).
tual quantities by Arabic numerals. First the Although some studies on conservation
monkeys were trained to identify the numer- in monkeys have been done (e.g.,Thomas
als corresponding to quantities that subjects & Peay, 1 976), the lack of information
knew. Then subjects were given a choice be- about how subjects judge the two quanti-
tween different combinations of numerals ties prior to a transformation prevents us
involving two numerals in each pair, one nu- from drawing any conclusions. Two stud-
meral and two numerals, or three numerals ies with chimpanzees collected this informa-
in each pair. For instance (1 + 1 + 3 ) against tion and therefore can be interpreted more
(2 + 2 + 2). Squirrel monkeys were capable accurately. First, Woodruff, Premack, and
of selecting the larger total quantity. Addi- Kennel (1 978) presented Sarah with liquid,
tional tests indicated that this effect could solid, and number conservation tasks. Before
not be explained by choosing the largest this test, Sarah had learned to use plastic
numeral available or avoiding the smallest tokens to indicate whether a pair of stim-
number available. uli were “same” or “different.” In the liquid
Similarly, Boysen and Berntson (1 989) conservation task, she was presented with a
also reported that Sheba was able to visit pair of equal or unequal liquid quantities
three locations in the room, look for hid- in identical containers and asked to judge
den items that might be there and report them with the tokens. One of the quan-
the total number of items (up to four) at a tities was then poured into another con-
different location by picking up a card de- tainer with a different shape, and she was
picting the arabic numeral corresponding to asked to make a judgment on the novel
the total number of items available in the stimuli. Results indicated that she correctly
room. Sheba was able to do this by using ei- judged the quantities in liquid and solid
ther actual objects or Arabic numerals with but not in number conservation tasks. Addi-
an overall accuracy of 75 % (chance = 25 %). tional tests also showed that Sarah was un-
Given that Sheba also can make transitive able to judge correctly when she was pre-
inferences with an ordered series of items vented from seeing the quantities presented
(Boysen et al., 1 993 ) and uses indicating acts in identical containers first. This result led
as she attempts to determine the numeri- Woodruff, Premack, & Kennel (1 978) to con-
cal value of sets of objects (see previous sec- clude that she based her judgments on log-
tion), the investigators hypothesize that she ical necessity rather than perceptual esti-
is actually counting, in a human-like way, mation. Similarly, Muncer (1 983 ) reported
in these foraging tasks, and that her number that a chimpanzee was capable of selecting
concept is very much like that of a young the larger of two quantities after applying a
human child. transformation that changed the appearance
of the liquid. As in the previous study, the
chimpanzee was unable to select the larger
Conservation
quantity if she was prevented from seeing
Piaget and Inhelder (1 941 ) considered the the pretransformation quantities displayed
ability to understand that physical quanti- in identical containers.
ties remain constant after changing their per- Call and Rochat (1 996) investigated the
ceptual appearance an important step to- ability of four orangutans to solve liquid
conservation using a modified version of studies were not comparable, it is diffi-

Muncer’s procedure. Subjects were pre- cult to know whether chimpanzees and
sented with a pair of identical containers orangutans truly differ in the mechanisms
with unequal amounts of juice. Once sub- they use to solve conservation problems or
jects had indicated their choice by pointing the differences were a result of the methods
(which invariably was to the larger quan- used in each set of studies. Recently, Suda
tity), the experimenter transferred the liquid and Call (in press) set out to resolve this
quantities into a pair of unequal containers. discrepancy by studying chimpanzees and
In different experiments, the authors var- orangutans with the same procedures. They
ied either the shape of the containers or the presented apes with various liquid conser-
number of containers available (while keep- vation problems in which the initial quanti-
ing the shape constant). Although some apes ties were transferred into containers of dif-
still selected the larger quantity after a shape ferent shapes or into multiple containers, di-
transformation, this performance deterio- viding the total quantity. Results supported
rated when the contrast between the shape the notion that most apes relied on percep-
of the containers was increased. In contrast, tual estimation rather than logical necessity
some of the six- to eight-year-old children with orangutans being slightly more profi-
they tested performed satisfactorily. Further- cient than chimpanzees.
more, none of the apes solved the task when In summary, primates can solve quanti-
the quantities were transferred into mul- tative problems that require combining or
tiple containers. Call and Rochat (1 996) dissociating quantities, and they can develop
concluded that orangutans depended upon the notion of ordinality. In contrast, there is
perceptual information rather than logical little evidence that primates use logical ne-
necessity, thereby demonstrating “pseudo- cessity when confronted with various Piage-
conservation.” In a follow-up study, Call and tian conservation problems.
Rochat (1 997) investigated the use of per-
ceptual strategies underlying the orangutans’
pseudoconservation. The authors examined Causal Reasoning
three possible perceptual strategies to iden-
tify the larger amount of liquid: visual es-
Causal reasoning is a complex topic, and
timation of the liquid in the container, the
much hinges on the chosen definition of
use of information about quantity based
causality. Some researchers interpret causal-
on pouring the liquid, and a tracking strat-
ity as the ability to form stimulus–stimulus
egy that consisted of following the liq-
or stimulus–response associations. In this
uid that subjects had initially chosen. Re-
broad sense there is no doubt that many an-
sults indicated that the visual estimation
imals are sensitive to causality. We concen-
strategy best accounted for the orangutan’s
trate more narrowly on the understanding of
pseudoconservation. Overall, these investi-
the underlying “structures” and “forces” that
gators interpreted their results as indicating
are responsible for certain effects. This has
that orangutans are very good at estimating
been most studied in the domain of tool use,
quantities and at tracking the quantity they
but it has also been investigated in a variety
prefer across various spatial displacements,
of other types of physical events in which
but they do not conserve quantities across
the subject does not manipulate but only
perceptual transformations in a humanlike
observes.
manner.
The studies cited with chimpanzees sug-
Tool Use
gest the use of logical reasoning, whereas
studies with orangutans suggest the use of Many introductory texts to psychology men-
perceptual estimation in the solution of liq- tion the experiments involving tool use
uid conservation problems. Because both by chimpanzees as groundbreaking stud-
the species and methods employed in the ies. Since then, it has been shown that
several primate species use tools in a va- Hauser and colleagues recently investi-
riety of ways and for a variety of pur- gated this problem in detail with cotton-top
poses (see Beck, 1 980; Tomasello & Call, tamarins (Hauser, Kralik & Botto-Mahan,
1 997, for reviews). We concentrate on three 1 999). Their studies questioned whether
tasks that have been used to investigate these monkeys can distinguish between rele-
causality. vant and irrelevant features of tools – in this
case, the cloth. They found that tamarins
were able to master this problem. In par-
support problem ticular, presented with two cloths and two
In this problem, a reward is placed on a rewards from which to choose, they pulled
cloth. The reward itself is outside the sub- the cloth on which the reward rested. In
ject’s reach, but one of the ends of the cloth another experiment, subjects selected the
is within reach. The solution to this problem cloth that was connected somehow to the
consists of pulling the cloth to bring the re- reward but avoided the cloth that was not
ward within reach. Piaget (1 95 2) studied this connected to the reward. Once subjects had
problem in human infants and indicated that mastered these two problems, the authors
by 1 2 months of age children not only readily presented monkeys with both relevant and
pull in the cloth but, more importantly, they irrelevant changes to the problem. Relevant
withhold pulling when the reward is not in changes included the position of the re-
contact with the cloth. This indicates that ward in relation to the reward or the con-
children at this age understand that spatial nectedness between two pieces of cloth; ir-
contact is necessary for the tool to act on relevant changes included variations in the
the reward. color, texture, or shape of the cloth. The
Spinozzi and Potı̀ (1 989) tested several in- tamarins ignored irrelevant changes to the
fant primates (one Japanese macaque, two tool such as color or shape. They failed to
capuchin monkeys, two longtail macaques, solve some problems involving changes in
and one gorilla) on this problem. In one con- the relevant features, although they mas-
dition, the reward was placed on the cloth, tered those problems with additional expe-
whereas in another condition the reward was rience. The authors interpreted this as an
placed off the cloth to the side. All primates ability to distinguish between relevant and
responded appropriately by pulling in the irrelevant features.
cloth when the reward was on the cloth and Hauser et al. (2002) investigated whether
withheld pulling when the reward was off experience with tool use played a role in de-
the cloth. In a second experiment, Spinozzi ciding what constituted the relevant func-
and Potı̀ (1 989) tested the generality of these tional features of a tool. They presented
findings by modifying the conditions of the monkeys with a number of cloth problems
off-cloth condition by placing the reward that varied along several parameters except
near the end of the cloth rather than to that the correct alternative was always in-
the side of it. The authors reasoned that if dicated by the same color. Subjects there-
subjects had simply learned to respond ap- fore could solve the various problems by
propriately to a specific configuration of the either attending to the relevant features of
cloth and the reward rather than a more the problem (e.g., connectedness) or the
general relation between them, they would color of the cloth. Once animals had mas-
respond inappropriately to this novel con- tered the series of preliminary tests, they
figuration. Results confirmed their previous were presented with novel problems but
findings: All subjects pulled in the on-cloth with the color contingency reversed so that
condition but not in the off-cloth condition. color always signaled the incorrect alterna-
Recently, Spinozzi and Potı̀ (1 993 ) admin- tive. Results indicated that tool-experienced
istered the same support problem to two monkeys relied less on irrelevant cues such
infant chimpanzees and only one of them as color than tool-naı̈ve individuals in solv-
succeeded. ing the cloth problem. Nevertheless, all
monkeys experienced a postreversal decre- they varied either relevant or irrelevant fea-
ment performance, albeit performance of tures of the task, as previously done with
tool-experienced monkeys suffered less. the cloth problem. Results mirrored those
of the cloth problem and indicated that in-
dividuals selected tools most often based on
stick and hook problem relevant, as opposed to irrelevant, functional
A more challenging task than the sup- features. Hauser, Pearson, and Seelig (2002)
port problem consists of using a tool to recently investigated the role of experience
bring in a reward that is not in direct con- in the ability to distinguish relevant from
tact with the tool. This situation entails irrelevant features. They found that infant
putting the tool into contact with the re- tamarins, without much experience with
ward and then sweeping the reward within tools, also selected tools based on relevant
reach. According to Natale (1 989), solving features, reproducing the results of the adult
this task demonstrates an ability to under- subjects.
stand complex causal relations such as that
the stick must be of the appropriate size
and material (e.g., long and rigid) and that tube and trap problem
only certain kinds of contact (e.g., with a In this problem, the reward is placed inside
certain force and directionality) would be the middle portion of a transparent tube,
successful. and subjects have to use a stick to push the
Natale (1 989) presented eight subjects reward out the end opposite to which the
from the same four species tested by reward was inserted. In a series of studies,
Spinozzi and Potı̀ (1 989) with an out-of- Visalberghi and colleagues explored the abil-
reach reward and a stick placed in differ- ity of capuchin monkeys and apes (mainly
ent positions relative to the object in dif- chimpanzees) to solve this problem and to
ferent experimental conditions. Three of adjust to novel variations of this problem.
the four capuchin monkeys and the go- Visalberghi and Trinca (1 989) found that
rilla were moderately successful in obtaining three of four capuchin monkeys succeeded
the reward in various tool-reward spatial ar- in the basic version of the problem, and then
rangements. These results have been con- the authors administered three variations
firmed by other studies (see Beck, 1 980; of the problem involving different types of
Tomasello & Call, 1 997 for a review). Al- tools that required different solutions. In the
though none of the macaques tested by bundle task, subjects were given a bundle
Natale (1 989) was able to obtain the re- of sticks taped together that, as a whole,
ward with the stick, other studies have was too wide to fit in the tube; the solu-
shown that macaques and other primates, tion consisted of breaking the sticks apart.
including baboons, orangutans, and chim- In the short-sticks task, subjects were given
panzees, are capable of solving the stick three short sticks that, together, added up
problem (see Tomasello & Call, 1 997 for a to the length required; the solution consist-
review). ing of putting them all in the same end of
A refinement of the stick problem con- the tube to displace the food out the other
sists of presenting a hook-shaped tool and side. Finally, in the H-tool task, subjects were
a straight tool as alternatives for retrieving given a stick with transverse pieces on ei-
the reward. Hauser (1 997) presented cotton- ther end that prevented its insertion into the
top tamarins with two hooked tools, only tube; the solution consisted of removing the
one of which had the reward inside the blocking piece from the tool. Although all
hook so that pulling it would bring the re- three subjects eventually solved these varia-
ward. Once tamarins consistently solved this tions of the task, they made a number of er-
problem – that is, they preferred the stick rors such as attempting to insert the whole
with the reward inside the hook – the au- bundle or inserting one short stick in one
thors presented novel problems in which end of the tube and another short stick in
the other end. Moreover, these errors did formance of the majority of subjects actu-
not decrease significantly over trials, suggest- ally deteriorated over trials, indicating they
ing that capuchins understood little about may not have come to understand the causal
the causal relations between the elements in relations involved, although their young age
the task. Visalberghi, Fragaszy, and Savage- may have explained their poor performance.
Rumbaugh (1 995 ) essentially replicated the Visalberghi Fragaszy, & Savage-Rumbaugh
results of the bundle and the H-tool tasks (1 995 ) presented the bundle and H-tool
with six other capuchins. tasks to subadult and adult apes (four bono-
One recent study, however, suggests that bos, five chimpanzees, and one orangutan).
some capuchins may understand more about Eight of the ten apes solved the basic tube
causal relations than previously thought. task on the first trial, and the other two
Anderson and Henneman (1 995 ) tested the were successful later. When given a bun-
ability of two adult capuchin monkeys to an- dle of sticks, all subjects immediately disas-
ticipate (and solve) a variety of problems assembled the bundle and, unlike capuchins,
sociated with using a stick to extract honey no ape attempted to insert the bundle as
from a box with multiple holes. In a se- a whole. Apes proved less successful in the
ries of experiments of increasing complex- H-tool task, however, making some of the
ity, subjects were required to select a stick same mistakes as the capuchins. Indeed, a
of the appropriate diameter to fit the holes, statistical comparison of the two species in
rake in a stick of the appropriate diameter this condition revealed no significant differ-
with the help of another tool, modify a stick ence. Although there was an overall group
that was too thick or too twisted to fit the tendency to decrease the number of errors
holes, or construct a rake that would per- across trials, some subjects increased their
mit them to obtain a suitable stick to ex- errors.
tract the honey. Results indicated that both To examine further the understanding of
capuchins (especially the male) readily se- causal relations in the tube task, Visalberghi
lected sticks of a diameter suitable to fit the and Limongelli (1 994) presented a new tube
holes. This even included cases in which the problem that punished subjects who did not
box and the sticks available were not within foresee the consequences of their behavior.
the same visual field. This result contrasts The authors presented four capuchin mon-
with Visalberghi’s (1 993 ) findings in which keys with a tube that had a trap in its bottom
capuchins failed to select appropriate tools center, and placed the food next to the trap.
to solve the tube task when the tools were If subjects pushed the food in the direction
left in a room adjacent to the tube with food of the trap, it would fall in it and they would
in it. Moreover, Anderson and Henneman lose it; to get the food out, they had to push
(1 995 ) noted that one capuchin modified the food away from the trap toward the other
tools in a very purposeful manner without end of the tube. Visalberghi and Limongelli
committing the sort of errors described by (1 994) found that only one subject solved
Visalberghi and Trinca (1 989). The same ca- the task, systematically pushing the reward
puchin also used a tool (itself not suitable for away from the trap. Although this subject
honey-dipping) to rake in appropriate sticks seemed to be planning her moves in advance,
for honey-dipping. Neither of the subjects, the authors noted that in half the trials, she
however, was able to construct a rake to ob- inserted the tool in the wrong side of the
tain honey-dipping sticks. tube and, upon seeing that the reward was
The tube task has also been administered moving into the trap, withdrew the tool,
to apes. First, Bard, Fragaszy, and Visalberghi reinserted it in the other end, and pushed
(1 995 ) administered this task to young chim- out the reward. Visalberghi and Limongelli
panzees (two to four years old) and found (1 994) probed further her understanding of
that in the two most difficult versions of the the relation between the trap and the reward
task (i.e., short-sticks and H-tool), the per- by inverting the trap 1 80 degrees so that the
trap was on top of the tube, where it was more clearly seen in the tube task and its
no longer effective. The subject, however, variations.
persisted in her strategy of pushing the food
away from the trap, suggesting that she had
Perceiving and Judging Physical Events
apparently simply learned to push the food
away from the trap side without understand- One area that has received considerable at-
ing the causal relations between the trap and tention is that of object knowledge in in-
the reward. fants. These studies present subjects with a
Limongelli, Boysen, and Visalberghi series of events – some that follow the laws
(1 995 ) presented the trap-tube task to five of physics such as solidity or gravity and
chimpanzees who behaved at chance levels others that violate those laws. Using look-
for the first seventy trials, although two of ing measures, numerous studies have found
them learned to avoid the trap during sev- that human infants respond selectively to
enty additional trials. The authors admin- the violation of physics laws (Baillargeon,
istered an additional test to assess whether 1 995 ; Spelke et al., 1 995 ). These authors
chimpanzees understood the relationship have argued that even at this young age, chil-
between the position of the reward with re- dren show object knowledge. Hauser and
spect to the trap or whether they were sim- colleagues have been instrumental in intro-
ply using the simple rule of pushing the reducing this area of research in nonhuman
ward out the side to which it was closest, primates. They have concentrated on two
thus avoiding the trap. Limongelli, Boysen, topics: gravity and solidity.
and Visalberghi (1 995 ) varied the location In the gravity area, Hood et al. (1 999) pre-
of the trap in the tube. In some cases, the sented cotton-top tamarins with three con-
trap was located very close to one end with tainers arranged in a straight line. One of the
the food just beyond it, so that subjects ac- containers was connected to an opaque tube
tually had to push the food out the end from through which the experimenter dropped
which it was farthest. In other cases, the food. Subjects consistently searched for the
opposite arrangement was used. Both sub- food in the container over which the food
jects solved these variations easily, with al- was dropped. They did this regardless of
most no errors, so the researchers concluded whether the tube was connected to the
that these two chimpanzees understood the container or not. This indicates that mon-
causal relations in this task better than the keys failed to understand that the reward’s
capuchin monkeys. It should be noted, how- straight-fall trajectory can be deviated by the
ever, that the variations used in this experi- tube. This bias persisted despite variations
ment could still be solved by the rule “push on the incentives offered to the subjects for
the food away from the trap,” which could successful performance. Children presented
have been learned during the previous tri- with the same task also show a gravity bias,
als. Unfortunately, the authors did not in- although older children can eventually over-
vert the trap as was previously done with come it (Hood, Care, & Prasada, 2000). In a
capuchins. follow-up experiment, Hauser et al. (2001 )
In summary, this section has shown reported that when the reward trajectory
that various primates have some knowledge was horizontal rather than vertical (as in the
about causal relations regarding what makes original test), tamarins performed better and
a tool effective. They know that objects have the biases observed previously disappeared.
to be in contact for a tool to be effective, rec- Also subjects with experience with the hor-
ognize the relevant and irrelevant functional izontal version of the task performed bet-
features of a tool, and can choose the ap- ter in the original task (i.e., free-fall reward)
propriate dimensions of an effective tool in than subjects without such experience, even
a particular task. Nevertheless, these studies though the gravity bias was still apparent.
have also shown clear limitations, perhaps Taken together, these results suggest that
tamarins have a pervasive gravity bias that ure was especially striking because subjects
impairs their search for hidden objects. were tested after they had solved the initial
Hauser (2001 ) investigated the same problem. Second, their performance in this
topic with a different paradigm. He pre- tapping test was comparable to performance
sented rhesus macaques with a table and two in learning novel stimuli with arbitrary rela-
boxes. The first box was placed on top of the tions – for instance, learning that a green cup
table and the second box was placed under has food and a yellow cup does not. Sub-
the table right under the first box. The exper- jects responded correctly to the auditory cue
imenter then raised a screen that occluded when it held a causal connection to the food
both boxes (and the table) and dropped a re- but failed to do so when the auditory cue
ward over the top box. Because of the screen, held a noncausal connection to the food.
the monkeys never saw where the reward In a second study, Call (unpublished data)
entered the box, they just saw it falling to- investigated the ability of apes to use the
ward the table and disappearing behind the shape of objects to locate food. In the initial
screen. Monkeys searched for the food in problem, he presented two rectangular trays
the bottom box, thus showing a gravity bias. on a platform and hid a piece of food un-
Control tests indicated that subjects did not der one of them. One of the trays therefore
have a preference for the bottom box, nor rested flat on the platform whereas the other
did they avoid the top box in the absence rested in an inclined orientation (because of
of the reward drop. Interestingly, Santos and the food placed under it). Subjects selected
Hauser (2002) found that rhesus monkeys preferentially the inclined tray but failed to
tested with the same paradigm but using a do so in a control test in which the inclined
violation of expectation measures solved this tray was substituted by a wooden wedge
problem. In other words, subjects looked that produced the same visual effect as the
longer in trials in which the reward appeared inclined tray. This result was important be-
on the bottom (apparently going through a cause it ruled out the possibility that subjects
solid partition) than in trials in which the simply preferred the perceptual appearance
reward stayed on top of the partition. of the inclined tray, perhaps because it had
Call (2004) recently investigated two been reinforced in the past. More impor-
other aspects of the object knowledge that tantly, subjects failed to select the wedge,
subjects may use to find food. The first is an arbitrary stimulus, after they were suc-
whether apes know that food inside a con- cessful in the inclined tray test. This result
tainer when shaken makes a noise. He found is analogous to that of the previous study
that apes are capable of using the noise in which subjects failed to respond above
made by shaking food to identify the correct chance to stimuli with noncausal connec-
container (see inferential reasoning section). tions to the food after they had succeeded
Although one may argue that this simply with very similar stimuli with causal connec-
involves detecting an association between tions. It therefore is found again that when
the food and the cue rather than an un- there are arbitrary (i.e., noncausal) relations
derstanding that the food causes the noise, between the food and the elements of the
there are several lines of evidence that sug- problem, subjects perform poorly compared
gest that this interpretation oversimplifies with when the connection is nonarbitrary
the phenomenon. First, subjects performed (i.e., causal). It is unlikely that these results
well from the beginning, with no evidence are based solely on learning to associate a
of gradual improvement over trials. If sub- cue with a response without any insight into
jects had learned to associate a noise with the structure of the problem. Instead, it is
food in the past, it is unclear why, in control conceivable that subjects understood that it
tests, they failed to associate a noise made by was the food that caused the noise or made
tapping the baited cup, which was compara- the tray incline, not simply that the food was
ble to that made by shaking the food inside associated with the presence of the noise or
the cup, with the presence of food. This fail- the shape.
In summary, primates have some knowl- The basic finding was that subordinates
edge about the physical properties of did go for food only they could see much
objects, and they can use this knowledge to more often than they went for food that both
predict the location in which rewards can be they and the dominant could see. In some
found. cases, the subordinate may have been mon-
itoring the behavior of the dominant, but in
other cases this possibility was ruled out by
giving subordinates a small headstart, forcing
Social Reasoning them to make their choice (to go to the food
that both competitors could see, or to go to
Primates’ social cognition represents a large the food that only they could see) before the
area of research in its own right. At the dominant was released into the area. More-
most basic level, it involves how individu- over, we ran two other control conditions. In
als understand and predict the behavior and, one, the dominant’s door was lowered before
perhaps, the perceptual activities of others. the two competitors were let into the room
Of course, it also may involve how indi- (and again the subordinate got a small head-
viduals understand the psychological states start), so that the subordinate could not see
and activities of others, which are less di- which piece the dominant was looking at un-
rectly observable. So the question is whether der the door (i.e., it is possible that in the first
primates can reason about the psycholog- studies the subordinate saw that the domi-
ical states and activities of others. Despite nant was looking at the out-in-the-open food
many richly interpreted anecdotes, until re- and so went for the other piece). The re-
cently there was very little evidence that sults were clear. Subordinates preferentially
primates reasoned about what others were targeted the hidden piece. In the other con-
seeing, intending, wanting, and thinking (see trol study, we followed the same basic pro-
Tomasello & Call, 1 997, for a review). Some cedure as before (one piece of food in the
recent studies have demonstrated that pri- open, one on the subordinate’s side of a bar-
mates can reason about some – although rier) but we used a transparent barrier that
clearly not all – of the psychological states did not prevent the dominant from seeing
about which humans reason. the food behind it. In this case, chimpanzees
Hare et al. (2000) placed a subordinate chose equally between the two pieces of
and a dominant chimpanzee into rooms on food, seeming to know that the transparent
opposite sides of a third room. Each had a barrier was not serving to block the dom-
guillotine door leading into the third room inant’s visual access (and so her “control”
which, when cracked at the bottom, allowed of the food). The findings of these studies
them to observe two pieces of food at vari- suggest that chimpanzees know what con-
ous locations within that room – and to see specifics can and cannot see and, further, that
the other individual looking under her door. they use this knowledge to make inferences
After the food had been placed, the doors about what their competitor is about to do
for both individuals were opened and they next.
were allowed to enter the third room. The In a follow-up study, Hare, Call, and
basic problem for the subordinate in this sit- Tomasello (2001 ) investigated whether
uation is that the dominant will take all of chimpanzees were also able to take into ac-
the food it can see. In some cases, however, count past information such as whether the
things were arranged so that the subordinate dominant had seen the baiting. For these
could see a piece of food that the dominant experiments, two barriers and one piece of
could not see – because it was on her side of food were used, and what the dominant
a small barrier. The question in these cases saw was manipulated. In experimental tri-
was whether the subordinate knew that the als, dominants had not seen the food hid-
dominant could not see a particular piece of den, or food they had seen hidden was
food, so it was safe for her to go for it. moved to a different location when they
were not watching, whereas in control trials, quite often have some components, or some
they saw the food being hidden or moved. aspects, of the human version, and in some
Subordinates, on the other hand, always cases they possess skills that humans do not
saw the entire baiting procedure and could have or do not have to the same degree (e.g.,
monitor the visual access of the dominant some of the memory skills demonstrated by
competitor. Subordinates preferentially re- food-caching birds; Shettleworth, 1 998). We
trieved and approached the food that domi- need to compare the skills in detail if we
nants had not seen hidden or moved, which want to provide an anatomy of their struc-
suggests that subordinates were sensitive to ture from an evolutionary point of view.
what dominants had or had not seen during Because this book is mainly about human
baiting a few moments before. In this case, reasoning and thinking, we should conclude
deciding which piece of food to approach with a word about what we believe makes
depended on the subordinate’s making infer- human cognition different from that of other
ences about what the dominant knew about primates. The answer, in a word, is culture
the situation. (Tomasello & Call, 1 997; Tomasello, 1 999).
These studies of what may be called so- The thought experiment we use to demon-
cial problem solving demonstrate that some strate the point is to imagine a human child
primates may make inferential leaps not raised on a desert island without any social
just about directly perceivable things, but contacts. Our contention is that in adult-
also about less observable things such as hood this adult’s cognitive skills would not
what others do and do not see, or even differ very much – perhaps a little, but not
have or have not seen in the immediate very much – from those of other great apes.
past. This person would certainly not invent by
him or herself a natural language, or algebra
or calculus, or science or government. The
human cognitive skills that make the most
Conclusions and Future Directions difference are those that enable individu-
als of the species Homo sapiens, in a sense,
There was a time when the dominant view to pool their cognitive resources – to create
in the Western intellectual tradition was that and participate in collective cultural activ-
human beings were rational and all other an- ities and products. When viewed from the
imals were simply preprogrammed brutes or perspective of the individual mind, the cog-
automata. That view is demonstrably false. nitive skills necessary for cultural creation
All the evidence reviewed in this chapter and learning may not differ so very much
suggests that nonhuman primates interact from those of other primate species.
with their worlds in many creative ways, re- In any case, much can be learned about
lying on a variety of cognitive processes to do human cognition by looking at how it is
so. They reason and make inferences about similar to and how it is different from that
space, causality, objects, quantities, and the of closely related species. We hope to have
psychological states of other individuals, and shown in this chapter that, in many fun-
in some cases they can engage in relational damental respects, human cognition is sim-
and analogical reasoning concerning partic- ply one form of primate cognition. The vast
ular objects or even categories of objects. gulf that seems to separate what humans and
The main pitfall to avoid in attempting to other primates can do cognitively – in the do-
integrate our knowledge about the cognitive main of mathematics, as just one instance –
skills of other animals with our knowledge in many, if not most, cases is the result of
about human cognition is oversimplification. fairly small differences of individual psy-
Asking dichotomously whether or not ani- chology that enable humans to accumulate
mals reason or think or have a theory of mind knowledge across generations and to use col-
generally is not very useful (Tomasello, Call, lective artifacts such as linguistic and math-
& Hare, 2003 a, 2003 b). Nonhuman animals ematical symbols.
Acknowledgment Bovet, D., & Vauclair, J. (2001 ). Judgment of con-

ceptual identity in monkeys. Psychonomic Bul-
The authors wish to thank Keith Holyoak letin and Review, 8, 470–475 .
for comments made on an earlier draft of Boyd, R., & Richerson, P. (1 985 ). Culture and the
this chapter. Evolutionary Process. Chicago: The University
of Chicago Press.
Boysen, S. T., & Berntson, G. G. (1 989). Nu-
merical competence in a chimpanzee (Pan
References troglodytes). Journal of Comparative Psychology,
1 03 , 23 –3 1 .
Anderson, J. R. & Henneman, M. C. (1 995 ). So- Boysen, S. T., Berntson, G. G., Shreyer, T. A., &
lutions to a tool-use problem in a pair of Cebus Hannan, M. B. (1 995 ). Indicating acts during
apella. Mammalia, 5 8, 3 5 1 –3 61 . counting by a chimpanzee (Pan troglodytes).
Baillargeon, R. (1 995 ). Physical reasoning in in- Journal of Comparative Psychology, 1 09,
fancy. In M. Gazzaniga (Ed.), The Cognitive 47–5 1 .
Neurosciences (pp. 1 81 –204) Cambridge, MA: Boysen, S. T., Berntson, G. G., Shreyer, T. A.,
MIT Press. & Quigley, K. S. (1 993 ). Processing of ordi-
Bard, K. A., Fragaszy, D. M., & Visalberghi, nality and transitivity by chimpanzees (Pan
E. (1 995 ). Acquisition and comprehension of troglodytes). Journal of Comparative Psychology,
a tool-using behavior by young chimpanzees 1 07, 208–21 5 .
(Pan troglodytes): Effects of age and modelling. Brannon, E. M., & Terrace, H. S. (1 998). Ordering
International Journal of Comparative Psychol- of the numerosities 1 to 9 by monkeys. Science,
ogy, 8, 47–68. 2 82 , 746–749.
Beck, B. B. (1 980). Animal Tool Behavior. New Brannon, E. M., & Terrace, H. S. (2000). Rep-
York: Garland Press. resentation of the numerosities 1 –9 by Rhesus
Beran, M. J. (2001 ). Summation and numerous- macaques (Macaca mulatta). Journal of Exper-
ness judgments of sequentially presented sets of imental Psychology: Animal Behavior Processes,
items by chimpanzees (Pan troglodytes). Journal 2 6, 3 1 –49.
of Comparative Psychology, 1 1 5 , 1 81 –1 91 . Burdyn, L. E., & Thomas, R. K. (1 984). Condi-
Beran, M. J., & Minahan, M. F. (2000). Moni- tional discrimination with conceptual simulta-
toring spatial transpositions by bonobos (Pan neous and successive cues in the squirrel mon-
paniscus) and chimpanzees (P. troglodytes). In- key (Saimiri sciureus). Journal of Comparative
ternational Journal of Comparative Psychology, Psychology, 98, 405 –41 3 .
1 3 , 1 –1 5 . Call, J. (2000). Estimating and operating on
Beran, M. J., & Rumbaugh, D. M. (2001 ). “Con- discrete quantities in orangutans (Pongo pyg-
structive” enumeration by chimpanzees (Pan maeus). Journal of Comparative Psychology,
troglodytes) on a computerized task. Animal 1 1 4, 1 3 6–1 47.
Cognition, 4, 81 –89. Call, J. (2001 ). Object permanence in orangu-
Beran, M. J., & Washburn, D. A. (2002) Chim- tans (Pongo pygmaeus), chimpanzees (Pan
panzee responding during matching to sample: troglodytes), and children (Homo sapiens). Jour-
Control by exclusion. Journal of the Experimen- nal of Comparative Psychology, 1 1 5 , 1 5 9–
tal Analysis of Behavior, 78, 497–5 08. 1 71 .
Bernstein, I. S. (1 961 ). The utilization of visual Call, J. (2003 ). Spatial rotations and transposi-
cues in dimension-abstracted oddity by pri- tions in orangutans (Pongo pygmaeus) and chim-
mates. Journal of Comparative and Physiological panzees (Pan troglodytes). Primates, 44, 3 47–
Psychology, 5 4, 243 –247. 3 5 7.
Boesch, C., & Boesch, H. (1 984). Mental map Call, J. (2004). Inferences about the location of
in wild chimpanzees: An analysis of hammer food in the great apes. Journal of Comparative
transports for nut cracking. Primates, 2 5 , 1 60– Psychology, 1 1 8, 23 2–241 .
1 70. Call, J., & Rochat, P. (1 996). Liquid conservation
Bond, A. B., Kamil, A. C., & Balda, R. P. (2003 ). in orangutans. Individual differences and per-
Social complexity and transitive inference in ceptual strategies Journal of Comparative Psy-
Corvids. Animal Behaviour, 65 , 479–487. chology, 1 00, 21 9–23 2.
Call, J., & Rochat, P. (1 997). Perceptual strate- Fagot, J., Wasserman, E. A. & Young, M. E.
gies in the estimation of physical quantities by (2001 ). Discriminating the relation between
orangutans (Pongo pygmaeus). Journal of Com- relations: The role of entropy in abstract con-
parative Psychology, 1 1 1 , 3 1 5 –3 29. ceptualization by baboons (Papio papio) and
Cramer, A. E., & Gallistel, C. R. (1 997). Vervet humans (Homo sapiens). Journal of Experimen-
monkeys as travelling salesmen. Nature, 3 87, tal Psychology: Animal Behavior Processes, 2 7,
464. 3 1 6–3 28.
D’Amato, M. R., & Colombo, M. (1 985 ). Au- Fujita, K. (1 982). An analysis of stimulus con-
ditory matching-to-sample in monkeys (Ce- trol in two-color matching-to-sample behav-
bus apella). Animal Learning and Behavior, 1 3 , iors of Japanese monkeys (Macaca fuscata fus-
3 75 –3 82. cata). Japanese Psychological Research, 2 4, 1 24–
135.
D’Amato, M. R., & Colombo, M. (1 988). Rep-
resentation of serial order in monkeys (Cebus Fujita, K. (1 983 ). Formation of the sameness-
apella). Journal of Experimental Psychology: An- difference concept by Japanese monkeys from
imal Behavior Processes, 1 4, 1 3 1 –1 3 9. a small number of color stimuli. Journal of
the Experimental Analysis of Behavior, 40, 289–
D’Amato, M. R., & Colombo, M. (1 989). Serial
3 00.
learning with wild card items by monkeys (Ce-
bus apella): Implications for knowledge of ordi- Garber, P. (1 989). Role of spatial memory in pri-
nal position. Journal of Comparative Psychology, mate foraging patterns: Saguinus mystax and
1 03 , 25 2–261 . Saguinus fuscicollis. American Journal of Prima-
tology, 1 9, 203 –21 6.
D’Amato, M. R., & Salmon, D. P. (1 984). Cog-
nitive processes in cebus monkeys. In H. L. Gillan, D. J. (1 981 ). Reasoning in the chimpanzee:
Roitblat, T. G. Bever, & H. S. Terrace (Eds.), II. Transitive inference. Journal of Experimental
Animal Cognition (pp. 1 49–1 68). Hillsdale, NJ: Psychology: Animal Behavior Processes, 7, 1 5 0–
Erlbaum. 1 64.
D’Amato, M. R., Salmon, D. P., Loukas, E., & Gillan, D. J., Premack, D., & Woodruff, G. (1 981 ).
Tomie, A. (1 985 ). Symmetry and transitiv- Reasoning in the chimpanzee: I. Analogical rea-
ity of conditional relations in monkeys (Cebus soning. Journal of Experimental Psychology: An-
apella) and pigeons (Columba livia). Journal of imal Behavior Processes, 7, 1 –1 7.
the Experimental Analysis of Behavior, 44, 3 5 – Hare, B., Call, J., Agnetta, B., & Tomasello, M.,
47. (2000). Chimpanzees know what conspecifics
D’Amato, M. R., Salmon, D. P., Loukas, E., & do and do not see. Animal Behaviour, 5 9, 771 –
Tomie, A. (1 986). Processing of identity and 785 .
conditional relations in monkeys (Cebus apella) Hare, B., Call, J., & Tomasello, M., (2001 ). Do
and pigeons (Columba livia). Animal Learning chimpanzees know what conspecifics know
and Behavior, 1 4, 3 65 –3 73 . and do not know? Animal Behaviour, 61 , 1 3 9–
De Blois, S. T., & Novak, M. A. (1 994). Object 151.
permanence in rhesus monkeys (Macaca mu- Harlow, H. F. (1 95 9). The development of learn-
latta). Journal of Comparative Psychology, 1 08, ing in the rhesus monkey. American Scientist,
3 1 8–3 27. 47, 45 9–479.
De Blois, S. T., Novak, M. A., & Bond, M. (1 998). Hashiya, K., & Kojima, S. (2001 ). Hearing and
Object permanence in orangutans (Pongo pyg- auditory-visual intermodal recognition in the
maeus) and squirrel monkeys (Saimiri sci- chimpanzee. In T. Matsuzawa (Ed.). Primate
ureus). Journal of Comparative Psychology, 1 1 2 , Origins of Human Cognition and Behavior
1 3 7–1 5 2. (pp. 1 5 5 –1 89). Berlin: Springer-Verlag.
DeLoache, J. S. (1 995 ). Early understanding and Hauser, M. D., Kralik, J., & Botto-Mahan, C.
use of symbols: The model model. Currents Di- (1 999). Problem solving and functional design
rections in Psychological Science, 4, 1 09–1 1 3 . features: Experiments on cotton-top tamarins,
Dumas, C. & Brunet, C. (1 994). Pernamence de Saguinus oedipus. Animal Behaviour, 5 7, 5 65 –
l’objet chez le singe capucin (Cebus apella): 5 82.
Etude des desplacements invisibles. Revue Hauser, M., Pearson, H., & Seelig, D. (2002).
Canadienne de psychologie experimentale, 48, Ontogeny of tool use in cotton-top tamarins,
3 41 –3 5 7. Saguinus oedipus: Innate recognition of
functionally relevant features. Animal Be- King, J. E. (1 973 ). Learning and generalization
haviour, 64, 299–3 1 1 . of a two-dimensional sameness-difference con-
Hauser, M. D. (1 997). Artifactual kinds and func- cept by chimpanzees and orangutans. Journal of
tional design features: What a primate under- Comparative and Physiological Psychology, 84,
stands without language. Cognition, 64, 285 – 1 40–1 48.
3 08. Köhler, W. (1 925 ). The Mentality of Apes. London:
Hauser, M. D. (2001 ). Searching for food in Routledge & Kegan Paul.
the wild: A nonhuman primate’s expectations Kojima, T. (1 979). Discriminative stimulus con-
about invisible displacement. Developmental text in matching-to-sample of Japanese mon-
Science, 4, 84–93 . keys. Japanese Psychological Research, 2 1 , 1 89–
Hauser, M. D., Santos, L. R., Spaepen, G. M., & 1 94.
Pearson, H. E. (2002). Problem solving, inhi- Kuhlmeier, V. A., Boysen, S. T., & Mukobi, K. L.
bition and domain-specific experience: Exper- (1 999). Scale-model comprehension by chim-
iments on cotton-top tamarins, Saguinus oedi- panzees (Pan troglodytes). Journal of Compara-
pus. Animal Behaviour, 64, 3 87–3 96. tive Psychology, 1 1 3 , 3 96–402.
Hauser, M. D., Williams, T., Kralik, J. D., & Kuhlmeier, V. A., & Boysen, S. T. (2001 ).
Moskovitz, D. (2001 ). What guides a search The effect of response contingencies on scale
for food that has disappeared? Experiments model task performance by chimpanzees (Pan
on cotton-top tamarins (Saguinus oedipus). troglodytes). Journal of Comparative Psychology,
Journal of Comparative Psychology, 1 1 5 , 1 40– 1 1 5 , 3 00–3 06.
151. Limongelli, L., Boysen, S. T., & Visalberghi, E.
Hood, B., Carey, S. & Prasada, S. (2000). Pre- (1 995 ). Comprehension of cause-effect rela-
dicting the outcomes of physical events: Two- tions in a tool-using task by chimpanzees (Pan
year-olds fail to reveal knowledge of solidity troglodytes). Journal of Comparative Psychology,
and support. Child Development, 71 , 1 5 40– 1 09, 1 8–26.
1 5 5 4. Lipsett, L. P., & Serunian, S. A. (1 963 ). Oddity-
Hood, B. M., Hauser, M. D., Anderson, L., & problem learning in young children. Child De-
Santos, L. (1 999). Gravity biases in a non- velopment, 3 4, 201 –206.
human primate? Developmental Science, 2 , 3 5 – MacDonald, S. E., Pang, J. C., & Gibeault, S.
41 . (1 994). Marmoset (Callithrix jacchus jacchus)
Iversen, I. H., & Matsuzawa, T. (2001 ). Ac- spatial memory in a foraging task: Win-stay ver-
quisition of navigation by chimpanzees (Pan sus win-shift strategies. Journal of Comparative
troglodytes) in an automated fingermaze task. Psychology, 1 08, 3 28–3 3 4.
Animal Cognition, 4, 1 79–1 92. MacDonald, S. E., & Wilkie, D. M. (1 990).
Jackson, W. J., & Pegram, G. V. (1 970a). Ac- Yellow-nosed monkeys’ (Cercopithecus asca-
quisition, transfer and retention of matching nius whitesidei) spatial memory in a simulated
by Rhesus monkeys. Psychological Reports, 2 7, foraging environment. Journal of Comparative
83 9–846. Psychology, 1 04, 3 82–3 87.
Jackson, W. J., & Pegram, G. V. (1 970b). Com- McGonigle, B. O., & Chalmers, M. (1 977). Are
parison of intra- vs. extradimensional transfer monkeys logical?. Nature, 2 67, 694–696.
of matching by Rhesus monkeys. Psychonomic Menzel, E. W. Jr. (1 973 ). Chimpanzee spa-
Science, 1 9, 1 62–1 63 . tial memory organization. Science, 1 82 , 943 –
Kawai, N., & Matsuzawa, T. (2000). Numerical 945 .
memory span in a chimpanzee. Nature, 403 , Muncer, S. J. (1 983 ). “Conservations” with a
3 9–40. chimpanzee. Developmental Psychobiology, 1 6,
King, J. E., & Fobes, J. L. (1 975 ). Hypothesis anal- 1 –1 1 .
ysis of sameness-difference learning-set by ca- Natale, F. (1 989). Causality II: The stick problem.
puchin monkeys. Learning and Motivation, 6, In F. Antinucci (Ed.), Cognitive Structure and
1 01 –1 1 3 . Development in Nonhuman Primates (pp. 1 21 –
King, J. E., & Fobes, J. L. (1 982). Complex learn- 1 3 3 ). Hilldsale, NJ: Erlbaum.
ing by primates. In J. L. Fobes, & J. E. King Natale, F., Antinucci, F., Spinozzi, G., & Potı̀,
(Eds.), Primate Behavior. (pp. 3 27–3 60). New P. (1 986). Stage 6 object concept in nonhu-
York: Academic Press. man primate cognition: A comparison between
gorilla (Gorilla gorilla gorilla) and Japanese difference discrimination problem. Journal of
macaque (Macaca fuscata). Journal of Compar- Comparative and Physiological Psychology, 5 3 ,
ative Psychology, 1 00, 3 3 5 –3 3 9. 3 68–3 70.
Nissen, H. W., Blum, J. S., & Blum, R. A. (1 948). Rumbaugh, D. M., & McCormack. (1 967). The
Analysis of matching behavior in chimpanzee. learning skills of primates: A comparative study
Journal of Comparative and Physiological Psy- of apes and monkeys. In D. Starck, R. Schhei-
chology, 41 , 62–74. der, & H. J. Ruhn (Eds.), Progress in Prima-
Oden, D. L., Thompson, R. K. R., & Premack, tology (pp. 289–3 06). Stuggart: Gustav Fischer
D. (1 988). Spontaneous transfer of matching Verlag.
by infant chimpanzees (Pan troglodytes). Jour- Rumbaugh, D. M. (1 970). Learning skills of an-
nal of Experimental Psychology: Animal Behavior thropoids. In L. A. Rosenblum (Ed.), Primate
Processes, 1 4, 1 40–1 45 . Behavior: Developments in Field and Labora-
Oden, D. L., Thompson, R. K. R., & Premack, D. tory Research (pp. 1 –70). New York: Academic
(1 990). Infant chimpanzees spontaneously per- Press.
ceive both concrete and abstract same/different Rumbaugh, D. M., Hopkins, W. D., Washburn,
relations. Child Development, 61 , 621 – D. A., & Savage-Rumbaugh, E. S. (1 989). Lana
63 1 . chimpanzee learns to count by “Numath”: A
Olthof, A., Iden, C. M., & Roberts, W. A. summary of videotaped experimental report.
(1 997). Judgments of ordinality and summa- Psychological Record, 3 9, 45 9–470.
tion of number symbols by squirrel monkeys Rumbaugh, D. M., Savage-Rumbaugh, E. S., &
(Saimiri sciureus). Journal of Experimental Psy- Hegel, M. T. (1 987). Summation in the chim-
chology Animal Behavior Processes, 2 3 , 3 25 – panzee (Pan troglodytes). Journal of Experimen-
3 3 9. tal Psychology: Animal Behavior Processes, 1 3 ,
Pepperberg, I. M. (1 999). The Alex studies. 1 07–1 1 5 .
Cognitive and communicative abilities of grey Rumbaugh, D. M., Savage-Rumbaugh, E. S., &
parrots. Cambridge, MA: Harvard University Pate, J. L. (1 988). Addendum to “Summation
Press. in the chimpanzee (Pan troglodytes)”. Journal of
Perusse, R., & Rumbaugh, D. M. (1 990). Summa- Experimental Psychology: Animal Behavior Pro-
tion in chimpanzees (Pan troglodytes): Effects cesses, 1 4, 1 1 8–1 20.
of amounts, number of wells, and finer ratios. Santos, L. R., & Hauser, M. D. (2002). A non-
International Journal of Primatology, 1 1 , 425 – human primate’s understanding of solidity:
43 7. Dissociations between seeing and acting. De-
Piaget, J. (1 95 2). The Origins of Intelligence in Chil- velopmental Science, 5 , 1 –7.
dren. New York: Norton. Schino, G., Spinozzi, G., & Berlinguer, L. (1 990).
Piaget, J., & Inhelder, B. (1 941 ). Le développement Object concept and mental representation in
des quantités physiques chez l’enfant. Neuchâtel, Cebus apella and Macaca fascicularis. Primates,
Switzerland: Delachaux; and Paris: Niestlé. 3 1 , 5 3 7–5 44.
Premack, D., & Premack, A. J. (1 994). Levels of Shettleworth, S. J. (1 998). Cognition, Evolution,
causal understanding in chimpanzees and chil- and Behavior. New York: Oxford University
dren. Cognition, 5 0, 3 47–3 62. Press.
Premack, D. (1 976). Intelligence in Ape and Man. Sigg, H. (1 986). Ranging patterns in hamadryas
Hillsdale, NJ: Earlbaum. baboons: Evidence for a mental map. In J. G.
Premack, D. (1 983 ). The codes of man and Else, & P. C. Lee (Eds.), Primate Ontogeny, Cog-
beasts. Behavioral and Brain Sciences, 6, 1 25 – nition and Social Behaviour (pp. 87–91 ). Cam-
1 67. bridge, UK: Cambridge University Press.
Roberts, W. A. (1 998). Principles of Animal Cog- Smith, H. J., King, J. E., Witt, E. D., & Rickel, J. E.
nition. Boston: McGraw-Hill. (1 975 ). Sameness–difference matching from
Robinson, J. S. (1 95 5 ). The sameness-difference sample by chimpanzees. Bulletin of the Psycho-
discrimination problem in chimpanzee. Jour- nomic Society, 6, 469–471 .
nal of Comparative and Physiological Psychology, Spelke, E. S., Phillips, A., & Woodward, A. L.
48, 1 95 –1 97. (1 995 ). Infants’ knowledge of object motion
Robinson, J. S. (1 960). The conceptual basis of the and human action. In D. Sperber, D. Premack
chimpanzee’s performance on the sameness- & A. J. Premack (Eds.). Causal Cognition.
reasoning and thinking in nonhuman primates 631
A Multidisciplinary Debate (pp. 44–78). New question is which ones and to what extent.
York: Oxford University Press. Trends in Cognitive Sciences, 7, 1 5 3 –1 5 6.
Spinozzi, G. & Potı̀, P. (1 989). Causality I: The Tomasello, M., Call, J. & Hare, B. (2003 b). Chim-
support problem. In F. Antinucci (Ed.), Cog- panzees versus humans: It’s not that simple.
nitive Structure and Development in Nonhu- Trends in Cognitive Sciences, 7, 23 9–240.
man Primates (pp. 1 1 3 –1 1 9). Hilldsale, NJ: Vauclair, J. (1 996). Animal Cognition. An In-
Erlbaum. troduction to Modern Comparative Cognition.
Spinozzi, G. & Potı̀, P. (1 993 ). Piagetian Stage 5 in Cambridge, MA: Harvard University Press.
two infant chimpanzees (Pan troglodytes): The Visalberghi, E. (1 993 ). Tool use in a South
development of permanence of objects and the American monkey species: An overview of the
spatialization of causality. International Journal characteristics and limits of tool use in Cebus
of Primatology, 1 4, 905 –91 7. apella. In A. Berthelet, & J. Chavaillon (Eds.),
Suda, C. & Call, J. (in press). Piagetian liquid con- The Use of Tools by Human and Non-Human Pri-
servation in the great apes. Journal of Compar- mates (pp. 1 1 8–1 3 1 ). New York: Oxford Uni-
ative Psychology. versity Press.
Sulkowski, G. M & Hauser, M. D. (2001 ). Can Visalberghi, E., Fragaszy, D. M., & Savage-
Rhesus monkeys spontaneously subtract? Cog- Rumbaugh, E. S. (1 995 ). Performance in
nition, 79, 23 9–262. a tool-using task by common chimpanzees
Swartz, K., Chen, S., & Terrace, H. (1 991 ). Serial (Pan troglodytes), bonobos (Pan paniscus), an
learning by rhesus monkeys I: Acquisition and orangutan (Pongo pygmaeus), and capuchin
retention of multiple four item lists. Journal of monkeys (Cebus apella). Journal of Compara-
Experimental Psychology: Animal Behavior Pro- tive Psychology, 1 09, 5 2–60.
cesses, 1 7, 3 96–41 0. Visalberghi, E., & Limongelli, L. (1 994). Lack
Thomas, R. K., & Boyd, M. G. (1 973 ). A com- of comprehension of cause-effect relations in
parison of Cebus albifrons and Saimiri sciureus tool-using capuchin monkeys (Cebus apella).
on oddity performance. Animal Learning and Journal of Comparative Psychology, 1 08, 1 5 –
Behavior, 1 , 1 5 1 –1 5 3 . 22.
Thomas, R. K., & Frost, T. (1 983 ). Oddity and Visalberghi, E., & Trinca, L. (1 989). Tool use
dimension-abstracted oddity (DAO) in squirrel in capuchin monkeys: Distinguishing between
monkeys. American Journal of Psychology, 96, performing and understanding. Primates, 3 0,
5 1 –64. 5 1 1 –5 21 .
Thomas, R. K., & Peay, L. (1 976). Length Vonk, J. (2003 ). Gorilla (Gorilla gorilla gorilla)
judgments by squirrel monkeys: Evidence for and orangutan (Pongo abelii) understanding of
conservation? Developmental Psychology, 1 2 , first- and second-order relations. Animal Cog-
3 49–3 5 2. nition, 6, 77–86.
Thompson, R. K. R., & Oden, D. L. (2000). Cat- Washburn, D. A. (1 992). Human factors with
egorical perception and conceptual judgments nonhumans: Factors that affect computer-task
by nonhuman primates: The paleological mon- performance. International Journal of Compar-
key and the analogical ape. Cognitive Science, ative Psychology, 5 , 1 91 –204.
2 4, 3 63 –3 96. Washburn, D. A., & Rumbaugh, D. M. (1 991 ).
Thompson, R. K. R., Oden, D. L., & Boysen, Ordinal judgements of numerical symbols by
S. T. (1 997). Language-naive chimpanzees (Pan macaques (Macaca mulatta). Psychological Sci-
troglodytes) judge relations between relations ence, 2 , 1 90–1 93 .
in a conceptual matching-to-sample task. Jour- Washburn, D. A., & Rumbaugh, D. M.
nal of Experimental Psychology: Animal Behavior (1 992). Comparative assessment of psychomo-
Processes, 2 3 , 3 1 –43 . tor performance: Target prediction by humans
Tomasello, M. (1 999). The cultural origins of hu- and macaques (Macaca mulatta). Journal of
man cognition. Cambridge, MA: Harvard Uni- Experimental Psychology: General, 1 2 1 , 3 05 –
versity Press. 3 1 2.
Tomasello, M., & Call, J. (1 997). Primate Cogni- Weinstein, B. (1 941 ). Matching-from-sample by
tion. New York: Oxford University Press. Rhesus monkeys and by children. Journal of
Tomasello, M., Call, J. & Hare, B. (2003 a). Chim- Comparative and Physiological Psychology, 3 1 ,
panzees understand psychological states – the 1 95 –21 3 .
Woodruff, G., Premack, D., & Kennel, K. probe delay effects. Journal of Experimental
(1 978). Conservation of liquid and solid quan- Psychology: Animal Behavior Processes, 1 0, 5 1 3 –
tity by the chimpanzee. Science, 2 02 , 991 – 5 29.
994. Wright, A. A., Shyan, M., & Jitsumori, M.
Wright, A. A., Santiago, H. C., & Sands, S. F. (1 990). Auditory same/different concept learn-
(1 984). Monkey memory: Same/different con- ing by monkeys. Animal Learning and Behavior,
cept learning, serial probe acquisition, and 1 8, 287–294.
CHAPTER 2 6
Language and Thought
Lila Gleitman
Anna Papafragou
Possessing a language is one of the cen- some prior state of one’s nature. But do such
tral features that distinguishes humans from states of mind arise because one is literally
other species. Many people share the intu- thinking in some new representational for-
ition that they think in language and the ab- mat by speaking in a different language? Af-
sence of language therefore would be the ter all, many people experience the same or
absence of thought. One compelling ver- related changes in sociocultural orientation
sion of this self-reflection is Helen Keller’s and sense of self when they are, say, wear-
(1 95 5 ) report that her recognition of the ing their battered old jeans versus some re-
signed symbol for ‘water’ triggered thought quired business suit or military uniform; or
processes that had theretofore – and conse- even more poignantly when they reexperi-
quently – been utterly absent. Statements to ence a smell or color or sound associated
the same or related effect come from the with dimly recalled events. Many such ex-
most diverse intellectual sources: “The limits periences evoke other times, other places.
of my language are the limits of my world” But according to many anthropological
(Wittgenstein, 1 922); and “The fact of the linguists, sociologists, and cognitive psychol-
matter is that the ‘real world’ is to a large ex- ogists, speaking a particular language ex-
tent unconsciously built upon the language erts vastly stronger and more pervasive in-
habits of the group” (Sapir, 1 941 , as cited in fluences than an old shoe or the smell of
Whorf, 1 95 6, p. 75 ). boiling cabbage. The idea of “linguistic rel-
The same intuition arises with regard to ativity” is that having language, or having a
particular languages and dialects. Speaking particular language, crucially shapes mental
the language of one’s childhood seems to life. Indeed, it may not be only that a spe-
conjure up a host of social and cultural at- cific language exerts its idiosyncratic effects
titudes, beliefs, memories, and emotions, as as we speak or listen to it – that language
though returning to the Casbah or to Av- might come to “be” our thought; we may
enue L and East 1 9 th Street and conversing have no way to think many thoughts, con-
with the natives opens a window back into ceptualize many of our ideas, without this
633
language, or outside of and independent of “program and guide for an individual’s men-
this language. From such a perspective, dif- tal activity” (Ref. 1 43 , p. 21 2), including cate-
ferent communities of humans, speaking dif- gorization, memory, reasoning, and decision
ferent languages, would think differently to making. If this is right, then the study of
the extent that languages differ from one an- different linguistic systems may throw light
other. But is this so? Could it be so? That onto the diverse modes of thinking encour-
depends on how we unpack the notions al- aged or imposed by such systems. Here is
luded to so informally thus far. a recent formulation of this view (Pederson
In one sense, it is obvious that language et al., 1 998, p. 5 86):
use has powerful and specific effects on
We surmise that language structure . . . pro-
thought. That’s what it is for, or at least that
vides the individual with a system of rep-
is one of the things it is for – to transfer resentation, some isomorphic version of
ideas from one mind to another mind. Imag- which becomes highly available for incor-
ine Eve telling Adam “Apples taste great.” poration as a default conceptual represen-
This fragment of linguistic information, as tation. Far more than developing simple ha-
we know, caused Adam to entertain a new bituation, use of the linguistic system, we
thought with profound effects on his world suggest, actually forces the speaker to make
knowledge, inferencing, and subsequent be- computations he or she might otherwise
havior. Much of human communication is not make.
an intentional attempt to modify others’ Even more dramatically, according to
thoughts and attitudes in just this way. This stronger versions of this general position, we
information transmission function is crucial can newly understand much about the de-
for the structure and survival of cultures and velopment of concepts in the child mind:
societies in all their known forms. One acquires concepts as a consequence of
But the language-and-thought debate is their being systematically instantiated in the
not framed to query whether the content exposure language (Bowerman & Levinson,
of conversation can influence one’s attitudes 2001 , p. 1 3 ):
and beliefs, for the answer to that question
is too obvious for words. At issue, rather, is Instead of language merely reflecting the
the degree to which natural languages pro- cognitive development which permits and
vide the format in which thought is neces- constrains its acquisition, language is
thought of as potentially catalytic and
sarily (or at least habitually) couched. Do
transformative of cognition.
formal aspects of a particular linguistic sys-
tem (e.g., features of the grammar or the The importance of this position cannot
lexicon) organize the thought processes of be underestimated: Language here becomes
its users? One famous “Aye” to this question a vehicle for the growth of new concepts –
appears in the writings of B. L. Whorf in the those that were not theretofore in the mind,
first half of the twentieth century. Accord- and perhaps could not have been there with-
ing to Whorf (1 95 6, p. 21 4), the grammatical out the intercession of linguistic experience.
and lexical resources of individual languages It therefore poses a challenge to the vener-
heavily constrain the conceptual representa- able view that one could not acquire a con-
tions available to their speakers. To quote: cept that one could not antecedently enter-
tain (Plato, 5 th–4th b.c.e.; Descartes, 1 662;
We are thus introduced to a new principle Fodor, 1 975 , inter alia].
of relativity, which holds that all observers Quite a different position is that language,
are not led by the same physical evidence
although being the central human conduit
to the same picture of the universe, unless
their linguistic backgrounds are similar, or for thought in communication, memory, and
can in some way be calibrated. planning, neither creates nor materially dis-
torts conceptual life: Thought is first; lan-
This relativistic view, in its strictest form, guage is its expression. This contrasting view
entails that linguistic categories will be the of cause and effect leaves the link between
language and thought 635
language and mind as strong as ever and quite novel and, in its strongest interpreta-
just as relevant for understanding mental tions, revolutionary. At the limit, it is a pro-
life. From Noam Chomsky’s universalist per- posal for how new thoughts can arise in the
spective (1 975 , p. 4), for example, the forms mind as a result of experience with language
and contents of all particular languages de- rather than as a result of experience with the
rive, in large part, from an antecedently spec- world of objects and events.
ified cognitive substance and architecture Before turning to the recent literature
and therefore provide a rich diagnostic of on language and thought, we want to em-
human conceptual commonalities: phasize that there are no ideologues ready
to man the barricades at the absolute ex-
Language is a mirror of mind in a deep and
tremes of the debate just sketched. To our
significant sense. It is a product of human
knowledge, none – well, very few – of those
intelligence . . . By studying the properties of
natural languages, their structure, organi- who are currently advancing linguistic–
zation, and use, we may hope to learn some- relativistic themes and explanations believe
thing about human nature; something sig- that infants enter into language acquisition
nificant, if it is true that human cognitive in a state of complete conceptual naked-
capacity is the truly distinctive and most ness later redressed (perhaps we should say
remarkable characteristic of the species. “dressed”) by linguistic information. Rather,
by general acclaim, infants are believed to
This view of concepts as prior to and pro-
possess some “core knowledge” that enters
genitive of language is not proprietary to
into first categorization of objects, proper-
the rationalist position for which Chomsky
ties, and events in the world (e.g., Carey,
is speaking here. This commonsensical po-
1 982; Kellman, 1 996; Baillargeon, 1 993 ;
sition is maintained – rather, presupposed –
Gelman & Spelke, 1 981 ; Leslie & Keeble,
by students of the mind who differ among
1 987; Mandler, 1 996; Quinn, 2001 ; Spelke
themselves in almost all other regards. The
et al., 1 992). The general question is how
early empiricists, for example, took it for
richly specified this innate basis may be
granted that our concepts derive from expe-
and how experience refines, enhances, and
rience with properties, things, and events in
transforms the mind’s original furnishings.
the world and not, originally, from language
The specific question is whether language
(Hume, 1 73 9; Book I):
knowledge may be one of these formative or
To give a child an idea of scarlet or or- transformative aspects of experience. To our
ange, of sweet or bitter, I present the ob- knowledge, none – well, very few – of those
jects, or in other words, convey to him these who adopt a nativist position on these mat-
impressions; but proceed not so absurdly, ters reject as a matter of a priori conviction
as to endeavor to produce the impressions the possibility that there could be salience
by exciting the ideas. effects of language on thought. For instance,
And as a part of such experience of some particular natural language might for-
objects, language learning will come along mally mark a category whereas another does
for the ride (Locke, 1 690, Book 3 .IX.9; not; two languages might draw a category
emphasis ours): boundary at different places; two languages
might differ in the computational resources
If we will observe how children learn lan- they require to make manifest a particular
guages, we shall find that, to make them distinction or category.
understand what the names of simple ideas We will try to draw out aspects of these
or substances for, people ordinarily show issues within several domains in which com-
them the thing whereof they would have
mentators and investigators are trying to dis-
them have the idea; and then repeat to
them the name that stands for it . . .
entangle cause and effect in the interaction
of language and thought. We cannot dis-
Thus linguistic relativity, in the sense of cuss it all, of course, or even very much
Whorf and many recent commentators, is of what is currently in print on this topic.
There is too much of it (for recent antholo- ical theorizing. Consider the sentence It is
gies, see Gumperz & Levinson, 1 996; Bower- raining. Does this sentence directly – that is,
man & Levinson, 2001 ; Gentner & Goldin- as an interpretive consequence of the linguis-
Meadow, 2003 ). tic representation itself – convey an assertion
about rain falling here, in the immediate ge-
ographical environment of the speaker? Or
does the sentence – the linguistic represen-
Do We Think In Language? tation – convey only that rain is falling, leav-
ing it for the common sense of the listener to
We begin with a very simple question: Do deduce that the speaker likely meant raining
our thoughts take place in natural language? here and now rather than raining today in
If so, it would immediately follow that Bombay or on Mars; likely, too, that if the
Whorf was right all along, since speak- sentence was uttered indoors, the speaker
ers of Korean and Spanish, or Swahili and more likely meant here to convey “just out-
Hopi would have to think systematically dif- side of here” than “right here, as the roof
ferent thoughts. is leaking.” The exact division of labor be-
If language directly expresses our tween linguistic semantics and pragmatics
thought, it seems to make a poor job of it. has implications for the language–thought
Consider for example the final (nonparen- issue, because the richer (one claims that)
thetical) sentence in the preceding section: the linguistic semantics is, the more likely it
1 . There is too much of it. is that language guides our mental life. With-
out going into detail, we will argue that lin-
Leaving aside, for now, the problems guistic semantics cannot fully envelop and
of anaphoric reference (what is “it”?), the substitute for inferential interpretation, and
sentence still has at least two interpre- the representations that populate our mental
tations that are compatible with its dis- life therefore cannot be identical to the rep-
course context: resentations that encode linguistic (seman-
1 a. There is too much written on linguistic tic) meaning.
relativity to fit into this article.
1 b. There is too much written on linguistic
relativity. (Period!) Language Is Sketchy, Thought Is Rich
We authors had one of these two inter-
pretations in mind (guess which one). We There are several reasons to believe that
had a thought and expressed it as (1 ) but thought processes are not definable over rep-
English failed to render that thought unam- resentations that are isomorphic to linguis-
biguously, leaving doubt between (1 a) and tic representations. One is the pervasive am-
(1 b). One way to think about what this ex- biguity of words and sentences. Bat, bank,
ample portends is that language cannot, or in and bug all have multiple meanings in En-
practice does not, express all and only what glish and are associated with multiple con-
we mean. Rather, language use offers hints cepts, but these concepts themselves are
and guideposts to hearers, such that they can clearly distinct in thought, as shown inter
usually reconstruct what the speaker had in alia by the fact that one may consciously
mind by applying to the uttered words a construct a pun. Moreover, several linguis-
good dose of common sense – aka thoughts, tic expressions including pronouns (he, she)
inferences, and plausibilities – in the world. and indexicals (here, now) crucially rely on
The question of just how to apportion the context for their interpretation whereas the
territory between the underlying semantics thoughts they are used to express are usu-
of sentences and the pragmatic interpreta- ally more specific. Our words are often se-
tion of the sentential semantics, of course, is mantically general – i.e., they fail to make
far from settled in linguistic and philosoph- distinctions that nevertheless are present in
thought: Uncle in English does not semanti- In limiting cases, competent listeners ig-
cally specify whether the individual comes nore linguistically encoded meaning if it
from the mother’s or the father’s side, or patently differs from what the speaker in-
whether he is a relative by blood or mar- tended – for instance, by smoothly and
riage, but usually the speaker who utters rapidly repairing slips of the tongue. Oxford
“my uncle . . . ” possesses the relevant infor- undergraduates had the wit, if not the grace,
mation. Indeed, lexical items typically take to snicker when Reverend Spooner reput-
on different interpretations tuned to the oc- edly said, “Work is the curse of the drinking
casion of use (He has a square face. The room classes.” Often, the misspeaking is not even
is hot.) and depend on inference for their consciously noticed but is repaired to fit the
precise construal in different contexts (e.g., thought – evidence enough that the word
the implied action is systematically differ- and the thought are two different matters.1
ent when we open an envelope/a can/an um- The same latitude for thought to range be-
brella/a book, or when an instance of that yond established linguistic means holds for
class of actions is performed to serve dif- the speakers, too. Wherever the local lin-
ferent purposes: Open the window to let in guistic devices and locutions seem insuffi-
the evening breeze/the cat). Moreover, there cient or overly constraining, speakers invent
are cases in which linguistic output does or borrow words from another language, de-
not even encode a complete thought or vise similes and metaphors, and sometimes
proposition (tomorrow, maybe). Finally, the make permanent additions and subtractions
presence of implicatures and other kinds to the received tongue. It would be hard to
of pragmatic inference ensures that – to understand how they do so if language were
steal a line from the Mad Hatter – although itself, and all at once, both the format and
speakers generally mean what they say, they vehicle of thought.
do not and could not say exactly what All the cases just mentioned refer to par-
they mean. ticular tokenings of meanings in the id-
From this and related evidence, it ap- iosyncratic interactions between people. A
pears that linguistic representations under- different problem arises when languages
determine the conceptual contents they are categorize aspects of the world in ways that
used to convey: Language is sketchy com- are complex and inconsistent. An example is
pared with the richness of our thoughts (for reported by Malt et al. (1 999). They exam-
a related discussion, see Fisher & Gleitman, ined the vocabulary used by English, Span-
2002). In light of the limitations of lan- ish, and Chinese subjects to label the various
guage, time, and sheer patience, language containers we bring home from the grocery
users make reference by whatever catch- store full of milk, juice, ice cream, bleach, or
as-catch-can methods they find handy, in- medicine (e.g., jugs, bottles, cartons, boxes).
cluding the waitress who famously told an- As the authors point out, containers share
other that “The ham sandwich wants his names based not only on some perceptual re-
check” (Nunberg, 1 978). What chiefly mat- semblances but also on very local and partic-
ters to talkers and listeners is that successful ular conditions with size, shape, substance,
reference be made, whatever the means at contents, and nature of the contents, not
hand. If one tried to say all and exactly what to speak of the commercial interests of the
one meant, conversation could not happen; purveyor, all playing interacting and shift-
speakers would be lost in thought. Instead, ing roles. In present-day American English,
conversation involves a constant negotiation for instance, a certain plastic container that
in which participants estimate and update looks like a bear with a straw stuck in its
each others’ background knowledge as a ba- head is called a juice box, although it is not
sis for what needs to be said given what is boxy either in shape (square or rectangu-
mutually known and inferable (e.g., Grice, lar) or typical constitution (your prototypi-
1 975 ; Sperber & Wilson, 1 986; Clark, 1 992; cal American box is made of cardboard). The
Bloom, 2002). languages Malt et al. studied differ markedly
in the set of terms available for this domain, et al., 1 967; Kuhl et al., 1 992). Children
and also in how their subjects extended these begin life with the capacity and inclination
terms to describe diverse new containers. to discriminate among all of the acoustic–
Speakers of the three languages differed in phonetic properties by which languages en-
which objects (old and new) they classified code distinctions of meaning – a result fa-
together by name. For example, a set of ob- mously documented by Peter Eimas (Eimas
jects distributed across the sets of jugs, con- et al., 1 971 ) using a dishabituation paradigm
tainers, and jars by English speakers were (for details and significant expansions of this
unified by the single label frasco by Spanish basic result, see Jusczyk, 1 985 ; and for ex-
speakers. Within and across languages, not tensions with neonates, Peña et al., 2003 ).
everything square is a box, not everything These authors showed that an infant will
glass is a bottle, not everything not glass is not work (e.g., turn its head or suck on a nip-
a bottle, and so on. The naming, in short, is ple) to hear a syllable such as ba. After some
a complex mix resulting from perceptual re- period of time, the infant habituates; that
semblances, historical influences, and a gen- is, its sucking rate decreases to some base
erous dollop of arbitrariness. Yet Malt et al.’s level. The high sucking rate can be rein-
subjects did not differ much (if at all) from stated if the syllable is switched to, say, pa,
each other in their classification of these con- demonstrating that the infant detects the dif-
tainers by overall similarity rather than by ference. These effects are heavily influenced
name. Nor were the English and Spanish, as by linguistic experience. Infants only a year
one might guess, more closely aligned than, or so of age – just when true language is
say, the Chinese and Spanish. So here we making its appearance – have become in-
have a case in which cross-linguistic practice sensitive to phonetic distinctions that are
groups objects in a domain in multiple ways not phonemic (play no role at higher lev-
that have only flimsy and sporadic correla- els of linguistic organization) in the expo-
tions with perception without discernible ef- sure language (Werker & Tees, 1 984). Al-
fect on the nonlinguistic classificatory behav- though these experience-driven effects are
iors of users.2 not totally irreversible in cases of long-term
So far, we have emphasized that language second-language immersion, they are perva-
is a relatively impoverished and underspeci- sive and dramatic (for discussion, see Werker
fied vehicle of expression that relies heavily & Logan, 1 985 ; Best, McRoberts, & Sithole,
on inferential processes outside the linguistic 1 988). Without special training or unusual
system for reconstructing the richness and talent, the adult speaker–listener can effec-
specificity of thought. If correct, this seems tively produce and discriminate the phonetic
to place rather stringent limitations on how categories required in the native tongue,
language could serve as the original engine and little more. Not only that, these dis-
and sculptor of our conceptual life. Never- criminations are categorical in the sense
theless, it is possible to maintain the idea that sensitivity to within-category phonetic
that certain formal properties of language distinctions is poor and sensitivity at the
causally affect thought in more subtle, but phonemic boundaries is especially acute. Al-
still important, ways. though the learning and use of a specific lan-
guage has not created perceptual elements
de novo, certainly it has refined, organized,
and limited the set of categories at this level
Use It or Lose It: Language in radical ways. As we will discuss, sev-
Determines the Categories of Thought eral findings in the concept-learning litera-
ture have been interpreted analogously to
We begin by mentioning the most famous this case.
and compelling case of a linguistic influ- An even more intriguing effect in this gen-
ence on perception: categorical perception eral domain is the reorganization of phonetic
of the phoneme (Liberman, 1 970; Liberman elements into higher-level phonological
categories as a function of specific lan- terms for hue and brightness (Berlin & Kay,
guage spoken. For example, American En- 1 969; cf. Kay & Regier, 2002). Do psy-
glish speech regularly lengthens vowels in chophysical judgments differ accordingly?
syllables ending with a voiced consonant For instance, are adjacent hues that share a
(e.g., ride and write) and neutralizes the t/d name in a particular language judged more
distinction in favor of a dental flap in cer- similar by its speakers than equal-magnitude
tain unstressed syllables. The effect is that differences in wavelength and intensity that
(in most dialects) the consonant sounds in are consensually given different names in
the middle of rider and writer are physically that language? And are the similarity spaces
the same. Yet the English-speaking listener of speakers of other languages different in
seems to perceive a d/t difference in these the requisite ways? Such language-caused
words all the same, and – except when asked distinctions have been measured in various
to reflect carefully – fails to notice the char- ways – for example, discrimination across
acteristic difference in vowel length that his hue labeling boundaries (speed, accuracy,
or her own speech faithfully reflects. The confusability), memory, and population
complexity of this phonological reorganiza- comparisons. By and large, the results of such
tion is often understood as a reconciliation cross-linguistic studies suggest a remarkable
(interface) of the cross-cutting phonetic and independence of hue perception from label-
morphological categories of a particular lan- ing practice (e.g., Brown & Lenneberg, 1 95 4;
guage. Ride ends with a d sound; write ends Heider & Oliver, 1 972). One relevant finding
with a t sound; morphologically speaking, comes from red–green color-blind individ-
rider and writer are just ride and write with uals (Jameson & Hurwich, 1 978). The per-
er added on; therefore, the phonetic entity ceptual similarity space of the hues for such
between the syllables in these two words individuals is systematically different from
must be d in the first case and t in the sec- that of individuals of normal vision; that is
ond. Morphology trumps phonetics (for dis- what it means to be colorblind. Yet a large
cussion see Bloch & Trager, 1 942; Chomsky, subpopulation of red–green colorblind in-
1 964; Gleitman & Rozin, 1 977). dividuals names hues, even of new things,
When considering linguistic relativity, consensually with normal-sighted individu-
one might be tempted to write off the pho- als and orders these hue labels consensually.
netic categorical perception effect as one That is, these individuals do not perceptually
that merely tweaks the boundaries of acous- order a set of color chips with the reds at one
tic distinctions built into the mammalian end, the greens at the other, and the oranges
species – a not-so-startling sensitizing effect somewhere in between; yet they organize
of language on perception. But the phono- the words with red semantically at one end,
logical effect just discussed is no mere tweak. green at the other, and orange somewhere
There has been a systemic reorganization in between. In short, the naming practices
creating a new set of lawfully recombinato- and perceptual organization of color mis-
rial elements – one that varies very signifi- match in these individuals, which is a fact
cantly cross-linguistically. that they rarely notice until they enter the
Much of the literature on linguistic rel- vision laboratory.
ativity can be understood as raising related Overall, the language–thought relations
issues in various perceptual and conceptual for one perceptual domain (speech-sound
domains. Is it the case that distinctions of perception) appear to be quite differ-
lexicon or grammar made regularly in one’s ent from those in another perceptual do-
language sensitize one to these distinctions main (hue perception). Language influences
and suppress or muffle others? Even to the acoustic phonetic perception much more
extent of radically reorganizing the domain? than it influences hue perception. As a re-
An important literature has investigated this sult, there is no deciding in advance that lan-
issue using the instance of color names and guage does or does not influence perceptual
color perception. Languages differ in their life. Moreover, despite the prima facie
relevance of these cases and the elegance tions natural (Pinker, 1 984; Gleitman, 1 990;
of the literature that investigated them, the Fisher, 1 996; Bloom, 1 994a; Lidz, Gleitman,
perception of relatively low-level percep- & Gleitman, 2003 ; Baker, 2001 , inter alia).
tual categories, the organization of which Brown saw his result the other way around.
we share with many nonhuman species, are He supposed that languages would vary ar-
less than ideal places to look for the lin- bitrarily in these mappings onto conceptual
guistic malleability of thought.3 However, categories. If that is so, then language can-
these instances serve to scaffold discussion not play the causal role that Pinker and oth-
of language influences at higher levels and ers envisaged for it – that is, as a cue to an-
therefore for more elusive aspects of concep- tecedently “prepared” correlations between
tual organization. linguistic and conceptual categories. Rather,
those world properties yoked together by
language would cause a (previously uncom-
mitted) infant learner to conceive them as
Do the Categories of Language meaningfully related in some ways (Brown,
Become the Categories of Thought? 1 95 7, p. 5 ):
A seminal figure in reawakening interest in In learning a language, therefore, it must be

linguistic relativity was Roger Brown, the useful to discover the semantic correlates
great social and developmental psychologist for the various parts of speech; for this dis-
who framed much of the field of language ac- covery enables the learner to use the part-
quisition in the modern era. Brown (1 95 7) of-speech membership of a new word as a
performed a simple and elegant experiment first cue to its meaning . . . Since [grammat-
that demonstrated an effect of lexical cate- ical categories] are strikingly different in
gorization on the inferred meaning of a new unrelated languages, the speakers [of these
word. Young children were shown a picture, languages] may have quite different cogn-
for example, of hands that seemed to be itive categories.
kneading confettilike stuff in an overflow-
ing bowl. Some children were told Show As recent commentators have put this po-
me the sib. They pointed to the bowl (a sition, linguistic regularities are part of the
solid rigid object). Others were told Show correlational mix that creates ontologies, and
me some sib. They pointed to the confetti language-specific properties therefore will
(an undifferentiated mass of stuff ). Others bend psychological ontologies in language-
were told Show me sibbing. They pointed specific ways (Smith, Colunga, & Yoshida,
to the hands and made kneading motions 2001 ). The forms of particular languages – or
with their own hands (an action or event). the habitual language usage of particular lin-
Plainly, the same stimulus object was repre- guistic communities – by hypothesis, could
sented differently depending on the linguis- yield different organizations of the funda-
tic cues to the lexical categories count noun, mental nature of one’s conceptual world:
mass noun, and verb. That is, the lexical cate- what it is to be a thing or some stuff, or a
gories themselves have notional correlates – direction or place, or a state or event. We
at least in the minds of these young Eng- will discuss some research on these category
lish speakers. types and their cross-linguistic investigation.
Some commentators have argued that the But before doing so, we want to mention
kinds of cues exemplified here – that per- another useful framework for understand-
sons, places, and things surface as nouns – ing potential relations between language and
are universal and can play causal roles in thought: that the tweakings and reorganiza-
the acquisition of language – of course, by tions language may accomplish happen un-
learners who are predisposed to find just der the dynamic control of communicative
these kinds of syntactic–semantic correla- interaction, of “thinking for speaking.”
Thinking for Speaking forth. These words play rather specific gram-
matical roles in marking the ways in which
It is natural to conceive conversation as be- noun phrases relate to the verb and how
ginning with a thought or mental message the predications within a sentence relate to
one wishes to convey. This thought is the each other. These same grammatical words
first link in a chain of mental events that, usually also have semantic content – for ex-
on most accounts, gets translated into suc- ample, the directional properties of from in
cessively more languagelike representations, John separated the wheat from the chaff. Slobin
eventuating in a series of commands to the has given a compendium of the semantic
articulatory system to utter a word, phrase, functions known to be expressed by such
or sentence (Levelt, 1 989; Dell, 1 995 ). As items and these number at least in the several
we have just described matters, there is a hundreds, including not only tense, aspect,
clear distinction at the two ends of this pro- causativity, number, person, gender, mood,
cess – what you meant to say and how you definiteness, and so on, found in English, but
express it linguistically. But this is not so also first-hand versus inferred knowledge,
clear. Several commentators, notably Dan social status of the addressee, existence–
Slobin (1 996, 2003 ), have raised the possi- nonexistence, shape, and many others. Both
bility of a more dynamic and interactive pro- Slobin and Levelt have argued as follows: As
cess in which what one chooses to mean and a condition of uttering a well-formed English
the expressive options that one’s language sentence, the speaker of English must decide
makes available are not so neatly divorced. for example, whether the number of crea-
It may not be that speakers of every language tures being referred to is one or more in order
set out their messages identically all the way to choose the dog or the dogs. Some mod-
up to the time that they arrange the jaw, icum of mental resources, no matter how
mouth, and tongue to utter one two three ver- small, must be devoted to this issue repeat-
sus un deux trois. Instead, the language one edly – hundreds of times a day every day,
has learned causes one to “intend to mean” in every week, every year – by English speak-
somewhat different ways. For instance, and ers. But speakers of Mandarin need not think
as we will discuss in more detail, it may be about number, except when they particu-
that as a speaker of English, with its myriad larly want to, because its expression is not
verbs of manner of motion, one comes to in- grammaticized in their language. The same
spect – and speak of – the world in terms of is true for all the hundreds of other prop-
such manners, whereas a speaker of Greek erties. So either all speakers of languages
or Spanish, with a vocabulary emphasizing covertly compute all these several hundred
verbs relating to path of motion, inspects – properties as part of their representations of
and speaks of – the world more directly in the contents of their sent and received mes-
terms of the paths traversed. The organi- sages or they compute only some of them –
zation of the thought, on this view, might primarily those that they must compute
be dynamically impacted along its course by to speak and understand the language of
specific organizational properties of the in- their community. On information-handling
dividual language. grounds, one would suspect that not all
Slobin (2001 ) and Levelt (1 989) have these hundreds of conceptual interpreta-
pointed to some cases in which a distinctions and their possible combinations are
tion across languages in the resources de- computed at every instance. But if one com-
voted to different conceptual matters seems putes only what one must for the com-
almost inevitable. This case is the closed- bined purposes of linguistic intelligibility
class functional vocabulary, the “grammat- and present communicative purpose, then
ical” words such as modals, auxiliaries, tense speakers of different languages, to this ex-
and aspect markers, determiners, comple- tent, must be thinking differently. As Slobin
mentizers, case markers, prepositions, and so (2001 , p. 442) puts it, “From this point of
view, grammaticizable notions have a role in these distinctions in their speech. Soja et al.
structuring language-specific mental spaces, (1 991 ) taught these children words in refer-
rather than being there at the beginning, ence to various types of unfamiliar displays.
waiting for an input language to turn them Some were solid objects such as a T-shaped
on.” On the basis of this reasoning, it is plau- piece of wood, and others were nonsolid sub-
sible to entertain the view of a language- stances such as a pile of hand cream with
based difference in the dynamics of convert- sparkles in it. The children were shown such
ing thought to speech. How far such effects a sample, named with a term presented in
percolate downstream is the issue to which a syntactically neutral frame that identified
we now turn. Do differences in phraseology, it neither as a count nor as a mass noun –
grammatical morphology, and lexical seman- for example, This is my blicket or Do you see
tics of different languages yield underlying this blicket? In extending these words to new
disparities in their modes of thought? displays, two-year-olds honored the distinc-
tion between object and substance. When
the sample was a hard-edged solid object,
they extended the new word to all objects
Semantic Arenas of the Present Day
of the same shape, even when made of a dif-
Language–Thought Investigation
ferent material. When the sample was a non-
solid substance, they extended the word to
Objects and Substances
other-shaped puddles of that same substance
The problem of reference to stuff versus ob- but not to shape matches made of different
jects has attracted considerable attention be- materials. Soja et al. took this finding as ev-
cause it starkly displays the indeterminacy in idence of a conceptual distinction between
how language refers to the world (Chomsky, objects and stuff, independent of and prior
1 95 7; Quine, 1 960). Whenever we indicate a to the morphosyntactic distinction made in
physical object, we necessarily indicate some English.
portion of a substance as well; the reverse This interpretation was put to stronger
is also true. Languages differ in their ex- tests by extending such classificatory tasks
pression of this distinction (Lucy & Gaskins, to languages that differ from English in
2001 ). Some languages make a grammatical these regards: Either these languages do not
distinction that roughly distinguishes object grammaticize the distinction, or they orga-
from substance. Count nouns in such lan- nize it in different ways (see Lucy, 1 992;
guages denote individuated entities; such as, Lucy & Gaskins, 2001 , for findings from
object kinds. These are marked in English Yucatec Mayan; Mazuka & Friedman, 2000;
with determiners and are subject to counting Imai & Gentner, 1 997, for Japanese). Es-
and pluralization (a horse, horses, two horses). sentially, nouns in these languages all start
Mass nouns typically denote nonindividu- life as mass terms, requiring a special gram-
ated entities – that is, substance rather than matical marker (called a classifier) to be
object kinds. These are marked in English counted. One might claim, then, that sub-
with a different set of determiners (more stance is in some sense linguistically basic for
porridge) and need an additional term that Japanese whereas objecthood is basic for En-
specifies quantity to be counted and plu- glish speakers because of the dominance of
ralized (a tube of toothpaste rather than a its count-noun morphology.4 So if children
toothpaste). Soja, Carey, and Spelke (1 991 ) are led to differentiate object and substance
asked whether children approach this aspect reference by the language forms themselves,
of language learning already equipped with the resulting abstract semantic distinction
the ontological distinction between things should differ cross-linguistically. To test this
and substance or whether they are led to notion, Imai and Gentner replicated the
make this distinction through learning count tests of Soja et al. with Japanese and En-
and mass syntax. Their subjects, English- glish children and adults. Some of their find-
speaking two-year-olds, did not yet make ings appear to strengthen the evidence for a
universal prelinguistic ontology that permits As usual, neither the findings nor the in-
us to think about both individual objects and terpretations of such experiments are easy
portions of stuff because both American and to attain at the present state of the art. For
Japanese children (even two-year-olds) ex- one, thing, Mazuka and Friedman (2000)
tended names for complex hard-edged non- failed to reproduce Lucy’s effects for Mayan
sense objects on the basis of shape rather versus English-speaking subjects’ classifica-
than substance. The lack of separate gram- tory performance in the predicted further
matical marking did not put Japanese chil- case of Japanese. As these authors point
dren at a disadvantage in this regard. out, the sameness in this regard between
Another aspect of the results hints at a Japanese and English speakers, and the dif-
role for language in categorization, however. ference in this regard between Mayan and
Japanese children tended to extend names English speakers, may best be thought of as
for mushy hand cream displays according to arising from cultural and educational differ-
their substance, for example, whereas Amer- ences between the populations rather than
ican children were at chance for these items. linguistic differences.
There were also discernible language effects In light of all the findings so far reviewed,
on word extension for certain very simple there is another interpretation of the results
stimuli (e.g., a kidney bean–shaped piece of that does not implicate an effect of language
colored wax) that seemed to fall at the on- on thought but only an effect of language
tological midline between object and sub- on language: One’s implicit understanding
stance. Whereas the Japanese at ages two and of the organization of a specific language
four years were at chance on these items, En- can influence one’s interpretation of con-
glish speakers showed a tendency to extend versation. Interpretations from this perspec-
words for them by shape. tive have been offered by many commen-
How are we to interpret these results? tators. Bowerman (1 996), Brown (1 95 8),
Several authors have concluded that onto- Landau and Gleitman (1 985 ), and Slobin
logical boundaries literally shift to where (1 996, 2001 ) propose that native speakers
language makes its cuts; that the substance not only learn and use the individual lexical
versus object distinction works much like items their language offers but also learn the
the categorical perception effects we no- kinds of meanings typically expressed by a
ticed for phonemes (and perhaps colors; particular grammatical category in their lan-
for an important statement, see Gentner & guage and come to expect new members
Boroditsky, 2001 ). Lucy and Gaskins (2001 ) of that category to have similar meanings.
bolster this interpretation with evidence that Slobin calls this “typological bootstrapping.”
populations speaking different languages dif- Languages differ strikingly in their common
fer increasingly in this regard with age. forms and locutions – preferred fashions of
Whereas young Mayan speakers do not differ speaking, to use Whorf’s phrase. These prob-
much from their English-speaking peers, by abilistic patterns could bias the interpreta-
age nine years members of the two commu- tion of new words. Such effects occur in
nities differ significantly in relevant classifi- experiments when subjects are offered lan-
catory and memorial tasks. The implication guage input (usually nonsense words) un-
is that long-term use of a language influ- der conditions in which implicitly known
ences ontology with growing conformance form-to-meaning patterns in the language
of concept grouping to linguistic group- might hint at how the new word is to be
ing. Of course, the claim is not for a ram- interpreted.
pant Procrustean reorganization of thought; Let us reconsider the Imai and Gentner
only for boundary shifting. For displays object–substance effects on this hypothe-
that blatantly fall to one side or the other sis. As we saw, when the displays them-
of the object/substance boundary, therefore, selves were of nonaccidental-looking hard-
the speakers of all the tested languages sort edged objects, subjects in both language
the displays in the same ways. groups opted for the object interpretation.
But when the world was uninformative (e.g., cation or geometric information that is more
for softish waxy lima bean shapes), the lis- typically specified by a spatial preposition in
teners fell back upon linguistic cues, if avail- English. To describe a scene in which a cas-
able. No relevant morphosyntactic clues ex- sette tape is placed into its case, for exam-
ist in Japanese, so Japanese subjects chose ple, English speakers would say “We put the
at random for these indeterminate stimuli. tape in the case.” Korean speakers typically
For English-speaking subjects, the linguis- use the verb kkita to express the put in rela-
tic stimulus in a formal sense also was in- tion for this scene. Kkita does not have the
terpretively neutral: This blicket is a tem- same extension as put in. Both put in and
plate that accepts both mass and count kkita describe an act of putting an object in
nouns (this horse/toothpaste). But here prin- a location; but put in is used for all cases of
ciple and probability part company. Re- containment (fruit in a bowl, flowers in a
cent experimentation leaves no doubt that vase) whereas kkita is used only in case the
child and adult listeners incrementally ex- outcome is a tight fit between two matching
ploit probabilistic facts about word use to shapes (tape in its case, one Lego piece on
guide the comprehension process on line another, glove on hand). Notice that there
(e.g., Snedeker, Thorpe, & Trueswell, 2001 ). is a cross-classification here: Whereas En-
In the present case, any English speaker glish appears to collapse across tightnesses
equipped with even a rough subjective prob- of fit, Korean makes this distinction but
ability counter should take into account the conflates across putting in versus putting on,
massive preponderance of count nouns over which English regularly differentiates. Very
mass nouns in English and conclude that a young learners of these two languages have
new word, blicket, used to refer to some in- already worked out the language-specific
determinate display, is probably a new count classification of such motion relations and
noun rather than a new mass noun. Count events in their language, as shown by both
nouns, in turn, tend to denote individuals their usage and their comprehension (Choi
rather than stuff and so have shape pre- & Bowerman, 1 991 ).
dictivity (Smith, 2001 ; Landau, Smith, & Do such cross-linguistic differences have
Jones, 1 998). implications for spatial cognition? Mc-
Applying this interpretation, it is not that Donough, Choi, and Mandler (2003 ) fo-
speaking English leads one to tip the scales cused on spatial contrasts between relations
toward object representations of newly seen of tight containment versus loose support
referents for perceptually ambiguous items, (grammaticalized in English by the prepo-
but that hearing English leads one to tip sitions in and on and in Korean by the verbs
the scales toward count-noun representa- kkita and nohta) and tight versus loose con-
tion of newly heard nominals in linguis- tainment (both grammaticalized as in in
tically ambiguous structural environments. English but separately as kkita and nehta
Derivatively, then, count syntax hints at ob- in Korean). They showed that prelinguis-
ject representation of the newly observed tic infants (nine to fourteen months old)
referent. Notice that such effects can be in both English- and Korean-speaking en-
expected to increase with age as massive vironments are sensitive to such contrasts,
lexical–linguistic mental databases are built, and so are Korean-speaking adults (see also
consistent with the findings of Lucy and Hespos & Spelke, 2000, who show that five-
Gaskins (2001 ).5 month-olds are sensitive to this distinction).
Their English-speaking adult subjects, how-
ever, showed sensitivity only to the tight
Spatial Relationships
containment versus loose support distinc-
Choi and Bowerman (1 991 ) studied the ways tion, which is grammaticalized in English
in which common motion verbs in Korean (in versus on). The conclusion drawn from
differ from their counterparts in English. these results was that some spatial relations
First, Korean motion verbs often contain lo- that are salient during the prelinguistic stage
become less salient for adult speakers if lan- various laboratory effects in dealing with
guage does not systematically encode them: spatial relations.
“Flexible infants become rigid adults.”
This interpretation again resembles that
Motion
for the perception of phoneme contrasts, but
by no means as categorically. The fact that Talmy (1 985 ) described two styles of
English speakers learn and readily use verbs motion expression characterizing different
such as jam, pack, and wedge weakens any languages: Some languages, including En-
claim that the lack of common terms seri- glish, typically use a verb plus a separate
ously diminishes the availability of catego- path expression to describe motion events.
rization in terms of tightness of fit. One pos- In such languages, manner of motion is en-
sibility is that the observed language-specific coded in the main verb (e.g., walk, crawl,
effects with adults are attributable to verbal slide, or float), and path information appears
mediation: Unlike preverbal infants, adults in nonverbal elements such as particles, ad-
may have turned the spatial classification verbials, or prepositional phrases (e.g., away,
task into a linguistic task. It therefore is use- through the forest, out of the room). In Greek
ful to turn to studies that explicitly compare or Spanish, the dominant pattern instead
performance when subjects from each lan- is to include path information within the
guage group are instructed to classify ob- verb itself (e.g., Greek bjeno, “exit” and beno,
jects or pictures by name, as opposed to “enter”); the manner of motion often goes
when they are instructed to classify the same unmentioned or appears in gerunds, preposi-
objects by similarity. In one such study, Li tional phrases, or adverbials (trehontas, “run-
et al. (1 997) showed Korean- and English- ning”). These patterns are not absolute.
speaking subjects pictures of events such as Greek has motion verbs that express manner,
putting a suitcase on a table (an example and English has motion verbs that express
of “on” in English, and of “loose support” path (enter, exit, cross). But several studies
in Korean). For half the subjects from each have shown that children and adults have
language group (each tested fully in their learned these dominance patterns. Slobin
own language), these training stimuli were (1 996) showed that child and adult Spanish
labeled by a videotaped cartoon character and English speakers vary in the terms they
who performed the events (I am Miss Picky typically use to describe the same picture-
and I only like to put things on things. See?), book stories with English speakers display-
and for the other subjects, the stimuli were ing greater frequency and diversity of man-
described more vaguely (. . . and I only like to ner of motion verbs. Papafragou, Massey,
do things like this. See?). Later categorization and Gleitman (2002) showed the same ef-
of new instances followed language in the fects for the description of motion scenes
labeling condition: English speakers identi- by Greek- versus English-speaking children
fied new pictures showing tight fits (e.g., a and, much more strongly, for Greek-versus
cap put on a pen) as well as the original English-speaking adults.
loose-fitting ones as belonging to the cate- Do such differences in event encoding af-
gory that Miss Picky likes, but Korean speak- fect the way speakers think about motion
ers generalized only to new instances of loose events? Papafragou et al. (2002) tested their
fits. These language-driven differences rad- English- and Greek-speaking subjects on ei-
ically diminished in the similarity sorting ther memory of path or manner details of
condition in which the word (on or nohta) motion scenes, or categorization of motion
was not invoked; in this case the catego- events on the basis of path or manner sim-
rization choices of the two language groups ilarities. Even though speakers of the two
were essentially the same. The “language on languages exhibited an asymmetry in encod-
language” interpretation we commended in ing manner and path information in their
discussing the object/substance distinction verbal descriptions, they did not differ in
in this case, too, seems to encompass the terms of classification or memory for path
and manner.6 Similar results have been ob- influences listeners’ interpretation of the
tained for Spanish versus English by Gen- speaker’s intended meaning if the stimu-
nari et al. (2002). Corroborating evidence lus situation leaves such interpretation unre-
also comes from studies by Munnich, Lan- solved. In another important demonstration
dau, and Dosher (2001 ), who compared En- of this language-on-language effect, Naigles
glish, Japanese, and Korean speakers’ naming and Terrazas (1 998) asked subjects to de-
of spatial locations and their spatial memory scribe and categorize videotaped scenes –
for the same set of locations. They found for example, of a girl skipping toward a
that, even in aspects in which languages tree. They found that Spanish- and English-
differed (e.g., encoding spatial contact or speaking adults differed in their preferred
support), there was no corresponding dif- interpretations of new (nonsense) motion
ference in memory performance across lan- verbs in manner-biasing (She’s kradding to-
guage groups. ward the tree or Ella está mecando hacia el
Relatedly, the same set of studies sug- árbol) or path-biasing (She’s kradding the tree
gests that the mental representation of mo- or Ella está mecando el árbol) sentence struc-
tion and location is independent of linguis- tures. The interpretations were heavily influ-
tic naming even within a single language. enced by syntactic structure. But judgments
Papafragou et al. (2002) divided their Eng- also reflected the preponderance of verbs in
lish- and Greek-speaking subjects’ verbal de- each language – Spanish speakers gave more
scriptions of motion according to whether path interpretations and English speakers
they included a path or manner verb, re- gave more manner interpretations. Similar
gardless of native language. Although En- effects of language-specific lexical practices
glish speakers usually chose manner verbs, on presumed word extension have been
sometimes they produced path verbs; the found for adjectives (Waxman, Senghas, &
Greek speakers also varied but with the pre- Benveniste, 1 997).
ponderances reversed. It was found that verb A fair conclusion from this and related
choice did not predict memory for path or evidence is that verbal descriptions are un-
manner aspects of motion scenes or choice der the control of many factors related to
of path or manner as a basis for catego- accessibility, including the simple frequency
rizing motion scenes. In the memory task, of a word’s use, as well as of faithfulness
subjects who had used a path verb to de- as a description of the scene. As several
scribe a scene were no more likely to detect authors have argued, the dynamic process
later path changes to that scene than sub- of expressing one’s thoughts is subject to
jects who had used a manner verb (and vice the exigencies of linguistic categories that
versa for manner). In the classification task, can vary from language to language. Given
subjects were not more likely to name two the heavy information-processing demands
motion events they had earlier categorized as of rapid conversation, faithfulness often is
most similar by using the same verb. Naming sacrificed to accessibility. For these and other
and cognition, then, are distinct under these reasons, verbal reports do not come any-
conditions: Even for speakers of a single where near exhausting the observers’ mental
language, the linguistic resources mobilized representations of events. Language use, in
for labeling underrepresent the cognitive this sense, is “sketchy.” Rather than “think-
resources mobilized for cognitive processing in words,” humans seem to make easy lin-
ing (e.g., memorizing, classifying, reason- guistic choices that, for competent listeners,
ing, etc.). serve as rough but usually effective pointers
An obvious conclusion from these stud- to those ideas.
ies of motion representation is that the
conceptual organization of space and mo-
Spatial Frames of Reference
tion is robustly independent of language-
specific labeling practices. Just as obvious, Certain linguistic communities (e.g., Tene-
however, is that specific language usage japan Mayans) customarily use an externally
referenced (absolute) spatial coordinate sys- fluences the choice of absolute versus spatial
tem to refer to nearby directions and po- coordinate frameworks. After all, the influ-
sitions (“to the north”); others (e.g., Dutch ence of such landmark information on spa-
speakers) use a viewer-perspective (relative) tial reasoning has been demonstrated with
system (“to the left”). Brown and Levinson nonlinguistic (rats; Restle, 1 95 7) and prelin-
(1 993 ) and Pederson et al. (1 998) recently guistic (infants; Acredolo & Evans, 1 980) an-
suggested that these linguistic practices af- imals. To examine this possibility, Li and
fect spatial reasoning in language-specific Gleitman replicated Brown and Levinson’s
ways. In one of their experiments, Tenejapan rotation task with English speakers, but they
Mayan and Dutch subjects were presented manipulated the presence or absence of
with an array of objects (toy animals) on a landmark cues in the testing area. The re-
tabletop; after a brief delay, subjects were sult, just as for the rats and the infants, was
taken to the opposite side of a new table that English-speaking adults respond abso-
(they were effectively rotated 1 80 degrees), lutely in the presence of landmark informa-
handed the toys, and asked to reproduce the tion (after rotation, they set up the animals
array “in the same way as before.” The over- going in the same cardinal direction) and rel-
whelming majority of Tenejapan (absolute) atively when it is withheld (they set up the
speakers rearranged the objects so they were animals going in the same relative – left or
heading in the same cardinal direction after right – direction).
rotation, whereas Dutch (relative) speakers Flexibility in spatial reasoning in this re-
massively preferred to rearrange the objects gard should come as little surprise. The abil-
in terms of left–right directionality. This co- ity to navigate in space is hard-wired in the
variation of linguistic terminology and spa- brain of moving creatures, including bees
tial reasoning seems to provide compelling and ants. For all of these organisms, re-
evidence for linguistic influences on nonlin- liable orientation and navigation in space
guistic cognition. are crucial for survival (Gallistel, 1 990).
As so often is the case in this literature, Accordingly, neurobiological evidence from
however, it is quite hard to disentangle cause humans and other species that the brain
and effect. For instance, it is possible that routinely uses a multiplicity of coordinate
the Tenejapan and Dutch groups think about frameworks in coding for the position of ob-
space differently because their languages jects to prepare for directed action (Gallistel,
pattern differently; but it is just as possi- 2002). It would be quite amazing if, among
ble that the two linguistic–cultural groups all the creatures that walk, fly, and crawl on
developed different spatial-orientational vo- the earth, only humans, by virtue of acquir-
cabulary to reflect (rather than cause) dif- ing a particular language, lose the ability to
ferences in their spatial reasoning strate- use both absolute and relative spatial coordi-
gies. Li and Gleitman (2002) investigated nate frameworks flexibly. The case is by no
this second position. They noted that ab- means closed even on this issue, however,
solute spatial terminology is widely used in because successive probes of the rotation
many English-speaking communities whose situation have continued to yield conflict-
environment is geographically constrained ing results both within and across languages
and includes large stable landmarks such as (e.g., Levinson, Kita, & Haun, 2002; Li &
oceans and looming mountains. The abso- Gleitman, in preparation]. One way of rec-
lute terms uptown, downtown, and crosstown onciling these findings and theories has to
(referring to North, South, and East–West) do with the level of analysis to which the
are widely used to describe and navigate in Levinson groups’ findings are thought to ap-
the space of Manhattan Island, Chicagoans ply. Perhaps we are prisoners of language
regularly make absolute reference to the only in complex and highly derived tasks and
lake, etc. It is quite possible, then, that the only when behavior is partly under the con-
presence or absence of stable landmark in- trol of verbal instructions that include vague
formation rather than language spoken in- expressions such as “make it the same.” But
it is fair to say that the jury is still out on this also able to attribute knowledge of the con-
phenomenon. tents of a container to a character who had
looked inside but not to another character
Evidentiality who had had no visual access to its content.
Furthermore, Korean learners were more
One of Whorf’s most interesting conjectures
advanced in their nonlinguistic knowledge
concerned the possible effects of evidentials
of sources of information than in their
(linguistic markers of information source) on
knowledge of the meaning of linguistic evi-
the nature of thought. Whorf pointed out
dentials. In this case, then, learned linguistic
that Hopi – unlike English – marked evi-
categories do not seem to serve as a guide
dential distinctions in its complementizer
for the individual’s nonlinguistic categories
system. Comparing the sentences I see that
in the way that Whorf conjectured. Rather,
it is red vs. I see that it is new, he remarked
the acquisition of linguistically encoded
(Whorf, 1 95 6, p. 85 ):
distinctions seems to follow, and build upon,
We fuse two quite different types of rela- the conceptual understanding of evidential
tionship into a vague sort of connection ex- distinctions. The conceptual understanding
pressed by ‘that’, whereas the Hopi indi- itself appears to proceed similarly across
cates that in the first case seeing presents a diverse language-learning populations.
sensation ‘red,’ and in the second that see-
ing presents unspecified evidence for which
is drawn the inference of newness. Time
Whorf concluded that this grammatical Thus far, we have focused on grammati-
feature was bound to make certain concep- cal and lexical properties of linguistic sys-
tual distinctions easier to draw for the Hopi tems and their possible effects on conceptual
speaker because of the force of habitual lin- structure. Here we consider another aspect
guistic practices. of languages as expressive systems – their
Papafragou, Li, Choi, and Han (in systematically differing use of certain net-
preparation) sought to put this proposal works of metaphor; specifically, metaphor
to test. They compared English, which for talking about time (Boroditsky, 2001 ).
mainly marks evidentiality lexically (I English speakers predominantly talk about
saw/heard/inferred that John left), with time as if it were horizontal (one pushes
Korean, in which evidentiality is encoded deadlines back, expects good times ahead, or
through a set of dedicated morphemes. moves meetings forward), whereas Mandarin
Given evidence that such morphemes are speakers more usually talk about time in
produced early by children learning Korean terms of a vertical axis (they use the Man-
(Choi, 1 995 ), they asked whether Korean darin equivalents of up and down to refer
children develop the relevant conceptual to the order of events, weeks, or months).
distinctions earlier and with greater reli- Boroditsky showed that these differences
ability than learners of English, in which predict aspects of temporal reasoning by
evidentiality is not grammatically encoded. speakers of these two languages. In one of
In a series of experiments, they compared her manipulations, subjects were shown two
the acquisition of nonlinguistic distinctions objects in vertical arrangement, say, one fish
between sources of evidence in three- and following another one downward, as they
four-year-olds learning English or Korean: heard something like The black fish is win-
No difference in nonlinguistic reasoning in ning. After this vertically oriented prime,
these regards was found between the English Mandarin speakers were faster to confirm
and Korean group. For instance, children in or disconfirm temporal propositions (e.g.,
both linguistic groups were equally good March comes earlier than April ) than if they
at reporting how they found out about the were shown the fish in a horizontal array.
contents of a container (e.g., by looking The reverse was true for English speakers.
inside or by being told); both groups were Boroditsky concluded that spatiotemporal
metaphors in language affect how people the basic principles underlying the adult
reason about time. She has suggested, more number system are innate; gaining access
generally, that such systematic linguistic to these principles gives children a way of
metaphors are important in shaping habit- grasping the infinitely discrete nature of
ual patterns of thought. natural numbers, as manifested by their
However, these results are again more ability to use verbal counting (Gelman &
complex than they seem at first glance. For Gallistel, 1 978; Gallistel & Gelman, Chap.
one thing, and as Boroditsky acknowledges, 23 ). Other researchers propose that children
vertical metaphors of time are by no means come to acquire the adult number system
absent from ordinary English speech (e.g., I by conjoining properties of the two prelin-
have a deadline coming up), although they are guistic number systems via natural language.
more sporadic than in Mandarin. So again we Specifically, they propose that grasping the
have a cross-linguistic difference of degree, linguistic properties of number words (e.g.,
rather than a principled opposition. More- their role in verbal counting or their seman-
over, Boroditsky briefly trained her English- tic relations to quantifiers such as few, all,
speaking subjects to think about time ver- many, most; see Spelke & Tsivkin, 2001 a and
tically, as in Mandarin. After such training, Bloom, 1 994b; Carey, 2001 respectively)
the English speakers exhibited the vertical enables children to put together elements of
(rather than the former horizontal) priming the two previously available number systems
effect. Apparently, fifteen minutes of train- to create a new, generative number faculty.
ing on the vertical overcame and completely In Bloom’s (1 994b, p. 1 86) words, “in the
reversed twenty-plus years of the habitual course of development, children ‘bootstrap’
use of the horizontal in these speakers. The a generative understanding of number out of
effects of metaphor, it seems, are transient the productive syntactic and morphological
and fluid without long-term influence on structures available in the counting system.”
the nature of conceptualization or its im- Upon hearing the number words in a
plicit deployment to evaluate propositions in counting context, for instance, children re-
real time. alize that these words map onto both
specific representations delivered by the
exact-numerosities calculator and inexact
Number
representations delivered by the approxima-
Prelinguistic infants and nonhuman pri- tor device. By conjoining properties of these
mates share an ability to represent both ex- two systems, children gain insight into the
act numerosities for very small sets (roughly properties of the adult conception of num-
up to three objects) and approximate nu- ber (e.g., that each of the number words
merosities for larger sets (Dehaene, 1 997). picks out an exact set of entities, that adding
Human adults possess a third system for rep- or subtracting exactly one object changes
resenting number that allows for the rep- number, etc.). Ultimately, it is hypothesized
resentation of exact numerosities for large that this process enables the child to com-
sets; in principle has no upper bound on set pute exact numerosities even for large sets
size; and can support the comparison of nu- (such as seven or twenty-three) – an ability
merosities of different sets, as well as pro- not afforded by either of the prelinguistic
cesses of addition and subtraction. Crucially, calculation systems.
this system is generative because it possesses Spelke and Tsivkin (2001 a, b) experi-
a rule for creating successive integers (the mentally investigated the thesis that lan-
successor function) and therefore is charac- guage contributes to exact large-number cal-
terized by discrete infinity (see Gallistel & culations. In their studies, bilinguals who
Gelman, Chap. 23 ). were trained on arithmetic problems in a
How do young children become capable single language and later tested on them
of using this uniquely human number were faster on large-number arithmetic if
system? One powerful answer is that tested in the training language; however, no
such advantage of the training language ap- proposed as a possible candidate (Carey,
peared with estimation problems. The con- 2001 ). However, familiar quantifiers lack the
clusion from this and related experiments hallmark properties of the number system:
was that the particular natural language is They are not strictly ordered with respect
the vehicle of thought concerning large ex- to one another, and their generation is not
act numbers but not about approximate nu- governed by the successor function. In fact,
merosities. Such findings, as Spelke and her several quantifiers presuppose the compu-
collaborators have emphasized, can be part tation of cardinality of sets – for example,
of the explanation of the special “smartness” neither and both apply only to sets of two
of humans. Higher animals, like humans, can items (Keenan & Stavi, 1 986; Barwise &
reason to some degree about approximate Cooper, 1 981 ). Moreover, quantifiers com-
numerosity, but not about exact numbers. pose in quite different ways from numbers.
Beyond this shared core knowledge, how- For example, the expression most men and
ever, humans have language. If language is women cannot be interpreted to mean a large
a required causal factor in exact number majority of the men and much less than half
knowledge, in principle this could explain the women (A. Joshi, personal communica-
the gulf between creatures like us and creation). In light of the semantic disparities be-
tures like them. tween the quantifier and integer systems, it
How plausible is the view that the is hard to see how it is possible to bootstrap
adult number faculty presupposes linguis- the semantics of one from the other.
tic mediation? Recall that, on this view, Recent experimental findings suggest,
children infer the generative structure of moreover, that young children understand
number from the generative structure of certain semantic properties of number
grammar when they hear others count- words well before they know those of quan-
ing. However, counting systems vary cross- tifiers. One case involves the scalar interpre-
linguistically, and in a language like English, tation of these terms. In one experiment,
their recursive properties are not really ob- Papafragou and Musolino (2003 ) had five-
vious from the outset. Specifically, until year-old children watch as three horses were
number eleven, the English counting sys- shown jumping over a fence. The children
tem presents no evidence of regularity, much would not accept Two of the horses jumped
less of generativity: A child hearing one, two, over the fence as an adequate description of
three, four, five, six, up to eleven, would have that event (even though it is necessarily true
no reason to assume – based on properties that if three horses jumped, then certainly
of form – that the corresponding numbers two did). But at the same age, they will
are lawfully related (namely, that they suc- accept Some of the horses jumped over the
cessively increase by one). For larger num- fence as an adequate description even though
bers, the system is more regular, even though it is true that all of the horses jumped. In
not fully recursive because of the presence another experiment, Hurewitz, Papafragou,
of several idiosyncratic features (e.g., one Gleitman, and Gelman (in review) found
can say eighteen or nineteen but not tenteen that three-year-olds understand certain se-
for twenty). In sum, it is not so clear how mantic properties of number words such as
the “productive syntactic and morphological two and four well before they know those
structures available in the counting system” of quantifiers such as some and all. It seems,
will provide systematic examples of discrete then, that the linguistic systems of number
infinity that can then be imported into num- and natural-language quantification are de-
ber cognition (see Grinstead et al., 2003 , for veloping rather independently. If anything,
detailed discussion). the children seem more advanced in knowl-
Can properties of other natural language edge of the meaning of number words than
expressions bootstrap a generative under- quantifiers so it is hard to see how the
standing of number? Quantifiers have been semantics of the former lexical type is to
be bootstrapped from the semantics of the encoding of the relevant terms in a specifi-
latter. cally linguistic format.
In a recent review article, Carruthers
(2002) suggests even more strongly that in
number, space, and perhaps other domains,
Orientation
language is the medium of intermodular
A final domain we discuss is spatial orien- communication, a format in which repre-
tation. Cheng and Gallistel (1 984) found sentations from different domains can be
that rats rely on geometric information to combined to create novel concepts. In stan-
reorient themselves in a rectangular space, dard assumptions about modularity, how-
and seem incapable of integrating geomet- ever, modules are characterized as compu-
rical with nongeometrical properties (e.g., tational systems with their own proprietary
color, smell, etc.) in searching for a hidden vocabulary and combinatorial rules. Because
object. If they see food hidden at the cor- language itself is a module in this sense,
ner of a long and a short wall, they will its computations and properties (e.g., gen-
search equally at either of the two such walls erativity, compositionality) cannot be trans-
of a rectangular space after disorientation; ferred to other modules because they are
this is true even if these corners are dis- defined over – and can only apply to –
tinguishable by one of the long walls be- language-internal representations. One way
ing painted blue or having a special smell, out of this conundrum is to give up the as-
and so on. Hermer and Spelke (1 994, 1 996) sumption that language is – on the appropri-
reported a very similar difficulty in young ate level – modular:
children. Both animals and young children
Language may serve as a medium
can navigate and reorient by the use of ei-
for this conjunction . . . because it is a
ther geometric or nongeometric cues; it is domain-general, combinatorial system to
integrating across the cue types that creates which the representations delivered by the
trouble. These difficulties are overcome by child’s . . . [domain-specific] nonverbal sys-
older children and adults, who are able, for tems can be mapped. (Spelke & Tsivkin,
instance, to go straight to the corner formed 2 001 b, p. 84).
by a long wall to the left and a short blue
wall to the right. Hermer and Spelke found Language is constitutively involved in
that success in these tasks was significantly (some kinds of) human thinking. Specifi-
predicted by the spontaneous combination cally, language is the vehicle of nonmodu-
of spatial vocabulary and object properties lar, nondomain-specific, conceptual think-
ing which integrates the results of modular
such as color within a single phrase (e.g., to
thinking (Carruthers, 2 002 , p. 666).
the left of the blue wall ).7 Later experiments
(Hermer-Vasquez, Spelke, and Katsnelson, On this view, the output of the linguistic
1 999) revealed that adults who were asked system just is Mentalese: There is no other
to shadow speech had more difficulty in level of representation in which the infor-
these orientation tasks than adults who were mation to the left of the blue wall can be en-
asked to shadow a rhythm with their hands; tertained. This picture of language is novel
however, verbal shadowing did not disrupt in many respects. In the first place, replac-
subjects’ performance in tasks that required ing Mentalese with a linguistic representa-
the use of nongeometric information only. tion challenges existing theories of language
The conclusion was that speech-shadowing, production and comprehension. Tradition-
unlike rhythm-shadowing, by taking up lin- ally, and as discussed earlier, it is assumed
guistic resources, blocked the integration of the production of sentences begins by en-
geometrical and object properties, which is tertaining the corresponding thought, which
required to solve complex orientation tasks. then mobilizes the appropriate linguistic re-
In short, success at the task seems to require sources for its expression (e.g., Levelt, 1 989).
On recent proposals, however, Carruthers, or like prelinguistic infants and rats. This
(2002, p. 668) observed: prediction, although merely carrying the
original proposal to its apparent logical con-
We cannot accept that the production of a clusion, is quite radical: It allows a striking
sentence ‘The toy is to the left of the blue discontinuity among members of the human
wall’ begins with a tokening of the thought species, contingent not upon the presence or
THE TOY IS TO THE LEFT OF THE
absence of human language and its combi-
BLUE WALL (in Mentalese), since our hy-
pothesis is that such a thought cannot be natorial powers (as the original experiments
entertained independently of being framed seem to suggest) or even upon cultural and
in a natural language. educational differences, but on vagaries of
the lexicon in individual linguistic systems.
Inversely, language comprehension clas- Despite its radical entailments, there is a
sically is taken to unpack linguistic repre- sense in which Spelke’s proposal to inter-
sentations into mental representations that pret concept configurations on the basis of
then can trigger further inferences. But in the combinatorics of natural language can be
Carruthers’ proposal, after hearing The toy construed as decidedly nativist. In fact, we so
is to the left of the blue wall, the interpretive construe it. Spelke’s proposal requires that
device cannot decode the message into the humans be equipped with the ability to con-
corresponding thought because there is no struct novel structured syntactic represen-
level of Mentalese independent of language tations, insert lexical concepts at the termi-
in which the constituents are lawfully con- nal nodes of such representations (left, blue,
nected to each other. Interpretation can only etc.), and interpret the outcome on the ba-
dismantle the utterance and send its consis of familiar rules of semantic composition
cepts back to the geometric and landmark (to the left of the blue wall). In other words,
modules to be processed. In this sense, un- humans are granted principled knowledge of
derstanding an utterance such as The picture how phrasal meaning is to be determined by
is to the right of the red wall turns out to be lexical units and the way they are composed
a very different process than understanding into structured configurations. That is, what
superficially similar utterances such as The is granted is the ability to read the seman-
picture is to the right of the wall, or The pic- tics off of phrase structure trees. Further, the
ture is on the red wall, which do not, on this assumption is that this knowledge is not at-
account, require cross-domain integration. tained through learning but belongs to the
Furthermore, if language is to serve as a in-built properties of the human language
domain for cross-module integration, then device. But notice that granting humans the
the lexical resources of each language be- core ability to build and interpret phrase
come crucial for conceptual combination. structures is already granting them quite a
Lexical gaps in the language will block con- lot. Exactly these presuppositions have been
ceptual integration, for instance, because the hallmark of the nativist program in lin-
there would be no relevant words to insert guistics and language acquisition (Chomsky,
into the linguistic string. We know that color 1 95 7; Pinker, 1 984; Gleitman, 1 990; Lidz,
terms vary across languages (Kay & Regier, Gleitman, & Gleitman, 2002; Jackendoff,
2002); more relevantly, not all languages 1 990) and the target of vigorous dissent else-
have terms for left and right (Levinson, where (Tomasello, 2000; Goldberg, 1 995 ).
1 996). It follows that speakers of these lan- To the extent that Spelke and Tsivkin’s ar-
guages should fail to combine geometric and guments about language and cognition rely
object properties in the same way as do En- on the combinatorial and generative powers
glish speakers to recover from disorientation. of language, they already make quite deep
In other words, depending on the spatial vo- commitments to abstract (and unlearnable)
cabulary available in their language, disori- syntactic principles and their semantic re-
ented adults may behave either like Spelke flexes. Notice in this regard that because
and Tsivkin’s English-speaking population these authors hold that any natural language
will serve as the source and vehicle for the thinking (Varley & Siegal, 2000). Animals
required inferences, the principles at work can form representations of space, artifacts,
here must be abstract enough to wash out and perhaps even mental states without
the diverse surface-structural realizations of linguistic crutches (Hauser & Carey, 1 998;
to the left of the blue wall in the languages of Gallistel, 1 990; Hare, Call, & Tomasello,
the world. Independently of particular expe- 2001 ; and Call & Tomasello, Chap. 25 ). In
riences, an organism with such principles in light of all these language–thought dispari-
place could generate and systematically com- ties, it would seem perverse to take an equa-
prehend novel linguistic strings with mean- tive position on relations between the two.
ings predictable from the internal organiza- At the same time, compelling experi-
tion of those strings and, for different but mental studies again and again document
related reasons, just as systematically fail to intimate, seemingly organic, relationships
understand other strings such as to the left of among language, thought, and culture, of
the blue idea. We would be among the last much the kind that Whorf and Sapir drew
to deny such a proposal in its general form. out of their field experiences. What is to
We agree that there are universal aspects explain these deep correlations between
of the syntax–semantics interface. Whether culturally divergent ways of thinking and
these derive from or augment the combina- culturally divergent ways of talking? In cer-
torial powers of thought is the question at tain cases, we argued that cause and effect
issue here. For the present commentators, it had simply been prematurely placed on one
is hard to see how shifting the burden of the foot or another because of the crudeness
acquisition of compositional semantics from of our investigative tools. Inconveniently
the conceptual system to the linguistic sys- enough, it is often hard to study language
tem diminishes the radical nativist flavor of development apart from conceptual and
the position. cultural learning or to devise experiments in
which these factors can be prevented from
interacting, so it is hard to argue back to
origins. On the other hand, the difficulty
Conclusions and Future Directions of even engineering such language–thought
dissociations in the laboratory is one signifi-
We have just tried to review the burgeoning cant point in favor of a linguistic–relativistic
psychological and anthropological literature view. Why should it be so hard to pry them
that attempts to relate language to thought. apart if they are so separate?
We began with the many difficulties in- Over the course of the discussion, our
volved in radical versions of the linguistic rel- reading of the evidence provides source
ativity position, including the fact that lan- global support for what we take to be the “ty-
guage seems to underspecify thought and to pological bootstrapping” and “thinking for
diverge from it regarding the treatment of speaking” positions articulated in various
ambiguity, paraphrase, and deictic reference. places by Slobin [1 996; 2001 ; 2003 , inter
Moreover, there is ample evidence that sev- alia]. Language influences thought “on line”
eral forms of cognitive organization are in- and in many ways. For the learner, the par-
dependent of language: Infants who have no ticular speech events that one experiences
language are able to entertain relatively com- can and do provide cues to nonlinguistic
plex thoughts; for that matter, they can learn categorization – that is, a new linguistic la-
languages or even invent them when the bel “invites” the learner to attend to certain
need arises (Goldin-Meadow, 2003 ; Senghas types of classification criteria over others.
et al., 1 997). Many bilinguals, as a mat- Markman and Hutchinson (1 984) found that
ter of course, “code-switch” between their if one shows a two-year-old a new object
known languages even during the utterance and says See this one; find another one, the
of a single sentence (Joshi, 1 985 ). Aphasics child typically reaches for something that
sometimes exhibit impressive propositional has a spatial or encyclopedic relation to the
original object (e.g., finding a bone to go with tive discontinuities among speakers of dif-
the dog). But if one uses a new word (See this ferent languages. But other commentators
fendle, find another fendle), the child typically see this cross-linguistic diversity as much
looks for something from the same category more limited and superficial than the bloom-
(e.g., finding another dog to go with the ing, buzzing confusion coming out of the
first dog). Similar effects have been obtained tower of Babel. For instance, many stud-
with much younger children: Balaban and ies in morphosyntax show that apparently
Waxman (1 997) showed that labeling can fa- distinct surface configurations of linguistic
cilitate categorization in infants as young as elements in different languages can be ana-
nine months (cf. Xu, 2002). Beyond catego- lyzed in terms of underlying structural simi-
rization, labeling has been shown to guide larities (Chomsky, 2000; Baker, 2001 ). Stud-
infants’ inductive inference (e.g., expecta- ies in linguistic semantics suggest that the
tions about nonobvious properties of novel properties and meanings of syntactic entities
objects), even more so than perceptual sim- (e.g., determiners) are severely constrained
ilarity (Welder & Graham, 2001 ). Other re- cross-linguistically (Keenan & Stavi, 1 986).
cent experimentation shows that labeling Many of these principles of language organi-
may help children solve spatial tasks by zation seem to map quite transparently from
pointing to specific systems of spatial rela- core knowledge of the kinds studied in in-
tions (Loewenstein & Gentner, 2003 ). For fants (e.g., Quinn, 2001 ; Baillargeon, 1 993 ;
learners, then, the presence of linguistic la- and other sources mentioned throughout).
bels constrains criteria for categorization and For instance, scenes of kangaroos jumping
serves to foreground a codable category out come apart into the kangaroo (argument)
of all the possible categories to which a stim- part and jumping (predicate) part in every
ulus could be said to belong. natural language, but also in the prelinguis-
To what extent these linguistic influences tic parsing of events by children, including
result in mere tweaks – slight shifts in the those learning language under circumstances
boundaries between categories – or to more of extreme linguistic and sensory deprivation
radical reorganizations of the learners’ con- (e.g., blind or isolated deaf children: Goldin-
ceptual world (as in the reorganizational Meadow, 2003 ; Landau & Gleitman, 1 985 ;
principles that stand between phonetics and Senghas et al., 1 997). Focus on this kind
phonology) is hard to say at the present of evidence suggests that cross-linguistic di-
time. For competent adult users, thinking for versity is highly constrained by rich and
speaking effects arise again to coax the lis- deep underlying similarities in the nature of
tener toward certain interpretations of the thought. Thus, rather than pointing to cogni-
speech he or she is hearing as a function tive discontinuities among speakers of differ-
of probabilistic features of a particular lan- ent languages, cross-linguistic diversity could
guage. The clearest example in the analy- reveal principled points of departure from
sis we presented is the series of inferences an otherwise common linguistic–conceptual
that lead to different cross-linguistic catego- blueprint humans share as a consequence of
rizations of novel not-clearly-individuatable their biological endowment.
stimulus items with nonsense names: If it is
an English noun, it is probably an English
count-noun; if it is an English count-noun, it
is probably naming an individuatable object. Acknowledgments
It appears to us that much discussion
about the relationship between language and We thank Jerry Fodor for a discussion of
thought has been colored by an underlying the semantics of raining, Ray Jackendoff
disagreement about the nature of language for a discussion of phonology, as well as
itself. Many commentators, struck by ob- Dedre Gentner for her comments on this
served cross-linguistic diversity in semantic chapter. Much of our perspective derives
and syntactic categories, have taken this di- from our collaborative work with Cynthia
versity as a possible source of deeper cogni- Fisher, Henry Gleitman, Christine Massey,
Kimberly Cassidy, Jeff Lidz, Peggy Li, and its words start out as mass nouns and be-
Barbara Landau. Writing of this chapter come countable entities only through adding
was supported by NIH Grant No. 1 -R01 - the classifiers the and a (compare brick the sub-
HD3 75 07-02 to J. Trueswell and L. R. Gleit- stance to a brick, the object). Detailed linguis-
man and NIH Grant No. 1 F3 2MH65 020- tic analysis, however, suggests there is a gen-
uine typological difference here (Slobin, 2001
01 A2 to A. Papafragou.
and Lucy & Gaskins, 2001 ; Chierchia, 1 998;
Krifka, 1 995 , for discussion). The question is
whether, because all languages formally mark
the mass or count distinction in one way or
Notes another, the difference in particular linguis-
tic means could plausibly rebound to impact
1 . In one experimental demonstration, subjects ontology.
were asked: When an airplane crashes, where 5 . We should point out that this hint, at best, is a
should the survivors be buried? They rarely no- weak one, another reason why the observed
ticed the meaning discrepancy in the question interpretive difference for Japanese and En-
(Barton & Sanford, 1 996). glish speakers, even at the perceptual midline,
2. The similarity test may not be decisive for this is also weak. Notoriously, English often violates
case, as Malt, Sloman, and Gennari (2003 ), as the semantic generalization linking mass noun
well as Smith, Colunga, and Yoshida (2001 ), morphology with substancehood (compare,
among others, have pointed out. Similarity for example, footwear, silverware, furniture).
judgments applied as the measuring instru- 6. Subsequent analysis of the linguistic data re-
ment could systematically mask various non- vealed that Greek speakers were more likely
perceptual determinants of organization in to include manner of motion in their ver-
a semantic–conceptual domain, some poten- bal descriptions when manner was unexpected
tially language-caused. Over the course of this or noninferable, whereas English speakers in-
chapter, we will return to consider other do- cluded manner information regardless of in-
mains and other psychological measures. For ferability (Papafragou, Massey, & Gleitman,
further discussion of the sometimes arbitrary 2003 ). This suggests that speakers may mon-
and linguistically varying nature of the lexi- itor harder-to-encode event components and
con, even in languages that are typologically choose to include them in their utterances
and historically closely related, see Kay (1 996). when especially informative. This finding rein-
He points out, for example, that English speak- forces the conclusion that verbally encoded as-
ers use screwdriver whereas the Germans use pects of events vastly underdetermine the sub-
Schraubenzieher (literally, “screwpuller”), and tleties of event cognition.
the French tournevise (literally, “screwturner”) 7. Further studies show that success in this task
for the same purposes; our turnpike exit–entry among young children is sensitive to the size
points are marked exit, whereas the Brazilians of the room: In a large room, more four-year-
have entradas; and so forth. olds succeed in combining geometric and land-
3 . Categorical perception for speech sounds has mark information (Learmonth, Nadel, & New-
been documented for other species, includ- combe, in press). Moreover, it is claimed that
ing chinchillas and macaques (e.g., Kuhl & other species (chickens, monkeys) can use both
Miller, 1 978). Moreover, studies from Kay and types of information when disoriented (Val-
Kempton (1 984) and Roberson, Davies, and lortigara, Zanforlin, & Pasti, 1 990; Gouteux,
Davidoff (2000) suggest that even for hue Thinus-Blanc, & Vauclair, in press). For discus-
perception, the relationship between linguis- sion, see Carruthers (2002).
tic and perceptual categorization is not so
clear with categorical perception effects ob-
tained or not obtained depending on very
delicate choices of experimental procedure
and particular characteristics of the stimulus. References
For an important review, see Munnich and
Landau (2003 ). Acredolo, L., & Evans, D. (1 980). Developmental
4. This argument is not easy. One might argue changes in the effects of landmarks on infant
that English is a classifier language much like spatial behavior. Developmental Psychology, 1 6,
Yucatec Mayan or Japanese – that is, that all 3 1 2–3 1 8.
Baillargeon, R. (1 993 ). The object concept Bowerman, M. (1 996). The origins of children’s
revisited: New directions in the investigation of spatial semantic categories: Cognitive versus
infants’ physical knowledge. In C. E. Granrud linguistic determinants. In J. Gumperz, & S. C.
(Ed.), Carnegie Mellon Symposia on Cognition, Levinson (Eds.), Rethinking Linguistic Relativ-
Vol. 2 3 : Visual Perception and Cognition in In- ity (pp. 1 45 –1 76). Cambridge, UK: Cambridge
fancy (pp. 265 –3 1 5 ). Hillsdale, NJ: Erlbaum. University Press.
Baker, M. (2001 ). The Atoms of Language. New Bowerman, M., & Levinson, S. C. (2001 ). Intro-
York: Basic Books. duction. In M. Bowerman and S. C. Levinson
Balaban, M. T., & Waxman, S. R. (1 997). Do (Eds.), Language Acquisition and Conceptual
words facilitate object categorization in 9- Development (pp. 1 –1 6). Cambridge, UK: Cam-
month-old infants? Journal of Experimental bridge University Press.
Child Psychology, 64, 3 –26. Brown, P., & Levinson, S. C. (1 993 ). “Uphill” and
Barton, S. B., & Sanford, A. J. (1 993 ). A case “downhill” in Tzeltal. Journal of Linguistic An-
study of anomaly detection: Shallow semantic thropology, 3 , 46–74.
processing and cohesion establishment. Mem- Brown, R. (1 95 7). Linguistic determinism and the
ory and Cognition, 2 1 , 477–487. parts of speech. Journal of Abnormal and Social
Barwise, J., & Cooper, R. (1 981 ). Generalized Psychology, 5 5 , 1 –5 .
quantifiers and natural language. Linguistics and Brown, R., & Lenneberg, E. (1 95 4). A study of
Philosophy, 4, 1 5 9–21 9. language and cognition. Journal of Abnormal
Berlin, B., & Kay, P. (1 969). Basic Color Terms: and Social Psychology, 49, 45 4–462.
Their Universality and Evolution. Berkeley: Uni- Carey, S. (1 982). The child as word learner. In M.
versity of California Press. Halle, J. Bresnan, & G. Miller (Eds.), Linguistic
Best, C., McRoberts, G., & Sithole, N. (1 988). Theory and Psychological Reality (pp. 264–293 ).
The phonological basis of perceptual loss for Cambridge, MA: MIT Press.
non-native contrasts: Maintenance of discrim- Carey, S. (2001 ). Whorf versus continuity
ination among Zulu clicks by English-speaking theorists: Bringing data to bear on the de-
adults and infants. Journal of Experimental bate. In M. Bowerman, & S. Levinson (Eds.),
Psychology: Human Perception and Performance, Language Acquisition and Conceptual Develop-
1 4, 3 45 –3 60. ment (pp. 1 85 –21 4). Cambridge, UK: Cam-
Bloch, B., & Trager, G. L. (1 942). Outline of Lin- bridge University Press.
guistic Analysis. Baltimore: Waverly Press. Carruthers, P. (2002). The cognitive functions
Bloom, P. (1 994a). Possible names: The role of of language. Behavioral and Brain Sciences, 2 5 ,
syntax-semantics mappings in the acquisition 65 7–674.
of nominals. Lingua, 92 , 297–3 29. Cheng, K., & Gallistel, C. R. (1 984). Testing the
Bloom, P. (1 994b). Generativity within language geometric power of an animal’s spatial repre-
and other cognitive domains. Cognition, 5 1 , sentation. In H. Roitblat, T. G. Bever, & H. Ter-
1 77–1 89. race (Eds.), Animal Cognition (pp. 409–423 ).
Hillsdale, NJ: Erlbaum.
Bloom, P. (2000). How Children Learn the Mean-
ing of Words. Cambridge, MA: MIT Press. Chierchia, G. (1 998). Reference to kinds across
languages. Natural Language Semantics, 6,
Boroditsky, L. (2001 ). Does language shape
3 3 9–405 .
thought?: Mandarin and English speakers’ con-
ception of time. Cognitive Psychology, 43 , Choi, S., & Bowerman, M. (1 991 ). Learning to
1 –22. express motion events in English and Korean:
The influence of language-specific lexicaliza-
Bowerman, M. & Choi, S. (2001 ). Shaping mean-
tion patterns. Cognition, 41 , 83 –1 21 .
ings for language: Universal and language-
specific in the acquisition of spatial semantic Choi, S. (1 995 ). The development of epistemic
categories. In M. Bowerman, & S. C. Levin- sentence-ending modal forms and functions in
son (Eds.), Language Acquisition and Concep- Korean children. In J. Bybee, & S. Fleischman
tual Development (pp. 475 –5 1 1 ). Cambridge, (Eds.), Modality in Grammar and Discourse
UK: Cambridge University Press. (pp. 1 65 –204). Amsterdam: Benjamins.
Bowerman, M., & Levinson, S. C. (Eds.) (2001 ). Chomsky, N. (1 95 7). Syntactic Structures. The
Language Acquisition and Conceptual Develop- Hague: Mouton.
ment. Cambridge, UK: Cambridge University Chomsky, N. (1 964) Current Issues in Linguistic
Press. Theory. The Hague: Mouton.
Chomsky, N. (1 965 ). Aspects of the Theory of Syn- quisition and Conceptual Development (pp. 21 5 –
tax. Cambridge, MA: MIT Press. 25 6). Cambridge, UK: Cambridge University
Chomsky, N. (1 975 ). Reflections on Language. Press.
New York: Pantheon. Gentner, D., & Goldin-Meadow, S., (Eds.)
Chomsky, N. (2000). New Horizons in the Study (2003 ). Language in Mind: Advances in the
of Language and Mind. Cambridge, UK: Cam- Study of Language and Thought. Cambridge,
bridge University Press. MA: MIT Press.
Clark, H. (1 992). Arenas of Language Use. Gleitman, L. (1 990). The structural sources of
Chicago: University of Chicago Press. verb meaning. Language Acquisition, 1 , 1 –5 5 .
Dehaene, S. (1 997). The Number Sense. New Glietman, L., Abarbanell, L., Gallistel, G. R., Lee,
York: Oxford University Press. P., & Papafragou, A. (in preparation). On the
Dell, G. (1 995 ). Speaking and mispeaking. In L. flip side: Spatial reasoning of Mayan and English
Gleitman, & M. Liberman (Eds.), Language: An speakers.
Invitation to Cognitive Science, (pp. 1 83 –208). Gleitman, L., & Rozin P. (1 977). The structure
Cambridge, MA: MIT Press. and acquisition of reading I: Relations between
Descartes, R. (1 662). Trait de l’homme. E. S. Hal- orthographies and the structure of language. In
dane, & G. R. T. Ross (Trans). Cambridge, UK: A. Reber, & D. Scarborough (eds.), Toward a
Cambridge University Press. Psychology of Reading. Hillsdale, NJ: Erlbaum.
Eimas, P., Siqueland, E., Jusczyk, P., & Vigorito, Goldberg, A. (1 995 ). Constructions: A Construc-
J. (1 971 ). Speech perception in infants. Science, tion Grammar Approach to Argument Structure.
1 71 , 3 03 –3 06. Chicago: University of Chicago Press.
Fisher, C. (1 996). Structural limits on verb Goldin-Meadow, S. (2003 ). Thought before
mapping: The role of analogy in children’s in- language: Do we think ergative? In D. Gentner,
terpretations of sentences. Cognitive Psychol- & S. Goldin-Meadow (Eds.), Language in Mind:
ogy, 3 1 , 41 –81 . Advances in the Study of Language and Thought,
(pp. 493 –5 22). Cambridge, MA: MIT Press.
Fisher, C., & Gleitman, L. R. (2002). Breaking
the linguistic code: Current issues in early lan- Gouteux, S., Thinus-Blanc, C., & Vauclair, S.
guage learning. In H. F. Pashler (series Ed.), & (in press). Rhesus monkeys use geometric and
R. Gallistel (vol. Ed.), Steven’s Handbook of Ex- non-geometric information during a reorienta-
perimental Psychology, Vol. 1 : Learning and Mo- tion task. Journal of Experimental Psychology:
tivation (3 rd ed.). New York: Wiley. General, 1 3 0, 5 05 –5 1 9.
Fodor, J. (1 975 ). The Language of Thought. New Grice, P. (1 975 ). Logic and conversation. In P.
York: Crowell. Cole, & J. Morgan (Eds.), Syntax and Semantics,
Gallistel, C. R., & Gelman, R. (1 992). Preverbal Vol. 3 : Speech Acts. New York: Academic Press.
and verbal counting and computation. Cogni- Grinstead, J., MacSwan, J., Curtiss, S., & Gelman,
tion, 44, 43 –74. R. (2 003 ). (under review) The independence
Gallistel, R. (1 990). The Organization of Learning. of language and number.
Cambridge, MA: MIT Press. Gumperz, J., & Levinson, S., (Eds.) (1 996). Re-
Gallistel, R. (2002). Language and spatial frames thinking Linguistic Relativity. Cambridge, UK:
of reference in mind and brain. Trends in Cog- Cambridge University Press.
nitive Science, 6, 3 21 –3 22. Hare, B., Call, J., & Tomasello, M. (2001 ). Do
Gelman, R., & Spelke, E. (1 981 ). The develop- chimpanzees know what conspecifics know?
ment of thoughts about animate and inanimate Animal Behaviour, 61 , 1 3 9–1 5 1 .
objects: Implications for research on social cog- Hauser, M., & Carey, S. (1 998). Building a cog-
nition. In J. H. Flavell, & L. Ross (Eds.), So- nitive creature from a set of primitives. In
cial Cognitive Development: Frontiers and Pos- D. Cummins, & C. Allen (Eds.), The Evolu-
sible Futures (pp. 43 –66). Cambridge, UK: tion of Mind. Oxford, UK: Oxford University
Cambridge University Press. Press.
Gennari, S., Sloman, S., Malt, B., & Fitch, W. Heider, E., & Oliver, D. C. (1 972). The structure
(2002). Motion events in language and cogni- of color space in naming and memory for two
tion. Cognition, 83 , 49–79. languages. Cognitive Psychology, 3 , 3 3 7–3 5 4.
Gentner, D., & Boroditksy, L. (2001 ). Individua- Hermer, L., & Spelke, E. (1 994). A geometric
tion, relativity and early word learning. In M. process for spatial representation in young chil-
Bowerman, & S. Levinson (Eds.), Language Ac- dren. Nature, 3 70, 5 7–5 9.
Hermer, L., & Spelke, E. (1 996). Modularity and R. Fox (Eds.), Neonate Cognition: Beyond the
development: The case of spatial reorientation. Blooming Buzzing Confusion. Hillsdale, NJ:
Cognition, 61 , 1 95 –23 2. Erlbaum.
Hermer-Vasquez, L., Spelke, E., & Katsnelson, Kay, P. & Regier, T. (2002). Resolving the ques-
A. (1 999). Sources of flexibility in human tion of color naming universals. Proceedings
cognition: Dual-task studies of space and lan- of the National Academy of Sciences, 1 00(1 5 ),
guage. Cognitive Psychology, 3 9, 3 –3 6. 9085 –9089.
Hespos, S., & Spelke, E. (2000). Conceptual pre- Kay, P. (1 996). Intra-speaker relativity. In J.
cursors to spatial language: Categories of con- Gumperz, & S. Levinson (Eds.), Rethinking Lin-
tainment. Paper presented at the Meeting of guistic Relativity (pp. 97–1 1 4). Cambridge, UK:
the International Society on Infant Studies. Cambridge University Press.
Brighton, UK. Kay, P., & Kempton, W. (1 984). What is the
Hume, D. (1 73 9). D. F. Norton, & M. Norton, Sapir-Whorf hypothesis? American Anthropol-
(Ed.). A Treatise on Human Nature. New York: ogist, 86, 65 –79.
Oxford University Press, 2000. Keenan, E., & Stavi, J. (1 986). A semantic charac-
Hurewitz, F., Papafragou, A., Gleitman, L., & terization of natural language determiners. Lin-
Gelman, R. (in prep.). The acquisition of num- guistics and Philosophy, 9, 25 3 –3 26.
bers and quantifiers. Rutgers University and Keller, H. (1 95 5 ). Teacher: Anne Sullivan Macy.
University of Pennsylvania. Westport, CT: Greenwood Press.
Imai, M., & Mazuka, R. (1 997). A crosslinguistic
Kellman, P. (1 996). The origins of object percep-
study on the construal of individuation in lin-
tion. In R. Gelman, & T. Au (Eds.), Percep-
guistic and non-linguistic contexts. Paper pre-
tual and Cognitive Development (pp. 3 –48). San
sented at the Biannual Meeting of the Society
Diego: Academic Press.
for Research in Child Development. Washing-
ton, DC. Krifka, M. (1 995 ). Common nouns: A contrastive
analysis of Chinese and English. In G. Carl-
Imai, M. (2000). Universal ontological knowl-
son, & F. J. Pelletier (Eds.), The Generic Book
edge and a bias toward language-specific cat-
(pp. 3 98–41 1 ). Chicago & London: University
egories in the construal of individuation. In S.
of Chicago Press.
Niemeier, & R. Dirven (Eds.), Evidence for Lin-
guistic Relativity (pp. 1 3 9–1 60). Amsterdam: Kuhl, P. K., & Miller, J. D. (1 978). Speech per-
Benjamins. ception by the chinchilla: Identification func-
tions for synthetic VOT stimuli. Journal of
Imai, M., & Gentner, D. (1 997). A crosslinguis-
the Acoustical Society of America, 63 , 905 –
tic study of early word meaning: Universal on-
91 7.
tology and linguistic influence. Cognition, 62 ,
1 69–200. Kuhl, P., Williams, K., Lacerda, F., Stevens, K.,
& Lindblom, B. (1 992). Linguistic experience
Imai, M., Gentner, D., & Uchida, N. (1 994). Chil-
alters phonetic perception in infants by six
dren’s theories of word meaning: The role of
months of age. Science, 2 5 5 , 606–608.
shape similarity in early acquisition. Cognitive
Development, 9, 45 –76. Landau, B., & Gleitman, L. (1 985 ). Language and
Experience: Evidence from the Blind Child. Cam-
Jackendoff, R. (1 990). Semantic Structures. Cam-
bridge, MA: Harvard University Press.
Landau, B., Smith, L., & Jones, S. (1 998). The
Jameson, D., & Hurwich, L. M. (1 978). Dichro-
importance of shape in early lexical learning.
matic color language: “reds” and “greens” do not
Cognitive Development, 3 , 299–3 21 .
look alike but their colors do. Sensory Processes,
2 , 1 46–1 5 5 . Learmonth, A., Nadel, L., & Newcombe, N.
Joshi, A. (1 985 ). How much context-sensitivity is (in press). Children’s use of landmarks: Im-
necessary for assigning structural descriptions: plications for modularity theory. Psychological
Tree adjoining grammars. In D. Dowty, L. Kart- Science.
tunen, & A. Zwicky (Eds.), Natural Language Leslie, A., & Keeble, S. (1 987). Do six-month-
Parsing. Cambridge, UK: Cambridge Univer- old infants perceive causality? Cognition, 2 5 ,
sity Press. 265 –288.
Jusczyk, P. (1 985 ). On characterizing the devel- Levelt, W. (1 989). Speaking: From Intention to Ar-
opment of speech perception. In J. Mehler, & ticulation. Cambridge, MA: MIT Press.
Levinson, S. C. (1 996). Frames of reference and tifacts. Journal of Memory and Language, 40,
Molyneux’s question: Crosslinguistic evidence. 23 0–262.
In P. Bloom, M.,Oederson, L. Nadel & M. Mandler, J. (1 996). Preverbal representation and
Garrett, Language and Space (pp. 1 09–1 69). language. In P. Bloom, M. Peterson, L. Nadel,
Cambridge, MA: MIT Press. & M. Garrett (Eds.), Language and Space (pp.
Levinson, S. C., Kita, S., & Haun, D. (2002). Re- 3 65 –3 84). Cambridge, MA: MIT Press.
turning the tables: Language affects spatial rea- Markman, E., & Hutchinson, J. (1 984). Children’s
soning. Cognition, 84, 1 5 5 –1 88. sensitivity to constraints on word meaning:
Li, P. & Gleitman, L. (in prep.). Language and Taxonomic versus thematic relations. Cognitive
spatial reasoning. Psychology, 1 6, 1 –27.
Li, P., & Gleitman, L. (2002). Turning the tables: Mazuka, R., & Friedman, R. (2000). Linguis-
Spatial language and spatial cognition. Cogni- tic relativity in Japanese and English: Is lan-
tion, 83 , 265 –294. guage the primary determinant in object
Li, P., Gleitman, H., Gleitman, L., & Landau, B. classification? Journal of East Asian Linguistics,
(1 997), Spatial language in Korean and English. 9, 3 5 3 –3 77.
Proceedings from the 1 9 th Annual Boston Uni- McDonough, L., Choi, S., & Mandler, J. M.
versity Conference on Language Development. (2003 ). Understanding spatial relations: Flexi-
Somerville: Cascadilla Press. ble infants, lexical adults. Cognitive Psychology,
Liberman, A. M. (1 970). The grammars of speech 46, 229–25 9.
and language. Cognitive Psychology, 1 , 3 01 –3 23 . Munnich, E., & Landau, B. (2003 ). The effects
Liberman, A.M, Cooper, F. S., Shankweiler, D. P., of spatial language on spatial representation:
& Studdert-Kennedy, M. (1 967). Perception of Setting some boundaries. In D. Gentner,
the speech code. Psychological Review, 74, 43 1 – & S. Goldin-Meadow (Eds.), Language in
461 . Mind, (pp. 1 1 3 –1 5 5 ). Cambridge, MA: MIT
Press.
Lidz, J., Gleitman, H., & Gleitman, L. (2002).
Understanding how input matters: Verb learn- Munnich, E., Landau, B., & Dosher, B. A. (2001 ).
ing and the footprint of universal grammar. Spatial language and spatial representation:
Cognition, 87, 1 5 1 –1 78. A cross-linguistic comparison. Cognition, 81 ,
1 71 –207.
Locke, J. (1 690/1 964). A. D. Woozley (Ed.).
An Essay Concerning Human Understanding. Naigles, L., & Terrazas, P. (1 998). Motion-verb
Cleveland: Meridian Books. generalizations in English and Spanish: Influ-
ences of language and syntax. Psychological Sci-
Loewenstein, J., & Gentner, D. (Submitted
ence, 9, 3 63 –3 69.
2003 ). Relational language and the de-
velopment of relational mapping. Cognitive Nunberg, G. (1 978). The Pragmatics of Reference.
Psychology. Bloomington, IN: Indiana University Linguis-
tics Club.
Lucy, J. (1 992). Grammatical Categories and
Cognition: A Case Study of the Linguistic Rel- Papafragou, A, Li, P., Choi, Y., & Han,
ativity Hypothesis. Cambridge, UK: Cambridge C. (in preparation). Evidentiality and the
University Press. language/cognition interface.
Lucy, J., & Gaskins, S. (2001 ). Grammatical cat- Papafragou, A., & Musolino, J. (2003 ). Scalar
egories and the development of classification implicatures: Experiments at the semantics-
preferences: A comparative approach. In M. pragmatics interface. Cognition, 86, 1 5 3 –1 82.
Bowerman, & S. Levinson (Eds.), Language Ac- Papafragou, A., Massey, C., & Gleitman, L.
quisition and Conceptual Development (pp. 25 7– (2002). Shake, rattle ‘n’ roll: the representation
283 ). Cambridge, UK: Cambridge University of motion in language and cognition. Cognition,
Press. 84, 1 89–21 9.
Malt, B., Sloman, S., & Gennari, S. (2003 ). Papafragou, A., Massey, C., & Gleitman, L.
Universality and language specificity in object (2003 ). Motion event conflation and clause
naming. Journal of Memory and Language, 49, structure. To appear, Proceedings of the 3 9 th
20–42. Annual Meeting of the Chicago Linguistics So-
Malt, B., Sloman, S., Gennari, S., Shi, M., & ciety. University of Chicago.
Wang, Y. (1 999). Knowing versus naming: Sim- Pederson, E., Danziger, E., Wilkins, D., Levin-
ilarity and the linguistic categorization of ar- son, S., Kita, S. & Senft, G. (1 998). Semantic
typology and spatial conceptualization. Lan- Slobin, D. (2003 ). Language and thought online:
guage, 74, 5 5 7–5 89. Cognitive consequences of linguistic relativity.
Peña, M., Maki, A., Kovacic, D., Dehaene- In D. Gentner & S. Goldin-Meadow (Eds.),
Lambertz, G., Koizumi, H., Bouquet, F. et al. Language in Mind: Advances in the Investigation
(2003 ). Sounds and silence: An optical topog- of Language and Thought (pp. 1 5 7–1 91 ). Cam-
raphy study of language recognition at birth. bridge, MA: MIT Press.
Proceedings of the National Academy of Science Smith, L. (2001 ). How domain-general processes
U S A, 1 00(20), 1 1 702–5 . may create domain-specific biases. In M. Bow-
Pinker, S. (1 984). Language Learnability and Lan- erman, & S. C. Levinson (Eds.), Language Ac-
guage Development. Cambridge, MA: Harvard quisition and Conceptual Development (pp. 1 01 –
University Press. 1 3 1 ). Cambridge, UK: Cambridge University
Quine, W. V. O. (1 960). Word and Object. Cam- Press.
bridge, MA: MIT Press. Smith, L., Colunga, E., & Yoshida (2001 ). Making
Quinn, P. (2001 ). Concepts are not just for an ontology: Cross-linguistic evidence. In D.
objects: Categorization of spatial relational in- Rakison, & L. Oakes (Eds.), Early Category and
formation by infants. In D. Rakison, & L. Oakes Object Development: Making Sense of the Bloom-
(Eds.), Early Category and Object Development: ing, Buzzing Confusion (pp. 275 –3 02). Oxford,
Making Sense of the Blooming, Buzzing Con- UK: Oxford University Press.
fusion. Oxford, UK: Oxford University Snedeker, J., Thorpe, K., & Trueswell, J. (2001 ).
Press. On choosing the parse with the scene: The
Restle, F. (1 95 7). Discrimination of cues in role of visual context and verb bias in ambigu-
mazes: A resolution of the place-vs.-response ity resolution. Proceedings of the 23 rd Annual
question. Psychological Review, 64, 21 7–228. Conference of the Cognitive Science Society.
Mahwah, NJ: Erlbaum.
Roberson, D., Davies, I., & Davidoff, J. (2000).
Color categories are not universal: Replica- Soja, N., Carey, S., & Spelke, E. (1 991 ). Onto-
tions and new evidence from a stone-age cul- logical categories guide young children’s induc-
ture. Journal of Experimental Psychology: Gen- tions of word meaning: Object terms and sub-
eral, 1 2 9, 3 69–3 98. stance terms. Cognition, 3 8, 1 79–21 1 .
Rosch, E., Mervis, C. B., Gray, W. D., Johnson, Spelke, E., & Tsivkin, S. (2001 a). Language and
D. M., & Boyes-Braem, P. (1 976). Basic objects number: A bilingual training study. Cognition,
in natural categories. Cognitive Psychology, 8, 78, 45 –88.
3 82–43 9. Spelke, E., & Tsivkin, S. (2001 b). Initial knowl-
Sapir, E. (1 941 ). In L. Spier, Language, cul- edge and conceptual change: space and num-
ture and personality: Essays in memory of ber. In M. Bowerman, & S. C. Levinson (Eds.),
Edward Sapir. Menasha, WI: Memorial Pub- Language Acquisition and Conceptual Devel-
lication Fund. Cited in Whorf (1 95 6, Ref. 1 43 , opment (pp. 70–1 00). Cambridge, UK: Cam-
p. 1 3 4). bridge University Press.
Senghas, A., Coppola, M., Newport, E., & Spelke, E., Breinliger, K., Macomber, J., &
Suppala, T. 1 997. Argument structure in Jacobson, K. (1 992). The origins of knowledge.
Nicaraguan Sign Language: The emergence Psychological Review, 99, 605 –63 2.
of grammatical devices. In Proceedings of Sperber, D., & Wilson, D. (1 986). Relevance: Com-
BUCLD 21 . Somerville: Cascadilla Press. munication and Cognition. Cambridge, MA:
Slobin, D. (1 996). From ‘thought and language’ Harvard University Press.
to ‘thinking for speaking’. In J. Gumperz, & S. Talmy, L. (1 985 ). Lexicalization patterns: Seman-
C. Levinson (Eds.), Rethinking Linguistic Rela- tic structure in lexical forms. In T. Shopen
tivity (pp. 70–96). Cambridge, UK: Cambridge (Ed.), Language Typology and Syntactic Descrip-
University Press. tion (pp. 5 7–1 49). New York: Cambridge Uni-
Slobin, D. (2001 ). Form-function relations: How versity Press.
do children find out what they are? In M. Bow- Tomasello, M. (2000). Do young children have
erman, & S. Levinson (Eds.), Language Acqui- adult syntactic competence? Cognition, 74,
sition and Conceptual Development (pp. 406– 209–25 3 .
449). Cambridge, UK: Cambridge University Vallortigara, G., Zanforlin, M., & Pasti, G.
Press. (1 990). Geometric modules in animals’ spatial
representations: A test with chicks. Journal of on infants’ inductive inferences about nonob-
Comparative Psychology, 1 04, 248–25 4. vious object properties. Child Development, 72 ,
Varley, R., & Siegal, M. (2000). Evidence for cog- 1 65 3 –1 673 .
nition without grammar from causal reasoning Werker, J., & Logan, J. (1 985 ). Cross-language ev-
and ‘theory of mind’ in an agrammatic aphasic idence for three factors in speech perception.
patient. Current Biology, 1 0, 723 –726. Perception and Psychophysics, 3 7, 3 5 –44.
Waxman, S. R., Senghas, A., & Benveniste, S. Werker, J., & Tees, R. (1 984). Cross-language
(1 997). A cross-linguistic examination of the speech perception: Evidence for perceptual re-
noun-category bias: Its existence and specificity organization during the first year of life. Infant
in French- and Spanish-speaking preschool- Behavior and Development, 7, 49–63 .
aged children. Cognitive Psychology, 43 , 1 83 – Whorf, B. L. (1 95 6). J. Carroll (Ed.). Language,
21 8. Thought and Reality. Cambridge, MA: MIT
Waxman, S., & Markow, D. (1 995 ). Words as in- Press.
vitations to form categories: Evidence from 1 2- Wittgenstein, L. (1 922). D. F. Pears (Ed.). Trac-
to 1 3 -month-old infants. Cognitive Psychology, tatus Logico-Philosophicus. London: Routledge,
2 9, 25 7–3 02. 1 981 .
Welder, A. N., & Graham, S. A. (2001 ). The in- Xu, F. (2002). The role of language in acquiring
fluence of shape similarity and shared labels object kind concepts. Cognition, 85 , 223 –25 0.
CHAPTER 2 7
Paradigms of Cultural Thought
Patricia M. Greenfield
Two Paradigms of Thought: Western and Wolof school children were dis-
Phenomena, Theory, and playing psychological mindedness; they dis-
Methodology tinguished between their own thought or
statement about something and the thing
In 1 963 , Jerome Bruner gave me the chance itself. In contrast, the unschooled Wolof
of a lifetime – to go to Senegal to do my dis- children were not making this distinction.
sertation on relations between culture and They were assuming the world on one plane
the development of thought. While there I with thought and object of thought as one
made an unexpected discovery, one that led unified reality.
me into two radically different paradigms of I am going to use this difference to pro-
cultural thought. I found that unschooled vide some historical background for the the-
Wolof children, participating in a classic Pi- oretical theme of this chapter – that there are
agetian conservation task, were unable to re- two major paradigms of cultural thought, an
ply to the question, “Why do you think (or individualistic one and a collectivistic one,
say) this glass has more (or equal) water?”; and that each is part of a larger pathway
yet they quickly answered an alternative of development that encompasses the social
form of the question: “Why does this glass as well as the cognitive (Greenfield et al.,
have more (or equal) water?” (Greenfield, 2003 ). Although this theme leads to a very
1 966). U.S. or Swiss children, of course, selective review of research on culture and
had no difficulty in understanding the first thinking, it also provides theoretical coher-
question. Neither did Wolof schoolchildren. ence for a diverse body of literature.
What did this difference mean? At first this I took the terminology of individual-
seemed to be a methodological problem. ism and collectivism from anthropologists
Later I realized it was a reflection of deep Florence Kluckhohn and Fred Strodtbeck’s
differences in cultural psychology: In pro- pathbreaking 1 961 book, Variations in Value
viding a reason for their thoughts or words, Orientation. For me, collectivism was a world
663
view in which people were more connected possible a natural experiment. Some chil-
both to each other and to the physical world dren went to school; others, even from the
than in the individualistic worldview. The same families, did not. There was no selec-
terminology was not perfect and continues tion for school attendance on the basis of in-
to be problematic (e.g., Oyserman, Coon, & telligence. We therefore could see what dif-
Kemmelmeier, 2002). The important point ference school made. Indeed, it suppressed
for me, however, was that a worldview the action reasons for inequality judgments
and a value system had significant cognitive with what we called at the time “astonish-
implications. ing absoluteness”; there was not one instance
The intrinsic connectedness of the phys- among all the school children, either in the
ical and social worlds for our unschooled village or in the capital city of Dakar (Green-
Wolof participants was substantiated by the field & Bruner, 1 966/1 969). This was my sec-
distinctive causal reasoning of unschooled ond hint that school functions to create an
children who had not yet attained conser- individualistic psychology. One route to this
vation. Children who believed the quantity effect might be that, in school, one is always
of water had changed after the experimenter being asked to give reasons for things. At
transfered it to a taller, thinner beaker (or di- the time, however, my best candidate was
vided it into several smaller beakers) would literacy, introduced into the oral Wolof cul-
often say that the amount had changed be- ture by the school, of French colonial ori-
cause “you poured it.” This justification con- gin. In the written word, a thought clearly
trasted with the more usual perceptual rea- has a separate physical manifestation from
sons I had seen in the United States – for its referents in the real world; this could be
example, the amount has changed because the beginning of understanding self as sep-
“the water is higher.” At first, I thought that arate from world and thought as separate
“a natural phenomenon was being explained from its referent (Greenfield, 1 972/1 975 ).
by attributing special, magical powers to But the finding also shows that worldviews
intervening human agents” (Greenfield & are not immutable; they are constructed
Bruner, 1 966/69). But then we realized this by experience.
was an ethnocentric interpretation. We drew Finally, a learning experiment helped us
upon Kohler (1 93 7/1 962), who points out analyze further the thought processes of the
that such phenomena are made possible by unschooled children. We devised a proce-
a worldview, dure in which the child, rather than the ex-
perimenter, first transfered the water from
in which animate and inanimate phenom- one beaker to a taller, thinner one, then to six
ena occupy a single plane of reality. That is,
tiny ones. We thought that the child might
the child in the conservation experiment is
be willing to attribute powers to an author-
faced with the following sequence of events:
(1 ) water a certain way, (2 ) experimenter’s ity figure that he was was not willing to at-
action, (3 ) water changed. When the child tribute to himself. Indeed, at all ages (from
says the amount is not the same because six to thirteen), conservation performance
the experimenter poured it, he is basing was much better when the child poured than
his causal inference on contiguity – the when the experimenter poured, and there
usual procedure even in our society. But un- was good transfer of the conservation judg-
der ordinary circumstances, we would ac- ment to posttests in which the experimenter
cept an explanation in terms of contiguous did the pouring (Greenfield, 1 966). We con-
physical events or contiguous social events, cluded that the experimenter as authority
but not a causal chain that included both
figure was considered to have causal power
kinds of event. Thus “magic” only exists
to change the amount of water. Once the
from the perspective of a dualistic ontology.
(Greenfield & Bruner, 1 969, p. 63 9). child had a chance to “do-it-himself or her-
self,” the powers of the experimenter were
The presence of a school in the bush vil- somehow diminished. Only recently have
lage where I worked, Taiba N’Diaye, made I come to realize that the action reason
paradigms of cultural thought 665
for inequality, reflecting the importance of day life of children and their caregivers. She
social authority, is also part of the collectivis- found clues, for example, in adult interpreta-
tic worldview. tions of the child’s developing motor capac-
We connected these patterns of thought ities. Whereas we, in the United States or
to early Wolof socialization on the one hand France, would get excited about the child’s
and to African philosophy on the other. First first step as an index of developing skill and
we reasoned as follows: even independence, a Wolof mother would
likely interpret it as signifying the child’s de-
It may be that a collective, rather than in- sire in relation to a person in his surrounding;
dividual, value orientation develops where for example, she might say something like
the individual lacks power over the physi- “Look, he’s walking toward you!” (Rabain-
cal world. Lacking personal power, he has
Zempléni, 1 965 ).
no notion of personal importance. In terms
of his cognitive categories, now, he will be Thus, adult interpretation of the child’s first
less likely to set himself apart from others actions would seem to be paradigmatic for
and the physical world, he will be less self- the choice between an individualistic and
conscious at the same time that he places a collective orientation; a social interpre-
less value on himself. Thus, mastery over tation of an act not only relates the actor
the physical world and individualistic self- to the group but also relates the group, in-
consciousness will appear together in a cul- cluding the actor, to physical events. When
ture, in contrast to a collective orientation on the other hand, acts are given an inter-
and a . . . world view in which people’s at- pretation in terms of motoric competence,
titudes and actions are not placed in sep- other people are irrelevan and, moreover,
arate conceptual pigeonholes from physi- the act is separated from the motivations,
cal events. (Greenfield & Bruner, 1 969, interntions, and desires of the actor himself.
p. 640). (Greenfield & Bruner, 1 969, p. 641 )
Indeed, I had noted that the unschooled Such selective interpretations serve an im-
Wolof children had never spontaneously ma- portant socializing function: They expose
nipulated the materials in the conserva- the child to what is considered important in
tion experiment. I saw this as indicative of a particular culture.
the absence of a sense of power over the Rabain also found the first clues that col-
physical world. lectivism was associated with de-emphasis
of the world of objects. She noted that ma-
nipulation of objects was an occasional and
The Importance of Ethnography
secondary activity for the Wolof child from
Was there a developmental reason in early two to four years and that self-image rested
socialization for this dichotomy between in- more on power over people than power
dividual mastery over the physical world and over objects. She noted further that verbal
a collectivistic value orientation? I turned to exchanges between adults and children of-
the anthropological method of ethnography ten concerned valued relations between peo-
to find out. Ethnography is often defined in ple but rarely concerned explanations of the
anthropology as participant observation; in physical world (Rabain-Zempléni, 1 965 ).
the course of developing an appropriate par- Because scientific thinking is so linked to the
ticipant role or roles in a real-life cultural world of objects, this was a clue that col-
setting, the researcher is able to record, tra- lectivistic world view might privilege social
ditionally by means of in-depth field notes, thinking, thinking about people and their
everyday life and discourse relevant to a par- relations, over scientific thinking. Later re-
ticular topic or multiple topics. search has confirmed this paradigm of early
My colleague and friend in Senegal, socialization for a world that emphasizes
Jacqueline Rabain, working on an ethno- thinking about people rather than things
graphic dissertation for the Sorbonne, found (Greenfield et al., 2003 ). It contrasts greatly
some clues to early socialization in the every- with a paradigm that emphasizes learning to
manipulate and understand objects, in the patterns found in my experiments. Equally

form of toys, from early infancy on (Green- fascinating were parallels on the broader cul-
field et al., 2003 ). tural level of social ideology. Aimée Césaire
Most intriguing, because it related di- had developed a concept of négritude or
rectly to my conservation experiment, was blackness, a worldview that distinguished
Rabain’s observation that, in the everyday Black values from White. In opposition
situation of sharing a quantity among several to the individualism of European cultures,
persons (a situation not too different from négritude emphasizes “solidarity, born of the
the second half of my conservation experi- cohesion of the . . . clan” (Kestelhoof, 1 962).
ment, in which a quantity of water was di- The poet and president of Senegal, Leopold
vided among six breakers), Wolof bush chil- Senghor, defined négritude as “participation
dren pay more attention to who receives of the subject in the object, participation
what, when, than to the amount received of the man in cosmic forces, communion
(Rabain-Zempléni, 1 965 ). It parallels the of man with all other men” (Monteil, 1 964,
“magical” action reason for nonconservation: p. 3 1 , my translation). This formulation of
More attention is focused on the person social and cultural ideology looked like my
pouring, the social aspect of the situation, experimental results in Senegal writ large!
than on the purely physical aspect, the It was therefore not surprising that
amount of water. This observation could cultural world view also permeated the
also explain why Wolof children in Senegal second cognitive domain of my dissertation
achieved conservation in the standard ex- research in Senegal, the development of cat-
periment later than children in the United egorization. If unschooled Wolof children
States or Switzerland. were assuming that the world exists on one
This work illustrates the way in which plane, with thought and object of thought
ethnography can complement experiments as one unified reality, then it followed
to deepen understanding of paradigms of that the notions of individual viewpoints
cultural thought. Ethnography has a very and different points of view would also
special role to play because it introduces be meaningless. Data from a study of
cultural interpretations of behavior – it re- picture categorization were relevant to this
veals that the very same behavior can have implication (Greenfield, Reich, & Olver,
an opposite meaning in two different cul- 1 966/1 972). Children of different ages were
tural settings. In a sense, when we do ex- given triads of pictures and asked to pick the
periments in the United States, we already pair that was most alike. After unschooled
have done our ethnography. Because we are Wolof children had selected a pair, the
members of the society, we have a good idea pictures were replaced and the participants
of the cultural meaning of our results. This were asked to find two different pictures
is not the case when we study a culture dif- from the same set that were also alike. In
ferent from our own. Ethnography also con- fact, each set of three images had been
nects our findings in the laboratory to the designed to have three bases of similarity –
real world phenomena of everyday life. Fi- form, function, and color. But unschooled
nally, because cultural values are implicit in Wolof children did not find a second basis
the very design of our experiments, often of similarity; they saw the stimuli from only
without our realizing it, ethnography is re- one point of view. Researchers working in
quired to design valid cross-cultural experi- other parts of the nonindustrial world found
ments. We omit this first ethnographic stage parallel results (Cole et al., 1 971 ; Irwin &
of cross-cultural research at our own peril, McLaughlin, 1 970). Thus, categorization
as the reader will see later in this chapter. behavior also revealed indications of taking
for granted a single perspective on the
world. (See Goldstone & Son, Chap. 2, for a
The Level of Social Ideology
review of theories of similarity; and Medin
Rabain’s ethnography did not uncover only & Rips, Chap. 3 , for a review of studies of
socialization antecedents to the thinking categorization.)
In their landmark 1 974 book, Culture and uations are diagnostic of particular kinds of
Thought, Cole and Scribner noted the need cognitive capacities or processes.
for integrative theory “to pull together a va- 2 . Psychological processes are treated as “en-
riety of disconnected experiments” (Cole & tities” which a person “has” or “does not
Scribner, 1 974, p. 1 72). I did not realize have.” In other words, they are considered
that the two paradigms of thought I had a property of the person rather than the sit-
stumbled upon in Senegal formed the ba- uation.
(Cole & Scribner, 1 974, p. 1 73 ).
sis of such an integrative theory. Data on
culture and thought that could later be in-
serted into this larger framework contin- There is another problem in this story that
ued to accumulate. Like my problem in also can be considered methodological –
developing questions that were meaningful the ethnocentrism of the criteria for “cor-
to elicit reasoning in a conservation exper- rect” sorting. Such methodological prob-
iment, many of the findings were initially lems led Cole and Scribner (1 974) to rec-
seen as methodological barriers to be over- ommend that researchers take into account
come rather than as deep cultural differences “knowledge about the culture and behav-
in cognitive functioning. ior of the people gained from the work of
Let me give an example from Cole et al. anthropologists, linguists, and other social
(1 971 ). These researchers took a categoriza- scientists.” (Ref. 8, p. 1 96). They went a
tion task to Liberia, where they presented step “further in suggesting that the meth-
it to their Kpelle participants. This task in- ods of these relevant fields need to be in-
volved a set of 20 objects that divided evenly tegrated. . . . Field and laboratory, anthropo-
into the linguistic categories of foods, imple- logical observation and psychological ex-
ments, food containers, and clothing. When perimentation, can yield knowledge from
asked to group objects that were similar, the different perspectives about the same func-
Kpelle participants did not do the taxonomic tion” (Ref. 8, p. 1 96). We already have seen
sorts expected by the researchers. Instead this advice in action; collection of both qual-
participants consistently made functional itative and quantitative data is part of the
pairings (Glick, 1 968). For example, rather methodological armoire of the cultural psy-
than sorting objects into groups of tools and chologist (Greenfield, 1 997a).
foods, participants would put a potato and But the problems of “wise” and “fool-
a knife together because “you take the knife ish” sorting also get to the substantive heart
and cut the potato” (Cole et al., 1 971 , p. 79). of the collectivistic paradigm of cognition.
According to Glick, participants often jus- From the vantage point of a collectivis-
tified their pairings by stating “that a wise tic worldview, I would submit that the
man could only do such and such” (Glick, “wise man’s” pairings were of social util-
1 968, p. 1 3 ). In total exasperation, the re- ity, whereas the “foolish man’s” taxonomic
searchers “finally said, ‘How would a fool do groupings of five items each were socially or
it?’ The result was a set of nice linguistically pragmatically useless. I believe that is why,
ordered categories – four of them with five for the Kpelle, a wise man would make func-
items each” (Glick, 1 968, p. 1 3 ). tional pairs, whereas only a fool would make
From the methodological perspective of taxonomic sorts.
a cognitive psychologist, the researchers had This analysis leads us to an even deeper
failed to tap into the participants’ obvious level of cultural definitions of intelligence:
competence in categorization with their first In the Kpelle example, the researchers’
procedure. This example illustrates what criterion for intelligent behavior was the
Cole and Scribner (1 974) viewed as two participants’ criterion for foolish; the partic-
general problems in the cross-cultural study ipants’ criterion for wise behavior was the
of thought: researchers’ criterion for stupid (Greenfield,
1 997b). Underlying these interpretations of
1 . There is a great readiness to assume that the experiment are different ethnotheories,
particular kinds of tests or experimental sit- that is, folk theories of intelligence. Most
profoundly, our theories of what kind of explorations of intelligence concepts in dif-

thought is worth studying are very much ferent cultures began to appear (Dasen & de
influenced by our ethnotheories of what Ribeaupierre, 1 987; Serpell, 1 994; Sternberg
constitutes intelligent behavior. And what et al., 1 981 ; Wober, 1 974); all challenged the
constitutes intelligent behavior depends on assumption that technological or scientific
what is adaptive and valued in a particular intelligence was a universal endpoint of de-
ecocultural environment. The investigation velopment (Greenfield, 1 974). Indeed, so-
of ethnotheories of intelligence proved to cial intelligence turned out to be the pre-
greatly deepen understanding of cultural dominant ideal in Africa and Asia (e.g.,
paradigms of thought (see Sternberg, Chap. Wober, 1 974; Super, 1 983 ; Dasen, 1 984; Gill
3 1 , for further discussion of intelligence). & Keats, 1 980; Serpell, 1 994; Nsamenang &
Lamb, 1 994; Grigorenko et al., 2001 ). Intel-
ligence in all these investigations includes a
concern with responsible ways of contribut-
Theories and Ethnotheories ing to the social world. The central feature
of Intelligence of the Baoulé concept of intelligence in Ivory
Coast, West Africa, for example, is willing-
Clearly, human intelligence and the brain ness to help others (Dasen 1 984). In gen-
structure that supports it are keys to our eral, African cultures not only emphasize
adaptation as a species. Yet within this broad social intelligence but also see the role of
rubric of human intelligence, different forms technical skills as a means to social ends
of intelligence are valued and adaptive in (Dasen 1 984). This sort of ethnotheory of in-
different ecocultural niches. Mundy-Castle telligence could explain why the taxonomic
(1 974/1 976) contrasted technological intel- sorter was a foolish man in Kpelle eyes.
ligence, which is more developed in the As a group, such conceptions can be seen
independent, individualist characteristic of as collectivistic conceptions of intelligence
Europe, and social intelligence, which is (Segall et al., 1 999). Note that these con-
more developed in the interdependent, col- ceptions are not all-or-none. Differences to
lectivist characteristic of Africa. Closely re- a great extent, are a matter of differen-
lated to technological intelligence (and per- tial priorities. At the same time, there is
haps indistinguishable from it) is scientific not one collectivistic conception of intelli-
intelligence. Indeed, underlying Piaget’s the- gence, nor a single individualistic concep-
ory of cognitive development is a theory tion of intelligence. There are cross-cultural
of intelligence as scientific thinking (Green- surface variations for each underlying theme
field, 1 974). By his own admission, un- (Greenfield, 2000).
derstanding the basis for Western scientific
thought was Piaget’s most fundamental the-
oretical concern (Piaget, 1 963 /1 977). Under
Inhelder’s leadership, Piaget investigated the Who and What Are the Individualists
development of scientific thought (chem- and Collectivists?
istry and physics) in a set of experimental
studies (Inhelder & Piaget, 1 95 8). This body This is perhaps the place to stop and de-
of theory and research implies the impor- fine who are the individualists and who
tance of scientific intelligence as a develop- are the collectivists. In doing so, I will not
mental goal for processes of thinking. Sci- present a simple picture. Instead, I will dis-
entific or technological intelligence as a folk cuss ideal cases, in-between cases, culture
theory supports thinking skills that relate to change, biculturalism, and culture contact.
the world of things rather than people; this These complexities take me beyond simple
would include most of the items and subtests binary distinctions that have bothered some
of standardized intelligence tests. (Rogoff, 2003 ).
Following Mundy-Castle’s depiction of My nonbinary starting point is that all hu-
technological and social intelligence, related man beings are both individual and social.
What varies is the extent to which cultures these are the bottom line of this theoretical
try to maximize one or the other facet of paradigm.
the human experience. Correlated with this Who are the collectivists? Harry Trian-
maximization are different forms that the dis notes that they include 70% of the
social and the individual take within each world’s population – the populations of
paradigm. So, for example, social behavior Africa, Asia, Latin America, and Native
tends to be more automatic in the collec- America (Triandis, 1 989). Equally impor-
tivistic system and more by choice, providing tant, there are demographic, ecological, and
individual autonomy, in the individualistic historical factors that are inputs into the ex-
system. The other side of the maximization pressed value system. Some of the most im-
coin is the fact that the major mode of one portant demographic factors are economic
cultural paradigm may be the minor mode level [rich are more individualistic than poor
of the other. For example, in the society of (Segall et al., 1 999)], the urban–rural con-
the United States, we might see religions as trast [large-scale urban more individualistic
often emphasizing the communitarian in a than small-scale rural (Kim & Choi, 1 994)],
primarily individualistic surround. The uni- formal education [which functions as an in-
versal existence of both modes can be seen dividualizer (Reykowski, 1 994)], high tech-
in priming studies in which the minor mode nology [which functions as an individu-
(individualism in the case of Asians, collec- alizer (Mundy-Castle, 1 974)], immigration
tivism in the case of North Americans) can and migration (making people more indi-
be elicited by a relevant prime (Gardner, vidualistic), agricultural subsistence versus
Gabriel, & Lee, 1 999). commerce [the latter functioning as an indi-
It is also important to realize that we are vidualizer (Greenfield, Maynard, & Childs,
talking about cultural systems, not isolated 2003 ; Greenfield, 2004)], and religion (some
attributes (cf. Kitayama, 2002). The distri- are more individualistic; e.g., Protestantism,
bution of autonomy and obedience between others more collectivistic; e.g., Catholicism).
men and women in a collectivistic culture Indeed, it is useful to see the two
has been used as an argument against the paradigms as originating as adaptations to
very concept of collectivistic culture and for different ecologies. Demographic factors in-
the notion that autonomy and obedience are fluence ecology and, through ecology, they
individual difference variables rather than form psychologies. Thus, rich people do not
culture-level characteristics (Turiel, 2000). need to cooperate with a larger group for
In response to this argument, I note that their survival; poor people do. The urban en-
one essence of a collectivistic culture is re- vironment contains many strangers, and so
lations of obedience between women and community relations become less functional
men, clearly providing more autonomy for (Kim, 1 994). In formal education, the irre-
men than for women. Similarly, the rela- ducible unit of performance is the individual
tion of equality among individuals provides who must receive an individual grade and
more autonomy for both women and men performance evaluation (Greenfield, 1 994).
in an individualistic culture. It is not the Complex technology functions as an individ-
existence of autonomy that is important in ualizer in multiple ways – through providing
the characterization of a culture according large dwellings and office buildings with the
to the present paradigm; it is the pattern- opportunity for private space and through
ing that counts. Indeed, I would see the em- substituting interaction with a machine for
phasis on individuals as separate rather than interaction with people (e.g., television re-
as interrelated (the hallmark of psychology placing frequent face-to-face visits).
founded upon the independent individual When you immigrate to a new country or
as the unit of analysis) as an individualis- migrate to a new location within a country,
tic perspective on social science itself. Cul- you often leave extended family behind. As
ture as a system of relations, the patterning a consequence, a high rate of geographical
of attributes, the forms of individual and so- mobility should increase individualism. This
cial behavior, and the system of priorities – might be a reason why Europeans are less
individualistic than Americans. Note, too, advanced technology would be the paradig-
that nation states composed primarily of im- matic case on the collectivistic end of the
migrants at their founding – for example, spectrum. The large, mobile, rich, urban
the United States, Canada, and Australia – neighborhood with a high level of formal
are generally among the most individualistic schooling and advanced technology would
(Hofstede, 1 980; Oyserman, Coon, & Kem- be the paradigmatic case on the individual-
melmeier, 2002). istic end of the spectrum. Clearly, all other
In subsistence agriculture settings, all cases would fall between these extremes.
must cooperate to produce mainly perish- A particular type of in-between case is the
able goods. In a commercial setting, it is de- immigrant family who has come, most gen-
sirable to maximize the monetary resources erally, from a poorer, more collectivistic soci-
of an individual to accumulate nonperish- ety into a richer, more individualistic one. In
able consumer goods like cars or televi- general, such immigrants will be at a point
sions (Collier, 2003 ). Catholicism empha- between their compatriots in the ancestral
sizes the communal, including a pathway country and natives of the host country on
to God through another human being, the cognitive tasks that tap into individualis-
priest; Protestantism emphasizes the inde- tic and collectivistic paradigms of thought
pendent individual with a direct pathway to (Nisbett, 2003 ). In addition, we expect, as
God. It is interesting that, as commerce de- generations in the host country increase, the
velops in Mexico and Central America and host country culture will make an increas-
when immigrants come to the commercial ingly large mark on patterns of thought.
environment of the United States from the Because of the development of the world
more agricultural environment of Mexico, in the direction of a dense urban, com-
evangelical Protestantism has become much mercial, high-tech environment, there is a
more popular whereas Catholicism has de- worldwide movement toward increasing in-
clined in popularity. dividualism. Finally, because of high rates
It is also important to note that, because of immigration, there is also increasing con-
of all these factors, individualism and col- tact between more individualistic and more
lectivism are relative terms, their system- collectivistic cultures in the world. This of-
atic nature notwithstanding. If one tests ten leads to mismatches and misunderstand-
rural versus urban populations in the same ings. I will give an example of a cognitive
country (e.g., Mexico), one will usually find mismatch and misunderstanding later in the
the rural population to be more collectivistic chapter. But let me now turn to some ad-
(e.g., Madsen & Shapira, 1 970). On the other ditional thought processes in which the two
hand, if you compare Latino immigrant fam- paradigms manifest themselves, yielding in-
ilies in Los Angeles, an urban setting, and teresting cross-cultural differences.
Euro-American families in Los Angeles, the
urban Latino families will respond more
collectivistically than the Euro-Americans Thinking about People: Theory
(Raeff, Greenfield, & Quiroz, 2000). In of Mind
other words, the nature of these demo-
graphic variables is such as to make individ- Given what I had observed in Senegal con-
ualism and collectivism graded, rather than cerning the absence of a notion of point-of-
all-or-none systems. Because they are so cen- view, I became very skeptical when theory of
tral to adaptation, they are clearly very sen- mind became popular in cognitive develop-
sitive to environmental factors. ment research. The claims for universality
Multiple demographic factors create of the sort of calculus that requires a par-
paradigmatic cases on the extremes (H. ticipant to know, for example, what some-
Keller, personal communication, June, one knows a third party has said to a fourth
2003 ): The small, stable, poor, agrarian party (e.g., Does Mary know the ice-cream
village with an oral culture and without man has talked to John?”, Baron-Cohen,
1 989) seemed to involve too much differ- others and the capacity to encode the so-
entiation of viewpoints for children whose cial effects of one’s own and others’ action –
world view emphasized unity with the world provide the groundwork for two distinct
and those around them. I wanted to think cultural emphases in the development of
through the individualistic assumptions that person knowledge. Some cultures empha-
might be being made in this line of re- size the individual psyche, individual traits,
search and to think about what a col- and the individual intentions behind action
lectivistic alternative might look like. This (Vinden & Astington, 2000); other cultures
search eventuated in one section of an ar- emphasize the social effects and social con-
ticle, “Cultural Pathways through Univer- text of a person’s action (Duranti, 1 988,
sal Development” (Greenfield, et al., 2003 ), 1 993 ; Shweder & Bourne, 1 984; Fiske et al.,
which I present here. 1 998). The latter also see mind and heart
Understanding self and others is part as integrated rather than separate (Lillard,
of our universal evolutionary heritage 1 998: Zambrano, 1 999). We see the former
(Tomasello, 1 999; Whiten, 2002). The mir- as the individualistic emphasis and the latter
ror neuron system of the cerebral cortex as the collectivistic or sociocentric emphasis.
reveals a common neuromuscular activa- Most literature on theory of mind – the
tion for acting oneself and for understand- ability to think about other people’s mental
ing the actions of others (Fadiga et al., states – has assumed an emphasis on indi-
1 995 ; Iacoboni et al., 1 999). In ontogeny, vidual minds (Flavell, 1 999). I, however, see
the first step in understanding self and oth- theory of mind as a special culturally canal-
ers occurs at birth, when infants discrimi- ized case of person knowledge (cf. Hobson,
nate people from things (Trevarthen, 1 980). 1 993 ). I therefore review the literature in-
Comprehension of agency as the produc- dicating the existence of these two differ-
tion of goal-directed action begins in early ent cultural emphases – individual psyche
infancy (Gelman & Lucariello, 2002). An versus social effects or context – in the de-
ability to distinguish between self and oth- velopment of social understanding or person
ers as intentional agents develops at eight or knowledge.
nine months of age (Piaget, 1 95 2; Tomasello, Although it claims universality, I utilize
1 999; Trevarthen, 1 980). the classical literature on theory of mind to
At the one-word stage of language devel- complete the picture of the individualistic
opment (between one and two years of age), pathway to person knowledge. Early steps
infants code the intentional action not just of along this pathway have to do with the ac-
self but of others (Greenfield & Smith, 1 976; quisition of mentalistic terms; children as
Greenfield, 1 980), and this encoding seems young as twenty-two months first produce
to have ancient phylogenetic roots (Green- mentalistic terms such as know and pretend
field & Savage-Rumbaugh, 1 990; Greenfield (Wellman, 1 990). Later, the child is able to
& Lyn, in press). The linguistic encoding of imagine a mental state of affairs in another
intentional action becomes more complex person different from the information avail-
with age and the acquisition of language able to oneself (e.g., Perner, 1 991 ). Similar
(Bloom, Lightbown, & Hood, 1 975 ). At the trends occur in literate, developed countries,
same time, there is very early understand- both Western and non-Western (Wellman,
ing of the effects of action on other people. Cross, & Watson, 2001 ). The differentia-
Script knowledge, which begins in the section and individuation of people according
ond year of life, involves the understanding to their states of mind is basic to this devel-
of both intentions and effects of human ac- opmental pathway to social understanding.
tion (Gelman & Lucariello, 2002). It also re- In the other pathway, however, mental-
quires an understanding of the coordination istic terms are lacking in the lexicon, are
of action by more than one person. not understood in the same way as the En-
These two universal capacities – the ca- glish equivalents, and are not applied to one-
pacity to encode the intentions of self and self. This phenomenon has been found in a
number of subsistence ecologies (Greenfield a particular culture (LeVine, 1 997). “With

& Bruner, 1 966/1 969; Vinden, 1 996, 1 999). a collectivist or group orientation, personal,
As mentioned earlier, however, both school- mental, and emotional states are relatively
ing, with its demand for justifications, and unimportant” (Vinden & Astington, 2000,
literacy, with its separation of thought (on p. 5 1 2). In line with the notion that school
paper) from thinker, leads to an understand- ecology favors the development of atten-
ing of the mentalistic term think (Greenfield tion to the individual psyche, schooled chil-
& Bruner, 1 966/1 969). (See Lillard (1 998) dren performed better on several of the
for a cross-cultural review of the theory-of- tasks relating to predicting an individual’s
mind literature). behavior or emotion in a nonsocial situation
In a nonliterate subsistence ecology in (Vinden, 1 999).
Africa, children between two and four years On the other hand, in a culturally impor-
old were given a theory-of-mind task em- tant situation involving social responsibil-
bedded into a context of social action. In ad- ity, young children from small, face-to-face
dition, the task used the term heart rather societies with subsistence traditions show
than thought (Avis & Harris, 1 991 ). Un- advanced understanding of the knowledge
der these circumstances, Baka children in state and feelings of another person whose
southeast Cameroon showed the develop- knowledge differs from one’s own. In a suc-
ment of social understanding that had been cessful apprenticeship situation, the expert
found in the United States and Europe. must be aware of how much less the novice
The results contrasted strongly with another knows in comparison with self. The expert
study that (1 ) decontextualized the task, must also be aware of the novice’s need for
presenting it as a task involving only one materials and the novice’s motivations. In a
actual person, the subject; (2) asked about video study of naturalistic teaching interac-
the deceived’s thought rather than action tions, Zinacantec Maya children as young as
in reference to a hidden object; and (3 ) four years old were able to supply necessary
asked about mind rather than heart. Under materials and model tasks for their younger
these conditions, Quechua children between siblings (Maynard, 2002). They were also
about four and eight performed at chance able to provide useful verbal guidance in
levels (Vinden, 1 996). Somewhat more teaching, such as narrating a task they were
contextualized tasks led to somewhat im- demonstrating and giving commands to the
proved performance in subsistence groups younger child. By the age of eight, chil-
in Cameroon, West Africa (the Mofu), and dren were very adept at simplifying the
Papua New Guinea (the Tainae and Tolai) task for younger children by giving them
(Vinden, 1 999). parts of tasks, one at a time, and at scaf-
Meta-analysis indicates that, around the folding the task by providing complex ver-
world, children from subsistence cultures bal information. These advanced thinking
solve theory-of-mind tasks better when skills showed an understanding of the knowl-
these are presented in context (Wellman, edge state, material needs, and motivation of
Cross, & Watson, 2001 ). However, Vinden the younger children. Sibling caregiving as
(1 999) found a lag in age in all groups relative an important social responsibility may have
to children of European-derived cultures; played a role in the young children’s de-
false belief (the understanding that another sire and skill in teaching their younger sib-
person has been misled into believing that lings. Similar sibling teaching practices were
something is true that, in fact, is false) as- found in another sibling-caregiving culture –
sessed using the word “think” was at chance the Wolof of Senegal (Rabain-Jamin, May-
levels at all ages in the two groups most iso- nard, & Greenfield, 2003 ). Future research is
lated from the outside world of European needed to explore the relationship between
culture. the cognitive operations of person knowl-
Here we interpret a lag as indicating edge in sibling caregiving and in experimen-
that the skill in question is not valued in tal tasks.
Indeed, it may be culturally significant tudes (“personal belief that guns were an
that person knowledge has been measured important means to redress grievances”),
so frequently by false belief, the dominant and psychological problems (“a darkly dis-
theory-of-mind task. In a false-belief task, turbed man who drove himself to success
the participant must understand that an- and destruction,” “a psychological problem
with being challenged”). He asked his fel-
other person has a different perspective (the
low student Kaiping Peng what kinds of ac-
false belief) from his or her own. It is a
counts of the murder were being given in
task that requires individuation of one’s per- Chinese newspapers. They could scarcely
spective from that of another. Individuation have been more different. Chinese reporters
is an important component of the devel- emphasized causes that had to do with the
opment of the independent self. It may be context in which Lu operated. Explana-
that socialization in interdependent cultures tions centered on Lu’s relationships (“did
emphasizes shared perspectives more than not get along with his advisor,” “rivalry
different perspectives. Only future research with slain student,” “isolation from Chi-
can tell us whether this may be another nese community”), pressures in Chinese so-
reason for relatively poor performance on ciety (“victim of Chinese ‘Top Student’ edu-
cational policy”) and aspects of the Amer-
false-belief tasks in collectivistic, subsistence
ican context (“availability of guns in the
cultures.
U.S.”). (Morris & Peng, pp. 1 1 1 –1 1 2).
Ideally, cross-cultural comparison would
involve a developmental analysis of tasks Morris and Peng found the same pattern of
tapping into both of these cultural emphases differences when the incident involved a stu-
within the context of universal develop- dent from the United States. The Chinese
ments. A pioneering study of social explana- focused on the killer’s relation to context,
tion in India and the United States by Joan particularly social context, in explaining his
Miller (1 984) did exactly that: Children in behavior. U.S. reporters focused on quali-
both the United States and India improved ties of the individual. A whole series of ex-
at social explanation with age (the universal periments on causal attribution led to the
development). At the same time, children conclusion that “Westerners attend primar-
in the United States increasingly formulated ily to the focal . . . person and Asians attend
their social explanations of events in terms more broadly to the field and to the relations
of an individual’s stable traits (emphasis on between the object and the field” (Nisbett,
the individual psyche). Indian children, in 2003 , p. 1 27). Thus, a pattern of cultural
contrast, increasingly formulated their social differnces found in the developing child by
explanations in terms of contextual factors, Miller also show up in adulthood, the end-
particularly factors in the social surround point or outcome of development.
(emphasis on social context). Hong Kong is a setting in which two cul-
Miller’s findings were replicated in a real tures, one more collectivistic (Chinese) and
world situation by Morris and Peng (1 994). one more individualistic (British) coexist.
They found that when a Chinese physics Hong et al. (2000) showed the dynamism
student at the University of Iowa shot his of the bicultural mind in the arena of social
advisor and several other people after los- explanation. When primed with symbols of
ing an award competition, the reasons given Western culture (e.g., Mickey Mouse) in
were quite different in U.S. and Chinese an experiment concerning social explanation
newspapers: (participants had to explain why, in a pic-
ture, one fish was swimming in front of the
Michael Morris, a graduate student at
other fish), participants constructed more
Michigan at the time, noticed that the ex-
planations for Gang Lu’s behavior in the explanations in terms of individual motiva-
campus newpapers focused almost entirely tion. When primed with symbols of Chinese
on Lu’s presumed qualities – the mur- culture (e.g., with a dragon), participants
derer’s psychological foibles (“very bad tem- constructed more explanations in terms of
per,” “sinister edge to his character”), atti- the other fish or the context.
These same differences in thinking about egorization task, rather than sort by taxo-
people can affect a sense of one’s own con- nomic categories. Taxonomic categories, in
tinuity of self over time. Parallel to the contrast, revolve around a defining trait or
two modes of social explanation discov- traits of its members. These defining traits
ered by Miller, researchers Lalonde, Chan- are decontextualized from the social util-
dler, and Sokol (1 999) identify two cultural ity of the object or from other parts of the
modes of addressing the problem of self- physical world. We saw this same contrast
continuity over time in autobiographical nar- between an emphasis on inner traits that
ratives. This is the problem of how to expe- transcend context and contextualized expla-
rience and conceptualize a continuing self nation when we examined two paradigms of
in the presence of dramatic changes over social reasoning (Miller, 1 984).
the course of development. They term the If the Kpelle mode of categorization typi-
first model “an ‘Essentialist’ or ‘Entity’ no- fies a collectivistic worldview, then it should
tion of selfhood” (Ref. 5 5 , 1 999, p. 1 ); these appear in other collectivistic cultures. In-
narratives focus attention upon some aspect deed, this is the case. Ji, Zhang, and Nis-
of the self “that is thought to remain un- bett (2002) compared U.S. college students
touched by time and change” (Lalonde, Ref. with students from China and Taiwan on a
5 5 , 1 999, p. 1 ). The pathway of the inde- triadic test of categorization. In each triad
pendent, autonomous self requires a source (e.g., panda, monkey, banana), there were
of self-continuity that is functional in the two pictures that could be paired on the
face of separation from parents, the modal basis of taxonomic similarity (in this triad,
adolescent identity formation in the United panda and monkey are both animals), and
States and Canada. Internal essences or en- there were two that could be paired on the
tities would fill this requirement; this is basis of functional relationships (in this triad,
the way in which most non-Native Canadi- the monkey eats the banana). When asked
ans explain self-continuity (Chandler et al., which two of the three pictures were most
2003 ). And, as we have seen from Miller’s closely related, U.S. college students pre-
research, internal traits or essences are gen- ferred to group “on the basis of common cat-
erally used in causal attribution in the indi- egory membership: Panda and monkey fit in
vidualistic paradigm. to the animal category.” The Chinese par-
They call the second model a ticipants showed a preference for grouping
“relationship-centered” notion of self. on the basis of thematic relationships (e.g.,
It uses narrative to connect the self across monkey and banana) and justified their an-
different time periods. The narratives often swers in terms of relationships: “Monkeys
situate the speakers in family and com- eat bananas” (Ji, Zhang, & Nisbett, 2002,
munity relationships that continue across p. 1 40–1 41 ). This same cross-cultural differ-
various periods in the life cycle. This is ence developed in childhood (Chiu, 1 972).
the way most Native Canadians explain But, again, cultural preferences do not nec-
self-continuity (Lalonde, Chandler, & Sokol, essarily exclude the development of a minor
1 999). mode. Illustrating this point, a study by Wis-
niewski and Bassok (1 999) indicates that,
in the absence of a forced choice between
the taxonomic similarity and functional re-
Thinking about Things: lationships, U.S. college students can and do
Categories, Physical Relations, use both modes of thought as an implicit
and Social Relations basis for similarity judgments and other cog-
nitive operations.
A more collectivistic ethnotheory of intel- Perhaps the most basic difference be-
ligence that values relationships and social tween the two modes of thinking is the
utility can explain why the wise Kpelle per- collectivistic tendency to contextualize the
son would make functional pairs in a cat- world of objects in a web of social relations
versus the individualistic tendency to see the occur frequently in classrooms with immi-
world of physical objects as operating in its grant Latino students. But what is really hap-
own plane of reality. The former is what we pening here?
saw in the causal reasoning among the un- Our theoretical analysis rests on the fol-
schooled Wolof children; the latter is what lowing two points: What counts as thinking
we expect in the world of physical science. for the teacher is thinking about the phys-
These two modes of thinking about things ical world apart from the social world. It is
are socialized very early (Bakeman et al., the teacher’s definition of scientific thinking,
1 990; Clancy, 1 986; Fernald & Morikawa, and, in her mind, this is a science lesson.
1 993 ; Rabain, 1 979; Rabain-Jamin, 1 994; Her focus is on one part of her instruc-
Zempléni-Rabain, 1 973 ). tions, “Describe eggs.” The child, in con-
trast, is responding more to the other part
Cross-Cultural Conflict in What of the teacher’s instructions – “Think about
Counts as Thinking the times you have cooked and eaten eggs”
and, based on a different set of assump-
When families with a collectivistic cultural tions about what counts as thinking, focuses
heritage emmigrate to an individualistic soon the social aspect of her experience with
ciety, the two paradigms can come into eggs, in particular, a family experience. This
sharp conflict, particularly at school. Cul- is the first aspect of the misunderstanding
tural models not only have values attached and cultural mismatch between teacher and
to them – what counts as good and bad, what learner.
takes priority over what – but they also have The second aspect of the mismatch is that
epistemologies – what counts as knowledge. the child who was passed over is providing
These cultural models are so basic they nor- a narrative, also valued in her home culture,
mally remain implicit. As long as everyone whereas the teacher is expecting a simple
interacting in the same social world shares statement of fact. Implicitly, the teacher is
the same model, the implicit quality of the making Bruner’s distinction between narra-
models does not cause a problem. In fact, it tive thought and logical–scientific thought.
provides an underlying set of shared assump- Bruner’s analysis is very relevant here:
tions that makes social life – for example, life
in school – run smoothly. The next example There appear to be two broad ways in
is about what happens in a bicultural class- which human beings organize and manage
room when teachers and learners have differ- their knowledge of the world, indeed struc-
ent implicit understandings of what counts ture even their immediate experience: One
as thinking. seems more specialized for treating of phys-
ical “things,” the other for treating people
and their plights. These are conventionally
In a pre-kindergarten class, the teacher
known as logical–scientific thinking and
held an actual chicken egg. She asked
narrative thinking. (Bruner, 1 996, p. 3 9).
the children to describe eggs by think-
ing about the times they had cooked and
The child who talks about cooking and eat-
eaten eggs. One of the children tried three
times to talk about how she cooked eggs ing eggs with grandmother is responding in
with her grandmother, but the teacher the narrative mode; but the teacher expects
disregarded these comments in favor of the logical–scientific mode: “What are the
a child who explained how eggs are bare facts about eggs?” she wants to know.
white and yellow when they are cracked. Narrative is, in the dominant culture, associ-
(Greenfield, Raeff, & Quiroz, 1 996). ated with the humanities, logical–scientific
thought is associated with the sciences. As
The two features of this incident – the first Bruner says, the value of logical–scientific
child’s emphasis on a family-based story and thinking “is so implicit in our highly tech-
the teacher’s disregard and devaluation of nological culture that its inclusion in school
the child’s seemingly unscientific answer – curricula is taken for granted” (Bruner, 1 996,
p. 41 ). It is so taken for granted that, as the SUBJECT: The reason is that black deer al-
egg incident shows, the narrative mode be- ways walks about all day eating green
comes invisible to the teacher. leaves in the bush. When it rests for a while
it gets up again and goes to eat.
Logic (Cole et al., 1 971 , p. 1 87).
The same type of contrast applies to logi- In essence, this participant rejects the ab-
cal thought (see Evans, Chap. 8). Deductive stract, decontextualized structure of the log-
logic is intrinsically decontextualized from ical problem. This type of response was
its content (Nisbett et al., 2001 ; Nisbett, typical of a group of nonliterate Kpelle
2003 ). We therefore would expect it would adults. In line with our notion of school
be part of individualistic but not collec- as promoting an individualistic worldview,
tivistic habits of thought. Instead, a col- Kpelle high school students generally an-
lectivist might recontextualize a deductive swered the logical problems in the way the
problem. This phenomenon was first iden- researchers had in mind – as decontextual-
tified by Luria in the 1 93 0s with unedu- ized logical deductive problems.
cated Soviet peasants in Central Asia (Luria, Again, if this distinction is typical of the
1 971 ). Inspired by Luria, Cole et al. (1 971 ) two paradigms of thought, it should apply
gave such problems to nonliterate Kpelle to other groups who might differ on the
adults in a rural area of Liberia. Here is individualism–collectivism worldview. Us-
an example of a deductive logic problem ing different methods, Nisbett and his col-
and how the participant refuses to deal leagues showed that East Asians, like the
with the decontextualized structure and, Kpelle, rejected decontextualized abstract
instead, recontextualizes it, first by ask- logic and preferred to reason on the basis
ing more questions concerning context and of experience (Nisbett et al., 2001 ).
then by applying his own experience to
the problem:
EXPERIMENTER: At one time spider went to a Visual Pattern Construction: A Case
feast. He was told to answer this question of Historical Change
before he could eat any of the food. The
question is: Spider and black deer always The worldwide direction of change on
eat together. Spider is eating. Is black deer all critical demographic variables – to-
eating? ward greater population density, formal ed-
SUBJECT: Were they in the bush?
ucation, technology, and commerce-based
EXPERIMENTER: Yes.
wealth – yields an historical push toward the
SUBJECT: They were eating together?
EXPERIMENTER: Spider and black deer always
pole of individualism. I will use the domain
eat together. Spider is eating. Is black deer of visual representation to provide an exam-
eating? ple of how historical change can move cog-
SUBJECT: But I was not there. How can I nition in the direction of the individualis-
answer such a question? tic paradigm of thought. One of the marks
EXPERIMENTER: Can’t you answer it? Even if of a collectivistic cultural system is respect
you were not there you can answer it. for elders and their traditions. The individ-
SUBJECT: Ask the question again for me to ualistic side of this coin places a value on
hear. novelty and innovation. The typical econ-
EXPERIMENTER: (repeats the question)
omy in which respect for elders predomi-
SUBJECT: Oh, oh black deer was eating.
nates is agricultural subsistence. Innovation,
EXPERIMENTER: Black deer was eating?
SUBJECT: Yes.
in turn, is an important value in commercial
EXPERIMENTER: Black deer was eating? entrepreneurship. An experiment demon-
SUBJECT: Yes. strated how a shift from one economy to
EXPERIMENTER: What is your reason for say- another affected the representation of cul-
ing that black deer was eating? turally novel patterns.
In 1 969 and 1 970, I did a pattern repre- whole as it moved from subsistence agricul-
sentation experiment in a Zinacantec Maya ture to money and commerce.
community of Chiapas, Mexico (Green- In terms of the socialization processes
field & Childs, 1 977) that involved, among that could develop these new cognitive
other things, continuing both culturally styles, we found an historical change in
novel and culturally familiar (from tradi- weaving apprenticeship that also had moved
tional weaving) striped patterns. The ex- toward a more individualistic model. In
perimenter would place sticks of differ- commercial families, weaving apprentice-
ent colors in a rectangular wooden frame, ship had, between 1 970 and the early 1 990s,
providing three repetitions of the pattern moved from help and guidance from the
(for example, green, green, green, yellow teacher to a more independent trial-and-
would be a single repetition of one of the error learning process for the novice weaver.
patterns). She would then ask the sub- Moreover, we also found a correlation be-
ject to continue the same pattern. At that tween the more independent, individual-
time, the dominant economy was agricul- istic mode of weaving apprenticeship skill
tural subsistence with relatively little cash and continuing the novel patterns in our
or commerce. experiment.
I returned to the community in 1 991 af- So these basic cultural paradigms of
ter a period of economic development in thinking are not constant. They are adapta-
which commercial entrepreneurship and a tions to social conditions, including social-
cash economy had grown greatly with a ization processes, that change over time. As
corresponding decline in agricultural subsis- the world becomes more commercial, more
tence. I predicted that skill in continuing dense, and more formally educated, the Zin-
novel (not familiar) patterns would have in- acantecs illustrate this worldwide trend from
creased, and this is exactly what I found. a more collectivistic to a more individualistic
Even more interesting, I was able to relate paradigm of thought.
this skill with novel representations directly
to participation in commerce. Change had conclusions and future directions
been uneven, and children whose families Identifying two basically different paradigms
were most involved in commercial activities of thought, value, and behavior has linked
in both their business dealings and as con- together phenomena in the domain of cul-
sumers showed the most skill in constructing ture and thinking that were once consid-
the novel patterns. Structural equation mod- ered unrelated. With this linking thread has
eling indicated a causal relationship between come deeper understanding of basic cul-
correct completion of the novel patterns and tural differences. Although providing theo-
commercial involvement. retical coherence, it has also removed some
At the same time in this community, of the ethnocentrism from earlier accounts
where weaving was the most important skill of difference, in which, for example, collec-
learned by all girls, there had been a shift tivistic forms of categorization, reasoning,
in woven patterns from tradition to novelty. and logic were considered the absence of
At the earlier period, there was a closed set Western skills rather than as examples of a
of about four patterns that girls and women different set of values about the nature of
wove for clothing and other utilitarian pur- intelligence.
poses. By the time we went back in 1 991 , The primary omission in the preced-
the basic patterns still existed, but they had ing account is probably the ecocultural ap-
been supplemented by an ongoing process proach to everyday cognition and particu-
of innovation through girls and women who larly the role of cultural artifacts in thinking.
created an infinite number of woven and em- For good reviews from these perspec-
broidered designs. So skill in representing tives, I recommend Everyday Cognition by
culturally novel patterns in our experiment Schliemann, Carraher, and Ceci (1 997) and
was a reflection of change in the culture as a Culturally Situated Cognition by Wang, Ceci,
Williams, and Kopko (2004). The empirical Dasen, P. (Ed.) (1 977). Piagetian Psychology:
body of work generated by these approaches Cross-cultural Contributions. New York: Gard-
is not at all antithetical to the theoretical ner Press.
paradigm presented here. In the future, I Dasen, P. R. (1 984). The cross-cultural study of
believe further theoretical integration will intelligence: Piaget and the Baoulé. Interna-
take place. tional Journal of Psychology, 1 9, 407–3 4.
Dasen P. R. & de Ribeaupierre, A. (1 987). Neo-
Piagetian theories: Cross-cultural and differen-
tial perspectives. International Journal of Psy-
References chology, 2 2 , 793 –83 2.
Duranti, A. (1 988). Intentions, language and so-
Avis, J., & Harris, P. L. (1 991 ). Belief-desire rea- cial action in a Samoan context. Journal of Prag-
soning among Baka children: Evidence for a matics, 1 2 , 1 3 –3 3 .
universal conception of mind. Child Develop- Duranti, A. (1 993 ). Intentions, self, and
ment, 62 , 460–467. responsibility: An essay in Samoan ethno-
Bakeman, R., Adamson, L. B., Konner, M., & Barr, pragmatics. In J. H. Hill, & J. T. Irvine (Eds.),
R. G. (1 990). !Kung infancy: The social context Responsibility and Evidence in Oral Discourse
of object exploration. Child Development, 61 , (pp. 24–47). Cambridge, UK: Cambridge
794–809. University Press.
Baron-Cohen, S. (1 989). The autistic child’s the- Fadiga, L., Fogassi, L., Pavesi, G., & Rizolatti,
ory of mind: A case of specific developmental G. (1 995 ). Motor facilitation during action
delay. Journal of Child Psychology and Psychia- observation: A magnetic stimulation study.
try, 3 0, 285 –297. Journal of Neurophysiology, 73 , 2608–261 1 .
Bloom, L., Lightbown, P., & Hood, L. (1 975 ). Fernald, A., & Morikawa, H. (1 993 ). Common
Structure and variation in child language. themes and cultural variation in Japanese and
Monographs of the Society for Research in Child American mothers’ speech to infants. Child De-
Development, 40–97. velopment, 64, 63 7–65 6.
Bruner., J. S. (1 996). The Culture of Education. Fiske, A. P., Kitayama, S., Markus, H., & Nisbett,
Cambridge, MA: Harvard University Press. D. (1 998). The cultural matrix of social psy-
chology. In D. Gilbert, S. Fiske, & G. Lindzey
Chandler, M., Lalonde, C. E., Sokol, B., & Hal-
(Eds.), Handbook of Social Psychology (Fourth
lett, D. (2003 ). Personal persistence, identity,
Edition) (pp. 91 5 –981 ). New York: McGraw-
development, and suicide: A study of native
Hill.
and non-native North American adolescents.
Monographs of the Society for Research in Child Flavell, J. (1 999). Cognitive development: Chil-
Development, 68, No. 2, Serial No. 273 . dren’s knowledge about the mind. Annual Re-
view of Psychology, 5 0, 21 –45 .
Chiu, L.-H. (1 972). A cross-cultural comparison
of cognitive styles in Chinese and American Gardner, W. L., Gabriel, S., & Lee, A. (1 999).
children. International Journal of Psychology, 7, “I” value freedom, but “we’ value relationships:
23 5 –242. Self-construal priming mimics cultural differ-
ences in judgment. Psychological Science, 1 0,
Clancy, P. M. (1 986). The acquisition of commu-
3 21 –3 26.
nicative style in Japanese. In B. B. Schieffelin
& E. Ochs (Eds.), Language socialization across Gelman, R., & Lucariello, J. (2002). Role of learn-
cultures. Cambridge, UK: Cambridge Univer- ing in cognitive development. In C. R. Gallistel
sity Press. (Ed.), Stevens’ Handbook of Experimental Psy-
chology (Third Edition). Vol. 3 , Learning, Moti-
Cole, M., Gay, J., Glick, J., & Sharp, J. (1 971 ). The
vation, and Emotion (pp. 3 95 –443 ). New York:
Cultural Context of Learning and Thinking: An
Wiley.
Exploration in Experimental Anthropology. New
York: Basic Books. Gill, R., & Keats, D. M. (1 980). Elements of intel-
lectual competence: Judgments by Australian
Cole, M., & Scribner, S. (1 974). Culture and
and Malay university students. Journal of Cross-
Thought: A Psychological Introduction. New
Cutlural Psychology, 1 1 , 23 3 –43 .
York: Wiley.
Glick, J. (1 968, February). Cognitive style among
Collier, J. F. (2003 ). Presentation at UCLA Sloan
the Kpelle of Liberia. Paper presented at the
Center Workshop on Kinship and Family. Jan-
meeting on Cross-Cultural Cognitive Studies,
uary 3 1 , 2003 .
American Educational Research Association, Greenfield, P. M. (2004). Weaving generations

Chicago. together: Evolving creativity in the Zinacantec
Greenfield, P. M. (1 966). On culture and con- Maya. Santa Fe, CA: SAR Press.
servation. In J. S. Bruner, R. R. Olver, & P. Greenfield, P. M., & Bruner, J. S. (1 966). Culture
M. Greenfield, Studies in Cognitive Growth and cognitive growth. International Journal of
(pp. 225 –25 6). New York: Wiley. Psychology, 1 , 89–1 07.
Greenfield, P. M. (1 972). Oral or written Greenfield, P. M., & Bruner, J. S. (1 969). Cul-
language: The consequences for cognitive ture and cognitive growth, revised. In D.
development in Africa, the United States, Goslin (Ed.), Handbook of Socialization Theory
and England. Language and Speech, 1 5 , 1 69– (pp. 63 3 –660). Chicago: Rand McNally.
1 78. Greenfield, P. M., & Childs, C. P. (1 977). Un-
Greenfield, P. M. (1 974). Cross-cultural research derstanding sibling concepts: A developmental
and Piagetian theory: Paradox and progress. study of kin terms in Zinacantan. In P. Dasen
Dossiers Pedagogiques, March–April, 3 4–3 9. (Ed.), Piagetian Psychology: Cross-cultural con-
French translation. tributions (pp. 3 3 5 –3 5 8). New York: Gardner
Greenfield, P. M., (1 975 ). Oral or written Press.
language: The consequences for cognitive Greenfield, P. M., Keller, H., Fuligni, A., & May-
development in Africa, the United States, and nard, A. (2003 ). Cultural pathways through
England. Reprinted in M. Maer & W. M. universal development. Annual Review of Psy-
Stallings (Eds.), Culture, child, and school: Socio- chology, 5 4, 461 –490.
cultural influences on learning. Brooks/Cole Greenfield, P. M. & Lyn, H. (in press). Symbol
Publishing. combination in Pan: Language, action, culture.
Greenfield, P. M. (1 976). Cross-cultural research In D. Washburn (Ed.), Emergents and Ratio-
and Piagetian theory: Paradox and progress. In nal Behaviorism: Essays in Honor of Duane M.
English in K. F. Riegel & J. A. Meacham (Ed.), Rumbaush (working title). Washington, DC:
The developing individual in a changing world. American Psychological Association.
Vol. 1 . Historical and cultural issues. The Hague: Greenfield, P. M., Maynard, A. E., & Childs,
Mouton, pp. 3 22–3 3 3 . C. P. (2003 ). Historical change, cultural learn-
Greenfield, P. M. (1 980). Towards an operational ing, and cognitive representation in Zinacan-
and logical analysis of intentionality: The use of tec Maya children. Cognitive Development, 1 8,
discourse in early child language. In D. R. Ol- 45 5 –487.
son (Ed.), The Social Foundations of Language Greenfield, P. M., Raeff, C., & Quiroz, B. (1 996).
and Thought: Essays in Honor of J. S. Bruner Cultural values in learning and education. In B.
(pp. 25 4–279). New York: Norton. Williams (Ed.), Closing the Achievement Gap:
Greenfield, P. M. (1 994). Independence and inter- A Vision for Changing Beliefs and Practices
dependence as developmental scripts: Implica- (pp. 3 7–5 5 ). Alexandria, VA: Association for
tions for theory, research, and practice. In P. Supervision and Curriculum Development.
M. Greenfield, & R. R. Cocking (Eds.), Cross Greenfield, P. M., Reich, L. C., & Olver, R. R.
Cultural Roots of Minority Child Development (1 966). On culture and equivalence–II. In J. S.
(pp. 1 –3 7). Hillsdale, NJ: Erlbaum. Bruner, R. R. Olver, P. M. Greenfield, et al.,
Greenfield, P. M. (1 997a). Culture as process: Studies in Cognitive Growth (pp. 270–3 1 8).
Empirical methods for cultural psychology. In New York: Wiley.
J. W. Berry, Y. Poortinga, & J. Pandey (Eds.), Greenfield, P. M., Reich, L. C., & Olver, R.
Handbook of Cross-cultural Psychology: Vol. 1 . R. (1 972). On culture and equivalence–II.
Theory and Method (pp. 3 01 –3 46). Boston: Reprinted in P. Adams (Ed.), Language in
Allyn & Bacon. Thinking (pp. 21 7–23 5 ). Baltimore: Penguin
Greenfield, P. M. (1 997b). You can’t take it Books.
with you: Why ability assessments don’t Greenfield, P. M., & Savage-Rumbaugh, E. S.
cross cultures. American Psychologist, 5 2 , 1 1 1 5 – (1 990). Grammatical combination in Pan
1 1 24. Paniscus: Processes of learning and inven-
Greenfield, P. M. (2000). Three approaches to tion in the evolution and development of
the psychology of culture: Where do they come language. In S. Parker, & K. Gibson (Eds.),
from? Where can they go? Asian Journal of So- “Language” and Intelligence in Monkeys and
cial Psychology, 3 , 223 –240. Apes: Comparative Developmental Perspectives
(pp. 5 40–5 78). Cambridge, UK: Cambridge Kluckhohn, F. R. & Strodtbeck, F. L. (1 961 ). Vari-
University Press. ations in Value Orientations. Evanston, IL: Row,
Greenfield, P. M., & Smith, J. H. (1 976). The Peterson.
Structure of Communication in Early Language Kohler, W. (1 93 7). Psychological remarks on
Development. New York: Academic Press. some questions of anthropology. American
Grigorenko, E. L., Geissler P. W., Prince, R., Journal of Psychology, 5 8, 271 –278.
Okatcha F., Nokes, C., Kenney, D. A., et al. Kohler, W. (1 962). Psychological remarks on some
(2001 ). The organisation of Luo conceptions questions of anthropology. Reprinted in M.
of intelligence: A study of implicit theories in a Henle (Ed.), Documents of Gestalt psychology
Kenyan village. International. Journal of Behav- (pp. 203 –221 ). Berkeley: University of Califor-
ioral Development, 2 5 , 3 67–3 78. nia Press.
Hobson, R. P. (1 993 ). Autism and the Development Lalonde, C., Chandler, M. J., & Sokol, B. (1 999,
of Mind. Hillsdale, NJ: Erlbaum. June). Alternative solutions to the problem
Hofstede, G. (1 980). Culture’s Consequences: In- of self-continuity among Canada’s culturally
ternational Differences in Work-related Values. mainstream and aboriginal youth. Paper pre-
Beverly Hills, CA: Sage. sented to the 29th annual meeting of the Jean
Hong, Y. Y., Morris, M., Chiu, C. Y., & Benet- Piaget Society. Mexico City.
Martinez, V. (2000). Multicultural minds: A LeVine, R. A. (1 997). Mother–infant interac-
dynamic constructivist approach to culture and tion in cross-cultural perspective. In N. L.
cognition. American Psychologist, 5 5 , 709–720. Segall, G. E. Weisfeld, & C. C. Weisfeld (Eds.),
Iacoboni, M., Woods, R. P., Brass, M., Bekkering, Uniting Psychology and Biology: Integrative Per-
H., Mazziotta, J. C., & Rizzolatti, G. (1 999). spectives on Human Development (pp. 3 3 9–
Cortical mechanisms of human imitation. Sci- 3 5 4). Washington, DC: American Psycholog-
ence, 2 86, 25 26–25 28. ical Association.
Inhelder B., & Piaget J. (1 95 8). The Growth of Log- Lillard, A. (1 998). Ethnopsychologies: Cultural
ical Thinking from Childhood to Adolescence: An variations in theories of mind. Psychological Bul-
Essay on the Construction of Formal Operational letin, 1 2 3 , 3 –3 2.
Structures. New York: Basic Books. Luria, A. R. (1 971 ). Towards the problem of
Irwin, M. H., & McLaughlin, D. H. (1 970). Abil- the historical nature of psychological processes.
ity and preference in category sorting by Mano International Journal of Psychology, 6, 25 9–
schoolchildren and adults. Journal of Social Psy- 272.
chology, 82 , 1 5 –24. Madsen, M., & Shapira, A. (1 970). Coopera-
Ji, L., Zhang, Z., & Nisbett, R. E. (2002). tive and competitive behavior of urban Afro-
Culture, language, and categorization. Un- Americna, Anglo-American, Mexican Ameri-
published manuscript. Queens University, can, and Mexican village children. Developmen-
Kingston, Ontario. tal Psychology, 3 , 1 6–20.
Kestelhoof, L. (1 962). Aimée Césare. Paris: Edi- Maynard, A. E. (2002). Cultural teaching: The
tions Presse Seghers. development of teaching skills in Zinacantec
Kim, U. (1 994). Individualism and collectivism: Maya sibling interactions. Child Development,
Conceptual clarification and elaboration. In U. 73 , 969–982.
Kim, H. C. Triandis, C. Kagitcibasi, S. C. Choi, Miller, J. G. (1 984). Culture and the develop-
& Yoon, G., Individualism and Collectivism: ment of everyday social explanation. Journal
Theory, Method, and Applications (pp. 1 9–40). of Personality and Social Psychology, 46, 961 –
Thousand Oaks, CA: Sage. 978.
Kim, U., & Choi, S.-H. (1 994). Individualism, Monteil, V. (1 964). L’islam noir. Paris: Editions
collectivism, and child development: A Ko- du Seuil.
rean perspective. In P. M. Greenfield & R. R. Morris, M. W., & Peng, K. (1 994). Culture and
Cocking (Eds.), Cross-cultural roots of minority cause: American and Chinese attributions for
child development (pp. 227–25 7). Hillsdale, NJ: social and physical events. Journal of Personality
Erlbaum. and Social Psychology, 67, 949–971 .
Kitiyama, S. (2002). Culture and basic psycholog- Mundy-Castle, A. C. (1 974). Social and tech-
ical processes. Psychological Bulletin, 1 2 8, 89– nological intelligence in Western and non-
96. Western cultures. Universitas 4, 46–5 2.
Nisbett, R. E. (2003 ). The Geography of Thought: the Child, New Directions in Child Development.
How Asians and Westerners Think Differ- San Francisco: Jossey-Bass.
ently . . . and Why. New York: Free Press. Reykowski, J. (1 994). Collectivism and individ-
Nisbett, R. E., Peng, K., Choi, I., & Norenzayan, ualism as dimensions of social change. In U.
A. (2001 ). Culture and systems of thought: Kim, H. C. Triandis, C. Kagitcibasi, S. C. Choi,
Holistic versus analytic cognition. Psychological & Yoon, G., Individualism and Collectivism:
Review, 1 08(2), 291 –3 1 0. Theory, Method, and Applications (pp. 1 9–40).
Nsamenang, B. & Lamb, M. (1 994). Socialization Thousand Oaks, CA: Sage.
of Nso children in the Bamenda grassfields of Rogoff, B. (2003 ). The Cultural Nature of Human
Northwest Cameroon. In P. M. Greenfield, & Development. Oxford, UK: Oxford University
R. R. Cocking (Eds.), Cross-cultural Roots of Mi- Press.
nority Child Development (pp. 1 3 3 –46). Hills- Schliemann, A., Carraher, D., & Ceci, S. J. (1 997).
dale, NJ: Erlbaum. Everyday cognition. In J. W. Berry, P. R. Dasen,
Oyserman, D., Coon, H. M., & Kemmelmeier, & T. S. Saraswathi (Eds.), Handbook of Cross-
M. (2002). Rethinking individualism and cultural Psychology. Vol. 2 : Basic Processes and
collectivism: Evaluation of theoretical assump- Human Development (pp. 1 77–21 6). Boston:
tions and meta-analyses. Psychological Bulletin, Allyn & Bacon.
1 2 8, 3 –72. Scribner, S., & Cole, M. (1 981 ). The Psychology of
Perner, J. (1 991 ). Understanding the Representa- Literacy. Cambridge, MA: Harvard University
tional Mind. Cambridge, MA: MIT Press. Press.
Piaget, J. (1 95 2). The Origins of Intelligence in Chil- Segall, M. H., Dasen, P. R., Berry, J. W., &
dren. New York: Basic Books. Poortinga, Y. H. (1 999). Human Behavior in
Piaget J. (1 963 /1 977). Intellectual operations and Global Perspective: An Introduction to Cross-
their development. Reprinted in H. E. Gruber, cultural Psychology, (2nd ed.). Boston: Allyn &
H. & J. J. Vonéche (Eds.), The Essential Piaget: Bacon.
An Interpretive Reference and Guide (pp. 3 42– Serpell, R. (1 993 ). The Significance of Schooling:
5 8). New York: Basic Books Life Journeys in an African Society. Cambridge,
Price-Williams, D. R. (1 961 ). A study concerning UK: Cambridge University Press.
concepts of conservation of quantities among Serpell, R. (1 994). The cultural construction of
primitive children. Acta Psychologica, 1 8, 297– intelligence. In W. L. Lonner, & R. S. Malpass
3 05 . (Eds.), Psychology and Culture (pp. 1 5 7–63 ).
Rabain, J. (1 979). L’enfant du lignage. Paris: Payot. Boston: Allyn & Bacon.
Rabain-Jamin, J. (1 994). Language and social- Shweder, R. A., & Bourne, E. J. (1 984). Does the
ization of the child of African families liv- concept of the person vary cross-culturally? In
ing in France. In P. M. Greenfield, & R. R. R. A. Shweder, & R. A. Levine, Culture Theory:
Cocking (Eds.), Cross-cultural Roots of Minor- Essays on Mind, Self, and Emotion (pp. 1 5 8–
ity Child Development (pp. 276–292). Hillsdale, 1 99). New York: Cambridge University Press.
NJ: Erlbaum. Sternberg, R. J., Conway, B. E., Ketron, J. L., &
Rabain-Jamin, J., Maynard, A. E., & Greenfield, Bernstein, M. (1 981 ). People’s conceptions of
P. M. (2003 ). Implications of sibling caregiving intelligence. Journal of Personality and Social
for sibling relations and teaching interactions in Psychology, 4, 3 7–5 5 .
two cultures. Ethos, 3 1 , 204–23 1 . Super, C. M. (1 983 ). Cultural variation in
Rabain-Zempléni, J. (1 965 ). Quelques réfléxions the meaning and uses of children’s “intelli-
sur les modes fondamentaux de relations chez gence.” In J. Deregowski, S. Dziurawiec, & R.
l’enfant wolof du sevrage à l’intégration dans Annis (Eds.), Explorations in Cross-cultural Psy-
la classe d’âge. Paris: Association Universitaire chology (pp. 1 99–21 2). Amsterdam: Swets &
pour le Développement de l’Enseignement et Zeitlinger.
de la culture en Afrique et à Madagascar. Tomasello, M. (1 999). The Cultural Origins of Hu-
Raeff, C., Greenfield, P. M., & B. Quiroz (2000). man Cognition. Cambridge, MA: Harvard Uni-
Developing interpersonal relationships in the versity Press.
cultural contexts of individualism and collec- Trevarthen, C. (1 980). The foundations of
tivism. In S. Harkness, C. Raeff, & C. R. Super intersubjectivity: Development of interper-
(Eds.), Variability in the Social Construction of sonal and cooperative understanding in infants.
In D. Olson (Ed.), The Social Foundations of Wellman, H. M., Cross, D., & Watson, J. (2001 ).
Language and Thought (pp. 3 1 6–3 42). New Meta-analysis of theory-of-mind development:
York: Norton. The truth about false belief. Child Development,
Triandis, H. (1 989). Cross-cultural studies of in- 72 , 65 5 –684.
dividualism and collectivism. Nebraska Sympo- Whiten, A. (2002). Chimpanzee cognition and
sium on Motivation, 3 7, 41 –1 3 3 . the question of mental re-representation. In
Turiel, E. (2000). Conflict, social development, D. Sperber (Ed.), Metarepresentation (pp. 1 3 9–
and cultural change. In E. Turiel (Ed.), Develop- 1 67). Oxford, UK: Oxford University Press.
ment and Cultural Change: Reciprocal Processes. Whiting, B. B. (1 976). Unpackaging variables. In
New Directions for Child and Adolescent Devel- K. F. Riegel, & J. A. Meacham (Eds.), The
opment, 83 , 77–92. Changing Individual in a Changing World (pp.
Vinden, P. (1 996). Junin Quechua children’s un- 3 03 –3 09). Chicago: Aldine.
derstanding of mind. Child Development, 67, Wisniewski, E. J., & Bassok, M. (1 999). What
1 701 –1 706. makes a man similar to a tie? Stimulus compat-
Vinden, P. (1 999). Children’s understanding of ibility with comparison and integration. Cogni-
mind and emotion: A multi-culture study. Cog- tive Psychology, 3 9, 208–23 8.
nition and Emotion, 1 3 , 1 9–48. Wober, M. (1 974). Toward an understand-
Vinden, P., & Astington, J. (2000). Culture ing of the Kiganda concept of intelligence.
and understanding Other Minds. In S. Baron- In J. W. Berry, & P. R. Dasen, Cul-
Cohen (Ed.), Understanding Other Minds: Per- ture and Cognition (pp. 261 –280). London:
spectives from Developmental Cognitive Neuro- Methuen.
science (pp. 5 03 –5 1 9). Oxford, UK: Oxford Zambrano, I. (1 999). From na’ to know: Power,
University Press. epistemology and the everyday forms of state for-
Wang, Q., Ceci, S. J., Williams, W. M., & Kopko, mation in Mitontik, Chiapas (Mexico). Doc-
K. A. (2004). In R. Sternberg (Ed.), Culture toral dissertation. Cambridge, MA: Harvard
and Competence. Washington: American Psy- University.
chological Association. Zempleni-Rabain, J. (1 973 ). Food and the strat-
Wellman, H. M. (1 990). The Child’s Theory of egy involved in learning fraternal exchange
Mind. Cambridge, MA: MIT. studies. London: Oxford University Press.
Part VII
THINKING IN PRACTICE

CHAPTER 2 8
Legal Reasoning
Phoebe C. Ellsworth
For more than a century, lawyers have writ- evidence they’ve heard to reach a verdict.
ten about legal reasoning, and the flow of They are asked to engage in “legal reason-
books and articles describing, analyzing, and ing.” Clients approach their attorneys with
reformulating the topic continues unabated. rambling stories and a strong, if somewhat
The volume and persistence of this “unre- vague, sense of injustice, and it is the at-
lenting discussion” (Simon, 1 998, p. 4) sug- torney’s job to figure out the laws, prece-
gests that there is no solid consensus about dents, and facts that most favor the client
what legal reasoning is. Legal scholars have and to integrate them into a persuasive case.
a tenacious intuition – or at least a strong This task involves legal reasoning, but the
hope – that legal reasoning is distinctive, reasoning is driven by the desired outcome.
that it is not the same as logic, or scientific The goal is not to reach the right decision
reasoning, or ordinary decision making, and but to make the best argument for one side.
there have been dozens of attempts to de- The evidence, as orchestrated by the lawyers
scribe what it is that sets it apart from these and the legal arguments they make, form
other forms of thinking. These attempts gen- the raw materials for the judge’s decision, al-
erate criticism, the critics devise new formu- though judges (like juries) may also draw on
lations that generate further criticism, and their own background knowledge and expe-
the process continues. In this chapter, I de- rience and their own interpretations of the
scribe the primary forms of legal reason- evidence and (unlike juries) their own un-
ing, the most important schools of thought derstanding of the law.
about legal reasoning, and some of the ma- When scholars write about “legal reason-
jor differences between legal reasoning and ing,” they are writing about judges. The
scientific reasoning. lawyer does not have to decide the case,
The first question is, “Whose legal reason- but only to make the strongest appeal for
ing are we talking about?” Jurors are given one side; lawyers’ reasoning is discussed in
instructions on the law at the end of every courses and writings on advocacy. Jurors in-
trial and are asked to apply that law to the terpret the evidence to decide what actually
685
happened and apply the law given to them soning. They are commonly described in re-
in the judge’s instructions to reach a verdict. lation to scientific reasoning as well. What is
The judge must also seek out the appropri- distinctive about these forms of reasoning in
ate legal authority, deciding which laws and the legal context is not so much the process
previous cases are applicable. Jurors are not but the context, the raw materials to which
supposed to reason about the law itself; that the processes are applied, and the nature of
is the task of the judge. Judges are trained in the rules.
the law, they know the statutes and prece-
dents, and they have the experience of judg-
ing many cases and reading the decisions of
Deductive and Analogical Reasoning
other judges. Jurors do not provide reasons
in Law
for their verdicts; judges often do. Finally,
much of what is written about legal rea-
Deductive (Rule-Based) Reasoning
soning is about appellate court decisions, in
which judges are primarily concerned with In deductive scientific reasoning (see Dun-
legal procedure and the law itself, not about bar & Fugelsang, Chap. 29), there is a gen-
who wins and loses, and in which they al- eral law or theory, and the scientist uses that
most always must provide legal explanations theory to infer what will happen in some
for their decisions. particular fact situation, makes a prediction,
In the subsequent historical section, I de- and designs an experiment to test it. If the
scribe how basic visions of the nature of le- prediction is not confirmed, there are three
gal reasoning have changed over time. Most possibilities: The deduction was flawed,
judges, if they thought about their thought the experiment was flawed, or the theory
processes at all, have probably accepted the is flawed. In deductive legal reasoning, the
commonsense background theory prevalent decision maker begins with a specific set of
in the legal culture of their era. Some, how- facts, looks at the law that applies to those
ever, including some of the greatest judges, facts, and reaches a verdict. If Joe’s Liquor
have recognized that they really can’t ex- Store sells beer to 1 6-year-old Richard, and
plain how they reach decisions (Holmes, there is a law prohibiting the sale of alco-
1 897; and cf. Nisbett & Wilson, 1 977). In hol to anyone under the age of 21 , then Joe’s
1 921 , Benjamin Cardozo began his classic Liquor Store is guilty. The reasoning is ba-
work, The Nature of the Judicial Process, with sically syllogistic, and in many cases the ap-
the observation that “[A]ny judge, one might plication of the law is unproblematic (see
suppose, would find it easy to describe the Evans, Chap. 8). These are called easy cases.
process which he had followed a thousand In practice, there are many ways in
times and more. Nothing could be farther which ambiguity can creep into this appar-
from the truth” (1 921 , p. 9). ently clear logical process. First, the decision
But that does not mean there are no com- maker is faced with a specific set of facts. If
monly accepted characteristics of legal rea- he or she is a judge, there are almost always
soning. There are. The problem that vexes two versions of the facts. It is the attorneys’
legal scholars is that they are incomplete. job to organize the facts in a way that fits the
Although they undoubtedly influence judi- legal outcome they wish to achieve, and they
cial reasoning, they are insufficient either do this by emphasizing different facts and,
to predict future outcomes or to provide often, different legal precedents. “[T]he law
a fully satisfactory account for past ones. determines which facts are relevant while at
The two most common reasoning strate- the same time, the facts determine which
gies, taught in every law school course on law is relevant” (Burton, 1 995 , p. 1 41 ). There
legal reasoning and writing, are the deduc- may be more than one law that is poten-
tive method (rule-based reasoning) and the tially applicable. There may be several statu-
analogical method (case-based reasoning). tory provisions that might be relevant, and
These strategies are not unique to legal rea- the two opposing counsel may argue that a
legal reasoning 687
different rule is the one that should con- performance on tasks that were less familiar.
trol this case. The statute itself may violate a He applied a more general principle that
higher rule, such as the state or federal con- explained the apparently contradictory re-
stitution. The rule may be ambiguous, as in sults of past research and made sense of the
a ban on “excessive noise,” or the application field. He then went on to devise a situation
of the “reasonable person” standard (“Would in which the new principle could be tested.
a reasonable person have believed that her The judge begins where the scientist ends,
life was in danger?”). with a specific situation in which the out-
In preparing a case, an attorney will go come must be decided – not predicted and
back and forth between developing a co- tested but decided by examining the sim-
herent version of the facts that fits the law ilarities and differences between this new
and conducting legal research to find out case and the previous cases and choosing an
which laws frame the facts in the best pos- outcome that corresponds to the holdings
sible way. The judge, faced with two com- of the cases it most resembles. In the ad-
peting arguments, may choose one of them, versarial system, the lawyers emphasize the
or may bring in additional factual interpreta- prior cases that were decided the way they
tions or legal considerations not mentioned want this one to be decided, finding crucial
by either of the parties. Thus, even the ap- differences in the prior cases that went the
parently simplest form of legal reasoning – “wrong way” so as to argue that their hold-
deciding whether the law covers the specific ings are inapplicable in the present context.
fact situation – is often quite complicated in The lawyers have a certain leeway in their se-
practice. The commonsense idea that there is lection of which facts to emphasize, in their
a behavior, there is a law, and the ques- interpretation of the facts, and in their de-
tion is “Does the behavior conform to the scription of the legal significance of those
law?” is much too simple to apply to interes- facts (Llewellyn, 1 93 0, p. 70). Like the scien-
ting cases. tist, the lawyer may identify some principle
that explains why the current case should
be considered an example of the first group
Analogical (Case-Based) Reasoning
rather than the second. The judge examines
In the Anglo-American common law the strengths and weaknesses of the argu-
tradition,1 cases are decided by examining ments of the two parties and either chooses
the patterns of decisions in earlier, related between them or develops a different princi-
cases. No case has meaning in isolation, ple for placing the present case in the context
and general rules and propositions are of the past ones.
useless without “the heaping up of concrete When legal educators claim that the basic
instances” (Llewellyn, 1 93 0, p. 2), except in mission of the first year of law school is to
very simple cases. A somewhat similar form train the student to “think like a lawyer,” it
of reasoning occurs in science when a scien- is this sort of analogical reasoning they gen-
tist examines a series of studies with appar- erally have in mind – the ability to spot the
ently inconsistent results and tries to come factual and legal similarities and (more im-
up with a general principle that will explain portant) differences between the case un-
the inconsistencies. In research on social der study and related previous cases and
facilitation, for example, some researchers to recognize which similarities and differ-
found that people performed better on a task ences are relevant (e.g., the defendant’s state
when other people were around, but other of mind) and which are not (e.g., the de-
researchers found that people performed fendant’s name). This entails defining the
better when they were alone. In 1 965 , universe of possibly applicable cases and de-
Robert Zajonc resolved this controversy by ciding which ones match the current case
showing that the emotional arousal caused most closely and which, although apparently
by the presence of others enhanced perfor- similar, do not apply. The focus is on the
mance on well-learned tasks but impaired particular cases, and the reasoning is more
like feature matching than like the applica- and in his effort to make law an academic
tion of a general principle (Sunstein, 1 996, discipline rather than a mere trade, Langdell
p. 67; see Holyoak, Chap. 6, for further dis- embraced the idea that law is a science
cussion of analogical reasoning). (Langdell, 1 880). He did not originate this
Finally, as with deductive reasoning, the view, which can be found in Blackstone’s
significance of a particular fact depends on Commentaries and earlier (Kennedy, 1 973 ),
its legal significance, and the significance but he promulgated it enthusiastically. An
of a particular law or previous holding de- obvious problem with this analogy is that
pends on the exact fact pattern of the in law there is no means of experimenta-
case. The legal reasoner must consider both tion, no access to previously unknown data.
simultaneously. The “data” consisted of the writings of ear-
lier judges: “We have constantly inculcated
the idea that the library is the proper work-
shop of professors and students alike; that it
Theories of Legal Reasoning
is to us all that the laboratories of the uni-
versity are to the chemists and physicists, the
Formalism2
museum of natural history to the zoologists,
That “legal reasoning” is considered to be a and the botanical gardens to the botanists
distinctive form of reasoning worthy of be- (Langdell, 1 887, p. 1 24; emphasis added).
ing included as a separate topic in the Cam- The data were what judges had said, and new
bridge Handbook on Thinking and Reasoning data were what new judges said, based on
is attributable in large measure to Christo- their readings of their predecessors. Langdell
pher Columbus Langdell, who became the did not argue that law as it existed actually
first Dean of the Harvard Law School in achieved the beautiful hierarchical organi-
1 870, and who revolutionized legal educa- zation from clear, highly abstract principles
tion. He introduced the case-based tech- down to lower levels that would finally allow
nique of teaching law; he created the image precise derivations that would fit any new set
of the law faculty as a group of perma- of particular facts; creating such an arrange-
nent scholars devoted to legal research, ment was a goal of legal science.
explicitly promoting the analogy to the fac- Of course this view of science as a closed
ulty of a science department; and he advo- deductive system strikes most modern sci-
cated a view of legal reasoning known as “le- entists as unrealistic and simplistic – a view
gal formalism.” of science that we were taught in eighth
The essence of legal formalism is the idea grade but that rarely seems like a descrip-
that “a few basic top-level categories and tion of what we actually do or how we
principles formed a conceptually ordered actually think. The behavioral sciences espe-
system above a large number of bottom-level cially (and it seems natural to us that if law
rules. The rules themselves were, ideally, the is to be considered a science at all it should
holdings of established precedents, which be a behavioral science) seem a poor fit for
upon analysis could be seen to be discovered such an abstract deductive model of reason-
from the principles” (Grey, 1 983 , p. 1 1 ). In ing. Even in 1 870, the excitement of observa-
other words, there is a pyramid of rules with tion, empiricism, and induction were rapidly
a very few fundamental “first principles” at replacing earlier deductive views of science.
the top, from which mid-level and finally a Langdell’s model of science was more like
large number of specific rules could be de- the taxonomic system of Linnaeus than like
rived. The legal decision maker, faced with empirical science. Families of plants and an-
a case to be decided, would study the body imals were organized under phyla (the fun-
of law and discover the rule that determined damental principles), genera under families,
the correct result. and species under genera. During the explo-
In 1 870, science represented the pin- rations of the eighteenth and nineteenth cen-
nacle of human intellectual achievement, turies, an astonishing variety of new plant
legal reasoning 689
and animal species was discovered, and each Two widely different cases suggest a general
one could be compared with others at the distinction which is a clear one when stated
species level and classified appropriately in broadly. But as new cases cluster around
its place in the ruling structure. In the same the opposite poles, and begin to approach
way, each new legal case could be examined each other, the distinction becomes more dif-
ficult to trace; the determinations are made
for its similarities and differences to previ-
one way or the other on a very slight pre-
ously decided cases, which in turn had been
ponderance of feeling, rather than articu-
classified according to the general taxon- late reason; and at last a mathematical
omy, and so could be decided accurately. In line is arrived at by the contact of contrary
law, “the fundamental principles of common decisions, which is so far arbitrary that it
law were discerned by induction from cases, might equally well have been drawn a little
rules of law were then derived from princi- further the one side or the other (Holmes,
ples conceptually, and, finally, cases were de- 1 873 , p. 65 2).
cided, also conceptually, from rules” (Grey,
1 983 , 1 9). Although the idealistic theory behind
There were critics of legal formalism from formalism has largely been abandoned (cf.
the very beginning. The alternative view is il- Kennedy, 1 973 ; Gordon, 1 984; Grey, 1 983 ;
lustrated in two famous remarks by Oliver Simon, 1 998), its categories and its ana-
Wendell Homes, Jr.: “The life of the law lytic methods persist. Its classifications are
has not been logic: It has been experience” still robust – substantive versus procedural
(Holmes, 1 881 , p. 1 ), and “general princi- law; contracts, torts, property. They deter-
ples do not decide concrete cases” (dissent- mine how the first year of law school is
ing opinion in Lochner v. New York, 1 905 , p. structured. No comprehensive new organi-
76). Holmes and, later, critics such as Pound, zational scheme has replaced the categories
Llewellyn, and Cardozo argued that legal of formalism, and they therefore continue to
principles were not “discovered” by careful “influence judgment much as the agenda for
research into the rules and principles, and a meeting influences the results of its delib-
that such research, however diligent, would erations” (Grey, 1 983 , p. 5 0).
not yield definite and incontrovertible an- The tenets of legal formalism still ex-
swers in any but the easiest cases. Instead of ercise a strong influence on the way judi-
clear distinctions between the cases decided cial opinions are written. Decisions typically
in one way and those decided in the other are presented as the inevitable consequence
(for the plaintiff or the defendant in a med- of a careful analysis of the facts and
ical malpractice case, for example), there is the applicable law based on the classifi-
overlap and fuzziness at the boundary and, cation of this case in relation to previous
in the end, the judge creates the defining dis- cases. The correct decision and the govern-
tinction rather than discovering it (Cardozo, ing principles are described as discovered,
1 921 , p. 1 67). The distinctions were often not created, by the judge (Schauer, 1 995 ,
arbitrary, not logical, and influenced by the p. 642, note 23 ), and are expressed with
judge’s own sense of what the right outcome great certainty, as though there were no
should be. The fundamental principles and room for doubt. “It seems that this neo-
legal rules were important and provided con- formalist form of jurisprudence – typified
siderable guidance to the judge but, in most by a self-reported experience of constraint,
cases, they were insufficient to determine high confidence and singular correctness,
the outcome. The certainty and sense of in- all couched in the rhetoric of closure – is
evitability expressed in judicial opinions was the predominant, albeit unofficial, mode of
quite unjustified. As time goes by and the judicial reasoning in current American legal
legal landscape becomes dense with more culture” (Simon, 1 998, p. 1 1 ). In part, this
and more intermediate cases, the failures of persistence is attributable to the strong be-
formalism become increasingly apparent. As lief that the law requires stability. For peo-
Holmes put it ple to have faith in the legal system, judges’
decisions must be predictable, and for judges The first move toward legal realism was
to make predictable, logical decisions there “Sociological Jurisprudence,” which was ex-
must be a fixed framework from which pounded most explicitly by Roscoe Pound
those decisions are derived. A major differ- (1 91 2). Like Holmes, Pound felt that the
ence between law and science, as discussed “mechanical jurisprudence” of the formal-
subsequently, is that uncertainty and change ists was out of touch with social real-
are a sign of a healthy scientific climate; ity and that legal scholarship and judicial
they would definitely not signal a healthy norms were standing still, out of touch with
legal climate. exciting developments in philosophy and,
particularly, the social sciences. “Jurispru-
dence,” he argued, “is the last in the march
of sciences away from the method of de-
Legal Realism
duction from predetermined conceptions”
Legal realism arose in opposition to formal- (Pound, 1 909, p. 464). The strict doctrinal
ism and can be seen as an extension and elab- approach blinded legal writers to two essen-
oration of Holmes’s early skepticism. Legal tial considerations: first, the purposes of the
realists rejected the formalist ideas that the law – the goal of doing justice rather than
law was a self-contained logical system pro- following the letter of the law; and second,
viding for the scientific, deductive derivation the social, cultural, and psychological factors
of the right answer in all new cases. They that influenced behavior, including the be-
regarded this view as a vain daydream dis- havior of lawmakers and judges. Blind adher-
connected from the real world influences on ence to the abstract law-on-the-books might
legal decision makers – hence the label “legal make for greater certainty and predictability,
realism.” but “reasonable and just solutions of individ-
In a strict formalist analysis, two different ual cases” were “too often sacrificed” (Pound,
judges should always judge the same case 1 91 2, p. 5 1 5 ). The law treated all individuals
in the same way unless one of them was as equivalent regardless of their social back-
mistaken in his3 understanding of the facts ground or position. Thus, for example, the
or the law. Clearly this was not the case. right of an employee to quit was legally the
In the nineteenth century, as now, courts same as the right of the employer to fire him.
were often divided. There were judges in Both were free agents enjoying the “liberty of
the majority and there were dissenters, and contract.” But of course the employer could
no one seriously argued that the dissenters easily find another employee, but the em-
were incompetent or in need of retraining. ployee would have lost his livelihood and
Of course the formalists did not believe this might have a very hard time finding another
was the way the world really worked, but job. The law’s refusal to acknowledge these
they did believe that the legal system could obvious social truths was a major stimulus to
approximate that ideal and that it was an sociological jurisprudence.
ideal worth striving for. The legal realists be- Pound argued that legal scholarship and
lieved that it was an impossible ideal and that judicial decisions should “take more ac-
it was a waste of time to strive for it. count, and more intelligent account, of the
According to the legal realists, instead of social facts upon which law must proceed
reflecting an abstract set of nearly immutable and to which it is to be applied” (1 91 2,
principles, the law reflects historical, social, p. 5 1 3 ). The focus should not be on the ab-
cultural, political, economic, and psycholog- stract content of the laws but on how they
ical forces, and the behavior of individual actually work. It is important to consider
legal decision makers is a product of these the purpose of laws and to modify them if
forces. It therefore is not surprising that dif- these purposes are not being achieved. And
ferent judges, with different goals and back- judges should regard the law as suggestive
grounds, should decide cases differently, and rather than determinative of their decisions:
contrary decisions do not imply that some If strict application of the law would result
judges must be “wrong.” in an outcome that is unjust or contrary to
legal reasoning 691
the purpose of the law, then flexibility in the making – and that that was how it should
cause of justice is appropriate. be. Karl Llewellyn, one of the most impor-
The basic views of Holmes and Pound tant figures in the group, argued that law was
were quite similar – pragmatic and open- about “disputes to be settled and disputes to
minded. Pound, however, was a far stronger be prevented” (1 93 0, p. 2), not about rules;
proponent of an interdisciplinary solution about what legal decision makers do, not
to the problems of formalism. The social what they say. Legal rules were regarded as,
sciences were very much on the rise at at best, post hoc justifications and, at worst,
the beginning of the twentieth century and criteria that could lead judges to unjust de-
seemed “progressive” in a way that law was cisions. Advocates in a trial could usually
not. Their ideas stretched the imaginations describe the facts and the law so as to pro-
of the more intellectually curious law produce coherent, complete, persuasive argu-
fessors and challenged some of the most ments for two diametrically opposite con-
fundamental assumptions of the law. The so- clusions. Llewellyn even wrote an article on
ciologists (the most influential group) sug- statutory interpretation showing that each
gested that the equality of all assumed by of 28 basic legal propositions could be ar-
the law (e.g., the “liberty of contract”) was a gued either way: “A statute cannot go beyond
myth because status and power significantly its text”/“To effect a purpose a statute may
affected a person’s choices, the anthropolo- be implemented beyond its text”; “Where
gists revealed a wide range of peaceful so- design has been distinctly stated no place
cieties with entirely different kinds of legal is left for construction”/“Courts have the
systems, and psychologists raised questions power to inquire into real – as distinct from
about the essential legal concepts of free will ostensible – purposes” (Llewellyn, 1 95 0,
and responsibility, suggesting that behavior pp. 401 , 403 ).
was determined by psychological and social The agenda of the legal realists was both
factors beyond the control of the individual descriptive and prescriptive. According to
(Green, 1 995 ). Felix Cohen, “Fundamentally, there are only
The period identified as the flowering two significant questions in the field of law.
of legal realism was the period between One is, ‘How do courts actually decide cases
the wars (Fisher, Horwitz, & Reed, 1 993 ). of a given kind?’ The other is, ‘How ought
Holmes and Pound were the inspirational they to decide cases of a given kind?’”(1 93 5 ,
figures from the past,4 but now there were p. 824). The answer to the descriptive ques-
enough like-minded scholars so they could tion was that courts do not decide cases on
legitimately be called a “school” or a “move- the basis of laws because the law always
ment,” although never an organization. Like allows for multiple answers. In considering
the cognitive psychologists who shook off what sort of forces do influence case out-
the shackles of behaviorism in the 1 960s and comes, different scholars emphasized social
1 970s, they were an eclectic group united and cultural forces (Cohen, 1 93 5 ; Lasswell,
mainly by their opposition to the old ways. 1 93 0; Yntema, 1 928), unconscious psycho-
Some tried to do empirical research, some logical drives (Frank, 1 93 0), or just a pro-
were political activists (and some eventually cess of intuition that eventually culminated
became part of the New Deal government), in a Gestalt-like “Aha effect” after long ru-
some continued as legal scholars but preach- mination (Hutcheson, 1 929). These influ-
ing a new faith, and some were articulate ences affect the assessment of the actual
gadflies. Some were and are highly respected facts of the case – the credibility of the
figures in the history of legal scholarship, witnesses, the plausibility of the stories, as
some were but are no longer, and some were well as the judge’s “sense of how the law
always seen as fringe elements. ought to respond to these facts” (Fisher, Hor-
As with their predecessors, their primary witz, & Reed, 1 993 , p. 1 65 ). Legal real-
unifying theme was a rejection of the old ists were ridiculed as believing that judicial
ways and a passionate belief that legal doc- decisions depended on what the judge ate
trine played a limited role in legal decision for breakfast. However, the realists generally
did not believe that judicial decisions were sential to answer the second question, “How
idiosyncratic or unpredictable. “Law is not ought [courts] to decide cases of a particular
a mass of unrelated decisions nor a prod- kind?” Judicial decisions affect human be-
uct of judicial bellyaches. Judges are hu- havior, often favoring one group’s interests
man, but they are a particular breed of over another, and they affect future judicial
humans, selected to a type and held to ser- decisions. Careful study of these conse-
vice under a potent system of governmen- quences would allow for better-informed ju-
tal controls” (Cohen, 1 93 5 , p. 843 ). Because dicial decisions and better laws.
most judges come from the same social class, Prescriptively, the realists argued first that
receive the same legal education, and are in applying the law, judges ought to con-
subject to the same social and historical in- sider the purpose of the law and, second,
fluences and the same role demands, their that they should focus on the particulars
decisions will resemble each other. of the case and compare it with the partic-
The intellectual enterprise of legal schol- ulars of preceding cases, rather than look-
arship, therefore, should be to describe the ing for broad general principles. Consid-
actual behavior of courts, taking account of eration of the purposes of the law was
the broader social context. The realists were supposed to enhance the fairness and the
confident that this behavior would not be consistency of decisions, and blind applica-
predictable from written legal doctrine or tion to the rule without considering its pur-
statutes. Instead, the legal rules and con- pose would lead to bad decisions (Llewellyn,
cepts would turn out to be consequences, 1 942). To facilitate this approach, legisla-
rather than causes, of judges’ behavior. To tors and judges should make the reasons
understand how judges reach their decisions, for the law explicit; to provide appropri-
it is important to analyze their social back- ate guidance to future judges: “Only the
grounds, previous experience, and role de- rule which shows its reason on its face has
mands and the general political, social, and ground to claim maximum chance of contin-
economic pressures of the times. Because uing effectiveness” (Llewellyn, 1 942, p. 260).
these same forces affected the behavior of Because social conditions were constantly
the parties of the case, the relation between changing, however, judges should be free
the judge’s position in society and that of the to revise and reject even rules with clearly
litigants should also be explored. This gen- stated purposes; the development of law,
eral set of ideas was easy to demonstrate in like the development of science, should be
particular cases. Then, as now, the opinions a never-ending process of examination and
of individual judges on particular issues were re-examination.
often easy to predict. Defense lawyers “shop” Specific comparisons of the particular
for judges known to be sympathetic to of- case to be decided and the facts of related
fenders who resemble their client (judges cases, through analogical reasoning, was the
who believe that drug laws are too harsh, for preferred method. Just as a case read by it-
example). On some issues, it is easy to pre- self is meaningless (Llewellyn, 1 93 0, p. 49),
dict Supreme Court Justices’ positions based a case read with reference to the law and
on their previous opinions and their general without reference to other cases was also
ideology. Coming up with a more general meaningless. Close factual comparisons will
mid-level theory, something between vague reveal the empirically grounded rules and
abstract statements about “social forces” and cultural beliefs that actually explain legal
predictions of what a particular judge would decisions because “legal rules are simply for-
say in a particular case, was a much greater mulae describing uniformities of judicial de-
challenge and one the realists never actually cision” (Cohen, 1 93 5 , p. 848). Some of the
accomplished. realists believed that close examination of
The description of what courts actually the prior body of cases required more than a
do was supposed to explore not only the reading of the cases alone. Some felt that an
causes of judicial decisions but also their education in social science was necessary to
consequences. A study of consequences is es- fully understand the social forces influencing
legal reasoning 693
the parties and the judge. Others felt that scholars should collect detailed statistical
legal researchers should create databases on information about the causes and conse-
the background of judges and their decisions, quences of various rules, conducting in-
the frequency with which laws on the books terdisciplinary empirical research, and that
were actually enforced, whether they are en- courts should consider social science data
forced against some groups more than oth- in deciding cases. The method of mar-
ers, whether patterns of enforcement have shaling social scientific evidence in argu-
changed over time (e.g., obscenity laws), and ing a case was pioneered by Louis Brandeis
so on. and Josephine Goldmark in the famous
The legal realists have been identified “Brandeis brief” in Muller v. Oregon (208 U.S.
with a “social science” point of view, but this 41 2). In arguing that it was constitution-
meant different things to different scholars. ally permissible to restrict women’s work-
Most of them probably shared Pound’s being hours to ten hours a day, they presented
lief that, although other scientific disciplines hundreds of excerpts from various articles
were making huge progress, law was stag- and reports claiming that long working hours
nating, backwards looking, and clinging to a were damaging to women’s health. Most of
static, deductive model that had been aban- these were not actually scientific reports, but
doned by other sciences. The law, because it they were an effort (successful) to force the
deals with ever-changing values, opportuni- court to consider the social facts involved
ties, and norms of behavior should keep pace in the legal question and the social conse-
with these changes. Most also were some- quences of the decision. The “Brandeis brief”
what shaken by the ways in which sociology is legendary, and the inclusion of social sci-
and psychology were undermining the no- ence research in legal arguments is now com-
tion of free will central to the law (Green, mon. Modern trial and appellate courts rou-
1 995 ). Most of them agreed that the focus of tinely consider social science data, although
attention should be on how judges think, not their actual influence is probably less than
on the written rules. They were fairly unified most social scientists would like to believe
in describing what was wrong with formalism (Ellsworth & Getman, 1 986).
but never fully agreed on the remedies and, There were some efforts to compile
indeed, proposed very few. databases (Pound and Frankfurter, 1 922; and
Beyond this general sense that the law cf. Schlegel, 1 980) and a few attempts to ac-
should develop as society develops and tually carry out systematic research projects.
take general account of progress in the so- However, these attempts generally failed to
cial sciences, the realists followed different achieve the grand purposes their authors had
paths. Some more or less stopped there. in mind. In writing a traditional law review
For others, the “critical realists” in Horwitz’s article, the author typically knows what the
(1 992) terminology, social science mainly conclusion is at the beginning; empirical re-
meant a concern with social policy. Politi- search, as any honest scientist knows, often
cally they were progressives, and flourished forces agonizing rethinking and sometimes
under the New Deal. Cardozo, Brandeis, produces data so ambiguous that nothing
Frankfurter, and Douglas followed Holmes can be concluded. So, in 1 928, the future
to the Supreme Court, and several others Supreme Court Justice William O. Douglas
moved to important positions in the New conducted a study of business failures de-
Deal administration. For them, the social sci- signed to produce revolutionary insights but
ence that mattered was the sociologists’ em- ended up with two small, inconclusive arti-
phasis on social class and a generally socialist cles (Fisher, Horwitz, & Reed, 1 993 , p. 23 3 ).
view of what should guide the government Underhill Moore, a Yale law professor in one
and the courts. For them, as for many of the of the three experimental law and social sci-
social scientists of the time, social science ence interdisciplinary programs, attempted
meant social activism. a behaviorist (Hullian) analysis of the ef-
Another group, the “constructive real- fects of parking tickets (Moore and Callahan,
ists” (Horwitz, 1 992), believed that legal 1 943 ) that provoked intense ridicule even
from other realists [Llewellyn later called it scendants that persist as independent cur-
“the nadir of idiocy” (1 95 6, p. 400)]. Em- rents. One, called Critical Legal Studies, is
pirical research by legal scholars has slowly a reincarnation of the Progressive political
increased over the past 5 0 or 60 years, but themes of Legal Realism, and the other two
at the time, the admonishments of the le- (the Law and Economics movement and the
gal realists only produced a brief spate of at- Law and Society movement) are develop-
tempts, nothing like a major change in orien- ments of the interdisciplinary social science
tation. It is still the case that some law pro- endeavor.
fessors regard empirical research as mindless Law and Economics scholars are fairly
and mechanical with data a crutch for those traditional in terms of economic theory
whose mental capacities are insufficient to [e.g., Tversky, Kahneman, and the behav-
reach the truth on their own. ioral economists so far have had mini-
Although the excesses of Legal Realism mal influence (Kahneman & Tversky, 2000;
are still parodied in well-worn clichés (such Kahneman, Slovic, & Tversky, 1 982; Thaler,
as the “what the judge had for breakfast” 1 992)], taking as given the assumption that
cliché), in the main, it has been absorbed people rationally assess their circumstances
into American legal thought; thus, only the and do what will maximize their own wel-
excesses stand out as distinctive. Close com- fare. The potential criminal calculates the
parison of cases is the standard method of probabilities of getting caught, being pun-
legal education, and consideration of the ished, and the potential severity of pun-
social context, purposes, and policy impli- ishment and weighs these considerations
cations of the law is common. The chal- against the beneficial consequences of the
lenge posed by the realists – the relative crime (money, the extermination of a goal-
role of law versus social and personal con- blocking person) and accordingly decides
siderations – still looms over the study of whether or not to commit the crime. They
law and defines the questions. Databases attempt to fit legal decisions into a stan-
are everywhere, especially in the criminal dard economic framework and, if they do
justice system, but also in the civil arena. not fit, to argue that they should.5 Although
The American Bar Association regularly pro- they are often described as descendants of
poses guidelines based on statistical data as the legal realists, in some ways the Law
do government commissions. No one still and Economics movement bears a closer re-
believes in strict Langdellian formalism, al- semblance to the formalists. It has a for-
though many law courses are an uneasy mal model with a set of first principles: “Be-
blend of formalism and the considerations havior always takes the form of constrained
raised by the legal realists, and judicial opin- maximization. The actor chooses from some
ions are written in formalist language. And specified set of options, selecting the option
the later developments of legal realism, al- that maximizes some objective function. In
though never quite mainstream, are thriv- orthodox theory, consumers have preferences
ing. In 1 93 5 , Felix Cohen wrote that “It is that are represented by a utility function,
reasonable to expect that some day even and they choose in a way that maximizes
the impudencies of Holmes and Llewellyn their utility . . .” (Kreps, 1 990, p. 4, cited in
will appear sage and respectable” (1 93 5 , Hanson & Yosifon, 2003 ). Explanations and
p. 847), and that prophecy has certainly recommendations follow deductively from
come true. the basic premises. Law and Economics has
little to say about what is distinctive about
legal reasoning; it is primarily another ex-
Critical Legal Studies, Law and
ample of the economic model of reasoning
Economics, and the Law and
in general.
Society Movement
By contrast, the Law and Society schol-
Although many of the ideas of the legal ars are open-minded, eclectic, and devoid
realists have been incorporated into the of any theoretical mission. Instead, they are
mainstream of law, there are three direct de- committed to the social science method of
legal reasoning 695
inquiry and to the idea that history, culture, ing the requirement that all decisions must
and social context matter. Friedman (1 986) be justified by legal authority and precedent?
has proposed that Law and Society is a field Or are they totally unaware of their own
like “Area Studies” in which scholars from biases?
many disciplines study law the way scholars Duncan Kennedy, one of the founders of
from many disciplines study Latin America Critical Legal Studies, draws on the psychol-
or Southeast Asia. Their concern with con- ogy of Kohler, Lewin, and Piaget to explore
text and actual behavior means that they the thought processes of judges in a way
are relatively uninterested in “purely intel- that is less fuzzy and more nuanced than
lectual forces – the role of legal thinkers, for- the general realist point of view (Kennedy,
mal doctrine, philosophy and theory of law; 1 986). His hypothetical judge is a politi-
the role of abstract ideas” (Friedman, 1 986) cal reformist, of course, who is faced with
because such forces are mainly epiphenom- a conflict between what the law seems to
ena, not fundamentally causal. A great deal require and “how I want it to come out”:
of important and interesting work has come “imagine that I think the rule that seems to
from this school, but it is not really about apply is bad because it strikes the wrong
legal reasoning in general. In fact Law and balance between two identifiable conflict-
Society scholars would reject the idea that ing groups, and does so as part of a gener-
there is such a thing as legal reasoning in ally unjust overall arrangement that includes
general. many similar rules, all of which ought in the
Critical Legal Studies is the bad boy of name of justice to change” (Kennedy, 1 986,
the bunch, and in this regard it is more ob- p. 5 1 9). The judge may reinterpret the facts,
viously connected to the Legal Realists in reinterpret the legal precedents, reinterpret
their role as iconoclastic rebels. Like the re- the basic purpose of the law in the light of
alists, they argue that interpretation of the social policy, or make other moves. Judges
law is subjective, and they emphasize the will also consider how the public and other
role of power and political ideology more judges will view their decision, and finally,
strongly than most of the realists. Like the re- they really do care about the law and prece-
alists, they have been more effective as crit- dent; thus, the dilemma is a real cognitive
ics than as authors of an alternative vision dilemma, not just a matter of imposing their
(Kennedy, 1 997), and some of them have personal political motives. The decision will
glorified “trashing” as a sufficient contribu- become part of the law that other judges
tion (Tushnet, 1 984). In some ways, they must consider when they decide cases, so
resemble the postmodernists of other disci- the judge also must worry about its future
plines, insisting that there is “no there there,” ramifications. “Legal argument is the process
that all legal concepts, like all other social of creating the field of law through restate-
concepts, are socially constructed (except of ment rather than rule application” (Kennedy,
course for power and dominance). 1 986, p. 5 62). The thought process evolves
However, some of their analyses of le- in time, beginning as a conflict and ending
gal reasoning went beyond what the legal as certainty. Once a strategy is chosen, the
realists had produced. In arguing that the judge no longer can imagine any compelling
legal realists’ decisions were based on per- counterargument. Simon recently updated
sonal and social values, not law, the legal this analysis in the light of more recent re-
realists didn’t quite get at the process by search in social and cognitive psychology and
which a judge’s preference is turned into a showed that it has considerable power even
legal justification. Is the judge’s reference in cases in which the judge has no particular
to the law or precedent a “noble lie” in political motivation: An incoherent mass of
Dworkin’s (1 986) terms, resorted to because contradictions develops into a coherent de-
personal preferences or partisan political cision in which no opposing argument carries
preferences could never be publicly stated any weight, but all turn out upon close ex-
as good reasons for justifying a decision? amination to support the decision (Simon,
Are judges simply unquestioningly follow- 1 998).
Of course these biases – hindsight, hy- quate, has the same precedential force. It is
pothesis confirmation, motivated informa- final. The scientist’s conclusions are never fi-
tion processing, ultimate overconfidence, nal, always tentative.
and others – are not unique to legal rea- The judge must also decide for one side
soners. They are true of us all, including or the other; the scientist’s decision that the
scientists. Still, there are several important truth lies somewhere between the extreme
differences between legal reasoning and sci- points of view is typically not available to
entific reasoning. the judge. As I will argue, these role con-
straints in legal reasoning encourage cate-
gorical thinking and a corresponding distrust
of probabilistic reasoning, overconfidence,
Differences Between Scientific and a strong dispositional bias in which
Reasoning and Legal Reasoning situational factors and attributional biases
are overlooked, and the idea of free will is
As Llewellyn said, legal reasoning is not sci- preserved.
entific reasoning, although it shares some an-
alytic strategies, most notably the “method
Lack of Opportunity for Empirical Testing
of comparison and difference” (Llewellyn,
1 93 0, p. 43 ) or, as we might say, “conver- Scientists and judges must both decide be-
gent and discriminant validity” (Campbell & tween competing explanations. But when
Fiske, 1 95 9) and the technique of simultane- scientists are trying to decide among rival
ously considering alternative explanations or hypotheses, or even when testing a single hy-
“multiple working hypotheses” (Chamber- pothesis, sooner or later they put the ques-
lin, 1 890; Campbell & Stanley, 1 966). In fact, tion to nature. They design a study that will
the legal decision maker in an adversarial sys- create new information, information that is
tem is forced to consider at least two com- not already in the system, that will help them
peting hypotheses proposed by the parties. to answer the question and to move forward
In this sense, the judge has some marginal in the way they think about the issues. In
protection against the thoughtless hypothe- legal reasoning, there is no empirical op-
sis confirmation to which scientists occasion- tion. Judges must work with the information
ally fall prey. This is not to say that judges given to them, and that information consists
are immune from hypothesis-confirming bi- entirely of what other people have said and
ases, only that at the beginning of the process the judge’s own knowledge. Judges listen to
they are forced to consider at least two rival testimony and arguments and read the law,
hypotheses. scholarly works, and the opinions of other
Nonetheless, the judge and the scientist judges; they arrange and rearrange these el-
have different tools available to them, dif- ements, selecting, interpreting, and looking
ferent constraints, and different goals. Sci- for a rule that “holds good for the matter at
ence demands no final decisions; it is an on- hand” (Llewellyn, 1 93 0, p. 72). The conclu-
going process. If the evidence is murky, sci- sion that the judge finally reaches is not em-
entists can wait, can reserve judgment until pirically tested and cannot be disconfirmed.
they can conduct further research. And they Of course, the judge may consider empir-
can figure out what further research needs to ical data as part of the factual evidence in a
be done to answer the question, and do it. case. Most cases involve experts of one sort
Judges can neither reserve judgment nor go or another – some who present the results of
beyond the data presented in court, how- diagnostic tests (e.g., of bullets, blood, dan-
ever ambiguous those data might be. They gerousness, mental illness, almost anything
cannot carry out further research, nor wait you can think of ), some who present the re-
until others have done so; they must decide. sults of empirical work specifically related
And the judge’s decision, whether the ev- to the case (e.g., contamination of the jury
idence is conclusive or completely inade- pool through pretrial publicity, evidence of
legal reasoning 697
racial discrimination in a company’s promo- pirical research, they cannot wait for new
tion policies), some who describe the results information before making a decision.
of general research that is germane to the When the courts use available scientific
issue (e.g., evidence that some substance in- data in reaching a decision, this finality can
creases the risk of cancer, or of factors affect- be a source of frustration to scientific re-
ing the reliability of eyewitness testimony). searchers. In 1 970, the Supreme Court held
The legal realists would be pleased about this that the size of a jury (six versus twelve
increasing prevalence of social science evi- members) does not affect its functioning
dence in legal decision making, but the judge (Williams v. Florida, 1 970), and in 1 972, it
does not collect new evidence. held that deliberation would be just as thor-
The scientist is searching for truth. The ough in juries that were not required to
judge wants to get the facts right, but that reach a unanimous verdict as in those that
is not the whole task. The judge also wants were (Johnson v. Louisiana, 1 972; Apodaca
to settle the dispute in a way that is consis- et al. v. Oregon, 1 972). In the early 1 970s,
tent with the law and the decisions in pre- when these decisions were handed down,
vious disputes and that is just. So it could there was almost no research on the ef-
be argued that the whole concept of an em- fects of group size or the unanimity require-
pirical test of the final decision is irrelevant, ment. Social scientists were stunned that
that there is no empirical test of justice. such important decisions could be made on
If two scientists make opposite predictions, the basis of so little information, and a flood
someone will do a study to try to choose of studies and commentaries quickly fol-
between them or otherwise clarify the ques- lowed, many of them suggesting that twelve-
tion. If a judge makes a decision, it is fi- person, unanimous juries deliberate more
nal unless it is appealed. If it is appealed, thoroughly than six-person or nonunani-
the appellate court rarely re-examines the mous juries (Lempert, 1 975 ; Saks & Ostrum,
facts and certainly does not invite new evi- 1 975 ; Zeisel, 1 971 , on jury size; Hastie,
dence but decides whether the lower court Penrod, & Pennington, 1 983 , on unanimity).
made a legal (procedural) error (Mathieson However, the Court had already held that
& Gross, 2004). The final decision is the neither the size of the jury nor the una-
decision of the majority, and a five to four nimity requirement affected deliberations,
decision in the Supreme Court has the same and that six-person and nonunanimous ju-
precedential authority as a unanimous de- ries were constitutional. Although it is cer-
cision. When the Court is split four to four, tainly true that in science bad research can
the views of the ninth, “swing” Justice decide exert a baleful influence on the field for far
the case and can have precedential force – longer than it should (because the finding is
even if those views are quite idiosyncratic exciting, or because it is what people want to
(e.g., Johnson v. Louisiana, 1 972; Regents of believe, or because the researcher is very fa-
the University of California v. Bakke, 1 978). mous, or for various other reasons), it doesn’t
have the same force as legal precedent. It
is more acceptable and less costly for a sci-
Need for an Immediate, Final Decision
entist to reject a theory than for a judge
Unlike the judge, the scientist can reserve to overturn a previous precedent. Authority
judgment and can say that, given the mud- matters in law; in science nothing enhances
dled state of the current evidence, there are a career more than a convincing refutation
many questions that we can’t answer yet and of authority.
that further research is necessary. The judge Still, there have been cases in which
has to decide, and usually he has to decide the Supreme Court has expressed a more
one way or the other, without the range of provisional, scientific point of view. In
compromise solutions that are often avail- Witherspoon v. Illinois (1 968) the Court
able to the scientist. Just as judges cannot had before it sketchy evidence based on
create new information by conducting em- three unpublished studies suggesting that
excluding opponents of the death penalty Compromise decisions are usually impossi-
from juries in capital cases (the common ble, and in an adversary system, the judge
practice known as “death qualification”) bi- is faced with two attorneys, each making
ased the jury toward a guilty verdict, and the strongest possible case for diametrically
so when a defendant’s life was at stake he opposed outcomes and thus minimizing any
would face a greater risk of conviction than ambiguities.6 Experts may agree on most
he would if the prosecutor had not asked for of the data in their field, but those are not
the death penalty. The Court decided that the data that make for effective adversarial
the research was, as yet, “too tentative and persuasion; thus, they are not likely to be
fragmentary” to reject death-qualification as presented in court, and the judge or jury is
unconstitutional but that future data might not likely to get a sense of how much con-
justify such a move. From a scientific point sensus actually exists. The attorneys do their
of view, such a holding is far more accept- best to make every fact and every precedent
able than a holding that said, “We have re- fit their argument, trying to make it look
viewed the evidence and we conclude that as though the field is “impacted” (Kennedy,
death-qualification does not create a bias and 1 986), with little room for doubt, and that
therefore is constitutional,” which would be everything about this case places it clearly
analogous to the Williams holding on jury on one side of the line. The combination of
size. From a practical point of view, how- adversarial presentation and the need for a
ever, leaving a question open invites more dichotomous decision may eventually make
litigation, and if the practice later is found the legal reasoning of judges resemble that
to be unconstitutional, there is the problem of advocates. The facts and law may begin by
of retroactivity – that is of what to do about seeming to be a mass of contradictions, and
all those people who were convicted by bi- the judge may be plagued by “the doubts
ased, death-qualified juries. and misgivings, the hope and fears” (Car-
dozo, 1 921 , p. 1 67) common in significant
enterprises that are fraught with uncertainty
Categorical Thinking, Lack of
and ambiguity; however, judicial opinions
Compromise, and Certainty
almost never suggest that there was ever any
The need to decide the particular case one uncertainty. Once the judge realizes which
way or the other also pushes legal reasoning way he will probably decide the case and
toward categorical thinking: A person is ei- the rudiments of the justifications, “one of
ther sane (guilty) or insane (not guilty); an the effects . . . is a kind of tunnel vision: One
unfit parent (someone else gets the child) or is inside the strategy, sensitive to its internal
fit (he or she may get the child); a future economy, its history of trade-offs, attuned
danger to society (execution permitted) or to developing it further but at least tem-
not (execution not permitted, barring other porarily unable to imagine any other way to
aggravating factors). Psychologists consider go” (Kennedy, 1 986, p. 5 43 ). As in normal
sanity, fitness, and dangerousness to be con- memory processes, strong pressures toward
tinuous variables with no great gulf between consistency and coherence arise, and the ar-
the sane and the insane, the fit and the un- guments and evidence that initially seemed
fit, the safe and the dangerous, and many to favor the other side evaporate. “This sense
intermediate cases. But a legal case has to be of unequivocal support for the one decision
decided for one party or the other, and so generates a sense of inevitability, of singular
variables that are continuous are forced to correctness” (Simon, 1 998, p. 84), and judi-
become dichotomous. Sometimes there are cial opinions are generally written as though
more than two categories (first-degree mur- all arguments support the conclusion, and
der, second-degree murder, and manslaugh- there is no uncertainty whatever. Simon
ter), but a line must always be drawn. attributes this movement toward certainty
The fact that the decision must be to basic cognitive processes, and certainly
categorical very likely exercises an influence this form of thinking is not unique to law;
on the process of legal reasoning itself. it is however exaggerated, I think, by the
legal reasoning 699
adversarial presentation of evidence (with is no accident that psychiatrists and clinical

little or no attention to the ambiguous, in- psychologists had close ties to the legal sys-
between facts and law) and by the necessity tem long before research psychologists did.
of always having to choose one side. Explaining (or predicting) the behavior of a
The feeling that there must be a cer- specific individual in a specific set of circum-
tain outcome, and that expressions of uncer- stances is not what most scientists do and not
tainty by a judge are a sign of weakness or what statistics are designed for. Experts will-
incompetence (Simon, 1 998, p. 1 2) seem ing to testify to the exact probability that a
quite bizarre in a world in which the basic given defendant will commit a future crime
insights of the legal realists are widely ac- are viewed as charlatans by the scientific
cepted. But it is real. Despite the fact that community. However, statistical probabilis-
majority and dissenting justices are perfectly tic data may be quite useful in illuminating
certain (so presumably either one side is other questions that judges must consider,
dead wrong or there is some uncertainty), such as whether a company is guilty of dis-
and despite the fact that everyone knows crimination in hiring or whether a particular
that as soon as the next case comes along drug causes birth defects. These questions
“the legal materials lose their recently ac- are typically addressed with aggregate data
quired character, and return to their ambigu- in which the results of many different stud-
ous existence within the world of multiple ies involving many different people are pro-
meanings” (Simon, 1 998, p. 1 27), nonethe- vided by an expert. Judges have become far
less certainty is still valued as some sort of more receptive to statistical, empirical, ag-
mastery and uncertainty as a sign of indeci- gregate studies over the past fifty years, but
siveness at best and incompetence at worst. there is still a core reluctance. Experts who
The decision must be justified in terms of testify about the factors affecting eyewitness
the law, and it would be dangerous, in law reliability often have to overcome a certain
as in chess or sports, to suggest that the law judicial skepticism about the value of their
itself is ambiguous. testimony because they have not examined
this particular eyewitness but are only talk-
ing about the circumstances that affect most
Mistrust of Probabilistic Thinking
eyewitnesses most of the time. Large-scale
and Aggregate Data
studies of pervasive racial discrimination in
This concern with certainty and the need capital sentencing (Baldus, Woodworth, &
to make dichotomous judgments may help Pulaski, 1 990; Gross & Mauro, 1 989) were
explain why judges and legal scholars rejected by the Supreme Court in McCleskey
are often uncomfortable with probabilistic vs. Kemp (1 987) in part because the ap-
statements and probabilistic data. Scientists pellant had not shown that the particular
regularly make explicit quantified probabil- jury that tried McCleskey was influenced
ity judgments; lawyers and judges do not – by racial bias. The Court held that in order
certainly not about the ultimate issues. For to succeed with a claim of racial discrimi-
example, they strongly resist placing a nu- nation, an appellant must prove either (1 )
merical value on the “reasonable doubt” “that the decision makers in his case acted
standard: Is it 95 % certainty, 99% certainty? with discriminatory purpose” [emphasis in
Jurors are generally just given the stock original], or (2) “that the Georgia legislature
phrase, sometimes supplemented by other enacted or maintained the death penalty
phrases, such as “to a moral certainty” or statute because of an anticipated racially dis-
“firmly convinced.” criminatory effect” [emphasis in original]
This hesitation to consider probabilities is (McCleskey vs. Kemp, 1 987, p. 1 769).
not unreasonable given that the judge has to
make a yes or no decision about a particular
Free Will and the Dispositional Bias
individual. The judge’s task is more analo-
gous to that of a doctor or clinical psycholo- Aggregate data are threatening in another
gist than to that of a research scientist, and it way; they imply that many people in the
same circumstances would behave in the bility. Even when exceptions are made, they
same way and thereby threaten the notion generally are made on the basis of internal,
of autonomy and free will so deeply rooted dispositional factors (e.g., insanity, youth)
in the minds of legal thinkers. The law sees and rarely on the basis of situational forces.
behavior as caused by people’s beliefs, de-
sires, and preferences. Ideas of free choice
and free will are still fundamental to legal
thinking and largely unquestioned. This em- Conclusions and Future Directions
phasis creates another source of tension be-
tween law and the social sciences because Legal reasoning is a form of expert reason-
social science takes a much more determining. Einstein argued that expert reasoning –
istic point of view, emphasizing cultural, so- in particular, scientific reasoning – is “noth-
ciological, psychological, biological, and, es- ing but a refinement of our everyday think-
pecially in psychology, situational forces on ing” (1 93 6, in Bargmann [trans.] 1 95 4, p.
behavior (Ross and Nisbett, 1 991 ). The fact 290). Like everyday problem solving and sci-
that economics is the social science that has entific reasoning, legal reasoning begins by
been most successful in law schools is not examining a set of facts and figuring out
surprising given this model; of all the social what happened and why. Of course, some
sciences, economics is the one most wedded of the “facts” may be fictions, and the judge
to a free choice theory of behavior. must decide which to believe and which to
The law has developed a highly elaborate reject, but that is true of all natural prob-
set of definitions of various degrees of per- lem solving. Information is selected and re-
sonal responsibility, including deliberation, jected as part of the process of creating a
intention, knowledge, recklessness, and neg- coherent story.
ligence, but has been relatively untouched It is the “refinements” that make one form
by psychological research on attributional of expert reasoning different from another.
biases and particularly by the research on Like other forms of expert reasoning, the
the dispositional bias (fundamental attribu- law has its own terminology, its own uni-
tion error) or by social psychological reverse of acceptable data, and its own rules.
search demonstrating that situations play a In law, the rules are more flexible than they
far greater role than personal preferences are in some domains and more central than
and dispositions in determining people’s be- they are in most. They are more flexible than
havior (Ross & Nisbett, 1 991 ). When situa- the rules of chess, for example, because in
tional forces are considered, such as in the complex cases there are often many possible
concepts of necessity and duress, the situ- rules and precedents from which to choose,
ations are generally so extreme as to be ir- and both the facts and the rules can be inter-
relevant to everyday life – a person breaks preted and reinterpreted in relation to each
into a lonely cabin in a blizzard because other until the judge is satisfied with the to-
he is freezing to death or signs a contract tal combination – satisfied with the fitness
because someone is holding a gun to her or coherence of the overall picture, and sat-
head – and can be taken as the exceptions isfied that the decision is just.
that prove the rule that the pervasive power The rules are more central in that every
of the situation in all aspects of our lives decision must be justified by explicit dis-
is largely ignored by the law (Hanson & cussion of the relevant rules: The rules are
Yosifon, 2003 ; Ross & Shestowsky, 2003 ). not just a framework for decision making;
The validity of the concept of free will has they are an essential part of the process.
in fact troubled a sprinkling of legal schol- The sine qua non of empirical scientific re-
ars for a century (Pound, Green, Hanson), search is a clear description of the research
and these doubts have occasionally influ- method. The judge has a mass of materials
enced sentencing practices but have rarely to work with, ranging from the incoherent,
affected the basic attribution of guilt or lia- self-serving blabbering of a witness to the
legal reasoning 701
decisions of other judges to the Constitution 2. This section owes much to the work of Robert
itself, and the sine qua non of legal reason- Gordon (1 984), Duncan Kennedy (1 973 ), and,
ing is the explanation of why this decision especially, Thomas C. Grey (1 983 ).
is the right one (Schauer, 1 995 ), an expla- 3 . In the era of formalism, judges were men, so I
nation ultimately expressed as argument. refer to them as “he.” For the sake of balance,
This explanation “is meant not only to jus- I refer to scientists as she.
tify the judgment in terms of an authorita- 4. By this time, Holmes had been on the Supreme
tive past but to constitute an authority to Court for many years, and Pound had become
be referred to in the future” (White, 1 985 , more conservative and more prosaic.
p. 240). 5 . Of course there are exceptions, and a brief de-
Despite the major developments in le- scription like this one must always be, in some
ways, a caricature.
gal scholars’ interpretations of legal reasoning
over the past century and a half, legal rea- 6. In actuality, compromise is pervasive in the
legal system, because most civil cases are re-
soning itself has not changed substantially,
solved by settlement and most criminal cases
and it is unlikely to do so in the near future. by plea bargain. The study of legal reasoning,
Law is a socially defined and socially con- however, focuses on the small minority of cases
structed system that is generally seen as serv- that are litigated and decided by judges.
ing its purposes well. Undoubtedly there will
be further changes in the nature of the fac-
tual evidence judges consider relevant with
increasing attention to general scientific re-
search, but the form of legal reasoning, the References
rules of the game, cannot change without
major changes in the system itself, and there Apodaca et al. v. Oregon, (1 972) 406 U.S. 408.
is no indication of any such changes in the Baldus, D., Woodworth, G., & Pulaski, C. (1 990).
near future. Equal Justice and the Death Penalty: A Legal Em-
pirical Analysis. Boston: Northeastern Univer-
sity Press.
Burton, S. J. (1 995 ). Law and Legal Reasoning
Acknowledgments (2nd ed.). Aspen, CO: Aspen Publishers.
Campbell, D. T., & Fiske, D. W. (1 95 9). Con-
Research for this chapter was supported by vergent and discriminant validation by the
the Cook Fund of the University of Michi- multitrait–multimethod matrix. Psychological
gan. I am grateful to Hanoch Dagan, Thomas Bulletin, 5 6, 81 –1 05 .
A. Green, Sallyanne Payton, and Grace Ton- Campbell, D. T., & Stanley, J. C. (1 966). Ex-
ner for guidance in the initial stages of perimental and Quasi-experimental Designs for
this work, to Samuel Gross, Don Herzog, Research.Chicago: Rand McNally.
and Dan Molden for extremely useful com- Cardozo, B. N. (1 921 ). The Nature of the Judicial
ments on an earlier draft, and to Barbara Process. New Haven, CT: Yale University Press.
Zezulka Brown for turning it into a readable Chamberlin, T. C. (1 890). The method of multi-
manuscript. ple working hypotheses. Science, 1 5 , 92 (old
series); reprinted in Science, 1 48, 75 4–75 9
(1 965 ).
Cohen, F. S. (1 93 5 ). Transcendental nonsense and
Notes the functionalist approach. Columbia Law Re-
view, 3 5 , 809–849.
1 . European civil law systems differ from com- Dworkin, R. (1 986). Law’s Empire. Cambridge,
mon law systems in many respects, such as a MA: Harvard University Press.
more active role for the trial court judge, less Einstein, A. (1 93 6). Physics and reality. In S.
emphasis on precedent, and reconsideration of Bargmann (Trans.) (1 95 4) (p. 290). Ideas and
the facts at the appellate level. They are be- Opinions of Albert Einstein. New York: Crown
yond the scope of this chapter. Publishers.
Ellsworth, P. C., & Getman, J. G. (1 986). Social Kahneman, D., Slovic, P., & Tversky, A. (Eds.)
science in legal decision-making. In L. Lipson & (1 982). Judgment Under Uncertainty: Heuristics
S. Wheeler (Eds.). Law and the Social Sciences. and Biases. Cambridge, UK: Cambridge Uni-
New York: Russell Sage. versity Press.
Fisher, W. W. III, Horwitz, M. J., & Reed, T. A. Kennedy, D. (1 973 ). Legal formality. Journal of
(1 993 ). American Legal Realism. New York and Legal Studies, 1 1 , 3 5 1 –3 98.
Oxford, UK: Oxford University Press. Kennedy, D. (1 986). Freedom and constraint in
Frank, J. (1 93 0). Law and the Modern Mind. New adjudication: A critical phenomenology. Jour-
York: Bretano’s. nal of Legal Education, 3 6, 5 1 8–5 62.
Friedman, L. M. (1 986). The law and society Kennedy, D. (1 997). A Critique of Adjudication
movement. Stanford Law Review, 3 8, 763 – (fin de siecle). Cambridge, MA: Harvard Uni-
780. versity Press.
Gordon, R. W. (1 984). Critical legal histories. Kreps, D. M. (1 990). A Course in Macroeconomic
Stanford Law Review, 3 8, 3 7–1 25 Theory. Princeton, NJ: Princeton University
Green, T. A. (1 995 ). Freedom and criminal re- Press.
sponsibility in the age of Pound: An essay Langdell, C. C. (1 880). Summary of the Law of
on criminal justice. Michigan Law Review, 93 , Contracts. Boston: Little, Brown.
1 91 5 –205 3 . Langdell, C. C. (1 887). Speech delivered by
Grey, T. C. (1 983 ). Langdell’s orthodoxy. Univer- C. C. Langdell at the quarter-millennial cel-
sity of Pittsburgh Law Review, 45 , 1 –5 3 . ebration of Harvard University, 5 November,
Gross, S., & Mauro, R. (1 989) Death and 1 887. Reprinted in The Law Quarterly Review,
Discrimination: Racial Disparities in Capital 3 , 1 23 –1 25 (1 887).
Sentencing. Boston: Northeastern University Lasswell, H. (1 93 0). Self-analysis and judicial
Press. thinking. International Journal of Ethics, 40,
Hanson, J., & Yosifon, D. (2003 ). The situation: 3 5 4–3 62.
An introduction to the situational character, Lempert, R. O. (1 975 ). Uncovering “nondiscern-
critical realism, power economics, and deep able” differences: Empirical research and the
capture. University of Pennsylvania Law Review, jury-size cases. Michigan Law Review, 73 , 643 –
1 5 2 , pp. 1 29–3 46. 708.
Hastie, R., Penrod, S. D., & Pennington, N. Llewellyn, K. N. (1 93 0). The Bramble Bush.
(1 983 ). Inside the Jury. Cambridge, MA: Har- Columbia Law School. Later published as The
vard University Press. Bramble Bush, or, Our Law and Its Study. New
Holmes, O. W. (1 873 ). The theory of torts. Amer- York: Oceana, 1 960.
ican Law Review, 7, 65 2–663 . Llewellyn, K. N. (1 942). On the good, the true,
Holmes, O. W. (1 881 ). The Common Law. Boston: the beautiful, in law. University of Chicago Law
Little, Brown. Review, 9, 224–265 .
Holmes, O. W. (1 897). The path of the law. Har- Llewellyn, K. N. (1 95 0). Remarks on the theory
vard Law Review, 1 0, 45 7–478. of appellate decision and the rules or canons
Horwitz, M. (1 992). The Transformation of Amer- about how statutes are to be construed. Van-
ican Law: 1 870–1 960: The Crisis of Legal derbilt Law Review, 3 , 3 95 –406.
Orthodoxy. New York: Oxford University Llewellyn, K. N. (1 95 6). On what makes legal re-
Press. search worthwhile. Journal of Legal Education,
Hutcheson, J. C., Jr. (1 929). The judgment 8, 3 99–421 .
intuitive: The function of the ‘hunch’ in judi- Lochner v. New York (1 905 ), 1 98 U.S. 45 .
cial decision. Cornell Law Quarterly, 1 4, 274– Mathieson, A., & Gross, S. R. (2004). Review for
288. error, Law, Probability and Risk, 2 , pp. 25 9–
Johnson v. Louisiana (1 972) 406 U.S. 3 5 6. 268.
Kahneman, D., & Tversky, A. (Eds.) (2000). McCleskey v. Kemp (1 987). 481 U.S. 279.
Choices, Values, and Frames. New York: Russell Moore, U., & Callahan, C. C. (1 943 ). Law and
Sage Foundation; Cambridge, UK: Cambridge learning theory: A study in legal control. Yale
University Press. Law Journal, 5 3 , 1 –1 3 6.
legal reasoning 703
Muller v. Oregon (1 908) 208 U.S. 41 2. Schlegel, J. (1 980). American legal realism and
Nisbett, R. E., & Wilson, T. (1 977). Telling more empirical social science: From the Yale experi-
than we can know: Verbal reports on men- ence. Buffalo Law Review, 2 8, 45 9–5 86.
tal processes. Psychological Review, 84, 23 1 – Simon, D. (1 998). A psychological model of judi-
25 9. cial decision making. Rutgers Law Journal, 3 0,
Pound, R. (1 909). Liberty of contract. The Yale 1 –1 42.
Law Journal, 1 8, 45 4–487. Sunstein, C. R. (1 996). Legal Reasoning and Po-
Pound, R. (1 91 2). The scope and purpose of so- litical Conflict. New York: Oxford University
ciological jurisprudence. Harvard Law Review, Press.
2 5 , 489–5 1 6. Thaler, R. H. (1 992). The Winner’s Curse: Para-
Pound, R., & Frankfurter, F. (1 922). Criminal Jus- doxes and Anomalies of Economic Life. New
tice in Cleveland. Cleveland, OH: The Cleve- York: Free Press.
land Foundation. Tushnet, M. (1 984). Trashing. Stanford Law Re-
Regents of the University of California v. Bakke view, 3 6, 293 –3 48.
(1 978) 43 8 U.S. 265 . White, J. B. (1 985 ). Heracles’ bow: Essays on the
Ross, L. D., & Nisbett, R. E. (1 991 ). The Person Rhetoric and Poetics of the Law. Madison: Uni-
and the Situation. Boston: McGraw-Hill. versity of Wisconsin Press.
Ross, L., & Shestowsky, D. (2003 ). Contempo- Williams v. Florida (1 970) 3 99 U.S. 78.
rary psychology’s challenges to legal theory and Witherspoon v. Illinois (1 968) 3 91 U.S. 5 1 0.
practice. Northwestern University Law Review, Yntema, H. (1 928). The hornbook method and
97, pp. 1 081 –1 1 1 4. the conflict of laws. Yale Law Journal, 3 7, 468–
Saks, M. J., & Ostrum, T. M. (1 975 ). Jury size and 483 .
consensus requirements: The laws of probabil- Zajonc, R. B. (1 965 ). Social facilitation. Science,
ity vs. the law of the land. Journal of Contem- 1 49, 269–274.
porary Law, 1 , 1 63 –1 73 . Zeisel, H. (1 971 ). And then there were none:
Schauer, F. (1 995 ). Giving reasons. Stanford Law The diminution of the federal jury. University
Review, 47, 63 3 –65 9. of Chicago Law Review, 3 8, 71 0–724.
CHAPTER 2 9
Scientific Thinking and Reasoning
Kevin Dunbar
Jonathan Fugelsang
What Is Scientific Thinking volves investigating thinking that has scien-

and Reasoning? tific content. A number of overlapping re-
search traditions have been used to investi-
Scientific thinking refers to the mental gate scientific thinking. We cover the history
processes used when reasoning about the of research on scientific thinking and the dif-
content of science (e.g., force in physics), ferent approaches that have been used, high-
engaged in typical scientific activities (e.g., lighting common themes that have emerged
designing experiments), or specific types of over the past fifty years of research.
reasoning that are frequently used in sci-
ence (e.g., deducing that there is a planet
beyond Pluto). Scientific thinking involves A Brief History of Research
many general-purpose cognitive operations on Scientific Thinking
that human beings apply in nonscientific do-
mains such as induction, deduction, anal- Science is often considered one of the hall-
ogy, problem solving, and causal reason- marks of the human species, along with
ing. These cognitive processes are covered art, music, and literature. Illuminating the
in many chapters of this handbook (see thought processes used in science there-
Sloman & Lagnado, Chap. 5 on induction; fore reveals key aspects of the human mind.
Holyoak, Chap. 6 on analogy; Buehner and The thought processes underlying scientific
Cheng, Chap. 7 on causality; Evans, Chap. thinking have fascinated both scientists and
8 on deduction; Novick and Bassok, Chap. nonscientists because the products of sci-
1 4 on problem solving; Chi and Ohllson, ence have transformed our world and be-
Chap. 1 6 on conceptual change). What dis- cause the process of discovery is shrouded
tinguishes research on scientific thinking in mystery. Scientists talk of the chance dis-
from general research on cognition is that covery, the flash of insight, the years of per-
research on scientific thinking typically inspiration, and the voyage of discovery. These
705
images of science have helped make the years (e.g., Bacon, 1 620; Galilei, 1 63 8; Klahr,
mental processes underlying the discovery 2000; Tweney, Doherty, & Mynatt, 1 981 ).
process intriguing to cognitive scientists as Understanding the nature of scientific think-
they attempt to uncover what really goes ing has been an important and central issue
on inside the scientific mind and how sci- not only for our understanding of science,
entists really think. Furthermore, the ques- but also for our understating of what it is to
tions, “Can scientists be taught to think bet- be human. Bacon’s Novumm Organum, in
ter, avoiding mistakes of scientific thinking?” 1 620, sketched out some of the key features
and “Could the scientific process be auto- of the ways that experiments are designed
mated such that scientists are no longer nec- and data interpreted. Over the ensuing 400
essary?” make scientific thinking a topic of years, philosophers and scientists vigorously
enduring interest. One of the most com- debated the appropriate methods that scien-
pelling accounts of science that makes the tists should use (see Giere, 1 993 ). These de-
reader want to understand science and why bates over the appropriate methods for sci-
science is interesting recently appeared in ence typically resulted in the espousal of a
the journal Popular Science. In this article, particular type of reasoning method such as
Charles Hirshberg discusses his mother, sci- induction or deduction. It was not until the
entist Joan Feynman, and her scientific con- Gestalt psychologists began working on the
tributions as well as the difficulties of being a nature of human problem solving during the
woman scientist. The following excerpt cap- 1 940s that experimental psychologists began
tures the excitement and thrill that even a to investigate the cognitive processes under-
household encounter with science can gen- lying scientific thinking and reasoning.
erate and that is thought to be at the root The Gestalt Psychologist Max Werthei-
of many scientists’ desire to conduct science mer initiated the first investigations of sci-
(Hirschberg, 2003 ). entific thinking in his landmark book, Pro-
ductive Thinking (Wertheimer, 1 945 ; see
My introduction to chemistry came in Novick & Bassok, Chap. 1 4). Wertheimer
1 970, on a day when my mom was bak- spent a considerable amount of time corre-
ing challah bread for the Jewish New Year. sponding with Albert Einstein, attempting
I was about ten, and though I felt cooking to discover how Einstein generated the con-
was unmanly for a guy who played short-
cept of relativity. Wertheimer argued that
stop for Village Host Pizza in the Menlo
Park, California, Little League, she had Einstein had to overcome the structure of
persuaded me to help. When the bread was Newtonian physics at each step in his the-
in the oven, she gave me a plastic pill bot- orizing and the ways that Einstein actually
tle and a cork. She told me to sprinkle a achieved this restructuring were articulated
little baking soda into the bottle, then a lit- in terms of Gestalt theories. For a recent
tle vinegar, and cork the bottle as fast as and different account of how Einstein made
I could. There followed a violent and com- his discovery, see Galison (2003 ). We will
pletely unexpected pop as the cork flew off see later how this process of overcoming al-
and walloped me in the forehead. Explod- ternative theories is an obstacle with which
ing food: I was ecstatic! “That’s called a both scientists and nonscientists need to
chemical reaction,” she said, rubbing my
deal when evaluating and theorizing about
shirt clean. “The vinegar is an acid and the
soda is a base, and that’s what happens the world.
when you mix the two.” After that, I never One of the first investigations of the cog-
understood what other kids meant when nitive abilities underlying scientific think-
they said that science was boring. ing was the work of Jerome Bruner and his
colleagues at Harvard (Bruner, Goodnow, &
The cognitive processes underlying sci- Austin, 1 95 6). They argued that a key ac-
entific discovery and day-to-day scientific tivity in which scientists engage is to deter-
thinking have been a topic of intense mine whether or not a particular instance
scrutiny and speculation for almost 400 is a member of a category. For example, a
scientific thinking and reasoning 707
scientist might want to discover which sub- that the rule was even numbers increasing by
stances undergo fission when bombarded by two. They then attempted to confirm their
neutrons and which substances do not. Here, hypothesis by generating a triad like 8-1 0-
scientists have to discover the attributes that 1 2, then 1 4-1 6-1 8. These triads are consis-
make a substance undergo fission. Bruner tent with the rule and the participants were
et al. (1 95 6) saw scientific thinking as the told yes, that the triads were indeed con-
testing of hypotheses and collecting of data sistent with the rule. However, when they
with the end goal of determining whether proposed the rule, even numbers increas-
something is a member of a category or not. ing by two, they were told that the rule
They invented a paradigm in which people was incorrect. The correct rule was num-
were required to formulate hypotheses and bers of increasing magnitude. From this re-
collect data that test their hypotheses. Us- search Wason concluded that people try and
ing this approach, Bruner et al. identified a confirm their hypotheses, whereas norma-
number of strategies people use to formu- tively speaking, they should try and discon-
late and test hypotheses. They found that firm their hypotheses. One implication of
a key factor determining which hypothesis this research is that confirmation bias is not
testing strategy people use is the amount of just restricted to scientists but is a general
memory capacity the strategy takes up (see human tendency.
also Morrison, Chap. 1 9, on working mem- It was not until the 1 970s that a general
ory). Another key factor they discovered was account of scientific reasoning was proposed.
that it is much more difficult for people to Herbert Simon, often in collaboration with
discover negative concepts (e.g., not blue) Allan Newell (e.g., Newell & Simon, 1 972),
than positive concepts (e.g., blue). Although proposed that scientific thinking is a form
the Bruner et al. research is most com- of problem solving. He proposed that prob-
monly thought of as work on concepts, they lem solving is a search in a problem space.
saw their work as uncovering a key compo- Newell and Simon’s (1 972) theory of prob-
nent of scientific thinking. lem solving is discussed in many places in
A second early line of research on scien- this volume, usually in the context of spe-
tific thinking was developed by Peter Wa- cific problems (see especially Novick & Bas-
son and his colleagues. Like Bruner et al., sok, Chap. 1 4, on problem solving). Herbert
Wason (1 968) saw a key component of sci- Simon (1 977), however, devoted consider-
entific thinking as being the testing of hy- able time to understanding many different
potheses. Whereas Bruner et al. focused on scientific discoveries and scientific reason-
the different types of strategies people use ing processes. The common thread in his re-
to formulate hypotheses, Wason focused on search was that scientific thinking and dis-
whether people adopt a strategy of trying covery is not a mysterious magical process
to confirm or disconfirm their hypotheses. but a process of problem solving in which
Using Popper’s (1 95 9) theory that scien- clear heuristics are used. Simon’s goal was to
tists should try and falsify rather than con- articulate the heuristics that scientists use in
firm their hypotheses, Wason devised a de- their research at a fine-grained level. He built
ceptively simple task in which participants many programs that simulated the process of
were given three numbers, such as 2-4-6, scientific discovery and articulated the spe-
and were asked to discover the rule under- cific computations that scientists use in their
lying the three numbers. Participants were research (see subsequent section on compu-
asked to generate other triads of numbers, tational approaches to scientific thinking).
and the experimenter would tell the partic- Particularly important was Simon and Lea’s
ipant whether the triad was consistent or (1 974) work demonstrating that concept for-
inconsistent with the rule. They were told mation and induction consist of a search in
that when they were sure they knew what two problem spaces: a space of instances and
the rule was they should state it. Most par- a space of rules. This idea has been highly
ticipants began the experiment by thinking influential on problem-solving accounts of
scientific thinking that will be discussed in arching framework to understand the scien-
the next section. tific mind. One framework that has had a
Overall, the work of Bruner, Wason, and great influence in cognitive science is that
Simon laid the foundations for contempo- scientific thinking and scientific discovery
rary research on scientific thinking. Early can be conceived as a form of problem solv-
research on scientific thinking is conve- ing. Simon (1 977) argued that both scientific
niently summarized in Tweney, Doherty, thinking in general and problem solving in
and Mynatt’s 1 981 book, On Scientific Think- particular could be thought of as a search in
ing, in which they sketched out many of the a problem space (see Chapter 1 1 ). A prob-
themes that have dominated research on sci- lem space consists of all the possible states
entific thinking over the past few decades. of a problem and all the operations that a
Other more recent books, such as Ronald problem solver can use to get from one state
Giere’s Cognitive Models of Science (1 993 ); to the next (see problem solving entry). Ac-
David Klahr’s Explaining Science (2000); Pe- cording to this view, by characterizing the
ter Carruthers, Steven Stich, and Michael types of representations and procedures peo-
Siegal’s Cognitive Basis of Science (2002); and ple use to get from one state to another, it
Gorman and colleagues’ Scientific and Tech- is possible to understand scientific thinking.
nical Thinking (2005 ) provide detailed anal- Scientific thinking therefore can be charac-
yses of different aspects of scientific discov- terized as a search in various problem spaces
ery. In this chapter, we discuss the main ap- (Simon, 1 977). Simon investigated a num-
proaches that have been used to investigate ber of scientific discoveries by bringing par-
scientific thinking. ticipants into the laboratory, providing the
One of the main features of investigations participants with the data to which a sci-
of research on the scientific mind has been entist had access, and getting the partici-
to take one aspect of scientific thinking that pants to reason about the data and rediscover
is thought to be important and investigate a scientific concept. He then analyzed the
it in isolation. How does one go about in- verbal protocols participants generated and
vestigating the many different aspects of sci- mapped out the types of problem spaces in
entific thinking? Numerous methodologies which the participants searched (e.g., Qin &
have been used to analyze the genesis of sci- Simon, 1 990). Kulkarni and Simon (1 988)
entific concepts, theories, hypotheses, and used a more historical approach to uncover
experiments. Researchers have used experi- the problem-solving heuristics that Krebs
ments, verbal protocols, computer programs, used in his discovery of the urea cycle. Kulka-
and analysis of particular scientific discover- rni and Simon analyzed Krebs’s diaries and
ies. A recent development has been to inves- proposed a set of problem-solving heuristics
tigate scientists as they reason “live” (in vivo that he used in his research. They then built a
studies of scientific thinking) in their own computer program incorporating the heuris-
laboratories (Dunbar, 1 995 , 2002). From a tics and biological knowledge that Krebs had
“thinking and reasoning” standpoint, the ma- before he made his discoveries. Of particular
jor aspects of scientific thinking that have importance are the search heuristics the pro-
been most actively investigated are prob- gram uses such as the experimental proposal
lem solving, analogical reasoning, hypothe- heuristics and the data interpretation heuris-
sis testing, conceptual change, collaborative tics built into the program. A key heuristic
reasoning, inductive reasoning, and deduc- was an unusualness heuristic that focused
tive reasoning. on unusual findings and guided the search
through a space of theories and a space of
experiments.
Klahr and Dunbar (1 988) extended the
Scientific Thinking as Problem Solving search in a problem space approach and pro-
posed that scientific thinking can be thought
One important goal for accounts of scien- of as a search through two related spaces –
tific thinking has been to provide an over- an hypothesis space and an experiment
scientific thinking and reasoning 709
space. Each problem space that a scientist historians, and experimental psychologists
uses will have its own types of representa- have devoted a considerable amount of re-
tions and operators used to change the rep- search to “confirmation bias.” This occurs
resentations. Search in the hypothesis space when scientists consider only one hypoth-
constrains search in the experiment space. esis (typically the favored hypothesis) and
Klahr and Dunbar found that some partic- ignore alternative hypotheses or other po-
ipants move from the hypothesis space to tentially relevant hypotheses. This impor-
the experiment space, whereas others move tant phenomenon can distort the design of
from the experiment space to the hypothesis experiments, formulation of theories, and
space. These different types of searches lead interpretation of data. Beginning with the
to the proposal of different types of hypothe- work of Wason (1 968) and as discussed pre-
ses and experiments. More recent work viously, researchers have repeatedly shown
has extended the dual-space approach to that when participants are asked to design
include alternative problem-solving spaces, an experiment to test a hypothesis, they pre-
including those for data, instrumentation, dominantly design experiments they think
and domain-specific knowledge (Schunn & will yield results consistent with the hypoth-
Klahr, 1 995 , 1 996; Klahr & Simon, 1 999). esis. Using the 2-4-6 task mentioned ear-
lier, Klayman and Ha (1 987) showed that
in situations in which one’s hypothesis is
likely to be confirmed, seeking confirmation
Scientific Thinking as Hypothesis is a normatively incorrect strategy, whereas
Testing when the probability of confirming one’s
hypothesis is low, then attempting to con-
Many researchers have regarded testing spe- firm ones hypothesis can be an appropri-
cific hypotheses predicted by theories as one ate strategy. Historical analyses by Tweney
of the key attributes of scientific thinking. (1 989) on the way that Faraday made his dis-
Hypothesis testing is the process of evalu- coveries and experiments investigating peo-
ating a proposition by collecting evidence ple testing hypotheses have revealed that
regarding its truth. Experimental cognitive people use a confirm early–disconfirm late
research on scientific thinking that specifi- strategy: When people initially generate or
cally examines this issue has tended to fall are given hypotheses, they try to gather ev-
into two broad classes of investigations. The idence that is consistent with the hypoth-
first class is concerned with the types of esis. Once enough evidence has been gath-
reasoning that lead scientists astray, block- ered, people attempt to find the boundaries
ing scientific ingenuity (see also Sternberg, of their hypothesis and often try to discon-
Chap. 1 5 on creativity). A large amount of firm their hypotheses.
research has been conducted on the poten- In an interesting variant on the con-
tially faulty reasoning strategies that both firmation bias paradigm, Gorman (1 989)
participants in experiments and scientists has shown that when participants are told
use such as considering only one favored hy- there is the possibility of error in the data
pothesis at a time and how this prevents they receive, they assume any data incon-
scientists from making discoveries. The sec- sistent with their favored hypothesis are at-
ond class is concerned with uncovering the tributable to error. The possibility of error
mental processes underlying the generation therefore insulates hypotheses against dis-
of new scientific hypotheses and concepts. confirmation. This hypothesis has not been
This research has tended to focus on the use confirmed by other researchers (Penner &
of analogy and imagery in science as well as Klahr, 1 996) but is an intriguing one that
the use of specific types of problem-solving warrants further investigation.
heuristics (see also Holyoak, Chapter 6 Confirmation bias is very difficult to over-
on analogy). come. Even when participants are asked
Turning first to investigations of what di- to consider alternate hypotheses, they of-
minishes scientific creativity, philosophers, ten fail to conduct experiments that could
potentially disconfirm their hypothesis. One important issue in the causal rea-
Tweney and his colleagues provide an excel- soning literature that is directly relevant to
lent overview of this phenomenon in their scientific thinking is the extent to which sci-
classic monograph “On Scientific Think- entists and nonscientists are governed by the
ing” (1 981 ). The precise reasons for this search for causal mechanisms (i.e., the chain
type of block are still widely debated. Re- of events that lead from a cause to an effect)
searchers such as Michael Doherty have ar- versus the search for statistical data (i.e., how
gued that limitations in working memory often variables co-occur). This dichotomy
make it difficult for people to consider more can be boiled down to the search for quali-
than one hypothesis. Consistent with this tative versus quantitative information about
view, Dunbar and Sussman (1 995 ) showed the paradigm the scientist is investigating.
that when participants are asked to hold Researchers from a number of cognitive psy-
irrelevant items in working memory while chology laboratories have found that peo-
testing hypotheses, participants are unable ple prefer to gather more information about
to switch hypotheses in the face of inconsis- an underlying mechanism than covariation
tent evidence (see also Morrison, Chap. 1 9, between a cause and an effect (e.g., Ahn
on working memory). Although limitations et al., 1 995 ). That is, the predominant strat-
of working memory are involved in the phe- egy that students in scientific thinking simu-
nomenon of confirmation bias, even groups lations use is to gather as much information
of scientists can display confirmation bias. as possible about how the objects under in-
The recent controversies over cold fusion vestigation work rather than collecting large
are an example of confirmation bias. Here, amounts of quantitative data to determine
large groups of scientists had other hypothe- whether the observations hold across mul-
ses available to explain their data but yet tiple samples. These findings suggest that a
maintained their hypotheses in the face of central component of scientific thinking may
other, more standard alternative hypotheses. be to formulate explicit mechanistic causal
Mitroff (1 974) provides some interesting ex- models of scientific events.
amples of scientists at the National Aero- One place where causal reasoning has
nautical and Space Administration demon- been observed extensively is when scientists
strating confirmation bias that highlights obtain unexpected findings. Both historical
the roles of commitment and motivation in and naturalistic research has revealed that
this process. reasoning causally about unexpected find-
ings has a central role in science. Indeed,
scientists themselves frequently state that a
finding was attributable to chance or was un-
Causal Thinking in Science expected. Given that claims of unexpected
findings are such a frequent component of
Much of scientific thinking and scientific scientists’ autobiographies and interviews
theory building pertains to the development in the media, Dunbar (1 995 , 1 997, 1 999;
of causal models between variables of inter- Dunbar & Fugelsang, 2004; Fugelsang et al.,
est. For example, does smoking cause cancer, 2004) decided to investigate the ways that
Prozac relieve depression, or aerosol spray scientists deal with unexpected findings. In
deplete the ozone layer? (See also Buehner & 1 991 –1 992 Dunbar spent one year in three
Cheng, Chap. 7, on causality.) Scientists and molecular biology laboratories and one im-
nonscientists alike are constantly bombarded munology laboratory at a prestigious U.S.
with statements regarding the causal rela- university. He used the weekly laboratory
tionship between such variables. How does meeting as a source of data on scientific dis-
one evaluate the status of such claims? What covery and scientific reasoning. (This type of
kinds of data are informative? How do sci- study he has called InVivo cognition). When
entists and nonscientists deal with data that he examined the types of findings the sci-
are inconsistent with their theory? entists made, he found that more than 5 0%
scientific thinking and reasoning 71 1
Figure 2 9.1 . Causal thinking in science. Potential mechanisms of human

immunodeficiency virus integration into host DNA. The diagram shows two
potential causal mechanisms – cellular (left branch) and viral (right branch).
were unexpected and that these scientists esis was supported or refuted, participants
had evolved a number of important strate- spent the majority of their time consid-
gies for dealing with such findings. One clear ering unexpected findings. An analysis of
strategy was to reason causally about the participants’ verbal protocols indicates that
findings: Scientists attempted to build causal much of this extra time is spent formu-
models of their unexpected findings. This lating causal models for the unexpected
causal model building resulted in the exten- findings.
sive use of collaborative reasoning, analog- Scientists are not merely the victims
ical reasoning, and problem-solving heuris- of unexpected findings but plan for unex-
tics (Dunbar, 1 997; 2001 ). pected events to occur. An example of the
Many of the key unexpected findings ways that scientists plan for unexpected con-
that scientists reasoned about in the InVivo tingencies in their day-to-day research is
studies of scientific thinking were inconsis- shown in Figure 29.1 . Figure 29.1 is an ex-
tent with the scientists’ pre-existing causal ample of a diagram in which the scientist is
models. A laboratory equivalent of the bi- building causal models about the ways that
ology labs therefore was to create a situa- human immunodeficiency virus (HIV) inte-
tion in which students obtained unexpected grates itself into the host deoxyribonucleic
findings that were inconsistent with their acid (DNA) taken from a presentation at
pre-existing theories. Dunbar and Fugelsang a lab meeting. The scientist proposes two
(2005 ; see also Fugelsang et al., 2004) ex- main causal mechanisms by which HIV in-
amined this issue by creating a scientific tegrates into the host DNA. The main event
causal thinking simulation in which exper- that must occur is that gaps in the DNA
imental outcomes were either expected or must be filled. In the left-hand branch of
unexpected. (Dunbar [1 995 ] called this type Diagram 2, he proposes a cellular mech-
of study of people reasoning in a cognitive anism whereby cellular polymerase fills in
laboratory InVitro cognition). They found gaps as the two sources of DNA integrate.
that students spent considerably more time In the right-hand branch, he proposes that
reasoning about unexpected findings than instead of cellular mechanisms filling in the
expected findings. Second, when assessing gaps, viral enzymes fill in the gap and join
the overall degree to which their hypoth- the two pieces of DNA. He then designs an
experiment to distinguish between these rule that governs them. Once a rule is dis-
two causal mechanisms. Clearly, visual and covered, scientists can extrapolate from the
diagrammatic reasoning is used here and is rule to formulate theories of the observed
a useful way of representing different causal and yet to be observed phenomena. One ex-
mechanisms (see also Tversky, Chap. 1 0 on ample is using inductive reasoning in the dis-
visuospatial reasoning). In this case, the vi- covery that a certain type of bacterium is a
sual representations of different causal paths cause of many ulcers (Thagard, 1 999). In a
are used to design an experiment and predict fascinating series of articles, Thagard docu-
possible results. Thus, causal reasoning is a ments the reasoning processes that Marshall
key component of the experimental design and Warren went through in proposing this
process. novel hypothesis. One key reasoning pro-
When designing experiments, scientists cess was the use of induction by generaliza-
know that unexpected findings occur of- tion. Marshall and Warren noted that almost
ten and have developed many strategies to all patients with gastric enteritis had a spi-
take advantage of them (Baker & Dunbar, ral bacterium in their stomachs and formed
2000). Scientists build different causal mod- the generalization that this bacterium is the
els of their experiments incorporating many cause of many stomach ulcers. There are nu-
conditions and controls. These multiple merous other examples of induction by gen-
conditions and controls allow unknown eralization in science, such as Tycho Brahe
mechanisms to manifest themselves. Rather induction about the motion of planets from
than being the victims of the unexpected, his observations, Dalton’s use of induction in
the scientists create opportunities for unex- chemistry, and the discovery of prions as the
pected events to occur, and once these events source of mad cow disease. Many theories
do occur, they have causal models that al- of induction have used scientific discovery
low them to determine exactly where in the and reasoning as examples of this important
causal chain their unexpected finding arose. reasoning process.
The results of these InVivo and InVitro stud- Another common type of inductive rea-
ies all point to a more complex and nuanced soning is to map a feature of one member
account of how scientists and nonscientists of a category to another member of a cate-
test and evaluate hypotheses. gory. This is called categorical induction. This
type of induction projects a known prop-
erty of one item onto another item from the
same category. Thus, knowing that the Rous
The Roles of Inductive and Deductive Sarcoma virus is a retrovirus that uses RNA
Thinking in the Scientific Mind rather than DNA, a biologist might assume
that another virus that is thought to be a
One of the most basic characteristics of sci- retrovirus also uses RNA rather than DNA.
ence is that scientists assume that the uni- Although research on this type of induction
verse that we live in follows predictable typically has not been discussed in accounts
rules. Very few scientists in this century of scientific thinking, this type of induction
would refute the claim that the earth ro- is common in science. For an important con-
tates around the sun, for example. Scien- tribution to this literature see Smith, Shafir,
tists reason from these rules using a variety and Osherson (1 993 ), and for a review of
of different strategies to make new scien- this literature see Heit (2000).
tific discoveries. Two frequently used types Turning now to deductive thinking, many
of reasoning strategies are inductive (see thinking processes to which scientists adhere
Sloman & Lagnado, Chap. 5 ) and deductive follow traditional rules of deductive logic.
reasoning (see Evans, Chap. 8). In the case These processes correspond to conditions in
of inductive reasoning, a scientist may ob- which a hypothesis may lead to, or is de-
serve a series of events and try to discover a ducible to, a conclusion. Although they are
not always phrased in syllogistic form, de- tent errors. A common context error that
ductive arguments can usually be phrased as people often make is to assume that con-
“syllogisms,” or as brief mathematical state- ditional relationships are, in fact, bicondi-
ments in which the premises lead to the con- tional. The conditional statement “if some-
clusion. Deductive reasoning is an extremely one has AIDS then they also have HIV,”
important aspect of scientific thinking be- for example, does not necessarily imply that
cause it underlies a large component of how “if someone has HIV then they also have
scientists conduct their research. By looking AIDS.” This is a common error in deduc-
at many scientific discoveries, we can often tive reasoning that can result in logically in-
see that deductive reasoning is at work. De- correct conclusions being drawn. A common
ductive reasoning statements all contain in- content error people often make is to modify
formation or rules that state an assumption the interpretation of a conclusion based on
about how the world works and a conclu- the degree to which the conclusion is plau-
sion that would necessarily follow from the sible. Here, scientists may be more likely to
rule. A classic example that is still receiv- accept a scientific discovery as valid if the
ing much scientific investigation today is the outcome is plausible. You can see how this
case of Planet X. In the early twentieth cen- second class of errors in deductive logic can
tury, Percival Lowell coined the term “Planet have profound implications for theory de-
X” when referring to any planet yet to be dis- velopment. Indeed, if scientists are overly
covered. Around that time and continuing to blinded by the plausibility of an outcome,
this day, based on rather large residual orbital they may fail to objectively evaluate the
perturbations of Uranus and Neptune, many steps in their deductive process.
scientists are convinced there exists a yet to
be discovered planet in our solar system. Be-
cause it is assumed as fact that only large ob-
jects that possess a strong gravitational force The Roles of Analogy in Scientific
can cause such perturbations, the search for Thinking
such an object ensued. Given Pluto’s rather
meager stature, it has been dismissed as a One of the most widely mentioned rea-
candidate for these perturbations. We can soning processes used in science is analogy.
apply these statements to deductive logic Scientists use analogies to form a bridge
as follows: between what they already know and what
they are trying to explain, understand, or dis-
Premise 1: The gravitational force of large cover. In fact, many scientists have claimed
planetary bodies causes perturbations in or- that the use of certain analogies was instru-
bits of planetary bodies mental in their making a scientific discovery,
Premise 2: Uranus and Neptune have per-
and almost all scientific autobiographies and
turbations in their orbits
Conclusion: The gravitational force of a biographies feature an important analogy
large planetary body influences the orbits that is discussed in depth. Coupled with the
of Uranus and Neptune fact that there has been an enormous re-
search program on analogical thinking and
Of course, the soundness of the logical de- reasoning (see Holyoak, Chap. 6), we now
duction is completely dependent on the have a number of models and theories of
accuracy of the premises. If the premises analogical reasoning that show exactly how
are correct, then the conclusion will analogy can play a role in scientific discovery
be correct. (see Gentner, Holyoak, & Kokinov, 2001 ).
Inductive and deductive reasoning, even By analyzing the use of analogies in sci-
by successful scientists, is not immune to ence, Thagard and Croft (1 999), Nersessian
error. Two classes of errors commonly found (1 999), Gentner and Jeziorski (1 993 ), and
in deductive reasoning are context and con- Dunbar and Blanchette (2001 ) all have
shown that analogical reasoning is a key as- hitherto unknown feature of the target or
pect of scientific discovery. restructures the target into a new set of rela-
Traditional accounts of analogy distin- tions. Interestingly, research on analogy has
guish between two components of analog- shown that participants do not easily use
ical reasoning – the target and the source. analogy (see Gentner et al., 1 997; Holyoak
The target is the concept or problem that & Thagard, 1 995 ). Participants tend to fo-
a scientist is attempting to explain or solve. cus on the sharing of a superficial feature
The source is another piece of knowledge between the source and the target, rather
that the scientist uses to understand the tar- than the relations among features. In his
get, or to explain the target to others. What InVivo studies of science, Dunbar (1 995 ,
the scientist does when he or she makes an 2001 , 2002) investigated the ways that scien-
analogy is to map features of the source onto tists use analogies while they are conducting
features of the target. By mapping the fea- their research and found that scientists use
tures of the source onto the target, new fea- both relational and superficial features when
tures of the target may be discovered, or the they make analogies. The choice of whether
features of the target can be rearranged so to use superficial or relational features de-
that a new concept is invented and a scien- pends on their goals. If their goal is to fix
tific discovery is made. A common analogy a problem in an experiment, their analogies
used with computers is to describe a harmful are based upon superficial features. If their
piece of software as a computer virus. Once goal is to formulate hypotheses, they focus
a piece of software is called a virus, people on analogies based upon sets of relations.
can map features of biological viruses, such One important difference between scien-
as they are small, spread easily, self-replicate tists and participants in experiments is that
using a host, and cause damage. Not only the scientists have deep relational knowl-
do people map a single feature of the source edge of the processes they are investigat-
onto the target but also the systems of re- ing and can use that relational knowledge to
lations between features from the source to make analogies.
the target. They also make analogical infer- Analogies sometimes lead scientists and
ences. If a computer virus is similar to a bi- students astray. Evelyn Fox-Keller (1 985 )
ological virus, for example, an immune sys- shows how an analogy between the pulsing
tem can be created on computers that can of a lighthouse and the activity of the slime
protect computers from future variants of mold dictyostelium led researchers astray for
a virus. One of the reasons scientific anal- a number of years. Likewise, the analogy
ogy is so powerful is that it can generate between the solar system (the source) and
new knowledge such as the creation of a the structure of the atom (the target) has
computational immune system having many been shown to be potentially misleading to
of the features of a real biological immune students taking more advanced courses in
system. This also leads to predictions that physics or chemistry. The solar system anal-
there will be newer computer viruses that ogy has a number of misalignments to the
are the computational equivalent of retro- structure of the atom, such as electrons be-
viruses, which lack DNA or standard ining repelled rather than attracted by each
structions that will elude the computational other and that electrons do not have individ-
immune system. ual orbits like planets, but have orbit clouds
The process of making an analogy in- of electron density. Furthermore, students
volves a number of key steps – retrieval of have serious misconceptions about the na-
a source from memory, aligning the features ture of the solar system, which can com-
of the source with those of the target, map- pound their misunderstanding of the nature
ping features of the source onto those of of the atom (Fischler & Lichtfield, 1 992). Al-
the target, and possibly making of new infer- though analogy is a powerful tool in science,
ences about the target. Scientific discoveries as is the case with all forms of induction, in-
are made when the source highlights a correct conclusions can be reached.
Isaac Newton’s Fluxions (1 73 6). It displays

Conceptual Change in the the ancient Greeks looking on in amaze-
Scientific Mind ment at an English hunter who shoots at a
bird using Newton’s new method of fluxions.
Many researchers have noted that an im- Clearly they had not undergone the concep-
portant component of science is the gen- tual change needed to understand Newto-
eration of new concepts and modification nian physics.
of existing ones. Scientific concepts, like all One area in which students show great
concepts, can be characterized as contain- difficulty in understanding scientific con-
ing representations of words, thoughts, accepts is in physics. Analyses of students
tions, objects, and processes. How does one’s changing conceptions, using interviews, ver-
knowledge of scientific concepts change over bal protocols, and behavioral outcome mea-
time? The large-scale changes that occur in sures indicate that large-scale changes in
conceptual structures have been labeled con- students’ concepts occur in physics educa-
ceptual change (see Chi & Ohlsson, Chap. tion (see McDermott and Redish 1 999 for
1 6; Nersessian, 2002; Thagard, 1 992). The- a review of this literature). Following Kuhn
ories of conceptual change focus on two (1 962), researchers have noted that students’
main types of shifts. One is the addition changing conceptions are similar to the se-
of knowledge to a pre-existing conceptual quences of conceptual changes in physics
structure. Here, there is no conflict between that have occurred in the history of science.
the pre-existing conceptual knowledge and These notions of radical paradigm shifts and
the new information the student is acquir- ensuing incompatibility with past knowl-
ing. Such minor conceptual shifts are rela- edge states have drawn interesting parallels
tively easy to acquire and do not demand between the development of particular sci-
restructuring of the underlying representa- entific concepts in children and in the history
tions of scientific knowledge. The second of physics.
type of conceptual shift is what is known as Investigations of naı̈ve people’s under-
“radical conceptual change” (see Keil, 1 999, standing of motion indicate that students
and Nersessian, 1 998, for reviews of this lit- have extensive misunderstandings of mo-
erature). In this type of situation, it is nec- tion. This naı̈ve physics research indicates
essary for a new conceptual system to be that many people hold erroneous be-
acquired that organizes knowledge in new liefs about motion similar to a medieval
ways, adds new knowledge, and results in “Impetus” theory (McCloskey, Caramazza,
a very different conceptual structure. This & Green, 1 980). Furthermore, students ap-
radical conceptual change is thought to be pear to maintain “Impetus” notions even af-
necessary for acquiring many new concepts ter one or two courses in physics. In fact,
in physics and is regarded as the major source some authors have noted that students who
of difficulty for students. The factors at the have taken one or two courses in physics
root of this conceptual shift view have been may perform worse on physics problems
difficult to determine, although a number of than naı̈ve students (Mestre, 1 991 ). It is
studies in human development (Carey, 1 985 ; only after extensive learning that we see
Chi, 1 992; Chi & Roscoe 2002), in the his- a conceptual shift from “Impetus” theo-
tory of science (Nersessian, 1 998; Thagard, ries of motion to Newtonian scientific the-
1 992), and in physics education (Clement, ories. How one’s conceptual representation
1 982; Mestre, 1 991 ) give detailed accounts shifts from “naı̈ve” to Newtonian is a mat-
of the changes in knowledge representation ter of contention because some have argued
that occur when people switch from one that the shift involves a radical conceptual
way of representing scientific knowledge to change, whereas others have argued that the
another. A beautiful example of concep- conceptual change is not really complete.
tual change is shown in Figure 29.2. This il- Kozhevnikov and Hegarty (2001 ) argue that
lustration is taken from the first edition of much of the naı̈ve “Impetus” notions of
motion are maintained at the expense of scientific mind. In particular, we have shown
Newtonian principles even with extensive how the scientific mind possesses many cog-
training in physics. They argue that such nitive tools that are applied differently de-
“Impetus” principles are maintained at an pending on the task at hand. Research in
implicit level. Thus, although students can thinking and reasoning has recently been ex-
give the correct Newtonian answer to prob- tended to include a systematic analysis of the
lems, their reaction times to respond indicate brain areas associated with scientific reason-
they are also using impetus theories. ing using techniques such as functional mag-
Although conceptual changes are thought netic resonance imaging (fMRI), positron
to be large-scale changes in concepts that emission topography, and event related po-
occur over extensive periods of time, it has tentials. There are two main reasons for
been possible to observe conceptual change taking this approach. First, these approaches
using InVivo methodologies. Dunbar (1 995 ) allow the researcher to look at the en-
reported a major conceptual shift that oc- tire human brain, making it possible to see
curred in immunologists, in which they ob- the many different sites involved in sci-
tained a series of unexpected findings that entific thinking and to gain a more com-
forced the scientists to propose a new con- plete understanding of the entire range of
cept in immunology that, in turn, forced the mechanisms involved in scientific think-
change in other concepts. The drive behind ing. Second, these brain-imaging approaches
this conceptual change was the discovery of allow researchers to address fundamental
a series of different unexpected findings or questions in research on scientific thinking.
anomalies that required the scientists to re- One important question concerns the extent
vise and reorganize their conceptual knowl- to which ordinary thinking in nonscientific
edge. Interestingly, this conceptual change contexts and scientific thinking recruit sim-
was achieved by a group of scientists reason- ilar versus disparate neural structures of the
ing collaboratively, rather than by one scien- brain. Dunbar (2002) proposed that scien-
tist working alone. Different scientists tend tific thinking uses the same cognitive mech-
to work on different aspects of concepts, and anisms all human beings possess, rather than
also different concepts, that, when put to- being an entirely different type of thinking.
gether, lead to a rapid change in entire con- He has proposed that in scientific thinking,
ceptual structures. standard cognitive processes are used, but
Overall, accounts of conceptual change are combined in ways that are specific to a
in individuals indicate it is, indeed, similar particular aspect of science or a specific dis-
to that of conceptual change in entire scien- cipline of science. By comparing the results
tific fields. Individuals need to be confronted of brain imaging investigations of scientific
with anomalies that their pre-existing theo- thinking with brain imaging studies of non-
ries cannot explain before entire conceptual scientific thinking, we can see both whether
structures are overthrown. However, re- and when common versus dissociated brain
placement conceptual structures have to be sites are invoked during different cognitive
generated before the old conceptual struc- tasks. This approach will make it possible to
ture can be discarded. Often, people do articulate more clearly what scientific think-
not overthrow their naı̈ve conceptual the- ing is and how it is both similar to and differ-
ories and have misconceptions in many fun- ent from the nonscientific thinking typically
damental scientific concepts that are main- examined in the cognitive laboratory (also
tained across the lifespan. see Goel, Chap. 20).
Considering the large arsenal of cognitive
tools researchers have at their disposal, de-
The Scientific Brain termining the neurological underpinning of
scientific thinking becomes mainly a mat-
In this chapter, we have provided an ter of dissecting the processes thought to
overview of research into the workings of the be involved in the reasoning process and
Figure 2 9.2 . Conceptual change in science: The ancient Greeks look on in

amazement as a hunter uses Newtonian principles to shoot down a bird. This
figure is taken from the frontispiece of his Method of Fluxions and Infinite Series;
with its Application to the Geometry of Curve Lines. Frontispiece in Bodelian
Library.
conducting systematic experiments on these undoubtedly possess common and distinct

subprocesses. What might these subpro- neural signatures. A number of cognitive
cesses be? As the previous sections of neuroscientists recently examined problem
this chapter show, scientific thinking in- solving (Fincham et al., 2002; Goel &
volves many cognitive capabilities including, Grafman, 1 995 ; Colvin, Dunbar, & Graf-
but not limited to, analogical reason- man, 2001 ), analogical reasoning (Wharton
ing, casual reasoning, induction, deduction, et al., 2000; Kroger et al., 2002), hypothe-
and problem solving: These subprocesses sis testing (Fugelsang & Dunbar, in press),
inductive reasoning (Goel & Dolan, 2000; theories in graphical models. More often
Seger et al., 2000), and deductive reason- than not, scientific theories are represented
ing (Parsons & Osherson, 2001 ; Osherson in both modalities to some degree.
et al., 1 998). They all pointed to the role of Based on what we know about hemi-
the dorsolateral prefrontal/parietal network spheric differences in the brain, there are
for tasks requiring these higher level cogni- several clear predictions about how spatial
tive capacities. It is important to note that and verbal thinking styles would be repre-
this brain network has been implicated in sented in the brain. If scientific thinking were
tasks that are highly demanding on attention predominantly based on verbal or linguistic
and working-memory. representations, for example, we would ex-
One question cognitive neuroscience in- pect activations of the basic language neu-
vestigations of scientific thinking are be- ral structures such as the frontal and inferior
ginning to address is the neurological temporal regions in the left hemisphere. If
underpinnings of conceptual change. Using scientific thinking were predominately based
fMRI to investigate students who have and on visual-spatial representations, one would
who have not undergone conceptual change expect activation of the basic perception
in scientific areas, it is possible to uncover the and motor control neural structures such
neural changes that accompany conceptual as those found in the parietal and occipital
change. Fugelsang and Dunbar (submitted) lobes, particularly in the right hemisphere.
have found shifts from ventral pathways to To date, findings from research on this issue
dorsal pathways in the brain when students have been quite mixed. Goel and colleagues
shift from naı̈ve impetus theories of motion (e.g., Goel et al., 1 998; Goel Chap. 20) have
to Newtonian theories of motion. These cog- found significant activations for deductive
nitive neuroscience investigations reveal the reasoning to occur predominantly in the left
ways that knowledge is organized in the sci- hemisphere. Parsons and Osherson (2001 )
entific brain and provide detailed accounts of using a similar, but different, task of deduc-
the nature of the representation of scientific tive reasoning, found that such tasks recrui-
knowledge. ted resources predominantly from the right
The extent to which these processes are hemisphere.
lateralized in the right or left hemisphere Much research has been conducted to de-
is a matter of recent debate, especially as termine the cause of these different results
it pertains to inductive and deductive rea- and Goel (Chap. 20) provides a detailed ac-
soning. Hemispheric differences in scientific count of recent research on the brain and
deductive thinking potentially can be quite deductive reasoning. One result regarding
revealing about the nature of the represen- hemispheric differences important for stud-
tations of the scientific mind. For exam- ies of scientific thinking is that of Roser et
ple, recent cognitive neuroscience research al., (in press). They conducted experimen-
can provide important new insights into tal examinations of hemispheric differences
one of the most fundamental questions that in scientific causal thinking in a split-brain
has perplexed many scientists for decades – patient. They found that the patient’s right
namely, whether complex scientific think- hemisphere was uniquely able to detect
ing processes, such as deductive and induc- causality in perceptually salient events (i.e.,
tive reasoning, are represented in terms of colliding balls), whereas his left hemisphere
linguistic or visual–spatial representations. was uniquely able to infer causality based
Anecdotal claims are equivocal as to the na- on a more complex, not directly perceivable,
ture of such representations. When think- chain of events. These data add to our grow-
ing about scientific concepts and devising ing understanding of how the brain contains
theoretical explanations for phenomena, for specialized neural structures that contribute
example, scientists may verbally represent to the interpretation of data obtained from
their theories in text or visually represent the environment. The obvious experiments
that need to be done would involve allowing specialize in the development of programs
scientists to think and reason naturally about devised to search large databases in the
their own theories versus theories from dif- hope of making new scientific discoveries
ferent domains while being imaged. This (Langley, 2000, 2002). This process is com-
would allow one to decompose the effects monly known as “data mining.” Not until rel-
of scientific thinking and familiarity. Clearly, atively recently has this technique proven vi-
research on the scientific brain is about able because of recent advances in computer
to begin. technology. An even more recent develop-
ment in the area of data mining is the use
of distributed computer networks that take
advantage of thousands, or even millions, of
Computational Approaches to computers worldwide to jointly mine data
Scientific Thinking in the hope of making significant scientific
discoveries. This approach has shown much
Along with recent brain imaging studies, promise because of its relative cost effec-
computational approaches have provided a tiveness. The most powerful supercomput-
more complete account of the scientific ers currently cost over 1 00 million dollars,
mind. Computational models provide spe- whereas a distributed network server may
cific detailed accounts of the cognitive pro- cost only tens of thousands of dollars for
cesses underlying scientific thinking. Early roughly the same computational power.
computational work consisted of taking a Another recent shift in the use of com-
scientific discovery and building computa- puters in scientific discovery is to have com-
tional models of the reasoning processes puters and people make discoveries together,
involved in the discovery. Langley et al. rather than expecting computers to make
(1 987) built a series of programs that sim- an entire scientific discovery. Now, instead
ulated discoveries such as those of Coper- of using computers to mimic the entire sci-
nicus and Stahl. These programs have vari- entific discovery process used by humans,
ous inductive reasoning algorithms built into computers can use powerful algorithms that
them and, when given the data the scientists search for patterns on large databases and
used, were able to propose the same rules. provide the patterns to humans who can
Computational models make it possible to then use the output of these computers to
propose detailed models of the cognitive make discoveries from the human genome
subcomponents of scientific thinking that to the structure of the universe.
specify exactly how scientific theories are
generated, tested, and amended (see Darden
1 997; Shrager & Langley, 1 990, for accounts
of this branch of research). More recently, Scientific Thinking and Science
the incorporation of scientific knowledge Education
into the computer programs resulted in a
shift in emphasis from using programs to Science education has undergone many
simulate discoveries to building programs changes over the past hundred years that
that help scientists make discoveries. A num- mirrored wider changes in both education
ber of these computer programs have made and society. In the early 1 900s, science edu-
novel discoveries. For example, Valdes-Perez cation was seen as a form of nature study –
(1 994) built systems for discoveries in chem- particularly in the kindergarten through
istry, and Fajtlowicz has done this in mathe- eight grades. Each decade has seen a re-
matics (Erdos, Fajtlowicz, & Staton, 1 991 ). port on the need to improve science edu-
These advances in the fields of computer cation. Starting in the 1 93 0s, proponents of
discovery have led to new fields, confer- the progressive education movement began
ences, journals, and even departments that a movement that continues to this day. They
argued that children should be taught more person thinking alone. These changes in sci-
than just facts and should be taught meth- ence education parallel changes in method-
ods and general principles, as well as ways in ologies used to investigate science, such as
which science relate to the child’s world. In analyzing the ways that scientists think and
1 93 8, a report by the Progressive Education reason in their laboratories.
Association noted that the psychology of the By looking at science as a complex, multi-
learner should be at the core of science edu- layered, and group activity, many researchers
cation, as well as making a link to children’s in science education have adopted a con-
everyday lives. Various reports on science ed- structivist approach. This approach sees
ucation appeared over the ensuing years, but learning as an active rather than a passive
it was the launch of the Sputnik satellite in process and proposes that students learn
1 95 7 that transformed science education in through constructing their scientific knowl-
the United States. Seeing the Soviets launch edge. The goal of constructivist science edu-
a rocket before the United States galvanized cation often is to produce conceptual change
the nation into training better scientists and through guided instruction in which the
identifying the brightest students. The net teacher or professor acts as a guide to dis-
result for science education was that text- covery rather than the keeper of all the facts.
books were updated, a factually based cur- One recent and influential approach to sci-
riculum was maintained, and the notion of ence education is the inquiry-based learning
science as a voyage of discovery entered the approach. Inquiry-based learning focuses on
popular imagination. By the 1 980s, however, posing a problem or a puzzling event to stu-
many cultural changes had occurred, and sci- dents and asking them to propose a hypoth-
ence students in the United States appeared esis that can be used to explain the event.
to be falling behind those in other countries. Next, students are asked to collect data that
Numerous reports by science teachers and test the hypotheses, reach conclusions, and
scientists recommended widespread changes then reflect upon both the original problem
in the ways that science is taught. Most im- and the thought processes they used to solve
portant in these changes was the move to a the problem. Students often use computers
constructivist view of education. According that aid in their construction of new knowl-
to this view, students construct their knowledge. The computers allow students to learn
edge rather than being the passive recipients many of the different components of scien-
of scientific knowledge (see also Ritchhart & tific thinking. For example, Reiser and his
Perkins, Chap. 3 2, on teaching thinking). colleagues have developed a learning envi-
Beginning in the 1 980s, a number of re- ronment for biology in which students are
ports, often constructivist, stressed the need encouraged to develop hypotheses in groups,
for teaching scientific thinking skills and not codify the hypotheses, and search databases
just methods and content. The addition of to test them (Reiser et al., 2001 ).
scientific thinking skills to the science cur- One of the myths of science is the lone
riculum from kindergarten through adult- scientist toiling under a naked lightbulb,
hood was a major shift in focus. Many of suddenly shouting “Eureka, I have made a
the particular scientific thinking skills em- discovery!” Instead, InVivo studies of scien-
phasized were covered in previous sections tists (e.g., Dunbar, 1 995 , 2002), historical
of this chapter, such as deductive and induc- analyses of scientific discoveries (Nersessian,
tive thinking strategies. Rather than focusing 1 999), and InVivo studies of children learn-
on one particular skill, such as induction, re- ing science at museums all point to collab-
searchers in education have focused on how orative scientific discovery mechanisms as
the different components of scientific think- being one of the driving forces of science
ing are put together in science. Furthermore, (Crowley et al., 2001 ). What happens during
science educators have focused on situations collaborative scientific thinking is that there
in which science is conducted collabora- is usually a triggering event, such as an unex-
tively, rather than being the product of one pected result or situation that a student does
not understand. This results in other mem- thermore, they found that discovery learn-
bers of the group adding new information ing children did not have richer or deeper
to the person’s representation of knowledge, knowledge than direct instruction children.
often adding new inductions and deduc- This type of finding suggests that pure dis-
tions that both challenge and transform covery learning, although intuitively appeal-
the reasoner’s old representations of knowl- ing, benefits only a few children and that
edge (Dunbar, 1 998). This means that social guided discovery coupled with explicit in-
mechanisms play a key component in fos- struction is one of the most effective educa-
tering changes in concepts that have been tional strategies in science.
ignored in traditional cognitive research but
are crucial for both science and science edu-
cation. In science education, there has been a Conclusions and Future Directions
shift to collaborative learning, particularly at
the elementary level, but in university edu- Although much is known regarding certain
cation, the emphasis is still on the individual components of scientific thinking, much re-
scientist. Because many domains of science mains to be discovered. In particular, there
now involve collaborations across scientific has been little contact among cognitive, neu-
disciplines, we expect the explicit teach- roscience, social, personality, and motiva-
ing of collaborative science heuristics to tional accounts of scientific thinking. Clearly,
increase. the relations among these different aspects
What is the best way to teach and of scientific thinking need to be combined
learn science? Surprisingly, the answer to to produce a comprehensive picture of the
this question has been difficult to unscientific mind. One way to achieve this is
cover. Although there has been consider- by using converging multiple methodolo-
able research on the benefits of using a gies as outlined previously, such as natu-
particular way of learning science, few com- ralistic observation, controlled experiments
parative studies of different methods have in the cognitive laboratory, and functional
been conducted. Following Seymour Pa- brain imaging techniques. Theoretical devel-
pert’s book MindStorms, for example, (1 980) opments into the workings of the scientific
many schools moved to discovery learning mind would greatly benefit from more un-
in which children discover aspects of pro- constrained analyses of the neuroanatomical
gramming and mathematics through writ- correlates of the scientific reasoning process.
ing their own computer programs in the We, as scientists, are beginning to get a rea-
LOGO programming language. This discov- sonable grasp of the inner workings of the
ery learning approach, which thousands of subcomponents of the scientific mind (i.e.,
schools have adopted, has been presented as problem solving, analogy, induction) and sci-
an alternative to more didactic approaches entific thought. However, great advances re-
to teaching and learning. By allowing stu- main to be made concerning how these pro-
dents to discover principles on their own cesses interact so scientific discoveries can
and to set their own goals, students are pur- be made. Future research will focus on both
ported to have deeper knowledge that trans- the collaborative aspects of scientific think-
fers more appropriately. Although there is ing and the neural underpinnings of the sci-
much anecdotal evidence on the benefits of entific mind.
discovery learning, only recently has a di-
rect comparison of discovery learning with
more traditional methods been conducted. Acknowledgments
Klahr and Nigam (2004) conducted a study
of third and fourth grade children learning The authors would like to thank the fol-
about experimental design. They found that lowing organizations: Dartmouth College,
many more children learned from direct in- McGill University, The Spencer Foundation,
struction than from discovery learning. Fur- The National Science Foundation, and the
Engineering Research Council of Canada for achievement in the water jug task. Journal of
funding research discussed in this chapter. Cognitive Neuroscience, 1 3 , 1 1 29–1 1 47.
The comments of Keith Holyoak, Vimla Pa- Darden, L. (1 997). Strategies for discovering
tel, and an anonymous reviewer were all mechanisms: Schema instantiation, modular
helpful in making this a better chapter. subassembly, forward chaining/backtracking.
Proceedings of the 1 997 Biennial Meeting of the
Philosophy of Science Association.
Dunbar, K. (1 995 ). How scientists really reason:
References Scientific reasoning in real-world laborato-
ries. In R. J. Sternberg, & J. Davidson (Eds.),
Ahn, W., Kalish, C. W., Medin, D. L., & Gelman, Mechanisms of Insight. Cambridge, MA: MIT
S. A. (1 995 ). The role of covariation versus Press.
mechanism information in causal attribution, Dunbar, K. (1 997). How scientists think: Online
Cognition, 5 4, 299–3 5 2. creativity and conceptual change in science.
Azmitia, M. A. & Crowley, K. (2001 ). The In T. B. Ward, S. M. Smith, & S. Vaid (Eds.),
rhythms of scientific thinking: A study of col- Conceptual Structures and Processes: Emergence,
laboration in an earthquake microworld. In K. Discovery and Change. Washington, DC: APA
Crowley, C. Schunn, & T. Okada (Eds.) De- Press.
signing for Science: Implications from Everyday, Dunbar, K. (1 998). Problem solving. In W.
Classroom, and Professional Settings. Mawah, Bechtel, & G. Graham (Eds.). A Companion to
NJ: Erlbaum. Cognitive Science. London, UK: Blackwell.
Bacon, F. (1 620/1 85 4). Novum Organum. (B. Dunbar, K. (1 999). The scientist InVivo: How sci-
Monatgue, Trans.). Philadelphia: Parry & entists think and reason in the laboratory. In L.
McMillan. Magnani, N. Nersessian, & P. Thagard (Eds.),
Baker, L. M., & Dunbar, K. (2000). Experimental Model-based Reasoning in Scientific Discovery.
design heuristics for scientific discovery: The New York Plenum.
use of baseline and known controls. Interna- Dunbar, K. (2001 ). The analogical paradox: Why
tional Journal of Human Computer Studies, 5 3 , analogy is so easy in naturalistic settings, yet
3 3 5 –3 49. so difficult in the psychology laboratory. In D.
Bruner, J. S., Goodnow, J. J., & Austin, G. A. Gentner, K. J. Holyoak, & B. Kokinov, Analogy:
(1 95 6). A Study of Thinking. New York: Science Perspectives from Cognitive Science. Cambridge,
Editions. MA: MIT Press.
Carey, S. (1 985 ). Conceptual Change in Child- Dunbar, K. & Blanchette, I. (2001 ). The in-
hood. Cambridge, MA: MIT Press. vivo/invitro approach to cognition: the case of
Carruthers, P., Stich, S., & Siegal, M. (2002). analogy. Trends in Cognitive Sciences, 5 , 3 3 4–
The Cognitive Basis of Science. New York: Cam- 3 3 9.
bridge University Press. Dunbar, K. (2002). Science as category: Implica-
Chi, M. (1 992). Conceptual change within and tions of InVivo science for theories of cognitive
across ontological categories: Examples from development, scientific discovery, and the na-
learning and discovery in science. In R. Giere ture of science. In P. Caruthers, S. Stich, & M.
(Ed.), Cognitive Models of Science. Minneapolis: Siegel (Eds.), Cognitive Models of Science. New
University of Minnesota Press. York: Cambridge University Press.
Chi, M. T. H., & Roscoe, R. D. (2002). The pro- Dunbar, K., & Fugelsang, J. (2004). Causal think-
cesses and challenges of conceptual change. In ing in science: How scientists and students in-
M. Limon, & L. Mason (Eds.), Reconsidering terpret the unexpected. To appear in M. E.
Conceptual Change: Issues in Theory and Prac- Gorman, A. Kincannon, D. Gooding, & R. D.
tice (pp 3 –27). The Netherlands: Kluwer Aca- Tweney (Eds.), New Directions in Scientific and
demic Publishers. Technical Thinking. Hillsdale, NJ: Erlbaum.
Clement, J. (1 982). Students’ preconceptions in Dunbar, K. & Sussman, D. (1 995 ). Toward a cog-
introductory mechanics. American Journal of nitive account of frontal lobe function: Simu-
Physics, 5 0, 66–71 . lating frontal lobe deficits in normal subjects.
Colvin, M. K., Dunbar, K., & Grafman, J. (2001 ). Annals of the New York Academy of Sciences,
The effects of frontal lobe lesions on goal 769, 289–3 04.
Erdos, P., Fajtlowicz, S., & Staton, W. (1 991 ). De- Goel, V., & Grafman, J. (1 995 ). Are the frontal
gree sequences in the triangle-free graphs. Dis- lobes implicated in “planning” functions? In-
crete Mathematics, 92 (91 ), 85 –88. terpreting data from the Tower of Hanoi. Neu-
Fincham, J. M., Carter, C. S., van Veen, V., ropsychologia, 3 3 , 623 –642.
Stenger, V. A., & Anderson, J. R. (2002). Neu- Gorman, M. E. (1 989). Error, falsification and
ral mechanisms of planning: A computational scientific inference: An experimental inves-
analysis using event-related fMRI. Proclama- tigation. Quarterly Journal of Experimental
tions of the National Academy of Science, 99, Psychology: Human Experimental Psychology,
3 3 46–3 3 5 1 . 41 A, 3 85 –41 2.
Fischler, H., & Lichtfeldt, M. (1 992). Modern Gorman, M. E., Kincannon, A., Gooding, D., &
physics and students conceptions. International Tweney, R. D. (2004). New Directions in Sci-
Journal of Science Education, 1 4, 1 81 –1 90. entific and Technical Thinking. Lawrence Erl-
Fox-Keller, E. (1 985 ). Reflections on Gender and baum. Hillsdale, NJ.
Science. New Haven, CT: Yale University Press. Heit, E. (2000). Properties of inductive reasoning.
Fugelsang, J., & Dunbar, K. (submitted). How the Psychonomic Bulletin & Review, 7, 5 69–5 92.
brain uses theory to interpret data. Hirschberg, C. (2003 ). My mother, the scientist.
Fugelsang, J., & Dunbar, K. (in preparation). How In R. Dawkins, & T. Folger (Eds.), The Best
the brain learns physics. American Science and Nature Writing 2 003 .
Fugelsang, J., Stein, C., Green, A., & Dunbar, K. New York: Houghton Mifflin.
(2004). Theory and data interactions of the sci- Holyoak, K. J., & Thagard, P. (1 995 ). Mental
entific mind: Evidence from the molecular and Leaps. Cambridge, MA: MIT Press.
the cognitive laboratory. Canadian Journal of Keil, F. C. (1 999). Conceptual change. In R.
Experimental Psychology, 5 8, 1 3 2–1 41 . Wilson., & F. Keil (Eds.) The MIT Encyclope-
Galilei, G. (1 63 8/1 991 ). A. de Salvio, & H. Crew dia of Cognitive Science, Cambridge, MA: MIT
trans. Dialogues Concerning Two New Sciences. press.
Amherst, NY: Prometheus Books. Klahr, D. (2000). Exploring Science: The Cognition
Galison, P. (2003 ). Einstein’s Clocks, Poincaré’s and Development of Discovery Processes. Cam-
Maps: Empires of Time. New York, NY: W. W. bridge, MA: MIT Press.
Norton. Klahr, D., & Dunbar, K. (1 988). Dual space search
Gentner, D., Brem, S., Ferguson, R. W., during scientific reasoning. Cognitive Science,
Markman, A. B., Levidow, B. B., Wolff, P., et 1 2, 1 –48.
al. (1 997). Analogical reasoning and concep- Klahr, D., & Nigam, M. (2004). The equivalence
tual change: A case study of Johannes Kepler. of learning paths in early science instruction:
The Journal of the Learning Sciences, 6(1 ), 3 – Effects of direct instruction and discovery
40. learning Psychological Science, 1 5 , 661 –667.
Gentner, D., Holyoak, K. J., & Kokinov, B. (2001 ). Klahr, D., & Simon, H. (1 999). Studies of sci-
The Analogical Mind: Perspectives from Cognitive entific discovery: Complementary approaches
Science. Cambridge, MA: MIT Press. and convergent findings. Psychological Bulletin,
Gentner, D., & Jeziorski, M. (1 993 ). The shift 1 2 5 , 5 24–5 43 .
from metaphor to analogy in western science. Klayman, J., & Ha, Y. (1 987). Confirmation, dis-
In A. Ortony (Ed.), Metaphor and Thought (2nd confirmation, and information in hypothesis
ed., pp. 447–480). Cambridge, England: Cam- testing. Psychological Review, 94, 21 1 –228.
bridge University Press. Kozhevnikov, M., & Hegarty, M. (2001 ). Impe-
Giere, R. (1 993 ). Cognitive Models of Science. Min- tus beliefs as default heuristic: Dissociation be-
neapolis, MN: University of Minnesota Press. tween explicit and implicit knowledge about
Goel, V., & Dolan, R. J. (2000). Anatomical seg- motion. Psychonomic Bulletin and Review, 8,
regation of component processes in an induc- 43 9–45 3 .
tive inference task. Journal of Cognitive Neuro- Kroger, J. K., Sabb, F. W., Fales, C. L.,
science, 1 2 , 1 1 0–1 1 9. Bookheimer, S. Y., Cohen, M. S., & Holyoak, K.
Goel, V., Gold, B., Kapur, S., & Houle, S. (1 998). J. (2002). Recruitment of anterior dorsolateral
Neuroanatomical correlates of human reason- prefrontal cortex in human reasoning: A para-
ing. Journal of Cognitive Neuroscience, 1 0, 293 – metric study of relational complexity. Cerebral
3 02. Cortex, 1 2 , 477–485 .
Kuhn, T. (1 962). The Structure of Scientific Revolu- Papert, S. (1 980). Mindstorms: Children Com-
tions. Chicago: University of Chicago Press. puters and Powerful ideas. New York: Basic
Kulkarni, D., & Simon, H. A. (1 988). The pro- Books.
cesses of scientific discovery: The strategy of Parsons, L. M., & Osherson, D. (2001 ). New ev-
experimentation. Cognitive Science, 1 2 , 1 3 9– idence for distinct right and left brain systems
1 76. for deductive versus probabilistic reasoning.
Langley, P. (2000). Computational support of Cerebral Cortex, 1 1 , 95 4–965 .
scientific discovery. International Journal of Penner, D. E., & Klahr, D. (1 996). When to trust
Human-Computer Studies, 5 3 , 3 93 –41 0. the data: Further investigations of system er-
Langley, P. (2002). Lessons for the computational ror in a scientific reasoning task. Memory and
discovery of scientific knowledge. In the Pro- Cognition, 24(5 ), 65 5 –668.
ceedings of the First International Workshop on Popper, K. R. (1 95 9). The Logic of Scientific Dis-
Data Mining Lessons Learned. covery. London, UK: Hutchinson.
Langley, P., Simon, H. A., Bradshaw, G. L., & Qin, Y., & Simon, H. A. (1 990). Laboratory repli-
Zytkow, J. M. (1 987). Scientific Discovery: Com- cation of scientific discovery processes. Cogni-
putational Explorations of the Creative Processes. tive Science, 1 4, 281 –3 1 2.
Cambridge, MA: MIT Press. Reiser, B. J., Tabak, I., Sandoval, W. A., Smith, B.,
McCloskey, M., Caramazza, A., & Green, B. Steinmuller, F., & Leone, T. J., (2001 ). BGuILE:
(1 980). Curvilinear motion in the absence of Stategic and conceptual scaffolds for scientific
external forces: Naive beliefs about the motion inquiry in biology classrooms. In S. M. Carver,
of objects. Science, 2 1 0, 1 1 3 9–1 1 41 . & D. Klahr (Eds.). Cognition and Instruction:
McDermott, L. C., & Redish, L. (1 999). Research Twenty Five Years of Progress. Mahwah, NJ:
letter on physics education research. American Erlbaum.
Journal of Psychics, 67, 75 5 . Roser, M., Fugelsang, J., Dunbar, K., Corballis, P.,
Mestre, J. P. (1 991 ). Learning and instruction in & Gazzaniga, M. (Submitted). Causality per-
pre-college physical science. Physics Today, 44, ception and inference in the split-brain.
5 6–62. Schunn, C. D., & Klahr, D. (1 995 ). A 4-space
Mitroff, I. (1 974). The Subjective Side of Science. model of scientific discovery. In the Proceedings
Amsterdam: Elsevier. of the 1 7th Annual Conference of the Cognitive
Nersessian, N. (1 998). Conceptual change. In W. Science Society. Pittsburgh, PA.
Bechtel, & G. Graham (Eds.). A Companion to Schunn, C. D., & Klahr, D. (1 996). The prob-
Cognitive Science. London, UK: Blackwell. lem of problem spaces: When and how to go
Nersessian, N. (1 999). Models, mental mod- beyond a 2-space model of scientific discov-
els, and representations: Model-based reason- ery. Part of symposium on building a theory of
ing in conceptual change. In L. Magnani, N. problem solving and scientific discovery: How
Nersessian, & P. Thagard (Eds.). Model-based big is N in N-space search? In the Proceedings
Reasoning in Scientific Discovery. New York: of the 1 8th Annual Conference of the Cognitive
Plenum. Science Society. San Diego, CA.
Nersessian, N. J. (2002). The cognitive basis Seger, C., Poldrack, R., Prabhakaran, V., Zhao,
of model-based reasoning in science In P. M., Glover, G., & Gabrieli, J. (2000). Hemi-
Carruthers, S. Stich, & M. Siegal (Eds). The spheric asymmetries and individual differences
Cognitive Basis of Science. New York: Cam- in visual concept learning as measured by func-
bridge University Press. tional MRI. Neuropsychologia, 3 8, 1 3 1 6–1 3 24.
Newton, I. (1 73 6). Method of Fluxions and Infinite Shrager, J., & Langley, P. (1 990). Computational
Series; with its Application to the Geometry of Models of Scientific Discovery and Theory For-
Curve Lines. Frontispiece in Bodleian Library. mation. San Mateo, CA: Morgan Kaufmann.
Newell, A., & Simon, H. A. (1 972). Human Prob- Simon, H. A. (1 977). Models of Discovery. Dor-
lem Solving. Oxford, UK: Prentice-Hall. drecht, The Netherlands: D. Reidel.
Osherson, D., Perani, D., Cappa, S., Schnur, T., Simon, H. A., & Lea, G. (1 974). Problem solving
Grassi, F., & Fazio, F. (1 998). Distinct brain and rule induction. In H. Simon (Ed.), Models
loci in deductive versus probabilistic reasoning. of Thought (pp. 3 29 – 3 46). New Haven, CT:
Neuropsychologia, 3 6, 3 69–3 76. Yale University Press.
Smith, E. E., Shafir, E., & Osherson, D. (1 993 ). Tweney, R. D., Doherty, M. E., & Mynatt,
Similarity, plausibility, and judgments of prob- C. R. (1 981 ). On Scientific Thinking. New York:
ability. Cognition. Special Issue: Reasoning and Columbia University Press.
Decision Making, 49, 67–96. Valdes-Perez, R. E. (1 994). Conjecturing hidden
Thagard, P. (1 992). Conceptual Revolutions. Cam- entities via simplicity and conservation laws:
bridge, MA: MIT Press. Machine discovery in chemistry. Artificial In-
Thagard, P. (1 999). How Scientists Explain telligence, 65 (2), 247–280.
Disease. Princeton: Princeton University Wason, P. C. (1 968). Reasoning about a rule.
Press. Quarterly Journal of Experimental Psychology,
Thagard, P., & Croft, D. (1 999). Scientific 20, 273 –281 .
discovery and technological innovation: Ul- Wertheimer, M. (1 945 ). Productive Thinking.
cers, dinosaur extinction, and the program- New York: Harper.
ming language Java. In L. Magnani, N. Ners- Wharton, C., & Grafman, J. (1 998). Reasoning
essian, & P. Thagard (Eds.). Model-based and the human brain. Trends in Cognitive Sci-
Reasoning in Scientific Discovery. New York: ence. 2 , 5 4–5 9.
Plenum. Wharton, C. M., Grafman, J., Flitman, S. S.,
Tweney, R. D. (1 989). A framework for the cog- Hansen, E. K., Brauner, J., Marks, A., et
nitive psychology of science. In B. Gholson, A. al. (2000). Toward neuroanatomical models
Houts, R. A. Neimeyer, & W. Shadish (Eds.), of analogy: A positron emission tomography
Psychology of Science and Metascience. Cam- study of analogical mapping. Cognitive Psychol-
bridge, MA: Cambridge University Press. ogy, 40, 1 73 –1 97.
CHAPTER 3 0
Thinking and Reasoning in Medicine
Vimla L. Patel
José F. Arocha
Jiajie Zhang
What Is Medical Reasoning? basic to all higher-level cognitive processes

in medicine such as problem solving and
Medical reasoning describes a form of qual- medical text comprehension. On the other
itative inquiry that examines the cogni- hand, the structure of medical reasoning it-
tive (thought) processes involved in making self is the subject of considerable scrutiny.
medical decisions. Clinical reasoning, med- For example, the directionality of reason-
ical problem solving, diagnostic reasoning, ing in medicine has been an issue of con-
and decision making are all terms used in siderable controversy in medical cognition,
a growing body of literature that examines medical education, and artificial intelligence
how clinicians make clinical decisions. Med- (AI) in medicine. Conventionally, we can
ical cognition refers to studies of cognitive partition medical reasoning into clinical and
processes, such as perception, comprehen- biomedical or basic science reasoning. These
sion, decision making (see LeBoeuf & Shafir, are some of the central themes that consti-
Chap. 1 1 ), and problem solving (see Novick tute this chapter.
& Bassok, Chap. 1 4) in medical practice itself
or in tasks representative of medical prac-
tice. These studies use subjects who work in
medicine, including medical students, physi- Early Research on Medical Problem
cians, and biomedical scientists. The study Solving and Reasoning
of medical reasoning has been the focus of
much research in cognitive science and artifi- Medical cognition is a subfield of cogni-
cial intelligence in medicine. Medical reason- tive science devoted to the study of cogni-
ing involves an inferential process for making tive processes in medical tasks. Studies of
diagnostic or therapeutic decisions or un- medical cognition include analyses of perfor-
derstanding the pathology of a disease pro- mance in “real world” clinical tasks as well
cess. On the one hand, medical reasoning is as in experimental tasks. Understanding the
72 7
thought processes involved in clinical rea- processing psychology accelerated dramati-

soning in order to promote more effective cally. Problem solving was conceived of as a
practices has been the subject of concern for search in a problem space in which a prob-
nearly a century (Osler, 1 906). lem solver was viewed as selecting an option
Human information processing research (e.g., a hypothesis or an inference) or per-
typically has focused on the individual. The forming an operation (from a set of possible
dual focus on in-depth task analysis and on operations) in moving toward a solution or a
the study of human performance is a central goal state (e.g., diagnosis or treatment plan).
feature of a cognitive science approach. (See Novick & Bassok, Chap. 1 4, for a discus-
There have been two primary approaches sion of problem solving.) This conceptual-
to research investigating clinical reasoning in ization had an enormous impact in both cog-
medicine – the decision – analytic approach nitive psychology and artificial intelligence
and the information-processing or problem- research. It also led to rapid advances
solving approach. Decision analysis uses a in medical reasoning and problem-solving
formal quantitative model of inference and research, as exemplified by the seminal
decision making as the standard of compari- work of Elstein, Shulman, & Sprafka (1 978).
son (Dowie & Elstein, 1 988). It compares the They were the first to use experimen-
performance of a physician with the mathe- tal methods and theories of cognitive sci-
matical model by focusing on reasoning “fal- ence to investigate clinical competency.
lacies” and biases inherent in human clin- Their extensive empirical research led to
ical decision making (Leaper et al., 1 972). the development of an elaborated model
In contrast, the information-processing ap- of hypothetico-deductive reasoning, which
proach focuses on the description of cog- proposed that physicians reason by first gen-
nitive processes in reasoning tasks and the erating and then testing a set of hypothe-
development of cognitive models of perfor- ses to account for clinical data (i.e., reason-
mance, typically relying on protocol analysis ing from hypothesis to data). This model
(Ericsson and Simon, 1 993 ) and other obser- of problem solving had a substantial influ-
vational techniques. ence on studies of both medical cognition
Systematic investigations of medical ex- and medical education.
pertise began more than forty years ago with In the late 1 970s and early 1 980s, ad-
the research by Ledley and Lusted (1 95 9) vances into the nature of human expertise
on clinical inquiries. They proposed a two- were paralleled by developments in medi-
stage model of clinical reasoning involving cal AI – particularly expert systems tech-
a hypothesis-generation stage followed by nology. Artificial intelligence in medicine
a hypothesis-evaluation stage in which the and medical cognition mutually influenced
latter stage was amenable to formal deci- each other in a number of ways, including
sion analytic techniques. Probably the ear- (1 ) providing a basis for developing formal
liest empirical studies of medical reason- models of competence in problem-solving
ing can be traced to the work of Rimoldi tasks, (2) elucidating the structure of medi-
(1 961 ), who conducted experimental studies cal knowledge and providing important epis-
of diagnostic reasoning contrasting students temological distinctions, and (3 ) character-
with medical experts in simulated problem- izing productive and less-productive lines
solving tasks. The results emphasized the of reasoning in diagnostic and therapeutic
greater ability of expert physicians to se- tasks. Gorry (1 973 ) conducted a series of
lectively attend to relevant information and studies comparing a computational model
to narrow the set of diagnostic possibilities of medical problem solving with the actual
(i.e., consider fewer hypotheses). As cogni- problem-solving behavior of physicians. This
tive science came into prominence in the analysis provided a basis for characterizing
early 1 970s spearheaded by the immensely a sequential process of medical decision-
influential work of Newell and Simon (1 972) making – one that differs in important re-
on problem solving, research in information- spects from early diagnostic computational
thinking and reasoning in medicine 72 9
systems based on Bayes’ theorem. Pauker studied the knowledge-based solution strate-
and colleagues (1 976) capitalized on some of gies of expert cardiologists as evidenced by
the insights of Gorry’s earlier work and de- their pathophysiological explanations of a
veloped the Present Illness Program, a pro- complex clinical problem. The results indi-
gram designed to take the history of a patient cated that subjects who accurately diagnosed
with edema. Several of the questions guiding the problem employed a forward-oriented
this research, including the nature and orga- reasoning strategy – using patient data to
nization of expert knowledge, were of cen- lead toward a complete diagnosis (i.e., rea-
tral concern to both developers of medical soning from data to hypothesis). In contrast,
expert systems and researchers in medical subjects who misdiagnosed or partially diag-
cognition. The development and refinement nosed the patient problem used a backward
of the program was partially based on studies reasoning strategy. These research findings
of clinical problem solving. presented a challenge to the hypothetico-
Medical expert consultation systems such deductive model of reasoning as espoused by
as Internist (Miller, Pople, & Myers, 1 984) Elstein, Shulman, & Sprafka (1 978), which
and MYCIN (Shortliffe, 1 976) introduced did not differentiate expert from nonexpert
the ideas about knowledge-based reason- reasoning strategies.
ing strategies across a range of cognitive Much of the early research in the study
tasks. MYCIN, in particular, had a sub- of reasoning in domains such as medicine
stantial influence on cognitive science. It was carried out in laboratory or experimen-
contributed several advances (e.g., repre- tal settings. In more recent times, a shift oc-
senting reasoning under uncertainty) in the curred toward examining cognitive issues in
use of production systems as a represen- naturalistic medical settings, such as med-
tation scheme in a complex knowledge- ical teams in intensive care units (Patel,
based domain. MYCIN also highlighted the Kaufman, & Magder, 1 996), anesthesiolo-
difference between medical problem solv- gists working in surgery (Gaba, 1 992), nurses
ing and the cognitive dimensions of med- providing emergency telephone triage (Lep-
ical explanation. Clancey’s work (Clancey rohon & Patel, 1 995 ), and reasoning with
& Lefsinger, 1 984, 1 985 ) in GUIDON and technology by patients in the health care
NEOMYCIN was particularly influential in system (Patel et al., 2002). This research
the evolution of models of medical cog- was informed by work in the area of dy-
nition. Clancey endeavored to reconfigure namic decision making (Salas & Klein, 2001 ),
MYCIN to employ the system to teach med- complex problem solving (Frensch & Funke,
ical students about meningitis and related 1 995 ), human factors (Hoffman & Deffen-
disorders. NEOMYCIN was based on a more bacher, 1 993 ; Vicente & Rasmussen, 1 990),
psychologically plausible model of medical and cognitive engineering (Rasmussen,
diagnosis. This model differentiated data- Pejtersen, & Goodstein, 1 994). Such studies,
directed and hypothesis-directed reasoning conducted in the workplace, reshaped our
and separated control knowledge from the views of human thinking by shifting the onus
facts upon which it operates. of cognition from being the unique province
Feltovich and colleagues (Feltovich et al., of the individual to being distributed across
1 984), drawing on models of knowledge social and technological contexts.
representation from medical AI, character-
ized fine-grained differences in knowledge
organization between subjects with differ-
ent levels of expertise in the domain of Models of Medical Reasoning
pediatric cardiology. These differences ac-
counted for subjects’ inferences about di- The traditional view of medical reasoning
agnostic cues and evaluation of competing has been to treat diagnosis as similar to
hypotheses. Patel and Groen (1 986), incor- the scientist’s task of making a discovery or
porating distinctions introduced by Clancey, engaging in scientific experimentation (see
Dunbar & Fugelsang, Chap. 29). Coherent of the domain knowledge. These accounts
with this view of science is the assump- simply make the assumption that some do-
tion that diagnostic inference follows a main of knowledge exists and that all of the
hypothetico-deductive process of reaching hypotheses needed to explain a problem are
conclusions by testing hypothesis based on available when the diagnostic process begins.
clinical evidence. From a cognitive perspec- Within this generic framework, various
tive, as we saw previously, this view of the models of diagnostic reasoning may be
diagnostic process in medicine was first pro- constructed. Following Patel and Ramoni
posed in the influential work of Elstein, (1 997), we could distinguish between two
Shulman, and Sprafka (1 978). The view of major models of diagnostic reasoning: heuris-
medical reasoning as hypothetico-deductive tic classification (Clancey, 1 985 ) and cover
has been challenged from various points, em- and differentiate (Eshelman, 1 988). How-
pirical research, and philosophical discourse, ever, these models can be seen as special
as we will see in the following section. cases of a more general model: the select
and test model, in which the processes of
hypothesis generation and testing can be
characterized in terms of four types of in-
Toward a Model of Reasoning in ferences (Peirce, 1 95 5 ) – abstraction, abduc-
Medicine: Induction, Deduction, tion, deduction, and induction. The first two
and Abduction inference types drive hypothesis generation
whereas the latter two types drive hypothe-
It generally is agreed that there are two basic sis testing. During abstraction, data are fil-
forms of reasoning. One is deductive reason- tered according to their relevance for the
ing (see Evans, Chap. 8), which consists of problem solution and chunked in schemas
deriving a particular valid conclusion from a representing an abstract description of the
set of general premises, and the other is in- problem at hand (e.g., abstracting that an
ductive reasoning (see Sloman & Lagnado, adult male with hemoglobin concentration
Chap. 5 ), which consists of deriving a likely less than 1 4 d/gl is an anemic patient). Fol-
general conclusion from a set of particular lowing this, hypotheses that could account
statements. However, reasoning in the “real for the current situation are related through
world” does not appear to fit neatly into a process of abduction characterized by a
these basic types. For this reason, a third “backward flow” of inferences across a chain
form of reasoning has been recognized in of directed relations that identify initial con-
which deduction and induction are com- ditions from which the current abstract rep-
bined. This was termed abductive reasoning resentation of the problem originates. This
by Peirce (1 95 5 ). provides tentative solutions to the problem
Basically, all theories of medical reasoning at hand by way of hypotheses. For example,
characterize diagnosis as an abductive, cycli- knowing that disease A will cause symptom
cal process of generating possible explana- b, abduction will try to identify the explana-
tions (i.e., identification of a set of hypothe- tion for b, and deduction will forecast that
ses that are able to account for the clinical a patient affected by disease A will manifest
case on the basis of the available data) and symptom b: Both inferences are using the
testing those explanations (i.e., evaluation of same relation along two different directions.
each generated hypothesis on the basis of its These three types of reasoning in medicine
expected consequences) for the abnormal are described in a paper by Patel and
state of the patient at hand (Elstein, Shul- Ramoni (1 997).
man, & Sprafka, 1 978; Kassirer, 1 989; Joseph In the testing phase, hypotheses are tested
& Patel, 1 990; Ramoni et al., 1 992). Tra- incrementally according to their ability to
ditional accounts of medical reasoning de- account for the whole problem, and de-
scribed the diagnostic process in a way that duction serves to build up the possible
is independent of the underlying structure world described by the consequences of each
thinking and reasoning in medicine 731
hypothesis. This kind of reasoning is cus- pothesis to data). Physicians first generated
tomarily regarded as a common way to eval- a small set of hypotheses very early in the
uate diagnostic hypotheses (Kassirer, 1 989; case, as soon as the first pieces of data became
Patel, Evans, & Kaufman, 1 989; Joseph & Pa- available. Second, they were selective in the
tel, 1 990; Patel, Arocha, & Kaufman, 1 994, data they collected, focusing only on the rel-
2001 ). As predictions are derived from hy- evant data. Third, physicians made use of the
potheses, they are matched to the case hypothetico-deductive process, which con-
through a process of induction in which a sists of four stages – cue acquisition, hy-
prediction generated from a hypothesis can pothesis generation, cue interpretation, and
be matched with one specific aspect of the hypothesis evaluation. Cues in the clinical
patient problem. The major feature of in- case led to the generation of a few selected
duction, therefore, is the ability to rule out hypotheses, and each cue was interpreted
hypotheses, the expected consequences of as positive, negative, or noncontributory to
which turn out to be not in agreement with each hypothesis generated. Then each hy-
the patient’s problem. This is because there pothesis was evaluated for consistency with
is no way to logically confirm a hypothe- the cues. Using this framework, these in-
sis, but we can only disconfirm or refute it vestigators were unable to find differences
in the presence of contrary evidence. This between superior physicians (as judged by
evaluation process closes the testing phase their peers) and other physicians (Elstein,
of the diagnostic cycle. Moreover, it deter- Shulman, & Sprafka, 1 978).
mines which information is needed to dis-
criminate among hypotheses and, therefore,
which information has to be collected.
Forward-Driven and Backward-Driven
Reasoning
Hypothesis Testing and Later, Patel and Groen (1 986) studied

Clinical Reasoning knowledge-based solution strategies of ex-
pert cardiologists as evidenced by their
Although a model such as the one just pre- pathophysiological explanations of a com-
sented can be used to account for a large plex clinical problem. The results indi-
part of the medical diagnostic process, em- cated that subjects who accurately diagnosed
pirical literature points to various strategies the problem employed a forward-oriented
of diagnostic reasoning that underscore the (data-driven) reasoning strategy – using pa-
relative importance of deduction, induction, tient data to lead toward a complete diagno-
or abduction. In their seminal work, Elstein sis (i.e., reasoning from data to hypothesis).
and colleagues (1 978) studied the problem- This was in contrast to subjects who mis-
solving processes of physicians by drawing diagnosed or partially diagnosed the patient
on then-contemporary methods and theories problem, who tended to use a backward
of cognition. Their view of problem solving or hypothesis-driven reasoning strategy. The
had a substantial influence on both stud- results of this study presented a challenge
ies of medical reasoning and medical edu- to the hypothetico-deductive model of rea-
cation. They were the first to use experi- soning as espoused by Elstein and colleagues
mental methods and theories of cognitive (1 978), which did not differentiate expert
science to investigate clinical competency. from nonexpert reasoning strategies.
Their research findings led to the develop- A hypothesis for reconciling these seem-
ment of an elaborated model of hypothetico- ingly contradictory results is that forward
deductive reasoning, which proposed that reasoning is used in clinical problems in
physicians reasoned by first generating and which the physician has ample experi-
then testing a set of hypotheses to account ence. When reasoning through unfamil-
for clinical data (i.e., reasoning from hy- iar or difficult cases, however, physicians
resort to backward reasoning because their is highly efficient, it often is error prone
knowledge base does not support a pattern- in the absence of adequate domain knowl-
matching process. To support this expla- edge because there are no built-in checks
nation, Patel, Groen, and Arocha (1 990) on the legitimacy of the inferences that a
looked for the conditions under which person makes. Pure data-driven reasoning is
forward reasoning breaks down. Cardiol- successful only in constrained situations in
ogists and endocrinologists were asked to which one’s knowledge of a problem can re-
solve diagnostic problems in both fields. sult in a complete chain of inferences from
They showed that under conditions of case the initial problem statement to the prob-
complexity and uncertainty, the pattern lem solution. In contrast, hypothesis-driven
of forward reasoning was disrupted. More reasoning is slower and requires high mem-
specifically, the breakdown occurred when ory load, because one has to keep track of
nonsalient cues in the case were tested goals and hypotheses. It therefore is most
for consistency against the main hypoth- likely to be used when domain knowledge
esis, even in subjects who had generated is inadequate or the problem is complex.
the correct diagnosis. Otherwise, the re- Hypothesis-driven reasoning is an exemplar
sults supported previous studies in that sub- of a weak method of problem solving in the
jects with accurate diagnoses used pure sense that it is used in the absence of relevant
forward reasoning. prior knowledge and when there is uncer-
If forward reasoning breaks down when tainty about problem solution. In problem-
case complexity is introduced, then experts solving terms, strong methods engage knowl-
and novices should reason differently be- edge, whereas weak methods refer to general
cause routine cases for experts would not be strategies. Weak does not necessarily imply
so for less-than-expert subjects. Investigat- ineffectual in this context.
ing clinical reasoning in a range of contexts Studies also showed that data-driven rea-
of varying complexity (Patel & Groen, 1 991 ; soning can break down because of uncer-
Patel, Arocha, Kaufman, 1 994), the authors tainty (Patel, Groen, & Arocha, 1 990). These
found that novices and experts have differ- conditions include the presence of “loose
ent patterns of data-driven and hypothesis- ends” in explanations in which some piece
driven reasoning. As before, experts used of information remains unaccounted for and
data-driven reasoning, which depends on isolated from the overall explanation. Loose
the physician’s possessing a highly organized ends trigger explanatory processes that work
knowledge base about the patient’s disease, by hypothesizing a disease, for instance, and
including sets of signs and symptoms. Fur- trying to fit the loose ends within it in
thermore, because of their extensive knowl- a hypothesis-driven reasoning fashion. The
edge base and the high-level inferences they presence of loose ends may foster learning
make, experts typically skip steps in their as the person searches for an explanation for
reasoning. In contrast, because of their lack them. A medical student or a physician may
of substantive knowledge or their inability to encounter a sign or a symptom in a patient
distinguish relevant from irrelevant knowl- problem, for instance, and look for infor-
edge, less-than-expert subjects (novices and mation that may account for the finding by
intermediates) used more hypothesis-driven searching for similar cases seen in the past,
reasoning, resulting often in very complex reading a specialized medical book, or con-
reasoning patterns. Similar patterns of rea- sulting a domain expert. (See Chi & Ohls-
soning have been found in other domains son, Chap. 1 6, for a discussion of such com-
(Larkin et al., 1 980). plex forms of learning.)
The fact that experts and novices rea- In some circumstances, however, the use
son differently suggests that they might of data-driven reasoning may lead to a
reach different conclusions (e.g., decisions heavy cognitive load. When students are
or understandings) when solving medical given problems to solve while they are be-
problems. Although data-driven reasoning ing trained in the use of problem-solving
strategies, for instance, the situation pro- stances to compare and interpret a current
duces a heavy load on cognitive resources, clinical case. In such studies, mainly involv-
which may diminish students’ ability to fo- ing visual diagnosis – based on data sources
cus on the task. The reason is that stu- such as radiographs, dermatological slides,
dents have to share cognitive resources (e.g., and electrocardiograms – specific similarity
attention, memory) between learning the to previous cases accounts for about 3 0%
problem-solving method and learning the of diagnoses made (see Goldstone & Son,
content of the material. Research (Sweller, Chap. 2; Medin & Rips, Chap. 3 ). Further-
1 988) suggests that when subjects use a strat- more, errors made by experts in identifying
egy based on data-driven reasoning, they abnormalities in images are affected by the
are more able to acquire a schema for the prior history of the patient. That is, if the
problem. In addition, other characteristics prior history of the patient mentioned a pos-
associated with expert performance were sible abnormality, expert physicians more of-
observed, such as a reduced number of ten identified abnormalities in the images
moves to the solution. When subjects used even when none existed, which also sup-
a hypothesis-driven reasoning strategy, their ports the effect of specific past cases on the
problem-solving performance suffered. The interpretation of a current case.
study of medical reasoning has been summa- In pursuing their explanation, Norman
rized in a series of articles (e.g. Patel et al., and colleagues (Norman and Brooks, 1 997)
1 994; Patel, Kaufman, & Arocha, 2002) and argued against the hypothesis that expert
papers in edited volumes (Clancey & Short- physicians diagnose clinical cases by “ana-
liffe, 1 984; Szolovits, 1 982). lyzing” signs and symptoms and developing
correspondences among those signs, symp-
toms, and diagnoses, as traditional cognitive
research in medical reasoning suggests. They
The Role of Similarity in Diagnostic suggest instead the “nonanalytic” basis for
Reasoning medical diagnosis, in which diagnostic rea-
soning is characterized by the unanalyzed
The fact that physicians make use of forward retrieval of a similar case previously seen
reasoning in routine cases suggests a type of in medical practice to interpret the current
processing that is fast enough to be able to case – a kind of exemplar-based or case-
lead to the recognition of a set of signs and based reasoning. This discussion has its
symptoms in a patient and generate a diag- counterpart in the psychology of categoriza-
nosis based on such recognition. Most often tion, in which two accounts have been pro-
this has been interpreted as a type of specific- posed – categorization works either by a re-
to-general reasoning (e.g., reasoning from an liance on prototypes or by exemplars (Medin
individual case to a clinical schema or proto- & Rips, Chap. 3 ).
type). However, consistent with the model Exemplar-based thinking is certainly a
of abductive reasoning, some philosophers fundamental aspect of human cognition.
(Schaffner, 1 986) and empirical researchers There is ample evidence of the conditions
(Norman & Brooks, 1 997) have supported under which reasoning by analogy to previ-
an alternative hypothesis that consists of ous cases is used (Gentner & Holyoak, 1 997;
specific-to-specific reasoning. That is, ex- Holyoak & Thagard, 1 997; see Holyoak,
perts also use knowledge of specific instances Chap. 6). Furthermore, given the complex-
(e.g., particular patients with specific disease ity of natural reasoning in a highly dense
presentations) to interpret particular cases, knowledge domain such as medicine, it is
rather than relying only on general clinical highly likely that more than one type of rea-
knowledge (Kassirer & Kopelman, 1 990). soning is actually employed. Seen in this
Brooks and colleagues (Brooks, Norman, light, the search for a single manner in
& Allen, 1 991 ; Norman & Brooks, 1 997) ar- which clinicians diagnose clinical problems
gued that clinicians make use of specific in- may not be a reasonable goal. The inherent
adaptability of humans to different kinds found in other scientific domains. In this

of knowledge domains, situations, problems, view, the medical disciplines, notably clin-
and cases may call for the use of a variety of ical medicine, are organized vertically, and
reasoning strategies, which is what, after all, reasoning by analogy (see Holyoak, Chap-
the notion of abductive medical reasoning ter 6) plays a more important role than
tries to formalize (Patel & Ramoni, 1 997). causal reasoning. Based on such a distinction,
Alongside rule-based and prototype reason- it has been argued that reasoning in the phys-
ing, a model of clinical reasoning may allow ical sciences and reasoning in the biomedical
for case-based, nonanalytical reasoning, in sciences are of different types.
which recognizing similarity between partic- In particular, it has been argued that rea-
ulars may be the main cognitive mechanism. soning in the physical sciences, to some ex-
A reason for the variety of strategies used tent, can be conceptualized as a “deductive
in actual diagnostic problems may be found systematization of a broad class of general-
in the inherent organization of medical izations under a small number of axioms,”
knowledge. but this characterization cannot be applied
to the biomedical sciences. The latter are
characterized by what Shaffner (1 986, p. 68)
calls “a series of overlapping interleaved tem-
Reasoning and the Nature of poral models” that are based on familiarity
Medical Knowledge with shared exemplars to a much greater
degree than is necessary in the physical
Although a motivation for looking at med- sciences. Shaffner’s characterization, unlike
ical reasoning was to establish its relation- that of Blois, applies to both biomedical re-
ship with reasoning in other fields, such as search and clinical medicine. In biomedical
science, the prevalent view in the philoso- research, an organism such as a Drosophila,
phy of medicine (Blois, 1 990) has been that for instance, is used as an exemplar embody-
medical knowledge has an extremely coming a given disease mechanism that, by anal-
plex organization, requiring the use of dif- ogy, applies to other organisms, including
ferent reasoning strategies than those used in humans. In the clinical sciences, the patient
other more formal scientific disciplines, such is seen as an exemplar to which generaliza-
as physics. Disciplines such as physics, chem- tions based on multiple overlapping models
istry, and some subfields of biology, are said are applied from diseases and the population
to be horizontally organized, which means of similar patients.
these domains are characterized by the con- In the empirical research on medical
struction of causal relations among concepts reasoning, the distinction between reason-
and by the application of general principles ing from cases versus reasoning from pro-
to specific instances (Blois, 1 990). By this, totypes has not been established. Medical
it is meant that such scientific fields are or- knowledge consists of two categories of
ganized in a hypothetico-deductive manner knowledge – clinical knowledge, includ-
in which particular statements are gener- ing knowledge of disease processes and as-
ated from general statements and causality sociated findings; and basic science knowl-
plays a major role. This type of reasoning, in edge, incorporating subject matter such as
which one connects one concept to another biochemistry, anatomy, and physiology. Ba-
by forming causal networks, has been called sic science or biomedical knowledge is sup-
horizontal reasoning (Blois, 1 990). These posed to provide a scientific foundation for
philosophers argued that causal reasoning clinical reasoning. The conventional view
does not play such an important role in the is that basic science knowledge can be
medical domain. They argue, instead, that seamlessly integrated into clinical knowl-
reasoning in medicine requires vertical think- edge analogous to the way that learning the
ing. This kind of reasoning makes more use rules of the road can contribute to one’s
of the analogy than the reasoning typically mastery of driving a car. In this capacity,
Figure 30.1 . Idealized representation of the “intermediate effect.” The straight line gives a commonly
assumed representation of performance development by level of expertise. The curved, U-shaped,
line represents the actual development from novice to expert. The y-axis may represent performance
variables, such as the number of errors made, irrelevant concepts recalled, conceptual elaborations, or
number of extraneous hypotheses generated in a variety of tasks.
a particular piece of biomedical knowledge (Thibodeau et al., 1 989; Chi et al., 1 989).
could be automatically elicited in a range of This continuum also points to the partic-
clinical contexts and tasks in more or less the ular nature of medical knowledge and its
same fashion. acquisition.
Changes have been described in this pro-
cess that serve to characterize the various
phases medical trainees go through to be-
Knowledge Organization and come expert clinicians. An important char-
Changes in Directionality acteristic of this process is the intermediate
effect. This refers to the fact that, although
Following Blois (1 988) and Schaffner it seems reasonable to assume that perfor-
(1 986), it can be argued that the way medical mance improves with training or time on
knowledge is organized can be a determinant task, there appear to be particular transitions
factor explaining why experts do not use the in which subjects exhibit a certain drop in
hypothetico-deductive method of reasoning. performance. This is an example of what is
Maybe the medical domain is too messy referred to as nonmonotonicity in the devel-
to allow its neat partitioning and deductive opmental literature (Strauss & Stavy, 1 982)
use of reasoning strategies. Although the the- and is also observed in skill acquisition. The
ory of reasoning in medicine is basically a symptom is a learning curve or developmen-
theory of expert knowledge, reaching the tal pattern that is shaped like a U or an
level of efficient reasoning of the expert clin- inverted U, as illustrated in Figure 3 0.1 . In
ician reflects the extended continuum of medical expertise development, intermedi-
training and levels of reasoning performance ates’ performance reflects the degradation in
reasoning that results from the acquisition solidated, the intermediate is more likely to
of knowledge through a time during which engage in unnecessary search. Whether this
such knowledge is not well-organized and knowledge, painfully acquired during med-
irrelevant associations abound in the inter- ical training, is really necessary for clinical
mediate’s knowledge base. In contrast, the reasoning has been a focus of intensive re-
novice’s knowledge base is too sparse, con- search and great debate. If expert clinicians
taining very few associations, whereas the do not explicitly use underlying biomedi-
expert’s knowledge base is well pruned of cal knowledge, does that mean that it is not
the irrelevancies that characterize interme- necessary? Or could it be simply that this
diates. It should be noted that not all in- knowledge remains “dormant” until is really
termediate performance is nonmonotonic; needed? This raises an important question of
for example, on some global criteria such whether expert medical knowledge is deep
as diagnostic accuracy, there appears to be or shallow.
a steady improvement.
The intermediate effect occurs with many
tasks and at various levels of expertise. The
tasks vary from comprehension of clinical Causal Reasoning in Medicine
cases and explanation of clinical problems
to problem solving to generating laboratory The differential role of basic science knowl-
data. The phenomenon may be attributable edge (e.g., physiology and biochemistry) in
to the fact that intermediates have acquired solving problems of varying complexity and
an extensive body of knowledge but have the differences between subjects at different
not yet reorganized this knowledge in a levels of expertise (Patel et al., 1 994) have
functional manner. Intermediate knowledge been a source of controversy in the study of
therefore has a sort of network structure medical cognition (Patel & Kaufman, 1 995 )
that results in considerable search, which as well as in medical education and AI.
makes it more difficult for intermediates to As expertise develops, the disease knowl-
set up structures for rapid encoding and edge of a clinician becomes more dependent
selective retrieval of information (Patel & on clinical experience, and clinical problem
Groen, 1 991 ). In contrast, expert knowledge solving is increasingly guided by the use of
is finely tuned to perform various tasks, and exemplars and analogy and becomes less de-
experts can readily filter out irrelevant infor- pendent on a functional understanding of
mation using their hierarchically organized the system in question. However, an in-
schemata. The difference is reflected in the depth conceptual understanding of basic sci-
structural organization of knowledge and the ence plays a central role in reasoning about
extent to which it is proceduralized to per- complex problems and is also important in
form different tasks. generating explanations and justifications for
Schmidt and Boshuizen (1 993 ) reported decisions.
that intermediate nonmonotonicity recall Researchers in AI were confronted with
effects disappear by using short exposure similar problems in extending the utility of
times (about thirty seconds), which sug- systems beyond their immediate knowledge
gests that under time-restricted conditions, base. Biomedical knowledge can serve differ-
intermediates cannot engage in extraneous ent functional roles depending on the goals
search. Whereas a novice’s knowledge base of the system. Most models of diagnostic rea-
is likely to be sparse and an expert’s knowl- soning in medicine can be characterized as
edge base is intricately interconnected, the being shallow. A shallow medical expert sys-
knowledge base of an intermediate possesses tem (e.g., MYCIN and INTERNIST) reasons
many of the pieces of knowledge but lacks by relating observations to intermediate hy-
the extensive connectedness of an expert. potheses that partition the problem space
Until this knowledge becomes further con- and further by associating intermediate
hypotheses with diagnostic hypotheses. This sponses. The intermediate effect can also be
is consistent with the way physicians appear accounted for as a stage in the encapsula-
to reason. tion process in which a trainee’s network of
There are other medical reasoning sys- knowledge has not yet become sufficiently
tem models that propose a “deep” mode differentiated, resulting in more extensive
of reasoning as a main mechanism. Chan- processing of information.
drasekeran, Smith, & Sticklen, (1 989) char- Knowledge encapsulation provides an ap-
acterize a deep system as one that embod- pealing account of a range of developmen-
ies a causal mental model of bodily function tal phenomena in the course of acquiring
and malfunction, similar to the models used medical expertise. The integration of ba-
in qualitative physics (Bobrow, 1 985 ). Sys- sic science in clinical knowledge is a rather
tems such as MDX-2 (Chandrasekeran et al., complex process, however, and encapsula-
1 989) or QSIM (Kuipers, 1 987) have ex- tion is likely to be only part of the knowledge
plicit representations of structural compo- development process. Basic science knowl-
nents and their relations, the functions of edge plays a different role in different clini-
these components (in essence their pur- cal domains. For example, clinical expertise
pose), and their relationship to behavi- in perceptual domains, such as dermatology
oral states. and radiology, necessitates a relatively ro-
To become licensed physicians, medical bust model of anatomical structures that is
trainees undergo a lengthy training process the primary source of knowledge for diag-
that entails the learning of biomedical sci- nostic classification. In other domains, such
ences, including biochemistry, physiology, as cardiology and endocrinology, basic sci-
anatomy, and others. The apparent contra- ence knowledge has a more distant relation-
diction between this type of training and ship with clinical knowledge. The miscon-
the absence of deep biomedical knowledge ceptions evident in physicians’ biomedical
during expert medical reasoning has been explanations would argue against their hav-
pointed out. To account for such appar- ing well-developed encapsulated knowledge
ent inconsistency, Boshuizen and Schmidt structures in which basic science knowledge
(1 992) proposed a learning mechanism – can easily be retrieved and applied when
knowledge encapsulation. Knowledge encap- necessary.
sulation is a learning process that involves The results of research into medical prob-
the subsumption of biomedical propositions lem solving are consistent with the idea that
and their interrelations in associative clusters clinical medicine and biomedical sciences
under a small number of higher-level clini- constitute two distinct and not completely
cal propositions with the same explanatory compatible worlds with distinct modes of
power. Through exposure to clinical train- reasoning and quite different ways of struc-
ing, biomedical knowledge presumably be- turing knowledge (Patel, Arocha, & Kauf-
comes integrated with clinical knowledge. man, 1 994). Clinical knowledge is based
Biomedical knowledge can be “unpacked” on a complex taxonomy that relates dis-
when needed but is not used as a first line ease symptoms to underlying pathology. In
of explanation. contrast, biomedical sciences are based on
Boshuizen and Schmidt (1 992) cite a general principles defining chains of causal
wide range of clinical reasoning and re- mechanisms. Learning to explain how a set
call studies that support this kind of learn- of symptoms is consistent with a diagnosis
ing process. Of particular importance is the therefore may be very different from learn-
well-documented finding that with increasing how to explain what causes a disease.
ing levels of expertise, physicians produce (See Buehner & Cheng, Chap. 7, for a dis-
explanations at higher levels of generality, cussion of causal learning.)
using fewer and fewer biomedical concepts The notion of the progression of men-
while producing consistently accurate re- tal models (White & Frederiksen, 1 990) has
been used as an alternative framework for differs in two important senses. The pro-
characterizing the development of concep- cess of compiling knowledge is not one of
tual understanding in biomedical contexts. subsumption or abstraction, and the origi-
Mental models are dynamic knowledge nal knowledge (uncompiled mental model)
structures composed to make sense of expe- may no longer be available in a similar
rience and to reason across spatial or tempo- form (Kuipers & Kassirer, 1 984). The sec-
ral dimensions (see Johnson-Laird, Chap. 9). ond difference is that mental models are
An individual’s mental models provide pre- composed dynamically out of constituent
dictive and explanatory capabilities of the pieces of knowledge rather than prestored
function of a given system. The authors unitary structures. The use of mental mod-
employed the progression of mental models is somewhat opportunistic and the learn-
els to explain the process of understanding process is less predictable. The compi-
ing increasingly sophisticated electrical cir- lation process can work in reverse as well.
cuits. This notion can be used to account That is to say, discrete cause-and-effect re-
for differences between novices and experts lationships can be integrated into a mental
in understanding circulatory physiology, de- model as a student reasons about complex
scribing misconceptions (Patel, Arocha, & physiological processes.
Kaufman, 1 994), and explaining the gen-
eration of spontaneous analogies in cau-
sal reasoning. Errors and Medical Reasoning
Running a mental model is a potentially
powerful form of reasoning but it is also
According to the report from the Institute
cognitively demanding. It may require an
of Medicine (Kohn, Corrigan, & Donaldson,
extended chain of reasoning and the use
1 999), medical error is the eighth leading
of complex representations. It is apparent
cause of death in the United States ahead
that skilled individuals learn to circumvent
of deaths attributable to motor vehicle acci-
long chains of reasoning and chunk or com-
dents, breast cancer, or acquired immunod-
pile knowledge across intermediate states of
eficiency syndrome. Cognitive mechanisms,
inference (Chandrasekaran, 1 994; Newell,
such as mistakes of reasoning and decision
1 990). This results in shorter, more direct,
making and action slips of skilled perfor-
inferences that are stored in long-term mem-
mance, are the major factors contributing
ory and are directly available to be retrieved
to medical errors. A cognitive taxonomy is
in the appropriate contexts. Chandrasekaran
essential to understanding, explaining, and
(1 994) refers to this sort of knowledge as
predicting medical errors and to develop-
compiled causal knowledge. This term refers
ing interventions to reduce medical errors.
to knowledge of causal expectations that
Based on the definition and preliminary tax-
people compile directly from experience
onomy by Reason (1 990) and the action the-
and partly by chunking results from previ-
ory by Norman (1 986), Zhang et al. (2004,
ous problem-solving endeavors. The goals
in review) developed a cognitive taxonomy
of the individual and the demands of re-
for human errors in medicine.
curring situations largely determine which
pieces of knowledge get stored and used.
When physicians are confronted with a simi-
lar situation, they can employ this compiled A Cognitive Taxonomy of
knowledge in an efficient and effective man- Medical Errors
ner. The development of compiled knowl-
edge is an integral part of the acquisition One critical step toward understanding the
of expertise. cognitive mechanisms of various errors in
The idea of compiling declarative knowl- medical reasoning is to categorize the er-
edge bears a certain resemblance to the idea rors along cognitively meaningful dimen-
of knowledge encapsulation, but the claim sions. Reason (1 990) defines human error as
a failure to achieve the intended outcome child has fever or toothache and the weight
in a planned sequence of mental or physical of the child is 24–3 5 lbs.”
activities. He divides human errors into two Goal mistakes and intention mistakes
major categories: (1 ) slips that result from are caused by many complex factors such
the incorrect execution of a correct action as incorrect knowledge, incomplete knowl-
sequence and (2) mistakes that result from edge, and misuse of knowledge; biases;
the correct execution of an incorrect ac- faulty heuristics; and information overload.
tion sequence. Norman’s theory of action For example, neglect of base rate informa-
(Norman, 1 986) decomposes a human action could result in incorrect diagnosis of
tivity into seven stages. a disease. This is a well-documented find-
Based on Reason’s definition of human ing in human decision making (Tversky &
error and Norman’s action theory, Zhang Kahneman, 1 974; Kahneman & Frederick,
and colleagues developed a cognitive tax- Chap. 1 2). As another example, the goal of
onomy. Under this taxonomy, errors are di- “treating the disease as pneumonia” could
vided into slips and mistakes, just like Rea- be a mistake if it is a misdiagnosis based on
son’s two main categories. Then slips are incomplete knowledge (e.g., without radio-
divided into execution slips and evaluation graphic images). Intention mistakes can be
slips. Execution slips include goal, intention, caused by similar factors, such as the follow-
action specification, and action execution ing example: A physician treating a patient
slips, whereas evaluation slips include per- with oxygen set the flow control knob be-
ception, interpretation, and evaluation slips. tween one and two liters per minute, not
Similarly, mistakes can be divided into exe- realizing that the scale numbers represented
cution mistakes that include goal, intention, discrete, rather than continuous, settings. As
action specification, and action execution a result, the patient did not receive any oxy-
mistakes and evaluation mistakes that in- gen. This is a mistake caused by incomplete
clude perception, interpretation, and evalu- knowledge. The use of heuristics is another
ation mistakes. This taxonomy can cover ma- common source of goal and intention mis-
jor types of medical errors, because a medical takes. A heuristic often used is the reliance
error is a human error in an action and any on disease schemata during clinical diagno-
action goes through the seven stages of the sis. Disease schemata are knowledge struc-
action cycle. Most reasoning and decision- tures that have been formed from previ-
making errors in medicine are under the cat- ous experience with diagnosing diseases and
egory of mistakes in the taxonomy. They contain information about relevant and ir-
are attributable to incorrect or incomplete relevant signs and symptoms. When physi-
knowledge. cians and medical students diagnose pa-
tients, they tend to rely on their schemata
and base their reasoning on the apparent
similarity of patient information with these
Reasoning and Decision-Making schemata instead of a more objective anal-
Mistakes in Medicine ysis of patient data. The schemata used in
diagnosis often guide future reasoning about
In the cognitive taxonomy, goal and inten- the patient, affecting what tests are run
tion mistakes are mistakes about declarative and how data are interpreted. Arocha and
knowledge – knowledge about factual state- Patel (1 995 ) found that medical students and
ments and propositions, such as “Motrin is trainees maintained their initial hypotheses,
a pain reliever and fever reducer.” Action even if subsequent data were contradictory.
specification mistakes and action execu- Therefore, if the initial hypothesis is wrong,
tion mistakes are mistakes about procedural errors in diagnosis and treatment are likely
knowledge – knowledge about procedures to occur. Preliminary presentation of the pa-
and rules, such as “give 1 tsp Motrin to a tient (e.g., signs and symptoms), then, be-
child per dosage up to 4 times a day if the comes very important because it can suggest
strongly held hypotheses (i.e., lead to the use or incomplete knowledge leads a person to
of schemata). judge the completion or incompletion of
Action specification and action execution a goal erroneously.
mistakes are procedural mistakes that can
be caused by many factors, such as lack of
correct rules, overgeneralized application of
good rules, misapplication of good rules, en- Medical Reasoning and
coding deficiencies in rules, and dissociation Decision Research
between knowledge and rules. Overgener-
alized application of good rules, for exam- Decision making is central to medical activ-
ple, can cause an error because the condi- ity. Although health-care professionals are
tion part of a condition-action rule could be generally highly proficient decision mak-
misidentified and mismatched, causing the ers, their erroneous decisions have be-
firing of the action part of the rule. Proce- come the source of considerable public
dural mistakes caused by encoding efficien- scrutiny (Kohn, Corrigan, & Donaldson,
cies of action rules are usually attributable to 1 999).
the evolving nature of the rules and unfore- Decisions involve the application of rea-
seeable conditions that cannot be encoded soning to select some course of action
in the rules. A good rule may be misused that achieves the desired goal (see LeBoeuf
because the user may have incorrect or in- & Shafir, Chap. 1 1 ). Hastie (2001 ) identi-
complete knowledge about the condition of fied three components of decision making:
the rule in a specific context. The knowl- (1 ) choice options and courses of actions;
edge of a rule and the knowledge of how (2) beliefs about objective states, processes,
to use a rule are not always automatically and events in the world, including out-
linked without extensive practice. This dis- come states and means to achieve them; and
sociation, attributable to the lack of expe- (3 ) desires, values, or utilities that describe
rience and practiced skills, may also lead to the consequences associated with the out-
action execution mistakes. comes of each action–event combination.
Perception mistakes can be caused by Reasoning plays a major role in this pro-
expectation-driven processing. What we cess. In this context, a major thrust of re-
perceive is a function of the input and our search has been the study of hypothesis test-
expectations. This mechanism is what al- ing, which has been studied widely in the
lows us to read sloppy handwriting or rec- medical domain. Such research has shown
ognize degraded images. However, our ex- the pervasiveness of confirmation bias, ev-
pectations can also lead to misperceptions. idenced by the generation of a hypothesis
Interpretation mistakes are the incorrect in- and the subsequent search for evidence con-
terpretation of feedback caused by incorrect sistent with the hypothesis, often leading to
or incomplete knowledge. Suppose, for in- failure to adequately consider alternative di-
stance, that an intravenous infusion pump, agnostic possibilities. This bias may result in
a device often used in critical care environ- a less-than-thorough investigation with pos-
ments to give medications, indicates readi- sible adverse consequences for the patient.
ness to begin infusing medications using a A desire to confirm one’s preferred hypoth-
steady green light and indicates the infusion esis, moreover, may contribute to increased
is in progress by flashing the green light. If inefficiency and costs by ordering additional
the device user does not know the mean- laboratory tests that will do little to revise
ing of the steady green light, he or she one’s opinion, providing largely redundant
may incorrectly interpret it as an indication data (Chapman & Elstein, 2000).
that the infusion has begun. Generating dif- Health-care team decision making is the
ferent interpretations and treatment proce- rule rather than the exception in medicine.
dures from the same evidence is another Naturalistic decision-making (NDM) is con-
source of interpretation mistake. An action cerned with the study of cognition in real
evaluation mistake occurs when incorrect world work environments that are often
dynamic (i.e., rapidly changing) (Klein, sions as choosing among known alternatives,
1 993 ). The majority of this research com- real-world decision-making is best investi-
bines conventional protocol analytic meth- gated by a naturalistic approach in which
ods with innovative methods designed to reasoning is constrained by dynamic factors,
investigate reasoning and behavior in re- such as stress, time pressure, risk, and team
alistic settings (Woods, 1 993 ; Rasmussen, interactions. Looking at medical reasoning
Pejtersen, & Goodstein, 1 994). The study in social and collaborative settings is even
of decision making in the work context more important when information tech-
necessitates an extended cognitive science nologies are part of the ebb and flow of clini-
framework beyond typical characterizations cal work.
of knowledge structures, processes, and
skills to include modulating variables such
as stress, time pressure, and fatigue, as Reasoning and Medical Education
well as communication patterns in team
performance. The failures and successes of reasoning
Among the issues investigated in NDM strategies and skills can be traced back to
are understanding how decisions are jointly their sources – education. There is evidence
negotiated and updated by participants dif- suggesting that the way physicians reason
fering substantially in their areas of expertise follows from the way they were educated.
(e.g., pharmacology, respiratory medicine), Medical education in North America as well
how the complex communication process in as in the rest of the world has followed a sim-
these settings occurs, what role technology ilar path – from practice-based training to an
plays in mediating decisions and how it af- increasingly scientific type of training.
fects reasoning, and what sources of error Motivated by the increasing importance
exist in the decision making process. of basic scientific knowledge in the context
Patel, Kaufman, and Magder (1 996) stud- of clinical practice, problem-based learning
ied decision-making in a medical intensive (PBL) was developed on the premise that
care unit with the particular objective of de- not only should physicians possess the or-
scribing jointly negotiated decisions, com- dered and systematic knowledge of science,
munication processes, and the development but they should think like scientists during
of expertise. Intensive care decision-making their practices. Consistent with this idea,
is characterized by a rapid serial evaluation an attempt was made to teach hypothetico-
of options leading to immediate action in deductive reasoning to medical students
which reasoning is schema-driven in a for- to provide an adequate structure to med-
ward direction toward action with minimal ical problem solving. After all, this was
inference or justification. When patients do the way scientists were supposed to make
not respond in a manner consistent with the discoveries.
original hypothesis, however, the original de- Based on cognitive research in other
cision comes under scrutiny. This strategy knowledge domains, some researchers ar-
can result in a brainstorming session in which gued, however, that the hypothetico-
the team retrospectively evaluates and re- deductive method might not be the most
considers the decision and considers possible efficient way of solving clinical problems. To
alternatives. In such circumstances, various investigate how the kind of training medi-
patterns of reasoning are used to evaluate cal students received affected their reasoning
alternatives in these brainstorming sessions. patterns, Patel, Groen, and Norman (1 993 )
These include probabilistic reasoning, diag- looked at the problem-solving processes of
nostic reasoning, and biomedical causal rea- students in two medical schools with dif-
soning. Supporting decision making in clini- ferent modes of instruction – classical and
cal settings necessitates an understanding of problem-based. They found that students in
communication patterns. the problem-based curriculum reasoned in
In summary, although traditional ap- a way that was consistent with their train-
proaches to decision making looked at deci- ing methods, showing a preponderance of
hypothetico-deductive reasoning and exten- learning seems to promote a type of learn-

sive elaborations of biomedical information. ing in which basic biomedical knowledge
The PBL students have been shown to becomes so tightly tied to specific clinical
use hypothesis-driven reasoning – from the problem types that it becomes difficult to
hypothesis to explain the patient data – decouple this knowledge in context to trans-
whereas non-PBL students use mainly data- fer to a new situation (Anderson, Reder, &
driven reasoning – from data toward the hy- Simon, 1 996; Holyoak, 1 984).
pothesis. In explaining clinical cases, PBL This outcome is consistent with how
students produce extensive elaborations us- biomedical information is taught in the class-
ing detailed biomedical information, which room in PBL schools – by encouraging use of
is relatively absent from non-PBL students’ the hypothetico-deductive method, result-
explanations. However, these elaborations ing in a predominantly backward-directed
result in the generation of errors. Problem- mode of reasoning. Elaborations are accom-
based learning promotes the activation and panied by a tendency to generate errors of
elaboration of prior knowledge. scientific fact and flawed patterns of ex-
Patel and colleagues (Patel, Arocha, & planation, such as circular reasoning. Even
Lecissi, 2001 ) also investigated the effects though a student’s explanation may be rid-
of non-PBL curricula on the use and inte- dled with bugs and misconceptions, their
gration of basic science and clinical knowl- harmful effects may be dependent on the di-
edge and their relationship to reasoning in rection of reasoning. If they reason forward,
diagnostic explanation. The results showed then they are likely to view their existing
that biomedical and clinical knowledge are knowledge as adequate. In this case, miscon-
not integrated and that very little biomedi- ceptions may be long lasting and difficult
cal information is used in routine problem- to eradicate. If they reason backward, mis-
solving situations. There is significant use of conceptions might best be viewed as tran-
expert-like, data-driven strategies, however, sient hypotheses that, in the light of expe-
in non-PBL students’ explanations. The use rience, are refuted or modified to form the
of biomedical information increases when kernel of a more adequate explanation. In-
the clinical problems are complex; at the terestingly, differences in the patterns of rea-
same time, hypothesis-driven strategies re- soning acquired in both PBL and non-PBL
place the data-driven strategies. medical curricula are found to be quite sta-
Students from a PBL school integrated ble, even after the students have completed
the two types of knowledge and, in contrast medical school and are in residency training
to the non-PBL students, they spontaneously programs (Patel, Arocha, Lecissi, 2001 ; Patel
used biomedical information in solving even & Kaufman, 2001 ).
routine problems. We concluded that, for Instruction that emphasizes decontextu-
students in the non-PBL curriculum, the alized abstracted models of phenomena has
clinical components of problems are treated not yielded much success in medicine or
separately from the biomedical science com- in other spheres of science education. It is
ponents. The two components of problem widely believed that the amount of transfer
analysis seem to be viewed as serving dif- will be a function of the overlap between
ferent functions. When needed, however, the original domain of learning and the tar-
biomedical knowledge is and seems to act get domain (Holyoak, 1 984). Problem-based
as a “glue” that ties the two kinds of infor- learning’s emphasis on real-world problems
mation together. represents a very good source of transfer to
In the PBL curriculum, the integration of clinical situations. However, it is very chal-
basic science and clinical knowledge is so lenging to create a problem set that most ef-
tight that students appear unable to separate fectively embodies certain biomedical con-
the two. As a result, PBL students generate cepts while maximizing transfer. Knowledge
unnecessarily elaborate explanations, lead- that is overly contextualized actually can re-
ing to errors of reasoning. Problem-based duce transfer.
Technology-Mediated Reasoning tasks cease to exist or completely change

in nature.
All technologies mediate human perfor- Diagrams, graphs, pictures, and informa-
mance. Technologies, whether they be tion displays are typical external represen-
computer-based or in some other form, tations. They are used in many cognitive
transform the ways individuals and groups tasks such as problem solving, reasoning,
behave. They do not merely augment, en- and decision making. In studies of the rela-
hance, or expedite performance, although tionship between mental images and exter-
a given technology may do all of these nal pictures, Chambers and Reisberg (1 985 ;
things. The difference is not one of quan- Reisberg, 1 987) showed that external rep-
titative change but one that is qualitative resentations could give people access to
in nature. Technology, tools, and artifacts knowledge and skills that are unavailable
enhance people’s ability to perform tasks from internal representations. This advan-
and change the way they perform tasks. tage typically arises because internal rep-
In cognitive science, this ubiquitous phe- resentations are already interpreted and
nomenon is called the representational effect, difficult to change, whereas external repre-
which refers to the phenomenon that dif- sentations are subject to interpretations and
ferent representations of a common abstract can lead to different understandings under
structure can generate dramatically differ- different conditions. In their studies of dia-
ent representational efficiencies, task com- grammatic problem solving, Larkin & Simon
plexities, and behavioral outcomes (Zhang & (1 987; Larkin, 1 989) show that diagram-
Norman, 1 994). matic representations help reasoning and
problem solving because they support oper-
ators that can recognize features easily and
make inferences directly. In studies of log-
Technology as External ical reasoning with diagrams, Stenning and
Representations Oberlander (1 994) demonstrated that dia-
grammatic representations such as Euler cir-
One approach to the study of how tech- cles limit abstraction and thereby ease pro-
nology mediates thinking and reasoning is cessing effort. It is well known that different
to consider technology as external repre- forms of graphic displays have different rep-
sentations (Zhang & Norman, 1 994, 1 995 ; resentational efficiencies for different tasks
Zhang, 1 997). External representations are and can cause different cognitive behav-
the knowledge and structure in the envi- iors. For example, Kleinmuntz and Schkade
ronment as physical symbols, objects, or di- (1 993 ) showed that different representa-
mensions (e.g., written symbols, beads of tions (graphs, tables, and lists) of the same in-
an abacus, dimensions of a graph), and as formation can dramatically change decision-
external rules, constraints, or relations em- making strategies: With a tabular display,
bedded in physical configurations (e.g., spa- people made one decision, but with a graph
tial relations of written digits, visual and display of the same information, people
spatial layouts of diagrams, physical con- made a different decision.
straints in abacuses). The information in ex-
ternal representations can be picked up, an-
alyzed, and processed by perceptual systems
alone, although the top-down participation The Impact of Technology on
of conceptual knowledge from internal rep- Thinking in Medicine
resentations sometimes facilitates or inhibits
the perceptual processes. External represen- The mediating role of technology can be
tations are more than inputs and stimuli evaluated at several levels. For example, elec-
to the internal mind. For many tasks, they tronic medical records alter the practice of
are intrinsic components without which the individual clinicians in significant ways, as
discussed subsequently. Changes to an infor- effects of technology on human reasoning

mation system substantially impact organi- in medicine.
zational and institutional practices, from re- As the basis for many medical decisions,
search to billing to quality assurance. Even diagnostic reasoning requires collecting,
the introduction of patient-centered medical understanding, and using many types of
records early in the twentieth century neces- patient information, such as history, lab-
sitated changes in hospital architecture and oratory results, symptoms, prescriptions,
considerably affected work practices in clin- images, and so on. It is affected by the
ical settings. Salomon, Perkins, and Glober- expertise of the clinicians and the way the
son (1 991 ) introduced a useful distinction information is acquired, stored, processed,
in considering the mediating role of tech- and presented. If we consider clinicians
nology on individual performance – the ef- as rational decision makers, the format of
fects with technology and the effects of tech- a display, as long as it contains the same
nology. The former is concerned with the information, should not affect the outcome
changes in performance displayed by users of the reasoning and decision-making
while equipped with the technology. For process. But the formats of displays do
example, when using an effective medical affect many aspects of clinicians’ task per-
information system, physicians should be formance. Several recent studies examined
able to gather information more systemat- how different displays of information in
ically and efficiently. In this capacity, med- EMR affect clinicians’ behavior. Three
ical information technologies may alleviate major types of displays have been studied –
some of the cognitive load associated with source-based, time-based, and concept-
a given task and permit physicians to focus based. Source-based displays organize
on higher-order thinking skills, such as hy- medical data by the sources of the data,
pothesis generation and evaluation. The ef- such as encounter notes, laboratory results
fects of technology refer to enduring changes and reports, medications, radiology imaging
in general cognitive capacities (knowledge and reports, physical examinations, and so
and skills) as a consequence of interaction on. Time-based displays organize medical
with a technology. For example, frequent data as a temporal history. Concept-based
use of information technologies may re- displays organize medical data by clinically
sult in lasting changes in medical decision- meaningful concepts or problems. In this
making practices even in the absence of case, all data related to each specific prob-
the system. lem are displayed together. For example, if
In several studies involving the mediating a patient has symptoms such as coughing,
role of technology in clinical practice, Patel chest pain, and fever, the laboratory results,
and colleagues (Patel et al., 2000) observed imaging reports, prescriptions, assessments,
the change in thinking and reasoning pat- and plans are displayed together. A study
terns caused by the change in methods of by Zeng, Cimino, & Zou, (2002) found that
writing patient records, from paper records different displays were good for different
to electronic medical records (EMR). They tasks. Source-based displays are good for
found that before using EMR, physicians fo- clinicians to retrieve information for a
cused on exploration and discovery, used specific test or procedure from a specific
complex propositions, and tended to use department, for example, whereas concept-
data-driven reasoning. With EMR, which based displays are good for searching for
structures data, physicians focus on problem information related to a specific disease.
solving, use simple propositions, and tend to With the rapid growth of computer-based
use problem-directed and hypothesis-driven information systems, we are interacting
reasoning. The change of behavior caused by more and more with computer-generated
the use of EMR remains when physicians go health information displays. If these displays
back to paper records, showing the enduring are to generate the information people need
for informed reasoning effectively and accu- The recent concern with understanding and
rately, good design is necessary. reducing medical errors provides an oppor-
tunity for cognitive scientists to apply cogni-
tive theories and methodologies to a press-
Conclusions and Future Directions ing practical problem. A trend in health
care, spurred partly by the advent of infor-
The process of medical reasoning is one area mation technologies that foster communi-
in which advances in cognitive science have cation, is the shift in health-care systems to
made significant contributions to investiga- become increasingly multidisciplinary, col-
tion. In particular, reasoning in medical con- laborative, and geographically spanning re-
texts involving a dense population and a gions. In addition, increasing costs of health
high degree of uncertainty (such as criti- care and rapid knowledge growth have ac-
cal care environments), compounded with celerated the trend toward collaboration of
constraints imposed by resource availabil- health-care professionals in sharing knowl-
ity, leads to increased use of heuristic strate- edge and skills. Comprehensive patient care
gies. The utility of heuristics lies in limit- necessitates the communication of health-
ing the extent of purposeful search through care providers in different medical domains,
data sets, which have substantial practical thereby optimizing the use of their exper-
value by reducing redundancy. A signifi- tise. Research on reasoning will need to con-
cant part of a physician’s cognitive effort is tinue to move toward a distributed model
based on heuristic thinking, but its use in- of cognition. This model will include a fo-
troduces considerable bias in medical rea- cus on both socially shared and technology-
soning, often resulting in a number of con- mediated reasoning.
ceptual and procedural errors. These include
misconceptions about laws governing prob-
ability, instantiation of general rules to a
specific patient at the point of care, prior Acknowledgment
probabilities and actions, and false valida-
tion. Much of physicians’ reasoning is in- This chapter is dedicated to the memory of
ductive with attached probability. Human the late Yogesh C. Patel, who devoted his life
thought is fallible and we cannot appreciate to the advancement of biomedical science.
the fallibility of our thinking unless we draw Through his deeds and words, he inspired us
on understanding of how physicians’ think- to devote ourselves to superior pursuits and
ing processes operate in the real working to aspire to higher scientific standards.
environment.
Cognitive studies are increasingly mov-
ing toward investigations of real-world phe-
nomena. The constraints of laboratory-based References
work prevent capturing the dynamics of
real-world problems. This problem is partic- Anderson, J. R., Reder, L. M., & Simon, H. A.
ularly salient in high-velocity critical care en- (1 996). Situated learning and education. Edu-
vironments. In the best-case scenarios, this is cational Researcher, 2 5 (4), 5 –1 1 .
creating the potential for great synergy be- Arocha, J. F., & Patel, V. L. (1 995 ). Construction-
tween laboratory-based research and cogni- integration theory and clinical reasoning.
tive studies in the “wild.” As discussed in this In C. A. I. Weaver, C. R. Fletche, & S.
Mannes (Eds.), Discourse Comprehension: Es-
chapter, studies of thinking and reasoning
says in Honor of Walter Kintsch (pp. 3 5 9–3 81 ).
in medicine, including a focus on medical Hillsdale, NJ: Erlbaum.
errors and technology-mediated cognition,
Blois, M. S. (1 990). Medicine and the nature
are increasingly paying attention to dimen- of vertical reasoning. New England Journal of
sions of medical work in clinical settings. Medicine, 3 1 8, 847–85 1 .
Bobrow, D. G. (Ed.). (1 985 ). Qualitative Reason- Dowie, J., & Elstein, A. S. (Eds.). (1 988). Profes-
ing about Physical Systems (1 st MIT Press ed.). sional Judgment: A Reader in Clinical Decision
Cambridge, MA: MIT Press. Making. Cambridge, UK: Cambridge Univer-
Boshuizen, H. P. A., & Schmidt, H. G. (1 992). sity Press.
On the role of biomedical knowledge in clin- Elstein, A. S., Kleinmuntz, B., Rabinowitz, M.,
ical reasoning by experts, intermediates, and McAuley, R., Murakami, J., et al. (1 993 ). Di-
novices. Cognitive Science, 1 6(2), 1 5 3 –1 84. agnostic reasoning of high- and low-domain-
Brooks, L. R., Norman, G. R., & Allen, S. W. knowledge clinicians: A reanalysis. Medical De-
(1 991 ). Role of specific similarity in a med- cision Making, 1 3 (1 ), 21 –29.
ical diagnostic task. Journal of Experimental Elstein, A. S., Shulman, L. S., & Sprafka, S. A.
Psychology: General, 1 2 0(3 ), 278–287. (1 978). Medical Problem Solving: An Analysis
Chambers, D., & Reisberg, D. (1 985 ). Can mental of Clinical Reasoning. Cambridge, MA: Harvard
images be ambiguous? Journal of Experimental University Press.
Psychology: Human Perception and Performance, Ericsson, K. A., & Simon, H. A. (1 993 ). Protocol
1 1 (3 ), 3 1 7–3 28. Analysis: Verbal Reports as Data (Revised ed.).
Chandrasekaran, B. (1 994). The functional rep- Cambridge, MA: MIT Press.
resentation and causal process. In M. Yovitz Ericsson, K. A., & Smith, J. (1 991 ). Toward a Gen-
(Ed.), Advances in Computing. New York: Aca- eral Theory of Expertise: Prospects and Limits.
demic Press. New York: Cambridge University Press.
Chandrasekaran, B., Smith, J. W., & Sticklen, J. Eshelman, L. (1 988). MOLE: A knowledge ac-
(1 989). Deep models and their relation to di- quisition tool for Cover-and-Differentiatiate
agnosis. Artificial Intelligence in Medicine, 1 , 29– systems. In S. C. Marcus (Ed.), Automat-
40. ing Knowledge Acquistion for Expert Systems
Chapman, G. B., & Elstein, A. S. (2000). Cog- (pp. 3 7–80). Boston: Kluwer Academic.
nitive processes and biases in medical decision Feltovich, P. J., Johnson, P. E., Moller, J. H., &
making. In G. B. Chapman & A. Frank (Eds.), Swanson, D. B. (1 984). LCS: The role and
Decision Making in Health Care: Theory, Psy- development of medical knowledge in diag-
chology, and Applications (pp. 1 83 –21 0). Cam- nostic expertise. In W. J. Clancey, & E. H.
bridge, UK: Cambridge University Press. Shortliffe (Eds.), Readings in Medical Artifi-
Chi, M. T. H., Bassok, M., Lewis, M. W., Reiman, cial Intelligence: The First Decade (pp. 275 –3 1 9).
P., & Glaser, R. (1 989). Self explanations: How Reading, MA: Addison-Wesley.
students study and use examples in learning to Flexner, A. (1 91 0). Medical Education in the
solve problems. Cognitive Science, 1 3 , 1 45 –1 82. United States and Canada. A Report to the
Chi, M. T. H., & Glaser, R. (1 981 ). Categoriza- Carnegie Foundation for the Advancement of
tion and representation of physics problems by Teaching. Boston: Updyke.
experts and novices. Cognitive Science, 5 , 1 21 – Fordyce, J., Blank, F. S., Pekow, P., Smithline,
1 5 2. H. A., Ritter, G., Gehlbach, S., et al. (2003 ).
Clancey, W. J. (1 985 ). Heuristic classification. Ar- Errors in a busy emergency department. An-
tificial Intelligence, 2 7, 289–3 5 0. nals of Emergency Medicine, 42 (3 ), 3 24–3 3 3 .
Clancey, W. J., & Letsinger, R. (1 984). Frensch, P. A., & Funke, J. (1 995 ). Complex Prob-
NEOMYCIN: Reconfiguring a rule-based lem Solving: The European Perspective. Hillsdale,
expert system for application to teaching. NJ: Erlbaum.
In W. J. Clancey, & E. H. Shortliffe (Eds.), Frijda, N. H., & Elshout, J. J. (1 979). Problem
Readings in medical artificial intelligence: The solving and thinking. In J. A. Michon, E. G. J.
first decade (pp. 3 61 –81 ). Reading, MA: Eijkman, & L. F. W. De Klerk (Eds.), Handbook
Addison-Wesley. of Psychonomics (Vol. 2). Amsterdam: North-
Clancey, W. J., & Shortliffe, E. H. (1 984). Read- Holland.
ings in medical artificial intelligence: The first Gaba, D. M. (1 992). Dynamic decision-making
decade. Reading, MA: Addison-Wesley. in anesthesiology: Cognitive models and train-
Coiera, E. (2000). When conversation is bet- ing approaches. In D. A. Evans, & V. L. Patel
ter than computation. Journal of the Ameri- (Eds.), Advanced Models of Cognition for Med-
can Medical Informatics Association, 7(3 ), 277– ical Training and Practice (pp. 1 23 –1 47). New
286. York: Springer-Verlag.
Gentner, D., & Holyoak, K. J. (1 997). Reasoning Kuipers, B. (1 987). Qualitative simulation as
and learning by analogy: Introduction. Ameri- causal explanation. IEEE Transactions on Sys-
can Psychologist, 5 2 (1 ), 3 2–3 4. tems, Man, and Cybernetics, 1 7, 43 2–444.
Gorowitz, S., & McIntyre, A. (1 978). Toward a Kuipers, B., & Kassirer, J. P. (1 984). Causal reason-
theory of medical fallibility. Journal of Medicine ing in medicine: Analysis of a protocol. Cogni-
and Philosophy, 1 , 5 1 –71 . tive Science, 8(4), 3 63 –3 85 .
Gorry, G. A. (1 973 ). Computer-assisted clini- Larkin, J. H., McDermott, J., Simon, H. A., &
cal decision-making. Methods of Information in Simon, D. P. (1 980). Expert and novice per-
Medicine, 1 2 (1 ), 45 –5 1 . formances in solving physics problems. Science,
Hardiman, P. T., Dufresne, R., & Mestre, J. P. 2 08, 1 3 3 5 –42.
(1 989). The relation between problem cate- Larkin, J., & Simon, H. A. (1 987). Why a diagram
gorization and problem solving among experts is (sometimes) worth ten thousand words. Cog-
and novices. Memory and Cognition, 1 7(5 ), nitive Science, 1 1 (1 ), 65 –99.
627–63 8. Leaper, D. J., Horrocks, J. C., Staniland, J. R., &
Hastie, R. (2001 ). Problems for judgment and de- De Dombal, F. T. (1 972). Computer-assisted
cision making. Annual Review of Psychology, 5 2 , diagnosis of abdominal pain using “estimates”
65 3 –683 . provided by clinicians. British Medical Journal,
4(83 6), 3 5 0–3 5 4.
Hoffman, R. R., & Deffenbacher, K. A. (1 993 ).
An analysis of the relations of basic and applied Ledley, R. S., & Lusted, L. B. (1 95 9). Reasoning
science. Ecological Psychology, 5 , 3 1 5 –5 2. foundations of medical diagnosis. Science, 1 3 0,
9–21 .
Holland, J. H. (1 986). Induction : Processes of Infer-
ence, Learning, and Discovery. Cambridge, MA: Leprohon, J., & Patel, V. L. (1 995 ). Decision-
MIT Press. making strategies for telephone triage in emer-
gency medical services. Medical Decision Mak-
Holyoak, K. J. (1 985 ). The pragmatics of analogi-
ing: An International Journal of the Society of
cal transfer. The Psychology of Learning and Mo-
Medical Decision Making, 1 5 (3 ), 240–25 3 .
tivation, 1 9, 5 9–87.
Lin, L., Isla, R., Doniz, K., Harkness, H., Vicente,
Holyoak, K. J., & Thagard, P. (1 997). The ana-
K. J., & Doyle, D. J. (1 998). Applying human
logical mind. American Psychologist, 5 2 (1 ), 3 5 –
factors to the design of medical equipment:
44. Patient-controlled analgesia. Journal of Clinical
Joseph, G. M., & Patel, V. L. (1 990). Domain Monitoring, and Computing, 1 4(4), 25 3 –263 .
knowledge and hypothesis generation in di- Miller, R. A., Pople, H. E., & Myers, J. D.
agnostic reasoning. Medical Decision Making, (1 984). Internist-I, an experimental computer-
1 0(1 ), 3 1 –46. based diagnostic for general internal medicine.
Kassirer, J. P. (1 989). Diagnostic reasoning. An- In W. J. Clancey, & E. H. Shortliffe (Eds.),
nals of Internal Medicine, 1 1 0(1 1 ), 893 –900. Readings in Medical Artificial Intelligence: The
Kassirer, J. P., & Kopelman, R. I. (1 990). Diagno- First Decade (pp. xvi, 5 1 2 p). Reading, MA:
sis and the structure of memory. 2. Exemplars, Addison-Wesley.
scripts, and simulation. Hospital Practice (Office Newell, A. (1 990). Unified Theories of Cognition.
Edition), 2 5 (1 1 ), 29–3 3 , 3 6. Cambridge, MA: Harvard University Press.
Klein, G. A. (1 993 ). Decision Making in Action: Newell, A., & Simon, H. A. (1 972). Human Prob-
Models and Methods. Norwood, NJ: Ablex. lem Solving. Englewood Cliffs, NJ: Prentice-
Kleinmuntz, D. N., & Schkade, D. A. (1 993 ). In- Hall.
formation displays and decision processes. Psy- Norman, D. A. (1 986). Cognitive engineering.
chological Science, 4(4), 221 –227. In D. A. Norman, & S. W. Draper (Eds.),
Kohn, L. T., Corrigan, J., & Donaldson, M. S. User Centered System Design: New Perspectives
(1 999). To Err Is Human: Building a Safer on Human-computer Interaction (pp. 3 1 –61 ).
Health System. Washington, DC: National Hillsdale, NJ: Erlbaum.
Academy Press. Norman, D. A. (1 988). The Psychology of Everyday
Kuhn, D. (1 995 ). Scientific thinking and knowl- Things. New York: Basic Books.
edge acquisition. Monographs of the Society Norman, G. R., & Brooks, L. R. (1 997). The non-
for Research in Child Development, 60(4), analytical basis of clinical reasoning. Advances
1 5 2–1 5 7. in Health Sciences Education, 2 (2), 1 73 –1 84.
Norman, G. R., Brooks, L. R., & Allen, S. W. Patel, V. L., Groen, G. J., & Norman, G. R. (1 993 ).
(1 989). Recall by expert medical practitioners Reasoning and instruction in medical curricula.
and novices as a record of processing atten- Cognition and Instruction, 1 0(4), 3 3 5 –3 78.
tion. Journal of Experimental Psychology: Learn- Patel, V. L., & Kaufman, D. R. (1 995 ). Clinical
ing, Memory, and Cognition, 1 5 (6), 1 1 66–1 1 74. reasoning and biomedical knowledge: Impli-
Osler, W. (1 906). Aequanimitas. With other ad- cations for teaching. In J. Higgs, & M. Jones
dresses to medical students, nurses and practition- (Eds.), Clinical Reasoning in the Health Profes-
ers of medicine. Philadelphia: Blakiston’s Son & sions (pp. 1 1 7–1 28). Oxford, UK: Butterworth
Co. Heinemenn.
Patel, V. L., Arocha, & Kaufman, D. R. (1 994). Patel, V. L., & Kaufman, D. R. (2001 , Feb 02).
Diagnostic reasoning and expertise. Psychology Medical education isn’t just about solving
of Learning and Motivation, 3 1 , 1 3 7–25 2. problems. The Chronicle of Higher Education,
Patel, V. L., Kaufman, D. R., & Magder, S. A. B1 2.
(1 996). The acquisition of medical expertise in Patel, V. L., Kaufman, D. R., Allen, V. G., Short-
complex dynamic environments. In Ericsson, liffe, E. H., Cimino, J. J., & Greenes, R. A.
K. A. (Ed.), The road to excellence: The acquisi- (1 999). Toward a framework for computer-
tion of expert performance in the arts and sciences, mediated collaborative design in medical in-
sports and games. (pp. 1 26–65 ). Hillsdale, NJ: formatics. Methods of Information in Medicine,
Erlbaum. 3 8(3 ), 1 5 8–1 76.
Patel, V. L., & Arocha, J. F. (2001 ). The nature Patel, V. L., Kaufman, D. R., & Arocha, J. F.
of constraints on collaborative decision mak- (2002). Emerging paradigms of cognition in
ing in health care settings. In E. Salas, & G. medical decision-making. Journal of Biomedical
Klein (Eds.), Linking Expertise and Naturalistic Informatics, 3 5 , 5 2–75 .
Decision Making (pp. 3 83 –405 ). Mahwah, NJ: Patel, V. L., Kaufman, D. R., & Magder, S. A.
Erlbaum. (1 991 ). Causal reasoning about complex phys-
Patel, V. L., Arocha, J. F., & Kaufman, D. R. iological concepts in cardiovascular physiology
(2001 ). A primer on aspects of cognition for by medical students. International Journal of Sci-
medical informatics. Journal of the American ence Education, 1 3 , 1 71 –1 85 .
Medical Informatics Association, 8(4), 3 24–3 43 . Patel, V. L., Kushniruk, A. W., Yang, S., & Yale,
Patel, V. L. Arocha, J., & Lecissi, M. (2001 ). J. F. (2000). Impact of a computer-based pa-
Impact of undergraduate medical training on tient record system on data collection, knowl-
housestaff problem solving performance, im- edge organization, and reasoning. Journal of the
plications for health education in problem- American Medical Informatics Association, 7(6),
based curricula. Journal of Dental Education. 5 69–5 85 .
65 (1 1 ) 1 1 99–1 21 8. Patel, V. L., & Ramoni, M. F. (1 997). Cogni-
Patel, V. L., Evans, D. A., & Kaufman, D. R. tive models of directional inference in expert
(1 989). Cognitive framework for doctor- medical reasoning. In P. J. Feltovich, & K. M.
patient communication. In D. A. Evans, & Ford (Eds.), Expertise in Context: Human and
V. L. Patel (Eds.), Cognitive Science in Medicine: Machine (pp. 67–99): Cambridge, MA: MIT
Biomedical Modeling (pp. 25 7–3 1 2). Cam- Press.
bridge, MA: MIT Press. Patil, R. S., Szolovits, P., & Schwartz, W. B.
Patel, V. L., & Groen, G. J. (1 986). Knowledge- (1 985 ). Causal understanding of patient ill-
based solution strategies in medical reasoning. ness in medical diagnosis. In J. A. Reggia, & S.
Cognitive Science, 1 0, 91 –1 1 6. Tuhrim (Eds.), Computer-assisted Medical Deci-
Patel, V. L., & Groen, G. J. (1 991 ). The general sion Making (Vol. 2, pp. 272–292). New York:
and specific nature of medical expertise: A crit- Springer-Verlag.
ical look. In K. A. Ericsson, & J. Smith (Eds.), Peirce, C. S. (1 95 5 ). Abduction and induction.
Toward a General Theory of Expertise: Prospects In C. S. Peirce, & J. Buchler (Eds.), Philosophi-
and Limits (pp. 93 –1 25 ). New York: Cambridge cal Writings of Peirce (pp. 1 5 0–1 5 6). New York:
University Press. Dover.
Patel, V. L., Groen, G. J., & Arocha, J. F. (1 990). Perkins, D. N., & Simmons, R. (1 988). An inte-
Medical expertise as a function of task diffi- grative model of misconceptions. Review of Ed-
culty. Memory and Cognition, 1 8(4), 3 94–406. ucational Research, 5 8, 3 03 –3 26.
Ramoni, M. F., Stefanelli, M., Magnani, L., & Stefanelli, M., & Ramoni, M. F. (1 992). Episte-
Barosi, G. (1 992). An epistemological frame- mological constraints on medical knowledge-
work for medical knowledge based system. based systems. In D. A. Evans, & V. L. Patel
IEEE Transactions on Systems, Man, and Cyber- (Eds.), Advanced Models of Cognition for Med-
netics, 2 2 , 1 3 61 –1 3 75 . ical Training and Practice (Vol. 97, pp. 3 –20).
Rasmussen, J., Pejtersen, A. M., & Goodstein, Heidelberg: Springer-Verlag.
L. P. (1 994). Cognitive Systems Engineering. Stenning, K., & Oberlander, J. (1 994). A cognitive
New York: Wiley. theory of graphical and linguistic reasoning:
Reason, J. T. (1 990). Human Error. Cambridge, Logic and implementation. Cognitive Science,
UK: Cambridge University Press. 1 9, 97–1 40.
Reisberg, D. (1 987). External representations and Strauss, S., & Stavy, R. (1 982). U-shaped
the advantages of externalizing one’s thoughts. Behavioral Growth. New York: Academic
In Proceedings of the Eighth Annual Conference Press.
of the Cognitive Science Society. Hillsdale, NJ: Sweller, J. (1 988). Cognitive load during problem
Erlbaum. solving: Effects on learning. Cognitive Science,
Rimoldi, H. J. A. (1 961 ). The test of diagnos- 1 2 , 25 7–85 .
tic skills. Journal of Medical Education, 3 6, 73 – Szolovits, P. (Ed.) (1 982). Artificial Intelligence in
79. Medicine (Vol. 5 1 ). Boulder, CO: Published by
Rogoff, B., & Lave, J. (1 984). Everyday Cognition: Westview Press for the American Association
Its Development in Social Context. Cambridge, for the Advancement of Science.
MA: Harvard University Press. Thibodeau, P., Hardiman, P. T., Dufresne, R.,
Salas, E., & Klein, G. A. (2001 ). Linking Expertise & Mestre, J. P. (1 989). The relation between
and Naturalistic Decision Making. Mahwah, NJ: problem categorization and problem-solving
Erlbaum. among experts and novices. Memory and Cog-
Salomon, G., & Perkins, D. N. (1 989). Rocky nition, 1 7, 627–3 8.
roads to transfer: Rethinking mechanisms of a Tversky, A., & Kahneman, D. (1 974). Judgment
neglected phenomenon. Educational Psycholo- under uncertainty: Heuristics and biases. Sci-
gist, 2 4, 1 1 3 –1 42. ence, 1 85 (41 5 7), 1 1 24–1 1 3 1 .
Salomon, G., Perkins, D. N., & Globerson, T. Vicente, K. J., & Rasmussen, J. (1 990). The ecol-
(1 991 ). Partners in cognition: Extending hu- ogy of human-machine systems. II: Mediating
man intelligence with intelligent technologies. “direct perception” in complex work domains.
Educational Researcher, 2 0(3 ), 2–9. Ecological Psychology, 2 , 207–25 0.
Schaffner, K. F. (1 986). Exemplar reasoning White, B. Y., & Frederiksen, J. R. (1 990). Causal
about biological models and diseases: A rela- model progressions as a foundation for intelli-
tion between the philosophy of medicine and gent learning environments. In W. J. Clancey
philosophy of science. Journal of Medicine and & E. Soloway (Eds.), Artificial Intelligence and
Philosophy, 1 1 , 63 –80. Learning Environments, Special Issues of Ar-
Schauble, L. (1 996). The development of tificial Intelligence: An International Journal
scientific reasoning in knowledge-rich con- (pp. 99–1 5 7). Cambridge, MA: MIT Press.
texts. Developmental Psychology, 3 2 (1 ), 1 02– Woods, D. D. (1 993 ). Process-tracing methods for
1 1 9. the study of cognition outside of the experi-
Schmidt, H. G., & Boshuizen, H. P. A. (1 993 ). mental psychology laboratory. In G. A. Klein,
On the origin of intermediate effects in clinical & J. Orasanu (Eds.), Decision Making in Action:
case recall. Memory and Cognition, 2 1 , 3 3 8– Models and Methods (pp. 228–25 1 ). Norwood,
351. NJ: Ablex.
Shortliffe, E. H. (1 976). Computer-based Med- Woods, D. D., & Cook, R. (1 998). A tale of
ical Consultations, MYCIN. New York: two stories, contrasting views of patient safety.
Elsevier. National Health Care Safety Council of the
Simon, D. P., & Simon, H. A. (1 978). Individ- National Patient Safety Foundation at the
ual differences in solving physics problems. AMA.
In R. Siegler (Ed.), Children’s Thinking: What Zeng, Q., Cimino, J. J., & Zou, K. H. (2002). Pro-
Develops? Hillsdale, NJ: Erlbaum. viding concept-oriented views for clinical data
using a knowledge-based system: An evalua- Zhang, J., & Norman, D. A. (1 994). Represen-
tion. Journal of the American Medical Informat- tations in distributed cognitive tasks. Cognitive
ics Association, 9(3 ), 294–3 05 . Science, 1 8, 87–1 22.
Zhang, J. (1 996). A representational analysis of Zhang, J., & Norman, D. A. (1 995 ). A representa-
relational information displays. International tional analysis of numeration systems. Cognition,
Journal of Human-Computer Studies, 45 (1 ), 5 9– 5 7, 271 –295 .
74. Zhang J., Patel VL, Johnson TR, Shortliffe EH
Zhang, J. (1 997). The nature of external represen- (2004). A cognitive taxonomy of medical er-
tations in problem solving. Cognitive Science, rors. Journal of Biomedical Informatics, 3 7, 1 93 –
2 1 (2), 1 79–21 7. 204.
CHAPTER 3 1
Intelligence
Robert J. Sternberg
What is intelligence? This chapter discusses learn from experience and the ability
the nature of intelligence and related is- to adapt to the surrounding environ-
sues. The chapter is divided into several ma- ment. Sixty-five years later, Sternberg and
jor parts: The first discusses people’s con- Detterman (1 986) asked twenty-four cog-
ceptions of intelligence, also referred to as nitive psychologists with expertise in intel-
implicit theories of intelligence; the second ligence research the same question. They,
presents a brief discussion of intelligence too, underscored the importance of learning
testing; the third offers a review of major ap- from experience and adapting to the envi-
proaches to understanding intelligence; the ronment. They also broadened the definition
fourth discusses how intelligence can be im- to emphasize the importance of metacog-
proved; and the last part briefly draws some nition – people’s understanding and control
conclusions. The chapter does not discuss ar- of their own thinking processes. Contempo-
tificial intelligence and computer simulation rary experts also more heavily emphasized
(see Lovett & Anderson, Chap. 1 7), neural the role of culture, pointing out that what
networks, or parallel distributed processing is considered intelligent in one culture may
(see Doumas & Hummel, Chap. 4). be considered stupid in another (Serpell,
2000). Intelligence, then, is the capacity
to learn from experience, using metacogni-
tive processes to enhance learning, and the
Implicit Theories of Intelligence ability to adapt to the surrounding environ-
ment, which may require different adap-
What do people believe intelligence to be? tations within different social and cultural
In 1 921 , when the editors of the Jour- contexts.
nal of Educational Psychology asked 1 4 fa- According to the Oxford English Dictio-
mous psychologists that question, the re- nary, the word intelligence entered our lan-
sponses varied but generally embraced two guage in about the twelfth century. Today,
themes: Intelligence involves the capacity to we can look up intelligence in numerous
751
dictionaries, but most of us still have our & Salovey, 1 997; Mayer, Salovey, & Caruso,
own implicit (unstated) ideas about what it 2000; Salovey & Sluyter, 1 997), although
means to be smart; that is, we have our own the evidence is mixed (Davies, Stankov, &
implicit theories of intelligence. We use our Roberts, 1 998).
implicit theories in many social situations, A related concept is that of social intel-
such as when we meet people or when we ligence, the ability to understand and inter-
describe people we know as being very smart act with other people (Kihlstrom & Cantor,
or not so smart. 2000). Research also shows that person-
Within our implicit theories of intelli- ality variables are related to intelligence
gence, we also recognize that it has differ- (Ackerman, 1 996).
ent meanings in different contexts. A smart Explicit definitions of intelligence fre-
salesperson may show a different kind of in- quently take on an assessment-oriented fo-
telligence than a smart neurosurgeon or a cus. In fact, some psychologists, such as Ed-
smart accountant, each of whom may show win Boring (1 923 ), have defined intelligence
a different kind of intelligence than a smart as whatever it is that the tests measure.
choreographer, composer, athlete, or sculp- This definition, unfortunately, is circular
tor (see Sternberg et al., Chap. 1 5 , for a and, moreover, what different tests of intelli-
discussion of the related concept of creativ- gence measure is not always the same. Differ-
ity). We often, use our implicit and context- ent tests measure somewhat different con-
relevant definitions of intelligence to make structs (Daniel, 1 997, 2000; Embretson &
assessments of intelligence. Is your mechanic McCollam, 2000; Kaufman, 2000; Kaufman
smart enough to find and fix the problem in & Lichtenberger, 1 998), so it is not feasible
your car? Is your physician smart enough to to define intelligence by what tests test, as
find and treat your health problem? Is this though they all measured the same thing. Al-
attractive person smart enough to hold your though most cognitive psychologists do not
interest in a conversation? go to that extreme, the tradition of attempt-
Western notions about intelligence are ing to understand intelligence by measuring
not always shared by other cultures (Stern- various aspects of it has a long history (Brody,
berg & Kaufman, 1 998). For example the 2000).
Western emphasis on speed of mental pro-
cessing (Sternberg et al., 1 981 ) is not shared
in many cultures. Other cultures may even
Intelligence Testing
be suspicious of the quality of work that
is done very quickly. Indeed, other cultures
History
emphasize depth rather than speed of pro-
cessing. Even in the West, some prominent Contemporary measurements of intelli-
theorists have pointed out the importance gence usually can be traced to one of two
of depth of processing for full command of very different historical traditions. One tra-
material (e.g., Craik & Lockhart, 1 972). dition concentrated on lower level, psy-
Even within the United States, many peo- chophysical abilities (such as sensory acuity,
ple have started viewing as important not physical strength, and motor coordination);
only the cognitive aspects but also the emo- the other focused on higher level, judgment
tional aspects of intelligence. Mayer, Salovey, abilities (which we traditionally describe as
and Caruso (2000, p. 3 96) defined emo- related to thinking).
tional intelligence as “the ability to perceive Francis Galton (1 822 –1 91 1 ) believed that
and express emotion, assimilate emotion in intelligence was a function of psychophysical
thought, understand and reason with emo- abilities and, for several years, Galton main-
tion, and regulate emotion in the self and tained a well-equipped laboratory where vis-
others.” There is good evidence for the exis- itors could have themselves measured on a
tence of some kind of emotional intelligence variety of psychophysical tests. These tests
(Ciarrochi, Forgas, & Mayer, 2001 ; Mayer measured a broad range of psychophysical
intelligence 753
skills and sensitivities, such as weight dis- widely used, as are the competitive Wechsler
crimination (the ability to notice small dif- scales. The Wechsler tests yield three scores –
ferences in the weights of objects), pitch a verbal score, a performance score, and an
sensitivity (the ability to hear small differ- overall score. The verbal score is based on
ences between musical notes), and phys- tests such as vocabulary and verbal simi-
ical strength (Galton, 1 883 ). One of the larities in which the test-taker has to say
many enthusiastic followers of Galton, Clark how two things are similar. The performance
Wissler (1 901 ), attempted to detect links score is based on tests such as picture com-
among the assorted tests, which would unify pletion, which requires identification of a
the various dimensions of psychophysically missing part in a picture of an object; and
based intelligence. Much to Wissler’s dis- picture arrangement, which requires rear-
may, no unifying association could be de- rangement of a scrambled set of cartoon-like
tected. Moreover, the psychophysical tests pictures into an order that tells a coherent
did not predict college grades. The psy- story. The overall score is a combination of
chophysical approach to assessing intelli- the verbal and performance scores.
gence soon faded almost into oblivion, al- Although Wechsler clearly believed in the
though it would reappear many years later. worth of attempting to measure intelligence,
An alternative to the psychophysical he did not limit his conception of intelli-
approach was developed by Alfred Bi- gence to test scores. Wechsler believed that
net (1 85 7–1 91 1 ). He and his collaborator, intelligence is not represented just by a test
Theodore Simon, also attempted to assess score or even by what we do in school. We
intelligence, but their goal was much more use our intelligence not just in taking tests
practical. Binet had been asked to devise a and in doing homework, but also in relating
procedure to distinguish normal from men- to people, in performing our jobs effectively,
tally retarded learners in an academic setting and in managing our lives in general.
(Binet & Simon, 1 91 6). In Binet’s view, judg-
ment not psychophysical acuity, strength, or
skill, is the key to intelligence. For Binet
Approaches to Intelligence
(Binet & Simon, 1 91 6), intelligent thought –
mental judgment – comprises three distinct
Psychometric Approaches to Intelligence
elements: direction, adaptation, and criti-
cism. The importance of direction and adap- Psychologists interested in the structure of
tation certainly fits with contemporary views intelligence have relied on factor analysis as
of intelligence, and Binet’s notion of criti- an indispensable tool for their research. Fac-
cism actually seems prescient, considering tor analysis is a statistical method for sepa-
the current appreciation of metacognitive rating a construct – intelligence in this case –
processes as a key aspect of intelligence. Bi- into a number of hypothetical factors or abil-
net viewed intelligence as a broad potpourri ities the researchers believe to form the basis
of cognitive and other abilities and as highly of individual differences in test performance.
modifiable. The specific factors derived, of course, still
depend on the specific questions being asked
and the tasks being evaluated.
Major Intelligence Scales
Factor analysis is based on studies of cor-
Lewis Terman of Stanford University built relation. The idea is that the more highly
on Binet and Simon’s work in Europe and two tests are correlated the more likely they
constructed the earliest version of what has are to measure the same thing. In research
come to be called the Stanford–Binet In- on intelligence, a factor analysis might in-
telligence Scales (Terman & Merrill, 1 93 7, volve these steps: (1 ) Give a large number
1 973 ; Thorndike, Hagen, & Sattler, 1 986). of people several different tests of ability.
For years, the Stanford-Binet test was the (2) Determine the correlations among all
standard for intelligence tests, and it is still those tests. (3 ) Statistically analyze those
correlations to simplify them into a relatively ties: verbal comprehension, measured by vo-
small number of factors that summarize peo- cabulary tests; verbal fluency, measured by
ple’s performance on the tests. The investi- time-limited tests requiring the test-taker to
gators in this area have generally agreed on think of as many words as possible that be-
and followed this procedure, yet the result- gin with a given letter; inductive reason-
ing factorial structures of intelligence have ing, measured by tests such as analogies
differed among theorists such as Spearman, and number-series completion tasks; spatial
Thurstone, Guilford, Cattell, Vernon, and visualization, measured by tests requiring
Carroll. mental rotation of pictures of objects, num-
ber, measured by computation and simple
spearman: theory of g mathematical problem-solving tests; mem-
ory, measured by picture and word-recall
Charles Spearman is usually credited with
tests; and perceptual speed, measured by
inventing factor analysis (Spearman, 1 927).
tests that require the test-taker to recognize
Using factor-analytic studies, Spearman con-
small differences in pictures or to cross out
cluded that intelligence can be understood
a “each time it appear in a string” of varied
in terms of both a single general factor that
letters.
pervades performance on all tests of mental
ability and a set of specific factors, each of
which is involved in performance on only a guilford: the structure of intellect
single type of mental-ability test (e.g., arith- At the opposite extreme from Spearman’s
metic computations). In Spearman’s view, single g-factor model is J. P. Guilford’s (1 967,
the specific factors are of only casual inter- 1 982, 1 988) structure-of-intellect model,
est because of their narrow applicability. To which includes up to 1 5 0 factors of the mind
Spearman, the general factor, which he la- in one version of the theory. According to
beled “g,” provides the key to understand- Guilford, intelligence can be understood in
ing intelligence. Spearman believed g to be terms of a cube that represents the intersec-
attributable to “mental energy.” Many psy- tion of three dimensions – operations, con-
chologists still believe Spearman’s theory to tents, and products. Operations are simply
be essentially correct (e.g., Jensen, 1 998; see mental processes, such as memory and evalu-
essays in Sternberg & Grigorenko, 2002). ation (making judgments, such as determin-
The theory is useful in part because g ac- ing whether a particular statement is a fact
counts for a sizable, although not fixed, per- or opinion). Contents are the kinds of terms
centage of variance in school and job per- that appear in a problem, such as seman-
formance, usually somewhere between 5 % tic (words) and visual (pictures). Products
and 40% (Jensen, 1 998). Spearman (1 923 ) are the kinds of responses required, such as
provided a cognitive theory of intelligence. units (single words, numbers, or pictures),
He suggested that intelligence comprises classes (hierarchies), and implications. Thus,
apprehension of experience (encoding of Guilford’s theory, like Spearman’s, had an
stimuli), eduction of relations (inference of explicit cognitive component.
relations), and eduction of correlates (appli-
cation of what is learned). He therefore may
cattell, vernon, and carroll:
have been the earliest serious cognitive the- hierarchical models
orist of intelligence.
A more parsimonious way of handling a
number of factors of the mind is through
thurstone: primary mental abilities a hierarchical model of intelligence. One
In contrast to Spearman, Louis Thurstone such model, developed by Raymond Cat-
(1 887–1 95 5 ) concluded (Thurstone, 1 93 8) tell (1 971 ), proposed that general intelli-
that the core of intelligence resides not in gence comprises two major subfactors – fluid
one single factor but in seven such factors, ability (speed and accuracy of abstract rea-
which he referred to as primary mental abili- soning, especially for novel problems) and
intelligence 755
crystallized ability (accumulated knowledge study intelligence differ primarily in terms

and vocabulary). Subsumed within these of the complexity of the processes being
two major subfactors are other, more spe- studied. Among the advocates of this ap-
cific factors. A similar view was proposed by proach have been Ted Nettelbeck, Arthur
Philip E. Vernon (1 971), who made a gen- Jensen, Earl Hunt, Herbert Simon, and my-
eral division between practical-mechanical self. Each of these researchers has considered
and verbal-educational abilities. both the speed and the accuracy of infor-
More recently, John B. Carroll (1 993 ) pro- mation processing to be important factors
posed a hierarchical model of intelligence in intelligence. In addition to speed and ac-
based on his analysis of more than 460 data curacy of processing, Hunt considered ver-
sets obtained between 1 927 and 1 987. His bal versus spatial skill, as well as attentio-
analysis encompasses more than 1 3 0,000 nal ability.
people from diverse walks of life and even
countries of origin (although non–English- inspection time
speaking countries are poorly represented
Nettelbeck (e.g., 1 987; Nettelbeck & Lally,
among his data sets). The model Carroll pro-
1 976; Nettelbeck & Rabbitt, 1 992; see also
posed, based on his monumental undertak-
Deary, 2000, 2002; Deary & Stough, 1 996)
ing, is a hierarchy comprising three strata –
suggested a speed-related indicator of intelli-
Stratum I, which includes many narrow,
gence involving the encoding of visual infor-
specific abilities (e.g., spelling ability, speed
mation for brief storage in working memory.
of reasoning); Stratum II, which includes
But what is critical in this view is not speed
various broad abilities (e.g., fluid intelli-
of response but rather the length of time
gence, crystallized intelligence); and Stra-
a stimulus must be presented for the sub-
tum III, a single general intelligence, much
ject to be able to process that stimulus. The
like Spearman’s g.
shorter the presentation length, the higher
In addition to fluid intelligence and crys-
the score. The key variable is the length of
tallized intelligence, Carroll includes in the
time for the presentation of the target stim-
middle stratum learning and memory pro-
ulus, not the speed of responding by press-
cesses, visual perception, auditory percep-
ing the button. Nettelbeck operationally de-
tion, facile production of ideas (similar to
fined inspection time as the length of time
verbal fluency), and speed (which includes
for presentation of the target stimulus after
both sheer speed of response and speed of
which the participant still responds with at
accurate response). Although Carroll does
least 90% success. Nettelbeck (1 987) found
not break new ground in that many of the
that shorter inspection times correlate with
abilities in his model have been mentioned
higher scores on intelligence tests [e.g., var-
in other theories, he does masterfully inte-
ious subscales of the Wechsler Adult Intel-
grate a large and diverse factor-analytic lit-
ligence Scale (WAIS)] among differing pop-
erature, thereby giving great authority to
ulations of participants. Other investigators
his model. Whereas the factor-analytic ap-
have confirmed this finding (e.g., Deary &
proach has tended to emphasize the struc-
Stough, 1 996).
tures of intelligence, the cognitive approach
has tended to emphasize the operations of
intelligence. choice reaction time
Arthur Jensen (1 979, 1 998, 2002) empha-
sized a different aspect of information-
Cognitive Approaches to Intelligence
processing speed; specifically, he proposed
Cognitive theorists are interested in study- that intelligence can be understood in terms
ing how people (or other organisms; Zentall, of speed of neuronal conduction. In other
2000) mentally represent and process what words, the smart person is someone whose
they learn and know about the world. The neural circuits conduct information rapidly.
ways in which various cognitive investigators When Jensen proposed this notion, direct
measures of neural-conduction velocity tion about words (e.g., letter names) stored
were not readily available, so Jensen primar- in our long-term memories. To measure this
ily studied a proposed proxy for measuring speed, Hunt proposed a letter-matching RT
neural-processing speed – choice reaction task (Posner & Mitchell, 1 967).
time, the time it takes to select one answer For example, suppose that you are one
from among several possibilities. For exam- of Hunt’s participants. You would be shown
ple, suppose that you are one of Jensen’s pairs of letters, such as “A A,” “A a,” or “A b.”
participants. You might be seated in front For each pair, you would be asked to indi-
of a set of lights on a board. When one of cate whether the letters constitute a match
the lights flashed, you would be expected in name (e.g., “A a” match in name of let-
to extinguish it by pressing as rapidly as ter of the alphabet but “A b” do not). You
possible a button beneath the correct light. would also be given a simpler task, in which
The experimenter would then measure your you would be asked to indicate whether the
speed in performing this task. Jensen (1 982) letters match physically (e.g., “A A” are phys-
found that participants with higher intelli- ically identical, whereas “A a” are not). Hunt
gence quotients (IQs) are faster than par- would be particularly interested in discern-
ticipants with lower IQs in their reaction ing the difference between your speed for
time (RT), the time between when a light the first set of tasks, involving name match-
comes on and the finger leaves the home ing, and your speed for the second set, in-
(central) button. In some studies, partici- volving matching of physical characteristics.
pants with higher IQs also showed a faster Hunt would consider the difference in your
movement time, the time between letting reaction time for each task to indicate a mea-
the finger leave the home button and hitting sure of your speed of lexical access. Thus,
the button under the light. Based on such he would subtract from his equation the
tasks, Reed and Jensen (1 991 , 1 993 ) pro- physical-match reaction time. For Hunt, the
pose that their findings may be attributable response time in indicating that “A A” is a
to increased central nerve-conduction veloc- physical match is unimportant. What inter-
ity, although at present this proposal remains ests him is a more complex reaction time –
speculative. that for recognizing names of letters. He and
More recently, researchers have suggested his colleagues have found that students with
that various findings regarding choice RT lower verbal ability take longer to gain ac-
may be influenced by the number of re- cess to lexical information than do students
sponse alternatives and the visual-scanning with higher verbal ability.
requirements of Jensen’s apparatus rather Earl Hunt and Marcy Lansman (1 982)
than being attributable to the speed of RT also studied people’s ability to divide their
alone (Bors, MacLeod, & Forrin, 1 993 ). In attention as a function of intelligence. For
particular, Bors and colleagues found that example, suppose that you are asked to solve
manipulating the number of buttons and the mathematical problems and simultaneously
size of the visual angle of the display could to listen for a tone and press a button as soon
reduce the correlation between IQ and RT. as you hear it. We can expect that you would
Thus, the relation between reaction time and both solve the math problems effectively
intelligence is unclear. and respond quickly to hearing the tone.
According to Hunt and Lansman, one thing
lexical access speed and speed of that makes people more intelligent is that
simultaneous processing they are better able to timeshare between
Like Jensen, Earl Hunt (1 978) suggested two tasks and to perform both effectively.
that intelligence be measured in terms of In sum, process timing theories attempt
speed. However, Hunt has been particu- to account for differences in intelligence
larly interested in verbal intelligence and by appealing to differences in the speed
has focused on lexical-access speed – the of various forms of information process-
speed with which we can retrieve informa- ing; inspection time, choice RT, and lexical
intelligence 757
access timing all have been found to cor- little more than working memory! Dane-
relate with measures of intelligence. These man and Carpenter (1 983 ) had participants
findings suggest that higher intelligence read sets of passages and, after they had read
may be related to the speed of various the passages, try to remember the last word
information-processing abilities, including of each passage. Recall was highly corre-
encoding information more rapidly into lated with verbal ability. Turner and Engle
working memory, accessing information in (1 989) had participants perform a variety
long-term memory more rapidly, and re- of working-memory tasks. In one task, for
sponding more rapidly. example, the participants saw a set of sim-
Why would more rapid encoding, re- ple arithmetic problems, each of which was
trieval, and responding be associated with followed by a word or a digit. An example
higher intelligence test scores? Do rapid would be “Is ((3 × 5 ) − 6 = 7?” TABLE. The
information processors learn more? Other participants saw sets of from two to six such
research on learning in aged persons investi- problems and solved each one. After solving
gated whether there is a link between age- the problems in the set, they tried to recall
related slowing of information processing the words that followed the problems. The
and (1 ) initial encoding and recall of infor- number of words recalled was highly corre-
mation and (2) long-term retention (Nettel- lated with measured intelligence. It there-
beck et al., 1 996; Bors & Forrin, 1 995 ). The fore appears that the ability to store and
findings suggest that the relation between in- manipulate information in working memory
spection time and intelligence may not be may be an important aspect of intelligence,
related to learning. In particular, Nettelbeck although probably not all there is to intelli-
et al. found there is a difference between gence (see Morrison, Chap. 1 9 for discussion
initial recall and actual long-term learning – of working memory and thinking).
whereas initial recall performance is me-
diated by processing speed (older, slower the componential theory and complex
participants showed deficits), longer-term problem solving
retention of new information (preserved in In my early work on intelligence, I (Stern-
older participants) is mediated by cognitive berg, 1 977) began using cognitive ap-
processes other than speed of processing, proaches to study information processing in
including rehearsal strategies. This implies more complex tasks, such as analogies, se-
speed of information processing may influ- ries problems (e.g., completing a numerical
ence initial performance on recall and in- or figural series), and syllogisms (Sternberg,
spection time tasks, but speed is not related 1 977, 1 983 , 1 985 ). The goal was to find out
to long-term learning. Perhaps faster infor- just what made some people more intelli-
mation processing aids participants in per- gent processors of information than others.
formance aspects of intelligence test tasks, The idea was to take the kinds of tasks used
rather than contributing to actual learn- on conventional intelligence tests and to iso-
ing and intelligence (see also Salthouse, late the components of intelligence – the
Chap. 24). Clearly, this area requires more mental processes used in performing these
research to determine how information- tasks, such as translating a sensory input
processing speed relates to intelligence. into a mental representation, transforming
one conceptual representation into another,
or translating a conceptual representation
working memory into a motor output (Sternberg, 1 982). Since
Recent work suggests that a critical compo- then, many people have elaborated upon
nent of intelligence may be working mem- and expanded this basic approach (Lohman,
ory (see Morrison, Chap. 1 9 for a discussion 2000).
of working memory in thinking). Indeed, Componential analysis breaks down peo-
Kyllonen (2002) and Kyllonen and Christal ple’s reaction times and error rates on these
(1 990) have argued that intelligence may be tasks in terms of the processes that make
up the tasks. This kind of analysis revealed rect. Thus, brighter people may take longer
that people may solve analogies and simi- to do something than will less bright peo-
lar tasks by using several component pro- ple when taking more time is advantageous.
cesses including encoding the terms of the For example, the brighter person might
problem, inferring relations among at least spend more time researching and planning
some of the terms, mapping the inferred re- a term paper but less time in actually writ-
lations to other terms that would be pre- ing it. This same differential in time alloca-
sumed to show similar relations, and apply- tion has been shown in other tasks as well
ing the previously inferred relations to the (e.g., in solving physics problems; Larkin et
new situations. al., 1 980; Sternberg, 1 979, 1 985 ); that is,
Consider the analogy, LAWYER : more intelligent people seem to spend more
CLIENT :: DOCTOR : (a. PATIENT b. time planning for and encoding the prob-
MEDICINE). To solve this analogy, you lems they face but less time in the other
need to encode each term of the problem, components of task performance. This may
which includes perceiving a term and re- relate to the previously mentioned metacog-
trieving information about it from memory. nitive attribute many include in their notions
You then infer the relationship between of intelligence. The bottom line, then, is that
lawyer and client – that the former provides intelligence may reside as much in how peo-
professional services to the latter. You ple allocate time as it does in the amount of
then map the relationship in the first half time it takes them to perform cognitive tasks.
of the analogy to the second half of the In a similarly cognitive approach, Simon
analogy, noting that it will involve that same studied the information processing of people
relationship. Finally, you apply that inferred engaged in complex problem-solving situa-
relationship to generate the final term of tions, such as when playing chess and per-
the analogy, leading to the appropriate forming logical derivations (Newell & Si-
response of PATIENT. Studying these common, 1 972; Simon, 1 976). A simple, brief
ponents of information processing reveals task might require the participant to view an
more than measuring mental speed alone arithmetic or geometric series, figure out the
(see Holyoak, Chapter 6, for a detailed rule underlying the progression, and guess
discussion of analogical reasoning). what numeral or geometric figure might
When measuring speed alone, I found sig- come next; for example, more complex tasks
nificant correlations between speed in exe- might include some problem-solving tasks
cuting these processes and performance on (e.g., the water jugs problems; see Estes,
other traditional intelligence tests. However, 1 982). These problems were similar or iden-
a more intriguing discovery is that partici- tical to those used on intelligence tests.
pants who score higher on traditional intel-
ligence tests take longer to encode the terms
Biological Approaches to Intelligence
of the problem than do less intelligent par-
ticipants, but they make up for the extra Although the human brain is clearly the
time by taking less time to perform the re- organ responsible for human intelligence,
maining components of the task. In general, early studies (e.g., those by Karl Lashley
more intelligent participants take longer dur- and others) seeking to find biological in-
ing global planning – encoding the problem dices of intelligence and other aspects of
and formulating a general strategy for attack- mental processes were a resounding fail-
ing the problem (or set of problems) – but ure despite great efforts. As tools for study-
they take less time for local planning – forming the brain have become more sophisti-
ing and implementing strategies for the de- cated, however, we are beginning to see the
tails of the task (Sternberg, 1 981 ). possibility of finding physiological indica-
The advantage of spending more time tors of intelligence. Some investigators (e.g.,
on global planning is the increased likeli- Matarazzo, 1 992) believe that we will have
hood that the overall strategy will be cor- clinically useful psychophysiological indices
intelligence 759
of intelligence very early in the current ever, failed to find a strong relation between
millennium, although widely applicable in- neural-conduction velocity (as measured by
dices will be much longer in coming. In the neural-conduction speeds in a main nerve
meantime, the biological studies we now of the arm) and intelligence (as measured
have are largely correlational, showing sta- on the Multidimensional Aptitude Battery).
tistical associations between biological and Surprisingly, neural-conduction velocity ap-
psychometric or other measures of intelli- pears to be a more powerful predictor of
gence. The studies do not establish causal IQ scores for men than for women, so gen-
relations (see Goel, Chapter 20, for a de- der differences may account for some of the
scription of the neural basis of deductive differences in the data (Wickett & Vernon,
reasoning). 1 994). Additional studies on both males and
females are needed.
brain size
positron emission tomography, functional
One line of research looks at the relation- magnetic resonance imaging
ship of brain size to intelligence (see Jerison,
An alternative approach to studying the
2000; Vernon et al., 2000). The evidence
brain suggests that neural efficiency may be
suggests that, for humans, there is a mod-
related to intelligence; such an approach is
est but significant statistical relationship
based on studies of how the brain metabo-
between brain size and intelligence. It is dif-
lizes glucose (simple sugar required for brain
ficult to know what to make of this rela-
activity) during mental activities. Richard
tionship, however, because greater brain size
Haier and colleagues (Haier et al., 1 992)
may cause greater intelligence, greater intel-
cited several other researchers who support
ligence may cause greater brain size, or both
their own findings that higher intelligence
may depend on some third factor. Moreover,
correlates with reduced levels of glucose
it probably is more important how efficiently
metabolism during problem-solving tasks –
the brain is used than what size it is. On aver-
that is, smarter brains consume less sugar
age, for example, men have larger brains than
(and hence expend less effort) than do less
women, but women have better connections
smart brains doing the same task. Further-
of the two hemispheres of the brain through
more, Haier and colleagues found that cere-
the corpus callosum. So it is not clear which
bral efficiency increases as a result of learning
gender, on average, would be at an advan-
on a relatively complex task involving visu-
tage, and probably neither would be. It is
ospatial manipulations (the computer game
important to note that the relationship be-
Tetris). As a result of practice, more in-
tween brain size and intelligence does not
telligent participants show not only lower
hold across species (Jerison, 2000). Rather,
cerebral glucose metabolism overall but also
what holds seems to be a relationship be-
more specifically localized metabolism of
tween intelligence and brain size relative to
glucose. In most areas of their brains, smarter
the rough general size of the organism.
participants show less glucose metabolism,
but in selected areas of their brains (believed
speed of neural conduction to be important to the task at hand), they
Complex patterns of electrical activity in show higher levels of glucose metabolism.
the brain, which are prompted by specific Thus, more intelligent participants may have
stimuli, appear to correlate with scores on learned how to use their brains more effi-
IQ tests (Barrett & Eysenck, 1 992). Several ciently to focus their thought processes on a
studies (e.g., McGarry-Roberts, Stelmack, & given task.
Campbell, 1 992; Vernon & Mori, 1 992) ini- More recent research by Haier and col-
tially suggested that speed of conduction leagues suggests that the relationship be-
of neural impulses correlates with intelli- tween glucose metabolism and intelligence
gence as measured by IQ tests. A follow- may be more complex (Haier et al., 1 995 ;
up study (Wickett & Vernon, 1 994), how- Larson et al., 1 995 ). Whereas Haier’s group
(1 995 ) confirmed the earlier findings of in- Evolutionary Theory

creased glucose metabolism in less smart par-
Some theorists have tried to understand
ticipants (in this case, mildly retarded par-
intelligence in terms of how it has
ticipants), the study by Larson et al. (1 995 )
evolved over the eons (e.g., Bjorklund &
found, contrary to the earlier findings, that
Kipp, 2002; Bradshaw, 2002; Byrne, 2002;
smarter participants had increased glucose
Calvin, 2002; Corballis, 2002; Cosmides &
metabolism relative to their average compar-
Tooby, 2002; Flanagan, Hardcastle, & Nah-
ison group.
mias, 2002; Grossman & Kaufman, 2002;
One problem with earlier studies is that
Pinker, 1 997). The basic idea in these mod-
the tasks used were not matched for diffi-
els is that we are intelligent in the ways
culty level across groups of smart and av-
we are because it was important for our
erage individuals. The Larson et al. study
distant ancestors to acquire certain sets of
used tasks that were matched to the ability
skills. According to Cosmides and Tooby
levels of the smarter and average partici-
(2002), for example, we are particularly sen-
pants and found that the smarter partici-
sitive at detecting cheating because people in
pants used more glucose. Moreover, the glu-
the past who were not sensitive to cheaters
cose metabolism was highest in the right
did not live to have children, or had fewer
hemisphere of the more intelligent partic-
children. Evolutionary approaches stress the
ipants performing the hard task – again
continuity of the nature of intelligence over
suggesting selectivity of brain areas. What
long stretches of time, and in some theo-
could be driving the increases in glucose
ries, across species. However, during evolu-
metabolism? Currently, the key factor ap-
tion, the frontal lobe increased in size, so it
pears to be subjective task difficulty with
is difficult to know whether changes in in-
smarter participants in earlier studies simply
telligence are just a manifestation of physio-
finding the tasks too easy. Matching task dif-
logical changes or the other way around.
ficulty to participants’ abilities seems to in-
dicate that smarter participants increase glu-
Contextual Approaches to Intelligence
cose metabolism when the task demands it.
The preliminary findings in this area need to According to contextualists, intelligence
be investigated further before any conclusive cannot be understood outside its real-world
answers are reached. context. The context of intelligence may
Some neuropsychological research (e.g., be viewed at any level of analysis, focus-
Dempster, 1 991 ) suggests that performance ing narrowly, on the home and family en-
on intelligence tests may not indicate a cru- vironment, or extending broadly, on entire
cial aspect of intelligence – the ability to cultures (see Greenfield, Chap. 27). Even
set goals, to plan how to meet them, and cross-community differences have been cor-
to execute those plans. Specifically, persons related with differences in performance on
with lesions in the frontal lobe of the brain intelligence tests; such context-related dif-
frequently perform quite well on standard- ferences include those of rural versus urban
ized IQ tests, which require responses to communities, low versus high proportions
questions within a highly structured situa- of teenagers to adults within communities,
tion, but do not require much in the way and low versus high socioeconomic status
of goal setting or planning. If intelligence of communities (see Coon, Carey, & Fulker,
involves the ability to learn from experi- 1 992). Contextualists are particularly in-
ence and to adapt to the surrounding envi- trigued by the effects of cultural context
ronment, the ability to set goals and to de- on intelligence.
sign and implement plans cannot be ignored. In fact, contextualists consider intelli-
An essential aspect of goal setting and plan- gence so inextricably linked to culture that
ning is the ability to attend appropriately to they view intelligence as something that
relevant stimuli and to ignore or discount a culture creates to define the nature of
irrelevant stimuli. adaptive performance in that culture and to
intelligence 761
account for why some people perform bet- to get from one place to another (Gladwin,
ter than others on the tasks that the culture 1 970). Were Puluwat sailors to devise intel-
happens to value (Sternberg, 1 985 ). Theo- ligence tests for us and our fellow Amer-
rists who endorse this model study just how icans, we might not seem very intelligent.
intelligence relates to the external world in Similarly, the highly skilled Puluwat sailors
which the model is being applied and eval- might not do well on American-crafted tests
uated. In general, definitions and theories of of intelligence. These and other observations
intelligence will more effectively encompass have prompted quite a few theoreticians to
cultural diversity by broadening in scope. Be- recognize the importance of considering cul-
fore exploring some of the contextual the- tural context when assessing intelligence.
ories of intelligence, we will look at what The preceding arguments may make it
prompted psychologists to believe that cul- clear why it is so difficult to come up with
ture might play a role in how we define and a test that everyone would consider culture-
assess intelligence. fair – equally appropriate and fair for mem-
People in different cultures may have bers of all cultures. If members of differ-
quite different ideas of what it means to be ent cultures have different ideas of what it
smart. One of the more interesting cross- means to be intelligent, then the very be-
cultural studies of intelligence was per- haviors that may be considered intelligent in
formed by Michael Cole and colleagues one culture may be considered unintelligent
(Cole et al., 1 971 ). These investigators asked in another. Take, for example, the concept of
adult members of the Kpelle tribe in Africa mental quickness. In mainstream American
to sort concept terms. In Western culture, culture, quickness is usually associated with
when adults are given a sorting task on an in- intelligence. To say someone is “quick” is to
telligence test, more intelligent people typ- say that the person is intelligent and, indeed,
ically sort hierarchically. For example, they most group tests of intelligence are quite
may sort names of different kinds of fish to- strictly timed. Even on individual tests of
gether, and then the word fish over that, with intelligence, the test-giver times some re-
the name animal over fish and over birds, and sponses of the test-taker. Many information-
so on. Less intelligent people typically sort processing theorists and even psychophys-
functionally. They may sort fish with eat, for iological theorists focus on the study of
example, because we eat fish, or clothes with intelligence as a function of mental speed.
wear, because we wear clothes. The Kpelle In many cultures of the world, people
sorted functionally – even after investigators believe that more intelligent people do not
unsuccessfully tried to get the Kpelle sponta- rush into things. Even in our own culture,
neously to sort hierarchically. Finally, in des- no one will view you as brilliant if you de-
peration, one of the experimenters (Glick) cide on a marital partner, a job, or a place
asked a Kpelle to sort as a foolish person to live in the 20 to 3 0 seconds you might
would sort. In response, the Kpelle quickly normally have to solve an intelligence-test
and easily sorted hierarchically. The Kpelle problem. Thus, given that there exist no
had been able to sort this way all along; they perfectly culture-fair tests of intelligence, at
just hadn’t done it because they viewed it least at present, how should we consider
as foolish – and they probably considered context when assessing and understanding
the questioners rather unintelligent for ask- intelligence?
ing such stupid questions. Several researchers have suggested that
The Kpelle people are not the only ones providing culture-relevant tests is possi-
who might question Western understand- ble (e.g., Baltes, Dittmann-Kohli, & Dixon,
ings of intelligence. In the Puluwat culture of 1 984; Jenkins, 1 979; Keating, 1 984); that
the Pacific Ocean, for example, sailors navi- is, tests that employ skills and knowledge
gate incredibly long distances, using none of that relate to the cultural experiences of
the navigational aids that sailors from tech- the test-takers. Baltes and his colleagues, for
nologically advanced countries would need example, designed tests measuring skill in
dealing with the pragmatic aspects of ev- chasing food but had great difficulty with
eryday life. Designing culture-relevant tests it when hypothetically purchasing medici-
requires creativity and effort but probably nal herbs (Schliemann & Magalhües, 1 990).
is not impossible. A study by Daniel Wag- Brazilian children whose poverty had forced
ner (1 978), for example, investigated mem- them to become street vendors showed no
ory abilities – one aspect of intelligence as difficulty in performing complex arithmetic
our culture defines it – in our culture ver- computations when selling things but had
sus the Moroccan culture. Wagner found great difficulty performing similar calcula-
that level of recall depended on the content tions in a classroom (Carraher, Carraher, &
that was being remembered, with culture- Schliemann, 1 985 ). Thus, test performance
relevant content being remembered more may be affected by the context in which
effectively than irrelevant content (e.g., the test terms are presented. In this study,
compared with Westerners, Moroccan rug the investigators looked at the interaction
merchants were better able to recall com- of cognition and context. Several investiga-
plex visual patterns on black-and-white pho- tors have proposed theories that seek explic-
tos of Oriental rugs). Wagner further sug- itly to examine this interaction within an in-
gested that when tests are not designed to tegrated model of many aspects of intelli-
minimize the effects of cultural differences, gence. Such theories view intelligence as a
the key to culture-specific differences in complex system.
memory might be the knowledge and use
of metamemory strategies, rather than ac- Systems Approaches to Intelligence
tual structural differences in memory (e.g.,
memory span and rates of forgetting). gardner: multiple intelligences
In Kenya, research has shown that ru- Howard Gardner (1 983 , 1 993 ) proposed a
ral Kenyan school children have substantial theory of multiple intelligences, in which in-
knowledge about natural herbal medicines telligence is not just a single, unitary con-
they believe fight infection; Western chil- struct. Instead of speaking of multiple abil-
dren, of course, would not be able to ities that together constitute intelligence
identify any of these medicines (Sternberg (e.g., Thurstone, 1 93 8), Gardner (1 999)
et al., 2001 ; Sternberg & Grigorenko, 1 997). speaks of eight distinct intelligences that are
In short, making a test culturally rele- relatively independent of each other. Each
vant appears to involve much more than is a separate system of functioning, although
just removing specific linguistic barriers these systems can interact to produce what
to understanding. we see as intelligent performance.
Stephen Ceci (Ceci & Roazzi, 1 994) In some respects, Gardner’s theory
found similar context effects in childrens’ sounds like a factorial one because it specifies
and adults’ performance on a variety of tasks. several abilities that are construed to reflect
Ceci suggests that the social context (e.g., intelligence of some sort. However, Gardner
whether a task is considered masculine or views each ability as a separate intelligence,
feminine), the mental context (e.g., whether not just as a part of a single whole. Moreover,
a visuo-spatial task involves buying a home a crucial difference between Gardner’s the-
or burgling it), and the physical context (e.g., ory and factorial ones is in the sources of evi-
whether a task is presented at the beach or dence Gardner used for identifying the eight
in a laboratory) all affect performance. For intelligences. Gardner used converging op-
example, fourteen-year-old boys performed erations, gathering evidence from multiple
poorly on a task when it was couched as sources and types of data.
a cupcake-baking task but performed well Gardner’s view of the mind is modular,
when it was framed as a battery-charging Because as a major task of existing and fu-
task (Ceci & Bronfenbrenner, 1 985 ). Brazil- ture research on intelligence is to isolate the
ian maids had no difficulty with propor- portions of the brain responsible for each
tional reasoning when hypothetically pur- of the intelligences. Gardner has speculated
intelligence 763
regarding at least some of these locales, but which we have no previous experience, to
hard evidence for the existence of these sep- a completely familiar task, with which we
arate intelligences has yet to be produced. have vast, extensive experience. As a task
Furthermore, Nettelbeck and Young (1 996) becomes increasingly familiar, many aspects
question the strict modularity of Gardner’s of the task may become automatic, requir-
theory. Specifically, the phenomenon of pre- ing little conscious effort to determine what
served specific cognitive functioning in autis- step to take next and how to implement
tic savants (persons with severe social and that next step. A novel task makes demands
cognitive deficits, but with corresponding on intelligence different from those of a
high ability in a narrow domain) as evidence task for which automatic procedures have
for modular intelligences may not be justi- been developed.
fied. According to Nettelbeck and Young, According to the triarchic theory, rela-
the narrow long-term memory and specific tively novel tasks – such as visiting a foreign
aptitudes of savants is not really intelligent. country, mastering a new subject, or acquir-
As a result, there may be reason to question ing a foreign language – demand more of a
the intelligence of inflexible modules. person’s intelligence. In fact, a completely
unfamiliar task may demand so much of the
sternberg: the triarchic theory of person as to be overwhelming.
successful intelligence How intelligence relates to the external
Whereas Gardner emphasizes the separate- world. The triarchic theory also proposes
ness of the various aspects of intelligence, I that the various components of intelli-
tend to emphasize the extent to which they gence are applied to experience to serve
work together in the triarchic theory of suc- three functions in real-world contexts –
cessful intelligence (Sternberg, 1 985 , 1 988, adapting ourselves to our existing environ-
1 996, 1 999). According to the triarchic (tri-, ments, shaping our existing environments
“three”; -archic, “governed”) theory, intelli- to create new environments, and selecting
gence comprises three aspects, dealing with new environments.
the relation of intelligence (1 ) to the internal According to the triarchic theory, people
world of the person, (2) to experience, and may apply their intelligence to many differ-
(3 ) to the external world. ent kinds of problems. Some people may
How intelligence relates to the internal be more intelligent in the face of abstract,
world. This part of the theory emphasizes academic problems, for example, whereas
the processing of information, which can be others may be more intelligent in the face
viewed in terms of three different kinds of of concrete, practical problems. The the-
components: (1 ) metacomponents – execu- ory does not define an intelligent person as
tive processes (i.e., metacognition) used to someone who necessarily excels in all as-
plan, monitor, and evaluate problem solving; pects of intelligence. Rather, intelligent per-
(2) performance components – lower order sons know their own strengths and weak-
processes used to implement the commands nesses and find ways in which to capitalize
of the metacomponents; and (3 ) knowledge- on their strengths and either to compensate
acquisition components – the processes used for or to correct their weaknesses.
to learn how to solve the problems in In a recent comprehensive study testing
the first place. The components are highly the validity of the triarchic theory and its
interdependent. usefulness in improving performance, we
How intelligence relates to experience. The predicted that matching students’ instruc-
theory also considers how prior experi- tion and assessment to their abilities would
ence may interact with all three kinds of lead to improved performance (Sternberg
information-processing components. That et al., 1 996, 1 999). Students were selected
is, each of us faces tasks and situations with for one of five ability patterns: high only in
which we have varying levels of experience, analytical ability, high only in creative abil-
ranging from a completely novel task, with ity, high only in practical ability, high in all
three abilities, or not high in any of the three p. 497), although it is not entirely clear
abilities. Then students were assigned at ran- what any of these terms means. Perkins
dom to one of four instructional groups that believes this aspect of intelligence to be
emphasized memory-based, analytical, cre- largely genetically determined and unlearn-
ative, or practical learning followed by sub- able. This kind of intelligence seems to be
sequent assessment. We found that students somewhat similar to Cattell’s (1 971 ) idea of
who were placed in an instructional condi- fluid intelligence.
tion that matched their strength in terms of The experiential aspect of intelligence is
ability pattern (e.g., a high-analytical student what has been learned from experience. It is
being placed in an instructional condition the extent and organization of the knowl-
that emphasized analytical thinking) out- edge base and thus is similar to Cattell’s
performed students who were mismatched (1 971 ) notion of crystallized intelligence.
(e.g., a high-analytical student being placed The reflective aspect of intelligence refers
in an instructional condition that empha- to the role of strategies in memory and prob-
sized practical thinking). lem solving, and appears to be similar to
Teaching all students to use all of their the construct of metacognition or cogni-
analytic, creative, and practical abilities has tive monitoring (Brown & DeLoache, 1 978;
resulted in improved school achievement Flavell, 1 981 ).
for all students, whatever their ability pat- No empirical test of the theory of true
tern (Grigorenko, Jarvin, & Sternberg, 2002; intelligence has been published, so it is diffi-
Sternberg, Torff, & Grigorenko, 1 998). One cult to evaluate the theory at this time. Like
important consideration in light of such find- Gardner’s (1 983 ) theory, Perkins’s theory is
ings is the need for changes in the assess- based on literature review, and, as noted pre-
ment of intelligence (Sternberg & Kaufman, viously, such literature reviews often tend to
1 996). Current measures of intelligence are be selective and then interpreted in a way
somewhat one-sided, measuring mostly an- that maximizes the fit of the theory to the
alytic abilities with little or no assessment available data.
of creative and practical aspects of intel-
ligence (Sternberg et al., 2000; Wagner,
2000). A well-rounded assessment and in- the bioecological model of intelligence
struction system could lead to greater ben- Ceci (1 996) proposed a bioecological model
efits of education for a wider variety of stu- of intelligence, according to which multi-
dents – a nominal goal of education. ple cognitive potentials, context, and knowl-
edge all are essential bases of individual
differences in performance. Each of the mul-
true intelligence tiple cognitive potentials enables relation-
Perkins (1 995 ) proposed a theory of what he ships to be discovered, thoughts to be moni-
refers to as true intelligence, which he believes tored, and knowledge to be acquired within
synthesizes classic views as well as new ones. a given domain. Although these potentials
According to Perkins, there are three basic are biologically based, their development is
aspects of intelligence – neural, experiential, closely linked to environmental context, and
and reflective. it is difficult, if not impossible, to cleanly
Neural intelligence concerns what Perkins separate biological from environmental
believes to be the fact that some people’s contributions to intelligence. Moreover, abil-
neurological systems function better than do ities may express themselves very differ-
the neurological systems of others, running ently in different contexts. For example, chil-
faster and with more precision. He men- dren given essentially the same task in the
tions “more finely tuned voltages” and “more context of a video game versus a labora-
exquisitely adapted chemical catalysts” as tory cognitive task performed much better
well as a “better pattern of connecticity in when the task was presented in the video
the labyrinth of neurons” (Perkins, 1 995 , game context.
intelligence 765
The bioecological model appears in many riched home environment. A particularly

ways more to be a framework than a theory. successful project has been the Abecedar-
At some level, the theory must be right. Cer- ian Project, which showed that the cogni-
tainly, both biological and ecological factors tive skills and achievements of lower socioe-
contribute to the development and manifes- conomic status children could be increased
tation of intelligence. Perhaps what the the- through carefully planned and executed in-
ory needs most at this time are specific and terventions (Ramey & Ramey, 2000).
clearly falsifiable predictions that would set Bradley and Caldwell (1 984) found sup-
it apart from other theories. port for the importance of home environ-
ment with regard to the development of
intelligence in young children. These re-
Improving Intelligence searchers found that several factors in the
early (preschool) home environment were
Although designers of artificial intelligence correlated with high IQ scores – emotional
have made great strides in creating programs and verbal responsivity of the primary care-
that simulate knowledge and skill acquisi- giver and the caregiver’s involvement with
tion, no existing program even approaches the child, avoidance of restriction and pun-
the ability of the human brain to enhance ishment, organization of the physical envi-
its own intelligence. Human intelligence is ronment and activity schedule, provision of
highly malleable and can be shaped and appropriate play materials, and opportuni-
even increased through various kinds of in- ties for variety in daily stimulation. Further,
terventions (Detterman & Sternberg, 1 982; Bradley and Caldwell found that these fac-
Grotzer & Perkins, 2000; Perkins & Grotzer, tors more effectively predicted IQ scores
1 997; Sternberg et al., 1 996; Sternberg et al., than did socioeconomic status or family-
1 997; see Ritchhart & Perkins, Chap. 3 2, for structure variables. It should be noted, how-
a review of work on teaching thinking skills). ever, that the Bradley–Caldwell study is
Moreover, the malleability of intelligence correlational and therefore cannot be inter-
has nothing to do with the extent to which preted as indicating causality. Furthermore,
intelligence has a genetic basis (Sternberg, their study pertained to preschool children,
1 997). An attribute (such as height) can be and children’s IQ scores do not begin to pre-
partly or even largely genetically based and dict adult IQ scores well until age four years.
yet be environmentally malleable. Moreover, before age seven years, the scores
The Head Start program was initiated are not very stable (Bloom, 1 964). More re-
in the 1 960s to provide preschoolers with cent work (e.g., Pianta & Egeland, 1 994) sug-
an edge on intellectual abilities and accom- gested that factors such as maternal social
plishments when they started school. Long- support and interactive behavior may play a
term follow-ups have indicated that by mid- key role in the instability of scores on tests
adolescence, children who participated in of intellectual ability between ages two and
the program were more than a grade ahead eight years.
of matched controls who did not receive the The Bradley and Caldwell data should not
program (Lazar & Darlington, 1 982; Zigler be taken to indicate that demographic vari-
& Berman, 1 983 ). The children in the pro- ables have little effect on IQ scores. To the
gram also scored higher on a variety of tests contrary, throughout history and across cul-
of scholastic achievement, were less likely to tures, many groups of people have been as-
need remedial attention, and were less likely signed pariah status as inferior members of
to show behavioral problems. Although such the social order. Across cultures, these dis-
measures are not truly measures of intelli- advantaged groups (e.g., native Maoris vs.
gence, they show strong positive correlations European New Zealanders) have shown dif-
with intelligence tests. ferences in tests of intelligence and apti-
An alternative to intellectual enrichment tude (Steele, 1 990; Zeidner, 1 990). Such was
outside the home may be to provide an en- the case of the Burakumin tanners in Japan,
who, in 1 871 , were granted emancipation any attribute that is partly genetic, there is a
but not full acceptance into Japanese society. reaction range – that is, the attribute can be
Despite their poor performance and under- expressed in various ways within broad lim-
privileged status in Japan, those who immi- its of possibilities. Thus, each person’s intel-
grate to America and are treated like other ligence can be developed further within this
Japanese immigrants – perform on IQ tests broad range of potential intelligence (Grig-
and in school achievement at a level compa- orenko, 2000). We have no reason to believe
rable to that of their fellow Japanese Amer- that people now reach their upper limits in
icans (Ogbu, 1 986). the development of their intellectual skills.
Similar positive effects of integration To the contrary, the evidence suggests that
were shown on the other side of the world. we can do quite a bit to help people become
In Israel, the children of European Jews score more intelligent (for further discussion of
much higher on IQ tests than do children of these issues, see R. Mayer, 2000, and Neisser
Arabic Jews – except when the children are et al., 1 996).
reared on kibbutzim in which the children Environmental as well as hereditary fac-
of all national ancestries are raised by spe- tors may contribute to retardation in intelli-
cially trained caregivers in a dwelling sepa- gence (Grigorenko, 2000; Sternberg & Grig-
rate from their parents. When these children orenko, 1 997). Environmental influences be-
shared the same child-rearing environments, fore birth may cause permanent retardation,
there were no national-ancestry-related dif- which may result from a mother’s inade-
ferences in IQ. quate nutrition or ingestion of toxins such
Altogether, there is now abundant ev- as alcohol during the infant’s prenatal devel-
idence that people’s environments (e.g., opment (Grantham-McGregor, Ani, & Fer-
Ceci, Nightingale, & Baker, 1 992; Reed, nald, 2002; Mayes & Fahy, 2001 ; Olson,
1 993 ; Sternberg & Wagner, 1 994; Wagner, 1 994), for example. Among the other en-
2000), their motivation (e.g., Collier, 1 994; vironmental factors that can negatively im-
Sternberg & Ruzgis, 1 994), and their train- pact intelligence are low social and eco-
ing (e.g., Feuerstein, 1 980; Sternberg, 1 987) nomic status (Ogbu & Stern, 2001 ; Seifer,
can profoundly affect their intellectual skills. 2001 ), high levels of pollutants (Bellinger &
Thus, the controversial claims made by Her- Adams, 2001 ), inadequate care in the fam-
rnstein and Murray (1 994) in their book, The ily or divorce (Fiese, 2001 ; Guidubaldi &
Bell Curve, regarding the futility of interven- Duckworth, 2001 ), infectious diseases (Al-
tion programs, are unfounded when one con- cock & Bundy, 2001 ), high levels of radiation
siders the evidence in favor of the possibility (Grigorenko, 2001 ), and inadequate school-
of improving cognitive skills. Likewise, Her- ing (Christian, Bachnan, & Morrison, 2001 ).
rnstein and Murray’s appeal to “a genetic fac- Physical trauma can injure the brain, causing
tor in cognitive ethnic differences” (Herrn- mental retardation.
stein & Murray, 1 994, p. 270) falls apart in
light of the direct evidence against such ge-
netic differences (Sternberg, 1 996) and re-
sults from a misunderstanding of the heri- Conclusions and Future Directions
tability of traits in general.
Heredity certainly plays a role in indi- In conclusion, many approaches have been
vidual differences in intelligence (Loehlin, taken to improve understanding of the na-
2000; Loehlin, Horn, & Willerman, 1 997; ture of intelligence. Great progress has been
Plomin, 1 997), as does the environment made in elaborating the construct but much
(Grigorenko, 2000, 2002; Sternberg & Grig- less progress in converging upon either a
orenko, 1 999; Wahlsten & Gottlieb, 1 997). definition or a universally accepted theory.
Genetic inheritance may set some kind of Much of current debate revolves around
upper limit on how intelligent a person may trying to figure out what the construct is
become. However, we now know that for and how it relates to other constructs, such
intelligence 767
as learning, memory, and reasoning. Intel- Barrett, P. T., & Eysenck, H. J. (1 992).
ligence can be measured, to some extent, Brain evoked potentials and intelligence: The
and it can be improved. Improvements are Hendrickson Paradigm. Intelligence, 1 6, 3 61 –
not likely to eliminate individual differences, 3 81 .
however, because attempts to improve intel- Bellinger, D. C., & Adams, H. F. (2001 ) Environ-
ligence can help people at all levels and with mental pollutant exposures and children’s cog-
diverse kinds of intelligence. No matter how nitive ability. In R. J. Sternberg, & E. L. Grig-
orenko (Eds.), Environmental Effects on Cog-
high one’s intelligence, there is always room
nitive Abilities (pp. 1 5 7–1 88). Mahwah, NJ:
for improvement; and no matter how low,
Erlbaum.
there are always measures that can be taken
Binet, A., & Simon, T. (1 91 6). The Development
to help raise it.
of Intelligence in Children. Baltimore: Williams
& Wilkins. (Originally published in 1 905 ).
Bjorklund, D. F., & Kipp, K. (2002). Social cogni-
Acknowledgments tion, inhibition, and theory of mind: The evo-
lution of human intelligence. In R. J. Sternberg,
Preparation of this article was supported & J. C. Kaufman, (Eds.), The Evolution of Intel-
by Grant REC-9979843 from the National ligence (pp. 27–5 4). Mahwah, NJ: Erlbaum.
Science Foundation and by a grant un- Bloom, B. S. (1 964). Stability and Change in Hu-
der the Javits Act Program (Grant No. man Characteristics. New York: Wiley.
R206R000001 ) as administered by the In- Boring, E. G. (1 923 , June 6). Intelligence as the
stitute of Education Sciences, U.S. Depart- tests test it. New Republic, 3 5 –3 7.
ment of Education. Grantees undertaking Bors, D. A., Forrin, B. (1 995 ). Age, speed of in-
such projects are encouraged to express formation processing, recall, and fluid intelli-
freely their professional judgment. This ar- gence. Intelligence, 2 0(3 ), 229–248.
ticle, therefore, does not necessarily repre- Bors, D. A., MacLeod, C. M., & Forrin, B. (1 993 ).
sent the position or policies of the National Eliminating the IQT correlation by eliminating
Science Foundation, Office of Educational an experimental confound. Intelligence, 1 7(4),
Research and Improvement, or the U.S. 475 –5 00.
Department of Education, and no official en- Bradley, R. H., & Caldwell, B. M. (1 984). 1 74
dorsement should be inferred. Children: A study of the relationship between
home environment and cognitive development
during the first 5 years. In A. W. Gottfried
(Ed.), Home Environment and Early Cogni-
References tive Development : Longitudinal Research. San
Diego, CA: Academic Press.
Ackerman, P. L. (1 996). A theory of adult intel- Bradshaw, J. L. (2002). The evolution of intellect:
lectual development: Process, personality, in- Cognitive, neurological, and primatological as-
terests, and knowledge. Intelligence, 2 2 , 229– pects and hominid culture. In R. J. Sternberg,
25 9. & J. C. Kaufman, (Eds.), The Evolution of Intel-
Alcock, K. J., Bundy, D. A. P. (2001 ). The impact ligence (pp. 5 7–5 8). Mahwah, NJ: Erlbaum.
of infectious disease on cognitive development. Brody, N. (2000). History of theories and mea-
In R. J. Sternberg, & E. L. Grigorenko (Eds.), surements of intelligence. In R. J. Sternberg
Environmental Effects on Cognitive Abilities (Ed.), Handbook of Intelligence (pp. 1 6–3 3 ).
(pp. 221 –25 3 ). Mahwah, NJ: Erlbaum. New York: Cambridge University Press.
Baltes, P. B., Dittmann-Kohli, F., & Dixon, R. A. Brown, A. L., & DeLoache, J. S. (1 978). Skills,
(1 984). New Perspectives on the development plans, and self-regulation. In R. Siegler (Ed.),
of intelligence in adulthood: Toward a dual- Children’s Thinking: What Develops? Hillsdale,
process conception and a model of selective op- NJ: Erlbaum.
timization with compensation. In P. B. Baltes, Byrne, R. W. (2002). The primate origins of hu-
& O. G. Brim, Jr. (Eds.), Life-span Development man intelligence. In R. J. Sternberg & J. C.
and Behavior (Vol. 6, pp. 3 3 –76). New York: Kaufman, (Eds.), The Evolution of Intelligence
Academic Press. (pp. 79–96). Mahwah, NJ: Erlbaum.
Calvin, W. H. (2002). Pumping up intelligence: Corballis, M. C. (2002). Evolution of the gen-

Abrupt climate jumps and the evolution of erative mind. In R. J. Sternberg, & J. C.
higher intellectual functions during the Ice Kaufman, (Eds.), The Evolution of Intelligence
Ages. In R. J. Sternberg, & J. C. Kaufman, (pp. 1 1 7–1 44). Mahwah, NJ: Erlbaum.
(Eds.), The Evolution of Intelligence (pp. 97– Cosmides, L., & Tooby, J. (2002). Unraveling the
1 1 6). Mahwah, NJ: Erlbaum. enigma of human intelligence: Evolutionary
Carraher, T. N., Carraher, D., & Schliemann, A. psychology and the multimodular mind. In R. J.
D. (1 985 ). Mathematics in the streets and in Sternberg, & J. C. Kaufman, (Eds.), The Evolu-
schools. British Journal of Developmental Psy- tion of Intelligence (pp. 1 45 –1 98). Mahwah, NJ:
chology, 3 , 21 –29. Erlbaum.
Carroll, J. B. (1 993 ). Human Cognitive Abilities: Craik, F. I. M., & Lockhart R. S. (1 972). Levels of
A Survey of Factor-analytic Studies. New York: processing: A framework for memory research.
Cambridge University Press. Journal of Verbal Learning and Verbal Behavior,
Cattell, R. B. (1 971 ). Abilities: Their Structure, 1 1 , 671 –684.
Growth and Action. Boston: Houghton Mifflin. Daneman, M., & Carpenter, P. A. (1 983 ). Individ-
Ceci, S. J. (1 996). On Intelligence . . . More or Less ual differences in integrating information be-
(expanded ed.). Cambridge, MA: Harvard Uni- tween and within sentences. Journal of Experi-
versity Press. mental Psychology: Learning, Memory, and Cog-
Ceci, S. J., & Bronfenbrenner, U. (1 985 ). Don’t nition, 9(4), 5 61 –5 84.
forget to take the cupcakes out of the oven: Daniel, M. H. (1 997). Intelligence testing: Status
Strategic time-monitoring, prospective mem- and trends. American Psychologist, 5 2 , 1 03 8–
ory and context. Child Development, 5 6, 1 75 – 1 045 .
1 90. Daniel, M. H. (2000). Interpretation of intel-
Ceci, S. J., Nightingale, N. N., & Baker, J. G. ligence test scores. In R. J. Sternberg (Ed.),
(1 992). The ecologies of intelligence: Chal- Handbook of Intelligence (pp. 477–491). New
lenges to traditional views. In D. K. Detter- York: Cambridge University Press.
man (Ed.), Current Topics in Human Intelligence Davies, M., Stankov, L., & Roberts, R. D. (1 998).
(Vol. 2 ). Is Mind Modular or Unitary? (pp. 61 – Emotional intelligence: In search of an elusive
82). Norwood, NJ: Ablex. construct. Journal of Personality and Social Psy-
Ceci, S. J., & Roazzi, A. (1 994). The effects of chology, 75 , 989–1 01 5 .
context on cognition: Postcards from Brazil. In Deary, I. J. (2000). Simple information process-
R. J. Sternberg, & R. K. Wagner (Eds.), Mind in ing. In R. J. Sternberg (Ed.), Handbook of Intel-
Context: Interactionist Perspectives on Human In- ligence (pp. 267–284). New York: Cambridge
telligence (pp. 74–1 01 ). New York: Cambridge University Press.
University Press. Deary, I. J. (2002). g and cognitive elements
Christian, K., Bachnan, H. J., & Morrison, F. J. of information processing: An agnostic view.
(2001 ). Schooling and cognitive development. In R. J. Sternberg, & E. L. Grigorenko
In R. J. Sternberg, & E. L. Grigorenko (Eds.), (Eds.), The General Factor of Intelligence: How
Environmental Effects on Cognitive Abilities General Is It? (pp. 1 5 1 –1 81 ). Mahwah, NJ:
(pp. 287–3 3 5 ). Mahwah, NJ: Erlbaum. Erlbaum.
Ciarrochi, J., Forgas, J. P., & Mayer, J. D. (Eds.) Deary, I. J., & Stough, C. (1 996). Intelligence and
(2001 ). Emotional Intelligence in Everyday Life: inspection time: Achievements, prospects, and
A Scientific Inquiry. Philadelphia: Psychology problems. American Psychologist, 5 1, 5 99–608.
Press. Dempster, F. N. (1 991 ). Inhibitory processes:
Cole, M., Gay, J., Glick, J., & Sharp, D. W. (1 971 ). A neglected dimension of intelligence. Intelli-
The Cultural Context of Learning and Thinking. gence, 15 , 1 5 7–1 73 .
New York: Basic Books. Detterman, D. K., & Sternberg, R. J. (Eds.)
Collier, G. (1 994). Social Origins of Mental Ability. (1 982). How and How Much Can Intelligence
New York: Wiley. Be Increased? Norwood, NJ: Erlbaum.
Coon, H., Carey, G., & Fulker, D. W. (1 992). Embretson, S., & McCollam, K. (2000). Psycho-
Community influences on cognitive ability. In- metric approaches to the understanding and
telligence, 1 6(2), 1 69–1 88. measurement of intelligence. In R. J. Sternberg
intelligence 769
(Ed.), Handbook of Intelligence (pp. 423 –444). Effects on Intellectual Functioning (pp. 25 5 –286).
New York: Cambridge University Press. Mahwah, NJ: Erlbaum.
Estes, W. K. (1 982). Similarity-related channel in- Grigorenko, E. L. (2002). Other than g: The value
teractions in visual processing. Journal of Exper- of persistence. In R. J. Sternberg, & E. L. Grig-
imental Psychology: Human Perception and Per- orenko (Eds.), The General Factor of Intelligence:
formance, 8(3 ), 3 5 3 –3 82. Fact or Fiction (pp. 299–3 27). Mahwah, NJ:
Feuerstein, R. (1 980). Instrumental Enrichment: Erlbaum.
An Intervention Program for Cognitive Modifi- Grigorenko, E. L., Jarvin, L., & Sternberg, R. J.
ability. Baltimore: University Park Press. (2002). School-based tests of the triarchic the-
Fiese, B. H. (2001 ). Family matters: A systems ory of intelligence: Three settings, three sam-
view of family effects on children’s cognitive ples, three syllabi. Contemporary Educational
health. In R. J. Sternberg, & E. L. Grigorenko Psychology, 2 7, 1 67–208.
(Eds.), Environmental Effects on Cognitive Abil- Grossman, J. B., & Kaufman, J. C. (2002). Evo-
ities (pp. 3 9–5 7). Mahwah, NJ: Erlbaum. lutionary psychology: Promise and perils. In R.
Flanagan, O., Hardcastle, V. G., & Nahmias, E. J. Sternberg, & J. C. Kaufman, (Eds.), The Evo-
(2002). Is human intelligence an adaptation? lution of Intelligence (pp. 9–25 ). Mahwah, NJ:
Cautionary observations from the philosophy Erlbaum.
of biology. In R. J. Sternberg, & J. C. Kaufman, Grotzer, T. A., & Perkins, D. A. (2000). Teaching
(Eds.), The Evolution of Intelligence (pp. 1 99– of intelligence: A performance conception. In
222). Mahwah, NJ: Erlbaum. R. J. Sternberg (Ed.), Handbook of Intelligence
Flavell, J. H. (1 981 ). Cognitive monitoring. In W. (pp. 492–5 1 5 ). New York: Cambridge Univer-
P. Dickson (Ed.), Children’s Oral Communica- sity Press.
tion Skills (pp. 3 5 –60). New York: Academic Guidubaldi, J., & Duckworth, J. (2001 ). Divorce
Press. and children’s cognitive ability. In E. L. Grig-
Galton, F. (1 883 ). Inquiry into Human Faculty and orenko, & R. J. Sternberg (Eds.), Family Envi-
Its Development. London: Macmillan. ronment and Intellectual Functioning (pp. 97–
Gardner, H. (1 983 ). Frames of Mind: The Theory 1 1 8). Mahwah, NJ: Erlbaum.
of Multiple Intelligences. New York: Basic Books. Guilford, J. P. (1 967). The Nature of Human In-
Gardner, H. (1 993 ). Multiple Intelligences: The telligence. New York: McGraw-Hill.
Theory in Practice. New York: Basic Books. Guilford, J. P. (1 982). Cognitive psychology’s
Gardner, H. (1 999). Are there additional inte- ambiguities: Some suggested remedies. Psycho-
lligences? The case for naturalist, spiritual, and logical Review, 89, 48–5 9.
existential intelligences. In J. Kane (Ed.), Ed- Guilford, J. P. (1 988). Some changes in the
ucation, Information, and Transformation (pp. structure-of-intellect model. Educational and
1 1 1 –1 3 1 ). Upper Saddle River, NJ: Prentice- Psychological Measurement, 48, 1 –4.
Hall. Haier, R. J., Chueh, D., Touchette, R., Lott, I.,
Gladwin, T. (1 970). East Is a Big Bird. Cambridge, et al. (1 995 ). Brain size and cerebral glucose
MA: Harvard University Press. metabolic rate in nonspecific mental retarda-
Grantham-McGregor, S., Ani, C., & Fernald, L. tion and Down’s syndrome. Intelligence, 2 0,
(2002). The role of nutrition in intellectual de- 1 91 –21 0.
velopment. In R. J. Sternberg, & E. L. Grig- Haier, R. J., Siegel, B., Tang, C., Abel, L., &
orenko (Eds.), Environmental Effects on Cog- Buchsbaum, M. S. (1 992). Intelligence and
nitive Abilities (pp. 1 1 9–1 5 5 ). Mahwah, NJ: changes in regional cerebral glucose metabolic
Erlbaum. rate following learning. Intelligence, 1 6, 41 5 –
Grigorenko, E. L. (2000). Heritability and intel- 426.
ligence. In R. J. Sternberg (Ed.), Handbook of Herrnstein, R. J, & Murray, C. (1 994). The Bell
Intelligence (pp. 5 3 –91 ). New York: Cambridge Curve. New York: Free Press.
University Press. Hunt, E. B. (1 978). Mechanics of verbal ability.
Grigorenko, E. L. (2001 ). The invisible danger: Psychological Review, 85 , 1 09–1 3 0.
The impact of ionizing radiation on cognitive Hunt, E. B., & Lansman, M. (1 982). Individ-
development and functioning. In R. J. Stern- ual differences in attention. In R. J. Sternberg
berg, & E. L. Grigorenko (Eds.), Environmental (Ed.), Advances in the Psychology of Human
Intelligence (Vol. 1 , pp. 207–25 4). Hillsdale, NJ: Larkin, J. H., McDermott, J., Simon, D. P., &
Erlbaum. Simon, H. A. (1 980). Expert and novice per-
Jenkins, J. J. (1 979). Four points to remember: A formance in solving physics problems. Science,
tetrahedral model of memory experiments. In 2 08, 1 3 3 5 –1 3 42.
L. S. Cermak, & F. I. M. Craik (Eds.), Levels Larson, G. E., Haier, R. J., LaCasse, L., &
of Processing in Human Memory (pp. 429–446). Hazen, K. (1 995 ). Evaluation of a “mental ef-
Hillsdale, NJ: Erlbaum. fort” hypothesis for correlation between cor-
Jensen, A. R. (1 979). g: Outmoded theory or un- tical metabolism and intelligence. Intelligence,
conquered frontier? Creative Science and Tech- 2 1 , 267–278.
nology, 2 , 1 6–29. Laughon, P. (1 990). The dynamic assessment of
Jensen, A. R. (1 982). Reaction time and psycho- intelligence: A review of three approaches.
metric g. In H. J. Eysenck (Ed.), A Model for School Psychology Review, 1 9, 45 9–470.
Intelligence. Heidelberg: Springer-Verlag. Lazar, I., & Darlington, R. (1 982). Lasting effects
Jensen, A. R. (1 998). The g Factor: The of early education: A report from the consor-
Science of Mental Ability. Westport, CT: tium for longitudinal studies. Monographs of the
Praeger/Greenwood. Society for Research in Child Development, 47,
Jensen, A. R. (2002). Psychometric g: Defini- (Serial No. 1 95 , 2–3 ).
tion and substantiation. In R. J. Sternberg, & Loehlin, J. C. (2000). Group differences in in-
E. L. Grigorenko (Eds.), The General Factor telligence. In R. J. Sternberg (Ed.), Handbook
of Intelligence: How General Is It? (pp. 3 9–5 3 ). of Intelligence (pp. 1 76–1 93 ). New York: Cam-
Mahwah, NJ: Erlbaum. bridge University Press.
Jerison, H. J. (2000). The evolution of Intelli- Loehlin, J. C., Horn, J. M., & Willerman, L.
gence. In R. J. Sternberg (Ed.), Handbook of In- (1 997). Heredity, environment, and IQ in the
telligence (pp. 21 6–244). New York: Cambridge Texas adoption project. In R. J. Sternberg, & E.
University Press. L. Grigorenko (Eds.), Intelligence, Heredity, and
Kaufman, A. S. (2000). Tests of intelligence. In Environment (pp. 1 05 –1 25 ). New York: Cam-
R. J. Sternberg (Ed.), Handbook of Intelligence bridge University Press.
(pp. 445 –476). New York: Cambridge Univer- Lohman, D. F. (2000). Complex information
sity Press. processing and intelligence. In R. J. Sternberg
Kaufman, A. S., & Lichtenberger, E. O. (1 998). (Ed.), Handbook of Intelligence (pp. 285 –3 40).
Intellectual assessment. In C. R. Reynolds New York: Cambridge University Press.
(Ed.), Comprehensive Clinical Psychology. Vol. Matarazzo, J. D. (1 992). Psychological testing and
4: Assessment (pp. 203 –23 8). Tarrytown, NY: assessment in the 21 st century. American Psy-
Elsevier Science. chologist, 47(8), 1 007–1 01 8.
Keating, D. P. (1 984). The emperor’s new clothes: Mayer, R. E. (2000). Intelligence and education.
The “new look” in intelligence research. In R. J. In R. J. Sternberg (Ed.), Handbook of Intelligence
Sternberg (Ed.), Advances in the Psychology of (pp. 5 1 9–5 3 3 ). New York: Cambridge Univer-
Human Intelligence (Vol. 2, pp. 1 –45 ). Hillsdale, sity Press.
NJ: Erlbaum. Mayer, J. D., & Salovey, P. (1 997). What is emo-
Kihlstrom, J., & Cantor, N. (2000). Social intelli- tional intelligence? In P. Salovey, & D. J. Sluyter
gence. In R. J. Sternberg (Ed.), Handbook of In- (Eds.), Emotional Development and Emotional
telligence (pp. 3 5 9–3 79). New York: Cambridge Intelligence: Educational Implications (pp. 3 –
University Press. 3 4). New York: Basic Books.
Kyllonen, P. C. (2002). g: Knowledge, speed, Mayer, J. D., Salovey, P., & Caruso, D. (2000).
strategies, or working-memory capacity? A sys- Emotional intelligence. In R. J. Sternberg (Ed.),
tems perspective. In R. J. Sternberg, & E. Handbook of Intelligence (pp. 3 96–421 ). New
L. Grigorenko (Eds.), The General Factor of York: Cambridge University Press.
Intelligence: How General Is It? (pp. 41 5 –445 ). Mayes, L. C., & Fahy, T. (2001 ). Prenatal drug
Mahwah, NJ: Erlbaum. exposure and cognitive development. In R. J.
Kyllonen, P., & Christal, R. (1 990). Reasoning Sternberg, & E. L. Grigorenko (Eds.), Environ-
ability is (little more than) working-memory mental Effects on Cognitive Abilities (pp. 1 89–
capacity? Intelligence, 1 4, 3 89–43 3 . 21 9). Mahwah, NJ: Erlbaum.
intelligence 771
McGarry-Roberts, P. A., Stelmack, R. M., & mance at 24, 48, and 96 months. Intelligence,
Campbell, K. B. (1 992). Intelligence, reaction 1 8(2), 1 45 –1 63 .
time, and event-related potentials. Intelligence, Pinker, S. (1 997). How the Mind Works. New
16(3 , 4), 289–3 1 3 . York: Norton.
Neisser, U., Boodoo, G., Bouchard T. J., Boykin, Plomin, R. (1 997). Identifying genes for cogni-
W. A., Brody, N., Ceci, S. J., et al. (1 996). tive abilities and disabilities. In R. J. Sternberg,
Intelligence: Knowns and unknowns. American & E. L. Grigorenko (Eds.), Intelligence, Hered-
Psychologist, 5 1 (2), 77–1 01 . ity, and Environment (pp. 89–1 04). New York:
Nettelbeck, T. (1 987). Inspection time and in- Cambridge University Press.
telligence. In P. A. Vernon (Ed.), Speed of Posner, M. I., & Mitchell, R. F. (1 967). Chrono-
Information-processing and Intelligence (pp. 295 – metric analysis of classification. Psychological
3 46). Norwood, NJ: Ablex. Review, 74, 3 92–409.
Nettelbeck, T., & Lally, M. (1 976). Inspection Ramey, C. T., & Ramey, S. L. (2000). Intelligence
time and measured intelligence. British Journal and public policy. R. J. Sternberg (Ed.), Hand-
of Psychology, 67(1 ), 1 7–22. book of Intelligence (pp. 5 3 4–5 48). New York:
Nettelbeck, T., & Rabbitt, P. M. (1 992). Aging, Cambridge University Press.
cognitive performance, and mental speed. In- Reed, T. E. (1 993 ). Effect of enriched (complex)
telligence, 1 6(2), 1 89–205 . environment on nerve conduction velocity:
Nettelbeck, T., Rabbitt, P. M. A., Wilson, C., & New data and review of implications for the
Batt, R. (1 996). Uncoupling learning from ini- speed of information processing. Intelligence,
tial recall: The relationship between speed and 17(4), 5 3 3 –5 40.
memory deficits in old age. British Journal of Reed, T. E., & Jensen, A. R. (1 991 ). Arm nerve
Psychology, 87, 5 93 –607. conduction velocity (NCV), brain NCV, reac-
Nettelbeck, T., & Young, R. (1 996). Intelligence tion time, and intelligence. Intelligence, 1 5 , 3 3 –
and savant syndrome: Is the whole greater than 47.
the sum of the fragments? Intelligence, 2 2 , 49– Reed, T. E., & Jensen, R. (1 993 ). A somatosensory
67. latency between the thalamus and cortex also
Newell, A., & Simon, H. A. (1 972). Human Prob- correlates with level of intelligence. Intelligence,
lem Solving. Englewood Cliffs, NJ: Prentice- 17, 443 –45 0.
Hall. Salovey, P., & Sluyter, D. J. (Eds.) (1 997). Emo-
Ogbu, J. U. (1 986). The consequences of the tional Development and Emotional Intelligence:
American caste system. In U. Neisser (Ed.), The Educational Implications. New York: Basic
School Achievement of Minority Children. Hills- Books.
dale, NJ: Erlbaum. Schliemann, A. D., & Magalhües, V. P. (1 990).
Ogbu, J. U., & Stern, P. (2001 ). Caste status and Proportional reasoning: From shops, to
intellectual development. In R. S. Sternberg, kitchens, laboratories, and, hopefully, schools.
& E. L. Grigorenko (Eds.), Environmental Ef- Proceedings of the 1 4th International Con-
fects on Intellectual Functioning. Hillsdale, NJ: ference for the Psychology of Mathematics
Erlbaum. Education. Oaxtepec, Mexico.
Olson, H. C. (1 994). Fetal alcohol syndrome. In Seifer, R. (2001 ). Socioeconomic status, multi-
R. J. Sternberg (Ed.), Encyclopedia of Human ple risks, and development of intelligence. In
Intelligence (Vol. 1 , pp. 43 9–443 ). New York: R. J. Sternberg, & E. L. Grigorenko (Eds.),
Macmillan. Environmental Effects on Cognitive Abilities
Perkins, D. N. (1 995 ). Insight in minds and genes. (pp. 5 9–81 ). Mahwah, NJ: Erlbaum.
In R. J. Sternberg, & J. E. Davidson (Eds.), The Serpell, R. (2000). Intelligence and culture. In
Nature of Insight (pp. 495 –5 3 4). Cambridge, R. J. Sternberg (Ed.), Handbook of Intelligence
MA: MIT Press. (pp. 5 49–5 80). New York: Cambridge Univer-
Perkins, D. N., & Grotzer, T. A. (1 997). Teaching sity Press.
intelligence. American Psychologist, 5 2 , 1 1 25 – Simon, H. A. (1 976). Identifying basic abilities
1133. underlying intelligent performance of complex
Pianta, R. C., & Egeland, B. (1 994). Predictors tasks. In L. B. Resnick (Ed.), The Nature of In-
of instability in children’s mental test perfor- telligence (pp. 65 –98). Hillsdale, NJ: Erlbaum.
Spearman, C. (1 923 ). The Nature of ‘Intelli- struction, and assessment of gifted children: A
gence’ and the Principles of Cognition (2nd ed.). construct validation of a triarchic model. Gifted
London: Macmillan. (1 923 edition reprinted in Child Quarterly, 40(3 ), 1 29–1 3 7.
1 973 by Arno Press, New York). Sternberg, R. J., Forsythe, G. B., Hedlund, J.,
Spearman, C. (1 927). The Abilities of Man. Horvath, J., Snook, S., Williams, W. M., et al.
London: Macmillan. (2000). Practical Intelligence in Everyday Life.
Steele, C. (1 990, May). A conversation with New York: Cambridge University Press.
Claude Steele. APS Observer, pp. 1 1 –1 7. Sternberg, R. J., & Grigorenko, E. L. (Eds.)
Sternberg, R. J. (1 977). Intelligence, Information (1 997). Intelligence, Heredity, and Environment.
Processing, and Analogical Reasoning: The Com- New York: Cambridge University Press.
ponential Analysis of Human Abilities. Hillsdale, Sternberg, R. J., & Grigorenko, E. L. (1 999).
NJ: Erlbaum. Our Labeled Children: What Every Parent and
Sternberg, R. J. (1 979). Is absolute time relatively Teacher Needs to Know About Learning Disabil-
interesting? Behavioral and Brain Sciences, 2 , ities. Reading, MA: Perseus.
281 –282. Sternberg, R. J., & Grigorenko E. L. (Eds.) (2002).
Sternberg, R. J. (1 981 ). Intelligence and nonen- The General Factor of Intelligence: How General
trenchment. Journal of Educational Psychology, Is It?. Mahwah, NJ: Erlbaum.
73 , 1 –1 6. Sternberg, R. J., Grigorenko, E. L., Ferrari, M., &
Sternberg, R. J. (1 982). Natural, unnatural, and Clinkenbeard, P. (1 999). The triarchic model
supernatural concepts. Cognitive Psychology, 14, applied to gifted identification, instruction, and
45 1 –488. assessment. In N. Colangelo, & S. G. Assouline
(Eds.), Talent Development III: Proceedings from
Sternberg, R. J. (1 983 ). Components of human
the 1 995 Henry B. and Jocelyn Wallace Na-
intelligence. Cognition, 15 , 1 –48.
tional Research Symposium on Talent Develop-
Sternberg, R. J. (1 985 ). Beyond IQ: A Triarchic ment (pp. 71 –80). Scottsdale, AZ: Gifted Psy-
Theory of Human Intelligence. New York: Cam- chology Press.
bridge University Press.
Sternberg, R. J., & Kaufman, J. C. (1 996). Innova-
Sternberg, R. J. (1 987). Teaching intelligence: tion and intelligence testing: The curious case
The application of cognitive psychology to of the dog that didn’t bark. European Journal of
the improvement of intellectual skills. In J. B. Psychological Assessment, 1 2 (3 ), 1 75 –1 82.
Baron, & R. J. Sternberg (Eds.), Teaching Think- Sternberg, R. J., & Kaufman J. C. (1 998). Human
ing Skills: Theory and Practice (pp. 1 82–21 8). abilities. Annual Review of Psychology, 49, 479–
New York: Freeman Press. 5 02.
Sternberg, R. J. (1 988). The Triarchic Mind: A Sternberg, R. J., Nokes, K., Geissler, P. W., Prince,
New Theory of Human Intelligence. New York: R., Okatcha, F., Bundy, D. A., et al. (2001 ).
Viking. The relationship between academic and prac-
Sternberg, R. J. (1 996). For whom does the Bell tical intelligence: A case study in Kenya. Intel-
Curve toll? It tolls for you. Journal of Quality ligence, 2 9, 401 –41 8.
Learning, 6(1 ), 9–27. Sternberg, R. J., Powell, C., McGrane, P. A.,
Sternberg, R. J. (1 997). Successful Intelligence. & McGregor, S. (1 997). Effects of a para-
New York: Plume. sitic infection on cognitive functioning. Jour-
Sternberg, R. J. (1 999). A triarchic approach to nal of Experimental Psychology: Applied, 3 , 67–
the understanding and assessment of intelli- 76.
gence in multicultural populations. Journal of Sternberg, R. J., & Ruzgis, P. (Eds.) (1 994). Per-
School Psychology, 3 7, 1 45 –1 5 9. sonality and Intelligence. New York: Cambridge
Sternberg, R. J., Conway, B. E., Ketron, J. L., & University Press.
Bernstein, M. (1 981 ). People’s conceptions of Sternberg, R. J., Torff, B., & Grigorenko, E. L.
intelligence. Journal of Personality and Social (1 998). Teaching triarchically improves school
Psychology, 41, 3 7–5 5 . achievement. Journal of Educational Psychology,
Sternberg, R. J., & Detterman, D. K. (1 986). 90, 3 74–3 84.
What Is Intelligence? Norwood, NJ: Ablex. Sternberg, R. J., & Wagner, R. K. (Eds.) (1 994).
Sternberg, R. J., Ferrari, M., Clinkenbeard, P. R., Mind in Context. New York: Cambridge Uni-
& Grigorenko, E. L. (1 996). Identification, in- versity Press.
intelligence 773
Terman, L. M., & Merrill, M. A. (1 93 7). Measur- Wahlsten, D. & Gottlieb, G. (1 997). The invalid
ing Intelligence. Boston: Houghton Mifflin. separation of effects of nature and nurture:
Terman, L. M., & Merrill, M. A. (1 973 ). Stanford– Lessons from animal experimentation. In R. J.
Binet Intelligence Scale: Manual for the Third Re- Sternberg, & E. L. Grigorenko (Eds.), Intelli-
vision. Boston: Houghton Mifflin. gence, Heredity, and Environment (pp. 1 63 –1 92).
Thorndike, R. L., Hagen, E. P., & Sattler, J. New York: Cambridge University Press.
M. (1 986). Technical Manual for the Stanford– Wechsler, D. (1 980). Wechsler Preschool and Pri-
Binet Intelligence Scale: 4th Edition. Chicago: mary Scale of Intelligence – Revised. (WPPSI).
Riverside. San Antonio, TX: Psychological Corporation.
Thurstone, L. L. (1 93 8). Primary Mental Abilities. Wechsler, D. (1 991 ). Manual for the Wechsler Intel-
Chicago: University of Chicago Press. ligence Scales for Children (3 rd ed.) (WISC-III).
Turner, M. L., & Engle, R. W. (1 989). Is work- San Antonio, TX: Psychological Corporation.
ing memory capacity task dependent? Journal Wechsler, D. (1 997). Manual for the Wechsler
of Memory and Language, 2 8(2), 1 27–1 5 4. Adult Intelligence Scales (WAIS-III). San Anto-
Vernon, P. E. (1 971 ). The Structure of Human Abil- nio, TX: Psychological Corporation.
ities. London: Methuen. Wickett, J. C., & Vernon, P. A. (1 994). Peripheral
Vernon, P. A., & Mori, M. (1 992). Intelligence, re- nerve conduction velocity, reaction time, and
action times, and peripheral nerve conduction intelligence: An attempt to replicate Vernon
velocity. Intelligence, 8, 273 –288. and Mori. Intelligence, 18, 1 27–1 3 2.
Vernon, P. A., Wickett, J. C., Bazana, P. G., & Stel- Wissler, C. (1 901 ). The correlation of mental and
mack, R. M. (2000). The neuropsychology and physical tests. Psychological Review, Monograph
psycholophysiology of human intelligence. In Supplement, 3 (6).
R. J. Sternberg (Ed.), Handbook of Intelligence Zeidner, M. (1 990). Perceptions of ethnic group
(pp. 245 –264). New York: Cambridge Univer- modal intelligence: Reflections of cultural
sity Press. stereotypes or intelligence test scores? Journal
Wagner, D. A. (1 978). Memories of Morocco: of Cross-Cultural Psychology, 2 1 (2), 21 4–23 1 .
The influence of age, schooling and environ- Zentall, T. R. (2000). Animal intelligence. In
ment on memory. Cognitive Psychology, 1 0, 1 – R. J. Sternberg (Ed.), Handbook of Intelligence
28. (pp. 1 97–21 5 ). New York: Cambridge Univer-
Wagner, R. K. (2000). Practical intelligence. In R. sity Press.
J. Sternberg (Ed.), Handbook of Human Intel- Zigler, E., & Berman, W. (1 983 ). Discerning the
ligence (pp. 3 80–3 95 ). New York: Cambridge future of early childhood intervention. Ameri-
University Press. can Psychologist, 3 8, 894–906.
CHAPTER 3 2
Learning to Think: The Challenges

of Teaching Thinking
Ron Ritchhart
David N. Perkins
The idea that thinking can be taught, or at ing or even support. Indeed, neurological
least productively nurtured along its way, is findings suggest that the brain is hard-wired
ancient. Beginning with the efforts of Plato for just such activities as a basic mechanism
and the introduction of Socratic dialog, we for facilitating language development, so-
see attention to improving intelligence and cialization, and general environmental sur-
promoting effective thinking as a recurring vival. Furthermore, it might be assumed
educational trend throughout the ages. Early that these basic thinking skills are already
in the twentieth century, Dewey (1 93 3 ) enhanced through the regular processes of
again focused North American’s attention schooling, as students encounter the work
on the importance of thinking as an educa- of past thinkers, engage in some debate,
tional aim. At the same time, Selz (1 93 5 ) write essays, and so on. Why, then, should
was advocating the idea of learnable intel- we concern ourselves with the teaching and
ligence in Europe. In the 1 970s and 1 980s, learning of thinking? Addressing these is-
specific programs designed to teach think- sues entails looking more closely at a fuller
ing took shape, many of which continue in range of thinking, particularly what might
schools today. Efforts to teach thinking have be called high-end thinking, as well as ex-
proliferated in the new millennium, often amining the role education plays in promot-
becoming less programmatic in nature and ing thinking.
more integrated within the fabric of schools. Although it is true that the human mind
Despite this long history of concern with comes readily equipped for a wide variety of
thinking, one reasonably might ask: Why do thinking tasks, it is equally true that some
we need to “teach” thinking anyway? After kinds of thinking run against these natural
all, given reasonable access to a rich cultural tendencies. For example, probabilistic think-
surround, individuals readily engage in sit- ing is often counterintuitive in nature or
uated problem solving, observing, classify- doesn’t fit well with our experience (Tversky
ing, organizing, informal theory building and & Kahneman,1 993 ; also see Kahneman &
testing, and so on, without much prompt- Frederick, Chap. 1 2). We have a natural
775
tendency toward favoring our own posi- natural ability to make inferences becomes
tion and interests – my-side bias (Molden & much more sophisticated through system-
Higgins, Chap. 1 3 ) – that can lead to poor atized processes of reasoning with evidence,
conclusions in decision making and discern- weighing evidentiary sources, and drawing
ments of truth (Baron, et al. 1 993 ). We justifiable conclusions. Indeed, for most
frequently draw conclusions and inferences thinking abilities that might be considered
based on limited evidence (Perkins, 1 989, naturally occurring, one can usually identify
1 995 ). The fundamental attribution error a more sophisticated form that such think-
(Harvey, Town, & Yarkin, 1 981 ) names the ing might take with some deliberate nurtur-
tendency, particularly in Westerners, to as- ing. This type of thinking is what is often
cribe characterological traits to others based referred to as high-end thinking or criti-
on limited but highly salient encounters. cal and creative thinking. Such thinking ex-
Furthermore, sometimes our natural ways tends beyond a natural processing of the
of making sense of the world actually stand world into the realm of deliberative thinking
in the way of more effective ways of think- acts aimed at solving problems, making de-
ing. For instance, our ability to focus at- cisions (see LeBoeuf & Shafir, Chap. 1 1 ), and
tention can lead to narrowness of vision forming conclusions.
and insight. Our natural tendency to detect The contribution of schooling to the de-
familiar patterns and classify the world can velopment of thinking is a vexed matter (see
lock us into rigid patterns of action and Greenfield, Chap. 27, for a cross-cultural
trap us in the categories we invent (Langer, perspective on the impact of schooling). On
1 989). Relatedly, already developed under- the one hand, it is clear that schooling en-
standings constitute systems of knowledge hances performance of various kinds on for-
that are much more readily extended than mal tasks and IQ-like instruments (Grotzer
displaced: We tend to dismiss or recast chal- & Perkins, 2000; Perkins, 1 985 ; see Stern-
lenges rather than rethinking our under- berg, Chap. 3 1 , for a discussion of intelli-
standings, which is a deep and general gence). For the most part, however, schools
problem of learning (see Chi and Ohlsson, have addressed knowledge and skill acqui-
Chap. 1 6). Our emotional responses to sition. The narrowness of this focus and
situations can easily override more de- absence of strong efforts to nurture think-
liberative thinking (Goleman, 1 995 ). The ing were criticized by Dewey at the turn
phenomenon of groupthink, in which the of the century. Such critiques have contin-
dominant views of the group are readily ued until today from a variety of sources. In
adopted by group members, can lead to lim- a series of empirical investigations, Perkins
ited processing and discernment of infor- and colleagues (Perkins, Allen, & Hafner,
mation (Janis, 1 972). These are just a few 1 983 ; Perkins, Faraday & Busheq, 1 991 ) in-
thinking shortfalls suggesting that truly good vestigated the impact of conventional ed-
thinking does not automatically develop in ucation at the high school, university, and
the natural course of events. graduate school levels on informal reasoning
Even when our native tendencies do not about everyday issues. Cross-sectional stud-
lead us astray, they can usually benefit from ies examining the impact of three years of
development. The curiosity of the child for high school, college, and graduate school re-
discovering and making sense of the world vealed only marginal gains (Perkins, 1 985 ).
does not automatically evolve into an intel- Several national reports on schooling in the
lectual curiosity for ideas, knowledge, and 1 980s discussed how schools were domi-
problem solving (Dewey, 1 93 3 ), for exam- nated by rote work and involved very lit-
ple. Our ability to see patterns and rela- tle thinking (Boyer, 1 983 ; National Com-
tionships forms the basis for inductive rea- mission on Excellence in Education, 1 983 ;
soning (see Sloman & Lagnado, Chap. 5 ), Goodlad, 1 983 ).
but the latter requires a level of precision The problems of overcoming thinking
and articulation that must be learned. Our shortfalls while enhancing native thinking
the challenges of teaching thinking 777
processes through education therefore con- is that of transfer, a pivotal concern within
stitute an important rationale for the ex- the teaching of thinking. We conclude with
plicit teaching of thinking. Furthermore, as a fifth challenge, that of creating cultures of
knowledge and information become at the thinking, in which we examine the social
same time more complex and more acces- context and environment in which think-
sible, critics argue that teaching thinking ing is being promoted. Each of these chal-
should be considered even more of a pri- lenges involves key philosophical and prac-
ority (Resnick, 1 987). In this setting, it is tical issues that all efforts to teach thinking,
not enough to simply consume predigested whether undertaken by a single teacher or
knowledge, one must also become a knowl- a major research university, must confront.
edge builder (Scardamalia, Bereiter, & La- We review the ways in which various efforts
mon, 1 994) and problem solver (Polya, 1 95 7; to teach thinking address these challenges
Schoenfeld, 1 982; Selz, 1 93 5 ). to clarify just what is involved in teaching
This need for thinking instruction has led thinking.
to a rapid increase in efforts to teach thinking
over the past thirty years. During this time, a
few well-established thinking programs have The Challenge of Attaining Results
taken hold in schools and sustained their de-
velopment, while a plethora of new pro- As is the case with any class of educational
grams, often small interventions based on interventions, one of the most fundamental
current cognitive theory, have flourished. questions to be asked is: Do they work –
In addition, an increasing array of subject- at least with some populations under some
based programs and designed learning en- circumstances? This is especially important
vironments aimed at developing students’ for an area like the teaching of thinking,
thinking also have emerged. These programs which is haunted by skepticism on the part
deal with many different aspects of think- of lay people and some scholars.
ing, including critical and creative thinking It may seem premature to turn to findings
(for more on creative thinking, see Sternberg without discussing details about background
et al. Chap. 1 5 ), reflective and metacognitive theories and issues in the field, but letting
thinking, self-regulation, decision-making, the question of impact hover for many pages
and problem solving, as well as disciplinary while we deal with such matters also seems
forms of thinking. troublesome. After all, if there isn’t at least
All of these programs – whether aimed some indication that thinking can be taught,
at developing thinking as part of a stand- then the remaining challenges become aca-
alone course within the context of teach- demic. Accordingly, we turn to this ultimate
ing a particular subject or as part of a larger challenge first, asking whether, at least some-
design of the instructional environment – times, coordinated efforts to teach thinking
confront at least five important challenges work in a reasonable sense, also taking it as
in their efforts to develop thinking. We use an opportunity to put quick profiles of sev-
these as the basis for the present review. eral interventions on the table to give readers
The first challenge relates to the bottom a feel for the range of approaches.
line: Can thinking be taught with some rea- In looking for success, it is helpful to bear
sonable signs of success? The second chal- in mind three broad criteria – magnitude,
lenge concerns what is meant when one talks persistence, and transfer (Grotzer & Perkins,
about good thinking. Programs and efforts to 2000). An intervention appears successful
teach thinking are shaped largely by the an- to the extent that it shows some magni-
swer to this question. The third challenge tude of impact on learners’ thinking with
deals with the dispositional side of think- effects that persist well beyond the period
ing, not just skills and processes but atti- of instruction and with transfer to other
tudes and intellectual character (Ritchhart contexts and occasions. Previous reviewers
2002; Tishman 1 994). The fourth challenge of thinking programs pointed out that the
empirical evidence needed to assess program would have been expected, and significantly
effectiveness is often hard to come by in better than GE subjects by about a third of
the research literature (e.g., Adams, 1 989; a standard deviation on incidental follow-
Nickerson, Perkins, & Smith, 1 985 ; Stern- up testing on an Army Intelligence test
berg, 1 986), often because of the lack of (DAPAR) two years later (Feuerstein et al.,
funding for careful long-term program eval- 1 981 ; Rand, Tannenbaum, & Feuerstein,
uation. We emphatically do not limit this ar- 1 979). These findings show both magnitude
ticle only to those programs receiving exten- and persistence of effects, with some trans-
sive evaluation, but we do focus this section fer. The program uses testlike activities, so
on a few such programs. The good news is the transfer to a nonverbal intelligence test
that the history of efforts to teach thinking might be considered a case of near trans-
provides proofs for achieving all three crite- fer (Perkins & Salomon, 1 988). Evidence of
ria, at least to some extent. transfer to school tasks – far transfer – seems
Programs designed to teach thinking to depend on the individual teacher or in-
come in many different styles. For instance, structor, who is responsible for providing the
some programs are designed to develop dis- bridging (Savell, Twohig, & Rachford, 1 986;
crete skills and processes such as classifica- Sternberg, 1 986).
tion and sequencing as means of developing These findings have proved less easily
the building blocks for thinking. Paul (1 984) replicated with students of average or above-
refers to these programs as “micrological” in average ability. What is consistent, however,
nature. They often find their theoretical jus- is the change in behavior and attitude stu-
tification in theories of intelligence (see next dents experience, generally in terms of in-
section for more on how various programs creased confidence in abilities and a more
define good thinking), and they often use de- positive attitude toward school work (Blagg,
contextualized and abstract materials similar 1 991 ; Kriegler, 1 993 ).
to those one might find on standardized psy- Another type of program to teach think-
chometric tests. ing tends to be more “macrological” in na-
Perhaps the best-known program of this ture (Paul, 1 984), being contextualized and
type is Instrumental Enrichment (IE) (Feuer- real world oriented, focusing on more broad-
stein, 1 980). It uses very abstract, test-like based skills such as considering multiple
activities to develop skills in areas such points of view, dealing with complex in-
as comparisons, categorization, syllogisms, formation or creative problem solving. Phi-
and numerical progressions, among others. losophy for Children (Lipman, 1 976), and
Instructors are encouraged to “bridge” the CoRT (Cognitive Research Trust) (de Bono,
abstract exercises by relating the skills to 1 973 ), are examples of this approach. The
world problem solving. Instrumental en- Philosophy for Children program engages
richment was designed to bring students students in philosophical discussions around
who show marked ability deficits into main- a shared book to cultivate students’ abil-
stream culture, although it can be used with ity to draw inferences, make analogies, form
other students as well. hypotheses, and so forth. The CoRT pro-
In one study, matched samples of gram teaches a collection of thinking “op-
low functioning, low socio-economic status erations,” defined by acronyms for creative
(SES) twelve- to fifteen-year-olds partici- and critical thinking; operations these aim
pated in IE or general enrichment (GE) pro- to broaden and organize thinking and fa-
grams providing direct help, such as math cilitate dealing with information. Through
or science tutoring. Instrumental enrich- a developed set of practice problems, for
ment subjects made greater pre- to post- instance, students learn to apply the PMI
test gains on tests of interpersonal con- operation (plus, minus, interesting), iden-
duct, self-sufficiency, and adaptation to work tifying the pluses, minuses, and interesting
demands. Instrumental enrichment subjects but otherwise neutral points about a matter
scored slightly above normal, far better than at hand.
Both of these programs have been around Another program worth mentioning is a
long enough to develop a strong base and unique hybrid. The Odyssey (Adams, 1 986)
avid followers, resulting in a wealth of anec- program developed through a collaboration
dotal evidence and reports of effectiveness. between Harvard Project Zero, Bolt Beranek
Indeed, observers of these programs tend to and Newman, Inc., and the Venezuela Min-
be impressed with the involvement of stu- istry of Education was specifically designed
dents and the level of thinking demonstrated to systematically build macrological skills
(Adams, 1 989). Furthermore, some evidence upon micrological skills. The first lessons of
can be found to support both programs. Ed- the program deal with micrological skills, or
wards (1 994) reports that twelve-year-olds what the program developers call first-order
taught all sixty lessons of the CoRT pro- processes of classification, hierarchical clas-
gram showed improved scores on quantita- sification, sequencing, and analogical reason-
tive as well as qualitative measures. Com- ing, to build the foundation for the macro-
pared with other seventh grade students, logical process of dimensional analysis.
scores of CoRT students ranged from 48% to Processes often are introduced in the ab-
62% above the national mean on standard- stract, but then application is made to varied
ized tests, whereas other seventh graders’ contexts. The program takes the form of a
scores ranged from 25 % to 43 % above the separate course with 1 00 lessons, but it seeks
national norm of 3 1 %, indicating a mag- to connect directly to the scholastic activi-
nitude effect. Teachers reported improve- ties of students and provide links to everyday
ments in student thinking and confidence. life as well. The Odyssey program has been
Although students reported using the skills evaluated only in Venezuela. In a relatively
in other areas of their lives, there was no for- large evaluation of the program involving
mal measure of transfer on this evaluation. roughly 900 students in control and exper-
Other evaluations revealed mixed results on imental groups across twenty-two seventh
transfer (Edwards & Baldauf, 1 983 , 1 987). grade classes, the group gains of the exper-
The program produces an interesting find- imental group were 1 1 7 percent more than
ing with respect to persistence that should that of the control group on course-designed
be noted. Although reviews of research on pre- and postmeasurements – a strong indi-
CoRT suggest that the effects were short- cator of magnitude of effects. A battery of
term (Edwards, 1 991 a, 1 991 b), it was found tests were used to assess for transfer, includ-
that a small amount of follow-up reinforce- ing those of general ability, word problems,
ment given in the two years after the inter- and nonverbal reasoning. All showed signif-
vention resulted in increased persistence of icant gains for the experimental group, indi-
effects with scores that were one-third better cating both magnitude and transfer of effects
than controls three years after the interven- (Herrnstein, et al., 1 986).
tion (Edwards, 1 994). The abovementioned programs, whether
With respect to Philosophy for Children, focusing on micrological or macrological
evaluations have shown that children in skills, were stand-alone interventions with
grades four to eight display significant gains perhaps a modest degree of integration. A
in reading comprehension or logical think- number of programs are fully integrated and
ing (Lipman, 1 983 ). Transfer is built into the connected to the curriculum. A few of these
program because the discussions are text- are Intuitive Math (Burke, 1 971 ) and Problem
based and consequently deepen comprehen- Solving and Comprehension (Whimbey and
sion while teaching and modeling thinking Lochhead, 1 979), both focused on mathe-
strategies within the real world contexts of matics, and Think (Adams, 1 971 ) and Re-
the stories. As Adams (1 989, p. 3 7) points ciprocal Teaching (Brown & Palincsar, 1 982),
out, the texts give “Lipman the freedom to which are focused on language arts and read-
introduce, reintroduce, and elaborate each ing. All of these programs are designed to
logical process across a diversity of real- connect thinking processes to specific school
world situations.” content to enhance student understanding
and thinking. Think and Intuitive Math fo- The Challenge of Defining
cus on skills such as classification, structure Good Thinking
analysis, and seeing analogies. Problem Solv-
ing and Comprehension uses a technique Any program that aspires to teach think-
called “paired problem solving” to develop ing needs to face the challenge of defining
metacognitive awareness of one’s thinking good thinking, not necessarily in any ulti-
during problem solving. Reciprocal Teaching mate and comprehensive sense but at least in
is not so much a program as an approach to some practical, operational sense. With the
teaching reading comprehension. Through a foregoing examples of programs in mind, it
dialog with the teacher, students engage in will come as no surprise that many differ-
cycles of summarizing, question generating, ent approaches have been taken to answer
clarifying, and predicting. All of these inter- this challenge.
ventions have been shown to produce im- To begin, it is useful to examine some gen-
pressive results for their target populations, eral notions about the nature of good think-
generally low-achieving students, within the ing. There are a number of very broad char-
domains of their focus. In addition, transfer acterizations. Folk notions of intelligence, in
effects have been documented for Intuitive contrast with technical notions, boil down
Math and Think (Worsham & Austin, 1 983 ; to good thinking. A number of years ago,
Zenke & Alexander, 1 984). Sternberg et al. (1 981 ) reported research
As promised, these examples – and oth- synthesizing the characteristics people en-
ers discussed later – offer a kind of exis- vision when they think of someone as in-
tence proof regarding the challenge of attain- telligent. Intelligent individuals reason sys-
ing results (more reviews of these and other tematically, solve problems well, think in a
thinking programs can be found in Adams, logical way, deploy a good vocabulary, make
1 989; Grotzer & Perkins, 2000; Hamers & use of a rich stock of information, remain fo-
Overtoom, 1 997; Idol, 1 991 ; McGuinness & cused on their goals, and display intelligence
Nisbet, 1 991 ; Nickerson et al., 1 985 ; Perkins, in practical as well as academic ways. Perkins
1 995 ; Sternberg, 1 986). They give evidence (1 995 ) summed up a range of research on
that instruction designed to improve learn- difficulties of thinking by noting the human
ers’ thinking can advance it, with persistent tendency to think in ways that are hasty (im-
impact, and with some degree of transfer to pulsive, insufficient investment in deep pro-
other contexts and occasions. Along the way, cessing and examining alternatives), narrow
they also illustrate how rather different ap- (failure to challenge assumptions, examine
proaches can serve this purpose. other points of view), fuzzy (careless, im-
This is not to say that such results demon- precise, full of conflations), and sprawling
strate overwhelming success. Impacts on (general disorganization, failure to advance
learners’ thinking are typically moderate or conclude). Baron (1 985 ) advanced a
rather than huge. The persistence of effects search-and-inference framework that em-
tapers off after a period of months or years, phasized effective search and inference
particularly when learners return to settings around forming beliefs, making decisions,
that do not support the kind of development and choosing goals. Ennis (1 986) offered a
in question. Transfer effects are often spotty list of critical thinking abilities and disposi-
rather than sweeping. These limitations are tions, including traits such as seeking and of-
signs that the grandest ambitions regarding fering reasons, seeking alternatives, and be-
the teaching of thinking are yet to be real- ing open-minded. There are many others
ized. That said, enough evidence is at hand as well.
to show that the prospects of teaching think- The overlap among such conceptions is
ing cannot simply be dismissed on theoreti- apparent. They can be very useful for a broad
cal or empirical grounds. This opens the way overview and for the top level of program
for a deeper consideration of the challenges design, but they are not virtues of thinking
of doing so in the upcoming sections. that learners can straightforwardly learn or
teachers teach. They do not constitute a texts. Heuristic analyses have been devised
good theory of action (e.g., Argyris, 1 993 ; and taught for many generic thinking prac-
Argyris & Schön, 1 996) that would guide tices – everyday decision making, problem
and advise learners about how to improve solving, evaluating of claims, creative think-
their thinking, or guide and advise teachers ing, and so on.
and program designers about how to culti- Looking to programs mentioned earlier
vate thinking. With this general challenge for examples, we note that the CoRT pro-
in mind, we turn to describing three ap- gram teaches “operations” such as PMI (con-
proaches through which researchers and ed- sider plus, minus, and interesting factors in a
ucators have constructed theories of action situation) and OPV (consider other points
that characterize good thinking – by way of of view) (de Bono, 1 973 ). The Odyssey
norms and heuristics, models of intelligence, program teaches strategies for decision-
and models of human development. making, problem solving, and creative de-
sign, among others, foregrounding familiar
strategies such as looking for options be-
Norms and Heuristics
yond the obvious, trial and error, and ar-
One common approach to defining good ticulation of purposes (Adams, 1 986). Polya
thinking is to characterize concepts, stan- (1 95 4, 1 95 7) offered a well-known analysis
dards, and cognitive strategies that serve of strategies for mathematical problem solv-
a particular kind of thinking well. These ing, including examining special cases, ad-
guide performance as norms and heuristics. dressing a simplified form of the problem
When people know the norms and heuris- first, and many others. This led to a num-
tics, they can strive to improve their practice ber of efforts to teach mathematical prob-
accordingly. The result is a kind of “craft” lem solving, with unimpressive results, until
conception: Good thinking is a matter of Schoenfeld (1 982 ; Schoenfeld & Herrmann,
mastering knowledge, skills, and habits ap- 1 982) demonstrated a very effective inter-
propriate to the kind of thinking in question vention that included the instructor’s work-
as guided by the norms and heuristics. ing problems while commenting on strate-
Norms provide criteria of adequacy for gies as they were deployed, plus emphasis
products of thinking such as arguments or on the students’ self-management of the
grounded decisions. Examples of norms in- problem-solving process. Many simple read-
clude suitable conditions for formal deducing strategies have been shown to improve
tion or statistical adequacy, formal (e.g., af- student retention and understanding when
firming the consequent) or informal (e.g., ad systematically applied, including, for exam-
hominem argument) fallacies to be avoided, ple, the previously mentioned “reciprocal
or maximized payoffs in game theory (Ham- teaching” framework in which young read-
blin, 1 970; Nisbett, 1 993 ; Voss, Perkins, & ers interact conversationally in small groups
Segal, 1 991 ). Heuristics guide the process of around a text to question, clarify, summa-
thinking, but without the guarantees of suc- rize, and predict (Brown & Palincsar, 1 982).
cess that an algorithm provides. For instance, Nisbett (1 993 ) reported a series of stud-
mathematical problem solvers often do well ies conducted by himself and colleagues
to examine specific cases before attempting about the effectiveness of teaching norms
a general proof or to solve a simpler related and heuristics of statistical, if-then, cost-
problem before tackling the principal prob- benefit, and other sorts of reasoning, mainly
lem (Polya, 1 95 4, 1 95 7). to college students. Nisbett concluded that
The norms and heuristics approach fig- instruction in rules of reasoning was consid-
ures widely in educational endeavors. Train- erably more effective than critics of general,
ing in norms of argument goes back at least context-free rules for reasoning had claimed.
to the Greek rhetoriticians (Hamblin, 1 970) To be sure, student performance displayed
and continues in numerous settings of for- a range of lapses and could have been bet-
mal education today with many available ter. Nonetheless, students often applied the
patterns of reasoning that they were study- consider general intelligence in the sense
ing quite widely, well beyond the content of Spearman’s g factor to be unmodifiable
foregrounded in the instruction. Relatively by direct instructional interventions (Brody,
abstract and concise formulations of princi- 1 992; Jensen, 1 980, 1 998). On the other
ple alone led to some practical use of rules hand, a single factor does not afford much of
for reasoning, and this improved when in- a theory of action, because it does not break
struction included rich exploration of exam- down the learning problem into components
ples. Nisbett emphasized that we could cer- that can be addressed systematically.
tainly teach rules for reasoning much better Models of intelligence with components
than we do. Nonetheless, the basic enterprise offer more toward a theory of action.
appeared to be sound. J. P. Guilford’s 1 967 (Guilford & Hoepfner,
To summarize, the characteristic peda- 1 971 ) Structure of Intellect (SOI) model,
gogy of the approach through norms and for example, proposes that intelligence in-
heuristics follows from its emphasis on volves no fewer than 1 5 0 different com-
thinkers’ theories of action. Programs of this ponents generated by a three-dimensional
sort typically introduce norms and heuris- analysis involving several cognitive opera-
tics directly, demonstrate their application, tions (cognition, memory, evaluation, con-
and engage learners in practice with a range vergent production, divergent production)
of problems, often with an emphasis on crossed with several kinds of content (be-
metacognitive awareness, self-management, havioral, visual figural, and more) and cog-
and reflection on the strategies, general char- nitive products (units, classes, relations,
acter, and challenges of thinking. and more). An intervention developed by
Readily grasped concepts and standards, Meeker (1 969) aims to enhance the func-
strategies with three or four steps, and the tioning of a key subset of these compo-
like characterize the majority of norms and nents. Feuerstein (1 980) argues that in-
heuristics approaches. One objection to such telligence is modifiable through mediated
simplicity is that it can seem simpleminded. learning (with a mediator scaffolding learn-
“Everyone knows” that people should con- ers on the right kinds of tasks). His In-
sider both sides of the case in reasoning or strumental Enrichment program offers a
look for options beyond the obvious. How- broad range of mediated activities orga-
ever, as emphasized in the introduction to nized around three broad categories of cog-
this article, such lapses are commonplace. nitive process – information input, elabora-
Everyone does not know, and those who tion, and output – to work against problems
do know often fail to do so. The point of such as blurred and sweeping perception,
norms and heuristics most often is not to re- impulsiveness, unsystematic hypothesis test-
veal novel or startling secrets of a particular ing, and egocentric communication.
kind of thinking but to articulate some ba- Sternberg (1 985 ) developed the triarchic
sics and help bridge from inert knowledge to theory of intelligence over a number of years,
active practice. featuring three dimensions of intelligence –
analytic (as in typical IQ tests), practical
(expert “streetwise” behavior in particular
Models of Intelligence
domains), and creative (invention, innova-
The norms and heuristics approach to defin- tion). Sternberg, et al. (1 996) report an in-
ing and cultivating good thinking may be the tervention based on Sternberg’s (1 985 ) tri-
most common, but another avenue looks di- archic theory of intelligence: High school
rectly to models of intelligence (see Stern- students taking an intensive summer col-
berg, Chap. 3 1 ). Not so often encoun- lege course were grouped by their strengths
tered in the teaching of thinking is good according to Sternberg’s three dimensions
thinking defined through classic intelligence and taught the same content in ways build-
quotient (IQ) theory. On the one hand, ing on their strengths. The study included
many, although by no means all, scholars other groups not matched with their
strengths. Matched students exhibited supe- evaluation compared CASE students with
rior performance. control students on school science achieve-
The typical pedagogy of interventions ment tests with delayed posttesting. For
based on models of intelligence empha- some groups, substantial and statistically
sizes not teaching norms and heuristics but significant differences emerged for science,
rather providing abundant experience with mathematics, and English performance two
the thinking processes in question in moti- years after participation in CASE, demon-
vated contexts with strong emphasis on at- strating magnitude, persistence, and transfer
tention and self-regulation. Often, although of impact, the criteria used in the foregoing
by no means always – the Sternberg inter- results section (Adey & Shayer, 1 994, p. 92).
vention is an exception here, for example – Although this example takes a stage-like
the tasks have a rather abstract character view of human development, another tra-
on the theory that the learning activities dition looks to the work of Vygotsky and
are enhancing the functioning of fundamen- his followers, seeing development more as
tal cognitive operations and content is best a process of internalization from social situ-
selected for minimal dependence on back- ations that scaffold for the thinking of the
ground knowledge. That said, it is impor- participant (1 978). In addition to its Pi-
tant to recognize that no matter what the agetian emphasis, the work of Adey and
underlying theory – norms and heuristics, Shayer draws upon social scaffolding. Scar-
intelligence-based, or developmental, as in damalia and colleagues developed an initia-
the following section – interventions often tive initially called CSILE (Computer Sup-
pragmatically combine a variety of methods ported Intentional Learning Environments)
rather than proceeding in a purist manner. and now Knowledge Forum, that engages
students in the collaborative construction of
knowledge through an online environment
Models of Human Development that permits building complex knowledge
structures and labels for many important
Another approach to defining good think- epistemic elements such as hypotheses and
ing looks to models of human development evidence (Scardamalia, et al., 1 989). The
that outline how cognitive development nor- social character of the enterprise and the
mally advances, often through some se- forms of discourse it externalizes through
quence of stages that represent plateaus the online environment create conditions for
in the complexity of cognition, as with Vygotskian internalization of patterns of
the classic concrete and formal operational thinking. Studies of impact have shown gains
stages of Inhelder and Piaget (1 95 8; see Hal- in students’ depth of explanation and knowl-
ford, Chap. 22). For example, the program edge representation, capability in dealing
called Cognitive Acceleration through Sci- with difficult texts, recall of more infor-
ence Education (CASE) (Adey & Shayer, mation from texts, and deeper conceptions
1 993 , 1 994) teaches patterns of thinking in of the nature of learning with more of a
science – for instance the isolation and con- mastery emphasis (Scardamalia, Bereiter, &
trol of variables – based on Piagetian princi- Lamon, 1 994).
ples of uncovering students’ prior concep- Of course, developmental psychology has
tions and creating opportunities for them evolved greatly since the days of Vygotsky
to reorganize their thinking. Lessons intro- and Piaget. For example, the past half
duce cognitive dissonance around particular century has seen development explained
puzzles so students are led to examine their in terms of expansion in, and more effi-
assumptions and rethink their prior con- cient use of, working memory (e.g., Case,
ceptions. In addition to the thinking skills, 1 985 ; Fischer, 1 980; Pascual-Leone, 1 978);
the program focuses explicitly on fostering semi-independent courses of development
metacognition and transferring knowledge traced in different domains (e.g., Case,
and strategies between contexts. A formal 1 992; Fischer, 1 980; Carey, 1 985 ); strands of
development attributed to the modularity you cannot teach directly the underlying
of mind, with innate mental structures an- logical structures. Learners must attain them
ticipating certain kinds of knowledge (e.g., by wrestling with the right kinds of problems
Detterman, 1 992; Hirschfeld & Gelman, under appropriately reflective and support-
1 994), and so on. ive conditions.
It is not the role of this chapter to review
the complexities of contemporary develop-
What Effect Does a Theory of Good
mental psychology, especially because as far
Thinking Have?
as we know, few approaches to the teaching
of thinking have based themselves on recent With approaches to defining good thinking
developmental theory. Quite likely, there through heuristic analysis, intelligence, and
are substantial opportunities that have not human development on the table, perhaps
been taken. To give a sense of the promise, the most natural question to ask is which
Case (1 992) advanced the idea of central con- approach is “right” and therefore would lead
ceptual structures, which are core structures to the most powerful interventions. Unfor-
in broad domains such as quantity, narra- tunately, the matter is far too complex to
tive, and intentionality that lie at the foun- declare a winner. One complication is that
dations of development in these domains all programs, despite their theoretical differ-
and enable further learning. Working from ences, share key features. All programs en-
this notion, Griffin, Case, and Capodilupo gage learners in challenging thinking tasks
(1 995 ) designed and assessed an interven- that stretch beyond what they normally un-
tion called Rightstart to develop the cen- dertake. All programs place some empha-
tral conceptual structure for number and sis on focused attention and metacognitive
advance kindergarteners’ preparation for self-regulation. It may be that these de-
learning basic arithmetic operations through mand characteristics are the factors that in-
formal instruction. Testing demonstrated fluence an intervention’s success more than
that the children in the treatment group the underlying theory. Furthermore, as un-
indeed acquired a more fully developed derscored earlier, programs are often eclectic
central conceptual structure for number, in their means: Their methods overlap more
displayed greater understanding of number than their philosophies.
in content areas not included in the train- To further complicate declaring a win-
ing, and responded with substantially greater ner, different programs speak to the distinc-
gains to later formal instruction in the basics tive needs of different audiences – children
of arithmetic as well as showing far transfer of marked disabilities with unsystematic
to sight reading in music and to the notion of and impulsive ways of thinking, students of
distributive justice, areas related to the cen- elementary science conceptually confused
tral conceptual structure for number. about themes such as control of variables,
As these examples illustrate, the general math students in college struggling with
pedagogical style of the developmental ap- strategies of proof, and so on.
proach is to harness “natural” footholds and Another confounding factor is that a tech-
mechanisms of development to accelerate nically well-grounded theory may not be
and perhaps reach levels that the learner that helpful as a theory of action. As noted
otherwise would not attain. As theories of earlier, this is a problem with classic g theory.
action, models of human development, like Finally, and somewhat paradoxically, a the-
models of intelligence, do not so much offer ory, that is, in some ways suspect may lead
strategic advice to learners as they address to an intervention that proves quite effec-
teachers and especially designers, suggesting tive. For example, Piagetian theory has been
how they might arrange activities and expe- challenged in a number of compelling ways
riences that will push development forward. (e.g., Brainerd, 1 983 ; Case, 1 984, 1 985 ), yet
Indeed, a common, although questionable, applying certain key aspects of it appears
tenet of much developmental theory is that to serve the demonstrably effective CASE
program very well (Adey & Shayer, 1 993 , of mind; Baron (1 985 ) as part of his search-
1 994), perhaps because the kinds of think- inference framework; Ennis (1 986) and
ing it foregrounds are important to complex Norris (1 995 ) as part of analyses of criti-
cognition of the sort targeted, putting aside cal thinking; Langer (1 989, p. 44), with the
the standing of Piagetian theory as a whole. notion of mindfulness, which she defined as
In summary, although approaches based “an open, creative, and probabilistic state of
on norms and heuristics, theories of intelli- mind”; and Facione et al. (1 995 ). Models of
gence, and models of development can be self-regulation have emphasized volitional
identified, it is difficult at present to dismiss aspects of thinking and individuals’ moti-
any of them as misguided. As with much of vation to engage thoughtfully (Schunk &
human enterprise, the devil is in the details – Zimmerman, 1 994). We and our colleagues
here, the details of particular programs’ have done extensive work in this area, re-
agendas, the learners they mean to serve, ferring to intellectual character as a partic-
and the extent to which their conceptions ular perspective on dispositions (Ritchhart,
of good thinking provide helpful theories 2002; Tishman, 1 994, 1 995 ) and to disposi-
of action. tions themselves (Perkins, Jay, & Tishman,
That said, there is a general limitation 1 993 ; Perkins et al., 2000; Perkins &
to all three approaches: They all concern Tishman, 2001 ; Perkins & Ritchhart, 2004).
what it is to think well when you are think- Accordingly, it is important to exam-
ing. Such criteria are certainly important, ine the dispositional side of the story and
but this leaves room to ask: What if you appraise its importance in the teaching
don’t feel moved to think about the mat- of thinking.
ter at hand, or what if you don’t even no-
tice that the circumstances invite thinking?
The Logical Case for Dispositions
This brings us to the next fundamental
challenge of teaching thinking – the role One line of argument for the importance of
of dispositions. dispositions looks to logic and common ex-
perience. There is a natural tendency to as-
sociate thinking with blatant occasions – the
test item, the crossword puzzle, the choice
The Challenge of Attending of colleges, the investment decision. Plainly,
to Thinking Dispositions however, many situations call for thinking
with a softer voice all too easily unheard –
We discussed earlier how approaches to the politician’s subtle neglect of an alterna-
teaching thinking needed to address the tive viewpoint, your own and others’ rea-
question: What is good thinking? In a soning from ethnic stereotypes, the comfort
sense, that question was incomplete. Good of “good enough” solutions that are not all
thinkers, after all, are more than people who that good. Even when we sense opportuni-
simply think well when they think: They also ties for deeper thinking in principle, there
think at the right times with the right com- are many reasons why we often shun them –
mitments – to truth and evidence, creativity blinding confidence in one’s own view, obliv-
and perspective taking, sound decisions, and iousness to the possibilities for seeing things
apt solutions. Views of thinking that bring differently, aversion to complexities and am-
this to the fore are often called dispositional biguities, and the like. Such lapses seem all
because they look not just to how well peo- too common, which is why, for example,
ple think when trying hard but what kinds Dewey (1 922) emphasizes the importance
of thinking they are disposed to undertake. of good habits of mind that can carry people
Most views of thinking are abilities- past moments of distraction and reluctance.
centered, but several scholars have devel- Scheffler (1 991 , p. 4), writing about cogni-
oped dispositional perspectives – for in- tive emotions, put the point eloquently in
stance Dewey (1 922), who wrote of habits stating that “emotion without cognition is
blind, and . . . cognition without emotion is they persist. An extended program of re-
vacuous.” search has shown that these traits are inde-
It also is notable that the everyday lan- pendent of cognitive abilities but often affect
guage of thinking includes a range of terms cognitive performance greatly. Also, teach-
for positive and negative dispositional traits ing style and classroom culture can shape the
considered to be important: A person may be extent to which students adopt entity versus
open-minded or closed-minded, curious or incremental mindsets.
indifferent, judicious or impulsive, system- Using self-report measures of dogmatism,
atic or careless, rational or irrational, gullible categorical thinking, openness, counterfac-
or skeptical. Such contrasts have more to tual thinking, superstitious thinking, and ac-
do with how well people actually use their tively open-minded thinking, Stanovich and
minds than how well their minds work. West (1 997) found these measures predicted
performance on tests of argument evalu-
ation even after controlling for cognitive
The Empirical Case for Dispositions
capacities.
The foregoing arguments from logic and These studies support the notion that dis-
common sense give some reason to view positional constructs do influence behavior
the dispositional side of thinking as im- and can be useful in predicting performance,
portant. Beyond that, a number of re- although perhaps not in any absolute sense.
searchers have investigated a range of dis- One can be curious in one situation and not
positional constructs and provided empirical in another, for instance. Likewise with dispo-
evidence of their influence on thinking, their sitions such as friendliness or skepticism. Al-
trait-like character, and their distinctness though there is evidence for cross-situational
from abilities. stability for some dispositional constructs
Research on dispositional constructs such (Webster & Kruglanski, 1 994), the value of
as the need for cognitive closure (Kruglanski, the dispositional perspective does not rest
1 990) and the need for cognition (describ- on an assumed cross-situational character.
ing an individual’s tendency to seek, engage Indeed, rather than acting in a top-down,
in, and enjoy cognitively effortful activity; trait-like fashion, dispositions offer a more
Cacioppo & Petty, 1 982) has shown that they bottom-up explanation of patterns of behav-
influence when and to what extent individu- ior consistent with emerging social-cognitive
als engage in thinking and has demonstrated theories of personality (Cervone, 1 999;
test–retest reliability (Kruglanski, 1 990; Cervone & Shoda, 1 999). A dispositional
Cacioppo et al., 1 996). Measures of an perspective takes into account both the situ-
individual’s need for cognition developed ational context and individual motivational
by Cacioppo and colleagues show that it factors, positing that patterns of behavior are
is a construct distinguishable from ability emergent and not merely automatic. To bet-
(Cacioppo et al., 1 996). ter understand how such behavior emerges
Dweck and colleagues investigated an- and how dispositions differ from traits, it is
other dispositional construct for a number necessary to break apart dispositional behav-
of years – the contrast between entity learn- ior into its distinct components.
ers and incremental learners (Dweck, 1 975 , For a number of years, the authors and
2000). Broadly speaking, learners with an their colleagues have sustained a line of re-
entity mindset believe that “you either get search on the nature of dispositions, as cited
it or you don’t,” and if you don’t, you prob- earlier. Although most scholars view dispo-
ably are not smart enough. As a result, they sitions as motivating thinking, we have ana-
tend to quit in the face of intellectual chal- lyzed the dispositional side of thinking into
lenges. In contrast, learners with an incre- two components – sensitivity and inclina-
mental mindset believe their abilities can tion. Sensitivity does not motivate think-
be extended through step-by-step effort, so ing as such but concerns whether a person
notices occasions in the ongoing flow of positions does – and should – play in the
events that might call for thinking, such as teaching of thinking. Most programs do not
noticing a hasty causal inference, a sweeping attend directly and systematically to dispo-
generalization, a limiting assumption to be sitional aspects of thinking, although they
challenged, or a provocative problem to be may foster dispositions as a side-effect. In-
solved. Inclination concerns whether a per- deed, it is inconvenient to address disposi-
son is inclined to invest effort in thinking tions through programs that focus on direct
the matter through because of curiosity, per- instruction and regimens of practice. The
sonal relevance, and so on. dispositional side of thinking concerns notic-
Our empirical research argues that sensi- ing when to engage thinking seriously, which
tivity is supremely important. We used sto- inherently does not come up in abilities-
ries that portrayed people thinking through centered instruction that point-blank directs
various problems and decisions with embed- students to think about this or that problem
ded shortfalls in their thinking, such as not using this or that strategy.
going beyond the obvious options or not ex- One solution to this suggests that cul-
amining the other side of the case (Perkins ture is the best teacher of dispositions (cf.
et al., 2000; Perkins & Tishman, 2001 ). In Dewey, 1 922, 1 93 3 ; Ritchhart, 2002; Tish-
multiple studies, we found that subjects man, Jay, & Perkins, 1 993 ; Tishman, Perkins,
detected only about 1 0% of the thinking & Jay, 1 995 ; Vygotsky, 1 978). A culture in
problems, although, when prompted, they the classroom, the family, or the workplace
showed good ability, readily brainstorming that foregrounds values of thinking and en-
further options or generating arguments on courages attention to thinking would plausi-
the other side of the case. Inclinations played bly instill the attitudes and patterns of alert-
an intermediate role in their engagement ness called for.
in thinking. Interventions that wrap learners in a
In one study, we examined test–retest cor- culture include the Philosophy for Chil-
relations on sensitivity scores for detecting dren program developed by Lipman and
thinking shortfalls and found correlations of colleagues (Lipman, 1 988; Lipman, Sharp,
about 0.8 for a ninth grade sample and 0.6 & Oscanyon, 1 980), which foregrounds
for a fifth grade sample. The findings provide Socratic discussion, and the online col-
evidence that sensitivity to the sorts of short- laborative knowledge-building environment
falls examined is a somewhat stable charac- CSILE (Scardamalia & Bereiter, 1 996;
teristic of the person. In several studies, we Scardamalia et al., 1 989, 1 994), both of
examined correlations between our disposi- which were discussed earlier. Instrumental
tional measures and various measures of cog- Enrichment (Feuerstein, 1 980) involves a
nitive ability with results ranging from no to strong culture of support between media-
moderate correlation but lower than correla- tor and learners. We have also worked on
tions within ability measures (Perkins et al., programs with a cultural emphasis, includ-
2000; Perkins & Tishman, 2001 ). The finding Keys to Thinking (Perkins, Tishman, &
ings suggest that sensitivity and inclination Goodrich, 1 994; Cilliers et al., 1 994) and one
are not simply reflections of cognitive ability now under development (Perkins & Ritch-
as usually conceived: Dispositions are truly hart, 2004), and have published a book for
another side of the story of thinking. teachers with this emphasis – The Thinking
Classroom (Tishman, Perkins, & Jay, 1 995 ).
The theme of cultures of thinking is impor-
Cultivating Thinking Dispositions
tant in other ways as well, so, rather than
These lines of evidence support the funda- elaborating further, we will return to it in a
mental importance of dispositions in under- later section.
standing what it is to be a good thinker. The It is reasonable to ask whether such in-
question remains what role attention to dis- terventions have been shown to enhance
learners’ thinking dispositions. Unfortu- The Challenge of Transfer

nately, evidence on this question is sparse.
Although most of these programs have Like education in general, efforts to teach
been formally evaluated, the assessments thinking do not simply target the here and
by and large are abilities-oriented. Their now: They mean to serve the there and
performance-on-demand character does not then. What learners acquire today in the
estimate what students are disposed to do way of thinking skills, strategies, cognitive
in the absence of explicit demands, which schemata, underlying cognitive operations,
is what dispositions are all about. That dispositions, metacognitive capabilities, and
acknowledged, it is worth recalling that the like aims to help them there and then
CSILE students revealed deeper concep- make a difficult personal decision or study
tions of the nature of learning, a tendency quantum physics or manage a business or
to make mastery-oriented choices in their draft and deliver a compelling political state-
learning, and an avowed valuing of deep ment. In other words, the teaching of think-
thinking (Scardamalia, Bereiter, & Lamon, ing reaches for transfer of learning. Some-
1 994). Low-ability students responding to times the ambition for transfer is modest –
IE show marked increases in self-confidence experiences with reading for understanding
(Feuerstein et al., 1 981 ; Rand, Tannenbaum, or mathematical problem solving here and
& Feuerstein, 1 979). The authors think it now should improve performance for the
likely that many programs have at least same activities later in other contexts. Not
some impact on learners’ dispositions, but uncommonly, however, the ambition is far
an extensive empirical case remains to more grand – fundamental and far-reaching
be made. transformation of the person as a thinker.
In summary, both folk psychology and Some have charged that such ambitions
a good deal of academic psychology give are overwrought. Although thinking may be
abilities center stage in explaining good and cultivated in particular contexts for partic-
not-so-good thinking and thinkers. Along ular purposes, far-reaching transformation
with this abilities-centered view of think- may be impossible. Relatedly, some have
ing comes a concomitant view of what it argued that it may be impossible to teach
is to teach thinking: To get people to think thinking in an abstract way – say, with
better and improve their abilities, teach puzzle-like problems and through stepwise
problem-solving skills, learning skills, self- strategies – with gains that will spread far
management skills, and so on. All this cer- and wide.
tainly has value as far as it goes. However, the Empirical research shows us that the
arguments advanced here question the com- prospects of transfer cannot be utterly bleak.
pleteness of the storyline. They challenge In the second section of this article, we of-
whether performance-on-demand tasks are fered a number of existence proofs for mag-
a good model of how thinking works in ev- nitude, persistence, and transfer of impact,
eryday life and urge that well-rounded ef- and more appeared in the subsequent sec-
forts to teach thinking attend to dispositional tion. Before looking further at such results,
development as well as the development let us hear the case for meager transfer. At
of abilities. least three lines of scholarship pose a chal-
As is the case with abilities development, lenge to transfer – research on transfer it-
dispositions need to be considered from the self, research on expertise and the role of
standpoint of transfer of learning. Not only knowledge in cognition, and research on sit-
skills, but dispositions need to be generalized uated cognition. We will look briefly at each
broadly from their initial contexts of learn- in turn.
ing for them to develop a robust nature. This Transfer of learning has a vexed his-
brings us to our next challenge, that of teach- tory, particularly with respect to far transfer,
ing transfer. a somewhat informal term for transfer
to contexts very different from that of the of research has shown the fundamental im-
initial learning (see Holyoak, Chap. 6, for portance of familiarity with the knowledge,
a review of work on transfer by use of strategies, values, challenges, and other fea-
analogies). We can touch only briefly on tures of particular disciplines and endeavors
this complex literature. The classic studies (e.g., Bereiter & Scardamalia, 1 993 ; Erics-
are Thorndike’s (1 923 , Thorndike & Wood- son & Smith, 1 991 ; Ericsson, 1 996). For a
worth, 1 901 ) demonstrations that the intel- classic example, de Groot (1 965 ) and, build-
lectual rigor of studying Latin did not lead ing on his work, Chase and Simon (1 973 )
to improved performance on other fronts. demonstrated that skillful chess play de-
Since that time, numerous reviews and com- pends on a large repertoire of strategic pat-
pilations have shown that far transfer is hard terns about chess specifically accessed in a
to come by (e.g., Detterman, 1 992; Detter- perception-like way (see Novick & Bassok,
man & Sternberg, 1 992; Salomon & Perkins, Chap. 1 4).
1 989). For an interesting echo of Thorndike’s Evidence from a range of professions ar-
era, a number of efforts in the 1 980s to teach gues that naturalistic decision-making de-
various versions of computer programming pends on quick typing of situations to link
as, it might be said, “the new Latin,” gener- them to prototypical solutions that can be
ally showed no cognitive gains beyond the adjusted to the immediate circumstances
programming skills themselves (Salomon & (Klein, 1 999). In the same spirit, path anal-
Perkins, 1 987). Thorndike’s view that trans- yses of performance in practical job con-
fer depended on “identical elements” and is texts has shown specific knowledge to be a
less likely to apply to domains far removed much more direct predictor of performance
from one another remains a tempting expla- than general intelligence (Hunter, 1 986).
nation of the difficulties. Several scholars have argued that intelligent
A more recent view in a somewhat sim- behavior is deeply context bound (e.g. Ceci,
ilar spirit, Transfer Appropriate Processing, 1 990; Detterman, 1 992b; Glaser, 1 984; Lave,
holds that the prospects of transfer de- 1 988). Effective thinking depends so much
pend on a match between the features fore- on a large repertoire of reflexively activated,
grounded during initial encoding and the context-specific schemata that substantial
kinds of features called for in the target con- transfer of expert thinking from one domain
text. Initial encoding may tie the learning to another is impossible. Everyday support
to extraneous or unnecessarily narrow fea- for this comes from the informal observation
tures of the situation, limiting the prospects that people rarely manage to display high-
of transfer to other situations that happen to level thinking in more than one field.
share the same profile (Morris, Bransford, & Interventions consistent with this view in-
Franks, 1 977). Another rather different bar- clude programs in mathematics and science
rier reflects the position held by many IQ education that focus on a particular domain
theorists that there is nothing to train and and try to advance learners’ expertise. For
transfer: Very general cognitive capabilities example, Schoenfeld and Herrmann (1 982)
simply are not subject to improvement by documented how subjects in a previously
direct training, although genetics, nutrition, mentioned experimental intervention based
long-term enculturation by schooling, and on heuristics became more expert-like in
other factors may influence general cogni- their mathematical problem solving, coding
tive capability. problems more in terms of their deep struc-
Research directly on transfer aside, more ture than surface features.
damage to the prospects comes from studies Further skepticism about the prospects
of expertise and the importance of domain- for far transfer derives from studies of the
specific knowledge. Although it might be situated character of cognition and learning
thought that skilled cognition reflects gen- (Brown, Collins, & Duguid, 1 989; Kishner &
eral cognitive capabilities, an extensive body Whitson, 1 997; Lave, 1 988; Lave & Wenger,
1 991 ). The general point here is that skilled Second, the positions on transfer, exper-
activity is socially and physically situated in tise, and situated cognition just outlined
particular contexts, depending for its flu- have their critics as well as their propo-
ency and depth on a web of interactions nents. Many moderate positions take the
with peers, mentors, physical and symbolic most severe implications of these views
tools, and so on. Skill and knowledge do with a large grain of salt. For example, Sa-
not so much sit in the heads of individu- lomon and Perkins (1 989) outlined a two-
als as they are distributed through the social channel model of transfer specifying con-
and physical setting (Salomon, 1 993 ) and ditions for transfer by way of reflective
constituted through that setting. Individuals abstraction and by way of automatization
off-load certain thinking tasks onto the en- of routines, pointing out that there certainly
vironment by use of note-taking, organiza- were some successes reported in the trans-
tional mechanisms, fellow collaborators, and fer literature, and explaining a range of fail-
other technological tools to free up mental ures by the absence of conditions that would
space for more complex forms of thinking support transfer along one channel or the
(Pea, 1 993 ). other. In similar spirit, Gick and Holyoak
Accordingly, complex cognition is more (1 980, 1 983 ) (see Holyoak, Chap. 6) demon-
likely to develop through “cognitive appren- strated effective transfer between quite dif-
ticeship” (Collins, Brown, & Newman, 1 989) ferent problem-solving contexts when sub-
in the context of rich social and physical jects spontaneously or upon prompting
support than through instruction that at- reflectively abstracted underlying principles.
tempts to teach abstract schemas. Within Bassok and Holyoak (1 993 ) summarize ex-
such environments, individuals may first par- periments by making the case that super-
ticipate on the periphery of the group or ficial content context was not as limiting
with high-levels of support and gradually as some had argued. In many cases, learn-
progress to more independent and central ers bridged quite effectively from one con-
forms of operation as their expertise and tent context to another quite different, al-
comfort level increases (Lave & Wenger, though mismatches in the character of key
1 991 ). Because cognition is so situated, the variables in source and target sometimes in-
story goes, it is hard to uproot patterns of duced considerable interference. Bransford
cognition and transplant them into very dif- and Schwartz (1 999) urged reframing the
ferent contexts where they can still thrive. problem of transfer in terms of readier learn-
Interventions consistent with this view ining in the future, not of direct gains in per-
clude, for example, the CSILE collabora- formance, arguing that this afforded ample
tive online knowledge building environment opportunity for far transfer.
mentioned earlier (Scardamalia, Bereiter, & Turning to the theme of expertise, it can
Lamon, 1 994) and the Jasper Woodbury probe acknowledged that a rich collection of
gram, which helps youngsters build math- schemata constitutes an essential engine for
ematical skills and insights through situ- high-level thinking in a domain. Although
ating problem solving within compelling necessary in itself this engine is not suffi-
narratives and by making it a social endeavor cient. Expert status does not protect a person
(Van Haneghan et al., 1 992). from blind spots such as failure to examine
This triple challenge to the prospects of the other side of the case (Perkins, Farady, &
transfer seems daunting indeed. However, Bushey, 1 991 ). Indeed, people who “ought
it is important to emphasize that these cri- to know better” can behave with remark-
tiques by and large address the prospects able obtuseness (Sternberg, 2002). In keep-
of far transfer. They allow ample room for ing with this, many norms and heuristics
CSILE, the Jasper Woodbury program, writ- for good thinking address not the complex
ers’ workshops, design studios, philosophy knowledge characteristic of domain mas-
classes and the like, where the aim is to get tery but broad patterns of processing, such
better at a particular kind of thinking. as engaging anomalies seriously, examining
other perspectives, or questioning assump- quite different contexts. The point of all
tions, the neglect of which commonly en- this is certainly not to argue the opposite –
traps even those with well-developed knowl- that transfer comes easily, expertise depends
edge in a domain (see Chi and Ohlsson, largely on general cognitive capabilities, and
Chap. 1 6). learning is not somewhat entangled in its par-
Moreover, expert thinking is misleading ticular contexts – but rather to point out that
as a gold standard. Producing expert think- the most dire readings of the prospects of
ing by no means is the sole aim of the transfer do not seem to be warranted.
teaching of thinking. In many contexts, good Although the foregoing treats the general
thinking needs to be understood not as debate, the evidence on transfer from efforts
good-for-an-expert but good-for-a-learner to teach thinking also warrants considera-
or good-for-an-amateur. Some scholars have tion. As cited earlier, Nisbett (1 993 ) sum-
observed that there seems to be such a thing marized a number of studies in which efforts
as “expert novices,” and “expert learners” to teach statistical, if–then, cost–benefit, and
who bring to learning situations a range of at- other sorts of reasoning had led to transfer
titudes and strategies highly conducive to de- across content domains. As emphasized un-
veloping expertise more quickly (Bereiter & der the first challenge we addressed, there
Scardamalia, 1 993 ; Brown, Ferrera, & Cam- is considerable evidence for persistent far
pione, 1 983 ; Bruer, 1 993 ). Moreover, in transfer of improvements in thinking from
many facets of complex modern life – con- a number of studies. The signs of such trans-
sider filing income taxes, functioning as refer include impact on general reading skills,
sponsible citizens, purchasing a new car or IQ-like measures, thinking in various sub-
home – most of us operate as perpetual am- ject matters, the general cognitive compe-
ateurs. We do not engage in such activities tence of retarded people, and more. It will
enough to build deep expertise. The ques- be recalled that the philosophies and meth-
tion is less whether good general thinking ods of these programs are quite diverse, with
enables us to behave like an expert – it does some using rather abstract tasks well re-
not – and more whether good general think- moved from any particular subject matter or
ing enables us to perform better than we oth- natural community.
erwise would by leveraging more effectively In summary, we suggest that the de-
what knowledge we do have and helping us bate around transfer, expertise, and situated
to acquire more as we go. learning has been overly polarized and ideo-
Turning to the related theme of situated logical, leading to sweeping declarations on
knowledge, Anderson, Reder, and Simon both sides regarding what is possible or im-
(1 996) identified four core claims character- possible that do not stand up to empiri-
istic of the situated position – that action cal examination. The relationship between
is grounded in concrete situations, knowl- general cognitive structures and particular
edge does not transfer between tasks, train- situations perhaps needs to be understood
ing by abstraction is of little use, and in- as more complex and dynamic. Perkins and
struction must be done in complex social Salomon (1 989) offer the analogy of the
environments – and proceeded to summa- human hand gripping something. The hu-
rize empirical evidence contrary to all of man hand plainly is a very flexible gen-
them as universal generalizations. Bereiter eral instrument, but it always functions in
(1 997) and Salomon and Perkins (1 998) un- context, gripping different things in differ-
derscored how learners productively learn ent ways. Moreover, we need to learn to
under many degrees and kinds of social re- grasp objects according to their affordances:
lations and situatedness. Greeno, Smith, and You don’t hold a baby the same way you
Moore (1 992) offered an account of trans- hold a brick. Likewise, one can acknowl-
fer from the perspective of situated cogni- edge a broad range of general strategies,
tion, explaining how people sometimes ex- cognitive operations, and schemata with-
port systems of activity to other superficially out naı̈vely holding that they operate in
context-neutral ways. Adjustments are al- nect us to our last and final challenge, the
ways made – sometimes easily, sometimes challenge of creating cultures of thinking.
with difficulty. Skilled cognition involves Culture has been mentioned briefly
complex interarticulations of the general and in previous sections, but one still might
the specific. ask: What is it about culture, and cul-
So the prospects of transfer escape tures of thinking in particular, that de-
these skirmishes with skepticism – but not mands attention (see Greenfield, Chap. 27,
unscathed! Indeed, there are pointed lessons for further discussions on the role of
to be drawn. We can learn from research culture)? Three important motives are wor-
on the difficulties of transfer that transfer is thy of attention: First, the supporting struc-
nothing to take for granted. Well-designed tures of culture are needed to sustain gains
efforts to cultivate thinking will face up and actualize intelligent behavior over time,
to the challenge, for instance by incorpo- as opposed to merely building short-term
rating episodes of reflective abstraction to capacity (Brown & Campione, 1 994; Scar-
help learners to decontextualize patterns of damalia et al., 1 994; Tishman, Perkins, &
thinking and by providing practice across Jay, 1 993 ). It is through the culture of the
multiple distinct contexts. Well-designed ef- classroom that strategies and practices take
forts to cultivate thinking will look closely on meaning and become connected to the
at the behavior of experts to construct their work of learning. Second, culture helps to
heuristic analyses, and will not expect gen- shape what we attend to, care about, and
eral norms and heuristics to do the job of focus our energies upon (Bruner, Olver,
norms and heuristics tailored to particular & Greenfield, 1 966; Dasen, 1 977; Super,
endeavors such as writing or mathemati- 1 980). Thus, culture is integrally linked to
cal problem solving. Well-designed efforts the dispositional side of thinking and to
to cultivate thinking will recognize the dis- the cultivation of inclination and sensitivity.
tributed nature of cognition, and take advan- Third, researchers and program developers
tage of social and physical support systems to increasingly have recognized that thinking
advance individual and collective thinking. programs are not merely implemented but
are enacted, developed, and sustained in a
social context. As a result, they have found it
necessary to move away from teacher-proof
The Challenge of Creating Cultures materials, which view learning as an isolated
of Thinking individual process, and toward approaches
that pay more attention to the underlying
Thus far, we’ve examined four challenges conditions of learning.
that efforts to teach thinking traditionally As a result of the awareness of the
have faced. As teachers and program devel- role culture plays in learning, the past two
opers seek to meet those challenges, a host decades have seen efforts to teach think-
of additional concerns arise; for example; ing shift from programmed strategy in-
How do we provide enough time, context, struction aimed at students as individu-
and diverse applications so that new pat- als to broad-based approaches aimed at
terns of thinking actually take hold? How building classroom cultures supportive of
can we best take into account that school the active social construction of knowl-
learning happens in a social context within a edge among groups. These approaches take
classroom among a group of individuals? Is a variety of forms, such as cognitive ap-
the development of individual thinking best prenticeship (Collins, Brown, & Newman,
served and supported by the development 1 989), fostering a community of learners
of group learning practices? How do we un- (Brown & Campione, 1 994), group knowl-
cover the thinking that is going on in individ- edge building (Bereiter & Scardamalia, 1 996;
uals and within the group so we can respond Scardamalia & Bereiter, 1 996), inquiry-
to it and learn from it? These questions con- based teaching (Lipman, 1 983 ), and the
development of patterns of thinking (Tish- form resources that, in turn, have an ef-
man, Perkins, & Jay, 1 995 ) and habits of mind fect on practice. At the same time, resources
(Costa & Kallick, 2002). Several programs and practice provide supports for distributed
associated with these approaches were men- intelligence, scaffolding intelligent behavior
tioned previously – CISLE/Knowledge Fo- beyond that which can be displayed by an
rum, Philosophy for Children, and Keys to individual mind (Salomon, 1 993 ).
Thinking among them. We’ll examine a few This dialectical interplay between prac-
additional ones subsequently. Before doing tice and resources informs our understand-
so, however, it may be useful to take a closer ing of just what the “it” is in which individu-
look at just what is meant by culture in the als become enculturated. But, how does this
cultural approach. enculturation happen? How are a culture’s
Culture, construed broadly, refers to the practice and resources conveyed and learned
context and general surround in which we by group members? In a study of thought-
operate. This doesn’t tell us much about ful classrooms, Ritchhart (Ritchhart, 2002)
what it means to become enculturated, how- identified seven cultural forces at work in
ever. To illuminate this issue it is helpful to classrooms that facilitated the process of en-
look at particular intellectual subcultures or culturation in thinking: (1 ) messages from
communities of practice, say of mathemati- the physical environment about thinking,
cians or writers or even mechanics. What (2) teacher modeling of thinking and dis-
does it mean to be a part of these cultures? positions, (3 ) the use of language of think-
A frame that we have found useful is based ing, (4) routines and structures for thinking,
on two top-level conceptions: resources and (5 ) opportunities created for thinking, (6)
practice (Roth, 1 995 ). Resources are the conveyance of expectations for thinking, and
things upon which members of the culture (7) interactions and relationships supportive
of practice draw when they do their work. of thinking.
Resources can be physical in nature: com- These cultural forces act as direct and in-
puters, books, instruments, tools, and the direct vehicles for teaching. For example, the
like. There are also social resources such as use of routines and structures for thinking,
colleagues, coworkers, editors, peer-review which connects to the idea of norms and
boards, and so on. These types of resources heuristics mentioned previously, is a highly
help distribute cognition outside the individ- integrated but still direct form of teaching.
ual thinker’s mind (Salomon, 1 993 ). In ad- By introducing “thinking routines” (Ritch-
dition, there are conceptual resources con- hart, 2002), teachers provide students with
sisting of the conceptual, knowledge, and highly transportable tools for thinking that
belief systems in which the subculture read- they learn in one context and then transfer
ily traffics. Also included in the conceptual to other situations over time until the strat-
resources are the symbol systems and nota- egy has become a routine of the classroom.
tional structures evolved to support abstract We and our colleagues are currently capital-
thought (Gardner, 1 983 ; Goodman, 1 976; izing on this approach in the design of a new
Olson, 1 974). thinking program. The use of the language of
Practice captures the constructive acts en- thinking (Tishman & Perkins, 1 997; Ritchart,
gaged in by the cultural group – what it is 2002) – which includes process (justify-
they do, the kind of work that is valued and ing, questioning, analyzing), product (theory,
rewarded, the methods they employ. This conjecture, summation), stance (challenge,
connects the group to the socio-historically agree, concur), and state (confused, puzzled,
valued ways of knowing and thinking, such intrigued) words – is a much more indirect
as the epistemic forms of the disciplines that method of promoting thinking that gives stu-
are part of the group’s heritage (Collins & dents the vocabulary for talking about think-
Ferguson, 1 993 ; Perkins, 1 994, 1 997). Re- ing. By combining the direct (routines and
sources and practice interact dialectically in structures, and opportunities) and the indi-
that individual and group practice trans- rect (modeling, language, relationships and
interactions, environment, expectations), a they went through in solving problems or

culture of thinking is built and sustained. coming to conclusions, group ideas and
One can see these cultural forces at play conjectures are recorded and reviewed, the
in the Community of Learners approach artifacts of thinking are put on display in the
(Brown & Campione, 1 994). In this ap- classroom, and so on. At the heart of these
proach, a premium is placed on research, efforts lies reflection on one’s thinking and
knowledge-building, and critical thinking, cognitive monitoring, the core processes
thus communicating expectations for think- of metacognition. Ultimately, teaching
ing to students through the types of oppor- students to be more metacognitive and
tunities provided. In this environment, in- reflective, providing rich opportunities for
dividual responsibility is coupled with the thinking across various contexts, setting up
communal sharing of expertise. Discourse an environment that values thinking, and
(constructive discussion, questioning, and making the thinking of group members visi-
criticism) is the norm, making use of the ble contribute a great deal to the formation
language of thinking and interactions and of a culture of thinking. The cultural forces
relationships supportive of thinking. Ritual, can be leveraged toward this end. Within
familiar participant structures, and routines such a culture of thinking, other efforts to
are introduced to help students navigate and teach thinking, both formal and informal,
work within the new culture. All of this is have a greater likelihood of taking hold
accomplished within an environment that because they will be reinforced through the
makes thinking visible for students. culture and opportunities for transfer and
Research suggests that, at least in this reflection will increase.
particular case, a broad-based cultural ap- In summary, in some sense, a fully de-
proach was superior to one based on teach- veloped culture of thinking in the class-
ing heuristics. Approximately ninety fifth room or, indeed, in other settings such as the
and sixth graders in the Community of home or the workplace, represents the cohe-
Learners (CL) group outperformed a group sive culmination of the separate challenges
using only a reciprocal teaching technique of achieving results, defining the thinking,
in which students led the learning in read- attaining transfer, and attending to think-
ing discussions on criterion-referenced tests ing dispositions. A thoroughgoing culture of
of reading comprehension (and this result thinking attends to all of these. Unfortu-
occurred even though the group was given nately, the converse is certainly not so. It
twice as much practice as the CL group). is possible to attend assiduously to the first
There was no improvement in a reading-only four – say, every Tuesday and Thursday from
control group. Scores on questions dealing 1 1 to 1 2, or when we do math projects for a
with inference, gist, and analogy improved day at the end of each unit – and still fall
dramatically. The results show magnitude of far short of a pervasive culture of thinking.
effects but require further study to assess the Results reviewed earlier in this article sug-
generality and persistence of effects. Further gest that even limited treatments may well
research is needed to determine whether the benefit students’ thinking. However, one has
effects are sustaining in the sense of ongoing to ask about the rest of their learning. In the
repertoire, the ultimate goal of a cultural ap- end, the point of a culture of thinking is not
proach, or whether their impact is limited to just to serve the development of thinking but
behaviors in the immediate environment. to serve the breadth and depth of students’
A common thread running through learning on all fronts.
cultural approaches to teaching thinking is
the effort to make thinking visible, often
through the various cultural forces. This Conclusions and Future Directions
occurs as teachers model their thought pro-
cesses before the class, students are asked to This review of the teaching of thinking has
share their thinking and discuss the processes cast a wide net to look at programs for which
adequate data exist for examination and with quite different theories seem to have
discussion. These programs address a great achieved substantial success. Why should
variety of thinking – creative and critical this be? Does theory matter at all? As with
thinking, problem solving, decision making, the first challenge, the answer to effective-
and metacognition as well as subject-specific ness may lie more with certain demand char-
types of thinking. Even so, we have only acteristics of programs than with any single
scratched the surface of the ongoing efforts theoretical approach. Increased explicit in-
to teach thinking. Why does the teaching volvement with thinking and systematic at-
of thinking continue to be such a central tention to managing one’s thinking may be
question in education? Why do we even the most critical conditions. To untangle this
need to teach thinking? As discussed ear- issue empirically, one would need to com-
lier, efforts to teach thinking deal with both pare the effectiveness of programs with dif-
amplifying native tendencies and addressing ferent theoretical bases but with the same
problems of thinking shortfalls. In addition, demands for thinking and reflection. Unfor-
a major goal of most thinking interventions tunately, it is rare in the literature on the
is to enhance learning and promote deeper teaching of thinking to find alternative ap-
understanding. The idea that deep and last- proaches addressing the same kinds of think-
ing learning is a product of thinking provides ing and the same sorts of learners pitted
a powerful case for the teaching of thinking. against one another.
Indeed, we venture that the true promise of The third challenge dealt with the dispo-
the teaching of thinking will not be realized sitional side of thinking. We showed how the
until learning to think and thinking to learn effective teaching of thinking is more than
merge seamlessly. just the development of ability, demand-
Toward this end, we singled out five ing the development of awareness and in-
challenges that must be dealt with along clination as well. In particular, the lack of
the way. The first addressed the question a sensitivity to occasions for thinking ap-
of whether or not thinking can be taught pears to be a major bottleneck when it comes
with some reasonable signs of success. We to putting one’s abilities into action. It is
reviewed several programs as a kind of our belief that some programs accomplish
existence proof that, indeed, it is possi- this. Although most data focus on abilities,
ble to produce impacts with substantial leaving impact on sensitivity and inclination
magnitude, persistence, and transfer. These unassessed, there are a few indications of im-
programs spanned a variety of philosoph- pact on dispositions. Certainly, more work is
ical and methodological approaches while needed in this area.
sharing the common characteristics of in- Transfer, a pivotal concern within the
creasing the demand for thinking, devel- teaching of thinking, constituted our fourth
oping thinking processes, and paying at- challenge. Although some have argued that
tention to metacognitive self-regulation. transfer cannot be obtained because all
These common demand characteristics ap- knowledge is bound to context, the empiri-
pear to be key elements in the teaching cal record of successful programs has shown
of thinking. clearly that some degree of transfer is pos-
The second challenge concerned what sible across domains of content knowledge.
one means when talking about good think- This is by no means automatic, however.
ing. We showed how efforts to teach think- Transfer must be designed deliberately into
ing are shaped largely by how they answer interventions by highlighting key features of
this question. Thus, the content, sequence, the situation that need attention, promot-
and methods of instruction for a particular ing reflective abstraction of underlying prin-
intervention arise from a single or collec- ciples, and providing practice across multiple
tive set of grounding theories, be they linked contexts. Even then, one is more likely to see
to norms and heuristics, intelligence, or hu- near transfer of thinking to similar contexts
man development. Interestingly, programs than far transfer.
Our fifth challenge, that of creating cul- ful are existing programs at developing
tures of thinking, examined the social con- the dispositional side of thinking? What
text and environment in which thinking is kinds of practices and interventions ef-
fostered. Efforts to teach thinking cannot be fectively foster students’ inclination and
removed from their social context. Context sensitivity? Are dispositions bound to the
provides important avenues for the devel- social context in which they are devel-
opment of supporting inclinations toward oped or do they transfer to new settings?
thinking, learning from more accomplished How does attention to the development
peers, focusing attention, and access to the of sensitivity to occasions affect transfer of
resources and practices of the group. In class- thinking skills? Efforts to teaching think-
rooms, a set of cultural forces directs and ing skills are sometimes done in a limited
shapes students’ learning experiences both time frame, raising the question: What is
directly and indirectly. These cultural forces the appropriate time frame for the devel-
convey to students how much and what opment of dispositions?
kinds of thinking are valued, what meth-
ods the group uses to go about thinking, Perhaps the biggest question about the
and what expectations there are regarding teaching of thinking concerns how to inte-
thinking. Furthermore, the thinking of indi- grate it with other practices, in school and
viduals and groups is made visible through out of school, in an effective way. We already
these forces. know enough about the teaching of think-
Our review of these five challenges sug- ing to have a substantial impact, and yet the
gests several fronts for further investigation: reality of collective practice falls short. We
must ask ourselves: How can thinking ini-
r The questions of transfer and sustained tiatives be sustained and integrated with the
impact need to be better understood. many other agendas faced by schools, mu-
In particular, little is known about the seums, clubs, corporate cultures, and other
impact of extended interventions. One settings in which thinking might thrive?
might expect that broad multi-year in- Only when we understand how to foster
terventions would yield wide impact sus- cultures of thinking not just within in-
tained for many years, but the empirical dividual families or classrooms but across
work has not been done to our knowl- entire schools, communities, and, indeed, so-
edge. Relatedly, what would be the ef- cieties, will scholarly insights and the prac-
fect of a cross-subject thinking interven- tical craft of teaching thinking achieve their
tion in which students encounter the mutual promise.
same practices concurrently in multiple
disciplines?
r An exploration of the trade-offs among
the norms and heuristics, models of intel- Acknowledgments
ligence, and developmental approaches is
needed to better understand the role of Some of the ideas and research reported here
theory in successful interventions. How were developed with much-appreciated
and where does the underlying theory support from the Stiftelsen Carpe Vitam
of thinking matter? When demands for Foundation and the John D. and Cather-
thinking are held constant, does one ine T. MacArthur Foundation. The posi-
theoretical approach work better than tions taken by the authors, of course, are
another? What is it that makes success- not necessarily those of the foundations. The
ful programs work? What characteris- authors would like to thank Stellan Ohls-
tics and practices are most pivotal to son and Keith Holyoak for their thought-
success? ful editorial suggestions in response to the
r Within the realm of thinking dispositions, first draft and Nicole Weiss for her research
there is much to be learned. How success- assistance.
References Blagg, N. (1 991 ). Can We Teach Intelligence? A

Comprehensive Evaluation of Feuerstein’s In-
Adams, C. (Ed.) (1 971 ). Think, Stanford, CT: In- structional Enrichment Program. Hillsdale, NJ:
novative Sciences. Erlbaum.
Adams, M. J. (1 989). Thinking skills curricula: Boyer, E. (1 983 ). High School: A Report on
Their promise and progress. Educational Psy- Secondary Education in America. New York:
chologist, 2 4(1 ), 25 –77. Harper & Row.
Adams, M. J. (Ed.) (1 986). Odyssey: A Curricu- Brainerd, C. J. (1 983 ). Working-memory sys-
lum for Thinking. Watertown, MA: Charles- tems and cognitive development. In C. J.
bridge. Brainerd (Ed.), Recent Advances in Cognitive-
Adey, P., & Shayer, M. (1 993 ). An exploration of developmental Theory: Progress in Cognitive De-
long-term far-transfer effects following an ex- velopment Research (pp. 1 67–23 6). New York:
tended intervention program in the high school Springer-Verlag.
science curriculum. Cognition and Instruction, Bransford, J. D., & Schwartz, D. L. (1 999). Re-
1 1 (1 ), 1 –29. thinking transfer: A simple proposal with in-
Adey, P., & Shayer, M. (1 994). Really Raising teresting implications. In A. Iran-Nejad, & P. D.
Standards: Cognitive Intervention and Academic Pearson (Eds.), Review of Research in Education
Achievement. London: Routledge. (Vol. 24, pp. 61 –1 01 ). Washington, DC: Amer-
ican Educational Research Association.
Anderson, J. R., Reder, L. M., & Simon, H. A.
(1 996). Situated learning and education. Edu- Brody, N. (1 992). Intelligence. New York: Aca-
cational Researcher, 2 5 (4), 5 –1 1 . demic Press.
Argyris, C. (1 993 ). On Organizational Learning. Brown, A. L., Bransford, J. D., Ferrara, R. A., &
Cambridge, MA: Blackwell. Campione, J. C. (1 983 ). Learning, remember-
ing, and understanding. In P. H. Mussen (ED.)
Argyris, C., & Schön, D. A. (1 996). Organiza-
Handbook of Child Psychology. Vol. 3 : Cognitive
tional Learning II: Theory, Method, and Practice.
Development. New York: Wiley.
New York: Addison-Wesley.
Brown, J. S., Collins, A., & Duguid, P. (1 989).
Bassok, M., & Holyoak, K. J. (1 993 ). Pragmatic
Situated cognition and the culture of learning.
knowledge and conceptual structure: Deter-
Educational Researcher, 1 8(1 ), 3 2–42.
minants of transfer between quantitative do-
mains. In D. K. Detterman, & R. J. Sternberg Brown, A. L., & Campione, J. C. (1 994). Guided
(Eds.), Transfer on Trial: Intelligence, Cognition, discovery in a community of learners. In K.
and Instruction. Norwood, NJ: Ablex. McGilly (Ed.), Classroom Lessons: Integrating
Cognitive Theory and Classroom Practice. Cam-
Baron, J. (1 985 ). Rationality and Intelligence. New
bridge, MA: MIT Press/Bradford Books.
York: Cambridge University Press.
Brown, A. L., & Palincsar, A. S. (1 982). Introduc-
Baron, J. B., Granato, L., Spranca, M., & Teubel,
ing strategic learning from text by means of in-
E. (1 993 ). Decision-making biases in chil-
formed, self-control training. Topic in Learning
dren and early adolescents: Exploratory stud-
and Learning Disabilities, 2 (1 ), 1 –1 7.
ies. Merrill-Palmer Quarterly, 3 9(1 ), 22–46.
Bruner, J. S., Olver, R., & Greenfield, P. (1 966).
Bereiter, C. (1 997). Situated cognition and how
Studies in Cognitive Growth. New York: Wiley.
to overcome it. In D. Kishner, & J. A. Whitson
(Eds.), Situated Cognition: Social, Semiotic, and Burke, T. (Ed.) (1 971 ). Intuitive Math. Stanford,
Psychological Perspectives (pp. 281 –3 00). Hills- CT: Innovative Sciences.
dale, NJ: Erlbaum. Cacioppo, J. T., & Petty, R. E. (1 982). The need
Bereiter, C., & Scardamalia, M. (1 993 ). Sur- for cognition. Journal of Personality and Social
passing Ourselves: An Inquiry into the Nature Psychology, 42 , 1 1 6–1 3 1 .
and Implications of Expertise. Chicago: Open Cacioppo, J. T., Petty, R. E., Feinstein, J. A.,
Court. & Jarvis, W. B. G. (1 996). Dispositional dif-
Bereiter, C., & Scardamalia, M. (1 996). Rethink- ferences in cognitive motivation: The life
ing learning. In D. R. Olson, & N. Torrance and times of individuals varying in need for
(Eds.), The Handbook of Education and Human cognition. Psychological Bulletin, 1 1 9(2), 1 97–
Development: New Models of Learning, Teaching 25 3 .
and Schooling (pp. 485 –41 3 ). Cambridge, MA: Carey, S. (1 985 ). Conceptual Change in Child-
Basil Blackwell. hood. Cambridge, MA: MIT Press.
Carey, S. (1 985 b). Conceptual Change in Child- Detterman, D. K. (Ed.) (1 992a). Current Topics
hood. Cambridge, MA: MIT Press. in Human Intelligence. Vol. 2 : Is Mind Modular
Case, R. (1 984). The process of stage transition: or Unitary? Norwood, NJ: Ablex.
A neo-Piagetian viewpoint. In R. J. Sternberg Detterman, D. K. (1 992). The case for the
(Ed.), Mechanisms of Cognitive Development prosecution: Transfer as an epiphenomenon.
(pp. 1 9–44). New York: Freeman. In D. K. Detterman, & R. J. Sternberg (Eds.),
Case, R. (1 985 ). Intellectual Development: Birth to Transfer on Trial (pp. 1 –24). Norwood, NJ:
Adulthood. New York: Academic Press. Ablex.
Case, R. (1 992). The Mind’s Staircase: Explor- Detterman, D., & Sternberg, R. (Eds.) (1 992).
ing the Conceptual Underpinnings of Chil- Transfer on Trial. Norwood, NJ: Ablex.
dren’s Thought and Knowledge. Hillsdale, NJ: Dewey, J. (1 922). Human Nature and Conduct.
Erlbaum. New York: Holt.
Ceci, S. J. (1 990). On Intelligence . . . More or Less: Dewey, J. (1 93 3 ). How We Think: A Restatement
A Bio-ecological Treatise on Intellectual Develop- of the Relation of Reflective Thinking to the Ed-
ment. Englewood Cliffs, NJ: Prentice-Hall. ucative Process. Boston: Heath.
Cervone, D. (1 999). Bottom-up explanation in Dweck, C. S. (1 975 ). The role of expectations
personality psychology: The case of cross- and attributions in the alleviation of learned
situational coherence. In D. Cervone, & Y. helplessness. Journal of Personality and Social
Shoda (Eds.), The Coherence of Personality: Psychology, 3 1 , 674–685 .
Social-cognitive Bases of Consistency (pp. 3 03 – Dweck, C. S. (2000). Self-theories: Their Role
3 41 ). New York: Guilford Press. in Motivation, Personality, and Development.
Cervone, D., & Shoda, Y. (1 999). Beyond traits Philadelphia: Psychology Press.
in the study of personality coherence. Cur- Edwards, J., & Baldauf, R. B. (1 983 ). Teach-
rent Directions in Psychological Science, 8(1 ), ing thinking in secondary science. In W.
27–3 2. Maxwell (Ed.), Thinking: The Expanding Fron-
Chase, W. C., & Simon, H. A. (1 973 ). Perception tier. Philadelphia: Franklin Institute Press.
in chess. Cognitive Psychology, 4, 5 5 –81 . Edwards, J., & Baldauf, R. B. (1 987). The effects
Cilliers, C., Botha, L., Capdevielle, B., Perkins, of the CoRT-1 thinking skills program on stu-
D. N., & van der Vyver, D. (1 994). The de- dents. In D. N. Perkins, J. Lochhead & J. C.
velopment of a curriculum for thinking skills. Bishop (Eds.), Thinking: The Second Interna-
International Journal of Special Education, 9,(3 ), tional Conference. Hillsdale: Erlbaum.
25 7–270. Edwards, J. (1 991 a). The direct teaching of think-
Collins, A., & Ferguson, W. (1 993 ). Epistemic ing skills. In G. Evans (Ed.), Learning and Teach-
forms and epistemic games: Structures and ing Cognitive Skills (pp. 87–1 06). Melbourne:
strategies to guide inquiry. Educational Psychol- Australian Council for Educational Research.
ogist, 2 8(1 ), 25 –42. Edwards, J. (1 991 b). Research work on the CoRT
Collins, A., Brown, J. S., & Newman, S. F. method. In S. Maclure, & P. Davies (Eds.),
(1 989). Cognitive apprenticeship: Teaching Learning to Think: Thinking to Learn (pp. 1 9–
the craft of reading, writing, and mathemat- 3 0). Oxford, UK: Pergamon.
ics. In L. B. Resnick (Ed.), Knowing, Learning, Edwards, J. (1 994). Thinking, education, and hu-
and Instruction: Essays in Honor of Robert Glase man potential. In J. Edwards (Ed.), Thinking:
(pp. 45 3 –494). Hillsdale, NJ: Erlbaum. International Interdisciplinary Perspectives
Costa, A. L., & Kallick, B. (2002). Habits of Mind (pp. 6–1 5 ). Melbourne: Hawker Brownlow
(Vol. I–IV). Alexandria, VA: Association for Education.
Supervision and Curriculum Development. Ennis, R. H. (1 986). A taxonomy of critical think-
Dasen, P. R. (Ed.) (1 977). Piagetian Psychology: ing dispositions and abilities. In J. B. Baron, &
Cross-cultural Contributions. New York: Gard- R. S. Sternberg (Eds.). Teaching Thinking Skills:
ner Press. Theory and Practice (pp. 9–26). New York:
de Bono, E. (1 973 ). CoRT Thinking Program: Freeman.
Workcards and Teacher Notes. Sydney: Direct Ericsson, K. A. (Ed.) (1 996). The Road to
Educational Services. Excellence: The Acquisition of Expert Performance
de Groot, A. D. (1 965 ). Thought and Choice in in the Arts and Sciences, Sports, and Game.
Chess. The Hague: Mouton. Mahwah, NJ: Erlbaum.
Ericsson, K. A., & Smith, J. (Eds.) (1 991 ). To- R. J. Sternberg (Ed.), Handbook of Intelligence.
ward a General Theory of Expertise: Prospects New York: Cambridge University Press.
and Limits. Cambridge, UK: Cambridge Uni- Guilford, J. P. (1 967). The Nature of Human In-
versity Press. telligence. New York: McGraw-Hill.
Facione, P. A., Sanchez, C. A., Facione, N. C., Guilford, J. P., & Hoepfner, R. (1 971 ). The Anal-
& Gainen, J. (1 995 ). The disposition toward ysis of Intelligence. New York: McGraw-Hill.
critical thinking. Journal of General Education,
Hamblin, C. L. (1 970). Fallacies. London:
44(1 ), 1 –25 .
Methuen.
Feuerstein, R. (1 980). Instrumental Enrichment:
Hamers, J. H. M., & Overtoom, M. (Eds.) (1 997).
An Intervention Program for Cognitive Modifi-
Teaching Thinking in Europe: Inventory of Euro-
ability. Baltimore: University Park Press.
pean Programmes. Utrecht: Sardes.
Feuerstein, R., Miller, R., Hoffman, M. B., Rand,
Harvey, J. H., Town, J. P., & Yarkin, K. L. (1 981 ).
Y., Mintzker, Y., & Jensen, M. R. (1 981 ).
How fundamental is “the fundamental attribu-
Cognitive modifiability in adolescence: Cog-
tion error”? Journal of Personality and Social Psy-
nitive structure and the effects of interven-
chology, 40(2), 3 46–3 49.
tion. The Journal of Special Education, 1 5 ,
269–286. Herrnstein, R. J., Nickerson, R. S., Sanchez, M.,
& Swets, J. A. (1 986). Teaching thinking skills.
Fischer, K. W. (1 980). A theory of cognitive de-
American Psychologist, 41 , 1 279–1 289.
velopment: The control and construction of hi-
erarchies of skills. Psychological Review, 87(6), Hirschfeld, L. A., & Gelman, L. A. (Eds.) (1 994).
477–5 3 1 . Domain Specificity in Cognition and Cul-
ture. Cambridge, UK: Cambridge University
Gardner, H. (1 983 ). Frames of Mind. New York: Press.
Basic Books.
Hunter, J. E. (1 986). Cognitive ability, cogni-
Gick, M. L., & Holyoak, K. J. (1 980). Analogical tive aptitudes, job knowledge, and job perfor-
problem solving. Cognitive Psychology, 1 2 , 3 06– mance. Journal of Vocational Behavior, 2 9, 3 40–
3 65 . 3 62.
Gick, M. L., & Holyoak, K. J. (1 983 ). Schema in- Idol, L., Jones, B. F. (Ed.) (1 991 ). Educational Val-
duction and analogical transfer. Cognitive Psy- ues and Cognitive Instruction: Implications for
chology, 1 5 , 1 –3 8. Reform. Hillsdale, NJ: Erlbaum.
Glaser, R. (1 984). Education and thinking: The Inhelder, B., & Piaget, J. (1 95 8). The Growth of
role of knowledge. American Psychologist, 3 9, Logical Thinking from Childhood to Adolescence.
93 –1 04. New York: Basic Books.
Goleman, D. (1 995 ). Emotional Intelligence. New Janis, I. (1 972). Victims of Groupthink: Psychologi-
York: Bantam Books. cal Study of Foreign-Policy Decisions and Fiascoes
Goodlad, J. I. (1 983 ). A Place Called School: (2nd ed.). Boston: Houghton Mifflin.
Prospects for the Future. New York: McGraw- Jensen, A. R. (1 980). Bias in Mental Testing. New
Hill. York: Free Press.
Goodman, N. A. (1 976). Languages of Art. Jensen, A. R. (1 998). The g Factor: The Science of
Indianapolis: Hackett. Mental Ability. Westport, CT: Praeger.
Greeno, J. G., Smith, D. R., & Moore, J. L. (1 992). Kishner, D., & Whitson, J. A. (Eds.)(1 997). Situ-
Transfer of situated learning. In D. Detterman, ated Cognition: Social, Semiotic, and Psychologi-
& R. Sternberg (Eds.), Transfer on Trial (pp, 99– cal Perspectives. Hillsdale, NJ: Erlbaum.
1 67). Norwood, NJ: Ablex. Klein, G. (1 999). Sources of Power: How Peo-
Griffin, S. A., Case, R., & Capodilupo, S. (1 995 ). ple Make Decisions. Cambridge, MA: MIT
Teaching for understanding: The importance of Press.
central conceptual structures in the elementary Kriegler, S., van Niekerk, H. (1 993 ). IE: A con-
school mathematics curriculum. In A. McK- tribution to cultivating a culture of learning?
eough, J. Lupart, & A. Marini (Eds.), Teaching International Journal of Cognitive Education and
for Transfer: Fostering Generalization in Learn- Mediated Learning, 3 (1 ), 21 –26.
ing. Hillsdale, NJ: Erlbaum.
Kruglanski, A. W. (1 990). Motivations for
Grotzer, T., & Perkins, D. N. (2000). Teach- judging and knowing: Implications for causal
ing intelligence: A performance conception. In attribution. In E. T. Higgins, & R. M. Sorrentino
(Eds.), The Handbook of Motivation and Pascual-Leone, J., Goodman, D., Ammon, P., &
Cognition: Foundation of Social Behavior Subelman, I. (1 978). Piagetian theory and neo-
(Vol. 2, pp. 3 3 3 –3 68). New York: Guilford Piagetian analysis as psychological guides in ed-
Press. ucation. In J. M. Gallagher (Ed.), Knowledge
Langer, E. (1 989). Mindfulness. Reading, MA: and Development (Vol. 2). New York: Plenum.
Addison-Wesley. Paul, R. W. (1 984). Critical thinking: Fundamen-
Lave, J. (1 988). Cognition in Practice: Mind, Math- tal to education for a free society. Educational
ematics and Culture in Everyday Life. Cam- Leadership, 42 (1 ), 4–1 4.
bridge, UK: Cambridge University Press. Pea, R. D. (1 993 ). Practices of distributed intelli-
Lave, J., & Wenger, E. (1 991 ). Situated Learning: gence and designs for education. In G. Salomon
Legitimate Peripheral Participation. New York: (Ed.), Distributed Cognitions (pp. 47–87). New
Cambridge University Press. York: Cambridge University Press.
Lipman, M. (1 976). Philosophy for children. Perkins, D. N. (1 985 ). Postprimary education has
Metaphilosophy, 7, 1 7–1 9. little impact on informal reasoning. Journal of
Education Psychology, 77(5 ), 5 62–5 71 .
Lipman, M. (1 983 ). Thinking Skills Fostered by
Philosophy for Children. Montclair, NJ: Insti- Perkins, D. N. (1 994). The hidden order of
tute for the Advancement of Philosophy for open-ended thinking. In J. Edwards (Ed.),
Children. Thinking: Interdisciplinary Perspectives. Victo-
ria, Australia: Hawker Brownlow Education.
Lipman, M. (1 988). Philosophy Goes to School.
Perkins, D. N. (1 995 ). Outsmarting IQ: The Emerg-
Philadelphia: Temple University.
ing Science of Learnable Intelligence. New York:
Lipman, M., Sharp, A., & Oscanyon, F. (1 980). Free Press.
Philosophy in the Classroom. Philadelphia: Tem-
Perkins, D. N. (1 989). Reasoning as it is and could
ple University.
be. In D. Topping, D. Crowell, & V. Kobayashi
McGuinness, C., & Nisbet, J. (1 991 ). Teaching (Eds.), Thinking: The Third International Con-
thinking in Europe. British Journal of Educa- ference. Hillsdale, NJ: Erlbaum.
tional Psychology, 61 , 1 74–1 86.
Perkins, D. N. (1 995 ). Outsmarting IQ. New York:
Meeker, M. N. (1 969). The Structure of Intellect: Free Press.
Its Interpretation and Uses. Columbus, OH:
Perkins, D. N. (1 997). Epistemic games. Interna-
Charles E. Merrill.
tional Journal of Educational Research, 2 7(1 ),
Morris, C. D., Bransford, J. D., & Franks, J. J. 49–61 .
(1 977). Levels of processing versus transfer ap- Perkins, D. N., Allen, R., & Hafner, J. (1 983 ). Dif-
propriate processing. Journal of Verbal Learning ficulties in everyday reasoning. In W. Maxwell
and Verbal Behavior, 1 6, 5 1 9–5 3 3 . (Ed.), Thinking: The Frontier Expands (pp. 1 1 7–
National Commission on Excellence in Educa- 1 89). Hillsdale, NJ: Erlbaum.
tion (1 983 ). A Nation at Risk: The Imperative Perkins, D. N., Farady, M., & Bushey, B. (1 991 ).
for Educational Reform. Washington, DC: U.S. Everyday reasoning and the roots of intelli-
Department of Education. gence. In J. Voss, D. N. Perkins & J. Segal (Eds.),
Nickerson, R. S., Perkins, D. N., & Smith, E. E. Informal Reasoning (pp. 83 –1 05 ). Hillsdale, NJ:
(1 985 ). The Teaching of Thinking. Hillsdale, NJ: Erlbaum.
Erlbaum. Perkins, D. N., & Goodrich-Andrade, H. (1 998).
Nisbett, R. E. (Ed.) (1 993 ). Rules for Reasoning. Learnable intelligence and intelligent learning.
Hillsdale, NJ: Erlbaum. In R. J. Sternberg, & W. M. Williams (Eds.), In-
Norris, S. P. (1 995 ). The meaning of critical think- telligence, Instruction, and Assessment (67–94).
ing test performance: The effects of abilities Hillsdale, NJ: Erlbaum.
and dispositions on scores. In D. Fasko, Jr. Perkins, D. N., & Ritchhart, R. (2004). When is
(Ed.), Critical Thinking: Current Research, The- good thinking? In D. Y. Dai & R. J. Sternberg
ory, and Practice. Dordect, The Netherlands: (Eds.), Motivation, Emotion, and Cognition:
Kluwer Academic. Integrative Perspectives on Intellectual Function-
Olson, D. (Ed.) (1 974). Media and Symbols: ing and Development. Mahwah, NJ: Erlbaum.
The Forms of Expression, Communication, and Perkins, D. N., & Salomon, G. (1 988). Teaching
Education. Chicago: University of Chicago for transfer. Educational Leadership, Sept., 22–
Press. 3 2.
Perkins, D. N., & Salomon, G. (1 989). Are cog- Salomon, G., & Perkins, D. N. (1 998). Individ-
nitive skills context bound? Educational Re- ual and social aspects of learning. In P. D. Pear-
searcher, 1 8(1 ), 1 6–25 . son, & A. Iran-Nejad (Eds.), Review of Research
Perkins, D. N., Jay, E., & Tishman, S. (1 993 ). in Education: Vol. 2 3 . (pp. 1 –24). Washington,
Beyond abilities: A dispositional theory of DC: AERA.
thinking. The Merrill-Palmer Quarterly, 3 9(1 ), Savell, J. M., Twohig, P. T., & Rachford, D. L.
1 –21 . (1 986). Empirical Status of Feuerstein’s “Instru-
Perkins, D. N., Tishman, S., & Goodrich, mental Enrichment” (FIE) as a Method of Teach-
H. (1 994). Keys to Thinking. Johannesburg: ing Thinking Skills (No. 699). Arlington, VA:
UPTTRAIL Trust. U.S. Army Research Institute.
Perkins, D. N., Tishman, S., Ritchhart, R., Do- Scardamalia, M., & Bereiter, C. (1 996). Engaging
nis, K., & Andrade. A. (2000). Intelligence in students in a knowledge building society. Edu-
the wild: A dispositional view of intellectual cational Leadership, November, 6–1 0.
traits. Educational Psychology Review, 1 2 (3 ), Scardamalia, M., & Bereiter, C. (1 996). Adapta-
269–293 . tion and understanding: A case for new cultures
Perkins. D. N., & Tishman, S. (2001 ). Disposi- of schooling. In S. Vosniadou, E. DeCorte, R.
tional aspects of intelligence. In S. Messick, & Glaser, & H. Mandl (Eds.), International Per-
J. M. Collis (Eds.), Intelligence and Personality: spectives on the Design of Technology-supported
Bridging the Gap in Theory and Measurement Learning Environments (pp 1 49–1 63 ). Mahwah,
(pp. 23 3 –25 7). Mahwah, NJ: Erlbaum. NJ: Erlbaum.
Polya, G. (1 95 4). Mathematics and Plausible Rea- Scardamalia, M., Bereiter, C., & Lamon, M.
soning (2 vols.). Princeton, NJ: Princeton Uni- (1 994). The CSILE Project: Trying to bring the
versity Press. classroom into world 3 . In K. McGilly (Ed.),
Polya, G. (1 95 7). How to Solve It: A New As- Classroom Lessons: Integrating Cognitive Theory
pect of Mathematical Method. Garden City, NY: and Classroom Practice (pp. 201 –228). Cam-
Doubleday. bridge, MA: MIT Press.
Rand, Y., Tannenbaum, A. J., & Feuerstein, R. Scardamalia, M., Bereiter, C., McLean, R. S.,
(1 979). Effects of instrumental enrichment on Swallow, J., & Woodruff, E. (1 989). Computer-
the psychoeducational development of low- supported intentional learning environments.
functioning adolescents. Journal of Educational Journal of Educational Computing Research,
Psychology, 71 , 75 1 –763 . 5 (1 ), 5 1 –68.
Resnick, L. B. (1 987). Education and Learning Scheffler, I. (1 991 ). In praise of cognitive emo-
to Think. Washington, DC: National Academy tions. In I. Scheffler (Ed.), In Praise of Cognitive
Press. Emotions (pp. 3 –1 7). New York: Routledge.
Ritchhart, R. (2002). Intellectual Character: What Schoenfeld, A. H. (1 982). Measures of problem-
It Is, Why It Matters, and How to Get It. San solving performance and of problem-solving in-
Francisco: Jossey-Bass. struction. Journal of Research in Mathematics
Roth, W.-M. B., Michael, G. (1 995 ). Knowing and Teaching, 1 3 , 3 1 –49.
interacting: A study of culture, practices, and Schoenfeld, A. H., & Herrmann, D. J. (1 982).
resources in a Grade 8 open-inquiry science Problem perception and knowledge structure
classroom guided by a cognitive apprenticeship in expert and novice mathematical problem
metaphor. Cognition and Instruction, 1 3 (1 ), 73 – solvers. Journal of Experimental Psychology:
1 28. Learning, Memory, and Cognition, 8, 484–
Salomon, G. (Ed.) (1 993 ). Distributed Cognitions. 494.
New York: Cambridge University Press. Schunk, D. H., & Zimmerman, B. J. (Eds.) (1 994).
Salomon, G., & Perkins, D. N. (1 987). Transfer Self-regulation of Learning and Performance: Is-
of cognitive skills from programming: When sues and Educational Applications. Hillsdale,
and how? Journal of Educational Computing Re- NJ: Erlbaum.
search, 3 , 1 49–1 69. Selz, O. (1 93 5 ). Attempt to raise the level of in-
Salomon, G., & Perkins, D. N. (1 989). Rocky telligence. Zeitschrift für Psychologie, 1 3 4, 23 6–
roads to transfer: Rethinking mechanisms of a 3 01 .
neglected phenomenon. Educational Psycholo- Stanovich, K. E., & West, R. F. (1 997). Reason-
gist, 2 4(2), 1 1 3 –1 42. ing independently of prior belief and individual
differences in actively open-minded thinking. cational Research Association, April 4–8. New
Journal of Educational Psychology, 89(2), 3 42– Orleans, LA.
3 5 7. Tishman, S. (1 995 ). High-level thinking, ethics,
Sternberg, R. J. (1 985 ). Beyond IQ: A Triarchic and intellectual character. Think: The Magazine
Theory of Human Intelligence. Cambridge, UK: on Critical and Creative Thinking, October, 9–
Cambridge University Press. 1 4.
Sternberg, R. J., Ferrari, M., Clinkenbeard, P., Tishman, S., & Perkins, D. N. (1 997). The lan-
& Grigorenko, E. L. (1 996). Identification, in- guage of thinking. Phi Delta Kappan, 78(5 ),
struction, and assessment of gifted children: A 3 68–3 74.
construct validation of a triarchic model. Gifted Tishman, S., Perkins, D. N., & Jay, E. (1 993 ).
Child Quarterly, 40(3 ), 1 29–1 3 7. Teaching thinking dispositions: From transmis-
Sternberg, R. J. (Ed.) (2002). Why Smart People sion to enculturation. Theory into Practice, 3 ,
Can Be So Stupid. New Haven, CT: Yale Uni- 1 47–1 5 3 .
versity Press. Tishman, S., Perkins, D. N., & Jay, E. (1 995 ). The
Sternberg, R. J., Conway, B. E., Ketron, J. L., & Thinking Classroom: Learning and Teaching in
Bernstein, M. (1 981 ). People’s conceptions of a Culture of Thinking. Needham Heights, MA:
intelligence. Journal of Personality and Social Allyn & Bacon.
Psychology, 41 (1 ), 3 7–5 5 . Tversky, A., & Kahneman, D. (1 993 ). Probabilis-
Sternberg, R. J., & Bhana, K. (1 986). Synthe- tic reasoning. In A. I. Goldman (Ed.), Readings
sis of research on the effectiveness of intel- in Philosophy and Cognitive Science (pp. 43 –68).
lectual skills programs: Snake-oil remedies or Cambridge, MA: MIT Press.
miracle cures? Educational Leadership, 44(2), Van Haneghan, J., Barron, L., Young, M.,
60–67. Williams, S., Vye, N., & Bransford, J. (1 992).
Super, C. (1 980). Cognitive development: Look- The Jasper series: An experiment with new
ing across at growing up. In C. Super, & ways to enhance mathematical thinking. In
S. Harkness (Eds.), New Directions for Child D. Halpern (Ed.), Enhancing Thinking Skills
Development: Anthropological Perspectives on in the Sciences and Mathematics (pp. 1 5 –3 8).
Child Development (pp. 5 9–69). San Francisco, Hillsdale, NJ: Erlbaum.
Jossey-Bass. Voss, J. F., Perkins, D. N., & Segal, J. W. (1 991 ). In-
Swartz, R. J., & Parks, S. (1 994). Infusing the Teach- formal Reasoning and Education. Hillsdale, NJ:
ing of Critical and Creative Thinking into Ele- Erlbaum.
mentary Instruction: A Lesson Design Handbook. Vygotsky, L. S. (1 978). Mind in Society: The Devel-
Pacific Grove, CA: Critical Thinking Press and opment of Higher Psychological Processes. Cam-
Software. bridge, MA: Harvard University Press.
Swartz, R. J., & Perkins, D. N. (1 989). Teaching Webster, D. M., & Kruglanski, A. W. (1 994). In-
Thinking: Issues and Approaches. Pacific Grove, dividual differences in need for cognitive clo-
CA: Midwest Publications. sure. Journal of Personality and Social Psychol-
Thorndike, E. L. (1 923 ). The influence of first ogy, 67(6), 1 049–1 062.
year Latin upon the ability to read English. Whimbey, A., & Lochhead, J. (1 986). Prob-
School Sociology, 1 7, 1 65 –1 68. lem Solving and Comprehension. Hillsdale, NJ:
Thorndike, E. L., & Woodworth, R. S. (1 901 ). Erlbaum.
The influence of improvement in one mental Worsham, A. W., & Austin, G. R. (1 983 ). Effects
function upon the efficiency of other functions. of teaching thinking skills on SAT scores. Edu-
Psychological Review, 8, 247–261 . cational Leadership, 41 (3 ), 5 0–5 1 .
Tishman, S. (1 994) Thinking Dispositions and In- Zenke, L., & Alexander, L. (1 984). Teaching
tellectual Character. Paper presented at the thinking skills in Tulsa. Educational Leadership,
1 994 Annual Meeting of the American Edu- 42 (1 ), 81 –84.
Author Index
Abebe, T., 5 1 3 Allan, L.G., 1 46, 1 47, 1 5 3 , 1 5 6

Abel, L., 75 9 Allard, F., 3 3 7
Abelson, R.P., 1 23 , 3 73 , 3 75 , 3 80, 3 87, 3 88 Allen, H.A., 5 07
Abi-Dargham, A., 5 02, 5 1 2 Allen, S.W., 1 4, 28, 1 79, 1 80, 440, 442, 447, 73 3
Ackerman, B.P., 5 48 Allport, F.H., 295
Ackerman, P.L., 75 2 Altmann, G., 444
Ackermann, H., 441 Altom, M.W., 45
Acredolo, L., 647 Altran, S., 5 6, 5 9, 1 02, 1 03 , 1 06
Acuna, B.D., 482 Amabile, T.M., 3 5 1 , 3 5 6, 3 5 8, 3 60
Adams, H.F., 766 American Psychiatric Association, 494, 495
Adams, J.L., 3 3 1 , 3 5 3 Ames, G.J., 3 80
Adams, M.J., 778, 779, 781 Anderson, C., 3 87
Adamson, L.B., 675 Anderson, J.A., 46, 76, 77
Adelson, B., 3 3 7 Anderson, J.R., 4, 41 , 43 , 46, 1 29, 1 3 7, 1 45 , 1 48, 1 5 3 ,
Adey, P., 783 , 785 1 74–1 76, 21 0, 280, 3 27, 3 71 , 3 73 , 403 , 404, 406,
Adolphs, R., 5 1 6 407, 409, 41 0, 41 6, 41 9, 420, 422, 425 , 43 3 , 443 ,
Agnetta, B., 625 468, 470, 5 3 2, 622, 71 7, 742, 75 1 , 791
Agnoli, F., 268, 279 Anderson, L., 623
Aha, D.W., 1 4 Anderson, M.C., 5 08
Ahn, W-K., 5 4, 5 6, 5 7, 1 08, 1 09, 1 46, 1 49, 71 0 Anderson, N.H., 271
Ainslie, G., 25 6 Anderson, R.C., 3 24, 3 84
Aitken, M.R.F., 1 49 Anderson, T., 201
Ajjanagadde, V., 87, 5 3 6 Andrade, A., 785
Ajzen, I., 3 80, 3 93 Andreasen, N.C., 494, 495 , 496, 499, 5 00, 5 06, 5 09,
Alba, J.W., 3 3 2, 3 42, 3 5 6 516
Alcock, K.J., 766 Andrews, F.M., 3 5 5
Alexander, G.E., 462 Andrews, G., 5 3 7, 5 3 9, 5 41 , 5 44, 5 46, 5 47, 5 49
Alexander, L., 780 Ani, C., 766
Alfonso-Reese, L.A., 46, 41 6 Anzai, Y., 3 3 0
Alibali, M.W., 21 8, 3 40 Apanovitch, A.M., 298
Alicke, M.D., 3 00 Arabie, P., 22
Alksnis, O., 201 Arcediano, F., 1 5 2
803
804 author index
Argyris, C., 781 Barnet, R.C., 1 48, 1 5 1

Ariely, D., 284, 287 Barnett, S.M., 1 3 1
Aristotle, 1 1 8, 1 21 , 1 45 Baron, J.B., 776, 780, 785
Arkes, H.R., 25 9 Baron, R.M., 3 81
Arnold, S.E., 5 1 3 Baron-Cohen, S., 670
Arocha, J., 1 28, 3 24, 73 1 , 73 2, 73 7, 73 9, 742 Barr, R.G., 675
Aronson, E., 3 80 Barrett, P.T., 75 9
Arrow, K.J., 244 Barron, F., 3 5 1 , 3 5 4, 3 5 8
Ashby, F.G., 46, 25 8, 268, 41 6, 5 78 Barron, L., 790
Ashcraft, M.H., 3 73 , 5 74 Barrouillet, P., 1 90, 5 42, 5 43
Aslin, R.N., 43 1 , 441 Barsalou, L.W., 3 8, 60, 1 3 6
Astington, J.W., 5 47, 671 , 672 Barston, J.L., 1 79, 480
Atkins, B.T.S., 5 4 Bart, W.M., 26
Atkinson, J.W., 3 5 8 Barth, H.C., 5 72, 5 73 , 5 76, 5 78
Atkinson, R.C., 45 7 Barwise, J., 1 94, 209, 65 0
Atran, S., 60, 63 Bassok, M., 1 5 , 26, 76, 1 1 7, 1 28, 1 3 1 , 1 3 6, 21 8, 221 ,
Attneave, F., 1 6 23 0, 3 3 5 , 3 40, 3 41 , 3 42, 3 72, 3 84, 401 , 41 8, 5 41 ,
Au, 5 8, 5 9 674, 705 , 706, 707, 727, 728, 73 5 , 789, 790
Aujla, P., 25 2, 25 9 Bastardi, A., 25 2
Austin, G.A., 4, 3 9, 706 Bateson, G., 3 62
Austin, G.R., 780 Batt, R., 75 7
Austin, R., 5 1 3 Bauer, M.I., 1 93 , 227
Avis, J., 672 Baumeister, R.F., 297, 3 1 2
Awh, E., 482 Baving, L., 5 07
Ayton, P., 278 Bazana, P.G., 75 9
Bazerman, M.H., 25 3 , 25 4
Babcock, L., 62, 25 9, 3 92 Beall, A.C., 21 8
Babcock, R.L., 5 95 , 5 98, 5 99 Beauregard, K.S., 298, 3 00
Bachman, 3 04, 441 , 45 7 Beck, B.B., 620, 621
Bachnan, H.J., 766 Beck, J., 3 1
Bacon, F., 706 Beckers, T., 1 49
Baddeley, A.D., 41 2, 45 8, 45 9, 460, 461 , 462, 463 , Beeghly, M., 5 47
464, 467, 468, 469, 5 07 Been, S.L., 1 49
Badre, 5 09 Behr, S.E., 43 2
Baer, J., 3 5 8 Beilin, H., 5 3 0
Bailenson, J.N., 5 9, 1 03 Bell, V., 1 93 , 1 96
Baillargeon, R., 5 8, 1 46, 5 3 0, 5 3 5 , 5 81 , 623 , 63 5 , 65 4 Bellinger, D.C., 766
Bain, J.D., 5 3 7, 5 40 Bem, D.J., 296
Bakeman, R., 675 Bender, R.S., 222
Baker, A.G., 1 5 6 Benes, F.M., 5 1 3
Baker, J.G., 766 Benet-Martinez, V., 673
Baker, L.M., 71 2 Benjamin, T.B., 5 08
Baker, M., 640, 65 4 Benson, D., 73
Balaban, M.T., 65 4 Benveniste, S., 646
Balakrishnan, J.D., 5 78 Beran, M.J., 5 66, 609, 61 5 , 61 6, 61 7
Balda, R.P., 61 3 Bereiter, C., 3 72, 3 89, 777, 783 , 787, 788, 789, 790,
Baldauf, 779 791 , 792
Baldus, D., 699 Berenbaum, H., 499, 5 06, 5 07, 5 09, 5 1 0
Ball, T.M., 21 3 Berlin, 5 9, 1 06
Ballas, J.A., 444 Berlin, B., 63 9
Balota, D.A., 42 Berman, K.F., 5 1 2
Baltes, P.B., 761 Berman, W., 765
Bandalos, D.L., 600 Bernardo, A.B.I., 1 3 0
Banks, W.P., 209, 225 Bernstein, I.S., 61 0
Bara, B.G., 1 73 , 1 91 , 1 92 Bernstein, L.E., 26
Barch, D., 499, 5 00, 5 02, 5 06, 5 09, 5 1 5 Bernstein, M., 75 2, 780
Barclay, J.R., 1 29, 1 87 Berntson, G.G., 271 , 5 45 , 5 66, 61 3 , 61 4, 61 6, 61 7, 61 8
Bard, K.A., 622 Berry, C.J., 1 5 8
Bargh, J.A., 271 , 296, 441 Berry, D., 441
Bargmann, 700 Berry, J.W., 669
Bar-Hillel, Y., 1 85 , 274, 278, 280 Bersheid, E., 3 03 , 3 05
author index 805
Berthoz, A., 21 5 Bookin, H., 1 21

Berti, A., 441 Boole, G., 5 29
Bertin, J., 226 Booth, 63
Besche, C, 5 06 Boring, E., 75 2
Best, B., 425 Boroditsky, L., 1 20, 209, 221 , 227, 643 , 648, 649
Best, C., 63 8 Boronat, C., 1 21
Betrancourt, M., 21 6, 23 0 Bors, D.A., 75 6, 75 7
Bettman, J.R., 25 7, 260 Boshuizen, H.P., 73 6, 73 7
Betz, J., 41 6, 422 Botha, L., 787
Beveridge, M., 1 3 0 Bothell, D., 404, 425
Bianco, A.T., 3 08 Botto-Mahan, C., 620
Bidell, T.R., 5 3 6 Botvinick, M.M., 5 02
Biederman, I., 24, 26, 27, 74, 83 , 84 Bouquet, F., 63 8
Bilder, R.M., 5 1 3 Bourne, L.E., 3 9, 671
Billman, D.O., 1 23 , 5 42 Bouton, M.E., 1 49
Binck, 446 Bovet, D., 61 1
Binet, A., 75 3 Bowden, D., 5 47
Bird, E.D., 5 1 3 Bowden, E.M., 3 43
Birnbaum, M.H., 280 Bowdle, B., 1 21
Birney, D., 1 93 Bower, G.H., 1 4, 3 9, 41 , 43 , 46, 3 73 , 41 5 , 41 6
Biro, S., 5 41 , 5 68 Bowerman, M., 63 6, 643 , 644
Bishop, B., 3 87 Bowers, K.S., 3 62
Bjorklund, D.F., 469, 5 3 0, 760 Boyd, M.G., 61 0
Black, J., 5 3 5 Boyd, R., 607
Black, J.B., 1 87, 209, 3 3 1 Boyer, C.B., 5 60
Black, M., 1 20, 1 21 Boyer, E., 776
Black, T., 21 5 Boyes-Braem, P., 40, 5 45
Blader, S.L., 3 06 Boyle, F.M., 5 44
Blagg, N., 778 Boysen, S.T., 5 45 , 5 66, 61 1 , 61 2, 61 3 , 61 4, 61 6, 61 7,
Blanchette, I., 1 1 7, 1 24, 1 25 , 1 29, 71 3 61 8, 623
Blank, H., 279 Bradley, G.W., 296, 297
Blaye, A., 5 43 Bradley, R.H., 765
Bless, H., 25 8, 268, 3 1 2 Bradshaw, G.L., 3 5 2, 3 5 7, 3 62, 71 9
Bleuler, E., 494, 499, 5 1 6 Bradshaw, J.L., 760
Bloch, B., 63 9 Braine, M.D.S., 1 71 , 1 87, 475 –476
Blois, M.S., 73 4 Brainerd, C.J., 469, 5 3 0, 5 45 , 784
Blok, S., 5 8 Braisby, 5 7
Bloom, B.S., 765 Braithwaite, V.A., 489
Bloom, L., 671 Brand, J., 5 46
Bloom, P., 63 7, 640, 649 Brandeis, L., 693
Blount, S., 25 3 Brannon, E., 5 64, 5 66, 5 67, 5 68, 5 70, 5 80, 5 81 , 61 4,
Blum, B., 1 44 61 5
Blum, J.S. Bransford, J.D., 1 29, 1 87, 3 24, 3 62, 789, 790
Blum, N.A., 5 06 Brauner, J., 71 7
Blum, R.A., 61 0 Braver, T.S., 41 3 , 5 00, 5 01 , 5 02, 5 07, 5 08, 5 1 6
Blumberg, S.J., 25 8 Bray, N.W., 5 3 6
Blumer, C., 25 9 Brayshaw, M., 1 28, 1 3 1
Boag, C., 5 49 Breinliger, K., 63 5
Bobrow, D.G., 3 74, 73 7 Brem, S., 71 4
Boden, M.A., 3 5 7 Brendl, C.M., 271
Bodenhausen, G.V., 273 Brenneman, K., 5 80
Bodner, M., 461 Breslow, L., 5 45
Boeck, P. de, 43 Bretherton, I., 5 47
Boesch, C., 609 Brewer, W.F., 1 87, 3 74, 3 81 , 3 88, 5 49
Boesch, H., 609 Briars, D., 5 3 7
Boettger, R., 3 05 Broadbent, D.E., 43 6, 441
Bogerts, B., 5 1 3 Broca, P., 478
Bohr, N., 3 82 Brody, H., 43 3
Boltzmann, 1 86 Brody, N., 75 2, 782
Bond, M., 609, 61 3 Broe, M.B., 26
Bookheimer, S.Y., 71 7 Broman, M., 441
806 author index
Bronfenbrenner, U., 762 Calvin, W.H., 760

Brooks, L.R., 1 4, 28, 45 , 46, 5 7, 62, 443 , 444, 45 9, Camacho, C.J., 3 08, 3 09, 3 1 0
73 3 Camerer, C., 245 , 25 6, 25 9
Brooks, P.J., 5 48 Campbell, D., 3 5 9, 3 62
Brooks, R., 3 80 Campbell, D.T., 696
Brown, A.L., 1 3 0, 3 76, 764, 779, 781 , 791 , 792, 794 Campbell, J.I., 5 74
Brown, D., 1 06, 3 87, 5 41 , 5 42 Campbell, K.B., 75 9
Brown, J.S., 789 Campione, J.C., 791 , 792, 794
Brown, R., 421 , 63 9, 640, 643 , 647 Canfield, R.L., 441
Bruer, 791 Cannon, 3 04, 441 , 45 7
Brumby, M., 3 87 Cannon, T.D., 494, 5 09, 5 1 1 , 5 1 2, 5 1 3 , 5 1 6
Bruner, J.S., 4, 3 9, 209, 296, 5 3 0, 5 3 1 , 663 , 664, 672, Cantor, N., 75 2
675 , 706, 707, 708, 792 Capedevielle, B., 787
Brunet, C., 609 Capodilupo, S., 784
Brunswik, E., 268, 270 Cappa, S., 71 8
Bryant, D.J., 21 9 Cara, F., 1 78
Bryant, P.E., 5 3 0, 5 44 Caramazza, A., 1 8, 5 4, 60, 1 87, 3 87, 485 , 488, 71 5
Bucciarelli, M., 1 73 , 1 92, 1 94, 1 97 Carbonell, J.G., 1 1 7
Buchel, C., 1 80, 479 Cardozo, B., 686, 689, 693 , 698
Buck, E., 1 73 , 1 94 Carey, G., 760
Buckinghham, D., 5 3 2, 5 3 5 , 5 48 Carey, S., 28, 5 7, 5 8, 5 9, 1 01 , 1 03 , 1 08, 1 09, 3 74, 3 89,
Buckley, P.B., 5 69, 5 70 5 40, 5 80, 5 81 , 5 82, 63 5 , 642, 649, 65 0, 65 3 , 71 5 ,
Buckner, E., 498 783
Buehler, R., 25 5 Carlson, R., 443
Buehner, M.J., 96, 1 01 , 1 22, 1 47, 1 5 7, 1 5 8, 1 5 9, 1 63 , Carlson, S.M., 5 47
21 7, 221 , 228, 3 01 , 3 24, 3 80, 5 47, 5 89, 608, 705 , Carmichael, R., 221
71 0, 73 7 Carnap, R., 96, 97, 1 02, 1 07, 1 85
Buhrman, A., 5 42 Caroff, X., 5 44
Bullock, M., 1 46, 5 80 Caroll, J.D., 1 5
Bunch, K.M., 5 47 Carpenter, P.A., 1 1 8, 227, 3 3 1 , 404, 405 , 41 2, 41 6, 425 ,
Bundesen, C., 21 3 466, 468, 75 7
Bundy, D.A., 766 Carr, J.A., 5 68
Burdyn, L.E., 61 1 Carr, T.H., 469
Burge, T., 5 2 Carraher, D., 677, 762
Burgess, C., 78 Carraher, T.N., 762
Buridan, J., 3 87 Carreiras, M., 1 93 , 1 97
Burke, J., 1 49, 779 Carriedo, N., 1 93
Burkhart, J., 1 49, 1 5 9, 1 60 Carroll, J.B., 602, 75 4, 75 5
Burroughs, E.K., 223 Carroll, J.M., 21 6, 227
Burton, S.J., 686 Carruthers, P., 65 1 , 708
Burzio, L., 423 Carson, R., 282
Buschbaum, M.S., 75 9 Carswell, C.M., 227
Busemeyer, J.R., 47 Carter, C.S., 425 , 5 00, 5 02, 71 7
Bush, G.H.W., 1 26 Carullo, J.J., 466
Bush, G.W., 1 25 , 1 27, 1 3 2 Caruso, D., 75 2
Bush, R.R., 20 Case, R., 5 3 1 , 5 3 7, 5 41 , 5 48, 783 , 784
Bushy, 790 Catania, A.C., 5 68
Buss, R.R., 43 4 Catrambone, 1 3 0, 3 40, 41 8
Bykofsky, S., 3 74 Cattell, R., 1 1 8, 75 4, 764
Byrne, M., 5 1 3 Cave, C.B., 482
Byrne, R.M.J., 1 69, 1 72, 1 73 , 1 78, 1 87, 1 88, 1 89, Caverni, J-P., 1 97
1 92–1 97, 1 99, 200, 476, 5 41 , 5 42 Ceci, S.J., 3 75 , 5 40, 677, 762, 764–765 , 766, 789
Byrne, R.W., 760 Cervone, D., 786
Césaire, A., 666
Cabeza, R., 43 8 Chaika, E., 498
Cacioppo, J.T., 271 , 3 00, 3 1 2, 786 Chaiken, S., 267, 299, 3 02, 3 1 1 , 3 1 2, 3 80
Caldwell, B.M., 765 Chalmers, M., 5 45 , 61 3
Call, J., 73 , 5 47, 608, 609, 61 3 , 61 5 , 61 7, 61 9, 620, Chamberlain, N., 1 25
621 , 624, 625 , 626, 65 3 Chamberlin, T.C., 696
Callahan, C.C., 693 Chambers, D., 743
Callicott, J.H., 5 09, 5 1 2 Chance, S.S., 21 8
author index 807
Chandler, M.J., 674 Clibbens, J., 1 74–1 76

Chandrasakeran, B., 73 7, 73 8 Clifford, D., 1 5 7, 1 5 8
Changeaux, J.P., 5 66 Clinkenbeard, P.R., 763 , 765 , 782
Chapman, D.P., 25 9 Clore, G.L., 271 , 273 , 3 1 1 , 3 1 2
Chapman, G.B., 1 46, 1 48, 272, 740 Coblenz, 62
Chapman, J.P., 5 04, 5 08 Cofer, C.N., 445
Chapman, L.J., 5 08 Cohen, A., 21 9
Chapman, M., 5 3 1 Cohen, F.S., 691 , 692, 694
Chase, V.M., 3 41 Cohen, G.L., 3 00
Chase, W.C., 789 Cohen, J.D., 1 94, 41 3 , 469, 5 00, 5 01 , 5 07, 5 08, 5 1 1 ,
Chase, W.G., 224, 228, 268, 3 3 6, 3 3 7, 3 75 5 1 6, 5 78
Chater, N., 26, 27, 1 01 , 1 70, 1 74–1 75 , 1 76, 1 77, 1 78, Cohen, L.J., 1 75 –1 77, 5 64
444 Cohen, M.S., 71 7
Chatlosh, D.L., 1 5 6 Cohen, N.J., 43 2
Chein, J.M., 468, 470 Cole, M., 666, 667, 676, 761
Chen, S., 3 02, 3 1 1 , 3 1 2, 61 4 Coley, J.D., 44, 5 9, 1 02, 1 03 , 1 05 , 1 06, 1 07, 1 09
Chen, Y., 21 8 Collier, G., 766
Chen, Z., 1 23 , 1 3 0, 5 3 7 Collier, J.F., 670
Cheng, P.W., 96, 98, 1 01 , 1 22, 1 45 , 1 46, 1 48, 1 49, 1 5 2, Collins, A.M., 1 4, 22, 28, 3 9, 41 , 42, 249, 3 74, 3 75 ,
1 5 3 , 1 5 4, 1 5 5 , 1 5 7, 1 5 8, 1 5 9, 1 60, 1 61 , 1 70–1 77, 1 78, 789, 792, 793
201 , 21 7, 221 , 228, 3 01 , 3 24, 3 80, 485 , 5 41 , 5 43 , Collins, K.W., 466
5 47, 5 89, 608, 65 1 , 705 , 71 0, 73 7 Collins, M.A., 3 60
Cherry, E.C., 43 6 Collyer, C.E., 5 45
Cherubini, P., 1 92, 480 Colombo, M., 61 0, 61 3 , 61 4
Chi, M.T.H., 1 5 , 1 1 8, 224, 228, 3 25 , 3 3 5 , 3 3 7, 3 75 , Colunga, E., 63 , 640
3 76, 3 77, 3 81 , 3 82, 3 83 , 3 84, 3 85 , 3 87, 3 90, 3 91 , Colvin, M.K., 71 7
3 92, 41 8, 705 , 71 5 , 721 , 73 2, 73 5 , 776, 791 Confrey, J., 3 87
Chierchia, G., 43 Conrad, A.J., 5 1 3
Chiesi, H.L., 3 76 Conrad, F.G., 7, 49
Childs, C.P., 677 Conrad, R., 45 8
Chinn, C.A., 3 81 , 3 88 Conway, A.R.A., 466
Chipman, S.F., 21 2 Conway, B.E., 75 2, 780
Chiu, C.Y., 673 , 674 Conway, M., 299
Choi, I., 63 , 644, 676 Cook, B., 5 78
Choi, S., 648 Coon, H.M., 664, 670, 760
Chomsky, N., 5 3 6, 63 5 , 63 9, 642, 65 2, 65 4 Coon, V., 21 9
Chong, R.S., 425 Cooper, F.S., 63 8
Choplin, J.M., 87 Cooper, L., 21 1 , 21 3 , 3 3 7
Christal, R.E., 466, 5 95 , 75 7 Cooper, R., 65 0
Christian, K., 766 Coppola, M., 65 3
Christoff, K., 469 Corballis, M.C., 760
Chronicle, E.P., 3 27, 3 43 , 3 44 Corballis, P., 71 8
Chueh, D., 75 9 Cordes, S., 5 71 , 5 73 , 5 74, 5 76, 5 77, 5 78, 5 79, 5 80
Church, R.M., 5 63 , 5 64, 5 65 , 5 66 Coren, S., 222
Churchill, W., 1 25 , 1 26 Cork, R.C., 43 2
Ciarrochi, J., 75 2 Corkin, S., 43 9
Cicinelli, J.G., 21 1 Corrigan, J., 73 8
Cilliers, C., 787 Corry, R., 5 00
Cirillo, M.A., 5 1 2 Corter, 1 00
Claire, T., 5 1 Corter, J.E., 1 9
Clancey, W.J., 675 , 729, 73 0, 73 3 Cosmides, L., 61 , 1 73 , 1 77, 1 98, 279, 287, 485 , 5 41 ,
Claparède, E., 43 2, 43 9 5 43 , 760
Clapper, J., 46 Costa, A.L., 793
Clark, H., 63 7 Costello, F.J., 3 84
Clark, R.A., 3 5 8 Cosway, R., 5 1 3 , 5 1 4
Clark, S.E., 46 Coughlin, L.D., 3 3 7
Clearfield, M., 5 80 Coupe, P., 222
Cleeremans, A., 43 3 , 447 Cowan, N., 84, 462, 463 , 468, 470
Clement, C.A., 1 29 Cox, C.M., 3 5 4
Clement, J., 61 , 3 62, 3 87, 71 5 Cox, J.R., 1 75 –1 76, 1 77, 485
Cleveland, W.S., 227 Cozolino, L.J., 5 08
808 author index
Craik, F.I., 75 2 De Leeuw, N., 3 92

Craik, K., 1 86, 203 de Ribeauspierre, A., 668
Cramer, A.E., 609 De Silva, V., 1 6
Creem, S.H., 220 De Vooght, G., 1 93 , 1 96
Crisafi, M.A., 5 42 De Wit, H., 5 03
Critchley, M., 498 Deakin, J.M., 3 3 7
Croft, D., 71 3 Deary, I.J., 75 5
Crosbie, S., 499 Debner, J.A., 43 8
Cross, D., 5 47, 671 , 672 Decety, J., 21 5
Crowe, E., 3 07, 3 09 Deffenbacher, 729
Crowley, 720 Dehaene, S., 43 1 , 43 7, 43 8, 440, 482, 5 64, 5 66, 5 69,
Crutchfield, R., 3 5 8 5 76, 5 78, 5 79, 649
Csibra, G., 5 41 Dehaene-Lambertz, G., 440, 5 64, 63 8
Csikszentmihalyi, M., 3 5 1 , 3 5 8, 3 5 9, 3 60, DeKay, M.L., 285
3 61 , 3 62 Delius, J.D., 5 45
Cummins, D.D., 201 , 5 43 Dell, G.S., 5 06, 641
Cummins, R., 476 DeLoache, J.S., 3 76, 61 2, 764
Curtis, L.E., 5 78 Demasters, S., 3 87
Curtiss, S., 65 0 Demorest, M.E., 26
Cutting, J.C., 5 04 Dempster, F.N., 1 28, 469, 760
Cutting, J.E., 21 7 Denis, M., 21 3
Cziko, G.A., 3 5 9 Dennett, D.C., 61 , 442
Dennis, M.J., 1 49, 200
da Vinci, L., 3 5 3 Dennis, S., 5 3 1
Daehler, M.W., 1 3 0 DeRenzi, E., 45 9
Dagenbach, D.E., 469 Dermer, M., 3 03
D’agostino, P.R., 3 1 2 Descartes, R., 442, 63 4
Daily, L.Z., 41 3 Desimpelaere, C., 1 93
Dallas, M., 271 Desmedt, J., 84
Dalton, C., 5 49 Desvouges, W.H., 283
Damasio, 60 Detterman, D.K., 75 1 , 765 , 784, 789
D’Amato, M.R., 61 0, 61 3 , 61 4 Deutsch, D., 1 86, 1 98
Daneman, M., 41 3 , 43 7, 466, 75 7 Devlin, 60
Daniel, D.G., 5 1 2 DeVos, J., 5 3 5
Daniel, M.H., 75 2 Dewey, G.
Danks, D., 1 09, 1 44, 1 48 Dewey, G.I., 47
Darden, L., 3 81 , 3 88, 71 9 Dewey, J., 775 , 776, 785 , 787
Darlington, R., 765 Diamond, A., 469, 5 3 7
Darwin, C., 1 1 7, 1 1 8, 3 60 Diamond, P., 247, 283
Dasen, P.R., 668, 669, 792 Dickinson, A., 1 49, 1 62
Daum, I., 441 Diekmann, K.A., 25 4
Dauphinais, I.D., 5 1 3 Dienes, Z., 1 79, 43 3 , 444
David, A.S., 5 08 Dierckx, V., 1 93
Davidoff, J., 65 5 Diesendruck, 5 7
Davidson, J.E., 3 5 6 Dijksterhuis, A., 3 03
Davies, I., 65 5 DiSessa, A.A., 226, 3 74, 3 87
Davies, M., 75 2 Dittmann-Kohli, F., 761
Davis, A.S., 5 04 Ditto, P.H., 298, 299
Davis, H., 5 78 Diwadkar, V.A., 223
Davis, H.L., 5 47 Dixon, R.A., 761
Davis, J.D., 47 Docherty, N.M., 5 09, 5 1 6
Davis, S.N., 3 60 Dodd, B., 499
Dawes, R.M., 243 , 25 2 Doherty, M.E., 1 01 , 5 48, 706, 708
Dawson, M.E., 5 04 Dolan, J.R., 43 7, 43 8, 479, 483 , 485 , 487, 71 8
Dayton, T., 3 42 Dominowski, R.L., 3 42
De Blois, S.T., 609 Don, A.J., 447
de Bono, E., 3 5 2, 778, 781 Donaldson, M.S., 5 3 0, 5 44, 73 8
de Groot, A.D., 228, 3 3 6, 3 75 , 789 Donis, K., 785
De Haan, E.H.F., 441 Donoghue, J.P., 482
De Houwer, J., 1 49 Donoghue, T., 25 6
de Kleer, J., 1 87 Doosje, B., 298
author index 809
Dorfman, J., 445 Ekelund, J., 5 1 3

Dorken, M.D., 444 Ekman, G., 20
Dosher, B.A., 646 Elek, S.M., 1 5 6
Douglas, W.O., 693 Eliassen, J.C., 482
Douglass, S., 41 9, 420 Elio, R., 1 78
Doumas, L.A.A., 24, 46, 77, 83 , 87, 1 21 , 1 29, 1 3 2, 1 3 4, Eliot, T.S., 3 60
1 3 5 , 1 5 3 , 21 0, 45 7, 461 , 469, 470, 5 3 2, 5 3 6, 61 0, Elliot, R., 43 7, 43 8
75 1 Ellis, M.C., 200
Dowie, J., 728 Ellsworth, P.C., 1 1 7, 1 28, 1 5 1 , 1 64, 3 24, 693
Downs, J.S., 25 3 Elman, J.L., 78, 79, 422, 5 3 4, 5 3 5 , 5 3 6, 5 40,
Draine, S.C., 43 7 5 45
Driver, R., 3 87 Elstein, A.S., 728, 729, 73 0, 73 1 , 740
Dronkers, N.F., 499 Elster, J., 25 7
Duckworth, J., 766 Embretson, S., 75 2
Duff (Dugg?), S., 409 Emmorey, K., 21 8
Dufresne, R., 29, 3 3 7, 73 5 Enders, C.K., 600
Duguid, P., 789 Endo, Y., 3 1 2
Dulaney, D.E., 277, 443 Engle, R.W., 41 2, 41 3 , 466, 467, 469, 5 02, 5 1 2, 5 1 6,
Dumais, S.T., 5 2, 78 75 7
Dumas, C., 5 45 , 609 Englekamp, J., 21 5
Dunbar, K., 1 3 , 1 07, 1 1 7, 1 24, 1 25 , 1 28, 1 29, 1 5 4, 3 24, Ennis, R.H., 780, 785
3 3 9, 3 62, 3 80, 43 1 , 5 47, 5 48, 686, 708, 709, 71 0, Epstein, S., 268
71 1 , 71 2, 71 3 , 71 4, 71 6, 71 7, 71 8, 720, 721 , 73 0 Erber, M.W., 3 1 1
Duncan, J., 1 20, 5 02 Erber, R., 3 1 1
Duncan, M., 5 42 Erdelyi, M.H., 43 1 , 43 4, 43 7, 43 9
Duncan, S., 223 Erdos, P., 71 9
Duncker, K., 4, 1 22, 209, 3 21 , 3 24, 3 25 , 3 27, 3 29, 3 3 1 , Erhlich, K., 3 75
3 3 2, 3 3 4, 3 3 6, 3 40, 3 42, 3 45 , 3 5 6 Erickson, J.R., 1 91
Dunn, D.S., 25 1 Erickson, M.A., 46
Dunn, M.A., 3 00 Ericsson, K.A., 3 27, 3 72, 462, 728, 789
Dunnan, N., 3 74 Eriksen, C.W., 43 3 , 43 7
Dunning, D., 25 8, 296, 297, 298, 3 00, 3 04 Ernst, G.W., 3 24, 3 29
Dupin, J.J., 3 87 Eshelman, L., 73 0
Dupoux, E., 5 69 Esiri, M.M., 5 1 3
Duranti, A., 671 Espino, O., 1 94, 1 97
Durgin, F., 21 7 Estes, W.K., 1 4, 45 , 75 8
Durso, F.T., 3 42, 3 43 Etchemendy, 209
Dweck, C.S., 786 Ettinger, 1 28
Dworkin, R., 695 Evans, D., 647, 73 1
d’Ydewalle, G., 1 74–1 75 , 1 76, 1 88, 1 93 , 1 96 Evans, J.S., 1 69, 1 70, 1 71 , 1 72, 1 73 , 1 74–1 75 , 1 76, 1 77,
1 78, 1 79, 1 80, 1 81 , 1 87, 1 92, 1 96, 1 99, 200, 203 , 221 ,
Eagley, A.H., 3 02, 3 80 228, 267, 279, 3 24, 3 76, 464, 475 , 476, 480, 487,
Earle, W.B., 297 5 43 , 5 89, 676, 686, 705 , 71 2, 73 0
Ebbinghaus, H., 43 2 Evans, T.G., 96, 1 1 9, 1 22
Eberhardt, S.P., 26 Eysenck, H.J., 3 5 8, 75 9
Echols, C.H., 1 3 0, 5 42
Edelman, S., 1 7, 74, 84 Facione, N.C., 785
Edelson, S.M., 45 Facione, P.A., 785
Edginton, D., 1 72 Fadiga, L., 671
Edwards, J., 779 Fagley, N.S., 25 8
Edwards, W., 1 98 Fagot, J., 61 2
Egan, D.E., 228, 3 3 7 Fahy, T., 766
Egan, M.F., 5 1 3 Fajtlowicz, S., 71 9
Egeland, B., 765 Fales, C.L., 71 7
Ehrlich, H.J., 3 81 Falk, R., 1 97
Ehrlich, K., 1 87 Falkai, P., 5 1 3
Eich, E., 43 6 Falkenhainer, B., 24, 76, 88, 1 20, 1 28, 1 3 1
Eimas, P., 63 8 Falmagne, R.J., 5 43
Einhorn, H.J., 1 63 , 201 Fangmeir, T., 1 88
Einstein, A., 3 1 , 3 60, 700, 706 Fanselow, M.S., 5 03
Eisler, H., 20 Fant, G., 21
81 0 author index
Farady, 790 Flavell, J.H., 5 47, 764

Farah, M.J., 60, 21 3 Fleischman, D.A., 441
Faraone, S.V., 5 1 1 , 5 1 3 Flitman, S.S., 71 7
Fargis, P., 3 74 Flora, J., 209, 225
Farr, M., 3 76 Florack, A., 3 1 2
Farrell, J.E., 21 3 Flukes, J.E., 5 3
Faucher, L., 1 09 Fobes, J.L., 61 0
Fauconnier, G., 1 20 Fodor, J.A., 1 5 , 21 , 3 9, 43 , 5 3 , 74, 1 77, 468, 476, 5 3 5 ,
Faust, D., 25 2 5 3 6, 5 40, 5 41 , 5 45 , 63 4
Faustman, W.O., 5 05 Fong, G.T., 299
Fay, A.L., 5 48 Forbus, K.D., 24, 74–88, 1 23 , 1 3 1 , 1 3 3 , 71 4
Fazio, F., 71 8 Ford, J.M., 5 05 , 5 1 5
Fazio, R.H., 3 03 , 3 04 Ford, K.M., 3 72
Fearing, D.D., 5 45 Ford, T.E., 3 03
Fechner, 5 76 Forde, 60
Feigenson, L., 5 80, 5 81 Forgas, J.P., 3 1 1 , 3 1 2, 75 2
Fein, S., 3 00 Forrin, B., 75 6, 75 7
Feinstein, J.A., 3 00, 786 Förster, J., 3 08, 3 1 1
Feist, G.J., 3 5 1 , 3 5 8 Forster, K.I., 446
Feldman, D.H., 3 5 1 , 3 60 Forsythe, S., 5 1 3
Feltovich, P.J., 1 5 , 3 25 , 3 72, 3 75 , 3 84, 729 Fox-Keller, E., 71 4
Ferguson, R.W., 71 4 Fragaszy, D.M., 622
Ferguson, W., 3 75 , 793 Francis, M., 3 3 5
Fernald, 675 Frank, J., 691 , 789
Fernald, L., 766 Frankfurter, F., 693
Fernyhough, C., 5 3 1 Franklin, N., 21 8, 21 9
Ferrara, R.A., 791 Franks, B., 5 0, 5 4, 5 7
Ferrari, M., 3 87, 763 , 765 , 782 Franks, J.J., 1 29, 1 87
Ferrer-Caja, E., 603 , 604 Frederick, S.W., 1 00, 1 1 1 , 1 75 –1 77, 1 80, 1 97, 1 99, 25 2,
Fersen, L., 5 45 25 6, 25 9, 268, 269, 270, 273 , 283 , 286, 3 01 , 476,
Festinger, L., 296, 3 80, 3 88 73 9, 775
Feuerstein, R., 766, 778, 782, 787, 788 Frederickson, B.L., 25 9, 284, 285
Feynman, J., 706 Frederickson, J., 3 87
Fiddick, L., 61 , 1 73 , 1 77 Frederiksen, J.R., 73 7
Field, D., 5 3 0 Frege, G.W., 96
Fiese, B.H., 766 Freides, D., 5 06
Fiez, J.A., 468, 470 Freitas, A.L., 3 09, 3 1 0
Fileteo, J.V., 47 Freko, D., 45
Fillmore, C.J., 5 4 Frensch, P.A., 440, 441 , 729
Fincham, J.M., 46, 41 9, 420, 426, 71 7 Freud, A., 3 63
Fincher-Kiefer, R., 3 1 2 Freud, S., 4, 295 , 3 5 3 , 3 60, 3 63 , 499
Findlay, C.S., 3 62 Freund, T., 3 01 , 3 03 , 3 04, 3 1 2
Fink, G.R., 488 Freyd, J., 222
Finke, R., 21 1 , 21 3 , 21 4, 3 5 1 , 3 5 6, 3 62 Friedman, L.M., 695
Finucane, M., 25 8, 268, 273 Friedman, R., 642, 643
Fischer, K.W., 5 3 1 , 5 3 6, 5 3 7, 783 Frisch, S.A., 26
Fischler, H., 71 4 Fristoe, N., 5 94, 5 95
Fischoff, B., 1 74–1 76, 247, 280, 283 , 3 05 Frith, C., 479, 5 05
Fishbein, M., 3 80, 3 93 Frohardt, R., 1 49
Fisher, C., 63 7, 640 Frost, R., 1 20
Fisher, D.H., 1 00 Frost, T., 61 0
Fisher, W.W., 691 , 693 Frye, D., 5 3 8, 5 47, 5 48
Fisk, J.E., 5 98 Fugelsang, J., 1 3 , 1 07, 1 1 7, 1 5 4, 3 24, 3 62, 3 80, 5 47,
Fiske, A.P., 671 5 73 , 5 74, 686, 71 0, 71 1 , 71 8, 73 0
Fiske, D.W., 696 Fujita, K., 61 0
Fiske, S.T., 297, 3 00, 3 03 , 3 04, 3 05 , 3 1 1 Fuligni, A., 663 , 671
Fitch, W., 646 Fulker, D.W., 760
Flanagan, O., 760 Funke, J., 729
Flanery, M.A., 47 Furnham, 5 5
Flavell, E.R., 5 47 Fuson, K., 5 81
Flavell, J., 671 Fuster, J.M., 461 , 462, 470
author index 81 1
Gaba, D.M., 729 Giacomini, E., 5 78

Gabriel, S., 669 Gibbon, J., 1 62, 5 63 , 5 65 , 5 66, 5 67, 5 68
Gabrieli, J.D.E., 441 , 469, 482, 71 8 Gibbs, B.J., 5 81
Gaeth, G.J., 246, 249, 25 9 Gick, M.L., 73 , 1 1 7, 1 22, 1 23 , 1 3 0, 209, 3 3 4, 3 3 5 , 790
Gagné, C.L., 5 0 Giere, R., 706, 708
Gagne, R.M., 3 92, 3 93 Gigerenzer, G., 1 75 –1 77, 1 98, 270, 274, 278, 279, 485
Gainen, J., 785 Gilbert, D.T., 25 8, 25 9, 267, 268, 273 , 3 01
Galanter, E., 5 3 6 Gildea, P., 1 21
Galiliei, G., 706 Gilhooly, K.J., 464
Galison, P., 706 Gilinsky, A.S., 1 80, 5 98
Gall, Franz Joseph, 477 Gill, R., 668
Gallimore, R., 5 3 1 Gillan, J.E., 5 49, 61 1 , 61 3
Gallini, J.K., 227 Gillingham, M., 3 85
Gallistel, C.R., 1 46, 1 62, 3 25 , 5 47, 5 64, 5 66, 5 67, 5 68, Gillman, C.B., 5 69, 5 70
5 71 , 5 73 , 5 78, 5 79, 5 80, 5 81 , 608, 609, 647, 649, Gilly, M., 5 43
65 1 , 65 3 Gilmartin, K., 3 72
Galton, F., 75 2 Gilmore, G.C., 1 8
Gambini, O., 5 1 1 Gilovich, T., 1 75 –1 77, 1 79
Gandhi, M., 3 60 Giner-Sorolla, R., 299
Garavan, H., 5 09 Girgus, J.S., 222
Garber, P., 608 Girotto, V., 1 78, 1 93 , 1 97, 1 98, 202, 5 43
Garcı́a-Madruga, J.A., 1 93 , 1 94 Giunti, M., 477
Gardenfors, P., 1 7 Gladwin, T., 761
Gardner, H., 3 5 1 , 3 60, 3 61 , 3 62, 762, 764, Glahn, D.C., 5 09, 5 1 2
793 Glaser, R., 1 5 , 1 1 9, 3 25 , 3 3 5 , 3 3 7, 3 75 , 3 76, 3 84, 73 5 ,
Gardner, W.L., 669 789
Garner, W.R., 1 8, 26, 3 1 , 75 Gleitman, 3 9, 221
Garnham, A., 1 87, 480 Gleitman, H., 5 0, 640, 645
Garton, A.F., 5 3 1 Gleitman, L.R., 5 0, 63 7, 63 9, 640, 643 , 645 , 647, 65 0,
Gaskins, S., 642, 643 , 644–646 65 2, 65 4
Gasperoni, T.L., 5 1 3 Glenberg, A.M., 1 87
Gastel, J., 3 5 7 Glick, J., 666, 667, 676, 761
Gati, I., 1 8, 1 9, 20, 21 Glickman, I., 5 48
Gattis, M., 226 Glicksohn, J., 224
Gauthier, I., 1 4 Glisky, E.L., 43 1 , 441
Gay, J., 666, 676, 761 Globerson, 744
Gazzaniga, M., 71 8 Glover, G., 71 8
Gazzinga, M.S., 3 91 Gluck, M.A., 1 4, 23 , 1 00, 41 6
Geary, D.C., 5 74 Glucksberg, S., 42, 43 , 1 21
Gelfand, S., 445 Glushko, 21 3
Gelman, L.A., 784 Glymour, C., 98, 1 09, 1 1 3 , 1 44, 1 5 3
Gelman, R., 5 3 0, 5 3 2, 5 40, 5 41 , 5 44, 5 47, 5 64, 5 71 , Gobet, F., 3 3 7
5 73 , 5 78, 5 79, 5 80, 5 81 , 608, 63 5 , 649, 65 0, Gochfeld, D., 1 3 4
671 Goddard, M.J., 5 03
Gelman, S.A., 28, 5 3 , 5 6, 5 7, 5 9, 60, 63 , 1 01 , 1 02, 1 03 , Goel, V., 1 20, 1 3 6, 1 80, 425 , 468, 476, 477, 479, 481 ,
1 05 , 1 09, 1 1 3 , 1 46, 21 7, 3 25 , 5 45 , 5 46, 71 0 482, 483 , 484, 485 , 486, 487, 488, 5 02, 71 6, 71 7,
Gennari, S., 5 4, 63 7, 646, 65 5 71 8, 75 9
Gentner, D., 1 87, 228, 409, 465 , 5 41 , 5 42, 63 6, 642, Goetz, E.T., 3 24
643 , 65 4, 71 3 , 71 4, 73 3 Golann, S.E., 3 5 8
Gentner, D.R., 1 21 Gold, B.T., 479, 498, 71 8
George, C., 1 80 Goldberg, A., 65 2
Georgen, K. Goldberg, T.E., 5 06, 5 07
Gergely, G., 5 41 Goldin-Meadow, S., 21 8, 3 40, 63 6, 65 3 , 65 4
Gerken, L., 43 1 , 441 Goldman-Rakic, P.S., 5 00, 5 1 2, 5 3 7
Gernsbacher, M.A., 1 87 Goldmark, J., 693
Gero, B., 23 1 Goldschmidt, G., 23 1
Gertzog, W.A., 3 80 Goldstein, D., 248, 270
Getman, J.G., 693 Goldstein, W.M., 243
Getzels, J.W., 3 5 5 , 3 62 Goldstone, R.L., 20, 24, 25 , 29, 41 , 47, 49, 73 , 78, 96,
Ghiselin, B., 5 6, 3 5 2 99, 1 1 7, 221 , 666, 73 3
Gholson, B., 5 42 Goldvarg, Y., 1 5 9, 1 61 , 1 94, 1 95 , 201
81 2 author index
Goleman, D., 776 Grether, D., 25 3 , 25 9

Golledge, R.G., 21 1 , 21 8 Grewal, S.K., 1 28
Gollub, L.R., 5 68 Grey, T.C., 688, 689
Gollwitzer, P.M., 296, 297, 3 00 Grice, P., 63 7
Gomez, R.L., 43 1 , 441 , 444 Grieve, R., 5 46
Gonzalez, M., 1 98 Griffin, D., 1 75 –1 77, 1 79, 25 3 , 25 5 , 285
Gonzalez, R., 245 Griffin, J., 1 8
Good, R., 3 87 Griffin, S.A., 784
Goode, A., 444 Griffiths, T.L., 1 4, 1 45 , 1 5 8, 1 5 9, 1 64
Goodhew, A., 5 49 Griggs, R.A., 1 75 –1 76, 1 77, 1 93 , 485
Goodlad, J.I., 776 Grigorenko, E.L., 668, 75 4, 762, 763 , 765 , 766, 782
Goodman, N., 1 4, 29, 3 0, 97, 98, 1 01 , 1 02, 1 06, 793 Grinstead, J., 65 0
Goodnow, J.J., 4, 3 9, 706 Grober, E., 5 4
Goodrich, H., 787 Groen, G.J., 729, 73 1 , 73 2, 73 6, 741
Goodstein, L.P., 729 Gross, S., 699
Goodwin, G., 1 97 Gross, S.R., 697
Gopnik, A., 5 8, 5 9, 1 08, 1 09, 1 44, 1 49, 1 5 0, 1 5 3 , 1 5 9, Grosset, N., 5 42
1 65 , 3 74, 5 3 0, 5 47 Grossman, J.B., 760
Gordon, A.C., 5 47 Grossman, L.S., 5 1 6
Gordon, I.E., 21 2 Grotzer, T.A., 765 , 776, 777, 780
Gordon, P.C., 3 5 3 , 441 Grove, W.M., 494, 496, 499, 5 06, 5 09, 5 1 6
Gordon, R.W., 689 Gruber, H.E., 3 60
Gorman, A., 708, 709 Grupe, L.A., 5 3 6
Gorry, G.A., 728 Guidice, S., 5 3 7
Goswami, U., 1 24, 5 41 Guidubaldi, J., 766
Gottesman, I., 5 1 0 Guilford, J.P., 3 5 4, 3 5 5 , 3 62, 75 4, 782
Gottlieb, G., 766 Gumperz, J., 63 6
Gough, H.G., 3 5 8 Gunter, R., 5 74
Gould, S.J., 278 Gur, R.E., 5 09
Gould, T.D., 5 1 0 Gureckis, T.M., 46
Gourville, J.T., 25 9 Gustafson, S.B., 3 60
Graber, M., 5 3 5 Gutheil, G., 5 8, 1 03
Grace, A.A., 5 02, 5 05 , 5 1 1 Guthormsen, A., 3 42
Graesser, A., 3 77, 3 88 Gutiérrez, F., 1 93
Graf, P., 43 9 Gzesh, S.M., 5 3 4, 5 48
Grafman, J., 479, 71 7
Graham, D.J., 5 74 Ha, Y., 709
Graham, M., 3 60 Ha, Y-W., 1 01
Graham, S.A., 65 4 Hacking, I., 1 02, 1 07, 1 97
Grahame, N.J., 1 48 Hadjichristidis, C., 1 08
Granato, L., 776 Hagen, E.P., 75 3
Grant, E., 5 1 3 Hagmayer, Y., 1 63
Grantham-McGregor, S., 766 Hahn, E.D., 3 08
Grassi, F., 71 8 Hahn, U., 1 5 , 26, 27, 1 04, 1 08, 5 48
Gray, J.A., 5 03 , 5 04, 5 05 , 5 07, 5 09, 5 1 1 Haider, H., 3 43
Gray, R., 404, 469 Haidt, J., 447
Gray, W.D., 40, 5 45 Haier, R.J., 75 9
Graziano, W., 3 03 Haith. M.M., 441
Green, A., 71 0 Halberstadt, J., 29
Green, B., 1 87, 3 87, 71 5 Halford, G.S., 73 , 74, 77, 80, 81 , 82, 83 , 1 24, 1 28, 1 3 2,
Green, C.B., 87 1 3 7, 1 87, 1 93 , 3 83 , 3 85 , 442, 45 7, 464, 468, 5 3 0,
Green, F.L., 5 47 5 3 1 , 5 3 2, 5 3 6, 5 3 7, 5 3 9, 5 40, 5 41 , 5 42, 5 43 , 5 44,
Green, T.A., 691 , 693 5 45 , 5 46, 5 47, 5 48, 61 3 , 61 8, 783
Greenberg, J., 296, 297, 299, 3 04, 43 1 Hallberg, K.I., 5 66
Greene, T.R., 5 46 Halle, M., 21
Greenfield, P.M., 1 06, 3 1 2, 3 91 , 5 3 0, 5 3 1 , 663 , 664, Hallet, M., 479
666, 667, 668, 669, 670, 671 , 672, 677, 760, 776, Halloun, I.A., 3 87
792 Ham, W., 221
Greeno, J.G., 3 23 , 3 28, 3 29, 3 3 0, 5 80, 793 Hamblin, C.L., 781
Greenwald, A.G., 43 7 Hamers, J.H., 780
Gregory, R.L., 1 07 Hammer, D., 226
author index 81 3
Hammond, K.R., 267, 268, 270 Henneman, M.C., 622

Hampton, J.A., 3 9, 43 , 5 0, 5 7, 1 05 , 3 84 Hennessey, B.A., 3 5 1 , 3 5 8
Han, C., 648 Hermer, L., 65 1
Hanauer, J.B., 5 48 Hermer-Vasquez, L., 65 1
Handley, S.J., 1 70, 1 72, 1 73 , 1 79, 1 92, 1 94, 1 97, 487 Hernstein, R.J., 5 67, 766, 779
Hannan, M.B., 61 7 Herr, E.L., 3 5 5
Hansen, E.K., 71 7 Herrmann, D.J., 3 3 8, 781 , 789
Hanson, J., 694, 700 Herrnstein, R.J., 766
Hardcastle, V.G., 760 Hersh, H., 1 8
Hardiman, P.T., 29, 3 3 7, 73 5 Hershey, J.C., 246, 248
Hare, B., 625 , 65 3 Hertwig, R., 274, 279
Harlow, H.F., 608 Hertz-Pannier, L., 440
Harmon-Jones, E., 3 80 Hespos, S., 644
Harnishfeger, K.K., 469 Hesse, M., 1 1 8, 1 21
Harper, C.N., 1 70, 1 79, 1 92, 487 Hestenes, D., 3 87
Harper, D.G., 5 68 Hewson, M.G.A., 3 80
Harrington, D.M., 3 5 4, 3 5 8 Hewson, P.W., 3 80
Harris, P.L., 5 3 0, 5 43 , 672 Hicks, J.L., 41 2, 446
Harrison, P.J., 5 1 3 Higgins, E.T., 25 8, 271 , 296, 3 03 , 3 06, 3 07, 3 08, 3 09,
Harrow, M., 495 , 5 08, 5 1 6 3 1 0, 3 1 1 , 776
Hart, H.L.A., 201 Hilton, D.J., 277
Hartnett, P., 5 81 Hinrichs, J.V., 5 69
Harvey, 5 09 Hinton, G.E., 79, 1 3 4, 3 75
Harvey, J.H., 776 Hintzman, D.L., 43 , 62
Hasen, J.S., 3 5 5 Hirschfeld, L.A., 5 6, 1 1 3 , 784
Hasher, L., 469 Hirshberg, C., 706
Hashiya, K., 61 5 Hirt, E.R., 299
Hasson, U., 1 94 Hirtle, S.C., 222, 3 3 7
Hastie, R., 5 1 , 5 7, 1 09, 243 , 5 46, 697, 740 Hitch, G., 41 2, 45 8, 45 9, 460, 463 , 464
Hatano, 5 9 Hitler, A., 1 25 , 1 26
Haun, D., 647 Hobbes, T., 3
Hauser, M.D., 5 66, 5 81 , 61 7, 620, 621 , 623 , 624, 65 3 Hobson, R.P., 671
Hausman, C.R., 3 5 2 Hochberg, J., 222
Hausmann, R.G., 3 83 Hodges, A., 5 1 3
Hayes, B.K., 5 49 Hodgson, H.L., 3 00
Hayes, J.R., 3 3 2, 3 3 3 , 3 40, 3 5 5 Hodkin, B., 5 46
Hayes, P.J., 1 87 Hoepfner, R., 782
Hayes-Roth, B., 23 Hoffman, R.R., 3 72
Hayes-Roth, F., 23 Hoffmann, 729
Hayman, C.A.G., 43 9 Hoffrage, U., 1 98, 279
Hayward, S., 21 2 Hofstadter, D., 28, 1 28, 1 3 1 , 3 73 , 409, 5 41
Hazeltine, E., 21 4, 21 6 Hoftstede, G., 670
Hazen, K., 75 9 Hogan, H.P., 221
Healy, S.D., 489 Hogarth, R.M., 1 63 , 201 , 243 , 25 9
Heath, C., 246, 25 6, 25 7 Holcomb, P.I., 5 08
Hebb, D.O., 462 Holding, D.H., 3 26
Heckers, S., 5 1 2 Holender, D., 43 7, 43 8
Hedges, L.V., 223 Holland, J.H., 1 3 , 74, 77, 1 22, 1 46, 1 87, 5 40
Hegarty, M., 1 87, 21 6, 228, 3 3 1 , 71 5 Holland, R.W., 25 7
Hegel, M.T., 5 66, 61 7 Hollander, M.A., 5 3
Heider, E., 63 9 Holman, P.S., 5 08
Heider, F., 21 7, 271 , 3 80 Holmes, O.W., 686, 689
Heine, S.J., 3 1 2 Holton, D.L., 21 5
Heiser, J., 209, 228, 23 0 Holtzberg, A.D., 3 04
Heit, E., 1 3 , 1 01 , 1 04, 1 07, 1 08, 1 1 0, 1 1 2, 5 48, 5 49, 71 2 Holyoak, K.J., 1 3 , 1 4, 1 5 , 24, 25 , 26, 73 , 74–88, 98,
Heivly, 224 1 1 7, 1 20, 1 22, 1 23 , 1 24, 1 25 , 1 26, 1 28, 1 29, 1 3 0, 1 3 1 ,
Hell, W., 279 1 3 4, 1 3 6, 1 45 , 1 46, 1 49, 1 5 1 , 1 5 3 , 1 73 , 1 75 –1 77, 1 78,
Hemmerich, J., 3 75 , 3 83 1 87, 209, 221 , 223 , 225 , 226, 23 0, 3 24, 3 3 4, 3 3 5 ,
Hempel, C., 96, 97, 1 07, 1 08 3 3 6, 3 5 6–3 5 7, 3 74, 3 91 , 409, 41 0, 41 3 , 441 , 465 ,
Hemsley, D.R., 5 03 , 5 04, 5 09 469, 485 , 5 3 7, 5 40, 5 41 , 5 42, 5 89, 608, 688, 705 ,
Henle, M., 1 69, 475 709, 71 3 , 71 4, 71 7, 73 3 , 73 4, 742, 75 8, 789, 790
81 4 author index
Holzman, P.S. Indefrey, P., 498, 499, 5 06, 5 07

Homa, D., 46 Inhelder, B., 1 69, 3 85 , 5 29, 5 46, 61 8, 668,
Honda, M., 71 7 783
Honey, R.C.W., 5 1 2 Inoue, K., 3 42
Hong, Y.Y., 673 Intrator, N., 74, 84
Honoré, A.M., 201 Intraub, H., 222
Hood, B.M., 623 Irwin, M.H., 666
Hood, L., 671 Isen, A.M., 25 8, 268
Hooper, F.H.E. Iversen, I.H., 608, 609
Hopkins, W.D., 61 6 Iverson, J., 21 8
Horgan, D.D., 1 7 Ivry, R., 447
Horn, J.M., 766 Iyenger, S.S., 25 0
Horn, L., 5 07 Iyer, G., 21 7, 228
Horn, R.E., 23 0
Horton, J.J., 5 46 Jackendoff, R., 63 , 1 3 7, 5 3 7–5 47, 65 2
Horwitz, M.J., 691 , 693 Jackson, P.W., 3 5 5
Houle, S., 479, 71 8 Jackson, W.J., 61 0
Houser, N., 243 Jacobson, K., 63 5
Houston, D., 440 Jacoby, L.L., 271 , 43 8, 446, 447
Howard, D.V., 442 Jacques, S., 5 3 8
Howard, J.H., 222, 442, 444 Jakobson, R., 21
Howe, M.J., 5 40 James, W., 28, 45 7, 462
Hsaio, A., 441 Jameson, D., 63 9
Hsee, C.K., 246, 25 3 , 25 8, 25 9, 3 04 Janis, I., 776
Hu, J.M., 5 69 Jansen, B.R., 5 48
Hubel, D.H., 21 Jansma, J.M., 5 05
Huber, J., 25 0 Jarvin, L., 764
Hug, K., 1 77, 485 Jarvis, W.B.G., 3 00, 786
Hughes, M., 5 46 Jay, E., 785 , 787, 792, 793
Hull, C., 5 6, 41 5 Jenkins, E.A., 5 3 7
Hume, D., 3 , 95 , 96, 97, 98, 1 45 , 1 46 Jenkins, H., 1 46, 1 47, 1 5 3 , 1 5 6
Hummel, J.E., 24, 26, 27, 46, 73 , 74–88, 1 21 , 1 25 , 1 28, Jenkins, J.J., 761
1 29, 1 3 1 , 1 3 4, 1 3 6, 1 5 3 , 21 0, 23 0, 409, 41 0, 45 7, 461 , Jenkins, J.M., 5 47
469, 5 3 2, 5 3 6, 5 41 , 61 0, 75 1 Jensen, A.R., 75 4, 75 5 , 75 6, 782
Humphrey, G., 5 3 6 Jensen, E.M., 5 77, 5 78
Humphreys, 60 Jensen, I., 5 46
Hunn, 5 9 Jeong, H., 3 85
Hunt, E.B., 1 1 8, 75 5 , 75 6 Jepson, D.H., 1 07
Hunt, R.M., 1 87 Jerison, H.J., 75 9
Hunter, J.E., 789 Jessell, T.M., 477
Huntley-Fenner, G., 5 81 Jeste, D.V., 5 1 3
Hur, T., 3 08 Jeziorski, M., 71 3
Hurewitz, F., 65 0 Ji, L., 674
Hurley, S.M., 3 3 5 Jitsumori, M., 61 0
Hurt, C.A., 43 2 Job, 60
Hurvich, L.M., 63 9 Johannson, G., 21 7
Hussein, S., 1 26 Johansen, M.J., 46, 47
Hutcheson, J.C., 691 Johansen, M.K., 41 7
Hutchins, E., 3 72, 3 84 Johnson, A., 3 85 , 3 91
Hutchinson, J.W., 1 9, 65 3 Johnson, C., 46, 60, 1 09
Huttenlocher, J., 21 7, 220, 223 , 5 1 3 , 5 80 Johnson, D.M., 40, 5 64
Huttunen, M.O., 494, 5 1 1 , 5 1 2, 5 1 3 , 5 1 6 Johnson, E.J., 248, 25 2, 25 3 , 25 7, 25 8, 260, 272,
Hymes, C., 3 09 3 27
Johnson, K.E., 5 46
Iacoboni, M., 671 Johnson, L.B., 1 25
Ichikawa, S., 1 97 Johnson, M., 1 20
Iden, C.M., 5 66, 61 8 Johnson, M.D., 5 45
Idol, L.J., 780 Johnson, M.H., 5 3 5 , 5 45
Idson, L.C., 3 07, 3 09 Johnson, P.E., 729
Imai, S., 27, 642, 643 Johnson, T.R., 73 8
Inagaki, 5 9, 60 Johnson, W.A., 3 3 7, 43 6
author index 81 5
Johnson-Laird, P., 4, 1 02, 1 5 9, 1 61 , 1 70, 1 72, 1 73 , Kaufman, J.C., 3 5 8, 3 63 , 75 2, 760

1 74–1 76, 1 79, 1 85 , 1 86, 1 87, 1 88, 1 89, 1 90, 1 91 , 1 92, Kaufman, L., 21 7
1 93 , 1 94, 1 95 , 1 96, 1 97, 1 99, 200, 201 , 202, 209, 21 1 , Kaufman, N.L., 5 89
21 2, 221 , 227, 228, 3 5 7, 3 62, 464, 476, 478, 480, Kawai, N., 61 5
487, 5 41 , 5 42, 73 8 Kay, P., 63 9, 65 2, 65 5
Johnston, M.H., 5 08 Keane, M.M., 5 0, 76, 1 28, 1 3 1 , 441
Johnstone, E.C., 5 1 3 Keane, M.T., 1 23 , 3 84, 409
Johnstone, M.H., 495 Keating, D., 5 74
Johnstone, T., 444 Keating, D.P., 761
Jones, C.M., 489 Keats, D.M., 668
Jones, G., 3 44 Keeble, S., 5 41 , 5 48, 63 5
Jones, G.V., 1 00, 1 94, 203 Keegan, R.T., 3 75
Jones, R.M., 405 , 41 8, 425 Keele, S.W., 44, 45 , 47, 447
Jones, S.S., 29, 63 , 644 Keenan, E., 65 0, 65 4
Jones, T., 5 47 Keenan, T., 5 47
Jonides, J., 222, 41 6, 466, 482 Keeney, R.L., 243
Joram, E., 3 92 Keil, F.C., 1 4, 28, 3 0, 46, 5 6, 5 9, 60, 61 , 1 08, 3 83 , 5 40,
Joseph, G.M., 73 0, 73 1 5 41 , 71 5
Josephs, R.A., 3 08 Keith, J.R., 5 3 7
Joshi, A., 65 3 Keller, C.M., 3 72
Joshua, S., 3 87 Keller, H., 63 3 , 663 , 670, 671
Judd, B.B., 1 80, 5 98 Keller, J.D., 3 72
Judson, A.J., 445 Keller, J.V., 5 68
Jung-Beeman, M., 3 43 Kelley, H.H., 1 47, 297, 3 07
Junn, E.N., 1 23 , 5 42 Kellman, P.J., 5 9, 63 5
Jusczyk, P., 63 8 Kelly, M.E., 5 44
Juslin, 45 Kelman, H.C., 3 81
Just, M.A., 45 , 1 1 8, 3 3 1 , 404, 405 , 41 2, 425 , 466, 468 Keltner, D., 25 8
Kelvin, 1 86
Kahn, R.S., 5 05 Kemler, D.G., 3 0
Kahneman, D., 4, 1 00, 1 02, 1 1 1 , 1 75 –1 77, 1 80, 1 97, Kemmelmeier, M., 664
1 98, 1 99, 202, 243 , 244, 245 , 246, 247, 248, 25 2, Kemp, C., 1 1 1
25 3 , 25 4, 25 5 , 25 6, 25 9, 267, 268, 269, 270, 271 , Kemper, K., 3 80
272, 274, 275 , 276, 277, 278, 279, 282, 283 , 284, Kempton, W.
285 , 286, 287, 3 01 , 476, 5 41 , 5 81 , 694, 73 9, 775 Kennedy, D., 688, 689, 695 , 698
Kaiser, M.K., 21 7 Kennedy, J.F., 1 25
Kalish, C., 5 7, 1 46, 71 0 Kennedy, P.T., 1 3 0, 3 3 5
Kallick, B., 793 Kenney, P., 405 , 425
Kallio, K.D., 5 44 Kenrick, D.T., 5 41
Kamil, A.C., 61 3 Kerns, J.G., 5 06, 5 07, 5 1 0
Kamin, L.J., 1 48 Kershaw, T.C., 3 40, 3 42
Kamp, H., 1 87 Kerst, S.M., 222
Kandel, E.R., 477 Kestelhoof, L., 666
Kane, M.J., 1 3 0, 466, 467, 469, 5 02, 5 1 2, 5 1 6, 5 42 Ketron, J.L., 75 2, 780
Kanerva, P., 81 , 83 Keysar, B., 1 21
Kant, I., 3 , 1 45 , 1 5 3 Khong, Y.F., 1 25
Kanwisher, N., 5 78 Kieras, D.E., 404
Kaplan, B., 3 5 3 , 3 62 Kihlstrom, J.F., 43 2, 43 6, 445 , 446, 75 2
Kaprio, J., 5 1 1 Killeen, P.R., 5 65
Kapur, S., 479, 71 8 Kim, 5 4
Karmiloff-Smith, A., 5 3 6 Kim, J.J., 73 , 21 9
Karttunen, L., Kim, U., 669
Kassirer, J.P., 73 0, 73 1 , 73 3 , 73 8 Kindfield, A.C.H., 3 3 7
Kassleer, M.A., 3 88 King, A., 5 67, 5 78
Katsnelson, A., 65 1 King, J.E., 61 0, 61 2
Katz, J.J., 21 , 286 Kintsch, W., 21 2, 3 77, 462
Katz, L., 1 5 1 Kipling, R., 3 5 2
Kaufman, A.S., 5 89, 75 2 Kipp, K., 760
Kaufman, D.R., 729, 73 1 , 73 2, 73 6, 73 7, 73 8, 741 , Kirby, K.N., 464
742, 764, 777 Kircher, T.T., 5 09, 5 1 5
Kaufman, E.L., 5 77, 5 78 Kirsner, K., 443
81 6 author index
Kishner, D., 789 Kraik, K., 4

Kita, S., 647 Kralik, J.D., 620, 623
Kitayama, S., 669, 671 Krantz, D.H., 1 07, 268, 279, 5 76
Kitcher, P., 3 75 Krauss, R.M., 21 8
Kite, K., 3 80 Krawczyk, D.C., 76, 1 28
Klahr, D., 1 01 , 403 , 5 3 1 , 5 3 2, 5 48, 5 78, 706, 708, 709, Kreps, D.M., 694
721 Kreuz, R.J., 3 88
Klatt, L., 5 81 Kriegler, S., 778
Klatzky, R.L., 21 1 , 21 8 Krifka, M., 5 3
Klauer, K.C., 1 79, 464 Kris, E., 3 5 3
Klayman, J., 1 01 , 709 Kroger, J.K., 1 1 9, 1 94, 461 , 71 7
Klein, D.E., 5 4 Kronzon, S., 280
Klein, G., 3 75 , 789 Krosnick, J.A., 279
Klein, G.A., 729, 741 Kruglanski, A.W., 296, 297, 299, 3 00, 3 01 , 3 02, 3 03 ,
Klein, W.M., 3 00, 3 05 3 04, 3 05 , 3 1 1 , 3 1 2, 786
Kleinmuntz, D.N., 743 Krumhansl, C.L., 1 9, 29
Klem, A., 3 01 Kruschke, J.K., 1 4, 46
Kloesel, C., 21 1 Kubie, L.S., 3 5 3
Kloo, D., 5 47 Kubose, T.T., 76
Klopfer, D., 3 75 Kugelmass, S., 226
Kluckhohn, F., 663 Kühberger, A., 246
Knapp, A.G., 46 Kuhl, P., 63 8, 65 5
Knauff, M., 1 88 Kuhlmeier, V.A., 61 2, 61 3
Knetsch, J.L., 247, 248, 25 9, 283 Kuhn, D., 5 43 , 5 48
Knoblich, G., 3 43 , 3 44, 445 Kuhn, T.S., 1 09, 3 62, 3 80, 71 5
Knowlton, B.J., 47, 43 4, 43 9, 440, 441 , 444 Kuipers, B., 73 7, 73 8
Koehler, J.J., 1 74–1 76, 278, 287 Kulkarni, D., 708
Koenig, M.A., 5 9, 469, 482 Kunda, Z., 5 1 , 1 07, 296, 297, 298, 299, 3 00, 3 04, 3 05
Koeppe, R.A., 482 Kunreuther, H., 248
Koeske, R.D., 3 75 , 3 77, 3 82 Kunreuther, H.C., 25 9
Koestler, A., 3 62 Kunst-Wilson, W.R., 43 7
Kogan, N., 3 5 5 , 3 5 6 Kunzle, D., 23 0
Koh, K., 1 4, 1 22, 1 23 Kuperberg, G.R., 5 08
Köhler, W., 4, 1 86, 445 , 607, 608, 664, 695 Kushnir, T., 1 09, 1 44
Kohn, L.T., 73 8, 740 Kuyshniruk, A.W., 729
Kohonen, T., 1 4 Kyllonen, P.C., 466, 5 95 , 75 2
Koizumi, H., 63 8
Kojima, T., 61 0, 61 5 LaCasse, L., 75 9
Kokinov, B.N., 1 28, 409, 71 3 Laeng, B., 482
Kolers, P.A., 1 4 Lagnado, D., 48, 1 02, 1 1 3 , 1 1 7, 1 22, 1 44, 1 5 9, 1 69, 1 98,
Kolodner, J.L., 1 21 221 , 3 06, 3 24, 3 76, 5 89, 705 , 71 2, 73 0, 776
Kolpakowski, T., 226 Laird, J.E., 404, 405 , 425
Konner, M., 675 Lakoff, G., 1 09, 1 20
Koomen, W., 298 Lally, M., 75 5
Kopelman, R.I., 73 3 Lalonde, C., 674
Kopko, K.A., 678 Lamb, M., 668
Kopp, R., 287 Lamberts, K., 1 4, 45 , 41 6
Kornblith, H., 1 1 3 Lamon, M., 777, 783 , 788, 790
Korsakoff, 43 9 Lanca, M., 21 9
Koskenvuo, M., 5 1 1 Landau, B., 63 , 643 , 644, 645 , 646, 65 4, 65 5
Koslowski, B., 1 3 1 Landauer, T.K., 5 2, 78, 3 72, 5 69, 5 76, 61 5
Koss, F.V., 405 , 425 Lanford, J.C., 1 6
Kosslyn, S.M., 21 1 , 21 2, 21 3 , 21 5 , 21 6, 227, 464, 482 Lang, B., 5 47
Kotovsky, L., 24, 1 3 0, 1 3 6, 3 3 2, 446 Langdell, C.C., 688
Kounios, J., 498 Langdon, D., 486
Kovacic, D., 63 8 Lange, T.E., 76, 1 23
Kovacs, I., 5 00 Langer, E.J., 281 , 286, 776, 785
Kovelman, J.A., 5 1 3 Langley, P., 3 5 2, 3 5 6, 3 5 7, 3 62, 71 9
Kozhenikov, M., 21 6, 71 5 Langston, C., 73
Kozlowski, L.T., 21 7 Lansman, M., 75 6
Krafit, D., 25 1 Larkin, J., 73 2, 743
author index 81 7
Larkin, J.H., 227, 23 2, 75 8 Lewin, K., 295 , 3 08, 695

Larkin, M.J., 1 49 Lewis, C.I., 1 61
Larsen, A., 21 3 Lewis, M.W., 3 3 5 , 73 5
Larson, G.E., 75 9 Li, F., 279
Laruelle, M., 5 02 Li, M., 27
Lashley, K., 477, 75 8 Li, P., 645 , 647, 648
Lassaline, M.E., 26, 1 29 Liben, L.S., 223
Lasswell, H., 691 Liberman, A.M., 299, 3 07, 63 8
Latham, P.E., 5 67, 5 73 Licht, P., 3 87
Lau, A., 1 28 Lichtenberger, E.O., 75 2
Lauriola, M., 25 9 Lichtenstein, M., 3 03
Lave, J., 789, 790 Lichtenstein, S., 244, 247, 25 3
Lavoiser, 3 90 Liddle, P.F., 495
Law, K., 1 3 3 Lidz, J., 640, 65 2
Lawrie, S.M., 5 1 3 Lieberman, M.D., 447
Lawson, A., 3 87 Lien, Y.W., 1 46, 1 61
Lazar, I, 765 Light, P., 5 43
Le Bihan, D., 5 78 Lightbown, P., 671
Le Unes, A., 1 7 Liittschwager, J.C., 5 8
Lea, G., 707 Lillard, A., 671 , 672
Leaper, D.J., 728 Limon, M., 3 87
Lebiere, C., 404, 406, 425 , 43 3 Limongelli, L., 622, 623
LeBoeuf, R.A., 25 2, 25 8, 25 9, 727, 740, 776 Lindblom, B., 63 8
Lecas, J.F., 1 90, 5 42, 5 43 Lindem, K., 1 87
Lecissi, M., 742 Lindenberger, U., 5 3 1
Ledgeway, T., 409 Lindsay, D.S., 447
Ledley, R.S., 728 Linn, M.C., 21 6
Lee, A., 669 Lipman, M., 779, 787, 792
Lee, M.D., 21 , 22 Lippa, Y., 47
Lee, P.U., 220 Lipsett, L.P., 61 0
Leed, D., 3 81 Lipshitz, 741
Leevers, H.J., 5 43 Lipton, J.S., 5 80, 5 81
Legrenzi, M., 1 97 Lipton, P., 98
Legrenzi, P., 1 97, 202 Lisle, D.J., 25 1
Lehman, D.R., 1 48, 1 5 3 , 279, 3 1 2 Litchfield, M., 71 4
Lehtinen, E., 3 83 Litman, L., 1 79, 1 99, 447, 61 2
Leigh, J., 3 85 Little, T.D., 5 74
Leitch, E., 5 46 Llewellyn, K.N., 687, 689, 691 , 692, 694, 696
Lempert, R.O., 697 Lloyd, 224
Lennenberg, E., 63 9 Lloyd, P., 5 3 1
Leon, M.I., 5 68 Lo, Y., 1 02, 1 08, 1 09, 5 48
Leonard, G.K., 283 Lober, K., 1 5 7
Leone, T.J., 720 Lochhead, 779
Lepper, M.R., 25 0, 298, 3 93 Locke, J., 442
Leprohon, J., 729 Lockhart, L.K., 298
Lerner, J.S., 25 8, 3 00, 3 02, 3 05 Lockhart, R.S., 75 2
Lesgold, A., 3 75 Loehlin, J.C., 766
Leslie, A.M., 61 , 5 41 , 5 47, 5 48, 63 5 Loewenstein, G.F., 25 3 , 25 6, 25 7, 25 8, 25 9, 287
Leube, D.T., 5 09 Loewenstein, J., 1 3 0, 65 4
Leuenberger, A., 3 00 Loftus, E.F., 41 , 42, 249
Levelt, W.J.M., 494, 496, 498, 499, 5 06, 5 07, 5 09, Logan, G., 41 6
641 , 65 1 Logan, J., 63 8
Levi, A., 296 Logie, R.H., 45 9, 460
Levidow, B.B., 71 4 Lohman, D.F., 75 7
Levin, I.P., 246, 249, 25 9 Lohr, J.B., 5 1 3
Levin, J.A., 3 84 Lombardo, V., 1 92
Levine, M., 3 9, 41 5 Long, C., 1 3 0
LeVine, R.A., 672 Lonnqvist, J., 5 1 1
Levine, S.C., 5 80 Loomis, J.M., 21 8
Levinson, S.C., 221 , 63 6, 647, 65 2 Lopes, L.L., 260
Lewicki, P., 441 López, A., 1 02, 1 03 , 1 08
81 8 author index
Lopez, D.F., 299 Magyari-Beck, I., 3 5 8, 3 5 9

Lopez, F.J., 1 5 2 Mah, W.A., 223
Lord, C.G., 298 Maher, B.A., 495 , 498, 5 04, 5 08, 5 1 2
LoSchiavo, F.M., 3 00 Maheswaran, D., 3 02
Loukas, E., 61 0, 61 3 Maier, N., 3 3 1 , 3 42, 445
Loula, F., 21 7 Mainwaring, S., 220
Lourenco, O., 5 3 0 Maki, A., 63 8
Lovallo, D., 25 5 Maki, R.H., 222
Love, B.C., 24, 46, 47, 5 6, 84, 1 08, 1 09 Makin, V.S., 47
Lovett, M.C., 76, 77, 1 3 7, 3 27, 3 71 , 3 72, 407, 41 3 , Malenka, R.C., 5 09
43 3 , 468, 470, 5 3 2, 75 1 Malone, P.S., 3 01
Lovibond, P.F., 1 49, 1 5 5 Malt, B.C., 45 , 46, 5 4, 5 7, 5 9, 1 1 3 , 63 7, 646, 65 5
Lowe, R.K., 228 Mandel, D.R., 1 48, 1 5 3
Lowell, E.L., 3 5 8 Mandel, N., 25 7
Lowell, P., 71 3 Mandler, G., 5 3 6, 5 71 , 5 78
Lozano, S., 23 0 Mandler, J.M., 1 07, 5 3 6, 5 45 , 63 5 , 644
Lu, L., 5 03 Manfredi, D., 1 21
Lubart, T., 201 , 3 5 1 , 3 5 4, 3 5 6, 3 5 8, 3 60, 3 61 , 3 62, Mangels, J.A., 47
777 Mangels, J.A.., 222
Lubow, R.E.M., 5 05 Manktelow, K.I., 1 69, 1 75 –1 77, 1 78, 1 94, 203
Lucariello, J., 671 Manschreck, T.C., 5 1 2
Lucas, M., 42 Manstead, A.S.R., 3 1 1
Luce, R.D., 5 76 Manza, L., 441 , 444
Luchins, A.S., 407 Manzano, M., 78
Lucy, J., 642, 643 , 644 Marcel, A.J., 43 7
Ludwig, A.M., 3 5 8 Marcus, G.F., 74, 77, 79, 80, 81 , 5 3 4, 5 3 5 , 5 3 6, 5 40
Lugar, L., 3 08 Mareschal, D., 5 3 2, 5 3 5 , 5 40
Lumsden, C.J., 3 62 Marini, Z., 5 3 1 , 5 47, 5 48
Lund, K., 78 Mark, T.A., 5 67
Lundgren, S.R., 3 1 1 Markman, A.B., 24, 25 , 26, 47, 5 9, 1 27, 1 28, 1 29, 3 3 0,
Lung, C-T., 3 42 3 73 , 465 , 5 41 , 71 4
Luo, Q., 1 1 9 Markman, E.M., 28, 1 01 , 1 02, 1 05 , 5 43 , 5 46, 65 3
Luria, A.R., 4, 5 29, 676 Markovits, H., 1 94, 201 , 5 42, 5 43 , 5 45
Lusted, L.B., 728 Marks, A., 71 7
Lyn, H., 671 Markus, H., 671
Lynch, E.B., 3 1 , 3 7, 3 9 Marr, D., 5 , 27, 1 09, 1 87, 477
Lynch, J.S., 1 74–1 76 Marsh, L., 5 1 2
Lyons, J., 5 3 Marsh, R.L., 41 2, 446
Marshall, S.P., 3 74
Macchi, L., 279 Martin, B., 21 7
MacDonald, C., 43 9 Martin, L.L., 269, 3 1 1 , 3 1 2
MacDonald, S.E., 609 Martin, S.A., 3 41
MacGregor, D.G., 25 8 Maslow, A., 3 5 8, 3 62
MacGregor, J.N., 3 27, 3 43 , 3 44 Massey, C., 645
Machado, A., 5 3 0 Matarazzo, J.D., 75 8
Machamer, P., 3 75 Mathalon, D.H., 5 05
Macho, S., 1 49, 1 5 9, 1 60 Mather, N., 5 90
MacKenzie, R., 23 0 Mathews, R.C., 43 4, 444
Mackie, J.L., 1 5 0, 1 60 Mathieson, A., 697
Mackiewicz, R., 1 93 Matsuda, F., 5 48
MacKinnon, D.W., 3 5 8 Matsuzawa, T., 5 68, 608, 609, 61 5
MacLeod, C.M., 75 6 Matthews, J., 5 46
Macomber, J., 63 5 Matute, H., 1 5 2
Macrae, C.N., 3 04 Mauro, R., 699
MacSwan, J., 65 0 Mawby, R.A., 5 43
Maddox, W.T., 46, 47 Maxwell, J.C., 1 86, 1 87
Madsen, M., 670 May, J., 1 47, 1 63
Maduro, R., 3 5 8 Maybery, M., 5 3 7
Magalhües, V.P., 762 Mayer, J.D., 75 2
Magder, S.A., 729, 73 8, 741 Mayer, R., 766
Magel, S.G., 223 Mayer, R.E., 227
author index 81 9
Mayes, L.C., 766 Meimeyer, R.A., 1 7

Maynard, A., 663 , 671 , 672 Melinder, 5 09
Mazuka, R., 642, 643 Mellers, B., 279, 280
Mazzoco, A., 1 93 Mellet, E., 482
McBeath, M.K., 21 7, 222 Meltzoff, A.N., 1 08, 1 09, 3 74
McCabe, A.E., 5 46 Melz, E.R., 1 3 4
McCabe, G.P., 3 08 Mendelsohn, G.A., 3 43
McCarrell, N.S., 3 24 Menninger, K., 5 61
McClelland, J.L., 60, 77, 78, 79, 1 3 4, 285 , 3 5 8, 3 75 , Menzel, E.W., 608
422, 43 3 , 5 3 2, 5 3 4, 5 3 5 , 5 3 6, 5 45 , 5 48 Merikle, P.M., 43 7
McClone, M.S., 1 21 Merrill, M.A., 75 3
McCloskey, M., 42, 43 , 61 , 1 87, 3 74, 3 87, 71 5 Mervis, C.B., 40, 44, 5 45 , 5 46
McCloud, S., 23 0 Merzback, U.C., 5 60
McCollam, K., 75 2 Meseguer, E., 1 97
McConnell-Ginet, S., 43 Mestre, J.P., 29, 3 3 7, 71 5 , 73 5
McCormack, 61 0 Meszaros, J., 248
McDermott, J., 75 8 Metzler, J., 1 88, 21 2
McDermott, L.C., 71 5 Meyer, A.S., 496
McDonald, H.E., 299 Meyer, D.E., 42, 404, 445
McDonald, J., 1 1 0 Meyer, M., 42, 1 87
McDonald, M., 5 3 1 Meyerowitz, J.A., 3 04
McDonald, R., 5 78 Michotte, A.E., 1 45 , 1 62, 1 63 , 21 7, 271
McDonough, L., 1 07, 644 Mill, J.S., 201 , 3 80
McEleney, A., 1 94 Millar, J.K., 5 1 3
McFadden, D., 283 Miller, D.T., 5 1 , 271 , 297
McGarrigle, J., 5 44, 5 46 Miller, E.K., 469, 5 02, 5 1 2
McGarry-Roberts, P.A., 75 9 Miller, G.A., 3 72, 45 8, 5 3 6
McGhie, A., 5 04 Miller, J.G., 65 5 , 673 , 674
McGinnies, E., 296 Miller, K.F., 5 74
McGonigle, B., 5 45 , 61 3 Miller, P.M., 25 9
McGrane, P.A., 765 Miller, R.A., 729
McGregor, S., 765 Miller, R.R., 1 47, 1 48, 1 5 1 , 1 5 2, 5 67
McGrew, K.S., 5 90 Miller, R.W., 98
McGuinness, C., 780 Miller, S.A., 5 3 0
McGuire, P.K., 5 08 Millis, K., 1 7
McGuire, W.J., 3 80 Millward, R., 26
McKeithen, K.B., 3 3 7, 3 3 8 Milner, B., 43 2, 43 9, 45 8
McKeough, A., 5 3 1 Minahan, M.F., 609
McKinley, S.C., 46, 41 6 Minda, J.P., 46, 47
McKoon, G., 1 23 Minoshima, S., 482
Mclain-Allen, B., 3 88 Minsky, M., 1 21 , 3 74
McLaughlin, D.H., 666 Minstrel, J., 3 87
McLaughlin, G.H., 5 3 1 Minzenberg, M.J., 5 06, 5 07
McLean, R.S., 783 Mires, J., 21 4, 21 6
McLin, D., 5 3 5 Mitchell, C.J., 1 49
McNamara, T.P., 223 Mitchell, M., 28, 1 28, 1 3 1 , 409
McNeil, B.J., 25 9 Mitchell, R., 282, 75 6
McNeil, T.F., 5 1 4 Mitroff, I., 71 0
McNemar, Q., 3 5 5 Mix, K., 5 80, 5 81
McRoberts, G., 63 8 Miyake, N., 3 82, 41 2, 469
Means, M.L., 3 77 Modgil, S., 5 3 0
Mechner, F., 5 64 Moher, T., 3 85
Meck, W.H., 5 63 , 5 64, 5 65 , 5 66 Molden, D.C., 25 8, 296, 3 06, 3 07, 3 09, 3 1 1 , 776
Medin, D.L., 1 4, 20, 21 , 24, 25 , 26, 28, 29, 3 0, 3 1 , 3 7, Molenaar, P.C.M., 5 3 6
3 9, 41 , 43 , 44, 45 , 46, 47, 5 1 , 5 6, 5 7, 5 9, 63 , 73 , 99, Moller, J.H., 729
1 02, 1 03 , 1 06, 1 08, 1 1 3 , 1 22, 1 3 4, 1 45 , 1 46, 221 , 3 73 , Mone, S., 3 08
3 80, 3 84, 3 91 , 41 6, 496, 5 45 , 666, 71 0, 73 3 Monson, T., 3 03
Mednick, S.A., 3 5 5 Monteil, V., 666
Meehl, P.E., 25 2 Montello, D., 21 8, 224
Meeker, M.N., 782 Moore, G.D., 3 5 5
Mehler, J., 440, 5 69, 63 8 Moore, H., 5 05
82 0 author index
Moore, J.L., 791 National Commission on Excellence in

Moore, U., 693 Education, 776
Moray, N., 1 87, 43 6 National Council of Teachers of Mathematics, 3 45
Morel, F., 441 Naumer, B., 1 79
Moreno, S., 1 93 Navarro, D.J., 21
Morgenstern, O., 243 Neale, J.M., 5 00
Mori, M., 75 9 Neely, J.H., 5 09
Morikawa, 675 Neisser, U., 21 , 766
Morley, E., 480 Nelson, G., 60, 1 3 4
Morris, B.J., 1 94, 5 43 Nelson, L., 269, 270
Morris, C.D., 789 Nersessian, N., 71 3 , 71 5 , 720
Morris, M.W., 673 Nestor, P.G., 5 07
Morris, N., 464 Neter, E., 3 05
Morrison, F.J., 766 Neter, N.E., 274, 278
Morrison, J.B., 23 0 Neth, H., 1 94
Morrison, R.G., 87, 1 20, 1 24, 1 28, 1 3 4, 1 3 6, Nettelbeck, T., 75 5 , 75 7, 763
3 5 6–3 5 7, 41 2, 43 6, 465 , 469, 495 , Neuberg, S.L., 297, 3 00, 3 01 , 3 02, 3 03 , 3 04, 3 05
5 96, 707, 71 0, 75 7 Neuchterlein, K.H., 5 04, 5 09
Moses, L.J., 5 47 Newcomb, T.M., 3 80
Moskovitz, D., 623 Newcombe, N., 21 7, 223 , 441
Moskowitz, G.B., 3 00, 3 03 Newell, A., 4, 74, 1 94, 203 , 3 24, 3 26, 3 27, 3 29, 3 3 0,
Mosteller, F., 20 401 , 404, 470, 476, 477, 488, 5 3 2, 5 3 6, 707, 728,
Mouchiroud, C., 3 60 73 8, 75 8
Moyer, R.S., 21 3 , 225 , 5 69, 5 76, 61 5 Newman, G., 5 8
Mukobi, K.L., 61 2 Newman, S.D., 468
Mulholland, T.M., 1 1 9 Newman, S.F., 790, 792
Mulhotra, A., 227 Newport, E.L., 43 1 , 441 , 65 3
Müller, M.M., 469 Newstead, S.E., 1 69, 1 73 , 1 79, 1 80, 1 93 , 1 94, 200
Muller, U., 5 43 Newton, I., 71 5
Mulligan, N., 43 6 Nguyen-Xuan, A., 1 75 –1 77
Mumford, D., 3 60 Nichelli, P., 45 9
Munakata, Y., 78, 5 3 5 Nickerson, C., 288
Muncer, S.J., 61 8 Nickerson, R.S., 1 8, 778, 779, 780
Mundy-Castle, A.C., 668, 669 Nielsen, P.E., 405 , 425
Munnich, E., 644–646, 65 5 Nienhuis, A.E., 3 1 1
Munro, G.D., 298 Nigam, 721
Murdock, B.B., 45 7 Nightengale, N.N., 766
Murphy, G.L., 1 4, 28, 3 7, 46, 5 0, 5 1 , 5 4, 1 00, Niklasson, T.J., 74
1 06, 1 08 Nisbet, J., 780
Murphy, S.T., 43 7, 43 8 Nisbett, R.E., 1 3 , 63 , 73 , 1 02, 1 07, 1 08, 1 22, 1 46, 1 87,
Murray, C., 766 1 96, 25 1 , 268, 296, 297, 5 40, 5 41 , 670, 671 , 673 ,
Murray, F.B., 3 80 674, 676, 686, 700, 781 , 791
Murray, H.A., 296 Nishihara, H.K., 27
Murray, M.J., 46 Nissen, H.W., 61 0
Murray, S.L., 296, 297 Noel, M.P., 5 74, 5 79
Musch, J., 1 79 Norenzayan, A., 63 , 675
Musolini, J., 65 0 Norman, D.A., 41 , 43 , 62, 3 73 , 3 74, 3 76, 45 7, 460,
Mussweiler, 272 462, 73 8, 73 9, 741 , 743
Myers, J.D., 729 Norman, G.R., 73 3
Myles-Worsley, M., 3 3 7, 3 3 8 Norris, S.P., 785
Mynatt, C.R., 706, 708 Nosofsky, R.M., 1 4, 43 , 45 , 46, 47, 41 6
Novak, J.D., 3 88
Naccache, L., 43 1 , 43 7, 43 8 Novak, M.A., 609
Nadasky, Z., 5 41 Novemsky, N., 280
Nagel, 1 07 Novick, L.R., 1 5 , 1 1 7, 1 28, 1 3 0, 1 3 4, 1 3 6, 1 5 3 , 1 5 5 ,
Nahmias, E., 760 1 5 9, 1 60, 21 4, 221 , 23 0, 3 26, 3 3 5 , 3 3 6, 3 3 8, 3 3 9,
Naigles, L., 646 3 43 , 3 72, 3 84, 401 , 409, 41 8, 5 41 , 705 , 706, 707,
Nakamura, G.V., 3 74 727, 728, 789
Nakayama, K., 1 45 Noy, P., 3 5 3
Narayanan, K., 469 Nsamenang, B., 668
Natale, F., 609, 621 Nunberg, G., 63 7
author index 82 1
Nussbaum, J., 3 88 Pankratz, C., 5 9

Nyberg, L., 43 8 Papafragou, A., 3 9, 221 , 645 , 646, 648, 65 0
Nygren, T.E., 25 8, 268 Papert, S., 721
Parducci, A., 272
Oakhill, J.V., 1 87, 480 Pare-Blageov, J., 5 3 6, 5 3 7
Oaksford, M., 1 01 , 1 70, 1 74–1 75 , 1 76, 1 77, 1 78 Parkins, E., 1 3 0
Oatley, K., 21 7 Parkman, J.M., 5 69
Oberlander, J., 23 2, 743 Parsons, L.M., 21 4, 21 5 , 71 8
O’Brien, A.T., 3 43 Partee, B.H., 3 9, 5 0, 5 2
O’Brien, D.P., 1 71 Pascual-Leone, J.A., 5 3 1 , 783
Ochse, R., 3 5 1 , 3 62 Pashler, H.E., 268
Oden, D.L., 73 , 77, 61 0, 61 1 , 61 2 Pask, C., 1 1 7
Oden, G.C., 260 Passerieux, C., 5 06
O’Donnell, P., 5 05 Patalano, A., 41 6
O’Driscoll, G.A., 5 1 4 Pate, J.L., 61 7
Ogbu, J.U., 766 Patel, V.L., 1 28, 3 24, 3 3 7, 729, 73 0, 73 1 , 73 2, 73 3 ,
O’Hara, L., 3 5 6 73 4, 73 6, 73 7, 73 8, 73 9, 741 , 742, 744
Ohlsson, S., 1 1 8, 228, 3 3 1 , 3 42, 3 43 , 3 71 , 3 75 , 3 81 , Pauen, S., 5 42
3 83 , 3 85 , 3 87, 3 90, 3 91 , 705 , 71 5 , 73 2, 776, 791 Pauker, S.G., 25 9, 729
Ohnishi, H., 29 Paul, 778
Okubo, Y., 5 02, 5 1 2 Payne, J.W., 25 0, 260
Olan, J., 1 80 PDP Research Group, 77
Oliver, D.C., 63 9 Pea, R.D., 5 43 , 790
Oliver, L.M., 5 41 Pearce, J.M., 1 5 6
Ollinger, J.M., 21 6 Pearl, J., 99, 1 1 3 , 1 5 1 , 1 5 3
Olseth, K.L., 76, 1 28, 1 3 1 , 3 40, 3 41 Pearlstone, 271
Olson, D.R., 5 47, 5 48, 766 Pears, R., 5 44
Olson, H.C., 766, 793 Pearson, H.E., 620, 621
Olthof, A., 5 66, 5 68, 61 8 Pearson, K., 1 47, 1 62
Oltmanns, T.F., 5 00 Peay, L., 61 8
Olton, D.S., 470 Pederson, E., 647
Olver, R.R., 5 3 0, 666, 792 Pedley, 5 09
Ordóñez, L., 280 Pedone, R., 1 3 0, 1 3 6, 23 0
O’Reilly, R.C., 78, 84, 5 03 Pegram, G.V., 61 0
Ormerod, T.C., 1 94, 200, 203 , 3 27, 3 43 , 3 44 Pejtersen, A.M., 729
Ortony, A., 3 0, 5 6, 1 1 3 Pelham, B.W., 3 05
Osborn, A.F., 3 5 3 Pellegrino, J.W., 1 1 9, 21 1
Oscanyon, F., 787 Pelletier, F.J., 1 78
O’Seaghdha, P.G., 5 06 Peña, M., 63 8
O’Shea, T., 403 Pendry, L.F., 3 04
Osherson, D.N., 43 , 5 0, 1 01 , 1 02, 1 03 , 1 06, 1 07, 1 08, Peng, K., 63 , 1 96, 673 , 676
1 09, 1 87, 1 88, 3 84, 5 41 , 5 43 , 5 48, 71 2, 71 8 Penn, D.L., 5 1 6
Osler, W., 728 Penner, D.E., 709
Osterholm, K., 1 7 Pennington, G.L., 3 08, 3 80
Osterhout, L., 3 42 Pennington, N., 697
Ostrum, T.M, 697 Penrod, S.D., 697
Over, D.E., 1 02, 1 08, 1 70, 1 72, 1 73 , 1 74–1 75 , 1 76, 1 77, Pepperberg, I.M., 608
1 78, 1 80, 1 99, 267, 476 Perales, J.C., 1 48
Overton, W.F., 5 43 Perani, D., 71 8
Overtoom, M., 780 Perie, M., 3 00
Owen, D.R., 447 Perkins, D.A., 765
Owens, D.C., 5 1 3 Perkins, D.N., 45 , 3 5 6, 3 5 9, 3 60, 3 62, 720, 744, 764,
Oyserman, D., 664, 670 776, 777, 778, 780, 781 , 785 , 787, 789, 790, 791 ,
Ozeki, Y., 5 1 3 792, 793
Perlmutter, M., 5 74
Pacteau, C., 443 , 444 Perner, J., 1 79, 5 47, 5 48, 671
Paivio, A., 209, 225 Perruchet, P., 43 1 , 43 2, 43 8, 443 , 444
Palfai, T., 5 3 8 Persson, 45
Palincsar, A.S., 779, 781 Perusse, 61 7
Palmer, S.E., 22, 73 Peters, E., 25 8
Palmeri, T.J., 1 4, 29, 45 , 46, 47, 41 6, 41 7 Peterson, A.C., 21 6
82 2 author index
Petrov, A.A., 1 28 Puto, C., 25 0

Petty, R.E., 3 00, 3 05 , 3 1 2, 786 Pylyshyn, Z.W., 1 5 , 74, 21 1 , 468, 476, 477,
Pfefferbaum, A., 5 1 2 5 3 5 , 5 81
Phillips, L., 1 98 Pyszczynski, T., 296, 297, 299, 3 04, 43 1
Phillips, S., 5 8, 80, 5 3 6, 5 45
Piaget, J., 4, 1 69, 3 80, 3 85 , 5 29, 5 3 0, 5 3 1 , 5 3 6, 5 3 7, Qin, Y., 425 , 708
5 40, 5 46, 61 3 , 61 8, 620, 668, 671 , 695 , 783 Quartz, S.R., 5 3 7
Pianta, R.C., 765 Quattrone, G.A., 224, 248, 249
Piazza, M., 5 78 Quigley, K.S., 5 45 , 61 3 , 61 4
Picasso, P., 3 60 Quillian, M.R., 22, 3 9, 41 , 42, 43 , 3 73
Pierce, C.S., 1 86, 200, 203 , 21 1 , 73 0 Quine, W.V., 1 3 , 1 4, 98, 642
Pierrehumbert, J.B., 26 Quinlan, D.M., 495 , 5 08
Pinango, M.M., 63 Quinn, P.C., 5 45 , 63 5 , 65 4
Pinel, E.C., 25 8 Quiroz, B., 670
Pinel, P., 482
Pinker, S., 73 , 79, 21 3 , 226, 227, 640, 65 2, 760 Raaijmakers, J.G.W., 1 4
Pinkham, A.E., 5 1 6 Rabain, J., 665 , 666, 675
Pitchert, J., 3 84 Rabain-Jamin, J., 672, 675
Plate, T.A., 81 , 83 Rabain-Zempléni, J., 665 , 666, 675
Plato, 3 5 2, 775 Rabbitt, P.M., 75 5 , 75 7
Platt, J.R., 5 64 Rabin, M., 25 6, 25 9
Plomin, R., 766 Racenstein, J.M., 5 1 6
Plott, C., 25 3 , 25 9 Rachford, D.L., 778
Plunkett, K., 5 3 0 Radvansky, G.A., 21 2
Podgorny, P., 1 8, 3 1 , 21 1 , 21 4 Raeff, C., 670
Poldrack, R., 71 8 Ragland, J.D., 5 09
Policastro, E., 3 60 Raiffa, H., 243
Politzer, G., 1 75 –1 77 Raijmakers, M.E.J., 5 3 6
Polk, T.A., 1 8, 1 94, 203 Ramey, C.T., 765
Pollard, P., 1 75 –1 77, 1 79, 1 80, 480 Ramey, S.L., 765
Polya, G., 3 24, 777, 781 Ramoni, M.F., 73 0, 73 4
Polyani, M., 447 Ramscar, M., 221
Poortinga, Y.H. Ramsey, N.F., 5 05
Pople, H.E., 729 Ramus, F., 440
Popper, K.R., 1 98, 707 Ramus, S.J., 43 9
Posner, M.I., 44, 45 , 47, 3 3 5 , 3 80, 3 81 , 3 87, Rand, Y., 778, 788
75 6 Raney, G.E., 3 44
Potı̀, P., 620, 621 Rasmussen, J., 729, 741
Poulton, 280 Ratcliff, R., 1 23
Pound, R., 689, 690, 693 Rattermann, M.J., 24, 76, 1 23 , 1 24
Powell, C., 765 Rauch, S.L., 43 7
Prabhakaran, V., 1 1 9, 469, 71 8 Rauscher, F.H., 21 8
Prasada, S., 73 , 21 7, 623 Raven, J.C., 1 1 8, 5 91
Pratkanis, A.R., 43 7 Ravizza, S.M., 468, 470
Pratt, C., 5 47 Rea, C.B., 3 42
Prelec, D., 245 , 25 6 Read, D.E., 486
Premack, A.J., 61 5 Reagher, G., 46
Premack, D., 73 , 61 0, 61 1 , 61 2, 61 5 , 61 8 Reason, J.T., 73 8
Prentice, W.C.H., 222 Reber, A.S., 1 79, 1 80, 1 99, 43 1 , 43 2, 43 9, 440, 441 ,
Presson, C.C., 21 8, 220, 224 442, 443 , 444, 446, 447, 61 2
Pretz, J.E., 3 63 , 777 Reber, E.S., 43 2, 440
Pribram, K.H., 5 3 6 Reber, P.J., 47, 441 , 446
Priester, J.R., 271 Reddish, L., 71 5
Prill, K.A., 5 94 Redelmeier, D.A., 25 0, 25 2, 25 5 , 25 9, 284, 286
Prince, A., 73 , 79 Reder, L.M., 41 3 , 742, 791
Prislin, R., 3 1 1 Redington, M., 444
Proffitt, 1 03 Reed, S.K., 44, 1 24, 1 28, 41 6
Proffitt, D.R., 220 Reed, T.A., 691 , 693
Prusse, R., 5 78 Reed, T.E., 75 6, 766
Pulaski, C., 699 Reene, K., 5 43
Purcell, T., 23 1 Reese, E.P., 5 77
author index 82 3
Reese, T.W., 5 77 Roediger, H.L., 1 4

Regan, S, 3 83 Roelofs, A., 496
Regier, T., 63 9, 65 2 Roese, N.J., 298, 3 07, 3 08
Rehder, B., 5 7, 1 09, 5 46 Rogers, C.R., 3 5 8
Reich, L.C., 666 Rogers, Y., 227
Reichenbach, H., 96, 1 01 , 1 5 1 Rogoff, B., 668
Reilly, K.D., 5 3 6 Rogosky, B.J., 49
Reiman, P., 73 5 Roitman, J.D., 5 64, 5 66, 5 68
Reimann, P., 3 3 5 Rokeach, M., 3 73 , 3 81 , 3 87, 3 90, 3 93
Reiner, 3 87 Romo, 5 9
Reingold, E., 43 7, 443 Roney, C., 3 09
Reisberg, D., 3 73 , 743 Ronning, R.R., 3 26
Reiser, B.J., 720 Rood, B., 1 74–1 76
Reiser, J.J., 21 8 Roosevelt, F.D., 1 25
Reitman, J.S., 3 3 7 Rosch, E., 4, 40, 41 , 42, 44, 5 9, 99, 1 00, 1 02, 1 06, 1 45 ,
Reitman, W., 1 1 9, 228, 3 3 0 41 6, 5 45
Reminger, S.L., 441 Roschelle, J., 3 74
Renzulli, J.S., 3 5 4 Roscoe, R.D., 3 90, 71 5
Rescorla, R.A., 1 46, 1 47, 1 48, 1 5 2, 1 5 3 Rose, A., 441
Resing, W.C.M., 1 23 Rose, S.P., 5 3 7
Resnick, L.B., 777 Rosenberg, M.J., 3 80
Resnick, M., 3 83 , 3 87 Rosenbloom, P.S., 404
Restle, F., 3 9, 647 Rosengren, K.S., 5 8, 60
Reuter, H.H., 3 3 7 Rosenzweig, M.R., 3 76
Reykowski, J., 669 Roser, M., 71 8
Reyna, V.F., 5 45 Ross, B.H., 1 4, 46, 47, 60, 76, 1 23 , 1 24, 1 3 0, 3 3 5 , 41 0
Reynolds, R.E., 3 24 Ross, L.D., 296, 297, 298, 700
Rhenius, D., 3 43 Ross, M., 25 5 , 299
Richardson, A., 21 5 Rosso, I.M., 494
Richardson, J., 1 94 Roth, W., 5 05 , 793
Richardson, L.B., 27 Rothbart, M., 3 03
Richardson, M.W., 1 6 Rothenberg, A., 3 5 2, 3 5 3
Richart, 720 Rottenstreich, Y., 246, 25 9
Richerson, P., 607 Rouse, W.B., 1 87
Richland, L.E., 1 24, 1 3 7 Rovee-Collier, C., 441
Rickel, J.E., 61 2 Rozelle, J., 1 02, 1 08, 1 09, 5 48
Rieser, B.J., 21 3 Rozenblit, L., 1 4, 60
Riley, D.A., 221 , 5 44 Rozin, P., 63 9
Rimoldi, H.J., 728 Rubensen, D.L., 3 5 1 , 3 61 , 3 62
Ripley, B.D., 1 4 Rubinson, H., 3 75
Rips, L.J., 6, 1 4, 1 6, 1 7, 21 , 28, 3 7, 3 9, 42, 44, 49, 5 0, Rubinstein, J., 1 3 , 1 1 2
5 7, 5 8, 99, 1 00, 1 02, 1 03 , 1 1 3 , 1 22, 1 3 4, 1 45 , 1 71 , Ruby, P., 21 5
221 , 3 73 , 3 80, 3 84, 3 91 , 41 6, 475 , 479, 496, 5 45 , Ruchkin, D., 462, 469
666, 73 3 Rudy, J.W., 5 03 , 5 3 7
Rispoli, J., 1 1 0 Ruff, C.C., 1 88, 5 07
Rist, R., 201 Ruffman, T., 5 48
Ritchhart, R., 777, 785 , 787, 793 Rumbraugh, D.M., 5 66, 5 68, 608, 609, 61 0, 61 4, 61 6,
Rith, C., 1 80 61 7
Ritov, I., 20, 25 1 , 271 , 272, 283 , 287 Rumelhart, D.E., 41 , 43 , 77, 78, 1 3 4, 3 73 , 3 74, 3 75 ,
Rizzolatti, G., 441 3 76, 422, 5 45
Robbins, S.J., 1 46, 1 48 Runco, M.A., 3 5 1 , 3 5 7, 3 61 , 3 62
Roberson, D., 65 5 Ruscher, J.B., 3 1 1
Roberts, M.J., 1 94, 1 97 Russell, B., 96, 1 1 3
Roberts, R.D., 75 2 Russo, J.E., 3 27
Roberts, W.A., 5 66, 608, 61 8 Ruts, W., 43
Robin, N., 73 , 3 87, 5 3 7 Ruzgis, P., 766
Robinson, J.S., 61 0 Rypma, B., 5 09
Rochat, P., 61 8, 61 9
Rode, C., 227 Sabb, F.W., 5 1 5 , 71 7
Rodet, L., 48 Sadalla, E.K., 223
Roe, A., 3 5 4 Sadato, N., 479
82 4 author index
Saffran, J.R., 43 1 , 441 Schkade, D.A., 25 2, 25 3 , 25 4, 25 9, 271 , 272, 283 , 287,

Saiki, J., 75 743
Saks, M.J., 697–699 Schlegel, J., 693
Salas, E., 729 Schliemann, A.D., 677, 762
Salisbury, D.F., 5 08 Schlosberg, H., 5 78
Salmon, D.P., 61 0, 61 3 Schlottmann, A., 1 63 , 5 48
Salomon, G., 744, 778, 789, 790, 791 , 793 Schmandt-Besserat, D., 23 2
Salovey, P., 75 2 Schmidt, H.G., 73 6, 73 7
Salthouse, T.A., 1 80, 45 7, 467, 5 89, 5 92, 5 93 , 5 94, Schmidt, W.C., 5 3 2, 5 3 5
5 95 , 5 96, 5 97, 5 98, 5 99, 603 , 604, 75 7 Schnake, S.B., 3 1 1
Salvucci, D.D., 1 3 7, 404, 409, 41 0 Schneider, S.L., 246, 249
Samarapungavan, A., 5 49 Schneider, W.S., 498, 5 04
Samuels, M., 1 1 0, 5 48 Schnittjer, S.K., 249
Samuelson, W., 248 Schnur, T., 71 8
Sanbonmatsu, D.M., 3 03 , 3 04 Schoemaker, P.J.H., 246
Sanchez, C.A, 785 Schoenfeld, A.H., 3 3 8, 777, 781 , 789
Sanchez, M., 779 Schoenfeldt, L.F., 3 60
Sandoval, W.A., 720 Scholl, B.J., 1 45 , 1 62
Sands, S.F., 61 0 Schön, D.A., 23 1 , 781
Sanes, J.N., 482 School, B.J., 5 81
Sanjana, N.E., 1 1 0 Schooler, J.W., 25 1
Santamarı́a, C., 1 93 , 1 94, 1 95 , 1 97 Schreiber, C.A., 25 9, 284, 286, 287
Santiago, H.C., 61 0 Schreiber, J., 25 9
Santioso, R., 299, 3 04 Schroeder, C., 5 1
Santos, L.R., 620, 623 , 624 Schroth, H.A., 25 4
Sapir, E., 63 3 , 65 3 Schroyens, W., 1 74–1 75
Sarin, R., 284 Schuette, R.A., 3 03
Sartori, 60 Schultz, T.R., 3 93
Sass, L.A., 5 1 6 Schulz, L.E., 1 09, 1 44
Sattath, S., 25 3 Schunk, D.H., 785
Satterlee-Cartmell, T., 469 Schunn, C.D., 43 1 , 709
Sattler, J.M., 75 3 Schustack, M.W., 1 29, 1 48, 1 5 3
Savage, L.J., 243 , 25 1 Schvaneveldt, R., 1 7, 42, 444, 445
Savage-Rumbraugh, S., 5 66, 61 6, 61 7, 622, Schwartz, B.J., 3 3 7
671 Schwartz, D.L., 1 87, 209, 21 5 , 228, 3 3 1 , 790
Savary, F., 1 90, 1 96 Schwartz, J.H., 477
Savastano, H.I., 1 5 1 , 5 67 Schwarz, N., 269, 271 , 273 , 279, 280, 3 1 1
Savell, J.M., 778 Schyns, P., 48
Saykin, A.J., 5 1 2 Scollville, 43 2
Scaife, M., 227 Scott, 43 7
Scarabis, M., 3 1 2 Scott, P., 5 46
Scardamalia, M., 777, 783 , 787, 788, 789, 790, 791 , Scribner, S., 667
792 Searle, J.R., 477
Scarpinatto, C., 425 Seelig, D., 621
Scepansky, J.A., 298 Segal, J.W., 781
Schaal, B., 3 00 Segall, M.H., 668, 669
Schacter, D.L., 43 1 , 43 2, 43 9, 441 Seger, C.A., 43 9, 71 8
Schaeken, W., 1 74–1 75 , 1 88, 1 93 , 1 96 Seidman, L.J., 5 1 2, 5 1 3 , 5 1 4
Schaffer, M.M., 1 4, 21 , 43 , 45 , 46, 41 6 Seifer, R., 766
Schaffner, K.F., 73 3 , 73 4 Seifert, C.M., 1 23
Schallert, D.L., 3 24 Sejnowski, T.J., 5 3 7
Schank, R.C., 1 4, 1 21 , 3 73 , 3 75 Sekiyama, K., 21 4
Schaper, C., 3 03 Selemon, L.D., 5 1 2
Schauer, F., 689, 701 Selz, O., 775 , 777
Scheffler, I., 785 Senghas, A., 646, 65 3 , 65 4
Scheibel, A.B., 5 1 3 Senghor, L., 666
Scheines, R., 1 1 3 , 1 5 3 Sergent, J., 3 1
Schellenberg, E.G., 447 Serpell, R., 668, 75 1
Schelling, T., 25 7 Serunian, S.A., 61 0
Schiano, D.J., 222, 3 3 7 Servan-Schreiber, D., 43 3 , 443 , 5 00, 5 01 , 5 08, 5 1 1
Schino, G., 609 Settlage, J., 3 87
author index 82 5
Shaffer, D.M., 21 7 Sigg, H., 608

Shafir, E., 1 02, 1 05 , 247, 249, 25 0, 25 1 , 25 2, 25 3 , 25 8, Silver, 5 09
25 9, 268, 71 2, 727, 740, 776 Silver, D., 5 48
Shah, P.P., 227, 25 4, 41 2 Silver, E.A., 3 25 , 3 3 7, 3 3 8
Shaham, Y., 5 03 Silver, H.R., 3 5 8
Shalev, U., 5 03 Silverman, L.H., 43 7
Shallice, T., 60, 45 8, 45 9, 460, 462, 478 Silverstein, S.M., 5 00
Shames, V.A., 445 , 446 Simmel, M., 21 7
Shanks, D.R., 1 47, 1 48, 1 49, 1 5 2, 1 5 7, 1 62, 1 63 , 43 1 , Simmons, 60
43 7, 43 8, 443 , 444 Simon, D., 26, 685 , 689, 695 , 698, 75 8
Shankweiler, D.P., 63 8 Simon, H.A., 4, 1 3 6, 227, 228, 23 2, 244, 268, 3 24, 3 26,
Shapira, A., 670 3 27, 3 29, 3 3 0, 3 3 2, 3 3 3 , 3 3 6, 3 3 7, 3 40, 3 5 2, 3 5 7,
Shapiro, D.A., 485 3 62, 3 72, 3 75 , 401 , 404, 470, 476, 477, 488, 5 78,
Sharp, A., 787 707, 708, 709, 71 9, 728, 742, 743 , 75 5 , 75 8, 780,
Sharp, C., 5 98 791
Sharp, D.W., 761 Simon, T., 75 3
Sharp, J., 666, 676 Simons, M.A., 3 3 7
Shastri, L., 84, 87, 5 3 6 Simons, T., 5 3 2
Shatz, M., 3 85 Simonson, I., 249, 25 0, 25 1 , 25 9
Shayer, M., 783 , 785 Simonton, D.K., 3 5 1 , 3 5 5 , 3 5 6, 3 5 8, 3 5 9, 3 60, 3 62
Shebo, B.J., 5 71 , 5 78 Sinclair, L., 299
Shell, P., 1 1 8, 466, 468 Singer, M., 3 77, 469
Shelley, C., 1 1 7 Singer, W., 84
Shelton, 60 Singley, M.K., 403
Shepard, J., 21 6 Sirois, S., 5 3 5
Shepard, R.N., 1 5 , 1 6, 22, 99, 1 88, 21 1 , 21 2, 21 3 Sisqueland, E., 63 8
Sheppard, J.L., 5 3 0 Sithole, N., 63 8
Sheridan, M., 21 6 Sitnikova, T., 5 08
Sherin, B., 226 Sjoberg, L., 20, 29
Sherman, D.A., 3 00 Skorstad, J., 1 20
Sherman, S.J., 3 26, 3 43 Skovronek, E., 5 93 , 5 97, 5 98
Shertz, J., 3 3 7 Slagter, H.A., 5 05
Shestowsky, D., 700 Slayton, K., 3 5 6–3 5 7
Shettleworth, S.J., 608, 626 Slobin, D.I., 5 3 1 , 641 , 643 , 645 , 646, 65 3
Sheu, C.F., 1 48, 1 5 3 Sloman, S.A., 1 4, 28, 48, 5 4, 5 6, 5 7, 1 02, 1 03 , 1 04, 1 05 ,
Shi, M., 5 4, 63 7 1 07, 1 08, 1 09, 1 1 1 , 1 1 2, 1 1 3 , 1 1 7, 1 22, 1 44, 1 5 9, 1 69,
Shiffrar, M., 21 7 1 79, 1 80, 1 98, 1 99, 221 , 267, 268, 3 06, 3 24, 3 76,
Shiffrin, R.M., 1 4, 47, 45 7, 45 8, 498, 5 04 476, 488, 5 48, 5 89, 63 7, 646, 65 5 , 705 , 71 2, 73 0,
Shigemasu, K., 29 776
Shimamura, A.P., 469 Slotta, J.D., 3 87, 3 92
Shimojo, S., 1 97 Sloutsky, V.M., 1 94, 5 43
Shin, H.J., 46 Slovic, P., 1 75 –1 77, 1 97, 244, 247, 25 3 , 25 5 , 25 8, 25 9,
Shipley, 1 06 268, 271 , 274, 279, 5 41 , 694
Shipley, C., 5 3 7 Slugoski, B.R., 277
Shoben, E.J., 1 6, 1 7, 42, 49, 5 0, 5 1 , 3 84 Sluyter, D.J., 75 2
Shoda, Y., 786 Smilansky, B., 766
Shortliffe, E.H., 729, 73 3 , 73 8 Smith, B., 720
Shrager, J., 5 3 7, 71 9 Smith, C., 447
Shreyer, T.A., 5 45 , 61 3 , 61 4, 61 7 Smith, D.R., 791
Shulman, L.S., 728, 73 0 Smith, E.E., 1 4, 1 6, 1 7, 28, 41 , 42, 43 , 45 , 49, 5 0, 73 ,
Shultz, T.R., 1 46, 5 3 2, 5 3 5 , 5 40, 5 44, 5 48 1 02, 1 03 , 1 05 , 3 84, 41 6, 466, 482, 71 2, 778
Shum, 1 03 Smith, H.J., 61 2
Shweder, R., 268, 671 Smith, J.D., 46
Shyan, M., 61 0 Smith, J.H., 671
Sides, A., 1 02, 5 48 Smith, J.P., 3 74, 5 3 1 , 5 3 2, 789
Sieck, W., 25 9 Smith, L.B., 29, 3 0, 63 , 73 , 5 3 0, 5 3 5 , 640, 644, 65 5
Siegal, M., 65 3 , 708 Smith, S.M., 3 5 1 , 3 5 6, 3 62
Siegel, B., 75 9 Smith, V.L., 25 9
Siegel, L.S., 5 46 Smither, D., 5 42
Siegler, R.S., 423 , 5 3 4, 5 3 5 , 5 3 6, 5 3 7, 5 44, 5 48, 5 49 Smolensky, P., 81 , 82, 83 , 1 3 4, 3 75 , 406, 5 3 6
Sigel, I., 5 3 0 Snedeker, J., 644
82 6 author index
Snell, J., 25 9 Stavy, R., 73 5

Sobel, D.M., 1 09, 1 44 Steele, C., 765
Sober, E., 5 6 Stefanis, N., 5 1 3
Soh, M-H., 425 Stein, B, 3 62
Soja, N., 642 Stein, C., 71 0
Sokol, B., 674 Steinmuller, F., 720
Sokolov, E.N., 5 04 Stelmack, R.M., 75 9
Soll, J.B., 25 6 Stenger, V.A., 425 , 71 7
Solomon, K.O., 3 1 , 3 7, 3 9, 60, 62, 21 8, 3 40 Stenning, K., 1 91 , 1 92, 1 97, 23 2, 743
Solomon, S., 43 1 Stephan, W.G., 297
Soloway, E.M., 3 75 , 3 77 Stephens, D.L., 3 27
Soman, D., 25 9 Sterling, S., 46
Son, J.Y., 3 9–41 , 73 , 78, 96, 99, 1 1 7, 221 , 271 , 666, Stern, E., 227
73 3 Stern, P., 766
Son, L.K., 5 81 Sternberg, R.J., 1 1 8, 1 1 9, 1 48, 1 5 3 , 3 5 1 , 3 5 2, 3 5 5 , 3 5 6,
Song, G., 63 3 5 7, 3 5 8, 3 5 9, 3 60, 3 61 , 3 62, 3 63 , 465 , 467, 668,
Sonino, M., 202 709, 75 1 , 75 2, 75 4, 75 7, 75 8, 761 , 762, 763 , 764,
Sophian, C., 5 44 765 , 766, 776, 777, 778, 780, 782, 789, 790
Sorensen, I., 5 1 3 Stevens, A., 222, 228
Sorrentino, R.M., 296 Stevens, K., 63 8
Sousa, 60 Stevens, S.S., 272, 5 76
Sox, H.C., 25 9 Stevenson, R.J., 1 08, 1 80, 1 87
Spaepen, G.M., 620 Stevyers, M., 47, 1 44, 1 5 9
Spearman, C., 4, 1 1 8, 1 1 9, 468, 75 4, 75 5 Stewart, J., 5 03
Spears, R., 298, 3 1 1 Stich, S., 708
Spelke, E.P., 5 8, 21 7, 482, 5 66, 5 78, 5 80, 5 81 , 623 , Stigler, J.W., 3 3 1
63 5 , 642, 644, 649, 65 1 , 65 2 Stine, M., 469
Spellman, B.A., 1 24, 1 26, 1 29, 1 5 1 , 1 64, 41 0, 5 08 Stokes, A.F., 3 80
Spencer, R.M., 1 23 , 1 3 0 Stone, W.S., 5 1 1
Spencer, S.J., 3 00 Storm, R.W., 5 81
Sperber, D., 1 78, 63 7 Storms, G., 43 , 46
Spiegel, S., 3 09 Störring, G., 1 92
Spilich, G.J., 3 76 Story, A.L., 3 00
Spinozzi, G., 620, 621 Stough, C., 75 5
Spirtes, P., 1 1 3 , 1 5 3 Strack, F., 269, 272, 3 1 1
Spitzer, M., 5 06 Strauss, M.S., 5 78
Sprafka, S.A., 728, 73 0 Strauss, S., 73 5
Spranca, M., 776 Stravinsky, I., 3 60
Springer, 60 Strevens, M., 5 7, 1 1 3 , 5 46
Spurzheim, J., 477 Strike, K.A., 3 80, 3 81
Squire, L.R., 47, 3 72, 43 2, 43 4, 43 9, 440, 441 , 444, Strodtbeck, F., 663
5 1 2, 5 1 4 Stroop, 5 01
Srull, T.K., 3 03 , 3 1 2 Studdert-Kennedy, M., 63 8
St. Clair, D., 5 1 3 Stuss, D., 73
St. John, M.F., 78, 79, 43 1 , 43 7, 43 8, 443 Suda, C., 61 9
Staddon, J.E., 5 45 Sujan, M., 25 7
Stadler, M.A., 440, 441 Suler, J.R., 3 5 3
Staggs, L., 444 Sulkowski, G.M., 61 7
Stanescu, R., 482 Sullivan, K., 5 47
Stankiewicz, B.J., 73 , 84 Sunstein, C.R., 25 4, 271 , 272, 688
Stankov, L., 75 2 Super, C.M., 668, 792
Stanley, J.C., 696 Suppala, T., 65 3
Stanovich, K.E., 1 74–1 76, 1 79, 1 80, 1 99, 244, 25 8, 267, Suppes, P., 97, 201 , 5 76
278, 476, 786 Surber, C.F., 5 3 4, 5 48
Stanton, R., 47 Sussman, D., 71 0
Staplin, L.J., 223 Suwa, M., 209, 21 6, 228, 23 1
Star, J.R., 5 3 Suzuki, H., 29
Stark, C.E.L., 47, 441 Swallow, J., 783
Starkes, J.L., 3 3 7 Swartz, K., 61 4
Staton, W., 71 9 Sweller, J., 73 3
Stavi, J., 65 0, 65 4 Swets, J.A., 779
author index 82 7
Syc, S.E., 21 8, 3 40 Tomasello, M., 73 , 5 47, 608, 620, 621 , 625 , 626, 65 2,
Szolovits, P., 73 3 65 3 , 671
Tomberg, C., 84
Taatgen, N., 41 9, 422, 423 Tomie, A., 61 0, 61 3
Tabak, I., 720 Toms, M., 464
Tabossi, P., 1 92–1 93 Tooby, J., 61 , 1 73 , 1 98, 279, 287, 5 41 , 5 43 , 760
Tager-Flusberg, H., 5 47 Tordjman, S., 3 60
Takane, Y., 3 1 Torff, B., 764
Talmy, L., 21 1 , 645 Torgerson, W.S., 1 5
Tang, C., 75 9 Torrance, E.P., 3 5 4, 3 5 5
Tang, J., 482 Torrey, E.F., 5 1 2
Tannenbaum, A.J., 778, 788 Toth, J.P., 443 , 447
Tannenbaum, P.H., 3 80 Toulmin, S., 61
Tarr, M.J., 1 4 Toupin, C., 24, 1 24
Tasso, A., 1 93 , 1 94 Town, J.P., 776
Taves, E.H., 5 77, 5 78 Trabasso, T., 3 9, 3 77, 3 80, 41 5 , 5 3 0, 5 44, 5 45
Taylor, H.A., 21 8, 220 Trager, G.L., 63 9
Taylor, R., 3 85 Treisman, A., 5 81
Taylor, T., 5 65 Treisman, A.M., 21
Tees, R., 63 8 Tremoulet, P.D., 1 45 , 1 62
Tenbrunsel, A.E., 25 4 Trepel, L., 46
Tenenbaum, J.B., 1 3 , 1 4, 1 6, 22, 1 1 0, 1 1 1 , 1 44, 1 45 , 1 5 8, Trevarthen, C., 671
1 5 9, 1 64 Triandis, H., 669
Terman, L., 75 3 Trinca, L., 621 , 622
Terrace, H.S., 5 45 , 5 68, 5 70, 5 81 , 61 4, 61 5 Trope, Y., 267
Terrazas, P., 646 Trueswell, J., 644
Tesar, B., 82 Truman, H., 1 25
Tetewsky, S.J., 3 5 7 Truong, B., 465
Tetlock, P.E., 296, 298, 3 01 , 3 02, 3 05 Tsivkin, S., 482, 649, 65 2
Teubel, E., 776 Tsuang, M.T., 5 1 1 , 5 1 2
Teuber, H.L., 43 9 Tuckerman, N., 3 74
Thagard, P.R., 1 3 , 1 5 , 24, 73 , 77, 1 1 7, 1 20, 1 22, 1 24, Tuerlinckx, F., 46
1 25 , 1 3 1 , 1 3 4, 1 3 6, 1 46, 1 5 0, 1 87, 3 80, 3 82, 3 90, Tufte, E.R., 227
409, 5 40, 5 41 , 71 2, 71 3 , 71 4, 71 5 , 73 3 Tulhoski, S.W., 466
Thaler, R.H., 247, 25 6, 25 7, 25 9, 694 Tulving, E., 62, 271 , 43 9, 441
Thaman, C., 3 08 Tunteler, E., 1 23
Tharp, R., 5 3 1 Turiel, E., 669
Thatcher, R.W., 5 3 7 Turing, A., 4
Thayer, E.S., 5 45 Turken, A., 41 6
Thee, S.L., 249 Turken, A.U., 46
Thelen, E., 5 3 5 Turner, J.C., 25 7
Therivel, W.A., 3 62 Turner, M.L., 1 20, 41 3 , 466, 75 7
Thibaut, J., 48 Tushnet, M., 695
Thibodeau, 73 5 Tversky, A., 4, 1 7, 1 9, 20, 21 , 29, 42, 99, 1 00, 1 02, 1 1 1 ,
Thomas, H., 5 46 1 75 –1 77, 1 97, 1 98, 1 99, 202, 243 , 244, 245 , 246,
Thomas, J.C., 227, 3 28 247, 248, 249, 25 0, 25 1 , 25 3 , 25 5 , 25 6, 25 8, 25 9,
Thomas, R.K., 61 0, 61 1 , 61 8 267, 268, 269, 270, 271 , 274, 275 , 276, 277, 278,
Thompson, E.P., 3 03 , 3 05 279, 282, 285 , 3 07, 3 3 5 , 3 5 6–3 5 7, 5 41 , 5 76, 5 89,
Thompson, L., 1 3 0, 3 87, 3 91 608, 694, 71 2, 73 9, 775
Thompson, P.M., 494 Tversky, B., 6, 209, 21 2, 21 3 , 21 4, 21 6, 21 7, 21 8, 21 9,
Thompson, R.K.R., 73 , 77, 409, 61 0, 61 1 , 61 2 220, 222, 223 , 224, 226, 228, 229, 23 0, 23 1
Thompson, V.A., 1 79 Tweney, R.D., 706, 708, 709, 71 0
Thompson-Schill, S.L., 498 Twohig, P.T., 778
Thorndike, R.L., 75 3 , 789 Tykocinski, O., 3 09
Thorndyke, P., 223 , 3 74 Tyler, T.R., 3 06
Thorpe, K., 644
Thurstone, L.L., 75 4, 762 Uleman, J.S., 441
Tishman, S., 777, 785 , 787, 792, 793 Uller, C., 5 81
Titone, D., 5 09 Ullman, S., 26
Titzer, R., 5 3 5 Ungerleider, L.G., 442
Tohill, J.M., 1 28 Ursu, S., 426
82 8 author index
Vaadia, E., 84 Wahlberg, K.E., 5 1 6

Vadeboncouer, I., 1 94 Wahlsten, D., 766
Valdes-Perez, R.E., 71 9 Wainer, H., 227
Vallacher, R.R., 1 61 Wakker, P.P., 284
Valone, C., 5 00 Waldmann, M.R., 1 49, 1 5 1 , 1 5 2, 1 63
Van Boven, L., 25 8, 268 Waldron, E., 41 6
Van de Walle, G.A., 5 80 Waldron, E.M., 46
van den Broek, P., 3 80 Walker, R.A., 5 3 7
Van der Henst, J.B., 1 96, 1 97 Wallace, B., 3 5 7, 3 60
van der Maas, H.L., 5 48 Wallace, J.G., 403 , 5 3 1 , 5 3 2, 5 78
van der Vyver, D., 787 Wallach, D., 41 9
van Dijk, 21 2 Wallach, M., 3 5 5 , 3 5 6
Van Erp, T.G., 5 1 2, 5 1 3 , 5 1 4 Waller, M., 21 6
van Geert, P., 5 3 6 Walter, A.A., 221
van Gelder, T.J., 74 Waltz, J.A., 87, 1 20, 1 28, 465 , 469, 5 3 7
Van Hamme, L.J., 1 49 Wang, Q., 677
Van Haneghan, J., 790 Wang, Y., 5 4, 3 75 , 63 7
van Knippenberg, A., 3 03 Ward, D., 464
van Koten, S., 5 3 6 Ward, G., 5 07
van Mechelen, I., 43 Ward, T.B., 3 0, 1 47, 1 5 3 , 3 5 1 , 3 5 6, 3 62
Vandierendonck, A., 1 93 , 1 96 Ward, W., 1 46
VanLehn, K., 3 72, 41 8, 41 9 Warrington, E.K., 60, 43 9, 45 9, 460, 486
vanVeen, V., 426, 71 7 Wartenbereg, F., 445
Varey, C.A., 271 Wasel, W., 3 00
Vargha-Khadem, F., 5 1 4 Washburn, D.A., 5 68, 609, 61 2, 61 4, 61 5 , 61 6
Varley, R., 65 3 Wason, P., 4, 1 74–1 76, 1 79, 1 87, 1 99, 277, 485 , 5 43 ,
Varma, S., 404, 425 707, 708, 709
Vauclair, J., 608, 61 1 Wasserman, E.A., 1 49, 1 5 6, 5 81
Vaughn, L.A., 271 Watson, J., 671 , 672
Vaughn, W.J., 5 67 Watt, N.F., 5 08
Velakoulis, D., 5 1 3 Waugh, N.C., 45 7
Verguts, T., 46 Waxman, S., 63 , 646, 65 4
Verhaeghen, P., 5 92 Webber, B.L., 1 87
Vernon, P.A., 75 5 , 75 9 Weber, E.U., 25 8
Vernon, P.E., 3 5 3 , 75 4 Weber, R., 5 1
Verplanken, B., 25 7, 3 1 2 Webster, D.M., 297, 3 00, 3 01 , 3 05 , 786
Vesvonder, G.T., 3 76 Wedell, D., 29, 25 3
Vicente, K.J., 729 Wegener, R.E., 3 05 , 3 1 2
Vigorito, J., 63 8 Wegner, D.M., 1 61
Villa, M.F., 5 3 6 Wehner, L., 3 5 8
Vinden, P., 671 , 672 Weinberger, D.R., 5 1 2
Vinter, R., 43 1 , 43 2, 43 8, 443 , 444 Weinberger, N.M., 43 7
Visalberghi, E., 621 , 622 Weiner, I., 5 03
Vishton, P.M., 21 7 Weiner-Ehrlich, W.K., 26
Viskontas, I.V., 87 Weinstein, B., 61 0
Vitanyi, P., 27 Weisberg, R.W., 1 23 , 1 3 0, 3 3 2, 3 42, 3 5 1 , 3 5 4, 3 5 6,
Vokey, J.R., 443 , 444 3 60, 3 62
von der Malsburg, C., 83 Weisbrod, 5 06
von Hofsten, C., 21 7 Weisel, 21
von Neumann, J., 243 Weiser, M., 3 3 7
von Oech, R., 3 5 3 Weiskrantz, L., 43 2, 43 9, 441
Vonk, J., 61 2 Weiss, S.M., 447
Vosniadou, S., 1 87, 3 88, 5 49 Welch, N., 25 8
Voss, J.F., 3 76, 3 77, 781 Welder, A.N., 65 4
Vye, N., 790 Wellman, H.M., 5 6, 5 8, 60, 61 , 3 74, 5 47, 671 ,
Vygotsky, L., 4, 5 29, 5 3 0, 5 3 1 , 5 3 6, 5 3 7, 783 , 787 672
Welsh, M.C., 469
Wagenmakers, E-J., 1 44 Wenger, E., 789, 790
Wagner, 5 09 Wentworth, N., 441
Wagner, A.R., 1 46, 1 48, 1 5 3 Werker, J., 63 8
Wagner, D., 762, 764, 766 Werner, H., 3 5 3
author index 82 9
Wernicke, C., 478, 498 Wittgenstein, L., 1 86, 1 87, 63 3

Wertheimer, M., 4, 3 24, 3 3 8, 3 40, 445 , 5 3 6, 706 Wober, M., 668
Weschler, D., 75 3 , 778 Wolfe, C., 3 00
West, R.F., 1 74–1 76, 1 80, 267, 268, 278, 404, 476, Wolff, P., 5 9, 63 , 1 21 , 71 4
786 Wolschlager, A., 21 5
West, V., 5 78 Wolschlager, A.b, 21 5
Wexler, M., 21 5 Wood, J.N., 5 7, 5 1 6
Whalen, J., 5 71 , 5 74, 5 75 , 5 76, 5 78 Woodcock, R.W., 5 90
Whalen, P.J., 43 7, 43 8 Woodman, R.W., 3 60
Wharton, C.M., 76, 1 20, 1 23 , 1 24, 1 3 4, 71 7 Woodruff, E., 783
Wheatley, T.P., 25 8 Woodruff, G., 61 1 , 61 8
Whimbey, 779 Woods, B.T., 5 1 2
White, B., 3 87 Woods, D.D., 741
White, B.Y., 73 7 Woods, D.J., 1 7
White, J.B., 701 Woodward, J., 5 9, 1 44
White, P.A., 1 46, 1 48, 1 5 3 Woodworth, G., 699
Whitehead, A.N., 96 Woodworth, R.S., 5 78, 789
Whiten, A., 671 Woody, A., 3 75
Whitfield, S., 5 05 Worsham, A.W., 780
Whitson, J.A., 789 Wraga, M., 220
Whittlesea, B.W.A., 444 Wray, R.E., 425
Whorf, B.L., 63 3 , 63 4, 63 5 , 63 6, 643 , 648, 65 3 Wright, A.A., 61 0
Wickelgren, W.A., 3 28 Wright, J., 25 7
Wickens, C.D., 227 Wright, R.W., 1 5 1
Wickett, J.C., 75 9 Wu, G., 245
Widman, K.F., 5 74 Wu, L., 76, 1 3 1 , 3 41
Wiggins, D., 5 7 Wu, M., 1 5 2, 1 5 5
Wigner, E., 5 5 9 Wundt, W., 499
Wilcox, T., 5 8 Wusthoff, C.J., 5 67
Wilensky, U., 3 83 Wynn, K., 5 81
Wilkening, F., 5 42, 5 48 Wynne, C.D., 5 45
Wilkie, D.M., 5 68, 609
Wilkie, O., 1 02 Xu, F., 5 7, 5 8, 5 80, 5 81 , 65 4
Wilkins, M.C., 480, 487
Willerman, L., 766 Yama, H., 1 75
Williams, E., 5 80 Yamamoto, K., 3 5 5
Williams, K., 63 8 Yamauchi, T., 47, 62
Williams, S., 790 Yang, Y., 1 95 , 1 96
Williams, T., 623 Yaniv, I., 445
Williams, W.M., 678 Yarkin, K.L., 776
Wilson, C., 75 7 Yates, J.F., 25 9
Wilson, D., 63 7 Yerkes, R., 607
Wilson, J., 43 6 Yntema, H., 691
Wilson, R.A., 3 83 Yoshida, H., 63 , 640
Wilson, T.D., 25 1 , 25 8, 686 Yosifon, D., 694, 700
Wilson, W.H., 1 24, 5 3 1 , 5 3 6, 5 3 7, 5 3 9, 5 40, 5 43 , 5 45 , Young, A.W., 403 , 441
5 46, 5 47 Young, M., 790
Wilton, R.N., 222 Young, R.M., 403 , 763
Wimmer, H., 5 47 Younger, B.A., 5 45
Winer, G.A., 5 46 Yuill, N., 21 7
Winn, W., 226 Yuille, 1 48
Winograd, T., 3 71 Yule, P., 1 91 , 1 92, 1 97
Winston, P.H., 1 21 Yurgelun-Todd, D.A., 5 1 2
Winter, A., 226 Yurko, D.S., 5 69
Winter, W., 441
Wirshing, D.A., 5 1 6 Zachary, R.A., 5 91
Wiser, M., 3 74, 3 89 Zacks, J.M., 21 4, 21 6, 21 7, 220, 228
Wish, M., 1 5 Zacks, R.T., 469
Wisniewski, E.J., 25 , 3 7, 3 9, 47, 5 0, 674 Zadeh, L., 43
Wissler, C., 75 3 Zaidel, D.W., 5 1 3
Witt, E.D., 61 2 Zajonc, R.B., 271 , 43 7, 687
830 author index
Zaki, S.R., 47 Zhang, S., 26, 1 28, 3 00, 3 24, 3 3 7

Zambrano, I., 671 Zhang, Z., 674
Zeckhauser, R., 248 Zhao, M., 71 8
Zeidner, M., 765 Zhao, Z., 469
Zeisel, H., 697 Zielinski, T., 5 49
Zeitsman, A., 3 87 Zigler, E., 765
Zelaznik, H.N., 3 08 Zimmerhart-Hart, 1 5 2
Zelazo, P.D., 5 3 8, 5 48 Zimmerman, B.J., 785
Zenasni, F., 3 60 Zimmerman, C., 5 48
Zeng, Q., 744 Zizak, D., 441 , 447
Zenke, L., 780 Zola, S.M., 5 1 2, 5 1 4
Zentall, T.R., 75 5 Zurif, E., 63
Zerbst, J.I., 3 00 Zwaan, R.A., 21 2
Zhang, J., 73 8, 743 Zytkow, J.M., 3 5 2, 3 5 7, 3 62, 71 9
Subject Index
abductive reasoning, 73 0 production-rule learning in, 405

Abecedarian Project, 765 RULEX model in, 41 6
abeyance, 3 88 source activation in, 41 4
abstraction levels subsymbolism in, 406, 408, 422
for causal learning, 1 61 WM and, 41 2, 41 3 –41 4, 41 6, 470
in declarative knowledge, 3 84 adaptive regression, 3 5 3
accessibility effects, 272 Adaptive Strategy Choice Model. See ASCM
in directional-outcome motivation, 299 additive extension effect, 280, 282
accretion, 3 76 factorial designs and,
ACME (Analogical Mapping by Constraint WTP and, 283
Satisfaction) affective thinking, 3 1 1
as analogy model, 1 3 4 affective valence, 271
ARCS model and, 1 3 4 Affirmation of the Consequent (deductive reasoning),
connectionist mapping networks in, 1 3 4 1 72
constraint-satisfaction network, 1 3 4 AG (artificial grammar)
LISA vs., 1 3 4–1 3 5 in implicit cognition, 43 4
multiconstraint theory in, 1 3 4 in implicit learning, 441
predicate matching in, 1 3 4 standard learning tasks for, 43 4
ACT-R (production system), 1 3 7, 404, 408, 425 Agent-Based Modeling and Behavior Representation.
analogy in, 409 See AMBR
attentional activation in, 41 2 aging (cognitive)
BST in, 407 componential models for, 5 98–600
chunk strengthening in, 41 6 comprehension and, 5 93
cognitive modeling architecture in, 404, 406 correlated factor models for, 600–602
differential activation in, 41 3 correlation analysis and, 5 97–604
in implicit cognition, 43 3 factor loading variables and, 603
knowledge structures in, 405 Figure Classification test and, 5 99
language learning and, 422 geometric analogies and, 5 94, 5 96
latency responses in, 41 9 hierarchical structure models and, 602–604
MODS trial in, 41 3 LISA and, 87
parallelism in, 405 Location test and, 5 99
past tense generation in, 423 matrix reasoning and, 5 94, 602
831
832 subject index
aging (cont.) LISA and, 86, 1 3 4–1 3 6

mediational models for, 5 97–5 98 mapping in, 1 1 7, 1 24–1 27, 41 0, 424, 5 3 1
process-oriented research on, 5 93 –5 97 in mental model theory, 1 87
reasoning and, 5 67–5 68, 5 90, 5 91 –5 93 metaphor and, 1 20–1 21
relative distribution of inspection and, 5 94 motion cue diagrams for, 1 3 0
response speed and, 5 93 –5 94 multiple comparisons in, 1 3 0
serial position functioning and, 5 96 non-human primates and, 61 1 –61 3
series completion tasks and, 5 94 People Pieces task, 465
strategy use and, 5 94–5 95 political uses of, 1 25 –1 27
study times, relative to, 5 94 problem schemas in, 1 3 0
WCST and, 5 94, 5 95 , 670 problem solving and, 1 22, 1 3 6–1 3 7
WM and, 5 95 –5 97 processing goal impact within, 1 25
algorithms in production systems, 402, 409–41 2
in analysis, 5 psychometric tradition in, 1 1 8–1 20
exhaustive search, 3 25 relational generalization as part of, 1 3 0–1 3 1
in MDS models, 1 6 “relational shift” in, 1 24
in problem solving, 3 25 relations in, 1 21 –1 22, 1 24
alignment-based models (for similarity), 24–26 retrieval impact as part of, 1 23 –1 24
alignable differences in, 25 roles in, 41 0
formal, 24–25 science and, role in, 1 1 7–1 1 8
matching features within, 24 in scientific reasoning, 71 3 –71 4
non-alignable differences in, 25 SME model and, 1 3 2–1 3 4
object sets in, 24 solar systems model for, 409
rigid transformations within, 26 source analogs as part of, 1 1 7, 1 22, 71 4
SIAM, 24 STAR model for, 1 3 2
verbal structural similarity in, 26 “story memory” as part of, 1 22
AMBR (Agent-Based Modeling and Behavior structural parallelism in, 1 22
Representation), 425 symbol-argument consistency in, 5 3 1
American Bar Association, 694 target analogs as part of, 1 1 7, 1 23 –1 24, 71 4
anagrams, 3 43 task-interference paradigm and, 465
pop-out solutions and, 3 43 transfer paradigms for, 1 22–1 23
Analog Retrieval by Constraint Satisfaction model. See WM and, 1 20
ARCS WM in, 1 27–1 28
Analogical Mapping by Constraint Satisfaction. See analysis, 5 . See also thinking
ACME computation in, 5
analogs implementation as part of, 5
in visuospatial reasoning, 225 representation/algorithm as part of, 5
analogy, 5 . See also similarity anaphoric reference, 5 62–5 64, 63 6
abstract, 444 animations
ACME model for, 1 3 4 diagrams and, 23 0
ACT-R production systems and, 1 3 7, 409 limitations of, 23 0
“alignable differences” in, 1 28 anomic aphasia, 499
case-based reasoning and, 1 21 ANOVA model
categorization in, 1 22 for causal learning, 1 47
causal references in, 1 22 ARCS (Analog Retrieval by Constraint Satisfaction)
in cognitive development, 5 41 model, 1 3 4
coherence as part of, 1 25 ACME and, 1 3 4
component transfer processes for, 1 1 7, 1 22 arithmetic
computational models of, 1 3 1 –1 3 6 look-up tables and, 5 76
convergence schemas in, 1 3 0 mathematical cognition and, 5 60
Copycat model for, 1 3 1 number fact retrieval in, 5 74–5 75
CWSG as part of, 1 28 reaction time predictors in, 5 74
false recognition in, 1 29 artificial intelligence systems
4-term, 1 1 8, 1 1 9 medical reasoning and, 728
geometric, 5 94, 5 96 mental model theory and, 1 87
IAM, 1 3 1 ASCM (Adaptive Strategy Choice Model), 5 3 8
inference generalization as part of, 1 28–1 3 1 associative learning systems
intelligence and, 75 8 causal learning and, 1 47
knowledge representation in, 1 21 –1 22 ceiling effects and, 1 5 2–1 5 3
“known” domains in, 409 cue-outcome contingency as part of, 1 47
in legal reasoning, 686, 687–688 dual process theory and, 1 80
subject index 833
inductive reasoning and, 96 in naı̈ve theory (inductive reasoning), 1 09

temporal contiguity as part of, 1 47 negative conclusion, 1 74–1 75
asymmetry normative system problem and, 1 73 –1 75
causal, 1 1 2 path, 646
in conflict decisions, 25 0 in similarity-based induction, 1 07
hemispheric, 485 –487 weighting, 270
predictions (causal) and, 1 5 1 The Big Book of Concepts (Murphy), 3 7
in similarity-based induction, 1 03 binding
asynchronous parallelism, 426 argument, 76
atomic chemical reaction theory, 3 73 in conjunctive connectionist representations, 84
atomistic concepts, 5 2–5 3 dynamic, 84
attribute framing, 249 multiplicative schemes, 82
attributes relational representation, 82
affective valence and, 271 role-filler, 82, 83 –84, 87
compatibility between, 25 2–25 3 “black box” theories, 478
in directional outcome-motivation, 297–298 blocking, 1 48
evaluation of, 25 2–25 4 backward, 1 49, 1 5 5
extensional, 282 bootstrapping
framing of, 249 knowledge structure repairs in, 3 89
natural assessments for, 271 in language, 643 , 649, 65 3
non-directional outcome-motivation and, 3 01 nonmonotonicity and, 3 89–3 90
spreading activation and, 249 Boyle’s law
substitution of, 269–273 , 274, 287 in power PC theory, 1 5 3
target, 269 Bradley-Caldwell (intelligence) studies, 765
auditory hallucinations, 5 05 brain regions
domain specificity and, 60
backpropagation models BST (Building Sticks Task), 407
in dynamic systems models, 5 3 6 in ACT-R, 407
in neural net models, 5 3 3 guess-overshoot production rules in, 407
BACON program, 3 5 7 guess-undershoot production rules in, 407
balance scale model, 5 3 2 hillclimb-overshoot in, 408
cognitive development and, 5 48–5 49 hillclimb-undershoot in, 408
diagram, 5 3 2, 5 3 3 leaned utility values average in, 408
structure of, 5 3 4 overshoot in, 407
Bayesian models undershoot in, 407
of categorical inference, 1 1 0 Building Sticks Task. See BST
extensional reasoning and, 1 97
for inductive reasoning, 1 1 0–1 1 1 Cambridge Handbook on Thinking and Reasoning, 688
in WTP surveys, 283 CAPS
belief bias, 1 75 –1 77, 1 79 constraints for, 41 3
belief systems, 3 . See also intuitive theory WM in, 41 2
belief-laden material cascade correlation models, 5 3 5 , 5 40, 5 48
cerebellum effects of, 485 Cascade model (skill acquisition), 41 9
hemispheric asymmetry and, 485 CASE (Cognitive Acceleration through Science
belief-logic conflicts Education) program, 783
cerebellum effects of, 487 case-based reasoning
cognitive neuroscience and, 487–488 in analogy, 1 21
inhibitory trials for, 487 categorization (conceptual), 3 8, 41 5 –41 7
belief-neutral material analogy and, 1 22
cerebellum effects of, 485 basic level, 40
The Bell Curve (Herrnstein/Murray), 766 causal learning and, 1 61
bias decision bound, 46
belief, 1 75 –1 77, 1 79 EBRW model, 41 6
content/context effects of, 1 78–1 79 exemplar views of, 45
in deductive reasoning, 1 74–1 75 , 1 76 family-resemblance, 5 7
external validity problem and, 1 74–1 75 feature learning within, 47–48
heuristics and, 268, 270 goal planning and, 3 8
interpretation problem and, 1 75 identity-lending, 5 7
in language, 646 in inductive reasoning, 1 00
matching, 1 74–1 76 inference and, 44–48
in MDS models, 1 9–20 inference learning in, 47
834 subject index
categorization (cont.) problems in, 1 45

inside view of, 1 02 release-from-overshadowing condition in, 1 5 5 –1 5 6
learning and, 3 8 situational statistical models for, 1 47–1 48
in legal reasoning, 698–699 support models for, 1 5 8
multiple function sensitivity in, 62 temporal contiguity and, 1 47
multiple strategy approach for, 424 time and, 1 61 –1 64
multiple systems of, 46–47 tradition rule-based models for, 1 5 3
neural network, 46 unbound variables in, 1 5 3
new models of, 45 –46 causal thinking
outside view of, 1 02 in medical reasoning, 73 6–73 8
predicates’ role in, 1 00 in scientific reasoning, 71 0, 71 1
prototype views within, 44 “cause and effect.” See also causal learning
rational, 46 asymmetry in, 1 1 2
reasoning as function of, 48 inductive reasoning, role in, 95 , 98, 99,
RULEX model, 41 6 1 1 2–1 1 3
sensitivity within, 45 relevance in, 1 1 2
structure within, 47 CCC (Cognitive Complexity and Control) theory,
transfer tests in, 47 5 3 8–5 3 9
typicality effects within, 44 relational complexity metric and, 5 3 9
causal learning ceiling effects
abstraction levels for, 1 61 associative learning models and, 1 5 2–1 5 3
acyclic constraints in, 1 5 3 in causal learning, 1 5 2–1 5 3
ANOVA model for, 1 47 covariation vs. causation in, 1 5 2
applications of, 1 45 , 1 46 inhibitory cues and, 1 5 2
associative learning theory and, 1 47 power PC theory and, 1 5 5
Bayesian integration accounts as part of, 1 45 central executive
bi-directional associations and, 1 5 1 in Embedded Processes model, 462
blocking in, 1 48, 1 49, 1 5 5 functions of, 461
category formation and, 1 61 in multiple working memory models, 45 8
ceiling effects in, 1 5 2–1 5 3 Supervisory Attentional System in, 460–461
coherence of, 1 5 9 central executive (memory model), 45 8, 460
computational-level induction theory for, 1 5 3 –1 5 5 certification theory, 3 5 5
contiguous event pairings in, 1 62 childhood development
contingency tables for, 1 47 relational thinking during, 73
correlations as part of, 1 5 1 choice. See also decision making
covariation vs., 1 44, 1 5 4 BST and, 407
cue-outcome contingency and, 1 47 in production systems, 406–409
cues as part of, 1 47 chunk decomposition, 3 44
diagnostic conditions as part of, 1 5 1 chunking
directions of, 1 5 1 –1 5 2 conceptional, 5 3 9
empirical knowledge and, 1 45 in Q-SOAR model, 5 3 2
enabling conditions in, 1 60–1 61 in Soar, 41 9
flexibility of, 1 5 9 Church-Turing hypothesis, 476, 489–490
historical background of, 1 45 circular convolutions, 81 , 83
human contingency judgment studies for, 1 5 6–1 5 8 city-block metrics
inference and, 21 7 in MDS models, 1 6
intervention in, 1 44, 1 60 class inclusion (cognitive development), 5 46
iterative retrospective revaluation in, 1 5 9–1 60 classical conditioning, 5 05
knowledge mediation hypothesis as part of, 1 63 classification (cognitive development), 5 45 –5 47
“launching effects” in, 1 45 , 1 62 prototypes models of, 5 45
learning process and, 1 44 clinical psychology, 4
mechanism view of, 1 46 “code-switching,” 65 3
“no confounding” principle in, 1 5 4 cognition
noncausal observations and, 1 44 creativity and, 3 5 6–3 5 8
normative deviations in, 1 5 8–1 5 9 mathematical, 5 5 9
outcomes as part of, 1 47 medical, 727
perception and, 1 45 , 1 62 motivated thinking vs., 296
power PC theory of, 1 5 3 , 1 5 4 non-directional outcome-motivation and, 3 1 2
predictions as part of, 1 44, 1 5 1 , 1 5 2 similarity in, 1 4
predictive learning vs., 1 47 Cognitive Acceleration through Science Education
probabilistic contrast in, 1 47 program. See CASE
subject index 835
Cognitive Basis of Science (Carruthers/Siegal/Stich), 708 perspective in, 223

“cognitive collage” rotation and, 224
cognitive maps vs., 224 Cognitive Models of Science (Giere), 708
Cognitive Complexity and Control theory. See CCC cognitive neuroscience, 5
theory belief-logic conflicts and, 487–488
cognitive deficit integration brain localization functions, role in, 477
thought disorder and, 5 09–5 1 0 for deductive reasoning, 476–477
Cognitive Development (Bryant), 5 3 0 dissociation and, 477–478
cognitive development (children), 5 29–5 3 0. See also fundamental processes of, 6–7
thinking future applications of, 489
analogy in, 5 41 hemispheric asymmetry and, 485 –487
“apprenticeship” in, 790, 792 mental logic theory and, 479
ASCM and, 5 3 8 mental model theory and, 478–479
balance scale in, 5 48–5 49 real-time computational models for, 477
brain development links for, 5 3 7 Cognitive Research Trust program. See CoRT
CASE program for, 783 Cognitive Revolution, 21 1 , 295
category induction in, 5 46 cognitive development and, 5 3 0
causal reasoning in, 5 48 non-human primates, role in, 5 93
CCC theory in, 5 3 8–5 3 9 Cohen & Braver’s model (thought disorder),
class inclusion in, 5 46 5 00–5 03
classification in, 5 45 –5 47 cognitive control mechanism as part of, 5 08
Cognitive Revolution and, 5 3 0 diagram, 5 00
complexity in, 5 3 8–5 3 9 lexical disambiguation task in, 5 01
concept of Earth in, 5 49 “one-back” continuous performance task in, 5 01
concept of mind in, 5 47 phasic dopamine activity in, 5 02
concrete operational stage as part of, response levels in, 5 02–5 03
530 Stroop task in, 5 01
conservation in, 5 44 coherence
criticism of, 5 3 0 consistency and, 3 82
domain knowledge and, 5 40 collectivism
domain specificity in, 5 41 cultural thought and, 665 , 669, 672
dynamic systems models in, 5 49 gender differences within, 669
equilibration in, 5 3 0 individualism vs., 675 –676
evidential diversity principle and, 5 48 object contextualization in, 674–675
expertise and, 5 40–5 41 communication
formal operational stage as part of, 5 3 0 within concepts, 3 8
“function logic” in, 5 3 0 diagrams for, 227–23 0
groupings as part of, 5 3 0 inter-modular and (language), 65 1
increased dimensionality in, 5 3 9–5 40 Community of Learners program, 794
information integration theory for, 5 48 compatibility, 25 2–25 3
Knowledge Forum for, 783 in attribute evaluation, 25 2–25 3
language and, role in, 5 3 1 principle of, 25 3
preoperational stage of, 5 3 0 complex knowledge systems, 3 71
reasoning processes in, 5 41 –5 43 declarative, 3 71
relational complexity metric in, 5 3 9, 5 48 procedural, 3 71 –3 72
scientific thinking about, 5 47–5 48 componential theory (intelligence), 75 7–75 8
sensorimotor stage of, 5 29 compromise effect, 25 0
strategy development for, 5 3 7–5 3 8 computational models
symbolic categories in, 5 45 –5 47 ACME, 1 3 4
time/distance/velocity and, 5 48 for analogy, 1 3 1 –1 3 6
transition mechanisms in, 5 49 for causal learning (induction), 1 5 3 –1 5 5
transitivity in, 5 44–5 45 cause in, 1 3 7
Wason selection task and, 5 43 CWSG, 1 28
zone of proximal development in, 5 3 1 data mining in, 71 9
cognitive maps limitations of, 1 3 7
alignment and, 223 –224 LISA, 1 3 4–1 3 6
“cognitive collage” vs., 224 in psychometric tradition (analogy), 1 1 9
in distortions program, 222 in scientific methodology, 1 09–1 1 1
hierarchical organization in, 222–223 for scientific reasoning, 71 9
information levels in, 223 SME, 1 3 2–1 3 4
landmarks in, 223 stimulus generalization (causal learning), 1 5 6
836 subject index
computer conflict evasion

language programming systems, 74 abeyance in, 3 88
simulations, 4 bolstering in, 3 88
Computer Supported Intentional Learning in declarative knowledge, 3 88
Environments program. See CSILE recalibration in, 3 88
concept of mind, 5 47 conflict sets, 402
appearance-reality testing for, 5 47 conjunctive connectionist representations,
executive function relation to, 5 47 81 –83
false-belief tasks for, 5 47 binding storage and, 84
social-perceptual knowledge and, 5 47 circular convolutions in, 81 , 83
“theory of mind” vs., 5 47 HRRs in, 81 , 83
concepts implicit relations continuum as part of, 83
broad applications of, 62 LISA and, 84
categorization within, 3 8, 40 LTM storage, 84
combination theories, 3 9, 40, 49–5 2, 60 SAA isomorphism and, 82
communication within, 3 8 sparse coding in, 83
comprehension of, 3 8 spatter codes in, 81 , 83
direct observation within, 5 0 “superposition catastrophe” within, 83
domain specificity within, 5 8–62 tensor products in, 81
functions of, 3 7, 3 8–3 9, 44–48 connectionist representations
hypothesis testing, 40 conjunctive, 81 –83
indirect observation within, 5 0 diagram of, 80
inferential/atomistic, 5 2–5 3 distributed activation vectors in, 78
language functions within, 48–5 4 eliminative, 79
memory and, 3 9, 40–41 , 48–49 flexibility of, 78–79
mental pathways between, 3 9 identity functions for, 80
as mental representation, 3 7 latent semantic analysis in, 78
nature of, 6 patterns in, 78
“No Peeking Principle” as part of, 5 0 relational capacity of, 81
nonproprietary, 61 SAA vs., 78
prediction for, 3 8 specific function inputs for, 80
preexisting structures for, 71 5 Story Gestalt model and, 79
proprietary, 61 summary of, 87
psychodiagnostic classification within, 5 4 as symbolic relational representation, 78–81
psycholinguistic research and, 62–63 conservation
psychological essentialism, 5 6–5 7 acquisition in, 5 44
psychometaphysics and, 5 5 in cognitive development, 5 44
radical change for, 71 5 Q-SOAR model and, 5 44
“semantic memory marriage” as part of, 3 9 consistency (declarative knowledge), 3 80–3 82
sortalism, 5 7–5 8 coherence and, 3 82
theories of, 5 4–5 6 vs. veridicality, 3 81
conceptual knowledge constraint relaxation, 3 44
perceptual vs., 1 07 Contention Scheduler
in similarity-based induction, 1 07 in Supervisory Attentional System, 460
Conditional Proof (suppositional reasoning), 1 71 context
conditional reasoning. See deductive reasoning bias and, effects on, 1 78–1 79
conditioned inhibition, 1 48 in deductive reasoning, 1 74–1 76, 1 79
The Conditions of Learning (Gagne), 3 92 perceptual form, 3 3 1 –3 3 2
“Confirmation Bias” in problem solving, role of, 3 3 1 –3 3 2
in scientific reasoning, 709 in visuospatial reasoning, 225
variants in, 709 contingent valuation method. See CVM
WM limits and, 71 0 The Contrast Model
conflict, 249–25 1 asymmetric similarity prediction by, 20
asymmetric dominance and, 25 0 featural analyses in, 20–21
compromise effect as result of, 25 0 “Hamming distances” in, 21
default alternatives and, 25 0 MDS and, 20
evasion, 3 88 neural network representations in, 21
regularity condition and, 249 object sets in, 20, 23
resolution, 402 copy with substitution and generation. See CWSG
sets, 402 “correction model”
status quo in, 249 dual process theory vs., 268
subject index 837
CoRT (Cognitive Research Trust) program (thinking), CSILE (Computer Supported Intentional Learning
778, 779, 781 Environments) program, 783 , 787, 790, 793
counterfactual thinking cue-outcome contingency
additives in, 3 08 in associative learning theory, 1 47
in directional outcome-motivation, 298 blocking in, 1 48
in regulatory focus theory, 3 08 in causal learning, 1 47
in strategy-motivated thinking, 3 07–3 08 conditioned inhibition in, 1 48
subtractives in, 3 08 cue validity in, 1 47, 1 48
covariation, 1 44, 1 46 overshadowing in, 1 48
causal learning vs., 1 44, 1 5 4 cultural thought. See also culture
in ceiling effects, 1 5 2 behavioral interpretations and, 666
studies on, 1 5 5 –1 5 6 collectivism and, 665 , 669, 672
creative thinking. See creativity demographic factors for, 669
creativity, 3 5 1 difference patterns in, 673
adaptive regression in, 3 5 3 in dispositional thinking, 787–788
BACON program and, 3 5 7 enthnotheories of intelligence and, 668
blind variations in, 3 5 9 “Essentialist” model of, 674
candle problem and, 3 5 6 ethnography and, 665 –666, 667
cognitive approaches to, 3 5 6–3 5 8 false-belief tasks and, 673
computer simulation approaches to, 3 5 7 importance of, 792
confluence approaches to, 3 5 9–3 61 logic and, 676
contribution scaling for, 3 63 –3 64 narrative vs. logical-scientific in, 675
divergent thinking tasks for, 3 5 4 négritude and, 666
elaboration in, 3 5 3 practice as part of, 793
environmental role in, 3 5 1 problems in, 667
evolutionary approaches to, 3 5 9 “Relationship” model for, 674
evolving-systems model for, 3 60 religion as factor in, 670
explicit theories for, 3 60 resources in, 793
Genoplore model for, 3 5 6 sibling caregiving and, 672
implicit theories for, 3 60 social ideology levels and, 666–668
intelligence and, 3 5 4–3 5 5 theory of mind and, 670–674
intrinsic motivation for, 3 5 8 for thinking, 796
investment theory of, 3 61 “thinking routines” and, 793
lateral thinking and, 3 5 2 visual pattern construction in, 676–677
Lorge-Thorndike Verbal Intelligence scores and, “Culturally Situated Cognition”
355 (Ceci/Kopko/Wang/Williams), 677
mystical approaches to, 3 5 2 culture
9-dot problem and, 3 5 6 positive self-evaluation and, 3 1 2
pragmatic approaches to, 3 5 2–3 5 3 reasoning as effect of, 626
primary, 3 61 Culture and Thought (Cole/Scribner), 667
propulsion theory of, 3 63 CVM (contingent valuation method)
psychodynamic approach to, 3 5 3 –3 5 4 scope neglect and, 282
psychometric approaches to, 3 5 4–3 5 6 CWSG (copy with substitution and generation)
RAT for, 3 5 5 analog inference and, 1 28
SAT scoring and, 3 5 5 computational models’ use of, 1 28
secondary, 3 62 constraints on, 1 29
selective retention in, 3 5 9 SME and, 1 3 2
self-actualization and, 3 5 8 variable binding/mapping in, 1 28–1 29
social environment and, 3 5 8
social-personality approaches to, 3 5 8–3 5 9 DAPAR test (thinking), 778
synectics in, 3 5 3 decision making
“thinking hats” in, 3 5 3 attribute evaluation in, 25 2–25 4
Torrance Tests of Creative Thinking and, 3 5 4 bounded rationality in, 244
Unusual Uses Test in, 3 5 4 conflicts in, 249–25 1
WISC and, 3 5 5 decision utility and, 25 9
Critical Legal Studies movement, 694, 695 –696 default alternatives in, 249, 25 0
judges and, 695 description invariance in, 246
legal realism theory and, 695 emotions, role in, 25 8
cross-dimensional mapping (for heuristics), 272 experienced utility and, 25 9
cross-modality matching and, 272 frame of mind for, 25 7–25 8
univariate matching and, 272 framing effects for, 246
838 subject index
decision making (cont.) biases in, 1 74–1 75 , 1 76

insurance and, 245 , 246–247 cognitive neuroscience and, 476–477
local/global perspectives in, 25 4–25 8 content effects on, 1 87
loss aversion, 247–249 context effects on, 1 74–1 76, 1 79
mental accounting in, 25 5 –25 6 Darwinian algorithms, 1 73
normative intuitions in, 25 4 defeasible inferences in, 1 78
perceived certainty in, 246 Deontic Selection Task for, 1 74–1 76, 1 78
preference inconsistencies in, 25 1 dual mechanism theories and, 476
prospect theory of, 244–246, 260 dual process theory and, 1 79–1 80, 1 81
rational theory of choice and, 243 , 244 errors in, 71 3
reason-based, 25 1 –25 2 hueristic-analytic theory and, 1 73 –1 75
reasoning and, 2 inference rules systems for, 1 71 –1 72, 475
repetition in, 25 4–25 5 language and, 9
risk framing in, 246–247 legal reasoning by, 686–687
riskless choice in, 247–249 mental logic theories and, 475 –476
segregated opportunity in, 25 5 mental model theory and, 1 72, 1 81 , 1 90–1 91
semantic framing in, 249 ‘natural logics’ as part of, 1 71 , 1 81
separate vs. comparative evaluation in, 25 3 –25 4 origins of, 1 69
status quo bias in, 248–249 pragmatic reasoning schemas in, 1 73
subjective utility in, 243 principle of truth in, 1 72
Sure Thing Principle in, 25 1 psychological theories of, 475 –476
temporal discounting in, 25 6–25 7 in scientific reasoning, 71 2–71 3
under uncertainty, 244–247 syllogisms and, 1 69, 71 3
decision-analysis (medical reasoning), 728 Deontic Selection Task, 1 74–1 76, 1 78, 1 80
quantitative model of inference in, 728 Darwinian algorithms in, 1 77
declarative knowledge, 3 71 pragmatic reasoning schemas in, 1 77
abstraction levels in, 3 84 Wason selection task and, 1 74–1 76, 1 80
accretion of, 3 76 “derailment,” 496
assembly process within, 3 83 –3 84 verbal, 5 09
assimilated information as part of, 3 77 developmental psychology
assimilation distortion in, 3 88 central conceptual structures within, 784
atomic chemical reaction theory in, 3 73 evolution of, 783 –784
center-periphery structures within, 3 73 –3 74 sortalism’s role in, 5 7
complexity within, 3 83 –3 84 diagnosticity effects
computational power in, 3 88–3 89 in similarity, 29
conceptual combinations within, 3 84 diagram
conflict evasion in, 3 88 motion cue, 1 3 0
connectedness within, 3 77–3 80 diagrams
consistency in, 3 80–3 82 animations and, 23 0
domain grouping in, 3 73 for communication, 227–23 0
dynamic equilibrium within, 3 86 for connectionist representations, 80
egocentrism in, 3 85 Duncker radiation problem and, 23 0
evolution theory and, 3 71 , 3 73 enrichment of, 229–23 0
exocentrism in, 3 85 expertise and, 229
explanation patterns as part of, 3 75 extra-pictorial devices for, 23 0
finer grain of representation in, 3 82–3 83 graph comprehension models and, 227
intuitive theories within, 3 74 iconic, 1 87
knowledge base size in, 3 74, 3 76–3 77 inferences from, 227–229
learning paradox in, 3 89 for insight, 23 0–23 1
monotonicity in, 3 86 in mental model theory, 1 86
nonmonotonicity in, 3 89–3 92 motion cue, 1 3 0
organization in, 3 73 –3 76 multiple sense of, 23 0
perspective change in, 3 84 9-dot problem, 23 0
plans in, 3 75 “reading off,” 227
schemas in, 3 74–3 75 in visuospatial reasoning tasks, 21 9, 227
scripts as part of, 3 75 direct conjunction fallacy, 276
semantic networks within, 3 73 directional outcome-motivation, 296, 297
theory representation within, 3 73 –3 74 accessibility in, 299
vantage point shifting in, 3 84–3 85 attribution in, effects on, 297–298
deductive reasoning, 2 circumstance distinction within, 3 05
Affirmation of the Consequent as part of, 1 72 closure motivation in, influences on, 3 05
subject index 839
cognitive processes and, 3 00 psychology (naı̈ve), 5 9

cognitive-resource constraints on, 3 04 semantic-based memory structures and, 60
concept organization, effects on, 3 00 transitive inference within, 5 41
counterfactual thinking in, 298 domain-general information sources, 1 4
evidence evaluation for, effects on, 298 domains
extended information processing in, 3 04–3 05 before-after structures in, 3 73
influences on, 297 cause-effect in, 3 73
information search in, effects on, 299 in declarative knowledge, 3 73
knowledge activation in, 299 hierarchies in, 3 73
memory reconstruction in, 299 local structuring in, 3 73
positive self-evaluation within, 298, 3 00 naı̈ve biology, 5 9, 61
reality constraints on, 3 03 –3 04 naı̈ve law-school, 61
recall and, 299–3 00 naı̈ve physics, 5 9, 61
dispositional bias, 699–700 dopamine
dispositional thinking, 777, 785 , 795 functions, 5 05
behavioral effects of, 786 phasic activity, 5 02
cognitive closure in, 786 receptor binding, 5 1 2
cultivated, 787–788 dot numerosities, 5 77
empiricism of, 786–787 subtizing for, 5 77
entity learners in, 786 dot-arrays, 5 72
habits of mind in, 785 “double disjunction”
incremental learners in, 786 in mental model theory, 1 93
logic and, 785 –786 dual mechanism theories
dissociable neural networks, 481 –482 deductive reasoning and, 476, 479
dissociation (brain functions), 478 dual process theory
cognitive neuroscience and, 477–478 associative learning and, 1 80
4-card Selection Task and, 485 central working memory resources as part of, 1 80
language and, 478 “correction model” vs., 268
patient data evidence for, 484–485 in deductive reasoning, 1 79–1 80, 1 81
transitive reasoning and, 484 factors for, 268
distortions program heuristics and, 267–268
cognitive maps in, 222 implicit learning and, 1 79
representations and, 221 –222 intuitive judgment and, 1 79
symmetry in, 222 System 1 in, 1 80, 267
in visuospatial reasoning, 221 System 2 in, 1 80, 267
diversity duration neglect
age-based sensitivity for, 1 03 as attentional phenomenon, 285
in scientific induction methodology, 1 08 End Affect in, 284
in similarity-based induction, 1 03 in experience evaluation, 283 –285
diverted attention, 43 6 Peak Affect averages and, 284
dual-task paradigms in, 43 6 Peak/End Rule in, 284
DMTS tasks, 61 2 in prototype heuristics, 282
domain specificity substantial variations effects of, 284
in analogy, 409 dynamic systems models (cognitive development),
biology (naı̈ve), 5 9 5 3 6–5 3 7, 5 49
brain regions and, 60 backpropagation models in, 5 3 6
child development and, 5 41
classification and, 5 41 EBRW (exemplar-based random walk) model, 41 6
in cognitive development, 5 41 in WM, 41 6
within conceptual development, 5 8–62 economics, 700
folkbiology and, 5 9 legal reasoning and, 700
inheritance theory in, 60 elaboration (creativity), 3 5 3
interdomain differences and, 5 9–60 electronic medical records. See EMR
mechanics, 3 87 elementary transformations, 21 5 –21 6
memory and, 60–61 candidate catalog in, 21 5 –21 6
multiple storage theories within, 61 eliminative connectionist representations, 79
nonproprietary concepts and, 61 inferences within, 79
“novel,” 409 Embedded-Processes memory model, 462–463
physics (naı̈ve), 5 9 active memory in, 462
in production systems, 409 central executive as part of, 462
proprietary concepts and, 61 diagram for, 462
840 subject index
Embedded-Processes (cont.) recall and, 3 3 7

“focus of attention” in, 462 similarity and, 29
modality specificity in, 463 in transfer of learning processes, 790–791
emotional intelligence, 75 2 Explaining Science (Klahr), 708
emotions explicit induction, 1 99, 200–203
affect heuristics and, 25 8 “conjunction” fallacy in, 202
anticipatory, 25 8 explanations abduction as part, 201
decision making and, 25 8 Keplers third law, 200
empathy gaps and, 25 8 extension effects
frame of mind, role in, 25 8 in similarity, 29
inconsistency as result of, 25 8 extensional reasoning, 1 97, 1 98
reasoning and, 3 1 2 Bayesian reasoning and, 1 97
empathy gaps, 25 8 external validity problem (biases), 1 74–1 75
EMR (electronic medical records), 744
concept-based, 744 false-belief tasks
displays for, 744 cultural thought and, 673
source-based, 744 feature exclusion
time-based, 744 in similarity-based induction, 1 04
endowment effects, 247 feature learning, 47–48
entity learners, 786 similarity and, 48
entrenchment theory “feeling of knowing,” 445
inductive reasoning, 97 fetal hypoxia
kind in, 98 schizophrenia and, 5 1 4
limitations of, 97–98 thought disorder and, role in, 5 1 1
similarity in, 98 Figure Classification test, 5 99
enumeration finer grain of representation
inductive reasoning by, 97 in declarative knowledge, 3 82–3 83
EPIC (production system), 404–405 emergent systems in, 3 83
parallel production-rule firing in, 405 FINST system, 5 81
production rules within, 41 2 pointers, 5 82
Soar combination with, 425 Fluxions (Newton), 71 5
WM in, 41 3 fMRI (functional magnetic resonance imaging),
episodic buffers 468
functions of, 461 –462 conceptual change and, 71 8
in multiple working memory models, 45 8 in scientific reasoning, 71 6
equiprobability. See principle of equiprobability “focus of attention”
ERPs (event-related potentials), 3 42 in Embedded-Processes memory model, 462
in scientific reasoning, 71 6 primary memory vs., 462
essentialism. See psychological essentialism folkbiology, 5 9. See also domain specificity
Euclidean metrics taxonomies within, 5 9
in MDS models, 1 6 foresight, 4
Euler circles formalism theory (legal reasoning)
medical technology and, 743 criticisms of, 689
in mental model theory, 1 90, 1 91 , 1 92 “first principles” in, 688
event-related potentials. See ERPs Law and Economics movement and, 694
“Everyday Cognition” (Carraher/Ceci/Schliemann), in legal reasoning, 688–690
677 4-CAPS (production system), 404, 405
evidential diversity principle, 5 48 parallel firing in, 405
evolution theory, 3 71 , 3 73 4-card Selection Task, 277, 485
replacement systems and, 3 91 content in reasoning and, 485
evolving-systems model, 3 60 dissociation and, 485
exemplar-based random walk model. See EBRW permission schemas in, 485
experiment spaces 4-term analogies, 1 1 8, 1 1 9
hypothesis spaces vs., 709 fractions, 5 60, 5 61
in scientific reasoning, 708 frame of mind
expertise decision making, role in, 25 7–25 8
cognitive development and, 5 40–5 41 emotions and, 25 8
diagrams and, 229 identities and, 25 7–25 8
inference, effects on, 23 1 mood maintenance and, 25 8
problem representations and, 3 3 6 priming in, 25 7
in problem solving, 3 3 6–3 3 8 framing effects, 246
subject index 841
frequency format dedicated comparator mechanism as part of, 5 04

conjunction fallacy in, 279 diagram, 5 03
in System 2 (dual process theory), 279 dopamine functions in, 5 05
fully explicit models excitatory input disruption in, 5 05
in principle of iconicity, 1 88 interrupted motor labeling programs in, 5 05
for sentential reasoning, 1 88, 1 89, 1 90 “match”/“mismatch” signals within, 5 05
truth tables vs., 1 90 reticular nucleus inhibition within, 5 05
functional anatomy (reasoning), 479–481 thalamocortical disinhibition in, 5 05
basic paradigms for, 479–481 heuristics
cerebellum effects, 481 accessibility within, 270–272
content/no content effects, 482 additive extension effect in, 280, 282
dissociable neural networks and, 481 –482 affect, 5 8, 25 8
semantic content and, 482 anchoring, 272
stimuli presentations of, 479 attribute substitution and, 269–274
study findings for, 481 –484 bias and, 268, 270
functional fixedness, 3 3 2 “choosing by liking” in, 286
fuzzy set theory, 43 cognitive, use of, 1 1 1 –1 1 2
coherence rationality and, 277
GE (General Enrichment) programs, 778 cross-dimensional mapping for, 272
General Enrichment programs. See GE in deductive reasoning, analysis of, 1 73 –1 75
General Problem Solver. See GPS direct conjunction fallacy in, 276
generative representation systems dominance violations in, 285 –287
in transformational models, 26 elicitation, 274
generic noun phrases, 5 3 factorial designs in, 280–281 , 287
Genoplore model hill-climbing, 3 27–3 28
for creativity, 3 5 6 identification of, 274–276
exploratory phase of, 3 5 6–3 5 7 in inductive reasoning, 1 1 1 , 1 99
generative phase of, 3 5 6 judgments, role in, 267
geometric models (of similarity), 1 5 –1 7 mean-ends analysis, 3 28–3 29
problems with, 1 9 medical mistakes, role in, 73 9
symmetry assumptions within, 1 8 in medical reasoning, 73 0
Gestalt psychology, 3 in problem solving, 3 26
linguistic connections within, 9 prototype, 281 –287
principles of perceptual organization in, 21 0 representativeness in, 1 1 2, 1 99, 272,
problem solving, origins in, 3 24 274–281
scientific reasoning and, role in, 706 Hierarchical Cluster Analysis
Story model and, 79 and MDS, 22
gesture hierarchical structure models, 602, 603
inference and, 21 8 aging’s effect on, 602–604
spontaneous, 21 8 first-order factors for, 602, 604
GPS (General Problem Solver), 3 24, 3 29 for intelligence, 75 4–75 5
weaknesses of, 3 24 second-order factors for, 603
graphics hierarchies
elements, 225 –226 clustering in, 22
language vs., 23 2 in domains, 3 73
relations, 226 Hierarchical Cluster Analysis, 22
visuospatial reasoning context for, 225 in MDS models, 22
groupthink, 776 representational, 22
GUIDON consultation systems, 729 tangled, 3 73
in thinking (kinds), 2
“Hamming distances” hill climbing
in The Contrast Model, 21 means-end analysis vs., 3 29
Handbook of Implicit Learning (Frensch/Stadler), 441 in problem solving, 3 27–3 28
Head Start program, 765 Hobbits and Orcs problem
hemispheric asymmetry problem solving and, 3 27
belief-laden material and, 485 solution path for, 3 28
belief-neutral material and, 485 holographic reduced representations. See HRRs
cognitive neuroscience and, 485 –487 How to Solve It (Polya), 3 24
Hemsley’s & Gray model (thought disorder), 5 03 –5 06 HRRs (holographic reduced representations), 81 , 83
auditory hallucinations in, 5 05 Human Development, 5 40
classical conditioning and, 5 05 Human Problem Solving (Newell/ Simon), 3 24, 401
842 subject index
hypothesis spaces Incremental Analogy Model, Se IAM

experiment spaces and, 709 incremental learners, 786
in scientific reasoning, 708 incubation, 445
hypothesis testing indirect inference rules systems. See suppositional
medical reasoning and, 73 1 reasoning
scientific reasoning and, 707, 709–71 0 individualism
collectivism vs., 675 –676
IAM (Incremental Analogy Model), 1 3 1 inductive reasoning, 2, 95 , 1 98–1 99
ideational confusion definition, 496 associative learning systems and, 96
IE (Instrumental Enrichment) programs, 778, 782, availability heuristic in, 1 99
787, 788 categorical, 71 2
illusory inferences, 1 94, 1 95 categorical structures and, 1 00, 1 1 1
imagery “cause and effect” and, 95 , 98, 99, 1 1 2–1 1 3
perception vs., 21 1 –21 6, 21 8 collocation as part of, 1 00
“Impetus” theory, 71 5 computational-level causal theory for, 1 5 3 –1 5 5
intuitive theory vs., 3 87 conditional probability in, 1 01
Newtonian scientific theories and, 71 5 contrast models of similarity and, 99
implicit cognition descriptive level of, 96
abstraction in, 442–445 entrenchment in, 97
ACT-R models and, 43 3 by enumeration, 97
AG study for, 43 4 experience as part of, 95
age-based abilities in, 43 1 explicit, 1 99
amnesia and, 43 1 by generalization, 71 2
definition of, 43 2 generalization gradient as part of, 99
“feeling of knowing” in, 445 heuristics as part of, 1 1 1
incubation in, 445 implicit, 1 99
information accessibility in, 43 3 justificatory level of, 96
insight in, 445 naturalistic accounts of, 98
intuition as part of, 445 predicates’ role in, 99
knowledge availability in, 43 3 problems with, 95
learning in, 440–442 projectibility, 97
memory, 43 2, 43 8–440 reflective reasoning and, 96
methodology for, 43 3 –43 6 “riddle” of, 97, 98
as non-rule based, 43 3 as scientific methodology, 1 02
opinions about, 43 1 –43 2 in scientific reasoning, 71 2
perceptual representations in, 443 –445 similarity-based, 1 02–1 07, 1 1 1
problem solving in, 445 –447 tendency as instinct in, 96
spreading activation in, 445 inference
storage of, 43 2–43 3 as concept, 5 2–5 3
symbol manipulation in, 43 3 from diagrams, 227–229
transfer protocols in, 443 expertise, effect on, 23 1
implicit induction, 1 99–200 gesture and, 21 8
principle of modulation in, 1 99 illusory, 1 94, 1 95
truth functional in, 200 learning, 47
implicit learning, 440–442 in mental environments (visuospatial), 21 8–220
acquisitional mechanisms in, 442 in mental model theory, 1 92, 1 94
in adults, 441 –442 motion in space observation, 21 6–21 7
AG procedures in, 441 object-based, 3 3 2–3 3 3 , 3 40–3 42
artificial word segmentation in, 441 perception of causality and, 21 7
in infants, 440–441 in real environments (visuospatial), 21 7–21 8
implicit memory, 43 2, 43 8–440 from similarity, 1 3
under anesthesia, 43 6–43 7 from sketches, 23 1
attentional load in, 43 6 transitive, 61 3 –61 4
context sensitive, 43 9 in visuospatial reasoning, 21 6–221
diverted attention in, 43 6 inference rules systems (direct)
fragment completion for, 43 9 for deductive reasoning, 1 71 –1 72, 475
“polarity” fallacy in, 440 Modus Ponens, 1 71 –1 72
subliminal perception and, 43 7–440 information processing theories, 5 3 1 –5 3 2
implicit relations continuum, 83 cascade correlation models and, 5 3 5
inclusion fallacy dynamic systems models, 5 3 6–5 3 7
in similarity-based induction, 1 05 Neo-Piagetian school and, 5 3 1
subject index 843
neural nets models and, 5 3 2 history of, 75 2–75 3

Q-SOAR model, 5 3 2 major scales for, 75 3
symbolic architectures within, 5 3 1 mental context in, 762
information-processing (in medical reasoning), 728 physical context in, 762
protocol analysis in, 728 social context in, 762
insight problems, 3 42–3 44 Stanford-Binet test, 75 3
anagrams and, 3 43 Weschler scale, 75 3
9-dot problem as, 3 3 1 intensional reasoning, 1 97
pairs in, 3 43 “intermediate effect”
Instrumental Enrichment programs. See IE idealized representation of, 73 5
insurance, 245 in medical reasoning, 73 5
in risk decision framing, 246–247 internal representations, 2
integrative reasoning, 5 93 interpretation problem (for biases), 1 74–1 75 , 1 76
aging and, 5 94, 5 96 intuition, 445
WM and, 5 96 Intuitive Math programs, 779, 780
intelligence intuitive theory
analogy and, 75 8 belief systems vs., 3 93
bioecological model of, 764–765 declarative knowledge and, 3 74
biological approaches to, 75 8–760 formation of, 3 87
Bradley-Caldwell studies on, 765 impetus theory vs., 3 87
brain size and, 75 9 nonmonotonicity and, 3 87
choice reaction times for, 75 5 –75 6 IPAR studies, 3 5 4
cognitive approaches to, 75 5 –75 8 IQ (intelligence quotient)
componential theory for, 75 7–75 8 certification theory in, 3 5 5
contextual approaches to, 760–762 creativity and, 3 5 4
contextual differences in, 75 2 IPAR studies and, 3 5 4
conventional conceptions of, 3 5 4 neural-conduction speed and, 75 9
creativity and, 3 5 4–3 5 5 reaction times and, 75 6
emotional, 75 2 Terman Concept Mastery Test for, 3 5 4
environmental influences on, 766 threshold theory in, 3 5 5
ethnotheories of, 668 irrational numbers (mathematical cognition), 5 61
evolution theory of, 760 extracting roots operation for, 5 62
glucose metabolism and, 75 9 isomorphism
heredity’s role in, 766 hypothesis of, 1 86
hierarchical models of, 75 4–75 5 in SAA, 76, 82
home environment and, effects of, 765 “second-order,” 21 1
implicit theories of, 75 1 –75 2 in semantic alignment, 3 41
improvements to, 764–765 , 766 iterative retrospective revaluation
inspector time indicators for, 75 5 in causal learning, 1 5 9–1 60
integration effects on, 766
mediated learning and, 782 Jasper Woodbury program, 790
multiple intelligences theory, 762–763 Journal of Educational Psychology, 75 1
neural conduction speed and, 75 9 judgment. See also decision making
PET and, 75 9–760 in heuristics, 267
physiological indicators for, 75 8 influences on, 6
primary mental abilities for, 75 4 intuitive, 1 79
psychometric approaches to, 75 3 –75 5 reasoning and, 2
simultaneous processing speed and, 75 6–75 7 similarity in, 29
social, 75 2 thinking and, 1
SOI model for, 75 4 visuospatial thinking and, 221 –224
syllogisms’ role in, 75 7
systems approaches to, 762–765 “kernels,” 1 3 2
testing, 75 2–75 3 Keys to Thinking program, 787, 793
theory of “g” and, 75 4, 782 key-tapping paradigms
Triarchic theory for, 763 –764 baseline conditions for, 5 74
“true,” 764 in numerical estimation (animals), 5 73 , 5 74, 5 77
WM and, 75 7 kind. See also similarity
intelligence quotient. See IQ in entrenchment theory, 98
intelligence testing, 75 2–75 3 knowledge encapsulation, 73 7
culture-relevant, 761 in medical reasoning, 73 7
elements of, 75 3 Knowledge Forum, 783 , 793
844 subject index
knowledge systems language learning (production system), 421 –423

analytic vs. empirical, 1 45 ACT-R and, 422
complex, 3 71 fact representation in, 422
conceptual, 1 07 past tense generation models for, 422
key aspects of, 1 21 lateral thinking, 3 5 2
in metaphor, 1 21 –1 22 “launching effects,” 1 45
perceptual, 1 07 law. See legal reasoning
similarity and, 1 5 Law and Economics movement, 694
Kolmogorov complexity theory formalism theory and, 694
conditional, 27 Law and Society movement, 694–695
within transformational models (of similarity), 27 LCM (lowest common multiple), 3 3 9
learning
labeled graphs, 74 conceptual categorization and, 3 8
language feature, 47–48
ambiguity in, 63 6–63 7 implicit, 440–442
anaphoric reference and, 63 6 signal
bootstrapping in, 643 , 649, 65 3 Learning and Inference with Schemas and Analogies.
classificatory tasks for, 642–643 See LISA
closed-class functional vocabulary and, 641 legal realism theory, 690–694
“code-switching” in, 65 3 American Bar Association and, 694
in cognitive development, 5 3 1 case comparisons in, 692–693
components of, 49 constructive realists and, 693
compositionality within, 5 2 Critical Legal Studies movement and, 695
concept development and, 63 4, 63 5 critical realists and, 693
conceptual functions of, 48–5 4 judges’ role in, 692
“core knowledge” and, 63 5 sociocultural forces’ effect on, 691 –692
count-noun morphology in, 642 “Sociological Jurisprudence” and, 690
deductive reasoning within, 9 sociology and, 691
dissociation (brain functions) and, 478 statutory interpretation in, 691
evidentiality within, 648 legal reasoning, 685
generic noun phrases as part of, 5 3 analogical method (case-based) in, 686, 687–688
Gestalt psychology and, 9 categorical thinking in, 698–699
graphics vs., 23 2 certainty as part of, 699
inconsistency in, 63 7–63 8, 65 0 Critical Legal Studies movement in, 694, 695 –696
inter-modular communication and, 65 1 decision-making in, 697–698
labeling in, 65 4 deductive method (rule-based) in, 686–687
landmark information in, 647 dispositional bias in, 699–700
learning (production system), 421 –423 Economics and, 700
linguistic relativity and, 63 3 , 63 5 , 63 9 empirical testing for, 696
manner-biasing in, 646 formalism theory of, 688–690
in mental model theory, 1 87 group size effects on, 697
motion expression in, 645 –646 Law and Economics movement in, 694
numerosities and, 649–65 1 Law and Society movement in, 694–695
objects’ role in, 642–644 legal realism theory in, 690–694
ontology’s influence on, 643 “noble lie” in, 695
orientation and, 65 1 –65 3 probabilistic data and, 699
path-biasing in, 646 refinements in, 700
perception and, influences on, 63 8 as science, 688
phonemes and, 63 8 scientific reasoning vs., 696–700
phonetic reorganization in, 63 8 Leviathan (Hobbes), 3
polysemy in, 5 3 –5 4 lexical disambiguation task, 5 01
quantifiers within, 65 0 linguistic relativity, 63 3 , 63 5 , 63 9
regularities in, 640 study of, 640
similarity tests for, 65 5 LISA (Learning and Inference with Schemas and
sound categorization and, 65 5 Analogies), 1 3 4–1 3 6
spatial relationships in, 644–645 , 646–648 ACME vs., 1 3 4–1 3 5
spatial, spontaneous use of, 226 aging’s effects on, 87
thinking and, 7 analogs in, 86, 1 3 5
“thinking for speaking” and, 65 3 , 65 4 code hierarchy within, 85 , 86
time and, 648–649 front-temporal degeneration and, 87
visuospatial reasoning, effect on, 220–221 LTM in, 86
subject index 845
MAM and, 1 3 6 mathematical cognition

mapping connections in, 1 3 5 bidirectional mapping in, 5 79, 5 83
propositional representation in, 85 , 86, 1 3 4 children and, 5 79–5 82
as representational format, 84 FINST system and, 5 81
role-filler bindings in, 86 foundations of, 5 5 9
sub-proposition units in, 85 infant number discrimination and, 5 81
unit distribution hierarchy, 1 3 5 magic experiments and, 5 80
WM and, 86, 1 3 5 , 41 3 , 469, 470 numerical estimation as part of, 5 62–5 64
Location test, 5 99 object tracking systems and, 5 81
logic problems, 3 verbal numerical competence and, 5 79–5 82, 5 83
logics systems mathematical problem solving, 3 3 8–3 42
cultural thought and, 676 domain knowledge in, 3 3 8–3 40
mental, 1 71 ERPs in, 3 42
natural, 1 71 LCM in, 3 3 9
predicate calculus, 1 86–1 87 mathematical models in, 3 40
long term memory. See LTM object based inferences in, 3 40–3 42
Lorge-Thorndike Verbal Intelligence test, 3 5 5 semantic alignment in, 3 41
loss aversion, 245 , 247–249 semantic symmetry in, 3 41
endowment effects from, 247 subgoals generation in, 3 40
stability as result of, 248 mathematics
trade reluctance from, 248 arithmetic as part of, 5 60
lowest common multiple. See LCM formalist perspective of, 5 5 9
LTM (long term memory). See also memory mental magnitudes and, 5 60, 5 62
conjunctive coding for, 84 numbers systems and, 5 60, 5 62
in LISA, 86 objective magnitudes and, 5 60
proportion in, 5 61
MACFAC (“Many are Called but Few Are Chosen”) Pythagorean formula, role in, 5 61
model matrix reasoning, 5 91 , 5 93
content vectors in, 1 3 3 aging and, 5 94, 602
SME and, 1 3 3 –1 3 4 tasks, 5 91
magnitudes McClesky v. Kemp, 697–699
answer, 5 74 MDS (multidimensional scaling) models
comparand, 5 73 algorithms, 1 6
mental, 5 60, 5 62 applications of, 1 6
objective, 5 60 city-block metrics within, 1 6
operand, 5 73 , 5 74 compressed representations in, 1 6–1 7
rational numbers and, 5 61 contemporary uses of, 1 6
Weber’s law and, 5 69, 5 75 The Contrast Model within, 20
MAM (Metric Array Module) density as part of, 1 9
LISA and, 1 3 6 Euclidean metrics within, 1 6
“Many are Called but Few Are Chosen.” Hierarchical Cluster Analysis and, 22
See MACFAC hierarchical structures in, 22
mapping input to, 1 5
ACME networks and, 1 3 4 inter-item distances in, 1 9
in analogy, 1 1 7, 1 24–1 27, 41 0, 424, 5 3 1 item bias within, 1 9–20
bidirectional, 5 79, 5 83 output of, 1 5
bistable, 1 26, 1 27 postulated representations in, 22
coherence in, 1 25 –1 27 propositional structures in, 22
cross-dimensional (for heuristics), 272 quantitative representations in, 1 7
in CWSG, 1 28–1 29 of similarity, 1 5
goal-directed, 1 24–1 25 space dimensionality in, 1 9
in LISA, 1 3 5 MDX-2 systems, 73 7
in metaphor, 1 20 mean-ends analysis, 3 28–3 29
relational responses in, 1 27 hill climbing vs., 3 29
in SME, 1 3 2 Tower of Hanoi problem and, 3 28
structure, 1 21 mechanics, 3 87
WM in, 1 27–1 28 medial temporal lobes. See MTL
“match” signals medical education
in thought disorder, 5 05 , 5 07, 5 09 medical reasoning and, 741 –742
matching bias, 1 74–1 76 PBL in, 741
Wason selection tasks and, 1 74–1 76 problem solving and, 741
846 subject index
medical errors distinct primary, 45 7

medical reasoning and, 73 8 domain specificity and, 60–61
mistakes, 73 9 dynamic, 402
slips, 73 9 generic, 62
taxonomy of, 73 8–73 9 implicit cognition in, 43 2
theory of action and, 73 9 in LISA, 86
medical knowledge, 73 4–73 5 LTM storage and, 84
basic scientific, 73 4 multiple component systems of, 47, 45 8
clinical, 73 4 organization within, 42
compiled causal, 73 8 primacy effects for, 45 8
medical reasoning, 727 Quillian-type, 42
artificial intelligence and, 728 recency effects for, 45 7
causal reasoning as part of, 73 6–73 8 secondary components of, 45 7
“cover and differentiate” reasoning models as part of, semantic, 3 9–41 , 62
73 0 separate primary, 45 7–45 8
data-driven reasoning as part of, 73 2 “story,” 1 22
decision research and, 740–741 mental accounting
decision-analytic approach to, 728 in decision making, 25 5 –25 6
“deep” expertise in, 73 7 topicality in, 25 6
directionality in, 727 mental accumulator model
early history of, 727–729 for mental magnitudes, 5 65
errors and, 73 8, 73 9–740 mental logic theories
forward-oriented strategies in, 729, 73 1 –73 2 cognitive neuroscience and, 479
GUIDON consultation systems and, 729 deductive reasoning and, 475 –476
heuristic classification in, 73 0 inferential roles within, 475
horizontal organization in, 73 4 mental model theory vs., 476
hypothesis testing in, 73 1 mental magnitudes, 5 60, 5 62, 5 82
hypothesis-evaluation in, 728 adult number behavior and, role in, 5 83
hypothesis-generation in, 728 for duration, 5 65 , 5 67
hypothetico-deductive reasoning within, 728, 729, mapping between, 5 71
73 1 , 742 mental accumulator model for, 5 65
induction in, 73 1 numerosities and, 5 68
information-processing approach to, 728 for numerosity, 5 65
“intermediate effect” in, 73 5 scalar variability and, 5 64, 5 71 , 5 73 , 5 82
Internist consultation systems and, 729 subjective quantity in, 5 82
knowledge encapsulation in, 73 7 mental model theory (deductive reasoning), 1 72, 1 86,
MDX-2 systems in, 73 7 1 90–1 91
medical education and, 741 –742 analogy and, 1 87
medical knowledge and, 73 4–73 5 artificial intelligence and, 1 87
mental model progression in, 73 7 behavior simulation in, 1 87
models of, 728 categorization and, 1 85 –1 86
MYCIN consultation systems and, 729 cognitive neuroscience and, 478–479
NDM and, 740 compound premises in, 1 96
NEOMYCIN consultation systems and, 729 deduction strategies in, 1 96
nonmonotonicity in, 73 5 development of, 203
PIP and, 729 diagrammatic systems within, 1 86
QSIM systems, 73 7 “double disjunction” in, 1 93
“real-world” clinical tasks and, 727 Euler circles in, 1 90, 1 91 , 1 92
“select and test” models in, 73 0 fault diagnosis and, 1 87
“shallow” expertise in, 73 6 history of, 1 86–1 87
similarity in, 73 3 –73 4 induction cases as part of, 1 86
technology-mediated, 743 inferences in, 1 92, 1 94, 1 95
technology’s effects on, 744 isomorphism hypothesis in, 1 86
“vertical” role in, 73 4 in medical reasoning, 73 7
medical technology mental logic theory vs., 476
EMR, 744 “minimal completion” hypothesis and, 1 94
Euler circles and, 743 modal reasoning and, 1 93
as external representations, 743 Modus Ponens and, 1 73
medical reasoning and, 743 –745 ‘picture’ theory of meaning and, 1 86
memory prediction corroboration for, 1 93
concepts and, 3 9, 40–41 , 48–49 principle of iconicity in, 1 87–1 88
subject index 847
principle of strategic variations, 1 91 –1 92 monotonicity

principle of truth in, 1 90, 1 94, 1 95 in declarative knowledge, 3 86
psycholinguistics and, 1 87 in similarity-based induction, 1 04
semantic information effects in, 1 85 motion
semantics and, 1 72, 1 87 abstract paths for, 21 7
suppositional reasoning for, 1 96 expression, in language, 645 –646
syllogisms in, 1 90 self-propelled, 21 7
tidal predictors and, 1 86 motivated thinking
truth-functional meanings in, 1 88 affective thinking, influence on, 3 1 1
updating in, 1 90 cognitive vs., 296
mental models history of, 295 –296
“runnable,” 228, 3 3 1 “New Look” school of, 296
for visuospatial reasoning, 228 outcome, 296–3 06
Mental Models (Gentner/Stevens), 228 strategy, 3 06–3 1 0
Mental Models (Johnson-Laird), 228 motivations
mental representations, 2 for creativity, 3 5 8
mental scanning tasks, 21 3 –21 4 drives and, 295
mental transformations, 21 3 –21 4 expectancies and, 295
application of, 21 4 spreading activation and, 295
dissociability in, 21 5 MTL (medial temporal lobes)
“move” as part of, 21 4 schizophrenia, role in, 5 1 2, 5 1 3 –5 1 4, 5 1 5
“rotate” as part of, 21 4 structures, role of, 43 9
of self, 21 4–21 5 subliminal perception, role in, 43 9
metaphor Muller v. Oregon, 693
analogy and, 1 20–1 21 multidimensional scaling models. See MDS
mapping in, 1 20 multiple component working memory models,
metonymy in, 1 20 45 8–462
source domain in, 1 20 articulatory loop as part of, 45 9
tenor (target) as part of, 1 20 central executive as part of, 45 8, 460
time and, 1 20–1 21 Embedded-Processes, 462–463
metonymy, 1 20 episodic buffer in, 45 8, 461 –462
metrics four components for, 45 8–45 9
city-block, 1 6 phonological loop in, 45 8
Euclidean, 1 6 slave systems for, 45 9
power, 1 9 task-interference paradigm in, 463 –466
MindStorms (Papert), 721 visuospatial sketchpad in, 45 8, 45 9–460
“minimal completion” hypothesis, 1 94 multiple intelligences theory, 762–763
mirror image transformations, 26 cognitive modularity as part of, 762–763
mistakes (medical errors), 73 9 multiple systems
action specification, 740 in categorization (conceptual), 46–47
evaluation, 73 9 for memory, 47
execution, 73 9, 740 multiplicative binding schemes, 82
goal, 73 9 MYCIN consultation systems, 729
heuristics and, 73 9
intention, 73 9 naı̈ve biology domain, 61
medical reasoning and, 73 9–740 domain specificity and, 5 9
perception, 740 naı̈ve law-school domain, 61
procedural, 740 naı̈ve physics domain, 61
modality specificity domain specificity and, 5 9
in Embedded-Processes memory model, naı̈ve theory (inductive reasoning), 1 08–1 09
463 age-based knowledge systems and, 1 09
for WM, 467, 470 human bias as part of, 1 09
Modified Digit Span task. See MODS naming effects, 1 05 –1 06
MODS (Modified Digit Span task) in similarity-based induction, 1 05
in ACT-R, 41 3 “Natural Kinds” (Quine), 98
sample trial for, 41 3 naturalistic decision making. See NDM
Modus Ponens, 1 71 –1 72, 1 78, 1 79 The Nature of Explanation (Craik), 1 86
mental model theory and, 1 73 The Nature of the Judicial Process (Cardozo),
Modus Tollens, 1 71 , 1 72, 1 73 , 1 74–1 75 , 1 78 686
material implications in, 1 72 n-back task, 466
truth table analysis of, 1 72 NDM (naturalistic decision making), 740
848 subject index
negative conclusion bias, 1 74–1 75 physical event judgment for, 623 –625
double negation effects in, 1 74–1 76 quantitative reasoning in, 61 6–61 9
explicit negation as part of, 1 75 –1 77 relational reasoning in, 609–61 1
implicit negation as part of, 1 75 –1 77 rotational displacement and, 609
RAA reasoning and, 1 74–1 76 sameness-difference comprehension in, 61 0–61 1
négritude shortcut use by, 608–609
cultural thought and, 666 social reasoning for, 625 –626
neologism spatial reasoning in, 608–609, 61 2
thought disorder and, 495 stick and hook tasks for, 621
NEOMYCIN consultation systems, 729 summation tasks for, 61 7–61 8
Neo-Piagetian school (cognitive development) support problems and, 620–621
information processing theories and, 5 3 1 “theory of mind” and, 626
neural net models, 5 3 2–5 3 5 tool use by, 61 9–620
backpropagation in, 5 3 3 transitivity in, 61 3 –61 4
balance scale and, 5 3 2 tube and trap problems and, 621 –623
balance state in, 5 3 4 nonmonotonicity
cluster analysis for, 5 3 6 bootstrapping and, 3 89–3 90
in information processing theories, 5 3 2 feature coverage model analysis for, 1 04–1 05
prototype formation in, 5 45 intuitive theory and, 3 87
rules in, 5 3 4 in medical reasoning, 73 5
symbolic processes in, 5 3 5 , 5 3 6 ontological shifts as result of, 3 92
systematicity in, 5 3 6 replacement as part of, 3 90–3 91
three-layered, 5 3 6 in similarity-based induction, 1 04, 1 1 0
neural network models, 5 transfer via analogy in, 3 91 –3 92
categorization in, 46 normal speech production models, 498, 5 07
New Directions in Scientific and Technical Thinking diagram, 496, 498
(Gorman), 708 lemma retrieval in, 496–497
9-dot problem, 23 0, 3 3 1 on neural levels, 498
creativity and, 3 5 6 “rhetorical/semantic/syntactic” system within,
as insight problem, 3 3 1 496
solution for, 3 3 1 thought disorder and, 496–498
“No Peeking Principle,” 5 0 normative system problem (for biases), 1 73 –1 75
concepts and, 5 0 notion of overhypotheses, 1 06
“noble lie” (legal reasoning), 695 Novum Organum (Bacon), 706
non-directional outcome-motivation, 296, 3 00–3 01 number-left procedures, 5 67
accuracy in, 3 01 flash generation in, 5 67
analytic complexity in, effects of, 3 02 variable ratio schedules for, 5 67
attribution, effects on, 3 01 numbers systems (mathematics), 5 60, 5 62
closure in, 3 01 bit pattern symbols in, 5 64
cognition and, need for, 3 1 2 bit patterns within,
evidence evaluation, effects on, 3 01 defining features of, 5 66
fear of invalidity in, 3 01 fractions, 5 60
knowledge activation, effects on, 3 03 irrational, 5 61
recall, effects on, 3 02–3 03 measuring quantities in, 5 61
non-human primates negative, 5 61
analogical reasoning in, 61 1 –61 3 notation system for, 74, 5 62
causal reasoning for, 61 9–625 rational, 5 60
Cognitive Revolution, role in, 5 93 real, 5 62
conjunctive negation in, 61 5 –61 6 types of, 5 60–5 62
conservation experiments for, 61 8–61 9 numerical estimation (animals), 5 63 , 5 64
counting ability in, 61 6–61 7 key-tapping paradigms in, 5 73 , 5 74
detour use by, 608 in mathematical cognition, 5 62–5 64
DMTS tasks and, 61 2 multiplying rates in, 5 68
identity tasks and, 61 0 nonverbal arithmetic reasoning and, 5 72–5 74
inferential reasoning in, 61 3 –61 6 nonverbal counting, 5 71 –5 72
least-distance strategies in, 609 number-left procedures in, 5 67
moving object search by, 609 numerosity in, 5 64, 5 66
object permanence tasks by, 609 rate of reward in, 5 68
oddity conceptualization in, 61 0 reaction time/accuracy in, 5 70
ordinality for, 61 4–61 5 subtracting durations for, 5 66–5 67
perceptual strategy use by, 61 9 time-left procedures in, 5 66–5 67
subject index 849
numerosities permission schemas, 1 77

adding, 5 66 PET (positron emission topography)
difference judging experiments with, 5 76 intelligence and, 75 9–760
discrimination experiments with, 5 76 in scientific reasoning, 71 6
division of, 5 67–5 68, 5 91 –5 93 phase shift transformations, 27
dot, 5 77 Philosophy for Children program (thinking), 778, 779,
dot-arrays and, 5 72 787
estimating mechanism for, 5 77 PMI operation as result of, 778
language and, 649–65 1 phonemes, 63 8
map, 5 82 phonological loop
mental magnitudes and, 5 68, 5 69 articulatory suppression in, 45 9
nonverbal estimated, 5 72 in multiple working memory models, 45 8
in numerical estimation (animals), 5 64 phonological store within, 45 9
objective, 5 76 role of, 45 9
ordering, 5 68–5 69 ‘picture’ theory of meaning, 1 86
rate of reward and, 5 68 PIP (Present Illness Program), 729
symbolic distance/size effect and, 5 47, 5 69–5 71 plans (declarative knowledge), 3 75
tone sequences for, 5 72 polysemy, 5 3 –5 4
linguistic differentiation within, 5 4
Odyssey program (thinking), 779, 781 typicality structures in, 5 3
“On Problem Solving” (monograph) (Duncker), 3 24 Popular Science, 706
On Scientific Thinking (Doherty/Mynatt/Tweney), 708, positive self-evaluation
71 0 within directional-outcome motivation, 298,
“one-back” continuous performance task, 5 01 3 00
ontology individualistic cultures and, 3 1 2
language’s influence on, 643 strategies for, 3 00
organizational models positron emission topography. See PET
horizontal, 73 4 Posner’s trains-and-bird problem, 3 22, 3 25
of memory, 3 9–40 alternative representations for, 3 22
outcome-motivated thinking, 296–3 06 power metrics, 1 9
directional, 296, 297 Power PC theory
limits to, 3 03 –3 05 Boyle’s law and, 1 5 3
non-directional, 296, 3 00–3 01 in causal learning, 1 5 3 , 1 5 4, 1 5 5 , 1 60
overshadowing, 1 48 ceiling effects and, 1 5 5
deviations from, 1 5 8, 1 5 9
patterns flexibility of, 1 60
in connectionist representations, 78 PPP (Premise Probability Principle)
explanation, 3 75 in scientific methodology, 1 08
matching (production rules), 402 pragmatic reasoning schemas, 1 73
transmission (schizophrenia), 5 1 1 in Deontic Selection Task, 1 77
PBL (problem-based learning), 741 predicate calculus (logic system), 1 86,
backward-directed reasoning modes and, 1 87
742 predicates
reasoning errors as result of, 742 ACME matching, 1 3 4
Peak/End Rule, 284 in category structure, 1 00
application of, 284 inductive reasoning, role in, 99
People Pieces analogy task, 465 in similarity-based induction, 1 03
perception prediction
in causal learning, 1 45 , 1 62 in causal learning, 1 44, 1 5 1 , 1 5 2
constraints in, 1 5 for concepts, 3 8
constructive, 23 1 predictive learning
functions in, 1 5 causal learning vs., 1 47
language’s effect on, 63 8 preferences
mistakes (medical errors), 740 in strategy-motivated thinking, 3 09–3 1 0
subliminal, 43 7–440 Premise Probability Principle. See PPP
perceptual knowledge Present Illness Program. See PIP
conceptual vs., 1 07 prevention focus
in similarity-based induction, 1 07 promotion focus vs., 3 07
perceptual representation systems, 443 –445 in regulatory focus theory, 3 06
context effects in, 444 subtractive counterfactuals and, 3 08
task demands in, 444 primacy effects, 45 8
850 subject index
primary memory mathematical, 3 3 8–3 42

distinct, 45 7 mean-ends analysis in, 3 28–3 29
“focus of attention” vs., 462 medical education and, 741
separate, 45 7–45 8 in medical reasoning, 728
priming object based inferences in, 3 3 2–3 3 3
in frame of mind, 25 7 paired, 780
momentary, 25 7 Posner’s trains-and-bird problem and, 3 22
principle of equiprobability, 1 97 problem spaces in, 3 26–3 27
sentential reasoning models in, 1 97 production rules in, 424
principle of iconicity (mental model theory) progress monitoring theory in, 3 44
exclusive disjunction table for, 1 88 reasoning and, 2
formal logic in, 1 88 representational change theory and, 3 44
fully explicit possibility models in, 1 88, 1 89 representations as part of, 3 3 0–3 3 1 , 3 3 5
in mental model theory, 1 87–1 88 scientific thinking as, 708–709
sentential reasoning in, 1 88, 1 90 solution comprehension in, 3 3 8
principle of “indifference” solver knowledge, role in, 3 3 3 –3 3 4
in probabilistic reasoning, 1 97 story content in, 3 3 2–3 3 3
principle of least commitment subgoals in, 3 29
in similarity, 1 5 Tower of Hanoi problem and, 3 22, 3 25 , 3 29
principle of modulation, 1 99 transfer variant in, 3 3 3
principle of strategic variations Problem Solving and Comprehension (Whimby), 779
in mental model theory, 1 91 –1 92 problem spaces
principle of truth (mental model theory), 1 90, 1 94, 1 95 computer simulations for, 3 27
in deductive reasoning, 1 72 current knowledge state in, 3 26
mental footnotes as part of, 1 90 in problem solving, 3 26–3 27
probabilistic reasoning, 1 97 in scientific thinking, 708, 709
extensional reasoning and, 1 97, 1 98 searchs for, 3 27
intensional reasoning and, 1 97 think-aloud protocols, 3 27
principle of equiprobability, 1 97 in Tower of Hanoi problem, 3 26
principle of “indifference” in, 1 97 problem-based learning. See PBL
subset principle in, 1 98 procedural knowledge, 3 71 –3 72
probability acquisition of, 3 72
conditional, 1 01 process-dissociation procedures, 43 8
extensional representations of, 1 02 production rules, 401
in prospect theory, 245 abstract, 403
problem representations, 3 3 0, 3 3 5 in ACT-R, 405
expertise and, 3 3 6 asymmetry in, 403
four components of, 3 3 0–3 3 1 in EPIC, 41 2
functional fixedness in, 3 3 2 illustrative examples of, 401 , 402, 403
runnable mental models as, 3 3 1 modularity in, 402–403
problem schemas in problem solving, 424
in analogy, 1 3 0 in Soar, 405
problem solving verbalization of, 403
algorithmic solution strategies in, 3 25 production systems (thinking), 4
analogous experience and, role in, 3 3 5 abstract, 403
analogy and, 1 22 action execution in, 402
change variant in, 3 3 3 ACT-R, 404
complex learning and, 6 AMBR and, 425
context in, 3 3 1 –3 3 2 analogy and, 402, 409–41 2
convergence solution in, 3 3 4 asymmetry in, 403
expertise, role in, 3 3 6–3 3 8 background on, 401 –404
general memory schemas and, 3 3 5 –3 3 6 categorization, 41 5 –41 7
Gestalt psychology and, 3 24 choice in, 406–408
goal state in, 3 22 conflict resolution in, 402
GPS in, 3 24, 3 29 conflict sets in, 402
heuristic solution strategies in, 3 26 dynamic memory in, 402
hill climbing, 3 27–3 28 EPIC, 404
history of, 3 24–3 25 4-CAPS, 404
Hobbits and Orcs problem and, 3 27, 3 28 future applications for, 425 –426
ill-defined problems in, 3 3 0 hybrid view of, 404
initial state in, 3 22 knowledge content in, 403
subject index 851
language learning, 421 –423 neuroimaging studies for, 1 1 9

modularity in, 402–403 neurophysical studies for, 1 1 9
noise processes, role in, 404 RPM tests in, 1 1 8, 1 1 9
pattern matching in, 402 Pythagorean formula, 5 61
problem-solving choice in, 408–409
rules within, 401 , 402 QSIM systems, 73 7
skill learning, 41 8–420, 421 Q-SOAR model
Soar, 404 chunking in, 5 3 2
verbalization in, 403 in conservation, 5 44
WM, 41 2–41 4 information processing theories and, 5 3 2
Productive Thinking (Wertheimer), 3 24, 706 Quillian-type memory organization, 42
progress monitoring theory, 3 44
representation change theory vs., 3 44 RAA (Reductio ad Absurdum), 1 72
Progressive Education Association, 720 in negative conclusion bias, 1 74–1 76
projectibility random number generation
in inductive reasoning, 97 syllogisms, effect on, 464
problems with, 97 in task-interference paradigm,
promotion focus RAT (Remote Associates Test), 3 5 5
additive counterfactuals and, 3 08 rational numbers (mathematical cognition), 5 60
prevention focus vs., 3 07 magnitude measurement and, 5 61
in regulatory focus theory, 3 06 rational theory of choice, 243 , 244
propositional notation, 74 preferences in, 244
prospect theory (decision making), 244–246 rationality concepts
concave utility functions in, 245 coherence, 277
insurance and, 245 Raven’s Progressive Matrices. See RPM
loss aversion in, 245 reasoning
probabilities in, 245 abductive, 73 0
risk aversion in, 244 aging, effects on, 5 67–5 68, 5 90, 5 91 –5 93
risk seeking in, 245 analogical (non-human primates), 61 1 –61 3
value function of, 245 analytical tasks for, 5 91
prototype heuristics, 281 –287 in categorization, 48
base rate neglect in, 282 causal (non-human primates), 61 9–625
duration neglect and, 282 cognitive development and, 5 41 –5 43
extensional attributes and, 282 collective cultural activities and, 626
scope neglect in, 282 decision making and, 2, 25 1 –25 2
prototype models (categorization), 44–45 deductive, 2, 1 69
differential forgetting in, 45 emotions and, 3 1 2
distortion within, 44, 45 functional anatomy of, 479–481
psychodynamic psychology given information in, 209–21 0
creativity and, approaches to, 3 5 3 –3 5 4 horizontal, 73 4
methodology of, 3 5 4 hypothetico-deductive, 728, 729, 742
psychological essentialism, 5 6–5 7 inductive, 2, 95 , 96
in cultural thought, 674 inferential (non-human primates), 61 3 –61 6
evidence for, 5 6 integrative, 5 91 , 5 93 , 5 94
family-resemblance categories within, 5 7 intensional, 1 97
minimalist alternativism in, 5 7 legal, 685
restrictions within, 5 8 matrix tasks for, 5 91 , 5 93
“psycho-logics,” 5 29 medical, 727
psychology probabilistic, 1 97
clinical, 4 problem solving within, 2
Gestalt, 3 with quantifiers, 1 97
similarity assessments within, 1 3 quantitative (non-human primates), 61 6–61 9
of thinking, 3 reflective, 96
psychometaphysics, 3 8, 5 8, 63 relational, 1 97, 609–61 1
concepts and, 5 5 representations in, 21 0
psychometric tradition (analogy), 1 1 8–1 20 scientific, 696–700
computational models for, 1 1 9 sentential, 1 88
in creativity, 3 5 4–3 5 6 series completion tasks for, 5 91
crystallized intelligence in, 1 1 8 Shipley Abstraction series test for, 5 92
fluid intelligence in, 1 1 8 social (non-human primates), 625 –626
4-term analogies in, 1 1 8, 1 1 9 spatial (non-human primates), 608–609, 61 2
852 subject index
reasoning (cont.) representation systems

specific-to-specific, 73 3 compressed, 1 6–1 7
suppositional, 1 71 computer languages (programming), 74
syntactic approach to, 1 71 conjunctive features and, 23
thinking and, 2 distortions program and, 221 –222
transitive, 484 Gestalt principles of perceptual organization, 21 0
varieties of, 6 hierarchical, 22
visuospatial, 209, 21 0 internal, 2
WAIS III, 5 90 labeled graphs, 74
reasoning rationality, 277 mathematical notation, 74
“four card” problem and, 277 mental, 2
reasoning tasks perceptual, 443 –445
for AG, 43 4 postulated, 22
analytical, 5 91 problem, 3 3 0, 3 3 5
for concept of mind, 5 47 propositional notation, 74
divergent thinking, 3 5 4 quantitative, 1 7
DMTS, 61 2 relational, 74–75
DMTS (non-human primates), 61 2 simple features and, 23
false-belief, 5 47, 673 transformations, effect on, 21 0
identity (non-human primates), 61 0 visuospatial, 21 0–21 1
matrix, 5 91 , 5 93 representational change theory, 3 44
mental scanning, 21 3 –21 4 chunk decomposition in, 3 44
object permanence (non-human primates), 609 constraint relaxation in, 3 44
series completion, 5 91 , 5 94 progress monitoring theory vs., 3 44
stick and hook (non-human primates), 621 representativeness (in heuristics), 274–276
summation (non-human primates), 61 7–61 8 computations for, 288
visuospatial, 21 9, 227 conjunction items in, 276
Wason selection, 1 74–1 76 controversy over, 276–281
recency effects, 45 7 elicited, 274, 275 , 276
Reciprocal Teaching (Brown/Palincsar), 779, group participants and, 274
781 reversal transformations, 27
Reciprocal Teaching program, 780 risk
Reductio ad Absurdum. See RAA aversion, 244
regulatory focus theory decision framing, 246–247
counterfactual thinking in, 3 08 seeking, 245
prevention focus as part of, 3 06 “The Road Not Taken” (Frost), 1 20
promotion focus as part of, 3 06 role-filler binding
in strategy-motivated thinking, 3 06 LISA and, 86
relational complexity metric, 5 3 9, 5 48 by synchrony of firing, 84
CCC theory and, 5 3 9 tensor products and, 82, 83
conceptual chunking within, 5 3 9 by vector addition, 83 –87
segmentation in, 5 3 9 roles (analogy), 41 0
relational reasoning, 73 –76 Romeo and Juliet (Shakespeare), 1 23
arguments within, 87–88 route perspective, 220
childhood development and, 73 in visuospatial reasoning, 220
flexibility within, 75 RPM (Ravens Progressive Matrices) task
generalization within, 77 WM and, 467, 468
manipulation of, 73 performance graphics for, 1 1 9
semantics within, 75 –76 for psychometric tradition (analogy), 1 1 8, 1 1 9
symbolic representations within, 74–75 rules-plus-exception model. See RULEX
systematicity in, 74 RULEX (rules-plus-exception) model, 41 6
release-from-overshadowing condition, in ACT-R production system, 41 6
1 5 5 –1 5 6
religion SAA (symbol-argument-argument) notation
cultural thought and, factor in, 670 argument binding in, 76
Remote Associates Test. See RAT connectionist criticism of, 78
replacement systems connectionist representations vs., 78, 86
bottom-up, 3 90–3 91 external structure use in, 78
evolution theory and, 3 91 ill-typed, 78
in nonmonotonicity, 3 90–3 91 implicit role information in, 77
top-down, 3 91 isomorphism in, 76, 82
subject index 853
limits of, 77–78 conceptual changes in, 71 5 –71 6

schemes in, 76 “confirmation bias” in, 709
semantics in, 77 deductive reasoning as part of, 71 2–71 3
SME and, 88 empirical testing for, 696–697
as symbolic relational representation, 76–78 ERP use in, 71 6
tensor products and, 82, 83 experiment spaces as part of, 708
SAT (Scholastic Aptitude Test), 3 5 5 fMRI use and, 71 6
scalar variability Gestalt psychology and, 706
mental magnitudes and, 5 64, 5 71 , 5 73 , hemispheric differences in, 71 8
5 82 history of, 705 –708
patterns in, 5 72 hypothesis spaces as part of, 708
Weber’s law vs., 5 82 hypothesis testing as part of, 707, 709–71 0
schemas “Impetus” theory in, 71 5
convergence, 1 3 0 inductive reasoning as part of, 71 2
in declarative knowledge, 3 74–3 75 legal reasoning vs., 696–700
permission, 1 77, 485 Novum Organum (Bacon) and, 706
pragmatic reasoning, 1 73 PET use for, 71 6
problem, 1 3 0 as problem solving, 708–709
spatial diagram, 3 3 5 science education and, 71 9–721
transfer processes for, 3 91 statistical data’s role in, 71 0
schizophrenia, 494 unexpected findings and, 71 0, 71 1
in Cohen & Braver’s model (thought disorder), scope neglect (prototype heuristics), 282
5 00–5 03 Bayes rule application in, 283
contextual bias in, 5 1 0 CVM, 282
dopamine receptor binding and, 5 1 2 in WTP, 282–283
fetal hypoxia and, 5 1 4 scripts (declarative knowledge)
genetic epidemiology of, 5 1 1 in “theory of mind,” 671
genetic factors for, 5 1 3 “select and test” models (medical reasoning),
hippocampus abnormalities and, 5 1 3 , 5 1 4 73 0
interpersonal deficits in, 5 1 6 four inference types in, 73 0
long-term memory and, effects on, 5 1 2 semantic alignment
MTL structures and, 5 1 2, 5 1 3 –5 1 4, 5 1 5 isomorphic problems and, 3 41
neural system abnormalities in, 5 1 1 –5 1 2 in mathematical problem solving, 3 41
prefrontal cortex and, 5 1 1 , 5 1 2–5 1 3 semantic framing, 249. See also attribute framing
thought disorder and, 494, 495 semantic memory, 3 9–41 , 62. See also memory
transmission patterns in, 5 1 1 domain specificity and, 60
verbal communication and, 5 1 6 fragmentation of, 41 –44
verbal declarative memory deficits and, 5 1 4 information overlap in, 42
WM and, effects on, 5 1 2 lexical decision priming, 42
WMS and, 5 1 3 natural concepts and, 40–41
Scholastic Aptitude Test. See SAT organizational models, 3 9–40
science education “semantic memory marriage,” 3 9
history of, 71 9–720 semantics, 42–44
inquiry-based approach to, 720 comprehension within, 5 2
scientific reasoning and, 71 9–721 in connectionist representations, 78
scientific methodology (inductive reasoning), 1 02, in declarative knowledge, 3 73
1 07–1 1 1 fuzzy set theory within, 43
Bayesian models as part of, 1 1 0–1 1 1 latent analysis of, 5 2
computational models, 1 09–1 1 1 linguistic sentence meaning in, 43
diversity principle as part of, 1 08 mental model theory and, 1 85 , 1 87
hypothesis evaluation in, 1 1 0 phrase interpretation in, 5 2
induction rules within, 1 07–1 08 in relational representations, 75 –76
naı̈ve theory as part of, 1 08–1 09 in SAA, 77
PPP in, 1 08 tensor products and, 82
prior beliefs as part of, 1 1 0 sentential reasoning
variability/centrality as part of, 1 07–1 08 fully explicit models for, 1 88, 1 89, 1 90,
scientific reasoning 1 99
analogy use in, 71 3 –71 4 in principle of iconicity, 1 88
brain analysis and, 71 6–71 9 series completion tasks, 5 91
causal thinking as part of, 71 0–71 2 aging and, 5 94
computational models for, 71 9 Shipley Abstraction series test, 5 92
854 subject index
short-term memory skill learning (production system), 41 8–420, 421

capacity of, 45 8 ACT-R and, 41 9
SIAM model (alignment-based), 24, 25 composition and, 41 9
similarity fact representation in, 41 9, 420
alignment-based models of, 24–26 proceduralization and, 41 9
assessments of, 1 3 , 3 0–3 1 Soar and, 41 8
asymmetry in, 1 8, 20 slave systems
automatic assessments of, 3 0 in multiple working memory models, 45 9
in cognition, 1 4 slips (medical errors), 73 9
comparative analysis within, 27 evaluation, 73 9
contrast models of, 99 execution, 73 9
diagnosticity effects in, 29 SME (Structure Mapping Engine)
dissociations within, 28 algorithm in, 1 3 2–1 3 3
as domain-general information source, as analogy model, 1 3 2–1 3 4
14 CWSG as part of, 1 3 2
in entrenchment theory, 98 “deep” mapping in, 1 3 2
expertise and, 29 “kernels” as part of, 1 3 2
extension effects in, 29 “local-to-global” direction in, 1 3 2
featural models of, 1 7–24 MACFAC and, 1 3 3 –1 3 4
feature learning and, 48 predicate-calculus notation as part of, 1 3 2
flexibility of, 1 5 , 29–3 0 in SAA notation, 88
“generic” assessments of, 3 0 Soar (production system), 404
geometric models of, 1 5 –1 7 chunking in, 41 9
inclusion, 1 05 EPIC combination with, 425
induction, 1 02–1 07 limitations for, 41 2
inference from, 1 3 production-rule learning in, 405
judgments in, 29 serial processing in, 405
limitations of, 1 4 skill learning and, 41 8
mandatory considerations of, 29 WM in, 41 3
MDS models of, 1 5 social intelligence, 75 2
in medical reasoning, 73 3 –73 4 “Sociological Jurisprudence,” 690
mental entity structures within, 1 5 Socratic dialogue, 775
perceptual constraints and, 1 5 SOI (structure-of-intellect) model (intelligence), 75 4,
practical applications of, 1 5 782
principle of least commitment in, 1 5 sortalism, 5 7–5 8
psychological assessments of, 1 3 developmental psychology, role in, 5 7
reasonable expectation and, 1 3 identity-lending categories within, 5 7
Standard Geometric Models of, 1 8 source analogs, 1 1 7, 1 22, 71 4
transformational models of, 26–28 spatial diagram schemas, 3 3 5
similarity-based induction, 1 02–1 07 generality level of, 3 3 5
age-based sensitivity in, 1 03 matrices, 3 3 5
asymmetry in, 1 03 spatial framework tasks
basic level bias in, 1 07 theory for, 21 9
category coverage as part of, 1 03 in visuospatial reasoning, 21 8
diversity as part of, 1 03 spatial tapping, 465
feature exclusion in, 1 04 spatter codes, 81 , 83
folk-generic levels in, 1 06 speech disorders
inclusion fallacy and, 1 05 anomic aphasia, 499
inclusion similarity in, 1 05 programming deficit, 499
inference in, 1 06 thought disorder vs., 498–499
monotonicity/nonmonotonicity in, spreading activation
1 04 attributes and, 249
naming effects as part of, 1 05 in implicit cognition, 445
notion of overhypotheses, 1 06 motivations and, 295
predicates as part of, 1 03 standard economic model (decision making). See
preferred levels in, 1 06 rational theory of choice
similarity in, 1 02 Stanford-Binet test, 75 3
typicality in, 1 02–1 03 status quo bias, 248–249
sketches Story Gestalt model (story comprehension), 79
constructive perception and, 23 1 testing for, 79
inferences from, 23 1 “story memory,” 1 22
subject index 855
strategy-motivated thinking, 3 06–3 1 0 generalization within, 77

accuracy vs. speed in, 3 08–3 09 SAA, 76–78
alternative hypothesis and, effects on, 3 06–3 07 symmetry
counterfactual thinking, effects on, 3 07–3 08 in distortions program, 222
eager vs. vigilant, 3 07, 3 09 in mathematical problem solving, 3 41
future applications for, 3 1 0–3 1 1 in Standard Geometric Models (for similarity), 1 8
preferences in, 3 09–3 1 0 synchrony of firing, 84
preferred, 3 06 in frontal cortex, 84
recall and, effects on, 3 09 limitations of, 84
regulatory fit in, 3 09 in primate visual cortex, 84
regulatory focus theory in, 3 06 synectics, 3 5 3
Stroop task, 272 System 1 (dual process theory), 1 80, 267
schizophrenia and, 5 01 impression of distance in, 269
Structure Matching Engine. See SME judgment problems and, 267
The Structure of Scientific Revolutions (Kuhn), System 2 (dual process theory), 1 80, 267
3 80 additive extension effect in, 280, 282
subliminal perception, 43 7–440 attention manipulation in, 279–280
abstract representation in, 43 8 attribute substitution in, 273
brain regions for, 43 7 frequency format in, 279
emotional preferences, effects on, 43 7 intelligence and, 278–279
“mere exposure” effect in, 43 8 intuitive judgments within, 273
MTL and, 43 9 judgments of distance in, 269
process-dissociation procedures in, 43 8 proposal quality and, 267
semantic priming as result of, 43 7 statistical sophistication, effect on, 278
subset principle, 1 98 Stroop task for, 272
subsymbolism time pressure effects on, 273
in ACT-R, 406, 408, 422 within-subjects factorial designs in, 280–281
constituents within, 406 systematic transformations, 2
proportion exemplar model for, 41 7, 41 8 systematicity
“superposition catastrophe,” 83 in neural net models, 5 3 6
effects of, 83 in relational thinking, 74
Supervisory Attentional System, 460–461 three-layered nets and, 5 3 6
capacity limits for, 461
Contention Scheduler as part of, 460 target analogs, 1 1 7, 71 4
schemata in, 460 stories as part of, 1 23 –1 24
suppositional reasoning, 1 71 task-interference paradigm, 463 –466
Conditional Proof, 1 71 analogy and, 465
for mental model theory, 1 96 articulatory suppression effects within, 465
Modus Tollens, 1 71 memory loads for, 463
RAA and, 1 72 random number generation in,
Sure Thing Principle, 25 1 spatial tapping, effects of, 465
survey perspective, 220 verbal syllogisms in, 464
in visuospatial reasoning, 220 temporal contiguity
syllogisms in associative learning theory, 1 47
in deductive reasoning, 1 69, 71 3 in causal learning, 1 47
figures as part of, 1 70 temporal discounting
four statements of, 1 70–1 71 in decision making, 25 6–25 7
intelligence and, 75 7 excessive, 25 7
in mental model theory, 1 90 tensor products, 81
model adjustment for, 1 91 definition of, 81 –83
moods as part of, 1 70 limits of, 82
random number generation’s effect on, 464 multiplicative binding schemes and, 82
in task-interference paradigm, 464 relational binding representation and, 82
visuospatial sketchpad and, 464 relational generalization support from, 82
symbol-argument-argument notation. See SAA role-filler binding, 82, 83
symbolic distance effect, 225 SAA-isomorphic, 82, 83
symbolic distance/size effect, 5 47, 5 69–5 71 semantic relation content and, 82
symbolic relational representations, 74–75 . See also Terman Concept Mastery Test, 3 5 4
relational reasoning theory of “g” (intelligence), 75 4, 782
conjunctive connectionist, 81 –83 correlate eduction in, 75 4
connectionist (traditional), 78–81 experience apprehension in, 75 4
856 subject index
theory of “g” (cont.) Reciprocal Teaching program for, 780

factor analysis as part of, 75 4 Think (Adams), 779
mental energy in, 75 4 Think and Intuitive Math programs, 779, 780
relation eduction in, 75 4 transfer effects in, 780
“theory of mind” The Thinking Classroom (Jay/Perkins/Tishman), 787
concept of mind vs., 5 47 thought disorder, 494
cultural thought and, 670–674 capacity allocation in, 5 08–5 09
in non-human primates, 626 cognitive deficit integration and, 5 09–5 1 0
script knowledge in, 671 Cohen & Braver’s model, 5 00–5 03
Think (Adams), 779 context attention in, 5 07–5 08
thinking, 2. See also reasoning “controlled attention” in, 5 1 6–5 1 7
affective, 3 1 1 definition of, 495 –496
beliefs and, 1 “derailment” in, 496, 5 09
cognitive stages for, 5 29–5 3 0 endophenotype approach to, 5 1 0–5 1 1 , 5 1 4–5 1 5
computer simulations for, 4 environmental factors for, 5 1 1
counterfactual, 298, 3 07–3 08 executive system functioning in, 5 1 4
cultures of, 796 form vs. content in, 5 1 6
definition of, 2, 780–781 Hemsley’s & Gray model, 5 03 –5 06
development of, 5 29 heritability of, 5 1 1
dispositional of, 777, 785 hyper-priming hypothesis and, 5 07
early influences on, 5 29–5 3 1 hypothesized deficits and, 5 06
educational systems’ and, 776 ideational confusion definition in, 496
foresight and, 1 “match” signals in, 5 05 , 5 07, 5 09
Gestalt school and, 5 29 memory retrieval interference as result of, 5 08
groupthink and, 776 negative, 5 09
hierarchy of kinds in, 2 neologism production as result of, 495
high-end, 776 normal bias yielding, 5 08
instruction, 777 normal speech production models and, 496–498
internal representations from, 2 pathology of, 494
judgment and, 1 phonological/phonetic system and, 5 06
lateral, 3 5 2 positive vs. negative, 496
mental representations and, 2 schizophrenia and, 494, 495
modern conceptions of, 4 self-monitoring within, 5 09
motivated, 295 –296 semantic retrieval in, 5 07
norms in, 781 , 782 social cognitive neuroscience applications for, 5 1 6
in practice, 7 speech disorder vs., 498–499
production systems and, 4 symptom taxonomy for, 495
“psycho-logics” and, 5 29 trait deficits of, 5 1 0–5 1 1
psychology of, 3 threshold theory, 3 5 5
reasoning and, 2 time
relational, 73 –76 in metaphor, 1 20–1 21
results attainment for, 777–780 time-left procedures, 5 66–5 67
self-regulation in, 5 29 Torrance Tests of Creative Thinking, 3 5 4
Socratic dialogue and, 775 Tower of Hanoi problem, 3 22, 3 25 , 3 29, 3 3 3
speech and, 641 –642 mean-ends analysis and, 3 28
structure in, 5 29 possible solutions for, 3 22, 3 25 –3 26
systematic transformations and, 2 problem space in, 3 26
theory of action in, 781 Transfer Appropriate Processing, 789
transfer processes in, 777 transfer of learning processes, 788–792
“thinking for speaking,” 65 3 , 65 4 expertise in, 790–791
“thinking hats,” 3 5 3 far, 790
“thinking” instruction, 777 history of, 788–789
CoRT program for, 778, 779, 781 interventions as part of, 789
DAPAR test for, 778 transfer protocols
GE programs for, 778 abstract analogy in, 444
IE programs for, 778 chunk strength in, 444
Odyssey program for, 779, 781 in implicit cognition, 443
paired problem solving in, 780 transformational models (of similarity), 26–27, 28
Philosophy for Children program, 778, 779 generative representation systems in, 26
Problem Solving and Comprehension (Whimby), 779 global consistent correspondences in, 27
Reciprocal Teaching (Brown/Palincsar), 779, 781 Kolmogorov complexity theory within, 27
subject index 857
memorized pictorial descriptions within, 26 “second-order isomorphisms” and, 21 1

mirror image, 26 space and, spontaneous use of, 226–227
phase shift, 27 spatial perception in, 21 6
reversal, 27 spatial visualization in, 21 6
rigid, 26 symbolic distance effect in, 225
stimuli pairs within, 26 transformations in, 21 2–21 6
wave length, 27 updating in, 21 8
transformations visuospatial representations, 21 0–21 1
elementary, 21 5 –21 6 object properties as part of, 21 0, 21 1
mental (objects), 21 3 –21 4 visuospatial sketchpad
in visuospatial reasoning, 21 2–21 5 memory tasks for, 461
transitive inference in multiple working memory models, 45 8, 45 9–460
associative, 61 3 syllogisms and, 464
in non-human primates, 61 3 –61 4 vocabulary
transitive reasoning closed-class functional, 641
dissociation and, 484
triangle inequality assumption, 1 8 WAIS (Weschler Adult Intelligence Scale) III, 5 90
violations in, 1 8 Wason selection tasks
Triarchic theory for Successful Intelligence, 762, cognitive development and, 5 43
763 –764 content effects within, 5 43
experience’s role in, 763 Deontic Selection Task and, 1 74–1 76, 1 78, 1 80
external world relations in, 763 matching bias and, 1 74–1 76
internal world relations in, 763 wave length transformations, 27
“true” intelligence, 764 WCST (Wisconsin Card Sorting Test)
experience as aspect of, 764 aging and, 5 94, 5 95 , 670
neural intelligence as aspect of, 764 feedback-based concept identification in, 5 95
reflective aspect of, 764 parameters for, 5 95
truth tables Weber’s law
fully explicit models vs., 1 90 characteristic for, 5 78
Modus Tollens, analysis of, 1 72 numerical magnitudes and, 5 69, 5 75 , 5 76, 5 83
typicality effects, 44 scalar variability vs., 5 82
in similarity-based induction, 1 02–1 03 Wechsler Memory Scale. See WMS
Weschler Adult Intelligence Scale. See WAIS
unconscious thought. See implicit cognition Weschler Intelligence Scale for Children. See WISC
Unusual Uses Test, 3 5 4 Weschler scale (intelligence testing), 75 3
West Side Story, 1 23
variability Williams v. Florida, 697
in scientific methodology, 1 07–1 08 willingness to pay. See WTP
sensitivity to, 1 08 WISC (Weschler Intelligence Scale for Children), 3 5 5
Variations in Value Orientation (Strodtbeck), 663 , 664 Wisconsin Card Sorting Test. See WCST
veridicality Witherspoon v. Illinois, 697
consistency vs., 3 81 WM (working memory). See also memory
Vision (Marr), 5 in ACT-R, 41 2, 41 3 –41 4, 41 6, 470
visuospatial reasoning, 209, 21 0 aging and, 5 95 –5 97
abstract concepts in, 225 in analogical mapping, 1 27–1 28
analogs in, 225 analogical reasoning and, 1 20
brain pathways and, 21 6 animal behavior and, 470
diagrams and, 21 9, 227 attentional control function in, 467
distortions program and, 221 capacity limits for, 466
framework tasks in, 21 8 in CAPS, 41 2
graphics context and, 225 “Confirmation Bias” limitations in, 71 0
individual differences in, 21 6 definition of, 45 7, 466
judgments and, 221 –224 EBRW and, 41 6
language effects on, 220–221 in EPIC, 41 3
mental models for, 228 fMRI and, 468
mental rotation tasks for, 21 2, 21 6 high fidelity modeling in, 41 4
mental scanning tasks for, 21 3 –21 4 individual differences in, 466–467
narratives in, 21 9 integrative reasoning and, 5 96
perception vs. imagery in, 21 1 –21 6, 21 8 intelligence and, 75 7
reference systems in, 221 in LISA, 86, 1 3 5 , 41 3 , 470
route/survey perspectives as part of, 220 mediational analysis of, 5 98
858 subject index
WM (working memory) (cont.) WMS (Wechsler Memory Scale)

modality specificity for, 467, 470 schizophrenia and, 5 1 3
n-back task and, 466 working memory. See WM
neural basis of, 1 20 WTP (willingness to pay)
prefrontal cortex functioning and, 466 additive extension effect and, 283
in production systems, 41 2–41 4 “add-up” rule in, 283
relational complexity in, 468 categorical prediction similarity and,
RPM task for, 467, 468 283
schizophrenia and, effects on, 5 1 2 in scope neglect, 282–283
in Soar, 41 3
task as part of, 41 2 zone of proximal development
variability in, 424 in cognitive development, 5 3 1

The Cambridge Handbook of Thinking and Reasoning (2005) PDF

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

The Cambridge Handbook of Thinking and Reasoning (2005) PDF

Uploaded by

Copyright:

Available Formats

The Cambridge Handbook of

Thinking and Reasoning

Cambridge University Press

© Cambridge University Press 2005

This book is in copyright. Subject to statutory exception and to the provision of

First published in print format 2005

isbn-13 978-0-511-11330-7 eBook (NetLibrary)

isbn-13 978-0-521-82417-0 hardback

isbn-13 978-0-521-53101-6 paperback

Cambridge University Press has no responsibility for the persistence or accuracy of

1 2 . A Model of Heuristic Judgment 267 p a r t vi

1 6. Complex Declarative Learning 3 71 2 7. Paradigms of Cultural Thought 663

Leib Litman 2 9. Scientific Thinking and

John R. Anderson Marc J. Buehner

Michelene T. H. Chi Rochel Gelman

Charles R. Gallistel E. Tory Higgins

Keith J. Holyoak – editor Marsha C. Lovett

Leib Litman Anna Papafragou

Vimla L. Patel Eldar B. Shafir

David N. Perkins Steven A. Sloman

Arthur S. Reber Robert J. Sternberg

Lance J. Rips Michael Tomasello

Ron Ritchhart Barbara Tversky

Thinking and Reasoning:

“Cogito, ergo sum,” the French philosopher What Is Thinking?

(and this volume), these aspects of thinking eighteenth-century philosophers Immanuel

Chapter 5 The Problem of Induction Chapter 7 Causal Learning

Acknowledgments tive reasoning.” In an old Western movie, a

Introduction From this perspective, psychological as-

represent patterns in a relatively raw, un- A Survey of Major Approaches

metrics to be used. With r = 2, a stan- we cannot simultaneously place Russia near

6 yield the most satisfactory and interpretable

2 1 984), but this is incompatible with sim-

analyses have proliferated in domains of in neural networks normalize dissimilarities

with the clarity of the alignments (Gold-

to be fertile, it will be because it is applica- ify what transformational operations are

image (XXXXXOO → OOXXXXX), stated in the structural alignment method,

belongs to category A because it is similar to grating multiple sources of information into

Similarity as measured by ratings is not coherency of similarity. An alternative per-

Concepts and Categories: Memory,

Introduction Our goal in this chapter is to provide an

understand the principles of conceptual com- A Minihistory

Can bite Is pink

Figure 3.1 . A semantic network.

memory organization wise studiously avoided a stand on memory

to embed it in a neural network (Krus- contrary to empirical evidence that it does

Table 3.1 . Some Theories of Concept Combination

a number of encores of this performance, These consequences of sortalism may be

relations depend to some extent on word concepts. We do not recommend a solemn

perspectives (pp. 23 3 –278). Mahwah, NJ: noun phrases. Developmental Psychology, 3 8,

mechanisms on induction, categorization, and development of folkbiological induction. Cog-

Approaches to Modeling Human Mental

Relational Thinking Relational thinking is so commonplace

human mind is the product of a symbol

soundness. Cognitive Psychology, 2 5 , 5 24– reasoning. In Proceedings of the twenty-second

The Problem of Induction

experimental conclusions” (see Goldstone & depend on the environment we experience.

the typicality structure of categories and on arrive at an intermediate level of hierarchical

is the argument. For example, people Therefore, buffaloes have 3 8 chromo-

telligence, metaphor, and the representation

slightly biased summary of events in World

picture to objects in the bottom picture), Inference and Relational Generalization

Figure 6.4. Sequence of diagrams used to convey the convergence schema by