Professional Documents
Culture Documents
I What Is A Diagram and How Does It Function?
I What Is A Diagram and How Does It Function?
May not be reproduced in any form without permission from the publisher, except fair uses permitted under U.S. or applicable copyright law.
EBSCO Publishing : eBook Collection (EBSCOhost) - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS
AN: 1289667 ; Sybille Krmer, Christina Ljungberg.; Thinking with Diagrams : The Semiotic Basis of Human Cognition
Account: ns153208.main.eds
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
Michael Marrinan
1 On the “thing-ness” of diagrams
Icons are so completely substituted for their objects as hardly to be distinguished from
them. A diagram, indeed, so far as it has a general signification, is not a pure icon; but
in the middle part of our reasonings we forget that abstractness in great measure, and
the diagram is for us the very thing. So in contemplating a painting, there is a moment
when we lose the consciousness that it is not the thing, the distinction of the real and
the copy disappears, and it is for the moment a pure dream – not any particular
existence, and yet not general. At that moment we are contemplating an icon.
– Charles Sanders Peirce (W 5: 163)
Any art historian interested in looking at pictures must attend to this passage in
which a founding figure of the theory of signs – most commonly recognized
in the triad Icon/Index/Symbol – reflects upon the act of looking at a painting.
An art historian also interested in diagrams cannot afford to ignore the way
Peirce couples this act of contemplation to his account of diagrams (Bender
and Marrinan 2010), especially his claim that for a moment “the diagram is for
us the very thing.” In this moment, when the distinction between “the real and
the copy disappears,” we confront the icon. What does it mean for a diagram
to become “the very thing”? How does Peirce explain this transformation? Does
his account square with the way art historians look at pictures? Are there other
ways to think of diagrams as “things” in their own right? These are some of the
questions I will address in this essay.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
24 Michael Marrinan
whose parts shall present a complete analogy with those of the parts of the
objects of reasoning, of experimenting upon this image in the imagination, and
of observing the results so as to discover unnoticed and hidden relations among
the parts” (W 5: 164). At about the same time, he articulates four steps of deduc-
tive reasoning in which diagram construction plays the essential role, noting
that “even though the diagram exists only in the imagination . . . after it has
once been created, though the reasoner has power to change it, he has no power
to make the creation already past and done different from what it is. It is, there-
fore, just as real an object as if drawn on paper” (Peirce 1976a, 4: 275–276). A few
years later, he insists that “it is by icons only that we really reason, and abstract
statements are valueless in reasoning except so far as they aid us to construct
diagrams” (CP 4: 127). Peirce begins “Prolegomena for an Apology to Pragma-
tism” of 1906, his most detailed discussion of diagrams, by stating: “All neces-
sary reasoning is diagrammatic; and the assurance furnished by all other
reasoning must be based on necessary reasoning. In this sense, all reasoning
depends directly or indirectly upon diagrams” (Peirce 1976b, 4: 314). Stjernfelt’s
study of this essential text helps us to isolate the principal ideas underpinning
Peirce’s account of diagrammatic reasoning.
Most important, Peirce insists the diagram “is an Icon of a set of rationally
related objects” so that it is amenable to rational experiments and compatible
with reasoning (Peirce 1976b, 4: 316). Stjernfelt points out that “as soon as an
icon is contemplated as a whole consisting of interrelated parts whose relations
are subject to experimental change, we are operating on a diagram” (Stjernfelt
2007: 92). Peirce writes in an alternate version of his text that “the Diagram
represents a definite Form of Relation. This Relation is usually one that actually
exists,” although that condition is not “essential to the Diagram as such” (Peirce
1976b, 4: 316). He suggests that “the pure Diagram is designed to represent and
to render intelligible, the Form of Relation merely” (Peirce 1976b, 4: 316). Peirce
wrestles with the problem of how to align mental apprehension with the fact
that “Diagrams remain in the field of perception and imagination”; he settles
on a two-pronged formula: the drawn diagram and its universal signification
“taken together constitute what we shall not too much wrench Kant’s term in
calling a Schema, which is on one side an object capable of being observed
while on the other side it is General” (Peirce 1976b, 4: 318; see also Stjernfelt
2007: 82–83 and 94–95). On Peirce’s account, pure diagrams – comprised only
of rational relations – depend upon what Stjernfelt calls “applied” or “token
diagrams” to communicate their ideal entity (Stjernfelt 2007: 96).1 They do so
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
On the “thing-ness” of diagrams 25
2 Updated and revised in Blackwell and Englehardt 2002. Also see Schmidt-Burkhardt 2012: 41–
68.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
26 Michael Marrinan
3 The variant of this passage reads: “But the action of the Diagram does not stop there. It
has the same percussive effect on the Interpreter that any other Experience has. It does not
stimulate any immediate counter-action, nor does it, in its function as a Diagram, contribute
particularly to any expectation. As Diagram, it excites curiosity as to the effect of a transforma-
tion of it” (Peirce 1976b, 4: 317).
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
On the “thing-ness” of diagrams 27
. . . and one at once prescinds from the accidental characters that have no significance.
They disappear altogether from one’s understanding of the Diagram . . . one can contem-
plate the Diagram and perceive that it has certain features which would always belong
to it however its insignificant features might be changed. What is true of the geometrical
diagram drawn on paper would be equally true of the same Diagram put on the black-
board (Peirce 1976b, 4: 317, note).
Logic requires great subtlety of thought, throughout; and especially in distinguishing those
characters which belong to the diagram with which one works, but which are not signifi-
cant features of it considered as the Diagram it is taken for, from those that testify to the
Form represented. For not only may a Diagram have features that are not significant at all,
such as being drawn upon “laid” or upon “wove” paper; not only may it have features that
are significant but not diagrammatically so; but one and the same construction may be,
when regarded in two different ways, two altogether different diagrams (Peirce 1976b, 4:
324).
4 Scribing has a particular meaning for Peirce (1933b, 4: 397): “The graphist may place replicas
of graphs upon the sheet of assertion; but this act, called scribing a graph on the sheet of
assertion, shall be understood to constitute the assertion of the truth of the graph scribed.”
Also see Peirce 1976b, 4: 322 and Peirce 1933c 4: 418–423.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
28 Michael Marrinan
Figure 9: Pietro Perugino, Christ giving the Keys to Saint Peter, ca 1481–82. Fresco. Rome
(Vatican), Sistine Chapel (330 x 550 cm). Photo credit: Public Domain via Wiki Commons.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
On the “thing-ness” of diagrams 29
figures – they are further away. But actually, of course, they are not; all the
figures lie in the same wall plane at an equal distance from where Peirce is
standing. The apprehension of “depth” belongs to the imaginary moment that
requires the support to disappear. His attention returns to the diagram inscribed
mentally on the surface: Peirce realizes that an imaginary line drawn between
himself and the door to the temple coincides with a large key seen in profile,
which is connected to a second key held by a figure just to the left, and grasped
by the figure to the right. In a second imaginary moment, Peirce eliminates the
support to recognize that the left figure is handing the set of keys to the right
figure – an action that the diagram has helped him to discover. And so on:
a give-and-take process of experiment and discovery, motivated and directed
by the diagram his imagination inscribes invisibly on the surface, but whose
impact can only be assessed when that surface is thought away. In this textbook
case, I draw attention to the polyvalence of the image support that is integral to
the process but whose permutations are largely ignored.
I will complicate the issue a bit by placing our fictive Peirce before a charac-
teristic landscape of Jacob van Ruisdael (Figure 10). A picture like this does not
readily present an armature of perspective like Perugino’s fresco. Ruisdael’s
engagement with an unorthodox representation of space that does not begin by
thinking of the sky as a void has long been recognized. Eugène Fromentin, in a
famous text of nineteenth-century criticism, explains that Ruisdael:
. . . felt things differently and established once and for all a much more daring and true
principle. He thought of the immense vault that rises spherically over the countryside
or sea as if it were the real, dense, solid ceiling of his pictures. He bends it, unfurls it,
measures it, calculates its tonal brightness relative to the accidents of light scattered on
the distant ground; he nuances its large areas, models them in relief, makes them – in a
word – concrete, as if an item of primary interest (Fromentin 1876: 251–252).
For Fromentin, Ruisdael does not privilege the geometry of sight organized around
a fixed point of view essential to the diagram of linear perspective; rather, he
celebrates an embodied, mobile, and time-dependent process of comprehension.
Fromentin suggests that Ruisdael’s concept of a picture is less a window through
which we look onto a parallel world towards infinity – the model of painting
practiced by Perugino – than a continuous, palpable mass manipulated – almost
sculpted – in the process of representation.
Traces of this process are numerous in our example. Note the shared rela-
tionships of shape and mass that connect visually and perceptually the stand
of trees to the formations of dense clouds. Note the three widely-distributed
areas of bright light – hillock and building at center-right, farmhouse at far left,
blasted tree rising from the running water in the lower right – that encourage
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
30 Michael Marrinan
Figure 10: Jacob van Ruisdael, Landscape near Muiderberg, ca 1652. Oil on canvas. Oxford,
Ashmolean Museum, inv. A875 (66 x 75 cm). Photo credit: Ashmolean Museum, Oxford.
our eye to traverse the image laterally in a circular motion. Note the darkened
and clotted center of the image that refuses to provide us with a commanding
view of deep space. We can imagine that our fictive Peirce performs similar per-
ceptual experiments on the picture while trying to determine its diagrammatic
relations.
We might also imagine that he is thrown off at times by the intruding pres-
ence of Ruisdael’s pictorial matter: colored pigment applied in various densities
that present homologies between, for example, the visibly brushed clouds and
the thickly painted stand of trees that meet one another materially on the
surface of the canvas. Ruisdael’s picture will probably disturb the confidence
with which Peirce usually relates form to matter, as when he writes:
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
On the “thing-ness” of diagrams 31
. . . there is a rational explanation of the precedence of Form over Matter in natural classi-
fications. For such classifications are intended to render the composition of the entire
classified collection rationally intelligible, – no matter what else they may be intended to
show; and Form is something that the mind can “take in,” assimilate, and comprehend;
while Matter is always foreign to it, and though recognizable, is incomprehensible (Peirce
1976b, 4: 322).
The same precedence of Form over Matter is seen in the classification of psychical products.
Some of Rafael’s greatest pictures, – the Christ bearing the cross, for example, – are suffused
with a brick red tinge, intended, I doubt not, to correct for the violet blueness of the deep
shade of the chapels in which they were meant to be hung. But who would classify Rafael’s
paintings according to their predominant tinges instead of according to the nature of the
composition, or the stages of Rafael’s development? There is no need of insisting upon a
matter so obvious (Peirce 1976b, 4: 321–322).
Nevermind that art history does, in fact, classify paintings “according to their
predominant tinges,” as when discussing Picasso’s “Blue” or “Rose” periods or
the “sour period” of Renoir. The point to retain from my little detour is that the
thrust of Peirce’s thinking will disincline him to attend to the “recognizable” but
“incomprehensible” manipulations of material that are essential to any paint-
ing, and especially in works like the Ruisdael.
To seek and to isolate “form” in such pictures, to “prescind” much of their
material presence, is how Peirce keeps in line and under control the curiosity
and experiment that constitute the dangers of the “imaginary moment.” The
paradox is that when Peircean prescission takes over, some elements integral
to a configuration like Ruisdael’s are forced to relinquish their meaning, to
become only “background” or “support” for the rest. When writing about
diagrams in general, Peirce both admits and dismisses the difficulty I have in
mind:
It is, however, a very essential feature of the Diagram per se that while it is as a whole an
Icon, it yet contains parts which are capable of being recognized and distinguished by the
affixion to each of a distinct Semantic Index (or Indicatory Seme, if you prefer the phrase).
Letters of the alphabet commonly fulfill this office. How characteristic these Indices are of
the Diagram is shown by the fact that in one form or another they are indispensable to
using the diagram, yet they are seldom wanted for the general enunciation of the proposi-
tion which the Diagram is used for demonstrating (Peirce 1976b, 4: 317).
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
32 Michael Marrinan
then, do these letters exist if they are not part of the image defined by the plane
of material support? Pictures like the Ruisdael accentuate this difficulty because
its indices are not easily isolated letters or numbers, but integral components of
the image itself.
Frederik Stjernfelt recognizes this problem in Peirce’s thought. He proposes
to amend slightly Peirce’s requirement that diagrams be intentional (for example,
the diagram of perspective deployed by Perugino), by suggesting that “as soon
as an icon is contemplated as a whole consisting of interrelated parts whose
relations are subject to experimental change, we are operating on a diagram . . .
as soon as you imagine yourself wandering along the path into the landscape,
you are operating on the icon – but doing so in this way is possible only by
treating it as a diagram” (Stjernfelt 2007: 92 and note; 278–279). Stjernfelt’s
amendment, with which Peirce might well disagree, invests the entire configura-
tion with significance, as if the fact of being a painting pre-empts the prescission
that Peirce takes to be essential. Stjernfelt’s scheme also raises the problem that
looking at pictures is unlike looking at anything else (Stjernfelt 2007: 285). He
apparently embraces this consequence when stating “a (sufficiently complicated)
picture, however, only exists to the extent that it is already a diagram” (Stjernfelt
2007: 285).
Finally, Stjernfelt moves quite far from Peircean foundations when sketching
the process of viewing an abstract picture; he uses the example of Suprematist
Composition: White on White (Figure 11), painted by Kazimir Malevich in 1918, to
remark: “in viewing such a picture, you are forced, as an observer, to an abductive
trial-and-error process, attempting the use of diagrams and consecutive manipu-
lations with no other guaranty than that of perception for one diagram being
better than another” (Stjernfelt 2007: 288). It is not surprising that a long foot-
note to this section of text introduces the problem of phenomenology and the
body. Stjernfelt takes up those issues with a discussion of Husserl in the follow-
ing chapter, where he ultimately posits the viewing of pictures as a form of
“wandering access” that is “highly dependent upon the bouquet of possible
peripathetical [sic] action possibilities determining which kind of access to the
pictorial space is possible“ (Stjernfelt 2007: 310). I will eventually agree with
him in principle, but not because I believe pictures materialize the Peircean
model of diagrams.
I want to introduce a third type of example, one that bears an elliptical rela-
tionship of subject to the Ruisdael and recurs to Peirce’s comments about letters
aiding our use of a diagram. Consider the first plate of Diderot and d’Alembert’s
Encyclopédie, an image devoted to “Agriculture” (Figure 12). The large, upper
image appears to depict space rather like the Ruisdael: a series of planes maps
the distance from the foreground tree at far left to the ruined fortress on the
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
On the “thing-ness” of diagrams 33
Figure 11: Kazimir Malevich, Suprematist Composition: White on White, 1918. Oil on canvas.
New York, The Museum of Modern Art, inv. 817.1935 (79.4 x 79.4 cm). Photo credit: Digital
Image © The Museum of Modern Art / Licensed by SCALA / Art Resource, New York.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
34 Michael Marrinan
Figure 12: “Agriculture, Labourage,” pl. I. Engraving by Robert Bénard for Diderot and
d’Alembert, Recueil des Planches, vol. 1. Courtesy Department of Special Collections, Stanford
University Libraries. Photo credit: Michael Marrinan.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
On the “thing-ness” of diagrams 35
Figure 13: “Agriculture, Labourage,” pl. II. Engraving by Robert Bénard after a design by Louis-
Jacques Goussier for Diderot and d’Alembert, Recueil des Planches, vol. 1. Courtesy Department
of Special Collections, Stanford University Libraries. Photo credit: Michael Marrinan.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
36 Michael Marrinan
we think they are inscribed in the fields; however, they are inscribed on the
sheet of paper supporting the entire image. These elements point deictically to
captions and verbal explanations that supplement the visual forms and – most
important – direct our attention to other near-at-hand textual sources via many
cross-references. Here the paper support of the image performs two simultaneous
roles: on one hand, it presents markings that conjure up a fictive, three-
dimensional world; on the other, it supports letters and numbers that link the
entire icon to the space of a text. The deictic markers and accompanying lan-
guage structure join forces to diagram the image as surely as the geometry of
perspective diagrams the fresco of Perugino, but they are no longer imaginary
additions; rather, they coexist paradoxically with icons of horses and trees,
buildings and people to elicit a virtual explosion of attention.
That coexistence is drawn even more sharply beneath the line, inside the
smaller box presenting two different types of plows. The machines appear to be
situated on a ground plane marked by tufts of grass and cast shadows, but they
are also entirely surrounded by letters and numbers marking the various parts of
each device. If the ground plane is taken seriously the numbers and letters float
mysteriously in space. This is unlikely. However, when seen alongside its com-
panion plate (Figure 13), we immediately grasp that we are invited to recon-
ceptualize the blank sheet from spatial box to measurable flat surface (note the
scale of measurement), even if the move is ambiguous when we attend to the
lingering shadows and three-quarter views.
The ensemble of related imagery (I illustrate only two of the four plates) is
noteworthy for striving to keep both types of support simultaneously viable in a
dialectic of experiment. Before this set of images, I doubt our fictive Pierce
would exit cleanly his “imaginary moment” of curiosity and experiment, for the
unfolding information tends to branch continuously in many directions. Indeed,
that was the purpose of the Encyclopédie’s configuration, as explained in this
famous passage from d’Alembert’s Preliminary Discourse of 1751:
The reader opens a volume of the plates; he sees a machine that whets his curiosity; it
is, for example, a powder mill or a paper mill, a sugar mill, or a silk mill, etc. Opposite it
he will read: figure 50, 51, or 60, etc., powder mill, sugar mill, paper mill, silk mill, etc.
Following that he will find a succinct explanation of these machines with references to
the articles “Powder”, “Paper”, “Silk”, etc (d’Alembert 1963 [1751]: 126–127).
D’Alembert makes perfectly clear that users of the Encyclopédie will move across
the discursive breach between images and texts when working with the assembled
materials. This is not because the publication mobilizes new technologies of image-
making or textual presentation, but because it brings together and interconnects
existing forms in new ways. We will return shortly to consider this attention to
a systemic coupling of parts.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
On the “thing-ness” of diagrams 37
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
38 Michael Marrinan
are made, contradictions become visible, and positive feedback begins to influ-
ence new thinking “as soon as one is able to muster a large number of mobile,
readable, visible resources at one spot to support a point” (Latour 1986: 13).
The Encyclopédie of Diderot and d’Alembert was exactly a project designed
to assemble a large corpus of such resources in one spot; it was also “mobile”
insofar as it was published in bound volumes that traveled around the globe.
Latour lists nine advantages of what he calls the inscriptions of “paper
work,” of which four are of special interest to me: inscriptions can easily be re-
shuffled, superimposed, integrated into a written text, and merged with geometry
(Latour 1986: 21–22). Such operations are possible because inscriptions exhibit an
“optical consistency” established by the sheet of paper (Latour 1986: 7–9 and
22–23), only one of which is the “space-making” system of Renaissance perspec-
tive as practiced by Perugino (Figure 9). Latour summarizes the effect:
On paper hybrids can be created that mix drawings from many sources. Perspective is
not interesting because it provides realistic pictures; on the other hand, it is interesting
because it creates complete hybrids: nature seen as fiction, and fiction seen as nature,
with all the elements made so homogeneous in space that it is now possible to reshuffle
them like a pack of cards (Latour 1986: 9).
If one raises the objection that the alphabetical order will ruin the coherence of our system
of human knowledge, we will reply: since that coherence depends less on the arrangement
of topics than on their interconnections, nothing can destroy it; we will be careful to
clarify it by the ordering of subjects within each article and by the accuracy and frequency
of cross-references (Diderot 1994, 1: 235).
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
On the “thing-ness” of diagrams 39
5 en.wikipedia.org/wiki/Mashup_(web_application_hybrid).
6 Bender and Marrinan 2010: 19–33 for further discussion of these examples.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
40 Michael Marrinan
Figure 14: “Patissier, Tour à Pâte, Bassines, Mortier, &c,” pl. I. Engraving by Robert Bénard
for Diderot and d’Alembert, Recueil des Planches, vol. 8. Courtesy Department of Special
Collections, Stanford University Libraries. Photo credit: Michael Marrinan.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
On the “thing-ness” of diagrams 41
7 Elkins (1999: 85–89) invokes the idea of “negative white space” in his response to the
nominalism of Nelson Goodman when considering the conceptual intersection among writing,
pictorial elements, and notations.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
42 Michael Marrinan
Figure 15: “Dessein, Jambes et Pieds,” pl. XIII. Engraving by Robert Bénard after a design by
Edmé Bouchardon for Diderot and d’Alembert, Recueil des Planches, vol. 3. Courtesy Depart-
ment of Special Collections, Stanford University Libraries. Photo credit: Michael Marrinan.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
On the “thing-ness” of diagrams 43
Figure 16: Dessein, Mains,” pl. XII. Engraving by Robert Bénard for Diderot and d’Alembert,
Recueil des Planches, vol. 3. Courtesy Department of Special Collections, Stanford University
Libraries. Photo credit: Michael Marrinan.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
44 Michael Marrinan
Figure 17: “Anatomie,” pl. Seconde IV. Engraving by Robert Bénard for Diderot and and
d’Alembert, Recueil des Planches, vol. 1. Courtesy Department of Special Collections, Stanford
University Libraries. Photo credit: Michael Marrinan.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
On the “thing-ness” of diagrams 45
shadows to develop something like believable volume and mass. Figure numbers
referring to an external text again lurk at the edges of each fragment, much as
the objects arranged in the visual array of our pastry plate (Figure 14). But there
is also a fifth vignette of a hand, marked by secondary reference points of small
letters very like the feet just discussed; it casts no shadow. Here too, the optical
consistency of the paper’s whiteness provokes an ongoing dialectic between two
very different modes of seeing and thinking. It also provokes even more pro-
nounced hybrids: for example, juxtaposition with a plate two volumes away
in the section on “Anatomie” where hands and feet are subject to a completely
different kind of analysis (Figure 16). The dialogue between an anatomist’s view
of the body’s interior and an artist’s view of its exterior, enabled by visual cross-
references and textual prompts, elicits an understanding of one’s physical self
that is located in neither of the images separately. Rather, it is generated by an
active process of correlating two disparate disciplines, a process made possible
by the optical consistency that links the plates and accompanying texts in a
unified system of data display.
5 Systemic Thinking
My account of the Encyclopédie depends absolutely upon the idea of an active
user who turns pages, looks up references, reads text, and continues the process
for as long as the material continues to generate curiosity. There is no “end”
towards which one works, especially because the Encyclopédie was never thought
to be a teaching manual. Could we really expect to learn how to make pastries
by looking at Diderot’s plates and reading his long and quite detailed explana-
tions? Probably not: we would be better served buying flour, eggs, sugar and
butter to get our hands into real dough. Yet there is a dimension of learning – a
cognitive component – to the mental and physical interactions established
within what I will call the “space” of the Encyclopédie. Like the carrefour of a
city created when several individual streets intersect, but without a structure of
its own, the Encyclopédie creates an environment for cognitive activity. I prefer
to talk about “users” of the Encyclopédie rather than viewers or readers because
to make it work we must physically and psychically enter the space of its
components – plates, captions, and texts – held together by the material white-
ness of its pages.
The interactive turn crucial to the Encyclopédie seems to mark an historical
moment. Consider the parallels of its experience and chronology with the carte
chronographique [chronological map] invented about 1753 by Jacques Barbeu-
Dubourg, who was also the French translator of Benjamin Franklin’s works and
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
46 Michael Marrinan
8 On the device see Rosenberg and Grafton 2010: 112–113. See Ferguson 1991 for a detailed
study of the only extant working example now in the Princeton University Library.
9 Schmidt-Burkardt 2012: 69–80 for full discussion of the diagrammatic importance of Barbeu-
Dubourg’s device.
10 The “Avertissement” at the head of volume 3 (page xv) reports that “M. Barbeu du Bourg,
Docteur en Medecine de la Faculté de Paris, nous a communiqué sa machine chronologique &
l’explication de cette machine.”
11 Free indirect discourse (called style indirect libre in French theory) consists of the use of
third-person, past-tense grammar to represent present first-person thoughts and mental states
in narration. The literature on FID is very large but see the following for good overviews:
Banfield 1982: esp. 225–254; Banfield 1987; Brown 2005: chapters 3 and 4.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
On the “thing-ness” of diagrams 47
reader encounters “in the very language of the novel a real and empirically
determinable possibility, the disturbing presence of something impersonal,
inhuman, past and, in that instant, distant” (Banfield 1987: 278–279). Her account
of the impersonal narration of thought in FID, evoking the appearance of data
without any speaker, parallels the material whiteness of support in the Encyclo-
pédie that mobilizes a user’s deictic gestures of seeing, reading, and attention
to produce forms of knowledge not explicitly stated. I am naturally interested
in the irruption of reality from this matrix, for it recalls the “thing-ness” of
diagrams that deeply concerns Peirce.
My brief trip through free indirect discourse is seconded by a development
in the visual arts of the eighteenth century that Michael Fried calls “absorption”
(Fried 1980, esp. Ch.1). Writing about certain pictures of Jean-Baptiste-Siméon
Chardin (Figure 18)12 and Jean-Baptiste Greuze (Figure 19)13 whom Diderot also
admired, Fried suggests that depictions of self-absorbed figures tend to screen
out the audience, “to deny its existence, or at least refuse to allow the fact
of its existence to impinge upon the absorbed consciousness of his figures.
Precisely that refusal, however, seems to have given Greuze’s contemporaries a
deep thrill of pleasure and in fact to have transfixed them before the canvas”
(Fried 1980: 68). The paradox is that “because his presence was neutralized in
that way, the beholder was held and moved by Greuze’s paintings as by the
work of no other artist of his time” (Fried 1980: 69). The rapture invoked by
Fried depends upon extricating representation from the usual parameters of
placing some thing or action before a spectator in favor of a situation that simul-
taneously alerts the viewer to his non-existence even as the subject displays
itself. “What is called for,” remarks Fried, “is at one and the same time the
creation of a new sort of object – the fully realized tableau – and the constitution
of a new sort of beholder – a new “subject” – whose innermost nature would
consist precisely in the conviction of his absence from the scene of represen-
tation” (Fried 1980: 104). Following the critical lead of Grimm, Diderot and
Shaftesbury, Fried argues that such a tableau presents its material in a pictorially
unified, instantaneous, self-sufficient instant (Fried 1980: 82–92). To which I will
add: a tableau under absorption, like FID in literature, makes evident that it
addresses no one in particular – least of all its spectator – and for that very
12 Jean-Baptiste-Siméon Chardin, The House of Cards, ca 1737. Oil on canvas. Washington, DC,
The National Gallery of Art, Andrew W. Mellon Collection, inv. 1937.1.90 (82.2 × 66 cm). Photo
credit: The National Gallery of Art, Washington DC.
13 Jean-Baptiste Greuze, A Boy with a Lesson Book, exhibited 1757. Oil on canvas. Edinburgh,
Scottish National Gallery, inv. NG 436 (62.5 × 49 cm). Photo credit: Erich Lessing / Art Resource,
New York.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
48 Michael Marrinan
Figure 18: Jean-Baptise-Siméon Chardin: The House of Cards, ca 1737. Oil on canvas.
Washington DC, The National Gallery of Art, Andrew W. Mellon Collection, inv. 1937.1.90
(82.2 x 66)
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
On the “thing-ness” of diagrams 49
Figure 19: Jean-Baptiste Greuze: A Boy with a Lesson Book, exhibited 1757. Oil on canvas.
Edinburgh, Scottish National Gallery, inv. NG 436 (62.5 x 49 cm). Photo credit: Erich Lessing /
Art Resource, New York.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
50 Michael Marrinan
reason holds the beholder spellbound by a fiction of real presence that we might
call its “thingness.” Thus, Fried’s paradox returns us by a route through visual
culture to that “moment of a pure dream – not any particular existence, and
yet not general” when the “thingness” of a diagram leads Peirce to contemplate
the icon.
The visual arrays of the Encyclopédie’s plates – frontality of presentation,
discontinuity with adjacent spaces rendered in perspective, pervasive material
whiteness joining visual arrays to a network of numbers and letters keyed to
texts – fail to converge in a single vantage point or entity that might be called
a viewer in the sense that Perugino takes for granted. Users of the Encyclopédie
and Barbeu-Dubourg’s carte chronographique resemble the impersonal readers
of FID described by Banfield and the alienated beholders of pictures described
by Fried. Quite unlike embodied viewers, they are impersonal but functional
components, inseparable from the systems of representation in which they are
embedded. They are empowered to initiate a process of correlation, even as
they realize that their subjective presence is almost non-existent; very like the
non-person to whom free indirect discourse is addressed or the unacknowledged
viewer of absorptive paintings. Taken together, these phenomena define histori-
cally the decades when diagrammatic knowledge moved to center stage.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
On the “thing-ness” of diagrams 51
14 See Varela, Thompson and Rosch (1992: 133–145) for a discussion of the legacy of “The
Cartesian Anxiety.”
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
52 Michael Marrinan
Figure 20: “Histoire Naturelle, Le Pou vu au Microscope,” pl. LXXXIV. Engraving by Robert
Bénard after a design by François-Nicolas Martinet for Diderot and d’Alembert, Recueil des
Planches, vol. 6. Courtesy Department of Special Collections, Stanford University Libraries.
Photo credit: Michael Marrinan.
mean “how a perceiver can guide his actions in his local situation” when that
situation is not fixed but changes constantly in response to his activity of
perceiving.
On their account, the sensorimotor activity of perception effectively guides
and determines what is perceived. This part of their thinking is compatible with
Gibson’s idea of information pickup. By contrast, they part company with
Gibson concerning the nature of the information likely to be picked up: he
believes there are “invariances in the stimulus flux” that relate to constancies
in the environment (Gibson 1986: 139 and 249 – 250). They reject the idea
of “invariances” because it assumes that the environment is independent
and that perception is “direct detection” whereas their view holds that percep-
tion is “sensorimotor enactment” and can only be studied by attending to the
“structural coupling” between a perceiver and the surrounding environment.
“Cognition is no longer seen as problem solving on the basis of representa-
tions,” they write; “instead, cognition in its most encompassing sense consists
in the enactment or bringing forth of a world by a viable history of structural
coupling” (Varela, Thompson and Roach 1992: 204 – 205). My point in taking
this detour through the arguments of Varela and colleagues is to introduce the
idea of a feed-back loop that couples perceiver to environment, along with their
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
On the “thing-ness” of diagrams 53
insight that cognition is a process that brings forth a world under the aegis of
this coupling. I will eventually incorporate these points into my analysis of
how users manipulate the plates and texts of the Encyclopédie, but must first
explore some of their consequences.
Once cognitive scientists shrug off the anxiety of Descartes to dissolve the
barrier of skin and skull separating body and environment, it is inevitable that
more radical formulations will follow. Andy Clark, one of the leaders of this
tendency, published in 1998 a polemical article with David Chalmers that has
become a classic of cognitive literature (Clark and Chalmers 1998).15 In it they
argue for “an active externalism, based on the active role of the environment in
driving cognition processes” including “the general paraphenalia of language,
books, diagrams, and culture” in which “the individual brain performs some
operations, while others are delegated to manipulations of external media”
(Clark and Chalmers 1998: 7–8). Essential to their view is the potential that “the
human organism is linked with an external entity in a two-way interaction,
creating a coupled system that can be seen as a cognitive system in its own
right . . . our thesis is that this sort of coupled process counts equally well as a
cognitive process, whether or not it is wholly in the head” (Clark and Chalmers
1998: 8–9). One example they cite is a Scrabble player who constantly arranges
and re-arranges letters on his tray: in this case, they argue, “the re-arrangement
of tiles on the tray is not part of the action; it is part of thought” (Clark and
Chalmers 1998: 10). Equally essential to their account is the requirement that a
coupling system be reliable so that the external resources will be at hand when
they are needed: this does not mean that the resources must be always present
(as in the brain), but present when required (Clark and Chalmers 1998: 11–12).
They advance a notorious example that been much discussed in the cognitive
literature: a fictive Otto suffering from Alzheimer’s disease who relies heavily
upon information he carries in a notebook when making his way in the world.
“We are advocating a point of view,” they write, “on which Otto’s internal pro-
cesses and his notebook constitute a single cognitive system” (Clark and Chalmers
1998: 16).
Clark and Chalmers assign an essential role to language, which they describe
as “a central means by which cognitive processes are extended into the world”
(Clark and Chalmers 1998: 11). Later in the essay, after citing several examples
from everyday life, they conclude:
15 The thesis is much expanded and developed in Clark 2011, where the entire 1998 article is
reprinted as an appendix.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
54 Michael Marrinan
In each of these cases, the major burden of the coupling between agents is carried by
language. Without language, we might be much more akin to discrete Cartesian “inner”
minds, in which high-level cognition relies largely on internal resources. But the advent
of language has allowed us to spread this burden into the world. Language, thus
constructed, is not a mirror of our inner states but a complement to them. It serves as a
tool whose role is to extend cognition in ways that on-board devices cannot (Clark and
Chalmers 1998: 18).
I invoke the literature of cognitive science to set the stage for re-thinking what
occurs when users of the Encyclopédie manipulate the plates and pages of corre-
sponding text in order to follow a chain of ideas sketched by Diderot’s cross-
references. Such users are not simply looking, not simply reading, but are
actually thinking through the materials about ideas, problems, or technical matters
that exist nowhere but in the interaction of their bodies with the oversized folios
of the publication. They are coupled to a cognitive system that does not merely
catalogue component parts but generates knowledge by eliciting study of those
parts in new and unpredictable formations guided to a large extent by the indi-
vidual user’s personal orientation, life experience, and expertise. Curiosity, the
hunger to know, is the motor force that brings to life the inert materials of ink
and paper.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
On the “thing-ness” of diagrams 55
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
56 Michael Marrinan
Acknowledgments
Much of my thinking about diagrams depends upon the many hours of discus-
sion, and sometimes argument, that I spent with my colleague and co-author
John Bender during the time that we wrote The Culture of Diagram. I am grateful
to him for those months of invigorating collaboration. I also thank Christina
Ljungberg and Sybille Krämer for the invitation to participate in this volume. I
presented some of this material to the conference “Thinking with Diagrams”
organized in July 2011 at the Freie Universität Berlin. The good discussions, and
probing questions about why Charles Sanders Peirce figures so little in The
Culture of Diagram, inspired the present essay.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
Sun-Joo Shin
2 The role of diagrams in abductive
reasoning
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
58 Sun-Joo Shin
the leap from data to a theory, mainly how to construct it, how to justify it, and
how to evaluate it. A theory is waiting to see how well reality will bear it out.
While practitioners of deductive reasoning are interested in mechanization of
the process, scientists who exercise ampliative reasoning aim to conquer an
indefinitely expanded territory of our knowledge. Both sides trade on their
merits, and nothing is wrong with that.
How about philosophers? One of the main tasks of philosophy is to investi-
gate a subject at a meta-level. What kinds of meta-issues arise from deductive
and inductive processes? First of all, there is an issue of justification. Sure
enough, Hume’s skepticism has generated a tremendous amount of literature
about the justification of induction. The justification of deduction has also been
discussed rather extensively in the literature (Carroll 1895, Prior 1960 and Belnap
1962). As Hume himself admitted, skepticism toward justification stays at a
theoretical level and life goes on. We keep making inductive judgments all the
time, from ordinary daily events to scientific activities. This is even more so
in the case of deductive reasoning. Mathematicians do not stop to dwell on the
justification of modus ponens. The majority of them would not even feel the
need of its justification.
Relating to the justification issue, the philosophy of science has raised more
specific questions as to the practice of ampliative reasoning involved in scien-
tific theories – how to come up with a theory, how to select one out of many
possible theories, and how to test out a theory for indefinitely many future
cases, etc. As a contrast, discussions in the philosophy of logic and the philosophy
of mathematics have focused on more foundational questions, for example, logical
truth, mathematical truth, ontology of numbers, etc., instead of the deduction
process itself. As a result, there is less overlapping concern between philoso-
phers and practitioners in explicative reasoning than in ampliative reasoning.
One of the reasons for the discrepancy is that many believe the deduction pro-
cess, being certain, is less controversial than the induction process, being only
probable. Logicians and mathematicians focus on formal systems themselves,
without much ontological commitment. Axioms and inference rules should fill
in a gap between premises and conclusion, and we know how to check whether
each step is valid within a system.
On the other hand, scientists have to find a way to get to a hypothesis in
order to establish their own theories for explaining phenomena, and there is no
specific guideline to bridge between observation and a hypothesis. What’s more
intriguing, no hypothesis could fill in a gap between data and a theory. There
is no 100% proof, so to speak. We expect constant challenges both from other
theories and from future events. It is not surprising why many philosophers find
these questions more interesting at a theoretical level than finding algorithms
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
The role of diagrams in abductive reasoning 59
and establishing formal systems for explicative reasoning. Hence, much of the
literature in philosophy of science directly addresses the inductive process while
the literature in the philosophy of logic and mathematics is more occupied with
fundamental questions instead of the deductive process itself. I claim that what
exactly is going on with a deductive step has been massively overlooked and
argue that deductive reasoning is full of mysteries, in spite of its certainty.
Suppose a system is given to us. Suppose a deductive step is taken by citing
an axiom (or a previous line) and an inference rule. Then, we may check its
correctness easily since a finite number of axioms and inference rules are
involved. A deductive proof consists of this process only finitely many times.
We do not seem to have any interesting theoretical questions inside a system.
Only a few questions about a system seem to remain, such as “Is the system
valid and/or complete?” “How powerful is the system?” “Are there logically
equivalent systems?” If that were the case, deductive steps themselves would
be mystery-free.
Let me raise simple-minded questions. Why did it take more than three cen-
turies to prove Fermat’s last theorem successfully? Does the fact that Goldbach’s
conjecture is not yet proven show the lack of rules or axioms of a system we are
working on? Or, are we waiting for more effort, more time, and more insight?
More mundane questions: Why do we struggle in a geometry class to prove a
theorem? We know axioms, definitions, and rules. We are told that the proposi-
tion we are supposed to prove is a theorem, which means that the proposition
follows as a consequence from axioms, definitions, or previous theorems. All
we are doing is to extract the information of the proposition from the informa-
tion of the previous theorems and axioms. After all, the conclusion is contained
in the premises, we say. The number of axioms, previous theorems, and rules
are all finite as well. Then, why are we having a hard time to prove the logical
consequence from the premises to the conclusion? Let me call this question the
mystery of deduction, and this is the mystery that has been overlooked in the
literature which I would like to take up in the paper.
Interestingly enough, the deduction process itself, unlike the inductive process,
has not drawn much philosophical attention. Deductive steps have been con-
sidered as a crystal clear procedure while guessing and agony have been granted
for inductive steps. I believe the division between explicative versus ampliative
reasoning has provided us with assurance of deductive reasoning while a gap
between premises and conclusion in inductive reasoning has constantly fascinated
philosophers.
Let us pause here to examine what has been assured in explicative reason-
ing, and how. Deduction, which is used in explicative reasoning, if correctly
carried out, assures us of the truth of the conclusion if the premises are true.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
60 Sun-Joo Shin
Hence, two conditions for the assurance are: (i) true premises and (ii) valid infer-
ence steps. The first condition is assumed and is not expected to be shown
within an argument. The second condition requires a meta-theory where the
rules of our system are valid, that is, the soundness of the system. Then, given
(i) and (ii), any step taken from premises adopting an inference rule is assured
to be correct, and so is the proposition reached in the step.
This is a question: Do (i) and (ii) assure us what a next deductive step will
be? I could make a valid step, by citing a previously proven theorem in the
system, say ‘(p v ~p).’ The truth of the statement is assured, but not its useful-
ness. Most probably, this is not the step I need to take to get to the conclusion in
an argument. That is, the assurance is given to the result, not to the process
itself. Even when the number of premises and the number of inference rules
are finite, it seems as if there is an infinite number of valid steps we could take
in a deductive argument. This is where the mystery of deduction lies, I claim. In
spite of the ‘containment’ relation in information between premises and con-
clusion, many deductive arguments are not easy to make or even to follow.
Some of them are so hard that many decades or centuries pass until a genius
guides us through a mysterious deduction forest.
Why are some deductive steps are trivial while some are not? Are there dif-
ferent kinds of deduction? For non-trivial cases, what is the source of difficulty?
In the case of ampliative reasoning where the conclusion contains more than
what the premises say, difficulty is inherent when we move to a next step. For
this reason, how to search a hypothesis and how to set up a theory have been
major topics in the philosophy of science. In the next section I would like to
justify a division within deductive reasoning − trivial versus non-trivial. After
examining the source of the difficulty of deductive reasoning in the third section,
I borrow Charles Peirce’s discussions of abduction to argue in the fourth section
that non-trivial deduction requires abductive reasoning. In the fifith section, I
make a case for the claim that a form of representation is closely related to the
success of abductive reasoning as I characterize it in the paper.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
The role of diagrams in abductive reasoning 61
us anything about the sum of its interior angles. There is a difference between
mathematical truth and an analytic statement such as “All bachelors are un-
married.” Hence, Kant classifies mathematical truth as synthetic.
On the other hand, mathematical truth seems to be different in two aspects
from other synthetic truth, say, “The earth has one moon” or “It is sunny today.”
First, mathematical statements are universally accepted unlike with other syn-
thetic contingent truth. Second, even though mathematical judgments are not
made by the analysis of concepts, they do not come from our experience, either.
Hence, Kant classifies mathematical truth as a priori. Here is the birth of the
category ‘synthetic a priori.’ Then what is the source of our mathematical judg-
ments? This is where Kant’s long journey starts.
For our discussion, we do not need to follow his journey, but will focus on
Kant’s division between analytic a priori and synthetic a priori. We are aware
that Kant’s logic is different from ours, and modern logic covers a larger territory
for conceptual analysis than Kant’s logic. Hence, many would like to say Kant’s
synthetic a priori category is outdated. Also, the analytic/synthetic distinction
itself has been controversial, and some do not want to buy into the distinction.
Nonetheless, I believe Kant’s distinction is illuminating: he saw something dif-
ferent between simple and complicated deduction processes. In Kant’s mind,
mathematical truth is something beyond a process of deduction. His logic is
limited to Aristotelian monadic logic. When the logic of relations gets in the
picture, his terminology ‘synthetic’ might be questionable, but his original divi-
sion still stands as a demarcation between monadic and relational logic. This
demarcation is directly related to our investigation into the deduction mystery,
I claim.
There is a deduction sequence to show that the sum of the interior angles of
a triangle is 180 degrees. Whether the proposition is reached by analyzing the
concept of triangle is not an issue, but it is clearly an explicative process. Then,
is the triangle proposition of the same kind as Kant’s analytic example of “All
bachelors are unmarried”? Some might think so, believing that the difference
lies only in the degree of difficulty. Similarly, those who believe Kant’s distinc-
tion between (Aristotelian) logical and mathematical truth comes from Kant’s
ignorance of modern logic might say the distinction should be extinct. I do not
deny that monadic is simpler than relational logic. When we move to relational
logic, carrying out deduction is more complicated, and hence more difficult. I
claim the difference is not a matter of degree, but a matter of kind, and a different
kind of difficulty involved in relational logic is the source of the deduction
mystery itself. In that sense, Kant’s separation of mathematical truth from
monadic logical truth is remarkable.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
62 Sun-Joo Shin
The peculiarity of theorematic reasoning is that it considers something not implied at all
in the conceptions so far gained, which neither the definition of the object of research
nor anything yet known about could of themselves suggest, although they give room for
it. Euclid, for example, will add lines to his diagram which are not at all required or
suggested by any previous proposition. (Peirce 1976: 49)
1 I made a case for an (approximate) correspondence between Kant’s and Peirce’s distinction in
Shin 1997.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
The role of diagrams in abductive reasoning 63
(4) AC = AB.
(5) BC = BA.
(6) Hence, triangle CAB is an equilateral triangle. QED
Steps (1)−(3) require construction and steps (4)−(6) are taken by observation.
Anyone who tried to solve the proposition would agree that the construction
process is more difficult than the observation steps. Among the construction
steps, drawing the first circle in step (1) is the most difficult one. Even though
connecting points in (3) to form a triangle is a construction process, it is rela-
tively easier than step (1). Borrowing Peirce’s characterization of theorematic
reasoning, we may say that step (1) requires ‘an ingenious experiment’ on line
AB. Why is the construction ingenious? The first answer would be: It brings in
a new geometric object in the proof (Hintikka 1980). This is not incorrect, but
invites the next question: Why does the introduction of a new object cause
much trouble?
2 The point I am making here could be even stronger in the case of later propositions.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
64 Sun-Joo Shin
This is a crucial question to answer and this is where the mystery resides in
the deduction process! In the above proof, given line AB, we are almost certain
that something should be constructed with the line. That is, we need to intro-
duce a permissible geometrical object into a proof. There is no clear hint or
implication about what should be the right object we need to bring in at this
stage. This is, I claim, what Peirce meant by “not implied at all in the concep-
tions so far gained.”
Let me suggest three components of a theorematic step and name them as
follows:
e [Multiple Choices] There are other legitimate geometrical objects (instead of
a circle ABD) we could introduce at that stage.
e [No Algorithm] There is no specific guideline which object should be drawn
for a proof.
e [Ex-post Evaluation] Which is the right one among multiple permissible
choices will be known only after a proof is complete.
Let me continue the above proof example to illustrate the three components.
Suppose we introduced the following objects, instead of a circle ABD:
All of them are justifiable: not only a circle ABD but also these objects follow
from postulates or definitions.3 All of them being deductively permissible
objects, each construction would be a valid deductive step. This is the Multiple
Choices component. Then, how do we come to choose the circle ABD over the
other choices? The No-Algorithm clause answers “We are on our own.” If so,
what prevents us from introducing other objects? What if we choose to draw a
parallel line, instead of circle ABD? As the Multiple Choices clause says, it is a
deductively valid step and we are in the middle of a deductive proof. However,
we do not want to take any of these steps other than drawing circle ABD.
How do we know circle ABD is the right object we should be looking for, not
any other objects? It seems only after we may prove that the circle ABD proves
(!) the proposition! There is no ex-ante rationalization, but only an ex-post
evaluation.
3 Let’s recall this is the first proposition, and we can easily imagine more possibilities for later
propositions, citing previously proven propositions.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
The role of diagrams in abductive reasoning 65
The three components together tell us that deduction, which delivers certainty,
does not provide us with certitude as to how to deliver certainty. Yes, that is an
ironical mystery. Embracing it, we should stop wondering why we often, more
often than we would like, get stuck in deductive reasoning even when we are
assured that a given conclusion follows from given premises.
How to make the right choice is at the heart of the deduction mystery. Ex-
tending the mystery to the context of deduction in general beyond geometrical
proofs, we may generalize the issue in the following way: Without an explicit
guideline for the next step we need to take, how do we navigate through avail-
able premises and rules to make the right choice? Which axiom, which theorem,
which definition, which inference rule do we want to recall to take the next step?
This is indeed the most immediate question for those who actually carry out
deductive reasoning to prove a proposition. My hunch is: We make the right
choice in a theorematic step, not deductively, but abductively. In the next
section I would like to present arguments to back up this bold suggestion.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
66 Sun-Joo Shin
Many philosophers have been fascinated with the ampliative aspect of a scien-
tific theory. My question is: Is the ampliative nature the only source of these
hotly debated discussions? Are these hypothesis-related issues controversial
and interesting just because a hypothesis covers more than what data show?
Without any definite answer in mind, I am going to apply the three com-
ponents of a theorematic (hence, deductive) step we presented in the previous
section to a hypothesis proposal in a scientific theory. (i) Are there multiple
candidate hypotheses for given data? Definitely, “yes.” More than one hypothesis
is compatible with the given data a theory aims to explain. (ii) Is an algorithm
available for our choice of a hypothesis? Definitely, “no.” Lack of mechanical pro-
cedure has been taken for granted in the search for a hypothesis and has been
attributed solely to its inductive and ampliative nature. (iii) How do we evaluate
hypotheses? How do we know which hypothesis is better than or superior to
others? Not by its own truth-value alone: but its merit absolutely depends on
the scope of its coverage or its ability to predict future data.
Multiple choices, lack of an algorithm, and evaluation − all of the three
features are common between a theorematic step and a hypothesis proposal.
Now it is time to tap into the related rich literature to hypothesis in the philosophy
of science in order to understand the deduction mystery better. Charles Peirce’s
abduction is a good place to start. Two preliminary remarks are in order.
First, please note that theorematic reasoning − where I placed the deduction
mystery − is also Peirce’s concept. However, interestingly enough, Peirce himself
never connected theorematic reasoning and abductive reasoning. On the con-
trary, he placed these two categories of reasoning at the opposite sides of a
spectrum, one at an explicative and the other at an ampliative end. Therefore,
my view that abductive reasoning could provide us with insight about the
mystery of theorematic reasoning would be almost against Peirce’s categoriza-
tion of reasoning.
Second, Peirce’s classification of reasoning went through various stages,
and so did his abduction.4 Hence, Peirce’s abduction itself, i.e. what Peirce
meant by ‘abduction,’ has formed an interesting and independent research topic
both among Peircean scholars and among philosophers of science, and I do not
intend to get into those debates in this paper. Instead I rely on well-accepted
and uncontroversial parts of Peirce’s own writing about abduction and advance
my claim how deduction requires abductive elements.
Not surprisingly, Peirce’s tireless inquiry into scientific discoveries and scien-
tific explanations centered around reasoning related to hypotheses. Considering
4 For the development of abduction in Peirce’s own philosophy, please refer to Fann 1970. For
general introduction of the topic, refer to Burks 1946; Douven 2011a; Douven 2011b.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
The role of diagrams in abductive reasoning 67
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
68 Sun-Joo Shin
This step of adopting a hypothesis as being suggested by the facts is what I call abduction.
. . . There would be no logic in imposing rules, and saying that they ought to be followed,
until it is made out that the purpose of hypothesis requires them. (CP 7.202)
What is good abduction? . . . [I]t must explain the facts. . . . Its end is . . . to the avoidance of
all surprise and to the establishment of a habit of positive expectation . . . (CP 5.197)
A hypothesis will be evaluated in terms of its explanatory power and its ability
to predict the future. That is, its legitimacy is proven in terms of its utility. Many
of us could easily be reminded of Peirce’s pragmatism, and realize why abduc-
tion plays such an important role in Peirce’s philosophy.
I argue a deductive step chosen in theorematic reasoning is accepted or
rejected in a similar way. Being true is necessary, but far from being sufficient.5
Is there something similar to the explanatory power of a hypothesis in the case
of a deductive step? A scientific theory aims to explain certain observable data
with our current knowledge. We may consider a deductive proof as an explana-
5 One could repeat tautologies which might be technically valid, but of no use, period.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
The role of diagrams in abductive reasoning 69
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
70 Sun-Joo Shin
puzzle. They are explicative steps, meaning that a deductive step logically
follows from previous ones. Nonetheless, our practice shows how difficult it
could be to find a correct step in a deductive proof. In the third section, we iden-
tified the deduction mystery as a process of making a right choice among many
alternatives without any guideline. Even though each alternative is a logical
consequence of the previous steps along with axioms or definitions, a choice
itself cannot be made deductively, but (I claim) only abductively. That is, meta-
reasoning involved in interestingly difficult deductive steps requires abductive
reasoning.
In the previous section, we looked into Peirce’s abduction which has been
applied and discussed only in the context of constructing hypotheses and recog-
nized that there are striking similarities between the process of searching for
deductive steps and Peirce’s abductive reasoning toward hypotheses. Both take
us to the meta-reasoning level. Given more than one legitimate choice, both
processes need to appeal to our experience, insight, and intuition since there is
no algorithm we could rely on. After a hypothesis is proposed or a deductive
step is constructed, its evaluation could be made only after we have a chance
to check its utility − how a proposed hypothesis or a presented deductive step
leads us to the final goal more smoothly or more comprehensively.6
If my argument for abductive elements in deductive reasoning is convincing,
I would like to go back to the deduction mystery and raise the following prac-
tical question: What helps us to perform abduction more successfully? We
realized that there is no set menu for the search process and creative and in-
genious work is needed. In this section I claim that different forms of represen-
tation could make a difference in steering our mind to find a right step. I do not
intend to generalize about the relationship between representation and finding
a deductive step in the paper, but would like to open discussions about how
representation affects our theorematic reasoning.
Let’s go back to the example of section 3, to construct an equilateral triangle
for a given straight line. In a proof, the crucial theorematic step is to draw a
circle whose radius is the given straight line. How could one come up with an
idea to draw a circle? First, it is important to realize that we need to bring in a
new object to the proof. Second, we need to choose or find a correct object,
while there are many other possibilities involving with the line. We could have
drawn a parallel line to the line, we could try to bisect the line, etc. I claim this
is where forms of representation could make a difference.
Suppose we do not draw a line and rely on symbolization only. We may
arithmetize the entire proof without using a diagram at all. We do not lose any
6 Of course, the final goal of a hypothesis and the final goal of a deductive step are different
from each other.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
The role of diagrams in abductive reasoning 71
rigor or legitimacy of the proposition.7 Hence, one might conclude that the form
of representation is only a vehicle which expresses the same proposition and
the same process from premises to the conclusion. If we focus only on what is
represented in each step of a proof, we are ignoring a meta-process − how to get
to each step. It would be just like zooming in on the process from a hypothesis
to observed data or to a prediction of future events. What would be missing is to
reason how to reason toward a hypothesis, which is meta-reasoning. However,
when we draw our attention to the meta-level, that is, how to form a hypothesis
or how to bring in a new object into a proof, the mode of representing informa-
tion becomes highly relevant to the process. Therefore, abduction, which is a
process of search, is sensitive to a form of representation.
There being no clear algorithm, insight and guessing are at play in abduc-
tive reasoning, as we discussed above. How we may become more creative or
more insightful so that we may propose the right guess is a fascinating question
whose answers are far from being clear. One safe response is to list items, e.g.
being more knowledgeable, being more experienced or trained, etc. I propose
that how we represent information, e.g., symbolic representation or diagram-
matic representation, is one of the items which facilitate abductive reasoning
more efficiently. Why is it the case?
I claim diagrammatic representation stimulates our mind so that we may
introduce a new object more easily than with corresponding symbolic represen-
tation. In the case of our example in section 3 of drawing a circle using the given
line as a radius, it would be even more difficult to come up with an idea of
drawing a circle if we took our premise represented only symbolically, say line
AB, instead of looking at a line drawn on the sheet of paper. When we encounter
a geometric line visually, our imagination is more active to search for various
geometric objects to be added to the line. Also, after adding a new object, it
would be easier for the mind to manipulate newly added information or to read
off information from newly constructed one. In our example, when two circles
are drawn, radii appear in a new diagram and all we need is to connect lines to
show all of the three lines are of the same length since AB and AC are radii of
one circle and BA and BC are radii of the other circle.
Let’s consider another often-cited example before we explore further why
diagrammatic representation is helpful for abductive reasoning:
7 On the contrary, some might think symbolization is more rigorous and more legitimate than
drawing diagrams.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
72 Sun-Joo Shin
Proof: Let AB be the given straight line. [We show how to bisect it.]
(2) Bisect the angle ACB and draw line CD. (Proposition 9)
(3) Then, triangle ACD and triangle BCD are congruent to each other, since AC =
BC (by the definition of an equilateral triangle), CD is common, and angle
ACD = BCD (by (2) above).
(4) Therefore, AD = BD. QED
Both (1) and (2) are theorematic steps which require abductive reasoning: (i)
New geometric objects are introduced. (ii) An equilateral triangle and a bisecting
line are not the only objects we can introduce at those moments. (iii) There is no
guideline to tell us which objects need to be brought in. (iv) Whether these are
correct objects is known only after they deliver the conclusion (smoothly).
In spite of legitimate warnings against the misuse of diagrams in a rigorous
proof, we cannot deny that figures in the proof prompt our mind to bring in
another figure more easily than does symbolic representation. It is true that
what we are seeing on the paper are tokens of geometric figures with specific
sizes and shapes, and some of these particular properties could mislead us.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
The role of diagrams in abductive reasoning 73
8 I have pointed out an asymmetry between geometric and non-geometric problems: We adopt
diagrams in both kinds of problems, but we hardly adopt symbolic representations for geometric
problems at the brainstorming stage. Refer to Shin 2012.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
74 Sun-Joo Shin
However, this credit has not been theoretically justified. Why are diagrams
more helpful than symbols for brainstorming? What’s worse, the emphasis on
the heuristic side of diagrams has wrongly implicated the weakness of diagrams
in a non-heuristic proof context. The arguments presented in the paper provide
us with the rationale why diagrams could be a better form of representation at
a heuristic level, without ruling them out from a formal proof.9 Abduction is
crucial in navigating us in a directionless search for a proof step, and I argue
that diagrams are in many cases more useful for performing successful abduc-
tion. In the case of a geometric proof, finding an auxiliary geometric object is
more easily prompted by diagrammatic than symbolic representation. After all,
we need to bring in a figure into an existing figure. Also, after adding a new
object, it is easier for us to perceive new geometric figures and in many cases
these new objects are the key to the next step.
How about non-geometric problems? A diagonal proof, I believe, makes my
case. Let’s try to prove that truth functions for the set of sentence symbols
(which is countably infinite) are not countable. A proof is given by reductio ad
absurdum. I present the same information in two slightly different ways, one
being more diagrammatic than the other.
[1] Suppose (by reductio) that we enumerate the truth functions for the set
{A1, A2, A3, . . . }, say v1, v2, v3, . . . . Then, vn = {<A1, X1>, <A2, X2 >, <A3, X3 >, . . . }
[2] Suppose (by reductio) we assume that v1, v2, v3, . . . are enumerated in the
following way:
9 There is no good reason why we accept one form of representation as a heuristic tool, but not
as a formal proof, as long as we can formalize representation. Refer to Shin 1994.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
The role of diagrams in abductive reasoning 75
The next step is to construct a truth function, say vk which is not on our list.
How?
The diagrammatic representation above hints for us to diagonalize out our
list. That is,
Figure 23: Constructing a truth function which is not on our list by diagonalization
Let vk = {<A1, X*11>, <A2 , X*22 >, <A3, X*33 >, . . . }, where X*nn ≠ Xnn.
Suppose vk were on our list. Let’s check what is the truth-value of Ak by vk. Two
cases:
(i) It is when vk (Ak) = T. That is, Xkk = T in the above diagram. Since X*nn ≠
Xnn, X*kk = F.
(i) It is when vk (Ak) = F. That is, Xkk = F in the above diagram. Since X*nn ≠
Xnn, X*kk = T.
But, X*kk = vk (Ak) by our definition of vk. By (i) and (ii), vk (Ak) = T if and only if
vk (Ak) = F, which is a contradiction! Therefore, vk cannot be on the list. QED
Of course, we can construct vk in the other representation [1] as well without the
diagram of [2]. This is a subtle but important point I have been making: What
we are interested in is not whether we can represent information in one system
and not in another, but how we get inspiration for a crucial step. That is, we are
making an inquiry as to which representation helps us with the abduction
process. Even though our truth-function example is not intrinsically diagram-
matic (unlike geometry), representing truth functions diagrammatically in [2]
stimulates our mind so that if we diagonalize out the list, we can easily con-
struct a truth function which is not on our list.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
76 Sun-Joo Shin
6 Conclusion
Deduction, being explicative, does not produce any new knowledge in some
sense. On the other hand, in a scientific theory there is a leap from given data
to a hypothesis. Logical consequence would not provide us with a hypothesis
we are looking for. This is one of the main reasons why philosophers have been
focusing on the hypothesis searching process. In the paper I argued that there
are two different levels on which we can make our inquiry, and a confusion
between the two levels is one of the main reasons why we have been puzzled
with the difficulty encountered in deductive proofs. While each step follows
from previous ones, why is it difficult to make progress in a proof? In the case
of finding a hypothesis, we need new ideas which say more than what we have.
Strictly speaking, a valid deductive step should not introduce a new idea, but
extract a piece of information from given information. Then, why is finding a
logical consequence so difficult? Indeed, this is the mystery of deduction.
When philosophers are interested in exploring hypothesis-related theoretical
issues, the focus could be either on a logical leap from data to a hypothesis or on
how to come up with a hypothesis from data. Let me call each step, the first-
level and the meta-level of reasoning, respectively. As far as the first level goes,
there is a clear difference between a deductive step and the construction of
a hypothesis, in terms of logical consequence. However, at the meta-level,
we find striking similarities between difficult deductive steps and hypothesis-
finding processes: (i) There are more than one deductive steps which follow from
previous ones and more than one hypotheses which are consistent with given
data. (ii) There is no set menu to follow, and insight and clever guessing are at
play. (iii) Which is the right step or the right hypothesis is known in the context
of a bigger picture, that is, in an overall proof or in the prediction of future events.
Based on these important features shared between deductive steps and
hypothesis proposals, I suggested we identify the meta-reasoning process for
deduction as abduction, which Peirce introduced to explore the hypothesis
proposal procedure. How to perform abduction more successfully and more
economically, I believe, could be one of the exciting topics in various areas,
both theoretically and practically. In the last section of the paper I presented
one way to approach the topic, by exploring the relation between forms of
representation and abduction. Stimulating the mind to find a right path to the
conclusion can be done in different ways, and how information is represented
must be one of the factors to steer our mind in a creative way. How to gather
and organize information, not surprisingly, should make a huge difference in
how we navigate through information so that we may get to where we aim to
reach.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
Valeria Giardino
3 Behind the diagrams: cognitive issues
and open problems
Valeria Giardino, Institut Jean Nicod, Département d’Etudes Cognitives, Ecole Normale Supérieure
– PSL Research University
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
78 Valeria Giardino
We have three pulleys, two weights and some ropes arranged as follows:
1. The first weight is suspended from the left end of a rope over Pulley A. The
right hand of this rope is attached to, and partially supports, the second
weight.
2. Pulley A is suspended from the left end of a rope that runs over Pulley B, and
under Pulley C. Pulley B is suspended form the ceiling. The right end of the
rope that runs under Pulley C is attached to the ceiling.
3. Pulley C is attached to the second weight, supporting it jointly with the right
end of the first rope.
The same information is then displayed by relying on the diagram in Figure 24.
Figure 24: The diagram of the pulley system in Larkin and Simon (1987)
According to the authors, the two representations of the pulley system − its
linguistic description and its diagram − are to be considered as informationally
equivalent, since they happen to have the same content: the statements in the
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
Behind the diagrams: cognitive issues and open problems 79
description express the same information as the one displayed by the arrange-
ment of the elements of the diagram. However, consider someone who has to
solve a problem about the pulley system; for example, she is asked what the
ratio of the second weight to the first would be when the system is in equilib-
rium. Of course, some extra information is needed: from the context, one knows
that the pulleys and ropes are to be considered as weightless and the pulleys
as frictionless; furthermore, the rope segments are all vertical, except when
they run over or under the pulleys. In any case, in order to solve the problem,
whoever reads the description without having been shown the diagram would
very spontaneously reach for paper and pencil and draw a diagram based on
the description, thus obtaining something similar to Figure 24. In fact, the
linguistic description and the diagram do not seem to be equivalent from a
computational point of view: it is much easier to extract information about the
pulley system by looking at its diagram than it is by reading its linguistic
description. Despite being informationally equivalent, the two representations
are therefore computationally different.
The question to ask then is which of the diagram’s features would make this
‘computational difference’. Larkin and Simon’s suggestion is that, differently
from the linguistic description, the diagram allows for the information at one
location or close to it to be simultaneously accessed and presented. For this
reason, the search for the relevant information takes shorter time, thus facilitat-
ing computation. Note that in some sense both the linguistic description and the
diagram externalize the information about the pulley system in space, i.e., they
put information ‘out there’ for the user to be available for reasoning; however,
the diagram clearly exploits space in a way that is for some reason much more
convenient to enhance inference. Furthermore, both representations must be
interpreted in order to be used: one has to understand to which element of the
pulley system the words in the sentences as well as the elements of the diagram
refer to; nonetheless, the diagram seems to facilitate not only the search for
the relevant objects but also the very recognition of the objects involved in the
reasoning.
Let me clarify at this point what I will intend by ‘diagram’ in this context.
For the moment, my assumption is that a diagram is a two-dimensional arrange-
ment of content. To some extent, also the linguistic description above has
diagrammatic elements, because it is given as a numbered list; however, the
fact that the sentences are given in an ordered sequence is in this case not
enough to make it computationally more advantageous than the diagram. Con-
sider now a simpler problem that still illustrates the distinction between infor-
mational equivalence and computational difference. Imagine someone working
in a Parisian RER station who wants to display on a signpost the hours of the
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
80 Valeria Giardino
trains going from Paris to Versailles from 9 to 11 a.m. during the week, so as to
prevent too many tourists to keep asking him. He could choose to write the
hours in the three following ways:
(1) 10:13 08:01 08:32 09:14 09:32 10:48 09:52 10:02 09:01 10:33 08:12
(2) 08:01 08:12 08:32 09:01 09:14 09:32 09:52 10:02 10:13 10:33 10:48
As anyone can see, (1) to (3) have the same content, and therefore they are infor-
mationally equivalent. However, from a computational point of view, (2) is better
than (1) for the reason that the hours in (2) are chronologically ordered while in
(1) they are randomly displayed; moreover, in (3) the content is not only chrono-
logically ordered but also arranged along three different horizontal lines, repre-
senting the time intervals 8–9 a.m, 9–10 a.m, and 10–11 a.m. respectively. It is
then evident that (3) would be the more appropriate information display to
avoid long queues of tourists not sure about which train they are supposed to
take. Why is it so? (3) is computationally more efficacious because of its spatial
configuration, which allows the user to know almost immediately where to look
for to find the relevant information: now it is 9.15 a.m., it is a Monday morning,
and I see that there are 2 more trains going from Paris to Versailles before 10 a.m.
This example shows even more strikingly than the previous one that an
appropriate spatial configuration can facilitate inference, even when the infor-
mation is given in the same format (in this case, by displaying numbers repre-
senting the train hours). If the pulley system case still seems to allude to a
distinction between visual and sentential reasoning by comparing a linguistic
description and a diagram having the same content, the timetable example
centers the discussion on one crucial feature characterizing diagram-like repre-
sentations: diagram-like representations externalize information in space in a
relevant fashion thus supporting reasoning and inference. Note that such facili-
tation may also result from the interaction between different formats. Consider
again the pulley system diagram and the crucial role played in it by labels:
letters and numbers (for example A, W1, . . .) serve there as indexes, because
they are placed in the appropriate spatial location so as to ‘name’ the various
relevant elements of the diagram. As Peirce has suggested, “The index asserts
nothing; it only says “There!” It takes hold of our eyes, as it were, and forcibly
directs them to a particular object, and there it stops” (Peirce 1885: 181). Being
symbolic, letters and numbers are not ‘diagrammatic’ in nature, at least not in
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
Behind the diagrams: cognitive issues and open problems 81
any intuitive meaning of the term; nonetheless, they assume here an indexical
function, that is, they guide the attention of the observer towards the relevant
objects in the diagram, and they can do that precisely because, as Larkin and
Simon claim, they are in the same location or ‘close to’ these very objects.
I hope to have already shown the limits of a sharp distinction between
sentences on the one hand and diagrams on the other. Other arguments can be
given as well to reject such a distinction. Take for example specific diagram-
matic practices, such as the use of diagrams in geometry, where something
analogous to what we have just seen in the pulley system example happens:
displaying the problem spatially helps recognizing the relevant objects and find
connections with previous background knowledge. Nonetheless, the case of
geometrical proofs and Euclidean proofs in particular is of course more complex.
As it has been thoroughly discussed, diagrams and text in Euclid are interdepen-
dent, in such a way that the latter gives instructions on how to interpret the
former (see for reference Netz 1999 and Manders 2008). As Netz explains,
The perceived diagram does not exhaust the geometrical object. This object is partly
defined by the text, e.g. metric properties are textually defined. But the properties of the
perceived diagram form a true subset of the real properties of the mathematical object.
This is why diagrams are good things to think with. (Netz 1999: 35, emphasis added)
For reasons of space, I won’t go much further into this issue here, but I have
done it elsewhere (Giardino forthcoming 2016).
Based on these examples, we can come to some tentative conclusions that
will be the starting point of this article. First, diagrams are widespread in many
different cultures because they amount to cognitive advantages, that is, they
happen to be very good inferential shortcuts in problem solving. By externalizing
the information in space, they reduce the amount of search required to find the
relevant information and solve the task. Second, diagrams cannot be considered
in isolation from a context they refer to, and therefore they are not only spatial
tools: the space of the diagrams is an interpreted space, in line with textual and
background knowledge. If this is true, then diagrams have a twofold nature and
possess both cognitive and cultural features: they certainly exploits our spatial
cognition, but the spatial elements that can be visually identified are then open
to contextual interpretation. As a consequence, it is pointless to consider them
as visual tools in opposition to language; instead, the more appropriate and
interesting question concerns their computational efficacy, and how it happens
to emerge from the interaction between more spontaneous abilities and the
production of cultural artifacts.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
82 Valeria Giardino
In the rest of the article, my objective will be to sketch out the elements for a
conceptual framework explaining diagrams’ inferential powers. This will bring
me at first far from diagrams, and then back to a richer landscape involving not
only diagrams but also writing and notation, i.e., cognitive tools in general. In
Section 2, I will present the more general cognitive background. First, I will
introduce the view according to which most of human cognition is embodied,
in a sense to define, and then I will discuss how this feature of cognition relates
to the use of space in representing information. Such a framework would pro-
vide some insights on the cognitive reasons why the appeal to diagrams is
so spontaneous and widespread across cultures. In Section 3, I will focus on
another aspect characterizing diagrams that is at one time cognitive and cultural,
which is their relation to writing and more in general to written signs. In Section
4, I will present another crucial element that has been almost neglected in con-
sidering diagrammatic reasoning, which is the analysis of the actions that
are performed on the diagrams. Finally, in Section 5, I will propose a possible
‘big picture’ to offer as an account of diagrammatic reasoning, based on the
assumption that diagrams are kinesthetic tools open to change and manipula-
tion, and I will introduce what I consider as the typically human ability of
diagramming. As I will show, the consideration of the role of action in and on
the space of diagrams will be crucial in order to make sense of their computa-
tional efficacy.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
Behind the diagrams: cognitive issues and open problems 83
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
84 Valeria Giardino
pirical studies show that human cognition can in fact be considered for the most
part as embodied because information obtained from proprioception and kines-
thetics happens to be often reused for other more abstract tasks. The empirical
part of Goldman’s claim is to assume the existence of so-called bodily represen-
tational codes, a subset of mental codes that are primarily or fundamentally
applied in forming interoceptive or directive representations of one’s own bodily
states and activities (see for reference Goldman and Vignemont 2009). The phil-
osophical part of his claim is to consider that the brain reuses or redeploys cog-
nitive processes having different original uses to solve new tasks in new con-
texts; when it comes to bodily representational codes, these appear to be
extremely pervasive. Goldman’s proposal leads to a “moderate” conception of
embodiment-oriented cognitive science precisely because it specifies the role of
the body in cognition: first, by defining what bodily representational codes are
and then by explaining how they happen to have an influence on some
cognitive processes. We will see later in Section 5 that these considerations are
crucial to consider the relevance of performing actions on diagrams and to
explain the cognitive advantages of their use in some particular tasks.
Other approaches in cognitive science may be of interest. Barsalou (1999),
for example, has focused on the question about the influence that perception
seems to have on cognition. In his reconstruction of human cognition, he
observes that symbolic operations can do many things and are in fact essential
for intelligent activity. By symbolic operations, he intends things such as bind-
ing types to tokens or arguments to values, drawing inductive inferences from
category knowledge, predicating properties and relations of individuals, com-
bining symbols to form complex symbolic expressions, or representing abstract
concepts that interpret metacognitive states. If this is the case, then what cogni-
tive mechanisms underlie these operations?
Standard theories in cognitive science have claimed that core knowledge
representations in cognition are amodal data structures that get processed inde-
pendently of the brain’s modal systems for perception (e.g. vision, audition),
action (e.g. movement, proprioception) and introspection (e.g. internal percep-
tion of motivational states, affective states, goals, beliefs, cognitive operations,
meta-cognition and so forth). According to Barsalou, the somewhat paradoxical
result of the Cognitive Revolution of the 50s has been to ban imagery from
psychology as not sufficiently scientific. Interestingly, in his view, this attitude
was mainly due to the major developments that were taking place at the same
time in logic, linguistic, statistics and programming languages. Mental images
were replaced by amodal language-like symbols deriving from predicate calculus
and propositional logic: the idea was that such symbols are transduced from
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
Behind the diagrams: cognitive issues and open problems 85
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
86 Valeria Giardino
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
Behind the diagrams: cognitive issues and open problems 87
abstract than visual images and on the other hand they are more concrete than
propositional representations.
The interesting point in Knauff’s general framework is that he wants to
move against a set of dichotomies that he considers dangerous. As he explains,
one of his motivations is to overcome two common tendencies in cognitive
science: reductionism, claiming that unveiling the functioning of the brain
would be sufficient to understand the mind, and mentalism, endorsing the oppo-
site, that is that brain’s mechanisms would not have any role in explaining the
functioning of the mind. Knauff’s long-term objective is to give a contribution
to the study of human rationality, “barely scratching the surface” of this phe-
nomenon (Knauff 2013: 222): in his view, our rationality depends on the number
of models that we actually consider in reasoning, and this number varies accord-
ing to prior knowledge, previous beliefs, and one’s working memory capacity.
Note that Knauff’s theory has recently been considered in relation to Eucli-
dean diagrammatic reasoning. According to Hamami and Mumma (2013), a
cognitive account of Euclidean diagrammatic reasoning would be compatible
with Knauff’s framework. In fact, as it has been already mentioned, Euclidean
diagrammatic reasoning is not supported by visual images alone since one
needs to postulate the cognitive representations of the spatial information con-
tained in the premises of Euclidean diagrammatic inferences: it is the space of
the diagram that matters, not its visual appearance.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
88 Valeria Giardino
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
Behind the diagrams: cognitive issues and open problems 89
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
90 Valeria Giardino
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
Behind the diagrams: cognitive issues and open problems 91
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
92 Valeria Giardino
composed of squares with arrows drawn on them indicating the gluings i.e.
the homeomorphisms would be able to make new and in a sense unexpected
objects such as the Klein bottle emerge. Such algebraic topology notation has
characteristics that are typical of algebra, because it is a calculation device,
but also of geometry, because it gives the instructions for constructing the
object. For this reason, “we have a powerful piece of notation here that does
some genuine mathematical work for us” (Colyvan 2012: 139, emphasis added).
A good notation facilitates the economization of cognitive resources, calcula-
tion, possible advances in mathematics and it also provides more perspicuous
mathematical explanations. Notation or, to go back to Krämer’s terminology,
operative writing, is thus a very relevant aspect of the mathematical practice
that has been unfortunately neglected by the standard views in philosophy of
mathematics. One reason among others could have been that too much attention
was put on the supposed descriptive function of mathematical signs as opposed
to their creative function, as Rotman (2000) has proposed. This has obscured
their crucial importance in the practice of mathematics. We can think of building
a “semiotics” for mathematics: in Rotman’s view thoughts and scribbles are
mutually constitutive, and “mathematicians at the same time think their
scribbles and scribble their thoughts” (Rotman 2000: 35). This is again related
to writing, of which we have already seen the constitutive nature; however,
the standard approaches to mathematics such as formalism, Platonism and
intuitionism, have failed to acknowledge “the intricate interplay of imagining
and symbolizing, familiar on an everyday basis to mathematicians within their
practice” (Rotman 2000: 48): mathematics is first of all “a certain kind of traffic
with symbols, a written discourse” (Rotman 2000: 51).
If this is the right picture, then diagrams in particular, be them drawn or
simply imagined, are:
the work of the body; they are created and maintained as entities and attain significance
only in relation to human visual-kinetic presence, only in relation to our experience of the
culturally inflected world. As such, they not only introduce the historical contingency
inherent to all cultural activity but also, more to the present point, call attention to the
materiality of all signs and of the corporeality of those who manipulate them” (Rotman
2000: 57).
For Rotman, and in line with what I have claimed so far, the attitude typical of
the standard approaches of avoiding diagrams “like the plague” (Rotman 2000:
56) has ended up in overshadowing the importance for mathematics of things
such as materiality, embodiment, corporality, as well as its historical and social
situatedness.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
Behind the diagrams: cognitive issues and open problems 93
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
94 Valeria Giardino
Recently, recalling Lakoff and Núñez’ work, Sinclair and Gol Tabaghi (2010)
have claimed that, both to improve mathematical education and to define mathe-
matical understanding, it is necessary to provide a detailed analysis of the
use that mathematicians make of many heterogeneous representations: speech,
gestures, diagrams, and so on. To this aim, they have interviewed six mathema-
ticians and asked them to explain the meaning of the mathematical concept
of “eigenvector”; these interviews were filmed, to the aim of evaluating the
mathematicians’ embodied way of reasoning. The videos show that the six
interviewed mathematicians make use of various representations, moving from
one to the other without difficulties, thus blurring the alleged boarders between
the mathematical and the physical world. They all come from different disciplinary
domains – pure and applied – ranging from number theory to discrete applied
mathematics and they all well know the formal “manual” definition of eigen-
vector, which is single, atemporal and static; nonetheless, with no exception,
they offer a description of eigenvectors that alludes to a very different interpre-
tation, including also temporal and kinesthetic elements, as shown both in the
terms and in the gestures they use. Metaphors strike back. Some of the mathe-
maticians focus on the transformations of the vectors: for example, they say and
show in their gestures how the vectors “shrink” or “turn”; others describe the
vectors’ personality: they “go in the same direction”, they “align”, and so on.
In other words, none of the mathematicians speaks of eigenvectors only in terms
of algebraic equalities. Moreover, some metaphors are perceptual (for example,
one mathematician thinks of the quadratic function as a “goblet”) but most
of the metaphors used have a movement component. Furthermore, since the
mathematicians come from different specializations, there is high variability in
their gestures that seems to depend on their respective education and com-
petence, i.e., on their relevant background knowledge. The authors of the study
conclude that gestures, compared to language, would give even more possibilities
than simple speech to express continuity, time and movement, thus confirming
the intuition of the French mathematician Châtelet (1993) who saw in a mathe-
matical diagram the “crystallization” of a gesture.
In this example, we have seen that gestures, which are actions, can be used
to represent abstract objects and to talk about them. Therefore, they can embody
the reasoning about them, thus being close to the diagrams and notations that
were discussed in the previous sections. Nonetheless, gestures are volatile,
while diagrams and notations are not: they do not leave traces, as it happens
when drawing a diagram or writing an equation. Moreover, gestures are not
artifacts, at least not in any intuitive meaning of the term. However, it is true
that gestures, at least the ones of the mathematicians interviewed in the
reported study, are produced from within a written culture. As Goody explains,
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
Behind the diagrams: cognitive issues and open problems 95
memory is surely different in oral and written cultures; if this is true, then it
would be possible to think that also spontaneous gesture can be affected by
writing. More work needs to be done on the relation between gesture and
writing, in particular in specific contexts such as mathematics, where writing
plays a crucial role. For the moment, let us focus our attention on actions, and
consider if they can still play a role when we take into account the way we deal
with mathematical signs. Once more, this would give some plausibility to the
moderate embodied approach: as we will see, human beings seem to be able to
recruit not only space but also action for new tasks that are different from the
original ones for which space and action are used and have a specific cognitive
nature.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
96 Valeria Giardino
fact, they rotate and move them around to the aim of testing different alternative
solutions before letting them fall below. On this basis, the authors propose to
distinguish between pragmatic and epistemic actions: the aim of the actions of
the first kind is to bring the agent closer to his or her physical goal (in this
case, the one of creating a line without gaps); actions of the second kind instead
“use world to improve cognition” (Kirsh and Maglio 1994: 513), i.e. they are
“physical actions that make mental computation easier” (Kirsh and Maglio
1994: 513). An epistemic action is performed outside on the physical objects that
are available, and it is precisely the performance of this action that enhances
our inferential capacities. Note here again that the authors talk about facilitating
reasoning through actions on external tools.
Despite the obvious limits of the Tetris case in relation to more sophisticated
cognitive tasks, the concept of epistemic action can be extended to other more
complex contexts, for example to the use of signs in mathematics. As De Cruz
and De Smedt (2013) have recently proposed, mathematical symbols “enable us
to perform mathematical operations that we would not be able to do in the mind
alone, they are epistemic actions” (De Cruz and De Smedt 2013: 4). Of course,
such a view is not epistemologically innocuous, since it assumes once again
that mathematical signs are intimately linked to the concepts they represent –
or, in Macbeth’s view, of which they formulate the content – and vice versa.
Recently, together with De Toffoli, we have considered the practice of proof
in topology, in particular in knot theory and in low dimensional topology (De
Toffoli and Giardino 2014, 2015). We have proposed an interpretation of knot
diagrams as dynamic tools: in perceiving a diagram, one has to see the possible
moves that can be applied to it. For this reason, experts have developed a
specific form of enhanced manipulative imagination, which allows them to
draw inferences from knot diagrams by imagining and in some cases actually
performing epistemic actions on them. That is why, as Macbeth suggests, knot
theory implies a form of re-writing. Also the actual practice of proving in low-
dimensional topology involves a kind of reasoning that cannot be reduced to
formal statements without loss of intuition. The representations that are com-
monly used in low-dimensional topology are heterogeneous, being neither
entirely propositional nor entirely visual. This form of reasoning is shared by
experts: it is the kind of reasoning that one has to master to become a practi-
tioner. Moreover, the manipulations allowed on the representations as well as
the representations themselves are epistemologically relevant, because they are
integral parts both of the reasoning and of the justification provided. As a case-
study, we took Rolfsen’s demonstration of the equivalence of two presentations
of the Poincaré homology sphere (see, for reference, Rolfsen 1976) and explained
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
Behind the diagrams: cognitive issues and open problems 97
how most of the steps into which the proof can be broken down are transforma-
tions that lead from one picture to another, as clarified by the text giving the
instructions on how to interpret the pictures. Therefore, the representations give
a material form to the transformations (and in this sense they embody them),
thus allowing experts to perform epistemic actions (and in particular to draw
inferences) on them. These actions are controlled by the shared practice: the
set of legitimate transformations is limited and determined by the context.
As Larvor (2012) explained in a recent article by comparing different forms of
mathematical arguments:
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
98 Valeria Giardino
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
Behind the diagrams: cognitive issues and open problems 99
For example, we have recently shown that gestures can improve memory and
facilitate inferences about the features of an environment (Jamalian et al. 2013).
A small text illustrates the environments in two different alternative ways: as
in a topographical map (in a bird’s eye view), which is, in Krämer’s terminology,
the model for structural space, and as in a participant-oriented description of a
series of movements (“enter the room”, “go left”, “you see in front of you”, etc.),
which are instead the prototypical representational modality for movement
space. Recall that according to Krämer, thinking and understanding are in fact
inherently spatial. My suggestion is that even if we do not want to endorse
such a strong claim, we can accept that the power of cognitive tools resides in
their being an occasion for various cognitive systems to work concertedly. The
hypothesis about the existence of diagramming is in line both with the approach
of the moderate embodied cognition, according to which some abilities are re-
cruited for new tasks, and with the space to reason theory. Moreover, diagram-
ming and writing are clearly connected, in a non-trivial way: in fact, diagramming
seems to be the condition for the introduction of writing, but at the same time,
writing, once developed, can shape our forms of diagramming, as I have dis-
cussed above. The framework I propose will also be able to clarify how impor-
tant mathematics is from a cognitive point of view: in mathematics, we have
writing, we have manipulations and operations, we have space plus action,
all this in continuous interaction with the definition and the formulation of
concepts. By doing mathematics, we explore our ability of diagramming and we
invent new spaces, new objects to embed in them and new actions to perform
in and on the available representations
To sum up, I have first pointed out that, instead of assuming a sharp
distinction between visual and sentential thinking, it is more interesting to
differentiate two representations on the basis of their informational equivalence
and their computational differences. I have claimed that diagrams are in most
cases computationally advantageous in comparison to language, because our
cognition is (moderately) embodied and space can have a cognitive representa-
tional role. Moreover, diagrams are also ‘written’ objects, which are traced on a
piece of paper or shown on a computer screen. This brought me to the investiga-
tion of writing and its cognitive and cultural relevance. Finally, I have con-
sidered the role of action in diagrammatic reasoning, both in gestures and in
epistemic actions on mathematical signs, and proposed that diagrams’ function-
ing depends on diagramming, which is the typically human ability of facilitating
the interaction of different cognitive systems by means of some particular cogni-
tive tool, so as to enhance inference and reasoning.
As I said in my Introduction, the aim of this article was not to provide a full-
fledged theory on diagrammatic reasoning, but to sketch out the elements that
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
100 Valeria Giardino
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
Behind the diagrams: cognitive issues and open problems 101
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
Frederik Stjernfelt and Svend Østergaard
4 Diagrammatic problem solving
1 Problem solving
In this paper, we shall look at types of problems and their interplay with dia-
grammatic representation. The general notion of diagram goes back to Peirce
(e.g. “PAP”, in Peirce 1976b, 316ff), see (Stjernfelt 2007, chapter 4) for an in depth
introduction. Here, we will focus upon prototypical diagrams, a two dimensional,
stylized topological (geometric) representation of some subject matter. In some
cases this representation is analogical, as for instance, in a map of a country,
but in other cases it is a geometric representation of quantitative relations
as, for instance, in a pie chart diagram. The difference is whether we map an
already existing geometric relation onto the diagram, or whether it is quantita-
tive or other relations that are geometrized. If we add time as a dimension to
the traditional spatial dimensions, we can also interpret temporal relations as
geometric. Hence, for instance, a flowchart diagram of information currents in
a company will qualify as a diagram of geometric relations. However, there are
other relations than spatial/temporal and quantitative; for instance, interpersonal
relations: a crime investigator might set up a diagram of possible interpersonal
relations between the different suspects and associates in order to get an over-
view of the problem. In short, a prototypical diagram is a two-dimensional geo-
metric representation of something we may qualify as “relations” which might
then be spatial/temporal relations, quantitative relations, interpersonal or other
relations.
When we have a diagram it is possible to explore it. This exploration can
take two forms. 1) the exploration does not add anything to the diagram but con-
sists in an examination of possible true statements that may be deduced directly
from the diagram. This is the case of most information in both the pie chart
diagram and the map of the country. From the map we can read the distance
between A and B (given the map scale), find the shortest route between A and
B etc. without adding anything further to the map.1 2) In the second type of
1 Even in this case, however, it could be argued that something is added – namely the points of
departure and destination and the line routes explored between them. In this sense, the distinc-
tion we are making is rather one of a continuum between less and more manipulation/addition.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
104 Frederik Stjernfelt and Svend Østergaard
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
Diagrammatic problem solving 105
The classical example mentioned by Peirce is Euclid’s proof that the sum of
the angles in a triangle is 180, a version of which is given above (Figure 25).
Given the triangle ABC, you extend the line BC to D, and from C you draw a
new line CE parallel with AB. The angle between this new line CE and AC is
identical to the angle A and the angle between the new line and the extended
line CD is identical to the angle B. So, the three angles meeting in C are the
same as the three angles of the triangle, and as BD is a straight line, the three
add up to 180°. Hence, the sum of the angles is 180°. Although two new ele-
ments are added, they do not come from outside the problem space, on the
contrary they, as well as the triangle, are part of the two dimensional space
and they are drawn according to the general axioms of Euclidean geometry –
which are analogous to the chess player who imagines moves in accordance
with the general rules of chess. In both cases, however, what is additionally
required for the problem solver is some strategic information. It is not sufficient
to know Euclid’s axioms and chess rules – in both cases, some sort of strategy
must be pursued: the relevant auxiliary lines to add must be chosen over an
infinity of others, and strategically clever moves sequences must be chosen
among the finite set of possible moves.
A subclass of problems in the general category of information extraction
from the diagram space is what we could call a perspective shift. This might
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
106 Frederik Stjernfelt and Svend Østergaard
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
Diagrammatic problem solving 107
To the left we have the different positions that the problem solver will move
between without getting the solution, but she might accidentally cross the
boundary and hit the target element on the right, in which case the solution
becomes trivial.
The insight problems are opposed, of course, to problems that do not require
insight. The border between the two classes is slippery, though; take for instance
a chess problem. One can make a program that can solve any problem instantly,
not by insight but by systematically going through all possibilities until it hits
upon the solution. For humans, however, the cognitive process that leads to the
solution will proceed along strategic schemata and have the characteristics of
insight which possibly is measurable in the problem-solving individual as an
increase of entropy in behaviour just before discovering the target move. For
instance, in Metcalfe and Wiebe (1987), people’s feeling of warmth are recorded
while they solve insight and non-insight problems. There was a progressive
increase in warmth during non-insight problems. But with insight problems
the warmth ratings remained at the same low level before suddenly increas-
ing dramatically shortly before the solution was reached. Similarly, Stephen,
Boncoddo, Magnuson and Dixon (2009) show that when people solve a gear
problem (see below), there is an increase in entropy just before they see the
optimal strategy of counting the gears. In this case, entropy is measured through
eye tracking, i.e., the gaze pattern is stable for a long time, but just before reach-
ing the solution it becomes instable. So it seems that, in terms of behavioural
dynamics, the insight problems are determined by a catastrophic point, namely
reaching the target element, whereas solving non insight problems is determined
by a continuous process reflecting that these problems are routine tasks and tasks
where information is used incrementally, as, e.g., in the Tower of Hanoi problem
(Ohlsson 1992).
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
108 Frederik Stjernfelt and Svend Østergaard
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
Diagrammatic problem solving 109
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
110 Frederik Stjernfelt and Svend Østergaard
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
Diagrammatic problem solving 111
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
112 Frederik Stjernfelt and Svend Østergaard
counting may also proceed as “odd-even, odd-even, . . .” or “A-B, A-B, . . .”). The
purpose of the experiment is to see if the participants get to this insight, and
if so, how. If they do, to what extent can this be predicted from the gestures
(diagrams) they produce from the very beginning?
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
Diagrammatic problem solving 113
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
114 Frederik Stjernfelt and Svend Østergaard
1. Drawing a full circle for each wheel, typically with just one finger on the
screen
2. Drawing the half arc of a circle before continuing to the next wheel, typically
with one finger.
3. Wriggling the open hand left or right over each wheel
4. Drawing one continuous curve following the outlines of the gears
5. Pointing sequentially to wheel after wheel, typically with one finger
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
Diagrammatic problem solving 115
These different gestural and verbal behaviours are found to combine in the
following sets of stable patterns:
gesture language
f Chaotic use of gestures f Causal
f Circle/half circle f Causal
f Circle/half circle f Direction
f Wriggle f Causal
f Wriggle f Direction
f Continuous curve f Silence
f Pointing f Direction
f Pointing f Counting
Figure 30: Flowchart diagram of different phase transitions between stable solution strategies.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
116 Frederik Stjernfelt and Svend Østergaard
There is not any very simple pattern of phase transition between two states
to be found, as suggested by Stephen, Boncoddo, Magnuson & Dixon (2009).
Rather, a more complicated landscape of phase transition appears, with two
attractors: the continuous curve-following and the discrete marking of the two
directions, terminating in counting. Gestures accompanied by causal reasoning
are generally instable as a strategy and will develop into either of the two stable
states, the Continuous-silence and the Pointing-counting positions. Obviously,
not all pairs of respondents reach one of these end points, but it seems as
if those settling upon the less than optimal Continuous-silence strategy will
typically remain there. All the different strategies characterized by a verbal
indication of direction, however, may develop further to Pointing-direction or,
ultimately, to the Pointing-counting strategy.
There are some indications that early behaviour during the trials may indi-
cate later strategy choices. Early alternation between left-right movements might
be indicative of later discretization in terms of Wriggle-direction or Pointing-
counting, while, on the other hand, reliance upon causal arguments following
wheel contours may be indicative of settling for the Continuity-silence strategy.
It is interesting to compare the strategies by reaction time at the last trial:
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
Diagrammatic problem solving 117
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
118 Frederik Stjernfelt and Svend Østergaard
This may give rise to the following hypothesis. Not all reasoning is based on
immediate embodied experience – but rather on abstraction from immediate,
concrete and local embodied experience. It should be added, however, that this
abstraction is by no means completely disembodied but accompanied with
specific embodied strategies supporting abstraction – in this case the wriggling
and pointing gestures.
As to the collaboration issue, the provisional results of the Cog Wheel
experiment seem to indicate that mixed-media strategies involving both language
and gesture are the more efficient. Moreover, comparisons with individuals solv-
ing the same task seem to show that pairs are more efficient than individuals.3
Figure 32: Reaction time on last trial showing the advantage of collaboration.
Why do pairs perform better? A hypothesis may be that several reasons com-
bine here. One explanation is that the task affords abstraction – and as pairs
speak, language may form a route to finding the more abstract solution strategies.
Another explanation is that individuals may tend to get stuck with their first
3 These provisional data were collected at a public performance at the “Science in the City”
festival at the ESOF conference, Copenhagen, June 2014.
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
Diagrammatic problem solving 119
solution, while pairs may bring different perspectives and strategies motivating
the intuition that there might be more than one solution strategy to be tried
out. A further explanation may be feedback between the two parties, in the
shape of one participant watching the other’s gesture and drawing further
conclusions; more generally in the shape of collaboration or competition or
both (the two should not be seen as mutually exclusive, rather as two feedback
aspects which may even enhance each other).
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use
EBSCOhost - printed on 4/6/2023 12:15 PM via UNIVERSITT FR WEITERBILDUNG KREMS. All use subject to https://www.ebsco.com/terms-of-use