Professional Documents
Culture Documents
Matthew O. Peterson
To cite this article: Matthew O. Peterson (2018): Aspects of visual metaphor: an operational
typology of visual rhetoric for research in advertising, International Journal of Advertising, DOI:
10.1080/02650487.2018.1447760
Article views: 7
Introduction
In 2004, advertising researchers Barbara Phillips and Edward McQuarrie published a signif-
icant theoretical paper on aspects of visual rhetoric in advertising, going ‘beyond visual
metaphor’ but still largely inclined towards it. They outlined a typology that classifies pic-
torially rhetorical ads in terms of the effort required of viewers to understand the message.
Their contribution appeared to fulfil a body of literature that began with attempts to clas-
sify pictorial persuasion according to distinctions derived from verbal persuasion, but
increasingly addressed the image on its own terms, sensitive to the fundamental condi-
tions of that representational code. I will herein address the visual metaphor components
of Phillips and McQuarrie’s typology and propose two kinds of additions. These additions
are meant to go beyond the elegant but perhaps artificial clarity of the typology to better
address the diversity of visual metaphor expression as it relates to viewer processing
demands. On the one hand, many instances of ads that are ambiguous regarding Phillips
and McQuarrie’s variable of visual structure are rendered typical through expansion, pri-
marily with the definition of intermediate types. On the other hand, new variables are
proposed that do not fit neatly into the typology but rather suggest a profile for ads that
helps to explain the interpretational challenges they pose.
Ultimately, this contribution is intended to further operationalize Phillips and McQuar-
rie’s work, especially for use in experiments on visual metaphor in advertising. Despite the
clarity of their typology, subsequent researchers have developed their own alternatives,
rather than more directly building on it or simply utilizing it. Ideally, the present typologi-
cal expansion, in contrast to those parallel efforts, represents progress.
Of special note regarding progress, this expansion seeks to more finely parse viewer
processing operations in relation to the types. This is done by distinguishing stages of
visual metaphor construction and predicting which stages are differentially problematized
by each metaphor type. The processing distinctions indirectly follow the work of Sorm and
Steen (2013).
distinguish visual metaphor. Conceptual metaphor theory (Lakoff and Johnson 2003) pos-
its that metaphor is no mere manipulation of language, but represents a primary means
by which we humans make sense of the world: our conceptual knowledge is metaphori-
cally constituted. Furthermore, complex concepts can be traced back, through successive
metaphorical mappings, to the bodily experiences we began to organize in infancy
(Johnson 1987). Thus what was understood as metaphor in traditional rhetoric represents
the verbal expression of an underlying capacity, not an invention all its own.
Dual coding theory (Sadoski and Paivio 2001) describes information processing and
memory in terms of dedicated cognitive architecture. Working memory, the seat of con-
sciousness, includes two separate but complementary resources for processing informa-
tion. The phonological loop is evolved to process language, be it encountered through
sound (speech), sight (text), or touch (braille), and works exclusively with the serial or lin-
ear structure inherent to language. The visuospatial sketchpad is evolved to process imag-
ery, and works in a parallel structure where schematic relationships are retained. The
former is the verbal system and the latter the visual system. Information is typically stored
in long-term memory in relation to the code in which it was encountered, and there it
retains its fundamental characteristics. Thus we have distinct resources for two primary
codes; they are not equivalent.
In relation to conceptual metaphor theory and dual coding theory, metaphor pre-exists its
formal expressions. And those verbal and visual metaphor expressions, being of cognitively
distinct codes, are themselves fundamentally distinct. If we accept these well-supported the-
ories, then we should not expect visual metaphor to derive directly from its previously stud-
ied cousin, verbal metaphor. Earlier attempts to describe visual metaphor depended upon
distinctions from traditional verbal rhetoric, and sought equivalents to relationships such as
simile to metaphor, a distinction that is entirely linguistic (e.g. Durand 1987; Forceville 1996;
McQuarrie and Mick 1996). If we begin our study of pictures with assumptions from verbal
studies, we are likely to overlook that which does not neatly fit into that frame. The following
section provides an overview of the study of metaphor in advertising, which ultimately sepa-
rated visual rhetoric from its cousin. However, first, it is useful to reflect on what benefits are
derived from the use of metaphor, especially in advertising.
Phillips (2003) lists the basic outcomes of metaphor use in advertising as attention,
elaboration, pleasure, and liking. Attention appears to be especially important in advertis-
ing as viewers typically encounter ads when they are engaged with something else.
Advertisers thus face proposition rejection, and metaphor can mitigate this (Scott 1994).
Elaboration is a process wherein the viewer draws an inference, generates assumptions,
and integrates them with prior knowledge. If the search for a simple inference fails, alter-
natives are considered, and working memory is increasingly taxed (Phillips 2003). A rather
insidious potential outcome of metaphor as tied to elaboration is that viewers may have
fewer cognitive resources to ‘counterargue’ an ad’s claims because the search for infer-
ences requires significant effort (303, citing Kardes [1993]).
Pleasure and liking are a function of the inherent puzzle-solving process that metaphor
requires (McQuarrie and Mick 2003). Pleasure is also a function of aptness, which refers to
the inherent relation between the source and target domains (Phillips 2003). If source and
target are too similar, then the metaphor comes across as trite. Conversely, if the domains
are excessively dissimilar, the metaphor may prove too difficult to resolve, resulting in dis-
pleasure (Phillips 2003).
4 M. O. PETERSON
Metaphors can also strengthen an ad’s memory trace, especially when the message is
visually constructed (McQuarrie and Mick 2003). Of special interest to advertisers is that a
metaphorical comparison can ultimately become reasonable in the viewer’s mind, despite
it not literally being true (Phillips 2003, 298).
Kenney and Scott (2003) provide an extensive review of the visual rhetoric literature rel-
evant to advertisers. I will now give an especially abbreviated version primarily concerned
with the ultimate arrival of Phillips and McQuarrie’s (2004) paper on the structures and
operations of visual metaphor.
Visual structure
Phillips and McQuarrie (2004) describe visual rhetoric, inclusive of visual metaphor,
according to two dimensions: visual structure and meaning operation. It is the former that
is more clearly described, emergent from earlier literature, and the continued focus of sub-
sequent literature. I will describe this dimension, specifically in terms of visual metaphor,
and compare it to the corresponding dimension in other typologies before addressing
meaning operation and other differentiations across typologies.
Phillips and McQuarrie describe visual structure as the physical constitution of the two
domains (source and target) in a visual metaphorical ad. They identify juxtaposition, fusion,
and replacement structures, and predict that in this order the types represent increasing
complexity in terms of what the viewer must do to identify the two domains. This is an
important distinction as it lends theoretical weight to the classification.
Juxtaposition structures in visual metaphor depict both source and target, and each is
a complete entity. In the classic Saab ad reproduced in Figure 1, a car (target) is related
to a jet (source) – they are positioned beside one another on a runway – and takes on
some of its attributes. Fusion structures also picture both domains, but they are artificially
combined to create a hybrid entity. In Figure 2, a bottle of ketchup (target) is made up of
slices of tomatoes (source) in what is, in the photorealistic original, a convincing combi-
nation. Phillips and McQuarrie explain that fusion is more complex than juxtaposition
because the viewer must do the work of separating out source and target from a com-
bined entity.
Table 1. Typologies related to visual metaphor structure.
Phillips and Van Mulken, Le Pair, and Gkiouzepas and
Aspect Kaplan (1990) Kaplan (1992) Forceville (1994) Forceville (1996) McQuarrie (2004) McQuarrie (2008a) Forceville (2010) Hogg (2011)
Structural Metaphor structure Metaphor form Visual structure Figurative visual structure Spatial layout (increasing Objects’ mode of
dimension title (increasing (increasing complexity) deviation from expectations representation
(assumption) complexity) and increasing complexity) (increasing
elaboration)
Juxtaposition Association Juxtaposition (verbal M2, Metaphor with two Pictorial simile Juxtaposition (two (1) Juxtaposition (A beside B) Simile (both source and target Juxtaposition (whole
equivalent equivalent: simile) pictorially present terms; side-by-side domains are visually entities; equated with
Metaphor in praesentia images) presented separately) rhetorical scheme)
Fusion equivalent Transformation Identity (verbal (included in M2) M2, Metaphor with Fusion (two (4) Fusion (A fused to B to Hybrid metaphor (combines Synthesis (part of the
equivalent: two pictorially combined images) form AB) target and source domain into entities; equated with
metaphor) present terms; a single gestalt) rhetorical trope)
Metaphor in
praesentia
Replacement (none) (none) M1, Metaphor with one M1, Metaphor with Replacement (image (5) Replacement (A in place of Contextual metaphor (the source (second dimension is
equivalent pictorially present term; one pictorially present points to B) or the target domain is also largely structural
Metaphor in absentia present term; an absent image) visually absent, in such a way and includes
Metaphor in that the absent domain is Replacement)
absentia evoked by the visual context)
Additional Second-order reference (2) Inclusion (A inside B)
structural (metaphor within a (3) Combination (A
types metaphor) combined with B to form C)
(6) Removal (A and not-B)
Second (multiple) (multiple) (multiple) (multiple) Meaning operation (multiple) Visual scenario
dimension (increasing (increasing artificiality
(assumption) richness) or deviance)
Second Metaphor type: Lever; Web; Type of tension Aspects or possibilities Aspects or Connection (A is Layout: Documentary Realistic symbiosis (no
dimension Machine in the garden; exemplified: mentioned: possibilities associated with B) (picture confined to intruder)
types Synthesis of old and new Linguistic; Pragmatic; Text anchoring (when text mentioned: Similarity window); Pictorial (picture Replacement
values; Revolutionary; Hermeneutic message reinforces rather Verbo-pictorial comparison (A is takes over ad) (intruder)
Other Metaphor content: than directs, pictorial metaphor like B) Look-through (picture is a Artificial symbiosis
Role of the text: Needed to Orientational; metaphor comes close to Verbal anchoring Opposition copy of objects); Look-at (not applicable)
understand; Reinforces but Ontological standing on its own) Pictorial context comparison (A is (picture presents itself
not necessary; No Metaphor creates similarity not like B) rather than representing
relationship (does not reveal a pre- objects)
existing similarity) Genre: Tableau; Frozen
Linguistic message narrative; Rhetorical figure
Decoded message
Connoted message
Source (term) Vehicle Source (states B-term (figurative); SS, Figurative secondary (Image element) (Pictorial element) Source (Metaphorical object)
preference over Secondary subject subject
vehicle)
Target (term) Tenor Target (states A-term (literal); PS, Primary Literal primary (Image element) (Pictorial element) Target (Metaphorical object)
INTERNATIONAL JOURNAL OF ADVERTISING
Figure 1. Example of a juxtaposition structure. Illustration by Scott Durand after the original Saab ad.
Replacement structures entirely omit one of the domains, either source or target, and
suggest it through context. In Figure 3, which is an example provided by Phillips and
McQuarrie, the source of wine is never shown, but the positioning of Welch’s grape juice
bottles on a wine rack in a cellar suggests it. Replacement is predicted to be the most
complex of structures because the viewer must entirely imagine one domain.
Phillips and McQuarrie’s terms are entirely visual-code-dependent. Broad typologies
such as McQuarrie and Mick (1996), Forceville (1996), and Durand (1987) use terminology
that is adopted from verbal rhetoric.
Kaplan (1990, 1992) ultimately identifies types equivalent to two of Phillips and
McQuarrie’s structures, association (1990) or juxtaposition (1992) for juxtaposition and
transformation (1990) or identity (1992) for fusion, but nothing like replacement. Forceville
(1994) initially collapses juxtaposition and fusion into M2, ‘metaphor with two pictorially-
present terms,’ or metaphor in praesentia. He later separates juxtaposition out as pictorial
simile (1996). Replacement is identified from the start, as M1, ‘metaphor with one pictori-
ally-present term,’ or metaphor in absentia.
Though Phillips and McQuarrie appear to have elegantly processed this earlier work,
settled on concise terminology, and added an actionable prediction regarding complexity,
subsequent work continues the negotiation of visual structure typology. McQuarrie
(2008b) even provides a ‘toolkit’ for classification exercises. Forceville’s work continues in
the collaboration (Van Mulken, Le Pair, and Forceville 2010), where yet newer terms are
adopted for the same concepts: simile (actually from Forceville 1996), hybrid metaphor,
and contextual metaphor. ‘Contextual metaphor’ highlights the significance of context in
replacement structures, but context can also play a role in juxtaposition and fusion ads
INTERNATIONAL JOURNAL OF ADVERTISING 7
Figure 2. Example of a fusion structure. Illustration by Matthew Peterson after the original Heinz ad.
(the runway environment of Figure 1, for instance, helps to identify source and target from
the two domains in a juxtaposition structure).
Gkiouzepas and Hogg (2011) give an example of the expansion of visual structure
along with reconsideration of established types. They provide two dimensions, with juxta-
position (equated with rhetorical schemes) and synthesis (equated with rhetorical tropes
and equivalent to fusion) as modes of representation, and realistic symbiosis, replacement,
and artificial symbiosis as visual scenarios. Gkiouzepas and Hogg raise some interesting
issues. In particular, they appear to separate seemingly confounded aspects of Phillips
and McQuarrie’s (2004) juxtaposition and replacement: it is possible for one domain to
conceptually replace another in a picture, while other examples of the replaced domain
are also present. They give the example of a car positioned amidst deck chairs surround-
ing a swimming pool (Figure 4). However, when Gkiouzepas and Hogg provide their own
examples of the six types described by their matrix structure (two modes of
8 M. O. PETERSON
Figure 3. Example of a replacement structure. Illustration by Eric Pryor after the original Welch’s ad.
representation by three visual scenarios), the differences between some types appear
inconsequential. Unfortunately, their types do not appear to be as clearly defined as Phil-
lips and McQuarrie’s (2004).
McQuarrie (2008a) provides an expansion of the visual structure dimension from Phil-
lips and McQuarrie (2004). Phillips and McQuarrie had only allowed that visual structure
could be further subdivided within their types (for instance, they mention vertical or hori-
zontal arrangement of juxtaposition, which is a more trivial distinction).
INTERNATIONAL JOURNAL OF ADVERTISING 9
Figure 4. Example of a juxtaposition-replacement structure, from Gkiouzepas and Hogg (2011). This
structure is called replacing juxtaposition in the present expanded typology. Illustration by Matthew
Peterson after the original Chrysler ad.
McQuarrie (2008a) expands the list to six types in the following order: juxtaposition,
inclusion, combination, fusion, replacement, and removal. However, he does not provide
any visual examples and only describes one instance of each, with a single sentence per
type. This makes it difficult to evaluate the expanded typology. He describes the three
new types roughly as follows: inclusion finds one domain positioned within another; com-
bination is a lesser form of fusion where the domains are less distorted by each other; and
removal withholds an ‘expected complement’ to one domain (106–107). McQuarrie’s
expanded typology is difficult to evaluate or utilize with the level of detail provided.
10 M. O. PETERSON
For instance, if the distinction between combination and fusion is one of degree, a func-
tional rubric would be needed to distinguish the two. However, with this as in most of the
typologies listed thus far, more than visual structure is addressed.
I here propose an expansion of the typology for the purpose of better explaining the
variation of visual metaphor in advertising and thus supporting its use in research.
McQuarrie (2008a) expanded the typology in a similar vein but his distinctions are not
clear and do not appear to remove the same problems as the following proposal. Phillips
and McQuarrie (2004) originally allowed that only subcategories might exist, and further-
more, they asserted that their list was functionally ‘exhaustive’ and that any additional
types were not likely to produce meaningful differences in advertising (118).
The problems that inspired the following typological expansion were first encountered
by the author and collaborators, whilst preparing stimuli for a series of experiments using
Phillips and McQuarrie’s visual structure (two of the three studies to date have been sum-
marized: Peterson et al. 2017; Wise et al. 2017). As an example, replacement as defined by
Phillips and McQuarrie omits either source or target and suggests it through context. It
would appear that replacement thus has paired distinguishing aspects: one domain con-
ceptually replaces the other, and this is how the absent one can be identified; and one
domain, either source or target, is absent. But these aspects can be decoupled, as in the
earlier example from Figure 4 of the car amidst deck chairs by a pool. In this Chrysler ad,
the car conceptually replaces a deck chair, yet as other deck chairs are pictured, the viewer
needs not call to mind the image of a deck chair (it is visualized in multiple). This appears
to present the viewer with a slightly different task from either juxtaposition or replace-
ment. It is not clear from Phillips and McQuarrie’s definitions if the concept of replacement
is crucial, or if the absence of one domain is the lone defining aspect of such structures.
The expanded typology proposed here supports leaving this issue open to investigation
by isolating types according to such variation. As such, the typology is provisional: it
retains separations where they may reasonably prove significant. Future work could very
well suggest collapsing types. The following typology appears to more clearly identify a
wider range of visual metaphor structures. Most are represented in Figure 5, based on the
Welch’s ad from Figure 3, where Welch’s grape juice is target and wine is source. Fusion is
represented in Figure 6, along with further examples of replacing fusion, using other
source–target pairings.
(1) Identification: only one domain is represented pictorially; the other takes textual
form and the relationship is often established by typographic conventions (i.e.
arrangement). Identification utilizes the basic typographic strategy of labelling, as
seen in Figure 5(a,b). Similar to Forceville’s (1994) text anchoring and (1996) verbo-
pictorial metaphor.
(2) Pairwise juxtaposition: source and target entities are presented singly and fully.
They are related largely by association, by virtue of their likely unexpected pairing
in relative isolation (Figure 5(c,f)). Contextual factors (such as an environment or
awareness of the advertiser) help the viewer determine the directionality of map-
ping. Equal to a strict interpretation of Phillips and McQuarrie’s (2004) juxtaposition.
(3) Categorical juxtaposition: a source entity is presented in relation to a target set,
which calls to mind a category to which all pictured members of the set belong
(Figure 5(g)). The target set members are each unique. Since the target set members
are pictured and not the target itself, the target takes the form of a concept and not
an entity, which cannot be accomplished directly with any other type that utilizes
imagery alone. The source is thus related to the target category more so than it is to
12 M. O. PETERSON
Figure 5. Expanded typology exemplars, from identification to replacing fusion, adapted from Welch’s
example from Figure 3. Identification (a, b); pairwise juxtaposition (c–f); categorical juxtaposition (g);
replacing juxtaposition (h); replacement (i) with requisite context (setting); and replacing fusion (j).
Illustrations by Lucas Albrecht.
INTERNATIONAL JOURNAL OF ADVERTISING 13
Figure 6. Expanded typology exemplars, differentiating fusion (d, e) from replacing fusion (a–c) visual
structures. Illustrations by Scott Durand (b) and Matthew Peterson after the original ads.
(6) Replacing fusion: a part of one entity is replaced by either another whole entity or a
part thereof (Figure 5(j), where a juice bottle’s lid becomes a wine bottle’s cork). The
resulting hybrid entity is an artificial creation. Often the part replaced has a special
function. When one entity is more complete and partially modified (through the
concept of replacement) – a common relationship – this builds directionality into
the visual structure, making the designation of source and target more readily
apparent. This may be akin to McQuarrie’s (2008a) distinction of combination from
fusion.
(7) Fusion: two entities are fused together to create a new hybrid entity (Figure 6(d,e)).
The viewer must separate out the entwined entities to identify them individually.
Context is likely required to determine the directionality of mapping. Equal to Phil-
lips and McQuarrie’s (2004) fusion with the distinction that the hybridization is
wholesale instead of focused on one functional part.
The subtleties exhibited in Figures 5 and 6 beg further elaboration. The examples of
Figure 5(c–f) are proposed as pairwise juxtaposition variations. Figure 5(c) (like 5d)
relates two bottles, a Welch’s grape juice bottle and a wine bottle. This is a most direct
or ‘apt’ comparison, yet still figurative. Figure 5(e,f) utilizes a glass of wine and a cork-
screw instead of a wine bottle. Since, in all cases the source is ultimately wine, the glass
and corkscrew may appear related to replacement, with the juice bottle conceptually
replacing the wine bottle that would more sensibly be positioned beside glass or cork-
screw. However, this line of reasoning is more convoluted than simply considering a
glass of wine or a corkscrew as conventionally related to wine (with some immediacy),
and thus executing a juxtaposition comparison of grape juice (via juice bottle) to wine
(via wine glass or corkscrew). Indeed, the juice bottle itself is essentially standing in for
the grape juice it ostensibly contains, as a more direct puddle of juice is not pictured. It
is assumed here that the more direct line of reasoning is more likely to occur, and thus
Figure 5(e,f) is a better example of pairwise juxtaposition than some form of replace-
ment. Nevertheless, Figure 5(f) would appear a relatively more complicated form of
pairwise juxtaposition than 5(c) – not all examples of one visual structure are created
equal.
Figure 5(c,g) compares pairwise juxtaposition to categorical juxtaposition, respectively,
despite apparent similarity. Both source and target are presented as complete, and thus
each is a type of juxtaposition. It is theorized here that in the pairwise juxtaposition exam-
ple, the viewer must relate two entities together despite a weak visual relationship
between them. The visual relationship in the categorical juxtaposition example appears to
be more explicit: the juice bottle is within a set of wine bottles. Thus it is like a type of
wine bottle. A more extreme and more forcefully rhetorical example better demonstrates
this distinction, though it is not pictured here. The concept of technology could be sug-
gested by the following set of illustrations: campfire, stone wheel, aqueduct, printing
press, light bulb, biplane, rocket, and personal computer. If a product were inserted into
this arrangement – e.g. a new bra design, a smartwatch – the implication would be that
this unexpected ‘intruder’ represents a major technological advancement. In this case, the
bra or smartwatch is not likely compared with each other entity individually (e.g. bra–
wheel, bra–aqueduct), but rather it is related to the concept they represent as a set: tech-
nology. Thus, technology is source, and bra or smartwatch is target.
INTERNATIONAL JOURNAL OF ADVERTISING 15
which requires a directionality for mapping between entities. If the two halves are part
of the same man with meaningful symmetry, this might be considered something like a
visual portmanteau; otherwise, it might be considered to house multiple reflective
metaphors, with athlete mapping to father and father mapping to athlete. But the ad is
for a fitness club and it is aimed at parents; thus the context determines source (athlete)
and target (father): Anytime Fitness makes the father into an athlete. Phillips and
McQuarrie (2004) use a similar example to illustrate fusion. Here it is clarified as replac-
ing fusion, with part of the source replacing part of the target. It should be clear that
the expanded typology does not settle all debates outright. It does, however, give
grounds to settle them.
In most cases of visual structure, the domains are discrete picturable entities, for the
simple reason that in visual metaphor, as I have defined it here, domains are only repre-
sented through pictures. Identification is an exception, but it is not strictly visual meta-
phor as defined. (It is useful here to bracket off such cases with a described type.) But
categorical juxtaposition does permit the advertiser to work with a concept without
using text. There are also conventional pictures that are used to represent concepts, such
as the heart shape that signifies love. It would thus seem as though any visual structure
could be used to position a concept as source or target. However, in cases such as these,
where the picture is so conventionalized, it is functioning more symbolically in the man-
ner of language.
It is no mistake that the order of types differs from Phillips and McQuarrie’s original. The
ordering here is purely logical: replacement is listed between juxtaposition and fusion
because it shares border types with both. Conversely, there does not appear to be a theo-
retical border type between juxtaposition and fusion (and I have found no ads to suggest
otherwise). This reordering need not suggest a stance at odds with Phillips and McQuar-
rie’s hypothesis that replacement is more complex than fusion, which in turn is more com-
plex than juxtaposition. Peterson et al. (2017) did find evidence, using an experimental
design, that fusion is more complex (requires more cognitive resources) than juxtaposi-
tion, though this result begs reproduction and could involve confounding factors. What-
ever the general complexity inherent in visual structures, the expanded typology offers
insight into the source of variations in complexity.
Figure 7. Example of multiple mappings. Peahen maps to woman and peacock maps to man in parallel
mappings of identification structure. These metaphors in turn are then related to one another through
an opposition comparison operation with juxtaposition structure. This second-order relationship (pea-
hen-woman to peahen-man) is a nested mapping. Thus the ad has both parallel and nested mappings.
It also features a third-order relationship between the nested metaphors and the magazine. Illustration
by Bryan Lorenz after the original Garage Magazine ad.
(9) Multiple mappings: nested, parallel, or redundant. Upon inspection, some ads fea-
ture multiple source–target pairs (see Figure 7 and its caption for a convoluted
example). The individual metaphors in an ad will together suggest an ultimate
message, but each technically must have its own implicature. Nested mappings
build one implicature into another. For instance, in an example from advertising
INTERNATIONAL JOURNAL OF ADVERTISING 19
not reproduced here: a cheetah’s spots over the body of a whale imbue the whale
with speed, the primary association of cheetahs; the cheetah–whale in turn sug-
gests that a media storage device, the object of the ad, is both fast and large. Paral-
lel mappings contribute to the same message in comparison with one another.
Some otherwise parallel mappings are entirely redundant.
(10) Informational loci (in terms of quantity): when using authentic ads for empirical
study, researchers should consider how many points of attention – or ‘areas of
interest’ – are featured in the ad, including logos and supporting text. Total word
count is of interest as well.
(11) Illustration style, especially in terms of complexity: illustration style variation
includes grainy photographs, line drawings in a political cartoon style, beautifully
rendered watercolour paintings, and countless other possibilities (variations in
detail between black line art examples are demonstrated by comparing Figures 2
and 5). The rendering of pictures should impact engagement duration and
enjoyment.
Figure 8 illustrates multiple variables in the profile. This is a replacement image: a target
dog is replaced by a source elephant (albeit a small one); the elephant–dog is thus heavy.
The object of the ad, a Nissan truck, is external to that metaphor, making it indirect.
The children in Figure 8(a) are distracters (Figure 8(b) is true to the original). The versions
presented vary in environmental apportion, with Figure 8(a) isolated and Figure 8(b)
contextualized.
Figure 8. A visual metaphorical ad (a) with and (b) without distracter entities. Illustration by Sander
Weeks after the original Nissan ad.
A number of these stages can be repeated and thus the order can change.
Sorm and Steen’s processing model can be understood in part by considering the pri-
mary sources of Leder et al. (2004) and Steen (1994). Leder et al.’s (2004) model of aes-
thetic experience, which considered art and not advertising, includes five core stages
with additional inputs and outputs. For instance, inputs include previous experience
and pre-classification. The core stages are perceptual analyses, implicit memory integra-
tion, explicit classification, cognitive mastering, and evaluation. Leder et al.’s model is
itself of limited efficacy here because most of its concerns fall outside of the unique proc-
essing demands of visual metaphors. Much of the processing of a visual metaphor in
structural terms (i.e. recognizing an incongruity, relating target to source, transferring
attributes, and forming an implicature and a message) would fall under a single subpro-
cess of the model: content under explicit classification. This is because visual metaphor is
primarily a matter of meaning. As argued above concerning the profile of visual meta-
phor, illustration style, which could be analyzed according to Leder et al.’s model, should
impact interactions with ads that feature visual metaphor. However, illustration style or
aesthetic quality is effectively an independent variable, external to the structure of visual
metaphor.
Steen’s (1994) model, incorporated into Sorm and Steen’s (2013) model, is of verbal
metaphor processing. It is an inclusive model that includes one subprocess of reflection
(i.e. metaphor identification, in terms of a reader’s expression of awareness), one that is
elective or by my estimation may only rarely occur (i.e. metaphor context construction,
where the reader explicitly considers the author’s intentions), one specific to verbal struc-
tures (i.e. metaphor refunctionalization, where the reader returns to a previous sentence),
and another of aesthetic judgements (i.e. metaphor appreciation). The remaining
22 M. O. PETERSON
There are clear parallels between the above subprocesses and those of both Steen (1994)
and Sorm and Steen (2013). But certain fundamental differences suggest utility for this
new framework. Sorm and Steen’s model includes subprocesses that appear to represent
changes in understanding but that are not themselves processes that occur through time.
In particular, metaphor recognition and metaphor construction are outcomes rather than
processes that occur in time, and that are described in other subprocesses. The proposed
visual metaphor processing stages are alike in that they should each require time to
resolve, and are dependent upon one another for completion. In particular, they are predi-
cated on the schematic arrangement of elements particular to visual metaphor. That
domains (which are visualized entities in all cases save the absent domain in replacement
images) must be individually perceived (a more difficult prospect in fusion images),
individually fixed (i.e. recognized as meaningful), and related to each other is a matter of
two-dimensional schematic arrangement and can be partially observed by tracking eye
movements. The nearest equivalents to these proposed processes in Sorm and Steen’s
model are perceptual analyses, implicit information integration, and explicit classification,
which were all adopted from Leder et al. (2004) and concern aesthetic judgements. The
construction of meaning imagined here differs from aesthetics.
Figure 9 relates the proposed visual metaphor processing stages to the expanded
typology of visual structure. Each visual structure should be expected to problematize
some subprocesses while better automating others. This is indicated with scale in Figure 9
in terms of how difficult a subprocess is likely to be as related to visual structure, with a
larger circle indicating that a subprocess is hypothesized as being the most problematic
for a given visual structure, and a smaller circle indicating the opposite. For instance, ges-
ture recognition is hypothesized to be relatively difficult (indicated by a large circle) for
pairwise juxtaposition because the picture presents both source and target entities as
complete. Thus the image is more straightforward and it is not unlikely that the viewer
fails to recognize the incongruity. Conversely, gesture recognition should be relatively
easy (indicated by a small circle) for fusion because the forcible combination of source
and target entities is an overt manipulation by an illustrator or designer. Essentially, the
forcible combination of entities that defines fusion is itself a gesture.
The estimations of subprocess difficulty in terms of visual structure in Figure 9 are not
intended to be quantitatively accurate when comparing across subprocesses (e.g. though
both feature larger circles in the diagram, gesture recognition in pairwise juxtaposition
may generally require more engagement than domain perception in fusion), but merely
to suggest general expected relations when comparing across visual structures. The esti-
mations also assume all other things being equal, there being many other factors covered
in the proposed visual metaphor profile that will impact functional complexity.
As is evident given these assumptions, it is difficult to declare one visual structure as
generally more complex than others, as Phillips and McQuarrie (2004) have done. Instead,
this framework seeks to qualify complexity with some level of detail. In laboratory use,
24 M. O. PETERSON
visual structures may prove more or less complex by virtue of how complexity is
measured.
Despite the provisional nature of the predictions in Figure 9, it would be more useful if,
following empirical study, some visual structures do indeed prove to be overall more com-
plex than others. It is too early, however, to make such a wholesale prediction when sub-
processes appear to complicate the matter.
Discussion
Experimental issues with visual metaphor
The visual metaphor profile, as summarized in Table 2, acknowledges the complexity of
visual metaphors as well as the ads that present them. However inelegant its extensive-
ness, each variable should be considered when investigating visual metaphor in
INTERNATIONAL JOURNAL OF ADVERTISING 25
advertising. I do not mean to suggest that each variable is necessarily worthy of study on
its own. Rather, when a variable of visual metaphor is under investigation, be it included
in the profile or not, profile variables will ideally be manipulated or selected against to
remove their confounding effects. For instance, replacement images by definition have an
environmental apportion. This suggests that any comparison between replacement and
other visual structures should only include ads with environmental apportion. Of course,
many of the profile variables do appear to be worthy of study.
This proposal is necessarily limited. It is an argument that begs empirical support.
Indeed, I have attempted to provide a structure to which hypotheses may be formed and
tested, while arguably offering less definitive predictions than did Phillips and McQuarrie
originally. I make the case that visual metaphor is complicated enough that complexity
and richness must follow the consideration of individual subprocesses of interpretation.
Researchers can use the visual metaphor profile, with a sensitivity to the subprocesses
of interpretation, to refine investigation into the nature of visual metaphor. Even if
researchers wish to test Phillips and McQuarrie’s hypotheses as limited to their three-struc-
ture typology, the expanded typology presented here should help to ensure that ad stim-
uli represent unambiguous versions of juxtaposition, fusion, and replacement structures,
by explicitly avoiding examples of categorical juxtaposition, replacing juxtaposition, and
26 M. O. PETERSON
Figure 10. Predictions for peaks of confusion and joy (both measurable under experimental conditions
with facial recognition software) according to pairwise juxtaposition, replacement, and fusion visual
structures.
source and target, so the gaze point cannot differentiate these entities. Furthermore,
replacement entirely omits one domain as it is only suggested through environmental
apportion, though in that case environmental fixations in gaze patterns may imply atten-
dance to the missing domain. Suffice it to say, as an internal process, visual metaphor
requires clever work from advertising researchers, and under given experimental condi-
tions, visual structures are not likely all created equal.
Due to the nature of meaningful imagery, it can be difficult, even with a developed the-
ory on visual metaphor, to ensure that stimuli prepared for a study properly qualify as
visual metaphor. I use the following simple notation for visual metaphor:
SðaÞ: T ¼ I
The notation tracks source (S), the attributes (a) of the source that are mapped (with direc-
tional mapping indicated with a colon), the target (T) onto which they are mapped, and
the resulting implicature (I). In cases where structure is mapped rather than attributes, an
‘s’ appears in the parentheses. The example from Figure 1 is thus represented as follows.
Jet ðwell engineered; fastÞ: car ¼ “the car is well engineered and fast”
While such articulation seems straightforward enough, it can offer some level of clarity
when faced with complicated pictorial messages.
vie for viewers’ attention, but also ideally maintain it for as long as possible under adverse
conditions. What is happening during sustained engagement? Visual metaphor is a kind
of puzzle to be solved by viewers. Thus advertisers who employ visual metaphor are
implicitly (though surely not always consciously) designing experiences.
Acknowledgments
Thanks to Kevin Wise at the University of Illinois for his previous collaborations on visual metaphor in
advertising, which helped focus my own thoughts as presented here.
The illustration work featured in Figures 1, 3, 6(b), 7, and 8 was supported by a College of Fine
and Applied Arts Creative Research Award from the University of Illinois.
Disclosure statement
The author has no potential conflicts of interest with respect to the research, authorship, or publica-
tion of this article.
Funding
College of Fine and Applied Arts, University of Illinois at Urbana-Champaign: Creative Research
Award.
Notes on contributor
Matthew O. Peterson holds a PhD degree in Design from North Carolina State University. He has
extensive professional experience in graphic and interaction design. His research investigates the
interpretation of meaningful visual media, including issues such as text–image integration, visual
INTERNATIONAL JOURNAL OF ADVERTISING 29
metaphor, and visual narrative. This theoretical and empirical work spans disciplines including
advertising, communication, and science education.
References
Ang, S.H., and E.A.C. Lim. 2006. The influence of metaphors and product type on brand personality
perceptions and attitudes. Journal of Advertising 35, no. 2: 39–53.
Chang, C.-T., and C.-T. Yen. 2013. Missing ingredients in metaphor advertising: The right formula of
metaphor type, product type, and need for cognition. Journal of Advertising 42, no. 1: 80–94.
Durand, J. 1987. Rhetorical figures in the advertising image. In Marketing and semiotics: New direc-
tions in the study of signs for sale, ed. J. Umiker-Sebeok, 295–318. New York: De Gruyter.
Feng, D., and K.L. O’Halloran. 2013. The visual representation of metaphor: A social semiotic
approach. Review of Cognitive Linguistics 11, no. 2: 320–35.
Forceville, C. 1994. Pictorial metaphor in advertisements. Metaphor and Symbolic Activity 9, no. 1: 1–
29.
Forceville, C. 1996. Pictorial metaphor in advertising. London: Routledge.
Gentner, D., and D.R. Gentner. 1983. Flowing waters or teeming crowds: Mental models of electricity.
In Mental models, eds. D. Gentner and A.L. Stevens, 99–129. Hillsdale, NJ: Lawrence Erlbaum Asso-
ciates, Publishers.
Gkiouzepas, L., and M.K. Hogg. 2011. Articulating a new framework for visual metaphors in
advertising: A structural, conceptual, and pragmatic investigation. Journal of Advertising 40, no. 1:
103–20.
Johnson, M. 1987. The body in the mind: The bodily basis of meaning, imagination, and reason. Chi-
cago, IL: University of Chicago Press.
Kaplan, S.J. 1990. Visual metaphors in the representation of communication technology. Critical Stud-
ies in Mass Communication 7, 37–47.
Kaplan, S.J. 1992. A conceptual analysis of form and content in visual metaphors. Communication 13,
197–209.
Kardes, F.R. 1993. Consumer inference: Determinants, consequences, and implications for advertis-
ing. In Advertising exposure, memory, and choice, ed. A.A. Mitchell, 163–91. Hillsdale, NJ: Lawrence
Erlbaum Associates.
Kenney, K., and L.M. Scott. 2003. A review of the visual rhetoric literature. In Persuasive imagery: A
consumer response perspective, eds. L.M. Scott and R. Batra, 297–310. Mahwah, NJ: Lawrence Erl-
baum Associates, Inc.
Lakoff, G. 1990. Women, fire and dangerous things. Chicago, IL: University of Chicago Press.
Lakoff, G., and M. Johnson. 2003. Metaphors we live by. Chicago, IL: University of Chicago Press.
Lagerwerf, L., and Meijers, A. 2008. Openness in metaphorical and straightforward advertisements:
Appreciation effects. Journal of Advertising 37, no. 2: 19–30.
Leder, H., B. Belke, A. Oeberst, and D. Augustin. (2004). A model of aesthetic appreciation and aes-
thetic judgments. British Journal of Psychology 95, no. 4: 489–508.
McQuarrie, E.F. 2008a. Differentiating the pictorial element in advertising: A rhetorical perspective. In
Visual marketing: From attention to action, eds. M. Wedel and R. Pieters, 91–112. New York: Law-
rence Erlbaum Associates.
McQuarrie, E.F. 2008b. A visit to the rhetorician’s workbench: Developing a toolkit for differentiating
advertising style. In Go figure! New directions in advertising rhetoric, eds. E.F. McQuarrie and B.J.
Phillips, 257–76. Armonk, NY: M. E. Sharpe.
McQuarrie, E.F., and D.G. Mick. 1996. Figures of rhetoric in advertising language. Journal of Consumer
Research 22, no. 4: 424–38.
McQuarrie, E.F., and D.G. Mick. 2003. The contribution of semiotic and rhetorical perspectives to the
explanation of visual persuasion in advertising. In Persuasive imagery: A consumer response per-
spective, eds. L.M. Scott and R. Batra, 191–221. Mahwah, NJ: Lawrence Erlbaum Associates, Inc.
McQuarrie, E.F., and B.J. Phillips. 2005. Indirect persuasion in advertising: How consumers process
metaphors presented in pictures and words. Journal of Advertising 34, no. 2: 7–20.
30 M. O. PETERSON
Niebert, K., S. Marsch, and D.F. Treagust. 2012. Understanding needs embodiment: A theory-guided
reanalysis of the role of metaphors and analogies in understanding science. Science Education
96, no. 5: 849–77.
Peterson, M., K. Wise, R. Lindgren, D. Cox, and N. Mathayas. 2015. Understanding and implementing
visual metaphor. (White paper: Illinois Learning Sciences Design Initiative). http://textimage.org/
indexing/pdf/Understanding-Visual-Metaphor.pdf
Peterson, M., K. Wise, Y. Ren, Z. Wang, and J. Yao. 2017. Memorable metaphor: How different ele-
ments of visual rhetoric affect resource allocation and memory for advertisements. Journal of
Current Issues & Research in Advertising 38: 65–74.
Phillips, B.J. 2003. Understanding visual metaphor in advertising. In Persuasive imagery: A consumer
response perspective, eds. L.M. Scott and R. Batra, 297–310. Mahwah, NJ: Lawrence Erlbaum Asso-
ciates, Inc.
Phillips, B.J. 2015. Introduction to the virtual special issue on visual rhetoric in advertising. Journal of
Advertising (Virtual Special Issue on Visual Rhetoric in Advertising). http://explore.tandfonline.
com/content/bes/ja_virtualissue2.
Phillips, B.J., and E.F. McQuarrie. 2004. Beyond visual metaphor: A new typology of visual rhetoric in
advertising. Marketing Theory 4: 113–36.
Rosch, E. 1978. Principles of categorization. In Cognition and categorization, eds. E. Rosch and B.B.
Lloyd, 27–48. New York: John Wiley & Sons.
Sadoski, M., and A. Paivio. 2001. Imagery and text: A dual coding theory of reading and writing.
Mahwah, NJ: Lawrence Erlbaum Associates, Publishers.
Scott, L.M. 1994. The bridge from text to mind: Adapting reader-response theory to consumer
research. Journal of Consumer Research 21, no. 3: 461–80.
Steen, G.J. 1992. Metaphor in literary reception (doctoral dissertation, unpublished). Vrije Universiteit
Amsterdam. Cited in Sorm and Steen (2013).
Steen, G.J. 1994. Understanding metaphor in literature: An empirical approach. London: Longman.
Sorm, E., and G. Steen. 2013. Processing visual metaphor: A study in thinking out loud. Metaphor and
the Social World 3, no. 1: 1–34.
Van Mulken, M., R. Le Pair, and C. Forceville. 2010. The impact of perceived complexity, deviation and
comprehension on the appreciation of visual metaphor in advertising across three European
countries. Journal of Pragmatics 42: 3418–30.
Wise, K., M. Peterson, S. Xiong, and Z. Wang. 2017. How different visual metaphors influence resource
allocation and memory for advertisements. Interventions: Communication Research and Practice
(ICA 2017 Conference) Proceedings, San Diego, CA.