Professional Documents
Culture Documents
net/publication/235966209
CITATIONS READS
41 4,248
2 authors, including:
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by William Dezheng Feng on 16 May 2014.
1 Introduction
While there are a number of cognitive studies that focus on how film devices elic-
it emotion from the viewer (e.g., Carroll 2003; Smith 2003; Tan 1996), few theo-
rists provide a systematic account of how emotions are represented in film. Com-
plementary to cognitive theories that attribute the understanding of film to the
cognitive capacity of viewers, semioticians argue that films are constructed in
ways that guide interpretation prior to handing over the task of understanding to
Following social semiotic theory, texts consist of choices made at different strata
(Halliday 1994). In the case of film, the causes and the character’s linguistic and
nonverbal expressions of emotion are not spontaneous as in real life, but are semi-
otic discursive constructs designed by filmmakers. The semiotic approach enables
us to move beyond cognitive psychological studies to examine how emotions are
“designed” in film. The phenomenology of the causes and expressions of emotion
in real life provides resources for the semiotic choices and the psychological theo-
ries of emotion provide us with tools to categorize those resources (cf. Newman
2005).
Combining the social semiotic and cognitive approaches, we develop frame-
works for investigating the multimodal construction of emotion in film. The film-
maker’s semiotic choices are examined in relation to the cognitive structure of
emotion. In Section 2, a brief account of the cognitive components of emotion is
provided. Following this, the representation of eliciting conditions and expres-
sions of emotion is examined in Section 3, and the configuration of eliciting con-
dition and expression through film editing is explored in Section 4. We conclude
with a description of how the social semiotic approach, combined with cognitive
theories of emotion, is able to provide a theoretical account of emotion represen-
tation in film in Section 5.
The main theoretical basis for investigating emotion representation is the cogni-
tive appraisal theory (e.g., Frijda 1986; Scherer and Ellgring 2007), which argues
It seems safe to assume that basic emotions and their eliciting condition and
expression in films can be understood by most audiences. As Ortony et al. (1988:
3) note, “it is apparent that writers can reliably produce in readers an awareness
of a character’s affective states by characterizing a situation whose construal is
assumed to give rise to them.” The reason is that appraisal of an eliciting condi-
tion is generally shared amongst members and groups of a society (Bless et al.
2004). Experiments have also shown that both children and adults can report and
agree on typical antecedents of several common emotions (e.g., Smith and
Ellsworth 1985).
As a result, filmmakers are able to speculate (correctly most of the time)
viewers’ emotional reactions based on cultural knowledge. It is thus possible for
filmmakers to “design” film emotions to optimize engagement with viewers. The
filmic and discursive strategies for designing emotion are elaborated in Sections
3 and 4.
Modern studies of emotion have been modality specific; that is, they focus on
language (e.g., Kövecses 2000; Martin and White 2005; Wierzbicka 1990), the face
(e.g., Ekman and Friesen 1975, 1978), the voice (e.g., Banse and Scherer 1996;
Scherer 2003) or the body (e.g., Wallbott 1998).
In terms of facial expression, it is generally accepted that certain configura-
tions of facial muscle groups are universally judged to be associated with particu-
lar emotions (Ekman and Friesen 1975). Accordingly, psychologists have devel-
oped portraits of facial patterns to account for basic emotions of happiness,
surprise, fear, anger, disgust, and sadness (e.g., Ekman and Friesen 1975, 1978;
Izard 1971). However, Carroll and Russell (1997: 165) argue that patterns of facial
expressions arise only secondarily, through the coincidental co-occurrence of two
or more different components. In Hollywood films, although professional actors’
happiness is represented by smiles in 97% of cases, surprise, anger, disgust or
sadness rarely show the predicted pattern of facial expression (found in 0 to 31%
of cases; Carroll and Russell 1997). This study challenges the position that facial
expressions are hardwired in the emotion experience and suggests the need for a
framework that accommodates comprehensive multimodal analysis of the repre-
sentation of emotion.
The evidence for emotion-specific patterns in vocal features is not as strong
as that for facial expression (Wallbott 1998: 880). These parameters are generally
considered in relation to the arousal level of emotion. The emotive meanings of
body movements, gestures and actions are even less clear, in that differential pat-
terns of bodily activity do not fall into clusters characteristic of discrete emotions
(Planalp 1998: 34). Therefore, it is reasonable to consider these resources as con-
tinuous expressions of underlying dimensions of emotion, such as arousal and
valence.
In what follows, frameworks for the multimodal construction of the eliciting con-
dition (EC) and expression (Ex) of emotion are developed and are illustrated with
examples from well known films and television series. The basic organizing as-
sumption is that meaning is constructed across the semiotic strata in which the
cognitive components of emotion, organized by shots and syntagmas, are real-
ized by the audio-visual resources rendered as audio and visual tracks in film (cf.
Figure 1).
of eliciting condition and expression and these syntagmas are essential for the
understanding of filmic representation of emotion, as shall be discussed in
Section 4. The eliciting condition and the expression are mostly sequentially
arranged, in which some emotion-inducing event happens first and the charac-
ter’s emotional reactions follow. The emotion-inducing event is significant in
filmic communication of emotion. First, it doesn’t only enable us to infer the char-
acter’s emotion and but also makes us anticipate the character’s emotional reac-
tion. Second, it may provoke the viewer’s feeling. As is noted in Section 2.2, the
appraisal of many events is culturally shared. Therefore, we are not only able to
infer the character’s emotional reaction, but also feel the emotion to some extent
based on our identification with the character.
The film character’s linguistic and nonverbal expressions of emotion are not
spontaneous as in real life, but are semiotic discursive constructs designed by
filmmakers. Therefore, the first dimension in our multimodal framework involves
the resources of verbal and nonverbal expression. The framework also includes
discursive choices, which include the quantity of expression (simple/complex) as
well as the context of expression (individual/interactive). The dimensions with
their respective systems are displayed in Figure 3. The dimensions of verbal and
non-verbal expressions and discursive choices are elaborated in Section 3.2.1. The
stylistic choices, including camera positioning, music, and so forth, are not dis-
cussed separately, but are pointed out where relevant in the ensuing discussion.
iconic stage in the process of signification and consider the visually represented
behavior as icons of indexes, rather than indexes themselves. Therefore, in Figure
4 in Section 3.2.2, the character mimicries the real expression of happiness that
indexes the emotion. Meanwhile, this study does not aim to work out a “gram-
mar” of nonverbal behavior (see Feng and O’Halloran forthcoming; Martinec
2001 for attempts of this kind), rather, as the nonverbal expressions of emotion
can normally be unambiguously recognized in Hollywood movies, we shall
merely interpret the meanings of facial expressions or vocal features based on the
studies reviewed in Section 2.3.
Episode 12, Season 4 of Friends in Figure 6 illustrate how the choices in reverse
shots are made. In the first shot (Move 1), Monica (K1) expresses her emotion to
Rachel by recounting the eliciting condition that she is offered the job of head
chef in high pitched, loud voice, accompanied with the facial expression of smile.
Rachel (K2) responds to Monica in the second shot (Move 2) with surprise.
tion (e.g., Scherer and Ellgring 2007). As noted in Section 2.3, the limitation of
this approach is that the limited variables are unable to account for the complex-
ity of emotion expressions, which include idiosyncratic actions and emotions
which take place over time. These complex expressions are significant in the rep-
resentation of emotion. Very often, the immediate reaction is followed by several
shots or scenes of individual expression, or the emotion is expressed in multiple-
turn interaction (interactive expression). Many complex expressions involve both
individual and interactive expressions and may extend across several shots or
even several scenes. In a shot from Tom Shadyac’s Patch Adams (1998), Patch’s
emotion is expressed with several resources in several shots after being kissed by
Corinne, the girl who he admires. Patch first makes the “wow” sound, which
shows his enjoyment, then he laughs happily, and dances as he walks away. The
expression includes facial expression, linguistic expression, and material action
and communicates the intensity of Patch’s happiness.
As a discursive choice, the expression depicted is determined by many fac-
tors, in particular the intensity of the emotion and the genre of the film. Complex
expressions of emotion over long durations of time tend to appear in female-
oriented genres like melodrama and romance, while in male-oriented genres like
action movie, emotions are often expressed over shorter time periods. In the
melodrama Patch Adams, for example, Patch’s grief after his girlfriend Corinne
was murdered is expressed over approximately nine minutes. The expressions
include his immediate facial reaction after learning about the news, crying at
Corinne’s coffin, leaving the medical school, conversing with his two classmates,
attempting to jump off the cliff, and his speech, which blames God for the murder.
Such full-fledged expression is undoubtedly motivated by his intense grief and
despair, but the filmmaker’s choice to allocate nine minutes to Patch’s display of
emotion is certainly a discursive choice. The discursive choice is quite different in
Ridley Scott’s (2000) Gladiator, which is a Roman epic and a male-oriented ac-
tion film. When Maximus sees that his wife and son have been murdered, he cries
with much anguish at the sight of their corpses. However, this is the only expres-
sion of grief and the film gives it several seconds before moving on to another
stage of the narrative. Maximus’ emotion may be no less intense than Patch’s, but
the filmmaker chooses a more compact way to depict the emotion.
accurate depiction of emotion. A further issue to address, which is also a key as-
pect of filmic representation of emotion, is how they are co-deployed, or orga-
nized. Previous studies only explain the working mechanism of one or two filmic
resources, for example, Carroll’s (1996) theorization of the POV structure. In this
section, we provide a comprehensive account of the shot-connecting devices and
examine how causal relations between the eliciting condition and expression
are represented by formally connected shots. The framework counts as a step
towards explaining how the textual logic of film enables interpersonal meaning
(cf. Bateman and Schmidt 2011).
In our model, the eliciting condition-expression configuration is systemati-
cally organized by shots and syntagmas. However, as with previous models, there
is no one-to-one correspondence between the choices from eliciting condition-
expression configuration and the choices of their filmic organization. For exam-
ple, two interaction turns can be realized by reverse shots or a single shot. Never-
theless, patterns can be found between the semantic layer and the expression
layer.
To account for the eliciting condition-expression configuration, we draw
upon the “grande syntagmatique” (the syntagmatic categories for narrative film)
proposed by Metz (1974; see also Bateman 2007). The options for syntagma are
significantly fewer than Metz’s grande syntagmatique because the causal-
temporal relations between the eliciting condition and expression mean that only
narrative syntagmas are relevant for emotion representation. Other syntagmas,
such as parallel syntagma, which depicts conceptual relations (e.g., classifica-
tion) between events, are not relevant. The shots and syntagmas available for
representing eliciting condition-expression configuration are shown in Figure 7.
We are concerned with the shot relation which connects the eliciting condition
and the immediate linguistic or kinetic response within the basic unit of syn
tagma. There are cases where the eliciting condition and the expression are not
organized in one syntagma. First, the eliciting condition is presented to the viewer
as a narrative event and somehow the emoter knows it but we do not know how
he/she accesses it (the case of EC5 in Figure 2). Second, the filmmaker creates a
separate scene for the character to express his/her emotion. In an episode in
Friends (Crane and Kauffman 1998), Rachel is given the job of assistant buyer dur-
ing her conversation with her supervisor Joana. There are naturally emotional
reactions immediately after learning about the news, but the film cuts to another
scene and Rachel only expresses her emotion in the scene after that. Third, as
pointed out in Section 3.2.2, complex expressions may extend across several
scenes and hence extend beyond the autonomous syntagma.
Projection depicts the character and what he/she sees and thinks. We shall focus
on the former, which is represented by the POV structure. POV structure typically
portrays what the character sees and how he/she reacts to it, constituting the
EC2^Expression type (cf. Figure 2). Carroll (1996) develops a cogent theory of POV
Two points need to be stressed, based on Carroll’s (1996) classic theory. First, POV
structure is only one of the many mechanisms of emotion representation, as
pointed out by Plantinga (1999) and suggested in our system. Second, there are
variations to the POV structure. One obvious variation is the order of the object
and reaction. Naming the point/object and point/glance shot A and shot B re-
spectively, we can get A^B and B^A structures. Then it seems that Carroll’s (1996)
treatment of reaction as “ranger finder” and object as “focuser” only applies to
the B^A structure. Another variation is that the object and the reaction can be
represented in one shot, either within the same frame or by a panning/tilting
camera. Third, the POV shots may be elaborated by subsequent shots. That is, the
object or the reaction may be portrayed by more than one shot, as they often
are. Taking A^B structure as an example, it is often reiterated by another pair of
object-glance configuration (A^B + A′^B′), showing the object from a different
angle and the character’s reaction with slight variation, as in shot 3 and shot 4
in Figure 8, which follow the first two shots immediately. Variations of this
reiteration include showing the object again without showing the character
(A^B + A′ + A′′ + . . .) and showing the character’s reaction in several shots
(A^B + B′ + B′′ + . . .). The multiple reaction shots are commonly used to highlight
the character’s emotion, together with the long duration and close distance of the
shot. This technique is used not only to guarantee our recognition of the emotion
portrayed, but also to invoke our empathy (Plantinga 1999).
shot. The reaction is unambiguously surprise, with the verbal signal “oh, my god”
in high pitch voice and the open mouth. The eliciting condition and expression
corresponds to the speaker turns, organized by reversed shots, realized by facial
expression, language and vocal features, and finally rendered as audio-visual
tracks. The relation between different layers of semiosis is illustrated in Figure 10.
Linear narrative syntagma captures the eliciting condition and expression as two
successive actions, namely, the action and the reaction. The actions may be con-
tinuous or discontinuous in form, but two shots depict them as succeeding ac-
tions from one participant. The shots feature what the character does or says
(EC1) as eliciting condition and how he/she responds to his/her action/speech.
However, such configuration of eliciting condition and expression is less com-
mon because reaction usually does not immediately follow action. The emoter
often responds to the effect of his/her action, instead of the action, so there is
typically a shot of the result of the action before the reaction shot. For example,
Episode 12, Season 1 in Friends, there is a scene in which Monica is playing table
football with others. The first shot shows her action of playing the ball, the second
shot shows the ball she scored and the third shot shows her excitement. The suc-
cessive action is interrupted by the second shot which forms a POV structure with
the third shot.
To summarize, this section examines the discursive resources for organizing
the eliciting condition and expression. It shows that different configurations of
eliciting condition and expression are organized in different syntagmas, as shown
in Figure 12.
5 Conclusion
This study provides a semiotic theorization of how emotion is represented in film,
complementing cognitive approaches which focus on how film elicits emotion
from the viewers. We develop a semiotic framework in which the filmic represen-
tation of emotion is seen as semiotic discursive choices and we apply the strati-
fied semiotic model to film discourse to investigate how emotive meaning is real-
ized through the choices of verbal/nonverbal resources and filmic devices.
Meanwhile, the framework also draws upon the cognitive components of emotion
which provide structure to the representational choices at the semantic level.
Then choice systems for the representation of the two main components of elicit-
ing condition and expression are developed. At the discursive level, the choices
available in the shot organization of the eliciting condition and expression are
examined.
The paper concludes that the social semiotic approach, combined with the
cognitive account of emotion structure, is able to explain how emotion is con-
structed in film, although not all resources are fully discussed (e.g., the use of
music, color, etc.). Such semiotic discussions complements current studies which
focus on film viewer’s emotional response. It does not only explain how various
film techniques work to represent emotion, but is also significant for the study of
viewer emotion since character emotion is the most important source that elicits
viewer emotion.
References
Banse, Rainer & Klaus Scherer. 1996. Acoustic profiles in vocal emotion expression. Journal of
Personality and Social Psychology 70(3). 614–636.
Bartlett, Charles F. 1932. Remembering. Cambridge: Cambridge University Press.
Bateman, John. 2007. Towards a grande paradigmatique of film: Christian Metz reloaded.
Semiotica 167(1/4): 13–64.
Bateman, John & Karl-Heinrich Schmidt. 2011. Multimodal film analysis: How films mean.
London: Routledge.
Bednarek, Monica. 2008. Emotion talk across corpora. Basingstoke: Palgrave.
Bless, Herbert, Klaus Fiedler & Fritz Strack. 2004. Social cognition: How individuals construct
social reality. Hove: Psychology Press.
Carroll, Noël. 1996. Theorizing the moving image. Cambridge: Cambridge University Press.
Carroll, Noël. 2003. Engaging the moving image. New Haven: Yale University Press.
Carroll, James M. & James A. Russell. 1997. Facial expressions in Hollywood’s portrayal of
emotion. Journal of Personality and Social Psychology 72. 164–176.
Coplan, Amy. 2006. Catching characters’ emotions: emotional contagion responses to narrative
fiction film. Film Studies 8. 26–38.
Ekman, Paul & Wallace V. Friesen. 1975. Unmasking the face. Englewood Cliffs, NJ: Prentice Hall.
Ekman, Paul & Wallace V. Friesen. 1978. The facial action coding system. Palo Alto, CA:
Consulting Psychologists Press.
Feng, Dezheng & O’Halloran, Kay. L. (forthcoming). Representing emotions in visual images: A
social semiotic approach. Journal of Pragmatics.
Forceville, Charles. 2005. Visual representations of the idealized cognitive model of anger in
the Asterix album La Zizanie. Journal of Pragmatics 37. 69–88.
Frijda, Nico. 1986. The emotions. Cambridge: Cambridge University Press.
Halliday, Michael. 1994. An introduction to functional grammar. London: Arnold.
Izard, Carroll E. 1971. The face of emotion. New York: Appleton Century Crofts.
Kövecses, Zoltán. 2000. Metaphor and emotion: Language, culture, and body in human feeling.
Cambridge: Cambridge University Press.
Lazarus, Richard. 1991. Emotion and adaptation. New York: Oxford University Press.
Martin, James R. & Peter White. 2005. The language of evaluation. New York: Palgrave
Macmillan.
Martin, James R. & David Rose. 2007. Working with discourse. London: Continuum.
Martinec, R. 2001. Interpersonal resources in action. Semiotica 135(1/4). 117–145.
Metz, Christian. 1974. Film language: A semiotics of the cinema. Oxford: Oxford University Press.
Newman, Michael. 2005. Characterization in American independent film. Madison: University of
Wisconsin-Madison dissertation.
O’Donnell, Michael. 1990. A dynamic model of exchange. Word 41. 293–327.
Ortony, Andrew, Gerald Clore & Allen Collins. 1988. The cognitive structure of emotions. New
York: Cambridge University Press.
Planalp, Sally. 1998. Communicating emotion in everyday life. In Peter Andersen & Laura
Guerrero (eds.), Handbook of communication and emotion, 30–48. San Diego: Academic
Press.
Plantinga, Carl. 1999. The scene of empathy and the human face on film. In Carl Plantinga &
Greg Smith (eds.), Passionate views: Film, cognition, and emotion, 239–255. Baltimore:
Johns Hopkins University Press.
Scherer, Klaus. R. 2003. Vocal communication of emotion: a review of research paradigms.
Speech Communication 40. 227–256.
Scherer, Klaus. R. & Heiner Ellgring. 2007. Multimodal expression of emotion: Affect programs
or componential appraisal patterns. Emotion 7. 158–171.
Smith, Craig & Phoebe Ellsworth. 1985. Patterns of cognitive appraisal in emotion. Journal of
Personality and Social Psychology 48. 813–838.
Smith, Greg R. 2003. Film structure and the emotion system. Cambridge: Cambridge University
Press.
Tan, Ed S. 1996. Emotion and the structure of narrative film: Film as an emotion machine.
Mahwah, NJ: Lawrence Erlbaum.
Tseng, C. 2009. Cohesion in film and the construction of filmic thematic configuration: A
functional perspective. Bremen: University of Bremen dissertation.
Wallbott, Herald G. 1998. Bodily expression of emotion. European Journal of Social Psychology
28. 879–896.
Wierzbicka, Anna. 1990. The semantics of emotions: fear and its relatives in English. Australian
Journal of Linguistics 10(2). 359–375.
Bionotes
Dezheng Feng (b. 1983) is a Research Assistant Professor at the Hong Kong Poly-
technic University 〈dezhengfeng@gmail.com〉. His research interests include
social semiotics, multimodal discourse analysis, and cognitive linguistics. His
publications include “Visual space and ideology: A critical cognitive analysis of
spatial orientations in advertising” (2011); “Intertextual voices and engagement
in TV advertisements” (with P. Wignell, 2011); “The visual representation of meta-
phor: A social semiotic perspective” (with K. O’Halloran, 2013); and “Multimodal
engagement in television advertising discourse” (2013).