Professional Documents
Culture Documents
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at http://www.jstor.org/page/
info/about/policies/terms.jsp
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content
in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship.
For more information about JSTOR, please contact support@jstor.org.
Penn State University Press is collaborating with JSTOR to digitize, preserve and extend access to Style.
http://www.jstor.org
This content downloaded from 128.250.144.144 on Mon, 15 Feb 2016 12:01:58 UTC
All use subject to JSTOR Terms and Conditions
Audio Description on the Thought-
Action Continuum
Alan Palmer
Independent scholar
Andrew Salway
Uni Research Computing
Abstract: The term “audio description” refers to the spoken commentary that is
added to film soundtracks to enable visually impaired people to follow the action on the
screen by supplementing the information that is already available from character dia-
logue and other parts of the soundtrack. This essay introduces some core narratological
concepts and discusses how they are relevant to audio description research and practice,
especially to the issue of characters’ mental states. It then describes how corpus analysis
techniques can be used to analyze larger samples of audio description and develop a pre-
liminary classification of ways in which describers convey information about characters’
mental states. It closes by discussing the synergies between audiovisual translation, nar-
ratology, and corpus linguistics. The essay is intended to be a contribution to the debate
within the field of audio description studies regarding the possibility and desirability of
“objectivity” in audio descriptions.
1. I N TR OD U CT ION
Style, Vol. 49, No. 2, 2015. Copyright © 2015 The Pennsylvania State University, University Park, PA
This content downloaded from 128.250.144.144 on Mon, 15 Feb 2016 12:01:58 UTC
All use subject to JSTOR Terms and Conditions
of relevance to all those who work in the much wider fields of narrative
theory, stylistics and literary theory generally. We have in mind, in par-
ticular, our questioning of the apparently obvious distinction between
descriptions of characters’ actions and descriptions of characters’ mental
processes in fictional narratives.
Specifically, this essay seeks to show how concepts from narratology and
techniques from corpus linguistics can be applied to the analysis of audio
description. We are concerned with the issue of what should be described in
audio description for feature films, and how it should be worded. We intro-
duce narratological concepts that can help to better articulate this issue and
to better analyze and compare examples of audio description. To comple-
ment the application of narratological concepts for a close reading of audio
description samples, we show how corpus linguistics techniques can be used
to learn about what is being described, and how, in a corpus of ninety-one
audio description scripts. Our focus is on how an audio description utter-
ance can, and, as it turns out, often does, describe a character’s actions as
depicted on-screen while simultaneously giving some information about the
character’s thoughts.
Central to the practice and theory of audio description is the question
of what should be included in a description (Vercauteren). Concerning the
issue of what to describe, and how to describe it, some recent guidelines for
audio description state:
“The best audio describers objectively recount the visual aspects of an image.
Subjective or qualitative judgments or comment get in the way – they constitute an
interpretation on the part of the describer and are unnecessary and unwanted . . .
Describers must differentiate between emotion or reasoning (which requires an
interpretation on the part of the observer) and the physical characteristics of emo-
tion or reasoning (which are more concrete and allow description users to conjure
their own interpretations).” (Snyder 17)
This content downloaded from 128.250.144.144 on Mon, 15 Feb 2016 12:01:58 UTC
All use subject to JSTOR Terms and Conditions
“that is not apparent on the screen,” without really explaining how to determine
what is “apparent.” It seems to us that such guidelines leave unanswered ques-
tions about what information is necessary for an audience to understand and
enjoy a film, and what “interpretation” means in this context. Also, and this
becomes the main theme of the essay, with regards to what can be seen and
hence described, we take issue with the supposedly clear-cut distinction that
is made between a character’s “emotion or reasoning” (thoughts, which should
not be mentioned according to the guidelines) and their “apparent” physical
manifestations (actions, which may be described).
This essay seeks to contribute to audio description theory and practice in
two ways. First, we introduce more rigorously defined terms from narratol-
ogy to clarify and simplify the issues alluded to in the previous paragraph.
Second, we show how corpus linguistics techniques can be used to charac-
terize the information about characters’ actions and thoughts that audio
description actually provides. We, thus, hope to help circumvent the “mis-
guided dilemma of subjectivity versus objectivity” (Kruger and Orero 141)
and further develop the multidisciplinary nature of audiovisual translation
research (cf. Matamala and Orero).
Recent years have seen the uptake of various narratological concepts and
techniques by audio description researchers. Through a corpus-based analy-
sis, Salway (2007) showed how the language of audio description is influ-
enced by its narrative function, i.e., it must convey information about “chains
of events in cause-effect relationships” where “the agents of cause and effect
are characters with goals, beliefs and emotions” (152). With the aim of teach-
ing audio describers, Remael and Vercauteren (2007) show how to identify
“narratively significant cues” for inclusion in descriptions in relation to the
exposition phase of filmic narratives. Likewise, Ibañez (2012) argues for the
importance of explicating film narrative in the teaching of audio descrip-
tion. An innovative approach correlated eye-tracking data with viewers’ nar-
rative constructions to understand how different visual elements contribute
to storytelling (Kruger). Narratological theory was drawn on to argue for a
“more subjective and cultural-historical reference point for audio describers”
(Finbow 215) while Vandaele (2012) also extended consideration of narra-
tive beyond the depicted sequences of events in films to include the hypo-
thetical events that are triggered in the viewer’s mind.
Of particular relevance for the current essay is Kruger’s argument that
the filmic text, shorn of the moving image, must be “re-narrativised” for
This content downloaded from 128.250.144.144 on Mon, 15 Feb 2016 12:01:58 UTC
All use subject to JSTOR Terms and Conditions
blind and visually impaired audiences (Kruger 231–32). This means giving
consideration to how the story told by the film can be retold by com-
bining a new speech track with the dialogue, sound effects, and music.
Regarding the accessibility of films, Kruger makes the case for giving
less priority to a precise recounting of on-screen action and more pri-
ority to the “narrative implication or effect of what can be seen” (234).
To achieve this, “audio narration” (233) is suggested as an alternative to
audio description, where there is a continuum from objective description
(that may be achieved in the audio description of a documentary) to a
narration that may use, for example, “markers of focalization . . . to enable
the audience to situate themselves in the fictional world” (243). At the
core of the current essay is the somewhat analogous and complementary
thought–action continuum.
Informed by a narratological perspective, our premise is that understand-
ing a story entails knowing about the mental states of its characters. In other
words, it is essential for the audience of a film to learn about the causal net-
work of beliefs, desires, and goals that motivate and explain the characters’
actions. At first glance, this perspective may seem to be at odds with the
guidelines mentioned above: can it be possible to see, and hence to describe
mental states? Within the moving image of a film—for which audio descrip-
tion is a surrogate—actors act, and with their actions and through their
facial expressions, they reveal something about characters’ mental states;
of course, the other components of a film (dialogue, music, and film tech-
niques) contribute to the storytelling too.
The central question that we address here is this: how much information
about the “minds,” i.e., mental states, of characters in films is audio descrip-
tion able to convey—given the constraints of being objective (not “interpret-
ing”), allowing audiences to make their own inferences about a story (not
“giving away the plot”), and the limited time available for description (which
must fit between the film’s dialogue)?
In section 2, we introduce some core narratological concepts and discuss
how they are relevant to audio description research and practice, especially to
the issue of characters’ mental states. In section 3, we describe how corpus anal-
ysis techniques can be used to analyze larger samples of audio description and
develop a preliminary classification of ways in which describers convey infor-
mation about characters’ mental states. We close in section 4 by discussing the
synergies between audiovisual translation, narratology, and corpus linguistics.
This content downloaded from 128.250.144.144 on Mon, 15 Feb 2016 12:01:58 UTC
All use subject to JSTOR Terms and Conditions
2. A
PPLY ING A THEO RY OF F ICTI O N AL M I N DS TO
AU D IO D ESCRIPTION
This content downloaded from 128.250.144.144 on Mon, 15 Feb 2016 12:01:58 UTC
All use subject to JSTOR Terms and Conditions
This content downloaded from 128.250.144.144 on Mon, 15 Feb 2016 12:01:58 UTC
All use subject to JSTOR Terms and Conditions
reader looks at the word “ hiding,” the more like a description of consciousness
it becomes. Put another way, the word “standing” is at the action end of the
thought–action continuum; the word “hiding” is nearer the middle, because
it describes the action but also contains a reference to the mental functioning
behind the action. It is worth mentioning that the mental states referred to in
words in the middle of the continuum are often, unsurprisingly of course, rea-
sons, motives, and intentions relating to action.
In the Philosophical Investigations (1958), the philosopher Ludwig
Wittgenstein quotes the statement “I noticed that he was out of humour,”
and asks, “Is this a report about his behaviour or his state of mind.” When
“Wittgenstein’s question” (see Palmer 120–21) is asked of the statement
“They are standing behind the curtain,” the answer is that it is a report of
their behavior; when asked of the statement “They are hiding behind the
curtain,” the answer is that it is a report of both their behavior and their state
of mind. And Wittgenstein’s question can usefully be asked of many of the
statements made in action descriptions.
The other large group of mental states that often occur in these statements
is emotions. For example, in one of the scenes from the film The English
Patient to be discussed in the following section, one of the audio descrip-
tions refers to the female protagonist, Katherine, as simply stopping and
turning toward Almasy. That is at the action end of the continuum. The other
description says that “She rounds on Almasy.” The second description con-
veys much more information about her emotions of anger and frustration.
So it is in the middle of the continuum. Similarly, for example, in the same
scene, “Katherine glares” at Almasy, rather than the more neutral “Katherine
looks at him.” A hypothetical example at the thought end of the continuum
would be something like “Katherine is angry with Almasy.”
This content downloaded from 128.250.144.144 on Mon, 15 Feb 2016 12:01:58 UTC
All use subject to JSTOR Terms and Conditions
Scene one: As part of the communal entertainments around the desert campfire,
Katherine is telling the expedition party a story:
“Katherine hesitates as she meets Almasy’s gaze across the campfire.” (A)
Scene two: During a dance at the Cairo hotel, Katherine has just finished dancing with a
member of the group:
“Almasy steps between them and takes Katherine in his arms. Seeing them, Geoffrey
sips his wine thoughtfully. On the dance floor, Almasy does not answer. He holds
her stiffly in his arms, gazing directly at her finely sculpted features and pale blond
hair. Disconcerted, Katherine meets his gaze, then glances away uncertainly. Without
speaking they glide across the floor between the dancing couples circling beneath
the crystal chandeliers.” (A)
This content downloaded from 128.250.144.144 on Mon, 15 Feb 2016 12:01:58 UTC
All use subject to JSTOR Terms and Conditions
“Almasy cuts in and leads Katherine. She smiles. Almasy studies her intently and
she drops her gaze. When she lifts her eyes again he is still staring. He holds her in
a stiff embrace, his face inches from hers, his hand resting on the small of her back
pressing her close to him.” (B)
In this scene, the two descriptions are quite similar in some ways: both
emphasize Almasy’s taking control of the situation; both use the word “stiff ” to
describe Almasy’s dancing; and both have Katherine glancing away. However,
there are some differences. Version A makes more of Katherine’s turning
away movement, using the words “disconcerted” and “uncertainly” to provide
important information about her mental states. Also, it is odd that, as in the
last scene, only one description takes note of Clifton’s reaction to the relation-
ship between Katherine and Almasy. But this time, it is A rather than B.
In addition, the difference in what may be called focus is interesting.
B emphasizes the almost claustrophobic intimacy of the physical contact
between the pair: his face inches from hers, pressing her close. It con-
veys the tension and the growing attraction between them. By contrast,
A stresses the social and physical context of the dancing couples and the
crystal chandeliers. The use of the word “glide” suggests a much more
relaxed encounter. Very different mental states are implied: it makes the
two characters’ states of mind sound misleadingly relaxed and uninvolved
by comparison.
Scene three: Back in the desert, Katherine shows Almasy some of her paintings:
“She offers them to him. He takes the paintings and looks at them. He holds them
out. She takes the pictures back. Katherine walks a few steps then stops and turns. He
looks embarrassed. Quickly she walks away. Almasy looks after her uncertainly.” (A)
“Almasy takes them and glances up at her. Katherine smiles as he looks through
them. He hands them back. Katherine’s face falls. Reluctantly she takes the paint-
ings and starts to go. Then hesitates. She rounds on Almasy. He opens his mouth to
speak, but says nothing. Katherine glares, turns and marches off, heading up the side
of the dune. Almasy blinks, sighs and shakes his head. He returns to his book.” (B)
This content downloaded from 128.250.144.144 on Mon, 15 Feb 2016 12:01:58 UTC
All use subject to JSTOR Terms and Conditions
“glares,” “turns and marches off.” Much of this is veering toward the thought
end of the thought–action continuum. Her conflicted thought processes are
made clear. By contrast, A stays at the action end of the continuum in rela-
tion to Katherine. Significantly though, the A focus is on Almasy’s mental
functioning: he “looks embarrassed”; he looks after her “uncertainly.”
We did not notice any consistent patterns of difference between the two
descriptions across the three scenes. For example, Clifton’s reactions were
noted in description B for scene 1 and in description A in scene 2. So there
were no obvious patterns of identification, say, with one character in one
description and another character in another. It may be that this point could
form the basis of at least one good criterion for objectivity, i.e., descriptions
should not over-identify with, or get over-involved in, one character at the
expense of the others. What we did notice in both descriptions were some
common devices used to give information about mental states. These include
descriptions of different types of smile (“smile fades,” “smiles awkwardly”)
and different ways of looking at other characters (“studies her intently,”
“looks after her uncertainly,” and descriptions of characters appearing to be
in a particular state (“he is transfixed,” “disconcerted,” “looks embarrassed”).
These devices become the starting point for our corpus analysis in section 3.
Given that the two descriptions are so informative about the three main
characters’ states of mind, the question arises as to whether they meet the
guidelines’ requirement to be objective. Our close reading of some audio
description fragments leads us to claim that it is possible to remain objec-
tive while being informative about mental states in a concise and consistent
way, and that such information is indeed essential for the audience to under-
stand the story. To put the point in narratological terms: the descriptions
are in appropriate places on the thought–action continuum. We have also
shown how small differences in descriptions of characters’ actions can result
in big potential differences for an audience’s understanding of a story, which
is perhaps something to be considered in future guidelines.
3. A
PPLY ING CORP U S A NALY S I S TECH N I Q UES TO
AUD IO D ESCRIPTION
This content downloaded from 128.250.144.144 on Mon, 15 Feb 2016 12:01:58 UTC
All use subject to JSTOR Terms and Conditions
The detailed analysis that follows may be summarized in this way. Some
of the ways in which audio describers can give information about charac-
ters’ mental states are listed here roughly in order from action-oriented to
thought-oriented. This order implies an increase in the extent to which the
describer makes explicit for the AD audience what the sighted audience is
expected to infer from the moving image.
(1) Description of simple actions that are intended to imply mental states,
e.g., “Robin rolls his eyes.”
(2) Description of facial expressions, such as “smiles,” “frowns,” and “grins.”
(3) Modification of the description of actions by adding an adverb, e.g.,
“walks cautiously,” by using a troponym, e.g., “creeps,” or with a phrase
like “smiling in relief.” The particular choice of adverb, troponym, or
abstract noun will lead to a different positioning on the thought–
action continuum.
(4) Description of a character as appearing to be in a particular mental
state, e.g., “Harry looks confused.”
This content downloaded from 128.250.144.144 on Mon, 15 Feb 2016 12:01:58 UTC
All use subject to JSTOR Terms and Conditions
This content downloaded from 128.250.144.144 on Mon, 15 Feb 2016 12:01:58 UTC
All use subject to JSTOR Terms and Conditions
This content downloaded from 128.250.144.144 on Mon, 15 Feb 2016 12:01:58 UTC
All use subject to JSTOR Terms and Conditions
The uses in the former sense (looking at something) tend to be nearer the
middle of the spectrum because there is an action involved. In both descrip-
tions A and B “looks” is modified by an adverb seven times to give more
insight into mental states. It is clear that, even in the action uses of “looks,”
there is plenty of information being given about mental states. Indeed, it
is notable that several words convey the presence of a good deal of highly
self-conscious thought: “thoughtfully,” “pensively,” “uncertainly,” “curiously,”
and “questioningly.” This is not surprising given the subject matter of this
film—highly intelligent and sensitive people falling in love.
Of course, aside from smiling, looking, and walking, there are many other
actions that could be modified by adverbs to inform about mental states in
audio description. In order to focus on how these actions are performed, the
next step was to search the two descriptions for words taking a common and
easily identifiable form of English adverbs, i.e., those ending in -ly. From
inspecting the concordances of -ly adverbs, we found that in description
A over 100 of the 149 instances convey information about characters’ mental
states; in B it is over 90 of 136 occurrences.
As an alternative to modifying “looks” with an adverb, audio describers
working in some languages, such as English, have a variety of troponyms at
their disposal with which they can specify a particular manner of looking,
and in so doing give more insight into characters’ mental states, e.g., “gazes”
This content downloaded from 128.250.144.144 on Mon, 15 Feb 2016 12:01:58 UTC
All use subject to JSTOR Terms and Conditions
Troponym A B
stares 16 4
gazes 9 5
peers 8 3
glances 5 2
eyes 4 0
studies 3 0
glares 2 0
regards 0 1
This content downloaded from 128.250.144.144 on Mon, 15 Feb 2016 12:01:58 UTC
All use subject to JSTOR Terms and Conditions
This content downloaded from 128.250.144.144 on Mon, 15 Feb 2016 12:01:58 UTC
All use subject to JSTOR Terms and Conditions
Table 2a: Frequencies of word forms ending -ly that follow the word forms “looks.”
directly 9
anxiously 8
sadly, steadily, thoughtfully 7
nervously 6
fearfully, grimly, longingly, quizzically 4
curiously, slowly 3
blankly, briefly, carefully, frantically, intently, pensively, searchingly, 2
seriously, sheepishly, sympathetically, uneasily, urgently
accusingly, appealingly, awkwardly, bitterly, calmly, cautiously, closely, 1
coldly, contemptuously, defiantly, disappointedly, disapprovingly,
earnestly, enquiringly, entreatingly, firmly, forlornly, gravely, guiltily,
impatiently, incredulously, malevolently, meaningfully, miserably,
mournfully, pathetically, pityingly, pleadingly, pointedly, questioningly,
quickly, resignedly, skeptically, shakily, sharply, shyly, solemnly, surrep-
titiously, tenderly, unsurely, vacantly, warily, wistfully
Table 2b: Frequencies of word forms ending -ly that follow the word form “walks.”
slowly 57
purposefully 9
briskly 8
carefully, cautiously, confidently, quickly, stiffly 4
hesitantly, nervously, steadily, tentatively, unsteadily 3
This content downloaded from 128.250.144.144 on Mon, 15 Feb 2016 12:01:58 UTC
All use subject to JSTOR Terms and Conditions
This content downloaded from 128.250.144.144 on Mon, 15 Feb 2016 12:01:58 UTC
All use subject to JSTOR Terms and Conditions
Table 3: Frequencies of word forms ending -ed that follow the word form “looks.”
puzzled 17
confused 15
troubled 11
shocked 10
concerned 7
embarrassed, surprised 6
amazed, disappointed, worried 5
bemused, pleased, stunned 4
annoyed, dazed, perplexed, terrified, unconvinced 3
agitated, alarmed, exasperated, horrified, interested, relieved, unim- 2
pressed, unnerved
aggrieved, amused, ashamed, astonished, baffled, bewildered, bored, 1
captivated, chuffed, defeated, deflated, delighted, determined, disgrun-
tled, disheartened, dismayed, dispirited, distracted, disturbed, drained,
entranced, excited, flattered, frightened, impressed, irritated, moved,
mystified, peeved, perturbed, petrified, saddened, scared, shell-shocked,
startled, strained, stumped, unconcerned
This content downloaded from 128.250.144.144 on Mon, 15 Feb 2016 12:01:58 UTC
All use subject to JSTOR Terms and Conditions
This content downloaded from 128.250.144.144 on Mon, 15 Feb 2016 12:01:58 UTC
All use subject to JSTOR Terms and Conditions
NOTES
1. Unless specified otherwise, all frequency counts here relate to specific word forms,
rather than lemmas. We think it is appropriate to use frequency here, rather than relative
frequency, because we are comparing descriptions of the same film.
2. Indeed, the ITC guidelines (ITC 2000) suggest the use of adverbs (point 3.9) and the
use of “specific” verbs (point 3.11).
3. http://www.perl.org.
4. Expressions that can specify word forms and patterns to be searched for in a corpus;
for more examples, see http://www.regular-expressions.info/.
This content downloaded from 128.250.144.144 on Mon, 15 Feb 2016 12:01:58 UTC
All use subject to JSTOR Terms and Conditions
RE F ERE NCES
Anthony, Laurence. AntConc (Version 3.4.2) [Computer Software]. Tokyo, Japan: Waseda
University, 2011. Web. 24 Feb. 2015. http://www.antlab.sci.waseda.ac.jp/.
Biber, Douglas, Conrad, Susan, and Reppen, Randi. Corpus Linguistics: Investigating
Language Structure and Use. Cambridge: Cambridge UP, 1998. Print.
Fellbaum, Christiane, ed. WordNet: An Electronic Lexical Database. Cambridge, MA: MIT
Press, 1998. Print.
Finbow, Steve. “The state of audio description in the UK – from description to narration.”
Perspectives: Studies in Translatology 20.1 (2010): 215–29. Print.
Herman, David, ed. The Cambridge Companion to Narrative. Cambridge: Cambridge UP,
2007. Print.
Ibañez, Anna. “Evaluation criteria and film narrative. A frame to teaching relevance
in audio description.” Perspectives: Studies in Translatology 18.3 (2010): 143–53.
Print.
ITC. “ITC Guidance on Standards for Audio Description.” 2009. Web. 10 Aug. 2013. http://
www.ofcom.org.uk/static/archive/itc/itc_publications/codes_guidance/audio_
description/index.asp.html.
Kruger, Jan-Louis. “Audio narration: re-narrativising film.” Perspectives: Studies in
Translatology, 18(3) (2010): 231–49. Print.
———. “Making meaning in AVT: eye tracking and viewer construction of narrative.”
Perspectives: Studies in Translatology 20.1 (2012): 67–86. Print.
Kruger, Jan-Louis and Orero, Pilar. “Audio description, audio narration – a new era in
AVT.” Perspectives: Studies in Translatology 18.3 (2010): 141–42. Print.
Matamala, Anna and Orero, Pilar. “Audiovisual translation. When modalities merge.”
Perspectives: Studies in Translatology 21.1 (2013): 2–4. Print.
Mazur, Iwona and Chmiel, Agnieszka. “Towards common European audio description
guidelines: results of the Pear Tree Project.” Perspectives: Studies in Translatology
20.1 (2012): 5–23. Print.
Palmer, Alan. Fictional Minds. Lincoln and London: U of Nebraska P, 2004. Print.
Remael, Aline and Vercauteren, Gert. “Audio describing the exposition phase of films.
Teaching students what to choose.” Trans. Revista de Traductología 11 (2007): 73–
93. Print.
Ryan, Marie-Laure, ed. Narrative Across Media: The Languages of Storytelling. Lincoln:
U of Nebraska P, 2004. Print.
Salway, Andrew. “A Corpus-based Analysis of Audio Description.” Media for All: Subtitling
for the Deaf, Audio Description and Sign Language. Eds. Jorge Díaz Cintas, Pilar
Orero, and Aline Remael. Amsterdam: Rodopi, 2007. 151–74. Print.
———. “The Computer-based Analysis of Narrative and Multimodality.” New Perspectives
on Narrative and Multimodality. Ed. Ruth Page. London and New York: Routledge,
2010. 50–64. Print.
Snyder, Joel, ed. “Audio Description Guidelines and Best Practices.” Web. 10 Aug. 2013.
http://www.acb.org/adp/ad.html.
This content downloaded from 128.250.144.144 on Mon, 15 Feb 2016 12:01:58 UTC
All use subject to JSTOR Terms and Conditions
Vandaele, Jeroen. “What meets the eye. Cognitive narratology for audio description.”
Perspectives: Studies in Translatology 20.1 2012: 87–102. Print.
Vercauteren, Gert. “Towards a European guideline for audio description.” Media for
All: Subtitling for the Deaf, Audio Description and Sign Language. Eds. Jorge Díaz
Cintas, Pilar Orero, and Aline Remael. Amsterdam: Rodopi, 2007. 139–50.
Print.
This content downloaded from 128.250.144.144 on Mon, 15 Feb 2016 12:01:58 UTC
All use subject to JSTOR Terms and Conditions