Professional Documents
Culture Documents
Contributors viii
Introduction 1
ANDREW ROTHWELL, ANDY WAY AND ROY YOUDALE
PART 1
The Automated and Post-Edited Machine Translation of
Literature 25
PART 2
Machine Translation Applications in Literary Translation 89
PART 3
Corpus Linguistics, Text-Visualisation and
Literary Translation 187
PART 4
Applying Specialised Electronic Tools to
Literary Translation 235
Index282
Contributors
Editors
Andrew Rothwell is Professor Emeritus of French and Translation Studies
at Swansea University, UK. He has research interests in contemporary
French literature, especially the poet and art writer Bernard Noël, trans-
lation technologies and translation theory. He has published numerous
literary translations into English, including poetry by Bernard Noël and
two novels by Emile Zola, and is currently working on a new transla-
tion of Proust’s La Prisonnière.
Andy Way is Professor of Computing at Dublin City University, Ireland, and
Deputy Director of the ADAPT Centre for Digital Content Technology.
He is currently writing books on Sign Language Machine Translation
(MT) and European Language Equality arising from projects he coor-
dinates and a textbook on Translation and Automation. He has pub-
lished over 400 papers, was editor of the Machine Translation journal
from 2007–21, and in 2019 received the Award of Honour from the
International Association for Machine Translation for his services to
the community.
Roy Youdale is Research Associate in Translation Studies at the University
of Bristol, UK, where his research interests include corpus stylistics, text
visualisation, and using Computer-Assisted Translation (CAT) tools
for literary translation. He is the author of Using Computers in the
Translation of Literary Style: Challenges and Opportunities. He is also
a literary translator from Spanish to English.
Contributors
Aletta G. (Lettie) Dorst is Senior Lecturer in Translation Studies and English
Linguistics at Leiden University Centre for Linguistics, the Netherlands.
Her research focuses primarily on the translation of metaphor, style
and gender between different languages and cultures, between differ-
ent genres and domains, and between different modes and modalities.
Contributors ix
Avraham J. Roos did his PhD studies at the Amsterdam School of Historical
Studies, which is part of the University of Amsterdam, the Netherlands.
Roos did research comparing 106 English variant retranslations of the
Hebrew Passover Haggadah―a semi-liturgical work recited by Jews on
Passover Eve―using both algorithmically enhanced semi-close-reading
techniques and regular close reading. His goal was to visualise the differ-
ences by means of digital tools and to explain these differences against a
cultural-historical backdrop. Roos lectures at Herzog Academic College
and Orot Yisrael College in Israel.
Sasha Mile Rudan is completing his PhD at the Department of Informatics,
University of Oslo, Norway, on collaborative face-to-virtual systems for
augmenting social processes, knowledge management and dialogue. He
is a researcher at the Department of Linguistics and Philology, Uppsala
University, Sweden, on infrastructure for researching hagiographic
texts. He founded and co-leads the LitTerra Foundation which digitally
supports literature and cultural heritage. It provides digital humanities
(DH) platforms for computational text analysis (http://litterra.net/buk-
vik) and visualisation (http://litterra.net/litterra).
Sinisa Rudan is а researcher, IT developer and poetry performer. His
research is in the domain of collaboration and creativity, applied
on socio-technical ecosystems for social good/activism. He leads sev-
eral art-science multidisciplinary projects. He pursues his interests
through several regional and international positions: Co-founder of
‘ReMaking Tesla – Practices That Make a Genius’―International
Forum of Interactive and IT-Augmented Education; Co-founder
of ChaOS―an NGO uniting artists and scientists on cultural and
humane projects.
Tereza Šplíchalová is a freelance translator and a PhD student in English-
Czech Translation Studies at Masaryk University, Czech Republic. Her
research revolves around the interdisciplinarity of translation studies,
be it in relation to narratology, applied linguistics or logic and currently
focuses on the interface between Translation Studies and fictional world
theories.
Antonio Toral is Assistant Professor in Language Technology at
the University of Groningen, the Netherlands. He holds a PhD in
Computational Linguistics from the University of Alicante, Spain, and
has been researching in the field of Machine Translation (MT) since
2010. His research interests include the application of MT to literary
texts, MT for under-resourced languages, and the analysis of transla-
tions produced by machines and humans.
xii Contributors
DOI: 10.4324/9781003357391-1
2 Andrew Rothwell et al.
the style of a literary text involves the use of such figures as metaphor,
ambiguity, and repeated patterns. These figures may be present in any
type of text, but they will often be less frequent, less complex and less
subtle in non-literary texts.
(2010, p. 30)
the translation of texts from one language to another where the texts
themselves pivot broadly on the human creativity employed in their
production. They rely heavily on aesthetics for their existence, more
than texts that aim to bring about an outcome directly, as in the case
of technical texts.
(2022, pp. 5–6)
It has also been suggested that there is such a thing as ‘literary language’
which distinguishes literary from non-literary texts (Fabb, 2010). This
proposition we unreservedly reject, since (the same) language is used for
whatever people want to use it for. Literature often uses very simple lan-
guage, as exemplified by Ernest Hemingway for instance (see Section 2.1,
Chapter 2, this volume), while so-called technical/commercial translation
often uses complex and unusual language. Nor do we think that literary
texts can be clearly distinguished by their use of formal linguistic features
such as metaphor (but see Chapter 9, this volume, for a discussion of
metaphors and MT), wordplay etc. since these are employed in a wide
variety of text-types albeit to varying degrees. Toral et al. (Section 1.4,
Introduction 3
1 It should be noted that by the 15th century movable type had already been in use in East
Asia for at least two centuries (Brokaw & Kornicki, 2016).
4 Andrew Rothwell et al.
2 https://ilcs.sas.ac.uk/node/12053.
3 https://www.academia.edu/38505151/Machine_Translation_Summit_2019_Workshop
_Call_for_Papers_The_Qualities_of_Literary_Machine_Translation.
4 https://www.academia.edu/38087247/Call_for_papers_Translation_technologies_for_cre-
ative_text_translation.
5 https://calt2021conference.wordpress.com/.
6 https://www.iatis.org/index.php/7th-conference-barcelona-2021/item/2242-panels#P13.
7 https://www.asling.org/tc43/.
6 Andrew Rothwell et al.
We believe that this is the first book to bring together a collection of studies
which both illustrates the range and depth of research being undertaken in
CALT and assesses its development and future potential. The traditional
model of a single translator producing a translation of a literary work,
while probably still the norm, has been supplemented by other ways of
working including co-translation, collaborative translation where members
of a group share their drafts for comment and criticism and crowd-sourced
and fan translation where non-professional enthusiasts post their versions
of genre fiction on the internet (see Chapter 7, this volume). And it is the
availability of new electronic tools that makes all this possible.
What unites all the work displayed in this collection is a focus on the
application of theory to the practice of literary translation. In contrast
to the underlying philosophy of Descriptive Translation Studies (Toury,
2012)—that the job of TS is not to prescribe how translation should be
done, but to describe how it is done—CALT is founded on the idea that
the possibilities held out by translation technologies, and technologies
that can be used for translation, should be explored by both scholars and
practitioners of the art. In contrast to corpus-based Translation Studies,
it is concerned with how technology can be used in the process of literary
translation, and not simply in the analysis of already published translations,
valuable though that is.
that ‘while the preferred system set-ups were not so different from
those that had been conceived some time before (e.g. Forcada and
Ñeco, 1997), the hardware that facilitated the huge explosion in com-
putation required was now sufficiently powerful to allow these sys-
tems to be built in practice.’
Introduction 7
8 NMT training is computationally expensive, so some decisions are taken which although
unprincipled from a linguistic or translational perspective are valid computationally. For
instance, the vocabulary is usually limited which results in some bona fide words of either
source or target language being unknown to the system. In order to try to process such
out-of-vocabulary (OOV) items in terms of similar looking words that actually are con-
tained in the training data, prior to training and to translating the data is segmented into
8 Andrew Rothwell et al.
the NMT system is frozen as the final model and used to tackle unseen
sentences from the test set (see below).
More importantly, Bahdanau et al. (2015) introduced the concept of
‘attention’ to the basic RNN architecture, similar in spirit and intention to
source–target ‘alignments’ in SMT. The attention mechanism helped the
decoder choose the most appropriate target word for a particular source
item by allowing it to consider the most relevant other source-sentence
words to use in translation as contextual clues for translationally ambigu-
ous words in the same string.
Vaswani et al. (2017) took this a step further by using attention as
‘all you need’ to translate in their ‘Transformer’ architecture (cf. Van de
Cruys, Section 8.2.4, Chapter 8, this volume). Transformer-based systems
have achieved state-of-the-art (SOTA) performance on many benchmarks
in language processing, so it is unsurprising that Toral et al., Oliver and
Van de Cruys all use such systems for their MT experiments in this volume.
Before we describe how MT was used for literary translation elsewhere,
it is worth explaining the three different phases of building, tuning and
evaluating corpus-based (SMT and NMT) systems. Traditionally, large
amounts of high-quality parallel data were needed to build such systems
(and for SMT, large amounts of monolingual data, too). This comprised
source-language sentences and their human translations, such as the
proceedings of the European Parliament that have been used for MT engine-
training for many years (Koehn, 2005). For most official EU language-
pairs, around 2 million parallel sentence pairs exist in this collection,
which is usually enough to build good-quality MT systems, certainly as
far as SMT is concerned.9 Most of this material would be used for MT
system building (the ‘training’ set), but small subsets of it would be used as
‘held-out’ data for tuning the parameters of the system (the ‘development’
set) and a separate set for evaluating the quality of the tuned system (the
‘test’ set). Note that both the development and test sets are held out from
the training set so as not to unfairly skew (‘bias’) the performance of the
system.
subword units (not necessarily morphemes in the linguistic sense) during pre-processing.
Post-processing after translation consists of merging subword units into full wordforms
(e.g. “straight ##forward” becomes “straightforward”).
9 Khayrallah and Koehn (2018) demonstrate that with the same amount of training data
NMT models underperform compared to SMT, but with much larger corpora they can
outperform such systems. Accordingly, lots of work has gone into generating synthetic
(MT-generated) data via back-translation (Sennrich et al., 2016; Poncelas et al., 2018),
as larger amounts of authentic (human-produced) data are simply unavailable. Despite
the fact that back-translated data might be considered noisy, NMT systems trained with
such additional data typically outperform those trained solely on human-generated ‘gold-
standard’ corpora (cf. Chapter 1, this volume).
Introduction 9
10 https://translate.google.com.
11 Bringing this more up to date, Trancoso (2022) notes that ‘the volume of words that are
daily translated using publicly available machine translation engines reaches several tril-
lion words’, citing a 3-year-old SDL blog written by K. Vashee: https://www.rws.com/blog
/The-Issue-of-Data-Security-and-Machine-Translation/.
10 Andrew Rothwell et al.
In the mid-2010s, Way and Toral chose as a research topic what they
thought was the ‘last bastion of human translation’, namely the translation
of literature, as ‘the perceived wisdom is that MT [wa]s of no use for
the translation of literature’ (Toral and Way, 2015a, p. 123). Despite the
impressive improvements in MT that had been seen by that time, neither
Toral nor Way really believed that good translation quality was possible
in what is perhaps the most creative task a human translator can take on,
using an MT system.
They soon discovered that there was, in fact, very little related work
on the topic. MT had been used for some literary translation experiments;
for example, both Genzel et al. (2010) and Greene et al. (2010) examined
the use of SMT for poetry, but the former article did not test their French–
English system on poems (just news texts), while the latter evaluated the
MT output only from a qualitative perspective. Voigt and Jurafsky (2012)
examined the topic of referential cohesion of French prose and poetry
translated into English. Closer to what they planned to do was the work of
Besacier (2014), where PEMT (performed by non-professional translators)
of a short story was used to translate a short story from English into French.
However, they could not find any previous work where MT had been
applied to novels. In the data-gathering phase with which any MT developer
is familiar, they found quite a lot of literary parallel data available in the
form of e-books. Initially, then, in Toral and Way (2015a), they built a
tailored SMT system for a contemporary best-selling author, Carlos Ruiz
Zafón, and then used it to translate one of his novels—El Prisionero del
Cielo [The Prisoner of Heaven] (2011)—between two closely-related
languages, namely Spanish to Catalan; if the MT quality was too poor
between languages with such similarity, they could abandon their venture
and declare the bastion to be impregnable!
In fact, far from being poor, their results were surprisingly good. Not
only was the performance of their system better than three other systems
(including Google Translate) according to three standard automatic metrics,
but for 20% of the sentences, the translations produced by the MT system
and a professional translator (i.e. those taken from the published novel
in the target language) were identical. In addition, a human evaluation
revealed that for over 60% of the sentences, native speakers deemed the
translations of the MT system to be of the same quality as those of the
professional translator.
Duly encouraged by these findings they decided to move on to a less
closely related language pair (English to Catalan) and to extend their
experiments with the newly emerging NMT paradigm which at the time
was based on RNNs with attention (Bahdanau et al., 2015). In Toral and
Way (2018), they compared the performance of (at the time) large-scale
models of SMT and NMT, using 133 parallel novels as training data—a
Introduction 11
12 Note that some research on translationese purely in human translation contexts exists,
e.g. Meldrum (2009).
12 Andrew Rothwell et al.
13 https://www.deepl.com/en/translator.
Introduction 13
14 Note that the work Toral and Way described in this section either uses MT systems built
by the authors themselves, or freely available MT such as Google Translate. Of course,
where systems have been built which are outside the control of those testing their quality,
there is no guarantee that the test data used in these studies is not contained in the train-
ing data of those freely online MT systems, which could have skewed the results. Clearly
where authors were able to build their own systems, this can be avoided.
14 Andrew Rothwell et al.
book serve to deepen the translator’s relationship with both the ST and
the TL (target language) into which it is being translated, empowering
them to make use of the added information without applying it through
any automated process. In recent years, literary translators have begun
to appropriate a range of tools primarily developed for use in corpus
linguistics and text visualisation for their own purposes, not just to compare
features of existing translations (see especially Cheesman et al., 2017), but
to inform and guide their approach to performing new ones. Different
types of computer-enabled analysis applied to the literary ST, treated
as (part of) an electronic corpus, can yield a wealth of insights into its
thematic and lexical structures which go far beyond what can typically be
derived from a conventional linear reading, however detailed. As Youdale
(2020) has shown, both close and distant (Moretti, 2013) corpus analyses
can reveal specific features of an ST author’s style, either in general or in
a particular work (see also Chapter 3, this volume) which the translator
can then decide whether to try to emulate in the TT. Data visualisation
(see also Chapters 11 and 14, this volume) can make such decisions clearer
by showing in graphical form, for instance, where in the ST a particular
theme is prominent, how patterns of imagery cluster around a particular
character, or where the length of sentences departs from the average,
raising the question of the author’s literary objective in doing these things.
While the specific use (if any) that the translator makes of this additional
rich information remains entirely a matter for them, and the analysis tools
from which it derives in no way dictate or determine the decisions they will
subsequently make, the result of using them in a preparatory way can be a
better-informed, stylistically aware translation.
From the wealth of imaginative, sometimes ludic applications of
computer technology to literary translation described in the different
chapters of this book (see Part 5 below) it is clear that practitioners and
researchers alike are discovering myriad ways of enriching their analytical
engagement with source texts and multiplying their perspectives on the
production of new target texts. The internet-connected, tools-equipped
literary translator of the 21st century appears to have a much more powerful
relationship with their materials, both cultural and linguistic, than one
working even a couple of decades ago with little more than a book, paper
or CD-ROM dictionaries, and a word processor. Although coined with
‘human-in-the-loop’, MT-derived commercial translation in mind, the
concept of the augmented translator (Lommel, 2018) now seems at least as
applicable, if not more so, to state-of-the-art literary translation practice.
While new applications specifically designed for the literary domain are
being prototyped (custom-trained MT systems, translation comparison
and analysis tools such as KonText and Bukvik, see Chapters 10 and 14,
this volume), others are being experimentally appropriated from fields
Introduction 15
such as NLP and text visualisation. This is especially the case with the
kind of interactive environment offered by commercial CAT tools which
allow the ST and developing TT (target text) to be presented on screen in
parallel alignment, avoiding the need for the translator’s gaze to flit back
and forth between page and screen with the attendant risk of omission
while subsequently facilitating revision of the draft TT. Many CAT tools
additionally allow the integration of MT into the workbench environment,
giving the translator instant access to a range of relevant lexical (including
collocational) options, even if rarely to a fully satisfactory translation
proposal (see Chapter 5, this volume). We might speculate that such tools
will eventually be customised to offer the augmented literary translator
of the future an integrated workbench suite giving fine-grained access
to the patterns underlying the ST as well as leverage of a range of TL
linguistic resources, from online dictionaries and thesauri to MT proposals
and, through alignment and translation memory, the decisions made by
previous translators (see Chapter 14, this volume).
and the source text, as well as impacts from the stimulus of the MT version
on the translator’s voice, along with potential changes to literary language
itself. Despite the obvious improvements we are seeing in MT capability,
it is worth reflecting on whether MT systems will ever be able to incorpo-
rate the type of world knowledge in their decision-making processes that
we see the human experts using in Kolb’s chapter. Finally, in ‘Mark My
Keywords: A Translator-Specific Exploration of Style in Literary Machine
Translation’, Dorothy Kenny and Marion Winters present a keyword
analysis of a novel post-edited by the internationally acclaimed translator
Hans-Christian Oeser. The analysis identifies words that are key in Oeser’s
post-edited text compared to the machine-translated version. It goes on to
investigate whether these keywords are characteristic of Oeser’s broader
translation work and of German literary fiction in general.
Part 2, containing six chapters on Machine Translation Applications
in Literary Translation, starts with a global assessment by James Hadley
of ‘MT and CAT: Challenges, Irrelevancies, or Opportunities for Literary
Translation?’, which investigates whether MT is likely to challenge literary
translators in the short to medium term, and looks towards the develop-
ment of CAT tools specifically designed to assist them. This is followed
by a case study, ‘Retranslating Proust Using CAT, MT and Other Tools’,
in which Andrew Rothwell describes his use of current translation tech-
nologies to retranslate into English Marcel Proust’s classic 1923 novel La
Prisonnière. The CAT editing environment of Déjà Vu X3 enables, for
each segment, a close comparison between the canonical translation pub-
lished in 1929 by C.K. Scott Moncrieff and a new interactive MT version
generated by DeepL. This, he argues, brings translation issues into focus
and helps identify creative solutions, taking away some of the spadework
of translating from scratch and allowing the translator to concentrate on
higher-level issues of style and literary effect. Antoni Oliver then presents
an experiment to develop ‘Author-Tailored Neural Machine Translation
Systems for Literary Works’, translated from English into Spanish and
Catalan, for the use of students and language learners. After a general
MT engine has been trained, it is tuned using a parallel corpus of human-
translated works by the same author, and the results are evaluated using
automated metrics, comparison with the output of free online NMT sys-
tems, and various measures of post-editing effort. The following chapter
by Shuyin Zhang shifts the focus of investigation from author to genre,
and from European languages to Chinese. ‘The Machine Translation of
Chinese Fantasy (Xianxia) Novels: An Investigation into the Leading
Websites Translating Net Literature from Chinese to English’ examines
the features of Xianxia novels that make MT particularly suitable, with
pre- and post-editing by their extensive fan-base, to support online inter-
cultural communication and the dissemination of Chinese literature. Next,
Introduction 17
Acknowledgements
The ADAPT Centre for Digital Content Technology (www.adaptcentre
.ie) at Dublin City University is funded by the Science Foundation Ireland
Research Centres Programme (Grant 13/RC/2106) and is co-funded by the
European Regional Development Fund.
References
Bahdanau, D., Cho, K., & Bengio, Y. (2015). Neural machine translation by jointly
learning to align and translate. In 3rd International Conference on Learning
Representations, ICLR 2015, Conference Track Proceedings, San Diego (15p).
https://doi.org/10.48550/arXiv.1409.0473
Baker, M. (1993). Corpus linguistics and translation studies: Implications and
applications. In M. Baker, G. Francis, & E. Tognini-Bonelli (Eds.), Text and
technology: In honour of John Sinclair (pp. 233–50). John Benjamins. https://
doi.org/10.1075/z.64.15bak
Bentivogli, L., Bisazza, A., Cettolo, M., & Federico, M. (2016). Neural versus
phrase-based machine translation quality: A case study. In Proceedings of the
2016 Conference on Empirical Methods in Natural Language Processing,
Austin (pp. 257–67). https://aclanthology.org/D16-1025
Besacier, L. (2014). Traduction automatisée d’une œuvre littéraire: Une étude
pilote. Traitement Automatique du Langage Naturel (TALN). Marseille.
https://hal.inria.fr/hal-01003944
Boase-Beier, J. (2010). Stylistic approaches to translation. Routledge. https://doi
.org/10.4324/9781315759456.
Brokaw, C., & Kornicki, P. (2016). The history of the book in East Asia. Routledge.
https://doi.org/10.4324/9781315239200
Castilho, S., Moorkens, J., Gaspari, F., Calixto, I., Tinsley, J., & Way, A. (2017).
Is neural machine translation the new state-of-the-art? Prague Bulletin of
Mathematical Linguistics, 108, 109–120. https://core.ac.uk/download/pdf
/195384513.pdf
Introduction 19
Cheesman, T., Flanagan, K., Thiel, S., Rybicki, J., Laramee, R.S., Hope, J., &
Roos, A. (2017). Multi-retranslation corpora: Visibility, variation, value, and
virtue. Digital Scholarship in the Humanities, 32(4), 739–60. https://www
.academia.edu/28423745/Multi_Retranslation_Corpora_Visibility_Variation
_Value_and_Virtue
Cho, K., Van Merrienboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk,
H., & Bengio, Y. (2014). Learning phrase representations using RNN encoder–
decoder for statistical machine translation. In Proceedings of the 2014 Conference
on Empirical Methods in Natural Language Processing (EMNLP), Doha, Qatar
(pp. 1724–34). Association for Computational Linguistics. https://aclanthology
.org/D14-1179/
Cronin, M. (2010). The translation crowd. Tradumàtica: traducció i tecnologies
de la informació i la comunicació, 8. https://revistes.uab.cat/tradumatica/article
/view/n8-cronin/pdf_15
Daems, J. (2022). Dutch literary translators’ use and perceived usefulness of
technology. In J. Hadley, K. Taivalkoski-Shilov, C. Teixeira, & A. Toral (Eds.),
Using technologies for creative-text translation (pp. 40–65). Routledge. https://
doi.org/10.4324/9781003094159
Eagleton, T. (2014). How to read literature. Yale University Press. https://tinyurl
.com/4x5xxr2j
Fabb, N. (2010). Is literary language a development of ordinary language? Lingua,
120(5), 1219–32. https://doi.org/10.1016/j.lingua.2009.07.007
Farrell, M. (2018). Machine translation markers in post-edited machine translation
output. In Proceedings of the 40th Conference Translating and the Computer,
London (pp. 50–59). https://www.asling.org/tc40/wp-content/uploads/TC40
-Proceedings
Forcada, M. (2017), Making Sense of neural machine translation. Translation
Spaces, 6(2), 291–309. https://doi.org/10.1075/ts.6.2.06for
Forcada, M., & Ñeco, R. (1997). Recursive hetero-associative memories for
translation. In Biological and Artificial Computation: From Neuroscience to
Technology (International Work-Conference on Artificial and Natural Neural
Networks, IWANN’97, Proceedings), Lanzarote, Canary Islands, Spain (pp.
453–62). https://link.springer.com/chapter/10.1007/BFb0032504
Genzel, D., Uszkoreit, J., & Och, F.J. (2010). ‘Poetic’ statistical machine
translation: Rhyme and meter. In Proceedings of the 2010 Conference on
Empirical Methods in Natural Language Processing, Cambridge, MA (pp.
158–66). https://aclanthology.org/D10-1016/
Greene, E., Bodrumlu, T., & Knight, K. (2010). Automatic analysis of rhythmic
poetry with applications to generation and translation. In Proceedings of the
2010 Conference on Empirical Methods in Natural Language Processing,
Cambridge, MA (pp. 524–33). https://aclanthology.org/D10-1051/
Guerberof-Arenas, A., & Toral, A. (2019). Measuring readers’ engagement
in literary texts: A study comparing human translation to machine assisted
translation. In Proceedings of the 2019 Congress of the European Society for
Translation Studies, Stellenbosch, South Africa.
Hadley, J.L., Taivalkoski-Shilov, K., Teixeira, C.S.C., & Toral, A. (Eds.). (2022).
Using technologies for creative-text translation. Routledge. https://doi.org/10
.4324/9781003094159
20 Andrew Rothwell et al.
Hassan, H., Aue, A., Chen, C., Chowdhary, V., Clark, J., Federmann, C., Huang,
X., Junczys-Dowmunt, M., Lewis, W., Li, M., Liu, S., Liu, T-Y., Luo, R.,
Menezes, A., Qin, T., Seide, F., Tan, X., Tian, F., Wu, L., … Zhou, M. (2018).
Achieving human parity on automatic Chinese to English news translation.
ArXiv. https://arxiv.org/abs/1803.05567
Khayrallah, H., & Koehn, P. (2018). On the impact of various types of noise on
neural machine translation. In Proceedings of the 2nd Workshop on Neural
Machine Translation and Generation, Melbourne, Australia (pp. 74–83).
Association for Computational Linguistics. https://aclanthology.org/W18-2709/
Koehn, P. (2005). Europarl: A parallel corpus for statistical machine translation. In
MT Summit X, Conference Proceedings: The Tenth Machine Translation Summit,
Phuket, Thailand (pp. 79–86). https://aclanthology.org/2005.mtsummit-papers.11/
Koehn, P. (2020). Neural machine translation. Cambridge University Press. https://
doi.org/10.1017/9781108608480
Koehn, P., Hoang, H., Birch, A., Callison-Burch, C., Federico, M., Bertoldi, N.,
Cowan, B., Shen, W., Moran, C., Zens, R., Dyer, C., Bojar, O., Constantin,
A., & Herbst, E. (2007). Moses: Open source toolkit for statistical machine
translation. In Proceedings of the 45th Annual Meeting of the Association for
Computational Linguistics Companion Volume Proceedings of the Demo and
Poster Sessions, Prague, Czech Republic (pp. 177–80). https://aclanthology.org
/P07-2045/
Koehn, P., Och, F.J., & Marcu, D. (2003). Statistical phrase-based translation.
In Proceedings of the 2003 Human Language Technology Conference of the
North American Chapter of the Association for Computational Linguistics,
Edmonton, Alberta, Canada (pp. 127–33). https://aclanthology.org/N03-1017/
Kuzman, T., Vintar, Š., & Arčan, M. (2019). Neural machine translation of literary
texts from English to Slovene. In Proceedings of Machine Translation Summit
XVII: The Qualities of Literary Machine Translation, Dublin, Ireland (pp. 1–9).
European Association for Machine Translation. https://aclanthology.org/W19
-7301.pdf
Landers, C. (2001). Literary translation: A practical guide. Multilingual Matters.
https://www.perlego.com/book/571617/literary-translation-a-practical-guide
-pdf
Lee, T. (2011). The death of the translator in machine translation. Target:
International Journal of Translation Studies, 23(1), 92–112. https://doi.org/10
.1075/target.23.1.06lee
Littau, K. (2011). First steps towards a media history of translation. Translation
Studies, 4(3), 261–81. https://www.tandfonline.com/doi/abs/10.1080
/14781700.2011.589651
Lommel, A. (2018). Augmented translation: A new approach to combining
human and machine capabilities. In Proceedings of the 13th Conference of the
Association for Machine Translation in the Americas (Volume 2: User Track),
Boston, MA (pp. 5–12). https://aclanthology.org/W18-1905
Matusov, E. (2019). The challenges of using neural machine translation for
literature. In Proceedings of Machine Translation Summit XVII: The Qualities
of Literary Machine Translation, Dublin, Ireland (pp. 10–19). European
Association for Machine Translation. https://aclanthology.org/W19-7302
Introduction 21
1 Literary-Adapted Machine Translation
in a Well-Resourced Language Pair
Explorations with More Data and Wider
Contexts
1.1 Introduction
Research on machine translation (MT) for literary texts has recently gained
a foothold in the field. A popular strand therein leverages well-established
domain adaptation techniques to tailor neural MT (NMT) systems to the
literary domain (Toral and Way, 2018; Kuzman et al., 2019, Matusov,
2019). To the best of our knowledge, the largest amount of literary in-
domain data used to date is 1.1 million bilingual sentence pairs (Toral
and Way, 2018) and 10 million monolingual sentences (Matusov, 2019).
Our study tackles a relatively well-resourced language pair and goes a step
beyond this previous work in terms of the amount of data used: we train
an English-to-Dutch MT system on around 5 million sentence pairs and
over 20 million monolingual target-language sentences.
In addition, we note that the MT systems trained in the aforementioned
studies deal with sentences in isolation, i.e. they translate one sentence at
a time, independently of the surrounding sentences. However, it has been
shown that literary texts have particularly dense reference chains (Voigt and
Jurafsky, 2012), so incorporating discourse features beyond the level of the
sentence is an important research focus for applying MT to literary texts.
Accordingly, in this study, we also build MT systems that go beyond the sen-
tence level. These are usually referred to in the literature as document-based
MT systems (Popel et al., 2019) and, to the best of our knowledge, have not
yet been explored for literary texts except for Baas (2021).
The word literary, such as in the established term ‘literary MT’, is often
used to refer to fiction novels in general. However, in this chapter, we are
also interested in a narrower, more exclusive sense of the word when used
in the phrase ‘literary fiction’ which we elaborate on here. In the rest of this
chapter, we use the term novel for fiction novels in general which also form
the domain we consider. In the book market, publishers attribute a genre
label to books as a means to communicate with their prospective readers.
Usually, a distinction is being made between genre fiction, such as romance
DOI: 10.4324/9781003357391-3
28 Antonio Toral et al.
or suspense, and general fiction, also often called literary fiction (Squires,
2009). Seen from this perspective, literary fiction is anything that is not
clearly belonging to a certain genre (cf. Squires, 2009, pp. 4–5; Baldick,
2015). What is literary is difficult to define; in the words of Squires (2009,
p. 5): ‘Literary fiction is that published by literary imprints such as [she
names several publishing houses here]’. This circular way of reasoning has
been quite common in book history and literary studies. Other theories
claim that specific formal features determine a text’s literariness (Russian
Formalism; cf. Baldick, 2008; see also Section I.1, Introduction, this vol-
ume for more on the literary/non-literary distinction). In contrast, the pro-
ject The Riddle of Literary Quality1 took an empirical and computational
approach. The concept of literature was triangulated in a data-driven man-
ner using a corpus of novels, a reader survey, and machine learning. The
large reader survey asked respondents how literary they found the novels
(Koolen et al., 2020), and machine learning was applied to estimate to
what extent the text of the novels can explain the reader’s judgments (Van
Cranenburgh et al., 2019). In our chapter, we apply this machine learning
approach to explore the literariness of translations (see Section 1.4.1 for
more details).
In this work we aim to answer four main research questions:
1 https://www.huygens.knaw.nl/en/projecten/the-riddle-of-literary-quality-2/.
Literary-Adapted Machine Translation 29
The datasets used to train, develop, and evaluate the MT systems are listed
in Table 1.1. The training data includes both in-domain corpora (parallel
English–Dutch and monolingual Dutch novels) and general-domain cor-
pora (the first 20 million sentence pairs from Paracrawl v9,2 a parallel
corpus automatically compiled from web text). The in-domain corpora
consist mostly of popular novels; although they include literary novels,
genre fiction (suspense, sci-fi, fantasy, romance) predominates. The devel-
opment set (see Section 1.2.4) consists of one English-language novel
(Boyne, 2006) and its published Dutch translation (Boyne, 2010), chosen
as a representative example of recent popular fiction. Finally, the test set
(see Section 1.2.3) consists of 31 book pairs, chosen to be a representative
sample of fiction, as it encompasses different periods (from 1884 to 2012,
M = 1983, Mdn = 2007) as well as different varieties (suspense, romance,
and literary novels). The list of novels in the test set is shown in Appendix
A (see Section I.2, Introduction, this volume, for more on how data is split
into training, development, and test sets).
*means that the English side of this corpus is synthetic, created by means of back-translation
(see Section 1.2.3).
• S1. Trained on the in-domain parallel data using the Transformer base
network.
• S2. Compared to S1, adds the in-domain monolingual data previ-
ously back-translated (Sennrich et al., 2016; Poncelas et al., 2018; see
Section I.2, Introduction, this volume, for more on synthetic data crea-
tion) using a reverse version (i.e. Dutch-to-English) of S1. After initial
training finishes, the system is fine-tuned on the in-domain parallel data.
Fine-tuning refers to a second training phase. Usually, the first train-
ing phase involves larger amounts of general-domain data, while the
fine-tuning phase uses smaller amounts of domain-specific data. In our
case, the first phase uses all training data while the second uses only the
parallel novels. The remaining systems are fine-tuned in the same way.
• S2Big. Equal to S2 except for using Transformer big.
• S3Big. Adds to S2Big the general-domain parallel data.
• S3BigDocX00. Equal to S3Big except for the fact that fine-tuning is per-
formed at document level, up to either 400 or 800 characters on each
side, following Baas (2021).
• Combo. An ensemble of S3Big, S3BigDoc400 and S3BigDoc800.
The motivation for building ensembles like Combo is that its underlying
systems may have complementary strengths since they were trained with
different granularities (isolated sentences, and small and larger amounts
of multiple consecutive sentences). For each sentence, Combo selects the
translation from the three MT systems with the highest quality according
to COMET QE.
We consider three baselines in our study, namely, two popular gen-
eral-purpose online systems (DeepL4 and Google Translate5), as well as
3 https://marian-nmt.github.io/.
4 https://www.deepl.com/translator.
5 https://translate.google.com/.
32 Antonio Toral et al.
1.3 Automatic Evaluation
Table 1.2 shows the results for BLEU and COMET, as well as the training
time taken by each system. We observe that the use of additional training
data, be it monolingual in-domain (S2 vs S1) or parallel general-purpose
(S3Big vs S2Big), results in consistent improvements according to both
BLEU and COMET. The same trend can be observed when increasing the
size of the network (S2Big vs S2). We also trained a Transformer big vari-
ant of S1. However, this did not lead to an improvement, suggesting that
the additional network parameters in the S2Big variant offer an edge only
when sufficient amounts of training data are available.
However, the two metrics did behave differently when comparing a sys-
tem trained on isolated sentences to a variant trained on multiple consecu-
tive sentences (S3Big vs S3BigDoc400 and S3BigDoc800): BLEU slightly
prefers the former while COMET prefers the latter. Given that COMET
correlates better with human judgments as well as the positive outcomes
of document-level MT for the translation of pronouns (e.g. Baas, 2021),
we decided to give COMET the benefit of the doubt in this case. Finally,
a combination based on system selection (taking as inputs the MT out-
puts produced by S3Big, S3BigDoc400 and S3BigDoc800) leads to the best
Table 1.2 Results with automatic evaluation metrics (BLEU and COMET estimate
translation quality; higher is better).
▵% denotes relative differences with respect to DeepL. MT training times given as hours for
first training plus hours for fine-tuning. Best result per metric shown in bold.
6 https://github.com/Helsinki-NLP/Opus-MT.
Literary-Adapted Machine Translation 33
1.4 Quantitative Analyses
1.4.1 Literariness Predictions
Figure 1.1 C
OMET scores by DeepL, Opus-MT and our Combo system on the 31
test set novels.
of the variation in ratings (R2; Field et al., 2012, p. 262). For a given novel,
the predicted rating is, on average, 0.74 points too low or high on the 1–7
Likert scale.
However, this evaluation was based on predicting the literary ratings
of different novels. In the present work, we are concerned with differ-
ent translations of the same work, which is arguably a more challeng-
ing task since the differences may be more subtle. Our aim is to explore
the predicted degrees of literariness across the translations and determine
whether they agree with the judgments from the human evaluation survey
(Section 1.5.2). Since the model expects 1000-word chunks, for each novel
we obtain a sequence of literariness predictions, which we reduce to a sin-
gle score per novel using the mean.
Given that DeepL is a general-purpose system, while our MT systems
have been fine-tuned on novels, we expected our systems to produce trans-
lations with higher literariness than DeepL. However, we find that DeepL
scores higher than our Combo system for 27 out of the 31 novels that
make up the test set; the four novels on which our Combo system scores
higher are Boyne, French, Tolkien, and Twain. The differences are small
but significant (p < 0.01, paired t-test; Field et al., 2012, pp. 386–93) for
26 out of the 31 novels, with Cohen’s d (Cohen, 1988, pp. 20–27) ranging
between 0.24 and 0.77, indicating small and medium effect sizes. For 17
out of 31 novels, the human translation (HT) is predicted to be more liter-
ary than any of the MT outputs (and these 17 novels have high literariness,
4.5 or higher) while the 3 least literary HTs are less literary than the MT
systems, indicating a wider range for HT (see Figure 1.2). The difference
between our sentence- and document-level systems is negligible. Lastly,
when comparing our Combo system to Opus-MT, translations by Combo
are predicted to be more literary for half the books, including the most
literary novels. The differences between the two systems are significant for
20 out of 31 novels (p < 0.01, paired t-test).
1.4.2 Lexical Variety
Figure 1.2 M
ean of predicted literariness scores across fragments (error bars show
standard error; Fields et al., 2012: 42) for the human translation and
three MT systems.
p < 0.05). Overall, these results are consistent with those on literariness
(Section 1.4.1).
1.4.3 Correlations
Finally, we study whether there are any correlations between the auto-
matic evaluation metric COMET (Section 1.3) and the quantitative metrics
used for analyses, literariness (Section 1.4.1), TTR (Section 1.4.2) as well
Literary-Adapted Machine Translation 37
as n-gram overlap. The latter was already explored by Toral and Way
(2018) to measure the novelty of a text with respect to the training data
(considered as representative of fiction), by checking how many of the
unique 4-grams (i.e. sequences of 4 consecutive words) in a novel are also
present in the training data. Thus, the lower the overlap, the more distinct
the novel is compared to the training data.
We find moderate Spearman correlations (Field et al., 2012, pp. 219–22)
for the difference between the COMET scores of the Combo system and
DeepL on the one hand, and n-gram overlap (ρ = 0.44, p < 0.05, Figure 1.3)
as well as the literariness of the human translation (ρ = –0.5, p < 0.01,
Figure 1.4) on the other. A crucial part of Figures 1.3 and 1.4 is the 0 point
on the x-axis; to the right of the shaded areas we find all the books on
Figure 1.3 L
inear correlation of the difference between the COMET scores of the
Combo system and DeepL, and n-gram overlap between our training
and test set.
38 Antonio Toral et al.
Figure 1.4 L
inear correlation between the absolute difference between the
COMET scores of the Combo system and DeepL and predicted
literariness of the human translations.
which Combo scores better than DeepL. This means that on most books
with high literariness, our system performed better in terms of (i) a higher
level of n-gram overlap indicating an advantage of our Combo system
compared to DeepL translations, but (ii) higher literariness is associated
with less of an advantage for our Combo system compared to DeepL, i.e.
the correlation shows that the difference is smaller for books which are
deemed to be more literary. Note that Pearson (parametric) correlations
did not meet all the assumptions, which is why we report Spearman (non-
parametric). However, we still show linear regressions in Figures 1.3–1.5
as that makes the interpretation easier since they have the original values
in the two axes. The bands in the plots show the 95% confidence intervals
Literary-Adapted Machine Translation 39
(Field et al., 2012, pp. 138, 211), to visualise how well the straight line
from the linear regression describes the observed data.
We find similar correlations between DeepL and Opus-MT, both in
the case of n-gram overlap (ρ = 0.44, p < 0.05; note that this refers to
n-gram overlap with the training data of our system, not the training data
of DeepL or Opus–MT) and literariness (ρ = –0.39, p < 0.05). However,
there is no such correlation between Opus-MT and the Combo system. We
do not find any significant correlation involving COMET scores and TTR.
We then explore whether n-gram overlap and literariness correlate
(Figure 1.5). They do in fact and lead to the strongest correlation so far
( ρ = –0.76, p < 0.001). A possible explanation for this is that literary works
tend to contain more creative and original language, meaning that it is
Figure 1.5 L
inear correlation between n-gram overlap of our training and test set
and predicted literariness of the human translations.
40 Antonio Toral et al.
more likely that n-grams contained in the literary works have not been
seen in the training data.
1.5 Human Evaluation
We performed a human evaluation of almost every system built in this
project in order to track the effects of the system’s development and to
compare the human evaluation to the BLEU and COMET scores in the
automatic evaluation, particularly in the case of the final, combined sys-
tem. Below we describe how we conducted two of the human evaluation
experiments and their results.
At this point in our project, the two automatic evaluation metrics, BLEU
and COMET, started to behave differently for systems trained on isolated
sentences (S3Big) and on chunks of consecutive sentences (S3BigDoc800).
Therefore, we performed a human evaluation to compare this to the pic-
ture presented by the automatic metrics (cf. Way (2018) for more on the
relative merits of both evaluation types).
In this step, a professional translator blindly compared the Dutch trans-
lations from systems S3Big and S3BigDoc800. The source text was the first
page (approx. 380 words) of Kurt Vonnegut’s short story ‘2BR02B’. They
analysed the following aspects: the number of differences between the two
translations and which translation was preferred; which error category
the non-preferred translations fell into (meaning, style, lexicon, gram-
mar, cohesion; roughly based on Short and Leech, 2007), and whether the
errors concerned accuracy or fluency. Our hypothesis was that the system
trained on chunks of consecutive sentences would perform better in terms
of cohesion.
The two translations show 26 differences; in four instances, the transla-
tion choices are considered equal. Thirteen translations by the sentence-
level system are preferred, and nine by the document-level system, i.e. the
sentence-level system shows fewer issues. The issues are distributed as
shown in Table 1.3.
We compared the systems using the TAUS Dynamic Quality Framework
(Görög, 2014), an error typology often used in MT evaluation research to
distinguish accuracy (in terms of transferring meaning) and fluency errors.
The document-level system shows six accuracy (five are serious mistransla-
tions) and seven fluency issues. The sentence-level system shows three accu-
racy and five fluency issues, plus one locale convention issue. Most notably,
the document-level system more often interprets the original wrongly (5
vs 2), once leading to a completely unintelligible translation. The docu-
ment-level system produces five lexical issues, but the sentence-level system
Literary-Adapted Machine Translation 41
[S3Big] Wehling was zesenvijftig, niet meer dan een groentje in een
bevolking van gemiddeld honderdnegenentwintig jaar.
The final stage of our study led to the hypothesis that combining MT
systems that are trained on instances with different granularity (single
sentences and multiple consecutive sentences) can lead to better output
than a generic MT system. We performed an elaborate survey evaluation
to see whether the human evaluation would complement the automatic
evaluation.
In the survey, we asked respondents to compare two different Dutch
translations, one generated by DeepL and one by our Combo system, of an
English self-contained excerpt (approx. 400–600 words) from one of the
following four novels:
In half of the PDF files (containing the original excerpt plus the two trans-
lations) respondents received, we swapped the order of the two transla-
tions so as to avoid order bias. The translations were labelled [translation
1] and [translation 2] so as to ensure a blind comparison.
We selected these four novels from our test set to represent literary fic-
tion, though with different degrees of literariness (see the HT predictions
in Figure 1.2). From these novels, we selected three types of excerpts that
we deemed would lead to significant insights: (a) the opening passage,
(b) a passage from the chunk with the highest predicted literariness (see
Section 1.4.1) for the human translation (which was not included in the
survey), and (c) a passage from the chunk with the largest difference in
predicted literariness between the human translation and the average score
for all of the MT systems. The latter choice reflects the idea that, this way,
our combined system would not be favoured.
In the survey, we asked respondents to rank the two translations and
motivate their ranking using examples. After that, we asked respondents
to indicate to what extent meaning, style, grammar, and cohesion were
preserved in each of the two translations. In addition, we asked them to
provide examples to substantiate their opinion with each of the four cate-
gories. Lastly, respondents were asked how literary they found each trans-
lation to be on a scale of 1 to 7, again complemented with the request to
provide examples as to why.
Nineteen respondents completed our survey. Sixteen respondents iden-
tify as female (three as male); all respondents are aged over 35 and have
completed higher education. Sixteen indicated they read for pleasure, and
Literary-Adapted Machine Translation 43
7 https://www.lezen.nl/sites/default/files/Leesmonitor_lr.pdf.
44 Antonio Toral et al.
Combo DeepL
Meaning 3.0 (1.4) 4.0 (0.82)
Style 3.0 (0.82) 3.8 (0.96)
Grammar 4.0 (0.82) 4.3 (0.58)
Cohesion 3.7 (1.5) 4.3 (0.58)
Literariness (1–7) 2.7 (1.5) 4.7 (2.3)
Combo DeepL
Meaning 3.4 (1.3) 3.6 (0.89)
Style 3.2 (1.3) 3.4 (0.89)
Grammar 3.4 (1.1) 4.0 (0.71)
Cohesion 3.4 (1.1) 4.4 (0.89)
Literariness (1–7) 4.4 (1.3) 5.2 (0.84)
(1) Several respondents find that the Combo translations contain ‘wrong
choices’, ‘errors’, ‘omissions’, and ‘mistranslations’, whereas the DeepL
translations contain fewer of these and are ‘more correct in word
choice and punctuation’. One respondent indicates they are ‘bothered
by the many wrong choices’ in the Combo translation. However, one
respondent disagrees and finds that in both translations ‘word choice is
wrong and too literal’, particularly in the DeepL translation.
(2) Several respondents find that the Combo translation is ‘too literal’.
One of them asks whether it was ‘perhaps done by Google Translate?’.
Another respondent indicates they ‘could hear the English original’ in
the Combo translation, and that they found the DeepL translation
Literary-Adapted Machine Translation 45
‘more active’. Lastly, one respondent finds ‘both translations are very
bad and literal and do not take Dutch into account, at the sentence as
well as the word level’.
(3) Dovetailing with these comments on source-language interference,
respondents also comment on the translations’ relationship with the
original. Several indicate that ‘the atmosphere and ‘way of speak-
ing’ dovetail better with the original’ in the DeepL translation, and
that it is ‘better in line with what the author wants to convey’. One
respondent disagrees, however, indicating that word choice in the
Combo translation ‘is closer to the original, causing its atmosphere to
be closer to the original too’ (in their case ‘equally pompous’).
(4) Many respondents refer to the fluency of both translations, prefer-
ring the DeepL translation: it is considered ‘more idiomatic’ (twice)
and ‘more natural’. Surprisingly, in terms of fluency, two respondents
indicate that in the DeepL translation ‘sentences do not always flow’
and that in the Combo translation ‘sentences flow better’. However,
two other respondents state that the Combo translation is ‘less fluent’
than the DeepL one, which ‘reads like better Dutch’.
sticks to their guns, stating that ‘the pompous language in the original
serves a purpose’, which the Combo output ‘serves better’.
Remarkably, when asked about grammaticality, all but one of the exam-
ples which respondents provided relate to verbs, more specifically to tenses,
collocations, and source-language interference. Two examples where the
DeepL translation is preferred are related to tenses: ‘had always fascinated
me’ translated as ‘had me altijd gefascineerd’ (had fascinated; note that ‘had’
in the translation is an OOV, despite being quite a common English word)
by Combo, and as ‘heeft mij altijd gefascineerd’ (has fascinated) by DeepL,
and the Combo translation ‘en vroeg wie er behalve ik ging’ (went) (correct
would be ‘zou gaan’ (would go)). An example of a wrong collocation can
be found in the Combo output ‘Hij heeft nu een hoop geld. Dat deed hij
vroeger niet’ (He has lots of money now. Before he hadn’t.). Respondents
seem to be quite satisfied with the level of cohesion of both translations,
which is reflected in the scores above (3.4 for Combo vs 4.4 for DeepL)
and supported by comments such as ‘an important ‘break’ in the original is
transferred well to both translations’, ‘both translations are rather cohesive’
or ‘equally cohesive as the original’. Two respondents clearly preferred the
DeepL translation: it has a ‘better rhythm’, whereas the Combo output ‘is
incohesive’ and ‘sometimes sounds like a bullet-pointed list, not a story (but
both cover the content of the original)’.
When asked for the motivation of their literariness scores, two respond-
ents clearly indicated they needed a definition of ‘literary’: ‘I don’t under-
stand the meaning of literary’ and ‘please define literary’. One respondent
indicated they found both translations ‘not literary because ‘nothing was
done’ with the original, it was not turned into a stand-alone or let alone
Dutch version, but they are word-for-word translations based on the first
dictionary available’.
We decided to reveal the survey’s objective and the ‘translators’ to the
respondents only after they had completed the survey: ‘With this survey
we aim to gain an insight into (1) whether a translation machine built for
literature performs better than a generic one, and (2) to what extent the
output of these translation machine types is conceived as literary’. We also
revealed to them which translation came from the literary-adapted system
and which from the ‘generic’ one, and invited them to leave a reaction. We
want to include two responses here: ‘It’s funny, I thought the first transla-
tion [Combo] was made by a machine and the second [DeepL] by a bad
translator’. The other respondent ‘instinctively’ did not like the first trans-
lation [Combo] but was ‘still surprised it was made by a machine’.
RQ4. Our findings provide support for the validity of the predictive
literariness model. Translations by DeepL tend to have slightly
higher predicted literariness than our Combo system, and for most
novels, the human translation is predicted as most literary. When
presented with fragments of four novels, the survey respondents
agreed with this ranking of DeepL and our Combo system.
Finally, we outline two main directions for future work. First, we note that
literary adaptation provides the biggest edge over DeepL on genre fiction,
even if the adaptation followed is rather general, for fiction as a whole. It
may, therefore, be better to conduct a more fine-grained adaptation at the
level of genre (e.g. suspense) or even at the author and/or translator level,
although, of course, the more fine-grained the experiment, the less data will
be available. Second, we found that the advantage of the literary-adapted
system tends to be smaller for novels with higher predicted literariness, and
in general, produces translations with lower predicted literariness and lexi-
cal variety. Methodologies to foster more natural and varied output could
be explored (Freitag et al., 2022). It may also be promising to modify the
MT system so that it prefers more literary outputs. Finally, multilingual
models and larger networks may lead to MT systems that generalise better.
Acknowledgements
We would like to thank the Center for Information Technology of the
University of Groningen for their support and for providing access to the
Peregrine high-performance computing cluster. We are grateful to Karina
van Dalen-Oskam and the reviewers for their helpful feedback.
References
Baas, M. (2021). Context is key: The effects of contextual information in English–
Dutch neural machine translation for different domains [Unpublished master’s
thesis]. University of Groningen. https://arts.studenttheses.ub.rug.nl/29168/
Baldick, C. (2008). Literariness. In The Oxford dictionary of literary terms (3rd
edn.). Oxford University Press. https://oxfordreference.com/view/10.1093/acref
/9780199208272.001.0001/acref-9780199208272-e-661
Baldick, C. (2015). Genre fiction. In The Oxford dictionary of literary terms (4th
edn.). Oxford University Press. https://oxfordreference.com/view/10.1093/acref
/9780198715443.001.0001/acref-9780198715443-e-494
Boyne, J. (2006). The boy in the striped pyjamas. David Fickling Books.
Boyne, J. (2010). De jongen in de gestreepte pyjama (J. de Jonge, Trans.). Boekerij
(Original work Boyne, 2006).
Cohen, J. (1988). Statistical power analysis for the behavioral sciences (2nd edn.).
Lawrence Erlbaum Associates. https://doi.org/10.4324/9780203771587
Literary-Adapted Machine Translation 49
Field, A., Miles, J., & Field, Z. (2012). Discovering statistics using R. Sage
Publications. https://nyu-cdsc.github.io/learningr/assets/discoveringstatistics.pdf
Freitag, M., Vilar, D., Grangier, D., Cherry, C., & Foster, G. (2022). A natural
diet: Towards improving naturalness of machine translation output. In Findings
of the Association for Computational Linguistics: ACL 2022, Dublin, Ireland
(pp. 3340–53). Association for Computational Linguistics. https://aclanthology
.org/2022.findings-acl.263/
Görög, A. (2014). Quality evaluation today: The dynamic quality framework. In
Proceedings of Translating and the Computer, 36, 155–64. https://aclanthology
.org/2014.tc-1.21.pdf
Junczys-Dowmunt, M., Grundkiewicz, R., Dwojak, T., Hoang, H., Heafield, K.,
Neckermann, T., Seide, F., Germann, U., Fikri Aji, A., Bogoychev, N., Martins,
A.F.T., & Birch, A. (2018). Marian: Fast neural machine translation in C++. In
Proceedings of ACL 2018, System Demonstrations, Melbourne, Australia (pp.
116–21). Association for Computational Linguistics. https://aclanthology.org/
P18-4020/
Kocmi, T., Federmann, C., Grundkiewicz, R., Junczys-Dowmunt, M., Matsushita,
H., & Menezes, A. (2021). To ship or not to ship: An extensive evaluation
of automatic metrics for machine translation. In Proceedings of the Sixth
Conference on Machine Translation, online (pp. 478–94). Association for
Computational Linguistics. https://aclanthology.org/2021.wmt-1.57/
Koehn, P., Hoang, H., Birch, A., Callison-Burch, C., Federico, M., Bertoldi, N.,
Cowan, B., Shen, W., Moran, C., Zens, R., Dyer, C., Bojar, O., Constantin,
A., & Herbst, E. (2007). Moses: Open source toolkit for statistical machine
translation. In Proceedings of the 45th Annual Meeting of the Association for
Computational Linguistics Companion Volume Proceedings of the Demo and
Poster Sessions, Prague, Czech Republic (pp. 177–80). https://aclanthology.org
/P07-2045/
Koolen, C., van Dalen-Oskam, K., van Cranenburgh, A., & Nagelhout, E. (2020).
Literary quality in the eye of the Dutch reader: The national reader survey.
Poetics, 79. https://doi.org/10.1016/j.poetic.2020.101439
Kudo, T., & Richardson, J. (2018). SentencePiece: A simple and language
independent subword tokenizer and detokenizer for neural text processing. In
Proceedings of the 2018 Conference on Empirical Methods in Natural Language
Processing: System Demonstrations, Brussels, Belgium (pp. 66–71). Association
for Computational Linguistics. https://aclanthology.org/D18-2012/
Kuzman, T., Vintar, Š., & Arčan, M. (2019). Neural machine translation of literary
texts from English to Slovene. In Proceedings of Machine Translation Summit
XVII: The Qualities of Literary Machine Translation, Dublin, Ireland (pp. 1–9).
European Association for Machine Translation. https://aclanthology.org/W19
-7301.pdf
Le, Q., & Mikolov, T. (2014). Distributed representations of sentences and
documents. In Proceedings of the 31st International Conference on Machine
Learning, Beijing, China (pp. 1188–96). https://doi.org/10.48550/arXiv.1405
.4053
Matusov, E. (2019). The challenges of using neural machine translation for
literature. In Proceedings of Machine Translation Summit XVII: The Qualities
50 Antonio Toral et al.
Toral, A., Oliver, A., & Ribas Ballestín, P. (2020). Machine translation of novels
in the age of transformer. In J. Porsiel (Ed.), Maschinelle Übersetzung für
Übersetzungsprofis (pp. 276–95). BDÜ Fachverlag. https://doi.org/10.48550/
arXiv.2011.14979
Van Cranenburgh, A., Van Dalen-Oskam, K., & Van Zundert, J. (2019). Vector
space explorations of literary language. Language Resources & Evaluation 53,
625–50. https://www.researchgate.net/publication/330990258_Vector_space
_explorations_of_literary_language
Van Noord, G. (2006). At last parsing is now operational. In Verbum Ex Machina.
Actes de la 13e conférence sur le traitement automatique des langues naturelles,
Leuven, Belgium (pp. 20–42). https://aclanthology.org/2006.jeptalnrecital
-invite.2/
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, Ll, Gomez, A.N., Kaiser,
L., & Polosukhin, I. (2017). Attention is all you need. In Advances in Neural
Information Processing Systems 30: Annual Conference on Neural Information
Processing Systems 2017, Long Beach, CA (pp. 5998–6008). https://doi.org/10
.48550/arXiv.1706.03762
Voigt, R., & Jurafsky, D. (2012). Towards a literary machine translation: The
role of referential cohesion. In D. Elson, A. Kazantseva, R. Mihalcea, & S.
Szpakowicz (Eds.), Proceedings of the NAACL-HLT 2012 Workshop on
Computational Linguistics for Literature (pp. 18–25). Association for
Computational Linguistics. https://aclanthology.org/W12-25.pdf
Way, A. (2018). Quality expectations of machine translation. In J. Moorkens, S.
Castilho, F. Gaspari, & S. Doherty (Eds.), Translation quality assessment: From
principles to practice (pp. 159–78). Springer. https://doi.org/10.48550/arXiv
.1803.08409
52 Antonio Toral et al.
Waltraud Kolb
2.1 Introduction
At the International Literature Festival in Berlin in 2019, a panel com-
posed of two authors/translators and a computer linguist1 discussed the
future role of machine translation (MT) in the literary domain. It was a
remarkable early effort to draw the reading public’s attention to recent
developments in the field. Towards the end of the event, moderator Gregor
Dotzauer, a literary editor at the German newspaper Tagesspiegel, sin-
gled out Ernest Hemingway as an obvious candidate for MT because he
‘has this Latin clarity’ (Internationales Literaturfestival Berlin, 2019).
Hemingway is indeed well known for his straightforward prose, and one
of his early short stories also serves as the source text for the explorations
of literary post-editing that will be presented in this chapter. Post-editing
lies at the very heart of any investigation of the opportunities and chal-
lenges engendered by digital technologies in the literary translation domain
as MT is still widely considered ‘unsuitable for translating literature’
(European Commission, Directorate-General for Education, Youth, Sport
and Culture, 2022, p. 60), unless followed by human post-editing. So far,
we do not know very much about what post-editing actually entails for
literary translation, be it literary translation as a process or as a product.
The aim of this chapter is to examine the differences between translation
from scratch and post-editing processes in the literary domain from a
primarily cognitive perspective and identify correlations with the resulting
target texts. In which ways is post-editing a short story by Hemingway
different from translating it from scratch? Is it faster, does it involve
less typing? Do post-editors reflect on the same issues as translators and
go through the same decision-making routines? Do both engage in the
same way with Hemingway’s choices? Are post-editors primed by MT
suggestions? To explore these questions, I will compare findings from two
DOI: 10.4324/9781003357391-4
54 Waltraud Kolb
2.2 Related Research
Post-editing (PE) is ‘the activity of revising a text that has been translated
automatically by a Machine Translation (MT) system’ (O’Brien, 2021,
p. 177), usually with the aim of increasing productivity. In this context,
productivity can be measured as the effort that goes into post-editing a
machine-translated text, compared to the effort that goes into translating
the same text from scratch, with lower effort signifying higher productivity.
Krings (2001) proposed three levels to study PE effort that have since been
widely applied in research, namely the temporal level (time spent on the
task), the technical level (number of keystrokes), and the cognitive level
(e.g. number and duration of pauses). Regarding literary texts, Moorkens
et al. (2018) and Toral et al. (2018), reporting on the same experiment,
found substantial productivity gains for PE (for both statistical MT and
neural MT) compared to human translation (HT) for their (English–
Catalan) literary-adapted MT systems. Their participants, six professional
translators with experience in literary translation, were faster post-editing
than translating, generating fewer keystrokes, and PE resulted in fewer but
longer pauses (see Moorkens et al., 2018, for the participants’ perception
of the task). The same parameters were also used in a later study by
Guerberof-Arenas and Toral (2022) involving the language pairs English–
Catalan and English–Dutch with four literary translators as participants
(see Chapter 1, this volume); their results were mixed, with a lower
technical effort for PE for both language pairs but a higher temporal effort
for PE compared to HT for Dutch. For both target languages, the average
number of pauses was also lower for PE than for HT, which the authors
interpreted as an indicator of lower cognitive effort, adding, however, that
the higher cognitive effort in HT might indicate that translators ‘“think
harder” for a more creative solution’ (p. 13). Both studies used keylogging
as a tool and based their analysis of cognitive effort on pause patterns.
In order to gain more insight into the cognitive aspects of translatorial
processes as well as what happens during pauses and how translators and
post-editors make their decisions, Krings (2001) and Vieira (2015)—both
2 www.deepl.com/en/translator.
‘I Am a Bit Surprised’ 55
2.3 Research Design
In a study of literary translation processes carried out in 2009/2010 (Study
1), five literary translators (four female, one male; henceforth referenced as
T–1 to T–5) translated a short story by Ernest Hemingway into German.
Their processes were captured by keylogging and think-aloud, product data
comprised first drafts, interim, and final versions; information about the
participants’ educational and professional backgrounds and work routines
was collected through questionnaires (Kolb, 2011, 2013, 2017, 2021). To
explore literary PE and compare it to HT, a follow-up study was carried
out in 2020/2021 (Study 2), in which five different literary translators
(all female: PE–1 to PE–5) post-edited a DeepL version of the same story,
using the same type of tools. DeepL was selected as a general-domain
neural MT system as it is widely used in German-speaking countries and
would, in practice, be a likely choice for literary translators (who would
usually not be in a position to train their own MT systems; see Section I.2,
‘I Am a Bit Surprised’ 57
3 www.audacityteam.org/.
58 Waltraud Kolb
For the purposes of this study, temporal effort is measured as the total
time of the verbal record. Participants were asked to switch on Audacity
whenever they worked on the task so that the verbal record includes time
spent working at the computer (including silent periods) as well as periods
they spent on manually revising the printout of an interim target version.
It does not cover some preliminary task-related activities of unknown
length, such as setting up the tools, or periods in which they reflected on
task-related issues away from their desk, e.g. during breakfast. However,
whenever participants at a later point summarized such deliberations
for the record, these brief sequences of retrospective verbalization are
included in their total time count. On three occasions, translators put on
record their desire to pause the recording and read the target text silently;
these periods are also not included. Since the participants worked in their
authentic working environments rather than in a supervised lab, some
technical hiccups occurred which also impacted time measurements: three
participants from Group 1 (T–1, T–2, T–3) had to re-type passages that had
not been saved by Translog; these periods have been subtracted from their
total times (see Kolb, 2017 for more details) so as to not skew correlations
with the respective keystroke numbers. Given the above limitations, the
time measurements in Table 2.1 are approximate values only but still serve
to indicate general tendencies for the ten participants.
As shown in Table 2.1, the post-editors spent on average less time on
the task than the translators (approx. 24% less, a number that is slightly
lower than results from other studies, e.g. Toral et al. (2018)), with the
median being only approx. 10% lower. However, a closer look reveals
that there are significant differences between individual participants. Inter-
subject variation is particularly great in Group 1, ranging from 1 hour
‘I Am a Bit Surprised’ 59
Total time of verbal record and total number of keystrokes for transla-
Table 2.1
tors and post-editors, charcut for post-editors.
2.4.2 Cognitive Processes
The quantitative data described above, i.e. temporal and technical effort
or inter-text similarity, are shaped by the underlying cognitive processes.
In the case of PE, these processes relate to either the source text, the raw
MT output, or the target text, whereas with HT the cognitive load is dis-
tributed between two texts only rather than three. Studies on non-literary
PE have shown that the distribution of attention tends to differ between
the two modalities, with the source text usually receiving less attention in
PE than in HT (Krings, 2001; Mesa-Lao, 2014). This is hardly surprising,
given the fact that in HT drafting the first version is usually the phase in
which translators most intensely engage with the source text, its interpreta-
tion, its style, and the author’s choices; in PE, the first draft―and with it,
the first interpretation of the source text―is supplied by the MT engine.
The following discussion of cognitive processes and the role played by
MT priming will draw on data from the participants’ verbal reports. TAPs
include various kinds of verbalization, ranging from emotional reactions
(sighing, laughing) to reading out loud snippets of source or target text
or raw MT, from disjointed sequences of thinking aloud to explanations
or rationalizations addressed to the researcher (Ericsson & Simon, 1993).
‘I Am a Bit Surprised’ 61
One hot evening in Padua they carried him up onto the roof and he
could look out over the top of the town. There were chimney swifts
[Example 1] in the sky. After a while it got dark and the searchlights
came out. The others went down and took the bottles with them. He
and Luz could hear them below on the balcony. Luz sat on the bed
[Example 2]. She was cool and fresh in the hot night.
(Hemingway, 1925/1930, p. 83)
PE–5: should I check now if this is indeed a bird […] yes […] but I am
a bit surprised now that it flies around in Italy, even though it is an
American swift, but ok [laughs]
In Example 2, the post-edited target texts are also identical, while the
translations again vary. Here, however, priming occurred on the level of
interpretation. Ambiguity is a typical feature of literary writing, includ-
ing Hemingway’s, notwithstanding his ‘Latin clarity’ (Internationales
Literaturfestival Berlin, 2019), highlighting the translator’s role in inter-
pretation and meaning construction. As Hermans (2007, p. 30) phrased it,
different translations ‘flesh out the interpretive potential of a given text’.
The original sentence Luz sat on the bed is ambiguous in that the verb sat
can be read in two ways, either as the action of sitting down (setzte sich
in German) or as the state of being seated (saß in German). Both inter-
pretations make perfect sense in the context of the story, and indeed we
find both options in the translations (setzte sich in two, saß in three trans-
lations), whereas all five post-edited target texts present the same scene,
in all cases the one suggested by DeepL (setzte sich). PE–1 was the only
post-editor who considered the possibility of an alternative reading but
ultimately found DeepL’s version ‘acceptable’; the verbal data and keylogs
of the other four post-editors do not contain any indication that they were
aware of this interpretative potential or questioned the MT suggestion in
any way. In Group 1, two translators automatically, without any obvious
hesitation, chose the second option, while T–1 and T–2 explicitly reflected
on the ambiguity, T–1 even revising her choice twice before settling on her
final interpretation. In T–4’s audio-recording we can hear a slight hesita-
tion before she typed in the sentence, which may be taken as an indicator
of some state of cognitive uncertainty, though it was obviously not strong
enough to prompt further deliberations.
Example 3 is taken from a later part of the story and serves to illustrate
two more instances of priming, now regarding syntax normalization and
literalism, both indicative of post-editese (italics added):
Original: When they had to say good-bye, in the station at Milan, they
kissed good-bye, but were not finished with the quarrel. (Hemingway,
1925/1930, p. 84)
DeepL normalized the syntax of the first part of the sentence, shifting the
phrase in the station at Milan to the front of had to say good-bye. In
German, retaining the original’s syntax is possible but results in a slightly
unusual rhythm; four of the five translators opted for it, while four of
the five post-editors retained the normalized German syntax suggested by
64 Waltraud Kolb
DeepL, again, without any hesitation. Of some interest is also the preposi-
tion im (in the). Both in English and German, one could use either in the
station (im Bahnhof) or at the station (am Bahnhof), and in both lan-
guages, the second option leaves more room for interpretation (where
exactly do they say good-bye4: in front of the building, inside the building,
by the train’s door on the platform?). In both languages, the second option
would also be the more common way to describe the scene (we can only
speculate that Hemingway might have opted for in here to avoid another
at in the same phrase). The MT suggestion (im) is a literal translation
and was retained by four post-editors (the exception being PE–5), while
three of the five translators opted for the less literal and more common
(and ambiguous) am (at the). The following excerpt from PE–3’s TAP is
particularly interesting (italics highlight the words of interest and do not
indicate any change in intonation):
PE–3: [reading German MT:] als sie sich am, im Bahnhof von Mailand
verabschiedeten, mussten … verabschiedeten sie sich [laughs], [reading
English original:] they kissed good-bye, [translating:] küssten sie sich
[…] [reading German MT:] im Bahnhof, well, ok … well now we have
one, two, three saying good-bye, one of them completely wrong
We can hear how PE–3 stumbles when she first reads out loud the German
MT version, subconsciously correcting the MT output into the more com-
mon am (at the); she then immediately corrects herself and reads what is
actually on her screen (im); a second later she seems to experience another
moment of uncertainty (‘im Bahnhof, well, ok’), but does not react in any
way to it, probably because the MT error (omission of kissed) monopolizes
her attention. As Krings (2001) has shown, post-editors ‘tend to intervene
correctively in the machine translation subconsciously during reading and
to recognize the correct (and thus familiar) well-formed nature of a text
even where it is not present at all’ (p. 364). And indeed, when reading out
loud the raw MT output or their various interim or final versions, all post-
editors (with the exception of PE–4) subconsciously switched several times
between im (in the) and am (at the).
2.5 Conclusion
Post-edited target texts are synthetic texts that are part human-generated
and part machine-generated, containing both edited and unedited mate-
rial. In the examples discussed above, PE resulted in target texts that
contain distinct traces of priming through MT and are more similar to
4 Hemingway’s spelling.
‘I Am a Bit Surprised’ 65
each other than the translations from scratch. Even though the post-
editors in this study went well beyond securing accuracy and fluency,
their target texts still do not ‘flaunt [their] identity through the difference
with other translative interpretations’ (Hermans, 2007, p. 31) the same
way translations do; put differently, the voices of the post-editors are less
manifest in the target texts than those of the translators (see also Kenny
and Winters, 2020).
On average, PE was found to be faster than HT and involved fewer
keystrokes; however, inter-subject variation regarding speed and work-
ing style was considerable in both groups of participants. Regarding the
underlying cognitive processes associated with the two modalities, dif-
ferent patterns emerged. In line with the post-editors’ role as evaluators
(Guerberof-Arenas & Toral, 2022), their attention was primarily on the
raw MT output, e.g. verifying terminology (chimney swifts) or identifying
and correcting MT errors (omission of kissing); at the same time, their
engagement with the source text and its author’s choices (and on occa-
sion also their final target texts, e.g. regarding imagery) was less extensive
and remained more superficial than in the case of the translators. This
is closely linked to priming effects, which occurred on multiple levels in
the PE modality, be it on the level of terminology (chimney swifts), inter-
pretation (sat), normalization or literalism (in the station). Some of these
priming effects were more subtle than others, but in each case, several of
the post-editors were primed by the same MT suggestion. Together with
the fact that priming occurred on so many levels despite Hemingway’s
rather straightforward language, this clearly points to the great impact
of MT stimuli in a PE task, an impact which can be assumed to be even
greater with stylistically more demanding source texts. While individual
post-edited target texts are not necessarily of lower quality, MT priming
seems to be an intrinsic element of PE processes. Priming effects are here
shown to constrain the post-editor’s agency and lead to more homogene-
ous target texts. It is likely that, in the long run, they will also lead to vari-
ous impoverishments of translated literary language (see also Toral, 2019;
Vanmassenhove et al., 2019).
As the participants in this study had no prior PE training or experience,
further studies will be needed to ascertain whether priming would have
less impact for more experienced post-editors. As far as research tools go,
TAPs provide very rich data for learning more about how HT and PE differ
in terms of translatorial cognition and decision-making; in future studies,
eye-tracking might yield additional insights into patterns of attention
distribution and cognitive rhythms. We can assume that at least in some
sectors of the book market, MT with subsequent human PE will become
more widespread before long (see also Chapter 7 in the present volume).
Therefore, it seems of paramount importance that both literary translators
66 Waltraud Kolb
who take on PE work and publishers who commission it are aware of the
potential implications entailed by the specificities of the cognitive processes
associated with literary PE.
References
Angelone, E. (2010). Uncertainty, uncertainty management and metacognitive
problem solving in the translation task. In G. M. Shreve & E. Angelone (Eds.),
Translation and cognition (pp. 17–40). Benjamins. https://doi.org/10.1075/ata
.xv.03ang
Borg, C. (2022). A literary translation in the making: A process-oriented perspective.
Routledge. https://doi.org/10.4324/9781003150909
Castilho, S., & Resende, N. (2022a). Post-editese in literary translations.
Information 13(2), 66. https://doi.org/10.3390/info13020066
Castilho, S., & Resende, N. (2022b). MT-pese: Machine translation and post-
editese. Proceedings of the 23rd Annual Conference of the European Association
for Machine Translation (pp. 305–306). https://lt3.ugent.be/media/uploads/
eamt2022/proceedings- eamt2022.pdf
Castilho, S., Resende, N., & Mitkov, R. (2019). What influences the features of
post-editese? A preliminary study. Proceedings of the 2nd Workshop on Human-
Informed Translation and Interpreting Technology (pp. 19–27). https://doi.org
/10.26615/issn.2683-0078.2019_003
Čulo, O., & Nitzke, J. (2016). Patterns of terminological variation in post-editing
and of cognate use in machine translation in contrast to human translation.
Baltic Journal of Modern Computing 4(2), 106–114. https://aclanthology.org
/W16-3401/
Daems, J., De Clercq, O., & Macken, L. (2017). Translationese and post-editese:
How comparable is comparable quality? Linguistica Antverpiensia 16, 89–103.
https://lans-tts.uantwerpen.be/index.php/LANS-TTS/article/view/434
Ericsson, K. A., & Simon, H. A. (1993). Protocol analysis. Verbal reports as data.
MIT Press.
European Commission, Directorate-General for Education, Youth, Sport and
Culture. (2022). Translators on the cover: multilingualism & translation:
Report of the Open Method of Coordination (OMC) working group of EU
Member State experts. Publications Office of the European Union. https://data
.europa.eu/doi/10.2766/017
Farrell, M. (2018). Machine translation markers in post-edited machine
translation output. Proceedings of the 40th Conference Translating and the
Computer (pp. 50–59). https://www.asling.org/tc40/wp-content/uploads/
TC40-Proceedings
Guerberof-Arenas, A., & Toral, A. (2020). The impact of post-editing and machine
translation on creativity and reading experience. Translation Spaces 9(2), 255–
282. https://doi.org/10.1075/ts.20035.gue
Guerberof-Arenas, A., & Toral, A. (2022). Creativity in translation: Machine
translation as a constraint for literary texts. Translation Spaces 11(2), 184–212.
https://doi.org/10.1075/ts.21025.gue
‘I Am a Bit Surprised’ 67
3.1 Introduction
Since the mid-2010s, growing attention has been paid to how machine
translation (MT) might be applied in the translation of literary texts, to
the extent that literary MT is now emerging as a field in its own right. The
interests of researchers working in the area are many and varied (see Kenny
and Winters, forthcoming; and the other chapters in the present volume),
but one important strand focuses on the customisation of MT engines to
improve their performance in translating literary texts. Such customisation
can be individualised: engines can be (partly) trained on texts written by
particular authors (see Chapters 1 and 6, this volume) and/or translated
by particular translators, yielding what we call ‘author-specific personali-
sation’ and ‘translator-specific personalisation’, respectively (Kenny and
Winters, op. cit.). This kind of personalisation has as its implicit or explicit
aim the recreation of a certain ‘style’ in MT, namely that of the source-text
author or the individual translator. It happens upstream, that is, at the
training stage, before any new translation is attempted. It is also possible
to imagine downstream personalisation, however, in the guise of literary
post-editing. In Kenny and Winters (op. cit.) we thus characterise the post-
editing of a machine-translated text by an experienced literary translator
whose brief is to ‘make the text his [i.e. Oeser’s] own’ as a kind of down-
stream translator-specific personalisation (TSP). Whether or not the TSP
in question is successful would depend not on how the outputs were rated
using standard MT evaluation metrics (e.g. Castilho et al., 2018; Way,
2018), but rather on whether the post-edited version could be shown to
reflect the translator’s known style. In this chapter, we continue on this
track, presenting the results of an empirical investigation into literary
translator Hans-Christian Oeser’s style when he works in post-editing as
DOI: 10.4324/9781003357391-5
70 Marion Winters and Dorothy Kenny
and then to hypothesise causes for the observed formal features of texts, in
the form of particular translators, post-editors, and so on.
The particular formal features that can be studied and the ways in which
they can be processed run into the hundreds (Herrmann et al., 2015, p. 45).
Previous corpus-based and corpus-driven studies in translator style (e.g.
Baker, 2000; Saldanha, 2011; Youdale, 2020) have started, for example,
by focusing on basic statistics like lexical density, average sentence length,
(standardised) type–token ratios and other measures of lexical variety. These
studies usually branch into richer qualitative analyses on the basis of their
initial quantitative findings. Similar metrics have been used in studies that
1 We are using ‘conventional’ here to designate translation completed without the use of MT
for lack of a better term. Alternative terms used in the literature create as many problems
as they solve: ‘human translation’, for example, is often used to contrast with ‘machine
translation’ but post-editing is very much a human activity too, a fact that reduces the
discriminating capacity of the epithet ‘human’ in this instance. Likewise, the term ‘from
scratch’ seems inadequate as it appears to present translation completed without the use of
MT as a kind of ex nihilo activity, which is clearly not the case. That said, ‘from scratch’ is
so commonly used to designate translation completed without a machine-translated first
draft that it is difficult to displace.
Mark My Keywords 71
Higher-order stylistic features are derived from these basic style markers by
relating selected markers to each other and/or through the application of
various statistical techniques and tests. Among the ‘well understood’ (ibid.)
methods used to generate such higher-order features, Herrmann et al. (ibid.)
list keyness measures. In short, an item is considered ‘key’ in a given text or
corpus if it occurs with unusual frequency compared with its frequency in
another text or (often larger) corpus. The second (larger) corpus is usually
called the reference corpus, although Scott (2022) uses the term comparison
corpus, a usage we follow in this chapter. Although keyness can be attributed
to any of the items listed above (lemmata, word classes, etc.) it is frequently
word forms that are studied in corpus-based translation studies. In the
paradigm case (see Rayson, 2019), a word form (or ‘type’) is said to be a
keyword in a given text, if it occurs with greater than expected frequency in
that text given its frequency in the comparison corpus. Such a keyword is
described as a positive keyword. A negative keyword, in contrast, is one that
occurs with less than expected frequency in the text in question.
The computation of keyness usually involves comparing the relative
frequencies of word forms in the focus text and the comparison corpus,
and then conducting a test to ascertain whether any difference between
these relative frequencies is statistically significant. The score produced by
this test can then be used to rank word forms in terms of their keyness in
the text in question.
Mark My Keywords 73
There has been much debate over which statistical significance test is
most suitable for the identification and ranking of keywords, as well as
general criticism over assumptions made by many of the tests in question
(see Rayson, 2019). The log likelihood test has been favoured for some
time (ibid.), although it is not without its detractors (e.g. Gabrielatos,
2018). Jeaco (2020, p. 149), however, holds that log likelihood-based
keyword calculations ‘can be used effectively for a range of different
kinds of research, but often work best with texts and moderately large
collections of texts rather than with very large corpora at the entire
corpus level’. He adds (ibid.) that log likelihood can be used effectively in
combination with related measures such as Bayesian Information Criterion
(BIC). The corpus-processing software WordSmith Tools now integrates
BIC alongside a variety of keyness measures including log likelihood, the
settings for which are often user-adjustable.3
Keyword analysis has also come under criticism for its prioritisation
of difference over similarity in corpus studies (see e.g. Taylor, 2018). But
even the techniques most associated with the privileging of difference
can be turned to the analysis of similarity (ibid., p. 21) and it is possible,
for example, to investigate whether keywords generated for a given text
remain key in other texts by the same author or translator, given a different
comparison corpus. This is the approach taken in this study.
All told, keyword analysis offers a number of advantages in corpus
stylistic studies. Mastropierro (2018, p. 66), for example, argues that
using keywords means that the analyst works with a controlled number of
automatically generated items, whose frequency is statistically significant,
which makes for ‘an efficient way to begin a study’ and helps to minimise
researcher bias. Keyword lists typically contain content and function words,
both of which can characterise an author’s, translator’s or character’s style
(Culpeper, 2009) and thus are of interest to stylisticians. This fact can
differentiate keyword analysis from ‘most frequent word’ analyses, which
typically revolve around function words. Keywords also provide a useful
exploratory bridge between more quantitative analyses of textual features
and more qualitative, interpretative analyses, as is customary in corpus
stylistics (Herrmann, 2017). And, as Mastropierro (2018, p. 67) reminds
us, the generation of a keyword list is not an end in itself; rather, it can be
the starting point for further quantitative and qualitative examination of
what, upon careful analysis, is likely to turn out to be important meaningful
features of the text in question.
3 www.lexically.net/wordsmith/.
74 Marion Winters and Dorothy Kenny
3.5 Research Design
3.5.1 Data
3.5.2 Procedure
We start our analysis by generating a keyword list for Oeser PE using the
keywords function in WordSmith Tools 8.0 and taking DeepL MT as our
comparison corpus.10 In general, the closer the comparison corpus is to the
text under study in terms of genre and other extratextual factors the better
(Culpeper, 2009). In our case, the comparison corpus is extremely close to
the study text, differing only in the translation condition (post-edited MT
vs raw MT). This reduces the chances that the keyword analysis will high-
light words that indicate what the study text is ‘about’, which might serve
as a distractor in a study of style (Scott & Tribble, 2006). The output is a
6 Oeser 12 contains 699,315 tokens and 52,427 types and has a standardized TTR of 50.47.
7 Correct as of 08 March 2022.
8 The IDS allows users to build a customised sub-corpus from the German Reference Cor-
pus. However, there is no function to create frequency lists or other statistics. Counts for
selected types (see Section 3.6.4) were thus done by looking them up individually.
9 We would like to thank David Woolls for developing Tetrapla and optimizing it for Ger-
man.
10 The relevant settings are: maximum wanted = 500; minimum frequency of occurrence of
candidate keywords in the study text = 3; minimum BIC score = 2.5; minimum log ratio
= 0; p value = 0.1. The p value is set high as it can effectively be ignored if BIC values are
used (Scott, 2022).
76 Marion Winters and Dorothy Kenny
3.6.2 Fictional Dialogue
12 125 negative keywords were generated in Oeser PE using the same settings. Due to space
constraints, we will not elaborate on them here except to the extent that they shed light
on the positive keywords we discuss.
13 Log likelihood (LL) values can be interpreted as follows, according to Rayson (See fn. 11):
LL of 3.8 or higher is significant at the level of p < 0.05
LL of 6.6 or higher is significant at p < 0.01
95th percentile; 5% level; p < 0.05; critical value = 3.84
99th percentile; 1% level; p < 0.01; critical value = 6.63
99.9th percentile; 0.1% level; p < 0.001; critical value = 10.83
99.99th percentile; 0.01% level; p < 0.0001; critical value = 15.13
BIC scores can be interpreted as follows, according to Gabrielatos (2018):
below 0: not trustworthy (or evidence in favour of H0 according to Rayson (ibid.)
0–2: not worth more than a bare mention
2–6: positive evidence against H0
6–10: strong evidence against H0
>10: very strong evidence against H0
78 Marion Winters and Dorothy Kenny
habe ‘[I] have’). They are used in depictions of direct speech by Oeser
but not by DeepL. The elevated frequency of these forms in Oeser PE
is consistent with what is already known about Oeser’s skill in writing
convincing dialogue.
Oeser’s frequent use of interjections and (potential) modal particles
such as ach (ranked 1), doch (ranked 2), gar (ranked 3), schon (ranked
9), eben (ranked 11) and ja (ranked 19) may also be interpretable in this
light, although it should be noted that instances of ach and o (ranked 17)
tend to be straightforward replacements for the untranslated particle Oh
in DeepL MT, and that the other forms mentioned here are generally
polysemous, so each instance needs to be inspected individually to come
to sound conclusions. Even the overrepresentation of the polysemous
pronoun sie/Sie (ranked 5), one use of which is as a formal translation of
‘you’, is at least partly attributable to its use in Oeser’s fictional dialogue.
Nearly half of the keywords identified in Table 3.2 may thus be linked
to Oeser’s particular way of handling fictional dialogue. Space restrictions
prevent us from analysing these data further here. For now, we simply note
the importance of keyword analysis in leading us towards higher-order
features of texts (such as the treatment of dialogue) that differentiate the
post-editor’s work from that of the machine.
3.6.3.1 Vermutlich
In no case does Oeser’s edit affect the meaning or epistemic stance of the
speaker. The changes from the verbal ich nehme an to the adverbial ver-
mutlich do, however, influence the sentence structure, which changes from
a hypotactic structure to a simple main clause in 12 instances (Example 1)
or from a main clause to an incomplete sentence in five instances (Example
2).14 The former allows inversion, which Oeser is known to like (see
Section 3.3). The latter, it could be argued, yields a better approximation
of spoken language.
Example 1
(1a) Isherwood ST I suppose I still regarded marriage as a kind of
game.
(1b) DeepL MT Ich nehme an, ich betrachtete die Ehe immer noch
als eine Art Spiel.
(1c) Oeser PE Vermutlich betrachtete ich die Ehe noch immer als
eine Art Spiel.
Example 2
(2a) Isherwood ST ‘Yes. I suppose so.’
(2b) DeepL MT „Ja. Ich nehme es an.”
(2c) Oeser PE „Ja. Vermutlich.”
14 Note that particular examples are chosen as they attest the use of the word form in ques-
tion in a relatively short sentence with few ‘distractors’. By their very nature they may
underrepresent the amount of editing that Oeser does across the text as a whole.
80 Marion Winters and Dorothy Kenny
Example 3
(3a) Isherwood ST Yes, the Jane-situation still existed, and would
continue to exist, probably, for a long time.
(3b) DeepL MT Ja, die Jane-Situation existierte noch und würde
wahrscheinlich noch lange bestehen bleiben.
(3c) Oeser PE Ja, die Jane-Situation existierte noch und würde ver-
mutlich noch lange existieren.
3.6.3.2 Bestimmt
and some edits (five instances) may have been made in an effort to create
natural-sounding dialogue (Example 4).
Example 4
(4a) Isherwood ST ‘I’m sure you look cute in it.’
(4b) DeepL MT „Ich bin sicher, du siehst darin süß aus.”
(4c) Oeser PE „Du siehst bestimmt niedlich darin aus.”
Example 5
(5a) Isherwood ST I would miss her, certainly.
(5b) DeepL MT Ich würde sie sicherlich vermissen.
(5c) Oeser PE Bestimmt würde ich sie vermissen.
3.6.3.4 Gewiss
Example 6
(6a) Isherwood ST Certainly not because I imagine you’ll disap-
prove of him.
(6b) DeepL MT Sicherlich nicht, weil ich mir vorstelle, dass du ihn
missbilligen wirst.
(6c) Oeser PE Gewiss nicht, weil ich glaube, dass du ihn missbilligen
wirst.
Example 7
(7a) Isherwood ST ‘Sure, I understand all about that, Bob.
(7b) DeepL MT Sicher, ich verstehe das alles, Bob.
(7c) Oeser PE Gewiss, das alles verstehe ich, Bob.
The flip side of Oeser’s preference for gewiss seems to be a dislike of the
synonymous sicher and sicherlich. He reduces their frequency considerably,
by a third in the case of sicher (from 168 to 106 instances) and by three-
quarters in the case of sicherlich (from 46 to ten instances). Unsurprisingly,
both sicher and sicherlich thus appear as negative keywords for Oeser PE
when DeepL MT is the comparison corpus.16
3.6.3.5 Weshalb
While the above three keywords can be grouped on semantic and prag-
matic grounds, weshalb stands alone as the only interrogative form in the
keyword list in Table 3.2. It translates as ‘why’ and is used by Oeser on
all occasions as an interrogative or relative adverb to replace the more
common synonymous form warum in the DeepL output. It is of par-
ticular interest to us, as its use has previously been identified as one of
Oeser’s ‘quirks’ (Kenny & Winters, 2020, p. 143). Although Oeser regards
weshalb as being of higher register than warum (personal communication,
22/04/2022), the decision to change warum to weshalb is based entirely on
his personal preference.
Table 3.6 gives the absolute and relative frequencies for the four key-
words addressed above in the texts/corpora in question, sorted in descend-
ing order of absolute frequency in Oeser PE.17
As the bolding in Table 3.6 highlights, vermutlich, bestimmt and gewiss
are all relatively more frequent in Oeser PE than in his wider work (Oeser
12). Only weshalb occurs relatively less frequently in Oeser PE than it
does in Oeser 12. All four forms are relatively more common in Oeser PE
than in German Original Literature. The comparison between Oeser 12
and German Original Literature suggests that Oeser generally uses ver-
mutlich and weshalb more than other writers in the target language while
he is less likely than others to use bestimmt and gewiss. In his post-editing
work then, he appears to be asserting his attested lexical style in the use
of vermutlich and weshalb. He uses bestimmt and gewiss, however, not
because they are characteristic of his style, but rather to avoid using sicher
and sicherlich, which he generally disprefers. That said, he ‘overuses’ them
in his post-edited work most likely because of the influence of the MT
(although the source text could also be exerting an influence here) (see
Table 3.7).
As indicated above, differences in frequencies can be tested for statistical
significance using log likelihood, which compares observed and expected
Table 3.7 Lexical items Oeser generally disprefers (sorted by freq. in Oeser PE).
3.7 Conclusions
This chapter has explored the use of keywords as a way of eliciting data
for the analysis of a post-editing translator’s style. The generation of an
unlemmatised keyword list proved to be an efficient way of eliciting unbi-
ased data for further examination, leading us to at least one higher-order
Mark My Keywords 85
References
Baker, M. (2000). Towards a methodology for investigating the style of a literary
translator. Target 12(2), 241–266. https://benjamins.com/online/target/articles
/target.12.2.04bak
Burrows, J.F. (2002). Delta: A measure of stylistic difference and a guide to likely
authorship. Literary and Linguistic Computing 17, 267–287. https://doi.org/10
.1093/llc/17.3.267
Castilho, S., Doherty, S., Gaspari, F. & Moorkens, J. (2018). Approaches to
human and machine translation quality assessment. In J. Moorkens, S. Castilho,
F. Gaspari & S. Doherty (Eds), Translation quality assessment: From principles
to practice (pp. 9–38). Springer. https://www.researchgate.net/publication
/326386260 _ Approaches _ to _ Human _ and _ Machine _ Translation _ Quality
_Assessment_From_Principles_to_Practice
Culpeper, J. (2009). Keyness: Words, parts-of-speech and semantic categories in
the character-talk of Shakespeare’s Romeo and Juliet. International Journal
of Corpus Linguistics 14(1), 29–59. https://www.researchgate.net/publication
/233493422_Keyness_Words_parts-of-speech_and_semantic_categories_in_the
_character-talk_of_Shakespeare's_Romeo_and_Juliet
Gabrielatos, C. (2018). Keyness analysis: Nature, metrics and techniques. In
C. Taylor & A. Marchi (Eds.), Corpus approaches to discourse: A critical
review (pp. 225–258). Routledge. https://www.researchgate.net/publication
/319208347_Keyness_analysis_Nature_metrics_and_techniques
86 Marion Winters and Dorothy Kenny
Scott, M. (2022). How key words are calculated. WordSmith tools help. https://
lexically.net/downloads/version_64_8/HTML/keywords_calculate_info.html
Scott, M. & Tribble, C. (2006). Textual patterns: Key words and corpus analysis
in language education. John Benjamins. https://www.researchgate.net/
publication/238737537_Textual_Patterns_Key_Words_and_Corpus_Analysis
_In_Language_Education
Taylor, C. (2018). Similarity. In C. Taylor & A. Marchi (Eds.), Corpus approaches
to discourse: A critical review (pp. 19–37). Routledge. https://doi.org/10.4324
/9781315179346
Toral, A. (2019). Post-editese: An exacerbated translationese. In Proceedings of
Machine Translation Summit XVII: Research Track (pp. 273–281). European
Association for Machine Translation. https://aclanthology.org/W19-6627
Way, A. (2018). Quality expectations of machine translation. In J. Moorkens, S.
Castilho, F. Gaspari & S. Doherty (Eds.), Translation quality assessment: From
principles to practice (pp. 159–178). Springer. https://doi.org/10.48550/arXiv
.1803.08409
Winters, M. (2007). F. Scott Fitzgerald’s Die Schönen und Verdammten: A corpus-
based study of speech-act report verbs as a feature of translators’ style. Meta
52(3), 412–425. https://doi.org/10.7202/016728ar
Winters, M. (2009). Modal particles explained: How modal particles creep into
translations and reveal translators’ styles. Target 21(1), 74–97. https://benjamins
.com/online/target/articles/target.21.1.04win
Winters, M. (2015). Hans-Christian Oeser: Translation strategies of a literary
translator. In S. Egger (Ed.), Cultural/literary translators: Selected Irish-German
biographies II (pp. 210–224). Wissenschaftlicher Verlag Trier.
Woolls, D. (2021). Tetrapla plus. CFL Software Ltd.
Youdale, R. (2020). Using computers in the translation of literary style. Routledge.
https://doi.org/10.4324/9780429030345
Part 2
Machine Translation
Applications in Literary
Translation
4 MT and CAT
Challenges, Irrelevancies, or Opportunities
for Literary Translation?
4.1 Introduction
Although technology explicitly created for the facilitation of translating
texts has a history that is virtually as old as Translation Studies as an
academic field of study, both having developed in the wake of the Second
World War (Tymoczko, 2018, p. 156), the question of whether and to what
extent such technology holds any benefit for the translation of specifically
literary texts has been largely moot for much of that time. Until recently,
Machine Translation (MT) technology had not reached the point at which
literary texts could be considered seriously as points of analysis, and
Computer-Assisted Translation (CAT) tools were similarly rudimentary
from the perspective of operations key to literary translation. Meanwhile,
literary translation has come to be questionable in multiple contexts in
terms of its commercial sustainability, particularly from the perspective of
paying its translators fairly (Taivalkoski-Shilov, 2018, p. 695).
Here, ‘literary translation’ refers to the translation of texts that are
engaged with primarily for aesthetic, rather than functional, reasons (see
also Section I.1, Introduction, this volume). Functional texts, such as
instruction manuals, legal statutes, and recipes describe phenomena and
facilitate actions in the physical world. Literary texts, on the other hand,
frequently evoke or depict the physical world, but the reactions they elicit
in readers are primarily emotional rather than physical. Naturally, literary
texts may have perlocutionary, educational, or informative effects in the
same way that sculptures, paintings, pieces of music or any other artistic
artefact may have. However, these effects are generally secondary to, or
caused by a reaction to, the aesthetic qualities of the texts. Thus, while
advertisements have the underlying perlocutionary aim of encouraging
observers to purchase the products or services being advertised, the trigger
encouraging them to do so is the aesthetic, and thereby emotional, reaction
to the text itself rather than a functional message such as ‘Step 1: select
product. Step 2: pay for product’.
DOI: 10.4324/9781003357391-7
92 James Luke Hadley
The term used here for the translation of functional texts is ‘technical
translation’ in order to avoid confusion with the functionalist school
of translation, which is not exclusive to functional texts (Snell-Hornby,
2006, p. 162). Technical translators have been able to increase their
efficiency dramatically with the use of CAT tools, producing many times
more translated words than they could have without them, and can now
make use of MT functionality to assist in the translation process itself,
making technical translation not only a viable commercial activity but
a lucrative one for the highly skilled. Literary translators, on the other
hand, have certainly benefitted from many aspects of technology, including
easily searchable online dictionaries and corpora, digitised archives, and
word processing technology. However, generally speaking, they have
not historically benefitted from technological tools that are specifically
designed to assist them in the translation of literature (Ruffo, 2022, pp.
26–29). The economics of literary translation are also very different from
those of technical translation. In many contexts, national bodies such as
cultural outreach centres subsidise the translation of national literature
into the languages of the world because the amounts of money consumers
are accustomed to paying for literary works generally do not differ
significantly between translated and non-translated works. Therefore, the
additional work associated with translating a text into another language
requires a subsidy. Alternatively, translators can be paid the equivalent
of a tiny amount per hour for completing the work, effectively turning
literary translation into a hobby, and at the same time preventing it from
being a viable career choice per se for many.
Before CAT tools, including translation memories (TMs), electronic
dictionaries, and project-management software, translation in general was
a labour-intensive, expensive activity. Logistics effectively limited how
much material could be translated each year. As the world has gradually
become ever more interconnected since the end of the Second World War
though, translation has played an ever more important role in facilitating
communication. Without the assistance of CAT tools, it would be unfea-
sible for the same amount of material to be translated as we see today.
Possibly because necessity is the mother of invention, the technology, the
translators, and the technical-translation industry have continually shifted
to keep pace with demand. Many times more translation work can now be
expected and delivered by a single translator than was previously imagi-
nable. Thus, translators without access to such tools can quickly be priced
out of the market. In the third decade of the twenty-first century, it is not
unreasonable to assert that many professional technical translators mak-
ing no use of CAT tools would struggle to remain competitive in terms of
output and may be severely limited in terms of the projects they are able
to take on since so many require the use of specific TMs or termbases.
MT and CAT 93
4.2 Irrelevancies
For many years, from the perspective of MT and CAT, literature was
simply irrelevant. Similarly, from the perspective of literature, MT and
CAT tools were irrelevant. MT had originally surfaced as a development
of cryptography during the early years of the Cold War when the USA and
its allies had a particular interest in understanding the contents of Russian-
language communications (see Harper, 1955, p. 41; Lennon, 2014, p. 137).
These machines belonged to the same tradition as the early computers that
had been put to work by codebreakers during the Second World War to
decode enemy transmissions (Gambier, 2018, pp. 132–133). The principle
underpinning them was that a mechanical approach to the deciphering of
cyphers, or indeed whole languages, could yield quicker, more reliable,
and less easily problematised results than the comparatively messy, highly
time- and resource-intensive manual approaches that made use of the
human brain. This principle correlates with Saussurean linguistics, under
which paradigm arbitrary lexical signs are used as signifiers and can be
94 James Luke Hadley
and Kolb and Miller, 2022), are not included in the functionality of such
tools or may be impeded by the assumptions underpinning the tools.
Therefore, one can be seen as something approaching an irrelevance from
the perspective of the other.
4.3 Challenges
In more recent years, as the quality of MT outputs has improved and CAT
tools have become further integrated into the technical translator’s work-
flow such that clients could insist on the use of specific termbases or TMs,
the attitude that MT systems and CAT tools in general are simply irrel-
evant to the production of literary translations has begun to give way to a
sense that MT systems and CAT tools might pose an existential threat to
the human translator’s livelihood.
Partly because it is a labour-intensive, poorly paid endeavour, trans-
lators working in literary translation often do not do so exclusively
(Wechsler, 1998, p. 22). Many also work as technical translators and have
seen the effects of technology’s arrival on the technical-translation industry
first-hand. They have seen that the advent of CAT technology has tended
to increase the pace and urgency of the industry as a whole, has shifted the
focus of translation away from whole texts towards individual segments,
has required translators to invest in the correct software, and in some
cases has introduced a greater proportion of editing and post-editing to
the workflow of translators than had previously been the case. At the same
time, literary translators have observed the gradual creep of MT systems
into fields of technical translation, such as legal and medical areas, which
have historically been lucrative because of the specialist knowledge they
require, and observed how MT systems have been able to cope relatively
well with the formulaic text types and terminology that are often found in
these fields (see, for example, Wiesmann, 2019).
The concern among literary translators who observe these developments
may be about what the literary translation industry might look like if it
were to come under the same or similar pressures after the introduction
of technology. They may imagine a world in which novels and poems are
similarly translated by the segment, where some segments are pre-filled
based on other works of literature that have already been translated, or
where their role as translators shifts to quality control for the outputs of
MT systems. They may imagine the already meagre fees associated with
literary translation being squeezed even more by unscrupulous editors
counting pre-translated segments with an eye on the bottom line.
Thus, in every respect, if the only translation technology currently in
existence which is made for the translation of technical texts is anything to
go on, the introduction of technology to the literary-translation industry
could be perceived as a threat to the workflow, livelihood, and job
MT and CAT 97
4.4 Opportunities
It is easy in an already stretched industry to assume that any change will
necessarily constitute a challenge. Thus, what risks being missed in discussions
related to the introduction of technology to a sector which has traditionally
tended towards manual approaches is the potential opportunities associated
98 James Luke Hadley
with augmenting human workflows with tools that have not previously
existed. While it is not true to say that literary translation during the third
decade of the twenty-first century has the same relationship with technology
as it had twenty years before, it is true to say that the scale and nature of
technology’s impact on the workflow of the translators involved is very
different to those working in many forms of technical translation.
In many contexts, and in the case of much technical translation, it has
become difficult or even impossible for a professional translator to survive
without the use of CAT tools of one form or another. Particularly in the
case of multinational companies and those for whom business pivots on
translation, TMs, and the ability of a translator to make use of them have
become prerequisites to maintain consistency (Taivalkoski-Shilov, 2018,
p. 691). Translators in such situations are routinely sent TMs or termbases
which must be used (ibid). These tools substantially increase productiv-
ity on the part of translators (ibid.). They systematise and speed up the
process of translating by segmenting texts and taking formatting out of
the equation. They prefill segments for which a similar segment exists in
the memory and suggest translations for terminology already listed in the
termbase (Brkić et al., 2009, pp. 354–355). Viewed from the companies’
perspective, the technology also increases efficiency by reducing or obviat-
ing the reproduction of work in the form of retranslating already translated
segments (Taivalkoski-Shilov, 2018, p. 691). Ultimately, this functionality
also creates the opportunity for the companies to separate the segments
which have been translated by the human from those prefilled by the CAT
tool and adjust the fee paid to the translator accordingly. Thus, the trans-
lator’s fee is no longer tied to the length of the document itself, but to the
fraction of that work for which their human input has been indispensable
(Gouadec, 2007, p. 18).
As already noted, this model tends to work on the assumption that
the translation provided for any given segment in one document could be
translated in a similar or identical way if it appeared in another document
or in a different part of the same document. In many contexts, particularly
in the context of technical translation, this assumption may be relatively
safe. However, even in the context of technical documents, it does not
always hold true and has led to the so-called ‘peep-hole’ phenomenon,
whereby translators’ only access to the text they are involved in translating
is through individual or disembodied segments (Melby et al., 2014, p. 671).
In the case of literature especially, working this way may lead to additional
work. Sentences that are structurally similar in the source text may have
substantially different perlocutionary functions from one another because
of the different contexts underpinning different texts. Thus, translators
may be forced to rework inappropriately reproduced segments rather than
translate them afresh.
MT and CAT 99
1 www.ofai.at/software/puncat.
2 www.hup.harvard.edu/collection.php?cpk=1031.
100 James Luke Hadley
language does not exist in the target language, and therefore, that the
translation will necessarily need to do something else.
This latter case can be illustrated by the idea of translating Shakespeare’s
sonnets into Japanese. Shakespeare’s sonnets belong to the English son-
net tradition which, like those of other European languages, hinges on its
strict use of metre and end rhyme. It makes use of iambic pentameter, a
term indicating five unstressed syllables interspersed by five stressed syl-
lables on each line and a pattern of three alternately rhymed quatrains
terminating in a final couplet, meaning that the ends of each of the lines
rhyme in the pattern ABAB CDCD, EFEF, GG (Walters et al., 2016, p. 5).
Unlike English and ancient Greek, Japanese does not function on the basis
of iambs, or patterns of stressed and unstressed syllables. Neither does
Japanese function on the same basis of syllable count as languages such
as French and Spanish. Rather, the basic unit of the Japanese language is
the mora, which is somewhat comparable to a syllable. It includes either a
consonant and a vowel, a vowel only, or a terminal ‘n’ only (Poser, 1990,
pp. 78–79). Although the pitch and occasionally tone of each mora shifts
according to the word’s pronunciation and position in the sentence, the
stress is constant (ibid).
It would be an overstatement to claim that the translation of rhyme
is impossible in Japanese. However, the structure of the language, where
verb endings are highly regular and fall by default at the end of sentences
has hampered the development of a tradition of end rhyme in poetry
(Chamberlain, 2013, pp. 29).
Thus, what are arguably the two defining features of the English sonnet
as it exists in Shakespeare's source texts would not generally be present in
Japanese translations. Does this mean that the texts cannot be translated at
all? Certainly not. But it does mean that a translator’s judgement to focus
on other elements of the text than these key features is both fully justifiable
and indeed expected.
Over the coming years, we will likely see the development of a suite of
individual tools that aim to facilitate the work of translators of literary
texts. It would be relatively simple for such tools to automatically identify
features such as rhyme patterns (see Chapter 8, this volume), syllable
counts, and sentence lengths in both source texts and draft translations as
a means of deciphering the formal features of the source text in question
in the first place and then as a means of assisting the translator in checking
the translation they have produced. CAT tools already routinely check for
formatting tags, such as hyperlinks or quotation marks present in source
texts but not in translations so as to flag these as possible errors. CAT tools
focused on literary translation could apply a similar approach to other
features, including, for example, rhymes. However, in addition to being
maximally useful to literary translators, such tools would need to include
MT and CAT 101
the added flexibility of not only attempting to enforce what exists in the
source text or what is theoretically possible in a target text, but also what
the translator’s chosen strategy is with regards to the work of literature.
In a similar way that CAT tools in the domain of technical translation
have developed over time, not only in terms of quality but also in terms
of functionality, it may be that translation technology that is particular
to the translation of literature will develop in an iterative manner. Since
observing, identifying, categorising, highlighting, and matching such
formal features in texts is not a radical departure from functionality already
relatively common in text-based applications, most notably, perhaps, in
existing CAT applications, it could be that these will embody some of the
first steps in creating translation tools specifically for literary translation.
A subsequent step may be to integrate some MT functionality. This could
give gloss translations of individual terms found in source texts. Next,
such functionality may be extended to give a range of rough translations
for whole segments, with the aim of prompting the human translator’s
creativity. An early version of this functionality already exists in some
CAT systems, such as Phrase TMS (formerly Memsource Cloud), though
it is not intended specifically for literary translation.3 Later, the systems
used for identifying and categorising formal features could be modified to
inform the outputs of the MT system, such that lists of suggestions could
be given to the human translator of solutions that reproduce any specified
textual pattern such as rhyme. Part of this stage would probably include
functionality that would allow the translator to rank the list of identified
formal features in terms of priority since translation often involves
balancing many competing interests.
A later stage would be to shift the relationship between the technology
and the human slightly, such that instead of the machine making suggestions
to the human for solutions to isolated translation problems, the human
may begin setting parameters for the MT output which they could then
postedit. It is unlikely that TMs of the kinds seen in technical translation
would prove useful in the context of literary works since style can be so
idiosyncratic on the one hand, and such a large part of the appeal of the
work on the other. None of the increments would remove agency from
the literary translator altogether, since the human would be required to
select the parameters underpinning the translation throughout and would
ultimately be needed to postedit the output.
An evolution of this kind would bring literary translation to a similar
stage of technological development that some areas of technical translation
find themselves in at the beginning of the third decade of the twenty-first
3 https://phrase.com/.
102 James Luke Hadley
century. It is clear that the impact on the industry as a whole would likely
be as profound as the impact of the gradually advancing CAT tools has
been for technical translation. Each increment would hold the potential
to increase a translator’s efficiency in terms of the number of words they
would be able to translate in one day. This increased efficiency could mean
that a vastly greater amount of translated literature could find its way
into circulation since the translation of literature could be made financially
viable, where today in many contexts, the amount of work required to
produce a single translated novel can outweigh the revenue that can be
produced by selling it. What is less easy to see at this stage is whether making
literary translators substantially more efficient would have the effect of
saturating the market with translated literature. If it did, this could either
lead to subsidies being increased to match the market-based competition,
or cut altogether, since the current need to incentivise literary-translation
practice would no longer hold. Either way, it seems likely that if literary
translation were made substantially more efficient than it currently is, the
subsidies would be less necessary as the industry as a whole would become
more efficient. Viewed from this hypothetical perspective, the introduction
of technology can be seen as an opportunity.
4.5 Discussion
This chapter set out to ask whether technology can be seen most reasonably
as an irrelevance, a challenge, or an opportunity for literary translation. It
almost goes without saying that all three and many more perspectives are
simultaneously valid. However, the perspectives that tend to characterise
literary translation’s relationship with technology at any given point in
history may correspond to those which are most closely influenced by what
is and what is not technically possible at the time.
In the early years of translation technology, when the overwhelming
focus was on technical texts, and the use of technology was more of a
luxury than a prerequisite, it was, from the perspective of literature, largely
irrelevant. MT systems were not even at the stage of being able to process
technical texts reliably, and while CAT tools were used in some contexts,
the idea of investing in expensive technology to translate literature would
likely have been a relatively extravagant one. Therefore, under this
paradigm, where the technology was manifestly unable to work with most
forms of literature, it is difficult to see it as anything other than irrelevant
from a literary translation’s point of view.
But, as MT systems passed from statistical to neural models in the
second decade of the twenty-first century, the quality of the outputs
increased substantially, such that post-editing came to be conceivable as
part of the workflow of professional translation for more than the most
formulaic texts (Matusov, 2019, p. 11). At the same time, CAT tools were
MT and CAT 103
However, it may also be that resisting the use of technology per se could
hasten that time rather than hinder it.
References
ALPAC. (1966). Language and machines: Computers in translation and linguistics
(1416). National Academy of Sciences. https://nap.nationalacademies.org/
resource/alpac_lm/ARC000005.pdf
Brkić, M., Seljan, S., & Bašić Mikulić, B. (2009). Using translation memory to
speed up translation process. In Proceedings of the 2nd International Conference
‘The Future of Information Sciences: INFuture2009—Digital Resources and
Knowledge Sharing’, Zagreb (pp. 353–363). Department of Information
Sciences, Faculty of Humanities and Social Sciences, University of Zagreb.
http://infoz.ffzg.hr/INFuture/2009/papers/INFuture2009.pdf
Chamberlain, B. H. (2013). The classical poetry of the Japanese. Routledge. https://
doi.org/10.4324/9781315012384
De Saussure, F. (1916). Cours de linguistique générale. Payot.
Gambier, Y. (2018). Institutionalization of translation studies. In L. d’Hulst & Y.
Gambier (Eds.), A history of modern translation knowledge: Sources, concepts,
effects (pp. 131–142). John Benjamins. https://doi.org/10.1075/btl.142.24gam
Gouadec, D. (2007). Translation as a profession. John Benjamins. https://doi.org
/10.1075/btl.73
Harper, K. E. (1955). Translating Russian by machine. Journal of Communication,
5(2), 41–46. https://doi.org/10.1111/j.1460-2466.1955.tb01099.x
Kolb, W., & Miller, T. (2022). Human–computer interaction in pun translation. In
J. L. Hadley, K. Taivalkoski-Shilov, C. S. C. Teixeira, & A. Toral (Eds.), Using
technologies for creative-text translation (pp. 66–87). Routledge. https://doi.org
/10.4324/9781003094159
Lennon, B. (2014). Machine translation: A tale of two cultures. In S. Bermann &
C. Porter (Eds.), A companion to translation studies (pp. 133–146). John Wiley
and Sons. https://doi.org/10.1002/9781118613504.ch10
Macdonald, N. (1954). Language translation by machine—A report of the first
successful trial. Computers and Automation, 3(2), 2–10.
Matusov, E. (2019). The challenges of using neural machine translation for
literature. In Proceedings of Machine Translation Summit XVII: The Qualities of
Literary Machine Translation, Dublin, Ireland (pp. 10–19). https://aclanthology
.org/W19-7302
Melby, A. K., & Wright, S.-E. (2014). Translation memory. In S.-W. Chan (Ed.),
Routledge encyclopedia of translation technology (pp. 662–677). https://www
.routledgehandbooks.com/doi/10.4324/9781315749129.ch41
Poser, W. J. (1990). Evidence for foot structure in Japanese. Language, 66(1), 78–
105. https://muse.jhu.edu/article/452744
Ruffo, P. (2021). In-between role and technology: Literary translators on navigating
the new socio-technological paradigm [Unpublished PhD dissertation]. Heriot-
Watt University. https://www.academia.edu/45534446
Ruffo, P. (2022). Collecting literary translators' narratives: Towards a new
paradigm for technological innovation in literary translation. In J. L. Hadley,
MT and CAT 105
5.1 Introduction
Until recent years, specialised translation tools were assumed to be
unusable, by definition, by literary translators:
DOI: 10.4324/9781003357391-8
Retranslating Proust 107
1 https://atril.com/product/deja-vu-x3-professional/.
108 Andrew Rothwell
5.2 Retranslation Brief
Until the publication of the new translation of the complete In Search
of Lost Time by Penguin Books, including Carol Clark’s acclaimed The
Prisoner (Proust, 2003), Scott Moncrieff’s version was (and for many
readers still is) the voice of Proust (1871–1922) in English. The reservation
most often expressed about his translations is that they changed Proust’s
tone, making him sound precious, over-ornate, and affected: in his essay
‘Proust in English’, for instance, John Sturrock writes of ‘the unhappy way
in which Scott Moncrieff contrived to play down the stringent intelligence
of his author by conveying it in an English prose that is constantly looking
to prettify’ (Sturrock, 1998, p. 115). His English seems to want to live up
to the conventions of Edwardian novel-writing, systematically preferring
‘high-style’ (less common) lexical options (e.g. ‘upon’ for ‘on’, ‘behold’
for ‘see’, ‘impart’ for ‘give’, ‘presently’ for ‘soon’, ‘notwithstanding’ for
‘despite’, ‘inasmuch as’ for ‘since’, ‘falsehood’ for ‘lie’, etc.) and using
occasional archaisms (e.g. ‘he would fain’, and ‘shew’ systematically for
‘show’). Proust’s sentences are often long (the longest in La Prisonnière
being over 400 words, but there are many around a quarter of a page in
length) and always carefully crafted, with intricately balanced and inter-
related clauses which can make it seem as if he is aiming for a logician’s
exposition of a given issue. This is an impression Scott Moncrieff tends to
promote by the explicitation and amplification of connections that Proust
leaves implicit. One aim of the new English cycle is to strip away the
stuffiness of his translation and produce a tighter, lighter, more confidential
and modern version, conveying the first-person narrator’s voice as he muses
ironically, at times amusingly, on his life and times rather as a stream of
2 www.deepl.com/translator.
Retranslating Proust 109
Figure 5.2 F
uzzy match of 87% between SL segment and TM segment, with
tracked changes to indicate differences (deletions struck through,
additions underlined).
The creative and cognitive reasons for working with existing TM mate-
rial, and other sources of ‘raw’ text from an NMT engine, dictionaries
etc. rather than starting from scratch will be discussed in Sections 5.4 and
5.5, below. There are also significant practical advantages to working in
a CAT environment such as DVX (other CAT tools offer similar facilities
and advantages) despite its not being designed for this type of translation
(see also Hansen, 2022, p. 51):
• The whole translation task is on the same screen bilingually all the time,
obviating constant shifting of attention between book and computer,
and attendant risk of oversight (e.g. omitting part or all of a sentence).
• ST segments are presented in text order, which helps the translator keep
track of any issues of reference and coherence between segments. DVX’s
real-time scrollable preview pane also allows the work to be read as a
coherent text rather than a sequence of disconnected segments.
• The ST and target text (TT) are locked permanently in alignment,
making it easy to backtrack and check coherence between the new
segment and previously translated material (particularly helpful when it
comes to revising the draft translation, perhaps weeks or months later).
Retranslating Proust 113
4 https://lilt.com/.
114 Andrew Rothwell
5 https://lilt.com/technology/translate.
6 www.deepl.com/en/translator.
Retranslating Proust
115
Figure 5.3 DeepL interactive proposal in a pop-up window alongside a TM proposal in DVX.
116 Andrew Rothwell
Moncrieff’s translation of the same segment from the TM, allowing the
translator to assess the different solutions, decide which one to work with
and paste it in.
Offering a different and arguably even more flexible method of
translator interaction from Lilt, DeepL proposes a complete translation of
the segment but also allows the translator to click on any TL word to open
a drop-down list of alternatives, both lexical and syntactic, which come
from lower-ranked translation proposals generated by the system but
not initially displayed. If one of the alternatives from the list is preferred
and inserted (in the example in Figure 5.3, ‘endearments…’ is about
to be selected in preference to ‘tenderness’, with the option to add that
translation to the user’s Glossary for future re-use), the rest of the proposal
from that point will be recalculated, as in Lilt. Equally, the translator can
intervene at any point in the TL sentence and manually over-type one
or more word(s) with a preferred solution not from DeepL’s list, again
resulting in a recalculation of the rest of the sentence. In combination, these
two ways of interacting with the MT proposal enable a kind of ‘guided
post-editing’ that is quick and easy to implement, so that quite substantial
adjustments can be made before the TL segment is inserted back into the
CAT environment for further polishing.
5.3.3 Dictionaries, Thesauri
The usefulness of DeepL’s app does not end with its interactive MT
proposals. It is also capable of exposing, in the form of a simplified
bilingual dictionary entry, word-by-word equivalence data drawn from
the vast tagged corpora on which the system is trained, and which can also
be accessed via the partner website of the Linguee bilingual concordancer7
(which pre-dates DeepL). In the example shown in Figure 5.3, selecting
the polysemic SL word ‘tendresse’, challenging to the translator because
its psychological extension in French is wider than that of its English
cognate, which also possesses a physical sense that the French word lacks,
generates an item ‘Dictionary entries for “tendresse”’ at the bottom of the
app window that can be expanded to display the data shown in Figure 5.4.
In addition to basic grammatical information, some examples of
usage and links to pronunciation sound files, the list of possible English
translations for ‘tendresse’, with the bottom row classified as ‘less
common’, constitutes a basic thesaurus which can give the translator a
quick overview of some available TL options within the same interface as
the MT proposal.
7 https://www.linguee.com/.
Retranslating Proust 117
as unwanted artefacts by the NMT process (Raunak et al., 2021; see also
the Introduction and Chapter 1 of this book), this feature can allow the
translator not just to review near-synonyms, but to select a possible solu-
tion and evaluate how well (or not) it works in the TL draft sentence. This
lexical interactivity makes it easy to experiment with different options and
instantly reverse those that turn out to be unsatisfactory: in this respect,
DeepL’s interactive NMT app can play the role of a sandbox (a meta-
phor borrowed from the field of Computer Science), a safe environment
in which to explore different options and combinations before committing
a preferred version to the CAT editor for further work. It is important
to emphasise that such experimentation is not just a matter of linguistic
juggling: it can be an important part of the cognitive activity of learning
the complexities of the ST by judging the implications, both within and
beyond the active segment, of different options—which in the case of a
stylist such as Proust can be very numerous and complex—and narrowing
them down to a manageable number of avenues for further exploration.
Of course, there are times when the lexical resources of a single tool
such as DeepL are not sufficient for the task at hand, and more complete
or specialised input is needed. For instance, the Thesaurus.com website8
offers a wider range of near-synonyms, often classified by tabs into different
superordinate categories. These are colour-coded from red (most relevant),
then dark or light orange, to yellow, for semantic proximity to the search
term, and clickable to allow exploration down a particular semantic
path, opening a list of the synonyms for one of the words in the original
list. The Trésor de la langue française informatisé online dictionary9
also proved indispensable for digging deeper into the precise meaning of
rare SL words (e.g. the shape and material of the ‘toque’ headgear that
Albertine hankers after (Proust, 1988, p. 572)). It was useful to have these
and other resources permanently open in different browser windows,
despite the attendant screen clutter and the time taken to navigate between
them. Sometimes, too, it was necessary to turn away from the computer
for confirmation of a meaning and consult general and specialised paper
dictionaries. For instance, the revised 1934 edition of Harrap’s Standard
French and English Dictionary (Mansion, 1940) was an essential source of
‘period’ vocabulary relating to technology, fashion, social conventions etc.
(e.g. ‘petit bleu’ (Proust, 1988, p. 838) meaning not, as DeepL accurately,
but in context erroneously suggested, ‘little bruise’, but an urgent message
sent across Paris by pneumatic tube); while the Dictionnaire du français
non conventionnel (Cellard & Rey, 1981) documents the Belle Epoque
homosexual slang associated with the character of the Baron de Charlus,
8 www.thesaurus.com/.
9 http://atilf.atilf.fr/.
Retranslating Proust 119
(P) Proust (1988, p. 621) (SM) Scott Moncrieff (DL) DeepL (unedited)
(1929)
Ses silences n’étaient So her silences were Her silences were therefore
donc que des voiles, merely screens, her only veils, her surface
ses tendresses de surface affection tenderness was only
surface ne faisaient merely kept beneath holding back a thousand
donc que retenir au the surface a thousand memories that would
fond mille souvenirs memories which have torn me apart ―
qui m’eussent déchiré would have rent my her life was therefore
— sa vie était donc heart, her life was full of those facts whose
pleine de ces faits full of those incidents mocking account, the
dont le récit moqueur, the derisive account, laughing chronicle,
la rieuse chronique the comic history of constitutes our daily
constituent nos which form our daily chatter about others,
bavardages quotidiens gossip at the expense about the indifferent,
au sujet des autres, des of other people, but which, as long as a
indifférents, mais qui, people who do not being remains misplaced
tant qu’un être reste matter, but which, in our heart, seem to us
fourvoyé dans notre so long as a person to be such a precious
cœur, nous semblent remains lost in the enlightenment of his or
un éclaircissement dark forest of our her life that, in order
si précieux de sa vie heart, seem to us so to get to know this
que pour connaître precious a revelation underlying world, we
ce monde sous-jacent of her life that, for the would gladly give up
nous donnerions privilege of exploring ours. (88 words)
volontiers la nôtre. (78 that subterranean
words) world, we would
gladly sacrifice our
own. (94 words)
120 Andrew Rothwell
are just some of the linguistic and textual cruxes highlighted by the kind
of three-way comparison described here, which allows the retranslator
to identify rapidly and work on the high-level challenges posed by the
ST while treating the verbal details of the two solutions as available raw
materials for further exploration (e.g. via a synonym search in DeepL, or a
thesaurus, or a concordance search of the TMs).
In the event, it was judged that DeepL’s version with some interactive
adjustments would require less modification than Scott Moncrieff’s, so
that was the one pasted into DVX and further edited, with the above
considerations in mind to produce the following draft translation:
This is not yet a definitive effort; it may well be revised in a second and
final draft, but it does try to offer solutions to the specific problems
identified by comparing P, SM, and DL which are compatible with the new
edition’s objectives of verbal economy, fluency, and relative informality,
emphasising the stream-of-consciousness aspect of Proust’s labyrinthine
meditations rather than their complex logical structure.
5.5 Discussion
In this retranslation case study, the main inputs supplied by current
translation technologies (a historical TM, interactive MT), which often
supplied significantly different material from each other, almost never
offered ready-made translation solutions but were found to save time at
the preliminary comprehension stage to expose through comparison many
of the key translation issues in a given segment, and to provide useful raw
materials for the retranslator’s further exploration notably via a synonyms
search (for confirmation from other literary translators, see Daems, 2022,
p. 53). It is mainly in this sense that the combination of ST, TM, and
MT in a standard CAT tool, in partnership with the retranslator’s own
linguistic and cultural expertise, can be said to constitute an ‘extended
cognitive system’ which, in literary translation perhaps even more than
in more commercial types, both augments the translator’s understanding
of the text’s challenges and expands, in a quickly assimilable manner, the
options available for working on them. A particularly apt metaphor for
the translation environment that results is indeed the sandbox: relieved
122 Andrew Rothwell
10 Daems (2022, pp. 58–9) also notes that, although most of her survey respondents ‘did
not want to think about’ how tools might be optimised, several ‘would like to be able to
click a word and get a variety of information from different resources such as (synonym)
dictionaries’.
124 Andrew Rothwell
the information they contain via a quick key combination, in the manner
of TM proposals in a current CAT tool. In this way the translation tool of
the future, rather than adding ever more numerous internal features and
fine-grained routines for extracting information from TMs, would act as
an outward-facing creative portal augmenting the literary translator with
flexible access, within a single interface, to potentially numerous linguistic
and stylometric resources, facilitating the analysis of translation problems
and discovery of creative solutions. In Chapter 14 of this book, Rudan
et al. set out the principles on which an even more ambitious translator-
augmenting CALT tool should be based and present their progress to date
in developing its key components. The opportunity to make an informed
contribution and influence the design of such future tools is one more
reason for literary translators to engage now with the technologies already
available.
Acknowledgements
I am grateful to Atril Solutions and DeepL SE for permission to reproduce
screenshots of their translation tools, Déjà Vu X3 and DeepL Translator
respectively.
References
Balashov, Y. (2020). The translator’s extended mind. Minds and Machines, 30(3),
349–383. https://doi.org/10.1007/s11023-020-09536-5
Berman, A. (2021 [1985]). Translation and the trials of the foreign. In L. Venuti
(Ed. & Trans.), The translation studies reader (4th ed., pp. 247–260). Routledge.
https://doi.org/10.4324/9780429280641
Cellard, J., & Rey, A. (1981). Dictionnaire du français non conventionnel.
Masson.
Daems, J. (2022). Dutch literary translators’ use and perceived usefulness of
technology: The role of awareness and attitude. In J. L. Hadley et al. (Eds.),
Using technologies for creative-text translation (pp. 40–65). Routledge. https://
doi.org/10.4324/9781003094159
Hadley, J. L., Taivalkoski-Shilov, K., Teixeira, C. S. C., & Toral, A. (Eds.). (2022).
Using technologies for creative-text translation. Routledge. https://doi.org/10
.4324/9781003094159
Hansen, D. (2022). The challenges and relevance of computer-assisted literary
translation. In The many faces of translation (pp. 46–62). European Parliament,
Directorate-General for Translation. https://data.europa.eu/doi/10.2861
/653362
Lommel, A. (2018). Augmented translation: A new approach to combining human
and machine capabilities. Proceedings of the 13th Conference of the Association
for Machine Translation in the Americas (Volume 2: User Track) (pp. 5–12).
https://aclanthology.org/W18-1905
Retranslating Proust 125
6.1 Introduction
Literature is one of the areas where machine translation (MT) has not
been used until very recent times. Nevertheless, the impressive increase in
quality obtained with neural machine translation (NMT) systems has led
some authors to start researching the use of MT in literature and other
types of creative texts (see e.g. Toral & Way, 2015a, 2015b, 2018). The
use of source texts and translations of literary texts to create parallel
corpora to train MT systems can lead to copyright issues (see Moorkens
& Lewis, 2019). Many original texts are available in the public domain,
as copyright expires after a given number of years from the death of the
author (between 50 and 100 years, depending on the country’s legislation).1
The same copyright issue applies to translations, making it difficult to find
good translations in the public domain.
In this chapter, a novel method for training NMT systems tailored to
a given author (or a given series of novels, as in our work) is presented.
The method uses the source-language texts of the novels to select parallel
segments from very large parallel corpora. In this way, when training a
tailored system for an author or series in the public domain, no copyrighted
human translations are used.
The goal of the trained MT systems is not the production of translated
versions to be published and read, but the creation of bilingual e-books.
Readers want to read these bilingual e-books in the original language, and
we expect them to access the translated version only to understand difficult
sentences or paragraphs.
DOI: 10.4324/9781003357391-9
Author-Tailored Neural Machine Translation 127
2 https://translate.google.com/.
128 Antoni Oliver
6.4 Experiments
6.4.1 Source Texts: The Sherlock Holmes Series by Conan Doyle
3 https://inlector.wordpress.com/.
Author-Tailored Neural Machine Translation 129
Table 6.1 Size of the corpora available at Opus for the English–Spanish language
pair with a size larger than 5M segments.
4 https://opus.nlpl.eu/.
5 https://commoncrawl.org/.
130 Antoni Oliver
browses the World Wide Web and stores the text of the webpages and
archives. This organization provides this data for free, and a new crawl is
completed approximately every month by randomly exploring URLs. In
this way, terabytes of web pages in various languages are obtained each
month. Each of these crawls is a snapshot of what is available at that
moment in time. Using different snapshots, a set of pre-processing steps are
performed: elimination of repeated paragraphs, automatic determination
of the language in which the web page is written, and filtering of low-qual-
ity content. Once the snapshots are pre-processed, a multilingual sentence
embedding space is learned. In this space, sentences (in their vectorized
forms in the multilingual embedding space, not in their original form con-
taining the actual source-language words) that are semantically similar
occur close to one another, regardless of the language they are written in.
To determine the extent to which two sentences in different languages are
mutual translations or not, we can calculate the distance between their
vector representations; an English sentence like, ‘Long live the King!’ and a
French sentence, ‘Vive le roi!’ should be very close together in the multilin-
gual embedding space, such that without any further knowledge or com-
plex calculation, we can assume them to be translations of one another.6
In this way, a large amount of parallel data for many language pairs is
created. Each parallel corpus provides a score for each segment pair indicating
the confidence of the segments being translation equivalents, which can
be used if required to prune low-scoring pairs. Schwenk et al. (2021) also
demonstrate that NMT systems trained with CCMatrix for several language
pairs performed better than systems trained with other popular corpora.
6.4.3 Corpus Combination
Table 6.2 Works used to create the monolingual English corpus for selection. The
numbers are for unique segments and tokens.
8 A measure used very often is perplexity, a measure that indicates how well a given model
(in our case the language model) predicts or describes a sample (in our case the given
sentence).
132 Antoni Oliver
9 https://github.com/aoliverg/MTUOC-eval.
Author-Tailored Neural Machine Translation 133
et al., 2002); NIST (Doddington, 2002); Word Error Rate (WER) (Nießen,
2000); Edit Distance, based on the Levenshtein distance (Levenshtein,
1965); and Translation Edit Rate (TER) (Snover et al., 2006). See Way
(2018) and Section I.2, Introduction, this volume, for an explanation of
human and automatic evaluation of MT system quality.
For the evaluation in this experiment, we used a set of 1,000 segments
from The Adventures of Sherlock Holmes and a public domain translation
into Spanish. All the trained systems as well as Google Translate and
DeepL were evaluated, meaning that all 1,000 sentences in the test set were
translated by each of the systems, and the outputs were compared against
the human reference translations for those sentences. The overall score is
an average over all 1,000 sentence translations.
In Table 6.3 we can observe the evaluation scores for the English–Spanish
MT system. For BLEU and NIST, higher scores indicate better quality,
whereas for WER and TER lower values are better quality. %EdDist,
indicates the percentage of edit operations (usually insertions, deletions,
and substitutions) needed to convert the raw MT output into the human-
produced reference translation, so again, lower values mean better quality.
The system CCMatrix Sherlock Holmes 10M obtains the best scores for
BLEU and %EdDist, whereas DeepL receives better scores for NIST, WER,
and TER. As a conclusion, we can confirm that the CC Matrix Sherlock
Holmes 10M obtains competitive scores, very close to those from the two
popular commercial systems. It is also worth mentioning that CC Matrix
Sherlock Holmes 10M outperforms the CCMatrix 10M system, indicating
that the corpus selection technique is very productive. Even those systems
trained with the selection of 5M segments achieve better results for all
metrics, and the system with the selection of 1M segments obtains better
values on all metrics except %EditDist.
Given its superior performance, in the remaining experiments in this
chapter, we will use the CCMatrix Sherlock Holmes 10M system, along
with Google Translate and DeepL.
10 Longer than the mean time plus two times the standard deviation.
Author-Tailored Neural Machine Translation 135
All the scripts necessary to create the bilingual e-books using an MT system
are publicly available on GitHub.11 The full process can be divided into the
following steps:
Table 6.4 Figures from the analysis of the post-editing tasks (figures in bold indi-
cate the best value for this measure).
11 https://github.com/aoliverg/InLector.
136 Antoni Oliver
• Obtaining the source text: the main source for our books is Project
Gutenberg.12 As a source file, we can use two different formats: EPUB
or plain text.
• Conversion of the source text to DocBook, a standard XML file format
for books and other documents.
• Creation of the bilingual DocBook: using the MT system, we create a
DocBook file where each source segment is linked to the corresponding
machine-translated target segment.
• Creation of the bilingual e-book in the final format: we create an EPUB
and an HTML file using the provided scripts or other standard tools for
handling DocBook files.
To assess the reading experience and the user’s opinions of the bilingual
e-books, we distributed a questionnaire developed in Google Forms with
12 questions. The questionnaire was distributed via a mailing list for bach-
elor and master students of translation at UOC. This mailing list is also
open to former students and to any person interested in translation. No
personal contact data was collected from the participants. The users were
asked to read one of the adventures in The Adventures of Sherlock Holmes
(whichever they wanted), in the English–Spanish bilingual version, and to
answer the questions after reading the adventure.
We received a total of eight answers to the questionnaire. Here we
present a summary of the results.
Question 1. In which format did you read the adventure? Half of the
users read it in HTML and the other half in EPUB.
Question 2. On which device did you read the adventure? Most of the
users (seven out of eight) used a computer and one used a
tablet.
Question 3. Which adventure did you read? Here we saw a great
disparity:
• Two read V. The Five Orange Pips.
• Two read IV. The Boscombe Valley.
• One read II. The Red-Headed League.
• One read VI. The Man with the Twisted Lip.
• One read VIII. The Adventure of the Speckled Band.
• One read XI. The Adventure of the Beryl Coronet.
12 www.gutenberg.org/.
Author-Tailored Neural Machine Translation 137
6.5 Discussion
In this research, we trained an NMT system tailored for translating literary
works for a given author using only the original works in the public
domain and a freely available very large parallel corpus. A small subset of
the human translation in the public domain of the chosen work was used
13 www.coe.int/en/web/common-european-framework-reference-languages.
138 Antoni Oliver
to evaluate the trained system along with two popular commercial NMT
systems, Google Translate and DeepL. The NMT system was trained
with the goal of producing a bilingual e-book, useful for learners of the
source language to read the novel in the original version and to consult the
translation of difficult sentences.
As expected, most of the metrics used for evaluation are worse for our
trained system. However, is it still a useful system for the production of
bilingual e-books? The evaluation of the reading experience shows that
the trained system produces useful translations, as 62.5 % of the read-
ers thought that the published translations helped them understand dif-
ficult sentences, and 75% thought that the translation was good enough to
understand the content. Most of the readers (87.5 %) would like to read
more books in this bilingual electronic format, and 100% would recom-
mend this kind of e-book to their friends. It is also worth noting that 100%
of the respondents think that these bilingual e-books can promote reading
the original version of novels.
References
Barrachina S., Bender, O., Casacuberta, F., Civera, J., Cubel, E., Khadivi, S.,
Lagarda, A., Ney, H., Tomás, J., Vidal, E., & Vilar, J.M. (2009). Statistical
approaches to computer-assisted translation. Computational Linguistics 35 (1),
3–28. https://doi.org/10.1162/coli.2008.07-055-R2-06-29
Cybenko, G., O’Leary, D.P., & Rissanen, J. (Eds.). (1999). The mathematics of
information coding, extraction and distribution. Springer. https://link.springer
.com/book/10.1007/978-1-4612-1524-0
Doddington, G. (2002). Automatic evaluation of machine translation quality using
n-gram co-occurrence statistics. In Proceedings of the Second Conference on
Human Language Technology (HLT 2002) (pp. 128–132). https://aclanthology
.org/www.mt-archive.info/HLT-2002-Doddington.pdf
Guerberof, A., & Toral, A. (2022). Creativity in translation: Machine translation
as a constraint for literary texts. Translation Spaces, 11, 184–212. https://doi
.org/10.1075/ts.21025.gue
Hadley, J.L. (2020). Literary machine translation: Are the computers coming
for our jobs? Counterpoint, 4, 14–18. https://www.academia.edu/44701250/
Literary_machine_translation_Are_the_computers_coming_for_our_jobs
Junczys-Dowmunt, M., Grundkiewicz, R., Dwojak, T., Hoang, H., Heafield, K.,
Neckermann, T., Seide, F., Germann, U., Fikri Aji, A., Bogoychev, N., Martins,
A.F.T., & Birch, A. (2018). Marian: Fast neural machine translation in C++. In
Proceedings of ACL 2018, System Demonstrations, Melbourne, Australia (pp.
116–121). Association for Computational Linguistics. https://aclanthology.org
/P18-4020/
Kudo, T. & Richarsdon, J. (2018). SentencePiece: A simple and language
independent subword tokenizer and detokenizer for Neural Text Processing.
https://doi.org/10.48550/arXiv.1808.06226
Kuzman, T., Vintar, Š., & Arčan, M. (2019). Neural machine translation of literary
texts from English to Slovene. In Proceedings of Machine Translation Summit
XVII: The Qualities of Literary Machine Translation (pp. 1–9). European
Association for Machine Translation. https://aclanthology.org/W19-7301.pdf
Levenshtein, V.I. (1965). Binary codes capable of correcting deletions, insertions,
and reversals. Soviet Physics Doklady, 10(8), 707–710. https://nymity.ch/
sybilhunting/pdf/Levenshtein1966a.pdf
Matusov, E. (2019). The challenges of using neural machine translation for
literature. In Proceedings of Machine Translation Summit XVII: The Qualities
of Literary Machine Translation, Dublin, Ireland (pp. 10–19). European
Association for Machine Translation. https://aclanthology.org/W19-7302.pdf
Moorkens, J., & Lewis, D. (2019). Research questions and a proposal for the
future governance of translation data. JosTrans, 32, 2–25. https://www.jostrans
.org/issue32/art_moorkens.pdf
Nießen, S., Och, F., Leusch, G., & Ney, H. (2000). An evaluation tool for machine
translation: Fast evaluation for MT research. In Proceedings of the Second
International Conference on Language Resources and Evaluation (LREC’00)
(7p.). European Language Resources Association (ELRA). https://aclanthology
.org/L00-1210/
140 Antoni Oliver
Oliver A., Toral, A., & Guerberof, A. (2019). InLéctor neural machine translation
for the creation of bilingual ebooks. In Proceedings of Machine Translation
Summit XVII: The Qualities of Literary Machine Translation, Dublin, Ireland
(p. vii). European Association for Machine Translation.
Oliver, A. (2020). MTUOC: Easy and free integration of NMT systems in
professional translation environments. In Proceedings of the 22nd Annual
Conference of the European Association for Machine Translation (pp. 467–
468). https://aclanthology.org/2020.eamt-1.55
Oliver, A., Alvarez, S., & Badia, T. (2020). PosEdiOn: Post-editing assessment
in PythOn. In Proceedings of the 22nd Annual Conference of the European
Association for Machine Translation (pp. 403–410). https://aclanthology.org
/2020.eamt-1.43
Oliver, A., Toral, A., & Guerberof-Arenas, A. (2022). Bilingual e-books via
neural machine translation and their reception. In J.L. Hadley, K. Taivalkoski-
Shilov, C.S.C. Teixeira, & A. Toral (Eds.), Using technologies for creative-text
translation (pp. 89–115). Routledge. https://doi.org/10.4324/9781003094159
Papineni, K., Roukos, S., Ward, T., & Zhu, W-J. (2002). BLEU: A method for
automatic evaluation of machine translation. In ACL–2002: 40th Annual
meeting of the Association for Computational Linguistics, Philadelphia, PA (pp.
311–318). Association for Computational Linguistics. https://aclanthology.org
/P02-1040/
Schwenk, H., Wenzek, G., Edunov, S., Grave, E., & Joulin, A. (2021). CCMatrix:
Mining billions of high-quality parallel sentences on the web. In Proceedings
of the 59th Annual Meeting of the Association for Computational Linguistics
and the 11th International Joint Conference on Natural Language Processing
(Volume 1: Long Papers) (pp. 6490–6500). Association for Computational
Linguistics. https://aclanthology.org/2021.acl-long.507
Snover, M., Dorr, B., Schwartz, R., Micciulla, L., & Makhoul, J. (2006). A study
of translation edit rate with targeted human annotation. In Proceedings of the
7th Conference of the Association for Machine Translation in the Americas:
Technical Papers (pp. 223–231). Association for Machine Translation in the
Americas. https://aclanthology.org/2006.amta-papers.25/
Toral, A., & Way, A. (2015a). Translating literary text between related languages
using SMT. In Proceedings of the Fourth Workshop on Computational
Linguistics for Literature, NAACL, Denver, CO (pp. 123–32). https://
aclanthology.org/W15-0714.pdf
Toral, A., & Way, A. (2015b). Machine-assisted translation of literary text: A case
study. Translation Spaces, 4(2), 240–67. https://doi.org/10.1075/ts.4.2.04tor
Toral, A., & Way, A. (2018). What level of quality can neural machine translation
attain on literary text? In J. Moorkens, S. Castilho, F. Gaspari, & S. Doherty
(Eds.), Translation quality assessment: From principles to practice (pp. 263–
87). Springer. https://arxiv.org/abs/1801.04962
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L.L., Gomez, A.N.,
Kaiser, L., & Polosukhin, I. (2017). Attention is all you need. In Advances
in Neural Information Processing Systems 30: Annual Conference on Neural
Author-Tailored Neural Machine Translation 141
Shuyin Zhang
7.1 Introduction
It is often said that translating literary texts is ‘the last bastion of human
translation’ (Toral & Way, 2014, p. 174) in the competition between
human and machine modes of translation. This is evident in the following
statement:
DOI: 10.4324/9781003357391-10
Machine Translation of Xianxia Novels 143
1 www.qidian.com/.
2 https://chuangshi.qq.com/.
3 www.zongheng.com/.
4 www.jjwxc.net/.
5 This project was initiated in 1981 by Chinese Literature Press, presenting the literary face
of China to the world.
6 This term refers to ‘interconnectedness of modern media of communication’ (Jensen, 2016,
p. 1).
144 Shuyin Zhang
7 www.wuxiaworld.com/page/general-glossary-of-terms.
8 Webnovel (www.webnovel.com) is owned by the Tencent-backed digital content company
China Literature Limited.
9 www.wuxiaworld.com
10 lnmtl.com
Machine Translation of Xianxia Novels 145
Phoenix. The translation model for these early platforms is fan translation
by human translators, a pull model where ‘end-users increasingly demand
translation or do it themselves’ (Drugan, 2013, p. 159). It is worth noting
that both websites were acquired by larger companies that own the rights
to the works in the last two years.
Commercial platforms with the copyright to the novels to produce offi-
cial translations adopt a push model where established clients and produc-
ers decide what to translate and deliver to consumers (Drugan, 2013, p.
159), offering professional translation by human translators. Webnovel,
the international version of Qidian, which translates Chinese novels from
Qidian, is a prime example. Founded by China Literature Limited in 2017,
Webnovel, with more than 375,000 original works, is one of the largest
internet literature websites. Unlike its predecessors, Webnovel offers not
only Xianxia but many other genres and content in six languages. Today,
the number of profit-oriented platforms is increasing, and their competi-
tors—the fan-based online translation communities—have undergone
a transformation and become more professional, and some have estab-
lished a collaboration or merged with official websites. However, a gap
still exists between the large-scale overseas propagation of Chinese net
literature and the current capacity for HT. The speed of HT varies widely
and is generally too slow to cater to the readers’ curiosity concerning
updates. Besides, some novels have not yet been selected for translation
by human translators and commercial websites, all of which has fueled
the need for MT.
Many fans have thus started to produce MT output themselves, mainly
using freely available platforms, which is also a case of a pull model. This
model that relies on MT (usually assisted by human pre-and post-editing)
rather than HT has two subtypes, a collective one involving more than one
translator, and an individual one. In terms of the collective model, LNMTL
is one of the earliest Xianxia literature translation platforms to adopt this
translation method. Compared with Wuxiaworld, LNMTL is smaller, less
organised and ‘wilder’, offering only machine-translated Xianxia novels,
but it is popular despite its dubious legal status. Founded in 2016 by an
engineer with no knowledge of Chinese, LNMTL’s purpose is to ‘make it
easy to read machine translation of Chinese web literature’.11 According to
the discussion on the forum on Novel Updates,12 it is believed that LNMTL
uses Systran to translate the novels although no explicit statement can be
found on LNMTL’s website.13 When reading content on LNMTL, readers
11 https://lnmtl.com/about.
12 Novel Updates (https://forum.novelupdates.com) is a directory of Asian translated novels
and a platform for users who enjoy reading web novels.
13 https://forum.novelupdates.com/threads/how-does-lnmtl-translate-their-chapters.19240/.
148 Shuyin Zhang
Figure 7.1 S creenshot of LNMTL’s translation of Volume 45, Book 17, Desolate
Era (tonally inverted). https://lnmtl.com/chapter/desolate-era-book-45
-chapter-17.
can spot terms highlighted, and by clicking on them, the original texts
in Chinese will appear which allows the reader to look up the terms in
the dictionary more efficiently, thus smoothing the reading experience, as
shown in Figure 7.1.
In addition, readers are involved in the translation process by freely
communicating with one another under the translation posts and are
encouraged to polish the MT output. LNMTL’s post-editing process is
open to all its readers. Specifically, they can add glossary terms, modify
phrases, adjust sentence structures and rewrite sentences by creating new
propositions. Once the number of changes reaches 200, retranslation will
be triggered. LNMTL also urges its readers to ‘vote on which series to pick
up’ and ‘message about the blunders/missing chapters’,14 giving its readers
some of the same rights as editors and translators.
LNMTL’s model has been widely acknowledged by its readers judg-
ing by the feedback left on the website. There have, however, been com-
plaints about the translation quality. LNMTL has defended itself by
14 https://lnmtl.com/about.
Machine Translation of Xianxia Novels
149
Figure 7.2 Screenshot of the introductory page of Desolate Era (tonally inverted). https://lnmtl.com/novel/desolate-era.
150 Shuyin Zhang
15 https://lnmtl.com/faq.
16 https://lnmtl.com/novel/desolate-era.
17 https://forum.novelupdates.com/threads/so-how-do-you-translate.66056/.
18 https://forum.novelupdates.com/threads/best-mtl.120521/#post-6157669.
Machine Translation of Xianxia Novels 151
19 https://vipreader.qidian.com/chapter/2502372/82320305/.
20 https://lnmtl.com/chapter/desolate-era-book-45-chapter-17.
21 www.wuxiaworld.com/novel/desolate-era/de-book-45-chapter-17.
22 https://forum.novelupdates.com/threads/best-mtl.120521/#post-6157669.
23 https://forum.novelupdates.com/threads/opinions-on-mtlers.117455/
152 Shuyin Zhang
24 This refers to the process of fans translating and subtitling multimedia materials.
Machine Translation of Xianxia Novels 153
25 Chinese internet literature adopts a different length system than print literature. An online
novel usually contains more than 3 million Chinese characters (Pei, 2022). However, print
literature, for example, the Mao Dun Literature Prize, the most prestigious literature prize
in China would categorise a piece above 130,000 Chinese characters as a novel.
154 Shuyin Zhang
the translations by post-editing them. For example, hyperlinks are one way
of sharing additional information to inform readers of translation chal-
lenges and explain any cultural implications inherent in the material. These
hiccups seem not to bother the readers of Xianxia literature much accord-
ing to their feedback on Novel Updates. Some giant Chinese internet com-
panies have noticed the merits of MT and have consequently started to
invest in researching and developing AI translation for net literature. A
prime example is funstory.ai,26 a Chinese company that offers AI trans-
lation services for internet literature platforms and currently has a very
significant daily update volume, with over 2,000 works updated simulta-
neously, covering more than 180 countries and regions (Chai, 2019).
In contrast, the intrinsic characteristics of Xianxia novels that make the
genre popular, discussed in Section 7.1—its unique terminology, straightfor-
ward language and relatively fixed plot pattern with its feel-good storyline—
also support the feasibility of MT. Notably, the genre uses a large amount
of terminology that consists of culture-specific or genre-specific Chinese
elements. Writers sometimes also devise terms to create a new world, but
the cultivation systems referred to in the novels largely stem from Daoism,
which means there will always be some regular or predictable terms, neces-
sitating the use of a terminology database within one novel and across dif-
ferent novels. Examples include 因果 (karma), 轮回 (samsara), 五行 (five
elements) and 阴阳 (yin and yang) to name a few. Meanwhile, the readership
is not expecting serious literature. ‘Research suggests that most foreign read-
ership of Chinese internet literature constitutes unemployed males (students)
under the age of 30, whose reading purpose is more for entertainment and
pleasure than for literary appreciation’ (iResearch, 2017, pp. 25–8). The
genre gives the readers an utterly unconventional experience and demands
no hard work on the part of a translator, so that there is room for machines,
even though the extraction of terminology and the maintenance of transla-
tion memories requires extra effort, and the formatting, linguistic and cul-
tural issues still need human intervention.
While the translation market for Xianxia literature is rapidly expand-
ing, many translation platforms are still struggling to recruit professional,
experienced literary translators. Unlike translating formal literature, trans-
lating internet literature does not give translators the prestige required to
offset their low income, which makes recruitment even harder. In the mean-
time, MT is becoming progressively more prevalent in different industries
and everyday life through technological advancements and internationali-
sation, and the ready availability of MT engines gives readers a chance to
26 https://funstory.ai/.
Machine Translation of Xianxia Novels 155
switch their roles from readers to translators and editors, thus establishing
a more intimate relationship with a text. This trend is also applicable to
certain literary genres, undesirable as it might seem to many professional
translators and literary critics, who, in the past, have castigated MT as
unsuitable for literary translation.
7.5 Conclusions
The gap between the fast-growing overseas market for Xianxia novels and
their ensuing translation activity and the time lag for MT applied in liter-
ary translation highlights the necessity for research in this area. The present
chapter explores the potential and practicalities of literary MT by reviewing
Chinese internet literature and the characteristics of the Xianxia genre and
analysing the translation models used by leading platforms. Xianxia literature,
which epitomises commercialised Chinese net literature, with its specific termi-
nology, reduced literary quality and set patterns, complies with the pervasive
use of interactive MT in the digital age and international dissemination of
Chinese culture. The MTPE model, which cuts costs, saves time and main-
tains quality to a large extent, can contribute to fan-based and commercial
websites. The future of literary MT is promising as it is establishing a more
intimate relationship between readers, translators and texts, and creating a
channel for better communication between different cultures while at the same
time enhancing cooperation between humans and machines.
However, MT still has a long way to go in terms of literary translation,
even with pre- and post-editors handling cultural and linguistic issues.
There are still insurmountable gaps for machines to overcome, at least for
now. Technical constraints are primary concerns in this regard, but some
ethical issues also need to be addressed to promote literary MT. The most
urgent questions are: will MT shake the status quo enjoyed by profes-
sional literary translators? Will machine-translated literature be regarded
as inferior to human-translated literature—a ‘secondary’ work’s ‘second-
ary’ work? If so, is it offensive to the original author if someone machine-
translates their work? Does the translation industry still need human
translators for particular net literature genres or will human editors suf-
fice? Apart from answering these questions, empirical research involving a
comparison of different MT outputs of an internet novel also needs to be
conducted in support of literary MT.
References
Chai, R. (2019). Ta yong AI fanyi wangluo xiaoshuo shouge 30 yi yangfensi
他用AI翻译网络小说收割30亿洋粉丝 [AI translated web novels attract 3 billion
foreign fans]. https://funstory.ai/2019/09/24/
156 Shuyin Zhang
8.1 Introduction
Computers have made remarkable strides in processing natural language in
recent years. Machine translation (MT) applications are advancing rapidly,
and the newest language models enable computers to generate coherent
and meaningful text autonomously. In this article, we will delve into the
developments underlying these advances, and explore how the algorithms
can be augmented for automatic poetry generation and translation.
About 70 years ago, the first algorithms for automatic natural language
processing (NLP) appeared as part of the emerging field of artificial
intelligence (AI). Over time, several different algorithms have emerged,
which can be broadly classified into three paradigms: symbolic, statistical,
and neural networks.
The symbolic paradigm, which is the oldest, uses handwritten rules to
model language. Generative grammar, developed by Chomsky (1957), is
an example of a rule-based system for NLP. Rule-based systems have also
been used in MT to transform sentences from the source language to the
target language. However, designing these rules can be a tedious process,
and their coverage is limited.
The statistical paradigm (Koehn, 2009), which emerged in the late
1980s, aims to equip computers with self-learning capabilities for NLP.
The goal of statistical models is to learn a model of language from large
text collections and perform tasks such as MT automatically. In contrast
to the symbolic paradigm, which relies on manually designed rules, the
statistical approach allows for a more automatic approach to NLP.
The most recent and successful models for automatic language processing
use neural networks (Goldberg, 2017). Although neural networks were
first proposed in the 1950s, their use in NLP has only become widespread
in the last decade. Neural networks are a type of machine learning, where
the model learns from training data to make predictions for new data.
However, unlike the statistical model, neural networks possess the ability
to build complex representations based on the input data, leading to
DOI: 10.4324/9781003357391-11
Up and About, or Betwixt and Between? 159
8.2.2 Word Embeddings
Word embeddings are one of the most important developments for neural
NLP. A monolingual word embedding is a vector of values that captures
a word’s similarity with the other words in a language, but multilingual
language-agnostic word embeddings are often used too. In both monolingual
and multilingual word embeddings, words with similar meanings have
values close to each other, while words with different meanings have values
further apart. Word embeddings offer an improvement over the statistical
paradigm, which represents words as atomic, indivisible features without
any underlying meaning structure.
One of the most attractive features of word embeddings is that they
are automatically derived from the data. Each word is assigned a set of
parameters, and the neural network is tasked with learning the appropriate
values for these parameters. A language model, which predicts the next word
in a text based on the previous words, is commonly used to learn embeddings.
Figure 8.2 shows a schematic representation of a network architecture for
inducing monolingual word embeddings. Matrix V contains the embeddings
for the entire vocabulary. The embeddings of the three preceding words are
selected as input to the network. Based on these inputs, the network induces
a hidden representation, and that representation is used to predict the
next word. The parameters of the network (including the embeddings) are
initialized with random weights, and the network automatically induces the
values that yield the best possible prediction. If the network assigns similar
embeddings to words such as ‘drink’ and ‘sip’, ‘a’ and ‘the’, and ‘white’ and
‘red’, it will be easier to make the correct prediction.
Up and About, or Betwixt and Between? 161
The network model described earlier performs well for the induction of
word embeddings. However, it can also be used as a language generation
model, where the goal is to predict the next word in a sentence based
on the preceding three words. The disadvantage of this approach is that
the network’s structure is fixed, requiring a fixed context of three words.
It would be better if the entire sentence context could be considered for
predicting the next word. This is often achieved using a recurrent neural
162 Tim Van de Cruys
network (RNN), which allows for the entire sentence context to be captured
(Mikolov et al., 2010). A graphical representation of such a network can
be found in Figure 8.3.
In an RNN, a sentence is read word by word, and the hidden
representation of the network calculated in the previous step is reused in
the next step. To illustrate how the generation process works, we will walk
through several calculation steps. Suppose we start a sentence with the word
‘I’. In this case, we take the embedding for the word ‘I’ (represented as x)
and input it into the network, along with an initial context representation
a0 indicating that we are at the beginning of a sentence. Based on these
two representations, a hidden representation (a1) is calculated, indicating
that we have seen the word ‘I’ at the beginning of the sentence. This
hidden representation is then used to predict a probability distribution
that reflects the likelihood of each word in our vocabulary. We select a
word with a high probability (such as ‘see’) as the next word. We then
repeat the same calculation with the previously selected word as input. We
take the embedding for the word ‘see’ and input it into the network, along
with the context representation we built in the previous step (meaning
that representation a1 replaces a0). Based on this input, we calculate a
new hidden representation that now includes the sentence context ‘I see’.
This hidden representation leads to a new probability distribution, based
on which we can choose the next word. In this way, we build a sentence
word by word, where the hidden representation of the network (through
the recurrent connection) always represents the full-sentence context,
allowing us to consider the entire context seen so far for predicting the
next word.
An RNN can be used not only to generate language but also to build
a representation for an existing sentence. To do this, we read the existing
sentence word by word into the network, and the recurrent connection
gradually builds the representation of the sentence. We can then use this
final representation for a prediction regarding a specific language-pro-
cessing application (such as sentiment analysis), or to encode a source
sentence into a representation that is used to predict a target translation
(Section 8.2.5).
8.2.4 Transformers
Until around 2015, the statistical method provided the best MT quality.
However, the emergence of the neural paradigm quickly changed this (see
Section I.2, Introduction, this volume, for a more historical perspective).
A neural MT (NMT) system considers the translation process as a
large mathematical function, modelled using neural networks. The
NMT system consists of two networks: an encoder and a decoder (see
Section I.2, Introduction, this volume, for more explanation of how
NMT systems are built). The encoder builds a representation of the
sentence in the source language, which is then passed on to the decoder.
The decoder then decodes that representation word by word into the
target language. RNNs can be used as an underlying architecture for
the encoder and decoder, in which case two RNNs are linked together.
164 Tim Van de Cruys
The first network, the encoder, sequentially processes the words of the
sentence in the source language, building up a hidden representation.
This representation is then passed as the initial hidden representation
to the second network, the decoder. Based on that representation, the
decoder predicts word by word the translation of the sentence in the
target language. The model is trained on a parallel text corpus. The
sentence in the source language serves as input to the encoder, and based
on the resulting representation, the decoder is tasked with reproducing
the sentence in the target language as correctly as possible by adjusting
the weights of the entire network. Instead of RNNs, Transformers can
also be embedded in an encoder-decoder architecture, but using a simpler
feed-forward network as in Figure 8.1. This is currently the method that
produces the best possible MT quality, although LLMs like GPT-3 are
likely to provide strong competition to these bespoke NMT systems in
the near future.
1 https://en.wiktionary.org/wiki/Wiktionary:Main_Page.
166 Tim Van de Cruys
The encoder encodes the current verse, and the final representation
is given to the decoder, which predicts the next verse word by word in
reverse. The rhyme prior is applied to the first-time step in the decoding
process, and the topic prior is optionally applied to all time steps. This
prevents the generation of an ill-chosen rhyme word that does not fit well
with the rest of the verse.
The model is used to generate a large number of candidate verses.
By using an optimization procedure, the best sample can be chosen
according to specific constraints. Each candidate is scored based on
various criteria, including language model score, compliance with rhyme
and topical constraints, and an optimal number of syllables. These scores
are normalized, and the harmonic mean of all scores is taken as the final
composite score for each candidate. The candidate with the highest score is
then chosen and added to the poem. This process ensures that the generated
verse meets the specific constraints and results in a high-quality poem.
An example of a poem generated by the system is given below.
There will come soft rains and the smell of the ground,
And swallows calling with their shimmering sound;
And frogs in the pools singing at night,
And wild plum-trees in tremulous white;
Robins will wear their feathery fire
Whistling their whims on a low fence-wire;
And not one will know of the war, not one
Will care at last when it is done.
Not one would mind, neither bird nor tree
If mankind perished utterly;
And Spring herself, when she woke at dawn,
Would scarcely know that we were gone.
2 https://marian-nmt.github.io/.
168 Tim Van de Cruys
both the semantics and form of the source poem (see also Chapter 13.8,
this volume). Some preliminary experiments seem to indicate that this is not
the case. When prompted with the task of providing a literary translation
of the poem by Teasdale that preserves the rhyme of the original poem, the
ChatGPT model yields the following translation (prompted by the author
on 25th February 2023):
that fit the imposed poetic form. These constrained neural architectures
show promising results, successfully producing poetry, as well as
successfully translating existing poetry.
Despite these advances, fully automatic literary translation has not yet
arrived. While NMT systems can produce translations that are generally
accurate, they still struggle with more complex aspects of literary language,
such as idiomatic expressions and cultural nuances. However, constrained
neural architectures offer an attractive source of inspiration that can
provide authors and translators with ideas, giving rise to a multi-faceted
co-creative setup.
Moving forward, further exploration of these constrained models
may lead to even more significant breakthroughs for creative language
generation and translation. By incorporating additional constraints, such
as those derived from linguistic theory or cultural context, we can improve
the quality and creativity of language models and enhance their ability
to produce more complex and creative language. Ultimately, this could
lead to more accurate and nuanced translations, as well as new forms of
creative language generation that push the boundaries of what is possible
with current architectures.
References
Brown, T., Mann, B., Ryder, N., Subbiah, M., Kaplan, J., Dhariwal, P., Neelakantan,
A., Shyam, P., Sastry, G., Askell, A., Agarwal, S., Herbert-Voss, A., Krueger,
G., Henighan, T., Child, R., Ramesh, A., Ziegler, D.M., Wu, J., Winter, C.,
Hesse, C., Chen, M., Sigler, E., Litwin, M., Gray, S., Chess, B., Clark, J., Berner,
C., McCandlish, S., Radford, A., Sutskever, I., Amodei, D. (2020). Language
models are few-shot learners. In Proceedings of 34th Conference on Neural
Information Processing Systems (NeurIPS 2020), Vancouver, Canada (pp.
1877–1901). https://dl.acm.org/doi/abs/10.5555/3495724.3495883
Chomsky, N. (1957). Syntactic structures. Mouton.
Ghazvininejad, M., Shi, X., Choi, Y., & Knight, K. (2016). Generating topical
poetry. In Proceedings of the 2016 Conference on Empirical Methods in
Natural Language Processing, Austin, Texas (pp. 1183–91). Association for
Computational Linguistics. https://aclanthology.org/D16-1126/
Goldberg, Y. (2017). Neural network methods for natural language processing.
Springer. https://link.springer.com/book/10.1007/978-3-031-02165-7
Junczys-Dowmunt, M., Grundkiewicz, R., Dwojak, T., Hoang, H., Heafield, K.,
Neckermann, T., Seide, F., Germann, U., Fikri Aji, A., Bogoychev, N., Martins,
A.F.T., & Birch, A. (2018). Marian: Fast neural machine translation in C++. In
Proceedings of ACL 2018, System Demonstrations, Melbourne, Australia (pp.
116–21). Association for Computational Linguistics. https://aclanthology.org/
P18-4020/
Koehn, P. (2009). Statistical machine translation. Cambridge University Press.
https://doi.org/10.1017/CBO9780511815829
172 Tim Van de Cruys
Lau, J.H., Cohn, T., Baldwin, T., Brooke, J., & Hammond, A. (2018). Deep-speare:
A joint neural model of poetic language, meter and rhyme. In Proceedings of
the 56th Annual Meeting of the Association for Computational Linguistics
(Volume 1: Long Papers), Melbourne, Australia (pp. 1948–58). Association for
Computational Linguistics. https://aclanthology.org/P18-1181/
Lee, D.D., & Seung, H.S. (2000). Algorithms for non-negative matrix factorization.
In Proceedings of 13th Conference on Neural Information Processing Systems
(NeurIPS 2000), Denver, CO (pp. 556–62). https://proceedings.neurips.cc/
paper_files/paper/2000/file/f9d1152547c0bde01830b7e8bd60024c-Paper.pdf
Lison, P., Tiedemann, J., & Kouylekov, M. (2018). OpenSubtitles2018: Statistical
rescoring of sentence alignments in large, noisy parallel corpora. In Proceedings
of the 11th International Conference on Language Resources and Evaluation
(LREC 2018), Miyazaki, Japan (pp. 1742–8). European Language Resources
Association (ELRA). https://aclanthology.org/L18-1275/
Mikolov, T., Karafiát, M., Burget, L., Černocký, J., & Khudanpur, S. (2010).
Recurrent neural network based language model. In Proceedings of Interspeech
2010, 11th Annual Conference of the International Speech Communication
Association, Makuhari, Chiba, Japan (pp. 2045–8). https://www.researchgate
.net / publication / 221489926 _ Recurrent _ neural _ network _ based _ language
_model
Mikolov, T., Yih, W.-T., & Zweig, G. (2013). Linguistic regularities in continuous
space word representations. In Proceedings of the 2013 Conference of the
North American Chapter of the Association for Computational Linguistics:
Human Language Technologies, Atlanta, Georgia (pp. 746–51). Association
for Computational Linguistics. https://aclanthology.org/N13-1090/
Murphy, B., Talukdar, P., & Mitchell, T. (2012). Learning effective and interpretable
semantic models using non-negative sparse embedding. In Proceedings of
COLING 2012, Mumbai, India (pp. 1933–50). https://aclanthology.org/C12
-1118.pdf
Van de Cruys, T., Poibeau, T., & Korhonen, A. (2011). Latent vector weighting for
word meaning in context. In Proceedings of the 2011 Conference on Empirical
Methods in Natural Language Processing, Edinburgh, Scotland (pp. 1012–22).
Association for Computational Linguistics. https://aclanthology.org/D11-1094/
Van de Cruys, T. (2020). Automatic poetry generation from prosaic text. In
Proceedings of the 58th Annual Meeting of the Association for Computational
Linguistics, Online (pp. 2471–80). Association for Computational Linguistics.
https://aclanthology.org/2020.acl-main.223/
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L.L., Gomez, A.N.,
Kaiser, L., & Polosukhin, I. (2017). Attention is all you need. In Advances
in Neural Information Processing Systems 30: Annual Conference on Neural
Information Processing Systems 2017, Long Beach, CA (pp. 5998–6008).
https://doi.org/10.48550/arXiv.1706.03762
9 Metaphor in Literary Machine
Translation
Style, Creativity and Literariness
Aletta G. Dorst
9.1 Introduction
Metaphor and translation are no strangers to each other. In fact, they share
not only an etymology (metaphor originating from Greek meta [change,
e.g. of place] and pherein [to carry] and translate from Latin trans [across]
and ferre [to carry; past participle latus]) but also their main concepts and
terminology (e.g. source, target, substitution and transfer). And although
there is a relatively steady stream of studies on metaphor in translation to
be found in the academic literature (see Schäffner, 2017 and Shuttleworth,
2017 for useful overviews), these studies have tended to focus either on
offering theoretical deliberations on the translatability of metaphor (e.g.
Nida, 1964; Dagut, 1976) or on providing models for metaphor translation
in terms of translation procedures proposed for different types of metaphor
(e.g. Van den Broeck, 1981; Newmark, 1988).
Despite the fact that as academic disciplines both Metaphor Studies
and Translation Studies have undergone the same developmental phases
from an initial focus on language in ‘traditional approaches’, followed by a
cognitive and cultural turn in the 1980s, many researchers from Translation
Studies still approach metaphor as a predominantly linguistic phenomenon
(cf. Burmakova & Marugina, 2014; Dagnev & Chervenkova, 2020),
focusing heavily on unique and creative instances of metaphor. Likewise,
many researchers from Metaphor Studies still treat translation as a rather
simplistic process of linguistic substitution, disregarding extratextual
factors that influence the translation process and the decisions translators
make.
Metaphor translation can be approached from a linguistic or conceptual
perspective, as illustrated by the models developed by Newmark (1988) and
Schäffner (2004). Many studies focus on the type of linguistic metaphor
involved and whether the languages involved provide equivalent metaphors
(e.g. Newmark, 1988). More cognitively oriented studies consider whether
the cultures involved employ similar or different conceptual metaphors
to understand a particular phenomenon (e.g. Schäffner, 2004) or on
DOI: 10.4324/9781003357391-12
174 Aletta G. Dorst
rather than identity’ (p. 155). In such cases, translators should opt for any
of the seven procedures proposed by Newmark except the first (retention of
the SL image). Park emphasises that in order to ‘make a proper metaphor
translation, translators should recognize that metaphor translation needs
relevant knowledge and substantial background study’ (p. 170).
Dagnev and Chervenkova (2020) identified the conceptual metaphors
underlying metaphorical linguistic expressions in five canonical works—
Joseph Conrad’s Heart of Darkness, Virginia Woolf’s Mrs. Dalloway,
George Orwell’s 1984, Richard Russo’s Empire Falls and James Joyce’s
The Dead—and how they were translated into Bulgarian to determine
whether metaphors are lost or transformed. Distinguishing between
‘sleeping metaphors (lexicalised—non-transparent)’, ‘linguistically
expressed conceptual metaphors (entrenched—conventional and activated
in the text)’ and ‘creative metaphors (novel)’ (p. 103), and focusing on
whether they occur in the same or a different mapping condition, they
found that lexicalised metaphors were more often transformed than
sleeping or creative ones, most likely due to their culture-specific nature
and their linguistic entrenchment. Creative metaphors translated most
readily, while foregrounded metaphors, grammatical metaphors and
culturally bound ones turned out to be the ones most difficult to translate
(p. 110). Contrary to their expectations, they did not find many instances
of metaphor explication, paraphrase or loss, and the target texts were not
inferior to the source texts in terms of their metaphor depth and variety,
suggesting that ‘Toury’s law of growing standardization (1995) is not
applicable to the translation of highbrow literature by well-established and
experienced literary translators’ (ibid., p. 110).
This last claim is interesting in light of the common distinction between
experienced and novice translators. Ali (2006) looked specifically at
students training to become literary translators. Starting from Newmark’s
list of procedures, he warns against ‘the dangers involved in the application
of the set of translation procedures proposed by some translation theorists
as “alternative solutions” in cases where a given SL metaphor does not lend
itself to being retained in a TL’ (p. 134). Ali shows that Newmark’s model
does not make clear or explicit how translators should choose between the
different procedures, ‘thus turning metaphor translation into a random
process’ (ibid.). He argues that translators need a solid understanding
of the complex nature of metaphor and the many different forms and
functions it may have in authentic discourse. They also need to understand
the relevance of the metaphor in both the source and target culture, as
well as their own personal views on the purpose of their translation, i.e.
‘whether the aim is to produce a semantic translation (loyal to the SL
culture) or a communicative translation (primarily oriented towards the
TL reader)’ (ibid).
Metaphor in Literary Machine Translation 177
This brings us to the focus of the current chapter, as it suggests that for
the translation of metaphor in literature to be ‘correct’ or ‘appropriate’,
an MT system would need to adhere to an intricate set of rules and
guidelines that capture both the complex nature of metaphor as well as
the complex nature of translation. A small but growing number of studies
(e.g. Genzel et al., 2010; Green et al., 2010; Voigt & Jurafsky, 2012; Jones
& Irvine, 2013; Besacier, 2014; Toral & Way, 2015a, 2015b, 2018) has
investigated the usefulness of statistical and neural machine translation
(NMT) systems for literary translation (see Introduction of this volume
for a more detailed discussion) and obtained promising results. However,
no studies have thus far focused specifically on metaphor, so we do not at
present know whether the perceived quality of literary MT or the errors
observed are in any way related to the presence of particular types or uses
of metaphor. Nevertheless, a recent study by Guerberof-Arenas and Toral
(2022) suggests that metaphor is one important unit of ‘creative potential’
in literary texts, and as such, it is expected that metaphors may pose a
problem in translation that requires a creative solution, while MT tends
to be direct, which in this case means that it will retain the metaphor by
default.
‘clustering’), and the fact that they are all extended and creative is particularly
noteworthy seeing as the number of similes is generally low even in fiction
(Steen et al., 2010; Dorst, 2011). The four similes in the relevant passage
(1996, p. 20) are also semantically related via ‘bodies’—‘heads’—‘voices’:
(1) […] all the different kinds of mischief that our bodies, like unruly
children, can get into.
(2) We would nod our heads as punctuation to each other’s voices, sign-
aling that yes, we know all about it.
(3) [;] gently we would complain, our voices soft and minor key and
mournful as pigeons in the eaves troughs.
(4) I hear where you're coming from, as if the voice itself were a traveler,
arriving from a distant place.
Not surprisingly, Google Translate retains all of these metaphors and pro-
duces a direct translation that follows the sentence structure and word
order very closely:
[1] alle verschillende soorten onheil waar ons lichaam, net als onhan-
delbare kinderen, in terecht kan komen. [all different sorts of evil/
mischief/calamity where our body, just like unruly/unmanageable
children, can end up]
[2] We knikten met onze hoofden als interpunctie voor elkaars stemmen,
om aan te geven dat ja, we er alles van weten. [We nodded with our
heads like punctuation to each other’s voices, to indicate that yes, we
know all about it]
[3] zachtjes klaagden we, onze stemmen zacht en mineur en treurig als
duiven in de dakgoottroggen. [softly complained we, our voices soft
and minor and sad like pigeons in the gutter troughs]
[4] ik hoor waar je vandaan komt, alsof de stem zelf een reiziger is, die
van een verre plaats komt. [I hear where you from come, as if the
voice itself a traveller is, who from a far place comes]
What is perhaps surprising is that this translation is error free and reads
naturally. Despite Atwood’s liking for long and interrupted sentences with
lots of commas, the Dutch sentences are idiomatic and correct and read as
natural conversation, even in places where you would expect the machine
to run into difficulty. The simple conclusion, therefore, is that such original
metaphors are not problematic for the machine since they normally require
metaphor retention, and their linguistic expression as a simile is well-suited
to direct translation.
When looking at the human translations, the students also behaved as
would be expected based on previous research into metaphor translation
Metaphor in Literary Machine Translation 179
(see Section 9.2) and the procedures proposed by Newmark (1988): for all
of these similes, the students opted for metaphor retention, just like Google
Translate. The official translation, however, shows some clear deviations
from this expectation. Though similes 3 and 4 have been retained, in
simile 1 ‘mischief’ has been substituted by an alternative metaphor ‘last’
[burden] (2020, p. 25). This alternate metaphor not only loses the specific
connotations of ‘mischief’ and its connection to ‘unruly children’, but is
also a highly conventional one, since ‘last hebben van’ [be troubled by] is
the common expression for health problems, unlike the ST’s description of
aches and illnesses as ‘mischief’. In simile 2, the most creative part of the
simile has been deleted: ‘We zouden elkaars stemmen met hoofdknikjes
begeleiden’ [We would accompany each other’s voices with nods] (2020,
p. 25). Though it can be argued that this partial substitution evokes
associations with music (‘begeleiden’ means ‘accompany’, as in ‘be
accompanied on the violin’), and the idea of nodding as punctuation is
perhaps not essential to the novel’s main themes, it is particularly strange
that such a creative and original image has simply been deleted.
There are also a number of variations in the human translations that
indicate where the translators struggled, but also where they may have
created added literary value. One metaphor the student translators seemed
to have struggled with is ‘I hear where you’re coming from’. Considering
how the simile develops, it makes sense to translate the metaphor directly,
yet the fact that this is not a fixed expression in Dutch may have prompted
the students to look for alternatives. In fact, they produced eight different
translations, with only one student opting for the literal ‘Ik hoor waar
je vandaan komt’ [I hear where you’re coming from], which is also the
solution the professional translator chose. The other solutions are ‘Ik snap
waar je vandaan komt’ [I understand where you’re coming from], losing
the connection with sound, as do the solutions ‘Ik zie waar je vandaan
komt’ [I see where you’re coming from], ‘Ik begrijp waar je vandaan komt’
[I understand where you’re coming from’] and ‘Ik zie wat je bedoelt’ [I
see what you mean]; the latter also deletes the ‘coming’ metaphor, thus
losing the connection with ‘traveller’. Two opt for alternative movement/
travel metaphors, namely ‘Ik kan erin komen’ [I can get into it > I can
understand] and ‘Ik ga helemaal met je mee’ [I go with you all the way >
I totally agree]; these display a certain level of creativity but are not well
matched with the continuing simile of the voice as a traveller.
One important aspect of stylistics where the human translator can
create added literary value is by retaining and recreating wordplay and
sound effects, both of which occur at different points in the excerpt,
though perhaps only noticeable to the careful reader. One instance occurs
in the simile in the opening of the chapter: ‘Above, on the white ceiling,
a relief ornament in the shape of a wreath, and in the center of it a blank
180 Aletta G. Dorst
space, plastered over, like the place in a face where the eye has been taken
out’ (1996, p. 17; emphasis added). This metaphor is in itself not difficult
to understand or translate, and both Google Translate and the human
translators (professional and student) retain the metaphor image, with only
minor variations in lexical choice between synonyms that do not affect
the metaphor image (e.g. ‘ruimte’ [space] versus ‘plek’ [spot]). Yet what
makes this metaphor creative and interesting is the sound effect created
by ‘space—place—face’ combined with the wordplay on ‘blank face’.
The reference to the meaning of ‘blank’ evoked by its collocation with
‘face’—that is, expressionless and devoid of emotion—is now projected
onto ‘space’. This elusive reference is impossible to translate directly into
Dutch since the two meanings ‘empty’ and ‘expressionless’ require two
different words.
The student translations show that they have all but one opted for the
more literal interpretation that applies to the ceiling, ‘leeg’ [empty]. Only
the choice for ‘ijlte’ counts as a creative solution here, marked and more
poetic in terms of lexical choice and providing an elusive reference to the
verb ‘ijlen’ [talk in a way that shows you are delirious]. The same student
who opted for ‘ijlte’ also found a way to recreate the sound effect, namely
by being more creative in translating ‘plastered over’ to create a rhyme with
‘face’: ‘een ijlte, met pleister gedicht, zoiets als de plek in een gezicht waar
een oog uit is gestoken’ [a hollowness, with plaster closed off, something
like the place in a face where an eye has been stabbed out]. In both
decisions, the human translator has added creativity and literariness to the
translation, without altering the metaphor. Conversely, and surprisingly,
the professional translator has altered and normalised the structure of the
sentence, combining ‘plastered over’ with ‘empty space’, thus reducing the
number of interruptions in the sentence and changing its flow.
While the creative, more literary metaphors may thus be relatively
unproblematic for both human and machine translation, it may be more
interesting to look at those cases of metaphor that may well slip through
the cracks of our attention, but which, especially in literary texts, may
actually be an inherent part of the fabric of authorial style. There are several
conventional metaphors in the excerpt. One of the least striking examples
at first glance is perhaps the use of ‘touch’ in ‘This is the kind of touch they
like: folk art, archaic, made by women, in their spare time, from things that
have no further use’ (1996, p. 17) where the noun is used in its conventional
metaphorical sense of ‘quality’ or feature’. Yet Atwood is cleverly playing
on two common collocations of ‘touch’, namely ‘a personal touch’ and ‘a
nice touch’. Normally, this kind of description of a room would include
a reference to ‘a personal touch’ which makes the omission of ‘personal’
in this context appear deliberate, reinforcing the idea that everything and
everyone in this world has been de-personalised. The formulation ‘touch
Metaphor in Literary Machine Translation 181
they like’ subtly suggests that this is the opposite of ‘a nice touch’; it is the
last thing the narrator likes, which is reinforced by the later confession, ‘I
hunger to touch something, other than cloth or wood. I hunger to commit
the act of touch.’ (1996, p. 21) where the unusual collocation ‘commit the
act of touch’ signals that touching is considered a crime. A quick corpus
search of the novel (see also Chapter 12, this volume) reveals that different
forms of the lemma ‘touch’ occur 42 times, creating a lexical network
throughout the novel that supports this main theme.
Looking at the translations, the most natural way to translate ‘touch’ is
by reducing it to sense, since the Dutch word ‘aanraking’ does not have the
same polysemy and Dutch readers might not understand the description of
a carpet as a form of touching. As pointed out by Guerberof-Arenas and
Toral (2022), creativity only works if it is both original and acceptable.
Retaining the metaphor may be original here, but probably not acceptable.
Google Translate of course retains the metaphor. Apparently, nothing
in the immediate context has led the algorithm to privilege a non-direct
translation (see Chapter 1, this volume). Interestingly, if we add ‘personal’
to the English sentence, the output produced by Google Translate changes
into the non-direct ‘Dit is het soort persoonlijke touch waar ze van houden’.
The human translations show that the naturalness of using ‘aanraking’
must indeed have played a role: of the eight students, only one has opted
for retention, three have used ‘detail’ [detail], one used ‘stijl’ [style] (also
the word used in the official translation), one used ‘sfeer’ [atmosphere],
one used ‘toets’ [brushstroke] and one opted for deletion of the metaphor:
‘Dit is waar ze van houden’ [This is what they like]. Though the English
metaphor is conventional and stylistically relevant, it is very hard to
translate and the preferred solution is clearly an idiomatic translation.
Equally hard to translate, for both humans and machine, is the
colloquial ‘I've heard them at it sometimes’ (1996, p. 21). Google Translate
mistranslates the idiomatic metaphor ‘to be at it’ (which conceptualises
actions and states as locations) as ‘Ik heb ze er wel eens naar horen
luisteren’ [I’ve heard them listen to it at times], which is an incredibly
complex formulation and a fluent translation but the wrong meaning.
But the machine is not the only one confused by this inconspicuous
metaphor; three of the eight students also misunderstood it and produced
incorrect translations: ‘Ik hoorde ze er soms over’ [I heard them about
it sometimes], ‘Ik hoorde ze af en toe gaan’ [I heard them go sometimes]
and ‘Ik heb ze er af en toe over gehoord’ [I have heard them about it
sometimes]. The same happens for the conventional idiomatic metaphor
‘but they found her out all right’ (1996, p. 21), which Google Translate
incorrectly translates as ‘maar ze hebben haar goed gevonden’ [they did
well in finding her]. Only one student also misunderstood the expression,
producing ‘maar ze vonden haar in prima gezondheid’ [but they found
182 Aletta G. Dorst
her in good health]. The other seven students translate the expression
correctly, but once again we see that there is much more variation in
the options they choose: ‘maar ze hadden haar door’ [but they had her
through > they were unto her], ‘maar ze hebben haar even goed te pakken
gekregen’/‘maar ze hebben haar net zo goed gepakt’ [but they caught
her all right], ‘maar ze is wel gesnapt hoor’ [but they caught her], ‘maar
ze hebben haar mooi doorzien’ [but they saw through her all right] and
‘maar ze zijn er natuurlijk achter gekomen’/‘maar ze zijn er toch achter
gekomen’ [but they came behind it of course > they discovered it all
right].
For the conventional metaphor ‘eat up’ in, ‘Jealousy, it must have
been, eating her up.’ (1996, p. 21) there is more consensus amongst
the student translators, with only one opting for the direct translation
‘opeten’ and one opting for the normal Dutch collocation with ‘verteerd
worden door’ [be digested/consumed by], and five students choosing the
near-synonym ‘opvreten’ (‘vreten’ being the vulgar form of ‘eten’, i.e.
eat without manners), which is commonly used in the Dutch expression
‘vreten aan’ for when you are plagued by negative emotions (English ‘a
gnawing guilt’). One student used a different metaphor, namely ‘vermor-
zeld door jaloezie’ [crushed by jealousy], a novel combination in Dutch.
The official translation deletes the eating metaphor, opting for ‘Ze was
natuurlijk straaljaloers’ [She was of course really jealous] (2020, p. 25),
which draws attention because of the novel compound ‘straaljaloers’.
Since ‘straal’ [beam, jet] normally combines with ‘bezopen’ [drunk] in
‘straalbezopen’, meaning ‘very drunk’, this marked lexical choice may be
intended to suggest ‘drunk with jealousy’ to the reader, creating a seman-
tic link with the following sentences, in which the women talk about a
man drinking toilet cleaner and not noticing because he was probably
too drunk.
Google Translate of course predictably retains the metaphor image:
‘Jaloezie, dat moet het geweest zijn, haar opeten’ [Jealousy, that must have
been it, eating her]. However, the interrupted sentence structure has resulted
in a grammatically incorrect (and rather ridiculous-sounding) translation,
since ‘haar opeten’ shows no agreement with the subject ‘Jaloezie’. Yet
it is not the metaphor itself which is problematic here, only Atwood’s
preference for complex and interrupted sentences. When presented with
the same metaphor in a normalized sentence structure, Google Translate
produces a translation that is idiomatic, namely ‘Paula werd dagenlang
verteerd door schuldgevoelens’ [Paula was for days consumed by feelings
of guilt]. This shows that when we analyse MT output, we have to be
aware that problems and errors in metaphor translation may not in fact
be a metaphor problem but rather a problem of linguistic realization and
sentence structure.
Metaphor in Literary Machine Translation 183
9.4 Conclusion
Overall, the results support Ali’s (2006) claim that both metaphor and
translation are highly complex phenomena that require considerable
knowledge of the language and cultures involved and the process of
metaphor translation cannot be reduced to a simply mechanical application
of translation procedures for pre-defined types of metaphor. Interestingly
though, and perhaps against common expectations, the results also show
that metaphor translation is not necessarily difficult or problematic for
MT engines, even in literary texts. Similar to Dagnev and Chervenkova
(2020), the current study found that creative metaphors translate quite
readily, while foregrounded metaphors and grammatical metaphors are
more problematic. This can now be argued to be the case for both humans
and machines.
While Chita and Stavrou (2020) and Dagnev and Chervenkova (2020)
found that the professional translators preserved the literary style of the
source texts very carefully, sometimes even unsuccessfully, and as such
did not show signs of Toury’s norm of standardisation, the current study
found that it was the professional translator who most often opted for
deletion and normalisation, especially for creative metaphor, similar to the
findings of Dorst (2018). Yet the substituting metaphors could often also
be related to stylistic considerations, which is in line with Park’s (2009)
claim that translators do need to keep the reader in mind and may opt
for equivalence when retaining the metaphor image disrupts the reading
experience. While humans are clearly more creative in the solutions they
consider, and their solutions in themselves may be creative and original,
this does in fact sometimes mean that the stylistic coherence of metaphor
patterns is altered, disrupted or removed.
For both humans and machines, the creative, more clearly literary
metaphors appeared to be the easiest to translate, most likely since direct
translation is simply the most suitable solution. Of course, we should
be careful not to suggest that the process leading up to these solutions
is similar since human translation involves decision-making while MT
involves computation. The machine does not reflect on the different options
in light of the text’s genre, style, skopos and the expectations of the reader.
It calculates what is the most likely translation for words in context, and
as such the decision to retain or alter a metaphor may be affected more
by sentence context, and the training data, than the metaphor’s form or
function.
When metaphors are conventional, especially in the form of fixed
collocations or idiomatic expressions, human translators show more
variation in their solutions and the machine starts making mistakes.
What makes such metaphors difficult to translate, especially for a
machine, is that they require careful attention to the way the metaphor
184 Aletta G. Dorst
References
Al-Hasnawi, A. R. (2007). A cognitive approach to translating metaphors.
Translation Journal, 11(3). https://translationjournal.net/journal/41metaphor
.htm
Ali, A. S. M. (2006). On the translation of metaphor: Notions and pedagogical
implications. International Journal of Arabic–English Studies, 7, 121–36. http://
www.ijaes.net/article/FullText/6?volume=7&issue=1
Alvarez, A. (1993). On translating metaphor. Meta, 38(3), 479–90. https://www
.erudit.org/fr/revues/meta/1993-v38-n3-meta341/001879ar.pdf
Atwood, M. (1996 [1985]). The Handmaid’s tale. Vintage Books.
Atwood, M. (2020 [1987]). Het verhaal van de dienstmaagd. Trans. G. de Blaauw.
Prometheus.
Besacier, L. (2014). Traduction automatisée d’une œuvre littéraire: Une étude
pilote. Traitement Automatique du Langage Naturel (TALN), Marseille.
https://hal.inria.fr/hal-01003944
Berman, A. (2021 [1985]). Translation and the trials of the foreign. In L. Venuti
(Ed. & Trans.), The translation studies reader (4th ed., pp. 247–60). Routledge.
https://doi.org/10.4324/9780429280641
Burmakova, E., & Marugina, N. (2014). Cognitive approach to metaphor
translation in literary discourse. The XXV annual international academic
conference, language and culture, 20–22 October 2014. Procedia—Social and
Behavioral Sciences, 154, 527–33. https://tinyurl.com/4xwwckmw
Chita, A., & Stavrou, C. (2020). The metaphor in literature and the effect on
translation. Nordic Journal of English Studies, 19(2), 117–42. https://njes
-journal.com/articles/abstract/10.35360/njes.562/
Dagnev, I., & Chervenkova, Z. (2020). Comparative study of metaphor in literary
texts and their translations. 5th International e-Conference on Studies in
Humanities and Social Sciences. http://centerprode.com/conferences/5IeCSHSS
.html
Dagut, M. (1976). Can metaphor be translated? Babel, 22(1), 21–33. https://doi
.org/10.1075/babel.22.1.05dag
Dorst, A. G. (2011). Metaphor in fiction: Language, thought, communication.
Boxpress. https://tinyurl.com/3x8fcd8d
Metaphor in Literary Machine Translation 185
Toral, A., & Way, A. (2015a). Translating literary text between related languages
using SMT. Proceedings of the Fourth Workshop on Computational Linguistics
for Literature, NAACL, Denver, CO (pp. 123–32). https://aclanthology.org/
W15-0714.pdf
Toral, A., & Way, A. (2015b). Machine-assisted translation of literary text: A case
study. Translation Spaces, 4(2), 240–67. https://doi.org/10.1075/ts.4.2.04tor
Toral, A., & Way, A. (2018). What level of quality can neural machine translation
attain on literary text? In J. Moorkens, S. Castilho, F. Gaspari, & S. Doherty
(Eds.), Translation quality assessment: From principles to practice (pp. 263–
87). Springer. https://arxiv.org/abs/1801.04962
Toury, G. (1995). Descriptive translation studies—And beyond. John Benjamins.
https://doi.org/10.1075/btl.4
Van Den Broeck, R. (1981). The limits of translatability exemplified by metaphor
translation. Poetics Today, 2, 73–87. https://doi.org/10.2307/1772487
Venuti, L. (Ed.). (2021). The translation studies reader, 4th ed. Routledge. https://
doi.org/10.4324/9780429280641
Voigt, R., & Jurafsky, D. (2012). Towards a literary machine translation: The
role of referential cohesion. In D. Elson, A. Kazantseva, R. Mihalcea, & S.
Szpakowicz (Eds.), Proceedings of the NAACL-HLT 2012 Workshop on
Computational Linguistics for Literature, Montréal, Canada (pp. 18–25).
Association for Computational Linguistics. https://aclanthology.org/W12-25
.pdf
Part 3
10 KonText in Trilingual Studies—
Supporting Phraseology
Translation Based on the EPB
Corpus
Angelika Peljak-Łapińska
10.1 Introduction
Phraseology is notoriously one of the most difficult parts of translation,
as well as areas of learning. It is not only problematic to find the best
analogue in the target language, but also to identify fixed expressions in the
source language. The stakes are high, as the correct usage of phrasemes, or
fixed expressions, makes the final translation seem natural and determines
whether it is successful or not. It also influences the mutual understanding
between two speakers of a language. Therefore, translators, as well as
language learners, seek the best resources for finding the solutions they
need, and these are often parallel corpora. They are perfect for the task, as
will be illustrated below.
This chapter starts by discussing the problem of translating phraseology,
where it comes from and why it is not trivial. The following section presents
the KonText application, discussing in detail various options available for
the users, as well as the composition of the English–Polish–Belarusian (EPB)
Parallel Literary Corpus which can be queried in KonText. Finally, the
chapter focuses on the case study of make as a phrasal verb and as a core of
the phrase. The examples discussed are make up, make […] difference and
make up one’s mind. These phraseological expressions and their Polish
and Belarusian translations are used to showcase the benefits of using a
trilingual corpus and the KonText application in translating phraseology.
10.2 Phraseology in Translation
Countless articles have been devoted to the problem of translating
phraseology. Often students spend an entire semester of their Translation
Studies programme1 learning exclusively about the intricacies of
1 Such as ‘Phraseology and Translation’ for the Master’s in Translation and Interpret-
ing at UC Louvain or ‘Comparative Phraseology’ for students in Applied Linguistics at
DOI: 10.4324/9781003357391-14
190 Angelika Peljak-Łapińska
(cf. Chapter 14, this volume). Belarusian translators, when facing the lack
of a direct English–Belarusian dictionary or corpus, very often resort to
Russian–English or Polish–English resources. And recently one additional
trilingual resource has been made available, namely the English–Polish–
Belarusian Parallel Literary Corpus. The following section discusses
the composition and advantages of the EPB Corpus, not only for the
translators or learners of the minority Belarusian language, but also for
Polish.
After logging into the KonText application, the translator or scholar has
access to the entirety of the EPB Corpus and also to the aligned part (over
700,000 words in total). After logging in (setting up an account is required
to fulfil the copyright requirements) at https://kontext.clarin-pl.eu, we get
access to the KonText interface with all its functions.
The default window, Search, in the corpus, has three main elements:
Corpus, Query type and Query. Query is a window for entering the search
term. Query type allows for specifying whether the search term is a spe-
cific word form, lemma, phrase, character or a query in a specific query
language (Corpus Query Language (CQL) in KonText). The Corpus but-
ton shows available corpora; only after choosing a corpus which is part
of a larger parallel corpus does a new option appear. The ‘Add a Corpus’
button displays a list of subcorpora aligned with the user’s first choice.
Choosing ‘Parallel Corpus’ results in adding the second three-element win-
dow allocated to the other corpus. This operation can then be repeated to
produce query windows for two or three aligned corpora. Additionally, the
user can narrow down the search by filtering selected word forms (option
‘Specify Context’) and by choosing texts with specific properties (option
‘Specify Query’ according to the meta-information), such as publication
date or the title of the text.
Typing the query is obligatory for the first corpus. Such a search returns
segments containing the search term (be it a character, word or phrase) in
the first selected corpus and the aligned segments from the parallel corpus
(or corpora). The search can be narrowed down by typing the query term
for the parallel corpus (or corpora) as well. In that case, only the segments
containing the query in both (or more) languages will be displayed.
KonText highlights words that were introduced in the Query window
and shows some basic statistics: overall number of hits, number of hits per
million (in relation to the whole corpus) and the average reduced frequency
(ARF), which combines the simple word count and document count, that
is the number of occurrences in each document. Clicking the highlighted
word or the segment (when no word is highlighted) opens a pop-up
window displaying a 150-word fragment of text surrounding the search
term (or the segment). By default, the results are shuffled, which means
that even if a search term appears a few times in one text the excerpts
from this particular text will not be displayed one after another but among
fragments of other texts containing the query. This is one of the ways of
protecting copyright, but it makes it more difficult to read substantial parts
of the whole text using the corpus search interface.
The search results can be enhanced by means of filtering (Filter tab).
KonText allows positive filtering, that is, selecting part of the result, and
negative filtering, namely removing some concordances according to words
(characters, phrases or any other criteria specified with the use of the query
194 Angelika Peljak-Łapińska
10.4
Make in the EPB Corpus
As an example to illustrate the use of the EPB Corpus and the KonText
application, we shall take a close look at the verb make, as a centre of both
phrasal verbs and phraseological expressions. This choice is motivated by
the broad usage of the word in the source language and its high frequency
KonText in Trilingual Studies 195
Polish Belarusian
make up (1) posprzątać [to tidy up] прыбраць [to clean]
make up (2) układać [to compose] прыдумляць [to come
up with]; скласці [to
compose]
make up (3) wypisywać [to write]; sporządzić none
[to draw up]
make up (4) sprawić sobie [to buy oneself зрабіць [to make]
something]
make up (5) zniechęcić [to daunt] адгаварыць [to dissuade]
make up (6) być ceną [to be a price] адплата [a payback]
196 Angelika Peljak-Łapińska
Based on the material from the EPB Corpus we can deduce that Belarusian
exhibits a low level of phraseological equivalence (PE) in the case of the
expression make […] difference. A trilingual corpus gives a Belarusian
translator a double advantage. First, it makes the translator aware of that
low PE and offers a whole spectrum of translation possibilities. At the same
time proving that there is no regularity informs the translator that there is
a space for creativity. Moreover, the trilingual corpus shows increased PE
in another language (in the case of the EPB Corpus, Polish) and points to
a solution with better prospects of becoming the regular one. Similarly to
the case of the phrasal verb make up, the exercise of searching for various
KonText in Trilingual Studies 197
Table 10.2 Continued
Table 10.2 Continued
If it was Fond of A jeśli lubi, to czy nie robi А калі Ставіцца Прыязна,
Pigs, did it make mu różnicy, jaki to jest ці адыгрывае нейкую
any difference rodzaj prosiaczka? ролю—гэта племянны
what sort of Pig? [And if he likes, then does Парсючок ці не
is make any difference to племянны?
him, what kind of piglet [And if he Is Friendly, then
is it?] does it play any role if
it is Piglet’s relative or
not?]
10.5 Conclusions
The trilingual EPB Corpus gives translators real-life, ready-to-use solu-
tions, not only in the minority, Belarusian language but also in Polish.
Such solutions are especially important in phraseology translation because
translating phraseology is crucial for target text quality. A translator
equipped with a versatile tool such as KonText can efficiently analyse the
available material and use it to firstly identify, and secondly make well-
informed decisions concerning the translation of fixed expressions in the
source language.
When a minority language is involved, even a small bilingual parallel
corpus is a huge help for the translator, but multilingual settings provide
even more solutions. The material in another language provides addi-
tional information about the context of certain words and expressions.
Moreover, when the translation in a minority language is omitted, the
example in the other language might serve as the primary source of infor-
mation, as shown above with the example of the phrasal verb make up
in its various contexts.
Based on the examples provided in this study, it seems that minority lan-
guages have lower levels of phraseological equivalence, and consequently
they often contain more diverse options for translating fixed expressions.
Therefore, a trilingual corpus can also be a source of inspiration for non-
minority translators, as is shown by the examples of Belarusian and Polish
translations of the English expressions make […] difference and make up
one’s mind. Additionally, such a resource is an advantage for language
learners.
References
BNkorpus. (2021a). Biblijny korpus [Biblical corpus]. http://biblija.bnkorpus.info
/index.html
BNkorpus. (2021b). Biełaruski N–korpus [Belarusian N–corpus]. http://bnkorpus
.info/
Colson, J.-P. (2020). Computational phraseology and translation studies: From
theoretical hypotheses to practical tools. In J.-P. Colson & G. Corpas Pastor
(Eds.), Computational phraseology (pp. 66–81). John Benjamins. https://doi.org
/10.1075/ivitra.24.04col
Deutscher Wortschatz. (2018). Leipzig Corpora Collection—Wortschatz
Belarusian. http://corpora.uni-leipzig.de/de?corpusId=bel_newscrawl_2011
Hareide, L., & Hofland, K. (2012). Compiling a Norwegian–Spanish parallel
corpus: Methods and challenges. In M. P. Oakes & M. Ji (Eds.), Quantitative
methods in corpus-based translation studies, 51 (pp. 75–114). John Benjamins.
https://doi.org/10.1075/scl.51.04har
Machálek, T. (2020). KonText: Advanced and flexible corpus query interface. In
Proceedings of the Twelfth Language Resources and Evaluation Conference,
202 Angelika Peljak-Łapińska
Lisa Horenberg
11.1 Introduction
To explore the opportunities for non-automated translation technologies
in literary translation, as outlined in the introductory chapter to this vol-
ume, this chapter focuses on technological innovations for literary source-
text analysis (STA) in particular. Inspired by traditional definitions of
computer-assisted translation (CAT) tools, this chapter provides insight
into the benefits of corpus linguistics in the preliminary, STA phase of the
literary translation process. More specifically, it tests the hypothesis that
text-analysis software Voyant Tools (Sinclair & Rockwell, 2022; hence-
forth, ‘VT’) can assist literary translators by providing them with basic yet
valuable information about their source text (ST), which might help them
achieve a more detailed understanding of the ST and start their translation
process more efficiently.
To identify possible benefits for literary translators using VT for STA,
this chapter focuses on a workflow-oriented case study1 in which VT was
used to analyse Evelyn Waugh’s (1903–1966) short story Mr Loveday’s
Little Outing (1951, first published in 1936; henceforth, ‘MLLO’). VT is a
free, online, and open-source text-analysis program that comprises a range
of visualisations to support distant reading (see Section 11.2). ‘Designed to
facilitate reading and interpretive practices for digital humanities students
and scholars as well as for the general public’,2 the tool efficiently provides
quantitative data about one or more texts. Its interface (referred to as a
‘skin’ by VT) is easy to use and requires little experience. The user copies
and pastes or uploads their text or document on the home page, and VT
calculates word frequencies, correlations and other types of quantifiable
textual data. This data is then presented in five visualisation panes that
1 This case study was first carried out for my (2019) master’s thesis, which also involved
Sketch Engine and Trados Studio, and a computer-assisted (re)translation of the ST.
2 https://voyant-tools.org/docs/#!/guide/about.
DOI: 10.4324/9781003357391-15
204 Lisa Horenberg
3 This chapter discusses some but not nearly all of VT’s functionalities, nor does it cover how
each of the tool’s features work. For more specific information on VT, see www.voyant
-tools.org/docs/.
4 Although the modern reality of Kenny’s (1999, p. 67) observation that ‘in MT the machine
seeks to replace the translator by fully automating the translation process’ is more nuanced
(see Chapter 1 of this volume), being made expendable by MT remains a genuine fear
among translators (Daems, 2021) and cases in which MT approaches human-quality trans-
lation exist (e.g. Popel et. al, 2020).
5 Coined by Moretti (2000), distant reading essentially entails applying computational
approaches (including corpus linguistics) to texts.
Voyant Tools’ Little Outing 205
MT and CAT tools both generally reside in categories four and five; Alcina
(2008, p. 98) explains that language tools and resources are ‘designed for
the collection and organisation of linguistic data’ and mainly consist of
‘electronic dictionaries, databases and text corpora’, and that translation
tools ‘work with at least one [ST] and one [Target Text (TT)] at the same
time and establish relationships between both texts on a segment level or
on the whole text level’ (ibid.). Using text-analysis tools as translation
technologies surpasses Alcina’s categorisation: while some text-analysis
tools allow users to create bilingual corpora and/or help organise linguistic
data, they do not completely fit the categories CAT technologies typically
inhabit. Instead, text-analysis tools can logically be seen as a new, sixth
subcategory of translation technology, which better represents their added
value to the field of translation.
In this case, the primary selection criteria for the ST were that its length
would allow for analyses and (re)translation into Dutch of the full text
within the scope of the project, that the text had to have been written
by an established author and that its existing Dutch translation was due
for an update. The selected ST was Evelyn Waugh’s story Mr Loveday’s
Little Outing. Waugh was an English author mainly known for the novel
Brideshead Revisited (1945), whose earlier works include Vile Bodies
(1930) and A Handful of Dust (1934). Described as ‘the funniest man of
his generation’ (A. Waugh, quoted in Wykes, 2016, p. 1) and someone
who ‘liked things to go wrong’ (A. Fleming, quoted in ibid.), Waugh had
a tendency to ‘make fun of situations usually regarded as tragic’ (Bucaria,
2008, pp. 218–19) that resulted in elegantly written, dark-humorous
stories.
To summarise, Mr Loveday’s Little Outing is set in early 20th-century
Britain and consists of three numbered parts that succeed each other quite
seamlessly. The story follows Angela Moping, a young lady who for the
first time joins her mother (Lady Moping) on a visit to her mentally ill
father (Lord Moping) in an asylum. During Angela’s visit, she meets her
father’s officious secretary and caretaker, Mr Loveday. Angela learns that
Loveday is also an inmate and has been for 35 years after he killed a young
206 Lisa Horenberg
woman. Convinced that his time at the asylum has restored Loveday’s
mental sanity, Angela arranges for him to be released. At the end of the
story Loveday voluntarily returns to the asylum, just a few hours later.
11.3.2 Methodology
Christensen et al. (2017, p. 14) explain that existing research into the
technical aspects of translation technology usually highlights the tools’
functions and that existing workflow-oriented and industrial research
often focuses ‘more on the implementation of technology in the language
industry than on the impact of this on translation processes’, or how
the implementation of technology ‘affects [translators’] minds and work
processes’ (ibid.). This case study combines the research categories of
Christensen et al. by incorporating aspects of technology-oriented research
into a primarily workflow-oriented method that is focused on the practical
use and functionality of computer-assisted STA.
The case study is inspired by and gratefully borrows from Youdale’s
aforementioned CDR model. The ST is first analysed without special
technology to highlight textual features that are deemed important or
can cause translation problems, to which the subsequent computer-
assisted STA can be compared. This first traditional type of STA is usually
an explicit step in the process for translation students but is ‘peculiarly
internalised’ in more experienced translators as ‘experience makes [them]
feel when special attention is necessary’ (Pym, 2011, p. 2).
The second part contains a freestanding6 two-step computer-assisted
STA7 that first looks at VT’s basic results and then explores them in more
detail. This method covers the first two parts of the translation process
identified in Youdale’s (2020, p. 41) CDR model, respectively ‘making
notes during and after initial reading’, and ‘preparing to translate’. By
following VT’s layout, the computer-assisted STA is expected to include a
set of elements that could be stylistically relevant on either a narratological,
lexical, or grammatical level, or on the level of context and cohesion, e.g.
‘plot structure’, ‘character names, attributes, dialogues, registers, etc.’,
6 My experience analysing the text for Part One (and translating it without CAT, which was
part of the original research) of the case study was likely to interfere with the computer-
assisted STA in Part Two to some degree. This interference was minimised by scheduling a
gap of nearly two months between working on Parts One and Two, as well as by perform-
ing the computer-assisted STA as objectively as possible. Limitations of the research are
also discussed in Section 11.4.
7 Although VT can process PDF documents, the PDF ST was converted to Word and cleaned
of formatting and special characters to increase the reliability of results before being
uploaded to VT. This process is mentioned in the original research project (Horenberg,
2019).
Voyant Tools’ Little Outing 207
11.4 Case Study
11.4.1 Traditional STA
This section provides a brief STA by commenting on themes in the text and
how they are reflected stylistically. Considering the ST’s translation into
Dutch, it discusses translation issues on both the micro and macro level
and comments on what approach can best be taken for their translation.
Mr Loveday’s Little Outing represents Waugh’s iconic style through its
dark-humorous, rather ironic tone and content. With regard to content,
one of the main features that can be identified throughout the story is
Angela’s personal development. In the first part of the story when Angela
visits her father at the asylum, she is rather passive. However, this part
ends with Angela stating that Loveday’s situation ‘doesn’t seem fair’
(MLLO, p. 11) as ‘[she has] never seen anyone saner’ (MLLO, p. 10). This
is a stepping-stone to the story’s second part, which provides Angela with
more agency as it elaborates on her surrogate feeling of injustice and her
determination to help Loveday when he explains to her that he would like
to have just one little outing. In the third and final part of the story, Angela
truly becomes ‘a woman with a cause’ (MLLO, p. 13) as she works for
and achieves Loveday’s release, which in turn can be seen as a big personal
accomplishment for her. However, events turn dark when it is revealed,
albeit implicitly, that he has killed another young woman who passed him
on the road. Angela’s development is the leitmotif of the story, and the plot
twist affects her as well as Loveday, as it negates her development.
Angela’s personal development is also reflected stylistically, as her
passive role in the first part of the short story is emphasised by descriptions
in a passive voice, such as: ‘it was being made at her own suggestion’
(MLLO, p. 7). Angela is more assertive in the second part of the story,
and her thoughts and feelings are voiced more actively and clearly. This
continues in the story’s third part, where she has ‘a new purpose in life’
(MLLO, p. 13) and finally achieves something real by herself. Stylistically,
Angela’s newfound attitude and drive are represented in the flow of the
text, with its speed increasing as the narrative proceeds. These stylistic
features together reinforce Angela’s role in the story and build up suspense
for the plot twist that concludes the story.
Similarly, the story shows a pattern of confusion and uncertainty in
both style and content. A first example of this is the sudden shift from
clear, sunny skies to black squalls at a garden party in the beginning of
the story. The changing of the weather is described in phrases that are
separated by semicolons, providing a rather chaotic paragraph that mirrors
208 Lisa Horenberg
the ongoing chaos stylistically. Secondly, there are many factors that create
a certain level of unrest in the reader, including a general confusion and
inconsistency with regard to Loveday and his role at the asylum as well
as in the story itself. In these and other instances, Waugh shifts between
dialogues and background information, between direct and indirect speech
and the story’s omniscient narrator frequently changes perspective. These
shifts unsettle the reader throughout the text and foreshadow the coming
plot twist.
The plot-twist at the end of the story that negates Angela’s development
by revealing Loveday’s murderous relapse is built up to and foreshadowed
throughout the text in both style and content. These stylistic and narrative
turns are reinforced by ironic undertones that define Waugh’s text. These
features should therefore ideally all be maintained in translation. The
translation strategy thus requires the translator to bear in mind the macro-
level of the text when it comes to specific translation issues, as it is precisely
the way different elements are intertwined that characterises this text.
8 https://voyant-tools.org/docs/#!/guide/stopwords.
Figure 11.1 Initial results of the case study’s computer-assisted STA in Voyant Tools. https://voyant-tools.org/docs/#!/guide/about.
Voyant Tools’ Little Outing 209
210 Lisa Horenberg
focus on that specific word as well. For example, selecting ‘angela’ in Cirrus
causes the standard top-centre pane, Reader, to show the distribution of
‘angela’ throughout the text. By default, Reader shows the full text and
can be navigated by scrolling down or selecting words. Reader can also
be switched to TermsBerry, another word cloud that shows how words
‘co-occur (that is, to what extend [sic] they appear in proximity with one
another)’.9 TermsBerry reveals that ‘angela’ has the most collocates with
other frequent words. Interestingly, this is not the case for ‘loveday’, but
it is for ‘don’t’, which initially seems a less significant word with a total of
only eight occurrences in the whole text. ‘Said’ shows the highest number
of the same collocates with ‘angela’, ‘doctor’, ‘lord’ and ‘father’. Not much
can be derived from these isolated results just yet, although there seems to
be some sort of dialogue between or about Angela, Lord Moping and the
doctor. When ‘said’ is selected, Reader additionally shows that this word
occurs most often in a section just before the middle of the text, indicating
that some dialogue might take place in that part of the text. These views
allow for some cautious statements about the contents and structure of
the text but require additional information from other views to yield more
telling results.
The top-right Trends view in Figure 11.1 provides some additional
insight. Trends by default shows graphs of the five most frequent words,
with a relative frequency (number of occurrences per million words) on
the vertical axis and ten default document segments on the horizontal axis
(each one-tenth of the text’s total length in words), which allow for visual
comparison of relative word frequencies. Once again, more details are
provided by the alternative top-right pane called Document Terms, which
shows absolute values of relative word frequencies in a table containing
dozens of different words. Trends provides particularly interesting
information in this case. A focus on ‘said’ seems to confirm there is indeed
dialogue just before the middle of the text and some more at about three-
quarters of the way through. When only the two most frequently recurring
terms are selected in Trends, something interesting happens as well, as
the relative frequencies for ‘angela’ and ‘loveday’ seem to correlate to a
certain extent. Both names barely occur in the first two text segments,
but then occur more and more frequently, although exact numbers vary
between segments. Considering both terms are names of characters,
logically, they would indicate some sort of character relevance to the
story. The increasing mentions of Angela point towards her increasing role
in the story. The number of occurrences of Loveday, however, starts to
decrease after his initial introduction halfway through the story. It is clear
9 https://voyant-tools.org/docs/#!/guide/termsberry.
Voyant Tools’ Little Outing 211
that ‘angela’ is barely mentioned in the final quarter of the story and that,
contrary to the direction of the graph in the first eight segments, ‘loveday’
suddenly emerges again in segments nine and ten. This shift alerts users
that something is happening. Similarly, Trends shows that ‘moping’ is
mentioned quite often in the first parts of the story, but is then gradually
mentioned less frequently, indicating Lord and Lady Moping’s roles might
decrease as the story develops.
Moving on, VT’s default interface offers two more panes that each
contain various views. In the bottom-left pane, Summary initially provides
a brief summary of basic textual data (Figure 11.1). There is one single
document uploaded to VT, which contains 2,655 words with 912 unique
word forms. It also shows vocabulary density (the number of unique
words divided by the total word count), in this case 0.342, and the average
number of words per sentence, in this case 13.7. In this pane’s Documents
view, more documents can be added and the documents that are used can
be downloaded. The third view that can be used in this pane is Phrases,
which by default shows recurring phrases that are between two and
30 words long, but these criteria can be customised. The phrases in the
current default selection state there are a number of four-word phrases
that occur twice and a few shorter phrases that occur more frequently. To
learn more about the role of these repeated phrases in the ST and to be able
to decide if consistency is desired in the TT as well, this information needs
a closer look (see Section 11.4.3).
In the bottom-right pane, Contexts provides textual context the most
systematically by showing all occurrences of a word or phrase and the
words that precede and follow it (commonly referred to as Key Word in
Context (KWIC) concordance). Users can customise this view by specifying
how many words they would like to see as context both left and right of
the selected word or phrase, which is set to five by default. This pane
can also present Bubblelines, again visualising the distribution of frequent
terms across the text, whereby the text is represented as a horizontal line
and terms are represented by circles (bubbles) placed on that line. Finally,
this bottom right pane can alternatively show Correlations, a table that
is based on a ‘correlation coefficient [that] is calculated by comparing the
relative frequencies of terms (relative to each document for the corpus or
relative to each segment of document)’.10 For the current case study, these
three views confirm there are some repeating words and phrases. However,
the size of the text seems to limit the number of repetitions and respective
context which affects correlation results, indicating that the text is too
short to collect relevant data using the correlation coefficient.
10 https://voyant-tools.org/docs/#!/guide/correlations.
212 Lisa Horenberg
These initial results in VT hint at textual features and some of the tool’s
functionalities that call for a closer look. Firstly, as previously mentioned,
VT uses a default Stoplist to filter out words that would typically distort
results, but this list can be customised. The list not only contains words
but also numbers, punctuation, abbreviations and individual letters. This
list of Stopwords may include words that are relevant to the analysis after
all. Whereas words like ‘the’, ‘to’, ‘and’, ‘you’, ‘of’ and ‘he’ might indeed
generally not reveal much about a text, frequent use of words such as
‘whereas’, ‘however’, ‘moreover’, ‘rather’ and ‘whom’ may be stylistically
relevant as they could attest to a certain register (see Chapter 3, this volume,
which demonstrates exactly this for the German language).
Including all words by deactivating the Stoplist puts results in a different
perspective (Figure 11.2). Comparing frequent words with and without
using the Stoplist can cause words that are normally included in the
Stoplist—and so do not show up in initial results—to stand out. Although
most of these words, such as ‘a’, ‘of’, ‘to’, ‘the’ and ‘and’ are irrelevant in
this case, ‘you’ and ‘i’, which occur 42 and 48 times, respectively, are not.
Trends shows two clear peaks for ‘i’ and one peak for ‘you’ (Figure 11.3).
The peak for ‘you’ coincides with the first one for ‘i’ and corresponds
to the dialogue that occurs at the end of the first half of the story. This
is confirmed in the Reader, which also indicates that the second peak of
‘i’ occurs when Loveday answers Angela’s questions. Waugh’s use of ‘i’
and ‘you’ therefore mirrors and draws attention to the fact that Loveday
barely talks about himself in the first part of the story, where he is more
concerned about Lord Moping and Angela, but that the focus in the later
part of the story shifts towards Loveday himself.
In the case of ‘you’ and ‘i’, the required additional context is provided
by other panes. However, isolating words from their context can also be
beneficial, and the removal of capitalisation allows for more objective
interpretations or analyses. Words that are normally capitalised now do
not draw more attention than those that never are, and isolating words
from their context may furthermore invite users to contemplate the
underlying meaning behind them and reconsider their role in the text. For
example, ‘angel’ and ‘love’ stand out more prominently as parts of the
names ‘angela’ and ‘loveday’. This emphasises Angela’s angelic aspect,
referring to her role in the story in the way she steps up and cares for
Loveday’s fate. Similarly, the meaning of ‘loveday’ adds to the irony of the
story given the day of Loveday’s release can be seen as ‘the settlement of
a dispute’ (OED, n.d.) (i.e. the initially friendly rectification of him having
lived as an inmate for too long). These underlying layers cause translation
issues because the meaning behind these names is less obvious if the names
are maintained in the TT. However, attempting to find target language
Figure 11.2 Results of computer-assisted STA in Voyant Tools, Stoplist deactivated. https://voyant-tools.org/docs/#!/guide/about.
Voyant Tools’ Little Outing 213
214 Lisa Horenberg
Figure 11.3 T
rends showing ‘you’ and ‘i’. https://voyant-tools.org/docs/#!/guide/
about.
equivalents can also have undesirable effects as they would domesticate the
story, while it can be argued its English setting is quintessential to the text.
The recurring use of ‘time’, ‘day’ and ‘little’ also requires a closer look.
First, the frequent use of ‘day’ and ‘time’ is interesting given both terms
contain direct or indirect references to times in the past and/or the general
passing of time, which may hint at a central theme in the story. However,
a Context search for ‘time’ and ‘day’ shows little correlation between
the occurrences of the words, nor does Trends show clear similarities or
contrasts between their distribution. Nonetheless, in a more general sense
and with regard to the theme of time, the words likely both refer to the
time Loveday has spent at the asylum and the day of his release. Similarly,
the frequent recurrence of ‘little’ is of course interesting due to the title of
the story. Trends shows the term is used regularly throughout the text, and
Context shows the term is used mostly in an ironic or derogatory sense,
Voyant Tools’ Little Outing 215
for example: ‘there is a very nice little flower garden’, ‘nasty little river’,
and ‘quite a little fortune’ or ‘on our little visit yesterday’. Whereas the
repeating of ‘time’ and ‘day’ refers to the story’s theme, the repeating of
‘little’ contributes to a clear voice. All three words are likely to be deliberate
stylistic features of the text and build up the story, and it can therefore be
argued that consistency in the translation is desirable in this case.
As mentioned earlier, VT not only presents recurring individual words
but also recurring phrases. In this case, there are short phrases of two
words that occur quite frequently, such as ‘mr loveday,’ ‘lady moping’
and ‘he is’. The longest recurring phrases consist of four words, and all
occur twice and seem more significant to the story. Selecting these phrases
then shows users where and how they are used in the text as the phrase
is highlighted in the other panes. This shows all longer repeated phrases
happen in close proximity to each other and even appear within the same
linear segment (e.g. ‘i do feel that’), whereas shorter phrases may occur
throughout the text (e.g. ‘a great deal’). Although these repetitions are
subtle, their close distributional proximity to each other allows for them to
be argued to stylistically foreshadow Loveday repeating himself as well and
committing a second murder, which is why it is most sensible to maintain
these recurring phrases consistently in the translation. Although only
single words and short phrases are repeated in this case, and longer texts
might yield more compelling results, identifying repetitions is important
for translators to make conscious and well-informed decisions about them
as possible stylistic features and how this affects their translation strategy.
11.5 Discussion
With VT’s initial, basic results, the user quickly gets a sense of their text
simply by looking into frequent words and their distribution, which this
case study shows can already reveal quite a bit about a story (i.e. main
characters and themes, sections with direct speech and the plot-twist). VT’s
interface invites users to further explore their data, and having a closer look
at textual features as well as some of the tool’s functionalities (and how
these affect the results) can add nuance to otherwise standardised results.
In this way VT can help translators in the early stage of STA by providing
a general data overview of the quantified text, recognising frequent words
and giving users an idea of the structure of the text and its possible patterns.
The tool guides users to look at their text from a data-driven perspective
that allows them to easily investigate aspects they find interesting, which
can lead to more objective or even completely new insights.
In this case, the qualitative and quantitative approaches of traditional
and computer-assisted STA complement each other well because both
analyses bring forward points of attention that are not immediately picked
up in the other. The traditional STA identifies stylistic representations
216 Lisa Horenberg
11.6 Conclusion
Although not all of VT’s functions are fully employed due to the limitations
and scope of this study, the tool’s varied representations of quantified
textual data give significant insight into the ST. Other benefits include the
fact that VT allows users to explore their text with minimal effort, pointing
out recurring words and phrases that might indicate themes and patterns in
the text, yet offers varied opportunities for more thorough investigations.
As such, the basic but relevant information VT gathers provides a useful
starting point for a translator to explore results and apply their findings to
their translation at their discretion.
VT demonstrates its potential to yield more sophisticated results with
regard to recurring phrases or sentences and other patterns throughout
one or more (longer) texts. Although this chapter covers only the
application of the tool to the preliminary stage of the translation process,
the original research also reflects on using VT’s findings as a reference
during translation and employs two other tools as well. Further research
might elaborate on such literary applications of text-analysis tools and
other types of translation technology. This chapter not only showcases
VT’s benefits, but it also substantiates the expansion of Alcina’s
categorisation of translation technology to include text-analysis tools as
a sixth subcategory and underlines the general relevance of computer-
assisted STA.
References
Alcina, A. (2008). Translation technologies: Scope, tools and resources. Target,
20(1), 79–102. https://doi.org/10.1075/target.20.1.05alc
Bucaria, C. (2008). Dubbing dark humour: A case study in audiovisual translation.
Lodz Papers in Pragmatics, 4(2), 215–40. https://www.degruyter.com/document
/doi/10.2478/v10016-008-0014-2/html?lang=en
Christensen, T.P., Flanagan, M., & Schjoldager, A. (2017). Mapping translation
technology research in translation studies. An introduction to the thematic
section. Hermes, 56, 7–20. https://tidsskrift.dk/her/article/view/97199
218 Lisa Horenberg
Daems, J. (2021). Wat denken literaire vertalers echt over technologie? Filter. https://
www . tijdschrift -filter .nl /webfilter /dossier /literair -vertalen-en-technologie /
januari-2021/wat-denken-literaire-vertalers-echt-over-technologie/
Horenberg, L.W. (2019). Using CAT in literary translation—How tools may
support translators in source-text analysis, translation and retranslation: A case
study of ‘Mr Loveday’s little outing’ [Unpublished Master’s thesis]. Utrecht
University. https://studenttheses.uu.nl/handle/20.500.12932/33956
Jänicke, S., Franzini, G., Cheema, M.F., & Scheuermann, G. (2015). On close and
distant reading in digital humanities: A survey and future challenges. EuroVis
(STARs), 83–103. http://diglib.eg.org/handle/10.2312/eurovisstar.20151113
.083-103
Kenny, D. (1999) CAT tools in an academic environment: What are they good for?
Target, 11(1), 65–82. https://doi.org/10.1075/target.11.1.04ken
Moretti, F. (2000). Conjectures on world literature. New Left Review, 1, 54–68.
https://newleftreview . org / issues / ii1 / articles / franco - moretti - conjectures - on
-world-literature
OED. (n.d.). Loveday. Oxford English dictionary online. https://www.oed.com
Popel, M., Tomkova, M., Tomek, J., Kaiser, Ł., Uszkoreit, J., Bojar, O., &
Žabokrtský, Z. (2020) Transforming machine translation: A deep learning
system reaches news translation quality comparable to human professionals.
Nature Communications, 11. https://doi.org/10.1038/s41467-020-18073-9
Pym, A. (2011). What technology does to translating. Translation and Interpreting
Research, 3(1), 1–9. http://trans-int.org/index.php/transint/article/viewFile/121
/81
Sinclair, S. & Rockwell, G. (2022). Voyant tools (version 2.6.0). http://voyant
-tools.org/
Waugh, E. (1951). Mr Loveday’s little outing. In Work suspended and other
stories: together with Scott-King’s Modern Europe (pp. 7–14). Penguin Books.
Wykes, D. (2016) Evelyn Waugh: A literary life. Springer.
Youdale, R. (2020). Using computers in the translation of literary style: Challenges
and opportunities. Routledge. https://doi.org/10.4324/9780429030345
12 (Re)creating Equivalence of Stylistic
Effect
A Corpus-Aided Methodology
Tereza Šplíchalová
12.1 Introduction
We are living in the digital age, and the translation industry has been revo-
lutionised by digitalisation. While there are many subdomains of transla-
tion and interpreting where technology has become inseparably integrated
into the translator’s everyday agenda, the practice of literary translation
appears to be relatively unaffected by this development. The debate on
‘how’ and potentially ‘whether at all’ literary translators can adopt com-
puter-assisted workflows has not passed unnoticed within academia and
studies focused on different facets of this development are increasing in
number (Rothwell & Svoboda, 2019; Kenny & Winters, 2020; Youdale,
2020; Kolb, 2021; Way, Rothwell, & Youdale, 2023), collectively assert-
ing that the productive integration of technology within literary transla-
tion processes is not at all unfeasible. Along the broad spectrum of MT
systems, CAT tools, and methods stemming from corpus linguistics, this
chapter aligns itself with the latter and introduces a corpus-aided method-
ology intended to explore the particularities of translating literary style, or,
more precisely, the intensional structure of a narrative text. This approach
to style stems from a firm belief that narratology and Translation Studies
(TS) would mutually benefit from some further dialogue. Research in TS
that is primarily preoccupied with narratological issues exists (Slater,
2011; Prince, 2014; Hirsch & Lessinger, 2020), but much is still unex-
plored. One such topic would be the potential application of the theory of
fictional worlds within TS. Some have suggested the possibility (Doležel,
1998; Fořt, 2005), and some have already researched it (Kamenická, 2007;
Loponen, 2009), but overall, it does not seem to be well established yet.
Hence, the aim of this chapter is twofold: (i) to introduce intensional func-
tions—an integral part of the theory of fictional worlds—as possible vehi-
cles of style; and (ii) to investigate to what extent these functions can be
described with the aid of a corpus-aided methodology.
DOI: 10.4324/9781003357391-16
220 Tereza Šplíchalová
12.2 Theoretical Framework
To begin with, the relationship between intensional functions and the
broad concept of style, no matter which of the numerous definitions
one chooses to adopt, should be clarified. The premise is that narrative
texts—or fictional worlds—are purely semantic entities that are defined
by their extensional and intensional structures (Doležel, 1998). According
to Doležel, the extensional structure of a literary text encompasses the
semantic elements, both explicitly and implicitly given, that are not tied to
the exact wording of a world-constructing text and can thus be subjected
to paraphrasing. These can manifest as any constituents of the fictional
world in question, be it the entities that inhabit the said fictional world,
its shapes, or its representations. Conversely, the intensional structure
excludes any paraphrasable elements and firmly connects to the very
texture of the text, upon which intensional functions arise. These manifest
as global regularities of texture that affect the structuring of the fictional
world and represent an indivisible element of the style of the original. To
put it differently, the relationship between extensional and intensional
properties of a narrative text is thus not dissimilar to Frege’s notions of
‘reference’ and ‘sense’ (Doležel, 1979), i.e. the relation between a linguistic
expression and an entity to which it refers. As a simplified case in point,
the mere mention of the second planet from the Sun in a narrative text
would introduce the said fictional entity into a fictional world as a part of
the extensional structure. Its particular form, however, be it ‘a morning
star’, ‘an evening star’, or simply ‘Venus’, would then have the capability
to shape the intensional structure by indexing an additional meaning such
as communicating the time of day without explicitly referring to it or
mentioning it at all.
In line with this theoretical framework, my intention is to approach
style as patterns of choices where the ‘patterns’ manifest as intensional
functions. The very straightforward idea this chapter revolves around is
that both structures—extensional and intensional—should be considered
in the process of translation, provided one wishes to recreate a fictional
world as faithfully as possible. I am aware that there are many strategies
one can employ when translating a narrative text, and that there are many
possible goals one can strive to achieve. Therefore, it needs to be made
clear that this chapter operates under the assumption that ‘to translate a
fictional world faithfully’ can be a viable goal of one’s translation activity.
In practice, this would entail producing a stylistically aware translation
that successfully recreates both the extensional and, where possible,
the intensional structure. Such a translation would then prioritise the
faithfulness to the source—the world-constructing text—over any other
motivations. I recognise that translation does not happen in a vacuum and
such an overly specific translation goal might not translate that well into
(Re)creating Stylistic Effect 221
I share this viewpoint and propose that one of the ways to address the
equivalence of stylistic effect is to look at one’s ST through the lens of
fictional worlds theories; the equivalence of stylistic effect would thus
equate to a successful rendition of a fictional world in another language.
Youdale also emphasises the necessity of uncovering certain ‘“hidden”
layers of information’ (2020, p. 24) that are essential if one wishes to
reverse-engineer the desired effect, which would, in the context of this
chapter, correspond to recovering the fully operative intensional functions.
In TS, this would mean correctly identifying intensional functions in
the ST and making sure they remain operative in the TT as well. While
this chapter only looks at the three functions identified by Doležel—the
naming, saturation, and authentication functions—an unspoken consensus
between researchers interested in fictional worlds theories is that there
are many more intensional functions that are yet to be described. One
of the secondary objectives of employing the corpus-aided methodology
is that it might, in the long run, help identify potential candidate
functions, but it is not a topic that can be discussed within the scope of
this chapter. Instead, the focus is put on the three intensional functions
that are already established. More details on the individual functions
can be found in Doležel’s Heterocosmica (1998), but in a nutshell, the
naming function manifests as a regularity in the use of proper names and
definite descriptions: for instance, a fictional entity can be referred to
by a certain moniker in selected situations only, which can then prompt
the reader to decode the situation accordingly. The saturation function
manifests as a projection of the density of a fictional text onto a fictional
world, which then governs what information is revealed to the reader,
when it is revealed, and in what manner. This can, for example, be an
222 Tereza Šplíchalová
12.3 Methodology
To reiterate, the aim is to explore the possibility of accessing the
intensional structure by means of a corpus-aided methodology, i.e. with
individual narrative texts conceived as micro-sized corpora consisting of
one novel and its translation. Even though such a limited scope largely
defies the underlying principles of corpus linguistics and the basics of
corpus building, it is not an uncommon measure in TS to base one’s
study on a small-scale corpus consisting of only a few narrative texts (e.g.
Wang & Li, 2011; Ji & Oakes, 2012) or one text and its translation (e.g.
Youdale, 2020). This situates corpus-based studies of literary translation
‘at the intersection of corpus linguistics and digital scholarship’ (Zanettin,
2017), allowing us to align the purpose of one’s investigation with either
linguistic or literary objectives. This chapter pursues the latter, and in
order to do so, it makes use of a powerful tool of choice, Sketch Engine
(SkE), a corpus-management system co-developed under the Natural
Language Processing Centre at Masaryk University (Kilgarriff et al.,
2014). Its flagship function would probably be the word-sketch feature
and accompanying word-sketch grammar that provides an at-a-glance
snapshot of the behaviour of a selected word and its collocational patterns
(Kilgarriff et al., 2014, pp. 3–5), but its appeal is also grounded in the
possibility of readily accessing a multitude of corpora and eventually
creating one’s own. SkE is accessible via institutional or inexpensive
private subscription.
When considering methodology and workflow, my two main concerns
were replicability and user-friendliness. After acquiring an electronic copy
of the ST, The Road (2006) by Cormac McCarthy, and the TT, a transla-
tion into Czech by Jiří Hrubý titled Cesta (2008), both texts were stripped
of any non-essential typographical characters, such as the hyphens used
for word breaks at the end of a line, and loaded into open-source aligner
(Re)creating Stylistic Effect 223
12.4 Results
The Road tells a story about a father and a son journeying through a post-
apocalyptic wasteland. This fictional world encompasses two domains—
‘before’ and ‘after’ the apocalypse—and different narrative modes are
used to introduce and establish fictional facts falling under the respective
domains. Initial reading indicates that all three intensional functions are
fully operative, in the sense that the regularity in use creates an additional
layer of meaning for the reader to discover, but revealing their distribution
1 https://youalign.com/.
224 Tereza Šplíchalová
12.4.1 Naming Function
For the sake of brevity, only the results relating to the two main fictional
entities are commented on here.
Visualising the distribution requires two tasks: to identify the noun
phrases used to refer to the two fictional entities, and to review them in
context. Due to the different tagsets SkE uses for different languages, it is
much easier to complete the first task in the Czech corpus since it is pos-
sible to use the wordlist feature and list only masculine, animate nouns.
The English tagset does not have a gender attribute, so going through the
resulting list of nouns would be slightly more time-consuming, but not at
all impossible. An alternative solution which has proved effective is to use
the concordance feature to search for the third-person singular masculine
pronoun ‘he’, keep only the lines that also include a noun, and see what the
pronoun refers to. Although somewhat inelegant, the search yielded sev-
eral promising candidate terms that were then displayed in context via the
concordance feature for the second task. Some of them were discounted,
and the remainder were examined with the aid of the word sketch feature.
The results show that there are three definite descriptions used for each
entity: the first entity is referred to as the man/father/papa, and the second
entity as the boy/child/son. The collocative patterns brought to light via
word sketch show that the descriptions are not assigned haphazardly,
but alternate, in the case of Entity 1, on the basis of the syntactic frame
and/or collocative meaning (Šplíchalová, 2021). This fictional entity is
referred to as the man in third-person narrative and descriptive passages
only, father when addressed in relation to the boy (thus in possessive
structures only), and papa when directly addressed by the boy. In the
case of Entity 2, the choice of definite description depends on the level
of investment expressed through social/affective meaning (Šplíchalová,
2021). The boy occurs in neutral and descriptive passages, the child in
passages highlighting vulnerability, and (his) son in the most tender and
emotionally charged moments.
After establishing that there is a pattern in assigning the definite descrip-
tions, ST and TT lines were compared to reveal that the pattern is dis-
rupted in the TT; the definite descriptions are treated as interchangeable,
and a fourth description, kluk [the lad], is introduced for Entity 2, as in
the following example:
ST: Then he took the boy with him into the woods.
TT: Pak chlapce [the boy] odvedl do lesa.
ST: He didnt [sic] think the boy could travel much more.
TT: Ten kluk [the lad] už zřejmě daleko nedojde.
(McCarthy, 2006, pp. 42, 57; 2008, pp. 41, 55)
These two segments illustrate that the boy can either be translated as
chlapec [the boy] or kluk [BOYCOLLOQ; the lad]. The naming function has
226 Tereza Šplíchalová
not been kept intact, which, in combination with treating the rest of the
definite descriptions interchangeably, renders the function inoperative. It
should be noted that the boy is the most frequent noun phrase that occurs
596 times in the ST. In the TT, chlapec occurs 585 times, while kluk occurs
only 31 times so only in a handful of cases. An interpretation that sug-
gests itself would be Toury’s ‘avoidance of repetition’ (1995), i.e. a transla-
tor’s inadvertent tendency to avoid repeating the same noun over and over
again, albeit to the detriment of the otherwise very symmetrical naming
system introduced in the ST. Despite the low number of cases featuring
kluk, introducing the fourth descriptor and treating all of them as inter-
changeable changes the distribution of the naming function and alters the
intensional meaning of the translated fictional world. Alternatively stated,
it robs the readers of the TT of an extra level of meaning that could have
been communicated by introducing a comparable tripartite system. An
attentive translator-reader might be able to spot the pattern just as easily
without any computer-assisted or corpus-aided analyses, meaning SkE is
not needed to reveal the pattern itself but has been instrumental in survey-
ing the overall distribution of the function. I recognise that identifying the
individual definite descriptions can remain a matter of close reading, but
SkE queries, namely those run through word sketch and parallel concord-
ance, proved useful in illuminating their respective behaviour and differ-
ences in TT renditions.
12.4.2 Saturation Function
The second intensional function arises from the notion that fictional worlds
are, by their very nature, always constructed as incomplete; the distribution
of fictional facts into the domains of the determinate, the indeterminate,
and gaps forms the explicit, implicit, or zero texture (Doležel, 1998). In
principle, this function governs the extent to which the fictional world is
‘saturated’ with information and in what manner this is presented to the
reader: either explicitly, implicitly, or not at all. The latter represents an
enticing topic for both narratology and TS and generally refers to pieces
of information the narrative does not provide and leaves up to the reader’s
imagination. Doležel refers to the domain conditioned by zero texture
as ‘gaps’, but it is not uncommon to encounter different terminology
denoting the same concept, such as ‘empty spaces’ or ‘silences of the story’
(Lessinger, 2018).
In terms of relevance to TS, there is more to focus on than just the
narrative gaps. For instance, Kamenická (2007) shows that the saturation
function can be explored against the backdrop of the explicitation
hypotheses to establish a translator’s explicitation profile, but the shifts
between the domains of the determinate and indeterminate can also be
interpreted with the text’s global texture in mind. This would align well
(Re)creating Stylistic Effect 227
with the findings of Hirsch and Lessinger (2020, p. 756), proposing that in
certain cases, ‘the linguistic explicitation corresponds to an implicitation of
one of the narrative layers of the story, and of the text’s global meaning’.
They illustrate this proposition in situations where making something more
explicit, for instance, a subtle clue, or any part of a text where obscurity
or sense of mystery is valued, intrinsically affects the global meaning of the
text and renders it more implicit. An example would be a series of hints or
clues that the reader is meant to uncover at certain points in a narrative; if
explicitated in the TT, this ‘guessing game’ is not mediated for the readers
of the TL and the resulting structure, with a layer of intensional meaning
missing, can be thus understood as globally more implicit.
With respect to these two approaches, the first logical step would be to
see whether SkE can be instrumental in identifying instances of explicita-
tion. The aligned layout and powerful filtering options of parallel concord-
ance provide an ideal environment for close reading, but a more detailed
query can be devised as well. In line with the findings reported on the use
of names and definite descriptions in The Road, a possible area of inter-
est could reside in the category of changes of logical cohesive ties, which
includes the replacement of personal pronouns with proper names (Blum-
Kulka, 1986, quoted in Hirsch and Lessinger (2020, p. 756)). As there are
virtually no proper names in The Road, I propose that replacing a personal
pronoun with a definite description of choice can, in the context of this
chapter, be considered within the same framework. If so, it is theoreti-
cally possible to put together a simple query in Corpus Query Language
(CQL) and search for pronouns in the ST that are tagged as nouns in the
TT corpus, but I do not want to assume that all literary translators and
researchers are well-versed in CQL building. Having said that, SkE offers
a basic query that allows you to search for a specific example, such as the
pronoun ‘he’ translated as ‘muž’ [the man]. In the case of The Road, the
search yields 113 results of the type in the next example:
ST: He pulled the boy closer.
TT: Muž [the man] k sobě přitáhl chlapce ještě blíž.
(McCarthy, 2006, p. 9; 2008, p. 10)
This might be an interesting option if one wishes to search for lines that
might include an example of explicitation, but I am hesitant to conclude
that occurrences such as these definitively alter an intensional function.
The use of anaphoric/cataphoric referencing in translation does impel the
translator to choose a suitable substitution or drop a definite description,
but the overall impact on the texture seems to be rather trivial. However,
with regard to the global meaning of the text, the same phenomenon can
have much greater impact. There is a short passage in The Road that
describes an encounter with a hostile road rat. A suspenseful skirmish
228 Tereza Šplíchalová
ensues, culminating in the man shooting the road rat dead. The confronta-
tion begins as follows:
ST: The man had already dropped to the ground and he swung
with him and leveled the pistol and fired from a two-handed
position balanced on both knees at a distance of six feet. The
man fell back instantly and lay with blood bubbling from the
hole in his forehead.
TT: To už se otec [the father] bleskurychle přikrčil; revolver
držel oběma rukama a vkleče vystřelil z necelých dvou metrů.
Chlap [the guy] okamžitě padl na záda; z díry na čele mu bub-
lala krev.
(McCarthy, 2006, p. 40; 2008, p. 39)
ST and TT. Upon further inspection, it becomes clear that the translator
favoured a domestication strategy and transferred the units of distance
from the imperial system to the metric system. The values were in most
cases re-calculated, as can be seen in the following excerpt:
ST: It looks to be about eight miles or so.
TT: Bude to tak třináct kilometrů. [thirteen kilometres]
(McCarthy, 2006, p. 26; 2008, p. 26)
12.4.3 Authentication Function
12.5 Conclusion
This chapter demonstrates that approaching intensional functions as
vehicles of style creates many opportunities and poses many challenges.
(Re)creating Stylistic Effect 231
Due to the scope of this chapter, many other issues were not addressed,
and it is thus advisable to view the findings presented as a cautious survey
of the intersection of the disciplines involved rather than a comprehensive
overview. The proposed methodology relies on a close-reading analysis
supported by, not entirely based upon, a distant-reading approach put into
practice via Sketch Engine. The objective was to see whether it is possible
to make use of corpus-aided methods to uncover differences in the ST and
TT intensional functions, and the results are, to an extent, affirmative.
In a general sense, querying the one-novel corpus appears to be the most
advantageous if one works towards a verification of hypotheses based on a
close-reading analysis. This creates a general framework of what to look for
and what software functionalities to use but leaves enough space to discover
the unexpected. The targeted queries in parallel concordance, for example,
quickly reveal inconsistencies in terminology that can be interpreted as
causal agents of changes and alterations of intensional meaning. The rest
of the features mentioned above (i.e. word sketch, wordlist, and keywords)
are indeed instrumental, but highlight the importance of knowing what
to look for beforehand. It does not need to be understood as a downside,
though; this reflection consolidates the usefulness of a method combining
close and distant reading by proposing that any hypotheses formed by
the act of reading can be subsequently confirmed or disproved in Sketch
Engine. The exception would be the naming function, given that an SkE
query can, albeit somewhat inelegantly, reveal the definite descriptions in
use and show the distribution of the naming function without having to
search for the individual elements well in advance. It has been suggested
that the naming function is perhaps the most suitable for this type of
investigation, and the results confirm this. By running a set of searches
in SkE, one can effectively identify the definite descriptions used for
different fictional entities, obtain an overview of their distribution with an
aligned list of their renditions in the TT, and gain a deeper understanding
of their collocative patterns. Regarding the second intensional function,
the saturation function, the SkE queries seem to be usable only with a
prior knowledge of what to investigate. As discussed earlier, the saturation
function can be linked to a so-called translation universal, namely the
explicitation theory, and processed accordingly. Lastly, recovering the
distribution of the authentication function in both ST and TT is a task that
benefits the most from a close-reading analysis, but cross-referencing one’s
findings in SkE can eventually help illuminate the shifts affecting certain
narrative modes. All in all, having an aligned text with the possibility of
zooming in on any interesting shifts or occurrences helped to shed some
light on the differences in ST and TT intensional structure.
My aim has been to present a method that would be, albeit fractionally,
applicable to literary translation practice. With some reservations detailed
232 Tereza Šplíchalová
References
Doležel, L. (1979). Extensional and intensional narrative worlds. Poetics, 8(1–2),
193–211. https://doi.org/10.1016/0304-422x(79)90020-2
Doležel, L. (1998). Heterocosmica: Fiction and possible worlds. Johns Hopkins
University Press.
Fořt, B. (2005). Úvod do sémantiky fikčních světů [An introduction to semantics
of fictional worlds]. Host.
Hirsch, G., & Lessinger, E. (2020). Obscuring the speaker’s stance: When
explicitating results in implicitation. Meta, 65(3), 745–765. https://doi.org/10
.7202/1077412ar
Ji, M., & Oakes, M. P. (2012). A corpus study of early English translations of Cao
Xueqin’s Hongloumeng. Quantitative Methods in Corpus-Based Translation
Studies, 177–208. https://doi.org/10.1075/scl.51.07ji
Kamenická, R. (2007). Dreaming the original: Original and translation as
two different fictional worlds. In K. Vránková & C. E. Koy (Eds.), Dream,
imagination and reality in literature (pp. 15–21). Jihočeská univerzita v Českých
Budějovicích. https://www.pf.jcu.cz/stru/katedry/aj/doc/sbaas01-kamenicka.pdf
Kenny, D., & Winters, M. (2020). Machine translation, ethics and the literary
translator’s voice. Translation Spaces, 9(1), 123–149. https://doi.org/10.1075
/ts.00024.ken
Kilgarriff, A., Baisa, V., Bušta, J., Jakubíček, M., Kovář, V., Michelfeit, J., Rychlý,
P., & Suchomel, V. (2014). The sketch engine: Ten years on. Lexicography,
1(1), 7–36. https://doi.org/10.1007/s40607-014-0009-9
Kolb, W. (2021, May 13). Literary translation and post-editing: Priming effects
and engagement with the text [Conference session]. Computer-Assisted Literary
Translation Conference, Swansea. https://docs.google.com/document/d/e
/2PACX-1vRWAAgOpMCWqGniXJAULzCra1e4We2U7lQj53e7qbKNIgH
vjLGmHrwlsqbqCJCQIwJ11tztjQVmXMu6/pub
Lessinger, E. (2018). Genesis of a poetics of silence. Sillages Critiques, 25. https://
doi.org/10.4000/sillagescritiques.7421
Loponen, M. (2009). Translating irrealia – Creating a semiotic framework for
the translation of fictional cultures. Chinese Semiotic Studies, 2(1), 165–175.
https://doi.org/10.1515/css-2009-0117
McCarthy, C. (2006). The Road. Picador.
(Re)creating Stylistic Effect 233
McCarthy, C. (2008). Cesta (J. Hrubý, Trans.). Argo (Original work published
2006).
Prince, G. (2014). Narratology and translation. Language and Literature:
International Journal of Stylistics, 23(1), 23–31. https://doi.org/10.1177
/0963947013510647
Rothwell, A., & Svoboda, T. (2019). Tracking translator training in tools and
technologies: Findings of the EMT survey 2017. Journal of Specialised
Translation, 32, 26–60. https://www.jostrans.org/issue32/art_rothwell.php
Slater, C. (2011). Location, location, translation: Mapping voice in translated
storyworlds. Storyworlds: A Journal of Narrative Studies, 3, 93–115. https://
doi.org/10.5250/storyworlds.3.2011.0093
Šplíchalová, T. (2021). On translatability of an intensional function within computer-
assisted literary translation. The ESSE Messenger, 30(2), 99–107. https://
essenglish.org/messenger/wp-content/uploads/sites/2/2022/02/splichalova.pdf
Toury, G. (1995). Descriptive translation studies—And beyond. John Benjamins.
https://doi.org/10.1075/btl.4
Wang, Q., & Li, D. (2011). Looking for translator’s fingerprints: A corpus-based
study on Chinese translations of Ulysses. Literary and Linguistic Computing,
27(1), 81–93. https://doi.org/10.1093/llc/fqr039
Way, A., Rothwell, A., & Youdale, R. (2023). Why literary translators should
embrace translation technology. Tradumática.
Youdale, R. (2020). Using computers in the translation of literary style. Routledge.
https://doi.org/10.4324/9780429030345
Zanettin, F. (2017). Issues in computer-assisted literary translation studies.
inTRAlinea Special Issue: Corpora and Literary Translation. https://www
.intralinea.org/specials/article/2254
Part 4
13 The Experiment
Avraham J. Roos
13.1 Introduction
In 1997, Douglas R. Hofstadter published his book Le Ton Beau de Marot
(Hofstadter, 1997) with multiple English (re)translations of a minor
French poem by the Renaissance poet Clement Marot. These translations
were created by Hofstadter’s friends, students, colleagues, family, and
noted poets and translators (not all Francophone) who had been given
instructions by Hofstadter concerning the literary constraints of the original
French poem and a literal translation into English. His book explores the
meaning, strengths, failures, and beauty of translation.
This chapter describes a similar experiment, conducted at the May
CALT 2019 conference at Swansea University. The aim was to see if
through computer-assisted literary translation tools, participants with no
prior knowledge of Hebrew and minimal understanding of the biblical
and mystical background of a short Hebrew poem, would be able to
collaboratively construct an English rendering of it, in verse, following the
constraints of the original rhyming scheme.
During the workshop, the 30 (mostly) non-Jewish participants were
instructed to ‘translate’ into English an 8-line poem in Biblical Hebrew,
created by the 11th-century French Talmudist R. Yosef Tov Elem Bonfils
(Guggenheimer, 1995). The poem uses a simple AAAA BBBB rhyming
scheme and participants were asked to create a parallel English text using
an identical scheme. Given a corpus of prose translations and digital tools,
participants (individually or collaboratively) not only retranslated these
prose translations into an English-versed version following the original
rhyming scheme, but some also produced a Polish poem and even a
Chinese one (that followed the rules of classical Chinese poetry—each line
with exactly five characters).
This chapter describes the corpus, tools, process, and some of the final
products. It also demonstrates how Google tools were used to create,
record, and later analyse the collaborative attempt. Several of the key
DOI: 10.4324/9781003357391-18
238 Avraham J. Roos
13.2.1 The Poem
Table 13.1 presents the Hebrew source text and its transliteration. Hebrew
is dextrosinistral (reads from right-to-left) and simple character compari-
son will reveal, even to non-Hebrew readers, that the last two characters on
the left of the first four lines are the same, as are the last two characters on
the left of each line in the second strophe. This indicates the rhyming pat-
tern AAAA BBBB as can also be discerned from the transliteration on the
left. However, most translators, even those who were professional poets,
opted for a prose translation as is shown below in Table 13.2. Even the
ones that did create versed translations used a different rhyming pattern.
My digitized PhD corpus of over a hundred variant Haggadah translations
dating from 1770 to now, includes only 16 such versed translations. This
raises the question why.
1808 Sevice for the First Two Nights #516 Carefully revised and First revision of David Levi’s Hyam Barnett, London, UK
of Passover corrected by Isaac translation by Isaac Levi
Levi
1914 Hagadah for Passover Service #2618 Not listed M. Lehberger & Co.
for the first two nights of Rȍdelheim, Germany
Passover
1928 Hagada. The narrative of #3149 J.D. Eisenstein With illustrations by Lola New York, USA
Israel’s redemption from
Egypt
1934 The HAGGADAH #3482 Cecil Roth Illustrations Donia Nachshen Soncino Press, London, UK
1936 The Haggadah for Children #3593 Rabbi J. P. Rudin Bloch Publishing Company,
New York, USA
1950 Israel Passover Haggadah #4187 A. Greenbaum With commentary by M. American Biblical Encyclopedia
M. Kasher and lengthy Society, New York, USA
introduction by translator
1951 The Haggadah of Passover #4280 Isidore Fishman Edward Goldston & Son,
London, UK
1953 Birnbaum Haggadah #4345 Paltiel/ Philip Montauk Book Mfg. Co, New
Birnbaum York, USA
2012 The New American Haggadah NA Nathan Englander Little, Brown and Company,
USA
The Experiment 241
13.3 The Experiment
After a short introduction about the Haggadah, the poem, and a discus-
sion on the form vs meaning controversy, the 30 workshop participants
from both Anglo-Saxon countries (GB, USA, Canada) and non-Anglo-
phone countries (e.g. Portugal, The Netherlands, Japan, China, Poland),
were challenged to create a versed AAAA BBBB retranslation of this poem.
They were given the choice to either translate the poem into their native
tongue or into English, doing so either individually or collaboratively. A
shared Google Document was provided for collaborative work on which
participants could share their attempts, comment on the draft translation,
and edit it together.
The historical English prose translations from Table 13.2 were provided in
digital form as a reference frame for the participants. This was especially
helpful as almost none of the participants could read, let alone understand,
the original Hebrew text. In addition to these 11 English translations from
1770–2012, participants were offered prose translations in 13 other lan-
guages so that each workshop participant had at least one translation in
their native tongue. There were translations in Arabic, Chinese, Dutch
(3), French (3), German (4), Greek, Hungarian (3), Italian (2), Japanese,
1 Referenced at the end of this chapter. These translations were chosen by the author from a
pool of 106 English translations to represent a variety of translation options from a large
time span and from three different countries (UK, USA, and Germany).
242 Avraham J. Roos
Polish (2), Portuguese, Russian (4), and Spanish (4). These translations
were copied from other language Haggadah translations collected by the
author. This corpus of 41 variant translations was uploaded to a desig-
nated Google Sites webpage2 created by the author.
In addition to the corpus, various digital tools were made available to
the workshop participants, some online, others downloadable. The links
to these tools were provided on the aforementioned web page. The online
tools included Thesaurus3 and Reverse Lookup,4 which were aimed at
helping writers find the right word they were looking for. The Reverse
Lookup tool in particular was deemed useful for writers who have the word
on the tip of their tongue but cannot recall it, as typing in a description
would often lead to the word.
To support the workshop assignment of creating a rhyming version of
the translation, a number of rhyming tools were added, including Rhyme
Zone,5 B-Rhymes,6 and Rhymes.7 RhymeZone provides a list of real
English-language rhymes, while B-Rhymes offers near rhymes and Rhymes
allows one to find rhyming words in languages other than English, espe-
cially useful for the multi-lingual workshop participants. An online sylla-
ble counter, HowManySyllables,8 was added to help the participants with
meter and scanning.
In addition to the online tools, links were provided to three download-
able poetry analysis tools: Poemage,9 VersePerfect,10 and RhymeDesign.11
These tools were chosen for their unique features. For example, Poemage
and RhymeDesign provide a visual representation of the complex
sonic devices in a poem, thereby offering a new way to analyse poetry.
Meanwhile, VersePerfect offers a comprehensive toolbox for poets with
a built-in thesaurus, rhyming suggestions as you type, and links to quotes
from famous poems that use the rhyming words being used.
Participants were given 30 minutes to create their translations, armed
with both the corpus and the digital tools. Unfortunately, some of these
tools were not used by the participants, probably due to time constraints,
as will be discussed below.
2 https://sites.google.com/view/calt-2019-swansea-roos/home.
3 www.thesaurus.com.
4 www.onelook.com/thesaurus.
5 www.rhymezone.com.
6 www.b-rhymes.com.
7 www.rhymes.com.
8 www.howmanysyllables.com/syllable_counter.
9 www.sci.utah.edu/~nmccurdy/Poemage.
10 https://verseperfect.en.softonic.com/download.
11 www.sci.utah.edu/~nmccurdy/rhymeDesign.
The Experiment 243
13.4 The Results
Eight individual English translations were created, one of these being a chain
translation via Portuguese. Only two of these actually followed the guide-
lines of an 8-line poem with rhyming scheme AAAA BBBB (more will be said
about these in 13.4.1). Four non-English individual versions were created,
two in Chinese, one in Portuguese, and one in Polish. The only submission in
this group that followed the guidelines was the Polish one (more about these
versions in 13.4.2 below). Seven participants collaborated on creating a joint
version, which will be discussed in 13.4.3 and in 13.5.
During the concluding discussion of the workshop, most participants
expressed pride in their creations but complained that 30 minutes was not
enough to accomplish the mission to their complete satisfaction. They all
stated that the corpus and tools helped them and that they enjoyed the
experience. We will now look at the individual submissions before focus-
ing on the collaborative project.
I
Passover Seder is here
According to laws and customs we adhere
As we have permitted the uprear
Of the performance that is premier
Oh, Adonai,
Raise up thy
To the place we lie,
Zion, we sing of the nearby.
II
We've done the Seder with all the dishes
Did it like the good Lord wishes
What a blessing to perform it this is
We'll be sure next year not to be remiss-ish
G-d sitting up there in the sky
See your people multiply
Raise us up like flowers growing high
Heading for Israel the saints go by
Both I and II use the word ‘Seder’ which is the Hebrew word for the
Passover ceremony and is used in the original Hebrew text. This kind of
244 Avraham J. Roos
III
Porządek uroczystości już spełniony
według przepisów i zwyczajów skończony.
Oby szczęśliwie był powtórzony,
tak jak dziś był wypełniony!
Najczystszy, królujący na wysokości!
Podnieś Swe zgromadzenie z nicości,
zaprowadź Swe plemię ku Syjonu jasności
ku zbawieniu w tryumfie przyszłości!
This Polish poem not only follows the rhyming scheme of the original
(AAAA BBBB), but also stays very close to the meaning of the original
Hebrew text, thus making it a perfect example of both form and meaning
translation. The English translation of the poem reads:
During the closing discussion of the workshop, one of the non-native speak-
ers, Song Runjuan (School of Foreign Studies of Hebei Normal University,
China), expressed dismay over the instructions that the rhyming scheme of
the original had to be used. She claimed that translation is not just trans-
ferring a text from one language to another, but also from one culture to
another. Whereas the AAAA BBBB rhyming scheme might be suitable for
Hebrew and English, Chinese classical poetry has its own rules—each line
needs to have exactly five characters. I asked her if she could create such a
poem as a translation for the Hebrew one. Two weeks later she sent me the
following by email (presented here with her permission):
节庆家宴闭
吾辈遵父言
今年何其幸
来年续此福
高位圣神灵
创世及世人
望您予关怀
世人永欢唱
13.4.3 Insights—Collaborative Attempt
have the latest version to work on, that collaborators can work whenever
and wherever they want, and that the same document is available through
various browsers and operating systems. A disadvantage sometimes sug-
gested is that a team member might accidentally delete something from
the document for all. However, the platform keeps track of all changes
and earlier versions/change histories are easily accessible. This important
feature is further discussed in 13.5.
The collaborative poem thus created was:
IV
A subpar Haggadah?!
A so-la-la Haggadah?!
A blahdiblah Haggadah?!
The Passover rites have been completed
We knew the rules while we were seated
Just as by their order, we’ve not been defeated
May also in future this ceremony be oft repeated.
Oh Lord who is our one true Sire
To be one people we do aspire
Raise us up the uncounted, we are your choir!
Singing ‘Zion is free’, will never us tire
The use of the term ‘rites’ in this version is interesting as none of the his-
torical versions in the corpus used it. It is, however, one of the suggestions
on Thesaurus.com as a replacement for ceremony, which is used in the
corpus participants had access to (see Table 13.2). This version also uses
the word ‘order’ in line three like many late 20th-century translations.
The poem refrains from referring to the request that the Jewish People be
returned to Israel in the last two verses, and this may be a political decision
by the participants. On the other hand, the allusion to the Divine promise
to Abraham that his offspring will be uncountable like the stars in the sky
(Gen. 15:5), is correctly incorporated in this poem, the only poem to do so.
Three playful titles were added to this poem, maybe the participants did
not have time to decide which to choose.
On the whole, this collaborative poem has the best flow and best con-
veys the general message of the original. The joint effort, through the
digital use of Google Documents, seemed to have paid off, and the par-
ticipants expressed great satisfaction with the co-operation, but the real
results for the researcher of technology in literary translations lie in find-
ing out what went on behind the scenes during the process of the crea-
tion, and this can be done using Google document features and add-ons.
248 Avraham J. Roos
12 https://chrome.google.com/webstore/detail/draftback/nnajoiemfpldioamchanognpjmocg-
kbg.
The Experiment 249
which set the rhyme for strophe one from the onset. The first strophe
was considered done within 6.5 minutes after which the participants
shifted their attention to the second strophe. Line one of strophe one
never changed throughout the process and made it into the final version,
in contrast to the struggle with the first line of the second strophe. That
line went through several changes getting its final form only 8 minutes
before the deadline:
13.6 Discussion
Song Runjuan’s claim that cultural considerations should be taken into
account when translating a poem certainly holds true when translating
a medieval Hebrew poem into classical Chinese. One could argue that it
would make sense for English translators to translate this Hebrew AAAA
BBBB poem into the rhyming pattern of a classical English sonnet (ABAB
CDCD EFEF GG), in which case more lines than the original eight would
be required. Haggadah translator Maurice Myers came closest to this in
his ‘translation in prose and verse’ (Myers, 1920, p. 67), the very first
attempt to create a versed translation of this poem, but as it was not a real
sonnet, one must question his reasons for neglecting the original’s rhyming
scheme:
13.6.1 Collaborative Translation
13.6.2 Group Dynamics
When people are working in a group setting, they often take on distinct
roles and behaviours, as already described by Lewin in the late 1940s. He
stressed the importance of leadership roles, the self-value of team members,
and the need for a supportive community (Lewin, 1947). An interesting
observation gained through Draftback is that there was a clear difference
in the way native and non-native speakers offered their input. The majority
of non-native speakers hedged their suggestions with a question mark or an
explanatory comment. They were less prone to edit the main text directly
in comparison to the native speakers who often edited the text without
hedging. This was either due to lower levels of confidence in their language
proficiency or to cultural differences. It was mostly the Asian participants
who were hesitant to make the changes themselves. Still, the fact that these
non-native speakers participated shows they felt sufficiently confident as
valuable team members, which in turn reinforces the idea that a supportive
community is key for collaborative work. Indeed, in most cases, their
suggestions were accepted by the other team members. Team members
both revised their own contributions (co-operation) or took responsibility
for the text as a whole (collaboration).
It is interesting to note how these digital tools which were designed for
monolingual collaboration can be made to serve collaborative translation.
13.7 Conclusions
To sum up, notwithstanding the fact that none of the Haggadah transla-
tors used the original AAAA BBBB rhyming scheme to create an English
13 Starting with the Septuagint, most Bible translations were created by groups of Bible
scholars due to the scope of the project and in order to avoid bias. A well-known example
is the King James Bible.
The Experiment 253
13.8 Epilogue
Since this chapter was first drafted, a new player has entered the scene. In
recent years, the field of artificial intelligence (AI) has undergone tremen-
dous progress, with AI now making inroads into various aspects of human
life, including language processing. Among the many AI applications that
have emerged, at the time of writing, ChatGPT stands out as a state-of-the-
art natural language processing (NLP) chatbot developed by OpenAI (see
254 Avraham J. Roos
Section I.2, Introduction, and Section 8.4, both in this volume, for more
discussion on the capabilities of these tools). Through its use of deep learn-
ing and cutting-edge technologies, ChatGPT can engage people in natural
conversations and generate text that closely mimics human-written text
in a variety of styles and formats, including rudimentary poetry and song
lyrics.
Despite previous attempts to generate poetry using machines, includ-
ing as far back as 1845, creating poetry with a specific metre, such
as rhyme, has remained a challenge. However, the latest update of
OpenAI's GPT-3 feels like a step forward in complexity that comes from
integrating knowledge about a wide variety of subjects and styles into
one model that writes coherent text (and GPT-4 has just come out in
March 2023). In November 2022, OpenAI released a new model in the
GPT series that has rhyming capabilities previous GPT models did not
have. As a follow-up to the experiment in this chapter, ChatGPT was
asked to produce an AAAA BBBB rendering of the above-mentioned
poem. This exposed its limitations in following the AAAA BBBB rhym-
ing scheme.
Below are some of its initial attempts:
Although these were interesting, they did not comply with the guidelines
provided. A final attempt was to make ChatGPT a member in a collabora-
tive attempt. The beginning was promising:
The Experiment 255
References
Alfer, A. (2017). Entering the Translab. Translation as collaboration, collaboration
as translation, and the third space of ‘translaboration’. Translation and
Translanguaging in Multilingual Contexts, 3(3), 275–290. https://doi.org/10
.1075/ttmc.3.3.01alf
Fauconnier, G., & Turner, M. (1998). Conceptual integration networks. Cognitive
Science: A Multidisciplinary Journal, 22(2), 133–187. https://doi.org/10.1207/
s15516709cog2202_1
Gigenheimer, H. (1995). The scholar’s Haggadah. Jason Aronson.
Hofstadter, D. R. (1997). Le ton beau de Marot. BasicBook.
Holmes, J. (1972). The cross-temporal factor in verse translation. Translator’s
Journal, 17(2), 102–110. https://www.erudit.org/en/journals/meta/1972-v17
-n2-meta256/003078ar.pdf
Huss, J. T. (2018). Collaborative translation. In K. Washbourne & B. Van
Wyke (Eds.), The Routledge handbook of literary translation (pp. 448–467).
Routledge. https://www.academia.edu/37517705/Collaborative_Translation
Kahn, A.D. (1994). The chosen nation Haggadah. CFR Publications.
Lefevere, A. (1975). Translating poetry: Seven strategies and a blueprint. Van
Gorcum.
Lewin, K. (1947). Frontiers in group dynamics. Human Relations, 1(1), 5–41.
https://doi.org/10.1177/001872674700100103
Liberman, M. (2009). Rhymes. Language Log, 8 Dec. 2009. https://languagelog
.ldc.upenn.edu/nll/?p=1946.
Myers, M. (1920). The passover Hagadah. Geographia.
Risku, H., & Windhager, F. (2013). Extended translation: A sociocognitive research
agenda. Target, 25(1), 33–45. https://doi.org/10.1075/target.25.1.04ris
The Experiment 257
Primary Sources
Below is a list of all the Haggadot presented in Table 13.2, ordered by date.
The reason for this organization is that many do not include an author,
editor, or name and most use the same title. It was, therefore, deemed more
effective to order them by their most distinctive feature, the publication
date, especially as Table 13.2 presents them this way. The Yudlov numbers
refer to a comprehensive Haggadah bibliography published by the Hebrew
University, Israel (Yudlov, 1997).
14 Augmenting and Informing the
Translation Process through
Workflow-Enabled CALT Tools
Sasha Mile Rudan, Eugenia Kelbert,
Lazar Kovacevic, Matthew Reynolds, and
Sinisha Rudan
14.1 Introduction
CALT is an emerging field in the process of defining itself, but it is already,
we feel, a misnomer. Adding an ‘L’ (Literary) to the pre-existing notion
of Computer-Assisted Translation (CAT) creates continuity with CAT
research; Machine Translation (MT) is another key technological precur-
sor (cf. Hadley et al., 2022). However, by taking this course of action the
field is perhaps doing itself a disservice in overlooking the potential of
the ‘A’ (or assistance) in a literary context where CALT can qualitatively
shape a translator’s work rather than merely allow a translator to perform
a given task faster and more efficiently. In other words, the ‘A’ in CALT
should ideally stand for translator augmentation (see Lommel, 2018), and
we explain this in what follows.
CAT and MT may have changed the experience of many users, but they
still receive mixed reviews from some professional translators (Penkale &
Way, 2013; Way, 2013). The current placement of CALT within the same
paradigm is one reason why CALT tools are perceived as inhibiting the
creativity and freedom of the literary translator (Russo, 2022, pp. 30–31).
Additionally, they are associated with cognitive friction together with the
inability to freely move the translation across various tools of the potential
computer-assisted toolkit, which gives rise to the (largely correct) assump-
tion that literary translators are generally sceptical when it comes to trans-
lation technology (Russo, 2022, p. 26).
This chapter seeks to look beyond the tradition where technical accuracy
has come to define, yet limit, the field’s ambitions. It thus reconsiders the
potential of technology in literary translation as a shaping factor that can
augment translation practice, in close dialogue with advances in translation
theory. Today, most translators use non-CAT-specific technology on a daily
basis (Russo, 2022, p. 28) in ways that anticipate the potential of CALT to
go, with the help of CAT/MT, beyond a focus on efficiency or outsourcing
towards controlled shifts to the translation process. A modern CALT tool,
we argue, must embrace a wider definition of technology that has hitherto
DOI: 10.4324/9781003357391-19
Augmenting and Informing 259
1 https://prismaticjaneeyre.org/.
260 Sasha Mile Rudan et al.
This introduction addresses the motivation for our work and establishes
our research questions. Section 14.2 considers the three most important
(for our work) constituents of the act of translation: translation theories,
actors, and workflows. In Section 14.3 we discuss the translator’s augmen-
tation through CALT together with how they are guided through criti-
cal points in the translation process (in canonical settings where previous
translations or literary scholarship help identify important aspects of a
text and the translation process). Having established our motivation, con-
text and informing strategies, and demonstrated the principles through
our CALT toolbelt (named WriTerra), in Section 14.4 we set out the key
aspects of infrastructural change required to accommodate the CALT par-
adigm shift (and address the CALT design challenges) which, together with
the informing strategies, constitutes the main contribution of this chapter.
Section 14.5 presents in greater detail the current implementation of our
CALT tool. It provides a list of the tool’s features and stands as an impor-
tant reference for the components mentioned throughout this chapter.
Finally, we conclude the chapter in Section 14.6.
Augmenting and Informing 261
14.2 CALT in Context
14.2.1 Translation Theories
2 Bukvik is a beech grove in Serbian; it is also a derivative of the Russian bukva, meaning a
letter, and a rather obvious pun on the English word book.
262 Sasha Mile Rudan et al.
Figure 14.1 A
n example of augmenting translators using CALT with Bukvik’s etymological layer to permit comparison of the current
text with other texts and help improve lexical selection.
Augmenting and Informing 263
3 http://icame.uib.no/brown/bcm.html.
264 Sasha Mile Rudan et al.
14.2.3 Translation Workflows
One key to a successful CALT tool involves enhanced integration with the
real-life (natural) translation process of a translator, rather than ‘bending’
the translator to conform to a CALT-enforced translation process. This is
challenging primarily because the real-life workflows of literary translators
are as varied as they are understudied. Existing research focuses, with the
rare exception of Borg’s in-depth case study, on short passages (Borg, 2022,
p. 30). Yet we know from a flood of anecdotal evidence—translators’ own
accounts of the processes they follow—that variation from one translator
to another can be striking (Rossi, 2018). Some start, like Borg’s subject
Aquilina, by extensive research and familiarisation with the source text
while others prefer to read as they translate. Some translators use word
processing throughout while others, like Aquilina, write their first draft
with pen and paper and type it up afterwards. Some systematically
compare their solutions to previous translations of the same work (see e.g.
Youdale & Rothwell, 2022), while others deliberately avoid consulting
them. Some (like Aquilina) mostly stick to a well-established routine from
one translation to another, whereas others go with the flow.
Consider the simple workflow example in Figure 14.3 showing how
CALT fits in with real-world translation processes today, presented by
Schiaffino (2016). This reasonably straightforward process focuses primar-
ily on segmentation and assumes a completely linear workflow between
draft and revision with only one choice to make. On the one hand, even
though the workflow represents a fairly simple translation process, it still
introduces significant friction as datasets must be exported (T3) and even-
tually reimported (T5) for revision elsewhere, while on the other hand, it
is easy to imagine that a more realistic translation process would require
multiple translation drafts and therefore multiple reiterations over friction
tasks in the workflow (T3 and T5).
266 Sasha Mile Rudan et al.
Figure 14.3 A
n example of a CALT creative-translation workflow, adapted from
the full workflow proposed by Schiaffino (2016).
5 https://wordnet.princeton.edu/.
270 Sasha Mile Rudan et al.
Figure 14.5 (a), (b) and (c) Using the WriTerra CALT toolset with VTUs.
Augmenting and Informing 271
Alongside informing the translation process, the CALT tools discussed here
have the capacity to involve additional stakeholders, notably researchers,
to guide the translator. The versatility of the workflow model means that
most of the toolkit’s components can be used for textual analysis (this
is especially the case for Bukvik and Annotata), which can then be fed
back into the CALT toolkit. One prominent example of this approach
is the work on keywords conducted as part of the Prismatic Jane Eyre
project. Several keywords were selected and voted on for closer study by
the project team and their renderings in the published translations of Jane
Eyre into 26 languages were then traced (see Rudan et al., 2021a, pp. 3–4
for the relevant workflow). Once the ST and a number of TTs are cross-
annotated in this way through LitTerra, this knowledge (together with the
scholarly commentaries and essays on relevant topics) can be used in a
number of CALT settings, from retranslations of the novel (the translator
can be alerted to the keywords and the complexities of their translations
discovered by researchers) to translator training on the corpus of the
annotated translations, and finally to training users on the CALT toolkit.
14.4.1 Modular Tool
sort of FMR). Equally, the authors had similar issues testing their algo-
rithm with real translators (hence their use of a simulation against the pre-
translated DGT-TM corpus).
This is a clear example which calls for democratising CALT tools to
thus enable: (i) creative translators to embrace the latest CALT features;
(ii) researchers and developers interested in a particular aspect of a CALT
tool to test it in a real CALT environment; and (iii) the provision of a freely
accessible version of a CALT tool.
Primarily, a new generation CALT tool should not be monolithic in
its nature and design but rather modular, and therefore extendable with
critical features adapted to a given translation process workflow. Most
modern tools support extensions through plugins, but these are mostly for
specific well-identified border cases, like importing/exporting documents
or supporting special dictionaries. When it comes to core concepts like
how to form a TU, how to integrate TM functionality, how to annotate
the ST with a third-party tool for the purpose of NER or POS-tagging etc.,
most CA(L)T tools fail and make it impossible to provide an alternative
third-party solution. However, interoperability is one of the keys to full
tool openness. Our CALT design strategy is modular-by-design, with each
functional section of our system being a separate component.
14.4.2 Components
6 https://colabo.space/en/flow/.
Augmenting and Informing 273
14.4.3 Tool Democratisation
14.4.4 Component Generalisation
7 https://colabo.space/en/datatalks/.
274 Sasha Mile Rudan et al.
14.4.5 Workflows
14.4.6 Perspectives
Translation perspectives provide different views of the CALT tool and the
text to be translated; each perspective adapts the CALT tool and makes the
translating experience appropriate to each particular task of the process.
The above example of switching between TU-based translation with parallel
ST and TT texts and a ‘zen-like’ pure TT-only editor demonstrates two
such perspectives. Others might include perspectives like: (i) initial reading
of the ST; (ii) annotation of the ST; (iii) exploring previous translations;
and (iv) exploring scholarly findings on the ST and any relevant corpora.
It is important that the perspectives should not be seen as equivalent
to workflow tasks (CALT features); each perspective usually consists of
(applies to) multiple tasks and/or components sometimes even the same
set of tasks but with different configurations (like different TT alignment
types).
Figure 14.7 (split into Parts (a) and (b) for the sake of readability) demon-
strates the interface of the WriTerra CALT toolset in practice. We can see
the ST, multiple TTs together with the text currently being translated (the
2nd pane in Part (a) of the screenshot), and collaborative commenting (the
last overlay pane in Part (b) of the screenshot) presented side by side, with
embedded multi-annotations (‘AnnoTatas’), semantic editor and stylomet-
ric insights chart fed from the underlying Bukvik workflow and ezGlot
service.8 For more details, see Rudan et al. (2020, 2021a).
14.6 Conclusion
In this chapter, we addressed the major challenges that a translator faces
when it comes to integrating CALT tools into their standard translation
processes. We established a clear set of expectations for an improved
CALT tool and argued for a shift in focus away from ‘assisting’ towards
‘augmenting’ the translator, which reshapes the relationship between the
CALT tool and the translation stakeholders. Consequently, we proposed
a paradigm shift in designing and implementing a modular CALT tool
with flexible workflow support and democratised data content across the
whole translation process, rather than only at boundary points. This work
is informed and illustrated by the analysis of translation processes and
translation theories evaluated on projects such as Prismatic Jane Eyre.
We believe that the proposed approach is in itself conducive to research
developments in and beyond CALT. To give just one example, a workflow-
based infrastructure such as the one outlined here can gather and analyse
workflow models that translators opt for (anonymously or otherwise),
and thus provide a retrospective analysis for the study of the translation
process. Since the augmented translation process is affected by the use of
technology, these changes can also be fed back into the development of
further CALT tools. At the same time, CALT can provide a retrospective
8 www.ezglot.com/.
278 Sasha Mile Rudan et al.
Figure 14.7 ( a) and (b) An example of using the WriTerra CALT toolset in
translation (using annotations, collaboration, machine alignment and
stylometric charts).
Augmenting and Informing 279
workflow for the study of the translation process, doing away with the
current impediments to the study of the process for longer projects.
Even though we have already implemented the majority of the CALT
components introduced here, we still need to tighten the integration
between the set of tools that inform the translator in their translation
process (primarily Bukvik) and the pure CALT-editing tool (WriTerra) in
order to apply it in real-world experiments with translators (initially, in
translator training environments), and open it up for external research and
evaluation. Enabling such external evaluation and learning from real-life
applications of our toolkit would be the primary focus of our future work.
References
Berman, A. (1985). La traduction comme épreuve de l’étranger. Texte, 4, 67–81.
Berman, A. (2021 [1985]). Translation and the trials of the foreign. In L. Venuti
(Ed. & Trans.), The translation studies reader (4th ed., pp. 247–60). Routledge.
https://doi.org/10.4324/9780429280641
Borg, C. (2022). A literary translation in the making: A process-oriented perspective.
Routledge. https://doi.org/10.4324/9781003150909
Cronin, M. (2010). The translation crowd. Tradumàtica: traducció i tecnologies
de la informació i la comunicació, 8. https://revistes.uab.cat/tradumatica/article
/view/n8-cronin/pdf_15
Daems, J. (2022). Dutch literary translators’ use and perceived usefulness of
technology: The role of awareness and attitude. In J. L. Hadley et al. (Eds.),
Using technologies for creative-text translation (pp. 40–65). Routledge. https://
doi.org/10.4324/9781003094159
Hadley, J. L., Taivalkoski-Shilov, K., Teixeira, C. S. C., & Toral, A. (Eds.). (2022).
Using technologies for creative-text translation. Routledge. https://doi.org/10
.4324/9781003094159
Hvelplund, K. T. (2017). Translators’ use of digital resources during translation.
HERMES – Journal of Language and Communication in Business, 56. https://
doi.org/10.7146/hjlcb.v0i56.97205
Jiménez-Crespo, M. A. (2021). The ‘technological turn’ in translation studies: Are
we there yet? A transversal cross-disciplinary approach. Translation Spaces,
9(2), 314–41. https://doi.org/10.1075/ts.19012.jim
Kelbert, E. (2022). Appearances: Character description as a network of signification
in Russian translations of Jane Eyre. Target, 34(2), 219–50. https://doi.org/10
.1075/target.20079.kel
Kurzweil, R. (2005). The singularity is near: When humans transcend biology.
Duckworth.
Latour, B. (2007). Reassembling the social: An introduction to actor-network
theory. Oxford University Press.
Lommel, A. (2018). Augmented translation: A new approach to combining human
and machine capabilities. Proceedings of the 13th Conference of the Association
for Machine Translation in the Americas (Volume 2: User Track) (pp. 5–12).
https://aclanthology.org/W18-1905
280 Sasha Mile Rudan et al.
Lotman, J. (1977). The structure of the artistic text (R. Vroon, Trans.). University
of Michigan.
Markovitch, A. (1996). Zametki francuzskogo perevodchika Dostoevskogo. In G.
M. Fridlender (Ed.), Dostoevsky. Materialy i issledovaniya (Vol. 12, pp. 255–
9). Nauka.
Moretti, F. (2013). Distant reading. Verso.
Nord, C. (1997). Translating as a purposeful activity: Functionalist approaches
explained. St. Jerome. https://doi.org/10.4324/9781315760506
Ortega, J., Sánchez-Martínez, F., & Forcada, M. (2016). Fuzzy-match repair using
black-box machine translation systems: What can be expected? Conferences
of the Association for Machine Translation in the Americas: MT Researchers’
Track (pp. 27–39). https://aclanthology.org/2016.amta-researchers.3
Penkale, S., & Way, A. (2013). Tailor-made quality-controlled translation.
Proceedings of Translating and the Computer 35. TC 2013, London. https://
aclanthology.org/2013.tc-1.13
Poncelas, A., Lohar, P., Hadley, J., & Way, A. (2020). The impact of indirect
machine translation on sentiment classification. Proceedings of the 14th
Conference of the Association for Machine Translation in the Americas
(Volume 1: Research Track) (pp. 78–88). https://aclanthology.org/2020.amta
-research.7
Rossi, C. (2018). Literary translation and disciplinary boundaries: Creative writing
and interdisciplinarity. In K. Washbourne & B. Van Wyke (Eds.), The Routledge
handbook of literary translation (pp. 42–57). Routledge.
Rothwell, A. (2021). Uses of alignment: Retranslating Zola with/against Vizetelly.
La Main de Thôt, 9. https://www.researchgate.net/publication/359617463
Rudan, S. M., Kelbert, E., Kovacevic, L., Rudan, S., & Reynolds, M. (2020).
Twin Talk: Bukvik+LitTerra+Colabo.Space – An example of DH collaboration
across disciplines, languages, and style (Vol. 2717, pp. 15–29). CEUR–WS.
https://ueaeprints.uea.ac.uk/id/eprint/84216/
Rudan, S. M., Rudan, S., Kelbert, E., Sagic, A., Kovacevic, L., & Reynolds, M.
(2021a). Colabo.Space—Participatory platform for evolving research and
publishing workflows. In G. Berget, M. M. Hall, D. Brenn, & S. Kumpulainen
(Eds.), Linking theory and practice of digital libraries (pp. 112–17). Springer.
https://doi.org/10.1007/978-3-030-86324-1_13
Rudan, S. M., Rudan, S., & Møller-Pedersen, B. (2021b). Extending BPM(N)
to support face-to-virtual (F2V) process modelling. Proceedings of the
9th International Conference on Model-Driven Engineering and Software
Development (pp. 350–61). https://doi.org/10.5220/0010349203500361
Ruffo, P. (2022). Collecting literary translators’ narratives: Towards a new
paradigm for technological innovation in literary translation. In J. L. Hadley,
K. Taivalkoski-Shilov, C. S. C. Teixeira, & A. Toral (Eds.), Using technologies
for creative-text translation (pp. 18–39). Routledge. https://doi.org/10.4324
/9781003094159
Schiaffino, R. (2016). CAT tools and translation style. https://www.aboutranslation
.com/2016/04/cattools-and-translation-style.html
Van de Meer, J. (2021). Translation economics of the 2020s. Multilingual Magazine.
https://multilingual.com/issues/july-august-2021/translation-economics-of-the
-2020s/
Augmenting and Informing 281
ranking 42, 44, 48, 72–3, 83, Sketch Engine 203, 222–31
101, 122 skopos 183
rationalisation 120 social network (text as) 268
readability 56 software 3, 12, 73–4, 92, 96, 99, 103,
reader: as editor 148; as translator 114, 192, 203, 223, 231, 248
146; survey 33 speech 71, 78–9, 99, 208, 215,
reading experience 127–8, 136, 138, 229–30
148, 183 statistical machine translation (SMT)
recurrent neural network (RNN) 7–8, 6–13, 54, 102, 163, 177
10, 13, 161–3 statistical significance 73, 83
reference 27, 224 stoplist 208, 212–13
register 45, 57, 71, 79, 81–2, 99, 120, string 7–8, 12, 110, 123
145, 212 style 2–3, 14, 16–17, 40–3, 45, 47,
repetition 106, 123, 211, 215–16, 226 60, 65, 72, 95, 99, 106, 108, 120,
retranslation 16, 106–24, 127, 148, 122, 151, 174–5, 177, 179, 183–4,
238, 241, 251, 263, 271 206–8, 215–16, 219–20, 232, 254,
Reverse Lookup 242 276; author’s 69, 73, 127, 180;
revision 4, 15, 56, 112–13, 240, 249, character’s 73; translator's 4, 13, 16,
265, 274 56, 69–85, 261
reweighting 165 stylometry 259, 261, 276, 277
rhyme 17–18, 95, 99–101, 164, 166–8, subconscious correction 64
170, 180, 241, 244, 249, 254–5; subsegment recall 123
constraint 167; prior 166; scheme subsidy 92, 102
168–9, 237–9, 243–5, 250, 252–4 subword unit 8, 30
RhymeDesign 242 support vector machine 71
RhymeZone 242 syllable 100, 166, 242, 253
rhythm 46, 63, 99, 122, 246 synonym 79, 81–2, 118, 121, 123,
180, 182, 268
sandbox 118, 121 syntactic structure 12
screen clutter 118 synthetic data 8, 11, 29, 31, 64
segment 16, 61, 95–8, 101, 109–10, Systran 147
112–14, 116, 118–19, 121–3,
128, 130–1, 134–6, 192–3, 205, tagging 223, 225, 272
211, 215, 230, 266, 268; length target audience 109
120; parallel 126, 129–30, 138, TAUS Dynamic Quality Framework
230; source 130, 136; target (DQF) 40
113, 136 technical effort 54, 58–60
segmentation 7, 30, 95, 98, 103, technological singularity 259, 263
109, 265 technological turn 261
self-translation 261 temporal effort 11, 54, 58–60
semantic model 264 tense 46–7, 57
semantic proximity 118 TER 9, 134
sentence: length 11, 14, 55, 70, termbase 92–3, 96, 98–9, 154
99–100, 207; pair 8, 27, 29, terminology 55, 58, 61–2, 65, 93,
30; structure 79, 113, 148, 95–6, 98, 106, 150, 154, 173,
178, 182 226, 231
SentencePiece 30, 132 test set 8, 29, 33–5, 37, 42, 133
sentiment analysis 163 testing 13, 57
shift 56, 59, 63, 175, 226, 228, 231, Tetrapla 75
249, 268 text: creative 2, 126; cultural 5;
simile 99, 175, 177–9 electronic 3, 109; focus 72;
simplification 12, 55 formulaic 93, 96, 102; functional
Index 289