Professional Documents
Culture Documents
The authors are practising translators or translator trainers in the fields con-
cerned. Although specialists, they explain their professional insights in a manner
accessible to the wider learning public.
Anthony Pym
Series Editor
This page intentionally left blank
Electronic Tools for Translators
Frank Austermühl
First published 2001 by St. Jerome Publishing
Notices
Knowledge and best practice in this field are constantly changing. As new research and experience
broaden our understanding, changes in research methods, professional practices, or medical
treatment may become necessary.
Practitioners and researchers must always rely on their own experience and knowledge in
evaluating and using any information, methods, compounds, or experiments described herein. In
using such information or methods they should be mindful of their own safety and the safety of
others, including parties for whom they have a professional responsibility.
To the fullest extent of the law, neither the Publisher nor the authors, contributors, or editors,
assume any liability for any injury and/or damage to persons or property as a matter of products
liability, negligence or otherwise, or from any use or operation of any methods, products,
instructions, or ideas contained in the material herein.
Cover design by
Steve Fieldhouse, Oldham, UK
Typeset by
Delta Typesetters, Cairo, Egypt
Glossary 177
References 182
Index 189
This page intentionally left blank
How to use this book
This book is addressed to translation trainers, students and professionals inter-
ested in the manifold possibilities offered by electronic tools for translators. It
can also be used as a textbook for regular classes on computer-assisted transla-
tion or for translation practice classes, as well as for self-learning. Each chapter
(with the exception of the chapters on the Internet) deals with a different tool
and can be approached as an individual lesson.
The chapters are all structured in terms of five basic parts:
At the end of the book there is a general bibliography and lists of the Internet
addresses and the products mentioned, along with an extensive index.
Although it is recommended that the reader work through the book in the
order of presentation, individual access to selected topics is also possible. When
you use the table of contents or the index provided, the book can easily become
a reference work.
Although a number of tools are described in great detail, this book clearly
does not claim to cover each and every single product available on the market.
However, the most relevant types of tools are presented, and examples of each
type are given. The focus is on general strategies, explained through numerous
examples relevant to translation. Readers are invited to develop these strategies
in terms of their own experience as translators. The tasks will hopefully deepen
your understanding of the subject matter and arouse your appetite for more.
When reading this book, please bear in mind that the term ‘electronic trans-
lation tools’ does not refer exclusively to machine translation (MT). MT systems
are only one of many kinds of translation tools. Indeed, since MT systems ne-
glect the communicative, cultural and encyclopedic dimensions of translation,
it is questionable whether they really provide ‘translation’ at all.
It is also important to understand that the tools demonstrated in this book are
not replacements for human translation. There is no such thing as a computer-
aided cure-all that will make a poor translator into a good one. However, used
properly, the right tools can help good translators improve the efficiency and
quality of their work. Indeed, quality is the primary yardstick for assessing elec-
tronic translation tools.
And if so, do those tools need to be electronic? Instead of answering these ques-
tions immediately, I would like to set the stage by describing one of the contexts
in which translation is currently being carried out. It is a context characterized
by globalization, specialization and digitization.
Globalization
The concept of globalization entails another crucial question, one that might
make the entire translation profession obsolete: Do we, as citizens of the
‘global village’, really need translation at all? Why bother with 4,000 to 6,000
different languages if we can make do with only one, namely English? After all,
English is the dominant language in international politics, business, technol-
ogy, science, education, aviation, seafaring, etc. It is the lingua franca of the
global market economy, especially in the field of business–to–business com-
munication. For example, 80 percent of all business transactions in Denmark
are carried out in English (Lockwood 1998:16). International business negotia-
tions and contract talks among non-native English speakers from Algeria and
Germany are conducted in English. The importance of English as a promoter of
economic growth is further demonstrated by the fact that within the Organiza-
tion for Economic Co-operation and Development (OECD), English-speaking
countries receive three times as much foreign direct investment as those where
English is not the official language (Mai & Wettach 1999:130). Globally, many
large corporations have adopted English as their official language. About 85
percent of international organizations use English as their working language. In
Europe, 99 percent of all international organizations have English as one of
their official languages (Mai & Wettach 1999:130). The world of technology,
too, is heavily influenced by English, or more accurately by its American vari-
ety. Roughly 80 percent of the contents of the 1 billion Internet pages on the
web at the time of writing are in English. Almost 60 percent of all Internet users
are native speakers of English. The lion’s share of the 8,000 online databases
currently available is taken by information in the English language. The global
Electronic Tools for Translation 3
characteristics of the local market. The globalized economy means that busi-
nesses are trying to sell the same goods – from iMacs to Eastpacks to Big
Macs – in every corner of the world. This requires traders to accommodate their
marketing and selling strategies to local idiosyncrasies. Globalization thus au-
tomatically entails localization, as is expressed in the term glocalization and
slogans such as ‘Think globally, act locally’.
A powerful catalyst for translation has thus been created by the rapid inter-
nationalization of markets, particularly by the need to localize not only products
but also the methods of designing, producing, marketing and distribution.
users, with a rapidly growing interest in non-English sites as the Net becomes
genuinely multilingual. Websites in Spanish, Portuguese, German, Japanese,
Chinese and Scandinavian languages are showing the strongest growth rates. It
is estimated that although 57.4 percent of Internet users were based in English-
speaking countries in 1999, this figure will drop to 43 percent by 2005. Growth
rates in the number of Internet users in non-English-speaking countries are al-
ready much higher than in English-speaking countries. While the number of
English-speaking Internet users is expected to rise by 60 percent over the next
six years, the number of non-English-speaking users is expected to increase by
150 percent. At the same time, it is estimated that the total number of Internet
users worldwide will increase from 171 million in 1999 to 345 million in 2005.
This will lead to a growing number of multilingual sites and thus to a growing
demand for website translation.
Part of this increase in translation demand will be absorbed by more sophis-
ticated methods of machine-aided translation. Also, owing to international
competition, translation pay-rates might be expected to stagnate or even decline
in the future. Nevertheless, globalization is definitively promoting the demand
for translations and translators. This is also confirmed by the increasing number
of translation programmes in universities worldwide. In Europe alone, there are
about 80,000 students in 150 institutions studying translation on the level of
higher education (ASSIM 1997). For an updated list of translator training insti-
tutions, see Anthony Pym’s page at http://www.fut.es/~apym/tti/tti.htm (see also
(see also Snell-Hornby 1998 and Pym 1998).
For translators there is no longer any question of whether or not to use comput-
ers and networks. The use of information and communication technology (ICT)
is a fait accompli in the lives of today’s language professionals. The electronic
handling of orders and the digitized delivery of language services are now taken
for granted as standard client services. For example, the European Commis-
sion’s Language Service requires its external freelance translators to use a rather
sophisticated IT infrastructure. More than 50 percent of the translators inter-
viewed for the 1997 ASSIM report were using electronic dictionaries, and about
one-third of the translators were using translation memory systems.
The application of institutional pressure is not the only way translators will
be convinced to make extensive use of modern ICT systems. Exposure to the
right approaches in using electronic tools will lead to a bottom-up automation
of the translator’s workplace, enabling translators to increase significantly their
professional translation service quality and efficiency. These two effects – the
improvement of quality and increased productivity – will clearly strengthen the
position of any freelance translator in a rapidly changing and increasingly com-
petitive global language market (Wood 1998).
Trends toward globalization and international networking have resulted in
organizational changes in enterprises worldwide. In the resulting virtual enter-
prises, outsourcing and workflow-management systems have impacted on the
translation business so greatly that we can now talk about ‘tele-translation’.
This concept is based on the assumption that translators can communicate with
clients across the globe; they can compete for orders regardless of their loca-
tion. A sophisticated IT infrastructure thus becomes a significant competitive
advantage in terms of client acquisition. Furthermore, quick access to relevant
8 Frank Austermühl
and reliable online and offline data has become a key issue in the retrieval of
encyclopedic and linguistic information necessary for the tasks being performed
(Austermühl forthcoming).
It is obvious that the growing demand for high-quality translations of techni-
cal texts is no longer manageable without the use of computer-based methods.
The increase in efficiency needed to deal with this growing demand can only be
achieved by the expanded use of modern forms of ICT.
Electronic resources offer several distinct advantages. For a start, much of
the content of specialized print dictionaries are already out of date by the time
the books reach the market. To understand the reason for this, just look at the
speed with which domain knowledge is being produced in some innovative dis-
ciplines such as ICT, bio-technology or immunology, and then compare this
process with the time it takes to put together and publish a print dictionary.
Electronic dictionaries, on the other hand, can be published online or offline
(i.e. on CD-ROM) without any delay and can easily be kept up to date over the
Internet. Many publishing houses have changed their strategies with regard
to publishing reference works in general and have turned to publishing their
works on CD-ROM or even online. The powerful presence of the Encyclo-
paedia Britannica (http://www.britannica.com) on the Internet underlines this
process. Many specialized encyclopedias or scientific journals that have long
been valuable resources for translators are no longer published in print but are
only delivered digitally. Many newspapers and magazines offer online access
to their archives, thereby allowing a comprehensive search of all articles within
seconds. Printed versions simply cannot compete.
At the same time, the exchange of texts and information between translators
and their clients is taking place almost exclusively over electronic channels.
The traditional ‘snail mail’ has been replaced by its electronic brethren. In all
these areas, dramatic modifications are being made to the way the business of
translation is conducted. The advantages of modern information and communi-
cation systems are introducing new criteria of speed, flexibility, timeliness and
comprehensiveness.
Figure 1 shows the three levels of the translator’s workstation proposed by Alan
Melby in the early 1980s.
Text Processing
Level 1 Telecommunications Software
Terminology Management Systems
Others (DTP, Converter)
Text Analysis
Level 2 Automatic Dictionary Look-up
Bilingual Text Retrieval
Other (SGML)
Only level 1 elements of Melby’s model had been used on a broad scale by
the mid-1990s; tools at levels 2 and 3 had only been partially implemented. At
the end of twentieth century, however, these products were widely used in the
professional translation world. CD-ROM-based dictionaries, encyclopedias
and terminology management systems are now standard equipment for most
translators.
Translation memory systems are being used by all major translation agen-
cies, translation departments, and many freelances as well. Universal data
exchange formats, such as SGML (Standard Generalized Markup Language),
10 Frank Austermühl
When dealing with the use of computers for translation, all kinds of acronyms
are used to explain the different types of tools. A list of some of the more com-
mon ones, which readers can use as a quick self-test, includes: MT, CAT, HT,
HAMT, MAHT, FAHQMT. Figure 2 puts these various acronyms into some
kind of perspective.
The acronyms are now points on a continuum that shows the gradual in-
crease in the automation of the translation process caused by the use of the tools
human involvement
mechanization
FullyAutomatic
Fully Automatic Human-Aided
Human-Aided Machine-Aided
Machine-Aided Human
Human
High Quality
High Quality Machine
Machine Human
Human Translation
Translation
Translation
Translation Translation
Translation Translation
Translation
(FAHQT)
(FAHQT) (HAMT)
(HAMT) (MAHT)
(MAHT) (HT)
(HT)
Computer-Assisted
Computer-Assisted
Translation(CAT)
Translation (CAT)
and the consequent decrease in human involvement. Let’s take a closer look at
this continuum.
It is quite obvious that both extremes shown in Figure 2 are easily discarded
as valid models for contemporary translation. The antiquated image of a lone
translator, armed only with a pencil or a typewriter and surrounded by dusty
books, is no longer realistic. However, the idea of an independently acting, error-
free translating machine is equally unrealistic and will not become a reality for
a long time to come, if at all.
Having eliminated HT and FAHQT, our interest is focused on the area of
translation where humans and computers co-operate to improve the overall
quality of professional translation. Here, depending on whether humans or ma-
chines do the lion’s share of a translation, we speak either of machine-aided
human translation (MAHT) – translation aided by text-processing software,
terminology databases or electronic dictionaries – or human-aided machine
translation (HAMT) – translation memories (which come empty and initially
have to be filled with translations from human translators) or MT systems that
require extensive human pre- and post-editing.
Both terms, MAHT and HAMT, are often grouped together under the term
computer-assisted translation (CAT).
Transfer
ST M ap Cultural Adaptation
TT M ap
Form
Text U nderstanding
t io n
u la t io
Information Retrieval
Text V erification
Problem Analysis
Text A rchiving
n
Text A nalysis
Source Target
Text (ST) Direct Transfer Text (TT)
details, we can see how electronic tools can be used regardless of the type of
text being translated. We have added some information to the model in order to
show the sub-processes during the reception, transfer and formulation phases.
Holmes states that the reception of a source text leads to an abstract form, a
mental concept of this text. He calls this a map. This abstraction is heavily
influenced by contextual, intertextual, situational and individual factors. Ac-
cording to Holmes, the translator transforms the source text map into the target
text map. In literary translations, but more frequently in technical texts, this
step can involve a high degree of cross-cultural adaptation. The resulting men-
tal image of the source text is then put into written form through a step that
Holmes calls ‘formulation’.
The three main elements in this process – reception, transfer and formulation –
demand different competencies and activities by the translators, and different
features by the tools used in their support.
Figure 4 replaces the sub-phases mentioned in the first model shown in Fig-
ure 3 with the tools that can be used to support the translation process. These
tools will all be featured in this book.
Transfer
ST Map Integrated Hypermedia TT Map
Knowledge Bases
Reception Formulation
Terminology Database Terminology
Reception Dictionaries b
Production Dictionaries
Electronic Encylopedia Electronic Archives
Knowledge Database Digital Corpora
Term Extraction Document Management
Direct Transfer
Source Target
Text (ST) Translation Memories, Text (TT)
Machine Translation
Let’s take a closer look at these sub-processes and the different tools appli-
cable to each.
background knowledge plays a central role. As Brown & Yule point out, “un-
derstanding discourse is [...] essentially a process of retrieving stored information
from memory and relating it to the encountered discourse” (Brown & Yule
1983:236). In order to understand the source text, world and domain knowl-
edge, encyclopedias, knowledge databases and information retrieval systems
are all equally important, as are the contacts to domain experts through news-
groups or mailing lists.
What else happens during the reception phase? The source text is analyzed,
primarily with regard to functional aspects (Nord 1997) but also in order to
discover the translator’s information needs (Hönig 1998). This first step can
also comprise an automated statistical text analysis which can be assisted by
electronic tools such as Word Cruncher, WordSmith, or MonoConc or other
terminology extraction tools, the aim being to filter the special terminology used
in the text. This process can result in type-token lists, and in KWIC concord-
ances showing the single term, the key word, in its context (Austermühl,
Einhauser & Kornelius 1998). These lists can then be used by translators –
especially a team of translators handling a large translation project – to ensure
terminological consistency throughout the translation. The resulting KWIC con-
cordances can also be directly imported into terminology databases such as
MultiTerm or Microsoft Access, where further information and links to other
resources can be added (Austermühl & Coners forthcoming).
Terminology databases or electronic dictionaries (either on CD-ROM or on
the Internet) can then be used to decode the linguistic information of the source
text. This kind of ‘reception’ dictionary – as opposed to the production (or en-
coding) dictionary used in the formulation phase – needs to provide a high degree
of paradigmatic information, for example synonyms, antonyms and hyperonyms.
Such dictionaries assist the understanding of the text-internal knowledge by
positioning a given term within its lexical context.
The above model calls for individual use of electronic translation tools with
regard to the specific sub-processes of the translation process. However, in the
past 5 to 10 years the language industry has spent enormous amounts of money
creating and marketing products that are able to cover the total translation pro-
cess in one go. The products designed to automate the process of translation
(almost) completely are translation memories, software localization tools and
machine translation systems.
16 Frank Austermühl
Translation memories
mechanized version of Douglas Adams’ Babel Fish – a black box that converts
the source language input into a (perfect) target language output without any
human interaction. After spending a great deal of money on the development of
MT systems, researchers gave up the idea of Fully Automatic High-Quality
Translation (FAHQT). They have instead focused on using MT for the transla-
tion of texts covering a highly restricted encyclopedic and terminological field
and consisting of very simple sentence structures. Machine translation and mod-
ern MT systems are dealt with extensively in Chapter 10.
Tasks
Blatt, A., K.-H. Freigang, K.-D. Schmitz and G. Thome (eds) (1985) Computer und
Übersetzen, Hildesheim, Zürich, New York: Georg Olms.
Brace, C., M. Vasconcellos and C. Miller (1995) MT Users and Usage: Europe and
the Americas, http://www.lim.nl/eamt/archive/summit95.html. (1 March 2001)
Bradley, Stephen P., Richard L. Nolan and Jerry A. Hausman (eds) (1993) Globali-
zation, Technology, and Competition: the fusion of computers and telecommu-
nications in the 1990s, Boston, Mass.: Harvard Business School Press.
Brown, G. and G. Yule (1983) Discourse Analysis, Cambridge: Cambridge Univer-
sity Press.
Brungs, B. (1996) Translation Memories als Komponente Integrierter
Übersetzungssysteme, Saarbrücker Studien zur Sprachdatenverarbeitung und
Übersetzen, Band 7, Saarbrücken: Fachbereich 8.6 – Angewandte Sprachwiss-
enschaft sowie Übersetzen und Dolmetschen, Universität des Saarlands.
Chiu A. L. S., E. Sherry and X. Phung (1999) ‘Just try to be specific’, in Nature
401/6749, 111.
Cohen, J. and J. Ward (1998) ‘World’s wide Web needs to speak its languages’,
CNN interactive, http://www.cnn.com/TECH/computing/9810/26/lang.idg.
(1 March 2001)
Cole, R.A., J. Mariani, J. Uszkoreit, A. Zaenen and V. Zue (1996) Survey of the
State of the Art in Human Language Technology, http://cslu.cse.ogi.edu/
HLTsurvey/. (1 March 2001)
Der Spiegel (1996) ‘Klick in die Zukunft’, in Der Spiegel 11/96, 66-99.
Dohler, P. (1996) ‘Facets of Software Localization’, in Language Partners Interna-
tional, http://www.languagepartners.com/reference-center/whitepapers/l10nwp/
triacom.htm. (1 March 2001)
Dyson, E. (1998) Release 2.1: A Design for Living in the Digital Age, New York:
Broadway Books.
Esselink, B. (1998) A Practical Guide to Software Localization, Amsterdam &, Phila-
delphia: John Benjamins.
European Commission (1999) A Multilingual Community at Work. The European
Commission’s Translation Service, Luxembourg: Office for Official Publications
of the European Communities. http://europa.eu.int/comm/translation/en/
enintro.html. (1 March 2001)
Feldmann, Doris, Fritz-Wilhelm Neumann and Thomas Rommel (eds) (1997)
Anglistik im Internet – Proceedings of the 1996 Erfurt Conference on Comput-
ing in the Humanities, Heidelberg: Universitätsverlag C. Winter.
Gantz, J. (1998) ‘Coming soon: Language barriers’, in Computerworld, 2 Feb. 1998.
http://www.computerworld.com/cwi/story/0,1199,NAV47_STO13396,00.html.
(1 March 2001)
Gralla, Preston (ed), Sarah Ishida, Mina Reimer and Stephen Adams (illustrators)
(1999) How the Internet Works: Millennium Edition, Indianapolis: Que.
Gross, A. (1992) ‘Limitations of Computers as Translation Tools’, in J. Newton
(ed) Computers in Translation, London: Routledge, 96-130.
Gurian, Phil (1996) E-mail Business Strategies, Spokane: Grand National Press.
Harris, R. (1997) ‘Evaluating Internet Research Sources’, http://www.sccu.edu/ fac-
ulty/R_Harris/evalu8it.htm. (1 March 2001)
184 Frank Austermühl