Professional Documents
Culture Documents
Ben Zimmer
like the only serious contender.” Candidly, Rundell stated that “for
dictionary publishers, these developments are not altogether welcome: a
reliable revenue stream is drying up, and an alternative business model
has not yet emerged.”
Leaving aside for a moment the business concerns that are
afflicting the entire publishing industry, I would like to focus on exactly
how the move to digital can become a “liberating” moment for the
creators of dictionaries and thesauruses, and how users stand to benefit
from the electronic experience. In her introduction to the Electronic
Lexicography volume, Sylviane Granger lays out six areas of innovation that
lexicographers are currently exploring in online reference: hybridization,
more and better data, efficiency of access, customization, corpus integration, and
user input (Granger 2012). Let us consider each of these innovations in
turn, and how online resources are taking advantage of them.
dictionary data from multiple licensed sources, with Wordnik adding not
only corpus examples but real-time results from Twitter (for text) and
Flickr (for images). The danger then becomes overwhelming the user
with a surfeit of information about a lexical item, which can become
even more unwieldy if advertisements are also running on a web page.
Judicious use of web design requires knowing which kinds of
language-related data (text, audio, or visual) will be of most use and
how best to feature them in an online environment. This will depend
largely on the goals of the dictionary or thesaurus in question. Lexical
reference sites targeting English language learners, for instance, may
make use of simpler definitions and information about how words are
used in collocations.
By way of example, word pages in the Vocabulary.com Dictionary
feature more conversational explanations of words in addition to typical
dictionary definitions supplied by WordNet, a lexical database developed at
Princeton University. This more discursive approach to defining words by
means of “full-sentence definitions” has been explored in the past primarily
in dictionaries designed for learners of English as a foreign language
(Atkins and Rundell 2008, 441). But the value of explaining words in an
engaging style is just as important for native speakers of English who would
not be able to appreciate the full range of how a word is typically used from
a traditional dictionary definition. Researchers in how students engage
with dictionaries have long called for “definitions that characterize a word’s
use and are phrased in language that makes meaning easily accessible to
young learners” (McKeown 1991, 154). The Vocabulary.com Dictionary
takes exactly this approach to elucidate vocabulary words with the goal of
ensuring that the explanations will be memorable.
Consider the word quadruped, which a standard dictionary would
define simply as “an animal with four feet.” In the Vocabulary.com Dictionary,
the word page for quadruped leads with this light-hearted explanation:
A squirrel, a zebra, a deer, a wolf, and a grizzly bear meet in a
field. Yes, a disaster in the making, but also a bunch of quadrupeds
— animals that walk on four feet.
-ped is for the feet: think of centipedes and millipedes, insects that
have so many feet it’s disturbing. A human is a biped because
they walk on two feet. If you meet a human with four feet, you
could call him a quadruped. You could also call the circus and let
them know their quadruped is loose.
The move from the static printed page to the dynamic web page
encourages a responsive design that is tailored to a user’s needs. This
becomes a crucial concern in creating resources for English language
learners. As Granger (2012) suggests, customization in an online dictionary
should ideally be both adaptive (automatically adjusting to user input)
and adaptable (open to the user’s own personalization).
The Vocabulary.com Dictionary is fully integrated in an adaptive
gameplay environment that quickly adjusts to the level of the user by
means of item response theory (IRT), so that the learner’s time is not
282 Ben Zimmer
wasted by vocabulary items too far above or below his or her proficiency.
IRT is the central model in computerized adaptive testing (currently
used in the US for such standardized tests as GRE and GMAT), and
it allows appropriate questions to be presented to test-takers, zeroing
in quickly to their skill level based on their successive responses. For
such an approach to work effectively, a large pool of questions must be
continually calibrated, scored according to difficulty and discrimination
(that is, how well a question discriminates between users of different
ability levels). Vocabulary.com has an ever-increasing pool of questions,
and the voluminous user data collected over millions of sessions provides
accurate models for the appropriate questions to present to a vocabulary
learner at any given time (Lohr 2013).
The Vocabulary.com Dictionary is not only integrated into an
adaptive learning system but is adaptable as well. Users can create their
own custom-made vocabulary lists to study from and then learn those
lists in the same gamified environment as the main Vocabulary.com
Challenge. This also allows teachers to focus on those vocabulary items
that are appropriate to their students’ particular needs. A learning
environment that is both adaptable and adaptive can lend itself to
vocabulary instruction in a way that transcends static flash cards and the
rote memorization of word lists.
entries, and also for innovative “front-end” features that allow users to
plumb corpus data on their own. Rather than relying on the manual
accumulation of citation slips, lexicographers can now consult a
seemingly limitless supply of relevant text to illuminate the usage of any
given item. This embarrassment of riches presents new challenges, of
course. Corpus data is most useful when lexical items are identified by
part of speech; fortunately, automated part-of-speech taggers are making
great strides in accuracy. But this type of pre-processing must still be
accompanied by tasks that, at least thus far, only humans can adequately
perform, such as “word sense disambiguation” (WSD) to determine
which definition of a word in a dictionary or thesaurus entry corresponds
with a usage in a particular sentence culled from the corpus.
The payoff from this semi-automated work is sizable, however.
The online entries for Oxford Dictionaries, for instance, are enriched
by numerous corpus-derived example sentences broken down by
individual senses, reflecting the results of a major WSD project enlisting
freelance lexicographers in both the US and UK. A similar project has
been undertaken for Vocabulary.com, where the goal is not simply to
assign corpus sentences to appropriate word senses, but then to use such
sentences (derived largely from literary and journalistic sources) as the
basis for adaptive vocabulary instruction in a quiz format.
was begun online in 2005. The suggestions serve as inspiration for the
annual additions of new words, alerting editors to fresh formations and
shifts in meaning and usage. Collins has recently gone even further in
this direction. User suggestions are now tightly integrated into their
online dictionary, with contributors credited by name in the new entries.
And the Collins publishing schedule is greatly accelerated, with user
contributions folded into online updates about once a month. The
Collins new-words initiative, while clearly successful as a marketing
gambit, raises some questions about the shaky status of dictionaries in
the digital era. Is this kind of crowdsourcing a worthwhile endeavor for
dictionary makers, beyond providing valuable publicity for publishers
facing a tough consumer market? Or could the reliance on the wisdom
of the crowds end up diluting the authority that the leading print
dictionaries have traditionally held?
As reference publishers grapple with these questions, it is clear
that dictionaries and thesauruses need to adapt to their new online
homes and understand the opportunities that web-based interaction
presents. The lessons of search engine optimization, for instance, can be
applied to boost the online visibility of a dictionary or thesaurus, and web
traffic analytics can be studied to determine exactly how users approach
a site, down to which words are being searched for at any given moment
(Lannoy 2010). Such a fine-grained understanding of how online lexical
references build and maintain an audience will surely be a key to their
future survival. It is also vital that dictionary and thesaurus sites be
enriched with data-driven and visually stimulating features in order to
engage with “digital natives.” This means meeting the digital natives on
their own technologized terrain, fulfilling expectations of interactivity
and fluidity in sophisticated interfaces.
I believe that lexicographers, working with web and app
developers, are up to the challenge. Dictionaries and thesauruses
are being reinvented and in the process are becoming more valuable
resources than ever. This is a golden opportunity to build much better
lexical references, ones that honor the legacy of the print tradition while
taking full advantage of the benefits of online content and design. The
content can now more accurately reflect language as it is actually used,
with insights from corpus-driven methods illuminating semantic nuances
as never before. Improvements in design can present this linguistic data
more engagingly. And in so doing, lexicographers will introduce new
generations to the interwoven splendor of language.
Lexicography 2.0: Reimagining Dictionaries for the Digital Age 285
References
Atkins, B[eryl] T. S[ue] 1996. Bilingual dictionaries: Past, present and future.
Euralex ’96 Proceedings, Part II, 515–546. Göteburg University, Department
of Swedish.
———, and Michael Rundell. 2008. The Oxford Guide to Practical Lexicography.
Oxford: Oxford University Press.
Dziemianko, Anna. 2012. On the usefulness of paper and electronic dictionaries.
In Electronic Lexicography, edited by Sylviane Granger and Magali Paquot,
319–342. Oxford: Oxford University Press.
Fatsis, Stefan. 2011. Word Freak: Heartbreak, Triumph, Genius, and Obsession in the
World of Competitive Scrabble Players. Tenth Anniversary Edition. Penguin.
Flanagan, Padraic. 2014. RIP for OED as world’s finest dictionary goes out of
print. The Telegraph, Apr. 20. <http://www.telegraph.co.uk/culture/
culturenews/10777079/RIP-for-OED-as-worlds-finest-dictionary-goes-
out-of-print.html>
Granger, Sylviane. 2012. Electronic lexicography: From challenge to opportunity.
In Electronic Lexicography, edited by Sylviane Granger and Magali Paquot,
1–11. Oxford: Oxford University Press.
Heffernan, Virginia. 2008. The medium: Lexicographical longing. New York
Times Magazine. May 11. <http://www.nytimes.com/2008/05/11/
magazine/11wwln-medium-t.html>
Kilgarriff, Adam. 2012. Re: End of print dictionaries at Macmillan. Euralex
mailing list, Nov. 6. <http://www.freelists.org/post/euralex/DSNA-RE-
End-of-print-dictionaries-at-Macmillan,2>
Lannoy, Vincent. 2010. Free online dictionaries: Why and how? In eLexicography
in the 21st Century: New Challenges, New Applications (Proceedings of eLex
2009), edited by Sylviane Granger and Magali Paquot, 173–182. Presses
Universitaires de Louvain.
Lew, Robert. 2012. How can we make electronic dictionaries more effective? In
Electronic Lexicography, edited by Sylviane Granger and Magali Paquot,
343–361. Oxford: Oxford University Press.
Lohr, Steve. 2013. A vocabulary site shows how to tailor online education.
The New York Times Bits blog, Apr. 10. <http://bits.blogs.nytimes.
com/2013/04/10/a-vocabulary-site-shows-how-to-tailor-online-
education/>
McKeown, Margaret G. 1991. Learning word meanings from definitions:
Problems and potential. In The Psychology of Word Meanings, edited by
Paula J. Schwanenflugel, 137–156. Hillsdale, NJ: Lawrence Erlbaum
Associates.
Mount, Harry. 2010. R.I.P. the OED: As the world’s greatest dictionary goes out
of print, why it spells disaster for everyone who loves words. Daily Mail.
Aug. 31. <http://www.dailymail.co.uk/news/article-1307580/R-I-P-
Oxford-English-Dictionary-spells-disaster-loves-words.html>
286 Ben Zimmer
Pratt, Dan. 2012. Re: End of print dictionaries at Macmillan. Euralex mailing list,
Nov. 5. <http://www.freelists.org/post/euralex/DSNA-RE-End-of-print-
dictionaries-at-Macmillan>
Rundell, Michael. 2012a. Stop the presses—the end of the printed
dictionary. Macmillan Dictionary Blog, Nov. 5. <http://www.
macmillandictionaryblog.com/bye-print-dictionary>
———. 2012b. The road to automated lexicography: An editor’s viewpoint. In
Electronic Lexicography, edited by Sylviane Granger and Magali Paquot,
15–30. Oxford: Oxford University Press.
Siegel, Muffy E. A. 2007. What do you do with a dictionary? A study of
undergraduate dictionary use. Dictionaries 28: 23–47.
Zimmer, Ben. 2007. Charting the digital future of dictionary research: Prospects
for online collaborative lexicography. Paper presented at the 2007
Dictionary Society of North America biennial conference, Chicago, IL.
———. 2012a. Reconsideration: The thesaurus. Lapham’s Quarterly 5, 2: 207–213.
<http://www.laphamsquarterly.org/reconsiderations/word-for-word.
php>
———. 2012b. Remodeling “the Warehouse of Roget”: How the thesaurus
is being reinvented as a writer’s tool. Introduction to Oxford American
Writer’s Thesaurus. Third edition. Compiled by Christine A. Lindberg, xi–
xvi. Oxford: Oxford University Press.