Professional Documents
Culture Documents
P
the Thesaurus of English “fiberglass-framed.” And this list does not Julia Marshall has a Master’s Degree in Library
Words and Phrases in 1852, include the tents used for commercial or mil- Science from Catholic University and has been
but not until 1957 was the itary purposes3. To think I used to sleep in a indexing since 1998. While working part-time at
term “thesaurus” used in the plain old “pup tent” in the backyard when I the American Institutes for Research and
working with online indexes for the National
context of information was a kid!
Center for Education Statistics, she became
retrieval by Peter Luhn of IBM1. As the impact Stability refers to how often vocabularies interested in controlled vocabularies. A past
of computers and the Internet on information change. When I hiked as a youngster, we car- chair of the Washington DC chapter of ASI, she is
retrieval has increased, printed thesauri have ried plastic jugs of water with us. Now people expanding her skills to include information
evolved to include other types of controlled carry “hydration systems.” If you had a term architecture as well as indexing. She would like
vocabularies. for “drinking water,” would you change that to thank Bonnie Jo Dopp, Sue Nedrow, and Pilar
This article is meant to be used as a start- term to “hydration systems,” or would you Wyman for their help in editing this article..
ing point toward further study of controlled make “hydration systems” a synonym for
vocabularies. “drinking water?” If you added “hydration
systems” to a hierarchy, where would it fit
Defining the Vocabulary and how would it affect other terms like Will the controlled vocabulary be used as a
When you plan a controlled vocabulary, you “water purification systems” or “giardiasis”? browsing list with hyperlinks or will it be hid-
need to begin gathering information on the The second question addresses the end den within the search function? If it is in the
client to determine the best solution for their user or target audience. Sometimes this is search function, will your client be using a
information retrieval needs. You need to ask a an easy one. database program such as SQL Server or MS
lot of questions, which will generally fall into The end users for a company intranet are Access? Or will they be using XML or CGI
four main categories. the employees of that company. But how web- scripts? Will they be using a third-party search
1) What is the material being searched? savvy are the company employees? Are new engine such as Inktomi or a custom-made
2) Who is doing the searching? employees trained on the intricacies of the search function? All of these can significantly
3) How will the controlled vocabulary be intranet, or are they left to fend for them- affect how a controlled vocabulary works
implemented? selves? What documents are company employ- within a site. Since this is such a complicated
4) How will the controlled vocabulary be ees most likely to be searching? If your target area, I will be writing more about this in the
maintained? audience is the general public, answering next installment of this series.
The first question concerns content. Is these questions can be quite daunting. Ask Finally, consider how the controlled vocab-
the content mostly text, or will pictures or the marketing department for information on ulary will be used by the staff to maintain
even sound bites be involved? Is the content customer profiles. If a department routinely and add records to the database. Will the staff
already online or is it in print format? Con- handles queries from the public, interview be trained information specialists who are
sider also the specificity and the stability of staff about what kinds of questions people full-time employees, or interns who change
the terms. How many fine distinctions will you ask. Rosenfeld and Morville have some excel- every three to six months? Trained employees
need to make among your terms? lent information on user research in the 2002 will be much more capable of working with
Fast, Leise, and Steckel use the example of edition of Information Architecture4. complex hierarchies and relationships.
camping gear. An outdoor company that has If the controlled vocabulary will be used in Interns might be better off with a simpler syn-
100 types of tents is going to need more spe- an online environment, you will need to ask onym ring. If the staff finds a new term that
cific terms than a company that sells only 7- how the vocabulary will be accessed. Who they want to add to the controlled vocabulary,
102 styles. You could have terms for will be responsible for “plugging” the con- what will be the procedure? Will anyone be
“three-season tents,” “expedition tents,” and trolled vocabulary into the site? Is he or she a able to add terms, or only designated staff?
“screen house tents.” You could have terms willing participant in the process of creating a Milstead writes, “A thesaurus is never ‘fin-
for the number of people that will fit in a tent, controlled vocabulary? Cooperation from the ished,’ unless it is no longer being used for
from “bivy-sack tents” to “family tents.” person responsible for the implementa- indexing or its database is no longer being
Designs of tents include “A-frame,” “umbrella tion of the controlled vocabulary is crucial updated. Plan for maintenance before you
hub,” and “hoop tents.” Tents can also be in an online environment. even begin developing your thesaurus. A