Professional Documents
Culture Documents
To cite this article: Kay Cahill & Renee Chalut (2009) Optimal Results: What Libraries Need to
Know About Google and Search Engine Optimization, The Reference Librarian, 50:3, 234-247,
DOI: 10.1080/02763870902961969
INTRODUCTION
Address correspondence to Kay Cahill, Vancouver Public Library, 350 West Georgia St.,
Vancouver, British Columbia, Canada V6B 6B1. E-mail: kay.cahill@vpl.ca
234
Google and Search Engine Optimization 235
sponsored search, there are enormous payoffs for companies that can make
their product stand out from the crowd.
Search engine optimization (SEO) is currently one of the key aspects of
successfully marketing products and services on the modern web. SEO
techniques are not in themselves new, but over time they have been refined
and reworked until they have come to have significant influence on the
pages most heavily relied on for Internet users seeking information—the
results of their Google searches.
This is significant for libraries because as we move increasingly toward
a model where our users expect to be able to search independently, our
role shifts more toward becoming the guide who provides the critical infor-
mation literacy instruction that enables them to perform those searches
effectively and evaluate the authority and relevance of the results they
receive in return. If SEO techniques are leading to changes in the order of
search results and the quantity of what amounts to targeted advertising
within those results, our users need to be aware of this and it’s our respon-
sibility, through the courses we teach and the advice that we give, to help
create that awareness. The more we can help library users become aware of
the limitations and benefits of search engines and teach them to recognize
authoritative sources versus self-styled experts and product placement, the
more effectively they will be able to construct their searches and dissemi-
nate the information they find.
This article seeks to explore both the evolution and effects of SEO
techniques, including the efforts of search engines to reduce the effects of
“black hat,” or illegitimate, SEO on search results, and the role of the library
in helping users understand what influences search results and how to
better assess their relevance and accuracy.
on results lists because they had paid to be there rather than because of
their relevance to the search terms (Hedger 2004).
Then Google came on the scene, a young upstart determined to
change the face of Internet searching. Google eschewed the lure of both
banner advertising and paid inclusion in favor of AdWords, or keyword-
based, advertising. Google was not the first search engine to use this type of
sponsored search; it was pre-dated by both Overture in 1998 and BeFirst in
1999 (Fain and Pedersen 2006), but AdWords quickly overtook its competi-
tors and by 2008 Google was estimated to be in control of 69% of the online
advertising market (Baker 2008).
The premise behind AdWords and similarly sponsored search products
is simple. Keywords are used to match brief advertisements to searches
entered by Google users. In addition to the main list of search results, the
advertisements retrieved are displayed under a separate “Sponsored Links”
header. The keyword matching is intended to ensure that only relevant
advertisements are shown, increasing the likelihood of the searcher clicking
through to the advertiser’s web site.
Revenue from AdWords derives from two sources: the number of click-
throughs on the advertisements themselves (the standard pay-per-click
model) and the results of keyword auctions. Advertisers bid on what they
consider to be the most relevant search words to increase their chances of
successful sales; for example, an online winter sports retailer might bid on
keywords including “skis,” “snowboard,” and “snowshoes.” The AdSense
program also offers a pay-per-click revenue-sharing scheme to third party
web sites, which choose to host sponsored AdWords advertisements on
their sites.
Google AdWords ads are part and parcel of Google searching, but they
are also clearly separated from the main search results list. Although the
color scheme is the same, the placement on the page and the small grey
“Sponsored Links” header are a clear sign to all but the most novice
searcher that these are not a part of the main search results. Basic Internet
instruction courses offered by libraries almost always include an advisory
note to help new users differentiate between sponsored results and regular
results.
Clearly there is a huge market for AdWords and enough people are
clicking through and following up with purchases to keep both Google
itself and its millions of third-party advertisers interested and subsidized.
However, AdWords remains first and foremost an obvious marketing ploy.
Even the name makes it clear that, regardless of the connotations of the
public label “sponsored,” they are advertisements.
What’s wrong with obvious advertising? First and foremost, the Internet
is pretty tired of it. Users who lived through the era of endless pop-ups,
animated gifs, and garish banner ads have grown to loathe web advertise-
ments, especially the obvious ones. Witness the enormous furor that erupted
Google and Search Engine Optimization 237
on LiveJournal early in 2008 when Sup, the site’s Russian owners, announced
that they were removing basic accounts and forcing users to choose
between paid accounts and free “sponsored” accounts that covered users’
blogs in third party advertisements. Sixty-eight pages (5,000 individual com-
ments) of furious user responses and a 1-day “content strike,” where many
of LiveJournal’s most popular bloggers boycotted the site entirely, rapidly
ensued, forcing Sup to retract the cancellation of the basic accounts (Gladkova
2008).
As a result of this growing antipathy toward traditional web advertising,
businesses are turning to other solutions. YouTube, for example, is proving
a popular venue for ad agencies who carefully construct video advertising
campaigns to look like typical amateur videos, often to the disgust of site
users when the truth emerges (Sidebar 2008).
The holy grail for many companies is one simple thing: achieving as
prominent a place as possible not in the sponsored links section but in the
main results list. Whether a company has products to sell or is simply trying
to achieve a wider audience or more page hits, the obvious way to achieve
its goal is to appear in the place where most of the searcher’s attention is
focused. This is where SEO comes in. SEO is about creating a web page that
will appear as high as possible on the search engine results page (SERP) for
a particular keyword search.
At the heart of Google’s search results (and Google itself; Backrub, the Google
prototype, evolved from a research project by founders Sergei Page and
Larry Brin during their days at Stanford University) lies PageRank. PageRank
is based on a concept familiar to every library school student: citation analysis.
On the web, this means that the unique PageRank algorithm essentially uses
the popularity of a page to define its place in the search engine results list
for a particular search. A hyperlink to Page A from Page B counts as one
vote for Page B on the PageRank scale.
A simple concept, but the execution is somewhat more sophisticated
than may first appear. PageRank not only measures the number of times an
individual page is linked to, but also weights the relative importance of the
pages that are linking to it on a scale of 1 to 10 to provide a level of authority
to the individual vote. Therefore, a link from the Harvard University web
site is accorded significantly more importance by PageRank than a link from
Dave’s Blog.
As soon as Google released PageRank, webmasters and web site owners
set to work figuring out how to make it work to their advantage. Manipulating
the ranking system of Google and other search engines is known as SEO.
For SEO, the key lies in manipulating these rankings so that their site rises
238 K. Cahill and R. Chalut
up within the SERP listings. Their goal is not just to be on page one of the
results, but to achieve one of the top five or ten spots that are most likely to
be clicked on by the information seeker who entered the original search.
SEO practices are divided into two camps: white hat and black hat. The
intentions of the two camps are similar, as are many of the practices. The
former are (or represent) legitimate businesses or organizations with infor-
mation to present or products to sell the web searcher. Our aforementioned
winter sports retailer wants to make its site findable by someone who wants
to buy a snowboard, so it will employ certain accepted tools and techniques
to optimize accessibility to potential snowboard buyers.
Black hat consultants or programmers and site owners want you to
look at their web sites too. Site owners may have a legitimate product to
sell or something interesting to show you, but often it is not what you were
looking for. Black hat SEO consultants do not actually care whether any-
body buys anything in particular; they get paid by the click, so their goal is
purely to get more visitors to a site regardless of what users do there. As a
result, they are more than happy to resort to rather devious and sleazy tricks
to send the user to sites that are not relevant to their search purely in the
interest of increasing traffic to those sites. Our poor snowboarder is simply
trying to find a new board, but the black hatters lure him or her into a site
advertising time shares, online casinos, or fake Rolex watches. White hat SEO
sites potentially want you to buy a product, but they do aim for a certain
relevance when pushing their sites to users.
People generally associate the term “spam” with unwanted e-mails that
clog their inboxes, but spam also refers to black hat SEO techniques that
mislead web searchers for commercial purposes and the sites themselves.
Google defines search engine spam as practices that attempt to deceive the
search engine and ultimately the searcher by manipulating search engine
results to push irrelevant pages to the top of the rankings. Accepted SEO
practices, which maintain a degree of relevance to the commercial page push,
are not considered search engine spam; black hat techniques, however, are.
Both legitimate and illicit SEO techniques are mainly based around two
types of manipulation: page content and links. The earliest tricks of the SEO
trade involved modification of content to improve a page’s chance of
matching a searcher’s keyword inquiries. Google’s PageRank was devel-
oped as a direct response to the early practice of keyword stuffing, which
led to porn sites (which rapidly became very good at this) regularly appear-
ing at the top of search results regardless of what keywords the searcher
entered in the search engine. The web designers simply populated the
metadata field in the HTML code with keywords that they anticipated users
were likely to search, such as “president” or “united states,” often repeating
the keywords ad infinitum.
Keyword stuffing still exists today, but search engines have smartened
up and now look for repeated keywords in code as an indication of
Google and Search Engine Optimization 239
results. Part prank, part social commentary, the perpetrators of this and
other Google bombs liberally pepper their own and other web sites and
blogs with links to the White House site using “miserable failure” as anchor
text.
For quite a long time, Google’s response to the bombing was to do
nothing, citing their desire to refrain from manipulating what it saw as the
“objectivity” of their PageRank system. (Meyer 2005). However, after many
well publicized bombs, Google seemed to realize that PageRank was too
easily manipulated to be entirely objective and had received numerous
complaints from users who thought the company endorsed the opinion that
Bush or Jimmy Carter or others were “Miserable Failures.” In January 2007,
Google announced it was tweaking its search algorithms to fight Google
bombs (Cuffs 2007).
most of the decade since its launch (the verb “to google” first appeared in
the New York Times in 2001), and for most people (not just casual searchers
but even many trained information professionals) it remains not just the first
port of call but often the only port of call when looking for information.
As of December 2008, according to Internet marketing company Hitwise,
Google was the search engine of choice for over 72% of U.S. searches.
Google has almost made the search process too easy: it has trained
information seekers to expect useful, relevant results with almost every
single search, regardless of the vagaries of the keywords selected. The
smarter Google’s 200 signals get at weeding out irrelevancies, correcting
misspellings, and accurately identifying when a user is looking for Turkey
(the country) as opposed to turkey (the bird), the more people trust and
rely on that first page of results returned by their search.
The result of this is that searchers have gotten lazy and are getting
lazier. Jansen and Spink’s (2006) study into user search practices found that
73% of search engine users do not look at any search results beyond the
first page. A recent iProspects (2006) survey revealed that an increasing
number of search engine users are unwilling to look past the first three
pages of results. The impact of this can be clearly seen with a sample search
for McDonald’s; all results in the first three pages are produced by some
branch of the corporation.
These are significant and somewhat frightening statistics. It means that
three-quarters of searchers never venture beyond that first page, that top ten
listing targeted so aggressively by myriad black and white hatted SEO prac-
titioners. More interesting still would be a similar study into the search
behavior of reference personnel—we who think of ourselves as search pro-
fessionals—how many search engines do we use and how many page
results will we look through before we give up?
The take-home message for libraries is that the end users of search engines,
our patrons, are not fully aware of all the factors that contribute to the SERP
that they see in front of them following a Google search, and that one of the
keys to helping them fully understand and exploit the capacity of search
engines is changing their own entrenched information seeking behaviors.
One of the ways to download knowledge and tools to our users is increase
our own knowledge and use the tools ourselves. As search engines become
more important to reference work on both sides of the desk, a sound pro-
fessional familiarity with their workings is essential. Including web sites and
blogs such as Search Engine Watch and Search Engine Land in your list of
professional reading is essential to keep up to date with new developments
in what’s happening in search engine world.
244 K. Cahill and R. Chalut
the searcher to sort, resort, compare, and contrast the products on offer,
rather than simply being presented with a list whose order is dictated by the
marketing prowess of the company. Moreover, the inclusion of reviews and
ratings allows the searcher to access the collective experience of all the
users who have previously bought the product or interacted with the ven-
dor. This has been taken a step further by sites like Get Satisfaction (<http://
www.getsatisfaction.com>) and Restaurantica (<http://www.restaurantica.com>),
which are organization- and community-driven ranking sites. Although user
ratings and reviews are not necessarily objective, they provide immediate
feedback on commercial products and services from the people who have
actually used them, which is invaluable for the searcher trying to filter out
the best product from the array on offer.
Libraries also play a role in teaching users to look beyond not just the
first page of their Google results, but beyond Google itself. There is a
wealth of information on the deep web locked behind barriers, such as reg-
istration and authentication, that users who never look beyond Google are
missing. Introducing users to subject-specialized resources, full-text article
databases, and vertical search products gives them access to a new range of
information and authority. For most library staff, these things are second
nature; we use them without even thinking about it to verify, double-check,
and compare and contrast the information we are finding. What is so critical
in the modern information environment, when more data than one person
could process in his or her lifetime is just a mouse-click away, is sharing
these skills with our patrons to help them see his or her own search results
through that all-important critical eye.
CONCLUSION
Without a more extensive study into the impact of SEO practices on search
engine results, it is hard to say how detrimental SEO actually is on SERP list-
ings. Moreover, not all SEO has a malign influence on search results. The
influence of SEO may be self-correcting to some degree; as SEO practices
become more widespread and better understood, increasing numbers of
web sites will simply incorporate even the more sophisticated optimization
techniques as standard, eliminating some of the advantage held by those
web sites that currently use the most effective SEO practices. The search
engines themselves are strongly committed to the elimination of black hat
SEO wherever possible, although fighting this particular battle to some
extent mirrors the two steps forward, one step back campaigns against e-mail
spam, spyware, and various other scourges of the digital age.
Some have speculated that the PageRank system and SEO tactics, both
white and black hat, have led to a World Wide Web where the information
users end up looking at (those vital top ten results) is overwhelmingly
246 K. Cahill and R. Chalut
REFERENCES
Hedger, Jim. 2004. Paid inclusion going the way of the dodo. Zeromillion.com. http://
www.zeromillion.com/webmarketing/web-site-submission-google.html (accessed
January 5, 2009).
iProspect. 2006. iProspect search engine user behavior study. www.iprospect.com/
premiumPDFs/WhitePaper_2006_SearchEngineUserBehavior/ (accessed January
27, 2009).
Jansen, B., and A. Spink. 2006. How we are searching the world wide web. Informa-
tion Processing and Management 42: 248–263.
Meiners, Janet. 2008. Search engine marketing projected to reach $25.2 billion in 2011.
Marketing Pilgrim. http://www.marketingpilgrim.com/2008/03/search-engine-
marketing-to-projected-to-reach-252-billion-in-2011.html (accessed January 28,
2009).
Meyer, Marissa. 2005. Google bombing failure. The official Google Blog. http://
googleblog.blogspot.com/2005/09/googlebombing-failure.html (accessed January 28,
2009).
QuestionPoint. 2009. QuestionPoint statistical reports. http://www.questionpoint.org
(accessed January 30, 2009).
Segev, Elad. 2008. Search engines and power: A politics of online (mis-)informa-
tion. Webology no. 5, http://www.webology.ir/2008/v5n2/a54.html (accessed
January 30, 2009).
Sullivan, Danny. 2007. Google kills Bush’s miserable failure search & other Google
bombs. Search Engine Land. http://searchengineland.com/google-kills-bushs-
miserable-failure-search-other-google-bombs-10363 (accessed January 29, 2009).
Wii Fit girlfriend viral video is actually an ad. 2008. Sidebar: The Gamespot Newsblog.
http://www.gamespot.com/pages/news/show_blog_entry.php?topic_id=
26429442&part=rss&subj=6192391 (accessed January 5, 2009).