You are on page 1of 9

HOW GOOGLE SEARCH ENGINE

WORKS

INTRODUCTION TO GOOGLE
Name- Google, because it is a common spelling of googol, or 10100 and fits well with our
goal of building very.

Google is an American multinational corporation specializing in Internet-related services and


products. These include online advertising technologies, search, cloud computing,
and software. Most of its profits are derived from AdWords , an online advertising service that
places advertising near the list of search results.
Google was founded by Larry Page and Sergey Brin while they were Ph.D. students at Stanford
University. Together they own about 14 percent of its shares but control 56 percent of the
stockholder voting power through supervoting stock. They incorporated Google as a privately
held company on September 4, 1998. An initial public offering followed on August 19, 2004.
Its mission statement from the outset was "to organize the world's information and make it
universally accessible and useful, and its unofficial slogan was "Don't be evil. In 2004, Google
moved to its new headquarters in Mountain View, California, nicknamed the Googleplex.

Introduction to Google Search Engine

Google Search, commonly referred to as Google Web Search or just Google, is a web search
engine owned by Google Inc. It is the most-used search engine on the World Wide
Web, handling more than three billion searches each day. As of February 2015 it is the most
used search engine in the US with 64.5% market share.
The order of search on Google's search-results pages is based, in part, on a priority rank called a
"PageRank". Google Search provides many different options for customized search,
using Boolean operators such as: exclusion ("-xx"), alternatives ("xx OR yy OR zz"),
and wildcards ("Winston * Churchill" returns "Winston Churchill", "Winston Spencer Churchill",
etc.) The same and other options can be specified in a different way on an Advanced Search
page.

How Google Search Engine Works


Google consists of three distinct parts, each of which is run on a distributed network of thousands
of low-cost computers and can therefore when we enter a query.
1) Carry out fast parallel processing - Parallel processing is a method of computation in
which many calculations can be performed simultaneously, significantly speeding up data
processing
2) Googlebot - A web crawler that finds and fetches web pages.
The indexer that sorts every word on every page and stores the resulting index of words in a
huge database.
3) The query processor, which compares your search query to the index and recommends the
documents that it considers most relevant.

The web server sends the query to the index servers. The content inside the index
servers is similar to the index in the back of a book--it tells which pages contain the
words that match any particular query terms.

The query travels to the doc servers, which actually retrieve the stored documents.
Snippets are generated to describe each search result

The search results are returned to the user in a fraction of a second.

Crawling
This technology is needed to gather the web documents and keep them up to date. Storage
space must be used efficiently to store indices and, optionally, the documents themselves.

Indexing
The indexing system must process hundreds of gigabytes of data efficiently. Queries must be
handled quickly, at a rate of hundreds to thousands per second.
Google is designed to scale well to extremely large data sets. It makes efficient use of
storage space to store the index.
Its data structures are optimized for fast and efficient access Further, we expect that the cost
to index and store text or HTML will eventually decline relative to the amount that will be
available This will result in favourable scaling properties for centralized systems like Google.

The Google search engine has two important features that help it produce high precision
results.
First - It makes use of the link structure of the Web to calculate a quality ranking for each
web page. This ranking is called PageRank.
Second - Google utilizes link to improve search results.

PageRank: Bringing Order to the Web

The citation (link) graph of the web is an important resource that has largely gone unused in
existing web search engines. They have created maps containing as many as 518 million of
these hyperlinks These maps allow rapid calculation of a web pages "PageRank", an
objective measure of its citation importance that corresponds well with peoples subjective
idea of importance Because of this correspondence, PageRank is an excellent way to
prioritize the results of web keyword searches For most popular subjects, a simple text
matching search that is restricted to webpage titles performs admirably when PageRank
prioritizes the results For the type of full text searches in the main Google system, PageRank
also helps a great deal.

Anchor Text
The text of links is treated in a special way in our search engine. Most search engines
associate the text of a link with the page that the link is on. In addition, we associate it with
the page the link points to.This has several advantages. First, anchors often provide more
accurate descriptions of web pages than the pages themselves. Second, anchors may exist
for documents which cannot be indexed by a text-based search engine, such as images,
programs, and databases. This makes it possible to return web pages which have not
actually been crawled. Note that pages that have not been crawled can cause problems,
since they are never checked for validity before being returned to the user. In this case, the
search engine can even return a page that never actually existed, but had hyperlinks pointing
to it. However, it is possible to sort the results, so that this particular problem rarely happens.

BIBLIOGRAPHY
www.google.com
www.wikipedia.com
www.slideshare.net

You might also like