Professional Documents
Culture Documents
Assignment Number 1
Subject: ICT
Title: How Search Engine Works
Submitted to:
Sir Ashfaq Hussain Farooqi
Submitted by:
Muhammad Imran Taj
Reg Number:
SP18-BCS-116
Date of Submission:
26 Feb, 2018
Contents
1 Introduction.........................................................................................................................................3
1|Page
How Search Engine Works
1.1 History.........................................................................................................................................3
2 Working...............................................................................................................................................4
2.1 Crawling.......................................................................................................................................4
2.2 Indexing.......................................................................................................................................5
2.2.1 Design Factor.......................................................................................................................6
2.2.2 Data Structures:...................................................................................................................6
2.3 Retrieval.......................................................................................................................................7
2.3.1 The Information Retrieving Cycle:........................................................................................7
2.4 Advantages and Disadvantages:..................................................................................................8
2.4.1 Advantages:.........................................................................................................................8
2.4.2 Disadvantages:.....................................................................................................................9
3 Important Search Engines:...................................................................................................................9
3.1 Market Share:..............................................................................................................................9
4 Search Engine Submission:................................................................................................................10
References.................................................................................................................................................11
5 Following table contains information about different elements of MS word, formatting patterns or
styles used in the report:...........................................................................................................................12
1 Introduction
A Search Engine is a program software available through the internet
that searches documents and files for keywords and returns the results
2|Page
How Search Engine Works
1.1 History
The following table shows the history of search engine form first Search engine till Google.
Archie was the first tool ever made as search engine on the internet. The name stands for
“archive” without the “v”. It was created Alan Emtage, Bill Heelan and J. Peter computer
Science students at McGill University of Canada.
Google adopted the idea of selling search terms in 1998, from a small search engine
Company named goto.com This move had a significant effect on Se business.[ CITATION Pur18 \l
1033 ]
3|Page
How Search Engine Works
2 Working
Every search engine has three main functions:
Crawling (to discover content).
Indexing (to discover and store content).
Retrieval (to fetch relevant content when users query the search engine).
Now we discuss them in some detail:
2.1 Crawling
This involves scanning sites and collecting details about each page: titles,
Images, keywords, other linked pages, etc.
Different crawlers may look for different details like page layouts, where
advertisements are placed, where links are crammed in, etc.
Web crawler also revisit past papers to see if any change is happened.
4|Page
How Search Engine Works
Some sites are crawled more frequently, and some are crawled to greater depths, but
sometimes a crawler may give up if a site’s page hierarchy is too complex.
2.2 Indexing
Indexing is when the data from a crawl is processed and placed in database.
After crawling has been done, the results have been into Google’s index
According to Layman’s terms, indexing is the process of adding webpages in
to Google search
By Default, every WordPress post and page is indexed.
The actual search engine index is the place where all the data the search engine
has collected is stored. It provides the results of search queries and the pages
that are stored within the search index that appear on the search index page.
Here’s is a sneak peek of one of Google’s search center.
Suffix tree: Supports linear time lookup and is structured like a tree.
Tree: An ordered tree data structure that stores and associative array.
Inverted index: Stores a list of occurrence in the form of hash table.
Citation Index: Stores citation between certain documents.
Term Document Matrix: It stores the occurrences of words in documents
in a two dimensional sparse document.
2.3 Retrieval
Information Retrieval is the activity of obtaining information relevant to an
Information need from a collection of information resources.
It is the science of searching for information in a document, searching for
documents themselves, and also searching for metadata that describes
data, and for database of texts.
Searches and communication are the two most popular purposes of computer.
An information retrieval process begins when a user enters query into the
System. In information retrieval a query does not uniquely identify a single
object in the collection. Instead, several objects may match the query,
perhaps with different degrees of relevancy.
6|Page
How Search Engine Works
Query Formation.
Query Search.
Ranked List Selection.
Documents Examination.
Documents Delivery.
2.4.1 Advantages:
2.4.1.1 Variety:
An Internet search can generate a variety of sources for information.
This variety allows anyone searching for information to choose the
types of sources they would like to use, or to use a variety of sources
to gain a greater understanding of a subject.
2.4.1.2 Precision:
Search engines have the ability to provide refined or more precise results.
Some search engines, such as Google or Yahoo, enable you to specify
the type of web sources to be searched.
Being able to search more precisely allows you to cut down on the amount
of information generated by your search
Search engines within a website allow you to search information only on
that website, filtering out information from other web sources and giving
more precision in a user's search for information.
7|Page
How Search Engine Works
2.4.1.3 Organization:
Internet search engines help to organize the Internet and individual websites.
Search engines aid in organizing the vast amount of information that can sometimes
be scattered in various places on the same web page into an organized list that can be
used more easily.
2.4.2 Disadvantages:
Search engine show way too much useless on our screen.
Sometimes you even cannot find anything useful from searching results.
It wastes us much time to pick up useful information from seas of searching results.
Those who use search engine frequently may become lazy even stupid.
Search engine may bring people to various pornographic websites. Those websites are
especially harmful for children. And search engine has not found the best way to keep
those evil websites from kids now.
Google 44.52%
Bing 38.49%
8|Page
How Search Engine Works
Baidu 10.98%
Yahoo! 08.41%
Add an entirely new web site without waiting for a search engine to discover it,
And have a web site's record updated after a substantial redesign.
9|Page
How Search Engine Works
References
Baeza-Yates, R. (n.d.). Query Recommendation Using Query Logs in Search Engines.
Silverstein, C. (1999). Analysis of a very large web search engine query log. 12.
Sullivan, D. (October 14, 2002 ). How Search Engines Work . 12.
10 | P a g e
How Search Engine Works
11 | P a g e