You are on page 1of 12

SEARCH ENGINE

 A search engine is a software system that is particular information


specified in a textual web search query. The search results are generally
presented in a line of results, often referred to as search engine results
pages (SERPs) The information may be a mix of links to web pages,
images, videos, infographics, articles, research papers, and other types
of files. Some search engines also mine data available in databases or
open directories. Unlike web directories, which are maintained only by
human editors, search engines also maintain real time information by
running an algorithm on a web crawler. Internet content that is not
capable of being searched by a web search engine is generally described
as the deep web.
 A system for locating published information intended to overcome the
ever increasing difficulty of locating information in ever-growing
centralized indices of scientific work was described in 1945 by Vannevar
Bush, who wrote an article in The Atlantic Monthly titled "As We May
Think" in which he envisioned libraries of research with connected
annotations not unlike modern hyperlinks. Link analysis would
eventually become a crucial component of search engines through
algorithms such as Hyper Search and PageRank.
 Birth of search engines :-

 The first internet search engines predate the debut of the Web in
December 1990: WHOIS user search dates back to 1982,[5] and the
Knowbot Information Service multi-network user search was first
implemented in 1989.[6] The first well documented search engine that
searched content files, namely FTP files, was Archie, which debuted on
10 September 1990.[7] Prior to September 1993, the World Wide Web
was entirely indexed by hand. There was a list of webservers edited by
Tim Berners-Lee and hosted on the CERN webserver.

 One snapshot of the list in 1992 remains,[8] but as more and more web
servers went online the central list could no longer keep up. On the
NCSA site, new servers were announced under the title "What's
New!"[9] The first tool used for searching content (as opposed to users)
on the Internet was Archie. [10] The name stands for "archive" without
the "v".,[11] It was created by Alan Emtage[11][12][13][14] computer
science student at McGill University in Montreal, Quebec, Canada.

 The program downloaded the directory listings of all the files located on
public anonymous FTP (File Transfer Protocol) sites, creating a
searchable database of file names; however, Archie Search Engine did
not index the contents of these sites since the amount of data was so
limited it could be readily searched manually.
 Founder of search engines :-
 Fortunately for early users, the first web designers set to work
creating search engines. Indeed the first of them, the WWW Virtual
Library, was created by the Web's creator, Tim Berners-Lee. The
Virtual Library, and its most well-known clone, 1994's Yahoo, weren't

really search engines.

 Sir Timothy John Berners-


Lee OM KBE FRS FREng FRSA FBCS (born 8 June 1955), also
known as TimBL, is an English computer scientist best known as the
inventor of the World Wide Web. He is a Prosseferial Fellow of
Computer Science at the  and a professor at University of Oxfard the
Massachuttus Institute of Technology  (MIT). Berners-Lee proposed an
information management system on 12 March 1989, then
implemented the first successful communication between a Hypertext
Transfer Protocol (HTTP) client and server via the Internet in mid-
November.

 He was named in Time magazine's list of the 100th Most Important


People Of 20th Century and has received a Number of Other
Accoldes for his invention. He was honoured as the "Inventor of the
World Wide Web" during the 2012 Summer Olympics Opening
Ceremony in which he appeared working with a vintage Next
Computer. He tweeted "This is for everyone"which appeared in LCD
lights attached to the chairs of the audience. He received the 2016
Turing Award "for inventing the World Wide Web, the first web
browser, and the fundamental protocols and algorithms allowing the
Web to scale.
 2000s–present: Post dot-com bubble
 Around 2000, Google's search engine rose to prominence. The company
achieved better results for many searches with an algorithm called
PageRank, as was explained in the paper Anatomy of a Search Engine
written by Sergey Brin and Larry Page, the later founders of Google. This
iterative algorithm ranks web pages based on the number and PageRank
of other web sites and pages that link there, on the premise that good or
desirable pages are linked to more than others. Larry Page's patent for
PageRank cites Robin Li's earlier RankDex patent as an influence. Google
also maintained a minimalist interface to its search engine. In contrast,
many of its competitors embedded a search engine in a web portal. In
fact, the Google search engine became so popular that spoof engines
emerged such as Mystery Seeker.

 By 2000, Yahoo! was providing search services based on Inktomi's search


engine. Yahoo! acquired Inktomi in 2002, and Overture (which owned
AlltheWeb and AltaVista) in 2003. Yahoo! switched to Google's search
engine until 2004, when it launched its own search engine based on the
combined technologies of its acquisitions.

 A search engine maintains the following processes in near real time: 1.


Web crawling 2. Indexing 3. Searching Web search engines get their
information by web crawling from site to site. The "spider" checks for the
standard filename robots.txt, addressed to it. The robots.txt file contains
directives for search spiders, telling it which pages to crawl and which
pages not to crawl. After checking for robots.txt and either finding it or
not, the spider sends certain information back to be indexed depending
on many factors, such as the titles, page content, JavaScript, Cascading
Style Sheets (CSS), headings, or itmetadata in HTML meta tags. After a
certain number of pages crawled, amount of data indexed, or time spent
on the website, the spider stops crawling and moves on. "[N]o web
crawler may actually crawl the entire reachable web. Due to infinite
websites, spider traps, spam, and other exigencies of the real web,
crawlers instead apply a crawl policy to determine when the crawling of a
site should be deemed sufficient. Some websites are crawled
exhaustively, while others are crawled only partially".
 High-level architecture of a standard Web crawler

 Typically when a user enters a query into a search engine it is a few


keywords. The index already has the names of the sites containing the
keywords, and these are instantly obtained from the index. The real
processing load is in generating the web pages that are the search results
list: Every page in the entire list must be weighted according to
information in the indexes. Then the top search result item requires the
lookup, reconstruction, and markup of the snippets showing the context
of the keywords matched. These are only part of the processing each
search results web page requires, and further pages (next to the top)
require more of this post processing.

 Beyond simple keyword lookups, search engines offer their own GUI- or
commanddriven operators and search parameters to refine the search
results. These provide the necessary controls for the user engaged in the
feedback loop users create by filtering and weighting while refining the
search results, given the initial pages of the first search results.
 Different Types of Search Engines :-
1. Google
2. Bing
3. Samsung Internet
4. Yahoo
5. Yandex

1.Google :-
 Google LLC is an American Multinational Technological
Company that specializes in Internet -related services and
products, which include Online Advertising Technologies a
Search Engine, cloud computing , software, and hardware. It is
considered one of the Big Five companies in the American
Information Technology industry, along with Amazon, Apple ,
Meta (Facebook) and Microsoft .

 Google was founded on September 4, 1998, by Larry


Page and Sergey Brein while they were P.H.d. students
at Stanford University in California. Together they own about
14% of its publicly-listed shares and control 56% of the
stockholder voting power through super-voting stock. The
company went public via an Initial Public offering (IPO) in 2004.
In 2015, Google was reorganized as a wholly-owned
subsidiary of Alphabet Inc. Google is Alphabet's largest
subsidiary and is a holding company for Alphabet's Internet
properties and interests. Sunder Pichai was appointed CEO of
Google on October 24, 2015, replacing Larry Page, who
became the CEO of Alphabet. On December 3, 2019, Pichai
also became the CEO of Alphabet.
2. Bing :-
 Microsoft Bing (commonly known as Bing) is a  web search
engine owned and operated by Microsoft. The service has its
origins in Microsoft's previous search engines: MSN
Search, Windows Live Search and later Live Search. Bing
provides a variety of search services, including web, video,
image and map search products. It is developed
using ASP.NET.

 Bing, Microsoft's replacement for Live Search, was unveiled


by Microsoft CEO Steve Ballmer on May 28, 2009, at the All
things Digital conference in San Diego, California, for release
on June 3, 2009. Notable new features at the time included
the listing of search suggestions while queries are entered
and a list of related searches (called "Explore pane") based
on semantic technology from powerset, which Microsoft had
acquired in 2008.

 In July 2009, Microsoft and Yahoo! announced a deal in


which Bing would power Yahoo! Search. Yahoo! finished the
transition in 2012.
3. Samsung Internet:-
 Samsung Internet Browser (or
simply Samsung Internet or S Browser )

is a mobile web
browser for smartphones and tablets devel
oped by Samsung Electronics. It is based
on the open-source Chromium project. It
comes pre-installed on Samsung
Galaxy devices. Since 2015, it has been
available for download from Google
Play, and recently it is also available for
their Tizen-based smartwatch via
the Samsung Galaxy Store. Samsung
estimated that it had around 400
million monthly active users in 2016.
According to StatCounter, it had a market
share of around 4.98% (among 53.26% for
all Chrome variants) around May 2018.
4.Yahoo:-
 Yahoo (/ˈjɑːhuː/, styled as yahoo!)[7][8] is an American web
services provider. It is headquartered in Sunnyvale,
California and operated by the namesake company Yahoo!
Inc., which is 90% owned by investment funds managed
by Apollo Global Management and 10% by Verizon
Communications.
 It provides a web portal, search engine Yahoo Search, and
related services, including My Yahoo!, Yahoo Mail, Yahoo
News, Yahoo Finance, Yahoo Sports and its advertising
platform, Yahoo! Native.
 Yahoo was established by Jerry Yang and David Filo in
January 1994 and was one of the pioneers of the early
Internet era in the 1990s. n 2000, it was the most popular
website worldwide. Usage declined in the late 2000s as it lost
market share to Google. However, Yahoo domain websites
are still among the most popular websites, ranking 12th in
global engagement according to both Alexa
Internet[13] and SimilarWeb.
5.Yandex:-
 Yandex N.V. (/ˈjʌndɛks/; Russian: Яндекс) is a multinational
corporation primarily for Russian and Russian-language
users, providing 70 Internet-related products and services,
including transportation, search and information services, e-
commerce, navigation, mobile applications, and online
advertising.
 The firm is registered in Schiphol, the Netherlands as
a naamloze vennootschap (Dutch public limited
company), but the company founders and most of the team
members are located in Russia. It primarily serves audiences
in Russia and the Commonwealth of Independent States, and
also has 30 offices worldwide.
 The firm is the largest technology company in Russia[10] and
the second largest search engine on the Internet in Russian,
with a market share of over 42%. It also has the largest
market share of any search engine from Europe and
the Commonwealth of Independent States and is the 5th
largest search engine worldwide after Google, Baidu, Bing,
and Yahoo!.

You might also like