You are on page 1of 34

Internet & HTML Programming

Unit - II
WWW- History
• World Wide Web is also commonly known as ‘The Web’. The WWW is a system to
access the Internet. The WWW is hypertext based information retrieval tool, it uses the
hypertext to access the various forms of information available on the world’s different
networks.
• 1989-1990 – Tim Berners-Lee invents World Wide Web at CERN.
• On 30 April 1993, CERN put the World Wide Web software in
the public domain.
• Tim moved from CERN to the Massachusetts Institute of
Technology in 1994 to found the World Wide Web
Consortium (W3C), an international community devoted to
developing open web standards.
• Established as a common language for sharing information
on computers.
WWW
•The World Wide Web is a system of interlinked hypertext documents accessed via the
Internet.

•With a web browser, one can view web pages that may contain text, images, videos, and
other multimedia and navigate between them via hyperlinks.

•The WWW is a network of online content that is formatted in HTML and accessed via HTTP.
The term refers to all the interlinked HTML pages that can be accessed over the Internet.

•It is the way of exchanging information between the computers on the Internet .

•It use a protocol called HTTP – Hypertext Transfer Protocol.

•HTTP defines how messages are formatted and transmitted, and what actions Web servers
and browsers should take in response to various commands.
Cont..
• WWW VS Internet
• The internet is the physical network of computers all over the world.
• The World Wide Web is a virtual network of web sites connected by hyperlinks (or
"links"). Web sites are stored on servers on the internet, so the World Wide Web is a part
of the internet.
• Internet:
 Internet is a global system of interconnected computer network.
 Massive network of networks.
 This computers communicate with each other using a common protocol (TCP/IP i.e.
Transmission Control Protocol / Internet protocol)
• World Wide Web (WWW) :
 WWW is one of the services that run on internet.
 WWW is a system of interlinked hypertext documents accessed via Internet.
 Way of accessing the information
3D Map of World Wide Web
WWW Components
• Structural Components :
 Clients/Browsers – to dominant implementations.
 Servers – Run on sophisticated hardware.
 Caches – Many interesting implementations.
 Internet – The global infrastructure which facilitates
data transfer.
• Semantic Components :
 Hyper Text Transfer Protocol (HTTP)
 Hyper Text Markup Language (HTML)
 Extensible Markup Language (XML)
 Uniform Resource Identifiers (URIs)
Website
• A website is a set of related web pages typically served
from a single web domain.
• A website is a collection of Web pages, images, videos or other
digital assets that is hosted on one or more web servers, usually
accessible via the internet.
• The pages of a website can usually be accessed from a
common root URL called the homepage and usually
reside on the same physical server.
Web Page
• Web page- a document, typically written in HTML, which is almost
always accessible via HTTP, a protocol that transfers information
from the Web server to display in the user's Web browser.
• Homepage - The default entry page to a website.

• The home page is used to facilitate navigation to other pages on


the site by providing links to prioritize and recent articles and
pages, and possibly a search box
Cont..
• Hypertext , Hyperlink & Hypermedia
• HyperText is a document containing words that connect
to other documents.
• Rather than remaining static like traditional text,
hypertext makes possible a dynamic organization of
information through links and connections (called
hyperlinks).
• Hyperlink is a reference to data that the reader can
directly follow by clicking or by hovering.
• Hypermedia is term used for Hypertext which is not
constrained to be text; it can include video and sound.
Classification based on style
• Static
– A static website is one that has web pages stored on the server in the format that is
sent to a client web browser. It is primarily coded in Hypertext Markup Language
(HTML)
– Static web pages are made of “fixed code,” and unless the site developer makes
changes, nothing will change on the page.
– A static website is delivered to a user exactly the way it’s stored. That means that
nothing on the page will change by the user or even the site administrator unless
there’s a redesign of the site, or the site administrator goes directly into the code
to change it. 
– A static site is the most basic kind of website, and the easiest to create. It
requires no server-side (also called back-end) processing, only client-side.
• Dynamic
– A dynamic website is one that changes or customizes itself
frequently and automatically, based on certain criteria.
– dynamic sites allows to create a user profile, comment on a post, or make
a reservation.
– Dynamic site examples include:
• E-commerce sites
• Blogs
• Any site with information that must be updated regularly
Web Server & Web Client
• Web servers speak the HTTP protocol, so they are often called HTTP
servers.
• HTTP servers store the Internet’s data and provide the data when it
is requested by HTTP clients.
• The clients send HTTP requests to servers, and servers return the
requested data in HTTP responses. Together, HTTP clients and HTTP
servers make up the basic components of the World Wide Web.
• The most common client is a web browser.
• Web browsers request HTTP objects from servers and display the
objects on your screen.
• Web Server:
Web server is basically system the deals with the client’s
request and provides a client with web pages by Hyper Text
Transfer Protocol (HTTP) and files via File Transfer Protocol
(FTP). URL (universal resource locator) is basically the address
where you want to receive data from. When you type a URL
in the address bar of your web browser, the web server sends
a request to the location where the domain name of this URL
is saved. Then the data you requested (i.e. the webpage) is
accessed and provided you by a server. This is how it works.
So, we can conclude that the processing and providing a
webpage to the client is the main duty of a web server.
• Web Client:
The web client can be said as an application or web browser
(like Google Chrome, Internet Explorer, Opera, Firefox, Safari)
which is installed in a computer and used to interact with
Web servers upon user’s request. It is basically a consumer
application which collects processed data from servers. A
Client and a Server are two parts of a connection, these are
two distinct machines, web client requests information, and
the web server is basically a PC that is designed to accept
requests from remote computers and send on the
information requested. Web server is responsible for storing
the information in order to be viewed by the clients and is
also usually a Web Host. A Web host allows connections to
the server to view said stored info.
• Cookie - A small file sent by a Web Server (usually
through a website) to a Web Browser which is saved on
the users computer. This file can then be referenced
when the user revisits the same Web Sever (website).
Web Application
• A web application (or web app) is an application software that
runs on a web server.
•  Web applications are accessed by the user through a web
browser with an active internet connection.
• A Web application (Web app) is an application program  that is
stored on a remote server and delivered over the Internet
through a browser interface.
• Commonly used Web applications can include webmail, online
calculators, or e-commerce shops
Domains and Subdomains
• In addition to an IP address, most Internet hosts or servers have a
Domain Name System (DNS) address, which uses words.

• A domain name identifies the type of institution that owns the


computer. An Internet server owned by IBM might have the
domain name ibm.com.

• Some enterprises have multiple servers, and identify them


with subdomains, such as products.ibm.com.
Domain Name Addressing
• Most web browsers do not use the IP address to locate Web sites and
individual pages.
• They use domain name addressing.
• A domain name is a unique name associated with a specific IP
address by a program that runs on an Internet host computer.
• This program, which coordinates the IP addresses and domain names
for all computers attached to it, is called DNS (Domain Name
System ) software.
• The host computer that runs this software is called a domain name
server.
Uniform Resource
Locators
• The IP address and the domain name each identify a particular computer on the
Internet.

• However, they do not indicate where a Web page’s HTML document resides on that
computer.

• To identify a Web pages exact location, Web browsers rely on Uniform Resource
Locator (URL).

• URL is a four-part addressing scheme that tells the Web browser:


 What transfer protocol to use for transporting the file

 The domain name of the computer on which the file resides

 The pathname of the folder or directory on the computer on which the file resides

 The name of the file


Structure of a Uniform Resource
Locators

protocol pathname

http://www.chicagosymphony.org/civicconcerts/index.htm

Domain name filename

http => Hypertext Transfer Protocol


URI \ URL
• A Uniform Resource Locator, or URL is the
address of a document you'll find on the WWW.
• The elements in a URL:
Protocol://server's address/filename
• Example : http://www.example.com/index.html
Search Engines

• A search engine is a web-based tool that enables users to locate information on the
World Wide Web. Popular examples of search engines are Google, Yahoo!, and MSN
Search.
• The first tools used for searching in the internet was Archie in 1990 by Alan Emtage.
• In 1991, two new search programs Verconica and Jughead.
• In septermer 1993, W3Catalog was the first web primitive search engines.
Searching Guidelines
• Specify the words clearly (+, -)
• Use Advanced Search when necessary Provide as many particular terms as
possible
• If looking for a company, institution, or organization, try: www.name [.com | .edu
| .org | .gov | country code]
• If the user use broad queries, try to use Web directories as starting points
Search Engine Types

1.Crawler based search engines


2. Search directories or indexes
3. Hybrid search engines
4. Meta search engines
Crawler Based Search Engines
• These types of search engines use a "spider" or a "crawler" to search the
Internet.
• The crawler digs through individual web pages, pulls out keywords and then
adds the pages to the search engine's database.
• Google and Yahoo are examples of crawler search engines.

• Crawler-based search engines are good when you have a specific search topic
• Google was founded by Larry Page and Sergey Brin.
• When a user types in a search and clicks on the button, the user is directly taken to the
first search result.
• Google has various “special features” which include weather, unit conversion, currency
conversion, time , calculator, maps etc.

• Yahoo was founded by David Filo and Jerry Yang


• Yahoo operates a portal that provides the latest news, entertainment, and sports
information. The portal also gives users access to other Yahoo services like Yahoo
Mail, Yahoo Maps, Yahoo Finance, Yahoo Groups and Yahoo Messenger.
• Directories depend on human editors to create their listings or the
database. Yahoo Directory, Open Directory and Look Smart are few
examples
• Hybrid search engines are search engines that use both crawler based
searches and directory searches to obtain their results .
Example:- Yahoo.com- Google.com
• MSEs Searches more hits for a single query, Meta Search Engine (Mamma,
Dogpile, etc.) sends the user query to different search
engines simultaneously and the top results shown as output to the user.

To know more information

https://www.quora.com/What-is-the-difference-between-
a-meta-search-engine-and-a-web-search-engine
• For more information:

• https://moz.com/beginners-guide-to-seo/
how-search-engines-operate

You might also like