You are on page 1of 38

BY YASHIKA KAIMWAAL

FUNDAMENTALS
OF WEB
TECHNOLOGY
S.No. INTERNET WWW

WWW stands for World


1 Internet is a global network of networks.
wide Web.

World Wide Web which is a


Internet is a means of connecting a computer
2 collection of information which
to any other computer anywhere in the world.
is accessed via the Internet.

WWW is service on top of


3 Internet is infrastructure.
that infrastructure.

Web can be viewed as


4 Internet can be viewed as a big book-store. collection of books on that
store.

At some advanced level, to


At some advanced level, to understand we
5 understand we can think of
can think of the Internet as hardware.
the WWW as software.

WWW is more software-


6 Internet is primarily hardware-based. oriented as compared to the
Internet.

English scientist Tim Berners-


7 It is originated sometimes in late 1960s. Lee invented the World Wide
Web in 1989.
WWW is a subset of the
8 Internet is superset of WWW.
Internet.

The first version of the Internet was known In the beginning WWW was
9
as ARPANET. known as NSFNET.

10 Internet uses IP address. WWW uses HTTP.

WWW vs Internet: Difference Between Internet and WWW

Basis of
Internet World Wide Web (WWW)
Comparison
The World Wide Web or
The internet is a global network of WWW refers to online content
interconnected computer networks that that is formatted in HTML
Definition
use the TCP/IP protocol to connect and accessible via the
devices worldwide. HTTP/HTTPS protocol.

In 1989, Tim Berners-Lee, an


Internet was founded in the late English scientist, created the
Founded
1960s. World Wide Web.

The internet’s architecture is


The World Wide Web is based
decentralized and follows a peer-to-
on a client-server architecture.
peer model. It is a network of
Architecture Clients (typically web browsers)
networks, meaning that it connects
send requests to web servers,
various autonomous systems and
which then respond by serving
networks that use the TCP/IP
Basis of
Internet World Wide Web (WWW)
Comparison
protocol to communicate with each web pages and resources to the
other. clients.

The internet is a vast network that


World Wide Web is a service
Functionality connects computer networks all over
provided by the internet.
the world.
Browsing the World Wide
Web involves using web
The internet enables various means of
browsers (e.g., Chrome, Firefox,
Connectivity connectivity, including direct
Safari) to access websites and
and Browsing connections, wireless networks,
navigate through hyperlinks to
satellite links, and more.
view different web pages.

ARPANET was the first version of WWW was initially known as


First Version
the internet. NSFNET.
In comparison to the Internet,
The internet is primarily hardware-
Type WWW is more software-
based.
oriented.
The Internet utilizes the Transmission The World Wide Web utilizes
Protocol Control Protocol/Internet Protocol the Hyper Text Transfer
(TCP/IP). Protocol (HTTP).
The internet has a wide range of
The World Wide Web is
applications in entertainment,
Application utilized to access resources all
education, research, navigation,
across the world.
advertising, and banking.
HTTP

o HTTP stands for HyperText Transfer Protocol.


o It is a protocol used to access the data on the World Wide Web (www).
o The HTTP protocol can be used to transfer the data in the form of plain
text, hypertext, audio, video, and so on.
o This protocol is known as HyperText Transfer Protocol because of its
efficiency that allows us to use in a hypertext environment where there are
rapid jumps from one document to another document.
o HTTP is similar to the FTP as it also transfers the files from one host to
another host. But HTTP is simpler than FTP as HTTP uses only one
connection, i.e., no control connection to transfer the files.
o HTTP is used to carry the data in the form of MIME-like format.
o HTTP is similar to SMTP as the data is transferred between client and
server. The HTTP differs from the SMTP in the way the messages are sent
from the client to the server and from server to the client. SMTP messages
are stored and forwarded while HTTP messages are delivered immediately.

Features of HTTP:

o Connectionless protocol: HTTP is a connectionless protocol. HTTP client


initiates a request and waits for a response from the server. When the server
receives the request, the server processes the request and sends back the
response to the HTTP client after which the client disconnects the
connection. The connection between client and server exist only during the
current request and response time only.
o Media independent: HTTP protocol is a media independent as data can be
sent as long as both the client and server know how to handle the data
content. It is required for both the client and server to specify the content
type in MIME-type header.
o Stateless: HTTP is a stateless protocol as both the client and server know
each other only during the current request. Due to this nature of the protocol,
both the client and server do not retain the information between various
requests of the web pages.

AD
HTTP Transactions
The above figure shows the HTTP transaction between client and server. The client
initiates a transaction by sending a request message to the server. The server replies
to the request message by sending a response message.

Messages
HTTP messages are of two types: request and response. Both the message types
follow the same message format.
Response Message: The response message is sent by the server to the client that
consists of a status line, headers, and sometimes a body.

Working of HTTP
First of all, whenever we want to open any website then first open a web browser
after that we will type the URL of that website (e.g., www.facebook.com ). This
URL is now sent to Domain Name Server (DNS). Then DNS first check records
for this URL in their database, then DNS will return the IP address to the
web browser corresponding to this URL. Now the browser is able to send
requests to the actual server.
After the server sends data to the client, the connection will be closed. If we
want something else from the server we should have to re-establish the
connection between the client and the server.
HTTP Connection
HTTP Request
HTTP request is simply termed as the information or data that is needed by
Internet browsers for loading a website. This is simply known as HTTP Request.
There is some common information that is generally present in all HTTP
requests. These are mentioned below.
• HTTP Version
• URL
• HTTP Method
• HTTP Request Headers
• HTTP Body

HTTP Request Headers


HTTP Request Headers generally store information in the form of key-value
and must be present in each HTTP Request. The use of this Request Header is to
provide core information about the client’s information, etc.
HTTP Request Body
HTTP Request Body simply contains the information that has to be transferred.
HTTP Request has the information or data to be sent to these browsers.
HTTP Method
HTTP Methods are simply HTTP Verb. In spite of being present so many HTTP
Methods, the most common HTTP Methods are HTTP GET and HTTP POST.
These two are generally used in HTTP cases. In HTTP GET, the information is
received in the form of a website.
For more, refer to the Difference Between HTTP GET and HTTP POST.
HTTP Response
HTTP Response is simply the answer to what a Server gets when the request is
raised. There are various things contained in HTTP Response, some of them are
listed below.
• HTTP Status Code
• HTTP Headers
• HTTP Body

HTTP Response
HTTP Response Headers
HTTP Response headers are simply like an HTTP Request where it has that
work to send some important files and data to the HTTP Response Body.
HTTP Response Body
HTTP Responses are the responses that are received successfully upon the
request. Generally, it comes under the requests generated by the web. In most
cases, the request is of transferring the HTML data into a webpage.
HTTP Status Code
HTTP Status Codes are the 3-Digit codes that tell the message or simply tell us
about the HTTP Request whether it has been completed or not. There are simply
5 types of status codes.
• Informational
• Successful
• Re-directional
• Client-Error
• Server-Error
.
History of HTTP
Tim Berners Lee and his team at CERN get credit for inventing original HTTP
and associated technologies.
• HTTP version 0.9: This was the first version of HTTP which was
introduced in 1991.
• HTTP version 1.0: In 1996, RFC 1945 (Request For Comments) was
introduced in HTTP version 1.0.
• HTTP version 1.1: In January 1997, RFC 2068 was introduced in
HTTP version 1.1. Improvements and updates to the HTTP version 1.1
standard were released under RFC 2616 in June 1999.
• HTTP version 2.0: The HTTP version 2.0 specification was published as
RFC 7540 on May 14, 2015.
• HTTP version 3.0: HTTP version 3.0 is based on the previous RFC
draft. It is renamed as Hyper-Text Transfer Protocol QUIC which is a
transport layer network protocol developed by Google.
Characteristics of HTTP
HTTP is IP based communication protocol that is used to deliver data from
server to client or vice-versa.
• The server processes a request, which is raised by the client, and also
server and client know each other only during the current bid and
response period.
• Any type of content can be exchanged as long as the server and client are
compatible with it.
• Once data is exchanged, servers and clients are no longer connected.
• It is a request and response protocol based on client and server
requirements.
• It is a connection-less protocol because after the connection is closed, the
server does not remember anything about the client and the client does not
remember anything about the server.
• It is a stateless protocol because both client and server do not expect
anything from each other but they are still able to communicate.
Advantages of HTTP
• Memory usage and CPU usage are low because of fewer simultaneous
connections.
• Since there are few TCP connections hence network congestion is less.
• Since handshaking is done at the initial connection stage, then latency is
reduced because there is no further need for handshaking for subsequent
requests.
• The error can be reported without closing the connection.
• HTTP allows HTTP pipe-lining of requests or responses.
Disadvantages of HTTP
• HTTP requires high power to establish communication and transfer data.
• HTTP is less secure because it does not use any encryption method like
HTTPS and use TLS to encrypt regular HTTP requests and response.
• HTTP is not optimized for cellular phones and it is too gabby.
• HTTP does not offer a genuine exchange of data because it is less secure.
• The client does not close the connection until it receives complete data from
the server; hence, the server needs to wait for data completion and cannot
be available for other clients during this time.
Web browser and web server
The terms web browser and web server are very common in the field of computer
science and Internet, however people often get confused between the two. The most
basic difference between a web browser and a web server is that a web browser is
an application software which is used to browse and display webpages available
over the Internet, whereas a web server is a software hosted on a dedicated
computer which provides these documents when requested by web browsers.
Read through this article to know more about web browsers and web servers and
how they are different from each other.

What is a Web Browser?


A web browser is an application software that can process and display a web page
on the internet. The web browser is capable to make a request for web services and
documents to web server. It acts an interface between the server and the client.
As web browser is a software, thus it is to be installed on the client computer and
used to surf the internet for websites and web pages. Popular examples of web
browsers include Google Chrome, Microsoft Internet Explorer, Microsoft Edge,
Mozilla Firefox, Opera Mini, etc.

What is a Web Server?


A web server is a dedicated computer that sends web based documents to the
client’s computer when request through the web browser. A web server accepts
HTTP request from the client’s browser, processes it to find the required document,
and then sends a suitable response to client machine.
A web server is a basically a set of hardware and software whose primary
function is to serve web-based material through the internet on demand.
Difference between Web Browser and Web Server
The following table highlights the important differences
between a Web Browser and a Web Server −
Factor Web Browser Web Server
Purpose Web Browser is a software which Web server is a software which
is used to browse and display provides these documents when
pages available over internet. requested by web browsers.
Function A web browser sends request to Web server sees and approves
server for web based documents those requests made by web
and services. browsers and sends the
document in response

Process Web browsers send HTTP Request Web servers receive HTTP
and receive HTTP Response Request and send HTTP
Response.
Processing Web browser has no processing Web servers follow three major
Model model. processing models: process based,
thread based or hybrid.
Data Storage Web browsers stores user data in Web servers provide an area to
cookies in local machine. store the website.
Installation Web Browser is installed on user's Web servers can be installed
machine. anywhere but it need to be on a
network or on local computer.
Acts as Web browser acts as an interface Web servers act as the sender of
between the web server and client. web resources like web pages.
Responsibility Web browser is responsible to Web server is responsible for
request for a website or webpage hosting websites, processing web
located on the internet. requests, and sending the
demanded document to the
client.
Example Examples of web browsers include Apache Server is an example of
Google Chrome, Internet explorer, a web server.
Mozilla Firefox, etc.
Web 1.0 Web 2.0 Web 3.0

Read-only web Read-write web Read-write-execute web

The first stage of the The second stage of the The third stage of the
internet internet internet

The purpose is
It is about interaction It aims at immersion
information sharing

Content will be
The content was
Shared content collectively owned and
owned
shared

More of a simple and


More of a social web It is a semantic web
passive web

Focuses on connecting Focuses on connecting Revolves around


information people connecting knowledge

Web-based intelligent
Introduction of web
Static websites functionalities and
applications
applications

Better interaction Designed to deliver a


No or little interaction
between server and personalized web
between server and user
user experience to the users
Technologies related to Associated technologies Technologies related to
Web 1.0 include Web include Web 3.0 include
and File Servers, HTML, Ajax, JavaScript, CSS, Blockchain, AI,
and Portals and HTML5 decentralized protocols

Conclusion
The most significant difference between a web browser and a web server is that a
web browser is an application software developed to display a web-based material
like a webpage, while a web server is a huge computer which serves the requested
material through the Internet to the client computer.
Web 1.0
Web 1 is the first phase of the World Wide Web. It was created as a
hyperlinked information system. It had a huge library of data sourced on a
screen from computer systems in the network for users to browse.
This phase was dominated by static websites which were used to display some
information. These websites had no or very little interaction capabilities.
In this stage, there were only a few content creators. The majority of users of
web 1.0 were consumers of content. An example of a web 1.0 website is CNN.
Here are some features of Web 1.0:
• Read-only web
• Static web pages
• One-way publishing medium
• Page hyperlinking and bookmarking
• Content served from the server’s file system
• HTML forms are sent via email
• Only text mails could be written and sent – no option to attach images
• Use of frames and tables place and align the elements on a webpage
• Content comes from filesystem of the server instead of an RDBMS
Web 2.0
Web 2.0 or participative social web started coming into the picture in the late
1990s. It is an improved version of Web 1.0. It focuses on modifying the way
web pages are designed and used.
The differentiating factor of Web 2.0 compared to Web 1.0 is that Web 2.0
allows users to create content. It also enables social media kind of interactions.
Users can interact in the form of likes, comments, sharing their photos or videos,
etc.
In web 2.0 or the current iteration of the web, tech giants such as Facebook
(Meta), Twitter, Google, Apple, Microsoft, and Amazon, control how our personal
data will be used. These companies use algorithms that decide the information
that we consume.
A few ways how internet users in web 2.0 interact and share their thoughts,
opinions, and experiences include:
• Social media
• Social networking
• Tagging
• Blogs
• Podcasts
• Web content voting
Here are some features of Web 2.0:
• Read-write web
• Also known as participative web and social web
• Dynamic content with high responsiveness to user inputs
• User-generated content
• Interoperability for end-users
• User-friendly
Web 3.0
It is the third generation of the internet that uses blockchain, cryptocurrencies,
and metaverses. It focuses on taking back the control of the internet and our
data from the giant tech companies.
In web 2.0, our data is under the control of centralized organizations. With
web3, the data will reside on blockchain networks. Therefore, users will be the
owners of their own data. The owners get to decide the ways in which they want
to share it.
Web 3.0 leverages encryption and distributed ledger technology to address the
challenges related to trust. It also focuses on more privacy and security for users.
Here are some features of Web 3.0:
• Read-write-interact web
• Powered by blockchain
• The use of decentralized network provides data control of owners
• 3D visuals and graphics
• Uses artificial intelligence (AI) to provide fast results with accurate real-
time insights
• Support for semantic web that understands the meaning of words
• Use of advanced authorization mechanisms for the protection of user data
and identity
Difference between Web 1.0, Web 2.0 and Web 3.0
Let us talk about the differences between Web 1.0, Web 2.0, and Web 3.0.

Parameters Web 3.0 Web 2.0 Web 1.0

Basics Portable and Wildly Read- Mostly Read-


Personal Write Only

Primary Focus Individual Focus Community Focus Company


Focus

Example of Content Live-streams/Waves Blogs/Wikis Home Pages

Focus of Content Consolidating Content Sharing Content Owning


Content

Example of Smart Applications Web Applications WebForms


Interaction

Interaction Based User Behaviour Tagging Directories


On

Monetization User Engagement Cost Per Click Page Views


Strategy

Type of Advertising Behavioural Interactive Banner


Advertising Advertising Advertising
Website Example The Semantic Web Wikipedia Britannica
Online

Languages and RDF/RDFS/OWL XML/RSS HTML/Portals


Backlinks

Web Terminology - Definitions


Viewing a Website
Domain name
A domain name is a unique name (e.g. google.com) used to identify the location of
a website on a web server. When you access a website through a web browser
like Google Chrome, the domain name is translated to an Internet Protocol (IP)
address (e.g. 172.217.12.206) which represents the server on which the website
is hosted. This translation is dynamically performed by a domain name server
(DNS).
DNS
Domain Name Servers are the Internet’s equivalent of a phone book or directory.
They keep an updated list of domain names and translate them back into IP
addresses. This is necessary because, although domain names are easy for
humans to remember, computers (which include all devices that can connect to
the internet) use IP addresses to access websites.
URL
A Uniform Resource Locator (URL) is the file address of a resource on the
Internet. A URL can represent a web page, an image, a video, a style sheet, and
much more. A URL typically consists of several components, including but not
limited to:
Web Server
Web Servers are used to store, process and deliver web pages to clients (e.g. web
browsers like Google Chrome). Although you could purchase a server to run your
website, most businesses use cloud storage services such as Amazon Web Services
(AWS) to host websites due to the cost savings and added security.
Web Browser
A web browser, often referred to as a “browser” is a software application used
for accessing information on the Web. Some of the most popular web browsers
include Chrome, Firefox, Safari, Internet Explorer, Edge, and Opera. As with all
software applications, web browsers have different features and functionality,
appealing to different user types.IP Address
An Internet Protocol address is a unique numerical label assigned to each device
connected to the Internet, including computers, phones, tablets, servers, cars, and
more. A quick Google search for “what is my IP address” will bring up the IP
address associated with your internet connection.
HTML, CSS, JavaScript
Websites consist of an array of content (like images and text) as well as
interactive elements like forms, games, and animations. To render the desired
results, your web browser relies on programming languages, primarily HTML,
CSS and Javascript, to accomplish what is required.
HTML, which stands for HyperText Markup Language, is used to identify the
different elements on a page such as paragraphs, titles, images, and links.
CSS, also known as Cascading Style Sheet, is used to give your website its style.
Fonts, colors, sizes, spacing, borders, backgrounds, and shadows are just a few
examples of what can be adjusted using CSS.
Javascript is a client side programming language which means the source code is
processed by the client’s web browser rather than on the Web server. This allows
the language to run actions on a page after a page has been fully loaded. An
example would be sending an error message if there are missing form fields
without having to submit the request to the Web Server.
HTTP/HTTPS
HyperText Transfer Protocol Secure (HTTPS) is the secure version of HTTP, the
protocol over which data is sent between your browser and the website you’re
connected to. To ensure a safe connection for your website, you will need to get an
HTTPS certificate.
Web Development
Web development refers to the creating, building, and maintaining of websites.
It includes aspects such as web design, web publishing, web programming, and
database management. It is the creation of an application that works over the
internet i.e. websites.
Web Development
The word Web Development is made up of two words, that is:
• Web: It refers to websites, web pages or anything that works over the
internet.
• Development: It refers to building the application from scratch.
Web Development can be classified into two ways:
• Frontend Development
• Backend Development

Frontend Development
The part of a website where the user interacts directly is termed as front end. It
is also referred to as the ‘client side’ of the application.
Frontend Roadmap
Frontend Development Roadmap
Popular Frontend Technologies
• HTML: HTML stands for HyperText Markup Language. It is used to
design the front end portion of web pages using markup language. It acts
as a skeleton for a website since it is used to make the structure of a
website.
• CSS: Cascading Style Sheets fondly referred to as CSS is a simply
designed language intended to simplify the process of making web pages
presentable. It is used to style our website.
• JavaScript: JavaScript is a scripting language used to provide a dynamic
behavior to our website.
• Bootstrap: Bootstrap is a free and open-source tool collection for creating
responsive websites and web applications. It is the most popular CSS
framework for developing responsive, mobile-first websites. Nowadays, the
websites are perfect for all browsers (IE, Firefox, and Chrome) and for all
sizes of screens (Desktop, Tablets, Phablets, and Phones).
• Bootstrap 4
• Bootstrap 5
Backend Development
Backend is the server side of a website. It is part of the website that users
cannot see and interact with. It is the portion of software that does not come in
direct contact with the users. It is used to store and arrange data.
Backend Roadmap

Backend Development Roadmap


Popular Backend Technologies
• PHP: PHP is a server-side scripting language designed specifically for
web development.
• Java: Java is one of the most popular and widely used programming
languages. It is highly scalable.
• Python: Python is a programming language that lets you work quickly and
integrate systems more efficiently.
• Node.js: Node.js is an open source and cross-platform runtime
environment for executing JavaScript code outside a browser.

What is website maintenance?


Website maintenance is the practice or activity of consistently keeping a website
up-to-date and operationally sound in coherence with the client's business
standards regardless of type, whether a personal website or business, to ensure
website security, maximize traffic growth, enhance website performance, provide
a better user experience, improve search engine optimization, and gain better
search engine results for specific target keywords.
Everyday site maintenance tasks include:
• Adding new website content, updating existing content, and removing
outdated content.
• Updating essential codes, plugins, integrations, or software, such as its
content management system, to ensure compatibility and security.
• Monitoring the website for security threats and promptly fixing
vulnerabilities when they occur.
• Checking web hosting providers for possible outages.
• Optimizing the website's loading speed and site's performance.
• Improving the website's usability, accessibility, and overall user experience.
• Optimizing the website's content and structure to improve ranking on
search engines.
Factors That Make Web Maintenance Essential
Web maintenance is an essential part of every business, regardless of size. It can
impact every single element of your business efforts, whether it's functionality,
sales, security, or SEO. No matter how big or small your business is, if your
website needs to be fixed or updated, you may miss out on essential opportunities,
including gaining potential customers and leads.
Website maintenance can be helpful on the following factors:
First Impression. Data research shows that 94% of users' first impressions are
related to website design. An outdated website can create a negative first
impression on your brand and overall business.
Brand Reputation. A well-maintained website can reinforce your brand
reputation and help you establish trust with your target audience.
Security. More frequent maintenance helps protect against security threats, such
as hacking attempts and malware infections. This is crucial if you collect
sensitive information from your visitors or customers.
Performance. A regularly updated website is anticipated to run smoothly at
absolute capacity. This can enhance the user experience and lower the likelihood
of users growing frustrated and leaving your site.
Compatibility. New technologies and integrations can break your website if not
given enough attention. Implementing website maintenance services can help
ensure that your site remains accessible and is compatible across multiple
technology channels.
Search Engine Optimization (SEO). Popular search engines such as Google and
Bing have continuously evolving algorithms, and a website that needs to be
regularly updated may fall behind in search engine results pages. Performing
regular website maintenance to your website content and keeping everything up-
to-date can help improve its SEO and attract more organic traffic.
Types of Website Maintenance
Website deployment doesn't mean the end of the project cycle. Websites need
routine maintenance to prevent any security, server outages, or performance
issues to ensure that the site's performance produces high-quality metrics and
visibility on search engines at all times.
A post-launch website maintenance plan can consist of challenging tasks that
are essential to ensure that the website runs smoothly 24/7. Having said that,
there are different types of website maintenance that you need to be aware of to
understand better the maintenance services and workflow that needs to be
implemented:
Regular updates
This can include creating and storing backups of the website's data and files for
future recovery and updating the website's software, plugins, and other technical
components. In addition, checking for possible outages on hosting providers and
renewing name records can also be a part of regular updates.
Security maintenance
Security is an integral part of the daily maintenance tasks as it addresses
vulnerabilities and the potential external threat that could break the entire
website operation. This can include installing patches or updates and checking
for compatibility issues when they occur.
Performance optimization
Monitoring and optimizing website performance can ensure that the website is in
its best state and running as expected. Performance optimization can help
improve a website's speed, responsiveness, and overall website's SEO. Tasks may
include minifying HTML, CSS, and JS files, optimizing images for faster
loading, and removing web elements that can disrupt the website's loading state.
Content updates
Website content must be kept up-to-date and relevant in order to be effective.
This is an essential maintenance plan factor, reflecting your website's relevancy
for a specific time or season. This includes adding fresh content or updating text,
images, or other media and removing outdated content on the website. In
addition, it may also involve refreshing the website's visual design, layout, and
branding to make your website look modern and professional to attract more
visitors and potential customers.
Web hosting and web publishing

Web Hosting: Web Hosting, as the name suggests, is a type of internet hosting
that allows one to make their website available to users through WWW using
the internet. It simply provides space to the website on a web server so that files
or data of the website can be stored. It is the best way to increase the growth of
business and user experience.
Note: There are few popular free Web Hosting Platforms are Hostinger,
Bluehost, etc.
Web Publishing: Web Publishing, as the name suggests, is the process of creating
a website and placing it on the web server, and published content may include
text, images, videos, and other types of media. Its main aim is to facilitate
communication simply by adding context through style, emotion, and space. It is
also known as online publishing.
Web Hosting vs Web Publishing

Web Hosting Web Publishing

It is a process of publishing or
It is a process of using server to host
uploading original content on
website.
Internet.
Web Hosting Web Publishing

Its process includes uploading


Its process includes proving space in
files, updating web pages, posting
server to store web pages, make it
blogs, posting content to web
available for viewing online, etc.
pages, etc.

It allows customization from


It allows for advanced level of
beginner to advanced levels
customization.
depending on platform.

It makes website available to


It provides space in internet for website.
view on your domain.

It provides ready-made themes


It does not come with premade website.
to develop website.

It allows for one-click installs and It is limited to WordPress and


endless customization of self-hosted NYU-approved theme and
WordPress, Scalar, and Omeka. plugins.

Its main aim is to provide storage space


Its main aim is to communicate
for website or application on server on
ideas and concepts between
internet so that it can be accessed by other
people.
computers connected to internet.
Web Hosting Web Publishing

Its component includes domain


Its component includes Web Browser,
name planning and registration,
FTP Client, Database Server, FTP Server,
web hosting, web design and
etc.
development, etc.

It allows one to store website online, It allows one to include colorful


provide technical support, data designs, photographs, and text in
management, more security, etc. webpage for less cost.

What are search engines and how do they work?

What is a search engine?

A search engine is an online tool that is designed to search for websites on the
internet based on the user’s search query.

Every time a web crawler finds a new webpage through a link, it scans and passes
its content for further processing (called indexing) and continues in the discovery of
new webpages.

How do Search Engines Work?


Search engines are generally working on three parts that are crawling, indexing,
and ranking
1. Crawling: Search engines have a number of computers programs that are
responsible for finding information that is publicly available on the internet.
These programs scan the web and create a list of all available websites. Then
they visit each website and by reading HTML code they try to understand the
structure of the page, the type of the content, the meaning of the content, and
when it was created or updated. Why crawling is important? Because your first
concern when optimizing your website for search engines is to make sure that
they can access it correctly. If they cannot find your content you won’t get any
ranking or search engine traffic.
2. Indexing: Information identified by the crawler needs to be organized,
Sorted, and Stored so that it can be processed later by the ranking algorithm.
Search engines don’t store all the information in your index, but they keep things
like the Title and description of the page, The type of content, Associated
keywords Number of incoming and outgoing links, and a lot of other parameters
that are needed by the ranking algorithm. Why indexing is important? Because
if your website is not in their index it will not appear for any searches this also
means that if you have any pages indexed you have more chances of appearing in
the search results for a related query.
3. Ranking: Ranking is the position by which your website is listed in any
Search Engine. (There are three steps in which ranking works).
• Step 1: Analyze user query – This step is to understand what kind of
information the user is looking for. To do that analyze the user’s query by
breaking it down into a number of meaningful keywords. A keyword is a
word that has a specific meaning and purpose, for example when you type
how to make a chocolate cupcake search engines know that you are looking
for specific information so the results will contain recipes and step-by-
step instructions. They can also understand the meaning of how to change
a light bulb is the same as how to replace a light bulb search engines are
clever enough to interpret spelling mistakes also.
• Step 2: Finding matching pages – This step is to look into their index and
find the best matching pages, for example, if you search dark wallpaper
then it gives you the result of images, not text.
• Step 3: Present the results to the users – A typical search results page
includes ten organic results in most cases it is enriched with other elements
like paid Ads, direct answers for specific queries, etc.

Performance of Search Engine


The performance of search engine is determined by 2 requirements. They are:
• Effectiveness (quality of result).
• Efficiency (Response time & through put).
Components of Search Engine
There three components in search engine. They are web crawler, data base, and
search interface:
• Web crawler: A search engine uses multiple web crawlers to crawl through
world wide web and gather information. It is basically a software which
is also known bat or spider.
• Data base: The information which is gathered by web crawler by crawling
through internet is stored on the database.
• Search Interface: Search interface is just an interface to the data base
which is employed by the user to search through the data base.
Basic building blocks of search engine:
There are basically two building blocks which perform various activities.
• Indexing
• Querying
1. Indexing: Indexing Indexing performs mainly 3 activities text acquisition,
text transformation index creation.
i)Text acquisition: Text acquisition basically identifies and stores documents
into data base for indexing.
• It convert variety of documents into a consistent data Format.
• It also stores text meta data and other related information of document.
ii) Text transformation: It transforms document into indexed terms.
• Parser: It recognizes the “words’ in the text with the help of tokenizer
and process the sequence of text tokens to recognize structural pattern.
• Stopping: Removes stop words like “and”, “or”, “the”.
• Stemming: It groups together all the words derived from same stem.
• Link analysis: It is used identify the popularity page. It uses links & of &
anchor text from web pages.
• Information extraction: Information extraction identifies classes of index
terms which are important for some application.
• Classifier: Identifies class related data of document.
iii) Index creation:
• Document statistics: It collects the features like position & count of
words.
• Weighing: Calculates weights of index terms.
• Inversion: As the format of inverted files is fast for query processing it
converts document term information to term document information
2. Querying: It consists following three tasks
• User interaction: User interaction provides a query input which gives an
interface and parser for query language. Then it transforms the query by
improving query. Then it shows the output by Constructing the display of
ranked documents for a query.
• Ranking: It first calculates the score of document by using ranking
algorithms. It processes query in distributed
environment.
• Score: qi*di , Where qj & di are term weights for term i query and
document
• Evaluation: It this step it logs user queries & interaction for improving
search engines efficiency & effectiveness.
Usage of Search Engine
Search engines have so many usages and some of them are:
• Searching for information: People use a search engine to search for any
kind of information present on the internet. For example, Rohit wants to
buy a mobile phone but he does not know which one is the best mobile
phone. So he searches “best mobile phones in 2021” in the search engine
and gets the list of best mobile phones along with their features, reviews,
and prices.
• Searching images and videos: Search engines are also used to search
images and videos. There are so many videos and images available on the
internet in different categories like plants, animals, flowers, etc., you can
search them according to your need.
• Searching location: Search engines are also used to find locations. For
example, Seema is on a Goa trip but she doesn’t know the location of
Palolem beach. So she searches “Palolem beach” on the search engine and
then the search engine gives the best route to reach Palolem beach.
• Searching people: Search engines are also used to find people on the
internet around the world.
• Shopping: Search engines are also used for shopping. Search engines
optimize the pages to meet the needs of the user and give the lists of all the
websites that contain the specified product according to the best price,
reviews, free shipping, etc.
• Entertainment: Search engines are also used for entertainment purposes.
It is used to search videos, movies, games, movie trailers, reviews of
movies, social networking sites, etc. For example, Rohan wants to watch a
movie named “Ram”, then he searches this movie on a search engine and
the search engine returns a list of links (of the websites) that contain the
Ram movie.
• Education: Search engines are also used for education. With the help of
search engines, people can learn anything they wanted to learn like
cooking, programming languages, home decorations, etc. It is like an open
school where you can learn anything for free.

You might also like