You are on page 1of 10

CSE2015 Class Material

Introduction To Internet
Internet Overview- Networks – WWW –Web Protocols ––
Web Organization and Addressing – Internet Service
Module 1 Providers, DNS Servers, Connection Types, Internet
Addresses – Web Browsers and Web Servers -Security and
Dr. MareeswariV Vulnerability-Web System Architecture – URL – Domain
Assistant Professor(Sr) Name – Web Content Authoring - Webserver Administration
SITE, VIT, Vellore – Search Engines
Cabin No: 210-A30

2 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021

The Internet
 The Internet is a global network comprised of smaller networks that Web Organization
are interconnected using standardized communication protocols. The
 No one person, company, organization or government runs
Internet standards describe a framework known as the Internet the Internet. It is a globally distributed network comprising many
protocol suite. This model divides methods into a layered system of voluntarily interconnected autonomous networks.
protocols.  ISOC is a voluntary membership organization whose purpose is to
promote global information exchange through Internet technology.
 It is a network of networks, that consists of millions of private, public,
 ISOC appoints the IAB- Internet Architecture Board. They meet
academic, business and government networks of local to global regularly to review standards and allocate resources, like addresses.
scope…  IETF- Internet Engineering Task Force. Another volunteer
organization that meets regularly to discuss operational and technical
problems.
 Internet services were launched in India on 15th August, 1995 by Videsh
Sanchar Nigam Limited. In November, 1998, the Government opened
up the sector for providing Internet services by private operators.

3 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021 4 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021

Dr. Mareeswari V, SITE, VIT University, Vellore 1


CSE2015 Class Material

Web Protocols
MODERN USES OF INTERNET  A protocol is simply a standard for enabling the connection, communication,
and data transfer between two places on a network. Here are some of the key
protocols that are used for transferring data across the Internet.
 The internet can be accessed almost anywhere  HTTP : Hypertext Transfer Protocol. It is the standard protocol for
by numerous means including mobile internet transferring web pages (and their content) across the Internet.
 HTTPS : Hypertext Transfer Protocol over Secure Socket Layer (SSL). For a
services… website to use HTTPS it needs to have an SSL certificate installed on the
server. These are usually issued by a trusted 3rd party, referred to as a
Certificate Authority (CA). When you browse a web page using HTTPS, you
can check the details of the SSL certificate. For example, you could check the
 The Internet allows computer users validity of it.
 FTP : File Transfer Protocol. It is used to transfer files across the Internet.
to remotely access other computers and FTP is commonly used by web developers to publish updates to a website
(i.e. to upload a new version of the website).
information stores easily, wherever they may  Every HTTP request also uses TCP and IP. The Web is just one of the
be. applications built on top of the Internet protocols

5 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021 6 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021

World Wide Web Web addresses


 TheWeb is a only way to access information through the Internet.  Just like every house has a postal code, each Web page has an address describing
where it can be found. On the Web these addresses are called URLs (Uniform
 The world wide web is a sophisticated system for universal
Resource Locator).
information capture and delivery. http://www.google.com/services/index.htm
 The Web is a large number of computer documents or "Web
pages" that are stored on computers around the world and are  http:// - Protocol - This part of the address indicates that it is aWeb page.
connected to one another using hyperlinks. These Web pages can be  www - This indicates that the Web page you are looking at is part of the World
seen by anyone through their computer's "Web Browser," which is WideWeb. Many Web sites do not use www but are still part of theWeb.
the program you are using now.  google.com - This part of the address is the domain name and indicates the
 A group of Web pages that follow the same theme and are connected unique address of aWeb site.
together with hyperlinks is called a "Web site." Web sites and Web  /services/ - The "/" symbol indicates you have moved into a specific directory in
pages are written in a coding language that makes it possible to add the Web sites. Directories are like the folders on your computer and help to
organizeWeb pages in aWeb sites.
pictures, sound and interactivity to plain old text, making people's
 index.htm - A word with ".htm" or "html" following it indicates the name of the
reading experience more exciting. specific page in theWeb site you are looking at.

7 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021 8 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021

Dr. Mareeswari V, SITE, VIT University, Vellore 2


CSE2015 Class Material

Domain Affiliations IP Address


Domain Affiliations  IP addresses are how computers identify other computers on the
arts cultural and entertainment activities Internet.
com business organizations
edu educational sites  The number of IP addresses provided by a 32-bit code (IPv4) is
firm businesses and firms insufficient for the Internet’s current growth trajectory
gov government sites  How many different addresses does a 32-bit number provide?
info information service providers
mil military sites
232 = 4,294,967,296
nom individuals  IPv6 is going to increase the address space to 128 bits, represented as eight
net networking organizations groups, separated by colons, of four hexadecimal digits.
org organizations
Eg: 2001:0db8:0000:0000:0000:8a2e:0370:7334
rec recreational activities
store businesses offering goods for purchase
web entities related to World Wide Web activities
net networking organizations
9 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021 10 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021

DNS
 The domain name system (DNS) gives us humans an easy way to DNS
identify where we want to go on the Internet.
 Consider the website:
 Behind the scenes, each domain name maps to an IP address. When
we type a URL in the address bar of our browser, the computer has  blog.gardeningknowhow.com
to figure out its IP address.  That URL leads to the blog of a gardening tips & tricks
 Find the IP address our VIT university website, google and website.
popular websites … refer ipaddress.com  First level domain – com
 Third level domain – blog

11 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021 12 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021

Dr. Mareeswari V, SITE, VIT University, Vellore 3


CSE2015 Class Material

Connect to the internet


Web browsers
 Web browsers are programs used to explore the
Internet. Software that creates a unique hypermedia-
based menu on your computer screen and provides a
graphical interface to the Web.
 There are many Web browser programs available
including Netscape Navigator, Internet Explorer and
Opera, Google Chrome, Mozilla Firefox.

Mobiles and PDA’s 3G USB Dongle 3G


13 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021 14 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021

Internet Service Provider


 ISP is a company that offers services to us to connect with
the internet.The following table shows the top 10 ISPs in
India by total subscriber base as of 31 March
2020. Broadband is defined as "an always-on Internet
connection with download speed of 512 kbit/s or above."

Now this file is converted to binary code by the browser and it is sent down the wires if we are
connected through Ethernet and if we are using WiFi, first it converts it to radio signal which is
decoded by router in a very low level. It is converted to binary and then sent to the servers.

15 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021 16 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021

Dr. Mareeswari V, SITE, VIT University, Vellore 4


CSE2015 Class Material

RISK PREVENTION
RISKS !!! What do you need to do to minimise some or all of the risk
when accessing the internet ?

Viruses, attack, hacking, identity theft, fraud, FIREWALL


VIRUS SCANNER
getting passwords, stealing bank details,
INSTALL ADWARE AND MALWARE SCANNERS
intercepting emails, theft of data, posting
BE AWARE OF EMAIL SCAMS
unwanted videos, posting unwanted BE CAREFUL WHEN USING PAYMENT SITES
text……….. BE CAREFUL DOWNLOADING AND UPLOADING
USE COMPLEX PASSWORDS

17 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021 18 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021

Web Servers How the WWW Works


 A web server is a computer with special software to host web
pages and web applications.
 A web server serves web pages to clients across the Internet or
an Intranet. The web server hosts the pages, scripts, programs,
and multimedia files and serves them using HTTP, a protocol
designed to send files to web browsers and other protocols.
 A number of server-side technologies can be used to increase
the power of the server beyond its ability to deliver standard
HTML pages.
 These include CGI scripts, server-side includes, SSL security,
and Active Server Pages(ASPs)
19 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021 20 01-03-2021
Dr.Mareeswari V/ AP(Sr) / SITE / VIT

Dr. Mareeswari V, SITE, VIT University, Vellore 5


CSE2015 Class Material

Example of an HTTP Request from a HTTP response from a Web server


Web browser
HTTP version Status code Reason
Command URL HTTP version HTTP/1.1 200 OK ]- Response Status
Date: Mon 06 Aug 2001 17:35:46 GMT ]- Date
Response
Server: NCSA/1.3 ]- Web server
GET http://www.kelley.indiana.edu/ardennis/home.htm HTTP/1.1 ]- Request Header
Location: http:// www.kelley.indiana.edu/adennis/home.htm ]- URL
Date: Mon 06 Aug 2001 17:35:46 GMT Line
Content-type: text/html ]- Type of file
User-Agent: Mozilla/6.0 ]-Web browser (this is Netscape) <html>
Referer: http://www.indiana.edu/~aisdept/faculty.htm Request Header
<head>
<title>Allen R. Dennis</title>
</head>
URL that contained the link to the requested URL Response
<body>
<H2> Allen R. Dennis </H2> Body
<P>Welcome to the home page of Allen R. Dennis</P>

</body>
</html>
21 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021 22 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021

HTTP Request Types System Architecture


 Multi-tier application (n-tier application)
 GET (default) and POST do basically the same thing: Send data
 Information tier (data or bottom tier)
from the client to the server. However, they have some
differences:  Maintains data for the application
 GET  Stores data in a relational database management system
 Appends form data directly to the end of the URL—visible (RDBMS)
to users (not suitable for sending passwords)  Middle tier
 Limited to 2,048 characters for the entire URL  Implements business logic and presentation logic
 Result page can be bookmarked and cached  Control interactions between application clients and
 POST application data
 Sends form data in the HTTP request—invisible to users  Client tier (top tier)
 Virtually no limit (but check your specific configuration)  Application’s user interface
 Results are not cacheable or bookmarkable
 Users interact directly with the application through the client
23 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021 24 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021
tier

Dr. Mareeswari V, SITE, VIT University, Vellore 6


CSE2015 Class Material

N-tier Client-Server Architecture


3 tier Architecture

25 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021 26 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021

Hosting a website: Self hosting Evolution of Web


 Install a web server on a computer
 Local access
 Using domain <localhost>
 or IP address 127.0.0.1
 Necessary for server-side programming development
 Global access
 Register a human-readable domain name
 Obtain IP address
 Static: Costs more
 Dynamic: Needs dynamic DNS system, e.g.
http://www.dyndns.com/

27 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021 28 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021

Dr. Mareeswari V, SITE, VIT University, Vellore 7


CSE2015 Class Material

WEB 1.0 – WEB2.0 – WEB 3.0


READ – WRITE - EXECUTE
Web 2.0 Web 3.0
1. communication - Interactive 1. Engaged/Invested
2. Information- Dynamic 2. Portable & Personal
3. Focuses on community 3. Individual
4. Personal Interest – Blog/wikis/social media 4. Lifestreams
5. Content- Through sharing 5. Through curation
6. Interaction – With applications 6. With smart applications
7. Searching – keywords / tags 7. Contexts/relevances
8. Metrix – cost per click 8. User engagement
9. Advertising – interactive 9. Behavioral
10. Research - wikipedia 10. Semantic web
11. Technologies – flash, Java, XML 11. RDF, OWL

29 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021 30 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021

Web content authoring


 Web content authoring is the process of creating page-
level content for your content management websites.
 Web content authoring comprises the following activities:
Defining the content records that populate the content areas of
your website pages, optionally using workflow and Easy Edit
features to help with this task.

31 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021 32 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021

Dr. Mareeswari V, SITE, VIT University, Vellore 8


CSE2015 Class Material

Web content authoring entails the following Web site Testing


actions, performed as needed:  Save the sample invalid web page on to your local drive.
 Now try testing this file using the W3C HTML Validator.
 Content Designer: Managing content records  Now try testing this file using the W3C CSS Validator. Use the "Validate by File
 Managing images for use in content records Upload" method and upload the file that now resides on your local drive.
 Note: By default the CSS validator checks for valid CSS using the CSS 2.1
 Tracking your current and available authoring work specification. If you've used any CSS3 properties such as text-shadow, box-
 Content Approver : Approving content records for publication or shadow, border-radius, etc. these will trigger errors in the report. To
deletion correct for this, select More Options on the CSS validator home page, then select
"CSS Level 3" as your Profile.
 Content Editor : Revising website content  WAVE (Web Accessibility Evaluation Tool) : It is a suite of evaluation tools that
 Website Manager: Improving performance for remote servers
helps authors make their web content more accessible to individuals with
disabilities.
 Functional Accessibility Evaluator : The Functional Accessibility Evaluator
(FAE) evaluates a website or a single web page based on the W3C Web Content
Accessibility Guidelines (WCAG) 2.0 Level A and AA requirements.

33 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021 34 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021

Search engine
Web hosting service
 A search engine is a software system that is designed to carry
 A web hosting service (often shortened to web host) is a type out web searches (Internet searches), which means to search
of Internet hosting service that allows individuals and organizations the World Wide Web in a systematic way for particular information
to make their website accessible via the World Wide Web. Web specified in a textual web search query.
hosts are companies that provide space on a server owned or leased  The search results are generally presented in a line of results, often
for use by clients, as well as providing Internet connectivity, referred to as search engine results pages (SERPs)
typically in a data center.  The information may be a mix of links to web pages, images,
videos, infographics, articles, research papers, and other types of files.
Some search engines also mine data available in databases or open
 Refer more in:
directories. Unlike web directories, which are maintained only by
https://en.wikipedia.org/wiki/Web_hosting_service human editors, search engines also maintain real-time information by
running an algorithm on a web crawler.
 Internet content that is not capable of being searched by a web search
engine is generally described as the deep web.

35 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021 36 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021

Dr. Mareeswari V, SITE, VIT University, Vellore 9


CSE2015 Class Material

 A search engine maintains the following processes in near real time:  Crawling: Google searches the web with automated programs called crawlers,
 Web crawling looking for pages that are new or updated. Google stores those page addresses
(or page URLs) in a big list to look at later. We find pages by many different
 Indexing methods, but the main method is following links from pages that we already
 Searching know about.
 Indexing: Google visits the pages that it has learned about by crawling, and
 Web search engines get their information by web crawling from site to
tries to analyze what each page is about. Google analyzes the content, images,
site. The "spider" checks for the standard filename robots.txt, addressed to and video files in the page, trying to understand what the page is about. This
it. The robots.txt file contains directives for search spiders, telling it information is stored in the Google index, a huge database that is stored on many
which pages to crawl. After checking for robots.txt and either finding it computers.
or not, the spider sends certain information back to  Serving search results: When a user performs a Google search, Google
be indexed depending on many factors, such as the titles, page tries to determine the highest quality results. The "best" results have many
content, JavaScript, Cascading Style Sheets (CSS), headings, or its factors, including things such as the user's location, language, device (desktop
or phone), and previous queries. For example, searching for "bicycle repair
metadata in HTML meta tags. After a certain number of pages crawled, shops" would show different answers to a user in Paris than it would to a user
amount of data indexed, or time spent on the website, the spider stops in Hong Kong. Google doesn't accept payment to rank pages higher, and
crawling and moves on ranking is done algorithmically.
37 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021 38 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021

 As of January 2021,Google is the world's most used search engine,


with a market share of 91.86%, and the world's other most used
search engines were: References
 https://www.youtube.com/watch?v=HoVT7qIyarE
 https://developers.google.com/search/docs/basics/how-
search-works
 https://www.youtube.com/watch?v=BNHR6IQJGZs&feat
ure=youtu.be
 Find it? Deep web, Dark web, Darknet, DNS spoofing,

39 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021 40 Dr.Mareeswari V/ AP(Sr) / SITE / VIT 01-03-2021

Dr. Mareeswari V, SITE, VIT University, Vellore 10

You might also like