Professional Documents
Culture Documents
Internet
⚫Historyof the Internet
⚫Protocols
⚫Computer Networks
⚫DNS
⚫Web Servers, E-mail etc.
The “Victorian” Internet
⚫ Invented in the 1840s.
⚫ Signals sent over wires that
were established over vast
distances
⚫ Used extensively by the U.S.
Government during the
American Civil War, 1861 -
1865
⚫ Morse Code was dots and
dashes, or short signals and
long signals
⚫ The electronic signal standard
of +/- 15 v. is still used in
network interface cards today.
What is the Internet?
⚫ A large computer network that joins together many
organizations
⚫ It provides the infrastructure for e-mail, file archives,
hypertext documents, databases etc
⚫ The vast collection of computer networks which form
and act as a single huge network for transport of
data and messages across distances which can be
anywhere from the same office to anywhere in the
world.
(Written by William F. Slater, III, 1996, President of the
Chicago Chapter of the Internet Society)
A Short History…
⚫ 1968 - DARPA (Defense Advanced Research Projects Agency)
contracts with BBN (Bolt, Beranek & Newman) to create
ARPAnet
⚫ 1970 - First five nodes:
⚫ UCLA
⚫ Stanford
⚫ UC Santa Barbara
⚫ U of Utah, and
⚫ BBN
⚫ 1974 - TCP specification by Vint Cerf
⚫ 1984 – On January 1, the Internet with its 1000 hosts
converts en masse to using TCP/IP for its messaging
A Brief Summary of the
Evolution of the Internet Mosaic
Age of
eCommerce
Begins
WWW Created 1995
Internet Created 1993
Named 1989
and
Goes
TCP/IP TCP/IP
Created 1984
ARPANET 1972
1969
Hypertext
Invented
Packet 1965
Switching
First Vast Invented
Computer 1964
Network
Silicon Envisioned
A Chip 1962
Mathematical 1958
Theory of
Memex Communication
Conceived 1948
1945
1945 1995
Ideas from
1940s to 1969
1970 1995
Main documents for the
Internet
⚫ RFC (Request For Comments)
⚫ http://www.ietf.org/rfc/rfcXXXX.txt
Leased line
Point-to-Point
connection
between two PPP, HDLC,
Most secure Expensive
computers or SDLC, HNAS
Local Area
Networks (LANs)
Circuit switching
A dedicated
circuit path is
created between
28 Kb/s - 144
end points. Best Less Expensive Call Setup PPP, ISDN
Kb/s
example is
dialup
connections
Description Bandwidth Sample
Advantages Disadvantages
range protocols used
Packet switching Devices transport
packets via a
shared single
point-to-point or
point-to-
multipoint link
across a carrier
internetwork. Shared media X.25 Frame-
Variable length across link Relay
packets are
transmitted over
Permanent
Virtual Circuits
(PVC) or
Switched Virtual
Circuits (SVC)
Cell relay Similar to packet
switching, but
uses fixed length
cells instead of
variable length best for
Overhead can be
packets. Data is simultaneous use ATM
considerable
divided into fixed- of Voice and data
length cells and
then transported
across virtual
circuits
Network topologies
⚫ Bus
⚫ Ring, token ring
⚫ Star
⚫ Switch
⚫ Mesh
⚫ Hierarchical
⚫ Complex
Bus
Ring, token ring
This type of topology suffers from the same centralization flaw as the
Star Topology. If the device that is on top of the chain fails, consider the
entire network down. Obviously this is impractical and not used a great
deal in real applications.
Mesh
⚫ The Full-Mesh Topology connects
every single node together. This will
create the most redundant and
reliable network around- especially
for large networks
com Comercial
edu Education
Gov Government (www.odci.gov
=CIA)
Mil Military (www.army.mil = US
Army)
Org Non for profit organizations
Domain Names -
categorization
⚫ Top Level Domain:
⚫ Country name
Ro Romania
DE Germania
IT Italia
Ca Canada
US Statele Unite
Uk Marea Britanie (www.ox.ac.uk)
Jp Japonia
Vanity ccTLDs
TLDs which are used for various purposes outside their home countries, because of their name:
ad is a ccTLD for Andorra, but has recently been increasingly used by advertising agencies.
ag is a ccTLD for Antigua and Barbuda and is sometimes used for agricultural sites. In Germany, AG (short for Aktiengesellschaft) is
appended to the name of a stock-based company, similar to Inc. in USA.
am is a ccTLD for Armenia, but is often used for AM radio stations. → http://aprs.fi/
as is a ccTLD for American Samoa. In Denmark and Norway, AS is appended to the name of a stock-based company, similar to Inc.
in USA.
be is a ccTLD for Belgium. Widely used by small Bulgarian websites because it's cheaper than a bg ccTLD.
cc is a ccTLD for Cocos (Keeling) Islands but is used for a wide variety of sites.
cd is a ccTLD for Democratic Republic of Congo but is used for CD merchants and file sharing sites.
dj is a ccTLD for Djibouti but is used for CD merchants and disc jockeys.
fm is a ccTLD for the Federated States of Micronesia but it is often used for FM radio stations.
gg is a ccTLD for Guernsey but it is often used by the gaming and gambling industry (with "gg" being the abbreviation for "good
game"), particularly in relation to horse racing gee-gee.
im is a ccTLD for Island of Man but is often used by instant messaging programs and services.
in is a ccTLD for India but is widely used in the internet industry.
je is a ccTLD for Jersey but is often used as a diminutive in Dutch (e.g. "huis.je"), as "you" ("zoek.je" = "search ye!"), or as "I" in
French (e.g. "moi.je")
la is a ccTLD for Laos but is marketed as the TLD for Los Angeles.
li is a ccTLD for Liechtenstein but is marketed as the TLD for Long Island.
md is a ccTLD for Moldova, but is marketed exclusively to the medical industry (as in "medical domain" or "medical doctor").
mu is a ccTLD for Mauritius, but is used within the music industry.
nu is a ccTLD for Niue but marketed as resembling "new" in English and "now" in Nordic/Dutch. Also meaning "nude" in
French/Portuguese.
sc is a ccTLD for Seychelles but is often used as .Source
to is a ccTLD for Tonga but is often used as the English word "to", like "go.to"
tv is a ccTLD for Tuvalu but it is used for the television ("tv")/entertainment industry purposes.
ws is a ccTLD for Samoa (earlier Western Samoa) is marketed as .Website
vu is a ccTLD for Vanuatu but means "seen" in French.
How DNS works
⚫ DNS:
⚫ Recursive (DNS Server gives the full answer);
⚫ Non-recursive (DNS Server gives partial answer).
⚫ Finding IP address for www.unitbv.ro:
⚫ the client connects to its first configured DNS
server → IP for the host that hosts TLD .ro →
IP for one of the DNS servers holding
unitbv.ro information → www.unitbv.ro =
193.254.231.8 → connection to IP;
List of DNS record types
⚫ A record (1): address record, maps a hostname to a 32-bit IPv4 address.
⚫ AAAA record (28): IPv6 address record maps a hostname to a 128-bit
IPv6 address.
⚫ CNAME record (5): canonical name record is an alias of one name to
another. The A record to which the alias points can be either local or remote
- on a foreign name server. This is useful when running multiple services
(like an FTP and a webserver) from a single IP address. Each service can
then have its own entry in DNS (like ftp.example.com. and
www.example.com.). It is also used when running multiple HTTP servers,
with different names, on the same physical host.
⚫ MX record (15): mail exchange record maps a domain name to a list of
mail exchange servers for that domain.
⚫ ETC….
URL vs. Domain
⚫ URL: http://www.example.net/index.html
⚫ For finding resources in a computer
⚫ Domain name: example.net
⚫ For finding a host/alias in a domain
⚫ Registered domain name: example.net
⚫ For finding an organization on the Internet
Server Web/HTTP
⚫ The Web is a Client/Server Architecture
⚫ The Application that sends pages “over” the
World Wide Web is called HTTP Server
⚫ Default port for the HTTP server is 80
⚫ Retrieving the pages is made through an
URL of the following type
http://server.ext:port/path/.../file.ext
Server Web/HTTP
⚫ Default pages
⚫ They are configured at the installation of the web
server
⚫ Exemples:
⚫ Default.htm, default.html,
⚫ Index.htm, index.html
⚫ Default.aspx, index.aspx;
⚫ Default.php, index.php4
⚫ Etc……
How HTTP works
1. The browser connects to the remote computer at
the IP and port from the address/URL;
2. It sends (through HTTP) GET
/students/8000/note_IT.htm
3. The server sends back:
1. Status resource code (200 OK, 404 = not found, etc.);
2. File type indicator (html, image, etc);
3. File content
How HTTP works
⚫ If the file contains HTML, the browser:
⚫ Parses the file
⚫ For each URL found in the file, it connects back
to the server for getting the resources
⚫ The Procedure repeats
HTTP answer example
HTTP/1.0 200 OK
Server: Microsoft-IIS/4.0
Connection: keep-alive
Date: Fri, 09 Feb 2001 22:41:10 GMT
Content-Type: text/html
Accept-Ranges: bytes
Last-Modified: Fri, 09 Feb 2001 03:50:15 GMT
Content-Length: 5574
Folders in a Web Server
⚫ There is a root folder for the web server web
(c:\Inetpub\wwwroot);
⚫ At a request
GET /students/8000/note_TI.html
⚫ The Server sends back to the client
c:\Inetpub\wwwroot\students\8000\note_TI.html
Virtual Folders
⚫ E.G.:
⚫ Virtual Folder: News
⚫ Phisical Folder: E:\LocalNews
⚫ Root Web Server: c:\inetpub\wwwroot
⚫ At a Request for
⚫ GET /news/default.html
⚫ The Server sends back
⚫ E:\LocalNews\default.html
Virtual Web Servers
⚫ On ONE computer, a server may answer to
requests for multiple IP addresses
⚫ On ONE server/computer:
⚫ www.server.ro
⚫ www.server.com, etc.
⚫ Each with its own IP address
⚫ Or according to Host Header.
Logging visits
⚫ IN log that are generated:
⚫ Every hour;
⚫ daily;
⚫ weekly;
⚫ monthly, etc.
⚫ Log files may keep:
⚫ Client IP address;
⚫ Resource status;
Logging visits
⚫ Log files may keep :
⚫ Referrer;
⚫ server IP / web farms;
⚫ Port server;
⚫ Access URL;
⚫ Agent name (browser or…)
⚫ Date and hour for access…….
Applications for log Analysis
(Web Log Analyzer)
⚫ Traffic analysis from web and FTP server
logs;
⚫ Various reports;
⚫ ….
E-mail
⚫ Message exchange by means of remote
distance communication;
⚫ Messages may be encoded in:
⚫ Text ASCII (American Standard Code for
Information Interchange)
⚫ HTML
⚫ XML (eXtensible Markup Language)
E-mail
⚫ Protocols:
⚫ SMTP – Simple Mail Transfer Protocol
⚫ POP3 – Post Office Protocol
⚫ IMAP – Internet Message Access Protocol
E-mail - SMTP
⚫ It controls the transport to a destination
server; it is used for sending and receiving
messages between servers;
⚫ It is used for sending messages from the
client to the server;
E-mail – POP3
⚫ Standard protocol for retrieving and
downloading+moving email messages;
⚫ It controls a connection between a POP3
client and a server used for storing
messages;
⚫ It has 3 states:
⚫ Authentication – the POP3 client is connected to
server must be authenticated before users
downloading messages.
E-mail – POP3
⚫ Transaction state - the client sends POP3
commands and the server receives and
executes them according to the POP3
protocol
⚫ Update state – closes the connection
between the client and server, being the last
command sent by the client.
SMTP and POP3
E-mail Operations
POP 3 System Components
⚫ POP3 client– is an application used for
reading, composing and managing e-mail
messages;
⚫ SMTP – transfers the messages between the
client and the server;
⚫ POP3 protocol – email protocol used for
controlling the connection between the client
and the server.
Organization level e-mail
services
⚫ e-mail servers – a computer that has services for
SMTP, POP3 or IMAP; the users may connect to it
for downloading, sending and managing through a
network
⚫ Email domains –
⚫ Domain name
⚫ Mail eXchanger (MX) registration in DNS
⚫ Mail boxes – a mailbox it is used by one user who is
member of an email domain
IMAP
⚫ A method used for accessing messages
stored on a e-mail server;
⚫ Support for operations like creating, deleting
and renaming folders, checking for new
messages, setting flags for messages,
searching for messages ON the server.
Proxy
Secured HTTP (financial transactions)
Useful addresses
⚫ http://www.whatis.com – dictionar de termeni
legati de calculatoare
⚫ http://www.webopedia.com
⚫ http://www.howstuffworks.com
⚫ http://www.wikipedia.com
⚫ www.warriorsofthe.net
Personalization
Overview
• Profile-based personalization
• Behavior-based personalization
• Campaign-based personalization
– Campaigns have defined start and finish dates and a defined
audience
Personalization and the
Audience
• Without some notion of who a person is, personalizing for
that person is not possible
• By using an audience analysis, you can come up with the
set of traits and trait values that each audience type must
exhibit
• Each cluster of these traits is a user profile that you can
collect, store, and access to decide what kind of person you
are dealing with
Personalization and the
Audience (Cont.)
• How to collect audience traits
– You can ask
– You can infer/deduce
– You can buy
• If you decide to ask for information from your audience
members, keep the following points in mind
– Trust
– Value
– Time and effort
– Context – make sure that you explain why you need the answer to
the question you ask, or position the questions in a context where an
explanation isn’t necessary
Personalization and Components
1
Definitions
Search –
To examining a file in a computer, disk,
database or network for finding certain
information
Engine
Something that gives force or energy for moving a system
Search engine
An application that looks for certain key words and returns a list of
documents where these words were found. Esp. commercial
services that scan documents from the Internet.
2
Using a search engine…
3
How S.E. work
Crawler
URL1
URL2
Indexer Web
URL3 URL4
4
How S.E. work (2)
7
How S.E. work (5)
• further:
– There are many S.E having only national covering
• orientation
– There are many specialized or domain S.E
• Coverage oriented to a subject of interest
9
WWW coverage (2)
• According to a 2001 study, there were massively more than 550 billion
documents on the Web, mostly in the invisible Web, or deep Web.[27]
• A 2002 survey of 2,024 million Web pages[28] determined that by far the
most Web content was in English: 56.4%; next were pages in German
(7.7%), French (5.6%), and Japanese (4.9%).
• A more recent study, which used Web searches in 75 different languages
to sample the Web, determined that there were over 11.5 billion Web
pages in the publicly indexable Web as of the end of January 2005.[29]
• As of March 2009[update], the indexable web contains at least 26 billion
pages.[38]
• On July 25, 2008, Google software engineers Jesse Alpert and Nissan Hajaj
announced that Google Search had discovered one trillion unique URLs.[31]
• Over 100.1 million websites operated as of March 2008.[14] Of these 74%
were commercial or other sites operating in the .com generic top-level
domain.[14]
10
The size of the World Wide Web
GYWA = Sorted on Google, Yahoo!, Windows Live Search (Msn Search) and Ask
YGWA = Sorted on Yahoo!, Google, Windows Live Search (Msn Search) and Ask
“From the sum of these estimations, an estimated overlap between these search engines is
subtracted. The overlap is an overestimation; hence, the total estimated size of the indexed World
Wide Web is an underestimation.” 11
Estimated size of Google's index
12
Differences between S.E.
• Size
• Search options
• Speed
• Update frequency
• Relevance of results
• Ease of use
• http://www.searchengineshowdown.com/features/
• http://www.searchengineshowdown.com/features/byfeat
ure.shtml
13
14
Business Models
• Public good – independent budget
– PubMed (http://en.wikipedia.org/wiki/PubMed) –biomedical
research
– Librarians’ Index to Internet - http://lii.org/
15
Sponsored links
• “Sponsored links are links to websites that pay for
placement next to Google Maps search results. These
advertisements are always clearly labeled as 'Sponsored
Links', and are targeted to the topic and location of a
search. For example, search results for 'hotels near LAX'
will show sponsored links from certain hotels in this area.
Sponsored links are simply another way to find websites
that contain the information that you're searching for.”
• “Sponsored links that appear in AOL Search results or on
AOL channels are listings that have been purchased by
companies to have their businesses or Web sites appear
for specific search terms related to their services”
• Payment for crawling/indexing a site more often
16
Limits
• Each S.E. has limitation regarding
– Coverage
– Search features
– Finding quality info
• Some SE combined search with economics
becoming more than advertisers
• SE may become victims of spamindexing
– It affects the included content and its
classification
17
spamindexing
• Content spam
– Keyword stuffing
– Hidden or invisible unrelated text
– Meta tag stuffing
– "Gateway" or doorway pages
• Link spam
– Link farms
– Hidden links
– "Sybil attack"
– Spam blogs
– Page hijacking
• Using world-writable pages
– Spam in blogs
– Comment spam
• http://en.wikipedia.org/wiki/Spamdexing
18
Meta search engines
• meta engines search multiple engines
– getting combined results from a variety of
engines
• do not have their own databases
– but have their own business models affecting
results
• a number of techniques used
– interesting ones: clustering, statistical analyses
19
Examples of meta engines
- with organized results
Dogpile
results from a number of leading search engines; gives source,
so overlap can be compared; has SearchSpy -listing searches
that were performed
Surfwax
gives text sources & linking to sources; for some terms gives
related terms to focus
Turbo10
provides results in clusters; engines searched can be edited
Clusty
results grouped by topics or clusters for further sources
20
21
Examples of meta engines
- with organized results (2)
• large directory
– Complete Planet
directory of over 70,000 databases & specialty engines; classified
27
Information about S.E.
Search Engine Watch
• ratings, news, statistics, charts, explanations, tutorials
Search Engine Showdown
• “The users’ guide to web searching” - run by a librarian, news links, ratings
Virtual Chase
a site about “Teaching Legal Professionals How To Do Research;,” this section has
very good tips and links for consideration of quality on the web
SiteLines
a blog, written by Rita Vine, a professional librarian, & web search
trainer; many evaluations in archive
ResourceShelf
“Resources and News for Information Professionals,” edited by
Gary Price, a librarian & author of Invisible Web – has extensive
archive
WebsearchAbout
not evaluative, but provides news, capabilities, sources, articles
about web searching
28
A few tips for Web searching, including
the invisible kind
29
A SELECTION OF A FEW (OF GREAT
MANY) SOURCES FOR
INVISIBLE WEB
30
Characteristics
• Many oriented toward scholarly, research &
professional, technical & related information
– include sources mostly not covered by general
search engines
• majority of these are trustworthy
• quality much higher, some carefully selected, some
edited
– origins vary widely
• from commercial to voluntary to government
sponsored
• Popular in many disciplines31
Large scholarly search engines &
directories - sample
• Infomine - a comprehensive virtual library and reference tool for
academic and scholarly Internet resources, including Web sites, databases
– covers a wide range of scholarly resources by fields
• Scirus – “it allows researchers to search for not only journal content but
also scientists' homepages, courseware, pre-print server material, patents and
institutional repository and website information. “
– by Elsevier, run in conjunction with Scopus and Science Direct, but this one free
32
Large edited sites
Open Directory Project
• large edited catalog of the web – global, run by
volunteers
BUBL LINK
• selected Internet resources covering all academic
subject areas; organized by Dewey Decimal System –
from UK
33
Science, scholarship engines, not free – a
sample
• In a specific domain
ACM Portal
Asoc. for Computing Machinery: access to ACM Digital Library & Guide to Computing
[available at RUL]
34
Domain engines
Cover specific subjects & topics
from sciences, arts, humanities, to various media &
interests – you name it
• Important tool for subject searches
– particularly for subject specialist
– valued by professional searchers
36
in science …
Ocean Planet NASA
presentation of earth & its vast oceans
ArXiv Cornell U, National Science Foundation
e-print service in the fields of physics, mathematics, computer
science, and quantitative biology
large, non-reviewed contribution by authors, comments later
Athena Earth Sciences Resources
not a search engine but a large well organized directory
37
in education …
Intute
“Intute is a free online service providing you with a
database of hand selected Web resources for education
and research.”
Think Quest – Oracle Education Foundation
education resources, programs; web sites created by
students
Resource Discovery Network – UK
“UK's free national gateway to Internet resources for the
learning, teaching and research community”
38
in images, movies, video …
Internet Movie Database
• treasure trove of movies
Picsearch
picture searching
Blinkx
claims to be word largest search engine for videos; it has indexed over 32
million hours worth of video footage, made searchable by automatically
transcribing the speech content.
39
in humanities …
Shakespeare & Internet Search Tools & Resources
great fun to navigate
KIRKE - Katalog der Internetressourcen für die Klassische Philologie aus
Erlangen
• German; a variety of resources for classics
Diotima
Materials for study of women and gender in the Ancient World
40
in music …
Musipedia
Not everything is text. This is “a searchable, editable, and expandable collection
of tunes, melodies, and musical themes.” Great fun!
41
governments …
U Mich Document Center
official documents from all over the world
– US government official web portal
“Whatever you want or need from the U.S. government”
US State Department
• about the U.S & other countries
FirstGov
the US government official web portal
42
Enterprise search
• used to describe the application of search technology to information within an organization
• major challenge faced by Enterprise search is the need to index documents from a variety of
sources such as: file systems, intranets, document management systems, e-mail, and
databases and present a consolidated list of relevance ranked documents from these various
sources
• many applications require the integration of structured data as part of the search criteria and
when presenting results back to the users
• Differences from web search
– Adapters to index content from a variety of repositories, such as databases and content
management systems
– Federated search
• transforming a query and broadcasting it to a group of disparate databases with the appropriate syntax,
• merging the results collected from the databases,
• presenting them in a succinct and unified format with minimal duplication,
• providing a means, performed either automatically or by the portal user, to sort the merged result set.
– Entity extraction that seeks to locate and classify elements in text into predefined
categories such as the names of persons, organizations, locations, expressions of times,
quantities, monetary values, percentages, etc.
– Faceted search, a technique for accessing a collection of information represented using
a faceted classification, allowing users to explore by filtering available information.
– Access control, usually in the form of an Access control list (ACL), is often required to
restrict access to documents based on individual user identities.
43
Magic Quadrant for Information
Access Technology (2008)
• http://mediaproducts.gartner.com/reprints/microsoft/vol6/articl
e4/article4.html
• Included in Information Access Technology:
– document management, Web content management and relational
database management systems to provide users with insight into their
contents
– expected to include results from enterprise applications, such as
customer relationship management (CRM) and legacy systems
• it increasingly looks outside enterprises as well, to premium
sources of information, Web sites and elements of the social
Web
• Portal, ECM, business application and other vendors
frequently include enterprise search as part of their products
44
• most mature information access
technology is search engine technology
(from 1994 +), applied to unstructured
data in document repositories
• added to this category: auto
categorization, creative visualization,
content analytics and taxonomy support
technologies
• “Total software revenue in the world's
enterprise search market in 2007 was
$860.3 million. We forecast it to grow to
$1.5 billion by 2012, for a compound
annual growth rate of 11.4%
(see "Dataquest Insight: Technology and
Vendor Consolidation Will Drive the
Enterprise Search Market Through 2012").
“, Gartner
45
Mind map
Web 2.0
“Definition”
• Technology trend that enables collaboration
and sharing between users
Web 1.0 vs Web 2.0
• DoubleClick --> Google
• AdSense Ofoto --> Flickr
• Akamai --> BitTorrent
• mp3.com --> Napster
• Britannica Online --> Wikipedia
• personal websites --> blogging
• evite --> upcoming.org and EVDB
• domain name speculation --> search engine optimization
• page views --> cost per click
• screen scraping --> web services
• publishing --> participation
• content management systems --> wikis
• directories (taxonomy) --> tagging ("folksonomy")
• stickiness --> syndication
Web 1.0 vs Web 2.0
• Document Collaboration
Napster Bittorrent
Web 1.0 vs Web 2.0
• Music and Entertainment
RapidShare MediaFire
MediaFire does not enforce waiting
times for downloads, require
CAPTCHAs, limit simultaneous
downloading, or set bandwidth
limits.
Web 1.0 vs Web 2.0
• Video Sharing
•Google Sites
Adobe Acrobat Alternatives
• PDFreDirect - view, edit, create and protect
PDFs. The free version offers almost all of the
same functionalities as Adobe Acrobat, and
the paid version is a nice complete substitute
• PDFescape - read, fill out and print PDFs
online directly in the browser + PDFtypewriter
for creating and editing the PDF files using
Java
• PDFedit - edit, read and script PDF files online
Also see
• Open Source Alternative to Commercial
Software - http://www.osalt.com/
• http://alternativeto.net/
Q/A
• Questions, please?
Nupur Choudhury / (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 5 (6) , 2014, 8096-8100
Abstract- The fast lane toward the development of Web is eventually become the World Wide Web [1]. The 1989
coined to be as an outright phenomenon in the today’s society proposal was meant for a more effective CERN
with incorporated use of modern innovative technology and communication system but Berners- Lee eventually realised
redefining the way of organizing, communicating and the concept could be implemented throughout the world.
collaborating with individual which in terms lead us to
mixture of spectacular successes and failures. The purpose of
Berners- Lee and Belgian computer scientist Robert
this paper is to understand and conceptualize the evolution of Cailliau proposed in 1990 to use hypertext “to link and
Web from the scratch to the upcoming trends in the field of access information of various kinds as a web of nodes in
Web Technology. which the user can browse at will" [22]. In these ways the
first web service was designed and tested and latterly
Keywords: Web 1.0, Web 2.0, Web 3.0, Web 4.0, confined as Word Wide Web.
characteristics, Limitation, Architecture.
III. WEB 1.0
I. INTRODUCTION Web 1.0 was first implementation of the web and
In today’s era Web Technology can be easily defined it lasted from 1989 to 2005. It was define as web of
by the user in different descriptive way. But matter in fact information connections. According to the innovator of
many user are quite unknown to the information that from World Wide Web, Tim Berners-Lee considers the Web as
where the WWW was coined first. As this paper state the “read-only” Web [1]. It provides very little interaction
evolution of Web so it is important to initiate the story from where consumer can exchange the information together but
the beginning where it was stated first. it was not possible to interact with the website. The role of
Web was introduced by Tim Burners-Lee in late the web was very passive in nature.
1989[9][10]. He view of the capabilities of the World Wide Web 1.0 was referred as the first generation of World
Web was expressed by three innovations, typically Wide Web which was basically defined as
associated with three phases: namely, the Web of ” It is an information space in which the items of
documents (Web 1.0), the Web of people (Web 2.0) and the interest referred to as resources are identified by global
Web of data (the still-to-be-realised Web 3.0) [11].Through identifier called as Uniform Resources Identifiers (URIs) “.
its life cycle, the World Wide Web has been through First generation Web was era static pages and content
various phases of development. Going by the trend of delivery purpose only. In other words, the
constant evolution, the Web is now slowly but surely early web allowed us to search for information and read it.
transiting to more data centric phase in the context of Web There was very little in the way of user interaction or
version 3.0[7]. content contribution.
This paper is structured in such a way that, classifying A. CHARATERISTICS
obtaining nature of Web 1.0 and projecting prospective Web 1.0 Technologies includes core web
characteristics of Web 2.0 with added different dimensions protocols: HTML, HTTP and URI. The major
of the Web 3.0 semantic frameworks, whilst its scope is characteristics of Web 1.0 are as follow:
directed to explore a stronger appreciation into architectural They have read only content.
foundations of the next generation of Web 4.0 of Web Establish an online presence and make their
applications. This paper would attempt to build a user information available to anyone at any time.
centric view of the composition of features that would be It includes static web pages and use basic
expected to be incorporated in future generations of Web Hypertext Mark-up Language.
technology. In sum, the paper presents a holistic view of B. LIMITATION
the World Wide Web. The major limitations of Web 1.0 are as follow:
II. WORLD WIDE WEB The Web 1.0 pages can only be understood by
The World Wide Web is a system of interlinked humans (web readers) they do not have
hypertext documents accessed via the Internet [21]. With a machine compatible content.
web browser, one can view web pages that may contain The web master is solely responsible for
text, images, videos, and other multimedia and navigate updating users and managing the content of
between them via hyperlinks. On March 12, 1989, Tim website.
Berners- Lee, a British computer scientist and former
CERN employee, wrote a proposal for what would
www.ijcsit.com 8096
Nupur Choudhury / (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 5 (6) , 2014, 8096-8100
Lack of Dynamic representation i.e., to acquire Wide Web to a new phase of use and service
only static information, no web console were development [17]. The categorization can be used to
available to performing dynamic events. elaborate on the understanding of Web 2.0 achieved
through varied definitions.
IV. WEB 2.0
Web 2.0 is the second generation of web. It was Technology Centric Definition:
defined by Dale Dougherty in 2004 as a read-write web [1]. Web has become a platform with
The concept began with a conference brainstorming session software above the level of a single
between O’Reilly and Media live International. The device. Technology that is associated
technologies of web 2.0 allow assembling and managing with blogs, wikis, podcasts, RSS feeds
large global crowds with common interests in social etc.
interactions. Business Centric Definitions:
Tim O’Reilly defines web 2.0 on his website as follows A way of architecting software and
[8]: businesses. The business revolution in the
“Web 2.0 is the business revolution in the computer industry caused by the move to
computer industry caused by the move to the internet as platform and an attempt to
internet as platform, and an attempt to understand the rules for success on that of
understand the rules for success on that new new platform.
platform. Chief among those rules is this: Build User Centric Definitions:
applications that harness network effects to get The Social Web is often used to
better the more people use them.” characterize sites that consist of
Web 2.0 facilitates major properties like participatory, communities. It is all about content
collaborative, and distributed practices which enable formal management and new ways of
and in-formal spheres of daily activities on going on web. communication and interaction between
In other terms it resemble major distinct characteristics of users. Web applications that facilitate
Web 2.0 include “relationship” technologies, participatory collective knowledge production, social
media and a social digital technology which in term can networking and increases user to user
also defined as the wisdom web. People-centric web and information exchange.
participative web is taken into concern and which facilities B. LIMITATION
reading and writing on the web which makes the web Sometimes it may happen that if the new technology
transaction bi-directional. meets expectations of the mass user at large, there may
Web 2.0 is a web as a platform where users can leave be a chance that these technologies may face lot of
many of the controls they have used in web 2.0. In other consequences from external environment which may
words, the user of web 2.0 has more interaction with less supress or limit the flow of technology in presenting
control. Web 2.0 is not only a new version of web 1.0 results which might not be feasible and may lead to
but it also implies to flexible web design, creative reuse, degrade the performance of the technology as a whole.
updates, collaborative content creation and modification Constant iteration cycle of Change and
in web 2.0 that should be considered as one of the Updates to services [11].
outstanding feature of the web 2.0 is to support
Ethical issues concerning build and usage of
collaboration and to help gather collective intelligence
Web 2.0 [11].
rather Web 1.0.
Interconnectivity and knowledge sharing
between platforms across community
boundaries are still limited [12] [15].
V. WEB 3.0
Web 3.0 is one of modern and evolutionary topics
associated with the following initiatives of Web 2.0. Web
3.0 was first coined by John Markoff of the New York
Times and he suggested web 3.0 as third generation of the
web in 2006 [18]. Web 3.0 can be also stated as
“executable Web”.
The basic idea of web 3.0 is to define structure data and
link them in order to more effective discovery, automation,
integration, and reuse across various applications [6]. It is
able to improve data management, support accessibility
of mobile internet, simulate creativity and innovation,
Fig. 1 Comparison Web1.0 & Web 2.0 [28]
encourage factor of globalization phenomena, enhance
A. CHARATERISTICS customers’ satisfaction and help to organize collaboration
Web 2.0 is instead a label coined by Tim O’Reilly and in social web.
associates to reference the transition of the World
www.ijcsit.com 8097
Nupur Choudhury / (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 5 (6) , 2014, 8096-8100
Web 3.0 is also known as semantic web. Semantic web A. SEMANTIC WEB
was thought up by Tim Berners-Lee, inventor of the World The Semantic Web is a collaborative movement
Wide Web [1]. There is a dedicated team at the World led by international standards body the World Wide
Wide Web consortium Web Consortium. According to the W3C [4],
(W3C) working to improve, extend and standardize the “The Semantic Web provides a common
system, languages, publications and tools have already framework that allows data to be shared and reused
been developed [3]. Web 3.0 is a web where the concept of across application, enterprise, and community
website or webpage disappears, where data isn’t owned but boundaries “.
instead shared, where services show different views for the
same web or the same data. Those services can be The main purpose of the Semantic Web is driving
applications (like browsers, virtual worlds or anything the evolution of the current Web by enabling users to
else), devices or other, and have to be focused on context find, share and combine in formation more easily. The
and personalization, and both will be reached by using Semantic Web, as originally envisioned, is a system that
vertical search [13]. enables machines to “understand” and respond to
Web3.0 supports world wide database and web complex human requests based on their meaning. Such
oriented architecture which in earlier stage was described an “understanding” requires that the relevant
as a web of document. It deals mainly with static HTML information sources be semantically structured.
documents, but dynamically rendered pages and alternative Tim Berners- Lee originally expressed the Semantic
formats should follow the same conceptual layout standards Web as follows [2]:
whenever possible and links are between documents or part “If HTML and the Web made all the online
of them. The web of documents was designed for human documents look like one huge book, RDF, schema, and
Consumption in which primary objects are documents and inference languages will make all the data in the world
links are between documents (or parts of them). Semantics look like one huge database”.
of content and links are implicit and the degree of structure Tim Berners-Lee proposed a layered architecture
between objects is fairly low [19]. Figure 2 represents the for semantic web that often represented using a
structure of web of documents in simple [19]. diagram, with many variations since.
www.ijcsit.com 8098
Nupur Choudhury / (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 5 (6) , 2014, 8096-8100
information on the Web. RDF provides the Vagueness: This arises from the vagueness of
technology for expressing the meaning of terms user queries, of concepts represented by
and concepts in a form that computers can content providers, of matching query terms to
readily process. provider terms and of trying to combine
RDF Schema: It provides a predefined, basic different knowledge bases with overlapping
type system for RDF models. RDF Schema but subtly different concepts.
provides modeling primitives for organizing Inconsistency: These are logical
Web objects into hierarchies. Key primitives are contradictions which will inevitably arise
classes and properties, subclass and sub property during the development of large
relationships, and domain and range restrictions. ontologies, and when ontologies from
Ontology: The ontology layer described separate sources are combined.
properties and the relation between properties Deceit: This is when the producer of the
and different. Ontology can be defined as a information is intentionally misleading the
collection of terms used to describe a specific consumer of the information.
domain with the ability of inference.
Logic layer: It is used to enhance the ontology
VI. COMPARISION
language further and to allow the writing of
The main difference between Web 1.0, Web 2.0 and
application-specific declarative knowledge.
Web 3.0 is that web 1.0 is consider as read-only web
Proof layer: It involves the actual deductive targets on content creativity of producer web 2.0 targets
process as well as the representation of proofs on content creativity of users and producers while web 3.0
in Web languages (from lower levels) and targets on linked data sets. The very few comparative
proof validation. differences between Web 1.0, Web 2.0 and Web 3.0 are
Trust layer: It will emerge through the use of given below:
digital signatures and other kinds of knowledge WEB 1.0 WEB 2.0 WEB 3.0
based on recommendations by trusted agents or
1996 – 2004 2004 -2016 2016+
on rating and certification agencies and
The Hypertext
consumer bodies. The Social Web The Semantic Web
Web
Semantic web is not limited to publish data on the Tim Berners Tim O’Reilly, Dale
web. It is about making links to connect related data. Tim Berners Lee
Lee Dougherty
Berners-Lee introduced a set of rules have become Read and Write
known as the Linked Data principles to publish and Read Only Executable Web
Web
connect data on the web in 2007 [16]: Millions of
Use URIs as names for things Billions of User Trillions+ of Users
User
Use HTTP URIs to look up those names Participation and
Provide useful information, using the Echo System Understanding self
Interaction
standards (RDF) by look up a URI One Multi-user Virtual
Include links to other URIs to discover more Bi-Directional
Directional environment
things People build
Data providers can add their data to a single Companies application though
global data space by publishing data on the web People Publish
Publish which people
according to the Linked Data principles. Content
Content interact and
B. CHARACTERISTICS publish content.
The major characteristics of Web 3.0 as marked by Web 3.0 is
Nova Spivack are [18]: curiously
SaaS Business Model. Static content. Dynamic content. undefined.
Open Source Software Platform. AI and 3D,The
Distributed Database –or what called as “The web learning
World Wide Database”. Personal Blog and Social SemiBlog,
Web Personalization. Websites Profile Haystack.
Resource Pooling Message
Community portals Semantic Forums
Intelligent Web. Board
Buddy List, Online Social Semantic Social
C. CHALLENGES Address Book networks. Information
Semantic Web faces several challenging issue Table 1. Comparison of Web 1.0, Web 2.0 and Web 3.0
like:
Vastness: The World Wide Web contains VII. WEB 4.0 AND FUTURE WEB
many billions of pages. Redundancy in Data Web 4.0 can be considered as an Ultra-Intelligent
may occur which has not yet been able to Electronic Agent, symbiotic web and Ubiquitous web
eliminate all semantically duplicated terms. [25]. Interaction between humans and machines in
www.ijcsit.com 8099
Nupur Choudhury / (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 5 (6) , 2014, 8096-8100
symbiosis was motive behind of the symbiotic web. [10] Maged, N. Kamel Boulos & Steve, Wheeler, “The emerging Web
2.0 social software: an enabling suite of sociable technologies in
Powerful as human brain, progress in the development of health and health care education”, Health Information and Libraries
telecommunications, advancement on nanotechnology in Journal, pp: 2 -23, 2007
the world and controlled interfaces using web 4.0. In [11] Anderson, P. ”`All That Glisters Is Not Gold' -- Web 2.0 And
simple words, machines would be clever on reading the The Librarian”, Journal of Librarianship and Information Science,
39 (4), pp. 195–198, 2007.
contents of the web, and react in the form of executing and [12] Abel, F., Frank, M., Henze, N., Krause, D., Plappert, D., & Siehndel,
deciding what to execute first to load the websites fast with P., “Group Me! - Where Semantic Web meets Web 2.0”, 2007.
superior quality and performance and build more [13] Mind Booster, Noori, “WhatisWeb3.0?”
commanding interfaces [24]. http://mindboosternoori.blogspot.com/2007/08/what-is-web-30.html,
2007.
Web 4.0 will be read write concurrency web [23]. It [14] Tim, Berners-Lee & Christian, Bizer & Tom, Heath & Kingsley,
ensures global transparency, governance, distribution, Idehen, “Linked Data on the Web”, 17th International World Wide
participation, collaboration into key communities such as Web Conference, 2008.
industry, political, social and other communities. WebOS [15] Chan, C. K., Lee, Y. C., & Lin, V., “Harnessing Web 2.0 for
Collaborative Learning”, Springerlink, 2009.
will be such as a middleware in which will start functioning [16] Christian, Bizer & Tom, Heath & Tim, Berners-Lee, “Linked
like an operating system [26]. WebOS will be parallel to Data - The Story So Far”, Journal Semantic Web and Information
the human brain and implies a massive web of highly Systems, 2009.
intelligent interactions [27]. [17] Harrisom, T. M., & Barthel, B., “ Wielding new media in Web
2.0: exploring the history of engagement with the collaborative
construction of media products ”. New media & Society, 11(1&2),
VIII. CONCLUSION pp. 155–178, 2009.
[18] Nova Spivack, “Web 3.0: The Third Generation Web is Coming”
This paper provided an overview from the evolution of http://lifeboat.com/ex/web.3.0, 2011.
the web. Web 1.0, web 2.0, web 3.0 and web 4.0 were [19] Sareh Aghaei, Mohammad Ali Nematbakhsh and Hadi Khosravi
described as four generations of the web. The Farsani, “Evolution of the World Wide Web: From Web 1.0 to Web
characteristics of the generations are introduced and 4.0”,Computer Engineering Department, University of Isfahan,
Isfahan, Iran, International Journal of Web & Semantic Technology
compared. It is concluded web as an information space has (IJWesT) Vol.3, No.1,pp. 1-10, 2012
had much progress since 1989 and it is moving toward [20] Patel et al., International Journal of Advanced Research in Computer
using artificial intelligent techniques to be as a massive Science and Software Engineering 3(10), pp. 410-417, 2013.
web of highly intelligent interactions in close future. [21] W3C “World Wide Web Consortium”, http://www.w3.org.
[22] World Wide Web: Proposal for a HyperText Project
(http://www.w3.org/Proposal.html).
REFERENCES [23] “Web 4.0 - A New Web Technology”, http://website-
[1] Tim Berners-Lee, “The World Wide Web: A very short personal quality.blogspot.com/2010/01/web-40-new-webtechnology.html/,
history”, http://www.w3.org/People/Berners-Lee/ShortHistory.html , Hemnath (2010)
1998. [24] On2broker: Lessons Learned from Applying AI to the Web Dieter
[2] Berners-Lee, Tim; Fischetti, Mark,” Weaving the Web”, Harper San Fensel, Jürgen Angele, Stefan Decker, Michael Erdmann, Hans-
Francisco, chapter 12, ISBN 978-0-06-251587-2, 1999. Peter Schnurr, Rudi Studer and Andreas Witt Institute AIFB,
[3] Sean B, Palmer, “The Semantic Web: An Introduction”, University of Karlsruhe, D -76128 Karlsruhe, Germany
http://infomesh.net/2001/swintro/, 2001. dfe@aifb.uni-karlsruhe.de,http://www.aifb.uni-karlsruhe.de/~dfe.
[4] W3C Semantic Web Activity “http://www.w3.org/2001/sw/”, World [25] JONATHAN FOWLER AND ELIZABETH RODD(2013) Web
Wide Web Consortium, 2001. 4.0: The Ultra-Intelligent Electronic Agent is
[5] Jane, Greenberg & Stuart, Sutton & D. Grant, Campbell , http://bigthink.com/big-think-tv/web-40-the-ultra-intelligent-
“Metadata: A Fundamental Component of the Semantic Web”, electronic-agent-is-coming.
Bulletin of the American Society for Information Science and [26] Ron, Callari , “Web 4.0,Trip Down the Rabbit Hole or Brave
Technology Volume 29, Issue 4, pages 16–18,2003. New World?”, http://www.zmogo.com/web/web-40trip-down-the-
[6] Ossi, Nykänen, “Semantic Web: Definition”, rabbit-hole-or-brave-new-world/ .
http://www.w3c.tut.fi/talks/2003/0331umediaon/slide6-0.html, 2003. [27] Dan, Farber (2007), “From semantic Web (3.0) to the WebOS (4.0)”,
[7] Motta, E., & Sabou, M, “Next Generation Semantic Web http://www.zdnet.com/blog/btl/from-semantic-web-30-to-the-webos-
Applications”, Heidelberg, Springer-Verlag Berlin, pp. 24-29, 2006. 40/4499/ .
[8] O’Reilly, Definition of Web 2.0. [28] Flat Word Business “Web 1.0 vs Web 2.0 vs Web 3.0 vs Web 4.0 –
http://radar.oreilly.com/archives/2006/12/web-20-compact- A bird’s eye on the evolution and definition”
definition-tryi.html, 2006. http://flatworldbusiness.wordpress.com/flat-
[9] Brian, Getting, “Basic Definitions: Web 1.0, Web 2.0, Web 3.0”, education/previously/web-1-0-vs-web-2-0-vs-web-3-0-a-bird-eye-
2007, http://www.practicalecommerce.com/articles/464-Basic- on-the-definition/
Definitions-Web-1-0-Web-2-0-Web-3-0
www.ijcsit.com 8100
Intranet, Extranet, Portal
Intranet
• An intranet is a network within an
organization that links together users by
means of Internet technologies.
• An intranet limits the Internet territory by
establishing access controlled zones where
users may communicate and interact freely.
• These networks are based on WWW --> users
may communicate in real time between
platforms.
Intranet (cont.)
• An intranet is useful for organizations that:
– Are geographically dispersed;
– Share common business objectives;
– Have common information needs;
– Value collaboration.
• An Intranet may have 3 functionality levels:
– It Displays general, static information;
– Sharing data – used for managing dynamic data within an
organization;
– Interactive communications – real-time collaboration and
creating a secured platform for interactive communication
within an organization.
An Intranet may be used for:
• Displaying the goal and scope of the organization;
• on-line manuals and procedures;
• Creating internal forums and bulletin boards;
• Displaying a digital phone book and a personnel catalogue;
• Event calendars for the events within the organization;
• Search engine for documents;
• Displaying news from within the organization and from the
outside world;
• Lists of articles written by partners;
• List of clients and contact information database;
• Marketing and price information for products together with
their catalogue;
Advantages for using an Intranet
• Workforce productivity: Intranets can help employees to quickly find and view information
and applications relevant to their roles and responsibilities. Via a simple-to-use web browser
interface, users can access data held in any database the organization wants to make
available, anytime and - subject to security provisions - from anywhere, increasing
employees' ability to perform their jobs faster, more accurately, and with confidence that
they have the right information.
• Time: With intranets, organizations can make more information available to employees on a
"pull" basis (ie: employees can link to relevant information at a time which suits them) rather
than being deluged indiscriminately by emails.
• Communication: Intranets can serve as powerful tools for communication within an
organization, vertically and horizontally.
• Web publishing allows 'cumbersome' corporate knowledge to be maintained and easily
accessed throughout the company using hypermedia and Web technologies. Examples
include: employee manuals, benefits documents, company policies, business standards,
newsfeeds, and even training, can be accessed using common Internet standards (Acrobat
files, Flash files, CGI applications). Because each business unit can update the online copy of a
document, the most recent version is always available to employees using the intranet.
• Business operations and management: Intranets are also being used as a platform for
developing and deploying applications to support business operations and decisions across
the internetworked enterprise.
Disadvantages
• Publication of information must be controlled
to ensure only correct and appropriate
information is provided in the intranet – poate
fi combinat cu solutii de workflow pentru
rezolvarea problemei.
• Appropriate security permissions must be in
place to ensure there are no concerns over
who accesses the intranet or abuse of the
intranet by users.
Exemples
• KPMG moved all of its information assets to an intranet called
KWorld.
• “The success of Cisco Systems has been largely attributed to its
innovative corporate intranet”
• Ford Motor Co has more than 175,000 employees in 950 locations
worldwide, each of whom had access to the company’s intranet,
called Myford.com. The intranet gave employees information about
benefits, demographics, salary history, general company news and
human resources forms.
• ShoreBank's branch, affiliate, and consulting service employees
around the world communicate and collaborate using SIREN. SIREN
is an intranet, extranet, and knowledge management solution
implemented in 2006 using Intranet DASHBOARD.
• The Australian National University uses an Intranet called
Claromentis to maintain one of its external sites.
Links
• http://www.intranetjournal.com
• http://www.intranetblog.com
• http://www.steptwo.com.au/columntwo/
• http://www.eaber.org
• http://b-r-ent.com
• http://www.sorce.biz/whitepaperindex.asp
• http://www.s-development.net/blogs/
• http://www.intranetmaturity.com/
• http://www.theworkplaceblog.com/
Extranet
• It is an Web site with controlled access where the visitors may come
from the outside of the organization
• Exemples:
– Sales extranets allow the owners to publish special content for
important clients or for those that [prospect the market.
– B2B/e-commerce/virtual stores extranets for selected clients.
– Extranets for project management or collaborative extranets allow
sharing documents, plans and electronic goods within partners.
– In 2003 in the United Kingdom, several of the leading vendors formed
the Network of Construction Collaboration Technology Providers, or
NCCTP, to promote the technologies and to establish data exchange
standards between the different systems
• An extranet uses the features and the goals of an intranet,
extending them beyond the borders of an organization
Extranet usage
• Sharing up-to-date documents, files and images with
suppliers, partners and clients from different locations;
• Working in collaboration for editing, revision, updating,
versioning and storing documents and digital goods;
• Managing projects among partners from a single location;
• Sharing updated versions of frequently updated
documents: sales reports, stock summaries, product
specifications, design documents, production planning etc.
• Access for partners to back-office functions such as stock
management, warranty information, dates for new
products, shared sales etc.
Advantages
• Extranets can improve organization productivity by automating
processes that were previously done manually (e.g.: reordering of
inventory from suppliers). Automation can also reduce the margin
of error of these processes.
• Extranets allow organization or project information to be viewed at
times convenient for business partners, customers, employees,
suppliers and other stake-holders. This cuts down on meeting times
and is an advantage when doing business with partners in different
time zones.
• Information on an extranet can be updated, edited and changed
instantly. All authorised users therefore have immediate access to
the most up-to-date information.
• Extranets can improve relationships with key customers, providing
them with accurate and updated information
Disadvantages
• Extranets can be expensive to implement and maintain
within an organisation (e.g.: hardware, software, employee
training costs) — if hosted internally instead of via an
Application Service Provider;
• Security of extranets can be a big concern when dealing
with valuable information. System access needs to be
carefully controlled to avoid sensitive information falling
into the wrong hands.
• Extranets can reduce personal contact (face-to-face
meetings) with customers and business partners. This could
cause a lack of connections made between people and a
company, which hurts the business when it comes to
loyalty of its business partners and customers.
Portals
• Definition: Portals are single point of access to information which is:
– from various logically linked internet based applications and
– is of interest to various type of users
• Site on the World Wide Web that typically provides personalized
capabilities to its visitors, providing a pathway to other content.
– It is designed to use distributed applications,
– different numbers and types of middleware and hardware to provide
services from a number of different sources.
– In addition, business portals are designed to share collaboration in
workplaces.
– A further business-driven requirement of portals is that the content be
able to work on multiple platforms such as personal computers,
personal digital assistants (PDAs), and cell phones.
Advantages
• advantages of using portals:
– intelligent integration and access to enterprise
content, applications and processes
– improved communication and collaboration
among customers, partners, and employees
– unified, real-time access to information held in
disparate systems
– personalized user interactions
– rapid, easy modification and maintenance of the
website presentation
Types of portals
Types of portals
Types of Portals
• International portals (Yahoo!)
• Regional portals [(MswPower.Com),China (Sina.com), Italy (Webplace.it)]
– local information such as weather forecasts, street maps and local business
information
• Government portals:
– USA.gov,
– DisabilityInfo.gov
– Directgov (UK) – pentru cetateni
– businesslink.gov.uk (UK) – pentru persoane juridice
• Corporate/Enterprise portals
• Hosted web portals
– corporate portals gained popularity a number of companies began offering
them as a hosted service
– Hyperoffice.com, OFFICEHQ.com
• Domain specific portals – pentru toate tipurile de afarceri
Other types of portals
• Entertainment Portals: Often all members of an entertainment portal are responsible for its
content and direct the type of entertainment that is available to visitors to the site. An
example of one such portal is the South African Music and Entertainment portal Overtone.
These can be an essential part of community based networking and collaboration
• Environmental Portals: In recent years, many Environmental Portals have been developed in
order to raise awareness about Environmental Indicators. Such an Example is the EUSOILS
• Investment Portals: These are an excellent resource when researching global and industry
specific markets
• B2B and B2C Portals: B2B or Business to Business Portals have become a very important
resource for Global business. They provide buyer and seller details for different commodities
and products and help in connecting businesses across the globe. A B2B portal that
specializes in a single industry is called a Vertical B2B Portal or a Vortal. B2C or Business to
Consumer portals are used to directly sell products to consumers
• Mini Portals: Some localized portals are based on local interests, and edited and maintained
by individuals. While they do not provide the same levels of services as major portals, they
are a good place for collaboration of ideas, for commonly interested people
• Voice Portals: In addition to standard web sites accessed through web browsers, people can
also access "voice sites" through "voice browsers". Destinations accessed in this way by
standard telephones are often called Voice Portals.
Enterprise portal
• Definition:
– Enterprise Information Portals are one of the most
popular ways in which enterprises can allow their
employees and customers to search and access
corporate information.
– It is a single gateway for users, such as employees,
customers and company’s partners to log into and
retrieve corporate information, company history
and other services or resources
– is a web portal for use within an organization.
Example of an EIP Architecture
Example of an EIP Architecture 2
Portal features - 1
1. Web interface;
2. Presentation services (user interface
management);
3. External data access mechanisms;
4. Data access management;
5. security, authentication and personalization;
6. Tools for portal development;
7. Portal administrative and management tools.
Portal features - 2
• Content and document management — services that support the full life cycle of content and document
creation and provide mechanisms for authoring, approval, version control and scheduled publishing. Some
portal solutions providers aim to remove the need for a third-party content management system.
• Collaboration — portal members can communicate synchronously (through chat or messaging) or
asynchronously through threaded discussion and email digests (forums) and blogs.
• Search & Navigation — Content is meant to be read, so on the usage side of the equation, being able to
find and retrieve targeted content is the essential task. As more content is added to repositories, the more
valuable those repositories become. Unfortunately, retrieving useful information becomes more difficult
as the volume of information grows unless effective search and navigation methods are employed.
• Personalization — the ability for portal members to subscribe to specific types of content and services.
Users can customize the look and feel of their environment.Customers who are using EIPs can edit and
design their own web sites which are full of their own personality and own style; they can also choose the
specific content and services they prefer. Like My Yahoo. MSN.
• Entitlement /Securitate — the ability for portal administrators to limit specific types of content and
services users have access to. For example, a company's proprietary information can be entitled for only
company employee access.
• Integration — the connection of functions and data from multiple systems into new components/portlets.
• Single sign-on (SSO) — many enterprise portals provide single sign-on capabilities between their users and
various other systems. This requires a user to authenticate only once. Access control lists manage the
mapping between portal content and services over the portal user base.
EIP Advantages
• Centralization: EIPs provide a centralized system that may contain a wide
range of a company’s corporate information and access to online
applications. This centralized information system enables customers or
employees to easily access information such as reports, application forms
or policy documents. Furthermore, it is easy for the individuals within the
company to update or edit content.
• Increase productivity and profit: information and time is money. A
Centralized and well organized information system provided by EIP can
help employees get quick response and information that increase
employees’ productivity. In addition, it can offer customers easy access to
resources that may increase the company’s sources of customers.
• Provide security area: EIP has one significant feature which is providing a
security area that for team or a specific partner to access, which means
only authorized users can access restriction information.
EIP Disavantages
• High cost: To maintain several Web and portal sites for employees, customers and
partners is an expensive process, companies can spend a huge amount of money
on an EIP system in the hope that it will provide a stable portal for them.
• Conflict: To keep the current infrastructures or introduce more advanced system?
Many businesses required a single, integrated Web environment to cover all the
information and applications that is easy for employees, partners and customers to
view and find information. However, no company would like to spend huge cost on
replacing their existing infrastructures.
• outmoded platform: Many companies used to have an outmoded development
platform. Also they pay unequal attention on their information system. Especially
for external, customers or employees can not find enough information or
applications that they wanted. Therefore, the company may lose lots of chances to
attract potential customers.
• Ignoring Importance of Information Systems: An information repository is one of
the basic requirements for a company to keep providing information to employees,
partners or customers which they wish to view as web pages over their intranet. In
contrast, a portal that does not contain all pertinent information resources can
decrease a company’s market share and competitive advantage.
Data presentation techniques within
portals
• Portlet (pluggable user interface components that are managed and
displayed in a web portal. Portlets produce fragments of markup code that
are aggregated into a portal page. Typically, a portal page is displayed as a
collection of non-overlapping portlet windows, where each portlet
window displays a portlet. Hence a portlet (or collection of portlets)
resembles a web-based application that is hosted in a portal. Portlet
applications include email, weather reports, discussion forums, and news.
• Web-parts: Web Part is an add-on ASP.NET technology to Windows
SharePoint Services. Web Parts are an integrated set of controls for
creating Web sites that enable end users to modify the content,
appearance, and behavior of Web pages directly from a browser
• Digital dashboards: also known as an enterprise dashboard or executive
dashboard, is a business management tool used to visually ascertain the
status (or "health") of a business enterprise via key business indicators.
Digital dashboards use visual, at-a-glance displays of data pulled from
disparate business systems to provide warnings, action notices, next steps,
and summaries of business conditions
Digital Dashboard
http://www.businessweek.com/magazine/content/06_07/b3971083.htm
http://informationr.net/ir/9-3/paper181.html
http://www.clickz.com/showPage.html?page=910251
Rules versus collaborative filtering
• When complex filtering is required, a rule-based system may work better
than collaborative filtering, and vice versa. The following table details
examples where one type of personalization is better than the other.
Scenario Which Reason
filtering type
to use
f the number of items offered and users who Rules Very little room to compute user similarity necessary for
purchase them are rather low. collaborative filtering.
If price points are high or purchasing Rules Finite, limited arenas - collaborative filtering fails because
frequency is low. of the inherent lack of diversity.
If there is a pre-existing dependency between Rules Recommending a disability policy just because
items. Example: Disability policy required for collaborative filtering says many others "like this user" also
homeowner bought a policy is incorrect--one must have the
homeowner policy first.
If number of items offered and users who Collaborative Cannot write rules covering all items.
purchase them are rather high.
If price points are low, all quite dissimilar, or Collaborative The wide variance fits the collaborative filtering approach.
the products offered have a wide range of Collaborative filtering also lowers the risk of making "bad"
user appeal. recommendations.
When not much information is gathered about Collaborative In this case, user attributes on which to base rules may be
the user, but the user can be identified, lacking. Collaborative filtering can compare the user's
possibly by a login or cookie. experiences on the site to other users.
Customization VS. Personalization