You are on page 1of 5



The project entitled as Offline Web Browser is to design and develop the application package
for well secured dynamic application. This package is originated using front end as Java with
MYSQL Server.

This project aims to investigate Implementing Security and Authentication as Aspects within the
frame work of an offline web browser. In this project presents Preprocessing, web construction
components User process. They exclusively update the details of nominees and the browser.

According to the customer use and needs the browser can perform the following function:
Bookmark managing: Internet Bookmarks are stored as web pages locations (URLs) that can be
retrieved. As a feature of all modern internet web browsers, their primary purpose is to easily
catalogue and access web pages that a user has visited and chosen to save.

A download manager is a computer program dedicated to the task of downloading and

sometimes uploading possibly unrelated standalone files and sometimes to the internet for
storage. Project provides web construction, Preprocessing Module their profile with their full
details which has full control for offline web browser system from the application.

The bar along the bottom of the browsers window shows you what is being loaded into the
browser window at the moment or the URL of the link your mouse is over.
Use this button if you want to stop loading the contents of a page.

This project describes the capabilities that will be provided by the software applications offline
web browser. Its purpose is to display the information on the website in a net and organized
way; it also helps to navigate the web easily. These programs interpret code (HTML) and display
information from WWW.

1. Preprocessing Module
2 .Web Directory construction
3. Web fragment generation and classification
4) Web unit construction and classification
5) Home page construction :


1. Preprocessing Module

Protocol check

Web server is a "server" which is specifically used for displaying web pages or web sites.
Web server store pages on it's directory and serve when requested by any web client (Browser)
via HTTP Protocol. Every web server has their unique IP address(Internet Protocol Address)
which help the Browser to locate your web server in Internet.

HTTP Protocol is used to distribute information on world wide web, which we call
website, web page etc. The main aim of HTTP Protocol is to distribute information over Internet.

Log check
*Admin log level
Administration log level which specified by web server .
*User log level
User log level which specified by user.

This module checks the user log level against Admin log level specified by web server.
This consists of simple logging of common problems that can be fixed or tracked locally,
such as running out of resources, security failures, and simple configuration errors.

When a problem occurs in the field, it may be necessary to return the captured logging
information to the users for diagnosis the problem. This logging information may be extremely
detailed and fairly inscrutable.

2 .Web Directory construction

To mine Web units, other than using links among pages, we need to construct web
directory which is a first and important module in my project. We also derive the Web site
structure from the URLs of Web pages. Such a structure is known as Web directory. The URLs of
homepages share the common format: protocol type : ://hostname [:port number] [/path] [label
From the URLs, we can determine a set of Web folders from the elements of the host-
name component separated by the delimiter \." and the elements of path component separated by
the delimiter. Given a set of pages, a Web directory is therefore a tree consisting of Web folders
and pages as nodes, and the inclusion relationship among them as edges.

3. Web fragment generation and classification

A Web fragment is a Web page or a set of Web pages that can be considered as a potential
Web unit or a portion of a Web unit. In the Web fragment generation phase, we take the input
collection of Web pages and construct a Web directory representing the folder
structure of the Web site. Once the Web directory is built, the Web folders likely to contain
homepages of Web units are determined. From the selected Web folders, candidate homepages
are identified and their Web fragments are generated. The generated Web fragments are
then classified and assigned appropriate labels by the classifiers constructed from the training
Web units.
4) Web unit construction and classification

In this phase, Web units are constructed from the classified Web fragments based on some
heuristic rules. With these constructed Web units, the information about the Web site are
Web units of the same concept located together under a common parent folder .
hub page linking to all homepages are grouped together.
Such information is used as features to construct Web unit

classifiers that re-classify the Web units and assign them updated concept labels. This Web unit
construction and classification process repeats itself until there are no changes or only very
minor changes to the Web unit concept labels.

5) Home page construction :

This module constructs one homepage and zero or more support pages. The
homepage represents an entry point to reach all support pages of a Web unit. The support pages
of a Web unit are all reachable from the homepage through links.

A support page can only be part of one Web unit although it might be reachable from
multiple homepages through links. Whether a page should be a support page of a Web unit
depends on whether the page provides supplementary information for the logical entity described
by the Web unit. Web pages are manually examined and are classified into homepages, support
pages and other pages .Web graph is produced by performing a breadth first search starting from
the Web site home page
System Specification:
Hardware & software requirements
Hardware requirements
Processor : Pentium IV
Speed : Above 500 MHz
RAM capacity : 2 GB
Hard disk drive : 80 GB
Key Board : Samsung 108 keys
Mouse : Logitech Optical Mouse
Printer : DeskJet HP
Motherboard : Intel
Cabinet : ATX
Monitor : 17 Samsung
Software configuration
Operating System : Windows XP and above
Front end used : Java
Back End : MYSQL