You are on page 1of 34

RATING BASED FILE RETRIEVAL 2019-2020

1. Introduction to the area


In today’s competitive world it becomes important to service client more appropriately and
quickly. File exchange or information exchange or knowledge exchange or document
exchange is one of often process we see among employees of organization, faculty of
university or college, students of university or college, research personnel of different
department, etc.

Most common way of file exchange is emailing, blog or websites, etc. These way have
certain issues like information value of file (rating level may be high or low), security,
availability, etc.

The purpose of this project will manage file sharing. The main objective of our project
“Rating Based File Retrieval” is to develop a high performance network, whose primary
purpose is to build a storage devices which stores files and allow users to access, share and
modify it from anywhere through LAN.

Generally users upload files and save them on their RBFR Userfolder and if they want to
share that file with any other user then personally they have to send their files using some
services which is tedious work. The other factor is security issue i.e. while sharing the file,
user is not sure that whether it reaches to the correct destination without tempering file. If
user wanted to modify the file which it had saved on its Userfolder, then either he has to
access same computer or has to take manually that file in any other storage.

This system is designed for LAN based file sharing.


 Following are few key features of this application:
 File Server maintains user information and respective user file uploads.
 Server also records the rating on each user file uploaded (the rating is average score
out of 5).
 Each Client registers with server by providing user_id, name, address/department
details and password.
 Each user will be able to upload the Document files, PowerPoint Presentation(ppt) file
and Portable Document Format(pdf) file in the server, server stores the user file in
userfolder and filename in Masterfile along with user_id, file_type, accession number,
total rating and average rating are stored.

Department of MCA BEC, BGK Page 1


RATING BASED FILE RETRIEVAL 2019-2020

 User can request file to the server by sending filename “string” and server searches
matching filename in “master-file” and selects top rated file and sends to the
requested user after consulting to file owner.

Functional Requirements
 Following are modules of this application:
 Client module
i. Client Registration
ii. Client Login
iii. File Upload
iv. File Download
 Server module
i. Records Client Registration
ii. Processes Client Login Request
iii. Handles Download Process
iv. Support Rating Mechanism

A file is an object on a computer that stores data, information, settings, or commands used
with a computer program. In a GUI (graphical user interface), such as Microsoft Windows,
files display as icons that relate to the program that opens the file.
A file is created using a software program on the computer. For example, to create a text
file you would use a text editor, to create an image file you would use an image editor, and to
create a document you would use a word processor.
File sharing is the practice of distributing or providing access to digital media, such as
documents, pdf, ppt, images, audio, etc.
File sharing has three major types. In a network, one file-sharing method or any combination
of these three can be used. They are as below:
i. System-native
Our application is constructed by this type of file sharing. It is usually intended for sharing
files in our local network with other users. Sometimes it allows printers also. But it is not
intended for sharing files with others across the LAN.
ii. Client- server
Client-server file sharing model covers different types of fields such as web pages, email,
FTP, cloud storage services and so on. Sometimes the data is stored in a central location. It is
called the server. Often the data is sent separately to each user who requests it. They are called
the clients. This model is mainly used for file sharing through internet.

Department of MCA BEC, BGK Page 2


RATING BASED FILE RETRIEVAL 2019-2020

iii. Peer-to-peer
This model of file sharing doesn't rely on a single central server. Any two users can exchange
Data, information or resources with each other directly where one user can upload files to the
Server and another user can download files from the server. We are using client-server file
sharing type and the concept of peer-to-peer within local network.

A local area network (LAN) is a computer network that interconnects computers within a
limited area such as a residence, school, laboratory, university campus or office building.
Ethernet and Wi-Fi are the two most common technologies in use for local area networks.
The TCP/IP-based LAN successfully supported Telnet, FTP, and a Defense Department
teleconferencing application.
A document is a written, drawn, presented, or memorialized representation of thought. In the
past, the word was usually used to denote a written proof useful as evidence of a truth or fact.
In the computer age, "document" usually denotes a primarily textual computer file, including
its structure and format, e.g. fonts, colors, and images. Contemporarily, "document" is not
defined by its transmission medium, e.g., paper, given the existence of electronic documents.
Documents are also distinguished from "realia", which are three-dimensional objects that
would otherwise satisfy the definition of "document" because they memorialize or represent
thought; documents are considered more as 2 dimensional representations. While documents
are able to have large varieties of customization, all documents are able to be shared freely,
and have the right to do so, creativity can be represented by documents, also. History, events,
examples, opinion, etc. all can be expressed in documents.

Following is discussion on Faculty and students information sharing in a


College/University :
This system allows college faculty to share important data as notifications with students. It
consists of a faculty login along with student login. Since college faculty operate through pc
and document uploading is simpler through a pc, the faculty pc and document uploading is
simpler through a pc, the faculty login is to be performed through a computer. Faculty may
upload documents of subject syllabus, timetable document, notifications, e notes etc through

Department of MCA BEC, BGK Page 3


RATING BASED FILE RETRIEVAL 2019-2020

their provided login. The documents are uploaded by faculty to different corresponding
departments. We propose to build this system on a LAN based server that allows faculty to
upload data and students may view and download required documents through their device.
Here students only see and download data of their particular semester. Rest data is hidden.
Faculty may access and upload/edit documents to any semester or add any notice as desired.

The authorized user generates the secure request for the interested search word and gives it to
the RBFR server. After receiving the request, the RBFR server uses the search word to get the
matched file IDs. If the match occurs, the RBFR server retrieves the file IDs of the most
relevant files to a search word instead of retrieving all the documents relevant to a search
word. The ranked order retrieved files then sent back to the user. In this study we concentrate
on single word search. Since the score is always same for the given search word, the result is
always accurate.

Department of MCA BEC, BGK Page 4


RATING BASED FILE RETRIEVAL 2019-2020

2. Literature survey.
2.1 Existing System
When the existing system was studied, it was found having some problems, existing system
was very time consuming and was not very efficient. The drawback of the existing system
has resulted in to the development of new system, which is very user friendly and efficient.
Existing system was also very low in performance.
In existing system user cannot share the files with multiple users at a time. And the files do
not have security as the user who received the file can share to a number of users.

2.2 Problems in the existing system.


 The system does not maintain the list of active user.

 Cannot Upload and Download the latest updates.

 Risk of mismanagement and of data when the project is under development.

 Less Security.

 No proper coordination between dissimilar Applications and Users.

 Fewer Users – Friendly.

 Manual system need man power a lot.

 Communication between customer and owner is not directly.

 The complete hierarchy is doesn’t implemented in manually.

 In big organization it is time consuming process.

2.3 Available solutions and their features.


2.3.1 Google Drive
Google Drive is like a server or place where you can store all your files and data into it
safely. You can store all your files such as videos, photos, Google Docs, PDFs and etc. It can
Department of MCA BEC, BGK Page 5
RATING BASED FILE RETRIEVAL 2019-2020

be managed by either individual or joint venture, both parties can manage the files in a joint
project. As an individual user, you will get to store and exchange 5 GB of data for free, which
is pretty much enough to never have to upgrade your account.

Drawbacks of Google Drive:


 The uploading and downloading speed, when millions of user are uploading and
downloading together, the speed will be slow down.
 It might happen will be the hackers who hack or remove your important data, or they
install virus into your server and your files are gone.

2.3.2 Box
Box turns creating, editing, and reviewing files into team actions, and keeps a close eye on
your information for unparalleled security. Box offers “enterprise grade security and
scalability”. Box will provide you with centralized security controls and reporting, customer-
managed encryption, rights management, and mobile security; and integrate with all popular
apps for day-to-day use to make data transfers painless.
Drawbacks:

Department of MCA BEC, BGK Page 6


RATING BASED FILE RETRIEVAL 2019-2020

Box doesn’t offer an easy way to preview files, this makes it difficult to use Box as a
repository of marketing materials, such as our image library. “Under the free plan, files larger
than 250MB can't be uploaded,” says Joanan Hernandes, founder of mobile
platform Mollejuo. “So you have to split the files when you need to upload larger files.”

Department of MCA BEC, BGK Page 7


RATING BASED FILE RETRIEVAL 2019-2020

2.3.3 Apple iCloud


Apple iCloud is a cloud-based file-syncing and storage solution that enables users to store
files in the cloud. The stored files are automatically synced to all devices in the account,
including both Windows and Mac systems. The service comes with offerings such as the
iCloud Photo Library and iCloud Drive, where you can keep all your files stored securely and
updated everywhere. The Family Sharing feature lets you easily share photos, movies, music,
and more with your family members. Find My iPhone helps you find your Apple device if
you lose it.Apple iCloud lets you make folders for your files and provides online storage
for productivity suite software such as Keynote, Numbers, and Pages. Setup is easy on all
devices, including PC, Mac, iPod touch, iPhone, and iPad.

Drawbacks: When you want to backup iPhone data, you must be connected to Wi-Fi
network. Without it, back up would not be possible. Security issue: The fact you are storing
your data to a third party opens the possibility that someone who is not authorized will access
your files.
Not all files are included in iCloud backup: For example, media files, you didn’t buy at the
iTunes Store, will not be included. Slow transfer of large files: Many users noticed the large
files are transferring really slowly.

Department of MCA BEC, BGK Page 8


RATING BASED FILE RETRIEVAL 2019-2020

2.3.4 OneDrive For Business


Microsoft’s OneDrive for Business is one of the best-regarded file sharing applications
in the market, leveraging Microsoft’s vast experience and resources to provide customers
with a stable, robust, and well-protected system to collaborate with shared files.As one of
the top file sharing software solutions today, OneDrive for Business is covered by one
of the toughest, most stringent security protocols available in the market. This means that
your most important files stay in the server and not ending up in the hands of online
predators. The program protects you from the headache of accidentally losing files by
allowing easy recovery even in the case of malicious attacks. Your business can start
OneDrive for just $5 per user/month.

Drawbacks:

 Limited document size support. Microsoft does not support files over 15 GB in
OneDrive for Business. For larger files, you may need to use a compression utility.
 Lack of third-party app integrations. Currently, OneDrive for Business does not
enable third-party app integrations as compared to OneDrive for personal use. You
cannot sync your business account with apps such as calendars or message boards.
 Limited offline access. Although OneDrive for Business allows offline access once a
user downloads the app, that is about as far as offline access goes. If your team needs to
conduct a great deal of work offline, you may want a more robust system.

Department of MCA BEC, BGK Page 9


RATING BASED FILE RETRIEVAL 2019-2020

2.3.5 Synology Drive

Synology Drive is the leading file-sharing software that provides fast and secure ways to
share and store your files. The platform allows you to explore, add, and organize your files
using a mobile phone or tablet. It is compatible with Android, iOS, and Windows Phone,
making it easier for you to access your files from any device of your choice.

It gives you an effortless cloud-storage experience at over 112 MBps of sequential reading.
All your files are also protected by Synology Drive’s robust multi-version backup solution.
This way, your digital assets are safe from malicious attacks and ransomware threats. Aside
from this, the system has sync functionalities so you can easily see all the changes in your
storage and access files throughout different offices.

Real-time file synchronization: With Synology Drive’s Client desktop application, you can
sync your files between Windows, macOS, Ubuntu computer, and Synology Drive Server.
The on-demand sync feature also lets you see all the files without consuming PC storage.

Department of MCA BEC, BGK Page 10


RATING BASED FILE RETRIEVAL 2019-2020

2.3.6 Share File


A platform that enables secure content collaboration and document exchange,
Citrix’s ShareFile is a powerful file sharing alternative for small, medium, and large
businesses. Using the platform, your team members will be able to share confidential
information regardless of size or format and access their files in secure mode from any device
or operating system. At the same time, you will have the peace of mind that all files, emails,
and attachments are encrypted in line with bank-grade standards, both when in transit and at
rest. ShareFile is also the industry master of co-editing and co-authoring, as it allows team
members to make notes and work simultaneously on a single task. Another reason to consider
it is the use of 10-plus industry-specific modules that provide users with specific features and
personalized branding.

Drawbacks:

 When you download files from people that you don’t know or from the untrusted
source, there is a risk of downloading viruses, Trojans and other types of malware.
 Some end-users are confident in illegally downloading files if an antivirus program
has support to scan the connections and the files, using a P2P application.
 They don’t realize that their private information is at risk, in addition to facing the law
if their ISP has determined that their customer is using the internet connection
illegally.

Department of MCA BEC, BGK Page 11


RATING BASED FILE RETRIEVAL 2019-2020

2.4 Proposed solution.


The proposed system should have the following features. The uploading/sharing of
files take place in a secured format between various clients in the rbfr server. It
provides flexibility to the user to upload the files through the local area network very
easily by compressing the large amount of file. It should also identify the user and
provide the communication according to the prescribed level of security with transfer
of the file requested and run the required process at the server if necessary. In this
application the data will be send through the local area network as a doc file or pdf
file or image file or ppt file. The user who received the best rated file from the rbfr
server will do the operations like de embedding, decryption, and decompress in their
level of hierarchy etc.

2.5 Advantages of proposed solution.


 Rating based file retrieval application is secure to file exchange or sharing.
 There is no need for third parties like website, email and blog to do file exchange.
 You can store, track, and share files through your rating based file retrieval, you don’t
have to invest in anther file sharing solution.
 Every file is stored within your software, everyone with the proper security rights can
access it no more waiting on someone else to dig it up for you.
Department of MCA BEC, BGK Page 12
RATING BASED FILE RETRIEVAL 2019-2020

 Increasing the speed and reducing the cost.

Department of MCA BEC, BGK Page 13


RATING BASED FILE RETRIEVAL 2019-2020

3. Problem Definition
3.1 Problem definition
 Understanding the file formats which can be transferred from the clients (format like a
jpg, docx, and pdf).
 Identifying the particular client at where the file has been uploaded providing the feature
file along with client name.
 Providing an appropriate platform for the file which can be easily opened.
 Searching the file in a machine whether file is not available in a machine, file will be send
by server to other clients for searching.

3.2 Proposed solution


 Reducing the paper work, sending and receiving the file very easily.
 Proper and appropriate way of communication. Mistakes cannot be occurred during time
of file uploading or downloading from any client.
 Any files can be uploaded and downloaded and view according to the file formats. Every
file is stored within you software, everyone with the proper security can access it.

Department of MCA BEC, BGK Page 14


RATING BASED FILE RETRIEVAL 2019-2020

3.3 Development process

RBFR Client 1
RBFR Server
Register
Userfolder
Register
RBFR Client 2 L RBFR
A Userfile 1
N
Master
File upload RBFR
file
Userfile 2
File request

RBFR
Userfile n

RBFR Client n

File download

Department of MCA BEC, BGK Page 15


RATING BASED FILE RETRIEVAL 2019-2020

Software requirement specification


4.1 Purpose, scope
Purpose:
Purpose of this is to provide rating based file exchange or file sharing for users. Each user
will be able to upload or download the Document files, PowerPoint Presentation (ppt) file
and Portable Document Format (pdf) file in the server, and server stores the user file in
userfolder.

Scope:
The software developed is very helpful to users for file sharing or file exchange securely.
Users can store, track, and share files through our rating based file retrieval application, we
don’t have to invest in another file sharing solution. It indicates software that is capable for
providing the solution for all existing problem.

4.2 Product overview


In the “Rating Based File Retrieval” application n number of clients are allowed to register
and are allowed to upload files which are necessary for other clients, such files are stored in a
Master File in which each uploaded file is saved in different user folders, whenever a client
requests a file from the server, the server checks for the similar file in it’s master file if
present then the server will provide the list of identical files along with rating to the client
which he/she can download based on the highest rating(highest downloaded) file and can use
based on their purpose.

Department of MCA BEC, BGK Page 16


RATING BASED FILE RETRIEVAL 2019-2020

4.3 Functional requirement


Following are modules of this application:
 Client module is responsible for providing following facility for user:
i. Client Registration: Here Client registration form can filled out by user to register
through providing their personal and contact information, business details and
additional information and specific requirements if any. The form is fully
customizable where user can add/change/remove fields, add visual and informative
content, change the fonts, colors, and background and either embed it to website or
use it as standalone form. Client registers with server by providing user_id, name,
address/department details and password.

ii. Client Login: Here Client should login with server by providing username and
password. If both inputs are correct, the homepage will be displayed otherwise it shows
an error message (invalid inputs). Password is case sensitive.

iii. File Upload: It provides all rbfr users, without a necessity to upload files of type pdf,
ppt, and docs and to store them or share them with their friends easily. It encourages all
users to use the services as they want, without any hassle.

iv. File Download: Users can download files by requesting to the server. After completion of
an upload, a download link will be generated. User can share this link with anyone and when
they visit this link will see a download button and get these files easily. And the server will
send files which are well rated.

 Server module is responsible for providing following facility for client:


i. Records Client Registration: Server always does user authentication and records or
stores user information like user_id, name, address or department details and password.
Server first validates that the user requesting for the service is a registered user (valid) or
not.
ii. Processes Client Login Request: The server provides a function or service to one or
many clients, which initiate requests for such services. The client opens a connection to
the server and sends a request. The server
processes the request by generating following response like, addlog(adding log entries:
username and password), the client should enter correct inputs to get the next page, if an
error occurs in inputs, the server logs an error message and aborts the process.

Department of MCA BEC, BGK Page 17


RATING BASED FILE RETRIEVAL 2019-2020

iii. Handles Upload Process: User sends file upload request with upload command,
server receives file upload request and checks that service requesting user has enough
space available or not on the basis of the file size user wants to upload.
If server finds that user has enough available space it puts user’s file named “xyz.pdf”
in the userfolder named as “user_id” of this user. Otherwise server sends a message to
user that you don’t have enough storage available to upload the file.
After uploading the file server updates the available storage of the user by subtracting
the size of the uploaded file from the current available storage of the user.

iv. Handles Download Process: User sends file download request with download
command, and gives filename and a path where user wants to save the file, server
receives the download request from the user and checks in the userfolder named as
user_id for downloading.
If server finds the requesting file in the userfolder, it sends the file to the requesting
client and then client saves the received file in the place specified by the user.
If server doesn’t find requesting file in the userfolder, it sends a message to other clients
for asking to share the file (which is well rated), if they send message “yes” , then the
server sends the file to the requested client.

v. Support Rating Mechanism: Once all the clients uploads their files to server, server
stores them in a userfolder which are initially rated as 1. The rating mechanism is done,
when a client requests for a file which is not available in his userfile, server asks other
clients to share the file, if they share the fileserver sends the file to requested client and
asks him to rate it. Server records the rating on each user file uploaded (the rating is
average score out of 5).

4.4Performance requirement
4.5Exception handling
4.6Acceptance criteria
4.7Design hints and guidelines
4.8Glossary of terms
TCP – Transmission Control Protocol is one the main protocols of the Internet protocol suit.

Department of MCA BEC, BGK Page 18


RATING BASED FILE RETRIEVAL 2019-2020

UDP – User Datagram Protocol, which provides a connectionless datagram service that
prioritizes time over reliability.
Socket – A socket is one endpoint of a two-way communication link between two programs
running on the network
Swing – Swing is a GUI widget toolkit for Java. It is part of Oracle's Java Foundation
Classes (JFC) – an API for providing a graphical user interface (GUI) for Java programs.
IP – An Internet Protocol address is a numerical label assigned to each device connected to a
computer network that uses the Internet Protocol for communication. An IP address serves
two main functions: host or network interface identification and location addressing
Port – A port is a communication endpoint. At the software level, within an operating system,
a port is a logical construct that identifies a specific process or a type of network service.
LAN – A local area network is a computer network that interconnects computers within a
limited area such as a residence, school, laboratory, university campus or office building.
HTTP – The Hypertext Transfer Protocol is an application protocol for distributed,
collaborative, hypermedia information systems.
FTP – The File Transfer Protocol is a standard network protocol used for the transfer of
computer files between a client and server on a computer network.

Department of MCA BEC, BGK Page 19


RATING BASED FILE RETRIEVAL 2019-2020

4.9 Technology requirement.

Hardware Requirements

Processor Core i3 or above


RAM 1 GB or above
Hard-Disk 250 GB or above
LAN 100mbps or above

Software Requirements

Operating System Windows 10


Development programming language JDK 8
Front End Java Swings
IDE NetBeans 8.0.1

Department of MCA BEC, BGK Page 20


RATING BASED FILE RETRIEVAL 2019-2020

5. System design
5.1 Use Case Model

Register to the
application

Login to the
application

View profile

Upload Files
Client

Download Files

Rate’s the
downloaded file

Department of MCA BEC, BGK Page 21


RATING BASED FILE RETRIEVAL 2019-2020

5.2 Sequence Model

Department of MCA BEC, BGK Page 22


RATING BASED FILE RETRIEVAL 2019-2020

6. Detailed design
6.1 High level design
6.1.1 ER model/Object model
Object Model: - An Object is an instance of a class. A specific thing that has a
specific value of the attributes and behavior.

1. Object Model for Client.


Client1: Client

2. Object Model for Server.


server: Server

3. The Relationship among Server and Clients can be represented using the
object model.

Client
2

Client Client
1 3

Server

Client Client

n 4

….

6.1.2 DFD/State diagram


Department of MCA BEC, BGK Page 23
RATING BASED FILE RETRIEVAL 2019-2020

State Diagram: -
At any given time an object is in a particular state. State Transition
Diagrams are used to represent the states of an object at any particular instance.

Client Server
Initiated Sends Host Address Initiated

Server

Client gets updated Updated

Exit

Client Exited Server


Removes
Client

Server in
Listening
State

Department of MCA BEC, BGK Page 24


RATING BASED FILE RETRIEVAL 2019-2020

6.2 Low level design


62.1 Relational model/Flowchart and Pseudocode/Source code

Department of MCA BEC, BGK Page 25


RATING BASED FILE RETRIEVAL 2019-2020

7. Implementation

Department of MCA BEC, BGK Page 26


RATING BASED FILE RETRIEVAL 2019-2020

8. Testing & Results

Department of MCA BEC, BGK Page 27


RATING BASED FILE RETRIEVAL 2019-2020

9. Conclusion.

Department of MCA BEC, BGK Page 28


RATING BASED FILE RETRIEVAL 2019-2020

10. Future Enhancement

Department of MCA BEC, BGK Page 29


RATING BASED FILE RETRIEVAL 2019-2020

11. Reference books, sites and other resources.


Text Books
 Herbert Schildt, “Java The Complete Reference” 7 th Edition, Tata McGraw-Hill
Education 2001.
 E Balagurusamy, “Programming with Java: A primer” 4th Edition, Tata McGraw-Hill
Education 2009.

Websites
 https://www.tutorialpoint.com
 https://www.javatpoint.com

Department of MCA BEC, BGK Page 30


RATING BASED FILE RETRIEVAL 2019-2020

12. Index

Department of MCA BEC, BGK Page 31


RATING BASED FILE RETRIEVAL 2019-2020

13. Appendices
13.1 User manual

Department of MCA BEC, BGK Page 32


RATING BASED FILE RETRIEVAL 2019-2020

13.2 About tools/technology.

Department of MCA BEC, BGK Page 33


RATING BASED FILE RETRIEVAL 2019-2020

Prepare document with the consultation of your guide covering all above points. Content in
the document should be with following formatting
Font style times new roman for entire document.

Topic heading (center alignment, 16 font size, and bold)


Sub heading (left justified, 14 font size, bold)
Other text (justified, 12 font size, 1.5 line spacing).

Header (font size 10, bold, Italics)


Project title Page No./ Total
Pages

Footer (font size 10, bold, Italics)


Department of MCA BEC, BGK Academic Year

All the textbook references should be in IEEE (A, B, E, P, Y) format, complete URL of
websites visited should be present and detail of other resources should be included.

Cover page and certificate should be color printed on the card sheet.
3 copies (1 guide, 1 department and 1 self copy) of Project report must be prepared and
should be spiral bound using white colored translucent cover sheet and navy blue
opaque back sheet.

Department of MCA BEC, BGK Page 34

You might also like