Professional Documents
Culture Documents
Project Submitted to
KAVAYITRI BAHINABAI CHAUDHARI NORTH MAHARASHTRA
UNIVERSITY,JALGAON
In fulfillment of requirement for the award of degree of
BACHELOR OF ENGINEERING
Under the
Faculty of Engineering and Technology
In the discipline
Computer Engineering
By
Nayan S. Patil
Puja M. Badhe
Shubham D. Mahale
Monika V. More
Narendra P. Marathe
B.E. Computer
Guide
Mr. Rahul Gaikwad
Assistant Professor
CERTIFICATE
This is to certify that the Project titled, " CrowdSourcing" submitted by Nayan S.
Patil, Puja M. Badhe, Shubham D. Mahale, Monika V. More, Narendra P. Marathe.
In fulfillment of the degree of BACHELORS OF ENGINEERING in Department of
Computer Engineering, Godavari College of Engineering, Kavayitri Bahinabai Chaud-
hari North Maharashtra University, Jalgaon is a bonafide record of work carried out
by his/her in the Department of Computer Engineering, Kavayitri Bahinabai Chaud-
hari North Maharashtra University, Jalgaon under my guidance and supervision. In my
opinion this work has attained the standard fulfilling the requirements of the regulations
of the University.
Date:
Place:Jalgaon
Dr. V. H. PATIL
PRINCIPAL
Godavari Foundation’s
Godavari College of Engineering, Jalgaon
DECLARATION / UNDERTAKING
We hereby declare that the work presented in this Project "CrowdSourcing" was carried
out by us under the supervision of guide from July-2019 to May-2020.
This Work or any part of this work is based on original research and has not been sub-
mitted by us to any University/Institution for the award of any degree.
Date:
Place: Jalgaon
Last but not least; we are thankful to the God, my dearly beloved Parents, all Faculty
Members, my Friends and all who directly or indirectly supported for completion.̇..
List of Figures iv
Abstract 1
1 INTRODUCTION 2
1.1 Motivation of Project . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
3 LITERATURE REVIEW 6
3.1 Existing System . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
3.2 Proposed System . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
3.3 Feasibility Study . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
4 ANALYSIS 9
4.1 Software requirement specification . . . . . . . . . . . . . . . . . . . . 9
4.1.1 Purpose and Scope of Document . . . . . . . . . . . . . . . . . 9
4.1.2 Overview of responsibilities of Developer . . . . . . . . . . . . 9
4.2 System requirements . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
4.2.1 Hardware Requirment . . . . . . . . . . . . . . . . . . . . . . 10
4.2.2 Software Requirements . . . . . . . . . . . . . . . . . . . . . . 10
4.3 Functional Requirement . . . . . . . . . . . . . . . . . . . . . . . . . . 10
i
4.3.1 Performance Requirement . . . . . . . . . . . . . . . . . . . . 10
4.3.2 Safety Requirement . . . . . . . . . . . . . . . . . . . . . . . . 10
4.3.3 Security Requirement . . . . . . . . . . . . . . . . . . . . . . . 11
4.4 Non-Functional Requirement . . . . . . . . . . . . . . . . . . . . . . . 11
4.4.1 Performance Requirement . . . . . . . . . . . . . . . . . . . . 11
4.4.2 Safety Requirement . . . . . . . . . . . . . . . . . . . . . . . . 11
4.4.3 Security Requirement . . . . . . . . . . . . . . . . . . . . . . . 11
4.4.4 Software Quality Attributes . . . . . . . . . . . . . . . . . . . . 11
4.5 Software Interface Description . . . . . . . . . . . . . . . . . . . . . . 12
4.5.1 What is Java . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
5 DESIGN 16
5.1 System Design . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
5.1.1 Project Idea . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
5.1.2 Project Concept . . . . . . . . . . . . . . . . . . . . . . . . . . 17
5.2 System Architecture . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
5.3 Data Flow Diagram . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
5.4 UML Diagram . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
5.4.1 Activity Diagram . . . . . . . . . . . . . . . . . . . . . . . . . 19
5.4.2 Use Case Diagram . . . . . . . . . . . . . . . . . . . . . . . . 20
5.4.3 Package Diagram . . . . . . . . . . . . . . . . . . . . . . . . . 21
5.4.4 Class Diagram . . . . . . . . . . . . . . . . . . . . . . . . . . 22
5.4.5 Sequence Diagram . . . . . . . . . . . . . . . . . . . . . . . . 23
6 CODING 24
6.1 Front-End Languages . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
6.1.1 Java . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
6.1.2 HTML . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
6.1.3 CSS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
6.1.4 Apache Tomcat . . . . . . . . . . . . . . . . . . . . . . . . . . 25
ii
7 TESTING 26
7.1 Type of Testing Used . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
7.2 Test Cases and Test Results . . . . . . . . . . . . . . . . . . . . . . . . 29
iii
List of Figures
iv
Abstract
CrowdSourcing is a platform which may contain opinions, thoughts, facts and other in-
formation. Within it, many and various communities are originated by users with com-
mon interests, or with similar ways to feel part of the community. This paper presents
a possible combined approach between Social Network Analysis and Sentiment Anal-
ysis. In particular, we have tried to associate a sentiment to the nodes of the graphs
showing the social connec- tions, and this may highlight the potential correlations. The
idea behind it is that, on the one hand, the network topology can contextualize and then,
in part, unmask some incorrect results of the Sentiment Analysis; on the other hand, the
polarity of the feeling on the network can highlight the role of semantic connections
in the hierarchy of the communities that are present in the network. In this work, we
illustrate the approach to the issue, together with the system architecture and, then, we
discuss our first results.
Chapter 1
INTRODUCTION
Crowdsourcing is one of the most widely studied applications of Natural Language Pro-
cessing (NLP) and Machine Learning (ML). This field has grown tremendously with the
advent of the Web 2.0. The Internet has provided a platform for people to express their
views, emotions and senti- ments towards products, people and life in general. Thus, the
Internet is now a vast resource of opinion rich textual data. The goal of crowdsourcing
is to harness this data in order to obtain important information regarding public opinion,
that would help make smarter business decisions, political campaigns and better prod-
uct consumption. Sentiment Analysis focuses on identifying whether a given piece of
text is subjective or objective and if it is subjective, then whether it is negative or posi-
tive. Sentiment Analysis deals with the computational treatment of opinion, sentiment,
and subjectivity of texts. Sentiment Analysis starts with a small question: What other
people think?, and finally convert into billions of dollars of commercial deal. After the
great success of Web-2.0, Sentiment Analysis became a demanding and commercially
supported research field.
2
CROWDSOURCING
Alert users from the emergency while getting the notification from the system
the project objective comprises of 3 parts
1.Performing crowdsource sentiment analysis on posts for 2019 indian election using
twitter public stream comparing two parties.
2.creating dummy account and then performing analysis on it which will enable us to
fetch the location of users.
3.hosting it through AWS Cloud platform.
CrowdSourcing 3
Chapter 2
4
CROWDSOURCING
CrowdSourcing 5
Chapter 3
LITERATURE REVIEW
6
CROWDSOURCING
CrowdSourcing 7
CROWDSOURCING
CrowdSourcing 8
Chapter 4
ANALYSIS
9
CROWDSOURCING
The application is designed in modules where errors can be detected and fixed easily.
This makes it easier to install and update new functionality if required.
CrowdSourcing 10
CROWDSOURCING
Users all details are confidentiality. Data is modified by authorized person in authorized
manner. User need to go throw authentication process. Permissions to be assigned to
All Authenticated entities
The application is designed in modules where errors can be detected and fixed easily.
This makes it easier to install and update new functionality if required.
All data will be encrypted using strong encryption algorithm and according to location
encryption is done.
Our software has many quality attribute that are given below:-
• Availability: This software is freely available to all users. The availability of the
software is easy for everyone.
CrowdSourcing 11
CROWDSOURCING
• Maintainability: After the deployment of the project if any error occurs then it can be
easily maintained by the software developer.
• Reliability: The performance of the software is better which will increase the reliabil-
ity of the Software.
• User Friendliness: Since, the software is a GUI application; the output generated is
much user friendly in its behavior.
Java is a programming language and a platform. Java is a high level, robust, secured
and object-oriented programming language. Platform: Any hardware or software en-
vironment in which a program runs, is known as a platform. Since Java has its own
runtime environment (JRE) and API, it is called platform.
Where it is used? According to Sun, 3 billion devices run java. There are many devices
where java is currently used. Some of them are as follows:
1. Desktop Applications such as acrobat reader, media player, antivirus etc.
2. Web Applications such as irctc.co.in, javatpoint.com etc.
3. Enterprise Applications such as banking applications.
4. Mobile
5. Embedded System
6. Smart Card
7. Robotics
8. Games etc.
CrowdSourcing 12
CROWDSOURCING
2) Web Application
An application that runs on the server side and creates dynamic page, is called web
application. Currently, servlet, jsp, struts, jsf etc. technologies are used for creating
web applications in java.
3) Enterprise Application
An application that is distributed in nature, such as banking applications etc. It has
the advantage of high level security, load balancing and clustering. In java, EJB is used
for creating enterprise applications.
4) Mobile Application
An application that is created for mobile devices. Currently Android and Java ME
are use for creating mobile applications.
Features of Java
There is given many features of java. They are also known as java buzzwords. The
Java Features given below are simple and easy to understand.
Simple
CrowdSourcing 13
CROWDSOURCING
1. Object
2. Class
3. Inheritance
4. Polymorphism
5. Abstraction
6. Encapsulation
What does Java Development Kit (JDK) mean?
The Java Development Kit (JDK) is a software development environment used for
developing Java applications and applets. It includes the Java Runtime Environment
(JRE), an interpreter/loader (java), a compiler (javac), an archiver (jar), a documentation
generator (javadoc) and other tools needed in Java development. People new to Java
may be confused about whether to use the JRE or the JDK. To run Java applications and
applets, simply download the JRE. However, to develop Java applications and applets
as well as run them, the JDK is needed.
Java developers are initially presented with two JDK tools, java and javac. Both
are run from the command prompt. Java source files are simple text files saved with
an extension of .java. After writing and saving Java source code, the javac compiler is
CrowdSourcing 14
CROWDSOURCING
invoked to create .class files. Once the .class files are created, the ’java’ command can
be used to run the java program.
For developers who wish to work in an integrated development environment (IDE),
a JDK bundled with Netbeans can be downloaded from the Oracle website. Such IDEs
speed up the development process by introducing point-and-click and drag-and-drop
features for creating an application.
There are different JDKs for various platforms. The supported platforms include
Windows, Linux and Solaris. Mac users need a different software development kit,
which includes adaptations of some tools found in the JDK
SQLite Database
SQLite is an embedded SQL database engine. Unlike most other SQL databases,
SQLite does not have a separate server process. SQLite reads and writes directly to
ordinary disk files. A complete SQL database with multiple tables, indices, triggers,
and views, is contained in a single disk file. The database file format is cross-platform
- you can freely copy a database between 32-bit and 64-bit systems or between big-
endian and little-endian architectures. These features make SQLite a popular choice as
an Application File Format. Think of SQLite not as a replacement for Oracle but as a
replacement for fopen()
SQLite is a compact library. With all features enabled, the library size can be less
than 500KiB, depending on the target platform and compiler optimization settings. (64-
bit code is larger. And some compiler optimizations such as aggressive function inlining
and loop unrolling can cause the object code to be much larger.) If optional features are
omitted, the size of the SQLite library can be reduced below 300KiB. SQLite can also
be made to run in minimal stack space (4KiB) and very little heap (100KiB), making
SQLite a popular database engine choice on memory constrained gadgets such as cell-
phones, PDAs, and MP3 players. There is a tradeoff between memory usage and speed.
SQLite generally runs faster the more memory you give it. Nevertheless, performance
is usually quite good even in low-memory environments.
CrowdSourcing 15
Chapter 5
DESIGN
This stage is the preliminary, informal conceptualization and vetting of a project idea
among Colleagues within, and perhaps also outside.
16
CROWDSOURCING
Once a project idea has been agreed upon, a project concept paper should be developed
in order to flesh out the idea, and enable those responsible for appraising the concept to
determine whether the proposed undertaking is viable, fits with perceived conservation
needs. A project concept paper is essentially a brief conceptualization of a project inter-
vention prepared prior to a Feasibility study and ultimately a full, participatory project
design undertaking. Developing a project concept involves the initial formulation of
project goals, targets, outputs,and activities in Summary form within the framework of
the WWF Global Priorities.
CrowdSourcing 17
CROWDSOURCING
CrowdSourcing 18
CROWDSOURCING
CrowdSourcing 19
CROWDSOURCING
CrowdSourcing 20
CROWDSOURCING
CrowdSourcing 21
CROWDSOURCING
CrowdSourcing 22
CROWDSOURCING
CrowdSourcing 23
Chapter 6
CODING
Front-End means a language that is used for user interface designing and coding. We
used our front end Java, HTML and CSS and use the Apache Tomcat server.par
Java is a programming language and a platform. Java is a high level, robust, secured
and object-oriented programming language. HTML (HyperText Markup Language) is
the most basic building block of the Web.
Cascading Style Sheets (CSS) is a simple mechanism for adding style (e.g., fonts,
colors, spacing) to Web documents. Apache Tomcat (sometimes simply "Tomcat") is
an open-source implementation of the Java Servlet, JavaServer Pages, Java Expression
Language and WebSocket technologies.
6.1.1 Java
Java is a programming language and a platform. Java is a high level, robust, secured
and object-oriented programming language. Platform: Any hardware or software en-
vironment in which a program runs, is known as a platform. Since Java has its own
runtime environment (JRE) and API, it is called platform.
24
CROWDSOURCING
6.1.2 HTML
HTML (HyperText Markup Language) is the most basic building block of the Web. It
defines the meaning and structure of web content. Other technologies besides HTML
are generally used to describe a web page’s appearance/presentation (CSS) or function-
ality/behavior (JavaScript).
6.1.3 CSS
Cascading Style Sheets (CSS) is a simple mechanism for adding style (e.g., fonts, col-
ors, spacing) to Web documents.These pages contain information on how to learn and
use CSS and on available software. They also contain news from the CSS working
group.
6.2 Snapshots
CrowdSourcing 25
CROWDSOURCING
authentic, giving the user can access the restricted page. If a user is not able to provide
authentic credentials they will not be able to proceed past the login form.
**FIGURE***
After entering the correct user name and password we get a message of Login suc-
cessful, welcome admin and we can enter in the system. It is for security purpose.
The home page is the first page which shows what our system does and it shows 3
different choices such as Item Master in which we can add different items f different-
categories, Inventory that contains sales in which we can create a bill and add different
items for creating bill with GST and Exit option for exiting from the system.
CrowdSourcing 26
Chapter 7
TESTING
27
CROWDSOURCING
adopted. Different software development models will focus the test effort at different
points in the development process. Newer development models, such as Agile, often
employ test driven development and place an increased portion of the testing in the
hands of the developer, before it reaches a formal team of testers. In a more traditional
model, most of the test execution occurs after the requirements have been defined and
the coding process has been completed.
Test Plan
To test this application we are going with proper sequencing of testing like unit, inte-
gration, validation, GUI, Low level and High level test cases, major scenarios likewise.
We will go with the GUI testing first and then integration testing. After integration test-
ing performs the high level test cases and major scenarios which can affect the working
on the application. We will perform the testing on the data transmitted using the various
inputs and outputs and validate the results. It also intends to cover any deviations that
the project might take from the initially agreed Test Strategy in terms of scope, testing
methodology, tools, etc.. This test plan covers details of testing activities for this project
and scope.
GUI Testing :
Graphical User Interface (GUI) testing is the one of the mechanism in which user
interface developed System Under some graphical rules. GUI testing includes check-
ing various controls- menus, buttons, icons, dialog boxes and windows etc. Proposed
system is tested for user inputs against different mod- ules, validations are done. GUI
is tested for appearance of different controls, visibility graphs is tested. GUI testing
involves following actions:
1. Check all elements for size, position, width, length and acceptance of characters or
numbers. For instance, you must be able to provide inputs to the input fields.
2. Overall functionality related with performance of users graphical interface are checked.
3. Check Error Messages are displayed correctly
4. Check the font, layout details, style and display warning messages if it is false.
5. check the positioning of GUI elements.
CrowdSourcing 28
CROWDSOURCING
Unit Testing :
It is the testing of individual software units of the application .it is done after the
complexion of an individual unit before integration. Unit testing involves the design
of test cases that validate that the internal program logic is functioning properly, and
that program inputs produce valid outputs. All decision branches and internal code
flow should be validated. This is a struc- tural testing, that relies on knowledge of its
construction and is invasive. Unit tests perform basic tests at component level and test
a specific business pro- cess,application, and/or system configuration. Unit tests ensure
that each unique path of a business process performs accurately to the documented
specifications and contains clearly defined inputs and expected results.
Integration Testing :
Integration tests are designed to test integrated software components to de- termine
if they actually run as one program. Testing is event driven and is more concerned with
the basic outcome of screens or fields. Integration tests demonstrate that although the
components were individually satisfaction, as shown by successfully unit testing, the
combination of components is cor- rect and consistent. Integration testing is specifically
aimed at exposing the problems that arise from the combination of components.
Testing Strategy
Software testing methods are traditionally divided into white- and black-box testing.
These two approaches are used to describe the point of view that a test engineer takes
when designing test cases.
White-box testing :
In white-box testing an internal perspective of the system, as well as pro- gramming
skills, are used to design test cases.
Black-box testing :
Black-box testing treats the software as a and quot;black box and quot;, examining
functionality without any knowledge of internal implementation. The testers are only
aware of what the software is supposed to do, not how it does it.
Grey-box testing
CrowdSourcing 29
CROWDSOURCING
Grey-box testing involves having knowledge of internal data structures and algo-
rithms for purposes of designing tests, while executing those tests at the user, or black-
box level. The tester is not required to have full access to the software.
CrowdSourcing 30
CROWDSOURCING
CrowdSourcing 31
CROWDSOURCING
CrowdSourcing 32
Chapter 8
8.1 Conclusion
The rise of social network has fuelled interest in sentiment classification. Promptly and
correctly classifying sentiment from the text has become an important task for individ-
uals and companies. In the development of prediction models to classify the reviews,
more reliable approaches are expected to reduce the misclassifications. In this study,
the results of various hybrid methods are empirically evaluated on datasets of different
size for use in sentiment mining. Among the methods used, hybrid ensemble method
(HEM1) is highly robust in nature for balanced data models I, II and III, which is studied
through various quality parameters. The analysis also shows that the compound com-
bination of unigram, bigram and trigram performs better for almost all the prediction
methods. To handle imbalance data distribution in real time applications, it is observed
from the results that using SVMs for class prediction can be influenced by the data im-
balance, although SVMs can adjust itself well to some degree of data imbalance. To
cope with the problem, rebalancing the data is chosen as a promising direction, but both
under sampling and over sampling have limitations.
Through extensive experiments with benchmark and real application datasets, the
33
CROWDSOURCING
CrowdSourcing 34
Bibliography
1 B. Liu, Sentiment analysis and opinion mining, Synthesis Lectures on Human Lan-
guage Technologies, vol. 5, no. 1, pp. 1-167, May 2012.
3 V. Ng, S. Das gupta, and S. M. N. Arifin, Examining the role of linguistic knowledge
sources in the automatic identification and clas- sification of reviews, in Proceedings
of the COLING/ACL on Main Conference Poster Sessions, ser. COLING-ACL 06.
Stroudsburg,PA, USA: Association for Computational Linguistics, 2006,pp. 611-618.
4 J. Zhao, K. Liu, and G. Wang, Adding redundant features for crfs- based sentence
sentiment classification, in Proceedings of the Confer- ence on Empirical Methods in
Natural Language Processing
35