You are on page 1of 9

Deception-

Detection-
of Amazon
reviews
Project Guide and Team

Project Guide:
Ms. Anuranjana

Project Team:
Saurav Goyal (A2305316024)
Yavar Khan (A2305316026)
Hardik Sawhney (A2305316054)
Project Summary And Methodology

Project Summary Project Methodology


 In this project, we will be using NLP toolkit to pre-
 Natural Language Processing
process the training data.
(NLP) :
 These pre-processed data will be used in the
 NLP, is an attempt to make a
classifier (algorithm) to make machine understand
the pattern of raw data. computer understand human
language. Computers can easily
 They will help us to identify whether the reviews
understand programming languages.
are authentic or fake.
Classification Algorithm

There are two forms of data analysis that can be used for extracting models describing
important classes or to predict future data trends. These two forms are as follows −
 Classification
 Prediction
Classification models predict categorical class labels; and prediction models predict
continuous valued functions. For example, we can build a classification model to
categorize bank loan applications as either safe or risky, or a prediction model to
predict the expenditures in dollars of potential customers on computer equipment
given their income and occupation.
 In our project we’ll be focusing on the classification algorithm.
How Does
Classification Works?

The Data Classification process includes two


steps −
 Building the Classifier or Model
 Using Classifier for Classification

In the end we use the predictive model to


find the accuracy of the classifier model.
Building the Classifier
or Model

•This step is the learning step or


the learning phase.
•In this step the classification
algorithms build the classifier.
•The classifier is built from the
training set made up of database
tuples and their associated class
labels.
•Each tuple that constitutes the
training set is referred to as a
category or class. These tuples
can also be referred to as sample,
object or data points.
Using Classifier for
Classification

In this step, the classifier is


used for classification. Here
the test data is used to
estimate the accuracy of
classification rules. The
classification rules can be
applied to the new data
tuples if the accuracy is
considered acceptable.
 Comparative analysis of different
classifiers and their performances.
Future  Plotting of comparative analysis
scope on a GUI webpage.
 Google assistant integration for
inputs.
References/Books

• Advanced Data Analytics Using Python: With Machine Learning, Deep Learning and
NLP Examples by Sayan Mukhopadhyay
• https://www.geeksforgeeks.org/machine-learning/
• https://machinelearningmastery.com/natural-language-processing/

You might also like