Professional Documents
Culture Documents
I hereby declare that this report submission is my own work and that, to
the best of my knowledge and belief, it contains no material previously published
or written by another person nor material which has been accepted for the
award of any other degree or diploma of the university or other institute of higher
learning, except where due acknowledgment has been made in the text.
Place:
Date
CERTIFICATE
Certified that this thesis entitled Sentiment Analysis Using Hybrid Cluster
and Predict Model is the bonafide work of Mr. KAMAL SINGH who carried out
project work under my supervision. Certified further, that to the best of my
knowledge the work reported herein does not form part of any other project
report or dissertation on the basis of which a degree or award was conferred on
an earlier occasion on this or any other candidate.
Signature of Supervisor
Mr. Mukul Varshney
ii
ABSTRACT
Over the past decade humans have experienced exponential growth in the use
of online resources, in particular social media and microblogging websites such
as Facebook, Twitter, YouTube and also mobile applications such as WhatsApp,
Line, etc. Many companies have identified these resources as a rich mine of
marketing knowledge. This knowledge provides valuable feedback which allows
them to further develop the next generation of their product. In this report
sentiment analysis about apple product have been performed by extracting
tweets about that product and classifying the tweets showing it as positive and
negative feedback for apple product. We propose a hybrid approach which uses
k medoid clustering to form the clusters and uses a supervised learning
technique known as CART method to make the predictions on those clusters.
iii
ACKNOWLEDGEMENT
I take this opportunity to acknowledge to Mr Mukul Varshney, my project guide
whose valuable inputs helped us to complete this report.
With profound sense of gratitude and sincere thanks to Prof. Ishan Ranjan
(Head of the Department), Department of Computer Science and Engineering,
Sharda University, Greater Noida, U.P., INDIA. It was very inspiring and
knowledgeable for me to work with enlightened and disciplined personality.
I also want to express sincere thanks to Dr. Manoj Kumar Gupta (Program
Coordinator) for his continuing sincere helps and supports to complete this
report. Last but not the least, I wish to thank my friends for their continuous
support.
KAMAL SINGH
iv
LIST OF TABLES
Table 2.1 Performance of lexical approach variants
16
17
Table 2.3
19
34
LIST OF FIGURES
12
14
23
32
33
34
TABLE OF CONTENTS
Declaration
Certificate
ii
Abstract
iii
Acknowledgement
iv
List of Tables
List of Figures
CHAPTER 1: INTRODUCTION
1.1 Background
1.2 Objective
10
12
13
14
14
15
16
17
vi
CHAPTER 3: METHODOLOGY
3.1 R Studio
20
20
20
20
21
22
22
22
23
24
25
3.9 Evaluation
29
CHAPTER 4: Experiment
4.1 Data Sets
30
30
CHAPTER 5: Conclusion
5.1 Conclusion
33
37
References
38
vii
viii