Professional Documents
Culture Documents
PROJECT SYNOPSIS
OF MINI PROJECT
BACHELOR OF TECHNOLOGY
CSE(DS)
SUBMITTED BY GUIDED BY
SARANSH SHARMA Ms. Mimansha Singh
2000321540052 Assistant Professor
DEEP SHARAN
2000321540025
AFFILIATED TO
DR. A.P.J. ABDUL KALAM TECHNICAL
UNIVERSITY U.P., LUCKNOW
STUDENT’S DECLARATION
I / we hereby declare that the work being presented in this report entitled “Twitter
Sentimental Analysis.” is an authentic record of my/ our own work carried out under the
supervision of Mr. Prabhat Singh, Assistant Professor, CSE-DS. The matter embodied
in this report has not been submitted by us for the award of any other degree.
Date:
This is to certify that the above statement made by the candidate(s) is correct to the best
of my knowledge.
Date: CSE-DS
i
ACKNOWLEDGEMENT
We would like to convey our sincere thanks to Ms. Mimansha Singh for giving the
motivation, knowledge and support throughout the course of the project. The continuous
support helps in a successful completion of project. The knowledge provided is very
useful for us.
We also like to give a special thanks to the department of Information and Technology
for giving us the continuous support and opportunities for fulfilling our project.
We would also like to extend our sincere obligation to Mr. Prabhat Singh, Head
of Department, CSE(DS) for providing this opportunity to us.
2000321540052 2000321540025
ii
iii
ABSTRACT
1
CHAPTER 1
INTRODUCTION
Today's globe has transformed microblogging sites into a sea of data
that analysts may use. This is due to the fact that the majority of
people in today's society use a microblogging platform to express all of
their enthusiasm for various topics. It wouldn't be incorrect to suggest
that everyone who has access to these microblogging sites now has a
right to free speech in some way. In real time, people from all over the
world are free to talk, comment, and express their thoughts on any
subject of their choice. These blogs primarily consist of complaints or
expressions of gratitude regarding any issue of the author's choosing.
They benefit from getting a fair assessment of their business or
product, which enables them to understand consumer demand and the
changes that need to be made in order to provide better goods in the
future. Therefore, if sentiment analysis could be applied to these
microblogging sites, it could be inferred from the explanation above
that they could benefit a variety of organisations, both public and
private. An effective tool for examining many websites where
individuals publish their ideas on a topic of interest is sentiment
analysis, often known as analysis of feelings. With the use of this type
of analysis, businesses can learn what consumers think about a
specific entity or product that interests them by reading their
comments, tweets, or reviews.
2
CHAPTER 2
RELATED WORK
To do the sentiment analysis of twitter data using python and find the
pandemic [9]
To find the frequency of each word and do the sentiment analysis of the
In the existing projects, the words with positive or negative polarity are obtained
but our project we are obtaining the polarity of the overall data set.
In existing projects, it is not specified that which machine learning model is best
for sentiment analysis but in our project we will be determining that too.
3
CHAPTER 3
PROJECT OBJECTIVE
negative or neutral.
This project will analyze different Algorithms and finds the one with best
accuracy.
4
CHAPTER 4
PROPOSED METHODOLOGY
Step 1: Identify the famous hashtags during the pandemic in India on Twitter. Tweets
under those hashtags are extracted from the Twitter API using Tweepy library.
Step 2: The preprocessing of the dataset is done. It involves the following steps:
Removal of hashtags.
Removal of links, gifs, emoji, images and special characters.
Removal of stop words.
Removal of non-English words.
Lemmatization
Step 3: Analyzing the polarity of the dataset.
Step 4: Giving the step 3 output in different machine learning algorithms and analyze it to
find the algorithm with best accuracy.
Pre-processing of Data to remove special characters, punctuations, Stop Words and Images
To use Algorithm and find which fits best for performing Sentiment Analysis
Results
5
CHAPTER 5
python. Python library Numpy is used for the numerical computation and pandas is
used for the data manipulation. Natural Language Toolkit is used for the
preprocessing of the dataset. Text Blob library is used for spelling checks and
6
CHAPTER 6
The result we got from analyzing the tweets is given below in Fig.3.
Fig.3. shows that 46 % of the total tweets are neutral, about 36.5% tweets are positive
7
CHAPTER 7
The project will give the overall polarity score of Tweets and will find which is the
From the analyses of the tweets, we observe that most of the people feel neutral,
positive or negative.
platforms Instagram, Facebook, etc. and also try to further classify the sentiments.
8
REFERENCES
[1] Medford, R. J., Saleh, S. N., Sumarsono, A., Perl, T. M., & Lehmann, C. U. (2020). An"
Infodemic": Leveraging High-Volume Twitter Data to Understand Public Sentiment
Outbreak. medRxiv.
[2] Rajput, N. K., Grover, B. A., & Rathi, V. K. (2020). Word frequency and sentiment
analysis of twitter messages . arXiv preprint arXiv:2004.03925.
[3] Samuel, J., Ali, G. G., Rahman, M., Esawi, E., & Samuel, Y. (2020). Covid-19 public
sentiment insights and machine learning for tweets classification. Information, 11(6), 314.
[4] Kumar, A., Khan, S. U., & Kalra, A. (2020): a sentiment analysis. European
Heart Journal.
[5] Ahuja, S., & Dubey, G. (2017, August). Sentiment analysis on Twitter data. In 2017
2nd International Conference on Telecommunication and Networks (TEL- NET) (pp. 1-
5). IEEE.
[6] Suman, C., Saha, S., Bhattacharyya, P., & Chaudhari, R. S. (2020). Emoji Helps! A
Multi-modal Siamese Architecture for Tweet User Verification. Cognitive Computation, 1-
16