Professional Documents
Culture Documents
I would also like to thank all my teammates for all the work and
efforts they put in this project, without them this project would
have not been a successful one.
2. Abstract
Every social networking site like Facebook, twitter,
Instagram etc. become one of the key sources of information.
It is found that by extracting and analyzing data from social
networking sites, a business entity can be benefited in them
product marketing. Twitter is one of the most popular sites
where people used to express their feelings and reviews for a
particular product. In our work, we use twitter data to analyze
public views towards a product. In this project we have to take
tweets from twitter and we have a set of predefined words which
tell the emotions related to those particular words. Then we
analyze the tweet with respect to our NLP based program. Then
we are analyzing the raw data and plotting the sentiment
analysis graph accordingly.
3. Introduction
Twitter allows businesses to engage personally with consumers.
However, there’s so much data on Twitter that it can be hard for
brands to prioritize which tweets or mentions to respond to first.
In this guide, learn how you can use sentiment analysis tools to
listen to your customers on Twitter, and follow our tutorial on
how to perform sentiment analysis in just a few simple steps.
4. Basic Concepts
4.1 Cleaning The Test:
It is done in mainly two ways, firstly, we convert all the
characters in lowercase, and secondly, we remove all the
unwanted characters like punctuations.
This is done so as in NLP, a word like “Data” and “data”
are treated differently by the computer.
The unwanted characters like: !#@$%^& etc don’t convey
any sentiment and hence are removed.
4.2 Tokenization:
The sentence is split/broken down into words and stored
in a word list.
And then the stop-words like I,am,with,me,myself etc are
removed from this word list(as these words don’t
convey/add-up any sentiment to a sentence) to form the
final word list.
5.2Project Analysis:
1. Sorting Data at Scale:
Manually sorting through thousands of tweets, customer
support conversations, or surveys would be just too much
business data to process. Sentiment analysis helps
businesses process huge amounts of unstructured data in
an efficient and cost-effective way.
2. Real-Time Analysis:
Sentiment analysis can identify critical issues in real-time,
for example:-
Is an election crisis on social media escalating?
Is an angry customer about to churn?
Sentiment analysis models can help us immediately
identify these kinds of situations, so that we can take
action right away.
3. Consistent criteria:
It’s estimated that people only agree around 60-65% of the
time when determining the sentiment of a particular text.
Tagging text by sentiment is highly subjective, influenced
by personal experiences, thoughts, and beliefs.
6. Implementation
Methodology
To achieve this we follow the following steps
Testing
Firstly, For testing Purpose I have created two text files
emotions.txt and read.txt. Read.txt contains random texts
or paragraphs for testing and emotions.txt contains all the
keywords or emotions like sad, angry, happy, bored etc
related to our project that a person can feel that our model
can identify accurately.
Now, After we successfully run our algorithm or model over
a set of statements We will get a graph as an output which
will classify the words present in the statement and analyse
them .We can accurately predict if the statement contains
more negative or positive words. Using that we can predict
the sentiment of the person .
Testing on small data set. To verify if the model is working
properly. The sentiments are classified and the graphical
output is presented.
INPUT :
OTPUT:
Result Analysis
INPUT: A large data set.
OUTPUT:
As we can see in this example after running our algorithm over
a statement . We can clearly see the classification of words
present in the statement. The statement mostly contains happy
words following equal number of hatred and entitled words.
8. References
https://docs.python.org/3/
https://numpy.org/doc/
https://pandas.pydata.org/docs/
https://scikit-learn.org/
9. Individual Contribution