You are on page 1of 5

Problem Statement

• To analyze the women safety based on the Tweets.


• Twitter is a platform widely used by people to express their
opinions and display sentiments on different occasions.
Pre processing of Tweets

• The preprocessing of the data is a very important step as


it decides the efficency of the other steps down in line.
• Steps ivolve in Pre-processing of tweets
• Removal of retweets.
• Converting Upper case to lower case.
• Stop word removal.
• Twitter Feature removal.
• Special character and digit removal.
Feature extraction

• Selection of useful words from tweets is feature


extraction.
• Unigram Feature:One word is considered at a time and
decided whether it is capable of being a feature.
• N-gram feature:more than one word is considered at a
time.
• External lexicon:use of list of words with pre defined
positive and negative expressions.
By
T.Reeshi 245116733146
K.Pradeep 245116733150
B.Rohith 245116733152

You might also like