Professional Documents
Culture Documents
+ It would be helpful to have some insight for the users to know about his
behavior, how he conducts himself and how people perceive him online.
Objective
Building a platform which tries to give some insights about the user
online social behavior and user can easily create his gig and make it
available online.
Our Platform can perform analysis of his social media history (right
now only twitter data is being analyzed) and give them some insights
such as their personality, his interests based on his tweets, who he
follows, sentiment analysis of his tweets.
Front-End
+ We used handlebars for front-end of
application.
+ Handlebars is a templating engine for
web applications.
+ We used material UI and bootstrap
classes for making the interface of the
application.
+ We used canvas.js for creating the
charts.
Tweepy
Interest
Analysis EXTRACTING DATA PRE-PROCESSING
Remove stop
Input Text Lower Text Remove URLS Tokenize
words
Remove spaces,
punctuation, Remove single Return final list
Lemmatise
numbers, nouns character token token
and emails
Personality Analysis
EXTRACTING RETWEET TF-IDF VECTORIZER TO EXTRACT
PRE-PROCESSING
TEXT FEATURES
Algolia
and Stripe Stripe -> Stripe is a payment service
provider that accepts credit cards, digital
wallets and many other payment
methods.
Results
+ Interest Classifier –> Used Standford label
Political, Entertainment, Environment, Sports
and Technology labelled tweets as train and
test data and running Multinomial Naïve
Bayes Classifier gave 89.7% accuracy
+ Sentiment Classifier -> Used Stanford label
Positive, Negative tweets as train and test
data, running Multinomial Naïve Bayes
Classifier gave 80.09% accuracy
+ Personality Classifier -> An open Dataset
from Kaggle for IE, PJ, SN, TF csv, running
Gaussian Naïve Classifier gave 86.2%
accuracy.
Future Scope
A better LSTM recurrent neural
We need more social media data,
network model can give
to get more accurate results we Separating all the services to
way results and can be trained as
can perform data mining on make it as a microservice to scale
the new data comes which can
Facebook, reddit and other social the platform operation.
make it more resilient and much
media Platforms.
fast.