Welcome to Scribd!

Data Analytics Project

Uploaded by

0% found this document useful (0 votes)

11 views7 pages

This document contains code to extract data from Facebook and Twitter APIs and build predictive models for diabetes classification. It includes code to: 1. Extract comments from a public Facebook page post using the Graph API. 2. Extract the most recent tweets matching a keyword search using the Twitter API. 3. Build logistic regression, SVM, random forest, and decision tree models on a diabetes dataset to classify patients and compare their performance.

Original Description:

Extract facebook data available on any public pagle like Amazon.

Original Title

Data analytics project

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

11 views7 pages

Data Analytics Project

Uploaded by

vishal.gahlot14

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 7

Search inside document

Exercise 1: Extract facebook data available on any public pagle like

Amazon.

Code:
import requests
import json

access_token='EAADraRiwnasBALJgEL4vbyvv2DTJvAYjBlLfk1iO0xgL56Vf70mE1MYlv
dv2A5RupQZBOctpcE8Qdu1COESmobBxTwC6DFTOrbaXCRWcBzsZB6wlZBuzFSx5A
gvXZAfLnp9etZBBTHwCL9U5klw4Q9sBFpmfVAEiJCZBFMD2CXCXyS5sPepoEqCDfY
32DeUUoZD'

post_id = "1973749942700563"

URL = 'https://graph.facebook.com/v3.2/'+post_id+'/comments'

PARAMS = {'access_token':access_token}

# sending get request and saving the response as response object

r = requests.get(url = URL, params = PARAMS)

# extracting data in json format

data = r.json()

for comment in data['data']:

print "----------------------------------------------------------\n\n"
print
"id:",comment['id'],"\n","created_time:",comment['created_time'],"\n","message:",comme
nt['message']
print "----------------------------------------------------------\n\n"
Output:
Exercise 2: Extract 1000 latest tweets from twitter using any keyword.

Code:
from twitter import Twitter,OAuth, TwitterStream
import json

ACCESS_TOKEN =
'1064460892694241281-yHNHebYDMQgaoEjLD8BrcyVpDzIeGf'
ACCESS_SECRET =
'DQFUjh3TklipgH9dN6cGIlCW6KPXok2Q3oiN6HNJARxRM'
CONSUMER_KEY = '4mGaUsqkD2EyHkagZpHKOpBXF'
CONSUMER_SECRET =
'5xI9anpq1O0F5CT5Gj5TC9tzz3s4pDfjxmtLIse88clK9E4REy'

oauth = OAuth(ACCESS_TOKEN, ACCESS_SECRET, CONSUMER_KEY,

CONSUMER_SECRET)
twitter = Twitter(auth=oauth)
#print twitter.GetFriends()
twt = twitter.search.tweets(q='machine learning', result_type='recent',
lang='en', count=5)

i=0
for tweet in twt['statuses']:
print "Tweet_count: ", i
print "id:",tweet['id'],"\n","text:",tweet['text'],"\n\n"
i=i+1
Output:
Exercise 3: Design a predictive model for diabetes on the given
dataset of 535 patients using following machine learning techniques:
1. Logistic Regression
2. SVM
3. Random Forest
4. Decision Tree

Code:
from sklearn import svm
from sklearn.ensemble import RandomForestClassifier
from sklearn.model_selection import train_test_split
from sklearn.tree import DecisionTreeClassifier
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import accuracy_score

import numpy

A = numpy.loadtxt(open("data.csv", "rb"), delimiter=",", skiprows=1)

X_features = A[:,:9]
y_targets = A[:,9:]

X_train, X_test, y_train, y_test = train_test_split(X_features, y_targets, test_size=0.4,

random_state=0)

print "Support Vector Machine:"

svm_model = svm.SVC(kernel='linear', C=1).fit(X_train, y_train.ravel())
print "Score: ", svm_model.score(X_test, y_test.ravel())

print "-----------------------------------------------------------------------"
print "Decision Tree:"
max_score = ()
max_val = 0
for i in range(1,100,2):
dtree_model = DecisionTreeClassifier(max_depth = i).fit(X_train, y_train.ravel())
curr_score = dtree_model.score(X_test, y_test.ravel())
if(max_val<curr_score):
max_val = curr_score
max_score = (i, curr_score)
print "max_score: ",max_score[1], "max_depth: ",max_score[0]

print "-----------------------------------------------------------------------"

print "Random Forest:"

rf_model = RandomForestClassifier(n_estimators=100,n_jobs=-1).fit(X_train,
y_train.ravel())
rf_score = rf_model.score(X_test, y_test.ravel())
print "Score: ", rf_score

print "-----------------------------------------------------------------------"

print "Logistic Regression:"

lr_model = LogisticRegression(penalty='l1',C=50).fit(X_train, y_train.ravel())

print "Score: ", lr_model.score(X_test, y_test.ravel())

Output:

Clustering Documentation Python Code
Document8 pages
Clustering Documentation Python Code
nehal gundrapally
No ratings yet
Ableton Shortcuts PDF
Document9 pages
Ableton Shortcuts PDF
lucas
No ratings yet
Qgis Training Manual PDF
Document164 pages
Qgis Training Manual PDF
GachatonyMwaniki
80% (10)
chapter1 10ข้อ
Document6 pages
chapter1 10ข้อ
Thai Konsar
No ratings yet
Brain Gym
Document27 pages
Brain Gym
sona
100% (3)
Subcontractor Recommendation Letter
Document1 page
Subcontractor Recommendation Letter
Isabelle Nyamgeroh
100% (2)
Microsoft Visual Basic Interview Questions: Microsoft VB Certification Review
From Everand
Microsoft Visual Basic Interview Questions: Microsoft VB Certification Review
Equity Press
No ratings yet
Advanced Java Lab Manual
Document31 pages
Advanced Java Lab Manual
K M Imtiaz Uddin
0% (2)
19f0217 8B Assignment04
Document12 pages
19f0217 8B Assignment04
Shahid Imran
100% (1)
Machine Learning Hands-On Programs Program 1: Linear Regression - Single Variable Linear Regression
Document22 pages
Machine Learning Hands-On Programs Program 1: Linear Regression - Single Variable Linear Regression
KANTESH kantesh
100% (1)
NR-320502 Computer Networks
Document6 pages
NR-320502 Computer Networks
Srinivasa Rao G
100% (2)
DG - Dali - Lighting - DeSIGN GUIDE
Document26 pages
DG - Dali - Lighting - DeSIGN GUIDE
Anonymous L3WIDg03
No ratings yet
20AI16 - ML Record
Document24 pages
20AI16 - ML Record
Menma
No ratings yet
Daftar Lampiran Coding Python Recognize
Document7 pages
Daftar Lampiran Coding Python Recognize
Resha Noviane Putri
No ratings yet
Efficient Python Tricks and Tools For Data Scientists - by Khuyen Tran
Document20 pages
Efficient Python Tricks and Tools For Data Scientists - by Khuyen Tran
Khagen
No ratings yet
Efficient Python Tricks and Tools For Data Scientists
Document20 pages
Efficient Python Tricks and Tools For Data Scientists
Javier Velandia
100% (1)
Aiml Ex 5
Document3 pages
Aiml Ex 5
Tasmiya Dz
No ratings yet
Appendix B: Source Code
Document5 pages
Appendix B: Source Code
AISHWARYA S
No ratings yet
Machine Learning Lab: Raheel Aslam (74-FET/BSEE/F16)
Document3 pages
Machine Learning Lab: Raheel Aslam (74-FET/BSEE/F16)
Raheel Aslam
No ratings yet
Email Spam Classifier
Document22 pages
Email Spam Classifier
phenomenal beast
No ratings yet
Lab - 8 - 21130568 - NguyenNhuToan - Ipynb - Colab
Document4 pages
Lab - 8 - 21130568 - NguyenNhuToan - Ipynb - Colab
nguyennhutoan722003
No ratings yet
Raw Nitex
Document5 pages
Raw Nitex
neel neelanti
No ratings yet
PL Ii
Document43 pages
PL Ii
Rohan 7
No ratings yet
Koushik - Skill
Document4 pages
Koushik - Skill
kalyan
No ratings yet
Amazon-Fine-Food-Review - K-Means, Agglomerative & DBSCAN Clustering
Document79 pages
Amazon-Fine-Food-Review - K-Means, Agglomerative & DBSCAN Clustering
krishna
No ratings yet
Salazar Francisco C3 - W1 - Lab - 3 - Sarcasm
Document11 pages
Salazar Francisco C3 - W1 - Lab - 3 - Sarcasm
Frank SD
No ratings yet
Fds Mannual
Document39 pages
Fds Mannual
sudha
No ratings yet
Generative AI Binary Classification
Document7 pages
Generative AI Binary Classification
Cyborg Ultra
No ratings yet
Lab7 Hameed 211086
Document4 pages
Lab7 Hameed 211086
Abdul Moaid
No ratings yet
Data Warehouse & Data Mining Lab Assignment-4 Name:Sai Subhikshaa.K.A REG NO:19MID0037 Code
Document6 pages
Data Warehouse & Data Mining Lab Assignment-4 Name:Sai Subhikshaa.K.A REG NO:19MID0037 Code
Vasumathi R 19MID0045
No ratings yet
Sqlmap Code Linux Ok
Document10 pages
Sqlmap Code Linux Ok
Carlos farias
No ratings yet
Machine Learnin
Document23 pages
Machine Learnin
Manoj Kumar 1183
100% (1)
Kabir Khan 1147 - 4
Document4 pages
Kabir Khan 1147 - 4
mohammed.ibrahimdurrani.bscs-2020b
No ratings yet
Installing Spark On Windows Environment
Document16 pages
Installing Spark On Windows Environment
Dr Mohammed Kamal
No ratings yet
Final Code
Document16 pages
Final Code
Naimul Hasan Tahsin
No ratings yet
Lab - 8 - 21130616 - TranThanhVu - Ipynb - Colab
Document4 pages
Lab - 8 - 21130616 - TranThanhVu - Ipynb - Colab
nguyennhutoan722003
No ratings yet
Ajp PRC 18
Document4 pages
Ajp PRC 18
nstrnsdtn
No ratings yet
Import Pandas As PD DF PD - Read - CSV ("Titanic - Train - CSV") DF - Head
Document20 pages
Import Pandas As PD DF PD - Read - CSV ("Titanic - Train - CSV") DF - Head
Saloni Tuli
No ratings yet
Codes
Document6 pages
Codes
Vamshi Krishna
No ratings yet
Python Code
Document52 pages
Python Code
Rohit Kumar
No ratings yet
Better Learning Practices PDF
Document9 pages
Better Learning Practices PDF
Meenakshi
No ratings yet
Java Record
Document49 pages
Java Record
Padmapriya
No ratings yet
Java and Mathematica
Document4 pages
Java and Mathematica
SolisterADV
No ratings yet
# Capture The Target Column ("Default") Into Separate Vectors For Training Set and Test Set
Document4 pages
# Capture The Target Column ("Default") Into Separate Vectors For Training Set and Test Set
Rohit Kumar
No ratings yet
20MIS1025 - DecisionTree - Ipynb - Colaboratory
Document4 pages
20MIS1025 - DecisionTree - Ipynb - Colaboratory
Sandip Das
No ratings yet
Naive Bayes Project
Document5 pages
Naive Bayes Project
Night Music
No ratings yet
Cnnbyrohanga: # Create Datasets
Document1 page
Cnnbyrohanga: # Create Datasets
ROHAN G A
No ratings yet
EJ2SEMANA3
Document14 pages
EJ2SEMANA3
fuck off we need limits
No ratings yet
7 Aiml
Document4 pages
7 Aiml
bharath vaj
No ratings yet
Classification Is For Predicting Type and Regression Is For Predicting Value
Document4 pages
Classification Is For Predicting Type and Regression Is For Predicting Value
rana
No ratings yet
AJP No.18
Document2 pages
AJP No.18
Saraswati Shelke
No ratings yet
Content
Document12 pages
Content
Prashanth Shetty
No ratings yet
Ajp Practical 20
Document4 pages
Ajp Practical 20
nstrnsdtn
No ratings yet
Android Upload Image Using Volley To Server
Document11 pages
Android Upload Image Using Volley To Server
Ansori Sori
No ratings yet
21BEE0103 (Iot 2theory)
Document7 pages
21BEE0103 (Iot 2theory)
srujan
No ratings yet
Java Notes
Document11 pages
Java Notes
nagubhat
No ratings yet
AIML Record 56
Document28 pages
AIML Record 56
saisatwik bikumandla
No ratings yet
Project 1
Document18 pages
Project 1
Varun Taurus
No ratings yet
Tidaim 2
Document2 pages
Tidaim 2
neagaiuliancostin
No ratings yet
Isat-U Computer Department ICT104 - Intermediate Programming Prof. Loreto G. Gabawa JR
Document7 pages
Isat-U Computer Department ICT104 - Intermediate Programming Prof. Loreto G. Gabawa JR
JesseVillanueva
No ratings yet
Diabetes Case Study - Jupyter Notebook
Document10 pages
Diabetes Case Study - Jupyter Notebook
Abhising
100% (1)
Correction
Document3 pages
Correction
bougmazisoufyane
No ratings yet
AI and ML Lab Manual
Document29 pages
AI and ML Lab Manual
Nithya Nair
No ratings yet
Python Implementation of Random Forest Algorithm
Document10 pages
Python Implementation of Random Forest Algorithm
Courtney Kudra Dzere
No ratings yet
ML Lab Programs
Document23 pages
ML Lab Programs
Roopa 18-19-36
No ratings yet
PS Project - Jupyter Notebook
Document6 pages
PS Project - Jupyter Notebook
M. Mobeen Khattak
No ratings yet
Import As Import As Import As From Import From Import As Import
Document5 pages
Import As Import As Import As From Import From Import As Import
manjeet chauhan
No ratings yet
CCTN Report: A Visit To The Server Room
Document13 pages
CCTN Report: A Visit To The Server Room
Tushar Patil
No ratings yet
Base Syntax Ref PDF
Document2 pages
Base Syntax Ref PDF
Rohit Halappanavar
No ratings yet
WWW - Dgca.nic - in Admit AME0612R Oth
Document124 pages
WWW - Dgca.nic - in Admit AME0612R Oth
Mohan Chetri
100% (1)
Setting Up A Virtual Switch For A Hyper
Document5 pages
Setting Up A Virtual Switch For A Hyper
aleksandar71
No ratings yet
Russia All Russian Olympiad 2011 61
Document5 pages
Russia All Russian Olympiad 2011 61
Biswarup Burman
No ratings yet
X1 Air Install Manual
Document18 pages
X1 Air Install Manual
Neil Czs
No ratings yet
White Paper - Map Data For Safer ADAS To HAD Solutions - VSI Labs
Document12 pages
White Paper - Map Data For Safer ADAS To HAD Solutions - VSI Labs
Rachana Medehal
No ratings yet
CNC Milling
Document8 pages
CNC Milling
Nashon Mutua
No ratings yet
Quick Installation Guide: Netis Wireless N Range Extender
Document1 page
Quick Installation Guide: Netis Wireless N Range Extender
lazaros
No ratings yet
Search Filter User Stories
Document26 pages
Search Filter User Stories
Kavarthapu Vijay Bhaskar
No ratings yet
Structural Testing: Stuart Anderson
Document22 pages
Structural Testing: Stuart Anderson
Anonymous Tvpppp
No ratings yet
Security Empowers Business: How It Works 2
Document11 pages
Security Empowers Business: How It Works 2
Rafiai Khalid
No ratings yet
Tcs400 Um GB
Document313 pages
Tcs400 Um GB
Serge Macia
No ratings yet
Sigma II Servo System Product Catalog Supplement 2009
Document202 pages
Sigma II Servo System Product Catalog Supplement 2009
Trí Chốt
No ratings yet
18CS33-ADE-Module 1
Document57 pages
18CS33-ADE-Module 1
chandan
No ratings yet
Netimizer Brouchure ENG
Document26 pages
Netimizer Brouchure ENG
trxopti9
No ratings yet
Chapter One Background To The Study
Document80 pages
Chapter One Background To The Study
Umar Abussiddiq Abubakar Abdullahi
No ratings yet
Docu
Document77 pages
Docu
Don'tAsK TheStupidOnes
No ratings yet
(Download PDF) Mastering Python For Bioinformatics Ken Youens Clark Online Ebook All Chapter PDF
Document42 pages
(Download PDF) Mastering Python For Bioinformatics Ken Youens Clark Online Ebook All Chapter PDF
linda.coles284
100% (18)
Booking Confirmation On IRCTC, Train: 02787, 31-Jul-2021, SL, SC - DNR
Document1 page
Booking Confirmation On IRCTC, Train: 02787, 31-Jul-2021, SL, SC - DNR
reddy_575729486
No ratings yet
Adedoyin Ahmed Hussain Ouns Bouachir Fadi Al-Turjman Moayad Aloqaily
Document21 pages
Adedoyin Ahmed Hussain Ouns Bouachir Fadi Al-Turjman Moayad Aloqaily
Ibn Alhalal Alhashimi
No ratings yet
01 IndustrialRobots
Document68 pages
01 IndustrialRobots
ramar M
No ratings yet
Microsoft Beefs Up VBScript With Regular Expressions
Document10 pages
Microsoft Beefs Up VBScript With Regular Expressions
Srikanth Pentyala
No ratings yet