Welcome to Scribd!

Classification Accuracy in R

Uploaded by

0% found this document useful (0 votes)

53 views4 pages

- The document uses a decision tree model (rpart) to classify spam and non-spam emails based on features in a dataset. - It trains the decision tree model on 2/3 of the data and tests it on the remaining 1/3, calculating accuracy and other performance metrics. - It explores visualizing the decision trees, adjusting tree complexity, and measuring stability by building trees on different random train/test splits.

Original Description:

Original Title

Classification Accuracy in r

Copyright

Available Formats

DOC, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as DOC, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

53 views4 pages

Classification Accuracy in R

Uploaded by

chitrapriyan

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as DOC, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 4

Search inside document

# package for trees library(rpart) # package including data from Elements of Statistical Learning library(ElemStatLearn) data(spam) # make response

a 0-1 outcome #spam$spam = ifelse(spam$spam=="spam",1,0) spam.sub = c(1:nrow(spam))[spam$spam == 'spam'] nospam.sub = c(1:nrow(spam))[spam$spam == 'email'] # use 2/3 for training, 1/3 for test train.spam = sample(spam.sub,floor(length(spam.sub)*2/3)) train.email = sample(nospam.sub,floor(length(nospam.sub)*2/3)) train = c(train.spam,train.email) train.set = spam[train,] test.set = spam[-train,] rpart.spam = rpart(spam ~ ., data=train.set, method="class", parms=list(split="gini")) # take a look at the decision rule print(summary(rpart.spam)) png("spam_tree.png", height=600, width=900) # visualize it (gets difficult for bigger trees) post(rpart.spam, filename='') dev.off() # predict the labels for the test set predict.spam = predict(rpart.spam, test.set) plabels.spam = colnames(predict.spam)[apply(predict.spam, 1, which.max)] # compute the various measures of accuracy classification.summary = function(plabels, tlabels) { # true positives: things we labelled spam that are spam

TP = sum((plabels.spam == 'spam') * (tlabels == 'spam')) # false positives: things we labelled spam that are email FP = sum((plabels.spam == 'spam') * (tlabels == 'email')) # true negatives: things we labelled email that are email TN = sum((plabels.spam == 'email') * (tlabels == 'email')) # false negatives: things we labelled email that are spam FN = sum((plabels.spam == 'email') * (tlabels == 'spam')) # accuracy A = (TP+TN) / (TP+TN+FP+FN) # sensitivity sens = TP / (TP+FN) # specificity spec = TN / (TN+FP) # precision prec = TP / (TP+FN) # confusion matrix C = matrix(c(TP,FP,FN,TN),2,2) colnames(C) = c('predicted spam', 'predicted email') rownames(C) = c('truly spam', 'truly email') return(list(A=A,TP=TP,FP=FP,TN=TN,FN=FN,C=C,sens=sens,spec=spec)) } s = classification.summary(plabels.spam, test.set$spam) print(s)

png("spam_cptree.png", height=1200, width=800) # you can control some aspects of the tree building process # with rpart.control rpart.spam.deeper = rpart(spam ~ ., data=train.set, method="class", parms=list(split="gini"), control=rpart.control(cp=0.00001, xval=20)) post(rpart.spam, filename='')

dev.off() # let's look at the stability of the tree png("spam_repeat0.png", height=600, width=600)

train.spam = sample(spam.sub,floor(length(spam.sub)*2/3)) train.email = sample(nospam.sub,floor(length(nospam.sub)*2/3)) train = c(train.spam,train.email) train.set = spam[train,] test.set = spam[-train,] rpart.spam = rpart(spam ~ ., data=train.set, method="class", parms=list(split="gini")) post(rpart.spam, filename='') dev.off() png("spamROC.png", height=600, width=600) predict.spam = predict(rpart.spam, test.set) l = sort(unique(predict.spam[,'spam'])) sens = c() spec = c() for (ll in l) { plabels.spam = rep('email', nrow(predict.spam)) plabels.spam[(predict.spam[,'spam'] >= ll)] = 'spam' s = classification.summary(plabels.spam, test.set$spam) sens = c(sens, s$sens) spec = c(spec, s$spec) } sens = c(1,sens,0) spec = c(0,spec,1) plot(1-spec, sens, type='l', col='red', lwd=2) abline(0,1,lwd=2, lty=2, col='blue') dev.off()

Office365 Security
Document1,824 pages
Office365 Security
RomanAuslaender
100% (1)
Rsm321 Course Outline Fall 2020
Document11 pages
Rsm321 Course Outline Fall 2020
Michelle Liu
No ratings yet
Ass
Document5 pages
Ass
Taqwa Elsayed
No ratings yet
Asep Purnama - 140710180027 - Praktik LSTM
Document9 pages
Asep Purnama - 140710180027 - Praktik LSTM
Asep
No ratings yet
Import Numpy As NP
Document6 pages
Import Numpy As NP
Maciej Wiśniewski
No ratings yet
Boxplot With Outlier Label R
Document5 pages
Boxplot With Outlier Label R
knapiko
No ratings yet
AIML Lab - Ws10
Document9 pages
AIML Lab - Ws10
lucky one
No ratings yet
QLSTMvs LSTM
Document7 pages
QLSTMvs LSTM
mohamedaligharbi20
No ratings yet
Forex Algo
Document90 pages
Forex Algo
Santosh prajapati
No ratings yet
Predict
Document3 pages
Predict
Venkatesh W
No ratings yet
Aiml Ex 4-7
Document8 pages
Aiml Ex 4-7
Lakshmi Dheeba K
No ratings yet
Aiml Lab
Document14 pages
Aiml Lab
1DT19IS146Triveni
No ratings yet
CS229 Python & Numpy: Jingbo Yang, Zhihan Xiong
Document40 pages
CS229 Python & Numpy: Jingbo Yang, Zhihan Xiong
sid s
100% (1)
ID3
Document3 pages
ID3
Kavyashree
No ratings yet
Sample Code For Twitter Processing in R
Document1 page
Sample Code For Twitter Processing in R
vinodnerella
No ratings yet
Assignment # 3
Document8 pages
Assignment # 3
Bushi Balooch
No ratings yet
SLK Software Python Interview Questions
Document4 pages
SLK Software Python Interview Questions
Srinimf
No ratings yet
Cheat Sheet - Gnuplot2
Document1 page
Cheat Sheet - Gnuplot2
Ambar Shukla
No ratings yet
CS229 Section: Python Tutorial: Maya Srikanth
Document39 pages
CS229 Section: Python Tutorial: Maya Srikanth
Trần Thành Long
No ratings yet
Daftar Lampiran: Music Signal Analysis
Document7 pages
Daftar Lampiran: Music Signal Analysis
jeremi kucing
No ratings yet
Prelab GMM 1
Document5 pages
Prelab GMM 1
lamtayazaissat
No ratings yet
Train Py
Document4 pages
Train Py
Karim dabbabi
No ratings yet
All 1
Document3 pages
All 1
araz arta
No ratings yet
Training Code
Document27 pages
Training Code
The Mind
No ratings yet
Machine Learning
Document54 pages
Machine Learning
Jacob
No ratings yet
BB Signal
Document2 pages
BB Signal
sukabumi junior
No ratings yet
Assignment 1700480105
Document34 pages
Assignment 1700480105
sowmeya veeraraghavan
No ratings yet
Compound Interest
Document11 pages
Compound Interest
abcd9661595653
No ratings yet
Jaycolpdf 1
Document5 pages
Jaycolpdf 1
P Samyutha 22107849101
No ratings yet
Momentum
Document3 pages
Momentum
parag Wadhai
No ratings yet
B2 40 Practical 5A
Document6 pages
B2 40 Practical 5A
Alex
No ratings yet
20bce1499 Cse3020 Lab-2
Document9 pages
20bce1499 Cse3020 Lab-2
Venkataraman Balanand
No ratings yet
Mod11 Textmining
Document4 pages
Mod11 Textmining
Sandhya Kuppala
No ratings yet
Python Lab Manual
Document19 pages
Python Lab Manual
Rahul Yadav
No ratings yet
7 - 201904121342. Lampiran Skripsi
Document65 pages
7 - 201904121342. Lampiran Skripsi
ilfisyafa
No ratings yet
A Short List of The Most Useful R Commands
Document11 pages
A Short List of The Most Useful R Commands
cristiansolomon1754
No ratings yet
Assignment 10 2
Document4 pages
Assignment 10 2
dash
No ratings yet
Tuple in Python
Document36 pages
Tuple in Python
Manan kansara
No ratings yet
Shoulda Matcher Reference
Document9 pages
Shoulda Matcher Reference
Mike Blyth
No ratings yet
Precision and Recall
Document13 pages
Precision and Recall
ssigold
No ratings yet
Cardio Screen RF
Document27 pages
Cardio Screen RF
The Mind
100% (1)
Python - 1 Year - Unit-3
Document72 pages
Python - 1 Year - Unit-3
sana
No ratings yet
Maximum Likelihood Estimate
Document3 pages
Maximum Likelihood Estimate
Xiao Xue
No ratings yet
Python 3 Functions and OOPs FP
Document10 pages
Python 3 Functions and OOPs FP
Priya Satheesh
No ratings yet
R语言基础入门指令 (tips)
Document14 pages
R语言基础入门指令 (tips)
s2000152
No ratings yet
Atomlib
Document7 pages
Atomlib
kandorp
No ratings yet
C121 Exp1
Document32 pages
C121 Exp1
Devanshu Maheshwari
No ratings yet
1.1 Objective: 2. Data Preparation and Exploratory Analysis
Document11 pages
1.1 Objective: 2. Data Preparation and Exploratory Analysis
k767
No ratings yet
PHP Unit-4
Document11 pages
PHP Unit-4
Nidhi Bhati
No ratings yet
Ajp 19 Code
Document3 pages
Ajp 19 Code
04 Tushar Bhadrike
No ratings yet
Introduction To Plyr: Garrett Grolemund
Document49 pages
Introduction To Plyr: Garrett Grolemund
Rose Widanti
No ratings yet
Pattern Recognition
Document26 pages
Pattern Recognition
Aryan Attri
No ratings yet
Crypto Slaya
Document12 pages
Crypto Slaya
Yudistira Waskito
No ratings yet
Text Mining KNN
Document2 pages
Text Mining KNN
vedavarshni
No ratings yet
Sorting HOW TO: Guido Van Rossum Fred L. Drake, JR., Editor
Document6 pages
Sorting HOW TO: Guido Van Rossum Fred L. Drake, JR., Editor
dalequenosvamos
100% (1)
22MCA1008 - Varun ML LAB ASSIGNMENTS
Document41 pages
22MCA1008 - Varun ML LAB ASSIGNMENTS
S Varun (RA1931241020133)
100% (1)
Spam Email Classification 3
Document8 pages
Spam Email Classification 3
Austin Kinion
No ratings yet
Exe 1
Document13 pages
Exe 1
jaya pandi
No ratings yet
Ba Notes
Document19 pages
Ba Notes
ANJALI SAHU
No ratings yet
Py Rar Crack
Document5 pages
Py Rar Crack
Mian Muhammad Muaz
No ratings yet
Is Lab Aman Agarwal PDF
Document8 pages
Is Lab Aman Agarwal PDF
Aman Bansal
No ratings yet
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Cyber Security - CS-503 (C) - Class Notes - 1563265709
Document29 pages
Cyber Security - CS-503 (C) - Class Notes - 1563265709
Anshul Aliwal
No ratings yet
CDDE Vendor Services Manual - 20180531 Single Sided Printing PDF
Document22 pages
CDDE Vendor Services Manual - 20180531 Single Sided Printing PDF
AhmadRadzali
No ratings yet
Understanding Social Engineering Based Scams PDF
Document135 pages
Understanding Social Engineering Based Scams PDF
yiho
No ratings yet
Principles of Information Systems, Thirteenth Edition: Cybercrime and Information System Security
Document48 pages
Principles of Information Systems, Thirteenth Edition: Cybercrime and Information System Security
Perfect Darkside
No ratings yet
23-27 April 2021 Daily Global Regional Local Rice E-Newsletter (Un-Edited Version)
Document411 pages
23-27 April 2021 Daily Global Regional Local Rice E-Newsletter (Un-Edited Version)
Mujahid Ali
No ratings yet
IP SPOOFING Documentation
Document18 pages
IP SPOOFING Documentation
Ancy Anas
No ratings yet
ProbabilisticLearning Bayesian
Document11 pages
ProbabilisticLearning Bayesian
suryansh
No ratings yet
Authorship Analysis:: Athira U
Document22 pages
Authorship Analysis:: Athira U
Tom Miralrio
No ratings yet
Top Grading Snapshot Users Manual
Document31 pages
Top Grading Snapshot Users Manual
Jessica Oakridge
No ratings yet
122 14211291439 13 PDF
Document5 pages
122 14211291439 13 PDF
Nancy Pareta
No ratings yet
Ip Notes XI
Document129 pages
Ip Notes XI
shivam jain
No ratings yet
E-Mail Work Book
Document45 pages
E-Mail Work Book
J.Gopala Krishna
No ratings yet
Read Me First: The ICDL® Qualification
Document76 pages
Read Me First: The ICDL® Qualification
Loïc JEAN-CHARLES
No ratings yet
Cisco Ironport Email Security Appliances
Document5 pages
Cisco Ironport Email Security Appliances
abebaw abex
No ratings yet
E-Mail Spam Detection Using Machine Learning KNN
Document5 pages
E-Mail Spam Detection Using Machine Learning KNN
mittakola shivaram
No ratings yet
Check Spam PDF
Document14 pages
Check Spam PDF
pavithra
No ratings yet
Setting Up External Mail Servers For G Suite: Part Three
Document14 pages
Setting Up External Mail Servers For G Suite: Part Three
Cristian G. Ciocoi
No ratings yet
Chapter 6 PDF
Document34 pages
Chapter 6 PDF
STEVE JHONSON Lepasana
No ratings yet
NSE1 Lesson Scripts-En
Document11 pages
NSE1 Lesson Scripts-En
zicoctgbd
No ratings yet
RCSbetfair Whitepaper
Document10 pages
RCSbetfair Whitepaper
Vishwjeet Kumar Choudhary
No ratings yet
AI Based E-Mail Scraper and Sending Tool
Document9 pages
AI Based E-Mail Scraper and Sending Tool
IJRASETPublications
No ratings yet
What Is Exchange Server
Document16 pages
What Is Exchange Server
kmrs_888
No ratings yet
Empowerment Technologies Module 2
Document28 pages
Empowerment Technologies Module 2
Anthony Gelig Montemayor
No ratings yet
Free Anti-Spam Policy: Cover
Document5 pages
Free Anti-Spam Policy: Cover
de_ghaywats
No ratings yet
Acceptable-Use Policy
Document12 pages
Acceptable-Use Policy
Vic Logic
No ratings yet
The Evolution of Cybersecurity - Secure Mail Gateway Quiz
Document2 pages
The Evolution of Cybersecurity - Secure Mail Gateway Quiz
chhun
25% (4)
FortiGate Antivirus Firewalls
Document56 pages
FortiGate Antivirus Firewalls
netmotshop
No ratings yet
Becoming A Problem Solving Genius 01-Sep-2022 20-59-28
Document20 pages
Becoming A Problem Solving Genius 01-Sep-2022 20-59-28
Gladson
No ratings yet