CS504 hw3

Uploaded by

bijaysubedi

0% found this document useful (0 votes)

15 views2 pages

Original Title

CS504_hw3(1)

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

15 views2 pages

CS504 hw3

Uploaded by

bijaysubedi

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

CS504 Spring 2020 Homework 3

Due: 04/30/2020 by 11:59PM

Description:
Download WEKA from: http://www.cs.waikato.ac.nz/ml/weka/ Weka assumes by default
that the class attribute is the last column.

Datasets and corresponding descriptions are provided with this homework.

Datasets Descriptions
a.arff a.png
b.arff b.png
c.arff c.png

For this assignment, you will use WEKA to evaluate 4 different classifiers
(DecisionStump, J48, IBk(KNN), NaiveBayes) on three synthetic datasets. This will be
done in the following steps:
1. First, you will explore the datasets.
2. Next, you will perform a series of experiments using Weka Explorer. For each
experiment, you will be asked to answer a series of questions.
3. Compile your answers in the form of a pdf file.

Data Exploration (6 points)

• Visually explore the data sets, and describe the following for each data set
o Types of attributes
o Class distribution
o Any special structure that you might observe, if any

Experiments (8 points each)

• Experiment 1: use 10-fold cross validation to test/compare DecisionStump and
J48 on dataset c. Here are the steps to do this:
o Select the explorer button
o Click on the open file button and load the c.arff file
o With the preprocess tab open, make sure the "Class (Nom)" field is
selected on the right side of the screen. On the left, select different
attributes and observe the distributions of data over the two classes (1 and
2). Specifically note the distribution of the "class" attribute. You can also
click on "visualize all" to look at a complete pairwise plot.
o Click on the classify tab.
§ Click on the choose button. Then select
• Trees->DecisionStump
§ Make sure (Nom) Class is selected as the attribute to predict.
§ Under test options, click cross-validate folds 10.
§ Click start, and review the output on the right. Note the correctly
classified instances and the root mean squared error (RMSE).
o REPEAT the above steps for:
§ Trees->J48
§ Trees->J48 Unpruned (next to the choose button where you
selected J48, click on the parameters line, which will open a
window with some options. Set the unpruned option to TRUE.)
o In a table, list the classification accuracy (correctly classified instance
percentage) and the RMSE for each classifier. (One row for
DecisionStump, two rows for J48(pruned/unpruned.)
o For DecisionStump, briefly explain the technique and list the attribute that
was used to make the decision. Compare the results of
J48(pruned/unpruned) and explain why pruned has better performance.

• Experiment 2: Run J48(pruned), NaiveBayes, IBk (k=1 and k=21) respectively

on data sets a and b using default parameters
o For each classifier, use F-measure to compare its performance obtained on
data set a to its performance obtained on data set b.
o For data set a, compare the performance of the 4 classifiers using F-
measure.
o Give explanations for your observations above.

• Experiment 3: Run NaiveBayes, IBk (k=1 and k=10) on data set c using default
parameters
o Compare the performance of the 3 classifiers using F-measure.
§ Comment on the effect of k.
o Give explanations for your observations above.

Deliverables:
A single PDF document including all the answers for questions in data exploration and
experiments.

Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Laboratory Practice, Testing, and Reporting: Time-Honored Fundamentals for the Sciences
From Everand
Laboratory Practice, Testing, and Reporting: Time-Honored Fundamentals for the Sciences
Dwayne Phillips
No ratings yet
Experiment No. 7
Document4 pages
Experiment No. 7
yepop93638
No ratings yet
Tut2 Weka
Document8 pages
Tut2 Weka
borjaunda
No ratings yet
Assignment-7: Opening Iris - Arff and Removing Class Attribute
Document17 pages
Assignment-7: Opening Iris - Arff and Removing Class Attribute
ammi890
No ratings yet
Hands On Assignment
Document10 pages
Hands On Assignment
gopi
No ratings yet
CAP3770 Lab#4 DecsionTree Sp2017
Document4 pages
CAP3770 Lab#4 DecsionTree Sp2017
Melving
No ratings yet
DOC-20231017-WA0002.
Document21 pages
DOC-20231017-WA0002.
thabeswar2003
No ratings yet
Ass3 v1
Document4 pages
Ass3 v1
Reeya Prakash
No ratings yet
Lab3 KNN
Document4 pages
Lab3 KNN
Erick Menjivar
No ratings yet
Assignment1 COMP723 2019
Document4 pages
Assignment1 COMP723 2019
imran5705074
No ratings yet
Experimental Design Process: 1) Define Problem(s) 5) Conduct Experiment & Collect Data
Document15 pages
Experimental Design Process: 1) Define Problem(s) 5) Conduct Experiment & Collect Data
Mark Shuster
No ratings yet
EC9560 Data Mining: Lab 02: Classification and Prediction Using WEKA
Document5 pages
EC9560 Data Mining: Lab 02: Classification and Prediction Using WEKA
keethan
No ratings yet
DWDM Record With Alignment
Document69 pages
DWDM Record With Alignment
navya
No ratings yet
6.034 Design Assignment 2: 1 Data Sets
Document6 pages
6.034 Design Assignment 2: 1 Data Sets
upender_kalwa
No ratings yet
Classifying Objects with Decision Trees
Document59 pages
Classifying Objects with Decision Trees
Oscar Wong
No ratings yet
Assigment 3
Document2 pages
Assigment 3
Erick Menjivar
No ratings yet
Lab (I)
Document3 pages
Lab (I)
anand_sesham
No ratings yet
Objective: Classification Using ID3 and C4.5 Algorithms Tasks
Document8 pages
Objective: Classification Using ID3 and C4.5 Algorithms Tasks
Shivam Shukla
No ratings yet
Data Mining With Weka Heart Disease Dataset: 1 Problem Description
Document4 pages
Data Mining With Weka Heart Disease Dataset: 1 Problem Description
Sindhuja Vigneshwaran
No ratings yet
DWDM Unit 4 PDF
Document18 pages
DWDM Unit 4 PDF
indira
No ratings yet
W7 Weka Experimenter
Document6 pages
W7 Weka Experimenter
Azfar Jiji
No ratings yet
Tu3 Weka Tutorials
Document11 pages
Tu3 Weka Tutorials
borjaunda
No ratings yet
Data Mining - Lab - Manual
Document20 pages
Data Mining - Lab - Manual
varmam
No ratings yet
Data Warehousing and Data Mining Lab
Document53 pages
Data Warehousing and Data Mining Lab
Aman Jolly
No ratings yet
Weka Book Questions
Document2 pages
Weka Book Questions
Sravan Kumar
0% (1)
Machine Learning Assignment: Iris Dataset, Parameter Settings, Experiments
Document1 page
Machine Learning Assignment: Iris Dataset, Parameter Settings, Experiments
creatively_1
No ratings yet
Experiment 1 Aim:: Introduction To ML Lab With Tools (Hands On WEKA On Data Set (Iris - Arff) ) - (A) Start Weka
Document55 pages
Experiment 1 Aim:: Introduction To ML Lab With Tools (Hands On WEKA On Data Set (Iris - Arff) ) - (A) Start Weka
Jayesh bansal
No ratings yet
WEKA Tutorial: Machine Learning with Datasets
Document4 pages
WEKA Tutorial: Machine Learning with Datasets
aditi1687
No ratings yet
DWDM Asgmnt Prog
Document51 pages
DWDM Asgmnt Prog
copy leaks
No ratings yet
CS440: HW3
Document7 pages
CS440: HW3
Jon Mueller
No ratings yet
DATA MINING LAB MANUAL: Reduced Error Pruning
Document3 pages
DATA MINING LAB MANUAL: Reduced Error Pruning
cse VBIT
No ratings yet
Decision Tree Tutorial: How to Construct Them and Use for Classification
Document126 pages
Decision Tree Tutorial: How to Construct Them and Use for Classification
Nicknaim
No ratings yet
Data Mining 2
Document40 pages
Data Mining 2
Piyush Rajput
No ratings yet
DWDM Lab Manual
Document47 pages
DWDM Lab Manual
Krishna Chowdary Challa
No ratings yet
Lab 03
Document10 pages
Lab 03
MuhdHusaini
No ratings yet
Exp 5
Document5 pages
Exp 5
Pavan Sankar K
No ratings yet
Classification With Decision Trees I: Instructor: Qiang Yang
Document29 pages
Classification With Decision Trees I: Instructor: Qiang Yang
Poornima Venkatesh
No ratings yet
Excel
Document7 pages
Excel
api-3698136
No ratings yet
Assignment 2
Document3 pages
Assignment 2
Erick Menjivar
No ratings yet
Module 4
Document41 pages
Module 4
Sneha
No ratings yet
Weka Tutorial 2
Document50 pages
Weka Tutorial 2
Fikri Faris
No ratings yet
Manual de WEKA Reducido
Document14 pages
Manual de WEKA Reducido
Evelyn Karina Caiza Cañar
No ratings yet
Descriptive and Inferential Statistics With R
Document6 pages
Descriptive and Inferential Statistics With R
Trần Thị Bích Thảo 3KT -19
No ratings yet
Decision Trees and Random Forests
Document25 pages
Decision Trees and Random Forests
Alexandra Veres
No ratings yet
Lab Stress and Strain
Document4 pages
Lab Stress and Strain
Eric Urbina Santibañez
No ratings yet
Predict AirBnb Listing Prices with Machine Learning
Document1 page
Predict AirBnb Listing Prices with Machine Learning
suryansh
No ratings yet
SWENG 545 Term Project Analysis of Census Dataset
Document3 pages
SWENG 545 Term Project Analysis of Census Dataset
Karen Kraus
No ratings yet
Data Mining Questions Q&A
Document11 pages
Data Mining Questions Q&A
aaakandoh
No ratings yet
Attachment 1 (14)
Document4 pages
Attachment 1 (14)
sammiepetez8
No ratings yet
Assignment 01-Spring24 (1)
Document3 pages
Assignment 01-Spring24 (1)
k224522
No ratings yet
Performing Unit Root Tests in Eviews
Document9 pages
Performing Unit Root Tests in Eviews
smazadamha sulaiman
No ratings yet
Task 3
Document1 page
Task 3
kakakkawaii
No ratings yet
Examquestionbank PR
Document4 pages
Examquestionbank PR
winster21aug
No ratings yet
Unit 3
Document21 pages
Unit 3
nandan
100% (1)
8 Classification
Document45 pages
8 Classification
YOSEF Abdo
No ratings yet
COL 774: Assignment 2
Document3 pages
COL 774: Assignment 2
Aditya Kumar
No ratings yet
Machine Learning Model Predicts House Prices
Document9 pages
Machine Learning Model Predicts House Prices
Education VietCo
No ratings yet
DWDM - Case Study On Weka - Ceb624
Document13 pages
DWDM - Case Study On Weka - Ceb624
CEB524SreejitGNair
No ratings yet
ML2019 Lab1 Trees
Document14 pages
ML2019 Lab1 Trees
Luca Guglielmi
No ratings yet
Final Report of Geology
Document61 pages
Final Report of Geology
bijaysubedi
No ratings yet
Body
Document30 pages
Body
bijaysubedi
No ratings yet
Question No 1: Zip-Code
Document2 pages
Question No 1: Zip-Code
bijaysubedi
No ratings yet
ER-to-Relational Mapping
Document32 pages
ER-to-Relational Mapping
bijaysubedi
No ratings yet
504 Lecture4
Document42 pages
504 Lecture4
bijaysubedi
No ratings yet
The ER and EER Model
Document62 pages
The ER and EER Model
bijaysubedi
No ratings yet
Principles of Data Management and Mining: CS 504 Spring 2020
Document28 pages
Principles of Data Management and Mining: CS 504 Spring 2020
bijaysubedi
No ratings yet
Puff and Plume Models
Document3 pages
Puff and Plume Models
Fathi Mawardi
No ratings yet
Industrial Economics Unit 1
Document6 pages
Industrial Economics Unit 1
library gcpeshawar
No ratings yet
DM DW QB
Document4 pages
DM DW QB
M.A raja
No ratings yet
TOPIC 6 Qualitative & Quantitative Research
Document3 pages
TOPIC 6 Qualitative & Quantitative Research
Noor-ul Huda
No ratings yet
Minitab 19 Statistical Software For Mac
Document3 pages
Minitab 19 Statistical Software For Mac
compu center
No ratings yet
Forcasting
Document85 pages
Forcasting
Ucca Amanda
No ratings yet
Real Statistics Using Excel - Examples Workbook Charles Zaiontz, 9 April 2015
Document1,595 pages
Real Statistics Using Excel - Examples Workbook Charles Zaiontz, 9 April 2015
Maaraa Maaraa
No ratings yet
Konsep Berfikir Dalam Pemacahan Masalah Mahasiswa Program Studi Bimbingan Dan Konseling Universitas Prof. Dr. Hazairin, Sh. Bengkulu
Document16 pages
Konsep Berfikir Dalam Pemacahan Masalah Mahasiswa Program Studi Bimbingan Dan Konseling Universitas Prof. Dr. Hazairin, Sh. Bengkulu
ilham faj
No ratings yet
The Research Process Simplified
Document4 pages
The Research Process Simplified
Huda Reem Mansharamani
No ratings yet
Instructions For Running ANOVAs in v25 of SPSS - Print Before Beginning 82278422
Document4 pages
Instructions For Running ANOVAs in v25 of SPSS - Print Before Beginning 82278422
Zain Naeem
No ratings yet
Unit Test in PR1
Document3 pages
Unit Test in PR1
Jennie Joie Malangis Cacho
No ratings yet
3-08 ANOVA Revision
Document14 pages
3-08 ANOVA Revision
Mgn San
No ratings yet
Hypothesis Testing Basic Statistical Inference
Document4 pages
Hypothesis Testing Basic Statistical Inference
Anonymous
No ratings yet
01homework Problems
Document2 pages
01homework Problems
Essoufflee
No ratings yet
Ee 155B Lab Experiment E1 Documentation
Document3 pages
Ee 155B Lab Experiment E1 Documentation
Nicole Dino
No ratings yet
Implications of heuristics in accounting decision making
Document2 pages
Implications of heuristics in accounting decision making
Elisa Sharma
No ratings yet
CHEM 130 Lab Manual Guides Green Chemistry Experiments
Document64 pages
CHEM 130 Lab Manual Guides Green Chemistry Experiments
CaledoniaHearthPipes
No ratings yet
Singhankit - 937548 - 28309801 - Qualitative Methods by Ankit Singh
Document5 pages
Singhankit - 937548 - 28309801 - Qualitative Methods by Ankit Singh
Carl Lewis
No ratings yet
Monopole
Document3 pages
Monopole
Mohamed IBrahim
No ratings yet
EMBO 2013 HandoutDeuteration
Document54 pages
EMBO 2013 HandoutDeuteration
Rigel_T
No ratings yet
Cochrane Risk of Bias Tool 1 PDF
Document3 pages
Cochrane Risk of Bias Tool 1 PDF
RadhikaManam
No ratings yet
Psychology SL Internal Assessment Rubric
Document2 pages
Psychology SL Internal Assessment Rubric
Sanha Lee
No ratings yet
Statistics & Probability Hypothesis Testing Guide
Document26 pages
Statistics & Probability Hypothesis Testing Guide
Crisbell ligaya
No ratings yet
4th Lesson 1
Document58 pages
4th Lesson 1
Micha Benedicto
No ratings yet
SAP PA Automated TimeSeriesTutorial
Document31 pages
SAP PA Automated TimeSeriesTutorial
Muneeb Ali
No ratings yet
Revision 2 Sem 2 2019
Document4 pages
Revision 2 Sem 2 2019
庄敏敏
No ratings yet
Results and Discussion Template
Document6 pages
Results and Discussion Template
jassermeneses2k21
No ratings yet
1.disain Penelitian Kualitative-Unisulla
Document37 pages
1.disain Penelitian Kualitative-Unisulla
suhartono
No ratings yet
Bivariate Data
Document4 pages
Bivariate Data
Yangyang Xiao Nai
No ratings yet
Relativity of Simultaneity Time Dilation Length Contraction
Document20 pages
Relativity of Simultaneity Time Dilation Length Contraction
Joichiro Nishi
No ratings yet