Welcome to Scribd!

DM Coursework 1

Uploaded by

0% found this document useful (0 votes)

17 views2 pages

This document contains a data mining question that provides attributes for climate, soil type, market demand, and profitability for 8 farmers growing sweetcorn. It asks the reader to use different algorithms like 1R, Naive Bayes, ID3, and PRISM to derive rules and predictions for whether a farmer with attributes of warm climate, acid soil, and medium demand should grow sweetcorn next year based on the provided data. It also contains optional questions on linear models, the perceptron algorithm, and k-nearest neighbors.

Original Description:

Original Title

DM_Coursework_1 (3)

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

17 views2 pages

DM Coursework 1

Uploaded by

omer hassan

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

WSS552: Data Mining and Big Data Analytics

Data Mining Coursework 1

Question 1
A farmer needs to decide whether or not to grow sweetcorn next year. He has
a table of data based on the experience of eight other farmers showing
attributes for climate, soil type and market demand, along with the outcome:
whether or not the crop was profitable.

Climate Soil Demand Profitable?

Hot Acid High Yes

Hot Acid Medium Yes
Hot Acid Low No
Hot Acid Low No
Warm Acid Medium Yes
Warm Alkali Medium No
Warm Alkali Medium No
Cold Acid Low No

He has determined that the attribute values for next year are: {Warm, Acid,
Medium}.

(a) Use 1R to derive a set of rules from the above data. Explain all the steps
you follow and show your calculations.
(b) According to your 1R rules should he grow sweetcorn next year?
(c) Use Naïve Bayes to derive the probabilities about growing sweetcorn
being profitable or not. Show all your computations.
(d) Use the ID3 algorithm to derive a decision tree from the above data. Show
all your calculations.
(e) According to your decision tree will growing sweetcorn be profitable or
not?
(f) Use the PRISM algorithm to derive a set of rules from the above data.
Explain all the steps you follow and show your calculations.
(g) According to the rules generated by PRISM will growing sweetcorn be
profitable or not?
(h) Transform your decision tree from (d) to rules and compare them to the
ones generated by PRISM. What do you think is the reason for the
difference between them? Is there some instance for which the two rule
sets will disagree?
(i) Which of these four algorithms would you say is the least suitable for this
problem? Why?
Page 1 of 2
Question 2 - Optional

(a) Consider a two input linear model with inputs x1 and x2, whose decision
surface crosses the x1 axis at 5 and the x2 axis at -1. If the bias weight of
this model is w0 = 0.5, what are the values of its other two weights (w1 and
w2)?
(b) A 2-input perceprton is initialised with the weights w0 = -0.1, w1 = 0.3, and
w2 = 0.05. Carry out the Perceptron Algorithm once on the following set of
examples. For each example calculate the prediction made by the
perceptron and the corresponding weight update. Show all your
calculations.

x1 x2 Class
2 1 -1
0 3 -1
3 -1 -1
0 -1 1
-4 -4 1

(c) Consider the following training set:

x1 x2 x3 x4 x5 class
1 0 1 1 -2 1
0 0 -1 -1 1 0
1 -1 1 1 2 1
-2 0 0 0 0 0
1 0 0 3 -1 1

i. Give the prediction of the 1-Nearest Neighbours algorithm, with the

standard Euclidean distance, for the test example <-1, 1, 0, 0, 1>.
Show all your calculations.
ii. Give the prediction of the 3-Nearest Neighbours algorithm, with the
standard Euclidean distance, for the same test example. Show all your
calculations.

Page 2 of 2

STP1236 Eb.1415051 1 PDF
Document208 pages
STP1236 Eb.1415051 1 PDF
paola
No ratings yet
Olympiad Sample Paper 2: Useful for Olympiad conducted at School, National & International levels
From Everand
Olympiad Sample Paper 2: Useful for Olympiad conducted at School, National & International levels
EDITORIAL BOARD
Rating: 5 out of 5 stars
5/5 (4)
Load-Out and Sea-Fastening Procedure
Document17 pages
Load-Out and Sea-Fastening Procedure
Adaghara
100% (2)
CS230: Deep Learning: Winter Quarter 2018 Stanford University Midterm Examination 180 Minutes
Document36 pages
CS230: Deep Learning: Winter Quarter 2018 Stanford University Midterm Examination 180 Minutes
himanshu singh
No ratings yet
Physics Formulas and Symbols: Physics I Symbol Formula
Document5 pages
Physics Formulas and Symbols: Physics I Symbol Formula
kaparthy
100% (9)
Stereochemistry MSC
Document29 pages
Stereochemistry MSC
Bapu Thorat
50% (2)
Solutions To Deep Learning
Document25 pages
Solutions To Deep Learning
iqbal
No ratings yet
Understanding of AVO and Its Use in Interpretation
Document35 pages
Understanding of AVO and Its Use in Interpretation
brian_schulte_esp803
100% (1)
Ras Abu Aboud Stadium Daily Report 040 (20190613)
Document4 pages
Ras Abu Aboud Stadium Daily Report 040 (20190613)
tuan
50% (2)
Spesifikasi Siemens MRI AERA 1,5 T
Document2 pages
Spesifikasi Siemens MRI AERA 1,5 T
Dr.gendjut
No ratings yet
The Storage Handling and Transportation of Ammonium Nitrate Based Fertilisers 2015
Document58 pages
The Storage Handling and Transportation of Ammonium Nitrate Based Fertilisers 2015
Minh Đức Tạ
No ratings yet
FN 211 Self Test 4: Data Summary, Confidence Intervals, and Hypothesis Testing
Document10 pages
FN 211 Self Test 4: Data Summary, Confidence Intervals, and Hypothesis Testing
RaiNz Season
No ratings yet
ARIMA Forecasting and Autocorrelation
Document18 pages
ARIMA Forecasting and Autocorrelation
Barep Adji Widhi
No ratings yet
Stat 401B Final Exam Analysis
Document9 pages
Stat 401B Final Exam Analysis
juanEs2374p
No ratings yet
Logistic Regression - Exercises
Document8 pages
Logistic Regression - Exercises
Filbertha
No ratings yet
2326_EC2020_Main EQP v1_Final
Document19 pages
2326_EC2020_Main EQP v1_Final
Aryan Mittal
No ratings yet
Homework 7 Traffic Accident Data Analysis
Document5 pages
Homework 7 Traffic Accident Data Analysis
Ragini P
0% (1)
IIT Madras MS4610 Introduction to Data Analytics Final Exam
Document11 pages
IIT Madras MS4610 Introduction to Data Analytics Final Exam
Mohd Saud
No ratings yet
Program Analysis Algorithm Solutions
Document11 pages
Program Analysis Algorithm Solutions
Nour Allam
No ratings yet
Additional Problem Set Units I and II
Document9 pages
Additional Problem Set Units I and II
Spider Man
No ratings yet
13-Mca-Or-Probability & Statistics
Document3 pages
13-Mca-Or-Probability & Statistics
SRINIVASA RAO GANTA
No ratings yet
ST4250 23S1 Assignment 2
Document2 pages
ST4250 23S1 Assignment 2
Loo Guan Yee
No ratings yet
Final 2007 S
Document14 pages
Final 2007 S
Muhammad Murtaza
No ratings yet
Part A. Multiple-Choice Questions.: (12 Points) (4 Points)
Document12 pages
Part A. Multiple-Choice Questions.: (12 Points) (4 Points)
Jacob Schuiteman
No ratings yet
Math 1060 - Lecture 7
Document26 pages
Math 1060 - Lecture 7
John Lee
No ratings yet
Chi Square Test
Document32 pages
Chi Square Test
shubendu ghosh
No ratings yet
HW
Document5 pages
HW
Hà Nguyễn
No ratings yet
Workshop 2 PDF
Document2 pages
Workshop 2 PDF
Josie
No ratings yet
Final2017 Solution PDF
Document14 pages
Final2017 Solution PDF
Vikram Sharma
No ratings yet
MFC CO 1 Independent Learing Problems Sessions 1 12
Document6 pages
MFC CO 1 Independent Learing Problems Sessions 1 12
Narayanasetti Saranya
No ratings yet
PGT 202E Presentation Titles First Semester 2021/2022 Instructions: 1. All The Presentation Handouts Must Include
Document4 pages
PGT 202E Presentation Titles First Semester 2021/2022 Instructions: 1. All The Presentation Handouts Must Include
Ain Ismazatul Alia binti Murshid
No ratings yet
Allama Iqbal Open University, Islamabad (Department of Statistics) Warning
Document4 pages
Allama Iqbal Open University, Islamabad (Department of Statistics) Warning
samranaseem367
No ratings yet
Sapmle Exam2.solution
Document10 pages
Sapmle Exam2.solution
Rahul Bhatia
No ratings yet
Sample Questions Pattern Recognition
Document8 pages
Sample Questions Pattern Recognition
Debadutta Nayak
No ratings yet
Operation Research
Document21 pages
Operation Research
keshav kumar
No ratings yet
Tables 4, 5, 7, 8, 9, 10, 13 & 14 (New Cambridge) - Graph Paper
Document21 pages
Tables 4, 5, 7, 8, 9, 10, 13 & 14 (New Cambridge) - Graph Paper
cookieproductor
No ratings yet
Class XI English Holiday Assignment 2023
Document17 pages
Class XI English Holiday Assignment 2023
Gursidak Dahiya
No ratings yet
Introduction to Biostatistics Descriptive Statistics
Document5 pages
Introduction to Biostatistics Descriptive Statistics
Jinitha Babe
No ratings yet
Chapter 2 Regression and Forecasting
Document88 pages
Chapter 2 Regression and Forecasting
IslamSharaf
No ratings yet
Deep Learning Practical Assignment #1:: Instructions
Document5 pages
Deep Learning Practical Assignment #1:: Instructions
Gaith Belkacem
No ratings yet
Eecs 4750 F2018 HW1
Document2 pages
Eecs 4750 F2018 HW1
shubham97
No ratings yet
CO-1: Tutorials Tutorial-1
Document9 pages
CO-1: Tutorials Tutorial-1
K Balaji
No ratings yet
Econometrics exercises: Estimating LRM
Document24 pages
Econometrics exercises: Estimating LRM
nishit0157623637
No ratings yet
NR-220105 - Probability and Statistics
Document8 pages
NR-220105 - Probability and Statistics
Srinivasa Rao G
100% (1)
Yashahuja Report
Document10 pages
Yashahuja Report
wocim20084
No ratings yet
Section 1: Multiple Choice For Each Question in This Section, Circle The Correct Answer
Document4 pages
Section 1: Multiple Choice For Each Question in This Section, Circle The Correct Answer
Taha Wael Qandeel
No ratings yet
Chapter 9: Linear Regression and Correlation
Document6 pages
Chapter 9: Linear Regression and Correlation
Wong Veronica
No ratings yet
ECTX 2004: Econometrics - I: Section - I (1 Point Each)
Document3 pages
ECTX 2004: Econometrics - I: Section - I (1 Point Each)
Ajay Chiratla
No ratings yet
Statistics GIDP Ph.D. Qualifying Exam Methodology: January 10, 9:00am-1:00pm
Document20 pages
Statistics GIDP Ph.D. Qualifying Exam Methodology: January 10, 9:00am-1:00pm
Md. Mujahidul Islam
No ratings yet
Tutorial FST ENSET 2021
Document3 pages
Tutorial FST ENSET 2021
Atoh Courage
No ratings yet
2017 SA Feb-17
Document18 pages
2017 SA Feb-17
vishwas kumar
No ratings yet
Solutions To The Exercises: Solution
Document105 pages
Solutions To The Exercises: Solution
rizwan ghafoor
No ratings yet
Solutions To The Exercises: Solution
Document105 pages
Solutions To The Exercises: Solution
aanchal singh
No ratings yet
Probability Assignment 2
Document3 pages
Probability Assignment 2
Tathastu Vats
No ratings yet
BDA Assignment (Savi Bilandi)
Document10 pages
BDA Assignment (Savi Bilandi)
SAVI
No ratings yet
Coding 8
Document7 pages
Coding 8
Emperor'l Bill
No ratings yet
final2006
Document15 pages
final2006
유홍승
No ratings yet
BEA 242 Introduction To Econometrics Group Assignment (Updated On 10 May 2012: The Change in Highlighted)
Document4 pages
BEA 242 Introduction To Econometrics Group Assignment (Updated On 10 May 2012: The Change in Highlighted)
Reza Riantono Sukarno
No ratings yet
Genetic Algorithms Tutorials
Document29 pages
Genetic Algorithms Tutorials
Mustafamna Al Salam
No ratings yet
AB1202 Quiz 3 Prep Special R-Skills v1 Nov'20oubhjnl
Document2 pages
AB1202 Quiz 3 Prep Special R-Skills v1 Nov'20oubhjnl
Trash Bin
No ratings yet
ML Question Bank and Sol
Document12 pages
ML Question Bank and Sol
Prabhu Prasad Dev
No ratings yet
Tutorials - 4, 5, 6, 7
Document4 pages
Tutorials - 4, 5, 6, 7
chandra teja gudapati
No ratings yet
Estimating Population Variances
Document17 pages
Estimating Population Variances
Rossel Jane Campillo
No ratings yet
Stat 401B Homework
Document9 pages
Stat 401B Homework
juanEs2374p
No ratings yet
SS ZG568 EC 2M SECOND SEM 2020 2021 Solution 1617600765956
Document9 pages
SS ZG568 EC 2M SECOND SEM 2020 2021 Solution 1617600765956
amrasirah
No ratings yet
Econometrics
Document7 pages
Econometrics
mehrin.morshed1230
No ratings yet
Assignments
Document6 pages
Assignments
Peter Chenza
No ratings yet
2021, Obe, 12277502
Document6 pages
2021, Obe, 12277502
nikhiljoshisrcc
No ratings yet
G5 C11 Test
Document6 pages
G5 C11 Test
victoria
No ratings yet
Tables in National Plumbing Code
Document4 pages
Tables in National Plumbing Code
Martin Gragasin
No ratings yet
Deskripsi (Caffein)
Document4 pages
Deskripsi (Caffein)
jibefahla
No ratings yet
Lecture No.3 Part 1 (Fan)
Document6 pages
Lecture No.3 Part 1 (Fan)
Mohsen Hassan
No ratings yet
Chap 5. Beam Analysis and Design PDF
Document61 pages
Chap 5. Beam Analysis and Design PDF
Rafael Joshua Ledesma
No ratings yet
Top 21 Largest EMS Companies in World
Document22 pages
Top 21 Largest EMS Companies in World
jack
No ratings yet
FT Aeroterme GEA
Document15 pages
FT Aeroterme GEA
CrisTim
No ratings yet
Differences in Left Ventricular and Left Atrial Fu
Document10 pages
Differences in Left Ventricular and Left Atrial Fu
eugenia
No ratings yet
Pneumatic Pruning Equipment American Arborist Supplies, Tree Care, Climbing Equipment
Document1 page
Pneumatic Pruning Equipment American Arborist Supplies, Tree Care, Climbing Equipment
Salman Jo
No ratings yet
The Future - G&V
Document6 pages
The Future - G&V
ManuelHerreraMontoya
No ratings yet
CHS-WWW - Polsteel. TUBOS METALICOS PDF
Document3 pages
CHS-WWW - Polsteel. TUBOS METALICOS PDF
Eduardo Torre
No ratings yet
GROHE Specification Sheet 19443000-1
Document2 pages
GROHE Specification Sheet 19443000-1
Fred Prz
No ratings yet
Shat Karma Concise
Document4 pages
Shat Karma Concise
sarikaabhay
No ratings yet
Mobile Network Optimization Map
Document1 page
Mobile Network Optimization Map
Shahzad Farooq
100% (1)
Chapter 14 Chemical Equilibrium
Document29 pages
Chapter 14 Chemical Equilibrium
lynloe
No ratings yet
Foreign Body Airway Obstruction
Document6 pages
Foreign Body Airway Obstruction
Reeja Rajesh
No ratings yet
Binzel - Katalog MAG
Document64 pages
Binzel - Katalog MAG
Adrian Kustra
No ratings yet
Kyocera Fs-6900 Parts Manual
Document28 pages
Kyocera Fs-6900 Parts Manual
Nic Cowpe
No ratings yet
Management Foreign Body
Document6 pages
Management Foreign Body
Rahmatia Syukrina
No ratings yet
Computer Engineering Syllabus
Document47 pages
Computer Engineering Syllabus
Lily Chan
No ratings yet
Basics of Scientific Writing, Scientific Research, and Elementary Data Analysis
Document12 pages
Basics of Scientific Writing, Scientific Research, and Elementary Data Analysis
burhan sabir
No ratings yet
FIKE RD Combo With Relief Valves
Document11 pages
FIKE RD Combo With Relief Valves
Ankit Gandhi
No ratings yet