Welcome to Scribd!

Skip carousel

WQD7005 (Alternative Assessment)

Uploaded by

AdamZain788

100% found this document useful (1 vote)

150 views4 pages

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

100% found this document useful (1 vote)

150 views4 pages

WQD7005 (Alternative Assessment)

Uploaded by

AdamZain788

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 4

Search inside document

UNIVERSITI MALAYA

EXAMINATION FOR THE DEGREE OF MASTER OF DATA SCIENCE

ACADEMIC SESSION 2020/2021 : SEMESTER I

WQD7005: Data Mining

14th January 2020 from 8.00 am to 15th January 2020 5.00 pm

INSTRUCTIONS TO CANDIDATES:

Answer ALL questions (50 marks).

(This exam contains 4 pages including the first title page)

WQD7005

PART A (30 marks)

1) Define "Data Mining" in terms of Business Intelligence (keeping in mind the data
transformation from Online Transaction Process (OLTP) to Online Analytic Process
(OLAP)).
(5 marks)

2) Suppose that the data for analysis includes the attribute age. The age values for the data
tuples are (in increasing order) 13, 15, 16, 16, 19, 20, 20, 21, 22, 22, 25, 25, 25, 25, 30, 33,
33, 35, 35, 35, 35, 36, 40, 45, 46, 52, 70. (5 marks)

a) What is the mean of the data? (1 mark)

b) What is the median? (1 mark)
c) What is the mode of the data? (1 mark)
d) Use smoothing by bin means to smooth the above data, using a bin depth of 3.
Illustrate your steps. (2 marks)

3) Suppose you have the following four Dimension Tables namely Time, Customer, Employee
and Product. Construct a snowflake scheme by developing "Sales" Fact Table. The linkage
attribute in the dimension tables can be used to split the table to form a snowflake scheme.
The aggregate variable of fact table can be "quantity" of products.

Time Customer
OrderID (primary key) CustID (primary key)
Order Date Name
Year Address
Quarter CityID (linkage attribute)
Month City Name
Zip Code
State
Country

Employee Product
EmpID (primary key) ProductID
Employee Name Product Name
DepartmentID (linkage attribute) Product Category
Region Product Description
Territory

(5 marks)

4) Suppose you have the following transactional database, construct an FP (frequent pattern)
tree from this transaction database.
(5 marks)

2/4
WQD7005

5) Let us consider the dataset of sales related to computer systems (e.g. hardware and software)
shown below. We are required to learn a decision tree which predicts the profit either up or
down based on certain features i.e. condition, upgradable and type.
(5 marks)

Condition Upgradable Type Profit

Old Yes S/W Down
Old No S/W Down
Old No H/W Down
Mid Yes S/W Down
Mid Yes H/W Down
Mid No H/W Up
Mid No S/W Up
New Yes S/W Up
New No H/W Up
New No S/W Up

Calculate the Information Gain of feature "Condition" based on,

Entropy (Profit)
Entropy (Old)
Entropy (Mid)
Entropy (New)
Entropy (Condition)

6) Write down the steps of DBScan algorithm.

(5 marks)

3/4
WQD7005

PART B (20 marks)

Instructions: Answer the following questions by using any data mining tool. Explain how you
do each step (include print screens). Download “Data(Exam).csv” from the Spectrum (You can
find the description of this data at https://archive.ics.uci.edu/ml/datasets/Zoo).

1) Select the best non-target features using one of statistical methods "correlation", "Chi-
square", or "ANOVA". Your solution should describe the relevant statistical findings.
(5 marks)

2) Experiment/simulate the classification algorithms (Naive Bayes, Random Forest, Support

Vector Machine) and identify the best algorithm among the three algorithms using 10-fold
cross validation. Justify your choice of algorithm in terms of classification accuracy and
false positive rate.
(10 marks)

3) Discuss the performance metric of all three algorithms in terms of Receiver Operator
Characteristic (ROC) curve.
(5 marks)

END

4/4

CCS592 - AZHT KSCP 2017marking Scheme USM
Document5 pages
CCS592 - AZHT KSCP 2017marking Scheme USM
DhavalChheda
No ratings yet
WQD7006 Exam Part2
Document3 pages
WQD7006 Exam Part2
AdamZain788
100% (1)
Physics 203p
Document9 pages
Physics 203p
lalslsal
No ratings yet
Pms 5003
Document3 pages
Pms 5003
LB
No ratings yet
SHA512
Document24 pages
SHA512
Aravind Sai
No ratings yet
Syllabus CEA201 Spring 2022
Document15 pages
Syllabus CEA201 Spring 2022
Dang Hoang Viet (K17 HCM)
No ratings yet
Panel Data Analysis Using EViews Chapter - 1 PDF
Document30 pages
Panel Data Analysis Using EViews Chapter - 1 PDF
imohamed2
No ratings yet
Chapter 4
Document34 pages
Chapter 4
ALBERTUS DE
No ratings yet
Student Answers
Document31 pages
Student Answers
davod_ir
0% (2)
Set-22 Mba I Semester Assign Questions
Document10 pages
Set-22 Mba I Semester Assign Questions
இந்துமதி வெங்கடரமணன்
No ratings yet
ECO4122Z - 2021 Examination Question
Document3 pages
ECO4122Z - 2021 Examination Question
Rico Bartman
No ratings yet
It 05106 DBMS
Document22 pages
It 05106 DBMS
sjognipa
No ratings yet
TCP/IP Configuration (7 Points) : NAME: Justin Quen
Document9 pages
TCP/IP Configuration (7 Points) : NAME: Justin Quen
Justin Quen
No ratings yet
Erdinger Block Diagram: Kabylake-U
Document94 pages
Erdinger Block Diagram: Kabylake-U
EduinMaracuchoFernandezChaparro
No ratings yet
Duane Group Project 2 Questions and Instructions FIN 5203 1D2 FA21
Document3 pages
Duane Group Project 2 Questions and Instructions FIN 5203 1D2 FA21
NarasimhaBadri
0% (1)
Bahria University Midterm ITB Paper Fall 2021 07052021 091547pm
Document1 page
Bahria University Midterm ITB Paper Fall 2021 07052021 091547pm
Umer Abid
100% (1)
Accounting 2
Document7 pages
Accounting 2
vietthui
No ratings yet
Institute and Faculty of Actuaries: Subject CM2A - Financial Engineering and Loss Reserving Core Principles
Document8 pages
Institute and Faculty of Actuaries: Subject CM2A - Financial Engineering and Loss Reserving Core Principles
A
No ratings yet
State Wise SDP 02 08 2021 11aug21
Document6 pages
State Wise SDP 02 08 2021 11aug21
Jahangir Seema Nizami
No ratings yet
Challenges of Agile
Document5 pages
Challenges of Agile
Andra Widana
No ratings yet
Microprocessor 8085 Practical Course Guide
Document79 pages
Microprocessor 8085 Practical Course Guide
SK Kashyap
No ratings yet
Final Mapi Lab Manual
Document44 pages
Final Mapi Lab Manual
Ramachandra Reddy
No ratings yet
DMDW-Solution For Unit 1-5
Document20 pages
DMDW-Solution For Unit 1-5
Manish Arya
50% (2)
Superbar 2019 PDF
Document32 pages
Superbar 2019 PDF
Ramsey Bolton
No ratings yet
Assignment 2
Document2 pages
Assignment 2
Zain ali
No ratings yet
Chapter 20
Document93 pages
Chapter 20
Irina Alexandra
No ratings yet
HDP e Drainage Products Brochure 10222114
Document20 pages
HDP e Drainage Products Brochure 10222114
realchic
No ratings yet
Quiz #9 (CH 11) - Siyao Guan
Document7 pages
Quiz #9 (CH 11) - Siyao Guan
balwantsharma1993
No ratings yet
DBMS Lab Assignment 5
Document11 pages
DBMS Lab Assignment 5
Akashdeep Balu
0% (1)
Codesolar Lorentz SM lc175-24m En-1 PDF
Document2 pages
Codesolar Lorentz SM lc175-24m En-1 PDF
walter_lopezv
No ratings yet
PS2
Document18 pages
PS2
Thanh Nguyen
No ratings yet
What-If Analysis and Sensitivity Analysis for Evaluating Decision Alternatives
Document8 pages
What-If Analysis and Sensitivity Analysis for Evaluating Decision Alternatives
Nadia Swais
No ratings yet
BITS Pilani Reliability Engineering Mid-Semester Test Questions
Document4 pages
BITS Pilani Reliability Engineering Mid-Semester Test Questions
Sooraj Dilip Kumar
100% (1)
BT6270 Assignment 1
Document1 page
BT6270 Assignment 1
NAGAVARSHINI
No ratings yet
Subject: PRF192-PFC Workshop 02: Objectives
Document5 pages
Subject: PRF192-PFC Workshop 02: Objectives
Linh Tí
0% (1)
Formula Information For Process Manufacturing
Document21 pages
Formula Information For Process Manufacturing
Dia Siri
No ratings yet
Study of Architecture of DSP TMS320C6748
Document9 pages
Study of Architecture of DSP TMS320C6748
Varssha B
No ratings yet
Tutorial 2 Solution 2018
Document3 pages
Tutorial 2 Solution 2018
Anit Kumar
50% (4)
Fault Finding
Document52 pages
Fault Finding
buzzboy67
No ratings yet
COMP 6651 Syllabus Fall 2022
Document6 pages
COMP 6651 Syllabus Fall 2022
Akshita Patel
No ratings yet
Midterm - Version A (For Posting)
Document21 pages
Midterm - Version A (For Posting)
Dennis Chow
No ratings yet
Microprocessor Lab1
Document3 pages
Microprocessor Lab1
kidu
100% (1)
Institute and Faculty of Actuaries: Subject CT5 - Contingencies Core Technical
Document19 pages
Institute and Faculty of Actuaries: Subject CT5 - Contingencies Core Technical
Nayaz NM
No ratings yet
CS1 and CS2 Guide Jan 20 Final PDF
Document9 pages
CS1 and CS2 Guide Jan 20 Final PDF
SC
No ratings yet
Homework: Linear and Integer Optimization: 1.1 Restaurant Stang
Document4 pages
Homework: Linear and Integer Optimization: 1.1 Restaurant Stang
Poojitha Pooji
No ratings yet
Distributed Databases - Vertical and Horizontal Fragmentation
Document40 pages
Distributed Databases - Vertical and Horizontal Fragmentation
RAKHEE YADAV
No ratings yet
Assignment 2
Document5 pages
Assignment 2
Marcus Goh
No ratings yet
L&T Infra bond redemption and trading options
Document3 pages
L&T Infra bond redemption and trading options
Anshuman Sharma
No ratings yet
FALLSEM2019-20 CSE2001 TH VL2019201007433 Reference Material I 18-Sep-2019 DA-2 PDF
Document2 pages
FALLSEM2019-20 CSE2001 TH VL2019201007433 Reference Material I 18-Sep-2019 DA-2 PDF
Subarna Lamsal
0% (1)
IEEE Xtreme Programming Challenge
Document63 pages
IEEE Xtreme Programming Challenge
d1560543
No ratings yet
MA252 - Combinatorial Optimisation
Document9 pages
MA252 - Combinatorial Optimisation
Rebecca Rumsey
No ratings yet
ct52010 2014
Document202 pages
ct52010 2014
Corry Carlton
No ratings yet
Mba ZG526 Ec-3r First Sem 2020-2021
Document2 pages
Mba ZG526 Ec-3r First Sem 2020-2021
Manu@777
100% (1)
WQD7005 (Alternative Assessment)
Document4 pages
WQD7005 (Alternative Assessment)
AdamZain788
No ratings yet
Data Mining Class Test
Document3 pages
Data Mining Class Test
mahbubur rahman
No ratings yet
Information2 Analysis and Decision Making Techniques PDF
Document47 pages
Information2 Analysis and Decision Making Techniques PDF
alkalkia
No ratings yet
Typical Interview Questions
Document11 pages
Typical Interview Questions
sudas14
No ratings yet
Typical Interview Questions PDF
Document9 pages
Typical Interview Questions PDF
Vinícius Pereira
No ratings yet
03-Project Definition & Scoping
Document13 pages
03-Project Definition & Scoping
Saumya Gunawardana
No ratings yet
Learning Highcharts
From Everand
Learning Highcharts
Joe Kuan
No ratings yet
WQD700D Test2 Input Data
Document1 page
WQD700D Test2 Input Data
AdamZain788
No ratings yet
WQD7005 Final Exam - 17219402
Document12 pages
WQD7005 Final Exam - 17219402
AdamZain788
100% (1)
WQD7005 Final Exam - 17219402
Document12 pages
WQD7005 Final Exam - 17219402
AdamZain788
No ratings yet
!any Year Calendar (1 Month Per Tab) 1
Document12 pages
!any Year Calendar (1 Month Per Tab) 1
tiendo
No ratings yet
WQD7005 Case Study - 17219402
Document21 pages
WQD7005 Case Study - 17219402
AdamZain788
No ratings yet
Assignment Step 3 - 17219402
Document2 pages
Assignment Step 3 - 17219402
AdamZain788
No ratings yet
WQD7005 (Alternative Assessment)
Document4 pages
WQD7005 (Alternative Assessment)
AdamZain788
No ratings yet
Age values and smoothed data from bins
Document2 pages
Age values and smoothed data from bins
AdamZain788
No ratings yet
Research Proposal Part 1 - 17219402
Document2 pages
Research Proposal Part 1 - 17219402
AdamZain788
No ratings yet
WQD7007 Big Data Management: Introduction To The Course
Document6 pages
WQD7007 Big Data Management: Introduction To The Course
AdamZain788
No ratings yet
Week 8 Analyzing and Interpreting Quantitative Data
Document1 page
Week 8 Analyzing and Interpreting Quantitative Data
AdamZain788
No ratings yet
WQD7005 Case Study - 17219402
Document21 pages
WQD7005 Case Study - 17219402
AdamZain788
No ratings yet
WQD7005 (Alternative Assessment)
Document4 pages
WQD7005 (Alternative Assessment)
AdamZain788
100% (1)
Concepts and Techniques: Data Mining
Document27 pages
Concepts and Techniques: Data Mining
AdamZain788
No ratings yet
Clustering Density Based
Document14 pages
Clustering Density Based
AdamZain788
No ratings yet
04 Big Data Concepts - Identification and Ontologies
Document32 pages
04 Big Data Concepts - Identification and Ontologies
AdamZain788
No ratings yet
Colgate Palmolive List of Mills As of June 2019h2
Document51 pages
Colgate Palmolive List of Mills As of June 2019h2
AdamZain788
No ratings yet
Lecture 13: Advanced Methods (Machine and Deep Learning) : Instructions
Document2 pages
Lecture 13: Advanced Methods (Machine and Deep Learning) : Instructions
AdamZain788
No ratings yet
List - of - Companies - 20211231 - Bursa Malaysia
Document34 pages
List - of - Companies - 20211231 - Bursa Malaysia
AdamZain788
No ratings yet
Bursa Data - ContactList-14January2020
Document19 pages
Bursa Data - ContactList-14January2020
AdamZain788
No ratings yet
Instructions For Lecture 10
Document1 page
Instructions For Lecture 10
AdamZain788
No ratings yet
List - of - Companies - 20211231 - Bursa Malaysia
Document34 pages
List - of - Companies - 20211231 - Bursa Malaysia
AdamZain788
No ratings yet
List of Industrial Partners With UniKL
Document559 pages
List of Industrial Partners With UniKL
AdamZain788
No ratings yet
Business Research Method Proposal
Document38 pages
Business Research Method Proposal
AdamZain788
No ratings yet
Top 100 Companies in Johor
Document2 pages
Top 100 Companies in Johor
AdamZain788
No ratings yet
Applied Research Project PDF
Document34 pages
Applied Research Project PDF
AdamZain788
No ratings yet
Leardership
Document7 pages
Leardership
AdamZain788
No ratings yet
Overall Portfolio - ASG
Document44 pages
Overall Portfolio - ASG
AdamZain788
No ratings yet
Web Intelligence Advanced
Document96 pages
Web Intelligence Advanced
Srikanth Tatipaka
No ratings yet
Einstein Analytics and Discovery Consultant Demo
Document5 pages
Einstein Analytics and Discovery Consultant Demo
Mano Dev
No ratings yet
Mastering Mongodb 7.0: Fourth Edition
Document447 pages
Mastering Mongodb 7.0: Fourth Edition
fpm89470
No ratings yet
Manage Student & Staff Records
Document19 pages
Manage Student & Staff Records
MUHAMMAD MUDASSAR TAHIR NCBA&E
No ratings yet
DBMS Fundamentals Quiz
Document23 pages
DBMS Fundamentals Quiz
Bogdan Tosa
50% (4)
Unit 1.2
Document11 pages
Unit 1.2
Ughrisha V
No ratings yet
Database Management System
Document33 pages
Database Management System
Winter Blossom
No ratings yet
DBMS Cie 2
Document3 pages
DBMS Cie 2
learn something
No ratings yet
Hospital Management System Database Design
Document8 pages
Hospital Management System Database Design
Malatesh Havanagi
83% (60)
Santosh Goud - Senior AWS Big Data Engineer
Document9 pages
Santosh Goud - Senior AWS Big Data Engineer
Pranay G
No ratings yet
Enforcing Data Quality
Document28 pages
Enforcing Data Quality
Richie Poo
No ratings yet
Data Analyst Roadmap Skills Resources
Document7 pages
Data Analyst Roadmap Skills Resources
sai maurya
No ratings yet
DWH
Document5 pages
DWH
chaitanya paruvada
No ratings yet
DATA MODELING FUNDAMENTALS
Document11 pages
DATA MODELING FUNDAMENTALS
Hassan Khan
No ratings yet
So Some of Us Are Afraid of AI - What AI
Document5 pages
So Some of Us Are Afraid of AI - What AI
dawudh
No ratings yet
D77758GC20 15947 Us
Document6 pages
D77758GC20 15947 Us
William Lee
0% (1)
DB Lab4
Document7 pages
DB Lab4
Muhammad Talha Qadri
No ratings yet
SQL Joins
Document7 pages
SQL Joins
Smaranika_Saho_651
No ratings yet
What Is The Syntax For Match, Vlookup and Offset?
Document4 pages
What Is The Syntax For Match, Vlookup and Offset?
Clark Domingo
No ratings yet
MCQ Database design and queries
Document9 pages
MCQ Database design and queries
Being Cringe
No ratings yet
Tutorial Qlik Errors in QV
Document18 pages
Tutorial Qlik Errors in QV
nassif.hassane
No ratings yet
D7.12 Data Management Plan Phase 3 v1.0
Document9 pages
D7.12 Data Management Plan Phase 3 v1.0
gkout
No ratings yet
10 Distributeddbms
Document56 pages
10 Distributeddbms
Krishna Kumar
No ratings yet
Entity-Relationship Model - Wikipedia, The Free Encyclopedia PDF
Document10 pages
Entity-Relationship Model - Wikipedia, The Free Encyclopedia PDF
Javier Garcia Rajoy
No ratings yet
Lab Manual Dbms
Document45 pages
Lab Manual Dbms
compiler&automata
No ratings yet
Fuzzy Keyword Search Over Encrypted Data in Cloud Computing
Document2 pages
Fuzzy Keyword Search Over Encrypted Data in Cloud Computing
Jubaira Samsudeen
No ratings yet
Chapter 6 Information Management Basics
Document55 pages
Chapter 6 Information Management Basics
SAMBIT HALDER PGP 2018-20 Batch
No ratings yet
Spend Analytics
Document14 pages
Spend Analytics
ijji
No ratings yet
Library Management System
Document44 pages
Library Management System
mswarna
77% (13)
Internshala
Document11 pages
Internshala
Bhavesh
No ratings yet