Welcome to Scribd!

Skip carousel

BIT Pilani Data Science Exam Questions on Data Mining, Normalization, Classification

Uploaded by

VIVEK JOHARI

0% found this document useful (0 votes)

10 views3 pages

Original Title

Mid Semester Make-up Data Mining Second Semester 2019-2020

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

10 views3 pages

BIT Pilani Data Science Exam Questions on Data Mining, Normalization, Classification

Uploaded by

VIVEK JOHARI

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 3

Search inside document

Birla Institute of Technology & Science, Pilani

Work Integrated Learning Programmes Division

Second Semester 2019-2020
M.Tech (Data Science and Engineering)
Mid-Semester Exam (EC-2 Make-up)

Course No. : DSECLZC415

Course Title : Data Mining
Nature of Exam : Open Book
Weightage : 30% No. of Pages =3
Duration : 90 minutes No. of Questions = 4
Date of Exam : 4/07/2020 (AN), 2:00 pm to 3:30 pm
Note to Students:
1. Please follow all the Instructions to Candidates given on the cover page of the answer book.
2. All parts of a question should be answered consecutively. Each answer should start from a fresh page.
3. Assumptions made if any, should be stated clearly at the beginning of your answer.

Q.1. Answer the following:

a) Consider the following ordered list of observations of a variable: [2+1+2+2=7 marks]

25, 25, 30, 33, 33, 35, 35, 35, 35, 36, 40, 41, 42, 42, 99

1) What is the five-number summary for the given data?

2) Draw boxplot.
3) Considering univariate normal distribution, estimate the range of values that helps in
estimating whether the given value is an outlier or not.
4) Explain, how do the outliers affect the measures of central tendency (Mean, and Median) of
data? Comment using the given data set.

b) As many data mining algorithms cannot handle missing values, analyst sometimes remove all
observations (rows) that contain missing values before the analysis. Give two potential
disadvantages of this procedure. [2 marks]
c) Consider the following marks details of some students
Name Marks
Abishek 20
Ramesh 30
Vinod 55
Rahul 75
Anu 95
Kavita 40
Latha 45
Pravin 57
Sonu 32
Sneha 65

Use min-max normalization to transform the above marks onto the range [50, 70]. [3 marks]

Q.2. The following table shows the house area(in sq meters) and house rent(in thousand rupees)
obtained for a metropolitan city. [4+1+1=6 marks]

Area 172 150 181 174 194

Rent 42 35 46 40 50

a) Assuming linear relationship, Use the method of least squares to get an equation for the
prediction of rent based on the area.
b) Predict the rent of a house with 180 sq. meter area.
c) Do you notice any limitations with the solution?

Q.3. Answer the following: [1.5*4=6 marks]

Suppose you are a Data scientist. You are building a Classifier that can predict whether a person is
likely to default or Not based on certain parameters/attribute values. Assume, the class variable is
“Default” and has two outcomes, {“yes”, “no”}
• Own_House = Yes, No
• Marital Status = Single, Married, Divorced
• Annual Income = Low, Medium, High
• Currently Employed = Yes, No

Suppose a rule-based classifier produces the following rules:

1. Own_House = Yes → Default = Yes
2. Marital Status = Single → Default = Yes
3. Annual Income = Low → Default = Yes
4. Annual Income = High, Currently Employed = No → Default = Yes
5. Annual Income = Medium, Currently Employed = Yes → Default = No
6. Own_House = No, Marital Status = Married → Default = No
7. Own_House = No, Marital Status = Single → Default = Yes
Answer the following questions. Make sure to provide a brief explanation or examples to
illustrate the answer.
a) Are the rules mutually exclusive?
b) Is the rule set exhaustive?
c) Is ordering needed for this set of rules?
d) Do you need a default class for the rule set?

Q.4. Based on the information given in the table below, find the most similar and most dissimilar
persons among them. Apply min-max normalization on income to obtain [0,1] range. Consider
Profession and City as nominal, and Age as an ordinal variable with ranking order of [Youth,
Middle-Aged, Senior]. Give equal weight to each attribute.
[6 marks]

Name Profession Age City Income

Sam Doctor Middle Aged Mumbai 80000

John Engineer Youth Delhi 50000

Mary Politician Senior Bangalore 70000

Sonam Doctor Middle Aged Delhi 89000

Sujoy Carpenter Middle-aged Gurgaon 20000

Let's Practise: Maths Workbook Coursebook 6
From Everand
Let's Practise: Maths Workbook Coursebook 6
ExcelSoft Technologies Pvt. Ltd.
No ratings yet
BIT Pilani Data Science Exam Questions on Data Mining, Normalization, Classification
Document3 pages
BIT Pilani Data Science Exam Questions on Data Mining, Normalization, Classification
VIVEK JOHARI
No ratings yet
Let's Practise: Maths Workbook Coursebook 5
From Everand
Let's Practise: Maths Workbook Coursebook 5
ExcelSoft Technologies Pvt. Ltd.
No ratings yet
DM Makeup Key
Document6 pages
DM Makeup Key
amrutha
No ratings yet
Practise Mathematics Grade 7 Book 8
From Everand
Practise Mathematics Grade 7 Book 8
Esther Chen
Rating: 5 out of 5 stars
5/5 (1)
PS Fall2020M
Document11 pages
PS Fall2020M
Talha Mansoor
No ratings yet
Data Mining Comprehensive Exam - Regular PDF
Document3 pages
Data Mining Comprehensive Exam - Regular PDF
srirams007
No ratings yet
BITS WILP Data Mining Exam 2017
Document29 pages
BITS WILP Data Mining Exam 2017
John Wick
No ratings yet
8661 - Tests For AS A Level Year 1 Edexcel Stats Mechanics Challenge Set A v1.2
Document9 pages
8661 - Tests For AS A Level Year 1 Edexcel Stats Mechanics Challenge Set A v1.2
Cherie Chow
No ratings yet
A-Level Math Monthly Test Analyzes Measures of Central Tendency and Dispersion
Document3 pages
A-Level Math Monthly Test Analyzes Measures of Central Tendency and Dispersion
saad
No ratings yet
STA404 Exam Booklet - 20.03.2023
Document153 pages
STA404 Exam Booklet - 20.03.2023
Nur Daniel
No ratings yet
RONGCALES - Math11n - Lesson 3.2 Assessmet (1) - 1sem22-23
Document2 pages
RONGCALES - Math11n - Lesson 3.2 Assessmet (1) - 1sem22-23
Travelers Family
No ratings yet
Department of Computer Science and Engineering
Document3 pages
Department of Computer Science and Engineering
Md.Ashiqur Rahman
No ratings yet
MGCR 271 Assignment #4 Forecast
Document10 pages
MGCR 271 Assignment #4 Forecast
karim saf
No ratings yet
CDS300 Assignment S22 Ques Qonita Qotrunnada
Document9 pages
CDS300 Assignment S22 Ques Qonita Qotrunnada
Annisa Angelita putri
No ratings yet
Dec 2020 Data
Document3 pages
Dec 2020 Data
Vanshika Khetan
No ratings yet
Relational Model
Document4 pages
Relational Model
Prateek Goyal
No ratings yet
Summative Assessment B 2020S1 MEMO PDF
Document6 pages
Summative Assessment B 2020S1 MEMO PDF
Letang Modisha
0% (1)
Discussion 1: Exhibit 1-1
Document7 pages
Discussion 1: Exhibit 1-1
Mohammed Kashoob
No ratings yet
DBSA AK-Regular Mid-Sem Question Papers
Document3 pages
DBSA AK-Regular Mid-Sem Question Papers
pvaid2208
No ratings yet
Study - Material - 1st Year 1 Sem All Paper (2020-21) 1619326582759
Document33 pages
Study - Material - 1st Year 1 Sem All Paper (2020-21) 1619326582759
khuship200102
No ratings yet
Assignment Booklet PGDAST 2016
Document29 pages
Assignment Booklet PGDAST 2016
Vijay P Singh
No ratings yet
Data Mining Questions Q&A
Document11 pages
Data Mining Questions Q&A
aaakandoh
No ratings yet
DL - Assignment 2 Solution
Document7 pages
DL - Assignment 2 Solution
swathisreejith6
No ratings yet
Mock Final Examination Model Answer: Faculty of Computer Studies TM351 Data Management and Analysis
Document9 pages
Mock Final Examination Model Answer: Faculty of Computer Studies TM351 Data Management and Analysis
Christina Fington
No ratings yet
Mu 7th Semester Question Papers December 22
Document7 pages
Mu 7th Semester Question Papers December 22
Movies Duniya
No ratings yet
Time: 3 Hours Total Marks: 100: MBA (Sem I) Theory Examination 2018-19 Business Statistics
Document2 pages
Time: 3 Hours Total Marks: 100: MBA (Sem I) Theory Examination 2018-19 Business Statistics
Eshu Goyal
No ratings yet
MST-01 To MSTL-02 Assignments 2021
Document31 pages
MST-01 To MSTL-02 Assignments 2021
mukul
No ratings yet
Cuny Data Science Challenge
Document8 pages
Cuny Data Science Challenge
deejay217
No ratings yet
Physics Test 1 Retake T1 Grade 9
Document5 pages
Physics Test 1 Retake T1 Grade 9
aaishah.variawa
No ratings yet
Mba ZC417 Ec-3m First Sem 2018-2019
Document6 pages
Mba ZC417 Ec-3m First Sem 2018-2019
shiintu
No ratings yet
B.Sc. Degree Examination, 2011
Document2 pages
B.Sc. Degree Examination, 2011
NIKHIL SINGH
No ratings yet
Differences between supervised and unsupervised learning
Document21 pages
Differences between supervised and unsupervised learning
sahil kumar
No ratings yet
Sample Paper For Class 12 Boards Ip Exam
Document7 pages
Sample Paper For Class 12 Boards Ip Exam
zahir khan
No ratings yet
Assignments 1
Document11 pages
Assignments 1
kiwanuka45
100% (1)
Important Instructions To The Candidates:: Part B
Document7 pages
Important Instructions To The Candidates:: Part B
Nuwan Chathuranga
No ratings yet
Page 1 of 4
Document4 pages
Page 1 of 4
ABDUL WARIS AFTAB
No ratings yet
RFN 202 - Draft Exam
Document4 pages
RFN 202 - Draft Exam
Ishak Ishak
No ratings yet
CourseWork Assessments S1 2019-2020
Document6 pages
CourseWork Assessments S1 2019-2020
jemima
No ratings yet
ECF630-FINAL EXAMINATION - MAY 2021
Document12 pages
ECF630-FINAL EXAMINATION - MAY 2021
Kalimanshi Nsakaza
No ratings yet
S1 C2 Test
Document3 pages
S1 C2 Test
Ruth Bittan
No ratings yet
2023S - IP - Old - Questions (ITPEC Test)
Document37 pages
2023S - IP - Old - Questions (ITPEC Test)
barbarian 520
No ratings yet
DEEP LEARNING IIT Kharagpur Assignment_2_2024__updated
Document6 pages
DEEP LEARNING IIT Kharagpur Assignment_2_2024__updated
Mangaiyarkarasi K
No ratings yet
MTH 302 Long Questions Solved by Pisces Girl "My Lord! Increase Me in Knowledge."
Document20 pages
MTH 302 Long Questions Solved by Pisces Girl "My Lord! Increase Me in Knowledge."
Fun N
No ratings yet
NCERT Class 6-12 Notes, Solutions, Sample Papers and Presentation of Data
Document4 pages
NCERT Class 6-12 Notes, Solutions, Sample Papers and Presentation of Data
raghu8215
No ratings yet
Graded Assignment 1 - Questions - Student
Document4 pages
Graded Assignment 1 - Questions - Student
Dane Sinclair
No ratings yet
IP Question Paper 2020-2021
Document9 pages
IP Question Paper 2020-2021
blessy thomas
No ratings yet
FALLSEM2022-23 BCSE101E ELA VL2022230107118 Reference Material I 29-09-2022 Final PPS1 22 23 Python
Document9 pages
FALLSEM2022-23 BCSE101E ELA VL2022230107118 Reference Material I 29-09-2022 Final PPS1 22 23 Python
Abhishek Singh
No ratings yet
Class 11 Economics Answer Key
Document9 pages
Class 11 Economics Answer Key
u0911588
No ratings yet
Cs403 Collection of Old Papers
Document19 pages
Cs403 Collection of Old Papers
cs619finalproject.com
100% (1)
Data Structures - Assignment 6 IDC, Spring 2022
Document3 pages
Data Structures - Assignment 6 IDC, Spring 2022
eleanor
No ratings yet
Saurav Exam
Document9 pages
Saurav Exam
Shrijan Basnet
No ratings yet
Statistics in Education
Document7 pages
Statistics in Education
Sharan Hans
No ratings yet
Ignou PGDAST Assignment Booklet Jan-Dec 2020
Document30 pages
Ignou PGDAST Assignment Booklet Jan-Dec 2020
Aditya Ojha
No ratings yet
CBSE Sample Papers For Class 11 Economics Set 2 W 3
Document3 pages
CBSE Sample Papers For Class 11 Economics Set 2 W 3
xpert shiksha
No ratings yet
Midterm Fall2011
Document13 pages
Midterm Fall2011
Patrick Benson
No ratings yet
Final PF Question
Document4 pages
Final PF Question
Muhammad Hassaan
No ratings yet
Chapter 3 - Organisation of Data: Question 1 (I)
Document8 pages
Chapter 3 - Organisation of Data: Question 1 (I)
Vaibhav Gour
No ratings yet
ECE 3231 Algorithms and Programming Mid-Term Exam
Document6 pages
ECE 3231 Algorithms and Programming Mid-Term Exam
কফি ওয়ান টু
No ratings yet
Trial Maths M Term 2 (2014) Smart
Document6 pages
Trial Maths M Term 2 (2014) Smart
Irenaeus Martin
No ratings yet
Dvi - Assignment1 - PS9 - Air Quality Index - Analysis
Document4 pages
Dvi - Assignment1 - PS9 - Air Quality Index - Analysis
VIVEK JOHARI
No ratings yet
DSAD Makeup Question PDF
Document3 pages
DSAD Makeup Question PDF
Jitamitra Behura
No ratings yet
Dsecl Zg519-Ec3r-1 PDF
Document3 pages
Dsecl Zg519-Ec3r-1 PDF
srirams007
No ratings yet
Mid Semester Regular-DM
Document3 pages
Mid Semester Regular-DM
RaviShankar
No ratings yet
BST Dictionary Implementation
Document5 pages
BST Dictionary Implementation
VIVEK JOHARI
No ratings yet
Mid Semester Regular-DM
Document3 pages
Mid Semester Regular-DM
RaviShankar
No ratings yet
MID SEM Makeup QP Solutions
Document5 pages
MID SEM Makeup QP Solutions
VIVEK JOHARI
No ratings yet
Bluesound POWERNODE N330 Owners Manual
Document4 pages
Bluesound POWERNODE N330 Owners Manual
alelendo
No ratings yet
8700iLT 1.0 Operating Manual
Document375 pages
8700iLT 1.0 Operating Manual
Edinaldo Gomes
No ratings yet
Unit 5
Document13 pages
Unit 5
lokgenie
No ratings yet
Key PA-4000 Series Next-Generation Firewall Features
Document3 pages
Key PA-4000 Series Next-Generation Firewall Features
Abel De Los Santos
No ratings yet
Compass and Divider
Document27 pages
Compass and Divider
Glydell Valeriano
No ratings yet
IP Issues & Cyberspace
Document13 pages
IP Issues & Cyberspace
1234tempo
No ratings yet
AHC15A U Manual
Document4 pages
AHC15A U Manual
fabryce68
No ratings yet
SLSU Lucban Campus Class Schedules
Document4 pages
SLSU Lucban Campus Class Schedules
Christian Paul Casido
No ratings yet
Product Data Sheet NETx BMS Server
Document89 pages
Product Data Sheet NETx BMS Server
Xozan
No ratings yet
College Algebra 7th Edition Blitzer Solutions Manual
Document26 pages
College Algebra 7th Edition Blitzer Solutions Manual
JuliaCrawfordejpf
100% (58)
Checkpoint Transcender 156-21581 Practice Test 2023-Jul-07 by Henry 219q Vce
Document26 pages
Checkpoint Transcender 156-21581 Practice Test 2023-Jul-07 by Henry 219q Vce
Lin Rassel
No ratings yet
Nived A S R Sum
Document2 pages
Nived A S R Sum
rocky kr
No ratings yet
Soxeffect
Document53 pages
Soxeffect
bubba
No ratings yet
Java interview questions on interfaces, date, time, SQL, random numbers
Document11 pages
Java interview questions on interfaces, date, time, SQL, random numbers
Kavitha
No ratings yet
Information and Communication Technology: Pearson Edexcel International GCSE
Document24 pages
Information and Communication Technology: Pearson Edexcel International GCSE
Eric TTL
No ratings yet
SKR PRO V1.2 User Manual
Document18 pages
SKR PRO V1.2 User Manual
Jorge Gómez del Castillo
No ratings yet
Hcia Datacom
Document198 pages
Hcia Datacom
VLAD
100% (4)
002 PLAXIS 2D 2023.1 Tutorial Manual
Document280 pages
002 PLAXIS 2D 2023.1 Tutorial Manual
Vaibhav Phalak
No ratings yet
TCS NQT 2021 Ready Reckoner
Document30 pages
TCS NQT 2021 Ready Reckoner
Naman Bairagi
No ratings yet
Ambisea - Electrocardigrafo - 1Ch - 3Ch - Folha A4.odg
Document1 page
Ambisea - Electrocardigrafo - 1Ch - 3Ch - Folha A4.odg
Paulo Dores
No ratings yet
Booth Algo and IEEE 754 Code
Document7 pages
Booth Algo and IEEE 754 Code
Surjya Kanta prusty
No ratings yet
Iec 62443 3 2 2020
Document12 pages
Iec 62443 3 2 2020
steve
No ratings yet
Fxpro mt4 Backtesting - Guide
Document9 pages
Fxpro mt4 Backtesting - Guide
Nizar Ramadhan
No ratings yet
Digital Marketing
Document2 pages
Digital Marketing
Taskeen Zafar
No ratings yet
Alfa R36A Instructions
Document2 pages
Alfa R36A Instructions
Mo
No ratings yet
SN e Reference PDF
Document1,269 pages
SN e Reference PDF
PM
No ratings yet
Procom Interview Questions
Document10 pages
Procom Interview Questions
Muhammad Izaan Sohail
No ratings yet
Digital Crime Money Loundering Simulation Trainings
Document1 page
Digital Crime Money Loundering Simulation Trainings
marijalaza
No ratings yet
SRAN21A Software Upgrade MOP 1.01
Document17 pages
SRAN21A Software Upgrade MOP 1.01
MOHANKUMAR.A
No ratings yet
S6050 - en - 0719 - NORD Cloud Solution - Industry 4.0 Ready!
Document2 pages
S6050 - en - 0719 - NORD Cloud Solution - Industry 4.0 Ready!
Fabio Souza
No ratings yet