Welcome to Scribd!

Skip carousel

Data Mining Quiz 1 Clustering

Uploaded by

Shripad H

100% found this document useful (2 votes)

617 views4 pages

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

100% found this document useful (2 votes)

617 views4 pages

Data Mining Quiz 1 Clustering

Uploaded by

Shripad H

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 4

Search inside document

Data Mining Quiz 1 Clustering

Type : Graded Quiz Questions : 8 Time : 45m

Marks: 10
Q No: 1

Correct Answer
Marks: 1/1

Silhouette Score is calculated using the following formula:

Silhouettescore = (p−q)/max(p,q)

What does p & q represent?

p = mean distance to the points in the nearest cluster & q = mean intra-cluster distance to all the
points.
You Selected
p = mean distance to the points in the farthest cluster & q = mean intra-cluster distance to all the
points.

p = mean distance to the points in the nearest cluster & q = sum of the intra-cluster distance of all the
points.

p = mean distance to the points in the farthest cluster & q = sum of the intra-cluster distance of all
the points.
Q No: 2

Correct Answer
Marks: 1/1
At p=2, the Minkowski distance will resemble which type of distance measure?

Euclidean Distance
You Selected
Manhattan Distance

Chebyshev Distance

None of the mentioned

d(x,y)= (Summation( xi - yi)p )1/p

for p=2, d(x,y) becomes (Summation( xi - yi)2 )1/2

Q No: 3

Correct Answer
Marks: 1/1
Calculate Euclidean Distance for between below points:
p1= [2,3]
p2= [4,5]

2.626

3.100

2.423

2.828
You Selected

Euclidean Distance:

dist((x, y), (a, b)) = √(x - a)² + (y - b)²

(2,3)

(4,5)

Find difference 2-4= -2 and 3-5 =-2

Square and add the values 4 + 4 =8

Take the Square Root of the value √8 = 2 x √2 = 2 x 1.414 =2.828

Q No: 4

Correct Answer
Marks: 1/1

Calculate the Silhouette Score for below:

np.random.seed(7)
array=np.array(np.random.rand(20)).reshape(10,2)
for n_clusters=2

[hint: scale the array using standard scalar]

0.4164

0.5478

0.4069
You Selected
0.3209
Q No: 5

Correct Answer
Marks: 1/1
Calculate the Manhattan distance between Point P1(4,4) and P2(9,9)?

10
You Selected
(5,5)

None of the Mentioned

Manhattan Distance:

(4,4) (9,9)

d= |(x2-x1)|+|(y2-y1)|

d= |(9-4)|+|(9-4)| = 5+5=10

Q No: 6

Correct Answer
Marks: 1/1
Agglomerative clustering algorithm is generating 2 different dendrograms. What among the following
could be the possibilities for it to occur?

All of the mentioned.

You Selected
Due to the proximity function

Due to the data points used

Due to the variables used

Q No: 7

Correct Answer
Marks: 1/1
Agglomerative Clustering will start by considering all points as part of one big cluster

True

False
You Selected
Agglomerative Clustering starts by considering all points as individual clusters
Q No: 8

Correct Answer
Marks: 3/3

Use the dataset provided in the instructions.

The within-cluster sum of squared for 4 clusters is:

[Hint: Use KMeans Clustering and keep random_state=0]

1102.32

1694.33

1895.25
You Selected
2123.10

kmeans = KMeans(n_clusters=4,random_state=0)
km=kmeans.fit(dataset_scaled)
print('The within sum of squared for 4 clusters is',round(km.inertia_,2))

The within sum of squared for 4 clusters is 1895.25

Points
Document1 page
Points
Being Indian
0% (6)
ANOVA
Document1 page
ANOVA
Being Indian
33% (3)
Advanced Statistics Project
Document2 pages
Advanced Statistics Project
Being Indian
17% (6)
Project - Advanced Statistics - Final-1
Document15 pages
Project - Advanced Statistics - Final-1
hemantaddal
100% (3)
Quiz 3 LDA Predictive Modeling Great Learning
Document7 pages
Quiz 3 LDA Predictive Modeling Great Learning
sonali Pradhan
100% (4)
Data Mining Quiz 2
Document8 pages
Data Mining Quiz 2
Shripad H
100% (2)
Weekly Quiz 1 (TSF) - Time Series Forecasting - Great Learning PDF
Document4 pages
Weekly Quiz 1 (TSF) - Time Series Forecasting - Great Learning PDF
Ankit
100% (1)
This Study Resource Was: Quiz 3
Document5 pages
This Study Resource Was: Quiz 3
Nagarajan Thandayutham
100% (1)
Project 2 SMDM
Document5 pages
Project 2 SMDM
shilpa
50% (2)
Project SMDM Kundan Sinha PDF
Document4 pages
Project SMDM Kundan Sinha PDF
Tanmay Iyer
0% (1)
Predictive Modeling PDF
Document49 pages
Predictive Modeling PDF
preeti
100% (2)
Ruhee Ansari - Advanced Statistic Project SCB
Document28 pages
Ruhee Ansari - Advanced Statistic Project SCB
Ruhee's Kitchen
100% (1)
Advanced Statistics - Project - 16052021
Document9 pages
Advanced Statistics - Project - 16052021
vansh gupta
No ratings yet
SMDM Assignment: Problem 1
Document16 pages
SMDM Assignment: Problem 1
manas vikram
0% (1)
PCA Project Advanced Statistics
Document24 pages
PCA Project Advanced Statistics
Ankit Sharma
67% (3)
Problem Statement 1
Document17 pages
Problem Statement 1
SHYAM VIVIN
100% (1)
Prob 3
Document2 pages
Prob 3
shilpa
No ratings yet
DM Gopala Satish Kumar Business Report G8 DSBA
Document26 pages
DM Gopala Satish Kumar Business Report G8 DSBA
Satish Kumar
100% (2)
SMDM Extended Project Report
Document9 pages
SMDM Extended Project Report
Leon D’Mello
No ratings yet
Business Report - Advanced Statistics - Great Learning
Document20 pages
Business Report - Advanced Statistics - Great Learning
Aditya Hajare
100% (1)
Advanced Statistics
Document16 pages
Advanced Statistics
diptidp
100% (1)
Advance Statistics - Buisness Report
Document26 pages
Advance Statistics - Buisness Report
Shefali Kaushik
100% (1)
Business Report: Pgpdsba Advanced Statistics Module Project
Document18 pages
Business Report: Pgpdsba Advanced Statistics Module Project
Prasad Mohan
100% (2)
Data Mining Graded Assignment: Problem 1: Clustering Analysis
Document39 pages
Data Mining Graded Assignment: Problem 1: Clustering Analysis
rakesh sandhyapogu
100% (3)
Statisitics Project 6
Document48 pages
Statisitics Project 6
AMAN PRAKASH
100% (2)
As Quiz 3 PCA Solution PDF
Document1 page
As Quiz 3 PCA Solution PDF
BhagyaSree J
100% (1)
Advanced Statistics - Graded Quiz 1 - Solution
Document4 pages
Advanced Statistics - Graded Quiz 1 - Solution
Punyaslok
No ratings yet
Advanced Statistics Jupyter File PDF
Document56 pages
Advanced Statistics Jupyter File PDF
Nagarajan Thandayutham
100% (2)
Which Year Has The Most Number of Records?: AS Quiz 2: Exploratory Data Analysis
Document5 pages
Which Year Has The Most Number of Records?: AS Quiz 2: Exploratory Data Analysis
BhagyaSree J
100% (2)
Project Report - Advanced - Stats - Final PDF
Document25 pages
Project Report - Advanced - Stats - Final PDF
Bibin Vadakkekara
No ratings yet
SMDM Project Gopala Satish Kumar Jupyter Notebook G8 DSBA
Document14 pages
SMDM Project Gopala Satish Kumar Jupyter Notebook G8 DSBA
Satish Kumar
100% (1)
Project Advance Stats - Abhishek
Document14 pages
Project Advance Stats - Abhishek
Abhishek Gautam
No ratings yet
Predictive Modelling Project Gloria Susan Raju 11 APR 2021 PDF
Document56 pages
Predictive Modelling Project Gloria Susan Raju 11 APR 2021 PDF
preeti
No ratings yet
QUIZ Week 2 CART Practice PDF
Document10 pages
QUIZ Week 2 CART Practice PDF
Nagarajan Thandayutham
No ratings yet
Data Mining - Project
Document11 pages
Data Mining - Project
ankitbhagat
100% (2)
Jupyter Notebook Project CART RF ANN
Document41 pages
Jupyter Notebook Project CART RF ANN
Nikita Chaturvedi
100% (1)
Predictive Modelling Project 1 PDF
Document38 pages
Predictive Modelling Project 1 PDF
preeti
50% (2)
PROJECT Advanced Statistics
Document58 pages
PROJECT Advanced Statistics
BhagyaSree J
No ratings yet
Dbms db03 2020 Assessment (Solved) : Find Study Resources
Document12 pages
Dbms db03 2020 Assessment (Solved) : Find Study Resources
Gupta Anacoolz
50% (2)
Business Analytics Report: Submitted To
Document32 pages
Business Analytics Report: Submitted To
SHYAM VIVIN
No ratings yet
Graded Project AS
Document14 pages
Graded Project AS
Jimmi Pranami
No ratings yet
Predictive Modeling
Document22 pages
Predictive Modeling
diptidp
100% (1)
Project 4 Data Mining Final v2
Document19 pages
Project 4 Data Mining Final v2
Tina
100% (1)
Advanced Statistics ANOVA PCA EDA Project Report 3 Great Lakes
Document28 pages
Advanced Statistics ANOVA PCA EDA Project Report 3 Great Lakes
Nikhil R K
No ratings yet
SMDM Project Report: Submitted By: Kratika Vijayvergiya
Document15 pages
SMDM Project Report: Submitted By: Kratika Vijayvergiya
Kratika Vijayvergiya
100% (1)
Weekly Quiz - 2 (TSF) - Time Series Forecasting - Great Learning PDF
Document4 pages
Weekly Quiz - 2 (TSF) - Time Series Forecasting - Great Learning PDF
Ankit
100% (2)
Assignment Report - Advanced Statistics
Document12 pages
Assignment Report - Advanced Statistics
Rahul
No ratings yet
Problem 1:: Readingcsv PD Read - Excel (Readingcsv) Readingcsv Head
Document18 pages
Problem 1:: Readingcsv PD Read - Excel (Readingcsv) Readingcsv Head
Pratigya pathak
No ratings yet
Data Mining Case Study PDF
Document21 pages
Data Mining Case Study PDF
Deepali Kumar
100% (1)
Predictive Modelling Sweta Kumari
Document35 pages
Predictive Modelling Sweta Kumari
sweta kumari
No ratings yet
Time Series Project
Document2 pages
Time Series Project
Tina
50% (4)
Answer
Document5 pages
Answer
sonali Pradhan
100% (3)
Predictive Modelling ALOK KUMAR
Document25 pages
Predictive Modelling ALOK KUMAR
Rv Group
No ratings yet
Vidhya - SMDM Project
Document21 pages
Vidhya - SMDM Project
vidyaramesh
50% (2)
Data Mining Project
Document20 pages
Data Mining Project
Bhuvanesh Singh
100% (2)
Cart-Rf-Ann: Prepared by Muralidharan N
Document33 pages
Cart-Rf-Ann: Prepared by Muralidharan N
rakesh sandhyapogu
50% (2)
Project Submission Predictive Modelling - Logistic Regression and LDA
Document29 pages
Project Submission Predictive Modelling - Logistic Regression and LDA
ankitbhagat
No ratings yet