You are on page 1of 2

Name: Naqqash Haider

Sap-Id: 1980
Assignment#: 5

Dataset I used from : https://raw.githubusercontent.com/mk-


gurucharan/Classification/master/DMVWrittenTests.csv

Scenario:
 In this, we DMV Test dataset which has three columns. The first two columns
consist of the two DMV written tests (DMV_Test_1 and DMV_Test_2) which
are the independent variables and the last column consists of the dependent
variable, Results which denote that the driver has got the license (1) or not (0).
In this, we have to build a KNN Classification model using this data to predict if
a driver who has taken the two DMV written tests will get the license or not
using those marks obtained in their written tests and classify the results.

Euclidean Distance
(i) For k=3 (Using Euclidean Distance)
Accuracy : 0.92

array([[11, 0],
[ 2, 12]], So it is sweet point where model will perform well

(ii) For k=5 (Using Euclidean Distance)


Accuracy : 0.84

array([[10, 1],
[ 3, 11]],

Minkowski Distance
(i) For k=3
Accuracy : 0.92
So it is sweet point where model will perform well

array([[11, 0],
[ 2, 12]]

(ii) For k=5


Accuracy : 0.84

array([[10, 1],
[ 3, 11]],

Manhattan Distance

(i) For k=3


Accuracy : 0.92

array([[11, 0],
[ 2, 12]], So it is sweet point where model will perform well

(ii) For k=5


Accuracy : 0.84
array([[10, 1],
[ 3, 11]],

You might also like