Professional Documents
Culture Documents
16/03/16
Instructions to Candidate
1. All questions are compulsory.
2. Neat diagrams must be drawn wherever necessary.
3. Figures to the right indicate full marks.
Q.1: (CO1) Study the data distribution given in Table no. 1 and answer the Questions –
Value 1 2 3 4 5 6 7 8
No. of data 1 0 0 3 4 10 12 8
points with
that value i.e.
frequency
Table no. 1
Q.2: (CO2) What are type 1 and type 2 errors in hypothesis testing? Explain with the help of
suitable examples. ………. 4 M
Q.3: a. (CO3) What are the advantages and disadvantages of using L1 norm? ………. 1 M
Q.3: b. (CO3) Draw a typical Hessian Matrix? Indicate how is it used in Optimization……. 3 M
Q.4: c. (CO4) What do you mean by interpretation of beta coefficients? explain with
examples……….. 5 Marks
OR
Q.4: c. (CO4) What do you understand by Logistic regression? What are dichotomous variables
in the context of Logistic regression? ………. 5 Marks
Q.5: a. (CO5)How does the KNN algorithm make the predictions on the unseen dataset? ……….
5 Marks
Vishwakarma Institute of Technology Issue 01 : Rev No. 0 : Dt. 16/03/16
OR
Q.5: a. (CO5)Why we measure impurity of a resulting node in Decision tree? List the different
measures of impurity in DT. ………. 5 Marks
Q.5: b. (CO5)Is Feature Scaling required for the KNN Algorithm? Explain with proper
justification……….. 4 Marks
OR
Q.5: b. (CO5)There are 4 coins A, B, C and D out of which 3 coins are of equal weight and one
coin is heavier. Find out the heavier coin using Decision Tree……….. 4 Marks
Q.5: c. (CO5) Compare and contrast between Divisive and Agglomerative clustering algorithms
……… 4 Marks
Q.6: a. (CO6) The confusion matrix for a certain classification activity is as shown in Table no. 2
Table no. 2
Find the following classifier performance measures –
1. Accuracy
2. Precision
3. Recall
4. Specificity
5. F-Score
6. Error rate …………… 6 Marks
Q.6: b (CO6) Explain the following methods used for training and testing –
1. Re substitution
2. K fold Cross-validation
3. Bootstrapping …………. 6 Marks
OR
Q.6: (CO6) Using the Naïve Bayes Classifier approach based on the training data set given in
Table no. 3, predict Class = Buy Laptop: Yes or No for the feature set: {Income = Low; Student
= No; Credit Rating = Excellent} ………. 12 Marks