You are on page 1of 5

Important Questions

Que 1: Define Machine Learning.


Que 2: Name some applications that require machine learning techniques.
Que 3: What are the different types of machine learning algorithms? Explain all in
detail.
Que 4: Explain machine learning with respect to designing a chess game by identifying
the tasks (T), experience (E), and performance measure (P).
Que 5: What is cluster analysis? Explain with suitable examples.
Que 6: Compute the determinant of the following matrix:

[ ]

Que 7: Find the inverse of the following matrix:

[ ]

Que 8: How is data integration accomplished? What are the problems associated with
data integration?
Que 9: What is the difference between covariance and correlation?
Que 10: How are mean deviation, standard deviation, and variance related to each
other?
Que 11: Explain the different types of binning methods used in data cleaning.
Que 12: What is PCA? When should you use the PCA? Compute PCA for following
data set:
X1 2 3 4 8 9 10 11
X2 5 8 2 24 3 72 45
X3 3 8 2 6 23 8 7
Que 13: Calculate the covariance matrix for following data:
Item No. 1 2 3 4
W 1 -1 4 2
X 2 2 1 3
Y 3 1 -1 5
Z 1 4 -4 3

Que 14: Why is normalization of variables necessary? Explain with proper examples.
Que 15: Do Haar wavelet transformation for following data set using Haar
transformation matrix:
Que 16. Write explanatory notes on following with proper examples:
1) Ensemble Models
2) Modelling Unbalanced Classes
3) Deep Learning
4) Reinforcement Learning
5) Feature Engineering
6) Exploratory Data Analysis
7) Machine Learning Work Flow
8) Error Measurement in Machine Learning
9) Bias vs. Variance in ML
10) Ensemble Techniques

Que 17. Compare Classification with regression with an example.


Que 18. Distinguish between overfitting and underfitting. How it can affect model
generalization?
Que 19. Explain the general MLE method for estimating the parameters of a
probability distribution.
Que 20. Compare Cross validation with Bootstrapping Techniques.
Que 21. Explain the different methods that can be used to handle the class imbalance
problem.
Que 22. Describe the significance of Kernal functions in SVM. List any two kernel
functions.
Que 23. Explain any two model combination scheme to improve the accuracy of a
classifier.
Que 24. Compare K means clustering with Hierarchical Clustering Techniques.
Que 25. Distinguish between supervised learning and Reinforcement learning.
Illustrate with an example.
Que 26. Discuss any four examples of machine learning applications.
Que 27. Point wise differentiate between Machine Learning, Artificial Intelligence and
Deep Learning.
Que 28. Explain the procedure for the computation of the principal components of the
data.
Que 29. Compare Feature Extraction and Feature Selection techniques. Explain how
dimensionality can be reduced using subset selection procedure.
Que 30. Explain the methods used to learn multiple classes for a K class Classification
Problem.
Que 31. Suppose 10000 patients get tested for flu; out of them, 9000 are actually
healthy and 1000 are actually sick. For the sick people, a test was positive for 620 and
negative for 380. For the healthy people, the same test was positive for 180 and
negative for 8820. Construct a confusion matrix for the data and compute the precision
and recall for the data.
Que 32. Consider the training data in the following table where Play is a class attribute.
In the table, the Humidity attribute has values “L” (for low) or “H” (for high), Sunny
has values “Y” (for yes) or “N” (for no), Wind has values “S” (for strong) or “W” (for
weak), and Play has values “Yes” or “No”. Construct Decision Tree.

Que 33. Given the set of values X = (3, 9, 11, 5, 2)T and Y = (1, 8, 11, 4, 3)T . Evaluate
the regression coefficients.
Que 34. Explain DBSCAN algorithm for density based clustering. List out its
advantages compared to K-means.
Que 35. Describe the significance of soft margin hyper plane and explain how they are
computed.
Que 36. Illustrate K means clustering algorithm with an example.
Que 37. State the mathematical formulation of the SVM problem. Give an outline of
the method for solving the problem.
Que 38. Define Artificial Neural Network and Mention what you can and cannot do
with an ANN.
Que 39. Differentiate between Supervised Learning, Unsupervised Learning and
Reinforcement Learning with proper examples.
Que 40. What do you understand by Data Preprocessing? Explain the following terms
with suitable examples:
a) Data Cleaning b) Data Integration c) Data Transformation d) Data Reduction
Que 41. What is the need for classification? Explain logistic regression with example
and also derive all the gradient descent equations for computing the parameters of logit
function.
Que 42. Explain the following terms with examples: a) Karl Pearson’s Coefficient of
Correlation b) Confusion Matrix c) R2
d) Standard error of estimates
Que 43. Construct decision tree for following data set:
Example Sky AirTemp Humidity Wind Water Forecast EnjoySport

1 Sunny Warm Normal Strong Warm Same Yes

2 Sunny Warm High Strong Warm Same Yes

3 Rainy Cold High Strong Warm Change No

4 Sunny Warm High Strong Cool Change Yes

Que 44. Differentiate between Training data and Testing Data


Que 45. What are the issues in Machine Learning
Que 46. Give decision trees to represent the following boolean functions:
a. A ˄˜B
b. A V [B ˄ C]
c. A XOR B
d. [A ˄ B] v [C ˄ D]
Que 47. Consider the following set of training examples:

Instance Classification a1 a2

1 + T T

2 + T T

3 - T F

4 + F F

5 - F T

6 - F T

a. What is the entropy of this collection of training examples with respect to the
target function classification?
b. What is the information gain of a2 relative to these training examples?
Que 48. What type of problems are best suited for decision tree learning
Que 49. What is Artificial Neural Network?
Que 50. What do you mean by Gradient Descent?
Que 51. Differentiate between Gradient Descent and Stochastic Gradient Descent
Que 52. Explain the k-Means Algorithm with an example.
Que 53. Discuss Maximum Likelihood and Least Square Error Hypothesis.
Que 54. Discuss the major drawbacks of K-nearest Neighbour learning Algorithm and
how it can be corrected
Que 55. Define the following terms

a. Sample error b. True error c. Random Variable

d. Expected value e. Variance f. standard Deviation

Que 56. Explain the Central Limit Theorem with an example.


Que 57. Create the relationship model for the data given in table below to find the
relationship between the strength of the mixture with quantity of material X, material Y
and Material Z and also predict the strength of the mixture for X=6, Y=2 and Z= 3
respectively.

Material Y Material Z Strength of the


Observations Material X (Kg)
(Kg) (Kg) Mixture (MPa)
1. 6 2 3 14
2. 6 2 3 15
3. 7 2 2 17
4. 7 1 2 19
5. 7 1 2 20
6. 7 1 1 21
Que 58. Compute the Principal Component of following data set:
Class 1: X=2, 1, 6, 5 and Y=2, 3, 4, 5
Class 2: X=3, 4, 7, 7 and Y=3, 9, 2, 8
Que 59. Explain the terms a) Recall b) Specificity c) Confusion
Matrix d) Accuracy
Que 60. Differentiate between boosting and bragging.

You might also like