Professional Documents
Culture Documents
[ ]
[ ]
Que 8: How is data integration accomplished? What are the problems associated with
data integration?
Que 9: What is the difference between covariance and correlation?
Que 10: How are mean deviation, standard deviation, and variance related to each
other?
Que 11: Explain the different types of binning methods used in data cleaning.
Que 12: What is PCA? When should you use the PCA? Compute PCA for following
data set:
X1 2 3 4 8 9 10 11
X2 5 8 2 24 3 72 45
X3 3 8 2 6 23 8 7
Que 13: Calculate the covariance matrix for following data:
Item No. 1 2 3 4
W 1 -1 4 2
X 2 2 1 3
Y 3 1 -1 5
Z 1 4 -4 3
Que 14: Why is normalization of variables necessary? Explain with proper examples.
Que 15: Do Haar wavelet transformation for following data set using Haar
transformation matrix:
Que 16. Write explanatory notes on following with proper examples:
1) Ensemble Models
2) Modelling Unbalanced Classes
3) Deep Learning
4) Reinforcement Learning
5) Feature Engineering
6) Exploratory Data Analysis
7) Machine Learning Work Flow
8) Error Measurement in Machine Learning
9) Bias vs. Variance in ML
10) Ensemble Techniques
Que 33. Given the set of values X = (3, 9, 11, 5, 2)T and Y = (1, 8, 11, 4, 3)T . Evaluate
the regression coefficients.
Que 34. Explain DBSCAN algorithm for density based clustering. List out its
advantages compared to K-means.
Que 35. Describe the significance of soft margin hyper plane and explain how they are
computed.
Que 36. Illustrate K means clustering algorithm with an example.
Que 37. State the mathematical formulation of the SVM problem. Give an outline of
the method for solving the problem.
Que 38. Define Artificial Neural Network and Mention what you can and cannot do
with an ANN.
Que 39. Differentiate between Supervised Learning, Unsupervised Learning and
Reinforcement Learning with proper examples.
Que 40. What do you understand by Data Preprocessing? Explain the following terms
with suitable examples:
a) Data Cleaning b) Data Integration c) Data Transformation d) Data Reduction
Que 41. What is the need for classification? Explain logistic regression with example
and also derive all the gradient descent equations for computing the parameters of logit
function.
Que 42. Explain the following terms with examples: a) Karl Pearson’s Coefficient of
Correlation b) Confusion Matrix c) R2
d) Standard error of estimates
Que 43. Construct decision tree for following data set:
Example Sky AirTemp Humidity Wind Water Forecast EnjoySport
Instance Classification a1 a2
1 + T T
2 + T T
3 - T F
4 + F F
5 - F T
6 - F T
a. What is the entropy of this collection of training examples with respect to the
target function classification?
b. What is the information gain of a2 relative to these training examples?
Que 48. What type of problems are best suited for decision tree learning
Que 49. What is Artificial Neural Network?
Que 50. What do you mean by Gradient Descent?
Que 51. Differentiate between Gradient Descent and Stochastic Gradient Descent
Que 52. Explain the k-Means Algorithm with an example.
Que 53. Discuss Maximum Likelihood and Least Square Error Hypothesis.
Que 54. Discuss the major drawbacks of K-nearest Neighbour learning Algorithm and
how it can be corrected
Que 55. Define the following terms