Professional Documents
Culture Documents
www.universityacademy.in, info@universityacademy.in
AKTU EXAM 19-20
Machine Learning Solved MCQ
Highlighted Option is Correct Answer
Note: Attempt all questions. The question paper contains 70 MCQ type questions. Each
question carries equal marks. Select the answer and Jill the bubble corresponding to
that question in the attached OMRsheet.
my
knowledge through the use of i and iii
manual programs (D) ii and iii
e
(C) The selective acquisition of 4. How do you handle missing or
ad
knowledge through the use of corrupted data in a dataset?
computer programs Drop missing rows or columns
(D) The selective acquisition of
Ac (B) Replace missing values with
knowledge through the use of mean/median/mode
manual programs Assign a unique category to
y
examples.
"Type-2" errors? (B) FIND-S algorithm ignores
U) Typel is known as false negative examples.
positive and Type2 is known FIND-S algorithm finds the
as false negative. most specific hypothesis
(ii) Type 1 is known as false within H that is consistent
negative and Type2 is known with the positive training
as false positive. examples.
www.universityacademy.co.in Page 1 of 10
(D All of the above (C) High estimation bias
6. Regarding bias and variance, which (D) Nonc of the above
of the followingstatementsare true? 9. Adding more basis functions in a
(Here 'high' and 'low' are relative to linear model... (pick the most
the idea model.) probably option)
odels which overfit have a Decreases model bias
high bias. (B) Decreases estimation bias
Models which overfit have a (C) Decreases variance
low bias. (D) Doesn't affect bias and
my
(C) Models which underfit have a variance
high variance. 10. Which of the following will be true
e
(D) None of these about k in k-NN in terms of Bias?
7. Which of the following sentence is
ad
When you increase the k the
FALSE regarding regression? bias will be increases
(A) It relates inputs to outputs. Ac (B) When you decrease the k the
(B) It is used for prediction. bias will be increases
(C) It may be used for (C) Can't say
interpretation.
ty
www.universityacademy.co.in Page 2 of 10
Your analysis is based on features 15. Choose the False Statement.
like author name, number of articles Gradient of a continuous and
written by the same author on differentiable function
Analytics Vidhya in past and a few (A) is zero at a minimum
other features. Which of the (B) is non-zero at a maximumw
following evaluation metric would (C) is zero at a saddle point
you choose in that case? decreases as you get closer to
my
Mean Square Error the minimum
Accuracy 16. Computational complexity of
Fl Score
e
Gradient descent is,
Only 1
ad
(A) linear in D
(B) only 2 (B) linear in N
(C) only 3
(D) 1 and 3b
13. At a certain university, 4% of men
Ac (C)
(D)
polynomial in D
dependent on the number of
iterations
are over 6 feet tall and 1% of women Let's say, you are using activation
ty
17.
are over 6 feet tall. The total student ftmction X in hidden layers of neural
population is divided in the ratio 3:2 network. At a neuron for any given
si
those over six feet tall, what is the activation function could X
probability that the student is a represent?
iv
(B) 36 SIGMOID
(C) 3/11 (D) None of these
W) moo 18. Which of the following hyper
14. Macromutation operator is also parameter(s), when increased may
known as cause random forest to over fit the
Headed Chicken data?
(B) Headless chicken Number of Trees
(C) SPX operator Depth of Tree
BLX operator Learning Rate
www.universityacademy.co.in Page 3 of 10
(A) Only I technique that adjusts weights in the
(B) only 2 neural network by propagating
(C) 1 and 2 weight changes.
(D) 2 and 3 Forward from source to sink
19. Which of the following is a Backward from sink to source
disadvantage of decision trees? (C) Forward from source to hidden
(A) Factor analysis nodes
(B) Decision trees are robust to (D) Backward from sink to hidden
outliers nodes
my
(C) Pecision trees are prone to be 23. Which of the following neural
verfit networks uses supervised learning?
e
(D) one of the above (A) Multilayer pcrceptron
ad
20. To find the minimum or the (B) Self organizing feature map
maximum of a function, we set the (C) Hopfield network
gradient to zero because:
Ac Choose the correct answer:
The value of the gradient at (A) A only
extrema of a function is (B) B only
y
A and C only
problem 24. Which of the following sentences is
(C) Both A and B incorrect in reference to Information
s
www.universityacademy.co.in Page 4 of 10
Optimistic pruning 30. Which of the following is true about
(B) Post-pruning and Pre-pruning Naive Bayes?
(C) Cost complexity pruning and Assumes that all the features
time complexity pruning in a dataset are equally
(D) None of the options important
26. Which one of these is not a tree- (B) Assumes that all the features
based learner? in a dataset are independent
(A) CART Both A and B
(B) 11)3 (D) None of the above options
my
Bayesian classifier 31. The method in which the previously
(D) Random Forest calculated probabilities are revised
e
27. What is tree-based classifiers? with new probabilities is classified as
ad
(A) Classifiers which form a tree (A) updating theorem
with each attribute at one level Ac (B) revised theorem
(B) Classifiers which perform Jef Bayes theorem
series of condition checking (D) dependencytheorem
with one attribute at a time 32. Which of the following is a widely
ty
www.universityacademy.co.in Page 5 of 10
operator in GA? high bias.
(B) Number of offspring used for (iii) Models which underfit have a
the operator high variance.
(C) Both a and b (iv) Models which underfit have a
(D) None low ariance
35. Which of the following statements ) (i) and (ii)
my
(A)
lambda can cause your (D) None of these
38. What is the purpose of restricting
e
hypothesis to underfit the data.
(B) Using too large a value of hypothesis space in machine
ad
lambda can cause your learning?
hypothesis to overfit the data. Ac (A) can be easier to search
(C) Using a very large value of (B) May avoid overfit since they
lambda cannot hurt the are usually simpler (e.g. linear
performance of your or low order decision surface)
ty
36. You are given reviews of movies 39. Suppose, you got a situation where
marked as positive, negative, and you find that your linear regression
er
[190537] [RCS080J
www.universityacademy.co.in Page 6 of 10
(X). The output variable is Y. The (D) None of the above
equation is : Y=aX+b, where a is the 44. Unsupervised learning is
slope and b is the intercept. If we (A) learning without computers
change the input variable (X) by 1 problem based learning
unit, by how much output variable learning from environment
(N) will change? (D) learning from teachers
(A) 1 unit 45. In supervised learning
classes are not predefined
my
(B) By (A)
(C) By intercept, JBf classes are predefined
classes are not required
e
vcDf None (C)
41. You have generated data from a 3- (D) classification is not done
ad
degree polynomial with some noise. 46. Mutating a strain is:
What do you expect of the model that (A) Changing all the genes in the
was trained on this data using a 5-
degree polynomial as ftrnction class?
Ac (B)
strain.
Removing one gene in the
( Low bias, high variance strain.
ty
(D) High bias, low variance. (D) Removing the strain from the
42. Genetic Algorithm are a part of population.
er
(C) are adaptive heuristic search (B) Search the solution space
algorithm based on the using the previous generation
evolutionary ideas of natural as a starting point.
selection and genetics (C) Have no knowledge of what
J) All of the above strains are contained in the
43. What are the 2 types of learning next generation.
(A) Improvised and un-improvised (D) Use random numbers.
supervised and unsupervised 48. The three gene operators we have
www.universityacademy.co.in Page 7 of 10
one is
Crossover: Receiving the best
52. Among the following, which
(A)
not "hyperparameter"?
genes from both parents.
(A) learning rate CL
(B) Mutation: Changing one gene
so that the child is almost like (B) number of layers L in the
neural network
the parent.
activation values all]
(C) Mirror: Changing a string of
genes in the child so it is like a (D) size of the hidden layers n[l]
my
A and B only complex
computing more
49. If a population contains only one features of the input than the
earlier layers.
e
strain, you can introduce new strains
(ii) The earlier layers of a neural
ad
by:
network are typically
Using the Crossover operator. computing more complex
Injecting random strains into features of the input than the
(A)
Ac
the population. deeper layers.
Which of the following option
(B) Using the Mutation operator.
is correct?
B only
y
(C)
(i) is correct and (ii) is
B and C only
it
(D)
incorrect
50. The efficiency of a Genetic
(B) (i) is incorrect while (ii) is
Algorithm (how quickly it arrives at
s
correct
the best solution) is dependent upon:
er
my
55. Factor Analysis involves: regression problem
(A) dimensionality reduction (B (ii) is classification while (i) is
e
technique regression problem
ad
(B) finding correlation among both are classification problem
variables (D) both are regression problem
(C) capturing maximum variance
Ac
59. what does fitness function represent
in the data with minimum to describe optimization problem?
number of variables Objective function
(D) All the above (B) Scaling function
y
www.universityacademy.co.in Page 9 of 10
is/are ü-ue?
is fi-ue about mimic
62. which of the following (A) Genetic
algorithm
bagging and boosting? process from natural
selection
learning
(A) Both are ensemble Chromosomesplay vital
roles
(B)
techniques
output of in GA
(B) Both combine the
weak learners to make
Jef Both a and b can't be
(D) Chromosomes
consistent predictions
encoded
(C) Both can be used to solve individual is
67. characteristics of
classification as well as
represented by
my
regression problems
(D) All of the above ) Chromosomes
(B) Gray Code
e
63. what causes underfitting?
(C) Initial population
ad
(A) Less number of features in the
data (D) None of the above
(B) Less number of observations 68. what is the main concept
Ac of
in the data Evolutionary computation?
Both a and b ( Survival of the fittest
(D) None of the above (B) Survival of the weakest
ty
65. which of the following are main (D) None of the above
components of evolutionary 70. Which selection strategy is
Un
[190537] [RCS080]
[Page- 12]
www.universityacademy.co.in Page 10 of 10