Big Data and Analytics
Machine Learning Multiple Choice Questions - Free Practice Test
6042 Tests taken
Machine learning is a revolutionary technology that’s changing how businesses and industries
function across the globe in a good way. This Machine Learning quiz, is a free practice test that
is focused to help people wanting to start their career in the Machine learning industry. This
Machine Learning Bootcamp helps you assess how prepared are you for the certification exam.
Here, you get Machine Learning MCQs that test your knowledge on the technology. These
Machine Learning Questions are prepared by subject matter experts and are in line with the
questions you can come across in certification exam. Take this test today!
Improve Your Skills with this Course
Machine Learning
6729 Ratings
Exclusive 30% OFF
EXPLORE COURSE
Explanations
20 9 45
Questions Correct Answers % Correct Answers
1. Which of the following is a widely used and effective machine learning algorithm based
on the idea of bagging?
Decision Tree
Regression
Classification
Random Forest
Explanations:
The Radom Forest algorithm builds an ensemble of Decision Trees, mostly trained with the bagging
method.
2. To find the minimum or the maximum of a function, we set the gradient to zero because:
The value of the gradient at extrema of a function is always zero
Depends on the type of problem
Both A and B
None of the above
Explanations:
The gradient of a multivariable function at a maximum point will be the zero vector of the function,
which is the single greatest value that the function can achieve.
3. The most widely used metrics and tools to assess a classification model are:
Confusion matrix
Cost-sensitive accuracy
Area under the ROC curve
All of the above
Explanations:
The model performance assessment for classification algorithms encorporates all of the above
techniques.
4. Which of the following is a good test dataset characterstic?
Large enough to yield meaningful results
Is representative of the dataset as a whole
Both A and B
None of the above
Explanations:
A good test dataset has a good amount of sample population and equal ratios of class representation.
5. Which of the following is a disadvantage of decision trees?
Factor analysis
Decision trees are robust to outliers
Decision trees are prone to be overfit
None of the above
Explanations:
Allowing a decision tree to split to a granular degree makes decision trees prone to learning every point
extremely well to the point of perfect classification that is overfitting.
6. How do you handle missing or corrupted data in a dataset?
Drop missing rows or columns
Drop missing rows or columns
Replace missing values with mean/median/mode
Assign a unique category to missing values
All of the above
Explanations:
All of the above techniques are different ways of imputing the missing values.
7. What is the purpose of performing cross-validation?
To assess the predictive performance of the models
To judge how the trained model performs outside the sample on test data
Both A and B
Explanations:
Cross-validation is a model validation technique for assessing how the results of a statistical analysis
will generalize to an independent data set.
8. Why is second order differencing in time series needed?
To remove stationarity
To find the maxima or minima at the local point
Both A and B
None of the above
Explanations:
If the second-order difference is positive, the time series will curve upward and if it is negative, the time
series will curve downward at that time.
9. When performing regression or classification, which of the following is the correct way
to preprocess the data?
to preprocess the data?
Normalize the data -> PCA -> training
PCA -> normalize PCA output -> training
Normalize the data -> PCA -> normalize PCA output -> training
None of the above
Explanations:
You need to always normalize the data first. If not, PCA or other techniques that are used to reduce
dimensions will give different results.
10. Which of the folllowing is an example of feature extraction?
Constructing bag of words vector from an email
Applying PCA projects to a large high-dimensional data
Removing stopwords in a sentence
All of the above
Explanations:
All of the above techniques transform raw data into features which can be used as inputs to machine
learning algorithms.
11. What is pca.components_ in Sklearn?
Set of all eigen vectors for the projection space
Matrix of principal components
Result of the multiplication matrix
None of the above options
Explanations:
pca.components_ is the set of all eigen vectors for the projection space.
12. Which of the following is true about Naive Bayes ?
Assumes that all the features in a dataset are equally important
Assumes that all the features in a dataset are independent
Both A and B
None of the above options
Explanations:
Naive Bayes assumes that all the features in a data set are equally important and independent.
13. Which of the following statements about regularization is not correct?
Using too large a value of lambda can cause your hypothesis to underfit the data.
Using too large a value of lambda can cause your hypothesis to overfit the data.
Using a very large value of lambda cannot hurt the performance of your hypothesis.
None of the above
Explanations:
A large value results in a large regularization penalty and therefore, a strong preference for simpler
models, which can underfit the data.
14. How can you prevent a clustering algorithm from getting stuck in bad local optima?
Set the same seed value for each run
Use multiple radom initializations
Both A and B
None of the above
Explanations:
K-Means clustering algorithm has the drawback of converging at local minima which can be prevented
by using multiple radom initializations.
15. Which of the following techniques can be used for normalization in text mining?
Stemming
Lemmatization
Stop Word Removal
Both A and B
Explanations:
Lemmatization and stemming are the techniques of keyword normalization.
16. In which of the following cases will K-means clustering fail to give good results?
1)
Data points with outliers
2) Data points with different densities
3) Data points with
nonconvex shapes
1 and 2
2 and 3
1, 2, and 3
1 and 3
Explanations:
K-means clustering algorithm fails to give good results when the data contains outliers, the density
spread of data points across the data space is different, and the data points follow nonconvex shapes.
17. Which of the following is a reasonable way to select the number of principal
components "k"?
Choose k to be the smallest value so that at least 99% of the varinace is retained.
Choose k to be 99% of m (k = 0.99*m, rounded to the nearest integer).
Choose k to be the largest value so that 99% of the variance is retained.
Use the elbow method.
Explanations:
This will maintain the structure of the data and also reduce its dimension.
18. You run gradient descent for 15 iterations with a=0.3 and compute J(theta) after each
iteration. You find that the value of J(Theta) decreases quickly and then levels off. Based
on this, which of the following conclusions seems most plausible?
Rather than using the current value of a, use a larger value of a (say a=1.0)
Rather than using the current value of a, use a smaller value of a (say a=0.1)
a=0.3 is an effective choice of learning rate
None of the above
Explanations:
You need the gradient descent to quickly converge to the minimum. So the current setting of a seems to
be good.
19. What is a sentence parser typically used for?
It is used to parse sentences to check if they are utf-8 compliant.
It is used to parse sentences to derive their most likely syntax tree structures.
It is used to parse sentences to assign POS tags to all tokens.
It is used to check if sentences can be parsed into meaningful tokens.
Explanations:
Sentence parsers analyze a sentence and automatically build a syntax tree.
20. Suppose you have trained a logistic regression classifier and it outputs a new example
x with a prediction ho(x) = 0.2. This means
Our estimate for P(y=1 | x
Our estimate for P(y=0 | x
Our estimate for P(y=1 | x
Our estimate for P(y=0 | x
Improve Your Score with this Course
Machine Learning
6729 Ratings
Exclusive 30% OFF
EXPLORE COURSE
course page fpt faqs
What is a Machine Learning practice test?
The Machine Learning free practice test is a simulation of the actual Machine Learning certification exam.
The test consists of 20 multiple choice questions that are likely to be faced in the actual exam. The test is
helpful in understanding whether you have the skills that are required to become a Machine Learning
engineer.
Who can take up this Machine Learning quiz?
This Machine Learning online test can be taken by anyone who is preparing to pass the Machine Learning
certification exam.
What will I learn from this mock exam?
The Machine Learning practice exam is designed to test your knowledge of machine learning concepts
and techniques. There are 25 multiple choice questions in the test which are helpful in analyzing your
strong and weak areas in topics like supervised and unsupervised learning, mathematical and heuristic
aspects, hands-on modeling, and more.
What is included in this machine learning exam simulator?
There are 20 machine learning practice test questions in the test that need to be answered in 25 minutes.
You can even pause the test and resume it later.
Will the Machine Learning quiz questions help in clearing the actual certification exam?
Yes, this Machine Learning mock test gives a complete overview of what you will face in the actual
certification exam. While we do not guarantee excellent results in the exam, you will definitely be able to
pass the exam with much ease.
Are these the same questions I'll see on the real exam?
The questions asked in this test are much like the questions expected in the actual certification exam.
Will the Practice Tests be updated frequently?
Yes, the Machine Learning MCQs are periodically updated and all the latest information related to
machine learning is incorporated.
Can I retake the Machine Learning certification dumps?
Yes, the Machine Learning objective questions can be taken repeatedly if you find it suitable. However,
you should ensure that your preparation is adequate so as to get a better test experience.
What are the requirements to take the Machine Learning dumps?
This Machine Learning exercise comes with no prior conditions.
I didn’t do well on this practice test. What should I do now?
If you are dissatisfied with your performance, you can retake the Machine Learning exam dumps multiple
times. You can also get a better grasp of all the machine learning concepts by taking our Machine
Learning Certification Course and then attempt the practice test.
Recommended Resources
Introduction to Machine Learning: A Beginner's Guide
How to Develop a Machine Learning Career
Machine Learning Career Guide: A complete playbook to becoming a
Machine Learning Engineer
How to Build a Career in AI and Machine Learning
© 2009 -2021- Simplilearn Solutions
Disclaimer
PMP, PMI, PMBOK, CAPM, PgMP, PfMP, ACP, PBA, RMP, SP, and OPM3 are registered marks of the Project Management Institute, Inc.