a. To predict future values b. To classify data c. To identify relationships between variables d. To optimize parameters 2. What type of data is best suited for linear models? a. Categorical data b. Numerical data c. Text data d. Image data 3. What is the main advantage of linear models? a. They are easy to interpret b. They are computationally efficient c. They can handle large datasets d. They can handle non-linear data 4. What is the purpose of a hypothesis test in regression analysis? a. To identify relationships between variables b. To provide an estimate of the population parameter c. To measure the strength of a relationship d. To test the significance of a relationship 5. What is the relationship between two variables when one increases as the other also increases? a. Positive correlation b. Negative correlation c. No correlation d. Zero Relation 6. What is the range of the logistic regression model? a. 0 to 1 b. . -1 to 1 c. 0 to infinity d. -infinity to infinity 7. What is the most common type of sigmoid function? a. Logistic b. Hyperbolic c. Exponential d. Linear 8. What is the main assumption of logistic regression for multiclass classification? a. The data is linearly separable b. The data is normally distributed c. The data is linearly related d. The data is independent 9. What is the most important assumption of classification model building? a. The data is normally distributed b. The data is linearly separable c. The data is labelled d. The data is balanced 10. What is the second most important assumption of classification model building? a. The data is normally distributed b. The data is linearly separable c. The data is labelled d. The data is balanced 11. What is the third most important assumption of classification model building? a. The data is normally distributed b. The data is linearly separable c. The data is labelled d. The data is balanced 12. What is the fourth most important assumption of classification model building? a. The data is normally distributed b. The data is linearly separable c. The data is labelled d. The data is balanced 13. What are the assumptions of the Gini Index? a. That all individuals are equal b. That all individuals have the same income c. That all individuals have the same wealth d. That all individuals have the same access to resources 14. What is the name of the tree which can work on continuous data. a. Classification Tree b. Clustering Tree c. Regression Tree d. Node Tree 15. What is the main advantage of using Random Forests? a. They are more accurate than other machine learning algorithms b. They are faster to train than other machine learning algorithms c. They are easier to interpret than other machine learning algorithms d. They are more robust to outliers than other machine learning algorithms 16. Which of the following is a good technique to evaluate the performance of a machine learning algorithm? a. Sampling b. Parameter tuning c. Cross-validation d. Stratification 17. Any dataset in Naïve Bayes is divided into two parts. Which pair is the correct one. a. Target variable & Dependent Matrix b. Response variable & Feature model c. Response vector & Feature Matrix d. Response data & Matrix Data 18. What is the fundamental assumption of Naïve Bayes. a. The fundamental Naive Bayes assumption is that each feature makes an dependent and unequal contribution to the outcome. b. The fundamental Naive Bayes assumption is that each feature makes an dependent and Equal contribution to the outcome. c. The fundamental Naive Bayes assumption is that each feature makes an Independent and unequal contribution to the outcome. d. The fundamental Naive Bayes assumption is that each feature makes an Independent, Equal contribution to the outcome. 19. What is Laplace estimation? a. smoothing techniques b. irregular techniques c. Cyclic techniques d. Regression techniques 20. What is Vcorpus? a. Vector Corpus b. Volatile Corpus c. Voice Corpus d. Volume Corpus 21. What is Pcorpus? a. Permanent Corpus b. Plus Corpus c. Part Corpus d. Predictive Corpus 22. For more than 30 observation factor analysis is same with what analysis. a. PCA b. MCA c. TCA d. EFA 23. What is Neural Network. a. Neural networks are a set of mean data b. Neural networks are a set of if rules c. Neural networks are a set of algorithms d. Neural networks are a set of only classification 24. What do we understand by activation function. a. Activation functions are a multi lines of code that gives the neural nets non-linearity and expressiveness b. Activation function is linear regression that gives the neural nets non-linearity and expressiveness c. Activation functions are a single line of code that gives the neural nets non-linearity and expressiveness d. Activation functions are a single line of code for layers. 25. Which package we use for Neural Network in R a. NN b. nn c. neuralnet d. neural network 26. Why Learning modifies the agent's Decision. a. Learning modifies the agent's decision mechanisms to improve performance b. Learning modifies the agent's decision mechanisms to increase parameters c. Learning modifies the agent's decision mechanisms to improve quality of model. d. Learning modifies the agent's decision to merge two algorithm 27. What we call the set of tuples used for model construction.? a. training set b. Testing set c. Data Frame d. DF 28. What is Association Analysis? a. A method of analyzing the performance of employees b. A statistical technique used to identify relationships between variables c. A way to determine the profitability of a company d. A tool for measuring customer loyalty 29. What is PR curve? a. It is the Line between precision and recall for various threshold values. b. It is the curve between prediction and recall for various threshold values. c. It is the curve between precision and ROC for various threshold values. d. It is the curve between precision and recall for various threshold values. 30. What is the main cause of model overfitting? a. Insufficient data b. Too complex of a model c. Too complex of a model d. Too much data