You are on page 1of 4

Model Question Paper

Course: Data Mining for Intelligence

1. What is the purpose of linear models?


a. To predict future values
b. To classify data
c. To identify relationships between variables
d. To optimize parameters
2. What type of data is best suited for linear models?
a. Categorical data
b. Numerical data
c. Text data
d. Image data
3. What is the main advantage of linear models?
a. They are easy to interpret
b. They are computationally efficient
c. They can handle large datasets
d. They can handle non-linear data
4. What is the purpose of a hypothesis test in regression analysis?
a. To identify relationships between variables
b. To provide an estimate of the population parameter
c. To measure the strength of a relationship
d. To test the significance of a relationship
5. What is the relationship between two variables when one increases as the other also increases?
a. Positive correlation
b. Negative correlation
c. No correlation
d. Zero Relation
6. What is the range of the logistic regression model?
a. 0 to 1
b. . -1 to 1
c. 0 to infinity
d. -infinity to infinity
7. What is the most common type of sigmoid function?
a. Logistic
b. Hyperbolic
c. Exponential
d. Linear
8. What is the main assumption of logistic regression for multiclass classification?
a. The data is linearly separable
b. The data is normally distributed
c. The data is linearly related
d. The data is independent
9. What is the most important assumption of classification model building?
a. The data is normally distributed
b. The data is linearly separable
c. The data is labelled
d. The data is balanced
10. What is the second most important assumption of classification model building?
a. The data is normally distributed
b. The data is linearly separable
c. The data is labelled
d. The data is balanced
11. What is the third most important assumption of classification model building?
a. The data is normally distributed
b. The data is linearly separable
c. The data is labelled
d. The data is balanced
12. What is the fourth most important assumption of classification model building?
a. The data is normally distributed
b. The data is linearly separable
c. The data is labelled
d. The data is balanced
13. What are the assumptions of the Gini Index?
a. That all individuals are equal
b. That all individuals have the same income
c. That all individuals have the same wealth
d. That all individuals have the same access to resources
14. What is the name of the tree which can work on continuous data.
a. Classification Tree
b. Clustering Tree
c. Regression Tree
d. Node Tree
15. What is the main advantage of using Random Forests?
a. They are more accurate than other machine learning algorithms
b. They are faster to train than other machine learning algorithms
c. They are easier to interpret than other machine learning algorithms
d. They are more robust to outliers than other machine learning algorithms
16. Which of the following is a good technique to evaluate the performance of a machine learning
algorithm?
a. Sampling
b. Parameter tuning
c. Cross-validation
d. Stratification
17. Any dataset in Naïve Bayes is divided into two parts. Which pair is the correct one.
a. Target variable & Dependent Matrix
b. Response variable & Feature model
c. Response vector & Feature Matrix
d. Response data & Matrix Data
18. What is the fundamental assumption of Naïve Bayes.
a. The fundamental Naive Bayes assumption is that each feature makes an dependent and
unequal contribution to the outcome.
b. The fundamental Naive Bayes assumption is that each feature makes an dependent and
Equal contribution to the outcome.
c. The fundamental Naive Bayes assumption is that each feature makes an Independent
and unequal contribution to the outcome.
d. The fundamental Naive Bayes assumption is that each feature makes an Independent,
Equal contribution to the outcome.
19. What is Laplace estimation?
a. smoothing techniques
b. irregular techniques
c. Cyclic techniques
d. Regression techniques
20. What is Vcorpus?
a. Vector Corpus
b. Volatile Corpus
c. Voice Corpus
d. Volume Corpus
21. What is Pcorpus?
a. Permanent Corpus
b. Plus Corpus
c. Part Corpus
d. Predictive Corpus
22. For more than 30 observation factor analysis is same with what analysis.
a. PCA
b. MCA
c. TCA
d. EFA
23. What is Neural Network.
a. Neural networks are a set of mean data
b. Neural networks are a set of if rules
c. Neural networks are a set of algorithms
d. Neural networks are a set of only classification
24. What do we understand by activation function.
a. Activation functions are a multi lines of code that gives the neural nets non-linearity and
expressiveness
b. Activation function is linear regression that gives the neural nets non-linearity and
expressiveness
c. Activation functions are a single line of code that gives the neural nets non-linearity
and expressiveness
d. Activation functions are a single line of code for layers.
25. Which package we use for Neural Network in R
a. NN
b. nn
c. neuralnet
d. neural network
26. Why Learning modifies the agent's Decision.
a. Learning modifies the agent's decision mechanisms to improve performance
b. Learning modifies the agent's decision mechanisms to increase parameters
c. Learning modifies the agent's decision mechanisms to improve quality of model.
d. Learning modifies the agent's decision to merge two algorithm
27. What we call the set of tuples used for model construction.?
a. training set
b. Testing set
c. Data Frame
d. DF
28. What is Association Analysis?
a. A method of analyzing the performance of employees
b. A statistical technique used to identify relationships between variables
c. A way to determine the profitability of a company
d. A tool for measuring customer loyalty
29. What is PR curve?
a. It is the Line between precision and recall for various threshold values.
b. It is the curve between prediction and recall for various threshold values.
c. It is the curve between precision and ROC for various threshold values.
d. It is the curve between precision and recall for various threshold values.
30. What is the main cause of model overfitting?
a. Insufficient data
b. Too complex of a model
c. Too complex of a model
d. Too much data

You might also like