Professional Documents
Culture Documents
SEM:8 SEC: C
SUB: Big Data Analytics Name of the faculty: G. Nazia sulthana
Module- 3
Module-4
1. What are decision trees? Why are the decision trees the most popular
classification techniques?
2. What are Gini‟s coefficient and information gain?
3. What is Regression? Explain Scatter plots showing types of relationship among
two variables
4. What is a neural network? How does it work?
5. What makes a neural network versatile enough for supervised as well as non-
supervised learning tasks?
6. Explain the different steps for constructing the decision tree for the following
example.
7. Describe advantages and disadvantages of regression model.
8. Write the different steps involved in developing artificial neural networks.
9. Describe the advantages of using ANN.
10.For the following example describe the different steps of forming association
rules using Apriori algorithm.
14.What is splitting variable? Describe the criteria for choosing splitting
variable.
15.Create a decision tree for the following dataset
Then solve the following problem using the model
1. Define Text Mining and Explain the Text Mining Architecture with suitable
diagram.
2. Consider the following network . Compute the Rank values for the network and
which is the highest ranked node now?
Ra Rb Rc Rd
Ra 0 0.50 0 1.00
Rb 0.50 0 0 0
Rc 0.50 0.50 0 0
Rd 0 0 1.00 0
3. Explain SVM model with support vector machine classifiers with diagram.
4. Describe the difference between text mining and data mining.
5. Explain Naïve bayes model to classify the text data into right class using
following dataset.