Welcome to Scribd. Sign in or start your free trial to enjoy unlimited e-books, audiobooks & documents.Find out more
Download
Standard view
Full view
of .
Look up keyword
Like this
1Activity
0 of .
Results for:
No results containing your search query
P. 1
QB Students Dm

QB Students Dm

Ratings: (0)|Views: 176|Likes:
Published by vinsor1714
This Is the Final Year Question Bank
This Is the Final Year Question Bank

More info:

Published by: vinsor1714 on Dec 31, 2010
Copyright:Attribution Non-commercial

Availability:

Read on Scribd mobile: iPhone, iPad and Android.
download as DOC, PDF, TXT or read online from Scribd
See more
See less

12/31/2010

pdf

text

original

 
NMIT, BangaloreData Mining Question BankDept of ISE
NITTE MEENAKSHI INSTITUTE OF TECHNOLOGY
(AN AUTONOMOUS INSTITUTION)
(AFFILIATED TO VISVESVARAYA TECHNOLOGICAL UNIVERSITY, BELGAUM, APPROVED BYAICTE & GOVT.OF KARNATAKA)
DATA MINING (ISE751)
Sem: 7
th
Credits: 3Dept: ISE
UNIT-I
1.
What is Data Mining? Explain the process of Knowledge Discovery in Databases (KDD) with adiagram
2.
What are the different motivating challenges faced by Data Mining Algorithms? Explain each of them
3.
Explain the origins of Data Mining with diagram
4.
What is predictive modeling? Explain with example
5.
Discuss Association Analysis and Cluster Analysis with examples
6.
What are the different types of attributes? Explain with a table
7.
In case of record data, what is transaction / market based data, Data Matrix and Sparse Data Matrix?Explain with examples.
8.
In case of ordered data, Explain Sequential Data, Sequence Data, Time Series Data and Spatial Datawith examples
9.
What do you mean by Data Preprocessing? Explain Aggregation and Sampling in this respect
10.
Explain Dimensionality reduction in Data Preprocessing
11.
What are the different variations of Graph Data? Explain with diagrams
12.
What is Feature Subset Selection? What are the different approaches for doing this? Explain thearchitecture of Feature subset selection with a diagram
13.
In case of Feature Creation, Explain the following with examples:i)Feature Extractionii)Mapping Data to new space
1
 
NMIT, BangaloreData Mining Question BankDept of ISE
iii)Feature Construction
14.
What do you mean by Binarization? Explain the conversion of a Categorical Attribute to 3 binaryattributes? What is its drawback? How is it overcome?
15.
How is Discretization of Continuous Attributes done? In this regard, Explain unsupervised andsupervised Discretization.
16.
What is variable transformation? In this regard, explaini)Simple Functional Transformationii)Normalization/Standardization
17.
Explain the following terms:i)Outliers(ii) Precision(iii) Accuracy(iv) Bias
18.
Explain Data Mining Tasks in detail with examples
19.
Define and explain the terms:i)Attribute(ii) Measurement(iii) Data Set(iv) Sparsity
20.
What are Discrete and Continuous Attributes? Explain the term resolution.
21.
What is the curse of Dimensionality? Explain Data Quality issues related to applications
UNIT-II
1.
Give the formal definition of classification. What is classification model? Explain with diagram
2.
With a diagram, explain the general approach for building a classification model3.For the Nodes N1 & N2 given below, calculate the Gini Index, Entropy and Classification Error.Based on this, mention which node is suitable for splitting
 Node N1CountClass=00
2
 
NMIT, BangaloreData Mining Question BankDept of ISE
Class=16
4.
What is confusion matrix? Explain the confusion matrix for a 2-class problem with an example. Inthis regard, explain Accuracy and error rate of prediction with appropriate formula
5.
Write Hunt’s Algorithm. Explain it with an example
6.
Compare rule-based and class-ordering schemes with examples
7.
Explain different methods for expressing Attribute test conditions
8.
Explain in detail the characteristics of Decision Tree Induction.9.Write and explain the algorithm for Decision Tree Induction.10.What is Gain Ratio? Explain with formula.11.Calculate the Gini Index for Attributes A and B given below and specify which attribute is better for splitting.Where C0, C1 stand for Class 0 and Class 1 respectively.
12.
What is rule based classifier? Explain how it works with an example. In this regard, also defineaccuracy and coverage
13.
Consider a training set that contains 60 positive examples and 100 negative examples. Suppose tworules are given:R1: covers 50 positive examples and 5 negative examplesR2: covers 2 positive examples and no negative examplesFor the above two rules, calculate Laplace, accuracy, coverage and likelihood ratio.
3
 Node N2CountClass=01Class=15
A Node N1C0: 4C1: 3 Node N2C0: 2C1: 3
B
 Node N1C0: 1C1: 4 Node N2C0: 5C1: 2

You're Reading a Free Preview

Download
scribd
/*********** DO NOT ALTER ANYTHING BELOW THIS LINE ! ************/ var s_code=s.t();if(s_code)document.write(s_code)//-->