MCQ On Data Mining With Answers Set-1

Chapter (1) On DATA MINING
MCQ on Data Mining with Answers set-1
1. ...................... is an essential process where intelligent methods are applied to

extract data patterns.
A) Data Warehousing
B) Data Mining
C) Data Base
D) Data Structure
2. Data mining can also applied to other forms such as ................
i) Data streams
ii) Sequence data
iii) Networked data
iv) Text data
v) Spatial data
A) i, ii, iii and v only
B) ii, iii, iv and v only
C) i, iii, iv and v only
D) All i, ii, iii, iv and v

1
3. Which of the following is not a data mining functionality?
A) Characterization and Discrimination (classification)
B) Classification and regression
C) Selection and interpretation
D) Clustering and Analysis
4. ............................. is a summarization of the general characteristics or features of

a target class of data.
A) Data Characterization
B) Data Classification
C) Data discrimination
D) Data selection
5. ............................. is a comparison of the general features of the target class data

objects against the general features of objects from one or multiple contrasting
classes.
D) Data selection
6. Strategic value of data mining is ......................
A) cost-sensitive
B) work-sensitive
C) time-sensitive
D) technical-sensitive
2
7. ............................. is the process of finding a model that describes and
distinguishes data classes or concepts.
D) Data selection
8. The various aspects of data mining methodologies is/are ...................
i) Mining various and new kinds of knowledge
ii) Mining knowledge in multidimensional space
iii) Pattern evaluation and pattern or constraint-guided mining.
iv) Handling uncertainty, noise, or incompleteness of data
A) i, ii and iv only
B) ii, iii and iv only
C) i, ii and iii only
D) All i, ii, iii and iv
9. The full form of KDD is ..................
A) Knowledge Database
B) Knowledge Discovery Database
C) Knowledge Data House
D) Knowledge Data Definition
3
10. The output of KDD is .............
A) Data B) Information
C) Query D) Useful information
11) The _______ is a symbolic representation of facts or ideas from which

information can potentially be extracted.
A. knowledge.
*B. data.
C. algorithm.
D. program.
12) The _________ refers to extracting knowledge from larger amount of data.
A. data abstraction.
B. data warehouse.
C. database.
*D. data mining.
13)Knowledge discovery in database refers to _____.
* A. whole process of extraction of knowledge from data.

B. selection of data.
C. coding.
D. cleaning the data.
14) Data mining is used to refer ______ stage in knowledge discovery in database.
A. selection.
B. retrieving.
*C. discovery.
D. coding.
15)__________________refers to the process of deriving high-quality information

from text.
*A. Text Mining.
B. Image Mining.
C. Database Mining.
D. Multimedia Mining.
4
16)_____________ is the application of data mining techniques to discover
patterns from the Web.
A. Text Mining.
B. Multimedia Mining.
*C. Web Mining.
D. Link Mining.
17)A collection of interesting and useful patterns in database is called _______.

*A. knowledge.
B. information.
C. data.
D. algorithm
18) The first step in data mining project is ________.
19) ________analysis divides data into groups that are meaningful, useful, or both.
*A. Cluster.
B. Association.
C. Classifiction.
D. Relation
20) Strategic value of data mining is___________.

A. cost-sensitive.
B. work-sensitive.
*C. time-sensitive.
D. technical-sensitive
21) Which of the following is the data mining tool?

A. Borland C. *B. Weka.
C. Borland C++. D. Visual C.
5
MCQ on Data Mining with Answers Set-2
1. It is the main technique employed for data selection.

a. Noise b. Sampling c. Clustering d. Histogram
2. Data mining is ------b--------

a) an extraction of explicit, known and potentially useful knowledge from
information.
b) a non-trivial extraction of implicit, previously unknown and potentially useful
information from data.
c) an essential process where intelligent methods are applied to extract data
patterns that is also referred to database.
d) is an essential process where intelligent methods are applied to extract data
that is also referred to data sets.
3. Two fundamental goals of Data Mining are ____c ____.
a) Analysis and Description
b) Data cleaning and organizing the data
c) Prediction and Description
d) Data cleaning and organizing the data
4. A transactional database consists of a file where each ------a--------- represents a

transaction.
b. record b. record graph c. ordered record d. attribute
5. Cluster is------------------b-----------------------------
a) Group on a training data set to transform or simplify data in order to prepare it
for a machine-learning algorithm
b) Group of similar objects that differ significantly from other objects.
c) Symbolic representation of facts or ideas from which information can potentially
be extracted
d) Both a and b
6. Reduce amount of time and memory required by data mining algorithms-----d---

a) Target Data b. Data sampling c. Data aggregation d. Data reduction
6
7. Data .........d........... is the process of finding a model that describes and
distinguishes data classes or concepts.
a) Characterization b) Mining c) clustering d ) Classification
//////////////////////////////////////////////////////////////////////////////////
Complete the followings Answers Set-3

//////////////////////////////////////////////////////////////////////////////////
Q(A) What are the steps in the data mining process with draw figure?
1- Selection Data
2- Preprocessed target Data
3- Transformed Data
4- Data Mining Task Methods
5- Interpretation Evaluation
Patterns
… …………………………
Q(B) What are Prediction and Description methods in Data Mining Tasks?
I) Prediction Methods :Use some variables to predict unknown or future

values of other variables. Such as:
 Classification [Predictive]
 Regression [Predictive]
 Deviation Detection [Predictive]
II) Description Methods :Find human-interpretable patterns that describe the

data.
7
 Clustering [Descriptive]
 Association Rule Discovery [Descriptive]
 Sequential Pattern Discovery [Descriptive]
Q ( C ) Given a set of data points, each having a set of attributes, and a similarity
measure among them, find clusters such that:
► Data points in one cluster are more similar to one another.
► Data points in separate clusters are less similar to one another.
This definition represents of ----------------------
a) Cluster
b) Classification
c) Regression
d) Clustering
Q ( D) Intracluster distances are------------------------ , while Intercluster distances
are -------------------------
a) Equalized
b) Maximized
c) Minimized
d) Minimaxized
Q( E) Approach of ----------------------that To identify frequently occurring terms in

each document. Form a similarity measure based on the frequencies of different
terms. Use it to cluster.
a) Document segmentations
b) Document Classification
c) Document Identification
d) Document Clustering
8
Q(F) Produce dependency rules which will predict occurrence of an item based
on occurrences of other items.
a) Sequential Pattern Discovery

b) Association Rule Discovery
c) Both a and b
d) Otherwise
Q ( F) What are Gain of Document Clustering ?
1. Information Retrieval can utilize the clusters to relate a new document .
2. Search term to clustered documents.
Q( E ) Define Data Mining ?
– Non-trivial extraction of implicit, previously unknown and potentially useful

information from data
– Exploration & analysis, by automatic or semi-automatic means, of
large quantities of data in order to discover meaningful patterns
Q(D) What are Data Mining Tasks or Data mining methods
1. Classification [Predictive]
2. Clustering [Descriptive]
3. Association Rule Discovery [Descriptive]
4. Sequential Pattern Discovery [Descriptive]
5. Regression [Predictive]
6. Deviation Detection [Predictive]
Q ( E ) Define Data Classification ?

Classification is a data mining function that assigns items in a collection to target categories
or classes. The goal of classification is to accurately predict the target class for each case in
the data. For example, a classification model could be used to identify loan applicants as
low, medium, or high credit risks.
Q ( F ) What are Similarity Measures of Clustering ?
 Euclidean Distance
9
Q ( H) What are Challenges of Data Mining
1. Scalability
2. Dimensionality
3. Complex and Heterogeneous Data
4. Data Quality
5. Data Ownership and Distribution
6. Privacy Preservation
7. Streaming Data
Q ( I ) What are Applications of Data Mining?
1. Classification applications:
 Direct Marketing
 Fraud Detection
 Customer Attrition/Churn:
 Sky Survey Cataloging
 Classifying Galaxies
2. Clustering Applications:
 Market Segmentation
 Document Clustering
3. Association Rule Discover:

 Marketing and Sales Promotion
 Supermarket shelf management
 Inventory Management
4. Sequential Pattern Discovery

 Telecommunications alarm logs
 point-of-sale transaction sequences
5. Deviation/Anomaly Detection :
 Networks Intrusion detection
 Fraud Deviation detection
6. Regression
 Predicting sales amounts of new product based on advertising
expenditure.
10
 Predicting wind velocities as a function of temperature, humidity, air
pressure, etc.
 Time series prediction of stock market indices.
Q(2) Why Traditional Techniques may be unsuitable for extract

information?
1. Enormity of data
2. High dimensionality of data
3. Heterogeneous, distributed nature of data
Q(3) How Data mining may help scientists ?
– in classifying and segmenting data
– in Hypothesis Formation
11

MCQ On Data Mining With Answers Set-1

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

MCQ On Data Mining With Answers Set-1

Uploaded by

Copyright:

Available Formats

Chapter (1) On DATA MINING

MCQ on Data Mining with Answers set-1

1. ...................... is an essential process where intelligent methods are applied to

2. Data mining can also applied to other forms such as ................

A) i, ii, iii and v only

B) ii, iii, iv and v only

C) i, iii, iv and v only

D) All i, ii, iii, iv and v

A) Characterization and Discrimination (classification)

B) Classification and regression

C) Selection and interpretation

D) Clustering and Analysis

4. ............................. is a summarization of the general characteristics or features of

5. ............................. is a comparison of the general features of the target class data

6. Strategic value of data mining is ......................

8. The various aspects of data mining methodologies is/are ...................

i) Mining various and new kinds of knowledge

ii) Mining knowledge in multidimensional space

iii) Pattern evaluation and pattern or constraint-guided mining.

iv) Handling uncertainty, noise, or incompleteness of data

B) ii, iii and iv only

C) i, ii and iii only

D) All i, ii, iii and iv

9. The full form of KDD is ..................

B) Knowledge Discovery Database

C) Knowledge Data House

D) Knowledge Data Definition

11) The _______ is a symbolic representation of facts or ideas from which

13)Knowledge discovery in database refers to _____.

* A. whole process of extraction of knowledge from data.

15)__________________refers to the process of deriving high-quality information

17)A collection of interesting and useful patterns in database is called _______.

18) The first step in data mining project is ________.

20) Strategic value of data mining is___________.

21) Which of the following is the data mining tool?

1. It is the main technique employed for data selection.

2. Data mining is ------b--------

4. A transactional database consists of a file where each ------a--------- represents a

6. Reduce amount of time and memory required by data mining algorithms-----d---

Complete the followings Answers Set-3

I) Prediction Methods :Use some variables to predict unknown or future

II) Description Methods :Find human-interpretable patterns that describe the

measure among them, find clusters such that:

► Data points in one cluster are more similar to one another.

► Data points in separate clusters are less similar to one another.

This definition represents of ----------------------

Q ( D) Intracluster distances are------------------------ , while Intercluster distances

Q( E) Approach of ----------------------that To identify frequently occurring terms in

a) Sequential Pattern Discovery

Q ( F) What are Gain of Document Clustering ?

1. Information Retrieval can utilize the clusters to relate a new document .

2. Search term to clustered documents.

Q( E ) Define Data Mining ?

– Non-trivial extraction of implicit, previously unknown and potentially useful

Q(D) What are Data Mining Tasks or Data mining methods

Q ( E ) Define Data Classification ?

Q ( F ) What are Similarity Measures of Clustering ?

Q ( I ) What are Applications of Data Mining?

3. Association Rule Discover:

4. Sequential Pattern Discovery

Q(2) Why Traditional Techniques may be unsuitable for extract

Q(3) How Data mining may help scientists ?

– in classifying and segmenting data