You are on page 1of 11

Chapter (1) On DATA MINING

MCQ on Data Mining with Answers set-1

1. ...................... is an essential process where intelligent methods are applied to


extract data patterns.

A) Data Warehousing
B) Data Mining
C) Data Base
D) Data Structure

2. Data mining can also applied to other forms such as ................

i) Data streams
ii) Sequence data
iii) Networked data
iv) Text data
v) Spatial data

A) i, ii, iii and v only

B) ii, iii, iv and v only

C) i, iii, iv and v only

D) All i, ii, iii, iv and v


1
3. Which of the following is not a data mining functionality?

A) Characterization and Discrimination (classification)

B) Classification and regression

C) Selection and interpretation

D) Clustering and Analysis

4. ............................. is a summarization of the general characteristics or features of


a target class of data.

A) Data Characterization

B) Data Classification

C) Data discrimination

D) Data selection

5. ............................. is a comparison of the general features of the target class data


objects against the general features of objects from one or multiple contrasting
classes.

A) Data Characterization

B) Data Classification

C) Data discrimination

D) Data selection

6. Strategic value of data mining is ......................

A) cost-sensitive
B) work-sensitive
C) time-sensitive
D) technical-sensitive

2
7. ............................. is the process of finding a model that describes and
distinguishes data classes or concepts.

A) Data Characterization

B) Data Classification

C) Data discrimination

D) Data selection

8. The various aspects of data mining methodologies is/are ...................

i) Mining various and new kinds of knowledge

ii) Mining knowledge in multidimensional space

iii) Pattern evaluation and pattern or constraint-guided mining.

iv) Handling uncertainty, noise, or incompleteness of data

A) i, ii and iv only

B) ii, iii and iv only

C) i, ii and iii only

D) All i, ii, iii and iv

9. The full form of KDD is ..................

A) Knowledge Database

B) Knowledge Discovery Database

C) Knowledge Data House

D) Knowledge Data Definition

3
10. The output of KDD is .............

A) Data B) Information
C) Query D) Useful information

11) The _______ is a symbolic representation of facts or ideas from which


information can potentially be extracted.
A. knowledge.
*B. data.
C. algorithm.
D. program.

12) The _________ refers to extracting knowledge from larger amount of data.
A. data abstraction.
B. data warehouse.
C. database.
*D. data mining.

13)Knowledge discovery in database refers to _____.

* A. whole process of extraction of knowledge from data.


B. selection of data.
C. coding.
D. cleaning the data.

14) Data mining is used to refer ______ stage in knowledge discovery in database.

A. selection.
B. retrieving.
*C. discovery.
D. coding.

15)__________________refers to the process of deriving high-quality information


from text.
*A. Text Mining.
B. Image Mining.
C. Database Mining.
D. Multimedia Mining.

4
16)_____________ is the application of data mining techniques to discover
patterns from the Web.
A. Text Mining.
B. Multimedia Mining.
*C. Web Mining.
D. Link Mining.

17)A collection of interesting and useful patterns in database is called _______.


*A. knowledge.
B. information.
C. data.
D. algorithm

18) The first step in data mining project is ________.

19) ________analysis divides data into groups that are meaningful, useful, or both.
*A. Cluster.
B. Association.
C. Classifiction.
D. Relation

20) Strategic value of data mining is___________.


A. cost-sensitive.
B. work-sensitive.
*C. time-sensitive.
D. technical-sensitive

21) Which of the following is the data mining tool?


A. Borland C. *B. Weka.
C. Borland C++. D. Visual C.

5
MCQ on Data Mining with Answers Set-2

1. It is the main technique employed for data selection.


a. Noise b. Sampling c. Clustering d. Histogram

2. Data mining is ------b--------


a) an extraction of explicit, known and potentially useful knowledge from
information.
b) a non-trivial extraction of implicit, previously unknown and potentially useful
information from data.
c) an essential process where intelligent methods are applied to extract data
patterns that is also referred to database.
d) is an essential process where intelligent methods are applied to extract data
that is also referred to data sets.
3. Two fundamental goals of Data Mining are ____c ____.
a) Analysis and Description
b) Data cleaning and organizing the data
c) Prediction and Description
d) Data cleaning and organizing the data

4. A transactional database consists of a file where each ------a--------- represents a


transaction.
b. record b. record graph c. ordered record d. attribute

5. Cluster is------------------b-----------------------------
a) Group on a training data set to transform or simplify data in order to prepare it
for a machine-learning algorithm
b) Group of similar objects that differ significantly from other objects.
c) Symbolic representation of facts or ideas from which information can potentially
be extracted
d) Both a and b

6. Reduce amount of time and memory required by data mining algorithms-----d---


a) Target Data b. Data sampling c. Data aggregation d. Data reduction

6
7. Data .........d........... is the process of finding a model that describes and
distinguishes data classes or concepts.
a) Characterization b) Mining c) clustering d ) Classification

//////////////////////////////////////////////////////////////////////////////////

Complete the followings Answers Set-3


//////////////////////////////////////////////////////////////////////////////////

Q(A) What are the steps in the data mining process with draw figure?

1- Selection Data
2- Preprocessed target Data
3- Transformed Data
4- Data Mining Task Methods
5- Interpretation Evaluation
Patterns

… …………………………

Q(B) What are Prediction and Description methods in Data Mining Tasks?

I) Prediction Methods :Use some variables to predict unknown or future


values of other variables. Such as:
 Classification [Predictive]
 Regression [Predictive]
 Deviation Detection [Predictive]

II) Description Methods :Find human-interpretable patterns that describe the


data.

7
 Clustering [Descriptive]
 Association Rule Discovery [Descriptive]
 Sequential Pattern Discovery [Descriptive]

Q ( C ) Given a set of data points, each having a set of attributes, and a similarity

measure among them, find clusters such that:

► Data points in one cluster are more similar to one another.

► Data points in separate clusters are less similar to one another.

This definition represents of ----------------------

a) Cluster
b) Classification
c) Regression
d) Clustering

Q ( D) Intracluster distances are------------------------ , while Intercluster distances

are -------------------------

a) Equalized
b) Maximized
c) Minimized
d) Minimaxized

Q( E) Approach of ----------------------that To identify frequently occurring terms in


each document. Form a similarity measure based on the frequencies of different
terms. Use it to cluster.

a) Document segmentations
b) Document Classification
c) Document Identification
d) Document Clustering

8
Q(F) Produce dependency rules which will predict occurrence of an item based
on occurrences of other items.

a) Sequential Pattern Discovery


b) Association Rule Discovery
c) Both a and b
d) Otherwise

Q ( F) What are Gain of Document Clustering ?

1. Information Retrieval can utilize the clusters to relate a new document .

2. Search term to clustered documents.

Q( E ) Define Data Mining ?

– Non-trivial extraction of implicit, previously unknown and potentially useful


information from data
– Exploration & analysis, by automatic or semi-automatic means, of
large quantities of data in order to discover meaningful patterns

Q(D) What are Data Mining Tasks or Data mining methods

1. Classification [Predictive]
2. Clustering [Descriptive]
3. Association Rule Discovery [Descriptive]
4. Sequential Pattern Discovery [Descriptive]
5. Regression [Predictive]
6. Deviation Detection [Predictive]

Q ( E ) Define Data Classification ?


Classification is a data mining function that assigns items in a collection to target categories
or classes. The goal of classification is to accurately predict the target class for each case in
the data. For example, a classification model could be used to identify loan applicants as
low, medium, or high credit risks.

Q ( F ) What are Similarity Measures of Clustering ?

 Euclidean Distance

9
Q ( H) What are Challenges of Data Mining
1. Scalability
2. Dimensionality
3. Complex and Heterogeneous Data
4. Data Quality
5. Data Ownership and Distribution
6. Privacy Preservation
7. Streaming Data

Q ( I ) What are Applications of Data Mining?

1. Classification applications:
 Direct Marketing
 Fraud Detection
 Customer Attrition/Churn:
 Sky Survey Cataloging
 Classifying Galaxies

2. Clustering Applications:
 Market Segmentation
 Document Clustering

3. Association Rule Discover:


 Marketing and Sales Promotion
 Supermarket shelf management
 Inventory Management

4. Sequential Pattern Discovery


 Telecommunications alarm logs
 point-of-sale transaction sequences

5. Deviation/Anomaly Detection :
 Networks Intrusion detection
 Fraud Deviation detection

6. Regression
 Predicting sales amounts of new product based on advertising
expenditure.

10
 Predicting wind velocities as a function of temperature, humidity, air
pressure, etc.
 Time series prediction of stock market indices.

Q(2) Why Traditional Techniques may be unsuitable for extract


information?

1. Enormity of data
2. High dimensionality of data
3. Heterogeneous, distributed nature of data

Q(3) How Data mining may help scientists ?

– in classifying and segmenting data

– in Hypothesis Formation

11

You might also like