Professional Documents
Culture Documents
Subject code: 80359 Subject Name: Data Warehousing and Data Mining
Common subject code(if any) _________________________________
9) The important aspect of the data warehouse environment is that data found within the data
warehouse is_______.
a) subject-oriented
b) time-variant
c) integrated
d) All of these
15) Data that are not of interest to the data mining task is called as ______data
a) missing
b) irrelevant
c) changing
d) noisy
16) Converting data from different sources into a common format for processing is called as
________.
a) preprocessing
b) transformation
c) selection
d) interpretation
17) _______ refers to how often a given rule appears in the database being mined.
a) Confidence
b) Support
c) Count
d) None of these
19) Apriori algorithm is given by R. Agarwal and R. Srikant in ______ for finding frequent
itemsets in a dataset.
a) 1990
b) 1991
c) 1992
d) 1994
28) _____ simply store the training data and wait until a testing data appear.
a) smart learner
b) lazy learner
c) active learner
d) passive learner
29) Rule based classification algorithms generate ______ rule to perform the classification.
a) if-then
b) while
c) do-while
d) switch
30) ______ algorithm is used to build decision tree classifier in a given dataset of training
instances.
a) Greedy
b) Bayes
c) ETL
d) None of these
34) The goal of _____ is to discover both the dense and sparse regions of a data set.
a) Classification
b) Clustering
c) Association rule
d) Genetic Algorithm
36) In ________ algorithm each cluster is represented by the center of gravity of the cluster.
a) k-medoid
b) k-means
c) STIRR
d) ROCK
37) ______ is the clustering technique which needs the merging approach.
a) Naive Bayes
b) Hierarchical
c) Partitioned
d) None of these
38) _____ clustering technique start with as many clusters as there are records, with each cluster
having only one record.
a) Agglomerative
b) Divisive
c) Partition
d) Numeric
39) In web mining, _______ is used to find natural groupings of users, pages, etc.
a) clustering.
b) associations.
c) sequential analysis.
d) classification.
40) Which of the following is used to examine data collected by search engines and web spiders?
a) Web content mining
b) Web usage mining
c) Web structure mining
d) None of these
28) A database has five transactions. Let min_sup = 60% and min_conf=80%
29) A database has five transactions. Let min_sup = 30% and min_conf = 70%