Professional Documents
Culture Documents
MCQ’s
Unit – I
Q1. Which of the following is an essential process in which the intelligent methods are applied
to extract data patterns?
a. Warehousing
b. Data Mining
c. Text Mining
d. Data Selection
Ans: B
Ans: D
Ans: D
Q4. Which one of the following statements is not correct about the data cleaning?
Ans: D
Q5. The issues like efficiency, scalability of data mining algorithms comes under_______
a. Performance issues
b. Diverse data type issues
c. Mining methodology and user interaction
d. All of the above
Ans: A
Ans: C
a. It is hidden within a database and can only be recovered if one is given certain clues (an
example IS encrypted information).
b. An extremely complex molecule that occurs in human chromosomes and that carries
genetic information in the form of genes.
c. It is a kind of process of executing implicit, previously unknown and potentially useful
information from data
d. None of the above
Ans: C
Q8. Which one of the following refers to the model regularities or to the objects that trends or
not consistent with the change in time?
a. Prediction
b. Evolution analysis
c. Classification
d. Both A and B
Ans: B
Q9. The issues like "handling the rational and complex types of data" comes under which of the
following category?
Ans: A
Q10. Which of the following also used as the first step in the knowledge discovery process?
a. Data selection
b. Data cleaning
c. Data transformation
d. Data integration
Ans: B
Q11. Which of the following refers to the steps of the knowledge discovery process, in which the
several data sources are combined?
a. Data selection
b. Data cleaning
c. Data transformation
d. Data integration
Ans: D
Q12. Which one of the following issues must be considered before investing in data mining?
a. Compatibility
b. Functionality
c. Vendor consideration
d. All of the above
Ans: D
Q13. Which one of the following issues must be considered before investing in data mining?
a. Compatibility
b. Functionality
c. Vendor consideration
d. All of the above
Ans: A
(a)Data Cleaning
(b)Data Transformation
(c)Data Reduction
(d)Data Integration
Ans: A
Q15. _____ studies the collection, analysis, interpretation or explanation, and presentation of data.
(a)Statistics
(b)Visualization
(c)Data Mining
(d)Clustering
Ans: A
Q16. _____ investigates how computers can learn (or improve their performance) based on data.
Ans: A
Q18. ____ is the science of searching for documents or information in documents.
(a)Data Mining
(b)Information Retrieval
(c)Text Mining
(d)Web Mining
Ans: B
(a)On Going
(b)Active
(c)Interactive
(d)Flexible
Ans : C
Q20. In real world multidimensional view of data mining, The major dimensions are
(a) Methods
(b) Applications
(c) Tools
(d) Files
Ans: B
(a)Method
(b)Variable
(c)Task
(d)Attribute
Ans: D
(a)Ordinal
(b)Nominal
(c)Ratio
(d)Interval
Ans: B
(a) Information
(b) Database
(c) Metadata
(d) File
Ans: C
Q24. Patterns that can be discovered from a given database are which type…
Ans: B
Q30. The learning which is used for inferring a model from labeled
training data is called?
(A). Unsupervised learning
(B). Reinforcement learning
(C). Supervised learning
(D) Machine Learning
Ans: C
Unit -II
Ans: C
Q5. Among the types of fact tables which is not a correct type ?
1. Fact-less fact table
2. Transaction fact tables
3. Integration fact tables
4. Aggregate fact tables
Ans: C
Ans: B
Ans: D
Ans: D
Q13. The generic two-level data warehouse architecture includes which of the
following?
1. At least one data mart
2. Data that can extracted from numerous internal and external sources
3. Near real-time updates
4. All of the above.
Ans: B
Ans: A
Ans: A
Q16. Which of the following is not true regarding characteristics of warehoused data?
1. Changed data will be added as new data
2. Data warehouse can contains historical data
3. Obsolete data are discarded
4. Users can change data once entered into the data warehouse
Ans: D
Q17. Which is the core of the multidimensional model that consists of a large
set of facts and a number of dimensions?
1. Multidimensional cube
2. Data model
3. Data cube
4. None of the above
Ans: C
Ans: B
Q19. Which of the following standard query techniques increase the granularity
1. roll-up
2. drill-down
3. slicing
4. dicing
Ans: B
Ans: A
Q21. __ is a standard query technique that can be used within OLAP to zoom in
Ans: B
Q23. A __ combines facts from multiple processes into a single fact table and
eases the analytic burden on BI applications.
1. Aggregate fact table
2. Consolidated fact table
3. Transaction fact table
4. Accumulating snapshot fact table
Ans: B
Ans: A
Q25. Focusing on the modeling and analysis of data for decision makers,
not on daily operations or transaction processing is known
1. Integrated
2. Time-variant
3. Subject oriented
4. Non-volatile
Ans: C
Q26. Which one is not a type of fact?
1. Fully Addictive
2. Cumulative addictive
3. Semi Addictive
4. Non Addictive
Ans: C
Q27. _____ refers to the currency and lineage of data in a data warehouse
1. Operational metadata
2. Business metadata
3. Technical metadata
4. End-User meatdata
Ans: A
Q28. Which of the following correctly refers to the term "Data Independence"?
a. It means that the programs are not dependent on the logical attributes
b. It refers to that data that is defined separately, not included in the program
c. It means that the programs are totally dependent on the physical attributes of data
d. Both A and C
Ans: D
Ans: C
B. A process to load the data in the data warehouse and to create the necessary indexes
C. A process to upgrade the quality of data after it is moved into a data warehouse
A process to upgrade the quality of data before it is moved into a data warehouse
D.
Ans: B
Unit –III
Q1. Which of the following statements is incorrect about the hierarchal clustering?
Ans: A
Q2. Which one of the following statements about the K-means clustering is incorrect?
a. The goal of the k-means clustering is to partition (n) observation into (k) clusters
b. K-means clustering can be defined as the method of quantization
c. The nearest neighbor is the same as the K-means
d. All of the above
Ans: C
Q3. Which one of the following can be considered as the final output of the hierarchal type of
clustering?
a. A tree which displays how the close thing are to each other
b. Assignment of each point to clusters
c. Finalize estimation of cluster centroids
d. None of the above
Ans: A
Q4. Which one of the clustering technique needs the merging approach?
a. Partitioned
b. Naïve Bayes
c. Hierarchical
d. Both A and C
Ans: C
Q5. The self-organizing maps can also be considered as the instance of _________ type of learning.
a. Supervised learning
b. Unsupervised learning
c. Missing data imputation
d. Both A & C
Ans: B
Q6. The following given statement can be considered as the examples of_________
Suppose one wants to predict the number of newborns according to the size of storks'
Ans: C
Q7. In the example predicting the number of newborns, the final number of total newborns can be
considered as the _________
a. Features
b. Observation
c. Attribute
d. Outcome
Ans: D
a. It is a measure of accuracy
b. It is a subdivision of a set
c. It is the task of assigning a classification
d. None of the above
Ans: B
Q9. Which of the following can be considered as the classification or mapping of a set or class with
a. Data set
b. Data Characterization
c. Data Sub Structure
d. Data Discrimination
Ans: D
Q10. The analysis performed to uncover the interesting statistical correlation between
a. Mining of association
b. Mining of correlation
c. Mining of clusters
d. All of the above
Ans: B
Q11. Which one of the following can be defined as the data object which does not comply
a. Evaluation Analysis
b. Outliner Analysis
c. Classification
d. Prediction
Ans: B
Q12. Which one of the following correctly defines the term cluster?
Ans: A
a. This takes only two values. In general, these values will be 0 and 1, and they can be
coded as one bit
b. The natural environment of a certain species
c. Systems that can be used without knowledge of internal operations
d. All of the above
Ans: A
Q14. Which one of the following correctly refers to the task of the classification?
a. Approach to the design of learning algorithms that is structured along the lines of the
theory of evolution.
b. Decision support systems that contain an information base filled with the knowledge of
an expert formulated in terms of if-then rules.
c. Combining different types of method or information
d. None of these
Ans: C
a. The process of finding a solution for a problem simply by enumerating all possible
solutions according to some predefined order and then testing them
b. The distance between two points as calculated using the Pythagoras theorem
c. A stage of the KDD process in which new data is added to the existing selection.
d. All of the above
Ans: C
Q17. Which one of the following correctly refers to the Class study in the data characterization?
a. Final class
b. Study class
c. Target class
d. Both A and C
Ans: C
Q18. Which of the following refers to the sequence of pattern that occurs frequently?
a. Frequent sub-sequence
b. Frequent sub-structure
c. Frequent sub-items
d. All of the above
Ans: A
Q19. Which one of the following refers to the model regularities or to the objects that trends or not
a. Prediction
b. Evolution analysis
c. Classification
d. Both A and B
Ans: B
Q26. Cluster is
A. Group of similar objects that differ significantly from other objects
B. Operations on a database to transform or simplify data in order to prepare it
for a machine-learning algorithm
C. Symbolic representation of facts or ideas from which information can
potentially be extracted
D. None of these
Ans: A
A) Data Characterization
B) Data Classification
C) Data discrimination
D) Data selection
Ans: A
Q29. ............................. is a comparison of the general features of the target class data objects
against the general features of objects from one or multiple contrasting classes.
A) Data Characterization
B) Data Classification
C) Data discrimination
D) Data selection
Ans: C
Q30. ............................. is the process of finding a model that describes and distinguishes data class
or concepts.
A) Data Characterization
B) Data Classification
C) Data discrimination
D) Data selection
Ans: A
Unit – IV
Q1. Which one of the following can be considered as the correct application of the data mining?
a. Fraud detection
b. Corporate Analysis & Risk management
c. Management and market analysis
d. All of the above
Ans: D
Q2. Data mining can also applied to other forms such as ................
i) Data streams
ii) Sequence data
iii) Networked data
iv) Text data
A) i, ii, iii
B) ii, iii, iv
C) i, iii, iv
D) All i, ii, iii, iv
Ans: D
Q3. ___________ is the application of data mining techniques to discover patterns from the Web.
A. Text Mining.
B. Multimedia Mining.
C. Web Mining.
D. Link Mining.
Ans: C
Q6. Which of the following is the private network to access the data through the web.
a. Internet
b. Extranet
c. Intranet
d. None of the above
Ans: c
Q7. Web-enabling the Data Warehouse uses the following as the information delivery mechanism.
a. Web technology
b. Grid computing
c. Artificial intelligence
d. None of these
Ans: a
Q11. Data set {brown, black, blue, green , red} is example of:
a. Continuous attribute
b. Ordinal attribute
c. Numeric attribute
d. Nominal attribute
Ans: D
a. Photos
b. Graphs
c. Charts
d. Information Graphics
Ans: A
Q13. Dimensionality reduction reduces the data set size by removing _________:
a. composite attributes
b. derived attributes
c. relevant attributes
d. irrelevant attributes
Ans: D
a. weather forecast
b. data matrix
d. genomic data
Ans: D
Q15. To detect fraudulent usage of credit cards, the following data mining task should be used
a. Outlier analysis
b. prediction
c. association analysis
d. feature selection
Ans: A
a. Zip codes
b. Ordered numbers
c. Movie ratings
d. Military ranks
Ans: A
Q17. Which data mining task can be used for predicting wind velocities as a function of temperature,
a. Cluster Analysis
b. Regression
c. Classification
Ans: B
Ans: C
b. Salary
c. Mass
d. Gender
Ans: D
Q21. Nominal and ordinal attributes can be collectively referred to as_________ attributes
a. perfect
b. qualitative
c. consistent
d. optimized
Ans: B
c. Regression
Ans: A
Ans: B
Q24. n Binning, we first sort data and partition into (equal-frequency) bins and then
Ans: D
d. eliminating noise
Ans: B
a. Missing values
b. Outlier records
c. Duplicate records
Ans: D
Q27. Which of the following is not a Data discretization Method?
a. Histogram analysis
b. Cluster Analysis
c. Data compression
d. Binning
Ans: C
Q28. Which of the following data mining task is known as Market Basket Analysis?
a. Association Analysis
b. Regression
c. Classification
d. Outlier Analysis
Ans: A
Ans: D
Q30. Firms that are engaged in sentiment mining are analyzing data collected from?
A. social media sites.
B. in-depth interviews.
C. focus groups.
D. experiments.
Ans: A