You are on page 1of 4

University of Mumbai

Examination 2020 under cluster 4 (PCE)


These are sample MCQs to indicate pattern, may or may not
appear in examination
G.M. Vedak Institute of Technology
Program: BE Computer Engineering
Curriculum Scheme: Rev2012
Examination: Fourth Year Semester VIII
Course Code: CPC801 and Course Name: Data Warehouse and Mining
Time: 1 hour Max. Marks: 50
=====================================================================
Note to the students:- All the Questions are compulsory and carry equal marks .
Q1. The data is stored, retrieved & updated in ___
Option A: OLAP
Option B: OLTP
Option C: SMTP
Option D: FTP

Q2. The terms equality and roll up are associated with ____
Option A: OLAP
Option B: visualization
Option C: data mart.
Option D: decision tree.

Q3. Exceptional reporting in data warehousing is otherwise called as ____


Option A: exception
Option B: alerts
Option C: errors
Option D: bugs

Q4. Which is NOT a basic conceptual schema in Data Modeling of Data Warehouses?
Option A: Star schema
Option B: Fact constellations
Option C: Tree schema
Option D: Snowflake schema

Q5. ___________ is a good alternative to the star schema.


Option A: Star-snowflake schema
Option B: Star schema
Option C: Snowflake schema
Option D: Fact constellation

Q6. In what case does snowflake schema perform better than star schema?
Option A: When there is less redundancy
Option B: When there is more redundancy

1 | Page
University of Mumbai
Examination 2020 under cluster 4 (PCE)
Option C: All cases
Option D: No case

Q7. MDDB stands for ___________.


Option A: multidimensional databases
Option B: multiple double dimension
Option C: multi-dimension doubling
Option D: multiple data doubling

Q8. Record cannot be updated in _____


Option A: OLTP
Option B: files
Option C: RDBMS
Option D: data warehouse

Q9. ____ is a good alternative to the star schema.


Option A: Star schema.
Option B: Snowflake schema.
Option C: Fact constellation.
Option D: Star-snowflake schema.

Q10. Strategic value of data mining is ____


Option A: cost-sensitive.
Option B: work-sensitive
Option C: time-sensitive.
Option D: technical-sensitive.

Q11. Which of the following is the other name of Data mining?


Option A: Exploratory data analysis.
Option B: Data driven discovery.
Option C: Deductive learning.
Option D: Deep learning.

Q12. Rule based classification algorithms generate ______ rule to perform the
classification.
Option A: if-then
Option B: while
Option C: do while
Option D: switch

Q13. With Bayes classifier, missing data items are


Option A: treated as equal compares
Option B: treated as unequal compares
Option C: replaced with a default value
Option D: ignored

2 | Page
University of Mumbai
Examination 2020 under cluster 4 (PCE)
Q14. __________ clustering techniques starts with all records in one cluster and then
try to split that cluster into small pieces
Option A: Agglomerative
Option B: Divisive
Option C: Partition
Option D: Numeric

Q15. The most commonly used measure of similarity is the _____ or its square
Option A: euclidean distance
Option B: city-block distance
Option C: Chebychev’s distance
Option D: Manhattan distance

Q16. The _____ method uses information on all pairs of distances, not merely the
minimum or maximum distances
Option A: single linkage
Option B: medium linkage
Option C: complete linkage
Option D: average linkage

Q17. Which method of analysis does not classify variables as dependent or


independent
Option A: regression analysis
Option B: discriminant analysis
Option C: analysis of variance
Option D: cluster analysis

Q18. The leaf nodes of a model tree are


Option A: averages of numeric output attribute values.
Option B: nonlinear regression equations
Option C: linear regression equations
Option D: sums of numeric output attribute values

Q19. Simple regression assumes a __________ relationship between the input attribute
and output attribute
Option A: linear
Option B: quadratic
Option C: reciprocal
Option D: inverse

Q20. The time horizon in operational environment is ___


Option A: 30-60 days.
Option B: 60-90 days.
Option C: 90-120 days.
Option D: 120-150 days.

3 | Page
University of Mumbai
Examination 2020 under cluster 4 (PCE)
Q21. The first International conference on KDD was held in the year ____
Option A: 1996
Option B: 1997
Option C: 1995
Option D: 1994

Q22. Which schema is best when dimensions are not normalized?


Option A: Star schema
Option B: Snowflake schema
Option C: Fact constellation
Option D: No schema is best when dimensions are not normalized

Q23. ___ is a comparison of the general features of the target class data objects against the
general features of objects from one or multiple contrasting classes.
Option A: Data Characterization
Option B: Data Classification
Option C: Data discrimination
Option D: Data selection

Q24. __________ are designed to overcome any limitations placed on the warehouse by the
nature of the relational data model.
Option A: Operational database
Option B: Relational database
Option C: Multidimensional database
Option D: Data repository

Q25. Which of the following algorithm is most sensitive to outliers


Option A: K-means clustering algorithm
Option B: K-medians clustering algorithm
Option C: K-modes clustering algorithm
Option D: K-medoids clustering algorithm

4 | Page

You might also like