Professional Documents
Culture Documents
1. What are the various strategies and techniques used in data mining?
2. What is data mining? Differentiate between data mining techniques and data mining strategy.
3. What is data warehouse? How is it different from database?
4. What do you mean by granularity? What is partitioning.
5. Explain the life cycle of data warehouse.
6. Is data consolidation data modeling activity? Justify your answer.
7. What is data mining? Define the major issues in data mining.
8. Explain data, information, knowledge and intelligence.
9. What are different forms of data processing?
10. Explain data cleaning, data transformation, and data integration.
11. Distinguished between Dimensionality reduction and Numerosity reduction.
12. Explain concept hierarchy generation for categorical data.
13. Define KDD. Identify and describe the phases of KDD.
14. Explain Attribute subset selection methods for data reduction with example.
15. Describe the difference between the following approaches for the integration of data mining system
with databases or data warehouse system: no coupling, loose coupling, semi-tight coupling, and tight
coupling.
16. Explain principle component analysis (PCA) in detail.
17. What are outliers? How outliers analysis can be done.
18. Describe in brief the important steps of data mining and data mining functionalities.
19. Describe the important types of difficulties in data mining process.
20. Describe the process of data integration and transformation.
21. Explain the characteristics of operational data.
Assignment 2
Assignment 3
Cluster this dataset non-hierarchically and also provide the answer to the following:
i) Compute the matrix of Manhattan distance
ii) Which two cases are closest together?
iii) Which are the two clusters?
12. i) For clustering, the similarity measure between data points is used. List all the measures used to cluster the
points.
ii) Explain different kinds of non-hierarchical clustering based on density and probability algorithms.
Assignment 4
1. Explain three tier architecture of data warehouse. Also distinguished the data warehouse with data
mart.
2. Explain all steps and guidelines for the implementation of data warehouse.
3. What is multidimensional modeling? Explain the STAR, SNOWFLAKE, and FACT constellation schemas
for multidimensional databases. Also write their advantages and disadvantages.
4. Why data warehouse maintained separately from the database. Difference OLTP and OLAP.
5. Short notes on: i). Concept Hierarchy ii). Data mart
6. Explain the important approach to build a data warehouse system.
7. Explain various schemas for multidimensional modeling.
8. What are the difference between the information processing, analytical processing and data mining.
Assignment 5