Professional Documents
Culture Documents
General Objective:
This course is designed to introduce the core concepts of data warehousing and mining, associated
techniques, implementations, benefits. The course also introduces the art and techniques of
knowledge management, different application of data warehousing & mining and knowledge
management.
Learning outcomes:
On completion of the module, students will be able to:
1. Describe the key principle of data mining and knowledge management.
2. Identify data mining and data warehousing functionalities.
3. Describe and demonstrate basic data mining algorithms, methods, and tools.
4. Apply the practices & principles of data mining and knowledge management.
5. Differentiate different techniques of data abstracting and mining.
6. Apply data pre-processing techniques - data cleaning, data integration and transformation, data
reduction, discretization, and concept hierarchy generation
7. Identify business applications of data mining and warehousing.
Tutorial 1 15
Total 120
Assessment approach:
[Updated section: removed assignment and one of the Term Tests and added the self reflection
of Flipped Class session assessment from SS2023]
2.1 Data marts, types of data marts, loading a data mart, metadata, data model, maintenance,
nature of data
2.2 Software components; external data, reference data, performance issues, monitoring
requirements and security in a data mart.
3.1 OLTP and OLAP systems, Data Modelling, LAP tools, State of the market
3.2 Arbour Essbase web, MicroStrategy DSS web, Brio Technology
3.3 Star schema for multi-dimensional view, snowflake schema; OLAP tools.
7.1 Clustering paradigm, Partition algorithms, CLARA – Clustering Large Applications, CLARANS
– Clustering Large Application based on RANdomize Search
7.2 Hierarchical clustering, DBSCAN-Density-based spatial clustering of application with noise,
BIRCH-Balanced Iterative Reducing and Clustering using Hierarchies, CURE; Categorical
clustering, STIRR, ROCK – Robust Clustering using Links, CACTUS-Clustering Categorical
Data Using Summaries.
8.1 Tree construction principle, Best split, Splitting indices, Splitting criteria
8.2 Decision tree construction with pre-sorting.
Unit IX: Web Mining
9.1 Web content Mining, Web structure Mining, Web usage Mining, And Text Mining.
Reading List:
Essential reading
Prabhu, S. (2004). Data Warehousing – Concepts, Techniques, products, application (2nd ed.). India:
PHI Learning Pvt. Ltd.
Pujari, A.K. (2013). Data Mining Techniques (3rd ed.). India: Universities Press.
Additional reading
Berson, A. & Smith, S. J. (1997). Data Warehousing, Data Mining and OLAP. New Delhi: McGraw
Hill.
Anahory, S. & Murray, D. (1997). Data Warehousing in the real world (1st ed.). India: Addison Wesley
Longman Ltd.
Dunham, M. (2002). Data Mining Introductory & Advanced Topic (1st ed.). India: Pearson Education.