Professional Documents
Culture Documents
Data Mining
Data Mining
DW Architecture
Supported by Ralph
Kimball
Strategic view of
Perceived ability
Constraints on the data warehouse Compatibility with
of the in-house IT
resources prior to existing systems
staff
implementation
Social/political
Technical issues
factors
Requires minimal
Frees up capacity on Makes powerful
investment in Frees up cash flow
in-house systems solutions affordable
infrastructure
The nontrivial process of identifying valid, novel, potentially useful, and ultimately
understandable patterns in data stored in structured databases. - Fayyad et al., (1996)
► Data warehouse is the primary source of data mining but other data sources can
also be used.
► The architecture for data mining is that of a client-server or web based
architecture.
► The nature of data can be unstructured or structured
► Data miner might be an individual with little or no programming skills.
Sophisticated tools are used to ease the data extraction process.
► Creative thinking helps to make sense out of the findings
► Data mining tools can be easily integrated with spreadsheets and other software
development tools
► Parallel processing is required to mine large data sets for analysis.
Source: Sharda et.al (2018)
Source: Sharda et.al (2018)
Data Mining process
1. CRISP-DM
Source:https://www.youtube.com/watch?v=ar2J0pX0T3M
2. SEMMA