Professional Documents
Culture Documents
data?
Lecture-4 Data mining functionality
Lecture-5 Classification of data mining
systems
Lecture-6 Major issues in data mining
Dr. C.NAGARAJU HEAD OF CSE YSREC of
YVU Proddatur
Unit-1 Data warehouse and OLAP
Lecture-7 What is a data warehouse?
• Late 1980s-present
– Advanced Data Analysis
• Data warehouse and OLAP
• Data mining and knowledge discovery
• Advanced data mining appliations
• Data mining and socity
• 1990s-present:
– XML-based database systems
– Integration with information retrieval
– Data and information integreation
Present – future:
New generation of integrated data and information
system.
Task-relevant Data
Data Selection
Warehouse &
transformation
Data Cleaning
Data Integration
Pattern evaluation
Data
Databases Warehouse
Dr. C.NAGARAJU HEAD OF CSE YSREC of
YVU Proddatur
Data Mining and Business
Intelligence
Increasing potential
to support
business decisions End User
Making
Decisions
Data Exploration
Statistical Analysis, Querying and Reporting
Sequential patterns
Structured patterns
Similarity-based analysis
Information
Science Data Mining MachineLearning
Visualization Other
Disciplines
Dr. C.NAGARAJU HEAD OF CSE YSREC of
YVU Proddatur
Data Mining: Classification Schemes
General functionality
Descriptive data mining
Data warehousing:
The process of constructing and using data warehouses is
called datawarehousing
time,location,supplier
time,item,location 3-D cuboids
time,item,supplier item,location,supplier
4-D(base) cuboid
Dr. C.NAGARAJU
time, item, HEAD OF CSE YSREC of
location, supplier
YVU Proddatur
Conceptual Modeling of Data Warehouses
Modeling data warehouses: dimensions & measures
Star schema: A fact table in the middle connected to a set
of dimension tables
Snowflake schema: A refinement of star schema where
some dimensional hierarchy is normalized into a set of
smaller dimension tables, forming a shape similar to
snowflake
Fact constellations: Multiple fact tables share dimension
tables, viewed as a collection of stars, therefore called
galaxy schema or fact constellation
branch_key
location
branch location_key
location_key
branch_key
units_sold street
branch_name
city_key
branch_type city
dollars_sold
city_key
avg_sales city
Measures province_or_street
country
Dr. C.NAGARAJU HEAD OF CSE YSREC of
YVU Proddatur
Example of Fact
Constellation
time
time_key item Shipping Fact Table
day item_key
day_of_the_week Sales Fact Table item_name time_key
month brand
quarter time_key type item_key
year supplier_type shipper_key
item_key
branch_key from_location
Other operations
drill across: involving (across) more than one fact
table
Operational meta-data
perfromance
Business data
business terms and definitions, ownership of data, charging
policies
Multi-Tier Data
Warehouse
Distributed
Data Marts
mining
OLAP tools
OLAP-based exploratory data analysis
mining with drilling, dicing, pivoting, etc.
and tasks.
Architecture of OLAM
Dr. C.NAGARAJU HEAD OF CSE YSREC of
YVU Proddatur
An OLAM Architecture
Mining query Mining result Layer4
User Interface
User GUI API
Layer3
OLAM OLAP
Engine Engine OLAP/OLAM
Layer2
MDDB
MDDB
Meta
Data
Filtering&Integration Database API Filtering
Layer1
Data cleaning Data
Databases Dr. C.NAGARAJU Data
HEAD OF CSE Warehouse
Data integration YSREC of
YVU Proddatur Repository
Lecture-11 & 12
C c3 61
c2 45
62 63 64
46 47 48
c1 29 30 31 32
c0
B13 14 15 16 60
b3 44
B 28 56
b2 9
40
24 52
b1 5
36
20
b0 1 2 3 4
a0 a1 a2 a3
A