Professional Documents
Culture Documents
Bilal Hussain
• Course Outlines:
1. Introduction & Background.
2. De-Normalization.
3. OLAP & Dimensional Modeling.
4. ETL and Data Quality Management (DQM).
5. Database Performance (Parallelism, Partitioning).
6. ETL Implementation using ODI.
7. Data Visualization using OBIEE.
8. Project (Design Data warehouse for any organization using any
ETL and BI Tool).
Course
Week #
plan: Assignment # Quiz No
1
2
3 Assign #1 Quiz # 1
4
5
6 Assign #2 Quiz # 2
7
8
9 Mid-Term
10 Assign #3 Quiz # 3
11
12 Assign #4 Quiz # 4
13
14
15
16 Final Exam
Recap:
• Dr.EF Code 12 Rules.
• Normlization.
• Constraints.
• De-Normalization.
• DM
ETL/ELT
The process of extracting data from source systems and bringing it into
the data warehouse is commonly called ETL. Which stands for Extract,
transformation and loading.
Why ETL?
• A Data Warehouse provides a common data repository.
• ETL provides a method of moving the data from various sources into a data
warehouse.
• As data sources change, the Data Warehouse will automatically updated.
• Well-designed and documented ETL system is almost essential to the success of a
Data Warehouse project.
• Allow verification of data transformation, aggregation and calculations rules.
• Perform complex transformations and requires extra space to store the data.
• Convert to the various formats and types to one consistent system.
• ETL is a predefined process for accessing and manipulating source data into the
target database.
• It helps to improve productivity.
ETL Process:
• Extract: Capture Data from source system.
• Full Extraction.
• Incremental Extraction(Timesatmp, UniqueID, Triggers).
• Efficient when changes are identified.
• Identification could be costly.
• Very challenging.
Platform OS DBMS
Accuracy Qualitatively assessing lack of error, high accuracy corresponding to small error.
Completeness The Degree to which values are present in the attributes that require them.
Reliability Reliability means piece of information doesn't contradict another piece of information.(DOB)
Interpretability the extent to which data is in appropriate language, symbols and units and the definition are clear.
Accessibility The extent to which data is available or easily and quickly retrievable.