You are on page 1of 7

ETL

The Extract-Transform-Load (ETL)


system is the foundation of the data
warehouse.
• Extracting data from outside sources

• Transforming it to fit operational needs (which


can include quality levels)

• Loading it into the end target (database or


data warehouse)
.
What is ETL
• ETL is both a simple and a complicated subject

• ETL system extracts data from the source systems, enforces


data quality and consistency standards, conforms data so that
separate sources can be used together, and finally delivers
data in a presentation-ready format.

• Although building the ETL system is a back room activity that


is not very visible to end users, it easily consumes 70 percent
of the resources needed for implementation and
maintenance of a typical data warehouse.
Extract
The first part of an ETL process involves extracting the data from
the source systems.

Extraction is the operation of extracting data from a source


system for further use in a data warehouse environment.

An essential part of the extraction involves the parsing of


extracted data, resulting in a check if the data meets an
expected pattern or structure.
Methods of Exraction
The extraction method you should choose is highly dependent on the source
system and also from the business needs in the target data warehouse
environment.

1. Logical Extraction methods


a) Full Extraction
b) Incremental Extraction
2. Physical Extraction Methods
a) Online Extraction
b) Offline Extraction
3. Change Data Capture
a) Timestamps
b) Partitioning
C) Triggers
Transform
• The transform stage applies a series of rules or
functions to the extracted data from the source
to derive the data for loading into the end target
• It comprises of steps such as
1. Cleansing
2. Summarization
3. Derivation
4. Aggregation
5. integration
loading
• The load phase loads the data into the end
target, usually the data warehouse (DW).
Depending on the requirements of the
organization, this process varies widely.
• There are two types of tables in the database
structure: fact tables and dimensions tables

You might also like