You are on page 1of 37

Introduction

-Ashu Mehta
Database systems
Informatica products
• Informatica data pocket(idq):Maintaing quality of
data throughout process
• Informatica cloud data integrator:Working with cloud
or Integrating data on cloud
• Informatica enterprise data integrator:Integrating
data between different organizations
• Informatica B2B data exchange:For exchanging data
• Informatica Master data management
PowerCenter overview
• Contents
– Purpose of powercenter
• Explian the purpose of datacenter
– Use of powercenter
• Define terms used in powercenter
– Components of powercenter
• Name the major power center components
The problem
Solution
Informatica PowerCenter
• Informatica power centre can bring data from
these different platforms and store it in a data
warehouse.
• This unified view of data is then used by the
organization to perform enterprise wide
analysis to govern, monitor and plan the
company’s growth.
What is the methodology used by
Informatica power centre to create a
unified view of data?
How power centre uses ETL methodolgy for data warehousing?
• ETL involves extracting data from multiple source
systems
– Type of data:Transactional data-stores data on real time basis
– Format: Normalized or denormalized
• After extraction, data is transformed as per business
needs.
– Aggreagated, cleansed or consolidated applying business
rules.
– Denormalized for increased performance
Denormalization:Process of retaining redundant data to avoid
large no. of joins
• Transformed data is placed in datawarehouse
for decision making purposes
– Contains historical data
– Aggregated data to meet user needs
Data integration possibilities
with powercenter and
supporting tools
Use case-
• Power center connects to a wide array of
sources
• Transforms data into analytical information
and then load the data into a decision support
structure
PowerCenter and Data profiling
• ETL jobs makes assumptions about characteristics of data.
• Profiling determines if those assumptions are valid or not
• For successful processing of ETL data and producing
correct results, nature of data needs to be understood.
• The informatica analyst tool can profile the source data
to provide the detailed information about the data
which includes uniqueness, presence of NULL data, and
lists of values and patterns
• Analysts can apply business rules to the data and
profiler will determine if the data confirms
• Profiling through analyst tool is included in a
powercenter SE license
PowerCenter and data cleansing
• In many cases, profiling the data derives
requirement for data cleansing
• Instead of expanding ETL logic, developer tool
to cleanse the data can be used so that it
confirms to etl expectations
• Data cleansing requires an additional license
PowerCenter and data validation
• In a warehouse it is important to validate that
the data was loaded correctly
• There are 2 ways to do this:
– Data validation option which requires a separate
license and compare source and target data.
– Analyst tool to profile datawarehouse data.
Test Data Management
• In datawarehouse, there may be a need for non
production copies of data.
• Application development, testing and demoing all
require non production data
• To avoid the cost of full sized copies and exposure of
sensitive data, you csn use test data mangement.
• Allows you to provide subset of data or mask the
data.
• Separate license is required
Metadata manager
• Metadata manger ingests metadata from databases,
modeling tools, reporting tools, powercenter &
developer
• It organizes the metadata into lineage which shows
how metadata are inter related across different
systems.
• Lineage is valuable for impact analysis and report
auditing.
• Metadata manger is included in powercenter advanced
edition and data governance edition licenses.

You might also like