Professional Documents
Culture Documents
PowerCenter Basic Concepts
PowerCenter Basic Concepts
Ale Ribeiro
June 6, 2006
Agenda
What is PowerCenter?
PowerCenter Client Applications
Demo
PowerCenter Designer, Workflow Manager, Workflow Monitor
PowerCenter Architecture
PowerCenter
Is a single, unified enterprise data integration
platform that allows companies and government
organizations of all sizes to access, discover,
and integrate data from virtually any business
system, in any format, and deliver that data
throughout the enterprise at any speed
An ETL Tool (Extract, Transform and Load)
Development
Administration
Console
Repository
Manager
Manage repository
connections
folders
objects
users and groups
Administration Console
(browser-based)
Designer
Create ETL
Create and
mappings start workflows
Workflow
Manager
Workflow
Monitor
Monitor and
control
workflows
Target
Transformation Mapplet
Source
Designer:
Developer:
Designer:
Analyzer:
create
create source create target create reusable
objects
transformations mapplets
objects
Mapping
Designer:
create
mappings
Mapping
Logically Defines the ETL Process:
Reads data from sources
Applies transformation logic to data
Writes transformed data to targets
Source
Transformations
Target
Note: Sources and targets can be flat files, relational tables, XML files,
application systems, message queues, etc
Unit 1
Mapping (contd)
Example
Give me an Excel file with Total Order Amount per
Customer. I also need to know when this data was
extracted (date) and the customer type initial ( first letter
of the customer type)
Define the sources
Orders
Customers
Transformations
Generate, modify, or pass data
Data passes into and out of
transformations through ports that
you link in a mapping
Passive transformations do not
change the number of rows received
Active transformations can change
the number of rows received
Unit 1
11
Mapplet
Input & Output
transformations
(pass data from
or to mapping)
Unit 14
12
Mapplet
Mapplet Input
transformation
Mapplet Output
transformation
Unit 14
13
Recap
1.
2.
3.
4.
5.
ETL
Designer
Mapping
Transformation
Mapplet
a.
b.
c.
d.
14
Create worklets
Create workflows
15
Task
An executable set of actions, functions or
commands
Examples:
Session task runs a mapping
Command task runs a shell script
Email task sends an email
Decision task branches workflow conditionally
Timer task waits for a specified period
16
Session
Task that executes a mapping
Define Log Options, Error handling, Connections
17
Decision Task
Tests for a condition during the workflow and sets a flag based on
the condition
Use a link condition (or a Control task) downstream to test the flag
and control execution flow
Can use workflow variables in condition
Options on all
tasks to fail parent
and disable
Treat inputs as
AND/OR
Unit 16
18
Email Task
Sends an email within a workflow
Note: emails can also be sent post-session in a Session task
Unit 16
19
Unit 17
20
21
Command Task
23
Add Cmd
Remove Cmd
24
Reusable Tasks
Session, Email and Command tasks can be reusable
Use the Task Developer to create reusable tasks
Reusable tasks appear in the Navigator Tasks node and can be
dragged and dropped into any workflow
Unit 17
25
Worklet
An object representing a set or grouping of Tasks
Can contain any Task available in the Workflow Manager
Worklets expand and execute inside a Workflow
A Workflow which contains a Worklet is called the parent
Workflow
Worklets CAN be nested
Reusable Worklets create in the Worklet Designer
Non-reusable Worklets create in the Workflow Designer
Unit 18
26
Workflow
A collection of ordered tasks
Tasks can be linked sequentially, concurrently and/or combined
Links can be conditional on previous tasks completing
Unit 1
27
Workflow Structure
Workflow 1
1
Session 1
Worklet A
1
2
3
Session A1
Session A2
Session A3
Worklet B
Session B1Session B2
Worklet C
Session C1
Session C2
3
4
28
Workflow Schedule
Workflow can be scheduled to run continuously, repeat at a given time or
interval, or start manually.
The Integration Service runs a workflow unless the prior workflow run fails.
When a workflow fails, the Integration Service removes the workflow from the
schedule, and you must reschedule it
29
Workflow Monitor
Check Workflow Status
Recover Workflow
Get session log
30
Recap
1.
2.
3.
4.
5.
Workflow
Worklet
Task
Workflow Manager
Workflow Monitor
a.
b.
c.
d.
e.
Unit 1
31
PowerCenter Architecture
Domain
Sources
Integration
Service
Repository
Service
Repository
Service Process
Targets
Administration
Console
PowerCenter Client
Repository
32
Architecture Components
The Integration Service reads mapping and session information from the repository.
It extracts data from the mapping sources and stores the data in memory while it
applies the transformation rules that you configure in the mapping. The Integration
Service loads the transformed data into the mapping targets.
Metadata
Defines data and processes
Examples:
Source and target definitions
Type (flat file, database table, XML file, etc)
Datatype (character string, integer, decimal, etc)
Other attributes (length, precision, etc.)
Mapping logic
Workflow logic
Repository
34
Recap
Match the terms and explanations:
1. Metadata
2. Repository
3. Repository Manager
4. Integration Service
Unit 1
35
36
Databases
and Flat
Files
Messaging and
Standards
Hierarchical*
Software as a
Service
(SaaS)
Hyperion Essbase
DB2
HTTP
Adabas
salesforce.com
Lotus Notes
Flat files
IBM MQSeries
C-ISAM
PeopleSoft
Informix
JMS
SAP Netweaver BW
Netezza
LDAP
Datacom
SAS
SQL Server
MSMQ
IDMS
Siebel
Sybase
ODBC
IMS
Teradata
TIBCO Rendezvous
VSAM
Web logs
webMethods
Web Services
XML
37
Questions?
38