Professional Documents
Culture Documents
• Centralized Services
• Administration
• Logging & Auditing
PowerCenter 7 Mission-Critical
Advanced Edition Enterprise Deployment
One Product,
Single Install
Continuous Innovation
0:37
“With PowerCenter continually Session On Grid
Adaptive Load Balancing
leapfrogging on performance and High Availability
scalability, we are never concerned about Dynamic Partitioning
Pushdown Optimization
our ability to handle increasingly large data Unstructured Data
volumes in our data integration 3:36 Data Federation
Mapping Generation
Data Profiling
PowerCenter 8 Updated Partitioning
Standard
Edition Real-Time
PowerCenter Connects
Broader
Metadata Exchange
Repository
Web Services Repository Database
Hub Server
PowerExchange
Machine
Client Tools
Repository Manager Workflow Manger
Core Services
PowerCenter
Connects Repository Service
Domain/Gateway Services Log Service
• Administration & Authorization
• Configuration
PowerExchange • Domain
• Licensing
• Services
• A service is a resource that provides specialized functions.
• PowerCenter has two types of services. Application and
Core Services.
• PowerCenter Application Services – represents server based functions such
as Repository, Integration, SAP BW, and WebService Hub services.
• PowerCenter Core Services – represents functions that manage and
maintain the environment in which PowerCenter operates.
• Node
• A node is a logical representation of a physical machine. It has
physical attributes such as a hostname and port number.
• Each node runs a Service Manager which is responsible for the
application and core services.
• Is started when you start “Informatica Services”
• Domain
• A domain is the fundamental unit of PowerCenter Services
administraion.
• A domain is a logical collection or set of nodes and services that
you can group in a “folder like” deployment.
• Service Manager
• On the gateway node, the Service Manager is responsible
for
• Controlling the domain
• Manage services running on the domain
• Provide service lookup
• On all nodes, the Service Manager
• Controls the core services and application services
Repository Check
Repository Service point
Manager
Workflow Logs
Manager Master
Gateway
(Domain
Controller) Domain
Monitor Metadata
Administration
Console
Integration
Service
• Failover
• Restart for data integration, repository and other services
• Primary and backup servers
• Recovery
• Workflow and sessions will be recovered on running servers on
the grid during server failure
• Checkpoint recovery
• Repository recovery
• Resilience
• PowerCenter jobs will sustain transient failure
• Network errors
• DB connection failures
• DB Connection Resilience
• When connecting/disconnecting from a DB
• Oracle, DB2, Sybase, SQL Server and Teradata
• Retry interval based on timeout setting
• FTP Resilience
• For connections to FTP server
• Read/write will recover if connection lost based on timeout
parameter
• Internal Resilience
• PowerCenter components (integration service, clients etc.)
resilient to Repository service failure
• Simple environment
• 1 Domain which consists of: Node01
(Int_Svc01)
• 2 nodes for Integration Services
• node01 - Primary
• node02 - Backup
• 1 server for repository.
Repository DB
Node02
(Int_Svc02)
Repository DB
node02
(Int_Svs02)
Automatic
Failover
Restart
Recovery
Services Configuration
Remember pmserver config file?
Domain
Example
Primary
& Backup
Repository Nodes
Service
Services
Automatically
recover, restart Failed
on live server Hardware
Server
PowerCenter
Domain
Controller
Distributed
processing
of sessions
PowerCenter
Domain on
Server Grid
PowerCenter 7
• ServerGrid is collection of
pmservers
• Work is directed to
individual pmservers
• Work distributed across Grid
in round-robin manner
• Session/task is lowest unit
of work
• Benefits
• Reduce movement of data when source and target are the
same database instance
• Utilize database-specific processing that may be more
optimal
• Maintain metadata and lineage in PowerCenter
• Partial Source:
• One or more transformations can be processed in source database
• Partial Target :
• One or more transformations can be processed in target database
• Generated SQL:
• INSERT INTO t (…) VALUES (?+1, SOUNDEX(?))
• Parameter Enhancements
• Table owner name for relational sources/targets
• E-mail address
• FTP remote file name
• Database Partitioning
• Partitioned Oracle and DB2 sources can be read in parallel
• No changes to targets. DB2 can be written to in parallel.
• Dynamic Partitioning
• Based on # of partitions in database
• Based on the # of nodes in a Grid
©Informatica
Informatica Corporation,
confidential.2006.
ForAll rights reserved.
discussion purposes only.
38
AutoCache Overview
• Cache in PowerCenter v7
• Default cache settings not adequate for all situations.
• Default settings can underestimate new chip technologies.
• Sometimes necessary to hand tune individual transformations.
• Development did not always scale when deployed to different
production machines.
• Calculate based on
the number of rows
and the ports going
into the object
• Value is propogated
into the Cache
value
• Custom Functions
• Extend the functionality of the Expression Transformation
via a C API
• All 20+ functions above were added via this API
Simply Right-Click
on an object…
Mapping Template
Template Inputs
PC 7.x PC 8.x
• 3 Tier Architecture • Services Oriented Architecture
• Basic Grid Deployment • Enhanced Grid Deployment
• Introduction to Profiling • High Availability
• Session on Grid
• Added Transformations
• Resilience
• Union
• XML • Enhanced Profiling
• Web Services • Added Transformations
• Team Based Development • Java
• SQL
• Enhanced Productivity
• Mapping Generation
• User Defined Functions