Informatica Corporation PowerCenter Version 8.

0 New Features and Enhancements
December 2005
Copyright 1998-2005 Informatica Corporation

Contents
Informatica Distributed Services Architecture Command Line Programs Datatypes Data Profiling Documentation Installation Integration Service (PowerCenter Server) Architecture Partitioning FTP Recovery Environment SQL Globalization Performance Improvements Repository Administration Architecture Meta Integration Model Bridge Object Queries Versioned Objects Deployment Security Transformation Language Transformations XML Usability PowerCenter Data Analyzer (PowerAnalyzer) PowerCenter Metadata Manager (SuperGlue) 2 2 3 3 3 4 4 4 4 5 5 5 5 5 6 6 6 6 6 7 7 7 7 7 8 8 8 8

1

PowerCenter Connect for IBM MQSeries 9 PowerCenter Connect for SAP NetWeaver mySAP Option (PowerCenter Connect for SAP R/3) 9 PowerCenter Connect for SAP NetWeaver BW Option (PowerCenter Connect for SAP BW) 9 PowerCenter Repository Reports (PowerCenter Metadata Reporter) 10

Informatica Distributed Services Architecture
Domains and nodes. PowerCenter 8.0 allows you to scale services and share resources across multiple machines. PowerCenter introduces a PowerCenter domain, the primary unit of administration for the PowerCenter environment, and a PowerCenter node, the logical representation of a machine in a domain. Services. PowerCenter provides application and core services. Application services represent PowerCenter server-based functionality such as the Repository Service, Integration Service, and SAP BW Service. Core services support the domain and application services. Core services include a Log Service, Licensing Service, and Domain Service. High availability. You can use the high availability option to eliminate single points of failure in the PowerCenter environment and reduce service interruptions in the event of failure. High availability provides resilience, failover, and recovery for services. Administration. PowerCenter provides centralized administration through the Administration Console. You perform all administration tasks for the domain and services in the Administration Console. This includes administration of the Repository Service. Authentication Service. PowerCenter uses the Authentication Service to authenticate users for the Administration Console and the repository. For repository authentication, you can use LDAP or native authentication. To use LDAP authentication, you configure authentication at the domain level and register the module with each repository you want to use LDAP authentication. Licensing Service. PowerCenter uses the Licensing Service to enable features, such as PowerCenter options and connectivity. You assign each application service to a license to enable the service. Log Service. The Log Service is a core service of the PowerCenter domain. It captures log events for all services within the domain. Log types include domain, services, workflows, and sessions. You can view logs in the Administration Console and in the Workflow Monitor.

Command Line Programs
infacmd. infacmd is a new command line program that allows you to administer PowerCenter domains and services. You can perform tasks such as enabling and disabling services and purging log events.

2

infasetup. infasetup is a new command line program that allows you to configure domains and nodes. Use infasetup to define domains and nodes, join domains, and configure node passwords, domain licenses, and domain service ports. pmcmd. pmcmd includes new commands to support Integration Service functionality and new syntax to connect to a domain. pmrep. pmrep now includes former pmrepagent commands and new syntax to connect to a domain.

Datatypes
Flat file definitions. You can configure flat file definitions to use the Integer or Double datatype.

Data Profiling
Functional Dependencies Analysis function. You can use the Functional Dependencies Analysis function to determine exact and approximate dependencies between columns in a source. Inter-Source Structure Analysis function. You can use the Inter-Source Structure Inference function to determine primary key-foreign key relationships among multiple sources. Candidate key and redundancy column analysis. Data Profiling can return column sets of up to seven columns for candidate key and redundancy analysis. Also, the column precision to evaluate candidate keys and analyze redundancies is increased. Domain Inference function. The Domain Inference function can infer the source datatype and categories of inferred patterns. Data Profiling reports show results for the new types of data in this function. Column Lookup domain. You can use a flat file or relational source column as a domain. Multiple join columns. You can specify up to six join conditions in the Referential Integrity Analysis and Join Complexity Evaluation functions. Reporting. Data Profiling provides reporting enhancements such as improved paging capabilities and a consistent layout between custom profile reports and auto profile reports. In reports for the column-level Business Rule Validation function, you can click the Drill Down link on the Verbose Report Summary to view the source rows.

Documentation
PowerCenter documentation includes the following new books: Administrator Guide. Provides information for administrators to manage the domain and services. Command Line Reference. Provides syntax and description for infacmd, infasetup, pmcmd, and pmrep. Performance Tuning Guide. Provides information about tuning the performance in PowerCenter.

3

Upgrade Guide. Provides information about new and changed functionality in PowerCenter, including tasks that you might need to perform after upgrading to a new version.

Installation
Installation packaging. The PowerCenter installation DVD installs Informatica Distributed Services and the PowerCenter Client. PowerCenter also includes a documentation CD that you can use to install PowerCenter documentation. PowerCenter Editions. Based on the license you have, you can install PowerCenter Standard Edition or PowerCenter Advanced Edition. Configuration. The installation program can create a domain, add a node to an existing domain, and create a Repository Service and Integration Service. For PowerCenter Services, Data Analyzer, and Metadata Manager, you can use an existing repository or create a new repository. It also provides configuration options. Upgrade. You can manually upgrade an existing PowerCenter Server or Repository Agent to an Integration Service or Repository Service.

Integration Service (PowerCenter Server)
Architecture
Service-oriented architecture. The Integration Service replaces the PowerCenter Server. If you have the high availability option, you can configure the Integration Service to fail over to another node if the current node becomes unavailable. Load Balancer. The Load Balancer dispatches tasks. If you run tasks on a grid, the Load Balancer dispatches Session and Command tasks to nodes that can run the tasks. Resources. PowerCenter requires resources to run tasks. Resources can include plug-ins, connection objects, file directories, and shared libraries. You define resources for nodes and tasks. The Load Balancer dispatches tasks to nodes where the resources are available. Grid object. The server grid is updated to incorporate a grid object into the PowerCenter domain. You create a grid object in the Administration Console, and you assign one or more nodes to run on the grid. Run workflow on a grid. You can configure a workflow to run on an Integration Service associated with the grid. When you run a workflow on a grid, the Integration Service distributes the workflow tasks to available nodes. Run session on a grid. You can configure a session to run on a grid if the parent workflow is associated with a grid. When you run a session on a grid, the Integration Service distributes reader, writer, and transformation threads to available nodes.

Partitioning
Database partitioning. You can use the database partitioning partition type with Oracle and DB2 sources.

4

Dynamic partitioning. The Integration Service can determine the number of partitions at run time based on the dynamic partitioning session property.

FTP
Partitioned FTP file targets. For partitioned target instances, the Integration Service writes to a remote merge file. For merged targets, the Integration Service creates a local merge file and transfers the merge file to the FTP host. Indirect FTP file sources. You can use an indirect file list for a flat file source instance located on a remote machine.

Recovery
Workflow recovery. If you have high availability, you can recover an interrupted workflow in the Workflow Manager or Workflow Monitor. When you recover a workflow, the Integration Service attempts to complete the original workflow run. The Integration Service restores the last good state of the workflow and restarts or resumes the interrupted tasks. Session task recovery. You can configure the recovery strategy to recover an interrupted session in a workflow. You can choose to resume, restart, or skip the Session task. When you recover the workflow, the Integration Service uses the recovery strategy to recover the session. Command task recovery. You can configure the recovery strategy to restart or skip the Command task when you recover a workflow.

Environment SQL
Transaction environment SQL. You can use transaction environment SQL to execute an SQL statement at the beginning of each transaction. This is in addition to the current connection environment SQL that executes each time the Integration Service connects to the database.

Globalization
Unicode repository. PowerCenter now supports a Unicode repository. You can choose the UTF-8 character set as the repository code page to store metadata from multiple languages. Code page validation. With expanded support for Unicode, PowerCenter code page validation rules are now less restrictive.

Performance Improvements
Pushdown optimization. The Integration Service can execute SQL against the source or target database instead of processing the transformation logic within the Integration Service.

5

Partitioned flat file targets. The Integration Service can create an individual target file for all partitions, a merge file for all partitioned flat file target instances, and a file list that references the individual target files for all target partitions. Append to flat file targets. The Integration Service can append data to existing flat file target and reject files. Flat file source and target commands. You can specify a command for any flat file source or target instance in a session. You can use a command to generate source data and you can use a command to process target data. Data compression. Repository data is compressed during communication among the Repository Service, Integration Service, and PowerCenter Client.

Repository
This section describes changes to the repository functionality.

Administration
Administration tool. You use the browser-based PowerCenter Administration Console to perform administrative tasks that were formerly done in the Repository Server Administration Console.

Architecture
Service-oriented architecture. In the Informatica distributed services architecture, each Repository Service manages a single repository. A Repository Service process is an instance of the Repository Service running on a particular node. If you have the high availability option, you can configure the Repository Service to fail over to another node if the current node becomes unavailable.

Meta Integration Model Bridge
Additional import/export functionality. You can use the Meta Integration Model Bridge from Meta Integration Technology, Inc. to exchange data with the following business intelligence and data modeling tools: CA AllFusion ERwin Data Modeler 4.x CA ERwin 3.x (ERX) and CA ERwin 3.5x (export only) Embarcadero ER/Studio Oracle Designer Sybase PowerDesigner

Object Queries
Non-versioned repositories. You can create and run queries against repositories that are not enabled for versioning.

6

Label parameter. The following operators are added to the label parameter: Contains, Does Not Contain, Starts With, Does Not Start With, Ends With, and Does Not End With.

Versioned Objects
Checking out and checking in objects. You can explicitly check out versioned repository objects. You can view checked-in objects in the workspace. Viewing older versions of objects. You can view older versions of objects in the workspace.

Deployment
Assign owner and group to a folder. You can assign an owner and owner group when you copy a folder or deployment group. Generate a deployment control file. You can generate a deployment control file when you copy a folder or deployment group with the Copy Wizard.

Security
Extended repository privileges. You can assign additional extended repository privileges to manage repository objects. Privileges include Manage Connection, Manage Label, and Manage Folder.

Transformation Language
User-defined functions. You can create user-defined functions in the Designer to use in transformations and workflow tasks. You can include the functions in expressions or other user-defined functions. Custom Functions SDK. You can develop functions written in C Language and integrate them using a set of API calls that come with PowerCenter. You can include the functions in mapping and workflow expressions. New functions. The transformation language has 23 new built-in functions such as COMPRESS, DECOMPRESS, and REVERSE.

Transformations
Custom transformation. The Custom transformation has the following enhancements: Procedures with thread-specific operations. You can include thread-specific operations in Custom transformation procedures and configure the transformation to process each partition using one thread. Partitioning enhancements. You can partition a Custom transformation or any transformation created from a Custom transformation locally or across a grid.

7

Java and C++ API. You can use Java and C++ to create the Custom transformation procedures. You can use the new API to access data in block mode and perform datatype conversions at the column level. Java transformation. You can use the Java transformation to create, compile, and debug Java code snippets that execute in-line Java code in the mapping. You can also use Java to call expressions in a Java transformation.

XML
Filter data with query predicate. You can filter XML source data in a session by including an XPath query predicate in an XML definition. You can use string, numeric, and boolean XPath functions to filter source data. Create a default namespace. You can use a default namespace to eliminate the namespace prefix in an XML target. Import part of an XML schema. You can import part of an XML schema when you import an XML file that uses the schema, but only contains some of the schema components. Use anySimpleType. You can import an XML schema that contains anySimpleType elements.

Usability
Propagate port description. In the Designer, you can edit a port description and propagate the description to other transformations in the mapping. Link ports in the workspace. In the Designer, you can link ports by dragging all ports from one mapping object to another mapping object in the workspace. Show background in iconic view. In the Designer and Workflow Manager, you can enable background colors for objects in iconic view.

PowerCenter Data Analyzer (PowerAnalyzer)
PowerAnalyzer renamed. PowerAnalyzer is renamed PowerCenter Data Analyzer.

PowerCenter Metadata Manager (SuperGlue)
SuperGlue renamed. SuperGlue is renamed PowerCenter Metadata Manager. 64-bit operating systems. Metadata Manager supports 64-bit operating systems. XConnects. The PowerCenter XConnect extracts additional metadata from the PowerCenter repository. PowerCenter XConnects now support parameterized connections, and data lineage is displayed for these connections. Data lineage diagrams. Data lineage diagrams display the following information: If the container of the data structure or transformation box is different from the actual parent, the actual parent name of the data structure or transformation displays in the lineage diagram.

8

Stored procedures are displayed as object structures. Public synonyms are displayed for all database tables across domains. Metadata Manager Configuration Console. Metadata Manager Configuration Console now works on the PowerCenter 8.0 platform. You can now register multiple source repository instances under the corresponding repository type.

PowerCenter Connect for IBM MQSeries
Improved real-time performance. The Integration Service can now commit source messages to the target at the end of a millisecond flush latency period. By default, the integer n in the FlushLatency(n) function represents seconds. However, you can configure an MQSeries session to change the value to milliseconds.

PowerCenter Connect for SAP NetWeaver mySAP Option (PowerCenter Connect for SAP R/3)
PowerCenter Connect for SAP R/3 renamed. PowerCenter Connect for SAP R/3 is renamed PowerCenter Connect for SAP NetWeaver mySAP Option. Improved performance. The Integration Service creates caches in memory for sessions for inbound IDoc mappings or DMI mappings. You can configure the total cache size and cache directory. Outbound IDoc syntax validation. You can configure an outbound IDoc session to check for invalid IDocs and write them to a relational or flat file target. Mandatory segments and groups in IDoc and DMI mappings. In the SAP/ALE IDoc Prepare transformation, SAP/ALE IDoc Interpreter transformation, and SAP DMI Prepare transformation, you can view mandatory segments and groups to help you understand which segments the transformations require in the IDoc or DMI message. Scheduling Business Content Integration. The listener workflow controls the scheduling of the processing workflows. The listener workflow receives DataSource data from SAP, loads the data to staging targets, and then starts the appropriate processing workflow for the received data.

PowerCenter Connect for SAP NetWeaver BW Option (PowerCenter Connect for SAP BW)
PowerCenter Connect for SAP BW renamed. PowerCenter Connect for SAP BW is renamed PowerCenter Connect for SAP NetWeaver BW Option. PowerCenter Integration Server for SAP BW (PCISBW) renamed. PowerCenter Integration Server for SAP BW is renamed to SAP BW Service. Loading by packet size. PowerCenter Connect for SAP NetWeaver BW Option loads BW data as a packet based on the packet size value you set in the session.

9

Load balancing. To support load balancing, the SAP BW Service records the BW host requesting data from PowerCenter. The Integration Service loads the data to the same requesting host. Process chains to load into SAP BW. You can create a BW Process chain to start a PowerCenter workflow that loads data into SAP BW. Improved logging. The SAP BW Service can capture messages from SAP BW when you run a BW session. You can view these log events in the PowerCenter Administration Console or in the SAP BW monitor. Recovery. You can enable a BW session for recovery. If the session fails, you can restart it in the Workflow Manager.

PowerCenter Repository Reports (PowerCenter Metadata Reporter)
PowerCenter Metadata Reporter renamed. PowerCenter Metadata Reporter is renamed PowerCenter Repository Reports. Server Grid List report. The Server Grid List report is replaced with the following reports: Integration Service List. Displays a list of all Integration Services that run on the grids and nodes configured in PowerCenter. Repository Information. Displays a list of all Repository Services that run on nodes configured in PowerCenter. It also provides information about whether a node is a primary node or a backup node. Metadata Extensions Usage report. The Metadata Extensions Usage report is replaced with the following reports: Metadata Extensions Usage (Mapping and Mapplet). Displays a list of all reusable metadata extensions. The report is organized by repository and folder and shows the number of mappings or mapplets that use each metadata extension. Metadata Extensions Usage (Source, Target and Transformation). Displays a list of all reusable metadata extensions. The report is organized by repository and folder and shows the number of sources, targets, or transformations that use each metadata extension. Metadata Extensions Usage (Workflow, Worklet and Session). Displays a list of all reusable metadata extensions. The report is organized by repository and folder and shows the number of workflows, worklets, or sessions that use each metadata extension.

10

Sign up to vote on this title
UsefulNot useful