Professional Documents
Culture Documents
Informatica 1
Informatica 1
Power Exchange
Power Center
Power Exchange
Power Channel
Metadata Exchange
Power Analyzer
Super Glue
In Informatica, all the Metadata information about source systems, target
systems and transformations are stored in the Informatica repository.
Informatica's Power Center Client and Repository Server access this
repository to store and retrieve metadata.
Source and Target:
Consider a Bank that has got many branches throughout the world. In
each branch data may be stored in different source systems like oracle,
sql server, terradata, etc. When the Bank decides to integrate its data
from several sources for its management decisions, it may choose one or
more systems like oracle, sql server, terradata, etc. as its data warehouse
target. Many organisations prefer Informatica to do that ETL process,
because Informatica is more powerful in designing and building data
warehouses. It can connect to several sources and targets to extract meta
data from sources and targets, transform and load the data into target
systems.
Guidelines to work with Informatica Power Center
and this is done during the mapping process from source to target.
Workflow Manager: Workflow helps to load the data from source to
target in a sequential manner. For example, if the fact tables are loaded
before the lookup tables, then the target system will pop up an error
message since the fact table is violating the foreign key validation. To
avoid this, workflows can be created to ensure the correct flow of data
Power Exchange:
Informatica Power Exchange as a stand alone service or along with Power
Center, helps organizations leverage data by avoiding manual coding of
data extraction programs. Power Exchange supports batch, real time and
changed data capture options in main frame(DB2, VSAM, IMS etc.,), mid
range (AS400 DB2 etc.,), and for relational databases (oracle, sql server,
db2 etc) and flat files in unix, linux and windows systems.
Power Channel:
This helps to transfer large amount of encrypted and compressed data
over LAN, WAN, through Firewalls, tranfer files over FTP, etc.
Meta Data Exchange:
Metadata Exchange enables organizations to take advantage of the time
and effort already invested in defining data structures within their IT
environment when used with Power Center. For example, an organization
may be using data modeling tools, such as Erwin, Embarcadero, Oracle
designer, Sybase Power Designer etc for developing data models.
Functional and technical team should have spent much time and effort in
creating the data model's data structures(tables, columns, data types,
procedures, functions, triggers etc). By using meta deta exchange, these
data structures can be imported into power center to identifiy source and
target mappings which leverages time and effort. There is no need for
informatica developer to create these data structures once again.
Power Analyzer:
Power Analyzer provides organizations with reporting facilities.
PowerAnalyzer makes accessing, analyzing, and sharing enterprise data
simple and easily available to decision makers. PowerAnalyzer enables to
gain insight into business processes and develop business intelligence.
With PowerAnalyzer, an organization can extract, filter, format, and
analyze corporate information from data stored in a data warehouse, data
mart, operational data store, or otherdata storage models. PowerAnalyzer
is best with a dimensional data warehouse in a relational database. It can
also run reports on data in any table in a relational database that do not
conform to the dimensional model.
Super Glue:
Superglue is used for loading metadata in a centralized place from several
sources. Reports can be run against this superglue to analyze meta data.
Power Mart:
Power Mart is a departmental version of Informatica for building,
deploying, and managing data warehouses and data marts. Power center
is used for corporate enterprise data warehouse and power mart is used
for departmental data warehouses like data marts. Power Center supports
global repositories and networked repositories and it can be connected to
several sources. Power Mart supports single repository and it can be
connected to fewer sources when compared to Power Center. Power Mart
can extensibily grow to an enterprise implementation and it is easy for
developer productivity through a codeless environment.
Note:This is not a complete tutorial on Informatica. We will add more Tips
and Guidelines on Informatica in near future. Please visit us soon to check
back. To know more about Informatica, contact its official website
Informatica - Transformations
In Informatica, Transformations help to transform the source data
according to the requirements of target system and it ensures the quality
of the data being loaded into target.
Transformations are of two types: Active and Passive.
Active Transformation
An active transformation can change the number of rows that pass
through it from source to target i.e it eliminates rows that do not meet the
condition in transformation.
Passive Transformation
A passive transformation does not change the number of rows that pass
through it i.e it passes all rows through the transformation.
Transformations can be Connected or UnConnected.
Connected Transformation
Connected transformation is connected to other transformations or
directly to target table in the mapping.
UnConnected Transformation
An unconnected transformation is not connected to other transformations
in the mapping. It is called within another transformation, and returns a
value to that transformation.
Following are the list of Transformations available in Informatica:
Aggregator Transformation
Expression Transformation
Filter Transformation
Joiner Transformation
Lookup Transformation
Normalizer Transformation
Rank Transformation
Router Transformation
Sorter Transformation
External Transformation
In the following pages, we will explain all the above Informatica
Transformations and their significances in the ETL process in detail.
Aggregator Transformation
Aggregator transformation is an Active and Connected transformation.
This transformation is useful to perform calculations such as averages and
sums (mainly to perform calculations on multiple rows or groups). For
example, to calculate total of daily sales or to calculate average of
monthly or yearly sales. Aggregate functions such as AVG, FIRST, COUNT,
PERCENTILE, MAX, SUM etc. can be used in aggregate transformation.
Expression Transformation
Expression transformation is a Passive and Connected transformation. This
can be used to calculate values in a single row before writing to the
target. For example, to calculate discount of each product or to
concatenate first and last names or to convert date to a string field.
Filter Transformation
Filter transformation is an Active and Connected transformation. This can
be used to filter rows in a mapping that do not meet the condition. For
example, to know all the employees who are working in Department 10 or
to find out the products that falls between the rate category $500 and
$1000.
Joiner Transformation
Joiner Transformation is an Active and Connected transformation. This can
be used to join two sources coming from two different locations or from
same location. For example, to join a flat file and a relational source or to
join two flat files or to join a relational source and a XML source.
In order to join two sources, there must be atleast one matching port. at
least one matching port. While joining two sources it is a must to specify
one source as master and the other as detail.
The Joiner transformation supports the following types of joins:
Normal
Master Outer
Detail Outer
Full Outer
Normal join discards all the rows of data from the master and detail
source that do not match, based on the condition.
Master outer join discards all the unmatched rows from the master
source and keeps all the rows from the detail source and the matching
rows from the master source.
Detail outer join keeps all rows of data from the master source and the
matching rows from the detail source. It discards the unmatched rows
from the detail source.
Full outer join keeps all rows of data from both the master and detail
sources.
Lookup Transformation
Lookup transformation is Passive and it can be both Connected and
UnConnected as well. It is used to look up data in a relational table, view,
or synonym. Lookup definition can be imported either from source or from
target tables.
For example, if we want to retrieve all the sales of a product with an
ID 10 and assume that the sales data resides in another table. Here
instead of using the sales table as one more source, use Lookup
transformation to lookup the data for the product, with ID 10 in sales
table.
Difference between Connected and UnConnected Lookup
Transformation:
Repository Login
Repository Manager
Folders main contain Nodes (subfolders) - Sessions, Batches, Sources,
Targets, Transformations, Mapplets (reusable sets of transformations) and
Mappings. They in turn may contain corresponding individual repository
objects - sessions, batches, sources, targets, transformations, mapplets and
mappings, as well as shortcuts, batches, and session logs.
Below the Navigator and the Main Window you may see two more windows:
Windows:
Mapping Designer
Example of a mapping
Overview window
Note: you can open several workspaces (workbooks) - choose Window - New
Window, and then select appropriate tool.
To make a mapping:
Note: Source has only Ouput ports, Source Qualifier has both input and
output ports.
Here are some transformations:
Chose Layout-Link Columns, drag needed fields from Source Qualifier to the
Transformation, double-click on the title bar of the transformation to edit the
transformation.
In the "Edit Transformations" dialog box you can check/uncheck necessary
options (I/O ports, Group-By), add new ports as necessary, edt the
expressions for each port (and validate them).
You can click on the Expression field - and edit expression in the Expression
Editor:
You can chain transformations. You can do joins between tables in different
databases using "Lookup" transformation to create local copy of the data. You
Sessions are sets of instructions for Informatica Server when and how to
move data from sources to targets.
Server Manager - a client application used to create and manage sessions
and batches, and to configure session connections. You can monitor multiple
Windows:
Select the session wit the mouse - and choose Server Requests Start (or click on the start button on the toolbar).
======================
Informatica Repository Manager
Q. What type of repositories can be created using Informatica
Repository Manager?
A. Informatica PowerCenter includeds following type of repositories :
IBM DB2
Informix
Microsoft Access
Microsoft Excel
Microsoft SQL Server
Oracle
Sybase
Teradata
IBM DB2
Informix
Oracle
Sybase
Teradata
Aggregator
Application Source Qualifier
Custom
Expression
External Procedure
Filter
Input
Joiner
Lookup
Normalizer
Output
Rank
Router
Sequence Generator
Sorter
Source Qualifier
Stored Procedure
Transaction Control
Union
Update Strategy
XML Generator
XML Parser
XML Source Qualifier
ports) to the source column values based on the lookup condition. Later
returned values can be passed to other transformations.
Q. Can a lookup be done on Flat Files?
A. Yes.
Q. What is the difference between a connected look up and
unconnected look up?
A. Connected lookup takes input values directly from other transformations in
the pipleline.
Unconnected lookup doesnt take inputs directly from any other
transformation, but it can be used in any transformation (like expression)
and can be invoked as a function using :LKP expression. So, an unconnected
lookup can be called multiple times in a mapping.
Q. What is a mapplet?
A. A mapplet is a reusable object that is created using mapplet designer. The
mapplet contains set of transformations and it allows us to reuse that
transformation logic in multiple mappings.
Q. What does reusable transformation mean?
A. Reusable transformations can be used multiple times in a mapping. The
reusable transformation is stored as a metadata separate from any other
mapping that uses the transformation. Whenever any changes to a reusable
transformation are made, all the mappings where the transformation is used
will be invalidated.
Q. What is update strategy and what are the options for update
strategy?
A. Informatica processes the source data row-by-row. By default every row is
marked to be inserted in the target table. If the row has to be
updated/inserted based on some logic Update Strategy transformation is
used. The condition can be specified in Update Strategy to mark the
processed row for update or insert.
Following options are available for update strategy :
DD_INSERT : If this is used the Update Strategy flags the row for
insertion. Equivalent numeric value of DD_INSERT is 0.
DD_UPDATE : If this is used the Update Strategy flags the row for
update. Equivalent numeric value of DD_UPDATE is 1.
DD_DELETE : If this is used the Update Strategy flags the row for
deletion. Equivalent numeric value of DD_DELETE is 2.
DD_REJECT : If this is used the Update Strategy flags the row for
rejection. Equivalent numeric value of DD_REJECT is 3.