Professional Documents
Culture Documents
Contents
• Introduction
• Clients
• Server(s)
• Source, Target, Repository
• Connectivity
What is Informatica?
• Allows you to load data into a centralized
location, such as a datamart or data warehouse.
• ETL Tool
– Extract data from multiple sources
– Transform the data according to business logic
and needs
– Load the transformed data into file and
relational targets
Example
EMPLOYEE EMP_DETAILS
EMP_ID EMP_ID
EMP_NAME EMP_NAME
EMP_CITY EMP_CITY
EMP_STATE EMP_STATE
EMP_COUNT EMP_COUNTRY
RY Transform
EMP_DATE_O Date of YRS_OF_SRV
Joining to
F_JOINING
Yrs of
Service
Data Warehousing
Developer End User
Metadata
Extract
Transform
Load
Operational Sources Data Warehouse
Informatica Architecture
Design Process
Client Tool Review
– Repository Manager
– Designer
– Server Manager
Informatica’s Architecture
Data Models
Designer
Repository Manager Server Manager
Repository
Server
Informatica Design Process
2.
1.
3.
Source Def
5.
Mapping
Sessions
4. Target Def
1. Create Repository
2. Import Source Definitions
3. Create Target Schema
4. Create Mappings
5. Load Data
Informatica Client
Repository Manager – Can view much of the metadata in
the Repository through the Repository Manager.
Dependency Window
Output Window
Folder Attributes
Navigator
Workspace
Workbook Tabs
Output Window
Status Bar
Designer Options
Navigator
Configure Window
Monitor Window
Output Window
Designer
• Source Analyzer
• Warehouse Designer
• Transformation Developer
• Mapplet Designer
• Mapping Designer
Source Analyzer
• Identify the sources used to build the warehouse.
• Create repository definitions for these sources
Analyzing Sources
Relational – Oracle, Sybase, Informix, IBM DB2,
Microsoft SQL Server, and Teradata
File – Fixed and delimited flat file, COBOL file, and XML
Other – Microsoft Excel, Microsoft Access
Extended – PeopleSoft, SAP R/3, Sieble, and IBM
MQSeries (need to purchase additional products for these
sources)
Oracle
Sybase
Flat
File
Transformation Developer
• Generates ,modifies, passes data through ports
• 12 objects for transforming data
Transformations Types
• Source Qualifier represents all data queried from the
source
• Normalizer normalizes records from VSAM or
relational sources
• Expression performs simple calculations
• Filter serves as a conditional filter
• Aggregator performs aggregate calculations
• Rank limits records to top or bottom range
Transformations Types contd…
• Update Strategy allows for logic to insert, update,
delete, or reject data
• Lookup looks up values and passes to other
objects
• Stored Procedure calls a stored procedure and
captures return values
• External Procedure calls a procedure defined in a
shared library
• Sequence Generator generates unique ID values
• Joiner allows for heterogeneous joins
SourceQualifier Transformation
• Represents records that Informatica server reads
when it runs a session
• Automatically attached when a Source is added to a
mapping
Use a Source Qualifier to:
Filter Records when the Informatica Server reads
source data
Specify sorted ports
– Order by clause
Select only distinct values from a source
Create a custom query for the Informatica Server
to read source data
Expression Transformation
EMP_NAME EMP_NAME
PROJECT_CO PROJECT_CODE
DE PROJECT_DESC
LOOK UP
PROJECT Transform
ation
PROJECT_CO Get
PROJECT.PROJECT_DESC
DE Where
PROJECT_DE PROJECT.PROJECT_CODE
SC =
NM_EMP_DETAILS.PROJE
CT_CODE
Update Strategy
Two Ways Of doing
Within a Session
Within a Mapping
Update Strategy
Within a Session
Within a Mapping
Mapping
Mapping Designer
Transformation Toolbar
Mapping List
Iconized
Status Bar
Validation
Three different levels of validation:
Connection validation
Expression validation
Mapping validation
Connection Validation
Connecting ports with mismatched datatypes
Connecting output ports to a source
Connecting a source to anything but a Source
Qualifier or Normalizer
Connecting an output to a output, or an input
to a input
Connecting more than one active
transformation to another transformation
Copying columns to a target definition
Expression Validation
Parse the current expression, with remote port
searching (references to a port in another
transformation are resolved)
Parse expression attributes such as filter
condition, lookup condition, SQL Query, etc.
Parse default values
Mapping Validation
Mapping validation will take place with menu
commands:
Mapping | Validate
Repository | Save
Mapping validation will:
Perform connection validation
Perform expression validation
Check the mapping flow validation
Data from Source Qualifier mapped to a target
Targets are connected to transformations
Informatica Server
• Reads information from the Repository
Session Metadata
Source Def source information
Mapping target information
Target Def mapping
scheduling
Session error handling
pre- / post-session scripts
Server Manager tuning parameters
output log information
transformation overrides
Server
Targets
Sources
Definitions
• Session - A set of instructions that tells the
Informatica Server how and when to move data from
sources to targets
Navigator
Configure Window
Monitor Window
Output Window
Process
Configure server
Create session
Run session
Monitor session
View logs
Tune session
Server Configuration
Server Variables
Target
E-mail
Data
.bad
.log Event
Error
Log
Log
Source Settings - Session Wizard
Source
Select source type
File, Relational, Heterogeneous Treat rows as:
Source Type: Relational, File
Tells server how to treat source rows
Insert, update, delete, data driven
Works in conjunction with Target Options
Source Options...
Source Type: Relational
Database name
Source Type: File
Source Database
fixed / delimited properties
Source Type: Relational
file list
Database connection
FTP properties
Target Settings - Session Wizard
Target
Select target type
File, Relational
Flat File Options:
File properties
FTP properties
Loader properties
Target Options:
Target type: Relational
Tell server how to load target
Target Database
Insert, Update Delete
Target Type: Relational
Truncate target
Database connection
Bulk, Normal, Test
Launching Sessions
Manual start - Manually launch a session
from within the Server Manager
Session Schedule - Schedule a session using
business cycle start, stop, and repeat intervals
Batching - Use batches to run session
concurrently or sequentially
Event based - Configure a session to launch
based upon the appearance of an indicator file
Command Line - Launch a session from the
command line prompt
After the Session Launches...
Read/Write throughput
Log Files...
Log file path and name
Reject file path and name
Session log archive options
Connectivity Overview