You are on page 1of 148
Informatica Course Material TABLE OF CONTENTS Introduction, 6 ‘About this Module Target Audience Module Objectives Prerequisite Session 1: ETL 8 Informatica enn sonnei 7 Learning Objectives ETL Overview ETL Teme, Extraction Transformation Leading ETL Process Flow 0 Dotaied ETL Process. Flow " Popular ETL Tools i i Informatica: An Overview. a Informatica Architecture and Components. a Infermatica Serviess Connactiviy. “4 New Features id ‘Summary 6 ‘Test Your Understanding 16 ‘Session 2: Integration Service .. ip enite 7 Learning Objectives 7 Integration Service Overview iF Integration Service Process. 18 Load Balancer 8 Data Transformation Manager (OTM) Process 19 Integration Sorvice and tha Administration Console 19 ‘Summary “Test your Understanding Learning Object ves PowerCenter Reposiory, ‘Administration Console Creating Repository Service ‘Craating Repository Contant Deleting Repository Content 24 25 36 39 a Bo 28 Page 2 Backing up a Repository 28 Restoing aReposiioy 30 Lioneing 2 ‘Summary a “Test your Understanding a Learning Objectives 34 Repository Manager Overview. am Understanding the Repository Manager Windows ‘33 Comecting 1 a Repository 38 Managing Groups. 38 Managing Users ‘40 Managing Privileges a Managing Folders. re) Viewing User Connections. a8 Viewing Locks 49 Viewing Dependencies a Ty tow st ‘Summary 31 Test your Understanding i ‘Session 8: Mapping Designer. 7 daa Learning Objectives 2 Designer Overviow cA ‘Source Analyzer 3 ‘Warehoute Designer ‘56 ‘Transformation Developer a7 ‘Source Quaifier Transformation 58 Expression Transformation 3 Fiter Transformation Be Joiner Transformation Rank Router Transformation. Normatzer Transformation Look Up Transformation Sorter Transformation ‘Aggregator Transformation Sequence Generator Transformation Update Strategy Transformation ‘Stored Procedure Custom Transformation es a mg 72 78 a WT aa Page XML Transformation Mapplet Designer Mapping Designer Mapping Paramoters and Variables. Parameter Fes Ty tow Summary “Test your Understanding Session 6: Workflow Manager... Learning Objectives Manager Overview ‘Source and Target Gannecfon Tasks, Sessions Workliows Tey tout summary. “Test your Understanding ‘Session 7: Workflow Monitor Leaming Objadives Monitor Overview, ‘Workflow Monitor Views Running and Monitoring Workfiows . ‘Scheduling Workfious ‘Warklow and Session Lage Monit: Configuration Try tout ‘Summary ‘Test your Understanding ‘Session 8: Reusable Objects Leaming Objacives Reusable Traneforma fone Mapplet ‘a a7 ‘30 a a 6 ‘08 ry) i ite 8 a 4 102 108 108 109 no m 112 n3 a4 14 115 ns ne Wordets 123 Rousatie Workflaw Tasks 128 Tey tout 128 summary. v8 Page 4 “Test your Understanding ‘Session 9: Advanced Topics Leaming Objadives Wersioning Deployment Geouns. Exporting and imparting Objects Copying Otjects Debugger Data Recovery Parttioning Perfomance Data Collection Bottlreck, Summary. “Test your Understanding References ene Websites. Books ‘STUDENT NOTES: ee i ‘40 ar 12 100 12 133 2 M3 vs 45 M5 ur ua 143 Page 5 Introduction ‘About this Module ‘This handout document a % Introduces the PowerCenter Tools Explains the basic concepts in ETL and Informatica ‘Touches tha Advanced topiesin Informatica Entry Level Trainees Module Objectives _ Alter completing this module, you willbe abo to: % % % % Pre-requisite Explain te ETL process in detail Deserbe the basic cancepts in Integration and Repository Service Work with Inbrmatica Repasiory Manag Explain al the Designer Too's Describe the tansfarmations in Informatica ‘Work with Worktow Manager ‘Track all tha Warktiows in the Workflow Manitor Work with Reusable Objects “The trainae needs to have batie knowledge in RDBMS and Data Warehousing Concepts Page 6 Session 1: ETL & Informatica Se ETLOverviaw ETL Terms Extaction, Transformation, Leading ETL Process Flow Dotaited ETL Process Flow Popular ETL Tools Informatica Overview Informatica Architecture and Components Informatica Serviows Comnectiviy New Features eerrerre ETL Overview ETL stands for Extraction, Transfomation and Loading, ETL is a process that inwolves the foowing tasks: + Extracting data ftom source operatonal or achive systems which are the primary ‘sourca of data fr the data warehoute + Transforming the data - which may involve cleaning, fitering, validating and ‘applying business rules + Loading the data into a data warehouse or any other database or appication that houses data Source System ‘Source System is a datatase, application, fie, or other storage ‘acilly from which the data in addala warehouse is derived Some of them are Flat fles, Oracle Tables, Microsoft SOL ‘server tablas, COBOL Sources, XML fis. Mapping Mapping is the definiton of the relationship and data flow between source and target objects. |= a picaral representation about fe flow of data from source to target. Metadata Metadata describes data and other sinuctures, such as objects, business rues, and processes. For example, fie schema design of a dala warehouea f \ypicaly stored in’a rapostory ae metadata, which is used to generate scrips used to build and populate the data ‘warehouse, Matadata contains all the information about the source tables, target tables, the transformations, so that it wil be useful and aasy fo perform ransfamations during the ETL process. A repository contains metadata. Staging Area ‘Slaging area is placa where you Hold temporary tables on data warehouse server. Staging tables are connected fb work area or fact tables. We basicaly need staging area to hold the data, and peor data deansing ard merging, before loading fhe data info warehouse. age 7 Data Cleansing Is the process of resolving inconsistencies and ftxing the anomalies in source data, typically as part of the ETL process, The data cleansing technology improves data quality by validating, corectly naming and standardzing data, Far example, a persoris address may not be ame in al source systams Because of fypas and postal cada, ey name may nat match with ‘addiass. These errors can be corrected by using dala cleansing process and slandardized data ean be loaded in target systems (data warehouse). ‘Transfonmation ‘Traneformaton is the process of manipuating data, Any manipulation bayand copying is a transfomation Examples include cleansing, aggregating, and integraling dala from ‘muliple sources. A tansformaton isa repostory object hat generates, mod fes, a passes data, Target System Targat System is a database, application, fle, or otner storage facility to which he “Lansformed souroe data” is loaded in a daa warehouse, Extraction ‘The first part of an ETL process involves extracting the data from the source systems. Most data warehousing projects Consolidate dala from different source systems. Each separate system may also ute a different dala organization / formal, Common data source formats. are ‘olaional databases and flat fles, Dut may incule non-velational database structures such as Informalion Management Systor (IMS) or other dala structures such as Vidual Storage ‘Access Nathod (VSAM) ar Indaxed Sequential Access Method (ISAM), or aven fetching from ulside soureas such as web sparing or soreen-scraping, Exracton converts tne data inl @ {format for transtormaton processing, Oracle Target Sybase Text filles Aa intinsie pad of the extraction involves. fhe parsing of extracled data, suing in a check i the data meets an expected pattern or structure. I not, the data may be reacted entirely Page 8 {b derive the data for loadng into the end target. Soma dala sources wil require very ile or ‘even no manipulation of data. in other cases, one or more of fe folowing transformations tyaes {a moat the business and lachinical needs of fhe end target may be recqured: J Selecting only certain cdlumns to load (or selecting nul calums notto load) ‘Translating coded values and automated data cleansing Encoding ftee-form values Deriving a new caictated value Fitering Sorting Joining data from mutipla sources Aggregation Generating surrogate-key values ‘Transposing or pyoting oolurms ‘Splting a calurn into multiple columns ‘Aaplying any fom of simple or complex dala validation. tf validation fai, it may result in a {ul patial oF ne rejection of the dala, and thus none, some or ll the data is handed over to the ‘ext slop, depending on the rule design and exception handing. Many of the above Yansformlions may result in exceptions, for example, when a code Yanslatin parses an unknown code in the extracted data Source Emp | Last | Fist | name | Name TO00r | Jones | Indans Staging Area To002 | Wolmes | Bherioek India Jones, ‘Sherlock Homes bn Source, the data available ie First name and Last name, To got the full nama, the ‘ransformalion logic is applied by concatenating both the first and last name. |The place where these transiormatons take place is called the Staging Area. Loading bn loading sige, some data are loaded fo the targot direc without applying any transformation logic and some data are loaded ‘o the target after anpiying the logic or business ruies. The load phase loads the data into the end target, usually the Data Warehouse (DW). Depending on the requirementsof fhe organization, the process vanas widely. Some data warehouses ‘may overwnte exstng information with cumulatve, updated data every week, white other DW (or ‘even dther pars of the same DW) may add new dala in a historzed form, for example, hourly. The timing and scope 'b replace or appand are strategic design chaces dependent on the time available and the business needs. More complex systems maintain a history and ‘audit trai of all changes to the data loaded inthe DW. Page 9 ‘she Joad phase interacts with a database, the constraints defined in the database schema, ‘2 in tnguars activated upon dala oad apply ((or example, uniqueness, referertial Integrity, mandatory fields), which also confbute to the overall data quaily perfomance ot the ETL process. Durng Extracton, the desied data i identifed and extracted fom many diferent sources, including database systems and appications. Very afte, it is not possitie 0 identily the spect subeel af interect; herafore more data than necessary has to be extracted, 20 the ‘dentficaton ofthe relevant data will be dona at a laler point in time. Aer extracting data, i has ta be physically transported fan itermadiaia syelam for further procassing Depending on the chosen way of transportation, soma tansformations can be done during this process, too. For example, a SQL stalement which dreclly accesses a remote target trough a gateway can concatenaie two cams as part of the SELECT stalomart Bated on. the requrements, soma tansformatone may take place dung. he Transformation and Execution Phase. Through Informatica mappings, the necessary changes and updates of he data are made using transforma tons, ‘Then in the Load phase the dala is baded in the Largel. Afler all the transformations, it has to ‘be physical transported ta the targat system for loading the data info the Target. Page 10 Detailed ETL Process Flow riers | >= a) F % ‘Operational gstems. Ewor View Tool Name Inkormatica DTIStudia Data Stage sitio Data Juncton Oracle Warehoure Bulder Microsoft SOL Server integration ‘TransformOnDemand ‘Transformation Manager ‘Company Name Informatica Corporation Embareadsto Teetslagios te 2 bitin Sofware Corpaation Pervasive Software Oracie Corporation Microsoft Solomte ETL Solufons age 14 Informatica: An Overview Informatica provides a single enterprise data integration platlorm to help organizations access, transform, and integrate data ftom a large vanely of systems and daiivar that information to ther transactional systems, reaklime business processes, and users, Informatica suppods the activities of Business Inegtation Competency Center (ICC) and lher integraton experts “by serving as the foundation for data warehousing, data ‘migration, consoldation, “single-view,” metadata management and synehrenization, Informatica: Key Benefits Wit Informatica you can J Integrate data to provie business users halstic access 1 enlerprite data - data i= comprehonsive, accurate, and timely. J Seale and respond to business needs for information - daliver data in a secure, scalabio ferwiranment that provides immediate data access to alldisparaie sources. J Simpity design, coleboration, and re-use to reduce developers’ ime to results - unique ‘ratadata management helps bovet eFfisency to meet changing markel darnands. Informatica: Key Features J Enterprise-level data integration. Informatica ensures accuracy of data though a sing fenvirenment fr transforming, profiing, inlegraiing, cleansing, and reconciling data ‘and managing metadata, J Security. Informatica ensures security through complete user authentication, granular privacy management, and secure transpert of your data, J Visual Interface. Infermatica is an easy to use tool with a simple visual interface for designing fhe integration J Developer productivity. Informatica simplifies design procasses by making it easy to ‘soarch and profile dala, reuse abjacis across taams and projects, and leverage metadata J Compatibility, Informatica can communicate with a wide range af data sauroas and move huge volume of data between them effectively. Informatica Architecture and Components. Whea a workliow stars, the Informatica Server retiaves mapping, workflow, and session metadata fom the repository fb extact data from the source, transform i, ard. load it into the target, It also runs the tasks in tha workflow. The Informatica Server usas Load Managar and Data “Transformation Manager (DTM) prosasses to run the workhow. Page 12 ‘Server Components Integration Service. Integration Serco manages the scheduling and execution of workfows. It roads mapping and session information from the repository, oxtracts dala from the mapping sources and stows the data in memory while it ‘apples the (ransiormaton ree that you configure in fre mapping. I loads the twansiormed data into the mapping targets. You can achieve high periormance using ssymmaic mul-pracessing systems, start and run muliple workflows concurrently and concurrent process partons withn a single session. Repository Service. Reposiery Service manages the metadata in the repository database and repository conection requests from cient appiicalions. For each ropesitory database regisiered with the Repository Server, it configures and manages a Repository Agent process, It moniors fhe stalus of sunning Repository Agents, and ssonds reposiloy abjact notification messages fo clori applications. The Repasiioy ‘Agant isa separate, multithreaded process that rotnieves, insorts, and updates mmotadala in he repository database lables, The Reposioy Agent ensures Te cconsistency of metadata in the repository by employing object locking, Web Services Hub. Tho Web Services Hub is a galoway that exposes PowerCenter functionality to extemal cients. trecawes requests from web service cients and passes tham to he Inlagration Service or Repository Service. The Integration Service of Reposilory Service processes the requests and sends a response to the web ‘Service clint through the Web Services Hub. SAP BW Services. Tha SAP BW Service listens for RFC requests from SAP BW and Intintos workflows to extract ram ar load to SAP EW. The SAP EW Sarvice fs not highly avaliable. You can configura iti run on ene node, Client Components 5 Repository Server Administration Gonsole. Reposibry Server Administration Console is used {9 add, edi, and ramave repository configurations, Wis also usad to ‘export and impart repository contgurations, create, promote, copy, delete, backup land restore repositories. Repository Manager. Repositery Manager is used for administering the repository, perform folder funefons Ike ereate, edit ard delete folders, It ie alsa used to copy & folder wihin a reposibry or to otter repository, compare folders, and implement repository sacuriy. Also, A slows us fo creale, edit and dalele repostory users and user groups, assign and revoke repository privileges and foller permissions, view Jocks “and undock objects. You can impart and export repository comection information in tho regetry, anayza sourcelargel, mapping dependence and. view the proparies of repository abject. Designer. Designer provides five tools with which you can create mappings. Source Analyzer is used ‘9 impor of create source definiions for fat fe, ERP, and rolatonal sources, Warehouse Designer is used to import or credte targst definitions. Transtomaton developer is used to cteale reusable transformatons. Mapplet designer is used to create mapplals. Mapping designer is used to create mappings. Workflow Manager. Workfiow Manager is used ta create Workfiow which is a sat of Insinctions 10 execute mappings. Generally, a workiow contains a Sassion and any ote: task you may want perform when you execute a sassion Tasks can include a ‘Session, Email, or Schaduling information. You connect each task with Links in the workflow You can also creaia a warkiel in tha Workflow Manager, A Werklet i an ‘object that groupe a sat of tasks. Workflow Monitor. Workfaw Morilor is used to manitor workflows and tasks. Yau can View details about a workfow or task in eithor Gantt Chart view or Task view. You can run, stop, abort, and resume workflows. Displays workflows that have run at least (ance, Continuously recaives.informatan from the Inlagation Serva and Repasitay ‘Service, Fetches information fom the repositary to display historic infarmation Page 13 Informatica Services Connectivity ‘The Integration Service pracess conmecs to the source or target database using ODBC or native drivers. i comects to the Repository Service to retrieve workflow and mapping motadata from the raposibry database, The Informatica Client dls Ike the Workfaw Manager communicates wth the Integration Service and the Raposiory Sarvice procass var a TCP/IP cannaction, InformaticaClient Sourcesand Targets Native/ODBC — Manager Designer eee) Native/008C - MetworkProteco! Workflow IntegrationService Repository Database New Features. New enhancements have been inkoduoad in informatica 7.1 in the folowing ‘components: Data Profing caning PowerCantar Server PowerCenter Motadsta Reportar Repository Server Repository ‘Transformations, Usabiity \Version Contra! Wb services provider Worktow Monitor XML Support ‘SuSE Linux Support Oracle OS Autienticaton Flat fla Look Page 14 New enhancements have been introduced in Infarmatica 8.1 in the folowing components: Command Line Programs Data Analyzer (Powendnalyzer) Data Profiling Domains Integraton Service (PowerCenter Server) Mappings Metadata Manager (SuperGlus) Repository and Repository Sarvica ‘Transformations and Caches Usabity Web Services Provider Worktow Monitor XML PowerCenter Comects PowerExchanga Client or PowerCenter Page 15 ‘ETL stands for extraction, transformation and loading, ™ Exdraction & fetching data fram source aperatonal or archive systems which are the primary source of data forthe data warehouse. ™ Transformation of the data invaives cleaning, fitering, validating and appiying business les ™ Loading is the process of loading the data into a data warehouse or any other database or appication that houses data Informatica provides a single enterprise data integration platform, ‘With Informatica, you can achieve © Enterprise level data integration © Seatablty © Developer productivity 1% Chent foots of hformatica incudes © Design © Workfiow Manager © Workfaw Man tor 2 Repository Manager 2 Repasitry Administration Console z z 1. Expand and Explain ETL. 2 Js the definion of tha relationship and data flaw between source and targetabects. 3. Ta process of receiving invonsistonsies and! fang the anomalias in eoures data is called ‘Adatatase, appication, fie, or ther storage facily to which the “vansformed ‘source data" is foaded in a data warehouse is called Name the Sarver components in Informatica [Name the Ciient components in lfrmatica Which service manages the scheduling and execution of workfiows? Which service manages the matadata in the repository database? Which tools used to create/dalete backupiestore repasitorios? 10. Which tool is used in administaring the repo sary? Page 16 Session 2: Integration Service Learning Objectives Se Integration Service Overview Se Integration Service Process ‘we Load Balancer Se Data Transformation Manager (DTM) Process Se Integration Service and the Administration Console Intog ration Service Overview ‘The Integration Service moves dala from sources to targets based on workflow and ‘mapping metadata stored in a repository. When a workhiow starts, the Integration Service relreves mapping, workliow, and session metadata from the reposory. It extracts data from ‘he mapping sources and stores the data in memory while it apples the tansfornation rules configured in the mapping. The Inlagration Service loads the transformed data into ane oF more targets. Figue below shows the processing path between the Integration Sonvce, reposiery, sours, and target tegration sere eee — ——. Tworceae ‘Wane otracans tom epoeery Ta move data from sources fo targets, the Inegraton Service uses the follwing ‘components: J Integration Service process. The Integration Service stars ane or more Iniegration Service procasses te run and mantior werkfans, When you run a workfow, the Inlegration ‘Service process stars and locks fhe workfiw,, rune the Workfow! tasks, and stats the process to run sessions. J Lead Balancer. The integration Servica uses the Lead Balancar to dispatch tasks. The Load Balancer dispatches tasks to achieve oplimal performance. It may dispatch fasks toa single nade or across the nodes ina grid J Data Transformation Manager (DTM) procass. The Inlagraian Service starts a DTM process 10 run each Sesson and Command task within a workflow. The DTM process performs session Vabdations, creates Ueads {9 iniiaiza the session, 22d, \wite, and transform dala, and handlas pre- and past-sassion operations ‘The Integration Service can achieve high perfamance using symmetric mul-processing systoms. can start and run muliple tasks concurtenty. It ean also concurrently procoss parliions within a single session. When you croale mulfple paritions within 2 session, the Integration Service crastas muftipla database comactions to a singe source and extracts a ‘separata range of data for each connection, It also transforms and leads fhe da in parallel Integration Service can be crealed ona Windows or UNIX server machine. The Integration Senice could be configured using the Administration Console or the pmemd command! fine program, age 17 Intog ration Service Process ‘The Iniegration Service stars an Integration Service process ta run and monitor workiows. The Integration Service process accepts requests from the PowerCenter Cient and from pmemd, [performs the folowing tasks: Manages workflow scheduling Locks and reads the worktow. Reads tha pararneter fie, Creates the worklow log Rune workflow tasks and evaluates the conclianal inks connecting tasks, ‘Starts the DTM process or processes to run the session Wotes histrial run information to tha ropasitry. ‘Sands post-session email in tha event of a DTM falure Managing Workflow Scheduling ‘The Integration Service process manages workilow scheduling in the folowing tuations: J When you start the Integration Service. When you start the Infagration Servic, queries the repository fora ist of workflows configured to run anit J When you save a workflow. When you save 2 workfow assigned to an Integration ‘Service tothe repository, the Integration Service process adi the warkfiow to or romaves the warkfaw ftom the seneduls queue, Locking and Reading the Workflow When the Integration Service process starts a workflow, it requests an execute lock on the worktow from the repaslory, The execute lack alews to run fhe workfaw and prevent you from staring the workfiow again until it completes. If the workfow is aloady locked, the Iniggratin Service process camiat start the warkfow, A workflow may be locked if itis already running, ‘The Integration Service process also reads the workflow from the repost at worktow run tim. roads all links and tasks in the workflow except sessions and worklat instances. The DTM retnves the session and mapping om the repostory at session run time, Itreads the worklas from the reposiiory when the worklet starts. ‘The Load Balancer is a component of the Integration Service thal dispatches tacks to ‘achieve optimal performance and scalabilly. When you run a workfiow, the Load Balancer dispatches the Session, Command, and predefined Evert-Wat tasks within the workhow. ‘The Load Balancer matches task requirements with resource availabilty to identity the bost node fo run a task. Nt dispaiches the task i an Inlegaation Service pracess running an the rode, Kemay dispatch taske to a single node ar acrass nodes. ‘The Load Balancer dispatches tasks in the order it recsives them. When the Load Balancer needs to depaten mora Session and Command tasks than the Inlagration Service can un, places the tasks it cannot run in a quaue. When nodes bocome avaiable, the Load Balancer {spatches tasks from the quoue in the order delerminad by the workflow service love. The folowing cancepts describe Load Balancer functonaity: {Dispatch process, The Load Balancer performs several steps to dispatch tasks, { Resources. The Load Balancer can use PowerCarter resources 19 determine # i can dispatcha taskto a node. J Resource provision thresholds. The Load Balancar uses resource provision thresholds. todelarmine whether itean start additional tasks an a node. J Dispatch made. The dispatch made determina how the Load Balancer selects nodes for dispatch, § Service levels. When muitisie tasks are waiting in tha dispatch quoue, the Load Balancer uses service levels f determine the arder in which fo dspatch fsks fam the queue, Page 18 When the workfow reaches a session, the Inlogratin Service process starts the DTM process. The DTM is the process associated with the session task. The DTM process performs ‘he fobowing tasks: {Retrieves and validates session information fram the repositary, J Performs pushdown optimization when the session is configured for pushdawn ‘optimization J Adds partitions to the satsion when the session is configured for dynamic partitioning, J Fors parton groups when the session is configured to run ona gtd J Expands the service process variatles, sacsian parameters, and mapping variatias and parameters J Creates the session log, J Vabdates source and target cade pages. J Veriias conection abject permissions J Runs pre-session shell commands, stored procedures, and SQL. J Sends a request to start worker DTM processes on other nodes when the session is Configured to runon a gre, J Creates and runs mapping, reader, writer, and tansfomation threads to extract transform, and load data, J Runs post session stored procadures, SOL, and shell commands. J Sends past session email Integration Service and the Administration Console ‘The Integation Service is an application service that runs sessions and worklows. You instal the Integration Service when you install PowerCenter Services. Aflor you instal the PowerCanier Services, you can use fe Administration Consale to manage the Integration Sanvice. To access an Integration Service, you must have permission an the sence in the comain You can use the Administration Console to complete the following Integration Service tasks {Create an Integration Service. You can craate an Integration Sarvies to replaes ‘an exising Inlagration Service arto use multiple Integration Services. J Enable or disable the Integration Service. You enable the Integratian Service to ‘sessions and workfiows, You might disatte the Inagration Service to prevent users fram funning sessions and workflows while performing maintenance on the machine Cor mediying the repository 5 Configure normal or safe mode. Configure the Integration Service to ean in normal or sak made, J Configure the Integration Service properties. ‘You might need to configure the Infegration Service properties to change behavior of tha Integration Service, J Configure the associated repository. You must assacials a repository with Integration Service. The Integration Service uses the mappings in the repository to run sessions and workflows, {Configure the Integration Service proces ses. Configure service process properties for each node, such as the cade paga and service process Vanabies, {Configure permissions on the Integration Service. You must have permission on an Integration Service to viaw it and ful priviages t configure a J Remove an Integration Service. You may nead to remove an Integration Service if a becomes obsolats. Use the Administation Cantele to remove an Integration ‘Serve, Creating an Integration ‘Service You can create an Inbgration Service when you install PowerCenter. However, you may need ‘b create an additonal Integration Serva to replace an existing ona of create. mutiple Intograten Services. You can assign fe raposiory when you creaie the Integration Senvce oF after you create it. You must assign a repository before you can run the Iniegration Sanco. ‘The roposiory Page 19 ‘hat you assign to the Integration Servie is called the associated repository. The Integration Service ratveves metadata, such as workflows and mappings, fam the associated repository Alter you create an Integration Service, you must assign a code page for each Inlagration ‘Service process. The coda page for eacis Integration Service process must be a subsel of the code page of the assocaled reposiary. You must select the associated reposiiory babre you ‘can selact the cade page for an Inlagration Service process. The Respository Service must be enabled fo set up a code page for an Integration Servica process. To cieale an integration ‘Service, you must have full pavioges and permission on the damain cr falder where you wart to create the service. You must also have permission on the gic or nade where the service nuns. Note: If you configure an Integration Service to run on a node that is unavailable, you must start the node and configure $PMRa0IDir for the service process before you run ‘workfaws with the Integration Service, Te create an ntegration Service 1. ln fhe Administration Contole, cick Create > Integration Savice ‘The Create Now integration Service page appears. 2. Enter values fr the Integration Service aptions. Ck OK. frects Fao Bergeon Garvey Secteur ete napa er teens ———— —_—a ‘3. Ifyou do net spesty an associated repository, the following message appears: No Repository Service is associated with this Integration Service. Select an associated ‘Repository Servic fo view alist of available codepages. ‘Yous must specty @ Repository Service before you can enable the Integration Service. 4. If you specity an associated repastory, the Create New Integration Savvice page appears: eae peat a egg fan ee eget getan Se may Rees sent [idea capes aslo ut naan fi ah ne age 20 Informatica 5. Youccan specify the code page for each integration Service process node and select the Enatie Service opton to enatie tha sevice, If you do not specty the code page informatan now, you can specify it later. You carnot enable the Integration ‘Service untl you assign the code page for each Inisgration Service process node Enabling and Disabling the Integration Service ‘You can enable and disable an htegraton Sarvice process or the entire Infegration Service, you fun the Integration Sevice on a ghd or wih the high avallablly option, you have one hitograten Service process configured for each node. For a ghd, the Integration Senice ‘un all enabled Integration Service processes. With high availability, the Integraton Service rune the Integration Service process on the prmary node, Use the Administration Console fo enable and disable @ servioa prooass running the Integration ‘Service, Each serviea process runs on ane node, You mist enable the sence process if you want the node to perform Integration Service tasks. You may want to disable the service recess on a node to perform mainionance on that node or to enable safe_mode for thy Integration. Sorvies. When you disable an Integration Serviea process, you must chaose the ‘made to disable tin, You can choses ona of the follwing options: { Complete, Alows the sessions and workfiows to complete before disabing the service process. J Stop. Sbpe all eessions and workflws and then disables the eerviee process. J Abort. tempts fo stop all sessions and workflows before aborting them and cisabling the ‘service process. Ta enable ar dsable an integration Sawview process: 1. Inthe Navigator of he Adhinistration Consele, salvet the Integration Sawvies, 2. Click the Processes tab. If you configure the Integration Service to mun on a grid or ‘onmul pie nodes, the Procasses tab dsplays one area for each nade, 3. Toenabia a service process, cick Enable for fe applicable nade 4. Tocisablea service process, click Disable for the applicabie node, ‘The Disable Integration Service window appears. 5. Choase the disable mode and dick OK > Page 2 \Whan you enable the integration Service, the cavice stars. The associated Repository Service ‘must ba started before you can enable the Integration Service. If you ty to enable an Integration Service and the associated Repositery Service is nol running, the folowing error splays: ‘Te Service Manager could not start the service due to the following errr:[DOM_10076) Unable bb enable serve [ is enabled. ‘The service did not star. Please check the logs for moro information. To resolve the preblom, review the logs for this Integration Service to determine the reason for faire and fox he probiom. Ater fang the problem, you must disable and re-enable the Integraten Service ta stat Configuring the tegration Servica Properties Use the Administration Console to canfigure the folowing Integration Service properties: {Grid and node assignments. Chaose fa run the Integration Service on a grid or node. J General properties. Configure general properties, such as fe data movement made. J Advanced properties. Configure advanced propertes, such as the character set of the Inlegration ‘Service loge and whether the Integration Service should check § Operating mode properties. Enable the Integration Service in normal or safe mode and configure the Integration Service to fail over in safe rade. J Compatibility and database propertios, Configure the saurce and target database ppropertes, such the maximum number of comections, and configure properties to ‘enable compatbilly wih previous versions of PowerCentar {Configuration properties. Configure the canfiguation properties, such as the data display format J HTTP proxy properties. Configure the comection to the HTTP proxy sewer J Custom properties. Cusiom propartias include properties that are unique to your PowerCenter environment or that apoly in special cases. An Inlegratian Service has no custom properties when you intially create i. Use custom properties ‘ory Informatica Technical Suppart instructs you to, ‘To view and update propartios, satact the Integration Service in the Navigator. The Propertios tab for the service appears. Configuring the ~—= Associated Repository When you create the Integration Service, you specfy the repository associated with the Integration Service, However, you may need fo change the repastery connection informaten, For example, you need update the connection information if the repository is moved b a new databace. ‘You may need {0 choose a different repasiary when you move from a ‘evelopment reposiiry to a production repository. When you update or choose a new reposilay, you must spocily the repository dotabase user name and password, the Reposiory Service, and te associated domain. The ‘Administration Console lists the Repository Services defined inthe selected domain. ‘The Repository Savvieo and Integration Service can be in diferent domains. If you want to choose a domain thal is nat fisted in the Addministraion Console, you ean ack the comection information for the ather domain. To select a Repository Service in a different domain, you ‘must selected Page 22 domain and Repository Service, and then enter the host and port number for the Repository Sarva, ‘To ecit the associated repositoy properties, select the Integration Service in the Domain tab of, the Adminstration Console, and then click the Properties tab > Associaled Repastory Propestios > Edt. SSS a= te eel pe ee SR —_— — ee = nae nee ———— Page 23 ‘Tha Inlegration Service moves data from sources targets based on workfiow ‘and mapping metadata storedin a repository. ™ The components of Integration Service are © Integration Service Process © Load Managor © Dala Transformation Manager (DTM) Process ™ The Administration Cansde Tool is used to crealefoanfigureiremave and Integration Service, ‘est your Understanding 41. Truear False: The Integration Service can start and run multiple workflows concurrently. 2. Name the 3componernis of the Integration Service 3. What isthe tool which is usd to createsconfiguralvemove an Integration Sanvice? Page 24 Session 3: Repository Service Chfectives Se PowerCenter Repostany Se Administation Console Creating Repository Service (Creating Repository Content Deleting Repository Content Backing up a Repostory Restoring a Reposiory Licensing eeeeee# PowerCenter Repositor ‘The PowerCantar Repository & a relational databace managed by the Rapasiary Savica ‘The Repository consists of database tables that store metadala. Metadata descibes diferent types of objects, such as mappings and tansfomations, thal you can ereale or modify using the PowerCenter Client tools. The repository also stores administrative information such as user names, passwards, permissions, and prviloges. You can connect to and manage ruttiple repostorios. A repository domain is a group df repositories that you can connect fo simultaneously inthe PowerCarter Cant. They share meladala trough a special type of repository caled a global repostayy. When you configure shared folders in a reposioy, you can share the objects in the folder win other ‘epostories in the repository domain. You share objects to reuse metadata ‘The Repository Service manages repository metadata transaction requests from cient applicaions. Each Repository Service managas a singlo repository. Tha Rapasiiory Sarvice Uses objectiocking to ensure the consistency of metadata in the repository. The Repastory ‘Service alsa notifies you when another usar madifes or delelas repository abjacts you are using, A Repository Service process is a multithreaded process that fetches, inserts, and Updates metadata in the repository database lables. A Repository Sanice ‘pocess isan Instance of the Repasitary Service that runs on a particular machine, or node. ‘Administrion Conscia i the administration tool you use to administer the PowerCenter domain. If you have a user login to the doman, you can access the ‘Administration Console. Use the Administraton Console to perform administrative tasks such as ‘managng logs, user accounts, and domain objects. Domain objects ince services, nodes, Fis, folders, and boonses. ‘The domain adninistator or any user wit ful privileges and permission on the domain can create a login to the domain, You must have a domain user accaint to log in 19 fe ‘Administration Conse Te fog in the Administration Console: 4. Open Microsoft Intamat Explorer or Mazita Fiofox. 2. In the Address fisld, enter the folowing URI for the Administration Canscle login age: utpslchost>-/adiminconsole bn the URL, : oprasents tha hast name and port number of any gateway node IFyou are not using the Inlemet Explorer Enhanced Securiy Conf guraton, you can enter ‘he falewing URL, and the teawser is daciad to the ful URL forthe login page: age 25 teutshost= ‘The Informatica PowerCenlar Administration Conscle login page appears. 3, Enter the domain user name and password. (Click Login, 15. Wthisis the fist time you og in with the usar name and password provided by the domain administator, change your password to maintain secur. ‘Administration Gonsole Tabs ‘The Administration Console has labs fat are accessible at all times when you aa logged in Use thess tabs to perform tasks such as viewing log events ar configuring sarvica properties. ‘The Administration Console has the folowing abe: J Domain. Viow and edit the propertis of the damain and objects within the domain. The Domain tab has a Navigator anda Main window. J Logs. View log events for the domain and services within he domain, J Administration. Manage damain users in the Administration tab. This tab appears if you hhave permission on the domain, 1 Upgrade. Upgrade capositaries and Inlegration Serves, This tab appears i you have permission an the domain, J Manage Account. Manage your domain user profile, On this tab, you can change your domain user password and Update your user profi eens ome ELE Bae Os @e e— e 6 @ an op Greate a Reposiory Serica ta manage the metadata in repository database tables. Each Repository Service manages a. singe repastoy. You naed © create 2 unque Reposiery ‘Service or each repasiory in a PowerCenter domain, Te crate a Repository Service: Page 26 1. Inthe Navigator of the Adminstration Cansole, select the folder were you want to creale ta Repository Serves, 2. Chok Create > Repositay Service, The Create New Repository Service windaw appears, 43. Enter values for the Repostory Service optns. 4. Chick OK, wat rope foie nae Rapoitry Sn, Oe | weston sucotoonen ETI Mero SSE cue eR f<—e9 ‘a ein Stems RSET ET RoR (ecmet8tog: —§ ——— Ohieen cc 8 Panne eres | reavon det © create ney epcton cotart F crate a absivaoatory net a te lhe as try Sania ant Mart Bch toed pest cote roe. © bo netsnneraposoy contr tat han ait andthe euch dtabore cnvcio sng TF tnsle the Raperton tant Io 4 Configuring Repository Service Properties ‘Alter you create a Repository Service, you can configure it. Use the Administation Consdle ‘2 configure the falowing types of Repository Service properties: J General properties. Configure general properties, such as fe operaling made, J Node assignments. if you have fhe high avalabiity option, configure the primary and backup nodes to un the service J Database properties. Configure repository database properties, such as the code page ‘and connection string, J Advanced properties. Configure advanced repository properties, such as the maximum ‘connections and locks on the rapositay. age 27 J Custom properties. Configure repository properties that are unique fo your PowerCenter envranment or that apply in spesal cases. Use custam prapertes ony if Informatica Fechnical Suppo instructs you ta do so. ‘To view and updale properties, select he Repository Service in he Navigator. The Properties tab for te service appears. Basra Ossoprire Sade Aasgnnante qo ache Nodes sapere Canara Propartin qo ‘peretnenieder oral seeuntytast tits co scene WS Windows Latin bb, a baneaerd Repository content is he repaeitory tables in the databace, You can create repository content for a Repository Service i you del not craala content when you craalad the service arf you dalatad the reposiary canlert, You camot ereate content for a Repasitay Service that siraady includes content, Te create content Service 1. Inthe Navigator, selacta Repository Service fat has na content associated with it 2. Inthe Actions fist, selact Create Contents, ‘The page displays the options to create content ‘creating a Repository Page 28 3. Optionally, choase to create a global repository. ‘Select this option i you are certain you want to create a global repasitoy. You can promote a local repository to a gabal rapostory at any time, bul you cannot ‘convert a gabal repositary to a local repository. 4. Optionally, enable version contra You must have the team-based development option ta enable version cont Enable version contra # yau are carlain you want to use a varsionad repositay. Vou can convert a non-versionad repository to’ versioned rapositary at any time, Dut you ‘Cannot convert a vrsioned repository {0 a non-versianed repasiary 5. Chick OK, Delete repository contort when you want to delete al metadata and repository database tables fom the repository. You might delete the ragosiary conient i the metadata is obsdiate Deleting repostory content is an iraversiie action, I better to back up the repositry before deleting it Te delete repository content 1. nthe Navigator, select the repository fromwtich you want to delete the cantent 2. Change the operating made of the Rapasitary Service ta exclusive 3. Inthe Actions ist, select Delete Contents. The Revonory Serica MALAHA_Depayenaied ond mening mache made, STO ate content fr R818 apy ac Pepentary Athen enter arenes tc] Pasar: oc 4. Enter a repository user name and password, The repasitory user must have the Admin Repository privilege to delete rapositary content 5. I the repository is a global renository, choose to unvegisler local repositories when you date the content. The delata operation does nat proceed # i cannot unragister te local reposts. For example, if a Repository Serve for one of ihe local repesitones i running in exclusive mode, you may need 1a urregister thal repository bbelore you attempt! to delete tha global rapositay. 6. Chek OK. Backing up a Repository Regularly back up repositories fo prevent data lass due fa hardware or software problams. Wren you back up a repository, the Repastary Service saves the reposiony in a binary fi, including the repository objects, cannacton information, and code page information. If you need to recover the repasitary, you can restore the corientf the repository fram this binary fe, To back up a sopostery 1. In he Navigator, select the Repository Service forthe repository you want to back up. Page 29 2 Ih the Actions lst, select Back up Contents, ‘ack up contents for WR_#9 Dev oes eras [>-=>?.sSao— anewards = ‘ada le Fanart --- eso r a 7 repece enting Pe Tie depapment group hstary Thin oe date 3 Enter repository user name and password “The reposiory user must have the Admin Repository or Super User priviege t2 perform backups. Entera fle name and description for the repository backup fle Ityou use the same ‘le name that you used for a provious backup file, saloct whether Cornet to replace the existing flo wth the new backup fla, ‘To ovarrte an existng reposiiory backup fie, select Replace Existing Fl. If you specity a fie name that aleady exists in the repository backup directary and you do not choose toreplace the existing fla, the Repasitry Sevvice does not back up the repository. ‘Choose to skip ar back up workflow and session logs, daplayment group history, and NX'data, You might want to skip these operations to inereasa performance when you restore the reposiiny (Click OK, Restoring a Repository . You can resiore metadata from a reposiory binary backup fle. When you restore a reposiiony, you must have a dabbase avaiable for the repository. If a repostary exists at the target database location, you must dolele it belore you restore a reposiory backup fio. PawerCanler restores repesitades fram the current version. If you have a backup fle from an eartiar version of PowarCarter or PowerMar, you can use tha eariar varsion 10 restore the reposiory. Verly that the repository leanse nckides the loonea Keys nacossary 10 restore the repository backup fie, To restore a ropostery 1 2 a In the Navigator, select the Repository Service that manages the reposiery content you want restore Ih the Actions ls, select Restore Contents, The Restore Contents aptions appear ‘Select a backup fl to restore I the repository backup fle is associated with a security module, the restore ‘security options appear. In the restore secutly options appear, you must choose whether to restore the ropesitory withthe external module ar unregister the security module when you restore. age 20 ama oc Sct hc he Thane seared oes hip woe and sso lo sip seoymare gout hat sips ae lHyou choose fa keep the extemal modula, you must specify he repository administrator user name and password. Use the exlemal medua iogn tat oarespands 10 the Administratar user forthe repost. Ht you choose to unregister the security module, the Administrator account password becomes “Administator.” The other user names are disabled. You must ‘aq)sler the security module again io activate the user names. ‘Select whether or not fo estore the repository as new. When you restore a repasilory as now, the Repository Service creaios intemal fodder IDs ‘for the folders in the ‘restored repositary. This enables you to copy folders and deployment groups between the orignal reposibry and the restored repositery. M you do not restore the repasiory ‘as new, you camel. copy folders and deployment groups between the origina and restored repostones, ihory that ures an authentication module ‘eth the repository he astamal sthanvcaton mechanism twas using whan this Backup vas ceated Provide the login to authenticate using this moduler Lea a Pasevords ow) nate te secunty module The Buttin Administrator account password wil be Optionally, choose to skip workow and session logs, deployment group history, ard MX data. Skipping the data improves performance (Chick OK, Page 31 J Run application services. Application services include the Repository Service, Integration Service, SAP BW Sarvice, and Web Services Hub. J Use PowerCenter features. Features include connectivity, Metadata Exchange options, ‘and ohar options, such as sassion an god and high avait. You can create a license atjact in a demain and assign the loanse to apslication services. You can create the boanse in the Adminstration Cansole using a licanse key fie. The license key fie contains an encrypted original key. You use the onginal key to create the license, Ta create a loanse: 4. nthe Administration Console, cick Create > License, ‘The Create License window appoars. 2. Enter the options. 3. Chick Create, Page 32 Repositay ie a relational databace with unique Repo story Server ™ Reposiory Service is a separate, mull-threaded process that retrieves, inserts, and updates metadata in the repesitory database tables. ™ We can Create/Administer/Backup/DeletalRestore a Repository. ‘Test your Understanding 1. What manages the connactions lhe PawerCenier Rapasitary rom Client Applications? Where is a Reposiiory Service cated? Where the Informatica objects and the passwords & priviages are are saved? Repository ie a____ databace, Haw many Repostory Sarvicas a Repository can have? Page 33 Learning Objectives Repository Manager Overview Session 4: Repository Manager % Repository Manager Overviow Understanding tha Repo story Manager Windows. Comecting to a repository Managing Groups Managing Users Managing Privileges Managing Folders Viewing User Connections Viewing Locks ‘Viewing Dependencies eeeeeeere ‘The Repository Manager is used to administer repositories, You can navigate through ‘multiple foiders and repositories, and complete the folowing tasks: ' Manage users and groups. Create, edi, and delete repasiiory usare and user groups. You ean assign and revoke repository privieges and folder permissions, Perform folder functions. Create, eit, copy, and delete folders. Work you perform inthe Designer and Workfiaw Manager is stored in folders. I you want share metadata, you can configure a filer ta ba shared View metadata. Analyze sources, targets, mappings, and shortcut dependoneies, search bbykeyword, and view the propartasaf repository bjects. Use the Repository Manager to comple the following ‘Add domain connection information. You can configure damain camecton information ‘Add and connect to a rapository. You can ad repositerias tothe Navigator window and cient ragisty and then connect io the repositories. Work with PowerCenter domain and repository connections. Yau can adit or ramave damain connection infermation, You ean comect to ane reposiory or multiple ropositries. You can export repasitory connection information from the chant registy to fle, You can impat the fle on a diferent machne ard add the repasiary camection information to the Shenk roger. ‘Search for repository objects or keywords. You can search for repository objects Containing specified tax. IF you add Keywords target definiions, use a keyword to search br atargetdetinition View Object dependencies. Before you romave or change an ebject, you can view dapendanbies (9 eas he impact on other objects. Compare repository objects. In the Repository Manager, you can compare two repostary Gbjects of he same type to identity differences’ between the objects. Truncate session and workflow log entries. You can truncata thelist of session and \orktiow loge that the Inlegraton Service wii (o the repository. You can funcate alllogs, or tuneata all logs alder tan a spectied ate View user connections. Use the Repository Manager to view and manage user connections, Release locks. Use the Repasiory Manager release residual locks left by residual ‘connections, Releasing a vad lock can resull in reposiloy inconsistencies, 20 release ‘Block only if fhe locked abject is natin use Page 34 J Exchange metadata with other business intelligence tools. You can export metadata {to and import matadata fram oer business inteligence tools, seh ae Cognos Report Framework Manager Understanding the Repository Manager Windows The Ropository Manager can display four main windows: Navigator window, Main window, Dependency window, and Output window. You can dock and undock the Navigator, Dependency, and Ouiput windows. You can also hide and display the Navigator, Dependency, and Oulper wendows. Main Windows Navigator Window Use the Navigator window to connwet to @ repository and navigate trough the folders and ‘bjects in fre repository. The Navigator windaw displays fhe folowing types of abects: J Repositories. Pow erGenter repositories can be standalone, local, or global J Deployment groups. Deployment groups contain collecfons of abjacis for deplayment to ‘anathar repostryin te repository doman, { Folders. Foiders can be shared or not shared { Nodes. Nadee contain sessions, sources, targets, transformations, mapplets, workfiows, tasks, workets, and mappings. J Repository objects. Repository abjects displayed in the Navigator can include sources, targets, transformations, mappings, mapplels, sessions, tasks, workfiows, ‘works, worktiow logs, and session logs. age 35 i You can view the properties of the objects in the Navigator by selecting the object, and thenclickng the View Properties button in the tookbar. You can also nighi-click the abject in the Navigator and selact Properties from the shorteuk man Main Window ‘The Main window of the Repository Managor displays information about the object selected in the Navigator. For example, if you sdect a reposiiry in tho Navgator, the Main window displays al the foklers in fhe repository long with addiianal folder information, such as whether the fader ie shared arin use. Sorting and Organizing You can sort tems in the Nain window by each of the eslurms. For example, to sort manpings by valdity, select the mappings node, and then cick the Vaid coumn heading. Gick the beading again fo reverse the order in which the mappings are sorted. You can also change the order in which the columns appear. For example, you might want ‘he Vad column {0 appear frst, on tha aft sida of the Mann window. To do this, drag fe Vatid column heading to the location. Tha Reposiory Manager displays he columns in the new order intl you change the display. Note: You can doublectick an object in tha Main window to view its Propertas. Page 36 Viewing Object Details: ‘To view information about reposory objects, solect a node in the Navigator. Or, to view elated informaton about a parteuar repasiary object, dril down on a nada and seloct the object Dependency Window ‘The Dependency window appears when you configure the Repository Manager to display dependancies. You can view dependencies by using the menu items or the dependency buttons on the toaibar. You can aiso view dependencies using te Dependency dalog box When you view dependencies, the left pane of the Depersency window lists the object that has dependency informaton, and the dependency information appears inthe right pane. The Dependency window can display the following types of depordiancies: J Sourcetarget dependencies. Whon you view sourca-target dependencies, the Dependancy window lists all sources or tagels related to the selected object, including relevant information about fhose sources or tagels. For example, i you Ssolect a target, you view all sources that provide data for that target, along wilh information about each saurce J Mapping dependencies. When you view mapping dependencies, the Dependency ‘window ists al mappings containing the selected bject, and relevant information about those mappings. For example, if you select a reusable transformation, you viow all mappings uting that Fanefoemation and information aboud each mapping. J Shortcut dependencies. Whon you view shortcut dependencies, the Dependency ‘window lists ail shortcuts to tha selected atjact and relevant details, such as the repository and folder in which fa shortcut exists. When you edit an object referenced bby shortcuts, use the Repostory Manager b see which folders contain the shortcut When you open the Dependency window, it displays dependencies for the object selected in the Navigator. You can also view mapping abject dependencies in the Designer. Output Window ‘The Repositery Manager displays slatus messages in the status bar. For complex operations, the Repository Manager displays detailed informaton in the Output window. For example, when you connect to a repositoy, the status bar displays the following message: Accessing the ropostry. ‘tor you connect the repository, the statis bar displays the word Ready, ‘When you perform a complex operation, such as copying a folder, the Repasitoy’ Manager displays details about tre operation in the Output window. Page 37 ‘The Repository Manager racsives notifications when the folowing abjacts are created, modified or deleted J Folders $ Users $ Groups You must be comected to the repository to receive moificaions about folders. You noed the Super User priviege to receive notifications about users and groups. cor to a Reposito ‘Ald a repository when a repository already exists but doas not appear in the Navigator. You can add a repository in the Repository Manager, the Designer, the Workflow Manager, or the Workflow Monitor. Aller you ack! a rapository in ane of the PowerCenter Cent tools, it appears in the Navigator Window of ail the tools. To add a repostory to he Navigator: 1. In any ofthe PowarCantar Chant tools, click Repository > Add. anf Ce wweromes [ad 2. Enter the name ofthe repositery and a valid repository user name. 3. Chick OK, ‘The repository appears in the Navigator of the PowerCenter Client tons, Before you can connect ‘othe repastory or tho frst time, you must configura tha connection infarmabon far the domain that he rapositary belongs to Te conmect to a repost 1. Launch a PowerCenter Cental 2. ‘Select the repository in the Navigator and click Repository > Comedt, or double-ick the reposiiory, ‘The Coment to Repository diaiog box appears. 3. Enter valid eposiioy user name and passward 4. you are connacng tb a repository that you hava connected to before, Go to step 11 ar I you want to select an existing domain connection for he repositary, Goto step 9 “Or youneed to enter the domain connection information forthe repository, Complete stops 5 through 8. 5. the Connectto Repository diaiog bax isnot expanded, cick More to view the connectansatings. Page 38 7. Enter the domain name, gateway host name, ard gateway portnumber. Chick OK 8. the comection satings ofthe Cannect to Repository diaiog box are hidien, click More ‘o:viow the conection settings, 9. In he connection setings, select the appropriate domain connection fom the lst. 10, Click Conn. Managing Groups. You can create custom user groups to manage users and repository priviages stficienty. Aer creating a new user group, you assign thal group a set of privleges. Each repository user must be assigned to at least one user group. Youcan also assign users fo mullite groups. This grants ‘ho user tha privioges af each group, When you assign a user to a group, the J Receives all group pivioges. J Inherits any change to group priviages. J Losses and gains privegesif you change the user group membership. \Whon you revoke a priviege from a group, you revoke the privilege fram each userin the group. However, a user has been granied the priviege individually (as wih default privileges}, you ‘must revoke the privilege individually o restict fat usar Default Groups When you create 2 repository, the Repository Manager creales two repository user groups. ‘These two gous exist sa you can immediately create users and begin developing repasitory objects. However, you can ereate additonal groups and assign privioges and permissions to ‘hose groups. The two dofault repository user groups: J Administrators. This group intaly contains two users that are created by dofault. Tho default uses are Administer and the database user thal created the repository. You camt dette these users from the ropasiory or remave them fom Be Administrators grou. J Pulblic. The Repository Manager doas not create any defaultusers in the Public group, You camot delete these groups ftom the repository or change the default privioges. To crea a user group: 1. the Repositary Manager, connect toa repository. 2. Chick Securty > Manage Users and Privileges. Page 39 5. Enter the name ofthe group, (Cick OK. The new group now appears in thelist ofall groups defined forthe repository. 7. Chick OK to relum lo the Repository Manager. ‘You can now assign privileges and users to the group. Tip: Belore creating repository users, create usergroups, and assign the appropriate priviloges to those groups. bh the Repository Manager, you craala a user nama for each user accessing the repository. Each user belongs to at least one user group. Whan you create @ user, the Repostory Manager assigns the user the Public group. To changa the group, you must fist add the user analher group and then remave the user from the Pubic group. Each repostory user needs user name and password to access tha repository. PowerCenter uses one of the followng matode to authen fcale users: J PoworConter datault authontica fon J Lightweight Directory Access Protacal (LDAP) authentication Creating a User under Default Authentleation if you use default uer autnen cation, enter user namas in the repository and assign a password for each user name you create, To create a user under default authoniicabon: 1. the Repository Man ager, connect to a repository. 2. Click Security > Manage Users and Privileges. ‘The Manage Users and Privileges dialog bax appears. Page 40 (Click Aid. The New User diatog box appears. [Enter a user name and password betweon 1 and 75 charactors long, with no leading or traiing spaces. Use the optional Deseriplion and Contact Infa filds fo further idently the user. (Cick Group Memberships. The Mombeor Of ist displays the groups that the user belongs to. “To add the user to a group, select the group in the Not Member Of list, and click Ade ‘The group anpaars in the Member OF it. 7. To remove the user from a group, select the group in the Member Of list, and Click Remove. Note: You camel remova alusar froma group when the user lange fa ane group. Chick OK, Greating a User using an External Directory Service Ifyou use an extemal directory service for maintaining repository users, you add repository users from a ist of login names from the extemal directory. You do nol ceale rposiioy ser passwords, Ta add users with an external directory 1. the Repository Manager, connaet ta a repasitay. Click Sacurity> Manage Users and Prvieges. 2. Click Add. The Add Users dialog box displays a list of names from the extemal directory 3. Select alogin name tom the external directory Ist and click Ack ‘The login name appears at the Bottom of the dialog box. If you select mare than ‘one name, the names appear atthe bottom separated by semicsions, 4. If any of the names you enter do not match a valid direcbry sence login, you ‘must comect the name er remove it fram fe selaciens on the Invaid Name dialog box. Clk OK 5. Click OK in the Add Names dialog box to add the selections tb the user-name/ogin- name associa, The Manage Users and Privileges window displays the user ‘name, login name, and enabled statis. Editinga User You can ect users to maintain their group memberships and change their login status. If you use an extemal diectoy service you can change the user name-ogn association, so the ropasilory Page a2 {er rare mans io anew login name. Ifyou use cfaut authentication, you can change @ eer passive ‘The folowing exceptions apply when you edt users: J You canrotedit passwords i you use an extemal directory for scury J You canratchange group mombarships for the Administrator user You canrateditany propaty forthe Database user except contact information f You cannatchange a username, Ila username becomes dbsalele, yau can camave it from the repository Managing Privileges ‘You can arsign and revoke repository prvilages of a user or group. When you change privileges: for a goup, you change the priviogas granted to every user in the goup. However, users ‘tain the indvidually granted prviloges until you revoke them. You cannot change the riviages of the default user groups or the default repository users. Ta assign a privilege to a user or group: 1. In fhe Repository Manager, comect to a repository. 2. Cick Security > Manage Users and Privileges. Select Priviegas Tab 3. Select the privlages you want to assign to aach user or group, E 5 EB gE E EB 4 EB E 4. Click OK to eave the changes. IF you assign Super User privieges i» a user, you cannot change the other privieges for tat user. Super User priviege enables all privloges for the user Folders provide a way to organize and store al metadata in the repository, mluding mappings, ‘schemas, and sessions. Folders are designed 1 be flexble and to help you logicaly organize the repository. Each folder has a set of configurable properties thal enable you to define how users access the folder. For example, you can cea a folder that allows all reposiory users ‘o see objecis within the folder, bunt fo edit hem. Or, you can create a folder that lows users to share objects wihin the folder, You can erate sharad and nan-sharad facets, ‘Nols: You must create a fader in a repository before you can comect to the repository using the Designer ot Workflow Manager. Page 42 Ta create a folder: 41. the Repository Manager, conect to the repository 2. Chok Falder > Create. Pemissona Required Folder permissions fr users inthe reposiary. 4. ick OK. The folder appears in the Navigator, Ifthe faders shared, the folder displays ‘anopan hand icon, ‘Shared Folders You can configure felders in dobal and local repositories to be shared. After you designate a foder as shared, you can create shortcuts to objects in that folder. Use shorteuts in ary clher folder in tha repository. If the shared faldar is ia gobal repasitary, use shorkeuls io that {alder in any rapositoryin fe repastory domain Ifa foldor is not shared, you cannot create shortsuts'b ebjects in the folder. However, you can sill create copies of objacls in non-shared folders. ‘Shared faders ae useful when you want to use the same repastay objects in several projects wth fe same repostory. Far example, each fader wim a repostbry might represent & Stferent devdopment project. However, every project in the tepostary needs to store bu ‘eperts sn-a consent formal, so you might pul tw source defrwion br the BUS. REPORTS {able in shared Toler Folder Permissions Prmissions allow repository users to perform tasks within a folder. With folder permissions, you contra user access to the folder, and you contra the tasks thal users can periorm. Folder permissions work closaly with repository privieges. Priviegas grant acosss ta spectic tasks whie permissions alow you to read, wri, and execule tasks within a spe cic folder, Folders have the following permission types: J Read permission, Vou can view the folder and objects in the fader. { Write permission. You can create or edit objects inthe flder J Execute permission. You can exacuts or schedule a workflow in the folder. ‘Any user with the Super Usor priviege has read, write, and execute all asks across all folders in the ropository and can perkarm any task using tha PowerCenter Client tools. Permission Levels You can grant thea levels of securty using folder permissions, J Owner. The awner of the folder, as listed in the owner menu, J Owner's Group. Each member of the group to wich tha aaner belongs, incuding the ‘owner Ifthe owner belonge ommore than ane group, you must select one of hase (gfoURS in fe folder diaiog bax. This becames the Owner's Group J Repository. Al goups and usars in te repository Each permission lavel includes the members inthe level above i, If you restrict permission from ‘Owner of Owner's Group and fen grant fhe same permission to Repositary, Owner and ‘Owner's Group retain the permission. Nol: All users in the Administrators group or with the Super User privilege have full accass al folders in the roposiery, Edit Folder Permissions ‘You can edit folder permissions and properties at any time. To create, edit, and delete falars you must have the Manage Faler pviaga, Ta edit folder pemissions and properties: 1. In fhe Repository Manager, connect to a repository. 2. Select the folder in the Navigator. Ifthe fokder does not appear in the Navigatar, you donot have read permission forthe folder (Click Falder> Edt Wnecassary, select a new owner fam the Owner menu. Whe owner belange to more than one graup, select a group tram fhe Group mere, ait he foldar porissions. (Click OK to save tha changes. age 45 Comparing Folders Bofore you copy oF replace a falar in a eposiby, you may want to verify the contents of a fader or compare it with other folders. The Repository Manager lets you to quickly and accurately compare the objecis in diferent fekiers using the Compare Foklers Wizard, You ‘must have read permission for each folder you want to compare. You must also have priviages to access the repositories containing the folders. If you use a versioned repository, the Repository Manager uses the latest checked in version of ‘an object far comparison, bh fie Compare Faers Wizard, you can complete the folowing ‘comparisons: J Compare objects betieon two folders in the same repasitary. {Compare objects betireon two flders in diferent repositories. You can specfy tha folowing comparison criteria for each ‘comparison J Object types to compare. You can specify the abject types to compare and display batwoom folders. Tha wizard com pares objects based an spactic abject atibutes. J Direction of comparison. The wizard parforms directional comparisons. A directonal ‘comparison checks the contents of ane folder agains! the contents of the other. You an Specify ether ane-way or two-way comparisons. The ward displays the folowing information: J Similartios between objects J Differancas between ab cts J Outdated objects You can edit and save the resuft of the comparison. You can also compare two objects of the same type using Campare Objects Wizard, Tha wizard compares the attribute of each abject in the souree folder wi the alinibule of each abject in the target falder, You can choose to compare objects based.an the folowing crea: J Different objects. Object nama and typa exist in one folder but not the other J Similar objects. Objectaame, type, and modification date are the same in both folders. J Outdated objects. Object modification date is older than objects with the same name, ‘The wizard dose nat campare the fia allrbulae of the objects in the folders when pertorming the comparison. For example, two fokers have matching sauce names. and column ar por ‘ames But difeing port or oclimn altrovtes, such as procison or datalype, the wizard does rot note thase as diferance. ‘The Compare Folders wizard displays simiar objects in grean text, unmatched objects dencting difference in red tex, and ouldated objects in blue text. The Compare FoWers Wizard always dsplaye the total number of differences, cimilanties, and ouklated objects found during the comparison, even i you do not choose fo display dflerences or sinlariies inthe ext ‘ad You can save the resuits of the comparison in the edit fad to an Af or a tet fla. To retain the colar and fant atirbutes of the resul, save it as an ffi. By default, the resulls fle is saved in the My Documents directory. Connect 12 the repositories containing the folders in tho ward. Ta compare elders: 1. nthe Repository Manager, cick Folder > Compare. 2. Chick Nox, Page 46 3. Connect to the repositories. containing the folders you want to compare and select the folders for comparison, Click Nest. 4. Select the abjacttypes you want to compare, Click Net. 6. Cick Next, The wizard aways displays the number of diferances, simiaritio, land outdated objects, 7. Miaw the results of the comparison, 9. Ifyou chose to save the results toa file, specify the fie type, name, and drectory. lick, Save Views User Connections Use the Repositary Manager to view conection details ‘or users and applications connected ‘o the repository. The following are the connection deals viewable in Repository Manager. J User. Usar name associated with the connectian. J Connection ID. identification number assigned fo fa repository connection J Application. Repository client application associated with the connection J Host Name, Name ofthe machine running the application, 5 5 Host Address, TOPIIP actress of the machine associated withthe cannecton, Host Port. Port number the machine hosing the repositry client application uses ta ‘communicate withthe repository. $ Login Time, Time the user connected tothe repository. {Last Active Time, Time of the last maladata transaction between tie repostary client ‘application and the repository. TT view user connection details: 1. Launch the Repository Manager and connect to the repository 2. Click Edt > Show User Connections, ‘The Repositary Connactons dialog box appears. 3. Tosorttne connections by column, click the calumn name. Page 48 You can view locks ard Wenty residual lacks in the Repository Manager or in he Administration Console. The following are the abject lock infomation avaiable in both the Repastory Manager and the Administration Console: J) User, User name lacking the abject Connection iD. Idanification number assigned fo the repasitory connection. Folder. Folder in wich the locked objects saved ‘Object Type. Type of object, such as folder, varsion, mapping, or source, Object Name. Name of the locked object. Lock Type. Type of lock: in-use, wte-ntont, or execute Lock Time, Time the lack was created. Hostname. Name of the machine lacking the abject. ‘Application. Agolication locking the abject, such as Designer, Workflow Manager, Repository Manager, or integration Service, Ta show all mposiory locks: 1. nthe Repository Manager, connect to a repository. 2. Te wiew alllocks in the repository, ick Edt > Show locks. ‘The Objact Locks disiog box appears. ‘pees en ene 3. Tesortthe view ofthe locks by column, cick the colurm name, 4. Te wiew updated lock information, cick Reesh Viewing Dependencies Bofore changing or dalating the repository abjects, you can view dependencies to see the impact on ther objects. For example, before you remove a session, you can find oul which workliows use the session You can view dependencies for repository objects in the Repostory Manager, Worklow Manager, and Designar ods. hh the Repository Manager, Workflow Manager, and Designer, you can view dependancies when you perform fie following tasks: J View object history. You can viow dependencies from the View History window \uhian jou view the Fetory of an abject, J View checkouts. You can view dependencies from the View Checkouts window when you search fo persistent checkouts. J View query results. You can view dependencies from the Query Resuts window when you search far abject dependencies ar ran an atjact quar Page 49 ‘The Dependencies disiog bax displays the abject name, the object type, anda list of dependent chyacts and fhe properties. When you search for doporencies, you can fier the results by solecting options and cbjecttypes, J PrimarylForeign Key Dependencies. Viow primary and source object opendoncies whore there is a primary key-foreign key rlationship. {Global Shortcut Dependencies. View gobal shortcuts across repositories. You can ‘select this option when you search for parents, chien, or primary key-forsign ey dependencies. J All Children, View the objects that the selected abject usas. For example, if you search for he chad atjactsfor a workfow, tha results might include sessions and worst J AllParents. Wiew the objects that use the selected abject. Far example if you search for ‘the parent objects fora session, the results might include @ warkfiow or worka, ‘The View Dapendaneias window also cispiaye output, comments, and label information associated wih he object The Oubut window displays validation information, and the Commenis window displays text oniered dung object check in or check oul. The Labels window displays labels associated with the object and information associated with the label Ghject You can chack in objects from the View Dependencies dialog box, but you cannot check Inorginal otjets fr gibal shorteuts, 5 SOME oe Weta, OTA Try tt Out 1 2 a 4 5 é z z z Test (Cresta a User Grou. ‘Assign privieges ‘write and execute tothe created group, (Cresta Users and assign User Groups ta them. Create a fader without shorkeut. Wiew locke Mew Dependencies ‘The Repository Manager hae 4 windaws — Main, Navigator, dependency and output windows. Repository Managor allows the creation of groups. ‘Groups can be assigned with the applicable privioges. Repository Manager allows. he creation of users under the desired groups and folders vith the applicable pormissions and privileges. ‘Comparison bebween {wo folders af the same or ilferent repository can be done, Repository Managor shows all the dependency for the selocted abject jour Understand What are the 4 windowsin the Repository Manager? What ae the two types of faldars? What ae the types of dependencias available? What ae the dafautt avatiatie groupe? What ie the default group that gels assignad to when a new user ie created in fhe Repository Manager? What ar the defauit permissions when a new folder is created? age 51 Session 5: Mapping Designer Learning Objectives Se Designer Ovaview Se Designer Taais ‘we Souroe Analyzer Se Woking with Mappings Various Types of Transformations. Mapping Parameter and Variables Designer Overview _ ‘The Designer fas tools ta help you bud mappings and mapslets so you can specily how to ‘move and transform data belwaen sources and tages. The Designar helps you create Source defintions, target definitons, and transformations to buld the mappings. ‘The Designer lets you work with multste tools at one time and io work in multiple folders ‘and repositories al the same time, Walsa inches wirtiows so you can view Woks, repository cbjects, and tasks. These tools play an important ale in creating the mapping and converting te Business requirements inio an ETL code, ‘You can configure general Designer settings, such as background calor and font. You ean also configure specific tal settings for each Designer too. ‘The Dasigner provides the folowing tool: J Source Analyzer. impart or ceate sourca definitions for flat fle, XML, ‘COBOL, Application, and retational sources Warehouse Designer. impart cr create target definitions ‘Transformation Developer. Used io cate reusable transformations. Mapplet Designer. Used i create mapplas. Mapping Designer. Usod ta create mappings. ‘The Designer consists of the folowing winds: J Navigator. Use to connect to and wark in multiple repositories and folders. You can also copy and delete objects and create sharteuls using fhe Navigator. J Workspace. Use to view or edi sources, targets, mapplete, transformations, and ‘mappings. You work witha single too! at a time in the workspace, which has two formats: default and workbook. You can view multiple versions of an abject inthe workspace. J Status bar. Displays the status ofthe operation you perform, J Output. Provides detais when you pero cerlain tasks, such as saving wor or validating a mapping. Right-click the Output window fo access window optians, such aspinting oubultext, saving text to fl, and changing the font size J Overview. An optional windaw to simply wiewing workbooks containing large mappings. fora large number of abjects, Outings tha visite area in tha workspace and hhighights selected abjecis in coor. To open the Overview window, cick View > Overview Window. J Instance Data. View transformation data while you run the Debugger to debug a mapping J Target Data. View taget data while you run the Debugger to debug a mapping, You can compete the following tasks in each of the Designer ‘bale: J _ Add a repository. f Print tre workspace, J View date and time an object was last saved. J Openand dase a folder Page 52 Create shortcuts, ‘Ghack out and check in repasitry xbjects. ‘Search for repository object, Enter descriptions for repository objects ‘View alder versions of objects in the workspace. Revert a previously saved object version. Copyatjects Expat ane impart repository objects. Workwith muiipie ebjects, ports, or ears, Rename ports. {Use shorteut keys. You can also view object dapandencies in he Designer. ‘Comparison of various objects can also be done here, ‘Source Analyzer can import or create source definitions for flat fle, XML, COBOL, ERP, ‘and relational sources. Specifies. where tho data is being extracted fom. The’ Sources can be database obects (lables, views, and synonyms), flal fies, XML fies, or COBOL files that provide source data Reads, analyzes, and “reverse engingers’ schema information of Cherational database sand fat fies. Sources store metadata infarmaian in tye repository To extract data from a source, you must fist define sources in tha repository. Source definitions can be creaied or imported from the database, Mainly Flat Fie Source definiions ara created In the Source analyzer withthe structure of tho Flat fe. Page 83 ‘You car impart or create the folowing typas of source definitions in the Source Analyzer: J) Relational Tables, Views, and Synonyms J Fikedvidth and delimited flat fas that da not contain binary data J COBOL files § XML files J Data models using certain data modeling tosis through Metadata Exchange for Data Ta Create the Source Defintions: 1. Click Sources > Create, Select the Database type and enter he name for Source, 3. Doutle alck the Source or Right-Click > Edt contgure the properties of the ‘Soures definition, 4. Select Columns Tab © define the Field Name, Data Type and Length of the fold. 5. Chick OK, Taimport the Source definitions of Relational Tables, Views or Synonyms: 1. ln fhe Designer, click Tools > Source Analyzer fo open the Source Analyzer. 2. Chick Sources > Import fom Databace Page 54 4. Select tha ODBC data source fb access the database containing the source tables. 4. Enter the user name and password to connect o this database, Also, enter the name ofthe source tabie owner, # necessary, 5. Click Connect, Aoseenie Str peano reve 6. inthe Select bles let, expand the database awner and the TABLES heading, f you Cickthe Al button, you can see all tables in the source database, 17. ‘Select tha tables you wantio import it irta the repository. (Cick OK to import the source definitions into he repository. 9. The Designor displays the newly imported sources in the workspace. Click Layout > ‘Seale o Fito fall he definitns in tha workspace Page 55 ‘A new datatase definition (DBD) node appears under the Sources node in the project folder. ‘This new entry has the same nama as the ODBC data sourca fo access tha sources you just Imported, Ifyou dautie-cick the DBD node, the list of al the imparted sourcas is dsplayed ‘You can import fised-wieth and delimited flat fie definiions that do not contain binary data, When imparting the definton, the fie must be in a directory local 6 the cient machine, In addition, the Infagration Service mustbe abie fo access al saurce fee during the session. When you create a fla source, target, or lookup dofniton, you must define the properties of the fle. The Fiat Fie Wizard prompts you forthe folowing fe propertios: J File name and location Fie code page Fie type (Column names and data Yypes Number of header raws in fhe fe (Column size and nulcharacters for fwed-with fles J Deimtertypes, quote character, and escape character for delimited flee Every mapping requires at least one source qualifar that determines how the Integration Service reads fhe source dats J Source Qualifier transformation. Used for Relational and Flat fie sources. J. Normatizer transformation. Used for COBOL sources. J Application Source Qualifier transformation. Used for Apgication sources. J XML Source Qualifier transformation. Usad for XML sources. You can let the Designor creale the source qualifier by default, sa each time you drag a source into @ mapping, tha Designer adds a source qualifer and connects it fo the source. Use the aulomatic source qualer creation whan you want to create ane source qualifier for each sourbe in the mapping) You can disable the aulomatic creation when you want to join data ‘tom sifferentralatenal sources. You can thon manually create and connect ite the source, When you edit the source inthe Source Analyzer, all instances of the source in mappings inherit ‘he changes. Some changas might invalidate the mappingsusing the source. Warehouse Designer Warehouse Designer helps to impart or create target definitions for fle and relational sources, Targets are the output destinations where the dala will be writen to. This can 6% database objects, fat fies, or XML fles 1b receive Yansfermed data. During a session, the Informatica Server writes the transformed data fo Larges. Targat definitions can be imported in the same way a5 source dofintionsin the Warehouse Designer, You can create the folowing ypes of target definitions in the Warehouse Designer J Relational. Create a relational target for apartcular database platform. Croatoa ‘olatonal target definiion when you want to use an external loader ty the targat database. J Flat file, Create fixed-width and delimited fat fle target definions. J XML file Create an XML target definition lo output data to an ML fl You can create target definitions in the folowing ways: J Manually create a target definition, Create a target definition in the Warehouse Designer J Import the definition for an existing target. Import tha target definition from a eelational fargel ora fal fle. The Warehouse Designer uses a Flal Fie Wizard to impar lal ‘ke. J Create a target definition based on a source definition. Drag a source defintian ints the Warehouse Designer io make a targat definition, J Create a target definition based on a transformation or mapplet. Drag 2 \tansformaton ilo the Warehoute Designer fo make a target definition, Page 56 J Design several related targets. Create several related target dations at the same time, You can oreate fe overall relationship, caled.a schema, and te farget dafritons, trough wizards in the Designer. The Cubes and Dimensions Weards folow common principles of data warehouse design to simply the process of designing related targets, ‘Transformation Developer ‘A transformation generates, modfies, or passes dala trough ports that you connect in a ‘mapping or mapplet. When you buld a mapping ar mapplal, you adi tansfomations and Configure them to hande data according to your business purpose, The Designer provides a set of transformations that perform specific functions. Transformations in a mapping represent the operations the Infomatica Server performs on dala. Dala passes info and aut of transformations through ports that you connect ina mapping ar mapplet, Transformabons can be ‘2etve or passive, ‘When you build a mapping, you add transformations and configure them to handle data according to a business purpose. Complete the following tasks to incorporate a ‘ransformation into a mapping: {Create the transformation. Create & in the Mapping Dasigner as part of a mapping, inthe Mapplet Designer as part of a mapplet, or in the Transformation Developer ‘as a reurabie transformation, J Configure the transformation. Each type of transformation has a unique set af options that you can configure {Link the transformation to other transformations and target definitions, Drag one ‘Barto anoher to link them in fre mapping or mappiet. You can ereate transformations using the follwing Designer ‘bale: { Mapping Designer. Create traneformatons that connect sources fo targets, ‘Thase transformations created in a mapping camot be used in other mappings uniass you promdte them ta be reusatie J Transformation Developer. Create reusable transformations fat can be used in mulple mappings, { Mapplet Designer. Create and confgure a set of transformations, caled mapplats that {you use in muliple mappings. Use the same process to create a transformation in the Napping Designer, Transformation Developer, and Mapplet Designer To crate a transformation: 1. Open the appropriate Designer tol 2. On fhe Transformations toolbar, click the bution corresponding to the kansformation you want to create “Or (Glick Transformation > Create and salectthe type of transformation you want to creat, 3. Drag acrass the warkspace where you want to place fhe ‘ranelormation, The new transformation appaars in fhe workspace, Page 57 Configuring Transformations Alter you create a transformation, you can configure it Every transformation contains the failowing common tabs: { Transformation. Name the transfarmation or add a description. {Port Add and configure ports, J Properties. Configure properties that ara unique to the transformation J Metadata Extensions. Extand the metadata in the repository by associating information wi individual objects in the repository. ‘Some transformations might include othor tabs, such as the Condition tab, where you ‘ener consitions in Joiner ot Normalize tan formation, When you configure transformations, you might complats the following tasks: J Add ports. Define the columns of data that move into and out ofthe transformation, {Add groups. In some transformations, define input or output groups that define a row of data entering or leaving he transformation J Enter expressions. Enter SOL tke expressions in some transformations that transform, the data, {Define local variables. Define local variables in some Fansformations that temporary lore data, J Override dotaut values. Configure dataut values for pots to handle input nus and ‘output transfamation erors. J Enter tracing levels. Choose the amount of detail the Integration Service writes in the ‘session lag about tanstormation Source Qualifier Transformation. ‘When you add a relational ora flat fla source dafiniton fa mapping, you nead fo connect a toa ‘Source Qualifier transformation, The Source Qualifer taneormaton represents the rows that the Inbagraion Service reads whan ft runs a session. Use the Source Qualifier transformation to complete the following tasks J Join data originating from the same source database. You can jain two or more tables with primary key-foreign kay rélatonships by Inking the sources to ore Source ‘Quaifier transformation, J Filter rows when the Integration Service reads source data. If you include a filor Condition, fe Integration Service adds a WHERE clause tothe dafauit query J Specify an outer join rather than the default inner join. If you include a userdefined Join, he Integration Service replaces the join information spectad by the metadata in the ‘SQL query J Specify sorted ports. If you specify a number for sorted ports, the Inlagiation Service adds an ORDER BY clause to the defautt SOL query. 4 Select only distinct values from the source. I you choase Select Distinct, the Tnlegration Service adds a SELECT DISTINCT stalement ta the default SQL query J Create a custom query to issue 2 special SELECT statement for the integration. Service to read source data. For example, you might use a cusiom query 10 perform aggregate caleuations. Page 58 source deals changing dung the sesion run. ee Rone] eve eee) ‘The Expression transformation is used 1a calculate values in a single raw before you write to the taigat. For example, you mght need fo adjust employes salaries, concatanala fst and lastnames, or conver stings fo numbers. Yau use tha Expression transformation to perform arty non aggregate caledations, You can alsa use the Expression transformation to. test condi tonai statements tefare you ouput the resus to target tables or other transformations. ‘The Following types of ports ara used fo perfor the single row functions, ‘ Input Ports or InputOutput Ports for each value used in the calculation. For example, when calculating tha total pice fer an order, deiormined by mulpying the Unt price. by the quantity ordered, one port provides “he unit pice and the other provides the quantity ordered, ‘Output port for the expression. You en tie expression as a configuration aption fer the output port The retum value forthe output part needs to match the relum value of the expression Exnressions use the (ransformaton language, which inciudes ‘SQL-ike functions, fo perform calculations. Local Variables port. You can rélerenca variables in an expression or use them ta teonporay’ stove dala "Varies we an easy way to. imgove performance You might use variables to complete the following ‘asks: Temporarily sore data ‘Simpity complex expressions. ‘Store values ftom prior rows, Capture multiple return values from a stored procedure, Compare values. ‘Stora the results of an unconnected Leakup tansformation, Page 59 Se Siaa|| Sys Se SCH SN eS es me mae ee ‘Adding Multiple Calculations You can enter multiple expressions in a single Expression transformaton. As long as you entar only one expression for each culput port. you can create any number of output ports in the transformation. In this way, use one Expression transformation rather than creating ‘separ ata transformations for each caleulaion that requires the same sal af data For example, you might want fo calculate several types of withholding taxes ftom each employee paycheck, sien at local and fadaral inoame tax, Social Socury and Madicare. Sinoa. al of {hese caiculatone require the employe saliy, tw winhaking stegory, andor te corresponding 12x rate, you can create one Expression transfomation with the salary and withholding category as inpuloulput ports and a separate output pert for each necessary cealeulaton. Use the Filter tanstomation to filer out rows ina mapping. As an acive transformation, the Frater transformation may change the number of ows passed through i. Tha Fater transformation allows rows thal meet the specified filer condition b pase through. Ik drape ows thal do not ‘meet the cardiion, You can filer data baced on one ar mare cardions. ‘A fiter condifon retume TRUE or FALSE for each row that the Integration Service evaluates, depending on whether a row meats the spactiad condition. Far each row that retums ‘TRUE, the Iniegation Services pass ttvough the transformation, For each row that returns FALSE, the lntegration Serves dope and wiles a message tothe session lng. ‘You cannot concatenate ports fom mare than ona tansfarmation into the Fiter transformation. ‘The inpul ports fr the filer must come from a singla transformation ‘A Fiter transformation contains the folowing fabs: § Transformation. Enter tho nama and description of the transformation. The ‘naming convention fora Fite tanstormatio is FiL_TransformationName. You can also ‘aka the transformation reusable, { Ports. Create and configure ports { Properties. Configure the fter condiion to fiter rows. Use the Exprassion Edilor to enter the fiter condition. You can also canfgure the tracing leval to determina the amount of transaction detail ported in tha sassion log fle. J) Metadata Extensions. Create a non-eusable metadala exionsion b extend the metadata of fie transformation. Configure tha extension name, data ype, pracision, and value. You age 60 ccan also promote metadata extensions to reusable extensions if you want fo make Havaiatie ta al fanstormation transformations. ‘The Filer transformation used in the below mapping allows rows for employees that make ‘salaries of $90,000 or higher to pas through. pe EMPLEO ranger Ge: - a“ HeUEE_O ust Nae Note: Place the Filler transformation as close to the sources in the mapping as_ possible fo maximize session performance. Railer than gassing rows you plant discard ‘through the mapping, you can fier out unwanted data early in the flow of data trom s qurces to targets. Specifying a Filter Condition in a Filter Transformation ‘The fitor condition is an expression that rotums TRUE or FALSE. Enter conditions using the Expression Editor available on the Properties tab. Any expression thal relums a single value ccan be used asa fier, ‘eens Pepi | tee For example, if you want 10 filer out rows for empiayees whose salary is less than $30,000, you entarthe flowing condition SALARY > 30000 You can specly multe components af the condifon, using the AND: and OR logical operators. IF you want to fiter out emplayees who make less than $30,000 and mora than $100,000, you ‘enter the folowing condition: SALARY > 30000 AND SALARY < 100000 You can also antar a constant forthe fter condion. The numeric equivalent of FALSE fe zaro (0). Any nonzero value is the equivalart of TRUE. Far example, fa transformation cantane a port Page 61 ‘named NUMBER_OF_UNITS with a numerc datatype. You configure a fiter condition ta return FALSE if the value of NUMBER OF UNITS equals zoo, Otherwise, the condiion returns TRUE. You do not need to specify TRUE or FALSE as values in the expression, TRUE and FALSE are impict retum values frem any condition you set If fhe filter conditon evaluates tb NULL, te ‘ow is tealad as FALSE. Note: The filer condition is case sensitive, Fitering Rows with Null Values ‘To ‘ter rows containing null values or spaces, use the ISNULL and IS_SPACES functions to test the value of the por. For example, you wart to filer aut rows that contain NULL value In tha FIRST NAME port, use the folowing condition: IF (SNULL (FIRST_NAME), FALSE, TRUE) ‘This condition states that ifthe FIRST_NAME port is NULL, the relum value is FALSE and the ‘ow should be discared. Otherwise, thé row passes through tothe next transformation Use he following guidelines to efficiently use the fiter ‘ransformation J Usa the Fier transformation early in the mapping. To maximize session performance, keep the Filer transformation as close as possible to the sources in the ‘mapping. Rather than. passing rows that you plan to discard through the mapping, you can fiter out unwanted data early in the flow of data from sources ib targets. J Use the Source Qualifier transformation to filter. The Source Quaifior transformation provides an alternate way to fiter ows, Rather than ilaring rows kom within a mansing, tha Source Qualfer Yansformation fitors rows whan read fom a sous. Tho ‘main difference is that fhe source qualifer limits te row sef extracted fam a source, “while the Fiter transformation iis the row sot sent fo a target. Sioa a source qualifier reduces. the number of rows used throughout the mapping, it provides. better performance. However, the Source Qualifer transformation only lots you filer rows from relational sources, while the Filer transformation fiters rows ftom any type of source. Also, nole that since it runs in fhe database, you must make sure that fhe fiter condition in the Souce Quali ‘ransformation only uses standard SQL. The Filer transformation can define a condition Using any slatoment or transformation furetion thatreturns either a TRUE or FALSE valve “Transformation Use the Joiner transformation to jon soures data ftom two related heterogeneous sources residing in diferent locatans of fie systems. You can aiso join data trom the same source. ‘The Joiner transformation joie sources with at least one matching ‘cakumn, The Joiner transformation uses a condiion fat matches ona ar mora pars of columns batwaen the two ‘The two input pipelines inckide a mastor pipeline and a detail pipeline or a master and a etal branch, The master pipeline ends at tho Joiner transformation, while the detal pipaline ‘cantinues ta the target. Master Pipeline Detail Ppeline Page 62 To join more than two sources in a mapping, oin the cutout from the Jainertransformaton with andiner source pipaine, Add Joner iranstrmations fo tha mapping unit yau have joned all the source pipeines. The Joiner anstammatian accepts input from mast transformations. However, consider the flowing limitations on the pipélings you comect t the Joine® transformation: J You cannot use a Joiner transformation when ether input pipaine contains an Update ‘Strategy transformation, J You cannot use a Joiner ansfarmatian # you connect a Sequence Generator transormafon directly before the Joiner transformation Working with the Joiner Transformation [When you work wih the Jainer transformation, you must configure the transformation propertios, Join type, and join condiion. You can configure te Joiner transformation for sorted input fo improve Iniagration Service parlormance. You can also configure the transformation scope foconirol how tha Integration Servica applies transformation logic. To work wilh ‘he Joiner transformation, complete the folowing tasks: J Configure the Joiner transformation properties. Propertios for the Joiner transiormaton idenify tha locaton of the cache drectory, Now the Integration ‘Service procassas the traneformation, and haw the Infegration Sarvies handles caching. 1 Configure the join condition, The join condition contains parts fam both input sources that must match far the Integration Service ta join two rows. Depending on the type of Join selected, the Integration Service ether adds the row to the result sel or discards te J Configure the join type. A join ie a relational operator thal combines data from multile tables in diferent databases or fat fies info a single result set You can configure the Joiner transformation to use @ Normal, Master Outer, Detail Otter, ar Full Outer jain type, {Configure the session for sorted or unsorted input. You can imprave session performance by carfiguring the Joner transformalion to use sorted input. To configure ‘2 mapping to use sorted data, you establish and mainiain a sort order inthe mapping so that the “Integration Service can use the sorted dala when it processes the Joiner transformation, J Configure the transaction scope. When the Integration Service processes a Joiner transformaton, it can apply transformatin logic to all data in a transacton, all incoming dala, or one row of data ata time el Page 63 Detin ing a Join Condition. The jin candtion conlains ports fom both ino sources thal must maich for the Integration Service to join two rows. Depending on the type of join selected, the Integration ‘Service either adds te row to the rasul sel or discards the row, The deine: transformation reduces rasull sets based on the jain type, condition, and input data sources. During a session, the Integration Service compares each row of the master source against ‘he detail souree. To improve performanca for an unsorted Joiner tanstomation, use the source with fewer rows as the master source. To improve performance for a sorted Joiner {ransfermation, use the source with fewer duplicate key values asthe master You define one o more conditions based on equality between the spectiad master and delat sources. For example, if two sources with tales called | EMPLOYEE_AGE and EMPLOYEE POSITION bath contan emplayse ID numbers, the follwing condition matches rows with employeos listed in both sources EMP_ID} = EMP_.02 ‘The order of the pos in fe conciian can impact the performance of tha Joiner transformation If you use multisie ports in the join condiion, the Iniegration Sarvice compares the ports in te ‘order you specily. ‘The Designer validates datatypes in a cancion. Both ports in a condition must have the ‘same datatype. If you nead to use two paris in the condition wit non-matching datatypes, ‘convert the datatypes 80 they match. Use the ‘folowing guidelines when you define join condi fans! J The parts you use in the join condition must match the peris atthe sortorgin, J When you configure muitipie join conditions, tie ports in fe frst join condiion must match the fist ports al the sortargin, J When you configure muti conditons, the order of the conditions must maich the order ofthe ports atthe sort origin, and you must not skip any ports. § Tha number of sorisd ports in the sort origin can be greater than or equal to the number of ports athe jain canditon. Null Handing ‘The Joiner transformation does not match null values. For example, both EMP_ID1 and EMP_1D2 contain a row with a null valu, the Integration Service does nat consider tham a ‘maich and doas not join the two rows. To join rows wih nul values, replace null input with default ‘Values, and then join on the default values, Ia result st incudes fatds that do nat contam data In either of tha sources, the Joiner ‘ansformation populates fe empty fells with null values. If you know that a feid wil ratum a NULL and you do not want to insert NULLS in the target, You ean seta default value on the Port tab fer the eomespanding por. Join Type You define the join ype on the Properties tab in the transformation. The Joiner ‘ransformation supports tha folowing types of joins: J Normal Join. With a normal jin, the Iniagraion Service discards al rows of data om the master and deta sourea that do not match, based on the condifen, J Master Outer Join. & master lar join Keags all rows of data fom tha datal source and the matching Tows from fle master source. It discards the unmalched rows from the master source, {Detail Outer Join, A cletai outer jin keeps all raws of data from the master source and the matching rows from the detal source. It discards the unmatched rows. from the dail source J Full Outer Join. A ful outer join keops all rows of data from both the mastar and detail ‘Note: A normal ar master outer join performs faster than a fll our oF datall outer join. age 64

You might also like