You are on page 1of 10

Informatica Senarios-1

by PenchalaRaju.Yanamala

1. I have a flat file, in which I have two fields, emp_id, emp_name. The data is like this- emp_id, emp_name soha101, ali101, kahn101 khan102, Siva102, shanker102, reddy.how to merge the names so that my output is like this Emp_id Emp_name 101 Soha ali Kahn? A:- In expression transformation use an output port ...in expression window write emp_id||empname 2. How to join a Flat and Relational Source without using (Joiner, Update, Lookup) transformations... is it possible? If yes i would like to know how? A:- not possible 3. I have a source which relational, I am trying to populate to target flat file with one column for daily date which is sysdate, I want to populate the sysdate coulmn with DD/MM/YYYY format. Kindly provide a solution for this. My clear that my target is flat file. A: - In expression transformation create one out put port, and write like: TO_DATE ('SYSDATE'.'DD/MM/YYYY').Connect this port to target. 4. If the source has duplicate records as id and name columns, values: 1 a, 1 b, 1 c, 2 a, 2 b, the target should be loaded as 1 a+b+c or 1 a||b||c, what transformations should be used for this? A: - We need to use sorter, Expression and aggregator transformations to do this….. 1. Sort by ID 2.Take 2 variable ports one for id and one for name and store the values of id and keep on comparing with current id i.e. variable is having previous id so if previous id=current id then (variable name)||name otherwise only name. Assign the variable name to output port 3. Use aggregator and use last or max (len (name)) to get the result. 5. How many repositories can you create in informatica? A-In Informatica 8.6.0 multiple repositories can be created under node. The domain can have multiple nodes. 6. Router T/R is active but some people are saying some times passive what is Reason behind that? A: - First of all Every Active transformation is a Passive transformation, But every passive not Active. In Router Transformation there is a special feature with Default group. Because of Default Group its passive. We can avoid this Default group by some transformation Settings, Now It’s Active.

This will Reduce the complexity of the Power center mappings and increases the performance. How to deal with it without changing mapping? A: . VSAM. I want to run an informatica workflow after completion of oracle procedure.7.Dont go Single Source qualifier transformation. I need to find the details that the record belongs to which source? A: . Probably in Data warehousing. So.This can be possible with UNIX. If no.I know 4 ways in INFORMATICA ? . relational) will be loading into only one target. I do not agree with this scenario. 12. As far as DWH is concerned.After every Source qualifier transformation just keep an Expression with flag. Informatica Senarios-2 8. you won't find such a design.transform) it finally loads the data in some targets. it says that the no. the question itself arises for the OLTP design.dynamic cache ` 13.If I understand this question properly. Create a shell scripts which first has to Execute the Stored procedure or Package and we have command to check the completion or procedure After that use Pmcmd command in the same UNIX to start the workflow. T. That Procedure is not running through informatica and can be run at any Time in Database. as per the question. more than 500 sources (legacy. Is it possible? If yes please Explain? A: . No any OLTP (or database design principal) system suggests a varying number of columns. Diff b/w Shortcut and reusable Object? A: . please do not get confused by such trivial kind of questions. Please go through the concepts first.DWH is a much disciplined subject & it follows a very good standards. "Change cache" in Informatica? A: . of "Source" columns are changing. Informatica is in windows environment. Can we create index and drop index in existing table while using informatica A: . it takes the data from the OLTP systems & after performing some operations (E-extract.A shortcut is created by assigning 'Shared' status to a folder within the Repository Manager and then dragging objects from this folder into another open folder 10. Keep 1 source qualifier Trans for one table. 11. of source columns is changing every time (First time it is 10 next time it is 20 so on). You will get a clear picture of DWH then.Pushdown optimization is used to push the complex logic to the database level. What is 'Power Center Pushdown Optimization Option' in INFORMATICA?(IMP) A: . 9. Here. in a single mapping. Whenever I retrieve the data (any record) from target.

DELETE.Use source type as Indirect File Type and source file name as a file having the names of All the n flat files to be read.(source table. 17. UPDATE. If we are using an aggregator but forget to mention the group by port .We can create any No of Mapplets for 1 mapping. 2) Source qualifier Trans-(Sql override) 3) Target override 4) Pre sql. Post sql 14. Using Update Trans we can create SCD mapping 16. There is no limit for logics. Informatica Senarios-3 15.. 18.as per my understanding. There are n numbers of flat file of exactly same format are placed in a folder.SCD applicable for Insert. Update at a Time using Update Strategy Trans only..what will be the Output? A: . What is Target Update Override? What is the Use? . Can we load these flat file’s data one by one to a single relational table by a Single session? A: . Lkp is always behave like left outer join. There is no limit for Mapplets... 15. LOOKUP Condition is nothing but a Join condition? What type of join condition it.. Using Session Properties we can do single flow only . Why do we go for update strategy TR in SCD rather using the session Properties? A : . It will give you all Matched records as well as unmatched records which are not present in base table. Using key ports (enable. Every mapplet can have a Logic or logics..and Those unmatched records are will be null in case of unconnected lkp trn..1) Source Analyzer window. REJECT. the aggregator will write the lat row of the table to the next transformation. By Default? Using the Lookup Condition How many types of relational conditions We can make? A: .If we miss to enable any of the port as GROUP BY....Session Properties like pre Source Rows INSERT. How many mapplets u have created? And what is the logic used? A: .. disable)).

with Agg Transformation. on target there are 5 m rows which options is better to match data 1. if u connect sequence generator with joiner the outcome sequence will not be proper. Rank Trans).Here we will use joiner for better performance. Main reason is joiner is an Active transformation. How can I generate Sequence Numbers to Target Table (with out using Sequence Gen Trans. First Group by ID. How I can Schedule the Informatica job in "Unix Corn scheduling tool”? A: .So. This way we can update the table. How to create a mapping? id date 101 2/4/2008 101 4/4/2008 102 6/4/2008 102 4/4/2008 103 4/4/2008 104 8/4/2008 O/P . What are set operators in Oracle? A: . Why sequence generator should not directly connected to joiner transformation? A:. So only 100 comparisons will be done. or we can use "at" command in UNIX to schedule the job. We will join the two sources making source table as master source.A: . We can not join this number against any column in other tables. We can not connect sequence generator with joiner. Then go with min (date) in Same Agg Transformation. .. We need to define keys at informatica level and use update Override in target property. So. Static 4.I think its Simple. So caching will take more time. 21. Dynamic? A: . Joiner 2 No cache 3.should have only one id with the min (date) How to create a mapping for this? A: . MINUS and INTERSECT 23. Informatica Senarios-4 22. 20. Whereas in static and dynamic we have to look up on the target which is very large 5m rows.we can do this by using crontab file in UNIX. And also. UNION ALL. So it will be very faster.When we don't have primary keys defined on database level.Mainly sequence generator is used to generate a unique id dynamically. Means it can alter the number of rows.UNION. from Source 100 rows are coming. A nd still we need update on This target from Informatica. 24.. for this we need to schedule the Power centre job. 19.

A: . How can we load first and last record from a flat file source to target? A: .A: . 26. 2 Router Trans Agg1-gorup by col count=1 to router Trans Agg2-group by col count<>1 to router Trans (I think “it will help u”) From 2nd router Transformation we have a Separate Target Table. overwriting existing rows. This Inserts. I want want to know is it possible to get only date(MMDDYYYY) in the date data type. Then after using the Union transformation you will get 17 Records. Compares source columns against corresponding target columns if key columns match. Updates only slowly changing Dimension technique.I have table name called Team and I have name and DOJ in that table in oracle. It capture all the unique records from both the source.. Inserts new rows to the target.2 for last record. in which 3 records Are same in both the tables. 1 for 1st record.The simple logic of Union is that. Compares logical key columns in the source against corresponding columns in the target lookup table.I have an idea after sql transformation go thruogh 2 Agg Trans. Is this slowly changing Dimension technique? Change data capture (CDC) Mean newly Inserts. Can any one explain me step by step how scd will work? Selects all rows. Creates two data flows: one for new rows. 29. as the records which are getting duplicated will not be in the output. How Union Transformation is an Active Trans? A: . if u using the UNIONtrans one Target table is enough. 'MMDDYYYY') 28. Flags new rows and changed rows. Updates changed rows in the target.After sql Trans. Updates Based on Data loading time. go with AGG..TO_CHAR (DOJ_port. in RankTrans Properties set the rank 1 only---1 row returns. In AggTrans Don’t do any column for Group--Last row returns. 27. Informatica Senarios-5 . RANK transformation. When we load flat files into target tables how do we identify duplicates? And where do load the duplicate records for further reference? How do we do chage data capture? Is this slowly changing Dimension technique? A: . We need 2 Target tables. Suppose if you have 10 records in table A and 10 records in table B. when I retrieve the table in Informatica DOJ shows with date and time . Caches the existing target as a lookup table. one for changed rows. Create two ports one is variable And assign it to 0 and another one is output port and Write the logic to increment it (o_seq=v_Seq+1) 25.Use database Sequence generator call this from stored procedure or dummy lookup Query Or You can also use expression transformation. Generates a primary key for new rows.

because there are no delimiters to check so the performance will increase.30. such as database connections for DEV. etc. How to extract original records at one target & Duplicate records at one target? A:.Source -> SQ -> Sorter ->Expression ->Router(or 2 filters) ->Targets 36. Which is costliest transformation? Costly means occupying more memory? A:. Diff B/W MAP Parameter. without using Rank Transmission? A: . without using Lookup & Sequence Generator. Session parameter is set at the session level for values that can change from sesion to session. How to list Top 10 salary. its going to maintain existing data also in cache memory 33.If you absolutely want to ensure that the second wf starts only after graceful completion of wf1 then simply add a command task for the pmcmd and use the piple to validate that the previous task is completed properly. How to run two workflow (not a sessions) sequentially. SESS ION Parameter. 3) filter the value sequence number greater than 10 35. 31. It is possible to have multiple parameters at a time? The order of execution is wf/s/m. 34. what is the process? A: -The best way is obviously to run WF1 and then call WF2 using PMCMD command in the last session of WF1 (as a post session task).Look up Transformation only.fixed width. The database connection session parameters can be created for all input fields to connection objects. How to generate Sequence? .for that we need to write a ksh shell scripts or batch command and use cmd command 32. QA and PRD environments. password.we can pass all these three types of parameters by using Perameterfile.single mapping we can have 5 sources and 5 target and we need to arrange target load Plan if dependency exists. Database connection session parameters? It’s possible to create 3parameters at a time? If Possible which one will fire FIRST? A: . A mapping parameter is set at the mapping level for values that do not change from session to session for example tax rates.use sorter--> expression-->filter 1) Sorter descend 2) use sequence generator connected expression to generator Sequence. Which gives the more performance when compare to fixed width and delimited file? And why? A: . For example. username. Is it possible to have "5 source & 5 Target" in single mapping? A:.we can declare all in one parameter file. 37. Or We can run the workflow sequentially .

How to join the two flat files using the joiner t/r if there is no matching port? A:. 38. How to run the batch using pmcmd command? A:. For both source and use that column in Join condition.Using Command task in the workflow 43. For efficiency result both are impotent. How will you remove the duplicate records from flat file without using sorter? A: -Use aggregator transformation and group by all ports and create one port for checking Count. 44 . 39... How to join 2 tables. Without source how to insert record to target? A: -Without source you can not create mapping. 41. then using that port connects the joiner Tran.. Informatica Senarios-6 39. 42.Add dummy column in expression or Source Qul. What is the difference between Oracle performance and Informatica Performance? Which performance is better? A:.oracle performance deals with the source &targets. ..What is the target load order? A:-You specify the target load order based on source qualifiers in a maping. Informatica performance deals with the Transformations. without using any condition? A:.If you have the multiple source qualifiers connected to the multiple targets. 40.Connect the source Qualifier of two different flat files to two different Exp Trans. Suppose you have 2000 records in one table and 12000 in another which one you will consider as master and detail? A:. you can designate the order in which informatica server loads data into the targets.and pass the results accordingly to target tables.A:.We will consider the one with lesser number of records as master as with this approach the Data to be cached would have to be less and hence the performance can be improved.. Create a Dummy output port in both the exp trans..using set count variable in expression transformation.

45.10 A. Explain use of update strategy transformation? A:-To flag source records as INSERT.Select distinct option 2. What is parameter file ? A:-When you start a workflow. The Informatica Server runs the workflow using the parameters in the file you specify. UPDATE or REJECT for target database.30 D.Source qualifier – Target Check. DELETE. Difference between Rank and Dense Rank? Rank: 1 2<--2nd position 2<--3rd position 4 5 Same Rank is assigned to same totals/numbers. delete or reject.sorter – Target Check. We can set or flag the records by using these two levels.Select distinct option 4. you can optionally enter the directory and name of a parameter file. 2) Within a mapping:-within a mapping we use update strategy transformation to flag the records like insert. T1 T2 10 A 10 A 20 C 20 C 30 D 40 E which transformation can be used to load data into target? Informatica Senarios-7 A:. This is a scenario in which the source has 2 cols -10 A . enclose the parameter file name in single quotes: -paramfile '$PMRootDir/myfile. Position.Source -Source qualifier -Rank-Expression -Target 47.Sorce . Or This is the important transformation. you can instruct the informatica server to either treat all the records in the same way. This is must for Incremental Data Loading.Source --Source qualifier. 1) Within a session :-When you configure the session. 46. For UNIX shell users. This is usually a Gold Ranking.txt 48.20 C. Default flag is Inserting.aggregator – Target group by-empno 3.Source --Source qualifier.'N') 5. Rank is followed by the game ususally Ranks this way.1. is used to maintain the history data or just most recent changes into the target table. Golf . update.Source -Source qualifier -Expression –Target -Source qualifier: check sorted by empno Expression:port expression in_empno var-empno=var_duplicate var_duplicate=in_emp flag= IIF(var_duplicate=var_emp.40 E.'Y'.20 C And there should be 2 targets one to show the duplicate values and another target for distinct rows.

.. the next rank follows the serial number.. . Suppose session is configured with commit interval of 10.. Assume appropriate value wherever required.000 rows it commits the data...e. Explain the commit points for Source based commit and Target based commit.i. 49. ……….000 rows.000 rows and source has 50.so.@.Dense Rank: 1 2<--2nd position 2<--3rd position 3 4 Same ranks are assigned to same totals/numbers/names. Source based commit will commit the data into target based on commit interval.000. What is the method of loading 5 flat files of having same structure to a single target and which transformations I can use? Two Methods..Let us assume that the buffer size is 6. . Target based commit will commit the data into target based on buffer size of the target. 50 ..write all files in one directory then use file repository concept(dont forget to type source file type as indirect in the session). 1.So..for every 6. 2..use union t/r to combine multiple input files into a single target.for every 10.000 rows it will commit into target. it commits the data into target when ever the buffer fills..