You are on page 1of 37

1 1. What was the complex transformation u created a.

Normalizer transformation to create multiple rows out of single row manipulating the occurrences of the Key block of the row. e.g. input employee_id 112 output " 112 112 112 dept tasks HR admin inter!iew payroll HR admin HR inter!iew HR pay roll

b. #sed $nformatica %etadata &'change (%)* (Rep_+ession_tbl_log Rep_targ_tbls Rep_targ_%apping Rep_+rc_%apping etc*!iews to e'tract source target mapping session information,s c. #sed parameter file ( .t't file stored in the ser!er file system * with input !alues for the -atch e.g. .s_m_%ap1/ 00122_3&1R42556 00122_7&R$89412 .s_m_%ap2/ 00122_3&1R42556 2. How did u implement the Update Strategy multiple Not sure " $s it : ;arget #pdate 8!erride -y default the $nformatica +er!er updates targets based on key !alues. Howe!er you can o!erride the default #791;& statement for each target in a mapping. 3ou might want to update the target based on non<key columns. =or a mapping without an #pdate +trategy transformation configure the session to mark source records as update. $f your mapping includes an #pdate +trategy transformation the ;arget #pdate option only affects source records marked as update. ;he $nformatica +er!er processes all records marked as insert delete or re>ect normally. ?hen you configure the session mark source records as data<dri!en. ;he ;arget #pdate 8!erride only affects source rows marked as update by the #pdate +trategy transformation. 8!erriding the ?H&R& 2lause 3ou can o!erride the ?H&R& clause to include non<key columns. =or e'ample you might want to update records for employees named %ike +mith only. ;o do this you edit the ?H&R& clause as follows" #791;& ;_+1@&+ +&; 91;&_+H$77&9 4 ";#.91;&_+H$77&9 ;8;1@_+1@&+ 4 ";#.;8;1@_+1@&+ ?H&R& ";#.&%7_N1%& 4 &%7_N1%& and

2 &%7_N1%& 4 ,%$K& +%$;H, &ntering a ;arget #pdate +tatement =ollow these instructions to create an update statement. ;o enter a target update statement" 1. 9ouble<click the title bar of a target instance. 2. 2lick 7roperties. A. 2lick the arrow button in the #pdate 8!erride field. B. ;he +C@ &ditor displays. B. +elect Denerate +C@. ;he default #791;& statement appears. E. %odify the update statement. 3ou can o!erride the ?H&R& clause to include non<key columns. F. 2lick 8K. 6. How did u handle error handling #se of filter to remo!e the e'isting records so only the non<e'isting will =low through. -asic error handling can be done using e'p transformation to check for the known possible errors if found gi!e appropriate label to the rows then pass them through router transformation and direct the rows to the error tables or the target. +ome of the generic checks which are common to most of the mapping e.g. check for zero null length etc can be defined as reusable e'pression transformation which then can used in different mappings. A. Strength and weakness of informatica 1. Rows are processed one at a time so calculation and checks among the rows is difficult e!en though to some e'tent !alues can be stored in !ariables for further processing. ;his causes increase in the steps reGuired to accomplish the >ob. &.g. first load the rows in a temporary table with some calculation and then make this temp table the source for another tables for further row manipulations etc. 1. Hery good for load and e'traction howe!er not so fle'ible to implement procedural programming logic. 2. 2. &!ery mapping needs a source and target so sometimes Iust end up using dummy source and target (dual tables* to build the logic. 6. No Replace character function in B.1 (E.5 has it* A. &ase of use graphical representation of the &@; process %aplets and reusable e'p transformation helps to standardize the &;@ process across the organization.

6 B. &asy for knowledge transfer and maintenance. Give an example of a complex transformation u created using Informatica

ET !rocess in recent pro"ect and error handling How #any #apping u has in Ur recent !r"$ What are %ash&oards in '( $ . like 2&8 want to see his company performance in one page or two page report like consolidated data this Guarter re!enues and form last Guarter to this Guarter growth rate and couple of graphs etc.. ?hat are my top ten customers how many new customers you got this Guarter or year etc.... What is a repository o 1 Repository is a %eta data which is stored in a centralized location can be imported and e'ported to !arious systems stores information about all the components of informatica like information about sessions mappings transformation mapplets. +ession parameters mapping parameters local and global !ariable reusable transformations What are session !arameters ) %ifference &etween pre and post session parameters !re session parameters &'"< like e'ecuting a stored procedure or a sGl Guery dropping inde'es to impro!e performance !ost Session !arameters &'"< Recreating the inde'es &%1$@ Explain the ET *rchitecture %ata Staging *rea +, ;emporary like also called operational data store one le!el down to data mart %ifference &etween ! -S. and informatica/ Why do u need informatica "ustify 7lJ+Cl is free with oracle $nformatica is e'pensi!e &asily u can load the data in a more sophisticated way using informatica # can monitor the load process +chedule +essions etc in informatica What are the %ifferences 'etween informatica 0/1 and 2/3 and 4/5 o Router ;ranformation is a!ialble from B.5 onwards o 9ebugging in 9esigner o 7artition of session in session manager o $n E.5 complete heterogeneous targets one in oracle one in db2 into multiple targets

A o 9ata partitioning run as multiple sessions in informatica E.5 o Repository +er!er (New 2omponent* o ?orkflow %anager (New 2omponent* o ?orkflow %onitor (New 2omponent* Explain the process of moving ur data to production o 1. ;hrough )ml way of mo!ing into production o 2. ;h scripts o +ystem testing en!ironment" < u will ha!e already repository in test en!ironment by 'ml way or th scripts o 7repare the scripts for ;ables !iews with constraints and inde'es. 2hange the session connections What are the tools used in ur environment for %ata #ining and %ata #odeling o 9ata %odeling "< &rwin from platinum Hisio from %icrosoft o 9ata %ining" < 1 component of 9ata ?arehousing +olution used to disco!er patterns and relationships in ur data in order to help u make better business decisions. 1lso called knowledge disco!ery in databases (K99* 8racleKs sol to this business pr of data mining is known as 8racle 9arwin or as oracleKs 9ata mining +uite. #sed to help u build predicti!e models 9ata %ining is a disco!ery process that allows users to understand the substance of and the relationships between their data &'" < ?hat is likely to happen to the -oston +ales ne't month and why: 2haracteristics of 9ata %ining are 7rospecti!e 7roacti!e information deli!ery o ;o access remote uni' ser!ers tiny tern o How did u go to oracle o #sing tiny tern %iff &etween lkp cache and unlkp cache o # can disable the option use lkp cache when u are creating a lkp tranfor o 7erformance issue if lkp is too large in size lkp cache also takes lot of space What is diff &etween data warehousing and enterprise data warehousing ?idely used by whole enterprise data warehousing ;otal D& warehouse is called enterprise data warehouse 9ata warehouse build for all business in an enterprise is enterprise data warehousing and for a single business is data warehouse and module in a data warehouse is data mart &'"< D& capital D& appliances D& electrical each has their own data data warehouse What was the challenging task u felt when u are working with informatica

What is s6l override 8!erriding a +C@ in source Gualifier or lookup for additional logic. How do u fine tune a session * mapping is running very slow how do u handle +, ?hile creating a mapping in the de!elopment process $ followed all the rules to fine tune like >oiner transformation in the beginning reducing no of e'pressions $n the beginning only $ check how much time its taking to retri!e 1t mapping le!el "< +ource sGl o!erride (e'plain plan statistics *

B ;arget we write to flat file then check the time againt to rdbms then u will know bottle neck is in the db or in the mapping ;hen we check network by calling network admin ;hen data cache inde' cache increase o How did u implement update strategy 7omplex Session scheduling in ur experience Have u used a normali8er transformation where 9o Explain the difference &etween oracle normali8ation and data warehouse denormali8ation why do we need to denormali8e in datawarehousing Re!erse engineering is for the performance i.e. remo!ing additional >oins and decreasing the load on the oltp system How did u handle error handling &;@<Row<&rrors 1lways $ used ha!e problems when $ am retrie!ing data from flat files the data is not formatted arranged properly what $ used to do is #sing sGl L loader $ pulled flat file data into data ;hey r temporarily tables then $ used these temporary tables as source then applied transformations and loaded into target then deleted the temporary tables Why did u choose this method informatica taking lot of time to load data into source using sGl L loader the >ob can be done in minutes What are the different kinds of repots u have created in '( in the recent pro"ect

?hat are the different kinds of repots u ha!e created in -8 in the recent pro>ect ;abular cross tab %aster detail crosstab master detail table multi master detail. 2an we ha!e multiple conditions in a @ookup: yes 2an we ha!e multiple conditions in a =ilter: 3&+ How the flags are called in #pdate strategy: $$= 5 < 99_$N+&R; 1< 99_#791;& 2< 99_9&@&;& 6< 99_R&I&2; $s it possible to run the session other than +er!er manager: $f so how: 3&+ #+$ND 7%2%9 ?hat is the use of power plug: =or 6rd party connectors to sap mainframe 7eoplesoft 1. ?hat are all the !ersions of 7ower %artJ7ower 2enter M 1.FJA.FJB.5JB.1 <:: 2. -usiness 8b>ects" 1. ?hen # are installing -8 with what profile you will enter:

What is Short 7ut "oin What is load &alancing $ :ail (ver$ What is clustering $ 9iff between lkp cache and unlkp cache @ook up cache $ndicates whether the @ookup transformation caches lookup !alues during the session. ?hen lookup caching is enabled the $nformatica +er!er Gueries the lookup table once caches the !alues and looks up !alues in the cache during the session. ;his can impro!e session performance. ?hen you disable caching each time a row passes into the transformation the $nformatica +er!er issues a select statement to the lookup table for lookup !alues. ?hen # are installing -8 with what profile you will enter:

?hat is the difference between inner and outer >oin $nner is self >oin and outer >oin retrie!es the information for the gi!en condition and the information that dose not satisfy the condition ?hat is the synta' difference for using outer >oin in +C@ +er!er J 8racle $n 8racle u use (N* symbol and in +C@ +er!er u use L How do you do reGuirement analysis: ;his is a big Guestion we start with learning the e'isting system and interact with the end users for their reGuirements and then decide the steps that need to go in modeling &;@ and reporting ?hat is &'ception: +ame as e'ception handler in 8racle ?hat is slowly changing 9imension: $ guess you know the ;37& 1 2 6 ?hat is +lowly Drowing 9imension: ;he dimension which dose not ha!e many changes for the warehouse e.g. Region dimension dose not ha!e many changes it may add a row in a year. 3ou ha!e a wizard in $nformatica >ust go through ;he ;arget mapping filters source rows based on user<defined comparisons and then inserts only those found to be new to the target. #se the ;arget mapping to determine which source rows are new and to load them to an e'isting target table. $n the ;arget mapping all rows are current. #se the ;arget mapping to load a fact or dimension table one in which e'isting data does not reGuire updates. =or e'ample you ha!e a site code dimension table that contains only a store name and a corresponding site code that you update only after your company opens a new store. 1lthough listed stores might closeO you want to keep the store code and name in the dimension for historical analysis. ?ith the ;arget mapping you can load new source rows to the site code dimension table without deleting historical sites. ?hat kind of ;est plan: ?hat kind of !alidation you do: $n $nformatica we create some test +C@ to compare the number or records and !alidate scripts if the data in the warehouse is loaded for the logic incorporated. ?hat is the usage of unconnectedJconnected look up:

F ?e use a lookup for connecting to a table in the source or a target. ;here are 2 ways in which a lookup can be configured i.e. connected or unconnected

3ou can configure a connected @ookup transformation to recei!e input directly from the mapping pipeline or you can configure an unconnected @ookup transformation to recei!e input from the result of an e'pression in another transformation 2onnected @ookup #nconnected @ookup

Recei!es input !alues directly from Recei!es input !alues from the result of a the pipeline. "@K7 e'pression in another transformation 3ou can use a dynamic or static cache. 3ou can use a static cache. 2ache includes all lookup columns used in the mapping (that is lookup 2ache includes all lookupJoutput ports in table columns included in the lookup the lookup condition and the lookupJreturn condition and lookup table columns port. linked as output ports to other transformations*. 2an return multiple columns from the ;he dynamic lookup cache. same row or insert into the dynamic 9esignate one return port (R*. Returns one lookup cache. column from each row $f there is no match for the lookup condition the $nformatica +er!er $f there is no match for the lookup returns the default !alue for all output condition the $nformatica +er!er returns ports. $f you configure dynamic N#@@. caching the $nformatica +er!er inserts rows into the cache. 7ass multiple output !alues to another transformation. @ink lookupJoutput 7ass one output !alue to another ports to another transformation. transformation. ;he lookupJoutputJreturn port passes the !alue to the transformation calling" @K7 e'pression. +upports user<defined default !alues. 9oes not support user<defined default !alues

?hat is comple' mapping and e'plain: ?e can say a mapping in!ol!ing many lookupsO Ioiners and comple' calculation may be called as a comple' mapping $f you canKt find what you are looking for in lookup table how do you handle: $f u has data coming from diff. sources what transformation will u use in your designer: Ioiner ;ransformation How many transformations are there: 1B types of transformations 2an u stop a session in a concurrent batch: Nope ?hat happens when u uses the delete or update or re>ect or insert statement in #r update strategy: How many sessions u can run: ;he sessions those are configured in ser!er. %a' you can is B5<155(not sure* ?hat are parameter !ariables and where do u use it: #apping !arameters 1 mapping parameter represents a constant !alue that you can define before running a session. 1 mapping parameter retains the same !alue throughout the entire session. ?hen you use a mapping parameter you declare and use the parameter in a mapping or mapplet. ;hen define the !alue of the parameter in a parameter file for the session. 9uring the session the $nformatica +er!er e!aluates all references to the parameter to that !alue. #apping ;aria&les #nlike a mapping parameter a mapping !ariable represents a !alue that can change through the session. ;he $nformatica +er!er sa!es the !alue of a mapping !ariable to the repository at the end of each successful session run and uses that !alue the ne't time you run the session. ocal varia&le" 3ou can use local that you create within a mapping in any transformation e'pression. =or e'ample if you use a comple' ta' calculation throughout a mapping you might want to write the e'pression once and designate it as a. you thereby increase performance since the $nformatica +er!er performs the calculation only once. @ocal are especially useful when used with stored procedure e'pressions to capture multiple return !alues. System varia&le 000+ess+tart;ime these are constant through out the mapping and cannot be changed. ?hat are counters: ;he performance details pro!ide that help you understand the session and mapping efficiency. &ach +ource Cualifier target definition and indi!idual transformation appears in the performance details along with that display performance information about each transformation

Q Understanding !erformance 7ounters 1ll transformations ha!e some basic that indicates the number of input rows output rows and error rows. +ource Cualifiers Normalizes and targets ha!e additional that indicates the efficiency of data mo!ing into and out of buffers. 3ou can use these to locate performance bottlenecks. +ome transformations ha!e specific to their functionality. =or e'ample each @ookup transformation has an indicates the number of rows stored in the lookup cache. ?hen you read performance details the first column displays the transformation name as it appears in the mapping the second column contains the name and the third column holds the resulting number or efficiency percentage. ?hen you partition a source the $nformatica +er!er generates one set of for each partition. ;he following performance illustrate two partitions for an &'pression transformation" Transformation &)7;R1N+ .1/ 7ounter &'pression_input rows &'pression_output rows &)7;R1N+ .2/ &'pression_input rows ;alue P P 1E

&'pression_output rows

1E

9ote+ ?hen you partition a session the number of aggregate or rank input rows may be different from the number of output rows from the pre!ious transformation. How do u set the size of block buffer: How do u pass parameters to unconn. +tored proc: 2onfigure the e'pression to send any input parameters and capture any output parameters or return !alue 3ou must know whether the parameters shown in the &'pression &ditor are input or output parameters. 3ou insert !ariables or port names between the parentheses in the e'act order that they appear in the stored procedure itself. ;he datatypes of the ports and !ariables must match those of the parameters passed to the stored procedure. =or e'ample when you click the stored procedure something similar to the following appears" "+7.D&;_N1%&_=R8%_$9(* ;his particular stored procedure reGuires an integer !alue as an input parameter and returns a string !alue as an output parameter. How the output parameter or return !alue is captured depends on the number of output parameters and whether the return !alue needs to be captured. $f the stored procedure returns a single output parameter or a return !alue (but not both* you should use the reser!ed !ariable 7R82_R&+#@; as the output !ariable. $n the pre!ious e'ample the e'pression would appear as" "+7.D&;_N1%&_=R8%_$9(in$9 7R82_R&+#@;*

15 $n$9 can be either an input port for the transformation or a !ariable in the transformation. ;he !alue of 7R82_R&+#@; is applied to the output port for the e'pression. $f the stored procedure returns multiple output parameters you must create !ariables for each output parameter. =or e'ample if you created a port called !ar8#;7#;2 for the stored procedure e'pression and a !ariable called !ar8#;7#;1 the e'pression would appears as" "+7.D&;_N1%&_=R8%_$9 (in$9 !ar8#;7#;1 7R82_R&+#@;* ;he !alue of the second output port is applied to the output port for the e'pression and the !alue of the first output port is applied to !ar8#;7#;1. ;he output parameters are returned in the order they are declared in the stored procedure itself. ?ith all these e'pressions the datatypes for the ports and !ariables must match the datatypes for the inputJoutput !ariables and return !alue. How do u implement unconn. +tored proc. $n a mapping: 2an u access a repository created in pre!ious !ersion of $nformatica: ?hat happens if the info. +er!er doesnKt find the session parameter in the parameter file:

How does run session from command line: 7%2%9 command ?hat is diff. ;hings u can do using 7%2%9: +tart +top and abort the session

3/ What are different ports in Informatica$ 1ns" $nput 8utput Hariable ReturnJRank @ookup and %aster. </ What is a ;aria&le port$ Why it is used$ 1ns" Hariable port is used to store intermediate results. Hariable ports can reference input ports and !ariable ports but not output ports. =/ What are the different connectivity Informatica uses to connect to sources) targets and the repository$ 1ns" 7owermart and 7ower2enter uses 1. Network 7rotocol 2. Nati!e 9ri!ers 6. 89-2 ;he +er!er %anager and the $nformatica +er!er use ;27J$7 or $7)J+7) to communicate to each other. 0/ %ifference &etween %ata cleansing and %ata scru&&ing$ 1ns" 9ata cleansing is a process of remo!ing errors and resol!ing inconsistencies in source data before loading data into targets. 9ata scrubbing is a process of filtering merging decoding and translating the source data to create the !alidation data for data warehouse. 2/ What are reusa&le transformations$ 7an we roll&ack this option$ 1ns" Reusable transformations can be used in multiple transformations. 8nly one transformation can be used in reusable transformation. 3ou can rollback this option. %iff &etween *ctive and passive transormation $

11 ;ransf can be acti!e or passi!e acti! tranf can change the no of rec passed th it a passi!e tranf can ne!er change the rec cnt) *ctive trsnf that might change the rec cnt are ad!an e't proc aggrega filter >oiner normalizer rank updte strategy source Gualifier) if u use powerconnect to access erp sources) erp source 6uail is also an active tranfor !assive tranf "< lookup e'pression e'ternal procedure seG generator stored procedure U can connect only 1 acti!e tranf to the same tranf or target can connect any no of pass tranf 4/ What are #applet$ 1ns" 1 mapplet is a reusable ob>ect that represents a set of transformations. $t allows you to reuse transformation logic and can contain as many transformation as you need. 1/ What are mapping parameters and varia&les$ 1ns" 1 mapping parameter represents a constants !alue that you can define before running a session. 1 mapping parameter retains the same !alue throughout the entire session. #nlike a mapping parameter a mapping !ariable represents a !alue that can change through the session. >/ How many transformations have you used$ ?/ What is *ggregate transformation 1ns" 1n aggregator transformation allows you to perform aggregate calculations such as a!erage and sums. ;he 1ggregator transformation is unlike the &'pression transformation in that you can use the 1ggregator transformation to perform calculations on groups. 35/ What is @outer Transformation$ How is it different from :ilter transformation$ 1ns" 1 Router transformation is similar to a =ilter transformation because both transformations allow you to use a condition to test data. 1 =ilter transformation tests data for one condition and drops the rows of data that do not meet the condition. Howe!er a router transformation tests data for one or more conditions and gi!es you the option to route rows of data that do not meet any of the conditions to default output group. 33/ What are connected and unconnected transformations$ 1ns" 2onnected transformations are the transformation which are in the data flow whereas unconnected transformation will not be in the data flow. ;hese are dealt in @ookup and +tored procedure transformations. 3</ What is 9ormali8er transformation$ 1ns" Normalizer transformation normalizes records from 28-8@ and relational sources allowing you to organize the data according to your needs. 1 normalizer transformation can appear anywhere in a data flow when you normalize a relational source. 3=/ How to use a se6uence created in (racle in Informatica$ 1ns" -y using +tored procedure transformation. 30/ What are source 6ualifier transformations$ 1ns" ;he source Gualifier represents the records that the $nformatica +er!er reads when it runs a session. 32/ What is a dimension$ 1ns" 1 set of le!el properties that describe a specific aspect of a business used for analyzing the factual measures of one or more cubes which use that dimension. &'" geography time customer and product. 34/ What is a A# source 6ualifier$ 1ns" ;he )%@ source Gualifier represents the data elements that the $nformatica ser!er reads when it runs a session with )%@ sources. 31/ What is a cu&e$

12 1ns" 1 set of related factual measures aggregates and dimensions for a specific dimensional analysis problem. &'" regional product sales. 3>/ What is oad #anager process$ 1ns" @oad manager is the primary $nformatica ser!er process. $t performs the following tasks" a. %anages sessions and batch scheduling. b. @ocks the sessions and reads properties. c. Reads parameter files. d. &'pands the ser!er and session !ariables and parameters. e. Herifies permissions and pri!ileges. f. Halidates sources and targets code pages. g. 2reates session log files. h. 2reates 9ata ;ransformation %anager (9;%* process which e'ecutes the session. 3?/ %efine a session$ 1ns" 1 session is a set of instructions that describes how and when to mo!e data from source to targets <5/ What are pmcmd commands$ 1ns" pmcmd is a command line program to communicate with the $nformatica ser!er. ;his does not replace the ser!er manager since there are many tasks that you can perform only with ser!er %anager. <3/ What are cache and their types in Informatica$ 1ns" ;he $nformatica ser!er creates index and data cache for aggregator Rank >oiner and @ookup transformations in a mapping. ;he $nformatica ser!er stores key !alues in the inde' cache and output !alues in the data cache. <</ What are minimum and maximum values for index and data cache$ 1ns" $nde' cache" %in" 12%- %a'" 2A %9ata cache" %in" 12%%a'" 2A%<=/ What is default &lock &uffer si8e$ 1ns" EAK <0/ What is default # shared memory si8e$ 1ns" 2%<2/ What is an incremental aggregation$ 1ns" $n $ncremental aggregation you apply captured changes in the source to aggregate calculations in a session. $f the source changes only incrementally and you can capture changes you can configure the session to process only those changes. ;his allows the $nformatica ser!er to update your target incrementally rather than forcing it to process the entire source and recalculate the same calculation each time you run the session. <4/ What is @e"ect loading$ 1ns" 9uring a session the $nformatica ser!er creates a re>ect file for each target instance in the mapping. $f the writer or the target re>ects data the $nformatica ser!er writes the re>ected row into re>ect file. ;he re>ect file and session log contain information that helps you determine the cause of the re>ect. 3ou can correct re>ect files and load them to relational targets using the $nformatica re>ect load utility. ;he re>ect loader also creates another re>ect file for the data that the writer or target re>ect during the re>ect loading. <1/ How many servers instances can you register to a repository$ 1ns"

16 2P. ?hat was the size of your warehouse 2Q. ?hat is +tar +chema 65. ?hat is the difference between inner and outer >oin 61. How big was your fact table 62. How did you handle performance issues $f you ha!e data coming in from multiple sources >ust walk thru the process of loading it into the target 66. How will u con!ert rows into columns or columns into rows 6A. 9o you ha!e any e'perience in 887 6B. ?hat is the synta' difference for using outer >oin in +C@ +er!er J 8racle 6E. ?hat is a repository ;he place where u store the metadata is called a repository 1. ?hat is the difference between 7ower2enter and 7ower%art : 7ower 2enter and 7ower mart call the instructions to get data change it where to write the information 9ata %art 7ower2enter 2an create @ocal Repository which could be upgraded to a Dlobal Repository. 2an 2onnect to &nterprise 9ata (+17JR6 +$&-&@ H1N;$H& $-% %C +eriesRR 1bility to register multiple ser!ers share metadata across repositories and partition data 7ower%art =eature is not a!ailable =eature is not a!ailable 1ll features e'cept distributed metadata multiple registered ser!ers and data partitioning

</ What are the max num&er of sessions that you could configure in Informatica Server $ 3ou can set three parameters in the $nformatica +er!er configuration that control how the @oad %anager allocates memory to sessions"

#axSessions/ ;he ma'imum sessions parameter indicates the ma'imum number of session slots a!ailable to the @oad %anager at one time for running or repeating sessions. =or e'ample if you select the default %a'+essions of 15 the @oad %anager allocates 15 session slots. ;his parameter helps you control the number of sessions the $nformatica +er!er can run simultaneously. #Shared#emory/ ;he @oad %anager shared memory parameter is set in con>unction with the ma'imum sessions parameter to ensure that the @oad %anager has enough memory for each session. ;he @oad %anager reGuires appro'imately 255 555 bytes of shared memory for each session slot. ;he default setting is 2 555 555 bytes. =or each increase of 15 sessions in the %a'+essions setting you need to increase @%+hared%emory by 2 555 555 bytes. Beep@epeatSessionInShm/ ;he Keep Repeating +essions in +hared %emory option determines how the @oad %anager beha!es when there are no a!ailable slots for sessions to run. -y default this option is disabled and the @oad %anager swaps repeating sessions out of session slots to run new session reGuests. $f all session slots are filled with running sessions the

1A @oad %anager places new session reGuests in a waiting state until it can open a session slot. $f you enable this option the @oad %anager retains repeating sessions in shared memory and fails to run new session reGuests. 6. ?hat is unconnected transformation: transformation: ?hat is the ad!ance of using #nconnected

#nconnected transformation is not part of the data flow ;he $nformatica +er!er Gueries the @ookup table based on the logic of the e'pression calling the lookup. #nconnected @ookup ;ransformation to check the record whether it e'ists in the target database or not.

A. ?hat are the steps in!ol!ed in the migration from older !ersion to newer !ersion of $nformatica +er!er: B. Ha!e you worked in con>unction with 8racle 9-1(s* and #N$) administrators: E. Ha!e you worked with 8racle front<end tools: F. ?hat are the main features of 8racle Pi with conte't to datawarehouse: P. 1re you comfortable with -usiness 8b>ects: Q. ?hat kind of analysis you did in the e'isting 9ata warehouse en!ironment: 15. ?hat problem you faced and what were your recommendations (based on your last pro>ect*: 11. How many dimension tables and fact tables were there in your datawarehouse: $f you canKt find what you are looking for in lookup table how do you handle: ?hat is +tar schema J +now =lake schema: 1.What are Target Types on the Server? 1ns" ;arget ;ypes are =ile Relational and &R7 2.What are Target Options on the Servers? 1ns" ;arget 8ptions for =ile ;arget type are =;7 =ile @oader and %C ;here are no target options for &R7 target type ;arget 8ptions for Relational are $nsert #pdate (as #pdate* #pdate (as $nsert* #pdate (else $nsert* 9elete and ;runcate;able. 3.How do you identify existing rows of data in the target table using loo up transfor!ation? 1ns" 2an identify e'isting rows of data using unconnected lookup transformation. ".What are #ggregate transfor!ation? 1ns" 1ggregator transformation allows you to perform aggregate calculations such as a!erages and sums. $.What are various types of #ggregation? 1ns" Harious types of aggregation are +#% 1HD 28#N; %1) %$N =$R+; @1+; %&9$1N 7&R2&N;$@& +;99&H and H1R$1N2&. %.What are &i!ensions and various types of &i!ensions? 1 set of le!el properties that describe a specific aspect of a business used for analyzing the factual measures of one or more cubes which use that dimension. &gs. Deography time customer and product.

1B '.What are 2 !odes of data !ove!ent in (nfor!ati)a Server? Ans: The data movement mode depends on whether Informatica Server should process single byte or multi-byte character data. This mode selection can affect the enforcement of code page relationships and code page validation in the Informatica Client and Server. a* #nicode M $+ allows 2 bytes for each character and uses additional byte for each non<ascii character (such as Iapanese characters* b* 1+2$$ M $+ holds all data in a single byte ;he $+ data mo!ement mode can be changed in the $nformatica +er!er configuration parameters. ;his comes into effect once you restart the $nformatica +er!er. *.What is +ode ,age +o!patibility? 1ns" 2ompatibility between code pages is used for accurate data mo!ement when the $nformatica +e!er runs in the #nicode data mo!ement mode. $f the code pages are identical then there will not be any data loss. 8ne code page can be a subset or superset of another. =or accurate data mo!ement the target code page must be a superset of the source code page. Superset - A code page is a superset of another code page when it contains the character encoded in the other code page. It also contains additional characters not contained in the other code page. Subset - A code page is a subset of another code page when all characters in the code page are encoded in the other code page. ..What is +ode ,age used for? 1ns" 2ode 7age is used to identify characters that might be in different languages. $f you are importing Iapanese data into mapping then u must select the Iapanese code page for the source data. 1/.What is 0outer transfor!ation? 1ns" Router transformation allows you to use a condition to test data. $t is similar to filter transformation. $t allows the testing to be done on one or more conditions. 11. What is 1oad 2anager : 1ns. ;he load %anager is the 7rimary informatica +er!er 7rocess. $t 7erforms the following tasks < %anages session and batch scheduling. @ocks the session and read session properties. Reads the parameter file. &'pand the ser!er and session !ariables and parameters. Herify permissions and pri!ileges. Halidate source and target code pages. 2reate the session log file. 2reate the 9ata ;ransformation %anager which e'ecute the session. 12. What is &ata Transfor!ation 2anager? 1ns. 1fter the load manager performs !alidations for the session it creates the 9;% process. ;he 9;% process is the second process associated with the session run. ;he primary purpose of the 9;% process is to create and manage threads that carry out the session tasks. ;he 9;% allocates process memory for the session and di!ide it into buffers. ;his is also known as buffer memory. $t creates the main thread which is called the master thread. ;he master thread creates and manages all other threads. $f we partition a session the 9;% creates a set of threads for each partition to allow concurrent processing.. ?hen $nformatica ser!er writes messages to the session log it includes thread type and thread $9. =ollowing are the types of threads that 9;% creates"

1E

#*STE@ TH@E*% < %ain thread of the 9;% process. 2reates and manages all other threads. #*!!I9G TH@E*% < 8ne ;hread to &ach +ession. =etches +ession and %apping $nformation. !re *nd !ost Session Thread < 8ne ;hread &ach ;o 7erform 7re 1nd 7ost +ession 8perations. @E*%E@ TH@E*% < 8ne ;hread for &ach 7artition for &ach +ource 7ipeline. W@ITE@ TH@E*% < 8ne ;hread for &ach 7artition $f ;arget &'ist $n ;he +ource pipeline ?rite ;o ;he ;arget. T@*9S:(@#*TI(9 TH@E*% < 8ne or %ore ;ransformation ;hread =or &ach 7artition.

13. WH#T (S S3SS(O4 #4& 5#T+H3S? 1ns. SESSI(9 < 1 +ession $s 1 set of instructions that tells the $nformatica +er!er How 1nd ?hen ;o %o!e 9ata =rom +ources ;o ;argets. 1fter creating the session we can use either the ser!er manager or the command line program pmcmd to start or stop the session. '*T7HES < $t 7ro!ides 1 ?ay to Droup +essions =or &ither +erial 8r 7arallel &'ecution -y ;he $nformatica +er!er. ;here 1re ;wo ;ypes 8f -atches " 1. SE.UE9TI* < Run +ession 8ne after the 8ther. 2. 7(97U@@E9T < Run +ession 1t ;he +ame ;ime. 1". What is a sour)e 6ualifier? 1ns" $t represents all data Gueried from the source. 1$. Why we use loo up transfor!ations? 1ns" @ookup ;ransformations can access data from relational tables that are not sources in mapping. ?ith @ookup transformation we can accomplish the following tasks" a* Get a related value < Det the &mployee Name from the &mployee table based on the &mployee $9 &C !erform 7alculation c* Update slowly changing dimension ta&les , ?e can use unconnected lookup transformation to determine whether the records already e'ist in the target or not. Informatica + 1. 9ifference between 7ower 2enter and %art: 2. ?hat are the new features of 7ower 2enter B.5: 6. How to run a session which contains mapplet: A. 9ifferentiate between @oad %anager and 9;%: 'usiness (&"ects+ 1. ?hat is #ni!erse : 2. 9efine @oops and how to resol!e them : 6. ?hat were the errors which you encountered most while running -8 reports : A. How to schedule -8 Reports : B. ?hat are the methods a!ailable in 9ocument class of the ?eb<$ +9K : E. ?hat is the ob>ecti!e behind 7artitioning in 8racle : 7(%E !*GE (;E@;IEW 1 code page contains the encoding to specify characters in a set of one or more languages. 1n encoding is the assignment of a number to a character in the character set. 3ou use code pages to

1F identify data that might be in different languages. =or e'ample if you are importing Iapanese data into a mapping you must select a Iapanese code page for the source data. ;o change the language to &nglish and reGuire the system to use the @atin1 code page in #N$) e'ecute the following command. setenv *9G enDUS/iso>>2?3 If Eou *re Using !ower7enter ?ith 7ower2enter you recei!e all product functionality including the ability to register multiple ser!ers share metadata across repositories and partition data. 1 7ower2enter license lets you create a single repository that you can configure as a global repository the core component of a data warehouse. ?hen this guide mentions a 7ower2enter +er!er it is referring to an $nformatica +er!er with a 7ower2enter license. If Eou *re Using !ower#art ;his !ersion of 7ower%art includes all features e'cept distributed metadata multiple registered ser!ers and data partitioning. 1lso the !arious options a!ailable with 7ower2enter (such as 7ower2enter $ntegration +er!er for -? 7ower2onnect for $-% 9-2 7ower2onnect for $-% %C+eries 7ower2onnect for +17 RJ6 7ower2onnect for +iebel and 7ower2onnect for 7eople+oft* are not a!ailable with 7ower%art. ?hen this guide mentions a 7ower%art +er!er it is referring to an $nformatica +er!er with a 7ower%art license. 9ormali8er Transformation+ Normalization protects the data and makes the database more fle'ible by eliminating redundancy and inconsistent dependencies. #sed mainly with 28-8@ sources. ?ith relational sources used to create multiple rows from a single row of data. #se a single Normalizer ; to handle multiple le!els of denormalization in the same record. =or e'ample a single record might contain two different detail record sets. Rather than using two Normalizer ; to handle the two different detail record sets you handle both normalizations in the same transformation. 7i!oting can be done. 2hanging columns to rows and !ice !ersa. 1 Normalizer column id is created for 822#R+ 7i!oting used to identify the 2olumns. ?hen we are importing 28-8@ and Normalizer ; automatically created and we canFt modify the field definitions in the 9ormali8er T. ?e ha!e to modify only in the +ource 1nalyzer.

1P Setting Reset Restart ;racing le!el %escription $f selected the $nformatica +er!er resets the generated key !alue after the session finishes to its original !alue. $f selected the $nformatica +er!er restarts the generated key !alues from 1 e!ery time the session runs. 9etermines the amount of information about this transformation that the $nformatica +er!er writes to the session log when it runs the session. 3ou can o!erride this tracing le!el when you configure a session.

7an I use my existing !ower#art 0/5-!ower7enter 3/5 mappings that contain 7('( sources$ $n 7ower%art A.5 and 7ower2enter 1.5 the 9esigner did not support R&9&=$N& statements in 28-8@ sources or copybooks. $f your 28-8@ sources included R&9&=$N& statements in older mappings and you implemented workarounds you do not need to edit your e'isting mappings or sessions. Howe!er if you want to impro!e session performance you can either create a new mapping or edit the e'isting mapping by reimporting the 28-8@ sources and modifying any filter transformations or target as needed. 3ou can then create one session to load the data from the 28-8@ source. 7annot I edit the ports in my 9ormali8er transformation when using a relational source$ ?hen you create ports manually you must do so on the Normalizer tab in the transformation not the 7orts tab. Source .ualifier Transformation+ ?hen you add a relational or a flat file source definition to a mapping you need to connect it to a +ource Cualifier transformation. The Source .ualifier represents the records that the Informatica Server reads when it runs a session. ;o >oin data originating from the same 9-. =ilter records in the +ource itself. ;o specify an outer >oin instead of a default inner >oin. ;o specify sorter ports. ;o select distinct !alues from the source. ;o create a custom Guery to issue a special select statement for the $nformatica ser!er to read source data. =or e'ample we might use a custom Guery to perform aggregate calculations or e'ecute a stored procedure.

1Q %onFt alter the data types in the Source .ualifier. $f we ha!e connected multiple +C to multiple targets we can designate the order in which the targets are loaded. $f we connect one +C to multiple targets we can enable constraint<based loading in a session to ha!e the $+ load data based on target table primary and foreign key relationships. ;he $+ reads only the columns in +C that are connected to other transformations. $nformatica +er!er Informatica server moves data from source to target &ased on mapping and session metadata stored in a repository/ * mapping is a set of source and target definitions linked &y transformation o&"ects that define the rules for data transformation/ * session is a set of instructions that descri&es how and when to move data from source to targets/ Session ,ro)ess The Informatica server uses &oth process memory and system shared memory to perform these tasks/ It runs as a daemon on unix and as a service on Windows 9T-<555/ ;he load %anager process. +tart the session creates the 9;% process and sends post<session email when the session completed. ;he 9;% process (9ata ;ransformation %anager*. 2reates threads to initialize the session read write and transform data and handle pre< and post<session operations.

7artitioning 9ata ;he $nformatica ser!er can achie!e high performance by partitioning source data and performing the e'tract transformation and load for each partition in parallel.

2onfigure the session to partition source data. $nstall the $nformatica ser!er on a machine with multiple 27#s.

?hen you partition a session configure partitioning based on source Gualifiers in a mapping. =or relational source the $nformatica ser!er creates multiple database connections to a single source and e'tracts a separate range of data for each connection. =or )%@ or file sources the $nformatica ser!er reads multiple files concurrently. ?hen the $nformatica +er!er loads relational data it loads relational dataO it creates multiple database connections to the target and loads partitions of data concurrently. ?hen the $nformatica ser!er loads data to file targets it creates a separate file for each partition.

25 Update strategy ?hen you design the data warehouse you need to decide what type of information to store in targets. 1s part of your target table design you need to determine whether to maintain all the historic data or >ust the most recent changes. $n 7ower%art and 7ower 2enter you set your update strategy at two different le!els" ?ithin a session. ?hen you configure a session you can instruct the ser!er to either treat all records in the same way (for e'ample treat all records as inserts* or use instructions coded into the session mapping to flag records for different database operations. ?ithin a mapping. ?ithin a mapping you use the #pdate +trategy transformation to flag records for insert delete update or re>ect. =ollow these steps to define an update strategy" 1. ;o control how records are flagged for insert update delete or re>ect within a mapping add an #pdate +trategy transformation to the mapping. #pdate +trategy transformations are essential if you want to flag records destined for the same target for different database operations or if you want to re>ect records. 2. 9efine how to flag records when you configure a session. 3ou can flag all records for insert delete or update or select the 9ata 9ri!en option where the $nformatica +er!er follows instructions code into #pdate +trategy transformations within the session mapping. 6. 9efine insert update and delete options for each target when you configure a session. 8n a target<by<target basis you can allow or disallow inserts and deletes and you can choose three different ways to handle updates. ?hen you configure a session you ha!e se!eral options for handling specific database operations including updates. +pecifying an operation for all rows +etting $nsert 9elete #pdate 9ata 9ri!en 9escription ;reat all records as inserts. $f inserting the record !iolates a primary or foreign key constraint in database the ser!er re>ect the records ;reat all records as deletes. =or each record if the ser!er finds a corresponding record in the target table the ser!er deletes it. ;reat all records as updates. =or each e'isting records ser!er updates the record. ;he informatica ser!er follows instructions code into update strategy transformations within the session mapping to determine how to flag records for insert delete update or re>ect.

+pecifying operations for indi!idual ;arget ;ables. $nsert. +elect this option to insert a row into a target table. 9elete. +elect this option to delete a record from a table. #pdate. 3ou ha!e three different options.

21 1. #pdate as update. #pdate each record flagged for update if e'ists in the target table. 2. #pdate as insert. $nsert each record flagged for update. 6. #pdate else insert. #pdate the record if it e'ist. 8therwise insert it. ;runcate ;able. +elect this option to truncated the target table before loading data. =lagging Records within a %apping. =ollowing are database operation 8peration $nsert #pdate 9elete Re>ect 2onstant 99_$N+&R; 99_#791;& 99_9&@&;& 99_R&I&2; Numeric Halue 5 1 2 6

;he ser!er treats any other !alue as an insert. #pdate strategy transformation is freGuently the first transformation in a mapping before data reaches a target table. 3ou can use the #pdate strategy transformation to determine how to flag that record. @ater when you configure a session based on this transformation you can determine what to do with records flagged for insert delete or update. =orwarding Re>ected Rows 3ou can configure the #pdate strategy transformation to either pass re>ected rows to the ne't transformation or drop them. -y default the informatica ser!er forwards re>ected rows to the ne't transformation. #pdate +trategy &'pression. =reGuently the update strategy e'pression uses the $$= or 9&289&+ function from the transformation language to test each record to see if it meets a particular condition. 8n the condition you can assign a numeric code to flag it for a particular database operation. 1ggregator and update strategy transformation !osition the *ggregator &efore the Update Strategy Transformation. $n this case you perform the aggregate calculation and then used the update strategy transformation to flag records that contain the results of this calculation for insert delete or update. !osition the aggregator after the update strategy transformation. Here you flag records for insert delete update or re>ect before you perform the aggregate calculation. How you flag a particular record determines how the aggregator transformation treats any !alues in that record used in the calculation.

22 ;his beha!ior has changed since 7ower%art 6.B. $n !ersion 6.B the informatica ser!er performed all aggregate calculations before it flagged records for insert update delete or re>ect through a data dri!en e'pression

1.What are Target Types on the Server? 1ns" ;arget ;ypes are =ile Relational and &R7 2.What are Target Options on the Servers? 1ns" ;arget 8ptions for =ile ;arget type are =;7 =ile @oader and %C ;here are no target options for &R7 target type ;arget 8ptions for Relational are $nsert #pdate (as #pdate* #pdate (as $nsert* #pdate (else $nsert* 9elete and ;runcate ;able. 3.How do you identify existing rows of data in the target table using loo up transfor!ation? 1ns" 2an identify e'isting rows of data using unconnected lookup transformation. ".What are #ggregate transfor!ation? 1ns" 1ggregator transformation allows you to perform aggregate calculations such as a!erages and sums. $.What are various types of #ggregation? 1ns" Harious types of aggregation are +#% 1HD 28#N; %1) %$N =$R+; @1+; %&9$1N 7&R2&N;$@& +;99&H and H1R$1N2&. %.What are &i!ensions and various types of &i!ensions? 1 set of le!el properties that describe a specific aspect of a business used for analyzing the factual measures of one or more cubes which use that dimension. &gs. Deography time customer and product. '.What are 2 !odes of data !ove!ent in (nfor!ati)a Server? Ans: The data movement mode depends on whether Informatica Server should process single byte or multi-byte character data. This mode selection can affect the enforcement of code page relationships and code page validation in the Informatica Client and Server. c* #nicode M $+ allows 2 bytes for each character and uses additional byte for each non<ascii character (such as Iapanese characters* d* 1+2$$ M $+ holds all data in a single byte ;he $+ data mo!ement mode can be changed in the $nformatica +er!er configuration parameters. ;his comes into effect once you restart the $nformatica +er!er.

26

*.What is +ode ,age +o!patibility? 1ns" 2ompatibility between code pages is used for accurate data mo!ement when the $nformatica +e!er runs in the #nicode data mo!ement mode. $f the code pages are identical then there will not be any data loss. 8ne code page can be a subset or superset of another. =or accurate data mo!ement the target code page must be a superset of the source code page. Superset - A code page is a superset of another code page when it contains the character encoded in the other code page. It also contains additional characters not contained in the other code page. Subset - A code page is a subset of another code page when all characters in the code page are encoded in the other code page. ..What is +ode ,age used for? 1ns" 2ode 7age is used to identify characters that might be in different languages. $f you are importing Iapanese data into mapping then u must select the Iapanese code page for the source data. 1/.What is 0outer transfor!ation? 1ns" Router transformation allows you to use a condition to test data. transformation. $t allows the testing to be done on one or more conditions. $t is similar to filter

11. What is 1oad 2anager : 1ns. ;he load %anager is the 7rimary informatica +er!er 7rocess. $t 7erforms the following tasks < %anages session and batch scheduling. @ocks the session and read session properties. Reads the parameter file. &'pand the ser!er and session !ariables and parameters. Herify permissions and pri!ileges. Halidate source and target code pages. 2reate the session log file. 2reate the 9ata ;ransformation %anager which e'ecute the session. 12. What is &ata Transfor!ation 2anager? 1ns. 1fter the load manager performs !alidations for the session it creates the 9;% process. ;he 9;% process is the second process associated with the session run. ;he primary purpose of the 9;% process is to create and manage threads that carry out the session tasks. ;he 9;% allocates process memory for the session and di!ide it into buffers. ;his is also known as buffer memory. $t creates the main thread which is called the master thread. ;he master thread creates and manages all other threads.

2A $f we partition a session the 9;% creates a set of threads for each partition to allow concurrent processing.. ?hen $nformatica ser!er writes messages to the session log it includes thread type and thread $9. =ollowing are the types of threads that 9;% creates" #*STE@ TH@E*% < %ain thread of the 9;% process. 2reates and manages all other threads. #*!!I9G TH@E*% < 8ne ;hread to &ach +ession. =etches +ession and %apping $nformation. !re *nd !ost Session Thread < 8ne ;hread &ach ;o 7erform 7re 1nd 7ost +ession 8perations. @E*%E@ TH@E*% < 8ne ;hread for &ach 7artition for &ach +ource 7ipeline. W@ITE@ TH@E*% < 8ne ;hread for &ach 7artition $f ;arget &'ist $n ;he +ource pipeline ?rite ;o ;he ;arget. T@*9S:(@#*TI(9 TH@E*% < 8ne or %ore ;ransformation ;hread =or &ach 7artition.

13. WH#T (S S3SS(O4 #4& 5#T+H3S? 1ns. SESSI(9 < 1 +ession $s 1 set of instructions that tells the $nformatica +er!er How 1nd ?hen ;o %o!e 9ata =rom +ources ;o ;argets. 1fter creating the session we can use either the ser!er manager or the command line program pmcmd to start or stop the session. '*T7HES < $t 7ro!ides 1 ?ay to Droup +essions =or &ither +erial 8r 7arallel &'ecution -y ;he $nformatica +er!er. ;here 1re ;wo ;ypes 8f -atches " 6. SE.UE9TI* < Run +ession 8ne after the 8ther. A. 7(97U@@E9T < Run +ession 1t ;he +ame ;ime. 1". What is a sour)e 6ualifier? 1ns" $t represents all data Gueried from the source. 1$. Why we use loo up transfor!ations? 1ns" @ookup ;ransformations can access data from relational tables that are not sources in mapping. ?ith @ookup transformation we can accomplish the following tasks" d* Get a related value < Det the &mployee Name from the &mployee table based on the &mployee $9 eC !erform 7alculation f* Update slowly changing dimension ta&les , ?e can use unconnected lookup transformation to determine whether the records already e'ist in the target or not.

7ontrasting a %ata Warehouse with an ( T! System =igure 1M1 illustrates some of the key differences between a data warehouseKs model and an 8@;7 systemKs.

2B 7igure 181 +ontrasting O1T, and &ata Warehousing 3nviron!ents

8ne ma>or difference between the types of system is that data warehouses are not usually in third< normal form. 9ata warehouses and 8@;7 systems ha!e !astly different reGuirements. Here are some e'amples of the notable differences between typical data warehouses and 8@;7 systems" ?orkload 9ata warehouses are designed to accommodate ad hoc Gueries. ;he workload of a data warehouse may not be completely understood in ad!ance and the data warehouse is optimized to perform well for a wide !ariety of possible Guery operations. 8@;7 systems support only predefined operations. ;he application may be specifically tuned or designed to support only these operations. 9ata %odifications ;he data in a data warehouse is updated on a regular basis by the &;; process (often e!ery night or e!ery week* using bulk data<modification techniGues. ;he end users of a data warehouse do not directly update the data warehouse. $n an 8@;7 system end users routinely issue indi!idual data<modification statements in the database. ;he 8@;7 database is always up< to<date and reflects the current state of each business transaction. +chema 9esign 9ata warehouses often use denormalized or partially denormalized schemas (such as a star schema* to optimize Guery performance. 8@;7 systems often use fully normalized schemas to optimize updateJinsertJdelete performance and guarantee data consistency. ;ypical 8perations 1 typical data warehouse Guery may scan thousands or millions of rows. =or e'ample S=ind the total sales for all customers last month.S 1 typical 8@;7 operation may access only a handful of records. =or e'ample SRetrie!e the current order for a gi!en customer.S Historical 9ata 9ata warehouses usually store many months or years of historical data. ;his is to support historical analysis of business data. 8@;7 systems usually store only a few weeks, or months, worth of data.

2E ;he 8@;7 system only stores as much historical data as is necessary to successfully meet the current transactional reGuirements. Typical %ata Warehouse *rchitectures 1s you might e'pect data warehouses and their architectures can !ary depending upon the specifics of each organization,s situation. =igure 1M2 shows the most basic architecture for a data warehouse. $n it a data warehouse is fed from one or more source systems and end users directly access the data warehouse. :igure 3G< Typical *rchitecture for a %ata Warehouse

=igure 1M6 illustrates a more comple' data warehouse en!ironment. $n addition to a central database there is a staging system used to cleanse and integrate data as well as multiple data marts which are systems designed for a particular line of business.

2F

What are confirmed dimensions 2onfirmed dimensions are linked to multiple fact tables 91 =;1 96 9A 92 91 =;2 96 92 9B =;6

9ata modeling tool "< &rwin by computer associates Re!erse engineering "< %odifying e'isting tables =orwarding "< creating tables ( u create the sGl scripts and e'ecute* Explain the %ifference &etween ( *!) #( *!)@( *! )H( *! 8@;7 (online transaction processing system* stores day to day transactions 9? stores historical 9ata. 8@;7 data is in normalized form multiple tables >oins where u run a Guery comple' >oins takes a lot of time 9? is a denormalized one or more fact tables each connected to multiple dimension tables (+tar +chema* +now =lake +chema"< ?hen u ha!e multiple lookup tables u can go for snowflake schema (7artial Normalized * when u ha!e large !olume of data in dimension tables &'"< ?hen u ha!e a country table to be linked to both customers resot u can create a lkp country table so when u make any changes to ctry table u donKt need to change in multiple dimension tables

2P R8@17"< +tar +now =lake +chema %8@17"<%ultidimensional data is stored in hirarechical way (2ustomer<T Region<T2ountry< T2ontinent* 9ata is stored in the form of cubes . %8@17 is !ery e'pensi!e and comple' design ?hen u need results !ery !ery Guickly response time is more in %8@17 H8@17 also called as Hybrid 8@17 combination of both R8@17 and %8@17 ?hat is U1-8(Uero based administration* ?hat is 1$+ (177lication $ntegration +oftware* Name +ome 2leansing ;ools $nformatica can be used as a cleansing tool 1!inisha 9ata stage ?hat is +licing and dicing &'plain linking uni!erses ;o link the uni!erses u must ha!e e'ported to where the kernel uni!erse at lease once otherwise the designer does not allow the link %ata Warehousing What is %ata warehousing 1 9? is a 9- used for Guery analysis and reporting . -y definition 9? is a sub>ect oriented intergrated non !olatile and time !ariant +ub>ect 8riented"< Represents a sub>ect 1read like sales %ktg $ntegrated "< 9ata 2olleted from multiple source systems integrated into a user readable uniGue format &'"< male female 5 1 % = ; = Non Holatile "< 9w stores historical data ;ime Hariant "< +tores data timewise like weekly monthly Guarterly yearly (racle .uery (ptimi8ation H!erformance tuning S. C 8racle currently (as of 8racleQi Release 2* pro!ides 2 different optimization approaches. ;hese are 2ost -ased 8ptimization (2-8* and Rule -ased 8ptimization (R-8*. ;he 2-8 bases optimization choices on pre<gathered table and inde' statistics while the R-8 makes it,s decisions based on a set of ,best practice, rules and does not rely on any statistical information. 2-8,s reliance on statistics makes it !astly more fle'ible than the R-8 since as long as up to date statistics are maintained it will accurately reflect real data !olumes. ;he R-8 is 8racle,s legacy optimizer and is to be desupported in 8racle15i. What is dynamic caching 8ne $mportant feature of $nformatica E.2 is dynamic lookup cache dynamic means.......for e'ample if you are using lookup transformation and lookup cache for a particular table

2Q so this lookupcache will read the table at a time and will keep in the memory in the form of lookup cache but this is generally under assumption you the lookup table is not getting changed during that time let,s if the lookup table !alues getting changed during that time this lookup cache will not hold most recent !alues ?hile creating aggregator transformation ha!e u enabled sorted input option what e'actly it does &'plain the process so instead of standard lookup cache you use dynamic cache in that case ?hat is the size of ur repository : 15 DHow many source tables u ha!e 65 to A5 (;arget database E5 D-* How many mapping u ha!e 25 How many sessions -atches "< A<B nested +essions "< 25<2B $ndi!idual "< depencies $ put in seGuential (e'"< first one session ne't session ne't a batch* How do u sechedule sessions 8n 9emand (%anual* ;ime based &!ent based How do u handle session failure $ run from reco!ery mode 7%2%9 run (if error occurs session stops where!er it is stopped it continues* 9ashboards"< D& =rom one pr> how much re!enue generated how well u did a process (Draphical representation* -o Reporting (#se graphs* What does sorted input option in aggregator transformation does $mpro!es session performance but the data has to be passed into aggregator transformation must be sorted before (if use sorted input option and data is not sorted before session fails* ?hen u pass presorted data e'"< 151 if A records whene!er it finds new no 152 it performs the calculation ?hen the sorted input is not selected the $nformatica ser!er performs aggregate calculations as it reads the $nformatica ser!er stores data for each group until it reads the entire source to ensure all aggregate calculations are accurate ?hen sorted input option is selected $nformatica assumes all data is sorted by group as it reads rows for a group performs aggregate calculations as it reads

65 How did u do increment load What is 7lick Stream In %ata Warehousing 2lick stream is basically web based data warehousing analysis basically e<web intelligence When to index a particular column on what percentage Denerally if you are creating ordinary b<tree inde'.........if you know that that columns will be mostly reference in select statement if your select statement is retrie!ing more than 65 V rows then...it,s better not to use inde' inde' will be used out of the table when you are retrie!ing smaller no. of rows if you retrie!ing more no. of rows. you can read the table directly rather than using the inde'(=ull table scan* ST*@ .UE@E transformer generally used to impro!e the Guery performance in generally star schema models ;he star transformation is a cost<based Guery transformation aimed at e'ecuting star Gueries efficiently. ?hereas the star optimization works well for schemas with a small number of dimensions and dense fact tables the star transformation may be considered as an alternati!e if any of the following holds true" $n order to get the best possible performance for star Gueries it is important to follow some basic guidelines" n 1 bitmap inde' should be built on each of the foreign<key columns of the fact table(s*. n ;he initialization parameter +;1R_;R1N+=8R%1;$8N_&N1-@&9 should be set to ;R#&. ;his enables an important optimizer feature for star<GueriesO it is set to =1@+& by default for backwards<compatibility. n ;he cost<based optimizer should be used. .;his does not apply solely to star schemas" all data warehouses should always use the cost<based optimizer/. If u have data coming from diff/ Sources what transformation will u use in your designer$ Ioiner ;ransformation What is a reusa&le transf// What is a mapplet / Explain diff/ 'et them Reusable tranformation"< if u want to create tranfor that perform common tasks such as a!g sal in a dept %applet"< $s a reusuable ob>ect that represents a set of transformations How many transformations are there. +tandard and reusable +tandard"< 1ggregator e'pression e'ternal procedure ad!anced e'ternal procedure filter >oiner lookup normalizer seGuence source Gualifer stored procedure update strategy 7an u stop a session in a concurrent &atch$

61 $f the session u want to stop is part of a batch u must stop the batch to stop the session. $f the session is part of a batch that is nested in a series of batches u must stop the outermost batch to stop the session What happens when u use the delete or update or re"ect or insert statement in ur update strategy$ $nserts"< treats all records as inserts while inserting if the record !iolates primary foreign key or foreign key in the database it re>ects the record What are varia&les and where do u use it$ #se !ariable to simplify comple' e'pressions temporarily store date store !alues from prior rows compare !alues capture multiple return !alues from a stored procedure store the results of an unconnected lookup What is conn/ *nd unconn/ ookup$ 1 connected lookup transformation is part of the mapping flow . ?ith connected lookup !alues u can ha!e multiple return !alues +upport 9efault !alues #nconnected lookup transformation e'ists separate from the data flow in the mapping # write an e'pr using "lkp reference Gualifier to call the lookup within another transformation #ses"< testing the results of a lookup in a transformation =iltering records based on the lookup results 2alling the same lookup multiple times in one mapping ;o update slowly changing dimensions 9oes not +upport 9efault !alues 7an u access a repository created in previous version of Informatica$ What are the diff/ports availa&le$ $nput 7ort "< ?hich recei!es data (;arget* 8utput 7ort"< ?hich pro!ides data (+ource* $nputJ8utput 7ort"< ?hich pass data through them (%applets* Hariable 7ort "< #sed to store components of e'pressions What happens if the info/ Server doesnFt find the session parameter in the parameter file$ How do run session from command line$ pmcmd What are diff/ Things u can do using !#7#% $ # can stop or abort a session What is pmrep command Have u created parallel sessions How do u create parallel sessions : # can impro!e performace by creating a concurrent batch to run se!eral sessions in parallel on one informatic ser!er if u ha!e se!eral independent sessions using separate sources and separate mapping to populate diff targets u can place them in a concurrent batch and run them at the same time if u ha!e a comple' mapping with multiple sources u can separate the mapping into se!eral simpler mappings with separate sources. +imilarly if u ha!e session performing a minimal no of transformations on large amounts of data like mo!ing flat files to staging area u can separate the session into multiple sessions and run them concurrently in a batch cutting the total run time dramatically

62

What is dyna!i) insert What are diff types of )a)he diff between persistent and stati) )a)he +tatic cache "< Read only will be deleted after use 7ersistent cache "< Reused sa!ed on the ser!er What are session parameters $ How do u set them $ When do u use mapping parameters $ HIn which transformations* What is a parameter When and where do u them when does the value will &e created %esing time) run time / If u donFt create parameter what will happen How do you do error handling in Informatica $ &rror handling is !ery primiti!e. @og files can be generated which contain error details and code. ;he error code can be checked from troubleshooting guide and correcti!e action taken.;he log file can be increased by gi!ing appropriate tracing le!el in the session properties. 1lso we can gi!e that one +ession can stop after 1 2 or n number of errors. How do you implement configuration management in Informatica$ ;here are se!eral methods to do this .+ome of them are "< ;aking a back up of the repository as a binary file and treat it as a configurable item. $mplement =older Hersioning utility in $nformatica. Scenario +,/ 1 mapping contains +ource ;able +_;ime ( +tart_3ear &nd_3ear * ;arget ;able ;ime_9im ( 9ate 9ay %onth 3ear Cuarter * +tored procedure transformation " 1 procedure has two input parameters $_+tart_3ear $_&nd_3ear and output parameter as 8_9ate 9ay %onth 3ear Cuarter. $f this session is running how many rows will be a!ailable in the target and why :. 8nly one row the last date of the &nd_3ear. 1ll the subseGuent rows are o!erriding the pre!ious rows. What is the difference &etween lookup cache and lookup index/ @ook up 2ache contains $nde' cache and data cache $nde' cache"2ontains columns used in condition 9ata cache" "2ontains other output columns than the condition columns. %iscuss two approaches for updation of target ta&le in informatica and how they are different. #pdate strategy transformation" ?e can write our own code .$t is fle'ible. Normal insert J update Jdelete (with proper !ariation of the update option* " $t can be configured in the +ession properties. 1ny change in the row will cause an update.$nfle'ible. How do you handle performance issues in Informatica/Where can you monitor the performance ;here are se!eral aspects to the performance handling .+ome of them are "<

66 +ource tuning ;arget tuning Repository tuning +ession performance tuning $ncremental 2hange identification in source side. +oftware hardware(#se multiple ser!ers* and network tuning. -ulk @oading #se the appropriate transformation. ;o monitor this +et performance detail criteria &nable performance monitoring %onitor session at runtime WJ or 2heck the performance monitor file . What is a suggested method for validating fields - marking them with errors$/ 8ne of the successful methods is to create an e'pression ob>ect which contains !ariables.T 8ne !ariable per port that is to be checked.T +et the error XflagY for that field then at the bottom of the e'pression trap each of the error fields.T =rom this port you can choose to set flags based on each indi!idual error which occurred or feed them out as a combination of concatenated field names M to be inserted in to the database as an error row in an error tracking table. Where is the cache Hlookup) indexC created and how can you see it/ ;he cache is created in the ser!er.+ome default memory is allocated for it. 8nce that memory is e'ceeded than these files can be seen in the 2ache directory in the +e!er not before that. When do you use S. override in ook up Transformation. #se +Cl o!erride when you ha!e more than one look up table ;o use where condition to reduce records in cache. Explain how Iconstraint &ased load orderingI works$/ 2onstraint based load ordering in 7ower%art J 7ower2enter works like this" it controls the order in which the target tables are committed to a relational database. $t is of no use when sending information to a flat file. ;o construct the proper constraint order" links between the ;1RD&; tables in $nformatica need to be constructed. +imply turning on Sconstraint based load orderingS has no effect on the operation itself. $nformatica does N8; read constraints from the database when this switch is turned on. 1gain to take ad!antage of this switch you must construct primary J foreign key relationships in the ;1RD&; ;1-@&+ in the designer of $nformatica. 2reating primary J foreign key relationships is difficult < you are only allowed to link a single port (field* to a single table as a primary J foreign key. What is the difference &etween !ower mart and !ower 7entre/ !ower 7enter , has all the functionality . distributed metadata(repository*. global repository and can register multiple $nformatica ser!ers. 8ne can share metadata across repositories. 2an connect to Haried sources like 7eoplesoft +17 etc. Has bridges which can transport meta data from opther tools (like &rwin* 2ost around 255K #+ 0.

6A !ower #art G Su&set of !ower centre. 8ne repository and can register only one $nformatica ser!er. 2annot connect to Haried sources like 7eoplesoft +17 etc 2ost around B5K #+ 0. What is the difference &etween (racle Se6uence and Informatica Se6uence and which is &etter . 8racle seGuence can be used in a 7lJ+Gl stored procedure which in turn can be used with stored procedure transformation of $nformatica. $nformatica seGuence is generated through seGuence generator transformation of $nformatica. $t depends upon the user needs but 8racle seGuence pro!ides greater control. How do you execute a set of S6l commands &efore running as session and after completion of session in Informatica/Explain/ +Gl commands can be put in stored procedures. ;wo #nconnected +tored procedure ;ransformations are created pointing to respecti!e procedures one pre session other post session. ?hen the +ession is run these two procedures are e'ecuted before the session and after the session. How can you utili8e 7(# components in Informatica. -y writing 2N H- H2NN code in &'ternal +tored 7rocedure ;ransformation What is an indicator file and how it can &e used/ $ndicator file is used for &!ent -ased +cheduling when you donKt know when the +ource 9ata is a!ailaible. 1 shell command script or a batch file creates and send this indicator file to the directory local to the $nformatica +er!er.+er!er waits for the indicator file to appear before running the session. What persistent cache$ When it should &e used/ ?hen @ookup cache is sa!ed in @ook up ;ransformation $t is called persistent cache. ;he first time session runs it is sa!ed on the disk and utilized in subseGuent running of the +ession. $t is used when the look up table is +tatic i.e doesKnt change freGuently What is Incremental *ggregation and how it should &e used / $f the source changes only incrementally and you can capture changes you can configure the session to process only those changes. ;his allows the $nformatica +er!er to update your target incrementally rather than forcing it to process the entire source and recalculate the same calculations each time you run the session. ;herefore only use incremental aggregation if" 3our mapping includes an aggregate function. ;he source changes only incrementally. 3ou can capture incremental changes. 3ou might do this by filtering source data by timestamp. -efore implementing incremental aggregation consider the following issues" ?hether it is appropriate for the session ?hat to do before enabling incremental aggregation ?hen to reinitialize the aggregate caches Scenario +,$nformatica +er!er and 2lient are in different machines. 3ou run a session from the ser!er manager by specifying the source and target databases. $t displays an error. 3ou are confident that e!erything is correct. ;hen why it is displaying the error:

6B ;he connect strings for source and target databases are not configured on the ?orkstation conatining the ser!er though they may be on the client mJc. Informatica 9uration " 1 Hr. 1. 2. 6. A. B. %a'. %arks " 155 (2* (2*

?here e'actly the sources and target information stored : $nformatica Repository ;ables ?hat is the difference between power mart and power centre. &laborate. 7ower center is a global repository ?hat are !ariable ports and list two situations when they can be used: (2*

E. ?hat are the parts of $nformatica +er!er: (2* F. How does the ser!er recognise the source and target databases. &laborate on this. (2* P. @ist the transformation used for the following" (15* a* Heterogeneous +ources b* Homogeneous +ources c* =ind the B highest paid employees within a dept. d* 2reate a +ummary table e* Denerate surrogate keys Q. What is the difference &etween se6uential &atch and concurrent &atch and which is recommended and why $ (2* 15. 9esigner is used for ____________________ (1* 11. Repository %anager is used for __________________ (1* 12. +er!er %anager is used for ______________________ (1* 16. +er!er is used for _____________________________ (1* 1A. 1 session +_%171 is in Repository 1. ?hile running the session error message has displayed Z ser!er hot<ws2F5 is connect to Repository - Z. ?hat does it mean : (2* 1B. How do you do error handling in $nformatica : (2* 1E. How do you implement scheduling in $nformatica : (2* 1F. ?hat is the meaning of upgradation of repository : (2* 1P. How can you run a session without using ser!er manager : (2* 1Q. What is indicator file and where it is used : (2* <5/ What are pre and post session stored procedures $ Write a suita&le example/ H<C 21. 2onsider two cases " 1. 7ower 2enter +er!er and 2lient on the same machine 2. 7ower 2enter +e!er and 2lient on the different machines ?hat is the basic difference in these two setups and which is recommended: (2* 22. $nformatica +er!er and 2lient are in different machines. 3ou run a session from the ser!er manager by specifying the source and target databases. $t displays an error. 3ou are confident that e!erything is correct. ;hen why it is displaying the error: (2* 26. ?hen you connect to repository for the first time it asks you for user name W password of repository and database both. -ut subseGuent times it asks only repository password. ?hy : (2* 2A. ?hat is the difference between normal and bulk loading.

6E ?hich one is recommended: (2* 2B. ?hat is a test load : (2* 2E. ?hat is an incremental aggregation and when it should be implemented: (2* 2F. How can you use an 8racle seGuences in $nformatica: 3ou ha!e an $nformatica seGuence generator transformation also. ?hich one is better to use: (2* 2P. ?hat is the difference between a shortcut of an ob>ect and copy of an ob>ect: 2ompare them. (2* 2Q. ?hat is mapplet and a reusable transformation: (2* =5/ How do you implement configuration management in Informatica$ H=C 61. ?hat are -usiness 2omponents in $nformatica: (2* 62. 9imension 8b>ect created in 8racle can be imported in 9esigner ( ;J =* (1* 66. 2ubes contain measures ( ; J = * (1* 6A. 28% components can be used in $nformatica ( ; J = * (1* 6B. @ookup is an 1cti!e ;ransformation (;J=* (1* =4/ What is the advantage of persistent cache$ When it should &e used/ H3C 6F. ?hen will you use +C@ o!erride in a lookup transformation: (2* 6P. ;wo different admin users created for repository are ______ and_______ (1* 6Q. ;wo 9efault #ser groups created in the repository are ____ and ______ (1* A5. 1 mapping contains +ource ;able +_;ime ( +tart_3ear &nd_3ear * ;arget ;able ;im_9im ( 9ate 9ay %onth 3ear Cuarter * +tored procedure transformation " 1 procedure has two input parameters $_+tart_3ear $_&nd_3ear and output parameter as 8_9ate 9ay %onth 3ear Cuarter. $f this session is running how many rows will be a!ailable in the target and why : (B* 6Q. ;wo +ources +1 +2 containing measures %1 %2 %6 A 9imensions 91 92 96 9A 1 =act =1 containing measures %1 %2 %6 and 9imension +urrogate keys K1 K2 K6 KA (a* ?rite a +C@ statement to populate =act table =1 (b* 9esign a mapping in $nformatica for loading of =act table =1. (B* A5. ?hat is the difference between connected lookup and unconnected lookup. (2* A1. ?hat is the difference between datacahe and linde' cahe. (2* A2. ?hen should one create a lookup transformation: (2* 0=/ How do you handle performance issues in Informatica/Where can you monitor the performance $ H=C AA. @ist and 9iscuss two approaches for updation of target table in informatica and how they are different. (6* AB. 3ou ha!e created a lookup tansformation for a certain condition which if true returns multiple rows .?hen you go to the target and see only one row has come and not all. ?hy is it so and how it can be corrected. (2* AE. ?here are the log files generally stored. 2an you change the path of the file. ?hat can the path be: (2* AF. ?here is the cache (lookup inde'* created and how can you see it. (2*

6F

You might also like