Professional Documents
Culture Documents
1 Training
IDQ 9.1 Labs
Lab 1 - Content Management Service......................................................................................2
.....................................................................................................................................................5
Lab 2 - New Reference Table Capabilities...............................................................................6
Lab - Content Sets...................................................................................................................!
Lab " - Tags...............................................................................................................................1#
Lab 5 - Matc$ %n$ancements..................................................................................................12
Lab 6 & New %'ception Transform.........................................................................................1
Lab ! - (ata )*alit+ for MS %'cel...........................................................................................15
Lab , - -rofiling Labs ..............................................................................................................16
Informatica Data Quality 9.1
DATA QUALITY 9.1 Training
Lab 1 - Content Management Service
Create and Configure CMS
Objective: Configure AddressDoctor options and check AV reference file status from Developer
Steps
Open Administrator Console
Select Action/Ne/Content !anagement Service
Informatica Data Quality 9.1
DATA QUALITY 9.1 Training
"ollo #i$ard to create and start the C!S
Open C!S% processes tab
Informatica Data Quality 9.1
DATA QUALITY 9.1 Training
&dit AV Options
AV 'icence: S0PCF4MN94L7ZXEZ635NCSM90NZKR0NJUTWA
o Set No (re)'oad to A'' for all t*pes
o Set AV +eference data path to:
C:,-nformatica,./0/1,services,D2Content,-N"A3Content,av,default
+ec*cle C!S Service
+ec*cle D-S Service
Open Developer
o Select #indo / (references
o Select Content Status
o Check Status is displa*ed correctl* 4e5pected vie belo6
Informatica Data Quality 9.1
DATA QUALITY 9.1 Training
Informatica Data Quality 9.1
DATA QUALITY 9.1 Training
Lab 2 - New Reference Table Capabilities
Create Managed Reference Table from database
Objective: Create ne reference table using a database source
Steps
Open Anal*st 7ool
Select 8Create Ne +eference 7able9
Select 8Connect to a +elational 7able9
Select 8D237ables9 Connection
Select fname table
Select Column0 as valid value
Save as fname in :our (roject
Create Unmanaged Reference Table from database
Objective: Create ne unmanaged reference table
Steps
Open Anal*st 7ool
Select 8Create Ne +eference 7able9
Select 8Connect to a +elational 7able9
!ake sure 8;nmanaged 7able9 is ticked
Select 8D237ables9 Connection
Select us3states table
Select Column0 as valid value
Save as us3states in :our (roject
End of Exercise
Informatica Data Quality 9.1
DATA QUALITY 9.1 Training
Lab - Content Sets
Create and configure new Content Set
Objective: Create ne content set and content set e5pressions
Steps
Open Developer
Select "ile / Ne / Content Set
Create ne Content Set called 8ContentSet3.09 in :our project
Open *our content set
Add a ne
Add ne Character Set:
o Name: Char
o 'abel : C
o +ange: a)$ and A)<
Add a ne +egular &5pression:
o Name: num
o Number of Outputs: 0
o +eg&5: =>1).?@A
Add a ne 7oken Sets 4+eg&56:
o Name: date
o 'abel: date
o +eg&5: =,dB0%CD,/,dB0%CD,/,dBEDA
o Description: matches dates of the form FF/FF/:::: here FF can be 0 or C digits long
and :::: is ala*s E digits long/
Save Content Set
Use a Cotet Set
Parse, Cleanse and Standardize Data
Objective: (repare data source for upload to #arehouse and matching scenarios
Steps
Create Ne !apping: m3process3customer3data
Informatica Data Quality 9.1
DATA QUALITY 9.1 Training
Add customer "lat file source from c:,D23DA7A director*
Add (arser G 7oken (arser
o Add -nput (ort contact3name from source
o Create ne strateg*
Name: parse3names
Operation 0:
Operation: (arse ;sing +eference 7able
Name: parse3fname
+eference 7able: fnames 4&nablement3.0 project6
Output: fname% string% CH
Operation C:
Operation: (arse ;sing +eference 7able
Name: parse3sname
+eference 7able: usa3surnames3infa
4-nformatica3D23Content/Dictionaries/North America/;SA6
Output: sname% string% CH
o Add -nput port address0 from source
o Create ne strateg*
Name: parse3housenum
Operation 0:
Operation: (arse ;sing 7oken Set
Select +egular &5pression
o Choose +eg&5 8num9 from ContentSet3.0
Create ne output house3num
o +un data vieer and e5amine *our results
Add 'abeler
o Add -nput (ort addressI
o Create Ne Strateg*
Name: label3state
!ode: 7oken
Operation: 'abel ith +eference 7able
+eference 7able: us3states 4:our project6
'abel: state?
Add 'abeler 4 or use e5iting one6
o Add -nput (ort cust3start3date from source
o Create ne strateg*
Name: lbl3date
!ode: 7oken
Operation 0:
Operation: 'abel 7okens ith 7oken Set
Name: lbl3date
Select 7oken Set 8date9 from ContentSet3.0
o Add -nput (ort currenc* from source
o Create ne strateg*
Name: lbl3currenc*
!ode: 7oken
Operation 0:
Operation: 'abel 7okens ith reference table
-nformatica3D23Content/dictionaries/general/currenc*3codes3infa
Name: lbl3currenc*
o +un the data vieer and e5amine *our results
Informatica Data Quality 9.1
DATA QUALITY 9.1 Training
Should look something like this:
Informatica Data Quality 9.1
DATA QUALITY 9.1 Training
Lab ! - Tags
Create and associate new tags - Developer
Objective: Create ne tags and associate to objects in Developer
C!eate Ta" Steps
Open Developer
Open #indo / (references
Select 7ags
Vie 8Out of the Jo59 7ags (These will appear when you install 9.1 accelerators, which this
image does not have)
Create ne tags:
o Customer
o (roduct
Informatica Data Quality 9.1
DATA QUALITY 9.1 Training
o Content
Asso#$ate Ta" Steps
Open Developer
Appl* 7ags to Data Sources
o Open Source
o Navigate to 7ags Vie
o Select &dit
o Appl* 7ag
Appl* 7ags to content set e5pressions
o Open Content Set
o Navigate to 7ags Vie
o Select &dit
o Appl* Content 7ag to all elements
Create and associate new tags - nal!st
Objective: Create ne tags and associate to objects in Anal*st
C!eate Ta" Steps
Open Anal*st
Select Actions / Sho 7ags
Create ne tag:
o Order
o Address
Asso#$ate Ta" Steps
Open (rofile3order
4:ou probabl* have not alread* profiled the order table/ Create a data object using the flat file
8order9 in the c:,D23DA7A director* and profile it% columns onl*/6
Appl* 7ags to data columns
o Sho 7ags vie
o Select Address related columns
Appl* Address 7ag
o Select Order related columns
Appl* Order 7ag
o Select Customer name related columns
Appl* Customer 7ag
Ko to project vie
o Select +7! fnames
Appl* Customer and Content 7ags
Informatica Data Quality 9.1
DATA QUALITY 9.1 Training
Lab " - Matc# En#ancements
Pre-re" for image
Cop* I /*sp files from C:,-nformatica,./0/1,services,D2Content,-N"A3Content,identit*,
7o a nel* created folder called 8default9 at
C:,-nformatica,./0/1,services,D2Content,-N"A3Content,identit*,default
#e! $en and Matc% nal!sis
Objective: -dentif* potential duplicates
Steps
Create Ne !apping: m3match3customer
Add c:,D23Data,aml3demo3data source
Add Le* Kenerator
o ;se String strateg* on iso3ctr*3code
right click on the ke* generator and Select Anal*se Detail from the menu
o +evie the folloing information:
&stimated processing time
Kroups above the recommended threshold
o &dit the desired throughput value and observe ho estimated processing time changes
o &dit min and ma5 group si$e values
o Select groups above the threshold from the dropdon list and drilldon to the record
level
+e)configure Le* Kenerator to $ip3or3postcode port
+un KroupLe* Anal*sis again and observe the results
Add !atch transform and configure as follos
o "ield !atching 4Single Source6
o &dit Distance on contact3name
o 7hreshold 1/M
Select +untime Anal*sis b* !atch7*pe from the right click menu
o +evie results
Select Output Anal*sis of Clusters from the right click menu
o +evie results (There is a bug in this Beta release so you may get funny results)
+epeat the above steps for a !atch transform configured for -dentit* !atching
Informatica Data Quality 9.1
DATA QUALITY 9.1 Training
Lab $ % New Exception Transform
Objective: -dentif* and manuall* correct e5ception records
7hink of this as further don in a mapping here *ou have alread* run data cleansing and address
validation/ :ou have also run a data Nualit* check on the phone field/ No that *ouOve done that% *ou
need to decide hat records pass and hich need manual intervention/
Steps
Create Ne !apping: m3e5ception3records
Add c:,D23data,cleansed3customer source
Add Decision
o Assign a score of M1 to records ith AddressStatus P-ncomplete AddressO or P-nvalid
Address 'ineO and (hone Status P-ncomplete (honeO
o Assign a score of .1 to all remaining records
$% A&&!essStat's( )*#o+p,ete A&&!ess)
o! A&&!essStat's()*-a,$& A&&!ess L$e)
o! P.oeStat's()*#o+p,ete P.oe)
t.e
s#o!e/(60
e,se
s#o!e/(90
e&$%
Add &5ception transform and configure as follos
o Jad +ecords &5ception
o 7able PJAD +&CO+DSO in Staging DJ
o Connect data ports
o Add AddressStatus and (honeStatus ports to 'abels input
o Connect the score port to -nputs QQ Control QQ Score/
o +ecords ith a score beteen E1 and .1 to be revieed manuall*
o Send good records to standard output and bad records to Jad +ecords table
o !ap AddressStatus and (honeStatus ports to respective issues in the priorit* tab
+un the mapping (Data iewer)
Informatica Data Quality 9.1
DATA QUALITY 9.1 Training
o !un the data viewer on the "#ception transform
-n Anal*st% add the nel* created table to the project vie
+evie available filters