You are on page 1of 40

Top 10 Productivity Tips/Tricks with Information Steward

and Data Services

Lynne Lintelman, EIM Product Management


July 2014
Agenda

© 2011 SAP AG. All rights reserved. 2


#1 – Website – Community Network

http://wiki.scn.sap.com/wiki/display/EIM/Video+Tutorials

© 2011 SAP AG. All rights reserved. 3


#1 – Website – Community Network
Installation and Configuration tutorials

© 2011 SAP AG. All rights reserved. 4


#1 – Website – Community Network
Product Tutorials – Data Services
SAP Data Services Product Tutorials:
http://scn.sap.com/docs/DOC-54115

© 2011 SAP AG. All rights reserved. 5


#1 – Website – Community Network
Product Tutorials – Data Services

© 2011 SAP AG. All rights reserved. 6


#1 – Website – Community Network
Product Tutorials – Information Steward
SAP Information Steward Product Tutorials:
http://scn.sap.com/docs/DOC-8751?rid=/webcontent/uuid/b040a79b-c816-2e10-
69a9-d199bf09d23f

© 2011 SAP AG. All rights reserved. 7


#1 – Website – Community Network
Product Tutorials – Information Steward

© 2011 SAP AG. All rights reserved. 8


#1 – Website – Community Network
Common Questions

© 2011 SAP AG. All rights reserved. 9


#2 Directory Update Assistant (DUA)

https://support.sap.com/software/address-directories.html

© 2011 SAP AG. All rights reserved. 10


#3 Use Content Types for Validation and Cleansing Rules
Content Type Identification

 Profile any input source supported


within Data Insight
 SAP applications will have data-driven
analysis and metadata analysis
performed
 Data driven analysis suggests
the context of a column that contains
data for person, firm, and address
 Supports more than 20 content types
out of the box
 Ability to create custom content types
 Content type used to drive data
cleansing solutions and can be
assigned to a validation rule

© 2011 SAP AG. All rights reserved. 11


#3 Use Content Types for Validation and Cleansing Rules
Data Validation Advisor

Helps quickly create DQ Validation


rules by automatically suggesting
rules based on
 Statistical outlier analysis of column
profiling for completeness and
distributions

Helps quickly bind DQ Validation


rules by automatically suggesting
bindings based on
 Existing rules with same values and
patterns distributions
 Existing rules with same content type
parameters

© 2011 SAP AG. All rights reserved. 12


#3 Use Content Types for Validation and Cleansing Rules
Example of Proposed Rules
 Rule proposal based on value
distribution
– Outlier values in the columns are 01, 02
and 03
– Proposed rule check for value being 00 to
be valid

 Rule proposal based on blank values


– Outlier value is <blank> only two records
– Proposed rule checks for length of the
string to be more than zero to be valid

© 2011 SAP AG. All rights reserved. 13


#3 Use Content Types for Validation and Cleansing Rules
Accepting a Rule Proposal
 “Accept” the proposed rule if you think it’s applicable
 Accept even if it’s mostly accurate, you can edit it later.
 Opens Rule Editor with pre-populated properties
 Edit the properties to be more meaningful
 Add required parameters like Approver
 Edit the rule expression as needed
 For the input parameter, optionally add the Content Type if
known. This will be helpful for rule binding suggestions
 Validate, Test, Save, and Close.

2
1

© 2011 SAP AG. All rights reserved. 14


#3 Use Content Types for Validation and Cleansing Rules
Publishing Validation Rules Use in Data Service

© 2011 SAP AG. All rights reserved. 15


#4 Scorecards
Monitor quality of data & key information governance metrics

Drill into Scorecard


Details

Scorecard to

Improve quality of data sources


Measure DQ from
a Data Steward’s
Perspective

Key Quality
Latest Quality Dimensions (KPI
Score for data)

Data Quality Business Value of Quality Trend and


Score Metrics Data Rules Business Value Trend

© 2011 SAP AG. All rights reserved. 16


#4 Scorecards
Drill into scorecard metrics
Identify root cause or impact
 Drill down to root causes of poor data quality as it exists in the source system
 Quickly isolate data anomalies through preview of failed data

Reduce risk of project delays or cost overruns


 Establish impact of bad data on critical business analytics, processes and who it affects

© 2011 SAP AG. All rights reserved. 17


#4 Scorecards
Increase productivity by enabling business & IT collaboration

Increase end user productivity


IT can easily share data quality metrics to business users
and involve them in owning the data problem

Business users can easily see how their information


measures up against information governance rules and
standards

© 2011 SAP AG. All rights reserved. 18


#5 Financial Impact
Identify business value opportunities for information assets

Total financial
Switch to cost
What-If
financial

Improve quality of data sources


analysis

Cost per
business rule

Financial trend
over time

© 2011 SAP AG. All rights reserved. 19


#6 Data Cleansing Advisor

Bob e. oldstead
Guides Data Stewards to rapidly develop cleansing First Name: Robert
175 Riviington avenue suite 2
and matching rules to improve the quality of their Manhatten,
Last Name:new yourk
Oldstead
information assets
800-555-9875
Phone: (800) 555-9875
By automatically suggesting business rules for…
 Parsing and Standardization First Name: Bob
To ensure consistent data
Middle Name: E.

Match
 Correction and Enrichment Last Name: Oldstead
To ensure correct and complete data Primary Address: 175
RiviingtonAve
Rivington Ave
 Matching
To determine which records refer to the same entity Secondary Address: Ste 2
City: Manhattan
Manhatten
State: New York
Yourk
Postal
Phone:Code:
(800) 10002
555-9875
Phone: (800) 555-9875

© 2011 SAP AG. All rights reserved. 20


#6 Data Cleansing Advisor

Guides Data Stewards to rapidly develop cleansing and matching rules to improve the
quality of their information assets
By visualizing cleansing and matching rules with an intuitive, data-driven interface
 Easily adjust standardization options
 View potential duplicates, including near matches
 Adjust match criteria thresholds with what-if analysis

© 2011 SAP AG. All rights reserved. 21


#6 Data Cleansing Advisor
Data Quality Assessment for Party Data

• DCA can identify, cleanse and assess the following party data entities:
• Address, person, firm, title, phone, email, date and SSN

• Drill-down into the details of each entity to discover the impact of the cleanse rules

• Create filters to easily review data issues and understand how the data has changed

© 2011 SAP AG. All rights reserved. 22


#6 Data Cleansing Advisor
Match Statistics and Interactive Graphs

© 2011 SAP AG. All rights reserved. 23


#6 Data Cleansing Advisor
What “What-If” Analysis

• Records/Groups affected

• Before/After Statistics and charts

• Summary of change(s)

© 2011 SAP AG. All rights reserved. 24


#7 Match Review

• Publish DCA solution to Workbench and create an ETL dataflow without any hassle

• Create a match review task within Information Steward within minutes

© 2011 SAP AG. All rights reserved. 25


#7 Match Review
Reviewing Match Groups

• Leverage access to filtered data to find match groups critical to review

• Use the data review UI to understand match impact and modify match strategy

• Tools: Near matches, review indicator, show columns and record comparison

© 2011 SAP AG. All rights reserved. 26


#7 Match Review
Match Review and Best Record Results

© 2011 SAP AG. All rights reserved. 27


#8 Validate Person and Firm Cleansing Package Changes
View Input and Output Records

Validate cleansing package changes to your data


 Ability to see incoming data and add new input data
 View how input data will be parsed and standardized on output

© 2011 SAP AG. All rights reserved. 28


#8 Validate Person and Firm Cleansing Package Changes
Edit Input Record

Confirm changes of sample record


Verify sample data (‘Tick Tock’) is not parsing as a Firm
 Results as expected, matching Data Services Data Cleanse transform’s results
 Data Steward notes the Domain Sequence setting

© 2011 SAP AG. All rights reserved. 29


#8 Validate Person and Firm Cleansing Package Changes
Manage SAP-Supplied Person and Firm Cleansing Package

Data Steward searches SAP-Supplied Person and Firm cleansing package to


identify how ‘Tick’ and ‘Tock’ are classified

© 2011 SAP AG. All rights reserved. 30


#8 Validate Person and Firm Cleansing Package Changes
Manage SAP-Supplied Person and Firm Cleansing Package

Manage classifications for ‘Tick’ and ‘Tock’


 Add classification of FIRM_NAME
 Remove NAME_WEAK_FAMILY_NAME classification

© 2011 SAP AG. All rights reserved. 31


#8 Validate Person and Firm Cleansing Package Changes
Validate SAP-Supplied Person and Firm Cleansing Package Changes

Validate tab instantly shows the updated results


Data Steward verifies that ‘Tick Tock’ now parses and standardizes as a Firm name
 Data Steward reviews all sample data
 Data Steward publishes updated results for Data Services developers to use

© 2011 SAP AG. All rights reserved. 32


#9 Best Practices
Data Services Pre-Configured Transforms
Data Services contains pre-configured
Data Quality transforms
• Best Practice defined

© 2011 SAP AG. All rights reserved. 33


#9 Best Practices
Data Services Blueprints
Downloadable blueprint samples (http://scn.sap.com/docs/DOC-8820)

Data Services
contains provides
blueprints that
includes sample data
• Best Practice
defined
• A complete end to
end dataflow

© 2011 SAP AG. All rights reserved. 34


#10 Schedule Jobs
Data Services
Schedule jobs and tasks to run during non-peak times
• Data Services > Management Console
• Information Steward > Central Management Console (CMC)

© 2011 SAP AG. All rights reserved. 35


#10 Schedule Jobs
Data Services
Data Services’
Management Console
define:
• Occurrence
• Date(s)
• Time(s)

© 2011 SAP AG. All rights reserved. 36


#10 Schedule Jobs
Information Steward

Schedule jobs and tasks


to run during non-peak
times
• Central Management
Console (CMC) >
Information Steward
• Locate task to schedule

© 2011 SAP AG. All rights reserved. 37


#10 Schedule Jobs
Information Steward

© 2011 SAP AG. All rights reserved. 38


(BONUS) #11 – Enterprise Information Management with SAP

• Understand the big picture of SAP’s Reviews are in!


enterprise information management
offerings
• Explore step-by-step instructions for
working with SAP solutions for
Information Management
• Learn how to perform the most
important tasks with SAP Information
Steward, SAP Data Services, SAP
Master Governance, SAP NetWeaver
Information Lifecycle Management, and A consistent Top 10
more Best-seller with SAP
• All royalties donated to Doctors Without Press!
Borders

Order at http://www.sappress.com
© 2011 SAP AG. All rights reserved. 39
Thank you!

Contact Information:
Lynne Lintelman, Product Manager
lynne.lintelman@sap.com

You might also like