Professional Documents
Culture Documents
Document Control
Revision History
2/10
Technical Design Guideline Document
Table of Contents
1 OVERVIEW ................................................................................................... 4
4 SAP DS JOBS............................................................................................... 9
4.1 Job Hierarchy ...................................................................................................................................... 9
4.2 DataStores............................................................................................................................................ 9
3/10
Technical Design Guideline Document
1 Overview
Purpose of this document is to provide guidelines for technical design of ETL process for
object conversion using Enterprise Hub (EHub) platform. This includes guidelines for
Strategy for Extract and Transform in EHub
Process orchestration of extract and transform using HANA SDI/SDQ flowgraphs
Information views (Calculation Views) for reporting
Load of data using SAP Data Services Jobs
Data quality and transformation activities are shown in below diagram. It shows the flow
of information and enrichment of data in each step of Extract and Transform process of
Data Migration activity.
Each of the data layers mentioned below are linked to a schema in EHub
Landing Layer – This corresponds to schemas holding data from source systems.
EHub, at present, has two schemas for holding data from source systems –
ZSLT_SHE and ZST_ZT1
Staging Layer – This corresponds to schema EHUB_STG. This layer is used for
‘consolidation’ of data from multiple source systems. Usually, output of this layer
is a single table.
Alignment Layer – This corresponds to schema EHUB_ALG. This layer is used
for storing data associated with data cleansing, data transformation and matching.
Usually, output of transforms will be
o Data Cleansing
Table for storing Valid records due to completeness check
Table for storing Invalid records due to completeness check
o Data Matching
Table for storing the output of match process
o Data De-Duplication
Table for storing the unique records
Table for storing the duplicate records
Preload Layer – This corresponds to schema EHUB_PLD. This is the layer which
holds data for SAP DS to consume. Usually, output of data validation is stored in
two tables
o Table for storing the records which have been validated successfully
o Table for storing the records which have failed validation
4/10
Technical Design Guideline Document
Transform
Process Input Output Schema of Output Table
Data
Consolidation Source Tables <TableName>_Extract EHUB_STG
Data
Cleansing <TableName>_Extract <TableName>_Complete EHUB_ALG
<TableName>_InComplete EHUB_ALG
Data Convert <TableName>_Complete <TableName>_Transformed EHUB_ALG
Data Matching <TableName>_Transformed <TableName>_Matched EHUB_ALG
Data De-Dup <TableName>_Matched <TableName>_Survived EHUB_ALG
<TableName>_Duplicates EHUB_ALG
Data
Validation <TableName>_Survived <TableName>_PLD EHUB_PLD
5/10
Technical Design Guideline Document
Note –
For mock project, please create sub-packages under the package assigned to your
HANA user id (EHUB_TRAINING_USERXX)
No Flowgraphs will be created for Data Validation for reason mentioned earlier
3.2.1 Flowgraph
Naming guideline for flowgraph is shown below
<ConversionObjectName>_<SourceTable/ViewName>_DataTransformProcess
For example
MaterialMaster_MARA_Extract
MaterialMaster_MARC_Extract
6/10
Technical Design Guideline Document
MaterialMaster_MARA_Cleanse
7/10
Technical Design Guideline Document
8/10
Technical Design Guideline Document
4 SAP DS Jobs
4.2 DataStores
Data Stores created for training user in SAP DS in Training environment are given below
HANA Data Store: EHUB_SOURCE
S4H Data Store: EHUB_TARGET.
Job: JOB_EHUB_<ObjectName>
Workflow/Conditionals: WF_EHUB_<ObjectName>
DataFlow: DF_EHUB_<ObjectName>
9/10
Technical Design Guideline Document
10/10