You are on page 1of 20

IBM Client Innovation Centre

DATA MIGRATION

Created On : 05.12.2017
Created By : IBM SAP DATA MIGRATION SME
IBM Client Innovation Center

What is data migration?


Data migration is the process of transferring data from one system to another, while changing the storage,
database, or application. In reference to the ETL (Extract-Transform-Load) process, data migration always
requires at least Extract and Load steps.
Typically, data migration occurs during an upgrade of existing hardware or transfer to a completely new
system. Examples include:
Migration to or from hardware platform
Upgrading a database or migrating to new software
Company-mergers when the parallel systems in the two companies need to be merged into one

There are three main options to accomplish data migration:


Merge the systems from the two companies into a brand new one
Migrate one of the systems to the other one
Leave the systems as they are but create a common view on top of them - a data warehouse

2 © Copyright IBM Corporation 2017


IBM Client Innovation Center

Phases in data migration


Every data migration activity would involve a series of phases with defined activities that ensures to identify
the data migration strategy. There is no exception in case of S/4 HANA Data Migration as well.
Main considerations in a typical Data Cycle

3 © Copyright IBM Corporation 2017


0
4
9
/
1
1
IBM Client Innovation Center
/
2
2

Characteristics of a typical data cycle and tools used


Data Data Profiling Data Extraction Data Data Loading Data Quality
Assessment Phase Phase Transformation Phase Maintenance
Phase Phase Phase

System System System


Typical Consulting Consulting Integration In-house COE
Integration Integration
Engagement Engagement Engagement Engagement Engagement Engagement
Engagement
Type

• IBM-specific • IBM • Informatica • Informatica • Informatica • SAP MDG


Typical Tools
templates Information
Used • IBM Infosphere • IBM Infosphere • IBM Infosphere • SAP MDM
Analyser
• SAP BODS • SAP BODS • SAP BODS • SAS Dataflux
• Talend
• BackOffice • BackOffice • BackOffice • Oracle MDM
• Trillium
• Oracle Data • Oracle Data • Oracle Data • Web Methods
• Datiris
Integration Integration Integration One data
• Pervasisve
• Others • Others • Others • Hybris
• SAP
• S/4 HANA Data • S/4 HANA Data • S/4 HANA Data • Tibco
Information
Migration Migration Migration
Steward • SAP BODS
Cockpit Cockpit Cockpit
• SAP HANA EIM
• SAP HANA EIM • SAP HANA EIM • SAP HANA EIM
Services
Services Services Services
• Information
Steward

4 © Copyright IBM Corporation 2017


IBM Client Innovation Center

CONTD….
For this meeting we will consider SAP provided tools which are used in data migration projects.
SAP BODS
SAP LT
SAP HANA SDI / SDQ

SAP provided tool for Data Governance:


SAP Information Steward

5 © Copyright IBM Corporation 2017


IBM Client Innovation Center

SAP BODS(BUSINESS OBJECTS DATA SERVICES)

6 © Copyright IBM Corporation 2017


IBM Client Innovation Center

SAP BODS
• Data migration is a major task and key for a successful SAP implementation. SAP BODS has capabilities
to connect to non-SAP and SAP sources and extraction, cleanse , transform and load data to non-SAP
and SAP targets.
• We can use SAP Data Services to connect to multiple source systems simultaneously , implement
customized transformation logics on data sets classified on various characteristics like Geography ,
Business Unit etc. and load to target system.
• SAP BO Data Services is the preferred tool of data migration, if the volume of data is substantial and more
customized, validation rules and data conversion logics needs to be implemented.

 Role of SAP BODS in Data Migration


• SAP Data Services has various connectors and adapters available to connect to RDBMS, ERP,
Mainframe systems etc.
• SAP Data Services can be used to merge multiple data sets to be loaded to one system as a part of the
data migration
• Highly Customizable to cater to various business requirements or processing requirements making it more
flexible for data migration

7 © Copyright IBM Corporation 2017


IBM Client Innovation Center

Data Migration using SAP BODS

• The legacy data environment can


be SAP or Non-SAP

• Data Staging is usually an


intermediate database where the
source data is staged in raw format
as well as cleansed format before
loading to target

• Loading can be done as flat files or


direct loading to RDBMS and use
SAP loading interfaces like LSMW,
BAPI/RFC, IDoc

• Target Environment can be SAP or


Non-SAP
8 © Copyright IBM Corporation 2017
IBM Client Innovation Center

FEATURES OF SAP BODS


 Connects with most SAP and non-SAP Source systems including major RDBMS, Mainframe , Hadoop,
Google Big Query, PeopleSoft , Oracle Applications etc.
 Access and integrate SAP and non-SAP data sources and targets.
 Easily standardize, correct, and match data to reduce duplicates and identify relationships.
 Transform all types of data, regardless of industry or data domain, and leverage a centralized business
rule repository and object  reuse.
  Meet high-volume needs with support for parallel processing, grid computing, and bulk data loading.
 It supports multi-users.
 It provides high-performance parallel transformations.
 It allows scripting language with rich sets of functions.

9 © Copyright IBM Corporation 2017


IBM Client Innovation Center

SAP SDI/SDQ( SMART DATA INTEGRATION / SMART DATA


QUALITY)

10 © Copyright IBM Corporation 2017


IBM Client Innovation Center

SAP SDI/SDQ
The SAP HANA smart data integration and SAP HANA smart data quality options provide tools to access source
data, and provision, replicate, and transform that data in SAP HANA on-premise or in the cloud.
The smart data integration and smart data quality options let you enhance, cleanse, and transform data to make it more
accurate and useful. These options let you efficiently connect to any source to provision and cleanse data for loading
into SAP HANA on premise or in the cloud, and for supported systems, write back to the original source.
Capabilities include:
A simplified landscape, that is, one environment in which to provision and consume data.
Access to more data formats including an open framework for new data sources.
In-memory performance, which means increased speed and decreased latency.

11 © Copyright IBM Corporation 2017


IBM Client Innovation Center

Data Migration using SAP SDI/SDQ


The native integration capability of
SAP HANA smart data integration is
architected for on premise, cloud or
hybrid deployments, and supports all
styles of data delivery including

Federated
Batch
Real Time (not all data sources)

12 © Copyright IBM Corporation 2017


IBM Client Innovation Center

FEATURES OF SAP SDI / SDQ


 Connects with most SAP and non-SAP Source systems using Data Provisioning Adapters.
 There is no data latency as it provides real time replication for all kind of source systems.
 Robust Error handling capabilities.
 Monitoring and scheduling are well defined.
 ETL Capabilities are well defined with possibilities of custom scripting.
 Navigations are easy to use.
 Performance tuning is much faster as it has its own optimization techniques.
 In-built adapters available to establish connection within HANA

13 © Copyright IBM Corporation 2017


IBM Client Innovation Center

SAP SDI / SDQ : Where it can be used

14 © Copyright IBM Corporation 2017


IBM Client Innovation Center

SAP LT (SAP Landscape Transformation)

15 © Copyright IBM Corporation 2017


IBM Client Innovation Center

What is SAP LT ?

SLT stands for SAP Landscape Transformation Replication Server (SLT) running on the
NetWeaver Platform. SLT is the ideal solution for all HANA customers who need real-time
(and non-real-time) data replication sourcing from SAP ERP or non-SAP systems into HANA

 SAP LT uses trigger based approach. Trigger-based approach has no


measureable performance impact in source system.
 It provides transformation and filtering capability.

 It allows real-time (and scheduled) data replication, replicating only


relevant data into HANA from SAP and non-SAP source systems.
 It is fully integrated with HANA Studio.

 Replication from multiple source systems to one HANA system is


allowed, also from one source system to multiple HANA systems.

16 © Copyright IBM Corporation 2017


IBM Client Innovation Center

SAP LT Architecture Diagram

17 © Copyright IBM Corporation 2017


IBM Client Innovation Center

Features of Sap LT
 Allows real-time (and scheduled) data replication, replicating only relevant
data into HANA
 Ability to migrate data into HANA while replicating data in real-time.
 Unlimited release coverage (from SAP R/3 4.6C onwards)sourcing data
from SAP ERP(and other ABAP based SAP application)
 Handling of cluster and pool tables
 Automatically non-Unicode to Unicode conversion during load/replication.
 Simple and fast set-up of SAP LT Replication Server (initial installation and
configuration in less than 1day) and fully integrated with HANA Studio.

18 © Copyright IBM Corporation 2017


IBM Client Innovation Center

Use Case & Challenges


 Following Are Some Of The Cases Where These Tools Were Used :

SAP BODS SAP LT SAP SDI / SDQ


BIO-RAD UPM ROQUETTE
BNM National Grid (ng)
Schlumberger J&J
Yorkshire Water
Bell Canada
Veolia

Use Case 1 Use Case 2


Masking Fields in Table Table Enhancement and
Cluster Data Population
SLT - Masking SLT - Table
Fields in Table Enhancement and Data Populati

19 © Copyright IBM Corporation 2017


IBM Client Innovation Center

20 © Copyright IBM Corporation 2017

You might also like