Professional Documents
Culture Documents
M
odern data runs in cloud-native
environments. Migration of legacy Five best practices for migrating your data
on-premises data to the cloud is warehouse to the cloud:
becoming necessary because enterprises are facing
several critical pain points with their on-premises 1 Modernize enterprise data warehousing
deployments, including siloed, on-premises environments around cloud-native
data stores that prevent unified rollup of data principles
and analytics, limited technical support staff
to manage on-premises data, and scalability 2 Modularize enterprise data warehousing
constraints resulting from inflexible architectures of assets for maximum reuse
on-premises data platforms.
3 Convert legacy enterprise data warehousing
Legacy on-premises data warehouses (DWs) environments
are increasingly showing their age. Over the
past decade, enterprises everywhere have been 4 Automate enterprise migration to modern
migrating to cloud-based DWs that offer greater environments
scalability, performance, flexibility, and cost-
effectiveness. Typically accessed on a fully managed, 5 Augment enterprise teamwork of data
pay-as-you-go basis, the modern cloud DW can warehousing professionals to operationalize
migrated assets
support both structured and unstructured data • Convergence. Migrate one or more DWs to
assets. In addition to supporting conventional support unified data governance and deliver
ETL-based data integration, the modern cloud DW a “single version of truth” to all downstream
can also provide real-time and low-latency analytics, business intelligence, reporting, and other
automated data onboarding and integration, analytics applications.
orchestrated data pipelines, data engineering and
preparation, self-service data delivery, embedded • Enhancement. Migrate one or more DWs to a
analytics models, and other sophisticated features. fully managed cloud computing environment to
reduce infrastructure and operating costs, boost
This TDWI Checklist shares five best practices for performance and scalability, enable usage-based
migrating your enterprise DW to the cloud and pricing, and improve reliability and availability.
presents recommendations for aligning your DW
migrations with your overall cloud modernization • Evolution. Migrate one or more DWs to a
strategies and business imperatives. cloud-based platform with the flexibility to
evolve into a larger multicloud, virtualized,
mesh, or cloud-to-edge data architecture. The
target cloud DW should be able to ingest and
store new types of structured, semistructured,
Modernize enterprise
1 data warehousing
and unstructured data; support streaming,
low-latency, and real-time data applications;
environments around and enable unification of data integration and
machine learning pipelines.
cloud-native principles
Migration is often far more than “lifting and
Migrating to a modern cloud-based data platform can
shifting” existing DW system footprints to the new
accelerate business transformation. It can expand
environment. Beyond such essential tasks as copying
data accessibility; scale data ingestion, storage,
data from legacy databases and loading it onto
processing, and delivery; and help your organization
the target platform, there are numerous details to
harness new data types for innovative applications.
iron out before switching over to the target cloud
platform. These necessary migration chores
Migrations involve transferring the capabilities,
often include:
services, and assets of legacy data platforms partially
or entirely to newer, more modern platforms. Usually,
• Revising data schemas
the target platform will replace the legacy platform,
with the latter being turned off or repurposed after
• Rewriting stored procedures and user-defined
the migration is complete.
functions
Migration projects are smoother and less disruptive • Containerization. The target environment
if the legacy DW platform’s assets are transferred should enable orchestration of DW capabilities
as modular capabilities to the new modern cloud as microservices within a containerized cloud
architecture. computing fabric.
Modularity enables enterprises to start small with • Elasticity. The target environment should
well-defined use cases for the cloud DW migration. support elastic provisioning of computing,
Organizations can selectively and iteratively migrate storage, memory, and network resources to
specific business domains—along with associated support dynamic changes to DW workloads and
data objects, integration code, ETL mappings, usage patterns.
stored procedures, sessions, and other DW and data
integration assets—in a fashion that minimizes
disruption to the business.
Convert legacy
At the very least, you should migrate DW data sets 3 enterprise data
only after they’ve been made correct, consistent,
and compliant. To ensure a smoothly modular warehousing
migration of legacy DW assets, the target cloud DW
architecture should be architected in conformance
environments
with these tenets:
Migration may involve conversion of data,
integration code, and other artifacts to new formats
• Visibility. The target cloud data environment
that can be more efficiently processed by the
should support discovery, search, and
target platform.
management of all assets through a searchable
data catalog. This catalog should provide a
Conversion is one of the central requirements for
central point of management of all DW and data
any enterprise trying to leverage its substantial
integration assets.
investment in data, mappings, ETL code, and other
assets from legacy DWs. Enterprises may choose
• Openness. The target DW should present open
to move all or only a portion of legacy data over
computing APIs for access, programming,
to their new modern cloud DW. Whatever their
manipulation, monitoring, and management of
overarching business requirements may be, this
all DW and data integration functions.
presents an opportunity to ensure that whatever data
is migrated remains trustworthy, clean, consistent,
and compliant with all relevant mandates.
conversion of assets, they should validate and test • Select appropriate ETL transformations
that the assets are being converted and/or rewritten
with full fidelity for migration and processing within • Anticipate and proactively resolve data quality
the target cloud environment. and noncompliance issues
Augmentation of the technical staff’s collaborative cloud data platform capable of ingesting and
productivity is a key benefit of the automation storing new types of structured, semistructured,
features built into DW and data integration and unstructured data, and providing a scalable
migration tools. When used effectively, these tools platform for unification of data integration and
can deliver such benefits as: MLOps pipelines
• Accelerated data integration and DW migrations • Adopt data engineering, validation, security,
with fewer scheduling challenges protection, and governance tools optimized for
the target cloud DW
• Greater data integration and DW asset reusability
with less conversion effort • Make necessary investments in enabling cloud
infrastructure for DW modernization, especially
• Improved data integration and DW developer and AI/ML-automated data engineering pipelines and
administrator productivity searchable data catalogs
• Faster data integration and DW code generation • Deploy a cloud DW that boosts the density and
with fewer errors capacity utilization of CPU, memory, storage,
and other hardware utilization on the back-end
• Higher-quality data integration and DW code cloud platforms
generation, testing, and validation with less
rework • Deliver consumable cloud DW functions at low
latency via a consistent, secure, web-native API
• Contextual recommendations on monitoring, that can be called from any client application on
management, and troubleshooting pipeline a pay-as-you-go basis
issues, as well as profiling, matching, merging,
cleansing, exception handling, issue resolution, When undertaking the migration of a legacy
and other stewardship functions on-premises DW to this target cloud DW, your
enterprise should:
Product and company names mentioned herein may be trademarks and/or registered trademarks
E info@tdwi.org of their respective companies. Inclusion of a vendor, product, or service in TDWI research does
not constitute an endorsement by TDWI or its management. Sponsorship of a publication should
tdwi.org not be construed as an endorsement of the sponsor organization or validation of its claims.