You are on page 1of 11

A Data-Led Approach

to Digital Innovation

Authors
Meaghan McKeown
Susan Meldahl
Pankaj Srivastava

Key Contributor
Kelsey Radwick

Contents
Introduction...................................................................................................................................................................................................... 2
Section I: The Data-Led Approach........................................................................................................................................................... 3
Section II. The Modern Data Estate......................................................................................................................................................... 5
Section III. Opportunity for Partners........................................................................................................................................................ 7
Services Opportunity..................................................................................................................................................................................... 7
Monetization Opportunity.......................................................................................................................................................................... 8
Conclusion......................................................................................................................................................................................................... 9
Appendix I: Visualizing the Modern Data Estate..............................................................................................................................10
Appendix II: Managed Services Categories........................................................................................................................................10
Foundational...................................................................................................................................................................................................10
Data Availability.............................................................................................................................................................................................10
Intelligence......................................................................................................................................................................................................11
Experiences......................................................................................................................................................................................................11

© 2020 Microsoft. All rights reserved. Microsoft.com 1


Introduction partners. Thus, considering the data challenges and
opportunities facing customers and partners, this
The fourth industrial revolution is thought piece aims to accomplish three things.
upon us, arriving in the form of
digital transformation. The leaders The first is to propose a data-led approach as
– and laggards – of this new era will fundamental to the digital transformation journey.
in large part be determined by a single factor: the Unlike the application-led approach, which brings
effectiveness of their approach to data management. data along with applications, thereby leading
Leveraging data to build better products and higher to piecemealing of data, a data-led approach
value services as a point of competitive advantage ‘futureproofs’ the business by taking a holistic view of
is nothing new. What is new is the volume, velocity, data across the organization from source to usage. As
and variety of data that has been enabled by such, Section I serves to clearly define what a “data-
cloud computing and the corresponding need for led” approach looks like and how it differs from other
companies to quickly process and make sense of approaches.
their data. The world’s data is expected to reach Once customers understand the value of a data-led
175 zettabytes in 2025 with a 30 percent real-time approach the job of the partner is to help facilitate
consumption rate1. To put that into perspective, the this journey. The second key aim of this thought piece
world’s collective internet usage reached just one is therefore to underscore the core design principles
zettabyte in 20162. of a modern data estate as outlined in Section II.
The companies that can unlock and activate this And finally, the third key aim is to outline the
vast amount of data will see several topline benefits. opportunity for partners to (a) expand the surface
According to recent research from McKinsey, data- area of their engagements with customers and (b)
driven companies are 23 times more likely to acquire introduce new models for monetization. With the era
a customer, six times more likely to retain that of data-powered digitization upon us, the imperative
customer, and 19 times more likely to be profitable for companies to modernize their data estate and
as a result. To this end, leaders are recognizing data bring advanced capabilities to their businesses has
as the new basis of competition and as an underlying amplified. Partners must move quickly to build rich
force behind innovation, productivity, customer service offerings that meet this need. Section III
loyalty, and growth. describes four categories of managed services in the
However, building a ‘modern’ data estate is not realm of data, for partners to leverage as a framework
an easy endeavor and, in some cases, will require a for building out their offers.
complete overhaul of data and business strategy. As
a result, companies are turning to partners for their
expertise and guidance on how to make data a central
tenet of their cloud journey, opening a range of new 1
Seagate: Data Age 2025, The Digitalization of the World
and untapped revenue-generating opportunities for 2
Bernard Marr & Company: How Much Data Is There In The World

© 2020 Microsoft. All rights reserved. Microsoft.com 2


Section I: The Data-Led on data estate modernization. Once this door has
been opened, the data journey can be customized
Approach to fit the needs and appetite of the customer. For
What is the ‘data-led’ approach and some, the data journey will occur separate from
how does it differ from application- their application journey, accelerating the pace of
led approach? data estate modernization. For others, the data
and application journeys will run in parallel – often
Most partners begin customer engagements with a converging to build intelligent tools and processes
‘cloud envisioning session’, a standard methodology that unleash the possibilities of digital innovation (see
for educating customers and helping them make Figure I).
key decisions regarding their move to the cloud. The
envisioning session is typically followed by a cloud When partners implement a data-led approach to
assessment to gather insights on the customer’s customer digital transformation, smaller data projects
application landscape and to identify application that would otherwise be executed in siloes materialize
dependencies which are later used to design a into a connected, holistic approach to modernizing
cloud migration and modernization plan. Typically, the data estate. During the data discovery process,
these plans are heavily pivoted on lift-and-shift partners may initially start with a relatively small
migration, application modernization, and related business challenge. Examples could include financial
managed services. Data is often a secondary thought reporting challenges, data quality issues, security
in this approach, usually piecemealed into different incidents, or a business need such as point-of-sale
application/workload silos without a view of how data consolidation, IoT project integration, improved
to ensure trusted, connected data across the entire efficiencies, reduced costs, and business insights and
application portfolio. visualization. Once value/ROI is proven for the initial
business challenge, some customers will be hungry for
At the same time, partners who have invested radical, large scale data estate transformation. Others
in building AI capabilities (supported by data may take a more phased approach, continuing with
science talent) and are executing AI projects at a smaller use cases and over time, evolving the broader
certain volume (100+/year) are struggling to find data estate.
repeatability and opportunity for automation across
the AI project portfolio. Lack of scalability severely As customers embark on the data estate
limits operating margin expansion. modernization journey, there are two best practices
partners should leverage in order to maximize value
By contrast, a data-led approach is grounded in data creation for the customer while solidifying themselves
discovery and realizing value from the beginning, as critical allies in the digital innovation process.
very quickly opening the door to broader discussions

Data migration

Data modernization
Data Journey

Digital Innovation
Advanced analytics
Data intelligence
(AI/ML)

AND Convergence
Application Journey

Intelligent apps
(cloud native)

Repeatability
Smart apps (infused with pre-built AI)
Scalability
App modernization Margin Expansion
App migration

Foundational Innovation
Strategic value & capabilities

Figure I. Journey to Unlock Digital Innovation

© 2020 Microsoft. All rights reserved. Microsoft.com 3


The first is continuous and iterative design – Operating efficiencies that increase profitability. Three assets
in short, one to four-week sprints based on Agile in particular are key to building a repeatable and
methodology accelerates the discovery process, scalable date estate practice. The first is a solution
proves value early, and validates the investment. In library comprised of algorithms, machine learning/
the early stages, ROI is best demonstrated by focusing analytical models, and data patterns collected
on a single use case and the impact on a set of across customer projects to be leveraged for future
prioritized success indicators. This helps to achieve projects. The second is a continually refreshed
customer buy-in early before scaling the customer centralized knowledge repository containing reference
project to larger parts of the estate. As the process architectures, solution templates, use cases, DevOps
of discovery, agile deployment, and value realization process templates, solution runbooks, deployment
is executed over additional use cases, the value guides, governance repositories, analytics practice
multiplies. The data builds upon itself, creating new libraries, data connector repositories, and frameworks
value and encouraging new sources which in turn for assessments, scripts, and security. And finally,
creates more value. the third key asset is a standardized pre-sales
methodology to showcase the “art of the possible”
The second best practice is repeatability and to potential customers. This methodology should
scalability – Execution across multiple use cases help customers visualize specific use cases and the
enables partners to catalyze value realization for their business value that can be unlocked by modernizing
customers in the process, establishing structure and their data estate. Appendix I is one such visual to help
governance while realizing operational efficiencies customers image a “before and after” state. Invariably,
that increase profitability. Execution across multiple this conversation leads to discussion and identification
use cases enables partners to become increasingly of additional use cases.
programmatic in their engagements with customers,
not only providing valuable structure for customers,
but enabling partners to realize operational

© 2020 Microsoft. All rights reserved. Microsoft.com 4


Section II. The Modern II. Integrate relational data sources with other
unstructured datasets with the use of big data
Data Estate processing technologies.
What are the core design principles of III. Use semantic modeling and powerful visualization
a ‘Modern Data Estate’? tools for simpler data analysis.
A ‘Modern Data Estate’ leverages the power of the
cloud to ingest, process, store, serve, and visualize Figure II illustrates the interaction of these principles.
structured and unstructured data from multiple Data from a variety of unstructured, structured, and
different sources. Generally, the litmus test for a semi-structured sources is published to a virtual
Modern Data Estate is whether it delivers on the lake that users centrally access to generate reports,
following four tenets: (1) reduced costs, (2) improved APIs, and dashboards. To eliminate duplication and
performance, (3) reduced risk, and (4) advanced optimize efficiency, compute is brought to the data
analytics. These are achieved through adherence to lake as opposed to data being extracted, copied
the following fundamental principles: locally, and then compute applied. A layer of shared
I. Establish an enterprise-wide data hub consisting of managed services and analysis tools are also made
a data warehouse for structured data and operational available to maintain the integrity of the estate and
analytics and a data lake or lakes for semi-structured help users. Finally, the entire system is contained
and unstructured data. This data hub becomes within a single security context to ensure data is
the single source of truth for data and facilitates protected and secure.
implementation of data governance, security and
quality.

Data Platform Services


Single Security Context (AAD)

Dashboards Reports Cubes Collaboration &


APIs Modern Workplace

Data Availability Intelligence Experiences Reports

Ingestion for Data Catalogue Merge


Service Reconciliation
Compute Service Service Innovation Lab
Service

Handshaking Control File Data Quality ML/AI Models APIs


Service Service Service (Simulation, etc.) Conversation Agents

ELT
ELT Dashboards
Bring compute
compute data
Publish post

to data lake

Azure Azure
HR Finance Supply Chain Product Sales HDInsight Synapse Databricks
Analytics

Publishers publish to
Compute to Lake pattern
Azure Data Lake (no pull)

Figure II. Data Platform Services

© 2020 Microsoft. All rights reserved. Microsoft.com 5


Figure III brings this framework to life with an example and stored in a centralized, raw storage location
scenario of a Modern Data Estate architecture before being transformed from its source format
in Microsoft Azure. The solution architecture to the format required for analysis. Processes will
demonstrates the ability of a Modern Data Estate vary, such as for streaming data, through which near
to ingest a variety of complex data from multiple real time insights can be unlocked, this data can be
sources, including high velocity data from IoT devices, transformed in the streaming pipelines themselves,
sensors, and other gadgets as well as high variety as opposed to being stored first being transformed.
and high volume data from images, video, audio, Finally, the data is ready to ‘model and serve’,
csv, xml, log, and more. This data is ingested through ultimately powering data-driven applications and
streaming or batch processing and then processed supplying rich data visualizations to users.

Modern Data Estate Example


Single Security Context (AAD)

Data Sources Ingest Store Prep & train Model & serve Business
1 Messages
In-stream Processing
3 User
Stream Analytics Data science & AI
Streaming Event Hubs/
Kafka
3 4
IoT Scenarios
2 Power BI

Analytics
IoT Hub Azure Azure
Raw storage options Machine Learning Databricks

3 Azure
Cosmos DB
Applications
Azure Blob Data Lake
HDInsight Azure Synapse
Storage Store Analytics

1 5
Data engineering 3
4
Batch
processing
2 Cosmos DB SQL 3
Analytics
-DB
-MI Azure SQL Data Azure Analysis
Azure Data Factory -IaaS Warehouse Services

1 Identify Data Sources, Schema, Scale, Volume, Authentication etc 2 Load with either ELT or ETL 3 Transform, Process and Prep data for Model training and Visualization

4 Deploy Model for scoring and inferencing and serve data to down stream applications 5 Create semantic models and Visualization

Figure III. Modern Data Estate Sample Scenario

© 2020 Microsoft. All rights reserved. Microsoft.com 6


Section III. Opportunity for As partners develop their portfolio of offers, there are
four categories of data managed services to consider:
Partners foundational services, data availability services,
intelligence services, and experience services. The
A data-led approach to digital
first, foundational services, are those that transcend
transformation presents two
the underlying data estate to maintain overall health
opportunities for partners: (a)
and integrity while the remaining three categories
increased surface area for professional and managed
build upon each other to enhance the estate. Data
services and (b) new approaches to monetization for
availability is the first of these building blocks, and
improved profitability.
helps to ensure customer data is the quality needed
discoverable, and fit for the intended use case.
Services Opportunity Intelligence builds on data availability and helps
As customers grow to understand the value of the users derive deep and meaningful insights from
modernizing their data estate, the surface area for their data. And finally, the third tier of data services
partners to deploy professional and managed services offer users personalized, interactive, and automated
also expands. Not only is the partner able to build experiences via advanced analytics and powerful BI
and sell net new services, but feedback from partners reporting. Real-time data is consumed seamlessly
indicates that data estate modernization projects are across devices and mediums, empowering users to
“stickier”, quicker to market, and have faster time find hidden stories within their data and make smart,
to value realization compared to heavy enterprise proactive decisions to move their business forward.
line of business applications that may take years to Appendix II provides just a small sampling of relevant
modernize and move to the cloud. managed services across each of these categories.

Project services are an important point of entry for 3


Processing methods will vary based on the type of data, use case and if using
partners. However, supporting customers through ELT or ETL process.
the initial transformation of their data estate is only
part of the opportunity for partners. Rather, managed
services offer a clear path to long-term profitability
through higher margins and increased customer
lifetime value. Further, the current lack of managed
services for data estate management means partners
who move quickly to grow their portfolio will see first-
mover advantages.

© 2020 Microsoft. All rights reserved. Microsoft.com 7


Monetization Opportunity total control over their own environment. If executed
in the partner subscription, the customer will not
When leveraging an application-centric approach, control their own environment, but can tap into
most partners monetize their services by charging a insights and learnings from a multitude of “like” use
certain percentage or tiered fixed fee on top of the cases spanning various customers and industries. By
customer’s cloud consumption. Margins are healthy, adopting the architectural considerations of a Modern
often exceeding 50 percent for applications and Data Estate outlined in the paper and accelerating the
infrastructure. A data-led approach neither replaces customer’s journey to rich data intelligence through
nor diminishes this existing revenue/margin. Rather, managed services, partners will be well-positioned to
a data-led approach expands the surface area for grow the number of subscriptions per customer. By
customer engagement (as discussed above) while investing in both models outlined above, partners will
introducing new monetization opportunities akin to be well positioned to support customers with diverse
IP provider models. This means that rather than a needs and use cases.
percentage or tiered fixed fee approach, partners can
monetize their data services using royalties or a fee- Because this is a relatively novel monetization
per-user/fee-per-instance licensing model. approach for managed service providers, they
may need to retool their commerce and Cloud
Data managed services – especially data availability, Management Platforms to effectively invoice under
intelligence, and experiences – are essentially IP these models. However, the margin realization on
and can be monetized as such. For example, a these services is directly proportional to the value
managed service such as Data Quality Services or a delivered to the customer and far exceeds that of
Merge Service can be executed in a customer’s own infrastructure and application projects/services. For
subscription and licensed to customers for a fee or example, IP services that have been enriched with
executed in a partners own subscription in Azure automation and machine learning can lead to margins
and accessed by customer users/publishers for an exceeding 70 percent.
access fee. The first model provides customers with

© 2020 Microsoft. All rights reserved. Microsoft.com 8


Conclusion II. When defining success with the customer,
always refer back to the Modern Data
The pivot to becoming a data-driven Estate litmus test (i.e. whether the
organization is fundamental for investment delivers on the following
businesses and thus represents an core tenants: reduced costs, improved
enormous opportunity for partners performance, reduced risk, and advanced
to become catalysts for digital analytics).
innovation. To fully capture this opportunity, partners III. To maximize customer value while also
must rethink the way they engage customers on their expanding partner margins, build and
cloud journey by shifting from an application only attach managed service offers across the
approach to an application and data-led approach. four data services catagories: foundational
This means conducting early discovery efforts and services, data availability services,
assessing the data estate as a whole, apart from intelligence services, and experience
any singular application, to ensure repeatability and services.
scalability across current and future customer use
cases that impact business outcomes. For partners These tenets offer a roadmap for partners looking
looking to adopt a data-led approach and thus to become the “go to” trusted advisor for navigating
capitalize on a proposition not yet offered at scale in customers through their data journey. In so doing,
the market, key tenants are: they become the linchpin for companies in a time
where data may very well determine the winners and
I. A data-led approach should be designed losers of the fourth industrial age.
around two best practices: continuous and
iterative design to prove value early and Data managed services fall into one of four categories:
repeatability and scalability to catalyze foundational, data availability, intelligence, and
value realization for customers and experiences.
increase partner profitability.

© 2020 Microsoft. All rights reserved. Microsoft.com 9


Appendix I: Visualizing the Modern Data Estate

Legacy Data Estate Modern Data Estate

Source Staging ETL


Types of
data

Database
Replication—
Oracle, SQL,
Financial etc.
Reporting Sources Reporting Tools
Account
Hierarchy
Data Marts
Customer

Cost Hierarchy Power BI


Order Data Customer Learner/
Invoice Data Success Data Product/
XGL Data - Call Center License Data
HR & Campaign

Account Group
Maps

Product

Gainsight Opportunities Leads


Inbound/ Data Mart Data Mart Data Mart
Call Center Outbound Files

Mobile Apps
Sales

Data Integrator

Appendix II: Managed Services Categories


Foundational Data Availability
Underlying services to maintain the health and Services to ensure customer data operates at the
integrity of the estate, including: highest level of quality and efficiency in line with best
practices.
I. Governance – Set standards and policies that
define data availability, usability, integrity, and I. Data Lake as a Service – Ensure lake
security. consistently conforms to standards such as
II. Security – Minimize risk to business operations schema, data availability schedule, type of
and protect data at both data lake and file format available (csv, parquet, json), and
consumer layers. access protocols (REST, NFS, HDFS).
III. Lineage – Track data’s origins and movement I. Ingest for Compute Service – Support
over time. publication to data lake across multiple fabrics.
IV. Alert/Monitoring – Monitor the implementation II. Handshaking Service – Understand what data is
of each data lake to ensure compatibility with published to the data lake and when.
standards. III. Control File Service - Enables users’ ‘compute’
V. Data Catalog – Make data easy to find to read data files.
by automatically populating to a central IV. Data Catalog Service – Enables discoverability
catalogue. of what data is available in the data lake.
VI. Common Data Model Service – Provide a V. Data Quality Service - Ensure data is fit
shared data language for business and for intended usage in operations, decision
analytics applications. making, and planning.

© 2020 Microsoft. All rights reserved. Microsoft.com 10


Intelligence Experiences
Services to support user ability to derive deep and Services to provide users personalized, interactive, and
meaningful business intelligence. automated experiences.
I. Merge Service – Merge data based on user(s) I. AI/ML – Leverage automation to streamline
configuration. processes and improve data driven decision
II. Reconciliation Service (Recon) – Compare making.
transformed data against original source II. PowerBI as a Service – Make customer data
data to reduce duplication. and avoid risk of actionable through visual analytics.
differing results across users leveraging the III. Azure Virtual Assistant – Deliver a
same data set. conversational assistant personalized to the
III. AI/ML Models – Provide insights to users based user.
on most commonly needed information and
queries.

Special Thanks To Our Reviewers


Moritz Berger, Anil Dwarakanath, Andy Gore, Phil Harvey, Barnaby Jeans, Robert Venable

© 2020 Microsoft. All rights reserved. Microsoft.com 11

You might also like