Professional Documents
Culture Documents
by Noel Yuhanna
with Gene Leganza, Daniel Weber, and Peggy Dostie
March 24, 2020 | Updated: March 27, 2020
›› Data virtualization that can quickly and easily integrate data from silos. Data virtualization
creates a data abstraction layer by connecting, gathering, and transforming data from various
sources to support real-time dashboards and insights (see Figure 1). This technology evolved
primarily because of limitations in the traditional batch-oriented ETL technology that failed to move
data quickly to support the growing demand for real-time analytics. With data virtualization, you
can directly access transactional and operational systems and perform real-time integration and
transformations to support BI and analytical needs.1 The primary use cases for data virtualization
are real-time reporting, ad hoc queries, and search across disparate data sources.
›› Data fabric that can support end-to-end data management to enable more use cases.
Data fabric focuses on addressing broader business use cases such as customer 360, customer
intelligence, and IoT analytics. It includes many more components such as data catalog, data
transformation, data preparation, data discovery, data governance, and data modeling, thus
providing the ability to support end-to-end data management capabilities (see Figure 2).2 Forrester
often encounters configurations that include data virtualization as a data source to the data fabric,
enabling firms to leverage the best of both worlds (see Figure 3). Unlike data virtualization, data
fabric architecture is still evolving, largely because the stack components in the fabric need to work
together in tandem to support the business use cases. Forrester finds that often organizations
evolve their information architectures by leveraging data virtualization initially and then transitioning
toward a data fabric architecture over time.
ODBC/JDBC/REST/API Interface/protocol
Data virtualization
Cache
Metadata Data
security,
governance,
Data transformation, data integration monitoring
Connectors, adapters
© 2020 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law. 2
Citations@forrester.com or +1 866-367-7378
For Enterprise Architecture Professionals March 24, 2020 | Updated: March 27, 2020
Data Virtualization Or Data Fabric: Which Is Right For You?
Data management Data modeling, preparation, curation, graph engine Data discovery
• Metadata/catalog AI/ML
• Data security
• Data governance Transformation, integration, cleansing Data orchestration
• Data processing
AI/ML
• Data quality
• Data lineage Hadoop Data lake Data processing/
Data platform —
• Policies NoSQL processing EDW/BDW persistence
Spark AI/ML
Data ingestion/
Ingestion, streaming, data movement
streaming
AI/ML AI/ML
Data sources
Cloud On-premises
© 2020 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law. 3
Citations@forrester.com or +1 866-367-7378
For Enterprise Architecture Professionals March 24, 2020 | Updated: March 27, 2020
Data Virtualization Or Data Fabric: Which Is Right For You?
Distributed in-memory
Catalog AI/ML
Data fabric Data pipeline
Data lake/object store
(repositories)
Push-down
processing
Data lake
Object Store
DV = Data
Data virtualization abstraction layer Data virtualization
Connects, gathers,
transforms data
Choose Between Data Virtualization And Data Fabric, Depending On The Use Case
While data virtualization and data fabric provide some similar benefits, the technologies — and the effort
needed to implement them — are quite different. To choose between them, consider these key factors:
›› Time-to-value. Organizations often find data virtualization as the fastest way to integrate disparate
data sources, whether on-premises or cloud. Data virtualization offers many connectors to various
data sources, and can transform and curate data for visualization, reports, and dashboards. On the
other hand, to be successful with data fabric requires more elaborate planning; a team comprised
of enterprise architect, data architect, developers, data security professionals, and business
analysts; and an initial target business use case.
›› Availability of skilled data architects and data engineers. Today, most large and complex data
fabric deployments are done by SI/consulting organizations, primarily because several components
within a fabric need to work together to deliver an outcome. However, with intelligent and highly
automated data fabrics emerging, the need for consulting services will decline in the coming years.3
© 2020 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law. 4
Citations@forrester.com or +1 866-367-7378
For Enterprise Architecture Professionals March 24, 2020 | Updated: March 27, 2020
Data Virtualization Or Data Fabric: Which Is Right For You?
›› Target use cases and the target future state architecture. The key differences between data
virtualization and data fabric are largely around the use cases and end-to-end data management
capabilities that the fabric provides (see Figure 4). A data source to data fabric can be data
virtualization, enabling organizations to extend their platforms easily. Prioritize your architecture
build-out based on business need. For example, Forrester recommends leveraging data
virtualization in one of the “enhance/harden/re-architect” BI governance process steps as either a
target architecture or as a transition point to data fabric based architecture.4
›› Investment needed to get started. Data virtualization is a low-cost integration solution that
does not require huge investments, especially when the need is simple, such as federating and
transforming a few data sources to deliver real-time or near real-time insights. Organizations often
see data fabric ROI to be significantly longer when compared to data virtualization.
© 2020 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law. 5
Citations@forrester.com or +1 866-367-7378
For Enterprise Architecture Professionals March 24, 2020 | Updated: March 27, 2020
Data Virtualization Or Data Fabric: Which Is Right For You?
Business Data
functionality Description virtualization Data fabric
Time-to-value The time it takes to deploy a simple use case 1-4 weeks for 1-3 months for
from start to finish simple use simple use
case case
Initial cost Initial investment to support a simple use case Typically Typically
50K-100K 200K-400K
© 2020 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law. 6
Citations@forrester.com or +1 866-367-7378
For Enterprise Architecture Professionals March 24, 2020 | Updated: March 27, 2020
Data Virtualization Or Data Fabric: Which Is Right For You?
Development Data
functionality Description virtualization Data fabric
Data pipeline The platform supports data pipeline for Limited Yes
(integrated) ingesting data and also performs
transformations
Data modeling Data modeling to model business data, and Limited Yes
prebuilt data models
Data catalog Support for data catalog within the platform No Yes
Data type Type of data supported such as structured, Mostly All kinds of
unstructured, or semi-structured structured data types
data
API access Offers the ability to access data using various Limited Comprehensive
API programming interfaces
© 2020 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law. 7
Citations@forrester.com or +1 866-367-7378
For Enterprise Architecture Professionals March 24, 2020 | Updated: March 27, 2020
Data Virtualization Or Data Fabric: Which Is Right For You?
Deployment Data
functionality Description virtualization Data fabric
Data Can persist data from various sources when Limited Yes
persistence needed
© 2020 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law. 8
Citations@forrester.com or +1 866-367-7378
For Enterprise Architecture Professionals March 24, 2020 | Updated: March 27, 2020
Data Virtualization Or Data Fabric: Which Is Right For You?
To help you put research Translate research into Join our online sessions
into practice, connect action by working with on the latest research
with an analyst to discuss an analyst on a specific affecting your business.
your questions in a engagement in the form Each call includes analyst
30-minute phone session of custom strategy Q&A and slides and is
— or opt for a response sessions, workshops, available on-demand.
via email. or speeches.
Learn more.
Learn more. Learn more.
Endnotes
Data virtualization enables real-time integration of disparate data sources. See the Forrester report “Create A
1
Data fabric is the orchestration of disparate data sources intelligently and securely in a self-service manner. See the
2
With AI/machine learning (ML) functionality, big data fabric enables a higher degree of automation to support
3
advanced data intelligence for new insights and simplified data sharing across users. Data fabric also learns from
data and automatically identifies data patterns and connected data to support more adaptive intelligence, driving
accelerated actionable insights as well as automatic recommendations and alerts. See the Forrester report “Big Data
Fabric 2.0 Drives Data Democratization.”
See the Forrester report “Divide (BI Governance From Data Governance) And Conquer.”
4
© 2020 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law. 9
Citations@forrester.com or +1 866-367-7378
forrester.com
Client support
For information on hard-copy or electronic reprints, please contact Client Support at
+1 866-367-7378, +1 617-613-5730, or clientsupport@forrester.com. We offer quantity
discounts and special pricing for academic and nonprofit institutions.
160343