SlideShare a Scribd company logo
I’m building a data
mesh, so I don't need
data virtualization
WEBINAR SERIES
SPEAKERS
Paul Moxon
SVP Data Architectures & Chief
Evangelist, Denodo
3
MYTH #7:
I’m building a data
mesh, so I don't need
data virtualization
AGENDA
1. What is a Data Mesh?
2. Origins of the Myth
3. Just the Facts, Ma’am
4. The Proof is in the Pudding
5. Summary and Conclusions
6. Q&A
7. Next Steps
What is a Data Mesh?
6
Traditional Approach to Data Management
Illustration: Zhamak Dehghani, Data Mesh Principles and Logical Architecture, December 2020
7
Traditional Approach to Data Management
8
The ‘Knowledge Gap’ Challenge
Domain experts who
understand the domain
data
Domain experts who
understand the domain
processes
Tool experts who
understand how to
develop data pipelines
Illustration: Zhamak Dehghani, How to Move Beyond a Monolithic Data Lake to a Distributed Data Mesh, May 2019
9
What is a Data Mesh?
• The Data Mesh is a new organizational paradigm for
data management
• It moves from a centralized data infrastructure
managed by a single team to a distributed
organizational paradigm
• Several autonomous units (domains) are in charge
of managing and exposing their own “Data
Products” to the rest of the organization
• Data Products should be easily discoverable,
understandable and accessible to the rest of the
organization
10
Data Mesh – 1. Domain based organization
• Organizational units (domains) are responsible for
managing and exposing their own data
• Domains understand better how the data they own
should be processed and used
• Gives them autonomy to use the best tools to deal with
their data, and to evolve them when needed
• Results in shorter and fewer iterations until business
needs are met
11
Data Mesh – 2. Data as a Product
• To ensure that domains do not become isolated data
silos, the data exposed by the different domains
must be:
• Easily discoverable
• Understandable
• Secured
• Usable by other domains
• The processes and pipelines to generate the product
(e.g. cleansing and deduplication) are internal
implementation details and hidden to consumers
12
Data Mesh – 3. Self-serve Data Platform
• Self-Serve: while operated by a global data
infrastructure team, it allows the domains to create and
manage the data products themselves
• The platform should be able to automate or simplify
tasks such as:
• Data integration and transformation
• Security policies and identity management
• Exposure of data APIs
• Publish and document in a global catalog
13
• Data products created by the different domains need
to interoperate with each other and be combined to
solve new needs
• e.g. to be joined, aggregated, correlated, etc.
• This requires agreement about the semantics of
common entities (e.g. customer, product), about the
formats of field types (e.g. SINs, entity identifiers,...),
about addressability of data APIs, etc.
• Managed globally and, when possible, automatically
enforced
• Security must be enforced globally according to the
applicable regulations and policies
Data Mesh – 4. Federated Computational Governance
14
TRADITIONAL ATTRIBUTE DATA MESH
Centralized ORGANIZATION
Decentralized – Domain
Ownership
Monolithic & Physical ARCHITECTURE Distributed & Logical
Top-Down Design OPERATIONALLY
Federated Computational
Governance
Data is physically collected MINDSET
Data Products – Connect
and Share
Analytical separate from
Operational
INFRASTRUCTURE Integrated
How is it different from existing data delivery models?
Origins of the Myth
16
Nascent Technologies = Confusion!
• The fine details are not clearly defined
• Lots of vendors jump on to the bandwagon
• Original definition is distorted to fit existing product
capabilities
“I’m perhaps writing this book a bit earlier than I would have
liked. We are still in the early years of a fundamentally different
approach in sharing and creating data for analytical and machine
learning (ML) use cases. But our industry has the tendency to
percolate new concepts and buzzwords beyond recognition.”
Zhamak Dehghani, Data Mesh – Delivering Data-Driven Value at Scale, O’Reilly
17
Nascent Technologies = Confusion (Cont’d)
Image: Sven Balnojan, There’s More Than One Kind of Data Mesh — Three Types of Data Meshes, Towards Data Science
Just the Facts, Ma’am
19
TRADITIONAL ATTRIBUTE DATA MESH
Centralized ORGANIZATION
Decentralized – Domain
Ownership
Monolithic & Physical ARCHITECTURE Distributed & Logical
Top-Down Design OPERATIONALLY
Federated Computational
Governance
Data is physically collected MINDSET
Data Products – Connect
and Share
Analytical separate from
Operational
INFRASTRUCTURE Integrated
How is it different from existing data delivery models?
20
Centralized vs. Decentralized Data Mesh
DATA MESH ‘Centralized’ Data Mesh ‘Decentralized’ Data Mesh
Decentralized – Domain
Ownership ✓ ✓
Distributed & Logical  ✓
Federated Computational
Governance ? ✓
Data Products – Connect
and Share ✓ ✓
Integrated ✓ ✓
21
Centralized Data Mesh – Still ‘Business As Usual’?
Illustration: Zhamak Dehghani, Data Mesh Principles and Logical Architecture, December 2020
The Proof is in
the Pudding
23
Data Mesh + Data Virtualization: Decentralized Organization – Centralized Control
Domains create
virtual models in
separate
schemas.
Execution servers
can be scaled
independently
1
Domains can
choose and
autonomously
evolve their data
sources
2
Domains can share
standardized
definitions.
Products can be
used to define other
products
3
A central team can
set guidelines and
enforce global
security, quality and
governance policies
in the virtual models
4
24
Data Mesh + Data Virtualization – Distributed Topology
25
Other Data Mesh Reference Architectures
Image: Data Mesh: Just another buzzword or the next-generation enterprise data platform, PwC Germany, May 2022
26
1. Reusability: Data Virtualization platforms include strong capabilities to create and manage rich,
layered semantic layers which foster reuse and expose data to each type of consumer in the form
most suitable for them
2. Polyglot consumption: Data Virtualization allows data consumers to access data using any
technology, not only SQL. For instance, self-describing REST, GraphQL and OData APIs can be
created with a single click. Multidimensional access based on MDX is also possible
3. Top-down modelling: you can create ‘interface data views’ which set ‘schema contracts’ which
developers of data products need to comply with
• This helps to implement the concept of federated computational governance
4. Data marketplace: Ready-to-use data catalog which can act as a data marketplace for the data
products created by the different domains
Additional Benefits of a Data Virtualization approach to Data Mesh
Summary &
Conclusions
28
Summary and Conclusions
• Data Mesh is a new paradigm for data management and
analytics
• It’s getting a lot of interest for diverse organizations
• How to implement a Data Mesh is surrounded by hype and
confusion
• Data Virtualization offers a solid foundation to implement
this new paradigm
• Easy to use for business domain personas
• Can leverage domain infrastructure or direct them
towards a centralize repository
• Simple yet advanced graphical modeling tools to define
new products
• Full governance and security controls
Domain
Domain
Domain
Domain
Data
Platform
Domain
Domain
Domain
Domain
29
MYTH #7:
I’m building a data
mesh, so I don't need
data virtualization
Q&A
31
Get Started Today
Try Denodo with the 30-day free trial
in the cloud marketplaces
CHOICE
Under your cloud account
SUPPORT
Community forum AND remote
sales engineer
OPPORTUNITY
30 minutes free consultation with
Denodo Cloud specialist
denodo.com/free-trials
Thanks!
www.denodo.com info@denodo.com
© Copyright Denodo Technologies. All rights reserved
Unless otherwise specified, no part of this PDF file may be reproduced or utilized in any for or by any means, electronic or mechanical, including photocopying
and microfilm, without prior the written authorization from Denodo Technologies.

More Related Content

PDF
PDF
Why Data Mesh Needs Data Virtualization (ASEAN)
PDF
Enabling a Data Mesh Architecture with Data Virtualization
PDF
Data Mesh in Action (MEAP V04) Jacek Majchrzak
PDF
Modernizing Integration with Data Virtualization
PDF
Data Mesh 101
DOCX
A Practical Guide to Data Mesh Architecture Blog.docx
PDF
Belgium & Luxembourg dedicated online Data Virtualization discovery workshop
Why Data Mesh Needs Data Virtualization (ASEAN)
Enabling a Data Mesh Architecture with Data Virtualization
Data Mesh in Action (MEAP V04) Jacek Majchrzak
Modernizing Integration with Data Virtualization
Data Mesh 101
A Practical Guide to Data Mesh Architecture Blog.docx
Belgium & Luxembourg dedicated online Data Virtualization discovery workshop

Similar to Myth Busters VII: I’m building a data mesh, so I don’t need data virtualization (20)

PDF
data-mesh_whitepaper_dec2021.pdf
PPTX
ANIn Pune July 2024 | Bootstrapping Data Mesh for a Complex Enterprise by Bal...
PPTX
Data Mesh using Microsoft Fabric
PDF
Agile Data Management with Enterprise Data Fabric (Middle East)
PDF
Data Mesh in Practice - How Europe's Leading Online Platform for Fashion Goes...
PPTX
Data Domain-Driven Design
PDF
Agile Data Management with Enterprise Data Fabric (ASEAN)
PPTX
Streamlining Legacy Complexity Through Modernization
PDF
How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...
PDF
Data Virtualization: An Introduction
PPTX
Data Mesh in Azure using Cloud Scale Analytics (WAF)
PDF
NetApp CTO Predictions 2018
PDF
Implementing Data Mesh WP LTIMindtree White Paper
PDF
pwc-data-mesh.pdf
PDF
The Journey to Data Mesh with Confluent
PDF
Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...
PDF
Boston Data Engineering: Designing and Implementing Data Mesh at Your Company...
PPTX
Fast Data Strategy Houston Roadshow Presentation
PDF
Five Things to Consider About Data Mesh and Data Governance
PDF
Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...
data-mesh_whitepaper_dec2021.pdf
ANIn Pune July 2024 | Bootstrapping Data Mesh for a Complex Enterprise by Bal...
Data Mesh using Microsoft Fabric
Agile Data Management with Enterprise Data Fabric (Middle East)
Data Mesh in Practice - How Europe's Leading Online Platform for Fashion Goes...
Data Domain-Driven Design
Agile Data Management with Enterprise Data Fabric (ASEAN)
Streamlining Legacy Complexity Through Modernization
How to Swiftly Operationalize the Data Lake for Advanced Analytics Using a Lo...
Data Virtualization: An Introduction
Data Mesh in Azure using Cloud Scale Analytics (WAF)
NetApp CTO Predictions 2018
Implementing Data Mesh WP LTIMindtree White Paper
pwc-data-mesh.pdf
The Journey to Data Mesh with Confluent
Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...
Boston Data Engineering: Designing and Implementing Data Mesh at Your Company...
Fast Data Strategy Houston Roadshow Presentation
Five Things to Consider About Data Mesh and Data Governance
Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...
Ad

More from Denodo (20)

PDF
Enterprise Monitoring and Auditing in Denodo
PDF
Lunch and Learn ANZ: Mastering Cloud Data Cost Control: A FinOps Approach
PDF
Achieving Self-Service Analytics with a Governed Data Services Layer
PDF
What you need to know about Generative AI and Data Management?
PDF
Mastering Data Compliance in a Dynamic Business Landscape
PDF
Denodo Partner Connect: Business Value Demo with Denodo Demo Lite
PDF
Expert Panel: Overcoming Challenges with Distributed Data to Maximize Busines...
PDF
Drive Data Privacy Regulatory Compliance
PDF
Знакомство с виртуализацией данных для профессионалов в области данных
PDF
Data Democratization: A Secret Sauce to Say Goodbye to Data Fragmentation
PDF
Denodo Partner Connect - Technical Webinar - Ask Me Anything
PDF
Lunch and Learn ANZ: Key Takeaways for 2023!
PDF
It’s a Wrap! 2023 – A Groundbreaking Year for AI and The Way Forward
PDF
Quels sont les facteurs-clés de succès pour appliquer au mieux le RGPD à votr...
PDF
Lunch and Learn ANZ: Achieving Self-Service Analytics with a Governed Data Se...
PDF
How to Build Your Data Marketplace with Data Virtualization?
PDF
Webinar #2 - Transforming Challenges into Opportunities for Credit Unions
PDF
Enabling Data Catalog users with advanced usability
PDF
Denodo Partner Connect: Technical Webinar - Architect Associate Certification...
PDF
GenAI y el futuro de la gestión de datos: mitos y realidades
Enterprise Monitoring and Auditing in Denodo
Lunch and Learn ANZ: Mastering Cloud Data Cost Control: A FinOps Approach
Achieving Self-Service Analytics with a Governed Data Services Layer
What you need to know about Generative AI and Data Management?
Mastering Data Compliance in a Dynamic Business Landscape
Denodo Partner Connect: Business Value Demo with Denodo Demo Lite
Expert Panel: Overcoming Challenges with Distributed Data to Maximize Busines...
Drive Data Privacy Regulatory Compliance
Знакомство с виртуализацией данных для профессионалов в области данных
Data Democratization: A Secret Sauce to Say Goodbye to Data Fragmentation
Denodo Partner Connect - Technical Webinar - Ask Me Anything
Lunch and Learn ANZ: Key Takeaways for 2023!
It’s a Wrap! 2023 – A Groundbreaking Year for AI and The Way Forward
Quels sont les facteurs-clés de succès pour appliquer au mieux le RGPD à votr...
Lunch and Learn ANZ: Achieving Self-Service Analytics with a Governed Data Se...
How to Build Your Data Marketplace with Data Virtualization?
Webinar #2 - Transforming Challenges into Opportunities for Credit Unions
Enabling Data Catalog users with advanced usability
Denodo Partner Connect: Technical Webinar - Architect Associate Certification...
GenAI y el futuro de la gestión de datos: mitos y realidades
Ad

Recently uploaded (20)

PDF
[EN] Industrial Machine Downtime Prediction
PDF
Introduction to Data Science and Data Analysis
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PDF
Business Analytics and business intelligence.pdf
PDF
Fluorescence-microscope_Botany_detailed content
PDF
Introduction to the R Programming Language
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PDF
Mega Projects Data Mega Projects Data
PPTX
Computer network topology notes for revision
PPTX
1_Introduction to advance data techniques.pptx
PPTX
Database Infoormation System (DBIS).pptx
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
Supervised vs unsupervised machine learning algorithms
[EN] Industrial Machine Downtime Prediction
Introduction to Data Science and Data Analysis
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Data_Analytics_and_PowerBI_Presentation.pptx
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Business Analytics and business intelligence.pdf
Fluorescence-microscope_Botany_detailed content
Introduction to the R Programming Language
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Mega Projects Data Mega Projects Data
Computer network topology notes for revision
1_Introduction to advance data techniques.pptx
Database Infoormation System (DBIS).pptx
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
.pdf is not working space design for the following data for the following dat...
Acceptance and paychological effects of mandatory extra coach I classes.pptx
oil_refinery_comprehensive_20250804084928 (1).pptx
Supervised vs unsupervised machine learning algorithms

Myth Busters VII: I’m building a data mesh, so I don’t need data virtualization

  • 1. I’m building a data mesh, so I don't need data virtualization WEBINAR SERIES
  • 2. SPEAKERS Paul Moxon SVP Data Architectures & Chief Evangelist, Denodo
  • 3. 3 MYTH #7: I’m building a data mesh, so I don't need data virtualization
  • 4. AGENDA 1. What is a Data Mesh? 2. Origins of the Myth 3. Just the Facts, Ma’am 4. The Proof is in the Pudding 5. Summary and Conclusions 6. Q&A 7. Next Steps
  • 5. What is a Data Mesh?
  • 6. 6 Traditional Approach to Data Management Illustration: Zhamak Dehghani, Data Mesh Principles and Logical Architecture, December 2020
  • 7. 7 Traditional Approach to Data Management
  • 8. 8 The ‘Knowledge Gap’ Challenge Domain experts who understand the domain data Domain experts who understand the domain processes Tool experts who understand how to develop data pipelines Illustration: Zhamak Dehghani, How to Move Beyond a Monolithic Data Lake to a Distributed Data Mesh, May 2019
  • 9. 9 What is a Data Mesh? • The Data Mesh is a new organizational paradigm for data management • It moves from a centralized data infrastructure managed by a single team to a distributed organizational paradigm • Several autonomous units (domains) are in charge of managing and exposing their own “Data Products” to the rest of the organization • Data Products should be easily discoverable, understandable and accessible to the rest of the organization
  • 10. 10 Data Mesh – 1. Domain based organization • Organizational units (domains) are responsible for managing and exposing their own data • Domains understand better how the data they own should be processed and used • Gives them autonomy to use the best tools to deal with their data, and to evolve them when needed • Results in shorter and fewer iterations until business needs are met
  • 11. 11 Data Mesh – 2. Data as a Product • To ensure that domains do not become isolated data silos, the data exposed by the different domains must be: • Easily discoverable • Understandable • Secured • Usable by other domains • The processes and pipelines to generate the product (e.g. cleansing and deduplication) are internal implementation details and hidden to consumers
  • 12. 12 Data Mesh – 3. Self-serve Data Platform • Self-Serve: while operated by a global data infrastructure team, it allows the domains to create and manage the data products themselves • The platform should be able to automate or simplify tasks such as: • Data integration and transformation • Security policies and identity management • Exposure of data APIs • Publish and document in a global catalog
  • 13. 13 • Data products created by the different domains need to interoperate with each other and be combined to solve new needs • e.g. to be joined, aggregated, correlated, etc. • This requires agreement about the semantics of common entities (e.g. customer, product), about the formats of field types (e.g. SINs, entity identifiers,...), about addressability of data APIs, etc. • Managed globally and, when possible, automatically enforced • Security must be enforced globally according to the applicable regulations and policies Data Mesh – 4. Federated Computational Governance
  • 14. 14 TRADITIONAL ATTRIBUTE DATA MESH Centralized ORGANIZATION Decentralized – Domain Ownership Monolithic & Physical ARCHITECTURE Distributed & Logical Top-Down Design OPERATIONALLY Federated Computational Governance Data is physically collected MINDSET Data Products – Connect and Share Analytical separate from Operational INFRASTRUCTURE Integrated How is it different from existing data delivery models?
  • 16. 16 Nascent Technologies = Confusion! • The fine details are not clearly defined • Lots of vendors jump on to the bandwagon • Original definition is distorted to fit existing product capabilities “I’m perhaps writing this book a bit earlier than I would have liked. We are still in the early years of a fundamentally different approach in sharing and creating data for analytical and machine learning (ML) use cases. But our industry has the tendency to percolate new concepts and buzzwords beyond recognition.” Zhamak Dehghani, Data Mesh – Delivering Data-Driven Value at Scale, O’Reilly
  • 17. 17 Nascent Technologies = Confusion (Cont’d) Image: Sven Balnojan, There’s More Than One Kind of Data Mesh — Three Types of Data Meshes, Towards Data Science
  • 18. Just the Facts, Ma’am
  • 19. 19 TRADITIONAL ATTRIBUTE DATA MESH Centralized ORGANIZATION Decentralized – Domain Ownership Monolithic & Physical ARCHITECTURE Distributed & Logical Top-Down Design OPERATIONALLY Federated Computational Governance Data is physically collected MINDSET Data Products – Connect and Share Analytical separate from Operational INFRASTRUCTURE Integrated How is it different from existing data delivery models?
  • 20. 20 Centralized vs. Decentralized Data Mesh DATA MESH ‘Centralized’ Data Mesh ‘Decentralized’ Data Mesh Decentralized – Domain Ownership ✓ ✓ Distributed & Logical  ✓ Federated Computational Governance ? ✓ Data Products – Connect and Share ✓ ✓ Integrated ✓ ✓
  • 21. 21 Centralized Data Mesh – Still ‘Business As Usual’? Illustration: Zhamak Dehghani, Data Mesh Principles and Logical Architecture, December 2020
  • 22. The Proof is in the Pudding
  • 23. 23 Data Mesh + Data Virtualization: Decentralized Organization – Centralized Control Domains create virtual models in separate schemas. Execution servers can be scaled independently 1 Domains can choose and autonomously evolve their data sources 2 Domains can share standardized definitions. Products can be used to define other products 3 A central team can set guidelines and enforce global security, quality and governance policies in the virtual models 4
  • 24. 24 Data Mesh + Data Virtualization – Distributed Topology
  • 25. 25 Other Data Mesh Reference Architectures Image: Data Mesh: Just another buzzword or the next-generation enterprise data platform, PwC Germany, May 2022
  • 26. 26 1. Reusability: Data Virtualization platforms include strong capabilities to create and manage rich, layered semantic layers which foster reuse and expose data to each type of consumer in the form most suitable for them 2. Polyglot consumption: Data Virtualization allows data consumers to access data using any technology, not only SQL. For instance, self-describing REST, GraphQL and OData APIs can be created with a single click. Multidimensional access based on MDX is also possible 3. Top-down modelling: you can create ‘interface data views’ which set ‘schema contracts’ which developers of data products need to comply with • This helps to implement the concept of federated computational governance 4. Data marketplace: Ready-to-use data catalog which can act as a data marketplace for the data products created by the different domains Additional Benefits of a Data Virtualization approach to Data Mesh
  • 28. 28 Summary and Conclusions • Data Mesh is a new paradigm for data management and analytics • It’s getting a lot of interest for diverse organizations • How to implement a Data Mesh is surrounded by hype and confusion • Data Virtualization offers a solid foundation to implement this new paradigm • Easy to use for business domain personas • Can leverage domain infrastructure or direct them towards a centralize repository • Simple yet advanced graphical modeling tools to define new products • Full governance and security controls Domain Domain Domain Domain Data Platform Domain Domain Domain Domain
  • 29. 29 MYTH #7: I’m building a data mesh, so I don't need data virtualization
  • 30. Q&A
  • 31. 31 Get Started Today Try Denodo with the 30-day free trial in the cloud marketplaces CHOICE Under your cloud account SUPPORT Community forum AND remote sales engineer OPPORTUNITY 30 minutes free consultation with Denodo Cloud specialist denodo.com/free-trials
  • 32. Thanks! www.denodo.com info@denodo.com © Copyright Denodo Technologies. All rights reserved Unless otherwise specified, no part of this PDF file may be reproduced or utilized in any for or by any means, electronic or mechanical, including photocopying and microfilm, without prior the written authorization from Denodo Technologies.