SlideShare a Scribd company logo
By Bhaven Chavan
bhaven2001@yahoo.com
6/23/2016
Data Virtualization
6/23/2016
Confidential | 2016
DISCLAIMER
Note: It is understood that the material in this presentation is intended for general information only and should
not be used in relation to any specific application without independent examination and verification of its
applicability and suitability by professionally qualified personnel. Those making use thereof or relying thereon
assume all risk and liability arising from such use or reliance.
Agenda
• High level walkthrough of the Data Virtualization concepts and its possible
utilization:
• What is Data Virtualization?
• Why use Data Virtualization?
• When not to use Data Virtualization?
• What functionality it provides?
• Data Virtualization Overview
• Data Virtualization and Big Data/NoSQL Overview
• *Drawbacks*
• Q&A
6/23/20166/23/2016
Confidential | 2016
What is Data Virtualization?
• Data Virtualization is an umbrella term used to describe any
approach to data management that allows an application to retrieve
and manipulate data without requiring technical details about the
data, such as how it is formatted or where it is physically located.
• Data virtualization is a technique to deliver the data by consuming
many desperate data sources (internal/external) with a simplified,
integrated view of trusted data within enterprise using real-time or
near real time mechanism to achieve the business goals that support
business transactions, analytics, predictive analytics, and other
workloads and pattern.
6/23/20166/23/2016
Confidential | 2016
Why use Data Virtualization?
• Today’s complex world with so much data, business is looking for
instant access to all the complex data irrespective of the location to
meet the immediate market needs with an Agile manner.
• It helps in reducing the cost in data replication and data
consolidation.
• It adds value in Data Governance.
• Improves the Data Quality.
• It reduce data storage required.
6/23/20166/23/2016
Confidential | 2016
When not to use Data Virtualization?
• Data Virtualization is not the solution to every data integration
problems. Such as, persisting need of the data in a warehouse
(UDL/ODS) or data-mart, along with E-T-L or E-L-T is better solution for
specific use case. Sometimes a hybrid solution is the right answer.
6/23/20166/23/2016
Confidential | 2016
What functionality it provides?
• Virtualized Data Access
• It connects to the different data sources and make them accessible from a common data
access point.
• Data Transformation
• It transforms improved data quality and it reformats the source data the way consumer
needs.
• Data Federation
• It combines results set from across the multiple heterogeneous source systems.
• Data Delivery
• It publishes result sets as views and/or data services executed by client application or users
when requested.
6/23/20166/23/2016
Confidential | 2016
Data Virtualization Overview
Data Virtualization Server
OLTP
Databases
Data
Warehouse &
Data Marts
Applications
ASSET JMS SQL
Unstructured
Data
XSLT
ESB
SOAP EXCEL
Big Data
Store
Social
Media
Data
HIVE JSON
Private
Data
External
Data
Prop.
OLTP
Application
Analytics &
Reporting
ODBC/
JDBC/SQL JDBC/SQL
Service API
XML/SOAP REST/JSON
Mobile App Website
XQuery DAX/MDX
Dashboard
6/23/2016
Confidential | 2016
Denodo: Data Virtualization Overview
6/23/2016
Confidential | 2016
Data Virtualization and Big Data/NoSQL
Overview
6/23/20166/23/2016
Confidential | 2016
Data Virtualization and Big Data/NoSQL
• It unleashes the full value of Big Data for
analytics
• It speeds up development on Big Data
sources
• It offers an evolutionary adoption of Big
Data
• It makes Big Data available to everyone
• Higher Big Data ROI
6/23/20166/23/2016
Confidential | 2016
NoSQL as Sand Box
6/23/2016
OLTP
Databases Reporting &
Analytics
SQL SQL SQL
SQL
NoSQL
Data Staging
Area
Data
Warehouse
Data Marts
Data
Virtualization
Server
6/23/2016
Confidential | 2016
NoSQL for Storing Cold Data
6/23/2016
OLTP
Databases
SQL SQL SQL
SQL
NoSQL
Data Staging
Area
Data
Warehouse
Data Marts
Reporting &
Analytics
Data
Virtualization
Server
6/23/2016
Confidential | 2016
NoSQL as Staging Area
6/23/2016
Data
Virtualization
Server
OLTP
Databases
SQL SQL
SQL
NoSQL
Data Staging
Area
Data
Warehouse
Data Marts
Reporting &
Analytics
6/23/2016
Confidential | 2016
NoSQL as Extra Data Warehouse Database
6/23/2016
OLTP
Databases
SQL SQL
SQL
SQL
NoSQL
Data Staging
Area
Data
Warehouse
Data Marts
Reporting &
Analytics
Data
Virtualization
Server
6/23/2016
Confidential | 2016
NoSQL ETL Processing
6/23/2016
Data
Virtualization
Server
Data
Warehouse
Reporting &
Analytics
OLTP
Databases
SQL SQL SQL
SQL
NoSQL
Data Staging
Area
Data Marts
6/23/2016
Confidential | 2016
Drawbacks
• Another/A new DataStore in production to take care of.
• May impact Operational systems response time, particularly if under-
scaled to cope with unanticipated user queries or not tuned early on.
• Does not impose heterogeneous data model, meaning the user has to
interpret the data, unless combined with Data Federation and business
understanding of the data.
• Requires a defined Governance approach to avoid budgeting issues with
the shared services.
• Not suitable for recording the historic snapshots of data. Data
Warehouse is better for this.
• Change management “ is a huge overhead, as any changes need to be
accepted by all applications and users sharing the same virtualization kit.
6/23/20166/23/2016
Confidential | 2016
Q&A
6/23/20166/23/2016
Confidential | 2016
THANK YOU!
6/23/20166/23/2016
Confidential | 2016

More Related Content

PDF
Anne-Sophie Roessler, International Business Developer, Dataiku - "3 ways to ...
PPTX
ironSource Atom BigData Berlin
PDF
Gianluigi Vigano, Senior Architect and Fouad Teban, Regional Presales Manager...
PDF
Denodo Data Virtualization Platform Architecture: Performance (session 2 from...
PDF
Denodo Data Virtualization Platform architecture: Data Discovery and Data Gov...
PPTX
MongoDB at Agilysys: A Case Study
PPTX
Dsc 2021 presentation_radovan_bacovic
PPT
Why Data Virtualization? An Introduction by Denodo
Anne-Sophie Roessler, International Business Developer, Dataiku - "3 ways to ...
ironSource Atom BigData Berlin
Gianluigi Vigano, Senior Architect and Fouad Teban, Regional Presales Manager...
Denodo Data Virtualization Platform Architecture: Performance (session 2 from...
Denodo Data Virtualization Platform architecture: Data Discovery and Data Gov...
MongoDB at Agilysys: A Case Study
Dsc 2021 presentation_radovan_bacovic
Why Data Virtualization? An Introduction by Denodo

What's hot (19)

PDF
ROI in Linking Content to CRM by Applying the Linked Data Stack
PDF
Tim scottkoenverheyenpresentation
PDF
An Introduction to Data Virtualization in 2018
PPTX
3 Ways Tableau Improves Predictive Analytics
PPTX
AzureDay - Introduction Big Data Analytics.
PPTX
Delivering digital transformation and business impact with io t, machine lear...
PDF
Denodo DataFest 2016: What’s New in Denodo Platform – Demo and Roadmap
PDF
Data Virtualization Reference Architectures: Correctly Architecting your Solu...
PDF
Data Integration Alternatives: When to use Data Virtualization, ETL, and ESB
PDF
FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data
PDF
Cortana Analytics Workshop: Azure Data Catalog
PDF
Presentation by Kasper Kisjes (Rijkswaterstaat) and Christoph Balduck (Data T...
PDF
Why Marketing Should Consider Agile Modern Data Delivery Platform
PDF
Building A Self Service Analytics Platform on Hadoop
PPTX
Zsolt Várnai, Principal Software Engineer at Skyscanner - "The advantages of...
PDF
MongoDB Case Study in Healthcare
PDF
Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...
PDF
Uwe Seiler, Data Architect and Trainer at codecentric AG - "Hadoop & Germany ...
PDF
Microsoft Big Data
ROI in Linking Content to CRM by Applying the Linked Data Stack
Tim scottkoenverheyenpresentation
An Introduction to Data Virtualization in 2018
3 Ways Tableau Improves Predictive Analytics
AzureDay - Introduction Big Data Analytics.
Delivering digital transformation and business impact with io t, machine lear...
Denodo DataFest 2016: What’s New in Denodo Platform – Demo and Roadmap
Data Virtualization Reference Architectures: Correctly Architecting your Solu...
Data Integration Alternatives: When to use Data Virtualization, ETL, and ESB
FAME.Q – A Formal approach to Master Quality in Enterprise Linked Data
Cortana Analytics Workshop: Azure Data Catalog
Presentation by Kasper Kisjes (Rijkswaterstaat) and Christoph Balduck (Data T...
Why Marketing Should Consider Agile Modern Data Delivery Platform
Building A Self Service Analytics Platform on Hadoop
Zsolt Várnai, Principal Software Engineer at Skyscanner - "The advantages of...
MongoDB Case Study in Healthcare
Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...
Uwe Seiler, Data Architect and Trainer at codecentric AG - "Hadoop & Germany ...
Microsoft Big Data
Ad

Viewers also liked (18)

PDF
Infografía gaby
PDF
Lookbook-pdf-combined-pages-extra-klein
DOCX
Biodigestor
PPTX
Programas de desarrollo sustentable de Jalisco
PPTX
Event Design Group Highlights
PDF
TargetStateFutureArchitect - DV
DOCX
Sy ti 2016a-alvarez santiago y perez gabriela-guia para slidecasts
PDF
Workable Enteprise Data Governance
PDF
Planejamento e Estudo Preliminar_ API 5
PDF
Enterprise Data Management
DOCX
Guia para slidecasts
PDF
Understanding SOAP and REST basics and differences
PPTX
Compiled testimonials
PPTX
Choice life
PDF
PIOGG_Chapter_two_s
PDF
PIOGG_Chapter_two_s
PPT
A More Interactive Trinity
PDF
Negarkhalaj CV March 2016
Infografía gaby
Lookbook-pdf-combined-pages-extra-klein
Biodigestor
Programas de desarrollo sustentable de Jalisco
Event Design Group Highlights
TargetStateFutureArchitect - DV
Sy ti 2016a-alvarez santiago y perez gabriela-guia para slidecasts
Workable Enteprise Data Governance
Planejamento e Estudo Preliminar_ API 5
Enterprise Data Management
Guia para slidecasts
Understanding SOAP and REST basics and differences
Compiled testimonials
Choice life
PIOGG_Chapter_two_s
PIOGG_Chapter_two_s
A More Interactive Trinity
Negarkhalaj CV March 2016
Ad

Similar to DataVirtulization (20)

PPTX
Big and fast data strategy 2017 jr
PDF
PLOTCON NYC: Interactive Visual Statistics on Massive Datasets
PPTX
Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...
PDF
Self-Service Analytics with Guard Rails
PDF
Education Seminar: Self-service BI, Logical Data Warehouse and Data Lakes
PPTX
Opportunity: Data, Analytic & Azure
PPTX
SQL Server 2016 - Always On.pptx
PDF
The Shifting Landscape of Data Integration
PDF
Presentación Paco Bermejo - La Noche del Sector Financiero
PDF
Oracle Data Integration - Overview
PDF
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
PDF
Transforming Business in a Digital Era with Big Data and Microsoft
PDF
LinkedInSaxoBankDataWorkbench
PDF
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...
PDF
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
PDF
Bridging the Last Mile: Getting Data to the People Who Need It
PDF
SQL Server 2019 Data Virtualization
PDF
Seminaire bigdata23102014
PPTX
SoftServe BI/BigData Workshop in Utah
PDF
Revolution in Business Analytics-Zika Virus Example
Big and fast data strategy 2017 jr
PLOTCON NYC: Interactive Visual Statistics on Massive Datasets
Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...
Self-Service Analytics with Guard Rails
Education Seminar: Self-service BI, Logical Data Warehouse and Data Lakes
Opportunity: Data, Analytic & Azure
SQL Server 2016 - Always On.pptx
The Shifting Landscape of Data Integration
Presentación Paco Bermejo - La Noche del Sector Financiero
Oracle Data Integration - Overview
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
Transforming Business in a Digital Era with Big Data and Microsoft
LinkedInSaxoBankDataWorkbench
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
Bridging the Last Mile: Getting Data to the People Who Need It
SQL Server 2019 Data Virtualization
Seminaire bigdata23102014
SoftServe BI/BigData Workshop in Utah
Revolution in Business Analytics-Zika Virus Example

DataVirtulization

  • 1. By Bhaven Chavan bhaven2001@yahoo.com 6/23/2016 Data Virtualization 6/23/2016 Confidential | 2016 DISCLAIMER Note: It is understood that the material in this presentation is intended for general information only and should not be used in relation to any specific application without independent examination and verification of its applicability and suitability by professionally qualified personnel. Those making use thereof or relying thereon assume all risk and liability arising from such use or reliance.
  • 2. Agenda • High level walkthrough of the Data Virtualization concepts and its possible utilization: • What is Data Virtualization? • Why use Data Virtualization? • When not to use Data Virtualization? • What functionality it provides? • Data Virtualization Overview • Data Virtualization and Big Data/NoSQL Overview • *Drawbacks* • Q&A 6/23/20166/23/2016 Confidential | 2016
  • 3. What is Data Virtualization? • Data Virtualization is an umbrella term used to describe any approach to data management that allows an application to retrieve and manipulate data without requiring technical details about the data, such as how it is formatted or where it is physically located. • Data virtualization is a technique to deliver the data by consuming many desperate data sources (internal/external) with a simplified, integrated view of trusted data within enterprise using real-time or near real time mechanism to achieve the business goals that support business transactions, analytics, predictive analytics, and other workloads and pattern. 6/23/20166/23/2016 Confidential | 2016
  • 4. Why use Data Virtualization? • Today’s complex world with so much data, business is looking for instant access to all the complex data irrespective of the location to meet the immediate market needs with an Agile manner. • It helps in reducing the cost in data replication and data consolidation. • It adds value in Data Governance. • Improves the Data Quality. • It reduce data storage required. 6/23/20166/23/2016 Confidential | 2016
  • 5. When not to use Data Virtualization? • Data Virtualization is not the solution to every data integration problems. Such as, persisting need of the data in a warehouse (UDL/ODS) or data-mart, along with E-T-L or E-L-T is better solution for specific use case. Sometimes a hybrid solution is the right answer. 6/23/20166/23/2016 Confidential | 2016
  • 6. What functionality it provides? • Virtualized Data Access • It connects to the different data sources and make them accessible from a common data access point. • Data Transformation • It transforms improved data quality and it reformats the source data the way consumer needs. • Data Federation • It combines results set from across the multiple heterogeneous source systems. • Data Delivery • It publishes result sets as views and/or data services executed by client application or users when requested. 6/23/20166/23/2016 Confidential | 2016
  • 7. Data Virtualization Overview Data Virtualization Server OLTP Databases Data Warehouse & Data Marts Applications ASSET JMS SQL Unstructured Data XSLT ESB SOAP EXCEL Big Data Store Social Media Data HIVE JSON Private Data External Data Prop. OLTP Application Analytics & Reporting ODBC/ JDBC/SQL JDBC/SQL Service API XML/SOAP REST/JSON Mobile App Website XQuery DAX/MDX Dashboard 6/23/2016 Confidential | 2016
  • 8. Denodo: Data Virtualization Overview 6/23/2016 Confidential | 2016
  • 9. Data Virtualization and Big Data/NoSQL Overview 6/23/20166/23/2016 Confidential | 2016
  • 10. Data Virtualization and Big Data/NoSQL • It unleashes the full value of Big Data for analytics • It speeds up development on Big Data sources • It offers an evolutionary adoption of Big Data • It makes Big Data available to everyone • Higher Big Data ROI 6/23/20166/23/2016 Confidential | 2016
  • 11. NoSQL as Sand Box 6/23/2016 OLTP Databases Reporting & Analytics SQL SQL SQL SQL NoSQL Data Staging Area Data Warehouse Data Marts Data Virtualization Server 6/23/2016 Confidential | 2016
  • 12. NoSQL for Storing Cold Data 6/23/2016 OLTP Databases SQL SQL SQL SQL NoSQL Data Staging Area Data Warehouse Data Marts Reporting & Analytics Data Virtualization Server 6/23/2016 Confidential | 2016
  • 13. NoSQL as Staging Area 6/23/2016 Data Virtualization Server OLTP Databases SQL SQL SQL NoSQL Data Staging Area Data Warehouse Data Marts Reporting & Analytics 6/23/2016 Confidential | 2016
  • 14. NoSQL as Extra Data Warehouse Database 6/23/2016 OLTP Databases SQL SQL SQL SQL NoSQL Data Staging Area Data Warehouse Data Marts Reporting & Analytics Data Virtualization Server 6/23/2016 Confidential | 2016
  • 15. NoSQL ETL Processing 6/23/2016 Data Virtualization Server Data Warehouse Reporting & Analytics OLTP Databases SQL SQL SQL SQL NoSQL Data Staging Area Data Marts 6/23/2016 Confidential | 2016
  • 16. Drawbacks • Another/A new DataStore in production to take care of. • May impact Operational systems response time, particularly if under- scaled to cope with unanticipated user queries or not tuned early on. • Does not impose heterogeneous data model, meaning the user has to interpret the data, unless combined with Data Federation and business understanding of the data. • Requires a defined Governance approach to avoid budgeting issues with the shared services. • Not suitable for recording the historic snapshots of data. Data Warehouse is better for this. • Change management “ is a huge overhead, as any changes need to be accepted by all applications and users sharing the same virtualization kit. 6/23/20166/23/2016 Confidential | 2016