SlideShare a Scribd company logo
Powering Self Service Business
Intelligence with Data Virtualization
Sean Roberts, Hortonworks
Mark Pritchard, Denodo
January 2017
2 © Hortonworks Inc. 2011 – 2017. All Rights Reserved2 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Sean Roberts
Partner Engineering
3 © Hortonworks Inc. 2011 – 2017. All Rights Reserved3 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Data Doubles
Every Two Years
44ZB By 2020
3 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
4 © Hortonworks Inc. 2011 – 2017. All Rights Reserved4 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
The Old Way
System Centric
Procedural
Hierarchical
Scheduled
Homogeneous
The New Way
User Centric
Agile
Dynamic
Real-Time
Contextual
4 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Data Doubles
Every Two Years
44ZB By 2020
5 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Capture
streaming data
Deliver
perishable insights
Combine
new & old data
Store
data forever
Access
a multi-tenant data lake
Model
with more data
DATA AT RESTDATA IN MOTION
ACTIONABLE
INTELLIGENCE
Perishable Insights Historical Insights
6 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Secure
Real-time
Streaming
Integrated
Hortonworks DataFlow for Data in Motion
Powered by Apache NiFi, Kafka, and Storm
7 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Hortonworks Data Platform for Data at Rest
Powered by Open Enterprise Hadoop
Open
Interoperable
Ready
Central
8 © Hortonworks Inc. 2011 – 2017. All Rights Reserved8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Claims
Optimization
Social
Sentiment
Cohort
Selection
Bill Shock
Physician
Notes
Device
Monitoring
R & D
Quality
Benchmarks
Patient
Experience
Seasonal
Staffing
Net
Promoter
Score
Supply
Chain
Sentiment
Analysis
Patient
Outreach
360°
Patient
View
Patient
Throughput
Customer
Churn
Analysis
STARS
Ratings
Genomics
Remote
Monitoring
Drug
Diversion
Census
Proactive
Maintenance
Preventative
Medicine
Inventory
Medication
Safety
OPEX
Reduction
Lab Notes
Archive
Mainframe
Offloads
Device
Data
Ingest
Rapid
Reporting
Digital
Protection
Data
as a
Service
Fraud
Prevention
Real-time
Decision
Support
INNOVATE
RENOVATE
E X PLO R E O PTIMIZE TR A NS FO R M
ACTIVE
ARCHIVE
ETL
ONBOARD
DATA
ENRICHMENT
DATA
DISCOVERY
SINGLE
VIEW
PREDICTIVE
ANALYTICS
HEALTHCARE
Care-path
Best
Practices
OR
Optimization
HCAHPS
Scores
Staffing
Predictions
Proactive
Outreach
Legacy
System
Data
Imaging
Archive
Historical
Patient
Records
Improved
Drug
Yields
9 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Actionable Intelligence Makes Healthcare Precise and Personal
Patient
Records
Lab Data
Pharmacy
Data
Patient
Locations
Wearables
Intra-Network
Data
Sensor
Data
Claims
Data
Social
Media Physician
Notes
Patient
Satisfaction Data
Clinical
(EMR) Data
SINGLE VIEW OF
PATIENT
REAL-TIME VITAL
SIGN MONITORING
BILLING &
REIMBURSEMENTS
EMR
OPTIMIZATION
SUPPLY CHAIN
OPTIMIZATION
10 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Mercy’s Journey
Mercy Medical System Sought a Data Lake for a Single View of its Patients –
“One Patient, One Record”
Existing platform impeded goal of enriching Epic data for 1 million patients
over 35 Hospitals and 500 clinics
Moving Epic EMR data to Clarity EDW took 24 hours and was “never going
to enable real-time analytics”. Now that takes 3-5 minutes with HDP.
Improved billing processes resulted in $1M additional annual revenue
from newly documented secondary diagnoses and care
11 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Better Health Through “One Patient, One Record”
ACTIVE
ARCHIVE
Lab Notes
DATA
DISCOVERY
OPEX Efficiency
SINGLE
VIEW
Billing
DATA
DISCOVERY
Vital Signs
SINGLE VIEW
Single Patient
Record
ACTIVE
ARCHIVE
Epic EMR
Replication
ACTIVE
ARCHIVE
Privacy
Database
DATA
ENRICHMENT
Epic Enrichment
PREDICTIVE
ANALYTICS
Device Data
Ingest
Move to Clarity wouldn’t
enable real-time analytics
Existing platform
impeded goals
Data enrichment needed
for 1 million patients
Move off Epic
took over 24 hours
SITUATION
3-5
Minutes
$1M Additional
Annual Revenue
From “Never”
to “Seconds”
900x Faster
move data off Epic to
Clarity with HDP
from improved
billing process
accelerated
researcher insight
ingest of ICU
vital signs
PREDICTIVE
ANALYTICS
Preventive Care
ETL OFFLOAD
Medical Decision
Support
12 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Interoperable with Leading Technologies
Partners
13 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Information spread across different
systems
IT responds with point-to-point data
integration
Takes too long to get answers to
business users
The Self-Service Challenge
 Data Is Siloed Across Disparate Systems
MarketingSales ExecutiveSupport
Database
Apps
Warehouse Cloud
Big Data
Documents AppsNo SQL
“Data bottlenecks create business
bottlenecks.”
– Create a Road Map For A Real-time, Agile, Self-Service Data
Platform, Forrester Research, Dec 16, 2015
14 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
Denodo and Modern Data Architecture
Powering Self Service Business
Intelligence with Data Virtualization
Mark Pritchard
January 2017
Agenda1.Data Virtualization and Benefits
2.Case in Point – Vizient
3.Modern Data Architecture
4.Self-Service BI on Distributed Datasets
5.Denodo Platform for Data Virtualization
6.Q&A
Data Virtualization and Benefits
18
-Source: “Gartner Market Guide for data virtualization – 2016”
Data virtualization technology can be used to create virtualized and
integrated views of data in memory (rather than executing data movement
and physically storing integrated views in a target data structure), and
provides a layer of abstraction above the physical implementation of data.”
Data Virtualization – Definition
19
Data Virtualization
Real-time Data Integration
“Data virtualization integrates disparate data sources in real time or near-real time
to meet demands for analytics and transactional data.”
– Create a Road Map For A Real-time, Agile, Self-Service Data Platform, Forrester Research, Dec 16, 2015
Publishes
the data to applications
Combines
related data into views
Connects
to disparate data sources
2
3
1
20
Denodo and Modern Data Architecture
21
Benefits of Data Virtualization
“Get it Real-time and Get it Fast!”
Better Data Integration
Lower integration costs by 80%.
Flexibility to change.
Real-time (on-demand) data services.
Complete Information
Focus on business information needs.
Include web / cloud, big data,
unstructured, streaming.
Bigger volumes, richer/easier access to
data.
Better Business Outcome
Projects in 4-6 weeks.
ROI in <6 months.
Adds new IT and business capabilities
“Benefits of Data Virtualization: get it real-time and get it fast!”
– William McKnight, President, McKnight Consulting Group
Case in Point - Vizient
23
Who is Vizient?
Network for non-profit hospitals and alliance of academic medical centers
Network of not-for-
profit healthcare
organizations to
improve
performance and
efficiency in clinical,
financial and
operational
management
Combination of
VHA, University
HealthSystem
Consortium,
Novation,
MedAssets Spend,
Clinical Resources
Management and
SG2
Experts with
purchasing power,
insights and
connections that
accelerate
performance for
members
24
Purpose, Mission and Strategic Aspirations
Mission
To connect
members with the knowledge, solutions
and expertise that accelerate performance
Strategic Aspirations To become an indispensable partner to
healthcare organizations
Purpose
To ensure
members deliver exceptional, cost
effective care
25
Vizient delivers brilliant, data-driven resources and
insights — from benchmarking and predictive
analytics to cost-savings — to where they’re needed
most.
Empowering Brilliant Connections
Modern Data Architecture
27
Modern Data Architecture
HTTP
FTP
HL7
Flat File
Share
Subscribe
Attach
Initial
Event
Share
Persist
Event
Share
Ongoing
Event
Usage
Purchase
Data
Open
Data
RAW CLEAN AGGREGATED ENRICHED
RDBMS
RDBMS
ODS
DW
STAGESOURCE PROCESS PERSIST SERVE
Rules
Batch
Human
Process
Hadoop
Hadoop
Lake
Machine
Learning
Analytics
Consulting
28
Modern Data Architecture
HTTP
FTP
HL7
Flat File
Share
Subscribe
Attach
Initial
Event
Share
Persist
Event
Share
Purchase
Data
Open
Data
RAW CLEAN AGGREGATED ENRICHED
RDBMS
RDBMS
ODS
DW
STAGESOURCE PROCESS PERSIST
Ongoing
Event
Usage
SERVE
Rules
Batch
Human
Process
Hadoop
Hadoop
Lake
Machine
Learning
Analytics
Consulting
DataVirtualization
Powering Self-Service Discovery with Data
Virtualization
30
Financial Data Mart
Primary Use Case
o Unify disparate accounting and finance data marts across various legacy
organizations into a shared repository
Secondary Use Cases
o Provide a unified source for key BI initiatives like the GPO Dashboard
o Support reporting needs as legacy systems are migrated or replaced during
integration of Vizient and L-MDAS (dbVision, etc.)
o Provide a final resting place for archived legacy sources like Solomon, Epicor,
etc.
VHA
MedAssets
UHC
Vizient
31
Financial Data Mart
Architectural Approach
Denodo was selected as the data platform in order to utilize the following
features of the software:
o Data Virtualization allows sources in various mediums and locations to be
integrated without physically moving the data
o Data Abstraction allows data to be represented consistently within the
DataMart while data sources are moved or replaced behind the scenes
o Data Integration allows for a single seamless view to be created across a
subject area (e.g. “Supplier Sales”) with varied data transformation rules
for each data source within the subject area (PRS, dbVision).
32
GPO Dashboard
Primary Use Case
o Provide a consolidated view of supplier sales data across all customers of
legacy Vizient & Med Assets organizations.
Architectural Approach
o Financial DataMart (on Denodo) for data source
o Denodo TDE Exporter Tool for daily data extracts to Tableau:
 Report Data
 Report User Security
o Tableau for report development and distribution
33
GPO Dashboard
Key Challenges
o Balance between data timeliness and report performance
 Tableau reports performed best utilizing the TDE format (cached/extracted dataset) as
opposed to a live connection
 This meant that the report caches required daily refreshes, and data extraction had to
be appropriately tuned
 Denodo features such as dataset statistics and indexing greatly contributed to this
performance tuning
o Provisioning user security at cell level
 The requirement for some internal report users to be restricted to the
members/customers to which they are assigned meant that a new report security
approach was needed
 Reliance on TDEs for report data necessitated the integration of security in the reporting
layer
 Tableau’s “data blending” feature allows user security to be specified within a separate
dataset
 This also supports reuse of the security view in other reporting environments.
34
Contract Sales Actualizer Dashboard
Primary Use Case
o Integrate Member Spend and Supplier Sales data from all Vizient organizations to identify
opportunities for increasing contract utilization
Other Use Cases
o Maintain consistency (Single Source Of Truth) with GPO dashboard regarding:
 Supplier Sales Data
 Dimension Data
 User Security
Architectural Approach
o Data source utilizes Denodo to reuse overlapping datasets (sales, dimensions, security) while
allowing separate virtualized views to be created for new datasets (member spend) which can
be also be reused by future projects
o Reporting components match approach used by GPO Dashboard
35
Contract Sales Actualizer Dashboard
Key Challenges
o Successful integration of Exadata RDM as a data source for Denodo.
 Approach utilizes the strength of Exadata RDBMS for aggregating large
quantities of data quickly
 Denodo to integrate the data with similar legacy SQL Server data sources to
create a comprehensive view of Vizient member spend
o Scalability/Configuration Management
 Advances were made to support parallel development of this project and
continued efforts on GPO dashboard
 Compartmentalization features within Denodo allow for code changes in each
project to be version controlled and assessed for dependencies
 Process guidelines are being authored to allow for multiple development efforts
on the same datasets
Denodo Platform for Data Virtualization
37
Accelerate Your Fast Data Strategy with Denodo Platform 6.0
New Release of Denodo Platform Delivers Breakthrough Performance, Accelerates
Adoption, and Expedites Business Use of Data
Breakthrough
Performance
Dynamic Query Optimizer
delivers breakthrough performance
for big data, logical data
warehouse, and operational
scenarios.
Data Virtualization
In the Cloud
Denodo Platform for AWS
accelerates adoption of data
virtualization.
Self-service Data
Discovery and Search
Self-service data discovery
and search expedites use of
data by business users.
“Very happy with Denodo version 6. Well done!”
– Claudia Imhoff, President, Intelligent Solutions
38
Common Data Virtualization Use Cases
Data Virtualization
BIG DATA, CLOUD INTEGRATION
 Advanced Analytics
 Data Warehouse Offloading
 Big Data for Enterprise
 Cloud / SaaS Integration
AGILE BUSINESS INTELLIGENCE
 Logical Data Warehouse
 Virtual Data Marts
 Self-Service BI
 Operational BI / Analytics
SINGLE VIEW APPLICATIONS
 Single Customer View - Call Centers, Portals
 Single Product View - Catalogs
 Single Inventory View - Inventory Reconciliation
 Vertical Specific - Single View of Wells
DATA SERVICES
 Unified Data Services Layer
 Logical Data Abstraction
 Agile Application Development
 Linked Data Services
39
Why Denodo Data Virtualization?
Data Virtualization
Exceptional pre-sales service and
customer support
 Technical expertise and response time rated best
by costumers.
 Enhancement requests typically made available in
days / weeks.
Lower overall TCO and ROI  Address more use cases with a single platform.
 Aggressive “all-in-one” pricing for entire platform;
flexible pricing models.
Broad functionality and winning
innovation in a fully integrated
platform
 Flexible and broad use cases.
 Integrated purpose-built for data virtualization –
very easy to develop / deploy / change models.
Q&A
Thanks!
www.denodo.com info@denodo.com
© Copyright Denodo Technologies. All rights reserved
Unless otherwise specified, no part of this PDF file may be reproduced or utilized in any for or by any means, electronic or mechanical,
including photocopying and microfilm, without prior the written authorization from Denodo Technologies.

More Related Content

PDF
Data Virtualization: From Zero to Hero (Middle East)
PDF
Big Data Fabric: A Necessity For Any Successful Big Data Initiative
PDF
Denodo DataFest 2016: The Role of Data Virtualization in IoT Integration
PDF
Data Virtualization: The Agile Delivery Platform
PDF
Big Data Fabric: A Recipe for Big Data Initiatives
PDF
Denodo DataFest 2017: Conquering the Edge with Data Virtualization
PDF
Maximizing Data Lake ROI with Data Virtualization: A Technical Demonstration
PDF
In Memory Parallel Processing for Big Data Scenarios
Data Virtualization: From Zero to Hero (Middle East)
Big Data Fabric: A Necessity For Any Successful Big Data Initiative
Denodo DataFest 2016: The Role of Data Virtualization in IoT Integration
Data Virtualization: The Agile Delivery Platform
Big Data Fabric: A Recipe for Big Data Initiatives
Denodo DataFest 2017: Conquering the Edge with Data Virtualization
Maximizing Data Lake ROI with Data Virtualization: A Technical Demonstration
In Memory Parallel Processing for Big Data Scenarios

What's hot (20)

PDF
Data Virtualization - Enabling Next Generation Analytics
PDF
The Rise of Logical Data Architecture - Breaking the Data Gravity Notion (Mid...
PDF
Secure Your Data with Virtual Data Fabric (ASEAN)
PDF
Analyst Keynote: Delivering Faster Insights with a Logical Data Fabric in a H...
PDF
Education Seminar: Self-service BI, Logical Data Warehouse and Data Lakes
PPTX
Take your Data Management Practice to the Next Level with Denodo 7
PDF
Data Virtualization: From Zero to Hero
PDF
Best Practices: Data Virtualization Perspectives and Best Practices
PDF
Why Data Virtualization? An Introduction
PPTX
Fast Data Strategy Houston Roadshow Presentation
PDF
Denodo DataFest 2017: Business Needs for a Fast Data Strategy
PDF
Minimizing the Complexities of Machine Learning with Data Virtualization
PDF
Agile Data Management with Enterprise Data Fabric (ASEAN)
PDF
Denodo DataFest 2016: Big Data Virtualization in the Cloud
PDF
Unlock Your Data for ML & AI using Data Virtualization
PPTX
Delivering Quality Open Data by Chelsea Ursaner
PDF
Data Virtualization enabled Data Fabric: Operationalize the Data Lake (APAC)
PDF
3 Reasons Data Virtualization Matters in Your Portfolio
PDF
Performance Acceleration: Summaries, Recommendation, MPP and more
PPTX
Data Virtualization: An Introduction
Data Virtualization - Enabling Next Generation Analytics
The Rise of Logical Data Architecture - Breaking the Data Gravity Notion (Mid...
Secure Your Data with Virtual Data Fabric (ASEAN)
Analyst Keynote: Delivering Faster Insights with a Logical Data Fabric in a H...
Education Seminar: Self-service BI, Logical Data Warehouse and Data Lakes
Take your Data Management Practice to the Next Level with Denodo 7
Data Virtualization: From Zero to Hero
Best Practices: Data Virtualization Perspectives and Best Practices
Why Data Virtualization? An Introduction
Fast Data Strategy Houston Roadshow Presentation
Denodo DataFest 2017: Business Needs for a Fast Data Strategy
Minimizing the Complexities of Machine Learning with Data Virtualization
Agile Data Management with Enterprise Data Fabric (ASEAN)
Denodo DataFest 2016: Big Data Virtualization in the Cloud
Unlock Your Data for ML & AI using Data Virtualization
Delivering Quality Open Data by Chelsea Ursaner
Data Virtualization enabled Data Fabric: Operationalize the Data Lake (APAC)
3 Reasons Data Virtualization Matters in Your Portfolio
Performance Acceleration: Summaries, Recommendation, MPP and more
Data Virtualization: An Introduction
Ad

Viewers also liked (20)

PPTX
Scaling self service on Hadoop
PPTX
Self-Service Provisioning and Hadoop Management with Apache Ambari
PDF
Denodo DataFest 2016: ROI Justification in Data Virtualization
PDF
Business Intelligence on Hadoop Benchmark
PPTX
Office 365 Saturday Europe - Self-Service Business Intelligence with Power BI
PPTX
SQL In/On/Around Hadoop
PDF
Data Virtualization Deployments: How to Manage Very Large Deployments
PPTX
Etat de l art business intelligence
PPTX
Self-service BI for SAP and HANA – Dream or Reality?
PDF
Data Virtualization Reference Architectures: Correctly Architecting your Solu...
PDF
Getting Started with Data Virtualization – What problems DV solves
PPT
Ab initio training Ab-initio Architecture
PDF
Teradata - Presentation at Hortonworks Booth - Strata 2014
PDF
Extended Data Warehouse - A New Data Architecture for Modern BI with Claudia ...
PDF
Datix Handler Training manual
PDF
Présentation bi 1.0
PPTX
Comparison of MPP Data Warehouse Platforms
PPTX
Hybrid Data Warehouse Hadoop Implementations
PDF
Netezza vs Teradata vs Exadata
PDF
14 Banking Facts to Help You Master the New Digital Economy
Scaling self service on Hadoop
Self-Service Provisioning and Hadoop Management with Apache Ambari
Denodo DataFest 2016: ROI Justification in Data Virtualization
Business Intelligence on Hadoop Benchmark
Office 365 Saturday Europe - Self-Service Business Intelligence with Power BI
SQL In/On/Around Hadoop
Data Virtualization Deployments: How to Manage Very Large Deployments
Etat de l art business intelligence
Self-service BI for SAP and HANA – Dream or Reality?
Data Virtualization Reference Architectures: Correctly Architecting your Solu...
Getting Started with Data Virtualization – What problems DV solves
Ab initio training Ab-initio Architecture
Teradata - Presentation at Hortonworks Booth - Strata 2014
Extended Data Warehouse - A New Data Architecture for Modern BI with Claudia ...
Datix Handler Training manual
Présentation bi 1.0
Comparison of MPP Data Warehouse Platforms
Hybrid Data Warehouse Hadoop Implementations
Netezza vs Teradata vs Exadata
14 Banking Facts to Help You Master the New Digital Economy
Ad

Similar to Powering Self Service Business Intelligence with Hadoop and Data Virtualization (20)

PPTX
Denodo Data Virtualization - IT Days in Luxembourg with Oktopus
PDF
Self-Service Analytics with Guard Rails
PDF
Analyst Webinar: Best Practices In Enabling Data-Driven Decision Making
PDF
Reinvent Your Data Management Strategy for Successful Digital Transformation
PDF
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
PDF
KASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
PDF
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
PDF
Data Virtualization: An Introduction
PDF
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
PDF
Data Virtualization: An Introduction
PDF
Data virtualization an introduction
PPTX
HPE and Hortonworks join forces to Deliver Healthcare Transformation
PDF
Introduction to Modern Data Virtualization (US)
PDF
Introduction to Modern Data Virtualization 2021 (APAC)
PDF
Down to Business: Taking Action Quickly with Linked Data Services
PDF
CIO priorities and Data Virtualization: Balancing the Yin and Yang of the IT
PDF
A Key to Real-time Insights in a Post-COVID World (ASEAN)
PDF
Belgium & Luxembourg dedicated online Data Virtualization discovery workshop
PDF
Data Virtualization: An Introduction
PDF
Accelerate Cloud Migrations and Architecture with Data Virtualization
Denodo Data Virtualization - IT Days in Luxembourg with Oktopus
Self-Service Analytics with Guard Rails
Analyst Webinar: Best Practices In Enabling Data-Driven Decision Making
Reinvent Your Data Management Strategy for Successful Digital Transformation
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
KASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
Data Virtualization: An Introduction
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
Data Virtualization: An Introduction
Data virtualization an introduction
HPE and Hortonworks join forces to Deliver Healthcare Transformation
Introduction to Modern Data Virtualization (US)
Introduction to Modern Data Virtualization 2021 (APAC)
Down to Business: Taking Action Quickly with Linked Data Services
CIO priorities and Data Virtualization: Balancing the Yin and Yang of the IT
A Key to Real-time Insights in a Post-COVID World (ASEAN)
Belgium & Luxembourg dedicated online Data Virtualization discovery workshop
Data Virtualization: An Introduction
Accelerate Cloud Migrations and Architecture with Data Virtualization

More from Denodo (20)

PDF
Enterprise Monitoring and Auditing in Denodo
PDF
Lunch and Learn ANZ: Mastering Cloud Data Cost Control: A FinOps Approach
PDF
Achieving Self-Service Analytics with a Governed Data Services Layer
PDF
What you need to know about Generative AI and Data Management?
PDF
Mastering Data Compliance in a Dynamic Business Landscape
PDF
Denodo Partner Connect: Business Value Demo with Denodo Demo Lite
PDF
Expert Panel: Overcoming Challenges with Distributed Data to Maximize Busines...
PDF
Drive Data Privacy Regulatory Compliance
PDF
Знакомство с виртуализацией данных для профессионалов в области данных
PDF
Data Democratization: A Secret Sauce to Say Goodbye to Data Fragmentation
PDF
Denodo Partner Connect - Technical Webinar - Ask Me Anything
PDF
Lunch and Learn ANZ: Key Takeaways for 2023!
PDF
It’s a Wrap! 2023 – A Groundbreaking Year for AI and The Way Forward
PDF
Quels sont les facteurs-clés de succès pour appliquer au mieux le RGPD à votr...
PDF
Lunch and Learn ANZ: Achieving Self-Service Analytics with a Governed Data Se...
PDF
How to Build Your Data Marketplace with Data Virtualization?
PDF
Webinar #2 - Transforming Challenges into Opportunities for Credit Unions
PDF
Enabling Data Catalog users with advanced usability
PDF
Denodo Partner Connect: Technical Webinar - Architect Associate Certification...
PDF
GenAI y el futuro de la gestión de datos: mitos y realidades
Enterprise Monitoring and Auditing in Denodo
Lunch and Learn ANZ: Mastering Cloud Data Cost Control: A FinOps Approach
Achieving Self-Service Analytics with a Governed Data Services Layer
What you need to know about Generative AI and Data Management?
Mastering Data Compliance in a Dynamic Business Landscape
Denodo Partner Connect: Business Value Demo with Denodo Demo Lite
Expert Panel: Overcoming Challenges with Distributed Data to Maximize Busines...
Drive Data Privacy Regulatory Compliance
Знакомство с виртуализацией данных для профессионалов в области данных
Data Democratization: A Secret Sauce to Say Goodbye to Data Fragmentation
Denodo Partner Connect - Technical Webinar - Ask Me Anything
Lunch and Learn ANZ: Key Takeaways for 2023!
It’s a Wrap! 2023 – A Groundbreaking Year for AI and The Way Forward
Quels sont les facteurs-clés de succès pour appliquer au mieux le RGPD à votr...
Lunch and Learn ANZ: Achieving Self-Service Analytics with a Governed Data Se...
How to Build Your Data Marketplace with Data Virtualization?
Webinar #2 - Transforming Challenges into Opportunities for Credit Unions
Enabling Data Catalog users with advanced usability
Denodo Partner Connect: Technical Webinar - Architect Associate Certification...
GenAI y el futuro de la gestión de datos: mitos y realidades

Recently uploaded (20)

PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPT
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
PDF
Introduction to Business Data Analytics.
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
Database Infoormation System (DBIS).pptx
PDF
Mega Projects Data Mega Projects Data
PPTX
Supervised vs unsupervised machine learning algorithms
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPTX
Major-Components-ofNKJNNKNKNKNKronment.pptx
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PDF
.pdf is not working space design for the following data for the following dat...
PDF
Foundation of Data Science unit number two notes
PDF
Launch Your Data Science Career in Kochi – 2025
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PDF
Fluorescence-microscope_Botany_detailed content
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PPTX
1_Introduction to advance data techniques.pptx
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
Introduction to Business Data Analytics.
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
Database Infoormation System (DBIS).pptx
Mega Projects Data Mega Projects Data
Supervised vs unsupervised machine learning algorithms
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Major-Components-ofNKJNNKNKNKNKronment.pptx
STUDY DESIGN details- Lt Col Maksud (21).pptx
.pdf is not working space design for the following data for the following dat...
Foundation of Data Science unit number two notes
Launch Your Data Science Career in Kochi – 2025
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Introduction-to-Cloud-ComputingFinal.pptx
Fluorescence-microscope_Botany_detailed content
oil_refinery_comprehensive_20250804084928 (1).pptx
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
1_Introduction to advance data techniques.pptx

Powering Self Service Business Intelligence with Hadoop and Data Virtualization

  • 1. Powering Self Service Business Intelligence with Data Virtualization Sean Roberts, Hortonworks Mark Pritchard, Denodo January 2017
  • 2. 2 © Hortonworks Inc. 2011 – 2017. All Rights Reserved2 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Sean Roberts Partner Engineering
  • 3. 3 © Hortonworks Inc. 2011 – 2017. All Rights Reserved3 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Data Doubles Every Two Years 44ZB By 2020 3 © Hortonworks Inc. 2011 – 2017. All Rights Reserved
  • 4. 4 © Hortonworks Inc. 2011 – 2017. All Rights Reserved4 © Hortonworks Inc. 2011 – 2017. All Rights Reserved The Old Way System Centric Procedural Hierarchical Scheduled Homogeneous The New Way User Centric Agile Dynamic Real-Time Contextual 4 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Data Doubles Every Two Years 44ZB By 2020
  • 5. 5 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Capture streaming data Deliver perishable insights Combine new & old data Store data forever Access a multi-tenant data lake Model with more data DATA AT RESTDATA IN MOTION ACTIONABLE INTELLIGENCE Perishable Insights Historical Insights
  • 6. 6 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Secure Real-time Streaming Integrated Hortonworks DataFlow for Data in Motion Powered by Apache NiFi, Kafka, and Storm
  • 7. 7 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Hortonworks Data Platform for Data at Rest Powered by Open Enterprise Hadoop Open Interoperable Ready Central
  • 8. 8 © Hortonworks Inc. 2011 – 2017. All Rights Reserved8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Claims Optimization Social Sentiment Cohort Selection Bill Shock Physician Notes Device Monitoring R & D Quality Benchmarks Patient Experience Seasonal Staffing Net Promoter Score Supply Chain Sentiment Analysis Patient Outreach 360° Patient View Patient Throughput Customer Churn Analysis STARS Ratings Genomics Remote Monitoring Drug Diversion Census Proactive Maintenance Preventative Medicine Inventory Medication Safety OPEX Reduction Lab Notes Archive Mainframe Offloads Device Data Ingest Rapid Reporting Digital Protection Data as a Service Fraud Prevention Real-time Decision Support INNOVATE RENOVATE E X PLO R E O PTIMIZE TR A NS FO R M ACTIVE ARCHIVE ETL ONBOARD DATA ENRICHMENT DATA DISCOVERY SINGLE VIEW PREDICTIVE ANALYTICS HEALTHCARE Care-path Best Practices OR Optimization HCAHPS Scores Staffing Predictions Proactive Outreach Legacy System Data Imaging Archive Historical Patient Records Improved Drug Yields
  • 9. 9 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Actionable Intelligence Makes Healthcare Precise and Personal Patient Records Lab Data Pharmacy Data Patient Locations Wearables Intra-Network Data Sensor Data Claims Data Social Media Physician Notes Patient Satisfaction Data Clinical (EMR) Data SINGLE VIEW OF PATIENT REAL-TIME VITAL SIGN MONITORING BILLING & REIMBURSEMENTS EMR OPTIMIZATION SUPPLY CHAIN OPTIMIZATION
  • 10. 10 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Mercy’s Journey Mercy Medical System Sought a Data Lake for a Single View of its Patients – “One Patient, One Record” Existing platform impeded goal of enriching Epic data for 1 million patients over 35 Hospitals and 500 clinics Moving Epic EMR data to Clarity EDW took 24 hours and was “never going to enable real-time analytics”. Now that takes 3-5 minutes with HDP. Improved billing processes resulted in $1M additional annual revenue from newly documented secondary diagnoses and care
  • 11. 11 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Better Health Through “One Patient, One Record” ACTIVE ARCHIVE Lab Notes DATA DISCOVERY OPEX Efficiency SINGLE VIEW Billing DATA DISCOVERY Vital Signs SINGLE VIEW Single Patient Record ACTIVE ARCHIVE Epic EMR Replication ACTIVE ARCHIVE Privacy Database DATA ENRICHMENT Epic Enrichment PREDICTIVE ANALYTICS Device Data Ingest Move to Clarity wouldn’t enable real-time analytics Existing platform impeded goals Data enrichment needed for 1 million patients Move off Epic took over 24 hours SITUATION 3-5 Minutes $1M Additional Annual Revenue From “Never” to “Seconds” 900x Faster move data off Epic to Clarity with HDP from improved billing process accelerated researcher insight ingest of ICU vital signs PREDICTIVE ANALYTICS Preventive Care ETL OFFLOAD Medical Decision Support
  • 12. 12 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Interoperable with Leading Technologies Partners
  • 13. 13 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Information spread across different systems IT responds with point-to-point data integration Takes too long to get answers to business users The Self-Service Challenge  Data Is Siloed Across Disparate Systems MarketingSales ExecutiveSupport Database Apps Warehouse Cloud Big Data Documents AppsNo SQL “Data bottlenecks create business bottlenecks.” – Create a Road Map For A Real-time, Agile, Self-Service Data Platform, Forrester Research, Dec 16, 2015
  • 14. 14 © Hortonworks Inc. 2011 – 2017. All Rights Reserved Denodo and Modern Data Architecture
  • 15. Powering Self Service Business Intelligence with Data Virtualization Mark Pritchard January 2017
  • 16. Agenda1.Data Virtualization and Benefits 2.Case in Point – Vizient 3.Modern Data Architecture 4.Self-Service BI on Distributed Datasets 5.Denodo Platform for Data Virtualization 6.Q&A
  • 18. 18 -Source: “Gartner Market Guide for data virtualization – 2016” Data virtualization technology can be used to create virtualized and integrated views of data in memory (rather than executing data movement and physically storing integrated views in a target data structure), and provides a layer of abstraction above the physical implementation of data.” Data Virtualization – Definition
  • 19. 19 Data Virtualization Real-time Data Integration “Data virtualization integrates disparate data sources in real time or near-real time to meet demands for analytics and transactional data.” – Create a Road Map For A Real-time, Agile, Self-Service Data Platform, Forrester Research, Dec 16, 2015 Publishes the data to applications Combines related data into views Connects to disparate data sources 2 3 1
  • 20. 20 Denodo and Modern Data Architecture
  • 21. 21 Benefits of Data Virtualization “Get it Real-time and Get it Fast!” Better Data Integration Lower integration costs by 80%. Flexibility to change. Real-time (on-demand) data services. Complete Information Focus on business information needs. Include web / cloud, big data, unstructured, streaming. Bigger volumes, richer/easier access to data. Better Business Outcome Projects in 4-6 weeks. ROI in <6 months. Adds new IT and business capabilities “Benefits of Data Virtualization: get it real-time and get it fast!” – William McKnight, President, McKnight Consulting Group
  • 22. Case in Point - Vizient
  • 23. 23 Who is Vizient? Network for non-profit hospitals and alliance of academic medical centers Network of not-for- profit healthcare organizations to improve performance and efficiency in clinical, financial and operational management Combination of VHA, University HealthSystem Consortium, Novation, MedAssets Spend, Clinical Resources Management and SG2 Experts with purchasing power, insights and connections that accelerate performance for members
  • 24. 24 Purpose, Mission and Strategic Aspirations Mission To connect members with the knowledge, solutions and expertise that accelerate performance Strategic Aspirations To become an indispensable partner to healthcare organizations Purpose To ensure members deliver exceptional, cost effective care
  • 25. 25 Vizient delivers brilliant, data-driven resources and insights — from benchmarking and predictive analytics to cost-savings — to where they’re needed most. Empowering Brilliant Connections
  • 27. 27 Modern Data Architecture HTTP FTP HL7 Flat File Share Subscribe Attach Initial Event Share Persist Event Share Ongoing Event Usage Purchase Data Open Data RAW CLEAN AGGREGATED ENRICHED RDBMS RDBMS ODS DW STAGESOURCE PROCESS PERSIST SERVE Rules Batch Human Process Hadoop Hadoop Lake Machine Learning Analytics Consulting
  • 28. 28 Modern Data Architecture HTTP FTP HL7 Flat File Share Subscribe Attach Initial Event Share Persist Event Share Purchase Data Open Data RAW CLEAN AGGREGATED ENRICHED RDBMS RDBMS ODS DW STAGESOURCE PROCESS PERSIST Ongoing Event Usage SERVE Rules Batch Human Process Hadoop Hadoop Lake Machine Learning Analytics Consulting DataVirtualization
  • 29. Powering Self-Service Discovery with Data Virtualization
  • 30. 30 Financial Data Mart Primary Use Case o Unify disparate accounting and finance data marts across various legacy organizations into a shared repository Secondary Use Cases o Provide a unified source for key BI initiatives like the GPO Dashboard o Support reporting needs as legacy systems are migrated or replaced during integration of Vizient and L-MDAS (dbVision, etc.) o Provide a final resting place for archived legacy sources like Solomon, Epicor, etc. VHA MedAssets UHC Vizient
  • 31. 31 Financial Data Mart Architectural Approach Denodo was selected as the data platform in order to utilize the following features of the software: o Data Virtualization allows sources in various mediums and locations to be integrated without physically moving the data o Data Abstraction allows data to be represented consistently within the DataMart while data sources are moved or replaced behind the scenes o Data Integration allows for a single seamless view to be created across a subject area (e.g. “Supplier Sales”) with varied data transformation rules for each data source within the subject area (PRS, dbVision).
  • 32. 32 GPO Dashboard Primary Use Case o Provide a consolidated view of supplier sales data across all customers of legacy Vizient & Med Assets organizations. Architectural Approach o Financial DataMart (on Denodo) for data source o Denodo TDE Exporter Tool for daily data extracts to Tableau:  Report Data  Report User Security o Tableau for report development and distribution
  • 33. 33 GPO Dashboard Key Challenges o Balance between data timeliness and report performance  Tableau reports performed best utilizing the TDE format (cached/extracted dataset) as opposed to a live connection  This meant that the report caches required daily refreshes, and data extraction had to be appropriately tuned  Denodo features such as dataset statistics and indexing greatly contributed to this performance tuning o Provisioning user security at cell level  The requirement for some internal report users to be restricted to the members/customers to which they are assigned meant that a new report security approach was needed  Reliance on TDEs for report data necessitated the integration of security in the reporting layer  Tableau’s “data blending” feature allows user security to be specified within a separate dataset  This also supports reuse of the security view in other reporting environments.
  • 34. 34 Contract Sales Actualizer Dashboard Primary Use Case o Integrate Member Spend and Supplier Sales data from all Vizient organizations to identify opportunities for increasing contract utilization Other Use Cases o Maintain consistency (Single Source Of Truth) with GPO dashboard regarding:  Supplier Sales Data  Dimension Data  User Security Architectural Approach o Data source utilizes Denodo to reuse overlapping datasets (sales, dimensions, security) while allowing separate virtualized views to be created for new datasets (member spend) which can be also be reused by future projects o Reporting components match approach used by GPO Dashboard
  • 35. 35 Contract Sales Actualizer Dashboard Key Challenges o Successful integration of Exadata RDM as a data source for Denodo.  Approach utilizes the strength of Exadata RDBMS for aggregating large quantities of data quickly  Denodo to integrate the data with similar legacy SQL Server data sources to create a comprehensive view of Vizient member spend o Scalability/Configuration Management  Advances were made to support parallel development of this project and continued efforts on GPO dashboard  Compartmentalization features within Denodo allow for code changes in each project to be version controlled and assessed for dependencies  Process guidelines are being authored to allow for multiple development efforts on the same datasets
  • 36. Denodo Platform for Data Virtualization
  • 37. 37 Accelerate Your Fast Data Strategy with Denodo Platform 6.0 New Release of Denodo Platform Delivers Breakthrough Performance, Accelerates Adoption, and Expedites Business Use of Data Breakthrough Performance Dynamic Query Optimizer delivers breakthrough performance for big data, logical data warehouse, and operational scenarios. Data Virtualization In the Cloud Denodo Platform for AWS accelerates adoption of data virtualization. Self-service Data Discovery and Search Self-service data discovery and search expedites use of data by business users. “Very happy with Denodo version 6. Well done!” – Claudia Imhoff, President, Intelligent Solutions
  • 38. 38 Common Data Virtualization Use Cases Data Virtualization BIG DATA, CLOUD INTEGRATION  Advanced Analytics  Data Warehouse Offloading  Big Data for Enterprise  Cloud / SaaS Integration AGILE BUSINESS INTELLIGENCE  Logical Data Warehouse  Virtual Data Marts  Self-Service BI  Operational BI / Analytics SINGLE VIEW APPLICATIONS  Single Customer View - Call Centers, Portals  Single Product View - Catalogs  Single Inventory View - Inventory Reconciliation  Vertical Specific - Single View of Wells DATA SERVICES  Unified Data Services Layer  Logical Data Abstraction  Agile Application Development  Linked Data Services
  • 39. 39 Why Denodo Data Virtualization? Data Virtualization Exceptional pre-sales service and customer support  Technical expertise and response time rated best by costumers.  Enhancement requests typically made available in days / weeks. Lower overall TCO and ROI  Address more use cases with a single platform.  Aggressive “all-in-one” pricing for entire platform; flexible pricing models. Broad functionality and winning innovation in a fully integrated platform  Flexible and broad use cases.  Integrated purpose-built for data virtualization – very easy to develop / deploy / change models.
  • 40. Q&A
  • 41. Thanks! www.denodo.com info@denodo.com © Copyright Denodo Technologies. All rights reserved Unless otherwise specified, no part of this PDF file may be reproduced or utilized in any for or by any means, electronic or mechanical, including photocopying and microfilm, without prior the written authorization from Denodo Technologies.