SlideShare a Scribd company logo
www.dlt.com
Red Hat JBoss Data Virtualization
July, 2016
Rick Stewart, Middleware SA
Herndon, VA
7/19/16 DLT Solutions LLC – Proprietary & Confidential 2
“Kiss” “Whitesnake” “Poison”
“Bad Company”
Data
Warehouse
“Bad Company”
7/19/16 DLT Solutions LLC – Proprietary & Confidential 3
“Kiss” “Whitesnake” “Poison”
Data
WarehouseData Virtualization Server
What does Data Virtualization software do?
7/19/16 DLT Solutions LLC – Proprietary & Confidential 4
Virtual Consolidated Data Source
BI Reports
Data Virtualization Software
•Consume
•Compose
•Connect
SAP Salesforce.comOracle DW XML, CSV
& Excel files
Siloed &
Complex
Virtualize
Abstract
Federate
Easy,
Real-time
Information
Access
Applications
DATA CONSUMERS
DATA SOURCES
“Bad Company”
7/19/16 DLT Solutions LLC – Proprietary & Confidential 5
“Kiss” “Whitesnake” “Poison”
Data
WarehouseData Virtualization Server
“Bad Company”
7/19/16 DLT Solutions LLC – Proprietary & Confidential 6
“Kiss” “Whitesnake” “Poison”
Data
WarehouseData Virtualization Server
Data Challenges Getting Bigger
7/19/16 DLT Solutions LLC – Proprietary & Confidential 7
BI Reports
Operational
Reports
Enterprise
Applications
Cloud Native
Applications
Mobile
Applications
Hadoop NoSQL Cloud Apps Data Warehouse
& Databases
Mainframe XML, CSV
& Excel Files
Enterprise Apps
Integration Complexity
Consumption & Creation
Siloed
How to Integrate?
Improve Access to Your Data
7/19/16 DLT Solutions LLC – Proprietary & Confidential 8
BI Reports
Operational
Reports
Enterprise
Applications
Cloud Native
Applications
Mobile
Applications
Hadoop NoSQL Cloud Apps Data Warehouse
& Databases
Mainframe XML, CSV
& Excel Files
Enterprise Apps
Broad & Streamlined
Adaptable & Secure
Federated & MeaningfulData Virtualization Server
Simplify Access to Your Data
7/19/16 DLT Solutions LLC – Proprietary & Confidential 9
streaming
databases
social
media data
production
application
big data
stores
website
ESB
analytics
& reporting
unstructured
data
mobile
App
data
warehouse
& data marts
internal
portal dashboard
external
data
private
data
ODBC/SQL JDBC/SQL XML/SOAP REST/JSON OData SQL
JMS SQL JDBC OData Hive RSS Excel JSONREST SOAP
JMS message SQL statement SOAP messageData Virtualization Server
production
databases
applications
Turn Siloed Data into Actionable Information
7/19/16 DLT Solutions LLC – Proprietary & Confidential 10
Connect
Compose
Consume
BI Reports & Analytics
Mobile Applications
Applications & PortalsESB, ETL
Native Data Connectivity
Standard based Data Provisioning
JDBC, ODBC, SOAP, REST, OData
JBoss
Data
Virtual-
ization
Data
Consumers
Data
Sources
Design Tools
Dashboard
Optimization
Caching
Security
Metadata
Hadoop NoSQL Cloud Apps Data Warehouse
& Databases
Mainframe
XML, CSV
& Excel Files
Enterprise Apps
Siloed &
Complex
Virtualize
Transform
Federate
Easy,
Real-time
Information
Access
Unified Virtual Database / Common Data Model
Data Transformations
Supported Data Sources
7/19/16
DLT Solutions LLC – Proprietary & Confidential 11
Enterprise RDBMS:
•Oracle
•IBM DB2
•Microsoft SQL Server
•Sybase ASE
•MySQL
•MariaDB
•PostgreSQL
•Ingres
Enterprise EDW:
•Teradata
•Netezza
•Greenplum
Search:
•Apache SOLR
Hadoop:
•Apache
•HortonWorks
•Cloudera
•More coming…
Office Productivity:
•Microsoft Excel
•Microsoft Access
•Google Spreadsheets
Specialty Data
Sources:
•ModeShape Repository
•Mondrian
•MetaMatrix
•LDAP
•Apache POI for Excel
NoSQL:
•JBoss Data Grid
•MongoDB
•Cassandra
•More coming…
Enterprise & Cloud
Applications:
•Salesforce.com
•SAP
Technology
Connectors:
•Flat Files, XML Files,
XML over HTTP
•SOAP Web Services
•REST Web Services
•OData Services
7/19/16
Data As A Service
DLT Solutions LLC – Proprietary & Confidential 127/19/16
Contextual view of disparate
source data
Single point of access
Standard based interfaces
Shareable integration and
transformation logic
Reusable data services
But you cannot achieve this by
writing more application code…
Hadoop NoSQL Cloud Apps Data Warehouse
& Databases
Mainframe XML, CSV
& Excel Files
Enterprise Apps
JBoss Data Virtualization
BI Dashboard
& Reports
Analytical
Applications
ESB/SOA
Integration
BPM
Applications
Mobile
Applications
SQL Statement SOAP MessageREST Message
REST Request
JSON Result
SQL Request
SQL Result
Logical Architecture
7/19/16 DLT Solutions LLC – Proprietary & Confidential 13
Data Consumers
Data Sources
Teiid Data Virtualization Designer
7/19/16 DLT Solutions LLC – Proprietary & Confidential 14
7/19/16 DLT Solutions LLC – Proprietary & Confidential 15
Tooling VirtualDB Engine Server
7/19/16 DLT Solutions LLC – Proprietary & Confidential 16
Tooling VirtualDB Engine Server
Users create data models
based on metadata:
•Imported from data
sources
•Supplied via DDL
•Provided by Engine
•Specified by user
Models are packaged in a
Virtual Database (VDB)
Physical Models representing actual data sources
Logical Models
7/19/16 DLT Solutions LLC – Proprietary & Confidential 17
Tooling VirtualDB Engine Server
Build XML Document
models from XML Schemas
Map XML Document
models to other data models
Enable data access via
XML
7/19/16 DLT Solutions LLC – Proprietary & Confidential 18
Tooling VirtualDB Engine Server
Virtual Databases (VDBs) are deployment
archives similar to .WAR.
VDBs contain
•Source metadata and models
•View metadata and models
•System metadata
•Connection information, which is bound to
sources at deployment time
VDBs are deployed to the query engine
VDB Internals
Source Models
Connector
Binding
Properties
View Models
Manifesto Info
7/19/16
19
Tooling VirtualDB Engine Server
JBoss Data Virtualization can offer finer-grained
security control:
Authentication: Kerberos, LDAP, WS-UsernameToken,
HTTP Basic, SAML
Authorization: Virtual data views, Role based access
control
Administration: Centralized management of Virtual DB
privileges
Audit: Centralized audit logging and dashboard
Protection:
Row and column masking
SSL encryption (ODBC and JDBC)
DLT Solutions LLC – Proprietary & Confidential
7/19/16 DLT Solutions LLC – Proprietary & Confidential 20
Tooling VirtualDB Engine Server
Query Engine
JDBC API
VDB
Connector
Binding (1)
Connector
Binding (2)
C1 C2
DB
Oracle
DB
SQL Server
Data Consumer Apps
Query Engine is core data
virtualization functionality: Federating
relational query engine. Rule and cost
based optimizer, advanced query
planner, caching, hint processing.
Query Engine hosts VDBs, binds to
data sources, performs query
execution and results processing.
7/19/16
21
Tooling VirtualDB Engine Server
The Teiid Query engine is hosted in
JBoss EAP and uses key container-
provided services:
•Transaction manager
•JAAS security framework
•Container managed data sources
•EAP management infrastructure
•EAP deployment
The Server exposes views /services
to consumers and managed
connections and connection pools for
data sources.
DLT Solutions LLC – Proprietary & Confidential
JBoss EAP
Applications
Security
JAAS
Transaction
Manager
JDV Runtime Engine
BufferMgr
Threading
Local Caches
etc.
VDB
VDBs
ODBC Socket
Transport
Admin Socket
Transport
JDBC Socket
Transport
Profile
Service
ODBC
JDBC
Admin /
AdminShell
JON
DS
DS
DS
DS
JCA
Translators
Embedded DS
xxx-ds.xml
yyy-ds.xml
zzz-ds.xml
7/19/16
22
Tooling VirtualDB Engine Server
DLT Solutions LLC – Proprietary & Confidential
CACHING & MATERIALIZATION
Multiple levels of caching to meet
performance requirements and manage load
on source systems:
Materialized Views
–External or Internal materialized views
–Ability to override use of materialized
views
Result set Caching
–Applied to results return from user queries
and virtual procedure calls
–Configurable time to live and max. number
of entries
Code Table Caching
–Suited for integrating reference data with
transaction/operational data e.g. Country
code, State Code etc.
QUERY
Access Patterns – criteria requirements on
pushdown queries
Pushdown – decompose user query into
source queries
–Projection minimization to remove unused
select items
–Decompose aggregates over joins/unions
–Generating SQL matching Teiid system
functions
Dependent Joins (can use hints) – feed equi-
join values from one side of the join to the
other
Partition aware aggregation and joins
Copy Criteria – uses criteria transitivity to
minimize join tuples.
PERFORMANCE OPTIMIZATION
Business Dashboard
7/19/16 DLT Solutions LLC – Proprietary & Confidential 23
Bring It All Together
7/19/16 DLT Solutions LLC – Proprietary & Confidential 24
Hadoop
Data Integration
JBoss Data Virtualization
In-memory Cache
JBoss Data Grid
BI Analytics
(historical, operational, predictive)
Composite Applications
Messaging and Event Processing
JBoss A-MQ and JBoss BRMS
J
Structured Data
Streaming
Data
Semi-Structured
Data
Capture&ProcessIntegrate&Analyze
Red Hat Storage
25
Questions
?
Bring It All Together
7/19/16 DLT Solutions LLC – Proprietary & Confidential 26
27
Thank
You!
JBoss Data Virtualization – Use Cases
7/19/16 DLT Solutions LLC – Proprietary & Confidential 28
Self-Service
Business
Intelligence
The virtual, reusable data model provides business-friendly representation of data,
allowing the user to interact with their data without having to know the complexities of
their database or where the data is stored and allowing multiple BI tools to acquire data
from centralized data layer. Gain better insights from Big Data using JBoss Data
Virtualization to integrate with existing information sources.
360◦
Unified
View
Deliver a complete view of master & transactional data in real-time. The virtual data layer
serves as a unified, enterprise-wide view of business information that improves users’ ability
to understand and leverage enterprise data.
Agile SOA
Data
Services
A data virtualization layer deliver the missing data services layer to SOA applications. JBoss
Data Virtualization increases agility and loose coupling with virtual data stores without the
need to touch underlying sources and creation of data services that encapsulate the data
access logic and allowing multiple business service to acquire data from centralized data
layer.
Regulatory
Compliance
Data Virtualization layer deliver the data firewall functionality. JBoss Data Virtualization
improves data quality via centralized access control, robust security infrastructure and
reduction in physical copies of data thus reducing risk. Furthermore, the metadata
repository catalogs enterprise data locations and the relationships between the data in
various data stores, enabling transparency and visibility.
7/19/16 DLT Solutions LLC – Proprietary & Confidential 29
BA C D
JBoss Data Virtualization
Leveraged TPC-H like schema, data and queries
Use 4 different commercial enterprise RDBMS
Each database with 1 TB data representing
•150 million customers, with over
•600 million order records, and
•6 billion order line items.
•Total 4 TB of data
Findings:
•No measurable JDV queries overhead vs. direct queries
•Queries to federated data from four data sources ran
61.7 percent faster vs. baseline
•Scaling queries workload by 2x resulted in <10% impact
on response time
Download Benchmark Study @ http://www.redhat.com/en/resources/jboss-data-virtualization-query-performance-benchmark-study

More Related Content

ODP
JBoss Enterprise Data Services (Data Virtualization)
PDF
JDV for Codemotion Rome 2017
PPTX
Data virtualization, Data Federation & IaaS with Jboss Teiid
PDF
Data Virtualization Primer - Introduction
PDF
Enabling Data as a Service with the JBoss Enterprise Data Services Platform
PPT
Big data insights with Red Hat JBoss Data Virtualization
PPTX
Data Virtualization and ETL
PDF
Red Hat JBoss Data Virtualization
JBoss Enterprise Data Services (Data Virtualization)
JDV for Codemotion Rome 2017
Data virtualization, Data Federation & IaaS with Jboss Teiid
Data Virtualization Primer - Introduction
Enabling Data as a Service with the JBoss Enterprise Data Services Platform
Big data insights with Red Hat JBoss Data Virtualization
Data Virtualization and ETL
Red Hat JBoss Data Virtualization

What's hot (20)

PDF
Data Integration through Data Virtualization (SQL Server Konferenz 2019)
PDF
Encompassing Information Integration
PDF
Jboss Teiid - The data you have on the place you need
PDF
Enabling digital transformation api ecosystems and data virtualization
PDF
dvprimer-architecture
PDF
Myth Busters II: BI Tools and Data Virtualization are Interchangeable
PPTX
Information Virtualization: Query Federation on Data Lakes
PDF
A Comparison of EDB Postgres to Self-Supported PostgreSQL
 
PDF
An introduction to data virtualization in business intelligence
PDF
Data Platform Overview
PDF
IOOF Mongodb Australia
PDF
An Introduction to Data Virtualization in 2018
PPTX
The Double win business transformation and in-year ROI and TCO reduction
PDF
Agile Data Warehousing: Using SDDM to Build a Virtualized ODS
PPTX
Building Modern Data Platform with Microsoft Azure
PDF
Data Mesh for Dinner
PDF
Open Development
PDF
DQS & MDS in SQL Server 2016
PDF
A Brief Introduction: MongoDB
PDF
Worst Practices in Data Warehouse Design
Data Integration through Data Virtualization (SQL Server Konferenz 2019)
Encompassing Information Integration
Jboss Teiid - The data you have on the place you need
Enabling digital transformation api ecosystems and data virtualization
dvprimer-architecture
Myth Busters II: BI Tools and Data Virtualization are Interchangeable
Information Virtualization: Query Federation on Data Lakes
A Comparison of EDB Postgres to Self-Supported PostgreSQL
 
An introduction to data virtualization in business intelligence
Data Platform Overview
IOOF Mongodb Australia
An Introduction to Data Virtualization in 2018
The Double win business transformation and in-year ROI and TCO reduction
Agile Data Warehousing: Using SDDM to Build a Virtualized ODS
Building Modern Data Platform with Microsoft Azure
Data Mesh for Dinner
Open Development
DQS & MDS in SQL Server 2016
A Brief Introduction: MongoDB
Worst Practices in Data Warehouse Design
Ad

Viewers also liked (20)

DOCX
Swati Dubey QA 6 Yrs
PDF
ERRP=Addendum to resettlement action plan - package - 2 & 3
PDF
Frederick County Office of Economic Development 2015 Annual Report
PPTX
GTRI Splunk Overview - Splunk Tech Day
PPTX
World Wide Technology Tec37 Webinar - Deploy and Manage Windows 10 at Scale v1
PPT
GFS Chemicals Introduction
PDF
PPT D-DataSource
PPT
Websphere - Introduction to jdbc
PPT
PDF
Odroid Magazine March 2014
PDF
reliance steel & aluminum 2006_AnnualReport
PDF
People as sensors - mining social media for meaningful information
PDF
nextcomputing-radius-edge
DOC
McLister, John- Resume Updated_03.16-2
PDF
ssd vs hdd infographic
PDF
SOW for Ironbow ACS project
DOC
Prueba corta de estequiometria de reacciones
PPT
Carahsoft technology interview questions and answers
PDF
Snr Systems Engineer ArthitK_CVMar2016
PDF
Puppet Camp Atlanta 2014: DEV Toolsets for Ops (Beginner) -
Swati Dubey QA 6 Yrs
ERRP=Addendum to resettlement action plan - package - 2 & 3
Frederick County Office of Economic Development 2015 Annual Report
GTRI Splunk Overview - Splunk Tech Day
World Wide Technology Tec37 Webinar - Deploy and Manage Windows 10 at Scale v1
GFS Chemicals Introduction
PPT D-DataSource
Websphere - Introduction to jdbc
Odroid Magazine March 2014
reliance steel & aluminum 2006_AnnualReport
People as sensors - mining social media for meaningful information
nextcomputing-radius-edge
McLister, John- Resume Updated_03.16-2
ssd vs hdd infographic
SOW for Ironbow ACS project
Prueba corta de estequiometria de reacciones
Carahsoft technology interview questions and answers
Snr Systems Engineer ArthitK_CVMar2016
Puppet Camp Atlanta 2014: DEV Toolsets for Ops (Beginner) -
Ad

Similar to Red Hat JBOSS Data Virtualization (20)

PDF
Data virtualization
PPTX
A Guide to Modern BPM Data Integration
PDF
SOA Summit 2014
PDF
Getting Started with Data Virtualization – What problems DV solves
PDF
Mow2012 data services
PDF
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...
PDF
Powerplay: Postgres and Lenovo for the Best Performance & Savings
 
PDF
From Database to Strategy - Sandor Klein
PDF
Where does Fast Data Strategy Fit within IT Projects
PPTX
Transform your DBMS to drive engagement innovation with Big Data
PDF
OPEN'17_4_Postgres: The Centerpiece for Modernising IT Infrastructures
PPTX
JDV Big Data Webinar v2
PPT
Why Data Virtualization? An Introduction by Denodo
PDF
EDB Executive Presentation 101515
PDF
Denodo DataFest 2017: The Need for Speed and Agility in Business
PPTX
The Most Trusted In-Memory database in the world- Altibase
PDF
Single View of Well, Production and Assets
PPTX
The Evolution of the Oracle Database - Then, Now and Later (Fontys Hogeschool...
PDF
The Evolution of Open Source Databases
PDF
Postgres Integrates Effectively in the "Enterprise Sandbox"
 
Data virtualization
A Guide to Modern BPM Data Integration
SOA Summit 2014
Getting Started with Data Virtualization – What problems DV solves
Mow2012 data services
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...
Powerplay: Postgres and Lenovo for the Best Performance & Savings
 
From Database to Strategy - Sandor Klein
Where does Fast Data Strategy Fit within IT Projects
Transform your DBMS to drive engagement innovation with Big Data
OPEN'17_4_Postgres: The Centerpiece for Modernising IT Infrastructures
JDV Big Data Webinar v2
Why Data Virtualization? An Introduction by Denodo
EDB Executive Presentation 101515
Denodo DataFest 2017: The Need for Speed and Agility in Business
The Most Trusted In-Memory database in the world- Altibase
Single View of Well, Production and Assets
The Evolution of the Oracle Database - Then, Now and Later (Fontys Hogeschool...
The Evolution of Open Source Databases
Postgres Integrates Effectively in the "Enterprise Sandbox"
 

More from DLT Solutions (20)

PDF
WebLogic 12c & WebLogic Mgmt Pack
PDF
Oracle Identity & Access Management
PDF
Oracle Key Vault Data Subsetting and Masking
PDF
AV/DF Advanced Security Option
PDF
Replicate data between environments
PDF
Streamline it management
PDF
Consolidate and prepare for cloud efficiencies
PPTX
Red Hat Software Defined Storage
ODP
Openshift Container Platform
PDF
How to Upgrade Hundreds or Thousands of Databases
PPTX
Why Upgrade to Oracle Database 12c?
PPTX
Cross Domain Solutions for SolarWinds from Sterling Computers
PPTX
Making Sense of Threat Reports
PDF
DLT Portal
PPTX
Symantec and ForeScout Delivering a Unified Cyber Security Solution
PPTX
Deploying and Managing Red Hat Enterprise Linux in Amazon Web Services
PDF
Implementing BIM for Owners
PDF
Autodesk Infrastructure Solutions for Government Agencies
PPTX
Is Your Agency Data Challenged?
PPTX
Simplify and Streamline with Oracle Solutions Contract
WebLogic 12c & WebLogic Mgmt Pack
Oracle Identity & Access Management
Oracle Key Vault Data Subsetting and Masking
AV/DF Advanced Security Option
Replicate data between environments
Streamline it management
Consolidate and prepare for cloud efficiencies
Red Hat Software Defined Storage
Openshift Container Platform
How to Upgrade Hundreds or Thousands of Databases
Why Upgrade to Oracle Database 12c?
Cross Domain Solutions for SolarWinds from Sterling Computers
Making Sense of Threat Reports
DLT Portal
Symantec and ForeScout Delivering a Unified Cyber Security Solution
Deploying and Managing Red Hat Enterprise Linux in Amazon Web Services
Implementing BIM for Owners
Autodesk Infrastructure Solutions for Government Agencies
Is Your Agency Data Challenged?
Simplify and Streamline with Oracle Solutions Contract

Recently uploaded (20)

PDF
NAV to Microsoft Dynamics 365 Business Central Upgrade in London UK (1).pdf
PDF
Choosing an Entrepreneurial Path Based on Your Personality.pdf
PDF
The Role of Testing and QA in Successful Mobile App Development_Spinx Infotec...
PDF
Risk Assessment Survey of the Esarbica 2025.pdf
PPTX
Enhancing Wastewater Treatment Efficiency with GO2™ Water Treatment Chlorine ...
PDF
2025 Electrician Marketing Trends Report | Destiny Marketing Solutions
PDF
Effective Bad Luck Removal In Sydney.pdf
PDF
How to Inspect Exterior Paint for Early Signs of Summer Damage.pdf
PDF
Best Private Bba Colleges | Galgotias University
PDF
Understanding LA's Zero Waste Initiative
PPTX
The Rise of Work-from-Home Internships.pptx
PDF
Smart Plumbing Solutions Every Property Owner and Developer Should Know
PDF
Secure Your World with Acme Enterprises PDF Sharing.pdf
PDF
The New Drive_ How the Transportation Business is Reinventing Itself by Ednei...
PDF
Legacy Application Modernisation Services.pdf
PPTX
How After-School Art Classes Enhance Social Skills.pptx
PDF
Leveraging Earth Observation Data to Improve Wildfire Prevention and Manageme...
PPTX
Unlocking-Business-Potential-Power-BI-Development-Services.pptx
PDF
Bisleri vs Coca Cola.pdf intellectual property rights
PDF
Optimize Freight, Fleet, and Fulfillment with Scalable Logistics Solutions.pdf
NAV to Microsoft Dynamics 365 Business Central Upgrade in London UK (1).pdf
Choosing an Entrepreneurial Path Based on Your Personality.pdf
The Role of Testing and QA in Successful Mobile App Development_Spinx Infotec...
Risk Assessment Survey of the Esarbica 2025.pdf
Enhancing Wastewater Treatment Efficiency with GO2™ Water Treatment Chlorine ...
2025 Electrician Marketing Trends Report | Destiny Marketing Solutions
Effective Bad Luck Removal In Sydney.pdf
How to Inspect Exterior Paint for Early Signs of Summer Damage.pdf
Best Private Bba Colleges | Galgotias University
Understanding LA's Zero Waste Initiative
The Rise of Work-from-Home Internships.pptx
Smart Plumbing Solutions Every Property Owner and Developer Should Know
Secure Your World with Acme Enterprises PDF Sharing.pdf
The New Drive_ How the Transportation Business is Reinventing Itself by Ednei...
Legacy Application Modernisation Services.pdf
How After-School Art Classes Enhance Social Skills.pptx
Leveraging Earth Observation Data to Improve Wildfire Prevention and Manageme...
Unlocking-Business-Potential-Power-BI-Development-Services.pptx
Bisleri vs Coca Cola.pdf intellectual property rights
Optimize Freight, Fleet, and Fulfillment with Scalable Logistics Solutions.pdf

Red Hat JBOSS Data Virtualization

  • 1. www.dlt.com Red Hat JBoss Data Virtualization July, 2016 Rick Stewart, Middleware SA Herndon, VA
  • 2. 7/19/16 DLT Solutions LLC – Proprietary & Confidential 2 “Kiss” “Whitesnake” “Poison” “Bad Company” Data Warehouse
  • 3. “Bad Company” 7/19/16 DLT Solutions LLC – Proprietary & Confidential 3 “Kiss” “Whitesnake” “Poison” Data WarehouseData Virtualization Server
  • 4. What does Data Virtualization software do? 7/19/16 DLT Solutions LLC – Proprietary & Confidential 4 Virtual Consolidated Data Source BI Reports Data Virtualization Software •Consume •Compose •Connect SAP Salesforce.comOracle DW XML, CSV & Excel files Siloed & Complex Virtualize Abstract Federate Easy, Real-time Information Access Applications DATA CONSUMERS DATA SOURCES
  • 5. “Bad Company” 7/19/16 DLT Solutions LLC – Proprietary & Confidential 5 “Kiss” “Whitesnake” “Poison” Data WarehouseData Virtualization Server
  • 6. “Bad Company” 7/19/16 DLT Solutions LLC – Proprietary & Confidential 6 “Kiss” “Whitesnake” “Poison” Data WarehouseData Virtualization Server
  • 7. Data Challenges Getting Bigger 7/19/16 DLT Solutions LLC – Proprietary & Confidential 7 BI Reports Operational Reports Enterprise Applications Cloud Native Applications Mobile Applications Hadoop NoSQL Cloud Apps Data Warehouse & Databases Mainframe XML, CSV & Excel Files Enterprise Apps Integration Complexity Consumption & Creation Siloed How to Integrate?
  • 8. Improve Access to Your Data 7/19/16 DLT Solutions LLC – Proprietary & Confidential 8 BI Reports Operational Reports Enterprise Applications Cloud Native Applications Mobile Applications Hadoop NoSQL Cloud Apps Data Warehouse & Databases Mainframe XML, CSV & Excel Files Enterprise Apps Broad & Streamlined Adaptable & Secure Federated & MeaningfulData Virtualization Server
  • 9. Simplify Access to Your Data 7/19/16 DLT Solutions LLC – Proprietary & Confidential 9 streaming databases social media data production application big data stores website ESB analytics & reporting unstructured data mobile App data warehouse & data marts internal portal dashboard external data private data ODBC/SQL JDBC/SQL XML/SOAP REST/JSON OData SQL JMS SQL JDBC OData Hive RSS Excel JSONREST SOAP JMS message SQL statement SOAP messageData Virtualization Server production databases applications
  • 10. Turn Siloed Data into Actionable Information 7/19/16 DLT Solutions LLC – Proprietary & Confidential 10 Connect Compose Consume BI Reports & Analytics Mobile Applications Applications & PortalsESB, ETL Native Data Connectivity Standard based Data Provisioning JDBC, ODBC, SOAP, REST, OData JBoss Data Virtual- ization Data Consumers Data Sources Design Tools Dashboard Optimization Caching Security Metadata Hadoop NoSQL Cloud Apps Data Warehouse & Databases Mainframe XML, CSV & Excel Files Enterprise Apps Siloed & Complex Virtualize Transform Federate Easy, Real-time Information Access Unified Virtual Database / Common Data Model Data Transformations
  • 11. Supported Data Sources 7/19/16 DLT Solutions LLC – Proprietary & Confidential 11 Enterprise RDBMS: •Oracle •IBM DB2 •Microsoft SQL Server •Sybase ASE •MySQL •MariaDB •PostgreSQL •Ingres Enterprise EDW: •Teradata •Netezza •Greenplum Search: •Apache SOLR Hadoop: •Apache •HortonWorks •Cloudera •More coming… Office Productivity: •Microsoft Excel •Microsoft Access •Google Spreadsheets Specialty Data Sources: •ModeShape Repository •Mondrian •MetaMatrix •LDAP •Apache POI for Excel NoSQL: •JBoss Data Grid •MongoDB •Cassandra •More coming… Enterprise & Cloud Applications: •Salesforce.com •SAP Technology Connectors: •Flat Files, XML Files, XML over HTTP •SOAP Web Services •REST Web Services •OData Services 7/19/16
  • 12. Data As A Service DLT Solutions LLC – Proprietary & Confidential 127/19/16 Contextual view of disparate source data Single point of access Standard based interfaces Shareable integration and transformation logic Reusable data services But you cannot achieve this by writing more application code… Hadoop NoSQL Cloud Apps Data Warehouse & Databases Mainframe XML, CSV & Excel Files Enterprise Apps JBoss Data Virtualization BI Dashboard & Reports Analytical Applications ESB/SOA Integration BPM Applications Mobile Applications SQL Statement SOAP MessageREST Message REST Request JSON Result SQL Request SQL Result
  • 13. Logical Architecture 7/19/16 DLT Solutions LLC – Proprietary & Confidential 13 Data Consumers Data Sources
  • 14. Teiid Data Virtualization Designer 7/19/16 DLT Solutions LLC – Proprietary & Confidential 14
  • 15. 7/19/16 DLT Solutions LLC – Proprietary & Confidential 15 Tooling VirtualDB Engine Server
  • 16. 7/19/16 DLT Solutions LLC – Proprietary & Confidential 16 Tooling VirtualDB Engine Server Users create data models based on metadata: •Imported from data sources •Supplied via DDL •Provided by Engine •Specified by user Models are packaged in a Virtual Database (VDB) Physical Models representing actual data sources Logical Models
  • 17. 7/19/16 DLT Solutions LLC – Proprietary & Confidential 17 Tooling VirtualDB Engine Server Build XML Document models from XML Schemas Map XML Document models to other data models Enable data access via XML
  • 18. 7/19/16 DLT Solutions LLC – Proprietary & Confidential 18 Tooling VirtualDB Engine Server Virtual Databases (VDBs) are deployment archives similar to .WAR. VDBs contain •Source metadata and models •View metadata and models •System metadata •Connection information, which is bound to sources at deployment time VDBs are deployed to the query engine VDB Internals Source Models Connector Binding Properties View Models Manifesto Info
  • 19. 7/19/16 19 Tooling VirtualDB Engine Server JBoss Data Virtualization can offer finer-grained security control: Authentication: Kerberos, LDAP, WS-UsernameToken, HTTP Basic, SAML Authorization: Virtual data views, Role based access control Administration: Centralized management of Virtual DB privileges Audit: Centralized audit logging and dashboard Protection: Row and column masking SSL encryption (ODBC and JDBC) DLT Solutions LLC – Proprietary & Confidential
  • 20. 7/19/16 DLT Solutions LLC – Proprietary & Confidential 20 Tooling VirtualDB Engine Server Query Engine JDBC API VDB Connector Binding (1) Connector Binding (2) C1 C2 DB Oracle DB SQL Server Data Consumer Apps Query Engine is core data virtualization functionality: Federating relational query engine. Rule and cost based optimizer, advanced query planner, caching, hint processing. Query Engine hosts VDBs, binds to data sources, performs query execution and results processing.
  • 21. 7/19/16 21 Tooling VirtualDB Engine Server The Teiid Query engine is hosted in JBoss EAP and uses key container- provided services: •Transaction manager •JAAS security framework •Container managed data sources •EAP management infrastructure •EAP deployment The Server exposes views /services to consumers and managed connections and connection pools for data sources. DLT Solutions LLC – Proprietary & Confidential JBoss EAP Applications Security JAAS Transaction Manager JDV Runtime Engine BufferMgr Threading Local Caches etc. VDB VDBs ODBC Socket Transport Admin Socket Transport JDBC Socket Transport Profile Service ODBC JDBC Admin / AdminShell JON DS DS DS DS JCA Translators Embedded DS xxx-ds.xml yyy-ds.xml zzz-ds.xml
  • 22. 7/19/16 22 Tooling VirtualDB Engine Server DLT Solutions LLC – Proprietary & Confidential CACHING & MATERIALIZATION Multiple levels of caching to meet performance requirements and manage load on source systems: Materialized Views –External or Internal materialized views –Ability to override use of materialized views Result set Caching –Applied to results return from user queries and virtual procedure calls –Configurable time to live and max. number of entries Code Table Caching –Suited for integrating reference data with transaction/operational data e.g. Country code, State Code etc. QUERY Access Patterns – criteria requirements on pushdown queries Pushdown – decompose user query into source queries –Projection minimization to remove unused select items –Decompose aggregates over joins/unions –Generating SQL matching Teiid system functions Dependent Joins (can use hints) – feed equi- join values from one side of the join to the other Partition aware aggregation and joins Copy Criteria – uses criteria transitivity to minimize join tuples. PERFORMANCE OPTIMIZATION
  • 23. Business Dashboard 7/19/16 DLT Solutions LLC – Proprietary & Confidential 23
  • 24. Bring It All Together 7/19/16 DLT Solutions LLC – Proprietary & Confidential 24 Hadoop Data Integration JBoss Data Virtualization In-memory Cache JBoss Data Grid BI Analytics (historical, operational, predictive) Composite Applications Messaging and Event Processing JBoss A-MQ and JBoss BRMS J Structured Data Streaming Data Semi-Structured Data Capture&ProcessIntegrate&Analyze Red Hat Storage
  • 26. Bring It All Together 7/19/16 DLT Solutions LLC – Proprietary & Confidential 26
  • 28. JBoss Data Virtualization – Use Cases 7/19/16 DLT Solutions LLC – Proprietary & Confidential 28 Self-Service Business Intelligence The virtual, reusable data model provides business-friendly representation of data, allowing the user to interact with their data without having to know the complexities of their database or where the data is stored and allowing multiple BI tools to acquire data from centralized data layer. Gain better insights from Big Data using JBoss Data Virtualization to integrate with existing information sources. 360◦ Unified View Deliver a complete view of master & transactional data in real-time. The virtual data layer serves as a unified, enterprise-wide view of business information that improves users’ ability to understand and leverage enterprise data. Agile SOA Data Services A data virtualization layer deliver the missing data services layer to SOA applications. JBoss Data Virtualization increases agility and loose coupling with virtual data stores without the need to touch underlying sources and creation of data services that encapsulate the data access logic and allowing multiple business service to acquire data from centralized data layer. Regulatory Compliance Data Virtualization layer deliver the data firewall functionality. JBoss Data Virtualization improves data quality via centralized access control, robust security infrastructure and reduction in physical copies of data thus reducing risk. Furthermore, the metadata repository catalogs enterprise data locations and the relationships between the data in various data stores, enabling transparency and visibility.
  • 29. 7/19/16 DLT Solutions LLC – Proprietary & Confidential 29 BA C D JBoss Data Virtualization Leveraged TPC-H like schema, data and queries Use 4 different commercial enterprise RDBMS Each database with 1 TB data representing •150 million customers, with over •600 million order records, and •6 billion order line items. •Total 4 TB of data Findings: •No measurable JDV queries overhead vs. direct queries •Queries to federated data from four data sources ran 61.7 percent faster vs. baseline •Scaling queries workload by 2x resulted in <10% impact on response time Download Benchmark Study @ http://www.redhat.com/en/resources/jboss-data-virtualization-query-performance-benchmark-study