SlideShare a Scribd company logo
www.synerzip.comwww.synerzip.com
Data Virtualization
&
Information As A Service (IaaS)
By Anil Allewar
Senior Solutions Architect - Synerzip
1
www.synerzip.comwww.synerzip.com
About Me!!
Confidential 2
Anil Allewar
Senior Solutions Architect @
Synerzip
Technology Evangelist &
speaker
Core interests: JEE, EAI, EII
www.synerzip.comwww.synerzip.com
• Use cases
Agenda
• What does it mean?
• Implementation Frameworks
• Demo
• Questions?
3
• Architecture explained
www.synerzip.comwww.synerzip.com
Why it makes sense?
Confidential 4
www.synerzip.comwww.synerzip.com
Use Cases
Data
Warehouse
ETL
Financial
Data
OLTP
Data
ETL
3rd Party
Data
Data
Mart
ETL
Web
Service 1
Web
Service 2
Legacy
Data
Custom
Program
Excel
files
Confidential 5
www.synerzip.comwww.synerzip.com
Traditional Data Integration
Enterprise Information System
ETL
Source
System
Source
System
ETL
Confidential 6
Business Applications
www.synerzip.comwww.synerzip.com
Problems with ETL
Confidential 7
More than 1 copy of
data for staging
Intermediate data =>
Errors
Lead time to add new
source
Domain knowledge for
mapping
Batch Process => No
real time data
www.synerzip.comwww.synerzip.com
Problems with DBMS consolidation
Confidential 8
Alternate approach =>
Single EIS (say RDBMS)
Extensive changes to
existing apps
Might not satisfy
everyone’s requiremets
www.synerzip.comwww.synerzip.com
• Use cases
Agenda
• What does it mean?
• Implementation Frameworks
• Demo
• Questions?
9
• Architecture explained
www.synerzip.comwww.synerzip.com
Data Virtualization & Federation
Confidential 10
Single API to access
data
Only metadata stored
at virtualization layer
Real time access
without
copying/moving data
Federate data across
hetero/homogenous
sources
www.synerzip.comwww.synerzip.com
Data Virtualization
Confidential 11
www.synerzip.comwww.synerzip.com
• Use cases
Agenda
• What does it mean?
• Implementation Frameworks
• Demo
• Questions?
12
• Architecture explained
www.synerzip.comwww.synerzip.com
Architecture
Confidential 13
User
Application
CommonAccess
API
Connector 1
Connector 2
RUNTIME &
QUERY ENGINE
Virtual
Database
Translator
1
Translator
2
www.synerzip.comwww.synerzip.com
• Use cases
Agenda
• What does it mean?
• Implementation Frameworks
• Demo
• Questions?
14
• Architecture explained
www.synerzip.comwww.synerzip.com
Vendors
• Commercial Products
– Composite Software
• http://www.compositesw.com/data-virtualization/
– Denodo
• http://www.denodo.com/en/product/overview.php?n=h
– IBM
• http://www-03.ibm.com/software/products/en/ibminfofedeserv
– Informatica
• http://www.informatica.com/us/data-virtualization/
– Red Hat
• http://www.redhat.com/products/jbossenterprisemiddleware/data-virtualization/
• Open Source
– Jboss Teiid
• http://teiid.jboss.org/
Confidential 15
www.synerzip.comwww.synerzip.com
Selected Platform – JBoss Teiid
Confidential 16
Open Source
Number of
relational/NoSQL/E
RP/CRM data stores
JEE standards
Add custom EIS
support using JEE
components
Active & responsive
community Synerzip contribution: Defect
discovery, root cause analysis,
feature verification
www.synerzip.comwww.synerzip.com
Teiid Components
• Virtual Database
– container for components used to integrate data from
multiple data sources
• Source Models
– structure and characteristics of physical data sources
• View Models
– structure and characteristics of abstract structures you want to expose to your
applications
• Teiid Designer
– Eclipse based UI to dynamically discover data source
objects and apply data federation
– Generate virtual database from 1 or more sources
Confidential 17
www.synerzip.comwww.synerzip.com
Teiid Components
• Translator
– Provides abstraction later between Teiid Query
Engine and source system
– Convert Teiid SQL commands to source specific
execution commands
– Convert result data from source system to Teiid
specific format
• Resource Adapter
– Provides connectivity to the physical data source
– Integration provided through Java Connector
Architecture (JCA) API
Confidential 18
www.synerzip.comwww.synerzip.com
Teiid – Supported EIS
• Amazon SimpleDB
• Apache Accumulo
• Apache SOLR
• Cassandra
• File
• Google Spreadsheet
• JPA
• LDAP
• Excel – as file
• SalesForce
• JDBC
– MS access, DB2, derby, excel-
odbc, greenplum, h2 ,
hive(for accessing Hadoop),
oracle, teradata and most
RDBMS
• MongoDB
• Object
• OData
• OLAP
• Web Services
• SAP Netweaver Gateway
Confidential 19
www.synerzip.comwww.synerzip.com
Performance Characteristics
• Access same data using Oracle and Teiid drivers
– Retrieval times comparable when accessing tables
having no Blobs
Confidential 20
0
5,000
10,000
15,000
20,000
25,000
No. of rows Vs Time: No Blobs
Oracle-JDBC
Teiid-JDBC
No. of rows
ms
www.synerzip.comwww.synerzip.com
Performance Characteristics
– Teiid slower when accessing Blob data
• Can be tuned
Confidential 21
0
5,000
10,000
15,000
20,000
25,000
30,000
0 0 2 42 21,804 32,531 185,454
No. of rows Vs Time: Blobs
Oracle-JDBC
Teiid-JDBC
ms
No. of rows
www.synerzip.comwww.synerzip.com
• Use cases
Agenda
• What does it mean?
• Implementation Frameworks
• Demo
• Questions?
22
• Architecture explained
www.synerzip.comwww.synerzip.com
Demo
Confidential 23
JDBC
Client
JDBC
API
RDBMS
Resource
Adapter
MongoDB
Resource
Adapter
TEIID RUNTIME
& QUERY
ENGINE
Federated
VDB
mySQL
Translator
MongoDB
Translator
mySQL
www.synerzip.comwww.synerzip.com
Demo-Steps
• Pre-requisites
– mySQL server 5.5+ installed
– MongoDB 2.4.x+ installed
• Steps
– Load the mySql and MongoDB database with sample data
– Setup environment – JBoss, Eclipse
– Create Teiid project in Eclipse using Teiid designer
• Import source model using JDBC
• Create the virtual model and federate data from the source
model
• Create a virtual database (VDB) and deploy to JBoss
– Access data using JDBC client or through browser using OData
Confidential 24
www.synerzip.comwww.synerzip.com
Demo – Scenario
Confidential 25
Federated
Data
www.synerzip.comwww.synerzip.com
Demo – Connection Profile
Confidential 26
www.synerzip.comwww.synerzip.com
Demo – Source Model
Confidential 27
www.synerzip.comwww.synerzip.com
Demo - Source Model Generation
Confidential 28
www.synerzip.comwww.synerzip.com
Demo – Map Source To View
Confidential 29
www.synerzip.comwww.synerzip.com
Demo - Association
Confidential 30
www.synerzip.comwww.synerzip.com
Demo – Data Federation
Confidential 31
www.synerzip.comwww.synerzip.com
Demo – Source Code
• Source code
–https://github.com/Synerzip/JBoss-
Teiid
–Contains
• Configuration files
• Instructions
• “How-to” videos
• VDBs, source models and view models
Confidential 32
www.synerzip.comwww.synerzip.com
Conclusion
• Data Virtualization and Federation is
a rapidly emerging technology that
solves traditional BI/ETL problems.
• It provides lower time to market,
distributes data across the enterprise
as a service and provides real time
access to enterprise data.
Confidential 33
www.synerzip.comwww.synerzip.com
• Use cases
Agenda
• What does it mean?
• Implementation Frameworks
• Demo
• Questions?
34
• Architecture explained
www.synerzip.comwww.synerzip.com
Contact Me
• anil.allewar@synerzip.com
Confidential 35
www.synerzip.comwww.synerzip.com
•3684
Questions?
www.synerzip.com
Hemant Elhence
hemant@synerzip.com
469.322.0349
www.synerzip.comwww.synerzip.com
Synerzip in a Nutshell
1. Software product development partner for small/mid-sized
technology companies
• Exclusive focus on small/mid-sized technology companies, typically
venture-backed companies in growth phase
• By definition, all Synerzip work is the IP of its respective clients
• Deep experience in full SDLC – design, dev, QA/testing, deployment
2. Dedicated team of high caliber software professionals for
each client
• Seamlessly extends client’s local team, offering full transparency
• Stable teams with very low turn-over
• NOT just “staff augmentation”, but provide full mgmt support
3. Actually reduces risk of development/delivery
• Experienced team - uses appropriate level of engineering discipline
• Practices Agile development – responsive, yet disciplined
4. Reduces cost – dual-shore team, 50% cost advantage
5. Offers long term flexibility – allows (facilitates) taking
offshore team captive – aka “BOT” option
www.synerzip.comwww.synerzip.com
Our Clients
www.synerzip.comwww.synerzip.com
Call Us for a Free Consultation!
Hemant Elhence
hemant@synerzip.com
469.322.0349
Thanks!

More Related Content

PPTX
OAC - From Cloud Entry to Data Engineering to Data Science
PPTX
Office 365 – Power BI and the Data Management Gateway
PPTX
Solr Under the Hood at S&P Global- Sumit Vadhera, S&P Global
PPTX
SQL Analytics for Search Engineers - Timothy Potter, Lucidworksngineers
PPTX
Gateways to Power BI, Connect PowerBI.com to your On-Prem Data
PPTX
Tableau API
PPTX
Azure enterprise integration platform
PDF
Presto: Fast SQL on Everything
OAC - From Cloud Entry to Data Engineering to Data Science
Office 365 – Power BI and the Data Management Gateway
Solr Under the Hood at S&P Global- Sumit Vadhera, S&P Global
SQL Analytics for Search Engineers - Timothy Potter, Lucidworksngineers
Gateways to Power BI, Connect PowerBI.com to your On-Prem Data
Tableau API
Azure enterprise integration platform
Presto: Fast SQL on Everything

What's hot (20)

PPTX
Adobe Spark Meetup - 9/19/2018 - San Jose, CA
PPTX
ECS19 - Mike Ammerlaan - Microsoft Graph Data Connect
PDF
Accelerate Data Science Initiatives: Databricks & Privacera
PDF
5 Amazing Reasons DBAs Need to Love Extended Events
PDF
Effective AIOps with Open Source Software in a Week
PDF
Accelerate Your ML Pipeline with AutoML and MLflow
PPTX
Create Salesforce online IDE in 30 minutes
PDF
SQL Analytics Powering Telemetry Analysis at Comcast
PPTX
10 Big Data Technologies you Didn't Know About
PDF
Marketing vs Technology
PPTX
02 integrate highchart
PDF
Machine Learning Data Lineage with MLflow and Delta Lake
PPTX
Microservices in the Enterprise
PDF
Moving to Databricks & Delta
PDF
10 Things Learned Releasing Databricks Enterprise Wide
PPTX
6Reinventing Oracle Systems in a Cloudy World (Sangam20, December 2020)
PPTX
Data Management Gateway - Deep Dive
PPTX
Automated Data Synchronization: Data Loader, Data Mirror & Beyond
PPTX
Rev Your Engines: SharePoint Performance Best Practices
PPTX
Spark - Migration Story
Adobe Spark Meetup - 9/19/2018 - San Jose, CA
ECS19 - Mike Ammerlaan - Microsoft Graph Data Connect
Accelerate Data Science Initiatives: Databricks & Privacera
5 Amazing Reasons DBAs Need to Love Extended Events
Effective AIOps with Open Source Software in a Week
Accelerate Your ML Pipeline with AutoML and MLflow
Create Salesforce online IDE in 30 minutes
SQL Analytics Powering Telemetry Analysis at Comcast
10 Big Data Technologies you Didn't Know About
Marketing vs Technology
02 integrate highchart
Machine Learning Data Lineage with MLflow and Delta Lake
Microservices in the Enterprise
Moving to Databricks & Delta
10 Things Learned Releasing Databricks Enterprise Wide
6Reinventing Oracle Systems in a Cloudy World (Sangam20, December 2020)
Data Management Gateway - Deep Dive
Automated Data Synchronization: Data Loader, Data Mirror & Beyond
Rev Your Engines: SharePoint Performance Best Practices
Spark - Migration Story
Ad

Similar to Data Virtualization And Information As A Service (IaaS) (20)

PPTX
Data virtualization, Data Federation & IaaS with Jboss Teiid
PDF
Delivering Self-Service Analytics using Big Data and Data Virtualization on t...
PDF
Myth Busters II: BI Tools and Data Virtualization are Interchangeable
PDF
IMCSummit 2015 - Day 2 Developer Track - The Internet of Analytics – Discover...
PDF
Ibm db2update2019 icp4 data
PDF
Converged Everything, Converged Infrastructure delivering business value and ...
PPTX
Linked Data as a Service
PPT
IBM Software Capabilities
PDF
How Cloud Based Market Data Enables Innovation
PDF
Enterprise Data Sources PowerPoint Presentation Slides
PDF
2020 – A Decade of Change
PPTX
Unlock the Power of Mainframe Data for Democratized Cloud Analytics
PPT
Technology Fundamentals
PPT
Technology Fundamentals
PDF
Integrating Structure and Analytics with Unstructured Data
PDF
Converged Everything, Converged Infrastructure Delivering Business Value and ...
PPTX
Denodo Data Virtualization - IT Days in Luxembourg with Oktopus
PPTX
Klarna Tech Talk - Mind the Data!
PDF
Why an AI-Powered Data Catalog Tool is Critical to Business Success
PPTX
Introducing Net App
Data virtualization, Data Federation & IaaS with Jboss Teiid
Delivering Self-Service Analytics using Big Data and Data Virtualization on t...
Myth Busters II: BI Tools and Data Virtualization are Interchangeable
IMCSummit 2015 - Day 2 Developer Track - The Internet of Analytics – Discover...
Ibm db2update2019 icp4 data
Converged Everything, Converged Infrastructure delivering business value and ...
Linked Data as a Service
IBM Software Capabilities
How Cloud Based Market Data Enables Innovation
Enterprise Data Sources PowerPoint Presentation Slides
2020 – A Decade of Change
Unlock the Power of Mainframe Data for Democratized Cloud Analytics
Technology Fundamentals
Technology Fundamentals
Integrating Structure and Analytics with Unstructured Data
Converged Everything, Converged Infrastructure Delivering Business Value and ...
Denodo Data Virtualization - IT Days in Luxembourg with Oktopus
Klarna Tech Talk - Mind the Data!
Why an AI-Powered Data Catalog Tool is Critical to Business Success
Introducing Net App
Ad

More from Synerzip (20)

PDF
HOW VOCERA LEVERAGES SYNERZIP FOR ENHANCEMENT OF VOCERA PLATFORM & ITS USER E...
PPT
The QA/Testing Process
PPT
Test Driven Development – What Works And What Doesn’t
PDF
Distributed/Dual-Shore Agile Software Development – Is It Effective?
PPT
Using Agile Approach with Fixed Budget Projects
PDF
QA Role in Agile Teams
PDF
Agile For Mobile App Development
PDF
Using Agile in Non-Ideal Situations
PDF
Accelerating Agile Transformations - Ravi Verma
PDF
Agile Product Management Basics
PDF
Product Portfolio Kanban - by Erik Huddleston
PDF
Modern Software Practices - by Damon Poole
PPT
Context Driven Agile Leadership
PDF
Adopting TDD - by Don McGreal
PDF
Pragmatics of Agility - by Venkat Subramaniam
PPT
Cross Platform Mobile App Development
PPT
Agile2011 Conference – Key Take Aways
PPT
Performance Evaluation in Agile
PDF
Scrum And Kanban (for better agile teams)
PPT
Managing Technical Debt - by Michael Hall
HOW VOCERA LEVERAGES SYNERZIP FOR ENHANCEMENT OF VOCERA PLATFORM & ITS USER E...
The QA/Testing Process
Test Driven Development – What Works And What Doesn’t
Distributed/Dual-Shore Agile Software Development – Is It Effective?
Using Agile Approach with Fixed Budget Projects
QA Role in Agile Teams
Agile For Mobile App Development
Using Agile in Non-Ideal Situations
Accelerating Agile Transformations - Ravi Verma
Agile Product Management Basics
Product Portfolio Kanban - by Erik Huddleston
Modern Software Practices - by Damon Poole
Context Driven Agile Leadership
Adopting TDD - by Don McGreal
Pragmatics of Agility - by Venkat Subramaniam
Cross Platform Mobile App Development
Agile2011 Conference – Key Take Aways
Performance Evaluation in Agile
Scrum And Kanban (for better agile teams)
Managing Technical Debt - by Michael Hall

Recently uploaded (20)

PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Empathic Computing: Creating Shared Understanding
PDF
Electronic commerce courselecture one. Pdf
PDF
cuic standard and advanced reporting.pdf
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPTX
sap open course for s4hana steps from ECC to s4
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PPTX
Big Data Technologies - Introduction.pptx
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Machine learning based COVID-19 study performance prediction
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Digital-Transformation-Roadmap-for-Companies.pptx
Empathic Computing: Creating Shared Understanding
Electronic commerce courselecture one. Pdf
cuic standard and advanced reporting.pdf
20250228 LYD VKU AI Blended-Learning.pptx
Understanding_Digital_Forensics_Presentation.pptx
MIND Revenue Release Quarter 2 2025 Press Release
sap open course for s4hana steps from ECC to s4
The AUB Centre for AI in Media Proposal.docx
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
Advanced methodologies resolving dimensionality complications for autism neur...
Programs and apps: productivity, graphics, security and other tools
Unlocking AI with Model Context Protocol (MCP)
Mobile App Security Testing_ A Comprehensive Guide.pdf
Big Data Technologies - Introduction.pptx
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Review of recent advances in non-invasive hemoglobin estimation
Machine learning based COVID-19 study performance prediction
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx

Data Virtualization And Information As A Service (IaaS)

Editor's Notes

  • #8: Require more than 1 copy of data for staging Creating, storing and manipulating this intermediate data can lead to errors in data quality Lead time required to add data from new sources Depends on domain knowledge of mapping entities between different data sources Batch processing – information lagging behind real time data
  • #9: Alternate approach is to move all enterprise data to a common Enterprise Information System (typically RDBMS) Extensive changes to existing applications resulting in end user impact Might not satisfy every group’s requirements – say group 1 has partitioned data but the target RDBMS doesn’t support partitioning
  • #11: Single API to access data from heterogeneous sources Only metadata stored at virtualization layer Real time access of data without copying/moving data from the source Enterprise Information System (EIS) Federate data across multiple heterogeneous/homogenous sources An enterprise information system (EIS) is any kind of information system which improves the functions of an enterprise business processes by integration. An EIS could use a database/web service/flat files or any other custom system for storing this information.
  • #17: Jboss Teiid Open Source  Supports number of relational and non relational data sources Integrated with the JBoss Application Server and JEE architecture Ability to add custom data sources using standard JEE components Very active and responsive community
  • #20: Amazon SimpleDB - web service for running queries on structured data in real time Apache Accumulo - sorted, distributed key value store Apache SOLR - search system for indexing data/services Cassandra - NoSQL database File - exposes stored procedures to leverage file system resources JPA - reverse a JPA object model into a relational model LDAP - exposes an LDAP directory tree relationally MongoDB - NoSQL database Object - reading java objects from external sources (i.e., Infinispan Cache or Map cache) OData - Consume OData web services and also act as web server to expose VDB as an OData service OLAP - online analytical processing exposing data as 3-D arrays called cubes SalesForce - CRM product SAP Netweaver Gateway - Web service calls to SAP Web Services - exposes stored procedures for calling web services
  • #38: 37
  • #39: 38
  • #40: 39