SlideShare a Scribd company logo
New York City
9th June, 2016
Logical Data Warehouse,
Data Lakes, and Data
Services Marketplaces
Agenda1.Introductions
2.Logical Data Warehouse and Data Lakes
3.Coffee Break
4.Data Services Marketplaces
5.Q&A
Data Services Marketplace
New York City
June 2016
Agenda1.Data Services Marketplace
2.Data Services Demo
3.Addressing the Challenges
4.Customer Success Stories
5.Q&A
Data, Data, Everywhere…
• Organizations are awash with data, but…
• How do I know what data is available?
• What’s its structure?
• How do I know how good it is?
• How do I access the data?
• Data Services Marketplaces address these
questions
• Provide a mechanism for end users and
developers to find and access data
• For reports, applications, analytics, etc.
And not a drop of it to read!
5
What is a Data Services Marketplace?
A single place where consumers of data –
developers or end users – can search for, find,
and access data, that is available to them, as a
service.
6
Data Services Marketplace
7
Enterprise Apps
SQL (JDBC/ODBC), RESTful Web Services, SOAP, JMS, etc.
Operational
Systems
Analytical
Systems
Big Data External/SaaS
Systems
Virtual
Data Marts Virtual ODS
Reusable
Data Services
Metadata Scheduling & Delivery Usage Stats
Enterprise Data
Service Registry
Data Services
Layer
Enterprise Data Service Registry
• Catalog of data available to consumers
• Metadata for data ‘services’
• Format and structure of data, description of data and attributes
• Data lineage information – where does the data come from?
• Access permissions for data services
• Enforcing privacy policies for PII
• Monitoring and auditing of data usage
• Monitoring and managing QoS/SLA
• Knowing who is access data, when and how…
8
Virtual Data Services Layer
A data access layer that abstracts underlying data sources and
exposes them as discrete services to form a ‘data API’
 Different users and developers across the enterprise can access data in a
secure and managed fashion and share a common data ‘model’
 Provides secure and managed access to data across the enterprise
 Provides consistency of data
 Hides complexity, format, and location of actual data sources
 Supports many consumption protocols and patterns
Example: Single data access layer for all development teams to avoid
‘hunting down and interpreting data differently by project’
9
Data Services Layer
10
Enterprise Apps
SQL (JDBC/ODBC), RESTful Web Services, SOAP, JMS, etc.
Operational
Systems
Analytical
Systems
Big Data External/SaaS
Systems
Benefits of Data Services
• Agility
• Rapid development, service reuse, quicker time-to-value
• Data Integration
• Combine data to provide data ‘as needed’ not ‘as stored’
• Aligned with logical data models
• Data Quality
• Data consistency, common ‘model’
• Single Point of Interaction
• Users don’t need direct access to data sources, better management and
security
11
Challenges of Data Services
• Security
• How secure is the data? How is access controlled?
• Privacy
• How is PII protected? How can you audit access compliance?
• Performance/QoS
• Does the data services layer ‘get in the way’? How does it impact
performance? And QoS/SLAs?
• Data Governance and Veracity
• How do you know that the data is ‘good’?
12
13
Implementing Data Services
• Data services can be implemented using a
number of different technologies:
1. ESB/SOA
2. ETL
3. MDM
4. Data Virtualization
• Typically it will be one or more of the above
Different Technologies
14
Data Services with Data Virtualization
• Optimized for data services
• Configuration and not coding
• Rapid development and time-to-value
• Supports multiple delivery styles
• Real-time/right-time, batch/file, etc.
• Multiple protocols – SQL (JDBC/ODBC), Web Services (REST/SOAP), …
• Complements other technologies
• MDM exposed as services through data virtualization
• Combined with an ESB for process flows
The Foundation for the Data Services Marketplace
Data Services Demo
Addressing the Challenges
Challenges of Data Services
• Security & Privacy
• How secure is the data? How is access controlled?
• How is PII protected? How can you audit access compliance?
• Performance & QoS
• Does the data services layer ‘get in the way’? How does it impact
performance?
• How can we control the resources to comply with SLAs?
• Data Governance & Veracity
• How do we know that the data is ‘good’?
17
Security & Privacy
Challenges of Data Services
18
19
Security in Denodo
Overview
Authentication
• Pass-through authentication
• Kerberos and Windows SSO
• OAuth, SPNEGO
Authentication
• Standard JDBC/ODBC security
• Kerberos and Windows SSO
• Web Service security
LDAP
Active Directory
Role based Authentication
Guest, employee, corporate
Schema-wide Permissions
Data Specific Permissions
(Row, Column level, Masking)
Policy Based Security
Data in motion
• SSL/TLS
Data in motion
• SSL/TLS
Encrypted
data at rest
• Cache
• Swap
20
Security in Denodo
Data in Motion – secure channels
 Using SSL/TLS
 Client-to-Denodo and Denodo-to-source
 Available for all protocols (JDBC, ODBC, ADO.NET and WS)
 WS security: Basic, Digest, SPNEGO (Kerberos), integration with LDAP
Data at Rest – secure storage
 Cache: third party database. Can leverage its own encryption mechanism
 Swapping to disk: serialized temporarily stored in a configurable folder that can be
encrypted by the OS
Encryption/Decryption
 Support for custom decryption for files and web services
 Transparent integration with RDBMs encryption
Securing data
21
Security in Denodo
Authentication
 Native and LDAP/Active Directory based
 Support for Kerberos and Windows SSO
Authorization
 Virtual Database
 View
 Row and Column level authorization
 Masking
 Custom policies for specific security constrains and integration with external policy servers
Roles
 Integration with LDAP/AD groups
 Role hierarchies supported
Pass-through session credentials
 Leverage existing source privileges
Authentication and Authorization
Role-Based Granular Privileges
22
Security In Denodo
Advanced Selective Data Masking
23
Security In Denodo
Advanced Selective Data Masking
24
Security In Denodo
25
Custom
Policy
Conditions satisfied
Security: applies custom security
policies
• If person accessing data has role of
'Supervisor' and location is 'New
York', then show compensation
information for employees in the
New York office only.
Enforcement: rejects/filters
queries by specified criteria like
user priority, cost, time of day etc.
• If the production batch window runs
from 3 am - 6 am, there is
increased load on production
servers at this time. So, all queries
on these servers can be blocked
during this time to prevent failure of
a process.
Data consuming users, Apps
Query
Accept / add filters
Reject
Security - Custom Policies
Interception of queries before they are executed
Performance & QoS
Challenges of Data Services
26
27
Resource Manager
Apply resource restrictions based on a set of rules
 Rules classify sessions into groups
 By user, role, application, IP, time of the day, etc.
 E.g. Connections from application ‘app1’ coming from users with role
‘reporting’ are assigned to a group
 Apply restrictions for each group.
 Change priority, change concurrency settings, change max timeouts, etc
Controlled Resource Allocation
28
Resource Manager
Controlled Resource Allocation
1 Defines a rule that will be
triggered for “app1” and users
with the role “reporting”
2 For those request that fulfill the rule, if the
CPU usage is greater than 85%, will apply the
following:
• Reduce thread priority
• Reduce the number of concurrent requests
• Limit the number of queued queries
29
Performance Features
Data Provisioning Layer
Selective Materialization
Intelligent Caching of only the most relevant and often used
information
Streaming & pagination
Operate on data in streaming mode for a low memory
footprint. Paginate responses to control the size of datasets
Parallelism
Parallel access to disparate sources to minimize latency
NESTED JOINs for concurrent access to sources with
restricted query capabilities
Optimized Resource Management
Smart allocation of resources to handle high concurrency
Throttling to control and mitigate source impact
Resource plans based on rules
30
Quality of Service in Real Scenarios
• Multinational insurance & reinsurance company
• Average response time of 80-100ms
• 200+ concurrent queries
• 2 nodes – 4 cores each
• Global semiconductor chip manufacturer
• Enterprise-wide data access layer
• 200+ developers trained in Denodo
• ~50 data sources, +90 data services published
• Response times under 120ms, well in compliance with their internal SLAs
(200-300ms)
• 128+ cores in production
Data Provisioning Layer
Data Governance & Veracity
Challenges of Data Services
31
32
Enterprise Data Governance
Understand the “source of truth” and transformations of every piece of data in the
model
Data lineage
33
Enterprise Data Governance
Understand the “source of truth” and transformations of every piece of data in the
model
Data lineage
Customer Success Stories
35
DrillingInfo
• SaaS-based platform that provides business intelligence and
decision support technology
• Facilitates faster, smarter decisions for the oil and gas upstream
E&P industry
• HQs in Austin, Texas. More than 400 employees on 5 continents
• Services 3,000+ companies globally
Overview
36
DrillingInfo
Architecture
37
-Jay Heydt, Manager, Drillinginfo
As a data and business intelligence provider, one of our biggest
challenges is the need to rapidly sell the data that we acquire. The
Denodo Platform enables us to build and deliver data services to our
internal and external consumers within 3–4 hours instead of the 1–2
weeks that would take with ETL”
40
Guardian Life
• Large mutual life insurer with $7.3 billion in capital and $1.5 billion in operating
income in 2015.
• Founded in 1860, the company has paid dividends to policyholders every year
since 1868.
• ~8,000 employees and a over 3,000 financial representatives in 70+ agencies
nationwide.
• Offerings:
• Life insurance
• Disability income insurance
• Annuities
• Investments to dental, vision, and 401(k) plans.
Overview
Enterprise Data Marketplace
41
Enterprise Data Marketplace
42
Enterprise Data Marketplace
43
Enterprise Data Marketplace
44
Q&A
Thanks!
www.denodo.com info@denodo.com
© Copyright Denodo Technologies. All rights reserved
Unless otherwise specified, no part of this PDF file may be reproduced or utilized in any for or by any means, electronic or mechanical,
including photocopying and microfilm, without prior the written authorization from Denodo Technologies.

More Related Content

PPTX
Databricks Platform.pptx
PPTX
Data Lakehouse, Data Mesh, and Data Fabric (r1)
PDF
Modernizing to a Cloud Data Architecture
PDF
Data Marketplace and the Role of Data Virtualization
PPTX
Modernize & Automate Analytics Data Pipelines
PDF
Introducing Databricks Delta
PDF
Moving to Databricks & Delta
PPTX
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
Databricks Platform.pptx
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Modernizing to a Cloud Data Architecture
Data Marketplace and the Role of Data Virtualization
Modernize & Automate Analytics Data Pipelines
Introducing Databricks Delta
Moving to Databricks & Delta
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...

What's hot (20)

PPTX
Big data architectures and the data lake
PPTX
Snowflake + Power BI: Cloud Analytics for Everyone
PDF
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
PPTX
Data Lakehouse, Data Mesh, and Data Fabric (r2)
PPTX
Zero to Snowflake Presentation
PDF
Five Things to Consider About Data Mesh and Data Governance
PDF
Introduction SQL Analytics on Lakehouse Architecture
PPTX
Introduction to Data Engineering
PDF
Enabling a Data Mesh Architecture with Data Virtualization
PPTX
Data Lakehouse Symposium | Day 4
PPTX
Free Training: How to Build a Lakehouse
PPT
MDM Strategy & Roadmap
PPT
Gartner: Master Data Management Functionality
PDF
Data Architecture Best Practices for Advanced Analytics
PDF
How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...
PDF
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
PDF
Building Lakehouses on Delta Lake with SQL Analytics Primer
PPTX
How to Build & Sustain a Data Governance Operating Model
PDF
Building a Data Strategy – Practical Steps for Aligning with Business Goals
PDF
DMBOK and Data Governance
Big data architectures and the data lake
Snowflake + Power BI: Cloud Analytics for Everyone
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
Data Lakehouse, Data Mesh, and Data Fabric (r2)
Zero to Snowflake Presentation
Five Things to Consider About Data Mesh and Data Governance
Introduction SQL Analytics on Lakehouse Architecture
Introduction to Data Engineering
Enabling a Data Mesh Architecture with Data Virtualization
Data Lakehouse Symposium | Day 4
Free Training: How to Build a Lakehouse
MDM Strategy & Roadmap
Gartner: Master Data Management Functionality
Data Architecture Best Practices for Advanced Analytics
How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Building Lakehouses on Delta Lake with SQL Analytics Primer
How to Build & Sustain a Data Governance Operating Model
Building a Data Strategy – Practical Steps for Aligning with Business Goals
DMBOK and Data Governance
Ad

Viewers also liked (12)

PDF
BioStorage Technologies Case Study: How to build an informatics platform usin...
PDF
Hadoop and Data Virtualization - A Case Study by VHA
PDF
Data Virtualization Modernizes Biobanking
PDF
Data-Driven is Passé: Transform Into An Insights-Driven Enterprise
PDF
Supporting Data Services Marketplace using Data Virtualization
PDF
Big Data Fabric: A Recipe for Big Data Initiatives
PDF
The 3-Speed Chief Data Officer
PDF
Start Your E-marketplace Today
PPTX
E marketplace
PDF
Building a Marketplace: A Checklist for Online Disruption
PPT
E-marketplace
PDF
A Guide to Marketplaces
BioStorage Technologies Case Study: How to build an informatics platform usin...
Hadoop and Data Virtualization - A Case Study by VHA
Data Virtualization Modernizes Biobanking
Data-Driven is Passé: Transform Into An Insights-Driven Enterprise
Supporting Data Services Marketplace using Data Virtualization
Big Data Fabric: A Recipe for Big Data Initiatives
The 3-Speed Chief Data Officer
Start Your E-marketplace Today
E marketplace
Building a Marketplace: A Checklist for Online Disruption
E-marketplace
A Guide to Marketplaces
Ad

Similar to Data Services Marketplace (20)

PDF
Why a Data Services Marketplace is Critical for a Successful Data-Driven Ente...
PDF
How to Achieve Self-Service Analytics with a Governed Data Services Layer (UK)
PDF
Self-Service Analytics with Guard Rails
PDF
Data Virtualization: An Essential Component of a Cloud Data Lake
PDF
Data Services and the Modern Data Ecosystem (ASEAN)
PDF
Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...
PDF
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
PDF
Connecting Silos in Real Time with Data Virtualization
PDF
6 Solution Patterns for Accelerating Self-Service BI, Cloud, Big Data, and Ot...
PDF
Delivering Self-Service Analytics using Big Data and Data Virtualization on t...
PDF
How to Place Data at the Center of Digital Transformation in BFSI
PDF
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
PDF
Data Lakes: A Logical Approach for Faster Unified Insights
PDF
GDPR Noncompliance: Avoid the Risk with Data Virtualization
PDF
KASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
PDF
Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)
PDF
Achieving Self-Service Analytics with a Governed Data Services Layer
PDF
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
PDF
Three Dimensions of Data as a Service
PDF
Introduction to Modern Data Virtualization 2021 (APAC)
Why a Data Services Marketplace is Critical for a Successful Data-Driven Ente...
How to Achieve Self-Service Analytics with a Governed Data Services Layer (UK)
Self-Service Analytics with Guard Rails
Data Virtualization: An Essential Component of a Cloud Data Lake
Data Services and the Modern Data Ecosystem (ASEAN)
Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
Connecting Silos in Real Time with Data Virtualization
6 Solution Patterns for Accelerating Self-Service BI, Cloud, Big Data, and Ot...
Delivering Self-Service Analytics using Big Data and Data Virtualization on t...
How to Place Data at the Center of Digital Transformation in BFSI
Empowering your Enterprise with a Self-Service Data Marketplace (EMEA)
Data Lakes: A Logical Approach for Faster Unified Insights
GDPR Noncompliance: Avoid the Risk with Data Virtualization
KASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
Empowering your Enterprise with a Self-Service Data Marketplace (ASEAN)
Achieving Self-Service Analytics with a Governed Data Services Layer
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
Three Dimensions of Data as a Service
Introduction to Modern Data Virtualization 2021 (APAC)

More from Denodo (20)

PDF
Enterprise Monitoring and Auditing in Denodo
PDF
Lunch and Learn ANZ: Mastering Cloud Data Cost Control: A FinOps Approach
PDF
Achieving Self-Service Analytics with a Governed Data Services Layer
PDF
What you need to know about Generative AI and Data Management?
PDF
Mastering Data Compliance in a Dynamic Business Landscape
PDF
Denodo Partner Connect: Business Value Demo with Denodo Demo Lite
PDF
Expert Panel: Overcoming Challenges with Distributed Data to Maximize Busines...
PDF
Drive Data Privacy Regulatory Compliance
PDF
Знакомство с виртуализацией данных для профессионалов в области данных
PDF
Data Democratization: A Secret Sauce to Say Goodbye to Data Fragmentation
PDF
Denodo Partner Connect - Technical Webinar - Ask Me Anything
PDF
Lunch and Learn ANZ: Key Takeaways for 2023!
PDF
It’s a Wrap! 2023 – A Groundbreaking Year for AI and The Way Forward
PDF
Quels sont les facteurs-clés de succès pour appliquer au mieux le RGPD à votr...
PDF
Lunch and Learn ANZ: Achieving Self-Service Analytics with a Governed Data Se...
PDF
How to Build Your Data Marketplace with Data Virtualization?
PDF
Webinar #2 - Transforming Challenges into Opportunities for Credit Unions
PDF
Enabling Data Catalog users with advanced usability
PDF
Denodo Partner Connect: Technical Webinar - Architect Associate Certification...
PDF
GenAI y el futuro de la gestión de datos: mitos y realidades
Enterprise Monitoring and Auditing in Denodo
Lunch and Learn ANZ: Mastering Cloud Data Cost Control: A FinOps Approach
Achieving Self-Service Analytics with a Governed Data Services Layer
What you need to know about Generative AI and Data Management?
Mastering Data Compliance in a Dynamic Business Landscape
Denodo Partner Connect: Business Value Demo with Denodo Demo Lite
Expert Panel: Overcoming Challenges with Distributed Data to Maximize Busines...
Drive Data Privacy Regulatory Compliance
Знакомство с виртуализацией данных для профессионалов в области данных
Data Democratization: A Secret Sauce to Say Goodbye to Data Fragmentation
Denodo Partner Connect - Technical Webinar - Ask Me Anything
Lunch and Learn ANZ: Key Takeaways for 2023!
It’s a Wrap! 2023 – A Groundbreaking Year for AI and The Way Forward
Quels sont les facteurs-clés de succès pour appliquer au mieux le RGPD à votr...
Lunch and Learn ANZ: Achieving Self-Service Analytics with a Governed Data Se...
How to Build Your Data Marketplace with Data Virtualization?
Webinar #2 - Transforming Challenges into Opportunities for Credit Unions
Enabling Data Catalog users with advanced usability
Denodo Partner Connect: Technical Webinar - Architect Associate Certification...
GenAI y el futuro de la gestión de datos: mitos y realidades

Recently uploaded (20)

PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Machine learning based COVID-19 study performance prediction
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Modernizing your data center with Dell and AMD
PDF
cuic standard and advanced reporting.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
Cloud computing and distributed systems.
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
NewMind AI Weekly Chronicles - August'25 Week I
20250228 LYD VKU AI Blended-Learning.pptx
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
The Rise and Fall of 3GPP – Time for a Sabbatical?
Spectral efficient network and resource selection model in 5G networks
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Building Integrated photovoltaic BIPV_UPV.pdf
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Machine learning based COVID-19 study performance prediction
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Diabetes mellitus diagnosis method based random forest with bat algorithm
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Modernizing your data center with Dell and AMD
cuic standard and advanced reporting.pdf
Unlocking AI with Model Context Protocol (MCP)
Review of recent advances in non-invasive hemoglobin estimation
Cloud computing and distributed systems.
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
NewMind AI Monthly Chronicles - July 2025
NewMind AI Weekly Chronicles - August'25 Week I

Data Services Marketplace

  • 1. New York City 9th June, 2016 Logical Data Warehouse, Data Lakes, and Data Services Marketplaces
  • 2. Agenda1.Introductions 2.Logical Data Warehouse and Data Lakes 3.Coffee Break 4.Data Services Marketplaces 5.Q&A
  • 3. Data Services Marketplace New York City June 2016
  • 4. Agenda1.Data Services Marketplace 2.Data Services Demo 3.Addressing the Challenges 4.Customer Success Stories 5.Q&A
  • 5. Data, Data, Everywhere… • Organizations are awash with data, but… • How do I know what data is available? • What’s its structure? • How do I know how good it is? • How do I access the data? • Data Services Marketplaces address these questions • Provide a mechanism for end users and developers to find and access data • For reports, applications, analytics, etc. And not a drop of it to read! 5
  • 6. What is a Data Services Marketplace? A single place where consumers of data – developers or end users – can search for, find, and access data, that is available to them, as a service. 6
  • 7. Data Services Marketplace 7 Enterprise Apps SQL (JDBC/ODBC), RESTful Web Services, SOAP, JMS, etc. Operational Systems Analytical Systems Big Data External/SaaS Systems Virtual Data Marts Virtual ODS Reusable Data Services Metadata Scheduling & Delivery Usage Stats Enterprise Data Service Registry Data Services Layer
  • 8. Enterprise Data Service Registry • Catalog of data available to consumers • Metadata for data ‘services’ • Format and structure of data, description of data and attributes • Data lineage information – where does the data come from? • Access permissions for data services • Enforcing privacy policies for PII • Monitoring and auditing of data usage • Monitoring and managing QoS/SLA • Knowing who is access data, when and how… 8
  • 9. Virtual Data Services Layer A data access layer that abstracts underlying data sources and exposes them as discrete services to form a ‘data API’  Different users and developers across the enterprise can access data in a secure and managed fashion and share a common data ‘model’  Provides secure and managed access to data across the enterprise  Provides consistency of data  Hides complexity, format, and location of actual data sources  Supports many consumption protocols and patterns Example: Single data access layer for all development teams to avoid ‘hunting down and interpreting data differently by project’ 9
  • 10. Data Services Layer 10 Enterprise Apps SQL (JDBC/ODBC), RESTful Web Services, SOAP, JMS, etc. Operational Systems Analytical Systems Big Data External/SaaS Systems
  • 11. Benefits of Data Services • Agility • Rapid development, service reuse, quicker time-to-value • Data Integration • Combine data to provide data ‘as needed’ not ‘as stored’ • Aligned with logical data models • Data Quality • Data consistency, common ‘model’ • Single Point of Interaction • Users don’t need direct access to data sources, better management and security 11
  • 12. Challenges of Data Services • Security • How secure is the data? How is access controlled? • Privacy • How is PII protected? How can you audit access compliance? • Performance/QoS • Does the data services layer ‘get in the way’? How does it impact performance? And QoS/SLAs? • Data Governance and Veracity • How do you know that the data is ‘good’? 12
  • 13. 13 Implementing Data Services • Data services can be implemented using a number of different technologies: 1. ESB/SOA 2. ETL 3. MDM 4. Data Virtualization • Typically it will be one or more of the above Different Technologies
  • 14. 14 Data Services with Data Virtualization • Optimized for data services • Configuration and not coding • Rapid development and time-to-value • Supports multiple delivery styles • Real-time/right-time, batch/file, etc. • Multiple protocols – SQL (JDBC/ODBC), Web Services (REST/SOAP), … • Complements other technologies • MDM exposed as services through data virtualization • Combined with an ESB for process flows The Foundation for the Data Services Marketplace
  • 17. Challenges of Data Services • Security & Privacy • How secure is the data? How is access controlled? • How is PII protected? How can you audit access compliance? • Performance & QoS • Does the data services layer ‘get in the way’? How does it impact performance? • How can we control the resources to comply with SLAs? • Data Governance & Veracity • How do we know that the data is ‘good’? 17
  • 18. Security & Privacy Challenges of Data Services 18
  • 19. 19 Security in Denodo Overview Authentication • Pass-through authentication • Kerberos and Windows SSO • OAuth, SPNEGO Authentication • Standard JDBC/ODBC security • Kerberos and Windows SSO • Web Service security LDAP Active Directory Role based Authentication Guest, employee, corporate Schema-wide Permissions Data Specific Permissions (Row, Column level, Masking) Policy Based Security Data in motion • SSL/TLS Data in motion • SSL/TLS Encrypted data at rest • Cache • Swap
  • 20. 20 Security in Denodo Data in Motion – secure channels  Using SSL/TLS  Client-to-Denodo and Denodo-to-source  Available for all protocols (JDBC, ODBC, ADO.NET and WS)  WS security: Basic, Digest, SPNEGO (Kerberos), integration with LDAP Data at Rest – secure storage  Cache: third party database. Can leverage its own encryption mechanism  Swapping to disk: serialized temporarily stored in a configurable folder that can be encrypted by the OS Encryption/Decryption  Support for custom decryption for files and web services  Transparent integration with RDBMs encryption Securing data
  • 21. 21 Security in Denodo Authentication  Native and LDAP/Active Directory based  Support for Kerberos and Windows SSO Authorization  Virtual Database  View  Row and Column level authorization  Masking  Custom policies for specific security constrains and integration with external policy servers Roles  Integration with LDAP/AD groups  Role hierarchies supported Pass-through session credentials  Leverage existing source privileges Authentication and Authorization
  • 23. Advanced Selective Data Masking 23 Security In Denodo
  • 24. Advanced Selective Data Masking 24 Security In Denodo
  • 25. 25 Custom Policy Conditions satisfied Security: applies custom security policies • If person accessing data has role of 'Supervisor' and location is 'New York', then show compensation information for employees in the New York office only. Enforcement: rejects/filters queries by specified criteria like user priority, cost, time of day etc. • If the production batch window runs from 3 am - 6 am, there is increased load on production servers at this time. So, all queries on these servers can be blocked during this time to prevent failure of a process. Data consuming users, Apps Query Accept / add filters Reject Security - Custom Policies Interception of queries before they are executed
  • 26. Performance & QoS Challenges of Data Services 26
  • 27. 27 Resource Manager Apply resource restrictions based on a set of rules  Rules classify sessions into groups  By user, role, application, IP, time of the day, etc.  E.g. Connections from application ‘app1’ coming from users with role ‘reporting’ are assigned to a group  Apply restrictions for each group.  Change priority, change concurrency settings, change max timeouts, etc Controlled Resource Allocation
  • 28. 28 Resource Manager Controlled Resource Allocation 1 Defines a rule that will be triggered for “app1” and users with the role “reporting” 2 For those request that fulfill the rule, if the CPU usage is greater than 85%, will apply the following: • Reduce thread priority • Reduce the number of concurrent requests • Limit the number of queued queries
  • 29. 29 Performance Features Data Provisioning Layer Selective Materialization Intelligent Caching of only the most relevant and often used information Streaming & pagination Operate on data in streaming mode for a low memory footprint. Paginate responses to control the size of datasets Parallelism Parallel access to disparate sources to minimize latency NESTED JOINs for concurrent access to sources with restricted query capabilities Optimized Resource Management Smart allocation of resources to handle high concurrency Throttling to control and mitigate source impact Resource plans based on rules
  • 30. 30 Quality of Service in Real Scenarios • Multinational insurance & reinsurance company • Average response time of 80-100ms • 200+ concurrent queries • 2 nodes – 4 cores each • Global semiconductor chip manufacturer • Enterprise-wide data access layer • 200+ developers trained in Denodo • ~50 data sources, +90 data services published • Response times under 120ms, well in compliance with their internal SLAs (200-300ms) • 128+ cores in production Data Provisioning Layer
  • 31. Data Governance & Veracity Challenges of Data Services 31
  • 32. 32 Enterprise Data Governance Understand the “source of truth” and transformations of every piece of data in the model Data lineage
  • 33. 33 Enterprise Data Governance Understand the “source of truth” and transformations of every piece of data in the model Data lineage
  • 35. 35 DrillingInfo • SaaS-based platform that provides business intelligence and decision support technology • Facilitates faster, smarter decisions for the oil and gas upstream E&P industry • HQs in Austin, Texas. More than 400 employees on 5 continents • Services 3,000+ companies globally Overview
  • 37. 37 -Jay Heydt, Manager, Drillinginfo As a data and business intelligence provider, one of our biggest challenges is the need to rapidly sell the data that we acquire. The Denodo Platform enables us to build and deliver data services to our internal and external consumers within 3–4 hours instead of the 1–2 weeks that would take with ETL”
  • 38. 40 Guardian Life • Large mutual life insurer with $7.3 billion in capital and $1.5 billion in operating income in 2015. • Founded in 1860, the company has paid dividends to policyholders every year since 1868. • ~8,000 employees and a over 3,000 financial representatives in 70+ agencies nationwide. • Offerings: • Life insurance • Disability income insurance • Annuities • Investments to dental, vision, and 401(k) plans. Overview
  • 43. Q&A
  • 44. Thanks! www.denodo.com info@denodo.com © Copyright Denodo Technologies. All rights reserved Unless otherwise specified, no part of this PDF file may be reproduced or utilized in any for or by any means, electronic or mechanical, including photocopying and microfilm, without prior the written authorization from Denodo Technologies.