SlideShare a Scribd company logo
Realizing the Promise of
Data Lakes
webinars
Data Ninja Webinar Series
Sessions covering data virtualization solutions for driving business value
2
Data Ninja Webinars
Five webinars over the next few months…
Speakers
Head of Product Marketing
Lakshmi Randall
Agenda
1.Big Data Challenges
2.Limitations of Physical Data Lakes
3.The Governed Logical Data Lake
4.Benefits
5.Customer Case Studies
6.Q&A
5
Big Data Challenges
6
SOMEBODY BOUGHT SOMETHING
“BACK IN THE DAY”
• WE HAVE TO DEAL WITH
LEGACY
• HOMOGENEITY ISN’T
REALISTIC
• ALL DATA SOURCES ARE
NO LONGER EQUAL
© 2016 Autodesk | Enterprise Information Services 7
The Data Lake alone is not a panacea
Easy to put data
in
Wait! What about
my old data
warehouse?
Harder to access
and secure the
data
8
Duplicate sets of data in source system and data lake
 Creates data quality challenges because data must be updated in two
places.
 Expensive since the data needs to be stored and maintained twice
Governance of such large amounts of data can be challenging
 Restrictions on data access must be maintained as data is brought
into the data lake as well as on new data created within the data lake.
Data lakes themselves can become silos
 Often built for a specific department, data from the data lake must be
integrated with other enterprise data to create a complete picture
Limitations of Physical Data Lakes
8
9
Overcoming the Limitations of Physical Data Lakes
Integrate your Data Lake with other Enterprise Data Architectures
Provides a way to
access data from
separate systems
through an
abstraction layer
that makes it
appear as if the
data were in a
single data lake
Improves the
enterprise func-
tionality of data
lakes by
combining one or
more physical
data lakes with
other enterprise
data
Improves an
organization’s
ability to govern
and extract more
value from its
data lakes by
extending them
as logical data
lakes
9
Implement a Single Logical Data Lake Using Data Virtualization
Marketing
Data Lakes
Research
Logical Data Lake/Big Data Fabric
Healthcare
Self-Service
Analytics
Operational
Apps
A Single Governed Logical Data Lake
Data Virtualization combines one or more physical data lakes with other enterprise data to create a
“virtual” or “logical” data lake.
Other Data Sources
MDM Cloud Apps
BI/Analytical
Tools
Excel
Reports
DATA VIRTUALIZATION
Semantic
Model
Data
Discovery
Metadata
Catalog
Data
Governance
Denodo Platform Bridges Distinct Data Architectures
10
 Discover
 Prepare
 Curate
 Orchestrate
 Integrate
 Publish
“Rely on Data Integration
Infrastructure to make the
Data Lake Work.”
Philip Russom, Analyst, TDWI
11
Data Governance
 Data Lineage
 Structure and organization to your data lake
 Data Masking
 Data Quality
 Functions for validating, cleansing, enriching
and standardizing data.
 SDK to integrate with external DQ tools and Big
Data systems
Enterprise Access Point
 Enterprise-level access controls – table, row, column
 Authentication/Authroization
 Roles
 Audit all access
 Encryption/Decryption
 Universal Semantic Model
The Governed Data Lake
12
Data Lineage
Understand the “source of truth” and transformations of every piece of data in the
model
Selective Data Masking (Full and Partial)
13
Role-based Granular Security
14
Performance & Scalability
16
Mature Data Virtualization
Operational
Data
EDW
SQL
Integrated
Security
Other
Sources
Cache
In-memory
Fabric
Big
Data
SaaS
REST
OData
Catalog &
Data Exploration
Monitoring
Auditing
17
Performance
Denodo’s unique query optimizer
Denodo’s optimizer borrows many techniques from traditional RDBMs
 Query plans based on statistics and indexes
 Multiple JOIN methods
 Query rewriting to generate more optimal SQL
However, given the distributed execution of a query in a processing
fabric, Denodo has designed unique techniques to maximize
performance in this environment
 Dynamic rewriting focused on maximizing execution at source and reduction of
network traffic
 Cost estimates also factor-in:
 Processing power of the sources (e.g. number of nodes in a Hadoop cluster)
 Network and transfer rates
18
1. A logical data lake prevents the data lake from becoming
a silo and provides access to all the information an
organization needs to power its analytics.
2. Data virtualization improves agility in big data activities.
Users can quickly combine sources of information
without spending time installing and configuring new
databases or clusters to store the consolidated
information.
3. This ease of use encourages exploration of data since the
cost or effort to access the information is lower.
4. Data virtualization eliminates the cost of storing
information twice and the need to update information in
multiple places since information is not duplicated.
Managing Data Lakes
18
Data virtualization is a practical strategy for managing data lakes
19
Why do Enterprises need Denodo’s Big Data Fabric
to succeed?
New actionable insights with minimal effort
Information Self-service for business users Secures big data end-to-end
Real-time integrated data
across the business
Ability to aggregate, transform, cleanse,
and integrate data from multiple big data
sources, which can then be presented in
dashboards, reporting tools, and web
applications.
Allows any application, process,
dashboard, tool, or user to access
any integrated data, regardless of
where the data is physically or
logically located and regardless of
the data format.
Offers consistent, timely, and
trusted data for internal and
external users.
Enables centralized data access
and control, and supports data-at-
rest and data-in-motion security
measures.
Remediates security risks with
masking, auditing, and encryption
across the fabric.
Provides self-service data discovery
and search capabilities.
Virtual Sandboxing for Citizen Users
Benefits
21
-Source: “Predicts 2017: Data Distribution and Complexity Drive Information Infrastructure
Modernization”
By 2018, organizations with data virtualization capabilities will
spend 40% less on building and managing data integration
processes for connecting distributed data assets.”
Data Virtualization bridges the skills gap
 Developers and users don’t need
to learn special languages.
 They can leverage Denodo
graphical user interface to
model, unify and deliver the data
to multiple consumers.
Data Virtualization combines
new and legacy data sources
24
How the data goes in… How it gets back out…
Denodo’s Big data
fabric provides
easy access to big
data without
having to decipher
various data
formats.
Data Virtualization provides ease of use
25
-Source: “Forrester Wave™: Big Data Fabric Q4 2016”
Denodo’s key strength is delivering a unified and centralized
data services fabric with security and real-time integration
across multiple traditional and big data sources, including
Hadoop, NoSQL, cloud, and software-as-a-service (SaaS).”
26
-Source: “Forrester Wave™: Big Data Fabric Q4 2016”
Today, several enterprises are leveraging Denodo to support big data fabric
deployments — such as virtual big data marts, big data analytics, realtime
analytics, and iot data processing — in various vertical industries. Customers
like its easy-to-use, simple yet sophisticated data modeling capabilities,
search, and support for various big data sources.
Customer Case Studies
© 2016 Autodesk | Enterprise Information Services 28
Why Build the Logical Data Lake Data virtualization can be used
throughout your data pipeline!
29
Leading Construction Manufacturer - Telematics &
Predictive Maintenance
Dealer
Maintenance
Parts Inventory
OSI PI Hadoop Cluster
Tableau: Dealer / Customer Dashboard
30
Enrich Machine Data and Combine with Other Data
Ingest, Integrate & Deliver
Persisted
(In-memory, Hadoop)
Streams
(specific time window)
Message Queue
Machine-generated/Event data Alerts
Workflows
Operational
Processes
Analytical
Processes
Consumers
Visualization
Data Virtualization
Enrich and Combine IoT
Data with Other Data
Historians
Streams
ERP/SCM
DW
Analytical
DB
MDM
Apps
Data Marts
Hadoop NoSQL
31
Business Benefits
 Improved asset performance and proactive maintenance.
 Reduced warranty costs due to proactive maintenance of parts
preventing parts failure.
 Optimized pricing for services and parts among global service
providers.
 New Business Model opportunities based on real-time analysis
of detailed sensor data.
Q&A
Next Steps
Forrester Wave™: Big Data Fabric Q4 2016
http://www.denodo.com/en/page/forrester-wave-big-data-fabric-
q4-2016
Get Started!
Download Denodo Express: www.denodoexpress.com
Access Denodo Platform on AWS: www.denodo.com/en/denodo-
platform/denodo-platform-for-aws
Data Ninja Webinar Series
Sessions covering data virtualization solutions for driving business value
Next Series: Packed Lunch Series
January - 2017
Thanks!
www.denodo.com info@denodo.com
© Copyright Denodo Technologies. All rights reserved
Unless otherwise specified, no part of this PDF file may be reproduced or utilized in any for or by any means, electronic or mechanical,
including photocopying and microfilm, without prior the written authorization from Denodo Technologies.

More Related Content

PDF
Data Lake Acceleration vs. Data Virtualization - What’s the difference?
PPT
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
PPTX
Data Lakehouse, Data Mesh, and Data Fabric (r2)
PPTX
SQL Server Disaster Recovery Implementation
PPT
DW 101
PPTX
Data Lakehouse, Data Mesh, and Data Fabric (r1)
PPTX
From Hadoop to Enterprise Data Warehouse
PPTX
Technical Demonstration - Denodo Platform 7.0
Data Lake Acceleration vs. Data Virtualization - What’s the difference?
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
Data Lakehouse, Data Mesh, and Data Fabric (r2)
SQL Server Disaster Recovery Implementation
DW 101
Data Lakehouse, Data Mesh, and Data Fabric (r1)
From Hadoop to Enterprise Data Warehouse
Technical Demonstration - Denodo Platform 7.0

What's hot (20)

PDF
Performance Acceleration: Summaries, Recommendation, MPP and more
PDF
Company report xinglian
PDF
Enabling Cloud Data Integration (EMEA)
PPTX
Applying Big Data Superpowers to Healthcare
PDF
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
PPT
Data Federation
PDF
Accelerate Self-Service Analytics with Data Virtualization and Visualization
PDF
How to select a modern data warehouse and get the most out of it?
PPTX
2022 02 Integration Bootcamp
PDF
Cloud Storage Spring Cleaning: A Treasure Hunt
PDF
Enabling a Data Mesh Architecture with Data Virtualization
PDF
Building a Logical Data Fabric using Data Virtualization (ASEAN)
PDF
DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL
PDF
Data Virtualization: From Zero to Hero
PDF
Are You Killing the Benefits of Your Data Lake?
PDF
(BI Advanced) Hiram Fleitas - SQL Server Machine Learning Predict Sentiment O...
PDF
GigaOm-sector-roadmap-cloud-analytic-databases-2017
PDF
GDPR Noncompliance: Avoid the Risk with Data Virtualization
PDF
The technology of the business data lake
PDF
Analyst View of Data Virtualization: Conversations with Boulder Business Inte...
Performance Acceleration: Summaries, Recommendation, MPP and more
Company report xinglian
Enabling Cloud Data Integration (EMEA)
Applying Big Data Superpowers to Healthcare
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Data Federation
Accelerate Self-Service Analytics with Data Virtualization and Visualization
How to select a modern data warehouse and get the most out of it?
2022 02 Integration Bootcamp
Cloud Storage Spring Cleaning: A Treasure Hunt
Enabling a Data Mesh Architecture with Data Virtualization
Building a Logical Data Fabric using Data Virtualization (ASEAN)
DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL
Data Virtualization: From Zero to Hero
Are You Killing the Benefits of Your Data Lake?
(BI Advanced) Hiram Fleitas - SQL Server Machine Learning Predict Sentiment O...
GigaOm-sector-roadmap-cloud-analytic-databases-2017
GDPR Noncompliance: Avoid the Risk with Data Virtualization
The technology of the business data lake
Analyst View of Data Virtualization: Conversations with Boulder Business Inte...
Ad

Viewers also liked (9)

PDF
Data-driven teacher effectiveness: Where to begin?
PPT
Data Driven Instruction
PDF
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...
PDF
Education Seminar: Self-service BI, Logical Data Warehouse and Data Lakes
PPTX
Practical guide to architecting data lakes - Avinash Ramineni - Phoenix Data...
PDF
Data Driven Design Research Personas
PDF
Data Lake: A simple introduction
PPTX
Christopher Penn - Build a Data-Driven Customer Journey
PDF
Utilities Digital Data Driven Innovation
Data-driven teacher effectiveness: Where to begin?
Data Driven Instruction
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...
Education Seminar: Self-service BI, Logical Data Warehouse and Data Lakes
Practical guide to architecting data lakes - Avinash Ramineni - Phoenix Data...
Data Driven Design Research Personas
Data Lake: A simple introduction
Christopher Penn - Build a Data-Driven Customer Journey
Utilities Digital Data Driven Innovation
Ad

Similar to Data Ninja Webinar Series: Realizing the Promise of Data Lakes (20)

PDF
Data Virtualization: An Introduction
PPTX
Fast Data Strategy Houston Roadshow Presentation
PDF
Data Virtualization: Introduction and Business Value (UK)
PDF
Modern Data Management for Federal Modernization
PDF
Data Virtualization. An Introduction (ASEAN)
PDF
KASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
PDF
Why Data Mesh Needs Data Virtualization (ASEAN)
PDF
Data Virtualization: An Introduction
PDF
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
PDF
Big Data Fabric: A Necessity For Any Successful Big Data Initiative
PDF
A Key to Real-time Insights in a Post-COVID World (ASEAN)
PDF
A Logical Architecture is Always a Flexible Architecture (ASEAN)
PDF
Introduction to Modern Data Virtualization 2021 (APAC)
PDF
Bridging the Last Mile: Getting Data to the People Who Need It
PDF
Best Practices in the Cloud for Data Management (US)
PDF
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
PDF
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
PDF
Data Virtualization: An Introduction
PDF
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
PPTX
Opportunity: Data, Analytic & Azure
Data Virtualization: An Introduction
Fast Data Strategy Houston Roadshow Presentation
Data Virtualization: Introduction and Business Value (UK)
Modern Data Management for Federal Modernization
Data Virtualization. An Introduction (ASEAN)
KASHTECH AND DENODO: ROI and Economic Value of Data Virtualization
Why Data Mesh Needs Data Virtualization (ASEAN)
Data Virtualization: An Introduction
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
Big Data Fabric: A Necessity For Any Successful Big Data Initiative
A Key to Real-time Insights in a Post-COVID World (ASEAN)
A Logical Architecture is Always a Flexible Architecture (ASEAN)
Introduction to Modern Data Virtualization 2021 (APAC)
Bridging the Last Mile: Getting Data to the People Who Need It
Best Practices in the Cloud for Data Management (US)
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
Data Virtualization: An Introduction
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
Opportunity: Data, Analytic & Azure

More from Denodo (20)

PDF
Enterprise Monitoring and Auditing in Denodo
PDF
Lunch and Learn ANZ: Mastering Cloud Data Cost Control: A FinOps Approach
PDF
Achieving Self-Service Analytics with a Governed Data Services Layer
PDF
What you need to know about Generative AI and Data Management?
PDF
Mastering Data Compliance in a Dynamic Business Landscape
PDF
Denodo Partner Connect: Business Value Demo with Denodo Demo Lite
PDF
Expert Panel: Overcoming Challenges with Distributed Data to Maximize Busines...
PDF
Drive Data Privacy Regulatory Compliance
PDF
Знакомство с виртуализацией данных для профессионалов в области данных
PDF
Data Democratization: A Secret Sauce to Say Goodbye to Data Fragmentation
PDF
Denodo Partner Connect - Technical Webinar - Ask Me Anything
PDF
Lunch and Learn ANZ: Key Takeaways for 2023!
PDF
It’s a Wrap! 2023 – A Groundbreaking Year for AI and The Way Forward
PDF
Quels sont les facteurs-clés de succès pour appliquer au mieux le RGPD à votr...
PDF
Lunch and Learn ANZ: Achieving Self-Service Analytics with a Governed Data Se...
PDF
How to Build Your Data Marketplace with Data Virtualization?
PDF
Webinar #2 - Transforming Challenges into Opportunities for Credit Unions
PDF
Enabling Data Catalog users with advanced usability
PDF
Denodo Partner Connect: Technical Webinar - Architect Associate Certification...
PDF
GenAI y el futuro de la gestión de datos: mitos y realidades
Enterprise Monitoring and Auditing in Denodo
Lunch and Learn ANZ: Mastering Cloud Data Cost Control: A FinOps Approach
Achieving Self-Service Analytics with a Governed Data Services Layer
What you need to know about Generative AI and Data Management?
Mastering Data Compliance in a Dynamic Business Landscape
Denodo Partner Connect: Business Value Demo with Denodo Demo Lite
Expert Panel: Overcoming Challenges with Distributed Data to Maximize Busines...
Drive Data Privacy Regulatory Compliance
Знакомство с виртуализацией данных для профессионалов в области данных
Data Democratization: A Secret Sauce to Say Goodbye to Data Fragmentation
Denodo Partner Connect - Technical Webinar - Ask Me Anything
Lunch and Learn ANZ: Key Takeaways for 2023!
It’s a Wrap! 2023 – A Groundbreaking Year for AI and The Way Forward
Quels sont les facteurs-clés de succès pour appliquer au mieux le RGPD à votr...
Lunch and Learn ANZ: Achieving Self-Service Analytics with a Governed Data Se...
How to Build Your Data Marketplace with Data Virtualization?
Webinar #2 - Transforming Challenges into Opportunities for Credit Unions
Enabling Data Catalog users with advanced usability
Denodo Partner Connect: Technical Webinar - Architect Associate Certification...
GenAI y el futuro de la gestión de datos: mitos y realidades

Recently uploaded (20)

PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPT
Quality review (1)_presentation of this 21
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
PDF
Lecture1 pattern recognition............
PPTX
Business Acumen Training GuidePresentation.pptx
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PDF
Mega Projects Data Mega Projects Data
PPTX
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PDF
Foundation of Data Science unit number two notes
PPTX
1_Introduction to advance data techniques.pptx
PPTX
Supervised vs unsupervised machine learning algorithms
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
STUDY DESIGN details- Lt Col Maksud (21).pptx
Quality review (1)_presentation of this 21
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
Lecture1 pattern recognition............
Business Acumen Training GuidePresentation.pptx
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Mega Projects Data Mega Projects Data
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
IBA_Chapter_11_Slides_Final_Accessible.pptx
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Foundation of Data Science unit number two notes
1_Introduction to advance data techniques.pptx
Supervised vs unsupervised machine learning algorithms
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
Acceptance and paychological effects of mandatory extra coach I classes.pptx
Clinical guidelines as a resource for EBP(1).pdf
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb

Data Ninja Webinar Series: Realizing the Promise of Data Lakes

  • 1. Realizing the Promise of Data Lakes webinars Data Ninja Webinar Series Sessions covering data virtualization solutions for driving business value
  • 2. 2 Data Ninja Webinars Five webinars over the next few months…
  • 3. Speakers Head of Product Marketing Lakshmi Randall
  • 4. Agenda 1.Big Data Challenges 2.Limitations of Physical Data Lakes 3.The Governed Logical Data Lake 4.Benefits 5.Customer Case Studies 6.Q&A
  • 6. 6 SOMEBODY BOUGHT SOMETHING “BACK IN THE DAY” • WE HAVE TO DEAL WITH LEGACY • HOMOGENEITY ISN’T REALISTIC • ALL DATA SOURCES ARE NO LONGER EQUAL
  • 7. © 2016 Autodesk | Enterprise Information Services 7 The Data Lake alone is not a panacea Easy to put data in Wait! What about my old data warehouse? Harder to access and secure the data
  • 8. 8 Duplicate sets of data in source system and data lake  Creates data quality challenges because data must be updated in two places.  Expensive since the data needs to be stored and maintained twice Governance of such large amounts of data can be challenging  Restrictions on data access must be maintained as data is brought into the data lake as well as on new data created within the data lake. Data lakes themselves can become silos  Often built for a specific department, data from the data lake must be integrated with other enterprise data to create a complete picture Limitations of Physical Data Lakes 8
  • 9. 9 Overcoming the Limitations of Physical Data Lakes Integrate your Data Lake with other Enterprise Data Architectures Provides a way to access data from separate systems through an abstraction layer that makes it appear as if the data were in a single data lake Improves the enterprise func- tionality of data lakes by combining one or more physical data lakes with other enterprise data Improves an organization’s ability to govern and extract more value from its data lakes by extending them as logical data lakes 9 Implement a Single Logical Data Lake Using Data Virtualization
  • 10. Marketing Data Lakes Research Logical Data Lake/Big Data Fabric Healthcare Self-Service Analytics Operational Apps A Single Governed Logical Data Lake Data Virtualization combines one or more physical data lakes with other enterprise data to create a “virtual” or “logical” data lake. Other Data Sources MDM Cloud Apps BI/Analytical Tools Excel Reports DATA VIRTUALIZATION Semantic Model Data Discovery Metadata Catalog Data Governance Denodo Platform Bridges Distinct Data Architectures 10  Discover  Prepare  Curate  Orchestrate  Integrate  Publish “Rely on Data Integration Infrastructure to make the Data Lake Work.” Philip Russom, Analyst, TDWI
  • 11. 11 Data Governance  Data Lineage  Structure and organization to your data lake  Data Masking  Data Quality  Functions for validating, cleansing, enriching and standardizing data.  SDK to integrate with external DQ tools and Big Data systems Enterprise Access Point  Enterprise-level access controls – table, row, column  Authentication/Authroization  Roles  Audit all access  Encryption/Decryption  Universal Semantic Model The Governed Data Lake
  • 12. 12 Data Lineage Understand the “source of truth” and transformations of every piece of data in the model
  • 13. Selective Data Masking (Full and Partial) 13
  • 17. 17 Performance Denodo’s unique query optimizer Denodo’s optimizer borrows many techniques from traditional RDBMs  Query plans based on statistics and indexes  Multiple JOIN methods  Query rewriting to generate more optimal SQL However, given the distributed execution of a query in a processing fabric, Denodo has designed unique techniques to maximize performance in this environment  Dynamic rewriting focused on maximizing execution at source and reduction of network traffic  Cost estimates also factor-in:  Processing power of the sources (e.g. number of nodes in a Hadoop cluster)  Network and transfer rates
  • 18. 18 1. A logical data lake prevents the data lake from becoming a silo and provides access to all the information an organization needs to power its analytics. 2. Data virtualization improves agility in big data activities. Users can quickly combine sources of information without spending time installing and configuring new databases or clusters to store the consolidated information. 3. This ease of use encourages exploration of data since the cost or effort to access the information is lower. 4. Data virtualization eliminates the cost of storing information twice and the need to update information in multiple places since information is not duplicated. Managing Data Lakes 18 Data virtualization is a practical strategy for managing data lakes
  • 19. 19 Why do Enterprises need Denodo’s Big Data Fabric to succeed? New actionable insights with minimal effort Information Self-service for business users Secures big data end-to-end Real-time integrated data across the business Ability to aggregate, transform, cleanse, and integrate data from multiple big data sources, which can then be presented in dashboards, reporting tools, and web applications. Allows any application, process, dashboard, tool, or user to access any integrated data, regardless of where the data is physically or logically located and regardless of the data format. Offers consistent, timely, and trusted data for internal and external users. Enables centralized data access and control, and supports data-at- rest and data-in-motion security measures. Remediates security risks with masking, auditing, and encryption across the fabric. Provides self-service data discovery and search capabilities. Virtual Sandboxing for Citizen Users
  • 21. 21 -Source: “Predicts 2017: Data Distribution and Complexity Drive Information Infrastructure Modernization” By 2018, organizations with data virtualization capabilities will spend 40% less on building and managing data integration processes for connecting distributed data assets.”
  • 22. Data Virtualization bridges the skills gap
  • 23.  Developers and users don’t need to learn special languages.  They can leverage Denodo graphical user interface to model, unify and deliver the data to multiple consumers. Data Virtualization combines new and legacy data sources
  • 24. 24 How the data goes in… How it gets back out… Denodo’s Big data fabric provides easy access to big data without having to decipher various data formats. Data Virtualization provides ease of use
  • 25. 25 -Source: “Forrester Wave™: Big Data Fabric Q4 2016” Denodo’s key strength is delivering a unified and centralized data services fabric with security and real-time integration across multiple traditional and big data sources, including Hadoop, NoSQL, cloud, and software-as-a-service (SaaS).”
  • 26. 26 -Source: “Forrester Wave™: Big Data Fabric Q4 2016” Today, several enterprises are leveraging Denodo to support big data fabric deployments — such as virtual big data marts, big data analytics, realtime analytics, and iot data processing — in various vertical industries. Customers like its easy-to-use, simple yet sophisticated data modeling capabilities, search, and support for various big data sources.
  • 28. © 2016 Autodesk | Enterprise Information Services 28 Why Build the Logical Data Lake Data virtualization can be used throughout your data pipeline!
  • 29. 29 Leading Construction Manufacturer - Telematics & Predictive Maintenance Dealer Maintenance Parts Inventory OSI PI Hadoop Cluster Tableau: Dealer / Customer Dashboard
  • 30. 30 Enrich Machine Data and Combine with Other Data Ingest, Integrate & Deliver Persisted (In-memory, Hadoop) Streams (specific time window) Message Queue Machine-generated/Event data Alerts Workflows Operational Processes Analytical Processes Consumers Visualization Data Virtualization Enrich and Combine IoT Data with Other Data Historians Streams ERP/SCM DW Analytical DB MDM Apps Data Marts Hadoop NoSQL
  • 31. 31 Business Benefits  Improved asset performance and proactive maintenance.  Reduced warranty costs due to proactive maintenance of parts preventing parts failure.  Optimized pricing for services and parts among global service providers.  New Business Model opportunities based on real-time analysis of detailed sensor data.
  • 32. Q&A
  • 33. Next Steps Forrester Wave™: Big Data Fabric Q4 2016 http://www.denodo.com/en/page/forrester-wave-big-data-fabric- q4-2016 Get Started! Download Denodo Express: www.denodoexpress.com Access Denodo Platform on AWS: www.denodo.com/en/denodo- platform/denodo-platform-for-aws
  • 34. Data Ninja Webinar Series Sessions covering data virtualization solutions for driving business value Next Series: Packed Lunch Series January - 2017
  • 35. Thanks! www.denodo.com info@denodo.com © Copyright Denodo Technologies. All rights reserved Unless otherwise specified, no part of this PDF file may be reproduced or utilized in any for or by any means, electronic or mechanical, including photocopying and microfilm, without prior the written authorization from Denodo Technologies.