SlideShare a Scribd company logo
1© Cloudera, Inc. All rights reserved.
Turning Data into Business Value
with a Modern Data Platform
Alex Gutow | Cloudera
2© Cloudera, Inc. All rights reserved.
Data is Now a Strategic Asset
Instrumentation
Consumerization
Experimentation
Today, everything that can be
measured will be measured.
Today, data IS the
application.
Today, becoming data-driven
is a business imperative.
3© Cloudera, Inc. All rights reserved.
Data is Transforming Business
Drive Customer
Insights
Improve Product &
Services Efficiency
Lower Business
Risk
Modernize IT
4© Cloudera, Inc. All rights reserved.
Using Deeper Customer Insight to
Personalize Customer Solutions
• Provides 360-degree customer view to
improve marketing effectiveness and
efficiency
• Enables real-time quoting of premiums
based on risk
• Analyzed customer data against 210
million U.S. survey database records in
15 seconds
CUSTOMER 360
5© Cloudera, Inc. All rights reserved.
Saving Lives by Detecting Sepsis Early
Enough for Successful Treatment
• Builds a more complete picture of
patients, conditions, and trends
• Has saved 100’s of lives already
• Reduces hospital readmissions
• 2PB+ in multi-tenant environment
supporting 100s of clients
• Secure yet explorable
6© Cloudera, Inc. All rights reserved.
Protecting the Global Finance System
• Compiles and links billions of signals to
create detailed consumer view
• Distinguishes good customers from
fraudsters and reduces false positives
• Fuels new products with competitive
features through new data assets
• Delivers 10X faster performance, using
⅓ the storage
CUSTOMER 360
LOWER BUSINESS RISKS
7© Cloudera, Inc. All rights reserved.
But how do you get there?
8© Cloudera, Inc. All rights reserved.
The New Analytics Paradigm
Understand
why it
happened
Change
what
happens
next
Determine
what
happened
Make it
happen
consistently
9© Cloudera, Inc. All rights reserved.
Leverage Common Workload Patterns
Store and process unlimited
data fast and cost-effectively.
Data
Engineering &
Data Science
“Programmatic data
processing and
machine learning”
Explore, analyze, and
understand all your data.
Analytic
Database
“Fast, flexible,
open source
parallel database”
Build data-driven applications
to deliver real-time insights.
Operational
Database
“Online applications,
lambda/kappa
architectures”
10© Cloudera, Inc. All rights reserved.
Packaged Together for Applications & Users
Programmatic data
processing and data pipeline
creation.
Data Science
& Engineering
Explore, analyze, and
understand all your data.
Analytic
Database
Data-driven applications
to deliver real-time insights.
Operational
Database
11© Cloudera, Inc. All rights reserved.
One platform. Many applications.
Data Science &
Engineering
Analytic
Database
Operational
Database
Driver Customer
Insights
Improve Product &
Services Efficiency
Lower Business
Risks
MODERN DATA PLATFORM
Business
Value
Technology
Use Cases
12© Cloudera, Inc. All rights reserved.
It all started with Hadoop
13© Cloudera, Inc. All rights reserved.
Evolution of the Hadoop Platform
Continually growing & adapting
Core Hadoop
(HDFS,
MapReduce)
Solr
Pig
Core Hadoop
HBase
ZooKeeper
Solr
Pig
Core Hadoop
Hive
Mahout
HBase
ZooKeeper
Solr
Pig
Core Hadoop
Sqoop
Avro
Hive
Mahout
HBase
ZooKeeper
Solr
Pig
Core Hadoop
Flume
Bigtop
Oozie
HCatalog
Hue
Sqoop
Avro
Hive
Mahout
HBase
ZooKeeper
Solr
Pig
YARN
Core Hadoop
Spark
Tez
Impala
Kafka
Drill
Flume
Bigtop
Oozie
HCatalog
Hue
Sqoop
Avro
Hive
Mahout
HBase
ZooKeeper
Solr
Pig
YARN
Core Hadoop
Parquet
Sentry
Spark
Tez
Impala
Kafka
Drill
Flume
Bigtop
Oozie
HCatalog
Hue
Sqoop
Avro
Hive
Mahout
HBase
ZooKeeper
Solr
Pig
YARN
Core Hadoop
Knox
Flink
Parquet
Sentry
Spark
Tez
Impala
Kafka
Drill
Flume
Bigtop
Oozie
HCatalog
Hue
Sqoop
Avro
Hive
Mahout
HBase
ZooKeeper
Solr
Pig
YARN
Core Hadoop
Kudu
RecordService
Ibis
Falcon
Knox
Flink
Parquet
Sentry
Spark
Tez
Impala
Kafka
Drill
Flume
Bigtop
Oozie
Hcatalog
Hue
Sqoop
Avro
Hive
Mahout
Hbase
ZooKeeper
Solr
Pig
YARN
Core Hadoop
2006 2008 2009 2010 2011 2012 20132007 2014 Present
14© Cloudera, Inc. All rights reserved.
2 PB of data/car/ year 1 – 2 TB of data / day 1 – 5 TB of data / day
15© Cloudera, Inc. All rights reserved.
It’s all kind of meaningless
unless you can make
sense of all that data
16© Cloudera, Inc. All rights reserved.
Cloudera Enterprise
Making Hadoop Fast, Easy, and Secure
A new kind of data platform
• One place for unlimited data
• Unified, multi-framework
data access
Cloudera makes it
• Fast for business
• Easy to manage
• Secure without compromise
Public Cloud
Private Cloud
Hybrid Environments
Hybrid Deployment
Flexibility
OPERATIONS
DATA
MANAGEMENT
STRUCTURED UNSTRUCTURED
PROCESS, ANALYZE, SERVE
UNIFIED SERVICES
RESOURCE MANAGEMENT SECURITY
NoSQL
STORE
INTEGRATE
BATCH STREAM SQL SEARCH OTHER
OTHERFILESYSTEM RELATIONAL
17© Cloudera, Inc. All rights reserved.
Key Capabilities
Data Engineering Operational DatabaseAnalytic Database
Fast
• Optimized performance for machine
learning & data processing
Fast
• High-performance SQL with multi-user
concurrency
Fast
• Real-time model serving (<15ms) with
limitless concurrency
Easy
• Transient workload automation
• Hybrid manageability & resource
management
Easy
• Elasticity on-prem & in cloud
• Recommendations for offload &
optimizations
Easy
• End-to-end Lambda/real-time
streaming in one platform
• Cloud & on-prem automations & BDR
Secure
• Compliance-Ready
• Fine-grained authorization of Spark/MR
Secure
• Compliance-Ready
• Data management for stewardship
Secure
• Compliance-Ready
• Unified encryption & RBAC
18© Cloudera, Inc. All rights reserved.
Partner focused. Partner engineered.
Data
Systems
Applications
Operational
Tools
Infrastructure
System
Integration
CLOUDERA ENTERPRISE
OPERATIONS
DATAMANAGEMENT
UNIFIED SERVICES
PROCESS,ANALYZE, SERVE
STORE
INTEGRATE
19© Cloudera, Inc. All rights reserved.
Built for Today
Ready for Tomorrow
20© Cloudera, Inc. All rights reserved.
What’s Driving Hadoop to the Cloud?
Enterprise customers using cloud for big data analytics
Hadoop deployments in cloud are
accelerating:
● Executive mandate: minimize on-prem
datacenter footprint
● Perceived lower overall TCO
● Increased agility: end-user self-service
● Elasticity: optimize infrastructure usage
21© Cloudera, Inc. All rights reserved.
The Future is Hybrid & Multi-Cloud
76% of companies will embrace hybrid cloud1 82% of enterprises will have a multi-cloud strategy2
1 Gartner, Market Trends: Cloud Adoption Trends Favor Public Cloud With a Hybrid Twist
2015
2 RightScale 2016 State of the Cloud Report
22© Cloudera, Inc. All rights reserved.
Embrace Transience for
Lower Costs
Decoupled Storage and
Compute for Elastic Scale
Patterns of Cloud-Native Applications
Flexibility, Self-Service Models, and New Cost Dynamics
Compartmentalize for
Greater Isolation
Object Store
COMPUTE
1h
r
SPIN UP SPIN
DOWN
Object Store
23© Cloudera, Inc. All rights reserved.
Run Concurrent Workloads on Shared Data
App Delivery
(Operational
Database)
Reduce Operating Costs New Insights, New Revenue Run Without Risk
Enterprise-grade to protect your
business, no matter what
▪ Persistent production-critical
clusters
▪ Periodic sync
▪ All local storage
Only pay for what you need,
when you need it
▪ Transient clusters
▪ Elastic workload
▪ Object storage centric
ETL/Modeling
(Data Engineering)
BI/Analytics
(Analytic Database)
Explore and analyze all data,
wherever it lives
▪ Transient or Persistent clusters
▪ Sized to demand
▪ Local or object storage
Cloud Object Store Local Hadoop Storage
Backup to cloud
24© Cloudera, Inc. All rights reserved.
Cloudera in the Cloud
Size compute and storage
independently, grow and shrink
clusters dynamically, and pay only
for what you use on ad-hoc,
transient workloads
Preserve business flexibility and
data portability and minimize cloud
lock-in by running in any one of the
three major public cloud providers
or in private cloud
Reduce risk with comprehensive
manageability, availability, security,
and governance required for
production big data workloads
Elastic Hybrid/Multi-Cloud Enterprise Grade
25© Cloudera, Inc. All rights reserved.
Providing a complete view of consumer
watching and buying habits
• Helps customers optimize their ad
spend for greater campaign ROI
• Improves processing performance as
data volumes double
• Boosts agility and flexibility and
reduces risk with hybrid and
multi-cloud strategy
CUSTOMER 360
DRIVE CUSTOMER INSIGHTS
26© Cloudera, Inc. All rights reserved.
Adopt an Agile Approach
Successful projects start small, fail often, and iterate to success
1. Get data you already have, or create
new data.
2. Explore and analyze, quickly.
3. Deploy your application.
…and repeat
Add:
new data sources, more
users, more use cases,
more complex analytics,
go real-time
Collect, Create,
Manage
unlimited data
Explore, Analyze
data in many ways
Operationalize
insights to drive action
27© Cloudera, Inc. All rights reserved.
Getting Started is Easy
① ②
Download or Deploy
in the Cloud
Signup for
Training
Contact us or a Partner
to Start a POC
③
28© Cloudera, Inc. All rights reserved.
Thanks!

More Related Content

PPTX
Introducing Cloudera DataFlow (CDF) 2.13.19
PPTX
Implement SQL Server on an Azure VM
PDF
Maximum Availability Architecture - Best Practices for Oracle Database 19c
PPTX
Database Cloud Service/Exadata Cloud Service/Exadata Cloud at Customer サービスアッ...
PPT
Oracle Transparent Data Encryption (TDE) 12c
PDF
Oracle Cloud Infrastructure:2021年3月度サービス・アップデート
PDF
GoldenGateテクニカルセミナー4「テクニカルコンサルタントが語るOracle GoldenGate現場で使える極意」(2016/5/11)
PPTX
Anil nair rac_internals_sangam_2016
Introducing Cloudera DataFlow (CDF) 2.13.19
Implement SQL Server on an Azure VM
Maximum Availability Architecture - Best Practices for Oracle Database 19c
Database Cloud Service/Exadata Cloud Service/Exadata Cloud at Customer サービスアッ...
Oracle Transparent Data Encryption (TDE) 12c
Oracle Cloud Infrastructure:2021年3月度サービス・アップデート
GoldenGateテクニカルセミナー4「テクニカルコンサルタントが語るOracle GoldenGate現場で使える極意」(2016/5/11)
Anil nair rac_internals_sangam_2016

What's hot (20)

PDF
Architecting Agile Data Applications for Scale
PDF
Time to Talk about Data Mesh
PPTX
Datasaturday Pordenone Azure Purview Erwin de Kreuk
PDF
Oracle Cloud Infrastructure
PDF
Cloud Migration Cookbook: A Guide To Moving Your Apps To The Cloud
PDF
Oracle Cloud Infrastructure:2021年6月度サービス・アップデート
PPTX
Securing Hadoop with Apache Ranger
PPTX
Session 14 - Hive
PPTX
Snowflake: The Good, the Bad, and the Ugly
PDF
Introduction to Azure Data Lake
PPTX
Introduction To Data Vault - DAMA Oregon 2012
PDF
Snowflake Architecture
PDF
Oracle RAC Internals - The Cache Fusion Edition
PPT
Oracle Database Vault
PDF
Azure Container Apps
PDF
Oracle RAC 19c with Standard Edition (SE) 2 - Support Update
PPTX
Data Sharing with Snowflake
PPT
Mainframe cloud computing presentation
PPTX
Understanding Azure Disaster Recovery
PDF
Oracle GoldenGate Cloud Serviceユーザーズガイド
Architecting Agile Data Applications for Scale
Time to Talk about Data Mesh
Datasaturday Pordenone Azure Purview Erwin de Kreuk
Oracle Cloud Infrastructure
Cloud Migration Cookbook: A Guide To Moving Your Apps To The Cloud
Oracle Cloud Infrastructure:2021年6月度サービス・アップデート
Securing Hadoop with Apache Ranger
Session 14 - Hive
Snowflake: The Good, the Bad, and the Ugly
Introduction to Azure Data Lake
Introduction To Data Vault - DAMA Oregon 2012
Snowflake Architecture
Oracle RAC Internals - The Cache Fusion Edition
Oracle Database Vault
Azure Container Apps
Oracle RAC 19c with Standard Edition (SE) 2 - Support Update
Data Sharing with Snowflake
Mainframe cloud computing presentation
Understanding Azure Disaster Recovery
Oracle GoldenGate Cloud Serviceユーザーズガイド
Ad

Viewers also liked (13)

PPTX
Secure Data - Why Encryption and Access Control are Game Changers
PPTX
Adaptive Data Cleansing with StreamSets and Cassandra (Pat Patterson, StreamS...
PDF
MiFID II - Data Governance - Closing the Chasm
PPTX
BD and Sales
PDF
CH&Cie - MiFID II - CIB - Teaser
PDF
Architecting next generation big data platform
PPTX
PPTX
The impact of MiFID II on your OTC derivatives trading business
PDF
Hadoop application architectures - using Customer 360 as an example
PDF
MiFID II: the next step presentation
PDF
EXTENT-2016: MiFID 2 Requirements for testing and business clocks
PPTX
EXTENT-2015: MiFID II Projected Impact on Trading Technology
PPTX
Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
Secure Data - Why Encryption and Access Control are Game Changers
Adaptive Data Cleansing with StreamSets and Cassandra (Pat Patterson, StreamS...
MiFID II - Data Governance - Closing the Chasm
BD and Sales
CH&Cie - MiFID II - CIB - Teaser
Architecting next generation big data platform
The impact of MiFID II on your OTC derivatives trading business
Hadoop application architectures - using Customer 360 as an example
MiFID II: the next step presentation
EXTENT-2016: MiFID 2 Requirements for testing and business clocks
EXTENT-2015: MiFID II Projected Impact on Trading Technology
Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
Ad

Similar to Turning Data into Business Value with a Modern Data Platform (20)

PPTX
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
PDF
Gab Genai Cloudera - Going Beyond Traditional Analytic
PPTX
The 6th Wave of Automation: Automation of Decisions | Cloudera Analytics & Ma...
PPTX
Standing Up an Effective Enterprise Data Hub -- Technology and Beyond
PPTX
Part 2: Cloudera’s Operational Database: Unlocking New Benefits in the Cloud
PPTX
Intel and Cloudera: Accelerating Enterprise Big Data Success
PPTX
The Journey to Success with Big Data
PPTX
Edc event vienna presentation 1 oct 2019
PDF
Hitachi Data Systems Hadoop Solution
PPTX
Leveraging the cloud for analytics and machine learning 1.29.19
PPTX
Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017
PPTX
A deep dive into running data analytic workloads in the cloud
PPTX
The Future of Data Management: The Enterprise Data Hub
PPTX
Keynote: The Journey to Pervasive Analytics
PPTX
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
PPTX
Modern Data Warehouse Fundamentals Part 1
PPTX
Building a Modern Analytic Database with Cloudera 5.8
PPTX
When SAP alone is not enough
PPTX
Cloudera Analytics and Machine Learning Platform - Optimized for Cloud
PPTX
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
Gab Genai Cloudera - Going Beyond Traditional Analytic
The 6th Wave of Automation: Automation of Decisions | Cloudera Analytics & Ma...
Standing Up an Effective Enterprise Data Hub -- Technology and Beyond
Part 2: Cloudera’s Operational Database: Unlocking New Benefits in the Cloud
Intel and Cloudera: Accelerating Enterprise Big Data Success
The Journey to Success with Big Data
Edc event vienna presentation 1 oct 2019
Hitachi Data Systems Hadoop Solution
Leveraging the cloud for analytics and machine learning 1.29.19
Cloudera Big Data Integration Speedpitch at TDWI Munich June 2017
A deep dive into running data analytic workloads in the cloud
The Future of Data Management: The Enterprise Data Hub
Keynote: The Journey to Pervasive Analytics
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Modern Data Warehouse Fundamentals Part 1
Building a Modern Analytic Database with Cloudera 5.8
When SAP alone is not enough
Cloudera Analytics and Machine Learning Platform - Optimized for Cloud
MongoDB IoT City Tour LONDON: Hadoop and the future of data management. By, M...

More from Cloudera, Inc. (20)

PPTX
Partner Briefing_January 25 (FINAL).pptx
PPTX
Cloudera Data Impact Awards 2021 - Finalists
PPTX
2020 Cloudera Data Impact Awards Finalists
PPTX
Machine Learning with Limited Labeled Data 4/3/19
PPTX
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
PPTX
Introducing Cloudera Data Science Workbench for HDP 2.12.19
PPTX
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
PPTX
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
PPTX
Leveraging the Cloud for Big Data Analytics 12.11.18
PPTX
Modern Data Warehouse Fundamentals Part 3
PPTX
Modern Data Warehouse Fundamentals Part 2
PPTX
Extending Cloudera SDX beyond the Platform
PPTX
Federated Learning: ML with Privacy on the Edge 11.15.18
PPTX
Analyst Webinar: Doing a 180 on Customer 360
PPTX
Build a modern platform for anti-money laundering 9.19.18
PPTX
Introducing the data science sandbox as a service 8.30.18
PPTX
Cloudera SDX
PPTX
Introducing Workload XM 8.7.18
PPTX
Get started with Cloudera's cyber solution
PPTX
Spark and Deep Learning Frameworks at Scale 7.19.18
Partner Briefing_January 25 (FINAL).pptx
Cloudera Data Impact Awards 2021 - Finalists
2020 Cloudera Data Impact Awards Finalists
Machine Learning with Limited Labeled Data 4/3/19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Leveraging the Cloud for Big Data Analytics 12.11.18
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 2
Extending Cloudera SDX beyond the Platform
Federated Learning: ML with Privacy on the Edge 11.15.18
Analyst Webinar: Doing a 180 on Customer 360
Build a modern platform for anti-money laundering 9.19.18
Introducing the data science sandbox as a service 8.30.18
Cloudera SDX
Introducing Workload XM 8.7.18
Get started with Cloudera's cyber solution
Spark and Deep Learning Frameworks at Scale 7.19.18

Recently uploaded (20)

PPTX
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
PDF
How Creative Agencies Leverage Project Management Software.pdf
PPTX
Transform Your Business with a Software ERP System
PDF
Odoo Companies in India – Driving Business Transformation.pdf
PDF
Understanding Forklifts - TECH EHS Solution
PDF
PTS Company Brochure 2025 (1).pdf.......
PDF
Design an Analysis of Algorithms II-SECS-1021-03
PPTX
Operating system designcfffgfgggggggvggggggggg
PDF
System and Network Administration Chapter 2
PDF
Softaken Excel to vCard Converter Software.pdf
PPTX
ISO 45001 Occupational Health and Safety Management System
PDF
Digital Strategies for Manufacturing Companies
PDF
top salesforce developer skills in 2025.pdf
PPTX
Odoo POS Development Services by CandidRoot Solutions
PDF
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
PPTX
history of c programming in notes for students .pptx
PDF
Wondershare Filmora 15 Crack With Activation Key [2025
PDF
medical staffing services at VALiNTRY
PPTX
L1 - Introduction to python Backend.pptx
PPTX
VVF-Customer-Presentation2025-Ver1.9.pptx
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
How Creative Agencies Leverage Project Management Software.pdf
Transform Your Business with a Software ERP System
Odoo Companies in India – Driving Business Transformation.pdf
Understanding Forklifts - TECH EHS Solution
PTS Company Brochure 2025 (1).pdf.......
Design an Analysis of Algorithms II-SECS-1021-03
Operating system designcfffgfgggggggvggggggggg
System and Network Administration Chapter 2
Softaken Excel to vCard Converter Software.pdf
ISO 45001 Occupational Health and Safety Management System
Digital Strategies for Manufacturing Companies
top salesforce developer skills in 2025.pdf
Odoo POS Development Services by CandidRoot Solutions
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
history of c programming in notes for students .pptx
Wondershare Filmora 15 Crack With Activation Key [2025
medical staffing services at VALiNTRY
L1 - Introduction to python Backend.pptx
VVF-Customer-Presentation2025-Ver1.9.pptx

Turning Data into Business Value with a Modern Data Platform

  • 1. 1© Cloudera, Inc. All rights reserved. Turning Data into Business Value with a Modern Data Platform Alex Gutow | Cloudera
  • 2. 2© Cloudera, Inc. All rights reserved. Data is Now a Strategic Asset Instrumentation Consumerization Experimentation Today, everything that can be measured will be measured. Today, data IS the application. Today, becoming data-driven is a business imperative.
  • 3. 3© Cloudera, Inc. All rights reserved. Data is Transforming Business Drive Customer Insights Improve Product & Services Efficiency Lower Business Risk Modernize IT
  • 4. 4© Cloudera, Inc. All rights reserved. Using Deeper Customer Insight to Personalize Customer Solutions • Provides 360-degree customer view to improve marketing effectiveness and efficiency • Enables real-time quoting of premiums based on risk • Analyzed customer data against 210 million U.S. survey database records in 15 seconds CUSTOMER 360
  • 5. 5© Cloudera, Inc. All rights reserved. Saving Lives by Detecting Sepsis Early Enough for Successful Treatment • Builds a more complete picture of patients, conditions, and trends • Has saved 100’s of lives already • Reduces hospital readmissions • 2PB+ in multi-tenant environment supporting 100s of clients • Secure yet explorable
  • 6. 6© Cloudera, Inc. All rights reserved. Protecting the Global Finance System • Compiles and links billions of signals to create detailed consumer view • Distinguishes good customers from fraudsters and reduces false positives • Fuels new products with competitive features through new data assets • Delivers 10X faster performance, using ⅓ the storage CUSTOMER 360 LOWER BUSINESS RISKS
  • 7. 7© Cloudera, Inc. All rights reserved. But how do you get there?
  • 8. 8© Cloudera, Inc. All rights reserved. The New Analytics Paradigm Understand why it happened Change what happens next Determine what happened Make it happen consistently
  • 9. 9© Cloudera, Inc. All rights reserved. Leverage Common Workload Patterns Store and process unlimited data fast and cost-effectively. Data Engineering & Data Science “Programmatic data processing and machine learning” Explore, analyze, and understand all your data. Analytic Database “Fast, flexible, open source parallel database” Build data-driven applications to deliver real-time insights. Operational Database “Online applications, lambda/kappa architectures”
  • 10. 10© Cloudera, Inc. All rights reserved. Packaged Together for Applications & Users Programmatic data processing and data pipeline creation. Data Science & Engineering Explore, analyze, and understand all your data. Analytic Database Data-driven applications to deliver real-time insights. Operational Database
  • 11. 11© Cloudera, Inc. All rights reserved. One platform. Many applications. Data Science & Engineering Analytic Database Operational Database Driver Customer Insights Improve Product & Services Efficiency Lower Business Risks MODERN DATA PLATFORM Business Value Technology Use Cases
  • 12. 12© Cloudera, Inc. All rights reserved. It all started with Hadoop
  • 13. 13© Cloudera, Inc. All rights reserved. Evolution of the Hadoop Platform Continually growing & adapting Core Hadoop (HDFS, MapReduce) Solr Pig Core Hadoop HBase ZooKeeper Solr Pig Core Hadoop Hive Mahout HBase ZooKeeper Solr Pig Core Hadoop Sqoop Avro Hive Mahout HBase ZooKeeper Solr Pig Core Hadoop Flume Bigtop Oozie HCatalog Hue Sqoop Avro Hive Mahout HBase ZooKeeper Solr Pig YARN Core Hadoop Spark Tez Impala Kafka Drill Flume Bigtop Oozie HCatalog Hue Sqoop Avro Hive Mahout HBase ZooKeeper Solr Pig YARN Core Hadoop Parquet Sentry Spark Tez Impala Kafka Drill Flume Bigtop Oozie HCatalog Hue Sqoop Avro Hive Mahout HBase ZooKeeper Solr Pig YARN Core Hadoop Knox Flink Parquet Sentry Spark Tez Impala Kafka Drill Flume Bigtop Oozie HCatalog Hue Sqoop Avro Hive Mahout HBase ZooKeeper Solr Pig YARN Core Hadoop Kudu RecordService Ibis Falcon Knox Flink Parquet Sentry Spark Tez Impala Kafka Drill Flume Bigtop Oozie Hcatalog Hue Sqoop Avro Hive Mahout Hbase ZooKeeper Solr Pig YARN Core Hadoop 2006 2008 2009 2010 2011 2012 20132007 2014 Present
  • 14. 14© Cloudera, Inc. All rights reserved. 2 PB of data/car/ year 1 – 2 TB of data / day 1 – 5 TB of data / day
  • 15. 15© Cloudera, Inc. All rights reserved. It’s all kind of meaningless unless you can make sense of all that data
  • 16. 16© Cloudera, Inc. All rights reserved. Cloudera Enterprise Making Hadoop Fast, Easy, and Secure A new kind of data platform • One place for unlimited data • Unified, multi-framework data access Cloudera makes it • Fast for business • Easy to manage • Secure without compromise Public Cloud Private Cloud Hybrid Environments Hybrid Deployment Flexibility OPERATIONS DATA MANAGEMENT STRUCTURED UNSTRUCTURED PROCESS, ANALYZE, SERVE UNIFIED SERVICES RESOURCE MANAGEMENT SECURITY NoSQL STORE INTEGRATE BATCH STREAM SQL SEARCH OTHER OTHERFILESYSTEM RELATIONAL
  • 17. 17© Cloudera, Inc. All rights reserved. Key Capabilities Data Engineering Operational DatabaseAnalytic Database Fast • Optimized performance for machine learning & data processing Fast • High-performance SQL with multi-user concurrency Fast • Real-time model serving (<15ms) with limitless concurrency Easy • Transient workload automation • Hybrid manageability & resource management Easy • Elasticity on-prem & in cloud • Recommendations for offload & optimizations Easy • End-to-end Lambda/real-time streaming in one platform • Cloud & on-prem automations & BDR Secure • Compliance-Ready • Fine-grained authorization of Spark/MR Secure • Compliance-Ready • Data management for stewardship Secure • Compliance-Ready • Unified encryption & RBAC
  • 18. 18© Cloudera, Inc. All rights reserved. Partner focused. Partner engineered. Data Systems Applications Operational Tools Infrastructure System Integration CLOUDERA ENTERPRISE OPERATIONS DATAMANAGEMENT UNIFIED SERVICES PROCESS,ANALYZE, SERVE STORE INTEGRATE
  • 19. 19© Cloudera, Inc. All rights reserved. Built for Today Ready for Tomorrow
  • 20. 20© Cloudera, Inc. All rights reserved. What’s Driving Hadoop to the Cloud? Enterprise customers using cloud for big data analytics Hadoop deployments in cloud are accelerating: ● Executive mandate: minimize on-prem datacenter footprint ● Perceived lower overall TCO ● Increased agility: end-user self-service ● Elasticity: optimize infrastructure usage
  • 21. 21© Cloudera, Inc. All rights reserved. The Future is Hybrid & Multi-Cloud 76% of companies will embrace hybrid cloud1 82% of enterprises will have a multi-cloud strategy2 1 Gartner, Market Trends: Cloud Adoption Trends Favor Public Cloud With a Hybrid Twist 2015 2 RightScale 2016 State of the Cloud Report
  • 22. 22© Cloudera, Inc. All rights reserved. Embrace Transience for Lower Costs Decoupled Storage and Compute for Elastic Scale Patterns of Cloud-Native Applications Flexibility, Self-Service Models, and New Cost Dynamics Compartmentalize for Greater Isolation Object Store COMPUTE 1h r SPIN UP SPIN DOWN Object Store
  • 23. 23© Cloudera, Inc. All rights reserved. Run Concurrent Workloads on Shared Data App Delivery (Operational Database) Reduce Operating Costs New Insights, New Revenue Run Without Risk Enterprise-grade to protect your business, no matter what ▪ Persistent production-critical clusters ▪ Periodic sync ▪ All local storage Only pay for what you need, when you need it ▪ Transient clusters ▪ Elastic workload ▪ Object storage centric ETL/Modeling (Data Engineering) BI/Analytics (Analytic Database) Explore and analyze all data, wherever it lives ▪ Transient or Persistent clusters ▪ Sized to demand ▪ Local or object storage Cloud Object Store Local Hadoop Storage Backup to cloud
  • 24. 24© Cloudera, Inc. All rights reserved. Cloudera in the Cloud Size compute and storage independently, grow and shrink clusters dynamically, and pay only for what you use on ad-hoc, transient workloads Preserve business flexibility and data portability and minimize cloud lock-in by running in any one of the three major public cloud providers or in private cloud Reduce risk with comprehensive manageability, availability, security, and governance required for production big data workloads Elastic Hybrid/Multi-Cloud Enterprise Grade
  • 25. 25© Cloudera, Inc. All rights reserved. Providing a complete view of consumer watching and buying habits • Helps customers optimize their ad spend for greater campaign ROI • Improves processing performance as data volumes double • Boosts agility and flexibility and reduces risk with hybrid and multi-cloud strategy CUSTOMER 360 DRIVE CUSTOMER INSIGHTS
  • 26. 26© Cloudera, Inc. All rights reserved. Adopt an Agile Approach Successful projects start small, fail often, and iterate to success 1. Get data you already have, or create new data. 2. Explore and analyze, quickly. 3. Deploy your application. …and repeat Add: new data sources, more users, more use cases, more complex analytics, go real-time Collect, Create, Manage unlimited data Explore, Analyze data in many ways Operationalize insights to drive action
  • 27. 27© Cloudera, Inc. All rights reserved. Getting Started is Easy ① ② Download or Deploy in the Cloud Signup for Training Contact us or a Partner to Start a POC ③
  • 28. 28© Cloudera, Inc. All rights reserved. Thanks!