SlideShare a Scribd company logo
The Car of the Future
Autonomous, Connected and Data-Centric
DWS Melbourne, Australia 2019
Robert Hryniewicz
2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Agenda
1. Autonomous Vehicles
2. Autonomous Vehicle Learning Lifecycle
3. Data Management Challenges
4. Next-Generation Data Management
5. Architecture for Success
6. Solution in Action
7. Q & A
3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Title Goes HereThe Autonomous Car – It’s Happening
“Nearly all new vehicles will be capable of
full autonomy within 10 years”
Elon Musk, 2017
10 million self-driving cars will be on the
road by 2020 BI Intelligence, 2016
12 million fully autonomous vehicles by
2035 BCG, 2017
Varying predictions, but the future is clear
McKinsey, 2016
Up to 15 percent of new cars in 2030
could be fully autonomous
4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Title Goes Here
DATA EXPLOSION
Annual Connected Vehicle Data to
Grow Almost 10X By 2020*
* Source: Cowen and Company, Gartner
And Autonomous Vehicles Will
Increase this by a further 25X**
** Source: Cowen and Company, Datameer
5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Title Goes HereAutonomous Driving Learning Lifecycle
Training Data for Autonomous
Driving Model Development
Deploy Autonomous Driving
Model To Vehicle
In-Vehicle
Processing Unit
• Camera
• Radar
• Lidar
• GPS
• IMU
• Other
Sensors
AUTOMONOMOUS
VEHICLE
In-Vehicle
Storage Device
DATA
AUTONOMOUS
DRIVING LEARNING
LIFECYCLE
Data
Storage
2
Petabyte
Scale
Data Pre-
Processing
3
Labeling, etc.
Machine
Learning
4
Auto-
Labeling, etc.
Rules
Definition
5
Manual or
Automatic
Testing &
Simulation
6
Using
Training Data
Data
Ingestion
1
From
Vehicle
Model
Deployment
7
To
Vehicle
6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Title Goes Here
COUNTRY 1
Autonomous Test Cars
INGEST1
NAS
Storage
STORE2
MOVE3
WorkstationsCOMPUTE4
Engineer Engineer
Traditional Data Management Approach
COUNTRY 2…N
7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Title Goes HereData Management Challenges
New Deployment
Models
Difficulty
managing both….
Excessive Data
Movement
Movement
between…
STORE
COMPUTE
Data Volume
and Variety
NAS not
optimized
for…
VOLUME
VARIETY
Data Governance
& Security
Fragmented
management
across….
PROGRAMS
PARK ASSIST
LANE DEPARTURE
ADAPTIVE CRUISE
LEVEL 4, 5
GEOGRAPHIES
Data Intensive
Computing
Workstations &
CPUs not
optimized for ML
X
Workstation
8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Title Goes HereA Better Way
SECURED &
GOVERNED
Centralized Governance
- Across Geographies
and Teams
DEEP
LEARNING
& GPU
Deep Learning
frameworks
(TensorFlow, PyTorch)
GPU Pooling/Isolation
“In Place”
Data
Processing
DON’T
MOVE
DATA
HYBRID &
MULTI
CLOUD
Unified Management
of Clusters - whether
on Cloud, On-Premise
or Hybrid
UNIVERSAL
DATA
STORAGE
Autonomous Vehicle
Research Data Lake
Infinitely Scalable (Billions
of files, Exabytes)
Low TCO
9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Title Goes Here“In-Place” Data Storage
TO
ü CREATE ALGORITHM
ü STORE + COMPUTE
AUTONOMOUS
CARS
• Data stored and processed on Hadoop
• Elimination of data movement
• Massively parallel computing
• Dramatically faster computing times
FROM
ü CREATE ALGORITHM
ü STORE
ü COMPUTE
AUTONOMOUS
CARS
• Data moved to workstation
• Data processed on workstation
10 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Title Goes HereSmarter Decisions Made Based on
Support for Deep Learning Workloads
Why GPU support?
à Enhances the performance of computations
needed for enterprise ML/DL apps
à DL requires intense computational algorithms
à Containerized software powered by GPUs helps
data processing at scale
Result: Data Scientists can run DL models in days
vs months, hours vs days, minutes vs hours
11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Title Goes HereSecurity and Governance
Full chain of custody of data across the
Hadoop ecosystem
Auditing of events for fine grained and detailed info
Tag propagation allows auditors to see where the
data is going across the enterprise & retain
context of sensitive data
Time-based policies to allow temporary access
to users
12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Title Goes Here
SOLUTION IN ACTION
13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Title Goes Here
lidars (front)
radar
GPS
GPS
lidar (side)
lidar (rear)
lidar
stereo
camera
remote control
antenna
2007 DARPA Urban Challenge
14 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Title Goes Here
lidar
stereo camera
Nvidia Jetson Tx2
”the brain”
power supply (computer)
power supply (car)
15 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Title Goes HereTraining workflow
HDP Cluster for Training
Model
Data (30K images+
Steering angles)
5 Servers
6 GPU cards
Dockerized TensorFlow
GPU Pooling
Zeppelin
Inference
16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Title Goes Here
17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Title Goes HereIn Conclusion
Autonomous
Vehicles
Coming Soon
Data-Centric Learning Based on Data!
Data Challenges
Volume, Variety, Storage, Compute,
Learning, Governance and Security
Crossing the Chasm
Data Management Innovation
Foundational to Success
18 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Questions?
19 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Thanks!
Robert Hryniewicz

More Related Content

PDF
Data in the Cloud Crash Course
PDF
Running Enterprise Workloads with an Open Source Hybrid Cloud Data Architecture
PDF
Containers and Big Data
PDF
10 Lessons Learned from Meeting with 150 Banks Across the Globe
PDF
Curing the Kafka Blindness – Streams Messaging Manager
PDF
Hadoop Operations – Past, Present, and Future
PDF
Running Enterprise Workloads with an open source Hybrid Cloud Data Architecture
PDF
Deep learning 101
Data in the Cloud Crash Course
Running Enterprise Workloads with an Open Source Hybrid Cloud Data Architecture
Containers and Big Data
10 Lessons Learned from Meeting with 150 Banks Across the Globe
Curing the Kafka Blindness – Streams Messaging Manager
Hadoop Operations – Past, Present, and Future
Running Enterprise Workloads with an open source Hybrid Cloud Data Architecture
Deep learning 101

What's hot (20)

PPTX
Overcoming the AI hype — and what enterprises should really focus on
PDF
Your Self-Driving Car - How Did it Get So Smart?
PPTX
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
PDF
Data Centric Transformation in Telecom
PDF
Deep Learning 101
PDF
Fast SQL on Hadoop, really?
PDF
Hortonworks HDP, Is it goog enough ?
PDF
Hadoop: The Unintended Benefits
PPTX
Modernize Your Existing EDW with IBM Big SQL & Hortonworks Data Platform
PDF
IoT Story: From Edge to HDP
PPTX
Containers and Big Data
PPTX
The Destiny of Data
PDF
Deep learning with Hortonworks and Apache Spark - Hortonworks technical workshop
PDF
Integrating and Analyzing Data from Multiple Manufacturing Sites using Apache...
PDF
What's New in Apache Hive 3.0?
PPTX
Enabling the Real Time Analytical Enterprise
PPTX
Dancing Elephants - Efficiently Working with Object Stories from Apache Spark...
PDF
Running Enterprise Workloads with an open source Hybrid Cloud Data Architectu...
PPTX
Modernise your EDW - Data Lake
PDF
Containers and Big Data
Overcoming the AI hype — and what enterprises should really focus on
Your Self-Driving Car - How Did it Get So Smart?
Big Data Platform Processes Daily Healthcare Data for Clinic Use at Mayo Clinic
Data Centric Transformation in Telecom
Deep Learning 101
Fast SQL on Hadoop, really?
Hortonworks HDP, Is it goog enough ?
Hadoop: The Unintended Benefits
Modernize Your Existing EDW with IBM Big SQL & Hortonworks Data Platform
IoT Story: From Edge to HDP
Containers and Big Data
The Destiny of Data
Deep learning with Hortonworks and Apache Spark - Hortonworks technical workshop
Integrating and Analyzing Data from Multiple Manufacturing Sites using Apache...
What's New in Apache Hive 3.0?
Enabling the Real Time Analytical Enterprise
Dancing Elephants - Efficiently Working with Object Stories from Apache Spark...
Running Enterprise Workloads with an open source Hybrid Cloud Data Architectu...
Modernise your EDW - Data Lake
Containers and Big Data
Ad

Similar to The Car of the Future - Autonomous, Connected, and Data Centric (20)

PPTX
[Hortonworks] Future Of Data: Madrid - HDF & Data in motion
PPTX
Hortonworks - IBM - Cloud Event
PDF
Hortonworks - IBM Cognitive - The Future of Data Science
PDF
Powering the Future of Data  
PDF
IBM Cloud Paris meetup 20180213 - Hortonworks
PPTX
Unlocking insights in streaming data
PPTX
Data Science Crash Course
PPTX
The Elephant in the Clouds
PDF
Data in Motion - Data at Rest - Hortonworks a Modern Architecture
PPTX
Smart Cities: An APAC Necessity
PPTX
Spark-Zeppelin-ML on HWX
PDF
Hortonworks Hybrid Cloud - Putting you back in control of your data
PPTX
The Implacable advance of the data
PDF
Apache Spark Crash Course
PDF
HDF 3.2 - What's New
PPTX
Edw Optimization Solution
PPTX
Hortonworks Open Connected Data Platforms for IoT and Predictive Big Data Ana...
PDF
Apache Hadoop Crash Course
PPTX
Couchbase & HPCC Systems – A complete mobile & data platform in the enterprise
PDF
FOD Paris Meetup - Global Data Management with DataPlane Services (DPS)
[Hortonworks] Future Of Data: Madrid - HDF & Data in motion
Hortonworks - IBM - Cloud Event
Hortonworks - IBM Cognitive - The Future of Data Science
Powering the Future of Data  
IBM Cloud Paris meetup 20180213 - Hortonworks
Unlocking insights in streaming data
Data Science Crash Course
The Elephant in the Clouds
Data in Motion - Data at Rest - Hortonworks a Modern Architecture
Smart Cities: An APAC Necessity
Spark-Zeppelin-ML on HWX
Hortonworks Hybrid Cloud - Putting you back in control of your data
The Implacable advance of the data
Apache Spark Crash Course
HDF 3.2 - What's New
Edw Optimization Solution
Hortonworks Open Connected Data Platforms for IoT and Predictive Big Data Ana...
Apache Hadoop Crash Course
Couchbase & HPCC Systems – A complete mobile & data platform in the enterprise
FOD Paris Meetup - Global Data Management with DataPlane Services (DPS)
Ad

More from DataWorks Summit (20)

PPTX
Data Science Crash Course
PPTX
Floating on a RAFT: HBase Durability with Apache Ratis
PPTX
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
PDF
HBase Tales From the Trenches - Short stories about most common HBase operati...
PPTX
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
PPTX
Managing the Dewey Decimal System
PPTX
Practical NoSQL: Accumulo's dirlist Example
PPTX
HBase Global Indexing to support large-scale data ingestion at Uber
PPTX
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
PPTX
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
PPTX
Supporting Apache HBase : Troubleshooting and Supportability Improvements
PPTX
Security Framework for Multitenant Architecture
PDF
Presto: Optimizing Performance of SQL-on-Anything Engine
PPTX
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
PPTX
Extending Twitter's Data Platform to Google Cloud
PPTX
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
PPTX
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
PPTX
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
PDF
Computer Vision: Coming to a Store Near You
PPTX
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Data Science Crash Course
Floating on a RAFT: HBase Durability with Apache Ratis
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
HBase Tales From the Trenches - Short stories about most common HBase operati...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Managing the Dewey Decimal System
Practical NoSQL: Accumulo's dirlist Example
HBase Global Indexing to support large-scale data ingestion at Uber
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Security Framework for Multitenant Architecture
Presto: Optimizing Performance of SQL-on-Anything Engine
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Extending Twitter's Data Platform to Google Cloud
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Computer Vision: Coming to a Store Near You
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark

Recently uploaded (20)

PDF
Machine learning based COVID-19 study performance prediction
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
Cloud computing and distributed systems.
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
KodekX | Application Modernization Development
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPT
Teaching material agriculture food technology
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
Machine learning based COVID-19 study performance prediction
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Encapsulation_ Review paper, used for researhc scholars
Digital-Transformation-Roadmap-for-Companies.pptx
The AUB Centre for AI in Media Proposal.docx
Cloud computing and distributed systems.
Unlocking AI with Model Context Protocol (MCP)
Agricultural_Statistics_at_a_Glance_2022_0.pdf
KodekX | Application Modernization Development
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Teaching material agriculture food technology
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Advanced methodologies resolving dimensionality complications for autism neur...
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Network Security Unit 5.pdf for BCA BBA.
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
NewMind AI Monthly Chronicles - July 2025
The Rise and Fall of 3GPP – Time for a Sabbatical?

The Car of the Future - Autonomous, Connected, and Data Centric

  • 1. The Car of the Future Autonomous, Connected and Data-Centric DWS Melbourne, Australia 2019 Robert Hryniewicz
  • 2. 2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Agenda 1. Autonomous Vehicles 2. Autonomous Vehicle Learning Lifecycle 3. Data Management Challenges 4. Next-Generation Data Management 5. Architecture for Success 6. Solution in Action 7. Q & A
  • 3. 3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Title Goes HereThe Autonomous Car – It’s Happening “Nearly all new vehicles will be capable of full autonomy within 10 years” Elon Musk, 2017 10 million self-driving cars will be on the road by 2020 BI Intelligence, 2016 12 million fully autonomous vehicles by 2035 BCG, 2017 Varying predictions, but the future is clear McKinsey, 2016 Up to 15 percent of new cars in 2030 could be fully autonomous
  • 4. 4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Title Goes Here DATA EXPLOSION Annual Connected Vehicle Data to Grow Almost 10X By 2020* * Source: Cowen and Company, Gartner And Autonomous Vehicles Will Increase this by a further 25X** ** Source: Cowen and Company, Datameer
  • 5. 5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Title Goes HereAutonomous Driving Learning Lifecycle Training Data for Autonomous Driving Model Development Deploy Autonomous Driving Model To Vehicle In-Vehicle Processing Unit • Camera • Radar • Lidar • GPS • IMU • Other Sensors AUTOMONOMOUS VEHICLE In-Vehicle Storage Device DATA AUTONOMOUS DRIVING LEARNING LIFECYCLE Data Storage 2 Petabyte Scale Data Pre- Processing 3 Labeling, etc. Machine Learning 4 Auto- Labeling, etc. Rules Definition 5 Manual or Automatic Testing & Simulation 6 Using Training Data Data Ingestion 1 From Vehicle Model Deployment 7 To Vehicle
  • 6. 6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Title Goes Here COUNTRY 1 Autonomous Test Cars INGEST1 NAS Storage STORE2 MOVE3 WorkstationsCOMPUTE4 Engineer Engineer Traditional Data Management Approach COUNTRY 2…N
  • 7. 7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Title Goes HereData Management Challenges New Deployment Models Difficulty managing both…. Excessive Data Movement Movement between… STORE COMPUTE Data Volume and Variety NAS not optimized for… VOLUME VARIETY Data Governance & Security Fragmented management across…. PROGRAMS PARK ASSIST LANE DEPARTURE ADAPTIVE CRUISE LEVEL 4, 5 GEOGRAPHIES Data Intensive Computing Workstations & CPUs not optimized for ML X Workstation
  • 8. 8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Title Goes HereA Better Way SECURED & GOVERNED Centralized Governance - Across Geographies and Teams DEEP LEARNING & GPU Deep Learning frameworks (TensorFlow, PyTorch) GPU Pooling/Isolation “In Place” Data Processing DON’T MOVE DATA HYBRID & MULTI CLOUD Unified Management of Clusters - whether on Cloud, On-Premise or Hybrid UNIVERSAL DATA STORAGE Autonomous Vehicle Research Data Lake Infinitely Scalable (Billions of files, Exabytes) Low TCO
  • 9. 9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Title Goes Here“In-Place” Data Storage TO ü CREATE ALGORITHM ü STORE + COMPUTE AUTONOMOUS CARS • Data stored and processed on Hadoop • Elimination of data movement • Massively parallel computing • Dramatically faster computing times FROM ü CREATE ALGORITHM ü STORE ü COMPUTE AUTONOMOUS CARS • Data moved to workstation • Data processed on workstation
  • 10. 10 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Title Goes HereSmarter Decisions Made Based on Support for Deep Learning Workloads Why GPU support? à Enhances the performance of computations needed for enterprise ML/DL apps à DL requires intense computational algorithms à Containerized software powered by GPUs helps data processing at scale Result: Data Scientists can run DL models in days vs months, hours vs days, minutes vs hours
  • 11. 11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Title Goes HereSecurity and Governance Full chain of custody of data across the Hadoop ecosystem Auditing of events for fine grained and detailed info Tag propagation allows auditors to see where the data is going across the enterprise & retain context of sensitive data Time-based policies to allow temporary access to users
  • 12. 12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Title Goes Here SOLUTION IN ACTION
  • 13. 13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Title Goes Here lidars (front) radar GPS GPS lidar (side) lidar (rear) lidar stereo camera remote control antenna 2007 DARPA Urban Challenge
  • 14. 14 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Title Goes Here lidar stereo camera Nvidia Jetson Tx2 ”the brain” power supply (computer) power supply (car)
  • 15. 15 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Title Goes HereTraining workflow HDP Cluster for Training Model Data (30K images+ Steering angles) 5 Servers 6 GPU cards Dockerized TensorFlow GPU Pooling Zeppelin Inference
  • 16. 16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Title Goes Here
  • 17. 17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Title Goes HereIn Conclusion Autonomous Vehicles Coming Soon Data-Centric Learning Based on Data! Data Challenges Volume, Variety, Storage, Compute, Learning, Governance and Security Crossing the Chasm Data Management Innovation Foundational to Success
  • 18. 18 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Questions?
  • 19. 19 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Thanks! Robert Hryniewicz