SlideShare a Scribd company logo
DATA PIPELINE AND DATA LAKE IN
AUTONOMOUS DRIVING
YU HUANG
SUNNYVALE, CALIFORNIA
YU.HUANG07@GMAIL.COM
OUTLINE
• TESLA
• GOOGLE WAYMO
• PLUSAI
• ALIBABA CLOUD
• NVIDIA
• NETAPP
• AMAZON-AWS
• AMAZON-TRI
• AMAZON-MOMENTA
• ECKERSON DATAOPS
• IBM
Tesla
Data pipeline and data lake for autonomous driving
Data pipeline and data lake for autonomous driving
Data pipeline and data lake for autonomous driving
Nvidia
Nvidia
Nvidia
Nvidia
Nvidia
Nvidia
NetApp
© 2020, Amazon Web Services, Inc. or its affiliates. All rights reserved. AWS Reference Architecture
Autonomous Driving Data Lake
Build an MDF4/Rosbag-based data ingestion and processing pipeline for Autonomous Driving and
Advanced Driver Assistance Systems (ADAS).
10
9
8
7
6
5
4
3
2
Ingest data from autonomous fleet
with AWS Outposts for local data
processing.
Ingest vehicle telemetry data in real
time using AWS IoT Core and
Amazon Kinesis Data Firehose.
Remove and transform low quality
data.
Schedule the extract, transform, load
(ETL) jobs using Apache Airflow.
Enrich data with weather conditions
based on GPS location and
timestamp.
Extract metadata using ASAM
OpenSCENARIO and store data in
Amazon DynamoDB and Amazon
Elasticsearch Service.
Store data lineage in Amazon
Neptune and catalog data using AWS
Glue Data Catalog.
Process drive data and perform deep
signal validation.
Perform automated labeling using
Amazon SageMaker Ground Truth.
Provide a search function for
particular scenarios using AWS
AppSync.
1
1
2
3
4
5
6
7
8 9
10
Radar and video data processing in MDF4 format achieves the highest scalability by leveraging AWS Fargate for Amazon ECS.
Data pipeline and data lake for autonomous driving
Data pipeline and data lake for autonomous driving
Autonomous driving at AWS
TRI
Momenta
Next Generation Data Pipelines
Data Pipeline Complexity
Eckerson DataOps
Analytic Models Failure to Launch
Eckerson DataOps
Streaming Data Pipelines
Eckerson DataOps
Requirements for Autonomous Navigation
Eckerson DataOps
DataOps for Model and Pipeline
Development and Operationalization
Continuous Integration / Continuous
Development(CI/CD)
Eckerson DataOps
Components and Complexities of Analytics Ecosystems
Eckerson DataOps
IBM Enterprise Data pipeline Marketecture
IBM Autonomous driving development use cases
Thanks

More Related Content

PDF
Aws glue를 통한 손쉬운 데이터 전처리 작업하기
PDF
마이크로 서비스를 위한 AWS Cloud Map & App Mesh - Saeho Kim (AWS Solutions Architect)
PDF
Lake Formation, 데이터레이크 관리와 운영을 하나로 :: 이재성 - AWS Community Day 2019
PDF
Operationalizing Machine Learning at Scale at Starbucks
PDF
AWS vs Azure vs Google (GCP) - Slides
PDF
Azure Monitoring Overview
PPTX
MLOps and Data Quality: Deploying Reliable ML Models in Production
PDF
AWS Summit Seoul 2023 | Amazon Redshift Serverless를 활용한 LG 이노텍의 데이터 분석 플랫폼 혁신 과정
Aws glue를 통한 손쉬운 데이터 전처리 작업하기
마이크로 서비스를 위한 AWS Cloud Map & App Mesh - Saeho Kim (AWS Solutions Architect)
Lake Formation, 데이터레이크 관리와 운영을 하나로 :: 이재성 - AWS Community Day 2019
Operationalizing Machine Learning at Scale at Starbucks
AWS vs Azure vs Google (GCP) - Slides
Azure Monitoring Overview
MLOps and Data Quality: Deploying Reliable ML Models in Production
AWS Summit Seoul 2023 | Amazon Redshift Serverless를 활용한 LG 이노텍의 데이터 분석 플랫폼 혁신 과정

What's hot (20)

PPTX
Snowflake essentials
PDF
[Keynote] Data Driven Organizations with AWS Data - 발표자: Agnes Panosian, Head...
PDF
20180221 AWS Black Belt Online Seminar AWS Lambda@Edge
PDF
AWS Elastic Beanstalk 활용하여 수 분만에 코드 배포하기 (최원근, AWS 솔루션즈 아키텍트) :: AWS DevDay2018
PDF
PDF
데이터 분석가를 위한 신규 분석 서비스 - 김기영, AWS 분석 솔루션즈 아키텍트 / 변규현, 당근마켓 소프트웨어 엔지니어 :: AWS r...
PDF
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...
PPTX
Azure Web App services
PDF
Using Databricks as an Analysis Platform
PDF
What is MLOps
PDF
AWS 클라우드 서비스 소개 및 사례 (방희란) - AWS 101 세미나
PDF
데브옵스 엔지니어를 위한 신규 운영 서비스 - 김필중, AWS 개발 전문 솔루션즈 아키텍트 / 김현민, 메가존클라우드 솔루션즈 아키텍트 :...
PDF
Getting Started with Delta Lake on Databricks
PDF
대규모 온프레미스 하둡 마이그레이션을 위한 실행 전략과 최적화 방안 소개-유철민, AWS Data Architect / 박성열,AWS Pr...
PDF
AWS Smart Factory - 이세현, 조이정, 정현아, 김대근, 정창호, 김지선, AWS 솔루션즈 아키텍트 :: AWS Summit...
PDF
Building Real-time Pipelines with FLaNK_ A Case Study with Transit Data
PDF
실전! AWS 하이브리드 네트워킹 (AWS Direct Connect 및 VPN 데모 세션) - 강동환, AWS 솔루션즈 아키텍트:: A...
PDF
Introdution to Dataops and AIOps (or MLOps)
PDF
Azure Data Factory V2; The Data Flows
PDF
AWS기반 서버리스 데이터레이크 구축하기 - 김진웅 (SK C&C) :: AWS Community Day 2020
Snowflake essentials
[Keynote] Data Driven Organizations with AWS Data - 발표자: Agnes Panosian, Head...
20180221 AWS Black Belt Online Seminar AWS Lambda@Edge
AWS Elastic Beanstalk 활용하여 수 분만에 코드 배포하기 (최원근, AWS 솔루션즈 아키텍트) :: AWS DevDay2018
데이터 분석가를 위한 신규 분석 서비스 - 김기영, AWS 분석 솔루션즈 아키텍트 / 변규현, 당근마켓 소프트웨어 엔지니어 :: AWS r...
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...
Azure Web App services
Using Databricks as an Analysis Platform
What is MLOps
AWS 클라우드 서비스 소개 및 사례 (방희란) - AWS 101 세미나
데브옵스 엔지니어를 위한 신규 운영 서비스 - 김필중, AWS 개발 전문 솔루션즈 아키텍트 / 김현민, 메가존클라우드 솔루션즈 아키텍트 :...
Getting Started with Delta Lake on Databricks
대규모 온프레미스 하둡 마이그레이션을 위한 실행 전략과 최적화 방안 소개-유철민, AWS Data Architect / 박성열,AWS Pr...
AWS Smart Factory - 이세현, 조이정, 정현아, 김대근, 정창호, 김지선, AWS 솔루션즈 아키텍트 :: AWS Summit...
Building Real-time Pipelines with FLaNK_ A Case Study with Transit Data
실전! AWS 하이브리드 네트워킹 (AWS Direct Connect 및 VPN 데모 세션) - 강동환, AWS 솔루션즈 아키텍트:: A...
Introdution to Dataops and AIOps (or MLOps)
Azure Data Factory V2; The Data Flows
AWS기반 서버리스 데이터레이크 구축하기 - 김진웅 (SK C&C) :: AWS Community Day 2020
Ad

Similar to Data pipeline and data lake for autonomous driving (8)

PDF
How Disney+ uses fast data ubiquity to improve the customer experience
PPTX
Construindo data lakes e analytics com AWS
PDF
AWSomeDayOnline Q322_2. Introduction to AWS Services Compute, Storage, Databa...
PDF
Introduction to AWS Services: Compute, Storage,_Databases
PDF
Slides: Proven Strategies for Hybrid Cloud Computing with Mainframes — From A...
PDF
Enterprise Application and Data Protection on AWS with Amazon FSx for NetApp ...
PDF
Get Value From Your Data
PDF
DW on AWS
How Disney+ uses fast data ubiquity to improve the customer experience
Construindo data lakes e analytics com AWS
AWSomeDayOnline Q322_2. Introduction to AWS Services Compute, Storage, Databa...
Introduction to AWS Services: Compute, Storage,_Databases
Slides: Proven Strategies for Hybrid Cloud Computing with Mainframes — From A...
Enterprise Application and Data Protection on AWS with Amazon FSx for NetApp ...
Get Value From Your Data
DW on AWS
Ad

More from Yu Huang (20)

PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
PDF
GOSIM_China_2024_Embodied AI Data VLA World Model
PDF
Levels of AI Agents: from Rules to Large Language Models
PDF
Application of Foundation Model for Autonomous Driving
PDF
The New Perception Framework in Autonomous Driving: An Introduction of BEV N...
PDF
Data Closed Loop in Simulation Test of Autonomous Driving
PDF
Techniques and Challenges in Autonomous Driving
PDF
BEV Joint Detection and Segmentation
PDF
BEV Object Detection and Prediction
PDF
Fisheye based Perception for Autonomous Driving VI
PDF
Fisheye/Omnidirectional View in Autonomous Driving V
PDF
Fisheye/Omnidirectional View in Autonomous Driving IV
PDF
Prediction,Planninng & Control at Baidu
PDF
Cruise AI under the Hood
PDF
LiDAR in the Adverse Weather: Dust, Snow, Rain and Fog (2)
PDF
Scenario-Based Development & Testing for Autonomous Driving
PDF
How to Build a Data Closed-loop Platform for Autonomous Driving?
PDF
Annotation tools for ADAS & Autonomous Driving
PDF
Simulation for autonomous driving at uber atg
PDF
Multi sensor calibration by deep learning
Embodied AI: Ushering in the Next Era of Intelligent Systems
GOSIM_China_2024_Embodied AI Data VLA World Model
Levels of AI Agents: from Rules to Large Language Models
Application of Foundation Model for Autonomous Driving
The New Perception Framework in Autonomous Driving: An Introduction of BEV N...
Data Closed Loop in Simulation Test of Autonomous Driving
Techniques and Challenges in Autonomous Driving
BEV Joint Detection and Segmentation
BEV Object Detection and Prediction
Fisheye based Perception for Autonomous Driving VI
Fisheye/Omnidirectional View in Autonomous Driving V
Fisheye/Omnidirectional View in Autonomous Driving IV
Prediction,Planninng & Control at Baidu
Cruise AI under the Hood
LiDAR in the Adverse Weather: Dust, Snow, Rain and Fog (2)
Scenario-Based Development & Testing for Autonomous Driving
How to Build a Data Closed-loop Platform for Autonomous Driving?
Annotation tools for ADAS & Autonomous Driving
Simulation for autonomous driving at uber atg
Multi sensor calibration by deep learning

Recently uploaded (20)

PPT
Mechanical Engineering MATERIALS Selection
PPTX
web development for engineering and engineering
PPTX
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
PPT
Project quality management in manufacturing
PDF
Automation-in-Manufacturing-Chapter-Introduction.pdf
PPTX
additive manufacturing of ss316l using mig welding
PPTX
Lecture Notes Electrical Wiring System Components
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PPTX
CYBER-CRIMES AND SECURITY A guide to understanding
PPTX
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
DOCX
573137875-Attendance-Management-System-original
PPTX
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
PPTX
bas. eng. economics group 4 presentation 1.pptx
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PPTX
UNIT 4 Total Quality Management .pptx
PDF
Model Code of Practice - Construction Work - 21102022 .pdf
PPTX
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PPTX
Sustainable Sites - Green Building Construction
PDF
Digital Logic Computer Design lecture notes
Mechanical Engineering MATERIALS Selection
web development for engineering and engineering
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
Project quality management in manufacturing
Automation-in-Manufacturing-Chapter-Introduction.pdf
additive manufacturing of ss316l using mig welding
Lecture Notes Electrical Wiring System Components
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
CYBER-CRIMES AND SECURITY A guide to understanding
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
573137875-Attendance-Management-System-original
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
bas. eng. economics group 4 presentation 1.pptx
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
UNIT 4 Total Quality Management .pptx
Model Code of Practice - Construction Work - 21102022 .pdf
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
Sustainable Sites - Green Building Construction
Digital Logic Computer Design lecture notes

Data pipeline and data lake for autonomous driving

  • 1. DATA PIPELINE AND DATA LAKE IN AUTONOMOUS DRIVING YU HUANG SUNNYVALE, CALIFORNIA YU.HUANG07@GMAIL.COM
  • 2. OUTLINE • TESLA • GOOGLE WAYMO • PLUSAI • ALIBABA CLOUD • NVIDIA • NETAPP • AMAZON-AWS • AMAZON-TRI • AMAZON-MOMENTA • ECKERSON DATAOPS • IBM
  • 14. © 2020, Amazon Web Services, Inc. or its affiliates. All rights reserved. AWS Reference Architecture Autonomous Driving Data Lake Build an MDF4/Rosbag-based data ingestion and processing pipeline for Autonomous Driving and Advanced Driver Assistance Systems (ADAS). 10 9 8 7 6 5 4 3 2 Ingest data from autonomous fleet with AWS Outposts for local data processing. Ingest vehicle telemetry data in real time using AWS IoT Core and Amazon Kinesis Data Firehose. Remove and transform low quality data. Schedule the extract, transform, load (ETL) jobs using Apache Airflow. Enrich data with weather conditions based on GPS location and timestamp. Extract metadata using ASAM OpenSCENARIO and store data in Amazon DynamoDB and Amazon Elasticsearch Service. Store data lineage in Amazon Neptune and catalog data using AWS Glue Data Catalog. Process drive data and perform deep signal validation. Perform automated labeling using Amazon SageMaker Ground Truth. Provide a search function for particular scenarios using AWS AppSync. 1 1 2 3 4 5 6 7 8 9 10
  • 15. Radar and video data processing in MDF4 format achieves the highest scalability by leveraging AWS Fargate for Amazon ECS.
  • 19. TRI
  • 21. Next Generation Data Pipelines
  • 23. Analytic Models Failure to Launch Eckerson DataOps
  • 25. Requirements for Autonomous Navigation Eckerson DataOps
  • 26. DataOps for Model and Pipeline Development and Operationalization Continuous Integration / Continuous Development(CI/CD) Eckerson DataOps
  • 27. Components and Complexities of Analytics Ecosystems Eckerson DataOps
  • 28. IBM Enterprise Data pipeline Marketecture
  • 29. IBM Autonomous driving development use cases