SlideShare a Scribd company logo
KAFKA +
Building the World's Realtime Transit Infrastructure
For Illustration only
Kafka + Uber- The World’s Realtime Transit Infrastructure, Aaron Schildkrout
SURGE - CIRCA 2013
SURGE - CIRCA 2016
DATA CONSUMERS
Real-time, Fast
Analytics
BATCH PIPELINE
Storm
Applications
Data Science
Analytics
Reporting
KAFKA
VERTICA
RIDER APP
DRIVER APP
API / SERVICES
DISPATCH
(gps logs)
Mapping &
Logistic Ad-hoc exploration
ELK
Samza
Alerts,
Dashboards
Debugging
REAL-TIME PIPELINE
HADOOP
Surge Mobile App
DATA
PRODUCERS
KAFKA 8 ECOSYSTEM @UBER
Product
Features
Predictive
Models
Operational
Analytics
Business
Intelligence
INFRASTRUCTURE ECOSYSTEM
NEAR REALTIME
PRICE SURGING
PRODUCT FEATURES
FRAUD -
ANOMALY
DETECTION
PREDICTIVE MODELS
PREDICTIVE MODELS
ETA
OPERATIONAL ANALYTICS
UberEATs
OPERATIONAL ANALYTICS
XP
OPERATIONAL ANALYTICS
BUSINESS INTELLIGENCE
KAFKA 8KAFKA 7 MIGRATOR
Limited Availability
Difficult to Scale
Not multi-DC Multi-lang incompatibility Multi-DC, multi-language
support
2013 2014 2015 - 2016
KAFKA 7 WORLD
Difficult to Operate
Producer Scale Issues
High Availability
High Scalability
Kafka 7 + Mirrormaker
Deployed everywhere
Kafka 7 migrator
Deployed everywhere
New Kafka 8
pipeline
Kafka 7
Mirrormaker
2.0
Rest
architecture
Data AuditAutomated
Topic Mgmt
Logs Business events
Async REST library
Data Audit
Local spooling
High throughput
custom protocol
REST ARCHITECTURE
Rest Proxy
Automated Schema and Topic Management
Mirrormaker 2.0
Robust
Data Audit
Dynamic topics
MIRROR MAKER 2.0
Destination DCSource DC
Msg counts across multiple DCs
End-end latencies across multiple
DCs
DATA AUDIT FOR KAFKA MESSAGES
Mirrormaker
2.0
Rest
architecture
Data Audit Kafka 8Automated
Topic Mgmt
A ROBUST FUTURE
0 data loss messaging system
Data discovery and lineage
Quota management
Self-correcting brokers
Active active data pipelines
Real-time Data
Dynamic SQL(ish)
Real-time decision
THE FUTURE
Real-time Data
Custom Application
Real-time decision
THE PRESENT
TELEMATICS
SELF DRIVING CAR
Kafka + Uber- The World’s Realtime Transit Infrastructure, Aaron Schildkrout
Thank you, Kafka Community!

More Related Content

PPTX
Introduction to Apache Kafka
PDF
Kappa vs Lambda Architectures and Technology Comparison
PDF
A Deep Dive into Kafka Controller
PDF
Architecture patterns for distributed, hybrid, edge and global Apache Kafka d...
PPTX
Netflix Data Pipeline With Kafka
PPTX
Kafka Tutorial - Introduction to Apache Kafka (Part 1)
PPTX
Apache Kafka Best Practices
PDF
Kafka Streams: What it is, and how to use it?
Introduction to Apache Kafka
Kappa vs Lambda Architectures and Technology Comparison
A Deep Dive into Kafka Controller
Architecture patterns for distributed, hybrid, edge and global Apache Kafka d...
Netflix Data Pipeline With Kafka
Kafka Tutorial - Introduction to Apache Kafka (Part 1)
Apache Kafka Best Practices
Kafka Streams: What it is, and how to use it?

What's hot (20)

PDF
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka
PDF
Flink powered stream processing platform at Pinterest
PPTX
PDF
Apache Kafka Architecture & Fundamentals Explained
ODP
Stream processing using Kafka
PPTX
Kafka Summit NYC 2017 - Data Processing at LinkedIn with Apache Kafka
PDF
Amazon S3 Best Practice and Tuning for Hadoop/Spark in the Cloud
PPSX
Apache Flink, AWS Kinesis, Analytics
PDF
Uber: Kafka Consumer Proxy
PDF
Disaster Recovery Plans for Apache Kafka
PPTX
The Top 5 Apache Kafka Use Cases and Architectures in 2022
PDF
When NOT to use Apache Kafka?
PDF
Pinot: Near Realtime Analytics @ Uber
PPTX
Kafka presentation
PDF
Scalability, Availability & Stability Patterns
PPSX
Big Data Redis Mongodb Dynamodb Sharding
KEY
Introduction to memcached
PDF
The Rise of ZStandard: Apache Spark/Parquet/ORC/Avro
PPSX
Elastic-Engineering
PPTX
Kafka 101
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Flink powered stream processing platform at Pinterest
Apache Kafka Architecture & Fundamentals Explained
Stream processing using Kafka
Kafka Summit NYC 2017 - Data Processing at LinkedIn with Apache Kafka
Amazon S3 Best Practice and Tuning for Hadoop/Spark in the Cloud
Apache Flink, AWS Kinesis, Analytics
Uber: Kafka Consumer Proxy
Disaster Recovery Plans for Apache Kafka
The Top 5 Apache Kafka Use Cases and Architectures in 2022
When NOT to use Apache Kafka?
Pinot: Near Realtime Analytics @ Uber
Kafka presentation
Scalability, Availability & Stability Patterns
Big Data Redis Mongodb Dynamodb Sharding
Introduction to memcached
The Rise of ZStandard: Apache Spark/Parquet/ORC/Avro
Elastic-Engineering
Kafka 101
Ad

Viewers also liked (10)

PPTX
Uber's new mobile architecture
PDF
Building Real-Time Applications with Android and WebSockets
PDF
"Building Data Foundations and Analytics Tools Across The Product" by Crystal...
PDF
Open-source Infrastructure at Lyft
PDF
Taxi Startup Presentation for Taxi Company
PDF
Just Add Reality: Managing Logistics with the Uber Developer Platform
PDF
Geospatial Indexing at Scale: The 15 Million QPS Redis Architecture Powering ...
PDF
31 - IDNOG03 - Bergas Bimo Branarto (GOJEK) - Scaling Gojek
PPTX
Cassandra on Mesos Across Multiple Datacenters at Uber (Abhishek Verma) | C* ...
PDF
Stream Processing with Kafka in Uber, Danny Yuan
Uber's new mobile architecture
Building Real-Time Applications with Android and WebSockets
"Building Data Foundations and Analytics Tools Across The Product" by Crystal...
Open-source Infrastructure at Lyft
Taxi Startup Presentation for Taxi Company
Just Add Reality: Managing Logistics with the Uber Developer Platform
Geospatial Indexing at Scale: The 15 Million QPS Redis Architecture Powering ...
31 - IDNOG03 - Bergas Bimo Branarto (GOJEK) - Scaling Gojek
Cassandra on Mesos Across Multiple Datacenters at Uber (Abhishek Verma) | C* ...
Stream Processing with Kafka in Uber, Danny Yuan
Ad

Similar to Kafka + Uber- The World’s Realtime Transit Infrastructure, Aaron Schildkrout (20)

PDF
Netflix Keystone Pipeline at Big Data Bootcamp, Santa Clara, Nov 2015
PDF
Self-hosting Kafka at Scale: Netflix's Journey & Challenges
PDF
Netflix Keystone—Cloud scale event processing pipeline
PDF
The Netflix Way to deal with Big Data Problems
PDF
Capital One Delivers Risk Insights in Real Time with Stream Processing
PPTX
20181026 streaming architecture
PDF
A Functional Approach to Architecture - Kafka & Kafka Streams - Kevin Mas Rui...
PDF
BDX 2016- Monal daxini @ Netflix
PPTX
Building real time Data Pipeline using Spark Streaming
PPTX
kafka for db as postgres
PDF
Netflix Keystone - How Netflix Handles Data Streams up to 11M Events/Sec
PDF
Kafka Use Cases Real-World Applications
PDF
Connect K of SMACK:pykafka, kafka-python or?
PDF
Learnings From Shipping 1000+ Streaming Data Pipelines To Production with Hak...
PDF
How we have grown 10x within 2 years
PPTX
Kafka Basic For Beginners
PDF
Event Hub (i.e. Kafka) in Modern Data Architecture
PPTX
Big Data Analytics_basic introduction of Kafka.pptx
PDF
The Rise Of Event Streaming – Why Apache Kafka Changes Everything
PDF
Apache kafka event_streaming___kai_waehner
Netflix Keystone Pipeline at Big Data Bootcamp, Santa Clara, Nov 2015
Self-hosting Kafka at Scale: Netflix's Journey & Challenges
Netflix Keystone—Cloud scale event processing pipeline
The Netflix Way to deal with Big Data Problems
Capital One Delivers Risk Insights in Real Time with Stream Processing
20181026 streaming architecture
A Functional Approach to Architecture - Kafka & Kafka Streams - Kevin Mas Rui...
BDX 2016- Monal daxini @ Netflix
Building real time Data Pipeline using Spark Streaming
kafka for db as postgres
Netflix Keystone - How Netflix Handles Data Streams up to 11M Events/Sec
Kafka Use Cases Real-World Applications
Connect K of SMACK:pykafka, kafka-python or?
Learnings From Shipping 1000+ Streaming Data Pipelines To Production with Hak...
How we have grown 10x within 2 years
Kafka Basic For Beginners
Event Hub (i.e. Kafka) in Modern Data Architecture
Big Data Analytics_basic introduction of Kafka.pptx
The Rise Of Event Streaming – Why Apache Kafka Changes Everything
Apache kafka event_streaming___kai_waehner

More from confluent (20)

PDF
Stream Processing Handson Workshop - Flink SQL Hands-on Workshop (Korean)
PPTX
Webinar Think Right - Shift Left - 19-03-2025.pptx
PDF
Migration, backup and restore made easy using Kannika
PDF
Five Things You Need to Know About Data Streaming in 2025
PDF
Data in Motion Tour Seoul 2024 - Keynote
PDF
Data in Motion Tour Seoul 2024 - Roadmap Demo
PDF
From Stream to Screen: Real-Time Data Streaming to Web Frontends with Conflue...
PDF
Confluent per il settore FSI: Accelerare l'Innovazione con il Data Streaming...
PDF
Data in Motion Tour 2024 Riyadh, Saudi Arabia
PDF
Build a Real-Time Decision Support Application for Financial Market Traders w...
PDF
Strumenti e Strategie di Stream Governance con Confluent Platform
PDF
Compose Gen-AI Apps With Real-Time Data - In Minutes, Not Weeks
PDF
Building Real-Time Gen AI Applications with SingleStore and Confluent
PDF
Unlocking value with event-driven architecture by Confluent
PDF
Il Data Streaming per un’AI real-time di nuova generazione
PDF
Unleashing the Future: Building a Scalable and Up-to-Date GenAI Chatbot with ...
PDF
Break data silos with real-time connectivity using Confluent Cloud Connectors
PDF
Building API data products on top of your real-time data infrastructure
PDF
Speed Wins: From Kafka to APIs in Minutes
PDF
Evolving Data Governance for the Real-time Streaming and AI Era
Stream Processing Handson Workshop - Flink SQL Hands-on Workshop (Korean)
Webinar Think Right - Shift Left - 19-03-2025.pptx
Migration, backup and restore made easy using Kannika
Five Things You Need to Know About Data Streaming in 2025
Data in Motion Tour Seoul 2024 - Keynote
Data in Motion Tour Seoul 2024 - Roadmap Demo
From Stream to Screen: Real-Time Data Streaming to Web Frontends with Conflue...
Confluent per il settore FSI: Accelerare l'Innovazione con il Data Streaming...
Data in Motion Tour 2024 Riyadh, Saudi Arabia
Build a Real-Time Decision Support Application for Financial Market Traders w...
Strumenti e Strategie di Stream Governance con Confluent Platform
Compose Gen-AI Apps With Real-Time Data - In Minutes, Not Weeks
Building Real-Time Gen AI Applications with SingleStore and Confluent
Unlocking value with event-driven architecture by Confluent
Il Data Streaming per un’AI real-time di nuova generazione
Unleashing the Future: Building a Scalable and Up-to-Date GenAI Chatbot with ...
Break data silos with real-time connectivity using Confluent Cloud Connectors
Building API data products on top of your real-time data infrastructure
Speed Wins: From Kafka to APIs in Minutes
Evolving Data Governance for the Real-time Streaming and AI Era

Recently uploaded (20)

PPTX
web development for engineering and engineering
PPTX
Geodesy 1.pptx...............................................
PPTX
bas. eng. economics group 4 presentation 1.pptx
PPTX
CH1 Production IntroductoryConcepts.pptx
PPTX
Foundation to blockchain - A guide to Blockchain Tech
DOCX
573137875-Attendance-Management-System-original
PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
PPTX
Sustainable Sites - Green Building Construction
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PPTX
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
PPTX
CYBER-CRIMES AND SECURITY A guide to understanding
PDF
Arduino robotics embedded978-1-4302-3184-4.pdf
PPTX
Fluid Mechanics, Module 3: Basics of Fluid Mechanics
PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PPTX
Construction Project Organization Group 2.pptx
PPTX
Lecture Notes Electrical Wiring System Components
PPTX
additive manufacturing of ss316l using mig welding
PDF
composite construction of structures.pdf
web development for engineering and engineering
Geodesy 1.pptx...............................................
bas. eng. economics group 4 presentation 1.pptx
CH1 Production IntroductoryConcepts.pptx
Foundation to blockchain - A guide to Blockchain Tech
573137875-Attendance-Management-System-original
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
Sustainable Sites - Green Building Construction
UNIT-1 - COAL BASED THERMAL POWER PLANTS
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
CYBER-CRIMES AND SECURITY A guide to understanding
Arduino robotics embedded978-1-4302-3184-4.pdf
Fluid Mechanics, Module 3: Basics of Fluid Mechanics
Embodied AI: Ushering in the Next Era of Intelligent Systems
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
Construction Project Organization Group 2.pptx
Lecture Notes Electrical Wiring System Components
additive manufacturing of ss316l using mig welding
composite construction of structures.pdf

Kafka + Uber- The World’s Realtime Transit Infrastructure, Aaron Schildkrout

Editor's Notes

  • #2: Duration: Keynote is 15 mins long Good morning! My name is Aaron Schildkrout. I run Data and Marketing at Uber. I’m here today to talk to you about our Realtime journey at Uber - and particularly the critical and hugely empowering role Kafka (including Confluent and the whole Kafka community) has played in this journey.
  • #3: Uber is realtime transit infrastructure for the globe. We’ve stated many times that we want this infrastructure to be as reliable as running water. A utility. A right even. A project that started out as a cool app to get you black cars on demand - is quickly becoming among the largest global infrastructure inventions of all time. And - like the cars moving on the streets outside right now - it is all taking place now and now and now. It is real time.
  • #4: We’re not the only ones. The internet is quite literally penetrating our lives. -our cities -our relationships -our bodies This is a known story. But it’s getting more radical by the day. And as this penetration increases - in volume, in immediacy, in depth - there is an unbelievable increase in the need for systems that facilitate the flow of information, in real time, between our lives and our machines and back again. That’s why we’re all here.
  • #5: Compressing time and space - is...a non-trivial technical problem. Uber for instance - has always sought to provide this kind of truly responsive, realtime infrastructure. But in the beginning we were...just starting. This is surge circa 2012/3 in our driver app. Our first version of surge, v1, used data it queried directly from our dispatch service There was only one Node.js process per city The geofenses were very big and not granular at all (causing a lot of problems and huge inefficiency).
  • #6: This is surge today - with the addition of much more granular geo-temporal surge targeting. We are updating - in real-time - our understanding of supply and demand in highly specific geographies to allow us to calculate surge in the hexagons shown in this screen. This system now runs on Kafka - as opposed to our janky node query - and while it took us a bit of time to make this truly work at our exponentially exploding global scale...we’ve gotten...at least closer. That’s the story I’ll tell today.
  • #7: To get the obvious architectural diagram out of the way - here’s how Kafka 8 is currently used @ Uber.
  • #8: The Real-time infrastructure ecosystem - which includes Kafka - at Uber powers many key pieces of our business. I think of this in this topology...
  • #9: Surge - as noted earlier..
  • #10: FRAUD MODELS
  • #11: ETA - real-time system
  • #12: Cities use real-time operational analytics to active manage their cities - making adjustments in dispatch, messaging, etc - to optimize city functioning. Much of Uber’s success has to do with the amazing speed and agility of our on-the-ground global city teams - and much of this comes from empowering them with realtime tools.
  • #13: We’ve recently applied this same type of infrastructure to our Uber Eats business, which is rapidly scaling now and involves significant operational complexity.
  • #14: Internally analytics on our experimentation pipeline - which now powers the creation of hundreds of new experiments weekly and on which our teams are acting on daily based on rapid data feedback loops - is a real-time system.
  • #16: Pretty awesome. But it took a long journey to get there. 2013 - we first launched Kafka 7 each application essentially ran its own Kafka cluster 2014 - started a transition to K8 - where we started moving all our K7 data to K8 through the K7 migrator. 2015 to today - we deployed a fully functional K8 pipeline - stable with scalable producers and consumers and multi-DC, multi language support
  • #17: Along the way we ran into some significant limitations…and we did a bunch of work that I’ll work through now to complete our migration to Kafka 8 - and, more fundamentally, to make Kafka work at our scale.
  • #18: We implemented REST proxy improvements, adding a new binary protocol for high throughput. By building REST client libraries, we facilitated multi-language support (which was important given our 4-language environment)
  • #19: We automated schema and topic management. In a world with many thousands of topics and hundreds of engineers and teams producing data, the absence of strong tools around schema inferencing, enforcement and management were a huge painpoint.
  • #20: We built Mirrormaker 2.0, which we’ll soon be open sourcing… It’s More robust // Easier to operate // and allows for dynamic topic addition
  • #21: And… We built a series of Data auditing tools - allowing us to track data loss and latency spikes at different points in the Kafka pipeline, which at scale became critical for triaging and solving problems at a rapid pace
  • #22: All kafka data producers at Uber are now running Kafka 8. The project has been a huge success and is now powering much of Uber’s data infrastructure. It is...mission critical.
  • #23: Add notes
  • #24: The goal is to shrink the barrier between real time Infra and analytical usage.
  • #25: We’re currently capturing accelerometer data from the driver’s / rider’s phone via Kafka. This data is then used for: Detecting traffic / road conditions ? (need to confirm) 1) we use our motionstash data to generate safety models an safety scores for all our drivers (Supervised machine learning and classification algorithms) 2) we do per trip adhoc- analysis for safety by computing safety scores per driver. Use the models generated in 1) to predict in realtime and alert a driver about their unsafe driving.
  • #28: Duration: Keynote is 15 mins long