SlideShare a Scribd company logo
3 Kafka Patterns to Deliver
Streaming Machine Learning
Models
Kafka Summit 2022, Europe
Andrea Spina, CTO @Radicalbit
3 Kafka patterns to deliver Streaming Machine Learning models with Andrea Spina | Kafka Summit London 2022
3 Kafka patterns to deliver Streaming Machine Learning models with Andrea Spina | Kafka Summit London 2022
Synonyms: online, incremental, sequential.
Many algorithms already present online
behavior.
Benefits: low computational requirements
(green AI [2]
), robust to concept drift
Introducing Streaming ML
Machine learning models that are able to update one sample at a
time[1]
. Traditionally, machine learning has been a batch task.
River ML
The “sklearn project” for
streaming machine learning Seldon Core
Enables
k8s-based MLOps
3 Kafka patterns to deliver Streaming Machine Learning models with Andrea Spina | Kafka Summit London 2022
1. Dynamic Models Serving
i.e. runtime adding, removing, replacing ML models
join
S
C
. model_ID
. location
. schema
Performing the wrong regular join
Performing the right (map-side) join
global
Ktable
2. Scoring API with seldon-core
Made with Kafka
Scoring API with Seldon Core
Seldon Core enables async scoring
Seldon offers out-of-the-box
Kafka support
k8s pod
Pod belonging to
SeldonDeployment
instance
seldon sidecar
seldon-container-engine
It proxies all the in/out
communication
Async scoring unfolds to
3x scoring rate[3]
Kafka REST
#KS replicas
to hit max.
throughput
GKE, 1 node, e2-standard8
Kafka streams job,
single-threaded
DummyModel on 1-replica
SeldonDeployment
max throughput 450 req/sec 1
3
2
3. Feedback API, explained
DATA STRUCTURE
● feature vector
● true class
FEEDBACK SOURCES
● applications (browser, backends)
● from File
How do we handle
feedback with Kafka?
Online predictor operator
From the user perspective, it’s about a binary
operator taking:
● The to be predicted stream and
● a feedback stream
Streaming
ML with Kafka
feedback
Streaming
ML with Kafka
dynamic
serving
Streaming
ML with Kafka
async scoring
@Spina89
andrea-spina
andrea.spina@radicalbit.io
Thank you!
references
[1] - Online Machine Learning
[2] - Green AI
[3] - https://github.com/radicalbit/kafkasummit2022.git

More Related Content

PPTX
Zookeeper Tutorial for beginners
PPTX
Microservices Part 3 Service Mesh and Kafka
PPT
Zookeeper Introduce
PPTX
CLOUD ARCHITECTURE AND SERVICES.pptx
PPTX
Introduction to Kafka and Zookeeper
PDF
Kafka Streams: What it is, and how to use it?
PPTX
Introduction to azure cosmos db
PDF
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Zookeeper Tutorial for beginners
Microservices Part 3 Service Mesh and Kafka
Zookeeper Introduce
CLOUD ARCHITECTURE AND SERVICES.pptx
Introduction to Kafka and Zookeeper
Kafka Streams: What it is, and how to use it?
Introduction to azure cosmos db
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka

What's hot (20)

PDF
Elasticsearch
PPTX
Kafka Tutorial - Introduction to Apache Kafka (Part 1)
PPTX
Azure App Service
PPTX
Apache Kafka vs RabbitMQ: Fit For Purpose / Decision Tree
PPSX
Event Sourcing & CQRS, Kafka, Rabbit MQ
PDF
Apache Zookeeper
PPTX
Hive presentation
PPTX
A cheapskate's guide to Azure - Øredev 2022
PDF
Benefits of Stream Processing and Apache Kafka Use Cases
PPTX
Containers and Docker
PPTX
DevOps Interview Questions Part - 1 | Devops Interview Questions And Answers ...
PPTX
Introduction to Apache Kafka
PDF
Présentation de Apache Zookeeper
ODP
Elasticsearch for beginners
PPTX
PPTX
Elastic search overview
PDF
Intro to HBase
PPTX
AWS S3 | Tutorial For Beginners | AWS S3 Bucket Tutorial | AWS Tutorial For B...
PDF
Event streaming: A paradigm shift in enterprise software architecture
PPTX
The Top 5 Apache Kafka Use Cases and Architectures in 2022
Elasticsearch
Kafka Tutorial - Introduction to Apache Kafka (Part 1)
Azure App Service
Apache Kafka vs RabbitMQ: Fit For Purpose / Decision Tree
Event Sourcing & CQRS, Kafka, Rabbit MQ
Apache Zookeeper
Hive presentation
A cheapskate's guide to Azure - Øredev 2022
Benefits of Stream Processing and Apache Kafka Use Cases
Containers and Docker
DevOps Interview Questions Part - 1 | Devops Interview Questions And Answers ...
Introduction to Apache Kafka
Présentation de Apache Zookeeper
Elasticsearch for beginners
Elastic search overview
Intro to HBase
AWS S3 | Tutorial For Beginners | AWS S3 Bucket Tutorial | AWS Tutorial For B...
Event streaming: A paradigm shift in enterprise software architecture
The Top 5 Apache Kafka Use Cases and Architectures in 2022
Ad

Similar to 3 Kafka patterns to deliver Streaming Machine Learning models with Andrea Spina | Kafka Summit London 2022 (20)

PPTX
Clipper at UC Berkeley RISECamp 2017
PDF
Kostiantyn Bokhan, N-iX. CD4ML based on Azure and Kubeflow
PPTX
System mldl meetup
PDF
End to end Machine Learning using Kubeflow - Build, Train, Deploy and Manage
PDF
Unleashing Apache Kafka and TensorFlow in the Cloud

PPTX
Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...
PDF
MLOps with Kubernetes - Thiago Ramos.pdf
PDF
Clipper: A Low-Latency Online Prediction Serving System
PPTX
OS for AI: Elastic Microservices & the Next Gen of ML
PDF
Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]
PDF
Irakli Nadareishvili - O'Reilly SACon 2018, London
PDF
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB)
PDF
JHipster conf 2019 - Kafka Ecosystem
PDF
AI/ML Infra Meetup | How Uber Optimizes LLM Training and Finetune
PDF
Advanced Model Inferencing leveraging Kubeflow Serving, KNative and Istio
PDF
Deep learning and streaming in Apache Spark 2.2 by Matei Zaharia
PDF
XebiCon'17 : AxonFramework @ SGCIB (our experience) : (CQRS, Eventsourcing, A...
PDF
PDF
Fast and Simplified Streaming, Ad-Hoc and Batch Analytics with FiloDB and Spa...
PDF
2019 04 seattle_meetup___kafka_machine_learning___kai_waehner
Clipper at UC Berkeley RISECamp 2017
Kostiantyn Bokhan, N-iX. CD4ML based on Azure and Kubeflow
System mldl meetup
End to end Machine Learning using Kubeflow - Build, Train, Deploy and Manage
Unleashing Apache Kafka and TensorFlow in the Cloud

Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...
MLOps with Kubernetes - Thiago Ramos.pdf
Clipper: A Low-Latency Online Prediction Serving System
OS for AI: Elastic Microservices & the Next Gen of ML
Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]
Irakli Nadareishvili - O'Reilly SACon 2018, London
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB)
JHipster conf 2019 - Kafka Ecosystem
AI/ML Infra Meetup | How Uber Optimizes LLM Training and Finetune
Advanced Model Inferencing leveraging Kubeflow Serving, KNative and Istio
Deep learning and streaming in Apache Spark 2.2 by Matei Zaharia
XebiCon'17 : AxonFramework @ SGCIB (our experience) : (CQRS, Eventsourcing, A...
Fast and Simplified Streaming, Ad-Hoc and Batch Analytics with FiloDB and Spa...
2019 04 seattle_meetup___kafka_machine_learning___kai_waehner
Ad

More from HostedbyConfluent (20)

PDF
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
PDF
Renaming a Kafka Topic | Kafka Summit London
PDF
Evolution of NRT Data Ingestion Pipeline at Trendyol
PDF
Ensuring Kafka Service Resilience: A Dive into Health-Checking Techniques
PDF
Exactly-once Stream Processing with Arroyo and Kafka
PDF
Fish Plays Pokemon | Kafka Summit London
PDF
Tiered Storage 101 | Kafla Summit London
PDF
Building a Self-Service Stream Processing Portal: How And Why
PDF
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...
PDF
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...
PDF
Navigating Private Network Connectivity Options for Kafka Clusters
PDF
Apache Flink: Building a Company-wide Self-service Streaming Data Platform
PDF
Explaining How Real-Time GenAI Works in a Noisy Pub
PDF
TL;DR Kafka Metrics | Kafka Summit London
PDF
A Window Into Your Kafka Streams Tasks | KSL
PDF
Mastering Kafka Producer Configs: A Guide to Optimizing Performance
PDF
Data Contracts Management: Schema Registry and Beyond
PDF
Code-First Approach: Crafting Efficient Flink Apps
PDF
Debezium vs. the World: An Overview of the CDC Ecosystem
PDF
Beyond Tiered Storage: Serverless Kafka with No Local Disks
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Renaming a Kafka Topic | Kafka Summit London
Evolution of NRT Data Ingestion Pipeline at Trendyol
Ensuring Kafka Service Resilience: A Dive into Health-Checking Techniques
Exactly-once Stream Processing with Arroyo and Kafka
Fish Plays Pokemon | Kafka Summit London
Tiered Storage 101 | Kafla Summit London
Building a Self-Service Stream Processing Portal: How And Why
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...
Navigating Private Network Connectivity Options for Kafka Clusters
Apache Flink: Building a Company-wide Self-service Streaming Data Platform
Explaining How Real-Time GenAI Works in a Noisy Pub
TL;DR Kafka Metrics | Kafka Summit London
A Window Into Your Kafka Streams Tasks | KSL
Mastering Kafka Producer Configs: A Guide to Optimizing Performance
Data Contracts Management: Schema Registry and Beyond
Code-First Approach: Crafting Efficient Flink Apps
Debezium vs. the World: An Overview of the CDC Ecosystem
Beyond Tiered Storage: Serverless Kafka with No Local Disks

Recently uploaded (20)

PDF
Heart disease approach using modified random forest and particle swarm optimi...
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Encapsulation theory and applications.pdf
PDF
Approach and Philosophy of On baking technology
PDF
Empathic Computing: Creating Shared Understanding
PPTX
Machine Learning_overview_presentation.pptx
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PPTX
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPT
Teaching material agriculture food technology
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
August Patch Tuesday
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
A comparative study of natural language inference in Swahili using monolingua...
Heart disease approach using modified random forest and particle swarm optimi...
Programs and apps: productivity, graphics, security and other tools
Encapsulation theory and applications.pdf
Approach and Philosophy of On baking technology
Empathic Computing: Creating Shared Understanding
Machine Learning_overview_presentation.pptx
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Diabetes mellitus diagnosis method based random forest with bat algorithm
Univ-Connecticut-ChatGPT-Presentaion.pdf
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
Spectral efficient network and resource selection model in 5G networks
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Teaching material agriculture food technology
Per capita expenditure prediction using model stacking based on satellite ima...
A comparative analysis of optical character recognition models for extracting...
August Patch Tuesday
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Reach Out and Touch Someone: Haptics and Empathic Computing
A comparative study of natural language inference in Swahili using monolingua...

3 Kafka patterns to deliver Streaming Machine Learning models with Andrea Spina | Kafka Summit London 2022