SlideShare a Scribd company logo
Kafka deployment to steel thread
Sam Julian
Chief Cloud Engineer, E.On SE
MÜNCHEN - 09. OKTOBER 2018
Agenda
• The What
• Rickshaw
• Deployment
• Data connectors
What is all about?
• Configuration
• Data
• Function
Rickshaw
https://upload.wikimedia.org/wikipedia/commons/6/64/Hanoi_conducteur_de_pp.jpg
Plug & play data connection
• ready-to-use data connectors and
Libraries help to rapidly getting
started
• versatile templates for custom
data sources can fit to any data
Adaptive capacity
• storage sizes of 50GB up to
50TB, extend or downsize to fit to
your needs or save costs
• worker node counts of 1 up to
100 with a choice of node sizes,
ready to serve any data
processing needs
Self-healing & auto-patching
• worker nodes automatically apply
security patches
• probing and self-healing
capabilities for worker nodes and
deployed workers actively protect
against downtime
Distributed failover components
• distributed and redundant
setup of computing and storage
resources for increased
stability
Emergency restoration pipelines
• rebuild or update configuration in
case of an emergency recovery
scenario
• recover whole cluster
configuration, or precisely update
affected parts only for least
intrusive operation
• fast and reliable guided transport
through pipelines
Threat protection
• cloud-native threat advisory help to
detect and prevent intrusion
• secure private network setup behind
firewall with configuration and rules as
code walls off most network breaches
• code and container vulnerabilities
scans with self-updated CVE*
databases contributes to strong
governance of threats from inside
• encrypted storage provide defence
against data leak
* https://cve.mitre.org
Seamless developer experience
• improved developer productivity
with seamlessly integrated
developer tools including
repositories, CI/CD and code
quality assistance
• always up to date with latest
contemporary tool versions
Realtime traffic information
• integral dashboards to monitor
data traffic, like latency, cpu load,
storage or memory usage
• adjustable alerts keep you up to
date on important metrics and
help to prevent downtime
Rickshaw
Deployment
Deployment
• ARM-Packer
• ARM-AKS
• Helm
• Terraform
Kafka Deployment to Steel Thread
ARM-Packer by HashiCorp
ARM-AKS
AKS-GKE-Helm
• fast and customisable via the values.yaml
• official top-level CNFN project with large
community
• difficult to know exactly what the chart is
changing/deploying without
• finding the chart source code
• making sense of or rendering all the
template to k8s resource files, investigating
the docker files etc etc
• added complexity, manage and secure the Tiller
component in the cluster (local tiller
https://github.com/adamreese/helm-local or Helm
3.0 tillerless)
• another tool is still required for managing the
cluster outside of helm
Apps cluster
Control Center
Schema registry
Rest Proxy
service
Prometheus
Ingress
controller
Let’s encrypt
controller
Kubernetes Cluster
(namespace per service)
OAuth2
proxy
Rest proxy
Producers
(Kafka connect)
Confluent
replicator
Consumers
(Kafka connect)
config
and
secrets
KSQL
microservices
Kafka Deployment to Steel Thread
Kafka cluster
Zookeeper
(Stateful set,
3 or 5 instances)
Kafka
(Statefule set,
3 - n instances)
Zookeeper
service
Kafka
service
coreDNS
Prometheus
Ingress
controller
Let’s encrypt
controller
Kubernetes Cluster
(namespace cds)
Confluent auto data
rebalancer
(kubernetes cron
job)
CA (certificates for
brokers and clients)
Kafka Deployment to Steel Thread
Terraform
• cloud agnostic
• shareable modules (kafka
cluster in the box)
• template all the cloud and
supporting infra (clusters,
nsgs, dns, networking etc)
Data connectors
Replicator
• data is copied from specified topics
from Kafka 1 to Kafka 2 via the
Replicator (one or more instances)
• the topic configuration (number of
partitions, replication factor) is
preserved from source todestination
• there must be at least as many
brokers in the destination cluster as
the maximum replication factor used
• serialization and deserialization is
done via SchemaRegistry, located on
Kafka 2
HDFS to GCS
HDFS to GCS
Kafka Deployment to Steel Thread

More Related Content

PPTX
Real-Time Analytics Visualized w/ Kafka + Streamliner + MemSQL + ZoomData, An...
PDF
Give Your Confluent Platform Superpowers! (Sandeep Togrika, Intel and Bert Ha...
PDF
user Behavior Analysis with Session Windows and Apache Kafka's Streams API
PDF
Introducing Events and Stream Processing into Nationwide Building Society (Ro...
PDF
Confluent & Attunity: Mainframe Data Modern Analytics
PDF
Supercharge Your Real-time Event Processing with Neo4j's Streams Kafka Connec...
PDF
Building Event-Driven Services with Apache Kafka
PDF
Enhancing Apache Kafka for Large Scale Real-Time Data Pipeline at Tencent | K...
Real-Time Analytics Visualized w/ Kafka + Streamliner + MemSQL + ZoomData, An...
Give Your Confluent Platform Superpowers! (Sandeep Togrika, Intel and Bert Ha...
user Behavior Analysis with Session Windows and Apache Kafka's Streams API
Introducing Events and Stream Processing into Nationwide Building Society (Ro...
Confluent & Attunity: Mainframe Data Modern Analytics
Supercharge Your Real-time Event Processing with Neo4j's Streams Kafka Connec...
Building Event-Driven Services with Apache Kafka
Enhancing Apache Kafka for Large Scale Real-Time Data Pipeline at Tencent | K...

What's hot (20)

PDF
3 Ways to Deliver an Elastic, Cost-Effective Cloud Architecture
PDF
Streaming all over the world Real life use cases with Kafka Streams
PDF
Couchbase Cloud No Equal (Rick Jacobs, Couchbase) Kafka Summit 2020
PDF
How Yelp Leapt to Microservices with More than a Message Queue
PDF
The Bridge to Cloud (Peter Gustafsson, Confluent) London 2019 Confluent Strea...
PDF
Kafka in Context, Cloud, & Community (Simon Elliston Ball, Cloudera) Kafka Su...
PPTX
SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architect...
PDF
Set your Data in Motion with Confluent & Apache Kafka Tech Talk Series LME
PPTX
Introduction to ksqlDB and stream processing (Vish Srinivasan - Confluent)
PPTX
One Click Streaming Data Pipelines & Flows | Leveraging Kafka & Spark | Ido F...
PDF
Mind the App: How to Monitor Your Kafka Streams Applications | Bruno Cadonna,...
PDF
Data integration with Apache Kafka
PDF
Kafka & Hadoop in Rakuten
PDF
Why Kafka Works the Way It Does (And Not Some Other Way) | Tim Berglund, Conf...
PDF
Lessons from the field: Catalog of Kafka Deployments | Joseph Niemiec, Cloudera
PDF
Distributed Data Storage & Streaming for Real-time Decisioning Using Kafka, S...
PDF
Bravo Six, Going Realtime. Transitioning Activision Data Pipeline to Streamin...
PPTX
Kafka error handling patterns and best practices | Hemant Desale and Aruna Ka...
PDF
Achieving scale and performance using cloud native environment
PDF
Introduction to Apache Kafka and Confluent... and why they matter!
3 Ways to Deliver an Elastic, Cost-Effective Cloud Architecture
Streaming all over the world Real life use cases with Kafka Streams
Couchbase Cloud No Equal (Rick Jacobs, Couchbase) Kafka Summit 2020
How Yelp Leapt to Microservices with More than a Message Queue
The Bridge to Cloud (Peter Gustafsson, Confluent) London 2019 Confluent Strea...
Kafka in Context, Cloud, & Community (Simon Elliston Ball, Cloudera) Kafka Su...
SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architect...
Set your Data in Motion with Confluent & Apache Kafka Tech Talk Series LME
Introduction to ksqlDB and stream processing (Vish Srinivasan - Confluent)
One Click Streaming Data Pipelines & Flows | Leveraging Kafka & Spark | Ido F...
Mind the App: How to Monitor Your Kafka Streams Applications | Bruno Cadonna,...
Data integration with Apache Kafka
Kafka & Hadoop in Rakuten
Why Kafka Works the Way It Does (And Not Some Other Way) | Tim Berglund, Conf...
Lessons from the field: Catalog of Kafka Deployments | Joseph Niemiec, Cloudera
Distributed Data Storage & Streaming for Real-time Decisioning Using Kafka, S...
Bravo Six, Going Realtime. Transitioning Activision Data Pipeline to Streamin...
Kafka error handling patterns and best practices | Hemant Desale and Aruna Ka...
Achieving scale and performance using cloud native environment
Introduction to Apache Kafka and Confluent... and why they matter!
Ad

Similar to Kafka Deployment to Steel Thread (20)

PPTX
Kubernetes at NU.nl (Kubernetes meetup 2019-09-05)
PPTX
Kubernetes Manchester - 6th December 2018
PDF
KubeCon 2019 Recap (Parts 1-3)
PDF
Kubernetes: My BFF
PDF
Elastic Kubernetes Services (EKS)
PDF
OSO Confluent GitOps Demo
PDF
kubernetes on awsjourneryssdddddddddddddd
PDF
Kuby, ActiveDeployment for Rails Apps
PDF
Running Kubernetes
PDF
Adam Hamsik - Kubernetes
PDF
Kubernetes on AWS @ Zalando Tech
PDF
5 - Hands-on Kubernetes Workshop:
PDF
Deploying on Kubernetes - An intro
PDF
AWS in Practice
PDF
Lessons learned migrating 100+ services to Kubernetes
PDF
prodops.io k8s presentation
PDF
Kubernetes/ EKS - 김광영 (AWS 솔루션즈 아키텍트)
PDF
Kubernetes - Starting with 1.2
PDF
Kubernetes @ pixel
PDF
Deploying PostgreSQL on Kubernetes
Kubernetes at NU.nl (Kubernetes meetup 2019-09-05)
Kubernetes Manchester - 6th December 2018
KubeCon 2019 Recap (Parts 1-3)
Kubernetes: My BFF
Elastic Kubernetes Services (EKS)
OSO Confluent GitOps Demo
kubernetes on awsjourneryssdddddddddddddd
Kuby, ActiveDeployment for Rails Apps
Running Kubernetes
Adam Hamsik - Kubernetes
Kubernetes on AWS @ Zalando Tech
5 - Hands-on Kubernetes Workshop:
Deploying on Kubernetes - An intro
AWS in Practice
Lessons learned migrating 100+ services to Kubernetes
prodops.io k8s presentation
Kubernetes/ EKS - 김광영 (AWS 솔루션즈 아키텍트)
Kubernetes - Starting with 1.2
Kubernetes @ pixel
Deploying PostgreSQL on Kubernetes
Ad

More from confluent (20)

PDF
Stream Processing Handson Workshop - Flink SQL Hands-on Workshop (Korean)
PPTX
Webinar Think Right - Shift Left - 19-03-2025.pptx
PDF
Migration, backup and restore made easy using Kannika
PDF
Five Things You Need to Know About Data Streaming in 2025
PDF
Data in Motion Tour Seoul 2024 - Keynote
PDF
Data in Motion Tour Seoul 2024 - Roadmap Demo
PDF
From Stream to Screen: Real-Time Data Streaming to Web Frontends with Conflue...
PDF
Confluent per il settore FSI: Accelerare l'Innovazione con il Data Streaming...
PDF
Data in Motion Tour 2024 Riyadh, Saudi Arabia
PDF
Build a Real-Time Decision Support Application for Financial Market Traders w...
PDF
Strumenti e Strategie di Stream Governance con Confluent Platform
PDF
Compose Gen-AI Apps With Real-Time Data - In Minutes, Not Weeks
PDF
Building Real-Time Gen AI Applications with SingleStore and Confluent
PDF
Unlocking value with event-driven architecture by Confluent
PDF
Il Data Streaming per un’AI real-time di nuova generazione
PDF
Unleashing the Future: Building a Scalable and Up-to-Date GenAI Chatbot with ...
PDF
Break data silos with real-time connectivity using Confluent Cloud Connectors
PDF
Building API data products on top of your real-time data infrastructure
PDF
Speed Wins: From Kafka to APIs in Minutes
PDF
Evolving Data Governance for the Real-time Streaming and AI Era
Stream Processing Handson Workshop - Flink SQL Hands-on Workshop (Korean)
Webinar Think Right - Shift Left - 19-03-2025.pptx
Migration, backup and restore made easy using Kannika
Five Things You Need to Know About Data Streaming in 2025
Data in Motion Tour Seoul 2024 - Keynote
Data in Motion Tour Seoul 2024 - Roadmap Demo
From Stream to Screen: Real-Time Data Streaming to Web Frontends with Conflue...
Confluent per il settore FSI: Accelerare l'Innovazione con il Data Streaming...
Data in Motion Tour 2024 Riyadh, Saudi Arabia
Build a Real-Time Decision Support Application for Financial Market Traders w...
Strumenti e Strategie di Stream Governance con Confluent Platform
Compose Gen-AI Apps With Real-Time Data - In Minutes, Not Weeks
Building Real-Time Gen AI Applications with SingleStore and Confluent
Unlocking value with event-driven architecture by Confluent
Il Data Streaming per un’AI real-time di nuova generazione
Unleashing the Future: Building a Scalable and Up-to-Date GenAI Chatbot with ...
Break data silos with real-time connectivity using Confluent Cloud Connectors
Building API data products on top of your real-time data infrastructure
Speed Wins: From Kafka to APIs in Minutes
Evolving Data Governance for the Real-time Streaming and AI Era

Recently uploaded (20)

PPT
Teaching material agriculture food technology
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Electronic commerce courselecture one. Pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
NewMind AI Weekly Chronicles - August'25 Week I
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Spectral efficient network and resource selection model in 5G networks
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Teaching material agriculture food technology
The Rise and Fall of 3GPP – Time for a Sabbatical?
sap open course for s4hana steps from ECC to s4
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Electronic commerce courselecture one. Pdf
Building Integrated photovoltaic BIPV_UPV.pdf
Understanding_Digital_Forensics_Presentation.pptx
Chapter 3 Spatial Domain Image Processing.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Reach Out and Touch Someone: Haptics and Empathic Computing
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
NewMind AI Weekly Chronicles - August'25 Week I
The AUB Centre for AI in Media Proposal.docx
Spectral efficient network and resource selection model in 5G networks
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Review of recent advances in non-invasive hemoglobin estimation
“AI and Expert System Decision Support & Business Intelligence Systems”
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...

Kafka Deployment to Steel Thread

  • 1. Kafka deployment to steel thread Sam Julian Chief Cloud Engineer, E.On SE MÜNCHEN - 09. OKTOBER 2018
  • 2. Agenda • The What • Rickshaw • Deployment • Data connectors
  • 3. What is all about? • Configuration • Data • Function
  • 5. Plug & play data connection • ready-to-use data connectors and Libraries help to rapidly getting started • versatile templates for custom data sources can fit to any data
  • 6. Adaptive capacity • storage sizes of 50GB up to 50TB, extend or downsize to fit to your needs or save costs • worker node counts of 1 up to 100 with a choice of node sizes, ready to serve any data processing needs
  • 7. Self-healing & auto-patching • worker nodes automatically apply security patches • probing and self-healing capabilities for worker nodes and deployed workers actively protect against downtime
  • 8. Distributed failover components • distributed and redundant setup of computing and storage resources for increased stability
  • 9. Emergency restoration pipelines • rebuild or update configuration in case of an emergency recovery scenario • recover whole cluster configuration, or precisely update affected parts only for least intrusive operation • fast and reliable guided transport through pipelines
  • 10. Threat protection • cloud-native threat advisory help to detect and prevent intrusion • secure private network setup behind firewall with configuration and rules as code walls off most network breaches • code and container vulnerabilities scans with self-updated CVE* databases contributes to strong governance of threats from inside • encrypted storage provide defence against data leak * https://cve.mitre.org
  • 11. Seamless developer experience • improved developer productivity with seamlessly integrated developer tools including repositories, CI/CD and code quality assistance • always up to date with latest contemporary tool versions
  • 12. Realtime traffic information • integral dashboards to monitor data traffic, like latency, cpu load, storage or memory usage • adjustable alerts keep you up to date on important metrics and help to prevent downtime
  • 18. AKS-GKE-Helm • fast and customisable via the values.yaml • official top-level CNFN project with large community • difficult to know exactly what the chart is changing/deploying without • finding the chart source code • making sense of or rendering all the template to k8s resource files, investigating the docker files etc etc • added complexity, manage and secure the Tiller component in the cluster (local tiller https://github.com/adamreese/helm-local or Helm 3.0 tillerless) • another tool is still required for managing the cluster outside of helm
  • 19. Apps cluster Control Center Schema registry Rest Proxy service Prometheus Ingress controller Let’s encrypt controller Kubernetes Cluster (namespace per service) OAuth2 proxy Rest proxy Producers (Kafka connect) Confluent replicator Consumers (Kafka connect) config and secrets KSQL microservices
  • 21. Kafka cluster Zookeeper (Stateful set, 3 or 5 instances) Kafka (Statefule set, 3 - n instances) Zookeeper service Kafka service coreDNS Prometheus Ingress controller Let’s encrypt controller Kubernetes Cluster (namespace cds) Confluent auto data rebalancer (kubernetes cron job) CA (certificates for brokers and clients)
  • 23. Terraform • cloud agnostic • shareable modules (kafka cluster in the box) • template all the cloud and supporting infra (clusters, nsgs, dns, networking etc)
  • 25. Replicator • data is copied from specified topics from Kafka 1 to Kafka 2 via the Replicator (one or more instances) • the topic configuration (number of partitions, replication factor) is preserved from source todestination • there must be at least as many brokers in the destination cluster as the maximum replication factor used • serialization and deserialization is done via SchemaRegistry, located on Kafka 2