SlideShare a Scribd company logo
Today's Menu

What is Kafka?

Kafka basic concepts

Installing & Running Kafka

Configurations concerns

Launching a cluster

Poking around with the command line

Adding metrics

Adding topics

Top considerations for topic configuration

Simple producing and consuming

Running simple load test

Breaking things and reviving

Creating a dashboard

Troubleshooting
Quick Kafka
Kafka Overview
“An open source, distributed, partitioned and
replicated commit-log based publish - subscribe
messaging system”
Kafka Overview

Topic: Category in which messages are published

Broker: Kafka server process (usually one per node)

Partitions: Topics are partitioned, each partition is represented by
the ordered immutable sequence of messages. Each message in
the partition is assigned a unique ID called offset



Installing and running
Not much of an installation, just download and open a tgz file
Apache:
https://kafka.apache.org/downloads
OR
Confluent:
https://www.confluent.io/download/
Running:
Starting zookeeper: (see example in demo)
Starting kafka:
bin/kafka-server-start.sh config/server.properties
Configuration concerns
Let's deep dive into the configuration file...
Launching a cluster
Poking around with the command line
Adding Metrics
Top consideration for topic
configuration

Minimize replication factor as possible to avoid extra load on the Leaders

Balance partition number to support parallelism

Make sure that leaders count is well balanced between brokers

Retention (time based) should be long enough to recover from failures

Keep spare disk space to increase retention if needed
Live Demo

Producing and consuming

Running simple load test

Breaking and reviving

Troubleshooting
Creating a dashboard
Kafka ops-new
Lessons learned - I

Minimize replication factor as possible to avoid extra load on the Leaders

Make sure that leaders count is well balanced between brokers

Balance partition number to support parallelism

Split cluster logically considering traffic and business importance

Retention (time based) should be long enough to recover from failures

Keep spare disk space to increase retention if needed
Lessons learned - II

Make sure you are running with adequate FD value (we use 64K)

In AWS, consider spreading cluster between AZ

Support cluster dynamic changes by clients

Create automation for reassign

Save cluster-reassignment.json of each topic for future needs

Have enough IOPS for Zookeepers as well

Check that your client version is compatible with message format
Lessons learned - III

Careful with adapting new instances

Automate your cluster migration

Backup your messages, we use secor

Monitor: ISR, leader election, Iowait, network bandwidth, messages count, df

Control brokers recovery

Take public stress tests with grain of salt

More Related Content

PDF
Testing Kafka components with Kafka for JUnit
PDF
PPT
Kafka goutam chowdhury-unicom-spark kafka-summit
PPTX
Apache Kafka
PPTX
Apache Bookkeeper and Apache Zookeeper for Apache Pulsar
PPTX
Apache kafka
ODP
Apache Kafka Demo
PPTX
Understanding kafka
Testing Kafka components with Kafka for JUnit
Kafka goutam chowdhury-unicom-spark kafka-summit
Apache Kafka
Apache Bookkeeper and Apache Zookeeper for Apache Pulsar
Apache kafka
Apache Kafka Demo
Understanding kafka

What's hot (20)

PDF
Real Time Streaming Data with Kafka and TensorFlow (Yong Tang, MobileIron) Ka...
PDF
Kafka clients and emitters
PDF
Introduction to Apache Kafka
PDF
Kafka Overview
PDF
Deep dive into Apache Kafka consumption
PDF
Apache Kafka Introduction
PDF
Apache Kafka - Martin Podval
PPTX
Kafka 101
PDF
An Introduction to Apache Kafka
PPTX
Kafka
PPTX
Apache kafka
PPTX
Apache kafka
PPT
Kafka Reliability - When it absolutely, positively has to be there
PPTX
Kafka connect 101
PPTX
Exactly-once Stream Processing with Kafka Streams
PPTX
Introduction to Kafka
PDF
Apache Kafka
PPTX
Kafka 0.8.0 Presentation to Atlanta Java User's Group March 2013
PPTX
Streaming and Messaging
PPTX
Apache kafka
Real Time Streaming Data with Kafka and TensorFlow (Yong Tang, MobileIron) Ka...
Kafka clients and emitters
Introduction to Apache Kafka
Kafka Overview
Deep dive into Apache Kafka consumption
Apache Kafka Introduction
Apache Kafka - Martin Podval
Kafka 101
An Introduction to Apache Kafka
Kafka
Apache kafka
Apache kafka
Kafka Reliability - When it absolutely, positively has to be there
Kafka connect 101
Exactly-once Stream Processing with Kafka Streams
Introduction to Kafka
Apache Kafka
Kafka 0.8.0 Presentation to Atlanta Java User's Group March 2013
Streaming and Messaging
Apache kafka
Ad

Similar to Kafka ops-new (20)

ODP
Kafka aws
PDF
Kafka in action - Tech Talk - Paytm
PDF
Learnings from the Field. Lessons from Working with Dozens of Small & Large D...
PPTX
Kafka blr-meetup-presentation - Kafka internals
PPTX
Putting Kafka Into Overdrive
PPTX
Apache Kafka Best Practices
PDF
Introduction to apache kafka
PDF
Tips and Tricks for Operating Apache Kafka
PPTX
Streaming in Practice - Putting Apache Kafka in Production
PDF
Building High-Throughput, Low-Latency Pipelines in Kafka
PDF
Kafka on Kubernetes: Keeping It Simple (Nikki Thean, Etsy) Kafka Summit SF 2019
PDF
Tips & Tricks for Apache Kafka®
PDF
Be DevOps Ready
PPTX
Kafka at scale facebook israel
PPTX
Fundamentals and Architecture of Apache Kafka
PDF
Metrics Are Not Enough: Monitoring Apache Kafka and Streaming Applications
PPTX
Netflix Data Pipeline With Kafka
PPTX
Netflix Data Pipeline With Kafka
PDF
Kafka internals
PDF
Fundamentals of Apache Kafka
Kafka aws
Kafka in action - Tech Talk - Paytm
Learnings from the Field. Lessons from Working with Dozens of Small & Large D...
Kafka blr-meetup-presentation - Kafka internals
Putting Kafka Into Overdrive
Apache Kafka Best Practices
Introduction to apache kafka
Tips and Tricks for Operating Apache Kafka
Streaming in Practice - Putting Apache Kafka in Production
Building High-Throughput, Low-Latency Pipelines in Kafka
Kafka on Kubernetes: Keeping It Simple (Nikki Thean, Etsy) Kafka Summit SF 2019
Tips & Tricks for Apache Kafka®
Be DevOps Ready
Kafka at scale facebook israel
Fundamentals and Architecture of Apache Kafka
Metrics Are Not Enough: Monitoring Apache Kafka and Streaming Applications
Netflix Data Pipeline With Kafka
Netflix Data Pipeline With Kafka
Kafka internals
Fundamentals of Apache Kafka
Ad

More from Ariel Moskovich (11)

PDF
Consul scale
PDF
Docker appsflyer
ODP
Advanced Code Flow, Notes From the Field
PDF
Practical Monitoring Techniques
PDF
PDF
PDF
devopstools
PDF
PDF
Devopstools
ODP
Docker in prod
PDF
Docker tlv
Consul scale
Docker appsflyer
Advanced Code Flow, Notes From the Field
Practical Monitoring Techniques
devopstools
Devopstools
Docker in prod
Docker tlv

Recently uploaded (20)

PPTX
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
PPTX
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PDF
Digital Logic Computer Design lecture notes
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PPTX
bas. eng. economics group 4 presentation 1.pptx
DOCX
573137875-Attendance-Management-System-original
PDF
Model Code of Practice - Construction Work - 21102022 .pdf
PPTX
web development for engineering and engineering
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PPTX
Lecture Notes Electrical Wiring System Components
PDF
composite construction of structures.pdf
PPTX
Welding lecture in detail for understanding
PPT
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
PPTX
Internet of Things (IOT) - A guide to understanding
PPTX
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
Digital Logic Computer Design lecture notes
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
bas. eng. economics group 4 presentation 1.pptx
573137875-Attendance-Management-System-original
Model Code of Practice - Construction Work - 21102022 .pdf
web development for engineering and engineering
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
UNIT-1 - COAL BASED THERMAL POWER PLANTS
Lecture Notes Electrical Wiring System Components
composite construction of structures.pdf
Welding lecture in detail for understanding
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
Internet of Things (IOT) - A guide to understanding
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx

Kafka ops-new

  • 1. Today's Menu  What is Kafka?  Kafka basic concepts  Installing & Running Kafka  Configurations concerns  Launching a cluster  Poking around with the command line  Adding metrics  Adding topics  Top considerations for topic configuration  Simple producing and consuming  Running simple load test  Breaking things and reviving  Creating a dashboard  Troubleshooting
  • 3. Kafka Overview “An open source, distributed, partitioned and replicated commit-log based publish - subscribe messaging system”
  • 4. Kafka Overview  Topic: Category in which messages are published  Broker: Kafka server process (usually one per node)  Partitions: Topics are partitioned, each partition is represented by the ordered immutable sequence of messages. Each message in the partition is assigned a unique ID called offset   
  • 5. Installing and running Not much of an installation, just download and open a tgz file Apache: https://kafka.apache.org/downloads OR Confluent: https://www.confluent.io/download/ Running: Starting zookeeper: (see example in demo) Starting kafka: bin/kafka-server-start.sh config/server.properties
  • 6. Configuration concerns Let's deep dive into the configuration file...
  • 8. Poking around with the command line
  • 10. Top consideration for topic configuration  Minimize replication factor as possible to avoid extra load on the Leaders  Balance partition number to support parallelism  Make sure that leaders count is well balanced between brokers  Retention (time based) should be long enough to recover from failures  Keep spare disk space to increase retention if needed
  • 11. Live Demo  Producing and consuming  Running simple load test  Breaking and reviving  Troubleshooting
  • 14. Lessons learned - I  Minimize replication factor as possible to avoid extra load on the Leaders  Make sure that leaders count is well balanced between brokers  Balance partition number to support parallelism  Split cluster logically considering traffic and business importance  Retention (time based) should be long enough to recover from failures  Keep spare disk space to increase retention if needed
  • 15. Lessons learned - II  Make sure you are running with adequate FD value (we use 64K)  In AWS, consider spreading cluster between AZ  Support cluster dynamic changes by clients  Create automation for reassign  Save cluster-reassignment.json of each topic for future needs  Have enough IOPS for Zookeepers as well  Check that your client version is compatible with message format
  • 16. Lessons learned - III  Careful with adapting new instances  Automate your cluster migration  Backup your messages, we use secor  Monitor: ISR, leader election, Iowait, network bandwidth, messages count, df  Control brokers recovery  Take public stress tests with grain of salt