SlideShare a Scribd company logo
Apache Kafka
Veysel Gündüzalp - Senior Software Engineer @Trendyol
twitter.com/v_gunduzalp
linkedin.com/in/vgunduzalp
Why we need Apache Kafka
- Is it a good technology?
- Hype?
- Over-Engineering?
Communication Types
- request-response
- callback
- push
- pull
- poll
Disadvantage of Request-Response
- Timeout
- Client has to wait
- Insufficient server resource
- The application is closed and data loss
- Network problems
Why Apache Kafka?
- Low latency
- Active-active working
- High availability
- Scalability and extensibility
- Multiple consumers
- Async processing
- Creating language-independent applications
Broker
- A Kafka broker acts as a server or a node in a Kafka cluster, and is
responsible for managing the storage and replication of messages across
multiple nodes
- What is the difference from database?
Topic
- Topic is a category or feed name to which messages are published by producers
- The message order is only guarantee inside a partition (FIFO)
- You can’t delete message
- How should the topic naming be
Topic Partitions
- Topic partition is a logical divisionof a topic's data
into multiple, ordered segments that can be
distributed across multiple brokers in a Kafka
cluster.
- Each partition is a sequence of immutable,
ordered messages that can be processed
independently from other partitions.
- Each partition can be processed independently,
allowing for parallel processing of messages
across multiple consumers.
Topic Replication
- Topic replication is a key feature of Kafka that provides high
availability, fault tolerance, and scalability for message storage and
processing in distributed systems.
- Replication Factor should be at least 2, recommended 3, up to 4.
Topic Replication
Message
Message Key
- Message key is an optional
attribute that is used to
control how messages are
partitioned and distributed
across the topic's partitions
- You can use for ordered
messages
Message Header
Message header is a set of key-value pairs that provide metadata
and context information for a message
- X-CorrelationId
- X-AgentName
- RetryCount
Producer
Producer is a component that sends messages to a Kafka topic
Producer Delivery Semantics
Consumer
- Consumer is an application or process
that reads messages from a Kafka topic.
- The consumer periodically polls Kafka for
new messages and processes them one at
a time
- Batch?
- Async?
Consumer Delivery Semantics
At mostonce(0, 1)
Consumer Delivery Semantics
At leastonce(1, n)
Consumer Delivery Semantics
Exactlyonce
- Consuming from a Kafka topic and producing to another topic
(as in a Kafka Streams application)
- Uses transactional producer ve consumer
- Uses a two-phase commit protocol for both producers and
consumers.
Consumer
auto.offset.reset: earliest, latest
Cleanup Policy (Delete-Compaction)
- Log compaction is a feature in Apache
Kafka that allows for the selective
removal of duplicate records from a
topic's log
What is the lag?
Kafka used to store and manage consumer group offsets in the __consumer_offsets topic
Why is Kafka fast?
- Zero copy is a shortcut to save the
multiple data copies between application
context and kernel context.
- This approach brings down the time
by approximately 65%.
https://twitter.com/alexxubyte/status/1506663791961919488
https://medium.com/free-code-camp/what-makes-apache-kafka-so-fast-a8d4f94ab145
Questions?
Veysel Gündüzalp - Senior Software Engineer @Trendyol
twitter.com/v_gunduzalp
linkedin.com/in/vgunduzalp

More Related Content

PDF
Fundamentals of Apache Kafka
PDF
Kafka Deep Dive
PDF
PDF
Kafka syed academy_v1_introduction
PPTX
Fundamentals and Architecture of Apache Kafka
PPTX
Kafka.pptx (uploaded from MyFiles SomnathDeb_PC)
PDF
Apache Kafka - Scalable Message Processing and more!
PDF
Kafka 10000 feet view
Fundamentals of Apache Kafka
Kafka Deep Dive
Kafka syed academy_v1_introduction
Fundamentals and Architecture of Apache Kafka
Kafka.pptx (uploaded from MyFiles SomnathDeb_PC)
Apache Kafka - Scalable Message Processing and more!
Kafka 10000 feet view

Similar to Intoduction to Apache Kafka (20)

PPTX
Introduction to Kafka
PPTX
Apache Kafka - Messaging System Overview
PPTX
Kafka Fundamentals
PDF
Introduction to apache kafka
PPTX
kafka_session1_basics_1.pptx kafka_session1_basics_1.pptx
PPTX
kafka_session_updated.pptx
PPTX
Kafka overview v0.1
PDF
Apache Kafka - Scalable Message-Processing and more !
PDF
Kafka in action - Tech Talk - Paytm
PDF
Apache Kafka Women Who Code Meetup
PPTX
Kafka tutorial
PDF
apachekafka-160907180205.pdf
PDF
Apache Kafka - Event Sourcing, Monitoring, Librdkafka, Scaling & Partitioning
PPTX
Apache kafka
PDF
Introduction_to_Kafka - A brief Overview.pdf
PPTX
04-Kafka.pptx
PPTX
04-Kafka.pptx
PPSX
Apache kafka introduction
PPTX
Proof of Concept on Kafka.pptx
PDF
Kafka - Messaging System
Introduction to Kafka
Apache Kafka - Messaging System Overview
Kafka Fundamentals
Introduction to apache kafka
kafka_session1_basics_1.pptx kafka_session1_basics_1.pptx
kafka_session_updated.pptx
Kafka overview v0.1
Apache Kafka - Scalable Message-Processing and more !
Kafka in action - Tech Talk - Paytm
Apache Kafka Women Who Code Meetup
Kafka tutorial
apachekafka-160907180205.pdf
Apache Kafka - Event Sourcing, Monitoring, Librdkafka, Scaling & Partitioning
Apache kafka
Introduction_to_Kafka - A brief Overview.pdf
04-Kafka.pptx
04-Kafka.pptx
Apache kafka introduction
Proof of Concept on Kafka.pptx
Kafka - Messaging System

Recently uploaded (20)

PPTX
Introduction to Artificial Intelligence
PDF
Navsoft: AI-Powered Business Solutions & Custom Software Development
PDF
Which alternative to Crystal Reports is best for small or large businesses.pdf
PDF
iTop VPN Free 5.6.0.5262 Crack latest version 2025
PPTX
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
PPTX
Computer Software and OS of computer science of grade 11.pptx
PDF
top salesforce developer skills in 2025.pdf
PPTX
Log360_SIEM_Solutions Overview PPT_Feb 2020.pptx
PDF
Digital Systems & Binary Numbers (comprehensive )
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 41
PDF
Nekopoi APK 2025 free lastest update
PPTX
CHAPTER 2 - PM Management and IT Context
PDF
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
PPTX
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
PDF
Cost to Outsource Software Development in 2025
PDF
PTS Company Brochure 2025 (1).pdf.......
PDF
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
PDF
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
PDF
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
PDF
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
Introduction to Artificial Intelligence
Navsoft: AI-Powered Business Solutions & Custom Software Development
Which alternative to Crystal Reports is best for small or large businesses.pdf
iTop VPN Free 5.6.0.5262 Crack latest version 2025
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
Computer Software and OS of computer science of grade 11.pptx
top salesforce developer skills in 2025.pdf
Log360_SIEM_Solutions Overview PPT_Feb 2020.pptx
Digital Systems & Binary Numbers (comprehensive )
Internet Downloader Manager (IDM) Crack 6.42 Build 41
Nekopoi APK 2025 free lastest update
CHAPTER 2 - PM Management and IT Context
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
Cost to Outsource Software Development in 2025
PTS Company Brochure 2025 (1).pdf.......
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus

Intoduction to Apache Kafka

  • 1. Apache Kafka Veysel Gündüzalp - Senior Software Engineer @Trendyol twitter.com/v_gunduzalp linkedin.com/in/vgunduzalp
  • 2. Why we need Apache Kafka - Is it a good technology? - Hype? - Over-Engineering?
  • 3. Communication Types - request-response - callback - push - pull - poll
  • 4. Disadvantage of Request-Response - Timeout - Client has to wait - Insufficient server resource - The application is closed and data loss - Network problems
  • 5. Why Apache Kafka? - Low latency - Active-active working - High availability - Scalability and extensibility - Multiple consumers - Async processing - Creating language-independent applications
  • 6. Broker - A Kafka broker acts as a server or a node in a Kafka cluster, and is responsible for managing the storage and replication of messages across multiple nodes - What is the difference from database?
  • 7. Topic - Topic is a category or feed name to which messages are published by producers - The message order is only guarantee inside a partition (FIFO) - You can’t delete message - How should the topic naming be
  • 8. Topic Partitions - Topic partition is a logical divisionof a topic's data into multiple, ordered segments that can be distributed across multiple brokers in a Kafka cluster. - Each partition is a sequence of immutable, ordered messages that can be processed independently from other partitions. - Each partition can be processed independently, allowing for parallel processing of messages across multiple consumers.
  • 9. Topic Replication - Topic replication is a key feature of Kafka that provides high availability, fault tolerance, and scalability for message storage and processing in distributed systems. - Replication Factor should be at least 2, recommended 3, up to 4.
  • 12. Message Key - Message key is an optional attribute that is used to control how messages are partitioned and distributed across the topic's partitions - You can use for ordered messages
  • 13. Message Header Message header is a set of key-value pairs that provide metadata and context information for a message - X-CorrelationId - X-AgentName - RetryCount
  • 14. Producer Producer is a component that sends messages to a Kafka topic
  • 16. Consumer - Consumer is an application or process that reads messages from a Kafka topic. - The consumer periodically polls Kafka for new messages and processes them one at a time - Batch? - Async?
  • 19. Consumer Delivery Semantics Exactlyonce - Consuming from a Kafka topic and producing to another topic (as in a Kafka Streams application) - Uses transactional producer ve consumer - Uses a two-phase commit protocol for both producers and consumers.
  • 21. Cleanup Policy (Delete-Compaction) - Log compaction is a feature in Apache Kafka that allows for the selective removal of duplicate records from a topic's log
  • 22. What is the lag? Kafka used to store and manage consumer group offsets in the __consumer_offsets topic
  • 23. Why is Kafka fast? - Zero copy is a shortcut to save the multiple data copies between application context and kernel context. - This approach brings down the time by approximately 65%. https://twitter.com/alexxubyte/status/1506663791961919488 https://medium.com/free-code-camp/what-makes-apache-kafka-so-fast-a8d4f94ab145
  • 24. Questions? Veysel Gündüzalp - Senior Software Engineer @Trendyol twitter.com/v_gunduzalp linkedin.com/in/vgunduzalp