SlideShare a Scribd company logo
Apache Kafka
Degendra Sivakoti
MSc IT | Islington College
Apache Kafka
Apache Kafka® is an open sourced distributed streaming platform.
History
- Developed by LinkedIn / Microsoft
- Open sourced in early 2011
- Donated to Apache Software Foundation on 2012
- Latest release and stable version 2.5.0
- Written in Scala/ Java
Why Kafka?
Apache kafka
Apache kafka
Apache kafka
Major Features
- Kafka as Messaging System
- Kafka for Stream Processing
Core Components
- Zookeeper
- Core dependency as of version 2.5.0
- Broker
- Kafka server
- Producer
- Produces message to topic partition
- Consumer
- Listens to topic partition
- Topic
- Partition
Topic
- A topic can have one or more partitions
- Topics are always multi-subscriber
- Partitions (logs):
- One leader and zero or more followers
- Distributed
Producer
- Produce message to a topic
- Select topic partition
- Normally, uses round-robin technique to balance load
Consumer
- Consumer listens to one or more topics
- Consumer belongs to a consumer group
- Only on consumer from a consumer group
can consume message from a topic
partition.
- Rebalancing
- More consumers in a group than partition
means idle consumers
Example
Kafka and BigData
- Process existing big data
- Sitting on HDFS
- Sitting in a database
- How does new data get into your cluster?
- New log from web servers
- New sensor data from IoT systems
- New stock trades
- Use for data ingestion in Hadoop system
Kafka and Hadoop Ecosystem
Questions?
Thank you!

More Related Content

PPTX
Schema registry
PDF
Kafka as Message Broker
PPT
An introduction to Apache Camel
PPTX
Mule 4 vanrish
PPSX
Apache kafka introduction
PDF
Spring Integration: from XML to Java DSL
PDF
intro-kafka
PPTX
Real time Messages at Scale with Apache Kafka and Couchbase
Schema registry
Kafka as Message Broker
An introduction to Apache Camel
Mule 4 vanrish
Apache kafka introduction
Spring Integration: from XML to Java DSL
intro-kafka
Real time Messages at Scale with Apache Kafka and Couchbase

What's hot (20)

PDF
Apache Kafka Introduction
PPTX
Mule Hyderabad Meetup (Mule 4)
PDF
Building Kafka-powered Activity Stream
PDF
Apache kafka
PPTX
Koha System Architecture
PPTX
Efficient Schemas in Motion with Kafka and Schema Registry
PDF
Secure Kafka at Salesforce.com
PPTX
Introduction to Kafka
PDF
SpringPeople Introduction to JAVA Web Services
PDF
Apache Kafka - Free Friday
ODP
Design considerations in MULE ESB
PPTX
Kafka connect
PPTX
Building an Event Bus at Scale
PPT
Mule overview
PPTX
Introduction to Total Library Solution- TLS
PDF
Introduction To Perl - SpringPeople
PDF
Apache ActiveMQ
PPTX
File connector
PPTX
Mule esb
PDF
Mule ESB Fundamentals
Apache Kafka Introduction
Mule Hyderabad Meetup (Mule 4)
Building Kafka-powered Activity Stream
Apache kafka
Koha System Architecture
Efficient Schemas in Motion with Kafka and Schema Registry
Secure Kafka at Salesforce.com
Introduction to Kafka
SpringPeople Introduction to JAVA Web Services
Apache Kafka - Free Friday
Design considerations in MULE ESB
Kafka connect
Building an Event Bus at Scale
Mule overview
Introduction to Total Library Solution- TLS
Introduction To Perl - SpringPeople
Apache ActiveMQ
File connector
Mule esb
Mule ESB Fundamentals
Ad

Similar to Apache kafka (20)

PPTX
Intoduction to Apache Kafka
PPTX
Session 23 - Kafka and Zookeeper
PPTX
Kafka and ibm event streams basics
PPTX
Understanding kafka
PPTX
Columbus mule soft_meetup_aug2021_Kafka_Integration
PPTX
kafka_session1_basics_1.pptx kafka_session1_basics_1.pptx
PPTX
Kafka Presentation.pptx
PPTX
Kafka Presentation.pptx
PDF
Introduction_to_Kafka - A brief Overview.pdf
PDF
Unlocking the Power of Apache Kafka: How Kafka Listeners Facilitate Real-time...
PPTX
Kafka overview
PPTX
kafka_session_updated.pptx
PDF
PDF
Kafka 10000 feet view
PPTX
Kafka.pptx (uploaded from MyFiles SomnathDeb_PC)
PPTX
Copy of Kafka-Camus
PPTX
Apache kafka
PPTX
Real time Analytics with Apache Kafka and Apache Spark
PDF
Apache Kafka - Strakin Technologies Pvt Ltd
PDF
Python Kafka Integration: Developers Guide
Intoduction to Apache Kafka
Session 23 - Kafka and Zookeeper
Kafka and ibm event streams basics
Understanding kafka
Columbus mule soft_meetup_aug2021_Kafka_Integration
kafka_session1_basics_1.pptx kafka_session1_basics_1.pptx
Kafka Presentation.pptx
Kafka Presentation.pptx
Introduction_to_Kafka - A brief Overview.pdf
Unlocking the Power of Apache Kafka: How Kafka Listeners Facilitate Real-time...
Kafka overview
kafka_session_updated.pptx
Kafka 10000 feet view
Kafka.pptx (uploaded from MyFiles SomnathDeb_PC)
Copy of Kafka-Camus
Apache kafka
Real time Analytics with Apache Kafka and Apache Spark
Apache Kafka - Strakin Technologies Pvt Ltd
Python Kafka Integration: Developers Guide
Ad

Recently uploaded (20)

PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
Electronic commerce courselecture one. Pdf
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PPTX
Tartificialntelligence_presentation.pptx
PPTX
SOPHOS-XG Firewall Administrator PPT.pptx
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Empathic Computing: Creating Shared Understanding
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
Spectroscopy.pptx food analysis technology
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PPTX
1. Introduction to Computer Programming.pptx
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Electronic commerce courselecture one. Pdf
MYSQL Presentation for SQL database connectivity
Network Security Unit 5.pdf for BCA BBA.
Assigned Numbers - 2025 - Bluetooth® Document
Tartificialntelligence_presentation.pptx
SOPHOS-XG Firewall Administrator PPT.pptx
Agricultural_Statistics_at_a_Glance_2022_0.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Empathic Computing: Creating Shared Understanding
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Spectroscopy.pptx food analysis technology
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
1. Introduction to Computer Programming.pptx
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
The Rise and Fall of 3GPP – Time for a Sabbatical?
Encapsulation_ Review paper, used for researhc scholars
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf

Apache kafka

  • 1. Apache Kafka Degendra Sivakoti MSc IT | Islington College
  • 2. Apache Kafka Apache Kafka® is an open sourced distributed streaming platform.
  • 3. History - Developed by LinkedIn / Microsoft - Open sourced in early 2011 - Donated to Apache Software Foundation on 2012 - Latest release and stable version 2.5.0 - Written in Scala/ Java
  • 8. Major Features - Kafka as Messaging System - Kafka for Stream Processing
  • 9. Core Components - Zookeeper - Core dependency as of version 2.5.0 - Broker - Kafka server - Producer - Produces message to topic partition - Consumer - Listens to topic partition - Topic - Partition
  • 10. Topic - A topic can have one or more partitions - Topics are always multi-subscriber - Partitions (logs): - One leader and zero or more followers - Distributed
  • 11. Producer - Produce message to a topic - Select topic partition - Normally, uses round-robin technique to balance load
  • 12. Consumer - Consumer listens to one or more topics - Consumer belongs to a consumer group - Only on consumer from a consumer group can consume message from a topic partition. - Rebalancing - More consumers in a group than partition means idle consumers
  • 14. Kafka and BigData - Process existing big data - Sitting on HDFS - Sitting in a database - How does new data get into your cluster? - New log from web servers - New sensor data from IoT systems - New stock trades - Use for data ingestion in Hadoop system
  • 15. Kafka and Hadoop Ecosystem

Editor's Notes

  • #6: Objective
  • #7: Problematic Solution
  • #8: Solution