SlideShare a Scribd company logo
Learn Kafka and event-driven
architecture
Vinay Kumar
@vinaykuma201
• ORACLE ACE
• Global Integration Architect
• Author of “Beginning Oracle WebCenter portal 12c”
• Blogger- http://www.techartifact.com/blogs
• https://www.linkedin.com/in/vinaykumar2/
2
About me
Kafka and event driven architecture -apacoug20
Agenda
4
• Interaction Style
• Traditional SOA approach
• Event driven Architecture
• Event with Microservices
• Event streaming with event Hub
• Integration with legacy.
• Kafka
• Kafka internals
• Async APIs
Communication Styles
5
Type of Interaction Initiator Participants
Time-driven Time The specifice system
Request-driven Client Client & Server
Event-driven event Open-ended
Time Driven
6
Store System
Run in every 30 min
to check the
inventory
Request-driven
7
Event-driven
8
Store System
“Inventory updated”
“Inventory is low”
9Document Title - Name - Function - Business Unit DD/MM/YYYY
Event Notification
• Anything happened (or didnt happen).
• A change in the state.
• An event is always named in the past tense and is immutabled
• A condition that triggers a notification.
CustomerAddressChanged
InventoryUpdated
SalesOrderCreated
PurschaseOrderCreated
10
Events
• “Real-time” events as they happen at the producer
• Push notifications
• One-way “fire-and-forget”
• Immediate action at the consumers
• Informational (“someone logged in”), not commands (“audit this”)
11
Characterstics of Events
Typical EDA Architecture
Event Bus
System SystemSystem
System System System
Event Producers
Event Transport
Event Consumer
• Supports the business demands for better service (no
batch, less waiting)
• No point-to-point integrations (fire & forget)
• Fault tolerance, scalability, versatility, and other benefits of
loose coupling.
• Powerful real-time response and analytics.
• Greater operational efficiencies
13
benefits of EDA
Where we do come from?
1
4
15Document Title - Name - Function - Business Unit DD/MM/YYYY
Monolithic
Shop
Customer
Customer
Inventory
Payment
Might be individual bounded
context in new world
16Document Title - Name - Function - Business Unit DD/MM/YYYY
Traditional Archiecture (Point to Point)
Shop
Customer
MarketingInventory
Payment Reporting
Inventory
17Document Title - Name - Function - Business Unit DD/MM/YYYY
Traditional Archiecture- ESB
Shop
Customer
MarketingInventory
Payment Reporting
Enterprise Service Bus
18Document Title - Name - Function - Business Unit DD/MM/YYYY
Traditional Archiecture- ESB
Shop
Customer
MarketingInventory
Payment Reporting
Lets add
some fraud
check and
new
version
V1
V2
Enterprise Service Bus
fraud
fraud
19Document Title - Name - Function - Business Unit DD/MM/YYYY
Lets add loyalty features
Shop
Customer
MarketingInventory
Payment Reporting
V1
V2
Enterprise Service Bus
fraud
fraud
Loyalty
20Document Title - Name - Function - Business Unit DD/MM/YYYY
When we scale up and see Integration ripple effect
Shop
Customer
MarketingInventory
Payment Reporting
V1
V2
Enterprise Service Bus
fraud
fraud
Loyalty
Loyalty
• SOA is all about dividing domain logic into separate systems and
exposing it as services
• In some cases business logic will, by its very nature, be spread out
over many systems or across various domain (cross domain).
• The result is domain pollution and bloat in the SOA systems.
Whats the problem
22Document Title - Name - Function - Business Unit DD/MM/YYYY
• Domain driven design promote the business logic to expose as a
service and focus should be on domain and domain logic.
• Domain event is an activity happened that domain expert is concerned.
• By exposing relevant Domain Events on a shared event bus we can
isolate cross cutting functions to separate systems
SOA and Domain events
SOA + Domain events
Shop
Customer
Marketing
Reporting
Inventory
Payment
Event
Bus
Loyalty
Fraud
BAM
“You make me complete”
SOA EDA
Event Driven (Async) in Microservices
Shop DBShop logicShop API
Customer DBCustomer logicCustomer API
Payment DBPayment logicPayment API
Event Hub
Shop Microservices
Payment Microservices
Customer Microservices
Producer,
Consumer
Consumer
Consumer
27Document Title - Name - Function - Business Unit DD/MM/YYYY
Event Streaming Data Source
Shop DBShop logicShop API
Customer DBCustomer logicCustomer API
Payment DBPayment logicPayment API
Event Hub
Shop Microservices
Payment Microservices
Customer Microservices
Producer,
Consumer
Consumer
Consumer
Mobile AppsSocial Media
Stocks Blockchain
Location IOT
Events Streaming
28Document Title - Name - Function - Business Unit DD/MM/YYYY
Microservice events and Streaming processing
StateMicroserviceAPI
Microservices Cluster
Mobile AppsSocial Media
Stocks Blockchain
Location IOT
Events Stream
Event
Hub
Events Stream
Stream Processing
Cluster
Stream
Analytics
dashboard
Events Stream
Reference
Model
Results
BI tools
Search/Discover
Mobile &
online apps
SQL
Service
Service
Service
• Domain event- In domain-driven design, domain events are described as something that happens in the domain and is important
to domain experts.
- A user has registered
- An order has been cancelled.
- The payment has been received
Domain events are relevant both within a bounded context and across bounded contexts for implementing processes within the
domain.
Best for communication between bounded context.
29Document Title - Name - Function - Business Unit DD/MM/YYYY
Domain event and event sourcing
■ Event Sourcing - Event Sourcing ensures that all changes to application state are stored as a sequence of events. It store
the events that lead to specific state and state too.
- MobileNumberProvided (MobileNumber)
- VerificationCodeGenerated (VerificationCode)
- MobileNumberValidated (no additional state)
- UserDetailsProvided (FullName, Address, …)
These events are sufficient to reconstruct the current state of the UserRegistration aggregate at any time.
Event Sourcing is for persistent strategy. Event Sourcing makes it easier to fix inconsistencies. Event Sourcing is local for a
domain.
Why Kafka for event-driven?
3
0
31Document Title - Name - Function - Business Unit DD/MM/YYYY
32Document Title - Name - Function - Business Unit DD/MM/YYYY
Kafka Overview
• Distributed publish-subscribe messaging system.
• Designed for processing of real time activity stream data (log,
metrics, collections, social media streams,…..)
• Does not use JMS API and standards
• Kafka maintains feeds of message in topics
• Initially developed at Linkedin, now part of Apache.
33Document Title - Name - Function - Business Unit DD/MM/YYYY
Kafka History
• Reliability. Kafka is distributed, partitioned, replicated, and fault tolerant. Kafka
replicates data and is able to support multiple subscribers. Additionally, it
automatically balances consumers in the event of failure.
• Scalability. Kafka is a distributed system that scales quickly and easily without
incurring any downtime.
• Durability. Kafka uses a distributed commit log, which means messages persists
on disk as fast as possible providing intra-cluster replication, hence it is durable.
• Performance. Kafka has high throughput for both publishing and subscribing
messages. It maintains stable performance even when dealing with many terabytes
of stored messages.
34Document Title - Name - Function - Business Unit DD/MM/YYYY
Benefits of Kafka
• Kafka is a messaging system that is designed to be fast, scalable, and durable.
• A producer is an entity/application that publishes data to a Kafka cluster, which is made up
of brokers.
• A Broker is responsible for receiving and storing the data when a producer publishes.
• A consumer then consumes data from a broker at a specified offset, i.e. position.
• A Topic is a category/feed name to which records are stored and published. Topics have partitions and
order guaranteed per partitions
• All Kafka records are organized into topics. Producer applications write data to topics and consumer
applications read from topics.
35Document Title - Name - Function - Business Unit DD/MM/YYYY
What is kafka
36Document Title - Name - Function - Business Unit DD/MM/YYYY
37Document Title - Name - Function - Business Unit DD/MM/YYYY
Kafka Architecture.
• Topic is divided in partitions.
• The message order is only guarantee inside a partition
• Consumer offsets are persisted by Kafka with a commit/auto-commit mechanism.
• Consumers subscribes to topics
• Consumers with different group-id receives all messages of the topics they subscribe.
They consume the messages at their own speed.
• Consumers sharing the same group-id will be assigned to one (or several) partition of the
topics they subscribe. They only receive messages from their partitions. So a constraint
appears here: the number of partitions in a topic gives the maximum number of parallel
consumers.
• The assignment of partitions to consumer can be automatic and performed by Kafka
(through Zookeeper). If a consumer stops polling or is too slow, a process call “re-
balancing” is performed and the partitions are re-assigned to other consumers.
38Document Title - Name - Function - Business Unit DD/MM/YYYY
Key Concepts of Kafka
• Kafka normally divides topic in multiply partitions.
• Each partition is an ordered, immutable sequence of messages that is continually appended to.
• A message in a partition is identified by a sequence number called offset.
• The FIFO is only guarantee inside a partition.
• When a topic is created, the number of partitions should be given
• The producer can choose which partition will get the message or let Kafka decides for him
based on a hash of the message key (recommended). So the message key is important and will
be the used to ensure the message order.
• Moreover, as the consumer will be assigned to one or several partition, the key will also “group”
messages to a same consumer.
39Document Title - Name - Function - Business Unit DD/MM/YYYY
Key Concepts of Kafka - continued
• A data source writes messages to
the log and one or more
consumers reads from the log at
the point in time they choose.
• In the diagram below a data source
is writing to the log and consumers
A and B are reading from the log at
different offsets.
40Document Title - Name - Function - Business Unit DD/MM/YYYY
Log Anatomy
• We have a broker with three topics, where each topic has 8 partitions.
• The producer sends a record to partition 1 in topic 1 and since the partition is empty
the record ends up at offset 0.
41Document Title - Name - Function - Business Unit DD/MM/YYYY
Record flow in Apache Kafka
• Next record is added to partition 1 will and up at offset 1, and the next record at offset
2 and so on.
• This is a commit log, each record is appended to the log and there is no way to
change the existing records in the log(immutable). This is also the same offset that
the consumer uses to specify where to start reading.
42Document Title - Name - Function - Business Unit DD/MM/YYYY
Record flow in Apache Kafka - continued
43Document Title - Name - Function - Business Unit DD/MM/YYYY
Apache Kafka Architecture
• Each broker holds a number of partitions and each of these partitions can be either a
leader or a replica for a topic.
• All writes and reads to a topic go through the leader and the leader coordinates updating
replicas with new data. If a leader fails, a replica takes over as the new leader.
44Document Title - Name - Function - Business Unit DD/MM/YYYY
Kafka - Partitions and Brokers
• Producers write to a single leader, this provides a means of load balancing production
so that each write can be serviced by a separate broker and machine.
• In the image, the producer is writing to partition 0 of the topic and partition 0
replicates that write to the available replicas.
45Document Title - Name - Function - Business Unit DD/MM/YYYY
Kafka – Producers writing to broker
46Document Title - Name - Function - Business Unit DD/MM/YYYY
Kafka Architecture- Topic Replication factor
• Producer is process that can publish a
message to a topic.
• Consumer is a process that can subscribe
to one or more topics and consume
messages published to topics.
• Topic category is the name of the feed to
which messages are published.
• Broker is a process running on single
machine
• Cluster is a group of brokers working
together.
• Broker management done by Zookeeper.
47Document Title - Name - Function - Business Unit DD/MM/YYYY
Flow of a record in Kafka
• Auto Scalable infrastructure.
• Multi language support (SDKs)
• Event Streaming Database
• GUI driven management and monitoring
• Enterprise Grade security
• Diagnostic Logs
• Data Monitor
• Global Resilience
• Disaster Recovery
• Connector with legacy application
• Retention management
• Flexible DevOps
• …..
Document Title - Name - Function - Business Unit DD/MM/YYYY
What are capabilites of events hub
• Traditional message broker (not really event driven)
49Document Title - Name - Function - Business Unit DD/MM/YYYY
Alternatives of event-hub or kafka?
50Document Title - Name - Function - Business Unit DD/MM/YYYY
What about legacy App?
RDBMS
Existing App
Event
Hub
New APPChange Data capture
• Attunity Replicate
• Debezium (open source)
• IBM IIDR
• Oracle GoldenGate for Big Data
• SQ Data
51Document Title - Name - Function - Business Unit DD/MM/YYYY
Change data capture tools
52Document Title - Name - Function - Business Unit DD/MM/YYYY
Microservice events and Streaming processing and legacy application
StateMicroserviceAPI
Microservices Cluster
Mobile AppsSocial Media
Stocks Blockchain
Location IOT
Events Stream
Event
Hub
Events Stream
Stream Processing
Cluster
Stream
Analytics
dashboard
Events Stream
Reference
Model
Results
BI tools
Search/Discover
Mobile &
online apps
SQL
Service
Service
Service
State
State
Finance Sales Audit
Change Data capture
Big Data
53Document Title - Name - Function - Business Unit DD/MM/YYYY
Integrate kafka with legacy
• JDBC connector for Kafka connect
• Use CDC (Change data capture) tool which integrates with kafka
connect.
• Kafka Connect is a tool for scalably and reliably streaming data between
Apache Kafka and other data systems.
Runs separately from Kafka brokers.
55Document Title - Name - Function - Business Unit DD/MM/YYYY
Kafka connect
• How do manage the event lifecycle?
• We have API management platform for APIs.
56Document Title - Name - Function - Business Unit DD/MM/YYYY
Async API - https://www.asyncapi.com/
Event lifecycle
- Design
- Documentation
- Code generation
- Event management
- Test
- Monitoring
An Async API document is a file that defines and annotates
the different components of a specific Event-Driven API.
57Document Title - Name - Function - Business Unit DD/MM/YYYY
OpenAPI- AsyncAPI comparison
Summary
5
8
• Make the split right – Bounded context.
• Events are the communication between bounded context.
• Event can be Async communication b/w microsevices.
• Kafka is great source for messaging.
• Event hub is key in new enterprise integration world.
• Use CDC for legacy integrationn.
• Try Async API for event documentation.
• https://www.slideshare.net/gschmutz/building-event-driven-microservices-with-apache-
kafka-208145957
• https://www.slideshare.net/jeppec/soa-and-event-driven-architecture-soa-
20?qid=604f3115-642b-48d4-b7ef-66ce11ab9b0b&v=&b=&from_search=65
• https://docs.confluent.io/current/connect/index.html
• https://data-flair.training/blogs/kafka-architecture/
• https://martinfowler.com/bliki/BoundedContext.html
• https://insidebigdata.com/2018/04/12/developing-deeper-understanding-apache-kafka-
architecture/
• https://www.asyncapi.com/
59Document Title - Name - Function - Business Unit DD/MM/YYYY
References
Questions?
6
0

More Related Content

PDF
Digital integration hub: Why, what and how?
PPTX
IBM Cloud Direct Link 2.0
PDF
API Days Singapore
PDF
The Big Picture: Monitoring and Orchestration of Your Microservices Landscape...
PDF
Confluent Messaging Modernization Forum
PPSX
Cloud Architecture - Multi Cloud, Edge, On-Premise
PDF
apidays LIVE Australia 2020 - Building an Enterprise Eventing Platform by Gna...
PDF
APAC Confluent Consumer Data Right the Lowdown and the Lessons
Digital integration hub: Why, what and how?
IBM Cloud Direct Link 2.0
API Days Singapore
The Big Picture: Monitoring and Orchestration of Your Microservices Landscape...
Confluent Messaging Modernization Forum
Cloud Architecture - Multi Cloud, Edge, On-Premise
apidays LIVE Australia 2020 - Building an Enterprise Eventing Platform by Gna...
APAC Confluent Consumer Data Right the Lowdown and the Lessons

What's hot (20)

PPTX
From legacy systems to microservices and back | Andera Gioia, Quantyca
PDF
Transform Your Mainframe and IBM i Data for the Cloud with Precisely and Apac...
PDF
Risk Management in Retail with Stream Processing (Daniel Jagielski, Virtuslab...
PDF
Battle Tested Event-Driven Patterns for your Microservices Architecture
PDF
Transforming Financial Services with Event Streaming Data
PDF
Event Streaming CTO Roundtable for Cloud-native Kafka Architectures
PDF
Digital Transformation: Highly Resilient Streaming Architecture and Strategies
PPTX
Should we manage events like APIs? | Kim Clark, IBM
PPTX
Transform Your Mainframe Data for the Cloud with Precisely and Apache Kafka
PDF
Caching for Microservices Architectures: Session II - Caching Patterns
PDF
The Bridge to Cloud (Peter Gustafsson, Confluent) London 2019 Confluent Strea...
PPTX
Scaling DevOps of Microservices at Uber (Code Conf 2018)
PDF
IDC Multicloud 2019 - Conference Milano , Oracle speech
PDF
Hybrid Streaming Analytics for Apache Kafka Users | Firat Tekiner, Google
PPTX
Modernizing your Application Architecture with Microservices
PPTX
Stream me to the Cloud (and back) with Confluent & MongoDB
PDF
Real-time Adaptation of Financial Market Events with Kafka | Cliff Cheng and ...
PPTX
Event Driven Architecture
PDF
Building event-driven Microservices with Kafka Ecosystem
PDF
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...
From legacy systems to microservices and back | Andera Gioia, Quantyca
Transform Your Mainframe and IBM i Data for the Cloud with Precisely and Apac...
Risk Management in Retail with Stream Processing (Daniel Jagielski, Virtuslab...
Battle Tested Event-Driven Patterns for your Microservices Architecture
Transforming Financial Services with Event Streaming Data
Event Streaming CTO Roundtable for Cloud-native Kafka Architectures
Digital Transformation: Highly Resilient Streaming Architecture and Strategies
Should we manage events like APIs? | Kim Clark, IBM
Transform Your Mainframe Data for the Cloud with Precisely and Apache Kafka
Caching for Microservices Architectures: Session II - Caching Patterns
The Bridge to Cloud (Peter Gustafsson, Confluent) London 2019 Confluent Strea...
Scaling DevOps of Microservices at Uber (Code Conf 2018)
IDC Multicloud 2019 - Conference Milano , Oracle speech
Hybrid Streaming Analytics for Apache Kafka Users | Firat Tekiner, Google
Modernizing your Application Architecture with Microservices
Stream me to the Cloud (and back) with Confluent & MongoDB
Real-time Adaptation of Financial Market Events with Kafka | Cliff Cheng and ...
Event Driven Architecture
Building event-driven Microservices with Kafka Ecosystem
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...
Ad

Similar to Kafka and event driven architecture -apacoug20 (20)

PPT
Kafka-and-event-driven-architecture-OGYatra20.ppt
PPTX
Event Driven Architectures with Apache Kafka
PDF
Apache Kafka - Scalable Message-Processing and more !
PPTX
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...
PPTX
Leveraging the power of the unbundled database
PPTX
apidays LIVE Jakarta - Building an Event-Driven Architecture by Harin Honesty...
PDF
Apache Kafka Introduction
PDF
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with K...
PPTX
Removing dependencies between services: Messaging and Apache Kafka
PPTX
Evolutionary Systems - Kafka Microservices
PDF
Database@Home : Data Driven Apps - Data-driven Microservices Architecture wit...
PDF
Data Transformations on Ops Metrics using Kafka Streams (Srividhya Ramachandr...
PPT
Service Oriented Architecture
PDF
Implementing Domain Events with Kafka
PDF
IoT & Azure
PPTX
Reduce Risk with End to End Monitoring of Middleware-based Applications
PDF
Salesforce Winter 23 Release Webinar Slide Deck
PPTX
High throughput data streaming in Azure
PDF
Removing performance bottlenecks with Kafka Monitoring and topic configuration
PDF
OnPrem Monitoring.pdf
Kafka-and-event-driven-architecture-OGYatra20.ppt
Event Driven Architectures with Apache Kafka
Apache Kafka - Scalable Message-Processing and more !
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...
Leveraging the power of the unbundled database
apidays LIVE Jakarta - Building an Event-Driven Architecture by Harin Honesty...
Apache Kafka Introduction
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with K...
Removing dependencies between services: Messaging and Apache Kafka
Evolutionary Systems - Kafka Microservices
Database@Home : Data Driven Apps - Data-driven Microservices Architecture wit...
Data Transformations on Ops Metrics using Kafka Streams (Srividhya Ramachandr...
Service Oriented Architecture
Implementing Domain Events with Kafka
IoT & Azure
Reduce Risk with End to End Monitoring of Middleware-based Applications
Salesforce Winter 23 Release Webinar Slide Deck
High throughput data streaming in Azure
Removing performance bottlenecks with Kafka Monitoring and topic configuration
OnPrem Monitoring.pdf
Ad

More from Vinay Kumar (20)

PDF
Modernizing the monolithic architecture to container based architecture apaco...
PPTX
Kafka and event driven architecture -og yatra20
PDF
Extend soa with api management Sangam18
PDF
Extend soa with api management Doag18
PDF
Roaring with elastic search sangam2018
PPTX
Extend soa with api management spoug- Madrid
PPTX
Expose your data as an api is with oracle rest data services -spoug Madrid
PPTX
Modern application development with oracle cloud sangam17
PDF
award-3b07c32b-b116-3a75-8974-d814d37026ca
PDF
award-3b07c32b-b116-3a75-8974-d814d37026ca
PPTX
Adf spotlight-webcenter task flow-customzation
PDF
Personalization in webcenter portal
PPTX
Custom audit rules in Jdeveloper extension
PDF
File upload in oracle adf mobile
PDF
Webcenter application performance tuning guide
PDF
Tuning and optimizing webcenter spaces application white paper
PDF
Oracle adf performance tips
PPTX
JSR 168 Portal - Overview
PPTX
Spring framework in depth
PPTX
Oracle Fusion Architecture
Modernizing the monolithic architecture to container based architecture apaco...
Kafka and event driven architecture -og yatra20
Extend soa with api management Sangam18
Extend soa with api management Doag18
Roaring with elastic search sangam2018
Extend soa with api management spoug- Madrid
Expose your data as an api is with oracle rest data services -spoug Madrid
Modern application development with oracle cloud sangam17
award-3b07c32b-b116-3a75-8974-d814d37026ca
award-3b07c32b-b116-3a75-8974-d814d37026ca
Adf spotlight-webcenter task flow-customzation
Personalization in webcenter portal
Custom audit rules in Jdeveloper extension
File upload in oracle adf mobile
Webcenter application performance tuning guide
Tuning and optimizing webcenter spaces application white paper
Oracle adf performance tips
JSR 168 Portal - Overview
Spring framework in depth
Oracle Fusion Architecture

Recently uploaded (20)

PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPT
Teaching material agriculture food technology
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Empathic Computing: Creating Shared Understanding
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
Big Data Technologies - Introduction.pptx
PPTX
A Presentation on Artificial Intelligence
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Network Security Unit 5.pdf for BCA BBA.
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
CIFDAQ's Market Insight: SEC Turns Pro Crypto
20250228 LYD VKU AI Blended-Learning.pptx
Teaching material agriculture food technology
Mobile App Security Testing_ A Comprehensive Guide.pdf
Empathic Computing: Creating Shared Understanding
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Chapter 3 Spatial Domain Image Processing.pdf
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Dropbox Q2 2025 Financial Results & Investor Presentation
The Rise and Fall of 3GPP – Time for a Sabbatical?
Digital-Transformation-Roadmap-for-Companies.pptx
Big Data Technologies - Introduction.pptx
A Presentation on Artificial Intelligence
Spectral efficient network and resource selection model in 5G networks
Reach Out and Touch Someone: Haptics and Empathic Computing
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows

Kafka and event driven architecture -apacoug20

  • 1. Learn Kafka and event-driven architecture Vinay Kumar @vinaykuma201
  • 2. • ORACLE ACE • Global Integration Architect • Author of “Beginning Oracle WebCenter portal 12c” • Blogger- http://www.techartifact.com/blogs • https://www.linkedin.com/in/vinaykumar2/ 2 About me
  • 4. Agenda 4 • Interaction Style • Traditional SOA approach • Event driven Architecture • Event with Microservices • Event streaming with event Hub • Integration with legacy. • Kafka • Kafka internals • Async APIs
  • 5. Communication Styles 5 Type of Interaction Initiator Participants Time-driven Time The specifice system Request-driven Client Client & Server Event-driven event Open-ended
  • 6. Time Driven 6 Store System Run in every 30 min to check the inventory
  • 9. 9Document Title - Name - Function - Business Unit DD/MM/YYYY Event Notification
  • 10. • Anything happened (or didnt happen). • A change in the state. • An event is always named in the past tense and is immutabled • A condition that triggers a notification. CustomerAddressChanged InventoryUpdated SalesOrderCreated PurschaseOrderCreated 10 Events
  • 11. • “Real-time” events as they happen at the producer • Push notifications • One-way “fire-and-forget” • Immediate action at the consumers • Informational (“someone logged in”), not commands (“audit this”) 11 Characterstics of Events
  • 12. Typical EDA Architecture Event Bus System SystemSystem System System System Event Producers Event Transport Event Consumer
  • 13. • Supports the business demands for better service (no batch, less waiting) • No point-to-point integrations (fire & forget) • Fault tolerance, scalability, versatility, and other benefits of loose coupling. • Powerful real-time response and analytics. • Greater operational efficiencies 13 benefits of EDA
  • 14. Where we do come from? 1 4
  • 15. 15Document Title - Name - Function - Business Unit DD/MM/YYYY Monolithic Shop Customer Customer Inventory Payment Might be individual bounded context in new world
  • 16. 16Document Title - Name - Function - Business Unit DD/MM/YYYY Traditional Archiecture (Point to Point) Shop Customer MarketingInventory Payment Reporting Inventory
  • 17. 17Document Title - Name - Function - Business Unit DD/MM/YYYY Traditional Archiecture- ESB Shop Customer MarketingInventory Payment Reporting Enterprise Service Bus
  • 18. 18Document Title - Name - Function - Business Unit DD/MM/YYYY Traditional Archiecture- ESB Shop Customer MarketingInventory Payment Reporting Lets add some fraud check and new version V1 V2 Enterprise Service Bus fraud fraud
  • 19. 19Document Title - Name - Function - Business Unit DD/MM/YYYY Lets add loyalty features Shop Customer MarketingInventory Payment Reporting V1 V2 Enterprise Service Bus fraud fraud Loyalty
  • 20. 20Document Title - Name - Function - Business Unit DD/MM/YYYY When we scale up and see Integration ripple effect Shop Customer MarketingInventory Payment Reporting V1 V2 Enterprise Service Bus fraud fraud Loyalty Loyalty
  • 21. • SOA is all about dividing domain logic into separate systems and exposing it as services • In some cases business logic will, by its very nature, be spread out over many systems or across various domain (cross domain). • The result is domain pollution and bloat in the SOA systems. Whats the problem
  • 22. 22Document Title - Name - Function - Business Unit DD/MM/YYYY
  • 23. • Domain driven design promote the business logic to expose as a service and focus should be on domain and domain logic. • Domain event is an activity happened that domain expert is concerned. • By exposing relevant Domain Events on a shared event bus we can isolate cross cutting functions to separate systems SOA and Domain events
  • 24. SOA + Domain events Shop Customer Marketing Reporting Inventory Payment Event Bus Loyalty Fraud BAM
  • 25. “You make me complete” SOA EDA
  • 26. Event Driven (Async) in Microservices Shop DBShop logicShop API Customer DBCustomer logicCustomer API Payment DBPayment logicPayment API Event Hub Shop Microservices Payment Microservices Customer Microservices Producer, Consumer Consumer Consumer
  • 27. 27Document Title - Name - Function - Business Unit DD/MM/YYYY Event Streaming Data Source Shop DBShop logicShop API Customer DBCustomer logicCustomer API Payment DBPayment logicPayment API Event Hub Shop Microservices Payment Microservices Customer Microservices Producer, Consumer Consumer Consumer Mobile AppsSocial Media Stocks Blockchain Location IOT Events Streaming
  • 28. 28Document Title - Name - Function - Business Unit DD/MM/YYYY Microservice events and Streaming processing StateMicroserviceAPI Microservices Cluster Mobile AppsSocial Media Stocks Blockchain Location IOT Events Stream Event Hub Events Stream Stream Processing Cluster Stream Analytics dashboard Events Stream Reference Model Results BI tools Search/Discover Mobile & online apps SQL Service Service Service
  • 29. • Domain event- In domain-driven design, domain events are described as something that happens in the domain and is important to domain experts. - A user has registered - An order has been cancelled. - The payment has been received Domain events are relevant both within a bounded context and across bounded contexts for implementing processes within the domain. Best for communication between bounded context. 29Document Title - Name - Function - Business Unit DD/MM/YYYY Domain event and event sourcing ■ Event Sourcing - Event Sourcing ensures that all changes to application state are stored as a sequence of events. It store the events that lead to specific state and state too. - MobileNumberProvided (MobileNumber) - VerificationCodeGenerated (VerificationCode) - MobileNumberValidated (no additional state) - UserDetailsProvided (FullName, Address, …) These events are sufficient to reconstruct the current state of the UserRegistration aggregate at any time. Event Sourcing is for persistent strategy. Event Sourcing makes it easier to fix inconsistencies. Event Sourcing is local for a domain.
  • 30. Why Kafka for event-driven? 3 0
  • 31. 31Document Title - Name - Function - Business Unit DD/MM/YYYY
  • 32. 32Document Title - Name - Function - Business Unit DD/MM/YYYY Kafka Overview • Distributed publish-subscribe messaging system. • Designed for processing of real time activity stream data (log, metrics, collections, social media streams,…..) • Does not use JMS API and standards • Kafka maintains feeds of message in topics • Initially developed at Linkedin, now part of Apache.
  • 33. 33Document Title - Name - Function - Business Unit DD/MM/YYYY Kafka History
  • 34. • Reliability. Kafka is distributed, partitioned, replicated, and fault tolerant. Kafka replicates data and is able to support multiple subscribers. Additionally, it automatically balances consumers in the event of failure. • Scalability. Kafka is a distributed system that scales quickly and easily without incurring any downtime. • Durability. Kafka uses a distributed commit log, which means messages persists on disk as fast as possible providing intra-cluster replication, hence it is durable. • Performance. Kafka has high throughput for both publishing and subscribing messages. It maintains stable performance even when dealing with many terabytes of stored messages. 34Document Title - Name - Function - Business Unit DD/MM/YYYY Benefits of Kafka
  • 35. • Kafka is a messaging system that is designed to be fast, scalable, and durable. • A producer is an entity/application that publishes data to a Kafka cluster, which is made up of brokers. • A Broker is responsible for receiving and storing the data when a producer publishes. • A consumer then consumes data from a broker at a specified offset, i.e. position. • A Topic is a category/feed name to which records are stored and published. Topics have partitions and order guaranteed per partitions • All Kafka records are organized into topics. Producer applications write data to topics and consumer applications read from topics. 35Document Title - Name - Function - Business Unit DD/MM/YYYY What is kafka
  • 36. 36Document Title - Name - Function - Business Unit DD/MM/YYYY
  • 37. 37Document Title - Name - Function - Business Unit DD/MM/YYYY Kafka Architecture.
  • 38. • Topic is divided in partitions. • The message order is only guarantee inside a partition • Consumer offsets are persisted by Kafka with a commit/auto-commit mechanism. • Consumers subscribes to topics • Consumers with different group-id receives all messages of the topics they subscribe. They consume the messages at their own speed. • Consumers sharing the same group-id will be assigned to one (or several) partition of the topics they subscribe. They only receive messages from their partitions. So a constraint appears here: the number of partitions in a topic gives the maximum number of parallel consumers. • The assignment of partitions to consumer can be automatic and performed by Kafka (through Zookeeper). If a consumer stops polling or is too slow, a process call “re- balancing” is performed and the partitions are re-assigned to other consumers. 38Document Title - Name - Function - Business Unit DD/MM/YYYY Key Concepts of Kafka
  • 39. • Kafka normally divides topic in multiply partitions. • Each partition is an ordered, immutable sequence of messages that is continually appended to. • A message in a partition is identified by a sequence number called offset. • The FIFO is only guarantee inside a partition. • When a topic is created, the number of partitions should be given • The producer can choose which partition will get the message or let Kafka decides for him based on a hash of the message key (recommended). So the message key is important and will be the used to ensure the message order. • Moreover, as the consumer will be assigned to one or several partition, the key will also “group” messages to a same consumer. 39Document Title - Name - Function - Business Unit DD/MM/YYYY Key Concepts of Kafka - continued
  • 40. • A data source writes messages to the log and one or more consumers reads from the log at the point in time they choose. • In the diagram below a data source is writing to the log and consumers A and B are reading from the log at different offsets. 40Document Title - Name - Function - Business Unit DD/MM/YYYY Log Anatomy
  • 41. • We have a broker with three topics, where each topic has 8 partitions. • The producer sends a record to partition 1 in topic 1 and since the partition is empty the record ends up at offset 0. 41Document Title - Name - Function - Business Unit DD/MM/YYYY Record flow in Apache Kafka
  • 42. • Next record is added to partition 1 will and up at offset 1, and the next record at offset 2 and so on. • This is a commit log, each record is appended to the log and there is no way to change the existing records in the log(immutable). This is also the same offset that the consumer uses to specify where to start reading. 42Document Title - Name - Function - Business Unit DD/MM/YYYY Record flow in Apache Kafka - continued
  • 43. 43Document Title - Name - Function - Business Unit DD/MM/YYYY Apache Kafka Architecture
  • 44. • Each broker holds a number of partitions and each of these partitions can be either a leader or a replica for a topic. • All writes and reads to a topic go through the leader and the leader coordinates updating replicas with new data. If a leader fails, a replica takes over as the new leader. 44Document Title - Name - Function - Business Unit DD/MM/YYYY Kafka - Partitions and Brokers
  • 45. • Producers write to a single leader, this provides a means of load balancing production so that each write can be serviced by a separate broker and machine. • In the image, the producer is writing to partition 0 of the topic and partition 0 replicates that write to the available replicas. 45Document Title - Name - Function - Business Unit DD/MM/YYYY Kafka – Producers writing to broker
  • 46. 46Document Title - Name - Function - Business Unit DD/MM/YYYY Kafka Architecture- Topic Replication factor
  • 47. • Producer is process that can publish a message to a topic. • Consumer is a process that can subscribe to one or more topics and consume messages published to topics. • Topic category is the name of the feed to which messages are published. • Broker is a process running on single machine • Cluster is a group of brokers working together. • Broker management done by Zookeeper. 47Document Title - Name - Function - Business Unit DD/MM/YYYY Flow of a record in Kafka
  • 48. • Auto Scalable infrastructure. • Multi language support (SDKs) • Event Streaming Database • GUI driven management and monitoring • Enterprise Grade security • Diagnostic Logs • Data Monitor • Global Resilience • Disaster Recovery • Connector with legacy application • Retention management • Flexible DevOps • ….. Document Title - Name - Function - Business Unit DD/MM/YYYY What are capabilites of events hub
  • 49. • Traditional message broker (not really event driven) 49Document Title - Name - Function - Business Unit DD/MM/YYYY Alternatives of event-hub or kafka?
  • 50. 50Document Title - Name - Function - Business Unit DD/MM/YYYY What about legacy App? RDBMS Existing App Event Hub New APPChange Data capture
  • 51. • Attunity Replicate • Debezium (open source) • IBM IIDR • Oracle GoldenGate for Big Data • SQ Data 51Document Title - Name - Function - Business Unit DD/MM/YYYY Change data capture tools
  • 52. 52Document Title - Name - Function - Business Unit DD/MM/YYYY Microservice events and Streaming processing and legacy application StateMicroserviceAPI Microservices Cluster Mobile AppsSocial Media Stocks Blockchain Location IOT Events Stream Event Hub Events Stream Stream Processing Cluster Stream Analytics dashboard Events Stream Reference Model Results BI tools Search/Discover Mobile & online apps SQL Service Service Service State State Finance Sales Audit Change Data capture Big Data
  • 53. 53Document Title - Name - Function - Business Unit DD/MM/YYYY
  • 54. Integrate kafka with legacy • JDBC connector for Kafka connect • Use CDC (Change data capture) tool which integrates with kafka connect.
  • 55. • Kafka Connect is a tool for scalably and reliably streaming data between Apache Kafka and other data systems. Runs separately from Kafka brokers. 55Document Title - Name - Function - Business Unit DD/MM/YYYY Kafka connect
  • 56. • How do manage the event lifecycle? • We have API management platform for APIs. 56Document Title - Name - Function - Business Unit DD/MM/YYYY Async API - https://www.asyncapi.com/ Event lifecycle - Design - Documentation - Code generation - Event management - Test - Monitoring An Async API document is a file that defines and annotates the different components of a specific Event-Driven API.
  • 57. 57Document Title - Name - Function - Business Unit DD/MM/YYYY OpenAPI- AsyncAPI comparison
  • 58. Summary 5 8 • Make the split right – Bounded context. • Events are the communication between bounded context. • Event can be Async communication b/w microsevices. • Kafka is great source for messaging. • Event hub is key in new enterprise integration world. • Use CDC for legacy integrationn. • Try Async API for event documentation.
  • 59. • https://www.slideshare.net/gschmutz/building-event-driven-microservices-with-apache- kafka-208145957 • https://www.slideshare.net/jeppec/soa-and-event-driven-architecture-soa- 20?qid=604f3115-642b-48d4-b7ef-66ce11ab9b0b&v=&b=&from_search=65 • https://docs.confluent.io/current/connect/index.html • https://data-flair.training/blogs/kafka-architecture/ • https://martinfowler.com/bliki/BoundedContext.html • https://insidebigdata.com/2018/04/12/developing-deeper-understanding-apache-kafka- architecture/ • https://www.asyncapi.com/ 59Document Title - Name - Function - Business Unit DD/MM/YYYY References