SlideShare a Scribd company logo
Actors or Not
Async Event Architectures
Yaroslav Tkachenko
Senior Software Engineer at Demonware (Activision)
Background
• 10 years in the industry
• ~1 year at Demonware/Activision, 5 years at Bench Accounting
• Mostly web, back-end, platform, infrastructure and data things
• @sap1ens / sap1ens.com
• Talk to me about data pipelines, stream processing and the Premier League ;-)
Two stories
Context: sync vs async communication
Service A Service B
POST /foo
service-b.example.com
“Easy” way – HTTP (RPC) API
Context: sync vs async communication
• Destination – where to send request?
• Service discovery
• Tight coupling
• Time – expect reply right away?
• Failure – always expect success?
• Retries
• Back-pressure
• Circuit breakers
You cannot make
synchronous requests
over the network
behave like local ones
Context: async communication styles
• Point-to-Point Channel
• One sender
• One receiver
• Publish-Subscribe Channel (Broadcast)
• One publisher
• Multiple subscribers
Context: Events vs Commands
• Event
• Simply a notification that something happened in the past
• Command
• Request to invoke some functionality (“RPC over messaging”)
Actors or Not: Async Event Architectures
Actors or Not: Async Event Architectures
Actors or Not: Async Event Architectures
• 469+ million gamers
• 3.2+ million concurrent online gamers
• 100+ games
• 300,000 requests per second at peak
• Average query response time of <.02 second
• 630,000+ metrics a minute
• 132 billion+ API calls per month
Demonware by the numbers
• Core game services including:
• Auth
• Matchmaking
• Leaderboards
• Marketplace
• Loot & Rewards
• Storage
• Etc.
• Erlang for networking layer, Python for application layer
• Still have a big application monolith, but slowly migrating to independent
services (SOA)
Demonware Back-end Services
• Lots of synchronous request/response communication between the monolith
and the services using:
• HTTP
• RPC
• The requesting process:
• conceptually knows which service it wants to call into
• is aware of the action that it is requesting, and its effects
• generally needs to be notified of the request’s completion and any
associated information before proceeding with its business logic
DW Services: Synchronous communication
• Using Domain Events
• Communication model assumes the following:
• The event may need to be handled by zero or more service processes,
each with different use cases; the process that generates the event does
not need to be aware of them
• The process that generates the event does not need to be aware of
what actions will be triggered, and what their effects might be
• The process that generates the event does not need to be notified of
the handlers’ completion before proceeding with its business logic
• Seamless integration with the Data Pipeline / Warehouse
DW Services: Asynchronous communication*
Domain Driven Design
Application
Core
Event
Adapter
Events
Commands
Events
HTTP
Adapter
CLI
Adapter
Kafka
Service
Kafka
Kafka
Publish-Subscribe OR Point-to-Point is a decision made by consumers
Kafka
• Service name is used as a topic name in Kafka
• Services have to explicitly subscribe to interested topics on startup (some
extra filtering is also supported)
• All messages are typically partitioned by a user ID to preserve order
Event Dispatcher
Application
Core
Event
Dispatcher
Kafka topic
Partitions
Kafka Python
Consumer
(librdkafka)
Local
buffer
queue
queue
queue
Tornado
Queues
Event Dispatcher
1 @demonata.event.source(
2 name='events_from_service_a'
3 )
4 class ServiceAEventsDispatcher (object):
5 def __init__(self, my_app_service):
6 self._app = my_app_service
7
8 @demonata.event.schema(
9 name='service.UserUpdated' ,
10 ge_version= '1.2.3',
11 event_dto=UserUpdated
12 )
13 def on_user_updated (self, message, event):
14 assert isinstance(message, DwPublishedEvent)
15 # ...
Publishing Events
The following reliability modes are supported:
• Fire and forget, relying on Kafka producer (acks = 0, 1, all)
• At least once (guaranteed), using remote EventStore backed by a DB
• At least once (intermediate), using local EventStore
Event Publisher
Application
Core
Event
Publisher
Kafka topic
PartitionsKafka Python
Producer
(librdkafka)
Event
Store Event
Producer
Publishing Events
1 @demonata.coroutine
2 def handle_event_atomically (self, event_to_process):
3 entity_key = self. determine_entity_key (event_to_process)
4 entity = self.db. read(entity_key)
5
6 some_data = yield self.perform_some_async_io_read ()
7 new_entity, new_event = self. apply_business_logic (
8 entity, event_to_process, some_data
9 )
10
11 # single-shard MySQL transaction:
12 with self.db. trans(shard_key=entity_key):
13 db.save(new_entity)
14 self.publisher. publish(new_event)
15 commit()
Event Framework in Demonware
• Decorator-driven consumers using callbacks
• Reliable producers
• Non-blocking IO using Tornado
• Apache Kafka as a transport
But still…
Can we do better?
1 @demonata.event.source(
2 name='events_from_service_a'
3 )
4 class ServiceAEventsDispatcher (object):
5 def __init__(self, my_app_service):
6 self._app = my_app_service
7
8 @demonata.event.schema(
9 name='service.UserUpdated' ,
10 ge_version= '1.2.3',
11 event_dto=UserUpdated
12 )
13 def on_user_updated (self, message, event):
14 assert isinstance(message, DwPublishedEvent)
15 # ...
Event Dispatcher
This is just
a boilerplate
Callback that
should pass
an event to
the actual
application
Can we create
producers and
consumers that support
message-passing
natively?
Actors
• Communicate with asynchronous messages instead of method invocations
• Manage their own state
• When responding to a message, can:
• Create other (child) actors
• Send messages to other actors
• Stop (child) actors or themselves
Actors
Actors: Erlang
1 loop() ->
2 receive
3 {From, Msg} ->
4 io:format("received ~p~n" , [Msg]),
5
6 From ! "got it";
7 end.
Actors: Akka
1 class MyActor extends Actor with ActorLogging {
2 def receive = {
3 case msg => {
4 log.info(s"received $msg" )
5
6 sender() ! "got it"
7 }
8 }
9 }
Actor-to-Actor communication
• Asynchronous and non-blocking message-passing
• Doesn’t mean senders must wait indefinitely - timeouts can be used
• Location transparency
• Enterprise Integration Patterns!
Bench Accounting
Bench Accounting Online Services
• Classic SAAS application used by the customers and internal bookkeepers:
• Double-entry bookkeeping with sophisticated reconciliation engine
and reporting [no external software]
• Receipt collection and OCR
• Integrations with banks, statement providers, Stripe, Shopify, etc.
• Enterprise Java monolith transitioning to Scala microservices (with Akka)
• Legacy event-based system built for notifications
Bench Accounting Legacy Eventing
• Multiple issues:
• Designed for a few specific use-cases, schema is not extendable
• Wasn’t built for microservices
• Tight coupling
• New requirements:
• Introduce real-time messaging (web & mobile)
• Add a framework for producing and consuming Domain Events and
Commands (both point-to-point and broadcasts)
• Otherwise very similar to the Demonware’s async communication
model
Bench Accounting Eventing System
ActiveMQ
Eventing
service
Service
A
Service
B
queue
queue
or
topic
IntegrationsEvent
store
ActiveMQ
Point-to-Point Publish-Subscribe
ActiveMQ
• Service name is used as a queue or topic name in ActiveMQ, but there is a also a
topic for global events
• Services can subscribe to interested queues or topics any time a new actor is
created
• Supports 3 modes of operations:
• Point-to-Point channel using a queue (perfect for Commands)
• Publish-Subscribe channel with guaranteed delivery using a Virtual topic
• Global Publish-Subscribe channel with guaranteed delivery using a Virtual
topic
Secret sauce: Apache Camel
• Integration framework that implements Enterprise Integration Patterns
• akka-camel is an official Akka library (now deprecated, Alpakka is a modern
alternative)
• Can be used with any JVM language
• “The most unknown coolest library out there”: JM (c)
Event Listener
akka-camel
ActiveMQ
queue or
topic
ActiveMQ
Consumer
prefetch
buffer
Actor
Event Listener
1 class CustomerService extends EventingConsumer {
2 def endpointUri = "activemq:Consumer.CustomerService.VirtualTopic.events"
3
4 def receive = {
5 case e: CamelMessage if e.isEvent && e.name == “some.event.name” => {
6 self ! DeleteAccount(e.clientId, sender())
7 }
8
9 case DeleteAccount(clientId, originalSender) => {
10 // ...
11 }
12 }
13 }
Event Sender
akka-camel ActiveMQ
queue or
topic
ActiveMQ
ProducerActor
Event Sender
1 // Broadcast
2 EventingClient
3 .buildSystemEvent (Event.BankError, userId, Component.ServiceA)
4 .send(true)
5
6 // Direct
7 EventingClient
8 .buildSystemEventWithAsset (Event.BankError, userId, Component.ServiceB)
9 .buildUrlAsset("http://example.com" )
10 .sendDirect("reporting")
Eventing Service
Event
Recorder
Event
Receiver
Event
Forwarder
Event
Reader
HTTP
API
Events
DAO
Event
Store
Integrations
ActiveMQqueue
ACK
Send
Receive
Eventing Service
So, we do we need this “router” service?
• Routing is handled in one place
• Lightweight consumers and producers
• The same Event Store is used for all services
Event framework in Bench Accounting
• Actor-based consumers and producers using Apache Camel
• Producer with ACKs
• Non-blocking IO
• Apache ActiveMQ as a transport
Lessons learned
So, Actors
• Semantics is important! Natural message-passing in Actors is a huge
advantage
• Asynchronous communication and location transparency by default makes it
easy to move actors between service boundaries
• We could also talk about supervision hierarchies and “Let it crash”
philosophy, excellent concurrency, networking features, etc… next time! You
can start with basics
Recommendations
• Domain Driven Design and Enterprise Integration Patterns are great!
• Understand your Domain space and choose the concepts you need to
support: Events, Commands, Documents or all of them
• Explicitly handle all possible failures. They will happen eventually
• Event Stores can be used for so many things! Tracing and debugging,
auditing, data analytics, etc.
• Actors or not? It really depends. It’s possible to build asynchronous,
non-blocking event frameworks in Java, Python, Node.js or a lot of the other
languages, but actors are asynchronous and message-based by default
Recommendations
• Carefully choose the transport layer. Apache Kafka can handle an impressive
scale, but many messaging features are missing / support just introduced
• Understand what you need to optimize: latency or throughput. You might
need to introduce multiple channels with different characteristics
• Do you really need exactly-once semantics?
• Message formats and schemas are extremely important! Choose binary
formats (Protobuf, Avro) AND/OR make sure to use a schema registry and
design a schema evolution strategy
• Consider splitting your messages into an envelope (metadata) and a payload.
Events and Commands could use the same envelope
Challenges
• We’re too attached to the synchronous request/response paradigm. It’s
everywhere - in the libraries, frameworks, standards. It takes time to learn
how to live in the asynchronous world
• High coupling will kill you. Routing is not a problem when you have a
handful of services (producers/consumers), but things get really complicated
with 10+ services. Try to avoid coupling by using Events as much as possible
and stay away from Commands unless you really need them
• Managing a properly partitioned, replicated and monitored message broker
cluster is still a non-trivial problem. Consider using managed services if your
Ops resources are limited
Challenges
• It’s very straightforward to implement event-based communication for
writes, but harder for reads. You’ll probably end up with some sort of DB
denormalization, in-memory hash join tables, caching or all of the above
• When you have dozens of producers and consumer scattered across the
service it becomes challenging to see the full picture. State and sequence
diagrams can help with capturing business use-cases, distributed tracing
becomes almost a must-have
• When things break you won’t notice them immediately without a proper
monitoring and alerting. Considering covering all critical business use-cases
first
That signup page...
Thanks
Davide Romani (Demonware)
Pavel Rodionov (Bench Accounting)
Questions?
@sap1ens | sap1ens.com

More Related Content

PDF
Apache Kafka: New Features That You Might Not Know About
PDF
Building Scalable and Extendable Data Pipeline for Call of Duty Games: Lesson...
PDF
Kafka Streams: the easiest way to start with stream processing
PDF
Designing Scalable and Extendable Data Pipeline for Call Of Duty Games
PDF
KSQL: Streaming SQL for Kafka
PDF
Real Time Streaming Data with Kafka and TensorFlow (Yong Tang, MobileIron) Ka...
PDF
Kafka Summit SF 2017 - Exactly-once Stream Processing with Kafka Streams
PPTX
Kafka Summit NYC 2017 - Easy, Scalable, Fault-tolerant Stream Processing with...
Apache Kafka: New Features That You Might Not Know About
Building Scalable and Extendable Data Pipeline for Call of Duty Games: Lesson...
Kafka Streams: the easiest way to start with stream processing
Designing Scalable and Extendable Data Pipeline for Call Of Duty Games
KSQL: Streaming SQL for Kafka
Real Time Streaming Data with Kafka and TensorFlow (Yong Tang, MobileIron) Ka...
Kafka Summit SF 2017 - Exactly-once Stream Processing with Kafka Streams
Kafka Summit NYC 2017 - Easy, Scalable, Fault-tolerant Stream Processing with...

What's hot (20)

PDF
Exactly-once Data Processing with Kafka Streams - July 27, 2017
PDF
What is the State of my Kafka Streams Application? Unleashing Metrics. | Neil...
PDF
Kafka Summit SF 2017 - Kafka Stream Processing for Everyone with KSQL
PDF
ksqlDB: A Stream-Relational Database System
PPTX
Akka Microservices Architecture And Design
PDF
KSQL Intro
PDF
A Tour of Apache Kafka
PPTX
Exactly-once Stream Processing with Kafka Streams
PDF
Webinar: Deep Dive on Apache Flink State - Seth Wiesman
PDF
KSQL Performance Tuning for Fun and Profit ( Nick Dearden, Confluent) Kafka S...
PDF
Exactly-Once Made Easy: Transactional Messaging Improvement for Usability and...
PDF
Production Ready Kafka on Kubernetes (Devandra Tagare, Lyft) Kafka Summit SF ...
PDF
Achieving a 50% Reduction in Cross-AZ Network Costs from Kafka (Uday Sagar Si...
PDF
What's the time? ...and why? (Mattias Sax, Confluent) Kafka Summit SF 2019
PDF
Apache kafka meet_up_zurich_at_swissre_from_zero_to_hero_with_kafka_connect_2...
PDF
Event sourcing - what could possibly go wrong ? Devoxx PL 2021
PPTX
Top Ten Kafka® Configs
PDF
KSQL and Security: The Current State of Affairs (Victoria Xia, Confluent) Kaf...
PDF
Introduction to Kafka Streams
PDF
UDF/UDAF: the extensibility framework for KSQL (Hojjat Jafapour, Confluent) K...
Exactly-once Data Processing with Kafka Streams - July 27, 2017
What is the State of my Kafka Streams Application? Unleashing Metrics. | Neil...
Kafka Summit SF 2017 - Kafka Stream Processing for Everyone with KSQL
ksqlDB: A Stream-Relational Database System
Akka Microservices Architecture And Design
KSQL Intro
A Tour of Apache Kafka
Exactly-once Stream Processing with Kafka Streams
Webinar: Deep Dive on Apache Flink State - Seth Wiesman
KSQL Performance Tuning for Fun and Profit ( Nick Dearden, Confluent) Kafka S...
Exactly-Once Made Easy: Transactional Messaging Improvement for Usability and...
Production Ready Kafka on Kubernetes (Devandra Tagare, Lyft) Kafka Summit SF ...
Achieving a 50% Reduction in Cross-AZ Network Costs from Kafka (Uday Sagar Si...
What's the time? ...and why? (Mattias Sax, Confluent) Kafka Summit SF 2019
Apache kafka meet_up_zurich_at_swissre_from_zero_to_hero_with_kafka_connect_2...
Event sourcing - what could possibly go wrong ? Devoxx PL 2021
Top Ten Kafka® Configs
KSQL and Security: The Current State of Affairs (Victoria Xia, Confluent) Kaf...
Introduction to Kafka Streams
UDF/UDAF: the extensibility framework for KSQL (Hojjat Jafapour, Confluent) K...
Ad

Similar to Actors or Not: Async Event Architectures (20)

PDF
Event driven-arch
PPTX
Software Architectures, Week 4 - Message-based Architectures, Message Bus
PDF
Message Driven and Event Sourcing
PDF
An eventful tour from enterprise integration to serverless and functions
PPTX
Using Event Streams in Serverless Applications
PDF
apidays LIVE Australia 2020 - Building an Enterprise Eventing Platform by Gna...
PDF
API Days Australia
PDF
Event Driven-Architecture from a Scalability perspective
PPTX
Event Driven Architectures with Apache Kafka
PPTX
Event Driven Architectures - Phoenix Java Users Group 2013
PPTX
Kafka for data scientists
DOCX
RabbitMQ in Microservice Architecture.docx
PDF
SOA Pattern Event Driven Messaging
PDF
Events and microservices
KEY
Event Driven Architecture
PDF
Evolving from Messaging to Event Streaming
PDF
Patterns and anti patterns of streaming
PPTX
Event-driven Infrastructure - Mike Place, SaltStack - DevOpsDays Tel Aviv 2016
PDF
Event Driven Services Part 2: Building Event-Driven Services with Apache Kafka
PPTX
Event mesh APIDays melbourne September 2019
Event driven-arch
Software Architectures, Week 4 - Message-based Architectures, Message Bus
Message Driven and Event Sourcing
An eventful tour from enterprise integration to serverless and functions
Using Event Streams in Serverless Applications
apidays LIVE Australia 2020 - Building an Enterprise Eventing Platform by Gna...
API Days Australia
Event Driven-Architecture from a Scalability perspective
Event Driven Architectures with Apache Kafka
Event Driven Architectures - Phoenix Java Users Group 2013
Kafka for data scientists
RabbitMQ in Microservice Architecture.docx
SOA Pattern Event Driven Messaging
Events and microservices
Event Driven Architecture
Evolving from Messaging to Event Streaming
Patterns and anti patterns of streaming
Event-driven Infrastructure - Mike Place, SaltStack - DevOpsDays Tel Aviv 2016
Event Driven Services Part 2: Building Event-Driven Services with Apache Kafka
Event mesh APIDays melbourne September 2019
Ad

More from Yaroslav Tkachenko (13)

PDF
Dynamic Change Data Capture with Flink CDC and Consistent Hashing
PDF
Streaming SQL for Data Engineers: The Next Big Thing?
PDF
Apache Flink Adoption at Shopify
PDF
Storing State Forever: Why It Can Be Good For Your Analytics
PDF
It's Time To Stop Using Lambda Architecture
PDF
Bravo Six, Going Realtime. Transitioning Activision Data Pipeline to Streaming
PPTX
10 tips for making Bash a sane programming language
PDF
Building Stateful Microservices With Akka
PDF
Querying Data Pipeline with AWS Athena
PDF
Why Actor-Based Systems Are The Best For Microservices
PPTX
Why actor-based systems are the best for microservices
PPTX
Building Eventing Systems for Microservice Architecture
PPTX
Быстрая и безболезненная разработка клиентской части веб-приложений
Dynamic Change Data Capture with Flink CDC and Consistent Hashing
Streaming SQL for Data Engineers: The Next Big Thing?
Apache Flink Adoption at Shopify
Storing State Forever: Why It Can Be Good For Your Analytics
It's Time To Stop Using Lambda Architecture
Bravo Six, Going Realtime. Transitioning Activision Data Pipeline to Streaming
10 tips for making Bash a sane programming language
Building Stateful Microservices With Akka
Querying Data Pipeline with AWS Athena
Why Actor-Based Systems Are The Best For Microservices
Why actor-based systems are the best for microservices
Building Eventing Systems for Microservice Architecture
Быстрая и безболезненная разработка клиентской части веб-приложений

Recently uploaded (20)

PDF
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
PPTX
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
PPTX
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
PPTX
Introduction to Artificial Intelligence
PDF
Addressing The Cult of Project Management Tools-Why Disconnected Work is Hold...
PDF
Design an Analysis of Algorithms I-SECS-1021-03
PDF
top salesforce developer skills in 2025.pdf
PPTX
Operating system designcfffgfgggggggvggggggggg
PDF
2025 Textile ERP Trends: SAP, Odoo & Oracle
PPTX
ManageIQ - Sprint 268 Review - Slide Deck
PPTX
VVF-Customer-Presentation2025-Ver1.9.pptx
PDF
Which alternative to Crystal Reports is best for small or large businesses.pdf
PDF
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
PPTX
history of c programming in notes for students .pptx
PDF
Design an Analysis of Algorithms II-SECS-1021-03
PDF
Softaken Excel to vCard Converter Software.pdf
PDF
System and Network Administraation Chapter 3
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
PPTX
CHAPTER 2 - PM Management and IT Context
PDF
AI in Product Development-omnex systems
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
Introduction to Artificial Intelligence
Addressing The Cult of Project Management Tools-Why Disconnected Work is Hold...
Design an Analysis of Algorithms I-SECS-1021-03
top salesforce developer skills in 2025.pdf
Operating system designcfffgfgggggggvggggggggg
2025 Textile ERP Trends: SAP, Odoo & Oracle
ManageIQ - Sprint 268 Review - Slide Deck
VVF-Customer-Presentation2025-Ver1.9.pptx
Which alternative to Crystal Reports is best for small or large businesses.pdf
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
history of c programming in notes for students .pptx
Design an Analysis of Algorithms II-SECS-1021-03
Softaken Excel to vCard Converter Software.pdf
System and Network Administraation Chapter 3
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
CHAPTER 2 - PM Management and IT Context
AI in Product Development-omnex systems

Actors or Not: Async Event Architectures

  • 1. Actors or Not Async Event Architectures Yaroslav Tkachenko Senior Software Engineer at Demonware (Activision)
  • 2. Background • 10 years in the industry • ~1 year at Demonware/Activision, 5 years at Bench Accounting • Mostly web, back-end, platform, infrastructure and data things • @sap1ens / sap1ens.com • Talk to me about data pipelines, stream processing and the Premier League ;-)
  • 4. Context: sync vs async communication Service A Service B POST /foo service-b.example.com “Easy” way – HTTP (RPC) API
  • 5. Context: sync vs async communication • Destination – where to send request? • Service discovery • Tight coupling • Time – expect reply right away? • Failure – always expect success? • Retries • Back-pressure • Circuit breakers
  • 6. You cannot make synchronous requests over the network behave like local ones
  • 7. Context: async communication styles • Point-to-Point Channel • One sender • One receiver • Publish-Subscribe Channel (Broadcast) • One publisher • Multiple subscribers
  • 8. Context: Events vs Commands • Event • Simply a notification that something happened in the past • Command • Request to invoke some functionality (“RPC over messaging”)
  • 12. • 469+ million gamers • 3.2+ million concurrent online gamers • 100+ games • 300,000 requests per second at peak • Average query response time of <.02 second • 630,000+ metrics a minute • 132 billion+ API calls per month Demonware by the numbers
  • 13. • Core game services including: • Auth • Matchmaking • Leaderboards • Marketplace • Loot & Rewards • Storage • Etc. • Erlang for networking layer, Python for application layer • Still have a big application monolith, but slowly migrating to independent services (SOA) Demonware Back-end Services
  • 14. • Lots of synchronous request/response communication between the monolith and the services using: • HTTP • RPC • The requesting process: • conceptually knows which service it wants to call into • is aware of the action that it is requesting, and its effects • generally needs to be notified of the request’s completion and any associated information before proceeding with its business logic DW Services: Synchronous communication
  • 15. • Using Domain Events • Communication model assumes the following: • The event may need to be handled by zero or more service processes, each with different use cases; the process that generates the event does not need to be aware of them • The process that generates the event does not need to be aware of what actions will be triggered, and what their effects might be • The process that generates the event does not need to be notified of the handlers’ completion before proceeding with its business logic • Seamless integration with the Data Pipeline / Warehouse DW Services: Asynchronous communication*
  • 17. Kafka
  • 18. Kafka Publish-Subscribe OR Point-to-Point is a decision made by consumers
  • 19. Kafka • Service name is used as a topic name in Kafka • Services have to explicitly subscribe to interested topics on startup (some extra filtering is also supported) • All messages are typically partitioned by a user ID to preserve order
  • 20. Event Dispatcher Application Core Event Dispatcher Kafka topic Partitions Kafka Python Consumer (librdkafka) Local buffer queue queue queue Tornado Queues
  • 21. Event Dispatcher 1 @demonata.event.source( 2 name='events_from_service_a' 3 ) 4 class ServiceAEventsDispatcher (object): 5 def __init__(self, my_app_service): 6 self._app = my_app_service 7 8 @demonata.event.schema( 9 name='service.UserUpdated' , 10 ge_version= '1.2.3', 11 event_dto=UserUpdated 12 ) 13 def on_user_updated (self, message, event): 14 assert isinstance(message, DwPublishedEvent) 15 # ...
  • 22. Publishing Events The following reliability modes are supported: • Fire and forget, relying on Kafka producer (acks = 0, 1, all) • At least once (guaranteed), using remote EventStore backed by a DB • At least once (intermediate), using local EventStore
  • 23. Event Publisher Application Core Event Publisher Kafka topic PartitionsKafka Python Producer (librdkafka) Event Store Event Producer
  • 24. Publishing Events 1 @demonata.coroutine 2 def handle_event_atomically (self, event_to_process): 3 entity_key = self. determine_entity_key (event_to_process) 4 entity = self.db. read(entity_key) 5 6 some_data = yield self.perform_some_async_io_read () 7 new_entity, new_event = self. apply_business_logic ( 8 entity, event_to_process, some_data 9 ) 10 11 # single-shard MySQL transaction: 12 with self.db. trans(shard_key=entity_key): 13 db.save(new_entity) 14 self.publisher. publish(new_event) 15 commit()
  • 25. Event Framework in Demonware • Decorator-driven consumers using callbacks • Reliable producers • Non-blocking IO using Tornado • Apache Kafka as a transport
  • 26. But still… Can we do better?
  • 27. 1 @demonata.event.source( 2 name='events_from_service_a' 3 ) 4 class ServiceAEventsDispatcher (object): 5 def __init__(self, my_app_service): 6 self._app = my_app_service 7 8 @demonata.event.schema( 9 name='service.UserUpdated' , 10 ge_version= '1.2.3', 11 event_dto=UserUpdated 12 ) 13 def on_user_updated (self, message, event): 14 assert isinstance(message, DwPublishedEvent) 15 # ... Event Dispatcher This is just a boilerplate Callback that should pass an event to the actual application
  • 28. Can we create producers and consumers that support message-passing natively?
  • 29. Actors • Communicate with asynchronous messages instead of method invocations • Manage their own state • When responding to a message, can: • Create other (child) actors • Send messages to other actors • Stop (child) actors or themselves
  • 31. Actors: Erlang 1 loop() -> 2 receive 3 {From, Msg} -> 4 io:format("received ~p~n" , [Msg]), 5 6 From ! "got it"; 7 end.
  • 32. Actors: Akka 1 class MyActor extends Actor with ActorLogging { 2 def receive = { 3 case msg => { 4 log.info(s"received $msg" ) 5 6 sender() ! "got it" 7 } 8 } 9 }
  • 33. Actor-to-Actor communication • Asynchronous and non-blocking message-passing • Doesn’t mean senders must wait indefinitely - timeouts can be used • Location transparency • Enterprise Integration Patterns!
  • 35. Bench Accounting Online Services • Classic SAAS application used by the customers and internal bookkeepers: • Double-entry bookkeeping with sophisticated reconciliation engine and reporting [no external software] • Receipt collection and OCR • Integrations with banks, statement providers, Stripe, Shopify, etc. • Enterprise Java monolith transitioning to Scala microservices (with Akka) • Legacy event-based system built for notifications
  • 36. Bench Accounting Legacy Eventing • Multiple issues: • Designed for a few specific use-cases, schema is not extendable • Wasn’t built for microservices • Tight coupling • New requirements: • Introduce real-time messaging (web & mobile) • Add a framework for producing and consuming Domain Events and Commands (both point-to-point and broadcasts) • Otherwise very similar to the Demonware’s async communication model
  • 37. Bench Accounting Eventing System ActiveMQ Eventing service Service A Service B queue queue or topic IntegrationsEvent store
  • 39. ActiveMQ • Service name is used as a queue or topic name in ActiveMQ, but there is a also a topic for global events • Services can subscribe to interested queues or topics any time a new actor is created • Supports 3 modes of operations: • Point-to-Point channel using a queue (perfect for Commands) • Publish-Subscribe channel with guaranteed delivery using a Virtual topic • Global Publish-Subscribe channel with guaranteed delivery using a Virtual topic
  • 40. Secret sauce: Apache Camel • Integration framework that implements Enterprise Integration Patterns • akka-camel is an official Akka library (now deprecated, Alpakka is a modern alternative) • Can be used with any JVM language • “The most unknown coolest library out there”: JM (c)
  • 42. Event Listener 1 class CustomerService extends EventingConsumer { 2 def endpointUri = "activemq:Consumer.CustomerService.VirtualTopic.events" 3 4 def receive = { 5 case e: CamelMessage if e.isEvent && e.name == “some.event.name” => { 6 self ! DeleteAccount(e.clientId, sender()) 7 } 8 9 case DeleteAccount(clientId, originalSender) => { 10 // ... 11 } 12 } 13 }
  • 43. Event Sender akka-camel ActiveMQ queue or topic ActiveMQ ProducerActor
  • 44. Event Sender 1 // Broadcast 2 EventingClient 3 .buildSystemEvent (Event.BankError, userId, Component.ServiceA) 4 .send(true) 5 6 // Direct 7 EventingClient 8 .buildSystemEventWithAsset (Event.BankError, userId, Component.ServiceB) 9 .buildUrlAsset("http://example.com" ) 10 .sendDirect("reporting")
  • 46. Eventing Service So, we do we need this “router” service? • Routing is handled in one place • Lightweight consumers and producers • The same Event Store is used for all services
  • 47. Event framework in Bench Accounting • Actor-based consumers and producers using Apache Camel • Producer with ACKs • Non-blocking IO • Apache ActiveMQ as a transport
  • 49. So, Actors • Semantics is important! Natural message-passing in Actors is a huge advantage • Asynchronous communication and location transparency by default makes it easy to move actors between service boundaries • We could also talk about supervision hierarchies and “Let it crash” philosophy, excellent concurrency, networking features, etc… next time! You can start with basics
  • 50. Recommendations • Domain Driven Design and Enterprise Integration Patterns are great! • Understand your Domain space and choose the concepts you need to support: Events, Commands, Documents or all of them • Explicitly handle all possible failures. They will happen eventually • Event Stores can be used for so many things! Tracing and debugging, auditing, data analytics, etc. • Actors or not? It really depends. It’s possible to build asynchronous, non-blocking event frameworks in Java, Python, Node.js or a lot of the other languages, but actors are asynchronous and message-based by default
  • 51. Recommendations • Carefully choose the transport layer. Apache Kafka can handle an impressive scale, but many messaging features are missing / support just introduced • Understand what you need to optimize: latency or throughput. You might need to introduce multiple channels with different characteristics • Do you really need exactly-once semantics? • Message formats and schemas are extremely important! Choose binary formats (Protobuf, Avro) AND/OR make sure to use a schema registry and design a schema evolution strategy • Consider splitting your messages into an envelope (metadata) and a payload. Events and Commands could use the same envelope
  • 52. Challenges • We’re too attached to the synchronous request/response paradigm. It’s everywhere - in the libraries, frameworks, standards. It takes time to learn how to live in the asynchronous world • High coupling will kill you. Routing is not a problem when you have a handful of services (producers/consumers), but things get really complicated with 10+ services. Try to avoid coupling by using Events as much as possible and stay away from Commands unless you really need them • Managing a properly partitioned, replicated and monitored message broker cluster is still a non-trivial problem. Consider using managed services if your Ops resources are limited
  • 53. Challenges • It’s very straightforward to implement event-based communication for writes, but harder for reads. You’ll probably end up with some sort of DB denormalization, in-memory hash join tables, caching or all of the above • When you have dozens of producers and consumer scattered across the service it becomes challenging to see the full picture. State and sequence diagrams can help with capturing business use-cases, distributed tracing becomes almost a must-have • When things break you won’t notice them immediately without a proper monitoring and alerting. Considering covering all critical business use-cases first
  • 55. Thanks Davide Romani (Demonware) Pavel Rodionov (Bench Accounting)