SlideShare a Scribd company logo
Using Hazelcast as the Serving Layer in the
Kappa Architecture
Presented by Oliver Buckley-Salmon
June 1st 2017
Twitter: @SalmonOliver
Github: https://github.com/oliversalmon
LinkedIn: Oliver Buckley-Salmon
Introduction
• Many Industries have a combined need to view and process big and fast data
• Previously tools such as Hadoop allowed the processing of large data sets but at high latency and stream processing systems processed small amounts
of data very fast
• Recently new architectures have been suggested to combine both of these to provide a single solution for big and fast data, with a couple of the most
well known below
• Lambda Architecture
• Nathan Marz came up with the term Lambda Architecture (LA) for a generic, scalable and fault-tolerant data processing architecture, based on his experience working on
distributed data processing systems at Backtype and Twitter
• The LA aims to satisfy the needs for a robust system that is fault-tolerant, both against hardware failures and human mistakes, being able to serve a wide range of
workloads and use cases, and in which low-latency reads and updates are required
• The resulting system should be linearly scalable, and it should scale out rather than up
• Kappa Architecture
• Kappa Architecture is a simplification of Lambda Architecture
• A Kappa Architecture system is like a Lambda Architecture system with the batch processing system removed
• To replace batch processing, data is simply fed through the streaming system quickly
Lambda Architecture Overview
Benefits & Challenges with Lambda Architecture
Benefit Challenge
Real-time view & analysis of latest data Synchronisation between Speed & Batch layers
Support historical data queries & analytics Analytics only, not operational/transactional
Horizontally scalable speed layer 2 separate sub systems for microservices to read from depending on life cycle
Horizontally scalable batch layer Heavy focus on HDFS / Storage / format optimization
Allows use of Hadoop ecosystem for batch processing & analytics Unpredictable latency of batch layer
Fault tolerant
Kappa Architecture Overview
Benefits & Challenges with Kappa Architecture
Benefit Challenge
Real-time view & analysis of latest data Allowing serving layer to replay from Kafka on demand to support historical
queries on demand
Single view of data (serving layer only) Latency and reprocessing required for historical queries
Horizontally scalable serving layer Analytics only, not operational/transactional
Horizontally scalable distributed log layer (Kafka) Hard sell to convince management that Kafka log is the database!
Horizontally Stream Processing layer Kafka log sizes
Fault tolerant Doesn’t leverage Hadoop ecosystem for large scale analytics
Support historical data queries & analytics through Kafka replays into Serving
layer
Allows the Stream Computation layer to do the heavy lifting
Fewer moving parts than Lambda Architecture
Simpler programming model – everything is a stream
Hazelcast in the Kappa Architecture
Introduction to Mu Architecture
• Many Industries have a combined need to view and process big and fast data. The Lambda & Kappa architectures solve the Big & Fast data problem but only for
analytics
• Traditionally there would be two separate architectures, one for OLTP one for OLAP
• Modern software allows us to combine the two into a single platform
• No need for complex ETL or ELT
• No delay for transactional data to be available for analytics
• Real-time reactive microservices and transaction processing
• Massively horizontally scalable
• Cloud ready
• By combining big data technology with in-memory technology the Mu architecture offers all of the above in an architecture that fits on one slide
Mu Architecture Overview
Mu Architecture Demo – Work In Progress
Follow progress or join it at https://github.com/oliversalmon/imcs-demo
Summary
• Many Industries have a combined need to view and process big and fast data
• Previously tools such as Hadoop allowed the processing of large data sets but at high latency and stream processing systems processed small amounts of data
very fast
• The Lambda & Kappa architectures allow real-time analytics
• In memory computing technology such as Hazelcast IMDG & Hazelcast Jet, combined with big data technologies, allows us to process vast volumes of
unbounded data fast
• The Mu architecture takes the best of both of the Kappa & Lambda architectures to produce a combined real-time OLTP & OLAP solution

More Related Content

PDF
ASPgems - kappa architecture
PPTX
Speed layer : Real time views in LAMBDA architecture
PPTX
Realtime streaming architecture in INFINARIO
PDF
Lambda architecture @ Indix
PDF
Apache spark y cómo lo usamos en nuestros proyectos
PDF
Continuous delivery for machine learning
PDF
Cloud Lambda Architecture Patterns
PPTX
Kappa Architecture on Apache Kafka and Querona: datamass.io
ASPgems - kappa architecture
Speed layer : Real time views in LAMBDA architecture
Realtime streaming architecture in INFINARIO
Lambda architecture @ Indix
Apache spark y cómo lo usamos en nuestros proyectos
Continuous delivery for machine learning
Cloud Lambda Architecture Patterns
Kappa Architecture on Apache Kafka and Querona: datamass.io

What's hot (20)

PPTX
The evolution of the big data platform @ Netflix (OSCON 2015)
PDF
Extracting Insights from Data at Twitter
PPTX
Lambda architecture: from zero to One
PDF
Data Policies for the Kafka-API with WebAssembly | Alexander Gallego, Vectorized
PDF
Modern ETL Pipelines with Change Data Capture
PDF
Lambda architecture
PPTX
Implementing the Lambda Architecture efficiently with Apache Spark
PPTX
Lambda architecture with Spark
PPTX
Data Apps with the Lambda Architecture - with Real Work Examples on Merging B...
PDF
It's Time To Stop Using Lambda Architecture | Yaroslav Tkachenko, Shopify
PPTX
Taboola Road To Scale With Apache Spark
ODP
Javantura v3 - Real-time BigData ingestion and querying of aggregated data – ...
PPTX
Cloud native data platform
PDF
Case Study: Stream Processing on AWS using Kappa Architecture
PPTX
Gluent Extending Enterprise Applications with Hadoop
ODP
Kick-Start with SMACK Stack
PDF
Introduction to Data Engineer and Data Pipeline at Credit OK
PPTX
Netflix Big Data Paris 2017
PDF
An End-to-End Spark-Based Machine Learning Stack in the Hybrid Cloud with Far...
PPTX
Getting It Right Exactly Once: Principles for Streaming Architectures
The evolution of the big data platform @ Netflix (OSCON 2015)
Extracting Insights from Data at Twitter
Lambda architecture: from zero to One
Data Policies for the Kafka-API with WebAssembly | Alexander Gallego, Vectorized
Modern ETL Pipelines with Change Data Capture
Lambda architecture
Implementing the Lambda Architecture efficiently with Apache Spark
Lambda architecture with Spark
Data Apps with the Lambda Architecture - with Real Work Examples on Merging B...
It's Time To Stop Using Lambda Architecture | Yaroslav Tkachenko, Shopify
Taboola Road To Scale With Apache Spark
Javantura v3 - Real-time BigData ingestion and querying of aggregated data – ...
Cloud native data platform
Case Study: Stream Processing on AWS using Kappa Architecture
Gluent Extending Enterprise Applications with Hadoop
Kick-Start with SMACK Stack
Introduction to Data Engineer and Data Pipeline at Credit OK
Netflix Big Data Paris 2017
An End-to-End Spark-Based Machine Learning Stack in the Hybrid Cloud with Far...
Getting It Right Exactly Once: Principles for Streaming Architectures
Ad

Similar to Using Hazelcast in the Kappa architecture (20)

PPTX
Big Data_Architecture.pptx
PDF
Kappa vs Lambda Architectures and Technology Comparison
PDF
Big Data Computing Architecture
PDF
Big data real time architectures
PPTX
Pacemaker hadoop infrastructure and soft serve experience
PPTX
Hadoop Infrastructure and SoftServe Experience by Vitaliy Bashun, Data Architect
PDF
An overview of modern scalable web development
PPTX
2014 09-12 lambda-architecture-at-indix
PDF
Streaming Analytics with Spark, Kafka, Cassandra and Akka
PDF
Streaming Analytics with Spark, Kafka, Cassandra and Akka by Helena Edelson
PDF
Architecting Agile Data Applications for Scale
PDF
Creating a Modern Data Architecture for Digital Transformation
PDF
The Web Scale
PDF
Big Data Architecture
PDF
AI&BigData Lab 2016. Сарапин Виктор: Размер имеет значение: анализ по требова...
PPTX
The Evolution of Data Engineering Emerging Trends and Scalable Architecture D...
PDF
Hadoop Ecosystem and Low Latency Streaming Architecture
PPTX
Hadoop as data refinery
PPTX
Hadoop as Data Refinery - Steve Loughran
PPTX
From Insights to Value - Building a Modern Logical Data Lake to Drive User Ad...
Big Data_Architecture.pptx
Kappa vs Lambda Architectures and Technology Comparison
Big Data Computing Architecture
Big data real time architectures
Pacemaker hadoop infrastructure and soft serve experience
Hadoop Infrastructure and SoftServe Experience by Vitaliy Bashun, Data Architect
An overview of modern scalable web development
2014 09-12 lambda-architecture-at-indix
Streaming Analytics with Spark, Kafka, Cassandra and Akka
Streaming Analytics with Spark, Kafka, Cassandra and Akka by Helena Edelson
Architecting Agile Data Applications for Scale
Creating a Modern Data Architecture for Digital Transformation
The Web Scale
Big Data Architecture
AI&BigData Lab 2016. Сарапин Виктор: Размер имеет значение: анализ по требова...
The Evolution of Data Engineering Emerging Trends and Scalable Architecture D...
Hadoop Ecosystem and Low Latency Streaming Architecture
Hadoop as data refinery
Hadoop as Data Refinery - Steve Loughran
From Insights to Value - Building a Modern Logical Data Lake to Drive User Ad...
Ad

Recently uploaded (20)

PDF
medical staffing services at VALiNTRY
PDF
Adobe Illustrator 28.6 Crack My Vision of Vector Design
PDF
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
PDF
Nekopoi APK 2025 free lastest update
PPTX
Odoo POS Development Services by CandidRoot Solutions
PDF
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
PPTX
L1 - Introduction to python Backend.pptx
PPTX
Operating system designcfffgfgggggggvggggggggg
PPTX
history of c programming in notes for students .pptx
PPTX
Introduction to Artificial Intelligence
PDF
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
PPTX
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
PDF
How Creative Agencies Leverage Project Management Software.pdf
PPTX
ai tools demonstartion for schools and inter college
PDF
System and Network Administration Chapter 2
PDF
Upgrade and Innovation Strategies for SAP ERP Customers
PDF
top salesforce developer skills in 2025.pdf
PPTX
Essential Infomation Tech presentation.pptx
PDF
PTS Company Brochure 2025 (1).pdf.......
PPTX
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
medical staffing services at VALiNTRY
Adobe Illustrator 28.6 Crack My Vision of Vector Design
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
Nekopoi APK 2025 free lastest update
Odoo POS Development Services by CandidRoot Solutions
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
L1 - Introduction to python Backend.pptx
Operating system designcfffgfgggggggvggggggggg
history of c programming in notes for students .pptx
Introduction to Artificial Intelligence
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
How Creative Agencies Leverage Project Management Software.pdf
ai tools demonstartion for schools and inter college
System and Network Administration Chapter 2
Upgrade and Innovation Strategies for SAP ERP Customers
top salesforce developer skills in 2025.pdf
Essential Infomation Tech presentation.pptx
PTS Company Brochure 2025 (1).pdf.......
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises

Using Hazelcast in the Kappa architecture

  • 1. Using Hazelcast as the Serving Layer in the Kappa Architecture Presented by Oliver Buckley-Salmon June 1st 2017 Twitter: @SalmonOliver Github: https://github.com/oliversalmon LinkedIn: Oliver Buckley-Salmon
  • 2. Introduction • Many Industries have a combined need to view and process big and fast data • Previously tools such as Hadoop allowed the processing of large data sets but at high latency and stream processing systems processed small amounts of data very fast • Recently new architectures have been suggested to combine both of these to provide a single solution for big and fast data, with a couple of the most well known below • Lambda Architecture • Nathan Marz came up with the term Lambda Architecture (LA) for a generic, scalable and fault-tolerant data processing architecture, based on his experience working on distributed data processing systems at Backtype and Twitter • The LA aims to satisfy the needs for a robust system that is fault-tolerant, both against hardware failures and human mistakes, being able to serve a wide range of workloads and use cases, and in which low-latency reads and updates are required • The resulting system should be linearly scalable, and it should scale out rather than up • Kappa Architecture • Kappa Architecture is a simplification of Lambda Architecture • A Kappa Architecture system is like a Lambda Architecture system with the batch processing system removed • To replace batch processing, data is simply fed through the streaming system quickly
  • 4. Benefits & Challenges with Lambda Architecture Benefit Challenge Real-time view & analysis of latest data Synchronisation between Speed & Batch layers Support historical data queries & analytics Analytics only, not operational/transactional Horizontally scalable speed layer 2 separate sub systems for microservices to read from depending on life cycle Horizontally scalable batch layer Heavy focus on HDFS / Storage / format optimization Allows use of Hadoop ecosystem for batch processing & analytics Unpredictable latency of batch layer Fault tolerant
  • 6. Benefits & Challenges with Kappa Architecture Benefit Challenge Real-time view & analysis of latest data Allowing serving layer to replay from Kafka on demand to support historical queries on demand Single view of data (serving layer only) Latency and reprocessing required for historical queries Horizontally scalable serving layer Analytics only, not operational/transactional Horizontally scalable distributed log layer (Kafka) Hard sell to convince management that Kafka log is the database! Horizontally Stream Processing layer Kafka log sizes Fault tolerant Doesn’t leverage Hadoop ecosystem for large scale analytics Support historical data queries & analytics through Kafka replays into Serving layer Allows the Stream Computation layer to do the heavy lifting Fewer moving parts than Lambda Architecture Simpler programming model – everything is a stream
  • 7. Hazelcast in the Kappa Architecture
  • 8. Introduction to Mu Architecture • Many Industries have a combined need to view and process big and fast data. The Lambda & Kappa architectures solve the Big & Fast data problem but only for analytics • Traditionally there would be two separate architectures, one for OLTP one for OLAP • Modern software allows us to combine the two into a single platform • No need for complex ETL or ELT • No delay for transactional data to be available for analytics • Real-time reactive microservices and transaction processing • Massively horizontally scalable • Cloud ready • By combining big data technology with in-memory technology the Mu architecture offers all of the above in an architecture that fits on one slide
  • 10. Mu Architecture Demo – Work In Progress Follow progress or join it at https://github.com/oliversalmon/imcs-demo
  • 11. Summary • Many Industries have a combined need to view and process big and fast data • Previously tools such as Hadoop allowed the processing of large data sets but at high latency and stream processing systems processed small amounts of data very fast • The Lambda & Kappa architectures allow real-time analytics • In memory computing technology such as Hazelcast IMDG & Hazelcast Jet, combined with big data technologies, allows us to process vast volumes of unbounded data fast • The Mu architecture takes the best of both of the Kappa & Lambda architectures to produce a combined real-time OLTP & OLAP solution