SlideShare a Scribd company logo
Tim Spann
“Serverless
Streaming”
“Architecting Your First Event Driven Serverless
Streaming Application on K8”
Tim Spann
“Serverless
Streaming” Tim Spann DoK Day North America 2022 @ KubeCon
“Serverless
Streaming”
Timothy Spann
Developer Advocate
FLiP(N) Stack = Flink, Pulsar and NiFi Stack
Streaming Systems & Data Architecture Expert
Experience:
15+ years of experience with streaming technologies including Pulsar, Flink, Spark,
NiFi, Kafka, Big Data, Cloud, MXNet, IoT and more.
Today, he helps to grow the Pulsar community sharing rich technical knowledge and
experience at both global conferences and through individual conversations.
Tim Spann
“Serverless
Streaming” Tim Spann DoK Day North America 2022 @ KubeCon
“Serverless
Streaming”
FLiP Stack Weekly
This week in Apache Flink, Apache Pulsar, Apache
NiFi, Apache Spark, Elasticsearch and open source
friends.
https://bit.ly/32dAJft
Tim Spann
“Serverless
Streaming” Tim Spann
“Serverless
Streaming”
Apache Pulsar is a Cloud-Native Messaging
and Event-Streaming Platform.
DoK Day North America 2022 @ KubeCon
Tim Spann
“Serverless
Streaming”
Apache Pulsar adoption is
being driven by
organizations seeking
cloud-native architectures
and new uses cases.
Microservices
Apache Pulsar - Built for Containers / Modern Cloud
Cloud Native
Hybrid & Multi-Cloud Containers
DoK Day North America 2022 @ KubeCon
Tim Spann
“Serverless
Streaming”
Key Pulsar Concepts: Messaging vs
Streaming
Message Queueing - Queueing
systems are ideal for work queues
that do not require tasks to be
performed in a particular order.
Streaming - Streaming works
best in situations where the
order of messages is important.
DoK Day North America 2022 @ KubeCon
Tim Spann
“Serverless
Streaming”
Pulsar Cluster
CLUSTER
Global ZooKeeper
Configuration Store
ZK3
ZK2
ZK1
Local
ZooKeeper
Quorum
Bookie 0 Bookie 1 Bookie 2
BookKeeper
Bookie
Ensemble
Pulsar Broker 0 Pulsar Broker 1 Pulsar Broker 2
DoK Day North America 2022 @ KubeCon
Tim Spann
“Serverless
Streaming”
● Consume messages from one
or more Pulsar topics.
● Apply user-supplied
processing logic to each
message.
● Publish the results of the
computation to another topic.
● Support multiple
programming languages
(Java, Python, Go)
● Can leverage 3rd-party
libraries
Pulsar Functions
DoK Day North America 2022 @ KubeCon
Tim Spann
“Serverless
Streaming”
import java.util.function.Function;
public class MyFunction implements Function<String, String> {
public String apply(String input) {
return doBusinessLogic(input);
}
}
Entire Function
Why Pulsar
Functions?
Tim Spann
“Serverless
Streaming”
from pulsar import Function
from vaderSentiment.vaderSentiment import SentimentIntensityAnalyzer
import json
class Chat(Function):
def __init__(self):
pass
def process(self, input, context):
fields = json.loads(input)
sid = SentimentIntensityAnalyzer()
ss = sid.polarity_scores(fields["comment"])
row = { }
row['id'] = str(msg_id)
if ss['compound'] < 0.00:
row['sentiment'] = 'Negative'
else:
row['sentiment'] = 'Positive'
row['comment'] = str(fields["comment"])
json_string = json.dumps(row)
return json_string
Entire Function
Pulsar Python
NLP Function
https://github.com/tspannhw/pulsar-pychat-function
Tim Spann
“Serverless
Streaming”
Function Mesh
Pulsar Functions, along with Pulsar IO/Connectors, provide a powerful API for ingesting,
transforming, and outputting data.
Function Mesh, another StreamNative project, makes it easier for developers to create entire
applications built from sources, functions, and sinks all through a declarative API.
Tim Spann
“Serverless
Streaming”
K8 Deploy
Tim Spann
“Serverless
Streaming”
Function Execution
DoK Day North America 2022 @ KubeCon
Tim Spann
“Serverless
Streaming”
Apache Pulsar Kafka K8
https://docs.streamnative.io/platform/v1.3.0/quickstart
DoK Day North America 2022 @ KubeCon
Tim Spann
“Serverless
Streaming”
Webinar Series:
Building Microservices
with Pulsar
Watch now Read more
Learn how Pulsar Functions, can be
leveraged to build a message bus for
event-driven microservices
DoK Day North America 2022 @ KubeCon
Tim Spann
“Serverless
Streaming”
Apache Pulsar Resources
● https://pulsar.apache.org/docs/en/kubernetes-helm/
● https://pulsar.apache.org/charts
● https://streamnative.io/streamnativecloud/
● https://medium.com/@_oleksii_/how-to-deploy-apache-puls
ar-cluster-in-kubernetes-808ecdfd87
Academy.StreamNative.io
Tim Spann
“Serverless
Streaming”
Thank you!
DoK Day North America 2022 @ KubeCon
Tim Spann
“Serverless
Streaming”

More Related Content

PDF
Conf42 Python_ ML Enhanced Event Streaming Apps with Python Microservices
PDF
Apache Pulsar Development 101 with Python
PDF
Designing Event-Driven Applications with Apache NiFi, Apache Flink, Apache Sp...
PDF
Serverless Event Streaming Applications as Functionson K8
PPT
Apache Spark™ + IBM Watson + Twitter DataPalooza SF 2015
PDF
Fast Streaming into Clickhouse with Apache Pulsar
PDF
OSA Con 2022: Streaming Data Made Easy
PDF
OSA Con 2022 - Streaming Data Made Easy - Tim Spann & David Kjerrumgaard - St...
Conf42 Python_ ML Enhanced Event Streaming Apps with Python Microservices
Apache Pulsar Development 101 with Python
Designing Event-Driven Applications with Apache NiFi, Apache Flink, Apache Sp...
Serverless Event Streaming Applications as Functionson K8
Apache Spark™ + IBM Watson + Twitter DataPalooza SF 2015
Fast Streaming into Clickhouse with Apache Pulsar
OSA Con 2022: Streaming Data Made Easy
OSA Con 2022 - Streaming Data Made Easy - Tim Spann & David Kjerrumgaard - St...

Similar to [DoKDayNA2022] - Architecting Your First Event Driven Serverless Streaming Applications on K8 (20)

PDF
[Conf42-KubeNative] Building Real-time Pulsar Apps on K8
PDF
Internationalizing The New York Times
PDF
Using FLiP with influxdb for edgeai iot at scale 2022
PDF
Using FLiP with InfluxDB for EdgeAI IoT at Scale 2022
PDF
SpringBoot and Spring Cloud Service for MSA
PDF
Using Data Science & Serverless Python to find apartment in Toronto
PPTX
Microservices with kubernetes @190316
PDF
CODEONTHEBEACH_Streaming Applications with Apache Pulsar
PDF
Python Streaming Pipelines on Flink - Beam Meetup at Lyft 2019
PDF
Taylor Wicksell and Tom Gianos at SpringOne Platform 2019
PPTX
Bringing the Power and Familiarity of .NET, C# and F# to Big Data Processing ...
PDF
Sink Your Teeth into Streaming at Any Scale
PDF
Sink Your Teeth into Streaming at Any Scale
PDF
ApacheCon 2021 Apache Deep Learning 302
PDF
Software Delivery at Warp Speed: Five Essential Techniques
PPTX
How APIs are Transforming Cisco Solutions and Catalyzing an Innovation Ecosystem
PPTX
Exploring the Final Frontier of Data Center Orchestration: Network Elements -...
PDF
Apache Spark Streaming in K8s with ArgoCD & Spark Operator
PPTX
Big Data Processing with .NET and Spark (SQLBits 2020)
PDF
Big mountain data and dev conference apache pulsar with mqtt for edge compu...
[Conf42-KubeNative] Building Real-time Pulsar Apps on K8
Internationalizing The New York Times
Using FLiP with influxdb for edgeai iot at scale 2022
Using FLiP with InfluxDB for EdgeAI IoT at Scale 2022
SpringBoot and Spring Cloud Service for MSA
Using Data Science & Serverless Python to find apartment in Toronto
Microservices with kubernetes @190316
CODEONTHEBEACH_Streaming Applications with Apache Pulsar
Python Streaming Pipelines on Flink - Beam Meetup at Lyft 2019
Taylor Wicksell and Tom Gianos at SpringOne Platform 2019
Bringing the Power and Familiarity of .NET, C# and F# to Big Data Processing ...
Sink Your Teeth into Streaming at Any Scale
Sink Your Teeth into Streaming at Any Scale
ApacheCon 2021 Apache Deep Learning 302
Software Delivery at Warp Speed: Five Essential Techniques
How APIs are Transforming Cisco Solutions and Catalyzing an Innovation Ecosystem
Exploring the Final Frontier of Data Center Orchestration: Network Elements -...
Apache Spark Streaming in K8s with ArgoCD & Spark Operator
Big Data Processing with .NET and Spark (SQLBits 2020)
Big mountain data and dev conference apache pulsar with mqtt for edge compu...

More from Timothy Spann (20)

PDF
14May2025_TSPANN_FromAirQualityUnstructuredData.pdf
PDF
Streaming AI Pipelines with Apache NiFi and Snowflake NYC 2025
PDF
2025-03-03-Philly-AAAI-GoodData-Build Secure RAG Apps With Open LLM
PDF
Conf42_IoT_Dec2024_Building IoT Applications With Open Source
PDF
2024 Dec 05 - PyData Global - Tutorial Its In The Air Tonight
PDF
2024Nov20-BigDataEU-RealTimeAIWithOpenSource
PDF
TSPANN-2024-Nov-CloudX-Adding Generative AI to Real-Time Streaming Pipelines
PDF
2024-Nov-BuildStuff-Adding Generative AI to Real-Time Streaming Pipelines
PDF
14 November 2024 - Conf 42 - Prompt Engineering - Codeless Generative AI Pipe...
PDF
2024 Nov 05 - Linux Foundation TAC TALK With Milvus
PPTX
tspann06-NOV-2024_AI-Alliance_NYC_ intro to Data Prep Kit and Open Source RAG
PDF
tspann08-Nov-2024_PyDataNYC_Unstructured Data Processing with a Raspberry Pi ...
PDF
2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...
PDF
10-25-2024_BITS_NYC_Unstructured Data and LLM_ What, Why and How
PDF
2024-OCT-23 NYC Meetup - Unstructured Data Meetup - Unstructured Halloween
PDF
DBTA Round Table with Zilliz and Airbyte - Unstructured Data Engineering
PDF
17-October-2024 NYC AI Camp - Step-by-Step RAG 101
PDF
11-OCT-2024_AI_101_CryptoOracle_UnstructuredData
PDF
2024-10-04 - Grace Hopper Celebration Open Source Day - Stefan
PDF
01-Oct-2024_PES-VectorDatabasesAndAI.pdf
14May2025_TSPANN_FromAirQualityUnstructuredData.pdf
Streaming AI Pipelines with Apache NiFi and Snowflake NYC 2025
2025-03-03-Philly-AAAI-GoodData-Build Secure RAG Apps With Open LLM
Conf42_IoT_Dec2024_Building IoT Applications With Open Source
2024 Dec 05 - PyData Global - Tutorial Its In The Air Tonight
2024Nov20-BigDataEU-RealTimeAIWithOpenSource
TSPANN-2024-Nov-CloudX-Adding Generative AI to Real-Time Streaming Pipelines
2024-Nov-BuildStuff-Adding Generative AI to Real-Time Streaming Pipelines
14 November 2024 - Conf 42 - Prompt Engineering - Codeless Generative AI Pipe...
2024 Nov 05 - Linux Foundation TAC TALK With Milvus
tspann06-NOV-2024_AI-Alliance_NYC_ intro to Data Prep Kit and Open Source RAG
tspann08-Nov-2024_PyDataNYC_Unstructured Data Processing with a Raspberry Pi ...
2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...
10-25-2024_BITS_NYC_Unstructured Data and LLM_ What, Why and How
2024-OCT-23 NYC Meetup - Unstructured Data Meetup - Unstructured Halloween
DBTA Round Table with Zilliz and Airbyte - Unstructured Data Engineering
17-October-2024 NYC AI Camp - Step-by-Step RAG 101
11-OCT-2024_AI_101_CryptoOracle_UnstructuredData
2024-10-04 - Grace Hopper Celebration Open Source Day - Stefan
01-Oct-2024_PES-VectorDatabasesAndAI.pdf

Recently uploaded (20)

PPTX
CH1 Production IntroductoryConcepts.pptx
PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
PPTX
Current and future trends in Computer Vision.pptx
PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
PPTX
Safety Seminar civil to be ensured for safe working.
PPTX
Construction Project Organization Group 2.pptx
PDF
Model Code of Practice - Construction Work - 21102022 .pdf
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PDF
Operating System & Kernel Study Guide-1 - converted.pdf
PDF
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
PDF
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PPTX
Artificial Intelligence
PDF
Automation-in-Manufacturing-Chapter-Introduction.pdf
PPTX
CYBER-CRIMES AND SECURITY A guide to understanding
PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PPTX
Foundation to blockchain - A guide to Blockchain Tech
CH1 Production IntroductoryConcepts.pptx
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
Current and future trends in Computer Vision.pptx
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
Safety Seminar civil to be ensured for safe working.
Construction Project Organization Group 2.pptx
Model Code of Practice - Construction Work - 21102022 .pdf
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
Operating System & Kernel Study Guide-1 - converted.pdf
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
Artificial Intelligence
Automation-in-Manufacturing-Chapter-Introduction.pdf
CYBER-CRIMES AND SECURITY A guide to understanding
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
Foundation to blockchain - A guide to Blockchain Tech

[DoKDayNA2022] - Architecting Your First Event Driven Serverless Streaming Applications on K8

  • 1. Tim Spann “Serverless Streaming” “Architecting Your First Event Driven Serverless Streaming Application on K8”
  • 2. Tim Spann “Serverless Streaming” Tim Spann DoK Day North America 2022 @ KubeCon “Serverless Streaming” Timothy Spann Developer Advocate FLiP(N) Stack = Flink, Pulsar and NiFi Stack Streaming Systems & Data Architecture Expert Experience: 15+ years of experience with streaming technologies including Pulsar, Flink, Spark, NiFi, Kafka, Big Data, Cloud, MXNet, IoT and more. Today, he helps to grow the Pulsar community sharing rich technical knowledge and experience at both global conferences and through individual conversations.
  • 3. Tim Spann “Serverless Streaming” Tim Spann DoK Day North America 2022 @ KubeCon “Serverless Streaming” FLiP Stack Weekly This week in Apache Flink, Apache Pulsar, Apache NiFi, Apache Spark, Elasticsearch and open source friends. https://bit.ly/32dAJft
  • 4. Tim Spann “Serverless Streaming” Tim Spann “Serverless Streaming” Apache Pulsar is a Cloud-Native Messaging and Event-Streaming Platform. DoK Day North America 2022 @ KubeCon
  • 5. Tim Spann “Serverless Streaming” Apache Pulsar adoption is being driven by organizations seeking cloud-native architectures and new uses cases. Microservices Apache Pulsar - Built for Containers / Modern Cloud Cloud Native Hybrid & Multi-Cloud Containers DoK Day North America 2022 @ KubeCon
  • 6. Tim Spann “Serverless Streaming” Key Pulsar Concepts: Messaging vs Streaming Message Queueing - Queueing systems are ideal for work queues that do not require tasks to be performed in a particular order. Streaming - Streaming works best in situations where the order of messages is important. DoK Day North America 2022 @ KubeCon
  • 7. Tim Spann “Serverless Streaming” Pulsar Cluster CLUSTER Global ZooKeeper Configuration Store ZK3 ZK2 ZK1 Local ZooKeeper Quorum Bookie 0 Bookie 1 Bookie 2 BookKeeper Bookie Ensemble Pulsar Broker 0 Pulsar Broker 1 Pulsar Broker 2 DoK Day North America 2022 @ KubeCon
  • 8. Tim Spann “Serverless Streaming” ● Consume messages from one or more Pulsar topics. ● Apply user-supplied processing logic to each message. ● Publish the results of the computation to another topic. ● Support multiple programming languages (Java, Python, Go) ● Can leverage 3rd-party libraries Pulsar Functions DoK Day North America 2022 @ KubeCon
  • 9. Tim Spann “Serverless Streaming” import java.util.function.Function; public class MyFunction implements Function<String, String> { public String apply(String input) { return doBusinessLogic(input); } } Entire Function Why Pulsar Functions?
  • 10. Tim Spann “Serverless Streaming” from pulsar import Function from vaderSentiment.vaderSentiment import SentimentIntensityAnalyzer import json class Chat(Function): def __init__(self): pass def process(self, input, context): fields = json.loads(input) sid = SentimentIntensityAnalyzer() ss = sid.polarity_scores(fields["comment"]) row = { } row['id'] = str(msg_id) if ss['compound'] < 0.00: row['sentiment'] = 'Negative' else: row['sentiment'] = 'Positive' row['comment'] = str(fields["comment"]) json_string = json.dumps(row) return json_string Entire Function Pulsar Python NLP Function https://github.com/tspannhw/pulsar-pychat-function
  • 11. Tim Spann “Serverless Streaming” Function Mesh Pulsar Functions, along with Pulsar IO/Connectors, provide a powerful API for ingesting, transforming, and outputting data. Function Mesh, another StreamNative project, makes it easier for developers to create entire applications built from sources, functions, and sinks all through a declarative API.
  • 14. Tim Spann “Serverless Streaming” Apache Pulsar Kafka K8 https://docs.streamnative.io/platform/v1.3.0/quickstart DoK Day North America 2022 @ KubeCon
  • 15. Tim Spann “Serverless Streaming” Webinar Series: Building Microservices with Pulsar Watch now Read more Learn how Pulsar Functions, can be leveraged to build a message bus for event-driven microservices DoK Day North America 2022 @ KubeCon
  • 16. Tim Spann “Serverless Streaming” Apache Pulsar Resources ● https://pulsar.apache.org/docs/en/kubernetes-helm/ ● https://pulsar.apache.org/charts ● https://streamnative.io/streamnativecloud/ ● https://medium.com/@_oleksii_/how-to-deploy-apache-puls ar-cluster-in-kubernetes-808ecdfd87 Academy.StreamNative.io
  • 17. Tim Spann “Serverless Streaming” Thank you! DoK Day North America 2022 @ KubeCon Tim Spann “Serverless Streaming”