SlideShare a Scribd company logo
streamnative.io
Who am I?
● Sijie Guo (@sijieg)
● Co-Founder & CEO, StreamNative
● PMC Member of Pulsar/BookKeeper
● Ex Co-Founder, Streamlio
● Ex-Twitter, Ex-Yahoo
● Work on messaging and streaming data
technologies for many years
streamnative.io
About StreamNative
● San Francisco based company
● Founded by the original developers of
Apache Pulsar and Apache BookKeeper
● Offer a cloud-native event streaming
platform powered by Pulsar
● Provide fully managed Pulsar services from
public cloud to on-premises
streamnative.io
Event Streaming with
Apache Pulsar
Sijie Guo (@sijie)
streamnative.io
streamnative.io
Event Streaming Platform
● Connect
○ Publish and subscribe to event streams
● Store
○ Store infinite event streams reliably
● Process
○ Process event streams in real-time at scale
streamnative.io
Why Pulsar?
streamnative.io
Apache Pulsar
● Publish-subscribe: unified messaging model (streaming + queueing)
● Infinite event stream storage: Apache BookKeeper + Tiered Storage
● Connectors: ingest events without writing code
● Process events in real-time
○ Pulsar Functions for serverless / lightweight computation
○ Spark / Flink for unified data processing
○ Presto for interactive queries
streamnative.io
Event Streaming Platform
streamnative.io
● Connect
○ Protocol Handler
■ Kafka-on-Pulsar
■ AMQP-on-Pulsar
● Process
○ Unified data processing
■ Spark-Pulsar Integration
■ Flink-Pulsar Integration
StreamNative Extensibilities
streamnative.io
Why Kafka-on-Pulsar
● Kafka - Rich ecosystem of connectors and integrations
● Pulsar
○ Infinite stream storage
○ Pulsar Functions
● Adoption & Migration
● Real-time data pipelines
○ Ingesting data using Kafka Connect
○ Processing data using Pulsar Functions
streamnative.io
Kafka-on-Pulsar Protocol Handler
streamnative.io
Kafka-on-Pulsar
● Support Kafka 2.0+ protocols
● Support official java client, librdkafka, and non-java clients
● Multi-tenancy support
● SSL + SASL support
● It is a plugin and you can install it to your existing Pulsar
cluster!
streamnative.io
Demo time!
streamnative.io
Demo
● K/P-Producer -> K/P-Consumer
○ TLS & SASL-PLAIN
● Kafka Connect -> Pulsar Functions
* All demos run with TLS and SASL-PLAIN
https://hackmd.io/nLj5M9BEQIacKcZsNrDxmQ
streamnative.io
Demo 1: K-Producer -> K-Consumer
streamnative.io
Demo 2: P-Producer -> K-Consumer
streamnative.io
Demo 3: K-Producer -> P-Consumer
streamnative.io
Demo 4: Kafka Connect
streamnative.io
Demo 5: Pulsar Functions
streamnative.io
2020 Pulsar User Survey Report
https://streamnative.io/whitepaper/sn-apache-pulsar-user-survey-report-2020/
streamnative.io
TGI Pulsar Weekly Live Stream
https://www.youtube.com/channel/UCywxUI5HlIyc0VEKYR4X9Pg/live
streamnative.io
Follow us!
● KoP - https://github.com/streamnative/kop
● Webinar - https://www.youtube.com/watch?v=gL6hzRtij8M
● Follow us at Twitter
○ Sijie Guo (@sijieg)
○ Apache Pulsar (@apache_pulsar)
○ StreamNative (@streamnativeio)
● Join us at Pulsar slack!

More Related Content

PDF
When apache pulsar meets apache flink
PDF
Elastic Data Processing with Apache Flink and Apache Pulsar
PDF
Integrating Apache Pulsar with Big Data Ecosystem
PDF
A Unified Platform for Real-time Storage and Processing
PDF
Query Pulsar Streams using Apache Flink
PDF
Unify Storage Backend for Batch and Streaming Computation with Apache Pulsar_...
PDF
Serverless Event Streaming with Pulsar Functions
PDF
Introducing Kafka-on-Pulsar: bring native Kafka protocol support to Apache Pu...
When apache pulsar meets apache flink
Elastic Data Processing with Apache Flink and Apache Pulsar
Integrating Apache Pulsar with Big Data Ecosystem
A Unified Platform for Real-time Storage and Processing
Query Pulsar Streams using Apache Flink
Unify Storage Backend for Batch and Streaming Computation with Apache Pulsar_...
Serverless Event Streaming with Pulsar Functions
Introducing Kafka-on-Pulsar: bring native Kafka protocol support to Apache Pu...

What's hot (20)

PDF
Stream or segment : what is the best way to access your events in Pulsar_Neng
PDF
Code the docs-yu liu
PDF
Apache Pulsar at Yahoo! Japan
PDF
How Orange Financial combat financial frauds over 50M transactions a day usin...
PDF
Transaction preview of Apache Pulsar
PDF
Apache Pulsar: A borderless community
PPTX
Architecture of a Kafka camus infrastructure
PDF
Select Star: Flink SQL for Pulsar Folks - Pulsar Summit NA 2021
PDF
Open keynote_carolyn&matteo&sijie
PDF
Hail hydrate! from stream to lake using open source
PDF
Big data conference europe real-time streaming in any and all clouds, hybri...
PDF
Five years of operating a large scale globally replicated Pulsar installation...
PDF
Real time cloud native open source streaming of any data to apache solr
PPTX
Interactive Analytics on Pulsar with Pulsar SQL - Pulsar Virtual Summit Europ...
PDF
Kafka on Pulsar
PDF
Building a distributed Key-Value store with Cassandra
PPTX
Matt Franklin - Apache Software (Geekfest)
PDF
Using the FLiPN Stack for Edge AI (Flink, NiFi, Pulsar) - Pulsar Summit Asia ...
PDF
Cloud lunch and learn real-time streaming in azure
PDF
Big mountain data and dev conference apache pulsar with mqtt for edge compu...
Stream or segment : what is the best way to access your events in Pulsar_Neng
Code the docs-yu liu
Apache Pulsar at Yahoo! Japan
How Orange Financial combat financial frauds over 50M transactions a day usin...
Transaction preview of Apache Pulsar
Apache Pulsar: A borderless community
Architecture of a Kafka camus infrastructure
Select Star: Flink SQL for Pulsar Folks - Pulsar Summit NA 2021
Open keynote_carolyn&matteo&sijie
Hail hydrate! from stream to lake using open source
Big data conference europe real-time streaming in any and all clouds, hybri...
Five years of operating a large scale globally replicated Pulsar installation...
Real time cloud native open source streaming of any data to apache solr
Interactive Analytics on Pulsar with Pulsar SQL - Pulsar Virtual Summit Europ...
Kafka on Pulsar
Building a distributed Key-Value store with Cassandra
Matt Franklin - Apache Software (Geekfest)
Using the FLiPN Stack for Edge AI (Flink, NiFi, Pulsar) - Pulsar Summit Asia ...
Cloud lunch and learn real-time streaming in azure
Big mountain data and dev conference apache pulsar with mqtt for edge compu...
Ad

Similar to Building event streaming pipelines using Apache Pulsar (20)

PDF
Designing Event-Driven Applications with Apache NiFi, Apache Flink, Apache Sp...
PDF
Using FLiP with influxdb for edgeai iot at scale 2022
PDF
Using FLiP with InfluxDB for EdgeAI IoT at Scale 2022
PDF
DBCC 2021 - FLiP Stack for Cloud Data Lakes
PDF
[AI Dev World 2022] Build ML Enhanced Event Streaming
PDF
Data science online camp using the flipn stack for edge ai (flink, nifi, pu...
PDF
Timothy Spann [StreamNative] | Using FLaNK with InfluxDB for EdgeAI IoT at Sc...
PDF
Using FLiP with influxdb for EdgeAI IoT at Scale
PDF
Pulsar summit asia 2021 apache pulsar with mqtt for edge computing
PDF
FLiP Into Trino
PDF
Devfest uk & ireland using apache nifi with apache pulsar for fast data on-r...
PDF
Kafka on Pulsar:bringing native Kafka protocol support to Pulsar_Sijie&Pierre
PDF
Automation + dev ops summit hail hydrate! from stream to lake
PPTX
Current and Future of Apache Kafka
PDF
Activity feeds (and more) at mate1
PDF
Apache Pulsar with MQTT for Edge Computing - Pulsar Summit Asia 2021
PDF
The Netflix Way to deal with Big Data Problems
PPTX
Building an Event Streaming Architecture with Apache Pulsar
PDF
Ai dev world utilizing apache pulsar, apache ni fi and minifi for edgeai io...
PDF
Netflix keystone streaming data pipeline @scale in the cloud-dbtb-2016
Designing Event-Driven Applications with Apache NiFi, Apache Flink, Apache Sp...
Using FLiP with influxdb for edgeai iot at scale 2022
Using FLiP with InfluxDB for EdgeAI IoT at Scale 2022
DBCC 2021 - FLiP Stack for Cloud Data Lakes
[AI Dev World 2022] Build ML Enhanced Event Streaming
Data science online camp using the flipn stack for edge ai (flink, nifi, pu...
Timothy Spann [StreamNative] | Using FLaNK with InfluxDB for EdgeAI IoT at Sc...
Using FLiP with influxdb for EdgeAI IoT at Scale
Pulsar summit asia 2021 apache pulsar with mqtt for edge computing
FLiP Into Trino
Devfest uk & ireland using apache nifi with apache pulsar for fast data on-r...
Kafka on Pulsar:bringing native Kafka protocol support to Pulsar_Sijie&Pierre
Automation + dev ops summit hail hydrate! from stream to lake
Current and Future of Apache Kafka
Activity feeds (and more) at mate1
Apache Pulsar with MQTT for Edge Computing - Pulsar Summit Asia 2021
The Netflix Way to deal with Big Data Problems
Building an Event Streaming Architecture with Apache Pulsar
Ai dev world utilizing apache pulsar, apache ni fi and minifi for edgeai io...
Netflix keystone streaming data pipeline @scale in the cloud-dbtb-2016
Ad

More from StreamNative (20)

PDF
Is Using KoP (Kafka-on-Pulsar) a Good Idea? - Pulsar Summit SF 2022
PDF
Building an Asynchronous Application Framework with Python and Pulsar - Pulsa...
PDF
Blue-green deploys with Pulsar & Envoy in an event-driven microservice ecosys...
PDF
Distributed Database Design Decisions to Support High Performance Event Strea...
PDF
Simplify Pulsar Functions Development with SQL - Pulsar Summit SF 2022
PDF
Towards a ZooKeeper-less Pulsar, etcd, etcd, etcd. - Pulsar Summit SF 2022
PDF
Validating Apache Pulsar’s Behavior under Failure Conditions - Pulsar Summit ...
PDF
Cross the Streams! Creating Streaming Data Pipelines with Apache Flink + Apac...
PDF
Message Redelivery: An Unexpected Journey - Pulsar Summit SF 2022
PDF
Unlocking the Power of Lakehouse Architectures with Apache Pulsar and Apache ...
PDF
Understanding Broker Load Balancing - Pulsar Summit SF 2022
PDF
Building an Asynchronous Application Framework with Python and Pulsar - Pulsa...
PDF
Pulsar's Journey in Yahoo!: On-prem, Cloud and Hybrid - Pulsar Summit SF 2022
PDF
Event-Driven Applications Done Right - Pulsar Summit SF 2022
PDF
Pulsar @ Scale. 200M RPM and 1K instances - Pulsar Summit SF 2022
PDF
Data Democracy: Journey to User-Facing Analytics - Pulsar Summit SF 2022
PDF
Beam + Pulsar: Powerful Stream Processing at Scale - Pulsar Summit SF 2022
PDF
Welcome and Opening Remarks - Pulsar Summit SF 2022
PDF
Log System As Backbone – How We Built the World’s Most Advanced Vector Databa...
PDF
MoP(MQTT on Pulsar) - a Powerful Tool for Apache Pulsar in IoT - Pulsar Summi...
Is Using KoP (Kafka-on-Pulsar) a Good Idea? - Pulsar Summit SF 2022
Building an Asynchronous Application Framework with Python and Pulsar - Pulsa...
Blue-green deploys with Pulsar & Envoy in an event-driven microservice ecosys...
Distributed Database Design Decisions to Support High Performance Event Strea...
Simplify Pulsar Functions Development with SQL - Pulsar Summit SF 2022
Towards a ZooKeeper-less Pulsar, etcd, etcd, etcd. - Pulsar Summit SF 2022
Validating Apache Pulsar’s Behavior under Failure Conditions - Pulsar Summit ...
Cross the Streams! Creating Streaming Data Pipelines with Apache Flink + Apac...
Message Redelivery: An Unexpected Journey - Pulsar Summit SF 2022
Unlocking the Power of Lakehouse Architectures with Apache Pulsar and Apache ...
Understanding Broker Load Balancing - Pulsar Summit SF 2022
Building an Asynchronous Application Framework with Python and Pulsar - Pulsa...
Pulsar's Journey in Yahoo!: On-prem, Cloud and Hybrid - Pulsar Summit SF 2022
Event-Driven Applications Done Right - Pulsar Summit SF 2022
Pulsar @ Scale. 200M RPM and 1K instances - Pulsar Summit SF 2022
Data Democracy: Journey to User-Facing Analytics - Pulsar Summit SF 2022
Beam + Pulsar: Powerful Stream Processing at Scale - Pulsar Summit SF 2022
Welcome and Opening Remarks - Pulsar Summit SF 2022
Log System As Backbone – How We Built the World’s Most Advanced Vector Databa...
MoP(MQTT on Pulsar) - a Powerful Tool for Apache Pulsar in IoT - Pulsar Summi...

Recently uploaded (20)

PPTX
Qualitative Qantitative and Mixed Methods.pptx
PDF
Mega Projects Data Mega Projects Data
PPTX
Introduction to Knowledge Engineering Part 1
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PDF
Fluorescence-microscope_Botany_detailed content
PPTX
Database Infoormation System (DBIS).pptx
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
Business Acumen Training GuidePresentation.pptx
PPT
ISS -ESG Data flows What is ESG and HowHow
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PDF
annual-report-2024-2025 original latest.
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Qualitative Qantitative and Mixed Methods.pptx
Mega Projects Data Mega Projects Data
Introduction to Knowledge Engineering Part 1
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Fluorescence-microscope_Botany_detailed content
Database Infoormation System (DBIS).pptx
Galatica Smart Energy Infrastructure Startup Pitch Deck
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
.pdf is not working space design for the following data for the following dat...
IBA_Chapter_11_Slides_Final_Accessible.pptx
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Introduction-to-Cloud-ComputingFinal.pptx
Business Acumen Training GuidePresentation.pptx
ISS -ESG Data flows What is ESG and HowHow
Acceptance and paychological effects of mandatory extra coach I classes.pptx
annual-report-2024-2025 original latest.
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...

Building event streaming pipelines using Apache Pulsar