SlideShare a Scribd company logo
Building a Self-Service Stream Processing Portal
: How And Why
Jaeseung Park
270,000 EPS
35 GB / sec
0
500
1000
1500
2000
2500
3000
0
50000
100000
150000
200000
250000
300000
2019 2020 2021 2022 2023
Event Equipment
https://www.youtube.com/watch?v=vFshGQ2ndeg&list=WL&index=3&t=231s
Data Platform
https://www.youtube.com/watch?v=vFshGQ2ndeg&list=WL&index=3&t=231s
Data Platform
Data Hub
Data Hub
Operation Automation Data Governance
Storage
Ingestion Delivery
Data
Pipeline
Hub DB
Data Mart
Processing Streaming Batch
• CDC Pipeline
• Event Pipeline
Hub Queue
h-STREAM
Pub/Sub
h-CONNECT
ETL
Single
Access
Point
Target
…
Fab
Fab
Source
Data
Lakehouse
Analysis
Biz. App
…
Fab
Hub
Fab
Hub
Legacy
Real-time
Real-time support
Real-time
Real-time
Real-time
Real-time
Real-time
Real-time Data Platform
Data Hub
Operation Automation Data Governance
Storage
Ingestion Delivery
Data
Pipeline
Hub DB
Data Mart
Processing Streaming Batch
• CDC Pipeline
• Event Pipeline
Hub Queue
h-STREAM
Pub/Sub
h-CONNECT
ETL
Single
Access
Point
Target
…
Fab
Fab
Source
Data
Lakehouse
Analysis
Biz. App
…
Fab
Hub
Fab
Hub
Legacy
Real-time
Real-time support
Real-time
Real-time
Real-time
Real-time
Real-time
Real-time Data Platform
Data Hub
Operation Automation Data Governance
Storage
Ingestion Delivery
Data
Pipeline
Hub DB
Data Mart
Processing Streaming Batch
• CDC Pipeline
• Event Pipeline
Hub Queue
h-STREAM
Pub/Sub
h-CONNECT
ETL
Single
Access
Point
Target
…
Fab
Fab
Source
Data
Lakehouse
Analysis
Biz. App
…
Fab
Hub
Fab
Hub
Legacy
Real-time
Real-time support
Real-time
Real-time
Real-time
Real-time
Real-time
Real-time Data Platform
Data Hub
Operation Automation Data Governance
Storage
Ingestion Delivery
Data
Pipeline
Hub DB
Data Mart
Processing Streaming Batch
• CDC Pipeline
• Event Pipeline
Hub Queue
h-STREAM
Pub/Sub
h-CONNECT
ETL
Single
Access
Point
Target
…
Fab
Fab
Source
Data
Lakehouse
Analysis
Biz. App
…
Fab
Hub
Fab
Hub
Legacy
Real-time
Real-time support
Real-time
Real-time
Real-time
Real-time
Real-time
Real-time Data Platform
Data Hub
Operation Automation Data Governance
Storage
Ingestion Delivery
Data
Pipeline
Hub DB
Data Mart
Processing Streaming Batch
• CDC Pipeline
• Event Pipeline
Hub Queue
h-STREAM
Pub/Sub
h-CONNECT
ETL
Single
Access
Point
Target
…
Fab
Fab
Source
Data
Lakehouse
Analysis
Biz. App
…
Fab
Hub
Fab
Hub
Legacy
Real-time
Real-time support
Real-time
Real-time
Real-time
Real-time
Real-time
Real-time Data Platform
Why
User
"I want to easily check real-time data."
"I want to manipulatereal-time data to create my own."
"I lackseparate computing resources, so I can't do Stream Processing."
"I want to integrate stable Stream Processing logic into the operating system."
"I would like to receiveadvice on Stream Processing know-how."
h-STREAM Architecture
ksqlDB
Deploy
Replicate
API
Pub
Dev (Sand-box) Ops
Sub
Brower
Pub/Sub Pub/Sub
API
Master DB
Kafka Connector
Event
h-STREAM Portal
ksqlDB
ksqlDB
User
1
2
3
4
5
h-STREAM Architecture
ksqlDB
Deploy
Replicate
API
Pub
Dev (Sand-box) Ops
Sub
Brower
Pub/Sub Pub/Sub
API
Master DB
Kafka Connector
Event
h-STREAM Portal
ksqlDB
ksqlDB
User
1
2
3
4
5
h-STREAM Architecture
ksqlDB
Deploy
Replicate
API
Pub
Dev (Sand-box) Ops
Sub
Brower
Pub/Sub Pub/Sub
API
Master DB
Kafka Connector
Event
h-STREAM Portal
ksqlDB
ksqlDB
User
1
2
3
4
5
h-STREAM Features
CDC/Event
Data Pipeline
Data Hub
AI Ops
Process
h-STREAM
Data
Source
BI
AI Product
System
Log
Event
Output
Key
Features
Explore Topic ·Stream User Self Service
Deploy to Ops Process
Monitoring Log
Management Server
Select Topic&Stream Work on Sandbox Deploy to Ops
Target
A B C
Topic 1
Topic 2
Topic …
Mirroring to Sandbox
Ops Dev
Topic
Stream
User Choice
Stream Data
Topic
Stream
User Make
Stream Data
Stream
Designer
Query
User
Stream
Ops Stream
Data
Ops Environment
User
Stream
Log
Monitoring
h-STREAM Demo
jseung.park@sk.com
Thank you!

More Related Content

PDF
[WSO2Con EU 2018] The Rise of Streaming SQL
PPTX
Потоковая обработка больших данных
PPTX
In-Stream Processing Service Blueprint, Reference architecture for real-time ...
PDF
Real-Time Analytics with Confluent and MemSQL
PDF
Hadoop Ecosystem and Low Latency Streaming Architecture
PDF
Stream Processing
PPTX
Realtime Detection of DDOS attacks using Apache Spark and MLLib
PDF
Streaming architecture patterns
[WSO2Con EU 2018] The Rise of Streaming SQL
Потоковая обработка больших данных
In-Stream Processing Service Blueprint, Reference architecture for real-time ...
Real-Time Analytics with Confluent and MemSQL
Hadoop Ecosystem and Low Latency Streaming Architecture
Stream Processing
Realtime Detection of DDOS attacks using Apache Spark and MLLib
Streaming architecture patterns

Similar to Building a Self-Service Stream Processing Portal: How And Why (17)

PDF
Complex Er[jl]ang Processing with StreamBase
PDF
Processing Real-Time Data at Scale: A streaming platform as a central nervous...
PPT
Survey of Real-time Processing Systems for Big Data
PDF
Streaming Data Into Your Lakehouse With Frank Munz | Current 2022
PDF
Data Ingestion in Big Data and IoT platforms
PDF
Architecting applications with Hadoop - Fraud Detection
PDF
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
PDF
Inside Kafka Streams—Monitoring Comcast’s Outside Plant
PDF
Unbundling the Modern Streaming Stack With Dunith Dhanushka | Current 2022
PDF
The Rise of Streaming SQL
PDF
[WSO2Con USA 2018] The Rise of Streaming SQL
PDF
Data Streaming Technology Overview
PDF
Building data pipelines at Shopee with DEC
PDF
Apache Kafka With Spark Structured Streaming With Emma Liu, Nitin Saksena, Ra...
PPTX
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...
PDF
Real-time processing of large amounts of data
PDF
Introduction to Stream Processing
Complex Er[jl]ang Processing with StreamBase
Processing Real-Time Data at Scale: A streaming platform as a central nervous...
Survey of Real-time Processing Systems for Big Data
Streaming Data Into Your Lakehouse With Frank Munz | Current 2022
Data Ingestion in Big Data and IoT platforms
Architecting applications with Hadoop - Fraud Detection
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Inside Kafka Streams—Monitoring Comcast’s Outside Plant
Unbundling the Modern Streaming Stack With Dunith Dhanushka | Current 2022
The Rise of Streaming SQL
[WSO2Con USA 2018] The Rise of Streaming SQL
Data Streaming Technology Overview
Building data pipelines at Shopee with DEC
Apache Kafka With Spark Structured Streaming With Emma Liu, Nitin Saksena, Ra...
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise- StreamAna...
Real-time processing of large amounts of data
Introduction to Stream Processing
Ad

More from HostedbyConfluent (20)

PDF
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
PDF
Renaming a Kafka Topic | Kafka Summit London
PDF
Evolution of NRT Data Ingestion Pipeline at Trendyol
PDF
Ensuring Kafka Service Resilience: A Dive into Health-Checking Techniques
PDF
Exactly-once Stream Processing with Arroyo and Kafka
PDF
Fish Plays Pokemon | Kafka Summit London
PDF
Tiered Storage 101 | Kafla Summit London
PDF
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...
PDF
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...
PDF
Navigating Private Network Connectivity Options for Kafka Clusters
PDF
Apache Flink: Building a Company-wide Self-service Streaming Data Platform
PDF
Explaining How Real-Time GenAI Works in a Noisy Pub
PDF
TL;DR Kafka Metrics | Kafka Summit London
PDF
A Window Into Your Kafka Streams Tasks | KSL
PDF
Mastering Kafka Producer Configs: A Guide to Optimizing Performance
PDF
Data Contracts Management: Schema Registry and Beyond
PDF
Code-First Approach: Crafting Efficient Flink Apps
PDF
Debezium vs. the World: An Overview of the CDC Ecosystem
PDF
Beyond Tiered Storage: Serverless Kafka with No Local Disks
PDF
Automating Speed: A Proven Approach to Preventing Performance Regressions in ...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Renaming a Kafka Topic | Kafka Summit London
Evolution of NRT Data Ingestion Pipeline at Trendyol
Ensuring Kafka Service Resilience: A Dive into Health-Checking Techniques
Exactly-once Stream Processing with Arroyo and Kafka
Fish Plays Pokemon | Kafka Summit London
Tiered Storage 101 | Kafla Summit London
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...
Navigating Private Network Connectivity Options for Kafka Clusters
Apache Flink: Building a Company-wide Self-service Streaming Data Platform
Explaining How Real-Time GenAI Works in a Noisy Pub
TL;DR Kafka Metrics | Kafka Summit London
A Window Into Your Kafka Streams Tasks | KSL
Mastering Kafka Producer Configs: A Guide to Optimizing Performance
Data Contracts Management: Schema Registry and Beyond
Code-First Approach: Crafting Efficient Flink Apps
Debezium vs. the World: An Overview of the CDC Ecosystem
Beyond Tiered Storage: Serverless Kafka with No Local Disks
Automating Speed: A Proven Approach to Preventing Performance Regressions in ...
Ad

Recently uploaded (20)

PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Encapsulation theory and applications.pdf
PDF
KodekX | Application Modernization Development
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
Cloud computing and distributed systems.
PDF
cuic standard and advanced reporting.pdf
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Approach and Philosophy of On baking technology
PPTX
Big Data Technologies - Introduction.pptx
PDF
Network Security Unit 5.pdf for BCA BBA.
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Encapsulation theory and applications.pdf
KodekX | Application Modernization Development
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Unlocking AI with Model Context Protocol (MCP)
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
The AUB Centre for AI in Media Proposal.docx
Spectral efficient network and resource selection model in 5G networks
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
CIFDAQ's Market Insight: SEC Turns Pro Crypto
“AI and Expert System Decision Support & Business Intelligence Systems”
Dropbox Q2 2025 Financial Results & Investor Presentation
Chapter 3 Spatial Domain Image Processing.pdf
Cloud computing and distributed systems.
cuic standard and advanced reporting.pdf
Mobile App Security Testing_ A Comprehensive Guide.pdf
Approach and Philosophy of On baking technology
Big Data Technologies - Introduction.pptx
Network Security Unit 5.pdf for BCA BBA.

Building a Self-Service Stream Processing Portal: How And Why

  • 1. Building a Self-Service Stream Processing Portal : How And Why Jaeseung Park
  • 2. 270,000 EPS 35 GB / sec 0 500 1000 1500 2000 2500 3000 0 50000 100000 150000 200000 250000 300000 2019 2020 2021 2022 2023 Event Equipment
  • 5. Data Hub Operation Automation Data Governance Storage Ingestion Delivery Data Pipeline Hub DB Data Mart Processing Streaming Batch • CDC Pipeline • Event Pipeline Hub Queue h-STREAM Pub/Sub h-CONNECT ETL Single Access Point Target … Fab Fab Source Data Lakehouse Analysis Biz. App … Fab Hub Fab Hub Legacy Real-time Real-time support Real-time Real-time Real-time Real-time Real-time Real-time Data Platform
  • 6. Data Hub Operation Automation Data Governance Storage Ingestion Delivery Data Pipeline Hub DB Data Mart Processing Streaming Batch • CDC Pipeline • Event Pipeline Hub Queue h-STREAM Pub/Sub h-CONNECT ETL Single Access Point Target … Fab Fab Source Data Lakehouse Analysis Biz. App … Fab Hub Fab Hub Legacy Real-time Real-time support Real-time Real-time Real-time Real-time Real-time Real-time Data Platform
  • 7. Data Hub Operation Automation Data Governance Storage Ingestion Delivery Data Pipeline Hub DB Data Mart Processing Streaming Batch • CDC Pipeline • Event Pipeline Hub Queue h-STREAM Pub/Sub h-CONNECT ETL Single Access Point Target … Fab Fab Source Data Lakehouse Analysis Biz. App … Fab Hub Fab Hub Legacy Real-time Real-time support Real-time Real-time Real-time Real-time Real-time Real-time Data Platform
  • 8. Data Hub Operation Automation Data Governance Storage Ingestion Delivery Data Pipeline Hub DB Data Mart Processing Streaming Batch • CDC Pipeline • Event Pipeline Hub Queue h-STREAM Pub/Sub h-CONNECT ETL Single Access Point Target … Fab Fab Source Data Lakehouse Analysis Biz. App … Fab Hub Fab Hub Legacy Real-time Real-time support Real-time Real-time Real-time Real-time Real-time Real-time Data Platform
  • 9. Data Hub Operation Automation Data Governance Storage Ingestion Delivery Data Pipeline Hub DB Data Mart Processing Streaming Batch • CDC Pipeline • Event Pipeline Hub Queue h-STREAM Pub/Sub h-CONNECT ETL Single Access Point Target … Fab Fab Source Data Lakehouse Analysis Biz. App … Fab Hub Fab Hub Legacy Real-time Real-time support Real-time Real-time Real-time Real-time Real-time Real-time Data Platform
  • 10. Why User "I want to easily check real-time data." "I want to manipulatereal-time data to create my own." "I lackseparate computing resources, so I can't do Stream Processing." "I want to integrate stable Stream Processing logic into the operating system." "I would like to receiveadvice on Stream Processing know-how."
  • 11. h-STREAM Architecture ksqlDB Deploy Replicate API Pub Dev (Sand-box) Ops Sub Brower Pub/Sub Pub/Sub API Master DB Kafka Connector Event h-STREAM Portal ksqlDB ksqlDB User 1 2 3 4 5
  • 12. h-STREAM Architecture ksqlDB Deploy Replicate API Pub Dev (Sand-box) Ops Sub Brower Pub/Sub Pub/Sub API Master DB Kafka Connector Event h-STREAM Portal ksqlDB ksqlDB User 1 2 3 4 5
  • 13. h-STREAM Architecture ksqlDB Deploy Replicate API Pub Dev (Sand-box) Ops Sub Brower Pub/Sub Pub/Sub API Master DB Kafka Connector Event h-STREAM Portal ksqlDB ksqlDB User 1 2 3 4 5
  • 14. h-STREAM Features CDC/Event Data Pipeline Data Hub AI Ops Process h-STREAM Data Source BI AI Product System Log Event Output Key Features Explore Topic ·Stream User Self Service Deploy to Ops Process Monitoring Log Management Server Select Topic&Stream Work on Sandbox Deploy to Ops Target A B C Topic 1 Topic 2 Topic … Mirroring to Sandbox Ops Dev Topic Stream User Choice Stream Data Topic Stream User Make Stream Data Stream Designer Query User Stream Ops Stream Data Ops Environment User Stream Log Monitoring