SlideShare a Scribd company logo
Feed Your SIEM Smart
with Kafka Connect
Vitalii Rudenskyi
Security Architect, McKesson Corp
Motivation and Background
Why We Started
It’s not a mistake
to make a mistake,
but it’s a mistake
to repeat the
same mistake
Requirements
★ Logs collection to be vendor agnostic
★ Feed different analytics tools
★ Data filtering and cleaning
★ Existing data retention requirements
★ Scalable and highly available
Anticipated Challenges
★ Amount of data and number of sources
★ Variety of different formats
★ SaaS/PaaS applications and public Clouds
★ Cloud SIEM
Kafka Connect
The Three Keys That Open The Door
How to Collect Data
The Three Keys
‘Push’ Connector ‘Pull’ Connector
Transformations Library
Push - NettySource Connector
★ Unified: “all-in-one” different transport and protocols
★ Scalable: multiple tasks supported
★ Available: works behind a LB (health checks supported)
★ Configurable: multiple protocols (plain text, syslog, http, snmp,
netflow)
★ Customizable: custom implementations supported
NettySource Connector
Common deployment model
Pull - PollableAPIClient Connector
★ Simple: easy new connectors development
★ Scalable: multiple tasks/partitions supported
★ Configurable: interval and scheduled (cron-style) polls,
retry/backoff, resettable offsets
★ Customizable: custom implementations supported
PollableAPIClient Connector
PollableAPIClient Connector
25+ ApiClient implementations
Transformations Library
★ Can transform different parts of a kafka message
★ Supports “if” conditions
Takeaways
Highly Available NettySource
Connector
Headers All The Way
★ Track origin of the data
★ Conditional routing
★ Tagging in ‘chained’ transformations
★ SIEM specific metadata
Syslog: Take Port 514 Under Control
Multi-rules RegexRouter config
Data archiving solution
★ Pair of source and sink connectors
★ One connector for all topics
★ Compressed
★ Easy to restore
★ Monitoring is essential
★ Keep original data
★ Be cautious when using “heavy” transformations in
source connectors
5 kafka clusters
20+ kafka connect clusters
530+ deployed connectors
7+TB of data daily
What is the next...
We share!
★ NettySource Connector
https://github.com/vrudenskyi/kafka-connect-netty-source
★ PollableAPIClient Connector
https://github.com/vrudenskyi/kafka-connect-pollable-source
https://github.com/vrudenskyi/kafka-connect-api-clients
★ Transformations Library
https://github.com/vrudenskyi/kafka-connect-transform
Q & Avrudenskyi@gmail.com
https://www.linkedin.com/in/vrudenskyi/

More Related Content

PDF
Introduction to Apache Beam
PDF
Advanced Streaming Analytics with Apache Flink and Apache Kafka, Stephan Ewen
PDF
Media Handling in FreeSWITCH
PDF
Altinity Quickstart for ClickHouse
PDF
Apache kafka performance(throughput) - without data loss and guaranteeing dat...
PDF
Apache Cassandra at Macys
PDF
The Rise of ZStandard: Apache Spark/Parquet/ORC/Avro
PDF
Multi Master PostgreSQL Cluster on Kubernetes
Introduction to Apache Beam
Advanced Streaming Analytics with Apache Flink and Apache Kafka, Stephan Ewen
Media Handling in FreeSWITCH
Altinity Quickstart for ClickHouse
Apache kafka performance(throughput) - without data loss and guaranteeing dat...
Apache Cassandra at Macys
The Rise of ZStandard: Apache Spark/Parquet/ORC/Avro
Multi Master PostgreSQL Cluster on Kubernetes

What's hot (20)

PDF
Care and Feeding of Catalyst Optimizer
PDF
Creating Beautiful Dashboards with Grafana and ClickHouse
PPTX
Logstash
PPTX
HBase and HDFS: Understanding FileSystem Usage in HBase
PDF
What Is ELK Stack | ELK Tutorial For Beginners | Elasticsearch Kibana | ELK S...
PDF
Kafka Security 101 and Real-World Tips
PPTX
Using Apache Arrow, Calcite, and Parquet to Build a Relational Cache
PDF
ksqlDB: A Stream-Relational Database System
PPTX
Elk
PPTX
Integrating Apache Spark and NiFi for Data Lakes
PPTX
Using Wildcards with rsyslog's File Monitor imfile
PPTX
Elastic - ELK, Logstash & Kibana
PDF
A Practical Introduction to Handling Log Data in ClickHouse, by Robert Hodges...
PDF
Netflix conductor
PPTX
The Future of Column-Oriented Data Processing With Apache Arrow and Apache Pa...
PPTX
SSL/TLS Introduction with Practical Examples Including Wireshark Captures
PPTX
Log management with ELK
PPTX
Apache Beam: A unified model for batch and stream processing data
PDF
A Day in the Life of a ClickHouse Query Webinar Slides
PDF
Vitess: Scalable Database Architecture - Kubernetes Community Days Africa Ap...
Care and Feeding of Catalyst Optimizer
Creating Beautiful Dashboards with Grafana and ClickHouse
Logstash
HBase and HDFS: Understanding FileSystem Usage in HBase
What Is ELK Stack | ELK Tutorial For Beginners | Elasticsearch Kibana | ELK S...
Kafka Security 101 and Real-World Tips
Using Apache Arrow, Calcite, and Parquet to Build a Relational Cache
ksqlDB: A Stream-Relational Database System
Elk
Integrating Apache Spark and NiFi for Data Lakes
Using Wildcards with rsyslog's File Monitor imfile
Elastic - ELK, Logstash & Kibana
A Practical Introduction to Handling Log Data in ClickHouse, by Robert Hodges...
Netflix conductor
The Future of Column-Oriented Data Processing With Apache Arrow and Apache Pa...
SSL/TLS Introduction with Practical Examples Including Wireshark Captures
Log management with ELK
Apache Beam: A unified model for batch and stream processing data
A Day in the Life of a ClickHouse Query Webinar Slides
Vitess: Scalable Database Architecture - Kubernetes Community Days Africa Ap...
Ad

Similar to Feed Your SIEM Smart with Kafka Connect (Vitalii Rudenskyi, McKesson Corp) Kafka Summit 2020 (20)

PDF
Big data conference europe real-time streaming in any and all clouds, hybri...
PDF
Scenic City Summit (2021): Real-Time Streaming in any and all clouds, hybrid...
PDF
Introduction to Apache Mesos and DC/OS
PDF
DevOps and Continuous Delivery with CloudFoundry
PDF
Streaming Time Series Data With Kenny Gorman and Elena Cuevas | Current 2022
PPTX
DEVNET-1140 InterCloud Mapreduce and Spark Workload Migration and Sharing: Fi...
PPTX
OS for AI: Elastic Microservices & the Next Gen of ML
PDF
.NET Cloud-Native Bootcamp- Los Angeles
PDF
Microsoft Azure Explained - Hitesh D Kesharia
PDF
Continuous delivery and DevOps with CloudFoundry
PPTX
Gs08 modernize your data platform with sql technologies wash dc
PDF
Music city data Hail Hydrate! from stream to lake
PDF
Service fabric and azure service fabric mesh
PDF
The Never Landing Stream with HTAP and Streaming
PDF
Developing Enterprise Applications for the Cloud, from Monolith to Microservice
PPTX
Developing Enterprise Applications for the Cloud, from Monolith to Microservices
PDF
.NET Cloud-Native Bootcamp
PDF
Zero to 1000+ Applications - Large Scale CD Adoption at Cisco with Spinnaker ...
PPTX
Sqlviking
PDF
Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...
Big data conference europe real-time streaming in any and all clouds, hybri...
Scenic City Summit (2021): Real-Time Streaming in any and all clouds, hybrid...
Introduction to Apache Mesos and DC/OS
DevOps and Continuous Delivery with CloudFoundry
Streaming Time Series Data With Kenny Gorman and Elena Cuevas | Current 2022
DEVNET-1140 InterCloud Mapreduce and Spark Workload Migration and Sharing: Fi...
OS for AI: Elastic Microservices & the Next Gen of ML
.NET Cloud-Native Bootcamp- Los Angeles
Microsoft Azure Explained - Hitesh D Kesharia
Continuous delivery and DevOps with CloudFoundry
Gs08 modernize your data platform with sql technologies wash dc
Music city data Hail Hydrate! from stream to lake
Service fabric and azure service fabric mesh
The Never Landing Stream with HTAP and Streaming
Developing Enterprise Applications for the Cloud, from Monolith to Microservice
Developing Enterprise Applications for the Cloud, from Monolith to Microservices
.NET Cloud-Native Bootcamp
Zero to 1000+ Applications - Large Scale CD Adoption at Cisco with Spinnaker ...
Sqlviking
Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...
Ad

More from HostedbyConfluent (20)

PDF
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
PDF
Renaming a Kafka Topic | Kafka Summit London
PDF
Evolution of NRT Data Ingestion Pipeline at Trendyol
PDF
Ensuring Kafka Service Resilience: A Dive into Health-Checking Techniques
PDF
Exactly-once Stream Processing with Arroyo and Kafka
PDF
Fish Plays Pokemon | Kafka Summit London
PDF
Tiered Storage 101 | Kafla Summit London
PDF
Building a Self-Service Stream Processing Portal: How And Why
PDF
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...
PDF
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...
PDF
Navigating Private Network Connectivity Options for Kafka Clusters
PDF
Apache Flink: Building a Company-wide Self-service Streaming Data Platform
PDF
Explaining How Real-Time GenAI Works in a Noisy Pub
PDF
TL;DR Kafka Metrics | Kafka Summit London
PDF
A Window Into Your Kafka Streams Tasks | KSL
PDF
Mastering Kafka Producer Configs: A Guide to Optimizing Performance
PDF
Data Contracts Management: Schema Registry and Beyond
PDF
Code-First Approach: Crafting Efficient Flink Apps
PDF
Debezium vs. the World: An Overview of the CDC Ecosystem
PDF
Beyond Tiered Storage: Serverless Kafka with No Local Disks
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Renaming a Kafka Topic | Kafka Summit London
Evolution of NRT Data Ingestion Pipeline at Trendyol
Ensuring Kafka Service Resilience: A Dive into Health-Checking Techniques
Exactly-once Stream Processing with Arroyo and Kafka
Fish Plays Pokemon | Kafka Summit London
Tiered Storage 101 | Kafla Summit London
Building a Self-Service Stream Processing Portal: How And Why
From the Trenches: Improving Kafka Connect Source Connector Ingestion from 7 ...
Future with Zero Down-Time: End-to-end Resiliency with Chaos Engineering and ...
Navigating Private Network Connectivity Options for Kafka Clusters
Apache Flink: Building a Company-wide Self-service Streaming Data Platform
Explaining How Real-Time GenAI Works in a Noisy Pub
TL;DR Kafka Metrics | Kafka Summit London
A Window Into Your Kafka Streams Tasks | KSL
Mastering Kafka Producer Configs: A Guide to Optimizing Performance
Data Contracts Management: Schema Registry and Beyond
Code-First Approach: Crafting Efficient Flink Apps
Debezium vs. the World: An Overview of the CDC Ecosystem
Beyond Tiered Storage: Serverless Kafka with No Local Disks

Recently uploaded (20)

PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
cuic standard and advanced reporting.pdf
PDF
Advanced IT Governance
PPTX
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Advanced Soft Computing BINUS July 2025.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
 
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
 
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Empathic Computing: Creating Shared Understanding
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
cuic standard and advanced reporting.pdf
Advanced IT Governance
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
Unlocking AI with Model Context Protocol (MCP)
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Advanced Soft Computing BINUS July 2025.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
The Rise and Fall of 3GPP – Time for a Sabbatical?
 
20250228 LYD VKU AI Blended-Learning.pptx
Chapter 3 Spatial Domain Image Processing.pdf
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
CIFDAQ's Market Insight: SEC Turns Pro Crypto
 
Reach Out and Touch Someone: Haptics and Empathic Computing
Empathic Computing: Creating Shared Understanding
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication

Feed Your SIEM Smart with Kafka Connect (Vitalii Rudenskyi, McKesson Corp) Kafka Summit 2020