6
Most read
8
Most read
14
Most read
Introduction to kafka
connect
Presented By:
Anuradha Kumari(software Consultant)
Lack of etiquette and manners is a huge turn off.
KnolX Etiquettes
Punctuality
Respect Knolx session
timings, you are requested not
to join sessions after a 5
minutes threshold post the
session start time.
Feedback
Make sure to submit a
constructive feedback for all
sessions as it is very helpful
for the presenter.
Silent Mode
please keep your window on
mute.
Avoid Disturbance
Avoid leaving your window
unmuted after asking a
question.
Agenda
What is kafka connect
01
02
03
04
05
Core Concepts
Features of kafka Connect
Demo
Architecture of kafka connect
What is Kafka Connect?
Apache Kafka is a distributed, resilient, fault tolerant platform. Apache Kafka is a
well-known name in the world of Big Data. It is one of the most used distributed
streaming platforms
It is a framework for storing, reading and analyzing streaming data. It is a publish-
subscribe based durable messaging system exchanging data between processes,
applications, and servers.Apache Kafka is a distributed, resilient, fault tolerant
platform .
Architecture of kafka connect
Some Important terms will help to understand kafka Connect:
● Connectors
● Tasks
● workers
● Transforms
● Coverters
Kafka Connect Terminologies
Standalone vs Distributed Mode
Standalone
● Single Process run both connectors and tasks.
● Configuration use .properties files
● Very easy to get start with, useful for development and testing.
● Not fault tolerant, no scalability, hard to monitor
Distributed
● Multiple workers run connectors and tasks
● Configuration is performed by a REST API
● easy to scale, and fault tolerant(rebalancing in case a worker dies)
● Useful for production deployment of connectors.
source Connector:-
A source connector collects data from a system. Source systems can be entire databases, streams tables, or
message brokers.
A source connector could also collect metrics from application servers into Kafka topics, making the data
available for stream processing with low latency
Sink Connector:-
A sink connector delivers data from Kafka topics into other systems, which might be indexes such as
Elasticsearch, batch systems such as Hadoop, or any kind of database
Different types of Kafka Connectors
Kafka Connect - High level
● Source Connector to get data from Common Data sources
● Sink Connector to publish that data in common data Store
● Make it easy for non-expensive dev to quickly get their data reliably into
kafka
● Part of your ETL pipeline
● Scaling made easy from small pipelies to company-wide pipelines
● Re-usable code!
Here are a few common ways Kafka Connect is used: -
Streaming Data Pipelines
Writing to Datastores from an Application
Evolve Processing from Old Systems to New
Kafka Connect Use Cases
● it simplifies the development, deployment, and management of connectors
● it helps us to deploy large clusters by leveraging the distributed nature of Kafka, as well as
setups for development, testing, and small production deployments
● Kafka Connect helps us to handle the offset commit process.
● Kafka Connect uses the existing group management protocol; we can add more workers to
scale up a Kafka Connect cluster
Features
DEMO
Resources
https://docs.confluent.io/platform/current/connect/userguide.html
https://data-flair.training/blogs/kafka-connect/
https://www.baeldung.com/kafka-connectors-guide
https://www.javatpoint.com/kafka-connect
Thank You !
Get in touch with us:
Lorem Studio, Lord Building
D4456, LA, USA

More Related Content

PDF
From Zero to Docker
PDF
When NOT to use Apache Kafka?
PPTX
Apache kafka
PPTX
Kafka presentation
PDF
Event Streaming in Retail with Apache Kafka
PPTX
Kafka Tutorial - Introduction to Apache Kafka (Part 1)
PDF
ksqlDB - Stream Processing simplified!
PDF
Fundamentals of Apache Kafka
From Zero to Docker
When NOT to use Apache Kafka?
Apache kafka
Kafka presentation
Event Streaming in Retail with Apache Kafka
Kafka Tutorial - Introduction to Apache Kafka (Part 1)
ksqlDB - Stream Processing simplified!
Fundamentals of Apache Kafka

What's hot (20)

PDF
Automate Your Kafka Cluster with Kubernetes Custom Resources
PDF
Building Streaming Data Pipelines with Google Cloud Dataflow and Confluent Cl...
PDF
Introduction To Flink
PDF
Integrating Apache Kafka Into Your Environment
PDF
Introduction to Spark Streaming
PPTX
Apache Spark Streaming
PPTX
Introduction to Apache Kafka
PDF
How Apache Kafka® Works
PPTX
PPTX
Apache Kafka
PDF
Pyspark Tutorial | Introduction to Apache Spark with Python | PySpark Trainin...
PDF
PDF
Why I quit Amazon and Build the Next-gen Streaming System
PDF
Introduction to Kafka Streams
PDF
Introduction to Apache Flink - Fast and reliable big data processing
PDF
SQOOP PPT
PPTX
Running Airflow Workflows as ETL Processes on Hadoop
PDF
Presto: Optimizing Performance of SQL-on-Anything Engine
PPTX
Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...
Automate Your Kafka Cluster with Kubernetes Custom Resources
Building Streaming Data Pipelines with Google Cloud Dataflow and Confluent Cl...
Introduction To Flink
Integrating Apache Kafka Into Your Environment
Introduction to Spark Streaming
Apache Spark Streaming
Introduction to Apache Kafka
How Apache Kafka® Works
Apache Kafka
Pyspark Tutorial | Introduction to Apache Spark with Python | PySpark Trainin...
Why I quit Amazon and Build the Next-gen Streaming System
Introduction to Kafka Streams
Introduction to Apache Flink - Fast and reliable big data processing
SQOOP PPT
Running Airflow Workflows as ETL Processes on Hadoop
Presto: Optimizing Performance of SQL-on-Anything Engine
Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...
Ad

Similar to Introduction to kafka connector (20)

PDF
Overview of Kafka connect
PDF
Overview of Kafka connect
PPTX
Introduction to Kafka Connectors (Knolx).pptx
PDF
Introduction to Kafka Connectors
PDF
Introduction to Kafka Connectors
ODP
Introduction to Kafka connect
PDF
Kafka Summit SF 2017 - Kafka Connect Best Practices – Advice from the Field
PDF
Diving into the Deep End - Kafka Connect
PPTX
Data Pipelines with Kafka Connect
PPTX
Kafka connect 101
PPTX
Riding the Streaming Wave DIY style
PDF
8th Athens Big Data Meetup - 1st Talk - Riding The Streaming Wave DIY Style
PDF
Changing landscapes in data integration - Kafka Connect for near real-time da...
PDF
What's new in Confluent 3.2 and Apache Kafka 0.10.2
PDF
Apache Kafka - A modern Stream Processing Platform
PPTX
Connecting kafka message systems with scylla
PDF
Partner Development Guide for Kafka Connect
PDF
How Much Can You Connect? | Bhavesh Raheja, Disney + Hotstar
PDF
How Much Can You Connect? | Bhavesh Raheja, Disney + Hotstar
PDF
Introducing Kafka Connect and Implementing Custom Connectors
Overview of Kafka connect
Overview of Kafka connect
Introduction to Kafka Connectors (Knolx).pptx
Introduction to Kafka Connectors
Introduction to Kafka Connectors
Introduction to Kafka connect
Kafka Summit SF 2017 - Kafka Connect Best Practices – Advice from the Field
Diving into the Deep End - Kafka Connect
Data Pipelines with Kafka Connect
Kafka connect 101
Riding the Streaming Wave DIY style
8th Athens Big Data Meetup - 1st Talk - Riding The Streaming Wave DIY Style
Changing landscapes in data integration - Kafka Connect for near real-time da...
What's new in Confluent 3.2 and Apache Kafka 0.10.2
Apache Kafka - A modern Stream Processing Platform
Connecting kafka message systems with scylla
Partner Development Guide for Kafka Connect
How Much Can You Connect? | Bhavesh Raheja, Disney + Hotstar
How Much Can You Connect? | Bhavesh Raheja, Disney + Hotstar
Introducing Kafka Connect and Implementing Custom Connectors
Ad

More from Knoldus Inc. (20)

PPTX
Angular Hydration Presentation (FrontEnd)
PPTX
Optimizing Test Execution: Heuristic Algorithm for Self-Healing
PPTX
Self-Healing Test Automation Framework - Healenium
PPTX
Kanban Metrics Presentation (Project Management)
PPTX
Java 17 features and implementation.pptx
PPTX
Chaos Mesh Introducing Chaos in Kubernetes
PPTX
GraalVM - A Step Ahead of JVM Presentation
PPTX
Nomad by HashiCorp Presentation (DevOps)
PPTX
Nomad by HashiCorp Presentation (DevOps)
PPTX
DAPR - Distributed Application Runtime Presentation
PPTX
Introduction to Azure Virtual WAN Presentation
PPTX
Introduction to Argo Rollouts Presentation
PPTX
Intro to Azure Container App Presentation
PPTX
Insights Unveiled Test Reporting and Observability Excellence
PPTX
Introduction to Splunk Presentation (DevOps)
PPTX
Code Camp - Data Profiling and Quality Analysis Framework
PPTX
AWS: Messaging Services in AWS Presentation
PPTX
Amazon Cognito: A Primer on Authentication and Authorization
PPTX
ZIO Http A Functional Approach to Scalable and Type-Safe Web Development
PPTX
Managing State & HTTP Requests In Ionic.
Angular Hydration Presentation (FrontEnd)
Optimizing Test Execution: Heuristic Algorithm for Self-Healing
Self-Healing Test Automation Framework - Healenium
Kanban Metrics Presentation (Project Management)
Java 17 features and implementation.pptx
Chaos Mesh Introducing Chaos in Kubernetes
GraalVM - A Step Ahead of JVM Presentation
Nomad by HashiCorp Presentation (DevOps)
Nomad by HashiCorp Presentation (DevOps)
DAPR - Distributed Application Runtime Presentation
Introduction to Azure Virtual WAN Presentation
Introduction to Argo Rollouts Presentation
Intro to Azure Container App Presentation
Insights Unveiled Test Reporting and Observability Excellence
Introduction to Splunk Presentation (DevOps)
Code Camp - Data Profiling and Quality Analysis Framework
AWS: Messaging Services in AWS Presentation
Amazon Cognito: A Primer on Authentication and Authorization
ZIO Http A Functional Approach to Scalable and Type-Safe Web Development
Managing State & HTTP Requests In Ionic.

Recently uploaded (20)

PPTX
Microsoft Excel 365/2024 Beginner's training
PPTX
Custom Battery Pack Design Considerations for Performance and Safety
PPT
Geologic Time for studying geology for geologist
PDF
Enhancing plagiarism detection using data pre-processing and machine learning...
PDF
Developing a website for English-speaking practice to English as a foreign la...
PPT
Galois Field Theory of Risk: A Perspective, Protocol, and Mathematical Backgr...
PDF
Getting started with AI Agents and Multi-Agent Systems
PDF
Consumable AI The What, Why & How for Small Teams.pdf
PDF
1 - Historical Antecedents, Social Consideration.pdf
PDF
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PDF
A proposed approach for plagiarism detection in Myanmar Unicode text
PDF
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
PPTX
Build Your First AI Agent with UiPath.pptx
PDF
STKI Israel Market Study 2025 version august
PDF
Five Habits of High-Impact Board Members
PPTX
The various Industrial Revolutions .pptx
PDF
OpenACC and Open Hackathons Monthly Highlights July 2025
PPTX
Benefits of Physical activity for teenagers.pptx
PDF
A contest of sentiment analysis: k-nearest neighbor versus neural network
Microsoft Excel 365/2024 Beginner's training
Custom Battery Pack Design Considerations for Performance and Safety
Geologic Time for studying geology for geologist
Enhancing plagiarism detection using data pre-processing and machine learning...
Developing a website for English-speaking practice to English as a foreign la...
Galois Field Theory of Risk: A Perspective, Protocol, and Mathematical Backgr...
Getting started with AI Agents and Multi-Agent Systems
Consumable AI The What, Why & How for Small Teams.pdf
1 - Historical Antecedents, Social Consideration.pdf
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
A proposed approach for plagiarism detection in Myanmar Unicode text
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
Build Your First AI Agent with UiPath.pptx
STKI Israel Market Study 2025 version august
Five Habits of High-Impact Board Members
The various Industrial Revolutions .pptx
OpenACC and Open Hackathons Monthly Highlights July 2025
Benefits of Physical activity for teenagers.pptx
A contest of sentiment analysis: k-nearest neighbor versus neural network

Introduction to kafka connector

  • 1. Introduction to kafka connect Presented By: Anuradha Kumari(software Consultant)
  • 2. Lack of etiquette and manners is a huge turn off. KnolX Etiquettes Punctuality Respect Knolx session timings, you are requested not to join sessions after a 5 minutes threshold post the session start time. Feedback Make sure to submit a constructive feedback for all sessions as it is very helpful for the presenter. Silent Mode please keep your window on mute. Avoid Disturbance Avoid leaving your window unmuted after asking a question.
  • 3. Agenda What is kafka connect 01 02 03 04 05 Core Concepts Features of kafka Connect Demo Architecture of kafka connect
  • 4. What is Kafka Connect? Apache Kafka is a distributed, resilient, fault tolerant platform. Apache Kafka is a well-known name in the world of Big Data. It is one of the most used distributed streaming platforms It is a framework for storing, reading and analyzing streaming data. It is a publish- subscribe based durable messaging system exchanging data between processes, applications, and servers.Apache Kafka is a distributed, resilient, fault tolerant platform .
  • 6. Some Important terms will help to understand kafka Connect: ● Connectors ● Tasks ● workers ● Transforms ● Coverters Kafka Connect Terminologies
  • 7. Standalone vs Distributed Mode Standalone ● Single Process run both connectors and tasks. ● Configuration use .properties files ● Very easy to get start with, useful for development and testing. ● Not fault tolerant, no scalability, hard to monitor Distributed ● Multiple workers run connectors and tasks ● Configuration is performed by a REST API ● easy to scale, and fault tolerant(rebalancing in case a worker dies) ● Useful for production deployment of connectors.
  • 8. source Connector:- A source connector collects data from a system. Source systems can be entire databases, streams tables, or message brokers. A source connector could also collect metrics from application servers into Kafka topics, making the data available for stream processing with low latency Sink Connector:- A sink connector delivers data from Kafka topics into other systems, which might be indexes such as Elasticsearch, batch systems such as Hadoop, or any kind of database Different types of Kafka Connectors
  • 9. Kafka Connect - High level ● Source Connector to get data from Common Data sources ● Sink Connector to publish that data in common data Store ● Make it easy for non-expensive dev to quickly get their data reliably into kafka ● Part of your ETL pipeline ● Scaling made easy from small pipelies to company-wide pipelines ● Re-usable code!
  • 10. Here are a few common ways Kafka Connect is used: - Streaming Data Pipelines Writing to Datastores from an Application Evolve Processing from Old Systems to New Kafka Connect Use Cases
  • 11. ● it simplifies the development, deployment, and management of connectors ● it helps us to deploy large clusters by leveraging the distributed nature of Kafka, as well as setups for development, testing, and small production deployments ● Kafka Connect helps us to handle the offset commit process. ● Kafka Connect uses the existing group management protocol; we can add more workers to scale up a Kafka Connect cluster Features
  • 12. DEMO
  • 14. Thank You ! Get in touch with us: Lorem Studio, Lord Building D4456, LA, USA