2
Most read
3
Most read
4
Most read
Streamsets
Author:-
Swapnil S Hampi
March 17th, 2022
What is Streamsets?
•Platform for data integration
• Multi Cloud Architecture
•Easy connections for various Source/
Target(Data Collector)
Streamsets Value Proposition
StreamSets Control Hub, introduced in 2017, provided a single software-as-a-
service platform to design, deploy, monitor, and manage smart data pipelines at
scale on any cloud and on-premises.
Why
Streamsets?
Minimize
Adoption time for
technologies
Smart modern
option for
changing data
source
Minimal
intervention for
developers for
data drifts
Increased visibility
for monitoring
loads
Reduced TCO
Designed to
handle data drifts
Combined
capabilities of ETL
and data
integration
Informatica vs Streamsets
Informatica Streamsets
• Cost intensive
• In business from 20+ years
• Proven high performance
• Less adaptive for new Source /
target connections
o Required to pay license cost
for additional connections
• Requires high Servers
• More clients compared to
Streamsets
o Designer
o Workflow manager
o Repo Manager
o Admin console
• Cost effective
• Launched in 2015 and still on the
path to be adaptive
• Based on Apache spark which is an
open-source platform
• Ease of adapting to new
connections (highly flexible)
• Lightweight application
• All functionality is managed under
Control Hub
THANK YOU

More Related Content

PPTX
Building Data Pipelines with Spark and StreamSets
PPTX
Introduction to Modern Software Architecture
PDF
INTERFACE by apidays 2023 - How APIs are fueling the growth of 5G and MEC
PPTX
Big data architectures and the data lake
PPTX
Virtualization Vs. Containers
PPTX
Vmware ppt
PDF
Architecture patterns for distributed, hybrid, edge and global Apache Kafka d...
PDF
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Building Data Pipelines with Spark and StreamSets
Introduction to Modern Software Architecture
INTERFACE by apidays 2023 - How APIs are fueling the growth of 5G and MEC
Big data architectures and the data lake
Virtualization Vs. Containers
Vmware ppt
Architecture patterns for distributed, hybrid, edge and global Apache Kafka d...
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka

What's hot (20)

PDF
Data Ingestion in Big Data and IoT platforms
PDF
Introduction to OpenStack
PDF
NATS Streaming - an alternative to Apache Kafka?
PPTX
Commvault Presentation
PPTX
Journey to Cloud: Fast Track to Azure
PPTX
Introduction to Apache Kafka
PDF
Cloud Computing Using OpenStack
PDF
The Best Storage Solution For CloudStack: LINSTOR
PDF
Convergence of Integration and Application Development
PDF
A brief history of cloud computing
PPTX
PDF
Building spatial applications with Google Cloud SQL and Google Maps API
PPTX
Commvault Story - CVTSP_1.pptx
PPTX
Microsoft: Multi-tenant SaaS with Azure
PDF
Migrating to Cloud - A Step by Step
PPTX
What is the Citrix?
PDF
Infrastructure as Code for Beginners
PDF
Monitor every app, in every stage, with free and open Elastic APM
PPTX
HBase Tutorial For Beginners | HBase Architecture | HBase Tutorial | Hadoop T...
PPTX
Azure vidyapeeth -Introduction to Azure Container Service & Registry Service
Data Ingestion in Big Data and IoT platforms
Introduction to OpenStack
NATS Streaming - an alternative to Apache Kafka?
Commvault Presentation
Journey to Cloud: Fast Track to Azure
Introduction to Apache Kafka
Cloud Computing Using OpenStack
The Best Storage Solution For CloudStack: LINSTOR
Convergence of Integration and Application Development
A brief history of cloud computing
Building spatial applications with Google Cloud SQL and Google Maps API
Commvault Story - CVTSP_1.pptx
Microsoft: Multi-tenant SaaS with Azure
Migrating to Cloud - A Step by Step
What is the Citrix?
Infrastructure as Code for Beginners
Monitor every app, in every stage, with free and open Elastic APM
HBase Tutorial For Beginners | HBase Architecture | HBase Tutorial | Hadoop T...
Azure vidyapeeth -Introduction to Azure Container Service & Registry Service
Ad

Similar to StreamSet ETL tool (20)

PPTX
Webinar: The Modern Streaming Data Stack with Kinetica & StreamSets
PPTX
Streamsets Training.pptx
PDF
Stream analytics
PDF
Logging infrastructure for Microservices using StreamSets Data Collector
PPTX
Data Engineer's Lunch #57: StreamSets for Data Engineering
PPTX
StructuredStreaming webinar slides.pptx
PDF
Streaming analytics
PDF
AI-Powered Streaming Analytics for Real-Time Customer Experience
PDF
Streaming Data Analytics with ksqlDB and Superset | Robert Stolz, Preset
PDF
Streaming vs batching (conundrum ai internal meetup)
PPTX
StructuredStreaming webinar slides.pptx
PDF
Streaming Data Pipelines with Kafka (MEAP) Stefan Sprenger
PDF
Streaming analytics state of the art
PDF
Architectural Patterns for Streaming Applications
PPTX
Let's add Power BI to your IT-Systems or Apps
PDF
Buy ebook Streaming Data Pipelines with Kafka (MEAP) Stefan Sprenger cheap price
PPTX
Enabling Next Gen Analytics with Azure Data Lake and StreamSets
PDF
Building a data pipeline to ingest data into Hadoop in minutes using Streamse...
PPTX
Lego-like building blocks of Storm and Spark Streaming Pipelines
PPT
Stream, Stream, Stream: Different Streaming Methods with Spark and Kafka
Webinar: The Modern Streaming Data Stack with Kinetica & StreamSets
Streamsets Training.pptx
Stream analytics
Logging infrastructure for Microservices using StreamSets Data Collector
Data Engineer's Lunch #57: StreamSets for Data Engineering
StructuredStreaming webinar slides.pptx
Streaming analytics
AI-Powered Streaming Analytics for Real-Time Customer Experience
Streaming Data Analytics with ksqlDB and Superset | Robert Stolz, Preset
Streaming vs batching (conundrum ai internal meetup)
StructuredStreaming webinar slides.pptx
Streaming Data Pipelines with Kafka (MEAP) Stefan Sprenger
Streaming analytics state of the art
Architectural Patterns for Streaming Applications
Let's add Power BI to your IT-Systems or Apps
Buy ebook Streaming Data Pipelines with Kafka (MEAP) Stefan Sprenger cheap price
Enabling Next Gen Analytics with Azure Data Lake and StreamSets
Building a data pipeline to ingest data into Hadoop in minutes using Streamse...
Lego-like building blocks of Storm and Spark Streaming Pipelines
Stream, Stream, Stream: Different Streaming Methods with Spark and Kafka
Ad

Recently uploaded (20)

PDF
Developing a website for English-speaking practice to English as a foreign la...
PDF
CloudStack 4.21: First Look Webinar slides
PDF
sustainability-14-14877-v2.pddhzftheheeeee
PPT
Geologic Time for studying geology for geologist
PDF
Getting started with AI Agents and Multi-Agent Systems
PPTX
Custom Battery Pack Design Considerations for Performance and Safety
PDF
Enhancing emotion recognition model for a student engagement use case through...
PDF
Five Habits of High-Impact Board Members
PDF
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
PPTX
AI IN MARKETING- PRESENTED BY ANWAR KABIR 1st June 2025.pptx
PDF
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
PDF
Zenith AI: Advanced Artificial Intelligence
PDF
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
PPTX
Modernising the Digital Integration Hub
PDF
Flame analysis and combustion estimation using large language and vision assi...
PPTX
Final SEM Unit 1 for mit wpu at pune .pptx
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PDF
Two-dimensional Klein-Gordon and Sine-Gordon numerical solutions based on dee...
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PDF
A comparative study of natural language inference in Swahili using monolingua...
Developing a website for English-speaking practice to English as a foreign la...
CloudStack 4.21: First Look Webinar slides
sustainability-14-14877-v2.pddhzftheheeeee
Geologic Time for studying geology for geologist
Getting started with AI Agents and Multi-Agent Systems
Custom Battery Pack Design Considerations for Performance and Safety
Enhancing emotion recognition model for a student engagement use case through...
Five Habits of High-Impact Board Members
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
AI IN MARKETING- PRESENTED BY ANWAR KABIR 1st June 2025.pptx
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
Zenith AI: Advanced Artificial Intelligence
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
Modernising the Digital Integration Hub
Flame analysis and combustion estimation using large language and vision assi...
Final SEM Unit 1 for mit wpu at pune .pptx
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
Two-dimensional Klein-Gordon and Sine-Gordon numerical solutions based on dee...
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
A comparative study of natural language inference in Swahili using monolingua...

StreamSet ETL tool

  • 2. What is Streamsets? •Platform for data integration • Multi Cloud Architecture •Easy connections for various Source/ Target(Data Collector)
  • 3. Streamsets Value Proposition StreamSets Control Hub, introduced in 2017, provided a single software-as-a- service platform to design, deploy, monitor, and manage smart data pipelines at scale on any cloud and on-premises. Why Streamsets? Minimize Adoption time for technologies Smart modern option for changing data source Minimal intervention for developers for data drifts Increased visibility for monitoring loads Reduced TCO Designed to handle data drifts Combined capabilities of ETL and data integration
  • 4. Informatica vs Streamsets Informatica Streamsets • Cost intensive • In business from 20+ years • Proven high performance • Less adaptive for new Source / target connections o Required to pay license cost for additional connections • Requires high Servers • More clients compared to Streamsets o Designer o Workflow manager o Repo Manager o Admin console • Cost effective • Launched in 2015 and still on the path to be adaptive • Based on Apache spark which is an open-source platform • Ease of adapting to new connections (highly flexible) • Lightweight application • All functionality is managed under Control Hub