SlideShare a Scribd company logo
SPRINGONE2GX
WASHINGTON, DC
Unless otherwise indicated, these slides are © 2013-2015 Pivotal Software, Inc. and licensed under a Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/
Who Needs Batch?
By Michael Minella and Gunnar Hillert
@michaelminella, @ghillert
s2gx2015 who needs batch
Michael Minella
Twitter: @michaelminella
Podcast: http://javaOffHeap.com 
or @OffHeap
Website: http://spring.io
Gunnar Hillert
Twitter: @ghillert
Website: http://blog.hillert.com
Conference: http://devnexus.com
s2gx2015 who needs batch
https://github.com/ghillert/s1-2gx2015-who-needs-batch
W E H AV E A N 
ANNOUNCEMENT
WE HAVE A 
DISEASE
OBJECTIVIUS SHINIUM
SYNDROMUS
Shiny Object Syndrom
SHINY OBJECT
SYNDROME
IT MAKES US
INNOVATE
ROAD TO 
RECOVERY
COOL KIDS ARE DOING
STREAM PROCESSING
Stream Analytics
Cloud Dataflow
s2gx2015 who needs batch
s2gx2015 who needs batch
LET’S TAKE A
CLOSER LOOK
s2gx2015 who needs batch
BATCH AND STREAM
DEFINITIONS
LOOK AT COMMON
USE CASES
CONCLUSIONS
DEFINITION:
STREAM PROCESSING
A system for processing an unbounded 
set of data in an asynchronous manor.
DEFINITION:
BATCH PROCESSING
DEPENDS ON 
WHO YOU ASK
“Batch … applications run … as special 
cases of stream processing applications”
- Apache Flink Documentation
Batch processing is the processing of
a bounded data set without interruption
or interaction
KEY
DIFFERENCES
1
BOUNDED VS
UNBOUNDED
2
SYNCHRONOUS VS
ASYNCHRONOUS
3
STATEFUL VS
STATELESS
STATE OF
STREAMING
Stream Analytics
Cloud Dataflow
Stream Analytics
Cloud Dataflow
STATE OF
STREAMING
JSR-352
WHY THE 
END OF BATCH?
LATENCY
WHAT DOES THE 
END OF BATCH MEAN?
LIMIT 
LATENCY
“ENTERPRISE GRADE”
JSR-352
LOOK AT COMMON
USE CASES
E.T.L.
DATABASE TO HDFS
DEMO
STREAMING CAN BE USEFUL
IN INGESTION USE CASES
TWITTER TO HDFS
DATA SCIENCE
BATCH TRAINING
PREDICTIVE MODELS
CONNECTED
CAR
transformerhttp filter hdfs
type-transformer
python
Gemfire
REST
Hadoop
Gemfire
STREAMING 
APROXIMIATIONS EXIST
≈
BREADTH AND ACCURACY
DON’T MATCH BATCH
≠
WORKFLOW
ORCHESTRATION
s2gx2015 who needs batch
forEachDir
start
getDirectoryInfo
join
dirSubProcss dirSubProcss…
Condition 1
start
getDirAgeAndNumberOfFiles
Ingest
Condition 2
sendReminderEmail
ArchiveEnd
DEMO
HOW IS THIS DONE
VIA STREAMING?
NON-INTERACTIVE
PROCESSING
SAME AS ETL
PROCESSING
PORT SCANNER
DEMO
STREAMING 
DOESN’T MAKE SENSE
STREAMING 
DOES MAKE SENSE
WHERE LATENCY
IS A PRIORITY
DATA LOSS
IS ACCEPTABLE
UNBOUNDED
DATASET
BUT DOES NOT
IN OTHER USE CASES
HIGHLY
COMPLEX
ERROR HANDLING
NOT AS ROBUST
DATA GUARANTEES
NOT AS ROBUST
WHERE BATCH 
IS BETTER
FINITE
DATASET
ROBUST 
ERROR HANDLING
RESOURCES
ARE A CONCERN
NOT EITHER OR
COMBINING THE TWO 
PROVIDES THE MOST POWER
E.T.L.v2
mailhttp mysql2hdfs
Database to HDFS v2
DEMO
s2gx2015 who needs batch
BATCH HAS LASTED
BECAUSE IT MAKSE SENSE
SPRING XD/DATA FLOW
BEST OF BOTH WORLDS
ENTERPRISE GRADE
STREAM PROCESSING
BEST BATCH OPTION
ON THE JVM
s2gx2015 who needs batch
Unless otherwise indicated, these slides are © 2013-2015 Pivotal Software, Inc. and licensed under a
Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ 88
Check out Spring Cloud Data Flow on https://spring.io
Apache Spark for Big Data Processing – Salon N-P Up next!
High Performance Stream Processing – Salon N-P 8:30AM Tomorrow
Spring Integration Extensions Ecosystem – Here! 12:45 Tomorrow
Learn More. Stay Connected.
@springcentral Spring.io/video

More Related Content

PPTX
Caso Clínico: Emergências Hipertensivas
PDF
One does not simply builds her own Heroku
PDF
Webinar - Data driven postmortems - Jason Yee
PPS
PDF
diploma_online_ajax
PDF
Own Your Front-end Performance: Tools, Not Rules
PDF
Epoch-Suite :: Preparar Ajustamento
PDF
Land For Sale OR
Caso Clínico: Emergências Hipertensivas
One does not simply builds her own Heroku
Webinar - Data driven postmortems - Jason Yee
diploma_online_ajax
Own Your Front-end Performance: Tools, Not Rules
Epoch-Suite :: Preparar Ajustamento
Land For Sale OR

Viewers also liked (11)

PPTX
141208 AER Public Forum - TransGrid Revenue Proposal
PPTX
Simbolos patrios
PDF
TransGrid's Renewable Energy Hub
PPTX
4 de noviembre
PDF
Electricity Storage Forum 2016
PPTX
NSW City Rail Model Model
DOC
Protocolo de toxoplasmose
 
PPT
Toxoplasmose congênita
PDF
2015 TAPR Forum Presentation
PPT
Amigdalas e vegetações adenóides
PDF
Annual Employee Performance Eval.
141208 AER Public Forum - TransGrid Revenue Proposal
Simbolos patrios
TransGrid's Renewable Energy Hub
4 de noviembre
Electricity Storage Forum 2016
NSW City Rail Model Model
Protocolo de toxoplasmose
 
Toxoplasmose congênita
2015 TAPR Forum Presentation
Amigdalas e vegetações adenóides
Annual Employee Performance Eval.
Ad

Similar to s2gx2015 who needs batch (20)

PDF
Don't Cross The Streams - Data Streaming And Apache Flink
PPTX
Introduction to Real-Time Data Processing
PPTX
Gcp dataflow
PDF
When Streaming Needs Batch With Konstantin Knauf | Current 2022
PDF
William Vambenepe – Google Cloud Dataflow and Flink , Stream Processing by De...
PDF
Spark meetup stream processing use cases
PPTX
Spark Streaming Recipes and "Exactly Once" Semantics Revised
PPTX
Trivento summercamp fast data 9/9/2016
PPTX
Sweet Streams (Are made of this)
PPTX
Apache Flink Meetup Munich (November 2015): Flink Overview, Architecture, Int...
PPTX
Introduction to Apache Flink at Vienna Meet Up
PDF
Spring Batch Performance Tuning
PPTX
Trivento summercamp masterclass 9/9/2016
PPTX
Batch Processing vs Stream Processing Difference
PDF
Introduction to Stream Processing
PDF
Spring batch overivew
PPTX
Continuous Processing with Apache Flink - Strata London 2016
PDF
Introduction to Real-Time Data Processing
PDF
Introduction to Real-time data processing
PDF
Self-Service Data Ingestion Using NiFi, StreamSets & Kafka
Don't Cross The Streams - Data Streaming And Apache Flink
Introduction to Real-Time Data Processing
Gcp dataflow
When Streaming Needs Batch With Konstantin Knauf | Current 2022
William Vambenepe – Google Cloud Dataflow and Flink , Stream Processing by De...
Spark meetup stream processing use cases
Spark Streaming Recipes and "Exactly Once" Semantics Revised
Trivento summercamp fast data 9/9/2016
Sweet Streams (Are made of this)
Apache Flink Meetup Munich (November 2015): Flink Overview, Architecture, Int...
Introduction to Apache Flink at Vienna Meet Up
Spring Batch Performance Tuning
Trivento summercamp masterclass 9/9/2016
Batch Processing vs Stream Processing Difference
Introduction to Stream Processing
Spring batch overivew
Continuous Processing with Apache Flink - Strata London 2016
Introduction to Real-Time Data Processing
Introduction to Real-time data processing
Self-Service Data Ingestion Using NiFi, StreamSets & Kafka
Ad

More from Gunnar Hillert (15)

PDF
High Precision GPS Positioning for Spring Developers
PDF
Migrating to Angular 5 for Spring Developers
PDF
The Spring Update
PDF
Ajug - The Spring Update
PDF
Creating Modular Test-Driven SPAs with Spring and AngularJS
PDF
Modular Test-driven SPAs with Spring and AngularJS
PDF
Atlanta JUG - Integrating Spring Batch and Spring Integration
PDF
DevNexus 2013 - Introduction to WebSockets
PDF
Introduction to WebSockets
KEY
S2GX 2012 - Spring Projects Infrastructure
KEY
S2GX 2012 - What's New in Spring Integration
KEY
S2GX 2012 - Introduction to Spring Integration and Spring Batch
PPTX
Spring Projects Infrastructure
PDF
Cloud Foundry for Spring Developers
KEY
jRecruiter - The AJUG Job Posting Service
High Precision GPS Positioning for Spring Developers
Migrating to Angular 5 for Spring Developers
The Spring Update
Ajug - The Spring Update
Creating Modular Test-Driven SPAs with Spring and AngularJS
Modular Test-driven SPAs with Spring and AngularJS
Atlanta JUG - Integrating Spring Batch and Spring Integration
DevNexus 2013 - Introduction to WebSockets
Introduction to WebSockets
S2GX 2012 - Spring Projects Infrastructure
S2GX 2012 - What's New in Spring Integration
S2GX 2012 - Introduction to Spring Integration and Spring Batch
Spring Projects Infrastructure
Cloud Foundry for Spring Developers
jRecruiter - The AJUG Job Posting Service

Recently uploaded (20)

PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Encapsulation theory and applications.pdf
PPTX
SOPHOS-XG Firewall Administrator PPT.pptx
PPTX
Chapter 5: Probability Theory and Statistics
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
Enhancing emotion recognition model for a student engagement use case through...
PDF
August Patch Tuesday
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Zenith AI: Advanced Artificial Intelligence
PDF
Hybrid model detection and classification of lung cancer
PPTX
A Presentation on Artificial Intelligence
PPTX
TLE Review Electricity (Electricity).pptx
PDF
1 - Historical Antecedents, Social Consideration.pdf
PDF
Approach and Philosophy of On baking technology
PDF
Accuracy of neural networks in brain wave diagnosis of schizophrenia
PDF
Hindi spoken digit analysis for native and non-native speakers
PDF
project resource management chapter-09.pdf
Assigned Numbers - 2025 - Bluetooth® Document
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
NewMind AI Weekly Chronicles - August'25-Week II
Encapsulation theory and applications.pdf
SOPHOS-XG Firewall Administrator PPT.pptx
Chapter 5: Probability Theory and Statistics
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Enhancing emotion recognition model for a student engagement use case through...
August Patch Tuesday
Building Integrated photovoltaic BIPV_UPV.pdf
Zenith AI: Advanced Artificial Intelligence
Hybrid model detection and classification of lung cancer
A Presentation on Artificial Intelligence
TLE Review Electricity (Electricity).pptx
1 - Historical Antecedents, Social Consideration.pdf
Approach and Philosophy of On baking technology
Accuracy of neural networks in brain wave diagnosis of schizophrenia
Hindi spoken digit analysis for native and non-native speakers
project resource management chapter-09.pdf

s2gx2015 who needs batch