SlideShare a Scribd company logo
SPARK STREAMING
... IN 10 MINUTES
sven.tessmann@comsysto.com
@setema
WHY STREAMING?
FROM "DATA AT REST"
TOWARDS "DATA IN MOTION"
LinkedIn processes 1.1 Trillion Messages a day
(2015, using Kafka)
Microsoft processes 10TB of O ce 365 Event Data a day
(2015, using Spark)
"FAST DATA"
GENERATE
"BUSINESS REALTIME"
INSIGHTS
WHAT ABOUT SPARK
STREAMING?
"scalable, high-throughput, fault-tolerant
stream processing framework"
Spark Ecosystem
DISCRETIZED STREAM (DSTREAM)
a sequence of distributed datasets (RDDs) representing a
continuous stream of data
SPARK STREAMING  Spark Hadoop User Group Munich Meetup 2016
DStream Transformations
Execution Model
LET'S SEE SOME CODE
THANK YOU!

More Related Content

PPTX
Azure Spring Cloud
PPTX
SkillPages Engineering Presents at AWS Lean Cloud in London
PDF
SquareScale Munich Cloud Native Night
PPTX
Moving Your Data to The Cloud
PPTX
Sql Azure - St. Louis Day of .NET
PDF
vRealize Operations (vROps) Management Pack for Citrix NetScaler
PPTX
Extending on premise applications to the cloud
PPTX
Developing for the Cloud
Azure Spring Cloud
SkillPages Engineering Presents at AWS Lean Cloud in London
SquareScale Munich Cloud Native Night
Moving Your Data to The Cloud
Sql Azure - St. Louis Day of .NET
vRealize Operations (vROps) Management Pack for Citrix NetScaler
Extending on premise applications to the cloud
Developing for the Cloud

More from Comsysto Reply GmbH (20)

PDF
Architectural Decisions: Smoothly and Consistently
PDF
ljug-meetup-2023-03-hexagonal-architecture.pdf
PDF
Software Architecture and Architectors: useless VS valuable
PDF
Invited-Talk_PredAnalytics_München (2).pdf
PDF
MicroFrontends für Microservices
PDF
Alles offen = gut(ai)
PDF
Bable on Smart City Munich Meetup: How cities are leveraging innovative partn...
PDF
Smart City Munich Kickoff Meetup
PDF
Data Reliability Challenges with Spark by Henning Kropp (Spark & Hadoop User ...
PDF
"Hadoop Data Lake vs classical Data Warehouse: How to utilize best of both wo...
PDF
Data lake vs Data Warehouse: Hybrid Architectures
PPTX
Java 9 Modularity and Project Jigsaw
PDF
Distributed Computing and Caching in the Cloud: Hazelcast and Microsoft
PDF
Grundlegende Konzepte von Elm, React und AngularDart 2 im Vergleich
PDF
Building a fully-automated Fast Data Platform
PPTX
Apache Apex: Stream Processing Architecture and Applications
PPTX
Ein Prozess lernt laufen: LEGO Mindstorms Steuerung mit BPMN
PDF
Geospatial applications created using java script(and nosql)
PDF
Java cro 2016 - From.... to Scrum by Jurica Krizanic
PDF
21.04.2016 Meetup: Spark vs. Flink
Architectural Decisions: Smoothly and Consistently
ljug-meetup-2023-03-hexagonal-architecture.pdf
Software Architecture and Architectors: useless VS valuable
Invited-Talk_PredAnalytics_München (2).pdf
MicroFrontends für Microservices
Alles offen = gut(ai)
Bable on Smart City Munich Meetup: How cities are leveraging innovative partn...
Smart City Munich Kickoff Meetup
Data Reliability Challenges with Spark by Henning Kropp (Spark & Hadoop User ...
"Hadoop Data Lake vs classical Data Warehouse: How to utilize best of both wo...
Data lake vs Data Warehouse: Hybrid Architectures
Java 9 Modularity and Project Jigsaw
Distributed Computing and Caching in the Cloud: Hazelcast and Microsoft
Grundlegende Konzepte von Elm, React und AngularDart 2 im Vergleich
Building a fully-automated Fast Data Platform
Apache Apex: Stream Processing Architecture and Applications
Ein Prozess lernt laufen: LEGO Mindstorms Steuerung mit BPMN
Geospatial applications created using java script(and nosql)
Java cro 2016 - From.... to Scrum by Jurica Krizanic
21.04.2016 Meetup: Spark vs. Flink
Ad

Recently uploaded (20)

PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPT
Quality review (1)_presentation of this 21
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PDF
Foundation of Data Science unit number two notes
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
Database Infoormation System (DBIS).pptx
PPTX
Introduction to machine learning and Linear Models
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PDF
Lecture1 pattern recognition............
PPTX
Qualitative Qantitative and Mixed Methods.pptx
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Acceptance and paychological effects of mandatory extra coach I classes.pptx
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
Quality review (1)_presentation of this 21
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
Foundation of Data Science unit number two notes
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
IBA_Chapter_11_Slides_Final_Accessible.pptx
Database Infoormation System (DBIS).pptx
Introduction to machine learning and Linear Models
oil_refinery_comprehensive_20250804084928 (1).pptx
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Galatica Smart Energy Infrastructure Startup Pitch Deck
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Lecture1 pattern recognition............
Qualitative Qantitative and Mixed Methods.pptx
Ad

SPARK STREAMING Spark Hadoop User Group Munich Meetup 2016