SlideShare a Scribd company logo
Image from: wikipedia.org/wiki/Systematic_review,2017
Data Stream
Processing Is the real-time processing of data continuously,
concurrently, and in a data-by-data fashion. SP treats
data as a continuous infinite stream integrated from
sources.
BIG DATA STREAM PROCESSING
SP
Stream Processing
This devices/social media/web content/…
generate massive stream signals denoted as “Big
Data Streams”.
BD
Big Data
In contrast traditional big data approaches, where
constraints of responsive real-time, mobility
problems, and energy availability aren’t
considered.
Mohammed Alayyoub, Ali Yazici, and Ziya
Karakaya. (2016). A Systematic Mapping Study
for Big Data Stream Processing Frameworks.
JADI - Brazi, vol.2, pp 4-11.
حلقة تكنولوجية 11 بحث علمى بعنوان A Systematic Mapping Study for Big Data Stream Processing Framework
حلقة تكنولوجية 11 بحث علمى بعنوان A Systematic Mapping Study for Big Data Stream Processing Framework
حلقة تكنولوجية 11 بحث علمى بعنوان A Systematic Mapping Study for Big Data Stream Processing Framework
summarizing the results of the included studies
RESULTS OF SYSTEMATIC REVIEW
RQ 1. What types of contributions are made by the papers?
A Systematic Mapping
Study for Big Data
Stream
Processing Frameworks
[Mohammed Alayyoub et al, 2016]
Contributions
Method/Technique/Approach : 35
Framework : 11
Comparison : 11
Analysis : 10
Other : 7
Model: 6
Tool : 5
Platform : 5
Overview : 4
Architecture : 4
Empirical Study: 3
nine research questions (RQs)
451 candidate studies from the selected sources.
91 studies that were conducted
between 2010 and 2015 were evaluated.
RESULTS OF SYSTEMATIC REVIEW
RQ 2. What type of research methods are used in the papers?
Solution Proposal: A solution for a problem is proposed.
Validation Research: Techniques investigated have not yet been
implemented.
Evaluation Research: Techniques are implemented in practice
and an evaluation of the technique is conducted.
Experience Papers: explain on what and how something has
been done in practice.
Research Methods
Solution Proposal : 20
Validation Research : 39
Evaluation Research : 31
Experience Papers : 1
A Systematic Mapping
Study for Big Data
Stream
Processing Frameworks
[Mohammed Alayyoub et al, 2016]
RESULTS OF SYSTEMATIC REVIEW
RQ 3. What type of research methods are used for each of the framework in the papers?
Spark Storm Flink InfoSphere
0
3
6
9
12
15
Spark
S4
0
3
6
9
12
15
Storm
0
3
6
9
12
15
Flink
0
3
6
9
12
15
InfoSphere
0
3
6
9
12
15
S4
Solution Proposal
Validation Research
Evaluation Research
Research methods for each SP
RESULTS OF SYSTEMATIC REVIEW
RQ 9. What type(s) of data is used most for each Big Data stream processing framework?
Sensors Social Media Graphical Geospatial
0
2
4
6
8
Sensor
Log data
0
2
4
6
8
Social Media
0
2
4
6
8
Graphical
0
2
4
6
8
Geospatial
0
2
4
6
8
Log data
0
2
4
6
8
Web Content
Web Content
RESULTS OF SYSTEMATIC REVIEW
RQ 5. What is the ratio of experimentation type (batch only, stream only or both) used for
each Big Data stream processing framework in the papers?
Spark Storm Flink InfoSphere
0
5
10
15
20
25
Spark
S4
0
5
10
15
20
25
Storm
Batch
Streaming
Both
0
5
10
15
20
25
Flink
0
5
10
15
20
25
InfoSphere
0
5
10
15
20
25
S4
Experimentation forms
RESULTS OF SYSTEMATIC REVIEW
RQ 4. What is the annual number of publications for each Big Data stream processing framework?
0
3
6
9
12
15
2009 2010 2011 2012 2013 2014 2015 2016
Spark Storm Flink InfoSphere S4
RESULTS OF SYSTEMATIC REVIEW
RQ 6. What is the ratio of contribution purposes (usage enhancement, performance
enhancement or both) for each Big Data stream processing framework in the papers?
Spark Storm Flink InfoSphere
0
3
6
9
12
15
Spark
S4
0
3
6
9
12
15
Storm
Usage enhancement
Performance enhancement
Both
0
3
6
9
12
15
Flink
0
3
6
9
12
15
InfoSphere
0
3
6
9
12
15
S4
RESULTS OF SYSTEMATIC REVIEW
RQ 7. Which data ingestion internal source/tool is used most for each framework?
Kafka
Client library to build SP apps.
RabbitMQ ZeroMQ
asynchronous message queue
Network Socket
0
5
Kafka
Twitter Streaming API
0
5
RabbitMQ
0
5
0MQ
0
5
Network Socket
0
5
Twitter Streaming API
Third party tool to ingest data from external sources
Streams API Libraries
RESULTS OF SYSTEMATIC REVIEW
RQ 8. What is the most preferred range for the number of nodes used in experimentation for
each Big Data stream processing framework?
Spark Storm Flink InfoSphere
0
3
6
9
12
15
Spark
S4
0
3
6
9
12
15
Storm
1 – 5 nodes
6 – 20 nodes
20+ nodes
0
3
6
9
12
15
Flink
0
3
6
9
12
15
InfoSphere
0
3
6
9
12
15
S4
Questions

More Related Content

PDF
MOCHA 2018 Challenge @ ESWC2018
PDF
Big Data Processing Beyond MapReduce by Dr. Flavio Villanustre
PDF
Analytics of Performance and Data Quality for Mobile Edge Cloud Applications
PPT
RUGCombine & Livetrix
PDF
Workshop on Real-time & Stream Analytics IEEE BigData 2016
PDF
[WSO2 Summit Chicago 2018] How to Build an Agile Enterprise
PPTX
Making obamacare work with Big Data
PPTX
Wrangling RedCap_An Introduction and Inspiration
MOCHA 2018 Challenge @ ESWC2018
Big Data Processing Beyond MapReduce by Dr. Flavio Villanustre
Analytics of Performance and Data Quality for Mobile Edge Cloud Applications
RUGCombine & Livetrix
Workshop on Real-time & Stream Analytics IEEE BigData 2016
[WSO2 Summit Chicago 2018] How to Build an Agile Enterprise
Making obamacare work with Big Data
Wrangling RedCap_An Introduction and Inspiration

What's hot (15)

PPTX
From Data to City Indicators: A Knowledge Graph for Supporting Automatic Gene...
PDF
Overview of OSLC - INCOSE IW 2018 MBSE Workshop
PPTX
Discover Introduction to REDCap
PDF
Brisbane Health-y Data: RedCap
PDF
Session III Census and registers - R.Radini, M.Scannapieco, L.Tosco, The ital...
PDF
Reading data into r
PPTX
Covid 19 monitor
PDF
Semantic Labeling for Quantitative Data using Wikidata
PDF
Quality aware subgraph matching over inconsistent probabilistic graph databases
PDF
Resume xiaodan(vinci)
PDF
My recent resume
PDF
Are we data responsible?
PPTX
Data Science as a Service: Intersection of Cloud Computing and Data Science
PDF
Beyond stream analytics
PPTX
Dissemination, principles and examples
From Data to City Indicators: A Knowledge Graph for Supporting Automatic Gene...
Overview of OSLC - INCOSE IW 2018 MBSE Workshop
Discover Introduction to REDCap
Brisbane Health-y Data: RedCap
Session III Census and registers - R.Radini, M.Scannapieco, L.Tosco, The ital...
Reading data into r
Covid 19 monitor
Semantic Labeling for Quantitative Data using Wikidata
Quality aware subgraph matching over inconsistent probabilistic graph databases
Resume xiaodan(vinci)
My recent resume
Are we data responsible?
Data Science as a Service: Intersection of Cloud Computing and Data Science
Beyond stream analytics
Dissemination, principles and examples
Ad

Similar to حلقة تكنولوجية 11 بحث علمى بعنوان A Systematic Mapping Study for Big Data Stream Processing Framework (20)

PPTX
Big Stream Processing Systems, Big Graphs
PPTX
From Pipelines to Refineries: scaling big data applications with Tim Hunter
PPTX
Big data analytics
PDF
Stream Processing
PPTX
Trivento summercamp masterclass 9/9/2016
PPT
Survey of Real-time Processing Systems for Big Data
PPTX
The data streaming processing paradigm and its use in modern fog architectures
PDF
Realtime
 Distributed Analysis
 of Datastreams
PPT
CS8091_BDA_Unit_IV_Stream_Computing
PDF
Reflections on Almost Two Decades of Research into Stream Processing
PDF
Big data service architecture: a survey
PPTX
Challenges and patterns for semantics at scale
PDF
Big Data Analytics for Real Time Systems
PPTX
Shikha fdp 62_14july2017
PPTX
Apache Spark Components
PPTX
PPT 1.1.2.pptx ehhllo hi hwi bdfhd dbdhu
PDF
Comparison of Open-Source Data Stream Processing Engines: Spark Streaming, Fl...
DOCX
95Orchestrating Big Data Analysis Workflows in the Cloud.docx
DOCX
95Orchestrating Big Data Analysis Workflows in the Cloud.docx
PPTX
Project Deimos
Big Stream Processing Systems, Big Graphs
From Pipelines to Refineries: scaling big data applications with Tim Hunter
Big data analytics
Stream Processing
Trivento summercamp masterclass 9/9/2016
Survey of Real-time Processing Systems for Big Data
The data streaming processing paradigm and its use in modern fog architectures
Realtime
 Distributed Analysis
 of Datastreams
CS8091_BDA_Unit_IV_Stream_Computing
Reflections on Almost Two Decades of Research into Stream Processing
Big data service architecture: a survey
Challenges and patterns for semantics at scale
Big Data Analytics for Real Time Systems
Shikha fdp 62_14july2017
Apache Spark Components
PPT 1.1.2.pptx ehhllo hi hwi bdfhd dbdhu
Comparison of Open-Source Data Stream Processing Engines: Spark Streaming, Fl...
95Orchestrating Big Data Analysis Workflows in the Cloud.docx
95Orchestrating Big Data Analysis Workflows in the Cloud.docx
Project Deimos
Ad

Recently uploaded (20)

PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PPTX
GDM (1) (1).pptx small presentation for students
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PDF
Classroom Observation Tools for Teachers
PDF
RMMM.pdf make it easy to upload and study
PPTX
202450812 BayCHI UCSC-SV 20250812 v17.pptx
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PDF
Complications of Minimal Access Surgery at WLH
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PDF
A systematic review of self-coping strategies used by university students to ...
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PPTX
Institutional Correction lecture only . . .
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PPTX
master seminar digital applications in india
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PPTX
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
Abdominal Access Techniques with Prof. Dr. R K Mishra
O5-L3 Freight Transport Ops (International) V1.pdf
GDM (1) (1).pptx small presentation for students
FourierSeries-QuestionsWithAnswers(Part-A).pdf
Classroom Observation Tools for Teachers
RMMM.pdf make it easy to upload and study
202450812 BayCHI UCSC-SV 20250812 v17.pptx
Final Presentation General Medicine 03-08-2024.pptx
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
Complications of Minimal Access Surgery at WLH
102 student loan defaulters named and shamed – Is someone you know on the list?
A systematic review of self-coping strategies used by university students to ...
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
Institutional Correction lecture only . . .
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
master seminar digital applications in india
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx

حلقة تكنولوجية 11 بحث علمى بعنوان A Systematic Mapping Study for Big Data Stream Processing Framework

  • 2. Data Stream Processing Is the real-time processing of data continuously, concurrently, and in a data-by-data fashion. SP treats data as a continuous infinite stream integrated from sources.
  • 3. BIG DATA STREAM PROCESSING SP Stream Processing This devices/social media/web content/… generate massive stream signals denoted as “Big Data Streams”. BD Big Data In contrast traditional big data approaches, where constraints of responsive real-time, mobility problems, and energy availability aren’t considered. Mohammed Alayyoub, Ali Yazici, and Ziya Karakaya. (2016). A Systematic Mapping Study for Big Data Stream Processing Frameworks. JADI - Brazi, vol.2, pp 4-11.
  • 7. summarizing the results of the included studies
  • 8. RESULTS OF SYSTEMATIC REVIEW RQ 1. What types of contributions are made by the papers? A Systematic Mapping Study for Big Data Stream Processing Frameworks [Mohammed Alayyoub et al, 2016] Contributions Method/Technique/Approach : 35 Framework : 11 Comparison : 11 Analysis : 10 Other : 7 Model: 6 Tool : 5 Platform : 5 Overview : 4 Architecture : 4 Empirical Study: 3 nine research questions (RQs) 451 candidate studies from the selected sources. 91 studies that were conducted between 2010 and 2015 were evaluated.
  • 9. RESULTS OF SYSTEMATIC REVIEW RQ 2. What type of research methods are used in the papers? Solution Proposal: A solution for a problem is proposed. Validation Research: Techniques investigated have not yet been implemented. Evaluation Research: Techniques are implemented in practice and an evaluation of the technique is conducted. Experience Papers: explain on what and how something has been done in practice. Research Methods Solution Proposal : 20 Validation Research : 39 Evaluation Research : 31 Experience Papers : 1 A Systematic Mapping Study for Big Data Stream Processing Frameworks [Mohammed Alayyoub et al, 2016]
  • 10. RESULTS OF SYSTEMATIC REVIEW RQ 3. What type of research methods are used for each of the framework in the papers? Spark Storm Flink InfoSphere 0 3 6 9 12 15 Spark S4 0 3 6 9 12 15 Storm 0 3 6 9 12 15 Flink 0 3 6 9 12 15 InfoSphere 0 3 6 9 12 15 S4 Solution Proposal Validation Research Evaluation Research Research methods for each SP
  • 11. RESULTS OF SYSTEMATIC REVIEW RQ 9. What type(s) of data is used most for each Big Data stream processing framework? Sensors Social Media Graphical Geospatial 0 2 4 6 8 Sensor Log data 0 2 4 6 8 Social Media 0 2 4 6 8 Graphical 0 2 4 6 8 Geospatial 0 2 4 6 8 Log data 0 2 4 6 8 Web Content Web Content
  • 12. RESULTS OF SYSTEMATIC REVIEW RQ 5. What is the ratio of experimentation type (batch only, stream only or both) used for each Big Data stream processing framework in the papers? Spark Storm Flink InfoSphere 0 5 10 15 20 25 Spark S4 0 5 10 15 20 25 Storm Batch Streaming Both 0 5 10 15 20 25 Flink 0 5 10 15 20 25 InfoSphere 0 5 10 15 20 25 S4 Experimentation forms
  • 13. RESULTS OF SYSTEMATIC REVIEW RQ 4. What is the annual number of publications for each Big Data stream processing framework? 0 3 6 9 12 15 2009 2010 2011 2012 2013 2014 2015 2016 Spark Storm Flink InfoSphere S4
  • 14. RESULTS OF SYSTEMATIC REVIEW RQ 6. What is the ratio of contribution purposes (usage enhancement, performance enhancement or both) for each Big Data stream processing framework in the papers? Spark Storm Flink InfoSphere 0 3 6 9 12 15 Spark S4 0 3 6 9 12 15 Storm Usage enhancement Performance enhancement Both 0 3 6 9 12 15 Flink 0 3 6 9 12 15 InfoSphere 0 3 6 9 12 15 S4
  • 15. RESULTS OF SYSTEMATIC REVIEW RQ 7. Which data ingestion internal source/tool is used most for each framework? Kafka Client library to build SP apps. RabbitMQ ZeroMQ asynchronous message queue Network Socket 0 5 Kafka Twitter Streaming API 0 5 RabbitMQ 0 5 0MQ 0 5 Network Socket 0 5 Twitter Streaming API Third party tool to ingest data from external sources Streams API Libraries
  • 16. RESULTS OF SYSTEMATIC REVIEW RQ 8. What is the most preferred range for the number of nodes used in experimentation for each Big Data stream processing framework? Spark Storm Flink InfoSphere 0 3 6 9 12 15 Spark S4 0 3 6 9 12 15 Storm 1 – 5 nodes 6 – 20 nodes 20+ nodes 0 3 6 9 12 15 Flink 0 3 6 9 12 15 InfoSphere 0 3 6 9 12 15 S4