SlideShare a Scribd company logo
Independent of the source of data, the integration of event streams into an
Enterprise Architecture gets more and more important in the world of
sensors, social media streams and Internet of Things. Events have to be
accepted quickly and reliably, they have to be distributed and analyzed, often
with many consumers or systems interested in all or part of the events.
Storing such huge event streams into HDFS or a NoSQL datastore is feasible
and not such a challenge anymore. But if you want to be able to react fast, with
minimal latency, you can not afford to first store the data and doing the
analysis/analytics later. You have to be able to include part of your analytics
right after you consume the data streams. Products for doing event
processing, such as Oracle Event Processing or Esper, are available for quite a
long time and used to be called Complex Event Processing (CEP). In the past
few years, anotherfamily of products appeared, mostly out of the Big Data
Technology space, called Stream Processing or Streaming Analytics. These are
mostly open source products/frameworks such as Apache Storm, Spark
Streaming, Flink, Kafka Streams as well as supporting infrastructures such as
Apache Kafka. In this talk I will present the theoretical foundations for Stream
Processing, discuss the core properties a Stream Processing platform should
provide and highlight what differences you might find between the more
traditional CEP and the more modern Stream Processing solutions.

More Related Content

PPTX
Overview of Apache Fink: The 4G of Big Data Analytics Frameworks
PPTX
Overview of Apache Fink: the 4 G of Big Data Analytics Frameworks
PPTX
Overview of Apache Flink: the 4G of Big Data Analytics Frameworks
PPTX
Analysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summit
PPTX
Analysis of Major Trends in Big Data Analytics
PPTX
Analysis of Major Trends in Big Data Analytics
PDF
About Streaming Data Solutions for Hadoop
PDF
Apache Spark and future of advanced analytics
Overview of Apache Fink: The 4G of Big Data Analytics Frameworks
Overview of Apache Fink: the 4 G of Big Data Analytics Frameworks
Overview of Apache Flink: the 4G of Big Data Analytics Frameworks
Analysis-of-Major-Trends-in-big-data-analytics-slim-baltagi-hadoop-summit
Analysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data Analytics
About Streaming Data Solutions for Hadoop
Apache Spark and future of advanced analytics

Similar to Apache Big D-4.docx (20)

PDF
Enabling SQL Access to Data Lakes
PDF
Mastering Kafka Streams and ksqlDB: Building Real-Time Data Systems by Exampl...
DOCX
INFO491FinalPaper
PDF
IoT and the pervasive nature of fast data and apache spark
PDF
IoT and the Pervasive Nature of Fast Data and Apache Spark
PDF
ManMachine&Mathematics_Arup_Ray_Ext
PDF
Started with-apache-spark
PDF
Apache Kafka Use Cases_ When To Use It_ When Not To Use_.pdf
PDF
10 things you need to know about Spark
PDF
Real Time Stream Data Management Push Based Data in Research Practice Wolfram...
PDF
Big Data to SMART Data : Process Scenario
PDF
Real Time Stream Data Management Push Based Data in Research Practice Wolfram...
PDF
NoLambda: Combining Streaming, Ad-Hoc, Machine Learning and Batch Analysis
PDF
Scaling up with Cisco Big Data: Data + Science = Data Science
PDF
Big data + cloud computing glossary for community
PPTX
Machine Learning and Hadoop
PDF
Apache Phoenix with Actor Model (Akka.io) for real-time Big Data Programming...
PDF
Event Driven Architecture with a RESTful Microservices Architecture (Kyle Ben...
PPT
A Look into the Apache OODT Ecosystem
PDF
Solving the Really Big Tech Problems with IoT
Enabling SQL Access to Data Lakes
Mastering Kafka Streams and ksqlDB: Building Real-Time Data Systems by Exampl...
INFO491FinalPaper
IoT and the pervasive nature of fast data and apache spark
IoT and the Pervasive Nature of Fast Data and Apache Spark
ManMachine&Mathematics_Arup_Ray_Ext
Started with-apache-spark
Apache Kafka Use Cases_ When To Use It_ When Not To Use_.pdf
10 things you need to know about Spark
Real Time Stream Data Management Push Based Data in Research Practice Wolfram...
Big Data to SMART Data : Process Scenario
Real Time Stream Data Management Push Based Data in Research Practice Wolfram...
NoLambda: Combining Streaming, Ad-Hoc, Machine Learning and Batch Analysis
Scaling up with Cisco Big Data: Data + Science = Data Science
Big data + cloud computing glossary for community
Machine Learning and Hadoop
Apache Phoenix with Actor Model (Akka.io) for real-time Big Data Programming...
Event Driven Architecture with a RESTful Microservices Architecture (Kyle Ben...
A Look into the Apache OODT Ecosystem
Solving the Really Big Tech Problems with IoT
Ad

Recently uploaded (20)

PPTX
QA PROCESS FLOW CHART (1).pptxaaaaaaaaaaaa
PDF
15901922083_PQA.pdf................................
PDF
Music-and-Arts_jwkskwjsjsjsjsjsjsjdisiaiajsjjzjz
PPTX
668819271-A Relibility CCEPTANCE-SAMPLING.pptx
PPTX
Slides-Archival-Moment-FGCCT-6Feb23.pptx
PPTX
WEEK-3_TOPIC_Photographic_Rays__Its_Nature_and_Characteristics.pptx
PPTX
QA PROCESS FLOW CHART (1).pptxbbbbbbbbbnnnn
PPTX
level measurement foe tttttttttttttttttttttttttttttttttt
PPTX
This is about the usage of color in universities design
PPTX
PPT 1 - Preamble - SPI PPT 2024.bfghfghfhfhfghfggfdgd
PDF
15901922083_ph.cology3.pdf..................................................
PPTX
GREEN BUILDINGS are the ecofriendly buildings
PDF
2025_Mohammad Mahbub KxXxáacscascsacabir.pdf
PPTX
mineralsshow-160112142010.pptxkuygyu buybub
PPTX
GREEN BUILDINGS are eco friendly for environment
PPTX
opp research this is good for field research
PPT
huyfuygkgkugi iyugib jiygi uyuyguygv uyguyv
PPTX
LESSON 2 PUBLIC SPEAKING IS VERY FUN I LOVE IT
PPTX
Operational Research check it out. I like this it is pretty good
PDF
Annah, his young mistress, had ransacked his apartment Morehead on Gauguin an...
QA PROCESS FLOW CHART (1).pptxaaaaaaaaaaaa
15901922083_PQA.pdf................................
Music-and-Arts_jwkskwjsjsjsjsjsjsjdisiaiajsjjzjz
668819271-A Relibility CCEPTANCE-SAMPLING.pptx
Slides-Archival-Moment-FGCCT-6Feb23.pptx
WEEK-3_TOPIC_Photographic_Rays__Its_Nature_and_Characteristics.pptx
QA PROCESS FLOW CHART (1).pptxbbbbbbbbbnnnn
level measurement foe tttttttttttttttttttttttttttttttttt
This is about the usage of color in universities design
PPT 1 - Preamble - SPI PPT 2024.bfghfghfhfhfghfggfdgd
15901922083_ph.cology3.pdf..................................................
GREEN BUILDINGS are the ecofriendly buildings
2025_Mohammad Mahbub KxXxáacscascsacabir.pdf
mineralsshow-160112142010.pptxkuygyu buybub
GREEN BUILDINGS are eco friendly for environment
opp research this is good for field research
huyfuygkgkugi iyugib jiygi uyuyguygv uyguyv
LESSON 2 PUBLIC SPEAKING IS VERY FUN I LOVE IT
Operational Research check it out. I like this it is pretty good
Annah, his young mistress, had ransacked his apartment Morehead on Gauguin an...
Ad

Apache Big D-4.docx

  • 1. Independent of the source of data, the integration of event streams into an Enterprise Architecture gets more and more important in the world of sensors, social media streams and Internet of Things. Events have to be accepted quickly and reliably, they have to be distributed and analyzed, often with many consumers or systems interested in all or part of the events. Storing such huge event streams into HDFS or a NoSQL datastore is feasible and not such a challenge anymore. But if you want to be able to react fast, with minimal latency, you can not afford to first store the data and doing the analysis/analytics later. You have to be able to include part of your analytics right after you consume the data streams. Products for doing event processing, such as Oracle Event Processing or Esper, are available for quite a long time and used to be called Complex Event Processing (CEP). In the past few years, anotherfamily of products appeared, mostly out of the Big Data Technology space, called Stream Processing or Streaming Analytics. These are mostly open source products/frameworks such as Apache Storm, Spark Streaming, Flink, Kafka Streams as well as supporting infrastructures such as Apache Kafka. In this talk I will present the theoretical foundations for Stream Processing, discuss the core properties a Stream Processing platform should provide and highlight what differences you might find between the more traditional CEP and the more modern Stream Processing solutions.