SlideShare a Scribd company logo
1
Is ETL Now a 4‐Letter 
Word? Preparing for 
Streaming Analytics
Analyst commentary
October, 2015
Mark Madsen
Third Nature
@markmadsen
#!$@*ETL!
%*ELT#&*!
Copyright Third Nature, Inc.
In a mostly‐connected world, events occur in 
different time frames, follow different cycles of use
2
Source: Noumenal
Disconnected
Milliseconds Minutes Hours+
Copyright Third Nature, Inc.
In a mostly‐connected world, events occur in 
different time frames, follow different cycles of use
3
Source: Noumenal
Disconnected
Every event is 
persisted for some 
period of time before 
it is forgotten or 
forwarded
Milliseconds Minutes Hours+
Copyright Third Nature, Inc.
In a mostly‐connected world, events occur in 
different time frames, follow different cycles of use
4
Source: Noumenal
Disconnected
Local context 
and control, 
local decisions, 
local latency
Milliseconds Minutes Hours+
Copyright Third Nature, Inc.
In a mostly‐connected world, events occur in 
different time frames, follow different cycles of use
5
Disconnected
Source: Noumenal
Milliseconds Minutes Hours+
Bigger context, 
likely correlated, 
more complex 
rules, external 
monitoring
Copyright Third Nature, Inc.
In a mostly‐connected world, events occur in 
different time frames, follow different cycles of use
6
Disconnected
Source: Noumenal
Milliseconds Minutes Hours+
Broad context, human 
intervention, diagnosis 
and analytical tasks that 
have to be coordinated.
Copyright Third Nature, Inc.
Source: Noumenal
Milliseconds Minutes Hours+
In a mostly‐connected world, events occur in 
different time frames, follow different cycles of use
7
Disconnected
Data lives in multiple places, 
at multiple levels of detail, for 
differing durations. Unlikely to 
all be in one place.
Nor should it be.
Copyright Third Nature, Inc.
We have a model for the persisted portion only
The DW can’t handle real time ingest
▪ One of the original DW design assumptions: solve for 
conflicting workloads by using a different database
▪ Workload management has limits
▪ Scalability problem for event streams
▪ Spiky flow patterns and dynamic scaling
Static schema:
▪ Reaction time ‐ shapes, holes, dropped packets
▪ What happens first, upstream change or data model change?
Polling architectures do not work well for streaming
▪ Introduces latency
▪ Polling creates performance and scaling problems
Copyright Third Nature, Inc.
Activities and functions based on the data flow cycle
Capture
Sensors
Machine 
Data
Logs
Events
Transctions
Table 
changes
Propagate
Filter
Transform
Correlate
Aggregate
Analyze
Classify
Detect 
anomalies
Detect 
patterns
Correlate
Elect
Rules
Algorithms
Select
Coordinate
Effect
Notify
Publish
Approve
Execute
Persist
Database, NoSQL, Files, Hadoop
Copyright Third Nature, Inc.
Flowing Persisted
Sliding window
of “now”
Persisted but not yet
loaded into a platform
Queryable history
Managed history
Streaming isn’t either‐or, it’s part of core architecture
A DB or ETL can get you to within
minutes (at large scale) but it
won’t be easy or cheap; mainly
lives in the realm of history
Event streams, in-mem
stores, CEP streaming
SQL can be used for these
Real time monitoring doesn’t use only real time data: windows, restarts,
detecting deviation, so the above boundaries are crossed.
ESB Cache/Queue Database / platform
Copyright Third Nature, Inc.
Stream
If you want to do realtime and still manage your data 
effectively then you need to think about data architecture
Collect Refine Manage Deliver
Flowing Managed historyPersisted
Microservices Metadata
Metadata & reuse?
Flow, persisted, managed define different access, 
processing, storage and retrieval requirements
Copyright Third Nature, Inc.
About Third Nature
Third Nature is a research and consulting firm focused on new and emerging technology 
and practices in analytics, business intelligence, and performance management. If your 
question is related to data, analytics, information strategy and technology infrastructure 
then you‘re at the right place.
Our goal is to help companies take advantage of information‐driven management 
practices and applications. We offer education, consulting and research services to 
support business and IT organizations as well as technology vendors.
We fill the gap between what the industry analyst firms cover and what IT needs. We 
specialize in product and technology analysis, so we look at emerging technologies and 
markets, evaluating technology and hw it is applied rather than vendor market positions.

More Related Content

PDF
Briefing room: An alternative for streaming data collection
PDF
5 Factors Impacting Your Big Data Project's Performance
PPTX
Security issues in big data
PPSX
Big datarevealed hadoop catalog
PPTX
2016 09 cxo forum
PDF
Big data analysis concepts and references
PDF
IRJET- Systematic Review: Progression Study on BIG DATA articles
PPTX
Big data
Briefing room: An alternative for streaming data collection
5 Factors Impacting Your Big Data Project's Performance
Security issues in big data
Big datarevealed hadoop catalog
2016 09 cxo forum
Big data analysis concepts and references
IRJET- Systematic Review: Progression Study on BIG DATA articles
Big data

What's hot (20)

PPTX
What Is A Datacenter?
PPTX
Big Data & Data Mining
PDF
The Central Hub: Defining the Data Lake
PDF
Stream Meets Batch for Smarter Analytics- Impetus White Paper
PPT
Activity Streaming as Information X-Docking
PPTX
Big data - What is It?
PDF
Lessons from building a stream-first metadata platform | Shirshanka Das, Stealth
PPTX
Ehr challenges [bigdata]
PDF
IoT - Be Open or Miss Out
PDF
Data Science London - Meetup, 28/05/15
PPTX
Big data and data mining
PPT
Big Data
PDF
Big Data and Health Care
PPTX
SQL Server 2008 R2 StreamInsight
PDF
What the IoT should learn from the life sciences
PPTX
Big Data Driven Solutions to Combat Covid' 19
PPTX
PPTX
No big data without small data
PDF
Big data and open access: a collision course for science
PDF
Expanded top ten_big_data_security_and_privacy_challenges
What Is A Datacenter?
Big Data & Data Mining
The Central Hub: Defining the Data Lake
Stream Meets Batch for Smarter Analytics- Impetus White Paper
Activity Streaming as Information X-Docking
Big data - What is It?
Lessons from building a stream-first metadata platform | Shirshanka Das, Stealth
Ehr challenges [bigdata]
IoT - Be Open or Miss Out
Data Science London - Meetup, 28/05/15
Big data and data mining
Big Data
Big Data and Health Care
SQL Server 2008 R2 StreamInsight
What the IoT should learn from the life sciences
Big Data Driven Solutions to Combat Covid' 19
No big data without small data
Big data and open access: a collision course for science
Expanded top ten_big_data_security_and_privacy_challenges
Ad

Viewers also liked (10)

PDF
A Pragmatic Approach to Analyzing Customers
PDF
Crossing the chasm with a high performance dynamically scalable open source p...
PDF
Everything has changed except us
PDF
Determine the Right Analytic Database: A Survey of New Data Technologies
PDF
The State of Open Source BI Adoption
PDF
Bi isn't big data and big data isn't BI (updated)
PDF
On the edge: analytics for the modern enterprise (analyst comments)
PDF
Disruptive Innovation: how do you use these theories to manage your IT?
PDF
Building the Enterprise Data Lake: A look at architecture
PPT
Third Nature - Open Source Data Warehousing
A Pragmatic Approach to Analyzing Customers
Crossing the chasm with a high performance dynamically scalable open source p...
Everything has changed except us
Determine the Right Analytic Database: A Survey of New Data Technologies
The State of Open Source BI Adoption
Bi isn't big data and big data isn't BI (updated)
On the edge: analytics for the modern enterprise (analyst comments)
Disruptive Innovation: how do you use these theories to manage your IT?
Building the Enterprise Data Lake: A look at architecture
Third Nature - Open Source Data Warehousing
Ad

Similar to Briefing Room analyst comments - streaming analytics (17)

PDF
Christmas Border Writing Paper - Thejudgereport674.
PPTX
DevOps Enterprise Summit 2019 - How Swarming Enables Enterprise Support to wo...
PPTX
Apache Pulsar, Supporting the Entire Lifecycle of Streaming Data
PDF
The History Of Computers Essay
PDF
Essay On Crackers Should Be Banned
PPTX
Dealing with delayed events in Splunk
PPTX
What does "monitoring" mean? (FOSDEM 2017)
PDF
Article Writer Proposal Sample For Freelancer
PDF
SRECon Coherent Performance
PDF
Descriptive Essay Colleges Without Supplemental Essa
PDF
Writing A Response Paper
PDF
300 Word Essay Example Telegraph. Online assignment writing service.
PDF
Evolutionary architecture: What can we learn from Nature?
PDF
AdhearsionConf 2013 Keynote
PDF
How To Write A Conclusion To A Research Paper
PDF
Utopoll Whitepaper.pdf
PDF
Agilists4Planet 2.0 - Leverage Points: Most impactful places to intervene in ...
Christmas Border Writing Paper - Thejudgereport674.
DevOps Enterprise Summit 2019 - How Swarming Enables Enterprise Support to wo...
Apache Pulsar, Supporting the Entire Lifecycle of Streaming Data
The History Of Computers Essay
Essay On Crackers Should Be Banned
Dealing with delayed events in Splunk
What does "monitoring" mean? (FOSDEM 2017)
Article Writer Proposal Sample For Freelancer
SRECon Coherent Performance
Descriptive Essay Colleges Without Supplemental Essa
Writing A Response Paper
300 Word Essay Example Telegraph. Online assignment writing service.
Evolutionary architecture: What can we learn from Nature?
AdhearsionConf 2013 Keynote
How To Write A Conclusion To A Research Paper
Utopoll Whitepaper.pdf
Agilists4Planet 2.0 - Leverage Points: Most impactful places to intervene in ...

More from mark madsen (20)

PDF
Data Architecture: OMG It’s Made of People
PDF
Solve User Problems: Data Architecture for Humans
PDF
The Black Box: Interpretability, Reproducibility, and Data Management
PDF
Operationalizing Machine Learning in the Enterprise
PDF
Building a Data Platform Strata SF 2019
PDF
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
PDF
Architecting a Platform for Enterprise Use - Strata London 2018
PDF
A Brief Tour through the Geology & Endemic Botany of the Klamath-Siskiyou Range
PDF
How to understand trends in the data & software market
PDF
Pay no attention to the man behind the curtain - the unseen work behind data ...
PDF
Assumptions about Data and Analysis: Briefing room webcast slides
PDF
Everything Has Changed Except Us: Modernizing the Data Warehouse
PDF
Don't let data get in the way of a good story
PDF
Big Data and Bad Analogies
PDF
Don't follow the followers
PDF
Exploring cloud for data warehousing
PDF
Open Data: Free Data Isn't the Same as Freeing Data
PDF
Exploring cloud for data warehousing
PDF
Wake up and smell the data
PDF
Big Data Wonderland: Two Views on the Big Data Revolution
Data Architecture: OMG It’s Made of People
Solve User Problems: Data Architecture for Humans
The Black Box: Interpretability, Reproducibility, and Data Management
Operationalizing Machine Learning in the Enterprise
Building a Data Platform Strata SF 2019
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Platform for Enterprise Use - Strata London 2018
A Brief Tour through the Geology & Endemic Botany of the Klamath-Siskiyou Range
How to understand trends in the data & software market
Pay no attention to the man behind the curtain - the unseen work behind data ...
Assumptions about Data and Analysis: Briefing room webcast slides
Everything Has Changed Except Us: Modernizing the Data Warehouse
Don't let data get in the way of a good story
Big Data and Bad Analogies
Don't follow the followers
Exploring cloud for data warehousing
Open Data: Free Data Isn't the Same as Freeing Data
Exploring cloud for data warehousing
Wake up and smell the data
Big Data Wonderland: Two Views on the Big Data Revolution

Recently uploaded (20)

PPTX
Supervised vs unsupervised machine learning algorithms
PPTX
Introduction to Knowledge Engineering Part 1
PDF
Introduction to Business Data Analytics.
PPTX
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
PPTX
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
PPTX
1_Introduction to advance data techniques.pptx
PPTX
Business Acumen Training GuidePresentation.pptx
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PDF
Fluorescence-microscope_Botany_detailed content
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
Global journeys: estimating international migration
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
Supervised vs unsupervised machine learning algorithms
Introduction to Knowledge Engineering Part 1
Introduction to Business Data Analytics.
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
1_Introduction to advance data techniques.pptx
Business Acumen Training GuidePresentation.pptx
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Fluorescence-microscope_Botany_detailed content
Miokarditis (Inflamasi pada Otot Jantung)
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Global journeys: estimating international migration
STUDY DESIGN details- Lt Col Maksud (21).pptx
.pdf is not working space design for the following data for the following dat...
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Introduction-to-Cloud-ComputingFinal.pptx

Briefing Room analyst comments - streaming analytics