SlideShare a Scribd company logo
Stream Processing
Key Driver for Enabling Instant Insights on Big Data
Mohit Jotwani
Product Manager, DataTorrent
Why is Stream Processing Vital?
SOURCE DATA
MS Queue’s
Events
XML Files
Databases
Sensor data
Social
Enterprise
Repositories
RDBMS
EDW
NoSQL
Feed m
Feed 2
Feed 1
Load
(Optional)
Staging Area
Traditional Analytics – Data at Rest
Business Analytics
Business Intelligence
Visualization Tools
Visualize
Analyze
Extract Transform
Feed n
Feed 2
Feed 1
Visualize
Next Generation – Data in Motion
• Organizations need to react to changing business conditions in real time
• Faster decision making across all industries
• Few companies outside of financial markets, telecom & utilities have experience with
streaming
• Newer data sources – like sensors, social media feeds
• Higher Volume and Greater Velocity
• More unstructured and semi-structured data
• Democratization of technologies
• Open Source Projects
• Large Scale Compute & Storage – Hadoop, NoSQL
• Streaming Technologies – Apex, Spark, Storm etc.
• Real-time dashboards and alert notification systems
• Beyond niche use cases
• Broad applicability but needs more adoption
Stream vs. Batch Processing Pipelines
Transform Analyze Action
Visualize/
Persist
Ingest
Extract Transform Load Analyze Action
Stream Processing
•Continuous processing on data as it flows through a
system
•Allows users to act on events instantaneously via
alerts
•Processing related to time (event time vs. processing
time)
•Real-Time – diff between event time and processing
time is negligible
Enables your Data In Motion Architecture
Big Data Application Types
IoT
Fraud
CDR
CDC
Reporting
SQL
Operations
Data Discovery
SQL on
Streams
Streaming
Disovery
Sample Streaming Analytics Patterns
Preprocessing
• Filtering events
• Transforming
attributes
Alerts & Thresholds
• Based on complex
conditions
Computing within
Windows
• Aggregations
Combining Event
Streams
• Correlation
• Error detection
Enrichment
• Looking up database,
reference data
Temporal Events
• Detecting events
within time windows
Tracking
• Tracking events over
space & time
Trend Detection
• Rise, Fall
• Outliers
Source: https://iwringer.wordpress.com/2015/08/03/patterns-for-streaming-realtime-analytics/
Stream Processing Use Cases
Financial Services
• Detect fraudulent activity in real-time
• Risk Analysis
• Deliver personalized products and
offerings
• Make decisions in real-time for trading
and transactional platforms
Financial services big data fabric
Telecom
• Real-time network monitoring and
protection
• Quality of service and Customer
Satisfaction
• Take action based on users’ location
• Automatic resource allocation and load
balancing
Online Advertising
• Dynamic bidding
• Real-time targeting & personalization
• Maximize click-through and
conversion rates.
• Reporting that can be updated
continuously
Online advertising dynamic inventory purchases
Internet of Things
• Environment monitoring
• Infrastructure management
• Manufacturing
• Energy management
• Public Building & Home automation
• Transportation
IoT secure ingestion and predictive analysis
High performance, multi-
customer secure, data ingestion.
Complex event processing with
historical data for predictive
maintenance
Sensor 2
Sensor 1
Sensor N
Application n
Application 1
Persistent
Data
Governance
Complex
Event Process
Predictive
maintenance
Stream Processing:
Conclusion
•Lots of untapped potential!
•Gives your business a competitive edge!
•Open Source and Big Data
technologies
•Built to address the scale and latency
demands
•Broad use cases
•Across industries and verticals
Spark meetup   stream processing use cases
Spark meetup   stream processing use cases
Hadoop Ingestion Made Easy
https://www.brighttalk.com/webcast/13685/194937/hadoop-ingestion-made-easy-with-
datatorrent-dtingest
•
•
•
•
•
•
•
https://www.datatorrent.com/careers/
indiajobs@datatorrent.com
Thank You

More Related Content

PPTX
PDF
IT-Analytics: Screen your IT processes with BI Technology
PDF
Under the Hood of Totango's Award Winning Technology
PDF
When Heroes Become Superheroes Using Apps
PPTX
ParStream - Big Data for Business Users
PPTX
2016 DSG Webinar Azure HDInsight 2 V4
PDF
DCiM_BIG_DATA
PDF
Partner Transformation for Hybrid Cloud Management
IT-Analytics: Screen your IT processes with BI Technology
Under the Hood of Totango's Award Winning Technology
When Heroes Become Superheroes Using Apps
ParStream - Big Data for Business Users
2016 DSG Webinar Azure HDInsight 2 V4
DCiM_BIG_DATA
Partner Transformation for Hybrid Cloud Management

What's hot (19)

PPTX
How Internet of Things Works
PPT
Emergence of ITOA: An Evolution in IT Monitoring and Management
PPTX
Big Data and Semantic Web in Manufacturing
PPTX
Measuring the Success of Cloud-Based Services
PPTX
Cloud Computing Introduction and Awareness
PPTX
Key Data Management Requirements for the IoT
PDF
DWS17 - Plenary Session : Big technological bets - Anukool LAKIHINA - Guavus
PPTX
Why You Should Be Using IoT Technologies for More Than Just IoT
PDF
RS2014_Perth_Synergy_FMevis_AVesselForChangeInSynergy
PDF
Modernizing Data Architecture using Data Virtualization for Agile Data Delivery
PDF
02 a holistic approach to big data
PPTX
Eliminate Data Entry with Document Scanning, Data Capture and Extraction - PS...
PDF
A Real-Time Version of the Truth
PDF
Why the Cloud?
PDF
Datenvirtualisierung: Wie Sie Ihre Datenarchitektur agiler machen (German)
PDF
Web Analytics Wednesday Melbourne Meet Up
PDF
Big Data and Implications on Platform Architecture
PPTX
Supercharging Smart Meter BIG DATA Analytics with Microsoft Azure Cloud- SRP ...
PPTX
Big data lab as a service
How Internet of Things Works
Emergence of ITOA: An Evolution in IT Monitoring and Management
Big Data and Semantic Web in Manufacturing
Measuring the Success of Cloud-Based Services
Cloud Computing Introduction and Awareness
Key Data Management Requirements for the IoT
DWS17 - Plenary Session : Big technological bets - Anukool LAKIHINA - Guavus
Why You Should Be Using IoT Technologies for More Than Just IoT
RS2014_Perth_Synergy_FMevis_AVesselForChangeInSynergy
Modernizing Data Architecture using Data Virtualization for Agile Data Delivery
02 a holistic approach to big data
Eliminate Data Entry with Document Scanning, Data Capture and Extraction - PS...
A Real-Time Version of the Truth
Why the Cloud?
Datenvirtualisierung: Wie Sie Ihre Datenarchitektur agiler machen (German)
Web Analytics Wednesday Melbourne Meet Up
Big Data and Implications on Platform Architecture
Supercharging Smart Meter BIG DATA Analytics with Microsoft Azure Cloud- SRP ...
Big data lab as a service
Ad

Viewers also liked (10)

PPTX
Spark streaming high level overview
PDF
Reactive Streams 1.0 and Akka Streams
PDF
Stream Processing in SmartNews #jawsdays
POTX
Apache Spark Streaming: Architecture and Fault Tolerance
PDF
Building a Sustainable Data Platform on AWS
PPT
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...
PPTX
Apache Flink: Real-World Use Cases for Streaming Analytics
PDF
Fault Tolerance in Spark: Lessons Learned from Production: Spark Summit East ...
PDF
Stream all the things
PDF
Building Realtime Data Pipelines with Kafka Connect and Spark Streaming: Spar...
Spark streaming high level overview
Reactive Streams 1.0 and Akka Streams
Stream Processing in SmartNews #jawsdays
Apache Spark Streaming: Architecture and Fault Tolerance
Building a Sustainable Data Platform on AWS
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...
Apache Flink: Real-World Use Cases for Streaming Analytics
Fault Tolerance in Spark: Lessons Learned from Production: Spark Summit East ...
Stream all the things
Building Realtime Data Pipelines with Kafka Connect and Spark Streaming: Spar...
Ad

Similar to Spark meetup stream processing use cases (20)

PPTX
Real time data integration best practices and architecture
PPTX
2016 DSG Webinar Azure HDInsight 2 V4
PPTX
Assessing New Databases– Translytical Use Cases
PPTX
WebAction-Sami Abkay
PPTX
How to Use Big Data to Transform IT Operations
PPTX
Hadoop in the Cloud: Common Architectural Patterns
PDF
Apache Kafka® Use Cases for Financial Services
PPTX
Big Data and Analytics
PPTX
Big Data and Analytics
PPTX
WebAction In-Memory Computing Summit 2015
PDF
Moving Targets: Harnessing Real-time Value from Data in Motion
PPTX
Big data streaming with Apache Spark on Azure
PDF
Real Time Business Platform by Ivan Novick from Pivotal
PPTX
Wikibon #IoT #HyperConvergence Presentation via @theCUBE
PPTX
Hyper-Convergence CrowdChat
PDF
Kafka and Stream Processing, Taking Analytics Real-time, Mike Spicer
PPTX
4th Industrial Revolution
PDF
IMCSummit 2015 - Day 2 Developer Track - The Internet of Analytics – Discover...
PDF
Machine Data Analytics
PDF
Innovating With Data and Analytics
Real time data integration best practices and architecture
2016 DSG Webinar Azure HDInsight 2 V4
Assessing New Databases– Translytical Use Cases
WebAction-Sami Abkay
How to Use Big Data to Transform IT Operations
Hadoop in the Cloud: Common Architectural Patterns
Apache Kafka® Use Cases for Financial Services
Big Data and Analytics
Big Data and Analytics
WebAction In-Memory Computing Summit 2015
Moving Targets: Harnessing Real-time Value from Data in Motion
Big data streaming with Apache Spark on Azure
Real Time Business Platform by Ivan Novick from Pivotal
Wikibon #IoT #HyperConvergence Presentation via @theCUBE
Hyper-Convergence CrowdChat
Kafka and Stream Processing, Taking Analytics Real-time, Mike Spicer
4th Industrial Revolution
IMCSummit 2015 - Day 2 Developer Track - The Internet of Analytics – Discover...
Machine Data Analytics
Innovating With Data and Analytics

Recently uploaded (20)

PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Encapsulation theory and applications.pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Electronic commerce courselecture one. Pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Spectral efficient network and resource selection model in 5G networks
PPT
Teaching material agriculture food technology
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Programs and apps: productivity, graphics, security and other tools
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
MIND Revenue Release Quarter 2 2025 Press Release
Network Security Unit 5.pdf for BCA BBA.
The Rise and Fall of 3GPP – Time for a Sabbatical?
Encapsulation theory and applications.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
Review of recent advances in non-invasive hemoglobin estimation
Mobile App Security Testing_ A Comprehensive Guide.pdf
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Electronic commerce courselecture one. Pdf
Advanced methodologies resolving dimensionality complications for autism neur...
Digital-Transformation-Roadmap-for-Companies.pptx
MYSQL Presentation for SQL database connectivity
20250228 LYD VKU AI Blended-Learning.pptx
Spectral efficient network and resource selection model in 5G networks
Teaching material agriculture food technology
The AUB Centre for AI in Media Proposal.docx
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx

Spark meetup stream processing use cases

  • 1. Stream Processing Key Driver for Enabling Instant Insights on Big Data Mohit Jotwani Product Manager, DataTorrent
  • 2. Why is Stream Processing Vital?
  • 3. SOURCE DATA MS Queue’s Events XML Files Databases Sensor data Social Enterprise Repositories RDBMS EDW NoSQL Feed m Feed 2 Feed 1 Load (Optional) Staging Area Traditional Analytics – Data at Rest Business Analytics Business Intelligence Visualization Tools Visualize Analyze Extract Transform Feed n Feed 2 Feed 1 Visualize
  • 4. Next Generation – Data in Motion • Organizations need to react to changing business conditions in real time • Faster decision making across all industries • Few companies outside of financial markets, telecom & utilities have experience with streaming • Newer data sources – like sensors, social media feeds • Higher Volume and Greater Velocity • More unstructured and semi-structured data • Democratization of technologies • Open Source Projects • Large Scale Compute & Storage – Hadoop, NoSQL • Streaming Technologies – Apex, Spark, Storm etc. • Real-time dashboards and alert notification systems • Beyond niche use cases • Broad applicability but needs more adoption
  • 5. Stream vs. Batch Processing Pipelines Transform Analyze Action Visualize/ Persist Ingest Extract Transform Load Analyze Action
  • 6. Stream Processing •Continuous processing on data as it flows through a system •Allows users to act on events instantaneously via alerts •Processing related to time (event time vs. processing time) •Real-Time – diff between event time and processing time is negligible Enables your Data In Motion Architecture
  • 7. Big Data Application Types IoT Fraud CDR CDC Reporting SQL Operations Data Discovery SQL on Streams Streaming Disovery
  • 8. Sample Streaming Analytics Patterns Preprocessing • Filtering events • Transforming attributes Alerts & Thresholds • Based on complex conditions Computing within Windows • Aggregations Combining Event Streams • Correlation • Error detection Enrichment • Looking up database, reference data Temporal Events • Detecting events within time windows Tracking • Tracking events over space & time Trend Detection • Rise, Fall • Outliers Source: https://iwringer.wordpress.com/2015/08/03/patterns-for-streaming-realtime-analytics/
  • 10. Financial Services • Detect fraudulent activity in real-time • Risk Analysis • Deliver personalized products and offerings • Make decisions in real-time for trading and transactional platforms
  • 11. Financial services big data fabric
  • 12. Telecom • Real-time network monitoring and protection • Quality of service and Customer Satisfaction • Take action based on users’ location • Automatic resource allocation and load balancing
  • 13. Online Advertising • Dynamic bidding • Real-time targeting & personalization • Maximize click-through and conversion rates. • Reporting that can be updated continuously
  • 14. Online advertising dynamic inventory purchases
  • 15. Internet of Things • Environment monitoring • Infrastructure management • Manufacturing • Energy management • Public Building & Home automation • Transportation
  • 16. IoT secure ingestion and predictive analysis High performance, multi- customer secure, data ingestion. Complex event processing with historical data for predictive maintenance Sensor 2 Sensor 1 Sensor N Application n Application 1 Persistent Data Governance Complex Event Process Predictive maintenance
  • 17. Stream Processing: Conclusion •Lots of untapped potential! •Gives your business a competitive edge! •Open Source and Big Data technologies •Built to address the scale and latency demands •Broad use cases •Across industries and verticals
  • 20. Hadoop Ingestion Made Easy https://www.brighttalk.com/webcast/13685/194937/hadoop-ingestion-made-easy-with- datatorrent-dtingest