SlideShare a Scribd company logo
Analytics in Your Enterprise
Dakshitha Ratnayake
Lead Solutions Engineer
What is Analytics?
• organizations have more data than ever at their disposal.
• actually deriving meaningful insights from that data—
and converting knowledge into action—is easier said than
done.
• There’s no single technology that encompasses big data
analytics.
• several types of technology work together to help
Organization get the most value from Their information.
Big Data Analytics
Real-World Applications
o
Portfolio analysis and to predict the
impact of global events on financial
markets.
Customer experience
management and network
capacity planning and
optimization.
Music
recommendations
based on user
data.
predict what the customer
wants to see before he or she
knows what they want!
Song identifications and predict
the popular artists and genres
that will get attention in the
upcoming years.
Monitor financial market
activities and catch illegal
insider trading activities in
the financial markets.
Track patient signs using
sensor data.
Reduce their claims cost
through better fraud
detection.
Detect and prevent cyber-
attacks and criminal
activity.
Predict trends and lay
down preparation
plans to meet future
demand.
Measure player
efficiency and
defensive
effectiveness.
Source - http://upxacademy.com/2016/05/31/big-data-and-analytics-use-cases-in-8-industries/
WSO2 Analytics
Platform
• a single platform to address all analytics styles
• We deliver:
• Batch Analytics
• Real time Analytics
• Interactive Analytics
• Predictive Analytics
• WSO2 Analytics Platform uniquely combines the above
styles to turn data from IoT, mobile and Web apps into
actionable insights.
WSO2 Analytics Platform
WSO2 Analytics Platform
“Publish once, process
anyway you like”.
Data AnaLysis
WSO2
Analytics
Platform
• high-level, SQL query-like languages
• Client Applications are agnostic of the Analytics
Components
• Common set of receivers/publishers for all analytics types
• Common format for events
• Leverage leading open source projects e.g. Storm and
Spark and contribute back (such as Siddhi).
Analytics Strategy
• Open Source
• Rich, extensible, SQL-like configuration language
• Rich set of data connectors, which can be easily extended
• Events only need to be published once from applications to
the platform, and can be consumed by batch or real time
pipeline.
• Part of the overall WSO2 platform
Key Differentiators
Data Collection and
Publishing
Collecting DAta
AgentHolder. setConfigPath (getDataAgentConfigPath ());
DataPublisher dataPublisher = new DataPublisher(url, username, password);
String streamId = DataBridgeCommonsUtils.generateStreamId(HTTPD_LOG_STREAM, VERSION);
Event event = new Event(streamId, System.currentTimeMillis(), new Object[]{"external"}, null, new
Object[]{aLog});
dataPublisher.publish(event);
Collecting Data: Example
Initialize the data publisher
Generate the stream ID for the
stream to which the event will be
published
Create and Publish
Event
As a prerequisite, the streams must be defined in the receiver server (WSO2 DAS/CEP)
• Events are the lifeline of WSO2 CEP/DAS.
• They not only process data as events, but also interact
with external systems using events.
• An Event is a unit of data
• an event stream is a sequence of events of a particular
type.
• The type of events can be defined as an event stream
definition.
Events , Streams and Event
Stream Definitions
Publishing Data
o
Data Analysis
Batch Analytics
Batch Analytics
Generating insight by processing large amounts of stored
data
● KPI Statistics
○ Application Statistics
Monitoring
○ Network / Service Statistics
○ Sensor Data Aggregation
● Solving Optimization Problems
○ Urban Planning
○ Revenue Distribution Analysis
Source: www.e-
deal.com
• Batch analytics reads data from a disk (or some other
storage) and process them record by record
• “MapReduce” is the most widely used technology for batch
analytics
- Apache Hadoop
- Apache Spark 30X faster and much more flexible
• Analytics (Min, Max, average, correlation, histograms,
might join or group data in many ways)
• Key Performance indicators (KPIs) – 
- e.g. Profit per square feet for retail
• Presented as a Dashboard
Batch Analytics
• Powered by Apache Spark
• up to 30x higher performance than Hadoop
• script-based analytics powered by Spark SQL
• Persist Data in A Database (RDBMS/NON-RDBMS) and process
Using Spark Queries and persist analyzed data in RDBMS
WSO2 Data Analytics Server
Batch Analytics With DAS
WSO2 DAS In Action: API
Statistics
DAS In Action: API Statistics
DAS In Action: HTTP Monitoring
Real-Time Analytics
Real-time Analytics
Making sense of fast moving data
● Sports
○ Real-time Analysis of Player
Performance
○ Real-time Match Analysis
● Geo-Spatial
○ Traffic Monitoring and Alerting
○ Geo-fencing
● Finance
○ Stock Market Monitoring
● Anomaly Detection
○ Fraud Detection
○ Network Intrusion Detection
○ Server Health Monitoring
Source: www.promojam.com
• For some use cases, the value of insights degrades very
quickly with time.
• We need technology that can produce outputs fast.
• Static Queries, but need very fast output (Alerts, Real-time
control)
• Dynamic and Interactive Queries ( Data exploration)
Real-TIME Analytics
• WSO2 CEP facilitates
• Real time event detection
• Correlation
• Notifications/alerts, visualization tools
• Siddhi - a high-performance streaming processing engine
• WSO2 CEP is configured using the Siddhi query language
• suited for complex queries involving time windows, as
well as patterns and sequences detection.
• CEP queries can be changed dynamically at runtime using
templates.
WSO2 Complex Event Processor
Real-TIME Analytics With WSO2
CEP
Real-time Analytics In Action
Real-time Analytics In Action
Real-time Analytics In Action
Interactive Analytics
Interactive Analytics
Near Real-time Indexed Data Search
● Log Analysis
○ Application / System Logs
● Activity Monitoring
○ Tracking Message Flows
● Fraud Detection
○ Executing queries to lookup
related data in a detected
fraud situation
● HL7 Data Exploration
○ ESB HL7 Transport Interfaced
with DAS
Source: befoundonline.com
• Best way to explore data is by asking Ad-hoc questions
• Interactive Analytics (search) let you query the system
and receive fast results (<10s)
• Shows data in context (e.g. by grouping events from the
same transaction together)
• Built using Lucene based Indexes.
Interactive Analytics with WSO2
DAS
Interactive Analytics In Action:
WSO2 DAS
Predictive Analytics
Predictive Analytics
Analyze Existing Data to Predict Future Events
● Next Value Prediction
○ Sales Forecasts
○ Electricity Loads
● Classification
○ Product Categorization
○ Customer Segmentation
● Anomaly Detection
○ Fraud Detection
○ Preventive Maintenance
● Other
○ Handwriting recognition
• Machine learning
• Takes in a lot of examples, and builds a program that matches
those examples.
• Specifically, that program is said to learn from experience E
with respect to some class of tasks T and performance
measure P, if its performance at tasks in T, as measured by P,
improves with experience E.
• We call that program a “model”
• A Lot of Machine Learning tools
• R ( Statistical language)
• Sci-kit learn (Python)
• Apache Spark’s MLLIB and Apache Mahout (Java)
Predictive Analytics
• Powered by Apache Spark MLlib
• Analyze data using machine learning algorithms
• Build machine learning models
• Compare and manage generated machine learning models
• Predict using the built models
Predictive Analytics with WSO2
Machine Learner
Predictive Analytics With WSO2
ML
Predictive Analytics In Action:
WSO2 ML
Home-Grown Solutions
WSO2 Solutions Based on the
Analytics Platform
● WSO2 Fraud Detection Solution
○ Built for detecting credit card fraud
○ The rules extensible with customized Siddhi execution
plans for any type of fraud detection
○ Currently uses Real-time and Interactive Analytics
features
● WSO2 Log Analytics Solution
○ Distributed indexing and searching of any type of
logs stored in the system
○ Notifications support with Real-time event processing
features
○ Application / Server health prediction with Machine
Learning
○ Uses Interactive + Real-time Analytics + Machine
Learning features
Source: www.retrospective.centeractive.com
Source: multichannelmerchant.com
Deployment
Minimum HA Deployment for DAS
2 Node Deployment
Use RDBMS to Store Data
If need to scale Higher
Use HBASe/Cassandra
Minimum HA Deployment for CEP
Minimum 2 nodes
Max throughput == 1 Node throughput
Minimum HA Deployment for ML
Minimum 1 node
Questions?? ?

More Related Content

PDF
[WSO2Con USA 2018] Patterns for Building Streaming Apps
PDF
Confluent x imply: Build the last mile to value for data streaming applications
PPTX
Overview of v cloud case studies
PDF
Security Information and Event Management with Kafka, Kafka Connect, KSQL and...
PDF
Modernising Change - Lime Point - Confluent - Kong
PDF
Kubernetes Jakarta Meetup 010 - Service Mesh Observability with Kiali
PPTX
Next-Gen уже здесь
PDF
Transforming The Customer Experience With Real-Time Insights
[WSO2Con USA 2018] Patterns for Building Streaming Apps
Confluent x imply: Build the last mile to value for data streaming applications
Overview of v cloud case studies
Security Information and Event Management with Kafka, Kafka Connect, KSQL and...
Modernising Change - Lime Point - Confluent - Kong
Kubernetes Jakarta Meetup 010 - Service Mesh Observability with Kiali
Next-Gen уже здесь
Transforming The Customer Experience With Real-Time Insights

What's hot (20)

PDF
[WSO2Con EU 2017] Open Interoperability of WSO2 Analytics Platform
PDF
Data reply sneak peek: real time decision engines
PDF
The Rise of Microservices
PDF
APAC Confluent Consumer Data Right the Lowdown and the Lessons
PDF
Google Cloud and Confluent Streaming: Generating Real Value From Real Time | ...
PDF
VMware vCloud and vRealize Operations
PDF
WSO2Con USA 2017: Discover Data That Matters: Deep Dive into WSO2 Analytics
PPTX
Digital Transformation Mindset - More Than Just Technology
PDF
The database is half done
PPTX
Stream me to the Cloud (and back) with Confluent & MongoDB
PDF
Pivoting event streaming, from PROJECTS to a PLATFORM
PPTX
Modernizing your Application Architecture with Microservices
PDF
What does an event mean? Manage the meaning of your data! | Andreas Wombacher...
PDF
Keynote : évolution et vision d'Elastic Observability
PDF
Architecture Patterns for Event Streaming (Nick Dearden, Confluent) London 20...
PDF
Confluent Messaging Modernization Forum
PPTX
What do you need to know to scale your business to China using Microsoft Azure
PDF
Jay Kreps | Kafka Summit NYC 2019 Keynote (Events Everywhere) | CEO, Confluent
PDF
Events, Streams, Devops and Speed - The Next Generation of Application Archit...
PDF
Elastic APM : développez vos logs et vos indicateurs pour obtenir une vue com...
[WSO2Con EU 2017] Open Interoperability of WSO2 Analytics Platform
Data reply sneak peek: real time decision engines
The Rise of Microservices
APAC Confluent Consumer Data Right the Lowdown and the Lessons
Google Cloud and Confluent Streaming: Generating Real Value From Real Time | ...
VMware vCloud and vRealize Operations
WSO2Con USA 2017: Discover Data That Matters: Deep Dive into WSO2 Analytics
Digital Transformation Mindset - More Than Just Technology
The database is half done
Stream me to the Cloud (and back) with Confluent & MongoDB
Pivoting event streaming, from PROJECTS to a PLATFORM
Modernizing your Application Architecture with Microservices
What does an event mean? Manage the meaning of your data! | Andreas Wombacher...
Keynote : évolution et vision d'Elastic Observability
Architecture Patterns for Event Streaming (Nick Dearden, Confluent) London 20...
Confluent Messaging Modernization Forum
What do you need to know to scale your business to China using Microsoft Azure
Jay Kreps | Kafka Summit NYC 2019 Keynote (Events Everywhere) | CEO, Confluent
Events, Streams, Devops and Speed - The Next Generation of Application Archit...
Elastic APM : développez vos logs et vos indicateurs pour obtenir une vue com...
Ad

Viewers also liked (20)

PDF
Google Summer of Code with WSO2
PDF
WSO2Con EU 2016: An Introduction to the WSO2 Analytics Platform
PDF
WSO2 Product Release Webinar: WSO2 Dashboard Server 2.0
PDF
WSO2 Enterprise Service Bus - Product Overview
PDF
WSO2 Microservices Framework for Java - Product Overview
PDF
WSO2Con EU 2016: Building Enterprise Apps Using WSO2 Platform
PDF
WSO2 Dashboard Server - Product Overview
PDF
WSO2 Application Server - Product Overview
PDF
WSO2 Business Process Server - Product Overview
PDF
WSO2Con EU 2016: Building Awesome Personal Dashboards with WSO2 Dashboard Se...
PPTX
Enterprise integration: The Past, Present and Future
PDF
WSO2 Data Services Server - Product Overview
PDF
WSO2Con EU 2016: Creating Microservices with WSO2 Microservices Framework fo...
PDF
WSO2Con USA 2017: Building Platforms for Rapid Application Development
PDF
WSO2Con USA 2017: Why Swiss Chocolate Relies on WSO2
PDF
WSO2Con USA 2017: Analytics Patterns for Your Digital Enterprise
PDF
WSO2Con USA 2017: Driving Insights for Your Digital Business With Analytics
PDF
WSO2Con USA 2017: Geospatial Big Data – Location Intelligence in Digital Tran...
PDF
WSO2Con USA 2017: Building an End-to-End Integration Scenario with WSO2 Integ...
PDF
WSO2Con USA 2017: Journey of Migration from Legacy ESB to Modern WSO2 ESB Pla...
Google Summer of Code with WSO2
WSO2Con EU 2016: An Introduction to the WSO2 Analytics Platform
WSO2 Product Release Webinar: WSO2 Dashboard Server 2.0
WSO2 Enterprise Service Bus - Product Overview
WSO2 Microservices Framework for Java - Product Overview
WSO2Con EU 2016: Building Enterprise Apps Using WSO2 Platform
WSO2 Dashboard Server - Product Overview
WSO2 Application Server - Product Overview
WSO2 Business Process Server - Product Overview
WSO2Con EU 2016: Building Awesome Personal Dashboards with WSO2 Dashboard Se...
Enterprise integration: The Past, Present and Future
WSO2 Data Services Server - Product Overview
WSO2Con EU 2016: Creating Microservices with WSO2 Microservices Framework fo...
WSO2Con USA 2017: Building Platforms for Rapid Application Development
WSO2Con USA 2017: Why Swiss Chocolate Relies on WSO2
WSO2Con USA 2017: Analytics Patterns for Your Digital Enterprise
WSO2Con USA 2017: Driving Insights for Your Digital Business With Analytics
WSO2Con USA 2017: Geospatial Big Data – Location Intelligence in Digital Tran...
WSO2Con USA 2017: Building an End-to-End Integration Scenario with WSO2 Integ...
WSO2Con USA 2017: Journey of Migration from Legacy ESB to Modern WSO2 ESB Pla...
Ad

Similar to Analytics in Your Enterprise (20)

PPTX
WSO2Con USA 2015: Patterns for Deploying Analytics in the Real World
PPTX
WSO2 Workshop Sydney 2016 - Analytics
PDF
WSO2Con EU 2015: An Introduction to the WSO2 Data Analytics Platform
PDF
WSO2 Product Release Webinar: WSO2 Data Analytics Server 3.0
PPTX
WSO2Con USA 2015: An Introduction to the WSO2 Analytics Platform
PPTX
Introduction to WSO2 Data Analytics Platform
PPTX
Role of Analytics in Digital Business
PDF
WSO2Con ASIA 2016: An Introduction to the WSO2 Analytics Platform
PDF
An introduction to the WSO2 Analytics Platform
PDF
WSO2Con USA 2015: Patterns for Deploying Analytics in the Real World
PDF
WSO2 Analytics Platform - The one stop shop for all your data needs
PDF
Solutions Using WSO2 Analytics
PDF
Driving Insights in the Digital Enterprise
PDF
Analytics Patterns for Your Digital Enterprise
PDF
Stream Processing in Action
PPTX
WSO2 Guest Webinar: Building Enterprise Awareness with API Analytics in the A...
PPTX
WSO2Con USA 2015: WSO2 Analytics Platform - The One Stop Shop for All Your Da...
PPTX
Big Data, Analytics and Real Time Event Processing
PDF
WSO2Con ASIA 2016: WSO2 Analytics Platform: The One Stop Shop for All Your Da...
PPTX
Introduction to WSO2 Analytics Platform: 2016 Q2 Update
WSO2Con USA 2015: Patterns for Deploying Analytics in the Real World
WSO2 Workshop Sydney 2016 - Analytics
WSO2Con EU 2015: An Introduction to the WSO2 Data Analytics Platform
WSO2 Product Release Webinar: WSO2 Data Analytics Server 3.0
WSO2Con USA 2015: An Introduction to the WSO2 Analytics Platform
Introduction to WSO2 Data Analytics Platform
Role of Analytics in Digital Business
WSO2Con ASIA 2016: An Introduction to the WSO2 Analytics Platform
An introduction to the WSO2 Analytics Platform
WSO2Con USA 2015: Patterns for Deploying Analytics in the Real World
WSO2 Analytics Platform - The one stop shop for all your data needs
Solutions Using WSO2 Analytics
Driving Insights in the Digital Enterprise
Analytics Patterns for Your Digital Enterprise
Stream Processing in Action
WSO2 Guest Webinar: Building Enterprise Awareness with API Analytics in the A...
WSO2Con USA 2015: WSO2 Analytics Platform - The One Stop Shop for All Your Da...
Big Data, Analytics and Real Time Event Processing
WSO2Con ASIA 2016: WSO2 Analytics Platform: The One Stop Shop for All Your Da...
Introduction to WSO2 Analytics Platform: 2016 Q2 Update

More from WSO2 (20)

PDF
Demystifying CMS-0057-F - Compliance Made Seamless with WSO2
PDF
Quantum Threats Are Closer Than You Think – Act Now to Stay Secure
PDF
Modern Platform Engineering with Choreo - The AI-Native Internal Developer Pl...
PDF
Application Modernization with Choreo - The AI-Native Internal Developer Plat...
PDF
Build Smarter, Deliver Faster with Choreo - An AI Native Internal Developer P...
PDF
Platformless Modernization with Choreo.pdf
PDF
Application Modernization with Choreo for the BFSI Sector
PDF
Choreo - The AI-Native Internal Developer Platform as a Service: Overview
PDF
[Roundtable] Choreo - The AI-Native Internal Developer Platform as a Service
PPTX
WSO2Con 2025 - Building AI Applications in the Enterprise (Part 1)
PPTX
WSO2Con 2025 - Building Secure Business Customer and Partner Experience (B2B)...
PPTX
WSO2Con 2025 - Building Secure Customer Experience Apps
PPTX
WSO2Con 2025 - AI-Driven API Design, Development, and Consumption with Enhanc...
PPTX
WSO2Con 2025 - AI-Driven API Design, Development, and Consumption with Enhanc...
PPTX
WSO2Con 2025 - Unified Management of Ingress and Egress Across Multiple API G...
PPTX
WSO2Con 2025 - How an Internal Developer Platform Lets Developers Focus on Code
PPTX
WSO2Con 2025 - Architecting Cloud-Native Applications
PDF
Mastering Intelligent Digital Experiences with Platformless Modernization
PDF
Accelerate Enterprise Software Engineering with Platformless
PDF
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
Demystifying CMS-0057-F - Compliance Made Seamless with WSO2
Quantum Threats Are Closer Than You Think – Act Now to Stay Secure
Modern Platform Engineering with Choreo - The AI-Native Internal Developer Pl...
Application Modernization with Choreo - The AI-Native Internal Developer Plat...
Build Smarter, Deliver Faster with Choreo - An AI Native Internal Developer P...
Platformless Modernization with Choreo.pdf
Application Modernization with Choreo for the BFSI Sector
Choreo - The AI-Native Internal Developer Platform as a Service: Overview
[Roundtable] Choreo - The AI-Native Internal Developer Platform as a Service
WSO2Con 2025 - Building AI Applications in the Enterprise (Part 1)
WSO2Con 2025 - Building Secure Business Customer and Partner Experience (B2B)...
WSO2Con 2025 - Building Secure Customer Experience Apps
WSO2Con 2025 - AI-Driven API Design, Development, and Consumption with Enhanc...
WSO2Con 2025 - AI-Driven API Design, Development, and Consumption with Enhanc...
WSO2Con 2025 - Unified Management of Ingress and Egress Across Multiple API G...
WSO2Con 2025 - How an Internal Developer Platform Lets Developers Focus on Code
WSO2Con 2025 - Architecting Cloud-Native Applications
Mastering Intelligent Digital Experiences with Platformless Modernization
Accelerate Enterprise Software Engineering with Platformless
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation

Recently uploaded (20)

PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Approach and Philosophy of On baking technology
PDF
Electronic commerce courselecture one. Pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
KodekX | Application Modernization Development
PPTX
A Presentation on Artificial Intelligence
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Empathic Computing: Creating Shared Understanding
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
MYSQL Presentation for SQL database connectivity
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Spectral efficient network and resource selection model in 5G networks
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Review of recent advances in non-invasive hemoglobin estimation
Approach and Philosophy of On baking technology
Electronic commerce courselecture one. Pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Chapter 3 Spatial Domain Image Processing.pdf
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
KodekX | Application Modernization Development
A Presentation on Artificial Intelligence
20250228 LYD VKU AI Blended-Learning.pptx
Empathic Computing: Creating Shared Understanding
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Advanced methodologies resolving dimensionality complications for autism neur...
MYSQL Presentation for SQL database connectivity
The AUB Centre for AI in Media Proposal.docx
Per capita expenditure prediction using model stacking based on satellite ima...
Spectral efficient network and resource selection model in 5G networks
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...

Analytics in Your Enterprise

  • 1. Analytics in Your Enterprise Dakshitha Ratnayake Lead Solutions Engineer
  • 3. • organizations have more data than ever at their disposal. • actually deriving meaningful insights from that data— and converting knowledge into action—is easier said than done. • There’s no single technology that encompasses big data analytics. • several types of technology work together to help Organization get the most value from Their information. Big Data Analytics
  • 4. Real-World Applications o Portfolio analysis and to predict the impact of global events on financial markets. Customer experience management and network capacity planning and optimization. Music recommendations based on user data. predict what the customer wants to see before he or she knows what they want! Song identifications and predict the popular artists and genres that will get attention in the upcoming years. Monitor financial market activities and catch illegal insider trading activities in the financial markets. Track patient signs using sensor data. Reduce their claims cost through better fraud detection. Detect and prevent cyber- attacks and criminal activity. Predict trends and lay down preparation plans to meet future demand. Measure player efficiency and defensive effectiveness. Source - http://upxacademy.com/2016/05/31/big-data-and-analytics-use-cases-in-8-industries/
  • 6. • a single platform to address all analytics styles • We deliver: • Batch Analytics • Real time Analytics • Interactive Analytics • Predictive Analytics • WSO2 Analytics Platform uniquely combines the above styles to turn data from IoT, mobile and Web apps into actionable insights. WSO2 Analytics Platform
  • 10. • high-level, SQL query-like languages • Client Applications are agnostic of the Analytics Components • Common set of receivers/publishers for all analytics types • Common format for events • Leverage leading open source projects e.g. Storm and Spark and contribute back (such as Siddhi). Analytics Strategy
  • 11. • Open Source • Rich, extensible, SQL-like configuration language • Rich set of data connectors, which can be easily extended • Events only need to be published once from applications to the platform, and can be consumed by batch or real time pipeline. • Part of the overall WSO2 platform Key Differentiators
  • 14. AgentHolder. setConfigPath (getDataAgentConfigPath ()); DataPublisher dataPublisher = new DataPublisher(url, username, password); String streamId = DataBridgeCommonsUtils.generateStreamId(HTTPD_LOG_STREAM, VERSION); Event event = new Event(streamId, System.currentTimeMillis(), new Object[]{"external"}, null, new Object[]{aLog}); dataPublisher.publish(event); Collecting Data: Example Initialize the data publisher Generate the stream ID for the stream to which the event will be published Create and Publish Event As a prerequisite, the streams must be defined in the receiver server (WSO2 DAS/CEP)
  • 15. • Events are the lifeline of WSO2 CEP/DAS. • They not only process data as events, but also interact with external systems using events. • An Event is a unit of data • an event stream is a sequence of events of a particular type. • The type of events can be defined as an event stream definition. Events , Streams and Event Stream Definitions
  • 19. Batch Analytics Generating insight by processing large amounts of stored data ● KPI Statistics ○ Application Statistics Monitoring ○ Network / Service Statistics ○ Sensor Data Aggregation ● Solving Optimization Problems ○ Urban Planning ○ Revenue Distribution Analysis Source: www.e- deal.com
  • 20. • Batch analytics reads data from a disk (or some other storage) and process them record by record • “MapReduce” is the most widely used technology for batch analytics - Apache Hadoop - Apache Spark 30X faster and much more flexible • Analytics (Min, Max, average, correlation, histograms, might join or group data in many ways) • Key Performance indicators (KPIs) –  - e.g. Profit per square feet for retail • Presented as a Dashboard Batch Analytics
  • 21. • Powered by Apache Spark • up to 30x higher performance than Hadoop • script-based analytics powered by Spark SQL • Persist Data in A Database (RDBMS/NON-RDBMS) and process Using Spark Queries and persist analyzed data in RDBMS WSO2 Data Analytics Server
  • 23. WSO2 DAS In Action: API Statistics
  • 24. DAS In Action: API Statistics
  • 25. DAS In Action: HTTP Monitoring
  • 27. Real-time Analytics Making sense of fast moving data ● Sports ○ Real-time Analysis of Player Performance ○ Real-time Match Analysis ● Geo-Spatial ○ Traffic Monitoring and Alerting ○ Geo-fencing ● Finance ○ Stock Market Monitoring ● Anomaly Detection ○ Fraud Detection ○ Network Intrusion Detection ○ Server Health Monitoring Source: www.promojam.com
  • 28. • For some use cases, the value of insights degrades very quickly with time. • We need technology that can produce outputs fast. • Static Queries, but need very fast output (Alerts, Real-time control) • Dynamic and Interactive Queries ( Data exploration) Real-TIME Analytics
  • 29. • WSO2 CEP facilitates • Real time event detection • Correlation • Notifications/alerts, visualization tools • Siddhi - a high-performance streaming processing engine • WSO2 CEP is configured using the Siddhi query language • suited for complex queries involving time windows, as well as patterns and sequences detection. • CEP queries can be changed dynamically at runtime using templates. WSO2 Complex Event Processor
  • 35. Interactive Analytics Near Real-time Indexed Data Search ● Log Analysis ○ Application / System Logs ● Activity Monitoring ○ Tracking Message Flows ● Fraud Detection ○ Executing queries to lookup related data in a detected fraud situation ● HL7 Data Exploration ○ ESB HL7 Transport Interfaced with DAS Source: befoundonline.com
  • 36. • Best way to explore data is by asking Ad-hoc questions • Interactive Analytics (search) let you query the system and receive fast results (<10s) • Shows data in context (e.g. by grouping events from the same transaction together) • Built using Lucene based Indexes. Interactive Analytics with WSO2 DAS
  • 37. Interactive Analytics In Action: WSO2 DAS
  • 39. Predictive Analytics Analyze Existing Data to Predict Future Events ● Next Value Prediction ○ Sales Forecasts ○ Electricity Loads ● Classification ○ Product Categorization ○ Customer Segmentation ● Anomaly Detection ○ Fraud Detection ○ Preventive Maintenance ● Other ○ Handwriting recognition
  • 40. • Machine learning • Takes in a lot of examples, and builds a program that matches those examples. • Specifically, that program is said to learn from experience E with respect to some class of tasks T and performance measure P, if its performance at tasks in T, as measured by P, improves with experience E. • We call that program a “model” • A Lot of Machine Learning tools • R ( Statistical language) • Sci-kit learn (Python) • Apache Spark’s MLLIB and Apache Mahout (Java) Predictive Analytics
  • 41. • Powered by Apache Spark MLlib • Analyze data using machine learning algorithms • Build machine learning models • Compare and manage generated machine learning models • Predict using the built models Predictive Analytics with WSO2 Machine Learner
  • 43. Predictive Analytics In Action: WSO2 ML
  • 45. WSO2 Solutions Based on the Analytics Platform ● WSO2 Fraud Detection Solution ○ Built for detecting credit card fraud ○ The rules extensible with customized Siddhi execution plans for any type of fraud detection ○ Currently uses Real-time and Interactive Analytics features ● WSO2 Log Analytics Solution ○ Distributed indexing and searching of any type of logs stored in the system ○ Notifications support with Real-time event processing features ○ Application / Server health prediction with Machine Learning ○ Uses Interactive + Real-time Analytics + Machine Learning features Source: www.retrospective.centeractive.com Source: multichannelmerchant.com
  • 47. Minimum HA Deployment for DAS 2 Node Deployment Use RDBMS to Store Data If need to scale Higher Use HBASe/Cassandra
  • 48. Minimum HA Deployment for CEP Minimum 2 nodes Max throughput == 1 Node throughput
  • 49. Minimum HA Deployment for ML Minimum 1 node