SlideShare a Scribd company logo
Privileged and confidential
Open Blueprint for Real-Time Analytics
with In-Stream Processing
Victoria Livschitz, Founder & CTO, Grid Dynamics
09/28/2017
2
Business Need
About the speaker:
CTO @Grid Dynamics: present
Founder and CEO @Grid Dynamics: 2006 – 2013
Principal engineer @Sun: 1997 – 2006
Engineering IT services company focused on digital transformation
through cloud, big data & open source for Fortune 500 clients.
Pioneer in real-time processing from company’s inception in 2006.
Architected 3 out of top-10 busiest e-commerce sites. Never had
production outage in peak season.
Frequent contributor to open source projects: Hadoop, Solr,
Lucene, Storm, others.
Victoria Livschitz
About Grid Dynamics:
3
Agenda
• What is “real-time” in analytics, and why it matters
• In-Stream Processing: emerging platform for real-time processing
• Open ISP blueprint: reference architecture, reference implementation
• Take ISP for a spin: reference demo of real-time twitter sentiment analysis
What is “real-time”, anyways?
4
5
What is “real-time” in analytics, machine learning,
data sciences & AI?
Receive
event
Event
Analyze
event
Act on
event
ResponseAugment
model
How long is the cycle?
What is done online vs. offline?
Learning Analysis
6
Weeks Days Hours Seconds
Receive
event
Event Analyze
event
Act on
event
ResponseAugment
model
How long is the cycle?
What is done online vs. offline?
Learning Analysis
What is “real-time” in analytics, machine learning,
data sciences & AI?
Event
Act on
event
Response
Receive
event
A few seconds
A day
Receive
event
Augment
model
Analyze
event
Modify
reaction
1.Offline learning/analytics, online response
Valueof“real-time”
2. Offline learning, real-time
analytics, online response
Event
Act on
event
Response
Receive
event
A few seconds
A day
Receive
event
Augment
model
Analyze
event
Modify
reaction
1.Offline learning/analytics, online response
Valueof“real-time”
Event
Receive
event Response
Analyze
event
Act on
event
A few
seconds
Receive
event
Augment
modelA day
Receive
event
Analyze
event
Act on
event
Augment
model
3. Real-time learning/analytics, online response A few seconds
2. Offline learning, real-time
analytics, online response
Event
Act on
event
Response
Receive
event
A few seconds
A day
Receive
event
Augment
model
Analyze
event
Modify
reaction
1.Offline learning/analytics, online response
Valueof“real-time”
Event
Receive
event Response
Analyze
event
Act on
event
A few
seconds
Receive
event
Augment
modelA day
Event Response
Wherereal-timemattersinretail?
10
11
Classification of retail use cases relative to “real-timeness”
Level 1: Segmented historic context: data on
what happened to all such customers before
Level 3: Situational context: where customer is,
what she wants – or might buy - right now
Level 4: Supply chain dynamics: demand surge,
product availability, competitive pricing
From time to time, send a coupon
based on a segment
Level 2: individualized historic context: 360-
degree view across personal data
On a birthday, offer a coupon based
on personal history
Right now, offer a product based on
what’s in her hands
During a storm, deliver trending
umbrella/pancho combo
Example: Personalized Offers
12
Level 1: Segmented historic context: data on
what happened to all such customers before
Level 3: Situational context: where customer is,
what she wants – or might buy - right now
Level 4: Supply/demand dynamics: impact of
demand surge, shortage, competitive actions...
Level 2: individualized historic context: 360-
degree view across individual’s data Suited
for offline
ML
Requires
real-time
ML
Historic aggregated
data
Real-time
individual’s data
Historic
individual’s data
Real-time
everything
Classification of retail use cases relative to “real-timeness”
13
Top 6 drivers of real-time applications in retail
#3. Dynamic pricing
Determine “right price” for products
based on availability, trending,
personal context & competitive price
#1. Personalized search
Augment search hits and relevancy
ranking based on personal context &
history
#2. Personalized offers
Motivate “buy now” behavior by
offering deals based on personal
context & history
#4. Dynamic inventory
Predict inventory needs & re-stock
products in stores based on
fluctuations in inventory & demand
#5. Intelligent sourcing
Determine what order to source from
what store to optimize delivery SLAs
& shipment costs
#6. Real-time alerts
Detect unusual patterns: fraud, surge in
demand, weather changes, shift in
brand sentiment. Respond right away
Emergingplatformforreal-timeanalytics:
In-StreamProcessing(ISP)
14
15
In a complex landscape of Big Data systems…
16
…In-Stream Processing (ISP) service is an approach
to build real-time extensions of Big Data applications
Today’s
focus
17
ISP is ideal for:
• Real-time data ingress to replace batch ETLs
• Real-time identification of one-in-a-million “actionable insights”
• Real-time response to actionable insights
• Real-time learning from new data
18
Conceptual architecture
19
ISP pipelines: complex behavior with simple steps
Easy to write, change or add a step
ISP marketplace: build vs. buy
20
21
Grid Dynamics open blueprint for ISP
22
23
Blueprint goals
Scalable to
100,000+/second Real-time streaming;
real-time ML
Cloud-ready
Proven for mission-
critical use
Open source
(and built 100%
with open source)
Production-ready
Portable across
clouds
Extendable
24
Selected stack for ISP blueprint
• REST API
• Message Queue
• HDFS
• Other
25
Designed as a complete platform
• No single points of failure
• No bottlenecks
• Built-in scaling
• Dockerized
• Deployable to any cloud
• Reference implementation for
AWS (open source)
• Reference demo: real-time
twitter sentiment analytics for
new movie reviews
ISP reference implementation:
fully-automated DevOps stack for running
ISP on any modern cloud
26
27
How to achieve cloud portability?
• Phase 1: bootstrap management cluster
• [manual] Choose a cloud. Get a set of VMs (6) to host mngt cluster
• [automated] Deploy & configure Mesos/Marathon cluster on available VMs
• Phase 2: use management cluster to provision ISP environments
• [automated] Deploy all ISP components as Docker containers
• [automated] Deploy analytics application components (like Twitter API)
• [automated] Configure all dependencies
• [automated] Scale on-demand
• [automated] Shut down when done
28
Topology with twitter data analytics demo
“TakeISPforaspin”demo:Real-timetwitter
sentimentanalyticsfornewmoviereviews
29
30
Real-time demo, a.k.a. “Data Science Kitchen”
• Provide reference example on how to use ISP platform…
• ... and learn the basics of data science along the way
• Gets actual Twitter data via streaming API
• Analyses & visualizes what people think about latest movies
• Exposes data science “kitchen”: models, training sets, dictionaries
• Provides nice web UI to play with data
• Uses our ISP RI (reference implementation)
• Demo is running on AWS as a public service
• Everything is open sourced
• Documentation on our Tech Blog
31
Demo app: pick movies you want to monitor
32
Compare different views on data
33
Compare trending between different movies
Examples of
positive &
negative Carrie
Fisher tweets
34
Where to learn more
• 7-part blog series on ISP
• 7-part blog series on Data Science Kitchen
1. Read our blog: blog.griddynamics.com
2. Play with our demo
• http://apps.griddynamics.com/realtime-twitter-sentiment-
analysis-example
3. Connect
• Twitter: @griddynamics
• Subscribe to our blog
• Drop email: info@griddynamics.com

More Related Content

PDF
Open Blueprint for Real-Time Analytics with In-Stream Processing (ISP); 2017 ...
PDF
Open Blueprint for Real-Time Analytics in Retail: Big data applications in fa...
PDF
Open Blueprint for Real-Time Analytics in Retail: Strata Hadoop World 2017 S...
PDF
Demystifying AI-chatbots Just add CUI to your business apps
PPTX
MicroStrategy on Amazon Web Services (AWS) Cloud
 
PDF
Customer Event Hub - the modern Customer 360° view
PPT
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...
PDF
Take Action: The New Reality of Data-Driven Business
Open Blueprint for Real-Time Analytics with In-Stream Processing (ISP); 2017 ...
Open Blueprint for Real-Time Analytics in Retail: Big data applications in fa...
Open Blueprint for Real-Time Analytics in Retail: Strata Hadoop World 2017 S...
Demystifying AI-chatbots Just add CUI to your business apps
MicroStrategy on Amazon Web Services (AWS) Cloud
 
Customer Event Hub - the modern Customer 360° view
Real-time Streaming Analytics: Business Value, Use Cases and Architectural Co...
Take Action: The New Reality of Data-Driven Business

What's hot (18)

PDF
Fast Data for Competitive Advantage: 4 Steps to Expand your Window of Opportu...
PPTX
Infochimps + CloudCon: Infinite Monkey Theorem
PDF
ANTS - 360 view of your customer - bigdata innovation summit 2016
PPTX
Big Data Hadoop Customer 360 Degree View
PPTX
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
PDF
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
PDF
Google на конференции Big Data Russia
PDF
Real-Time Machine Learning at Industrial scale (University of Oxford, 9th Oct...
PPTX
[Webinar] Measure Twice, Build Once: Real-Time Predictive Analytics
PPTX
Big Data as Competitive Advantage in Financial Services
PDF
TechConnectr's Big Data Connection. Digital Marketing KPIs, Targeting, Analy...
PDF
How to Apply Machine Learning with R, H20, Apache Spark MLlib or PMML to Real...
PDF
How First to Value Beats First to Market: Case Studies of Fast Data Success
PPTX
Unlocking Operational Intelligence from the Data Lake
PDF
Deep Learning Image Processing Applications in the Enterprise
PPTX
Digital Shift in Insurance: How is the Industry Responding with the Influx of...
PDF
Building an accurate understanding of consumers based on real-world signals
PDF
MAALBS Big Data agile framwork
Fast Data for Competitive Advantage: 4 Steps to Expand your Window of Opportu...
Infochimps + CloudCon: Infinite Monkey Theorem
ANTS - 360 view of your customer - bigdata innovation summit 2016
Big Data Hadoop Customer 360 Degree View
Real-time Streaming Analytics for Enterprises based on Apache Storm - Impetus...
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
Google на конференции Big Data Russia
Real-Time Machine Learning at Industrial scale (University of Oxford, 9th Oct...
[Webinar] Measure Twice, Build Once: Real-Time Predictive Analytics
Big Data as Competitive Advantage in Financial Services
TechConnectr's Big Data Connection. Digital Marketing KPIs, Targeting, Analy...
How to Apply Machine Learning with R, H20, Apache Spark MLlib or PMML to Real...
How First to Value Beats First to Market: Case Studies of Fast Data Success
Unlocking Operational Intelligence from the Data Lake
Deep Learning Image Processing Applications in the Enterprise
Digital Shift in Insurance: How is the Industry Responding with the Influx of...
Building an accurate understanding of consumers based on real-world signals
MAALBS Big Data agile framwork
Ad

Similar to Open Blueprint for Real-Time Analytics with In-Stream Processing (20)

PPTX
Apache Flink: Real-World Use Cases for Streaming Analytics
PDF
Business of iot_mongodb_spark
PPTX
Enabling Event Driven Architecture with PubSub+
PDF
Moving To MicroServices
PPTX
Webinar: Analytics with NoSQL: Why, for What, and When?
PPTX
Leverage Machine Data
PPTX
Assessing New Databases– Translytical Use Cases
PDF
New usage model for real-time analytics by Dr. WILLIAM L. BAIN at Big Data S...
PDF
Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...
PPT
For Developers : Real-Time Analytics on Data in Motion
PDF
Smarter Analytics: Supporting the Enterprise with Automation
PDF
Event Streaming CTO Roundtable for Cloud-native Kafka Architectures
PDF
Set Your Data In Motion - CTO Roundtable
PPTX
Real time data integration best practices and architecture
PPTX
StreamCentral for the IT Professional
PDF
Moving Targets: Harnessing Real-time Value from Data in Motion
PDF
SD Big Data Monthly Meetup #4 - Session 1 - IBM
PDF
OC Big Data Monthly Meetup #6 - Session 1 - IBM
PPTX
Extending Data Lake using the Lambda Architecture June 2015
PDF
Confluent Partner Tech Talk with BearingPoint
Apache Flink: Real-World Use Cases for Streaming Analytics
Business of iot_mongodb_spark
Enabling Event Driven Architecture with PubSub+
Moving To MicroServices
Webinar: Analytics with NoSQL: Why, for What, and When?
Leverage Machine Data
Assessing New Databases– Translytical Use Cases
New usage model for real-time analytics by Dr. WILLIAM L. BAIN at Big Data S...
Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...
For Developers : Real-Time Analytics on Data in Motion
Smarter Analytics: Supporting the Enterprise with Automation
Event Streaming CTO Roundtable for Cloud-native Kafka Architectures
Set Your Data In Motion - CTO Roundtable
Real time data integration best practices and architecture
StreamCentral for the IT Professional
Moving Targets: Harnessing Real-time Value from Data in Motion
SD Big Data Monthly Meetup #4 - Session 1 - IBM
OC Big Data Monthly Meetup #6 - Session 1 - IBM
Extending Data Lake using the Lambda Architecture June 2015
Confluent Partner Tech Talk with BearingPoint
Ad

More from Grid Dynamics (20)

PPTX
Are you keeping up with your customer
PPTX
"Implementing data quality automation with open source stack" - Max Martynov,...
PDF
"How to build cool & useful voice commerce applications (such as devices like...
PPTX
"Challenges for AI in Healthcare" - Peter Graven Ph.D
PPTX
Dynamic Talks: "Applications of Big Data, Machine Learning and Artificial Int...
PPTX
Dynamic Talks: "Digital Transformation in Banking & Financial Services… a per...
PDF
Dynamic Talks: "Data Strategy as a Conduit for Data Maturity and Monetization...
PDF
Dynamics Talks: "Writing Spark Pipelines with Less Boilerplate Code" - Egor P...
PPTX
"Trends in Building Advanced Analytics Platform for Large Enterprises" - Atul...
PPTX
The New Era of Public Safety Records Management: Dynamic talks Chicago 9/24/2019
PPTX
Dynamic Talks: "Implementing data quality automation with open source stack" ...
PDF
"Implementing AI for New Business Models and Efficiencies" - Parag Shrivastav...
PPTX
Reducing No-shows and Late Cancelations in Healthcare Enterprise" - Shervin M...
PDF
Customer intelligence: a Machine Learning Approach: Dynamic talks Atlanta 8/2...
PPTX
"ML Services - How do you begin and when do you start scaling?" - Madhura Dud...
PPTX
Realtime Contextual Product Recommendations…that scale and generate revenue -...
PDF
Decision Automation in Marketing Systems using Reinforcement Learning: Dynami...
PPTX
Best practices for enterprise-grade microservices implementations with Google...
PPTX
Attribution Modelling 101: Credit Where Credit is Due!: Dynamic talks Seattle...
PDF
Building an algorithmic price management system using ML: Dynamic talks Seatt...
Are you keeping up with your customer
"Implementing data quality automation with open source stack" - Max Martynov,...
"How to build cool & useful voice commerce applications (such as devices like...
"Challenges for AI in Healthcare" - Peter Graven Ph.D
Dynamic Talks: "Applications of Big Data, Machine Learning and Artificial Int...
Dynamic Talks: "Digital Transformation in Banking & Financial Services… a per...
Dynamic Talks: "Data Strategy as a Conduit for Data Maturity and Monetization...
Dynamics Talks: "Writing Spark Pipelines with Less Boilerplate Code" - Egor P...
"Trends in Building Advanced Analytics Platform for Large Enterprises" - Atul...
The New Era of Public Safety Records Management: Dynamic talks Chicago 9/24/2019
Dynamic Talks: "Implementing data quality automation with open source stack" ...
"Implementing AI for New Business Models and Efficiencies" - Parag Shrivastav...
Reducing No-shows and Late Cancelations in Healthcare Enterprise" - Shervin M...
Customer intelligence: a Machine Learning Approach: Dynamic talks Atlanta 8/2...
"ML Services - How do you begin and when do you start scaling?" - Madhura Dud...
Realtime Contextual Product Recommendations…that scale and generate revenue -...
Decision Automation in Marketing Systems using Reinforcement Learning: Dynami...
Best practices for enterprise-grade microservices implementations with Google...
Attribution Modelling 101: Credit Where Credit is Due!: Dynamic talks Seattle...
Building an algorithmic price management system using ML: Dynamic talks Seatt...

Recently uploaded (20)

PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Encapsulation theory and applications.pdf
PDF
Empathic Computing: Creating Shared Understanding
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
KodekX | Application Modernization Development
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
20250228 LYD VKU AI Blended-Learning.pptx
Encapsulation theory and applications.pdf
Empathic Computing: Creating Shared Understanding
The AUB Centre for AI in Media Proposal.docx
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
Dropbox Q2 2025 Financial Results & Investor Presentation
Understanding_Digital_Forensics_Presentation.pptx
NewMind AI Weekly Chronicles - August'25 Week I
Chapter 3 Spatial Domain Image Processing.pdf
Diabetes mellitus diagnosis method based random forest with bat algorithm
Mobile App Security Testing_ A Comprehensive Guide.pdf
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
KodekX | Application Modernization Development
Building Integrated photovoltaic BIPV_UPV.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf

Open Blueprint for Real-Time Analytics with In-Stream Processing

  • 1. Privileged and confidential Open Blueprint for Real-Time Analytics with In-Stream Processing Victoria Livschitz, Founder & CTO, Grid Dynamics 09/28/2017
  • 2. 2 Business Need About the speaker: CTO @Grid Dynamics: present Founder and CEO @Grid Dynamics: 2006 – 2013 Principal engineer @Sun: 1997 – 2006 Engineering IT services company focused on digital transformation through cloud, big data & open source for Fortune 500 clients. Pioneer in real-time processing from company’s inception in 2006. Architected 3 out of top-10 busiest e-commerce sites. Never had production outage in peak season. Frequent contributor to open source projects: Hadoop, Solr, Lucene, Storm, others. Victoria Livschitz About Grid Dynamics:
  • 3. 3 Agenda • What is “real-time” in analytics, and why it matters • In-Stream Processing: emerging platform for real-time processing • Open ISP blueprint: reference architecture, reference implementation • Take ISP for a spin: reference demo of real-time twitter sentiment analysis
  • 5. 5 What is “real-time” in analytics, machine learning, data sciences & AI? Receive event Event Analyze event Act on event ResponseAugment model How long is the cycle? What is done online vs. offline? Learning Analysis
  • 6. 6 Weeks Days Hours Seconds Receive event Event Analyze event Act on event ResponseAugment model How long is the cycle? What is done online vs. offline? Learning Analysis What is “real-time” in analytics, machine learning, data sciences & AI?
  • 7. Event Act on event Response Receive event A few seconds A day Receive event Augment model Analyze event Modify reaction 1.Offline learning/analytics, online response Valueof“real-time”
  • 8. 2. Offline learning, real-time analytics, online response Event Act on event Response Receive event A few seconds A day Receive event Augment model Analyze event Modify reaction 1.Offline learning/analytics, online response Valueof“real-time” Event Receive event Response Analyze event Act on event A few seconds Receive event Augment modelA day
  • 9. Receive event Analyze event Act on event Augment model 3. Real-time learning/analytics, online response A few seconds 2. Offline learning, real-time analytics, online response Event Act on event Response Receive event A few seconds A day Receive event Augment model Analyze event Modify reaction 1.Offline learning/analytics, online response Valueof“real-time” Event Receive event Response Analyze event Act on event A few seconds Receive event Augment modelA day Event Response
  • 11. 11 Classification of retail use cases relative to “real-timeness” Level 1: Segmented historic context: data on what happened to all such customers before Level 3: Situational context: where customer is, what she wants – or might buy - right now Level 4: Supply chain dynamics: demand surge, product availability, competitive pricing From time to time, send a coupon based on a segment Level 2: individualized historic context: 360- degree view across personal data On a birthday, offer a coupon based on personal history Right now, offer a product based on what’s in her hands During a storm, deliver trending umbrella/pancho combo Example: Personalized Offers
  • 12. 12 Level 1: Segmented historic context: data on what happened to all such customers before Level 3: Situational context: where customer is, what she wants – or might buy - right now Level 4: Supply/demand dynamics: impact of demand surge, shortage, competitive actions... Level 2: individualized historic context: 360- degree view across individual’s data Suited for offline ML Requires real-time ML Historic aggregated data Real-time individual’s data Historic individual’s data Real-time everything Classification of retail use cases relative to “real-timeness”
  • 13. 13 Top 6 drivers of real-time applications in retail #3. Dynamic pricing Determine “right price” for products based on availability, trending, personal context & competitive price #1. Personalized search Augment search hits and relevancy ranking based on personal context & history #2. Personalized offers Motivate “buy now” behavior by offering deals based on personal context & history #4. Dynamic inventory Predict inventory needs & re-stock products in stores based on fluctuations in inventory & demand #5. Intelligent sourcing Determine what order to source from what store to optimize delivery SLAs & shipment costs #6. Real-time alerts Detect unusual patterns: fraud, surge in demand, weather changes, shift in brand sentiment. Respond right away
  • 15. 15 In a complex landscape of Big Data systems…
  • 16. 16 …In-Stream Processing (ISP) service is an approach to build real-time extensions of Big Data applications Today’s focus
  • 17. 17 ISP is ideal for: • Real-time data ingress to replace batch ETLs • Real-time identification of one-in-a-million “actionable insights” • Real-time response to actionable insights • Real-time learning from new data
  • 19. 19 ISP pipelines: complex behavior with simple steps Easy to write, change or add a step
  • 21. 21
  • 22. Grid Dynamics open blueprint for ISP 22
  • 23. 23 Blueprint goals Scalable to 100,000+/second Real-time streaming; real-time ML Cloud-ready Proven for mission- critical use Open source (and built 100% with open source) Production-ready Portable across clouds Extendable
  • 24. 24 Selected stack for ISP blueprint • REST API • Message Queue • HDFS • Other
  • 25. 25 Designed as a complete platform • No single points of failure • No bottlenecks • Built-in scaling • Dockerized • Deployable to any cloud • Reference implementation for AWS (open source) • Reference demo: real-time twitter sentiment analytics for new movie reviews
  • 26. ISP reference implementation: fully-automated DevOps stack for running ISP on any modern cloud 26
  • 27. 27 How to achieve cloud portability? • Phase 1: bootstrap management cluster • [manual] Choose a cloud. Get a set of VMs (6) to host mngt cluster • [automated] Deploy & configure Mesos/Marathon cluster on available VMs • Phase 2: use management cluster to provision ISP environments • [automated] Deploy all ISP components as Docker containers • [automated] Deploy analytics application components (like Twitter API) • [automated] Configure all dependencies • [automated] Scale on-demand • [automated] Shut down when done
  • 28. 28 Topology with twitter data analytics demo
  • 30. 30 Real-time demo, a.k.a. “Data Science Kitchen” • Provide reference example on how to use ISP platform… • ... and learn the basics of data science along the way • Gets actual Twitter data via streaming API • Analyses & visualizes what people think about latest movies • Exposes data science “kitchen”: models, training sets, dictionaries • Provides nice web UI to play with data • Uses our ISP RI (reference implementation) • Demo is running on AWS as a public service • Everything is open sourced • Documentation on our Tech Blog
  • 31. 31 Demo app: pick movies you want to monitor
  • 33. 33 Compare trending between different movies Examples of positive & negative Carrie Fisher tweets
  • 34. 34 Where to learn more • 7-part blog series on ISP • 7-part blog series on Data Science Kitchen 1. Read our blog: blog.griddynamics.com 2. Play with our demo • http://apps.griddynamics.com/realtime-twitter-sentiment- analysis-example 3. Connect • Twitter: @griddynamics • Subscribe to our blog • Drop email: info@griddynamics.com