SlideShare a Scribd company logo
11
Analytics with NoSQL: why? for what? and when?
Edouard Servan-Schreiber, Ph.D.
Director for Solution Architecture
10gen
2
What is Analytics?
2
• Alerting
– Let me know when a cell tower has failed
• Getting insights - Strategic Analytics
– Churn rates, Customer segment distribution
• Transforming, Enriching, Aggregating
– Identifying faces in videos and images
– Identifying voices in recordings
• Operating smarter
– Having a pre-approved offer for a customer who calls after he expressed
interest on the web
• Analytics-driven actions in real-time
– Smart modeling integrating real time context
– This customer has lower status but suffered multiple delays in past
month, and should have priority over this higher status customer right
now on this flight
3
Why is this hard?
• Lots of data
– but few eyes and slow brains
• Lots of data
– just as many formats
• Lots of data
– many owners with unaligned interests and concerns
• Can you get your analysis in a useful timeframe?
• Can you make improvements in a useful timeframe?
• When you get new data, how fast can you do something with it?
• The more DATA you have, the easier it is to get lost in it...
• Data is useful only if it allows you to CHANGE the way you run your activity
– this is a surprisingly useful litmus test
• Any change requires measurement to make sure it helps
– this is a remarkably effective test to identify analytical organizations
3
4
Seven vital success areas
CRISP-DM methodology
Data
4
Data
Many Data Sources and Schemas
Hard to Integrate
Keeps evolving
Acting on “real time”
data
Is particularly hard
5
Collaborative Filtering
“Those who saw this also liked this….”
• Real time continuous updates of the user-product matrix
to make up-to-date predictions
5
6
Credit Card Fraud
Complex Event Processing
• Each transaction must be approved in a
matter of seconds. Each step, the relevant
authority must decide in real-time whether
the transaction is suspicious enough to
warrant an alert, refuting the transaction
6
Approaches to Model Scoring
88
Once you have built insights, the hard part is turning those insights into
money making actions through a multitude of field systems
Actions are taken in field systems….
DWHSensor Store
Order Store
Inventory Mgmt
Warranty Mgmt
Customer Portal
Analytical Store
Data is built here and action is taken here
Long running batch
analysis
Development of
Stats Models
Integration of
Enterprise Data
99
• Once you have built insights, the hard part is turning those insights
into money making actions through a multitude of field systems
Actions are taken in field systems….
DWHSensor Store
Order Store
Inventory Mgmt
Warranty Mgmt
Customer Portal
Analytical Store
Data is built here and action is taken here
BIG
ETL
Mess
1010
Once you have built insights, the hard part is turning those insights into
money making actions through a multitude of field systems
Actions are taken in field systems….
DWH
Sensor Store
Order Store
Inventory Mgmt
Warranty Mgmt
Customer Portal
Analytical Store
Operational Pre-aggregation
BIG Moveable
Normal
ETL
Mess
1111
MongoDB Strategic Advantages
Horizontally Scalable
-Sharding
Agile
Flexible
High Performance
Strong Consistency
Application
Highly
Available
-Replica Sets
{ author: “roger”,
date: new Date(),
text: “Spirited Away”,
tags: [“Tezuka”, “Manga”]}
+Aggregation
Framework
+MapReduce
Framework
1212
Document-oriented data model (JSON-Style)
{
_id : ObjectId("4c4ba5c0672c685e5e8aabf3"),
model: ”101 jet engine",
date : ISODate(“24-07-2010”),
purchaser: “Emirates”,
aircraft: {
type: “Boeing 747-400”,
first_flight: ISODate(“01-11-2010”)
registration: 3467892
}
manufacturing_plant: 8374
parts : [
{ partid: 132467589648762348765,
description: “blade”,
source: “some vendor”,
.....
},
{ partid: 9584352845569846,
description: “injector”,
source: “some vendor”,
.....
},
sensor_list: [sensorid1, sensoriid2, sensorid3,....]
}
www.bsonspec.org
13
Use Cases
• Retail:
– Price Optimization
• Utilities and Manufacturing:
– Using smart meter data, optimizing the flow of
electrical power to maximize yield and usage
– Sensor data from vehicles to build truck fleet analytics
in real time
• Telco:
– Geo-based advertising, delivering relevant ads based
on interest and locality
– Smart call routing taking into account saturated cell
towers and customer value
13
14
Use Cases
• Gov: City of Chicago (WindyGrid)
– Based on reports of maintenance needs (e.g.
broken streetlights), dispatching police in
targeted ways to reduce crime
• Financial Services: MetLife (The Wall)
– Moving from a policy centric view to a
customer centric view, enabling informed
upsell and cross sell offers based on historical
analysis and recent activity
14
15
How does MongoDB help for these?
• Agility to compute and aggregate in place
– All
• Agility to add new data to existing schema
– Price Optimization
• High scalable performance to ingest
operational data
– Sensor data
• High scalable performance to serve
operational analytics
– Metlife, Telco 15
16
NoSQL and Analytics
16
Tech Dev Time
Exec
latency
Exec
Power
Data
Transfer
Functional
Depth
Hadoop * * ***** ** *****
MongoDB ***** ***** *** ***** **
Cassandra
with
Hadoop
* * ***** ***** *****
DWH *** ***** ***** ** ****
SAS ***** ***** ** * *****
17
Conclusions
• Analytics are no longer just batch
• Analytics requires integrating the real time
context
• Big Data is putting pressure to process
data where it lands
• New sources and forms of data are making
it difficult to stick to RDBMS rigidity
• MongoDB can help you
17

More Related Content

PPTX
geniSIGHTS Mini
PDF
geniSIGHTS offerings on Retail- etail Latest
PDF
geniSIGHTS offerings on Travel - Latest
PPTX
Yellowbrick MicroStrategy webcast
PDF
Next-Gen Cloud Analytics with AWS, Big Data and Data Virtualization
PDF
Intelie's Overview - How much could your company lose in a matter of minutes?
PDF
Tom Martens - Cube Ware - The big data challenge - bo
PDF
Demystifying AI-chatbots Just add CUI to your business apps
geniSIGHTS Mini
geniSIGHTS offerings on Retail- etail Latest
geniSIGHTS offerings on Travel - Latest
Yellowbrick MicroStrategy webcast
Next-Gen Cloud Analytics with AWS, Big Data and Data Virtualization
Intelie's Overview - How much could your company lose in a matter of minutes?
Tom Martens - Cube Ware - The big data challenge - bo
Demystifying AI-chatbots Just add CUI to your business apps

What's hot (20)

PDF
Data Science for Finance
PPTX
Data science in finance industry
PPTX
Text analytics opportunities in the Insurance domain
PDF
DWS17 - Plenary Session : Big technological bets - Anukool LAKIHINA - Guavus
PDF
Denodo Datafest 2017 London Tekin Mentes Logitech
PDF
Gopalakrishna: big data consultant
PDF
QuanTemplate-Underwriting-Performance
PDF
Transformation of Sales and Marketing by Rene van der Laan
PDF
EVAM_Streaming Analytics_v1.5
PDF
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
PDF
QuanTemplate-data-managment
PPTX
Internet of things & predictive analytics
PPTX
#gaucbe - Closing the loop between your Analytics and marketing tools
PDF
16h00 globant - aws globant-big-data_summit2012
PDF
Big Data Analytics - GTech Seminar
PDF
How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...
PPTX
Impact of big data on DCMI market
PDF
G&S QUOTIENT
PPTX
Daten getriebene Service Intelligence mit Splunk ITSI
PPTX
Why MicroStrategy
Data Science for Finance
Data science in finance industry
Text analytics opportunities in the Insurance domain
DWS17 - Plenary Session : Big technological bets - Anukool LAKIHINA - Guavus
Denodo Datafest 2017 London Tekin Mentes Logitech
Gopalakrishna: big data consultant
QuanTemplate-Underwriting-Performance
Transformation of Sales and Marketing by Rene van der Laan
EVAM_Streaming Analytics_v1.5
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
QuanTemplate-data-managment
Internet of things & predictive analytics
#gaucbe - Closing the loop between your Analytics and marketing tools
16h00 globant - aws globant-big-data_summit2012
Big Data Analytics - GTech Seminar
How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...
Impact of big data on DCMI market
G&S QUOTIENT
Daten getriebene Service Intelligence mit Splunk ITSI
Why MicroStrategy
Ad

Viewers also liked (6)

PDF
Webinar: NoSQL as the New Normal
PPTX
Big data webinar-series-pt5 v2
PPT
Tricks
KEY
Scaling with MongoDB
PPTX
Why mongo db was created - Dwight Merriman - MongoSF 2011
KEY
2011 mongo sf-scaling
Webinar: NoSQL as the New Normal
Big data webinar-series-pt5 v2
Tricks
Scaling with MongoDB
Why mongo db was created - Dwight Merriman - MongoSF 2011
2011 mongo sf-scaling
Ad

Similar to Webinar: Analytics with NoSQL: Why, for What, and When? (20)

PDF
IMCSummit 2015 - Day 2 Developer Track - The Internet of Analytics – Discover...
PDF
The Age of Big Data: A New Class of Economic Asset
PDF
A technical Introduction to Big Data Analytics
PPTX
Assessing New Databases– Translytical Use Cases
PPTX
Finance and Accounting BPM
PDF
Big data beyond the hype may 2014
PPTX
INTERNET OF THINGS On data acquisition m2m systems
PDF
Analytics&IoT
PPTX
Bangalore Executive Seminar 2015: MongoDB - Your database of choice for real ...
PPTX
Big data analytics and machine intelligence v5.0
PPT
CS8091_BDA_Unit_I_Analytical_Architecture
PPTX
TCS Point of View Session - Analyze by Dr. Gautam Shroff, VP and Chief Scient...
PPTX
000 introduction to big data analytics 2021
PDF
The sensor data challenge - Innovations (not only) for the Internet of Things
PDF
SuanIct-Bigdata desktop-final
PPTX
Real Time Analytics
PPTX
WebAction In-Memory Computing Summit 2015
PDF
Big Data : Risks and Opportunities
PDF
How Can Analytics Improve Business?
PDF
Data Analysis by Multimedia University
IMCSummit 2015 - Day 2 Developer Track - The Internet of Analytics – Discover...
The Age of Big Data: A New Class of Economic Asset
A technical Introduction to Big Data Analytics
Assessing New Databases– Translytical Use Cases
Finance and Accounting BPM
Big data beyond the hype may 2014
INTERNET OF THINGS On data acquisition m2m systems
Analytics&IoT
Bangalore Executive Seminar 2015: MongoDB - Your database of choice for real ...
Big data analytics and machine intelligence v5.0
CS8091_BDA_Unit_I_Analytical_Architecture
TCS Point of View Session - Analyze by Dr. Gautam Shroff, VP and Chief Scient...
000 introduction to big data analytics 2021
The sensor data challenge - Innovations (not only) for the Internet of Things
SuanIct-Bigdata desktop-final
Real Time Analytics
WebAction In-Memory Computing Summit 2015
Big Data : Risks and Opportunities
How Can Analytics Improve Business?
Data Analysis by Multimedia University

More from MongoDB (20)

PDF
MongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas
PDF
MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!
PDF
MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...
PDF
MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB
PDF
MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...
PDF
MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data
PDF
MongoDB SoCal 2020: MongoDB Atlas Jump Start
PDF
MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]
PDF
MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2
PDF
MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...
PDF
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
PDF
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
PDF
MongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart
PDF
MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...
PDF
MongoDB .local San Francisco 2020: Aggregation Pipeline Power++
PDF
MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...
PDF
MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive
PDF
MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang
PDF
MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...
PDF
MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...
MongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas
MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!
MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...
MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB
MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...
MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data
MongoDB SoCal 2020: MongoDB Atlas Jump Start
MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]
MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2
MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
MongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart
MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...
MongoDB .local San Francisco 2020: Aggregation Pipeline Power++
MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...
MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive
MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang
MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...
MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...

Recently uploaded (20)

PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Encapsulation theory and applications.pdf
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PPTX
Machine Learning_overview_presentation.pptx
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Electronic commerce courselecture one. Pdf
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
Big Data Technologies - Introduction.pptx
PPTX
A Presentation on Artificial Intelligence
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
20250228 LYD VKU AI Blended-Learning.pptx
Reach Out and Touch Someone: Haptics and Empathic Computing
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
gpt5_lecture_notes_comprehensive_20250812015547.pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Advanced methodologies resolving dimensionality complications for autism neur...
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
The Rise and Fall of 3GPP – Time for a Sabbatical?
Encapsulation theory and applications.pdf
A comparative analysis of optical character recognition models for extracting...
Mobile App Security Testing_ A Comprehensive Guide.pdf
Machine Learning_overview_presentation.pptx
Building Integrated photovoltaic BIPV_UPV.pdf
Electronic commerce courselecture one. Pdf
Network Security Unit 5.pdf for BCA BBA.
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Big Data Technologies - Introduction.pptx
A Presentation on Artificial Intelligence
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton

Webinar: Analytics with NoSQL: Why, for What, and When?

  • 1. 11 Analytics with NoSQL: why? for what? and when? Edouard Servan-Schreiber, Ph.D. Director for Solution Architecture 10gen
  • 2. 2 What is Analytics? 2 • Alerting – Let me know when a cell tower has failed • Getting insights - Strategic Analytics – Churn rates, Customer segment distribution • Transforming, Enriching, Aggregating – Identifying faces in videos and images – Identifying voices in recordings • Operating smarter – Having a pre-approved offer for a customer who calls after he expressed interest on the web • Analytics-driven actions in real-time – Smart modeling integrating real time context – This customer has lower status but suffered multiple delays in past month, and should have priority over this higher status customer right now on this flight
  • 3. 3 Why is this hard? • Lots of data – but few eyes and slow brains • Lots of data – just as many formats • Lots of data – many owners with unaligned interests and concerns • Can you get your analysis in a useful timeframe? • Can you make improvements in a useful timeframe? • When you get new data, how fast can you do something with it? • The more DATA you have, the easier it is to get lost in it... • Data is useful only if it allows you to CHANGE the way you run your activity – this is a surprisingly useful litmus test • Any change requires measurement to make sure it helps – this is a remarkably effective test to identify analytical organizations 3
  • 4. 4 Seven vital success areas CRISP-DM methodology Data 4 Data Many Data Sources and Schemas Hard to Integrate Keeps evolving Acting on “real time” data Is particularly hard
  • 5. 5 Collaborative Filtering “Those who saw this also liked this….” • Real time continuous updates of the user-product matrix to make up-to-date predictions 5
  • 6. 6 Credit Card Fraud Complex Event Processing • Each transaction must be approved in a matter of seconds. Each step, the relevant authority must decide in real-time whether the transaction is suspicious enough to warrant an alert, refuting the transaction 6
  • 8. 88 Once you have built insights, the hard part is turning those insights into money making actions through a multitude of field systems Actions are taken in field systems…. DWHSensor Store Order Store Inventory Mgmt Warranty Mgmt Customer Portal Analytical Store Data is built here and action is taken here Long running batch analysis Development of Stats Models Integration of Enterprise Data
  • 9. 99 • Once you have built insights, the hard part is turning those insights into money making actions through a multitude of field systems Actions are taken in field systems…. DWHSensor Store Order Store Inventory Mgmt Warranty Mgmt Customer Portal Analytical Store Data is built here and action is taken here BIG ETL Mess
  • 10. 1010 Once you have built insights, the hard part is turning those insights into money making actions through a multitude of field systems Actions are taken in field systems…. DWH Sensor Store Order Store Inventory Mgmt Warranty Mgmt Customer Portal Analytical Store Operational Pre-aggregation BIG Moveable Normal ETL Mess
  • 11. 1111 MongoDB Strategic Advantages Horizontally Scalable -Sharding Agile Flexible High Performance Strong Consistency Application Highly Available -Replica Sets { author: “roger”, date: new Date(), text: “Spirited Away”, tags: [“Tezuka”, “Manga”]} +Aggregation Framework +MapReduce Framework
  • 12. 1212 Document-oriented data model (JSON-Style) { _id : ObjectId("4c4ba5c0672c685e5e8aabf3"), model: ”101 jet engine", date : ISODate(“24-07-2010”), purchaser: “Emirates”, aircraft: { type: “Boeing 747-400”, first_flight: ISODate(“01-11-2010”) registration: 3467892 } manufacturing_plant: 8374 parts : [ { partid: 132467589648762348765, description: “blade”, source: “some vendor”, ..... }, { partid: 9584352845569846, description: “injector”, source: “some vendor”, ..... }, sensor_list: [sensorid1, sensoriid2, sensorid3,....] } www.bsonspec.org
  • 13. 13 Use Cases • Retail: – Price Optimization • Utilities and Manufacturing: – Using smart meter data, optimizing the flow of electrical power to maximize yield and usage – Sensor data from vehicles to build truck fleet analytics in real time • Telco: – Geo-based advertising, delivering relevant ads based on interest and locality – Smart call routing taking into account saturated cell towers and customer value 13
  • 14. 14 Use Cases • Gov: City of Chicago (WindyGrid) – Based on reports of maintenance needs (e.g. broken streetlights), dispatching police in targeted ways to reduce crime • Financial Services: MetLife (The Wall) – Moving from a policy centric view to a customer centric view, enabling informed upsell and cross sell offers based on historical analysis and recent activity 14
  • 15. 15 How does MongoDB help for these? • Agility to compute and aggregate in place – All • Agility to add new data to existing schema – Price Optimization • High scalable performance to ingest operational data – Sensor data • High scalable performance to serve operational analytics – Metlife, Telco 15
  • 16. 16 NoSQL and Analytics 16 Tech Dev Time Exec latency Exec Power Data Transfer Functional Depth Hadoop * * ***** ** ***** MongoDB ***** ***** *** ***** ** Cassandra with Hadoop * * ***** ***** ***** DWH *** ***** ***** ** **** SAS ***** ***** ** * *****
  • 17. 17 Conclusions • Analytics are no longer just batch • Analytics requires integrating the real time context • Big Data is putting pressure to process data where it lands • New sources and forms of data are making it difficult to stick to RDBMS rigidity • MongoDB can help you 17