SlideShare a Scribd company logo
#MDBlocal
DATA ANALYTICS /
UNDERSTANDING YOUR DATA
SEATTLE
# M D B l o c a l
#MDBlocal
WIFI: MONGODB PW: mdb
#MDBlocal
SAM WEAVER
PRODUCT MANAGER, MONGODB
@samuel_weaver
#MDBlocal
AGENDA
1. Background
2. The importance of data visualization
3. Methods for data visualization in MongoDB
4. Lunch
#MDBlocal
WHERE ARE WE NOW?
#MDBlocal
TERMINOLOGY
“Business
Intelligence” “Business
Analytics”
ANALYTICS
#MDBlocal
DATA GROWTH IS EXPLOSIVE
• More data created in the last 2 years
than entire previous history of the
human race
• By 2020:
• 1.7MB per person every second
#MDBlocal
THE STATE OF ANALYTICS
• Analytics is big $!
• $130B in 2016
• $200B+ in 2020
• Less than 0.5% of data is analyzed and
used – imagine the potential!
#MDBlocal
EVOLUTION OF ANALYTICS
• Self service
• Mobile access
• Spark
• Real time analytics
• On-prem and cloud
• On demand reporting
2018
Today2015 20162012
• Dedicated reporting team
• Desktop access
• Hadoop
• Batch analytics
• On prem only
• Monthly reports
#MDBlocal
IMPORTANCE OF DATA VIZ
#MDBlocal
#MDBlocal
EARLY DATA VISUALIZATIONS
Charles Minard (1869)
-- Napolean’s march and
retreat on Moscow in
1812.
#MDBlocal
I
X Y
10 8.04
8 6.95
13 7.58
9 8.81
11 8.33
14 9.96
6 7.24
4 4.26
12 10.84
7 4.82
5 5.68
9.00 7.50
10.00 3.75
0.816
#MDBlocal
I
X Y
10 8.04
8 6.95
13 7.58
9 8.81
11 8.33
14 9.96
6 7.24
4 4.26
12 10.84
7 4.82
5 5.68
9.00 7.50
10.00 3.75
0.816
#MDBlocal
I
X Y
10 8.04
8 6.95
13 7.58
9 8.81
11 8.33
14 9.96
6 7.24
4 4.26
12 10.84
7 4.82
5 5.68
9.00 7.50
10.00 3.75
0.816
II III IV
X Y X Y X Y
10 9.14 10 7.46 8 6.58
8 8.14 8 6.77 8 5.76
13 8.74 13 12.74 8 7.71
9 8.77 9 7.11 8 8.84
11 9.26 11 7.81 8 8.47
14 8.1 14 8.84 8 7.04
6 6.13 6 6.08 8 5.25
4 3.1 4 5.39 19 12.5
12 9.13 12 8.15 8 5.56
7 7.26 7 6.42 8 7.91
5 4.74 5 5.73 8 6.89
9.00 7.50 9.00 7.50 9.00 7.50Mean
10.00 3.75 10.00 3.75 10.00 3.75Variance (Population)
0.816 0.816 0.817 Correlation (Pearson)
#MDBlocal
#MDBlocal
The Path to Truly Understanding Your MongoDB Data
#MDBlocal
SO YOU WANT TO VISUALIZE?
#MDBlocal
# M D B l o c a l
#MDBlocal
• Use the correct architecture
• Determine what your needs are
• Multiple data sources?
• Huge amounts of complex data?
• Quick self service?
• Choose the right solution for you
THINGS TO THINK ABOUT
#MDBlocal
ARCHITECTURE FOR
ANALYTICS
#MDBlocal
HIDDEN REPLICAS
• Hidden secondary's maintain a
copy of the primaries data set
• Hidden secondary's are used
for workloads with different
access patterns
• Cannot become primary
Client
Primary
Secondary
Secondary
Secondary
Secondary
P=0 Hidden=True
Analytics
#MDBlocal
TOOLING
#MDBlocal
BUILD YOUR OWN
• Pro’s
• Custom tailored solution: fits exactly as required!
• Con’s
• High investment
• Maintenance
• Deep understanding of the underlying tech and its language(s)
#MDBlocal
USE THE TOOLS WE GIVE
YOU
#MDBlocal
MONGODB COMPASS
• Developer tool
• Data management and
manipulation
• Interesting schema analysis
• Used daily: a good first place to
start
#MDBlocal
WHEN TO USE
• Day-to-day development/operations
• Adding indexes
• Viewing server stats
• Data manipulation
• 10,000->1ft view of data
#MDBlocal
BI CONNECTOR
• Visualize and explore MongoDB
data in SQL-based BI tools:
• Automatically discovers the schema
• Translates complex SQL statements
issued by the BI tool into MongoDB
aggregation queries
• Converts the results into a tabular
format for rendering inside the BI
tool
#MDBlocal
BI CONNECTOR
#MDBlocal
WHEN TO USE
• Multi datasources (not just mongodb)
• Business analysts
• Extremely powerful but high ramp
#MDBlocal
MONGODB CHARTS
• Lightweight
• Intuitive
• Build visualizations on
MongoDB data (nested,
polymorphic)
• Share content in a dashboard
#MDBlocal
WHEN TO USE
• When you want quick answers
• No need to flatten / ETL your mongodb data
• Self service for the technical audience
#MDBlocal
DEMO
#MDBlocal
LIFE CYCLE
1. Acquire 2. Prep
- Calcs
- Groups
- Data types
3. Visualize
- Bar
- Pie
- Line
4. Explore
- Dashboards
5. Share
- Export
- Collaborate
- Embed
#MDBlocal
THANK YOU
# M D B l o c a l
# M D B l o c a l
# M D B l o c a l
# M D B l o c a l
# M D B l o c a l
# M D B l o c a l
# M D B l o c a l
# M D B l o c a l
# M D B l o c a l
# M D B l o c a l

More Related Content

PPTX
Solving Your Backup Needs Using Ops Manager, Cloud Manager and Atlas
PPTX
Advanced Schema Design Patterns
PDF
MongoDB .local Munich 2019: MongoDB Atlas Auto-Scaling
PPTX
Sizing Your MongoDB Cluster
PDF
MongoDB .local Toronto 2019: Finding the Right Atlas Cluster Size: Does this ...
PDF
MongoDB .local Munich 2019: MongoDB Atlas Data Lake Technical Deep Dive
PDF
Big Data Spain 2016: Keynote
PPTX
MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB present...
Solving Your Backup Needs Using Ops Manager, Cloud Manager and Atlas
Advanced Schema Design Patterns
MongoDB .local Munich 2019: MongoDB Atlas Auto-Scaling
Sizing Your MongoDB Cluster
MongoDB .local Toronto 2019: Finding the Right Atlas Cluster Size: Does this ...
MongoDB .local Munich 2019: MongoDB Atlas Data Lake Technical Deep Dive
Big Data Spain 2016: Keynote
MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB present...

What's hot (20)

PDF
MongoDB .local Munich 2019: Mastering MongoDB on Kubernetes – MongoDB Enterpr...
PDF
Webinar: Introducing the MongoDB Connector for BI 2.0 with Tableau
PPTX
Benefits of Using MongoDB Over RDBMSs
PPTX
MongoDB and RDBMS: Using Polyglot Persistence at Equifax
PDF
MongoDB .local London 2019: MongoDB Atlas Data Lake Technical Deep Dive
PPTX
L’architettura di Classe Enterprise di Nuova Generazione
PPTX
An Enterprise Architect's View of MongoDB
PPTX
MongoDB Operations for Developers
PDF
Session #2, tech session: Build realtime search by Sylvain Utard from Algolia
PDF
Spark and MongoDB
PDF
Blazing Fast Analytics with MongoDB & Spark
PPTX
Webinar: An Enterprise Architect’s View of MongoDB
PPTX
Developing with the Modern App Stack: MEAN and MERN (with Angular2 and ReactJS)
PDF
MongoDB .local Chicago 2019: MongoDB Atlas Data Lake Technical Deep Dive
PPTX
An Introduction to MongoDB Compass
PPT
MongoATL: How Sourceforge is Using MongoDB
PDF
MongoDB Europe 2016 - Deploying MongoDB on NetApp storage
PDF
MongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas
PPTX
MongoDB Atlas
PPTX
Designing Cloud Products
MongoDB .local Munich 2019: Mastering MongoDB on Kubernetes – MongoDB Enterpr...
Webinar: Introducing the MongoDB Connector for BI 2.0 with Tableau
Benefits of Using MongoDB Over RDBMSs
MongoDB and RDBMS: Using Polyglot Persistence at Equifax
MongoDB .local London 2019: MongoDB Atlas Data Lake Technical Deep Dive
L’architettura di Classe Enterprise di Nuova Generazione
An Enterprise Architect's View of MongoDB
MongoDB Operations for Developers
Session #2, tech session: Build realtime search by Sylvain Utard from Algolia
Spark and MongoDB
Blazing Fast Analytics with MongoDB & Spark
Webinar: An Enterprise Architect’s View of MongoDB
Developing with the Modern App Stack: MEAN and MERN (with Angular2 and ReactJS)
MongoDB .local Chicago 2019: MongoDB Atlas Data Lake Technical Deep Dive
An Introduction to MongoDB Compass
MongoATL: How Sourceforge is Using MongoDB
MongoDB Europe 2016 - Deploying MongoDB on NetApp storage
MongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas
MongoDB Atlas
Designing Cloud Products
Ad

Similar to The Path to Truly Understanding Your MongoDB Data (20)

PPTX
[MongoDB.local Bengaluru 2018] The Path to Truly Understanding Your MongoDB Data
PPTX
Data Analytics: Understanding Your MongoDB Data
PDF
The Path to Truly Understanding your MongoDB Data
PPTX
SH 1 - SES 5 - SamW-TelAviv.pptx
PPTX
Jumpstart: Your Introduction To MongoDB
PPTX
When to Use MongoDB...and When You Should Not...
PPTX
Enable the business and make Artificial Intelligence accessible for everyone!
PDF
MongoDB World 2019: Simplici-tea: Getting Started with MongoDB Charts on Atlas
PDF
DAS Slides: Data Architect vs. Data Engineer vs. Data Modeler
PPTX
Jumpstart: Your Introduction to MongoDB
PDF
Mongo DB: Operational Big Data Database
PDF
Overcoming Today's Data Challenges with MongoDB
PDF
At the Tipping Point: Considerations for Cloud BI in a Multi-platform BI Ente...
PPTX
Big Data Analytics with Microsoft
PPTX
L’architettura di classe enterprise di nuova generazione
PDF
MongoDB .local London 2019: Streaming Data on the Shoulders of Giants
PDF
MongoDB .local London 2019: Streaming Data on the Shoulders of Giants
PDF
MongoDB Europe 2016 - The Rise of the Data Lake
PPTX
SPSNYC2019 - What is Common Data Model and how to use it?
PDF
TIBCO Advanced Analytics Meetup (TAAM) - June 2015
[MongoDB.local Bengaluru 2018] The Path to Truly Understanding Your MongoDB Data
Data Analytics: Understanding Your MongoDB Data
The Path to Truly Understanding your MongoDB Data
SH 1 - SES 5 - SamW-TelAviv.pptx
Jumpstart: Your Introduction To MongoDB
When to Use MongoDB...and When You Should Not...
Enable the business and make Artificial Intelligence accessible for everyone!
MongoDB World 2019: Simplici-tea: Getting Started with MongoDB Charts on Atlas
DAS Slides: Data Architect vs. Data Engineer vs. Data Modeler
Jumpstart: Your Introduction to MongoDB
Mongo DB: Operational Big Data Database
Overcoming Today's Data Challenges with MongoDB
At the Tipping Point: Considerations for Cloud BI in a Multi-platform BI Ente...
Big Data Analytics with Microsoft
L’architettura di classe enterprise di nuova generazione
MongoDB .local London 2019: Streaming Data on the Shoulders of Giants
MongoDB .local London 2019: Streaming Data on the Shoulders of Giants
MongoDB Europe 2016 - The Rise of the Data Lake
SPSNYC2019 - What is Common Data Model and how to use it?
TIBCO Advanced Analytics Meetup (TAAM) - June 2015
Ad

More from MongoDB (20)

PDF
MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!
PDF
MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...
PDF
MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB
PDF
MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...
PDF
MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data
PDF
MongoDB SoCal 2020: MongoDB Atlas Jump Start
PDF
MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]
PDF
MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2
PDF
MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...
PDF
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
PDF
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
PDF
MongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart
PDF
MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...
PDF
MongoDB .local San Francisco 2020: Aggregation Pipeline Power++
PDF
MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...
PDF
MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive
PDF
MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang
PDF
MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...
PDF
MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...
PDF
MongoDB .local Paris 2020: Les bonnes pratiques pour sécuriser MongoDB
MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!
MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...
MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB
MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...
MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data
MongoDB SoCal 2020: MongoDB Atlas Jump Start
MongoDB .local San Francisco 2020: Powering the new age data demands [Infosys]
MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2
MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
MongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart
MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...
MongoDB .local San Francisco 2020: Aggregation Pipeline Power++
MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...
MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive
MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang
MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...
MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...
MongoDB .local Paris 2020: Les bonnes pratiques pour sécuriser MongoDB

Recently uploaded (20)

PPTX
sap open course for s4hana steps from ECC to s4
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PPTX
Cloud computing and distributed systems.
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
KodekX | Application Modernization Development
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Encapsulation theory and applications.pdf
PPT
Teaching material agriculture food technology
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Empathic Computing: Creating Shared Understanding
sap open course for s4hana steps from ECC to s4
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
MYSQL Presentation for SQL database connectivity
Per capita expenditure prediction using model stacking based on satellite ima...
Building Integrated photovoltaic BIPV_UPV.pdf
Network Security Unit 5.pdf for BCA BBA.
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Cloud computing and distributed systems.
The AUB Centre for AI in Media Proposal.docx
KodekX | Application Modernization Development
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Encapsulation theory and applications.pdf
Teaching material agriculture food technology
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Empathic Computing: Creating Shared Understanding

The Path to Truly Understanding Your MongoDB Data

Editor's Notes

  • #9: 96 DVDs per person per day
  • #14: One of the best statistical drawings ever made. Tells of 400,000 army marching on moscow and returning with 10,000. Shows time and loss of life, routes and river crossings etc.
  • #20: Eye can process 10million bits per second. Roughly the same as Ethernet.