SlideShare a Scribd company logo
simon@metabase.com

@sbelak
Exploratory analysis:
automation, augmentation,
and building tools for thought
Goal: answer 80% of
questions stemming from
data in <20min
The analytics chasm
2 min 20 min project
Ideal. Almost real-time. Can be
done during brainstorming
without disrupting the flow.
:(
Added to roadmapSqueeze in
somewhere
in the day
Doing our job better is
often a matter of speed
and quantity
Metabase ❤
github.com/metabase/metabase
• Open source BI/analytics tool

• Runs on-premise, data-agnostic

• 21k+ companies use us daily

• Focus on UX and friendliness

• Building a “data scientist in a box”
Metabase ❤
github.com/metabase/metabase
• Open source BI/analytics tool

• Runs on-premise, data-agnostic

• 21k+ companies use us daily

• Focus on UX and friendliness
• Building a “data scientist in a box”
From artificial intelligence
to augmenting human
intelligence
Flow
Affordances of our tools
shape how we approach
problems
Think in
distributions
http://www.onyxplatform.org/jekyll/update/2017/02/08/Pyroclast-Preview-Simulation.html
Flow v. Hub-and-spoke
Exploratory analysis
Factor out the
mechanical part of the
analysis
Nils, wrong encodings,
outliers, input errors
Every aggregated
metric is a mixture
of all your segments
A lot of problems are also
time series problems
(we just don’t treat them
as such)
Changes in distribution
a r e h i n t s a b o u t
underlying changes in
segment mixture
Declarative analysis:
Directing focus, rather
than specifying the steps
Building intelligence
Semantic + structural
model
REPL + notebook
+ Metabase?
Questions
P.S. We’re hiring!

More Related Content

PDF
Doing data science with clojure
PDF
Tools for building the future
PDF
Levelling up your data infrastructure
PPTX
DataCanvas: Big Data Analytic Flow in Cloud
PDF
Software Analytics for Pragmatists [DevOps Camp 2017]
PPTX
Beyond Data Discovery: The Value Unlocked by Modern Data Modeling
PPTX
Improving Data Modeling Workflow
PDF
Join 2017_Deep Dive_Integrating Looker with R and Python
Doing data science with clojure
Tools for building the future
Levelling up your data infrastructure
DataCanvas: Big Data Analytic Flow in Cloud
Software Analytics for Pragmatists [DevOps Camp 2017]
Beyond Data Discovery: The Value Unlocked by Modern Data Modeling
Improving Data Modeling Workflow
Join 2017_Deep Dive_Integrating Looker with R and Python

What's hot (20)

PDF
The 3 Insights Defining Modern Analytics
PDF
Applied Data Science Course Part 1: Concepts & your first ML model
PDF
Data Science: Good, Bad and Ugly by Irina Kukuyeva
PDF
Bridging the Gap: Analyzing Data in and Below the Cloud
PDF
Dataiku productive application to production - pap is may 2015
PDF
Data-driven design (UX Antwerp 24/09/19)
PPTX
When and Where to Embed Business Intelligence
PDF
PASS Summit Data Storytelling with R Power BI and AzureML
PPTX
Development Productivity for IBM i - Build an Efficient IT Department with AB...
PDF
Data Modeling in Looker
PDF
Neo4j on Microsoft Azure
PDF
Webinar - Patient Readmission Risk
PDF
Let's analyze how world reacts to road traffic by sentiment analysis final
PPTX
Webinar - Fraud Detection - Palombo (20160428)
PDF
Data science team (new version)
PPTX
Machine Learning with GraphLab Create
PPTX
Synapse NanoApps
PDF
Top BI trends and predictions for 2017
PPTX
Yellowbrick MicroStrategy webcast
PPTX
Krish Swamy + Balaji Gopalakrishnan, Wells Fargo - Building a World Class Dat...
The 3 Insights Defining Modern Analytics
Applied Data Science Course Part 1: Concepts & your first ML model
Data Science: Good, Bad and Ugly by Irina Kukuyeva
Bridging the Gap: Analyzing Data in and Below the Cloud
Dataiku productive application to production - pap is may 2015
Data-driven design (UX Antwerp 24/09/19)
When and Where to Embed Business Intelligence
PASS Summit Data Storytelling with R Power BI and AzureML
Development Productivity for IBM i - Build an Efficient IT Department with AB...
Data Modeling in Looker
Neo4j on Microsoft Azure
Webinar - Patient Readmission Risk
Let's analyze how world reacts to road traffic by sentiment analysis final
Webinar - Fraud Detection - Palombo (20160428)
Data science team (new version)
Machine Learning with GraphLab Create
Synapse NanoApps
Top BI trends and predictions for 2017
Yellowbrick MicroStrategy webcast
Krish Swamy + Balaji Gopalakrishnan, Wells Fargo - Building a World Class Dat...
Ad

Similar to Exploratory analysis (20)

PPTX
Into the Big Data Future with Watson Analytics
PDF
Continuum Analytics and Python
PPTX
Partner webinar presentation aws pebble_treasure_data
PDF
Rapid Product Design in the Wild, Agile 2013
PDF
DataOps - The Foundation for Your Agile Data Architecture
PDF
CWIN17 san francisco-ai implementation-pub
PPTX
M Chambers and RapidMiner Overview for Babson class
PPTX
Webinar with SnagAJob, HP Vertica and Looker - Data at the speed of busines s...
PPTX
Data Engineering @ Patistic Innovations
PDF
How to make your data scientists happy
PDF
Scaling organic growth by building products - Turing Fest 2018
PPTX
Latest ppt work
PDF
The Right Data Warehouse: Automation Now, Business Value Thereafter
PPTX
Fabrizio Ballarini — Scaling Organic Growth by Building Products (Turing Fest...
PPTX
AI, The Enterprise, and You
PDF
Unlock the Future of Web Design with AI - Daniel Birch
PDF
Introducción al Machine Learning Automático
PPTX
Architecting for Big Data: Trends, Tips, and Deployment Options
PDF
Rapid Product Design In The Wild
Into the Big Data Future with Watson Analytics
Continuum Analytics and Python
Partner webinar presentation aws pebble_treasure_data
Rapid Product Design in the Wild, Agile 2013
DataOps - The Foundation for Your Agile Data Architecture
CWIN17 san francisco-ai implementation-pub
M Chambers and RapidMiner Overview for Babson class
Webinar with SnagAJob, HP Vertica and Looker - Data at the speed of busines s...
Data Engineering @ Patistic Innovations
How to make your data scientists happy
Scaling organic growth by building products - Turing Fest 2018
Latest ppt work
The Right Data Warehouse: Automation Now, Business Value Thereafter
Fabrizio Ballarini — Scaling Organic Growth by Building Products (Turing Fest...
AI, The Enterprise, and You
Unlock the Future of Web Design with AI - Daniel Birch
Introducción al Machine Learning Automático
Architecting for Big Data: Trends, Tips, and Deployment Options
Rapid Product Design In The Wild
Ad

More from Simon Belak (20)

PDF
The subtle art of recommendation
PDF
Metabase Ljubljana Meetup #2
PDF
Metabase lj meetup
PDF
Sketch algorithms
PDF
Transducing for fun and profit
PDF
Your metrics are wrong
PDF
Writing smart contracts the sane way
PDF
Online statistical analysis using transducers and sketch algorithms
PDF
Save the princess
PDF
Data driven going to market strategy
PDF
Spec: a lisp-flavoured type system
PDF
A data layer in clojure
PDF
Odkrivanje segmentov iz podatkov
PDF
Using Onyx in anger
PDF
Spec + onyx
PDF
Dao of lisp
PDF
Predicting the future with goopti
PDF
Living with-spec
PDF
Living with-spec
PDF
Doing data science with Clojure
The subtle art of recommendation
Metabase Ljubljana Meetup #2
Metabase lj meetup
Sketch algorithms
Transducing for fun and profit
Your metrics are wrong
Writing smart contracts the sane way
Online statistical analysis using transducers and sketch algorithms
Save the princess
Data driven going to market strategy
Spec: a lisp-flavoured type system
A data layer in clojure
Odkrivanje segmentov iz podatkov
Using Onyx in anger
Spec + onyx
Dao of lisp
Predicting the future with goopti
Living with-spec
Living with-spec
Doing data science with Clojure

Recently uploaded (20)

PDF
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
PPTX
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
PDF
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
PPTX
A Complete Guide to Streamlining Business Processes
PDF
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
DOCX
Factor Analysis Word Document Presentation
PPT
ISS -ESG Data flows What is ESG and HowHow
PPTX
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
PPTX
Leprosy and NLEP programme community medicine
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PPTX
importance of Data-Visualization-in-Data-Science. for mba studnts
PPTX
IMPACT OF LANDSLIDE.....................
PDF
annual-report-2024-2025 original latest.
PPTX
Introduction to Inferential Statistics.pptx
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PDF
Business Analytics and business intelligence.pdf
PPTX
QUANTUM_COMPUTING_AND_ITS_POTENTIAL_APPLICATIONS[2].pptx
PPTX
Managing Community Partner Relationships
PPTX
Database Infoormation System (DBIS).pptx
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
STERILIZATION AND DISINFECTION-1.ppthhhbx
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
A Complete Guide to Streamlining Business Processes
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
Factor Analysis Word Document Presentation
ISS -ESG Data flows What is ESG and HowHow
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
Leprosy and NLEP programme community medicine
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
importance of Data-Visualization-in-Data-Science. for mba studnts
IMPACT OF LANDSLIDE.....................
annual-report-2024-2025 original latest.
Introduction to Inferential Statistics.pptx
Acceptance and paychological effects of mandatory extra coach I classes.pptx
Business Analytics and business intelligence.pdf
QUANTUM_COMPUTING_AND_ITS_POTENTIAL_APPLICATIONS[2].pptx
Managing Community Partner Relationships
Database Infoormation System (DBIS).pptx

Exploratory analysis