SlideShare a Scribd company logo
How to Use Spatial Data
Science in Your Site Planning
Process
FOLLOW @CARTO ON TWITTER
The Sum of Our Parts
Today’s Speakers
Giulia Carella Steve Isaac
Data Scientist Content Marketing Manager
CARTO — Turn Location Data into Business Outcomes
CARTO is the platform to build
powerful Location Intelligence apps
with the best data streams available.
CARTO
Customers
Pioneers in Location Intelligence
1,200 End-users
300K Team members
100+
CARTO — Turn Location Data into Business Outcomes
The Complete Journey
1. Data Ingestion & Management
2. Enrichment
3. Analysis
4. Solutions & Visualization
5. Integration
CARTO — Turn Location Data into Business Outcomes
The Complete Journey
1. Data Ingestion & Management
2. Enrichment
3. Analysis
4. Solutions & Visualization
5. Integration
Enrichment
● Save time in gathering spatial data,
augmenting your existing data with
demographics from across the globe
● Create locations from addresses and
understand travel time all from within
CARTO
● Develop robust ETL processes and update
mechanisms so your data is always enriched
● Premium data to understand and analyze
deeper trends and behavior
Data
Observatory
ETL
Processing
CARTO
Grid
Data Services
API
Routing &
Traffic
Geocoding
Analysis
● Bring maps and data into your Data Science
workflows and the Python data science
ecosystem with CARTOframes
● Machine learning embedded in CARTO as
simple SQL calls for clustering, outliers analysis,
time series predictions, and geospatial
weighted regression
● Use the power of PostGIS and our APIs to
productionalize analysis workflows in your
CARTO platform
CARTO Frames Analysis
API
SQL
API
Python
SDK
Spatial Data Science for Site Planning
Financial
Housing
Human Mobility
Road Traffic Points of Interest
Demographics
Merchant and ATM transaction
data from leading banks and
credit card companies
Mobile device and GPS data
provide insight into human
movement patterns
The most recent census data
including: age, income, household
types and more
Property statistics, prices, and
history to drive decisions in
investment portfolios
Data from routing apps and GPS
to analyse traffic patterns and
commuter behaviour
Location data for business
establishments, restaurants,
schools, attractions, and more
CARTO — Turn Location Data into Business Outcomes
The Age of Data Abundance?
AND ITS HIDDEN PITFALLS
Sampling Bias
Data may not be collected using
random samples, e.g. need
extrapolation to the total
population
AND ITS HIDDEN PITFALLS
Sampling Bias
Data may not be collected using
random samples, e.g. need
extrapolation to the total
population
Anonymisation
Data needs to be anonymised
to meet regulations, and
vendors have different
approaches for that
AND ITS HIDDEN PITFALLS
Sampling Bias
Data may not be collected using
random samples, e.g. need
extrapolation to the total
population
Anonymisation
Data need to be anonymised to
meet regulations, and vendors
have different approaches for
that
Different Aggregations
Data comes in different spatial
aggregations such as grid cells
of different sizes or
administrative boundaries
Financial
Grid 110x110m
POI
Points aggregated on a 70x70m grid
Demographics
Census tracts
Building a Common
Reference Grid
Which spatial scale is correct?
How do we change from one spatial scale to another?
THE CHANGE OF SUPPORT PROBLEM
Statistical downscale/upscale model to
DISAGGREGATE/AGGREGATE
the data at different spatial resolutions
A PRELIMINARY SOLUTION
AREA WEIGHTENING
Which spatial scale is correct?
How do we change from one spatial scale to another?
Exploring the available data:
CARTO DATA OBSERVATORY
Viz using vector maps
Connector to CARTO platform
WHAT IS CARTOframes?
● Python package
● To be used in Jupyter Notebooks
● Built for Data Scientists
● Part of CARTO Analysis stack
CARTOFrames Analysis API SQL API Python SDK
How to Use Spatial Data Science in your Site Planning Process? [CARTOframes]
Quadkeys
https://docs.microsoft.com/en-us/bingmaps/articles/bing-maps-tile-system
How to Use Spatial Data Science in your Site Planning Process? [CARTOframes]
How to Use Spatial Data Science in your Site Planning Process? [CARTOframes]
How to Use Spatial Data Science in your Site Planning Process? [CARTOframes]
Defining Similarity
for Site Planning
CARTO — Turn Location Data into Business Outcomes
WITH SOME CAVEATS:
1. Different variances?
2. Correlated variables?
3. Missing data?
4. When is a distance small enough? Or how to define
similarity?
TWIN AREA MODEL
DIFFERENT VARIANCES
CORRELATED VARIABLES
CORRELATED VARIABLES
1. Eigen-decomposition of the sample covariance matrix
2. Rearrange the columns in the eigenvector matrix in order of decreasing eigenvalue
3. Keep only the eigenvectors that correspond to the p-largest eigenvalues
4. Compute the principal components (PC)
5. Reconstruct the original data
How many PCs? Let’s use an ensemble!
MISSING DATA
1. PCA can also be described as the ML solution of a probabilistic latent variable model (PPCA)
2. Find the ML estimate for the model parameters using the EM algorithm
2.1. E-step:
2.2. M-step
Similarity Score
HOW TO DEFINE SIMILARITY
So far we have only computed distances in the variable space
0 1
Actually since we are computing an K-ensemble of distances...
Let’s compare instead the score for each target location to the score from the mean vector data
Takeaways
CARTO Data Observatory
(DO) for data enrichment
CARTOframes as a connector
to the DO and for powerful
vector visualizations
Site-planning applications
require various sources of
location data streams
Easily derive data-driven
insights when opening,
relocating or consolidating
location sites
Thanks for listening! Any
questions?
Request a demo at CARTO.COM
Giulia Carella
Data Scientist // giulia@carto.com
Steve Isaac
Content Marketing Manager // sisaac@carto.com

More Related Content

PDF
Location Intelligence: The Secret Sauce for OOH Advertising
PDF
The Ultimate Guide to Location Data: New Datasets & Methods
PDF
Spatial Analytics in the Cloud Using Snowflake & CARTO
PDF
CARTO BUILDER: from visualization to geospatial analysis
PDF
Understanding Retail Catchment Areas with Human Mobility Data
PDF
Powering the Micromobility Revolution with Spatial Analysis
PDF
Applying Spatial Analysis to Real Estate Decision-Making
PDF
Unlocking Geospatial Analytics Use Cases with CARTO and Databricks
Location Intelligence: The Secret Sauce for OOH Advertising
The Ultimate Guide to Location Data: New Datasets & Methods
Spatial Analytics in the Cloud Using Snowflake & CARTO
CARTO BUILDER: from visualization to geospatial analysis
Understanding Retail Catchment Areas with Human Mobility Data
Powering the Micromobility Revolution with Spatial Analysis
Applying Spatial Analysis to Real Estate Decision-Making
Unlocking Geospatial Analytics Use Cases with CARTO and Databricks

What's hot (20)

PDF
Spatial Analysis: the Countervirus in OOH?
PDF
How retail analytics help monitor big box stores performance
PDF
How to Analyze & Optimize Mobility with Geospatial Data (Snowflake).pdf
PDF
7 Reasons Why CPG Marketers Are Turning To Location Analytics
PPTX
Data Visualization
PDF
Developing Spatial Applications with CARTO for React v1.1
PPT
Applications of GIS to Logistics and Transportation
PDF
Indoor Mapping: Why Precision Matters
PPTX
Data Visualization & Data Storytelling
PPTX
Data Visualization Design Best Practices Workshop
PDF
Data Visualization
PPS
Spatial Analysis Using GIS
PPTX
Gis in urban
PPTX
Data Visualization
PPTX
R vs python. Which one is best for data science
PDF
The Role of Data Science in Real Estate
PPT
GIS - lecture-1.ppt
PDF
4 v's of Big Data | The Knowledge Academy
PDF
"Introduction to Data Visualization" Workshop for General Assembly by Hunter ...
PDF
Data Visualization in Data Science
Spatial Analysis: the Countervirus in OOH?
How retail analytics help monitor big box stores performance
How to Analyze & Optimize Mobility with Geospatial Data (Snowflake).pdf
7 Reasons Why CPG Marketers Are Turning To Location Analytics
Data Visualization
Developing Spatial Applications with CARTO for React v1.1
Applications of GIS to Logistics and Transportation
Indoor Mapping: Why Precision Matters
Data Visualization & Data Storytelling
Data Visualization Design Best Practices Workshop
Data Visualization
Spatial Analysis Using GIS
Gis in urban
Data Visualization
R vs python. Which one is best for data science
The Role of Data Science in Real Estate
GIS - lecture-1.ppt
4 v's of Big Data | The Knowledge Academy
"Introduction to Data Visualization" Workshop for General Assembly by Hunter ...
Data Visualization in Data Science
Ad

Similar to How to Use Spatial Data Science in your Site Planning Process? [CARTOframes] (20)

PDF
Le rôle de l’intelligence géospatiale dans la reprise économique
PDF
CARTO en 5 Pasos: del Dato a la Toma de Decisiones [CARTO]
PDF
Unlock the power of spatial analysis using CARTO and python [CARTOframes]
PDF
Market analysis through Consumer Behavior Pattern Insights
PDF
Supercharging Site Planning in Retail & Real Estate [CARTO Reveal]
PPTX
Location Intelligence - the Next Evolution of Business Applications
PDF
CARTO for Retail: Driving Site Selection Decisions with Advanced Spatial Anal...
PPTX
Hedge Fund case study solution - Credit default swaps execution system and Gr...
PDF
Deploying Enterprise Scale Deep Learning in Actuarial Modeling at Nationwide
PDF
Tips and tricks for Working with Demographic Data [CARTOframes & Python]
PDF
AGIT 2015 - Hans Viehmann: "Big Data and Smart Cities"
PPTX
Market pulse
PDF
Overview of business intelligence
PDF
Everyday Data Science
PDF
Dista Insight Data Sheet.pdf
PPT
QlikView and Location / Map Intelligence
PDF
Improve Store Expansion (Territory Management Featuring)
PPTX
Big data solutions on cloud – the way forward
PPTX
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
PPTX
Don't Get Lost in Data Without a Map
Le rôle de l’intelligence géospatiale dans la reprise économique
CARTO en 5 Pasos: del Dato a la Toma de Decisiones [CARTO]
Unlock the power of spatial analysis using CARTO and python [CARTOframes]
Market analysis through Consumer Behavior Pattern Insights
Supercharging Site Planning in Retail & Real Estate [CARTO Reveal]
Location Intelligence - the Next Evolution of Business Applications
CARTO for Retail: Driving Site Selection Decisions with Advanced Spatial Anal...
Hedge Fund case study solution - Credit default swaps execution system and Gr...
Deploying Enterprise Scale Deep Learning in Actuarial Modeling at Nationwide
Tips and tricks for Working with Demographic Data [CARTOframes & Python]
AGIT 2015 - Hans Viehmann: "Big Data and Smart Cities"
Market pulse
Overview of business intelligence
Everyday Data Science
Dista Insight Data Sheet.pdf
QlikView and Location / Map Intelligence
Improve Store Expansion (Territory Management Featuring)
Big data solutions on cloud – the way forward
Big Data Solutions on Cloud – The Way Forward by Kiththi Perera SLT
Don't Get Lost in Data Without a Map
Ad

More from CARTO (20)

PDF
4 Ways Telecoms are Using GIS & Location Intelligence.pdf
PDF
Understanding Residential Energy Usage with CARTO & Doorda.pdf
PDF
How to Use Spatial Data to Create a Wildfire Risk Index.pdf
PDF
Winning Market Expansion Strategies for CPG brands, Using Spatial Data and An...
PPTX
Advancing Spatial Analysis in BigQuery using CARTO Analytics Toolbox
PDF
Can Kanye West Save Gap? Real-Time Consumer Social Media Segmentation On CARTO
PDF
Developing Spatial Applications with Google Maps and CARTO
PDF
Scaling Spatial Analytics with Google Cloud & CARTO
PDF
Sentiment, Popularity & Potentiality: 3 Unique KPIs to add to your Site Selec...
PDF
CARTO Cloud Native – An Introduction to the Spatial Extension for BigQuery
PDF
What Spatial Analytics Tells Us About the Future of the UK High Street
PDF
Using Spatial Analysis to Drive Post-Pandemic Site Selection in Retail
PDF
6 Ways CPG Brands are Using Location Data to Prepare for the "Post-Pandemic"
PDF
Using Places (POI) Data for QSR Site Selection
PDF
5 Ways to Strategize for Emerging Short-Term Rental Trends
PDF
How to Use Geospatial Data to Identify CPG Demnd Hotspots
PDF
Using Location Data to Adapt to the New normal
PDF
Analyzing the Rise of the Staycation during COVID-19
PDF
Why High-Resolution Spatial Data on Population Matters
PDF
Google Analytics location data visualised with CARTO & BigQuery
4 Ways Telecoms are Using GIS & Location Intelligence.pdf
Understanding Residential Energy Usage with CARTO & Doorda.pdf
How to Use Spatial Data to Create a Wildfire Risk Index.pdf
Winning Market Expansion Strategies for CPG brands, Using Spatial Data and An...
Advancing Spatial Analysis in BigQuery using CARTO Analytics Toolbox
Can Kanye West Save Gap? Real-Time Consumer Social Media Segmentation On CARTO
Developing Spatial Applications with Google Maps and CARTO
Scaling Spatial Analytics with Google Cloud & CARTO
Sentiment, Popularity & Potentiality: 3 Unique KPIs to add to your Site Selec...
CARTO Cloud Native – An Introduction to the Spatial Extension for BigQuery
What Spatial Analytics Tells Us About the Future of the UK High Street
Using Spatial Analysis to Drive Post-Pandemic Site Selection in Retail
6 Ways CPG Brands are Using Location Data to Prepare for the "Post-Pandemic"
Using Places (POI) Data for QSR Site Selection
5 Ways to Strategize for Emerging Short-Term Rental Trends
How to Use Geospatial Data to Identify CPG Demnd Hotspots
Using Location Data to Adapt to the New normal
Analyzing the Rise of the Staycation during COVID-19
Why High-Resolution Spatial Data on Population Matters
Google Analytics location data visualised with CARTO & BigQuery

Recently uploaded (20)

PPT
ISS -ESG Data flows What is ESG and HowHow
PPTX
1_Introduction to advance data techniques.pptx
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
Computer network topology notes for revision
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PDF
Mega Projects Data Mega Projects Data
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPTX
IB Computer Science - Internal Assessment.pptx
PDF
annual-report-2024-2025 original latest.
PPT
Quality review (1)_presentation of this 21
PPTX
Database Infoormation System (DBIS).pptx
ISS -ESG Data flows What is ESG and HowHow
1_Introduction to advance data techniques.pptx
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Computer network topology notes for revision
climate analysis of Dhaka ,Banglades.pptx
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
Introduction-to-Cloud-ComputingFinal.pptx
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
Clinical guidelines as a resource for EBP(1).pdf
oil_refinery_comprehensive_20250804084928 (1).pptx
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
Galatica Smart Energy Infrastructure Startup Pitch Deck
Mega Projects Data Mega Projects Data
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
IB Computer Science - Internal Assessment.pptx
annual-report-2024-2025 original latest.
Quality review (1)_presentation of this 21
Database Infoormation System (DBIS).pptx

How to Use Spatial Data Science in your Site Planning Process? [CARTOframes]

  • 1. How to Use Spatial Data Science in Your Site Planning Process FOLLOW @CARTO ON TWITTER
  • 2. The Sum of Our Parts Today’s Speakers Giulia Carella Steve Isaac Data Scientist Content Marketing Manager
  • 3. CARTO — Turn Location Data into Business Outcomes CARTO is the platform to build powerful Location Intelligence apps with the best data streams available.
  • 4. CARTO Customers Pioneers in Location Intelligence 1,200 End-users 300K Team members 100+
  • 5. CARTO — Turn Location Data into Business Outcomes The Complete Journey 1. Data Ingestion & Management 2. Enrichment 3. Analysis 4. Solutions & Visualization 5. Integration
  • 6. CARTO — Turn Location Data into Business Outcomes The Complete Journey 1. Data Ingestion & Management 2. Enrichment 3. Analysis 4. Solutions & Visualization 5. Integration
  • 7. Enrichment ● Save time in gathering spatial data, augmenting your existing data with demographics from across the globe ● Create locations from addresses and understand travel time all from within CARTO ● Develop robust ETL processes and update mechanisms so your data is always enriched ● Premium data to understand and analyze deeper trends and behavior Data Observatory ETL Processing CARTO Grid Data Services API Routing & Traffic Geocoding
  • 8. Analysis ● Bring maps and data into your Data Science workflows and the Python data science ecosystem with CARTOframes ● Machine learning embedded in CARTO as simple SQL calls for clustering, outliers analysis, time series predictions, and geospatial weighted regression ● Use the power of PostGIS and our APIs to productionalize analysis workflows in your CARTO platform CARTO Frames Analysis API SQL API Python SDK
  • 9. Spatial Data Science for Site Planning
  • 10. Financial Housing Human Mobility Road Traffic Points of Interest Demographics Merchant and ATM transaction data from leading banks and credit card companies Mobile device and GPS data provide insight into human movement patterns The most recent census data including: age, income, household types and more Property statistics, prices, and history to drive decisions in investment portfolios Data from routing apps and GPS to analyse traffic patterns and commuter behaviour Location data for business establishments, restaurants, schools, attractions, and more
  • 11. CARTO — Turn Location Data into Business Outcomes The Age of Data Abundance?
  • 12. AND ITS HIDDEN PITFALLS Sampling Bias Data may not be collected using random samples, e.g. need extrapolation to the total population
  • 13. AND ITS HIDDEN PITFALLS Sampling Bias Data may not be collected using random samples, e.g. need extrapolation to the total population Anonymisation Data needs to be anonymised to meet regulations, and vendors have different approaches for that
  • 14. AND ITS HIDDEN PITFALLS Sampling Bias Data may not be collected using random samples, e.g. need extrapolation to the total population Anonymisation Data need to be anonymised to meet regulations, and vendors have different approaches for that Different Aggregations Data comes in different spatial aggregations such as grid cells of different sizes or administrative boundaries
  • 15. Financial Grid 110x110m POI Points aggregated on a 70x70m grid Demographics Census tracts
  • 17. Which spatial scale is correct? How do we change from one spatial scale to another? THE CHANGE OF SUPPORT PROBLEM Statistical downscale/upscale model to DISAGGREGATE/AGGREGATE the data at different spatial resolutions
  • 18. A PRELIMINARY SOLUTION AREA WEIGHTENING Which spatial scale is correct? How do we change from one spatial scale to another?
  • 19. Exploring the available data: CARTO DATA OBSERVATORY
  • 20. Viz using vector maps Connector to CARTO platform WHAT IS CARTOframes? ● Python package ● To be used in Jupyter Notebooks ● Built for Data Scientists ● Part of CARTO Analysis stack CARTOFrames Analysis API SQL API Python SDK
  • 27. CARTO — Turn Location Data into Business Outcomes WITH SOME CAVEATS: 1. Different variances? 2. Correlated variables? 3. Missing data? 4. When is a distance small enough? Or how to define similarity? TWIN AREA MODEL
  • 31. 1. Eigen-decomposition of the sample covariance matrix 2. Rearrange the columns in the eigenvector matrix in order of decreasing eigenvalue 3. Keep only the eigenvectors that correspond to the p-largest eigenvalues 4. Compute the principal components (PC) 5. Reconstruct the original data How many PCs? Let’s use an ensemble!
  • 33. 1. PCA can also be described as the ML solution of a probabilistic latent variable model (PPCA) 2. Find the ML estimate for the model parameters using the EM algorithm 2.1. E-step: 2.2. M-step
  • 34. Similarity Score HOW TO DEFINE SIMILARITY So far we have only computed distances in the variable space 0 1 Actually since we are computing an K-ensemble of distances... Let’s compare instead the score for each target location to the score from the mean vector data
  • 35. Takeaways CARTO Data Observatory (DO) for data enrichment CARTOframes as a connector to the DO and for powerful vector visualizations Site-planning applications require various sources of location data streams Easily derive data-driven insights when opening, relocating or consolidating location sites
  • 36. Thanks for listening! Any questions? Request a demo at CARTO.COM Giulia Carella Data Scientist // giulia@carto.com Steve Isaac Content Marketing Manager // sisaac@carto.com