SlideShare a Scribd company logo
Geo-spatial Research for Impact:
Beyond Analysis  Towards Synthesis
Ramesh Raskar
Assoc. Professor
MIT Media Lab
with
Nikhil Naik
Ilke Demir
Poverty, Pop-density, Crop, Methane .. maps
Google Street View
Online User Photos: Social voting in the real world
Phototourism by Seitz at al
Analysis: Perceived Safety
Naik, Raskar, Hidalgo 2016
Analysis  Synthesis
Street Address: Assign | Adopt
• 75% of the world population without street addresses
• Timely ambulance delivery
• Stimulate digital economy via eCommerce
• Protect property rights of the marginalized
• Crisis response (Haiti, 48 hrs to coord aid)
Beyond Analysis  Towards Synthesis
Actionable Insights
Traffic Nudge
Crisis Response
Street Addr
Govt Policies
Large Scale Visual Study
Visual Census
Socio-eco observations
Sentiment maps
Occasional Frequent
Low
Quality
Census
High
Quality
Satellite,
Street view
Phone, Mobility,
Social, Financial
AV,
PlanetLabs etc
Sense
Process
Respond
Capture Analyze Act
Low
Level
Mid
Level
High
Level
Capture Analyze Act
Low
Level
Street-level
Measure
•Visual Census
•Counting/Density
Alert
•Traffic Interventions
Mid
Level
Aerial
Understand
•Geolocalization
•Crowds
•3D Modeling
Assist
•Crisis Response
•Agriculture
•Insurance/Hedge
Funds
High
Level
Satellite
Predict
• Economic activity
Change
•Street Addr
•Government planning
Capture  Present  Future
Low
Level
Street-level
Street View
Crowdsourcing
(mapillary)
Traffic Cameras
Autonomous
Vehicles
Cloud Dashcams
Mid
Level
Aerial
Commercial airplanes
Amateur Drones
Autonomous
Drones
High
Level
Satellite
Meter-resolution
Limited coverage for
high-res
Sub-meter, real-
time, entire Earth
(Planet Lab,
OneWeb)
Ack: Nikhil Naik
Analyze Present  Future
Low
Level
Measure
•Visual Census
•Counting/Density
Streetscore,
Streetchange,
Visual Census
(Fei-Fei Li)
Generative
modeling of
cities
Mid
Level
Understand
•Geolocalization
•Crowds
•3D Modeling
DeepRoadMap
per (Urtasun et
al.)
Design
Suggestions
High
Level
Aggregate and
Predict
• Economic activity
Jean and
Ermon
Jayachandran
Deforestation
Study
Real-time data
analysis.
Actionable
information
Socio-Economic Inference from Digital Patterns
Satellite ImageryStreet-level Imagery Aerial Imagery
Phone Mobility Data
Social Networks, Photos
Device
Activity
Built
Environment
Human
Activity
84 countries, no data
Ack: Nikhil Naik
400K Google StreetViews
Socio Economic and Physical environment
4k Images, 200K pairwise comparisons
Place Pulse 1.0
Naik, Raskar, Hidalgo 2016
1.7K Ranked 400K Predicted
Naik, Raskar, Hidalgo 2016
New York Boston
Detroit
Chicago
Validated with Crime Stats
streetscore.media.mit.edu
105 Kent Avenue, Brooklyn, NY
August 2007
Streetscore = 1.8/10
105 Kent Avenue, Brooklyn, NY
September 2014
Streetscore = 7.2/10
Urban Growth in New York
2007 - 2014
Raskar, Camera Culture, MIT Media Lab
http://cameraculture.info
MIT Media Lab
Ilke Demir, Nikhil Naik
Capture Analyze Act
Low
Level
Street-level
Measure
•Visual Census
•Counting/Density
Alert
•Traffic Interventions
Mid
Level
Aerial
Understand
•Geolocalization
•Crowds
•3D Modeling
Assist
•Crisis Response
•Agriculture
•Insurance/Hedge
Funds
High
Level
Satellite
Aggregate and Predict
• Economic activity
Change
•Street Addr
•Government planning
Example: Congestion Pricing
(Nikhil Naik et al.)
• Jakarta: Response of vehicles by types
• CV + traffic cameras across the city to detect types + number of vehicles
• Understand traffic flows before/after congestion pricing is introduced
• Adjust rates for different types of vehicles/ in different areas
Street Address: Assign | Adopt
• 75% of the world population
without street addresses
• Timely ambulance delivery
• Stimulate digital economy via eCommerce
• Protect property rights of the marginalized
• Crisis response (Haiti, 48 hrs to coord aid)
Street Addresses
from Satellite Imagery
İlke Demir, Forest Hughes,
Aman Raj, Kaunil Dhruv,
Suryanarayana M. Muddala,
Sanyam Garg, Barrett Doo,
Ramesh Raskar
IJCG 2018
Thanks
Nikhil Naik Tristan Swedish Praneeth Vepakomma Ilke Demir
Forest Hughes Jatin Malhotra SuryaNarayana Murthy
Kavnil Dhruv Aman Raj Barrett Doo Praveen Gedam Anna Roy
Cesar A. Hidalgo Guan Pang Jing Huang Daniel Aliaga
Manohar Paluri Pierre Roux Yael Maguire Leo Tsourides Divyaa
Ravichandran Sanyam Garg Sai Sri Sathya Grace Kermani
Tobias Tiecke Andreas Gros Santanu Bhattacharya
Kabir Rustogi Will Marshall
Pilot 1:
Will people use street names?
In use After 6 monthsCommunity assigned names + Signs
Pilot 2
Will SMEs use the labeling scheme?
Measured relative efficiency for Pizza delivery
Pilot 3
Suitable for E-commerce work-flow?
What3words
parrot.casino.failed
Robocodes
75D. Road NE27.
Dhule.MhIn
Landmarks
Green Park
Poor Coverage
Lat/lon
Goal: Memorable Geocoding
Physical
1050 Market St,
Fresno, CA, US
.
What3words:
A: parrot.casino.failed
B: issuer.lollipop.ripe
- Irrelevant words
based on lat/lon.
Robocodes:
75D.NE27.Dhule.MhIn
76C.NE27.Dhule.MhIn
- Hierarchical and
linear addresses.
Google Maps:
Near Green Park
Near Green Park
- No street names or numbers
Point vs Edges for Geometric Queries
Addressing Schemes Around the World
London postal code system:
Radial regions based on orientation and distance
South Korea streets:
Meter markers
Japan block system:
Hard to decipher
Dubai addressing:
Uses districts
Berlin numbering:
Zigzag house pattern
Robocode Scheme
• 5 alphanumeric fields
• Hierarchical and linear descriptors
• To close the gap between physical
addresses and automated geocoding
Road naming scheme:
- distance from the center
- orientation in odd parity
i.e. WB14
Region naming scheme:
- orientation wrt downtown
- distance from downtown
i.e. WB
House numbering scheme:
- meter markers on the road
- block letters from the road
i.e. 38K WB14
“I7 Hacker Way, Menlo Park, CA, US”
Design Choices
Linear: similar addresses stored in a linear fashion
Hierarchical: top-down structure for spatial encapsulation
Compressible: 5x4 max (chars x words)
Universal: independent of local language
Inquirable: useful for geometric, proximity-based, and type-ahead queries
Extendible: dynamically modifiable for new places
Robust: flexible for overestimation and noise
StructuralDesignParameters
forefficientcomputerimplementation
Linear: closer addresses are given related names
Hierarchical: top-down subdivision of the world
Memorable: short and alphanumeric, easily convertible
Intuitive: with a sense of direction and distance
Topological: consistent with road topology
Inclusive: with local names (city, state)
Physical: consistent with natural boundaries
SemanticDesignParameters
foruserfriendliness
Machine
Needs
Human
Needs
Our Pipeline
Satellite Images Predictions Road Segments
Clustering
Segmentation
RegionsRoad IDsMarkers + Blocks
Labeling
Satellite Images
ç
• Irregular urban structure
• Illumination/weather/country
• Different road types
Segmentation
• Binary road masks
• 19K*19K, 0.5m/pixel
• SegNet
Extracting Road Segments
• Orientation based median filtering
• Road segments by orientation bucketing
NF
NH
NE
Region Creation
• Road graph: Node=intersection,
edge=road, weight=length
• Partition for max inter, min intra
connectivity, using normalized min-cut.
• 𝑛 𝑚𝑎𝑥 = 𝑐𝑒𝑖𝑙
𝑟𝑜𝑎𝑑𝑠
88
Region and Road Naming
• Cmax
𝑟𝑜𝑎𝑑𝑠
𝐴
→ 𝐶𝐴 (downtown)
• Orientation bucketing into N, S, W, E
• Trace regions based on distance to CA
• Orientation bucketing into major axes
• Trace roads based on order
Offsetting and Meter Marking
• 5 meter marker along the road
• Odd/even based on RHR
• Distance field of roads: block offset
Unmapped Developing Country
Regions follow natural boundaries
Street Address with Robocodes
• From Satellite Imagery to Deployed Street Addresses
• Generative address : linear, hierarchical, and intuitive
• Human friendly rather than machine friendly
Occasional Frequent
Low
Quality
Census
High
Quality
Satellite,
Street view
Phone, Mobility,
Social, Financial
AV,
PlanetLabs etc
Sense
Process
Respond
Capture Analyze  Act
Low
Level
Street-level
Measure
•Visual Census
•Counting/Density
Alert
•Traffic Interventions
Mid
Level
Aerial
Understand
•Geolocalization
•Crowds
•3D Modeling
Assist
•Crisis Response
•Agriculture
•Insurance/Hedge
Funds
High
Level
Satellite
Aggregate and Predict
• Economic activity
Change
•Street Addr
•Government planning
Pervasive
Recording,
Incentives,
Distributed
ownership,
Privacy,
Full proof
authenticity,
Equality
Geo-spatial Research: Transition from Analysis to Synthesis
Act Alert Assist Change
Low/Mi
d/High
Level
Alert about the
state of
people/economy/b
uilt environment
(e.g., predict crop
yield from satellite
imagery, predict
insurance price
from street view)
Assist in acting on
information by
providing
suggestions based
on data
(e.g., design
optimal congestion
pricing based on
detected cars,
design crisis
response in
hurricanes)
?
Inaccessible Areas
• To extend our format to cover areas that are not accessible by
streets, we explored different implementations to cover such
areas, which are 26*5 m away from any street.
• Geocoding as a function (excluding the version field):
f (info, lat, lon) = x.y.z.t
• For places with roads, info={road network, city, country}
f (R, C) = x.y.city.country
• Extreme case: only reliable information is latitude/longitude!
52
f(C,lat,lon) = hash(round(lat,3)) + dir(lat) .
hash(round(lon,3)) +dir(lon) . C
L-A-T-dir.L-O-N-dir.name.area
Inaccessible Areas: Blackholes!
• Linear hashing:
• 26 letters + 10 digits
• 100m x 100 m granularity
• Last letter is the hemisphere
• Range: 359.999, longitude: 7PRZ W
• Hierarchical hashing:
• Enlarge the grid from to 1 km x 1 km
• Using two floating points = three letters
• Within each cell, re-hash it to a 36 x 36 grid = one letter
• New resolution: 30m, represented by five letters
53
f(C,lat,lon) = hash(round(lat,2)) + hash(lat - round(lat,2)) + dir(lat) .
hash(round(lon,2)) + hash(lon - round(lon,2)) + dir(lon) . C
LlatLlatHlatDlat .LlonLlonHlonDlon . name . Ocean /Continent /etc
Thanks!
What next?
Today
Tomorrow
Friday
A month
• Robocode.info
• Join our presentation in CVPR WiCV. Friday 10am
• Join us with your new ideas at SIGGRAPH 2018 Maps & Urban Data session.
Code: https://github.com/facebookresearch/street-addresses
Paper: https://research.fb.com/publications/generative-street-
addresses-from-satellite-imagery/
Bonus: Blackholes!
Main aim: f(<place>)=robocode
Base case:
<place> = <house, street, city, country>
“12C.NA14.PALO.CAUS”
No street:
<place> = <lat, lon, city, country>
“F12.HN3.PALO.CAUS”
No city/country:
<place> = <lat, lon, other info (ocean, dessert, etc.)>
“JK3.3DF.PAC.OCEA”
Region Experiments
• Experimented with (a) normalized min-cut, (b) Newman-
Girvan, (c) modularity based partitioning.
• Experimented with image based methods (superpixels,
region growing) and the dual of the road graph.
• Evaluated with urban rules (geography, population,
road distribution)
Output Maps and Tools
• .osm maps with roads (meter marking and offsetting on the fly)
• ID-tool of MapBox for on-demand inverse/forward geocoding
• rtree extension for efficient spatial querying
• Experimental mobile app for self navigation
• 21.7% decrease in arrival time using Robocodes
Analyze: Three Types of Outputs
1. Semantic
2. Objective
Population Density
76000/sq. mile
3. Qualitative
Assign a semantic
label to each pixel
Label road quality
as
“Bad” or “Good”

More Related Content

PDF
Semantic Integration at large scale
PDF
The right to the smart city
PDF
GIS and Remote Sensing Training at Pitney Bowes Software
PPTX
transitFinal
PPTX
Enhanced Urban Planning through Disruptive Technologies for more Age Friendl...
PDF
Lindemann arc gis forlocalgovt
PDF
Engaging the Public in Urban Solutions - GeoSUMR Project. C. Terborgh, M. Gol...
PDF
Planning in an era of smart urbanism
Semantic Integration at large scale
The right to the smart city
GIS and Remote Sensing Training at Pitney Bowes Software
transitFinal
Enhanced Urban Planning through Disruptive Technologies for more Age Friendl...
Lindemann arc gis forlocalgovt
Engaging the Public in Urban Solutions - GeoSUMR Project. C. Terborgh, M. Gol...
Planning in an era of smart urbanism

Similar to Geo-spatial Research: Transition from Analysis to Synthesis (20)

PPTX
Improve site selection and network planning with Dynamic Demographics
PDF
Introduction to Geographic Information System (GIS)
PDF
Data-driven urbanism (Amsterdam, Jan 2017)
PPTX
The Smart City Opportunity - The CTO Perspective
PPTX
Gis dr rahul
PDF
BDVe Webinar Series - QROWD: The Human Factor in Big Data
PDF
BDVe Webinar Series - QROWD: The Human Factor in Big Data
PPTX
The Real-Time City? Data-driven, networked urbanism and the production of sm...
PDF
Understanding Human Mobility
PPTX
RS and GIS-Fundamentals.pptx cskhpkv palampur
PDF
Smart cities: realising the promises while minimizing the perils
PDF
PDF
Human factor in big data qrowd bdve
PDF
Basic Gis
PPTX
Geographical information system : GIS and Social Media
PDF
UrbanFootprint at NPC 2018
PDF
Access to markets, technologies, and services (Carlo Azzarri, IFPRI)
PPT
Introduction to GIS for basic student and other IT staff
PPT
Intro_GIS.ppt
PPTX
CKX: Wellbeing Toronto - More Than Just a Map
Improve site selection and network planning with Dynamic Demographics
Introduction to Geographic Information System (GIS)
Data-driven urbanism (Amsterdam, Jan 2017)
The Smart City Opportunity - The CTO Perspective
Gis dr rahul
BDVe Webinar Series - QROWD: The Human Factor in Big Data
BDVe Webinar Series - QROWD: The Human Factor in Big Data
The Real-Time City? Data-driven, networked urbanism and the production of sm...
Understanding Human Mobility
RS and GIS-Fundamentals.pptx cskhpkv palampur
Smart cities: realising the promises while minimizing the perils
Human factor in big data qrowd bdve
Basic Gis
Geographical information system : GIS and Social Media
UrbanFootprint at NPC 2018
Access to markets, technologies, and services (Carlo Azzarri, IFPRI)
Introduction to GIS for basic student and other IT staff
Intro_GIS.ppt
CKX: Wellbeing Toronto - More Than Just a Map
Ad

More from Camera Culture Group, MIT Media Lab (20)

PPTX
Raskar Sig2017 Siggraph Achievement Award Talk
PPTX
Lost Decade of Computational Photography
PPTX
God’s Eye View: Will global AI empower us or destroy us? | Ramesh Raskar
PPTX
Dont follow the rainbow: How to avoid career traps that can lead you to fail,...
PDF
Raskar PhD and MS Thesis Guidance
PDF
Making Invisible Visible, Ramesh Raskar Keynote at Embedded Vision 2019
PPTX
Augmented Surgeons: AI AR for Anatome, Raskar Aria 2019
PDF
Split Learning versus Federated Learning for Data Transparent ML, Camera Cult...
PPTX
Unspoken Challenges in AR and XR
PPTX
Raskar stanfordextremecompuimagingapr2016
PDF
What is SIGGRAPH NEXT? Intro by Ramesh Raskar
PPTX
What is Media in MIT Media Lab, Why 'Camera Culture'
PDF
Raskar UIST Keynote 2015 November
PPTX
Multiview Imaging HW Overview
PPTX
Time of Flight Cameras - Refael Whyte
PPTX
Leap Motion Development (Rohan Puri)
PPTX
Compressed Sensing - Achuta Kadambi
PPTX
Coded Photography - Ramesh Raskar
PDF
Stereo and 3D Displays - Matt Hirsch
Raskar Sig2017 Siggraph Achievement Award Talk
Lost Decade of Computational Photography
God’s Eye View: Will global AI empower us or destroy us? | Ramesh Raskar
Dont follow the rainbow: How to avoid career traps that can lead you to fail,...
Raskar PhD and MS Thesis Guidance
Making Invisible Visible, Ramesh Raskar Keynote at Embedded Vision 2019
Augmented Surgeons: AI AR for Anatome, Raskar Aria 2019
Split Learning versus Federated Learning for Data Transparent ML, Camera Cult...
Unspoken Challenges in AR and XR
Raskar stanfordextremecompuimagingapr2016
What is SIGGRAPH NEXT? Intro by Ramesh Raskar
What is Media in MIT Media Lab, Why 'Camera Culture'
Raskar UIST Keynote 2015 November
Multiview Imaging HW Overview
Time of Flight Cameras - Refael Whyte
Leap Motion Development (Rohan Puri)
Compressed Sensing - Achuta Kadambi
Coded Photography - Ramesh Raskar
Stereo and 3D Displays - Matt Hirsch
Ad

Recently uploaded (20)

PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Machine learning based COVID-19 study performance prediction
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
Cloud computing and distributed systems.
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
KodekX | Application Modernization Development
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Mobile App Security Testing_ A Comprehensive Guide.pdf
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
MYSQL Presentation for SQL database connectivity
Network Security Unit 5.pdf for BCA BBA.
20250228 LYD VKU AI Blended-Learning.pptx
Machine learning based COVID-19 study performance prediction
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
The AUB Centre for AI in Media Proposal.docx
Cloud computing and distributed systems.
Dropbox Q2 2025 Financial Results & Investor Presentation
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
KodekX | Application Modernization Development
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11

Geo-spatial Research: Transition from Analysis to Synthesis

  • 1. Geo-spatial Research for Impact: Beyond Analysis  Towards Synthesis Ramesh Raskar Assoc. Professor MIT Media Lab with Nikhil Naik Ilke Demir
  • 2. Poverty, Pop-density, Crop, Methane .. maps
  • 4. Online User Photos: Social voting in the real world Phototourism by Seitz at al
  • 5. Analysis: Perceived Safety Naik, Raskar, Hidalgo 2016
  • 7. Street Address: Assign | Adopt • 75% of the world population without street addresses • Timely ambulance delivery • Stimulate digital economy via eCommerce • Protect property rights of the marginalized • Crisis response (Haiti, 48 hrs to coord aid)
  • 8. Beyond Analysis  Towards Synthesis Actionable Insights Traffic Nudge Crisis Response Street Addr Govt Policies Large Scale Visual Study Visual Census Socio-eco observations Sentiment maps
  • 9. Occasional Frequent Low Quality Census High Quality Satellite, Street view Phone, Mobility, Social, Financial AV, PlanetLabs etc Sense Process Respond
  • 11. Capture Analyze Act Low Level Street-level Measure •Visual Census •Counting/Density Alert •Traffic Interventions Mid Level Aerial Understand •Geolocalization •Crowds •3D Modeling Assist •Crisis Response •Agriculture •Insurance/Hedge Funds High Level Satellite Predict • Economic activity Change •Street Addr •Government planning
  • 12. Capture  Present  Future Low Level Street-level Street View Crowdsourcing (mapillary) Traffic Cameras Autonomous Vehicles Cloud Dashcams Mid Level Aerial Commercial airplanes Amateur Drones Autonomous Drones High Level Satellite Meter-resolution Limited coverage for high-res Sub-meter, real- time, entire Earth (Planet Lab, OneWeb) Ack: Nikhil Naik
  • 13. Analyze Present  Future Low Level Measure •Visual Census •Counting/Density Streetscore, Streetchange, Visual Census (Fei-Fei Li) Generative modeling of cities Mid Level Understand •Geolocalization •Crowds •3D Modeling DeepRoadMap per (Urtasun et al.) Design Suggestions High Level Aggregate and Predict • Economic activity Jean and Ermon Jayachandran Deforestation Study Real-time data analysis. Actionable information
  • 14. Socio-Economic Inference from Digital Patterns Satellite ImageryStreet-level Imagery Aerial Imagery Phone Mobility Data Social Networks, Photos Device Activity Built Environment Human Activity 84 countries, no data Ack: Nikhil Naik
  • 15. 400K Google StreetViews Socio Economic and Physical environment
  • 16. 4k Images, 200K pairwise comparisons Place Pulse 1.0 Naik, Raskar, Hidalgo 2016
  • 17. 1.7K Ranked 400K Predicted
  • 19. New York Boston Detroit Chicago Validated with Crime Stats streetscore.media.mit.edu
  • 20. 105 Kent Avenue, Brooklyn, NY August 2007 Streetscore = 1.8/10
  • 21. 105 Kent Avenue, Brooklyn, NY September 2014 Streetscore = 7.2/10
  • 22. Urban Growth in New York 2007 - 2014
  • 23. Raskar, Camera Culture, MIT Media Lab http://cameraculture.info MIT Media Lab Ilke Demir, Nikhil Naik
  • 24. Capture Analyze Act Low Level Street-level Measure •Visual Census •Counting/Density Alert •Traffic Interventions Mid Level Aerial Understand •Geolocalization •Crowds •3D Modeling Assist •Crisis Response •Agriculture •Insurance/Hedge Funds High Level Satellite Aggregate and Predict • Economic activity Change •Street Addr •Government planning
  • 25. Example: Congestion Pricing (Nikhil Naik et al.) • Jakarta: Response of vehicles by types • CV + traffic cameras across the city to detect types + number of vehicles • Understand traffic flows before/after congestion pricing is introduced • Adjust rates for different types of vehicles/ in different areas
  • 26. Street Address: Assign | Adopt • 75% of the world population without street addresses • Timely ambulance delivery • Stimulate digital economy via eCommerce • Protect property rights of the marginalized • Crisis response (Haiti, 48 hrs to coord aid)
  • 27. Street Addresses from Satellite Imagery İlke Demir, Forest Hughes, Aman Raj, Kaunil Dhruv, Suryanarayana M. Muddala, Sanyam Garg, Barrett Doo, Ramesh Raskar IJCG 2018
  • 28. Thanks Nikhil Naik Tristan Swedish Praneeth Vepakomma Ilke Demir Forest Hughes Jatin Malhotra SuryaNarayana Murthy Kavnil Dhruv Aman Raj Barrett Doo Praveen Gedam Anna Roy Cesar A. Hidalgo Guan Pang Jing Huang Daniel Aliaga Manohar Paluri Pierre Roux Yael Maguire Leo Tsourides Divyaa Ravichandran Sanyam Garg Sai Sri Sathya Grace Kermani Tobias Tiecke Andreas Gros Santanu Bhattacharya Kabir Rustogi Will Marshall
  • 29. Pilot 1: Will people use street names? In use After 6 monthsCommunity assigned names + Signs
  • 30. Pilot 2 Will SMEs use the labeling scheme? Measured relative efficiency for Pizza delivery
  • 31. Pilot 3 Suitable for E-commerce work-flow?
  • 32. What3words parrot.casino.failed Robocodes 75D. Road NE27. Dhule.MhIn Landmarks Green Park Poor Coverage Lat/lon Goal: Memorable Geocoding Physical 1050 Market St, Fresno, CA, US .
  • 33. What3words: A: parrot.casino.failed B: issuer.lollipop.ripe - Irrelevant words based on lat/lon. Robocodes: 75D.NE27.Dhule.MhIn 76C.NE27.Dhule.MhIn - Hierarchical and linear addresses. Google Maps: Near Green Park Near Green Park - No street names or numbers Point vs Edges for Geometric Queries
  • 34. Addressing Schemes Around the World London postal code system: Radial regions based on orientation and distance South Korea streets: Meter markers Japan block system: Hard to decipher Dubai addressing: Uses districts Berlin numbering: Zigzag house pattern
  • 35. Robocode Scheme • 5 alphanumeric fields • Hierarchical and linear descriptors • To close the gap between physical addresses and automated geocoding Road naming scheme: - distance from the center - orientation in odd parity i.e. WB14 Region naming scheme: - orientation wrt downtown - distance from downtown i.e. WB House numbering scheme: - meter markers on the road - block letters from the road i.e. 38K WB14 “I7 Hacker Way, Menlo Park, CA, US”
  • 36. Design Choices Linear: similar addresses stored in a linear fashion Hierarchical: top-down structure for spatial encapsulation Compressible: 5x4 max (chars x words) Universal: independent of local language Inquirable: useful for geometric, proximity-based, and type-ahead queries Extendible: dynamically modifiable for new places Robust: flexible for overestimation and noise StructuralDesignParameters forefficientcomputerimplementation Linear: closer addresses are given related names Hierarchical: top-down subdivision of the world Memorable: short and alphanumeric, easily convertible Intuitive: with a sense of direction and distance Topological: consistent with road topology Inclusive: with local names (city, state) Physical: consistent with natural boundaries SemanticDesignParameters foruserfriendliness Machine Needs Human Needs
  • 37. Our Pipeline Satellite Images Predictions Road Segments Clustering Segmentation RegionsRoad IDsMarkers + Blocks Labeling
  • 38. Satellite Images ç • Irregular urban structure • Illumination/weather/country • Different road types
  • 39. Segmentation • Binary road masks • 19K*19K, 0.5m/pixel • SegNet
  • 40. Extracting Road Segments • Orientation based median filtering • Road segments by orientation bucketing
  • 41. NF NH NE Region Creation • Road graph: Node=intersection, edge=road, weight=length • Partition for max inter, min intra connectivity, using normalized min-cut. • 𝑛 𝑚𝑎𝑥 = 𝑐𝑒𝑖𝑙 𝑟𝑜𝑎𝑑𝑠 88
  • 42. Region and Road Naming • Cmax 𝑟𝑜𝑎𝑑𝑠 𝐴 → 𝐶𝐴 (downtown) • Orientation bucketing into N, S, W, E • Trace regions based on distance to CA • Orientation bucketing into major axes • Trace roads based on order
  • 43. Offsetting and Meter Marking • 5 meter marker along the road • Odd/even based on RHR • Distance field of roads: block offset
  • 44. Unmapped Developing Country Regions follow natural boundaries
  • 45. Street Address with Robocodes • From Satellite Imagery to Deployed Street Addresses • Generative address : linear, hierarchical, and intuitive • Human friendly rather than machine friendly
  • 46. Occasional Frequent Low Quality Census High Quality Satellite, Street view Phone, Mobility, Social, Financial AV, PlanetLabs etc Sense Process Respond
  • 47. Capture Analyze  Act Low Level Street-level Measure •Visual Census •Counting/Density Alert •Traffic Interventions Mid Level Aerial Understand •Geolocalization •Crowds •3D Modeling Assist •Crisis Response •Agriculture •Insurance/Hedge Funds High Level Satellite Aggregate and Predict • Economic activity Change •Street Addr •Government planning Pervasive Recording, Incentives, Distributed ownership, Privacy, Full proof authenticity, Equality
  • 49. Act Alert Assist Change Low/Mi d/High Level Alert about the state of people/economy/b uilt environment (e.g., predict crop yield from satellite imagery, predict insurance price from street view) Assist in acting on information by providing suggestions based on data (e.g., design optimal congestion pricing based on detected cars, design crisis response in hurricanes) ?
  • 50. Inaccessible Areas • To extend our format to cover areas that are not accessible by streets, we explored different implementations to cover such areas, which are 26*5 m away from any street. • Geocoding as a function (excluding the version field): f (info, lat, lon) = x.y.z.t • For places with roads, info={road network, city, country} f (R, C) = x.y.city.country • Extreme case: only reliable information is latitude/longitude! 52
  • 51. f(C,lat,lon) = hash(round(lat,3)) + dir(lat) . hash(round(lon,3)) +dir(lon) . C L-A-T-dir.L-O-N-dir.name.area Inaccessible Areas: Blackholes! • Linear hashing: • 26 letters + 10 digits • 100m x 100 m granularity • Last letter is the hemisphere • Range: 359.999, longitude: 7PRZ W • Hierarchical hashing: • Enlarge the grid from to 1 km x 1 km • Using two floating points = three letters • Within each cell, re-hash it to a 36 x 36 grid = one letter • New resolution: 30m, represented by five letters 53 f(C,lat,lon) = hash(round(lat,2)) + hash(lat - round(lat,2)) + dir(lat) . hash(round(lon,2)) + hash(lon - round(lon,2)) + dir(lon) . C LlatLlatHlatDlat .LlonLlonHlonDlon . name . Ocean /Continent /etc
  • 52. Thanks! What next? Today Tomorrow Friday A month • Robocode.info • Join our presentation in CVPR WiCV. Friday 10am • Join us with your new ideas at SIGGRAPH 2018 Maps & Urban Data session. Code: https://github.com/facebookresearch/street-addresses Paper: https://research.fb.com/publications/generative-street- addresses-from-satellite-imagery/
  • 53. Bonus: Blackholes! Main aim: f(<place>)=robocode Base case: <place> = <house, street, city, country> “12C.NA14.PALO.CAUS” No street: <place> = <lat, lon, city, country> “F12.HN3.PALO.CAUS” No city/country: <place> = <lat, lon, other info (ocean, dessert, etc.)> “JK3.3DF.PAC.OCEA”
  • 54. Region Experiments • Experimented with (a) normalized min-cut, (b) Newman- Girvan, (c) modularity based partitioning. • Experimented with image based methods (superpixels, region growing) and the dual of the road graph. • Evaluated with urban rules (geography, population, road distribution)
  • 55. Output Maps and Tools • .osm maps with roads (meter marking and offsetting on the fly) • ID-tool of MapBox for on-demand inverse/forward geocoding • rtree extension for efficient spatial querying • Experimental mobile app for self navigation • 21.7% decrease in arrival time using Robocodes
  • 56. Analyze: Three Types of Outputs 1. Semantic 2. Objective Population Density 76000/sq. mile 3. Qualitative Assign a semantic label to each pixel Label road quality as “Bad” or “Good”

Editor's Notes

  • #4: Various Street View datasets are now available online. And perhaps the most popular one is Google Street View, which has covered more than a hundred countries to date. And interestingly Street View provides researchers with a new way to observe neighborhood.
  • #5: Check Steve Seitz and U of Washington Phototourism Page
  • #8: So why street addresses are important, why we need adequate mapping. Let me ask you, how many of you had a unique address up to your house or flat number, back home? According to geocoding companies, 75% of the world is unmapped and UN says 4 billion people are invisible because of that. This lack of addressing is even worse in disaster zones, for example in Haiti Earthquake, Humanatarian Openstreetmap community started remotely mapping the disaster area in 48 hours, and mapped adequately in 6 months for NGOs and aid agencies to use. But are curves on a plane enough to be a map? How do we refer to places, how do we define locations? Are those maps complete without any labels?
  • #10: Capture, analyze, act
  • #11: Capture, analyze, act
  • #12: Capture, analyze, act
  • #13: Capture, analyze, act
  • #17: Socio Economic and Physical environment,
  • #20: Cities are physical too: Using computer vision to measure the quality and impact of urban appearance N Naik, R Raskar, CA Hidalgo - American Economic Review, 2016 - aeaweb.org Streetscore-predicting the perceived safety of one million streetscapes N Naik, J Philipoom, R Raskar… - Proceedings of the …, 2014 - openaccess.thecvf.com
  • #27: Capture, analyze, act
  • #29: So why street addresses are important, why we need adequate mapping. Let me ask you, how many of you had a unique address up to your house or flat number, back home? According to geocoding companies, 75% of the world is unmapped and UN says 4 billion people are invisible because of that. This lack of addressing is even worse in disaster zones, for example in Haiti Earthquake, Humanatarian Openstreetmap community started remotely mapping the disaster area in 48 hours, and mapped adequately in 6 months for NGOs and aid agencies to use. But are curves on a plane enough to be a map? How do we refer to places, how do we define locations? Are those maps complete without any labels?
  • #30: Welcome to our presentation of Robocodes! Hopefully we’ll jump-start the conference with an impactful presentation to keep you pumped up for the rest of the week. Today I will introduce you our approach for generative street addressing from satellite imagery. This project is developed at Facebook and MIT Media Lab.
  • #32: Will people use street Signs $40 per sign City counsel used approved Naming streets is politically unsavvy 20% of first time deliveries undelivered $300M/year loss because of inadequate addressing Need an addressing system for emergency, businesses, and residents
  • #37: On the bright side, we don’t need to re-invent addresses, as there are many addressing schemes already organically being developed and experimented throughout the history. Within these addressing schemes, London postal code system caught our attention with its linear and hierarchical features. Also other addressing schemes guided us for some design choices that I will discuss next.
  • #38: Our addresses consist of four alphanumeric fields, hierarchically designating larger areas in order. It resembles real world addresses, containing house number, street name, city, state and country. The region naming scheme is inspired from the London scheme in the previous slide. The first letter is dedicated for orientation from the downtown (north, south, west, east) and the second letter cues the distance from the downtown. So the yellow house on the left is in region WB. Then, within a region, the roads are named according to their orientation and order. Finally, the house numbering is based on the distances along and from the road.
  • #39: We followed some design principles for making our system both efficient, and user friendly. I won’t go through all, but for example, the storage should be supporting forward and inverse geoqueries, should be extendible for address changes, should be compressible, etc. On the other hand, the linear and hierarchical properties of the scheme supports user integration, intuitiveness and memorability.
  • #40: Back to our pipeline, we start by extracting road masks from satellite images by deep learning, then we find the individual road segments from those predictions. We create regions from the road graph, and label the parcels by distance fields from the roads.
  • #41: Focusing on developing countries, the urban structure in those satellite images are not easy to detect and organize. Also there are different weather and illumination conditions per country.
  • #42: Our annotaters created binary road masks from satellite images, on zoom level 19 satellite images. Then we train a SegNet model on those images, and we are able to learn road predictions as shown here, with 72.6% precision and 57.2% recall.
  • #43: After we have the predictions, we threshold and thin the roads to find the skeleton. Then we use orientation bucketing to find individual road segments, which creates the base for streets.
  • #44: After we have the road segments we create a road graph where nodes represent intersections and edges represents road weighted by their length. We partition the graph for maximum inter minimum intra connectivity of clusters, using normalized min cut. We also use some urban rules to limit the clusters.
  • #45: After the regions are created, we mark the densest are as the downtown, and bucket other regions based on their orientation with respect to the downtown, in south, north, east, and west directions. After the regions are named, we find the two major axes of the roads within each region, and name the roads in each region based on their orientation and order.
  • #46: Finally, we put meter markers on the roads by pixel distance, and we compute distance fields to create the block offsets for the house numbering. The gradient in the left image shows those offsets
  • #47: As defined in our motivation, we want to locate and connect the invisible 4 billion, right? So we tested our system on unmapped developing countries. The map coverage improved up to 80% in some areas,
  • #48: In conclusion, we presented a robust addressing scheme, a deep learning and graph partitioning approach for street extraction, and ready to deploy maps and tools for easy geoqueries.
  • #49: Capture, analyze, act
  • #50: Capture, analyze, act
  • #52: Capture, analyze, act
  • #57: To partition the road graph we experimented with different algorithms as newman girvan and modularity based partitioning. Since there is no clear definition of regions, our domain experts evaluated the success of region creation by using urban rules as geography, population, and road distribution. We also experimented with partitioning the dual of the road graph and traditional image based approaches as mean shift and super pixels
  • #58: As an output, our system generates .osm files with roads. However since it does not make sense to output addresses for each five by five area, we compute the meter marking and offsetting on the fly. We modified the id-tool of MapBox, which is also the tool that openstreetmap uses, to integrate that last mile computation. We also developed an rtree extension for efficient querying. Lastly, as you can imagine, our addresses needs to be evaluated by real users: so we developed a mobile app to enable self navigation with our robocodes. In our user studies, we observed that using our smart addresses decreased the arrival time of the agents by 21.7 percent.