SlideShare a Scribd company logo
ETA Prediction with Graph Neural Networks in
Google Maps
Austin Derrow-Pinion, Jennifer She, David Wong, Oliver Lange,
Todd Hester, Luis Perez
Archived
2021. 09. 10
Hyunwook Lee
Contents
• Overview
• Preliminaries
• Methods
• Experiment
• Conclusion
2
Overview: What is ETA?
• ETA: Estimated time of arrival  Equivalent to travel time estimation
3
Overview
4
Preliminaries
• Goal: ETA for the supersegment S and segment s in variety of horizon T
 Supersegment: 𝑺 = 𝑆, 𝐸 , 𝑠 ∈ 𝑆 & 𝑒𝑖𝑗 ∈ 𝐸
• Input features
 speed, supersegment travel time
• Real-time information is provided for 17 2-minutes windows
• Historical data is provided for 5 8-minutes windows (average across the past 17 weeks)
 Segment-wise features – including road classifications, length, and priority
• Embeddings for supersegment/segment are given
5
Methods: What is Graph Network?
• φ: update function
• ρ: aggregation function
• u: global attributes
6
Methods: What is Graph Network?
• Well-known GN block configurations
• In this work, they utilize 3 Full GN blocks – each of
them operates as encoder, processor, and decoder
• Simple Spatial Attention, GCNs can be seen as one
kind of (d)
7
Methods: MetaGradients
• First used in Reinforcement Learning
• Set learning rate as hyperparameter
 update both learning rate and model parameter simultaneously
• Symbols
 𝜏: training examples
 𝜃: model parameters
 η: meta-parameter (in this work, learning rate)
8
Methods: Others
• To reduce the variance
• Huber Loss
 More robust for the outlier than MSE
 More sensitive/exact in range(strongly convex)
• Exponential Moving Average(EMA) of Parameters
 𝜃EMA = α 𝜃EMA + (1 - α) 𝜃
 Can utilize more stable parameters
 Note: not applied directly in training
9
Methods: Model Training
• Note: We should train individual model for each h-minutes prediction
• Utilize combination of various loss functions
 Loss functions in supersegment/segment/cumulative-segment-level
 𝐿 = 𝑙𝑠𝑠 + 𝜆𝑠𝑙𝑠 + 𝜆𝑠𝑐𝑙𝑠𝑐
 𝑦 𝑐,𝑗,𝑡+ℎ
(𝑖)
= 𝑗
𝑦𝑗,𝑡+ℎ
𝑖
 𝑓𝑘
(𝑖)
is free flow time
10
Experiment Results
11
Experiment Results: Variance
12
Experiment Results: Analysis
13
Conclusion & Discussion
• Simple GNN w/ other methods can achieve remarkable performance
• Embedding & other costs can be reduced
 Meta-Learning approaches
• Pre-defined supersegment can be helpful for the ETA
 Prediction for N routes(or segments) is much expansive than prediction for M
supersegment – M << N
• Where they utilize updated edge representation?
14
Appendix
• Graph Networks
 https://arxiv.org/pdf/1806.01261.pdf
• MetaGradients
 https://arxiv.org/pdf/1805.09801.pdf
• Author’s Explanation Video
 https://www.youtube.com/watch?v=a6WCZn7kOhk
ETA Prediction with Graph Neural Networks in Google Maps

More Related Content

PPTX
Anomaly Detection
PDF
Principal Component Analysis
PDF
Back propagation using sigmoid & ReLU function
PDF
Gonzalez, rafael,c.digitalimageprocessingusing matlab
PPTX
Lecture 18 simplified memory bound a star algorithm
PPTX
Activation functions and Training Algorithms for Deep Neural network
PDF
Anomaly detection
PPTX
Image enhancement
Anomaly Detection
Principal Component Analysis
Back propagation using sigmoid & ReLU function
Gonzalez, rafael,c.digitalimageprocessingusing matlab
Lecture 18 simplified memory bound a star algorithm
Activation functions and Training Algorithms for Deep Neural network
Anomaly detection
Image enhancement

What's hot (20)

PDF
Restricted Boltzmann Machine | Neural Network Tutorial | Deep Learning Tutori...
PPTX
Scale Invariant Feature Transform
PPT
Support Vector Machines
PDF
Content based video retrieval system
PPTX
Neuro-fuzzy systems
PDF
Cluster analysis
PPTX
AI3391 Artificial Intelligence Session 14 Adversarial Search .pptx
PPT
image processing intensity transformation
PDF
Introduction to Few shot learning
PDF
“Introduction to Simultaneous Localization and Mapping (SLAM),” a Presentatio...
PPTX
Edge detection
PPTX
Lect 02 first portion
PDF
Introduction of slam
PPT
Hebbian Learning
PPTX
Regularization in deep learning
PPTX
Lecture 19 sma star algorithm
PPTX
Region filling
PPTX
LDM_ImageSythesis.pptx
PPTX
Anomaly Detection - Real World Scenarios, Approaches and Live Implementation
Restricted Boltzmann Machine | Neural Network Tutorial | Deep Learning Tutori...
Scale Invariant Feature Transform
Support Vector Machines
Content based video retrieval system
Neuro-fuzzy systems
Cluster analysis
AI3391 Artificial Intelligence Session 14 Adversarial Search .pptx
image processing intensity transformation
Introduction to Few shot learning
“Introduction to Simultaneous Localization and Mapping (SLAM),” a Presentatio...
Edge detection
Lect 02 first portion
Introduction of slam
Hebbian Learning
Regularization in deep learning
Lecture 19 sma star algorithm
Region filling
LDM_ImageSythesis.pptx
Anomaly Detection - Real World Scenarios, Approaches and Live Implementation
Ad

Similar to ETA Prediction with Graph Neural Networks in Google Maps (20)

PPTX
[20240527_LabSeminar_Huy]Meta-Graph.pptx
PPTX
[20240513_LabSeminar_Huy]GraphFewShort_Transfer.pptx
PPTX
[20240703_LabSeminar_Huy]MakeGNNGreatAgain.pptx
PPTX
[20240624_LabSeminar_Huy]Towards Dynamic Spatial-Temporal Graph Learning: A D...
PPTX
240513_Thanh_LabSeminar[Learning and Aggregating Lane Graphs for Urban Automa...
PPTX
NS-CUK Seminar: S.T.Nguyen, Review on "Do We Really Need Complicated Model Ar...
PPTX
A survey on methods and applications of meta-learning with GNNs
PPTX
[20240715_LabSeminar_Huy]M2G4RTP: A Multi-Level and Multi-Task Graph Model fo...
PDF
Driving Behavior for ADAS and Autonomous Driving X
PPTX
[20240520_LabSeminar_Huy]DSTAGNN: Dynamic Spatial-Temporal Aware Graph Neural...
PPTX
[20240318_LabSeminar_Huy]GSTNet: Global Spatial-Temporal Network for Traffic ...
PPTX
[20240408_LabSeminar_Huy]PivotalSTGNN.pptx
PPTX
250630_JW_labseminar[Does GNN Pretraining Help Molecular].pptx
PPTX
[20240805_LabSeminar_Huy]GPT-ST: Generative Pre-Training of Spatio-Temporal G...
PPTX
[20240710_LabSeminar_Huy]PDFormer: Propagation Delay-Aware Dynamic Long-Range...
PPTX
[20240603_LabSeminar_Huy]TransMOT: Spatial-Temporal Graph Transformer for Mul...
PPTX
[20240701_LabSeminar_Huy]TelTrans: Applying Multi-Type Telecom Data to Transp...
PPTX
Chapter 3.pptx
PPTX
Traffic Prediction from Street Network images.pptx
PDF
More Than Routing: Joint GPS and Route Modeling for Refine Trajectory Represe...
[20240527_LabSeminar_Huy]Meta-Graph.pptx
[20240513_LabSeminar_Huy]GraphFewShort_Transfer.pptx
[20240703_LabSeminar_Huy]MakeGNNGreatAgain.pptx
[20240624_LabSeminar_Huy]Towards Dynamic Spatial-Temporal Graph Learning: A D...
240513_Thanh_LabSeminar[Learning and Aggregating Lane Graphs for Urban Automa...
NS-CUK Seminar: S.T.Nguyen, Review on "Do We Really Need Complicated Model Ar...
A survey on methods and applications of meta-learning with GNNs
[20240715_LabSeminar_Huy]M2G4RTP: A Multi-Level and Multi-Task Graph Model fo...
Driving Behavior for ADAS and Autonomous Driving X
[20240520_LabSeminar_Huy]DSTAGNN: Dynamic Spatial-Temporal Aware Graph Neural...
[20240318_LabSeminar_Huy]GSTNet: Global Spatial-Temporal Network for Traffic ...
[20240408_LabSeminar_Huy]PivotalSTGNN.pptx
250630_JW_labseminar[Does GNN Pretraining Help Molecular].pptx
[20240805_LabSeminar_Huy]GPT-ST: Generative Pre-Training of Spatio-Temporal G...
[20240710_LabSeminar_Huy]PDFormer: Propagation Delay-Aware Dynamic Long-Range...
[20240603_LabSeminar_Huy]TransMOT: Spatial-Temporal Graph Transformer for Mul...
[20240701_LabSeminar_Huy]TelTrans: Applying Multi-Type Telecom Data to Transp...
Chapter 3.pptx
Traffic Prediction from Street Network images.pptx
More Than Routing: Joint GPS and Route Modeling for Refine Trajectory Represe...
Ad

More from ivaderivader (20)

PPTX
Argument Mining
PPTX
Papers at CHI23
PPTX
DDGK: Learning Graph Representations for Deep Divergence Graph Kernels
PPTX
So Predictable! Continuous 3D Hand Trajectory Prediction in Virtual Reality
PPTX
Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...
PPTX
Prediction for Retrospection: Integrating Algorithmic Stress Prediction into ...
PPTX
Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Orien...
PPTX
A Style-Based Generator Architecture for Generative Adversarial Networks
PPTX
CatchLIve: Real-time Summarization of Live Streams with Stream Content and In...
PPTX
Perception! Immersion! Empowerment! Superpowers as Inspiration for Visualization
PPTX
Learning to Remember Patterns: Pattern Matching Memory Networks for Traffic F...
PPTX
Neural Approximate Dynamic Programming for On-Demand Ride-Pooling
PPTX
StoryMap: Using Social Modeling and Self-Modeling to Support Physical Activit...
PPTX
Bad Breakdowns, Useful Seams, and Face Slapping: Analysis of VR Fails on YouTube
PPTX
Invertible Denoising Network: A Light Solution for Real Noise Removal
PPTX
Traffic Demand Prediction Based Dynamic Transition Convolutional Neural Network
PPTX
MusicBERT: Symbolic Music Understanding with Large-Scale Pre-Training
PPTX
Screen2Vec: Semantic Embedding of GUI Screens and GUI Components
PPTX
Augmenting Decisions of Taxi Drivers through Reinforcement Learning for Impro...
PPTX
Natural Language to Visualization by Neural Machine Translation
Argument Mining
Papers at CHI23
DDGK: Learning Graph Representations for Deep Divergence Graph Kernels
So Predictable! Continuous 3D Hand Trajectory Prediction in Virtual Reality
Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...
Prediction for Retrospection: Integrating Algorithmic Stress Prediction into ...
Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Orien...
A Style-Based Generator Architecture for Generative Adversarial Networks
CatchLIve: Real-time Summarization of Live Streams with Stream Content and In...
Perception! Immersion! Empowerment! Superpowers as Inspiration for Visualization
Learning to Remember Patterns: Pattern Matching Memory Networks for Traffic F...
Neural Approximate Dynamic Programming for On-Demand Ride-Pooling
StoryMap: Using Social Modeling and Self-Modeling to Support Physical Activit...
Bad Breakdowns, Useful Seams, and Face Slapping: Analysis of VR Fails on YouTube
Invertible Denoising Network: A Light Solution for Real Noise Removal
Traffic Demand Prediction Based Dynamic Transition Convolutional Neural Network
MusicBERT: Symbolic Music Understanding with Large-Scale Pre-Training
Screen2Vec: Semantic Embedding of GUI Screens and GUI Components
Augmenting Decisions of Taxi Drivers through Reinforcement Learning for Impro...
Natural Language to Visualization by Neural Machine Translation

Recently uploaded (20)

PDF
Machine learning based COVID-19 study performance prediction
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
cuic standard and advanced reporting.pdf
PDF
Electronic commerce courselecture one. Pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
Programs and apps: productivity, graphics, security and other tools
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PPTX
Spectroscopy.pptx food analysis technology
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
Cloud computing and distributed systems.
Machine learning based COVID-19 study performance prediction
Encapsulation_ Review paper, used for researhc scholars
Dropbox Q2 2025 Financial Results & Investor Presentation
cuic standard and advanced reporting.pdf
Electronic commerce courselecture one. Pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
A comparative analysis of optical character recognition models for extracting...
Mobile App Security Testing_ A Comprehensive Guide.pdf
Network Security Unit 5.pdf for BCA BBA.
Programs and apps: productivity, graphics, security and other tools
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
The AUB Centre for AI in Media Proposal.docx
Reach Out and Touch Someone: Haptics and Empathic Computing
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
“AI and Expert System Decision Support & Business Intelligence Systems”
The Rise and Fall of 3GPP – Time for a Sabbatical?
Spectroscopy.pptx food analysis technology
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Cloud computing and distributed systems.

ETA Prediction with Graph Neural Networks in Google Maps

  • 1. ETA Prediction with Graph Neural Networks in Google Maps Austin Derrow-Pinion, Jennifer She, David Wong, Oliver Lange, Todd Hester, Luis Perez Archived 2021. 09. 10 Hyunwook Lee
  • 2. Contents • Overview • Preliminaries • Methods • Experiment • Conclusion
  • 3. 2 Overview: What is ETA? • ETA: Estimated time of arrival  Equivalent to travel time estimation
  • 5. 4 Preliminaries • Goal: ETA for the supersegment S and segment s in variety of horizon T  Supersegment: 𝑺 = 𝑆, 𝐸 , 𝑠 ∈ 𝑆 & 𝑒𝑖𝑗 ∈ 𝐸 • Input features  speed, supersegment travel time • Real-time information is provided for 17 2-minutes windows • Historical data is provided for 5 8-minutes windows (average across the past 17 weeks)  Segment-wise features – including road classifications, length, and priority • Embeddings for supersegment/segment are given
  • 6. 5 Methods: What is Graph Network? • φ: update function • ρ: aggregation function • u: global attributes
  • 7. 6 Methods: What is Graph Network? • Well-known GN block configurations • In this work, they utilize 3 Full GN blocks – each of them operates as encoder, processor, and decoder • Simple Spatial Attention, GCNs can be seen as one kind of (d)
  • 8. 7 Methods: MetaGradients • First used in Reinforcement Learning • Set learning rate as hyperparameter  update both learning rate and model parameter simultaneously • Symbols  𝜏: training examples  𝜃: model parameters  η: meta-parameter (in this work, learning rate)
  • 9. 8 Methods: Others • To reduce the variance • Huber Loss  More robust for the outlier than MSE  More sensitive/exact in range(strongly convex) • Exponential Moving Average(EMA) of Parameters  𝜃EMA = α 𝜃EMA + (1 - α) 𝜃  Can utilize more stable parameters  Note: not applied directly in training
  • 10. 9 Methods: Model Training • Note: We should train individual model for each h-minutes prediction • Utilize combination of various loss functions  Loss functions in supersegment/segment/cumulative-segment-level  𝐿 = 𝑙𝑠𝑠 + 𝜆𝑠𝑙𝑠 + 𝜆𝑠𝑐𝑙𝑠𝑐  𝑦 𝑐,𝑗,𝑡+ℎ (𝑖) = 𝑗 𝑦𝑗,𝑡+ℎ 𝑖  𝑓𝑘 (𝑖) is free flow time
  • 14. 13 Conclusion & Discussion • Simple GNN w/ other methods can achieve remarkable performance • Embedding & other costs can be reduced  Meta-Learning approaches • Pre-defined supersegment can be helpful for the ETA  Prediction for N routes(or segments) is much expansive than prediction for M supersegment – M << N • Where they utilize updated edge representation?
  • 15. 14 Appendix • Graph Networks  https://arxiv.org/pdf/1806.01261.pdf • MetaGradients  https://arxiv.org/pdf/1805.09801.pdf • Author’s Explanation Video  https://www.youtube.com/watch?v=a6WCZn7kOhk

Editor's Notes

  • #3: 크게 내용이 많지는 않음. 어려운 내용도 없기때문에 방법론 위주의 설명을 진행할 것임 이 논문에 크게 특이한 점은 없지만, 여러가지 방법론을 통해 학습을 stabilize했음  이러한 방법론을 알아두시면 좋을 것.
  • #4: 3, 45
  • #5: Strong benefits in Google-Map application 기본적인 간단한 모델인 GNN에 추가적인 학습 방법론(MetaGradients, EMA)을 통한 성능 향상
  • #6: Note: supersegment는 route 전체를 의미하는 것이 아니라, Typical route를 따라 연결된 도로구간입니다.
  • #7: 파이는 단순 MLP로 이루어지는 경우가 많고, 로는 summation, min/max 등 여러 aggregation 방법론이 있다. Note: Graph-XX라는 식의 거의 모든 모델들은 사실 Graph-structure를 외부에서/prior로써 받아서 다룬다
  • #10: Huber loss  에러가 큰 outlier에 대해서는 constant update & 에러가 작은 case에 대해서는 squared update  에러가 클 때 MSE의 단점을 커버, 에러가 작을 떄에 MSE의 장점을 가짐
  • #13: 이 외에도 aggregation functio에 대한 실험, Unsupervised auxiliary loss function을 이용했을 때에 대한 실험 등이 있지만 스킵.
  • #16: 3, 45