SlideShare a Scribd company logo
Nguyen Thanh Sang
Network Science Lab
Dept. of Artificial Intelligence
The Catholic University of Korea
E-mail: sang.ngt99@gmail.com
2023-03-24
1
 Paper
 Introduction
 Problem
 Contributions
 Framework
 Experiment
 Conclusion
2
Autonomous driving cars
• Predicting future trajectories of surrounding obstacles
is a crucial task for autonomous driving cars to achieve
a high degree of road safety.
3
Problems
 Challenges in trajectory prediction in
real-world traffic scenarios:
• Considering surrounding traffic
environments.
• Dealing with social interactions.
• Handling traffic of multi-class
movement,
• Predicting multi-modal
trajectories with probability.
• Probability awareness.
4
Contributions
• Multi-modal predictions with considering traffic environments, dealing with social interactions,
and predicting multi-class movement patterns with probability values, simultaneously.
• Dynamic Graph Attention Network (DGAN), Dynamic Attention Zone and GAT are combined
to model the intention and habit of human driving in heterogeneous traffic scenarios.
• To capture complex social interactions among road agents, the authors combine a semantic
HD map, observed trajectories of road agents, and the current status of the traffic.
5
Dynamic Attention Zone and Graph Modelling
• A dynamic attention zone is designed to capture
the normal ability of people when interacting with
others in traffic.
• Human choose which surrounding moving agents:
distances, headings, velocities, and sizes.
 an attention circle
6
Overview
7
Feature Extraction
• Semantic Map: semantic HD map contains valuable traffic rule
information.
• The middle-layer output estimated by the CNN is extracted
as the visual feature:
• Observed Trajectory: An LSTM is used to extract joint features
from the observed trajectories of all involved agents.
• Traffic state: S is very important for capturing extra information
to predict the future trajectories:
• The final embedding feature:
8
Graph Attention Network
• GAT: multiple stacked graph attention layers.
• Hierarchical classification to calculate the probabilities belonging to class c and
anchor 𝑘𝑐:
9
Multi-modal Trajectory Prediction
• Predicting multiple possible future trajectories with corresponding probability
using pre-defined anchor trajectories.
• The final loss consists of anchor classification loss and trajectory offset loss:
• represents the single-mode loss L of the 𝑖𝑡ℎ agent’s 𝑘𝑐 anchor, where:
• Hierarchical classification loss:
10
Logistic Delivery Dataset
• Left: Logistic delivery dataset example,
consisting of three-dimensional cloud points
with manually labeled information, front
camera image, and semantic map.
• Middle: observed in dashed yellow and
future ground truth trajectories in red.
• Right: Prediction results using our proposed
DGAN method showing up the two most
likely future trajectories, and corresponding
probabilities encoded in a color map to the
right.
 handling complex situations at traffic
intersections.
 the predicted trajectory with the maximum
probability value is more likely to follow
center lines of lanes guiding by the semantic
map.
11
Stanford Drone Dataset
• The model achieves state-of-the-art performance.
12
ETH and UCY Datasets
• Datasets for pedestrian trajectory prediction only, include 5 scenes in total, including ETH, HOTEL,
ZARA1, ZARA2, and UNIV.
13
Conclusions
• A dynamic social interaction-aware model that predicts the future trajectories of agents in
real-world settings to solve several challenges.
• An encoded semantic map, the observed history trajectories, and the current status of
agents as the input of the GAT.
• To generate the graph at the current time step, we use the dynamic attention zone to
simulate the intuitive ability of people to navigate roads in real-world traffic.
• The experiment results demonstrate the potential ability of the proposed method for
trajectory prediction in a real-world setting and achieves good performance.
14

More Related Content

PDF
Driving Behavior for ADAS and Autonomous Driving X
PDF
Pedestrian behavior/intention modeling for autonomous driving IV
PDF
Pedestrian behavior/intention modeling for autonomous driving V
PDF
Pedestrian behavior/intention modeling for autonomous driving II
PPTX
Project group 12 ppt.pptx Project group 12 ppt.pptx
PPTX
Application of a Markov chain traffic model to the Greater Philadelphia Region
PDF
Driving Behavior for ADAS and Autonomous Driving VIII
PDF
Pedestrian behavior/intention modeling for autonomous driving III
Driving Behavior for ADAS and Autonomous Driving X
Pedestrian behavior/intention modeling for autonomous driving IV
Pedestrian behavior/intention modeling for autonomous driving V
Pedestrian behavior/intention modeling for autonomous driving II
Project group 12 ppt.pptx Project group 12 ppt.pptx
Application of a Markov chain traffic model to the Greater Philadelphia Region
Driving Behavior for ADAS and Autonomous Driving VIII
Pedestrian behavior/intention modeling for autonomous driving III

Similar to NS-CUK Seminar: S.T.Nguyen, Review on "Multi-modal Trajectory Prediction for Autonomous Driving with Semantic Map and Dynamic Graph Attention Network", NIPS 2020 (20)

PDF
Driving behaviors for adas and autonomous driving XII
PPTX
TRAFFIC MANAGEMENT THROUGH SATELLITE IMAGING-- Part 3
PDF
IRJET - A Review on Pedestrian Behavior Prediction for Intelligent Transport ...
PPTX
[2024107_LabSeminar_Huy]MFTraj: Map-Free, Behavior-Driven Trajectory Predicti...
PDF
Driving Behavior for ADAS and Autonomous Driving III
PPTX
TRAFFIC MANAGEMENT THROUGH SATELLITE IMAGING -- Part 1
PPTX
Traffic assignment of motorized private transport in OmniTRANS transport plan...
PDF
Driving behaviors for adas and autonomous driving XI
PPTX
[20240712_LabSeminar_Huy]Spatio-Temporal Neural Structural Causal Models for ...
PDF
Learning to Recognize Distance to Stop Signs Using the Virtual World of Grand...
PDF
mathematics-10-01599-v2.pdf
PPTX
[NS][Lab_Seminar_240909]Sparse Multi-Relational Graph Convolutional Network f...
PDF
Adaptive traffic lights based on traffic flow prediction using machine learni...
PDF
IRJET- Road Recognition from Remote Sensing Imagery using Machine Learning
PPTX
Project Phase 2 ppt.pptx
PPTX
[20240710_LabSeminar_Huy]PDFormer: Propagation Delay-Aware Dynamic Long-Range...
PDF
2006.11583.pdf
PPTX
Smart Mobility
PPTX
[20240318_LabSeminar_Huy]GSTNet: Global Spatial-Temporal Network for Traffic ...
PDF
Pedestrian Behavior/Intention Modeling for Autonomous Driving VI
Driving behaviors for adas and autonomous driving XII
TRAFFIC MANAGEMENT THROUGH SATELLITE IMAGING-- Part 3
IRJET - A Review on Pedestrian Behavior Prediction for Intelligent Transport ...
[2024107_LabSeminar_Huy]MFTraj: Map-Free, Behavior-Driven Trajectory Predicti...
Driving Behavior for ADAS and Autonomous Driving III
TRAFFIC MANAGEMENT THROUGH SATELLITE IMAGING -- Part 1
Traffic assignment of motorized private transport in OmniTRANS transport plan...
Driving behaviors for adas and autonomous driving XI
[20240712_LabSeminar_Huy]Spatio-Temporal Neural Structural Causal Models for ...
Learning to Recognize Distance to Stop Signs Using the Virtual World of Grand...
mathematics-10-01599-v2.pdf
[NS][Lab_Seminar_240909]Sparse Multi-Relational Graph Convolutional Network f...
Adaptive traffic lights based on traffic flow prediction using machine learni...
IRJET- Road Recognition from Remote Sensing Imagery using Machine Learning
Project Phase 2 ppt.pptx
[20240710_LabSeminar_Huy]PDFormer: Propagation Delay-Aware Dynamic Long-Range...
2006.11583.pdf
Smart Mobility
[20240318_LabSeminar_Huy]GSTNet: Global Spatial-Temporal Network for Traffic ...
Pedestrian Behavior/Intention Modeling for Autonomous Driving VI
Ad

More from ssuser4b1f48 (20)

PPTX
NS-CUK Seminar: V.T.Hoang, Review on "GOAT: A Global Transformer on Large-sca...
PPTX
NS-CUK Seminar: J.H.Lee, Review on "Graph Propagation Transformer for Graph R...
PPTX
NS-CUK Seminar: H.B.Kim, Review on "Cluster-GCN: An Efficient Algorithm for ...
PPTX
NS-CUK Seminar: H.E.Lee, Review on "Weisfeiler and Leman Go Neural: Higher-O...
PPTX
NS-CUK Seminar:V.T.Hoang, Review on "GRPE: Relative Positional Encoding for G...
PPTX
NS-CUK Seminar: J.H.Lee, Review on "Learnable Structural Semantic Readout for...
PDF
Aug 22nd, 2023: Case Studies - The Art and Science of Animation Production)
PDF
Aug 17th, 2023: Case Studies - Examining Gamification through Virtual/Augment...
PDF
Aug 10th, 2023: Case Studies - The Power of eXtended Reality (XR) with 360°
PDF
Aug 8th, 2023: Case Studies - Utilizing eXtended Reality (XR) in Drones)
PPTX
NS-CUK Seminar: J.H.Lee, Review on "Learnable Structural Semantic Readout for...
PPTX
NS-CUK Seminar: H.E.Lee, Review on "Gated Graph Sequence Neural Networks", I...
PPTX
NS-CUK Seminar:V.T.Hoang, Review on "Augmentation-Free Self-Supervised Learni...
PPTX
NS-CUK Journal club: H.E.Lee, Review on " A biomedical knowledge graph-based ...
PPTX
NS-CUK Seminar: H.E.Lee, Review on "PTE: Predictive Text Embedding through L...
PPTX
NS-CUK Seminar: H.B.Kim, Review on "Inductive Representation Learning on Lar...
PPTX
NS-CUK Seminar: H.E.Lee, Review on "PTE: Predictive Text Embedding through L...
PPTX
NS-CUK Seminar: J.H.Lee, Review on "Relational Self-Supervised Learning on Gr...
PPTX
NS-CUK Seminar: H.B.Kim, Review on "metapath2vec: Scalable representation le...
PPTX
NS-CUK Seminar: H.E.Lee, Review on "Graph Star Net for Generalized Multi-Tas...
NS-CUK Seminar: V.T.Hoang, Review on "GOAT: A Global Transformer on Large-sca...
NS-CUK Seminar: J.H.Lee, Review on "Graph Propagation Transformer for Graph R...
NS-CUK Seminar: H.B.Kim, Review on "Cluster-GCN: An Efficient Algorithm for ...
NS-CUK Seminar: H.E.Lee, Review on "Weisfeiler and Leman Go Neural: Higher-O...
NS-CUK Seminar:V.T.Hoang, Review on "GRPE: Relative Positional Encoding for G...
NS-CUK Seminar: J.H.Lee, Review on "Learnable Structural Semantic Readout for...
Aug 22nd, 2023: Case Studies - The Art and Science of Animation Production)
Aug 17th, 2023: Case Studies - Examining Gamification through Virtual/Augment...
Aug 10th, 2023: Case Studies - The Power of eXtended Reality (XR) with 360°
Aug 8th, 2023: Case Studies - Utilizing eXtended Reality (XR) in Drones)
NS-CUK Seminar: J.H.Lee, Review on "Learnable Structural Semantic Readout for...
NS-CUK Seminar: H.E.Lee, Review on "Gated Graph Sequence Neural Networks", I...
NS-CUK Seminar:V.T.Hoang, Review on "Augmentation-Free Self-Supervised Learni...
NS-CUK Journal club: H.E.Lee, Review on " A biomedical knowledge graph-based ...
NS-CUK Seminar: H.E.Lee, Review on "PTE: Predictive Text Embedding through L...
NS-CUK Seminar: H.B.Kim, Review on "Inductive Representation Learning on Lar...
NS-CUK Seminar: H.E.Lee, Review on "PTE: Predictive Text Embedding through L...
NS-CUK Seminar: J.H.Lee, Review on "Relational Self-Supervised Learning on Gr...
NS-CUK Seminar: H.B.Kim, Review on "metapath2vec: Scalable representation le...
NS-CUK Seminar: H.E.Lee, Review on "Graph Star Net for Generalized Multi-Tas...
Ad

Recently uploaded (20)

PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Machine learning based COVID-19 study performance prediction
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Encapsulation theory and applications.pdf
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Approach and Philosophy of On baking technology
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
Chapter 3 Spatial Domain Image Processing.pdf
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Dropbox Q2 2025 Financial Results & Investor Presentation
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Machine learning based COVID-19 study performance prediction
Building Integrated photovoltaic BIPV_UPV.pdf
Encapsulation theory and applications.pdf
Review of recent advances in non-invasive hemoglobin estimation
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
MYSQL Presentation for SQL database connectivity
Digital-Transformation-Roadmap-for-Companies.pptx
The Rise and Fall of 3GPP – Time for a Sabbatical?
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
20250228 LYD VKU AI Blended-Learning.pptx
Understanding_Digital_Forensics_Presentation.pptx
Approach and Philosophy of On baking technology
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
The AUB Centre for AI in Media Proposal.docx
Mobile App Security Testing_ A Comprehensive Guide.pdf

NS-CUK Seminar: S.T.Nguyen, Review on "Multi-modal Trajectory Prediction for Autonomous Driving with Semantic Map and Dynamic Graph Attention Network", NIPS 2020

  • 1. Nguyen Thanh Sang Network Science Lab Dept. of Artificial Intelligence The Catholic University of Korea E-mail: sang.ngt99@gmail.com 2023-03-24
  • 2. 1  Paper  Introduction  Problem  Contributions  Framework  Experiment  Conclusion
  • 3. 2 Autonomous driving cars • Predicting future trajectories of surrounding obstacles is a crucial task for autonomous driving cars to achieve a high degree of road safety.
  • 4. 3 Problems  Challenges in trajectory prediction in real-world traffic scenarios: • Considering surrounding traffic environments. • Dealing with social interactions. • Handling traffic of multi-class movement, • Predicting multi-modal trajectories with probability. • Probability awareness.
  • 5. 4 Contributions • Multi-modal predictions with considering traffic environments, dealing with social interactions, and predicting multi-class movement patterns with probability values, simultaneously. • Dynamic Graph Attention Network (DGAN), Dynamic Attention Zone and GAT are combined to model the intention and habit of human driving in heterogeneous traffic scenarios. • To capture complex social interactions among road agents, the authors combine a semantic HD map, observed trajectories of road agents, and the current status of the traffic.
  • 6. 5 Dynamic Attention Zone and Graph Modelling • A dynamic attention zone is designed to capture the normal ability of people when interacting with others in traffic. • Human choose which surrounding moving agents: distances, headings, velocities, and sizes.  an attention circle
  • 8. 7 Feature Extraction • Semantic Map: semantic HD map contains valuable traffic rule information. • The middle-layer output estimated by the CNN is extracted as the visual feature: • Observed Trajectory: An LSTM is used to extract joint features from the observed trajectories of all involved agents. • Traffic state: S is very important for capturing extra information to predict the future trajectories: • The final embedding feature:
  • 9. 8 Graph Attention Network • GAT: multiple stacked graph attention layers. • Hierarchical classification to calculate the probabilities belonging to class c and anchor 𝑘𝑐:
  • 10. 9 Multi-modal Trajectory Prediction • Predicting multiple possible future trajectories with corresponding probability using pre-defined anchor trajectories. • The final loss consists of anchor classification loss and trajectory offset loss: • represents the single-mode loss L of the 𝑖𝑡ℎ agent’s 𝑘𝑐 anchor, where: • Hierarchical classification loss:
  • 11. 10 Logistic Delivery Dataset • Left: Logistic delivery dataset example, consisting of three-dimensional cloud points with manually labeled information, front camera image, and semantic map. • Middle: observed in dashed yellow and future ground truth trajectories in red. • Right: Prediction results using our proposed DGAN method showing up the two most likely future trajectories, and corresponding probabilities encoded in a color map to the right.  handling complex situations at traffic intersections.  the predicted trajectory with the maximum probability value is more likely to follow center lines of lanes guiding by the semantic map.
  • 12. 11 Stanford Drone Dataset • The model achieves state-of-the-art performance.
  • 13. 12 ETH and UCY Datasets • Datasets for pedestrian trajectory prediction only, include 5 scenes in total, including ETH, HOTEL, ZARA1, ZARA2, and UNIV.
  • 14. 13 Conclusions • A dynamic social interaction-aware model that predicts the future trajectories of agents in real-world settings to solve several challenges. • An encoded semantic map, the observed history trajectories, and the current status of agents as the input of the GAT. • To generate the graph at the current time step, we use the dynamic attention zone to simulate the intuitive ability of people to navigate roads in real-world traffic. • The experiment results demonstrate the potential ability of the proposed method for trajectory prediction in a real-world setting and achieves good performance.
  • 15. 14