SlideShare a Scribd company logo
■ Low-quality observation prediction by world model W
given current observation yt
and action ut
■ Fine-tuning of
foundation models
■ Additional dynamical
states like speed and
angular velocity
■ Apply multimodal
foundation models
■ Case studies on
physical robots
■ Unknown states and dynamical models
Preprint: Mao, Z.; Dai, S.; Geng, Y and
Ruchkin, I. "Zero-shot Safety Prediction for
Autonomous Robots with Foundation
World Models" arXiv (2024).
Zhenjiang Mao, Siqi Dai, Yuang Geng, Ivan Ruchkin
Trustworthy Engineered Autonomy (TEA) Lab
Department of Electrical and Computer Engineering, University of Florida
Zero-shot Safety Prediction for Autonomous Robots
with Foundation World Models
APPROACH
FUTURE WORK
RESULTS: LUNAR LANDER
CHALLENGES & CONTRIBUTIONS
PROBLEM
REFERENCES
─ yt+1
= W (yt
,ut
) ≠ yt+1
^
■ How to predict/evaluate the observation at object level?
─ MSE( yt+1
,yt+1
)
^
Challenges:
Contributions:
■ A training-free world model combines foundation models
and overcomes distribution shift in existing world models
■ A segmentation-based metric to measure the learned
surrogate dynamics by comparing object-level error
■ An interpretable latent representation that can evaluate
safety directly and improve safety predictions
■ Need to train specialized safety evaluator NN: y→ {T, F}
■ Out-of-distribution (OOD) shift in predictions
World models: Ha, D.; and Schmidhuber,
J. 2018. Recurrent World Models
Facilitate Policy Evolution. In Advances in
Neural Information Processing Systems,
volume 31.
Safety prediction: Mao, Z.; Sobolewski, C.;
and Ruchkin, I. "How Safe Am I Given
What I See? Calibrated Prediction of
Safety Chances for Image-Controlled
Autonomy." In Learning for Dynamics and
Control Conference 2024.
Image Rebuilding:
Move the objects from previous position wt
into the
predicted position wt+1
Coarse evaluation:
Object duplication
Object loss
Ground truth
Ground truth
─ Object Centroid Distance:
CD = ||( w i
t+1
)C
-( w i
t+1
)C
||
Fine-grained evaluation:
Observation: y Segmented
objects: w 1
-w 4
^
F1 score and False Positive Rate (FPR) of safety prediction
SSIM and MSE of predicted observation
10 20 30 40 50 60
Prediction Horizon
10 20 30 40 50 60
Prediction Horizon
10 20 30 40 50 60
Prediction Horizon
10 20 30 40 50 60
Prediction Horizon
10 20 30 40 50 60
Prediction Horizon
10 20 30 40 50 60
Prediction Horizon
1.0
0.8
0.6
0.4
0.2
0.0
Standard World Model Segmentation + Supervised World Model Foundation World Model
VAE + MD-LSTM SAM + LSTM SAM + GPT3.5
SAM + Gemma
Dashed lines indicate the use of supervised training data.
1.00
0.95
0.90
0.85
0.80
0.75
0.70
0.200
0.175
0.150
0.125
0.100
0.075
0.050
0.025
0.000
5
4
3
2
1
0
7
6
5
4
3
2
1
0
Before After Before After
^
1.0
0.8
0.6
0.4
0.2
0.0
Horizontal and vertical position error

More Related Content

PDF
How Safe Will I Be Given What I See? Calibrated Visual Safety Chance Predict...
PDF
Poster: How Safe Am I Given What I See? Calibrated Prediction of Safety Chanc...
PDF
Poster: How Safe Am I Given What I See? Calibrated Prediction of Safety Chanc...
PPTX
Object Detection and Tracking using Statistical and Stochastic Techniques
PDF
Overcoming Heterogeneity in Autonomous Cyber-Physical Systems
PPTX
Wang midterm-defence
PDF
Verify-then-Monitor: Calibration Guarantees for Safety Confidence
PPTX
Learning Object Detectors From Weakly Supervised Image Data
How Safe Will I Be Given What I See? Calibrated Visual Safety Chance Predict...
Poster: How Safe Am I Given What I See? Calibrated Prediction of Safety Chanc...
Poster: How Safe Am I Given What I See? Calibrated Prediction of Safety Chanc...
Object Detection and Tracking using Statistical and Stochastic Techniques
Overcoming Heterogeneity in Autonomous Cyber-Physical Systems
Wang midterm-defence
Verify-then-Monitor: Calibration Guarantees for Safety Confidence
Learning Object Detectors From Weakly Supervised Image Data

Similar to ​Poster: Zero-shot Safety Prediction for Autonomous Robots with Foundation World Models (20)

PPTX
Introduction to object detection
PPTX
Research proposal 610
PPTX
Yolos you only look one sequence
PPTX
Crystallization classification semisupervised
PPTX
Resilience Metrics and Approaches for Quantification, Igor LINKOV
PPTX
JATINSAHU_21CE02010_PPT_SPRINGSEMESTER_1.pptx
PDF
Presentation European Actuarial Journal conference 2016
PDF
Presentation by CZH
PPTX
Computer Vision and GenAI for Geoscientists.pptx
PPTX
Computer Vision and GenAI for Geoscientists.pptx
PPTX
22PCOAM11 Session 22 Acting under uncertainty.pptx
PDF
“Structures as Sensors: Smaller-Data Learning in the Physical World,” a Prese...
PDF
Not Enough Measurements, Too Many Measurements
PDF
ICIF19_Garg_job_talk_portfolio_modification.pdf
PDF
A NOVEL BACKGROUND SUBTRACTION ALGORITHM FOR PERSON TRACKING BASED ON K-NN
PDF
A Novel Background Subtraction Algorithm for Person Tracking Based on K-NN
PPTX
SEMINAR_3 powerpoint presentation, University of Buea, FET, Sample for superv...
PDF
Integration of poses to enhance the shape of the object tracking from a singl...
PPTX
VERIFICATION_&_VALIDATION_OF_A_SEMANTIC_IMAGE_TAGGING_FRAMEWORK_VIA_GENERATIO...
PDF
Masking preprocessing in transfer learning for damage building detection
Introduction to object detection
Research proposal 610
Yolos you only look one sequence
Crystallization classification semisupervised
Resilience Metrics and Approaches for Quantification, Igor LINKOV
JATINSAHU_21CE02010_PPT_SPRINGSEMESTER_1.pptx
Presentation European Actuarial Journal conference 2016
Presentation by CZH
Computer Vision and GenAI for Geoscientists.pptx
Computer Vision and GenAI for Geoscientists.pptx
22PCOAM11 Session 22 Acting under uncertainty.pptx
“Structures as Sensors: Smaller-Data Learning in the Physical World,” a Prese...
Not Enough Measurements, Too Many Measurements
ICIF19_Garg_job_talk_portfolio_modification.pdf
A NOVEL BACKGROUND SUBTRACTION ALGORITHM FOR PERSON TRACKING BASED ON K-NN
A Novel Background Subtraction Algorithm for Person Tracking Based on K-NN
SEMINAR_3 powerpoint presentation, University of Buea, FET, Sample for superv...
Integration of poses to enhance the shape of the object tracking from a singl...
VERIFICATION_&_VALIDATION_OF_A_SEMANTIC_IMAGE_TAGGING_FRAMEWORK_VIA_GENERATIO...
Masking preprocessing in transfer learning for damage building detection
Ad

More from Ivan Ruchkin (20)

PDF
Neural Approximation of Vision-Controlled Systems for Reachability Analysis
PDF
State-Dependent Conformal Perception Bounds for Neuro-Symbolic Verification
PPTX
NeuroStrata: Harnessing Neuro-Symbolic Paradigms for Improved Testability and...
PDF
State-Dependent Conformal Perception Bounds for Neuro-Symbolic Verification o...
PDF
Four Principles for Physically Interpretable World Models
PDF
State-Dependent Conformal Perception Bounds for Neuro-Symbolic Verification o...
PDF
Four Principles for Physically Interpretable World Models (poster)
PDF
Accelerating Neural Policy Repair with Preservation via Stability-Plasticity ...
PDF
Distributionally Robust Statistical Verification with Imprecise Neural Networks
PDF
Autonomous Drift Detection and Online Road Friction Estimation
PDF
Neuro-Symbolic Bridge: From Perception to Estimation & Control
PDF
Towards Physically Interpretable World Models: Meaningful Weakly Supervised R...
PDF
Bridging Dimensions: Confident Reachability for High-Dimensional Controllers...
PDF
Poster: Bridging Dimensions: Confident Reachability for High-Dimensional Cont...
PDF
Bridging Dimensions: Confident Reachability for High-Dimensional Controllers
PDF
Language-Enhanced Latent Representations for Out-of-Distribution Detection in...
PDF
Curating Naturally Adversarial Datasets for Learning-Enabled Medical Cyber-Ph...
PDF
Repairing Learning-Enabled Controllers While Preserving What Works
PDF
Poster: Conservative Safety Monitors of Stochastic Dynamical Systems
PDF
Causal Repair of Learning-Enabled Cyber-physical Systems
Neural Approximation of Vision-Controlled Systems for Reachability Analysis
State-Dependent Conformal Perception Bounds for Neuro-Symbolic Verification
NeuroStrata: Harnessing Neuro-Symbolic Paradigms for Improved Testability and...
State-Dependent Conformal Perception Bounds for Neuro-Symbolic Verification o...
Four Principles for Physically Interpretable World Models
State-Dependent Conformal Perception Bounds for Neuro-Symbolic Verification o...
Four Principles for Physically Interpretable World Models (poster)
Accelerating Neural Policy Repair with Preservation via Stability-Plasticity ...
Distributionally Robust Statistical Verification with Imprecise Neural Networks
Autonomous Drift Detection and Online Road Friction Estimation
Neuro-Symbolic Bridge: From Perception to Estimation & Control
Towards Physically Interpretable World Models: Meaningful Weakly Supervised R...
Bridging Dimensions: Confident Reachability for High-Dimensional Controllers...
Poster: Bridging Dimensions: Confident Reachability for High-Dimensional Cont...
Bridging Dimensions: Confident Reachability for High-Dimensional Controllers
Language-Enhanced Latent Representations for Out-of-Distribution Detection in...
Curating Naturally Adversarial Datasets for Learning-Enabled Medical Cyber-Ph...
Repairing Learning-Enabled Controllers While Preserving What Works
Poster: Conservative Safety Monitors of Stochastic Dynamical Systems
Causal Repair of Learning-Enabled Cyber-physical Systems
Ad

Recently uploaded (20)

PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPT
Teaching material agriculture food technology
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Approach and Philosophy of On baking technology
PPTX
Big Data Technologies - Introduction.pptx
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
Cloud computing and distributed systems.
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
A comparative analysis of optical character recognition models for extracting...
MIND Revenue Release Quarter 2 2025 Press Release
Teaching material agriculture food technology
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Digital-Transformation-Roadmap-for-Companies.pptx
NewMind AI Weekly Chronicles - August'25-Week II
Approach and Philosophy of On baking technology
Big Data Technologies - Introduction.pptx
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
The AUB Centre for AI in Media Proposal.docx
MYSQL Presentation for SQL database connectivity
Cloud computing and distributed systems.
Programs and apps: productivity, graphics, security and other tools
Mobile App Security Testing_ A Comprehensive Guide.pdf
Chapter 3 Spatial Domain Image Processing.pdf
A comparative analysis of optical character recognition models for extracting...

​Poster: Zero-shot Safety Prediction for Autonomous Robots with Foundation World Models

  • 1. ■ Low-quality observation prediction by world model W given current observation yt and action ut ■ Fine-tuning of foundation models ■ Additional dynamical states like speed and angular velocity ■ Apply multimodal foundation models ■ Case studies on physical robots ■ Unknown states and dynamical models Preprint: Mao, Z.; Dai, S.; Geng, Y and Ruchkin, I. "Zero-shot Safety Prediction for Autonomous Robots with Foundation World Models" arXiv (2024). Zhenjiang Mao, Siqi Dai, Yuang Geng, Ivan Ruchkin Trustworthy Engineered Autonomy (TEA) Lab Department of Electrical and Computer Engineering, University of Florida Zero-shot Safety Prediction for Autonomous Robots with Foundation World Models APPROACH FUTURE WORK RESULTS: LUNAR LANDER CHALLENGES & CONTRIBUTIONS PROBLEM REFERENCES ─ yt+1 = W (yt ,ut ) ≠ yt+1 ^ ■ How to predict/evaluate the observation at object level? ─ MSE( yt+1 ,yt+1 ) ^ Challenges: Contributions: ■ A training-free world model combines foundation models and overcomes distribution shift in existing world models ■ A segmentation-based metric to measure the learned surrogate dynamics by comparing object-level error ■ An interpretable latent representation that can evaluate safety directly and improve safety predictions ■ Need to train specialized safety evaluator NN: y→ {T, F} ■ Out-of-distribution (OOD) shift in predictions World models: Ha, D.; and Schmidhuber, J. 2018. Recurrent World Models Facilitate Policy Evolution. In Advances in Neural Information Processing Systems, volume 31. Safety prediction: Mao, Z.; Sobolewski, C.; and Ruchkin, I. "How Safe Am I Given What I See? Calibrated Prediction of Safety Chances for Image-Controlled Autonomy." In Learning for Dynamics and Control Conference 2024. Image Rebuilding: Move the objects from previous position wt into the predicted position wt+1 Coarse evaluation: Object duplication Object loss Ground truth Ground truth ─ Object Centroid Distance: CD = ||( w i t+1 )C -( w i t+1 )C || Fine-grained evaluation: Observation: y Segmented objects: w 1 -w 4 ^ F1 score and False Positive Rate (FPR) of safety prediction SSIM and MSE of predicted observation 10 20 30 40 50 60 Prediction Horizon 10 20 30 40 50 60 Prediction Horizon 10 20 30 40 50 60 Prediction Horizon 10 20 30 40 50 60 Prediction Horizon 10 20 30 40 50 60 Prediction Horizon 10 20 30 40 50 60 Prediction Horizon 1.0 0.8 0.6 0.4 0.2 0.0 Standard World Model Segmentation + Supervised World Model Foundation World Model VAE + MD-LSTM SAM + LSTM SAM + GPT3.5 SAM + Gemma Dashed lines indicate the use of supervised training data. 1.00 0.95 0.90 0.85 0.80 0.75 0.70 0.200 0.175 0.150 0.125 0.100 0.075 0.050 0.025 0.000 5 4 3 2 1 0 7 6 5 4 3 2 1 0 Before After Before After ^ 1.0 0.8 0.6 0.4 0.2 0.0 Horizontal and vertical position error