SlideShare a Scribd company logo
Zhenjiang Mao, Carson Sobolewski, Ivan Ruchkin
Trustworthy Engineered Autonomy (TEA) Lab
Department of Electrical and Computer Engineering, University of Florida
How Safe Am I Given What I See? Calibrated Prediction
of Safety Chances for Image-Controlled Autonomy
APPROACH
FUTURE WORK
RESULTS: PREDICTION
CHALLENGES & CONTRIBUTIONS
PROBLEM
REFERENCES
■ Given a system with state x, observation y, and a safety property φ(x):
■ Is the system safe now? φ(xi
| yi
)
■ What will be the observation after k steps given past observations? yi+k
| yi-j:i
■ Will the system be safe after k steps given past observations? φ( xi+k
| yi-j:i
)
■ What’s the probability of safety after k steps given a sequence of observations? Pr(φ(xi+k
) | yi-j:i
)
Conformal confidence calibration:
Draw N observation-state pairs {b1
, b2
, ..., bN
}
M times from calibration datasets to get M
datasets {B1
, B2
, ..., BM
}. By implementing
Conformal Calibration Algorithm on predictor
g given miscoverage level 𝜶 and M datasets,
we can get an upper calibration error bound c
for a newly drawn BM+1
that:
Challenges:
■ Unknown states and dynamical models
■ High-dimensional (1000+) observations
■ Difficult to reliably quantify confidence
Contributions:
■ Modular family of online safety predictors
■ Conformal confidence calibration
■ Evaluation on a racing car and a cart pole
■ Conclusion 1: Monolithic (mon) predictors outperform composite (comp) ones.
■ Conclusion 2: Controller-independent (ind) and controller-specific (c-sp) predictors show
comparable performance.
■ Adding physical constraints
to latent states similar to
Neural ODEs (Wen, Wang,
and Metaxas 2022)
■ Jointly learning forecasters
and evaluators to overcome
distribution shifts
■ Using world-model
transformer architectures
■ Applying predictors to
physical systems
Conformal prediction: Vovk, V.; Gammerman, A.; and
Shafer, G. 2005. Algorithmic Learning in a Random World.
New York: Springer, 2005. ISBN 978-0-387-00152-4.
World models: Ha, D.; and Schmidhuber, J. 2018. Recurrent
World Models Facilitate Policy Evolution. In Advances in
Neural Information Processing Systems, volume 31.
Confidence calibration: Guo, C.; Pleiss, G.; Sun, Y.; and
Weinberger, K. Q. 2017. On calibration of modern neural
networks. In Proceedings of the 34th International
Conference on Machine Learning (ICML), Sydney, Australia.
RESULTS: CALIBRATION
■ Conclusion 3: Calibrated predictors are superior to uncalibrated ones.
■ Conclusion 4: Conformal calibration coverage is reliable.
L to R: (1) uncalibrated; (2): calibrated w/ isotonic regression, conformal bounds for 𝛼=0.05;
(3) ECE/MCE over varied k before/after calibration;
(4) our conformal bounds for 𝛼=0.05 (blue) contain more than 95% of the true
calibration errors (box and whisker plots).
L to R: (1) controller-specific (`c-sp') monolithic (`mon') vs. latent composite (`comp') for the racing car;
(2) controller-independent (`ind') monolithic vs. latent composite for the racing car;
(3) controller-specific monolithic vs. latent composite for the cart pole;
(4) controller-independent monolithic vs. latent composite for the cart pole.
Performance of safety label predictors over varied horizons
Shaded uncertainty shows standard deviation due to different controllers and resampling.
where q is the mean true safety and p is the
mean safety chance prediction scores:
Pr( |qM+1
− pM+1
| ≤ c ) ≥ 1 − α

More Related Content

PDF
Poster: How Safe Am I Given What I See? Calibrated Prediction of Safety Chanc...
PDF
How Safe Will I Be Given What I See? Calibrated Visual Safety Chance Predict...
PDF
Overcoming Heterogeneity in Autonomous Cyber-Physical Systems
PDF
​Poster: Zero-shot Safety Prediction for Autonomous Robots with Foundation Wo...
PDF
Verify-then-Monitor: Calibration Guarantees for Safety Confidence
PDF
Generalizing Scientific Machine Learning and Differentiable Simulation Beyond...
PPTX
Mx net image segmentation to predict and diagnose the cardiac diseases karp...
PPTX
OOD_PPT.pptx
Poster: How Safe Am I Given What I See? Calibrated Prediction of Safety Chanc...
How Safe Will I Be Given What I See? Calibrated Visual Safety Chance Predict...
Overcoming Heterogeneity in Autonomous Cyber-Physical Systems
​Poster: Zero-shot Safety Prediction for Autonomous Robots with Foundation Wo...
Verify-then-Monitor: Calibration Guarantees for Safety Confidence
Generalizing Scientific Machine Learning and Differentiable Simulation Beyond...
Mx net image segmentation to predict and diagnose the cardiac diseases karp...
OOD_PPT.pptx

Similar to Poster: How Safe Am I Given What I See? Calibrated Prediction of Safety Chances for Image-Controlled Autonomy (2024) (20)

PPT
tracking.ppt
PDF
(eBook PDF) Compressed Sensing in Radar Signal Processing
PPS
Probability Forecasting - a Machine Learning Perspective
PPT
P1121133727
PPTX
Research Presentation for those who are dong M.Tech
PDF
Autonomous Systems: How to Address the Dilemma between Autonomy and Safety
PPTX
Object Detection and Tracking using Statistical and Stochastic Techniques
PDF
Adobe Audition Crack FRESH Version 2025 FREE
PDF
Requirements in Engineering AI- Enabled Systems: Open Problems and Safe AI Sy...
PPT
Tetiana Bogodorova "Data Science for Electric Power Systems"
PDF
(eBook PDF) Compressed Sensing in Radar Signal Processing
PDF
(eBook PDF) Compressed Sensing in Radar Signal Processing
PPTX
Miccai2018 paperlist
PPTX
Crystallization classification semisupervised
PDF
Confidence Composition for Monitors of Verification Assumptions
PDF
(eBook PDF) Compressed Sensing in Radar Signal Processing
PPTX
Resilience Metrics and Approaches for Quantification, Igor LINKOV
PDF
Endobronchial Ultrasound Image Diagnosis Using Convolutional Neural Network (...
PDF
Vertex Centric Asynchronous Belief Propagation Algorithm for Large-Scale Graphs
PPTX
ntroduction of Signal such as sinosoidal signals, definition of signals
tracking.ppt
(eBook PDF) Compressed Sensing in Radar Signal Processing
Probability Forecasting - a Machine Learning Perspective
P1121133727
Research Presentation for those who are dong M.Tech
Autonomous Systems: How to Address the Dilemma between Autonomy and Safety
Object Detection and Tracking using Statistical and Stochastic Techniques
Adobe Audition Crack FRESH Version 2025 FREE
Requirements in Engineering AI- Enabled Systems: Open Problems and Safe AI Sy...
Tetiana Bogodorova "Data Science for Electric Power Systems"
(eBook PDF) Compressed Sensing in Radar Signal Processing
(eBook PDF) Compressed Sensing in Radar Signal Processing
Miccai2018 paperlist
Crystallization classification semisupervised
Confidence Composition for Monitors of Verification Assumptions
(eBook PDF) Compressed Sensing in Radar Signal Processing
Resilience Metrics and Approaches for Quantification, Igor LINKOV
Endobronchial Ultrasound Image Diagnosis Using Convolutional Neural Network (...
Vertex Centric Asynchronous Belief Propagation Algorithm for Large-Scale Graphs
ntroduction of Signal such as sinosoidal signals, definition of signals
Ad

More from Ivan Ruchkin (20)

PDF
Neural Approximation of Vision-Controlled Systems for Reachability Analysis
PDF
State-Dependent Conformal Perception Bounds for Neuro-Symbolic Verification
PPTX
NeuroStrata: Harnessing Neuro-Symbolic Paradigms for Improved Testability and...
PDF
State-Dependent Conformal Perception Bounds for Neuro-Symbolic Verification o...
PDF
Four Principles for Physically Interpretable World Models
PDF
State-Dependent Conformal Perception Bounds for Neuro-Symbolic Verification o...
PDF
Four Principles for Physically Interpretable World Models (poster)
PDF
Accelerating Neural Policy Repair with Preservation via Stability-Plasticity ...
PDF
Distributionally Robust Statistical Verification with Imprecise Neural Networks
PDF
Autonomous Drift Detection and Online Road Friction Estimation
PDF
Neuro-Symbolic Bridge: From Perception to Estimation & Control
PDF
Towards Physically Interpretable World Models: Meaningful Weakly Supervised R...
PDF
Bridging Dimensions: Confident Reachability for High-Dimensional Controllers...
PDF
Poster: Bridging Dimensions: Confident Reachability for High-Dimensional Cont...
PDF
Bridging Dimensions: Confident Reachability for High-Dimensional Controllers
PDF
Language-Enhanced Latent Representations for Out-of-Distribution Detection in...
PDF
Curating Naturally Adversarial Datasets for Learning-Enabled Medical Cyber-Ph...
PDF
Repairing Learning-Enabled Controllers While Preserving What Works
PDF
Poster: Conservative Safety Monitors of Stochastic Dynamical Systems
PDF
Causal Repair of Learning-Enabled Cyber-physical Systems
Neural Approximation of Vision-Controlled Systems for Reachability Analysis
State-Dependent Conformal Perception Bounds for Neuro-Symbolic Verification
NeuroStrata: Harnessing Neuro-Symbolic Paradigms for Improved Testability and...
State-Dependent Conformal Perception Bounds for Neuro-Symbolic Verification o...
Four Principles for Physically Interpretable World Models
State-Dependent Conformal Perception Bounds for Neuro-Symbolic Verification o...
Four Principles for Physically Interpretable World Models (poster)
Accelerating Neural Policy Repair with Preservation via Stability-Plasticity ...
Distributionally Robust Statistical Verification with Imprecise Neural Networks
Autonomous Drift Detection and Online Road Friction Estimation
Neuro-Symbolic Bridge: From Perception to Estimation & Control
Towards Physically Interpretable World Models: Meaningful Weakly Supervised R...
Bridging Dimensions: Confident Reachability for High-Dimensional Controllers...
Poster: Bridging Dimensions: Confident Reachability for High-Dimensional Cont...
Bridging Dimensions: Confident Reachability for High-Dimensional Controllers
Language-Enhanced Latent Representations for Out-of-Distribution Detection in...
Curating Naturally Adversarial Datasets for Learning-Enabled Medical Cyber-Ph...
Repairing Learning-Enabled Controllers While Preserving What Works
Poster: Conservative Safety Monitors of Stochastic Dynamical Systems
Causal Repair of Learning-Enabled Cyber-physical Systems
Ad

Recently uploaded (20)

PPTX
SOPHOS-XG Firewall Administrator PPT.pptx
PDF
WOOl fibre morphology and structure.pdf for textiles
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
OMC Textile Division Presentation 2021.pptx
PDF
Getting Started with Data Integration: FME Form 101
PDF
Mushroom cultivation and it's methods.pdf
PDF
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
PPTX
Tartificialntelligence_presentation.pptx
PDF
A novel scalable deep ensemble learning framework for big data classification...
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PPTX
A Presentation on Touch Screen Technology
PDF
DP Operators-handbook-extract for the Mautical Institute
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Encapsulation theory and applications.pdf
PDF
Heart disease approach using modified random forest and particle swarm optimi...
PDF
Hindi spoken digit analysis for native and non-native speakers
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
SOPHOS-XG Firewall Administrator PPT.pptx
WOOl fibre morphology and structure.pdf for textiles
Building Integrated photovoltaic BIPV_UPV.pdf
OMC Textile Division Presentation 2021.pptx
Getting Started with Data Integration: FME Form 101
Mushroom cultivation and it's methods.pdf
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
Tartificialntelligence_presentation.pptx
A novel scalable deep ensemble learning framework for big data classification...
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
A Presentation on Touch Screen Technology
DP Operators-handbook-extract for the Mautical Institute
NewMind AI Weekly Chronicles - August'25-Week II
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Encapsulation theory and applications.pdf
Heart disease approach using modified random forest and particle swarm optimi...
Hindi spoken digit analysis for native and non-native speakers
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
Digital-Transformation-Roadmap-for-Companies.pptx

Poster: How Safe Am I Given What I See? Calibrated Prediction of Safety Chances for Image-Controlled Autonomy (2024)

  • 1. Zhenjiang Mao, Carson Sobolewski, Ivan Ruchkin Trustworthy Engineered Autonomy (TEA) Lab Department of Electrical and Computer Engineering, University of Florida How Safe Am I Given What I See? Calibrated Prediction of Safety Chances for Image-Controlled Autonomy APPROACH FUTURE WORK RESULTS: PREDICTION CHALLENGES & CONTRIBUTIONS PROBLEM REFERENCES ■ Given a system with state x, observation y, and a safety property φ(x): ■ Is the system safe now? φ(xi | yi ) ■ What will be the observation after k steps given past observations? yi+k | yi-j:i ■ Will the system be safe after k steps given past observations? φ( xi+k | yi-j:i ) ■ What’s the probability of safety after k steps given a sequence of observations? Pr(φ(xi+k ) | yi-j:i ) Conformal confidence calibration: Draw N observation-state pairs {b1 , b2 , ..., bN } M times from calibration datasets to get M datasets {B1 , B2 , ..., BM }. By implementing Conformal Calibration Algorithm on predictor g given miscoverage level 𝜶 and M datasets, we can get an upper calibration error bound c for a newly drawn BM+1 that: Challenges: ■ Unknown states and dynamical models ■ High-dimensional (1000+) observations ■ Difficult to reliably quantify confidence Contributions: ■ Modular family of online safety predictors ■ Conformal confidence calibration ■ Evaluation on a racing car and a cart pole ■ Conclusion 1: Monolithic (mon) predictors outperform composite (comp) ones. ■ Conclusion 2: Controller-independent (ind) and controller-specific (c-sp) predictors show comparable performance. ■ Adding physical constraints to latent states similar to Neural ODEs (Wen, Wang, and Metaxas 2022) ■ Jointly learning forecasters and evaluators to overcome distribution shifts ■ Using world-model transformer architectures ■ Applying predictors to physical systems Conformal prediction: Vovk, V.; Gammerman, A.; and Shafer, G. 2005. Algorithmic Learning in a Random World. New York: Springer, 2005. ISBN 978-0-387-00152-4. World models: Ha, D.; and Schmidhuber, J. 2018. Recurrent World Models Facilitate Policy Evolution. In Advances in Neural Information Processing Systems, volume 31. Confidence calibration: Guo, C.; Pleiss, G.; Sun, Y.; and Weinberger, K. Q. 2017. On calibration of modern neural networks. In Proceedings of the 34th International Conference on Machine Learning (ICML), Sydney, Australia. RESULTS: CALIBRATION ■ Conclusion 3: Calibrated predictors are superior to uncalibrated ones. ■ Conclusion 4: Conformal calibration coverage is reliable. L to R: (1) uncalibrated; (2): calibrated w/ isotonic regression, conformal bounds for 𝛼=0.05; (3) ECE/MCE over varied k before/after calibration; (4) our conformal bounds for 𝛼=0.05 (blue) contain more than 95% of the true calibration errors (box and whisker plots). L to R: (1) controller-specific (`c-sp') monolithic (`mon') vs. latent composite (`comp') for the racing car; (2) controller-independent (`ind') monolithic vs. latent composite for the racing car; (3) controller-specific monolithic vs. latent composite for the cart pole; (4) controller-independent monolithic vs. latent composite for the cart pole. Performance of safety label predictors over varied horizons Shaded uncertainty shows standard deviation due to different controllers and resampling. where q is the mean true safety and p is the mean safety chance prediction scores: Pr( |qM+1 − pM+1 | ≤ c ) ≥ 1 − α