SlideShare a Scribd company logo
UPDATE THIS PRESENTATION HEADER IN SLIDE MASTER
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
Verifiable Explainability for Financial
Services Workflows with Automated
Reasoning
Andy Hall, Sr. Solutions Architect
hllaah@amazon.com
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Agenda What’s the Problem?
Automated Reasoning
Example
Resources
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
3
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
What’s the Problem?
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Hallucinations can be subtle
Ground truth
My friend Sam and I enjoy
solving advent of code puzzles.
We spend hours on Slack
discussing the trade offs
between different algorithms to
solve the problem. Our passion
for this activity brings us
closer as good friends.
LLM Summary
Ben and I love solving
advent of code puzzles,
and this makes us good
friends.
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Hallucinations can be subtle
Ground truth
My friend Sam and I enjoy
solving advent of code puzzles.
We spend hours on Slack
discussing the trade offs
between different algorithms to
solve the problem. Our passion
for this activity brings us
closer as good friends.
LLM Summary
Ben and I love solving
advent of code puzzles,
and this makes us good
friends.
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
The concern of hallucinations that result
in reasoning errors was the top-rated
potential risk (59%), followed by bad
actors creating misinformation (48%) and
privacy assurances (44%).
Gartner
2024 Gartner CIO Gen AI Survey
See https://www.gartner.com/en/documents/5705151
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Hallucinations are not a bug. It’s a feature.
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
It’s creativity.
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
National Association of Insurance Commissioners
(NAIC)
Controls and processes that an Insurer adopts and implements shall consider:
• The nature of the decisions being made, informed, or supported using the AI System
• The type and mitigate the Degree of Potential Harm to Consumers resulting from the use of AI Systems
• The extent to which humans are involved in the final decision-making process
• The transparency and explainability of outcomes to the impacted consumer
• The extent and scope of the insurer’s use or reliance on data, Predictive Models, and AI Systems from
third parties.
• Governance and explainability controls and processes should be commensurate with both the risk of
Adverse Consumer Outcomes and the Degree of Potential Harm to Consumers.
See https://content.naic.org/sites/default/files/inline-files/2023-12-4%20Model%20Bulletin_Adopted_0.pdf for the bulletin
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Guaranteed Safe AI
See https://arxiv.org/abs/2405.06624 for paper that includes details on this
topic
Safety Specification: cover complex and muti-domain behaviors, for
example:
• Agentic systems shall not take direct actions that can
negatively impact consumers (unfairly denied claims, denied
policy application, etc.)
• Agentic systems shall not take actions that violates security
policies (invoke APIs that can share database content, delete
databases, etc.)
• Agentic systems shall not take action that may allow
invocation of API for money movement (payments, transfers,
etc.)
Verifier: Provides explainability and deterministic mathematical proof
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Automated Theorem Proving
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
12
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
Automated Reasoning
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Three objectives
Accurate
Identifies and suggests
corrections for
inaccurate factual
claims on support
knowledge
Sound
When it says something is
incorrect – it is. If we cannot
make a claim one way or
another, we’ll tell you
Transparent
We can explain exactly
why we believe a claim is
accurate, or not
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon
Bedrock
Guardrails
Evaluate prompts and model responses for agents, knowledge bases,
FMs in Amazon Bedrock, and self-managed or third-party FMs
Configure thresholds to filter harmful content, jailbreaks, and prompt
injection attacks
Define and disallow denied topics with short natural
language descriptions
Remove personally identifiable information (PII) and sensitive
information in generative AI applications
Filter hallucinations by detecting groundedness and relevance of
model responses based on context
Identify, correct, and explain factual claims in responses based on
ground truth formal logic
Automated
Reasoning
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
When does it work best
Policies
Laws and regulation
Operational workflows
Marketing messaging
best practices
Probabilistic calculations
(what are the chances
of?)
Qualitative descriptions
It’s not suited for
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
1 2
3
4
5
7
6
Source Documents
Automated Reasoning
Checks
Automated Reasoning
Policy
Amazon Bedrock
Guardrails
User providing input and
FM response
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Intent
Description
Upload content
Policy Name
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Semantics of Propositional Logic
propositional
formulas are intended
to represent
assertions that may
be true or false
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Truth Table Example
Is_acceptable_risk = (Chargeable_accident_Count<=3) AND
(DUI_count<=0) AND
(Vehicle_type_acceptable)
Chargeable_accident_Count<=3 DUI_count<=0 Vehicle_type_acceptable
TRUE TRUE TRUE
TRUE TRUE FALSE
TRUE FALSE TRUE
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
21
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
Example
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Homeowners Insurance Policy Claims Processing
Example
Document
Generated Rules and Variables
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Guardrail Creation
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Testing
I had a fire in my house and the fire
department was here to put it out. What do I
need to do to file a claim?
To file a fire damage insurance claim, follow these steps:
1. Contact your insurance company immediately
- Call their 24/7 claims number
- Get a claim number
- Ask about immediate living expenses coverage if needed
2. Document everything
- Take photos/videos of all damage
- Make a list of damaged items
- Keep all receipts for emergency repairs or temporary housing
- Save any communication with the fire department
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Boto3 Example
Sample code available at https://github.com/aws-samples/amazon-bedrock-samples/tree/main/responsible_ai/bedrock-automated-reasoning-checks
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon
Bedrock
Guardrails
Evaluate prompts and model responses for agents, knowledge bases,
FMs in Amazon Bedrock, and self-managed or third-party FMs
Configure thresholds to filter harmful content, jailbreaks, and prompt
injection attacks
Define and disallow denied topics with short natural
language descriptions
Remove personally identifiable information (PII) and sensitive
information in generative AI applications
Filter hallucinations by detecting groundedness and relevance of
model responses based on context
Identify, correct, and explain factual claims in responses based on
ground truth formal logic
Automated
Reasoning
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Thank you!
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Andy Hall
hllaah@amazon.com
https://www.linkedin.com/in/andythall
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
28
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
Resources
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
AWS Resources
• https://www.amazon.science/blog/a-gentle-introduction-to-automated-reas
oning
• https://aws.amazon.com/blogs/aws/prevent-factual-errors-from-llm-hallucin
ations-with-mathematically-sound-automated-reasoning-checks-preview/
• https://aws.amazon.com/blogs/machine-learning/minimize-generative-ai-ha
llucinations-with-amazon-bedrock-automated-reasoning-checks/
• https://aws.amazon.com/about-aws/whats-new/2024/12/amazon-bedrock-g
uardrails-automated-reasoning-checks-preview/
• https://github.com/aws-samples/amazon-bedrock-samples/tree/main/respo
nsible_ai/bedrock-automated-reasoning-checks
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
References
• https://kwarc.info/teaching/sWuV/harrison_handbook-of-p
ractical-logic.pdf
• https://www.gartner.com/en/documents/5705151
• https://content.naic.org/sites/default/files/inline-files/2023-
12-4%20Model%20Bulletin_Adopted_0.pdf
• https://arxiv.org/abs/2405.06624
• https://en.wikipedia.org/wiki/Satisfiability_modulo_theorie
s

More Related Content

PDF
stackconf 2024 | Generative AI Security — A Practical Guide to Securing Your ...
PPTX
Amazon SageMaker Clarify
PPTX
Threat Modeling a Batch Job Framework - Teri Radichel - AWS re:Inforce 2025
PPTX
Exploring_agents_with_Amazon_Bedrock.pptx
PDF
Automated Workflows and AI Agents with Amazon Bedrock
PDF
Innovate 7 Principles for effective and cost-efficient generative AI apps.pdf
PPTX
2018 re:Invent - Safeguard the Integrity of Your Code for Fast and Secure Dep...
PDF
Evaluating Large Language Models for Your Applications and Why It Matters
stackconf 2024 | Generative AI Security — A Practical Guide to Securing Your ...
Amazon SageMaker Clarify
Threat Modeling a Batch Job Framework - Teri Radichel - AWS re:Inforce 2025
Exploring_agents_with_Amazon_Bedrock.pptx
Automated Workflows and AI Agents with Amazon Bedrock
Innovate 7 Principles for effective and cost-efficient generative AI apps.pdf
2018 re:Invent - Safeguard the Integrity of Your Code for Fast and Secure Dep...
Evaluating Large Language Models for Your Applications and Why It Matters

More from AWS Chicago (20)

PPTX
Kathie Kinde Clark - Elevate Your Professional Footprint: LinkedIn Masterclass
PDF
Jason Anderson From Dirt Roads to Highways: Simplifying DevOps and Cloud Inf...
PDF
Aman Sardana and Vijay Kumar Soni - Navigating Hybrid Cloud Challenges for ...
PDF
Ben Blair Operating Safely in a Vibe Coding World
PPTX
Joseph Morotti Enhancing customer experience through Amazon Connect and Gene...
PPTX
Craig Johnson When VPCs Attack: Real-Life Cloud Networking Fails (and Fixes)
PDF
Peter Sankauskas Access Denied: Understanding & Debugging AWS IAM
PDF
Shuen Mei Parth Sharma Boost Productivity, Innovation and Efficiency wit...
PDF
Bob Fornal The Impact of Testing on a DevOps Pipeline
PDF
Jason Butz Chaos Engineering with FIS and Lambda Functions
PPTX
Automated VPC migration into centralized inspection architecture with AWS Gat...
PDF
Julia Furst Morgado The Lazy Guide to Kubernetes with EKS Auto Mode + Karpenter
PDF
Bob Fornal - Active Career Management AWS Community Day Midwest 2025
PDF
Edwin Moedano Monitoring and Observability of Lambdas with Cloudwatch and Po...
PPTX
Darren Mills The Migration Modernization Balancing Act: Navigating Risks and...
PPTX
Nathan Hiscock Architecting secure, scalable, cost-efficient computer vision...
PDF
AWS Community Day Midwest 2025 Julia Furst Morgado The Lazy Guide to Kuberne...
PDF
Steven Seaney - Simplifying and Streamlining AWS Control Tower Deployments
PDF
Timothy Rottach - Ramp up on AI Use Cases, from Vector Search to AI Agents wi...
PPTX
Paul Chin Jr. Data Gone in 60 Seconds: A Serverless ETL Heist
Kathie Kinde Clark - Elevate Your Professional Footprint: LinkedIn Masterclass
Jason Anderson From Dirt Roads to Highways: Simplifying DevOps and Cloud Inf...
Aman Sardana and Vijay Kumar Soni - Navigating Hybrid Cloud Challenges for ...
Ben Blair Operating Safely in a Vibe Coding World
Joseph Morotti Enhancing customer experience through Amazon Connect and Gene...
Craig Johnson When VPCs Attack: Real-Life Cloud Networking Fails (and Fixes)
Peter Sankauskas Access Denied: Understanding & Debugging AWS IAM
Shuen Mei Parth Sharma Boost Productivity, Innovation and Efficiency wit...
Bob Fornal The Impact of Testing on a DevOps Pipeline
Jason Butz Chaos Engineering with FIS and Lambda Functions
Automated VPC migration into centralized inspection architecture with AWS Gat...
Julia Furst Morgado The Lazy Guide to Kubernetes with EKS Auto Mode + Karpenter
Bob Fornal - Active Career Management AWS Community Day Midwest 2025
Edwin Moedano Monitoring and Observability of Lambdas with Cloudwatch and Po...
Darren Mills The Migration Modernization Balancing Act: Navigating Risks and...
Nathan Hiscock Architecting secure, scalable, cost-efficient computer vision...
AWS Community Day Midwest 2025 Julia Furst Morgado The Lazy Guide to Kuberne...
Steven Seaney - Simplifying and Streamlining AWS Control Tower Deployments
Timothy Rottach - Ramp up on AI Use Cases, from Vector Search to AI Agents wi...
Paul Chin Jr. Data Gone in 60 Seconds: A Serverless ETL Heist
Ad

Recently uploaded (20)

PDF
Accuracy of neural networks in brain wave diagnosis of schizophrenia
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPTX
TLE Review Electricity (Electricity).pptx
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
Empathic Computing: Creating Shared Understanding
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Heart disease approach using modified random forest and particle swarm optimi...
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Mushroom cultivation and it's methods.pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
1. Introduction to Computer Programming.pptx
PPTX
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
PPTX
cloud_computing_Infrastucture_as_cloud_p
PPTX
SOPHOS-XG Firewall Administrator PPT.pptx
PPTX
Spectroscopy.pptx food analysis technology
PDF
A comparative study of natural language inference in Swahili using monolingua...
Accuracy of neural networks in brain wave diagnosis of schizophrenia
MIND Revenue Release Quarter 2 2025 Press Release
TLE Review Electricity (Electricity).pptx
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Empathic Computing: Creating Shared Understanding
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
Assigned Numbers - 2025 - Bluetooth® Document
Network Security Unit 5.pdf for BCA BBA.
Digital-Transformation-Roadmap-for-Companies.pptx
Heart disease approach using modified random forest and particle swarm optimi...
A comparative analysis of optical character recognition models for extracting...
Mushroom cultivation and it's methods.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
1. Introduction to Computer Programming.pptx
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
cloud_computing_Infrastucture_as_cloud_p
SOPHOS-XG Firewall Administrator PPT.pptx
Spectroscopy.pptx food analysis technology
A comparative study of natural language inference in Swahili using monolingua...
Ad

Andy Hall - Build verifiable explainability into financial services workflows with Automated Reasoning checks

  • 1. UPDATE THIS PRESENTATION HEADER IN SLIDE MASTER © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. Verifiable Explainability for Financial Services Workflows with Automated Reasoning Andy Hall, Sr. Solutions Architect hllaah@amazon.com
  • 2. © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. Agenda What’s the Problem? Automated Reasoning Example Resources
  • 3. © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. 3 © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. What’s the Problem?
  • 4. © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. Hallucinations can be subtle Ground truth My friend Sam and I enjoy solving advent of code puzzles. We spend hours on Slack discussing the trade offs between different algorithms to solve the problem. Our passion for this activity brings us closer as good friends. LLM Summary Ben and I love solving advent of code puzzles, and this makes us good friends.
  • 5. © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. Hallucinations can be subtle Ground truth My friend Sam and I enjoy solving advent of code puzzles. We spend hours on Slack discussing the trade offs between different algorithms to solve the problem. Our passion for this activity brings us closer as good friends. LLM Summary Ben and I love solving advent of code puzzles, and this makes us good friends.
  • 6. © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. The concern of hallucinations that result in reasoning errors was the top-rated potential risk (59%), followed by bad actors creating misinformation (48%) and privacy assurances (44%). Gartner 2024 Gartner CIO Gen AI Survey See https://www.gartner.com/en/documents/5705151
  • 7. © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. Hallucinations are not a bug. It’s a feature.
  • 8. © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. It’s creativity.
  • 9. © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. National Association of Insurance Commissioners (NAIC) Controls and processes that an Insurer adopts and implements shall consider: • The nature of the decisions being made, informed, or supported using the AI System • The type and mitigate the Degree of Potential Harm to Consumers resulting from the use of AI Systems • The extent to which humans are involved in the final decision-making process • The transparency and explainability of outcomes to the impacted consumer • The extent and scope of the insurer’s use or reliance on data, Predictive Models, and AI Systems from third parties. • Governance and explainability controls and processes should be commensurate with both the risk of Adverse Consumer Outcomes and the Degree of Potential Harm to Consumers. See https://content.naic.org/sites/default/files/inline-files/2023-12-4%20Model%20Bulletin_Adopted_0.pdf for the bulletin
  • 10. © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. Guaranteed Safe AI See https://arxiv.org/abs/2405.06624 for paper that includes details on this topic Safety Specification: cover complex and muti-domain behaviors, for example: • Agentic systems shall not take direct actions that can negatively impact consumers (unfairly denied claims, denied policy application, etc.) • Agentic systems shall not take actions that violates security policies (invoke APIs that can share database content, delete databases, etc.) • Agentic systems shall not take action that may allow invocation of API for money movement (payments, transfers, etc.) Verifier: Provides explainability and deterministic mathematical proof
  • 11. © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. Automated Theorem Proving
  • 12. © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. 12 © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. Automated Reasoning
  • 13. © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. Three objectives Accurate Identifies and suggests corrections for inaccurate factual claims on support knowledge Sound When it says something is incorrect – it is. If we cannot make a claim one way or another, we’ll tell you Transparent We can explain exactly why we believe a claim is accurate, or not
  • 14. © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Bedrock Guardrails Evaluate prompts and model responses for agents, knowledge bases, FMs in Amazon Bedrock, and self-managed or third-party FMs Configure thresholds to filter harmful content, jailbreaks, and prompt injection attacks Define and disallow denied topics with short natural language descriptions Remove personally identifiable information (PII) and sensitive information in generative AI applications Filter hallucinations by detecting groundedness and relevance of model responses based on context Identify, correct, and explain factual claims in responses based on ground truth formal logic Automated Reasoning
  • 15. © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. When does it work best Policies Laws and regulation Operational workflows Marketing messaging best practices Probabilistic calculations (what are the chances of?) Qualitative descriptions It’s not suited for
  • 16. © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. 1 2 3 4 5 7 6 Source Documents Automated Reasoning Checks Automated Reasoning Policy Amazon Bedrock Guardrails User providing input and FM response
  • 17. © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. Intent Description Upload content Policy Name
  • 18. © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 19. © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. Semantics of Propositional Logic propositional formulas are intended to represent assertions that may be true or false
  • 20. © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. Truth Table Example Is_acceptable_risk = (Chargeable_accident_Count<=3) AND (DUI_count<=0) AND (Vehicle_type_acceptable) Chargeable_accident_Count<=3 DUI_count<=0 Vehicle_type_acceptable TRUE TRUE TRUE TRUE TRUE FALSE TRUE FALSE TRUE
  • 21. © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. 21 © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. Example
  • 22. © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. Homeowners Insurance Policy Claims Processing Example Document Generated Rules and Variables
  • 23. © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. Guardrail Creation
  • 24. © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. Testing I had a fire in my house and the fire department was here to put it out. What do I need to do to file a claim? To file a fire damage insurance claim, follow these steps: 1. Contact your insurance company immediately - Call their 24/7 claims number - Get a claim number - Ask about immediate living expenses coverage if needed 2. Document everything - Take photos/videos of all damage - Make a list of damaged items - Keep all receipts for emergency repairs or temporary housing - Save any communication with the fire department
  • 25. © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. Boto3 Example Sample code available at https://github.com/aws-samples/amazon-bedrock-samples/tree/main/responsible_ai/bedrock-automated-reasoning-checks
  • 26. © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Bedrock Guardrails Evaluate prompts and model responses for agents, knowledge bases, FMs in Amazon Bedrock, and self-managed or third-party FMs Configure thresholds to filter harmful content, jailbreaks, and prompt injection attacks Define and disallow denied topics with short natural language descriptions Remove personally identifiable information (PII) and sensitive information in generative AI applications Filter hallucinations by detecting groundedness and relevance of model responses based on context Identify, correct, and explain factual claims in responses based on ground truth formal logic Automated Reasoning
  • 27. © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. Thank you! © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. Andy Hall hllaah@amazon.com https://www.linkedin.com/in/andythall
  • 28. © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. 28 © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark. Resources
  • 29. © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. AWS Resources • https://www.amazon.science/blog/a-gentle-introduction-to-automated-reas oning • https://aws.amazon.com/blogs/aws/prevent-factual-errors-from-llm-hallucin ations-with-mathematically-sound-automated-reasoning-checks-preview/ • https://aws.amazon.com/blogs/machine-learning/minimize-generative-ai-ha llucinations-with-amazon-bedrock-automated-reasoning-checks/ • https://aws.amazon.com/about-aws/whats-new/2024/12/amazon-bedrock-g uardrails-automated-reasoning-checks-preview/ • https://github.com/aws-samples/amazon-bedrock-samples/tree/main/respo nsible_ai/bedrock-automated-reasoning-checks
  • 30. © 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. References • https://kwarc.info/teaching/sWuV/harrison_handbook-of-p ractical-logic.pdf • https://www.gartner.com/en/documents/5705151 • https://content.naic.org/sites/default/files/inline-files/2023- 12-4%20Model%20Bulletin_Adopted_0.pdf • https://arxiv.org/abs/2405.06624 • https://en.wikipedia.org/wiki/Satisfiability_modulo_theorie s

Editor's Notes

  • #4: * Sometimes the hallucination is big * Sometimes it’s subtle. Can you find it here? Take your time
  • #5: * Surprise
  • #6: * Back to our Gartner friends
  • #7: I don’t want you to read me as bashing LLMs We want LLMs to hallucinate – it’s their super-power When it matters, we need to find a way to feel comfortable with using them
  • #8: I don’t want you to read me as bashing LLMs We want LLMs to hallucinate – it’s their super-power When it matters, we need to find a way to feel comfortable with using them
  • #9: The NAIC operates similarly to a trade association. It is a federation of insurance commissioners with a founding goal of education and identify leading practices. It does no regulatory authority. The NAIC can issue non-binding guidance, but it does not issue laws, rules, or regulation. There is no obligation to operate according to NAIC guidance. Dec 4, 2023 bulletin to align state insurance commissioners on the usage and controls needed to agentic systems.
  • #10: Guaranteed Safe AI defined in May 10th 2024 Paper calls out the usage of a Safety Specification Read points on the slide here Verifier – provides the ability to explain actions and provides a deterministic mathematical proof
  • #11: Automated Theorem Proving is a specialized branch of computer science that uses mathematical proof techniques and formal logical deduction to verify compliance with rules and requirements with absolute certainty under given assumptions. SMT Solvers (Satisfiability Modulo Theories) is a tool that aims to determine if a mathematical formula is satisfiable using a practical subset of inputs, and it generalizes SAT (Boolean Satisfiability problem) to more complex formulas involving real numbers and various data structures.
  • #13: We want to be accurate – clearly identify factual claims that are incorrect Soundness to us means that we won’t draw false conclusion from true premises. We’d rather tell you we are not sure and you should clarify than tell you something is correct when it’s not We can be transparent. We can logically and verifiable explain why we believe something is correct
  • #14: Unlike probabilistic approaches prevalent in machine learning, Automated Reasoning relies on formal mathematical logic to provide definitive guarantees about what can and can’t be proven. This approach mirrors the rigors of auditors verifying financial statements or compliance officers validating regulatory requirements, but with mathematical precision. By using rigorous logical frameworks and theorem-proving methodologies, Automated Reasoning can conclusively determine whether statements are true or false under given assumptions. This makes it exceptionally valuable for applications that demand high assurance and need to deliver unambiguous conclusions to their users.
  • #15: Works well to validated factual claims and procedures Things that boil down to a yes/no, always/never answer
  • #16: Step 1: Source Document, along with Intent Instructions are passed to the Automated Reasoning Service to build the Rules and Variables and create an AR policy Step 2: AR Policy is Created and Versioned Step 3: An AR Policy and version is associated with a Bedrock Guardrail Step 4: An ApplyGuardrail API call is made with the question and FM Response to the associated Bedrock Guardrail Step 5: The Automated Reasoning model is triggered with the inputs from the ApplyGuardrail API building logical representations of the input and FM response Step 6: An Automated Reasoning check is completed based on the created rules and variables from the source document, and the logical representation of the inputs Step 7: The results of the AR are shared with the user along with what rules, variables, and variable values where used in it’s determination, along with suggestions on what would make the assertion valid
  • #25: Step 1: Source Document, along with Intent Instructions are passed to the Automated Reasoning Service to build the Rules and Variables and create an AR policy Step 2: AR Policy is Created and Versioned Step 3: An AR Policy and version is associated with a Bedrock Guardrail Step 4: An ApplyGuardrail API call is made with the question and FM Response to the associated Bedrock Guardrail Step 5: The Automated Reasoning model is triggered with the inputs from the ApplyGuardrail API building logical representations of the input and FM response Step 6: An Automated Reasoning check is completed based on the created rules and variables from the source document, and the logical representation of the inputs Step 7: The results of the AR are shared with the user along with what rules, variables, and variable values where used in it’s determination, along with suggestions on what would make the assertion valid
  • #26: Together, avengers assemble, right?