Andy Hall - Build verifiable explainability into financial services workflows with Automated Reasoning checks

UPDATE THIS PRESENTATION HEADER IN SLIDE MASTER
© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
Verifiable Explainability for Financial
Services Workflows with Automated
Reasoning
Andy Hall, Sr. Solutions Architect
hllaah@amazon.com

© 2025, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Agenda What’s the Problem?
Automated Reasoning
Example
Resources

3
What’s the Problem?

Hallucinations can be subtle
Ground truth
My friend Sam and I enjoy
solving advent of code puzzles.
We spend hours on Slack
discussing the trade offs
between different algorithms to
solve the problem. Our passion
for this activity brings us
closer as good friends.
LLM Summary
Ben and I love solving
advent of code puzzles,
and this makes us good
friends.

The concern of hallucinations that result
in reasoning errors was the top-rated
potential risk (59%), followed by bad
actors creating misinformation (48%) and
privacy assurances (44%).
Gartner
2024 Gartner CIO Gen AI Survey
See https://www.gartner.com/en/documents/5705151

Hallucinations are not a bug. It’s a feature.

It’s creativity.

National Association of Insurance Commissioners
(NAIC)
Controls and processes that an Insurer adopts and implements shall consider:
• The nature of the decisions being made, informed, or supported using the AI System
• The type and mitigate the Degree of Potential Harm to Consumers resulting from the use of AI Systems
• The extent to which humans are involved in the final decision-making process
• The transparency and explainability of outcomes to the impacted consumer
• The extent and scope of the insurer’s use or reliance on data, Predictive Models, and AI Systems from
third parties.
• Governance and explainability controls and processes should be commensurate with both the risk of
Adverse Consumer Outcomes and the Degree of Potential Harm to Consumers.
See https://content.naic.org/sites/default/files/inline-files/2023-12-4%20Model%20Bulletin_Adopted_0.pdf for the bulletin

Guaranteed Safe AI
See https://arxiv.org/abs/2405.06624 for paper that includes details on this
topic
Safety Specification: cover complex and muti-domain behaviors, for
example:
• Agentic systems shall not take direct actions that can
negatively impact consumers (unfairly denied claims, denied
policy application, etc.)
• Agentic systems shall not take actions that violates security
policies (invoke APIs that can share database content, delete
databases, etc.)
• Agentic systems shall not take action that may allow
invocation of API for money movement (payments, transfers,
etc.)
Verifier: Provides explainability and deterministic mathematical proof

Automated Theorem Proving

12
Automated Reasoning

Three objectives
Accurate
Identifies and suggests
corrections for
inaccurate factual
claims on support
knowledge
Sound
When it says something is
incorrect – it is. If we cannot
make a claim one way or
another, we’ll tell you
Transparent
We can explain exactly
why we believe a claim is
accurate, or not

Amazon
Bedrock
Guardrails
Evaluate prompts and model responses for agents, knowledge bases,
FMs in Amazon Bedrock, and self-managed or third-party FMs
Configure thresholds to filter harmful content, jailbreaks, and prompt
injection attacks
Define and disallow denied topics with short natural
language descriptions
Remove personally identifiable information (PII) and sensitive
information in generative AI applications
Filter hallucinations by detecting groundedness and relevance of
model responses based on context
Identify, correct, and explain factual claims in responses based on
ground truth formal logic
Automated
Reasoning

When does it work best
Policies
Laws and regulation
Operational workflows
Marketing messaging
best practices
Probabilistic calculations
(what are the chances
of?)
Qualitative descriptions
It’s not suited for

1 2
3
4
5
7
6
Source Documents
Automated Reasoning
Checks
Automated Reasoning
Policy
Amazon Bedrock
Guardrails
User providing input and
FM response

Intent
Description
Upload content
Policy Name

Semantics of Propositional Logic
propositional
formulas are intended
to represent
assertions that may
be true or false

Truth Table Example
Is_acceptable_risk = (Chargeable_accident_Count<=3) AND
(DUI_count<=0) AND
(Vehicle_type_acceptable)
Chargeable_accident_Count<=3 DUI_count<=0 Vehicle_type_acceptable
TRUE TRUE TRUE
TRUE TRUE FALSE
TRUE FALSE TRUE

21
Example

Homeowners Insurance Policy Claims Processing
Example
Document
Generated Rules and Variables

Guardrail Creation

Testing
I had a fire in my house and the fire
department was here to put it out. What do I
need to do to file a claim?
To file a fire damage insurance claim, follow these steps:
1. Contact your insurance company immediately
- Call their 24/7 claims number
- Get a claim number
- Ask about immediate living expenses coverage if needed
2. Document everything
- Take photos/videos of all damage
- Make a list of damaged items
- Keep all receipts for emergency repairs or temporary housing
- Save any communication with the fire department

Boto3 Example
Sample code available at https://github.com/aws-samples/amazon-bedrock-samples/tree/main/responsible_ai/bedrock-automated-reasoning-checks

Thank you!
Andy Hall
hllaah@amazon.com
https://www.linkedin.com/in/andythall

28
Resources

AWS Resources
• https://www.amazon.science/blog/a-gentle-introduction-to-automated-reas
oning
• https://aws.amazon.com/blogs/aws/prevent-factual-errors-from-llm-hallucin
ations-with-mathematically-sound-automated-reasoning-checks-preview/
• https://aws.amazon.com/blogs/machine-learning/minimize-generative-ai-ha
llucinations-with-amazon-bedrock-automated-reasoning-checks/
• https://aws.amazon.com/about-aws/whats-new/2024/12/amazon-bedrock-g
uardrails-automated-reasoning-checks-preview/
• https://github.com/aws-samples/amazon-bedrock-samples/tree/main/respo
nsible_ai/bedrock-automated-reasoning-checks

References
• https://kwarc.info/teaching/sWuV/harrison_handbook-of-p
ractical-logic.pdf
• https://www.gartner.com/en/documents/5705151
• https://content.naic.org/sites/default/files/inline-files/2023-
12-4%20Model%20Bulletin_Adopted_0.pdf
• https://arxiv.org/abs/2405.06624
• https://en.wikipedia.org/wiki/Satisfiability_modulo_theorie
s

Andy Hall - Build verifiable explainability into financial services workflows with Automated Reasoning checks

More Related Content

More from AWS Chicago (20)

Recently uploaded (20)

Andy Hall - Build verifiable explainability into financial services workflows with Automated Reasoning checks

Editor's Notes