SlideShare a Scribd company logo
3
Most read
10
Most read
16
Most read
Stateful Monitoring and
Responsible Deployment
of AI Agents
Debmalya Biswas
Wipro AI
Agentic AI Evolution
(Complex) Agentic AI Task Decomposition
A high-level approach to solving complex tasks:
• - decomposition of the given complex task into
a hierarchy or workflow of) simple tasks,
followed by
• - composition of agents able to execute the
simpler tasks.
This can be achieved in a dynamic or static manner.
• Dynamic: given a complex user task, the system
comes up with a plan to fulfil the request
depending on the capabilities of available
agents at run-time.
• Static: given a set of agents, composite agents
are defined manually at design-time combining
their capabilities. * D. Biswas. Constraints Enabled Autonomous Agent Marketplace:
Discovery and Matchmaking. 16th International Conference on Agents
and Artificial Intelligence (ICAART), 2024 (link)
Agentic AI Platform Reference Architecture
The future where enterprises
will be able to develop new
Enterprise AI Apps by
orchestrating / composing
multiple existing AI Agents.
Observability for AI
Agents
Non-determinism in Agentic AI Systems
There are two non-deterministic
operators in the execution plan:
‘Check Credit’ and ‘Delivery Mode’.
The choice ‘Delivery Mode’ indicates
that the user can either pick-up the
order directly from the store or have
it shipped to his address.
Given this, shipping is a non-
deterministic choice and may not be
invoked during the actual execution.
Observability Challenges for Agentic AI
Observability for AI Agents is
challenging:
- No global observer: Due to their
distributed nature, we cannot assume
the existence of an entity having
visibility over the entire execution. In
fact, due to their privacy and
autonomy requirements, even the
composite agent may not have
visibility over the internal processing
of its component agents.
- Parallelism: AI agents allow parallel
composition of processes.
- Dynamic configuration: The agents
are selected incrementally as the
execution progresses (dynamic
binding). Thus, the “components” of
the distributed system may not be
known in advance.
Stateful execution for AI Agents
AgentOps monitoring is critical given the
complexity and long running nature of AI
agents. We define observability as the
ability to find out where in the process the
execution is and whether any
unanticipated glitches have appeared.
- Local queries: Queries which can be
answered based on the local state
information of an agent.
- Composite queries: Queries expressed
over the states of several agents.
- Historical queries: Queries related to the
execution history of the composition.
- Relationship queries: Queries based on
the relationship between states.
Responsible AI
Agents
Data Quality Issues with respect to LLMs, esp.
Vector DBs
From a data quality point of view,
we see the following challenges
w.r.t. LLMs, esp. Vector DBs:
- Accuracy of the encodings in vector
stores, measures in terms of
correctness and groundedness of
the generated LLM responses.
- Incorrect and/or inconsistent
vectors: Due to issues in the
embedding process, some vectors
may end up getting corrupted, be
incomplete, or getting generated
with a different dimensionality.
- Missing data can be in the form of
missing vectors or metadata.
- Timeliness issues w.r.t. outdated
documents impacting the vector
store.
* D. Biswas. Long-term Memory for AI Agents. AI Advances, 2024 (link)
* D. Biswas. Long-term Memory for AI Agents. AI
Advances, 2024 (link)
Explainability
Explainable AI is an umbrella term for
a range of tools, algorithms and
methods; which accompany AI model
predictions with explanations.
- Explainability of AI models ranks
high among the list of ‘non-
functional’ AI features to be
considered by enterprises.
- For example, this implies having
to explain why an ML model
profiled a user to be in a specific
segment — which led him/her to
receiving an advertisement.
(Labeled)
Data
Train ML
Model
Predictions
Explanation
Model
Explainable
Predictions
Fairness & Bias
Bias creeps into AI models, primarily
due to the inherent bias already
present in the training data.
So the ‘data’ part of AI model
development is key to addressing
bias.
- Historical Bias: arises due to
historical inequality of human
decisions captured in the training
data
- Representation Bias: arises due to
training data that is not
representative of the actual
population.
*H. Suresh, J. V. Guttag. A Framework for Understanding Unintended Consequences of Machine Learning,
2020 (link)
*H. Suresh, J. V. Guttag. A Framework for Understanding
Unintended Consequences of Machine Learning, 2020 (link)
ML Privacy Risks
Two broad categories of
privacy inference attacks:
• Membership inference (if a
specific user data item was
present in the training
dataset) and
• Property inference
(reconstruct properties of a
participant’s dataset)
attacks.
Black box attacks are still
possible when the attacker
only has access to the APIs:
invoke the model and observe
the relationships between
inputs and outputs.
Training
dataset
wants access to
ML Model
(Classification,
Prediction)
Inference
API
has access to
Attacker
* D. Biswas. Privacy Preserving Chatbot Conversations. IEEE AIKE 2020: 179-182 (link)
*D. Biswas, K. Vidyasankar. A Privacy Framework for Hierarchical Federated Learning. CIKM Workshops 2021 (link)
Gen AI Privacy Risks – novel challenges
From a privacy point of view, we
need to consider the following
additional / different LLM privacy
risks:
- Membership and property
leakage from pre-training data
- Model features leakage from
pre-trained LLM
- Privacy leakage from
conversations (history) with
LLMs
- Compliance with privacy intent
of users
* D. Biswas. Privacy Risks of Large Language Models. AI Advances, 2024 (link)
* D. Biswas. Privacy Risks of Large Language Models.
AI Advances, 2024 (link)
Responsible deployment of AI Agents
* D. Biswas. Stateful Monitoring and Responsible Deployment of AI Agents. 17th International Conference on Agents and Artificial Intelligence (ICAART), 2025 (link)
Thanks
&
Questions
Debmalya Biswas
https://www.linkedin.com/in/debmalya-
biswas-3975261/
https://medium.com/@debmalyabiswas

More Related Content

PPTX
AI Agents and their implications for Enterprise AI Use-cases
PDF
A comprehensive guide to Agentic AI Systems
PPTX
A Privacy Framework for Hierarchical Federated Learning
PDF
Mastering Agentic AI in Production: Architectures, Deployment Strategies, and...
PDF
What’s Next in GenAI Deployment step-by-step.pdf
PDF
Agentic AI: Scalable & Responsible Deployment of AI Agents in the Enterprise
PDF
Responsible LLMOps presentation at Webit 2024
DOCX
Security Design Considerations In Robotic Process Automation.docx
AI Agents and their implications for Enterprise AI Use-cases
A comprehensive guide to Agentic AI Systems
A Privacy Framework for Hierarchical Federated Learning
Mastering Agentic AI in Production: Architectures, Deployment Strategies, and...
What’s Next in GenAI Deployment step-by-step.pdf
Agentic AI: Scalable & Responsible Deployment of AI Agents in the Enterprise
Responsible LLMOps presentation at Webit 2024
Security Design Considerations In Robotic Process Automation.docx

Similar to ICAART 2025 presentation on Stateful Monitoring and Responsible Deployment of AI Agents (20)

PPTX
Ethical AI - Open Compliance Summit 2020
PDF
Why Agentops requires an artificial Intelligence developer.pdf
DOC
Saipraveen_Cirrculum_Vitae
PDF
Sustainable & Composable Generative AI
PPTX
Compositional AI: Fusion of AI/ML Services
DOCX
Query Pattern Access and Fuzzy Clustering Based Intrusion Detection System
PDF
apidays Helsinki & North 2023 - API Security in the era of Generative AI, Mat...
PDF
AI model security.pdf
PDF
How to build AI agents with ZBrain: Introduction, agent types, development an...
PDF
Authentication and Authorization for User Roles and Device for Attack Detecti...
PDF
AI AGENTS Generative AI Cognitive architecture
PDF
AI AGENTS...............................
PDF
Break Loose Acting To Forestall Emulation Blast
PDF
Cybersecurity update 12
PPTX
Agent Technology
PPTX
Agent Technology Presentation
PDF
Requirement Based Intrusion Detection in Addition to Prevention Via Advanced ...
PDF
Requirement Based Intrusion Detection in Addition to Prevention Via Advanced ...
PPTX
[DSC Adria 23] Muthu Ramachandran AI Ethics Framework for Generative AI such ...
PDF
Solving Industrial Data Integration with Machine Intelligence
Ethical AI - Open Compliance Summit 2020
Why Agentops requires an artificial Intelligence developer.pdf
Saipraveen_Cirrculum_Vitae
Sustainable & Composable Generative AI
Compositional AI: Fusion of AI/ML Services
Query Pattern Access and Fuzzy Clustering Based Intrusion Detection System
apidays Helsinki & North 2023 - API Security in the era of Generative AI, Mat...
AI model security.pdf
How to build AI agents with ZBrain: Introduction, agent types, development an...
Authentication and Authorization for User Roles and Device for Attack Detecti...
AI AGENTS Generative AI Cognitive architecture
AI AGENTS...............................
Break Loose Acting To Forestall Emulation Blast
Cybersecurity update 12
Agent Technology
Agent Technology Presentation
Requirement Based Intrusion Detection in Addition to Prevention Via Advanced ...
Requirement Based Intrusion Detection in Addition to Prevention Via Advanced ...
[DSC Adria 23] Muthu Ramachandran AI Ethics Framework for Generative AI such ...
Solving Industrial Data Integration with Machine Intelligence
Ad

More from Debmalya Biswas (14)

PDF
Agentic AI lifecycle for Enterprise Hyper-Automation
PPTX
Gen AI: Privacy Risks of Large Language Models (LLMs)
PPTX
Constraints Enabled Autonomous Agent Marketplace: Discovery and Matchmaking
PDF
Responsible Generative AI Design Patterns
PPTX
Data-Driven (Reinforcement Learning-Based) Control
PPTX
Regulating Generative AI - LLMOps pipelines with Transparency
PPTX
MLOps for Compositional AI
PPTX
Edge AI Framework for Healthcare Applications
PPTX
Privacy Preserving Chatbot Conversations
PPTX
Reinforcement Learning based HVAC Optimization in Factories
PPTX
Delayed Rewards in the context of Reinforcement Learning based Recommender ...
PPTX
Building an enterprise Natural Language Search Engine with ElasticSearch and ...
PDF
Privacy-Preserving Outsourced Profiling
PDF
Privacy Policies Change Management for Smartphones
Agentic AI lifecycle for Enterprise Hyper-Automation
Gen AI: Privacy Risks of Large Language Models (LLMs)
Constraints Enabled Autonomous Agent Marketplace: Discovery and Matchmaking
Responsible Generative AI Design Patterns
Data-Driven (Reinforcement Learning-Based) Control
Regulating Generative AI - LLMOps pipelines with Transparency
MLOps for Compositional AI
Edge AI Framework for Healthcare Applications
Privacy Preserving Chatbot Conversations
Reinforcement Learning based HVAC Optimization in Factories
Delayed Rewards in the context of Reinforcement Learning based Recommender ...
Building an enterprise Natural Language Search Engine with ElasticSearch and ...
Privacy-Preserving Outsourced Profiling
Privacy Policies Change Management for Smartphones
Ad

Recently uploaded (20)

PPTX
Cloud computing and distributed systems.
PDF
Encapsulation theory and applications.pdf
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
sap open course for s4hana steps from ECC to s4
PPTX
Spectroscopy.pptx food analysis technology
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Approach and Philosophy of On baking technology
PDF
Electronic commerce courselecture one. Pdf
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Cloud computing and distributed systems.
Encapsulation theory and applications.pdf
20250228 LYD VKU AI Blended-Learning.pptx
Review of recent advances in non-invasive hemoglobin estimation
Building Integrated photovoltaic BIPV_UPV.pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
NewMind AI Weekly Chronicles - August'25 Week I
sap open course for s4hana steps from ECC to s4
Spectroscopy.pptx food analysis technology
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Encapsulation_ Review paper, used for researhc scholars
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Approach and Philosophy of On baking technology
Electronic commerce courselecture one. Pdf
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
The Rise and Fall of 3GPP – Time for a Sabbatical?
Per capita expenditure prediction using model stacking based on satellite ima...
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy

ICAART 2025 presentation on Stateful Monitoring and Responsible Deployment of AI Agents

  • 1. Stateful Monitoring and Responsible Deployment of AI Agents Debmalya Biswas Wipro AI
  • 3. (Complex) Agentic AI Task Decomposition A high-level approach to solving complex tasks: • - decomposition of the given complex task into a hierarchy or workflow of) simple tasks, followed by • - composition of agents able to execute the simpler tasks. This can be achieved in a dynamic or static manner. • Dynamic: given a complex user task, the system comes up with a plan to fulfil the request depending on the capabilities of available agents at run-time. • Static: given a set of agents, composite agents are defined manually at design-time combining their capabilities. * D. Biswas. Constraints Enabled Autonomous Agent Marketplace: Discovery and Matchmaking. 16th International Conference on Agents and Artificial Intelligence (ICAART), 2024 (link)
  • 4. Agentic AI Platform Reference Architecture The future where enterprises will be able to develop new Enterprise AI Apps by orchestrating / composing multiple existing AI Agents.
  • 6. Non-determinism in Agentic AI Systems There are two non-deterministic operators in the execution plan: ‘Check Credit’ and ‘Delivery Mode’. The choice ‘Delivery Mode’ indicates that the user can either pick-up the order directly from the store or have it shipped to his address. Given this, shipping is a non- deterministic choice and may not be invoked during the actual execution.
  • 7. Observability Challenges for Agentic AI Observability for AI Agents is challenging: - No global observer: Due to their distributed nature, we cannot assume the existence of an entity having visibility over the entire execution. In fact, due to their privacy and autonomy requirements, even the composite agent may not have visibility over the internal processing of its component agents. - Parallelism: AI agents allow parallel composition of processes. - Dynamic configuration: The agents are selected incrementally as the execution progresses (dynamic binding). Thus, the “components” of the distributed system may not be known in advance.
  • 8. Stateful execution for AI Agents AgentOps monitoring is critical given the complexity and long running nature of AI agents. We define observability as the ability to find out where in the process the execution is and whether any unanticipated glitches have appeared. - Local queries: Queries which can be answered based on the local state information of an agent. - Composite queries: Queries expressed over the states of several agents. - Historical queries: Queries related to the execution history of the composition. - Relationship queries: Queries based on the relationship between states.
  • 10. Data Quality Issues with respect to LLMs, esp. Vector DBs From a data quality point of view, we see the following challenges w.r.t. LLMs, esp. Vector DBs: - Accuracy of the encodings in vector stores, measures in terms of correctness and groundedness of the generated LLM responses. - Incorrect and/or inconsistent vectors: Due to issues in the embedding process, some vectors may end up getting corrupted, be incomplete, or getting generated with a different dimensionality. - Missing data can be in the form of missing vectors or metadata. - Timeliness issues w.r.t. outdated documents impacting the vector store. * D. Biswas. Long-term Memory for AI Agents. AI Advances, 2024 (link) * D. Biswas. Long-term Memory for AI Agents. AI Advances, 2024 (link)
  • 11. Explainability Explainable AI is an umbrella term for a range of tools, algorithms and methods; which accompany AI model predictions with explanations. - Explainability of AI models ranks high among the list of ‘non- functional’ AI features to be considered by enterprises. - For example, this implies having to explain why an ML model profiled a user to be in a specific segment — which led him/her to receiving an advertisement. (Labeled) Data Train ML Model Predictions Explanation Model Explainable Predictions
  • 12. Fairness & Bias Bias creeps into AI models, primarily due to the inherent bias already present in the training data. So the ‘data’ part of AI model development is key to addressing bias. - Historical Bias: arises due to historical inequality of human decisions captured in the training data - Representation Bias: arises due to training data that is not representative of the actual population. *H. Suresh, J. V. Guttag. A Framework for Understanding Unintended Consequences of Machine Learning, 2020 (link) *H. Suresh, J. V. Guttag. A Framework for Understanding Unintended Consequences of Machine Learning, 2020 (link)
  • 13. ML Privacy Risks Two broad categories of privacy inference attacks: • Membership inference (if a specific user data item was present in the training dataset) and • Property inference (reconstruct properties of a participant’s dataset) attacks. Black box attacks are still possible when the attacker only has access to the APIs: invoke the model and observe the relationships between inputs and outputs. Training dataset wants access to ML Model (Classification, Prediction) Inference API has access to Attacker * D. Biswas. Privacy Preserving Chatbot Conversations. IEEE AIKE 2020: 179-182 (link) *D. Biswas, K. Vidyasankar. A Privacy Framework for Hierarchical Federated Learning. CIKM Workshops 2021 (link)
  • 14. Gen AI Privacy Risks – novel challenges From a privacy point of view, we need to consider the following additional / different LLM privacy risks: - Membership and property leakage from pre-training data - Model features leakage from pre-trained LLM - Privacy leakage from conversations (history) with LLMs - Compliance with privacy intent of users * D. Biswas. Privacy Risks of Large Language Models. AI Advances, 2024 (link) * D. Biswas. Privacy Risks of Large Language Models. AI Advances, 2024 (link)
  • 15. Responsible deployment of AI Agents * D. Biswas. Stateful Monitoring and Responsible Deployment of AI Agents. 17th International Conference on Agents and Artificial Intelligence (ICAART), 2025 (link)