GenAI Agents: Major Applications (Part1)

GenAI Agents: Major Applications (Part1)
Vladimir Kanchev, PhD

Contents
1. Introduction. History and Basic Definitions.
2. Types and Structures of GenAI Agents.
3. Applications of GenAI Agents.
4. Evaluation of GenAI Agents.
5. Types of GenAI Frameworks and Technologies.
6. Challenges and Future of GenAI.

Introduction
GenAI Agents
Def: An autonomous system that uses advanced generative AI
models to create and interact through a human-like text, images,
audio, or other media. They often use text interfaces, enabling users
to input prompts or queries and receive context-aware,
conversational responses. (ChatGPT)
a

Introduction
General AI Agents Properties:
●
autonomy: independently make decisions
●
perception: gather information about their environment
●
decision-making: select appropriate actions
●
action: their actions change the state of their environment
CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects.
arXiv e-prints, 2024, arXiv: 2401.03428.

History
AI agents:
• Basic concept of computations
• Theory of mind
• Society of Mind (Minsky)
• Symbolic AI

History
Types of AI agents:
●
reflex agents: independently make a decision
●
goal-based agents: gather information about their
environment
●
utility-based agents: select appropriate actions
●
reinforcement learning (RL) agents
●
GenAI agents
●
AGI agents

History

Present
WANG, Yuntao, et al. Large Model Agents: State-of-the-Art, Cooperation Paradigms, Security and Privacy, and Future Trends. arXiv preprint
arXiv:2409.14457, 2024.

Basic Definitions
The ChatGPT era – November, 2022.
Properties of LLM models:
●
extensive knowledge base: the entire internet
●
adaptability: context learning by zero-shot, few-shot
learning
●
human-computer communication
●
scalability

Basic Definitions
Disadvantages of LLM models:
●
context length constraints
●
prolonged knowledge update
●
no direct tool support – LLM’s cannot employ code
interpreters, calculators, etc.
●
potential for biased or inaccurate output (hallucinations)
●
dependence on the training of data

Types of GenAI Agents
Different types of GenAI depending on:
●
number of LLM agents in the system: single or multi-agent
●
the type of LLM model: single-modal or multi-modal
●
the type of task specialization: general purpose and task-
specific

Single GenAI Agents
5 major properties of a single GenAI agent:
●
LLM model: serves as an agent brain, making decisions
●
objective: terminal state the agent must achieve
●
action: set of executable tasks that change the environment
●
memory: stores the environment feedback and task history
to improve performance
●
reflection (rethink): evaluates previous actions and feedback

Single GenAI Agents
2 external parts of a single GenAI agent:
●
tool: extensions of the agent's actions, such as calculators,
code interpreters, or robotic arms.
●
environment: provides input, constraints, and feedback that
influence the agent's behavior and decision-making

Single GenAI Agents
Planning
Def: Building an action sequence based on set objectives and
adapting it to the environment constraints to secure goal
achievement. It is built on the reasoning of an LLM model not
on a learned policy (RL agent).

Single GenAI Agents
Memory in a GenAI agent:
●
preserves knowledge and data from experience
●
represents the current state of the GenAI agent
●
can be textual, a vector/graph database, procedural
●
supports adaptation
●
facilitates personalization

Single GenAI Agents

Single GenAI Agents
Rethinking the ability or retrospection in a GenAI agent:
●
evaluates prior decisions
●
enhances the GenAI agent’s decision-making and learning
●
improves the GenAI agent’s adaptability

Single GenAI Agents
Environment of the GenAI agent:
●
is of a specific type
●
is influenced by the agent itself
●
helps dynamic adaptation
●
helps interactive learning

Single GenAI Agents
Action
Def: An agent extends its functionality by using external tools,
such as APIs, calculators, code interpreters, or specialized
software, to perform complex actions beyond its internal
reasoning capabilities. Thus, it interacts with its environment
and achieves the objectives more efficiently.
Types: single-tool, multi-tool, task-oriented, generalist,
environment-specific tool users.

Multi-agent Systems (MAS)
Def: A collaborative framework of multiple interacting
intelligent agents, each one with specialized roles or
capabilities, working to achieve complex objectives. Their
tasks usually span multiple domains, require distributed
problem-solving, or have parallelized workflows.

Types of multi-agent systems:
●
multi-role coordination - cooperative, competitive, mixed,
hierarchical
●
planning Type - Centralized Planning Decentralized
Execution (CPDE) and Decentralized Planning Decentralized
Execution (DPDE)

Contents
1. Introduction. History and Basic Definitions.
2. Types and Structures of GenAI Agents.
3. Applications of GenAI Agents.
4. Evaluation of GenAI Agents.
5. Types of GenAI Agent Frameworks and Technologies.
6. Challenges and Future of GenAI.

Applications of GenAI Agents
Here we provide a few applications:
●
a single task – React/Reflexion agents, content/code
creation, documentation/test cases generation
●
multiple agents – software development/maintenance,
collaborative coding, world simulations

REACT Single GenAI Agent
●
combines Reasoning and Acting
●
iterative feedback loop: Reason Act Observe Refine
→ → →
●
core features: LLM-based reasoning, contextual memory,
and tool integration (e.g. APIs, search engines)
●
broad applicability: from customer support and healthcare
to autonomous vehicles and R&D.
YAO, Shunyu, et al. ReAct: Synergizing Reasoning and Acting in Language Models. arXiv e-prints, 2022, arXiv: 2210.03629.

RIGAKI, Maria, et al. Out of the cage: How stochastic parrots win in cyber security environments. arXiv preprint arXiv:2308.12086, 2023.

YAO, Shunyu, et al. ReAct: Synergizing Reasoning and Acting in Language Models. arXiv e-prints, 2022, arXiv: 2210.03629.

REFLEXION Single GenAI Agent
●
learns through reflection
●
self-corrects through feedback
●
combines memory and adaptability.
●
has real-world applications
SHINN, Noah, et al. Reflexion: Language agents with verbal reinforcement learning.(2023). arXiv preprint cs.AI/2303.11366, 2023.

SHINN, Noah, et al. Reflexion: Language agents with verbal reinforcement learning.(2023). arXiv preprint cs.AI/2303.11366, 2023.

Agents for Software Engineering
GenAI agents can be oriented to:
●
a single SE task (single agent) – code generation, code
quality assurance (testing)
●
end-to-end SE task (multiple agents) – software
development/maintenance

Agents for Software Engineering
LIU, Junwei, et al. Large language model-based agents for software engineering: A survey. arXiv preprint arXiv:2409.02977, 2024.

End-to-end software development (SD) agent system:
●
covers the whole SD life-cycle
●
aligns with software process models: waterfall or agile
●
has a role-specific specialization
●
features a collaborative interaction
●
relies on more advanced communication

Multi-agent SD System

Мulti-agent for World Simulation
Here we have:
●
scenario-specific simulations
●
social and environmental interactions
●
embodied agents with diverse roles
●
advanced agent interactions
●
controlled sandbox environments

MP5 – A Multi-modal Open-ended
Embodied System in Minecraft
●
Minecraft ecosystem
●
MineDojo framework
●
multi-modal and open-ended tasks
●
task handling: context dependent and process-dependent
●
the MP5 System Architecture consists of: Parser,
Percipient, Planner, Performer, Patroller
QIN, Yiran, et al. Mp5: A multi-modal open-ended embodied system in minecraft via active perception. In: 2024 IEEE/CVF Conference on
Computer Vision and Pattern Recognition (CVPR). IEEE, 2024. p. 16307-16316.

MP5 – A multi-modal open-ended
embedded system in Minecraft
QIN, Yiran, et al. Mp5: A multi-modal open-ended embodied system in minecraft via active perception. In: 2024 IEEE/CVF Conference on
Computer Vision and Pattern Recognition (CVPR). IEEE, 2024. p. 16307-16316.

Evaluation of GenAI Agents
A single GenAI agent evaluation is characterized by:
• performance of narrow tasks
• adaptability to perform unseen tasks
• alignment with the user’s goals
• business metrics
Metrics: Task-specific accuracy, reasoning and adaptability,
user interaction quality, robustness and reliability, efficiency

Task-oriented benchmarks for single GenAI agents:
●
NLP-specific benchmarks (e.g. HELM)
●
multi-modal benchmarks (e.g. SEED-Bench)
●
agentic benchmarks (e.g. AgentBench)
●
autonomous decision-making (e.g. ALFWorld)

Human-AI interaction has:
●
response quality (e.g. relevance, coherence, accuracy)
●
user satisfaction and trust levels (user satisfaction score)
●
efficiency in task completion (task completion time,
reduction in user queries)
●
ability to take feedback and improve over time (retrain
latency)

A multiple GenAI agents evaluation is oriented to:
●
coordination and communication
●
emergent behaviors
●
scalability and resource efficiency
●
business metrics

Evaluation metrics of multiple GenAI agents for:
●
coordination efficiency (inter-agent communication success
rate)
●
emergent behavior analysis (bias detection score)
●
scalability and resource utilization (bandwidth efficiency)
●
inter-agent communication (message latency)
●
competitive and cooperative performance (cooperation
score)

GenAI Agents: Major Applications (Part1)

More Related Content

Similar to GenAI Agents: Major Applications (Part1) (20)

More from Vladimir Kanchev (8)

Recently uploaded (20)

GenAI Agents: Major Applications (Part1)