SlideShare a Scribd company logo
GenAI Agents: Major Applications (Part1)
Vladimir Kanchev, PhD
Contents
1. Introduction. History and Basic Definitions.
2. Types and Structures of GenAI Agents.
3. Applications of GenAI Agents.
4. Evaluation of GenAI Agents.
5. Types of GenAI Frameworks and Technologies.
6. Challenges and Future of GenAI.
Introduction
GenAI Agents
Def: An autonomous system that uses advanced generative AI
models to create and interact through a human-like text, images,
audio, or other media. They often use text interfaces, enabling users
to input prompts or queries and receive context-aware,
conversational responses. (ChatGPT)
a
Introduction
General AI Agents Properties:
ā—
autonomy: independently make decisions
ā—
perception: gather information about their environment
ā—
decision-making: select appropriate actions
ā—
action: their actions change the state of their environment
CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects.
arXiv e-prints, 2024, arXiv: 2401.03428.
History
AI agents:
• Basic concept of computations
• Theory of mind
• Society of Mind (Minsky)
• Symbolic AI
History
Types of AI agents:
ā—
reflex agents: independently make a decision
ā—
goal-based agents: gather information about their
environment
ā—
utility-based agents: select appropriate actions
ā—
reinforcement learning (RL) agents
ā—
GenAI agents
ā—
AGI agents
CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects.
arXiv e-prints, 2024, arXiv: 2401.03428.
History
CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects.
arXiv e-prints, 2024, arXiv: 2401.03428.
Present
WANG, Yuntao, et al. Large Model Agents: State-of-the-Art, Cooperation Paradigms, Security and Privacy, and Future Trends. arXiv preprint
arXiv:2409.14457, 2024.
Basic Definitions
The ChatGPT era – November, 2022.
Properties of LLM models:
ā—
extensive knowledge base: the entire internet
ā—
adaptability: context learning by zero-shot, few-shot
learning
ā—
human-computer communication
ā—
scalability
CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects.
arXiv e-prints, 2024, arXiv: 2401.03428.
Basic Definitions
Disadvantages of LLM models:
ā—
context length constraints
ā—
prolonged knowledge update
ā—
no direct tool support – LLM’s cannot employ code
interpreters, calculators, etc.
ā—
potential for biased or inaccurate output (hallucinations)
ā—
dependence on the training of data
CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects.
arXiv e-prints, 2024, arXiv: 2401.03428.
Contents
1. Introduction. History and Basic Definitions.
2. Types and Structures of GenAI Agents.
3. Applications of GenAI Agents.
4. Evaluation of GenAI Agents.
5. Types of GenAI Frameworks and Technologies.
6. Challenges and Future of GenAI.
Types of GenAI Agents
Different types of GenAI depending on:
ā—
number of LLM agents in the system: single or multi-agent
ā—
the type of LLM model: single-modal or multi-modal
ā—
the type of task specialization: general purpose and task-
specific
Single GenAI Agents
5 major properties of a single GenAI agent:
ā—
LLM model: serves as an agent brain, making decisions
ā—
objective: terminal state the agent must achieve
ā—
action: set of executable tasks that change the environment
ā—
memory: stores the environment feedback and task history
to improve performance
ā—
reflection (rethink): evaluates previous actions and feedback
CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects.
arXiv e-prints, 2024, arXiv: 2401.03428.
Single GenAI Agents
2 external parts of a single GenAI agent:
ā—
tool: extensions of the agent's actions, such as calculators,
code interpreters, or robotic arms.
ā—
environment: provides input, constraints, and feedback that
influence the agent's behavior and decision-making
CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects.
arXiv e-prints, 2024, arXiv: 2401.03428.
CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects.
arXiv e-prints, 2024, arXiv: 2401.03428.
Single GenAI Agents
Planning
Def: Building an action sequence based on set objectives and
adapting it to the environment constraints to secure goal
achievement. It is built on the reasoning of an LLM model not
on a learned policy (RL agent).
CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects.
arXiv e-prints, 2024, arXiv: 2401.03428.
CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects.
arXiv e-prints, 2024, arXiv: 2401.03428.
Single GenAI Agents
Memory in a GenAI agent:
ā—
preserves knowledge and data from experience
ā—
represents the current state of the GenAI agent
ā—
can be textual, a vector/graph database, procedural
ā—
supports adaptation
ā—
facilitates personalization
CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects.
arXiv e-prints, 2024, arXiv: 2401.03428.
Single GenAI Agents
CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects.
arXiv e-prints, 2024, arXiv: 2401.03428.
Single GenAI Agents
Rethinking the ability or retrospection in a GenAI agent:
ā—
evaluates prior decisions
ā—
enhances the GenAI agent’s decision-making and learning
ā—
improves the GenAI agent’s adaptability
CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects.
arXiv e-prints, 2024, arXiv: 2401.03428.
CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects.
arXiv e-prints, 2024, arXiv: 2401.03428.
Single GenAI Agents
Environment of the GenAI agent:
ā—
is of a specific type
ā—
is influenced by the agent itself
ā—
helps dynamic adaptation
ā—
helps interactive learning
CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects.
arXiv e-prints, 2024, arXiv: 2401.03428.
CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects.
arXiv e-prints, 2024, arXiv: 2401.03428.
Single GenAI Agents
Action
Def: An agent extends its functionality by using external tools,
such as APIs, calculators, code interpreters, or specialized
software, to perform complex actions beyond its internal
reasoning capabilities. Thus, it interacts with its environment
and achieves the objectives more efficiently.
Types: single-tool, multi-tool, task-oriented, generalist,
environment-specific tool users.
CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects.
arXiv e-prints, 2024, arXiv: 2401.03428.
CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects.
arXiv e-prints, 2024, arXiv: 2401.03428.
Multi-agent Systems (MAS)
Def: A collaborative framework of multiple interacting
intelligent agents, each one with specialized roles or
capabilities, working to achieve complex objectives. Their
tasks usually span multiple domains, require distributed
problem-solving, or have parallelized workflows.
CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects.
arXiv e-prints, 2024, arXiv: 2401.03428.
Multi-agent Systems (MAS)
Types of multi-agent systems:
ā—
multi-role coordination - cooperative, competitive, mixed,
hierarchical
ā—
planning Type - Centralized Planning Decentralized
Execution (CPDE) and Decentralized Planning Decentralized
Execution (DPDE)
CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects.
arXiv e-prints, 2024, arXiv: 2401.03428.
CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects.
arXiv e-prints, 2024, arXiv: 2401.03428.
Multi-agent Systems (MAS)
CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects.
arXiv e-prints, 2024, arXiv: 2401.03428.
Multi-agent Systems (MAS)
CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects.
arXiv e-prints, 2024, arXiv: 2401.03428.
CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects.
arXiv e-prints, 2024, arXiv: 2401.03428.
Contents
1. Introduction. History and Basic Definitions.
2. Types and Structures of GenAI Agents.
3. Applications of GenAI Agents.
4. Evaluation of GenAI Agents.
5. Types of GenAI Agent Frameworks and Technologies.
6. Challenges and Future of GenAI.
Applications of GenAI Agents
Here we provide a few applications:
ā—
a single task – React/Reflexion agents, content/code
creation, documentation/test cases generation
ā—
multiple agents – software development/maintenance,
collaborative coding, world simulations
CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects.
arXiv e-prints, 2024, arXiv: 2401.03428.
REACT Single GenAI Agent
ā—
combines Reasoning and Acting
ā—
iterative feedback loop: Reason Act Observe Refine
→ → →
ā—
core features: LLM-based reasoning, contextual memory,
and tool integration (e.g. APIs, search engines)
ā—
broad applicability: from customer support and healthcare
to autonomous vehicles and R&D.
YAO, Shunyu, et al. ReAct: Synergizing Reasoning and Acting in Language Models. arXiv e-prints, 2022, arXiv: 2210.03629.
RIGAKI, Maria, et al. Out of the cage: How stochastic parrots win in cyber security environments. arXiv preprint arXiv:2308.12086, 2023.
YAO, Shunyu, et al. ReAct: Synergizing Reasoning and Acting in Language Models. arXiv e-prints, 2022, arXiv: 2210.03629.
REFLEXION Single GenAI Agent
ā—
learns through reflection
ā—
self-corrects through feedback
ā—
combines memory and adaptability.
ā—
has real-world applications
SHINN, Noah, et al. Reflexion: Language agents with verbal reinforcement learning.(2023). arXiv preprint cs.AI/2303.11366, 2023.
SHINN, Noah, et al. Reflexion: Language agents with verbal reinforcement learning.(2023). arXiv preprint cs.AI/2303.11366, 2023.
SHINN, Noah, et al. Reflexion: Language agents with verbal reinforcement learning.(2023). arXiv preprint cs.AI/2303.11366, 2023.
Agents for Software Engineering
GenAI agents can be oriented to:
ā—
a single SE task (single agent) – code generation, code
quality assurance (testing)
ā—
end-to-end SE task (multiple agents) – software
development/maintenance
Agents for Software Engineering
LIU, Junwei, et al. Large language model-based agents for software engineering: A survey. arXiv preprint arXiv:2409.02977, 2024.
Multi-agent Systems (MAS)
End-to-end software development (SD) agent system:
ā—
covers the whole SD life-cycle
ā—
aligns with software process models: waterfall or agile
ā—
has a role-specific specialization
ā—
features a collaborative interaction
ā—
relies on more advanced communication
LIU, Junwei, et al. Large language model-based agents for software engineering: A survey. arXiv preprint arXiv:2409.02977, 2024.
Multi-agent SD System
LIU, Junwei, et al. Large language model-based agents for software engineering: A survey. arXiv preprint arXiv:2409.02977, 2024.
Мulti-agent for World Simulation
Here we have:
ā—
scenario-specific simulations
ā—
social and environmental interactions
ā—
embodied agents with diverse roles
ā—
advanced agent interactions
ā—
controlled sandbox environments
MP5 – A Multi-modal Open-ended
Embodied System in Minecraft
ā—
Minecraft ecosystem
ā—
MineDojo framework
ā—
multi-modal and open-ended tasks
ā—
task handling: context dependent and process-dependent
ā—
the MP5 System Architecture consists of: Parser,
Percipient, Planner, Performer, Patroller
QIN, Yiran, et al. Mp5: A multi-modal open-ended embodied system in minecraft via active perception. In: 2024 IEEE/CVF Conference on
Computer Vision and Pattern Recognition (CVPR). IEEE, 2024. p. 16307-16316.
MP5 – A multi-modal open-ended
embedded system in Minecraft
QIN, Yiran, et al. Mp5: A multi-modal open-ended embodied system in minecraft via active perception. In: 2024 IEEE/CVF Conference on
Computer Vision and Pattern Recognition (CVPR). IEEE, 2024. p. 16307-16316.
Contents
1. Introduction. History and Basic Definitions.
2. Types and Structures of GenAI Agents.
3. Applications of GenAI Agents.
4. Evaluation of GenAI Agents.
5. Types of GenAI Frameworks and Technologies.
6. Challenges and Future of GenAI.
Evaluation of GenAI Agents
A single GenAI agent evaluation is characterized by:
• performance of narrow tasks
• adaptability to perform unseen tasks
• alignment with the user’s goals
• business metrics
Metrics: Task-specific accuracy, reasoning and adaptability,
user interaction quality, robustness and reliability, efficiency
Evaluation of GenAI Agents
Task-oriented benchmarks for single GenAI agents:
ā—
NLP-specific benchmarks (e.g. HELM)
ā—
multi-modal benchmarks (e.g. SEED-Bench)
ā—
agentic benchmarks (e.g. AgentBench)
ā—
autonomous decision-making (e.g. ALFWorld)
Evaluation of GenAI Agents
Human-AI interaction has:
ā—
response quality (e.g. relevance, coherence, accuracy)
ā—
user satisfaction and trust levels (user satisfaction score)
ā—
efficiency in task completion (task completion time,
reduction in user queries)
ā—
ability to take feedback and improve over time (retrain
latency)
Evaluation of GenAI Agents
A multiple GenAI agents evaluation is oriented to:
ā—
coordination and communication
ā—
emergent behaviors
ā—
scalability and resource efficiency
ā—
business metrics
Evaluation of GenAI Agents
Evaluation metrics of multiple GenAI agents for:
ā—
coordination efficiency (inter-agent communication success
rate)
ā—
emergent behavior analysis (bias detection score)
ā—
scalability and resource utilization (bandwidth efficiency)
ā—
inter-agent communication (message latency)
ā—
competitive and cooperative performance (cooperation
score)

More Related Content

PPTX
AI Agents and their implications for Enterprise AI Use-cases
PDF
Devoxx Morocco 2024 - The Future Beyond LLMs: Exploring Agentic AI
PDF
From Assistants to Autopilots_ The Rise of AI Agents.pdf
PPTX
Unlocking the Future of AI Agents with Large Language Models
PDF
Prensentation_on_AI_Agents_and_their_classification
PDF
AGI Part 1.pdf
PDF
AI AGENTS...............................
Ā 
PDF
AI AGENTS Generative AI Cognitive architecture
Ā 
AI Agents and their implications for Enterprise AI Use-cases
Devoxx Morocco 2024 - The Future Beyond LLMs: Exploring Agentic AI
From Assistants to Autopilots_ The Rise of AI Agents.pdf
Unlocking the Future of AI Agents with Large Language Models
Prensentation_on_AI_Agents_and_their_classification
AGI Part 1.pdf
AI AGENTS...............................
Ā 
AI AGENTS Generative AI Cognitive architecture
Ā 

Similar to GenAI Agents: Major Applications (Part1) (20)

PDF
Kono.IntelCraft.Weekly.AI.LLM.Landscape.2024.02.28.pdf
PDF
PDF
LLM-based Multi-Agent Systems to Replace Traditional Software
PDF
Building Your Own AI Agent System: A Comprehensive Guide
PDF
A comprehensive guide to Agentic AI Systems
PDF
Introduction to GenAI - Past, Present, Future
PPTX
Scalable Multi-Agent AI with AutoGen by Udai
PPTX
UNIT I - AI.pptx
PDF
Adobe XD 50.0.12 for MacOS Crack Ā  Free
PDF
Vivaldi Web Browser 7.3.3635.7 Free Download
PDF
Markdown Monster 3.6.9 Free key Download
PDF
Adobe XD 50.0.12 for MacOS CrackĀ  Free Download
PDF
lanamalic-aiagents-250212223710-84219c4c-250408115702-2f9e4f0e.pdf
PDF
IObit Driver Booster Pro 12.3.0.557 Free
PPTX
AgenticAI Architecture In Feature 2025.pptx
PPTX
[DSC Europe 24] Lana Malic - AI Agents: The Future of Autonomous Decision-Mak...
PDF
TunesKit Video Cutter 3.0.0.54 Download
PDF
Wondershare Dr.Fone toolkit for free crack
PDF
Agentic architectures and workflows @ AIware Bootcamp 2024
PPTX
From-LLM-Powered-Chatbots-to-AI-Agents.pptx
Kono.IntelCraft.Weekly.AI.LLM.Landscape.2024.02.28.pdf
LLM-based Multi-Agent Systems to Replace Traditional Software
Building Your Own AI Agent System: A Comprehensive Guide
A comprehensive guide to Agentic AI Systems
Introduction to GenAI - Past, Present, Future
Scalable Multi-Agent AI with AutoGen by Udai
UNIT I - AI.pptx
Adobe XD 50.0.12 for MacOS Crack Ā  Free
Vivaldi Web Browser 7.3.3635.7 Free Download
Markdown Monster 3.6.9 Free key Download
Adobe XD 50.0.12 for MacOS CrackĀ  Free Download
lanamalic-aiagents-250212223710-84219c4c-250408115702-2f9e4f0e.pdf
IObit Driver Booster Pro 12.3.0.557 Free
AgenticAI Architecture In Feature 2025.pptx
[DSC Europe 24] Lana Malic - AI Agents: The Future of Autonomous Decision-Mak...
TunesKit Video Cutter 3.0.0.54 Download
Wondershare Dr.Fone toolkit for free crack
Agentic architectures and workflows @ AIware Bootcamp 2024
From-LLM-Powered-Chatbots-to-AI-Agents.pptx
Ad

More from Vladimir Kanchev (8)

PPTX
Ethical Issues in Machine Learning Algorithms. (Part 3)
PPTX
Ethical Issues in Machine Learning Algorithms (Part 2)
PPTX
Ethical Issues in Machine Learning Algorithms. (Part 1)
PPTX
Tissue segmentation methods using 2D histogram matching in a sequence of mr b...
PPTX
Tissue Segmentation Methods Using 2D Histogram Matching in a Sequence of MR B...
PPTX
Tissue Segmentation Methods using 2D Hiistogram Matching in a Sequence of MR ...
PPTX
Tissue Segmentation Methods Using 2D Histogram Matching in a Sequence of MR B...
PPT
Importance and Challenges of Reproducible Research
Ethical Issues in Machine Learning Algorithms. (Part 3)
Ethical Issues in Machine Learning Algorithms (Part 2)
Ethical Issues in Machine Learning Algorithms. (Part 1)
Tissue segmentation methods using 2D histogram matching in a sequence of mr b...
Tissue Segmentation Methods Using 2D Histogram Matching in a Sequence of MR B...
Tissue Segmentation Methods using 2D Hiistogram Matching in a Sequence of MR ...
Tissue Segmentation Methods Using 2D Histogram Matching in a Sequence of MR B...
Importance and Challenges of Reproducible Research
Ad

Recently uploaded (20)

PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
A contest of sentiment analysis: k-nearest neighbor versus neural network
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Web App vs Mobile App What Should You Build First.pdf
Ā 
PDF
1 - Historical Antecedents, Social Consideration.pdf
PDF
Getting started with AI Agents and Multi-Agent Systems
PDF
WOOl fibre morphology and structure.pdf for textiles
PDF
A novel scalable deep ensemble learning framework for big data classification...
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PPTX
TLE Review Electricity (Electricity).pptx
PPTX
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
PPTX
Tartificialntelligence_presentation.pptx
PDF
A comparative study of natural language inference in Swahili using monolingua...
PPTX
observCloud-Native Containerability and monitoring.pptx
PPTX
O2C Customer Invoices to Receipt V15A.pptx
PPTX
Chapter 5: Probability Theory and Statistics
PDF
Enhancing emotion recognition model for a student engagement use case through...
PDF
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
Ā 
PPTX
Modernising the Digital Integration Hub
PDF
Transform Your ITILĀ® 4 & ITSM Strategy with AI in 2025.pdf
NewMind AI Weekly Chronicles - August'25-Week II
A contest of sentiment analysis: k-nearest neighbor versus neural network
Programs and apps: productivity, graphics, security and other tools
Web App vs Mobile App What Should You Build First.pdf
Ā 
1 - Historical Antecedents, Social Consideration.pdf
Getting started with AI Agents and Multi-Agent Systems
WOOl fibre morphology and structure.pdf for textiles
A novel scalable deep ensemble learning framework for big data classification...
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
TLE Review Electricity (Electricity).pptx
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
Tartificialntelligence_presentation.pptx
A comparative study of natural language inference in Swahili using monolingua...
observCloud-Native Containerability and monitoring.pptx
O2C Customer Invoices to Receipt V15A.pptx
Chapter 5: Probability Theory and Statistics
Enhancing emotion recognition model for a student engagement use case through...
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
Ā 
Modernising the Digital Integration Hub
Transform Your ITILĀ® 4 & ITSM Strategy with AI in 2025.pdf

GenAI Agents: Major Applications (Part1)

  • 1. GenAI Agents: Major Applications (Part1) Vladimir Kanchev, PhD
  • 2. Contents 1. Introduction. History and Basic Definitions. 2. Types and Structures of GenAI Agents. 3. Applications of GenAI Agents. 4. Evaluation of GenAI Agents. 5. Types of GenAI Frameworks and Technologies. 6. Challenges and Future of GenAI.
  • 3. Introduction GenAI Agents Def: An autonomous system that uses advanced generative AI models to create and interact through a human-like text, images, audio, or other media. They often use text interfaces, enabling users to input prompts or queries and receive context-aware, conversational responses. (ChatGPT) a
  • 4. Introduction General AI Agents Properties: ā— autonomy: independently make decisions ā— perception: gather information about their environment ā— decision-making: select appropriate actions ā— action: their actions change the state of their environment CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects. arXiv e-prints, 2024, arXiv: 2401.03428.
  • 5. History AI agents: • Basic concept of computations • Theory of mind • Society of Mind (Minsky) • Symbolic AI
  • 6. History Types of AI agents: ā— reflex agents: independently make a decision ā— goal-based agents: gather information about their environment ā— utility-based agents: select appropriate actions ā— reinforcement learning (RL) agents ā— GenAI agents ā— AGI agents CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects. arXiv e-prints, 2024, arXiv: 2401.03428.
  • 7. History CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects. arXiv e-prints, 2024, arXiv: 2401.03428.
  • 8. Present WANG, Yuntao, et al. Large Model Agents: State-of-the-Art, Cooperation Paradigms, Security and Privacy, and Future Trends. arXiv preprint arXiv:2409.14457, 2024.
  • 9. Basic Definitions The ChatGPT era – November, 2022. Properties of LLM models: ā— extensive knowledge base: the entire internet ā— adaptability: context learning by zero-shot, few-shot learning ā— human-computer communication ā— scalability CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects. arXiv e-prints, 2024, arXiv: 2401.03428.
  • 10. Basic Definitions Disadvantages of LLM models: ā— context length constraints ā— prolonged knowledge update ā— no direct tool support – LLM’s cannot employ code interpreters, calculators, etc. ā— potential for biased or inaccurate output (hallucinations) ā— dependence on the training of data CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects. arXiv e-prints, 2024, arXiv: 2401.03428.
  • 11. Contents 1. Introduction. History and Basic Definitions. 2. Types and Structures of GenAI Agents. 3. Applications of GenAI Agents. 4. Evaluation of GenAI Agents. 5. Types of GenAI Frameworks and Technologies. 6. Challenges and Future of GenAI.
  • 12. Types of GenAI Agents Different types of GenAI depending on: ā— number of LLM agents in the system: single or multi-agent ā— the type of LLM model: single-modal or multi-modal ā— the type of task specialization: general purpose and task- specific
  • 13. Single GenAI Agents 5 major properties of a single GenAI agent: ā— LLM model: serves as an agent brain, making decisions ā— objective: terminal state the agent must achieve ā— action: set of executable tasks that change the environment ā— memory: stores the environment feedback and task history to improve performance ā— reflection (rethink): evaluates previous actions and feedback CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects. arXiv e-prints, 2024, arXiv: 2401.03428.
  • 14. Single GenAI Agents 2 external parts of a single GenAI agent: ā— tool: extensions of the agent's actions, such as calculators, code interpreters, or robotic arms. ā— environment: provides input, constraints, and feedback that influence the agent's behavior and decision-making CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects. arXiv e-prints, 2024, arXiv: 2401.03428.
  • 15. CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects. arXiv e-prints, 2024, arXiv: 2401.03428.
  • 16. Single GenAI Agents Planning Def: Building an action sequence based on set objectives and adapting it to the environment constraints to secure goal achievement. It is built on the reasoning of an LLM model not on a learned policy (RL agent). CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects. arXiv e-prints, 2024, arXiv: 2401.03428.
  • 17. CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects. arXiv e-prints, 2024, arXiv: 2401.03428.
  • 18. Single GenAI Agents Memory in a GenAI agent: ā— preserves knowledge and data from experience ā— represents the current state of the GenAI agent ā— can be textual, a vector/graph database, procedural ā— supports adaptation ā— facilitates personalization CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects. arXiv e-prints, 2024, arXiv: 2401.03428.
  • 19. Single GenAI Agents CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects. arXiv e-prints, 2024, arXiv: 2401.03428.
  • 20. Single GenAI Agents Rethinking the ability or retrospection in a GenAI agent: ā— evaluates prior decisions ā— enhances the GenAI agent’s decision-making and learning ā— improves the GenAI agent’s adaptability CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects. arXiv e-prints, 2024, arXiv: 2401.03428.
  • 21. CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects. arXiv e-prints, 2024, arXiv: 2401.03428.
  • 22. Single GenAI Agents Environment of the GenAI agent: ā— is of a specific type ā— is influenced by the agent itself ā— helps dynamic adaptation ā— helps interactive learning CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects. arXiv e-prints, 2024, arXiv: 2401.03428.
  • 23. CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects. arXiv e-prints, 2024, arXiv: 2401.03428.
  • 24. Single GenAI Agents Action Def: An agent extends its functionality by using external tools, such as APIs, calculators, code interpreters, or specialized software, to perform complex actions beyond its internal reasoning capabilities. Thus, it interacts with its environment and achieves the objectives more efficiently. Types: single-tool, multi-tool, task-oriented, generalist, environment-specific tool users. CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects. arXiv e-prints, 2024, arXiv: 2401.03428.
  • 25. CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects. arXiv e-prints, 2024, arXiv: 2401.03428.
  • 26. Multi-agent Systems (MAS) Def: A collaborative framework of multiple interacting intelligent agents, each one with specialized roles or capabilities, working to achieve complex objectives. Their tasks usually span multiple domains, require distributed problem-solving, or have parallelized workflows. CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects. arXiv e-prints, 2024, arXiv: 2401.03428.
  • 27. Multi-agent Systems (MAS) Types of multi-agent systems: ā— multi-role coordination - cooperative, competitive, mixed, hierarchical ā— planning Type - Centralized Planning Decentralized Execution (CPDE) and Decentralized Planning Decentralized Execution (DPDE) CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects. arXiv e-prints, 2024, arXiv: 2401.03428.
  • 28. CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects. arXiv e-prints, 2024, arXiv: 2401.03428.
  • 29. Multi-agent Systems (MAS) CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects. arXiv e-prints, 2024, arXiv: 2401.03428.
  • 30. Multi-agent Systems (MAS) CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects. arXiv e-prints, 2024, arXiv: 2401.03428.
  • 31. CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects. arXiv e-prints, 2024, arXiv: 2401.03428.
  • 32. Contents 1. Introduction. History and Basic Definitions. 2. Types and Structures of GenAI Agents. 3. Applications of GenAI Agents. 4. Evaluation of GenAI Agents. 5. Types of GenAI Agent Frameworks and Technologies. 6. Challenges and Future of GenAI.
  • 33. Applications of GenAI Agents Here we provide a few applications: ā— a single task – React/Reflexion agents, content/code creation, documentation/test cases generation ā— multiple agents – software development/maintenance, collaborative coding, world simulations CHENG, Yuheng, et al. Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects. arXiv e-prints, 2024, arXiv: 2401.03428.
  • 34. REACT Single GenAI Agent ā— combines Reasoning and Acting ā— iterative feedback loop: Reason Act Observe Refine → → → ā— core features: LLM-based reasoning, contextual memory, and tool integration (e.g. APIs, search engines) ā— broad applicability: from customer support and healthcare to autonomous vehicles and R&D. YAO, Shunyu, et al. ReAct: Synergizing Reasoning and Acting in Language Models. arXiv e-prints, 2022, arXiv: 2210.03629.
  • 35. RIGAKI, Maria, et al. Out of the cage: How stochastic parrots win in cyber security environments. arXiv preprint arXiv:2308.12086, 2023.
  • 36. YAO, Shunyu, et al. ReAct: Synergizing Reasoning and Acting in Language Models. arXiv e-prints, 2022, arXiv: 2210.03629.
  • 37. REFLEXION Single GenAI Agent ā— learns through reflection ā— self-corrects through feedback ā— combines memory and adaptability. ā— has real-world applications SHINN, Noah, et al. Reflexion: Language agents with verbal reinforcement learning.(2023). arXiv preprint cs.AI/2303.11366, 2023.
  • 38. SHINN, Noah, et al. Reflexion: Language agents with verbal reinforcement learning.(2023). arXiv preprint cs.AI/2303.11366, 2023.
  • 39. SHINN, Noah, et al. Reflexion: Language agents with verbal reinforcement learning.(2023). arXiv preprint cs.AI/2303.11366, 2023.
  • 40. Agents for Software Engineering GenAI agents can be oriented to: ā— a single SE task (single agent) – code generation, code quality assurance (testing) ā— end-to-end SE task (multiple agents) – software development/maintenance
  • 41. Agents for Software Engineering LIU, Junwei, et al. Large language model-based agents for software engineering: A survey. arXiv preprint arXiv:2409.02977, 2024.
  • 42. Multi-agent Systems (MAS) End-to-end software development (SD) agent system: ā— covers the whole SD life-cycle ā— aligns with software process models: waterfall or agile ā— has a role-specific specialization ā— features a collaborative interaction ā— relies on more advanced communication LIU, Junwei, et al. Large language model-based agents for software engineering: A survey. arXiv preprint arXiv:2409.02977, 2024.
  • 43. Multi-agent SD System LIU, Junwei, et al. Large language model-based agents for software engineering: A survey. arXiv preprint arXiv:2409.02977, 2024.
  • 44. Мulti-agent for World Simulation Here we have: ā— scenario-specific simulations ā— social and environmental interactions ā— embodied agents with diverse roles ā— advanced agent interactions ā— controlled sandbox environments
  • 45. MP5 – A Multi-modal Open-ended Embodied System in Minecraft ā— Minecraft ecosystem ā— MineDojo framework ā— multi-modal and open-ended tasks ā— task handling: context dependent and process-dependent ā— the MP5 System Architecture consists of: Parser, Percipient, Planner, Performer, Patroller QIN, Yiran, et al. Mp5: A multi-modal open-ended embodied system in minecraft via active perception. In: 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2024. p. 16307-16316.
  • 46. MP5 – A multi-modal open-ended embedded system in Minecraft QIN, Yiran, et al. Mp5: A multi-modal open-ended embodied system in minecraft via active perception. In: 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2024. p. 16307-16316.
  • 47. Contents 1. Introduction. History and Basic Definitions. 2. Types and Structures of GenAI Agents. 3. Applications of GenAI Agents. 4. Evaluation of GenAI Agents. 5. Types of GenAI Frameworks and Technologies. 6. Challenges and Future of GenAI.
  • 48. Evaluation of GenAI Agents A single GenAI agent evaluation is characterized by: • performance of narrow tasks • adaptability to perform unseen tasks • alignment with the user’s goals • business metrics Metrics: Task-specific accuracy, reasoning and adaptability, user interaction quality, robustness and reliability, efficiency
  • 49. Evaluation of GenAI Agents Task-oriented benchmarks for single GenAI agents: ā— NLP-specific benchmarks (e.g. HELM) ā— multi-modal benchmarks (e.g. SEED-Bench) ā— agentic benchmarks (e.g. AgentBench) ā— autonomous decision-making (e.g. ALFWorld)
  • 50. Evaluation of GenAI Agents Human-AI interaction has: ā— response quality (e.g. relevance, coherence, accuracy) ā— user satisfaction and trust levels (user satisfaction score) ā— efficiency in task completion (task completion time, reduction in user queries) ā— ability to take feedback and improve over time (retrain latency)
  • 51. Evaluation of GenAI Agents A multiple GenAI agents evaluation is oriented to: ā— coordination and communication ā— emergent behaviors ā— scalability and resource efficiency ā— business metrics
  • 52. Evaluation of GenAI Agents Evaluation metrics of multiple GenAI agents for: ā— coordination efficiency (inter-agent communication success rate) ā— emergent behavior analysis (bias detection score) ā— scalability and resource utilization (bandwidth efficiency) ā— inter-agent communication (message latency) ā— competitive and cooperative performance (cooperation score)