SlideShare a Scribd company logo
1 | © Copyright 8/16/23 Zilliz
1 | © Copyright 8/16/23 Zilliz
Stephen Batifol | Zilliz
Unstructured Data Meetup, Sept 5th
Multi-agent Systems with
Mistral AI, Milvus and
Llama-agents
2 | © Copyright 8/16/23 Zilliz
2 | © Copyright 8/16/23 Zilliz
Stephen Batifol
Developer Advocate, Zilliz/ Milvus
stephen.batifol@zilliz.com
linkedin.com/in/stephen-batifol/
@stephenbtl
Speaker
3 | © Copyright 8/16/23 Zilliz
3 | © Copyright 8/16/23 Zilliz
28K
GitHub
Stars
25M
Downloads
250
Contributors
2,600
+
Forks
Milvus is an open-source vector database for GenAI projects. pip install on your
laptop, plug into popular AI dev tools, and push to production with a single line of
code.
Easy Setup
Pip-install to start
coding in a notebook
within seconds.
Reusable Code
Write once, and
deploy with one line
of code into the
production
environment
Integration
Plug into OpenAI,
Langchain,
LlmaIndex, and
many more
Feature-rich
Dense & sparse
embeddings,
filtering, reranking
and beyond
4 | © Copyright 8/16/23 Zilliz
4 | © Copyright 8/16/23 Zilliz
Seamless integration with all popular AI toolkits
5 | © Copyright 8/16/23 Zilliz
5 | © Copyright 8/16/23 Zilliz
5 | © Copyright 8/16/23 Zilliz
5 | © Copyright 8/16/23 Zilliz
01 Key Components
6 | © Copyright 8/16/23 Zilliz
6 | © Copyright 8/16/23 Zilliz
pip install pymilvus
Milvus Lite
7 | © Copyright 8/16/23 Zilliz
7 | © Copyright 8/16/23 Zilliz
• Framework for building LLM Applications
• Focus on retrieving data and integrating with
LLMs
• Integrations with most AI popular tools
🦙 llama-index
8 | © Copyright 8/16/23 Zilliz
8 | © Copyright 8/16/23 Zilliz
🦙 llama-agents 🤖 by llama-index
• Build Stateful apps with LLMs
and Multi-Agents workflow
• Cycles and Branching
• Human-in-the-Loop
• Persistence
9 | © Copyright 8/16/23 Zilliz
9 | © Copyright 8/16/23 Zilliz
Mistral AI
• Mistral Embed
• Embedding Model focused on Retrieval, very useful
for RAG
• English only
• Mistral Nemo
• 12B model with 128k context length
• Strong Function Calling and Retrieval for its size
• Run Locally
• Mistral Large 2
• 123 Billions parameters with 128K context length
• Very strong Function Calling and Retrieval skills
10 | © Copyright 8/16/23 Zilliz
10 | © Copyright 8/16/23 Zilliz
Tavily
● Web search API for up-to-date information
● Tavily Search API is a search engine
optimized for LLMs
● Expands agent's knowledge base
11 | © Copyright 8/16/23 Zilliz
11 | © Copyright 8/16/23 Zilliz
| © Copyright 8/16/23 Zilliz
11
RAG
Retrieval Augmented Generation)
12 | © Copyright 8/16/23 Zilliz
12 | © Copyright 8/16/23 Zilliz
Basic Idea
Use RAG to force the LLM to work with your data
by injecting it via a vector database like Milvus
13 | © Copyright 8/16/23 Zilliz
13 | © Copyright 8/16/23 Zilliz
Basic RAG Architecture
14 | © Copyright 8/16/23 Zilliz
14 | © Copyright 8/16/23 Zilliz
5 lines starter
15 | © Copyright 8/16/23 Zilliz
15 | © Copyright 8/16/23 Zilliz
Naive RAG is limited
16 | © Copyright 8/16/23 Zilliz
16 | © Copyright 8/16/23 Zilliz
Naive RAG Pipeline
⚠ Single-shot
⚠ No query understanding/planning
⚠ No tool use
⚠ No reflection, error correction
⚠ No memory (stateless)
17 | © Copyright 8/16/23 Zilliz
17 | © Copyright 8/16/23 Zilliz
Naive RAG failure mode
Summarization
18 | © Copyright 8/16/23 Zilliz
18 | © Copyright 8/16/23 Zilliz
Naive RAG failure mode
Implicit data
19 | © Copyright 8/16/23 Zilliz
19 | © Copyright 8/16/23 Zilliz
Naive RAG failure mode
Multi-part questions
20 | © Copyright 8/16/23 Zilliz
20 | © Copyright 8/16/23 Zilliz
20 | © Copyright 8/16/23 Zilliz
RAG is necessary but
not sufficient
21 | © Copyright 8/16/23 Zilliz
21 | © Copyright 8/16/23 Zilliz
21 | © Copyright 8/16/23 Zilliz
21 | © Copyright 8/16/23 Zilliz
01 Agentic RAG
22 | © Copyright 8/16/23 Zilliz
22 | © Copyright 8/16/23 Zilliz
Agentic RAG
✅ Multi-turn
✅ Query / task planning layer
✅ Tool interface for external environment
✅ Reflection
✅ Memory for personalization
23 | © Copyright 8/16/23 Zilliz
23 | © Copyright 8/16/23 Zilliz
● Self-Reflection 🛜
○ Check the internet to verify
● Query Routing
○ Whether to check our RAG system
● Query Routing with Subquery
○ Whether to check our RAG system for different queries
● Conversation Memory
● Tool Use
Agentic RAG
24 | © Copyright 8/16/23 Zilliz
24 | © Copyright 8/16/23 Zilliz
| © Copyright 8/16/23 Zilliz
24
Demo!
25 | © Copyright 8/16/23 Zilliz
25 | © Copyright 8/16/23 Zilliz
milvus.io
github.com/milvus-io/
@milvusio
@stephenbtl
/in/stephen-batifol
Thank you
26 | © Copyright 8/16/23 Zilliz
26 | © Copyright 8/16/23 Zilliz
Meta Storage
Root Query Data Index
Coordinator Service
Proxy
Proxy
etcd
Log Broker
SDK
Load Balancer
DDL/DCL
DML
NOTIFICATION
CONTROL SIGNAL
Object Storage
Minio / S3 / AzureBlob
Log Snapshot Delta File Index File
Worker Node QUERY DATA DATA
Message Storage
VECTOR
DATABASE
Access Layer
Query Node Data Node Index Node
Milvus Architecture

More Related Content

PDF
Building an Agentic RAG locally with Ollama and Milvus
PDF
GraphRAG Agents with Neo4j, Milvus and GPT4
PDF
Multi-agent Systems with Mistral AI, Milvus and Llama-agents
PDF
Using LLM Agents with Llama 3.2, LangGraph and Milvus
PDF
2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...
PDF
MultiModal RAG using vLLM and Pixtral - Stephen Batifol
PDF
Supercharge Spark: Unleashing Big Data Potential with Milvus for RAG systems
PDF
17-October-2024 NYC AI Camp - Step-by-Step RAG 101
Building an Agentic RAG locally with Ollama and Milvus
GraphRAG Agents with Neo4j, Milvus and GPT4
Multi-agent Systems with Mistral AI, Milvus and Llama-agents
Using LLM Agents with Llama 3.2, LangGraph and Milvus
2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...
MultiModal RAG using vLLM and Pixtral - Stephen Batifol
Supercharge Spark: Unleashing Big Data Potential with Milvus for RAG systems
17-October-2024 NYC AI Camp - Step-by-Step RAG 101

Similar to Multi-agent Systems with Mistral AI, Milvus and Llama-agents (20)

PDF
Using LLM Agents with Llama 3, LangGraph and Milvus
PDF
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
PDF
Advanced Retrieval Augmented Generation Techniques
PDF
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
PDF
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
PDF
06-18-2024-Princeton Meetup-Introduction to Milvus
PDF
Hands-on Tutorial: Building an Agent to Reason about Private Data with OpenAI...
PDF
2024-OCT-23 NYC Meetup - Unstructured Data Meetup - Unstructured Halloween
PDF
11-OCT-2024_AI_101_CryptoOracle_UnstructuredData
PDF
Agentic AI in Action: Real-Time Vision, Memory & Autonomy with Browser Use & ...
PDF
Building Production Ready Search Pipelines with Spark and Milvus
PDF
09-12-2024 - Milvus, Vector database used for Sensor Data RAG
PDF
Deploying a Multimodal RAG System Using Open Source Milvus, LlamaIndex, and vLLM
PDF
10-25-2024_BITS_NYC_Unstructured Data and LLM_ What, Why and How
PDF
Scaling Vector Search: How Milvus Handles Billions+
PDF
2025-02-24 - AWS meetup - Zilliz presentation.pdf
PDF
NYCMeetup07-25-2024-Unstructured Data Processing From Cloud to Edge
PDF
Evaluating Retrieval-Augmented Generation - Webinar
PDF
2025-04-05 - Block71 Event - The Landscape of GenAI and Ecosystem.pdf
PDF
Milvus: Scaling Vector Data Solutions for Gen AI
Using LLM Agents with Llama 3, LangGraph and Milvus
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
Advanced Retrieval Augmented Generation Techniques
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-18-2024-Princeton Meetup-Introduction to Milvus
Hands-on Tutorial: Building an Agent to Reason about Private Data with OpenAI...
2024-OCT-23 NYC Meetup - Unstructured Data Meetup - Unstructured Halloween
11-OCT-2024_AI_101_CryptoOracle_UnstructuredData
Agentic AI in Action: Real-Time Vision, Memory & Autonomy with Browser Use & ...
Building Production Ready Search Pipelines with Spark and Milvus
09-12-2024 - Milvus, Vector database used for Sensor Data RAG
Deploying a Multimodal RAG System Using Open Source Milvus, LlamaIndex, and vLLM
10-25-2024_BITS_NYC_Unstructured Data and LLM_ What, Why and How
Scaling Vector Search: How Milvus Handles Billions+
2025-02-24 - AWS meetup - Zilliz presentation.pdf
NYCMeetup07-25-2024-Unstructured Data Processing From Cloud to Edge
Evaluating Retrieval-Augmented Generation - Webinar
2025-04-05 - Block71 Event - The Landscape of GenAI and Ecosystem.pdf
Milvus: Scaling Vector Data Solutions for Gen AI
Ad

More from Zilliz (20)

PDF
Build Fast, Scale Faster: Milvus vs. Zilliz Cloud for Production-Ready AI
PDF
Zilliz Cloud Demo for performance and scale
PDF
Open Source Milvus Vector Database v 2.6
PDF
Zilliz Cloud Monthly Technical Review: May 2025
PDF
Smarter RAG Pipelines: Scaling Search with Milvus and Feast
PDF
Webinar - Zilliz Cloud Monthly Demo - March 2025
PDF
What Makes "Deep Research"? A Dive into AI Agents
PDF
Combining Lexical and Semantic Search with Milvus 2.5
PDF
Bedrock Data Automation (Preview): Simplifying Unstructured Data Processing
PDF
February Product Demo: Discover the Power of Zilliz Cloud
PDF
Full Text Search with Milvus 2.5 - UD Meetup Berlin Jan 23
PDF
Building the Next-Gen Apps with Multimodal Retrieval using Twelve Labs & Milvus
PDF
Voice-to-Value- LLM-Powered Customer Interaction Analysis.pdf
PDF
Accelerate AI Agents with Multimodal RAG powered by Friendli Endpoints and Mi...
PDF
1 Table = 1000 Words? Foundation Models for Tabular Data
PDF
How Milvus allows you to run Full Text Search
PDF
How to Optimize Your Embedding Model Selection and Development through TDA Cl...
PDF
Keeping Data Fresh: Mastering Updates in Vector Databases
PDF
Milvus 2.5: Full-Text Search, More Powerful Metadata Filtering, and more!
PDF
Vector Databases for Enhanced Classification
Build Fast, Scale Faster: Milvus vs. Zilliz Cloud for Production-Ready AI
Zilliz Cloud Demo for performance and scale
Open Source Milvus Vector Database v 2.6
Zilliz Cloud Monthly Technical Review: May 2025
Smarter RAG Pipelines: Scaling Search with Milvus and Feast
Webinar - Zilliz Cloud Monthly Demo - March 2025
What Makes "Deep Research"? A Dive into AI Agents
Combining Lexical and Semantic Search with Milvus 2.5
Bedrock Data Automation (Preview): Simplifying Unstructured Data Processing
February Product Demo: Discover the Power of Zilliz Cloud
Full Text Search with Milvus 2.5 - UD Meetup Berlin Jan 23
Building the Next-Gen Apps with Multimodal Retrieval using Twelve Labs & Milvus
Voice-to-Value- LLM-Powered Customer Interaction Analysis.pdf
Accelerate AI Agents with Multimodal RAG powered by Friendli Endpoints and Mi...
1 Table = 1000 Words? Foundation Models for Tabular Data
How Milvus allows you to run Full Text Search
How to Optimize Your Embedding Model Selection and Development through TDA Cl...
Keeping Data Fresh: Mastering Updates in Vector Databases
Milvus 2.5: Full-Text Search, More Powerful Metadata Filtering, and more!
Vector Databases for Enhanced Classification
Ad

Recently uploaded (20)

PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Approach and Philosophy of On baking technology
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
KodekX | Application Modernization Development
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPT
Teaching material agriculture food technology
PPTX
Spectroscopy.pptx food analysis technology
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
Programs and apps: productivity, graphics, security and other tools
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Approach and Philosophy of On baking technology
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
The AUB Centre for AI in Media Proposal.docx
KodekX | Application Modernization Development
“AI and Expert System Decision Support & Business Intelligence Systems”
The Rise and Fall of 3GPP – Time for a Sabbatical?
Mobile App Security Testing_ A Comprehensive Guide.pdf
Review of recent advances in non-invasive hemoglobin estimation
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Teaching material agriculture food technology
Spectroscopy.pptx food analysis technology
Unlocking AI with Model Context Protocol (MCP)
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Building Integrated photovoltaic BIPV_UPV.pdf
Programs and apps: productivity, graphics, security and other tools

Multi-agent Systems with Mistral AI, Milvus and Llama-agents

  • 1. 1 | © Copyright 8/16/23 Zilliz 1 | © Copyright 8/16/23 Zilliz Stephen Batifol | Zilliz Unstructured Data Meetup, Sept 5th Multi-agent Systems with Mistral AI, Milvus and Llama-agents
  • 2. 2 | © Copyright 8/16/23 Zilliz 2 | © Copyright 8/16/23 Zilliz Stephen Batifol Developer Advocate, Zilliz/ Milvus stephen.batifol@zilliz.com linkedin.com/in/stephen-batifol/ @stephenbtl Speaker
  • 3. 3 | © Copyright 8/16/23 Zilliz 3 | © Copyright 8/16/23 Zilliz 28K GitHub Stars 25M Downloads 250 Contributors 2,600 + Forks Milvus is an open-source vector database for GenAI projects. pip install on your laptop, plug into popular AI dev tools, and push to production with a single line of code. Easy Setup Pip-install to start coding in a notebook within seconds. Reusable Code Write once, and deploy with one line of code into the production environment Integration Plug into OpenAI, Langchain, LlmaIndex, and many more Feature-rich Dense & sparse embeddings, filtering, reranking and beyond
  • 4. 4 | © Copyright 8/16/23 Zilliz 4 | © Copyright 8/16/23 Zilliz Seamless integration with all popular AI toolkits
  • 5. 5 | © Copyright 8/16/23 Zilliz 5 | © Copyright 8/16/23 Zilliz 5 | © Copyright 8/16/23 Zilliz 5 | © Copyright 8/16/23 Zilliz 01 Key Components
  • 6. 6 | © Copyright 8/16/23 Zilliz 6 | © Copyright 8/16/23 Zilliz pip install pymilvus Milvus Lite
  • 7. 7 | © Copyright 8/16/23 Zilliz 7 | © Copyright 8/16/23 Zilliz • Framework for building LLM Applications • Focus on retrieving data and integrating with LLMs • Integrations with most AI popular tools 🦙 llama-index
  • 8. 8 | © Copyright 8/16/23 Zilliz 8 | © Copyright 8/16/23 Zilliz 🦙 llama-agents 🤖 by llama-index • Build Stateful apps with LLMs and Multi-Agents workflow • Cycles and Branching • Human-in-the-Loop • Persistence
  • 9. 9 | © Copyright 8/16/23 Zilliz 9 | © Copyright 8/16/23 Zilliz Mistral AI • Mistral Embed • Embedding Model focused on Retrieval, very useful for RAG • English only • Mistral Nemo • 12B model with 128k context length • Strong Function Calling and Retrieval for its size • Run Locally • Mistral Large 2 • 123 Billions parameters with 128K context length • Very strong Function Calling and Retrieval skills
  • 10. 10 | © Copyright 8/16/23 Zilliz 10 | © Copyright 8/16/23 Zilliz Tavily ● Web search API for up-to-date information ● Tavily Search API is a search engine optimized for LLMs ● Expands agent's knowledge base
  • 11. 11 | © Copyright 8/16/23 Zilliz 11 | © Copyright 8/16/23 Zilliz | © Copyright 8/16/23 Zilliz 11 RAG Retrieval Augmented Generation)
  • 12. 12 | © Copyright 8/16/23 Zilliz 12 | © Copyright 8/16/23 Zilliz Basic Idea Use RAG to force the LLM to work with your data by injecting it via a vector database like Milvus
  • 13. 13 | © Copyright 8/16/23 Zilliz 13 | © Copyright 8/16/23 Zilliz Basic RAG Architecture
  • 14. 14 | © Copyright 8/16/23 Zilliz 14 | © Copyright 8/16/23 Zilliz 5 lines starter
  • 15. 15 | © Copyright 8/16/23 Zilliz 15 | © Copyright 8/16/23 Zilliz Naive RAG is limited
  • 16. 16 | © Copyright 8/16/23 Zilliz 16 | © Copyright 8/16/23 Zilliz Naive RAG Pipeline ⚠ Single-shot ⚠ No query understanding/planning ⚠ No tool use ⚠ No reflection, error correction ⚠ No memory (stateless)
  • 17. 17 | © Copyright 8/16/23 Zilliz 17 | © Copyright 8/16/23 Zilliz Naive RAG failure mode Summarization
  • 18. 18 | © Copyright 8/16/23 Zilliz 18 | © Copyright 8/16/23 Zilliz Naive RAG failure mode Implicit data
  • 19. 19 | © Copyright 8/16/23 Zilliz 19 | © Copyright 8/16/23 Zilliz Naive RAG failure mode Multi-part questions
  • 20. 20 | © Copyright 8/16/23 Zilliz 20 | © Copyright 8/16/23 Zilliz 20 | © Copyright 8/16/23 Zilliz RAG is necessary but not sufficient
  • 21. 21 | © Copyright 8/16/23 Zilliz 21 | © Copyright 8/16/23 Zilliz 21 | © Copyright 8/16/23 Zilliz 21 | © Copyright 8/16/23 Zilliz 01 Agentic RAG
  • 22. 22 | © Copyright 8/16/23 Zilliz 22 | © Copyright 8/16/23 Zilliz Agentic RAG ✅ Multi-turn ✅ Query / task planning layer ✅ Tool interface for external environment ✅ Reflection ✅ Memory for personalization
  • 23. 23 | © Copyright 8/16/23 Zilliz 23 | © Copyright 8/16/23 Zilliz ● Self-Reflection 🛜 ○ Check the internet to verify ● Query Routing ○ Whether to check our RAG system ● Query Routing with Subquery ○ Whether to check our RAG system for different queries ● Conversation Memory ● Tool Use Agentic RAG
  • 24. 24 | © Copyright 8/16/23 Zilliz 24 | © Copyright 8/16/23 Zilliz | © Copyright 8/16/23 Zilliz 24 Demo!
  • 25. 25 | © Copyright 8/16/23 Zilliz 25 | © Copyright 8/16/23 Zilliz milvus.io github.com/milvus-io/ @milvusio @stephenbtl /in/stephen-batifol Thank you
  • 26. 26 | © Copyright 8/16/23 Zilliz 26 | © Copyright 8/16/23 Zilliz Meta Storage Root Query Data Index Coordinator Service Proxy Proxy etcd Log Broker SDK Load Balancer DDL/DCL DML NOTIFICATION CONTROL SIGNAL Object Storage Minio / S3 / AzureBlob Log Snapshot Delta File Index File Worker Node QUERY DATA DATA Message Storage VECTOR DATABASE Access Layer Query Node Data Node Index Node Milvus Architecture