SlideShare a Scribd company logo
1 | © Copyright 8/16/23 Zilliz
1 | © Copyright 8/16/23 Zilliz
Stephen Batifol | Zilliz
Unstructured Data Meetup, July 16th
Scaling Vector Search: How
Milvus Handles Billions+
2 | © Copyright 8/16/23 Zilliz
2 | © Copyright 8/16/23 Zilliz
Stephen Batifol
Developer Advocate, Zilliz/ Milvus
stephen.batifol@zilliz.com
linkedin.com/in/stephen-batifol/
@stephenbtl
Speaker
3 | © Copyright 8/16/23 Zilliz
3 | © Copyright 8/16/23 Zilliz
27K+
GitHub
Stars
25M+
Downloads
250+
Contributors
2,600
+
Forks
Milvus is an open-source vector database for GenAI projects. pip install on your
laptop, plug into popular AI dev tools, and push to production with a single line of
code.
Easy Setup
pip install
pymilvus to start
coding in a notebook
within seconds.
Reusable Code
Write once, and
deploy with one line
of code into the
production
environment
Integration
Plug into OpenAI,
Langchain,
LlamaIndex, and
many more
Feature-rich
Dense & sparse
embeddings,
filtering, reranking
and beyond
4 | © Copyright 8/16/23 Zilliz
4 | © Copyright 8/16/23 Zilliz
Seamless integration with all popular AI toolkits
5 | © Copyright 8/16/23 Zilliz
5 | © Copyright 8/16/23 Zilliz
Well-connected in LLM infrastructure to enable RAG
use cases
Framework
Hardware
Infrastructure
Embedding Models LLMs
Software Infrastructure
Vector Database
6 | © Copyright 8/16/23 Zilliz
6 | © Copyright 8/16/23 Zilliz
Retrieval Augmented
Generation (RAG)
Expand LLMs' knowledge by
incorporating external data sources
into LLMs and your AI applications.
Match user behavior or content
features with other similar ones to
make effective recommendations.
Recommender System
Search for semantically similar
texts across vast amounts of
natural language documents.
Text/ Semantic Search
Image Similarity Search
Identify and search for visually
similar images or objects from a
vast collection of image libraries.
Video Similarity Search
Search for similar videos, scenes,
or objects from extensive
collections of video libraries.
Audio Similarity Search
Find similar audios in large datasets
for tasks like genre classification or
speech recognition
Molecular Similarity Search
Search for similar substructures,
superstructures, and other
structures for a specific molecule.
Anomaly Detection
Detect data points, events, and
observations that deviate
significantly from the usual pattern
Multimodal Similarity Search
Search over multiple types of data
simultaneously, e.g. text and
images
Common AI Use Cases
7 | © Copyright 8/16/23 Zilliz
7 | © Copyright 8/16/23 Zilliz
7 | © Copyright 8/16/23 Zilliz
7 | © Copyright 8/16/23 Zilliz
01
Introduction to Vector DB
and Vector Search
8 | © Copyright 8/16/23 Zilliz
8 | © Copyright 8/16/23 Zilliz
Traditional database was built upon exact search
9 | © Copyright 8/16/23 Zilliz
9 | © Copyright 8/16/23 Zilliz
…which misses context, semantic meaning, and user intent
VS.
Apple
VS.
Rising dough
VS.
Change car tire
Rising Dough
Proofing Bread
✔
❌
10 | © Copyright 8/16/23 Zilliz
10 | © Copyright 8/16/23 Zilliz
…and cannot process increasingly growing unstructured data
*Data Source: The Digitization of the World by IDC
20%
Other
newly generated data in 2025
will be unstructured data
80%
11 | © Copyright 8/16/23 Zilliz
11 | © Copyright 8/16/23 Zilliz
Vector
Databases
Where do Vectors Come From?
12 | © Copyright 8/16/23 Zilliz
12 | © Copyright 8/16/23 Zilliz
Embeddings Models
13 | © Copyright 8/16/23 Zilliz
13 | © Copyright 8/16/23 Zilliz
Vector Embedding
14 | © Copyright 8/16/23 Zilliz
14 | © Copyright 8/16/23 Zilliz
Vector Space
15 | © Copyright 8/16/23 Zilliz
15 | © Copyright 8/16/23 Zilliz
15 | © Copyright 8/16/23 Zilliz
15 | © Copyright 8/16/23 Zilliz
02
How do Vector Databases
Work?
16 | © Copyright 8/16/23 Zilliz
16 | © Copyright 8/16/23 Zilliz
How Similarity Search Works
Vn, 1
…
…
…
1
2
3
4
5
Transform into
Vectors
Unstructured Data
Images
User Generated
Content
Video
Documents
Audio
Vector Embeddings
Perform Approximate
Nearest Neighbor
Similarity Search
Perform Query
Get Results
Store in Vector Database
17 | © Copyright 8/16/23 Zilliz
17 | © Copyright 8/16/23 Zilliz
17 | © Copyright 8/16/23 Zilliz
17 | © Copyright 8/16/23 Zilliz
02
How does Milvus do it at
Billions+ Scale?
18 | © Copyright 8/16/23 Zilliz
18 | © Copyright 8/16/23 Zilliz
Design Principles
• Disaggregate storage and computation
• Fully depends on mature storage
systems
• Micro Service - scale by functionality
• Separate Streaming and historical data
• Pluggable engine, storage and index
• Log As data
19 | © Copyright 8/16/23 Zilliz
19 | © Copyright 8/16/23 Zilliz
Meta Storage
Root Query Data Index
Coordinator Service
Proxy
Proxy
etcd
Log Broker
SDK
Load Balancer
DDL/DCL
DML
NOTIFICATION
CONTROL SIGNAL
Object Storage
Minio / S3 / AzureBlob
Log Snapshot Delta File Index File
Worker Node QUERY DATA DATA
Message Storage
VECTOR
DATABASE
Access Layer
Query Node Data Node Index Node
Fully Distributed Architecture
20 | © Copyright 8/16/23 Zilliz
20 | © Copyright 8/16/23 Zilliz
Each shard is managed by a supervisor
(shard leader). This supervisor is
responsible for:
• Adding new information to the shard.
• Regularly storing the data in a safe
place (object storage).
• Serving the latest information for
search requests.
• Forwarding historical data requests to
other cabinets (query nodes) if
needed.
Milvus Data Layout - Shard
21 | © Copyright 8/16/23 Zilliz
21 | © Copyright 8/16/23 Zilliz
Growing Segment:
• In-memory segment replaying data
from the Log Broker.
• Uses a FLAT index to ensure data is
fresh and appendable.
Sealed Segment:
• Immutable segment using
alternative indexing methods for
efficiency.
Milvus Data Layout - Segments
22 | © Copyright 8/16/23 Zilliz
22 | © Copyright 8/16/23 Zilliz
Behind the Scenes: How Data Gets Added and
Accessed
• Sharding: Large datasets are
divided into smaller,
manageable sections called
shards. Each shard is handled
by a dedicated datanode.
• Write-Ahead Log (WAL):
When you add new data, a
proxy service writes it to a
temporary log called a WAL
(e.g., Kafka, Pulsar). Think of it
as a to-do list for the
datanodes.
23 | © Copyright 8/16/23 Zilliz
23 | © Copyright 8/16/23 Zilliz
Behind the Scenes: How Data Gets Added and
Accessed
• Datanodes subscribe to the WAL
and:
• Add new data to their assigned shard.
• Remove outdated data (if needed)
• Flush accumulated data to permanent
storage.
• Query Nodes also subscribe to the
WAL but focus on:
• Creating and managing Segments
within each shard for fast searching.
• Ensuring searches access the latest
information.
24 | © Copyright 8/16/23 Zilliz
24 | © Copyright 8/16/23 Zilliz
Index Building
To avoid frequent index building
for data updates.
A collection in Milvus is divided
further into segments, each with
its own index.
25 | © Copyright 8/16/23 Zilliz
25 | © Copyright 8/16/23 Zilliz
Data query refers to:
• Searching a specified
collection for k number of
vectors nearest to a target
vector or for all vectors within
a specified distance range to
the vector.
Data query
26 | © Copyright 8/16/23 Zilliz
26 | © Copyright 8/16/23 Zilliz
Some customers with Millions/ Billions Scale
27 | © Copyright 8/16/23 Zilliz
27 | © Copyright 8/16/23 Zilliz
| © Copyright 8/16/23 Zilliz
27
Demo!
28 | © Copyright 8/16/23 Zilliz
28 | © Copyright 8/16/23 Zilliz
28 | © Copyright 8/16/23 Zilliz
28 | © Copyright 8/16/23 Zilliz
09
RAG in action with Milvus
Lite
29 | © Copyright 8/16/23 Zilliz
29 | © Copyright 8/16/23 Zilliz
milvus.io
github.com/milvus-io/
@milvusio
@stephenbtl
/in/stephen-batifol
Thank you
30 | © Copyright 8/16/23 Zilliz
30 | © Copyright 8/16/23 Zilliz

More Related Content

PDF
Milvus: Scaling Vector Data Solutions for Gen AI
PDF
Supercharge Spark: Unleashing Big Data Potential with Milvus for RAG systems
PDF
Keeping Data Fresh: Mastering Updates in Vector Databases
PDF
2025-02-24 - AWS meetup - Zilliz presentation.pdf
PDF
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
PDF
MultiModal RAG using vLLM and Pixtral - Stephen Batifol
PDF
MultiModal RAG using vLLM and Pixtral - Stephen Batifol
PDF
Open Source Milvus Vector Database v 2.6
Milvus: Scaling Vector Data Solutions for Gen AI
Supercharge Spark: Unleashing Big Data Potential with Milvus for RAG systems
Keeping Data Fresh: Mastering Updates in Vector Databases
2025-02-24 - AWS meetup - Zilliz presentation.pdf
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
MultiModal RAG using vLLM and Pixtral - Stephen Batifol
MultiModal RAG using vLLM and Pixtral - Stephen Batifol
Open Source Milvus Vector Database v 2.6

Similar to Scaling Vector Search: How Milvus Handles Billions+ (20)

PDF
09-25-2024 NJX Venture Summit Introduction to Unstructured Data
PDF
NYCMeetup07-25-2024-Unstructured Data Processing From Cloud to Edge
PDF
Deploying a Multimodal RAG System Using Open Source Milvus, LlamaIndex, and vLLM
PDF
Agentic AI in Action: Real-Time Vision, Memory & Autonomy with Browser Use & ...
PDF
09-19-2024 AI Camp Hybrid Seach - Milvus for Vector Database
PDF
Multi-agent Systems with Mistral AI, Milvus and Llama-agents
PDF
Multi-agent Systems with Mistral AI, Milvus and Llama-agents
PDF
2025-04-05 - Block71 Event - The Landscape of GenAI and Ecosystem.pdf
PDF
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
PDF
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
PDF
GraphRAG Agents with Neo4j, Milvus and GPT4
PDF
17-October-2024 NYC AI Camp - Step-by-Step RAG 101
PDF
09-03-2024_UnstructuredDataAndAIDiscussion.pdf
PDF
A Beginners Guide to Building a RAG App Using Open Source Milvus
PDF
09-26-2024 Conf 42 Kube Native: Unleashing the Potential of Cloud Native Open...
PDF
Hands-on Tutorial: Building an Agent to Reason about Private Data with OpenAI...
PDF
Unstructured Data Processing from Cloud to Edge Webinar
PDF
Unstructured Data Processing from Cloud to Edge Webinar
PDF
10-25-2024_BITS_NYC_Unstructured Data and LLM_ What, Why and How
PDF
Vector Databases 101 - An introduction to the world of Vector Databases
09-25-2024 NJX Venture Summit Introduction to Unstructured Data
NYCMeetup07-25-2024-Unstructured Data Processing From Cloud to Edge
Deploying a Multimodal RAG System Using Open Source Milvus, LlamaIndex, and vLLM
Agentic AI in Action: Real-Time Vision, Memory & Autonomy with Browser Use & ...
09-19-2024 AI Camp Hybrid Seach - Milvus for Vector Database
Multi-agent Systems with Mistral AI, Milvus and Llama-agents
Multi-agent Systems with Mistral AI, Milvus and Llama-agents
2025-04-05 - Block71 Event - The Landscape of GenAI and Ecosystem.pdf
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
GraphRAG Agents with Neo4j, Milvus and GPT4
17-October-2024 NYC AI Camp - Step-by-Step RAG 101
09-03-2024_UnstructuredDataAndAIDiscussion.pdf
A Beginners Guide to Building a RAG App Using Open Source Milvus
09-26-2024 Conf 42 Kube Native: Unleashing the Potential of Cloud Native Open...
Hands-on Tutorial: Building an Agent to Reason about Private Data with OpenAI...
Unstructured Data Processing from Cloud to Edge Webinar
Unstructured Data Processing from Cloud to Edge Webinar
10-25-2024_BITS_NYC_Unstructured Data and LLM_ What, Why and How
Vector Databases 101 - An introduction to the world of Vector Databases
Ad

More from Zilliz (20)

PDF
Build Fast, Scale Faster: Milvus vs. Zilliz Cloud for Production-Ready AI
PDF
Zilliz Cloud Demo for performance and scale
PDF
Zilliz Cloud Monthly Technical Review: May 2025
PDF
Smarter RAG Pipelines: Scaling Search with Milvus and Feast
PDF
Webinar - Zilliz Cloud Monthly Demo - March 2025
PDF
What Makes "Deep Research"? A Dive into AI Agents
PDF
Combining Lexical and Semantic Search with Milvus 2.5
PDF
Bedrock Data Automation (Preview): Simplifying Unstructured Data Processing
PDF
February Product Demo: Discover the Power of Zilliz Cloud
PDF
Full Text Search with Milvus 2.5 - UD Meetup Berlin Jan 23
PDF
Building the Next-Gen Apps with Multimodal Retrieval using Twelve Labs & Milvus
PDF
Voice-to-Value- LLM-Powered Customer Interaction Analysis.pdf
PDF
Accelerate AI Agents with Multimodal RAG powered by Friendli Endpoints and Mi...
PDF
1 Table = 1000 Words? Foundation Models for Tabular Data
PDF
How Milvus allows you to run Full Text Search
PDF
How to Optimize Your Embedding Model Selection and Development through TDA Cl...
PDF
Using LLM Agents with Llama 3.2, LangGraph and Milvus
PDF
Milvus 2.5: Full-Text Search, More Powerful Metadata Filtering, and more!
PDF
Vector Databases for Enhanced Classification
PDF
Multimodal Retrieval-Augmented Generation (RAG) with Vector Database
Build Fast, Scale Faster: Milvus vs. Zilliz Cloud for Production-Ready AI
Zilliz Cloud Demo for performance and scale
Zilliz Cloud Monthly Technical Review: May 2025
Smarter RAG Pipelines: Scaling Search with Milvus and Feast
Webinar - Zilliz Cloud Monthly Demo - March 2025
What Makes "Deep Research"? A Dive into AI Agents
Combining Lexical and Semantic Search with Milvus 2.5
Bedrock Data Automation (Preview): Simplifying Unstructured Data Processing
February Product Demo: Discover the Power of Zilliz Cloud
Full Text Search with Milvus 2.5 - UD Meetup Berlin Jan 23
Building the Next-Gen Apps with Multimodal Retrieval using Twelve Labs & Milvus
Voice-to-Value- LLM-Powered Customer Interaction Analysis.pdf
Accelerate AI Agents with Multimodal RAG powered by Friendli Endpoints and Mi...
1 Table = 1000 Words? Foundation Models for Tabular Data
How Milvus allows you to run Full Text Search
How to Optimize Your Embedding Model Selection and Development through TDA Cl...
Using LLM Agents with Llama 3.2, LangGraph and Milvus
Milvus 2.5: Full-Text Search, More Powerful Metadata Filtering, and more!
Vector Databases for Enhanced Classification
Multimodal Retrieval-Augmented Generation (RAG) with Vector Database
Ad

Recently uploaded (20)

PDF
Electronic commerce courselecture one. Pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
Programs and apps: productivity, graphics, security and other tools
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
Machine Learning_overview_presentation.pptx
PDF
Machine learning based COVID-19 study performance prediction
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
Big Data Technologies - Introduction.pptx
PPTX
SOPHOS-XG Firewall Administrator PPT.pptx
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
A Presentation on Artificial Intelligence
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Electronic commerce courselecture one. Pdf
Encapsulation_ Review paper, used for researhc scholars
Programs and apps: productivity, graphics, security and other tools
MYSQL Presentation for SQL database connectivity
Machine Learning_overview_presentation.pptx
Machine learning based COVID-19 study performance prediction
Diabetes mellitus diagnosis method based random forest with bat algorithm
Big Data Technologies - Introduction.pptx
SOPHOS-XG Firewall Administrator PPT.pptx
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Unlocking AI with Model Context Protocol (MCP)
Building Integrated photovoltaic BIPV_UPV.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
NewMind AI Weekly Chronicles - August'25-Week II
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
A Presentation on Artificial Intelligence
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...

Scaling Vector Search: How Milvus Handles Billions+

  • 1. 1 | © Copyright 8/16/23 Zilliz 1 | © Copyright 8/16/23 Zilliz Stephen Batifol | Zilliz Unstructured Data Meetup, July 16th Scaling Vector Search: How Milvus Handles Billions+
  • 2. 2 | © Copyright 8/16/23 Zilliz 2 | © Copyright 8/16/23 Zilliz Stephen Batifol Developer Advocate, Zilliz/ Milvus stephen.batifol@zilliz.com linkedin.com/in/stephen-batifol/ @stephenbtl Speaker
  • 3. 3 | © Copyright 8/16/23 Zilliz 3 | © Copyright 8/16/23 Zilliz 27K+ GitHub Stars 25M+ Downloads 250+ Contributors 2,600 + Forks Milvus is an open-source vector database for GenAI projects. pip install on your laptop, plug into popular AI dev tools, and push to production with a single line of code. Easy Setup pip install pymilvus to start coding in a notebook within seconds. Reusable Code Write once, and deploy with one line of code into the production environment Integration Plug into OpenAI, Langchain, LlamaIndex, and many more Feature-rich Dense & sparse embeddings, filtering, reranking and beyond
  • 4. 4 | © Copyright 8/16/23 Zilliz 4 | © Copyright 8/16/23 Zilliz Seamless integration with all popular AI toolkits
  • 5. 5 | © Copyright 8/16/23 Zilliz 5 | © Copyright 8/16/23 Zilliz Well-connected in LLM infrastructure to enable RAG use cases Framework Hardware Infrastructure Embedding Models LLMs Software Infrastructure Vector Database
  • 6. 6 | © Copyright 8/16/23 Zilliz 6 | © Copyright 8/16/23 Zilliz Retrieval Augmented Generation (RAG) Expand LLMs' knowledge by incorporating external data sources into LLMs and your AI applications. Match user behavior or content features with other similar ones to make effective recommendations. Recommender System Search for semantically similar texts across vast amounts of natural language documents. Text/ Semantic Search Image Similarity Search Identify and search for visually similar images or objects from a vast collection of image libraries. Video Similarity Search Search for similar videos, scenes, or objects from extensive collections of video libraries. Audio Similarity Search Find similar audios in large datasets for tasks like genre classification or speech recognition Molecular Similarity Search Search for similar substructures, superstructures, and other structures for a specific molecule. Anomaly Detection Detect data points, events, and observations that deviate significantly from the usual pattern Multimodal Similarity Search Search over multiple types of data simultaneously, e.g. text and images Common AI Use Cases
  • 7. 7 | © Copyright 8/16/23 Zilliz 7 | © Copyright 8/16/23 Zilliz 7 | © Copyright 8/16/23 Zilliz 7 | © Copyright 8/16/23 Zilliz 01 Introduction to Vector DB and Vector Search
  • 8. 8 | © Copyright 8/16/23 Zilliz 8 | © Copyright 8/16/23 Zilliz Traditional database was built upon exact search
  • 9. 9 | © Copyright 8/16/23 Zilliz 9 | © Copyright 8/16/23 Zilliz …which misses context, semantic meaning, and user intent VS. Apple VS. Rising dough VS. Change car tire Rising Dough Proofing Bread ✔ ❌
  • 10. 10 | © Copyright 8/16/23 Zilliz 10 | © Copyright 8/16/23 Zilliz …and cannot process increasingly growing unstructured data *Data Source: The Digitization of the World by IDC 20% Other newly generated data in 2025 will be unstructured data 80%
  • 11. 11 | © Copyright 8/16/23 Zilliz 11 | © Copyright 8/16/23 Zilliz Vector Databases Where do Vectors Come From?
  • 12. 12 | © Copyright 8/16/23 Zilliz 12 | © Copyright 8/16/23 Zilliz Embeddings Models
  • 13. 13 | © Copyright 8/16/23 Zilliz 13 | © Copyright 8/16/23 Zilliz Vector Embedding
  • 14. 14 | © Copyright 8/16/23 Zilliz 14 | © Copyright 8/16/23 Zilliz Vector Space
  • 15. 15 | © Copyright 8/16/23 Zilliz 15 | © Copyright 8/16/23 Zilliz 15 | © Copyright 8/16/23 Zilliz 15 | © Copyright 8/16/23 Zilliz 02 How do Vector Databases Work?
  • 16. 16 | © Copyright 8/16/23 Zilliz 16 | © Copyright 8/16/23 Zilliz How Similarity Search Works Vn, 1 … … … 1 2 3 4 5 Transform into Vectors Unstructured Data Images User Generated Content Video Documents Audio Vector Embeddings Perform Approximate Nearest Neighbor Similarity Search Perform Query Get Results Store in Vector Database
  • 17. 17 | © Copyright 8/16/23 Zilliz 17 | © Copyright 8/16/23 Zilliz 17 | © Copyright 8/16/23 Zilliz 17 | © Copyright 8/16/23 Zilliz 02 How does Milvus do it at Billions+ Scale?
  • 18. 18 | © Copyright 8/16/23 Zilliz 18 | © Copyright 8/16/23 Zilliz Design Principles • Disaggregate storage and computation • Fully depends on mature storage systems • Micro Service - scale by functionality • Separate Streaming and historical data • Pluggable engine, storage and index • Log As data
  • 19. 19 | © Copyright 8/16/23 Zilliz 19 | © Copyright 8/16/23 Zilliz Meta Storage Root Query Data Index Coordinator Service Proxy Proxy etcd Log Broker SDK Load Balancer DDL/DCL DML NOTIFICATION CONTROL SIGNAL Object Storage Minio / S3 / AzureBlob Log Snapshot Delta File Index File Worker Node QUERY DATA DATA Message Storage VECTOR DATABASE Access Layer Query Node Data Node Index Node Fully Distributed Architecture
  • 20. 20 | © Copyright 8/16/23 Zilliz 20 | © Copyright 8/16/23 Zilliz Each shard is managed by a supervisor (shard leader). This supervisor is responsible for: • Adding new information to the shard. • Regularly storing the data in a safe place (object storage). • Serving the latest information for search requests. • Forwarding historical data requests to other cabinets (query nodes) if needed. Milvus Data Layout - Shard
  • 21. 21 | © Copyright 8/16/23 Zilliz 21 | © Copyright 8/16/23 Zilliz Growing Segment: • In-memory segment replaying data from the Log Broker. • Uses a FLAT index to ensure data is fresh and appendable. Sealed Segment: • Immutable segment using alternative indexing methods for efficiency. Milvus Data Layout - Segments
  • 22. 22 | © Copyright 8/16/23 Zilliz 22 | © Copyright 8/16/23 Zilliz Behind the Scenes: How Data Gets Added and Accessed • Sharding: Large datasets are divided into smaller, manageable sections called shards. Each shard is handled by a dedicated datanode. • Write-Ahead Log (WAL): When you add new data, a proxy service writes it to a temporary log called a WAL (e.g., Kafka, Pulsar). Think of it as a to-do list for the datanodes.
  • 23. 23 | © Copyright 8/16/23 Zilliz 23 | © Copyright 8/16/23 Zilliz Behind the Scenes: How Data Gets Added and Accessed • Datanodes subscribe to the WAL and: • Add new data to their assigned shard. • Remove outdated data (if needed) • Flush accumulated data to permanent storage. • Query Nodes also subscribe to the WAL but focus on: • Creating and managing Segments within each shard for fast searching. • Ensuring searches access the latest information.
  • 24. 24 | © Copyright 8/16/23 Zilliz 24 | © Copyright 8/16/23 Zilliz Index Building To avoid frequent index building for data updates. A collection in Milvus is divided further into segments, each with its own index.
  • 25. 25 | © Copyright 8/16/23 Zilliz 25 | © Copyright 8/16/23 Zilliz Data query refers to: • Searching a specified collection for k number of vectors nearest to a target vector or for all vectors within a specified distance range to the vector. Data query
  • 26. 26 | © Copyright 8/16/23 Zilliz 26 | © Copyright 8/16/23 Zilliz Some customers with Millions/ Billions Scale
  • 27. 27 | © Copyright 8/16/23 Zilliz 27 | © Copyright 8/16/23 Zilliz | © Copyright 8/16/23 Zilliz 27 Demo!
  • 28. 28 | © Copyright 8/16/23 Zilliz 28 | © Copyright 8/16/23 Zilliz 28 | © Copyright 8/16/23 Zilliz 28 | © Copyright 8/16/23 Zilliz 09 RAG in action with Milvus Lite
  • 29. 29 | © Copyright 8/16/23 Zilliz 29 | © Copyright 8/16/23 Zilliz milvus.io github.com/milvus-io/ @milvusio @stephenbtl /in/stephen-batifol Thank you
  • 30. 30 | © Copyright 8/16/23 Zilliz 30 | © Copyright 8/16/23 Zilliz