SlideShare a Scribd company logo
1 | © Copyright 10/22/23 Zilliz
1 | © Copyright 10/22/23 Zilliz
From Dev to Prod,
Vector Database Made Easy
Presented by: Charles Xie
Charles Xie
Founder and CEO of Zilliz,
Founder of the Milvus Project
● Board member of LF AI & Data,
Foundation and chairperson from 2020
to 2021
● Founding engineer of Oracle 12c cloud
database
● Master in Computer Science, University
of Wisconsin-Madison
2
● means zillions of zillions, pronunciate
as /ʼzilis/
● the company behind the Milvus project,
since 2018
● Series B, $113M funding
3
Vector Database : making sense of unstructured data
2024
A vector database stores embedding vectors and allows for
semantic retrieval of various types of unstructured data.
4
Milvus, OSS vector database since 2019
Originally created by Zilliz, hosted by the Linux Foundation
2024
28K
10000
GitHub Stars
Enterprise users
70M
Downloads
270
Contributors
Vector database made easy
for companies at different stages
GenAI company life cycle
Bootstrapping stage
● First to market
● Easy to use
● Prepare for future growth
8
2024
Easy to start
● Pip-install on your laptop
● plug into your favorite AI dev tools
● push to production with a single line of code
9
2024
Prepare for future growth
● Write your code once, and running everywhere, at
any scale
○ API and SDK are all the same
Milvus Lite
● Embedded
● No server
installation
● Low footprint
Milvus
● Dedicated server
● High performance
● Easy to maintain
○ K8S
○ Docker
Zilliz Cloud
● Multi-tenancy
● High availability
● Data security
● Complianced
○ SoC2
○ GDPR
Growth stage
● ROI, Cost optimization
● Scalability
11
2024
Status quo: expensive, not scalable
In memory
● Most of
other VDBs
● HNSW
● Expensive
12
2024
Lower cost
In memory
● Most of
other VDBs
● HNSW
● Expensive
On Disk
● Up to 10x
lower cost
13
2024
Lower cost
In memory
● Most VDBs
● HNSW
● Expensive
On Disk
● Up to 10x
lower cost
Object store
● Up to 50x
lower cost
Storage and computation separation
Milvus: decoupling computation and storage
16 | © Copyright 10/22/23 Zilliz
16 | © Copyright 10/22/23 Zilliz
hierarchical storage and caching
17 | © Copyright 10/22/23 Zilliz
17 | © Copyright 10/22/23 Zilliz
Data caching
18
2024
Higher scalability
10B vectors
of 1536 dimensions
in a single Milvus/Zilliz Cloud
instance
19
2024
Higher scalability
10B vectors
of 1536 dimensions
in a single Milvus/Zilliz Cloud
instance
100B vectors
in one of the largest deployment
Milvus architecture
Expansion stage
● Performance
● Avoid vendor lock in
○ Move data when you want
● Multi-cloud
● Global availability
22
2024
VectorDBBench : OSS framework for VDB benchmarking
https://github.com/zilliztech/VectorDBBench
23
2024
Performance
24
Multi-cloud: Zilliz Cloud is built atop of OSS Milvus
AWS, GCP, Azure
2024
Global availability: Zilliz Cloud has 20 availability zones
NA, EMEA, APAC
Vector database applications
27
Retrieval-Augmented Generation RAG
2024
A technique that combines the
strength of retrieval-based and
generative models:
● Improve accuracy and relevance
● Eliminate hallucination
● Provide domain-specific
knowledge
28
RAG : an economic perspective
2024
A business model that bridges public
data and private data
● Data sovereignty
● You can't and shouldn't give your
private data to others
29
RAG Evolution
2024
RAG 1.0, last year
● text
● LLMs,
○ GTP3.5, GPT4
30
RAG Evolution
2024
RAG 1.0, last year
● text
● LLMs,
○ GTP3.5, GPT4
RAG 2.0, this year
● image, video
● multi modality models
○ GTP4o
31
RAG Evolution
2024
RAG 1.0, last year
● text
● LLMs,
○ GTP3.5, GPT4
RAG 2.0, this year
● image, video
● multi modality models
○ GTP4o
RAG 3.0, next year?
● user behavior
● customized recommendation systems
○ Merlin
More application scenarios of vector databases
Thank You !
Milvus
Open Source Self-Managed
github.com/milvus-io/milvus
Zilliz Cloud
SaaS Fully-Managed
zilliz.com/cloud

More Related Content

PDF
NYCMeetup07-25-2024-Unstructured Data Processing From Cloud to Edge
PDF
20241108 - Milvus : a cloud native vector database for next generation AI app...
PDF
06-18-2024-Princeton Meetup-Introduction to Milvus
PDF
06-20-2024-AI Camp Meetup-Unstructured Data and Vector Databases
PDF
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
PDF
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
PDF
10-25-2024_BITS_NYC_Unstructured Data and LLM_ What, Why and How
PDF
09-26-2024 Conf 42 Kube Native: Unleashing the Potential of Cloud Native Open...
NYCMeetup07-25-2024-Unstructured Data Processing From Cloud to Edge
20241108 - Milvus : a cloud native vector database for next generation AI app...
06-18-2024-Princeton Meetup-Introduction to Milvus
06-20-2024-AI Camp Meetup-Unstructured Data and Vector Databases
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
10-25-2024_BITS_NYC_Unstructured Data and LLM_ What, Why and How
09-26-2024 Conf 42 Kube Native: Unleashing the Potential of Cloud Native Open...

Similar to From Dev to Prod, Vector Database Made Easy (20)

PDF
Open Source Milvus Vector Database v 2.6
PDF
09-12-2024 - Milvus, Vector database used for Sensor Data RAG
PDF
Supercharge Spark: Unleashing Big Data Potential with Milvus for RAG systems
PDF
2025-02-24 - AWS meetup - Zilliz presentation.pdf
PDF
Keeping Data Fresh: Mastering Updates in Vector Databases
PDF
01-Oct-2024_PES-VectorDatabasesAndAI.pdf
PDF
Scaling Vector Search: How Milvus Handles Billions+
PDF
Milvus: Scaling Vector Data Solutions for Gen AI
PDF
09-19-2024 AI Camp Hybrid Seach - Milvus for Vector Database
PDF
09-03-2024_UnstructuredDataAndAIDiscussion.pdf
PDF
09-25-2024 NJX Venture Summit Introduction to Unstructured Data
PDF
08-13-2024 NYC Meetup Unstructured Data Processing From Cloud to Edge (Milvus)
PDF
NYC Meetup Unstructured Data Processing From Cloud to Edge (Milvus)
PDF
Chunking, Embeddings, and Vector Databases
PDF
2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...
PDF
Vector Databases 101 - An introduction to the world of Vector Databases
PDF
2024-OCT-23 NYC Meetup - Unstructured Data Meetup - Unstructured Halloween
PDF
17-October-2024 NYC AI Camp - Step-by-Step RAG 101
PDF
Hands-on Tutorial: Building an Agent to Reason about Private Data with OpenAI...
PDF
Unstructured Data Processing from Cloud to Edge Webinar
Open Source Milvus Vector Database v 2.6
09-12-2024 - Milvus, Vector database used for Sensor Data RAG
Supercharge Spark: Unleashing Big Data Potential with Milvus for RAG systems
2025-02-24 - AWS meetup - Zilliz presentation.pdf
Keeping Data Fresh: Mastering Updates in Vector Databases
01-Oct-2024_PES-VectorDatabasesAndAI.pdf
Scaling Vector Search: How Milvus Handles Billions+
Milvus: Scaling Vector Data Solutions for Gen AI
09-19-2024 AI Camp Hybrid Seach - Milvus for Vector Database
09-03-2024_UnstructuredDataAndAIDiscussion.pdf
09-25-2024 NJX Venture Summit Introduction to Unstructured Data
08-13-2024 NYC Meetup Unstructured Data Processing From Cloud to Edge (Milvus)
NYC Meetup Unstructured Data Processing From Cloud to Edge (Milvus)
Chunking, Embeddings, and Vector Databases
2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...
Vector Databases 101 - An introduction to the world of Vector Databases
2024-OCT-23 NYC Meetup - Unstructured Data Meetup - Unstructured Halloween
17-October-2024 NYC AI Camp - Step-by-Step RAG 101
Hands-on Tutorial: Building an Agent to Reason about Private Data with OpenAI...
Unstructured Data Processing from Cloud to Edge Webinar
Ad

More from Zilliz (20)

PDF
Build Fast, Scale Faster: Milvus vs. Zilliz Cloud for Production-Ready AI
PDF
Zilliz Cloud Demo for performance and scale
PDF
Zilliz Cloud Monthly Technical Review: May 2025
PDF
Smarter RAG Pipelines: Scaling Search with Milvus and Feast
PDF
Agentic AI in Action: Real-Time Vision, Memory & Autonomy with Browser Use & ...
PDF
Webinar - Zilliz Cloud Monthly Demo - March 2025
PDF
What Makes "Deep Research"? A Dive into AI Agents
PDF
Combining Lexical and Semantic Search with Milvus 2.5
PDF
Bedrock Data Automation (Preview): Simplifying Unstructured Data Processing
PDF
Deploying a Multimodal RAG System Using Open Source Milvus, LlamaIndex, and vLLM
PDF
February Product Demo: Discover the Power of Zilliz Cloud
PDF
Full Text Search with Milvus 2.5 - UD Meetup Berlin Jan 23
PDF
Building the Next-Gen Apps with Multimodal Retrieval using Twelve Labs & Milvus
PDF
Voice-to-Value- LLM-Powered Customer Interaction Analysis.pdf
PDF
Accelerate AI Agents with Multimodal RAG powered by Friendli Endpoints and Mi...
PDF
1 Table = 1000 Words? Foundation Models for Tabular Data
PDF
How Milvus allows you to run Full Text Search
PDF
How to Optimize Your Embedding Model Selection and Development through TDA Cl...
PDF
GraphRAG Agents with Neo4j, Milvus and GPT4
PDF
Using LLM Agents with Llama 3.2, LangGraph and Milvus
Build Fast, Scale Faster: Milvus vs. Zilliz Cloud for Production-Ready AI
Zilliz Cloud Demo for performance and scale
Zilliz Cloud Monthly Technical Review: May 2025
Smarter RAG Pipelines: Scaling Search with Milvus and Feast
Agentic AI in Action: Real-Time Vision, Memory & Autonomy with Browser Use & ...
Webinar - Zilliz Cloud Monthly Demo - March 2025
What Makes "Deep Research"? A Dive into AI Agents
Combining Lexical and Semantic Search with Milvus 2.5
Bedrock Data Automation (Preview): Simplifying Unstructured Data Processing
Deploying a Multimodal RAG System Using Open Source Milvus, LlamaIndex, and vLLM
February Product Demo: Discover the Power of Zilliz Cloud
Full Text Search with Milvus 2.5 - UD Meetup Berlin Jan 23
Building the Next-Gen Apps with Multimodal Retrieval using Twelve Labs & Milvus
Voice-to-Value- LLM-Powered Customer Interaction Analysis.pdf
Accelerate AI Agents with Multimodal RAG powered by Friendli Endpoints and Mi...
1 Table = 1000 Words? Foundation Models for Tabular Data
How Milvus allows you to run Full Text Search
How to Optimize Your Embedding Model Selection and Development through TDA Cl...
GraphRAG Agents with Neo4j, Milvus and GPT4
Using LLM Agents with Llama 3.2, LangGraph and Milvus
Ad

Recently uploaded (20)

PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Empathic Computing: Creating Shared Understanding
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Approach and Philosophy of On baking technology
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PPTX
Cloud computing and distributed systems.
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Advanced methodologies resolving dimensionality complications for autism neur...
NewMind AI Weekly Chronicles - August'25-Week II
Spectral efficient network and resource selection model in 5G networks
Building Integrated photovoltaic BIPV_UPV.pdf
Empathic Computing: Creating Shared Understanding
Network Security Unit 5.pdf for BCA BBA.
A comparative analysis of optical character recognition models for extracting...
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Programs and apps: productivity, graphics, security and other tools
Approach and Philosophy of On baking technology
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Cloud computing and distributed systems.
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
sap open course for s4hana steps from ECC to s4
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
“AI and Expert System Decision Support & Business Intelligence Systems”
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Mobile App Security Testing_ A Comprehensive Guide.pdf

From Dev to Prod, Vector Database Made Easy