SlideShare a Scribd company logo
Intro to Vec Search
Uri Goren
Chief Data Scientist @ Argmax
Agenda 1
2
3
4
5
6
About us
RecSys & Search
Approx Neighbor
search
Similarity Search
stores
A new standard
Q&A
About Uri
BestPractix
■ Leading argmax.ml
■ Our expertise:
○ Natural Language
○ Recommendation Sys
■ ML Community
○ Podcast
○ Meetup
○ Conferences
e-Commerce
Why argmax cares about search ?
We develop recommendation systems and
personalized search solutions
Creator Economy
Ad Tech
Direct
Insurance
Recommendation and Search
Recommendations Systems
Recommended for you
Introduction to Vector search - Argmx talk
3%
42%
49%
23% 47%
2%
13%
48%
89%
99% 51% 71%
77%
87% 99% 63% 97% 51%
0
1
0
0
1
0
1
0
1
1
0
1
1
0
1
0
1
0
0
0
1
0
1
0
0
1
1
1
1
1
0
0
0
0
0
1
0
1
0
0
1
0
1
0
1
1
1
0
0
0
1
1
0
1
(0, 1, 0, 0, 0 ,0 ,1 ,1, 1)
(0, 0, 1, 0 ,0 ,1 ,0, 0, 0)
(1, 0, 0, 0, 1 ,0 ,1 ,0, 1)
0.03 1
0.9
-0.7
0.88
-0.8
Item vector and User vectors
User2Item recommendation Item2Item recommendation
How to learn the Vectors ?
Collaborative filtering Content filtering
0
1
0
0
1
0
1
0
1
1
0
0
1
0
1
0
1
1
Vegan?
Keto?
Low fat
Dairy?
Pastry?
Snack?
….
Shared
Features
Search is essentially the same thing
Query Query
Encoder
Different
Encoder
0.8
score
Personalized search is almost the same thing
Query
Query
Query
Encoder
Item
Encoder
0.8
score
User
Encoder
Query
Introduction to Vector search - Argmx talk
Approximation Algorithms
Isn’t it just KNN ?
Problem solved !?
How does Naive KNN work ?
m Users / Queries n Items
m times n times d2
(vector dim)
Sort
and
take
top k
Sort complexity
Flat search (Naive) pros and cons
Pros
✓ Extremely simple to implement
✓ Does not depend on metric
✓ Accurate results
✓ Best when search space is small
Cons
✘ Holds vectors in memory
✘ Not scalable
✘ Does not take advantage of the search
space topology
Hierarchical search
John is from Seattle, find his k nearest neighbors
No need to compare to the entire world What if John is
near the border of
2 states ?
Inverted Indices
1 0 0 1 0
0 0 100 1 0
1 0 1 0 0
a b c d e
When looking for the nearest
neighbors of
We can filter only records in which the
c component is positive (and maybe above a certain
threshold)
Locality sensitive hashing
(-0.55,+0.73,+1.7)
(-0.7,-0.52,+1.7)
(-2.05,-0.1,-1.11)
(-,+,+)
(-,-,+)
(-,-,-)
Key Ordered list
h(-,+,+)
h(-,-,+)
h(-,-,-)
Tradeoffs of the various algorithms
Flat HNSW IVF LSH
Accuracy Most Least
Memory
footprint
High Medium High Depends on
sparsity
Low
Metric Any L2/Cosine L2/Cosine Any
Vector Size Small Small Large if sparse Large
Similarity Search
Vector
Similarity
Server
Vector search workflow
User
Encoder
How pre-filters are implemented
US
EU
How post-filters are implemented
US
EU
Is item
available
for sale?
Is item
available
for sale?
Vector Similarity Providers
Google
ScaNN
And many
more …
Ann-benchmarks.con
Consideration when choosing a VecSim server
● Is the vector sparse or dense ?
○ Typically, one hot embeddings work best
with IVF and servers such as
Elastic,Solr,lucene
○ Hybrid queries?
● Should the vectors be stored ? or just the IDs
● Is replication needed ?
● Is GPU acceleration needed ?
● Filtering in Query time ?
● Are items dynamically added and removed
often?
● Batch vs online queries
○ E.g. building a mailing list vs live
recommendation
Is vector search a product or a
feature?
The VecSim API
Essentially, regardless of the implementation, the
API is pretty much the same
Taken from the
Vecsim pypi module
Using LangChain API
Suggestion
Summary
VecSim
A new standard
for querying
items for search
and recsys.
Control the Tradeoff
Between accuracy and
performance
Depends on use-case
Sparse or dense, batch or
online, static vs dynamic
items
Ready to get started ?
● Better experimentation, with
faster convergence.
● Take context into account and
personalize
THANK YOU!
Uri Goren
Chief Data Scientist @ Argmax

More Related Content

PDF
How Vector Search Transforms Information Retrieval?
PPTX
Haystack 2019 - Search with Vectors - Simon Hughes
PPTX
Searching with vectors
PPTX
Vectors in Search - Towards More Semantic Matching
PPTX
Vectors in Search – Towards More Semantic Matching - Simon Hughes, Dice.com
PPTX
RAG Patterns and Vector Search in Generative AI
PDF
An introduction to Elasticsearch's advanced relevance ranking toolbox
PDF
Machine learning @ Spotify - Madison Big Data Meetup
How Vector Search Transforms Information Retrieval?
Haystack 2019 - Search with Vectors - Simon Hughes
Searching with vectors
Vectors in Search - Towards More Semantic Matching
Vectors in Search – Towards More Semantic Matching - Simon Hughes, Dice.com
RAG Patterns and Vector Search in Generative AI
An introduction to Elasticsearch's advanced relevance ranking toolbox
Machine learning @ Spotify - Madison Big Data Meetup

Similar to Introduction to Vector search - Argmx talk (20)

PDF
Interactive Questions and Answers - London Information Retrieval Meetup
PDF
London IR Meetup - Players in Vector Search_ algorithms, software and use cases
PDF
Poster present at the CAIM workshop NYC, Feb 15 2018
PDF
ML+Hadoop at NYC Predictive Analytics
PDF
Searching in metric spaces
PDF
An introduction to Elasticsearch's advanced relevance ranking toolbox
PDF
Embedded based retrieval in modern search ranking system
PPT
Slides
PDF
Thought Vectors and Knowledge Graphs in AI-powered Search
PDF
Introduction to Open Source RAG and RAG Evaluation
PPT
Ir models
PDF
Neural Search Comes to Apache Solr_ Approximate Nearest Neighbor, BERT and Mo...
PDF
Dense Retrieval with Apache Solr Neural Search.pdf
PPTX
RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...
PPTX
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...
PDF
Red Hat Summit Connect 2023 - Redis Enterprise, the engine of Generative AI
PDF
Information retrieval to recommender systems
PDF
SVD and the Netflix Dataset
PDF
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
PDF
5 efficient-matching.ppt
Interactive Questions and Answers - London Information Retrieval Meetup
London IR Meetup - Players in Vector Search_ algorithms, software and use cases
Poster present at the CAIM workshop NYC, Feb 15 2018
ML+Hadoop at NYC Predictive Analytics
Searching in metric spaces
An introduction to Elasticsearch's advanced relevance ranking toolbox
Embedded based retrieval in modern search ranking system
Slides
Thought Vectors and Knowledge Graphs in AI-powered Search
Introduction to Open Source RAG and RAG Evaluation
Ir models
Neural Search Comes to Apache Solr_ Approximate Nearest Neighbor, BERT and Mo...
Dense Retrieval with Apache Solr Neural Search.pdf
RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...
RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...
Red Hat Summit Connect 2023 - Redis Enterprise, the engine of Generative AI
Information retrieval to recommender systems
SVD and the Netflix Dataset
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
5 efficient-matching.ppt
Ad

More from Zilliz (20)

PDF
Build Fast, Scale Faster: Milvus vs. Zilliz Cloud for Production-Ready AI
PDF
Zilliz Cloud Demo for performance and scale
PDF
Open Source Milvus Vector Database v 2.6
PDF
Zilliz Cloud Monthly Technical Review: May 2025
PDF
Smarter RAG Pipelines: Scaling Search with Milvus and Feast
PDF
Hands-on Tutorial: Building an Agent to Reason about Private Data with OpenAI...
PDF
Agentic AI in Action: Real-Time Vision, Memory & Autonomy with Browser Use & ...
PDF
Webinar - Zilliz Cloud Monthly Demo - March 2025
PDF
What Makes "Deep Research"? A Dive into AI Agents
PDF
Combining Lexical and Semantic Search with Milvus 2.5
PDF
Bedrock Data Automation (Preview): Simplifying Unstructured Data Processing
PDF
Deploying a Multimodal RAG System Using Open Source Milvus, LlamaIndex, and vLLM
PDF
February Product Demo: Discover the Power of Zilliz Cloud
PDF
Full Text Search with Milvus 2.5 - UD Meetup Berlin Jan 23
PDF
Building the Next-Gen Apps with Multimodal Retrieval using Twelve Labs & Milvus
PDF
Voice-to-Value- LLM-Powered Customer Interaction Analysis.pdf
PDF
Accelerate AI Agents with Multimodal RAG powered by Friendli Endpoints and Mi...
PDF
1 Table = 1000 Words? Foundation Models for Tabular Data
PDF
How Milvus allows you to run Full Text Search
PDF
How to Optimize Your Embedding Model Selection and Development through TDA Cl...
Build Fast, Scale Faster: Milvus vs. Zilliz Cloud for Production-Ready AI
Zilliz Cloud Demo for performance and scale
Open Source Milvus Vector Database v 2.6
Zilliz Cloud Monthly Technical Review: May 2025
Smarter RAG Pipelines: Scaling Search with Milvus and Feast
Hands-on Tutorial: Building an Agent to Reason about Private Data with OpenAI...
Agentic AI in Action: Real-Time Vision, Memory & Autonomy with Browser Use & ...
Webinar - Zilliz Cloud Monthly Demo - March 2025
What Makes "Deep Research"? A Dive into AI Agents
Combining Lexical and Semantic Search with Milvus 2.5
Bedrock Data Automation (Preview): Simplifying Unstructured Data Processing
Deploying a Multimodal RAG System Using Open Source Milvus, LlamaIndex, and vLLM
February Product Demo: Discover the Power of Zilliz Cloud
Full Text Search with Milvus 2.5 - UD Meetup Berlin Jan 23
Building the Next-Gen Apps with Multimodal Retrieval using Twelve Labs & Milvus
Voice-to-Value- LLM-Powered Customer Interaction Analysis.pdf
Accelerate AI Agents with Multimodal RAG powered by Friendli Endpoints and Mi...
1 Table = 1000 Words? Foundation Models for Tabular Data
How Milvus allows you to run Full Text Search
How to Optimize Your Embedding Model Selection and Development through TDA Cl...
Ad

Recently uploaded (20)

PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
Big Data Technologies - Introduction.pptx
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Machine learning based COVID-19 study performance prediction
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PPT
Teaching material agriculture food technology
PDF
KodekX | Application Modernization Development
PDF
Approach and Philosophy of On baking technology
PPTX
Cloud computing and distributed systems.
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Electronic commerce courselecture one. Pdf
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
MYSQL Presentation for SQL database connectivity
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Unlocking AI with Model Context Protocol (MCP)
The AUB Centre for AI in Media Proposal.docx
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Dropbox Q2 2025 Financial Results & Investor Presentation
Big Data Technologies - Introduction.pptx
Digital-Transformation-Roadmap-for-Companies.pptx
Machine learning based COVID-19 study performance prediction
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Teaching material agriculture food technology
KodekX | Application Modernization Development
Approach and Philosophy of On baking technology
Cloud computing and distributed systems.
Advanced methodologies resolving dimensionality complications for autism neur...
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Electronic commerce courselecture one. Pdf

Introduction to Vector search - Argmx talk