SlideShare a Scribd company logo
CVPR 2018 Review
In Perspective of Image Retrieval
Naver
Vision / Insik Kim
Vision / YuKyung Choi
More complete review (internal only): https://oss.navercorp.com/insikk/public/issues/2
Contents
• Background on Image Retrieval
• Interesting “CVPR 2018” papers
• Image Embedding (Metric Learning)
• Similarity Search
Background
Image Retrieval
• Application: Naver SmartLens (스마트렌즈)
• Search Query as Image
• Find relevant information
• Products
• Flowers, Plants
• Clothes
• Lotto
• Food
What people do
Similarity Search System
Approximate Nearest Neighbor Search
PQ Encoding
Query Expansion and Reranking (RANSAC)
Image Embedding
Representation Learning
Metric Learning
What people do
• State-of-the-art Image Retrieval Models use end-to-end image
representation learning
What people do
• Representation Learning
• Feature Aggregation
• Or, Learn everything in
End-to-end
Who are they?
• Diane Larlus (Naver Labs Europe)
• Hervé Jégou (Facebook AI Research)
• Ondrej Chum (VRG Group, Czech Technical University in Prague)
• Other groups
• Inria
• Alibaba
• SenseTime
Task & Dataset
Task Dataset
Instance Retrieval (Retail Product) INSTRE
Instance Retrieval (Building, Landmark Retrieval) ROxford, RParis,
Google Landmark (DeLF paper)
Instance Retrieval (Fashion Items) DeepFashion
Person Re-identification Market-1501, DukeMTMC
• We can also get ideas from
• Face verification, recognition
• Fine grained visual classification (FGVC)
• Of course from general computer vision fields
• Detection, Classification, Segmentation
• Image Matching
Image Embedding
• Image -> n-dim feature vector
• 𝐼 → 𝜙 𝐼
• Answer to how similar given two
images are?
• 𝑆 𝜙 𝐼 𝑝 , 𝜙 𝐼 𝑞
• Cosine similarity
• Learned similarity function
• Let’s apply deep learning
• Deep Metric Learning
ImageNet Feature from AlexNet, tSNE visualization
Metric Learning
Metric Learning
Idea:
• Additional constraint
• Exploit mini-batch
• CRF to model group consistency
• Estimate a pair of image
similarity from group
consistency
Effect:
• Achieve SOTA (State-of-the-art)
on Market-1501, DukeMTMC
Metric Learning
Idea:
• Triplet Loss with latent
examples
• Learning latent example
Effect:
• Robust to noise
• Faster training
Metric Learning
Idea:
• Hard positive, negative image mining
• Density information (Random Walk)
Effect:
• Good for triplet loss training
Metric Learning
Idea:
• Unsupervised feature learning
with discrimination
• N training example, N – way
softmax classification
• Direct comparison with dot-
product
Effect:
• Semi-Supervised learning
(use less labeled data)
Metric Learning
Idea:
• Unsupervised feature learning
with discrimination
• N training example, N – way
softmax classification
• Direct comparison with dot-
product
Effect:
• Semi-Supervised learning
(use less labeled data)
Similarity Search
Finding nearest neighbors from 1,000,000,000 images (1B scale)
What people do
Inverted File, Multi Index, Product Quantization
TPAMI 2010 CVPR 2012
Similarity Search
Idea:
• Learning coarse quantizer
in supervised manner
• Training requires
classification label
Effect:
• Learned binning works
better than unsupervised
one (e.g. IMI)
Similarity Search
Idea:
• Learning coarse quantizer
in supervised manner
• Training requires
classification label
Effect:
• Learned binning works
better than unsupervised
one (e.g. IMI)
Similarity Search
https://github.com/facebookresearch/faiss
CoRR 2016
Idea:
• Local descriptor. 1000 features / image. 8 bytes / feature.
=> Global descriptor. 1 feature / image. 128 bytes / feature
• HNSW (1M scale)
• Find nn with less comparison
• Compact representation with novel regression
Effects:
• HNSW 1M -> 1B scale
Similarity Search
https://github.com/facebookresearch/faiss
Idea:
• Local descriptor. 1000 features / image. 8 bytes / feature.
=> Global descriptor. 1 feature / image. 128 bytes / feature
• HNSW (1M scale)
• Find nn with less comparison
• Compact representation with novel regression
Effects:
• HNSW 1M -> 1B scale

More Related Content

PDF
[ADMA 2017] Identification of Grey Sheep Users By Histogram Intersection In R...
PPTX
Discovering Common Motifs in Cursor Movement Data
PPTX
Big Data 101 - An introduction
PPTX
Big Data Real Time Training in Chennai
PPSX
Image Search: Then and Now
PPT
Örüntü tanıma - Pattern Recognition
PPTX
machine learning
PPTX
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
[ADMA 2017] Identification of Grey Sheep Users By Histogram Intersection In R...
Discovering Common Motifs in Cursor Movement Data
Big Data 101 - An introduction
Big Data Real Time Training in Chennai
Image Search: Then and Now
Örüntü tanıma - Pattern Recognition
machine learning
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com

Similar to [CVPR 2018] Visual Search (Image Retrieval) and Metric Learning (20)

PDF
H2O with Erin LeDell at Portland R User Group
PPTX
Олександр Обєдніков “Рекомендательные системы”
PPTX
Learning a Joint Embedding Representation for Image Search using Self-supervi...
PDF
لموعد الإثنين 03 يناير 2022 143 مبادرة #تواصل_تطوير المحاضرة ال 143 من المباد...
PPTX
Choosing the right crowd. Expert finding in social networks. edbt 2013
PPTX
Evolving a Medical Image Similarity Search
PDF
Natural Intelligence the human factor in AI
PDF
Advances in Image Search and Retrieval
PDF
Survey Research in Software Engineering
PPTX
L 8 introduction to machine learning final kirti.pptx
PDF
Makine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine Learning
PPS
MLforIR.pps
PPTX
Demystifying Machine Learning and Artificial Intelligence
PDF
Introduction to Machine Learning
PPT
Machine learning introduction to unit 1.ppt
PDF
Hacking Predictive Modeling - RoadSec 2018
PDF
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
PPT
Lecture: introduction to Machine Learning.ppt
PPTX
Building High Available and Scalable Machine Learning Applications
PPTX
Machine Learning 2 deep Learning: An Intro
H2O with Erin LeDell at Portland R User Group
Олександр Обєдніков “Рекомендательные системы”
Learning a Joint Embedding Representation for Image Search using Self-supervi...
لموعد الإثنين 03 يناير 2022 143 مبادرة #تواصل_تطوير المحاضرة ال 143 من المباد...
Choosing the right crowd. Expert finding in social networks. edbt 2013
Evolving a Medical Image Similarity Search
Natural Intelligence the human factor in AI
Advances in Image Search and Retrieval
Survey Research in Software Engineering
L 8 introduction to machine learning final kirti.pptx
Makine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine Learning
MLforIR.pps
Demystifying Machine Learning and Artificial Intelligence
Introduction to Machine Learning
Machine learning introduction to unit 1.ppt
Hacking Predictive Modeling - RoadSec 2018
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
Lecture: introduction to Machine Learning.ppt
Building High Available and Scalable Machine Learning Applications
Machine Learning 2 deep Learning: An Intro
Ad

More from NAVER Engineering (20)

PDF
React vac pattern
PDF
디자인 시스템에 직방 ZUIX
PDF
진화하는 디자인 시스템(걸음마 편)
PDF
서비스 운영을 위한 디자인시스템 프로젝트
PDF
BPL(Banksalad Product Language) 무야호
PDF
이번 생에 디자인 시스템은 처음이라
PDF
날고 있는 여러 비행기 넘나 들며 정비하기
PDF
쏘카프레임 구축 배경과 과정
PDF
플랫폼 디자이너 없이 디자인 시스템을 구축하는 프로덕트 디자이너의 우당탕탕 고통 연대기
PDF
200820 NAVER TECH CONCERT 15_Code Review is Horse(코드리뷰는 말이야)(feat.Latte)
PDF
200819 NAVER TECH CONCERT 03_화려한 코루틴이 내 앱을 감싸네! 코루틴으로 작성해보는 깔끔한 비동기 코드
PDF
200819 NAVER TECH CONCERT 10_맥북에서도 아이맥프로에서 빌드하는 것처럼 빌드 속도 빠르게 하기
PDF
200819 NAVER TECH CONCERT 08_성능을 고민하는 슬기로운 개발자 생활
PDF
200819 NAVER TECH CONCERT 05_모르면 손해보는 Android 디버깅/분석 꿀팁 대방출
PDF
200819 NAVER TECH CONCERT 09_Case.xcodeproj - 좋은 동료로 거듭나기 위한 노하우
PDF
200820 NAVER TECH CONCERT 14_야 너두 할 수 있어. 비전공자, COBOL 개발자를 거쳐 네이버에서 FE 개발하게 된...
PDF
200820 NAVER TECH CONCERT 13_네이버에서 오픈 소스 개발을 통해 성장하는 방법
PDF
200820 NAVER TECH CONCERT 12_상반기 네이버 인턴을 돌아보며
PDF
200820 NAVER TECH CONCERT 11_빠르게 성장하는 슈퍼루키로 거듭나기
PDF
200819 NAVER TECH CONCERT 07_신입 iOS 개발자 개발업무 적응기
React vac pattern
디자인 시스템에 직방 ZUIX
진화하는 디자인 시스템(걸음마 편)
서비스 운영을 위한 디자인시스템 프로젝트
BPL(Banksalad Product Language) 무야호
이번 생에 디자인 시스템은 처음이라
날고 있는 여러 비행기 넘나 들며 정비하기
쏘카프레임 구축 배경과 과정
플랫폼 디자이너 없이 디자인 시스템을 구축하는 프로덕트 디자이너의 우당탕탕 고통 연대기
200820 NAVER TECH CONCERT 15_Code Review is Horse(코드리뷰는 말이야)(feat.Latte)
200819 NAVER TECH CONCERT 03_화려한 코루틴이 내 앱을 감싸네! 코루틴으로 작성해보는 깔끔한 비동기 코드
200819 NAVER TECH CONCERT 10_맥북에서도 아이맥프로에서 빌드하는 것처럼 빌드 속도 빠르게 하기
200819 NAVER TECH CONCERT 08_성능을 고민하는 슬기로운 개발자 생활
200819 NAVER TECH CONCERT 05_모르면 손해보는 Android 디버깅/분석 꿀팁 대방출
200819 NAVER TECH CONCERT 09_Case.xcodeproj - 좋은 동료로 거듭나기 위한 노하우
200820 NAVER TECH CONCERT 14_야 너두 할 수 있어. 비전공자, COBOL 개발자를 거쳐 네이버에서 FE 개발하게 된...
200820 NAVER TECH CONCERT 13_네이버에서 오픈 소스 개발을 통해 성장하는 방법
200820 NAVER TECH CONCERT 12_상반기 네이버 인턴을 돌아보며
200820 NAVER TECH CONCERT 11_빠르게 성장하는 슈퍼루키로 거듭나기
200819 NAVER TECH CONCERT 07_신입 iOS 개발자 개발업무 적응기
Ad

Recently uploaded (20)

PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
Machine Learning_overview_presentation.pptx
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Spectral efficient network and resource selection model in 5G networks
PPTX
Cloud computing and distributed systems.
PPTX
Spectroscopy.pptx food analysis technology
PDF
Empathic Computing: Creating Shared Understanding
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
Big Data Technologies - Introduction.pptx
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Machine learning based COVID-19 study performance prediction
PDF
A comparative analysis of optical character recognition models for extracting...
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Machine Learning_overview_presentation.pptx
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Spectral efficient network and resource selection model in 5G networks
Cloud computing and distributed systems.
Spectroscopy.pptx food analysis technology
Empathic Computing: Creating Shared Understanding
Review of recent advances in non-invasive hemoglobin estimation
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Big Data Technologies - Introduction.pptx
Digital-Transformation-Roadmap-for-Companies.pptx
Agricultural_Statistics_at_a_Glance_2022_0.pdf
The AUB Centre for AI in Media Proposal.docx
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Diabetes mellitus diagnosis method based random forest with bat algorithm
Building Integrated photovoltaic BIPV_UPV.pdf
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
MYSQL Presentation for SQL database connectivity
Machine learning based COVID-19 study performance prediction
A comparative analysis of optical character recognition models for extracting...

[CVPR 2018] Visual Search (Image Retrieval) and Metric Learning

  • 1. CVPR 2018 Review In Perspective of Image Retrieval Naver Vision / Insik Kim Vision / YuKyung Choi More complete review (internal only): https://oss.navercorp.com/insikk/public/issues/2
  • 2. Contents • Background on Image Retrieval • Interesting “CVPR 2018” papers • Image Embedding (Metric Learning) • Similarity Search
  • 4. Image Retrieval • Application: Naver SmartLens (스마트렌즈) • Search Query as Image • Find relevant information • Products • Flowers, Plants • Clothes • Lotto • Food
  • 5. What people do Similarity Search System Approximate Nearest Neighbor Search PQ Encoding Query Expansion and Reranking (RANSAC) Image Embedding Representation Learning Metric Learning
  • 6. What people do • State-of-the-art Image Retrieval Models use end-to-end image representation learning
  • 7. What people do • Representation Learning • Feature Aggregation • Or, Learn everything in End-to-end
  • 8. Who are they? • Diane Larlus (Naver Labs Europe) • Hervé Jégou (Facebook AI Research) • Ondrej Chum (VRG Group, Czech Technical University in Prague) • Other groups • Inria • Alibaba • SenseTime
  • 9. Task & Dataset Task Dataset Instance Retrieval (Retail Product) INSTRE Instance Retrieval (Building, Landmark Retrieval) ROxford, RParis, Google Landmark (DeLF paper) Instance Retrieval (Fashion Items) DeepFashion Person Re-identification Market-1501, DukeMTMC • We can also get ideas from • Face verification, recognition • Fine grained visual classification (FGVC) • Of course from general computer vision fields • Detection, Classification, Segmentation • Image Matching
  • 10. Image Embedding • Image -> n-dim feature vector • 𝐼 → 𝜙 𝐼 • Answer to how similar given two images are? • 𝑆 𝜙 𝐼 𝑝 , 𝜙 𝐼 𝑞 • Cosine similarity • Learned similarity function • Let’s apply deep learning • Deep Metric Learning ImageNet Feature from AlexNet, tSNE visualization
  • 12. Metric Learning Idea: • Additional constraint • Exploit mini-batch • CRF to model group consistency • Estimate a pair of image similarity from group consistency Effect: • Achieve SOTA (State-of-the-art) on Market-1501, DukeMTMC
  • 13. Metric Learning Idea: • Triplet Loss with latent examples • Learning latent example Effect: • Robust to noise • Faster training
  • 14. Metric Learning Idea: • Hard positive, negative image mining • Density information (Random Walk) Effect: • Good for triplet loss training
  • 15. Metric Learning Idea: • Unsupervised feature learning with discrimination • N training example, N – way softmax classification • Direct comparison with dot- product Effect: • Semi-Supervised learning (use less labeled data)
  • 16. Metric Learning Idea: • Unsupervised feature learning with discrimination • N training example, N – way softmax classification • Direct comparison with dot- product Effect: • Semi-Supervised learning (use less labeled data)
  • 17. Similarity Search Finding nearest neighbors from 1,000,000,000 images (1B scale)
  • 19. Inverted File, Multi Index, Product Quantization TPAMI 2010 CVPR 2012
  • 20. Similarity Search Idea: • Learning coarse quantizer in supervised manner • Training requires classification label Effect: • Learned binning works better than unsupervised one (e.g. IMI)
  • 21. Similarity Search Idea: • Learning coarse quantizer in supervised manner • Training requires classification label Effect: • Learned binning works better than unsupervised one (e.g. IMI)
  • 22. Similarity Search https://github.com/facebookresearch/faiss CoRR 2016 Idea: • Local descriptor. 1000 features / image. 8 bytes / feature. => Global descriptor. 1 feature / image. 128 bytes / feature • HNSW (1M scale) • Find nn with less comparison • Compact representation with novel regression Effects: • HNSW 1M -> 1B scale
  • 23. Similarity Search https://github.com/facebookresearch/faiss Idea: • Local descriptor. 1000 features / image. 8 bytes / feature. => Global descriptor. 1 feature / image. 128 bytes / feature • HNSW (1M scale) • Find nn with less comparison • Compact representation with novel regression Effects: • HNSW 1M -> 1B scale