SlideShare a Scribd company logo
Exploiting User Interaction and Object Candidates for Instance Retrieval and Object Segmentation 
Amaia Salvador Aguilera 
Advisors: Kevin McGuinness, Xavier Giró
Motivation 
Object segmentation 
Instance Search 
1
Motivation 
From rectangles to regions 
Exhaustive search 
Regions 
Object proposals 
2
Motivation 
From rectangles to regions 
Exhaustive search 
Regions 
Object proposals 
Arbeláez, P., Pont-Tuset, J., Barron, J. T., Marques, F., & Malik, J. (2014). Multiscale Combinatorial Grouping. CVPR. 3
Motivation 
4
Motivation 
5
Motivation 
6
Outline 
●Motivation 
●Interactive Object Segmentation 
oSegmentation algorithm 
oUser Interface 
oExperiments and Results 
oConclusions 
●Instance Retrieval 
7
Interactive Object Segmentation 
Amaia Salvador Xavier GiróAxel Carlier Vincent Charvillat Oge Marques 
Collaborationsponsoredbya scholarshipfromtheSpanishMinistry 8
Outline 
●Motivation 
●Interactive Object Segmentation 
oSegmentation algorithm 
oUser Interface 
oExperiments and Results 
oConclusions 
●Instance Retrieval 
9
Object Candidates 
Algorithm 1: Finding the best mask 
Limitation: sometimes there is no mask with no errors! 10
Object Candidates 
Algorithm 2: Combination of masks 
Union of masks containing at least one foreground point and no conflicting background points 
11
Outline 
●Motivation 
●Interactive Object Segmentation 
oSegmentation algorithm 
oUser Interface 
oExperiments and Results 
oConclusions 
●Instance Retrieval 
12
Click’n’Cut 
13
Outline 
●Motivation 
●Interactive Object Segmentation 
oSegmentation algorithm 
oUser Interface 
oExperiments and Results 
oConclusions 
●Instance Retrieval 
14
Dataset 
100 objects from BSDS500 Dataset 
with their corresponding binary masks 
5 control images from Pascal VOC2012 
(for gold standard test) 
15
Click’n’Cut and Object Candidates 
Crowd users 
Expert volunteers 
16
Jaccard Index VS Time 
(*) BPT and GrabCut curves show the results obtained by McGuinness, K., & O’connor, N. E. in A comparative evaluation of interactive segmentation algorithms. Pattern Recognition, 43(2), 434-444. 
VS 
McGuinness(BPT) 
McGuinness(GrabCut) 
17
Outline 
●Motivation 
●Interactive Object Segmentation 
oSegmentation algorithm 
oUser Interface 
oExperiments and Results 
oConclusions 
●Instance Retrieval 
18
Conclusions 
19
Conclusions 
e.g. GrabCut 
20
Conclusions 
21
Outline 
●Motivation 
●Interactive Object Segmentation 
●Instance Retrieval 
oFramework 
oUser interface and Relevance Feedback 
oExperiments and Results 
oConclusions 
22
TRECVID Instance Search 2014 
Amaia Salvador 
Xavier Giró 
Carles Ventura 
Eva Mohedano 
Kevin McGuinness 
Noel O’Connor 
ContributionsponsoredbytheCatalanGovernment 
23
Outline 
●Motivation 
●Interactive Object Segmentation 
●Instance Retrieval 
oFramework 
oUser interface 
oExperiments and Results 
oConclusions 
24
Framework 
query / query set 
Representation 
target database 
Representation 
Feature Matching 
Rank List 
ID 
score 
query images 
masks 
25
Framework 
query / query set 
Representation 
target database 
Representation 
Feature Matching 
Rank List 
ID 
score 
query images 
masks 
26
Query Set 
27
Framework 
query / query set 
Representation 
target database 
Representation 
Feature Matching 
Rank List 
ID 
score 
query images 
masks 
28
Target database 
●Collection of 244 videos from BBC EastEnders 
29
Target database 
Full dataset 
647,628 keyframes 
Ground Truth Subset 
23,614 keyframes 
30
Framework 
query / query set 
Representation 
target database 
Representation 
Feature Matching 
Rank List 
ID 
score 
query images 
masks 
31
Convolutional Neural Networks 
Caffe: a CNN implementation. http://caffe.berkeleyvision.org/ 
Figure Source: Babenko, A., Slesarev, A., Chigorin, A., & Lempitsky, V. (2014). Neural CodesforImageRetrieval.arXivpreprintarXiv:1404.1777. 
32
Framework 
query / query set 
Representation 
target database 
Representation 
Feature Matching 
Rank List 
ID 
score 
query images 
masks 
33
Outline 
●Motivation 
●Interactive Object Segmentation 
●Instance Retrieval 
oFramework 
oUser interface 
oExperiments and Results 
oConclusions 
34
User Interface 
35
Outline 
●Motivation 
●Interactive Object Segmentation 
●Instance Retrieval 
oFramework 
oUser interface 
oExperiments and Results 
oConclusions 
36
1.Local Features 
Global CNN features are nice, but… 
...what if we use binary masks to compute local CNN features? 
Problem: we don’t have binary masks for the target keyframes! 
37
Local Features: Object Candidates 
Arbeláez, P., Pont-Tuset, J., Barron, J. T., Marques, F., & Malik, J. (2014). Multiscale Combinatorial Grouping. CVPR. 38
Local features 
Global 
Local Only(square) 
Local Only(square+ padding) 
Local only(crop) 
Global Only 
Crop 
Square 
Square 
+ padding 
Resultsontoysubset 
39
2.Combination of features 
Results on Ground Truth Datasetwith N = 20 40
2.Combination of features 
Problems: 
~100 object candidates/frame 
~ 600,000 keyframes 
41
2.Combination of features 
Solution: 
42
3.Relevance Feedback 
Simulation of users interaction with the UI using the Ground Truth Subset 
Annotations from the ranking are used with Relevance Feedback techniques taking different percentages of the ranking. 
43
Ranking fromannotations 
Positive annotationsare usedto createthenew ranking 
44
Ranking fromannotations 
Addresultsto theranking byre-queryingusingpositive annotations 
45
Ranking fromannotations 
Linear SVM withpositive/negativeannotations 
46
4.Performance differences 
9074: a cigarette 
9075: a Vodka bottle 
9096: this woman 
Bad performance for topics containing small objects or people 
47
4.Performance differences 
9090: this wooden bench 
9097: these spheres 
9081: a black taxi 
Good performance for topics for which context is important 
48
Outline 
●Motivation 
●Interactive Object Segmentation 
●Instance Retrieval 
oFramework 
oUser interface 
oExperiments and Results 
oConclusions 
49
Conclusions 
50
Conclusions 
51
General Conclusions 
ObjectSegmentation 
InstanceSearch 
52
Contributions 
ObjectSegmentation 
InstanceSearch 
53
Exploiting User Interaction and Object Candidates for Instance Retrieval and Object Segmentation
Exploiting User Interaction and Object Candidates for Instance Retrieval and Object Segmentation

More Related Content

PPTX
James m Bockarie presentation
PPTX
Wang midterm-defence
PDF
OpenCV DNN module vs. Ours method
PDF
Crowdsourced Object Segmentation with a Game
PDF
Content-Based Image Retrieval Features: A Survey
PDF
Comparative Study of Object Detection Algorithms
PDF
Finding the best solution for Image Processing
PPTX
User Interface for an Image Retrieval Engine System
James m Bockarie presentation
Wang midterm-defence
OpenCV DNN module vs. Ours method
Crowdsourced Object Segmentation with a Game
Content-Based Image Retrieval Features: A Survey
Comparative Study of Object Detection Algorithms
Finding the best solution for Image Processing
User Interface for an Image Retrieval Engine System

Similar to Exploiting User Interaction and Object Candidates for Instance Retrieval and Object Segmentation (20)

PPTX
Image Segmentation Using Deep Learning : A survey
PDF
Content Based Image Retrieval
PDF
Region-oriented Convolutional Networks for Object Retrieval
PDF
IRJET- Identification of Scene Images using Convolutional Neural Networks - A...
PPTX
cbir and other trends like iot, virtual reality etc
PPTX
Deep image retrieval - learning global representations for image search - ub ...
PPTX
Graph R-CNN for Scene Graph Generation
PDF
Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)
PDF
Co-filtering human interaction and object segmentation
PDF
Deep image retrieval learning global representations for image search
PPTX
IntroComputerVision23.pptx
PDF
2016 MediaEval - Interestingness Task Overview
PDF
Image Retrieval (D4L5 2017 UPC Deep Learning for Computer Vision)
PDF
Interactive Video Search: Where is the User in the Age of Deep Learning?
PPTX
Content Based Image and Video Retrieval Algorithm
PPTX
Evolving a Medical Image Similarity Search
PDF
Content Based Image Retrieval An Assessment
PDF
Convolutional Features for Instance Search
PDF
META-HEURISTICS BASED ARF OPTIMIZATION FOR IMAGE RETRIEVAL
PDF
Content-based Image Retrieval - Eva Mohedano - UPC Barcelona 2018
Image Segmentation Using Deep Learning : A survey
Content Based Image Retrieval
Region-oriented Convolutional Networks for Object Retrieval
IRJET- Identification of Scene Images using Convolutional Neural Networks - A...
cbir and other trends like iot, virtual reality etc
Deep image retrieval - learning global representations for image search - ub ...
Graph R-CNN for Scene Graph Generation
Content-Based Image Retrieval (D2L6 Insight@DCU Machine Learning Workshop 2017)
Co-filtering human interaction and object segmentation
Deep image retrieval learning global representations for image search
IntroComputerVision23.pptx
2016 MediaEval - Interestingness Task Overview
Image Retrieval (D4L5 2017 UPC Deep Learning for Computer Vision)
Interactive Video Search: Where is the User in the Age of Deep Learning?
Content Based Image and Video Retrieval Algorithm
Evolving a Medical Image Similarity Search
Content Based Image Retrieval An Assessment
Convolutional Features for Instance Search
META-HEURISTICS BASED ARF OPTIMIZATION FOR IMAGE RETRIEVAL
Content-based Image Retrieval - Eva Mohedano - UPC Barcelona 2018
Ad

More from Universitat Politècnica de Catalunya (20)

PDF
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
PDF
Deep Generative Learning for All
PDF
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
PDF
Towards Sign Language Translation & Production | Xavier Giro-i-Nieto
PDF
The Transformer - Xavier Giró - UPC Barcelona 2021
PDF
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
PDF
Open challenges in sign language translation and production
PPTX
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
PPTX
Discovery and Learning of Navigation Goals from Pixels in Minecraft
PDF
Learn2Sign : Sign language recognition and translation using human keypoint e...
PDF
Intepretability / Explainable AI for Deep Neural Networks
PDF
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
PDF
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
PDF
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
PDF
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
PDF
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
PDF
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
PDF
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
PDF
Curriculum Learning for Recurrent Video Object Segmentation
PDF
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
Towards Sign Language Translation & Production | Xavier Giro-i-Nieto
The Transformer - Xavier Giró - UPC Barcelona 2021
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
Open challenges in sign language translation and production
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
Discovery and Learning of Navigation Goals from Pixels in Minecraft
Learn2Sign : Sign language recognition and translation using human keypoint e...
Intepretability / Explainable AI for Deep Neural Networks
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Curriculum Learning for Recurrent Video Object Segmentation
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020
Ad

Recently uploaded (20)

PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Machine learning based COVID-19 study performance prediction
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
Machine Learning_overview_presentation.pptx
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Approach and Philosophy of On baking technology
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
NewMind AI Weekly Chronicles - August'25-Week II
MIND Revenue Release Quarter 2 2025 Press Release
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Per capita expenditure prediction using model stacking based on satellite ima...
MYSQL Presentation for SQL database connectivity
Machine learning based COVID-19 study performance prediction
Unlocking AI with Model Context Protocol (MCP)
Building Integrated photovoltaic BIPV_UPV.pdf
Machine Learning_overview_presentation.pptx
Diabetes mellitus diagnosis method based random forest with bat algorithm
Approach and Philosophy of On baking technology
20250228 LYD VKU AI Blended-Learning.pptx
A comparative analysis of optical character recognition models for extracting...
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Digital-Transformation-Roadmap-for-Companies.pptx
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
The AUB Centre for AI in Media Proposal.docx
Dropbox Q2 2025 Financial Results & Investor Presentation
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf

Exploiting User Interaction and Object Candidates for Instance Retrieval and Object Segmentation