SlideShare a Scribd company logo
Crowdsourced object
segmentation with a game

Vincent Charvillat
Axel Carlier

Amaia Salvador Aguilera
Xavier Giró i Nieto

Oge Marques
Outline
• Motivation
• Object Segmentation
• Experiments
• Results
• Conclusions
• Ongoing work

2
Motivation

?

3
Motivation

?

4
Semi-Supervised object segmentation

Rough segmentation

• B. C. Russell, A. Torralba, K. P. Murphy, and W. T. Freeman. Labelme:
A database and web-based tool for image annotation. IJCV, 2008
5
Semi-Supervised object segmentation

•

•

P. Arbelaez and L. Cohen. Constrained image segmentation from hierarchical boundaries. In
CVPR'08, 2008.
2) K. McGuinness and N. E. O'Connor. A comparative evaluation of interactive segmentation
algorithms.

6
Semi-Supervised object segmentation

Boring task for users!

7
8
Games with a purpose

• J. Steggink and C. Snoek. Adding semantics to image-region annotations with the name-itgame. Multimedia Systems, 2011.
• L. von Ahn, R. Liu, and M. Blum. Peekaboom: a game for locating objects in images. In CHI'06,
2006.
9
Ask’nSeek

A. Carlier, O. Marques, and V. Charvillat. Ask'nseek: A new game for object detection and labeling. In
ECCV'12 Workshops 2012.
10
Motivation
?

11
Outline
• Motivation
• Object Segmentation
• Experiments
• Results
• Conclusions
• Next steps

12
Constrained parametric min-cuts for
automatic object segmentation (CPMC)

J. Carreira and C. Sminchisescu. Constrained parametric min-cuts for automatic object
segmentation. In CVPR'10, 2010.

13
Constrained parametric min-cuts for
automatic object segmentation

14
Motivation
CPMC

15
Outline
• Motivation
• Object Segmentation
• Experiments
• Results
• Conclusions
• Ongoing Work

16
Experiments
How many clicks do we need to achieve a certain quality in the
segmentation?

Test the algorithm for a large image dataset

17
Pascal VOC2010

1928 images divided in:
Train (964)
Validation (964)

18
Problem

Simulator

19
Simulator
• The simulator generates points using the ground truth of the image.

20
Simulator: Location of clicks

S. Goferman, L. Zelnik-Manor, and
A. Tal. Context-aware saliency
detection. PAMI, 2012.

21
Simulator: Foreground/Background ratio

22
Outline
• Motivation
• Object Segmentation
• Experiments
• Results
• Conclusions
• Ongoing Work

23
Jaccard index

Measure of similarity between the segmentation result and the ground truth mask
24
Results

Using Pascal
VOC2010
(Validation)

25
Results

Using Pascal
VOC2010
(Validation)

26
Outline
• Motivation
• Object Segmentation
• Experiments
• Results
• Conclusions
• Ongoing Work

27
Conclusions
• Realistic simulator to process large amounts of data.
• Estimation of the expected AVERAGE Jaccard index by clicks.
• Inter-class variance of results.

28
Ongoing Work

29
Ongoing Work
• Image segmentation
• CPMC candidates

●

●

30

Label propagation through
hierarchical partitions (eg. UCM,
BPT…)
Grabcut + Superpixels
Ongoing Work
• Data collection
• Awarded with $250 in CrowdMM Competition (ACM MM Barcelona 2013).

• Already more than 1500 games collected with 100 users

More on that in our poster!
31
Questions, suggestions…
Thank you for your attention

• Motivation
• Object Segmentation
• Experiments
• Results
• Conclusions
• Ongoing Work
32

More Related Content

PDF
G. Park, J.-Y. Yang, et. al., NeurIPS 2020, MLILAB, KAIST AI
PPTX
MediaEval 2016 - HUCVL Predicting Interesting Key Frames with Deep Models
PDF
Brian Mac Namee - Predict Webinar 3 - Short Intro to Deep Learing
PDF
When robots open their eyes
PDF
Where should saliency models look next ? (UPC Reading Group)
PPTX
Eva Mohedano, "Investigating EEG for Saliency and Segmentation Applications i...
PDF
Advanced iris recognition technology
G. Park, J.-Y. Yang, et. al., NeurIPS 2020, MLILAB, KAIST AI
MediaEval 2016 - HUCVL Predicting Interesting Key Frames with Deep Models
Brian Mac Namee - Predict Webinar 3 - Short Intro to Deep Learing
When robots open their eyes
Where should saliency models look next ? (UPC Reading Group)
Eva Mohedano, "Investigating EEG for Saliency and Segmentation Applications i...
Advanced iris recognition technology

Viewers also liked (19)

PDF
Interactive Image Processing Demos for the Web
PPTX
Visual instance mining of news videos using a graph-based approach
PDF
Photo Clustering of Social Events by Extending PhotoTOC to a Rich Context
PPT
Semantic HTML
PDF
Bundling interest points for object classification
PPT
A Presentation During Vacation Term Activities At Afro Caribbean Achievement ...
PDF
Describing videos by exploiting temporal structure
PPTX
Automatic Keyframe Selection based on Mutual Reinforcement Algorithm
PDF
Multimedia annotation (DCU 2016)
PDF
Rich Internet Application for Semi-Automatic Annotation of Semantic Shots on ...
PDF
Adaptive object detection using adjacency and zoom prediction
PDF
Recurrent Instance Segmentation (UPC Reading Group)
PDF
Part-based Object Retrieval with Binary Partition Trees
PDF
Improving Spatial Codification in Semantic Segmentation
PDF
Visual7W Grounded Question Answering in Images
PDF
Faces in Places: Compound Query Retrieval
PDF
Semantic and Diverse Summarization of Egocentric Photo Events
PPTX
The First Seminar
PDF
Dynamic memory networks for visual and textual question answering
Interactive Image Processing Demos for the Web
Visual instance mining of news videos using a graph-based approach
Photo Clustering of Social Events by Extending PhotoTOC to a Rich Context
Semantic HTML
Bundling interest points for object classification
A Presentation During Vacation Term Activities At Afro Caribbean Achievement ...
Describing videos by exploiting temporal structure
Automatic Keyframe Selection based on Mutual Reinforcement Algorithm
Multimedia annotation (DCU 2016)
Rich Internet Application for Semi-Automatic Annotation of Semantic Shots on ...
Adaptive object detection using adjacency and zoom prediction
Recurrent Instance Segmentation (UPC Reading Group)
Part-based Object Retrieval with Binary Partition Trees
Improving Spatial Codification in Semantic Segmentation
Visual7W Grounded Question Answering in Images
Faces in Places: Compound Query Retrieval
Semantic and Diverse Summarization of Egocentric Photo Events
The First Seminar
Dynamic memory networks for visual and textual question answering
Ad

Similar to Crowdsourced Object Segmentation with a Game (20)

PDF
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
PDF
Overview of ImageCLEF 2014
PPSX
The effects of visual realism on search tasks in mixed reality simulations-IE...
PPTX
[RSS2023] Local Object Crop Collision Network for Efficient Simulation
PDF
Salient KeypointSelection for Object Representation
PPTX
Using Deep Learning to Derive 3D Cities from Satellite Imagery
PDF
Exploiting User Interaction and Object Candidates for Instance Retrieval and ...
PPTX
Ml - A shallow dive
PPTX
[DL輪読会]ClearGrasp
PPTX
Transformer in Vision
PDF
Introduction talk to Computer Vision
PDF
NVIDIA 深度學習教育機構 (DLI): Medical image segmentation using digits
PPTX
ExplainableAI.pptx
PDF
Predictive uncertainty of deep models and its applications
PPTX
Object Recognition
PDF
[CVPR 2018] Visual Search (Image Retrieval) and Metric Learning
PDF
thesis
PPTX
Moving object detection in complex scene
PPTX
Cahall Final Intern Presentation
PPTX
Microsoft COCO: Common Objects in Context
HiPEAC 2019 Workshop - Real-Time Modelling Visual Scenes with Biological Insp...
Overview of ImageCLEF 2014
The effects of visual realism on search tasks in mixed reality simulations-IE...
[RSS2023] Local Object Crop Collision Network for Efficient Simulation
Salient KeypointSelection for Object Representation
Using Deep Learning to Derive 3D Cities from Satellite Imagery
Exploiting User Interaction and Object Candidates for Instance Retrieval and ...
Ml - A shallow dive
[DL輪読会]ClearGrasp
Transformer in Vision
Introduction talk to Computer Vision
NVIDIA 深度學習教育機構 (DLI): Medical image segmentation using digits
ExplainableAI.pptx
Predictive uncertainty of deep models and its applications
Object Recognition
[CVPR 2018] Visual Search (Image Retrieval) and Metric Learning
thesis
Moving object detection in complex scene
Cahall Final Intern Presentation
Microsoft COCO: Common Objects in Context
Ad

More from Universitat Politècnica de Catalunya (20)

PDF
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
PDF
Deep Generative Learning for All
PDF
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
PDF
Towards Sign Language Translation & Production | Xavier Giro-i-Nieto
PDF
The Transformer - Xavier Giró - UPC Barcelona 2021
PDF
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
PDF
Open challenges in sign language translation and production
PPTX
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
PPTX
Discovery and Learning of Navigation Goals from Pixels in Minecraft
PDF
Learn2Sign : Sign language recognition and translation using human keypoint e...
PDF
Intepretability / Explainable AI for Deep Neural Networks
PDF
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
PDF
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
PDF
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
PDF
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
PDF
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
PDF
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
PDF
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
PDF
Curriculum Learning for Recurrent Video Object Segmentation
PDF
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
Towards Sign Language Translation & Production | Xavier Giro-i-Nieto
The Transformer - Xavier Giró - UPC Barcelona 2021
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
Open challenges in sign language translation and production
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
Discovery and Learning of Navigation Goals from Pixels in Minecraft
Learn2Sign : Sign language recognition and translation using human keypoint e...
Intepretability / Explainable AI for Deep Neural Networks
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Curriculum Learning for Recurrent Video Object Segmentation
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020

Recently uploaded (20)

PDF
Hindi spoken digit analysis for native and non-native speakers
PPT
What is a Computer? Input Devices /output devices
PPTX
Tartificialntelligence_presentation.pptx
PDF
sustainability-14-14877-v2.pddhzftheheeeee
PDF
NewMind AI Weekly Chronicles – August ’25 Week III
PDF
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
PPTX
O2C Customer Invoices to Receipt V15A.pptx
PDF
A comparative study of natural language inference in Swahili using monolingua...
PPT
Geologic Time for studying geology for geologist
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PDF
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
PDF
Hybrid model detection and classification of lung cancer
PDF
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
PDF
Enhancing emotion recognition model for a student engagement use case through...
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PDF
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PPTX
Benefits of Physical activity for teenagers.pptx
PDF
Zenith AI: Advanced Artificial Intelligence
Hindi spoken digit analysis for native and non-native speakers
What is a Computer? Input Devices /output devices
Tartificialntelligence_presentation.pptx
sustainability-14-14877-v2.pddhzftheheeeee
NewMind AI Weekly Chronicles – August ’25 Week III
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
O2C Customer Invoices to Receipt V15A.pptx
A comparative study of natural language inference in Swahili using monolingua...
Geologic Time for studying geology for geologist
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
Hybrid model detection and classification of lung cancer
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
Enhancing emotion recognition model for a student engagement use case through...
Univ-Connecticut-ChatGPT-Presentaion.pdf
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
Group 1 Presentation -Planning and Decision Making .pptx
Benefits of Physical activity for teenagers.pptx
Zenith AI: Advanced Artificial Intelligence

Crowdsourced Object Segmentation with a Game