SlideShare a Scribd company logo
Exploiting disagreement
through open- ended tasks for
capturing interpretation spaces
Doctoral Consortium
By /Benjamin Timmermans @8w
Outline
Introduction
State of the Art
Problem Statement
Methodology
Preliminary Results
Conclusions
Introduction
Exploiting disagreement through open ended tasks for capturing interpretation spaces
How many dogs were in the picture?
There is no universal "truth"
For the training, testing and evaluation
of machines we rely on a...
ground "truth"
State of the Art
Crowdsourcing Approach
1-3 annotators
Evaluate workers
Inner-annotator agreement
Use test questions
Predefined answer choices
Exploiting disagreement through open ended tasks for capturing interpretation spaces
The CrowdTruth Approach
10-15 annotators
Evaluate the input, annotations and workers
Disagreement-based analytics
Problem Statement
Problems with multimedia annotations
Are sparse
Are homogeneous
Do not represent everything that can be heard or seen
Problems with crowdsourcing tasks
Are designed to stimulate agreement
Assumes answers are right or wrong
Closed task
How many beams do you see?
1
2
3
4
5
1 1
2
3
4
5
Open- ended tasks
How many beams do you see?
Exploiting disagreement through open ended tasks for capturing interpretation spaces
Gathering the interpretation space of multimedia through
open-ended crowdsourcing tasks
Goal
More efficient crowdsourcing
Higher quality ground truth data
Improved search and discovery of multimedia
Are open-ended crowdsourcing tasks a feasible method for
capturing the interpretation space of multimedia?
Research Question
Methodology
1. Improving quality evaluation
Comparing Closed and open-ended tasks
Measure worker confidence
2. Improving open- ended task design
Combine constrains with open-ended designs
Showing known annotations
Detecting the distribution of answers
3. Applying the ground "truth"
Compare different contexts
Improve indexing of multimedia
Preliminary Results
Gathering training data
for IBM Watson
Range of tasks
Passage Justification
Passage Alignment
Distributional disambiguation
Sound Interpretations
2.133 short sounds
Top 5000 search terms = 11 mil. searches
Sound tag overlap
Conclusions
There is no ultimate "truth"
Do not stimulate agreement
Capture the interpretation space
Use open-ended crowdsourcing tasks
Evaluation more difficult
Who we are
Lora Aroyo Robert-Jan Sips Chris Welty
Oana Inel Anca Dumitrache Benjamin
Timmermans
Acknowledgements
Supervisor: Dr. Lora Aroyo
Mentor: Dr. Matteo Palmonari
CrowdTruth.org
Benjamin Timmermans
btimmermans.com
b.timmermans@vu.nl
 @8w

More Related Content

PDF
Truth is a Lie: 7 Myths about Human Annotation @CogComputing Forum 2014
PDF
Crowds & Niches Teaching Machines to Diagnose: NLeSC Kick off eHumanities pr...
PDF
WebSci2013 Harnessing Disagreement in Crowdsourcing
PDF
(Presentation Chris) Crowdsourcing & Semantic Web: Dagstuhl 2014
PDF
Crowdsourcing & Semantic Web: Dagstuhl 2014 (Presentation Lora)
DOCX
Chapter 5 – quiz 5 instructions   in most cases the topic area
DOCX
Chapter 8 comparing and contrasting computers and technology
PPTX
Buttons on forms and surveys: a look at some research 2012
Truth is a Lie: 7 Myths about Human Annotation @CogComputing Forum 2014
Crowds & Niches Teaching Machines to Diagnose: NLeSC Kick off eHumanities pr...
WebSci2013 Harnessing Disagreement in Crowdsourcing
(Presentation Chris) Crowdsourcing & Semantic Web: Dagstuhl 2014
Crowdsourcing & Semantic Web: Dagstuhl 2014 (Presentation Lora)
Chapter 5 – quiz 5 instructions   in most cases the topic area
Chapter 8 comparing and contrasting computers and technology
Buttons on forms and surveys: a look at some research 2012

Viewers also liked (20)

PPTX
ESWC - PhD Symposium 2016
PPT
Visualization of Disagreement-based Quality Metrics of Crowdsourcing Data
PDF
Crowdsourcing Disagreement on Open-Domain Questions
PDF
Utilizing Social Health Websites for Cognitive Computing and Clinical Decisio...
PDF
Gamification of crowdsourcing tasks: What motivates a medical expert?
PDF
Truth is a Lie: Rules & Semantics from Crowd Perspectives (RR'2015 Keynote)
PDF
Towards Better Media Understanding and Searchability
PDF
Dive+@ICTOpen2017
PPTX
Dive+ NL eScience symposium 2015
PDF
CrowdTruth Games @NLeSc eHumanities day 2015
PDF
Boosting Named Entity Extraction through Crowdsourcing
PDF
SXSW2017 @NewDutchMedia Talk: Exploration is the New Search
PPTX
DIVE Semantic Web Challenge Presentation
PDF
Genuine semantic publishing
PDF
Europeana GA 2016: Harnessing Crowds, Niches & Professionals in the Digital Age
PDF
Truth is a Lie - 7 Myths of Human Annotation
PDF
Defining spacial representations for the meaning of sounds
PPT
Harnessing the Power of Machines & Crowds for Event Extraction
PDF
Stitch by Stitch: Annotating Fashion at the Rijksmuseum
ESWC - PhD Symposium 2016
Visualization of Disagreement-based Quality Metrics of Crowdsourcing Data
Crowdsourcing Disagreement on Open-Domain Questions
Utilizing Social Health Websites for Cognitive Computing and Clinical Decisio...
Gamification of crowdsourcing tasks: What motivates a medical expert?
Truth is a Lie: Rules & Semantics from Crowd Perspectives (RR'2015 Keynote)
Towards Better Media Understanding and Searchability
Dive+@ICTOpen2017
Dive+ NL eScience symposium 2015
CrowdTruth Games @NLeSc eHumanities day 2015
Boosting Named Entity Extraction through Crowdsourcing
SXSW2017 @NewDutchMedia Talk: Exploration is the New Search
DIVE Semantic Web Challenge Presentation
Genuine semantic publishing
Europeana GA 2016: Harnessing Crowds, Niches & Professionals in the Digital Age
Truth is a Lie - 7 Myths of Human Annotation
Defining spacial representations for the meaning of sounds
Harnessing the Power of Machines & Crowds for Event Extraction
Stitch by Stitch: Annotating Fashion at the Rijksmuseum
Ad

Similar to Exploiting disagreement through open ended tasks for capturing interpretation spaces (20)

PDF
2 Studies UX types should know about (Straub UXPA unconference13)
PPTX
Manual Testing is Dead. Long Live Manual Testing
PPT
Ofquals reliability of results programme
PDF
What does collecting better data mean, and how to achieve it?
PDF
Startup Weekend - Validate Your Idea, Crash Course in User Research
PPTX
L7 Usability testing lecture of usability
PPT
7027203.ppt
PPT
Seeing the Forest for the Trees: Visual Problem Solving as a Design and Usabi...
PPTX
Principles of survey research
PPTX
survey research
PDF
Investigating Crowdsourcing as an Evaluation Method for (TEL) Recommender Sy...
PDF
Audience Research on a Dime - Nonprofit of Influence
PPTX
IODA - The Promise & Perils of Narrative Research
PDF
Rinse and Repeat : The Spiral of Applied Machine Learning
PDF
UI/UX Foundations - Research
PPT
Research Writing Survey
PPTX
Validation and mechanism: exploring the limits of evaluation
PDF
Writing surveys that work
PPTX
Writing surveys that work
PDF
Peer reviews
2 Studies UX types should know about (Straub UXPA unconference13)
Manual Testing is Dead. Long Live Manual Testing
Ofquals reliability of results programme
What does collecting better data mean, and how to achieve it?
Startup Weekend - Validate Your Idea, Crash Course in User Research
L7 Usability testing lecture of usability
7027203.ppt
Seeing the Forest for the Trees: Visual Problem Solving as a Design and Usabi...
Principles of survey research
survey research
Investigating Crowdsourcing as an Evaluation Method for (TEL) Recommender Sy...
Audience Research on a Dime - Nonprofit of Influence
IODA - The Promise & Perils of Narrative Research
Rinse and Repeat : The Spiral of Applied Machine Learning
UI/UX Foundations - Research
Research Writing Survey
Validation and mechanism: exploring the limits of evaluation
Writing surveys that work
Writing surveys that work
Peer reviews
Ad

Recently uploaded (20)

PDF
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
PPTX
The KM-GBF monitoring framework – status & key messages.pptx
PPTX
Cell Membrane: Structure, Composition & Functions
PDF
Phytochemical Investigation of Miliusa longipes.pdf
PDF
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
PPTX
7. General Toxicologyfor clinical phrmacy.pptx
PPTX
ECG_Course_Presentation د.محمد صقران ppt
PDF
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
PPT
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
PPTX
Taita Taveta Laboratory Technician Workshop Presentation.pptx
PPTX
Microbiology with diagram medical studies .pptx
PDF
. Radiology Case Scenariosssssssssssssss
PDF
Biophysics 2.pdffffffffffffffffffffffffff
PDF
The scientific heritage No 166 (166) (2025)
PPTX
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
PPTX
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
PPTX
BIOMOLECULES PPT........................
PDF
Sciences of Europe No 170 (2025)
PPTX
2. Earth - The Living Planet Module 2ELS
PPTX
2. Earth - The Living Planet earth and life
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
The KM-GBF monitoring framework – status & key messages.pptx
Cell Membrane: Structure, Composition & Functions
Phytochemical Investigation of Miliusa longipes.pdf
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
7. General Toxicologyfor clinical phrmacy.pptx
ECG_Course_Presentation د.محمد صقران ppt
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
Taita Taveta Laboratory Technician Workshop Presentation.pptx
Microbiology with diagram medical studies .pptx
. Radiology Case Scenariosssssssssssssss
Biophysics 2.pdffffffffffffffffffffffffff
The scientific heritage No 166 (166) (2025)
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
BIOMOLECULES PPT........................
Sciences of Europe No 170 (2025)
2. Earth - The Living Planet Module 2ELS
2. Earth - The Living Planet earth and life

Exploiting disagreement through open ended tasks for capturing interpretation spaces