SlideShare a Scribd company logo
Digital Enterprise Research Institute

www.deri.ie

A CAPABILITY REQUIREMENTS APPROACH FOR
PREDICTING WORKER PERFORMANCE IN
CROWDSOURCING
Umair ul Hassan, Edward Curry
Digital Enterprise Research Institute
National University of Ireland, Galway

9th IEEE International Conference on Collaborative Computing:
Networking, Applications and Worksharing
Austin, Texas, United States
October 20–23, 2013
Copyright 2010 Digital Enterprise Research Institute. All rights reserved.
Agenda
Digital Enterprise Research Institute





Motivation
Background
Task Modelling





Capability Requirements





www.deri.ie

Capabilities Taxonomy

Capability Tracing
Experiment
Summary

2
Motivation: Heterogeneity
Digital Enterprise Research Institute

www.deri.ie

3
Motivation: Task Routing
Digital Enterprise Research Institute



www.deri.ie

Assigning heterogeneous tasks to heterogeneous workers

TASK MODELLING
Models
Models
Models

TASK ROUTING

WORKER PROFILING

Matching

Profiles
Profiles
Profiles

Task↔Worker

4
Proposal: Performance Prediction
Digital Enterprise Research Institute



www.deri.ie

Predict performance of workers on new tasks based on the
capabilities required for tasks and assign tasks accordingly
TASK MODELLING
Models
Models
Models
Capability
Requirements
Approach

TASK ROUTING

WORKER PROFILING

Matching

Profiles
Profiles
Profiles

Task↔Worker
Performance
Prediction

Capability Tracing
Model

5
Background: Micro tasks
Digital Enterprise Research Institute



www.deri.ie

When micro tasks are crowd sourced



Single person cannot do the task





Computers cannot do the task
Work can be split into smaller tasks

Some online microtask platforms

6
Background: Micro tasks
Digital Enterprise Research Institute



www.deri.ie

Most common tasks in Amazon Mechanical Turk (AMT)
and CrowdFlower (CFL)

7
Background: Micro tasks
Digital Enterprise Research Institute



www.deri.ie

Example of information extraction task in AMT

8
Background: Micro tasks
Digital Enterprise Research Institute



www.deri.ie

Example of video transcription task in AMT

9
Task Modelling
Digital Enterprise Research Institute

www.deri.ie



Appropriate models are needed to compare and contrast
micro tasks.



Capability Requirements approach


Capability is defined as the ability of humans to do things in
terms of both the capacity and the opportunity.



Four types of capabilities
–
–
–
–

Knowledge,
Skill,
Ability,
Other characteristics (e.g. motivation, price, etc)

10
Capability Requirements
Digital Enterprise Research Institute



www.deri.ie

Taxonomies have be used to study human task
performance, e.g.



Bloom’s taxonomy of classification of learning objectives





Fleishman’s taxonomy of human abilities
O*NET-SOC taxonomy of occupational classification

We are interested in taxonomy that


Describes tasks in terms of human capabilities



Helps in comparing tasks in terms of differences and similarities
of capabilities

11
Capabilities Taxonomy
Digital Enterprise Research Institute




www.deri.ie

Based on Fleishman’s abilities taxonomy
Selected abilities relevant to micro tasks


Comprehension (C): The ability to understand the meaning or importance of
something



Bilingualism (B): The ability to speak and understand two languages



Writing (W): The ability or capacity to write text in a given language



Comparison (M): The ability or capacity to compare things based on some
criteria



Judgment (J): The act or process of judging; the formation of an opinion after
consideration



Perception (P): The ability or capacity to perceive items visually or phonetically



Identification (I): The process of recognizing something



Reasoning (R): The ability to draw conclusions from
facts, evidence, relationships, etc.

12
Requirements of Micro Tasks
Digital Enterprise Research Institute

www.deri.ie

13
Capability Tracing
Digital Enterprise Research Institute




www.deri.ie

How to model worker’s capabilities?
Capability tracing





Inspired by Knowledge Tracing*
Estimates probability of a worker knowing a capability given
worker’s responses to test tasks

Worker Profile constrains


Set of binary variables representing capabilities



Probability estimates of each variable being in a state

* A. T. Corbett and J. R. Anderson, “Knowledge tracing: Modeling the acquisition of procedural knowledge,” User Modeling and User-Adapted
Interaction, vol. 4, no. 4, pp. 253–278, 1994.

14
Capability Tracing
Digital Enterprise Research Institute



www.deri.ie

Probabilistic network of a capability and four parameters of
capability tracing model

States of
Capability
Variable

Not
Learned

p(T): Probability of
transition between states

p(T)
Learned

p(L)

p(G)
Values of
Response
Variable

p(L): Probability of a
worker learning to
employ the capability

p(S)
p(G): Probability of
guess

Correct

Incorrect

15

p(S): Probability of slip
Experiment
Digital Enterprise Research Institute



www.deri.ie

Objective





Solicit capability requirements of tasks from crowds
Evaluation of capability tracing for performance prediction

Three types of micro tasks with manually created ground
truth data



Image comparison





Fact verification
Information Extraction

37 crowd workers including


University students



Workers from Shorttask.com

16
Crowdsourcing
Digital Enterprise Research Institute




www.deri.ie

Custom web application for gathering data
Example of fact verification task

17
Capability Requirements of Tasks
Digital Enterprise Research Institute



www.deri.ie

Objective 1: Solicit capability requirements of tasks
from crowds

(a) fact verification

(b) image comparison

18

(c) information extraction
Crowd Performance
Digital Enterprise Research Institute



www.deri.ie

How the crowd performed on each type of task?

Fact Verification task
• 37 workers
• Best workers perform with
both precision and recall
above 0.8
• More variation in recall means
some workers were could not
spot the incorrect facts
• Ideally tasks should be
assigned to workers that lie in
the top-right quadrant of the
plot

19
Crowd Performance
Digital Enterprise Research Institute



www.deri.ie

Image Comparison (20 workers) and Information Extraction (17 workers)

20
Performance Prediction
Digital Enterprise Research Institute




www.deri.ie

Objective 2: Evaluation of capability tracing for
performance prediction
Two phases


Build model with observation tasks



Predict performance on new tasks

AC: Consider previous
Accuracy as prediction
of future performance

CT: Capability Tracing

21
Summary
Digital Enterprise Research Institute




Capabilities taxonomy is first steps towards modelling of
micro tasks based on human factors
Capability tracing is effective in predicting future
performance





www.deri.ie

Even across tasks if there are similar capabilities

Predicted performance can be used to make right task
routing decisions
Future Work


Evaluate on more types of tasks



Evaluate capabilities such as domain knowledge and skills



Define standard tests for measuring worker capabilities

22
Further Reading
Digital Enterprise Research Institute

www.deri.ie

9th IEEE International Conference on Collaborative Computing:
Networking, Applications and Worksharing
Austin, Texas, United States
October 20–23, 2013

U. Ul Hassan and E. Curry, “A Capability Requirements Approach for Predicting
Worker Performance in Crowdsourcing,” in 9th IEEE International Conference on
Collaborative Computing: Networking, Applications and Worksharing, 2013.
http://deri.ie/users/umair-ul-hassan

23
Capability Tracing
Digital Enterprise Research Institute

www.deri.ie



Conditional probability of worker learning to employ
capability p(Ln|On) is calculated



When evidence On is positive



When evidence On is negative

24
Capability Tracing
Digital Enterprise Research Institute

www.deri.ie



Probability of worker learning to employ capability



Performance of worker on next task

25

More Related Content

PDF
Constructing Knowledge Graph for Social Networks in a Deep and Holistic Way
PDF
Novel character segmentation reconstruction approach for license plate recogn...
PDF
Ai in project management Karen Blay
PDF
Pankaj rajanresume2014
PDF
Do Personality Profiles Differ in the Pakistani Software Industry and Academi...
DOC
Kislaya resume latest
PDF
Face and facial expressions recognition for blind people
PPTX
Ai open powermeetupmarch25th_latest
Constructing Knowledge Graph for Social Networks in a Deep and Holistic Way
Novel character segmentation reconstruction approach for license plate recogn...
Ai in project management Karen Blay
Pankaj rajanresume2014
Do Personality Profiles Differ in the Pakistani Software Industry and Academi...
Kislaya resume latest
Face and facial expressions recognition for blind people
Ai open powermeetupmarch25th_latest

Similar to A Capability Requirements Approach for Predicting Worker Performance in Crowdsourcing (20)

PPTX
Effects of Expertise Assessment on the Quality of Task Routing in Human Compu...
PPTX
Web Service Capability Meta Model
PDF
Agile Network India | Collaborative Intelligence – (Human + AI) and Ethical C...
PPTX
COBI 2014 - An Empirical Evaluation of Capability Modelling using Design Rati...
PDF
One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...
PDF
How do you fast track Agentic automation use cases discovery?
PDF
Organizing Capabilities using Formal Concept Analysis
PPTX
Improving AI Development - Dave Litwiller - Jan 11 2022 - Public
PDF
Workera Introduction
PPTX
Crowdsourcing and Learning from Crowd Data (Tutorial @ PSB2015)
PDF
Business Capability-centric Management of Services and Business Process Models
PPT
Project management
PPTX
Paper sharing_The interplay of digital transformation and employee competency
PPT
Negotiated Studies - A semantic social network based expert recommender system
PDF
Collaborative Data Management: How Crowdsourcing Can Help To Manage Data
PDF
AWS Summit London 2024 - Cognizant Partner Spotlight - Cognitive Architecture...
PPTX
The Need for Explainable AI - Dorothea Wisemann
PDF
AIM102-S_Cognizant_CognizantCognitive
PPTX
019 Arcurve Skills and Staffing Recommender - NODES2022 AMERICAS Advanced 5 -...
PDF
Eszter Debreczeni: The Future of Work and the Augmented Enterprise: How to pr...
Effects of Expertise Assessment on the Quality of Task Routing in Human Compu...
Web Service Capability Meta Model
Agile Network India | Collaborative Intelligence – (Human + AI) and Ethical C...
COBI 2014 - An Empirical Evaluation of Capability Modelling using Design Rati...
One does not simply crowdsource the Semantic Web: 10 years with people, URIs,...
How do you fast track Agentic automation use cases discovery?
Organizing Capabilities using Formal Concept Analysis
Improving AI Development - Dave Litwiller - Jan 11 2022 - Public
Workera Introduction
Crowdsourcing and Learning from Crowd Data (Tutorial @ PSB2015)
Business Capability-centric Management of Services and Business Process Models
Project management
Paper sharing_The interplay of digital transformation and employee competency
Negotiated Studies - A semantic social network based expert recommender system
Collaborative Data Management: How Crowdsourcing Can Help To Manage Data
AWS Summit London 2024 - Cognizant Partner Spotlight - Cognitive Architecture...
The Need for Explainable AI - Dorothea Wisemann
AIM102-S_Cognizant_CognizantCognitive
019 Arcurve Skills and Staffing Recommender - NODES2022 AMERICAS Advanced 5 -...
Eszter Debreczeni: The Future of Work and the Augmented Enterprise: How to pr...
Ad

More from Umair ul Hassan (7)

PPTX
Leveraging DBpedia for Adaptive Crowdsourcing in Linked Data Quality Assessment
PPTX
A Multi-armed Bandit Approach to Online Spatial Task Assignment
PPTX
SLUA: Towards Semantic Linking of Users with Actions in Crowdsourcing
PPTX
A Collaborative Approach for Metadata Management for Internet of Things
PPTX
Researh toolbox - Data analysis with python
PPTX
Towards Expertise Modelling for Routing Data Cleaning Tasks within a Communit...
PPTX
Leveraging Matching Dependencies for Guided User Feedback in Linked Data Appl...
Leveraging DBpedia for Adaptive Crowdsourcing in Linked Data Quality Assessment
A Multi-armed Bandit Approach to Online Spatial Task Assignment
SLUA: Towards Semantic Linking of Users with Actions in Crowdsourcing
A Collaborative Approach for Metadata Management for Internet of Things
Researh toolbox - Data analysis with python
Towards Expertise Modelling for Routing Data Cleaning Tasks within a Communit...
Leveraging Matching Dependencies for Guided User Feedback in Linked Data Appl...
Ad

Recently uploaded (20)

PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPT
Quality review (1)_presentation of this 21
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PPTX
Supervised vs unsupervised machine learning algorithms
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PDF
annual-report-2024-2025 original latest.
PDF
Foundation of Data Science unit number two notes
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPT
ISS -ESG Data flows What is ESG and HowHow
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PDF
Fluorescence-microscope_Botany_detailed content
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
1_Introduction to advance data techniques.pptx
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
Quality review (1)_presentation of this 21
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
Acceptance and paychological effects of mandatory extra coach I classes.pptx
Supervised vs unsupervised machine learning algorithms
Introduction-to-Cloud-ComputingFinal.pptx
Galatica Smart Energy Infrastructure Startup Pitch Deck
annual-report-2024-2025 original latest.
Foundation of Data Science unit number two notes
.pdf is not working space design for the following data for the following dat...
Data_Analytics_and_PowerBI_Presentation.pptx
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
ISS -ESG Data flows What is ESG and HowHow
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
Fluorescence-microscope_Botany_detailed content
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
climate analysis of Dhaka ,Banglades.pptx
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
IBA_Chapter_11_Slides_Final_Accessible.pptx
1_Introduction to advance data techniques.pptx

A Capability Requirements Approach for Predicting Worker Performance in Crowdsourcing

  • 1. Digital Enterprise Research Institute www.deri.ie A CAPABILITY REQUIREMENTS APPROACH FOR PREDICTING WORKER PERFORMANCE IN CROWDSOURCING Umair ul Hassan, Edward Curry Digital Enterprise Research Institute National University of Ireland, Galway 9th IEEE International Conference on Collaborative Computing: Networking, Applications and Worksharing Austin, Texas, United States October 20–23, 2013 Copyright 2010 Digital Enterprise Research Institute. All rights reserved.
  • 2. Agenda Digital Enterprise Research Institute    Motivation Background Task Modelling    Capability Requirements   www.deri.ie Capabilities Taxonomy Capability Tracing Experiment Summary 2
  • 3. Motivation: Heterogeneity Digital Enterprise Research Institute www.deri.ie 3
  • 4. Motivation: Task Routing Digital Enterprise Research Institute  www.deri.ie Assigning heterogeneous tasks to heterogeneous workers TASK MODELLING Models Models Models TASK ROUTING WORKER PROFILING Matching Profiles Profiles Profiles Task↔Worker 4
  • 5. Proposal: Performance Prediction Digital Enterprise Research Institute  www.deri.ie Predict performance of workers on new tasks based on the capabilities required for tasks and assign tasks accordingly TASK MODELLING Models Models Models Capability Requirements Approach TASK ROUTING WORKER PROFILING Matching Profiles Profiles Profiles Task↔Worker Performance Prediction Capability Tracing Model 5
  • 6. Background: Micro tasks Digital Enterprise Research Institute  www.deri.ie When micro tasks are crowd sourced   Single person cannot do the task   Computers cannot do the task Work can be split into smaller tasks Some online microtask platforms 6
  • 7. Background: Micro tasks Digital Enterprise Research Institute  www.deri.ie Most common tasks in Amazon Mechanical Turk (AMT) and CrowdFlower (CFL) 7
  • 8. Background: Micro tasks Digital Enterprise Research Institute  www.deri.ie Example of information extraction task in AMT 8
  • 9. Background: Micro tasks Digital Enterprise Research Institute  www.deri.ie Example of video transcription task in AMT 9
  • 10. Task Modelling Digital Enterprise Research Institute www.deri.ie  Appropriate models are needed to compare and contrast micro tasks.  Capability Requirements approach  Capability is defined as the ability of humans to do things in terms of both the capacity and the opportunity.  Four types of capabilities – – – – Knowledge, Skill, Ability, Other characteristics (e.g. motivation, price, etc) 10
  • 11. Capability Requirements Digital Enterprise Research Institute  www.deri.ie Taxonomies have be used to study human task performance, e.g.   Bloom’s taxonomy of classification of learning objectives   Fleishman’s taxonomy of human abilities O*NET-SOC taxonomy of occupational classification We are interested in taxonomy that  Describes tasks in terms of human capabilities  Helps in comparing tasks in terms of differences and similarities of capabilities 11
  • 12. Capabilities Taxonomy Digital Enterprise Research Institute   www.deri.ie Based on Fleishman’s abilities taxonomy Selected abilities relevant to micro tasks  Comprehension (C): The ability to understand the meaning or importance of something  Bilingualism (B): The ability to speak and understand two languages  Writing (W): The ability or capacity to write text in a given language  Comparison (M): The ability or capacity to compare things based on some criteria  Judgment (J): The act or process of judging; the formation of an opinion after consideration  Perception (P): The ability or capacity to perceive items visually or phonetically  Identification (I): The process of recognizing something  Reasoning (R): The ability to draw conclusions from facts, evidence, relationships, etc. 12
  • 13. Requirements of Micro Tasks Digital Enterprise Research Institute www.deri.ie 13
  • 14. Capability Tracing Digital Enterprise Research Institute   www.deri.ie How to model worker’s capabilities? Capability tracing    Inspired by Knowledge Tracing* Estimates probability of a worker knowing a capability given worker’s responses to test tasks Worker Profile constrains  Set of binary variables representing capabilities  Probability estimates of each variable being in a state * A. T. Corbett and J. R. Anderson, “Knowledge tracing: Modeling the acquisition of procedural knowledge,” User Modeling and User-Adapted Interaction, vol. 4, no. 4, pp. 253–278, 1994. 14
  • 15. Capability Tracing Digital Enterprise Research Institute  www.deri.ie Probabilistic network of a capability and four parameters of capability tracing model States of Capability Variable Not Learned p(T): Probability of transition between states p(T) Learned p(L) p(G) Values of Response Variable p(L): Probability of a worker learning to employ the capability p(S) p(G): Probability of guess Correct Incorrect 15 p(S): Probability of slip
  • 16. Experiment Digital Enterprise Research Institute  www.deri.ie Objective    Solicit capability requirements of tasks from crowds Evaluation of capability tracing for performance prediction Three types of micro tasks with manually created ground truth data   Image comparison   Fact verification Information Extraction 37 crowd workers including  University students  Workers from Shorttask.com 16
  • 17. Crowdsourcing Digital Enterprise Research Institute   www.deri.ie Custom web application for gathering data Example of fact verification task 17
  • 18. Capability Requirements of Tasks Digital Enterprise Research Institute  www.deri.ie Objective 1: Solicit capability requirements of tasks from crowds (a) fact verification (b) image comparison 18 (c) information extraction
  • 19. Crowd Performance Digital Enterprise Research Institute  www.deri.ie How the crowd performed on each type of task? Fact Verification task • 37 workers • Best workers perform with both precision and recall above 0.8 • More variation in recall means some workers were could not spot the incorrect facts • Ideally tasks should be assigned to workers that lie in the top-right quadrant of the plot 19
  • 20. Crowd Performance Digital Enterprise Research Institute  www.deri.ie Image Comparison (20 workers) and Information Extraction (17 workers) 20
  • 21. Performance Prediction Digital Enterprise Research Institute   www.deri.ie Objective 2: Evaluation of capability tracing for performance prediction Two phases  Build model with observation tasks  Predict performance on new tasks AC: Consider previous Accuracy as prediction of future performance CT: Capability Tracing 21
  • 22. Summary Digital Enterprise Research Institute   Capabilities taxonomy is first steps towards modelling of micro tasks based on human factors Capability tracing is effective in predicting future performance    www.deri.ie Even across tasks if there are similar capabilities Predicted performance can be used to make right task routing decisions Future Work  Evaluate on more types of tasks  Evaluate capabilities such as domain knowledge and skills  Define standard tests for measuring worker capabilities 22
  • 23. Further Reading Digital Enterprise Research Institute www.deri.ie 9th IEEE International Conference on Collaborative Computing: Networking, Applications and Worksharing Austin, Texas, United States October 20–23, 2013 U. Ul Hassan and E. Curry, “A Capability Requirements Approach for Predicting Worker Performance in Crowdsourcing,” in 9th IEEE International Conference on Collaborative Computing: Networking, Applications and Worksharing, 2013. http://deri.ie/users/umair-ul-hassan 23
  • 24. Capability Tracing Digital Enterprise Research Institute www.deri.ie  Conditional probability of worker learning to employ capability p(Ln|On) is calculated  When evidence On is positive  When evidence On is negative 24
  • 25. Capability Tracing Digital Enterprise Research Institute www.deri.ie  Probability of worker learning to employ capability  Performance of worker on next task 25

Editor's Notes

  • #7: AMT is amazonmechincalturk
  • #8: AMT is amazonmechincalturk
  • #9: AMT is amazonmechincalturk
  • #10: AMT is amazon mechanical turk
  • #15: Probability estimates are generated while learning the model through capability tracing
  • #19: As can be seen, more than majority of workers believed that identification capability is essential for all three types of tasks. Majority of workers also agreed that the judgment and comprehension capabilities are important for the Fact Verification task. In the case of the Image Comparison task most workers specified that comparison and perception are important as well. There is general consensus between workers that comprehension, judgment and reasoning capabilities are also useful for Information Extraction tasks. We selected top-3 capabilities for each type of task for building capability tracing models.
  • #20: Interestingly, no worker achieved the highest recall for the information extraction task, which highlights the difference between workers and ground truth in terms of the entities extracted from the Wikipedia articles. Nevertheless, these distributions emphasize that in order to achieve high accuracy tasks should be assigned to workers that lie in the top-right quadrant of the plots.
  • #22: Results show that the capability tracing approach is comparable to the baseline approach in general and achieves better accuracy of prediction between similar tasks. The Fact Verification and Information Extraction tasks have similar capabilities requirements, therefore capability tracingcan better predict the performance of workers between them. The drop in prediction quality of capability tracing for Image Comparison task can be attributed to the little variation is the performance of workers on this task.
  • #25: Hide these
  • #26: Hide these