‘How <del>not</del> to run a
crowdsourcing project: lessons from
Transcribe Bentham'
Dr Valerie Wallace, History Programme, VUWDr Valerie Wallace, History Programme, VUW
(valerie.wallace@vuw.ac.nz)(valerie.wallace@vuw.ac.nz)
The Bentham ProjectThe Bentham Project
• Established in 1959
• Produces The Collected Works of Jeremy
Bentham (1748-1832), the influential jurist,
reformer, and philosopher.
• First two volumes published in 1968 and to
date, 28 of a proposed 70 have been
published, including 12 of the proposed 14
vols of Correspondence.
Challenges facing the Bentham Project:Challenges facing the Bentham Project:
How to speed up editorial production and
create a searchable, accessible digital
resource?
Answer: Crowdsource transcription
In 2010 40,000 of 72,500
manuscripts were
untranscribed
The Transcription DeskThe Transcription Desk
http://www.transcribe-bentham.da.ulcc.ac.uk/td/Transcribe_Bentham and
http//:www.ucl.ac.uk/transcribe-bentham
Lessons from Transcribe Bentham
Lessons from Transcribe Bentham
Process of checking volunteer transcriptsProcess of checking volunteer transcripts
Legacy
transcr
ipts
PROJECT WEBSITEPROJECT EDITORS
DIGITAL
REPOSITORY
Images
Metada
ta
TEI
transcri
pts
Retro-
conversion
to TEI
Quality
assurance
Manuscri
pts
Training
material
s
Registra
tion
Discussi
on
forum
Transcription tool
Ideas
bank
BlogWeb
pages
Folio
catalo
gue
TRANSCRIPTION
WIKI
TEI
Transcripts
COLLECTED
WORKS
Courtesy of
Martin Moyle,
UCL Library
Services
Volunteer transcript stored in UCL’s Bentham Papers digitalVolunteer transcript stored in UCL’s Bentham Papers digital
repository (www.ucl.ac.uk/library/bentham)repository (www.ucl.ac.uk/library/bentham)
Volunteer motivations
Some results from Transcribe Bentham
• 2410 registered accounts
• 362 active transcribers
• 15 super transcribers
• 5205 transcribed manuscripts (c.2.6 million
words)
(As of 8 March 2013)
Manuscripts worked on by volunteers, 8 September
2010 to 8 March 2013
Number of
manuscripts worked
on
Number of volunteers
(percentage)
0 2048 (84.9)
1 225 (9.3)
2 66 (2.7)
3 24 (0.9)
4 6 (0.2)
5 to 20 25 (1)
21 to 50 3 (0.1)
51 to 100 5 (0.2)
101 to 200 3 (0.1)
201 to 500 2 (<0.1)
501 to 999 1 (<0.1)
1000+ 2 (<0.1)
Total 2410 (100)
Some tips on project management:
• Don’t get lost in translation. The team must
communicate effectively.
• Don’t underestimate the time it takes to manage
volunteers.
• Simplify the task as much as possible
• Funding runs out fast. Think ahead.
• Tools are quickly rendered obsolete. Be ready to
adapt.
• Secure the right publicity!
• Code for Transcribe Bentham MediaWiki plugins:
http://code.google.com/p/tb-transcription-desk/,
last accessed 15 June 2012.
Public Records Office of Victoria
For more information see:
• Tim Causer, Justin Tonra, and Valerie Wallace, ‘Transcription
Maximized; Expense Minimized? Crowdsourcing and Editing The
Collected Works of Jeremy Bentham’, Literary and Linguistic
Computing, 27/2 (2012)
• Tim Causer and Valerie Wallace, ‘Building a Volunteer Community:
Results and Findings from Transcribe Bentham’, 6/2 (2012),
http://www.digitalhumanities.org/dhq/vol/6/2/000125/000125.html

More Related Content

PPTX
McLean-letters
PPTX
Optimizing crowdsourcing websites for volunteer participation-Donelle-McKinle...
PDF
Morgan McKinley Japan Salary Survey 2012 Jp Version
PPTX
Robert McKinley's Master Map Principles
PPTX
Planning 3D modelling, environment and animation
PPTX
McKinley NDF2013 crowdsourcing
PPTX
Dec 5 go newsletter
PPTX
PhD proposal: Specialized heuristics for crowdsourcing website design
McLean-letters
Optimizing crowdsourcing websites for volunteer participation-Donelle-McKinle...
Morgan McKinley Japan Salary Survey 2012 Jp Version
Robert McKinley's Master Map Principles
Planning 3D modelling, environment and animation
McKinley NDF2013 crowdsourcing
Dec 5 go newsletter
PhD proposal: Specialized heuristics for crowdsourcing website design

More from donellemckinley (8)

PPTX
Evaluating crowdsourcing websites
PPT
Crowdsourcing or bust: The Indexer, Archives NZ
PPTX
This is not a penis: User-generated tags
PDF
Crowd in the Cloud: Collaborative Frameworks for Virtual DH Projects
PPTX
UC CEISMIC: some thoughts on crowd-sourcing earthquake content
PPTX
Factors that influence an organization’s decision to adopt crowdsourcing: A r...
PPTX
Making a hash of the Adkin Diary transcriptions
PPTX
Crowdsourcing workshop quiz (answers)
Evaluating crowdsourcing websites
Crowdsourcing or bust: The Indexer, Archives NZ
This is not a penis: User-generated tags
Crowd in the Cloud: Collaborative Frameworks for Virtual DH Projects
UC CEISMIC: some thoughts on crowd-sourcing earthquake content
Factors that influence an organization’s decision to adopt crowdsourcing: A r...
Making a hash of the Adkin Diary transcriptions
Crowdsourcing workshop quiz (answers)
Ad

Recently uploaded (20)

PDF
Empowerment Technology for Senior High School Guide
PDF
International_Financial_Reporting_Standa.pdf
PDF
Hazard Identification & Risk Assessment .pdf
PDF
medical_surgical_nursing_10th_edition_ignatavicius_TEST_BANK_pdf.pdf
PDF
FOISHS ANNUAL IMPLEMENTATION PLAN 2025.pdf
PDF
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 1)
PDF
BP 505 T. PHARMACEUTICAL JURISPRUDENCE (UNIT 2).pdf
PDF
Uderstanding digital marketing and marketing stratergie for engaging the digi...
PPTX
Virtual and Augmented Reality in Current Scenario
PPTX
Introduction to pro and eukaryotes and differences.pptx
PPTX
Education and Perspectives of Education.pptx
PDF
English Textual Question & Ans (12th Class).pdf
PDF
FORM 1 BIOLOGY MIND MAPS and their schemes
PDF
My India Quiz Book_20210205121199924.pdf
PPTX
What’s under the hood: Parsing standardized learning content for AI
PDF
BP 505 T. PHARMACEUTICAL JURISPRUDENCE (UNIT 1).pdf
PPTX
Core Concepts of Personalized Learning and Virtual Learning Environments
PPTX
Module on health assessment of CHN. pptx
PPTX
Unit 4 Computer Architecture Multicore Processor.pptx
PDF
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 2).pdf
Empowerment Technology for Senior High School Guide
International_Financial_Reporting_Standa.pdf
Hazard Identification & Risk Assessment .pdf
medical_surgical_nursing_10th_edition_ignatavicius_TEST_BANK_pdf.pdf
FOISHS ANNUAL IMPLEMENTATION PLAN 2025.pdf
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 1)
BP 505 T. PHARMACEUTICAL JURISPRUDENCE (UNIT 2).pdf
Uderstanding digital marketing and marketing stratergie for engaging the digi...
Virtual and Augmented Reality in Current Scenario
Introduction to pro and eukaryotes and differences.pptx
Education and Perspectives of Education.pptx
English Textual Question & Ans (12th Class).pdf
FORM 1 BIOLOGY MIND MAPS and their schemes
My India Quiz Book_20210205121199924.pdf
What’s under the hood: Parsing standardized learning content for AI
BP 505 T. PHARMACEUTICAL JURISPRUDENCE (UNIT 1).pdf
Core Concepts of Personalized Learning and Virtual Learning Environments
Module on health assessment of CHN. pptx
Unit 4 Computer Architecture Multicore Processor.pptx
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 2).pdf
Ad

Lessons from Transcribe Bentham

  • 1. ‘How <del>not</del> to run a crowdsourcing project: lessons from Transcribe Bentham' Dr Valerie Wallace, History Programme, VUWDr Valerie Wallace, History Programme, VUW (valerie.wallace@vuw.ac.nz)(valerie.wallace@vuw.ac.nz)
  • 2. The Bentham ProjectThe Bentham Project • Established in 1959 • Produces The Collected Works of Jeremy Bentham (1748-1832), the influential jurist, reformer, and philosopher. • First two volumes published in 1968 and to date, 28 of a proposed 70 have been published, including 12 of the proposed 14 vols of Correspondence.
  • 3. Challenges facing the Bentham Project:Challenges facing the Bentham Project: How to speed up editorial production and create a searchable, accessible digital resource? Answer: Crowdsource transcription In 2010 40,000 of 72,500 manuscripts were untranscribed
  • 4. The Transcription DeskThe Transcription Desk http://www.transcribe-bentham.da.ulcc.ac.uk/td/Transcribe_Bentham and http//:www.ucl.ac.uk/transcribe-bentham
  • 7. Process of checking volunteer transcriptsProcess of checking volunteer transcripts
  • 8. Legacy transcr ipts PROJECT WEBSITEPROJECT EDITORS DIGITAL REPOSITORY Images Metada ta TEI transcri pts Retro- conversion to TEI Quality assurance Manuscri pts Training material s Registra tion Discussi on forum Transcription tool Ideas bank BlogWeb pages Folio catalo gue TRANSCRIPTION WIKI TEI Transcripts COLLECTED WORKS Courtesy of Martin Moyle, UCL Library Services
  • 9. Volunteer transcript stored in UCL’s Bentham Papers digitalVolunteer transcript stored in UCL’s Bentham Papers digital repository (www.ucl.ac.uk/library/bentham)repository (www.ucl.ac.uk/library/bentham)
  • 11. Some results from Transcribe Bentham • 2410 registered accounts • 362 active transcribers • 15 super transcribers • 5205 transcribed manuscripts (c.2.6 million words) (As of 8 March 2013)
  • 12. Manuscripts worked on by volunteers, 8 September 2010 to 8 March 2013 Number of manuscripts worked on Number of volunteers (percentage) 0 2048 (84.9) 1 225 (9.3) 2 66 (2.7) 3 24 (0.9) 4 6 (0.2) 5 to 20 25 (1) 21 to 50 3 (0.1) 51 to 100 5 (0.2) 101 to 200 3 (0.1) 201 to 500 2 (<0.1) 501 to 999 1 (<0.1) 1000+ 2 (<0.1) Total 2410 (100)
  • 13. Some tips on project management: • Don’t get lost in translation. The team must communicate effectively. • Don’t underestimate the time it takes to manage volunteers. • Simplify the task as much as possible • Funding runs out fast. Think ahead. • Tools are quickly rendered obsolete. Be ready to adapt. • Secure the right publicity!
  • 14. • Code for Transcribe Bentham MediaWiki plugins: http://code.google.com/p/tb-transcription-desk/, last accessed 15 June 2012. Public Records Office of Victoria
  • 15. For more information see: • Tim Causer, Justin Tonra, and Valerie Wallace, ‘Transcription Maximized; Expense Minimized? Crowdsourcing and Editing The Collected Works of Jeremy Bentham’, Literary and Linguistic Computing, 27/2 (2012) • Tim Causer and Valerie Wallace, ‘Building a Volunteer Community: Results and Findings from Transcribe Bentham’, 6/2 (2012), http://www.digitalhumanities.org/dhq/vol/6/2/000125/000125.html