SlideShare a Scribd company logo
Copyright 2011 Digital Enterprise Research Institute. All rights reserved.
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Arguments about deleting Wikipedia content
Jodi Schneider
jschneider@pobox.com
Vendredi 19th April 2013
1
Télécom ParisTech
Is Wikipedia Sustainable?
Deletion threatens Wikipedia
• 1 in 4 new Wikipedia articles is deleted –
within minutes or hours
• Demotivating!
– 1 in 3 newcomers start by writing a new article
– 7X less likely to stay if their article is deleted!
• Can we support editor retention?
Ph.D. case study: argumentative dialogues
about deleting Wikipedia articles
• Goals:
– Understand collaboration & coordination
– Identify “pain points” & new IT support opportunities
• Approaches:
– Net-ethnography
• Interviews of community members
• Embedded participation
• Reading essays, policies, & written dialogues
• Analysing article history, user contributions
– Content analysis
• Departure point: grounded theory or existing categories. With multiple
annotators, iteratively refined annotation manual to achieve strong interannotator
agreement.
• Decision factors (WikiSym 2012)
• Walton’s argumentation schemes (CSCW 2013)
– Prototyping & iterative design
• Design (WikiSym 2012 demo)
• User study (reported in dissertation)
Thesis summary-arguments-about-deleting-wikipedia-content-paris-2013-04-19
Corpus
• Article deletion dialogues
from English Wikipedia
started on a typical-volume day
• 72 dialogues (94 A4 pages)
Findings: pain points of
article deletion
• Article creators
• Novices visiting or newly joining Wikipedia
• No-consensus dialogues
Article creators
• Misunderstand policy
– “I do understand that articles on wikipedia need to be
sourced… it is due to have two [sources] once [our
website goes] live”
• Express high levels of emotion
– “To be honest it's been a real turn off adding articles
to WP and I don't think I will add articles again. So
smile and enjoy.”
• Learn from discussions
– “much as it would break my heart … it is perhaps
sensible that the piece is deleted.”
Net-ethnography
in 8th International Symposium on Wikis and Open Collaboration
(WikiSym 2012)
Novices’ arguments
• Structurally different to experts’ arguments
• More problematic arguments from novices
– Personal preference
– Requesting a favor
– Analogy to other cases
– No harm in keeping an article
– Large number of search engine hits
Argumentation schemes content analysis
in 16th ACM Conference on Computer-Supported
Cooperative Work and Social Computing (CSCW 2013)
No consensus discussions
“What works well is simply the community
agreeing on a verdict.”
Otherwise:
• Time-consuming & difficult to judge a case
• Same case may get raised repeatedly
• Emotional upset is more likely
– “messy”, “full of hate and pain” when overturned
Net-ethnography & interviews
in 8th International Symposium on Wikis and
Open Collaboration (WikiSym 2012)
Articulate criteria
Decision factors content analysis
in 8th International Symposium on Wikis and Open Collaboration
(WikiSym 2012)
4 Factors cover
– 91% of
comments
– 70% of
discussions
Factor Example (used to justify `keep')
Notability Anyone covered by another encyclopedic
reference is considered notable enough
for inclusion in Wikipedia.
Sources Basic information about this album at a
minimum is certainly verifiable, it's a
major label release, and a highly notable
band.
Maintenance …this article is savable but at its current
state, needs a lot of improvement.
Bias It is by no means spam (it does not
promote the products).
Other I'm advocating a blanket "hangon" for all
articles on newly- drafted players
Use criteria to augment interface
Prototype design (RDFa; custom ontology based on FOAF, SIOC)
in WikiSym 2012 Demos
84% prefer our system
“Information is structured and I can quickly get an
overview of the key arguments.”
“The ability to navigate the comments made it a bit easier
to filter my mind set and to come to a conclusion.”
“It offers the structure needed to consider each factor
separately, thus making the decision easier. Also, the
number of comments per factor offers a quick indication
of the relevance and the deepness of the decision.”
Based on a formative evaluation user study with 20 novice users
in dissertation “Enabling reuse of arguments and opinions from online social disputes”

More Related Content

PPT
dynamics-of-wikipedia-1196670708664566-3
PDF
iAnnotate 2014
PPTX
Ticer2005
PPT
Aporte Wikis
PPTX
Wikimedia for civil servants
PPT
Wiki project 2
PPTX
IA Wikipedia Edit-a-thon
PPTX
Mde.demo.v1
dynamics-of-wikipedia-1196670708664566-3
iAnnotate 2014
Ticer2005
Aporte Wikis
Wikimedia for civil servants
Wiki project 2
IA Wikipedia Edit-a-thon
Mde.demo.v1

Similar to Thesis summary-arguments-about-deleting-wikipedia-content-paris-2013-04-19 (20)

PPTX
Talking is (virtual) work -supporting online argumentation--2013-09-18 Malta ...
PPTX
How communities curate knowledge & how ontologists can help -Eurecom--2015-01-19
PPTX
Synthesizing knowledge from disagreement -- Manchester -- 2015-05-06
PPTX
Synthesizing knowledge from disagreement -cwi-2015-04-23
PPTX
Envisioning argumentation and decision making support for debates in open onl...
PPT
Mediawiki and Wiki As a Medium
PDF
Collective Cognition with Semantic Mediawiki: Lessons and Experiences
PPT
Dynamics Of Wikipedia
PDF
Conversations in Context: A Twitter Case for Social Media Systems Design
PPT
Wikis and collaboration: approaches to deploying wikis in educational settings
PPTX
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
PPTX
September 23 2015 NISO Virtual Conference: Scholarly Communication Models: Ev...
PPTX
Enhancing Learning & Participation: Critical Thinking Strategies & Practice
PPTX
Wikipedia for GLAMS_by_jentzsch_&_ockerbloom
PPSX
Social bookmarking
PPS
O WIKI na promocao de aprendizagem colaborativa
PDF
"It's the Conversation, Stupid!" - Social media systems design for open innov...
PPT
Wikipedia Workshop presentation
PPT
Wikipedia for Researchers
PPT
Web 2.0 Community Strategies Inside And Out (V4)
Talking is (virtual) work -supporting online argumentation--2013-09-18 Malta ...
How communities curate knowledge & how ontologists can help -Eurecom--2015-01-19
Synthesizing knowledge from disagreement -- Manchester -- 2015-05-06
Synthesizing knowledge from disagreement -cwi-2015-04-23
Envisioning argumentation and decision making support for debates in open onl...
Mediawiki and Wiki As a Medium
Collective Cognition with Semantic Mediawiki: Lessons and Experiences
Dynamics Of Wikipedia
Conversations in Context: A Twitter Case for Social Media Systems Design
Wikis and collaboration: approaches to deploying wikis in educational settings
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
September 23 2015 NISO Virtual Conference: Scholarly Communication Models: Ev...
Enhancing Learning & Participation: Critical Thinking Strategies & Practice
Wikipedia for GLAMS_by_jentzsch_&_ockerbloom
Social bookmarking
O WIKI na promocao de aprendizagem colaborativa
"It's the Conversation, Stupid!" - Social media systems design for open innov...
Wikipedia Workshop presentation
Wikipedia for Researchers
Web 2.0 Community Strategies Inside And Out (V4)
Ad

More from jodischneider (20)

PPTX
Continued citation of bad science and what we can do about it--2021-04-20
PPTX
Continued citation of bad science and what we can do about it--2021-02-19
PPTX
The problems of post retraction citation - and mitigation strategies that wor...
PPTX
Towards knowledge maintenance in scientific digital libraries with the keysto...
PPTX
Methods Pyramids as an Organizing Structure for Evidence-Based Medicine--SIGC...
PPTX
Annotation examples--Fribourg--2019-09-03
PPTX
Argumentation mining--an introduction for linguists--Fribourg--2019-09-02
PPTX
Beyond Randomized Clinical Trials: emerging innovations in reasoning about he...
PPTX
Problem-citations--CrossrefLive18--2018-11-13
PPTX
Problematic citations--Workshop-on-Open-Citations--2018-09-03
PPTX
Modeling Alzheimer’s Disease research claims, evidence, and arguments from a ...
PPTX
Innovations in reasoning about health: the case of the Randomized Clinical Tr...
PPTX
Viewing universities as landscapes of scholarship, VIVO keynote, 2017-08-04
PPTX
Rhetorical moves and audience considerations in the discussion sections of ra...
PPTX
Citation practices and the construction of scientific fact--ECA-facts-preconf...
PPTX
What WikiCite can learn from biomedical citation networks--Wikicite2017--2017...
PPTX
Medication safety as a use case for argumentation mining, Dagstuhl seminar 16...
PPTX
Acquiring and representing drug-drug interaction knowledge and evidence, Litm...
PPTX
Acquiring and representing drug-drug interaction knowledge and evidence, TRIA...
PPTX
Persons, documents, models: organising and structuring information for the We...
Continued citation of bad science and what we can do about it--2021-04-20
Continued citation of bad science and what we can do about it--2021-02-19
The problems of post retraction citation - and mitigation strategies that wor...
Towards knowledge maintenance in scientific digital libraries with the keysto...
Methods Pyramids as an Organizing Structure for Evidence-Based Medicine--SIGC...
Annotation examples--Fribourg--2019-09-03
Argumentation mining--an introduction for linguists--Fribourg--2019-09-02
Beyond Randomized Clinical Trials: emerging innovations in reasoning about he...
Problem-citations--CrossrefLive18--2018-11-13
Problematic citations--Workshop-on-Open-Citations--2018-09-03
Modeling Alzheimer’s Disease research claims, evidence, and arguments from a ...
Innovations in reasoning about health: the case of the Randomized Clinical Tr...
Viewing universities as landscapes of scholarship, VIVO keynote, 2017-08-04
Rhetorical moves and audience considerations in the discussion sections of ra...
Citation practices and the construction of scientific fact--ECA-facts-preconf...
What WikiCite can learn from biomedical citation networks--Wikicite2017--2017...
Medication safety as a use case for argumentation mining, Dagstuhl seminar 16...
Acquiring and representing drug-drug interaction knowledge and evidence, Litm...
Acquiring and representing drug-drug interaction knowledge and evidence, TRIA...
Persons, documents, models: organising and structuring information for the We...
Ad

Recently uploaded (20)

PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPT
Teaching material agriculture food technology
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
cuic standard and advanced reporting.pdf
PPTX
Big Data Technologies - Introduction.pptx
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Approach and Philosophy of On baking technology
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
KodekX | Application Modernization Development
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Unlocking AI with Model Context Protocol (MCP)
Reach Out and Touch Someone: Haptics and Empathic Computing
Digital-Transformation-Roadmap-for-Companies.pptx
Teaching material agriculture food technology
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
cuic standard and advanced reporting.pdf
Big Data Technologies - Introduction.pptx
Diabetes mellitus diagnosis method based random forest with bat algorithm
Understanding_Digital_Forensics_Presentation.pptx
Per capita expenditure prediction using model stacking based on satellite ima...
The Rise and Fall of 3GPP – Time for a Sabbatical?
20250228 LYD VKU AI Blended-Learning.pptx
Mobile App Security Testing_ A Comprehensive Guide.pdf
Review of recent advances in non-invasive hemoglobin estimation
Approach and Philosophy of On baking technology
Building Integrated photovoltaic BIPV_UPV.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
KodekX | Application Modernization Development
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...

Thesis summary-arguments-about-deleting-wikipedia-content-paris-2013-04-19

  • 1. Copyright 2011 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge Arguments about deleting Wikipedia content Jodi Schneider jschneider@pobox.com Vendredi 19th April 2013 1 Télécom ParisTech
  • 3. Deletion threatens Wikipedia • 1 in 4 new Wikipedia articles is deleted – within minutes or hours • Demotivating! – 1 in 3 newcomers start by writing a new article – 7X less likely to stay if their article is deleted! • Can we support editor retention?
  • 4. Ph.D. case study: argumentative dialogues about deleting Wikipedia articles • Goals: – Understand collaboration & coordination – Identify “pain points” & new IT support opportunities • Approaches: – Net-ethnography • Interviews of community members • Embedded participation • Reading essays, policies, & written dialogues • Analysing article history, user contributions – Content analysis • Departure point: grounded theory or existing categories. With multiple annotators, iteratively refined annotation manual to achieve strong interannotator agreement. • Decision factors (WikiSym 2012) • Walton’s argumentation schemes (CSCW 2013) – Prototyping & iterative design • Design (WikiSym 2012 demo) • User study (reported in dissertation)
  • 6. Corpus • Article deletion dialogues from English Wikipedia started on a typical-volume day • 72 dialogues (94 A4 pages)
  • 7. Findings: pain points of article deletion • Article creators • Novices visiting or newly joining Wikipedia • No-consensus dialogues
  • 8. Article creators • Misunderstand policy – “I do understand that articles on wikipedia need to be sourced… it is due to have two [sources] once [our website goes] live” • Express high levels of emotion – “To be honest it's been a real turn off adding articles to WP and I don't think I will add articles again. So smile and enjoy.” • Learn from discussions – “much as it would break my heart … it is perhaps sensible that the piece is deleted.” Net-ethnography in 8th International Symposium on Wikis and Open Collaboration (WikiSym 2012)
  • 9. Novices’ arguments • Structurally different to experts’ arguments • More problematic arguments from novices – Personal preference – Requesting a favor – Analogy to other cases – No harm in keeping an article – Large number of search engine hits Argumentation schemes content analysis in 16th ACM Conference on Computer-Supported Cooperative Work and Social Computing (CSCW 2013)
  • 10. No consensus discussions “What works well is simply the community agreeing on a verdict.” Otherwise: • Time-consuming & difficult to judge a case • Same case may get raised repeatedly • Emotional upset is more likely – “messy”, “full of hate and pain” when overturned Net-ethnography & interviews in 8th International Symposium on Wikis and Open Collaboration (WikiSym 2012)
  • 11. Articulate criteria Decision factors content analysis in 8th International Symposium on Wikis and Open Collaboration (WikiSym 2012) 4 Factors cover – 91% of comments – 70% of discussions Factor Example (used to justify `keep') Notability Anyone covered by another encyclopedic reference is considered notable enough for inclusion in Wikipedia. Sources Basic information about this album at a minimum is certainly verifiable, it's a major label release, and a highly notable band. Maintenance …this article is savable but at its current state, needs a lot of improvement. Bias It is by no means spam (it does not promote the products). Other I'm advocating a blanket "hangon" for all articles on newly- drafted players
  • 12. Use criteria to augment interface Prototype design (RDFa; custom ontology based on FOAF, SIOC) in WikiSym 2012 Demos
  • 13. 84% prefer our system “Information is structured and I can quickly get an overview of the key arguments.” “The ability to navigate the comments made it a bit easier to filter my mind set and to come to a conclusion.” “It offers the structure needed to consider each factor separately, thus making the decision easier. Also, the number of comments per factor offers a quick indication of the relevance and the deepness of the decision.” Based on a formative evaluation user study with 20 novice users in dissertation “Enabling reuse of arguments and opinions from online social disputes”

Editor's Notes

  • #3: Felipe Ortega via http://www.businessinsider.com/chart-of-the-day-wikipedia-editors-2009-11Wikipedia editors are leaving faster than they can be replaced1 in 3 editors begin by creating a new article7 times as likely to stay if their article is kept
  • #4: “only 0.6 percent of those whose articles are met with deletion stayed editing, compared to 4.4 percent of the users whose articles remained”, http://enwp.org/Wikipedia:Wikipedia_ Signpost/2011-04-04/Editor_retention
  • #5: Interviews via various means: (skype, IRC, in person)The story: understand the problem (analysis / survey), solve it (define method / tools + analysed criteria), evaluate it (prototype)
  • #7: By typical we mean average volume: there are consistently ~500 discussions per week about deleting borderline articles, see our WikiSym paper.
  • #9: Mentoring in discussions is effective: Article creators who receive mentoring seem toMake more edits to the articleContinue editingIncrease understanding of policy
  • #10: Experts argue from precedentNovices: values, analogy, cause to effectJodi Schneider, KrystianSamp, Alexandre Passant, and Stefan Decker. Arguments about Deletion: How Experience Improves the Acceptability of Arguments in Ad-hoc Online Task Groups. Computer-Supported Cooperative Work and Social Computing (CSCW 2013).
  • #12: 3 student annotators (besides me)Iterative refinement of annotation manualGood interannotator agreement
  • #14: 20 novice participants used both systems“The ability to navigate the comments made it a bit easier to filter my mind set and to come to a conclusion.”“summarise and, at the same time, evaluate which factor should be considered determinant for the final decision”