A Content Analysis:
            How Wikipedia Talk Pages Are Used
                      Jodi Schneider, Alexandre Passant & John G. Breslin
  Motivation                                     Content Analysis                                                                          Semantic Web
  Wikipedia’s coordination costs—the             We used 15 comment types;                                                                 Opportunities
  number of Talk page edits for each             a comment could have multiple types.                                                      We propose structured, meaningful
  article edit—have increased                    We started with Viégas’ 11 types [2]:                                                     annotations: the type of comment.
  dramatically [1]:                              1. Requests for editing coordination                                                      Comment types could enable new
                                                                                                                                           ways to browse Talk pages, using
                                                 2. Requests for information
                                                                                                                                           Semantic Web technologies. We
                                                 3. References to vandalism                                                                could instantaneously gather and
                                                 4. References to guidelines/policies                                                      show all comments of a certain type.

                                                 5. References to internal resources
                                                 6. Off-topic remarks                                                                      We have created a lightweight
                                                                                                                                           ontology, based on SIOC, where
                                                 7. Polls                                                                                  classes in the ontology correspond
                                                 8. Requests for peer review                                                               to common comment types we
                                                                                                                                           identified in the content analysis [4]:
                                                 9. Information boxes
                                                                                                                                           http://rdfs.org/sioc/wikitalk
  We are analyzing Talk pages to                 10. Images
  suggest how Semantic Web                                                                                                                 Users would tick checkboxes to
                                                 11. Other
  technologies (like structured                                                                                                            indicate a comment’s type(s).
  annotations) could improve                     We added 4 new types:
  coordination.                                  1. References to external sources
                                                                                                                                           A JavaScript plugin could then
 A typical discussion in a Wikipedia Talk page   2. Discussing reverts/removed                                                             highlight only certain comment types
                                                 material/controversial edits                                                              —for instance all “References to
                                                 3. Reference to edits made oneself                                                        external sources”. With SPARQL, we
                                                                                                                                           could show all “help requests” from a
                                                 4. Recruiting help for another article/                                                   group of pages.
                                                 portal




                                                                                                                                                                Talk page postings by type.
                                                                                                                                                                ‘Coordination’ is the most
                                                                                                                                                                common type of comment.
                                                                                                                                                                Comment types depend on
                                                                                                                                                                the page type. Discussions
                                                                                                                                                                of ‘reverts/removed
                                                                                                                                                                material/controversial
                                                                                                                                                                edits’ are three times as
                                                                                                                                                                likely on Talk pages of
                                                                                                                                                                controversial articles.
Method                                                                                                                                                          ‘Guidelines’ and ‘sources’
                                                                                                                                                                are commonly discussed.
We are examining 100 Talk pages, 20
                                                                                                                                                                Info boxes are common in
from each of these categories:
                                                                                                                                                                “most views” and
1.  Articles with the most contributors                                                                                                                         “controversial” samples.
2.  Most-viewed articles
3.  Controversial articles
4.  Featured Articles
5.  Random sample
This will help us to identify the types of
conversations and the variance between                References                                                                            Acknowledgements
pages. Existing studies focus on 1 or 2               [1] B. Stvilia, M.B. Twidale, L.C. Smith, and L. Gasser, “Information Quality Work
                                                      Organization in Wikipedia,” JASIST, vol. 59, 2008, pp. 983-1001.                      The work presented in this paper has
article types and use small sample                    [2] F.B. Viegas, M. Wattenberg, J. Kriss, and F.V. Ham, “Talk Before You Type:
                                                                                                                                            been funded by Science Foundation
sizes of 6 to 60 articles.                            Coordination in Wikipedia,” HICSS 2007, pp. 78-87.
                                                      [3] J. Schneider, A. Passant, and Breslin, John G., “A Content Analysis: How
                                                      Wikipedia Talk Pages Are Used,” WebScience 2010, Raleigh, North Carolina.
                                                                                                                                            Ireland under Grant No. SFI/08/CE/
                                                      [4] ibid, “Enhancing MediaWiki Talk pages with Semantics for Better Coordination      I1380 (Líon-2).
                                                      - A Proposal,” The Fifth Workshop on Semantic Wikis: Linking Data and People at
                                                      the 7th Extended Semantic Web Conference (ESWC), Crete, Greece: 2010.

More Related Content

PDF
PPT
Understanding and improving Wikipedia article discussion spaces SAC2011
PPT
Moodle Presentation
PDF
2010-03-10 PARC Augmented Social Cognition Research Overview
PDF
Enhancing the Social Web through Augmented Social Cognition Research
PPTX
Social Semantic Web (Social Activity and Facebook)
PDF
P2PU pm4e seminar
PPTX
Get Cookin' with Digital Curation
Understanding and improving Wikipedia article discussion spaces SAC2011
Moodle Presentation
2010-03-10 PARC Augmented Social Cognition Research Overview
Enhancing the Social Web through Augmented Social Cognition Research
Social Semantic Web (Social Activity and Facebook)
P2PU pm4e seminar
Get Cookin' with Digital Curation

Similar to A Content Analysis: How Wikipedia Talk Pages Are Used (WebSci2010 poster) (20)

PDF
CDSI Game - Tools
PDF
Social media, Web 2.0 & language teaching (Foresite, Sèvres, July 2011)
PDF
Collaborative Technologies, PLNs: New Literacies for the 21st Century Teacher
PDF
Cloudworks Overview
PDF
Community Detection in Social Media
PDF
Why the social web is here to stay (and what to do about it)
PPTX
iCurate: How to Become a Curation Rock Star
PDF
Wiki One Page Guide
PDF
Blog Comments Organizer
PPT
18 interesting ways_to_use_a_wiki_in_the_class
PDF
SparkCanada Founding Conference Report
PDF
Dicole DocReview - Product Sheet
PDF
WordLift 2.0 presented on the Semantic Web Meetup in Rome
PDF
ASC Research given at the PARC Forum on 2008-05-01
PDF
Building community inside the enterprise
PDF
33 Sites Every Journalist Should Know - Handout
PDF
Web 2.0 and e-elearning
PDF
Using Web 2.0 Technologies in Computer Science Classes
PDF
Design social interface
PPT
Moving beyond service applications to build a social ecosystem v1.1
CDSI Game - Tools
Social media, Web 2.0 & language teaching (Foresite, Sèvres, July 2011)
Collaborative Technologies, PLNs: New Literacies for the 21st Century Teacher
Cloudworks Overview
Community Detection in Social Media
Why the social web is here to stay (and what to do about it)
iCurate: How to Become a Curation Rock Star
Wiki One Page Guide
Blog Comments Organizer
18 interesting ways_to_use_a_wiki_in_the_class
SparkCanada Founding Conference Report
Dicole DocReview - Product Sheet
WordLift 2.0 presented on the Semantic Web Meetup in Rome
ASC Research given at the PARC Forum on 2008-05-01
Building community inside the enterprise
33 Sites Every Journalist Should Know - Handout
Web 2.0 and e-elearning
Using Web 2.0 Technologies in Computer Science Classes
Design social interface
Moving beyond service applications to build a social ecosystem v1.1
Ad

More from jodischneider (20)

PPTX
Continued citation of bad science and what we can do about it--2021-04-20
PPTX
Continued citation of bad science and what we can do about it--2021-02-19
PPTX
The problems of post retraction citation - and mitigation strategies that wor...
PPTX
Towards knowledge maintenance in scientific digital libraries with the keysto...
PPTX
Methods Pyramids as an Organizing Structure for Evidence-Based Medicine--SIGC...
PPTX
Annotation examples--Fribourg--2019-09-03
PPTX
Argumentation mining--an introduction for linguists--Fribourg--2019-09-02
PPTX
Beyond Randomized Clinical Trials: emerging innovations in reasoning about he...
PPTX
Problem-citations--CrossrefLive18--2018-11-13
PPTX
Problematic citations--Workshop-on-Open-Citations--2018-09-03
PPTX
Modeling Alzheimer’s Disease research claims, evidence, and arguments from a ...
PPTX
Innovations in reasoning about health: the case of the Randomized Clinical Tr...
PPTX
Viewing universities as landscapes of scholarship, VIVO keynote, 2017-08-04
PPTX
Rhetorical moves and audience considerations in the discussion sections of ra...
PPTX
Citation practices and the construction of scientific fact--ECA-facts-preconf...
PPTX
What WikiCite can learn from biomedical citation networks--Wikicite2017--2017...
PPTX
Medication safety as a use case for argumentation mining, Dagstuhl seminar 16...
PPTX
Acquiring and representing drug-drug interaction knowledge and evidence, Litm...
PPTX
Acquiring and representing drug-drug interaction knowledge and evidence, TRIA...
PPTX
Persons, documents, models: organising and structuring information for the We...
Continued citation of bad science and what we can do about it--2021-04-20
Continued citation of bad science and what we can do about it--2021-02-19
The problems of post retraction citation - and mitigation strategies that wor...
Towards knowledge maintenance in scientific digital libraries with the keysto...
Methods Pyramids as an Organizing Structure for Evidence-Based Medicine--SIGC...
Annotation examples--Fribourg--2019-09-03
Argumentation mining--an introduction for linguists--Fribourg--2019-09-02
Beyond Randomized Clinical Trials: emerging innovations in reasoning about he...
Problem-citations--CrossrefLive18--2018-11-13
Problematic citations--Workshop-on-Open-Citations--2018-09-03
Modeling Alzheimer’s Disease research claims, evidence, and arguments from a ...
Innovations in reasoning about health: the case of the Randomized Clinical Tr...
Viewing universities as landscapes of scholarship, VIVO keynote, 2017-08-04
Rhetorical moves and audience considerations in the discussion sections of ra...
Citation practices and the construction of scientific fact--ECA-facts-preconf...
What WikiCite can learn from biomedical citation networks--Wikicite2017--2017...
Medication safety as a use case for argumentation mining, Dagstuhl seminar 16...
Acquiring and representing drug-drug interaction knowledge and evidence, Litm...
Acquiring and representing drug-drug interaction knowledge and evidence, TRIA...
Persons, documents, models: organising and structuring information for the We...
Ad

Recently uploaded (20)

PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PDF
Developing a website for English-speaking practice to English as a foreign la...
PPTX
Benefits of Physical activity for teenagers.pptx
PDF
A comparative study of natural language inference in Swahili using monolingua...
PPTX
observCloud-Native Containerability and monitoring.pptx
PDF
WOOl fibre morphology and structure.pdf for textiles
PDF
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
PDF
A novel scalable deep ensemble learning framework for big data classification...
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
1 - Historical Antecedents, Social Consideration.pdf
PDF
August Patch Tuesday
PDF
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
PDF
Hindi spoken digit analysis for native and non-native speakers
PDF
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
DOCX
search engine optimization ppt fir known well about this
PDF
A Late Bloomer's Guide to GenAI: Ethics, Bias, and Effective Prompting - Boha...
PDF
STKI Israel Market Study 2025 version august
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PPTX
O2C Customer Invoices to Receipt V15A.pptx
PDF
A review of recent deep learning applications in wood surface defect identifi...
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
Developing a website for English-speaking practice to English as a foreign la...
Benefits of Physical activity for teenagers.pptx
A comparative study of natural language inference in Swahili using monolingua...
observCloud-Native Containerability and monitoring.pptx
WOOl fibre morphology and structure.pdf for textiles
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
A novel scalable deep ensemble learning framework for big data classification...
Group 1 Presentation -Planning and Decision Making .pptx
1 - Historical Antecedents, Social Consideration.pdf
August Patch Tuesday
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
Hindi spoken digit analysis for native and non-native speakers
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
search engine optimization ppt fir known well about this
A Late Bloomer's Guide to GenAI: Ethics, Bias, and Effective Prompting - Boha...
STKI Israel Market Study 2025 version august
Univ-Connecticut-ChatGPT-Presentaion.pdf
O2C Customer Invoices to Receipt V15A.pptx
A review of recent deep learning applications in wood surface defect identifi...

A Content Analysis: How Wikipedia Talk Pages Are Used (WebSci2010 poster)

  • 1. A Content Analysis: How Wikipedia Talk Pages Are Used Jodi Schneider, Alexandre Passant & John G. Breslin Motivation Content Analysis Semantic Web Wikipedia’s coordination costs—the We used 15 comment types; Opportunities number of Talk page edits for each a comment could have multiple types. We propose structured, meaningful article edit—have increased We started with Viégas’ 11 types [2]: annotations: the type of comment. dramatically [1]: 1. Requests for editing coordination Comment types could enable new ways to browse Talk pages, using 2. Requests for information Semantic Web technologies. We 3. References to vandalism could instantaneously gather and 4. References to guidelines/policies show all comments of a certain type. 5. References to internal resources 6. Off-topic remarks We have created a lightweight ontology, based on SIOC, where 7. Polls classes in the ontology correspond 8. Requests for peer review to common comment types we identified in the content analysis [4]: 9. Information boxes http://rdfs.org/sioc/wikitalk We are analyzing Talk pages to 10. Images suggest how Semantic Web Users would tick checkboxes to 11. Other technologies (like structured indicate a comment’s type(s). annotations) could improve We added 4 new types: coordination. 1. References to external sources A JavaScript plugin could then A typical discussion in a Wikipedia Talk page 2. Discussing reverts/removed highlight only certain comment types material/controversial edits —for instance all “References to 3. Reference to edits made oneself external sources”. With SPARQL, we could show all “help requests” from a 4. Recruiting help for another article/ group of pages. portal Talk page postings by type. ‘Coordination’ is the most common type of comment. Comment types depend on the page type. Discussions of ‘reverts/removed material/controversial edits’ are three times as likely on Talk pages of controversial articles. Method ‘Guidelines’ and ‘sources’ are commonly discussed. We are examining 100 Talk pages, 20 Info boxes are common in from each of these categories: “most views” and 1.  Articles with the most contributors “controversial” samples. 2.  Most-viewed articles 3.  Controversial articles 4.  Featured Articles 5.  Random sample This will help us to identify the types of conversations and the variance between References Acknowledgements pages. Existing studies focus on 1 or 2 [1] B. Stvilia, M.B. Twidale, L.C. Smith, and L. Gasser, “Information Quality Work Organization in Wikipedia,” JASIST, vol. 59, 2008, pp. 983-1001. The work presented in this paper has article types and use small sample [2] F.B. Viegas, M. Wattenberg, J. Kriss, and F.V. Ham, “Talk Before You Type: been funded by Science Foundation sizes of 6 to 60 articles. Coordination in Wikipedia,” HICSS 2007, pp. 78-87. [3] J. Schneider, A. Passant, and Breslin, John G., “A Content Analysis: How Wikipedia Talk Pages Are Used,” WebScience 2010, Raleigh, North Carolina. Ireland under Grant No. SFI/08/CE/ [4] ibid, “Enhancing MediaWiki Talk pages with Semantics for Better Coordination I1380 (Líon-2). - A Proposal,” The Fifth Workshop on Semantic Wikis: Linking Data and People at the 7th Extended Semantic Web Conference (ESWC), Crete, Greece: 2010.