SlideShare a Scribd company logo
Collective Cognition with Semantic
Mediawiki: Lessons and Experiences

               Jie Bao, Li Ding and James Hendler

                    Tetherless World Constellation,
                   Department of Computer Science
                   Rensselaer Polytechnic Institute,
                             Troy, NY, USA
                   {baojie,dingl,hendler}@cs.rpi.edu

    Network-Enabled Cognition workshop, ACITA, 2009 Sep 22, 2009, UMUC, Maryland
Goal

• To identify a few common pitfalls and
  limitations of Semantic Mediawiki in
  – knowledge modeling,
  – Knowledge organization and context, and
  – Collaboration protocols
• To examine some potential approaches to
  solve these problems.


                                              2
Wiki and Recognition

• Wiki is among the most prominent of forms on
  the Web that harness the distributed, collective
  efforts of users to create content online.
   – Ideas are formed, indentified or evolved and
   – Facts are discovered, refined or removed (for
     “wrong” ones)
   In the course of a never-ending editing process.




                                                      3
Collective Cognition

• A hypothetical example: write an outline
  for a new South Park episode.




                            Picture courtesy of Wikipedia



                                                            4
Collective Cognition
•   9:00pm: show starts
•   9:01pm: Wikipedia page for the episode is created
•   9:01-9:08pm: 10 active users are adding summary when they watch
•   9:08-9:10pm: ad time, some minor reorganization and typo fixing.
•   9:10-9:30pm: repeat the above
•   9:30-10:30pm: a user A is doing major refining, e.g. adding culture
    references
•   10:31pm: a user B disagrees with A, undo A’s edit
•   10:32pm: A undos B’s edit
•   10:33pm: B leaves a message on A’s user page, to avoid further edit wars
•   10:43pm: the two agrees to compromise with wording such that “It is implied
    that Chef is dead at the end of the episode; however, some others believe
    that it is not the case”.
•   The next day: user C adds a citation to this South Park episode’s page on a
    film’s page with that “South Park episode X is a parody of the this film”.

                                                                                  5
Key elements

• Simplicity: least training required to
  contribute.

• AAA: anybody can say anything anywhere

• NPOV: neutral point of view (among other
  collaboration protocols of Wikipedia)



                                             6
Semantic Wiki

• Extensions to Wikis with some Semantic
  Web support
  – Example: Semantic Mediawiki

   Eric Cartman
       [[friend of::Butters]]                  Butters
       [[Category:Boy]]

                                   friend of


                                                 Cartman
  Eric_Cartman friend_of Butters
  Eric_Cartman rdf:type Boy
        (RDF triple)
                                                           7
Semantic Wiki

• Fast-growing adoption
  –   Healthcare
  –   E-Government
  –   Entertainment
  –   Consulting
  –   Database
  –   …
• Inside ITA
  – OWL modeling with controlled natural language
  – Rule modeling

                                                    8
Semantic Wiki

• Can Semantic wiki reproduces the success of
  wiki to be among the most prominent of forms on
  the Web that harness the distributed, collective
  efforts of users to create content knowledge
  online?
• We have seen encouraging success in quite a
  few projects
• However, some issues are identified in our real-
  world experiences.


                                                     9
Knowledge Modeling

• Myth: users can do RDF-style (triple-
  based) modeling on SMW

• Fact: few is able to do this (at least without
  substantial training)




                                                   10
“Big Fat Page” effects
We gave a 3 hour training on SMW with a group of
 undergraduate students (most with no knowledge of
 RDF), and let them do a collective annotation task on TV
 shows. However, the result is not fully satisfactory
• Difference between categories and properties is not that easy to
  understand (see a lot misuse like Category:hug)
• To describe a thing with triples requires “thinking in RDF”, which
  needs some experiences.
• It is a big headache to choose the right vocabulary and it is hard to
  know what vocabulary to reuse.

As a result, many of the testees simply use the wiki as a notepad,
   without adding much semantic annotations, resulting in a long single
   “usual” wiki page.
                                                                          11
Schema or not schema?
• Two common knowledge models on a semantic wiki,
   – “Schema”-based modeling, often represented in the form of pre-
     defined wiki templates, that are used by “common” users of the
     wiki to access data via forms or prebuilt queries.
      • c.f. “infobox” in Wikipedia
      • =>stable, shared knowledge
   – Arbitrary RDF-style semantic markup - heavily used by a
     selected few elite group
      • => less structured, less shared knowledge


• A carefully pre-populated wiki “schema” (template), is as
  important as a schema in a database project.


                                                                      12
Template Example




Template as Schema    Form for the template


                                              13
Organization and Context

• Myth: semantic wiki, like wiki, allows you
  to write things freely.

• Fact: SMW does not support AAA
  – Every “triple” has to be on its subject’s page.
     • E.g., “South Park episode X is a parody of the this
       film” can only be said on X’s page.
  – Each subject and property of a triple must be
    a local page name.

                                                             14
Organization and Context

Why it may be problematic?

• May require the creation of many trivial, small pages.
• Is troublesome to describe things (e.g., an external URL)
  that have no corresponding wiki pages.
• Discourages users due to the difficulty of determining
  where to write knowledge (i.e., the best “subject” pages).
• Many users are confused of query-based pages: they do
  not know how to track the source of the queried results
  when they want to change a query-based page.


                                                               15
Organization and Context

Potential Solution

• Extending the SMW syntax
  – [[Cartman::friend of::Butters]]


• Introducing a context model to SMW
  – Context: Where, Who, When
  – No more need to use the subject to locate a triple



                                                         16
Collaboration Protocol

• Myth: semantic wiki, as wiki always does,
  allows compromises between different
  points of view.

• Fact: Semantic wiki only allows one
  version of the (semantic) “truth”.
  – A triple can not be both true and not true



                                                 17
Ontology War

                                   No! Cartman is
                                   only a Fictional
                                     Character


Cartman is a
    Boy




               http://www.gambling911.com/files/publisher/cat-fight-032609L.jpg



           Collaboration Protocol Support Needed!

                                                                                  18
Collaboration Protocol

• Avoid edit wars in Wikipedia
  – NPOV: allows multiple points
    of view co-exist on one page
    verifiable sources.
  – natural language text can
    accommodate and explain
    multiple points of view on a
    single page



                                   19
Collaboration Protocol
Two possible approaches

• To have categories and typed links optionally
  contextualized by authors, similar to the tag
  contextualizing mechanism in delicious and flickr.
   – http://example.com/author/term (contextualized name)
   – http://example.com/term (non-contextualized name)


• To introduce a context model of SMW knowledge
  statements, so that different versions of truth may be
  formally represented with explicitly given sources.

                                                            20
Conclusions
• Modeling in SMW can be regarded as an evolving
  cognition process and schema-based modeling is useful.

• We showed that a context/provenance model is needed
  for SMW to support better knowledge organization

• Collaboration protocols: to accommodate two versions of
  a fact, provenance of a term and/or triple should be
  traceable.




                                                            21
Solution Summary

• Simplicity: improve user interaction using
  forms and templates (schema).

• AAA: Context model
  – On-going work: “Semantic History”


• NPOV: Enabled by the context model


                                               22

More Related Content

PDF
"What is left to do?", Dublin Core 2012 Keynote
PDF
Blogs And Wikis In Academia
PPTX
Walz "Impacts of OER Flexibility: Understanding, Navigating, and Leveraging t...
PDF
We Need to Talk About Paid Editing: Sorting Out Wikipedia's Most Enduring Arg...
PPTX
Open Knowledge: Wikipedia and Beyond
PPTX
Developing an Ethical Approach to Using Wikipedia as the Front Matter to all ...
PPTX
The Wiki and the Wikipedia: A Comparative Study
PPTX
Editing Wikipedia: Why You Should and How You Can Support Your Users
"What is left to do?", Dublin Core 2012 Keynote
Blogs And Wikis In Academia
Walz "Impacts of OER Flexibility: Understanding, Navigating, and Leveraging t...
We Need to Talk About Paid Editing: Sorting Out Wikipedia's Most Enduring Arg...
Open Knowledge: Wikipedia and Beyond
Developing an Ethical Approach to Using Wikipedia as the Front Matter to all ...
The Wiki and the Wikipedia: A Comparative Study
Editing Wikipedia: Why You Should and How You Can Support Your Users

What's hot (20)

PPTX
ARLIS 2010 RLG Partnership Round Table
PPTX
Ashok "Creating Resilient OER in Times of Crisis"
PPTX
Brian kelly: Open practices for the connected researcher
PDF
Bridging the Web and Digital Publishing: EPUBWEB
PPTX
From Public Pedagogy to Critical Digital Praxis: Learning/Writing as Reflecti...
PDF
Undue Diligence: Seeking Low-risk Strategies for Making Collections of Unpubl...
PDF
Wollongong 090408232854-phpapp01
PDF
Metadata is a Love Note to the Future
PDF
From Frenemies to Friends: Embracing Wikipedia
PDF
[[edit]] this GLAM
PDF
Labour of Love: Why do people contribute, or not contribute, to Wikipedia art...
PPTX
Forms of Innovation: Collaboration, Attribution, Access
PPT
A Survey of the Landscape and State-of-Art in Semantic Wiki
PDF
Khmer Wikipedia's WikiMeetupPP2 - PPT Deck 201100430
PPTX
Cr4 moderna corrigido
PPTX
Open Practices for the Connected Researcher
PPT
Fys presentation 12_aug_2010
PPT
Beyond the Silos of the LAMs - Library, Archive, Museum Collaboration
PPTX
Introduction to social media for researchers
PPTX
Using social media to build your academic career
ARLIS 2010 RLG Partnership Round Table
Ashok "Creating Resilient OER in Times of Crisis"
Brian kelly: Open practices for the connected researcher
Bridging the Web and Digital Publishing: EPUBWEB
From Public Pedagogy to Critical Digital Praxis: Learning/Writing as Reflecti...
Undue Diligence: Seeking Low-risk Strategies for Making Collections of Unpubl...
Wollongong 090408232854-phpapp01
Metadata is a Love Note to the Future
From Frenemies to Friends: Embracing Wikipedia
[[edit]] this GLAM
Labour of Love: Why do people contribute, or not contribute, to Wikipedia art...
Forms of Innovation: Collaboration, Attribution, Access
A Survey of the Landscape and State-of-Art in Semantic Wiki
Khmer Wikipedia's WikiMeetupPP2 - PPT Deck 201100430
Cr4 moderna corrigido
Open Practices for the Connected Researcher
Fys presentation 12_aug_2010
Beyond the Silos of the LAMs - Library, Archive, Museum Collaboration
Introduction to social media for researchers
Using social media to build your academic career
Ad

Viewers also liked (7)

PDF
Walk the Talk: Analyzing the relation between implicit and explicit feedback ...
PPTX
The Triplex Approach for Recognizing Semantic Relations from Noun Phrases, Ap...
PPT
PDF
Semantic Relations
PPT
446 cognitivecategories-2017
PPTX
Lexical Relations in Semantic
PPTX
Database : Relational Data Model
Walk the Talk: Analyzing the relation between implicit and explicit feedback ...
The Triplex Approach for Recognizing Semantic Relations from Noun Phrases, Ap...
Semantic Relations
446 cognitivecategories-2017
Lexical Relations in Semantic
Database : Relational Data Model
Ad

Similar to Collective Cognition with Semantic Mediawiki: Lessons and Experiences (20)

PDF
The Unbearable Lightness of Wiking
PPTX
Semantic Wiki: Social Semantic Web In Action:
PPTX
Jist tutorial semantic wikis and applications
PDF
A Semantic Wiki To Support Knowledge Sharing In Innovation Activities
PPTX
Pre-SMWCon Spring 2012 meetup (short)
PDF
Applying And Extending Semantic Wikis For Semantic Web Courses
PPTX
Aswc2009 Smw Tutorial Part 1 Intro And Examples
PPTX
Semantic Wiki: Social Semantic Web in Use
PPTX
Msra talk smw+apps
PDF
A Controlled Natural Language Interface for Semantic MediaWiki
PDF
Development of a Controlled Natural Language Interface for Semantic MediaWiki
PDF
Lecture 3: Social Web Data Formats (2012)
PPT
Web 2
PDF
Lecture 3: Data Formats on the Social Web (2013)
PPTX
Semantic Wikis - Social Semantic Web in Action
PPT
A Short Tutorial to Semantic Media Wiki (SMW)
PDF
Bratsas Web Science Semantic Wiki
PPTX
Wiki in Teaching and Learning
PPTX
Tutorial semantic wikis and applications
PPTX
SMWCon Spring 2012 SMW+ Team Dev Update
The Unbearable Lightness of Wiking
Semantic Wiki: Social Semantic Web In Action:
Jist tutorial semantic wikis and applications
A Semantic Wiki To Support Knowledge Sharing In Innovation Activities
Pre-SMWCon Spring 2012 meetup (short)
Applying And Extending Semantic Wikis For Semantic Web Courses
Aswc2009 Smw Tutorial Part 1 Intro And Examples
Semantic Wiki: Social Semantic Web in Use
Msra talk smw+apps
A Controlled Natural Language Interface for Semantic MediaWiki
Development of a Controlled Natural Language Interface for Semantic MediaWiki
Lecture 3: Social Web Data Formats (2012)
Web 2
Lecture 3: Data Formats on the Social Web (2013)
Semantic Wikis - Social Semantic Web in Action
A Short Tutorial to Semantic Media Wiki (SMW)
Bratsas Web Science Semantic Wiki
Wiki in Teaching and Learning
Tutorial semantic wikis and applications
SMWCon Spring 2012 SMW+ Team Dev Update

More from Jie Bao (20)

PDF
python-graph-lovestory
PDF
unix toolbox 中文版
PDF
unixtoolbox.book
PDF
Lean startup 精益创业 新创企业的成长思维
PPT
Towards social webtops using semantic wiki
PPT
Semantic information theory in 20 minutes
PPT
Towards a theory of semantic communication
PPTX
Expressive Query Answering For Semantic Wikis (20min)
PDF
Startup best practices
PDF
Owl 2 quick reference card a4 size
PDF
ISWC 2010 Metadata Work Summary
PPTX
Expressive Query Answering For Semantic Wikis
PDF
PDF
24 Ways to Explore ISWC 2010 Data
PPT
Semantic Web: In Quest for the Next Generation Killer Apps
PDF
Representing financial reports on the semantic web a faithful translation f...
PDF
XACML 3.0 (Partial) Concept Map
PDF
Digital image self-adaptive acquisition in medical x-ray imaging
PPT
Privacy-Preserving Reasoning on the Semantic Web (Poster)
PPT
Privacy-Preserving Reasoning on the Semantic Web
python-graph-lovestory
unix toolbox 中文版
unixtoolbox.book
Lean startup 精益创业 新创企业的成长思维
Towards social webtops using semantic wiki
Semantic information theory in 20 minutes
Towards a theory of semantic communication
Expressive Query Answering For Semantic Wikis (20min)
Startup best practices
Owl 2 quick reference card a4 size
ISWC 2010 Metadata Work Summary
Expressive Query Answering For Semantic Wikis
24 Ways to Explore ISWC 2010 Data
Semantic Web: In Quest for the Next Generation Killer Apps
Representing financial reports on the semantic web a faithful translation f...
XACML 3.0 (Partial) Concept Map
Digital image self-adaptive acquisition in medical x-ray imaging
Privacy-Preserving Reasoning on the Semantic Web (Poster)
Privacy-Preserving Reasoning on the Semantic Web

Recently uploaded (20)

PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Machine learning based COVID-19 study performance prediction
PDF
Approach and Philosophy of On baking technology
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Empathic Computing: Creating Shared Understanding
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Network Security Unit 5.pdf for BCA BBA.
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPT
Teaching material agriculture food technology
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Encapsulation_ Review paper, used for researhc scholars
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Machine learning based COVID-19 study performance prediction
Approach and Philosophy of On baking technology
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Empathic Computing: Creating Shared Understanding
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Network Security Unit 5.pdf for BCA BBA.
The AUB Centre for AI in Media Proposal.docx
Advanced methodologies resolving dimensionality complications for autism neur...
Chapter 3 Spatial Domain Image Processing.pdf
Review of recent advances in non-invasive hemoglobin estimation
Understanding_Digital_Forensics_Presentation.pptx
Teaching material agriculture food technology
Programs and apps: productivity, graphics, security and other tools
Reach Out and Touch Someone: Haptics and Empathic Computing

Collective Cognition with Semantic Mediawiki: Lessons and Experiences

  • 1. Collective Cognition with Semantic Mediawiki: Lessons and Experiences Jie Bao, Li Ding and James Hendler Tetherless World Constellation, Department of Computer Science Rensselaer Polytechnic Institute, Troy, NY, USA {baojie,dingl,hendler}@cs.rpi.edu Network-Enabled Cognition workshop, ACITA, 2009 Sep 22, 2009, UMUC, Maryland
  • 2. Goal • To identify a few common pitfalls and limitations of Semantic Mediawiki in – knowledge modeling, – Knowledge organization and context, and – Collaboration protocols • To examine some potential approaches to solve these problems. 2
  • 3. Wiki and Recognition • Wiki is among the most prominent of forms on the Web that harness the distributed, collective efforts of users to create content online. – Ideas are formed, indentified or evolved and – Facts are discovered, refined or removed (for “wrong” ones) In the course of a never-ending editing process. 3
  • 4. Collective Cognition • A hypothetical example: write an outline for a new South Park episode. Picture courtesy of Wikipedia 4
  • 5. Collective Cognition • 9:00pm: show starts • 9:01pm: Wikipedia page for the episode is created • 9:01-9:08pm: 10 active users are adding summary when they watch • 9:08-9:10pm: ad time, some minor reorganization and typo fixing. • 9:10-9:30pm: repeat the above • 9:30-10:30pm: a user A is doing major refining, e.g. adding culture references • 10:31pm: a user B disagrees with A, undo A’s edit • 10:32pm: A undos B’s edit • 10:33pm: B leaves a message on A’s user page, to avoid further edit wars • 10:43pm: the two agrees to compromise with wording such that “It is implied that Chef is dead at the end of the episode; however, some others believe that it is not the case”. • The next day: user C adds a citation to this South Park episode’s page on a film’s page with that “South Park episode X is a parody of the this film”. 5
  • 6. Key elements • Simplicity: least training required to contribute. • AAA: anybody can say anything anywhere • NPOV: neutral point of view (among other collaboration protocols of Wikipedia) 6
  • 7. Semantic Wiki • Extensions to Wikis with some Semantic Web support – Example: Semantic Mediawiki Eric Cartman [[friend of::Butters]] Butters [[Category:Boy]] friend of Cartman Eric_Cartman friend_of Butters Eric_Cartman rdf:type Boy (RDF triple) 7
  • 8. Semantic Wiki • Fast-growing adoption – Healthcare – E-Government – Entertainment – Consulting – Database – … • Inside ITA – OWL modeling with controlled natural language – Rule modeling 8
  • 9. Semantic Wiki • Can Semantic wiki reproduces the success of wiki to be among the most prominent of forms on the Web that harness the distributed, collective efforts of users to create content knowledge online? • We have seen encouraging success in quite a few projects • However, some issues are identified in our real- world experiences. 9
  • 10. Knowledge Modeling • Myth: users can do RDF-style (triple- based) modeling on SMW • Fact: few is able to do this (at least without substantial training) 10
  • 11. “Big Fat Page” effects We gave a 3 hour training on SMW with a group of undergraduate students (most with no knowledge of RDF), and let them do a collective annotation task on TV shows. However, the result is not fully satisfactory • Difference between categories and properties is not that easy to understand (see a lot misuse like Category:hug) • To describe a thing with triples requires “thinking in RDF”, which needs some experiences. • It is a big headache to choose the right vocabulary and it is hard to know what vocabulary to reuse. As a result, many of the testees simply use the wiki as a notepad, without adding much semantic annotations, resulting in a long single “usual” wiki page. 11
  • 12. Schema or not schema? • Two common knowledge models on a semantic wiki, – “Schema”-based modeling, often represented in the form of pre- defined wiki templates, that are used by “common” users of the wiki to access data via forms or prebuilt queries. • c.f. “infobox” in Wikipedia • =>stable, shared knowledge – Arbitrary RDF-style semantic markup - heavily used by a selected few elite group • => less structured, less shared knowledge • A carefully pre-populated wiki “schema” (template), is as important as a schema in a database project. 12
  • 13. Template Example Template as Schema Form for the template 13
  • 14. Organization and Context • Myth: semantic wiki, like wiki, allows you to write things freely. • Fact: SMW does not support AAA – Every “triple” has to be on its subject’s page. • E.g., “South Park episode X is a parody of the this film” can only be said on X’s page. – Each subject and property of a triple must be a local page name. 14
  • 15. Organization and Context Why it may be problematic? • May require the creation of many trivial, small pages. • Is troublesome to describe things (e.g., an external URL) that have no corresponding wiki pages. • Discourages users due to the difficulty of determining where to write knowledge (i.e., the best “subject” pages). • Many users are confused of query-based pages: they do not know how to track the source of the queried results when they want to change a query-based page. 15
  • 16. Organization and Context Potential Solution • Extending the SMW syntax – [[Cartman::friend of::Butters]] • Introducing a context model to SMW – Context: Where, Who, When – No more need to use the subject to locate a triple 16
  • 17. Collaboration Protocol • Myth: semantic wiki, as wiki always does, allows compromises between different points of view. • Fact: Semantic wiki only allows one version of the (semantic) “truth”. – A triple can not be both true and not true 17
  • 18. Ontology War No! Cartman is only a Fictional Character Cartman is a Boy http://www.gambling911.com/files/publisher/cat-fight-032609L.jpg Collaboration Protocol Support Needed! 18
  • 19. Collaboration Protocol • Avoid edit wars in Wikipedia – NPOV: allows multiple points of view co-exist on one page verifiable sources. – natural language text can accommodate and explain multiple points of view on a single page 19
  • 20. Collaboration Protocol Two possible approaches • To have categories and typed links optionally contextualized by authors, similar to the tag contextualizing mechanism in delicious and flickr. – http://example.com/author/term (contextualized name) – http://example.com/term (non-contextualized name) • To introduce a context model of SMW knowledge statements, so that different versions of truth may be formally represented with explicitly given sources. 20
  • 21. Conclusions • Modeling in SMW can be regarded as an evolving cognition process and schema-based modeling is useful. • We showed that a context/provenance model is needed for SMW to support better knowledge organization • Collaboration protocols: to accommodate two versions of a fact, provenance of a term and/or triple should be traceable. 21
  • 22. Solution Summary • Simplicity: improve user interaction using forms and templates (schema). • AAA: Context model – On-going work: “Semantic History” • NPOV: Enabled by the context model 22