SlideShare a Scribd company logo
SKOS, RDFa & Co,
Linked Data Related Examples
INFO 4302 - April 20, 2011
Bernhard Haslhofer - Cornell University
Today we talk about...


• Simple Knowledge Organization System (SKOS)
• Machine-readable data in (X)HTML(5)
 (RDFa, Microformats, Microdata)
• Search Engines and machine-readable data
• Facebook Graph API & Open Graph Protocol
Linked Data Recap
Why Linked Data?

• There is lots of information on the Web
• ... valuable information that can be (re-)used
• Problem
 • information is usually expressed in the form of
     HTML documents
 •   the underlying raw data are locked in closed data
     silos (mostly DBMS)
Why Linked Data?

• The Web is successful because it provides
 • Uniform encoding (HTML)
 • Uniform addressing (URI)
 • Uniform transportation (HTTP)
 for the exchange of documents.
• Why not apply the same mechanism to the
 underlying data?
What is Linked Data?

• A method to build a Web of Data
• Architectural style, set of standards



                        Web
Publishing Data

• Distinguish between non-information and
 information resource
• Sample non-information resource
 • http://dbpedia.org/resource/The_Shining_(film)
• Sample information resource
 • http://dbpedia.org/page/The_Shining_(film) - HTML
 • http://dbpedia.org/data/The_Shining_(film) - RDF
Publishing Data

       GET http://dbpedia.org/resource/The_Shining_(film)
       Accept: application/rdf+xml



       303 See Other
       Location: http://dbpedia.org/data/The_Shining_(film)



       GET http://dbpedia.org/data/The_Shining_(film)
       Accept: application/rdf+xml



       200 OK
       ...
       <?xml version="1.0" encoding="utf-8"?>
       <rdf:RDF ...
Simple Knowledge Organization System (SKOS)

• A language for describing controlled vocabularies
      (taxonomies, thesauri, classification schemes)


                                              http://dbpedia.org/resource/
                                              Category:1980s_horror_films

                               skos:subject                                   rdf:type

http://dbpedia.org/resource/                         skos:broader
                                                                                         http://www.w3.org/2004/02/
     The_Shining_(film)                                                                       skos/core#Concept

                                                                             rdf:type
                                              http://dbpedia.org/resource/
                                                 Category:1980s_films
SKOS, RDFa, Microformats, Microdata
SKOS, RDFa, Microformats, Microdata
Excursus: The Semantic Spectrum
Knowledge Organization Systems (KOS)
semantic
expressiveness
                                                     Ontology
                                                                  DL, FOL, etc
                                                                OWL
                                        Data Model
                                                          UML
                                                     ER
                                  Thesaurus

                            Taxonomy              XML/S
                                           SKOS
                   Folksonomy

                 Glossary       Term/Tag list




                                                          complexity
Glossary
• A simple (alphabetical) list of terms and their
 definitions for a particular domain of
 knowledge
• Useful for creating clear and unambiguous
 term definitions
Folksonomy

• Collaborative tagging systems
• Web 2.0., social-media, etc...
Controlled Vocabulary
• A simple list of terms, definitions and naming
 conventions
• Some process “controls” adding and removing
 term definitions to ensure consistency
• Terms are often defined in relationship to
 each other
• Taxonomies, Thesauri, and Ontologies are
 special forms of controlled vocabularies
Taxonomy
• Classification of entities in a hierarchy
• A taxonomy node stands for a real-world entity
                                        animate object




                            agent


                                                         subclass of


                 person              organization




       manager            employee
source: http://upload.wikimedia.org/wikipedia/commons/b/bb/Linnaeus_-_Regnum_Animale_%281735%29.png
SKOS, RDFa, Microformats, Microdata
Thesaurus
• Associates the meaning of a term with the
 meaning of another term
• hypernym, hyponym, synonym, antonym
                                                            Imagery




                                           Aerial imagery             Infrared imagery




                        Radar imagery                                                                Combat support
                                                                                                       equipment



        Moving target                      Radar                                         Intelligence and electronic
         indicators                     photography                                          warfare equipment




                                                                                 Imaging systems




                              narrower than                                                     Infrared imaging
                                                                       Imaging radar
                                                                                                     systems
                              related to
SKOS, RDFa, Microformats, Microdata
SKOS, RDFa, Microformats, Microdata
Ontology

• Defines concepts that represent an area of
 knowledge
• Machine-read and -interpretable
 representation
• Classes, instances, relationships, properties,
 functions, constraints, etc.
http://www.bbc.co.uk/ontologies/programmes/2009-09-07.shtml
Simple Knowledge
Organization System (SKOS)
What is SKOS?

• A model for expressing the basic structure
 and content of concept schemes such as
 thesauri, classification schemes, taxonomies,
 folksonomies, and other similar types of
 controlled vocabularies
• Allows concepts to be composed and
 published as Linked Data on the Web
• Hides the complexity of OWL - easy to use
semantic
expressiveness
                                                     Ontology
                                                                  DL, FOL, etc
                                                                OWL
                                        Data Model
                                                          UML
                                                     ER
                                  Thesaurus

                            Taxonomy              XML/S
                                           SKOS
                   Folksonomy

                 Glossary       Term/Tag list




                                                          complexity
SKOS Concepts are...

• ... identified with URIs
• ... labeled with 1..* natural language strings
• ... documented with various types of notes
• ... semantically linked to each other
• ... aggregated into concept schemes
Example SKOS Concept
skos:Concept
• Concepts are
 • the units of thought: ideas, meanings, categories of
     objects, etc.
 •   abstract entities which are independent of the terms
     used to label them
                                skos:Concept




                                   rdf:type




                          lcsh:sh2007025344#concept




           @prefix skos: <http://www.w3.org/2004/02/skos/core#> .
           @prefix lcsh: <http://id.loc.gov/authorities/> .
skos:(pref|alt|hidden)Label
• Labels refer to concepts’ natural language(s)
 • skos:prefLabel: the preferred lexical label
 • skos:altLabel: alternative lexical labels (e.g., synonyms)
 • skos:hiddenLabel: labels useful for indexing
                                                     "Parody films"@en


                                   skos:prefLabel



       lcsh:sh2007025344#concept   skos:altLabel    "Movie parodies"@en


                                   skos:altLabel

                                                    "Send-up films"@en
SKOS Semantic Relationships
• The meaning of a concept is also defined by its
 links to other concepts
 •   skos:broader: hierarchical link to a more general concept
 •   skos:narrower: hierarchical link to a more specific concept
 •   skos:related: associative (non-hierarchical) link
                               skos:broader



              lcsh:sh2007025038#concept           skos:prefLabel   "Comedy films"@en




              skos:narrower        skos:broader




              lcsh:sh2007025344#concept           skos:prefLabel   "Parody films"@en




                   skos:narrower
SKOS Documentary Notes
• Add further human-readable documentation
 • skos:scopeNote: info about intended meaning
 • skos:definition: complete explanation of meaning
 • skos:example: example concept use
                                                     “This heading is used as a
                                                     genre/form heading for
                                                     films that comically imitate
   lcsh:sh2007025038#concept        skos:scopeNote
                                                     another work or group of
                                                     works of a more serious
                                                     nature.”@en

                   skos:prefLabel

                                     "Comedy films"@en
skos:ConceptScheme
• Allow the organization of skos:Concepts in
 some Knowledge Organization Scheme (KOS)

                          skos:ConceptScheme
                         lcsh:#genreFormTerms



                   skos:inScheme      skos:inScheme



           skos:Concept                         skos:Concept
    lcsh:sh2007025038#concept                        ...
Example


• Dereference and analyze
 “Jack Nicholson” @ New York Times

 • http://data.nytimes.com
 • http://data.nytimes.com/N5761411277431266513
SKOS, RDFa, Microformats, Microdata
RDFa, Microformats &
Microdata
Why RDFa?

     GET http://dbpedia.org/resource/The_Shining_(film)
     Accept: application/rdf+xml



     303 See Other
     Location: http://dbpedia.org/data/The_Shining_(film)



    Oh dear!
     GET http://dbpedia.org/data/The_Shining_(film)
     Accept: application/rdf+xml



     200 OK
     ...
     <?xml version="1.0" encoding="utf-8"?>
     <rdf:RDF ...
What is RDFa?

• A mechanism for embedding RDF metadata
 within XHTML 1.1 Web documents
• Defines a set of attributes to augment visual
 data with machine-readable hints
• User agents can extract triples from RDFa-
 enables Web pages
What is RDFa?

• Markup human-readable data (HTML) with
 machine-readable indicators
RDFa Example
      XHTML
...
All content on this site is licensed under
<a href="http://creativecommons.org/licenses/by/3.0/">
    a Creative Commons License
</a>.

      XHTML + RDFa
...
All content on this site is licensed under
<a rel=”license” href="http://creativecommons.org/licenses/by/3.0/">
    a Creative Commons License
</a>.
RDFa Attributes

• about and src: the resource the metadata is about
• rel and rev: (reverse) relationship between resources
• href and resource: the partner resource
• property: a property for the content of an element
• content: override content of an element
• datatype: specify the datatype of text
• typeof: specifies the RDF type(s) or a subject
RDFa Example
      XHTML
<div>
   <h2>The trouble with Bob</h2>
   <h3>Alice</h3>
   ...
</div>

      XHTML + RDFa
<div xmlns:dc="http://purl.org/dc/elements/1.1/">
   <h2 property="dc:title">The trouble with Bob</h2>
   <h3 property="dc:creator">Alice</h3>
   ...
</div>
RDFa Example
XHTML + RDFa
<div xmlns:dc="http://purl.org/dc/elements/1.1/">
! ! !
   <div about="/alice/posts/trouble_with_bob">
      <h2 property="dc:title">The trouble with Bob</h2>
      <h3 property="dc:creator">Alice</h3>
      ...
   </div>

   <div about="/alice/posts/jos_barbecue">
      <h2 property="dc:title">Jo's Barbecue</h2>
      <h3 property="dc:creator">Eve</h3>
      ...
   </div>

   ...

</div>
RDFa Example
XHTML + RDFa

<div typeof="foaf:Person" xmlns:foaf="http://xmlns.com/foaf/0.1/">
   <p property="foaf:name">
     Alice Birpemswick
   </p>

   <p>
     Email: <a rel="foaf:mbox"
href="mailto:alice@example.com">alice@example.com</a>
   </p>

   <p>
     Phone: <a rel="foaf:phone" href="tel:+1-617-555-7332">+1
617.555.7332</a>
   </p>
</div>
RDFa Examples


• OReilly.com: http://oreilly.com/catalog/
 9780596520694/
• Rottentomatoes: http://
 www.rottentomatoes.com/m/shining/
SKOS, RDFa, Microformats, Microdata
Microformats

• A Microformat extends conventional HTML
 tags with semantic information
• Started by Technorati, Inc.; now community-
 driven (IRC, mailing list, blogs)
• Make use of the following (X)HTML attributes
 • class
 • rel
Microformats Example
HTML
<div>
  <div>Joe Doe</div>
  <div>Jo</div>
  <div>The Example Company</div>
  <div>604-555-1234</div>
  <a href="http://example.com/">http://example.com/</a>
</div>


XHTML + Microformats
<head profile="http://www.w3.org/2006/03/hcard">
...
</head>
...
<div class="vcard">
  <div class="fn">Joe Doe</div>
  <div class="nickname">Jo</div>
  <div class="org">The Example Company</div>
  <div class="tel">604-555-1234</div>
  <a class="url" href="http://example.com/">http://example.com/</a>
</div>
Microformats Example
Available Microformats
Microformats vs. RDFa
  <div xmlns="http://www.w3.org/1999/xhtml"
       xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
       xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"
       xmlns:v="http://www.w3.org/2006/vcard/ns#">

     <div about="http://example.com/me/behas" typeof="v:VCard">
       <span property="v:fn">Bernhard Haslhofer</span>
       <span property="v:nickname">behas</span>
       <div rel="v:adr">
            <div typeof="v:Address v:Work">
                <span property="v:street-address">301 College Avenue</span>
                <span property="v:locality">Ithaca</span>,
                <span property="v:postal-code">14850</span>,
                <span property="v:country-name">United States</span>.
            </div>
         </div>
         <a rel="v:email"
  href="mailto:bernhard.haslhofer@cornell.edu">bernhard.haslhofer@cornell.edu</a>.
     </div>
  </div>



Sample vCard Object in RDFa (see http://www.w3.org/Submission/vcard-rdf/)
Microformats vs. RDFa
   <div class="vcard">

     <span class="fn">Bernhard Haslhofer</span>

     <div class="adr">
       <div class="street-address">301 College Avenue</div>
       <span class="locality">Ithaca</span>
       <span class="postal-code">14850</span>
       <span class="country-name">United States</span>
     </div>

     <a class="email"
     href="mailto:bernhard.haslhofer@cornell.edu">bernhard.haslhofer@cornell.edu</a>

   </div>




hCard sample created with http://microformats.org/code/hcard/creator
Microformats                                  RDFa
    flat namespace                            XML namespaces

    support HTML4, XHTML 1.1, and
                                             support for XHTML 1.1
    HTML 5

    use latent HTML attributes               introduces new metadata attributes

    vocabulary defined by one
                                             open to any RDF-based vocabulary
    organization/community




Also see: http://evan.prodromou.name/RDFa_vs_microformats
Microdata (HTML5)
• A very young HTML 5 proposition that
 extends Microformats and addresses its
 shortcomings
• Items are created within an itemscope
• Ever item is assigned an arbitrary number of
 properties (itemprop)
• Uses global identifiers for typing and naming
 items
Microdata Example

<div itemscope itemtype="http://data-vocabulary.org/Person">

  <span itemprop="name">Bernhard Haslhofer</span>,
  <span itemprop="nickname">behas</span>.

  <div itemprop="address" itemscope itemtype="http://data-vocabulary.org/Address">
    <span itemprop="street-address">301 College Avenue</span>
    <span itemprop="locality">Ithaca</span>
    <span itemprop="country-name">United States</span>
  </div>

</div>
Search engines and
machine-readable data
SKOS, RDFa, Microformats, Microdata
Google Rich Snippets

• Reviews
• People
• Products
• Businesses and organizations
• Recipes
• Events
Google Rich Snippets
HTML

<div>
L’Amourita Pizza
Reviewed by Ulysses Grant on Jan 6.
Delicious, tasty pizza on Eastlake!
L'Amourita serves up traditional wood-fired Neapolitan-style pizza,
brought to your table promptly and without fuss. An ideal neighborhood pizza joint.
Rating: 4.5
</div>
Google Rich Snippets
Microdata
<div>
  <div itemscope itemtype="http://data-vocabulary.org/Review">
    <span itemprop="itemreviewed">L’Amourita Pizza</span>
    Reviewed by <span itemprop="reviewer">Ulysses Grant</span> on
    <time itemprop="dtreviewed" datetime="2009-01-06">Jan 6</time>.
    <span itemprop="summary">Delicious, tasty pizza in Eastlake!</span>
    <span itemprop="description">L'Amourita serves up traditional wood-fired
Neapolitan-style pizza,
       brought to your table promptly and without fuss. An ideal neighborhood pizza
joint.</span>
    Rating: <span itemprop="rating">4.5</span>
  </div>
</div>
Google Rich Snippets
Microformats
<div class="hreview">
   <span class="item">
       <span class="fn">L’Amourita Pizza</span>
   </span>
   Reviewed by <span class="reviewer">Ulysses Grant</span> on
   <span class="dtreviewed">
       Jan 6<span class="value-title" title="2009-01-06"></span>
   </span>.
   <span class="summary">Delicious, tasty pizza on Eastlake!</span>
   <span class="description">L'Amourita serves up traditional wood-fired
   Neapolitan-style pizza, brought to your table promptly and without fuss.
   An ideal neighborhood pizza joint.</span>
   Rating:
   <span class="rating">4.5</span>
</div>
Google Rich Snippets
RDFa
<div xmlns:v="http://rdf.data-vocabulary.org/#" typeof="v:Review">
   <span property="v:itemreviewed">L’Amourita Pizza</span>
   Reviewed by
   <span property="v:reviewer">Ulysses Grant</span> on
   <span property="v:dtreviewed" content="2009-01-06">Jan 6</span>.
   <span property="v:summary">Delicious, tasty pizza on Eastlake!</span>
   <span property="v:description">L'Amourita serves up traditional wood-fired
   Neapolitan-style pizza, brought to your table promptly and without fuss.
   An ideal neighborhood pizza joint.</span>
   Rating:
   <span property="v:rating">4.5</span>
</div>
Facebook Graph API &
Open Graph Protocol
SKOS, RDFa, Microformats, Microdata
SKOS, RDFa, Microformats, Microdata
Readings
Required Reading


• SKOS Primer: http://www.w3.org/TR/skos-primer/
• RDFa Primer: Bridging the Human and Data Webs.
 http://www.w3.org/TR/xhtml-rdfa-primer/
• HTML Microdata: http://www.w3.org/TR/microdata/
• Microformats: http://microformats.org/
Recommended Readings


• SKOS Vocabularies Overview:
• RDFa.info: http://rdfa.info/wiki/RDFa_Wiki
• CC REL: Creative Commons Rights Expression
 Language (CC REL)
Dream




                                                                              RDFa
                                                                                     HTML 5
Reality                                                                              Microdata

          ~2000                                                                         2011


source: http://upload.wikimedia.org/wikipedia/commons/a/a7/Wollmilchsau.png

More Related Content

PDF
Listerine - Up Close & Personal pitch
PDF
Market structure and powerful setups
DOC
37279739 bausch-and-lomb-case-study
PPTX
RDFa: an introduction
PDF
RDFa: introduction, comparison with microdata and microformats and how to use it
PDF
RDFa: putting RDF on the Web
PDF
semantic markup using schema.org
Listerine - Up Close & Personal pitch
Market structure and powerful setups
37279739 bausch-and-lomb-case-study
RDFa: an introduction
RDFa: introduction, comparison with microdata and microformats and how to use it
RDFa: putting RDF on the Web
semantic markup using schema.org

Similar to SKOS, RDFa, Microformats, Microdata (20)

PPTX
The state of KOS in the Linked Data movement
PDF
Knowledge Organization Systems
PPTX
Taxonomy, ontology, folksonomies & SKOS.
PDF
Introduction to SKOS - Simple Knowledge Organization System
PDF
LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...
PPT
SKOS - 2007 Open Forum on Metadata Registries - NYC
PPT
Pratt Sils LIS653 4 Fall 2007
PPT
Linking KOS Data [using SKOS and OWL2]
PPT
The Myth of Topic Maps
PDF
Knowledge Organisation Systems in Digital Libraries: A Comparative Study
PPT
Progress in semantic mapping - NKOS
PPTX
Knowledge organization
PPTX
Knowledge organization system
PDF
TEDDY - Thesaurus Editor: Design and Definition Yarn
PDF
Introduction to the Semantic Web
KEY
What's Next for the Web?
PPTX
Libraries and Linked Data: Looking to the Future (3)
PPT
It's all semantics! -The premises and promises of the semantic web
PPT
SKOS, Past, Present and Future
PDF
Supporting Sensemaking by Modelling Discourse as Hypermedia Networks
The state of KOS in the Linked Data movement
Knowledge Organization Systems
Taxonomy, ontology, folksonomies & SKOS.
Introduction to SKOS - Simple Knowledge Organization System
LoCloud Vocabulary Services: Thesaurus management introduction, Walter Koch a...
SKOS - 2007 Open Forum on Metadata Registries - NYC
Pratt Sils LIS653 4 Fall 2007
Linking KOS Data [using SKOS and OWL2]
The Myth of Topic Maps
Knowledge Organisation Systems in Digital Libraries: A Comparative Study
Progress in semantic mapping - NKOS
Knowledge organization
Knowledge organization system
TEDDY - Thesaurus Editor: Design and Definition Yarn
Introduction to the Semantic Web
What's Next for the Web?
Libraries and Linked Data: Looking to the Future (3)
It's all semantics! -The premises and promises of the semantic web
SKOS, Past, Present and Future
Supporting Sensemaking by Modelling Discourse as Hypermedia Networks
Ad

More from Bernhard Haslhofer (20)

PDF
Decentralized Finance (DeFi) - Understanding Risks in an Emerging Financial P...
PDF
Token Systems, Payment Channels, and Corporate Currencies
PDF
Can a blockchain solve the trust problem?
PDF
Measurements in Cryptocurrency Networks
PDF
Post-Bitcoin Cryptocurrencies, Off-Chain Transaction Channels, and Cryptocur...
PDF
Insight Into Cryptocurrencies - Methods and Tools for Analyzing Blockchain-ba...
PDF
O Bitcoin Where Art Thou? An Introduction to Cryptocurrency Analytics
PDF
Mind the Gap - Data Science Meets Software Engineering
PDF
GraphSense - Real-time Insight into Virtual Currency Ecosystems
PDF
BITCOIN - De-anonymization and Money Laundering Detection Strategies
PDF
Bitcoin - Introduction, Technical Aspects and Ongoing Developments
PDF
Maphub und Pelagios: Anwendung von Linked Data in den Digitalen Geisteswissen...
PDF
The value of open data and the OpenGLAM network
PDF
Things, not Strings
PDF
Offene Daten im Kulturbereich - Die pragmatische Perspektive
PDF
Open Data - Principles and Techniques
PDF
Semantic Tagging on Historical Maps
PDF
The Story behind Maphub
PDF
OpenGLAM Intro @ OKFN.AT Meetup Graz
PDF
Semantic Tagging for old maps...and other things on the Web
Decentralized Finance (DeFi) - Understanding Risks in an Emerging Financial P...
Token Systems, Payment Channels, and Corporate Currencies
Can a blockchain solve the trust problem?
Measurements in Cryptocurrency Networks
Post-Bitcoin Cryptocurrencies, Off-Chain Transaction Channels, and Cryptocur...
Insight Into Cryptocurrencies - Methods and Tools for Analyzing Blockchain-ba...
O Bitcoin Where Art Thou? An Introduction to Cryptocurrency Analytics
Mind the Gap - Data Science Meets Software Engineering
GraphSense - Real-time Insight into Virtual Currency Ecosystems
BITCOIN - De-anonymization and Money Laundering Detection Strategies
Bitcoin - Introduction, Technical Aspects and Ongoing Developments
Maphub und Pelagios: Anwendung von Linked Data in den Digitalen Geisteswissen...
The value of open data and the OpenGLAM network
Things, not Strings
Offene Daten im Kulturbereich - Die pragmatische Perspektive
Open Data - Principles and Techniques
Semantic Tagging on Historical Maps
The Story behind Maphub
OpenGLAM Intro @ OKFN.AT Meetup Graz
Semantic Tagging for old maps...and other things on the Web
Ad

SKOS, RDFa, Microformats, Microdata

  • 1. SKOS, RDFa & Co, Linked Data Related Examples INFO 4302 - April 20, 2011 Bernhard Haslhofer - Cornell University
  • 2. Today we talk about... • Simple Knowledge Organization System (SKOS) • Machine-readable data in (X)HTML(5) (RDFa, Microformats, Microdata) • Search Engines and machine-readable data • Facebook Graph API & Open Graph Protocol
  • 4. Why Linked Data? • There is lots of information on the Web • ... valuable information that can be (re-)used • Problem • information is usually expressed in the form of HTML documents • the underlying raw data are locked in closed data silos (mostly DBMS)
  • 5. Why Linked Data? • The Web is successful because it provides • Uniform encoding (HTML) • Uniform addressing (URI) • Uniform transportation (HTTP) for the exchange of documents. • Why not apply the same mechanism to the underlying data?
  • 6. What is Linked Data? • A method to build a Web of Data • Architectural style, set of standards Web
  • 7. Publishing Data • Distinguish between non-information and information resource • Sample non-information resource • http://dbpedia.org/resource/The_Shining_(film) • Sample information resource • http://dbpedia.org/page/The_Shining_(film) - HTML • http://dbpedia.org/data/The_Shining_(film) - RDF
  • 8. Publishing Data GET http://dbpedia.org/resource/The_Shining_(film) Accept: application/rdf+xml 303 See Other Location: http://dbpedia.org/data/The_Shining_(film) GET http://dbpedia.org/data/The_Shining_(film) Accept: application/rdf+xml 200 OK ... <?xml version="1.0" encoding="utf-8"?> <rdf:RDF ...
  • 9. Simple Knowledge Organization System (SKOS) • A language for describing controlled vocabularies (taxonomies, thesauri, classification schemes) http://dbpedia.org/resource/ Category:1980s_horror_films skos:subject rdf:type http://dbpedia.org/resource/ skos:broader http://www.w3.org/2004/02/ The_Shining_(film) skos/core#Concept rdf:type http://dbpedia.org/resource/ Category:1980s_films
  • 13. Knowledge Organization Systems (KOS) semantic expressiveness Ontology DL, FOL, etc OWL Data Model UML ER Thesaurus Taxonomy XML/S SKOS Folksonomy Glossary Term/Tag list complexity
  • 14. Glossary • A simple (alphabetical) list of terms and their definitions for a particular domain of knowledge • Useful for creating clear and unambiguous term definitions
  • 15. Folksonomy • Collaborative tagging systems • Web 2.0., social-media, etc...
  • 16. Controlled Vocabulary • A simple list of terms, definitions and naming conventions • Some process “controls” adding and removing term definitions to ensure consistency • Terms are often defined in relationship to each other • Taxonomies, Thesauri, and Ontologies are special forms of controlled vocabularies
  • 17. Taxonomy • Classification of entities in a hierarchy • A taxonomy node stands for a real-world entity animate object agent subclass of person organization manager employee
  • 20. Thesaurus • Associates the meaning of a term with the meaning of another term • hypernym, hyponym, synonym, antonym Imagery Aerial imagery Infrared imagery Radar imagery Combat support equipment Moving target Radar Intelligence and electronic indicators photography warfare equipment Imaging systems narrower than Infrared imaging Imaging radar systems related to
  • 23. Ontology • Defines concepts that represent an area of knowledge • Machine-read and -interpretable representation • Classes, instances, relationships, properties, functions, constraints, etc.
  • 26. What is SKOS? • A model for expressing the basic structure and content of concept schemes such as thesauri, classification schemes, taxonomies, folksonomies, and other similar types of controlled vocabularies • Allows concepts to be composed and published as Linked Data on the Web • Hides the complexity of OWL - easy to use
  • 27. semantic expressiveness Ontology DL, FOL, etc OWL Data Model UML ER Thesaurus Taxonomy XML/S SKOS Folksonomy Glossary Term/Tag list complexity
  • 28. SKOS Concepts are... • ... identified with URIs • ... labeled with 1..* natural language strings • ... documented with various types of notes • ... semantically linked to each other • ... aggregated into concept schemes
  • 30. skos:Concept • Concepts are • the units of thought: ideas, meanings, categories of objects, etc. • abstract entities which are independent of the terms used to label them skos:Concept rdf:type lcsh:sh2007025344#concept @prefix skos: <http://www.w3.org/2004/02/skos/core#> . @prefix lcsh: <http://id.loc.gov/authorities/> .
  • 31. skos:(pref|alt|hidden)Label • Labels refer to concepts’ natural language(s) • skos:prefLabel: the preferred lexical label • skos:altLabel: alternative lexical labels (e.g., synonyms) • skos:hiddenLabel: labels useful for indexing "Parody films"@en skos:prefLabel lcsh:sh2007025344#concept skos:altLabel "Movie parodies"@en skos:altLabel "Send-up films"@en
  • 32. SKOS Semantic Relationships • The meaning of a concept is also defined by its links to other concepts • skos:broader: hierarchical link to a more general concept • skos:narrower: hierarchical link to a more specific concept • skos:related: associative (non-hierarchical) link skos:broader lcsh:sh2007025038#concept skos:prefLabel "Comedy films"@en skos:narrower skos:broader lcsh:sh2007025344#concept skos:prefLabel "Parody films"@en skos:narrower
  • 33. SKOS Documentary Notes • Add further human-readable documentation • skos:scopeNote: info about intended meaning • skos:definition: complete explanation of meaning • skos:example: example concept use “This heading is used as a genre/form heading for films that comically imitate lcsh:sh2007025038#concept skos:scopeNote another work or group of works of a more serious nature.”@en skos:prefLabel "Comedy films"@en
  • 34. skos:ConceptScheme • Allow the organization of skos:Concepts in some Knowledge Organization Scheme (KOS) skos:ConceptScheme lcsh:#genreFormTerms skos:inScheme skos:inScheme skos:Concept skos:Concept lcsh:sh2007025038#concept ...
  • 35. Example • Dereference and analyze “Jack Nicholson” @ New York Times • http://data.nytimes.com • http://data.nytimes.com/N5761411277431266513
  • 38. Why RDFa? GET http://dbpedia.org/resource/The_Shining_(film) Accept: application/rdf+xml 303 See Other Location: http://dbpedia.org/data/The_Shining_(film) Oh dear! GET http://dbpedia.org/data/The_Shining_(film) Accept: application/rdf+xml 200 OK ... <?xml version="1.0" encoding="utf-8"?> <rdf:RDF ...
  • 39. What is RDFa? • A mechanism for embedding RDF metadata within XHTML 1.1 Web documents • Defines a set of attributes to augment visual data with machine-readable hints • User agents can extract triples from RDFa- enables Web pages
  • 40. What is RDFa? • Markup human-readable data (HTML) with machine-readable indicators
  • 41. RDFa Example XHTML ... All content on this site is licensed under <a href="http://creativecommons.org/licenses/by/3.0/"> a Creative Commons License </a>. XHTML + RDFa ... All content on this site is licensed under <a rel=”license” href="http://creativecommons.org/licenses/by/3.0/"> a Creative Commons License </a>.
  • 42. RDFa Attributes • about and src: the resource the metadata is about • rel and rev: (reverse) relationship between resources • href and resource: the partner resource • property: a property for the content of an element • content: override content of an element • datatype: specify the datatype of text • typeof: specifies the RDF type(s) or a subject
  • 43. RDFa Example XHTML <div> <h2>The trouble with Bob</h2> <h3>Alice</h3> ... </div> XHTML + RDFa <div xmlns:dc="http://purl.org/dc/elements/1.1/"> <h2 property="dc:title">The trouble with Bob</h2> <h3 property="dc:creator">Alice</h3> ... </div>
  • 44. RDFa Example XHTML + RDFa <div xmlns:dc="http://purl.org/dc/elements/1.1/"> ! ! ! <div about="/alice/posts/trouble_with_bob"> <h2 property="dc:title">The trouble with Bob</h2> <h3 property="dc:creator">Alice</h3> ... </div> <div about="/alice/posts/jos_barbecue"> <h2 property="dc:title">Jo's Barbecue</h2> <h3 property="dc:creator">Eve</h3> ... </div> ... </div>
  • 45. RDFa Example XHTML + RDFa <div typeof="foaf:Person" xmlns:foaf="http://xmlns.com/foaf/0.1/"> <p property="foaf:name"> Alice Birpemswick </p> <p> Email: <a rel="foaf:mbox" href="mailto:alice@example.com">alice@example.com</a> </p> <p> Phone: <a rel="foaf:phone" href="tel:+1-617-555-7332">+1 617.555.7332</a> </p> </div>
  • 46. RDFa Examples • OReilly.com: http://oreilly.com/catalog/ 9780596520694/ • Rottentomatoes: http:// www.rottentomatoes.com/m/shining/
  • 48. Microformats • A Microformat extends conventional HTML tags with semantic information • Started by Technorati, Inc.; now community- driven (IRC, mailing list, blogs) • Make use of the following (X)HTML attributes • class • rel
  • 49. Microformats Example HTML <div> <div>Joe Doe</div> <div>Jo</div> <div>The Example Company</div> <div>604-555-1234</div> <a href="http://example.com/">http://example.com/</a> </div> XHTML + Microformats <head profile="http://www.w3.org/2006/03/hcard"> ... </head> ... <div class="vcard"> <div class="fn">Joe Doe</div> <div class="nickname">Jo</div> <div class="org">The Example Company</div> <div class="tel">604-555-1234</div> <a class="url" href="http://example.com/">http://example.com/</a> </div>
  • 52. Microformats vs. RDFa <div xmlns="http://www.w3.org/1999/xhtml" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xmlns:v="http://www.w3.org/2006/vcard/ns#"> <div about="http://example.com/me/behas" typeof="v:VCard"> <span property="v:fn">Bernhard Haslhofer</span> <span property="v:nickname">behas</span> <div rel="v:adr"> <div typeof="v:Address v:Work"> <span property="v:street-address">301 College Avenue</span> <span property="v:locality">Ithaca</span>, <span property="v:postal-code">14850</span>, <span property="v:country-name">United States</span>. </div> </div> <a rel="v:email" href="mailto:bernhard.haslhofer@cornell.edu">bernhard.haslhofer@cornell.edu</a>. </div> </div> Sample vCard Object in RDFa (see http://www.w3.org/Submission/vcard-rdf/)
  • 53. Microformats vs. RDFa <div class="vcard"> <span class="fn">Bernhard Haslhofer</span> <div class="adr"> <div class="street-address">301 College Avenue</div> <span class="locality">Ithaca</span> <span class="postal-code">14850</span> <span class="country-name">United States</span> </div> <a class="email" href="mailto:bernhard.haslhofer@cornell.edu">bernhard.haslhofer@cornell.edu</a> </div> hCard sample created with http://microformats.org/code/hcard/creator
  • 54. Microformats RDFa flat namespace XML namespaces support HTML4, XHTML 1.1, and support for XHTML 1.1 HTML 5 use latent HTML attributes introduces new metadata attributes vocabulary defined by one open to any RDF-based vocabulary organization/community Also see: http://evan.prodromou.name/RDFa_vs_microformats
  • 55. Microdata (HTML5) • A very young HTML 5 proposition that extends Microformats and addresses its shortcomings • Items are created within an itemscope • Ever item is assigned an arbitrary number of properties (itemprop) • Uses global identifiers for typing and naming items
  • 56. Microdata Example <div itemscope itemtype="http://data-vocabulary.org/Person"> <span itemprop="name">Bernhard Haslhofer</span>, <span itemprop="nickname">behas</span>. <div itemprop="address" itemscope itemtype="http://data-vocabulary.org/Address"> <span itemprop="street-address">301 College Avenue</span> <span itemprop="locality">Ithaca</span> <span itemprop="country-name">United States</span> </div> </div>
  • 59. Google Rich Snippets • Reviews • People • Products • Businesses and organizations • Recipes • Events
  • 60. Google Rich Snippets HTML <div> L’Amourita Pizza Reviewed by Ulysses Grant on Jan 6. Delicious, tasty pizza on Eastlake! L'Amourita serves up traditional wood-fired Neapolitan-style pizza, brought to your table promptly and without fuss. An ideal neighborhood pizza joint. Rating: 4.5 </div>
  • 61. Google Rich Snippets Microdata <div> <div itemscope itemtype="http://data-vocabulary.org/Review"> <span itemprop="itemreviewed">L’Amourita Pizza</span> Reviewed by <span itemprop="reviewer">Ulysses Grant</span> on <time itemprop="dtreviewed" datetime="2009-01-06">Jan 6</time>. <span itemprop="summary">Delicious, tasty pizza in Eastlake!</span> <span itemprop="description">L'Amourita serves up traditional wood-fired Neapolitan-style pizza, brought to your table promptly and without fuss. An ideal neighborhood pizza joint.</span> Rating: <span itemprop="rating">4.5</span> </div> </div>
  • 62. Google Rich Snippets Microformats <div class="hreview"> <span class="item"> <span class="fn">L’Amourita Pizza</span> </span> Reviewed by <span class="reviewer">Ulysses Grant</span> on <span class="dtreviewed"> Jan 6<span class="value-title" title="2009-01-06"></span> </span>. <span class="summary">Delicious, tasty pizza on Eastlake!</span> <span class="description">L'Amourita serves up traditional wood-fired Neapolitan-style pizza, brought to your table promptly and without fuss. An ideal neighborhood pizza joint.</span> Rating: <span class="rating">4.5</span> </div>
  • 63. Google Rich Snippets RDFa <div xmlns:v="http://rdf.data-vocabulary.org/#" typeof="v:Review"> <span property="v:itemreviewed">L’Amourita Pizza</span> Reviewed by <span property="v:reviewer">Ulysses Grant</span> on <span property="v:dtreviewed" content="2009-01-06">Jan 6</span>. <span property="v:summary">Delicious, tasty pizza on Eastlake!</span> <span property="v:description">L'Amourita serves up traditional wood-fired Neapolitan-style pizza, brought to your table promptly and without fuss. An ideal neighborhood pizza joint.</span> Rating: <span property="v:rating">4.5</span> </div>
  • 64. Facebook Graph API & Open Graph Protocol
  • 68. Required Reading • SKOS Primer: http://www.w3.org/TR/skos-primer/ • RDFa Primer: Bridging the Human and Data Webs. http://www.w3.org/TR/xhtml-rdfa-primer/ • HTML Microdata: http://www.w3.org/TR/microdata/ • Microformats: http://microformats.org/
  • 69. Recommended Readings • SKOS Vocabularies Overview: • RDFa.info: http://rdfa.info/wiki/RDFa_Wiki • CC REL: Creative Commons Rights Expression Language (CC REL)
  • 70. Dream RDFa HTML 5 Reality Microdata ~2000 2011 source: http://upload.wikimedia.org/wikipedia/commons/a/a7/Wollmilchsau.png