SlideShare a Scribd company logo
Getting started with the web of
              data
             Bill Roberts

                bill@swirrl.com
            http://www.swirrl.com
          http://webofdatablog.com
What is the web of data?
Tom Coates:

 “A web of data sources, services for exploring and
 manipulating data, and ways that users can connect
 them together”
Webofdata
Linked data

   “a method of exposing, sharing, and
   connecting data via dereferenceable URIs on
   the Web”

http://wikipedia.org/Linked_data
Linked data principles
From Sir TimBL:

1. Use URIs as names for things
2. Use HTTP URIs so that people can look up those
   names
3. When someone looks up a URI, provide useful
   information, using the standards
4. Include links to other URIs so that they can discover
   more things
http://www.readwriteweb.com/archives/interview_with_tim_b
                  erners-lee_part_1.php
Why publish linked data on the web?

•   Be found by your customers
•   Because it’s your mission
•   As part of academic publishing
•   As a paid-for service
•   Complement your main product or service
•   Build a reputation or brand
Who’s doing it?
How do you find it?
•   Sindice
•   Yahoo SearchMonkey
•   Google Rich Snippets
•   Following links
•   Data mashups/aggregators
•   SPARQL
How to get started?
Tom Scott of the BBC:

 ”...for those considering the Linked Data approach
 we’d say that 95% of the work is work you should be
 doing just to build for the (non-semantic) web. Get
 the fundamentals right and the leap to the Semantic
 Web is really more of a hop.”
Resource Description Framework

  Subject/predicate/object (= Thing/attribute/value)


 “Bill’s e-mail address is bill@swirrl.com”


       Bill                               bill@swirrl.com
                        e-mail address
Everything gets a URI

  http://swirrl.com/
                                                           mailto:bill@swirrl.com
      bill.rdf#me
                        http://xmlns.com/foaf/0.1/mbox



<rdf:RDF xmlns:foaf="http://xmlns.com/foaf/0.1/"
xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#”>
<foaf:Person rdf:about="http://swirrl.com/bill.rdf#me">
    <foaf:mbox rdf:resource=“mailto:bill@swirrl.com”>
</foaf:Person>



Don’t worry about the XML stuff: it’s just circles and arrows!
Everything gets a URI

http://swirrl.com/
                                                         foaf:Person
    bill.rdf#me
                             rdf:type




    <rdf:RDF xmlns:foaf="http://xmlns.com/foaf/0.1/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#”>
    <foaf:Person rdf:about="http://swirrl.com/bill.rdf#me">
         <foaf:mbox rdf:resource=“mailto:bill@swirrl.com”>
    </foaf:Person>
Ontologies
• Use existing ones where possible
• Extend if you have to
• Make your own if you really have to
  –   People: Friend of a Friend (FOAF)
  –   People/companies: vCard
  –   Dates: iCal
  –   Authors, titles, etc: Dublin Core
  –   E-Commerce: GoodRelations
  –   Places: Geonames
  –   Projects: Description of a Project (DOAP)
Publish for people AND machines

URI for a real thing: http://dbpedia.org/resource/Edinburgh

HTML representation: http://dbpedia.org/page/Edinburgh

RDF representation: http://dbpedia.org/data/Edinburgh.xml
Content negotiation

                                             DBPedia
Tell me about resource/Edinburgh (in HTML)


      303 redirect to …/page/Edinburgh


      Ok give me …/page/Edinburgh

                HTML file
Content negotiation
curl –v –H “Accept: application/rdf+xml” http://dbpedia.org/resource/Edinburgh




                    Tell me about Edinburgh (in RDF)                DBPedia


                    303 redirect to …/data/Edinburgh.xml


                       Ok give me the data


                                RDF+XML file
RDFa
• Mix the HTML and RDF markup in one file.

+ve: simpler as only one file
-ve: sometimes you want to include data that
        doesn’t really fit the HTML structure
+ve: Google and Yahoo can use it (or at least
        some of it)
Tools
•   Validators – w3.org/RDF/Validator
•   Client libraries and frameworks - Jena
•   RDF databases - Sesame
•   Platforms – Talis
Looking for data publishing pilot studies…

                                 <rdf:Description
                                 rdf:about="http://dbpedia.org/resourc
                                 e/Guinea_pig_breed">
                                 <skos:subject
                                 rdf:resource="http://dbpedia.org/reso
                                 urce/Category:Guinea_pigs"/>
                                 </rdf:Description>




                            http://www.flickr.com/photos/mjjames/2767817164/
Useful links
Tom Coates “Native to a web of data” http://www.plasticbag.org/files/native/
Linked Data: http://www.linkeddata.org
RDF primer http://www.w3.org/TR/rdf-primer/
RDFa primer http://www.w3.org/TR/xhtml-rdfa-primer/
Berners-Lee “Linked data – design issues” http://www.w3.org/DesignIssues/LinkedData.html
Berners-Lee “Putting government data online” http://www.w3.org/DesignIssues/GovData.html
Tom Scott “Building coherence at bbc.co.uk” http://blogs.talis.com/nodalities/2009/01/building-coherence-at-bbccouk.php
RDF Validator http://www.w3.org/RDF/Validator/
Sesame RDF database http://www.openrdf.org
Jena – Semantic web framework for Java http://jena.sourceforge.net/
Talis Platform http://www.talis.com/platform/
Yahoo SearchMonkey http://developer.yahoo.com/searchmonkey/
Google Rich Snippets http://www.google.com/support/webmasters/bin/answer.py?answer=99170
Thanks!


http://www.slideshare.net/billroberts/webofdata

More Related Content

PPTX
Introduction to Linked Data
PPTX
Consuming Linked Data SemTech2010
PPT
Linked Data Tutorial
PPT
Library Linked Data and the Future of Bibliographic Control
PPT
Semantic Technologies: Representing Semantic Data
KEY
LIBRIS - Linked Library Data
PPTX
Get on the Linked Data Web!
ODP
Linked Data
Introduction to Linked Data
Consuming Linked Data SemTech2010
Linked Data Tutorial
Library Linked Data and the Future of Bibliographic Control
Semantic Technologies: Representing Semantic Data
LIBRIS - Linked Library Data
Get on the Linked Data Web!
Linked Data

What's hot (20)

PDF
Publishing and Using Linked Data
ODP
Linked Data
PDF
DHWI Linked Open Data - Show and Tell
PDF
Linked Data - Radical Change?
PPTX
Reminiscing about interoperability
PDF
Cataloger 3.0: Competencies and Education for the BIBFRAME Catalog
PPTX
Epiphany: Adaptable RDFa Generation Linking the Web of Documents to the Web o...
PDF
Semantic Web Applications in Libraries: The Road to BIBFRAME
PPT
Linked Data - the Future for Open Repositories?
KEY
It's not rocket surgery - Linked In: ALA 2011
PPT
Semantic Web
PPT
A Semantic Data Model for Web Applications
PPTX
IFLA LIDASIG Open Session 2017: Introduction to Linked Data
PDF
Linked Data - Exposing what we have
PPT
Deep Web Presentation April 25
PPT
Deep Web
PDF
Archives & the Semantic Web
PPT
Exploring and using the Semantic Web - SSSW09 tutorial
ODP
Building a semantic website
Publishing and Using Linked Data
Linked Data
DHWI Linked Open Data - Show and Tell
Linked Data - Radical Change?
Reminiscing about interoperability
Cataloger 3.0: Competencies and Education for the BIBFRAME Catalog
Epiphany: Adaptable RDFa Generation Linking the Web of Documents to the Web o...
Semantic Web Applications in Libraries: The Road to BIBFRAME
Linked Data - the Future for Open Repositories?
It's not rocket surgery - Linked In: ALA 2011
Semantic Web
A Semantic Data Model for Web Applications
IFLA LIDASIG Open Session 2017: Introduction to Linked Data
Linked Data - Exposing what we have
Deep Web Presentation April 25
Deep Web
Archives & the Semantic Web
Exploring and using the Semantic Web - SSSW09 tutorial
Building a semantic website
Ad

Similar to Webofdata (20)

PPT
Publishing data on the Semantic Web
KEY
Transmission6 - Publishing Linked Data
PPTX
Hack U Barcelona 2011
PPT
Year of the Monkey: Lessons from the first year of SearchMonkey
PPT
Semantic Web Austin Yahoo
PPTX
Madrid Building blocks of Linked Data
ODP
Web of data
PDF
Linked Data Basics
PDF
ISWC GoodRelations Tutorial Part 2
PDF
GoodRelations Tutorial Part 2
PPTX
NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...
PDF
Deploying PHP applications using Virtuoso as Application Server
PPTX
Linked data HHS 2015
PDF
ISWC GoodRelations Tutorial Part 4
PDF
GoodRelations Tutorial Part 4
PPTX
Introduction to RDF Data Model
PPSX
The Web of data and web data commons
PPTX
Hacking with Semantic Web
PDF
Linked Data Principles and RDF: University of Florida Libraries, BIBFRAME Wor...
PDF
Introduction to RDF
Publishing data on the Semantic Web
Transmission6 - Publishing Linked Data
Hack U Barcelona 2011
Year of the Monkey: Lessons from the first year of SearchMonkey
Semantic Web Austin Yahoo
Madrid Building blocks of Linked Data
Web of data
Linked Data Basics
ISWC GoodRelations Tutorial Part 2
GoodRelations Tutorial Part 2
NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...
Deploying PHP applications using Virtuoso as Application Server
Linked data HHS 2015
ISWC GoodRelations Tutorial Part 4
GoodRelations Tutorial Part 4
Introduction to RDF Data Model
The Web of data and web data commons
Hacking with Semantic Web
Linked Data Principles and RDF: University of Florida Libraries, BIBFRAME Wor...
Introduction to RDF
Ad

Recently uploaded (20)

PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
KodekX | Application Modernization Development
PPTX
Cloud computing and distributed systems.
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Empathic Computing: Creating Shared Understanding
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Electronic commerce courselecture one. Pdf
Building Integrated photovoltaic BIPV_UPV.pdf
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
The AUB Centre for AI in Media Proposal.docx
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Understanding_Digital_Forensics_Presentation.pptx
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Dropbox Q2 2025 Financial Results & Investor Presentation
Review of recent advances in non-invasive hemoglobin estimation
Agricultural_Statistics_at_a_Glance_2022_0.pdf
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
KodekX | Application Modernization Development
Cloud computing and distributed systems.
sap open course for s4hana steps from ECC to s4
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Empathic Computing: Creating Shared Understanding
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Electronic commerce courselecture one. Pdf

Webofdata

  • 1. Getting started with the web of data Bill Roberts bill@swirrl.com http://www.swirrl.com http://webofdatablog.com
  • 2. What is the web of data? Tom Coates: “A web of data sources, services for exploring and manipulating data, and ways that users can connect them together”
  • 4. Linked data “a method of exposing, sharing, and connecting data via dereferenceable URIs on the Web” http://wikipedia.org/Linked_data
  • 5. Linked data principles From Sir TimBL: 1. Use URIs as names for things 2. Use HTTP URIs so that people can look up those names 3. When someone looks up a URI, provide useful information, using the standards 4. Include links to other URIs so that they can discover more things
  • 7. Why publish linked data on the web? • Be found by your customers • Because it’s your mission • As part of academic publishing • As a paid-for service • Complement your main product or service • Build a reputation or brand
  • 9. How do you find it? • Sindice • Yahoo SearchMonkey • Google Rich Snippets • Following links • Data mashups/aggregators • SPARQL
  • 10. How to get started? Tom Scott of the BBC: ”...for those considering the Linked Data approach we’d say that 95% of the work is work you should be doing just to build for the (non-semantic) web. Get the fundamentals right and the leap to the Semantic Web is really more of a hop.”
  • 11. Resource Description Framework Subject/predicate/object (= Thing/attribute/value) “Bill’s e-mail address is bill@swirrl.com” Bill bill@swirrl.com e-mail address
  • 12. Everything gets a URI http://swirrl.com/ mailto:bill@swirrl.com bill.rdf#me http://xmlns.com/foaf/0.1/mbox <rdf:RDF xmlns:foaf="http://xmlns.com/foaf/0.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#”> <foaf:Person rdf:about="http://swirrl.com/bill.rdf#me"> <foaf:mbox rdf:resource=“mailto:bill@swirrl.com”> </foaf:Person> Don’t worry about the XML stuff: it’s just circles and arrows!
  • 13. Everything gets a URI http://swirrl.com/ foaf:Person bill.rdf#me rdf:type <rdf:RDF xmlns:foaf="http://xmlns.com/foaf/0.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#”> <foaf:Person rdf:about="http://swirrl.com/bill.rdf#me"> <foaf:mbox rdf:resource=“mailto:bill@swirrl.com”> </foaf:Person>
  • 14. Ontologies • Use existing ones where possible • Extend if you have to • Make your own if you really have to – People: Friend of a Friend (FOAF) – People/companies: vCard – Dates: iCal – Authors, titles, etc: Dublin Core – E-Commerce: GoodRelations – Places: Geonames – Projects: Description of a Project (DOAP)
  • 15. Publish for people AND machines URI for a real thing: http://dbpedia.org/resource/Edinburgh HTML representation: http://dbpedia.org/page/Edinburgh RDF representation: http://dbpedia.org/data/Edinburgh.xml
  • 16. Content negotiation DBPedia Tell me about resource/Edinburgh (in HTML) 303 redirect to …/page/Edinburgh Ok give me …/page/Edinburgh HTML file
  • 17. Content negotiation curl –v –H “Accept: application/rdf+xml” http://dbpedia.org/resource/Edinburgh Tell me about Edinburgh (in RDF) DBPedia 303 redirect to …/data/Edinburgh.xml Ok give me the data RDF+XML file
  • 18. RDFa • Mix the HTML and RDF markup in one file. +ve: simpler as only one file -ve: sometimes you want to include data that doesn’t really fit the HTML structure +ve: Google and Yahoo can use it (or at least some of it)
  • 19. Tools • Validators – w3.org/RDF/Validator • Client libraries and frameworks - Jena • RDF databases - Sesame • Platforms – Talis
  • 20. Looking for data publishing pilot studies… <rdf:Description rdf:about="http://dbpedia.org/resourc e/Guinea_pig_breed"> <skos:subject rdf:resource="http://dbpedia.org/reso urce/Category:Guinea_pigs"/> </rdf:Description> http://www.flickr.com/photos/mjjames/2767817164/
  • 21. Useful links Tom Coates “Native to a web of data” http://www.plasticbag.org/files/native/ Linked Data: http://www.linkeddata.org RDF primer http://www.w3.org/TR/rdf-primer/ RDFa primer http://www.w3.org/TR/xhtml-rdfa-primer/ Berners-Lee “Linked data – design issues” http://www.w3.org/DesignIssues/LinkedData.html Berners-Lee “Putting government data online” http://www.w3.org/DesignIssues/GovData.html Tom Scott “Building coherence at bbc.co.uk” http://blogs.talis.com/nodalities/2009/01/building-coherence-at-bbccouk.php RDF Validator http://www.w3.org/RDF/Validator/ Sesame RDF database http://www.openrdf.org Jena – Semantic web framework for Java http://jena.sourceforge.net/ Talis Platform http://www.talis.com/platform/ Yahoo SearchMonkey http://developer.yahoo.com/searchmonkey/ Google Rich Snippets http://www.google.com/support/webmasters/bin/answer.py?answer=99170