SlideShare a Scribd company logo
UKOLN is supported  by: Do the LOCAH-Motion How to Make Archival and Bibliographic Linked Data 16 th  February 2011 Dev8D, University of London, UK Adrian Stevenson LOCAH Project Manager
What is the LOCAH Project? L inked  O pen  C opac and  A rchives  H ub Funded by #JiscEXPO 2/10 ‘Expose’ call 1 year project. Started August 2010 Partners & Consultants: UKOLN  – Adrian Stevenson, Julian Cheal Mimas  – Jane Stevenson, Bethan Ruddock, Yogesh Patel Eduserv  – Pete Johnston Talis  – Leigh Dodds, Tim Hodson OCLC  - Ralph LeVan, Thom Hickey Ed Summers http://blogs.ukoln.ac.uk/locah/  tag: #locah
What are the Archives Hub and Copac? The Archives Hub is an aggregation of archival descriptions from archive repositories across the UK -  http://archiveshub.ac.uk Copac provides access to the merged library catalogues of libraries throughout the UK, including all national libraries - http://copac.ac.uk
What is Linked Data? URIs LD Design Issues Triples http://www.w3.org/DesignIssues/LinkedData.html
What does Linked Data Offer? Haven’t we been putting linked data on the web for years? In CSV , relational databases, XML etc? Well yes, but these approaches are not easy to integrate Web 2.0 mashups work against a fixed set of data sources Linked Data applications operate on top of an unbound, global data space.
What is LOCAH Doing? Part 1: Exposing the Linked Data Part 2: Creating a prototype visualisation Part 3: Reporting on opportunities and barriers
How are we Exposing the LOCAH Linked Data? Model our ‘things’ into RDF Transform the existing data into RDF/XML  Enhance the data Load the RDF/XML into a triple store Create Linked Data Views Document the process, opportunities and barriers on LOCAH Blog
1. Modelling ‘things’ into RDF Archives Hub data in ‘Encoded Archival Description’ EAD XML form Copac data in ‘Metadata Object Description Schema’ MODS XML form Take a step back from the data format Think about your ‘things’ What is EAD document “saying” about “things in the world”? What questions do we want to answer about those “things”? Can help make data more user-centric http://www.loc.gov/ead/  http://www.loc.gov/standards/mods/
Triples Thinking falls naturally into ‘triple’ statements ‘ Things’ have ‘properties’ with ‘values’ Subject – Predicate - Object Triples are basis of RDF More on all this at http://blogs.ukoln.ac.uk/locah/2010/09/28/model-a-first-cut/ Archival Resource Repository Provides Access To
Data Modelling Challenges Archival description is hierarchical and multi-level Information is provided about aggregation of records, and then about component parts Multi-level approach gives a strong sense of “context” “ lower level” units interpreted in context of the higher levels of description Arguably “incomplete” without the contextual data Linked Data involves ‘bounded descriptions Relations are asserted, e.g. member-of/component-of But there is  no requirement or expectation that data consumers will follow the links describing the relations
Data Modelling Challenges Hub: inconsistencies in data and lack of standardisation there's actually no content standard in the UK Copac: not a standard library catalogue merged catalogues with de-duplication to an extent but cannot be done entirely
1. Modelling ‘things’ into RDF Decide on patterns for URIs we generate Following guidance from W3C ‘ Cool URIs for the Semantic Web ’ and UK Cabinet Office ‘ Designing URI Sets for the UK Public Sector ’ E.g. http://example.ac.uk/id/findingaid/gb1086skinner ‘thing’ URI Use HTTP 303 ‘See Other’ to redirect to … E.g. http://example.ac.uk/doc/id/findingaid/gb1086skinner doc URI Content negotiates to … http://example.ac.uk/doc/…/doc.rdf , …/doc.html for documents about things More info at http://blogs.ukoln.ac.uk/locah/2010/11/16/identifying-the-things-uri-patterns-for-the-hub-linked-data/ http://www.w3.org/TR/cooluris/ http://www.cabinetoffice.gov.uk/resource-library/designing-uri-sets-uk-public-sector
1. Modelling ‘things’ into RDF Using existing RDF vocabularies: DC, SKOS, FOAF, BIBO, WGS84 Geo, Lexvo, ORE, LODE, Event and Time Ontologies Define additional RDF terms where required FindingAid ArchivalResource maintenanceAgency It can be hard to know where to look for vocabs and ontologies Decide on license – CC0, ODC PDD
Archival Resource Finding  Aid Agent  Family  Person  Place  Concept  Genre  Function  Organisation  maintainedBy/ maintains origination associatedWith accessProvidedBy/ providesAccessTo topic/ page hasPart/ partOf Repository (Agent) Book foaf:focus Is-a associatedWith Is-a Concept Scheme inScheme Place  administeredBy/ administers
Archives Hub Model (as at 14/2/2011) Archival Resource Finding  Aid EAD  Document Biographical  History Agent  Family  Person  Place  Concept  Genre  Function  Organisation  maintainedBy/ maintains origination associatedWith accessProvidedBy/ providesAccessTo topic/ page hasPart/ partOf hasPart/ partOf encodedAs/ encodes Repository (Agent) Book Place  topic/ page Language Level administeredBy/ administers hasBiogHist/ isBiogHistFor foaf:focus Is-a associatedWith level Is-a language Concept Scheme inScheme Object representedBy Postcode Unit Extent Creation Birth Death extent participates in Temporal Entity Temporal Entity at time at time product of in
Copac Model (as at November 2010 – work in progress)
Feedback Requested! We would like feedback on the model Appreciate this will be easier when the data available Via blog  http://blogs.ukoln.ac.uk/locah/2010/09/28/model-a-first-cut/ http://blogs.ukoln.ac.uk/locah/2010/11/08/some-more-things-some-extensions-to-the-hub-model/ http://blogs.ukoln.ac.uk/locah/2010/10/07/modelling-copac-data/ Via email, twitter, in person at Dev8d
2. Transforming in RDF/XML Need to transform data in EAD and MODS to RDF/XML, based on our models For Hub data created XSLT Stylesheet and used Saxon parser http://saxon.sourceforge.net/ Saxon runs the XSLT against a set of EAD files and creates a set of RDF/XML files For Copac data created in-house Java transformation program
3. Enhancing our data Already have some links: lexvo.org URIs for languages of archival materials reference.data.gov.uk URIs for time periods URIs for postcodes, using both UK Postcodes URIs and Ordnance Survey URIs  Currently also looking at: Virtual International Authority File Matches and links widely-used authority files - http://viaf.org/ Library Congress Subject Headings DBPedia
4. Load the RDF/XML into a triple store Using the Talis Platform triple store RDF/XML is HTTP POSTed We’re using Pynappl  Python client for the Talis Platform http://code.google.com/p/pynappl/ Store provides us with a SPARQL query interface
5. Create Linked Data Views Expose ‘bounded’ descriptions from the triple store over the Web Make available as documents in both human-readable HTML and RDF formats (also JSON, Turtle, CSV) Using Paget ‘Linked Data Publishing Framework’ http://code.google.com/p/paget/ PHP scripts query Sparql endpoint
‘ Out-of-the-box’ Paget view Linkedhub.ac.uk domain just given as example
Other Stuff We Might Try Linked Data API APIs, data formats and supporting tools to aid the adoption of linked data http://code.google.com/p/linked-data-api/ Entity extraction from free text Open Calais “ creates rich semantic metadata for the content you submit”  - http://www.opencalais.com/ DBPedia Spotlight (announced yesterday) “ solution for linking unstructured information sources to the Linked Open Data” http://dbpedia.org/spotlight
 
Can I Access the Locah Linked Data? Not quite yet … Hoping to release the Hub data by end February 2011 Copac data end March 2011 Release will include Linked Data views, Sparql endpoint details, example queries and supporting documentation
How are we creating the Visualisation Prototype? Based on researcher use cases Data queried from Sparql endpoint Use tools such as Simile, Many Eyes, Google Charts
Visualisation Protoype Using Timemap –  Googlemaps and Simile http://code.google.com/p/timemap / Early stages with this Will give location and ‘extent’ of archive. Will link through to Archives Hub
How are we reporting on opportunities and barriers? Recording these as we go along on the blog (tags: ‘opportunities’ ‘barriers’) Feed into #JiscEXPO synthesis work Not time to go into these today More at: http://blogs.ukoln.ac.uk/locah/2010/09/22/creating-linked-data-more-reflections-from-the-coal-face/ http://blogs.ukoln.ac.uk/locah/2010/12/01/assessing-linked-data
Questions? Contacts: Ade Stevenson @adrianstevenson Jane Stevenson @janestevenson Pete Johnston @ppetej Bethan Ruddock @bethanar Julian Cheal @juliancheal Yogesh Patel  http://mimas.ac.uk/staff/
Attribution and CC License  Sections of this presentation adapted from materials created by other members of the LOCAH Project This presentation available under creative commons   Non Commercial-Share Alike: http://creativecommons.org/licenses/by-nc/2.0/uk/

More Related Content

PPTX
Epiphany: Adaptable RDFa Generation Linking the Web of Documents to the Web o...
PDF
WWW2014 Overview of W3C Linked Data Platform 20140410
PPT
Getting Started With The Talis Platform
PPT
Library Linked Data and the Future of Bibliographic Control
PPTX
Usage of Linked Data: Introduction and Application Scenarios
PPT
Linked Data Tutorial
PDF
Introduction to RDF & SPARQL
ODP
Linked Data
Epiphany: Adaptable RDFa Generation Linking the Web of Documents to the Web o...
WWW2014 Overview of W3C Linked Data Platform 20140410
Getting Started With The Talis Platform
Library Linked Data and the Future of Bibliographic Control
Usage of Linked Data: Introduction and Application Scenarios
Linked Data Tutorial
Introduction to RDF & SPARQL
Linked Data

What's hot (20)

PPTX
Linked Data (in low-resource) Platforms: a mapping for Constrained Applicatio...
PPTX
Consuming Linked Data SemTech2010
PPT
Linked Data - the Future for Open Repositories?
PPT
Webofdata
PPTX
Get on the Linked Data Web!
PPTX
Introduction to Linked Data
PPT
Semantic Technologies: Representing Semantic Data
PPTX
Consuming Linked Data 4/5 Semtech2011
PPTX
Saveface - Save your Facebook content as RDF data
PPTX
ELUNA2013:Providing Voyager catalog data in a custom, open source web applica...
PDF
Publishing and Using Linked Data
PDF
30° Nexa Lunch Seminar - Linked Data Platform vs real world
PPTX
Data.dcs: Converting Legacy Data into Linked Data
PPT
Semantic Web
PPT
A Semantic Data Model for Web Applications
ZIP
Intro to Linked Open Data in Libraries, Archives & Museums
PPTX
RDFa Tutorial
PDF
Drupal and Apache Stanbol. What if you could reliably do autotagging?
PPTX
Linked Data in Libraries
PPTX
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked Data (in low-resource) Platforms: a mapping for Constrained Applicatio...
Consuming Linked Data SemTech2010
Linked Data - the Future for Open Repositories?
Webofdata
Get on the Linked Data Web!
Introduction to Linked Data
Semantic Technologies: Representing Semantic Data
Consuming Linked Data 4/5 Semtech2011
Saveface - Save your Facebook content as RDF data
ELUNA2013:Providing Voyager catalog data in a custom, open source web applica...
Publishing and Using Linked Data
30° Nexa Lunch Seminar - Linked Data Platform vs real world
Data.dcs: Converting Legacy Data into Linked Data
Semantic Web
A Semantic Data Model for Web Applications
Intro to Linked Open Data in Libraries, Archives & Museums
RDFa Tutorial
Drupal and Apache Stanbol. What if you could reliably do autotagging?
Linked Data in Libraries
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Ad

Similar to Do the LOCAH-Motion: How to Make Bibliographic and Archival Linked Data (20)

PPTX
Linked Data and Locah, UKSG2011
PPT
Lifting the Lid on Linked Data
PPTX
High and Lows of Library Linked Data
PPT
Aggregation Using Linked Data – LOCAH Project Experiences
PPT
LOCAH Project and Considerations of Linked Data Approaches
PPT
Linked data and voyager
PPTX
Linked dataresearch
PPT
Locah Project Show and Tell
PDF
Adventures in Linked Data Land (presentation by Richard Light)
PPT
RDFa From Theory to Practice
PPT
Linked Data Overview - AGI Technical SIG
PPTX
It19 20140721 linked data personal perspective
PDF
Culture Geeks Feb talk: Adventures in Linked Data Land
PPTX
Linked Open Data: Opportunities & Barriers for Archives
PDF
Linked Data
PPT
Of Cataloging & Context
PPTX
TPDL2013 tutorial linked data for digital libraries 2013-10-22
PDF
Introduction to linked data
PPSX
Linked Data to Improve the OER Experience
PDF
Using Linked Data Resources to generate web pages based on a BBC case study
Linked Data and Locah, UKSG2011
Lifting the Lid on Linked Data
High and Lows of Library Linked Data
Aggregation Using Linked Data – LOCAH Project Experiences
LOCAH Project and Considerations of Linked Data Approaches
Linked data and voyager
Linked dataresearch
Locah Project Show and Tell
Adventures in Linked Data Land (presentation by Richard Light)
RDFa From Theory to Practice
Linked Data Overview - AGI Technical SIG
It19 20140721 linked data personal perspective
Culture Geeks Feb talk: Adventures in Linked Data Land
Linked Open Data: Opportunities & Barriers for Archives
Linked Data
Of Cataloging & Context
TPDL2013 tutorial linked data for digital libraries 2013-10-22
Introduction to linked data
Linked Data to Improve the OER Experience
Using Linked Data Resources to generate web pages based on a BBC case study
Ad

More from Adrian Stevenson (20)

PPTX
Tools for Data Manipulation - UKAD Open Refine Workshop
PPTX
Exploring British Design
PPTX
SEO Matters
PPTX
Linking Data with sameAs: Challenges and Solutions - Workshop
PPTX
“Il n’y a pas de hors-texte” - Challenges for Archival Linked Data
PPTX
Wrapping and Unwrapping History: What’s Gained and What’s Lost
PPTX
Very Gentle Linked Data Workshop
PPTX
Digital Humanities and the First World War
PPTX
Lessons from ‘Linking Lives’ and ‘WW1 Discovery’ Projects
PPTX
The Winner Takes it All? -APIs and Linked Data Battle It Out
PPTX
Introduction to APIs and Linked Data
PPTX
GLAM Rocks! London Semantic Web Meetup
PPTX
Linked Data - the Future for Open Repositories. Kultivate Workshop
PPTX
2 minutes on LOCAH Linking Lives at Europeana Tech 2011
PPTX
Report on the International Linked Open Data for Libraries, Archives and Muse...
PPT
Linked Data and the Semantic Web - Mimas Seminar
PPT
Semantic Technologies: Which Way Now? – UKOLN Response
PPT
SWORD 3 Kick-off Meeting
PPT
Linked Data and the Semantic Web: What Are They and Should I Care?
PPT
Making Repository Easier With SWORD
Tools for Data Manipulation - UKAD Open Refine Workshop
Exploring British Design
SEO Matters
Linking Data with sameAs: Challenges and Solutions - Workshop
“Il n’y a pas de hors-texte” - Challenges for Archival Linked Data
Wrapping and Unwrapping History: What’s Gained and What’s Lost
Very Gentle Linked Data Workshop
Digital Humanities and the First World War
Lessons from ‘Linking Lives’ and ‘WW1 Discovery’ Projects
The Winner Takes it All? -APIs and Linked Data Battle It Out
Introduction to APIs and Linked Data
GLAM Rocks! London Semantic Web Meetup
Linked Data - the Future for Open Repositories. Kultivate Workshop
2 minutes on LOCAH Linking Lives at Europeana Tech 2011
Report on the International Linked Open Data for Libraries, Archives and Muse...
Linked Data and the Semantic Web - Mimas Seminar
Semantic Technologies: Which Way Now? – UKOLN Response
SWORD 3 Kick-off Meeting
Linked Data and the Semantic Web: What Are They and Should I Care?
Making Repository Easier With SWORD

Recently uploaded (20)

PPTX
Machine Learning_overview_presentation.pptx
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Encapsulation theory and applications.pdf
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
Tartificialntelligence_presentation.pptx
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Empathic Computing: Creating Shared Understanding
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Getting Started with Data Integration: FME Form 101
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
cuic standard and advanced reporting.pdf
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Machine Learning_overview_presentation.pptx
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
NewMind AI Weekly Chronicles - August'25-Week II
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
A comparative analysis of optical character recognition models for extracting...
Encapsulation theory and applications.pdf
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Advanced methodologies resolving dimensionality complications for autism neur...
Tartificialntelligence_presentation.pptx
MYSQL Presentation for SQL database connectivity
Programs and apps: productivity, graphics, security and other tools
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Empathic Computing: Creating Shared Understanding
Spectral efficient network and resource selection model in 5G networks
MIND Revenue Release Quarter 2 2025 Press Release
Getting Started with Data Integration: FME Form 101
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
cuic standard and advanced reporting.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...

Do the LOCAH-Motion: How to Make Bibliographic and Archival Linked Data

  • 1. UKOLN is supported by: Do the LOCAH-Motion How to Make Archival and Bibliographic Linked Data 16 th February 2011 Dev8D, University of London, UK Adrian Stevenson LOCAH Project Manager
  • 2. What is the LOCAH Project? L inked O pen C opac and A rchives H ub Funded by #JiscEXPO 2/10 ‘Expose’ call 1 year project. Started August 2010 Partners & Consultants: UKOLN – Adrian Stevenson, Julian Cheal Mimas – Jane Stevenson, Bethan Ruddock, Yogesh Patel Eduserv – Pete Johnston Talis – Leigh Dodds, Tim Hodson OCLC - Ralph LeVan, Thom Hickey Ed Summers http://blogs.ukoln.ac.uk/locah/ tag: #locah
  • 3. What are the Archives Hub and Copac? The Archives Hub is an aggregation of archival descriptions from archive repositories across the UK - http://archiveshub.ac.uk Copac provides access to the merged library catalogues of libraries throughout the UK, including all national libraries - http://copac.ac.uk
  • 4. What is Linked Data? URIs LD Design Issues Triples http://www.w3.org/DesignIssues/LinkedData.html
  • 5. What does Linked Data Offer? Haven’t we been putting linked data on the web for years? In CSV , relational databases, XML etc? Well yes, but these approaches are not easy to integrate Web 2.0 mashups work against a fixed set of data sources Linked Data applications operate on top of an unbound, global data space.
  • 6. What is LOCAH Doing? Part 1: Exposing the Linked Data Part 2: Creating a prototype visualisation Part 3: Reporting on opportunities and barriers
  • 7. How are we Exposing the LOCAH Linked Data? Model our ‘things’ into RDF Transform the existing data into RDF/XML Enhance the data Load the RDF/XML into a triple store Create Linked Data Views Document the process, opportunities and barriers on LOCAH Blog
  • 8. 1. Modelling ‘things’ into RDF Archives Hub data in ‘Encoded Archival Description’ EAD XML form Copac data in ‘Metadata Object Description Schema’ MODS XML form Take a step back from the data format Think about your ‘things’ What is EAD document “saying” about “things in the world”? What questions do we want to answer about those “things”? Can help make data more user-centric http://www.loc.gov/ead/ http://www.loc.gov/standards/mods/
  • 9. Triples Thinking falls naturally into ‘triple’ statements ‘ Things’ have ‘properties’ with ‘values’ Subject – Predicate - Object Triples are basis of RDF More on all this at http://blogs.ukoln.ac.uk/locah/2010/09/28/model-a-first-cut/ Archival Resource Repository Provides Access To
  • 10. Data Modelling Challenges Archival description is hierarchical and multi-level Information is provided about aggregation of records, and then about component parts Multi-level approach gives a strong sense of “context” “ lower level” units interpreted in context of the higher levels of description Arguably “incomplete” without the contextual data Linked Data involves ‘bounded descriptions Relations are asserted, e.g. member-of/component-of But there is no requirement or expectation that data consumers will follow the links describing the relations
  • 11. Data Modelling Challenges Hub: inconsistencies in data and lack of standardisation there's actually no content standard in the UK Copac: not a standard library catalogue merged catalogues with de-duplication to an extent but cannot be done entirely
  • 12. 1. Modelling ‘things’ into RDF Decide on patterns for URIs we generate Following guidance from W3C ‘ Cool URIs for the Semantic Web ’ and UK Cabinet Office ‘ Designing URI Sets for the UK Public Sector ’ E.g. http://example.ac.uk/id/findingaid/gb1086skinner ‘thing’ URI Use HTTP 303 ‘See Other’ to redirect to … E.g. http://example.ac.uk/doc/id/findingaid/gb1086skinner doc URI Content negotiates to … http://example.ac.uk/doc/…/doc.rdf , …/doc.html for documents about things More info at http://blogs.ukoln.ac.uk/locah/2010/11/16/identifying-the-things-uri-patterns-for-the-hub-linked-data/ http://www.w3.org/TR/cooluris/ http://www.cabinetoffice.gov.uk/resource-library/designing-uri-sets-uk-public-sector
  • 13. 1. Modelling ‘things’ into RDF Using existing RDF vocabularies: DC, SKOS, FOAF, BIBO, WGS84 Geo, Lexvo, ORE, LODE, Event and Time Ontologies Define additional RDF terms where required FindingAid ArchivalResource maintenanceAgency It can be hard to know where to look for vocabs and ontologies Decide on license – CC0, ODC PDD
  • 14. Archival Resource Finding Aid Agent Family Person Place Concept Genre Function Organisation maintainedBy/ maintains origination associatedWith accessProvidedBy/ providesAccessTo topic/ page hasPart/ partOf Repository (Agent) Book foaf:focus Is-a associatedWith Is-a Concept Scheme inScheme Place administeredBy/ administers
  • 15. Archives Hub Model (as at 14/2/2011) Archival Resource Finding Aid EAD Document Biographical History Agent Family Person Place Concept Genre Function Organisation maintainedBy/ maintains origination associatedWith accessProvidedBy/ providesAccessTo topic/ page hasPart/ partOf hasPart/ partOf encodedAs/ encodes Repository (Agent) Book Place topic/ page Language Level administeredBy/ administers hasBiogHist/ isBiogHistFor foaf:focus Is-a associatedWith level Is-a language Concept Scheme inScheme Object representedBy Postcode Unit Extent Creation Birth Death extent participates in Temporal Entity Temporal Entity at time at time product of in
  • 16. Copac Model (as at November 2010 – work in progress)
  • 17. Feedback Requested! We would like feedback on the model Appreciate this will be easier when the data available Via blog http://blogs.ukoln.ac.uk/locah/2010/09/28/model-a-first-cut/ http://blogs.ukoln.ac.uk/locah/2010/11/08/some-more-things-some-extensions-to-the-hub-model/ http://blogs.ukoln.ac.uk/locah/2010/10/07/modelling-copac-data/ Via email, twitter, in person at Dev8d
  • 18. 2. Transforming in RDF/XML Need to transform data in EAD and MODS to RDF/XML, based on our models For Hub data created XSLT Stylesheet and used Saxon parser http://saxon.sourceforge.net/ Saxon runs the XSLT against a set of EAD files and creates a set of RDF/XML files For Copac data created in-house Java transformation program
  • 19. 3. Enhancing our data Already have some links: lexvo.org URIs for languages of archival materials reference.data.gov.uk URIs for time periods URIs for postcodes, using both UK Postcodes URIs and Ordnance Survey URIs Currently also looking at: Virtual International Authority File Matches and links widely-used authority files - http://viaf.org/ Library Congress Subject Headings DBPedia
  • 20. 4. Load the RDF/XML into a triple store Using the Talis Platform triple store RDF/XML is HTTP POSTed We’re using Pynappl Python client for the Talis Platform http://code.google.com/p/pynappl/ Store provides us with a SPARQL query interface
  • 21. 5. Create Linked Data Views Expose ‘bounded’ descriptions from the triple store over the Web Make available as documents in both human-readable HTML and RDF formats (also JSON, Turtle, CSV) Using Paget ‘Linked Data Publishing Framework’ http://code.google.com/p/paget/ PHP scripts query Sparql endpoint
  • 22. ‘ Out-of-the-box’ Paget view Linkedhub.ac.uk domain just given as example
  • 23. Other Stuff We Might Try Linked Data API APIs, data formats and supporting tools to aid the adoption of linked data http://code.google.com/p/linked-data-api/ Entity extraction from free text Open Calais “ creates rich semantic metadata for the content you submit” - http://www.opencalais.com/ DBPedia Spotlight (announced yesterday) “ solution for linking unstructured information sources to the Linked Open Data” http://dbpedia.org/spotlight
  • 24.  
  • 25. Can I Access the Locah Linked Data? Not quite yet … Hoping to release the Hub data by end February 2011 Copac data end March 2011 Release will include Linked Data views, Sparql endpoint details, example queries and supporting documentation
  • 26. How are we creating the Visualisation Prototype? Based on researcher use cases Data queried from Sparql endpoint Use tools such as Simile, Many Eyes, Google Charts
  • 27. Visualisation Protoype Using Timemap – Googlemaps and Simile http://code.google.com/p/timemap / Early stages with this Will give location and ‘extent’ of archive. Will link through to Archives Hub
  • 28. How are we reporting on opportunities and barriers? Recording these as we go along on the blog (tags: ‘opportunities’ ‘barriers’) Feed into #JiscEXPO synthesis work Not time to go into these today More at: http://blogs.ukoln.ac.uk/locah/2010/09/22/creating-linked-data-more-reflections-from-the-coal-face/ http://blogs.ukoln.ac.uk/locah/2010/12/01/assessing-linked-data
  • 29. Questions? Contacts: Ade Stevenson @adrianstevenson Jane Stevenson @janestevenson Pete Johnston @ppetej Bethan Ruddock @bethanar Julian Cheal @juliancheal Yogesh Patel http://mimas.ac.uk/staff/
  • 30. Attribution and CC License Sections of this presentation adapted from materials created by other members of the LOCAH Project This presentation available under creative commons Non Commercial-Share Alike: http://creativecommons.org/licenses/by-nc/2.0/uk/

Editor's Notes

  • #4: Copac a union catalogue Both successful JISC services running for many years now Locah is a research project – will have to see if go into service with LD interface
  • #5: http://www.w3.org/DesignIssues/LinkedData.html
  • #9: Encoded Archival Description is an XML standard for encoding archival finding aids The Object Description Schema (MODS) is an XML -based bibliographic description schema MODS - Metadata Object Description Schema (MODS) is a schema for a bibliographic element set that may be used for a variety of purposes, and particularly for library applications. EAD - Things” include concepts and abstractions as well as material objects We want location – archives physical things so location important Also wanted event data, partly steered by the visualisation prototype Also ‘extent’ data – number of boxes
  • #13: 303 and Content Neg from ‘Cool URIs for the Semantic Web’
  • #14: Open Data Commons Public Domain Dedication Creative Commons CC0 license