SlideShare a Scribd company logo
Semantic Web and Linked Data for cultural heritage materials  Approaches in Europeana Antoine Isaac Vrije Universiteit Amsterdam Europeana DANS Linked Data and RDF workshop, Den Haag, July 28 th  2010
A web of cultural heritage data? ?
?
The current portal
 
Towards semantic search: facets
Building a search engine on top of metadata is difficult Intrinsic quality problems: correctness, coverage Especially when data is so heterogeneous 100s of formats From flat 5-fields records to 100-nodes XML trees Language issue! We currently use a simple interoperability format Quick-win showing quickly its limits
We can better use institutions’ original metadata Accommodate their different practices Data structures and semantics Access objects via a semantic layer of vocabularies for subjects, persons, places… Semantic ThoughtLab:  experimenting solutions
Towards semantics-enabled search Building a "semantic layer" to help accessing content
Towards semantics-enabled search Enhance access to Europeana content by semantics Query expansion, clustering of results Exploiting various types of relations "located in", "lived in", "is more specific concept"… Semantics are already there, in metadata and "controlled vocabularies" used in metadata Thesauri, classifications… Requires to make it properly machine-accessible
Prototype: Europeana Thought Lab http://europeana.eu/portal/thought-lab.html
Semantic auto-completion
Clustering of results
Baseline: matching concepts' label Controlled place name from a vocabulary at the Rijskmuseum Metadata for the object
A "more specific Egypte"?
A "more specific Egypte"? Metadata for the object
A place more specific than the Egypt one Semantic information on the Giza place in the Rijskmuseum Vocabulary
Following other relations
Following other relations - creator Metadata for the object Controlled person name from a vocabulary at the Rijskmuseum
Following other relations - match Information on Gustave Le Gray from the Rijskmuseum Vocabulary Matched to a "Gustave Le Gray" from another Vocabulary
Following other relations – death place Information on Gustave Le Gray from the Union List of Artist Names (Getty)
Following other relations – death place Information on Cairo from the Thesaurus of Geographic Names (Getty) Matched to "Cairo" from another vocabulary…
A hell of relations? Well, they were in the original data, we just had to make them  explicit! Cultural Heritage institution often have a wealth of metadata to share and exploit
Enabling bits & pieces Exploiting semantic links in CH vocabularies Rijksmuseum thesaurus:  Concept “Giza” narrower than concept “Egypte” Mapping/alignment between CH vocabularies Louvre’s “Égypte” equivalent to Rijksmuseum’s “Egypte” Enrichment of existing metadata The string “Egypt” in a metadata record indicates the concept of Egypt defined in Rijksmuseum thesaurus
SKOS, Knowledge Organization Systems and Linked Data SKOS allows representing (simple) KOS data as RDF animals NT cats cats UF domestic cats RT wildcats BT animals SN used only for domestic cats domestic cats USE cats wildcats
SKOS, KOSs and LD SKOS allows bridging across KOSs from different contexts http://www.w3.org/2004/02/skos/
SKOS is used! Many Libraries – not a surprise! Swedish National Library’s Libris catalogue and thesaurus  http://libris.kb.se/   Library of Congress’ vocabularies, including LCSH  http://id.loc.gov/   DNB’s Gemeinsame Normdatei (incl. SWD subject headings)  http://d-nb.info/gnd/   Documentation at  https://wiki.d-nb.de/display/LDS   BnF’s RAMEAU subject headings  http://stitch.cs.vu.nl/   OCLC’s DDC classification  http://dewey.info/  and VIAF  http://viaf.org/   STW economy thesaurus  http:// zbw.eu/stw   National Library of Hungary’s catalogue and thesauri  http:// oszkdk.oszk.hu/resource/DRJ/404  (example) Other fields Wikipedia categories through Dbpedia  http://dbpedia.org/   New York Times subject headings  http://data.nytimes.com/   IVOA astronomy vocabularies  http://www.ivoa.net/Documents/latest/Vocabularies.html GEMET environmental thesaurus  http://eionet.europa.eu/gemet   UMTHES Agrovoc  http://aims.fao.org/   Linked Life Data  http://linkedlifedata.com/   Taxonconcept  http://www.taxonconcept.org/   UK Public sector vocabularies  http://standards.esd.org.uk/   (e.g.,  http://id.esd.org.uk/lifeEvent/7  )
KOS Alignments? Quite many of them are linked to some other resource LCSH, SWD and RAMEAU interlinked through MACS mappings GND linked to DBpedia and VIAF Libris linked to LCSH Agrovoc to CAT, NAL, SWD, GEMET NYT to freebase, DBpedia, Geonames dbPedia links are overwhelming Hungary, STW, TaxonConcept, GND…
Enabling bits & pieces (c’ed) Appropriate data model for objects Generic constructs for creation, title, subject, etc. that are useful for querying Flexible data model SW ontology linking features allow to keep close to original data while having the generic notions above
Formal semantics, metadata schemas and querying The query: The existing description: Why is there a match? For the Europeana ontology, every rma:depicts statement implies a vra:subject statement rma: gezicht_in_cairo rma:Cairo rma:depicts rma:Egypt skos:broader ?x ?y vra:subject rma:Egypt skos:broader
Where are the challenges? Semantic conversion of data Using appropriate data models Enriching legacy metadata Semantic alignments Between description ontologies vra:depicts   rdfs:subPropertyOf   dc:subject Between concepts in controlled vocabularies iconclass:bird skos:closeMatch ddc:bird
Alignment of semantic references
Where are the challenges? Semantic alignment (c'ed) Find correspondences between large vocabularies In a multilingual context Scalability Plugging the semantic features into the Europeana production environment
The Europeana Data Model (EDM)  with input from Carlo Meghini, Guus Schreiber, Stefan Gradmann, Maxx Dekkers, Steffen Hennicke, Viktor de Boer et al. from Europeana V1
Rationale of EDM Precursor: ESE (Europeana Semantic Elements) represents lowest common denominator for object metadata convert datasets to Dublin-Core like standard forces interoperability major drawback: original metadata is lost most  values are simple strings EDM goals preserve original data while still allowing for interoperability Semantic Web representation A community-driven effort C ore experts, validation by representatives of various CH domains
EDM requirements & principles Distinction between “provided object” (painting, book, program) and digital representation Distinction between object and metadata record describing an object  Allow for multiple records for  same object, containing potentially contradictory statements about an object  Support for objects that are composed of other objects Standard metadata format that can be specialized Standard vocabulary format that can be specialized EDM should be based on existing standards
EDM basics OAI ORE for organization of metadata about an object Dublin Core for metadata representation SKOS for vocabulary representation + Links to CIDOC-CRM and other shared ontologies
Dublin Core EDM uses the latest version of  DCMI Metadata Terms for a core of semantically interoperable properties And for backward compatibility, cf. ESE Specified with an RDF model Specialization of 15 original DC elements Can be specialized itself see requirement -> this is a crucial distinction with ESE Used in the richest way possible Pointers to resources
SKOS: vocabulary publication on the Web Already seen…
OAI ORE Specification: http://www.openarchives.org/ore/1.0/toc.html  Specified with an RDF model Four key notions (RDF classes) Object : the book/painting/program being described Aggregation : organizes object information from a particular provider (museum, archive, library)  Proxy : the object as viewed in a metadata record Digital representation : some digital form of the object with a Web address
The Example - 1
The Example - 2
Aggregation organizes data of a provider  aggregation digital representation object provenance metadata
Proxy: metadata record for an object proxy object metadata
Multiple aggregations = multiple providers aggregation  of DMF aggregation  of Louvre
Multiple aggregations = multiple providers DMF proxy Louvre P roxy Louvre title DMF title The “real” painting
Europeana is “just” a special provider with processed/enriched metadata Europeana aggregation enriched metadata Europeana landing page
A flexible model: different semantic grains Cf. goal: “preserve original data while still allowing for interoperability” Keep data expressed as close as possible to original model Using mappings to more interoperable level
A flexible model: objects, events and the rest Preserving and exploiting original data also means being compatible with descriptions beyond simple object level Also crucial for semantic enrichment
A flexible model: object and events (2) Classes and Properties for event-, agent-, place-centric modeling Instances of (local) vocabularies using skos:Concept Using RDF, EDM allows any kind of network to be attached to a provided object.
A flexible model: object and events (3)
Advanced modeling in EDM Relations between provided objects Part-whole links for complex (hierarchical) objects  Derivation and versioning relations Relations between provided objects, for instance artistic derivation between works;  ens:isRepresentationOf ens:isNextInSequence
Linked data and cultural heritage?
The case for linked data in cultural heritage Not just a more sophisticated way to represent data! Ease of getting data from external sources Just going to the URI and fetch the RDF there Ease of publishing data Linked data as a dissemination channel for Europeana data Ease of linking across datasets Linked data as a dissemination channel for Europeana data Object identification as cornerstone Records are just a side feature!
From a movement supported by researchers To much wider awareness Open government initiatives, libraries… Continuing effort: show benefits of collaborating to a  cultural heritage data web Library Linked Data W3C incubator http://www.w3.org/2005/Incubator/lld Encouraging open linked data adoption
Linked Library Cloud beginning 2008 [Ross Singer, Code4Lib2010] http://code4lib.org/conference/2010/singer
Linked Library Cloud mid-2010 Plus: Germany NL Hungary NL STW GEMET NYT Agrovoc [Ross Singer, Code4Lib2010] http://code4lib.org/conference/2010/singer
Is that a surprise? Not really, let’s have a look at a real-world case…
Johan Stapel, Koninklijke Bibliotheek KOS & collection environment @KB
A broad range of datasets That describe the same  objects Or  related  objects Which are about similar  subjects Which were made by the same  persons Or related  persons In the same  places Etc…
Thanks! [email_address] Europeana.eu team Web and Media lab @ Vrije Universiteit Amsterdam http://wiki.cs.vu.nl/web-media EuropeanaConnect project http://www.europeanaconnect.eu/

More Related Content

PDF
Linked (Open) Data
PPTX
Presentation of the INVENiT Expert Meeting on Monday 16 February 2015
PPTX
Hack U Barcelona 2011
PPTX
Semantic Cartography: Using ontologies to create adaptable tools for text exp...
PPTX
One day workshop Linked Data and Semantic Web
PDF
Digital Humanities and Linked Data
PPTX
Linked Data: principles and examples
PPT
SKOS, Past, Present and Future
Linked (Open) Data
Presentation of the INVENiT Expert Meeting on Monday 16 February 2015
Hack U Barcelona 2011
Semantic Cartography: Using ontologies to create adaptable tools for text exp...
One day workshop Linked Data and Semantic Web
Digital Humanities and Linked Data
Linked Data: principles and examples
SKOS, Past, Present and Future

What's hot (20)

PPT
Tutorial on Semantic Digital Libraries (ESWC'2007)
PDF
Scalable Cross-lingual Document Similarity through Language-specific Concept ...
PPT
Corrib.org - OpenSource and Research
PDF
Lotus: Linked Open Text UnleaShed - ISWC COLD '15
ODP
20110929 tpdl2011 dl-research-humboldt
ODP
Riding the Semantic Web
PPTX
Towards digitizing scholarly communication
PPT
Structured Dynamics' Semantic Technologies Product Stack
PDF
Distributing Text Mining tasks with librAIry
PDF
Semantically-enabled Browsing of Large Multilingual Document Collections
PPTX
SWT Lecture Session 9 - RDB2RDF direct mapping
PPTX
SWT Lecture Session 10 R2RML Part 1
PDF
The web of interlinked data and knowledge stripped
PPTX
Sem webmaubeuge
PPTX
SWT Lecture Session 11 - R2RML part 2
PPTX
NLP2RDF Wortschatz and Linguistic LOD draft
PDF
Verifying Integrity Constraints of a RDF-based WordNet
PPTX
Linked Data for Czech Legislation
PDF
Open hpi semweb-06-part8
PPT
Digital Libraries of the Future: Use of Semantic Web and Social Bookmarking t...
Tutorial on Semantic Digital Libraries (ESWC'2007)
Scalable Cross-lingual Document Similarity through Language-specific Concept ...
Corrib.org - OpenSource and Research
Lotus: Linked Open Text UnleaShed - ISWC COLD '15
20110929 tpdl2011 dl-research-humboldt
Riding the Semantic Web
Towards digitizing scholarly communication
Structured Dynamics' Semantic Technologies Product Stack
Distributing Text Mining tasks with librAIry
Semantically-enabled Browsing of Large Multilingual Document Collections
SWT Lecture Session 9 - RDB2RDF direct mapping
SWT Lecture Session 10 R2RML Part 1
The web of interlinked data and knowledge stripped
Sem webmaubeuge
SWT Lecture Session 11 - R2RML part 2
NLP2RDF Wortschatz and Linguistic LOD draft
Verifying Integrity Constraints of a RDF-based WordNet
Linked Data for Czech Legislation
Open hpi semweb-06-part8
Digital Libraries of the Future: Use of Semantic Web and Social Bookmarking t...
Ad

Similar to Semantic Web and Linked Data for cultural heritage materials - Approaches in Europeana (20)

PPT
Tutorial on Semantic Digital Libraries (WWW'2007)
PDF
Mapping cross-­domain metadata to the Europeana Data Model (EDM) - EDM introd...
PPT
Digital Libraries of the Future
PPT
Mapping the European(a) metadata landscape
PPT
Semantic Web and Cultural Heritage Collections
ODP
Wikipedia as source of collaboratively created Knowledge Organization Systems
PPT
SKOS and Linked Data
PDF
Vocabularies as Linked Data: SENESCHAL & HeritageData.org
PPTX
DLF 2015 Presentation, "RDF in the Real World."
PPT
Linking data for Europeana
PDF
Europeana and linked cultural heritage data
PPT
Rdf and open linked data a first approach
PPT
Linked Data Tutorial
PPTX
Semantic web
PPT
Porting terminologies to the Semantic Web
ODP
20110324 linked openeuropeanahumanities
PPT
Convergence and Interoperability (IFLA 2011)
PPTX
Connecting Heterogeneous Collections using Linked Data
PPT
Irish Digital Libraries Summit
PPT
Linked Data - the Future for Open Repositories?
Tutorial on Semantic Digital Libraries (WWW'2007)
Mapping cross-­domain metadata to the Europeana Data Model (EDM) - EDM introd...
Digital Libraries of the Future
Mapping the European(a) metadata landscape
Semantic Web and Cultural Heritage Collections
Wikipedia as source of collaboratively created Knowledge Organization Systems
SKOS and Linked Data
Vocabularies as Linked Data: SENESCHAL & HeritageData.org
DLF 2015 Presentation, "RDF in the Real World."
Linking data for Europeana
Europeana and linked cultural heritage data
Rdf and open linked data a first approach
Linked Data Tutorial
Semantic web
Porting terminologies to the Semantic Web
20110324 linked openeuropeanahumanities
Convergence and Interoperability (IFLA 2011)
Connecting Heterogeneous Collections using Linked Data
Irish Digital Libraries Summit
Linked Data - the Future for Open Repositories?
Ad

More from Antoine Isaac (20)

PDF
Addressing multilingual challenges at Europeana: An update - DCMI 2021
PDF
Entity Management at Europeana - DCMI 2021
PPTX
Le Cadre de publication d'Europeana
PPTX
The Europeana Data Model Principles, community and innovation
PPTX
Europeana as a Linked Data (Quality) case
PPTX
Metadata aggregation of IIIF Resources at Europeana: status and plans
PPTX
IIIF and the Europeana mission
PPTX
Multilingual challenges and ongoing work to tackle them at Europeana
PPTX
Semantic Interoperability at Europeana - MultilingualDSIs2018
PDF
Lightweight rights modeling and linked data publication for online cultural h...
PDF
Designing a multilingual knowledge graph - DCMI2018
PDF
The Europeana Data Model - TPDL2018
PPT
Europeana et IIIF
PPT
Data scale and diversity issues at Europeana
PPTX
Isaac - W3C Data on the Web Best Practices - Data Vocabularies
PPTX
Europeana APIs
PPTX
Enriching Cultural Heritage Data with DBpedia
PPTX
Modelling and exchanging annotations
PPTX
EuropeanaTech update - Europeana AGM 2015
PPTX
Modelling annotations for Europeana and related projects - DARIAH-EU WS
Addressing multilingual challenges at Europeana: An update - DCMI 2021
Entity Management at Europeana - DCMI 2021
Le Cadre de publication d'Europeana
The Europeana Data Model Principles, community and innovation
Europeana as a Linked Data (Quality) case
Metadata aggregation of IIIF Resources at Europeana: status and plans
IIIF and the Europeana mission
Multilingual challenges and ongoing work to tackle them at Europeana
Semantic Interoperability at Europeana - MultilingualDSIs2018
Lightweight rights modeling and linked data publication for online cultural h...
Designing a multilingual knowledge graph - DCMI2018
The Europeana Data Model - TPDL2018
Europeana et IIIF
Data scale and diversity issues at Europeana
Isaac - W3C Data on the Web Best Practices - Data Vocabularies
Europeana APIs
Enriching Cultural Heritage Data with DBpedia
Modelling and exchanging annotations
EuropeanaTech update - Europeana AGM 2015
Modelling annotations for Europeana and related projects - DARIAH-EU WS

Recently uploaded (20)

PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPT
Teaching material agriculture food technology
PDF
Encapsulation theory and applications.pdf
PDF
KodekX | Application Modernization Development
PDF
Electronic commerce courselecture one. Pdf
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PPTX
MYSQL Presentation for SQL database connectivity
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
Big Data Technologies - Introduction.pptx
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Diabetes mellitus diagnosis method based random forest with bat algorithm
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Teaching material agriculture food technology
Encapsulation theory and applications.pdf
KodekX | Application Modernization Development
Electronic commerce courselecture one. Pdf
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Mobile App Security Testing_ A Comprehensive Guide.pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
20250228 LYD VKU AI Blended-Learning.pptx
The Rise and Fall of 3GPP – Time for a Sabbatical?
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
MYSQL Presentation for SQL database connectivity
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Big Data Technologies - Introduction.pptx
Unlocking AI with Model Context Protocol (MCP)
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf

Semantic Web and Linked Data for cultural heritage materials - Approaches in Europeana

  • 1. Semantic Web and Linked Data for cultural heritage materials Approaches in Europeana Antoine Isaac Vrije Universiteit Amsterdam Europeana DANS Linked Data and RDF workshop, Den Haag, July 28 th 2010
  • 2. A web of cultural heritage data? ?
  • 3. ?
  • 5.  
  • 7. Building a search engine on top of metadata is difficult Intrinsic quality problems: correctness, coverage Especially when data is so heterogeneous 100s of formats From flat 5-fields records to 100-nodes XML trees Language issue! We currently use a simple interoperability format Quick-win showing quickly its limits
  • 8. We can better use institutions’ original metadata Accommodate their different practices Data structures and semantics Access objects via a semantic layer of vocabularies for subjects, persons, places… Semantic ThoughtLab: experimenting solutions
  • 9. Towards semantics-enabled search Building a "semantic layer" to help accessing content
  • 10. Towards semantics-enabled search Enhance access to Europeana content by semantics Query expansion, clustering of results Exploiting various types of relations "located in", "lived in", "is more specific concept"… Semantics are already there, in metadata and "controlled vocabularies" used in metadata Thesauri, classifications… Requires to make it properly machine-accessible
  • 11. Prototype: Europeana Thought Lab http://europeana.eu/portal/thought-lab.html
  • 14. Baseline: matching concepts' label Controlled place name from a vocabulary at the Rijskmuseum Metadata for the object
  • 15. A "more specific Egypte"?
  • 16. A "more specific Egypte"? Metadata for the object
  • 17. A place more specific than the Egypt one Semantic information on the Giza place in the Rijskmuseum Vocabulary
  • 19. Following other relations - creator Metadata for the object Controlled person name from a vocabulary at the Rijskmuseum
  • 20. Following other relations - match Information on Gustave Le Gray from the Rijskmuseum Vocabulary Matched to a "Gustave Le Gray" from another Vocabulary
  • 21. Following other relations – death place Information on Gustave Le Gray from the Union List of Artist Names (Getty)
  • 22. Following other relations – death place Information on Cairo from the Thesaurus of Geographic Names (Getty) Matched to "Cairo" from another vocabulary…
  • 23. A hell of relations? Well, they were in the original data, we just had to make them explicit! Cultural Heritage institution often have a wealth of metadata to share and exploit
  • 24. Enabling bits & pieces Exploiting semantic links in CH vocabularies Rijksmuseum thesaurus: Concept “Giza” narrower than concept “Egypte” Mapping/alignment between CH vocabularies Louvre’s “Égypte” equivalent to Rijksmuseum’s “Egypte” Enrichment of existing metadata The string “Egypt” in a metadata record indicates the concept of Egypt defined in Rijksmuseum thesaurus
  • 25. SKOS, Knowledge Organization Systems and Linked Data SKOS allows representing (simple) KOS data as RDF animals NT cats cats UF domestic cats RT wildcats BT animals SN used only for domestic cats domestic cats USE cats wildcats
  • 26. SKOS, KOSs and LD SKOS allows bridging across KOSs from different contexts http://www.w3.org/2004/02/skos/
  • 27. SKOS is used! Many Libraries – not a surprise! Swedish National Library’s Libris catalogue and thesaurus http://libris.kb.se/ Library of Congress’ vocabularies, including LCSH http://id.loc.gov/ DNB’s Gemeinsame Normdatei (incl. SWD subject headings) http://d-nb.info/gnd/ Documentation at https://wiki.d-nb.de/display/LDS BnF’s RAMEAU subject headings http://stitch.cs.vu.nl/ OCLC’s DDC classification http://dewey.info/ and VIAF http://viaf.org/ STW economy thesaurus http:// zbw.eu/stw National Library of Hungary’s catalogue and thesauri http:// oszkdk.oszk.hu/resource/DRJ/404 (example) Other fields Wikipedia categories through Dbpedia http://dbpedia.org/ New York Times subject headings http://data.nytimes.com/ IVOA astronomy vocabularies http://www.ivoa.net/Documents/latest/Vocabularies.html GEMET environmental thesaurus http://eionet.europa.eu/gemet UMTHES Agrovoc http://aims.fao.org/ Linked Life Data http://linkedlifedata.com/ Taxonconcept http://www.taxonconcept.org/ UK Public sector vocabularies http://standards.esd.org.uk/ (e.g., http://id.esd.org.uk/lifeEvent/7 )
  • 28. KOS Alignments? Quite many of them are linked to some other resource LCSH, SWD and RAMEAU interlinked through MACS mappings GND linked to DBpedia and VIAF Libris linked to LCSH Agrovoc to CAT, NAL, SWD, GEMET NYT to freebase, DBpedia, Geonames dbPedia links are overwhelming Hungary, STW, TaxonConcept, GND…
  • 29. Enabling bits & pieces (c’ed) Appropriate data model for objects Generic constructs for creation, title, subject, etc. that are useful for querying Flexible data model SW ontology linking features allow to keep close to original data while having the generic notions above
  • 30. Formal semantics, metadata schemas and querying The query: The existing description: Why is there a match? For the Europeana ontology, every rma:depicts statement implies a vra:subject statement rma: gezicht_in_cairo rma:Cairo rma:depicts rma:Egypt skos:broader ?x ?y vra:subject rma:Egypt skos:broader
  • 31. Where are the challenges? Semantic conversion of data Using appropriate data models Enriching legacy metadata Semantic alignments Between description ontologies vra:depicts rdfs:subPropertyOf dc:subject Between concepts in controlled vocabularies iconclass:bird skos:closeMatch ddc:bird
  • 32. Alignment of semantic references
  • 33. Where are the challenges? Semantic alignment (c'ed) Find correspondences between large vocabularies In a multilingual context Scalability Plugging the semantic features into the Europeana production environment
  • 34. The Europeana Data Model (EDM) with input from Carlo Meghini, Guus Schreiber, Stefan Gradmann, Maxx Dekkers, Steffen Hennicke, Viktor de Boer et al. from Europeana V1
  • 35. Rationale of EDM Precursor: ESE (Europeana Semantic Elements) represents lowest common denominator for object metadata convert datasets to Dublin-Core like standard forces interoperability major drawback: original metadata is lost most values are simple strings EDM goals preserve original data while still allowing for interoperability Semantic Web representation A community-driven effort C ore experts, validation by representatives of various CH domains
  • 36. EDM requirements & principles Distinction between “provided object” (painting, book, program) and digital representation Distinction between object and metadata record describing an object Allow for multiple records for same object, containing potentially contradictory statements about an object Support for objects that are composed of other objects Standard metadata format that can be specialized Standard vocabulary format that can be specialized EDM should be based on existing standards
  • 37. EDM basics OAI ORE for organization of metadata about an object Dublin Core for metadata representation SKOS for vocabulary representation + Links to CIDOC-CRM and other shared ontologies
  • 38. Dublin Core EDM uses the latest version of DCMI Metadata Terms for a core of semantically interoperable properties And for backward compatibility, cf. ESE Specified with an RDF model Specialization of 15 original DC elements Can be specialized itself see requirement -> this is a crucial distinction with ESE Used in the richest way possible Pointers to resources
  • 39. SKOS: vocabulary publication on the Web Already seen…
  • 40. OAI ORE Specification: http://www.openarchives.org/ore/1.0/toc.html Specified with an RDF model Four key notions (RDF classes) Object : the book/painting/program being described Aggregation : organizes object information from a particular provider (museum, archive, library) Proxy : the object as viewed in a metadata record Digital representation : some digital form of the object with a Web address
  • 43. Aggregation organizes data of a provider aggregation digital representation object provenance metadata
  • 44. Proxy: metadata record for an object proxy object metadata
  • 45. Multiple aggregations = multiple providers aggregation of DMF aggregation of Louvre
  • 46. Multiple aggregations = multiple providers DMF proxy Louvre P roxy Louvre title DMF title The “real” painting
  • 47. Europeana is “just” a special provider with processed/enriched metadata Europeana aggregation enriched metadata Europeana landing page
  • 48. A flexible model: different semantic grains Cf. goal: “preserve original data while still allowing for interoperability” Keep data expressed as close as possible to original model Using mappings to more interoperable level
  • 49. A flexible model: objects, events and the rest Preserving and exploiting original data also means being compatible with descriptions beyond simple object level Also crucial for semantic enrichment
  • 50. A flexible model: object and events (2) Classes and Properties for event-, agent-, place-centric modeling Instances of (local) vocabularies using skos:Concept Using RDF, EDM allows any kind of network to be attached to a provided object.
  • 51. A flexible model: object and events (3)
  • 52. Advanced modeling in EDM Relations between provided objects Part-whole links for complex (hierarchical) objects Derivation and versioning relations Relations between provided objects, for instance artistic derivation between works; ens:isRepresentationOf ens:isNextInSequence
  • 53. Linked data and cultural heritage?
  • 54. The case for linked data in cultural heritage Not just a more sophisticated way to represent data! Ease of getting data from external sources Just going to the URI and fetch the RDF there Ease of publishing data Linked data as a dissemination channel for Europeana data Ease of linking across datasets Linked data as a dissemination channel for Europeana data Object identification as cornerstone Records are just a side feature!
  • 55. From a movement supported by researchers To much wider awareness Open government initiatives, libraries… Continuing effort: show benefits of collaborating to a cultural heritage data web Library Linked Data W3C incubator http://www.w3.org/2005/Incubator/lld Encouraging open linked data adoption
  • 56. Linked Library Cloud beginning 2008 [Ross Singer, Code4Lib2010] http://code4lib.org/conference/2010/singer
  • 57. Linked Library Cloud mid-2010 Plus: Germany NL Hungary NL STW GEMET NYT Agrovoc [Ross Singer, Code4Lib2010] http://code4lib.org/conference/2010/singer
  • 58. Is that a surprise? Not really, let’s have a look at a real-world case…
  • 59. Johan Stapel, Koninklijke Bibliotheek KOS & collection environment @KB
  • 60. A broad range of datasets That describe the same objects Or related objects Which are about similar subjects Which were made by the same persons Or related persons In the same places Etc…
  • 61. Thanks! [email_address] Europeana.eu team Web and Media lab @ Vrije Universiteit Amsterdam http://wiki.cs.vu.nl/web-media EuropeanaConnect project http://www.europeanaconnect.eu/