SlideShare a Scribd company logo
Digital Enterprise Research Institute                                                      www.deri.ie




                                                VoID – Metadata for
                                                   RDF datasets
                                           Richard Cyganiak, Linked Data Research Centre




 Stefan.Decker@deri.org
 http://www.StefanDecker.org/

 Copyright 2010 Digital Enterprise Research Institute. All rights reserved.
Digital Enterprise Research Institute                    www.deri.ie




                            VoID
                    Vocabulary of Interlinked Datasets
W3C Interest Group note
Digital Enterprise Research Institute                                    www.deri.ie




                                            http://www.w3.org/TR/void/
                                        3
Digital Enterprise Research Institute          www.deri.ie




       “What business-related datasets are
        in the LOD Cloud?”
          “Which datasets deal with politics
           and transparency in the EU?”
          “We have some DERI data. What
           could we link it to?”
Read …
Digital Enterprise Research Institute                                                      www.deri.ie


                 http://esw.w3.org/TaskForces/CommunityProjects/LinkingOpenData/DataSets
Click …
Digital Enterprise Research Institute   www.deri.ie
Sindice …
Digital Enterprise Research Institute   www.deri.ie
Google …
Digital Enterprise Research Institute   www.deri.ie
And even if we find a dataset …
Digital Enterprise Research Institute    www.deri.ie
Standard questions
Digital Enterprise Research Institute    www.deri.ie




        What kind of data is there?
        Examples?
        Is it up to date?
        Who publishes it?
        Where is the SPARQL endpoint?
        Is there a download?
        How big is it?
        What’s the license?
Datasets
Digital Enterprise Research Institute                                www.deri.ie




            A dataset is a set of RDF triples that are published,
             maintained or aggregated by a single provider
Linksets
Digital Enterprise Research Institute                                 www.deri.ie




            An RDF link is an RDF triple whose subject and object
             are described in different datasets
            A linksetis a collection of such RDF links between two
             datasets
voiD schema
Digital Enterprise Research Institute                       www.deri.ie




                                               Statistics




                                                      Interlinking



                            General metadata
General dataset metadata
Digital Enterprise Research Institute       www.deri.ie




            Leveraging DublinCore:
                   Dataset homepage
                   Publisher
                   Title and description
                   Categorisation
                   Licensing
                   Technical features
General dataset metadata
Digital Enterprise Research Institute   www.deri.ie
Access metadata
Digital Enterprise Research Institute                  www.deri.ie




            How to access the actual RDF triples:
                   SPARQL endpoints
                   RDF data dumps
                   Root resources
                   URI lookup endpoints
                   OpenSearch description documents
Access metadata
Digital Enterprise Research Institute   www.deri.ie
Structural metadata
Digital Enterprise Research Institute                             www.deri.ie




            High-level information about schema and internal
             structure of a dataset
            Can be helpful when exploring or querying datasets
                   Example resources
                   Patterns for resource URIs
                   Vocabularies
                   Dataset partitions
                   Statistics
Structural metadata
Digital Enterprise Research Institute   www.deri.ie
Describing linksets
Digital Enterprise Research Institute   www.deri.ie
Describing linksets
Digital Enterprise Research Institute   www.deri.ie
Digital Enterprise Research Institute         www.deri.ie




                   Deployment and Discovery
Alongside a dataset
Digital Enterprise Research Institute   www.deri.ie
Digital Enterprise Research Institute                   www.deri.ie




            Publishing aVoIDfile alongside a dataset
                   Turtle
                   RDFa
            Discovery (well-known URI)
                   http://yoursite/.well-known/void
Users
Digital Enterprise Research Institute                    www.deri.ie




            Used by DBpedia, OpenLink, data.gov.uk, …
            30% of LOD datasets have VoID metadata
            The entire LOD Cloud described inVoID:
                   semantic.ckan.net
Applications
Digital Enterprise Research Institute        www.deri.ie




                                        26
Ed Summers’ LOD Graph
Digital Enterprise Research Institute   www.deri.ie
Summary
Digital Enterprise Research Institute                www.deri.ie




            Metadata for linked datasets
            For the 4-5 star datasets
            W3C Interest Group note (VoID 2)
             http://www.w3.org/TR/void/
        Leverages Dublin Core, FOAF, etc.
        Used by DBpedia, OpenLink, data.gov.uk, …
        Used to generate the LOD Cloud diagram
        The entire LOD Cloud described in VoID:
                   semantic.ckan.net




                                          28

More Related Content

PDF
Modellazione tramite geometria frattale
PPTX
Management of Chyle leakage after head and neck surgery - DIKIOHS DUHS
PDF
A brief overview of metadata for datasets
PDF
BioCADDIE: Descriptive Metadata for Datasets WG3 - ELIXIR All Hands
PPTX
Beyond regulatory submission - standards metadata management
PDF
NIH BD2K DataMed data index - DATS model
PPTX
OpenDataForge - SledgeHammer EDDI 2013 presentation
PPTX
What's New in RDF 1.1?
Modellazione tramite geometria frattale
Management of Chyle leakage after head and neck surgery - DIKIOHS DUHS
A brief overview of metadata for datasets
BioCADDIE: Descriptive Metadata for Datasets WG3 - ELIXIR All Hands
Beyond regulatory submission - standards metadata management
NIH BD2K DataMed data index - DATS model
OpenDataForge - SledgeHammer EDDI 2013 presentation
What's New in RDF 1.1?

Viewers also liked (6)

PPTX
Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...
PDF
NIH BD2K bioCADDIE DataMed: Data Discovery Index
PDF
Metadata Strategies
PDF
Metadata Workshop
PDF
Data Science with the Help of Metadata
PDF
Introduction to metadata management
Tutorial: Describing Datasets with the Health Care and Life Sciences Communit...
NIH BD2K bioCADDIE DataMed: Data Discovery Index
Metadata Strategies
Metadata Workshop
Data Science with the Help of Metadata
Introduction to metadata management
Ad

Similar to VoID: Metadata for RDF Datasets (20)

PPTX
Hello Open World - Semtech 2009
PPT
Linked Open Data
PPTX
PDF
dcat: An RDF vocabulary for interoperability of data catalogues
PDF
What is SDMX-RDF?
PPTX
Linked Data: opportunities and challenges
PPTX
How to Publish Open Data
PDF
Semantic Desktop
PDF
Leveraging existing Web Frameworks for a SIOC explorer (Scripting for the Sem...
ODP
ICOM: A Framework for Integrated Collaborative Work Environments
PDF
A Multidimensional Semantic Space for Data Model Independent Queries over RDF...
ODP
Knowledge management on the desktop
PDF
How to Build Linked Data Sites with Drupal 7 and RDFa
PDF
Hello Open World - The Web of Data for the Pragmatic Developer
PPT
Weaving the Pedantic Web (LD
PDF
RDFa: putting RDF on the Web
PDF
Linked Open Government Data
PPTX
Dcat - Machine Accessible Data Catalogues
PDF
A distributional structured semantic space for querying rdf graph data
PPT
Querying Heterogeneous Datasets on the Linked Data Web
Hello Open World - Semtech 2009
Linked Open Data
dcat: An RDF vocabulary for interoperability of data catalogues
What is SDMX-RDF?
Linked Data: opportunities and challenges
How to Publish Open Data
Semantic Desktop
Leveraging existing Web Frameworks for a SIOC explorer (Scripting for the Sem...
ICOM: A Framework for Integrated Collaborative Work Environments
A Multidimensional Semantic Space for Data Model Independent Queries over RDF...
Knowledge management on the desktop
How to Build Linked Data Sites with Drupal 7 and RDFa
Hello Open World - The Web of Data for the Pragmatic Developer
Weaving the Pedantic Web (LD
RDFa: putting RDF on the Web
Linked Open Government Data
Dcat - Machine Accessible Data Catalogues
A distributional structured semantic space for querying rdf graph data
Querying Heterogeneous Datasets on the Linked Data Web
Ad

More from Richard Cyganiak (8)

PPTX
SHACL: Shaping the Big Ball of Data Mud
PDF
EDF2012: The Web of Data and its Five Stars
PPTX
Practical Cross-Dataset Queries with SPARQL (Introduction)
PPTX
Sigma EE: Reaping low-hanging fruits in RDF-based data integration
PPT
Investigating Community Implementation of the GoodRelations Ontology
PPTX
How to get your data into Sindice and Google with sitemap4rdf
PPTX
Self-Service Linked Government Data with dcat and Gridworks
PPTX
The State of Linked Government Data
SHACL: Shaping the Big Ball of Data Mud
EDF2012: The Web of Data and its Five Stars
Practical Cross-Dataset Queries with SPARQL (Introduction)
Sigma EE: Reaping low-hanging fruits in RDF-based data integration
Investigating Community Implementation of the GoodRelations Ontology
How to get your data into Sindice and Google with sitemap4rdf
Self-Service Linked Government Data with dcat and Gridworks
The State of Linked Government Data

Recently uploaded (20)

PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
Big Data Technologies - Introduction.pptx
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
KodekX | Application Modernization Development
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Empathic Computing: Creating Shared Understanding
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Spectral efficient network and resource selection model in 5G networks
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Machine learning based COVID-19 study performance prediction
Per capita expenditure prediction using model stacking based on satellite ima...
20250228 LYD VKU AI Blended-Learning.pptx
NewMind AI Weekly Chronicles - August'25 Week I
Big Data Technologies - Introduction.pptx
Chapter 3 Spatial Domain Image Processing.pdf
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
“AI and Expert System Decision Support & Business Intelligence Systems”
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Programs and apps: productivity, graphics, security and other tools
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
KodekX | Application Modernization Development
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Empathic Computing: Creating Shared Understanding
Reach Out and Touch Someone: Haptics and Empathic Computing
Spectral efficient network and resource selection model in 5G networks
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Machine learning based COVID-19 study performance prediction

VoID: Metadata for RDF Datasets