SlideShare a Scribd company logo
Provenance Requirements for the
               Next Version of RDF

                          Christian Bizer         Yolanda Gil
     Jun Zhao            Freie Universität     ISI, University of
University of Oxford          Berlin          Southern California

                                       Satya Sahoo
          Paolo Missier              Kno.e.sis Center,
     University of Manchester      Wright State University



                       W3C Provenance Incubator Group
Outline
●   Background
●   Gathering provenance requirements
    ●   From both user and technical perspectives
    ●   From three dimensions: content, management and use
●   Requirement to RDF
●   Further information
The W3C Provenance Incubator Group
●   http://www.w3.org/2005/Incubator/prov/
●   Formed in September 2009 as part of the W3C
    Semantic Web Activity
●   Aim to provide
    ●   A state-of-the art understanding, and
    ●   A roadmap in the area of provenance for Semantic Web
        technologies, development, and possible
        standardization
A Definition of Web Provenance

The initial sources of information used as well
as any entity and process involved in
producing a data item

The data can be any web resource: a document,
an image, a dataset, an RDF statement or a set
of RDF graphs, ....
The Importance of Provenance
The Importance of Provenance
The Key Idea
●   We require additional capabilities that the current
    standard RDF model does not offer
    ●   Identity management of RDF statements
    ●   Annotation framework
    ●   ....
●   For interoperability we require standardized
    vocabularies and best practices for provenance
    descriptions
Where do our requirements come from?
Activities So Far
●   Collected >30 provenance use cases
●   Defined provenance dimensions
    ●   Content: attribution, evolution, process, entailment, etc
    ●   Management: publication, access, scalability, etc
    ●   Use: interoperability, trust, understanding, debugging, etc
●   A provenance requirement document
    ●   Three flagship use cases
    ●   http://www.w3.org/2005/Incubator/prov/wiki/User_Requirements


                         http://www.w3.org/2005/Incubator/prov/wiki/Provenance_Dimensions
What are the requirements?
Requirements from Provenance Content:
       What Needs to Be Represented
Requirement 1: Identity
●   Ability to refer to the resource being described
    ●   An area of an image, an RDF graph, a set of RDF graphs...
●   Resolving equality
Requirement 2: Evolution
●   How different versions are related
●   What transformations were applied
●   Best practices for minting new URIs
Requirements from Provenance Content:
       What Needs to Be Represented

Requirement 3: Entailment
●   Represent the distinction between asserted versus
    inferred provenance
Requirements from Provenance
                Management
Requirement 4: Publication
●   Linking provenance assertions with the resource
    ●   How to publish provenance: embed or link?
●   Associate publisher’s identification (e.g., digital
    signature)
Requirement 5: Querying
●   Query formulation: may mix references to the resource
    and to its provenance
●   Efficient query execution
Requirements from Provenance Use
●   No requirements were uncovered
State of the Art
●   Extension/alternatives to RDF models
    ●   RDF reification
        –   Querying is cumbersome
        –   Others ...
    ●   Named Graphs
    ●   OWL annotations
    ●   RDF molecules, Temporal RDF, PaCE Model ....




                 http://www.w3.org/2005/Incubator/prov/wiki/Relevant_Technologies
State of the Art (Cont.)
●   Vocabularies/ontologies to express provenance
    information
    ●   The Open Provenance Model (OPM)
    ●   Inference Web - Open Proof Language (PML)
    ●   The Provenance Vocabulary
    ●   Dublin Core
    ●   Open Archives Initiative - Object Reuse and Exchange (OAI-ORE)
    ●   Semantic Web Publishing Vocabulary
    ●   The SWAN-SIOC alignment
    ●   The Changeset Vocabulary
    ●   .......    http://www.w3.org/2005/Incubator/prov/wiki/Relevant_Technologies
Provenance Requirements to the
            RDF Community
●   Identification
    ●   Of any artifact, be a resource, a single RDF statement,
        a set of RDF statements or Web resources
    ●   Identity management
●   Annotations of RDF graphs
●   Standardized schemata, ontologies and vocabularies
Activities Ongoing
●   Mapping key terms from various provenance-related
    vocabularies
●   Report on the state-of-the-art in the area of
    provenance
See Also
●   The incubator group:
    http://www.w3.org/2005/Incubator/prov/
●   Provenance requirement document:
    http://www.w3.org/2005/Incubator/prov/wiki/User_Req
    uirements
●   Mapping provenance-related vocabularies:
    http://www.w3.org/2005/Incubator/prov/wiki/Provenanc
    e_Vocabulary_Mappings
Special thanks to members and invited experts of the
 W3C Provenance Incubator Group and UK EPSRC


           This work is licensed under a
Creative Commons Attribution-Share Alike 3.0 License
  (http://creativecommons.org/licenses/by-sa/3.0/)

More Related Content

PDF
Sparql a simple knowledge query
PPTX
Multilingual issues in the representation of international bibliographic stan...
PDF
Resource description framework
PPTX
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
PPT
Ontology Web services for Semantic Applications
PDF
PDF
Linked Open Vocabularies
PDF
Who and What Links to the Internet Archive
Sparql a simple knowledge query
Multilingual issues in the representation of international bibliographic stan...
Resource description framework
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
Ontology Web services for Semantic Applications
Linked Open Vocabularies
Who and What Links to the Internet Archive

What's hot (16)

PDF
Ontology, Semantic Web and DBpedia
PPTX
Web Archiving Profile - WADL 2013
PPTX
Semantic Application for Healthcare
PDF
Ontologies and semantic web
PPTX
Using OWL for the RESO Data Dictionary
PPTX
Naming things isn't that hard
PDF
Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...
PPT
Understanding RDF: the Resource Description Framework in Context (1999)
PPTX
The Dublin Core 1:1 Principle in the Age of Linked Data
PPT
NCBO SPARQL Endpoint
PPT
PPTX
Resource description framework
PDF
Annotations as Linked Data with Fedora4 and Triannon
PDF
Converting GHO to RDF
PPT
Introduction To RDF and RDFS
Ontology, Semantic Web and DBpedia
Web Archiving Profile - WADL 2013
Semantic Application for Healthcare
Ontologies and semantic web
Using OWL for the RESO Data Dictionary
Naming things isn't that hard
Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...
Understanding RDF: the Resource Description Framework in Context (1999)
The Dublin Core 1:1 Principle in the Age of Linked Data
NCBO SPARQL Endpoint
Resource description framework
Annotations as Linked Data with Fedora4 and Triannon
Converting GHO to RDF
Introduction To RDF and RDFS
Ad

Similar to 2010 06 rdf_next (20)

PPTX
NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti...
PPTX
Semantic Web use cases in outcomes research
PPTX
The Rhetoric of Research Objects
PDF
Extending DCAM for Metadata Provenance
PDF
Engaging Information Professionals in the Process of Authoritative Interlinki...
PPTX
Publishing and Using Linked Open Data - Day 4
PPTX
Linked Open Data for Cultural Heritage
PDF
ISWC GoodRelations Tutorial Part 2
PDF
GoodRelations Tutorial Part 2
PPTX
Linked Energy Data Generation
PPTX
Usage of Linked Data: Introduction and Application Scenarios
PPT
Freire model api
PDF
Getting Started with Knowledge Graphs
PDF
Build Knowledge Graphs with Oracle RDF to Extract More Value from Your Data
PPTX
Crossref LIVE Indonesia: An Introduction to Crossref, CRLIVE-ID 13 July 2021
PDF
Research Shared: researchobject.org
PDF
The WorldCat Search API
PPTX
NISO access related projects (presented at the Charleston conference 2016)
PDF
Introduction to RDF
PPTX
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti...
Semantic Web use cases in outcomes research
The Rhetoric of Research Objects
Extending DCAM for Metadata Provenance
Engaging Information Professionals in the Process of Authoritative Interlinki...
Publishing and Using Linked Open Data - Day 4
Linked Open Data for Cultural Heritage
ISWC GoodRelations Tutorial Part 2
GoodRelations Tutorial Part 2
Linked Energy Data Generation
Usage of Linked Data: Introduction and Application Scenarios
Freire model api
Getting Started with Knowledge Graphs
Build Knowledge Graphs with Oracle RDF to Extract More Value from Your Data
Crossref LIVE Indonesia: An Introduction to Crossref, CRLIVE-ID 13 July 2021
Research Shared: researchobject.org
The WorldCat Search API
NISO access related projects (presented at the Charleston conference 2016)
Introduction to RDF
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
Ad

More from Jun Zhao (20)

PDF
Www sociam-2016-policy-reviews
PPTX
Query-generation-for-provo-data-201406
PDF
2012 05-swpm-provo
PDF
2012 04-ldow-prov
PDF
2011 03-provenance-workshop-edingurgh
ODP
2011 03-provenance-workshop-edingurgh
PDF
2010 10 provxg_datagovuk
PDF
2010 09 opm_tutorial_02-jun-opmv
PPT
2010 09 opm_tutorial_01-jun-usecase-datagovuk
ODP
2010 06 ipaw_prv
PDF
2010 05 edinburgh
PPT
2010 03 Lodoxf Openflydata
PPT
2009 09 Lod London
ODP
2009 0807 Lod Gmod
PPT
2009 Dils Flyweb
PPT
Talk_linked_data_for_hcls_at_iswc2009
PPT
myExperiment and AIDA
PPT
2008 11 13 Hcls Call
PPT
2008 Jun Zhao Eswc
PDF
2008 04 22 Jun Zhao Ldow
Www sociam-2016-policy-reviews
Query-generation-for-provo-data-201406
2012 05-swpm-provo
2012 04-ldow-prov
2011 03-provenance-workshop-edingurgh
2011 03-provenance-workshop-edingurgh
2010 10 provxg_datagovuk
2010 09 opm_tutorial_02-jun-opmv
2010 09 opm_tutorial_01-jun-usecase-datagovuk
2010 06 ipaw_prv
2010 05 edinburgh
2010 03 Lodoxf Openflydata
2009 09 Lod London
2009 0807 Lod Gmod
2009 Dils Flyweb
Talk_linked_data_for_hcls_at_iswc2009
myExperiment and AIDA
2008 11 13 Hcls Call
2008 Jun Zhao Eswc
2008 04 22 Jun Zhao Ldow

Recently uploaded (20)

PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PDF
VCE English Exam - Section C Student Revision Booklet
PPTX
Cell Types and Its function , kingdom of life
PDF
Sports Quiz easy sports quiz sports quiz
PPTX
PPH.pptx obstetrics and gynecology in nursing
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
Computing-Curriculum for Schools in Ghana
PPTX
Institutional Correction lecture only . . .
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
01-Introduction-to-Information-Management.pdf
PDF
Complications of Minimal Access Surgery at WLH
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PPTX
Cell Structure & Organelles in detailed.
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
VCE English Exam - Section C Student Revision Booklet
Cell Types and Its function , kingdom of life
Sports Quiz easy sports quiz sports quiz
PPH.pptx obstetrics and gynecology in nursing
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
Supply Chain Operations Speaking Notes -ICLT Program
O5-L3 Freight Transport Ops (International) V1.pdf
Pharmacology of Heart Failure /Pharmacotherapy of CHF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
Computing-Curriculum for Schools in Ghana
Institutional Correction lecture only . . .
Abdominal Access Techniques with Prof. Dr. R K Mishra
01-Introduction-to-Information-Management.pdf
Complications of Minimal Access Surgery at WLH
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
Microbial diseases, their pathogenesis and prophylaxis
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
Cell Structure & Organelles in detailed.

2010 06 rdf_next

  • 1. Provenance Requirements for the Next Version of RDF Christian Bizer Yolanda Gil Jun Zhao Freie Universität ISI, University of University of Oxford Berlin Southern California Satya Sahoo Paolo Missier Kno.e.sis Center, University of Manchester Wright State University W3C Provenance Incubator Group
  • 2. Outline ● Background ● Gathering provenance requirements ● From both user and technical perspectives ● From three dimensions: content, management and use ● Requirement to RDF ● Further information
  • 3. The W3C Provenance Incubator Group ● http://www.w3.org/2005/Incubator/prov/ ● Formed in September 2009 as part of the W3C Semantic Web Activity ● Aim to provide ● A state-of-the art understanding, and ● A roadmap in the area of provenance for Semantic Web technologies, development, and possible standardization
  • 4. A Definition of Web Provenance The initial sources of information used as well as any entity and process involved in producing a data item The data can be any web resource: a document, an image, a dataset, an RDF statement or a set of RDF graphs, ....
  • 5. The Importance of Provenance
  • 6. The Importance of Provenance
  • 7. The Key Idea ● We require additional capabilities that the current standard RDF model does not offer ● Identity management of RDF statements ● Annotation framework ● .... ● For interoperability we require standardized vocabularies and best practices for provenance descriptions
  • 8. Where do our requirements come from?
  • 9. Activities So Far ● Collected >30 provenance use cases ● Defined provenance dimensions ● Content: attribution, evolution, process, entailment, etc ● Management: publication, access, scalability, etc ● Use: interoperability, trust, understanding, debugging, etc ● A provenance requirement document ● Three flagship use cases ● http://www.w3.org/2005/Incubator/prov/wiki/User_Requirements http://www.w3.org/2005/Incubator/prov/wiki/Provenance_Dimensions
  • 10. What are the requirements?
  • 11. Requirements from Provenance Content: What Needs to Be Represented Requirement 1: Identity ● Ability to refer to the resource being described ● An area of an image, an RDF graph, a set of RDF graphs... ● Resolving equality Requirement 2: Evolution ● How different versions are related ● What transformations were applied ● Best practices for minting new URIs
  • 12. Requirements from Provenance Content: What Needs to Be Represented Requirement 3: Entailment ● Represent the distinction between asserted versus inferred provenance
  • 13. Requirements from Provenance Management Requirement 4: Publication ● Linking provenance assertions with the resource ● How to publish provenance: embed or link? ● Associate publisher’s identification (e.g., digital signature) Requirement 5: Querying ● Query formulation: may mix references to the resource and to its provenance ● Efficient query execution
  • 14. Requirements from Provenance Use ● No requirements were uncovered
  • 15. State of the Art ● Extension/alternatives to RDF models ● RDF reification – Querying is cumbersome – Others ... ● Named Graphs ● OWL annotations ● RDF molecules, Temporal RDF, PaCE Model .... http://www.w3.org/2005/Incubator/prov/wiki/Relevant_Technologies
  • 16. State of the Art (Cont.) ● Vocabularies/ontologies to express provenance information ● The Open Provenance Model (OPM) ● Inference Web - Open Proof Language (PML) ● The Provenance Vocabulary ● Dublin Core ● Open Archives Initiative - Object Reuse and Exchange (OAI-ORE) ● Semantic Web Publishing Vocabulary ● The SWAN-SIOC alignment ● The Changeset Vocabulary ● ....... http://www.w3.org/2005/Incubator/prov/wiki/Relevant_Technologies
  • 17. Provenance Requirements to the RDF Community ● Identification ● Of any artifact, be a resource, a single RDF statement, a set of RDF statements or Web resources ● Identity management ● Annotations of RDF graphs ● Standardized schemata, ontologies and vocabularies
  • 18. Activities Ongoing ● Mapping key terms from various provenance-related vocabularies ● Report on the state-of-the-art in the area of provenance
  • 19. See Also ● The incubator group: http://www.w3.org/2005/Incubator/prov/ ● Provenance requirement document: http://www.w3.org/2005/Incubator/prov/wiki/User_Req uirements ● Mapping provenance-related vocabularies: http://www.w3.org/2005/Incubator/prov/wiki/Provenanc e_Vocabulary_Mappings
  • 20. Special thanks to members and invited experts of the W3C Provenance Incubator Group and UK EPSRC This work is licensed under a Creative Commons Attribution-Share Alike 3.0 License (http://creativecommons.org/licenses/by-sa/3.0/)

Editor's Notes

  • #12: When you want to say something about a set of RDF triples. Not only the content but also their publication and querying. - interoperability - standard way
  • #13: When you want to say something about a set of RDF triples. Not only the content but also their publication and querying. - interoperability - standard way
  • #14: When you want to say something about a set of RDF triples. Not only the content but also their publication and querying. - interoperability - standard way