SlideShare a Scribd company logo
Open Annotation:
Annotating Scholarly Communication on the Web

                                      Robert Sanderson
                                           rsanderson@lanl.gov
                                           Los Alamos National Laboratory
                                           @azaroth42

                                      Herbert Van de Sompel
                                           herbertv@lanl.gov
                                           Los Alamos National Laboratory
                                           @hvdsomp


                                          http://www.openannotation.org/

                                      This research is funded, in part, by the
                                      Andrew W. Mellon Foundation



                             Open Annotation                                     1
            AAHEP5,   22nd   of September 2011, Ithaca, NY
Overview


•  Motivation

•  Open Annotation Model
    •  Basics
    •  Segments
    •  Resources Changing over Time
    •  Machine Annotations

•  Network Model

•  Quick Demo



                                    Open Annotation                 2
                   AAHEP5,   22nd   of September 2011, Ithaca, NY
Scholarly Communication

Scholarly Communication is increasingly:
   •  Online
   •  Open
   •  Distributed
   •  Collaborative
   •  Data-Oriented

Annotation is a scholarly primitive, spanning discipline and level.

Need to ensure that Digital Annotations fall under these headings!

    •  Apply the standards and architecture of the World Wide Web to the
        Annotation use case.
    •  Even if scholar doesn’t share annotations with others, she will want
        to access them from different tools and environments.


                                        Open Annotation                  3
                       AAHEP5,   22nd   of September 2011, Ithaca, NY
Open Annotation Collaboration

                     http://www.openannotation.org/

Focus on interoperable sharing of annotations
   •  Web-centric and open, not locked down silos
   •  Create, consume and interact in different environments
   •  Build from a simple model for simple cases,
          to more detailed for complex scholarly annotation requirements

The Collaboration:
   •  Los Alamos National Laboratory
   •  University of Illinois at Urbana-Champaign
   •  University of Queensland
   •  University of Maryland
   •  George Mason University
   •  … plus many adopters and partners


                                       Open Annotation                 4
                      AAHEP5,   22nd   of September 2011, Ithaca, NY
Open Annotation Data Model


Design Guidelines:
   •  Based on the Architecture of the World Wide Web
   •  … and on Linked Open Data
   •  Should be general enough for ease of adoption
   •  … and rich enough to cover scholarly use cases

Status: Beta, with 9 ongoing funded experiments to inform 1.0

Hardest part: Define what an Annotation is!
   •  "Aboutness" is key to distinguish from general metadata


         A document that describes how one resource is about
            one or more other resources, or part(s) thereof.


                                       Open Annotation                 5
                      AAHEP5,   22nd   of September 2011, Ithaca, NY
Basic Model

The basic model has three resources:
   •  Annotation   (an RDF document)
   •  Body         (the ‘comment’ of the annotation)
   •  Target       (the resource the Body is ‘about’)




                                     Open Annotation                 6
                    AAHEP5,   22nd   of September 2011, Ithaca, NY
Basic Model Example




                 Open Annotation                 7
AAHEP5,   22nd   of September 2011, Ithaca, NY
Additional Relationships and Properties

Any of the resources can have additional information attached,
such as creator, date of creation, title, etc.




                                     Open Annotation                 8
                    AAHEP5,   22nd   of September 2011, Ithaca, NY
Additional Properties Example




                    Open Annotation                 9
   AAHEP5,   22nd   of September 2011, Ithaca, NY
Annotation Types

There can be further types of Annotation, such as a Reply.
Example: Replies are Annotations on Annotations.




                                     Open Annotation                 10
                    AAHEP5,   22nd   of September 2011, Ithaca, NY
Annotation Types Example




                   Open Annotation                 11
  AAHEP5,   22nd   of September 2011, Ithaca, NY
Inline Information


It is important to be able to have content contained within the
Annotation document for Client Autonomy:

    •  Clients may be unable to mint new URIs for every resource
    •  Clients may wish to transmit only a single document
    •  Third parties can generate new URIs if the client does not



The W3C has a Content in RDF specification:
   •  http://www.w3.org/TR/Content-in-RDF10/




                                      Open Annotation                 12
                     AAHEP5,   22nd   of September 2011, Ithaca, NY
Inline Information: Body

•  We introduce a resource identified by a non resolvable URI,
   such as a UUID URN, as the Body.
•  We then embed the data within the Annotation document using
  'chars’ from Content in RDF.




                                    Open Annotation                 13
                   AAHEP5,   22nd   of September 2011, Ithaca, NY
Inline Body Example




                 Open Annotation                 14
AAHEP5,   22nd   of September 2011, Ithaca, NY
Multiple Targets

There are many use cases for multiple targets for an Annotation:
   •  Comparison of two or more resources
   •  Making a statement that applies to all of the resources
   •  Making a statement about multiple parts of a resource

The OAC Data Model allows for multiple targets by simply having
more than one hasTarget relationship.




                                     Open Annotation                 15
                    AAHEP5,   22nd   of September 2011, Ithaca, NY
Multiple Targets Example




                  Open Annotation                 16
 AAHEP5,   22nd   of September 2011, Ithaca, NY
Segments of Resources


Most annotations are about part of a resource


Different segments for different media types:

   •  Text: paragraph, arbitrary span of words
   •  Image: rectangular or arbitrary shaped area
   •  Audio: start and end time points, track name/number
   •  Video: area and time points
   •  Other: slice of a data set, volume in a 3d object, …




                                     Open Annotation                 17
                    AAHEP5,   22nd   of September 2011, Ithaca, NY
Segments of Resources


Web Architecture Segmentation:

   •  A URI with a Fragment identifies part of the resource:
   •  IETF Mime-type fragment identifiers; eg xpointer
   •  W3C Media Fragments URI specification for simple
      segments of media: http://www.w3.org/TR/media-frags/

We introduce a method of constraining resources:

   •  Introduce an approach for arbitrarily complex segments that
   cannot be expressed using Fragments
   •  Can be applied to Body or Target resource



                                      Open Annotation                 18
                     AAHEP5,   22nd   of September 2011, Ithaca, NY
Segments of Resources: IETF Fragment URIs


URI Fragments are a syntax for creating subsidiary URIs that
identify part of the main resource

The syntax is defined per media type:

    •  X/HTML:   The named anchor or identified element

    •  XML:       An XPointer to the element(s)

    •  PDF:       Many options, especially page and viewrect

    •  Plain Text: Either by character position or line position




                                      Open Annotation                 19
                     AAHEP5,   22nd   of September 2011, Ithaca, NY
Segments of Resources: W3C Media Fragments

Media Fragments allow anyone to create URIs that identify part of
an image, audio or video resource.

The most common case is for rectangular areas of images:
    •  http://www.example.org/image.jpg#xywh=50,100,640,480

Link to the full resource as well, for all Fragment URIs




                                      Open Annotation                 20
                     AAHEP5,   22nd   of September 2011, Ithaca, NY
Media Fragments Example




                  Open Annotation                 21
 AAHEP5,   22nd   of September 2011, Ithaca, NY
Complex Constraints
Fragment URIs are not always possible
    •  Introduce a Constraint that describes the segment of interest
    •  And a ConstrainedTarget that identifies the segment of interest
    •  Constraints are entire resources, so can be more expressive
    •  Constraints may also describe 'contextual' information




                                         Open Annotation                 22
                        AAHEP5,   22nd   of September 2011, Ithaca, NY
Constraint Example




                 Open Annotation                 23
AAHEP5,   22nd   of September 2011, Ithaca, NY
Constrained Body

The Body may also be constrained in the same way as Targets




                                    Open Annotation                 24
                   AAHEP5,   22nd   of September 2011, Ithaca, NY
Web-Centric Annotation: No Persistence




Google Sidewiki Annotation on http://news.bbc.co.uk/ as of 2010-06-14

                                     Open Annotation                     25



                                                                     2
                    AAHEP5,   22nd   of September 2011, Ithaca, NY
Web-Centric Annotation: No Annotations




                       Archived page from:
http://www.dracos.co.uk/work/bbc-news-archive/2010/03/08/07.05.html
                                   Open Annotation                     26



                                                                   2
                  AAHEP5,   22nd   of September 2011, Ithaca, NY
Web-Centric Annotation: Desired Cross-Linking




                            Open Annotation                     27



                                                            2
           AAHEP5,   22nd   of September 2011, Ithaca, NY
Annotations and Time

There are three different types of Annotation with respect to Time:

   •  Timeless Annotations
        •  These are always relevant, regardless of the current
        state of the resources. This is the base model.

   •  Uniform Annotations
        •  There is a single timestamp at which all of the resources
        should be considered.

   •  Varied Annotations
        •  Each of the resources (Body, Targets) should be
        considered at a different time.




                                     Open Annotation                     28



                                                                     2
                    AAHEP5,   22nd   of September 2011, Ithaca, NY
Uniform Annotations

If the same time is applicable to all resources, we attach it to the
Annotation using the oac:when predicate.




                                      Open Annotation                     29



                                                                      2
                     AAHEP5,   22nd   of September 2011, Ithaca, NY
Uniform Annotation Example




                   Open Annotation                     30



                                                   3
  AAHEP5,   22nd   of September 2011, Ithaca, NY
Varied Annotations

If different timestamps are required for each resource, we use
oac:when from an oac:WebTimeConstraint.




                                     Open Annotation                     31



                                                                     3
                    AAHEP5,   22nd   of September 2011, Ithaca, NY
Varied Annotation Example




                   Open Annotation                     32



                                                   3
  AAHEP5,   22nd   of September 2011, Ithaca, NY
Finding Archived Resources with Memento




                          Open Annotation                     33



                                                          3
         AAHEP5,   22nd   of September 2011, Ithaca, NY
Annotations for/by Machines?


•  The Body consists of one or more Statements
    •  Human understandable: Text, Image, Video, Symbols, …
    •  Machine understandable: Data

•  Humans can infer relationships and context, Machines cannot
    •  Need to be as explicit as possible
    •  Need structured information

•  When would we do this?
   •  Attaching data to another resource
   •  Nano-Publications: publication of data for further processing
   •  Semantic Annotations


                                        Open Annotation                     34



                                                                        3
                       AAHEP5,   22nd   of September 2011, Ithaca, NY
Semantic Annotation: Single Named Entity




                          Open Annotation                     35



                                                          3
         AAHEP5,   22nd   of September 2011, Ithaca, NY
Semantic Annotation: Entity Relationships




                          Open Annotation                     36



                                                          3
         AAHEP5,   22nd   of September 2011, Ithaca, NY
Advantages of this Approach

•  Only uses existing OAC constructions
    •  No new ‘isTopicOf’ or similar shortcut relationships
    •  Creator of Body is not confused with Creator of Annotation!

•  Aligns very closely with human annotation practices

•  Consistent model that scales from resource to part of resource
    •  Can annotate data extracted at most appropriate level
    •  Could extract from sentence/paragraph/section/entire text

•  Consistent model that allows association of any amount of data:
    •  From Single Entity
    •  To scholarly discourse extraction from entire document

                                        Open Annotation                     37



                                                                        3
                       AAHEP5,   22nd   of September 2011, Ithaca, NY
Annotation Protocols

Unlike previous systems, Open
Annotation does not mandate a
protocol.

No reliance on a client/server
combination gives the client
autonomy.

Instead we promote a publish/
subscribe methodology, where
annotations may be stored and
consumed from anywhere.



                              Protocol: publish, subscribe, consume tied together

                                         Open Annotation                     38
                        AAHEP5,   22nd   of September 2011, Ithaca, NY
Publish/Subscribe Method




          We don’t specify how this transfer should occur




publish
                                        Open Annotation                     39



                                                                        3
                       AAHEP5,   22nd   of September 2011, Ithaca, NY
Publish/Subscribe Method




             Nor this.




publish     subscribe
                            Open Annotation                     40



                                                            4
           AAHEP5,   22nd   of September 2011, Ithaca, NY
Publish/Subscribe Method




                                                            Nor this.




publish     subscribe                                       consume
                            Open Annotation                             41



                                                                4
           AAHEP5,   22nd   of September 2011, Ithaca, NY
Publish/Subscribe Advantages

•  Client can use most appropriate method for transferring annotation to
   storage service
    •  May already be specified in certain domains
    •  Can use existing services without requiring direct adoption

•  Annotations are web resources in their own right
    •  Can be protected for limited access using existing technology
    •  Have their own URIs by necessity, not good-will of service

•  Promotes a market-place of services
    •  Archiving Annotations and resources for preservation
    •  Enriching with additional metadata and information
    •  Aggregation and curation to provide trusted annotation feeds

                                        Open Annotation                     42



                                                                        4
                       AAHEP5,   22nd   of September 2011, Ithaca, NY
Demo!

               http://www.shared-canvas.org/impl/demo3/

•  Took the PDF of Dirac’s thesis on Quantum Mechanics and split into
   individual page images
•  Allow transcription by annotation, and commentary by annotation
•  Allows storage at different services, both public and private




                                       Open Annotation                     43



                                                                       4
                      AAHEP5,   22nd   of September 2011, Ithaca, NY
Thank You



                        Robert Sanderson
                            rsanderson@lanl.gov
                            azaroth42@gmail.com
                            @azaroth42




                      Web:            http://www.openannotation.org/
                      Paper:           http://arxiv.org/abs/1003.2643

                      Slides:                    http://slidesha.re/qolpwI




                 Open Annotation                                      44
AAHEP5,   22nd   of September 2011, Ithaca, NY

More Related Content

PDF
British Library Seminar: Shared Canvas (September 2011)
PDF
Update on Memento (IIPC 2011 Plenary)
PDF
SharedCanvas: Collaborative Digital Facsimiles of Medieval Manuscripts
PDF
Multiplicity and Publishing in Open Annotation (tutorial)
PDF
Open Annotation Overview
PDF
Open Annotation; NISO digital bookmarking
PDF
Open Annotation: Bridging the Divide?
PDF
NISO/Internet Archive Meeting on Social Bookmarking and Annotation
British Library Seminar: Shared Canvas (September 2011)
Update on Memento (IIPC 2011 Plenary)
SharedCanvas: Collaborative Digital Facsimiles of Medieval Manuscripts
Multiplicity and Publishing in Open Annotation (tutorial)
Open Annotation Overview
Open Annotation; NISO digital bookmarking
Open Annotation: Bridging the Divide?
NISO/Internet Archive Meeting on Social Bookmarking and Annotation

Viewers also liked (20)

PDF
Python Web Tutorial
PDF
OAC Technical Summary
PDF
Linked Data and Images: Building Blocks for Cultural Heritage
PDF
Open Annotation Core Data Model (tutorial)
PDF
Transactional Archiving (Web Archive Globalization Workshop)
PDF
Parker Keio 2011: Interoperable Manuscript Framework
PPT
8 Panduan Silabus
PDF
Open Repositories 2014: Crowdsourced Transcription via IIIF
PPS
Espolon Oeste
PDF
IIIF: Shared Canvas 2.0
PDF
W3C Web Annotation WG Update (I Annotate 2016)
PDF
IIIF and JSON-LD: LODLAM Training Day
PPTX
Managing Annotations (OR2016)
PPTX
IIIF: The Advantages of APIs
PDF
Annotations as Linked Data with Fedora4 and Triannon
PDF
IIIF: Discovery of Resources
PDF
Annotating Scholarly Works - the W3C Open Annotation Model
PPTX
IIIF Foundational Specifications
PDF
Introduction to IIIF
PDF
OAC Presentation at CNI 09 Fall Forum
Python Web Tutorial
OAC Technical Summary
Linked Data and Images: Building Blocks for Cultural Heritage
Open Annotation Core Data Model (tutorial)
Transactional Archiving (Web Archive Globalization Workshop)
Parker Keio 2011: Interoperable Manuscript Framework
8 Panduan Silabus
Open Repositories 2014: Crowdsourced Transcription via IIIF
Espolon Oeste
IIIF: Shared Canvas 2.0
W3C Web Annotation WG Update (I Annotate 2016)
IIIF and JSON-LD: LODLAM Training Day
Managing Annotations (OR2016)
IIIF: The Advantages of APIs
Annotations as Linked Data with Fedora4 and Triannon
IIIF: Discovery of Resources
Annotating Scholarly Works - the W3C Open Annotation Model
IIIF Foundational Specifications
Introduction to IIIF
OAC Presentation at CNI 09 Fall Forum
Ad

Similar to Open Annotation: Annotating High Energy Physics on the Web (20)

PDF
NISO Annotation Meeting (San Francisco)
PPT
Open Annotation Collaboration Briefing
PDF
Linked Data in Scholarly Communication
PPT
Open Annotation Collaboration Introduction
PPT
Cole using oa-intro-dlf2012
PDF
The aDORe Federation Architecture
PDF
140328 전자출판 기술표준세미나 epub 주석 표준 open_annotation_최두립_v2
PDF
Annotating Scholarly Resources
PDF
Building OBO Foundry ontology using semantic web tools
PDF
20110728 datalift-rpi-troy
PDF
NLLC 2011: Memento, Open Annotation, SharedCanvas
ZIP
Linked Open Data in Libraries, Archives & Museums
PDF
Open Science - Free Science for a free Society
PDF
Semantic Web - Ontology 101
PPT
Collaborative Ontology Building Project
PDF
SHARE Update for CNI, April 2015
PPTX
Evidence of OER Impact
PPTX
Article Impact (ALM and altmetrics)
PDF
FOOPS!: An Ontology Pitfall Scanner for the FAIR principles
PDF
OOR--Open-Ontology-Repository--jun2010
NISO Annotation Meeting (San Francisco)
Open Annotation Collaboration Briefing
Linked Data in Scholarly Communication
Open Annotation Collaboration Introduction
Cole using oa-intro-dlf2012
The aDORe Federation Architecture
140328 전자출판 기술표준세미나 epub 주석 표준 open_annotation_최두립_v2
Annotating Scholarly Resources
Building OBO Foundry ontology using semantic web tools
20110728 datalift-rpi-troy
NLLC 2011: Memento, Open Annotation, SharedCanvas
Linked Open Data in Libraries, Archives & Museums
Open Science - Free Science for a free Society
Semantic Web - Ontology 101
Collaborative Ontology Building Project
SHARE Update for CNI, April 2015
Evidence of OER Impact
Article Impact (ALM and altmetrics)
FOOPS!: An Ontology Pitfall Scanner for the FAIR principles
OOR--Open-Ontology-Repository--jun2010
Ad

More from Robert Sanderson (20)

PPTX
Understanding Linked Art
PDF
LUX - Cross Collections Cultural Heritage at Yale
PDF
Zoom as a Paradigm for Linked Open Usable Data
PPTX
Provenance and Uncertainty in Linked Art
PPTX
Data is our Product: Thoughts on LOD Sustainability
PPTX
A Perspective on Wikidata: Ecosystems, Trust, and Usability
PPTX
Linked Art: Sustainable Cultural Knowledge through Linked Open Usable Data
PPTX
Illusions of Grandeur: Trust and Belief in Cultural Heritage Linked Open Data
PPTX
Structural Metadata in RDF (IS575)
PPTX
Sanderson CNI 2020 Keynote - Cultural Heritage Research Data Ecosystem
PPTX
Tiers of Abstraction and Audience in Cultural Heritage Data Modeling
PPTX
The Importance of being LOUD
PPTX
Introduction to Linked Art Model
PPTX
Standards and Communities: Connected People, Consistent Data, Usable Applicat...
PDF
Strong Opinions, Weakly Held
PDF
IIIF Discovery Walkthrough
PPTX
Linked Art: An Art Museum Profile for CIDOC-CRM
PDF
Euromed2018 Keynote: Usability over Completeness, Community over Committee
PDF
Linked Art - Our Linked Open Usable Data Model
PPTX
EuropeanaTech Keynote: Shout it out LOUD
Understanding Linked Art
LUX - Cross Collections Cultural Heritage at Yale
Zoom as a Paradigm for Linked Open Usable Data
Provenance and Uncertainty in Linked Art
Data is our Product: Thoughts on LOD Sustainability
A Perspective on Wikidata: Ecosystems, Trust, and Usability
Linked Art: Sustainable Cultural Knowledge through Linked Open Usable Data
Illusions of Grandeur: Trust and Belief in Cultural Heritage Linked Open Data
Structural Metadata in RDF (IS575)
Sanderson CNI 2020 Keynote - Cultural Heritage Research Data Ecosystem
Tiers of Abstraction and Audience in Cultural Heritage Data Modeling
The Importance of being LOUD
Introduction to Linked Art Model
Standards and Communities: Connected People, Consistent Data, Usable Applicat...
Strong Opinions, Weakly Held
IIIF Discovery Walkthrough
Linked Art: An Art Museum Profile for CIDOC-CRM
Euromed2018 Keynote: Usability over Completeness, Community over Committee
Linked Art - Our Linked Open Usable Data Model
EuropeanaTech Keynote: Shout it out LOUD

Recently uploaded (20)

PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
MYSQL Presentation for SQL database connectivity
PDF
cuic standard and advanced reporting.pdf
PDF
Machine learning based COVID-19 study performance prediction
PDF
Encapsulation theory and applications.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PPTX
A Presentation on Artificial Intelligence
PDF
Electronic commerce courselecture one. Pdf
PDF
Empathic Computing: Creating Shared Understanding
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
KodekX | Application Modernization Development
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
20250228 LYD VKU AI Blended-Learning.pptx
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
MYSQL Presentation for SQL database connectivity
cuic standard and advanced reporting.pdf
Machine learning based COVID-19 study performance prediction
Encapsulation theory and applications.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
NewMind AI Monthly Chronicles - July 2025
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Mobile App Security Testing_ A Comprehensive Guide.pdf
A Presentation on Artificial Intelligence
Electronic commerce courselecture one. Pdf
Empathic Computing: Creating Shared Understanding
NewMind AI Weekly Chronicles - August'25 Week I
KodekX | Application Modernization Development
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Encapsulation_ Review paper, used for researhc scholars
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
The Rise and Fall of 3GPP – Time for a Sabbatical?

Open Annotation: Annotating High Energy Physics on the Web

  • 1. Open Annotation: Annotating Scholarly Communication on the Web Robert Sanderson rsanderson@lanl.gov Los Alamos National Laboratory @azaroth42 Herbert Van de Sompel herbertv@lanl.gov Los Alamos National Laboratory @hvdsomp http://www.openannotation.org/ This research is funded, in part, by the Andrew W. Mellon Foundation Open Annotation 1 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 2. Overview •  Motivation •  Open Annotation Model •  Basics •  Segments •  Resources Changing over Time •  Machine Annotations •  Network Model •  Quick Demo Open Annotation 2 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 3. Scholarly Communication Scholarly Communication is increasingly: •  Online •  Open •  Distributed •  Collaborative •  Data-Oriented Annotation is a scholarly primitive, spanning discipline and level. Need to ensure that Digital Annotations fall under these headings! •  Apply the standards and architecture of the World Wide Web to the Annotation use case. •  Even if scholar doesn’t share annotations with others, she will want to access them from different tools and environments. Open Annotation 3 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 4. Open Annotation Collaboration http://www.openannotation.org/ Focus on interoperable sharing of annotations •  Web-centric and open, not locked down silos •  Create, consume and interact in different environments •  Build from a simple model for simple cases, to more detailed for complex scholarly annotation requirements The Collaboration: •  Los Alamos National Laboratory •  University of Illinois at Urbana-Champaign •  University of Queensland •  University of Maryland •  George Mason University •  … plus many adopters and partners Open Annotation 4 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 5. Open Annotation Data Model Design Guidelines: •  Based on the Architecture of the World Wide Web •  … and on Linked Open Data •  Should be general enough for ease of adoption •  … and rich enough to cover scholarly use cases Status: Beta, with 9 ongoing funded experiments to inform 1.0 Hardest part: Define what an Annotation is! •  "Aboutness" is key to distinguish from general metadata A document that describes how one resource is about one or more other resources, or part(s) thereof. Open Annotation 5 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 6. Basic Model The basic model has three resources: •  Annotation (an RDF document) •  Body (the ‘comment’ of the annotation) •  Target (the resource the Body is ‘about’) Open Annotation 6 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 7. Basic Model Example Open Annotation 7 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 8. Additional Relationships and Properties Any of the resources can have additional information attached, such as creator, date of creation, title, etc. Open Annotation 8 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 9. Additional Properties Example Open Annotation 9 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 10. Annotation Types There can be further types of Annotation, such as a Reply. Example: Replies are Annotations on Annotations. Open Annotation 10 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 11. Annotation Types Example Open Annotation 11 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 12. Inline Information It is important to be able to have content contained within the Annotation document for Client Autonomy: •  Clients may be unable to mint new URIs for every resource •  Clients may wish to transmit only a single document •  Third parties can generate new URIs if the client does not The W3C has a Content in RDF specification: •  http://www.w3.org/TR/Content-in-RDF10/ Open Annotation 12 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 13. Inline Information: Body •  We introduce a resource identified by a non resolvable URI, such as a UUID URN, as the Body. •  We then embed the data within the Annotation document using 'chars’ from Content in RDF. Open Annotation 13 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 14. Inline Body Example Open Annotation 14 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 15. Multiple Targets There are many use cases for multiple targets for an Annotation: •  Comparison of two or more resources •  Making a statement that applies to all of the resources •  Making a statement about multiple parts of a resource The OAC Data Model allows for multiple targets by simply having more than one hasTarget relationship. Open Annotation 15 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 16. Multiple Targets Example Open Annotation 16 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 17. Segments of Resources Most annotations are about part of a resource Different segments for different media types: •  Text: paragraph, arbitrary span of words •  Image: rectangular or arbitrary shaped area •  Audio: start and end time points, track name/number •  Video: area and time points •  Other: slice of a data set, volume in a 3d object, … Open Annotation 17 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 18. Segments of Resources Web Architecture Segmentation: •  A URI with a Fragment identifies part of the resource: •  IETF Mime-type fragment identifiers; eg xpointer •  W3C Media Fragments URI specification for simple segments of media: http://www.w3.org/TR/media-frags/ We introduce a method of constraining resources: •  Introduce an approach for arbitrarily complex segments that cannot be expressed using Fragments •  Can be applied to Body or Target resource Open Annotation 18 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 19. Segments of Resources: IETF Fragment URIs URI Fragments are a syntax for creating subsidiary URIs that identify part of the main resource The syntax is defined per media type: •  X/HTML: The named anchor or identified element •  XML: An XPointer to the element(s) •  PDF: Many options, especially page and viewrect •  Plain Text: Either by character position or line position Open Annotation 19 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 20. Segments of Resources: W3C Media Fragments Media Fragments allow anyone to create URIs that identify part of an image, audio or video resource. The most common case is for rectangular areas of images: •  http://www.example.org/image.jpg#xywh=50,100,640,480 Link to the full resource as well, for all Fragment URIs Open Annotation 20 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 21. Media Fragments Example Open Annotation 21 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 22. Complex Constraints Fragment URIs are not always possible •  Introduce a Constraint that describes the segment of interest •  And a ConstrainedTarget that identifies the segment of interest •  Constraints are entire resources, so can be more expressive •  Constraints may also describe 'contextual' information Open Annotation 22 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 23. Constraint Example Open Annotation 23 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 24. Constrained Body The Body may also be constrained in the same way as Targets Open Annotation 24 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 25. Web-Centric Annotation: No Persistence Google Sidewiki Annotation on http://news.bbc.co.uk/ as of 2010-06-14 Open Annotation 25 2 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 26. Web-Centric Annotation: No Annotations Archived page from: http://www.dracos.co.uk/work/bbc-news-archive/2010/03/08/07.05.html Open Annotation 26 2 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 27. Web-Centric Annotation: Desired Cross-Linking Open Annotation 27 2 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 28. Annotations and Time There are three different types of Annotation with respect to Time: •  Timeless Annotations •  These are always relevant, regardless of the current state of the resources. This is the base model. •  Uniform Annotations •  There is a single timestamp at which all of the resources should be considered. •  Varied Annotations •  Each of the resources (Body, Targets) should be considered at a different time. Open Annotation 28 2 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 29. Uniform Annotations If the same time is applicable to all resources, we attach it to the Annotation using the oac:when predicate. Open Annotation 29 2 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 30. Uniform Annotation Example Open Annotation 30 3 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 31. Varied Annotations If different timestamps are required for each resource, we use oac:when from an oac:WebTimeConstraint. Open Annotation 31 3 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 32. Varied Annotation Example Open Annotation 32 3 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 33. Finding Archived Resources with Memento Open Annotation 33 3 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 34. Annotations for/by Machines? •  The Body consists of one or more Statements •  Human understandable: Text, Image, Video, Symbols, … •  Machine understandable: Data •  Humans can infer relationships and context, Machines cannot •  Need to be as explicit as possible •  Need structured information •  When would we do this? •  Attaching data to another resource •  Nano-Publications: publication of data for further processing •  Semantic Annotations Open Annotation 34 3 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 35. Semantic Annotation: Single Named Entity Open Annotation 35 3 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 36. Semantic Annotation: Entity Relationships Open Annotation 36 3 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 37. Advantages of this Approach •  Only uses existing OAC constructions •  No new ‘isTopicOf’ or similar shortcut relationships •  Creator of Body is not confused with Creator of Annotation! •  Aligns very closely with human annotation practices •  Consistent model that scales from resource to part of resource •  Can annotate data extracted at most appropriate level •  Could extract from sentence/paragraph/section/entire text •  Consistent model that allows association of any amount of data: •  From Single Entity •  To scholarly discourse extraction from entire document Open Annotation 37 3 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 38. Annotation Protocols Unlike previous systems, Open Annotation does not mandate a protocol. No reliance on a client/server combination gives the client autonomy. Instead we promote a publish/ subscribe methodology, where annotations may be stored and consumed from anywhere. Protocol: publish, subscribe, consume tied together Open Annotation 38 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 39. Publish/Subscribe Method We don’t specify how this transfer should occur publish Open Annotation 39 3 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 40. Publish/Subscribe Method Nor this. publish subscribe Open Annotation 40 4 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 41. Publish/Subscribe Method Nor this. publish subscribe consume Open Annotation 41 4 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 42. Publish/Subscribe Advantages •  Client can use most appropriate method for transferring annotation to storage service •  May already be specified in certain domains •  Can use existing services without requiring direct adoption •  Annotations are web resources in their own right •  Can be protected for limited access using existing technology •  Have their own URIs by necessity, not good-will of service •  Promotes a market-place of services •  Archiving Annotations and resources for preservation •  Enriching with additional metadata and information •  Aggregation and curation to provide trusted annotation feeds Open Annotation 42 4 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 43. Demo! http://www.shared-canvas.org/impl/demo3/ •  Took the PDF of Dirac’s thesis on Quantum Mechanics and split into individual page images •  Allow transcription by annotation, and commentary by annotation •  Allows storage at different services, both public and private Open Annotation 43 4 AAHEP5, 22nd of September 2011, Ithaca, NY
  • 44. Thank You Robert Sanderson rsanderson@lanl.gov azaroth42@gmail.com @azaroth42 Web: http://www.openannotation.org/ Paper: http://arxiv.org/abs/1003.2643 Slides: http://slidesha.re/qolpwI Open Annotation 44 AAHEP5, 22nd of September 2011, Ithaca, NY