SlideShare a Scribd company logo
data sharing and the
polar information commons
                    kaitlin thaney
              program manager, science
                 creative commons


    This presentation is licensed under the CreativeCommons-Attribution-3.0 license.
access is step one

content needs to be legally and
    technically accessible
knowledge?

    journal articles
          data
       ontologies
      annotations
plasmids and cell lines
knowledge?

             journal articles
                 data
               ontologies
              annotations
         plasmids and cell lines

... how to treat? like content? software?
as a means to achieve Open Access
data? not necessarily.

  (it’s complicated)
copyright and databases

  what’s protected? is it legal?

        facts are free

to what extent is there creative
         expression?
the data “rights” conundrum...
©
“creative expression”
is it creative?
is it creative?
is it creative?
category errors
the problem of...
   Non-Commercial


   for data
Non-Commercial


what’s a commercial use
   of the data web?
the problem of...
  Share Alike


   for data
1854
issue of license proliferation

   whatever you do to the least of the
databases, you do to the integrated system

       (the most restrictive wins)

    risk for unintended consequences
the problem of...
   Attribution


   for data
Data Sharing and the Polar Information Commons
Data Sharing and the Polar Information Commons
the problem of...
  any license

   for data
national law / jurisdiction-based
            hurdles

             sui generis,
        “sweat of the brow”
          Crown copyright
           “level of skill”

how internat’l data sharing efforts
          are affected?
attribution vs. citation

which one applies? which is best fit?
      what’s the difference?


 “credit where credit is due”
attribution:
             (legal entity)

   “triggered by making of a copy”
         does it apply to facts?
how to attribute? (papers, ontologies, data)

      “in a manner specified by ...”
           attribution stacking
citation:
(gentle(wo)man’s club)

    legal requirement?
     interoperability?
credit where credit is due
entrenched scientific norm
we shouldn’t use the law to make it
   hard to do the wrong thing ...
need for a legally accurate and
              simple solution

reducing or eliminating the need to make the
       distinction of what’s protected

requires modular, standards based approach
                  to licensing
Data Sharing and the Polar Information Commons
Data Sharing and the Polar Information Commons
Data Sharing and the Polar Information Commons
Data Sharing and the Polar Information Commons
... must promote legal predictability and certainty.

             ... must be easy to use and understand.

... must impose the lowest possible transaction costs on
                         users.

full text:
http://sciencecommons.org/projects/publishing/open-access-data-protocol/
norms approach

  set of principles (not license)

open, accessible, interoperable

  create legal zones of certainty
calls for data providers to waive all rights
necessary for data extraction and re-use

  requires provider place no additional
    obligations (like share-alike) to limit
              downstream use

 request behavior (like attribution) through
        norms and terms of use
Creating norms for polar data

1. How to preserve the source information? How should the user or
   copier preserve the provenance of the data set. What can be required
   by PIC that is locally relevant and acceptable? DOIs? Something like a
   notice inside the data? Ping to a URL at PIC? RDFa inside a section of
   every database that is provided by PIC?
2. How to cite the data set? Many examples out there including
   http://ipydis.org/data/citations.html
3. How to preserve quality standards? Perhaps we leave it up to the
   users?
4. How to note and release user contributions, mashups, repurposing?
   Do we need release guidelines of contributions, annotations, etc. to
   data sets. How to reward and track individual contributions to a
   collective - trackback, user accounts, etc.? A simple “share alike”
   request?
Some draft norms of appropriate scientific
behavior when using PIC data

• Acknowledge the source of the data in accordance with the wishes of the provider,
  and explicitly cite the data when they are used in formal scientific publication (http://
  ipydis.org/data/citations.html).
• Maintain a link to the original information in any derived products, ideally through a
  persistent identifier, such as a Digital Object Identifier.
• Understanding that the data are made available “as is” and the accuracy of the data
  or documentation are not guaranteed. The provider assumes no responsibility for
  misuse or misinterpretation.
• Notify the data provider in the manner they describe on how you plan to use the
  data. For projects integrally dependent on the data consider requesting
  collaboration and/or co-authorship from the provider.
• Share any derived products in the PIC.
• Agree to IPY Data Policy



                                                                                             37
others?
Data Sharing and the Polar Information Commons
Data Sharing and the Polar Information Commons
Data Sharing and the Polar Information Commons
Data Sharing and the Polar Information Commons
5.4 million bibliographic records
Data Sharing and the Polar Information Commons
at best, we’re partially right.

at worst, we’re really wrong.
data without structure and annotation is a
            lost opportunity.

data should flow in an open, public, and
        extensible infrastructure

support recombination and reconfiguration
into computer models, queryable by search
                engine

        treated as public good
resist the temptation to treat
              as property

embrace the potential to treat instead
      as a network resource

More Related Content

PPT
Sible 09
PDF
Channeling insights to the right people
PDF
Information Architecture Primer - Integrating search,tagging, taxonomy and us...
PPTX
Linked Data and Users in Library - Does the library communicate efficiently?
PDF
The open semantic enterprise enterprise data meets web data
PDF
Findability Primer by Information Architected - the IA Primer Series
PDF
SemTech West 2011 - Digital Provenance
PPTX
Information Security in Big Data : Privacy and Data Mining
Sible 09
Channeling insights to the right people
Information Architecture Primer - Integrating search,tagging, taxonomy and us...
Linked Data and Users in Library - Does the library communicate efficiently?
The open semantic enterprise enterprise data meets web data
Findability Primer by Information Architected - the IA Primer Series
SemTech West 2011 - Digital Provenance
Information Security in Big Data : Privacy and Data Mining

What's hot (20)

PDF
Transcript #4 fair -R for Reusable
PPTX
VoID: Metadata for RDF Datasets
PPT
Information security in big data -privacy and data mining
PPT
Digital library and metadata
PPTX
A Return on Investment: Making the data work harder
PDF
Modeling and Detection of Data Leakage Fraud
PPTX
PPTX
A Look at CESSDA and Data Re-use Licenses
PDF
Big Data Meets Privacy:De-identification Maturity Model for Benchmarking and ...
PPTX
Security issues associated with big data in cloud
PPTX
Fair data principles for AOASG
DOC
Transcript - DOIs to support citation of grey literature
PDF
"Cool" metadata for FAIR data
PDF
Ai and applications in the legal domain studium generale maastricht 20191101
PPTX
LIBER Webinar: Are the FAIR Data Principles really fair?
PPTX
Komatsoulis internet2 executive track
PDF
Legal tech Alliance Workshop 20191029
PPT
CESSI Digital Library Case Study Eng
PPT
Technical Developments within the UK Access Management Federation
PDF
Why should semantic technologies pay more attention to privacy... and vice-ve...
Transcript #4 fair -R for Reusable
VoID: Metadata for RDF Datasets
Information security in big data -privacy and data mining
Digital library and metadata
A Return on Investment: Making the data work harder
Modeling and Detection of Data Leakage Fraud
A Look at CESSDA and Data Re-use Licenses
Big Data Meets Privacy:De-identification Maturity Model for Benchmarking and ...
Security issues associated with big data in cloud
Fair data principles for AOASG
Transcript - DOIs to support citation of grey literature
"Cool" metadata for FAIR data
Ai and applications in the legal domain studium generale maastricht 20191101
LIBER Webinar: Are the FAIR Data Principles really fair?
Komatsoulis internet2 executive track
Legal tech Alliance Workshop 20191029
CESSI Digital Library Case Study Eng
Technical Developments within the UK Access Management Federation
Why should semantic technologies pay more attention to privacy... and vice-ve...
Ad

Similar to Data Sharing and the Polar Information Commons (20)

PDF
Data Sharing: Social and Normative - ISWC
PDF
Knowledge Sharing in the Sciences
PDF
Knowledge Sharing in the Sciences - 8JPL
PPT
Data Sharing & Data Citation
PDF
Dataverse as a FAIR Data Repository (Mercè Crosas)
PDF
08.03.17 licensing research data for reuse
PPTX
Data Exchange, Data Citation: An overview of some community work
PPTX
Data Exchange, Data Citation: An overview of some community work
PDF
Sharing Scientific Data: Legal, Normative and Social Issues
PDF
Data Policy for Open Science
PDF
Data Policy for Open Science
PDF
Levine - Data Curation; Ethics and Legal Considerations
PDF
Open Research Data: Licensing | Standards | Future
PDF
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
PPT
Overview of Emerging Requirements for Data Management of Federally Funded Res...
PDF
Dataverse in the Universe of Data by Christine L. Borgman
PDF
Data accessibilityandchallenges
PDF
Cal Poly - Data Management: Who knew it was a hot topic?
PPT
What does open science mean? A stakeholder perspective
PPTX
Shareable by Design: Making Better Use of your Research
Data Sharing: Social and Normative - ISWC
Knowledge Sharing in the Sciences
Knowledge Sharing in the Sciences - 8JPL
Data Sharing & Data Citation
Dataverse as a FAIR Data Repository (Mercè Crosas)
08.03.17 licensing research data for reuse
Data Exchange, Data Citation: An overview of some community work
Data Exchange, Data Citation: An overview of some community work
Sharing Scientific Data: Legal, Normative and Social Issues
Data Policy for Open Science
Data Policy for Open Science
Levine - Data Curation; Ethics and Legal Considerations
Open Research Data: Licensing | Standards | Future
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
Overview of Emerging Requirements for Data Management of Federally Funded Res...
Dataverse in the Universe of Data by Christine L. Borgman
Data accessibilityandchallenges
Cal Poly - Data Management: Who knew it was a hot topic?
What does open science mean? A stakeholder perspective
Shareable by Design: Making Better Use of your Research
Ad

More from Kaitlin Thaney (20)

PDF
Megaphones to (No)where: On Sustaining Change
PDF
Lessons in Resilience - International Women's Day Keynote @ Brooklyn College
PDF
Building Capacity for Open Science
PDF
Fueling the Open Movement - Compute Midwest
PDF
Shifting Scientific Practice - ORCID 2015
PDF
Mozilla Science Lab 101
PDF
Building capacity for open science - COASP Meeting
PDF
Leveraging the power of the web - Rocky Mountain Advanced Computing Conference
PDF
Leveraging the power of the web - Open Repositories 2015
PDF
Building capacity for open, data-driven science - Grand Rounds
PDF
National Data Integrity Conference - Making the web work for science
PDF
Capturing Contribution - ARCS
PDF
Making the web work for science - RIT Dean's Lecture Series
PDF
Piloting Contributorship Badges for Science
PDF
"Designing for Truth, Scale and Sustainability" - WSSSPE2 Keynote
PDF
"Making the Web Work for Science" - NCI CBIIT
PDF
"Building Capacity for Open Research" - AAMC
PDF
Making the web work for science - eResearch nz
PDF
Making the web work for science - University of Queensland
PDF
Discoverability and Web-Enabled Science - #ScholarAfrica
Megaphones to (No)where: On Sustaining Change
Lessons in Resilience - International Women's Day Keynote @ Brooklyn College
Building Capacity for Open Science
Fueling the Open Movement - Compute Midwest
Shifting Scientific Practice - ORCID 2015
Mozilla Science Lab 101
Building capacity for open science - COASP Meeting
Leveraging the power of the web - Rocky Mountain Advanced Computing Conference
Leveraging the power of the web - Open Repositories 2015
Building capacity for open, data-driven science - Grand Rounds
National Data Integrity Conference - Making the web work for science
Capturing Contribution - ARCS
Making the web work for science - RIT Dean's Lecture Series
Piloting Contributorship Badges for Science
"Designing for Truth, Scale and Sustainability" - WSSSPE2 Keynote
"Making the Web Work for Science" - NCI CBIIT
"Building Capacity for Open Research" - AAMC
Making the web work for science - eResearch nz
Making the web work for science - University of Queensland
Discoverability and Web-Enabled Science - #ScholarAfrica

Recently uploaded (20)

PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
Complications of Minimal Access Surgery at WLH
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PDF
Basic Mud Logging Guide for educational purpose
PPTX
Cell Types and Its function , kingdom of life
PPTX
Introduction to Child Health Nursing – Unit I | Child Health Nursing I | B.Sc...
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PPTX
Pharma ospi slides which help in ospi learning
PPTX
Week 4 Term 3 Study Techniques revisited.pptx
PDF
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
Anesthesia in Laparoscopic Surgery in India
PDF
O7-L3 Supply Chain Operations - ICLT Program
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
Renaissance Architecture: A Journey from Faith to Humanism
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
Complications of Minimal Access Surgery at WLH
102 student loan defaulters named and shamed – Is someone you know on the list?
Basic Mud Logging Guide for educational purpose
Cell Types and Its function , kingdom of life
Introduction to Child Health Nursing – Unit I | Child Health Nursing I | B.Sc...
Module 4: Burden of Disease Tutorial Slides S2 2025
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
Final Presentation General Medicine 03-08-2024.pptx
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
Pharma ospi slides which help in ospi learning
Week 4 Term 3 Study Techniques revisited.pptx
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Anesthesia in Laparoscopic Surgery in India
O7-L3 Supply Chain Operations - ICLT Program
FourierSeries-QuestionsWithAnswers(Part-A).pdf

Data Sharing and the Polar Information Commons

  • 1. data sharing and the polar information commons kaitlin thaney program manager, science creative commons This presentation is licensed under the CreativeCommons-Attribution-3.0 license.
  • 2. access is step one content needs to be legally and technically accessible
  • 3. knowledge? journal articles data ontologies annotations plasmids and cell lines
  • 4. knowledge? journal articles data ontologies annotations plasmids and cell lines ... how to treat? like content? software?
  • 5. as a means to achieve Open Access
  • 6. data? not necessarily. (it’s complicated)
  • 7. copyright and databases what’s protected? is it legal? facts are free to what extent is there creative expression?
  • 8. the data “rights” conundrum...
  • 14. the problem of... Non-Commercial for data
  • 15. Non-Commercial what’s a commercial use of the data web?
  • 16. the problem of... Share Alike for data
  • 17. 1854
  • 18. issue of license proliferation whatever you do to the least of the databases, you do to the integrated system (the most restrictive wins) risk for unintended consequences
  • 19. the problem of... Attribution for data
  • 22. the problem of... any license for data
  • 23. national law / jurisdiction-based hurdles sui generis, “sweat of the brow” Crown copyright “level of skill” how internat’l data sharing efforts are affected?
  • 24. attribution vs. citation which one applies? which is best fit? what’s the difference? “credit where credit is due”
  • 25. attribution: (legal entity) “triggered by making of a copy” does it apply to facts? how to attribute? (papers, ontologies, data) “in a manner specified by ...” attribution stacking
  • 26. citation: (gentle(wo)man’s club) legal requirement? interoperability? credit where credit is due entrenched scientific norm
  • 27. we shouldn’t use the law to make it hard to do the wrong thing ...
  • 28. need for a legally accurate and simple solution reducing or eliminating the need to make the distinction of what’s protected requires modular, standards based approach to licensing
  • 33. ... must promote legal predictability and certainty. ... must be easy to use and understand. ... must impose the lowest possible transaction costs on users. full text: http://sciencecommons.org/projects/publishing/open-access-data-protocol/
  • 34. norms approach set of principles (not license) open, accessible, interoperable create legal zones of certainty
  • 35. calls for data providers to waive all rights necessary for data extraction and re-use requires provider place no additional obligations (like share-alike) to limit downstream use request behavior (like attribution) through norms and terms of use
  • 36. Creating norms for polar data 1. How to preserve the source information? How should the user or copier preserve the provenance of the data set. What can be required by PIC that is locally relevant and acceptable? DOIs? Something like a notice inside the data? Ping to a URL at PIC? RDFa inside a section of every database that is provided by PIC? 2. How to cite the data set? Many examples out there including http://ipydis.org/data/citations.html 3. How to preserve quality standards? Perhaps we leave it up to the users? 4. How to note and release user contributions, mashups, repurposing? Do we need release guidelines of contributions, annotations, etc. to data sets. How to reward and track individual contributions to a collective - trackback, user accounts, etc.? A simple “share alike” request?
  • 37. Some draft norms of appropriate scientific behavior when using PIC data • Acknowledge the source of the data in accordance with the wishes of the provider, and explicitly cite the data when they are used in formal scientific publication (http:// ipydis.org/data/citations.html). • Maintain a link to the original information in any derived products, ideally through a persistent identifier, such as a Digital Object Identifier. • Understanding that the data are made available “as is” and the accuracy of the data or documentation are not guaranteed. The provider assumes no responsibility for misuse or misinterpretation. • Notify the data provider in the manner they describe on how you plan to use the data. For projects integrally dependent on the data consider requesting collaboration and/or co-authorship from the provider. • Share any derived products in the PIC. • Agree to IPY Data Policy 37
  • 45. at best, we’re partially right. at worst, we’re really wrong.
  • 46. data without structure and annotation is a lost opportunity. data should flow in an open, public, and extensible infrastructure support recombination and reconfiguration into computer models, queryable by search engine treated as public good
  • 47. resist the temptation to treat as property embrace the potential to treat instead as a network resource