SlideShare a Scribd company logo
UKOLN is supported  by: Mind the Gap: Reflections on Data Policies and Practice Dr Liz Lyon,  Director, UKOLN, University of Bath, UK Associate Director, UK Digital Curation Centre JISC/CNI Conference, Edinburgh, July 2010 . This work is licensed under a Creative Commons Licence Attribution-ShareAlike 2.0
Overview UK Data Policy Context Institutions & open science Data practice today Future landscape Scale and complexity Open and personal Drivers and incentives Challenges & Actions Planning tools Policy Gaps
1. Current Practice Scale, Complexity, Predictive Potential  Continuum of Openness Citizen Science Credentials, Incentives, Rewards Institutional Readiness & Response Data Informatics Capacity & Capability http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/publications.html#november-2009 Open Science at Web-Scale Report
INCREMENTAL Project Scoping study : institution perspective Creating & organising data Storage and access Back-up Preservation Sharing and re-use
“ Departments don’t have  guidelines or norms for personal back-up and researcher procedure, knowledge and diligence varies tremendously. Many have experienced moderate to catastrophic data loss” Incremental Project Report, June 2010 http://www.flickr.com/photos/mattimattila/3003324844/
“ While many researchers are positive about sharing data in principle, they are almost universally reluctant in practice. ..... using these data to publish results before anyone else is the primary way of gaining prestige in nearly all disciplines.” INCREMENTAL Project “ Data sharing was more readily discussed by early career researchers.”
Heather Piwowar … but many researchers don’t share… … and are reluctant to re-use data…
“ They found the documents ....to be dense, wordy, theoretical, ambiguous and un-engaging.” “ Interviewees were often unaware of existing guidance, resources.... and policy documents.” Incremental Project Report, June 2010
“ Many people are suspicious of ‘policies’ which sound like hollow mandates, but are receptive to ‘procedures’ or ‘advice’  which may be essentially the same thing, but convey a sense of purpose and assistance rather than requirement.” Incremental Project Report, June 2010 The majority of people felt that some form of policy or guidance was needed....
2. Future Data Landscape ? Genomics exemplar
...Next next generation technology race to market $1000 genome in <15 minutes ....by 2013?
Researchers need.... Large-scale data storage that is: Cost-effective (rent on-demand) Secure (privacy and IPR) Robust and resilient Low entry barrier / ease-of-use Has data-handling / transfer / analysis capability Cloud services? “ .... analyse an entire human genome in a single day sitting with a laptop at your local Starbucks. ”
The “new” genome informatics ecosystem  The case for cloud computing in genome informatics.  Lincoln D Stein, May 2010 Data storage policy?
Post-genome decade Human genomes: >24  published & almost 200 unpublished
They have shared their data….
Share  my  data Data sharing policy?
“ P4 medicine : Predictive, Personalised, Preventive, Participatory.” Leroy Hood –  Institute for Systems Biology ...“medicine is going to become an information science”... Image from Scientific American
P4 medicine Each patient’s genome sequenced Your genome is basis of your medical record New method to anonymise medical records for genomics research at Vanderbilt Univ (April ‘10) New Predictive models of health and disease Personalised treatments focus on Preventative therapies Genome scale network biology Genomic data as a commodity
Sage Bionetworks : Integrative genomics Open data in the Sage Commons repository Human and mouse: clinical and genetics data Develop predictive models of disease: liver / breast / colon cancer, diabetes, obesity Crowd-sourced effort : global scope Stephen Friend
Participatory medicine : share data & empower the patient... Sage Congress  San Francisco April 2010
“ You have zero privacy anyway. Get over it” Scott McNealy, CEO Sun Microsystems, 1999 Data Ethics & Privacy Policy?  Significant implications for Faculty Awareness of wider societal benefits University Ethics Committee
Results data : validate in professional press Public participation, citizen science
Data policy for public engagement? Faculty attitude & culture Professional : amateur
Calls for action, new metrics Incentives?
Journal  Article Workflow Visualisation Model Data Annotation Concept Macro Attribution granularity Complexity : what are we citing? Micro / Nano
Large-scale predictive network models of disease Multiple datasets Visualise: Cytoscape  Workflow: Taverna Data citation policy?
3. Policy guidance, planning tools, Code of Conduct
State-of-the-Art Report :  Models & Tools  (Alex Ball, June 2010) Data Lifecycles Data Policies (UK) incl DMP Standards & tools Data Asset Framework (DAF)  DANS Seal of Approval Preservation metadata Archive management tools Cost / benefit tools
Data types, formats, standards, capture Ethics and Intellectual Property Access, sharing and re-use Short-term storage & data management Deposit & long-term preservation Adherence and review
http://www.dcc.ac.uk/dmponline   DMP Online Currently updating Version 2.0 Version 3.0 summer 2010
Making DMPs work : the start of a long process… Embed DMPs  in funder policies & research lifecycles as  the norm Code of Conduct for Research Assess & review DMPs  (not just the science content of proposals) Educate reviewers  (DCC guidance  for social science in prep) Manage compliance  of researchers Infrastructure to share  DMPs Analyse cost-benefits  for UK HE
Take homes... Practice is disconnected from policy Policy Gaps Data Storage (& Appraisal: DCC guidance in prep) Data Sharing (& Licensing: DCC guidance in prep) Ethics and Privacy  Citizen Science & Public Engagement Data Citation and Attribution Collaborate with funders to make DMPs work Digital Curation Centre DMP tool & resources www.dcc.ac.uk
Chicago Mart Plaza, 6-8 December 2010 Thank you…

More Related Content

PPT
Acting as Advocate? Seven steps for libraries in the data decade
PPT
Evolution or revolution? The changing data landscape
PPTX
LEARN Conference - How to cost
PPTX
LEARN Final Conference: Tutorial Group | Implementing the LEARN RDM Toolkit
PDF
Research Data Management, Challenges and Tools - Per Öster
PPTX
LEARN Final Conference: Tutorial Group | Using the LEARN Model RDM Policy
PPTX
Open science, open data - FOSTER training, Potsdam
PPTX
The Future of Open Science
Acting as Advocate? Seven steps for libraries in the data decade
Evolution or revolution? The changing data landscape
LEARN Conference - How to cost
LEARN Final Conference: Tutorial Group | Implementing the LEARN RDM Toolkit
Research Data Management, Challenges and Tools - Per Öster
LEARN Final Conference: Tutorial Group | Using the LEARN Model RDM Policy
Open science, open data - FOSTER training, Potsdam
The Future of Open Science

What's hot (20)

PPTX
Why science needs open data – Jisc and CNI conference 10 July 2014
PPTX
Jisc's new shared data centre
PPTX
20160719 23 Research Data Things
PPTX
Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...
PPTX
20160523 23 Research Data Things
PPTX
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
PPTX
RDM & ELNs @ Edinburgh
PPTX
Tijerina-RDA-NISO-Task Groups-sept11
PPTX
RDM policy and recovering costs
PPTX
Managing and sharing data
PPT
Libraries and Research Data Management – What Works? Lessons Learned from the...
PDF
Poster: Very Open Data Project
PDF
Levine - Data Curation; Ethics and Legal Considerations
PDF
Poster RDAP13: Research Data in eCommons @ Cornell: Present and Future
PPTX
The Horizon 2020 Open Data Pilot
PDF
Open Data - strategies for research data management & impact of best practices
PDF
Connected health cities
PDF
Open Science Governance and Regulation/Simon Hodson
PPTX
RDM LIASA webinar
PPTX
Data management: The new frontier for libraries
Why science needs open data – Jisc and CNI conference 10 July 2014
Jisc's new shared data centre
20160719 23 Research Data Things
Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...
20160523 23 Research Data Things
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
RDM & ELNs @ Edinburgh
Tijerina-RDA-NISO-Task Groups-sept11
RDM policy and recovering costs
Managing and sharing data
Libraries and Research Data Management – What Works? Lessons Learned from the...
Poster: Very Open Data Project
Levine - Data Curation; Ethics and Legal Considerations
Poster RDAP13: Research Data in eCommons @ Cornell: Present and Future
The Horizon 2020 Open Data Pilot
Open Data - strategies for research data management & impact of best practices
Connected health cities
Open Science Governance and Regulation/Simon Hodson
RDM LIASA webinar
Data management: The new frontier for libraries
Ad

Viewers also liked (7)

PPTX
Social Media Association for Business Presentation
ODP
כנסת פתוחה באוגוסט פינגווין
PPT
UK Digital Curation Centre: enabling research data management at the coalface
PPTX
Journalism and Social Media
PPTX
Hybinar hybrid events & cloud video notes
PPT
Introduction to Social Media
PPT
Onwebinar презентация для инвесторов
Social Media Association for Business Presentation
כנסת פתוחה באוגוסט פינגווין
UK Digital Curation Centre: enabling research data management at the coalface
Journalism and Social Media
Hybinar hybrid events & cloud video notes
Introduction to Social Media
Onwebinar презентация для инвесторов
Ad

Similar to Mind the Gap: Reflections on Data Policies and Practice (20)

PPT
Codes, Clouds & Constellations: Open Science in the Data Decade
PDF
Open Access Week - Oxford, 20-24 Oct 2014
PPTX
The purpose, practicalities, pitfalls and policies of managing and sharing da...
PPTX
ischools future of data managemente dec2017
PPTX
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PPT
Human Genome and Big Data Challenges
PPT
Informatics Transform : Re-engineering Libraries for the Data Decade
PPT
Bloomsbury Conference
PDF
Anthony J brookes
PPTX
Managing Your Research Data
PPT
PPTX
Introduction to research data management
PPT
AMIA 2014
PDF
A Data Biosphere for Biomedical Research
PPTX
PSB2014 A Vision for Biomedical Research
PPTX
Managing and Sharing Research Data
PPTX
Digital curation for postgraduate students
PPTX
Research Data Management: a gentle introduction
PPTX
International perspective for sharing publicly funded medical research data
PPT
Data Science at NIH and its Relationship to Social Computing, Behavioral-Cult...
Codes, Clouds & Constellations: Open Science in the Data Decade
Open Access Week - Oxford, 20-24 Oct 2014
The purpose, practicalities, pitfalls and policies of managing and sharing da...
ischools future of data managemente dec2017
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
Human Genome and Big Data Challenges
Informatics Transform : Re-engineering Libraries for the Data Decade
Bloomsbury Conference
Anthony J brookes
Managing Your Research Data
Introduction to research data management
AMIA 2014
A Data Biosphere for Biomedical Research
PSB2014 A Vision for Biomedical Research
Managing and Sharing Research Data
Digital curation for postgraduate students
Research Data Management: a gentle introduction
International perspective for sharing publicly funded medical research data
Data Science at NIH and its Relationship to Social Computing, Behavioral-Cult...

Recently uploaded (20)

PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Machine learning based COVID-19 study performance prediction
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
Spectroscopy.pptx food analysis technology
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
cuic standard and advanced reporting.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Electronic commerce courselecture one. Pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Empathic Computing: Creating Shared Understanding
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
Big Data Technologies - Introduction.pptx
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
20250228 LYD VKU AI Blended-Learning.pptx
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Understanding_Digital_Forensics_Presentation.pptx
Spectral efficient network and resource selection model in 5G networks
Machine learning based COVID-19 study performance prediction
Advanced methodologies resolving dimensionality complications for autism neur...
Spectroscopy.pptx food analysis technology
Dropbox Q2 2025 Financial Results & Investor Presentation
cuic standard and advanced reporting.pdf
Unlocking AI with Model Context Protocol (MCP)
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Electronic commerce courselecture one. Pdf
Encapsulation_ Review paper, used for researhc scholars
Network Security Unit 5.pdf for BCA BBA.
Building Integrated photovoltaic BIPV_UPV.pdf
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Empathic Computing: Creating Shared Understanding
Per capita expenditure prediction using model stacking based on satellite ima...
Big Data Technologies - Introduction.pptx
Digital-Transformation-Roadmap-for-Companies.pptx

Mind the Gap: Reflections on Data Policies and Practice

  • 1. UKOLN is supported by: Mind the Gap: Reflections on Data Policies and Practice Dr Liz Lyon, Director, UKOLN, University of Bath, UK Associate Director, UK Digital Curation Centre JISC/CNI Conference, Edinburgh, July 2010 . This work is licensed under a Creative Commons Licence Attribution-ShareAlike 2.0
  • 2. Overview UK Data Policy Context Institutions & open science Data practice today Future landscape Scale and complexity Open and personal Drivers and incentives Challenges & Actions Planning tools Policy Gaps
  • 3. 1. Current Practice Scale, Complexity, Predictive Potential Continuum of Openness Citizen Science Credentials, Incentives, Rewards Institutional Readiness & Response Data Informatics Capacity & Capability http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/publications.html#november-2009 Open Science at Web-Scale Report
  • 4. INCREMENTAL Project Scoping study : institution perspective Creating & organising data Storage and access Back-up Preservation Sharing and re-use
  • 5. “ Departments don’t have guidelines or norms for personal back-up and researcher procedure, knowledge and diligence varies tremendously. Many have experienced moderate to catastrophic data loss” Incremental Project Report, June 2010 http://www.flickr.com/photos/mattimattila/3003324844/
  • 6. “ While many researchers are positive about sharing data in principle, they are almost universally reluctant in practice. ..... using these data to publish results before anyone else is the primary way of gaining prestige in nearly all disciplines.” INCREMENTAL Project “ Data sharing was more readily discussed by early career researchers.”
  • 7. Heather Piwowar … but many researchers don’t share… … and are reluctant to re-use data…
  • 8. “ They found the documents ....to be dense, wordy, theoretical, ambiguous and un-engaging.” “ Interviewees were often unaware of existing guidance, resources.... and policy documents.” Incremental Project Report, June 2010
  • 9. “ Many people are suspicious of ‘policies’ which sound like hollow mandates, but are receptive to ‘procedures’ or ‘advice’ which may be essentially the same thing, but convey a sense of purpose and assistance rather than requirement.” Incremental Project Report, June 2010 The majority of people felt that some form of policy or guidance was needed....
  • 10. 2. Future Data Landscape ? Genomics exemplar
  • 11. ...Next next generation technology race to market $1000 genome in <15 minutes ....by 2013?
  • 12. Researchers need.... Large-scale data storage that is: Cost-effective (rent on-demand) Secure (privacy and IPR) Robust and resilient Low entry barrier / ease-of-use Has data-handling / transfer / analysis capability Cloud services? “ .... analyse an entire human genome in a single day sitting with a laptop at your local Starbucks. ”
  • 13. The “new” genome informatics ecosystem The case for cloud computing in genome informatics. Lincoln D Stein, May 2010 Data storage policy?
  • 14. Post-genome decade Human genomes: >24 published & almost 200 unpublished
  • 15. They have shared their data….
  • 16. Share my data Data sharing policy?
  • 17. “ P4 medicine : Predictive, Personalised, Preventive, Participatory.” Leroy Hood – Institute for Systems Biology ...“medicine is going to become an information science”... Image from Scientific American
  • 18. P4 medicine Each patient’s genome sequenced Your genome is basis of your medical record New method to anonymise medical records for genomics research at Vanderbilt Univ (April ‘10) New Predictive models of health and disease Personalised treatments focus on Preventative therapies Genome scale network biology Genomic data as a commodity
  • 19. Sage Bionetworks : Integrative genomics Open data in the Sage Commons repository Human and mouse: clinical and genetics data Develop predictive models of disease: liver / breast / colon cancer, diabetes, obesity Crowd-sourced effort : global scope Stephen Friend
  • 20. Participatory medicine : share data & empower the patient... Sage Congress San Francisco April 2010
  • 21. “ You have zero privacy anyway. Get over it” Scott McNealy, CEO Sun Microsystems, 1999 Data Ethics & Privacy Policy? Significant implications for Faculty Awareness of wider societal benefits University Ethics Committee
  • 22. Results data : validate in professional press Public participation, citizen science
  • 23. Data policy for public engagement? Faculty attitude & culture Professional : amateur
  • 24. Calls for action, new metrics Incentives?
  • 25. Journal Article Workflow Visualisation Model Data Annotation Concept Macro Attribution granularity Complexity : what are we citing? Micro / Nano
  • 26. Large-scale predictive network models of disease Multiple datasets Visualise: Cytoscape Workflow: Taverna Data citation policy?
  • 27. 3. Policy guidance, planning tools, Code of Conduct
  • 28. State-of-the-Art Report : Models & Tools (Alex Ball, June 2010) Data Lifecycles Data Policies (UK) incl DMP Standards & tools Data Asset Framework (DAF) DANS Seal of Approval Preservation metadata Archive management tools Cost / benefit tools
  • 29. Data types, formats, standards, capture Ethics and Intellectual Property Access, sharing and re-use Short-term storage & data management Deposit & long-term preservation Adherence and review
  • 30. http://www.dcc.ac.uk/dmponline DMP Online Currently updating Version 2.0 Version 3.0 summer 2010
  • 31. Making DMPs work : the start of a long process… Embed DMPs in funder policies & research lifecycles as the norm Code of Conduct for Research Assess & review DMPs (not just the science content of proposals) Educate reviewers (DCC guidance for social science in prep) Manage compliance of researchers Infrastructure to share DMPs Analyse cost-benefits for UK HE
  • 32. Take homes... Practice is disconnected from policy Policy Gaps Data Storage (& Appraisal: DCC guidance in prep) Data Sharing (& Licensing: DCC guidance in prep) Ethics and Privacy Citizen Science & Public Engagement Data Citation and Attribution Collaborate with funders to make DMPs work Digital Curation Centre DMP tool & resources www.dcc.ac.uk
  • 33. Chicago Mart Plaza, 6-8 December 2010 Thank you…