SlideShare a Scribd company logo
The purpose, practicalities, pitfalls
and policies of managing and
sharing data in the UK
AAMG-CICAG Measurement,
Information and Innovation meeting
20 October 2015
Dr Danny Kingsley
Can we cover this in 15 minutes
(allowing 5 min for questions?)
• UK policy landscape
• Places to share data
• What are we trying to achieve?
• Let’s start at the beginning
• Basics of Research Data Management
• Issues with sharing (or not) data
The data policy landscape
Lots of slightly different rules in the UK
Policies
• Funder
– RCUK Common Principles on Data Policy
• Government
– Draft Concordat on Open Research Data released by the RCUK
for consultation which ended on 28 September
• http://www.rcuk.ac.uk/research/opendata/
– Cambridge coordinated a joint response with other universities
• https://unlockingresearch.blog.lib.cam.ac.uk/?p=285
• Publishers
• Institutional
– Cambridge University Research Data Management Policy
Framework. http://www.data.cam.ac.uk/university-policy
RCUK Common Principles on Data
–“Publicly funded research data are
a public good (…), which should be
made openly available with as few
restrictions as possible”
–http://www.rcuk.ac.uk/research/datapolicy
/
The principles might be common…
What the researcher hears
From Bill Hubbard Getting the rights right: when policies collide
http://www.slideshare.net/UKSG/hubbard-uksg-may2015-public
Places to share data
There are lots of options
Open repositories
• (some are free, some charge)
Disciplinary specific repositories
• Gene Expression Omnibus
– Public function genomics data repository
• http://www.ncbi.nlm.nih.gov/geo/
• arXiv
– e-prints in Physics, Mathematics, Computer Science, Quantitative
Biology, Quantitative Finance and Statistics
• http://arxiv.org/
• Oxford Text Archive
– Literary and linguistic texts for higher education
• http://ota.ox.ac.uk/
• UK Data Service
– Social science data
• http://ukdataservice.ac.uk/
• Natural Environment Research Council (NERC) run 7 repositories
• http://www.nerc.ac.uk/research/sites/data/
Journals
• Either as supplementary data, or in data-only
journals
– PLOS data sharing policy (Dec 2013)
• https://www.plos.org/plos-data-policy-faq/
– Nature’s journal Scientific Data
• http://www.nature.com/sdata/about
We are a long way from there
So what’s it all about then?
What are we actually trying to
achieve with open data policies?
In conversation with Ben Ryan EPSRC
• Please share:
– the data that underpins publications
– the data that validates research findings
– the data that is worth keeping
• The default position is ‘data should be open’
• Published research findings should be testable
• Maximise the impact of publicly funded research
• Maintain public trust in science and research
• They are trying to create a new research culture
• https://unlockingresearch.blog.lib.cam.ac.uk/?p=151
Responses to data sharing policies
• What’s the minimum we can get away with?
• This is crap
• ‘They’ are just doing this because ‘they’ can
• But it will take a huge effort to get the data in
a useable form
• No-one will look at it
• What a waste of time
Data excuse bingo
We are trying to start at the end
We should begin at the beginning - a
stitch in time and all that…
In conversation with Michael Ball BBSRC
• Disciplines themselves must establish ways of
dealing with data
– This is the beginning of an ongoing process
• Researchers need to consider how to deal
with data from the beginning of a research
project
• You can ask for money to manage data in the
grant application
• https://unlockingresearch.blog.lib.cam.ac.uk/?p=337
Research data management
• The practice of sharing data requires the data
to be:
– Accessible
– Intelligible
– Assessable
– Reusable
Some of it is really obvious
• How many of you:
– Use a file naming protocol?
– Ensure all your laptops are backed up?
– Have written a data management plan for your
current project?
– Determined who in the team owns the data?
• PS: this last one REALLY matters
Skillsets required for managing and
curating data
http://www.dcc.ac.uk/sites/default/files/documents/RDMF/RDMF2/coreSkillsDiagram.gif
Lots of jobs…
Issues with sharing data
Both with sharing and not sharing
Issues raised by researchers
• There is a very real concern that the UK will
become unattractive for collaborations
• Researchers discussing changing the type of
research being done to reduce the amount of
data being produced
• There is discussion in some circles whether
applying for EPSRC funding is worth the hassle
Consequences of not sharing data
• Medicine
– Having the data publicly available in two trials of deworming pills
demonstrated that a population wide deworming program did not improve
school performance
– http://www.buzzfeed.com/bengoldacre/deworming-trials
• Economics
– A study widely cited to justify budget cutting in the US had a mistake in the
calculations which was only revealed when the Excel file was released
– http://www.bloomberg.com/bw/articles/2013-04-18/faq-reinhart-rogoff-and-
the-excel-error-that-changed-history
• Physics
– It took 12.5 years to withdraw Jan Hendrik Schon’s work on ‘organic
semiconductors’ because the reviewers were unable to replicate the results
without access to the original data or lab books
– http://www.science20.com/science_20/jan_hendrik_sch%C3%B6n_world_cla
ss_physics_fraud_gets_last_laugh_whole_book_about_himself
Questions?
Dr Danny Kingsley
Head of Scholarly Communication
University of Cambridge
Email: dak45@cam.ac.uk
Blog: https://unlockingresearch.blog.lib.cam.ac.uk/
Website: http://osc.cam.ac.uk
Twitter: @dannykay68

More Related Content

PPTX
Reward, reproducibility and recognition in research - the case for going Open
PPTX
Be careful what you wish for - unexpected policy consequences
PPTX
So, what's it all about then? Why we share research data
PPTX
Is ‘Open Science’ a solution or a threat?
PPTX
Access to Research Data - Westminster Briefing
PPTX
What is ‘research impact’ in an interconnected world?
PPTX
Developing a research Library position statement on Text and Data Mining in t...
PPTX
The value of embracing unknown unknowns
Reward, reproducibility and recognition in research - the case for going Open
Be careful what you wish for - unexpected policy consequences
So, what's it all about then? Why we share research data
Is ‘Open Science’ a solution or a threat?
Access to Research Data - Westminster Briefing
What is ‘research impact’ in an interconnected world?
Developing a research Library position statement on Text and Data Mining in t...
The value of embracing unknown unknowns

What's hot (20)

PPTX
Getting an Octopus into a String Bag - The complexity of communicating with t...
PPTX
Leveraging the ETD as a pathway to broader discussions about openness in a un...
PPTX
Disrupting academic publishing: a future role for libraries
PPT
Engaging students in publishing on the internet early in their careers
PPT
UKSG Conference 2016 Breakout Session - Institutional insights: adopting new ...
PDF
Open Data: Touching Upon the Intangible
PDF
UKSG Conference 2016 Breakout Session - Measuring the research impact of digi...
PDF
Data availability policies and licensing
PPTX
The Shift to Open Access Publishing
PPTX
Open Access: Advantages, Funding, Opportunities
PPTX
Disrupting Academic Publishing: Returning Control to Universities
PPTX
Open Access Publishing
PPTX
Library as publisher
PPTX
Brian Hole - The Shift to Open Access Publishing, UCL DH 2013
PPTX
UKSG Conference 2017 Breakout - Advancing the Research Paper of the Future: c...
PPTX
The Growing Role of Libraries in Publishing
PPTX
ESU OER and OA Workshop
PPTX
Open Access, Data and Education for Global Surgery RSM 2016
PDF
Getting Your Work Noticed and Creating Impact Outside Academia
PPTX
Responsible metrics for research - Jisc Digifest 2016
Getting an Octopus into a String Bag - The complexity of communicating with t...
Leveraging the ETD as a pathway to broader discussions about openness in a un...
Disrupting academic publishing: a future role for libraries
Engaging students in publishing on the internet early in their careers
UKSG Conference 2016 Breakout Session - Institutional insights: adopting new ...
Open Data: Touching Upon the Intangible
UKSG Conference 2016 Breakout Session - Measuring the research impact of digi...
Data availability policies and licensing
The Shift to Open Access Publishing
Open Access: Advantages, Funding, Opportunities
Disrupting Academic Publishing: Returning Control to Universities
Open Access Publishing
Library as publisher
Brian Hole - The Shift to Open Access Publishing, UCL DH 2013
UKSG Conference 2017 Breakout - Advancing the Research Paper of the Future: c...
The Growing Role of Libraries in Publishing
ESU OER and OA Workshop
Open Access, Data and Education for Global Surgery RSM 2016
Getting Your Work Noticed and Creating Impact Outside Academia
Responsible metrics for research - Jisc Digifest 2016
Ad

Viewers also liked (18)

PPTX
Navigating the data management ecosystem - John Kratz
PDF
Hardship fund reform application process (part i) (2 of 6)
PPTX
Presentation_NEW.PPTX
PDF
บทที่ 5
PDF
Response from OFAC to Elsevier, October 2015
PPTX
Tecnología Educativa
PPTX
Curros Enríquez
PDF
E-Finance
PDF
บทที่ 4
PPTX
Stop Press: Libraries' Role in the Future of Publishing
PDF
Herramienta distribución
PPTX
Eduardo Pondal
PDF
OpenTRV - LPWAN Meetup #2
PPTX
Purpose of a music video
PPTX
The OSC at Cambridge - a lightning tour
PPTX
Academic Social Network Sites: a rough guide for researchers
PPTX
Rosalía de castro
Navigating the data management ecosystem - John Kratz
Hardship fund reform application process (part i) (2 of 6)
Presentation_NEW.PPTX
บทที่ 5
Response from OFAC to Elsevier, October 2015
Tecnología Educativa
Curros Enríquez
E-Finance
บทที่ 4
Stop Press: Libraries' Role in the Future of Publishing
Herramienta distribución
Eduardo Pondal
OpenTRV - LPWAN Meetup #2
Purpose of a music video
The OSC at Cambridge - a lightning tour
Academic Social Network Sites: a rough guide for researchers
Rosalía de castro
Ad

Similar to The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK (20)

PPTX
Creating a Data Management Plan for your Grant Application
PPTX
Creating a Data Management Plan for your Grant Application
PPTX
How to write a data management plan
PPTX
Winter school in research data science research data management - final
PPTX
AKVS - Edinburgh Data Repository Experiences June 2016
PPTX
Creating a Data Management Plan
PPTX
Building a collaborative RDM community, research data network
PPT
Getting to grips with research data management
PPT
Getting to grips with Research Data Management
PPTX
Managing and sharing data
PDF
Data sharing: How, what and why?
PDF
The art of depositing social science data: maximising quality and ensuring go...
PPTX
Research data policy
PPTX
20160414 23 Research Data Things
PPTX
Developing Research Data Management Policy and Services
PDF
How to overcome obstacles to data publication: Issues, requirements, and good...
PDF
Department of Energy DMP Requirements
PPTX
DataONE Education Module 02: Data Sharing
PDF
NIH Data Sharing Plan Workshop - Handout
PDF
Rachel Bruce UK research and data management where are we now
Creating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant Application
How to write a data management plan
Winter school in research data science research data management - final
AKVS - Edinburgh Data Repository Experiences June 2016
Creating a Data Management Plan
Building a collaborative RDM community, research data network
Getting to grips with research data management
Getting to grips with Research Data Management
Managing and sharing data
Data sharing: How, what and why?
The art of depositing social science data: maximising quality and ensuring go...
Research data policy
20160414 23 Research Data Things
Developing Research Data Management Policy and Services
How to overcome obstacles to data publication: Issues, requirements, and good...
Department of Energy DMP Requirements
DataONE Education Module 02: Data Sharing
NIH Data Sharing Plan Workshop - Handout
Rachel Bruce UK research and data management where are we now

More from Danny Kingsley (15)

PPTX
Artificial Intelligence and implications for research outputs
PPTX
Let's get our act together!
PPTX
Thinking Strategically
PPTX
The macrame of scholarly training - collecting the cords that bind
PPTX
Scholarly communication competencies: An analysis of confidence among Austral...
PPTX
It’s publishing but not as you know it: How Open is Changing Everything
PPTX
Hard won: the challenges of obtaining scholarly communication knowledge & skills
PPTX
Open Access policies at Australian universities
PPTX
Where to from here? Identifying training and professional development needs o...
PPTX
Embedding open in the research training process
PPTX
Let’s just get on with it – ‘open’ in Australia in 2019
PPTX
What *is* a library in an 'open' future
PPTX
Impacts, consequences and outcomes of open policies in Europe
PPTX
Shifting sands: Changing academic library skill sets
PPTX
Watch out, it's behind you: publishers' tactics and the challenge they pose f...
Artificial Intelligence and implications for research outputs
Let's get our act together!
Thinking Strategically
The macrame of scholarly training - collecting the cords that bind
Scholarly communication competencies: An analysis of confidence among Austral...
It’s publishing but not as you know it: How Open is Changing Everything
Hard won: the challenges of obtaining scholarly communication knowledge & skills
Open Access policies at Australian universities
Where to from here? Identifying training and professional development needs o...
Embedding open in the research training process
Let’s just get on with it – ‘open’ in Australia in 2019
What *is* a library in an 'open' future
Impacts, consequences and outcomes of open policies in Europe
Shifting sands: Changing academic library skill sets
Watch out, it's behind you: publishers' tactics and the challenge they pose f...

Recently uploaded (20)

PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PDF
Lecture1 pattern recognition............
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PPT
Quality review (1)_presentation of this 21
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PDF
Launch Your Data Science Career in Kochi – 2025
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
Supervised vs unsupervised machine learning algorithms
PDF
Fluorescence-microscope_Botany_detailed content
PPTX
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
PDF
Mega Projects Data Mega Projects Data
PPTX
Moving the Public Sector (Government) to a Digital Adoption
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
Major-Components-ofNKJNNKNKNKNKronment.pptx
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPT
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
oil_refinery_comprehensive_20250804084928 (1).pptx
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Miokarditis (Inflamasi pada Otot Jantung)
Lecture1 pattern recognition............
Acceptance and paychological effects of mandatory extra coach I classes.pptx
Quality review (1)_presentation of this 21
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
Launch Your Data Science Career in Kochi – 2025
Introduction to Knowledge Engineering Part 1
Supervised vs unsupervised machine learning algorithms
Fluorescence-microscope_Botany_detailed content
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
Mega Projects Data Mega Projects Data
Moving the Public Sector (Government) to a Digital Adoption
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Major-Components-ofNKJNNKNKNKNKronment.pptx
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
Business Ppt On Nestle.pptx huunnnhhgfvu

The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK

  • 1. The purpose, practicalities, pitfalls and policies of managing and sharing data in the UK AAMG-CICAG Measurement, Information and Innovation meeting 20 October 2015 Dr Danny Kingsley
  • 2. Can we cover this in 15 minutes (allowing 5 min for questions?) • UK policy landscape • Places to share data • What are we trying to achieve? • Let’s start at the beginning • Basics of Research Data Management • Issues with sharing (or not) data
  • 3. The data policy landscape Lots of slightly different rules in the UK
  • 4. Policies • Funder – RCUK Common Principles on Data Policy • Government – Draft Concordat on Open Research Data released by the RCUK for consultation which ended on 28 September • http://www.rcuk.ac.uk/research/opendata/ – Cambridge coordinated a joint response with other universities • https://unlockingresearch.blog.lib.cam.ac.uk/?p=285 • Publishers • Institutional – Cambridge University Research Data Management Policy Framework. http://www.data.cam.ac.uk/university-policy
  • 5. RCUK Common Principles on Data –“Publicly funded research data are a public good (…), which should be made openly available with as few restrictions as possible” –http://www.rcuk.ac.uk/research/datapolicy /
  • 6. The principles might be common…
  • 7. What the researcher hears From Bill Hubbard Getting the rights right: when policies collide http://www.slideshare.net/UKSG/hubbard-uksg-may2015-public
  • 8. Places to share data There are lots of options
  • 9. Open repositories • (some are free, some charge)
  • 10. Disciplinary specific repositories • Gene Expression Omnibus – Public function genomics data repository • http://www.ncbi.nlm.nih.gov/geo/ • arXiv – e-prints in Physics, Mathematics, Computer Science, Quantitative Biology, Quantitative Finance and Statistics • http://arxiv.org/ • Oxford Text Archive – Literary and linguistic texts for higher education • http://ota.ox.ac.uk/ • UK Data Service – Social science data • http://ukdataservice.ac.uk/ • Natural Environment Research Council (NERC) run 7 repositories • http://www.nerc.ac.uk/research/sites/data/
  • 11. Journals • Either as supplementary data, or in data-only journals – PLOS data sharing policy (Dec 2013) • https://www.plos.org/plos-data-policy-faq/ – Nature’s journal Scientific Data • http://www.nature.com/sdata/about
  • 12. We are a long way from there
  • 13. So what’s it all about then? What are we actually trying to achieve with open data policies?
  • 14. In conversation with Ben Ryan EPSRC • Please share: – the data that underpins publications – the data that validates research findings – the data that is worth keeping • The default position is ‘data should be open’ • Published research findings should be testable • Maximise the impact of publicly funded research • Maintain public trust in science and research • They are trying to create a new research culture • https://unlockingresearch.blog.lib.cam.ac.uk/?p=151
  • 15. Responses to data sharing policies • What’s the minimum we can get away with? • This is crap • ‘They’ are just doing this because ‘they’ can • But it will take a huge effort to get the data in a useable form • No-one will look at it • What a waste of time
  • 17. We are trying to start at the end We should begin at the beginning - a stitch in time and all that…
  • 18. In conversation with Michael Ball BBSRC • Disciplines themselves must establish ways of dealing with data – This is the beginning of an ongoing process • Researchers need to consider how to deal with data from the beginning of a research project • You can ask for money to manage data in the grant application • https://unlockingresearch.blog.lib.cam.ac.uk/?p=337
  • 19. Research data management • The practice of sharing data requires the data to be: – Accessible – Intelligible – Assessable – Reusable
  • 20. Some of it is really obvious • How many of you: – Use a file naming protocol? – Ensure all your laptops are backed up? – Have written a data management plan for your current project? – Determined who in the team owns the data? • PS: this last one REALLY matters
  • 21. Skillsets required for managing and curating data http://www.dcc.ac.uk/sites/default/files/documents/RDMF/RDMF2/coreSkillsDiagram.gif
  • 23. Issues with sharing data Both with sharing and not sharing
  • 24. Issues raised by researchers • There is a very real concern that the UK will become unattractive for collaborations • Researchers discussing changing the type of research being done to reduce the amount of data being produced • There is discussion in some circles whether applying for EPSRC funding is worth the hassle
  • 25. Consequences of not sharing data • Medicine – Having the data publicly available in two trials of deworming pills demonstrated that a population wide deworming program did not improve school performance – http://www.buzzfeed.com/bengoldacre/deworming-trials • Economics – A study widely cited to justify budget cutting in the US had a mistake in the calculations which was only revealed when the Excel file was released – http://www.bloomberg.com/bw/articles/2013-04-18/faq-reinhart-rogoff-and- the-excel-error-that-changed-history • Physics – It took 12.5 years to withdraw Jan Hendrik Schon’s work on ‘organic semiconductors’ because the reviewers were unable to replicate the results without access to the original data or lab books – http://www.science20.com/science_20/jan_hendrik_sch%C3%B6n_world_cla ss_physics_fraud_gets_last_laugh_whole_book_about_himself
  • 26. Questions? Dr Danny Kingsley Head of Scholarly Communication University of Cambridge Email: dak45@cam.ac.uk Blog: https://unlockingresearch.blog.lib.cam.ac.uk/ Website: http://osc.cam.ac.uk Twitter: @dannykay68