SlideShare a Scribd company logo
User engagement in research data curation Stuart Macdonald  EDINA National Data Centre, University of Edinburgh Luis Martinez-Uribe  Oxford e-Research Centre, University of Oxford ECDL Corfu, 30 September 2009
Data deluge An updated IDC white paper reported that the digital universe in 2007 was 281 exabytes  and in 2011 should be 1,800 exabytes (or 10 times that produced in 2006). *“The  Diverse and Exploding Digital Universe - an updated forecast of worldwide information growth through 2011- http://www.emc.com/collateral/analyst-reports/diverse-exploding-digital-universe.pdf (Mar. 2008) BBSRC strategic plan (2010-2015) consultation document
Research data definitions US Office of Management and Budget defines research data as “the recorded factual material commonly accepted in the scientific community as necessary to validate research findings” Words, pictures, numbers, sounds Workflows, methodologies, protocols, standard operating procedures, instrumentation, models, questionaires, code books, set-up files, algorithms, transcripts
“ it is becoming increasingly clear that effective and efficient management and reuse of research data will be a key component in the UK knowledge economy in years to come, essential for the efficient conduct of research ….” *JISC (2008) “Identifying the benefits of curating and sharing research data” - http://www.jisc.ac.uk/whatwedo/programmes/digitalrepositories2007/databenefits.aspx Research methods experiencing a radical  transformation New tools & infrastructures generating  research data New ways to use, share and re-use Growing importance of curating research data
Departmental websites Domain-specific repositories Centralised data repositories (UKDA, NERC, MRC) Libraries and computing/IT services within  academic institutions working together to  develop and customise  institutional repositories to curate research data Data deposition and publication
Institutional Repositories: open access built for academic publications  technology lead No formal requirements analysis procedures User engagement required to develop systems that will  meet researchers’ needs Bottom up approach to inform top-down thinking Researchers – key user community overlooked
DISC-UK DataShare  -  legal, cultural, technical issues surrounding the sharing of research data in institutional settings Barriers to sharing: time taken to prepare datasets for deposit concerns over making data available before full academic exploitation misuse / misinterpretation (journalists, non-academics) loss of ownership, loss of commercial or competitive advantage repositories will cease to exist unwillingness to change working practices uncertainty about IPR and confidentiality Open data – realism versus altruism
Charting individual researcher’s information practices across 7 sub-disciplines of the life sciences -  http://www.rin.ac.uk/case-studies   DCC / ISSTI (University of Edinburgh) Deployed a range of methodologies and tools including short-term ethnographic techniques and semi-structured instruments: Diaries (x55),  F-2-F interviews, (x24)  Cognitive mapping (1 per case),  Focus groups (1 per case) RIN-funded Disciplinary case studies
Some disciplines lend themselves more than others to ‘openly’ data sharing Research data are varied, specific and complex Data curation and/or sharing only becomes crucial at certain stages of research lifecycle Feeling that only researchers have subject knowledge to curate their own data Keen sense of ‘ownership’ and protectiveness towards data Some findings from RIN Disciplinary case studies project:
Scoping digital repository services for research data management -  http://www.ict.ox.ac.uk/odit/projects/digitalrepository/ Scope requirements for services to manage research data generated by Oxford researchers from a variety of disciplines: Interviews (x37) conducted to learn about data management practices and identify top requirements  Workshop  (x46) held to compliment findings and to gather examples of good practice regarding use of repository services for research data management  Consultation with service units (ORA, data library,NGS, oxford digital library)  - identify gaps in service, validate researchers’ requirements
Scoping digital repository services -  top requirements Advice on practical issues related to managing data across their life  cycle incl. data management plans, assistance with formatting Secure storage required for large datasets generated by high  throughput instruments Sustainable & authenticated infrastructure that allows publication and  long-term preservation of research data  It is now followed up by the intra-institutional JISC funded Embedding Institutional Data Curation Services in Research (EIDCSR) project -  http://eidcsr.oucs.ox.ac.uk/
Tools – Data Audit Framework http://www.data-audit.eu/ DAF helps to establish relationships with research communities around  the issues of data curation Allows institutions to identify, locate, describe and assess how they are  managing their research data Provides information specialists who wish to extend support for research  data with a vehicle for engaging with researchers e.g. through local research data management training "staff had numerous comments and suggestions for improvement of data management at different levels indicating an awareness of the issues, even where it had not been made a priority to address"  -  edinburgh data audit implementation project
Summary Repository development distant from current research needs - due to lack of iterative requirements analysis with researchers Open data ethos detached from disciplinary research needs  Trusted relationships  -  dialogue with researchers early in research process
Thank you stuart.macdonald@ed.ac.uk  [email_address] All images - creative commons courtesy of Flickr

More Related Content

PPTX
Research Data Management: Why is it important?
PPTX
EPSRC Policy Compliance: What researchers need to know
PPTX
Overcoming obstacles to sharing data about human subjects
PPT
Organising and Documenting Data
PPTX
Certifying CISER! A Data Seal of Approval Case Study
PPTX
Educause 2015 RDM Maturity
PPTX
‘Good, better, best’? Examining the range and rationales of institutional dat...
PPTX
Now we are six: Integrating Edinburgh DataShare into local and internet in...
Research Data Management: Why is it important?
EPSRC Policy Compliance: What researchers need to know
Overcoming obstacles to sharing data about human subjects
Organising and Documenting Data
Certifying CISER! A Data Seal of Approval Case Study
Educause 2015 RDM Maturity
‘Good, better, best’? Examining the range and rationales of institutional dat...
Now we are six: Integrating Edinburgh DataShare into local and internet in...

What's hot (20)

PPTX
RDM for trainee physicians
PPTX
COBWEB - infrastructure and platform for Environmental Crowd Sensing and Big ...
PPTX
IASSIST40: Data management & curation workshop
PPTX
RIOXX: a Modern Metadata Application Profile
PPT
Introduction to Research Data Management
PDF
Delivering Postgraduate Training - MANTRA
PPTX
End of COBWEB Co-Design Projects Celebration
PPT
Research Data Management (RDM) Initiatives at the University of Edinburgh
PPT
Authentication Methods: Shibboleth
PPTX
Supporting the development of a national Research Data Discovery Service – a ...
PPTX
Guiding users through data deposit
PPT
Licence to Share: Research and Collaboration through Go-Geo! and ShareGeo
PPTX
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
PDF
Service Integration to Enhance RDM
PPTX
Edin casestudy-ou-rr-2011
PPT
National Activities and the UK LOCKSS Alliance
PPT
Going for GOLD - Adventures in Open Linked Geospatial Metadata
PPTX
Actions to Ensure the Integrity and Continuity of the Scholarly Record
PPT
Building research data management services at the University of Edinburgh: a ...
PDF
SDI – National to Global: perspectives from the UK academic sector
RDM for trainee physicians
COBWEB - infrastructure and platform for Environmental Crowd Sensing and Big ...
IASSIST40: Data management & curation workshop
RIOXX: a Modern Metadata Application Profile
Introduction to Research Data Management
Delivering Postgraduate Training - MANTRA
End of COBWEB Co-Design Projects Celebration
Research Data Management (RDM) Initiatives at the University of Edinburgh
Authentication Methods: Shibboleth
Supporting the development of a national Research Data Discovery Service – a ...
Guiding users through data deposit
Licence to Share: Research and Collaboration through Go-Geo! and ShareGeo
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Service Integration to Enhance RDM
Edin casestudy-ou-rr-2011
National Activities and the UK LOCKSS Alliance
Going for GOLD - Adventures in Open Linked Geospatial Metadata
Actions to Ensure the Integrity and Continuity of the Scholarly Record
Building research data management services at the University of Edinburgh: a ...
SDI – National to Global: perspectives from the UK academic sector
Ad

Viewers also liked (20)

PPT
Digimap for Schools presentation at Broomhill primary school 6th Oct 2014
PDF
Preserving the Integrity of the Scholarly Record
PPT
EDINA Sharing Content
PPT
Piloting an E-journals Preservation Registry Service: overview of PEPRS
PPT
Using Social Media to Communicate Your Research
PPTX
Reference Rot: Threat and Remedy
PDF
IASSIST Latin Engagement Strategic Action Group
PPT
DIY Research Data Management Training Kit for Librarians
PPT
Digimap for Schools for Secondary School
PDF
COBWEB - Chris Higgins, EDINA
PPT
COBWEB Project: Overall Project Status and Deliverables
PPTX
RDM Programme @ Edinburgh: Data Librarian Experience
PPTX
Certifying CISER! A Data Seal of Approval Case Study
PPT
Digital Preservation Case Study: Community Action via UK LOCKSS Alliance
PDF
Research Data Management: Policy Development
PPT
IGIBS - BDB Research Forum, May 2011
PDF
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
PPT
COBWEB Project: Citizens Observatories Side Event
Digimap for Schools presentation at Broomhill primary school 6th Oct 2014
Preserving the Integrity of the Scholarly Record
EDINA Sharing Content
Piloting an E-journals Preservation Registry Service: overview of PEPRS
Using Social Media to Communicate Your Research
Reference Rot: Threat and Remedy
IASSIST Latin Engagement Strategic Action Group
DIY Research Data Management Training Kit for Librarians
Digimap for Schools for Secondary School
COBWEB - Chris Higgins, EDINA
COBWEB Project: Overall Project Status and Deliverables
RDM Programme @ Edinburgh: Data Librarian Experience
Certifying CISER! A Data Seal of Approval Case Study
Digital Preservation Case Study: Community Action via UK LOCKSS Alliance
Research Data Management: Policy Development
IGIBS - BDB Research Forum, May 2011
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
COBWEB Project: Citizens Observatories Side Event
Ad

Similar to User engagement in research data curation (20)

PPT
User Engagement in Research Data Curation
PDF
Research Data Management Inititatives at University of Edinburgh
PPT
EDINA / Data Library Overview
PPT
Edinburgh DataShare: Tackling research data in a DSpace institutional repository
PDF
Wiser2009 Luis Martinez
PPT
Survey of research data management practices up2010digschol2011
PPT
Seminario Sobre Datasets Consorcio Madrono
PPT
Curation of Research Data
PDF
Research Data Management at Edinburgh: Effecting Culture Change
PPTX
Engaging the Researcher in RDM
PPT
Services, policy, guidance and training: Improving research data management a...
PPT
Current and emerging scientific data curation practices
PPTX
RDM LIASA webinar
PPT
Research Data Management at Edinburgh: Effecting Culture Change
PPT
Dc101 oxford sj_16062010
PPT
Open Data and Institutional Repositories
PPTX
Services, policy, guidance and training: Improving research data management a...
PPTX
Research Data Management in GLAM: Managing Data for Cultural Heritage
PPTX
Introduction to Research Data Management
PPT
Data curation issues for repositories
User Engagement in Research Data Curation
Research Data Management Inititatives at University of Edinburgh
EDINA / Data Library Overview
Edinburgh DataShare: Tackling research data in a DSpace institutional repository
Wiser2009 Luis Martinez
Survey of research data management practices up2010digschol2011
Seminario Sobre Datasets Consorcio Madrono
Curation of Research Data
Research Data Management at Edinburgh: Effecting Culture Change
Engaging the Researcher in RDM
Services, policy, guidance and training: Improving research data management a...
Current and emerging scientific data curation practices
RDM LIASA webinar
Research Data Management at Edinburgh: Effecting Culture Change
Dc101 oxford sj_16062010
Open Data and Institutional Repositories
Services, policy, guidance and training: Improving research data management a...
Research Data Management in GLAM: Managing Data for Cultural Heritage
Introduction to Research Data Management
Data curation issues for repositories

More from EDINA, University of Edinburgh (20)

PDF
The Making of the English Landscape:
PPTX
Spatial Data, Spatial Humanities
PDF
Land Cover Map 2015
PPTX
We have the technology... We have the data... What next?
PPTX
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
PPTX
GeoForum EDINA report 2017
PPTX
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...
PPTX
Moray housemarch2017
PPTX
Uniof stirlingmarch2017secondary
PPT
Uniof glasgow jan2017_secondary
PPTX
Managing your Digital Footprint : Taking control of the metadata and tracks a...
PPTX
Social media and blogging to develop and communicate research in the arts and...
PPTX
Enhancing your research impact through social media - Nicola Osborne
PPTX
Social Media in Marketing in Support of Your Personal Brand - Nicola Osborne
PPTX
Best Practice for Social Media in Teaching & Learning Contexts - Nicola Osborne
PPTX
SCURL and SUNCAT serials holdings comparison service
PPTX
Big data in Digimap
PPTX
Introduction to Edinburgh University Data Library and national data services
PPT
Digimap for Schools: Introduction to an ICT based cross curricular resource f...
PPTX
Digimap Update - Geoforum 2016 - Guy McGarva
The Making of the English Landscape:
Spatial Data, Spatial Humanities
Land Cover Map 2015
We have the technology... We have the data... What next?
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
GeoForum EDINA report 2017
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...
Moray housemarch2017
Uniof stirlingmarch2017secondary
Uniof glasgow jan2017_secondary
Managing your Digital Footprint : Taking control of the metadata and tracks a...
Social media and blogging to develop and communicate research in the arts and...
Enhancing your research impact through social media - Nicola Osborne
Social Media in Marketing in Support of Your Personal Brand - Nicola Osborne
Best Practice for Social Media in Teaching & Learning Contexts - Nicola Osborne
SCURL and SUNCAT serials holdings comparison service
Big data in Digimap
Introduction to Edinburgh University Data Library and national data services
Digimap for Schools: Introduction to an ICT based cross curricular resource f...
Digimap Update - Geoforum 2016 - Guy McGarva

Recently uploaded (20)

PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PDF
01-Introduction-to-Information-Management.pdf
PDF
O7-L3 Supply Chain Operations - ICLT Program
PPTX
Cell Structure & Organelles in detailed.
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PPTX
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PDF
Pre independence Education in Inndia.pdf
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPTX
Introduction to Child Health Nursing – Unit I | Child Health Nursing I | B.Sc...
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PDF
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
PDF
Complications of Minimal Access Surgery at WLH
PPTX
Cell Types and Its function , kingdom of life
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PDF
RMMM.pdf make it easy to upload and study
Module 4: Burden of Disease Tutorial Slides S2 2025
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
01-Introduction-to-Information-Management.pdf
O7-L3 Supply Chain Operations - ICLT Program
Cell Structure & Organelles in detailed.
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
102 student loan defaulters named and shamed – Is someone you know on the list?
Pre independence Education in Inndia.pdf
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
Introduction to Child Health Nursing – Unit I | Child Health Nursing I | B.Sc...
Microbial diseases, their pathogenesis and prophylaxis
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
Complications of Minimal Access Surgery at WLH
Cell Types and Its function , kingdom of life
FourierSeries-QuestionsWithAnswers(Part-A).pdf
Pharmacology of Heart Failure /Pharmacotherapy of CHF
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
RMMM.pdf make it easy to upload and study

User engagement in research data curation

  • 1. User engagement in research data curation Stuart Macdonald EDINA National Data Centre, University of Edinburgh Luis Martinez-Uribe Oxford e-Research Centre, University of Oxford ECDL Corfu, 30 September 2009
  • 2. Data deluge An updated IDC white paper reported that the digital universe in 2007 was 281 exabytes and in 2011 should be 1,800 exabytes (or 10 times that produced in 2006). *“The Diverse and Exploding Digital Universe - an updated forecast of worldwide information growth through 2011- http://www.emc.com/collateral/analyst-reports/diverse-exploding-digital-universe.pdf (Mar. 2008) BBSRC strategic plan (2010-2015) consultation document
  • 3. Research data definitions US Office of Management and Budget defines research data as “the recorded factual material commonly accepted in the scientific community as necessary to validate research findings” Words, pictures, numbers, sounds Workflows, methodologies, protocols, standard operating procedures, instrumentation, models, questionaires, code books, set-up files, algorithms, transcripts
  • 4. “ it is becoming increasingly clear that effective and efficient management and reuse of research data will be a key component in the UK knowledge economy in years to come, essential for the efficient conduct of research ….” *JISC (2008) “Identifying the benefits of curating and sharing research data” - http://www.jisc.ac.uk/whatwedo/programmes/digitalrepositories2007/databenefits.aspx Research methods experiencing a radical transformation New tools & infrastructures generating research data New ways to use, share and re-use Growing importance of curating research data
  • 5. Departmental websites Domain-specific repositories Centralised data repositories (UKDA, NERC, MRC) Libraries and computing/IT services within academic institutions working together to develop and customise institutional repositories to curate research data Data deposition and publication
  • 6. Institutional Repositories: open access built for academic publications technology lead No formal requirements analysis procedures User engagement required to develop systems that will meet researchers’ needs Bottom up approach to inform top-down thinking Researchers – key user community overlooked
  • 7. DISC-UK DataShare - legal, cultural, technical issues surrounding the sharing of research data in institutional settings Barriers to sharing: time taken to prepare datasets for deposit concerns over making data available before full academic exploitation misuse / misinterpretation (journalists, non-academics) loss of ownership, loss of commercial or competitive advantage repositories will cease to exist unwillingness to change working practices uncertainty about IPR and confidentiality Open data – realism versus altruism
  • 8. Charting individual researcher’s information practices across 7 sub-disciplines of the life sciences - http://www.rin.ac.uk/case-studies DCC / ISSTI (University of Edinburgh) Deployed a range of methodologies and tools including short-term ethnographic techniques and semi-structured instruments: Diaries (x55), F-2-F interviews, (x24) Cognitive mapping (1 per case), Focus groups (1 per case) RIN-funded Disciplinary case studies
  • 9. Some disciplines lend themselves more than others to ‘openly’ data sharing Research data are varied, specific and complex Data curation and/or sharing only becomes crucial at certain stages of research lifecycle Feeling that only researchers have subject knowledge to curate their own data Keen sense of ‘ownership’ and protectiveness towards data Some findings from RIN Disciplinary case studies project:
  • 10. Scoping digital repository services for research data management - http://www.ict.ox.ac.uk/odit/projects/digitalrepository/ Scope requirements for services to manage research data generated by Oxford researchers from a variety of disciplines: Interviews (x37) conducted to learn about data management practices and identify top requirements Workshop (x46) held to compliment findings and to gather examples of good practice regarding use of repository services for research data management Consultation with service units (ORA, data library,NGS, oxford digital library) - identify gaps in service, validate researchers’ requirements
  • 11. Scoping digital repository services - top requirements Advice on practical issues related to managing data across their life cycle incl. data management plans, assistance with formatting Secure storage required for large datasets generated by high throughput instruments Sustainable & authenticated infrastructure that allows publication and long-term preservation of research data It is now followed up by the intra-institutional JISC funded Embedding Institutional Data Curation Services in Research (EIDCSR) project - http://eidcsr.oucs.ox.ac.uk/
  • 12. Tools – Data Audit Framework http://www.data-audit.eu/ DAF helps to establish relationships with research communities around the issues of data curation Allows institutions to identify, locate, describe and assess how they are managing their research data Provides information specialists who wish to extend support for research data with a vehicle for engaging with researchers e.g. through local research data management training "staff had numerous comments and suggestions for improvement of data management at different levels indicating an awareness of the issues, even where it had not been made a priority to address" - edinburgh data audit implementation project
  • 13. Summary Repository development distant from current research needs - due to lack of iterative requirements analysis with researchers Open data ethos detached from disciplinary research needs Trusted relationships - dialogue with researchers early in research process
  • 14. Thank you stuart.macdonald@ed.ac.uk [email_address] All images - creative commons courtesy of Flickr