SlideShare a Scribd company logo
Service Integration to Enhance RDM:
RSpace electronic laboratory notebook (ELN) case
study
Stuart Macdonald (RDM Services Coordinator, University of
Edinburgh)
stuart.macdonald@ed.ac.uk
Rory Macneil (CEO, Research Space)
rmacneil@researchspace.com
10th International Digital Curation Conference, London, 10 February 2014
University of Edinburgh RDM Policy
 University of Edinburgh is one of the
first Universities in UK to adopt a
policy for managing research data:
http://www.ed.ac.uk/is/research-
data-policy
 The policy was approved by the
University Court on 16 May 2011.
 It’s acknowledged that this is an
aspirational policy and that
implementation will take some
years.
Policy implementation:
RDM Roadmap Colleagues in IS developed a roadmap to
deliver a suite of services that will meet:
• RDM policy objectives
• the needs of our researchers
Cross-divisional collaboration
 3 Phases (Aug. 2012 – May 2015)
Services already in place:
o Data management planning
o Active working file space = DataStore
o Data publication repository = DataShare
Services in development:
o Long term data archive = DataVault
o Data Asset Register (DAR)
RDM support: Awareness raising, training &
consultancy
http://edin.ac/1u3sKqy
Before research During research After research
Research Data Management Planning
Customised instance of DCC’s DMPonline toolkit for Univ. of Edinburgh use:
A range of funders and local (non-funder) DMP templates
Institutional guidance (storage, services, support)
Tailored DMP assistance for researchers submitting research proposals (F-2-F)
DataStore
NAS facility to store data that are actively used in current research activities
0.5 TB (500GB) allocated per researcher (incl. PGRs)
Up to 0.25TB can be used for “shared” group storage
Extra storage costs: £200 per TB per year incl. back-up and DR copies
Infrastructure in place. Allocation of space devolved to School IT departments
overseen by Heads of IT from each College.
DataShare
 Edinburgh DataShare is the University’s OA multi-disciplinary data repository hosted bt the
Data Library : http://datashare.is.ed.ac.uk
 Assists researchers who want to share their data, get credit for data publication, and
preserve their data for the long-term (DOI, licence, citation)
Data Vault
 Safe, private and secure long-term data archive
 Current focus on front-end application requirements (authorisation, retention & deletion,
file structure, file transfer, integration)
Data Asset Register (DAR)
 Using PURE as the catalogue of data assets produced by Edinburgh researchers for
discovery, access, and re-use as appropriate.
Interoperation
 Systems are more likely to be used if some or all of the components are integrated and
developed to minimise ‘duplication’ of effort
RDM Support
• RDM team work with Research Administrators , Academic Support Librarians and IT
staff in each of the 22 Schools.
• Queries can be sent to the IS Helpline who will direct them to appropriate RDM staff via
CMS.
• Introductory sessions on local RDM services and support for researchers and research
administration staff in Schools / Institutes
• RDM website: http://www.ed.ac.uk/is/data-management
• RDM blog: http://datablog.is.ed.ac.uk
• RDM wiki:
https://www.wiki.ed.ac.uk/display/RDM/Research+Data+Management+Wiki
Training: Tailored Courses
Formal and bespoke training in the form of workshops, seminars and drop in sessions to
help researchers with RDM issues.
 Creating a data management plan for your grant application
 Handling data using SPSS
 Managing your research data: why it is important and what should you do? NEW
 Publishing and sharing sensitive data (pilot) NEW

MANTRA - http://datalib.edina.ac.uk/mantra
 An internationally recognized free online RDM training course for researchers -
developed by the Data Library
 Software-specific data handling exercises
 CC License & embed units in VLE’s e.g. Moodle
Service Integration examples
• DataShare is a customised DSpace instance with OAI-PMH
compliant DCMI metadata fields for data discovery through Google and
other search engines
• Records are harvested by Thomson-Reuters Data Citation Index
• SWORD API utilised for batch deposit of large and/or many files from
computers (‘Push using http’)
• Internal batch ingest of many/large files to circumvent 2.1GB limit via
interface (‘Pull via command line interface’)
• checksums determine that delivered object mirrors deposited object
• DSpace GITHUB plugin* - allows software used in research to be
from GitHub (or similar) source code repository into DataShare, which
then be assigned a DOI to facilitate citation - using the SWORD deposit
protocol
DataSync – a secure dropbox-like facility for synchronising data on DataStore with
desktop and mobile machines:
• uses open source ‘ownCloud’ technology
Refresh of ECDF Computing Cluster (‘Eddie’) complete with ‘Data Centric
Computing’ business model – integrate Eddie storage & HPC, parallel and cloud
computing layers with DataStore for data sharing i.e. data transferred from DataStore
for analysis run on Eddie and then data ported back to DataStore (DataVault)
Linking of SDA toolkit with numeric ASCII data held in DataShare for the purposes of
analysis (re-use)
Facility to embargo variables within numeric files (in statistical analysis package
formats) for subsequent open deposit into DataShare of de-sensitised version
Research data deposit directly from RSpace Electronic Lab Notebook (ELN) interface
into DataShare and Datastore (& Data Vault) using SWORD protocol
Who and what is driving demand for ELNs?
● Researchers
– Utility and convenience of paper lab book + online capabilities
– On multiple devices
– File management/integration
● Groups/PIs
– Controlled sharing
– Collaboration
– Group management
– File management/integration
● Institutions: data librarians, research admins, IT, commercialisation offices
– Enterprise features: Scalable deployment, Single Sign On
– IP protection: audit trail, signing
– Publishing
– Archiving
– Repository integration
– File management/integration
RSpace
RSpace
First electronic notebook for research
institutions
Business Model
● Free public cloud for labs and individuals
● Institutional deployments @$100/user/year
● Seamless movement of groups and data between different RSpaces
Researchers Institutions Funders
Value
Edinburgh
Public
Cloud
Stanford
Lab
LabLab
Convenience
Productivity
Portability
Control
Compliance
Data mining
Data mining
RSpace at Edinburgh
– Linking to files in Edinburgh DataStore
– Depositing content in Edinburgh DataShare
– Archiving in Edinburgh DataVault
Linking to DataStore
“My plan for workflow would be generally to deposit
my data in DataStore either from the wet lab
instruments (gel photos, elisa data, etc, and also
possibly directly from an iPad) or from in silico data
analysis I’ve been doing, and then link to it from within
RSpace.”
Linking to DataStore
Experiment
Procedure
~~~~~~~~~~
~~~~~~~~~~
Results
~~~~~~~~~~
Results.xls
ELN UoE DataStore
Exposing DataStore File Roots
Linking to a DataStore File
Linking to a DataStore File
Linking to a DataStore File
Exporting to DataShare
RSpace
UoE DataShare
Adding metadata
Archiving in Edinburgh DataVault
● DataVault functionality/API not yet specified
● Anticipate use of XML zip archive
● Many requirements to be determined
– e.g., searching, restoration
RSpace and Edinburgh RDM
RSpace
server
DataShareDataStore
DataVault User / Browser
Thanks!
Acknowledgements:
Sunny Yang, Richard Adams, Nigel Goddard
Robin Rice, Kevin Tomlinson, George Hamilton,
Orlando Richards

More Related Content

PPTX
Service integration to Enhance RDM: RSpace electronic lab notebook at the Uni...
PDF
Addressing Institutional Research Data Management - University of Edinburgh R...
PPT
RDM Training Initiatives @ Edinburgh – DIY RDM Training Kit for Librarians
PPT
Northumbria University Geospatial Metadata Workshop 20110505
PPT
The WSTIERIA Project – A Web of Services
PPT
Collaboration to Curation: The High Rise Project meets Edinburgh DataShare
PPT
Investigation into Private LOCKSS Networks
PPTX
RJ Broker: Automating Delivery of Research Output to Repositories
Service integration to Enhance RDM: RSpace electronic lab notebook at the Uni...
Addressing Institutional Research Data Management - University of Edinburgh R...
RDM Training Initiatives @ Edinburgh – DIY RDM Training Kit for Librarians
Northumbria University Geospatial Metadata Workshop 20110505
The WSTIERIA Project – A Web of Services
Collaboration to Curation: The High Rise Project meets Edinburgh DataShare
Investigation into Private LOCKSS Networks
RJ Broker: Automating Delivery of Research Output to Repositories

What's hot (20)

PPT
The UK Federation Helpdesk
PPT
Organising and Documenting Data
PPTX
The University of Edinburgh Research Data Management Service Suite
PPT
An On-line Collaborative Data Management System
PPTX
The University of Edinburgh Research Data Management Service Suite
PPT
Open Access Repository Junction
PDF
Delivering Postgraduate Training - MANTRA
PPTX
Certifying CISER! A Data Seal of Approval Case Study
PPTX
Exploiting the value of Dublin Core through pragmatic development
PPTX
EUDAT Research Data Management | www.eudat.eu |
PPT
UKLA Update On Activities
PPT
Archiving The Worlds E-Journals:The Keepers Registry As Global Monitor
PPTX
Edin casestudy-ou-rr-2011
PPT
Open Data and Institutional Repositories
PPTX
RDM for trainee physicians
PPTX
Hughes RDAP11 Data Publication Repositories
PPTX
Metadata harvesting
PPT
Crowdsourcing the Past with AddressingHistory
PPTX
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
PDF
Dataset Citation and Identification
The UK Federation Helpdesk
Organising and Documenting Data
The University of Edinburgh Research Data Management Service Suite
An On-line Collaborative Data Management System
The University of Edinburgh Research Data Management Service Suite
Open Access Repository Junction
Delivering Postgraduate Training - MANTRA
Certifying CISER! A Data Seal of Approval Case Study
Exploiting the value of Dublin Core through pragmatic development
EUDAT Research Data Management | www.eudat.eu |
UKLA Update On Activities
Archiving The Worlds E-Journals:The Keepers Registry As Global Monitor
Edin casestudy-ou-rr-2011
Open Data and Institutional Repositories
RDM for trainee physicians
Hughes RDAP11 Data Publication Repositories
Metadata harvesting
Crowdsourcing the Past with AddressingHistory
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
Dataset Citation and Identification
Ad

Viewers also liked (7)

PPT
Aggregation as tactic sm new
PPTX
Supporting the development of a national Research Data Discovery Service - A ...
PPTX
RDM through a UK lens - New Roles for Librarians?
PPTX
RDM Programme @ Edinburgh
PPTX
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
PPTX
EPSRC research data expectations and research software management
Aggregation as tactic sm new
Supporting the development of a national Research Data Discovery Service - A ...
RDM through a UK lens - New Roles for Librarians?
RDM Programme @ Edinburgh
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
EPSRC research data expectations and research software management
Ad

Similar to RDM@Edinburgh_interoperation_IDCC2015 (20)

PDF
Service Integration to Enhance RDM
PDF
RDM Programme @ Edinburgh - Service Interoperation
PDF
RDM programme @ Edinburgh an institutional approach
PPTX
Integrating an electronic lab notebook with a data repository; American Chemi...
PDF
Elns and repositories, American Chemical Society, Dallas, March 2014
PDF
Looking After Your Data: RDM @ Edinburgh
PPTX
RDM & ELNs @ Edinburgh
PPTX
Making research data more resourceful - Jisc digital festival 2015
PPTX
Research Data Management at the University of Edinburgh
PPTX
Introduction to RDM for Geoscience PhD Students
PPTX
Research Data Management at The University of Edinburgh
PPTX
RDM Programme at University of Edinburgh
PPTX
Research Data Management: Why is it important?
PDF
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
PPTX
Integrating an electronic lab notebook with a university it environment rdmf ...
PPTX
AKVS - Edinburgh Data Repository Experiences June 2016
PPTX
Research Data Service at the University of Edinburgh
PPTX
RDM @ Edinburgh - Arkivum Workshop
Service Integration to Enhance RDM
RDM Programme @ Edinburgh - Service Interoperation
RDM programme @ Edinburgh an institutional approach
Integrating an electronic lab notebook with a data repository; American Chemi...
Elns and repositories, American Chemical Society, Dallas, March 2014
Looking After Your Data: RDM @ Edinburgh
RDM & ELNs @ Edinburgh
Making research data more resourceful - Jisc digital festival 2015
Research Data Management at the University of Edinburgh
Introduction to RDM for Geoscience PhD Students
Research Data Management at The University of Edinburgh
RDM Programme at University of Edinburgh
Research Data Management: Why is it important?
On being a cog rather than inventing the wheel: Edinburgh DataShare as a key ...
Integrating an electronic lab notebook with a university it environment rdmf ...
AKVS - Edinburgh Data Repository Experiences June 2016
Research Data Service at the University of Edinburgh
RDM @ Edinburgh - Arkivum Workshop

More from Historic Environment Scotland (18)

PPTX
Digital Archiving for Archaeological Units at Historic Environment Scotland
PPTX
Archives & Records Association summer seminar Edinburgh 7 June 2019
PPT
Bonares presentation oct2016v2
PPTX
Introduction to RDM for trainee physicians
PPTX
Introduction to data support services and resources for public policy
PPTX
EPSRC Policy Compliance: What researchers need to know
PPTX
Creating a Data Management Plan for your Grant Application
PPTX
Good Practice in Research Data Management
PPTX
RDM Programme@Edinburgh
PPT
Edinburgh DataShare - DSpace for Data
PPT
Rdm slides march 2014
PPT
RDM Priorities, Stakeholders, Practice
PPTX
CISER & the Data Reference Interview
PPT
AddressingHistory: crowdsourcing the past
PPT
Research Data Management at Edinburgh: Effecting Culture Change
PPT
Harnessing Collective Intelligence For Sustainable Development
PPT
Seminario Sobre Datasets Consorcio Madrono
Digital Archiving for Archaeological Units at Historic Environment Scotland
Archives & Records Association summer seminar Edinburgh 7 June 2019
Bonares presentation oct2016v2
Introduction to RDM for trainee physicians
Introduction to data support services and resources for public policy
EPSRC Policy Compliance: What researchers need to know
Creating a Data Management Plan for your Grant Application
Good Practice in Research Data Management
RDM Programme@Edinburgh
Edinburgh DataShare - DSpace for Data
Rdm slides march 2014
RDM Priorities, Stakeholders, Practice
CISER & the Data Reference Interview
AddressingHistory: crowdsourcing the past
Research Data Management at Edinburgh: Effecting Culture Change
Harnessing Collective Intelligence For Sustainable Development
Seminario Sobre Datasets Consorcio Madrono

RDM@Edinburgh_interoperation_IDCC2015

  • 1. Service Integration to Enhance RDM: RSpace electronic laboratory notebook (ELN) case study Stuart Macdonald (RDM Services Coordinator, University of Edinburgh) stuart.macdonald@ed.ac.uk Rory Macneil (CEO, Research Space) rmacneil@researchspace.com 10th International Digital Curation Conference, London, 10 February 2014
  • 2. University of Edinburgh RDM Policy  University of Edinburgh is one of the first Universities in UK to adopt a policy for managing research data: http://www.ed.ac.uk/is/research- data-policy  The policy was approved by the University Court on 16 May 2011.  It’s acknowledged that this is an aspirational policy and that implementation will take some years.
  • 3. Policy implementation: RDM Roadmap Colleagues in IS developed a roadmap to deliver a suite of services that will meet: • RDM policy objectives • the needs of our researchers Cross-divisional collaboration  3 Phases (Aug. 2012 – May 2015) Services already in place: o Data management planning o Active working file space = DataStore o Data publication repository = DataShare Services in development: o Long term data archive = DataVault o Data Asset Register (DAR) RDM support: Awareness raising, training & consultancy http://edin.ac/1u3sKqy Before research During research After research
  • 4. Research Data Management Planning Customised instance of DCC’s DMPonline toolkit for Univ. of Edinburgh use: A range of funders and local (non-funder) DMP templates Institutional guidance (storage, services, support) Tailored DMP assistance for researchers submitting research proposals (F-2-F) DataStore NAS facility to store data that are actively used in current research activities 0.5 TB (500GB) allocated per researcher (incl. PGRs) Up to 0.25TB can be used for “shared” group storage Extra storage costs: £200 per TB per year incl. back-up and DR copies Infrastructure in place. Allocation of space devolved to School IT departments overseen by Heads of IT from each College.
  • 5. DataShare  Edinburgh DataShare is the University’s OA multi-disciplinary data repository hosted bt the Data Library : http://datashare.is.ed.ac.uk  Assists researchers who want to share their data, get credit for data publication, and preserve their data for the long-term (DOI, licence, citation) Data Vault  Safe, private and secure long-term data archive  Current focus on front-end application requirements (authorisation, retention & deletion, file structure, file transfer, integration) Data Asset Register (DAR)  Using PURE as the catalogue of data assets produced by Edinburgh researchers for discovery, access, and re-use as appropriate. Interoperation  Systems are more likely to be used if some or all of the components are integrated and developed to minimise ‘duplication’ of effort
  • 6. RDM Support • RDM team work with Research Administrators , Academic Support Librarians and IT staff in each of the 22 Schools. • Queries can be sent to the IS Helpline who will direct them to appropriate RDM staff via CMS. • Introductory sessions on local RDM services and support for researchers and research administration staff in Schools / Institutes • RDM website: http://www.ed.ac.uk/is/data-management • RDM blog: http://datablog.is.ed.ac.uk • RDM wiki: https://www.wiki.ed.ac.uk/display/RDM/Research+Data+Management+Wiki
  • 7. Training: Tailored Courses Formal and bespoke training in the form of workshops, seminars and drop in sessions to help researchers with RDM issues.  Creating a data management plan for your grant application  Handling data using SPSS  Managing your research data: why it is important and what should you do? NEW  Publishing and sharing sensitive data (pilot) NEW  MANTRA - http://datalib.edina.ac.uk/mantra  An internationally recognized free online RDM training course for researchers - developed by the Data Library  Software-specific data handling exercises  CC License & embed units in VLE’s e.g. Moodle
  • 8. Service Integration examples • DataShare is a customised DSpace instance with OAI-PMH compliant DCMI metadata fields for data discovery through Google and other search engines • Records are harvested by Thomson-Reuters Data Citation Index • SWORD API utilised for batch deposit of large and/or many files from computers (‘Push using http’) • Internal batch ingest of many/large files to circumvent 2.1GB limit via interface (‘Pull via command line interface’) • checksums determine that delivered object mirrors deposited object • DSpace GITHUB plugin* - allows software used in research to be from GitHub (or similar) source code repository into DataShare, which then be assigned a DOI to facilitate citation - using the SWORD deposit protocol
  • 9. DataSync – a secure dropbox-like facility for synchronising data on DataStore with desktop and mobile machines: • uses open source ‘ownCloud’ technology Refresh of ECDF Computing Cluster (‘Eddie’) complete with ‘Data Centric Computing’ business model – integrate Eddie storage & HPC, parallel and cloud computing layers with DataStore for data sharing i.e. data transferred from DataStore for analysis run on Eddie and then data ported back to DataStore (DataVault) Linking of SDA toolkit with numeric ASCII data held in DataShare for the purposes of analysis (re-use) Facility to embargo variables within numeric files (in statistical analysis package formats) for subsequent open deposit into DataShare of de-sensitised version Research data deposit directly from RSpace Electronic Lab Notebook (ELN) interface into DataShare and Datastore (& Data Vault) using SWORD protocol
  • 10. Who and what is driving demand for ELNs? ● Researchers – Utility and convenience of paper lab book + online capabilities – On multiple devices – File management/integration ● Groups/PIs – Controlled sharing – Collaboration – Group management – File management/integration ● Institutions: data librarians, research admins, IT, commercialisation offices – Enterprise features: Scalable deployment, Single Sign On – IP protection: audit trail, signing – Publishing – Archiving – Repository integration – File management/integration
  • 11. RSpace RSpace First electronic notebook for research institutions
  • 12. Business Model ● Free public cloud for labs and individuals ● Institutional deployments @$100/user/year ● Seamless movement of groups and data between different RSpaces Researchers Institutions Funders Value Edinburgh Public Cloud Stanford Lab LabLab Convenience Productivity Portability Control Compliance Data mining Data mining
  • 13. RSpace at Edinburgh – Linking to files in Edinburgh DataStore – Depositing content in Edinburgh DataShare – Archiving in Edinburgh DataVault
  • 14. Linking to DataStore “My plan for workflow would be generally to deposit my data in DataStore either from the wet lab instruments (gel photos, elisa data, etc, and also possibly directly from an iPad) or from in silico data analysis I’ve been doing, and then link to it from within RSpace.”
  • 17. Linking to a DataStore File
  • 18. Linking to a DataStore File
  • 19. Linking to a DataStore File
  • 22. Archiving in Edinburgh DataVault ● DataVault functionality/API not yet specified ● Anticipate use of XML zip archive ● Many requirements to be determined – e.g., searching, restoration
  • 23. RSpace and Edinburgh RDM RSpace server DataShareDataStore DataVault User / Browser
  • 24. Thanks! Acknowledgements: Sunny Yang, Richard Adams, Nigel Goddard Robin Rice, Kevin Tomlinson, George Hamilton, Orlando Richards

Editor's Notes

  • #10: Edinburgh Data Science Institute Centre for Doctoral Training in Data Science (School of Informatics) Data Lab Innovation centre
  • #12: Designed and developed over three years with teams at three leading global research institutions The first and only ELN developed to meet the needs of research institutions – third generation supplanting second generation lab-focused ELNs. Comparison with major competitor, Lab Archives, shows dominance of RSpace for institutional requirements. Individual features not remarkable; advantage comes from the combination. Sustainable advantage from relationships with three partners, long head start and difficulty/impossibility of discovering and implementing detailed requirements of this customer set. Result: no real contest – like Man U vs. Cambridge United!