SlideShare a Scribd company logo
The Research Object Initiative:
Frameworks and Use Cases
Professor Carole Goble
The University of Manchester, UK
carole.goble@manchester.ac.uk
NIH BD2K BioCADDIE webinar, 11 June 2015
From Manuscripts to
Research Objects
“An article about computational science in a scientific publication is not the
scholarship itself, it is merely advertising of the scholarship. The actual
scholarship is the complete software development environment, [the
complete data] and the complete set of instructions which generated the
figures.” David Donoho, “Wavelab and Reproducible Research,” 1995
Datasets, Data collections
Standard operating procedures
Software, algorithms
Configurations,
Tools and apps, services
Codes, code libraries
Workflows, scripts
System software
Infrastructure
Compilers, hardware
Scattered Assets
The Research Object Initiative:Frameworks and Use Cases
Concept
Drivers for Research Objects (1)
• Computational Workflows /
Scripts
– Multi-step, nested.
– Data, executable codes, services
(remote and local), libraries
– Preservation, Repair
– Reproducibility
• Systems Biology
– Models, data (construction, validation,
predicted), SOPs, samples
– Structured around Investigations,
Studies, Assays
– Exchange
– Reproducibility
Drivers for Research Objects (2)
• ComputationalWorkflows
Commons
– Projects and individuals
– myExperiment.org
• Systems Biology Commons
– Modellers and experimentalists
– Projects and Programs
– Catalogue of research assets
– Fairdomhub.org
– Fair-dom.org
– Seek4science.org
"Mapping present and future predicted distribution patterns for a
meso-grazer guild in the Baltic Sea" by Sonja Leidenberger et al
Workflow Commons
https://doi.org/10.15490/seek.1.investigation.56
The Research Object Initiative:Frameworks and Use Cases
The Research Object Initiative:Frameworks and Use Cases
[Snoep, 2015]
https://doi.org/10.15490/seek.1.investigation.56
Penkler et al (2015) FEBSJ 282:1481-1511.
https://sems.uni-rostock.de/reproducible-and-citable-data-and-models/
Local
Repositories
LIMS
Public
Repositories
Central repositories
Funding
Agencies
Catalogue
Search
Index
Tools
Research
Infrastructures
execute
companion site
CRIS
results
gateway
catalogue
Standards
metadata
Consumers
Producers
Publishers
haven
platform
Commons
Research Objects
1. Multi-various, citable research products
Research Objects
2. Compound, nested, scattered, yet interconnected
research products, structured investigations
Research Objects
3. Preserved, Portable research products,
inter-platform exchange, reproducibility
Pop-up projects
Dynamic groups
Internal / external visibility
Commons
Research Objects
4. Active research products: evolving. executable.
• Fork.
• Merge.
• Version.
• Cite
• Snapshot.
• Live.
[Martin Scharm]
Haus et al, BMC Systems Biology, 2011, 5:10
Solvent production by Clostridium acetobutylicum
Bigger on the inside than the outside
cite? resolve? steward?
closed
embed
fixed
local
open
alien
refer
fluid
Content
TARDIS Time and Relative Dimension
in Space Scholarship
Multi Span
type
steward
site
author
research
researchers
platforms
time
Contributions
Bigger on the inside than the outside
cite? resolve? steward?
closed
embed
fixed
local
open
alien
refer
fluid
Content
TARDIS Time and Relative Dimension
in Space Scholarship
Multi Span
type
steward
site
author
research
researchers
platforms
time
Contributions
Goble, De Roure, Bechhofer, Accelerating KnowledgeTurns, I3CK, 2013
Knowledge
Turning
interpret
Commons
FAIR
Research
Products
Reproducibility
Interpretation
Comparison
Preservation
Portability
Release
Active
Research
http://ccrtypewriter.blogspot.co.uk/
Research Objectmeans
ends
driver
Framework
Multi-various products, platforms, resources
First class citizens - id, manage, credit, track,
profile, focus
A Framework to Bundle, Port and Link (scattered) resources, related
experiments. Metadata Objects that carry Research Context.
Units of exchange.
Research Objects
http://www.researchobject.org
The Research Object Framework
Desiderata
Technology Independent.
The least possible.
The simplest feasible.
Graceful degradation.
Research Object Framework
Principles & Conventions
API specificationMetadata formats
RO Core
model
using
standards
Annotation
profiles
progressive
extensionsAdobe
UCF
ORE
ODF
OADM/
PROV
Research Object Framework
Principles & Conventions
API specification
Platform Profiles using legacy &
commodity platforms
Metadata formats
Policies Services
Tools
Lifecycle
Steward
Ship
Training
…
Commodity
Native
RO Core
model
using
standards
Annotation
profiles
progressive
extensionsAdobe
UCF
ORE
ODF
OADM/
PROV
Identity
Aggregation
Interpretation:
The objects
How they are
linked together
RO Core Model
manifest
Refer to
aggregations
and their
contents
Describe group
& constituents
External ids
Local files
Attribution:
Who , when,
where, why?
Metadata
Description
RO Core Model
Aggregations
Resource maps
Proxies
Annotation first
class and stand-off
Identity persistence and
resolution, Names
Citation
Identity
Annotation
Aggregation
DOIs
URIs
Handles
ORCID
W3C
OADM
OAI-
ORE
manifest
Point of
extendability
Identity
Annotation
Aggregation
RO Core Platform Profiles
DOIs
URIs
Handles
ORCID
Data Citation
Implementation
OAI-
ORE
W3C
OADM
RO Model Ontology
http://w3id.org/ro/
Defines core concepts of research objects, identity,
aggregation, annotation. Used in the manifest
Metadata Objects
Manifest
The Container Manifest content and the
relationships between the content
• RO metadata- id, title, creator, status….
• Aggregates – list of ids/links to resources
• Annotations – list of annotations about
resources
The Objects
• Remote,
through links
• Locally,
embedded
Manifest – remote and local
on my machine
Container Machinery
Manifest
The Container
Packaging:
Zip files, DOCKER Images…
Catalogues & Commons:
FAIRDOM SEEK, Farr Commons
CKAN, myExperiment…
The Container Manifest
content and the relationships
between the content
Export, archive, publish and transfer ROs.
File format for storage and distribution of
ROs as a ZIP archive
Includes an RO’s manifest, annotations and
some or all of its aggregated resources
Basis for more specific file formats
Backwards compatible: its zip
Programmatic access: JSON and JSON-LD
manifest, API
https://researchobject.github.io/specifications/bundle/
https://w3id.org/bundle/ doi:10.5281/zenodo.10440
https://researchobject.github.io/specifications/bundle/
https://w3id.org/bundle/ doi:10.5281/zenodo.10440
http://www.cnri.reston.va.us/papers/OverviewDigitalObjectArchit
ecture.pdf
RO Lifecycles,
Resolution, Citation
• Defend it (snapshot)
• Locate it (most recent)
• Reuse it (a version, a
component)
• Credit it (contributory
authorship)
• Cross link it (connections)
PURL
Checklists
Versioning
Provenance
Dependencies
Annotation
Profiles
.
Depth: how deeply
described
Coverage: how
much is covered.
Progression levels
Semantic Framework
PID
The Manifest
The Object Metadata
PAV
VoID
VIVO-ISF
PAV
Mim Ontology
Puppet, Makefile
Less detail,
more stakeholders
Checklists
Gamble M, Goble CA, Klyne G, Zhao J
Mim: A minimum information model vocabulary and
framework for scientific linked data IEEE 8th Intl
Conf on eScience pp: 1-8
Zhao J, Klyne G, Gamble M, Goble CA - A Checklist-
Based Approach for Quality Assessment of Scientific
Information Proc Third Linked Science Workshop
2013, co-located ISWC2013.
Library
Publishers
Experiments
Type specific
PID
Citation
NISO-
JATS
Dublin Core
ISA
MIAME
Wf-Desc
Checklist
Annotation
Profiles
.
OBI
SBML,
SED-ML
JERM
EXPO
Wf-prov
Gamble M, Goble CA, Klyne G, Zhao J
Mim: A minimum information model vocabulary
and framework for scientific linked data IEEE 8th
Intl Conf on eScience pp: 1-8
Use Cases
Use case
• SEEK Commons
for Systems
Biology
• Natively RO
• Export/Import
RO bundles
SEEK Metadata framework
link studies and link assets
Describes
common
elements and
relationships
between things
produced and
used in
experiments.
Structured
descriptions for
consistency and
comparison
Just Enough
Results Model
Snapshots
& Living
Living ROs
Snapshot RO of
investigation
and all its parts
Community Sys Bio Models
metadata + packaging
Bergmann, Rodriguez, Le Novère.
COMBINE archive specification.
<http://identifiers.org/combine.specifications/o
mex.version-1> (2014)
Bergman et al COMBINE archive and OMEX
format: one file to share all information to
reproduce a modeling project, BMC
Bioinformatics 2014, 15:369
Combine with RO.
Standardised metadata & API
http://co.mbine.org/documents/archive
https://github.com/stain/ro-combine-archive
doi:10.5281/zenodo.10439
Bridge from Research to FAIR publishing
Deposit
Run
RO Unzip
The Research Object Initiative:Frameworks and Use Cases
The Research Object Initiative:Frameworks and Use Cases
RO Query
Use Case: Taverna Workflows
Workflow Results
workflowrun.prov.ttl
(RDF)
outputA.txt
outputC.jpg
outputB/
https://w3id.org/bundle
intermediates/
1.txt
2.txt
3.txt
de/def2e58b-50e2-4949-9980-fd310166621a.txt
inputA.txt
workflow
URI
references
attribution
execution
environment
Aggregating in Research Object
ZIP folder structure (RO Bundle)
mimetype
application/vnd.wf4ever.robundle+zip
.ro/manifest.json
Workflow Specification
Example data and
config.
Components.
Plug-ins,Versions
Workflow System
Software package
Workflow Runs
Data and
configs
Provenance
logs
Study
Asset specific Commons
Personal Notebook
Community Registry
General Publishing Repository
Use case: ATLAS Collider
Data Analytics
Portable, lightweight
application runtime
and packaging tool.
Image
ATLAS and CMS detector data
CharlesVardeman,
Da Huo
All data and files
of the execution
+ Instructions
convert
bundle
manifest
Relate files
and layers
Add provenance
and annotations
Link in other
content
Use case:
The Farr Institute Commons
safe use of patient and research
data for medical research
clinical study cohorts
Research Objects:
scripts, data, samples…
different e-Labs, legacy data
http://www.farrinstitute.org/
Use case:
The Farr Institute Commons
The open source data portal software
exchange
catalogue
deposit
Use case:
The Farr Institute Commons
The open source data portal software
exchange
catalogue
deposit
Uses “code as a
research object”
functionality
Baking RO Infrastructure
make, import, export,
inspect, render, version, process, check, …
• Libraries
– Create and inspect RO Bundles and their metadata
– Java, Ruby and Python
• User tools
– RO Manager: command line tool to make ROs
– ROHUB: a prototype web app to manage ROs
• Platforms
– SEEK
– CKAN plug-in to build, import and export ROs
http://www.researchobject.org/specifications/
NIH BD2K + Research Objects
Metadata Profiles
RO Model API
Community IDs*
RO Model Manifest Profile
Implementation Profiles
*BioMedBridges 10 Rules for Identifiers.
Summary
FAIR Research Objects:
• Concept, model, framework, use cases
• Lightweight, Incremental
Challenges
• Multi-stewarding and lifecycles (OAIS)
• Policy, governance
Partnerships
• Figshare, Oxford Bodliean, Farr Institute
• BioCADDIE?
Acknowledgements & Links
Stian Soiland-Reyes
Matt Gamble
Rob Haines
Sean Bechhofer
Norman Morrison
Phil Crouch
Finn Bacall
Stuart Owen
Carole Goble
Khalid Belhajjame
Graham Klyne
Jun Zhao
Daniel Garijo,
Oscar Corcho
Esteban García
Cuesta
University of
Manchester
University of Oxford
Lancaster University
UPM
http://researchobject.org
http://fair-dom.org
http://www.seek4science.org
http://www.farrinstitute.org
http://www.wf4ever-project.org
http://myexperiment.org
Raul Palma
iSOCO
PSNC
Paris 6

More Related Content

PPTX
FAIRer Research
PPTX
Research Objects, SEEK and FAIRDOM
PPTX
The Rhetoric of Research Objects
PPTX
Research Objects: more than the sum of the parts
PPTX
Mtsr2015 goble-keynote
PPTX
Being FAIR: Enabling Reproducible Data Science
PPTX
Being Reproducible: SSBSS Summer School 2017
PPTX
FAIRy Stories
FAIRer Research
Research Objects, SEEK and FAIRDOM
The Rhetoric of Research Objects
Research Objects: more than the sum of the parts
Mtsr2015 goble-keynote
Being FAIR: Enabling Reproducible Data Science
Being Reproducible: SSBSS Summer School 2017
FAIRy Stories

What's hot (20)

PPTX
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
PPT
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
PPTX
Crediting informatics and data folks in life science teams
PPTX
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
PPTX
Reproducibility, Research Objects and Reality, Leiden 2016
PPTX
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
PPTX
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
PPTX
Introduction to FAIRDOM
PPTX
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
PPTX
ROHub
PPTX
Reproducibility (and the R*) of Science: motivations, challenges and trends
PPTX
Aspects of Reproducibility in Earth Science
PPTX
FAIR Data and Model Management for Systems Biology (and SOPs too!)
PPTX
Let’s go on a FAIR safari!
PPTX
Advances in Scientific Workflow Environments
PPTX
RARE and FAIR Science: Reproducibility and Research Objects
PPTX
Citing data in research articles: principles, implementation, challenges - an...
PPTX
SEEK for Science: A Data and Model Management Platform to support Open and Re...
PPTX
Results Vary: The Pragmatics of Reproducibility and Research Object Frameworks
PDF
Reproducible research: First steps.
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
Crediting informatics and data folks in life science teams
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
Reproducibility, Research Objects and Reality, Leiden 2016
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
Introduction to FAIRDOM
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
ROHub
Reproducibility (and the R*) of Science: motivations, challenges and trends
Aspects of Reproducibility in Earth Science
FAIR Data and Model Management for Systems Biology (and SOPs too!)
Let’s go on a FAIR safari!
Advances in Scientific Workflow Environments
RARE and FAIR Science: Reproducibility and Research Objects
Citing data in research articles: principles, implementation, challenges - an...
SEEK for Science: A Data and Model Management Platform to support Open and Re...
Results Vary: The Pragmatics of Reproducibility and Research Object Frameworks
Reproducible research: First steps.
Ad

Similar to The Research Object Initiative: Frameworks and Use Cases (20)

PPTX
Research Object Community Update
PPTX
FAIR data and model management for systems biology (and SOPs too!)
PPT
UK Digital Curation Centre: enabling research data management at the coalface
PPT
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome
PPTX
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
PDF
A Clean Slate?
PPTX
Metadata for Research Objects
PPT
Results may vary: Collaborations Workshop, Oxford 2014
PPTX
Conservation of Scientific Workflow Infrastructures by Using Semantics - 2012
PPTX
Keynote speech - Carole Goble - Jisc Digital Festival 2015
PPTX
Preserving the Inputs and Outputs of Scholarship
PPTX
Research Objects for FAIRer Science
PDF
Research Shared: researchobject.org
PPTX
From Scientific Workflows to Research Objects: Publication and Abstraction of...
PPT
The eCrystals Federation
PDF
Sharing massive data analysis: from provenance to linked experiment reports
PPTX
The FAIRDOM Commons for Systems Biology
PDF
Standards and tools for model management in biomedical research
PDF
FAIR BioData Management
PPTX
Reproducible and citable data and models: an introduction.
Research Object Community Update
FAIR data and model management for systems biology (and SOPs too!)
UK Digital Curation Centre: enabling research data management at the coalface
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
A Clean Slate?
Metadata for Research Objects
Results may vary: Collaborations Workshop, Oxford 2014
Conservation of Scientific Workflow Infrastructures by Using Semantics - 2012
Keynote speech - Carole Goble - Jisc Digital Festival 2015
Preserving the Inputs and Outputs of Scholarship
Research Objects for FAIRer Science
Research Shared: researchobject.org
From Scientific Workflows to Research Objects: Publication and Abstraction of...
The eCrystals Federation
Sharing massive data analysis: from provenance to linked experiment reports
The FAIRDOM Commons for Systems Biology
Standards and tools for model management in biomedical research
FAIR BioData Management
Reproducible and citable data and models: an introduction.
Ad

More from Carole Goble (20)

PPTX
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
PPTX
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
PPTX
RO-Crate: packaging metadata love notes into FAIR Digital Objects
PPTX
Research Software Sustainability takes a Village
PPTX
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
PPTX
FAIR Computational Workflows
PPTX
Open Research: Manchester leading and learning
PPTX
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
PPTX
FAIR Computational Workflows
PPTX
FAIR Computational Workflows
PPTX
EOSC-Life Workflow Collaboratory
PPTX
FAIR Computational Workflows
PPTX
FAIR Data Bridging from researcher data management to ELIXIR archives in the...
PPTX
FAIR Computational Workflows
PPTX
FAIR Workflows and Research Objects get a Workout
PPTX
FAIRy stories: the FAIR Data principles in theory and in practice
PPTX
RO-Crate: A framework for packaging research products into FAIR Research Objects
PPTX
The swings and roundabouts of a decade of fun and games with Research Objects
PPTX
How are we Faring with FAIR? (and what FAIR is not)
PPTX
What is Reproducibility? The R* brouhaha and how Research Objects can help
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
RO-Crate: packaging metadata love notes into FAIR Digital Objects
Research Software Sustainability takes a Village
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
FAIR Computational Workflows
Open Research: Manchester leading and learning
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
FAIR Computational Workflows
FAIR Computational Workflows
EOSC-Life Workflow Collaboratory
FAIR Computational Workflows
FAIR Data Bridging from researcher data management to ELIXIR archives in the...
FAIR Computational Workflows
FAIR Workflows and Research Objects get a Workout
FAIRy stories: the FAIR Data principles in theory and in practice
RO-Crate: A framework for packaging research products into FAIR Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects
How are we Faring with FAIR? (and what FAIR is not)
What is Reproducibility? The R* brouhaha and how Research Objects can help

Recently uploaded (20)

PPT
Chemical bonding and molecular structure
PPTX
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
PPTX
The KM-GBF monitoring framework – status & key messages.pptx
PDF
Phytochemical Investigation of Miliusa longipes.pdf
PPTX
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
PPTX
famous lake in india and its disturibution and importance
PPTX
7. General Toxicologyfor clinical phrmacy.pptx
PDF
diccionario toefl examen de ingles para principiante
PPTX
Taita Taveta Laboratory Technician Workshop Presentation.pptx
PPTX
TOTAL hIP ARTHROPLASTY Presentation.pptx
PPTX
Classification Systems_TAXONOMY_SCIENCE8.pptx
PPTX
neck nodes and dissection types and lymph nodes levels
PPTX
Microbiology with diagram medical studies .pptx
PDF
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
PPT
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
PDF
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
PPTX
2. Earth - The Living Planet Module 2ELS
PDF
The scientific heritage No 166 (166) (2025)
PDF
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
PPTX
DRUG THERAPY FOR SHOCK gjjjgfhhhhh.pptx.
Chemical bonding and molecular structure
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
The KM-GBF monitoring framework – status & key messages.pptx
Phytochemical Investigation of Miliusa longipes.pdf
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
famous lake in india and its disturibution and importance
7. General Toxicologyfor clinical phrmacy.pptx
diccionario toefl examen de ingles para principiante
Taita Taveta Laboratory Technician Workshop Presentation.pptx
TOTAL hIP ARTHROPLASTY Presentation.pptx
Classification Systems_TAXONOMY_SCIENCE8.pptx
neck nodes and dissection types and lymph nodes levels
Microbiology with diagram medical studies .pptx
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
2. Earth - The Living Planet Module 2ELS
The scientific heritage No 166 (166) (2025)
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
DRUG THERAPY FOR SHOCK gjjjgfhhhhh.pptx.

The Research Object Initiative: Frameworks and Use Cases