www.postersession.com
SEAD Member Node is a Multi-Repository node that:
• Serves sustainability science community
• Shares and preserves data via any number of repositories
• Gives focused exposure through DataONE to sustainability science
products
• Advocates for and supports distributed and interoperable data
sharing
SEAD 2.0 Multi-Repository Member Node
Charitha Madurangi, Inna Kouper, Yu Luo, Isuru Suriarachchi,
Kunalan Ratharanjan, and Beth Plale
Indiana University SEAD
Curbee is a lightweight Java publishing pipeline that:
• Accepts Research Objects formed as a single ORE package
• Validates contents of Research Object
• Generates and preserves metadata
• Notifies recommended repository of prepared submission
• Synchronizes metadata with DataONE for discoverability
A recommendation engine that:
• Responds to requests from Project Spaces and independent
clients for recommendations for their Research Object
• Utilizes information from profiles about People, Data, and
Things (Repositories) to recommend the most appropriate
repository for a research object
Matchmaker profiles People, Data, and Things (Repositories):
• People: profiles are one of three kinds: ORCID ID, Google
identity, or Clowder identity (NCSA)
• Data: profiles are in JSON-LD format as defined by SEAD
• Repositories: definitions inspired by profiles used in the
Registry of Research Data Repositories, re3data.org
SEAD 2.1 will support fuzzy reasoning
Plale, B., Kouper, I., Goodwell, A., & Suriarachchi, I. (in press).
Trust Threads: Minimal Provenance for Data Publishing and
Reuse. In C. R. Sugimoto, H. Ekbia, & M. Mattioli (Eds.), Big
Data is Not a Monolith: Policies, Practices and Problems. MIT
Press.
• 2.0 release Jul 2016; 2.1 release Oct 2016
• Long term commitment to research objects published in IU SEAD
Cloud
• Make publishing tools standalone
• Embed SEAD publishing into existing analysis frameworks
• Extend Curbee to handle external submissions
A popular but basic repository of SEAD that is:
• Large scale replicated storage server at Indiana University
• Highly available and ingests large scale objects easily
• Pulls new Research Objects from SEAD
• Uses BagIt to describe the research objects
• Harvests minimal metadata and creates landing page per object
• Stores objects as zipped archives, but exposes the contents from
the landing page
• Assigns DOI
Core principles:
1. Offer curation as early in research process as possible
2. Provide choice to researchers in where to publish their data
3. Avoid duplication of services when partnering with existing and
emerging repositories
4. Support minimal data provenance by tracking and distinguishing
between revised, derived, and replicated datasets
CurBeeSubmissionAPI
Independent
Research Object
Submissions
Komadu
Provenance Store
Data Storage
MongoDB
object profile and
state store
DataONE
web file-space
CurBee
Persist
Library of Micro-services
Validate
Record provenance
CurBee-Service
DataONE MN API
Prepare for DataONE
Generate metadata
Curbee Architecture
Research
Object
Unique ID
Agents
StatesRelationships
Content
• Data creator
• Curator
• Data re-use scientist
• Live
• Curated
• Published
• Aggregates
• Related to
• Describes
• Derived from
• Versioned from
• Files
• Bitstreams
• Pointers
• Annotations

More Related Content

PPTX
Ways for researchers to store, share, discover, and use data_Cousijn
PDF
Open Data for Research
PPTX
The Economics of Data Sharing
PDF
Data Citation Implementation Guidelines By Tim Clark
PPTX
Elsevier‘s RDM Program: Ten Habits of Highly Effective Data
PPTX
THOR Workshop - Data Publishing Elsevier
PPTX
Open Access: funders' policies and recent updates
PPTX
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Ways for researchers to store, share, discover, and use data_Cousijn
Open Data for Research
The Economics of Data Sharing
Data Citation Implementation Guidelines By Tim Clark
Elsevier‘s RDM Program: Ten Habits of Highly Effective Data
THOR Workshop - Data Publishing Elsevier
Open Access: funders' policies and recent updates
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum

What's hot (20)

PPTX
Data curation
PPT
Altman RDAP11 Policy-based Data Management
PPT
Smith RDAP11 NSF Data Management Plan Case Studies
PPTX
Publishing the Full Research Data Lifecycle
PDF
Maureen C Kelly Managing Access in New World of Scholarly Research
PPTX
RDA-WDS Publishing Data Interest Group
PPTX
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
PPTX
Why would a publisher care about open data?
PPTX
EPSRC Policy Compliance: What researchers need to know
PPTX
Research Data Management: Why is it important?
PPT
Identifiers for Researchers and Data: Increasing Attribution and Discovery– J...
PPTX
Introduction to research data management
PDF
Introduction to the Environmental Data Initiative (EDI)
PPT
An Open Context for Zooarchaeology: Publishing Research Data on the Web
PPTX
The challenge of sharing data well, how publishers can help
PDF
RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...
PPT
Rots RDAP11 Data Archives in Federal Agencies
PPTX
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
PPTX
EDI Training Module 12: An Introduction to Metadata and Data Repositories
PPTX
Establishing a UQ Research Data Management Service
Data curation
Altman RDAP11 Policy-based Data Management
Smith RDAP11 NSF Data Management Plan Case Studies
Publishing the Full Research Data Lifecycle
Maureen C Kelly Managing Access in New World of Scholarly Research
RDA-WDS Publishing Data Interest Group
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Why would a publisher care about open data?
EPSRC Policy Compliance: What researchers need to know
Research Data Management: Why is it important?
Identifiers for Researchers and Data: Increasing Attribution and Discovery– J...
Introduction to research data management
Introduction to the Environmental Data Initiative (EDI)
An Open Context for Zooarchaeology: Publishing Research Data on the Web
The challenge of sharing data well, how publishers can help
RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...
Rots RDAP11 Data Archives in Federal Agencies
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
EDI Training Module 12: An Introduction to Metadata and Data Repositories
Establishing a UQ Research Data Management Service
Ad

Similar to SEAD 2.0 Multi-Repository Member Node (20)

PDF
SEAD: Anatomy of a multi-repository member node
PDF
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
PPTX
Data Literacy: Creating and Managing Reserach Data
PDF
How can we ensure research data is re-usable? The role of Publishers in Resea...
PPTX
Networked Science, And Integrating with Dataverse
PPTX
Some Ideas on Making Research Data: "It's the Metadata, stupid!"
PPT
Getting to grips with Research Data Management
PPTX
CNI 2018: A Research Object Authoring Tool for the Data Commons
PPTX
Supporting Good Practice in Research Data Management: Edinburgh’s Experience
PPT
BioMed Central's open data initiatives
PPT
Getting to grips with research data management
PPTX
UWA Research Week 2016
PPTX
IASSIST40: Data management & curation workshop
PPTX
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PPTX
Scholze liber 2015-06-25_final
PPTX
Digital Repositories: Essential Information for Academic Librarians
PDF
Scientific Data and peer review session at Dryad event, May 2015
PPTX
Steven McEachern - ADA, DDI (metadata standard) and the Data Lifecycle
PDF
ADA, DDI and the data lifecycle - Steve McEachern - 7 April 2017
PPT
The Rise of the Data Journal
SEAD: Anatomy of a multi-repository member node
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
Data Literacy: Creating and Managing Reserach Data
How can we ensure research data is re-usable? The role of Publishers in Resea...
Networked Science, And Integrating with Dataverse
Some Ideas on Making Research Data: "It's the Metadata, stupid!"
Getting to grips with Research Data Management
CNI 2018: A Research Object Authoring Tool for the Data Commons
Supporting Good Practice in Research Data Management: Edinburgh’s Experience
BioMed Central's open data initiatives
Getting to grips with research data management
UWA Research Week 2016
IASSIST40: Data management & curation workshop
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
Scholze liber 2015-06-25_final
Digital Repositories: Essential Information for Academic Librarians
Scientific Data and peer review session at Dryad event, May 2015
Steven McEachern - ADA, DDI (metadata standard) and the Data Lifecycle
ADA, DDI and the data lifecycle - Steve McEachern - 7 April 2017
The Rise of the Data Journal
Ad

Recently uploaded (20)

PDF
Transcultural that can help you someday.
PDF
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
PDF
Microsoft 365 products and services descrption
PPTX
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
PPTX
Steganography Project Steganography Project .pptx
PDF
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
PPT
Predictive modeling basics in data cleaning process
PPT
lectureusjsjdhdsjjshdshshddhdhddhhd1.ppt
PPTX
modul_python (1).pptx for professional and student
PDF
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
PDF
Microsoft Core Cloud Services powerpoint
PDF
Data Engineering Interview Questions & Answers Batch Processing (Spark, Hadoo...
PPTX
FMIS 108 and AISlaudon_mis17_ppt_ch11.pptx
PPTX
SET 1 Compulsory MNH machine learning intro
PPTX
DS-40-Pre-Engagement and Kickoff deck - v8.0.pptx
PDF
Introduction to Data Science and Data Analysis
PDF
Introduction to the R Programming Language
PPTX
retention in jsjsksksksnbsndjddjdnFPD.pptx
PPTX
Phase1_final PPTuwhefoegfohwfoiehfoegg.pptx
PDF
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
Transcultural that can help you someday.
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
Microsoft 365 products and services descrption
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
Steganography Project Steganography Project .pptx
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
Predictive modeling basics in data cleaning process
lectureusjsjdhdsjjshdshshddhdhddhhd1.ppt
modul_python (1).pptx for professional and student
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
Microsoft Core Cloud Services powerpoint
Data Engineering Interview Questions & Answers Batch Processing (Spark, Hadoo...
FMIS 108 and AISlaudon_mis17_ppt_ch11.pptx
SET 1 Compulsory MNH machine learning intro
DS-40-Pre-Engagement and Kickoff deck - v8.0.pptx
Introduction to Data Science and Data Analysis
Introduction to the R Programming Language
retention in jsjsksksksnbsndjddjdnFPD.pptx
Phase1_final PPTuwhefoegfohwfoiehfoegg.pptx
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}

SEAD 2.0 Multi-Repository Member Node

  • 1. www.postersession.com SEAD Member Node is a Multi-Repository node that: • Serves sustainability science community • Shares and preserves data via any number of repositories • Gives focused exposure through DataONE to sustainability science products • Advocates for and supports distributed and interoperable data sharing SEAD 2.0 Multi-Repository Member Node Charitha Madurangi, Inna Kouper, Yu Luo, Isuru Suriarachchi, Kunalan Ratharanjan, and Beth Plale Indiana University SEAD Curbee is a lightweight Java publishing pipeline that: • Accepts Research Objects formed as a single ORE package • Validates contents of Research Object • Generates and preserves metadata • Notifies recommended repository of prepared submission • Synchronizes metadata with DataONE for discoverability A recommendation engine that: • Responds to requests from Project Spaces and independent clients for recommendations for their Research Object • Utilizes information from profiles about People, Data, and Things (Repositories) to recommend the most appropriate repository for a research object Matchmaker profiles People, Data, and Things (Repositories): • People: profiles are one of three kinds: ORCID ID, Google identity, or Clowder identity (NCSA) • Data: profiles are in JSON-LD format as defined by SEAD • Repositories: definitions inspired by profiles used in the Registry of Research Data Repositories, re3data.org SEAD 2.1 will support fuzzy reasoning Plale, B., Kouper, I., Goodwell, A., & Suriarachchi, I. (in press). Trust Threads: Minimal Provenance for Data Publishing and Reuse. In C. R. Sugimoto, H. Ekbia, & M. Mattioli (Eds.), Big Data is Not a Monolith: Policies, Practices and Problems. MIT Press. • 2.0 release Jul 2016; 2.1 release Oct 2016 • Long term commitment to research objects published in IU SEAD Cloud • Make publishing tools standalone • Embed SEAD publishing into existing analysis frameworks • Extend Curbee to handle external submissions A popular but basic repository of SEAD that is: • Large scale replicated storage server at Indiana University • Highly available and ingests large scale objects easily • Pulls new Research Objects from SEAD • Uses BagIt to describe the research objects • Harvests minimal metadata and creates landing page per object • Stores objects as zipped archives, but exposes the contents from the landing page • Assigns DOI Core principles: 1. Offer curation as early in research process as possible 2. Provide choice to researchers in where to publish their data 3. Avoid duplication of services when partnering with existing and emerging repositories 4. Support minimal data provenance by tracking and distinguishing between revised, derived, and replicated datasets CurBeeSubmissionAPI Independent Research Object Submissions Komadu Provenance Store Data Storage MongoDB object profile and state store DataONE web file-space CurBee Persist Library of Micro-services Validate Record provenance CurBee-Service DataONE MN API Prepare for DataONE Generate metadata Curbee Architecture Research Object Unique ID Agents StatesRelationships Content • Data creator • Curator • Data re-use scientist • Live • Curated • Published • Aggregates • Related to • Describes • Derived from • Versioned from • Files • Bitstreams • Pointers • Annotations