SlideShare a Scribd company logo
Consultant,
Honorary Academic Editor
Associate Director,
Principal Investigator
!
Susanna-Assunta Sansone, PhD!
!
!
Alan Turing Institute Symposium
Oxford, 6-7 April, 2016
A Data Discovery Index prototype that:!
•  Helps users find and access shared data !
•  Interoperates in the NIH Commons (biomedical
digital assets) !
NIH BD2K bioCADDIE DataMed: Data Discovery Index
Repositories
Metadata Ingestion
ElasticSearch
Terminology server
User Interface
Online datasets
PublishersFunding
Agencies
Data producers
DataSources
Ingestion Indexing
Searching
prototype!
aggregator'
A'
B C
A
aggregator'
Data'Discovery'Index'
data'
Organizing framework and
portal for data
Dashed lines:
mapping of metadata standards,
links to aggregators, data
Aggregators:
repositories or various indices
Data:
digital research objects
Pilot projects*Core
development team
* There is work for everyone (and more)
Designed as an element of
the ecosystem!
Use cases- community-driven effort!
The ‘right’ level of metadata elements!!
Examples of competency questions, derived from the use cases
The ‘appropriate’ metadata standards!!
Mapping the landscape of standards and databases in the life sciences
mapped a variety of metadata standards and database schemas
Generic schemas:!
•  schema.org!
•  DataCite!
•  RIF-CS!
•  DCAT!
•  PROV!
•  VOID!
•  Dublin Core !
•  etc…!
Life/biomedical schemas:!
•  BioProject!
•  BioSample!
•  MiNIML!
•  PRIDE-ml!
•  GA4GH metadata schema!
•  SRA xml!
•  CDISC SDM / BRIDGE model !
•  etc…!
We have aimed to have maximum
coverage of use cases with
minimal number of data elements
We do foresee that not all
questions can be answered in full
From to!
Prototype, model, mappings, documentation and more at!
https://biocaddie.org and https://github.com/biocaddie !
Supported by the NIH grant 1U24 AI117966-01 to the University of California, San Diego

More Related Content

PDF
The FAIR Cookbook in a nutshell
PDF
NIH BD2K DataMed metadata model - Force11, 2016
PDF
Data publication: Discover, Explore, Visualise
PDF
Metadata challenges research and re-usable data - BioSharing, ISA and STATO
PDF
Open Science FAIR 2021: FAIRsharing and the FAIR Cookbook
PDF
The FAIR Principles and FAIRsharing
PDF
Metadata for Interoperable Bioscience
PDF
EnablingFAIR - Open research data in the UK
The FAIR Cookbook in a nutshell
NIH BD2K DataMed metadata model - Force11, 2016
Data publication: Discover, Explore, Visualise
Metadata challenges research and re-usable data - BioSharing, ISA and STATO
Open Science FAIR 2021: FAIRsharing and the FAIR Cookbook
The FAIR Principles and FAIRsharing
Metadata for Interoperable Bioscience
EnablingFAIR - Open research data in the UK

What's hot (20)

PPTX
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
PPTX
Research information management: making sense of it all
PDF
FAIR Data Management and FAIR Data Sharing
PDF
FAIRsharing poster
PPTX
Creating impact with accessible data in agriculture and nutrition: sharing da...
PPTX
Why would a publisher care about open data?
PDF
All Things Biocuration
PDF
FAIRsharing, FAIR principles and metrics - Working with/for the Agro domain
PDF
FAIR resources, selected examples from ELIXIR-related projects
PPTX
Navigating the data management ecosystem - John Kratz
PDF
The FAIR Cookbook poster
PDF
The FAIR Principles and the IMI FAIRplus project
PDF
FAIR, FAIRplus and the FAIR Cookbook
PPTX
Burton - Security, Privacy and Trust
PDF
Behind the FAIR brand: Thinkers, Doers and Dreamers
PDF
FAIRsharing for RDA Funders Forum
PPTX
NISO Training Thursday Crafting a Scientific Data Management Plan
PDF
Valen Metadata and the [Data] Repository
PPTX
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
PDF
The FAIR movement - Oxford Open Data Week
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
Research information management: making sense of it all
FAIR Data Management and FAIR Data Sharing
FAIRsharing poster
Creating impact with accessible data in agriculture and nutrition: sharing da...
Why would a publisher care about open data?
All Things Biocuration
FAIRsharing, FAIR principles and metrics - Working with/for the Agro domain
FAIR resources, selected examples from ELIXIR-related projects
Navigating the data management ecosystem - John Kratz
The FAIR Cookbook poster
The FAIR Principles and the IMI FAIRplus project
FAIR, FAIRplus and the FAIR Cookbook
Burton - Security, Privacy and Trust
Behind the FAIR brand: Thinkers, Doers and Dreamers
FAIRsharing for RDA Funders Forum
NISO Training Thursday Crafting a Scientific Data Management Plan
Valen Metadata and the [Data] Repository
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
The FAIR movement - Oxford Open Data Week
Ad

Similar to NIH BD2K bioCADDIE DataMed: Data Discovery Index (20)

PDF
The DATS model: datasets descriptions for data discovery in DataMed
PDF
Datasets with bioschemas
PDF
Introduction to DATS v2.2 - NIH May 2017
PPTX
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
PDF
NIH BD2K DataMed data index - DATS model
PPTX
Life Science Database Cross Search and Metadata
PDF
NIH BD2K DataMed model, DATS
PPTX
NISO Working Group Connection Live! Research Data Metrics Landscape: An Updat...
PPT
Data integration
PDF
FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014
PDF
ISMB Workshop 2014
PDF
Sandusky, "Deep Indexing and Discover of Tables and Figures"
PDF
FAIR and metadata standards - FAIRsharing and Neuroscience
PDF
INSERM - Data Management & Reuse of Health Data - May 2017
PPTX
Bio db core-mockup-v1
PPTX
No Free Lunch: Metadata in the life sciences
PDF
Overview of the NIH BD2K CEDAR centre, on metadata and standards
PDF
Research data catalogues and data interoperability in life sciences
PDF
PDF
BioCADDIE: Descriptive Metadata for Datasets WG3 - ELIXIR All Hands
The DATS model: datasets descriptions for data discovery in DataMed
Datasets with bioschemas
Introduction to DATS v2.2 - NIH May 2017
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH BD2K DataMed data index - DATS model
Life Science Database Cross Search and Metadata
NIH BD2K DataMed model, DATS
NISO Working Group Connection Live! Research Data Metrics Landscape: An Updat...
Data integration
FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014
ISMB Workshop 2014
Sandusky, "Deep Indexing and Discover of Tables and Figures"
FAIR and metadata standards - FAIRsharing and Neuroscience
INSERM - Data Management & Reuse of Health Data - May 2017
Bio db core-mockup-v1
No Free Lunch: Metadata in the life sciences
Overview of the NIH BD2K CEDAR centre, on metadata and standards
Research data catalogues and data interoperability in life sciences
BioCADDIE: Descriptive Metadata for Datasets WG3 - ELIXIR All Hands
Ad

More from Susanna-Assunta Sansone (20)

PDF
FAIR and Reproducible - GSC, Tucson, Aug 2024
PDF
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
PDF
FAIRsharing-Standards-4-GSC-Aug23.pdf
PDF
FAIR-4-GSC-Sansone-Aug23.pdf
PDF
FAIRsharing & FAIRcookbook at RDA 2023
PDF
NFDI Physical Sciences Colloquium - FAIR
PDF
Metadata Standards
PDF
FAIRcookbook: GSRS22-Singapore
PDF
FAIR Cookbook
PDF
FAIR, community standards and data FAIRification: components and recipes
PDF
FAIRsharing and the FAIR Cookbook
PDF
FAIRsharing for EOSC
PDF
FAIR: standards and services
PDF
FAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
PDF
FAIRsharing: what we do for policies
PDF
FAIRsharing: how we assist with FAIRness
PDF
ELIXIR FAIR Activities - Examplars
PDF
FAIRsharing - focus on standards and new features
PDF
FAIR data and standards for a coordinated COVID-19 response
PDF
FAIRsharing COVID-19 Collection for The Global Health Network
FAIR and Reproducible - GSC, Tucson, Aug 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIRsharing-Standards-4-GSC-Aug23.pdf
FAIR-4-GSC-Sansone-Aug23.pdf
FAIRsharing & FAIRcookbook at RDA 2023
NFDI Physical Sciences Colloquium - FAIR
Metadata Standards
FAIRcookbook: GSRS22-Singapore
FAIR Cookbook
FAIR, community standards and data FAIRification: components and recipes
FAIRsharing and the FAIR Cookbook
FAIRsharing for EOSC
FAIR: standards and services
FAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
FAIRsharing: what we do for policies
FAIRsharing: how we assist with FAIRness
ELIXIR FAIR Activities - Examplars
FAIRsharing - focus on standards and new features
FAIR data and standards for a coordinated COVID-19 response
FAIRsharing COVID-19 Collection for The Global Health Network

Recently uploaded (20)

PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPT
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
PPT
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
PDF
Fluorescence-microscope_Botany_detailed content
PPT
Quality review (1)_presentation of this 21
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PPTX
1_Introduction to advance data techniques.pptx
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PDF
Clinical guidelines as a resource for EBP(1).pdf
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
Database Infoormation System (DBIS).pptx
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPTX
Global journeys: estimating international migration
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PDF
Foundation of Data Science unit number two notes
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
Fluorescence-microscope_Botany_detailed content
Quality review (1)_presentation of this 21
STUDY DESIGN details- Lt Col Maksud (21).pptx
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
oil_refinery_comprehensive_20250804084928 (1).pptx
Data_Analytics_and_PowerBI_Presentation.pptx
1_Introduction to advance data techniques.pptx
Miokarditis (Inflamasi pada Otot Jantung)
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Clinical guidelines as a resource for EBP(1).pdf
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Database Infoormation System (DBIS).pptx
Galatica Smart Energy Infrastructure Startup Pitch Deck
Global journeys: estimating international migration
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
Introduction-to-Cloud-ComputingFinal.pptx
Foundation of Data Science unit number two notes

NIH BD2K bioCADDIE DataMed: Data Discovery Index

  • 1. Consultant, Honorary Academic Editor Associate Director, Principal Investigator ! Susanna-Assunta Sansone, PhD! ! ! Alan Turing Institute Symposium Oxford, 6-7 April, 2016
  • 2. A Data Discovery Index prototype that:! •  Helps users find and access shared data ! •  Interoperates in the NIH Commons (biomedical digital assets) !
  • 4. Repositories Metadata Ingestion ElasticSearch Terminology server User Interface Online datasets PublishersFunding Agencies Data producers DataSources Ingestion Indexing Searching prototype!
  • 5. aggregator' A' B C A aggregator' Data'Discovery'Index' data' Organizing framework and portal for data Dashed lines: mapping of metadata standards, links to aggregators, data Aggregators: repositories or various indices Data: digital research objects Pilot projects*Core development team * There is work for everyone (and more) Designed as an element of the ecosystem!
  • 7. The ‘right’ level of metadata elements!! Examples of competency questions, derived from the use cases
  • 8. The ‘appropriate’ metadata standards!! Mapping the landscape of standards and databases in the life sciences
  • 9. mapped a variety of metadata standards and database schemas Generic schemas:! •  schema.org! •  DataCite! •  RIF-CS! •  DCAT! •  PROV! •  VOID! •  Dublin Core ! •  etc…! Life/biomedical schemas:! •  BioProject! •  BioSample! •  MiNIML! •  PRIDE-ml! •  GA4GH metadata schema! •  SRA xml! •  CDISC SDM / BRIDGE model ! •  etc…! We have aimed to have maximum coverage of use cases with minimal number of data elements We do foresee that not all questions can be answered in full From to!
  • 10. Prototype, model, mappings, documentation and more at! https://biocaddie.org and https://github.com/biocaddie ! Supported by the NIH grant 1U24 AI117966-01 to the University of California, San Diego