SlideShare a Scribd company logo
Interoperable Data for KnetMiner and
DFW Use Cases
Elixir BioHackathon 2021
Marco Brandizi <marco.brandizi@rothamsted.ac.uk>
Find this presentation on SlideShare
background source: https://alimentaciosostenible.barcelona/en/protecting-planet/urban-agriculture
Typical KnetMiner Searches
Based on publications, which genes are related to the yellow rust disease?
In which biological processes are their encoded proteins involved?
Typical KnetMiner Data
schema.org, Bioschemas, AgriSchemas
• Ideal for:
• Heterogeneous data, sources, formats
• Informal data
• Exploratory research (including AI)
• Integration/sharing advantages
• Simple and informal, but it’s easy to integrate
• other data (eg, OBO ontologies)
• FAIR-oriented support (eg, Google Dataset
Search)
• The AgriSchemas Project
• A set of use cases modelled with
schema.org, bioschemas
• Reusable data ETL tools
• bioschemas additions and extensions
AgriSchemas: Molecular Biology Use Case
Live: http://knetminer.org/data/rdf/resources/gene_traescs1d02g156000
AgriSchemas: Molecular Biology Use Case
AgriSchemas: Information Artifacts
Live: http://knetminer.org/data/rdf/resources/publication_23105158
AgriSchemas: Gene Expression (and EBI/GXA Data)
Based on publications, which genes are related to the yellow rust disease?
In which biological processes are their encoded proteins involved?
In which tissues are the genes expressed?
AgriSchemas: Gene Expression (and EBI/GXA Data)
Live here.
AgriSchemas: Gene Expression (and EBI/GXA Data)
Live here.
AgriSchemas: Ontologies and Ontology Annotations
Live: http://knetminer.org/data/rdf/resources/cond_outer_pericarp
AgriSchemas: Gene Expression (and EBI/GXA Data)
Based on publications, which genes are related to the yellow rust disease?
In which biological processes are their encoded proteins involved?
In which tissues are the genes expressed?
What is the experimental evidence (ie, Field Trials) for the gene ex?
AgriSchemas: Studies/Experiments/Field Trials
Live: http://knetminer.org/data/rdf/resources/exp_E-MTAB-3103
Detailed modelling about Field Trials:
https://github.com/Rothamsted/agri-schemas/tree/master/doc/miappe-use-case
Use case (includes study, samples, assays):
https://github.com/Rothamsted/agri-schemas/blob/master/doc/miappe-use-case.ttl
Ongoing Work
Use case Data Types Data Sources Status
Molecular Biology Gene, Protein, Pathway
encodes, participates
Via Knetminer: ENSEMBL, UniProt,
TILLING, wheat-expression.com, KEGG
Done.
Ontology Annotations Ontology Term (schema:DefinedTerm)
dc:type, schema:additionalType
Via Knetminer: GO, PO, CROP-Onto Done.
Experiments Study, agri:StudyFactor, PropertyValue EBI/GXA, GLTen, MIAPPE/BrAPI
sources, ?
GXA Done
GLTen use case drafted
MIAPPE, use case drafted
Literature agri:ScholarlyPublication
mentions
Via Knetminer: PubMed Done
Gene Expression bioschema:expressedIn, reified
statements, agri:evidence, agri:pvalue,
agri:baseCondition
EBI/GXA, Via Knetminer: wheat-
expression.com
GXA
Host-pathogen interaction Gene, Phenotype,
agri:ScholarlyPublication
agri:HostPathogenInteraction
agri:evidence
PHI-Base Use case drafted
Weather ? ? TO DO
Dataset metadata Dataset, DataCatalog
license, distribution
knetminer.org/data ongoing
References
• AgriSchemas
• https://github.com/Rothamsted/agri-schemas
• Use cases: https://github.com/Rothamsted/agri-schemas/tree/master/drafts/201904-dfw-
hackathon
• Real data & ETL tools: https://github.com/Rothamsted/agri-schemas/tree/master/dfw-dataset
• Knetminer
• Web site: http://knetminer.org
• Publication: https://doi.org/10.1111/pbi.13583
• Case study about FAIR data:
• https://knetminer.com/cases/the-power-of-standardised-and-fair-knowledge-graphs.html
• FAIR data infrastructure: https://doi.org/10.1515/jib-2018-0023
• Data endpoint: http://knetminer.org/data
• DFW
• AgriSchemas and DFW:
• https://designingfuturewheat.org.uk/dfw-and-fair-agriculture-data-the-knetminer-
experience/
• Me
• https://www.slideshare.net/mbrandizi, https://marcobrandizi.info/about-me/

More Related Content

PPTX
KnetMiner - Knowledge Network Miner
PPTX
Introducing the KnetMiner Knowledge Graph: things, not strings
PPTX
KnetMiner - EBI Workshop 2017
PPTX
AgriSchemas: Sharing Agrifood data with Bioschemas
PPTX
FAIR Agronomy, where are we? The KnetMiner Use Case
PDF
Better Data for a Better World
PPTX
Application of bioinformatics
PDF
Multi-Omics Bioinformatics across Application Domains
KnetMiner - Knowledge Network Miner
Introducing the KnetMiner Knowledge Graph: things, not strings
KnetMiner - EBI Workshop 2017
AgriSchemas: Sharing Agrifood data with Bioschemas
FAIR Agronomy, where are we? The KnetMiner Use Case
Better Data for a Better World
Application of bioinformatics
Multi-Omics Bioinformatics across Application Domains

What's hot (20)

PPT
Intro bioinformatics
PPTX
Introduction to Gene Mining Part A: BLASTn-off!
PDF
Introduction to Bioinformatics
PPTX
Bioinformatics Final Presentation
PPTX
Careers in bioinformatics
PPT
Bioinformatics - Discovering the Bio Logic Of Nature
PPTX
Career oppurtunities in the field of Bioinformatics
DOCX
Bioinformatics, Its Usage and Advantages
PPT
The Seven Deadly Sins of Bioinformatics
PPTX
Introduction to bioinformatics
PPT
B.sc biochem i bobi u-1 introduction to bioinformatics
PDF
Bioinformatics databases: Current Trends and Future Perspectives
PPTX
Bioinformatics
PPT
Role of bioinformatics in life sciences research
PPTX
Introduction to Bioinformatics
PPTX
Database technologies in bioinformatics
PPTX
Model Organism Linked Data
PDF
Introduction to Bioinformatics
PPTX
Careers in bioinformatics, Scope, Skills and Jobs
PDF
Basics of Data Analysis in Bioinformatics
Intro bioinformatics
Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Bioinformatics
Bioinformatics Final Presentation
Careers in bioinformatics
Bioinformatics - Discovering the Bio Logic Of Nature
Career oppurtunities in the field of Bioinformatics
Bioinformatics, Its Usage and Advantages
The Seven Deadly Sins of Bioinformatics
Introduction to bioinformatics
B.sc biochem i bobi u-1 introduction to bioinformatics
Bioinformatics databases: Current Trends and Future Perspectives
Bioinformatics
Role of bioinformatics in life sciences research
Introduction to Bioinformatics
Database technologies in bioinformatics
Model Organism Linked Data
Introduction to Bioinformatics
Careers in bioinformatics, Scope, Skills and Jobs
Basics of Data Analysis in Bioinformatics
Ad

Similar to Interoperable Data for KnetMiner and DFW Use Cases (20)

PDF
Bioinformatics data mining
PPTX
Data analysis & integration challenges in genomics
PDF
Investigating plant systems using data integration and network analysis
PPTX
Bioinformatics .pptx
PPTX
Eccmid meet the expert 2015
PDF
Bioinformatics
PDF
bioinformatics enabling knowledge generation from agricultural omics data
PDF
Introduction to Bioinformatics for Molecular Studies
PPT
Data management, data sharing: the SysMO-SEEK Story
PPT
Data sharing - Data management - The SysMO-SEEK Story
PDF
Get Data Mining for Systems Biology Methods and Protocols 1st Edition Koji Ts...
PPTX
Genomics and Bioinformatics
PPT
2011-10-11 Open PHACTS at BioIT World Europe
PPT
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...
PDF
Bioinformatics-2009-Moura-1096-8
PDF
call for papers, research paper publishing, where to publish research paper, ...
PPTX
Ondex: Data integration and visualisation
PPTX
Bioinformatica 29-09-2011-t1-bioinformatics
PPTX
Computing on the shoulders of giants
PPTX
ECCMID 2016 - How to build actionable virulome databases
Bioinformatics data mining
Data analysis & integration challenges in genomics
Investigating plant systems using data integration and network analysis
Bioinformatics .pptx
Eccmid meet the expert 2015
Bioinformatics
bioinformatics enabling knowledge generation from agricultural omics data
Introduction to Bioinformatics for Molecular Studies
Data management, data sharing: the SysMO-SEEK Story
Data sharing - Data management - The SysMO-SEEK Story
Get Data Mining for Systems Biology Methods and Protocols 1st Edition Koji Ts...
Genomics and Bioinformatics
2011-10-11 Open PHACTS at BioIT World Europe
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...
Bioinformatics-2009-Moura-1096-8
call for papers, research paper publishing, where to publish research paper, ...
Ondex: Data integration and visualisation
Bioinformatica 29-09-2011-t1-bioinformatics
Computing on the shoulders of giants
ECCMID 2016 - How to build actionable virulome databases
Ad

More from Rothamsted Research, UK (20)

PPTX
Publishing and Consuming FAIR Data A Case in the Agri-Food Domain
PPTX
Continuos Integration @Knetminer
PPTX
AgriSchemas Progress Report
PPTX
AgriFood Data, Models, Standards, Tools, Use Cases
PDF
Notes about SWAT4LS 2018
PPTX
Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMine...
PPTX
Knetminer Backend Training, Nov 2018
PPTX
A Preliminary survey of RDF/Neo4j as backends for KnetMiner
PDF
Towards FAIRer Biological Knowledge Networks 
Using a Hybrid Linked Data 
and...
PDF
Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...
ODP
graph2tab, a library to convert experimental workflow graphs into tabular for...
PDF
Interoperable Open Data: Which Recipes?
PDF
Linked Data with the EBI RDF Platform
PDF
BioSD Linked Data: Lessons Learned
PDF
BioSD Tutorial 2014 Editition
PDF
myEquivalents, aka a new cross-reference service
PDF
Dev 2014 LOD tutorial
PDF
BioSamples Database Linked Data, SWAT4LS Tutorial
PDF
Uk onto net_2013_notes_brandizi
Publishing and Consuming FAIR Data A Case in the Agri-Food Domain
Continuos Integration @Knetminer
AgriSchemas Progress Report
AgriFood Data, Models, Standards, Tools, Use Cases
Notes about SWAT4LS 2018
Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMine...
Knetminer Backend Training, Nov 2018
A Preliminary survey of RDF/Neo4j as backends for KnetMiner
Towards FAIRer Biological Knowledge Networks 
Using a Hybrid Linked Data 
and...
Behind the Scenes of KnetMiner: Towards Standardised and Interoperable Knowle...
graph2tab, a library to convert experimental workflow graphs into tabular for...
Interoperable Open Data: Which Recipes?
Linked Data with the EBI RDF Platform
BioSD Linked Data: Lessons Learned
BioSD Tutorial 2014 Editition
myEquivalents, aka a new cross-reference service
Dev 2014 LOD tutorial
BioSamples Database Linked Data, SWAT4LS Tutorial
Uk onto net_2013_notes_brandizi

Recently uploaded (20)

PPTX
GEN. BIO 1 - CELL TYPES & CELL MODIFICATIONS
PPTX
The KM-GBF monitoring framework – status & key messages.pptx
PDF
Biophysics 2.pdffffffffffffffffffffffffff
PDF
. Radiology Case Scenariosssssssssssssss
PPTX
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
PPTX
G5Q1W8 PPT SCIENCE.pptx 2025-2026 GRADE 5
PDF
bbec55_b34400a7914c42429908233dbd381773.pdf
PPTX
ECG_Course_Presentation د.محمد صقران ppt
DOCX
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
PPT
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
PPTX
EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx
PDF
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
PDF
diccionario toefl examen de ingles para principiante
PDF
MIRIDeepImagingSurvey(MIDIS)oftheHubbleUltraDeepField
PPTX
DRUG THERAPY FOR SHOCK gjjjgfhhhhh.pptx.
PPTX
INTRODUCTION TO EVS | Concept of sustainability
PPTX
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
PPTX
Cell Membrane: Structure, Composition & Functions
PPTX
Classification Systems_TAXONOMY_SCIENCE8.pptx
PPTX
Derivatives of integument scales, beaks, horns,.pptx
GEN. BIO 1 - CELL TYPES & CELL MODIFICATIONS
The KM-GBF monitoring framework – status & key messages.pptx
Biophysics 2.pdffffffffffffffffffffffffff
. Radiology Case Scenariosssssssssssssss
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
G5Q1W8 PPT SCIENCE.pptx 2025-2026 GRADE 5
bbec55_b34400a7914c42429908233dbd381773.pdf
ECG_Course_Presentation د.محمد صقران ppt
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
diccionario toefl examen de ingles para principiante
MIRIDeepImagingSurvey(MIDIS)oftheHubbleUltraDeepField
DRUG THERAPY FOR SHOCK gjjjgfhhhhh.pptx.
INTRODUCTION TO EVS | Concept of sustainability
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
Cell Membrane: Structure, Composition & Functions
Classification Systems_TAXONOMY_SCIENCE8.pptx
Derivatives of integument scales, beaks, horns,.pptx

Interoperable Data for KnetMiner and DFW Use Cases

  • 1. Interoperable Data for KnetMiner and DFW Use Cases Elixir BioHackathon 2021 Marco Brandizi <marco.brandizi@rothamsted.ac.uk> Find this presentation on SlideShare background source: https://alimentaciosostenible.barcelona/en/protecting-planet/urban-agriculture
  • 2. Typical KnetMiner Searches Based on publications, which genes are related to the yellow rust disease? In which biological processes are their encoded proteins involved?
  • 4. schema.org, Bioschemas, AgriSchemas • Ideal for: • Heterogeneous data, sources, formats • Informal data • Exploratory research (including AI) • Integration/sharing advantages • Simple and informal, but it’s easy to integrate • other data (eg, OBO ontologies) • FAIR-oriented support (eg, Google Dataset Search) • The AgriSchemas Project • A set of use cases modelled with schema.org, bioschemas • Reusable data ETL tools • bioschemas additions and extensions
  • 5. AgriSchemas: Molecular Biology Use Case Live: http://knetminer.org/data/rdf/resources/gene_traescs1d02g156000
  • 7. AgriSchemas: Information Artifacts Live: http://knetminer.org/data/rdf/resources/publication_23105158
  • 8. AgriSchemas: Gene Expression (and EBI/GXA Data) Based on publications, which genes are related to the yellow rust disease? In which biological processes are their encoded proteins involved? In which tissues are the genes expressed?
  • 9. AgriSchemas: Gene Expression (and EBI/GXA Data) Live here.
  • 10. AgriSchemas: Gene Expression (and EBI/GXA Data) Live here.
  • 11. AgriSchemas: Ontologies and Ontology Annotations Live: http://knetminer.org/data/rdf/resources/cond_outer_pericarp
  • 12. AgriSchemas: Gene Expression (and EBI/GXA Data) Based on publications, which genes are related to the yellow rust disease? In which biological processes are their encoded proteins involved? In which tissues are the genes expressed? What is the experimental evidence (ie, Field Trials) for the gene ex?
  • 13. AgriSchemas: Studies/Experiments/Field Trials Live: http://knetminer.org/data/rdf/resources/exp_E-MTAB-3103 Detailed modelling about Field Trials: https://github.com/Rothamsted/agri-schemas/tree/master/doc/miappe-use-case Use case (includes study, samples, assays): https://github.com/Rothamsted/agri-schemas/blob/master/doc/miappe-use-case.ttl
  • 14. Ongoing Work Use case Data Types Data Sources Status Molecular Biology Gene, Protein, Pathway encodes, participates Via Knetminer: ENSEMBL, UniProt, TILLING, wheat-expression.com, KEGG Done. Ontology Annotations Ontology Term (schema:DefinedTerm) dc:type, schema:additionalType Via Knetminer: GO, PO, CROP-Onto Done. Experiments Study, agri:StudyFactor, PropertyValue EBI/GXA, GLTen, MIAPPE/BrAPI sources, ? GXA Done GLTen use case drafted MIAPPE, use case drafted Literature agri:ScholarlyPublication mentions Via Knetminer: PubMed Done Gene Expression bioschema:expressedIn, reified statements, agri:evidence, agri:pvalue, agri:baseCondition EBI/GXA, Via Knetminer: wheat- expression.com GXA Host-pathogen interaction Gene, Phenotype, agri:ScholarlyPublication agri:HostPathogenInteraction agri:evidence PHI-Base Use case drafted Weather ? ? TO DO Dataset metadata Dataset, DataCatalog license, distribution knetminer.org/data ongoing
  • 15. References • AgriSchemas • https://github.com/Rothamsted/agri-schemas • Use cases: https://github.com/Rothamsted/agri-schemas/tree/master/drafts/201904-dfw- hackathon • Real data & ETL tools: https://github.com/Rothamsted/agri-schemas/tree/master/dfw-dataset • Knetminer • Web site: http://knetminer.org • Publication: https://doi.org/10.1111/pbi.13583 • Case study about FAIR data: • https://knetminer.com/cases/the-power-of-standardised-and-fair-knowledge-graphs.html • FAIR data infrastructure: https://doi.org/10.1515/jib-2018-0023 • Data endpoint: http://knetminer.org/data • DFW • AgriSchemas and DFW: • https://designingfuturewheat.org.uk/dfw-and-fair-agriculture-data-the-knetminer- experience/ • Me • https://www.slideshare.net/mbrandizi, https://marcobrandizi.info/about-me/