SlideShare a Scribd company logo
ONTO-ToolKit: enabling bio-ontology engineering via Galaxy Aravind Venkatesan,  ONTO-ToolKit: enabling bio-ontology engineering via Galaxy Aravind Venkatesan Systems Biology group, Department of Biology NTNU, Trondheim [email_address]
Overview Galaxy Ontology for Life Sciences ONTO-Toolkit Use  Cases Conclusion Future   Directions Acknowledgment References
Web application that allows flexible retrieval and analyses of the data. Integrated with other resources such the UCSC Genome browsers, BioMart. Galaxy environment aids biologists to manipulate, analyse and build workflows.  Is an open-source scalable framework for tool and data integration suitable for tool developers.
Tool pane – provides various functionality to handle data Data display area History pane –manipulate  uploaded data and build workflow Visit Galaxy!!   http:// galaxy.psu.edu /
Ontology for Life Sciences Ontologies aid in knowledge formalisation and machine interoperability The success of ontologies in the Life Sciences is marked by the wide spread use of Gene Ontology 1   (GO) Application ontologies such as the Cell Cycle Ontology 2 The OBO flat file format 3  (OBOF) and the Web Ontology Language 4   (OWL) have gained wide acceptance as knowledge representation languages.
ONTO-Toolkit Is a collection of tools to manage ontologies represented in the OBO file format within Galaxy environment The tools are wrappers for commonly used functions provided by  ONTO-PERL 5 ONTO-PERL was developed as part of the Semantic Systems Biology 6  (SSB) initiative ONTO-PERL (OBOF-centered PERL API) comprises of extensible set of (Object-oriented) PERL modules  These have an organised set of subroutines to deal with ontologies and is fully compatible with the current OBO specifications (ver. 1.2) The latest version (ver.1.22) of ONTO-PERL can be directly downloaded from CPAN,  http://search.cpan.org/dist/ONTO-PERL/  ONTO-PERL: An API supporting the development and analysis of bio-ontologies . Antezana E, Egana M, De Baets B, Kuiper M, Mironov V. Bioinformatics 2008; doi: 10.1093/bioinformatics/btn042
Examples of ONTO-PERL functionalities Scripts Functionality get_ancestor_terms.pl  Collects the ancestor terms (list of IDs) from a given term (existing ID) in the given OBO ontology. get_child_terms.pl Collects the child terms (list of term IDs and their names) from a given term (existing ID) in the given OBO ontology. get_descendent_terms.pl  Collects the descendent terms (list of IDs) from a given term (existing ID) in the given OBO ontology. get_subontology_from.pl  Extracts a subontology (in OBO format) of a given ontology having the given term ID as the root. get_intersection_ontology.pl Provides an intersection of the given ontologies (in OBO format) obo2owl.pl  OBO to OWL translator. obo2rdf.pl  OBO to RDF translator. obo_trimming.pl  This script trims a given branch of an OBO ontology.
ONTO-Toolkit - GALAXY Define arguments
ONTO-Toolkit - GALAXY
Use Cases To investigate similarities between given molecular functions Collecting all the upstream terms (ancestors) of two given molecular function terms and to identify common ancestors terms. Motivation: To demonstrate the functionality of ONTO-Toolkit in GALAXY To demonstrate the usefulness of ontology engineering in biological domain Use Case I : Chosen Ontology: Cell Cycle Ontology Chosen Terms: Term 1:  id: CCO:F0000004 name: trans-hexaprenyltranstransferase activity Term 2: id: CCO:F0000820 name: homogentisate 1,2-dioxygenase activity Term ID 1 Term ID 2
Use Case I  Uploading an obo ontology file – e.g.: cco_S_pombe
Conti… Molecular function Term ID: CCO:F0000004
This step is repeated for the second term - CCO:F0000820 List of ancestor terms for the given Molecular function Term 1 List of ancestor terms for Term 2
Common ancestor terms Gets the overlapping ancestor terms
Use Case II Identifying overlapping annotations for a given pair of distinct biological process terms Chosen Ontology: Cell Cycle Ontology Chosen Terms: Term 1 :  id: CCO:P0000005 name: cell cycle checkpoint Term 2 : id: CCO:P0000069 name: mitosis Term ID 1 Term ID 2
Use Case II Gets the sub-ontology for the given terms
Generated sub-ontology of  Term 1 : CCO:P0000005 Generated sub-ontology of  Term 2 : CCO:P0000069
Gets the intersection of the two sub-ontologies
Conclusion Use Case I – the results provides evidence  that the two molecular functions are unrelated as only the high level terms are shared by them. Use Case II – the results suggests the possibility  of an overlap between two distinct biological processes ONTO-Toolkit functionalities provides rich-ontology driven solutions within the Galaxy framework Future Directions Provide interface to perform SPARQL queries within Galaxy Provide visualisation module
Acknowledgment Dr. Erick Antezana, NTNU Dr. Vladimir Mironov, NTNU Dr. Martin Kuiper, NTNU References M. Ashburner, et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet, 25:25– 29, May 2000. The Cell Cycle Ontology,  http://www.semantic-systems-biology.org/cco The OBO Flat File Format Specification (ver.1.2),  http://www.geneontology.org/GO.format.obo-1_2.shtml OWL Web Ontology Language,  http://www.w3.org/TR/owl-semantics/ ONTO-PERL: An API supporting the development and analysis of bio-ontologies. Antezana E, Egana M, De Baets B, Kuiper M, Mironov V. Bioinformatics 2008; doi: 10.1093/bioinformatics/btn042  Semantic Systems Biology,  http://www.semantic-systems-biology.org/

More Related Content

PPT
Swertz bosc2010 molgenis
PPTX
Mercer bosc2010 microsoft_framework
PPT
The beauty of workflows and models
PPT
Rice bosc2010 emboss
PPTX
Kanterakis bosc2010 molgenis
PDF
E-Utilities
PDF
ISMB Workshop 2014
Swertz bosc2010 molgenis
Mercer bosc2010 microsoft_framework
The beauty of workflows and models
Rice bosc2010 emboss
Kanterakis bosc2010 molgenis
E-Utilities
ISMB Workshop 2014

What's hot (20)

PDF
Initial steps towards a production platform for DNA sequence analysis on the ...
PPT
Introduction to Ontologies for Environmental Biology
PDF
Metadata-based tools at the ENCODE Portal
PPTX
ContentMine (TDM) at JISC Digifest
PPTX
ContentMine + EPMC: Finding Zika!
PDF
From peer-reviewed to peer-reproduced: a role for research objects in scholar...
PDF
Standarization in Proteomics: From raw data to metadata files
PDF
Ontologies for life sciences: examples from the gene ontology
PPTX
Aspects of Reproducibility in Earth Science
PPTX
From Laboratory to e-Laboratory
PDF
Using Neo4j technologies for the management of systems biology models
PPT
Services For Science April 2009
PPTX
Closing the gap between chemistry and biology: Joining between text tombs and...
PPTX
Fairport domain specific metadata using w3 c dcat & skos w ontology views
PPT
exFrame: a Semantic Web Platform for Genomics Experiments
PPT
eXframe: A Semantic Web Platform for Genomic Experiments
PPTX
Data Integration vs Transparency: Tackling the tension
PPTX
VariantSpark a library for genomics by Lynn Langit
PPT
UniProt-GOA
 
Initial steps towards a production platform for DNA sequence analysis on the ...
Introduction to Ontologies for Environmental Biology
Metadata-based tools at the ENCODE Portal
ContentMine (TDM) at JISC Digifest
ContentMine + EPMC: Finding Zika!
From peer-reviewed to peer-reproduced: a role for research objects in scholar...
Standarization in Proteomics: From raw data to metadata files
Ontologies for life sciences: examples from the gene ontology
Aspects of Reproducibility in Earth Science
From Laboratory to e-Laboratory
Using Neo4j technologies for the management of systems biology models
Services For Science April 2009
Closing the gap between chemistry and biology: Joining between text tombs and...
Fairport domain specific metadata using w3 c dcat & skos w ontology views
exFrame: a Semantic Web Platform for Genomics Experiments
eXframe: A Semantic Web Platform for Genomic Experiments
Data Integration vs Transparency: Tackling the tension
VariantSpark a library for genomics by Lynn Langit
UniProt-GOA
 
Ad

Viewers also liked (20)

PDF
Automating JFC UI application testing with Jemmy
PPTX
Web. 2.0 Why Should I Care
PPT
Organisational development innocent agaba
PPTX
PPTX
White balance
PDF
Proton ds userguide
PPT
Kallio bosc2010 chipster-cloud
PDF
Recursos educativos y de formación para docentes de programas bilingües
DOC
Benjamín Arditi (Democracia postliberal participativa)
PDF
World class ink coupon
PDF
Pes Product Life Cycle Storyboard
PDF
Usb may coi truong
PDF
Market Leadership by Scientific Online Community and Open Access
PPTX
Lesdag 3 ouderfactoren
PDF
Ct total eng_110822
PPTX
Releituras Romero Britto
DOC
Benjamín arditi (democracia postliberal participativa)
PPT
Installation of sensor wires and loggers
PPT
RefWorks for DEPARTMENT OF FAMILY MEDICINE - Faculty Development
Automating JFC UI application testing with Jemmy
Web. 2.0 Why Should I Care
Organisational development innocent agaba
White balance
Proton ds userguide
Kallio bosc2010 chipster-cloud
Recursos educativos y de formación para docentes de programas bilingües
Benjamín Arditi (Democracia postliberal participativa)
World class ink coupon
Pes Product Life Cycle Storyboard
Usb may coi truong
Market Leadership by Scientific Online Community and Open Access
Lesdag 3 ouderfactoren
Ct total eng_110822
Releituras Romero Britto
Benjamín arditi (democracia postliberal participativa)
Installation of sensor wires and loggers
RefWorks for DEPARTMENT OF FAMILY MEDICINE - Faculty Development
Ad

Similar to Venkatesan bosc2010 onto-toolkit (20)

PPTX
Ontology Development Kit: Bio-Ontologies 2019
PPTX
Experiences in the biosciences with the open biological ontologies foundry an...
PPT
NCBO Technology Overview
PPTX
Collaboratively Creating the Knowledge Graph of Life
PPTX
All together now: piecing together the knowledge graph of life
PPT
NCBO Technology
PPT
Web services and the Development of Semantic Applications
PPT
G03-SemanticWeb-OntoCAT
PPT
NCBO Tools and Web services
PPT
Enabling Semantically Aware Software Applications
PPTX
Scaling up semantics; lessons learned across the life sciences
PPTX
OntoCAT - integrated programming toolkit for common ontology application task...
PDF
Tutorial: “How to use ontology repositories and ontology–based services”
PPT
Building and Using Ontologies to do biology
PPTX
FAIR data requires FAIR ontologies, how do we do?
PDF
A Cell-Cycle Knowledge Integration Framework
PPTX
Mungall keynote-biocurator-2017
PPTX
Ontologies: Necessary, but not sufficient
PDF
Sense and Similarity: making sense of similarity for ontologies
PPT
Web Services for Semantic Applications in Healthcare and Life Sciences
Ontology Development Kit: Bio-Ontologies 2019
Experiences in the biosciences with the open biological ontologies foundry an...
NCBO Technology Overview
Collaboratively Creating the Knowledge Graph of Life
All together now: piecing together the knowledge graph of life
NCBO Technology
Web services and the Development of Semantic Applications
G03-SemanticWeb-OntoCAT
NCBO Tools and Web services
Enabling Semantically Aware Software Applications
Scaling up semantics; lessons learned across the life sciences
OntoCAT - integrated programming toolkit for common ontology application task...
Tutorial: “How to use ontology repositories and ontology–based services”
Building and Using Ontologies to do biology
FAIR data requires FAIR ontologies, how do we do?
A Cell-Cycle Knowledge Integration Framework
Mungall keynote-biocurator-2017
Ontologies: Necessary, but not sufficient
Sense and Similarity: making sense of similarity for ontologies
Web Services for Semantic Applications in Healthcare and Life Sciences

More from BOSC 2010 (20)

PPT
Langmead bosc2010 cloud-genomics
PDF
Schultheiss bosc2010 persistance-web-services
PDF
Morris bosc2010 evoker
PPT
Kono bosc2010 pathway_projector
PDF
Gautier bosc2010 pythonbioconductor
PDF
Gardler bosc2010 community_developmentattheasf
PDF
Friedberg bosc2010 iprstats
PDF
Fields bosc2010 bio_perl
PDF
Chapman bosc2010 biopython
PDF
Bonnal bosc2010 bio_ruby
PDF
Puton bosc2010 bio_python-modules-rna
PPT
Bader bosc2010 cytoweb
PDF
Talevich bosc2010 bio-phylo
PPTX
Zmasek bosc2010 aptx
PPTX
Wilkinson bosc2010 moby-to-sadi
PPT
Taylor bosc2010
PPTX
Robinson bosc2010 bio_hdf
PPTX
Qiu bosc2010
PPT
Owen bosc2010 taverna2.2-cows
PDF
O connor bosc2010
Langmead bosc2010 cloud-genomics
Schultheiss bosc2010 persistance-web-services
Morris bosc2010 evoker
Kono bosc2010 pathway_projector
Gautier bosc2010 pythonbioconductor
Gardler bosc2010 community_developmentattheasf
Friedberg bosc2010 iprstats
Fields bosc2010 bio_perl
Chapman bosc2010 biopython
Bonnal bosc2010 bio_ruby
Puton bosc2010 bio_python-modules-rna
Bader bosc2010 cytoweb
Talevich bosc2010 bio-phylo
Zmasek bosc2010 aptx
Wilkinson bosc2010 moby-to-sadi
Taylor bosc2010
Robinson bosc2010 bio_hdf
Qiu bosc2010
Owen bosc2010 taverna2.2-cows
O connor bosc2010

Recently uploaded (20)

PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Approach and Philosophy of On baking technology
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Empathic Computing: Creating Shared Understanding
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
solutions_manual_-_materials___processing_in_manufacturing__demargo_.pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
KodekX | Application Modernization Development
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PPTX
Big Data Technologies - Introduction.pptx
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
GamePlan Trading System Review: Professional Trader's Honest Take
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
PPTX
Understanding_Digital_Forensics_Presentation.pptx
Spectral efficient network and resource selection model in 5G networks
Network Security Unit 5.pdf for BCA BBA.
Approach and Philosophy of On baking technology
Review of recent advances in non-invasive hemoglobin estimation
Empathic Computing: Creating Shared Understanding
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
NewMind AI Monthly Chronicles - July 2025
solutions_manual_-_materials___processing_in_manufacturing__demargo_.pdf
Advanced methodologies resolving dimensionality complications for autism neur...
KodekX | Application Modernization Development
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Big Data Technologies - Introduction.pptx
Reach Out and Touch Someone: Haptics and Empathic Computing
“AI and Expert System Decision Support & Business Intelligence Systems”
20250228 LYD VKU AI Blended-Learning.pptx
The AUB Centre for AI in Media Proposal.docx
GamePlan Trading System Review: Professional Trader's Honest Take
Per capita expenditure prediction using model stacking based on satellite ima...
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
Understanding_Digital_Forensics_Presentation.pptx

Venkatesan bosc2010 onto-toolkit

  • 1. ONTO-ToolKit: enabling bio-ontology engineering via Galaxy Aravind Venkatesan, ONTO-ToolKit: enabling bio-ontology engineering via Galaxy Aravind Venkatesan Systems Biology group, Department of Biology NTNU, Trondheim [email_address]
  • 2. Overview Galaxy Ontology for Life Sciences ONTO-Toolkit Use Cases Conclusion Future Directions Acknowledgment References
  • 3. Web application that allows flexible retrieval and analyses of the data. Integrated with other resources such the UCSC Genome browsers, BioMart. Galaxy environment aids biologists to manipulate, analyse and build workflows. Is an open-source scalable framework for tool and data integration suitable for tool developers.
  • 4. Tool pane – provides various functionality to handle data Data display area History pane –manipulate uploaded data and build workflow Visit Galaxy!! http:// galaxy.psu.edu /
  • 5. Ontology for Life Sciences Ontologies aid in knowledge formalisation and machine interoperability The success of ontologies in the Life Sciences is marked by the wide spread use of Gene Ontology 1 (GO) Application ontologies such as the Cell Cycle Ontology 2 The OBO flat file format 3 (OBOF) and the Web Ontology Language 4 (OWL) have gained wide acceptance as knowledge representation languages.
  • 6. ONTO-Toolkit Is a collection of tools to manage ontologies represented in the OBO file format within Galaxy environment The tools are wrappers for commonly used functions provided by ONTO-PERL 5 ONTO-PERL was developed as part of the Semantic Systems Biology 6 (SSB) initiative ONTO-PERL (OBOF-centered PERL API) comprises of extensible set of (Object-oriented) PERL modules These have an organised set of subroutines to deal with ontologies and is fully compatible with the current OBO specifications (ver. 1.2) The latest version (ver.1.22) of ONTO-PERL can be directly downloaded from CPAN, http://search.cpan.org/dist/ONTO-PERL/ ONTO-PERL: An API supporting the development and analysis of bio-ontologies . Antezana E, Egana M, De Baets B, Kuiper M, Mironov V. Bioinformatics 2008; doi: 10.1093/bioinformatics/btn042
  • 7. Examples of ONTO-PERL functionalities Scripts Functionality get_ancestor_terms.pl Collects the ancestor terms (list of IDs) from a given term (existing ID) in the given OBO ontology. get_child_terms.pl Collects the child terms (list of term IDs and their names) from a given term (existing ID) in the given OBO ontology. get_descendent_terms.pl Collects the descendent terms (list of IDs) from a given term (existing ID) in the given OBO ontology. get_subontology_from.pl Extracts a subontology (in OBO format) of a given ontology having the given term ID as the root. get_intersection_ontology.pl Provides an intersection of the given ontologies (in OBO format) obo2owl.pl OBO to OWL translator. obo2rdf.pl OBO to RDF translator. obo_trimming.pl This script trims a given branch of an OBO ontology.
  • 8. ONTO-Toolkit - GALAXY Define arguments
  • 10. Use Cases To investigate similarities between given molecular functions Collecting all the upstream terms (ancestors) of two given molecular function terms and to identify common ancestors terms. Motivation: To demonstrate the functionality of ONTO-Toolkit in GALAXY To demonstrate the usefulness of ontology engineering in biological domain Use Case I : Chosen Ontology: Cell Cycle Ontology Chosen Terms: Term 1: id: CCO:F0000004 name: trans-hexaprenyltranstransferase activity Term 2: id: CCO:F0000820 name: homogentisate 1,2-dioxygenase activity Term ID 1 Term ID 2
  • 11. Use Case I Uploading an obo ontology file – e.g.: cco_S_pombe
  • 12. Conti… Molecular function Term ID: CCO:F0000004
  • 13. This step is repeated for the second term - CCO:F0000820 List of ancestor terms for the given Molecular function Term 1 List of ancestor terms for Term 2
  • 14. Common ancestor terms Gets the overlapping ancestor terms
  • 15. Use Case II Identifying overlapping annotations for a given pair of distinct biological process terms Chosen Ontology: Cell Cycle Ontology Chosen Terms: Term 1 : id: CCO:P0000005 name: cell cycle checkpoint Term 2 : id: CCO:P0000069 name: mitosis Term ID 1 Term ID 2
  • 16. Use Case II Gets the sub-ontology for the given terms
  • 17. Generated sub-ontology of Term 1 : CCO:P0000005 Generated sub-ontology of Term 2 : CCO:P0000069
  • 18. Gets the intersection of the two sub-ontologies
  • 19. Conclusion Use Case I – the results provides evidence that the two molecular functions are unrelated as only the high level terms are shared by them. Use Case II – the results suggests the possibility of an overlap between two distinct biological processes ONTO-Toolkit functionalities provides rich-ontology driven solutions within the Galaxy framework Future Directions Provide interface to perform SPARQL queries within Galaxy Provide visualisation module
  • 20. Acknowledgment Dr. Erick Antezana, NTNU Dr. Vladimir Mironov, NTNU Dr. Martin Kuiper, NTNU References M. Ashburner, et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet, 25:25– 29, May 2000. The Cell Cycle Ontology, http://www.semantic-systems-biology.org/cco The OBO Flat File Format Specification (ver.1.2), http://www.geneontology.org/GO.format.obo-1_2.shtml OWL Web Ontology Language, http://www.w3.org/TR/owl-semantics/ ONTO-PERL: An API supporting the development and analysis of bio-ontologies. Antezana E, Egana M, De Baets B, Kuiper M, Mironov V. Bioinformatics 2008; doi: 10.1093/bioinformatics/btn042 Semantic Systems Biology, http://www.semantic-systems-biology.org/

Editor's Notes

  • #4: Explains the motivation of Galaxy – services it provides
  • #5: Explains the basic features of Galaxy
  • #13: The step is repeated for the second OBO term