SlideShare a Scribd company logo
Formalizing Systems Biology
Models with Biomedical Ontologies
                  work by:
     Robert Hoehndorf, Michel Dumontier,
    John H. Gennari, Sarala Wimalaratne,
 Bernard de Bono, Dan Cook, George Gkoutos

  Cambridge University, Carleton University,
      EBI, University of Washington
        Harmony @ NYC. April 21,2011
Systems Biology

We create and simulate models to :
   gain biological insight into the structure
   and function of biochemical networks
   reveal capabilities and predict
   phenotypes
   undertake metabolic
   engineering maximize desired product

To do this, we need
   to manage our data & knowledge in a
   coherent, scalable and machine
   understandable manner
   use efficient software to execute
   simulations
Computational Knowledge Discovery

 Terminological resources increasingly being used to
 annotate biomolecular models
    easier to explore or find models

 converting models into formal representations of knowledge
    check the annotation consistency
    infer knowledge explicit in terminological resources
    discover biological implications inherent in the models.
SBML

XML-based representation of biochemical models, their
components (compartments, species, reactions, events),
descriptors (rules, constraints, functions, units)

Consider the following enzymatic reaction:
SBML is an XML-based format

<?xml version="1.0" encoding="UTF-8"?>
<sbml level="2" version="3" xmlns="http://www.sbml.org/sbml/level2/version3">
  <model name="EnzymaticReaction">
    <listOfUnitDefinitions>
       <unitDefinition id="per_second">
          <listOfUnits>
             <unit kind="second" exponent="-1"/>
          </listOfUnits>
       </unitDefinition>
       <unitDefinition id="litre_per_mole_per_second">
          <listOfUnits>
             <unit kind="mole" exponent="-1"/>
             <unit kind="litre" exponent="1"/>
             <unit kind="second" exponent="-1"/>
          </listOfUnits>
       </unitDefinition>
    </listOfUnitDefinitions>
    <listOfCompartments>
       <compartment id="cytosol" size="1e-14"/>
    </listOfCompartments>
    <listOfSpecies>
       <species compartment="cytosol" id="ES" initialAmount="0" name="ES"/>
       <species compartment="cytosol" id="P" initialAmount="0" name="P"/>
       <species compartment="cytosol" id="S" initialAmount="1e-20" name="S"/>
       <species compartment="cytosol" id="E" initialAmount="5e-21" name="E"/>
    </listOfSpecies>
<listOfReactions>
   <reaction id="veq">
     <listOfReactants>
        <speciesReference species="E"/>
        <speciesReference species="S"/>
     </listOfReactants>
     <listOfProducts>
        <speciesReference species="ES"/>
     </listOfProducts>
     <kineticLaw>
        <math xmlns="http://www.w3.org/1998/Math/MathML">
           <apply>
              <times/>
              <ci>cytosol</ci>
              <apply>
                 <minus/>
                 <apply>
                   <times/>
                   <ci>kon</ci>
                   <ci>E</ci>
                   <ci>S</ci>
                 </apply>
                 <apply>
                   <times/>
                   <ci>koff</ci>
                   <ci>ES</ci>
                 </apply>
              </apply>
           </apply>
        </math>
        <listOfParameters>
           <parameter id="kon" value="1000000" units="litre_per_mole_per_second"/>
           <parameter id="koff" value="0.2" units="per_second"/>
        </listOfParameters>
     </kineticLaw>
   </reaction>
<reaction id="vcat" reversible="false">
          <listOfReactants>
             <speciesReference species="ES"/>
          </listOfReactants>
          <listOfProducts>
             <speciesReference species="E"/>
             <speciesReference species="P"/>
          </listOfProducts>
          <kineticLaw>
             <math xmlns="http://www.w3.org/1998/Math/MathML">
                <apply>
                   <times/>
                   <ci>cytosol</ci>
                   <ci>kcat</ci>
                   <ci>ES</ci>
                </apply>
             </math>
             <listOfParameters>
                <parameter id="kcat" value="0.1" units="per_second"/>
             </listOfParameters>
          </kineticLaw>
       </reaction>
    </listOfReactions>
  </model>
</sbml>
SBML conceptualization
SBML specifies what models can have
as attributes
Biomodels are semantically annotated
SBML models
  EBI managed
  resource
  600 + models
  available as SBML
  269 models are
  curated with GO
  process, function and
  component terms,
  and has links to
  protein databases.
  Possible to browse
  by GO terms:            http://www.ebi.ac.uk/biomodels-main/
Biomodels are semantically annotated
 SBML models
 <?xml version="1.0" encoding="UTF-8"?>
 <sbml xmlns="http://www.sbml.org/sbml/level2" metaid="metaid_0000001" level="2" version="1">
  <model metaid="metaid_0000002" id="Proctor2005_Hsp90" name="Hsp90model_basis510">
   <annotation>
   <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
     xmlns:bqbiol="http://biomodels.net/biology-qualifiers/"
     xmlns:bqmodel="http://biomodels.net/model-qualifiers/">
     <bqbiol:isVersionOf>
       <rdf:Bag>
        <rdf:li rdf:resource="urn:miriam:obo.go:GO%3A0051085"/>
        <rdf:li rdf:resource="urn:miriam:obo.go:GO%3A0007569"/>
        <rdf:li rdf:resource="urn:miriam:obo.go:GO%3A0009408"/>
       </rdf:Bag>
      </bqbiol:isVersionOf>
     </rdf:RDF>
   </annotation>
   </model>
 </sbml>




GO:0051085                                       GO:0007569                             GO:0009408
chaperone mediated protein                       cell aging                             response to heat
folding requiring cofactor
Gene Ontology

 nearly 30,000 terms
 covers
     biological processes
     molecular functions
     cellular components
 terms organized around "is
 a" hierarchy
 terms further described with
 'has part'/'part of'; 'regulates'
 and '+ regulates', '-
 regulates'
Harmony 2011: Formalization of SBML models as OWL ontologies
Species may be annotated with
UniProt, KEGG, ChEBI terms
  <species metaid="metaid_0000038" id="ROS" name="ROS" compartment="
compartment" initialAmount="100" hasOnlySubstanceUnits="true">
   <annotation>
    <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
      xmlns:bqbiol="http://biomodels.net/biology-qualifiers/"
      xmlns:bqmodel="http://biomodels.net/model-qualifiers/">
      <rdf:Description rdf:about="#metaid_0000038">
       <bqbiol:isVersionOf>
         <rdf:Bag>
          <rdf:li rdf:resource="urn:miriam:obo.chebi:CHEBI%3A26523"/>
         </rdf:Bag>
       </bqbiol:isVersionOf>
      </rdf:Description>                                   CHEBI:26523 reactive
    </rdf:RDF>
   </annotation>                                           oxygen species
  </species>
Chemical Entities of Biological Interest
(ChEBI)
recently refactored to be in line with formal
(reasoning capable) ontology

scope includes chemical entities (atoms,
substances, groups, molecules), roles and
subatomic particles

large numbers of curated molecules
Approach

The idea is to create sophisticated OWL ontologies from
biomolecular models represented using the Systems Biology
Markup Language (SBML).

Features
   ontological commitment: terms in a vocabulary become
   formally defined classes and relations in an ontology
   upper level ontology to distinguish and constrain model
   entities the spatio-temporal entities they represent
   basic relations are used to describe (constrain) entities in
   terms of the attributes and relationships they hold

code: http://code.google.com/p/sbmlharvester/
Conceptualization (SBML)

 2 kinds of entities:
     in silico: model components
     in vivo: the entities represented by a model
An Upper Level Ontology distinguishes
models from the entities they represent
Ontological commitment

Assumption 1: Every model represents a
material entity (Model SubClassOf: represents
some MaterialEntity)
a Model annotated with class C represents
a C that is a subclass of MaterialEntity
Model entities (models and model
  components) are distinguished from the
  entities they represent




every element E of the SBML language represents a class Rep(E) and we
assert that E subClassOf: represents some Rep(E)
Ontological commitment

In addition to annotation to physical entities, we note that
annotations include functions and processes
Ontological commitment

 Assumption 1: Every model represents a material
 entity (Model SubClassOf: represents some
 MaterialEntity)
 a Model annotated with class C represents a
 C that is a subclass of MaterialEntity or a Thing
 that has-function some C or a Thing that has a
 function that is realized by only C's
Model annotations are converted into
 ontology axioms


model annotation:                           ontology axioms:
 'is' | 'isVersionOf' | 'isVariantOf'       subClassOf
  * physical entity -------------------->    has-part some physical-entity
  * molecular function --------------->      has-part some (has-function some function)
  * biological process --------------->      has-part some (
                                              has-function some (realized-by only
                                            process))
Harmony 2011: Formalization of SBML models as OWL ontologies
3. Relations impose additional
constraints, such that inconsistencies
arise when incorrectly used
Ontological commitment

Assertion:
M SubClassOf: represents some C or represents
some (has-function some C) or represents some
(has-function some (realized-by only C))

C SubClassOf: MaterialEntity
Then:
  represents some C is satisfiable
  represents some (has-function some
  C) and represents some (has-function some
  (realized-by only C)) are unsatisfiable
Ontological commitment

Assertion:
M SubClassOf: represents some C or represents
some (has-function some C) or represents some
(has-function some (realized-by only C))

C SubClassOf: Function
Then:
  represents some (has-function some C) is
  satisfiable
  represents some C and represents some (has-
  function some (realized-by only C)) are
  unsatisfiable
Ontological commitment

Assertion:
M SubClassOf: represents some C or represents
some (has-function some C) or represents some
(has-function some (realized-by only C))

C SubClassOf: Process
Then:
  represents some (has-function some (realized-by
  only C)) is satisfiable
  represents some C and represents some (has-
  function some C) are unsatisfiable
SBML2OWL: Implementation

 Combine libSBML and OWLAPI
 Use libSBML to access model structure
    extract MIRIAM annotations in RDF
    use Jena RDF API to parse RDF annotations in SBML
    models
    use OWLAPI to perform conversion of SBML structure
    combine with top-level ontology
SBML2OWL: Implementation

Application to BioModels repository yields:
   OWL ontology with more than 800,000 axioms
   includes all referenced ontologies
       GO
       ChEBI
       Celltype
       FMA
       PATO
Model verification

After reasoning, we found 27 models to be inconsistent

reasons
 1. our representation - functions sometimes found in the place
    of physical entities (e.g. entities that secrete insulin). better
    to constrain with appropriate relations
 2. SBML abused - species used as a measure of time
 3. constraints in the ontologies themselves mean that the
    annotation is simply not possible
Finding inconsistencies with
axiomatically enhanced ontologies
recent work treats function as process and axioms state that an
ATPase activity (GO:0004002) is a Catalytic activity that has
Water and ATP as input, ADP and phosphate as output and is
a part of an ATP catabolic process.
Tho this, we add:
   GO: ATP + Water the only inputs (universal quantification)
   ChEBI: Water, ATP, alpha-D-glucose 6-phosphate are all
   different (disjointness)
BIOMD0000000176 and BIOMD0000000177 models of
anaerobic glycolysis in yeast.
   “ATP” input to “ATPase” reaction, which is annotated with
   ATPase activity. The species “ATP”, however, is mis-
   annotated with Alpha-D-glucose 6-phosphate (CHEBI:
   17665), not with ATP.
Answering questions
Outcomes

The SBML-derived ontologies can be

 i) checked for their consistency, thereby uncovering erroneous
curations

 ii) infer attributes and relations of the substances,
compartments and reactions beyond what was originally
described in the models

iii) answer sophisticated questions across a model knowledge
base
Current work

 Add more of SBML annotations (aka qualifiers) to the
 ontology
     Specify the role of species in the processes that they
     participate in by extracting the roles from the
     semantically annotated kinetic expressions
 Simultaneously query knowledge and simulation results
     currently time course
 Increase performance
     Fit the transformation into one of the more
     computationally efficient OWL2 profiles
Questions?

More Related Content

PPT
Guerrilla Readers - výroční prezentace
PDF
Ipsos store brands muscle in May 2012
PPT
Proekt Kaladina L
PDF
Design thinking in efl context
PPT
Juan Barreto
PDF
Dka Presentation Feb 25
PPTX
Divizare digitală și subdezvoltare tehnologică: sociologia românească pe inte...
PPT
Bob Dylan Y Serrat En Hebreo
Guerrilla Readers - výroční prezentace
Ipsos store brands muscle in May 2012
Proekt Kaladina L
Design thinking in efl context
Juan Barreto
Dka Presentation Feb 25
Divizare digitală și subdezvoltare tehnologică: sociologia românească pe inte...
Bob Dylan Y Serrat En Hebreo

Viewers also liked (20)

PPT
Multicultural health standards around the world
PPSX
Innovation Benefits Realization for Industrial Research (Part-1)
PPT
Positive Thinking
PPT
Report z Vyškov
KEY
IT for Nursing @ RIC - 5
PDF
Russian & English: Nastas Ltrs of Confirmation-Contributions
PDF
Rational Risk and Crisis Management Course
PPT
Camp It, June 2012, How To Design Your Bi Architecture To Capitalize on New T...
PPS
The Economics of Grid-Connected Hybrid Distributed Generation
PDF
Riskope 5 days course on Risk and Crisis Management for top managers and key ...
PPT
Web Configurator
PDF
Warmadge
PDF
Pride Law Fund Auction Catalog 2009
PPT
Health-e-cITi NJ
PPT
F Business Presentatie 180609 Final
PDF
Socialmediastenden
PPTX
7+1 myths about presentation
PDF
pebbleのお話し
PPT
ABTCP-Tappi Presentation 2010
PDF
The Globalized Patient
Multicultural health standards around the world
Innovation Benefits Realization for Industrial Research (Part-1)
Positive Thinking
Report z Vyškov
IT for Nursing @ RIC - 5
Russian & English: Nastas Ltrs of Confirmation-Contributions
Rational Risk and Crisis Management Course
Camp It, June 2012, How To Design Your Bi Architecture To Capitalize on New T...
The Economics of Grid-Connected Hybrid Distributed Generation
Riskope 5 days course on Risk and Crisis Management for top managers and key ...
Web Configurator
Warmadge
Pride Law Fund Auction Catalog 2009
Health-e-cITi NJ
F Business Presentatie 180609 Final
Socialmediastenden
7+1 myths about presentation
pebbleのお話し
ABTCP-Tappi Presentation 2010
The Globalized Patient
Ad

Similar to Harmony 2011: Formalization of SBML models as OWL ontologies (20)

PDF
ISMB2011 Tutorial: Biomedical Ontologies for data integration and verification
PDF
Computational Tools For The Annotation of SBML Models
PPTX
Formal representation of models in systems biology
PPTX
Real World Applications of OWL
PDF
Enriching the Gene Ontology via the Dissection of Labels using the Ontology P...
PDF
Lucian Smith - Modular Modeling: Standards and Tools (IMAG)
PDF
Neuroinformatics conference 2012
PDF
Towards integration of systems biology and biomedical ontologies
PPT
Increasingly Accurate Representation of Biochemistry (v2)
PDF
Ontology Services for the Biomedical Sciences
PPT
libAnnotationSBML
PPT
Informatics In The Manchester Centre For Integrative Systems Biology
PPTX
Modularity requirements in bio-ontologies: a case study of ChEBI
PDF
Linked Data for integrating life-science databases
PPT
Michael Hucka.ppt
PPT
ChEBI and genome scale metabolic reconstructions
PDF
SBML (the Systems Biology Markup Language), model databases, and other resources
PDF
Subliminal: exploiting semantic annotations in the reconstruction of metaboli...
PDF
Connecting life sciences data at the European Bioinformatics Institute
PPT
20090511 Manchester Biochemistry
ISMB2011 Tutorial: Biomedical Ontologies for data integration and verification
Computational Tools For The Annotation of SBML Models
Formal representation of models in systems biology
Real World Applications of OWL
Enriching the Gene Ontology via the Dissection of Labels using the Ontology P...
Lucian Smith - Modular Modeling: Standards and Tools (IMAG)
Neuroinformatics conference 2012
Towards integration of systems biology and biomedical ontologies
Increasingly Accurate Representation of Biochemistry (v2)
Ontology Services for the Biomedical Sciences
libAnnotationSBML
Informatics In The Manchester Centre For Integrative Systems Biology
Modularity requirements in bio-ontologies: a case study of ChEBI
Linked Data for integrating life-science databases
Michael Hucka.ppt
ChEBI and genome scale metabolic reconstructions
SBML (the Systems Biology Markup Language), model databases, and other resources
Subliminal: exploiting semantic annotations in the reconstruction of metaboli...
Connecting life sciences data at the European Bioinformatics Institute
20090511 Manchester Biochemistry
Ad

More from Michel Dumontier (20)

PPTX
Generating (useful) synthetic data for medical research and AI application
PDF
FAIR & AI Ready KGs for Explainable Predictions.pdf
PPTX
FAIR & AI Ready KGs for Explainable Predictions
PPTX
A metadata standard for Knowledge Graphs
PPTX
Data-Driven Discovery Science with FAIR Knowledge Graphs
PDF
Evaluating FAIRness
PPTX
The Role of the FAIR Guiding Principles for an effective Learning Health System
PPTX
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
PPTX
The role of the FAIR Guiding Principles in a Learning Health System
PPTX
Acclerating biomedical discovery with an internet of FAIR data and services -...
PPTX
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
PPTX
Are we FAIR yet? And will it be worth it?
PPTX
The Future of FAIR Data: An international social, legal and technological inf...
PDF
Keynote at the 2018 Maastricht University Dinner
PPTX
The future of science and business - a UM Star Lecture
PPTX
Are we FAIR yet?
PPTX
Developing and assessing FAIR digital resources
PPTX
Advancing Biomedical Knowledge Reuse with FAIR
PPTX
A Framework to develop the FAIR Metrics
PPTX
FAIR principles and metrics for evaluation
Generating (useful) synthetic data for medical research and AI application
FAIR & AI Ready KGs for Explainable Predictions.pdf
FAIR & AI Ready KGs for Explainable Predictions
A metadata standard for Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge Graphs
Evaluating FAIRness
The Role of the FAIR Guiding Principles for an effective Learning Health System
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
The role of the FAIR Guiding Principles in a Learning Health System
Acclerating biomedical discovery with an internet of FAIR data and services -...
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
Are we FAIR yet? And will it be worth it?
The Future of FAIR Data: An international social, legal and technological inf...
Keynote at the 2018 Maastricht University Dinner
The future of science and business - a UM Star Lecture
Are we FAIR yet?
Developing and assessing FAIR digital resources
Advancing Biomedical Knowledge Reuse with FAIR
A Framework to develop the FAIR Metrics
FAIR principles and metrics for evaluation

Recently uploaded (20)

PDF
Therapeutic Potential of Citrus Flavonoids in Metabolic Inflammation and Ins...
PPTX
ca esophagus molecula biology detailaed molecular biology of tumors of esophagus
PPTX
Pathophysiology And Clinical Features Of Peripheral Nervous System .pptx
PPTX
Important Obstetric Emergency that must be recognised
PPT
Management of Acute Kidney Injury at LAUTECH
PDF
Intl J Gynecology Obste - 2021 - Melamed - FIGO International Federation o...
PPTX
POLYCYSTIC OVARIAN SYNDROME.pptx by Dr( med) Charles Amoateng
PPTX
neonatal infection(7392992y282939y5.pptx
PDF
Khadir.pdf Acacia catechu drug Ayurvedic medicine
PDF
CT Anatomy for Radiotherapy.pdf eryuioooop
PPTX
Neuropathic pain.ppt treatment managment
PPTX
Electromyography (EMG) in Physiotherapy: Principles, Procedure & Clinical App...
DOC
Adobe Premiere Pro CC Crack With Serial Key Full Free Download 2025
DOCX
RUHS II MBBS Microbiology Paper-II with Answer Key | 6th August 2025 (New Sch...
PPT
MENTAL HEALTH - NOTES.ppt for nursing students
PDF
Medical Evidence in the Criminal Justice Delivery System in.pdf
PPTX
1 General Principles of Radiotherapy.pptx
PPTX
Acid Base Disorders educational power point.pptx
PPT
Copy-Histopathology Practical by CMDA ESUTH CHAPTER(0) - Copy.ppt
PPTX
Uterus anatomy embryology, and clinical aspects
Therapeutic Potential of Citrus Flavonoids in Metabolic Inflammation and Ins...
ca esophagus molecula biology detailaed molecular biology of tumors of esophagus
Pathophysiology And Clinical Features Of Peripheral Nervous System .pptx
Important Obstetric Emergency that must be recognised
Management of Acute Kidney Injury at LAUTECH
Intl J Gynecology Obste - 2021 - Melamed - FIGO International Federation o...
POLYCYSTIC OVARIAN SYNDROME.pptx by Dr( med) Charles Amoateng
neonatal infection(7392992y282939y5.pptx
Khadir.pdf Acacia catechu drug Ayurvedic medicine
CT Anatomy for Radiotherapy.pdf eryuioooop
Neuropathic pain.ppt treatment managment
Electromyography (EMG) in Physiotherapy: Principles, Procedure & Clinical App...
Adobe Premiere Pro CC Crack With Serial Key Full Free Download 2025
RUHS II MBBS Microbiology Paper-II with Answer Key | 6th August 2025 (New Sch...
MENTAL HEALTH - NOTES.ppt for nursing students
Medical Evidence in the Criminal Justice Delivery System in.pdf
1 General Principles of Radiotherapy.pptx
Acid Base Disorders educational power point.pptx
Copy-Histopathology Practical by CMDA ESUTH CHAPTER(0) - Copy.ppt
Uterus anatomy embryology, and clinical aspects

Harmony 2011: Formalization of SBML models as OWL ontologies

  • 1. Formalizing Systems Biology Models with Biomedical Ontologies work by: Robert Hoehndorf, Michel Dumontier, John H. Gennari, Sarala Wimalaratne, Bernard de Bono, Dan Cook, George Gkoutos Cambridge University, Carleton University, EBI, University of Washington Harmony @ NYC. April 21,2011
  • 2. Systems Biology We create and simulate models to : gain biological insight into the structure and function of biochemical networks reveal capabilities and predict phenotypes undertake metabolic engineering maximize desired product To do this, we need to manage our data & knowledge in a coherent, scalable and machine understandable manner use efficient software to execute simulations
  • 3. Computational Knowledge Discovery Terminological resources increasingly being used to annotate biomolecular models easier to explore or find models converting models into formal representations of knowledge check the annotation consistency infer knowledge explicit in terminological resources discover biological implications inherent in the models.
  • 4. SBML XML-based representation of biochemical models, their components (compartments, species, reactions, events), descriptors (rules, constraints, functions, units) Consider the following enzymatic reaction:
  • 5. SBML is an XML-based format <?xml version="1.0" encoding="UTF-8"?> <sbml level="2" version="3" xmlns="http://www.sbml.org/sbml/level2/version3"> <model name="EnzymaticReaction"> <listOfUnitDefinitions> <unitDefinition id="per_second"> <listOfUnits> <unit kind="second" exponent="-1"/> </listOfUnits> </unitDefinition> <unitDefinition id="litre_per_mole_per_second"> <listOfUnits> <unit kind="mole" exponent="-1"/> <unit kind="litre" exponent="1"/> <unit kind="second" exponent="-1"/> </listOfUnits> </unitDefinition> </listOfUnitDefinitions> <listOfCompartments> <compartment id="cytosol" size="1e-14"/> </listOfCompartments> <listOfSpecies> <species compartment="cytosol" id="ES" initialAmount="0" name="ES"/> <species compartment="cytosol" id="P" initialAmount="0" name="P"/> <species compartment="cytosol" id="S" initialAmount="1e-20" name="S"/> <species compartment="cytosol" id="E" initialAmount="5e-21" name="E"/> </listOfSpecies>
  • 6. <listOfReactions> <reaction id="veq"> <listOfReactants> <speciesReference species="E"/> <speciesReference species="S"/> </listOfReactants> <listOfProducts> <speciesReference species="ES"/> </listOfProducts> <kineticLaw> <math xmlns="http://www.w3.org/1998/Math/MathML"> <apply> <times/> <ci>cytosol</ci> <apply> <minus/> <apply> <times/> <ci>kon</ci> <ci>E</ci> <ci>S</ci> </apply> <apply> <times/> <ci>koff</ci> <ci>ES</ci> </apply> </apply> </apply> </math> <listOfParameters> <parameter id="kon" value="1000000" units="litre_per_mole_per_second"/> <parameter id="koff" value="0.2" units="per_second"/> </listOfParameters> </kineticLaw> </reaction>
  • 7. <reaction id="vcat" reversible="false"> <listOfReactants> <speciesReference species="ES"/> </listOfReactants> <listOfProducts> <speciesReference species="E"/> <speciesReference species="P"/> </listOfProducts> <kineticLaw> <math xmlns="http://www.w3.org/1998/Math/MathML"> <apply> <times/> <ci>cytosol</ci> <ci>kcat</ci> <ci>ES</ci> </apply> </math> <listOfParameters> <parameter id="kcat" value="0.1" units="per_second"/> </listOfParameters> </kineticLaw> </reaction> </listOfReactions> </model> </sbml>
  • 9. SBML specifies what models can have as attributes
  • 10. Biomodels are semantically annotated SBML models EBI managed resource 600 + models available as SBML 269 models are curated with GO process, function and component terms, and has links to protein databases. Possible to browse by GO terms: http://www.ebi.ac.uk/biomodels-main/
  • 11. Biomodels are semantically annotated SBML models <?xml version="1.0" encoding="UTF-8"?> <sbml xmlns="http://www.sbml.org/sbml/level2" metaid="metaid_0000001" level="2" version="1"> <model metaid="metaid_0000002" id="Proctor2005_Hsp90" name="Hsp90model_basis510"> <annotation> <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:bqbiol="http://biomodels.net/biology-qualifiers/" xmlns:bqmodel="http://biomodels.net/model-qualifiers/"> <bqbiol:isVersionOf> <rdf:Bag> <rdf:li rdf:resource="urn:miriam:obo.go:GO%3A0051085"/> <rdf:li rdf:resource="urn:miriam:obo.go:GO%3A0007569"/> <rdf:li rdf:resource="urn:miriam:obo.go:GO%3A0009408"/> </rdf:Bag> </bqbiol:isVersionOf> </rdf:RDF> </annotation> </model> </sbml> GO:0051085 GO:0007569 GO:0009408 chaperone mediated protein cell aging response to heat folding requiring cofactor
  • 12. Gene Ontology nearly 30,000 terms covers biological processes molecular functions cellular components terms organized around "is a" hierarchy terms further described with 'has part'/'part of'; 'regulates' and '+ regulates', '- regulates'
  • 14. Species may be annotated with UniProt, KEGG, ChEBI terms <species metaid="metaid_0000038" id="ROS" name="ROS" compartment=" compartment" initialAmount="100" hasOnlySubstanceUnits="true"> <annotation> <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:bqbiol="http://biomodels.net/biology-qualifiers/" xmlns:bqmodel="http://biomodels.net/model-qualifiers/"> <rdf:Description rdf:about="#metaid_0000038"> <bqbiol:isVersionOf> <rdf:Bag> <rdf:li rdf:resource="urn:miriam:obo.chebi:CHEBI%3A26523"/> </rdf:Bag> </bqbiol:isVersionOf> </rdf:Description> CHEBI:26523 reactive </rdf:RDF> </annotation> oxygen species </species>
  • 15. Chemical Entities of Biological Interest (ChEBI) recently refactored to be in line with formal (reasoning capable) ontology scope includes chemical entities (atoms, substances, groups, molecules), roles and subatomic particles large numbers of curated molecules
  • 16. Approach The idea is to create sophisticated OWL ontologies from biomolecular models represented using the Systems Biology Markup Language (SBML). Features ontological commitment: terms in a vocabulary become formally defined classes and relations in an ontology upper level ontology to distinguish and constrain model entities the spatio-temporal entities they represent basic relations are used to describe (constrain) entities in terms of the attributes and relationships they hold code: http://code.google.com/p/sbmlharvester/
  • 17. Conceptualization (SBML) 2 kinds of entities: in silico: model components in vivo: the entities represented by a model
  • 18. An Upper Level Ontology distinguishes models from the entities they represent
  • 19. Ontological commitment Assumption 1: Every model represents a material entity (Model SubClassOf: represents some MaterialEntity) a Model annotated with class C represents a C that is a subclass of MaterialEntity
  • 20. Model entities (models and model components) are distinguished from the entities they represent every element E of the SBML language represents a class Rep(E) and we assert that E subClassOf: represents some Rep(E)
  • 21. Ontological commitment In addition to annotation to physical entities, we note that annotations include functions and processes
  • 22. Ontological commitment Assumption 1: Every model represents a material entity (Model SubClassOf: represents some MaterialEntity) a Model annotated with class C represents a C that is a subclass of MaterialEntity or a Thing that has-function some C or a Thing that has a function that is realized by only C's
  • 23. Model annotations are converted into ontology axioms model annotation: ontology axioms: 'is' | 'isVersionOf' | 'isVariantOf' subClassOf * physical entity --------------------> has-part some physical-entity * molecular function ---------------> has-part some (has-function some function) * biological process ---------------> has-part some ( has-function some (realized-by only process))
  • 25. 3. Relations impose additional constraints, such that inconsistencies arise when incorrectly used
  • 26. Ontological commitment Assertion: M SubClassOf: represents some C or represents some (has-function some C) or represents some (has-function some (realized-by only C)) C SubClassOf: MaterialEntity Then: represents some C is satisfiable represents some (has-function some C) and represents some (has-function some (realized-by only C)) are unsatisfiable
  • 27. Ontological commitment Assertion: M SubClassOf: represents some C or represents some (has-function some C) or represents some (has-function some (realized-by only C)) C SubClassOf: Function Then: represents some (has-function some C) is satisfiable represents some C and represents some (has- function some (realized-by only C)) are unsatisfiable
  • 28. Ontological commitment Assertion: M SubClassOf: represents some C or represents some (has-function some C) or represents some (has-function some (realized-by only C)) C SubClassOf: Process Then: represents some (has-function some (realized-by only C)) is satisfiable represents some C and represents some (has- function some C) are unsatisfiable
  • 29. SBML2OWL: Implementation Combine libSBML and OWLAPI Use libSBML to access model structure extract MIRIAM annotations in RDF use Jena RDF API to parse RDF annotations in SBML models use OWLAPI to perform conversion of SBML structure combine with top-level ontology
  • 30. SBML2OWL: Implementation Application to BioModels repository yields: OWL ontology with more than 800,000 axioms includes all referenced ontologies GO ChEBI Celltype FMA PATO
  • 31. Model verification After reasoning, we found 27 models to be inconsistent reasons 1. our representation - functions sometimes found in the place of physical entities (e.g. entities that secrete insulin). better to constrain with appropriate relations 2. SBML abused - species used as a measure of time 3. constraints in the ontologies themselves mean that the annotation is simply not possible
  • 32. Finding inconsistencies with axiomatically enhanced ontologies recent work treats function as process and axioms state that an ATPase activity (GO:0004002) is a Catalytic activity that has Water and ATP as input, ADP and phosphate as output and is a part of an ATP catabolic process. Tho this, we add: GO: ATP + Water the only inputs (universal quantification) ChEBI: Water, ATP, alpha-D-glucose 6-phosphate are all different (disjointness) BIOMD0000000176 and BIOMD0000000177 models of anaerobic glycolysis in yeast. “ATP” input to “ATPase” reaction, which is annotated with ATPase activity. The species “ATP”, however, is mis- annotated with Alpha-D-glucose 6-phosphate (CHEBI: 17665), not with ATP.
  • 34. Outcomes The SBML-derived ontologies can be i) checked for their consistency, thereby uncovering erroneous curations ii) infer attributes and relations of the substances, compartments and reactions beyond what was originally described in the models iii) answer sophisticated questions across a model knowledge base
  • 35. Current work Add more of SBML annotations (aka qualifiers) to the ontology Specify the role of species in the processes that they participate in by extracting the roles from the semantically annotated kinetic expressions Simultaneously query knowledge and simulation results currently time course Increase performance Fit the transformation into one of the more computationally efficient OWL2 profiles