SlideShare a Scribd company logo
The Subliminal Toolbox: automating steps in the reconstruction of metabolic networks  Neil Swainston Manchester Centre for Integrative Systems Biology 1st Conference on Constraint-based Reconstruction and Analysis Reykjavik, Iceland 24 June2011
Automation Many of these steps can be  automated Subliminal Toolbox How much of the reconstruction process can be aided by  publicly available  resources? Manual curation  still necessary BUT  reduce  what needs to be done Relies on  semantic model annotations e.g.  ChEBI ,  KEGG  terms for metabolites UniProt  terms for enzymes
Model annotations Model annotations provide unambiguous,  unique identifiers  for model components Not very interesting But – also  provide link  to chem/bioinformatics resources via  web services Automatically  extract data  through web services Potentially more interesting / useful
Publicly available resources Formula, Charge, Structure GPR relationships Compartment data Sequence, Metadata Web services
Subliminal Toolbox
Protonate ChEBI provides  web service access  to  structural data InChI ,  SMILES  strings Cheminformatics  software (ChemAxon) can be used to  predict charge state  at given pH Updates  FORMULA ,  CHARGE , metabolite  name  and  annotation Consistency – facilitates merging ✓ ✗
Balance Reaction  elemental  and  charge balancing Prevents  mass violation  and stoichiometric inconsistencies Reactions in resources don ’ t always balance Incorrect stoichiometry Missing  protons,  water, etc. Solution: extract formulae and charges and apply  linear programming
Balance carbon dioxide + 2-Acetolacetate    Pyruvate Ab = 0 A = b represents a vector of stoichiometries CO 2  + C 5 H 7 O 4 -      C 3 H 3 O 3 - Reactants Products Optional reactants Optional products CO2 C5H7O4 C3H3O3 H+ H20 H+ H20 CO2 C 1 5 -3 0 0 0 0 -1 O 2 4 -3 0 1 0 -1 -2 H 0 7 -3 1 2 -1 -2 0 charge 0 -1 1 1 0 -1 0 0 b min T 1 1 1 0 0 0 0 0
Balance Linear programming solver solves Ab = 0 carbon dioxide + 2-Acetolacetate     2   Pyruvate +  H + CO 2  + C 5 H 7 O 4 -      2   C 3 H 3 O 3 -  +  H + Reactants Products Optional reactants Optional products CO2 C5H7O4 C3H3O3 H+ H20 H+ H20 CO2 C 1 5 -3 0 0 0 0 -1 O 2 4 -3 0 1 0 -1 -2 H 0 7 -3 1 2 -1 -2 0 charge 0 -1 1 1 0 -1 0 0 b min T 1 1 1 0 0 0 0 0 b T 1 1 2 0 0 1 0 0
Merge Works on simple premises Metabolites / enzymes are the same if the share the same  annotation  and  compartment Reactions are the same if the have the same  stoichiometry Hence balancing Isoenzymes collated  ( “ OR ”  relationship in GPR definition)
“ Fuzzy ”  Merge Sources can be inconsistent in their definition of  stereochemical precision Considered different: apparently minor but can cause  gaps  in the network beta-D-glucose D-glucose
“ Fuzzy ”  Merge ChEBI is an  ontology  and contains  relationships  between metabolites Edges  not just  nodes
Compartment Determination of  intracellular compartment  in which enzymes operate Two approaches: Extract  curated information  from UniProt annotation Extract  protein sequence  from UniProt and pipe to WoLF PSORT localisation prediction algorithm Infer reaction localisation  from enzyme localisation Hypothetical transport reactions  added to be validated / deleted in manual curation phase
Transporters and biomass Extracellular  and  intracellular  transporters are required TransportDB  and  Transporter Classification DB  are sources of transporter proteins BUT  not comprehensive  enough to automatically assign these to individual reactions Biomass reaction  should be added  manually Automation breaks down…
Shorthand Converted to  SBML, protonated, balanced  and  merged  as before
Usage and availability Java  programming  API Simple  command line  interface SBML file in / SBML file out Has some  dependencies  on third-party software Free  to  academic  users http://www.mcisb.org/subliminal/
Thanks…
The Subliminal Toolbox: automating steps in the reconstruction of metabolic networks  Neil Swainston Manchester Centre for Integrative Systems Biology 1st Conference on Constraint-based Reconstruction and Analysis Reykjavik, Iceland 24 June 2011
UniProt compartmentalisation
UniProt compartmentalisation UniProt curated information
PSORT compartmentalisation
PSORT compartmentalisation WoLF PSORT prediction

More Related Content

PDF
ElogPoct: A Tool for Lipophilicity Determination in Drug Discovery
PPTX
Go pathway-interaction-integration
PDF
Pittcon 3 5-2014
PPTX
Retrosynthetic analysis in organic synthesis
PDF
ElogDoct: A Tool for Lipophilicity Determination in Drug Discovery. 2. Basic ...
PPT
VLife SCOPE for Lead Optimization
PDF
sugar - 1
PPT
The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...
ElogPoct: A Tool for Lipophilicity Determination in Drug Discovery
Go pathway-interaction-integration
Pittcon 3 5-2014
Retrosynthetic analysis in organic synthesis
ElogDoct: A Tool for Lipophilicity Determination in Drug Discovery. 2. Basic ...
VLife SCOPE for Lead Optimization
sugar - 1
The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...

Similar to The Subliminal Toolbox: automating steps in the reconstruction of metabolic networks (20)

PPT
ChEBI and genome scale metabolic reconstructions
PDF
Subliminal: exploiting semantic annotations in the reconstruction of metaboli...
PDF
Reactor Metabolizer virtual reactions for discovery: US UGM 2008
PPTX
Chapter 1 fundamentals of biochemical engineering
PDF
Predicting Novel Metabolic Pathways through Subgraph Mining
PDF
Network cheminformatics: gap filling and identifying new reactions in metabol...
PDF
Pah res-potentia-netsci emailable-stagebuild
PDF
AI and Machine Learning for Secondary Metabolite Prediction
PDF
soutenance
PPTX
Modularity in metabolic networks
PPT
Informatics In The Manchester Centre For Integrative Systems Biology
PDF
Continued development of ChEBI towards better usability for the systems biolo...
PDF
A new algorithm for Predicting Metabolic Pathways
PDF
Building a Model Organism Metabolome Database
PDF
art%3A10.1186%2F1471-2105-13-93
PPTX
Metabolomics
PDF
Building an efficient infrastructure, standards and data flow for metabolomics
PPTX
Flux balance analysis
PPTX
Research Application of MOLECULAR DOCKING
PPTX
Cobra phylogeny paper slides
ChEBI and genome scale metabolic reconstructions
Subliminal: exploiting semantic annotations in the reconstruction of metaboli...
Reactor Metabolizer virtual reactions for discovery: US UGM 2008
Chapter 1 fundamentals of biochemical engineering
Predicting Novel Metabolic Pathways through Subgraph Mining
Network cheminformatics: gap filling and identifying new reactions in metabol...
Pah res-potentia-netsci emailable-stagebuild
AI and Machine Learning for Secondary Metabolite Prediction
soutenance
Modularity in metabolic networks
Informatics In The Manchester Centre For Integrative Systems Biology
Continued development of ChEBI towards better usability for the systems biolo...
A new algorithm for Predicting Metabolic Pathways
Building a Model Organism Metabolome Database
art%3A10.1186%2F1471-2105-13-93
Metabolomics
Building an efficient infrastructure, standards and data flow for metabolomics
Flux balance analysis
Research Application of MOLECULAR DOCKING
Cobra phylogeny paper slides
Ad

More from Neil Swainston (10)

PPT
The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...
PPT
Data Integration, Mass Spectrometry Proteomics Software Development
PPT
Data standards for systems biology
PPT
Data standards for systems biology
PPT
Integrative information management for systems biology
PDF
SBML Browse
PPT
iQconCAT: quantitative proteomics from instrument to browser
PPT
Quantitative Proteomics: From Instrument To Browser
PPT
QconCat: From Instrument To Browser
PPT
libAnnotationSBML
The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...
Data Integration, Mass Spectrometry Proteomics Software Development
Data standards for systems biology
Data standards for systems biology
Integrative information management for systems biology
SBML Browse
iQconCAT: quantitative proteomics from instrument to browser
Quantitative Proteomics: From Instrument To Browser
QconCat: From Instrument To Browser
libAnnotationSBML
Ad

Recently uploaded (20)

PDF
Getting Started with Data Integration: FME Form 101
PPTX
Big Data Technologies - Introduction.pptx
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Electronic commerce courselecture one. Pdf
PDF
Approach and Philosophy of On baking technology
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Empathic Computing: Creating Shared Understanding
PPTX
Machine Learning_overview_presentation.pptx
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
SOPHOS-XG Firewall Administrator PPT.pptx
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
Getting Started with Data Integration: FME Form 101
Big Data Technologies - Introduction.pptx
The Rise and Fall of 3GPP – Time for a Sabbatical?
MIND Revenue Release Quarter 2 2025 Press Release
Electronic commerce courselecture one. Pdf
Approach and Philosophy of On baking technology
Spectral efficient network and resource selection model in 5G networks
Building Integrated photovoltaic BIPV_UPV.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
Digital-Transformation-Roadmap-for-Companies.pptx
Mobile App Security Testing_ A Comprehensive Guide.pdf
Empathic Computing: Creating Shared Understanding
Machine Learning_overview_presentation.pptx
Dropbox Q2 2025 Financial Results & Investor Presentation
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
SOPHOS-XG Firewall Administrator PPT.pptx
Per capita expenditure prediction using model stacking based on satellite ima...
Network Security Unit 5.pdf for BCA BBA.
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Reach Out and Touch Someone: Haptics and Empathic Computing

The Subliminal Toolbox: automating steps in the reconstruction of metabolic networks

  • 1. The Subliminal Toolbox: automating steps in the reconstruction of metabolic networks Neil Swainston Manchester Centre for Integrative Systems Biology 1st Conference on Constraint-based Reconstruction and Analysis Reykjavik, Iceland 24 June2011
  • 2. Automation Many of these steps can be automated Subliminal Toolbox How much of the reconstruction process can be aided by publicly available resources? Manual curation still necessary BUT reduce what needs to be done Relies on semantic model annotations e.g. ChEBI , KEGG terms for metabolites UniProt terms for enzymes
  • 3. Model annotations Model annotations provide unambiguous, unique identifiers for model components Not very interesting But – also provide link to chem/bioinformatics resources via web services Automatically extract data through web services Potentially more interesting / useful
  • 4. Publicly available resources Formula, Charge, Structure GPR relationships Compartment data Sequence, Metadata Web services
  • 6. Protonate ChEBI provides web service access to structural data InChI , SMILES strings Cheminformatics software (ChemAxon) can be used to predict charge state at given pH Updates FORMULA , CHARGE , metabolite name and annotation Consistency – facilitates merging ✓ ✗
  • 7. Balance Reaction elemental and charge balancing Prevents mass violation and stoichiometric inconsistencies Reactions in resources don ’ t always balance Incorrect stoichiometry Missing protons, water, etc. Solution: extract formulae and charges and apply linear programming
  • 8. Balance carbon dioxide + 2-Acetolacetate  Pyruvate Ab = 0 A = b represents a vector of stoichiometries CO 2 + C 5 H 7 O 4 -  C 3 H 3 O 3 - Reactants Products Optional reactants Optional products CO2 C5H7O4 C3H3O3 H+ H20 H+ H20 CO2 C 1 5 -3 0 0 0 0 -1 O 2 4 -3 0 1 0 -1 -2 H 0 7 -3 1 2 -1 -2 0 charge 0 -1 1 1 0 -1 0 0 b min T 1 1 1 0 0 0 0 0
  • 9. Balance Linear programming solver solves Ab = 0 carbon dioxide + 2-Acetolacetate  2 Pyruvate + H + CO 2 + C 5 H 7 O 4 -  2 C 3 H 3 O 3 - + H + Reactants Products Optional reactants Optional products CO2 C5H7O4 C3H3O3 H+ H20 H+ H20 CO2 C 1 5 -3 0 0 0 0 -1 O 2 4 -3 0 1 0 -1 -2 H 0 7 -3 1 2 -1 -2 0 charge 0 -1 1 1 0 -1 0 0 b min T 1 1 1 0 0 0 0 0 b T 1 1 2 0 0 1 0 0
  • 10. Merge Works on simple premises Metabolites / enzymes are the same if the share the same annotation and compartment Reactions are the same if the have the same stoichiometry Hence balancing Isoenzymes collated ( “ OR ” relationship in GPR definition)
  • 11. “ Fuzzy ” Merge Sources can be inconsistent in their definition of stereochemical precision Considered different: apparently minor but can cause gaps in the network beta-D-glucose D-glucose
  • 12. “ Fuzzy ” Merge ChEBI is an ontology and contains relationships between metabolites Edges not just nodes
  • 13. Compartment Determination of intracellular compartment in which enzymes operate Two approaches: Extract curated information from UniProt annotation Extract protein sequence from UniProt and pipe to WoLF PSORT localisation prediction algorithm Infer reaction localisation from enzyme localisation Hypothetical transport reactions added to be validated / deleted in manual curation phase
  • 14. Transporters and biomass Extracellular and intracellular transporters are required TransportDB and Transporter Classification DB are sources of transporter proteins BUT not comprehensive enough to automatically assign these to individual reactions Biomass reaction should be added manually Automation breaks down…
  • 15. Shorthand Converted to SBML, protonated, balanced and merged as before
  • 16. Usage and availability Java programming API Simple command line interface SBML file in / SBML file out Has some dependencies on third-party software Free to academic users http://www.mcisb.org/subliminal/
  • 18. The Subliminal Toolbox: automating steps in the reconstruction of metabolic networks Neil Swainston Manchester Centre for Integrative Systems Biology 1st Conference on Constraint-based Reconstruction and Analysis Reykjavik, Iceland 24 June 2011