SlideShare a Scribd company logo
Geometric Algorithms for Sampling
the Flux Space of Metabolic Networks
Apostolos Chalkis 1, 2, 3
Vissarion Fisikopoulos (NKUA), Elias Tsigaridas (INRIA),
Haris Zafeiropoulos (IMBBC-HCMR)
1National & Kapodistrian U. Athens, Greece
2ATHENA Research & Innovation Center, Greece
3GeomScale org
June 11, 2021
In our cells...
We call both the inputs (reactants) and the outputs (products)
of a chemical reaction, metabolites.
In every cell of our body
thousands of chemical reactions are taking place!
The reactions interact
A small fragment of the human metabolic network
Q: How can we model all the interactions between chemical
sdf reactions in an organism?
A: Computational geometry can help!
It is a complex problem
The complete human metabolic network Recon1 [Palsson et al.’07]
A small example of a metabolic network
A small example of a metabolic network
Key Concept of a metabolic network:
Reaction Fluxes
The i-th reaction has a flux (rate) vi that is flowing.
vi multiplies each metabolite in the i-th reaction.
Matrix representation of a metabolic network
11 metabolites and 4 reactions
Use S ∈ Rm×n and flux vector v ∈ Rn to express the
change of the mass of each metabolite over time [Palsson’15],
dr
dt
= Sv = S




v1
v2
v3
v4



 = v1S(·,1) + v2S(·,2) + v3S(·,3) + v4S(·,4)
Steady states: The network in balance
When for each metabolite, the rate of production equals to the
rate of consumption, the reactions exactly balance each other.
When a flux vector v balances the network,
Sv = 0,
v is a steady state.
The region of steady states
As a low dimensional polytope.
Sv = 0,
vlb ≤ v ≤ vub ←→
v=Nx
S ∈ Rm×n
, v ∈ Rn
As a full dimensional polytope
P := {x ∈ Rd | Ax ≤ b}
A =

InN
−InN

and b =

vub
vlb

N
N ∈ Rn×d the matrix of the right nullspace of S.
Sampling steady states
Sampling could lead to important biological insights! [Palsson’15]
Explore the flux space [Schellenberger,Palsson’09].
We introduce a
Multiphase Monte Carlo Sampling algorithm
based on Billiard Walk
Billiard walk
BW(P, pi ) [Gryazina,Polyak’14]
1 Generate the length of the trajectory L ∼ D.
Existing work on
uniform sampling
1. Hit-and-run
[Lovász et al.’06]
2. Geodesic walk
[Lee et al.’17]
3. Ball walk
[Lee et al.’17]
4. Vaidya walk
[Chen et al.’17]
5. Riemmanian HMC
[Lee et al.’18]
6. Dikin walk
[Laddha et al.’20]
7. Coordinate
Hit-and-Run
[Laddha et al.’21]
8. HMC  reflections
[Chevallier et al.’21]
Billiard walk
BW(P, pi ) [Gryazina,Polyak’14]
1 Generate the length of the trajectory L ∼ D.
2 Pick a uniform direction v to define the trajectory.
Existing work on
uniform sampling
1. Hit-and-run
[Lovász et al.’06]
2. Geodesic walk
[Lee et al.’17]
3. Ball walk
[Lee et al.’17]
4. Vaidya walk
[Chen et al.’17]
5. Riemmanian HMC
[Lee et al.’18]
6. Dikin walk
[Laddha et al.’20]
7. Coordinate
Hit-and-Run
[Laddha et al.’21]
8. HMC  reflections
[Chevallier et al.’21]
Billiard walk
BW(P, pi ) [Gryazina,Polyak’14]
1 Generate the length of the trajectory L ∼ D.
2 Pick a uniform direction v to define the trajectory.
Existing work on
uniform sampling
1. Hit-and-run
[Lovász et al.’06]
2. Geodesic walk
[Lee et al.’17]
3. Ball walk
[Lee et al.’17]
4. Vaidya walk
[Chen et al.’17]
5. Riemmanian HMC
[Lee et al.’18]
6. Dikin walk
[Laddha et al.’20]
7. Coordinate
Hit-and-Run
[Laddha et al.’21]
8. HMC  reflections
[Chevallier et al.’21]
Billiard walk
BW(P, pi ) [Gryazina,Polyak’14]
1 Generate the length of the trajectory L ∼ D.
2 Pick a uniform direction v to define the trajectory.
3 The trajectory reflects on the boundary if necessary.
Existing work on
uniform sampling
1. Hit-and-run
[Lovász et al.’06]
2. Geodesic walk
[Lee et al.’17]
3. Ball walk
[Lee et al.’17]
4. Vaidya walk
[Chen et al.’17]
5. Riemmanian HMC
[Lee et al.’18]
6. Dikin walk
[Laddha et al.’20]
7. Coordinate
Hit-and-Run
[Laddha et al.’21]
8. HMC  reflections
[Chevallier et al.’21]
Billiard walk
BW(P, pi ) [Gryazina,Polyak’14]
1 Generate the length of the trajectory L ∼ D.
2 Pick a uniform direction v to define the trajectory.
3 The trajectory reflects on the boundary if necessary.
Existing work on
uniform sampling
1. Hit-and-run
[Lovász et al.’06]
2. Geodesic walk
[Lee et al.’17]
3. Ball walk
[Lee et al.’17]
4. Vaidya walk
[Chen et al.’17]
5. Riemmanian HMC
[Lee et al.’18]
6. Dikin walk
[Laddha et al.’20]
7. Coordinate
Hit-and-Run
[Laddha et al.’21]
8. HMC  reflections
[Chevallier et al.’21]
Billiard walk
BW(P, pi ) [Gryazina,Polyak’14]
1 Generate the length of the trajectory L ∼ D.
2 Pick a uniform direction v to define the trajectory.
3 The trajectory reflects on the boundary if necessary.
4 return the end of the trajectory as pi+1.
Existing work on
uniform sampling
1. Hit-and-run
[Lovász et al.’06]
2. Geodesic walk
[Lee et al.’17]
3. Ball walk
[Lee et al.’17]
4. Vaidya walk
[Chen et al.’17]
5. Riemmanian HMC
[Lee et al.’18]
6. Dikin walk
[Laddha et al.’20]
7. Coordinate
Hit-and-Run
[Laddha et al.’21]
8. HMC  reflections
[Chevallier et al.’21]
Billiard walk
p
q
pi pi+1
Mixing time: O∗(`2d2) well-rounded convex bodies
[C,Papachristou,Fisikopoulos,Tsigaridas,’21],
Improved cost per point: O(kd) operations [This paper].
`: upper bound on the number of reflections, k: number of facets.
Difficulties
When the polytope is skinny:
The average number of reflections increases.
The mixing rate decreases.
Multiphase Monte Carlo Sampling
MMCS(P0, p, N, set i = 0)
1 Sample O(d) points from Pi with Billiard walk.
2 Estimate the Effective Sample Size (ESS) of the sample
in Pi .
3 Map the sampled points to an isotropic position and
apply the same transformation Ti to Pi , set
Pi+1 = Ti (Pi ).
4 i = i + 1; goto 1.
5 Stop when the sum of ESS  N and PSRF  1.1.
Related work
(optimization)
1. [Bertsimas et al.’04]
2. [Kalai et al.’06]
Experiments
MMCS cobra
model (d) Time (sec) (Steps) Time (sec) (Steps)
iAB RBC 283 130 5.20e+01 1.07e+04 7.85e+03 4.05e+08
iAT PLT 636 289 3.25e+02 1.04e+04 1.73e+04 6.68e+08
iML1515 633 4.65e+03 5.65e+04 1.15e+05 3.21e+09
Recon1 931 8.09e+03 1.94e+04 3.20e+05 6.93e+09
Recon2D 2430 2.48e+04 5.44e+04 ∼ 140 days 1.57e+11
Recon3D 5335 1.03e+05 1.44e+05 – –
Package cobra [Palsson, Thiele, Fleming et al.’19]:
state-of-the-art software to analyze metabolic networks,
Coordinate Directions Hit-and-Run [Cousins, Vempala et al.’17].
MMCS outperforms existing software.
Estimate flux distributions
Reaction GLGNS1 is crucial for the
growth rate of an organism. Either slow Human biomass production
and small GLGNS1 flux or fast biomass
production and large GLGNS1 flux.
Find possible anti-COVID19 targets
Not an anti-viral target Possible anti-viral target
Sample steady states when,
the growth rate of COVID-19 is optimized,
the host biomass production is optimized.
Check if the flux distribution of a reaction changes
[Renz,Widerspick,Dräger’20,’21].
Open code - open science
MMCS implemented in package volesti of Geomscale Org.
github.com/GeomScale/volume_approximation.
geomscale.github.io/.
Thank you!

More Related Content

PDF
The slides of my Ph.D. defense
PDF
dingo: a python package to analyzes metabolic networks
PDF
Presentation of volesti in eRum 2020
PDF
Practical Volume Estimation of Zonotopes by a new Annealing Schedule for Cool...
PDF
Sampling Spectrahedra: Volume Approximation and Optimization
PDF
Presentation socg
PDF
Practical volume estimation of polytopes by billiard trajectories and a new a...
PDF
14th Athens Colloquium on Algorithms and Complexity (ACAC19)
The slides of my Ph.D. defense
dingo: a python package to analyzes metabolic networks
Presentation of volesti in eRum 2020
Practical Volume Estimation of Zonotopes by a new Annealing Schedule for Cool...
Sampling Spectrahedra: Volume Approximation and Optimization
Presentation socg
Practical volume estimation of polytopes by billiard trajectories and a new a...
14th Athens Colloquium on Algorithms and Complexity (ACAC19)

What's hot (13)

PDF
Derivatives Lesson Oct 5
PDF
On complementarity in qec and quantum cryptography
PDF
RuleML 2015 Constraint Handling Rules - What Else?
PDF
Volume and edge skeleton computation in high dimensions
PDF
Nevenko Bilić "Tachyon inflation on the holographic braneworld"
PDF
Semi-automatic ABC: a discussion
PDF
Quantum Machine Learning and QEM for Gaussian mixture models (Alessandro Luongo)
PDF
Code of the multidimensional fractional pseudo-Newton method using recursive ...
PDF
Pres_Zurich14
PPTX
Engineering Economics
PDF
Bagging-Clustering Methods to Forecast Time Series
PDF
Efficient Random-Walk Methods forApproximating Polytope Volume
PDF
Time distatnce
Derivatives Lesson Oct 5
On complementarity in qec and quantum cryptography
RuleML 2015 Constraint Handling Rules - What Else?
Volume and edge skeleton computation in high dimensions
Nevenko Bilić "Tachyon inflation on the holographic braneworld"
Semi-automatic ABC: a discussion
Quantum Machine Learning and QEM for Gaussian mixture models (Alessandro Luongo)
Code of the multidimensional fractional pseudo-Newton method using recursive ...
Pres_Zurich14
Engineering Economics
Bagging-Clustering Methods to Forecast Time Series
Efficient Random-Walk Methods forApproximating Polytope Volume
Time distatnce
Ad

Similar to Symposium of Computational Geometry 2021 (20)

PDF
volesti: sampling efficiently from high dimensional distributions
PDF
Efficient sampling of constraint spaces in practice
PDF
High-dimensional sampling and volume computation
PDF
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
PDF
High-dimensional sampling and volume computation
PPTX
Monte Carlo Berkeley.pptx
PDF
Sampling and Markov Chain Monte Carlo Techniques
PDF
master-thesis
PDF
Andreas Eberle
PDF
Poster for Bayesian Statistics in the Big Data Era conference
PDF
A discussion on sampling graphs to approximate network classification functions
PDF
Sampling strategies for Sequential Monte Carlo (SMC) methods
PPTX
PRML Chapter 11
PPT
Randomizing genome-scale metabolic networks
PDF
Multiple estimators for Monte Carlo approximations
PDF
Metodo Monte Carlo -Wang Landau
PDF
thesis_final_draft
PDF
Deal_Final_Paper_PY525f15
PDF
Community Finding with Applications on Phylogenetic Networks [Presentation]
PDF
A bit about мcmc
volesti: sampling efficiently from high dimensional distributions
Efficient sampling of constraint spaces in practice
High-dimensional sampling and volume computation
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
High-dimensional sampling and volume computation
Monte Carlo Berkeley.pptx
Sampling and Markov Chain Monte Carlo Techniques
master-thesis
Andreas Eberle
Poster for Bayesian Statistics in the Big Data Era conference
A discussion on sampling graphs to approximate network classification functions
Sampling strategies for Sequential Monte Carlo (SMC) methods
PRML Chapter 11
Randomizing genome-scale metabolic networks
Multiple estimators for Monte Carlo approximations
Metodo Monte Carlo -Wang Landau
thesis_final_draft
Deal_Final_Paper_PY525f15
Community Finding with Applications on Phylogenetic Networks [Presentation]
A bit about мcmc
Ad

Recently uploaded (20)

DOCX
Action plan to easily understanding okey
DOC
EVC毕业证学历认证,北密歇根大学毕业证留学硕士毕业证
PPTX
Research Process - Research Methods course
PPTX
ANICK 6 BIRTHDAY....................................................
PDF
Module 7 guard mounting of security pers
PPTX
Copy- of-Lesson-6-Digestive-System.pptx
PPT
Lessons from Presentation Zen_ how to craft your story visually
PDF
6.-propertise of noble gases, uses and isolation in noble gases
PPTX
Module_4_Updated_Presentation CORRUPTION AND GRAFT IN THE PHILIPPINES.pptx
PDF
Unnecessary information is required for the
PPTX
Unit 8#Concept of teaching and learning.pptx
PPTX
Shizophrnia ppt for clinical psychology students of AS
PDF
_Nature and dynamics of communities and community development .pdf
PDF
MODULE 3 BASIC SECURITY DUTIES AND ROLES.pdf
PPTX
Sustainable Forest Management ..SFM.pptx
PPTX
Lesson-7-Gas. -Exchange_074636.pptx
PDF
PM Narendra Modi's speech from Red Fort on 79th Independence Day.pdf
PPTX
CAPE CARIBBEAN STUDIES- Integration-1.pptx
PDF
Presentation on cloud computing and ppt..
PDF
Yusen Logistics Group Sustainability Report 2024.pdf
Action plan to easily understanding okey
EVC毕业证学历认证,北密歇根大学毕业证留学硕士毕业证
Research Process - Research Methods course
ANICK 6 BIRTHDAY....................................................
Module 7 guard mounting of security pers
Copy- of-Lesson-6-Digestive-System.pptx
Lessons from Presentation Zen_ how to craft your story visually
6.-propertise of noble gases, uses and isolation in noble gases
Module_4_Updated_Presentation CORRUPTION AND GRAFT IN THE PHILIPPINES.pptx
Unnecessary information is required for the
Unit 8#Concept of teaching and learning.pptx
Shizophrnia ppt for clinical psychology students of AS
_Nature and dynamics of communities and community development .pdf
MODULE 3 BASIC SECURITY DUTIES AND ROLES.pdf
Sustainable Forest Management ..SFM.pptx
Lesson-7-Gas. -Exchange_074636.pptx
PM Narendra Modi's speech from Red Fort on 79th Independence Day.pdf
CAPE CARIBBEAN STUDIES- Integration-1.pptx
Presentation on cloud computing and ppt..
Yusen Logistics Group Sustainability Report 2024.pdf

Symposium of Computational Geometry 2021

  • 1. Geometric Algorithms for Sampling the Flux Space of Metabolic Networks Apostolos Chalkis 1, 2, 3 Vissarion Fisikopoulos (NKUA), Elias Tsigaridas (INRIA), Haris Zafeiropoulos (IMBBC-HCMR) 1National & Kapodistrian U. Athens, Greece 2ATHENA Research & Innovation Center, Greece 3GeomScale org June 11, 2021
  • 2. In our cells... We call both the inputs (reactants) and the outputs (products) of a chemical reaction, metabolites. In every cell of our body thousands of chemical reactions are taking place!
  • 3. The reactions interact A small fragment of the human metabolic network Q: How can we model all the interactions between chemical sdf reactions in an organism? A: Computational geometry can help!
  • 4. It is a complex problem The complete human metabolic network Recon1 [Palsson et al.’07]
  • 5. A small example of a metabolic network
  • 6. A small example of a metabolic network
  • 7. Key Concept of a metabolic network: Reaction Fluxes The i-th reaction has a flux (rate) vi that is flowing. vi multiplies each metabolite in the i-th reaction.
  • 8. Matrix representation of a metabolic network 11 metabolites and 4 reactions Use S ∈ Rm×n and flux vector v ∈ Rn to express the change of the mass of each metabolite over time [Palsson’15], dr dt = Sv = S     v1 v2 v3 v4     = v1S(·,1) + v2S(·,2) + v3S(·,3) + v4S(·,4)
  • 9. Steady states: The network in balance When for each metabolite, the rate of production equals to the rate of consumption, the reactions exactly balance each other. When a flux vector v balances the network, Sv = 0, v is a steady state.
  • 10. The region of steady states As a low dimensional polytope. Sv = 0, vlb ≤ v ≤ vub ←→ v=Nx S ∈ Rm×n , v ∈ Rn As a full dimensional polytope P := {x ∈ Rd | Ax ≤ b} A = InN −InN and b = vub vlb N N ∈ Rn×d the matrix of the right nullspace of S.
  • 11. Sampling steady states Sampling could lead to important biological insights! [Palsson’15] Explore the flux space [Schellenberger,Palsson’09]. We introduce a Multiphase Monte Carlo Sampling algorithm based on Billiard Walk
  • 12. Billiard walk BW(P, pi ) [Gryazina,Polyak’14] 1 Generate the length of the trajectory L ∼ D. Existing work on uniform sampling 1. Hit-and-run [Lovász et al.’06] 2. Geodesic walk [Lee et al.’17] 3. Ball walk [Lee et al.’17] 4. Vaidya walk [Chen et al.’17] 5. Riemmanian HMC [Lee et al.’18] 6. Dikin walk [Laddha et al.’20] 7. Coordinate Hit-and-Run [Laddha et al.’21] 8. HMC reflections [Chevallier et al.’21]
  • 13. Billiard walk BW(P, pi ) [Gryazina,Polyak’14] 1 Generate the length of the trajectory L ∼ D. 2 Pick a uniform direction v to define the trajectory. Existing work on uniform sampling 1. Hit-and-run [Lovász et al.’06] 2. Geodesic walk [Lee et al.’17] 3. Ball walk [Lee et al.’17] 4. Vaidya walk [Chen et al.’17] 5. Riemmanian HMC [Lee et al.’18] 6. Dikin walk [Laddha et al.’20] 7. Coordinate Hit-and-Run [Laddha et al.’21] 8. HMC reflections [Chevallier et al.’21]
  • 14. Billiard walk BW(P, pi ) [Gryazina,Polyak’14] 1 Generate the length of the trajectory L ∼ D. 2 Pick a uniform direction v to define the trajectory. Existing work on uniform sampling 1. Hit-and-run [Lovász et al.’06] 2. Geodesic walk [Lee et al.’17] 3. Ball walk [Lee et al.’17] 4. Vaidya walk [Chen et al.’17] 5. Riemmanian HMC [Lee et al.’18] 6. Dikin walk [Laddha et al.’20] 7. Coordinate Hit-and-Run [Laddha et al.’21] 8. HMC reflections [Chevallier et al.’21]
  • 15. Billiard walk BW(P, pi ) [Gryazina,Polyak’14] 1 Generate the length of the trajectory L ∼ D. 2 Pick a uniform direction v to define the trajectory. 3 The trajectory reflects on the boundary if necessary. Existing work on uniform sampling 1. Hit-and-run [Lovász et al.’06] 2. Geodesic walk [Lee et al.’17] 3. Ball walk [Lee et al.’17] 4. Vaidya walk [Chen et al.’17] 5. Riemmanian HMC [Lee et al.’18] 6. Dikin walk [Laddha et al.’20] 7. Coordinate Hit-and-Run [Laddha et al.’21] 8. HMC reflections [Chevallier et al.’21]
  • 16. Billiard walk BW(P, pi ) [Gryazina,Polyak’14] 1 Generate the length of the trajectory L ∼ D. 2 Pick a uniform direction v to define the trajectory. 3 The trajectory reflects on the boundary if necessary. Existing work on uniform sampling 1. Hit-and-run [Lovász et al.’06] 2. Geodesic walk [Lee et al.’17] 3. Ball walk [Lee et al.’17] 4. Vaidya walk [Chen et al.’17] 5. Riemmanian HMC [Lee et al.’18] 6. Dikin walk [Laddha et al.’20] 7. Coordinate Hit-and-Run [Laddha et al.’21] 8. HMC reflections [Chevallier et al.’21]
  • 17. Billiard walk BW(P, pi ) [Gryazina,Polyak’14] 1 Generate the length of the trajectory L ∼ D. 2 Pick a uniform direction v to define the trajectory. 3 The trajectory reflects on the boundary if necessary. 4 return the end of the trajectory as pi+1. Existing work on uniform sampling 1. Hit-and-run [Lovász et al.’06] 2. Geodesic walk [Lee et al.’17] 3. Ball walk [Lee et al.’17] 4. Vaidya walk [Chen et al.’17] 5. Riemmanian HMC [Lee et al.’18] 6. Dikin walk [Laddha et al.’20] 7. Coordinate Hit-and-Run [Laddha et al.’21] 8. HMC reflections [Chevallier et al.’21]
  • 18. Billiard walk p q pi pi+1 Mixing time: O∗(`2d2) well-rounded convex bodies [C,Papachristou,Fisikopoulos,Tsigaridas,’21], Improved cost per point: O(kd) operations [This paper]. `: upper bound on the number of reflections, k: number of facets.
  • 19. Difficulties When the polytope is skinny: The average number of reflections increases. The mixing rate decreases.
  • 20. Multiphase Monte Carlo Sampling MMCS(P0, p, N, set i = 0) 1 Sample O(d) points from Pi with Billiard walk. 2 Estimate the Effective Sample Size (ESS) of the sample in Pi . 3 Map the sampled points to an isotropic position and apply the same transformation Ti to Pi , set Pi+1 = Ti (Pi ). 4 i = i + 1; goto 1. 5 Stop when the sum of ESS N and PSRF 1.1. Related work (optimization) 1. [Bertsimas et al.’04] 2. [Kalai et al.’06]
  • 21. Experiments MMCS cobra model (d) Time (sec) (Steps) Time (sec) (Steps) iAB RBC 283 130 5.20e+01 1.07e+04 7.85e+03 4.05e+08 iAT PLT 636 289 3.25e+02 1.04e+04 1.73e+04 6.68e+08 iML1515 633 4.65e+03 5.65e+04 1.15e+05 3.21e+09 Recon1 931 8.09e+03 1.94e+04 3.20e+05 6.93e+09 Recon2D 2430 2.48e+04 5.44e+04 ∼ 140 days 1.57e+11 Recon3D 5335 1.03e+05 1.44e+05 – – Package cobra [Palsson, Thiele, Fleming et al.’19]: state-of-the-art software to analyze metabolic networks, Coordinate Directions Hit-and-Run [Cousins, Vempala et al.’17]. MMCS outperforms existing software.
  • 22. Estimate flux distributions Reaction GLGNS1 is crucial for the growth rate of an organism. Either slow Human biomass production and small GLGNS1 flux or fast biomass production and large GLGNS1 flux.
  • 23. Find possible anti-COVID19 targets Not an anti-viral target Possible anti-viral target Sample steady states when, the growth rate of COVID-19 is optimized, the host biomass production is optimized. Check if the flux distribution of a reaction changes [Renz,Widerspick,Dräger’20,’21].
  • 24. Open code - open science MMCS implemented in package volesti of Geomscale Org. github.com/GeomScale/volume_approximation. geomscale.github.io/. Thank you!