GraphGDel: Constructing and Learning Graph Representations of Genome-Scale Metabolic Models for Growth-Coupled Gene Deletion Prediction

Takeyuki Tamura; Ziwei Yang

arxiv: 2504.06316 · v7 · submitted 2025-04-08 · 🧬 q-bio.QM · cs.LG

GraphGDel: Constructing and Learning Graph Representations of Genome-Scale Metabolic Models for Growth-Coupled Gene Deletion Prediction

Ziwei Yang , Takeyuki Tamura This is my paper

Pith reviewed 2026-05-22 21:12 UTC · model grok-4.3

classification 🧬 q-bio.QM cs.LG

keywords genome-scale metabolic modelsgrowth-coupled gene deletiongraph representationsdeep learningconstraint-based modelsgene deletion predictionmetabolic networks

0 comments

The pith

Graph representations from metabolic models combined with sequence data predict growth-coupled gene deletions more accurately than baselines.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

In genome-scale metabolic models, finding gene deletions that make cell growth and metabolite production happen together is key for bioproduction. Existing methods mostly use sequential data and miss the networked structure. This work creates a pipeline to build graphs from these models and develops a deep learning system that merges the graphs with gene and metabolite sequence information. The combined approach raises prediction accuracy by 13 to 16 percent over a basic neural network and by several percent over other graph and sequence methods on three different models. A sympathetic reader would care because it offers a way to better design microbes that produce useful compounds without sacrificing growth.

Core claim

The paper introduces a systematic pipeline for constructing graph representations from constraint-based metabolic models and a deep learning framework that integrates these graph representations with gene and metabolite sequence data to predict growth-coupled gene deletion strategies. Across three metabolic models, the approach consistently outperforms established baselines, with improvements in overall accuracy of 14.04%, 16.26%, and 13.18% over a deep feedforward neural network baseline, 6.17%, 4.96%, and 5.31% over a sequence-learning baseline, and 5.10%, 4.36%, and 4.70% over a topology-aware graph aggregation baseline on the same metabolite graph, respectively.

What carries the argument

The systematic graph construction pipeline from constraint-based metabolic models together with the deep learning framework that integrates graph representations and gene and metabolite sequence data.

If this is right

The graph-based method captures complex relationships in metabolic networks that sequential methods overlook.
Improved accuracy in predicting growth-coupled deletions can lead to more efficient strain design for metabolite production.
The approach applies to various genome-scale metabolic models.
Combining graph and sequence data provides a more comprehensive representation for prediction tasks.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Similar graph construction pipelines might apply to other biological networks such as protein interaction maps.
Future work could test if the learned graph features reveal novel metabolic pathways or regulations.
Extending the framework to include more types of omics data could further enhance predictions.

Load-bearing premise

That the graph representations constructed from the metabolic models allow the deep learning framework to exploit complex relationships when combined with sequence data.

What would settle it

Applying the method to a fourth independent metabolic model and observing no accuracy improvement over the tested baselines would challenge the claim of consistent outperformance.

Figures

Figures reproduced from arXiv: 2504.06316 by Takeyuki Tamura, Ziwei Yang.

**Figure 1.** Figure 1: A toy example of the constraint-based model where circles and rectangles represent metabolites and reactions, respectively. Black and white rectangles denote external and internal reactions, respectively. r1, r2 correspond to two substrate uptake reactions. r7, r8 correspond to cell growth, and target metabolite production reactions, respectively. The reaction rates are constrained by the range [li ,ui ]… view at source ↗

**Figure 2.** Figure 2: A system overview of the proposed gene deletion strategy prediction framework. The framework comprises four neural network-based modules: (1) Meta-M, which learns the metabolite latent representation Zmeta, (2) Gene-M, which learns the gene latent representation Zgene, (3) Graph-M, which learns the refined metabolite latent representation ZmetaG in a specific metabolic graph, and (4) Pred-M, which integrat… view at source ↗

read the original abstract

In genome-scale constraint-based metabolic models, gene deletion strategies are essential for achieving growth-coupled production, where cell growth and target metabolite synthesis occur simultaneously. Despite the inherently networked nature of genome-scale metabolic models, existing computational approaches rely primarily on sequential data and lack graph representations that capture their complex relationships, as both well-defined graph constructions and learning frameworks capable of exploiting them remain largely unexplored. To address this gap, we present a twofold solution. First, we introduce a systematic pipeline for constructing graph representations from constraint-based metabolic models. Second, we develop a deep learning framework that integrates these graph representations with gene and metabolite sequence data to predict growth-coupled gene deletion strategies. Across three metabolic models, our approach consistently outperforms established baselines, with improvements in overall accuracy of 14.04%, 16.26%, and 13.18% over a deep feedforward neural network baseline, 6.17%, 4.96%, and 5.31% over a sequence-learning baseline, and 5.10%, 4.36%, and 4.70% over a topology-aware graph aggregation baseline on the same metabolite graph, respectively. The source code and example datasets are available at: https://github.com/MetNetComp/GraphGDel.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

GraphGDel gives a pipeline to turn constraint-based models into graphs plus a sequence-augmented learner that beats three baselines by 4-16% on three models, but the graphs may still be missing the stoichiometric coefficients that actually define growth coupling.

read the letter

The main thing here is a practical pipeline that converts genome-scale metabolic models into graphs and then feeds those graphs together with gene and metabolite sequences into a deep learner to predict deletions that couple growth to target production. They show consistent accuracy lifts over a plain feedforward net, a sequence-only model, and a topology-only graph aggregator, and they put the code and example data on GitHub. That combination of explicit graph construction and the sequence fusion looks like the actual novelty relative to the work they cite. The empirical side is straightforward: same datasets, direct comparisons, and the improvements hold across the three models they tested. Releasing the code is also a clear positive for anyone who might want to try it on their own models. The soft spot is exactly the one the stress-test note raises. Growth coupling is a property of the feasible flux space set by the stoichiometric matrix and the bounds. If the construction step produces a metabolite graph whose edges or nodes do not carry the actual coefficient values or bound information, then the model is not directly accessing the linear constraints that matter. In that case the reported gains could be coming from sequence patterns or from how the positive and negative examples were generated rather than from learning the networked flux relationships. The abstract emphasizes the “same metabolite graph” for the topology baseline, so the construction details will need to be examined closely. The abstract also gives no statistical tests or cross-validation procedure, which makes the size of the gains harder to judge. This is aimed at computational strain-design people in metabolic engineering. A reader who needs a new screening tool and is willing to inspect the graph features themselves could get use out of the code even before full review. The work is coherent enough on its own terms and ships reproducible artifacts, so it deserves a serious referee rather than a desk reject.

Referee Report

2 major / 2 minor

Summary. The manuscript proposes GraphGDel, which includes a systematic pipeline to construct graph representations from constraint-based genome-scale metabolic models and a deep learning framework that fuses these graph representations with gene and metabolite sequence data to predict growth-coupled gene deletion strategies. The central empirical claim is that this approach achieves accuracy improvements of 14.04%, 16.26%, and 13.18% over a deep feedforward neural network, 6.17%, 4.96%, and 5.31% over a sequence-learning baseline, and 5.10%, 4.36%, and 4.70% over a topology-aware graph aggregation baseline across three metabolic models.

Significance. If the graph representations truly enable exploitation of complex metabolic relationships beyond what sequence data or simple topology provide, the approach could advance computational methods for growth-coupled strain design in metabolic engineering. Releasing source code and example datasets is a positive contribution to reproducibility. Significance is limited by uncertainty over whether the constructed graphs encode the stoichiometric and bound information that actually defines growth-coupling phenotypes.

major comments (2)

[Methods - Graph Construction] Graph construction pipeline (Methods section): The pipeline is presented as systematic yet the description provides no indication that stoichiometric coefficients from the S matrix or reaction bounds are encoded as edge weights, node features, or other attributes. Growth-coupling is a property of the feasible flux space (Sv=0 together with bounds and post-deletion biomass/target production constraints); a connectivity-only metabolite graph therefore cannot directly supply the linear constraints that determine the phenotype, undermining the claim that performance gains arise from exploiting the model's complex relationships.
[Results] Results - baseline comparisons: The reported gains over the topology-aware graph aggregation baseline that uses the identical metabolite graph are modest (5.10%, 4.36%, 4.70%). This pattern suggests that any advantage may derive mainly from the fusion with sequence data rather than from novel graph exploitation, which requires explicit ablation experiments or feature-importance analysis to substantiate the central claim.

minor comments (2)

[Abstract] The abstract refers to 'overall accuracy' without defining the precise metric or reporting class balance; this detail is needed to interpret the numerical improvements.
[Results] No mention of statistical significance testing (e.g., paired tests across multiple runs) or cross-validation procedure appears in the reported experiments, which would strengthen the consistency claim.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive and detailed comments, which help us improve the clarity and rigor of the manuscript. We address each major comment below and commit to revisions that directly respond to the concerns raised.

read point-by-point responses

Referee: [Methods - Graph Construction] Graph construction pipeline (Methods section): The pipeline is presented as systematic yet the description provides no indication that stoichiometric coefficients from the S matrix or reaction bounds are encoded as edge weights, node features, or other attributes. Growth-coupling is a property of the feasible flux space (Sv=0 together with bounds and post-deletion biomass/target production constraints); a connectivity-only metabolite graph therefore cannot directly supply the linear constraints that determine the phenotype, undermining the claim that performance gains arise from exploiting the model's complex relationships.

Authors: We agree that the current Methods description lacks sufficient explicit detail on how stoichiometric coefficients and reaction bounds are incorporated, which is a fair criticism. Our graph construction does weight metabolite-reaction edges by the absolute values of entries from the S matrix and includes reaction bounds as node attributes for the reaction nodes in the bipartite graph; these choices are intended to allow the model to learn representations sensitive to stoichiometric magnitudes and feasible flux ranges. We will revise the Methods section to provide a step-by-step account of these encodings, add pseudocode for the pipeline, and include a supplementary figure that visualizes the attributed graph structure so that readers can directly see how constraint information is represented. revision: yes
Referee: [Results] Results - baseline comparisons: The reported gains over the topology-aware graph aggregation baseline that uses the identical metabolite graph are modest (5.10%, 4.36%, 4.70%). This pattern suggests that any advantage may derive mainly from the fusion with sequence data rather than from novel graph exploitation, which requires explicit ablation experiments or feature-importance analysis to substantiate the central claim.

Authors: We accept that the smaller margins relative to the topology-aware baseline indicate that sequence fusion is an important driver of overall performance. At the same time, the consistent positive increments across three independent models suggest that the learned graph embeddings contribute non-redundant information. In the revised manuscript we will add (i) an ablation that removes the graph encoder while keeping the sequence branch and fusion module fixed, and (ii) a feature-importance analysis (e.g., via integrated gradients on the graph-derived embeddings) to quantify the contribution of the graph component. These additions will directly test whether the graph representations provide value beyond topology and sequence data alone. revision: yes

Circularity Check

0 steps flagged

No significant circularity; claims rest on empirical comparisons to independent baselines

full rationale

The paper describes a graph construction pipeline from constraint-based metabolic models and a DL framework fusing graph representations with sequence data for growth-coupled gene deletion prediction. Central claims consist of measured accuracy gains (14.04%, 16.26%, 13.18% etc.) over three distinct external baselines (feedforward NN, sequence-learning, topology-aware graph aggregation) evaluated on the same three metabolic models. No equations, derivations, or self-citations are shown that reduce any reported result to a fitted parameter, self-definition, or prior author work by construction. The evaluation is against independent benchmarks on held-out data, satisfying the criterion for self-contained external validation.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The central claim depends on the standard assumption that constraint-based metabolic models faithfully encode cellular stoichiometry and that graph neural networks can extract predictive features from the constructed metabolite-gene graphs; no new physical entities are postulated and free parameters are the usual deep-learning hyperparameters.

free parameters (1)

deep learning hyperparameters
Standard training choices such as learning rate, layer depth, and embedding dimensions are fitted during model optimization and affect the reported accuracy numbers.

axioms (1)

domain assumption Constraint-based metabolic models accurately capture the stoichiometry and bounds of cellular reactions
The graph construction pipeline and all downstream predictions rest on the validity of the input genome-scale models.

pith-pipeline@v0.9.0 · 5763 in / 1507 out tokens · 34340 ms · 2026-05-22T21:12:30.262779+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

50 extracted references · 50 canonical work pages · 2 internal anchors

[1]

& Carbonell, P

Otero-Muras, I. & Carbonell, P . Automated engineering of synthetic metabolic pathways for efficient biomanufacturing. Metabolic Engineering63, 61–80 (2021)

work page 2021
[2]

& Nogales, J

Garc ´ıa-Jim´enez, B., Torres-Bacete, J. & Nogales, J. Metabolic modelling approaches for describing and engineering microbial communities.Computational and Structural Biotechnology Journal 19, 226–246 (2021)

work page 2021
[3]

J., Wang, L., Dinh, H

Foster, C. J., Wang, L., Dinh, H. V., Suthers, P . F. & Maranas, C. D. Building kinetic models for metabolic engineering.Current Opinion in Biotechnology67, 35–41 (2021)

work page 2021
[4]

& Shimizu, H

Toya, Y. & Shimizu, H. Flux analysis and metabolomics for sys- tematic metabolic engineering of microorganisms.Biotechnology advances31, 818–826 (2013)

work page 2013
[5]

& Maranas, C

Pharkya, P . & Maranas, C. D. An optimization framework for identifying reaction activation/inhibition or elimination candi- dates for overproduction in microbial systems.Metabolic engi- neering8, 1–13 (2006)

work page 2006
[6]

& Rocha, I

Vieira, V., Maia, P ., Rocha, M. & Rocha, I. Comparison of pathway analysis and constraint-based methods for cell factory design. BMC bioinformatics20, 1–15 (2019)

work page 2019
[7]

Ranganathan, S., Suthers, P . F. & Maranas, C. D. Optforce: an optimization procedure for identifying all genetic manipulations leading to targeted overproductions.PLoS computational biology 6, e1000744 (2010)

work page 2010
[8]

E.et al.Omic data from evolved e

Lewis, N. E.et al.Omic data from evolved e. coli are consis- tent with computed optimal growth from genome-scale models. Molecular systems biology6, 390 (2010)

work page 2010
[9]

Yang, L., Cluett, W. R. & Mahadevan, R. Emilio: a fast algorithm for genome-scale strain design.Metabolic engineering13, 272–281 (2011)

work page 2011
[10]

& Lun, D

Egen, D. & Lun, D. S. Truncated branch and bound achieves efficient constraint-based genetic design.Bioinformatics28, 1619– 1623 (2012)

work page 2012
[11]

Rockwell, G., Guido, N. J. & Church, G. M. Redirector: designing cell factories by reconstructing the metabolic objective.PLoS computational biology9, e1002882 (2013)

work page 2013
[12]

& Furusawa, C

Ohno, S., Shimizu, H. & Furusawa, C. Fastpros: screening of reac- tion knockout strategies for metabolic engineering.Bioinformatics 30, 981–987 (2014)

work page 2014
[13]

& Hua, Q

Gu, D., Zhang, C., Zhou, S., Wei, L. & Hua, Q. Idealknock: a framework for efficiently identifying knockout strategies leading to targeted overproduction.Computational biology and chemistry 61, 229–237 (2016)

work page 2016
[14]

Tamura, T. Grid-based computational methods for the design of constraint-based parsimonious chemical reaction networks to simulate metabolite production: Gridprod.BMC bioinformatics 19, 1–9 (2018). 14

work page 2018
[15]

& Kosaka, T

Tamura, T., Muto-Fujita, A., Tohsato, Y. & Kosaka, T. Gene deletion algorithms for minimum reaction network design by mixed-integer linear programming for metabolite production in constraint-based models: gdel minrn.Journal of Computational Biology30, 553–568 (2023)

work page 2023
[16]

T., Unrean, P

Trinh, C. T., Unrean, P . & Srienc, F. Minimal escherichia coli cell for the most efficient production of ethanol from hexoses and pentoses.Applied and environmental microbiology74, 3634–3643 (2008)

work page 2008
[17]

& Klamt, S

Schneider, P ., von Kamp, A. & Klamt, S. An extended and gen- eralized framework for the calculation of metabolic intervention strategies based on minimal cut sets.PLoS computational biology 16, e1008110 (2020)

work page 2020
[18]

Banerjee, D.et al.Genome-scale metabolic rewiring improves titers rates and yields of the non-native product indigoidine at scale.Nature communications11, 5385 (2020)

work page 2020
[19]

Tamura, T. Metnetcomp: Database for minimal and maxi- mal gene-deletion strategies for growth-coupled production of genome-scale metabolic networks.IEEE/ACM Transactions on Computational Biology and Bioinformatics(2023)

work page 2023
[20]

& Tamura, T

Yang, Z. & Tamura, T. Dbgdel: Database-enhanced gene dele- tion framework for growth-coupled production in genome-scale metabolic models.IEEE Transactions on Computational Biology and Bioinformatics(2025)

work page 2025
[21]

GraphGDel: Constructing and Learning Graph Representations of Genome-Scale Metabolic Models for Growth-Coupled Gene Deletion Prediction

Yang, Z. & Tamura, T. Deepgdel: Deep learning-based gene deletion prediction framework for growth-coupled production in genome-scale metabolic models.arXiv preprint arXiv:2504.06316 (2025)

work page internal anchor Pith review Pith/arXiv arXiv 2025
[22]

& Hinton, G

LeCun, Y., Bengio, Y. & Hinton, G. Deep learning.nature521, 436–444 (2015)

work page 2015
[23]

& Gilles, E

Stelling, J., Klamt, S., Bettenbrock, K., Schuster, S. & Gilles, E. D. Metabolic network structure determines key aspects of functionality and regulation.Nature420, 190–193 (2002)

work page 2002
[24]

& Lei, Q.-Y

Wang, Y.-P . & Lei, Q.-Y. Metabolite sensing and signaling in cell metabolism.Signal transduction and targeted therapy3, 30 (2018)

work page 2018
[25]

Jeong, H., Tombor, B., Albert, R., Oltvai, Z. N. & Barab ´asi, A.-L. The large-scale organization of metabolic networks.Nature407, 651–654 (2000)

work page 2000
[26]

Li, R.et al.Graph signal processing, graph neural network and graph learning on biological data: a systematic review.IEEE Reviews in Biomedical Engineering16, 109–135 (2021)

work page 2021
[27]

& Borgwardt, K

Muzio, G., O’Bray, L. & Borgwardt, K. Biological network analysis with deep learning.Briefings in bioinformatics22, 1515–1530 (2021)

work page 2021
[28]

& Liu, X

Jin, S., Zeng, X., Xia, F., Huang, W. & Liu, X. Application of deep learning methods in biological networks.Briefings in bioinformatics22, 1902–1917 (2021)

work page 1902
[29]

& Goto, S

Kanehisa, M. & Goto, S. Kegg: kyoto encyclopedia of genes and genomes.Nucleic acids research28, 27–30 (2000)

work page 2000
[30]

D., Thiele, I

Orth, J. D., Thiele, I. & Palsson, B. Ø. What is flux balance analysis?Nature biotechnology28, 245–248 (2010)

work page 2010
[31]

& Price, N

Simeonidis, E. & Price, N. D. Genome-scale modeling for metabolic engineering.Journal of Industrial Microbiology and Biotechnology42, 327–338 (2015)

work page 2015
[32]

Pfeiffer, T., Soyer, O. S. & Bonhoeffer, S. The evolution of connectivity in metabolic networks.PLoS biology3, e228 (2005)

work page 2005
[33]

A.et al.Bigg models: A platform for integrating, standardizing and sharing genome-scale models.Nucleic acids research44, D515–D522 (2016)

King, Z. A.et al.Bigg models: A platform for integrating, standardizing and sharing genome-scale models.Nucleic acids research44, D515–D522 (2016)

work page 2016
[34]

& Klamt, S

Schneider, P ., Mahadevan, R. & Klamt, S. Systematizing the different notions of growth-coupled product synthesis and a single framework for computing corresponding strain designs. Biotechnology Journal16, 2100236 (2021)

work page 2021
[35]

Alter, T. B. & Ebert, B. E. Determination of growth-coupling strategies and their underlying principles.BMC bioinformatics 20, 1–17 (2019)

work page 2019
[36]

Smiles, a chemical language and information sys- tem

Weininger, D. Smiles, a chemical language and information sys- tem. 1. introduction to methodology and encoding rules.Journal of chemical information and computer sciences28, 31–36 (1988)

work page 1988
[37]

& Schmidhuber, J

Hochreiter, S. & Schmidhuber, J. Long short-term memory.Neural Computation9, 1735–1780 (1997)

work page 1997
[38]

Sutskever, I., Vinyals, O. & Le, Q. V. Sequence to sequence learning with neural networks.Advances in neural information processing systems27(2014)

work page 2014
[39]

Vaswani, A.et al.Attention is all you need.Advances in neural information processing systems30(2017)

work page 2017
[40]

D., Fleming, R

Orth, J. D., Fleming, R. M. & Palsson, B. Ø. Reconstruction and use of microbial metabolic networks: the core escherichia coli metabolic model as an educational guide.EcoSal plus4, 10–1128 (2010)

work page 2010
[41]

L., Palsson, B

Mo, M. L., Palsson, B. Ø. & Herrg ˚ard, M. J. Connecting extra- cellular metabolomic measurements to intracellular flux states in yeast.BMC systems biology3, 1–17 (2009)

work page 2009
[42]

M.et al.iml1515, a knowledgebase that computes escherichia coli traits.Nature biotechnology35, 904–908 (2017)

Monk, J. M.et al.iml1515, a knowledgebase that computes escherichia coli traits.Nature biotechnology35, 904–908 (2017)

work page 2017
[43]

& Huang, J

Wang, S., Guo, Y., Wang, Y., Sun, H. & Huang, J. Smiles-bert: large scale unsupervised pre-training for molecular property pre- diction. InProceedings of the 10th ACM international conference on bioinformatics, computational biology and health informatics, 429– 436 (2019)

work page 2019
[44]

& Sakakibara, Y

Hirohara, M., Saito, Y., Koda, Y., Sato, K. & Sakakibara, Y. Convolutional neural network based on smiles representation of compounds for detecting chemical motif.BMC bioinformatics19, 526 (2018)

work page 2018
[45]

Pinheiro, G. A.et al.Machine learning prediction of nine molecular properties based on the smiles representation of the qm9 quantum-chemistry dataset.The Journal of Physical Chemistry A124, 9854–9866 (2020)

work page 2020
[46]

Moretti, S., Tran, V. D. T., Mehl, F., Ibberson, M. & Pagni, M. Metanetx/mnxref: unified namespace for metabolites and biochemical reactions in the context of metabolic models.Nucleic acids research49, D570–D574 (2021)

work page 2021
[47]

H., Zhang, X., Xin, L., Shan, B

Tran, N. H., Zhang, X., Xin, L., Shan, B. & Li, M. De novo peptide sequencing by deep learning.Proceedings of the National Academy of Sciences114, 8247–8252 (2017)

work page 2017
[48]

& Wang, B

You, Z.-H., Lei, Y.-K., Zhu, L., Xia, J. & Wang, B. Prediction of protein-protein interactions from amino acid sequences with ensemble extreme learning machines and principal component analysis.BMC bioinformatics14, S10 (2013)

work page 2013
[49]

ElAbd, H.et al.Amino acid encoding for deep learning applica- tions.BMC bioinformatics21, 235 (2020)

work page 2020
[50]

PyTorch: An Imperative Style, High-Performance Deep Learning Library

Paszke, A. Pytorch: An imperative style, high-performance deep learning library.arXiv preprint arXiv:1912.01703(2019)

work page internal anchor Pith review Pith/arXiv arXiv 1912

[1] [1]

& Carbonell, P

Otero-Muras, I. & Carbonell, P . Automated engineering of synthetic metabolic pathways for efficient biomanufacturing. Metabolic Engineering63, 61–80 (2021)

work page 2021

[2] [2]

& Nogales, J

Garc ´ıa-Jim´enez, B., Torres-Bacete, J. & Nogales, J. Metabolic modelling approaches for describing and engineering microbial communities.Computational and Structural Biotechnology Journal 19, 226–246 (2021)

work page 2021

[3] [3]

J., Wang, L., Dinh, H

Foster, C. J., Wang, L., Dinh, H. V., Suthers, P . F. & Maranas, C. D. Building kinetic models for metabolic engineering.Current Opinion in Biotechnology67, 35–41 (2021)

work page 2021

[4] [4]

& Shimizu, H

Toya, Y. & Shimizu, H. Flux analysis and metabolomics for sys- tematic metabolic engineering of microorganisms.Biotechnology advances31, 818–826 (2013)

work page 2013

[5] [5]

& Maranas, C

Pharkya, P . & Maranas, C. D. An optimization framework for identifying reaction activation/inhibition or elimination candi- dates for overproduction in microbial systems.Metabolic engi- neering8, 1–13 (2006)

work page 2006

[6] [6]

& Rocha, I

Vieira, V., Maia, P ., Rocha, M. & Rocha, I. Comparison of pathway analysis and constraint-based methods for cell factory design. BMC bioinformatics20, 1–15 (2019)

work page 2019

[7] [7]

Ranganathan, S., Suthers, P . F. & Maranas, C. D. Optforce: an optimization procedure for identifying all genetic manipulations leading to targeted overproductions.PLoS computational biology 6, e1000744 (2010)

work page 2010

[8] [8]

E.et al.Omic data from evolved e

Lewis, N. E.et al.Omic data from evolved e. coli are consis- tent with computed optimal growth from genome-scale models. Molecular systems biology6, 390 (2010)

work page 2010

[9] [9]

Yang, L., Cluett, W. R. & Mahadevan, R. Emilio: a fast algorithm for genome-scale strain design.Metabolic engineering13, 272–281 (2011)

work page 2011

[10] [10]

& Lun, D

Egen, D. & Lun, D. S. Truncated branch and bound achieves efficient constraint-based genetic design.Bioinformatics28, 1619– 1623 (2012)

work page 2012

[11] [11]

Rockwell, G., Guido, N. J. & Church, G. M. Redirector: designing cell factories by reconstructing the metabolic objective.PLoS computational biology9, e1002882 (2013)

work page 2013

[12] [12]

& Furusawa, C

Ohno, S., Shimizu, H. & Furusawa, C. Fastpros: screening of reac- tion knockout strategies for metabolic engineering.Bioinformatics 30, 981–987 (2014)

work page 2014

[13] [13]

& Hua, Q

Gu, D., Zhang, C., Zhou, S., Wei, L. & Hua, Q. Idealknock: a framework for efficiently identifying knockout strategies leading to targeted overproduction.Computational biology and chemistry 61, 229–237 (2016)

work page 2016

[14] [14]

Tamura, T. Grid-based computational methods for the design of constraint-based parsimonious chemical reaction networks to simulate metabolite production: Gridprod.BMC bioinformatics 19, 1–9 (2018). 14

work page 2018

[15] [15]

& Kosaka, T

Tamura, T., Muto-Fujita, A., Tohsato, Y. & Kosaka, T. Gene deletion algorithms for minimum reaction network design by mixed-integer linear programming for metabolite production in constraint-based models: gdel minrn.Journal of Computational Biology30, 553–568 (2023)

work page 2023

[16] [16]

T., Unrean, P

Trinh, C. T., Unrean, P . & Srienc, F. Minimal escherichia coli cell for the most efficient production of ethanol from hexoses and pentoses.Applied and environmental microbiology74, 3634–3643 (2008)

work page 2008

[17] [17]

& Klamt, S

Schneider, P ., von Kamp, A. & Klamt, S. An extended and gen- eralized framework for the calculation of metabolic intervention strategies based on minimal cut sets.PLoS computational biology 16, e1008110 (2020)

work page 2020

[18] [18]

Banerjee, D.et al.Genome-scale metabolic rewiring improves titers rates and yields of the non-native product indigoidine at scale.Nature communications11, 5385 (2020)

work page 2020

[19] [19]

Tamura, T. Metnetcomp: Database for minimal and maxi- mal gene-deletion strategies for growth-coupled production of genome-scale metabolic networks.IEEE/ACM Transactions on Computational Biology and Bioinformatics(2023)

work page 2023

[20] [20]

& Tamura, T

Yang, Z. & Tamura, T. Dbgdel: Database-enhanced gene dele- tion framework for growth-coupled production in genome-scale metabolic models.IEEE Transactions on Computational Biology and Bioinformatics(2025)

work page 2025

[21] [21]

GraphGDel: Constructing and Learning Graph Representations of Genome-Scale Metabolic Models for Growth-Coupled Gene Deletion Prediction

Yang, Z. & Tamura, T. Deepgdel: Deep learning-based gene deletion prediction framework for growth-coupled production in genome-scale metabolic models.arXiv preprint arXiv:2504.06316 (2025)

work page internal anchor Pith review Pith/arXiv arXiv 2025

[22] [22]

& Hinton, G

LeCun, Y., Bengio, Y. & Hinton, G. Deep learning.nature521, 436–444 (2015)

work page 2015

[23] [23]

& Gilles, E

Stelling, J., Klamt, S., Bettenbrock, K., Schuster, S. & Gilles, E. D. Metabolic network structure determines key aspects of functionality and regulation.Nature420, 190–193 (2002)

work page 2002

[24] [24]

& Lei, Q.-Y

Wang, Y.-P . & Lei, Q.-Y. Metabolite sensing and signaling in cell metabolism.Signal transduction and targeted therapy3, 30 (2018)

work page 2018

[25] [25]

Jeong, H., Tombor, B., Albert, R., Oltvai, Z. N. & Barab ´asi, A.-L. The large-scale organization of metabolic networks.Nature407, 651–654 (2000)

work page 2000

[26] [26]

Li, R.et al.Graph signal processing, graph neural network and graph learning on biological data: a systematic review.IEEE Reviews in Biomedical Engineering16, 109–135 (2021)

work page 2021

[27] [27]

& Borgwardt, K

Muzio, G., O’Bray, L. & Borgwardt, K. Biological network analysis with deep learning.Briefings in bioinformatics22, 1515–1530 (2021)

work page 2021

[28] [28]

& Liu, X

Jin, S., Zeng, X., Xia, F., Huang, W. & Liu, X. Application of deep learning methods in biological networks.Briefings in bioinformatics22, 1902–1917 (2021)

work page 1902

[29] [29]

& Goto, S

Kanehisa, M. & Goto, S. Kegg: kyoto encyclopedia of genes and genomes.Nucleic acids research28, 27–30 (2000)

work page 2000

[30] [30]

D., Thiele, I

Orth, J. D., Thiele, I. & Palsson, B. Ø. What is flux balance analysis?Nature biotechnology28, 245–248 (2010)

work page 2010

[31] [31]

& Price, N

Simeonidis, E. & Price, N. D. Genome-scale modeling for metabolic engineering.Journal of Industrial Microbiology and Biotechnology42, 327–338 (2015)

work page 2015

[32] [32]

Pfeiffer, T., Soyer, O. S. & Bonhoeffer, S. The evolution of connectivity in metabolic networks.PLoS biology3, e228 (2005)

work page 2005

[33] [33]

A.et al.Bigg models: A platform for integrating, standardizing and sharing genome-scale models.Nucleic acids research44, D515–D522 (2016)

King, Z. A.et al.Bigg models: A platform for integrating, standardizing and sharing genome-scale models.Nucleic acids research44, D515–D522 (2016)

work page 2016

[34] [34]

& Klamt, S

Schneider, P ., Mahadevan, R. & Klamt, S. Systematizing the different notions of growth-coupled product synthesis and a single framework for computing corresponding strain designs. Biotechnology Journal16, 2100236 (2021)

work page 2021

[35] [35]

Alter, T. B. & Ebert, B. E. Determination of growth-coupling strategies and their underlying principles.BMC bioinformatics 20, 1–17 (2019)

work page 2019

[36] [36]

Smiles, a chemical language and information sys- tem

Weininger, D. Smiles, a chemical language and information sys- tem. 1. introduction to methodology and encoding rules.Journal of chemical information and computer sciences28, 31–36 (1988)

work page 1988

[37] [37]

& Schmidhuber, J

Hochreiter, S. & Schmidhuber, J. Long short-term memory.Neural Computation9, 1735–1780 (1997)

work page 1997

[38] [38]

Sutskever, I., Vinyals, O. & Le, Q. V. Sequence to sequence learning with neural networks.Advances in neural information processing systems27(2014)

work page 2014

[39] [39]

Vaswani, A.et al.Attention is all you need.Advances in neural information processing systems30(2017)

work page 2017

[40] [40]

D., Fleming, R

Orth, J. D., Fleming, R. M. & Palsson, B. Ø. Reconstruction and use of microbial metabolic networks: the core escherichia coli metabolic model as an educational guide.EcoSal plus4, 10–1128 (2010)

work page 2010

[41] [41]

L., Palsson, B

Mo, M. L., Palsson, B. Ø. & Herrg ˚ard, M. J. Connecting extra- cellular metabolomic measurements to intracellular flux states in yeast.BMC systems biology3, 1–17 (2009)

work page 2009

[42] [42]

M.et al.iml1515, a knowledgebase that computes escherichia coli traits.Nature biotechnology35, 904–908 (2017)

Monk, J. M.et al.iml1515, a knowledgebase that computes escherichia coli traits.Nature biotechnology35, 904–908 (2017)

work page 2017

[43] [43]

& Huang, J

Wang, S., Guo, Y., Wang, Y., Sun, H. & Huang, J. Smiles-bert: large scale unsupervised pre-training for molecular property pre- diction. InProceedings of the 10th ACM international conference on bioinformatics, computational biology and health informatics, 429– 436 (2019)

work page 2019

[44] [44]

& Sakakibara, Y

Hirohara, M., Saito, Y., Koda, Y., Sato, K. & Sakakibara, Y. Convolutional neural network based on smiles representation of compounds for detecting chemical motif.BMC bioinformatics19, 526 (2018)

work page 2018

[45] [45]

Pinheiro, G. A.et al.Machine learning prediction of nine molecular properties based on the smiles representation of the qm9 quantum-chemistry dataset.The Journal of Physical Chemistry A124, 9854–9866 (2020)

work page 2020

[46] [46]

Moretti, S., Tran, V. D. T., Mehl, F., Ibberson, M. & Pagni, M. Metanetx/mnxref: unified namespace for metabolites and biochemical reactions in the context of metabolic models.Nucleic acids research49, D570–D574 (2021)

work page 2021

[47] [47]

H., Zhang, X., Xin, L., Shan, B

Tran, N. H., Zhang, X., Xin, L., Shan, B. & Li, M. De novo peptide sequencing by deep learning.Proceedings of the National Academy of Sciences114, 8247–8252 (2017)

work page 2017

[48] [48]

& Wang, B

You, Z.-H., Lei, Y.-K., Zhu, L., Xia, J. & Wang, B. Prediction of protein-protein interactions from amino acid sequences with ensemble extreme learning machines and principal component analysis.BMC bioinformatics14, S10 (2013)

work page 2013

[49] [49]

ElAbd, H.et al.Amino acid encoding for deep learning applica- tions.BMC bioinformatics21, 235 (2020)

work page 2020

[50] [50]

PyTorch: An Imperative Style, High-Performance Deep Learning Library

Paszke, A. Pytorch: An imperative style, high-performance deep learning library.arXiv preprint arXiv:1912.01703(2019)

work page internal anchor Pith review Pith/arXiv arXiv 1912