Graph-Theoretic Models for the Prediction of Molecular Measurements

Anna Niane; Prudence Djagba

arxiv: 2604.19840 · v1 · submitted 2026-04-21 · 💻 cs.LG · q-bio.QM

Graph-Theoretic Models for the Prediction of Molecular Measurements

Anna Niane , Prudence Djagba This is my paper

Pith reviewed 2026-05-10 02:41 UTC · model grok-4.3

classification 💻 cs.LG q-bio.QM

keywords graph-theoretic modelsmolecular property predictionmachine learning enhancementsgraph convolutional networksR-squaredMoleculeNettopological indicesMorgan fingerprints

0 comments

The pith

Adding standard ML tools to graph indices lets classical models match deep learning accuracy on molecular properties.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper tests whether a graph-theoretic model using external activity D(G) and internal activity ζ(G) indices, originally successful on small flavonoid sets, can generalize to five large MoleculeNet benchmarks covering biological activity, lipophilicity, solubility, and free energy. The baseline polynomial achieves only average R² of 0.24, showing limited transfer. By layering Ridge regularization, extra graph and physicochemical descriptors, Gradient Boosting ensembles, Lasso selection, and Morgan fingerprint hybrids, the authors lift average best R² to 0.79 with gains of 165 to 274 percent that remain significant at p less than 0.001. These enhanced models equal or exceed a graph convolutional network on every dataset while training in minutes on a CPU using only open-source code.

Core claim

The authors establish that the baseline D(G)-ζ(G) polynomial generalizes poorly across chemically diverse datasets but that a systematic enhancement framework—progressively adding Ridge regularization, additional descriptors, Gradient Boosting, Lasso feature selection, and Morgan fingerprint hybrids—produces large, statistically significant accuracy gains. The resulting models reach an average best R² of 0.79, match or outperform a graph convolutional network under identical conditions on all five tasks, and require no GPU.

What carries the argument

The progressive enhancement framework that begins with the D(G)-ζ(G) polynomial and successively incorporates Ridge regularization, graph and physicochemical descriptors, Gradient Boosting ensembles, Lasso selection, and Morgan fingerprint hybrids.

If this is right

The enhanced models can deliver competitive molecular predictions in environments without GPU access or deep-learning expertise.
Classical graph methods remain viable for property prediction when interpretability and low computational cost matter.
The same stepwise addition of regularization, ensembles, and fingerprints can be applied to other topological indices for similar gains.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

If the pattern holds, hybrid classical-plus-ML models could become standard baselines that future deep-learning methods must demonstrably surpass.
The work suggests testing whether the same enhancement sequence improves graph indices on tasks outside MoleculeNet, such as reaction prediction or materials properties.

Load-bearing premise

That layering these standard ML components yields generalizable gains across datasets rather than overfitting to the chosen benchmarks, and that the p less than 0.001 results survive correction for multiple comparisons.

What would settle it

Reproducing the full pipeline on an independent collection of molecular datasets and finding that the enhanced models no longer improve on the baseline or match the GCN performance would falsify the claim.

Figures

Figures reproduced from arXiv: 2604.19840 by Anna Niane, Prudence Djagba.

**Figure 2.** Figure 2: R2 performance progression across enhancement approaches for four benchmark datasets [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗

**Figure 3.** Figure 3: Comparison of baseline, best enhanced classical model, and GCN across all five benchmark datasets. [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗

**Figure 4.** Figure 4: Top 15 most important features in the hybrid model for BACE prediction. [PITH_FULL_IMAGE:figures/full_fig_p007_4.png] view at source ↗

read the original abstract

Graph-theoretic approaches offer simplicity, interpretability, and low computational cost for molecular property prediction. Among these, the model proposed by Mukwembi and Nyabadza, based on the external activity $D(G)$ and internal activity $\zeta(G)$ indices, achieved strong results on a small flavonoid dataset. However, its ability to generalize to larger and chemically diverse datasets has not been tested. This study evaluates the baseline $D(G)$-$\zeta(G)$ polynomial model on five benchmark datasets from MoleculeNet, covering biological activity (BACE, 1,513 molecules), lipophilicity (LogP synthetic, 14,610 molecules; LogP experimental, 753 molecules), aqueous solubility (ESOL, 1,128 molecules), and hydration free energy (SAMPL, 642 molecules). The baseline model achieves an average $R^2 = 0.24$, confirming limited transferability. To address this, a systematic enhancement framework is proposed, progressively incorporating Ridge regularization, additional graph descriptors, physicochemical properties, ensemble learning with Gradient Boosting, Lasso feature selection, and a hybrid approach combining topological indices with Morgan fingerprints. The enhanced models raise the average best $R^2$ to 0.79, with individual improvements ranging from 165\% to 274\%. All improvements are statistically significant ($p < 0.001$). A direct comparison with a Graph Convolutional Network under identical experimental conditions shows that the enhanced classical models match or outperform deep learning on all five datasets. Comparison with the recent GNN+PGM hybrid of Djagba et al.\ further confirms competitiveness, with the enhanced models achieving the best results on two datasets and tying on one. The entire framework requires no GPU, trains in under five minutes, and uses only open-source tools, making it accessible for researchers in resource-limited settings.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This paper layers standard ML add-ons onto an old graph index model to match GCN performance on MoleculeNet tasks, but the p<0.001 claims are likely overstated without multiple-comparison correction.

read the letter

Colleague, the main point is that a basic D(G)-zeta(G) graph index model from prior work, when you stack on Ridge, gradient boosting, Lasso, physicochemical features, and Morgan fingerprints, ends up matching or beating a GCN baseline on five MoleculeNet datasets while running entirely on CPU in a few minutes. They show the plain version averages R² of 0.24 across BACE, the two LogP sets, ESOL, and SAMPL, then the enhancements lift the best results to 0.79 average with 165-274% gains, all reported at p<0.001. Direct comparison under the same conditions and a side-by-side with the recent GNN+PGM hybrid round out the empirical case. The accessibility angle for low-resource labs is the practical takeaway they emphasize. What the paper does well is the step-by-step breakdown of which additions actually help and the head-to-head with deep learning on public benchmarks; that makes the gains traceable rather than a black-box claim. The citation pattern is straightforward and builds directly on the Mukwembi-Nyabadza baseline without overclaiming novelty. The soft spot is the statistics. With five datasets and multiple intermediate variants per dataset, the family-wise error rate is uncontrolled, and nothing indicates they applied Bonferroni or FDR adjustment. On the smaller sets like SAMPL (642 molecules) this raises the chance that some nominal significance is selection over noise. The abstract also leaves exact splits, hyperparameter search, and full methods opaque, which weakens in how generalizable the pipeline is. If the full text supplies reproducible code and shows the corrections were handled, that concern shrinks. This is for computational chemists who need lightweight, interpretable alternatives rather than people chasing new architectures. It has enough concrete benchmarking and a clear use case to deserve a serious referee, even with the stats issue. I would send it out but flag the multiple-testing point and request code release in the review.

Referee Report

2 major / 1 minor

Summary. The paper claims that a baseline graph-theoretic model using D(G) and ζ(G) indices achieves only average R²=0.24 on five MoleculeNet datasets, but systematic enhancements with Ridge, Gradient Boosting, Lasso, physicochemical properties, and Morgan fingerprints raise the average best R² to 0.79 (165-274% improvements, p<0.001), matching or outperforming a GCN baseline on all datasets while being computationally efficient.

Significance. If validated, this work highlights the potential of enhanced classical graph-theoretic methods as accessible, interpretable alternatives to deep learning for molecular property prediction, particularly in settings without GPU resources. The empirical comparisons on public benchmarks provide a useful reference point for the field.

major comments (2)

[Abstract and Results] The assertion that 'all improvements are statistically significant (p < 0.001)' does not address multiple comparisons. Given five datasets and several model variants per dataset, the family-wise error rate requires correction (e.g., Bonferroni or FDR). Without this, the significance on smaller datasets like SAMPL (n=642) and ESOL (n=1128) may not hold, undermining the central claim of reliable gains.
[Methods and Experimental Setup] Details on data splits (e.g., train/test ratios, random seeds), hyperparameter optimization procedures for Ridge/Lasso/GB, and the exact GCN architecture and training protocol are not provided. These are essential to verify that the reported outperformance over GCN occurs under truly identical conditions and to assess generalizability.

minor comments (1)

[Abstract] The citation to 'Mukwembi and Nyabadza' and 'Djagba et al.' should include full references in the bibliography for completeness.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive comments, which help improve the clarity and rigor of our work. We address each major comment point by point below.

read point-by-point responses

Referee: [Abstract and Results] The assertion that 'all improvements are statistically significant (p < 0.001)' does not address multiple comparisons. Given five datasets and several model variants per dataset, the family-wise error rate requires correction (e.g., Bonferroni or FDR). Without this, the significance on smaller datasets like SAMPL (n=642) and ESOL (n=1128) may not hold, undermining the central claim of reliable gains.

Authors: We acknowledge that multiple comparisons were not explicitly corrected in the original submission. In the revised manuscript, we will apply the Benjamini-Hochberg FDR procedure to adjust all p-values across the five datasets and model variants. Given that the reported p-values are below 0.001, the adjusted values are expected to remain below conventional significance thresholds (e.g., 0.05) even after correction for a modest number of tests; we will report both original and adjusted p-values to substantiate the claims. revision: yes
Referee: [Methods and Experimental Setup] Details on data splits (e.g., train/test ratios, random seeds), hyperparameter optimization procedures for Ridge/Lasso/GB, and the exact GCN architecture and training protocol are not provided. These are essential to verify that the reported outperformance over GCN occurs under truly identical conditions and to assess generalizability.

Authors: We agree these experimental details are necessary for reproducibility. The revised manuscript will expand the Methods section to specify: the exact train/test split ratios and random seeds for each dataset; the hyperparameter search grids and cross-validation procedure used for Ridge, Lasso, and Gradient Boosting; and the complete GCN architecture (layers, dimensions, activations) together with the training protocol (optimizer, learning rate, epochs, batch size). This will confirm all models, including the GCN baseline, were evaluated under identical conditions. revision: yes

Circularity Check

0 steps flagged

Minor self-citation in comparison; main results are empirical evaluations on public benchmarks with no circular derivations

full rationale

The paper evaluates a baseline graph-theoretic model and its enhancements (Ridge, GB, Lasso, Morgan fingerprints, etc.) directly on five external MoleculeNet datasets, reporting R² values computed from model fits and predictions on those benchmarks. No equations or derivations are presented that reduce the reported performance metrics to quantities defined solely by parameters fitted inside the paper. A single comparison to prior GNN+PGM work by Djagba et al. (self-citation due to author overlap) is included for context but is not load-bearing for the central claims of improvement over baseline or matching GCN performance. The derivation chain is therefore self-contained against external data and off-the-shelf methods.

Axiom & Free-Parameter Ledger

2 free parameters · 1 axioms · 0 invented entities

The central claim rests on empirical performance gains from standard machine-learning additions applied to an existing graph-index model; no new theoretical entities or derivations are introduced.

free parameters (2)

Ridge regularization strength
Hyperparameter tuned during model fitting to control overfitting on each dataset.
Lasso feature selection threshold
Hyperparameter controlling which additional descriptors are retained.

axioms (1)

domain assumption Molecules can be faithfully represented as undirected graphs for topological index calculation
Invoked when applying D(G) and zeta(G) indices to the benchmark molecules.

pith-pipeline@v0.9.0 · 5636 in / 1356 out tokens · 48448 ms · 2026-05-10T02:41:03.742850+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

12 extracted references · 12 canonical work pages

[1]

Hybrid deep learning models for the prediction of some experimental quantities of molecules

P Djagba, V Rakotonarivo, and A Zeleke. Hybrid deep learning models for the prediction of some experimental quantities of molecules. In2025 International Conference on Machine Learning and Applications (ICMLA), pages 820–825. IEEE, 2025

work page 2025
[2]

Schoenholz, Patrick F

Justin Gilmer, Samuel S. Schoenholz, Patrick F. Riley, Oriol Vinyals, and George E. Dahl. Neural message passing for quantum chemistry. InInternational Conference on Machine Learning, pages 1263–1272. PMLR, 2017

work page 2017
[3]

Graph theory and molecular orbitals

Ivan Gutman and Nenad Trinajstić. Graph theory and molecular orbitals. Totalπ-electron energy of alternant hydrocarbons.Chemical Physics Letters, 17(4):535–538, 1972

work page 1972
[4]

A method for the correlation of biological activity and chemical structure.Journal of the American Chemical Society, 86(8):1616–1626, 1964

Corwin Hansch and Toshio Fujita.ρ-σ-πanalysis. A method for the correlation of biological activity and chemical structure.Journal of the American Chemical Society, 86(8):1616–1626, 1964

work page 1964
[5]

Kipf and Max Welling

Thomas N. Kipf and Max Welling. Semi-supervised classification with graph convolutional networks.Proceed- ings of the International Conference on Learning Representations (ICLR), 2017

work page 2017
[6]

RDKit: Open-source cheminformatics.http://www.rdkit.org, 2013

Greg Landrum. RDKit: Open-source cheminformatics.http://www.rdkit.org, 2013. Accessed 2024

work page 2013
[7]

Mordred: a molecular descriptor calculator.Journal of Cheminformatics, 10(1):4, 2018

Hirotomo Moriwaki, Yu-Shi Tian, Norihito Kawashita, and Tatsuya Takagi. Mordred: a molecular descriptor calculator.Journal of Cheminformatics, 10(1):4, 2018

work page 2018
[8]

A graph-theoretic model for predicting tyrosinase inhibition activity of flavonoids.Journal of Mathematical Chemistry, 61(8):1789–1805, 2023

Simon Mukwembi and Farai Nyabadza. A graph-theoretic model for predicting tyrosinase inhibition activity of flavonoids.Journal of Mathematical Chemistry, 61(8):1789–1805, 2023

work page 2023
[9]

Characterization of molecular branching.Journal of the American Chemical Society, 97(23):6609–6615, 1975

Milan Randić. Characterization of molecular branching.Journal of the American Chemical Society, 97(23):6609–6615, 1975

work page 1975
[10]

Extended-connectivity fingerprints.Journal of Chemical Information and Modeling, 50(5):742–754, 2010

David Rogers and Mathew Hahn. Extended-connectivity fingerprints.Journal of Chemical Information and Modeling, 50(5):742–754, 2010

work page 2010
[11]

Structural determination of paraffin boiling points.Journal of the American Chemical Society, 69(1):17–20, 1947

Harry Wiener. Structural determination of paraffin boiling points.Journal of the American Chemical Society, 69(1):17–20, 1947

work page 1947
[12]

Feinberg, Joseph Gomes, Caleb Geniesse, Aneesh S

Zhenqin Wu, Bharath Ramsundar, Evan N. Feinberg, Joseph Gomes, Caleb Geniesse, Aneesh S. Pappu, Karl Leswing, and Vijay Pande. MoleculeNet: a benchmark for molecular machine learning.Chemical Science, 9(2):513–530, 2018. 9

work page 2018

[1] [1]

Hybrid deep learning models for the prediction of some experimental quantities of molecules

P Djagba, V Rakotonarivo, and A Zeleke. Hybrid deep learning models for the prediction of some experimental quantities of molecules. In2025 International Conference on Machine Learning and Applications (ICMLA), pages 820–825. IEEE, 2025

work page 2025

[2] [2]

Schoenholz, Patrick F

Justin Gilmer, Samuel S. Schoenholz, Patrick F. Riley, Oriol Vinyals, and George E. Dahl. Neural message passing for quantum chemistry. InInternational Conference on Machine Learning, pages 1263–1272. PMLR, 2017

work page 2017

[3] [3]

Graph theory and molecular orbitals

Ivan Gutman and Nenad Trinajstić. Graph theory and molecular orbitals. Totalπ-electron energy of alternant hydrocarbons.Chemical Physics Letters, 17(4):535–538, 1972

work page 1972

[4] [4]

A method for the correlation of biological activity and chemical structure.Journal of the American Chemical Society, 86(8):1616–1626, 1964

Corwin Hansch and Toshio Fujita.ρ-σ-πanalysis. A method for the correlation of biological activity and chemical structure.Journal of the American Chemical Society, 86(8):1616–1626, 1964

work page 1964

[5] [5]

Kipf and Max Welling

Thomas N. Kipf and Max Welling. Semi-supervised classification with graph convolutional networks.Proceed- ings of the International Conference on Learning Representations (ICLR), 2017

work page 2017

[6] [6]

RDKit: Open-source cheminformatics.http://www.rdkit.org, 2013

Greg Landrum. RDKit: Open-source cheminformatics.http://www.rdkit.org, 2013. Accessed 2024

work page 2013

[7] [7]

Mordred: a molecular descriptor calculator.Journal of Cheminformatics, 10(1):4, 2018

Hirotomo Moriwaki, Yu-Shi Tian, Norihito Kawashita, and Tatsuya Takagi. Mordred: a molecular descriptor calculator.Journal of Cheminformatics, 10(1):4, 2018

work page 2018

[8] [8]

A graph-theoretic model for predicting tyrosinase inhibition activity of flavonoids.Journal of Mathematical Chemistry, 61(8):1789–1805, 2023

Simon Mukwembi and Farai Nyabadza. A graph-theoretic model for predicting tyrosinase inhibition activity of flavonoids.Journal of Mathematical Chemistry, 61(8):1789–1805, 2023

work page 2023

[9] [9]

Characterization of molecular branching.Journal of the American Chemical Society, 97(23):6609–6615, 1975

Milan Randić. Characterization of molecular branching.Journal of the American Chemical Society, 97(23):6609–6615, 1975

work page 1975

[10] [10]

Extended-connectivity fingerprints.Journal of Chemical Information and Modeling, 50(5):742–754, 2010

David Rogers and Mathew Hahn. Extended-connectivity fingerprints.Journal of Chemical Information and Modeling, 50(5):742–754, 2010

work page 2010

[11] [11]

Structural determination of paraffin boiling points.Journal of the American Chemical Society, 69(1):17–20, 1947

Harry Wiener. Structural determination of paraffin boiling points.Journal of the American Chemical Society, 69(1):17–20, 1947

work page 1947

[12] [12]

Feinberg, Joseph Gomes, Caleb Geniesse, Aneesh S

Zhenqin Wu, Bharath Ramsundar, Evan N. Feinberg, Joseph Gomes, Caleb Geniesse, Aneesh S. Pappu, Karl Leswing, and Vijay Pande. MoleculeNet: a benchmark for molecular machine learning.Chemical Science, 9(2):513–530, 2018. 9

work page 2018