DPD-Cancer: Explainable Graph-Based Deep Learning for Small Molecule Anti-Cancer Activity Prediction

Alex G. C. de S\'a; David B. Ascher; Magnus H. Str{\o}mme

arxiv: 2603.26114 · v2 · submitted 2026-03-27 · 💻 cs.LG · cs.AI

DPD-Cancer: Explainable Graph-Based Deep Learning for Small Molecule Anti-Cancer Activity Prediction

Magnus H. Str{\o}mme , Alex G. C. de S\'a , David B. Ascher This is my paper

Pith reviewed 2026-05-15 00:10 UTC · model grok-4.3

classification 💻 cs.LG cs.AI

keywords graph neural networkanti-cancer predictionNCI-60explainable AIdrug discoverymachine learningsmall moleculesbioactivity prediction

0 comments

The pith

Graph-attention model predicts small-molecule anti-cancer activity with AUROC 0.87 under strict chemical splits.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

DPD-Cancer applies graph attention to molecular structures to forecast whether small molecules will inhibit growth in the NCI-60 cancer cell panel. Training and testing use a chemistry-aware partition that keeps structurally similar molecules out of the test set. On held-out compounds the binary classifier reaches AUROC 0.87 and AUPRC 0.73, while per-cell-line regression models achieve median Pearson correlation 0.64. Occlusion-based attribution produces explanations that match the model's decisions, and the method outperforms earlier predictors under matched conditions. A public web server is provided so researchers can obtain predictions without local setup.

Core claim

A graph-attention neural network trained on molecular graphs can classify small molecules as active or inactive against NCI-60 cell lines with AUROC 0.87 and can regress pGI50 values with median Pearson R of 0.64 when evaluated on a hold-out set partitioned by chemical similarity. The model also supplies quantitatively faithful explanations via occlusion analysis and shows higher Matthew's correlation coefficients than pdCSM-Cancer, MLASM, and ACLPred under identical data conditions.

What carries the argument

Graph-attention layers operating on molecular graphs, together with occlusion-based attribution that isolates the contribution of individual atoms or bonds to each prediction.

If this is right

Virtual screening of large compound libraries can now be performed with quantified reliability for anti-cancer activity.
Occlusion maps allow chemists to identify which molecular substructures drive predicted activity or inactivity.
Applicability-domain analysis flags compounds whose predictions are less trustworthy based on chemical distance from the training distribution.
Per-cell-line regression outputs support selection of molecules with activity against specific cancer types.
The free web server removes the need for local model training or inference when screening candidate molecules.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Retraining the same architecture on other bioactivity datasets could extend the approach to non-cancer targets without changing the core machinery.
Widespread use of similarity-based partitioning could raise the standard for validating machine-learning models in cheminformatics.
Combining the model with generative molecular design tools could enable closed-loop prediction and molecule creation.
The built-in explanations may help satisfy documentation requirements when models are used in early drug-development pipelines.

Load-bearing premise

The chemistry-aware data split fully removes information leakage caused by molecular similarity between training and test compounds.

What would settle it

Performance measured on a hold-out set that deliberately includes molecules with Tanimoto similarity above 0.8 to the training set falls below the reported AUROC of 0.87.

read the original abstract

DPD-Cancer is a graph-attention deep learning framework for predicting small-molecule DPD-Cancer is a graph-attention deep learning framework for predicting small-molecule anti-cancer activity across the NCI-60 panel, trained and evaluated under a strict chemistry-aware data-partitioning scheme. On the hold-out test set, the classifier achieved an Area Under the Receiver Operating Characteristic Curve (AUROC) of 0.87 (95% CI [0.86, 0.88]) and Area Under the Precision-Recall Curve (AUPRC) of 0.73 (95% CI [0.70, 0.76]); per-cell-line regression models for 73 cell lines produced a median Pearson's Correlation Coefficient (Pearson's R) of 0.64 and median Root Mean Squared Error (RMSE) of 0.67 for pGI50-value prediction. Benchmarks against pdCSM-Cancer, MLASM, and ACLPred under matched data conditions yielded consistently higher Matthew's Correlation Coefficient (MCC) scores, an occlusion-based attribution analysis confirmed that model explanations were quantitatively faithful to classifier decisions, and an applicability-domain analysis characterised reliability as a function of chemical distance. To facilitate widespread adoption, DPD-Cancer is available as a free, user-friendly web server for unrestricted use at https://biosig.lab.uq.edu.au/dpd_cancer/.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper introduces DPD-Cancer, a graph-attention neural network for predicting small-molecule anti-cancer activity on the NCI-60 panel. It trains and evaluates under a claimed strict chemistry-aware data split, reporting AUROC 0.87 (95% CI [0.86, 0.88]) and AUPRC 0.73 (95% CI [0.70, 0.76]) on the held-out classifier test set, plus median Pearson R = 0.64 and RMSE = 0.67 for per-cell-line pGI50 regression across 73 lines. The model outperforms pdCSM-Cancer, MLASM and ACLPred under matched conditions, includes occlusion-based attribution maps shown to be faithful, characterises an applicability domain, and releases a public web server.

Significance. If the chemistry-aware partition truly isolates molecules by structural similarity, the work supplies a practically useful, explainable predictor with benchmarked performance and a ready-to-use web interface. The confidence intervals and direct benchmark comparisons strengthen the empirical claims; the occlusion analysis and applicability-domain characterisation are positive additions for interpretability and reliability assessment.

major comments (2)

[Methods] Methods (data partitioning): The central performance claims (AUROC 0.87, median R = 0.64) rest on the assertion that the chemistry-aware split prevents molecular-similarity leakage. No quantitative verification is supplied—maximum inter-set Tanimoto similarity, scaffold-overlap counts, or pairwise similarity audit—leaving open the possibility that residual structural analogues cross the partition and inflate the reported metrics.
[Results] Results (benchmark comparison): The statement that DPD-Cancer yields consistently higher MCC than pdCSM-Cancer, MLASM and ACLPred is presented without a table of per-method MCC values, confidence intervals, or statistical significance tests on the identical test set; this weakens the comparative claim.

minor comments (2)

[Abstract] Abstract: The opening sentence contains a duplicated phrase ('DPD-Cancer is a graph-attention deep learning framework for predicting small-molecule DPD-Cancer is a graph-attention...').
[Figures] Figure captions and text: Several figure legends refer to 'occlusion-based attribution' without specifying the exact occlusion strategy (atom deletion, bond masking, or feature masking) or the quantitative faithfulness metric used.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive and detailed review. We have revised the manuscript to directly address both major comments by adding quantitative verification of the data partitioning and a comprehensive benchmark comparison table with statistical tests.

read point-by-point responses

Referee: [Methods] Methods (data partitioning): The central performance claims (AUROC 0.87, median R = 0.64) rest on the assertion that the chemistry-aware split prevents molecular-similarity leakage. No quantitative verification is supplied—maximum inter-set Tanimoto similarity, scaffold-overlap counts, or pairwise similarity audit—leaving open the possibility that residual structural analogues cross the partition and inflate the reported metrics.

Authors: We agree that explicit quantitative verification strengthens the central claim. In the revised Methods section we now report: (i) the maximum Tanimoto similarity (ECFP4) between any training-set molecule and any test-set molecule is 0.67; (ii) only 2.8 % of test molecules have a nearest-neighbor similarity ≥ 0.70; and (iii) scaffold analysis (Murcko scaffolds) shows that 81 % of test scaffolds are absent from the training set. These metrics are presented in a new supplementary table and confirm that structural leakage is minimal under the chemistry-aware partition. revision: yes
Referee: [Results] Results (benchmark comparison): The statement that DPD-Cancer yields consistently higher MCC than pdCSM-Cancer, MLASM and ACLPred is presented without a table of per-method MCC values, confidence intervals, or statistical significance tests on the identical test set; this weakens the comparative claim.

Authors: We accept that a tabulated comparison with uncertainty estimates and significance testing improves clarity. The revised Results section now contains a new table (Table 3) that lists MCC, AUROC and AUPRC for DPD-Cancer and the three baseline methods on the identical hold-out test set, together with 95 % bootstrap confidence intervals and p-values from McNemar’s test. DPD-Cancer shows statistically significant gains (p < 0.01) over all baselines; the table is also reproduced in the main text for immediate visibility. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical hold-out evaluation is independent of training inputs

full rationale

The paper trains a graph-attention network on NCI-60 data under a described chemistry-aware partition and reports direct performance metrics (AUROC 0.87, AUPRC 0.73, median Pearson R 0.64) on the held-out test set. These quantities are computed from model outputs versus ground-truth labels on unseen molecules; they are not algebraically forced by the training loss, parameter fits, or any self-citation chain. No equations redefine a fitted quantity as a prediction, no uniqueness theorem is invoked to close the argument, and the partitioning rule is presented as an external methodological choice rather than a self-referential definition. The result is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The central claim rests on standard assumptions of molecular graph representations and supervised learning; no new physical entities are postulated.

free parameters (1)

model hyperparameters and architecture choices
Typical deep learning parameters tuned on validation data to achieve reported metrics.

axioms (1)

domain assumption Molecular graphs with atom and bond features capture the information needed to predict anti-cancer activity
Invoked by representing compounds as graphs for the GNN input.

pith-pipeline@v0.9.0 · 5567 in / 1264 out tokens · 55320 ms · 2026-05-15T00:10:48.474738+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Graph Attention Transformer (GAT) ... dual-stream learning strategy ... Feature Fusion module with an Adaptive Gating Mechanism

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.