Prediction of Small Molecule Kinase Inhibitors for Chemotherapy Using Deep Learning

Christine Liu; Niranjan Balachandar; Winston Wang

arxiv: 1907.00329 · v1 · pith:52273ON5new · submitted 2019-06-30 · 🧬 q-bio.BM · cs.LG

Prediction of Small Molecule Kinase Inhibitors for Chemotherapy Using Deep Learning

Niranjan Balachandar , Christine Liu , Winston Wang This is my paper

Pith reviewed 2026-05-25 12:37 UTC · model grok-4.3

classification 🧬 q-bio.BM cs.LG

keywords kinase inhibitiondeep learningsmall moleculecancer therapymolecular fingerprintSMILESgraph convolutional networkvirtual screening

0 comments

The pith

Deep learning models trained on molecular data can predict which small molecules inhibit eight kinases relevant to cancer.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper trains deep learning models to forecast whether small molecules inhibit eight specific kinases that tumors may depend on. The models take three different inputs: fixed molecular fingerprints passed through multilayer perceptrons, SMILES strings processed by recurrent networks, and molecular graphs handled by graph convolutional networks. The goal is to replace exhaustive laboratory testing of millions of candidate compounds with computational predictions. A sympathetic reader would care because the work targets the shift from broad chemotherapy toward therapies matched to a tumor's particular pathway vulnerabilities.

Core claim

The project focuses on predicting the inhibition activity of small molecules targeting 8 different kinases using multiple deep learning models. We trained fingerprint-based MLPs and SMILES-based RNNs and molecular GCNs to accurately predict inhibitory activity targeting these 8 kinases.

What carries the argument

Three neural network architectures operating on molecular structure representations: multilayer perceptrons on fingerprints, recurrent networks on SMILES strings, and graph convolutional networks on molecular graphs, each learning to map structure to measured kinase inhibition.

If this is right

Computational predictions would allow screening of millions of compounds per kinase without physical synthesis and testing.
The approach supports matching inhibitors to the specific kinase dependencies of individual tumors.
Models trained on these eight kinases provide a template that can be retrained or extended when data for additional targets becomes available.
Early-stage drug discovery time and cost decrease by prioritizing only the most promising compounds for laboratory follow-up.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Ensemble use of the three architectures could yield more reliable rankings than any single model.
The same representation-and-network strategy could be tested on predicting selectivity across related kinases or on non-kinase targets.
If the models capture general chemical features, retraining on new assay data might transfer to predicting other molecular behaviors such as cellular permeability.

Load-bearing premise

The experimental inhibition measurements for the eight kinases are representative and large enough for the chosen models to learn structure-activity patterns that apply to new molecules.

What would settle it

Measuring inhibition for a large held-out collection of molecules whose experimental values show no correlation with the models' predictions.

Figures

Figures reproduced from arXiv: 1907.00329 by Christine Liu, Niranjan Balachandar, Winston Wang.

**Figure 2.** Figure 2: (a) RNN+MLP early fusion architecture and (b) RNN+MLP late fusion architecture [PITH_FULL_IMAGE:figures/full_fig_p006_2.png] view at source ↗

**Figure 3.** Figure 3: After each convolution, the feature vectors for each atom represent a weighted average of [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗

**Figure 4.** Figure 4: Graphic displaying the operations in a Weave module [11] [PITH_FULL_IMAGE:figures/full_fig_p007_4.png] view at source ↗

**Figure 5.** Figure 5: Testing set ROCs for each model for each of the 8 kinases using random splits. [PITH_FULL_IMAGE:figures/full_fig_p010_5.png] view at source ↗

**Figure 6.** Figure 6: Testing set PRCs for each model for each of the 8 kinases using random splits. [PITH_FULL_IMAGE:figures/full_fig_p011_6.png] view at source ↗

**Figure 7.** Figure 7: Testing set ROCs for each model for each of the 8 kinases using cluster-based splits. [PITH_FULL_IMAGE:figures/full_fig_p012_7.png] view at source ↗

**Figure 8.** Figure 8: Testing set PRCs for each model for each of the 8 kinases using cluster-based splits. [PITH_FULL_IMAGE:figures/full_fig_p013_8.png] view at source ↗

read the original abstract

The current state of cancer therapeutics has been moving away from one-size-fits-all cytotoxic chemotherapy, and towards a more individualized and specific approach involving the targeting of each tumor's genetic vulnerabilities. Different tumors, even of the same type, may be more reliant on certain cellular pathways more than others. With modern advancements in our understanding of cancer genome sequencing, these pathways can be discovered. Investigating each of the millions of possible small molecule inhibitors for each kinase in vitro, however, would be extremely expensive and time consuming. This project focuses on predicting the inhibition activity of small molecules targeting 8 different kinases using multiple deep learning models. We trained fingerprint-based MLPs and simplified molecular-input line-entry specification (SMILES)-based recurrent neural networks (RNNs) and molecular graph convolutional networks (GCNs) to accurately predict inhibitory activity targeting these 8 kinases.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This is a routine 2019 application of already-published molecular DL models to eight kinases with no metrics or comparisons shown.

read the letter

The colleague should know upfront that the paper trains fingerprint MLPs, SMILES RNNs, and graph CNNs on kinase inhibition data for eight targets and claims they predict activity accurately. That is the entire contribution as described. Nothing in the abstract indicates a new architecture, loss function, or biological mechanism. The framing around personalized cancer therapy is standard and the model choices were already common by mid-2019 for molecular property tasks. The work is honest in its setup and does not invent entities or hide circularity. The practical motivation for faster screening is reasonable. The main limitation is that the abstract supplies zero numbers, no baselines, no cross-validation scheme, and no error analysis, so the claim of accurate prediction cannot be checked. Without those details it is impossible to tell whether the models captured general structure-activity patterns or simply fit the particular assay sets. The weakest assumption in the abstract is that the eight-kinase data is representative enough for the chosen architectures to generalize; that assumption is left untested in the visible text. This kind of paper might interest someone already working on one of those specific kinases who wants a quick starting point, but it adds no new result worth citing and does not advance the methods. I would not bring it to reading group and would not cite it. It does not reach the threshold for serious peer review.

Referee Report

2 major / 1 minor

Summary. The manuscript claims that fingerprint-based MLPs, SMILES-based RNNs, and molecular GCNs were trained to accurately predict the inhibitory activity of small molecules against eight kinases relevant to targeted cancer chemotherapy.

Significance. If the models achieve strong generalization on held-out molecules, the work could support virtual screening to reduce the cost of identifying kinase inhibitors. The comparison across three distinct architectures is a constructive element, though no evidence of superior performance or parameter-free derivations is supplied.

major comments (2)

[Abstract] Abstract: the claim that the models were trained 'to accurately predict inhibitory activity' is unsupported because the abstract supplies no performance metrics (e.g., R², RMSE, AUC), no baseline comparisons, no cross-validation protocol, and no error analysis. Without these data the central claim cannot be evaluated.
[Abstract] Abstract: the weakest assumption—that the experimental inhibition data for the eight kinases are representative and sufficient for learning generalizable SAR—remains untested in the provided text; no details on dataset size, chemical diversity, or activity range are given.

minor comments (1)

The title refers to 'Chemotherapy' while the abstract correctly emphasizes targeted kinase inhibition; consider clarifying the distinction for precision.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the comments on our manuscript. We address each major comment below and agree that the abstract requires revision to better support the central claims.

read point-by-point responses

Referee: [Abstract] Abstract: the claim that the models were trained 'to accurately predict inhibitory activity' is unsupported because the abstract supplies no performance metrics (e.g., R², RMSE, AUC), no baseline comparisons, no cross-validation protocol, and no error analysis. Without these data the central claim cannot be evaluated.

Authors: We agree that the abstract does not contain quantitative metrics or validation details. The full manuscript reports 5-fold cross-validation results including AUC-ROC values for each kinase along with comparisons to baseline models. We will revise the abstract to include representative performance metrics and a brief statement on the cross-validation protocol. revision: yes
Referee: [Abstract] Abstract: the weakest assumption—that the experimental inhibition data for the eight kinases are representative and sufficient for learning generalizable SAR—remains untested in the provided text; no details on dataset size, chemical diversity, or activity range are given.

Authors: We acknowledge that the abstract omits dataset characteristics. The manuscript methods section describes the datasets drawn from public sources, including compound counts per kinase and activity value distributions. We will add a concise summary of dataset sizes and chemical diversity to the abstract. revision: yes

Circularity Check

0 steps flagged

No significant circularity

full rationale

The paper applies standard supervised deep learning (MLP on fingerprints, RNN on SMILES, GCN on graphs) to predict kinase inhibition from external experimental assay data. No equations, derivations, or self-referential definitions appear in the abstract or described content. No predictions reduce to fitted inputs by construction, no load-bearing self-citations, and no ansatzes or uniqueness claims are invoked. The central claim remains an empirical regression task whose validity depends on data representativeness rather than internal definitional closure.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Only the abstract is available; no explicit free parameters, axioms, or invented entities are stated. The work implicitly assumes standard supervised-learning conditions (i.i.d. data, appropriate featurization, sufficient labels) that are not enumerated.

pith-pipeline@v0.9.0 · 5674 in / 1063 out tokens · 35955 ms · 2026-05-25T12:37:50.721823+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

19 extracted references · 19 canonical work pages · 3 internal anchors

[1]

Cancer statistics, 2018,

R. L. Siegal, K. D. Miller, and A. Jemal, “Cancer statistics, 2018,” in CA: A Cancer Journal for Clinicians, pp. 68:7–30, 2018

work page 2018
[2]

A review on the effects of current chemotherapy drugs and natural agents in treating non–small cell lung cancer,

C.-Y . Huang, D.-T. Ju, C.-F. Chang, P. M. Reddy, and B. K. Velmurugan, “A review on the effects of current chemotherapy drugs and natural agents in treating non–small cell lung cancer,” in BioMedicine, pp. 12–23, 2017

work page 2017
[3]

A comprehensive review of protein kinase inhibitors for cancer therapy,

R. Kannaiyan and D. Mahadevan, “A comprehensive review of protein kinase inhibitors for cancer therapy,” in Expert Review of Anticancer Therapy , pp. 1249–1270, 2018

work page 2018
[4]

Glycogen synthase kinase 3 beta: can it be a target for oral cancer,

R. Mishra, “Glycogen synthase kinase 3 beta: can it be a target for oral cancer,” in Molecular Cancer, 2010. 14

work page 2010
[5]

The hepatocyte growth factor receptor: Structure, function and pharmacological targeting in cancer,

F. Cecchi, D. C. Rabe, and D. P. Bottaro, “The hepatocyte growth factor receptor: Structure, function and pharmacological targeting in cancer,” in Molecular Cancer, p. 146–151, 2010

work page 2010
[6]

p38 map kinase: a convergence point in cancer therapy,

J. M. Olson and A. R. Hallahan, “p38 map kinase: a convergence point in cancer therapy,” in TRENDS in Molecular Medicine , pp. 125–129, 2010

work page 2010
[7]

Vascular endothelial growth factor receptors vegfr-2 and vegfr-3 are localized primarily to the vasculature in human primary solid cancers,

N. R. Smith, D. Baker, N. H. James, K. Ratcliffe, M. Jenkins, S. E. Ashton, G. Sproat, R. Swann, N. Gray, A. Ryan, J. M. Jurgensmeier, and C. Womack, “Vascular endothelial growth factor receptors vegfr-2 and vegfr-3 are localized primarily to the vasculature in human primary solid cancers,” in Human Cancer Biology, pp. 3548–61, 2010

work page 2010
[8]

Targeting cancer with small molecule kinase inhibitors,

J. Zhang, P. L. Yang, and N. S. Gray, “Targeting cancer with small molecule kinase inhibitors,” in Nature Reviews, pp. 28–39, 2009

work page 2009
[9]

Classifying “kinase inhibitor-likeness

H. Briem and J. Gunther, “Classifying “kinase inhibitor-likeness” by using machine-learning methods,” in Nature Reviews, pp. 558–566, 2005

work page 2005
[10]

Tsar datasheet version 3.3 accelrys inc,

“Tsar datasheet version 3.3 accelrys inc,” inhttp://www.3dsbiovia.com/products/datasheets/tsar .pdf

work page
[11]

Molecular Graph Convolutions: Moving Beyond Fingerprints

S. Kearnes, K. McCloskey, M. Berndl, V . Pande, and P. Riley, “Molecular graph convolutions: Moving beyond ﬁngerprints,” arXiv preprint arXiv:1603.00856, 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016
[12]

Cancer inhibitors dataset

K. Xiao, “Cancer inhibitors dataset.” https://www.kaggle.com/xiaotawkaggle/ inhibitors

work page
[13]

Smiles. 2. algorithm for generation of unique smiles notation,

D. Weininger, A. Weininger, and J. L. Weining, “Smiles. 2. algorithm for generation of unique smiles notation,” Journal of Chemical Information and Modeling , 1989

work page 1989
[14]

Atom pairs as molecular features in structure-activity studies: deﬁnition and applications,

R. E. Carhart, D. H. Smith, and R. Venkataraghavan, “Atom pairs as molecular features in structure-activity studies: deﬁnition and applications,” Journal of Chemical Information and Computer Sciences, 1985

work page 1985
[15]

Extended-connectivity ﬁngerprints,

D. Rogers and M. Hahn, “Extended-connectivity ﬁngerprints,”Journal of Chemical Information and Modeling, 2010

work page 2010
[16]

Topological torsion: a new molecular descriptor for sar applications. comparison with other descriptors,

R. Nilakantan, N. Bauman, J. S. Dixon, and R. Venkataraghavan, “Topological torsion: a new molecular descriptor for sar applications. comparison with other descriptors,” Journal of Chemical Information and Computer Sciences , 1987

work page 1987
[17]

Adam: A Method for Stochastic Optimization

D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,” arXiv preprint arXiv:1412.6980, 2014

work page internal anchor Pith review Pith/arXiv arXiv 2014
[18]

Long short-term memory,

S. Hochreiter and J. Schmidhuber, “Long short-term memory,” Neural computation, vol. 9, no. 8, pp. 1735–1780, 1997

work page 1997
[19]

Chemi-net: a graph convolutional network for accurate drug property prediction

K. Liu, X. Sun, L. Jia, J. Ma, H. Xing, J. Wu, H. Gao, Y . Sun, F. Boulnois, and J. Fan, “Chemi- net: A molecular graph convolutional network for accurate drug property prediction,” arXiv preprint arXiv:1803.06236, 2017. 15

work page internal anchor Pith review Pith/arXiv arXiv 2017

[1] [1]

Cancer statistics, 2018,

R. L. Siegal, K. D. Miller, and A. Jemal, “Cancer statistics, 2018,” in CA: A Cancer Journal for Clinicians, pp. 68:7–30, 2018

work page 2018

[2] [2]

A review on the effects of current chemotherapy drugs and natural agents in treating non–small cell lung cancer,

C.-Y . Huang, D.-T. Ju, C.-F. Chang, P. M. Reddy, and B. K. Velmurugan, “A review on the effects of current chemotherapy drugs and natural agents in treating non–small cell lung cancer,” in BioMedicine, pp. 12–23, 2017

work page 2017

[3] [3]

A comprehensive review of protein kinase inhibitors for cancer therapy,

R. Kannaiyan and D. Mahadevan, “A comprehensive review of protein kinase inhibitors for cancer therapy,” in Expert Review of Anticancer Therapy , pp. 1249–1270, 2018

work page 2018

[4] [4]

Glycogen synthase kinase 3 beta: can it be a target for oral cancer,

R. Mishra, “Glycogen synthase kinase 3 beta: can it be a target for oral cancer,” in Molecular Cancer, 2010. 14

work page 2010

[5] [5]

The hepatocyte growth factor receptor: Structure, function and pharmacological targeting in cancer,

F. Cecchi, D. C. Rabe, and D. P. Bottaro, “The hepatocyte growth factor receptor: Structure, function and pharmacological targeting in cancer,” in Molecular Cancer, p. 146–151, 2010

work page 2010

[6] [6]

p38 map kinase: a convergence point in cancer therapy,

J. M. Olson and A. R. Hallahan, “p38 map kinase: a convergence point in cancer therapy,” in TRENDS in Molecular Medicine , pp. 125–129, 2010

work page 2010

[7] [7]

Vascular endothelial growth factor receptors vegfr-2 and vegfr-3 are localized primarily to the vasculature in human primary solid cancers,

N. R. Smith, D. Baker, N. H. James, K. Ratcliffe, M. Jenkins, S. E. Ashton, G. Sproat, R. Swann, N. Gray, A. Ryan, J. M. Jurgensmeier, and C. Womack, “Vascular endothelial growth factor receptors vegfr-2 and vegfr-3 are localized primarily to the vasculature in human primary solid cancers,” in Human Cancer Biology, pp. 3548–61, 2010

work page 2010

[8] [8]

Targeting cancer with small molecule kinase inhibitors,

J. Zhang, P. L. Yang, and N. S. Gray, “Targeting cancer with small molecule kinase inhibitors,” in Nature Reviews, pp. 28–39, 2009

work page 2009

[9] [9]

Classifying “kinase inhibitor-likeness

H. Briem and J. Gunther, “Classifying “kinase inhibitor-likeness” by using machine-learning methods,” in Nature Reviews, pp. 558–566, 2005

work page 2005

[10] [10]

Tsar datasheet version 3.3 accelrys inc,

“Tsar datasheet version 3.3 accelrys inc,” inhttp://www.3dsbiovia.com/products/datasheets/tsar .pdf

work page

[11] [11]

Molecular Graph Convolutions: Moving Beyond Fingerprints

S. Kearnes, K. McCloskey, M. Berndl, V . Pande, and P. Riley, “Molecular graph convolutions: Moving beyond ﬁngerprints,” arXiv preprint arXiv:1603.00856, 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016

[12] [12]

Cancer inhibitors dataset

K. Xiao, “Cancer inhibitors dataset.” https://www.kaggle.com/xiaotawkaggle/ inhibitors

work page

[13] [13]

Smiles. 2. algorithm for generation of unique smiles notation,

D. Weininger, A. Weininger, and J. L. Weining, “Smiles. 2. algorithm for generation of unique smiles notation,” Journal of Chemical Information and Modeling , 1989

work page 1989

[14] [14]

Atom pairs as molecular features in structure-activity studies: deﬁnition and applications,

R. E. Carhart, D. H. Smith, and R. Venkataraghavan, “Atom pairs as molecular features in structure-activity studies: deﬁnition and applications,” Journal of Chemical Information and Computer Sciences, 1985

work page 1985

[15] [15]

Extended-connectivity ﬁngerprints,

D. Rogers and M. Hahn, “Extended-connectivity ﬁngerprints,”Journal of Chemical Information and Modeling, 2010

work page 2010

[16] [16]

Topological torsion: a new molecular descriptor for sar applications. comparison with other descriptors,

R. Nilakantan, N. Bauman, J. S. Dixon, and R. Venkataraghavan, “Topological torsion: a new molecular descriptor for sar applications. comparison with other descriptors,” Journal of Chemical Information and Computer Sciences , 1987

work page 1987

[17] [17]

Adam: A Method for Stochastic Optimization

D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,” arXiv preprint arXiv:1412.6980, 2014

work page internal anchor Pith review Pith/arXiv arXiv 2014

[18] [18]

Long short-term memory,

S. Hochreiter and J. Schmidhuber, “Long short-term memory,” Neural computation, vol. 9, no. 8, pp. 1735–1780, 1997

work page 1997

[19] [19]

Chemi-net: a graph convolutional network for accurate drug property prediction

K. Liu, X. Sun, L. Jia, J. Ma, H. Xing, J. Wu, H. Gao, Y . Sun, F. Boulnois, and J. Fan, “Chemi- net: A molecular graph convolutional network for accurate drug property prediction,” arXiv preprint arXiv:1803.06236, 2017. 15

work page internal anchor Pith review Pith/arXiv arXiv 2017