OncoTraj: a public benchmark for longitudinal resistance prediction in EGFR-mutant non-small-cell lung cancer on osimertinib

Aarchi Singh Thakur; Abhijoy Sarkar

arxiv: 2606.11144 · v1 · pith:D6DK5Y3Gnew · submitted 2026-06-09 · 💻 cs.LG · q-bio.GN· q-bio.QM· stat.AP

OncoTraj: a public benchmark for longitudinal resistance prediction in EGFR-mutant non-small-cell lung cancer on osimertinib

Abhijoy Sarkar , Aarchi Singh Thakur This is my paper

Pith reviewed 2026-06-27 13:44 UTC · model grok-4.3

classification 💻 cs.LG q-bio.GNq-bio.QMstat.AP

keywords OncoTrajEGFR-mutant NSCLCosimertinib resistancelongitudinal predictionbenchmark datasetsingle-timepoint NGSTP53 co-mutationresistance mechanisms

0 comments

The pith

Single-timepoint tissue NGS data fails to predict osimertinib resistance above chance in EGFR-mutant NSCLC.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

OncoTraj introduces the first public benchmark for predicting resistance to first-line osimertinib in EGFR-mutant non-small-cell lung cancer, harmonizing 813 patients from three clinical-genomic sources into locked tasks. The tasks cover binary progression at a 12-month landmark, regression of time-to-progression, and six-class classification of the dominant resistance mechanism, all supplied with audited no-leakage splits. Every reference model using only single-timepoint snapshot features performs at chance levels on within-source evaluation, with the uniform ceiling across model classes pointing to the input modality itself as the limit. The benchmark still recovers a literature-consistent TP53 co-mutation signal and supplies an open harness to test future serial-ctDNA-enriched versions.

Core claim

With v1's single-timepoint snapshot features, no task clears chance on clean within-source evaluation: the uniformity of this ceiling across every model class localizes the limit to the input modality (single-snapshot tissue NGS rather than serial ctDNA), not the algorithm. The benchmark does recover a reproducible literature-consistent association: TP53 co-mutation raises the 12-month progression rate from 29% to 59% cohort-wide. OncoTraj establishes a reproducible, leakage-audited baseline and converts the modality limit into concrete design requirements for a serial-ctDNA-enriched v2.

What carries the argument

OncoTraj benchmark of harmonized multi-source patient records with three locked tasks and audited no-leakage train/validation/test splits.

If this is right

Single-timepoint tissue NGS inputs will keep all models at chance on the three resistance tasks.
Serial ctDNA enrichment is required to move beyond the current performance ceiling in v2.
The TP53 co-mutation association serves as a positive control confirming the harmonized data preserves known biology.
The released splits and evaluation harness create a fixed standard for comparing new algorithms.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Clinical tools for guiding osimertinib treatment may need repeated liquid-biopsy sampling rather than one-time tissue sequencing.
The same benchmarking approach could be applied to other targeted therapies that face predictable clonal evolution.
Adding imaging or routine clinical variables to the feature set might surface additional signals even before serial ctDNA arrives.

Load-bearing premise

The three source datasets can be accurately harmonized into patient-level records with audited no-leakage train/validation/test splits and correctly labeled resistance mechanisms.

What would settle it

A model that exceeds chance performance on any of the three within-source test sets when restricted to the v1 single-timepoint features would falsify the claim that the performance ceiling is set by the input modality.

Figures

Figures reproduced from arXiv: 2606.11144 by Aarchi Singh Thakur, Abhijoy Sarkar.

**Figure 1.** Figure 1: OncoTraj v1 study design: three real-world clinical-genomic sources are harmonized into a unified four-table schema, producing the 813-patient first-line-osimertinib cohort. Patient-level 70/15/15 splits feed the three locked tasks (A, B, C) against six reference baselines. the ceiling of the single-snapshot regime; v2 is where methods built for the serial regime can demonstrate they beat it. We are delibe… view at source ↗

**Figure 2.** Figure 2: OncoTraj v1 cohort composition. (a) Patient counts by source and EGFR variant class. (b) Source-level totals across the 813-patient v1 cohort. past day 365 without progression), and the patient is excluded if censored before day 365 (insufficient follow-up to determine the label). This converts the degenerate “ever-progresses” form, near-useless on a cohort in which essentially every patient eventually pro… view at source ↗

**Figure 3.** Figure 3: Task A (12-month landmark) test-set discrimination and calibration error per baseline, source flags removed. (a) ROC-AUC; (b) Brier score. n=110. 0.0 0.5 1.0 Predicted probability 0.0 0.5 1.0 Observed frequency Logistic ECE = 0.071 0.0 0.5 1.0 Predicted probability Random forest ECE = 0.041 0.0 0.5 1.0 Predicted probability XGBoost ECE = 0.159 0.0 0.5 1.0 Predicted probability Majority ECE = 0.017 Task A r… view at source ↗

**Figure 4.** Figure 4: Task A (12-month landmark) reliability diagrams per baseline on the test split. ECE is annotated per panel. Bins with zero patients are omitted; marker area encodes patient count. (logistic AUC 0.680 [0.581, 0.781], random forest 0.678, Brier 0.214, ECE 0.041) is partly cross-source structure rather than within-cohort discrimination, and should be read as an upper bound. The associated covariate is TP53 co… view at source ↗

**Figure 5.** Figure 5: TP53 co-mutation lifts the 12-month progression rate consistently across the cohort and within each source. The gradient is largest in the overall cohort, attenuates but remains directional within MSKCHORD, and is preserved in the smaller GENIE BPC slice. central scientific content of Task A: a fixed-horizon resistance label on this cohort is learnable to a modest but real degree, and what it learns is th… view at source ↗

**Figure 6.** Figure 6: Task A discrimination versus calibration trade-off across the four classical baselines. The ‘ideal corner’ (top-left) is high ROC-AUC, low ECE. Majority sits at AUC 0.5 by construction but is well-calibrated; logistic and random forest reach AUC ~0.68 with random forest also the best-calibrated discriminating model (ECE 0.041), while XGBoost trades discrimination for markedly worse calibration. 5.4 Source-… view at source ↗

read the original abstract

Resistance to first-line osimertinib in EGFR-mutant non-small-cell lung cancer (NSCLC) is the canonical example of predictable clonal evolution under therapeutic pressure, yet no public benchmark exists for training or evaluating computational models on the corresponding longitudinal patient trajectories. We introduce OncoTraj, a public benchmark of 813 EGFR-mutant NSCLC patients receiving first-line osimertinib, harmonized from three real-world clinical-genomic sources: MSK-CHORD (672 patients), AACR Project GENIE BPC NSCLC (34 patients), and the FLAURA molecular-resistance supplement (107 patients). OncoTraj defines three locked tasks: (A) binary classification of progression by a fixed 12-month landmark, (B) regression of time-to-first-progression in days, and (C) six-class classification of the dominant resistance mechanism. We release the harmonized dataset, patient-level train/validation/test splits with an audited no-leakage guarantee, an open-source evaluation harness, and six reference baselines spanning a majority-class predictor, logistic regression, random forest, XGBoost, an LSTM, and a multi-task transformer. With v1's single-timepoint snapshot features, no task clears chance on clean within-source evaluation: the uniformity of this ceiling across every model class localizes the limit to the input modality (single-snapshot tissue NGS rather than serial ctDNA), not the algorithm. The benchmark does recover a reproducible literature-consistent association: TP53 co-mutation raises the 12-month progression rate from 29% to 59% cohort-wide. OncoTraj establishes a reproducible, leakage-audited baseline and converts the modality limit into concrete design requirements for a serial-ctDNA-enriched v2.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

OncoTraj releases a usable public benchmark with locked tasks and audited splits for osimertinib resistance, but the claim that single-timepoint NGS is the binding limit still needs the harmonization details to hold up.

read the letter

The main takeaway is that this paper puts out OncoTraj, a harmonized set of 813 EGFR-mutant NSCLC patients on first-line osimertinib drawn from MSK-CHORD, GENIE BPC, and FLAURA. It defines three concrete tasks (12-month progression, time-to-progression, and resistance mechanism classification), releases the patient-level splits with a no-leakage audit, and ships an evaluation harness plus six baselines.

What works is the release itself. The data, splits, and code are public, the TP53 co-mutation signal comes through as expected, and every model class (including LSTM and transformer) hits the same ceiling on the single-snapshot features. That pattern makes the modality point concrete rather than hand-wavy.

The soft spot is the step that turns the uniform failure into a claim about input modality. The abstract states the harmonization and audit happened, but the concrete mapping rules, conflict resolution, and audit evidence are not visible here. If label noise or cross-source leakage crept in during patient-level alignment, the ceiling could be an artifact of the supervision rather than the single-timepoint NGS. The stress-test note flags exactly this, and the abstract alone does not close it.

This is for groups building or benchmarking longitudinal predictors in clinical oncology who need a reproducible starting point. A reader focused on serial ctDNA or multi-timepoint modeling would get the design requirements it lays out. It deserves peer review because the benchmark and tasks are well-specified and the release is real, even if the data-prep section will need extra scrutiny.

Referee Report

2 major / 1 minor

Summary. The manuscript introduces OncoTraj, a public benchmark of 813 EGFR-mutant NSCLC patients on first-line osimertinib, harmonized from MSK-CHORD (672), AACR GENIE BPC (34), and FLAURA (107). It defines three locked tasks—(A) 12-month progression binary classification, (B) time-to-first-progression regression, (C) six-class resistance mechanism classification—releases the harmonized data, audited no-leakage patient-level splits, and an open evaluation harness, and reports six baselines (majority, LR, RF, XGBoost, LSTM, multi-task transformer). With v1 single-timepoint snapshot features, no task exceeds chance on within-source evaluation; this uniform ceiling is attributed to the input modality rather than the algorithms. The benchmark recovers a literature-consistent TP53 co-mutation association (12-month progression 29% to 59%).

Significance. If the harmonization and splits hold, OncoTraj supplies a valuable, reproducible public resource that converts an empirical modality limit into concrete design requirements for serial-ctDNA v2. The explicit release of the dataset, splits, and harness is a clear strength that enables community follow-up.

major comments (2)

[Abstract and data harmonization description] The central claim that uniform failure across all model classes localizes the performance ceiling to single-snapshot tissue NGS (rather than algorithm) is load-bearing on the correctness of resistance-mechanism labels and the no-leakage property of the splits. The manuscript asserts an 'audited no-leakage guarantee' and states the three sources but supplies no concrete mapping rules, conflict-resolution procedure for resistance labels, or audit evidence in the provided text (Abstract and Data harmonization description).
[Results] § on results: the claim that 'no task clears chance' is presented without reported error bars, exact chance baselines per task, or within-source vs. cross-source breakdowns, which are required to substantiate the modality-limit interpretation.

minor comments (1)

[Abstract] The abstract states the patient counts per source but does not tabulate the final per-task label distributions or missingness rates after harmonization; a supplementary table would improve reproducibility.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive review and for recognizing the potential value of OncoTraj as a public resource. We address each major comment below and will revise the manuscript accordingly.

read point-by-point responses

Referee: [Abstract and data harmonization description] The central claim that uniform failure across all model classes localizes the performance ceiling to single-snapshot tissue NGS (rather than algorithm) is load-bearing on the correctness of resistance-mechanism labels and the no-leakage property of the splits. The manuscript asserts an 'audited no-leakage guarantee' and states the three sources but supplies no concrete mapping rules, conflict-resolution procedure for resistance labels, or audit evidence in the provided text (Abstract and Data harmonization description).

Authors: We agree that the main text does not currently contain the requested concrete details. The full mapping rules, conflict-resolution procedures for resistance labels, and audit documentation are present in the supplementary materials and the public data-release repository. In revision we will expand the Data harmonization section to include explicit mapping rules, representative examples of label conflicts and their resolution, and a concise summary of the audit steps performed, thereby placing the supporting evidence directly in the manuscript. revision: yes
Referee: [Results] § on results: the claim that 'no task clears chance' is presented without reported error bars, exact chance baselines per task, or within-source vs. cross-source breakdowns, which are required to substantiate the modality-limit interpretation.

Authors: We acknowledge the omission. The revised manuscript will add 95% confidence intervals (via bootstrapping) to all reported metrics, explicit chance-level baselines computed per task (majority-class accuracy for the two classification tasks and mean-value prediction for regression), and within-source versus cross-source performance tables. These additions will be placed in the Results section and associated supplementary tables to strengthen the modality-limit interpretation. revision: yes

Circularity Check

0 steps flagged

Empirical benchmark release with no derivation chain or self-referential reductions

full rationale

The paper presents a harmonized dataset, locked tasks, and baseline evaluations on released splits. All claims rest on empirical performance measurements rather than any derivation, equation, or parameter fit that reduces to its own inputs by construction. No self-citations, ansatzes, or uniqueness theorems are invoked as load-bearing steps. The uniformity of baseline failure is reported as an observation on the provided data, not a mathematical necessity derived from the inputs.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The benchmark rests on the assumption that real-world clinical-genomic records from three sources can be merged without introducing systematic bias or leakage; no free parameters or invented entities are introduced.

axioms (1)

domain assumption Data from MSK-CHORD, AACR Project GENIE BPC NSCLC, and FLAURA can be harmonized into consistent patient-level records with accurate resistance mechanism labels.
The abstract states the dataset is harmonized from these three sources and defines the six-class resistance task.

pith-pipeline@v0.9.1-grok · 5871 in / 1294 out tokens · 24239 ms · 2026-06-27T13:44:27.398906+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

20 extracted references · 17 canonical work pages

[1]

Siegel, Kimberly D

Rebecca L. Siegel, Kimberly D. Miller, Nikita Sandeep Wagle, and Ahmedin Jemal. Cancer statistics, 2023.CA: A Cancer Journal for Clinicians, 73(1):17–48, 2023. doi: 10.3322/caac. 21763. PMID: 36633525

work page doi:10.3322/caac 2023
[2]

Gray, Si-Min Lee, Rachel Hodge, Marcelo Marotti, Yuri Rukazenkov, and Suresh S

Jean-Charles Soria, Yuichiro Ohe, Johan Vansteenkiste, Thanyanan Reungwetwattana, Busya- mas Chewaskulyong, Ki Hyeong Lee, Arunee Dechaphunkul, Fumio Imamura, Naoyuki Nogami, Takayasu Kurata, Isamu Okamoto, Caicun Zhou, Byoung Chul Cho, Ying Cheng, Eun Kyung Cho, Pei Jye Voon, David Planchard, Wu-Chou Su, Jhanelle E. Gray, Si-Min Lee, Rachel Hodge, Marcel...

work page doi:10.1056/nejmoa1713137 2018
[3]

Ramalingam, Johan Vansteenkiste, David Planchard, Byoung Chul Cho, Jhanelle E

Suresh S. Ramalingam, Johan Vansteenkiste, David Planchard, Byoung Chul Cho, Jhanelle E. Gray, Yuichiro Ohe, Caicun Zhou, Thanyanan Reungwetwattana, Ying Cheng, Busyamas Chewaskulyong, Riyaz Shah, Manuel Cobo, Ki Hyeong Lee, Parneet Cheema, Marcello Tiseo, Thomas John, Meng-Chih Lin, Fumio Imamura, Takayasu Kurata, Alexander Todd, Rachel Hodge, Matilde Sa...

work page doi:10.1056/nejmoa1913662 2020
[4]

Gray, Ying Cheng, Yuichiro Ohe, Fumio Imamura, Byoung Chul Cho, Meng-Chih Lin, Margarita Majem, Riyaz Shah, Yuri Rukazenkov, Alexander Todd, Alek- sandra Markovets, J

Juliann Chmielecki, Jhanelle E. Gray, Ying Cheng, Yuichiro Ohe, Fumio Imamura, Byoung Chul Cho, Meng-Chih Lin, Margarita Majem, Riyaz Shah, Yuri Rukazenkov, Alexander Todd, Alek- sandra Markovets, J. Carl Barrett, Juliann Chmielecki, and Suresh S. Ramalingam. Candidate mechanisms of acquired resistance to first-line osimertinib in EGFR-mutated advanced no...

work page doi:10.1038/s41467-023-35961-y 2023
[5]

Mok, Yi-Long Wu, Myung-Ju Ahn, Marina C

Tony S. Mok, Yi-Long Wu, Myung-Ju Ahn, Marina C. Garassino, Hye Ryun Kim, Suresh S. Ramalingam, Frances A. Shepherd, Yuanbin He, Hiroaki Akamatsu, Willemijn S.M.E. Theelen, Chee Khoon Lee, Martin Sebastian, Arnoud Templeton, Helen Mann, Marcelo Marotti, Ser- ban Ghiorghiu, and Vassiliki A. Papadimitrakopoulou. Osimertinib or platinum-pemetrexed in EGFR T7...
[6]

PMID: 27959700

doi: 10.1056/NEJMoa1612674. PMID: 27959700. AURA3. ClinicalTrials.gov identifier: NCT02151981

work page doi:10.1056/nejmoa1612674
[7]

Gray, Myung-Ju Ahn, Geoffrey R

Jhanelle E. Gray, Myung-Ju Ahn, Geoffrey R. Oxnard, Frances A. Shepherd, Fumio Imamura, Ying Cheng, Isamu Okamoto, Byoung Chul Cho, Meng-Chih Lin, Yi-Long Wu, Marcelo Marotti, Alexander Todd, Tarjinder Sahota, Ryan Hartmaier, Ji-Youn Han, Tony Mok, and Suresh S. Ramalingam. Early clearance of plasma EGFR mutations as a predictor of outcome on osimertinib ...

work page doi:10.1158/1078-0432 2023
[8]

Wilson, Nicholas McGranahan, Nicolai J

Mariam Jamal-Hanjani, Gareth A. Wilson, Nicholas McGranahan, Nicolai J. Birkbak, Thomas B.K. Watkins, Selvaraju Veeriah, Seema Shafi, Diana H. Johnson, Richard Mit- ter, Rachel Rosenthal, Maximilian Salm, Stuart Horswell, Mickael Escudero, Nik Matthews, Andrew Rowan, Tim Chambers, David A. Moore, Samra Turajlic, Hang Xu, Siow-Ming Lee, Martin D. Forster, ...

work page doi:10.1056/nejmoa1616288 2017
[9]

Frankell, Michelle Dietzen, Maise Al Bakir, Emilia L

Alexander M. Frankell, Michelle Dietzen, Maise Al Bakir, Emilia L. Lim, Takahiro Karasaki, Sophia Ward, Selvaraju Veeriah, Emma Colliver, Ariana Huebner, Abigail Bunkum, et al. The evolution of lung cancer and impact of subclonal selection in TRACERx.Nature, 616(7957): 525–533, 2023. doi: 10.1038/s41586-023-05783-5. PMID: 37046096. TRACERx evolution analysis

work page doi:10.1038/s41586-023-05783-5 2023
[10]

Maron, Mohamed Ahmed, Susie Kim, Mono Pirun, Walid K

Justin Jee, Christopher Fong, Karl Pichotta, Thinh Ngoc Tran, Anisha Luthra, Michele Waters, Chenlian Fu, Mirella Altoe, Si-Yang Liu, Steven B. Maron, Mohamed Ahmed, Susie Kim, Mono Pirun, Walid K. Chatila, Caroline Bourque, Larisa Magoc, Pier Bose, Helena A. Yu, Mark T.A. Donoghue, Matthew D. Hellmann, Nikolaus Schultz, Michael F. Berger, Pedram Razavi, ...

work page doi:10.1038/s41586-024-08167-5 2024
[11]

Choudhury, Jessica A

Noura J. Choudhury, Jessica A. Lavery, Samantha Brown, Ino de Bruijn, Justin Jee, Thinh Ngoc Tran, Hira Rizvi, Kathryn C. Arbour, Karissa Whiting, Gregory J. Riely, Philippe L. Bedard, Lillian M. Smyth, Mary Mahler, Helena A. Yu, Wungki Tan, Nikolaus Schultz, Aaron Bell, et al. The GENIE BPC NSCLC cohort: a real-world repository integrating standardized c...

work page doi:10.1158/1078-0432.ccr-23-0580 2023
[12]

Ross A. Soo, Urania Dafni, Ji-Youn Han, Byoung Chul Cho, Ernest Nadal, Chong Ming Yeo, Enric Carcereny, Javier de Castro, Maria Angeles Sala, Linda Coate, Mariano Provencio, Christian Britschgi, Patrick Vagenknecht, Georgia Dimopoulou, Roswitha Kammler, Stephen P. Finn, Solange Peters, and Rolf A. Stahel. ctDNA dynamics and mechanisms of acquired resistan...

work page doi:10.1158/1078-0432.ccr-24-0932 2024
[13]

Benthe Muntinghe-Wagenaar, Pim Rozendal, Adrianus J

Fenneke Zwierenga, M. Benthe Muntinghe-Wagenaar, Pim Rozendal, Adrianus J. de Langen, Lizza E. L. Hendriks, Michel van den Heuvel, Cor van der Leest, Sayed M. S. Hashemi, Paul van der Leest, T. Jeroen N. Hiltermann, Ed Schuuring, and Anthonie J. van der Wekken. Circulating tumor DNA in advanced EGFRex20+ NSCLC: Concordance with tissue biopsy, monitoring o...

work page doi:10.1007/s11523-025-01153-5 2025
[14]

Maoxin Ran, Shao-Lin Zhang, and Kin Yip Tam. Identifying meaningful drug response biomarkers from public pharmacogenomic datasets with biologically informed interpretable neural networks.Computational Biology and Chemistry, 120(Pt 1):108669, 2025. doi: 10.1016/j. compbiolchem.2025.108669. PMID: 40914994. KEGG-informed sparse neural network identifies TP53...

work page doi:10.1016/j 2025
[15]

, author Dong, W

Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. ImageNet: A large-scale hierarchical image database. InProceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 248–255, 2009. doi: 10.1109/CVPR.2009.5206848

work page doi:10.1109/cvpr.2009.5206848 2009
[16]

Pedersen, Richard Judson, and Krzysztof Fidelis

John Moult, Jan T. Pedersen, Richard Judson, and Krzysztof Fidelis. A large-scale experiment to assess protein structure prediction methods.Proteins: Structure, Function, and Genetics, 23(3):ii–v, 1995. doi: 10.1002/prot.340230303. PMID: 8710822. CASP founding paper

work page doi:10.1002/prot.340230303 1995
[17]

Alistair E. W. Johnson, Lucas Bulgarelli, Lu Shen, Alvin Gayles, Ayad Shammout, Steven Horng, Tom J. Pollard, Sicheng Hao, Benjamin Moody, Brian Gow, Li-Wei H. Lehman, Leo Anthony Celi, and Roger G. Mark. MIMIC-IV, a freely accessible electronic health record dataset.Scientific Data, 10(1):1, 2023. doi: 10.1038/s41597-022-01899-x. PMID: 36596836

work page doi:10.1038/s41597-022-01899-x 2023
[18]

Stewart, and Jimeng Sun

Edward Choi, Mohammad Taha Bahadori, Andy Schuetz, Walter F. Stewart, and Jimeng Sun. Doctor AI: Predicting clinical events via recurrent neural networks. InProceedings of the Machine Learning for Healthcare Conference (MLHC), volume 56, pages 301–318, 2016

2016
[19]

New therapeutic approaches for EGFR-mutated non-small cell lung cancer in the osimertinib era.Cancer Treatment and Research Communications, 44:100945, 2025

Jaime Rubio-Pérez, Rocío Hernández, Cecilia Santolaya, et al. New therapeutic approaches for EGFR-mutated non-small cell lung cancer in the osimertinib era.Cancer Treatment and Research Communications, 44:100945, 2025. doi: 10.1016/j.ctarc.2025.100945. PMID: 40414016. TP53 co-mutation associated with reduced osimertinib PFS. 23

work page doi:10.1016/j.ctarc.2025.100945 2025
[20]

Improving reproducibility in machine learning research (a report from the NeurIPS 2019 reproducibility program).Journal of Machine Learning Research, 22(164):1–20, 2021

Joelle Pineau, Philippe Vincent-Lamarre, Koustuv Sinha, Vincent Larivière, Alina Beygelzimer, Florence d’Alché Buc, Emily Fox, and Hugo Larochelle. Improving reproducibility in machine learning research (a report from the NeurIPS 2019 reproducibility program).Journal of Machine Learning Research, 22(164):1–20, 2021. 24

2019

[1] [1]

Siegel, Kimberly D

Rebecca L. Siegel, Kimberly D. Miller, Nikita Sandeep Wagle, and Ahmedin Jemal. Cancer statistics, 2023.CA: A Cancer Journal for Clinicians, 73(1):17–48, 2023. doi: 10.3322/caac. 21763. PMID: 36633525

work page doi:10.3322/caac 2023

[2] [2]

Gray, Si-Min Lee, Rachel Hodge, Marcelo Marotti, Yuri Rukazenkov, and Suresh S

Jean-Charles Soria, Yuichiro Ohe, Johan Vansteenkiste, Thanyanan Reungwetwattana, Busya- mas Chewaskulyong, Ki Hyeong Lee, Arunee Dechaphunkul, Fumio Imamura, Naoyuki Nogami, Takayasu Kurata, Isamu Okamoto, Caicun Zhou, Byoung Chul Cho, Ying Cheng, Eun Kyung Cho, Pei Jye Voon, David Planchard, Wu-Chou Su, Jhanelle E. Gray, Si-Min Lee, Rachel Hodge, Marcel...

work page doi:10.1056/nejmoa1713137 2018

[3] [3]

Ramalingam, Johan Vansteenkiste, David Planchard, Byoung Chul Cho, Jhanelle E

Suresh S. Ramalingam, Johan Vansteenkiste, David Planchard, Byoung Chul Cho, Jhanelle E. Gray, Yuichiro Ohe, Caicun Zhou, Thanyanan Reungwetwattana, Ying Cheng, Busyamas Chewaskulyong, Riyaz Shah, Manuel Cobo, Ki Hyeong Lee, Parneet Cheema, Marcello Tiseo, Thomas John, Meng-Chih Lin, Fumio Imamura, Takayasu Kurata, Alexander Todd, Rachel Hodge, Matilde Sa...

work page doi:10.1056/nejmoa1913662 2020

[4] [4]

Gray, Ying Cheng, Yuichiro Ohe, Fumio Imamura, Byoung Chul Cho, Meng-Chih Lin, Margarita Majem, Riyaz Shah, Yuri Rukazenkov, Alexander Todd, Alek- sandra Markovets, J

Juliann Chmielecki, Jhanelle E. Gray, Ying Cheng, Yuichiro Ohe, Fumio Imamura, Byoung Chul Cho, Meng-Chih Lin, Margarita Majem, Riyaz Shah, Yuri Rukazenkov, Alexander Todd, Alek- sandra Markovets, J. Carl Barrett, Juliann Chmielecki, and Suresh S. Ramalingam. Candidate mechanisms of acquired resistance to first-line osimertinib in EGFR-mutated advanced no...

work page doi:10.1038/s41467-023-35961-y 2023

[5] [5]

Mok, Yi-Long Wu, Myung-Ju Ahn, Marina C

Tony S. Mok, Yi-Long Wu, Myung-Ju Ahn, Marina C. Garassino, Hye Ryun Kim, Suresh S. Ramalingam, Frances A. Shepherd, Yuanbin He, Hiroaki Akamatsu, Willemijn S.M.E. Theelen, Chee Khoon Lee, Martin Sebastian, Arnoud Templeton, Helen Mann, Marcelo Marotti, Ser- ban Ghiorghiu, and Vassiliki A. Papadimitrakopoulou. Osimertinib or platinum-pemetrexed in EGFR T7...

[6] [6]

PMID: 27959700

doi: 10.1056/NEJMoa1612674. PMID: 27959700. AURA3. ClinicalTrials.gov identifier: NCT02151981

work page doi:10.1056/nejmoa1612674

[7] [7]

Gray, Myung-Ju Ahn, Geoffrey R

Jhanelle E. Gray, Myung-Ju Ahn, Geoffrey R. Oxnard, Frances A. Shepherd, Fumio Imamura, Ying Cheng, Isamu Okamoto, Byoung Chul Cho, Meng-Chih Lin, Yi-Long Wu, Marcelo Marotti, Alexander Todd, Tarjinder Sahota, Ryan Hartmaier, Ji-Youn Han, Tony Mok, and Suresh S. Ramalingam. Early clearance of plasma EGFR mutations as a predictor of outcome on osimertinib ...

work page doi:10.1158/1078-0432 2023

[8] [8]

Wilson, Nicholas McGranahan, Nicolai J

Mariam Jamal-Hanjani, Gareth A. Wilson, Nicholas McGranahan, Nicolai J. Birkbak, Thomas B.K. Watkins, Selvaraju Veeriah, Seema Shafi, Diana H. Johnson, Richard Mit- ter, Rachel Rosenthal, Maximilian Salm, Stuart Horswell, Mickael Escudero, Nik Matthews, Andrew Rowan, Tim Chambers, David A. Moore, Samra Turajlic, Hang Xu, Siow-Ming Lee, Martin D. Forster, ...

work page doi:10.1056/nejmoa1616288 2017

[9] [9]

Frankell, Michelle Dietzen, Maise Al Bakir, Emilia L

Alexander M. Frankell, Michelle Dietzen, Maise Al Bakir, Emilia L. Lim, Takahiro Karasaki, Sophia Ward, Selvaraju Veeriah, Emma Colliver, Ariana Huebner, Abigail Bunkum, et al. The evolution of lung cancer and impact of subclonal selection in TRACERx.Nature, 616(7957): 525–533, 2023. doi: 10.1038/s41586-023-05783-5. PMID: 37046096. TRACERx evolution analysis

work page doi:10.1038/s41586-023-05783-5 2023

[10] [10]

Maron, Mohamed Ahmed, Susie Kim, Mono Pirun, Walid K

Justin Jee, Christopher Fong, Karl Pichotta, Thinh Ngoc Tran, Anisha Luthra, Michele Waters, Chenlian Fu, Mirella Altoe, Si-Yang Liu, Steven B. Maron, Mohamed Ahmed, Susie Kim, Mono Pirun, Walid K. Chatila, Caroline Bourque, Larisa Magoc, Pier Bose, Helena A. Yu, Mark T.A. Donoghue, Matthew D. Hellmann, Nikolaus Schultz, Michael F. Berger, Pedram Razavi, ...

work page doi:10.1038/s41586-024-08167-5 2024

[11] [11]

Choudhury, Jessica A

Noura J. Choudhury, Jessica A. Lavery, Samantha Brown, Ino de Bruijn, Justin Jee, Thinh Ngoc Tran, Hira Rizvi, Kathryn C. Arbour, Karissa Whiting, Gregory J. Riely, Philippe L. Bedard, Lillian M. Smyth, Mary Mahler, Helena A. Yu, Wungki Tan, Nikolaus Schultz, Aaron Bell, et al. The GENIE BPC NSCLC cohort: a real-world repository integrating standardized c...

work page doi:10.1158/1078-0432.ccr-23-0580 2023

[12] [12]

Ross A. Soo, Urania Dafni, Ji-Youn Han, Byoung Chul Cho, Ernest Nadal, Chong Ming Yeo, Enric Carcereny, Javier de Castro, Maria Angeles Sala, Linda Coate, Mariano Provencio, Christian Britschgi, Patrick Vagenknecht, Georgia Dimopoulou, Roswitha Kammler, Stephen P. Finn, Solange Peters, and Rolf A. Stahel. ctDNA dynamics and mechanisms of acquired resistan...

work page doi:10.1158/1078-0432.ccr-24-0932 2024

[13] [13]

Benthe Muntinghe-Wagenaar, Pim Rozendal, Adrianus J

Fenneke Zwierenga, M. Benthe Muntinghe-Wagenaar, Pim Rozendal, Adrianus J. de Langen, Lizza E. L. Hendriks, Michel van den Heuvel, Cor van der Leest, Sayed M. S. Hashemi, Paul van der Leest, T. Jeroen N. Hiltermann, Ed Schuuring, and Anthonie J. van der Wekken. Circulating tumor DNA in advanced EGFRex20+ NSCLC: Concordance with tissue biopsy, monitoring o...

work page doi:10.1007/s11523-025-01153-5 2025

[14] [14]

Maoxin Ran, Shao-Lin Zhang, and Kin Yip Tam. Identifying meaningful drug response biomarkers from public pharmacogenomic datasets with biologically informed interpretable neural networks.Computational Biology and Chemistry, 120(Pt 1):108669, 2025. doi: 10.1016/j. compbiolchem.2025.108669. PMID: 40914994. KEGG-informed sparse neural network identifies TP53...

work page doi:10.1016/j 2025

[15] [15]

, author Dong, W

Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. ImageNet: A large-scale hierarchical image database. InProceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 248–255, 2009. doi: 10.1109/CVPR.2009.5206848

work page doi:10.1109/cvpr.2009.5206848 2009

[16] [16]

Pedersen, Richard Judson, and Krzysztof Fidelis

John Moult, Jan T. Pedersen, Richard Judson, and Krzysztof Fidelis. A large-scale experiment to assess protein structure prediction methods.Proteins: Structure, Function, and Genetics, 23(3):ii–v, 1995. doi: 10.1002/prot.340230303. PMID: 8710822. CASP founding paper

work page doi:10.1002/prot.340230303 1995

[17] [17]

Alistair E. W. Johnson, Lucas Bulgarelli, Lu Shen, Alvin Gayles, Ayad Shammout, Steven Horng, Tom J. Pollard, Sicheng Hao, Benjamin Moody, Brian Gow, Li-Wei H. Lehman, Leo Anthony Celi, and Roger G. Mark. MIMIC-IV, a freely accessible electronic health record dataset.Scientific Data, 10(1):1, 2023. doi: 10.1038/s41597-022-01899-x. PMID: 36596836

work page doi:10.1038/s41597-022-01899-x 2023

[18] [18]

Stewart, and Jimeng Sun

Edward Choi, Mohammad Taha Bahadori, Andy Schuetz, Walter F. Stewart, and Jimeng Sun. Doctor AI: Predicting clinical events via recurrent neural networks. InProceedings of the Machine Learning for Healthcare Conference (MLHC), volume 56, pages 301–318, 2016

2016

[19] [19]

New therapeutic approaches for EGFR-mutated non-small cell lung cancer in the osimertinib era.Cancer Treatment and Research Communications, 44:100945, 2025

Jaime Rubio-Pérez, Rocío Hernández, Cecilia Santolaya, et al. New therapeutic approaches for EGFR-mutated non-small cell lung cancer in the osimertinib era.Cancer Treatment and Research Communications, 44:100945, 2025. doi: 10.1016/j.ctarc.2025.100945. PMID: 40414016. TP53 co-mutation associated with reduced osimertinib PFS. 23

work page doi:10.1016/j.ctarc.2025.100945 2025

[20] [20]

Improving reproducibility in machine learning research (a report from the NeurIPS 2019 reproducibility program).Journal of Machine Learning Research, 22(164):1–20, 2021

Joelle Pineau, Philippe Vincent-Lamarre, Koustuv Sinha, Vincent Larivière, Alina Beygelzimer, Florence d’Alché Buc, Emily Fox, and Hugo Larochelle. Improving reproducibility in machine learning research (a report from the NeurIPS 2019 reproducibility program).Journal of Machine Learning Research, 22(164):1–20, 2021. 24

2019