CNN-based Survival Model for Pancreatic Ductal Adenocarcinoma in Medical Imaging
Pith reviewed 2026-05-25 15:38 UTC · model grok-4.3
The pith
A convolutional neural network survival model using preoperative CT images outperforms traditional Cox proportional hazards radiomics models by 22% in concordance index for pancreatic ductal adenocarcinoma patients.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Using transfer learning, a convolutional neural network based survival model was built and tested on preoperative CT images of resectable Pancreatic Ductal Adenocarcinoma patients. The proposed CNN-based survival model outperformed the traditional CPH-based radiomics approach in terms of concordance index by 22%, providing a better fit for patients' survival patterns. The proposed CNN-based survival model outperforms CPH-based radiomics pipeline in PDAC prognosis. This approach offers a better fit for survival patterns based on CT images and overcomes the limitations of conventional survival models.
What carries the argument
Transfer learning CNN for direct survival ranking from CT images, replacing radiomic feature extraction and CPH modeling.
If this is right
- CNN provides better fit for survival patterns from CT images.
- Overcomes linear assumption and multicollinearity issues in CPH models.
- Improved prognostic performance for PDAC using preoperative imaging.
Where Pith is reading between the lines
- The method could be applied to other cancers with available CT data for survival prediction.
- Further testing on external validation sets is implied to ensure the model is not overfit.
- This could lead to revised clinical guidelines for using imaging in survival estimation.
Load-bearing premise
The CNN trained via transfer learning on the available preoperative CT cohort produces a generalizable ranking of survival times without overfitting or dataset-specific artifacts.
What would settle it
Performance evaluation on an independent external validation cohort of PDAC patients from different institutions showing the concordance index improvement does not hold would falsify the superiority claim.
read the original abstract
Cox proportional hazard model (CPH) is commonly used in clinical research for survival analysis. In quantitative medical imaging (radiomics) studies, CPH plays an important role in feature reduction and modeling. However, the underlying linear assumption of CPH model limits the prognostic performance. In addition, the multicollinearity of radiomic features and multiple testing problem further impedes the CPH models performance. In this work, using transfer learning, a convolutional neural network (CNN) based survival model was built and tested on preoperative CT images of resectable Pancreatic Ductal Adenocarcinoma (PDAC) patients. The proposed CNN-based survival model outperformed the traditional CPH-based radiomics approach in terms of concordance index by 22%, providing a better fit for patients' survival patterns. The proposed CNN-based survival model outperforms CPH-based radiomics pipeline in PDAC prognosis. This approach offers a better fit for survival patterns based on CT images and overcomes the limitations of conventional survival models.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes a CNN-based survival model trained via transfer learning on preoperative CT images of resectable PDAC patients. It claims this model outperforms a traditional CPH-based radiomics pipeline by 22% in concordance index, addressing the linear assumption, multicollinearity, and multiple-testing limitations of CPH while providing a better fit to survival patterns.
Significance. If the performance gain is shown to be robust under proper held-out validation on a sufficiently large cohort, the work would demonstrate that deep learning can usefully relax the restrictive assumptions of CPH in radiomics survival modeling, offering a practical alternative for PDAC prognosis from imaging data.
major comments (3)
- [Abstract] Abstract: the headline claim of a 22% C-index improvement is presented without any report of cohort size (n), cross-validation protocol, confidence intervals, or statistical testing; without these quantities the result cannot be evaluated for stability or generalizability.
- [Abstract] Abstract and Methods (implied): the description of how the CNN output is converted into a survival ranking (e.g., whether a Cox partial-likelihood loss, ranking loss, or discrete-time hazard is used) is absent, leaving the precise modeling innovation unspecified.
- [Abstract] Abstract: no comparison to non-radiomics baselines (e.g., clinical variables alone or standard image CNNs without radiomics) is supplied, so it is unclear whether the reported gain is attributable to the CNN architecture or simply to the use of imaging features at all.
minor comments (1)
- [Abstract] Abstract: the phrase 'providing a better fit for patients' survival patterns' is vague; a quantitative statement of calibration or time-dependent AUC would be more informative.
Simulated Author's Rebuttal
We thank the referee for the detailed and constructive comments on our manuscript. We provide point-by-point responses to the major comments below. We will revise the abstract to address the first two points.
read point-by-point responses
-
Referee: [Abstract] Abstract: the headline claim of a 22% C-index improvement is presented without any report of cohort size (n), cross-validation protocol, confidence intervals, or statistical testing; without these quantities the result cannot be evaluated for stability or generalizability.
Authors: We agree with the referee that the abstract should provide these essential details to allow proper evaluation of the results. Although the full manuscript describes the cohort size, cross-validation protocol, and includes confidence intervals and statistical comparisons in the Results section, we will revise the abstract to concisely include this information for improved clarity and to highlight the robustness of the findings. revision: yes
-
Referee: [Abstract] Abstract and Methods (implied): the description of how the CNN output is converted into a survival ranking (e.g., whether a Cox partial-likelihood loss, ranking loss, or discrete-time hazard is used) is absent, leaving the precise modeling innovation unspecified.
Authors: We acknowledge that the abstract does not specify the exact survival modeling approach used with the CNN output. We will update the abstract to briefly describe the method by which the CNN produces survival rankings, including the loss function or ranking approach employed. This is elaborated in the Methods section of the manuscript. revision: yes
-
Referee: [Abstract] Abstract: no comparison to non-radiomics baselines (e.g., clinical variables alone or standard image CNNs without radiomics) is supplied, so it is unclear whether the reported gain is attributable to the CNN architecture or simply to the use of imaging features at all.
Authors: The comparison presented is between the proposed CNN-based survival model and a traditional CPH-based radiomics pipeline, both of which utilize features derived from the preoperative CT images. This design isolates the effect of the modeling approach (CNN vs. CPH with radiomics) rather than the use of imaging versus non-imaging data. Therefore, the 22% improvement can be attributed to the CNN architecture relaxing the assumptions of CPH. We do not believe additional non-radiomics baselines are necessary to support this specific claim, though we can add a discussion clarifying this point if the editor deems it helpful. revision: no
Circularity Check
No circularity: empirical head-to-head comparison on imaging data
full rationale
The paper reports an empirical performance comparison between a transfer-learned CNN survival model and a CPH radiomics baseline on preoperative CT scans for PDAC. The 22% C-index gain is a measured metric on the cohort; no equations, ansatzes, or uniqueness theorems are present that could reduce the reported result to a fitted parameter or self-citation by construction. The central claim rests on data-driven evaluation rather than any self-referential derivation chain.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Input images have dimensions of 1401401 (grey scale), which contain the CT images within the manual contours of the tumors (example shown in Figure 2). All convolutional layers have kernel size of 33 with 32 filters following by Batch Normalization layers (BN). The first Max Pool layer has pool size of 2×2, and the latter two Max Pool layers have pool ...
- [2]
-
[3]
Katzman, J. et al. DeepSurv: Personalized Treatment Recommender System Using A Cox Proportional Hazards Deep Neural Network. (2016). doi:10.1186/s12874-018-0482-1
-
[4]
Gensheimer, M. F. & Narasimhan, B. A Scalable Discrete-Time Survival Model for Neural Networks
-
[5]
Ching, T., Zhu, X. & Garmire, L. X. Cox-nnet: An artificial neural network method for prognosis prediction of high-throughput omics data. PLoS Comput. Biol. 14, e1006076 (2018)
work page 2018
-
[6]
Cox, D. R. Regression Models and Life-Tables. Journal of the Royal Statistical Society. Series B (Methodological) 34, 187–220 (1972)
work page 1972
-
[7]
Katzman, J. L. et al. DeepSurv: personalized treatment recommender system using a Cox proportional hazards deep neural network. BMC Med. Res. Methodol. 18, 24 (2018)
work page 2018
- [8]
-
[9]
Hearst, M. A., Dumais, S. T., Osman, E., Platt, J. & Scholkopf, B. Support vector machines. IEEE Intell. Syst. 13, 18–28 (1998)
work page 1998
-
[10]
Adamska, A., Domenichini, A. & Falasca, M. Pancreatic Ductal Adenocarcinoma: Current and Evolving Therapies. Int. J. Mol. Sci. 18, (2017)
work page 2017
-
[11]
Foucher, E. D. et al. Pancreatic Ductal Adenocarcinoma: A Strong Imbalance of Good and Bad Immunological Cops in the Tumor Microenvironment. Front. Immunol. 9, 1044 (2018)
work page 2018
-
[12]
Stark, A. P. et al. Long-term survival in patients with pancreatic ductal adenocarcinoma. Surgery 159, 1520–1527 (2016)
work page 2016
-
[13]
Mariani, L. et al. Prognostic factors for metachronous contralateral breast cancer: a comparison of the linear Cox regression model and its artificial neural network extension. Breast Cancer Res. Treat. 44, 167–78 (1997)
work page 1997
- [14]
-
[15]
Kumar, V. et al. Radiomics: the process and the challenges. Magn. Reson. Imaging 30, 1234–1248 (2012)
work page 2012
-
[16]
Keek, S. A., Leijenaar, R. T., Jochems, A. & Woodruff, H. C. A review on radiomics and the future of theranostics for patient selection in precision medicine. Br. J. Radiol. 91, 20170926 (2018)
work page 2018
-
[17]
Zhang, Y., Oikonomou, A., Wong, A., Haider, M. A. & Khalvati, F. Radiomics-based Prognosis Analysis for Non-Small Cell Lung Cancer. Nat. Sci. Reports 7, (2017)
work page 2017
-
[18]
Aerts, H. J. et al. Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nat Commun 5, 4006 (2014)
work page 2014
-
[19]
Lambin, P. et al. Radiomics: Extracting more information from medical images using advanced feature analysis. Eur. J. Cancer 48, 441–446 (2012)
work page 2012
-
[20]
Aerts, H. J. W. L. The Potential of Radiomic-Based Phenotyping in Precision Medicine. JAMA Oncol. 2, 1636 (2016)
work page 2016
-
[21]
van Griethuysen, J. J. M. et al. Computational Radiomics System to Decode the Radiographic Phenotype. Cancer Res. 77, e104–e107 (2017)
work page 2017
-
[22]
Khalvati, F. et al. Prognostic Value of CT Radiomic Features in Resectable Pancreatic Ductal Adenocarcinoma. Sci. Rep. 9, 5449 (2019)
work page 2019
-
[23]
Huang, Y. et al. Radiomics Signature: A Potential Biomarker for the Prediction of Disease-Free Survival in Early-Stage (I or II) Non-Small Cell Lung Cancer. Radiology 152234 (2016). doi:10.1148/radiol.2016152234
-
[24]
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015)
work page 2015
-
[25]
Krizhevsky, A., Sutskever, I. & Hinton, G. E. ImageNet Classification with Deep Convolutional Neural Networks. 1097–1105 (2012)
work page 2012
-
[26]
Shin, H.-C. et al. Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning. IEEE Trans. Med. Imaging 35, 1285–1298 (2016)
work page 2016
-
[27]
Tajbakhsh, N. et al. Convolutional Neural Networks for Medical Image Analysis: Full Training or Fine Tuning? IEEE Trans. Med. Imaging 35, 1299–1312 (2017)
work page 2017
-
[28]
Yasaka, K., Akai, H., Abe, O. & Kiryu, S. Deep Learning with Convolutional Neural Network for Differentiation of Liver Masses at Dynamic Contrast-enhanced CT: A Preliminary Study. Radiology 286, 887–896 (2018)
work page 2018
-
[29]
Yamashita, R., Nishio, M., Do, R. K. G. & Togashi, K. Convolutional neural networks: an overview and application in radiology. Insights Imaging 9, 611–629 (2018)
work page 2018
-
[30]
Aerts, H. J. W. L. et al. Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nat. Commun. 5, 4006 (2014)
work page 2014
-
[31]
C., Steck, H., Krishnapuram, B., Dehing-Oberije, C
Raykar, V. C., Steck, H., Krishnapuram, B., Dehing-Oberije, C. & Lambin, P. On Ranking in Survival Analysis: Bounds on the Concordance Index
-
[32]
Haibe-Kains, B. et al. survcomp: a package for performance assessment and comparison for survival analysis. (2019)
work page 2019
-
[33]
Schmid, M., Wright, M. N. & Ziegler, A. On the use of Harrell’s C for clinical risk prediction via random survival forests. (2016)
work page 2016
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.