Automatic Calcium Scoring in Cardiac and Chest CT Using DenseRAUnet
Pith reviewed 2026-05-24 15:33 UTC · model grok-4.3
The pith
DenseRAUnet trained only on chest CT segments coronary calcium on both chest and cardiac CT with F1-score 0.75.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
DenseRAUnet integrates Dense U-Net, ResNet residual connections, and atrous convolutions into a 2.5D network trained exclusively on chest CT. It segments coronary calcium pixels on both chest and cardiac CT scans from a private dataset of 144 scans, reaching an F1-score of 0.75. The resulting calcium measures support cardiovascular disease risk prediction at 0.83 accuracy. A bootstrap-IoU loss balances the sparse foreground calcium class against background tissue.
What carries the argument
DenseRAUnet, a 2.5D convolutional network that fuses dense skip connections, residual blocks, and atrous convolutions, trained with a bootstrap-enhanced IoU loss to segment sparse calcium regions.
If this is right
- Routine chest CT scans can be used for CAC quantification and CVD risk assessment.
- No need for separate models or ECG-gated cardiac CT for calcium scoring.
- The architecture allows generalization from chest to cardiac CT without fine-tuning.
- Accurate segmentation supports reliable risk prediction at 0.83 level.
Where Pith is reading between the lines
- Extending the approach to multi-vendor public datasets could validate broader clinical deployment.
- The loss function design may apply to other medical segmentation tasks with severe class imbalance.
- 2.5D processing could reduce computational demands for similar 3D medical imaging problems.
Load-bearing premise
The private dataset of 144 scans captures the full range of calcium appearances, patient demographics, and scanner characteristics found in real-world clinical use.
What would settle it
Evaluating the model on a larger, publicly available dataset of chest and cardiac CT scans acquired from different scanners and patient groups would reveal if the reported F1-score and accuracy hold.
Figures
read the original abstract
Cardiovascular disease (CVD) is a common and strong threat to human beings, featuring high prevalence, disability and mortality. The amount of coronary artery calcification (CAC) is an effective factor for CVD risk evaluation. Conventionally, CAC is quantified using ECG-synchronized cardiac CT but rarely from general chest CT scans. However, compared with ECG-synchronized cardiac CT, chest CT is more prevalent and economical in clinical practice. To address this, we propose an automatic method based on Dense U-Net to segment coronary calcium pixels on both types of CT scans. Our contribution is two-fold. First, we propose a novel network called DenseRAUnet, which takes advantage of Dense U-net, ResNet and atrous convolutions. We prove the robustness and generalizability of our model by training it exclusively on chest CT while test on both types of CT scans. Second, we design a loss function combining bootstrap with IoU function to balance foreground and background classes. DenseRAUnet is trained in a 2.5D fashion and tested on a private dataset consisting of 144 scans. Results show an F1-score of 0.75, with 0.83 accuracy of predicting cardiovascular disease risk.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes DenseRAUnet, a 2.5D network combining Dense U-Net, ResNet blocks, and atrous convolutions, for segmenting coronary artery calcium on CT. Trained exclusively on chest CT scans using a bootstrap-IoU loss, the model is evaluated on a private dataset of 144 scans (both chest and cardiac CT) and reports an F1-score of 0.75 for calcium segmentation together with 0.83 accuracy for CVD risk prediction.
Significance. If the reported performance holds under proper validation, the work would be significant because it targets CAC quantification from the far more prevalent and economical chest CT rather than ECG-gated cardiac CT. The architectural combination and class-imbalance loss are sensible design choices that could be reusable; however, the absence of any public benchmark, external validation, or baseline comparison prevents assessment of whether the numbers advance the state of the art.
major comments (3)
- [Abstract] Abstract: the central performance claims (F1 = 0.75, risk-prediction accuracy = 0.83) are presented without any baseline method, without cross-validation procedure, without error bars or confidence intervals, and without stating how the 144 scans were partitioned or selected. These omissions make the numerical results impossible to interpret or reproduce and directly undermine the generalizability assertion.
- [Methods / Dataset] Dataset description (Methods): no information is supplied on scanner vendors, reconstruction kernels, slice thickness, patient demographics, or the proportion of chest versus cardiac cases within the 144-scan private set. Because the central claim is that training on chest CT alone yields robust performance on both modalities, the lack of acquisition-parameter diversity documentation is load-bearing.
- [Results / Evaluation] Evaluation protocol: the paper states that the model was “tested on both types of CT scans” yet provides neither a per-modality breakdown of the F1 score nor any statistical test comparing chest-only versus mixed test performance. Without this disaggregation the robustness claim cannot be verified.
minor comments (2)
- [Abstract / Introduction] The abstract and introduction contain several minor grammatical issues and inconsistent capitalization of “DenseRAUnet” versus “Dense U-Net.”
- [Figures] Figure captions and axis labels should explicitly state the number of test cases and whether the displayed examples are from chest or cardiac CT.
Simulated Author's Rebuttal
We thank the referee for the constructive comments. We address each major comment below and indicate the revisions planned for the manuscript.
read point-by-point responses
-
Referee: [Abstract] Abstract: the central performance claims (F1 = 0.75, risk-prediction accuracy = 0.83) are presented without any baseline method, without cross-validation procedure, without error bars or confidence intervals, and without stating how the 144 scans were partitioned or selected. These omissions make the numerical results impossible to interpret or reproduce and directly undermine the generalizability assertion.
Authors: We agree additional context is needed for interpretability. In revision we will expand the abstract to note that the model was trained exclusively on chest CT and evaluated on a private set of 144 scans (both modalities) using a single random train-test split. Bootstrap-derived confidence intervals will be added to the reported metrics in the results. Cross-validation was omitted to maximize training data given the limited private set size. No direct baseline comparisons were performed in the original submission; we will add discussion of related methods but note that public implementations for this exact task were unavailable at submission time. revision: partial
-
Referee: [Methods / Dataset] Dataset description (Methods): no information is supplied on scanner vendors, reconstruction kernels, slice thickness, patient demographics, or the proportion of chest versus cardiac cases within the 144-scan private set. Because the central claim is that training on chest CT alone yields robust performance on both modalities, the lack of acquisition-parameter diversity documentation is load-bearing.
Authors: We will revise the Methods section to document all recorded acquisition parameters (scanner vendors, kernels, slice thickness), patient demographics, and the precise proportion of chest versus cardiac cases within the 144-scan private set. This will directly support the cross-modality robustness claim. revision: yes
-
Referee: [Results / Evaluation] Evaluation protocol: the paper states that the model was “tested on both types of CT scans” yet provides neither a per-modality breakdown of the F1 score nor any statistical test comparing chest-only versus mixed test performance. Without this disaggregation the robustness claim cannot be verified.
Authors: We will add a per-modality F1-score breakdown in the revised Results section. Statistical comparisons (e.g., appropriate tests for the subset sizes) between chest-CT and cardiac-CT performance will also be reported to allow verification of the robustness claim. revision: yes
Circularity Check
No circularity: empirical ML metrics on held-out private data with no derivations
full rationale
The paper reports direct empirical results from training DenseRAUnet on chest CT and evaluating F1-score and risk-prediction accuracy on a private 144-scan test set. No equations, parameter-fitting steps, self-citations, or derivations are described that could reduce claims to inputs by construction. The generalizability statement is an empirical claim about the chosen dataset split, not a mathematical reduction. This is the standard non-circular outcome for a methods/results paper without theoretical derivations.
Axiom & Free-Parameter Ledger
free parameters (2)
- network weights and hyperparameters
- bootstrap sampling ratio in loss
axioms (1)
- domain assumption 2.5D slice stacking captures adequate 3D spatial context for accurate calcium segmentation
Reference graph
Works this paper leans on
-
[1]
John A Rumberger, Bruce H Brundage, Daniel J Rader, and George Kondos. Electron beam computed tomographic coronary calcium scanning: a review and guidelines for use in asymptomatic persons. In Mayo Clinic Proceedings, volume 74, pages 243–252. Elsevier, 1999
work page 1999
-
[2]
Felix Durlak, Michael Wels, Chris Schwemmer, Michael S¨ uhling, Stefan Steidl, and Andreas Maier. Growing a random forest with fuzzy spatial features for fully automatic artery-specific coronary calcium scoring. In International Workshop on Machine Learning in Medical Imaging , pages 27–35. Springer, 2017
work page 2017
-
[3]
Automatic coronary calcium scoring in low-dose chest computed tomography
Ivana Isgum, Mathias Prokop, Meindert Niemeijer, Max A Viergever, and Bram Van Ginneken. Automatic coronary calcium scoring in low-dose chest computed tomography. IEEE transactions on medical imaging , 31(12):2322–2334, 2012. 8 Jiechao Ma and Rongguo Zhang
work page 2012
-
[4]
A supervised classification-based method for coronary calcium de- tection in non-contrast ct
Uday Kurkure, Deepak R Chittajallu, Gerd Brunner, Yen H Le, and Ioannis A Kakadiaris. A supervised classification-based method for coronary calcium de- tection in non-contrast ct. The international journal of cardiovascular imaging , 26(7):817–828, 2010
work page 2010
-
[5]
Vessel specific coronary artery calcium scoring: an automatic system
Rahil Shahzad, Theo van Walsum, Michiel Schaap, Alexia Rossi, Stefan Klein, Annick C Weustink, Pim J de Feyter, Lucas J van Vliet, and Wiro J Niessen. Vessel specific coronary artery calcium scoring: an automatic system. Academic radiology, 20(1):1–9, 2013
work page 2013
-
[6]
Jelmer M Wolterink, Tim Leiner, Richard AP Takx, Max A Viergever, and Ivana Iˇ sgum. An automatic machine learning system for coronary calcium scoring in clinical non-contrast enhanced, ecg-triggered cardiac ct. In Medical Imaging 2014: Computer-Aided Diagnosis , volume 9035, page 90350E. International Society for Optics and Photonics, 2014
work page 2014
-
[7]
Automatic coronary calcium scoring in cardiac ct angiography using convolutional neural net- works
Jelmer M Wolterink, Tim Leiner, Max A Viergever, and Ivana Iˇ sgum. Automatic coronary calcium scoring in cardiac ct angiography using convolutional neural net- works. In International Conference on Medical Image Computing and Computer- Assisted Intervention, pages 589–596. Springer, 2015
work page 2015
-
[8]
Automatic calcium scoring in low-dose chest ct using deep neural networks with dilated convolutions
Nikolas Lessmann, Bram van Ginneken, Majd Zreik, Pim A de Jong, Bob D de Vos, Max A Viergever, and Ivana Iˇ sgum. Automatic calcium scoring in low-dose chest ct using deep neural networks with dilated convolutions. IEEE transactions on medical imaging, 37(2):615–625, 2018
work page 2018
-
[9]
Jelmer M Wolterink, Tim Leiner, Bob D de Vos, Robbert W van Hamersvelt, Max A Viergever, and Ivana Iˇ sgum. Automatic coronary artery calcium scoring in cardiac ct angiography using paired convolutional neural networks. Medical image analysis, 34:123–136, 2016
work page 2016
-
[10]
Ran Shadmi, Victoria Mazo, Orna Bregman-Amitai, and Eldad Elnekave. Fully- convolutional deep-learning based system for coronary calcium score prediction from non-contrast chest ct. In 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018) , pages 24–28. IEEE, 2018
work page 2018
-
[11]
Densely connected convolutional networks
Gao Huang, Zhuang Liu, Laurens Van Der Maaten, and Kilian Q Weinberger. Densely connected convolutional networks. In Proceedings of the IEEE conference on computer vision and pattern recognition , pages 4700–4708, 2017
work page 2017
-
[12]
Liang-Chieh Chen, George Papandreou, Iasonas Kokkinos, Kevin Murphy, and Alan L Yuille. Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE transactions on pattern analysis and machine intelligence , 40(4):834–848, 2018
work page 2018
-
[13]
Going deeper with convolutions
Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, and Andrew Rabinovich. Going deeper with convolutions. In Proceedings of the IEEE conference on computer vision and pattern recognition , pages 1–9, 2015
work page 2015
-
[14]
Concurrent spatial and channel squeeze & excitationin fully convolutional networks
Abhijit Guha Roy, Nassir Navab, and Christian Wachinger. Concurrent spatial and channel squeeze & excitationin fully convolutional networks. In International Con- ference on Medical Image Computing and Computer-Assisted Intervention , pages 421–429. Springer, 2018
work page 2018
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.