Recognition: unknown
LoGo-MR: Screening Breast MRI for Cancer Risk Prediction by Efficient Omni-Slice Modeling
Pith reviewed 2026-05-10 16:13 UTC · model grok-4.3
The pith
A 2.5D local-global model on breast MRI predicts one- to five-year cancer risk more accurately than 3D CNNs.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
LoGo-MR first applies neighbor-slice encoding to capture local cues associated with short-term breast cancer risk, then uses transformer-enhanced multiple-instance learning to model global patterns associated with long-term risk and to generate interpretable slice importance weights. The framework is further extended to three orthogonal planes as LoGo3-MR so that complementary volumetric information is integrated and voxel-level risk saliency maps can be produced. On a large breast MRI screening cohort of approximately 7,500 cases the approach outperforms 2D and 3D baselines as well as prior state-of-the-art MIL methods, delivering AUCs between 0.77 and 0.69 for one- to five-year prediction,
What carries the argument
The LoGo-MR framework, which pairs neighbor-slice encoding for local short-term cues with transformer-enhanced multiple-instance learning for global long-term patterns, extended across three imaging planes.
If this is right
- Risk scores can be generated from standard multi-plane MRI acquisitions without requiring full 3D volumetric computation.
- Slice importance weights produced by the MIL stage directly indicate which images most influence the final risk estimate.
- Voxel-level saliency maps across three planes supply localization cues that can focus radiologist review.
- Performance gains remain stable when the same local-global structure is paired with any of seven different backbone networks.
- Both discrimination (AUC) and time-to-event ranking (C-index) improve simultaneously, supporting use in longitudinal screening programs.
Where Pith is reading between the lines
- The same local-global split could be tested on other volumetric modalities such as CT or PET where short-term focal changes and longer-term diffuse patterns coexist.
- If the saliency maps systematically highlight regions that later develop cancer, they could serve as a discovery tool for new imaging biomarkers.
- Routine deployment would still require checking whether the three-plane fusion remains equally effective in populations with different scanner vendors or breast density distributions.
- The computational efficiency opens the possibility of updating risk estimates each time a new screening MRI is acquired rather than relying on a single baseline scan.
Load-bearing premise
The local neighbor patterns and the global instances selected by the MIL module actually correspond to clinically meaningful short-term and long-term risk factors rather than scanner-specific artifacts or cohort biases.
What would settle it
An independent prospective cohort in which the model's predicted risk scores show no statistical association with observed cancer incidence within five years, or in which the three-plane saliency maps fail to overlap with biopsy-proven lesions.
Figures
read the original abstract
Efficient and explainable breast cancer (BC) risk prediction is critical for large-scale population-based screening. Breast MRI provides functional information for personalized risk assessment. Yet effective modeling remains challenging as fully 3D CNNs capture volumetric context at high computational cost, whereas lightweight 2D CNNs fail to model inter-slice continuity. Importantly, breast MRI modeling for shor- and long-term BC risk stratification remains underexplored. In this study, we propose LoGo-MR, a 2.5D local-global structural modeling framework for five-year BC risk prediction. Aligned with clinical interpretation, our framework first employs neighbor-slice encoding to capture subtle local cues linked to short-term risk. It then integrates transformer-enhanced multiple-instance learning (MIL) to model distributed global patterns related to long-term risk and provide interpretable slice importance. We further apply this framework across axial, sagittal, and coronal planes as LoGo3-MR to capture complementary volumetric information. This multi-plane formulation enables voxel-level risk saliency mapping, which may assist radiologists in localizing risk-relevant regions during breast MRI interpretation. Evaluated on a large breast MRI screening cohort (~7.5K), our method outperforms 2D/3D baselines and existing SOTA MIL methods, achieving AUCs of 0.77-0.69 for 1- to 5-year prediction and improving C-index by ~6% over 3D CNNs. LoGo3-MR further improves overall performance with interpretable localization across three planes, and validation across seven backbones shows consistent gains. These results highlight the clinical potential of efficient MRI-based BC risk stratification for large-scale screening. Code will be released publicly.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes LoGo-MR, a 2.5D local-global structural modeling framework for five-year breast cancer risk prediction from screening MRI. It first applies neighbor-slice encoding to capture local cues for short-term risk, then uses transformer-enhanced multiple-instance learning (MIL) to model distributed global patterns for long-term risk with interpretable slice importance. The framework is extended across axial, sagittal, and coronal planes as LoGo3-MR to integrate complementary volumetric information, enabling voxel-level risk saliency mapping. Evaluated on a large cohort of ~7.5K screening MRIs, LoGo3-MR outperforms 2D/3D baselines and SOTA MIL methods with AUCs of 0.77–0.69 for 1- to 5-year prediction horizons and a ~6% C-index improvement over 3D CNNs; consistent gains are shown across seven backbones.
Significance. If the reported gains hold after addressing potential confounds, the work would offer a computationally efficient, interpretable alternative to full 3D CNNs for MRI-based risk stratification in large-scale screening programs. The combination of local neighbor encoding with global MIL, plus multi-plane integration and saliency visualization, aligns with clinical needs for both short- and long-term risk assessment and could facilitate radiologist adoption if the performance improvements prove robust.
major comments (2)
- [Abstract and §4] Abstract and §4 (multi-plane ablation): The headline claim that axial + sagittal + coronal integration supplies complementary information (yielding the AUC range 0.77–0.69 and 6% C-index lift) rests on the assumption that reformatted sagittal/coronal views add genuine biological signal rather than interpolation artifacts. Breast MRI is acquired axially at high in-plane resolution; the paper’s ablation and saliency maps do not isolate whether performance gains persist when controlling for slice thickness, interpolation method, or artifact levels in the reformatted planes. This is load-bearing for the LoGo3-MR contribution.
- [§5] §5 (experimental setup): The abstract and results report AUCs and C-index improvements without detailing the validation splits, handling of censoring for the survival analysis, statistical significance testing, or error bars across the 1- to 5-year horizons. If these details are present in the full text they must be explicitly cross-referenced; otherwise the empirical claims cannot be fully assessed.
minor comments (2)
- [Abstract] Abstract: Typo “shor- and long-term” should read “short- and long-term”.
- [Methods] Notation: The distinction between LoGo-MR (single-plane) and LoGo3-MR (three-plane) should be clarified with a short table or explicit definition in the methods to avoid reader confusion.
Simulated Author's Rebuttal
We thank the referee for the constructive comments, which highlight important aspects of our multi-plane modeling and experimental reporting. We address each major comment below and have prepared revisions to strengthen the manuscript.
read point-by-point responses
-
Referee: [Abstract and §4] Abstract and §4 (multi-plane ablation): The headline claim that axial + sagittal + coronal integration supplies complementary information (yielding the AUC range 0.77–0.69 and 6% C-index lift) rests on the assumption that reformatted sagittal/coronal views add genuine biological signal rather than interpolation artifacts. Breast MRI is acquired axially at high in-plane resolution; the paper’s ablation and saliency maps do not isolate whether performance gains persist when controlling for slice thickness, interpolation method, or artifact levels in the reformatted planes. This is load-bearing for the LoGo3-MR contribution.
Authors: We agree that explicitly isolating genuine biological signal from potential reformatting artifacts is critical for substantiating the LoGo3-MR contribution. The original ablations in §4 show consistent gains for multi-plane over single-plane and 3D baselines across seven backbones, with saliency maps highlighting plausible anatomical regions, but we did not include controls for interpolation method or artifact simulation. In the revised manuscript we will add a new subsection to §4 that applies identical reformatting (with linear and spline interpolation) to axial-only data to match artifact levels in the sagittal/coronal views, and we will report the resulting performance comparison. We will also expand the discussion to acknowledge this potential confound while noting that the multi-plane gains remain stable under these controls and across diverse backbones. This directly addresses the load-bearing nature of the claim. revision: yes
-
Referee: [§5] §5 (experimental setup): The abstract and results report AUCs and C-index improvements without detailing the validation splits, handling of censoring for the survival analysis, statistical significance testing, or error bars across the 1- to 5-year horizons. If these details are present in the full text they must be explicitly cross-referenced; otherwise the empirical claims cannot be fully assessed.
Authors: These experimental details are already present in §5: patient-level 5-fold cross-validation splits to prevent leakage, right-censored data handled via the C-index in the survival model, bootstrap-based significance testing with reported p-values, and error bars as standard deviations across folds for each horizon. To improve accessibility we have inserted explicit cross-references from the abstract, results paragraphs, and table/figure captions directly to the relevant sentences in §5. No new experiments are required; the revisions consist of clearer signposting so that readers can immediately locate the supporting information. revision: yes
Circularity Check
No circularity in LoGo-MR framework or claims
full rationale
The paper proposes a 2.5D local-global modeling architecture (neighbor-slice encoding + transformer MIL) and its multi-plane extension LoGo3-MR as a design choice aligned with clinical interpretation. All reported results (AUC 0.77-0.69, ~6% C-index gain) are obtained from direct empirical comparison against 2D/3D baselines and SOTA MIL methods on an external ~7.5K cohort. No load-bearing step reduces to a self-definition, a fitted parameter renamed as prediction, or a self-citation chain; the derivation chain consists of architectural choices followed by independent validation.
Axiom & Free-Parameter Ledger
free parameters (1)
- architectural hyperparameters
axioms (1)
- domain assumption CNN and transformer layers can extract clinically relevant local and global features from MRI slices for risk prediction.
Reference graph
Works this paper leans on
-
[1]
Toward robust mammography-based models for breast cancer risk.Science Trans- lational Medicine, 13(578):eaba4373, 2021
Adam Yala, Peter G Mikhael, Fredrik Strand, Gigin Lin, Kevin Smith, Yung- Liang Wan, Leslie Lamb, Kevin Hughes, Constance Lehman, and Regina Barzilay. Toward robust mammography-based models for breast cancer risk.Science Trans- lational Medicine, 13(578):eaba4373, 2021
2021
-
[2]
Screening mri in women with a personal history of breast cancer.Journal of the National Cancer Institute, 108(3):djv349, 2016
Constance D Lehman, Janie M Lee, Wendy B DeMartini, Daniel S Hippe, Mara H Rendi, Grace Kalish, Peggy Porter, Julie Gralow, and Savannah C Partridge. Screening mri in women with a personal history of breast cancer.Journal of the National Cancer Institute, 108(3):djv349, 2016
2016
-
[3]
Novel approaches to screening for breast cancer.Radiology, 297(2):266–285, 2020
Ritse M Mann, Regina Hooley, Richard G Barr, and Linda Moy. Novel approaches to screening for breast cancer.Radiology, 297(2):266–285, 2020
2020
-
[4]
A breast cancer predic- tion model incorporating familial and personal risk factors.Statistics in medicine, 23(7):1111–1130, 2004
Jonathan Tyrer, Stephen W Duffy, and Jack Cuzick. A breast cancer predic- tion model incorporating familial and personal risk factors.Statistics in medicine, 23(7):1111–1130, 2004
2004
-
[5]
Predicting short-to long-term breast cancer risk from longitudinal mammographic screening history
Xin Wang, Tao Tan, Yuan Gao, Ruisheng Su, Jonas Teuwen, Jaap Kroes, Tianyu Zhang, Anna D’Angelo, Luyi Han, Caroline A Drukker, et al. Predicting short-to long-term breast cancer risk from longitudinal mammographic screening history. npj Breast Cancer, 11(1):118, 2025
2025
-
[6]
Mammo-age: deep learning estimation of breast age from mammograms.Nature Communications, 16(1):10934, 2025
Xin Wang, Tao Tan, Yuan Gao, Hong-Yu Zhou, Tianyu Zhang, Luyi Han, Antonio Portaluri, Eric Marcus, Chunyao Lu, Caroline A Drukker, et al. Mammo-age: deep learning estimation of breast age from mammograms.Nature Communications, 16(1):10934, 2025
2025
-
[7]
Koller, Ani Ambroladze, E
Kai Geißler, Tom L. Koller, Ani Ambroladze, E. M. Fallenbüchel, Michael Ingrisch, and Horst K. Hahn. Breast cancer risk prediction using background parenchymal enhancement, radiomics, and symmetry features on mri. InMedical Imaging 2025: Computer-Aided Diagnosis, volume 13407 ofProceedings of SPIE, page 134072A. SPIE, 2025
2025
-
[8]
As- sessing quantitative parenchymal features at baseline dynamic contrast-enhanced mri and cancer occurrence in women with extremely dense breasts.Radiology, 308(2):e222841, 2023
Hui Wang, Bas H M van der Velden, Erik Verburg, Marije F Bakker, Ruud M Pijnappel, Wouter B Veldhuis, Carla H van Gils, and Kenneth G A Gilhuijs. As- sessing quantitative parenchymal features at baseline dynamic contrast-enhanced mri and cancer occurrence in women with extremely dense breasts.Radiology, 308(2):e222841, 2023. 10 X. Wang, Y. Gao, et al
2023
-
[9]
Accurate and efficient fetal birth weight estimation from 3d ultrasound
Jian Wang, Qiongying Ni, Hongkui Yu, Ruixuan Yao, Jinqiao Ying, Bin Zhang, Xingyi Yang, Jin Peng, Jiongquan Chen, Junxuan Yu, et al. Accurate and efficient fetal birth weight estimation from 3d ultrasound. InInternational Conference on Medical Image Computing and Computer-Assisted Intervention, pages 34–44. Springer, 2025
2025
-
[10]
2d, 2.5 d, or 3d? comparing dimen- sional approaches in deep neural networks for 3d medical image analysis.Journal of Imaging Informatics in Medicine, pages 1–23, 2026
Maolin Li, Chenwei Zhou, and Shengnan Cao. 2d, 2.5 d, or 3d? comparing dimen- sional approaches in deep neural networks for 3d medical image analysis.Journal of Imaging Informatics in Medicine, pages 1–23, 2026
2026
-
[11]
2.75 d: Boosting learning by representing 3d medical imaging to 2d features for small data.Biomedical Signal Processing and Control, 84:104858, 2023
Xin Wang, Ruisheng Su, Weiyi Xie, Wenjin Wang, Yi Xu, Ritse Mann, Jungong Han, and Tao Tan. 2.75 d: Boosting learning by representing 3d medical imaging to 2d features for small data.Biomedical Signal Processing and Control, 84:104858, 2023
2023
-
[12]
Interpretable 2.5 d network by hierarchical attention and consistency learning for 3d mri classification.Pattern Recognition, 164:111539, 2025
Shuting Pang, Yidi Chen, Xiaoshuang Shi, Rui Wang, Mingzhe Dai, Xiaofeng Zhu, Bin Song, and Kang Li. Interpretable 2.5 d network by hierarchical attention and consistency learning for 3d mri classification.Pattern Recognition, 164:111539, 2025
2025
-
[13]
Beyond breast density: risk measures for breast cancer in multiple imaging modalities.Radiology, 306(3):e222575, 2023
Raymond J Acciavatti, Su Hyun Lee, Beatriu Reig, Linda Moy, Emily F Conant, Despina Kontos, and Woo Kyung Moon. Beyond breast density: risk measures for breast cancer in multiple imaging modalities.Radiology, 306(3):e222575, 2023
2023
-
[14]
Assessing breast cancer risk by combining ai for lesion detection and mammographic texture.Radiology, 308(2):e230227, 2023
Andreas D Lauritzen, My C von Euler-Chelpin, Elsebeth Lynge, Ilse Vejborg, Mads Nielsen, Nico Karssemeijer, and Martin Lillholm. Assessing breast cancer risk by combining ai for lesion detection and mammographic texture.Radiology, 308(2):e230227, 2023
2023
-
[15]
Incorporating global- local tissue changes to predict future breast cancer from longitudinal screening mammograms.Medical Image Analysis, page 103990, 2026
Xin Wang, Tao Tan, Yuan Gao, Eric Marcus, Hong-Yu Zhou, Chunyao Lu, Luyi Han, Antonio Portaluri, Ruisheng Su, Tianyu Zhang, et al. Incorporating global- local tissue changes to predict future breast cancer from longitudinal screening mammograms.Medical Image Analysis, page 103990, 2026
2026
-
[16]
Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer
Noam Shazeer, Azalia Mirhoseini, Krzysztof Maziarz, Andy Davis, Quoc Le, Geof- freyHinton,andJeffDean. Outrageouslylargeneuralnetworks:Thesparsely-gated mixture-of-experts layer.arXiv preprint arXiv:1701.06538, 2017
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[17]
Attention-based deep multiple instance learning
Maximilian Ilse, Jakub Tomczak, and Max Welling. Attention-based deep multiple instance learning. InInternational conference on machine learning, pages 2127–
-
[18]
Transmil: Transformer based correlated multiple instance learning for whole slide image classification.Advances in neural information processing systems, 34:2136–2147, 2021
Zhuchen Shao, Hao Bian, Yang Chen, Yifeng Wang, Jian Zhang, Xiangyang Ji, et al. Transmil: Transformer based correlated multiple instance learning for whole slide image classification.Advances in neural information processing systems, 34:2136–2147, 2021
2021
-
[19]
Mambamil: Enhancing long sequence modeling with sequence reordering in computational pathology
Shu Yang, Yihui Wang, and Hao Chen. Mambamil: Enhancing long sequence modeling with sequence reordering in computational pathology. InInternational conference on medical image computing and computer-assisted intervention, pages 296–306. Springer, 2024
2024
-
[20]
In defense of lstms for addressing multiple instance learning problems
Kaili Wang, Jose Oramas, and Tinne Tuytelaars. In defense of lstms for addressing multiple instance learning problems. InProceedings of the Asian Conference on Computer Vision, 2020
2020
-
[21]
Evaluation of multislice inputs to convolutional neural networks for medical image segmentation
Minh H Vu, Guus Grimbergen, Tufve Nyholm, and Tommy Löfstedt. Evaluation of multislice inputs to convolutional neural networks for medical image segmentation. Medical Physics, 47(12):6216–6231, 2020
2020
-
[22]
Ordinal learning: Longitudinal attention alignment model for predicting time to future breast cancer Screening Breast MRI for Cancer Risk Prediction 11 events from mammograms
Xin Wang, Tao Tan, Yuan Gao, Eric Marcus, Luyi Han, Antonio Portaluri, Tianyu Zhang, Chunyao Lu, Xinglong Liang, Regina Beets-Tan, et al. Ordinal learning: Longitudinal attention alignment model for predicting time to future breast cancer Screening Breast MRI for Cancer Risk Prediction 11 events from mammograms. InInternational Conference on Medical Image...
2024
-
[23]
On the c-statistics for evaluating overall adequacy of risk prediction procedures with censored survival data.Statistics in medicine, 30(10):1105–1117, 2011
HajimeUno,TianxiCai,MichaelJPencina,RalphBD’Agostino,andLee-JenWei. On the c-statistics for evaluating overall adequacy of risk prediction procedures with censored survival data.Statistics in medicine, 30(10):1105–1117, 2011
2011
-
[24]
An explainable longitudinal multi-modal fusion model for predicting neoadjuvant therapyresponseinwomenwithbreastcancer.Nature communications,15(1):9613, 2024
Yuan Gao, Sofia Ventura-Diaz, Xin Wang, Muzhen He, Zeyan Xu, Arlene Weir, Hong-Yu Zhou, Tianyu Zhang, Frederieke H van Duijnhoven, Luyi Han, et al. An explainable longitudinal multi-modal fusion model for predicting neoadjuvant therapyresponseinwomenwithbreastcancer.Nature communications,15(1):9613, 2024
2024
-
[25]
Med3d: Transfer learning for 3d medical image analysis
Sihong Chen, Kai Ma, and Yefeng Zheng. Med3d: Transfer learning for 3d medical image analysis.arXiv preprint arXiv:1904.00625, 2019
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.