Modeling Local, Global, and Cross-Modal Context in Multimodal 3D MRI

Benjamin Wild; Minh Duc Do; Noel Kronenberg; Roland Eils; Tillmann Rheude

arxiv: 2606.26894 · v1 · pith:43BX5SA3new · submitted 2026-06-25 · 💻 cs.CV

Modeling Local, Global, and Cross-Modal Context in Multimodal 3D MRI

Minh Duc Do , Tillmann Rheude , Noel Kronenberg , Roland Eils , Benjamin Wild This is my paper

Pith reviewed 2026-06-26 05:26 UTC · model grok-4.3

classification 💻 cs.CV

keywords multimodal brain MRI3D vision transformercross-modal attentionbrain age predictionintra-modal contextneuroimagingmedical image analysis

0 comments

The pith

MICViT uses four attention mechanisms to model both intra-modal and cross-modal local and global contexts in 3D multimodal brain MRI.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Brain MRI studies face high-dimensional 3D data across multiple modalities but are limited by small sample sizes. MICViT addresses this by adding modality-specific local and global attention for features within each scan type alongside cross-modal local and global attention to link information across modalities such as T1 and FLAIR. Evaluated on brain age prediction using three datasets totaling over 43,000 scans, the model outperforms CNN and transformer baselines and shows larger gains precisely when more modalities are included. The results indicate that direct modeling of these interactions improves representation learning from complementary MRI signals.

Core claim

MICViT is a 3D vision transformer that combines modality-specific local attention, modality-specific global attention, cross-modal local attention, and cross-modal global attention to learn both within-modality representations and between-modality interactions, producing stronger performance on brain age prediction that increases with the number of input modalities.

What carries the argument

The four attention mechanisms in MICViT that separately process modality-specific local/global contexts and cross-modal local/global contexts.

If this is right

Adding modalities produces larger accuracy improvements on brain age prediction than in baseline models.
Explicit separation of intra- and cross-modal attention outperforms standard multimodal fusion approaches in 3D settings.
Representation learning benefits when local and global scales are modeled for both within-modality and between-modality interactions.
Neuroimaging models can better exploit complementary information from heterogeneous MRI acquisitions.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same split-attention design could transfer to other 3D multimodal medical imaging tasks such as combined CT-MRI analysis.
Clinical pipelines might achieve higher accuracy by routinely including more MRI contrasts without requiring proportionally larger training sets.
Training protocols could be simplified if cross-modal attention reduces the need for deeper or wider networks to capture modality interactions.

Load-bearing premise

Larger gains from adding modalities are caused by the cross-modal attention rather than increased model capacity or uncontrolled training or dataset factors.

What would settle it

An ablation that removes or replaces the cross-modal attention layers while matching total parameters, training procedure, and data shows the multimodal performance advantage disappears.

Figures

Figures reproduced from arXiv: 2606.26894 by Benjamin Wild, Minh Duc Do, Noel Kronenberg, Roland Eils, Tillmann Rheude.

**Figure 1.** Figure 1: Scaling behavior on UKBB as a function of model size using MRI modalities (T1, FLAIR, SWI, DWI). MICViT consistently achieves the lowest error across datasets. Late fusion variants marked with ‡ Recent advances in transformer-based architectures have demonstrated strong potential for modeling longrange dependencies in medical imaging [4]. In particular, Vision Transformers (ViTs) enable flexible contex… view at source ↗

**Figure 2.** Figure 2: Architecture of the proposed MICViT. Multiple imaging modalities ( [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 3.** Figure 3: Illustration of the four attention types operating on latent 3D representations: (a) SL per [PITH_FULL_IMAGE:figures/full_fig_p005_3.png] view at source ↗

**Figure 4.** Figure 4: Scaling with model capacity and input resolution. Full-resolution MRI (2253 ) reduces MAE across model sizes relative to downsampled inputs (643 ). Consistent scaling gains from model capacity and input resolution Finally, we analyze scaling behavior across model capacity and input resolution. We show that MICViT maintains superior performance across all scaling configurations, from small to large models… view at source ↗

**Figure 5.** Figure 5: Grad-CAM [63] visualizations for SL, SG, CL, and CG attention across T1, FLAIR, SWI, and DWI, averaged over participants in the UKBB test set. Local and global variants highlight complementary spatial detail and distributed multimodal context. produces largely uniform activation across the brain, indicating aggregation of distributed multimodal information, consistent with the view that brain aging manif… view at source ↗

read the original abstract

Brain MRI poses a fundamental challenge for machine learning: models must learn from high-dimensional 3D data spanning multiple co-registered modalities, despite the limited sample sizes typical of neuroimaging studies relative to the diversity in anatomy, pathology, and acquisition conditions. While multimodal imaging provides complementary information critical for clinical interpretation, effectively integrating these signals remains difficult. We propose Multimodal Intra- and Cross-Context Vision Transformer (MICViT), a 3D vision transformer that explicitly models both modality-specific representations and cross-modal interactions across local and global contexts. Concretely, MICViT combines four attention mechanisms: modality-specific local and global attention for intra-modal feature learning, and cross-modal local and global attention to capture interactions between modalities. We evaluate MICViT on brain age prediction across three heterogeneous datasets (UK Biobank, n=41,404; SOOP, n=1,062; Cam-CAN, n=613) using multiple MRI modalities (e.g. T1, FLAIR, DWI, SWI). MICViT consistently outperforms state-of-the-art CNN and transformer baselines in 3D settings. Notably, it benefits more strongly from multimodal inputs, yielding larger performance gains as additional modalities are incorporated. These results demonstrate that explicitly modeling intra- and cross-modal interactions is key to unlocking the full potential of multimodal brain MRI, highlighting a promising direction for representation learning in neuroimaging.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

MICViT adds four explicit attention streams to a 3D transformer for multimodal MRI but the abstract supplies no ablations or numbers to show the cross-modal blocks are what drive the larger multimodal gains.

read the letter

The core point is that this paper introduces MICViT, a 3D vision transformer that splits attention into modality-specific local, modality-specific global, cross-modal local, and cross-modal global streams. It tests the model on brain age prediction with T1, FLAIR, DWI, and SWI across UK Biobank (41k subjects), SOOP, and Cam-CAN.

The work does a few things cleanly. It targets a real practical problem in neuroimaging where multiple contrasts exist but fusion is still crude. The evaluation uses three external datasets rather than a single small one, and the claim that performance improves more when modalities are added is at least stated clearly.

The soft spots sit right at the central claim. The abstract says the model benefits more strongly from multimodal inputs and that explicitly modeling intra- and cross-modal interactions is key, yet it gives no numerical results, no error bars, and no ablation that removes only the cross-modal terms while holding total capacity fixed. Without those controls it is impossible to separate the architecture from simple increases in parameters or changes in training. The stress-test concern therefore stands on the information provided.

This paper is for people already working on 3D transformers or multimodal fusion in medical imaging. A reader who needs a concrete starting architecture for handling multiple MRI contrasts would find the four-stream design useful to try, even if they later modify it.

It deserves a serious referee. The datasets are large enough and the task is standard, so the idea is worth checking in full. I would send it to review rather than desk reject, with the expectation that the authors supply the missing ablations and metrics.

Referee Report

2 major / 1 minor

Summary. The manuscript proposes MICViT, a 3D vision transformer for multimodal brain MRI that combines modality-specific local/global attention with cross-modal local/global attention to model intra- and inter-modal context. It reports evaluation on brain-age prediction across UK Biobank (n=41,404), SOOP (n=1,062) and Cam-CAN (n=613) using T1, FLAIR, DWI and SWI, claiming consistent superiority over CNN and transformer baselines together with larger gains as more modalities are added.

Significance. If the performance advantages can be shown to arise specifically from the four explicit attention blocks rather than capacity or training differences, the work would supply concrete evidence that explicit intra- and cross-modal context modeling improves multimodal 3D representation learning in neuroimaging, a direction of clear practical interest given the small-sample, high-dimensional nature of the domain.

major comments (2)

[Abstract] Abstract: the central claim that MICViT 'benefits more strongly from multimodal inputs, yielding larger performance gains' is stated without any numerical deltas, error bars, baseline implementations, or ablation tables, so the magnitude and reliability of the reported multimodal scaling cannot be assessed from the provided text.
[Evaluation] Evaluation section (implied by the abstract's results paragraph): no ablation is described that removes only the cross-modal local and global attention blocks while keeping total parameter count, optimizer schedule and data augmentation identical; without such a capacity-matched control the attribution of larger multimodal gains to the cross-modal mechanisms remains unisolated.

minor comments (1)

[Abstract] Abstract: the phrase 'e.g. T1, FLAIR, DWI, SWI' leaves the exact modality combinations used in each reported experiment unspecified.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback. We address each major comment below and outline the revisions we will make to improve the clarity and rigor of the claims.

read point-by-point responses

Referee: [Abstract] Abstract: the central claim that MICViT 'benefits more strongly from multimodal inputs, yielding larger performance gains' is stated without any numerical deltas, error bars, baseline implementations, or ablation tables, so the magnitude and reliability of the reported multimodal scaling cannot be assessed from the provided text.

Authors: We agree that the abstract's brevity makes the claim harder to evaluate in isolation. The full manuscript reports the supporting results (including MAE values with standard deviations, baseline comparisons, and modality scaling trends) in the evaluation section and associated tables. To address the concern directly, we will revise the abstract to incorporate specific numerical deltas and error bars drawn from those experiments while remaining within length limits. revision: yes
Referee: [Evaluation] Evaluation section (implied by the abstract's results paragraph): no ablation is described that removes only the cross-modal local and global attention blocks while keeping total parameter count, optimizer schedule and data augmentation identical; without such a capacity-matched control the attribution of larger multimodal gains to the cross-modal mechanisms remains unisolated.

Authors: This is a fair and important point for isolating the contribution of the cross-modal blocks. Our existing comparisons are against CNN and transformer baselines that lack these blocks, and we observe larger gains with additional modalities. However, a dedicated capacity-matched ablation that disables only the cross-modal local/global attention (while preserving parameter count and all training details) is not currently present. We will add this controlled ablation to the revised manuscript to strengthen the attribution. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical architecture proposal evaluated on external data

full rationale

The paper introduces MICViT as a 3D vision transformer with four explicit attention blocks (modality-specific local/global and cross-modal local/global) and reports empirical performance on brain age prediction across three independent external datasets (UK Biobank, SOOP, Cam-CAN). No derivation chain, equations, or fitted parameters are presented that reduce by construction to quantities defined from the same data or prior self-citations. The central claim rests on observed outperformance and multimodal scaling in held-out evaluations rather than any self-referential definition or prediction step. This is a standard empirical comparison with no load-bearing self-referential elements.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the empirical observation that the four-attention design yields larger gains with more modalities; this depends on the domain assumption that attention can capture the relevant interactions and on the representativeness of the three evaluation datasets. No free parameters beyond standard transformer training are introduced in the abstract, and no new physical or mathematical entities are postulated.

axioms (1)

domain assumption Attention mechanisms can effectively model both intra-modal and cross-modal interactions in 3D multimodal MRI volumes
This premise underpins the proposal that the four attention streams are the key to better performance.

pith-pipeline@v0.9.1-grok · 5794 in / 1340 out tokens · 25927 ms · 2026-06-26T05:26:20.075246+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

67 extracted references · 37 canonical work pages

[1]

Gillies, Paul E

Robert J. Gillies, Paul E. Kinahan, and Hedvig Hricak. Radiomics: Images Are More than Pictures, They Are Data.Radiology, 278(2):563–577, February 2016. ISSN 0033-8419. doi: 10.1148/radiol.2015151169. URL https://pubs.rsna.org/doi/10.1148/radiol. 2015151169

work page doi:10.1148/radiol.2015151169 2016
[2]

Mark Haacke, Robert W

E. Mark Haacke, Robert W. Brown, Michael R. Thompson, and Ramesh Venkatesan, editors. Magnetic resonance imaging: physical principles and sequence design. Wiley-Liss, New York, NY , 1999. ISBN 978-0-471-35128-3

1999
[3]

Performance reserves in brain-imaging-based phenotype prediction.Cell Reports, 43 (1):113597, January 2024

Marc-Andre Schulz, Danilo Bzdok, Stefan Haufe, John-Dylan Haynes, and Kerstin Rit- ter. Performance reserves in brain-imaging-based phenotype prediction.Cell Reports, 43 (1):113597, January 2024. ISSN 22111247. doi: 10.1016/j.celrep.2023.113597. URL https://linkinghub.elsevier.com/retrieve/pii/S2211124723016091

work page doi:10.1016/j.celrep.2023.113597 2024
[4]

Yuille, and Yuyin Zhou

Jieneng Chen, Yongyi Lu, Qihang Yu, Xiangde Luo, Ehsan Adeli, Yan Wang, Le Lu, Alan L. Yuille, and Yuyin Zhou. TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation, February 2021. URL http://arxiv.org/abs/2102.04306. arXiv:2102.04306 [cs]

Pith/arXiv arXiv 2021
[5]

Comparison of Vision Transformers and Convolutional Neural Networks in Medical Image Analysis: A Systematic Review.Journal of Medical Systems, 48(1):84, September 2024

Satoshi Takahashi, Yusuke Sakaguchi, Nobuji Kouno, Ken Takasawa, Kenichi Ishizu, Yu Akagi, Rina Aoyama, Naoki Teraya, Amina Bolatkan, Norio Shinkai, Hidenori Machino, Kazuma Kobayashi, Ken Asada, Masaaki Komatsu, Syuzo Kaneko, Masashi Sugiyama, and Ryuji Hamamoto. Comparison of Vision Transformers and Convolutional Neural Networks in Medical Image Analysi...

work page doi:10.1007/s10916-024-02105-8 2024
[6]

Shih-Cheng Huang, Anuj Pareek, Saeed Seyyedi, Imon Banerjee, and Matthew P. Lungren. Fusion of medical imaging and electronic health records using deep learning: a systematic review and implementation guidelines.npj Digital Medicine, 3(1):136, October 2020. ISSN 2398-

2020
[7]

URL https://www.nature.com/articles/ s41746-020-00341-z

doi: 10.1038/s41746-020-00341-z. URL https://www.nature.com/articles/ s41746-020-00341-z

work page doi:10.1038/s41746-020-00341-z
[8]

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows, August 2021

Ze Liu, Yutong Lin, Yue Cao, Han Hu, Yixuan Wei, Zheng Zhang, Stephen Lin, and Baining Guo. Swin Transformer: Hierarchical Vision Transformer using Shifted Windows, August 2021. URLhttp://arxiv.org/abs/2103.14030. arXiv:2103.14030 [cs]

Pith/arXiv arXiv 2021
[9]

Global Context Vision Transformers, June 2023

Ali Hatamizadeh, Hongxu Yin, Greg Heinrich, Jan Kautz, and Pavlo Molchanov. Global Context Vision Transformers, June 2023. URL http://arxiv.org/abs/2206.09959. arXiv:2206.09959 [cs]

arXiv 2023
[10]

Multi-view 3D Object Detection Network for Autonomous Driving

Xiaozhi Chen, Huimin Ma, Ji Wan, Bo Li, and Tian Xia. Multi-view 3D Object Detection Network for Autonomous Driving. In2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 6526–6534, Honolulu, HI, July 2017. IEEE. ISBN 978-1-5386- 0457-1. doi: 10.1109/CVPR.2017.691. URL http://ieeexplore.ieee.org/document/ 8100174/

work page doi:10.1109/cvpr.2017.691 2017
[11]

Jiquan Ngiam, Aditya Khosla, Mingyu Kim, Juhan Nam, Honglak Lee, and Andrew Y . Ng. Multimodal deep learning. InProceedings of the 28th International Conference on International Conference on Machine Learning, ICML’11, pages 689–696, Madison, WI, USA, June 2011. Omnipress. ISBN 978-1-4503-0619-5. 10

2011
[12]

Multimodal Machine Learning: A Survey and Taxonomy, August 2017

Tadas Baltrušaitis, Chaitanya Ahuja, and Louis-Philippe Morency. Multimodal Machine Learning: A Survey and Taxonomy, August 2017. URL http://arxiv.org/abs/1705. 09406. arXiv:1705.09406 [cs]

Pith/arXiv arXiv 2017
[13]

Medical Multimodal Foundation Models in Clinical Diagnosis and Treatment: Applications, Chal- lenges, and Future Directions, December 2024

Kai Sun, Siyan Xue, Fuchun Sun, Haoran Sun, Yu Luo, Ling Wang, Siyuan Wang, Na Guo, Lei Liu, Tian Zhao, Xinzhou Wang, Lei Yang, Shuo Jin, Jun Yan, and Jiahong Dong. Medical Multimodal Foundation Models in Clinical Diagnosis and Treatment: Applications, Chal- lenges, and Future Directions, December 2024. URL http://arxiv.org/abs/2412.02621. arXiv:2412.02621 [cs]

arXiv 2024
[14]

Multimodal Fusion of Brain Imaging Data: Methods and Applications.Machine Intelligence Research, 21(1): 136–152, February 2024

Na Luo, Weiyang Shi, Zhengyi Yang, Ming Song, and Tianzi Jiang. Multimodal Fusion of Brain Imaging Data: Methods and Applications.Machine Intelligence Research, 21(1): 136–152, February 2024. ISSN 2731-5398. doi: 10.1007/s11633-023-1442-8. URL https: //doi.org/10.1007/s11633-023-1442-8

work page doi:10.1007/s11633-023-1442-8 2024
[15]

A review of deep learning-based information fusion techniques for multimodal medical image classification, April 2024

Yihao Li, Mostafa El Habib Daho, Pierre-Henri Conze, Rachid Zeghlache, Hugo Le Boité, Ramin Tadayoni, Béatrice Cochener, Mathieu Lamard, and Gwenolé Quellec. A review of deep learning-based information fusion techniques for multimodal medical image classification, April 2024. URLhttp://arxiv.org/abs/2404.15022. arXiv:2404.15022 [cs]

arXiv 2024
[16]

Huntenburg, Leonie Lampe, Mehdi Rahim, Alexandre Abraham, R

Franziskus Liem, Gaël Varoquaux, Jana Kynast, Frauke Beyer, Shahrzad Kharabian Masouleh, Julia M. Huntenburg, Leonie Lampe, Mehdi Rahim, Alexandre Abraham, R. Cameron Craddock, Steffi Riedel-Heller, Tobias Luck, Markus Loeffler, Matthias L. Schroeter, Anja Veronica Witte, Arno Villringer, and Daniel S. Margulies. Predicting brain-age from multimodal imagi...

work page doi:10.1016/j.neuroimage.2016.11.005 2017
[17]

Quintana, Dag Alnaes, Genevieve Richard, Ann-Marie G

Jaroslav Rokicki, Thomas Wolfers, Wibeke Nordhøy, Natalia Tesli, Daniel S. Quintana, Dag Alnaes, Genevieve Richard, Ann-Marie G. de Lange, Martina J. Lund, Linn Norbom, Ingrid Agartz, Ingrid Melle, Terje Naerland, Geir Selbaek, Karin Persson, Jan Egil Nordvik, Emanuel Schwarz, Ole A. Andreassen, Tobias Kaufmann, and Lars T. Westlye. Multimodal imaging imp...

work page doi:10.1002/hbm.25323 2021
[18]

Jirsaraie, Aaron J

Robert J. Jirsaraie, Aaron J. Gorelik, Martins M. Gatavins, Denis A. Engemann, Ryan Bogdan, Deanna M. Barch, and Aristeidis Sotiras. A systematic review of multimodal brain age studies: Uncovering a divergence between model accuracy and utility.Patterns, 4(4):100712, April
[19]

doi: 10.1016/j.patter.2023.100712

ISSN 2666-3899. doi: 10.1016/j.patter.2023.100712

work page doi:10.1016/j.patter.2023.100712 2023
[20]

Moona Mazher, Geoff J. M. Parker, and Daniel C. Alexander. Towards Generalisable Foun- dation Models for Brain MRI, December 2025. URL http://arxiv.org/abs/2510.23415. arXiv:2510.23415 [cs]

arXiv 2025
[21]

Multimodal MRI V olumetric Data Fusion With Convolutional Neural Networks.IEEE Transactions on Instrumentation and Measurement, 71:1–15, 2022

Yu Liu, Yu Shi, Fuhao Mu, Juan Cheng, Chang Li, and Xun Chen. Multimodal MRI V olumetric Data Fusion With Convolutional Neural Networks.IEEE Transactions on Instrumentation and Measurement, 71:1–15, 2022. ISSN 1557-9662. doi: 10.1109/TIM.2022.3184360. URL https://ieeexplore.ieee.org/document/9800914

work page doi:10.1109/tim.2022.3184360 2022
[22]

Garomsa, Anna Zapaishchykova, Tafadzwa L

Divyanshu Tak, Biniam A. Garomsa, Anna Zapaishchykova, Tafadzwa L. Chaunzwa, Juan Car- los Climent Pardo, Zezhong Ye, John Zielke, Yashwanth Ravipati, Suraj Pai, Sri Vajapeyam, Maryam Mahootiha, Mitchell Parker, Luke R. G. Pike, Ceilidh Smith, Ariana M. Familiar, Kevin X. Liu, Sanjay Prabhu, Omar Arnaout, Pratiti Bandopadhayay, Ali Nabavizadeh, Sabine Mue...

work page doi:10.1038/s41593-026-02202-6 2026
[23]

van Gemert

Osman Semih Kayhan and Jan C. van Gemert. On Translation Invariance in CNNs: Convolu- tional Layers can Exploit Absolute Spatial Location, May 2020. URL http://arxiv.org/ abs/2003.07064. arXiv:2003.07064 [cs]. 11

arXiv 2020
[24]

Swart, George Obaido, Matt Jordan, and Philip Ilono

Ibomoiye Domor Mienye, Theo G. Swart, George Obaido, Matt Jordan, and Philip Ilono. Deep Convolutional Neural Networks in Medical Image Analysis: A Review.Information, 16(3):195, March 2025. ISSN 2078-2489. doi: 10.3390/info16030195. URL https://www.mdpi.com/ 2078-2489/16/3/195

work page doi:10.3390/info16030195 2025
[25]

Lam, Greg Ver Steeg, and Paul M

Umang Gupta, Pradeep K. Lam, Greg Ver Steeg, and Paul M. Thompson. Improved Brain Age Estimation with Slice-based Set Networks, February 2021. URL http://arxiv.org/abs/ 2102.04438. arXiv:2102.04438 [eess]

arXiv 2021
[26]

MRI signatures of brain age and disease over the lifespan based on a deep brain network and 14 468 individuals worldwide

Vishnu M Bashyam, Guray Erus, Jimit Doshi, Mohamad Habes, Ilya M Nasrallah, Monica Truelove-Hill, Dhivya Srinivasan, Liz Mamourian, Raymond Pomponio, Yong Fan, Lenore J Launer, Colin L Masters, Paul Maruff, Chuanjun Zhuo, Henry Völzke, Sterling C Johnson, Jurgen Fripp, Nikolaos Koutsouleris, Theodore D Satterthwaite, Daniel Wolf, Raquel E Gur, Ruben C Gur...

work page doi:10.1093/brain/awaa160 2020
[27]

Paul Hamilton, and Anders Eklund

Johan Jönemo, Muhammad Usman Akbar, Robin Kämpe, J. Paul Hamilton, and Anders Eklund. Efficient Brain Age Prediction from 3D MRI V olumes Using 2D Projections.Brain Sciences, 13(9):1329, September 2023. ISSN 2076-3425. doi: 10.3390/brainsci13091329. URL https://www.mdpi.com/2076-3425/13/9/1329

work page doi:10.3390/brainsci13091329 2023
[28]

Beckmann, Andrea Vedaldi, and Stephen M

Han Peng, Weikang Gong, Christian F. Beckmann, Andrea Vedaldi, and Stephen M. Smith. Accurate brain age prediction with lightweight deep neural networks.Medical Image Analysis, 68:101871, February 2021. ISSN 13618415. doi: 10.1016/j.media.2020.101871. URL https://linkinghub.elsevier.com/retrieve/pii/S1361841520302358

work page doi:10.1016/j.media.2020.101871 2021
[29]

Dinsdale, Emma Bluemke, Stephen M

Nicola K. Dinsdale, Emma Bluemke, Stephen M. Smith, Zobair Arya, Diego Vidaurre, Mark Jenkinson, and Ana I. L. Namburete. Learning patterns of the ageing brain in MRI using deep convolutional networks.NeuroImage, 224:117401, January 2021. ISSN 1053-8119. doi: 10.1016/j.neuroimage.2020.117401. URL https://www.sciencedirect.com/science/ article/pii/S1053811...

work page doi:10.1016/j.neuroimage.2020.117401 2021
[30]

Burkett, Hoon-Ki Min, Matthew L

Jeyeon Lee, Brian J. Burkett, Hoon-Ki Min, Matthew L. Senjem, Emily S. Lundt, Hugo Botha, Jonathan Graff-Radford, Leland R. Barnard, Jeffrey L. Gunter, Christopher G. Schwarz, Kejal Kantarci, David S. Knopman, Bradley F. Boeve, Val J. Lowe, Ronald C. Petersen, Clifford R. Jack, and David T. Jones. Deep learning-based brain age prediction in normal aging a...

2022
[31]

Leonardsen, Han Peng, Tobias Kaufmann, Ingrid Agartz, Ole A

Esten H. Leonardsen, Han Peng, Tobias Kaufmann, Ingrid Agartz, Ole A. Andreassen, Elisa- beth Gulowsen Celius, Thomas Espeseth, Hanne F. Harbo, Einar A. Høgestøl, Ann-Marie de Lange, Andre F. Marquand, Didac Vidal-Piñeiro, James M. Roe, Geir Selbæk, Øystein Sørensen, Stephen M. Smith, Lars T. Westlye, Thomas Wolfers, and Yunpeng Wang. Deep neural networks...

work page doi:10.1016/j.neuroimage.2022.119210 2022
[32]

Gollub, Shawn N

Sheng He, Diana Pereira, Juan David Perez, Randy L. Gollub, Shawn N. Murphy, Sanjay Prabhu, Rudolph Pienaar, Richard L. Robertson, P. Ellen Grant, and Yangming Ou. Multi- channel attention-fusion neural network for brain age estimation: Accuracy, generality, and interpretation with 16,705 healthy MRIs across lifespan.Medical Image Analysis, 72:102091, Aug...

work page doi:10.1016/j.media.2021.102091 2021
[33]

Gomez, Lukasz Kaiser, and Illia Polosukhin

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin. Attention Is All You Need, August 2023. URL http: //arxiv.org/abs/1706.03762. arXiv:1706.03762 [cs]

Pith/arXiv arXiv 2023
[34]

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale, June 2021

Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, 12 Jakob Uszkoreit, and Neil Houlsby. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale, June 2021. URL http://arxiv.org/abs/2010.11929. arXiv:2010.11929 [cs]

Pith/arXiv arXiv 2021
[35]

Landman, and S

Jun Li, Junyu Chen, Yucheng Tang, Ce Wang, Bennett A. Landman, and S. Kevin Zhou. Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives, June 2022. URL https://arxiv.org/abs/2206. 01136v3

2022
[36]

Ellen Grant, and Yangming Ou

Sheng He, P. Ellen Grant, and Yangming Ou. Global-Local Transformer for Brain Age Estimation.IEEE Transactions on Medical Imaging, 41(1):213–224, January 2022. ISSN 0278-0062, 1558-254X. doi: 10.1109/TMI.2021.3108910. URL https://ieeexplore.ieee. org/document/9525077/

work page doi:10.1109/tmi.2021.3108910 2022
[37]

Triamese-ViT: A 3D-Aware Method for Robust Brain Age Estimation from MRIs, January 2024

Zhaonian Zhang and Richard Jiang. Triamese-ViT: A 3D-Aware Method for Robust Brain Age Estimation from MRIs, January 2024. URL http://arxiv.org/abs/2401.09475. arXiv:2401.09475 [cs] version: 1

arXiv 2024
[38]

Transformer based multi-modal MRI fusion for prediction of post-menstrual age and neonatal brain development analysis.Medical Image Analysis, 94:103140, May 2024

Haiyan Zhao, Hongjie Cai, and Manhua Liu. Transformer based multi-modal MRI fusion for prediction of post-menstrual age and neonatal brain development analysis.Medical Image Analysis, 94:103140, May 2024. ISSN 13618415. doi: 10.1016/j.media.2024.103140. URL https://linkinghub.elsevier.com/retrieve/pii/S1361841524000653

work page doi:10.1016/j.media.2024.103140 2024
[39]

SwiFT: Swin 4D fMRI Transformer, October

Peter Yongho Kim, Junbeom Kwon, Sunghwan Joo, Sangyoon Bae, Donggyu Lee, Yoonho Jung, Shinjae Yoo, Jiook Cha, and Taesup Moon. SwiFT: Swin 4D fMRI Transformer, October
[40]

arXiv:2307.05916 [cs]

URLhttp://arxiv.org/abs/2307.05916. arXiv:2307.05916 [cs]

arXiv
[41]

Multimodal population brain imaging in the UK Biobank prospective epidemiological study.Nature neuroscience, 19(11):1523–1536, November 2016

Karla L Miller, Fidel Alfaro-Almagro, Neal K Bangerter, David L Thomas, Essa Yacoub, Junqian Xu, Andreas J Bartsch, Saad Jbabdi, Stamatios N Sotiropoulos, Jesper LR Andersson, Ludovica Griffanti, Gwenaëlle Douaud, Thomas W Okell, Peter Weale, Iulius Dragonu, Steve Garratt, Sarah Hudson, Rory Collins, Mark Jenkinson, Paul M Matthews, and Stephen M Smith. M...

work page doi:10.1038/nn.4393 2016
[42]

Littlejohns, Jo Holliday, Lorna M

Thomas J. Littlejohns, Jo Holliday, Lorna M. Gibson, Steve Garratt, Niels Oesingmann, Fidel Alfaro-Almagro, Jimmy D. Bell, Chris Boultwood, Rory Collins, Megan C. Conroy, Nicola Crabtree, Nicola Doherty, Alejandro F. Frangi, Nicholas C. Harvey, Paul Leeson, Karla L. Miller, Stefan Neubauer, Steffen E. Petersen, Jonathan Sellors, Simon Sheard, Stephen M. S...

work page doi:10.1038/s41467-020-15948-9 2020
[43]

Data11, 839, DOI: 10.1038/s41597-024-03667-5 (2024)

John Absher, Sarah Goncher, Roger Newman-Norlund, Nicholas Perkins, Grigori Yourganov, Jan Vargas, Sanjeev Sivakumar, Naveen Parti, Shannon Sternberg, Alex Teghipco, Makayla Gibson, Sarah Wilson, Leonardo Bonilha, and Chris Rorden. The stroke outcome optimization project: Acute ischemic strokes from a comprehensive stroke center.Scientific Data, 11 (1):83...

work page doi:10.1038/s41597-024-03667-5 2024
[44]

Meredith A Shafto, Lorraine K Tyler, Marie Dixon, Jason R Taylor, James B Rowe, Rho- dri Cusack, Andrew J Calder, William D Marslen-Wilson, John Duncan, Tim Dalgleish, Richard N Henson, Carol Brayne, and Fiona E Matthews. The Cambridge Centre for Ageing and Neuroscience (Cam-CAN) study protocol: a cross-sectional, lifespan, multidisciplinary examination o...

work page doi:10.1186/s12883-014-0204-1 2014
[45]

Taylor, Nitin Williams, Rhodri Cusack, Tibor Auer, Meredith A

Jason R. Taylor, Nitin Williams, Rhodri Cusack, Tibor Auer, Meredith A. Shafto, Marie Dixon, Lorraine K. Tyler, null Cam-Can, and Richard N. Henson. The Cambridge Centre for Ageing and Neuroscience (Cam-CAN) data repository: Structural and functional MRI, MEG, and 13 cognitive data from a cross-sectional adult lifespan sample.NeuroImage, 144(Pt B):262–269...

work page doi:10.1016/j.neuroimage.2015.09.018 2017
[46]

Fusion or Confusion? Multimodal Com- plexity Is Not All You Need, 2025

Tillmann Rheude, Roland Eils, and Benjamin Wild. Fusion or Confusion? Multimodal Com- plexity Is Not All You Need, 2025. URL https://arxiv.org/abs/2512.22991. Version Number: 2

Pith/arXiv arXiv 2025
[47]

The Road Less Scheduled, October 2024

Aaron Defazio, Xingyu Alice Yang, Harsh Mehta, Konstantin Mishchenko, Ahmed Khaled, and Ashok Cutkosky. The Road Less Scheduled, October 2024. URL http://arxiv.org/abs/ 2405.15682. arXiv:2405.15682 [cs]

arXiv 2024
[48]

Surrogate Gap Minimization Improves Sharpness-Aware Training, March 2022

Juntang Zhuang, Boqing Gong, Liangzhe Yuan, Yin Cui, Hartwig Adam, Nicha Dvornek, Sekhar Tatikonda, James Duncan, and Ting Liu. Surrogate Gap Minimization Improves Sharpness-Aware Training, March 2022. URL http://arxiv.org/abs/2203.08065. arXiv:2203.08065 [cs]

arXiv 2022
[49]

Zico Kolter

Asher Trockman and J. Zico Kolter. Mimetic Initialization of Self-Attention Layers, May 2023. URLhttp://arxiv.org/abs/2305.09828. arXiv:2305.09828 [cs]

arXiv 2023
[50]

On Layer Normalization in the Transformer Architecture, June 2020

Ruibin Xiong, Yunchang Yang, Di He, Kai Zheng, Shuxin Zheng, Chen Xing, Huishuai Zhang, Yanyan Lan, Liwei Wang, and Tie-Yan Liu. On Layer Normalization in the Transformer Architecture, June 2020. URL http://arxiv.org/abs/2002.04745. arXiv:2002.04745 [cs]

arXiv 2020
[51]

A ConvNet for the 2020s, March 2022

Zhuang Liu, Hanzi Mao, Chao-Yuan Wu, Christoph Feichtenhofer, Trevor Darrell, and Saining Xie. A ConvNet for the 2020s, March 2022. URL http://arxiv.org/abs/2201.03545. arXiv:2201.03545 [cs]

arXiv 2022
[52]

ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design, July 2018

Ningning Ma, Xiangyu Zhang, Hai-Tao Zheng, and Jian Sun. ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design, July 2018. URL http://arxiv.org/abs/ 1807.11164. arXiv:1807.11164 [cs]

Pith/arXiv arXiv 2018
[53]

Fu, Stefano Ermon, Atri Rudra, and Christopher Ré

Tri Dao, Daniel Y . Fu, Stefano Ermon, Atri Rudra, and Christopher Ré. FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness, 2022. URL https://arxiv.org/ abs/2205.14135. Version Number: 2

Pith/arXiv arXiv 2022
[54]

Time Matters: Scaling Laws for Any Budget, October 2024

Itay Inbar and Luke Sernau. Time Matters: Scaling Laws for Any Budget, October 2024. URL http://arxiv.org/abs/2406.18922. arXiv:2406.18922 [cs]

arXiv 2024
[55]

Neuroimaging standards for research into small vessel disease and its contribution to ageing and neurodegeneration.Lancet Neurology, 12(8):822–838, August 2013

Joanna M Wardlaw, Eric E Smith, Geert J Biessels, Charlotte Cordonnier, Franz Fazekas, Richard Frayne, Richard I Lindley, John T O’Brien, Frederik Barkhof, Oscar R Benavente, Sandra E Black, Carol Brayne, Monique Breteler, Hugues Chabriat, Charles DeCarli, Frank- Erik de Leeuw, Fergus Doubal, Marco Duering, Nick C Fox, Steven Greenberg, Vladimir Hachinski...

work page doi:10.1016/s1474-4422(13)70124-8 2013
[56]

Charles DeCarli, Joseph Massaro, Danielle Harvey, John Hald, Mats Tullberg, Rhoda Au, Alexa Beiser, Ralph D’Agostino, and Philip A. Wolf. Measures of brain morphology and infarction in the framingham heart study: establishing what is normal.Neurobiology of Aging, 26(4): 491–510, April 2005. ISSN 0197-4580. doi: 10.1016/j.neurobiolaging.2004.05.004. URL ht...

work page doi:10.1016/j.neurobiolaging.2004.05.004 2005
[57]

Rodrigue, Kristen M

Naftali Raz, Ulman Lindenberger, Karen M. Rodrigue, Kristen M. Kennedy, Denise Head, Adrienne Williamson, Cheryl Dahle, Denis Gerstorf, and James D. Acker. Regional Brain Changes in Aging Healthy Adults: General Trends, Individual Differences and Modifiers. Cerebral Cortex, 15(11):1676–1689, November 2005. ISSN 1047-3211. doi: 10.1093/cercor/ bhi044. URLh...

work page doi:10.1093/cercor/ 2005
[58]

Cox, Stuart J

Simon R. Cox, Stuart J. Ritchie, Elliot M. Tucker-Drob, David C. Liewald, Saskia P. Hagenaars, Gail Davies, Joanna M. Wardlaw, Catharine R. Gale, Mark E. Bastin, and Ian J. Deary. Ageing and brain white matter structure in 3,513 UK Biobank participants.Nature Communications, 7(1):13629, December 2016. ISSN 2041-1723. doi: 10.1038/ncomms13629. URL https: /...

work page doi:10.1038/ncomms13629 2016
[59]

Electronic Com- merce Research and Applications73, 101527 (2025) https://doi.org/10.1016/j

Dani Beck, Ann-Marie G. de Lange, Ivan I. Maximov, Geneviève Richard, Ole A. Andreassen, Jan E. Nordvik, and Lars T. Westlye. White matter microstructure across the adult lifespan: A mixed longitudinal and cross-sectional study using advanced diffusion models and brain- age prediction.NeuroImage, 224:117441, January 2021. ISSN 1053-8119. doi: 10.1016/j. n...

work page doi:10.1016/j 2021
[60]

Age- Related Gray and White Matter Changes in Normal Adult Brains.Aging and Disease, 8(6): 899–909, December 2017

Farnaz Farokhian, Chunlan Yang, Iman Beheshti, Hiroshi Matsuda, and Shuicai Wu. Age- Related Gray and White Matter Changes in Normal Adult Brains.Aging and Disease, 8(6): 899–909, December 2017. ISSN 2152-5250. doi: 10.14336/AD.2017.0502. URL https: //pmc.ncbi.nlm.nih.gov/articles/PMC5758357/

work page doi:10.14336/ad.2017.0502 2017
[61]

Toledo, Tianhao Zhang, Nick Bryan, Lenore J

Mohamad Habes, Guray Erus, Jon B. Toledo, Tianhao Zhang, Nick Bryan, Lenore J. Launer, Yves Rosseel, Deborah Janowitz, Jimit Doshi, Sandra Van der Auwera, Bettina von Sarnowski, Katrin Hegenscheid, Norbert Hosten, Georg Homuth, Henry Völzke, Ulf Schminke, Wolfgang Hoffmann, Hans J. Grabe, and Christos Davatzikos. White matter hyperintensities and imaging ...

work page doi:10.1093/brain/aww008 2016
[62]

Cortina, Nikkita Khattar, Abinand C

Mustapha Bouhrara, Luis E. Cortina, Nikkita Khattar, Abinand C. Rejimon, Samuel Ajamu, Defne S. Cezayirli, and Richard G. Spencer. Maturation and degeneration of the human brainstem across the adult lifespan.Aging, 13(11):14862–14891, June 2021. ISSN 1945-4589. doi: 10.18632/aging.203183. URL https://www.aging-us.com/lookup/doi/10.18632/ aging.203183

work page doi:10.18632/aging.203183 2021
[63]

Morphological changes in the cerebellum during aging: evidence from convolutional neural networks and shape analysis.Frontiers in Ag- ing Neuroscience, 16, April 2024

Yu Wang, Ye Teng, Tianci Liu, Yuchun Tang, Wenjia Liang, Wenjun Wang, Zhuoran Li, Qing Xia, Feifei Xu, and Shuwei Liu. Morphological changes in the cerebellum during aging: evidence from convolutional neural networks and shape analysis.Frontiers in Ag- ing Neuroscience, 16, April 2024. ISSN 1663-4365. doi: 10.3389/fnagi.2024.1359320. URL https://www.front...

work page doi:10.3389/fnagi.2024.1359320 2024
[64]

Multimodal quantitative MRI reveals age-related biophysical alterations in the human brain across the adult lifespan.NeuroIm- age, 327:121742, February 2026

Xiang Chen, Zhiyuan Yuan, Jie Zhang, and Xiao-Yong Zhang. Multimodal quantitative MRI reveals age-related biophysical alterations in the human brain across the adult lifespan.NeuroIm- age, 327:121742, February 2026. ISSN 1053-8119. doi: 10.1016/j.neuroimage.2026.121742. URLhttps://www.sciencedirect.com/science/article/pii/S1053811926000601

work page doi:10.1016/j.neuroimage.2026.121742 2026
[65]

Mapping heterogeneous region- and tissue-specific brain ageing patterns using quantitative MRI.Brain Communications, 8(1):fcag010, February

Xinjie Chen, Mario Ocampo-Pineda, Po-Jui Lu, Michelle G Jansen, Kwok-Shing Chan, Marcel Zwiers, Joukje M Oosterman, David G Norris, Andre F Marquand, Lester Melie-Garcia, Cristina Granziera, and José P Marques. Mapping heterogeneous region- and tissue-specific brain ageing patterns using quantitative MRI.Brain Communications, 8(1):fcag010, February
[66]

doi: 10.1093/braincomms/fcag010

ISSN 2632-1297. doi: 10.1093/braincomms/fcag010. URL https://doi.org/10. 1093/braincomms/fcag010

work page doi:10.1093/braincomms/fcag010
[67]

International Journal of Computer Vision , author =

Ramprasaath R. Selvaraju, Michael Cogswell, Abhishek Das, Ramakrishna Vedantam, Devi Parikh, and Dhruv Batra. Grad-CAM: Visual Explanations from Deep Networks via Gradient- based Localization.International Journal of Computer Vision, 128(2):336–359, February 2020. ISSN 0920-5691, 1573-1405. doi: 10.1007/s11263-019-01228-7. URL http://arxiv.org/ abs/1610.0...

work page doi:10.1007/s11263-019-01228-7 2020

[1] [1]

Gillies, Paul E

Robert J. Gillies, Paul E. Kinahan, and Hedvig Hricak. Radiomics: Images Are More than Pictures, They Are Data.Radiology, 278(2):563–577, February 2016. ISSN 0033-8419. doi: 10.1148/radiol.2015151169. URL https://pubs.rsna.org/doi/10.1148/radiol. 2015151169

work page doi:10.1148/radiol.2015151169 2016

[2] [2]

Mark Haacke, Robert W

E. Mark Haacke, Robert W. Brown, Michael R. Thompson, and Ramesh Venkatesan, editors. Magnetic resonance imaging: physical principles and sequence design. Wiley-Liss, New York, NY , 1999. ISBN 978-0-471-35128-3

1999

[3] [3]

Performance reserves in brain-imaging-based phenotype prediction.Cell Reports, 43 (1):113597, January 2024

Marc-Andre Schulz, Danilo Bzdok, Stefan Haufe, John-Dylan Haynes, and Kerstin Rit- ter. Performance reserves in brain-imaging-based phenotype prediction.Cell Reports, 43 (1):113597, January 2024. ISSN 22111247. doi: 10.1016/j.celrep.2023.113597. URL https://linkinghub.elsevier.com/retrieve/pii/S2211124723016091

work page doi:10.1016/j.celrep.2023.113597 2024

[4] [4]

Yuille, and Yuyin Zhou

Jieneng Chen, Yongyi Lu, Qihang Yu, Xiangde Luo, Ehsan Adeli, Yan Wang, Le Lu, Alan L. Yuille, and Yuyin Zhou. TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation, February 2021. URL http://arxiv.org/abs/2102.04306. arXiv:2102.04306 [cs]

Pith/arXiv arXiv 2021

[5] [5]

Comparison of Vision Transformers and Convolutional Neural Networks in Medical Image Analysis: A Systematic Review.Journal of Medical Systems, 48(1):84, September 2024

Satoshi Takahashi, Yusuke Sakaguchi, Nobuji Kouno, Ken Takasawa, Kenichi Ishizu, Yu Akagi, Rina Aoyama, Naoki Teraya, Amina Bolatkan, Norio Shinkai, Hidenori Machino, Kazuma Kobayashi, Ken Asada, Masaaki Komatsu, Syuzo Kaneko, Masashi Sugiyama, and Ryuji Hamamoto. Comparison of Vision Transformers and Convolutional Neural Networks in Medical Image Analysi...

work page doi:10.1007/s10916-024-02105-8 2024

[6] [6]

Shih-Cheng Huang, Anuj Pareek, Saeed Seyyedi, Imon Banerjee, and Matthew P. Lungren. Fusion of medical imaging and electronic health records using deep learning: a systematic review and implementation guidelines.npj Digital Medicine, 3(1):136, October 2020. ISSN 2398-

2020

[7] [7]

URL https://www.nature.com/articles/ s41746-020-00341-z

doi: 10.1038/s41746-020-00341-z. URL https://www.nature.com/articles/ s41746-020-00341-z

work page doi:10.1038/s41746-020-00341-z

[8] [8]

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows, August 2021

Ze Liu, Yutong Lin, Yue Cao, Han Hu, Yixuan Wei, Zheng Zhang, Stephen Lin, and Baining Guo. Swin Transformer: Hierarchical Vision Transformer using Shifted Windows, August 2021. URLhttp://arxiv.org/abs/2103.14030. arXiv:2103.14030 [cs]

Pith/arXiv arXiv 2021

[9] [9]

Global Context Vision Transformers, June 2023

Ali Hatamizadeh, Hongxu Yin, Greg Heinrich, Jan Kautz, and Pavlo Molchanov. Global Context Vision Transformers, June 2023. URL http://arxiv.org/abs/2206.09959. arXiv:2206.09959 [cs]

arXiv 2023

[10] [10]

Multi-view 3D Object Detection Network for Autonomous Driving

Xiaozhi Chen, Huimin Ma, Ji Wan, Bo Li, and Tian Xia. Multi-view 3D Object Detection Network for Autonomous Driving. In2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 6526–6534, Honolulu, HI, July 2017. IEEE. ISBN 978-1-5386- 0457-1. doi: 10.1109/CVPR.2017.691. URL http://ieeexplore.ieee.org/document/ 8100174/

work page doi:10.1109/cvpr.2017.691 2017

[11] [11]

Jiquan Ngiam, Aditya Khosla, Mingyu Kim, Juhan Nam, Honglak Lee, and Andrew Y . Ng. Multimodal deep learning. InProceedings of the 28th International Conference on International Conference on Machine Learning, ICML’11, pages 689–696, Madison, WI, USA, June 2011. Omnipress. ISBN 978-1-4503-0619-5. 10

2011

[12] [12]

Multimodal Machine Learning: A Survey and Taxonomy, August 2017

Tadas Baltrušaitis, Chaitanya Ahuja, and Louis-Philippe Morency. Multimodal Machine Learning: A Survey and Taxonomy, August 2017. URL http://arxiv.org/abs/1705. 09406. arXiv:1705.09406 [cs]

Pith/arXiv arXiv 2017

[13] [13]

Medical Multimodal Foundation Models in Clinical Diagnosis and Treatment: Applications, Chal- lenges, and Future Directions, December 2024

Kai Sun, Siyan Xue, Fuchun Sun, Haoran Sun, Yu Luo, Ling Wang, Siyuan Wang, Na Guo, Lei Liu, Tian Zhao, Xinzhou Wang, Lei Yang, Shuo Jin, Jun Yan, and Jiahong Dong. Medical Multimodal Foundation Models in Clinical Diagnosis and Treatment: Applications, Chal- lenges, and Future Directions, December 2024. URL http://arxiv.org/abs/2412.02621. arXiv:2412.02621 [cs]

arXiv 2024

[14] [14]

Multimodal Fusion of Brain Imaging Data: Methods and Applications.Machine Intelligence Research, 21(1): 136–152, February 2024

Na Luo, Weiyang Shi, Zhengyi Yang, Ming Song, and Tianzi Jiang. Multimodal Fusion of Brain Imaging Data: Methods and Applications.Machine Intelligence Research, 21(1): 136–152, February 2024. ISSN 2731-5398. doi: 10.1007/s11633-023-1442-8. URL https: //doi.org/10.1007/s11633-023-1442-8

work page doi:10.1007/s11633-023-1442-8 2024

[15] [15]

A review of deep learning-based information fusion techniques for multimodal medical image classification, April 2024

Yihao Li, Mostafa El Habib Daho, Pierre-Henri Conze, Rachid Zeghlache, Hugo Le Boité, Ramin Tadayoni, Béatrice Cochener, Mathieu Lamard, and Gwenolé Quellec. A review of deep learning-based information fusion techniques for multimodal medical image classification, April 2024. URLhttp://arxiv.org/abs/2404.15022. arXiv:2404.15022 [cs]

arXiv 2024

[16] [16]

Huntenburg, Leonie Lampe, Mehdi Rahim, Alexandre Abraham, R

Franziskus Liem, Gaël Varoquaux, Jana Kynast, Frauke Beyer, Shahrzad Kharabian Masouleh, Julia M. Huntenburg, Leonie Lampe, Mehdi Rahim, Alexandre Abraham, R. Cameron Craddock, Steffi Riedel-Heller, Tobias Luck, Markus Loeffler, Matthias L. Schroeter, Anja Veronica Witte, Arno Villringer, and Daniel S. Margulies. Predicting brain-age from multimodal imagi...

work page doi:10.1016/j.neuroimage.2016.11.005 2017

[17] [17]

Quintana, Dag Alnaes, Genevieve Richard, Ann-Marie G

Jaroslav Rokicki, Thomas Wolfers, Wibeke Nordhøy, Natalia Tesli, Daniel S. Quintana, Dag Alnaes, Genevieve Richard, Ann-Marie G. de Lange, Martina J. Lund, Linn Norbom, Ingrid Agartz, Ingrid Melle, Terje Naerland, Geir Selbaek, Karin Persson, Jan Egil Nordvik, Emanuel Schwarz, Ole A. Andreassen, Tobias Kaufmann, and Lars T. Westlye. Multimodal imaging imp...

work page doi:10.1002/hbm.25323 2021

[18] [18]

Jirsaraie, Aaron J

Robert J. Jirsaraie, Aaron J. Gorelik, Martins M. Gatavins, Denis A. Engemann, Ryan Bogdan, Deanna M. Barch, and Aristeidis Sotiras. A systematic review of multimodal brain age studies: Uncovering a divergence between model accuracy and utility.Patterns, 4(4):100712, April

[19] [19]

doi: 10.1016/j.patter.2023.100712

ISSN 2666-3899. doi: 10.1016/j.patter.2023.100712

work page doi:10.1016/j.patter.2023.100712 2023

[20] [20]

Moona Mazher, Geoff J. M. Parker, and Daniel C. Alexander. Towards Generalisable Foun- dation Models for Brain MRI, December 2025. URL http://arxiv.org/abs/2510.23415. arXiv:2510.23415 [cs]

arXiv 2025

[21] [21]

Multimodal MRI V olumetric Data Fusion With Convolutional Neural Networks.IEEE Transactions on Instrumentation and Measurement, 71:1–15, 2022

Yu Liu, Yu Shi, Fuhao Mu, Juan Cheng, Chang Li, and Xun Chen. Multimodal MRI V olumetric Data Fusion With Convolutional Neural Networks.IEEE Transactions on Instrumentation and Measurement, 71:1–15, 2022. ISSN 1557-9662. doi: 10.1109/TIM.2022.3184360. URL https://ieeexplore.ieee.org/document/9800914

work page doi:10.1109/tim.2022.3184360 2022

[22] [22]

Garomsa, Anna Zapaishchykova, Tafadzwa L

Divyanshu Tak, Biniam A. Garomsa, Anna Zapaishchykova, Tafadzwa L. Chaunzwa, Juan Car- los Climent Pardo, Zezhong Ye, John Zielke, Yashwanth Ravipati, Suraj Pai, Sri Vajapeyam, Maryam Mahootiha, Mitchell Parker, Luke R. G. Pike, Ceilidh Smith, Ariana M. Familiar, Kevin X. Liu, Sanjay Prabhu, Omar Arnaout, Pratiti Bandopadhayay, Ali Nabavizadeh, Sabine Mue...

work page doi:10.1038/s41593-026-02202-6 2026

[23] [23]

van Gemert

Osman Semih Kayhan and Jan C. van Gemert. On Translation Invariance in CNNs: Convolu- tional Layers can Exploit Absolute Spatial Location, May 2020. URL http://arxiv.org/ abs/2003.07064. arXiv:2003.07064 [cs]. 11

arXiv 2020

[24] [24]

Swart, George Obaido, Matt Jordan, and Philip Ilono

Ibomoiye Domor Mienye, Theo G. Swart, George Obaido, Matt Jordan, and Philip Ilono. Deep Convolutional Neural Networks in Medical Image Analysis: A Review.Information, 16(3):195, March 2025. ISSN 2078-2489. doi: 10.3390/info16030195. URL https://www.mdpi.com/ 2078-2489/16/3/195

work page doi:10.3390/info16030195 2025

[25] [25]

Lam, Greg Ver Steeg, and Paul M

Umang Gupta, Pradeep K. Lam, Greg Ver Steeg, and Paul M. Thompson. Improved Brain Age Estimation with Slice-based Set Networks, February 2021. URL http://arxiv.org/abs/ 2102.04438. arXiv:2102.04438 [eess]

arXiv 2021

[26] [26]

MRI signatures of brain age and disease over the lifespan based on a deep brain network and 14 468 individuals worldwide

Vishnu M Bashyam, Guray Erus, Jimit Doshi, Mohamad Habes, Ilya M Nasrallah, Monica Truelove-Hill, Dhivya Srinivasan, Liz Mamourian, Raymond Pomponio, Yong Fan, Lenore J Launer, Colin L Masters, Paul Maruff, Chuanjun Zhuo, Henry Völzke, Sterling C Johnson, Jurgen Fripp, Nikolaos Koutsouleris, Theodore D Satterthwaite, Daniel Wolf, Raquel E Gur, Ruben C Gur...

work page doi:10.1093/brain/awaa160 2020

[27] [27]

Paul Hamilton, and Anders Eklund

Johan Jönemo, Muhammad Usman Akbar, Robin Kämpe, J. Paul Hamilton, and Anders Eklund. Efficient Brain Age Prediction from 3D MRI V olumes Using 2D Projections.Brain Sciences, 13(9):1329, September 2023. ISSN 2076-3425. doi: 10.3390/brainsci13091329. URL https://www.mdpi.com/2076-3425/13/9/1329

work page doi:10.3390/brainsci13091329 2023

[28] [28]

Beckmann, Andrea Vedaldi, and Stephen M

Han Peng, Weikang Gong, Christian F. Beckmann, Andrea Vedaldi, and Stephen M. Smith. Accurate brain age prediction with lightweight deep neural networks.Medical Image Analysis, 68:101871, February 2021. ISSN 13618415. doi: 10.1016/j.media.2020.101871. URL https://linkinghub.elsevier.com/retrieve/pii/S1361841520302358

work page doi:10.1016/j.media.2020.101871 2021

[29] [29]

Dinsdale, Emma Bluemke, Stephen M

Nicola K. Dinsdale, Emma Bluemke, Stephen M. Smith, Zobair Arya, Diego Vidaurre, Mark Jenkinson, and Ana I. L. Namburete. Learning patterns of the ageing brain in MRI using deep convolutional networks.NeuroImage, 224:117401, January 2021. ISSN 1053-8119. doi: 10.1016/j.neuroimage.2020.117401. URL https://www.sciencedirect.com/science/ article/pii/S1053811...

work page doi:10.1016/j.neuroimage.2020.117401 2021

[30] [30]

Burkett, Hoon-Ki Min, Matthew L

Jeyeon Lee, Brian J. Burkett, Hoon-Ki Min, Matthew L. Senjem, Emily S. Lundt, Hugo Botha, Jonathan Graff-Radford, Leland R. Barnard, Jeffrey L. Gunter, Christopher G. Schwarz, Kejal Kantarci, David S. Knopman, Bradley F. Boeve, Val J. Lowe, Ronald C. Petersen, Clifford R. Jack, and David T. Jones. Deep learning-based brain age prediction in normal aging a...

2022

[31] [31]

Leonardsen, Han Peng, Tobias Kaufmann, Ingrid Agartz, Ole A

Esten H. Leonardsen, Han Peng, Tobias Kaufmann, Ingrid Agartz, Ole A. Andreassen, Elisa- beth Gulowsen Celius, Thomas Espeseth, Hanne F. Harbo, Einar A. Høgestøl, Ann-Marie de Lange, Andre F. Marquand, Didac Vidal-Piñeiro, James M. Roe, Geir Selbæk, Øystein Sørensen, Stephen M. Smith, Lars T. Westlye, Thomas Wolfers, and Yunpeng Wang. Deep neural networks...

work page doi:10.1016/j.neuroimage.2022.119210 2022

[32] [32]

Gollub, Shawn N

Sheng He, Diana Pereira, Juan David Perez, Randy L. Gollub, Shawn N. Murphy, Sanjay Prabhu, Rudolph Pienaar, Richard L. Robertson, P. Ellen Grant, and Yangming Ou. Multi- channel attention-fusion neural network for brain age estimation: Accuracy, generality, and interpretation with 16,705 healthy MRIs across lifespan.Medical Image Analysis, 72:102091, Aug...

work page doi:10.1016/j.media.2021.102091 2021

[33] [33]

Gomez, Lukasz Kaiser, and Illia Polosukhin

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin. Attention Is All You Need, August 2023. URL http: //arxiv.org/abs/1706.03762. arXiv:1706.03762 [cs]

Pith/arXiv arXiv 2023

[34] [34]

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale, June 2021

Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, 12 Jakob Uszkoreit, and Neil Houlsby. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale, June 2021. URL http://arxiv.org/abs/2010.11929. arXiv:2010.11929 [cs]

Pith/arXiv arXiv 2021

[35] [35]

Landman, and S

Jun Li, Junyu Chen, Yucheng Tang, Ce Wang, Bennett A. Landman, and S. Kevin Zhou. Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives, June 2022. URL https://arxiv.org/abs/2206. 01136v3

2022

[36] [36]

Ellen Grant, and Yangming Ou

Sheng He, P. Ellen Grant, and Yangming Ou. Global-Local Transformer for Brain Age Estimation.IEEE Transactions on Medical Imaging, 41(1):213–224, January 2022. ISSN 0278-0062, 1558-254X. doi: 10.1109/TMI.2021.3108910. URL https://ieeexplore.ieee. org/document/9525077/

work page doi:10.1109/tmi.2021.3108910 2022

[37] [37]

Triamese-ViT: A 3D-Aware Method for Robust Brain Age Estimation from MRIs, January 2024

Zhaonian Zhang and Richard Jiang. Triamese-ViT: A 3D-Aware Method for Robust Brain Age Estimation from MRIs, January 2024. URL http://arxiv.org/abs/2401.09475. arXiv:2401.09475 [cs] version: 1

arXiv 2024

[38] [38]

Transformer based multi-modal MRI fusion for prediction of post-menstrual age and neonatal brain development analysis.Medical Image Analysis, 94:103140, May 2024

Haiyan Zhao, Hongjie Cai, and Manhua Liu. Transformer based multi-modal MRI fusion for prediction of post-menstrual age and neonatal brain development analysis.Medical Image Analysis, 94:103140, May 2024. ISSN 13618415. doi: 10.1016/j.media.2024.103140. URL https://linkinghub.elsevier.com/retrieve/pii/S1361841524000653

work page doi:10.1016/j.media.2024.103140 2024

[39] [39]

SwiFT: Swin 4D fMRI Transformer, October

Peter Yongho Kim, Junbeom Kwon, Sunghwan Joo, Sangyoon Bae, Donggyu Lee, Yoonho Jung, Shinjae Yoo, Jiook Cha, and Taesup Moon. SwiFT: Swin 4D fMRI Transformer, October

[40] [40]

arXiv:2307.05916 [cs]

URLhttp://arxiv.org/abs/2307.05916. arXiv:2307.05916 [cs]

arXiv

[41] [41]

Multimodal population brain imaging in the UK Biobank prospective epidemiological study.Nature neuroscience, 19(11):1523–1536, November 2016

Karla L Miller, Fidel Alfaro-Almagro, Neal K Bangerter, David L Thomas, Essa Yacoub, Junqian Xu, Andreas J Bartsch, Saad Jbabdi, Stamatios N Sotiropoulos, Jesper LR Andersson, Ludovica Griffanti, Gwenaëlle Douaud, Thomas W Okell, Peter Weale, Iulius Dragonu, Steve Garratt, Sarah Hudson, Rory Collins, Mark Jenkinson, Paul M Matthews, and Stephen M Smith. M...

work page doi:10.1038/nn.4393 2016

[42] [42]

Littlejohns, Jo Holliday, Lorna M

Thomas J. Littlejohns, Jo Holliday, Lorna M. Gibson, Steve Garratt, Niels Oesingmann, Fidel Alfaro-Almagro, Jimmy D. Bell, Chris Boultwood, Rory Collins, Megan C. Conroy, Nicola Crabtree, Nicola Doherty, Alejandro F. Frangi, Nicholas C. Harvey, Paul Leeson, Karla L. Miller, Stefan Neubauer, Steffen E. Petersen, Jonathan Sellors, Simon Sheard, Stephen M. S...

work page doi:10.1038/s41467-020-15948-9 2020

[43] [43]

Data11, 839, DOI: 10.1038/s41597-024-03667-5 (2024)

John Absher, Sarah Goncher, Roger Newman-Norlund, Nicholas Perkins, Grigori Yourganov, Jan Vargas, Sanjeev Sivakumar, Naveen Parti, Shannon Sternberg, Alex Teghipco, Makayla Gibson, Sarah Wilson, Leonardo Bonilha, and Chris Rorden. The stroke outcome optimization project: Acute ischemic strokes from a comprehensive stroke center.Scientific Data, 11 (1):83...

work page doi:10.1038/s41597-024-03667-5 2024

[44] [44]

Meredith A Shafto, Lorraine K Tyler, Marie Dixon, Jason R Taylor, James B Rowe, Rho- dri Cusack, Andrew J Calder, William D Marslen-Wilson, John Duncan, Tim Dalgleish, Richard N Henson, Carol Brayne, and Fiona E Matthews. The Cambridge Centre for Ageing and Neuroscience (Cam-CAN) study protocol: a cross-sectional, lifespan, multidisciplinary examination o...

work page doi:10.1186/s12883-014-0204-1 2014

[45] [45]

Taylor, Nitin Williams, Rhodri Cusack, Tibor Auer, Meredith A

Jason R. Taylor, Nitin Williams, Rhodri Cusack, Tibor Auer, Meredith A. Shafto, Marie Dixon, Lorraine K. Tyler, null Cam-Can, and Richard N. Henson. The Cambridge Centre for Ageing and Neuroscience (Cam-CAN) data repository: Structural and functional MRI, MEG, and 13 cognitive data from a cross-sectional adult lifespan sample.NeuroImage, 144(Pt B):262–269...

work page doi:10.1016/j.neuroimage.2015.09.018 2017

[46] [46]

Fusion or Confusion? Multimodal Com- plexity Is Not All You Need, 2025

Tillmann Rheude, Roland Eils, and Benjamin Wild. Fusion or Confusion? Multimodal Com- plexity Is Not All You Need, 2025. URL https://arxiv.org/abs/2512.22991. Version Number: 2

Pith/arXiv arXiv 2025

[47] [47]

The Road Less Scheduled, October 2024

Aaron Defazio, Xingyu Alice Yang, Harsh Mehta, Konstantin Mishchenko, Ahmed Khaled, and Ashok Cutkosky. The Road Less Scheduled, October 2024. URL http://arxiv.org/abs/ 2405.15682. arXiv:2405.15682 [cs]

arXiv 2024

[48] [48]

Surrogate Gap Minimization Improves Sharpness-Aware Training, March 2022

Juntang Zhuang, Boqing Gong, Liangzhe Yuan, Yin Cui, Hartwig Adam, Nicha Dvornek, Sekhar Tatikonda, James Duncan, and Ting Liu. Surrogate Gap Minimization Improves Sharpness-Aware Training, March 2022. URL http://arxiv.org/abs/2203.08065. arXiv:2203.08065 [cs]

arXiv 2022

[49] [49]

Zico Kolter

Asher Trockman and J. Zico Kolter. Mimetic Initialization of Self-Attention Layers, May 2023. URLhttp://arxiv.org/abs/2305.09828. arXiv:2305.09828 [cs]

arXiv 2023

[50] [50]

On Layer Normalization in the Transformer Architecture, June 2020

Ruibin Xiong, Yunchang Yang, Di He, Kai Zheng, Shuxin Zheng, Chen Xing, Huishuai Zhang, Yanyan Lan, Liwei Wang, and Tie-Yan Liu. On Layer Normalization in the Transformer Architecture, June 2020. URL http://arxiv.org/abs/2002.04745. arXiv:2002.04745 [cs]

arXiv 2020

[51] [51]

A ConvNet for the 2020s, March 2022

Zhuang Liu, Hanzi Mao, Chao-Yuan Wu, Christoph Feichtenhofer, Trevor Darrell, and Saining Xie. A ConvNet for the 2020s, March 2022. URL http://arxiv.org/abs/2201.03545. arXiv:2201.03545 [cs]

arXiv 2022

[52] [52]

ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design, July 2018

Ningning Ma, Xiangyu Zhang, Hai-Tao Zheng, and Jian Sun. ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design, July 2018. URL http://arxiv.org/abs/ 1807.11164. arXiv:1807.11164 [cs]

Pith/arXiv arXiv 2018

[53] [53]

Fu, Stefano Ermon, Atri Rudra, and Christopher Ré

Tri Dao, Daniel Y . Fu, Stefano Ermon, Atri Rudra, and Christopher Ré. FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness, 2022. URL https://arxiv.org/ abs/2205.14135. Version Number: 2

Pith/arXiv arXiv 2022

[54] [54]

Time Matters: Scaling Laws for Any Budget, October 2024

Itay Inbar and Luke Sernau. Time Matters: Scaling Laws for Any Budget, October 2024. URL http://arxiv.org/abs/2406.18922. arXiv:2406.18922 [cs]

arXiv 2024

[55] [55]

Neuroimaging standards for research into small vessel disease and its contribution to ageing and neurodegeneration.Lancet Neurology, 12(8):822–838, August 2013

Joanna M Wardlaw, Eric E Smith, Geert J Biessels, Charlotte Cordonnier, Franz Fazekas, Richard Frayne, Richard I Lindley, John T O’Brien, Frederik Barkhof, Oscar R Benavente, Sandra E Black, Carol Brayne, Monique Breteler, Hugues Chabriat, Charles DeCarli, Frank- Erik de Leeuw, Fergus Doubal, Marco Duering, Nick C Fox, Steven Greenberg, Vladimir Hachinski...

work page doi:10.1016/s1474-4422(13)70124-8 2013

[56] [56]

Charles DeCarli, Joseph Massaro, Danielle Harvey, John Hald, Mats Tullberg, Rhoda Au, Alexa Beiser, Ralph D’Agostino, and Philip A. Wolf. Measures of brain morphology and infarction in the framingham heart study: establishing what is normal.Neurobiology of Aging, 26(4): 491–510, April 2005. ISSN 0197-4580. doi: 10.1016/j.neurobiolaging.2004.05.004. URL ht...

work page doi:10.1016/j.neurobiolaging.2004.05.004 2005

[57] [57]

Rodrigue, Kristen M

Naftali Raz, Ulman Lindenberger, Karen M. Rodrigue, Kristen M. Kennedy, Denise Head, Adrienne Williamson, Cheryl Dahle, Denis Gerstorf, and James D. Acker. Regional Brain Changes in Aging Healthy Adults: General Trends, Individual Differences and Modifiers. Cerebral Cortex, 15(11):1676–1689, November 2005. ISSN 1047-3211. doi: 10.1093/cercor/ bhi044. URLh...

work page doi:10.1093/cercor/ 2005

[58] [58]

Cox, Stuart J

Simon R. Cox, Stuart J. Ritchie, Elliot M. Tucker-Drob, David C. Liewald, Saskia P. Hagenaars, Gail Davies, Joanna M. Wardlaw, Catharine R. Gale, Mark E. Bastin, and Ian J. Deary. Ageing and brain white matter structure in 3,513 UK Biobank participants.Nature Communications, 7(1):13629, December 2016. ISSN 2041-1723. doi: 10.1038/ncomms13629. URL https: /...

work page doi:10.1038/ncomms13629 2016

[59] [59]

Electronic Com- merce Research and Applications73, 101527 (2025) https://doi.org/10.1016/j

Dani Beck, Ann-Marie G. de Lange, Ivan I. Maximov, Geneviève Richard, Ole A. Andreassen, Jan E. Nordvik, and Lars T. Westlye. White matter microstructure across the adult lifespan: A mixed longitudinal and cross-sectional study using advanced diffusion models and brain- age prediction.NeuroImage, 224:117441, January 2021. ISSN 1053-8119. doi: 10.1016/j. n...

work page doi:10.1016/j 2021

[60] [60]

Age- Related Gray and White Matter Changes in Normal Adult Brains.Aging and Disease, 8(6): 899–909, December 2017

Farnaz Farokhian, Chunlan Yang, Iman Beheshti, Hiroshi Matsuda, and Shuicai Wu. Age- Related Gray and White Matter Changes in Normal Adult Brains.Aging and Disease, 8(6): 899–909, December 2017. ISSN 2152-5250. doi: 10.14336/AD.2017.0502. URL https: //pmc.ncbi.nlm.nih.gov/articles/PMC5758357/

work page doi:10.14336/ad.2017.0502 2017

[61] [61]

Toledo, Tianhao Zhang, Nick Bryan, Lenore J

Mohamad Habes, Guray Erus, Jon B. Toledo, Tianhao Zhang, Nick Bryan, Lenore J. Launer, Yves Rosseel, Deborah Janowitz, Jimit Doshi, Sandra Van der Auwera, Bettina von Sarnowski, Katrin Hegenscheid, Norbert Hosten, Georg Homuth, Henry Völzke, Ulf Schminke, Wolfgang Hoffmann, Hans J. Grabe, and Christos Davatzikos. White matter hyperintensities and imaging ...

work page doi:10.1093/brain/aww008 2016

[62] [62]

Cortina, Nikkita Khattar, Abinand C

Mustapha Bouhrara, Luis E. Cortina, Nikkita Khattar, Abinand C. Rejimon, Samuel Ajamu, Defne S. Cezayirli, and Richard G. Spencer. Maturation and degeneration of the human brainstem across the adult lifespan.Aging, 13(11):14862–14891, June 2021. ISSN 1945-4589. doi: 10.18632/aging.203183. URL https://www.aging-us.com/lookup/doi/10.18632/ aging.203183

work page doi:10.18632/aging.203183 2021

[63] [63]

Morphological changes in the cerebellum during aging: evidence from convolutional neural networks and shape analysis.Frontiers in Ag- ing Neuroscience, 16, April 2024

Yu Wang, Ye Teng, Tianci Liu, Yuchun Tang, Wenjia Liang, Wenjun Wang, Zhuoran Li, Qing Xia, Feifei Xu, and Shuwei Liu. Morphological changes in the cerebellum during aging: evidence from convolutional neural networks and shape analysis.Frontiers in Ag- ing Neuroscience, 16, April 2024. ISSN 1663-4365. doi: 10.3389/fnagi.2024.1359320. URL https://www.front...

work page doi:10.3389/fnagi.2024.1359320 2024

[64] [64]

Multimodal quantitative MRI reveals age-related biophysical alterations in the human brain across the adult lifespan.NeuroIm- age, 327:121742, February 2026

Xiang Chen, Zhiyuan Yuan, Jie Zhang, and Xiao-Yong Zhang. Multimodal quantitative MRI reveals age-related biophysical alterations in the human brain across the adult lifespan.NeuroIm- age, 327:121742, February 2026. ISSN 1053-8119. doi: 10.1016/j.neuroimage.2026.121742. URLhttps://www.sciencedirect.com/science/article/pii/S1053811926000601

work page doi:10.1016/j.neuroimage.2026.121742 2026

[65] [65]

Mapping heterogeneous region- and tissue-specific brain ageing patterns using quantitative MRI.Brain Communications, 8(1):fcag010, February

Xinjie Chen, Mario Ocampo-Pineda, Po-Jui Lu, Michelle G Jansen, Kwok-Shing Chan, Marcel Zwiers, Joukje M Oosterman, David G Norris, Andre F Marquand, Lester Melie-Garcia, Cristina Granziera, and José P Marques. Mapping heterogeneous region- and tissue-specific brain ageing patterns using quantitative MRI.Brain Communications, 8(1):fcag010, February

[66] [66]

doi: 10.1093/braincomms/fcag010

ISSN 2632-1297. doi: 10.1093/braincomms/fcag010. URL https://doi.org/10. 1093/braincomms/fcag010

work page doi:10.1093/braincomms/fcag010

[67] [67]

International Journal of Computer Vision , author =

Ramprasaath R. Selvaraju, Michael Cogswell, Abhishek Das, Ramakrishna Vedantam, Devi Parikh, and Dhruv Batra. Grad-CAM: Visual Explanations from Deep Networks via Gradient- based Localization.International Journal of Computer Vision, 128(2):336–359, February 2020. ISSN 0920-5691, 1573-1405. doi: 10.1007/s11263-019-01228-7. URL http://arxiv.org/ abs/1610.0...

work page doi:10.1007/s11263-019-01228-7 2020