DALight-3D: A Lightweight 3D U-Net for Brain Tumor Segmentation from Multi-Modal MRI

Dhruv Mishra; Dr Manu Pratap Singh; Nand Kumar Mishra

arxiv: 2605.04518 · v1 · submitted 2026-05-06 · 💻 cs.CV · cs.LG· cs.NE

DALight-3D: A Lightweight 3D U-Net for Brain Tumor Segmentation from Multi-Modal MRI

Nand Kumar Mishra , Dhruv Mishra , Dr Manu Pratap Singh This is my paper

Pith reviewed 2026-05-08 17:32 UTC · model grok-4.3

classification 💻 cs.CV cs.LGcs.NE

keywords brain tumor segmentation3D U-Netlightweight modelmulti-modal MRIDice scoredepthwise separable convolutioncross-slice attention

0 comments

The pith

DALight-3D reaches a mean Dice of 0.727 on brain tumor MRI using 2.22 million parameters.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents DALight-3D as a compact variant of the 3D U-Net designed to lower the computational cost of automatic brain tumor segmentation from multi-modal MRI scans. It incorporates depthwise separable 3D convolutions, identifier-conditioned normalization, cross-slice attention, and adaptive skip fusion. On the Medical Segmentation Decathlon Task01 BrainTumour benchmark and under matched optimization settings, the model records a mean Dice score of 0.727 with 2.22 million parameters. This exceeds the 0.710 Dice score obtained by the Residual 3D U-Net, which requires 3.20 million parameters. Ablation experiments show that performance drops when any of the four added components is removed.

Core claim

DALight-3D is a lightweight 3D U-Net that combines depthwise separable 3D convolutions, identifier-conditioned normalization, cross-slice attention, and adaptive skip fusion. Evaluated on the Medical Segmentation Decathlon Task01 BrainTumour benchmark with matched optimization settings, it attains a mean Dice of 0.727 using 2.22 million parameters, compared with 0.710 Dice and 3.20 million parameters for the Residual 3D U-Net baseline. Removing any one of the four added components produces consistent performance degradation in the reported ablations.

What carries the argument

The integration of depthwise separable 3D convolutions, identifier-conditioned normalization, cross-slice attention, and adaptive skip fusion inside a 3D U-Net backbone to reduce parameter count while preserving segmentation accuracy.

If this is right

The model achieves higher segmentation accuracy than the Residual 3D U-Net while using roughly 30 percent fewer parameters.
Each of the four added components contributes measurably to final performance according to the ablation results.
The architecture maintains its reported accuracy-efficiency balance on the Medical Segmentation Decathlon brain tumor task under the stated training protocol.
The approach provides a concrete example of trading model size for Dice score in volumetric medical image segmentation.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same combination of separable convolutions and light attention could be tested on other 3D medical segmentation tasks such as organ or lesion delineation.
Identifier-conditioned normalization may offer a general way to stabilize training when input modalities vary in intensity distribution.
Further reductions in parameter count might be possible by replacing the remaining standard convolutions with additional separable layers.

Load-bearing premise

The reported Dice gains arise from the four architectural additions rather than from any unstated differences in training schedules, data preprocessing, or random seeds.

What would settle it

Retrain the DALight-3D model and the Residual 3D U-Net baseline on the same data splits using identical random seeds, augmentation pipelines, and optimization hyperparameters, then compare the resulting mean Dice scores.

read the original abstract

Automatic brain tumor segmentation from multi-modal MRI remains challenging because volumetric models often incur substantial computational cost. This paper presents DALight-3D, a compact 3D U-Net variant that combines depthwise separable 3D convolutions, identifier-conditioned normalization, cross-slice attention, and adaptive skip fusion. The method is evaluated on the Medical Segmentation Decathlon Task01 BrainTumour benchmark under matched optimization settings against standard 3D U-Net, Attention U-Net, Residual 3D U-Net, and V-Net baselines. In the reported 50-epoch comparison, DALight-3D achieves a mean Dice of 0.727 with 2.22M parameters, compared with 0.710 Dice and 3.20M parameters for Residual 3D U-Net. Component-wise ablations show consistent performance degradation when SepConv, identifier-conditioned normalization, CSA, or SSFB is removed. These results indicate that DALight-3D offers a favorable accuracy-efficiency trade-off within the present benchmark setting.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

DALight-3D recombines known efficiency modules into a 3D U-Net and reports a small Dice edge over Residual U-Net on one benchmark, but the gain is hard to credit cleanly to the architecture.

read the letter

The main thing to know is that DALight-3D assembles depthwise separable 3D convolutions, identifier-conditioned normalization, cross-slice attention, and adaptive skip fusion into a U-Net variant and reports a 0.017 higher mean Dice score than Residual 3D U-Net while using fewer parameters on the MSD brain tumor dataset. The paper does a solid job with its internal ablations. Removing any one of the new modules leads to lower performance, which shows the components are contributing rather than just adding noise. The soft spot is the external comparison. The abstract mentions matched optimization settings, but without reported variance across runs, explicit confirmation of identical data handling for the baselines, or full hyperparameter details, it is difficult to be sure the gain is due to the architecture. The concern about unstated training differences is reasonable given the information provided. This work is aimed at people who need practical, lightweight models for multi-modal brain tumor segmentation in settings where compute is limited. A reader working on clinical deployment or resource-constrained environments would find the parameter counts and Dice numbers useful as a reference point. I would send this to peer review. The claims are not overstated, the methods are standard, and the ablations provide a starting point for evaluation. Referees can request the additional experimental controls that would make the results more convincing.

Referee Report

2 major / 2 minor

Summary. The manuscript introduces DALight-3D, a lightweight 3D U-Net variant for brain tumor segmentation from multi-modal MRI. It combines depthwise separable 3D convolutions, identifier-conditioned normalization, cross-slice attention (CSA), and adaptive skip fusion (SSFB). Evaluated on the Medical Segmentation Decathlon Task01 BrainTumour benchmark under matched optimization settings, DALight-3D reports a mean Dice of 0.727 with 2.22M parameters, outperforming Residual 3D U-Net (0.710 Dice, 3.20M parameters) and other baselines, with component ablations showing performance drops upon module removal.

Significance. If the performance gains are verifiably due to the architectural modules rather than training differences, the work offers a practical accuracy-efficiency trade-off for 3D medical segmentation, potentially aiding deployment in compute-limited clinical settings. The internal ablations provide direct evidence for each proposed component's contribution.

major comments (2)

[Abstract / Experimental Results] Abstract and Experimental Results section: The central claim of a 0.017 mean Dice improvement under 'matched optimization settings' lacks supporting details on whether identical data splits, preprocessing, augmentation pipelines, optimizer, learning-rate schedule, and random seeds were applied to the Residual 3D U-Net and other baselines. Without this or reported variance over multiple runs, the efficiency-accuracy trade-off cannot be confidently attributed to depthwise separable convolutions, identifier-conditioned normalization, CSA, or SSFB.
[Ablation studies] Ablation studies (component-wise results): While removals of SepConv, identifier-conditioned normalization, CSA, or SSFB show consistent degradation, the absence of error bars, statistical significance tests, or multi-seed averaging leaves the magnitude of each module's contribution unquantified, weakening support for the overall performance claim.

minor comments (2)

[Abstract / Methods] The 50-epoch comparison is mentioned in the abstract; clarify whether this represents full convergence or a fixed budget, and provide training curves or convergence criteria.
[Methods] Define acronyms CSA and SSFB at first use in the Methods section and ensure consistent notation for all modules throughout.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments. We address each major comment below and indicate the revisions planned for the manuscript.

read point-by-point responses

Referee: [Abstract / Experimental Results] Abstract and Experimental Results section: The central claim of a 0.017 mean Dice improvement under 'matched optimization settings' lacks supporting details on whether identical data splits, preprocessing, augmentation pipelines, optimizer, learning-rate schedule, and random seeds were applied to the Residual 3D U-Net and other baselines. Without this or reported variance over multiple runs, the efficiency-accuracy trade-off cannot be confidently attributed to depthwise separable convolutions, identifier-conditioned normalization, CSA, or SSFB.

Authors: We agree that the manuscript would benefit from more explicit confirmation of the matched settings. In the revised version, we will expand the Experimental Results section to state that all models (including Residual 3D U-Net and other baselines) were trained using identical data splits from the Medical Segmentation Decathlon benchmark, the same preprocessing and augmentation pipelines, the same optimizer and learning-rate schedule, and the same random seed. These settings are already described in the Experimental Setup section and were applied uniformly. Regarding variance over multiple runs, our experiments used fixed seeds for reproducibility; due to the high computational cost of 3D volumetric training, multiple independent runs were not performed. We will add a note acknowledging this limitation in the revised manuscript. revision: partial
Referee: [Ablation studies] Ablation studies (component-wise results): While removals of SepConv, identifier-conditioned normalization, CSA, or SSFB show consistent degradation, the absence of error bars, statistical significance tests, or multi-seed averaging leaves the magnitude of each module's contribution unquantified, weakening support for the overall performance claim.

Authors: We acknowledge that error bars and statistical tests would strengthen the quantification of each module's contribution. The ablations demonstrate consistent performance degradation upon removal of each component under identical training conditions. In the revision, we will update the Ablation Studies section to explicitly note that all results are from single training runs and to discuss this as a limitation. We will also consider adding a brief statement on the potential value of multi-seed experiments for future work. The observed degradations remain supportive of the modules' contributions within the controlled experimental framework. revision: partial

Circularity Check

0 steps flagged

No circularity: empirical architecture proposal with external benchmark evaluation

full rationale

The paper proposes DALight-3D as a lightweight 3D U-Net variant by combining depthwise separable convolutions with new modules (identifier-conditioned normalization, cross-slice attention, adaptive skip fusion) and evaluates the design empirically on the public Medical Segmentation Decathlon Task01 benchmark. Component ablations and comparisons to baselines (including Residual 3D U-Net) are reported under claimed matched settings. No mathematical derivation chain, first-principles prediction, uniqueness theorem, or fitted parameter is presented that reduces to the inputs by construction. All performance claims rest on external data and are independently falsifiable by re-running the public benchmark; no self-citation or self-definition is load-bearing for the central accuracy-efficiency result.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Based on abstract only; no explicit free parameters, axioms, or invented entities are stated beyond standard deep-learning training assumptions such as the Dice metric being appropriate and the benchmark being representative.

pith-pipeline@v0.9.0 · 5494 in / 1184 out tokens · 80015 ms · 2026-05-08T17:32:15.613612+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

35 extracted references · 35 canonical work pages

[1]

Parcham, M

Akter A, Nosheen N, Ahmed S, et al (2024) Robust clinical applicable CNN and U-Net based algorithm for MRI classification and segmentation for brain tumor. Expert Systems with Applications 238:122347. https://doi.org/10.1016/j.eswa. 2023.122347

work page doi:10.1016/j.eswa 2024
[2]

In: Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries (MICCAI Workshop)

Andermatt S, Pezold S, Cattin PC (2018) Multi-planar deep segmentation networks for 3d brain tumor segmentation. In: Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries (MICCAI Workshop)

work page 2018
[3]

European Conference on Computer Vision

Cao H, Wang Y, Chen J, et al (2022) Swin-unet: Unet-like pure transformer for medical image segmentation. European Conference on Computer Vision

work page 2022
[4]

In: Medical Image Computing and Computer-Assisted Intervention (MICCAI)

Cicek O, Abdulkadir A, Lienkamp SS, et al (2016) 3d u-net: Learning dense volumetric segmentation from sparse annotation. In: Medical Image Computing and Computer-Assisted Intervention (MICCAI). Springer, pp 424–432

work page 2016
[5]

Expert Systems with Applications 224:119963

Farajzadeh N, Sadeghzadeh N, Hashemzadeh M (2023) Brain tumor segmentation and classification on MRI via deep hybrid representation learning. Expert Systems with Applications 224:119963. https://doi.org/10.1016/j.eswa.2023.119963

work page doi:10.1016/j.eswa.2023.119963 2023
[6]

In: Journal of Machine Learning Research, pp 1–35

Ganin Y, Lempitsky V (2016) Domain-adversarial training of neural networks. In: Journal of Machine Learning Research, pp 1–35

work page 2016
[7]

Magnetic Resonance Imaging 31(8):1426–1438

Gordillo N, Montseny E, Sobrevilla P (2013) State of the art survey on MRI brain tumor segmentation. Magnetic Resonance Imaging 31(8):1426–1438. https: //doi.org/10.1016/j.mri.2013.05.002

work page doi:10.1016/j.mri.2013.05.002 2013
[8]

In: International Conference on Machine Learning (ICML), pp 1321–1330

Guo C, Pleiss G, Sun Y, et al (2017) On calibration of modern neural networks. In: International Conference on Machine Learning (ICML), pp 1321–1330

work page 2017
[9]

IEEE Winter Conference on Applications of Computer Vision (WACV) 24

Hatamizadeh A, Tang Y, Nath V, et al (2022) Unetr: Transformers for 3d med- ical image segmentation. IEEE Winter Conference on Applications of Computer Vision (WACV) 24

work page 2022
[10]

Medical Image Analysis 35:18–31

Havaei M, Davy A, Warde-Farley D, et al (2017) Brain tumor segmentation with deep neural networks. Medical Image Analysis 35:18–31. https://doi.org/10.1016/ j.media.2016.05.004

work page 2017
[11]

In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 770–778

He K, Zhang X, Ren S, et al (2016) Deep residual learning for image recogni- tion. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 770–778

work page 2016
[12]

arXiv preprint arXiv:160608415

Hendrycks D, Gimpel K (2016) Gaussian error linear units (GELUs). arXiv preprint arXiv:160608415

work page 2016
[13]

In: Brainlesion: Glioma, Multiple Sclerosis, Stroke and Trau- matic Brain Injuries (MICCAI Workshop), pp 327–339, https://doi.org/10.1007/ 978-3-030-72084-1 30

Henry T, Carr´ e A, Lerousseau M, et al (2021) Brain tumor segmentation with self-ensembled, deeply-supervised 3d u-net neural networks: A BraTS 2020 challenge solution. In: Brainlesion: Glioma, Multiple Sclerosis, Stroke and Trau- matic Brain Injuries (MICCAI Workshop), pp 327–339, https://doi.org/10.1007/ 978-3-030-72084-1 30

work page 2021
[14]

arXiv preprint arXiv:170404861

Howard AG, Zhu M, Chen B, et al (2017) Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:170404861

work page 2017
[15]

IEEE Transactions on Pattern Analysis and Machine Intelligence 42(8):2011–2023

Hu J, Shen L, Sun G (2020) Squeeze-and-excitation networks. IEEE Transactions on Pattern Analysis and Machine Intelligence 42(8):2011–2023

work page 2020
[16]

In: International Conference on Machine Learning, pp 448–456

Ioffe S, Szegedy C (2015) Batch normalization: Accelerating deep network train- ing by reducing internal covariate shift. In: International Conference on Machine Learning, pp 448–456

work page 2015
[17]

In: arXiv preprint arXiv:1809.10483

Isensee F, Petersen J, Kohl SAA, et al (2018) No new-net. In: arXiv preprint arXiv:1809.10483

work page arXiv 2018
[18]

nnU-Net: A Self-Configuring Method for Deep Learning-Based Biomedical Image Segmentation,

Isensee F, Jaeger PF, Kohl SAA, et al (2021) nnu-net: A self-configuring method for deep learning-based biomedical image segmentation. Nature Methods 18(2):203–211. https://doi.org/10.1038/s41592-020-01008-z

work page doi:10.1038/s41592-020-01008-z 2021
[19]

and Simpson, Joanna P

Kamnitsas K, Ledig C, Newcombe VFJ, et al (2017) Efficient multi-scale 3d cnn with fully connected crf for accurate brain lesion segmentation. Medical Image Analysis 36:61–78. https://doi.org/10.1016/j.media.2016.10.004

work page doi:10.1016/j.media.2016.10.004 2017
[20]

Information Processing in Medical Imaging pp 597–609

Kamnitsas K, et al (2017) Unsupervised domain adaptation in brain lesion seg- mentation with adversarial networks. Information Processing in Medical Imaging pp 597–609

work page 2017
[21]

In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Liu Z, Mao H, Wu CY, et al (2022) A convnet for the 2020s. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

work page 2022
[22]

25 Technologies 14(1):34

Lopez-Ramirez JL, Hernandez-Gutierrez FD, Avina-Ortiz JR, et al (2026) Multi- scale ConvNeXt for robust brain tumor segmentation in multimodal MRI. 25 Technologies 14(1):34. https://doi.org/10.3390/technologies14010034

work page doi:10.3390/technologies14010034 2026
[23]

International Conference on Learning Representations (ICLR)

Loshchilov I, Hutter F (2017) SGDR: Stochastic gradient descent with warm restarts. International Conference on Learning Representations (ICLR)

work page 2017
[24]

In: Inter- national Conference on Learning Representations (ICLR)

Loshchilov I, Hutter F (2019) Decoupled weight decay regularization. In: Inter- national Conference on Learning Representations (ICLR)

work page 2019
[25]

Elvis Nava, Seijin Kobayashi, Yifei Yin, Robert K Katzschmann, and Benjamin F Grewe

Menze BH, Jakab A, Bauer S, et al (2015) The multimodal brain tumor image segmentation benchmark (brats). IEEE Transactions on Medical Imaging 34(10):1993–2024. https://doi.org/10.1109/TMI.2014.2377694

work page doi:10.1109/tmi.2014.2377694 2015
[26]

Milletari, N

Milletari F, Navab N, Ahmadi SA (2016) V-net: Fully convolutional neural networks for volumetric medical image segmentation. In: Proceedings of the Fourth International Conference on 3D Vision (3DV). IEEE, pp 565–571, https: //doi.org/10.1109/3DV.2016.79

work page doi:10.1109/3dv.2016.79 2016
[27]

In: Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries (MICCAI Workshop)

Myronenko A (2019) 3d MRI brain tumor segmentation using autoencoder reg- ularization. In: Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries (MICCAI Workshop)

work page 2019
[28]

In: Medical Imaging with Deep Learning (MIDL)

Oktay O, Schlemper J, Le Folgoc L, et al (2018) Attention u-net: Learning where to look for the pancreas. In: Medical Imaging with Deep Learning (MIDL)

work page 2018
[29]

In: International Conference on Computational Intel- ligence in Data Science, pp 124–135, https://doi.org/10.1007/978-3-031-16364-7 10

Rawat A, Kumar R (2022) Assessing layer normalization with BraTS MRI data in a convolutional neural net. In: International Conference on Computational Intel- ligence in Data Science, pp 124–135, https://doi.org/10.1007/978-3-031-16364-7 10

work page doi:10.1007/978-3-031-16364-7 2022
[30]

In: Medical Image Computing and Computer- Assisted Intervention (MICCAI)

Ronneberger O, Fischer P, Brox T (2015) U-net: Convolutional networks for biomedical image segmentation. In: Medical Image Computing and Computer- Assisted Intervention (MICCAI). Springer, pp 234–241

work page 2015
[31]

Current Medical Imaging 21

Satushe V, Vyas V, Metkar S, et al (2025) Advanced cnn architecture for brain tumor segmentation and classification using brats-goat 2024 dataset. Current Medical Imaging 21. https://doi.org/10.2174/0115734056344235241217155930

work page doi:10.2174/0115734056344235241217155930 2025
[32]

arXiv preprint arXiv:190209063

Simpson AL, Antonelli M, Bakas S, et al (2019) A large annotated medical image dataset for the development and evaluation of segmentation algorithms. arXiv preprint arXiv:190209063

work page 2019
[33]

In: Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries (MICCAI Workshop) 26

Wang G, Li W, Ourselin S, et al (2019) Automatic brain tumor segmenta- tion using cascaded anisotropic convolutional neural networks. In: Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries (MICCAI Workshop) 26

work page 2019
[34]

In: European Conference on Computer Vision, pp 3–19

Woo S, Park J, Lee JY, et al (2018) Cbam: Convolutional block attention module. In: European Conference on Computer Vision, pp 3–19

work page 2018
[35]

In: European Conference on Computer Vision (ECCV), pp 3–19 27

Wu Y, He K (2018) Group normalization. In: European Conference on Computer Vision (ECCV), pp 3–19 27

work page 2018

[1] [1]

Parcham, M

Akter A, Nosheen N, Ahmed S, et al (2024) Robust clinical applicable CNN and U-Net based algorithm for MRI classification and segmentation for brain tumor. Expert Systems with Applications 238:122347. https://doi.org/10.1016/j.eswa. 2023.122347

work page doi:10.1016/j.eswa 2024

[2] [2]

In: Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries (MICCAI Workshop)

Andermatt S, Pezold S, Cattin PC (2018) Multi-planar deep segmentation networks for 3d brain tumor segmentation. In: Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries (MICCAI Workshop)

work page 2018

[3] [3]

European Conference on Computer Vision

Cao H, Wang Y, Chen J, et al (2022) Swin-unet: Unet-like pure transformer for medical image segmentation. European Conference on Computer Vision

work page 2022

[4] [4]

In: Medical Image Computing and Computer-Assisted Intervention (MICCAI)

Cicek O, Abdulkadir A, Lienkamp SS, et al (2016) 3d u-net: Learning dense volumetric segmentation from sparse annotation. In: Medical Image Computing and Computer-Assisted Intervention (MICCAI). Springer, pp 424–432

work page 2016

[5] [5]

Expert Systems with Applications 224:119963

Farajzadeh N, Sadeghzadeh N, Hashemzadeh M (2023) Brain tumor segmentation and classification on MRI via deep hybrid representation learning. Expert Systems with Applications 224:119963. https://doi.org/10.1016/j.eswa.2023.119963

work page doi:10.1016/j.eswa.2023.119963 2023

[6] [6]

In: Journal of Machine Learning Research, pp 1–35

Ganin Y, Lempitsky V (2016) Domain-adversarial training of neural networks. In: Journal of Machine Learning Research, pp 1–35

work page 2016

[7] [7]

Magnetic Resonance Imaging 31(8):1426–1438

Gordillo N, Montseny E, Sobrevilla P (2013) State of the art survey on MRI brain tumor segmentation. Magnetic Resonance Imaging 31(8):1426–1438. https: //doi.org/10.1016/j.mri.2013.05.002

work page doi:10.1016/j.mri.2013.05.002 2013

[8] [8]

In: International Conference on Machine Learning (ICML), pp 1321–1330

Guo C, Pleiss G, Sun Y, et al (2017) On calibration of modern neural networks. In: International Conference on Machine Learning (ICML), pp 1321–1330

work page 2017

[9] [9]

IEEE Winter Conference on Applications of Computer Vision (WACV) 24

Hatamizadeh A, Tang Y, Nath V, et al (2022) Unetr: Transformers for 3d med- ical image segmentation. IEEE Winter Conference on Applications of Computer Vision (WACV) 24

work page 2022

[10] [10]

Medical Image Analysis 35:18–31

Havaei M, Davy A, Warde-Farley D, et al (2017) Brain tumor segmentation with deep neural networks. Medical Image Analysis 35:18–31. https://doi.org/10.1016/ j.media.2016.05.004

work page 2017

[11] [11]

In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 770–778

He K, Zhang X, Ren S, et al (2016) Deep residual learning for image recogni- tion. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 770–778

work page 2016

[12] [12]

arXiv preprint arXiv:160608415

Hendrycks D, Gimpel K (2016) Gaussian error linear units (GELUs). arXiv preprint arXiv:160608415

work page 2016

[13] [13]

In: Brainlesion: Glioma, Multiple Sclerosis, Stroke and Trau- matic Brain Injuries (MICCAI Workshop), pp 327–339, https://doi.org/10.1007/ 978-3-030-72084-1 30

Henry T, Carr´ e A, Lerousseau M, et al (2021) Brain tumor segmentation with self-ensembled, deeply-supervised 3d u-net neural networks: A BraTS 2020 challenge solution. In: Brainlesion: Glioma, Multiple Sclerosis, Stroke and Trau- matic Brain Injuries (MICCAI Workshop), pp 327–339, https://doi.org/10.1007/ 978-3-030-72084-1 30

work page 2021

[14] [14]

arXiv preprint arXiv:170404861

Howard AG, Zhu M, Chen B, et al (2017) Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:170404861

work page 2017

[15] [15]

IEEE Transactions on Pattern Analysis and Machine Intelligence 42(8):2011–2023

Hu J, Shen L, Sun G (2020) Squeeze-and-excitation networks. IEEE Transactions on Pattern Analysis and Machine Intelligence 42(8):2011–2023

work page 2020

[16] [16]

In: International Conference on Machine Learning, pp 448–456

Ioffe S, Szegedy C (2015) Batch normalization: Accelerating deep network train- ing by reducing internal covariate shift. In: International Conference on Machine Learning, pp 448–456

work page 2015

[17] [17]

In: arXiv preprint arXiv:1809.10483

Isensee F, Petersen J, Kohl SAA, et al (2018) No new-net. In: arXiv preprint arXiv:1809.10483

work page arXiv 2018

[18] [18]

nnU-Net: A Self-Configuring Method for Deep Learning-Based Biomedical Image Segmentation,

Isensee F, Jaeger PF, Kohl SAA, et al (2021) nnu-net: A self-configuring method for deep learning-based biomedical image segmentation. Nature Methods 18(2):203–211. https://doi.org/10.1038/s41592-020-01008-z

work page doi:10.1038/s41592-020-01008-z 2021

[19] [19]

and Simpson, Joanna P

Kamnitsas K, Ledig C, Newcombe VFJ, et al (2017) Efficient multi-scale 3d cnn with fully connected crf for accurate brain lesion segmentation. Medical Image Analysis 36:61–78. https://doi.org/10.1016/j.media.2016.10.004

work page doi:10.1016/j.media.2016.10.004 2017

[20] [20]

Information Processing in Medical Imaging pp 597–609

Kamnitsas K, et al (2017) Unsupervised domain adaptation in brain lesion seg- mentation with adversarial networks. Information Processing in Medical Imaging pp 597–609

work page 2017

[21] [21]

In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Liu Z, Mao H, Wu CY, et al (2022) A convnet for the 2020s. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

work page 2022

[22] [22]

25 Technologies 14(1):34

Lopez-Ramirez JL, Hernandez-Gutierrez FD, Avina-Ortiz JR, et al (2026) Multi- scale ConvNeXt for robust brain tumor segmentation in multimodal MRI. 25 Technologies 14(1):34. https://doi.org/10.3390/technologies14010034

work page doi:10.3390/technologies14010034 2026

[23] [23]

International Conference on Learning Representations (ICLR)

Loshchilov I, Hutter F (2017) SGDR: Stochastic gradient descent with warm restarts. International Conference on Learning Representations (ICLR)

work page 2017

[24] [24]

In: Inter- national Conference on Learning Representations (ICLR)

Loshchilov I, Hutter F (2019) Decoupled weight decay regularization. In: Inter- national Conference on Learning Representations (ICLR)

work page 2019

[25] [25]

Elvis Nava, Seijin Kobayashi, Yifei Yin, Robert K Katzschmann, and Benjamin F Grewe

Menze BH, Jakab A, Bauer S, et al (2015) The multimodal brain tumor image segmentation benchmark (brats). IEEE Transactions on Medical Imaging 34(10):1993–2024. https://doi.org/10.1109/TMI.2014.2377694

work page doi:10.1109/tmi.2014.2377694 2015

[26] [26]

Milletari, N

Milletari F, Navab N, Ahmadi SA (2016) V-net: Fully convolutional neural networks for volumetric medical image segmentation. In: Proceedings of the Fourth International Conference on 3D Vision (3DV). IEEE, pp 565–571, https: //doi.org/10.1109/3DV.2016.79

work page doi:10.1109/3dv.2016.79 2016

[27] [27]

In: Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries (MICCAI Workshop)

Myronenko A (2019) 3d MRI brain tumor segmentation using autoencoder reg- ularization. In: Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries (MICCAI Workshop)

work page 2019

[28] [28]

In: Medical Imaging with Deep Learning (MIDL)

Oktay O, Schlemper J, Le Folgoc L, et al (2018) Attention u-net: Learning where to look for the pancreas. In: Medical Imaging with Deep Learning (MIDL)

work page 2018

[29] [29]

In: International Conference on Computational Intel- ligence in Data Science, pp 124–135, https://doi.org/10.1007/978-3-031-16364-7 10

Rawat A, Kumar R (2022) Assessing layer normalization with BraTS MRI data in a convolutional neural net. In: International Conference on Computational Intel- ligence in Data Science, pp 124–135, https://doi.org/10.1007/978-3-031-16364-7 10

work page doi:10.1007/978-3-031-16364-7 2022

[30] [30]

In: Medical Image Computing and Computer- Assisted Intervention (MICCAI)

Ronneberger O, Fischer P, Brox T (2015) U-net: Convolutional networks for biomedical image segmentation. In: Medical Image Computing and Computer- Assisted Intervention (MICCAI). Springer, pp 234–241

work page 2015

[31] [31]

Current Medical Imaging 21

Satushe V, Vyas V, Metkar S, et al (2025) Advanced cnn architecture for brain tumor segmentation and classification using brats-goat 2024 dataset. Current Medical Imaging 21. https://doi.org/10.2174/0115734056344235241217155930

work page doi:10.2174/0115734056344235241217155930 2025

[32] [32]

arXiv preprint arXiv:190209063

Simpson AL, Antonelli M, Bakas S, et al (2019) A large annotated medical image dataset for the development and evaluation of segmentation algorithms. arXiv preprint arXiv:190209063

work page 2019

[33] [33]

In: Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries (MICCAI Workshop) 26

Wang G, Li W, Ourselin S, et al (2019) Automatic brain tumor segmenta- tion using cascaded anisotropic convolutional neural networks. In: Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries (MICCAI Workshop) 26

work page 2019

[34] [34]

In: European Conference on Computer Vision, pp 3–19

Woo S, Park J, Lee JY, et al (2018) Cbam: Convolutional block attention module. In: European Conference on Computer Vision, pp 3–19

work page 2018

[35] [35]

In: European Conference on Computer Vision (ECCV), pp 3–19 27

Wu Y, He K (2018) Group normalization. In: European Conference on Computer Vision (ECCV), pp 3–19 27

work page 2018