arxiv: 2211.02701 · v1 · submitted 2022-11-04 · 💻 cs.LG · cs.AI· cs.CV

Recognition: 2 theorem links

· Lean Theorem

MONAI: An open-source framework for deep learning in healthcare

M. Jorge Cardoso , Wenqi Li , Richard Brown , Nic Ma , Eric Kerfoot , Yiheng Wang , Benjamin Murrey , Andriy Myronenko

show 48 more authors

Can Zhao Dong Yang Vishwesh Nath Yufan He Ziyue Xu Ali Hatamizadeh Wentao Zhu Yun Liu Mingxin Zheng Yucheng Tang Isaac Yang Michael Zephyr Behrooz Hashemian Sachidanand Alle Mohammad Zalbagi Darestani Charlie Budd Marc Modat Tom Vercauteren Guotai Wang Yiwen Li Yipeng Hu Yunguan Fu Benjamin Gorman Hans Johnson Brad Genereaux Barbaros S. Erdal Vikash Gupta Andres Diaz-Pinto Andre Dourson Lena Maier-Hein Paul F. Jaeger Michael Baumgartner Jayashree Kalpathy-Cramer Mona Flores Justin Kirby Lee A.D. Cooper Holger R. Roth Daguang Xu David Bericat Ralf Floca S. Kevin Zhou Haris Shuaib Keyvan Farahani Klaus H. Maier-Hein Stephen Aylward Prerna Dogra Sebastien Ourselin Andrew Feng

Authors on Pith no claims yet

Pith reviewed 2026-05-13 22:50 UTC · model grok-4.3

classification 💻 cs.LG cs.AIcs.CV

keywords MONAIdeep learninghealthcaremedical imagingPyTorchopen-source frameworkAI in medicine

0 comments

The pith

MONAI extends PyTorch with medical-specific components to support deep learning in healthcare.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces MONAI as a freely available, community-supported framework built on PyTorch for deep learning applications in healthcare. It adds purpose-built architectures, transformations, and utilities that account for the geometry, physiology, and physics of medical data, especially imaging. The design preserves PyTorch's simple and compositional style while incorporating software best practices for documentation, testing, and ease of use. By making these tools open and extensible, MONAI aims to help research, clinical, and industrial teams create more reproducible and robust models. Adoption across global teams working on nearly every aspect of healthcare illustrates the intended reach of this infrastructure.

Core claim

MONAI extends PyTorch to support medical data, with a particular focus on imaging, and provide purpose-specific AI model architectures, transformations and utilities that streamline the development and deployment of medical AI models.

What carries the argument

The MONAI library, which supplies medical-aware extensions to PyTorch including data handling, transforms, networks, and utilities while enforcing best-practice software development.

If this is right

Medical AI model development becomes more standardized through shared, tested components that respect data geometry and physiology.
Reproducibility improves when teams follow the documented transforms and utilities instead of ad-hoc pipelines.
Clinical translation is eased because the framework already encodes awareness of medical data peculiarities such as imaging physics.
Global collaboration accelerates as research, clinical, and industrial groups contribute and reuse the same open codebase.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Researchers already familiar with PyTorch can adopt medical workflows with minimal new syntax.
A shared library of this type could gradually reduce duplication of effort across medical imaging studies.
As usage grows, community-driven additions may expand coverage to non-imaging modalities such as signals or genomics.

Load-bearing premise

The provided components and best-practice guidelines will in practice produce safer, more reproducible, and clinically deployable models without additional domain-specific validation.

What would settle it

A head-to-head study in which models built with MONAI components show no measurable gains in reproducibility, robustness, or clinical safety metrics compared with equivalent custom PyTorch implementations.

read the original abstract

Artificial Intelligence (AI) is having a tremendous impact across most areas of science. Applications of AI in healthcare have the potential to improve our ability to detect, diagnose, prognose, and intervene on human disease. For AI models to be used clinically, they need to be made safe, reproducible and robust, and the underlying software framework must be aware of the particularities (e.g. geometry, physiology, physics) of medical data being processed. This work introduces MONAI, a freely available, community-supported, and consortium-led PyTorch-based framework for deep learning in healthcare. MONAI extends PyTorch to support medical data, with a particular focus on imaging, and provide purpose-specific AI model architectures, transformations and utilities that streamline the development and deployment of medical AI models. MONAI follows best practices for software-development, providing an easy-to-use, robust, well-documented, and well-tested software framework. MONAI preserves the simple, additive, and compositional approach of its underlying PyTorch libraries. MONAI is being used by and receiving contributions from research, clinical and industrial teams from around the world, who are pursuing applications spanning nearly every aspect of healthcare.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

MONAI is a straightforward PyTorch extension for medical imaging that packages existing patterns into a maintained, consortium-backed library.

read the letter

The paper's main point is the release of MONAI itself: a PyTorch-based framework that adds medical-specific transforms, models, and utilities while keeping the original library's simple compositional style. It is already seeing external contributions and use across research, clinical, and industry groups, which is the clearest sign of value so far. The design choices around testing, documentation, and governance are described clearly and follow standard open-source practices for this kind of project. That is the actual contribution here—an engineering and infrastructure one rather than a new algorithm or empirical result. The manuscript stays consistent with what it claims: it describes an implemented artifact without hidden derivations or performance guarantees. The long author list reflects the consortium model, and the text gives a reasonable overview of the components without overreaching. One limitation is the absence of any usage benchmarks or before-after comparisons that would show whether the framework actually improves reproducibility or deployment speed in practice; the paper treats those benefits as likely outcomes rather than demonstrated ones. This is common for framework releases but still leaves the central assumption untested in the document. The work is aimed at developers and researchers who build medical imaging models and want standardized, tested tools instead of reinventing common pipelines. It is solid enough on its own terms to warrant peer review, mainly so reviewers can check adoption data, API stability, and whether the best-practice claims hold up under real use. I would send it forward.

Referee Report

0 major / 2 minor

Summary. The manuscript introduces MONAI, a freely available, community-supported, and consortium-led PyTorch-based framework for deep learning in healthcare. It extends PyTorch to support medical data (with emphasis on imaging) by supplying purpose-specific model architectures, transformations, and utilities. The paper describes the framework's design choices, including a compositional API that preserves PyTorch's additive style, adherence to software best practices (robustness, documentation, testing), and its current adoption and external contributions from research, clinical, and industrial teams.

Significance. If the descriptive claims hold, the work is significant because it supplies a standardized, extensible, open-source platform that directly addresses the geometry, physiology, and physics particularities of medical data. By following documented best practices and operating under consortium governance with active external contributions, MONAI can accelerate reproducible research and lower barriers to clinically deployable models across healthcare applications.

minor comments (2)

[Design and Implementation] The manuscript would benefit from a concise table (perhaps in §3 or §4) that enumerates the core MONAI modules (transforms, networks, losses, metrics) alongside their PyTorch counterparts to make the extension points immediately visible to readers.
[Software Engineering Practices] A short paragraph on the testing and continuous-integration strategy (mentioned in the abstract) would strengthen the 'well-tested' claim; currently the description remains high-level.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for their positive assessment of the MONAI manuscript and for recommending acceptance. We appreciate the recognition that the framework addresses key challenges in medical imaging and deep learning through its design, governance, and community contributions.

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The manuscript is a purely descriptive introduction to the MONAI software framework, outlining its PyTorch-based design, medical-imaging transforms, model architectures, and community governance without any equations, derivations, fitted parameters, or quantitative predictions. No self-definitional loops, fitted-input predictions, or load-bearing self-citations appear; the central claim is the existence and features of the released artifact itself, which stands independently of any internal reduction.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

This is a software framework announcement rather than a theoretical derivation. No free parameters, invented physical entities, or ad-hoc axioms are required beyond standard assumptions about PyTorch extensibility and medical data characteristics.

axioms (1)

domain assumption PyTorch's compositional design can be extended to medical imaging without breaking existing workflows
Invoked in the description of preserving the simple, additive approach of PyTorch.

pith-pipeline@v0.9.0 · 5761 in / 1057 out tokens · 26912 ms · 2026-05-13T22:50:54.468025+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith.Foundation.RealityFromDistinction reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

This work introduces MONAI, a freely available, community-supported, and consortium-led PyTorch-based framework for deep learning in healthcare. MONAI extends PyTorch to support medical data, with a particular focus on imaging, and provide purpose-specific AI model architectures, transformations and utilities
IndisputableMonolith.Cost.FunctionalEquation washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

MONAI follows best practices for software-development, providing an easy-to-use, robust, well-documented, and well-tested software framework. MONAI preserves the simple, additive, and compositional approach of its underlying PyTorch libraries.

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Forward citations

Cited by 24 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

MedOpenClaw and MedFlowBench: Auditing Medical Agents in Full-Study Workflows
cs.CV 2026-03 conditional novelty 8.0

MedFlowBench evaluates VLM agents on full radiology and pathology studies by requiring both task answers and verifiable evidence like key slices and regions of interest, revealing that answer-only scores overestimate ...
SegWithU: Uncertainty as Perturbation Energy for Single-Forward-Pass Risk-Aware Medical Image Segmentation
cs.CV 2026-04 unverdicted novelty 7.0

SegWithU treats uncertainty as perturbation energy via rank-1 probes in a post-hoc head for efficient single-pass risk-aware medical image segmentation, outperforming other single-forward-pass methods on ACDC, BraTS20...
AbdomenGen: Sequential Volume-Conditioned Diffusion Framework for Abdominal Anatomy Generation
cs.CV 2026-04 unverdicted novelty 7.0

A sequential diffusion framework generates controllable abdominal anatomies with a Volume Control Scalar that decouples organ size from body habitus, achieving Dice scores around 0.83 and reducing distributional misma...
Camyla: Scaling Autonomous Research in Medical Image Segmentation
cs.AI 2026-04 unverdicted novelty 7.0

Camyla autonomously generates research proposals, experiments, and manuscripts in medical image segmentation, outperforming baselines on 24 of 31 recent datasets while producing 40 human-reviewed papers.
Cross Modality Image Translation In Medical Imaging Using Generative Frameworks
cs.CV 2026-05 accept novelty 6.0

A uniform benchmark across 77 experiments finds SRGAN superior to latent diffusion models for 3D medical image translation, with synthetic volumes indistinguishable from real ones in a 17-physician Turing test.
Tumor-aware augmentation with task-guided attention analysis improves rectal cancer segmentation from magnetic resonance images
eess.IV 2026-05 unverdicted novelty 6.0

Tumor-aware augmentation and anisotropic cropping improve CT-to-MRI transfer for rectal cancer segmentation in hierarchical transformers by reducing attention dilution from padding and enhancing feature adaptation.
SIAM: Head and Brain MRI Segmentation from Few High-Quality Templates via Synthetic Training
cs.CV 2026-05 unverdicted novelty 6.0

SIAM achieves state-of-the-art whole-head MRI segmentation of 16 structures including extra-cerebral tissues by training on synthetic data from just six manual templates, matching or exceeding prior methods on 301 sca...
GeoSAE: Geometric Prior-Guided Layer-Wise Sparse Autoencoder Annotation of Brain MRI Foundation Models
cs.CV 2026-05 unverdicted novelty 6.0

GeoSAE extracts a compact, interpretable feature set from frozen brain MRI foundation models that predicts MCI-to-AD conversion (AUC 0.746) with age-deconfounded annotations and replicates across cohorts.
ESICA: A Scalable Framework for Text-Guided 3D Medical Image Segmentation
cs.CV 2026-04 unverdicted novelty 6.0

ESICA delivers state-of-the-art accuracy on a five-modality 3D medical segmentation benchmark while offering a compact variant with far fewer parameters.
Generative Modeling of Neurodegenerative Brain Anatomy with 4D Longitudinal Diffusion Model
cs.CV 2026-04 unverdicted novelty 6.0

A 4D diffusion generative model learns topology-preserving spatiotemporal deformations to synthesize realistic longitudinal brain anatomy trajectories in neurodegenerative diseases from sparse follow-up scans.
Co-distilled attention guided masked image modeling with noisy teacher for self-supervised learning on medical images
cs.CV 2026-04 unverdicted novelty 6.0

DAGMaN uses co-distilled attention-guided masked image modeling with a noisy teacher to enable effective self-supervised pretraining on medical images by selective masking of co-occurring patches and maintenance of at...
Neuro-Oracle: A Trajectory-Aware Agentic RAG Framework for Interpretable Epilepsy Surgical Prognosis
cs.MM 2026-04 unverdicted novelty 6.0

Neuro-Oracle distills longitudinal MRI changes into trajectory vectors via a 3D Siamese encoder, retrieves similar cases, and generates LLM-based prognoses, achieving AUC 0.834-0.905 on a resection-type proxy task ver...
Distilling Photon-Counting CT into Routine Chest CT through Clinically Validated Degradation Modeling
cs.CV 2026-04 unverdicted novelty 6.0

SUMI distills photon-counting CT quality into routine chest CT by learning to reverse clinically validated acquisition degradations, yielding 15-20% gains in image metrics, better radiologist utility, and up to 15% hi...
NeuroAgent: LLM Agents for Multimodal Neuroimaging Analysis and Research
cs.AI 2026-05 unverdicted novelty 5.0

NeuroAgent uses a hierarchical LLM agent framework with Generate-Execute-Validate loops to automate neuroimaging preprocessing, reaching 84.8% end-to-end correctness and 0.9518 AUC for Alzheimer's classification on 14...
The autoPET3 Challenge: Automated Lesion Segmentation in Whole-Body PET/CT $\unicode{x2013}$ Multitracer Multicenter Generalization
cs.CV 2026-05 unverdicted novelty 5.0

The autoPET3 challenge finds that leading AI models reach a mean Dice score of 0.66 for multitracer PET/CT lesion segmentation, with compositional generalization to unseen tracer-center pairs remaining an open problem...
Multimodal synthesis of MRI and tabular data with diffusion in a joint latent space via cross-attention
eess.IV 2026-05 unverdicted novelty 5.0

A latent diffusion model jointly synthesizes MRI volumes and mixed-type tabular clinical data in a shared space via cross-attention and separate decoders after VAE fusion.
Architecture-Agnostic Modality-Isolated Gated Fusion for Robust Multi-Modal Prostate MRI Segmentation
cs.CV 2026-04 unverdicted novelty 5.0

MIGF improves multi-modal prostate MRI segmentation robustness via modality-isolated streams and dropout training, yielding ranking score gains of 2.8-13.4% across backbones and better tolerance to degraded diffusion ...
Why Invariance is Not Enough for Biomedical Domain Generalization and How to Fix It
eess.IV 2026-04 unverdicted novelty 5.0

MaskGen improves domain generalization for biomedical image segmentation by using source intensities plus domain-stable foundation model representations with minimal added complexity.
The autoPET3 Challenge: Automated Lesion Segmentation in Whole-Body PET/CT $\unicode{x2013}$ Multitracer Multicenter Generalization
cs.CV 2026-05 unverdicted novelty 4.0

The autoPET3 challenge finds good in-domain lesion segmentation performance in multitracer PET/CT but identifies compositional generalization to unseen tracer-center combinations as an open problem driven by volume ov...
One Sequence to Segment Them All: Efficient Data Augmentation for CT and MRI Cross-Domain 3D Spine Segmentation
cs.CV 2026-05 unverdicted novelty 4.0

Targeted data augmentations let single-sequence 3D spine segmentation models generalize to seven unseen CT and MRI datasets with 155% average Dice gain and almost no in-domain loss.
AMO-ENE: Attention-based Multi-Omics Fusion Model for Outcome Prediction in Extra Nodal Extension and HPV-associated Oropharyngeal Cancer
eess.IV 2026-04 unverdicted novelty 4.0

An attention-based fusion model combining semi-supervised CT segmentation, radiomics, and clinical features predicts metastatic recurrence, overall survival, and disease-free survival in HPV+ oropharyngeal cancer with...
GPAFormer: Graph-guided Patch Aggregation Transformer for Efficient 3D Medical Image Segmentation
cs.CV 2026-04 unverdicted novelty 4.0

GPAFormer with 1.81M parameters reports top Dice scores on BTCV (75.70%), Synapse (81.20%), ACDC (89.32%), and BraTS (82.74%) while running inference in under one second on consumer GPUs.
PR3DICTR: A modular AI framework for medical 3D image-based detection and outcome prediction
cs.CV 2026-04 unverdicted novelty 4.0

PR3DICTR is a new open-access modular framework for 3D medical image classification and outcome prediction that works with as little as two lines of code.
Dante: An Open Source Model Pre-Training and Fine-Tuning Tool for the Dafne Federated Framework for Medical Image Segmentation
eess.IV 2026-05 unverdicted novelty 3.0

Dante is a new open-source backend for the Dafne ecosystem that implements configurable training from scratch, layer freezing, and channel-wise LoRA for medical image segmentation, with validation showing faster conve...

Reference graph

Works this paper leans on

36 extracted references · 36 canonical work pages · cited by 23 Pith papers · 2 internal anchors

[1]

Kraljevic, Z. et al. Multi-domain clinical natural language processing with medcat: The medical concept annotation toolkit. Artiﬁcial Intelligence in Medicine 117, 102083 (2021). URL https://www.sciencedirect.com/ science/article/pii/S0933365721000762. https://doi.org/https://doi.org/ 10.1016/j.artmed.2021.102083

work page doi:10.1016/j.artmed.2021.102083 2021
[2]

Brzezicki, M. A. et al. Artiﬁcial intelligence outperforms human stu- dents in conducting neurosurgical audits. Clinical Neurology and Neu- rosurgery 192, 105732 (2020). URL https://www.sciencedirect.com/ science/article/pii/S0303846720300755. https://doi.org/https://doi.org/ 10.1016/j.clineuro.2020.105732

work page doi:10.1016/j.clineuro.2020.105732 2020
[3]

& Nachev, P

Nelson, A., Herron, D., Rees, G. & Nachev, P. Predicting scheduled hospital attendance with artiﬁcial intelligence. npj Digital Medicine 2 (1), 26 (2019). URL https://doi.org/10.1038/s41746-019-0103-3. https: //doi.org/10.1038/s41746-019-0103-3

work page doi:10.1038/s41746-019-0103-3 2019
[4]

inconvenient truth

Panch, T., Mattie, H. & Celi, L. A. The “inconvenient truth”about ai in healthcare. npj Digital Medicine 2 (1), 77 (2019). URL https://doi.org/ 10.1038/s41746-019-0155-4. https://doi.org/10.1038/s41746-019-0155-4

work page doi:10.1038/s41746-019-0155-4 2019
[5]

Abadi, M. et al. Tensorﬂow: A system for large-scale machine learning (2016). URL https://www.usenix.org/system/ﬁles/conference/osdi16/ osdi16-abadi.pdf

work page 2016
[6]

Paszke, A. et al. Pytorch: An imperative style, high-performance deep learning library (2019). URL https://arxiv.org/abs/1912.01703

work page internal anchor Pith review Pith/arXiv arXiv 2019
[7]

Chen, T. et al. Mxnet: A ﬂexible and eﬃcient machine learning library for heterogeneous distributed systems (2015). URL https://arxiv.org/abs/ Springer Nature 2021 LATEX template MONAI: An open-source framework for deep learning in healthcare 23 1512.01274

work page arXiv 2015
[8]

Gibson, E. et al. Niftynet: a deep-learning platform for medical imaging. Computer Methods and Programs in Biomedicine 158, 113– 122 (2018). URL https://www.sciencedirect.com/science/article/pii/ S0169260717311823. https://doi.org/https://doi.org/10.1016/j.cmpb. 2018.01.025

work page doi:10.1016/j.cmpb 2018
[9]

Pawlowski, N. et al. Dltk: State of the art reference implementations for deep learning on medical images (2017). URL https://arxiv.org/abs/ 1711.06853

work page arXiv 2017
[10]

Beers, A. et al. Deepneuro: an open-source deep learning toolbox for neuroimaging. Neuroinformatics 19 (1), 127–140 (2021). https://doi.org/ 10.1007/s12021-020-09477-5

work page doi:10.1007/s12021-020-09477-5 2021
[11]

The state of machine learning frameworks in 2019

He, H. The state of machine learning frameworks in 2019. The Gradient (2019)

work page 2019
[12]

Diaz-Pinto, A. et al. Monai label: A framework for ai-assisted interactive labeling of 3d medical images (2022). URL https://arxiv.org/abs/2203. 12362

work page 2022
[13]

& Marion, C

McCormick, M., Liu, X., Ibanez, L., Jomier, J. & Marion, C. ITK: enabling reproducible research and open science. Frontiers in Neuroinfor- matics 8, 13 (2014). URL https://www.frontiersin.org/article/10.3389/ fninf.2014.00013. https://doi.org/10.3389/fninf.2014.00013

work page doi:10.3389/fninf.2014.00013 2014
[14]

Brett, M. et al. nipy/nibabel: 3.2.1 (2020). URL https://doi.org/10.5281/ zenodo.4295521

work page 2020
[15]

van Kemenade, H. et al. python-pillow/pillow: 8.4.0 (2021). URL https: //doi.org/10.5281/zenodo.5571504

work page doi:10.5281/zenodo.5571504 2021
[16]

Wang, G. et al. Aleatoric uncertainty estimation with test-time aug- mentation for medical image segmentation with convolutional neural networks. Neurocomputing 338, 34–45 (2019). URL https://www. sciencedirect.com/science/article/pii/S0925231219301961. https://doi. org/https://doi.org/10.1016/j.neucom.2019.01.103

work page doi:10.1016/j.neucom.2019.01.103 2019
[17]

& Ourselin, S

P´ erez-Garc´ ıa, F., Sparks, R. & Ourselin, S. Torchio: a python library for eﬃcient loading, preprocessing, augmentation and patch-based sampling of medical images in deep learning. Computer Methods and Programs in Biomedicine 106236 (2021). URL https://www.sciencedirect.com/ science/article/pii/S0169260721003102. https://doi.org/https://doi.org/ 10.101...

work page doi:10.1016/j.cmpb.2021.106236 2021
[18]

& Bradski, G

Riba, E., Mishkin, D., Ponsa, D., Rublee, E. & Bradski, G. Kornia: an open source diﬀerentiable computer vision library for pytorch (2020). URL https://arxiv.org/pdf/1910.02190.pdf

work page arXiv 2020
[19]

Isensee, F. et al. batchgenerators - a python framework for data augmentation (2020). URL https://doi.org/10.5281/zenodo.3632567

work page doi:10.5281/zenodo.3632567 2020
[20]

& Weninger, L

Schock, J., Baumgartner, M. & Weninger, L. Phoenixdl/rising: High- performance diﬀerentiable medical data augmentation. https://github. com/PhoenixDL/rising. Accessed: 2021-12-20

work page 2021
[21]

& Choudhury, R

Lee, G., Bae, G., Zaitlen, B., Kirkham, J. & Choudhury, R. cucim - a gpu image i/o and processing library (2021). URL https://doi.org/10.25080/ majora-1b6fd038-022

work page 2021
[22]

H., Li, W., Vercauteren, T., Ourselin, S

Sudre, C. H., Li, W., Vercauteren, T., Ourselin, S. & Jorge Cardoso, M. Cardoso, M. J. et al. (eds) Generalised dice overlap as a deep learning loss function for highly unbalanced segmentations . (eds Cardoso, M. J. et al.) Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support, 240–248 (Springer International Publis...

work page 2017
[23]

Maier-Hein, L. et al. Metrics reloaded: Pitfalls and recommendations for image analysis validation (2022). URL https://arxiv.org/abs/2206.01653

work page arXiv 2022
[24]

Reinke, A. et al. Common limitations of image processing metrics: A picture story (2021). URL https://arxiv.org/abs/2104.05642

work page arXiv 2021
[25]

& Brox, T

Ronneberger, O., Fischer, P. & Brox, T. in U-Net: Convolutional Net- works for Biomedical Image Segmentation (eds Navab, N., Hornegger, J., Wells III, W. M. & Frangi, A. F.) MICCAI 2015 , Vol. 9351 of LNCS 234–241 (Springer, 2015)

work page 2015
[26]

Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification

He, K., Zhang, X., Ren, S. & Sun, J. Delving deep into rectiﬁers: Sur- passing human-level performance on imagenet classiﬁcation (2015). URL https://arxiv.org/abs/1502.01852

work page Pith review arXiv 2015
[27]

Gaussian Error Linear Units (GELUs)

Hendrycks, D. & Gimpel, K. Gaussian error linear units (gelus) (2016). URL https://arxiv.org/abs/1606.08415

work page internal anchor Pith review Pith/arXiv arXiv 2016
[28]

& Hochreiter, S

Klambauer, G., Unterthiner, T., Mayr, A. & Hochreiter, S. Self- normalizing neural networks (2017). URL https://arxiv.org/abs/1706. 02515

work page 2017
[29]

& Burges, C

LeCun, Y., Cortes, C. & Burges, C. Mnist handwritten digit database. ATT Labs [Online]. Available: http://yann.lecun.com/exdb/mnist 2 (2010) . Springer Nature 2021 LATEX template MONAI: An open-source framework for deep learning in healthcare 25

work page 2010
[30]

Antonelli, M. et al. The medical segmentation decathlon (2021). URL https://arxiv.org/abs/2106.05735

work page arXiv 2021
[31]

Abadi, M. et al. Tensorﬂow: Large-scale machine learning on heteroge- neous distributed systems (2016). 1603.04467

work page Pith review arXiv 2016
[32]

& Aylward, S

Major, B., McCormick, M. & Aylward, S. Tensorboardplu- gin3d: 3d tensor visualization. https://github.com/KitwareMedical/ tensorboard-plugin-3d (2022)

work page 2022
[33]

Zeiler, M. D. & Fergus, R. Fleet, D., Pajdla, T., Schiele, B. & Tuytelaars, T. (eds) Visualizing and Understanding Convolutional Networks . (eds Fleet, D., Pajdla, T., Schiele, B. & Tuytelaars, T.) Computer Vision – ECCV 2014, 818–833 (Springer International Publishing, Cham, 2014)

work page 2014
[34]

Selvaraju, R. R. et al. Grad-cam: Visual explanations from deep networks via gradient-based localization (2017)

work page 2017
[35]

SmoothGrad: removing noise by adding noise

Smilkov, D., Thorat, N., Kim, B., Vi´ egas, F. & Wattenberg, M. Smooth- grad: removing noise by adding noise (2017). URL https://arxiv.org/abs/ 1706.03825

work page Pith review arXiv 2017
[36]

Fu, Y. et al. DeepReg: a deep learning toolkit for medical image registra- tion. Journal of Open Source Software 5 (55), 2705 (2020). URL https: //doi.org/10.21105/joss.02705. https://doi.org/10.21105/joss.02705

work page doi:10.21105/joss.02705 2020