pith. machine review for the scientific record. sign in

arxiv: 2506.14432 · v3 · submitted 2025-06-17 · 📡 eess.IV · cs.CV

Recognition: unknown

A large-scale heterogeneous 3D magnetic resonance brain imaging dataset for self-supervised learning

Authors on Pith no claims yet
classification 📡 eess.IV cs.CV
keywords braindatasetimagingself-supervisedfomo260kheterogeneouslarge-scalelearning
0
0 comments X
read the original abstract

We present FOMO260K, a large-scale, heterogeneous dataset of 260,927 brain Magnetic Resonance Imaging (MRI) scans from 77,589 MRI sessions and 55,378 subjects, aggregated from 910 publicly available sources. The dataset includes both clinical- and research-grade images, multiple MRI sequences, and a wide range of anatomical and pathological variability, including scans with large brain anomalies. Minimal preprocessing was applied to preserve the original image characteristics while reducing entry barriers for new users. Companion code for self-supervised pretraining and finetuning is provided, along with pretrained models. FOMO260K is intended to support the development and benchmarking of self-supervised learning methods in medical imaging at scale.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Towards Brain MRI Foundation Models for the Clinic: Findings from the FOMO25 Challenge

    cs.CV 2026-04 conditional novelty 6.0

    Self-supervised pretraining on 60K clinical-style brain MRIs improves out-of-domain generalization on classification, segmentation, and regression tasks, with hybrid objectives and small models showing strong results.