Scaling self-supervised and cross-modal pretraining for volumetric ct transformers

Cris Claessens, Christiaan Viviers, Giacomo D’Amicantonio, Egor Bondarev, Fons van der Sommen · arXiv 2511.17209

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Universal CT Representations from Anatomy to Disease Phenotype through Agglomerative Pretraining

cs.CV · 2026-05-21 · unverdicted · novelty 5.0 · 2 refs

FlexiCT provides CT foundation models via agglomerative pretraining on 266227 volumes from 56 datasets that match or exceed task-specific models on five task families while organizing embeddings along tumor-stage gradients.

MultiMedVision: Multi-Modal Medical Vision Framework

cs.CV · 2026-05-09 · unverdicted · novelty 5.0

A unified Sparse Vision Transformer learns joint 2D/3D medical image representations via self-supervision and achieves competitive AUROC on chest X-ray and CT benchmarks with 5x less data than modality-specific models.

citing papers explorer

Showing 2 of 2 citing papers.

Universal CT Representations from Anatomy to Disease Phenotype through Agglomerative Pretraining cs.CV · 2026-05-21 · unverdicted · none · ref 2 · 2 links
FlexiCT provides CT foundation models via agglomerative pretraining on 266227 volumes from 56 datasets that match or exceed task-specific models on five task families while organizing embeddings along tumor-stage gradients.
MultiMedVision: Multi-Modal Medical Vision Framework cs.CV · 2026-05-09 · unverdicted · none · ref 3
A unified Sparse Vision Transformer learns joint 2D/3D medical image representations via self-supervision and achieves competitive AUROC on chest X-ray and CT benchmarks with 5x less data than modality-specific models.

Scaling self-supervised and cross-modal pretraining for volumetric ct transformers

fields

years

verdicts

representative citing papers

citing papers explorer