RadGenome-Anatomy: A Large-Scale Anatomy-Labeled Chest Radiograph Dataset via Physically Grounded Volumetric Projection

Hao Wang; Jinman Kim; Mingyuan Meng; Shuchang Ye; Usman Naseem

arxiv: 2605.17368 · v1 · pith:TKWLUL75new · submitted 2026-05-17 · 💻 cs.CV

RadGenome-Anatomy: A Large-Scale Anatomy-Labeled Chest Radiograph Dataset via Physically Grounded Volumetric Projection

Shuchang Ye , Mingyuan Meng , Hao Wang , Usman Naseem , Jinman Kim This is my paper

Pith reviewed 2026-05-20 13:17 UTC · model grok-4.3

classification 💻 cs.CV

keywords chest radiograph datasetanatomy segmentationvolumetric projectionCT to X-ray projectionmedical image labelssegmentation masksanatomical structuresdiagnostic AI

0 comments

The pith

Projecting 3D CT anatomy masks into 2D creates over 10 million reliable chest radiograph labels across 210 structures.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces a large dataset of anatomy labels for chest radiographs by using a projection method from 3D CT scans. Instead of labeling directly on ambiguous 2D X-rays where structures overlap or hide, the authors define structures in 3D space and project their masks onto the 2D plane using standard radiographic geometry. This produces physically accurate 2D masks even for complex cases. A reader would care because such scale and reliability can support better AI models for medical diagnosis and measurement extraction from routine X-rays. The work shows an example by training a model that measures heart enlargement, spine curvature, and scoliosis with accuracies above 89 percent.

Core claim

RadGenome-Anatomy is constructed by projecting large-scale 3D anatomical masks from CT volumes into 2D radiographic space through canonical radiographic geometry, resulting in over 10 million segmentation masks across 210 anatomical structures in 25,692 studies. Each 2D mask represents the physically grounded projected footprint of a volumetrically defined structure, allowing structures that overlap or become partially invisible in radiographs to remain spatially separable in the annotation process.

What carries the argument

Physically grounded volumetric projection of 3D anatomical masks from CT into 2D radiographic space, which allows annotation in volumetric space where overlapping structures stay separable and produces accurate 2D footprints.

If this is right

The dataset enables research on geometric measurements as explicit evidence for chest radiograph interpretation.
Models trained on it can predict structure-specific masks and derive clinical measurements.
It supports high diagnostic accuracies such as 96.4% for cardiomegaly, 95.6% for kyphosis, and 89.2% for scoliosis.
Labels cover structures that are overlapping, partially visible, or difficult to delineate directly on 2D images.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

This projection technique might extend to other imaging modalities or body parts where 3D data is available to generate 2D labels.
Large labeled datasets like this could accelerate development of automated diagnostic tools in radiology.
Validation against independent expert annotations on projected images would strengthen confidence in the method's accuracy.

Load-bearing premise

The canonical radiographic geometry used for projection from CT volumes accurately reproduces the true 2D appearance of 3D anatomical structures in chest radiographs, including overlaps and partial visibilities.

What would settle it

A side-by-side comparison of the generated 2D masks with expert-drawn annotations on a held-out set of real chest radiographs, checking for mismatches in boundary placement or missed occluded structures.

Figures

Figures reproduced from arXiv: 2605.17368 by Hao Wang, Jinman Kim, Mingyuan Meng, Shuchang Ye, Usman Naseem.

**Figure 2.** Figure 2: Hierarchical label space and anatomy-mask scale in RadGenome-Anatomy. a. Fourlevel sunburst of the 210 canonical anatomy classes. The innermost ring shows the 10 body-system groups, while the outer rings progressively expand the hierarchy into organ families, anatomical sub-parts, and the final canonical classes (Detailed label hierarchy is provided in Figure A4.). b. Anatomy-mask area for each body-syste… view at source ↗

**Figure 3.** Figure 3: Overview of the volumetric-to-radiographic projection pipeline. A preprocessed 3D CT [PITH_FULL_IMAGE:figures/full_fig_p005_3.png] view at source ↗

**Figure 4.** Figure 4: Per-class Dice performance for selected L3 anatomical classes in PA and LL views (see [PITH_FULL_IMAGE:figures/full_fig_p007_4.png] view at source ↗

**Figure 6.** Figure 6: Quantitative evaluation of mask-derived measurement-based screening for cardiomegaly, [PITH_FULL_IMAGE:figures/full_fig_p008_6.png] view at source ↗

**Figure 5.** Figure 5: Ordinal severity agreement among medical VLMs for cardiomegaly, scoliosis, and kyphosis, measured by quadratic-weighted κ. RadGenome-Anatomy supports fine-grained anatomybased measurement on chest radiographs by providing masks for overlapping structures that are difficult to delineate in 2D. The projected masks convert anatomical geometry into measurable quantities such as structure size, position, curv… view at source ↗

**Figure 7.** Figure 7: Qualitative examples of diagnostic measurements derived from [PITH_FULL_IMAGE:figures/full_fig_p009_7.png] view at source ↗

read the original abstract

Anatomical structure labels for chest radiographs are essential for medical image segmentation and a broad range of downstream diagnostic tasks. However, annotating anatomy directly on 2D chest radiographs is labor-intensive and intrinsically ambiguous, as 3D anatomical structures are projected onto a single 2D plane where boundaries may overlap, be occluded, or appear only partially visible. Consequently, existing anatomy-labeled chest radiograph datasets remain limited in scale, anatomy coverage, and label reliability. To address these limitations, we introduce RadGenome-Anatomy, the largest anatomy-labeled chest radiograph dataset, containing over 10 million segmentation masks across 210 anatomical structures in 25,692 studies. It is constructed by projecting large-scale 3D anatomical masks from CT volumes into 2D radiographic space through canonical radiographic geometry. This shifts annotation from directly tracing uncertain 2D boundaries to defining anatomy in volumetric space, where structures that overlap or become partially invisible in radiographs remain spatially separable. As a result, each 2D mask represents the physically grounded projected footprint of a volumetrically defined structure. The scale and broad anatomical coverage of RadGenome-Anatomy, including structures that are overlapping, partially visible, or difficult to delineate directly, enable research on geometric measurements as explicit evidence for chest radiograph interpretation. We demonstrate this by training XAnatomy to predict structure-specific masks and derive clinically relevant measurements, achieving diagnostic accuracies of 96.4%, 95.6%, and 89.2% for cardiomegaly, kyphosis, and scoliosis, respectively.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This paper builds a large chest X-ray anatomy dataset by projecting CT masks into 2D but provides no quantitative check on whether those projections match real radiographs.

read the letter

The main point is a new dataset called RadGenome-Anatomy with over 10 million masks across 210 structures from 25k studies, created by taking 3D CT anatomy labels and projecting them down using standard radiographic geometry. This moves the hard labeling work into volumetric space where overlapping or hidden structures stay separable, then produces 2D footprints that are claimed to be physically grounded. They also train a model called XAnatomy on it and report decent numbers for deriving measurements like cardiomegaly at 96 percent accuracy. That downstream demo is useful to see. The scale and coverage of tricky anatomy is the clear advance over earlier smaller datasets. The construction pipeline itself reads as coherent and practical for anyone who needs lots of labels without manual 2D tracing. The soft spot is validation. The paper describes the projection step but gives no error metrics, no comparison against expert-drawn masks on real X-rays, and no analysis of boundary accuracy. The stress-test concern lands: CT volumes are typically acquired supine with different breathing and arm positions than upright PA chest radiographs, and the abstract does not mention any pose normalization or registration to close that gap. Systematic viewpoint mismatches could shift projected boundaries for structures whose 3D extent changes under real acquisition geometry. This leaves the reliability of the 10 million labels as an open question rather than a demonstrated fact. The work is aimed at medical imaging researchers who build segmentation or measurement models for chest radiography. Readers who need large training resources for overlapping anatomy would get the most out of it, provided the labels hold up under scrutiny. It deserves peer review because the resource is substantial and the method is straightforward, even though referees will almost certainly ask for validation experiments and clearer details on data availability.

Referee Report

3 major / 2 minor

Summary. The manuscript introduces RadGenome-Anatomy, the largest anatomy-labeled chest radiograph dataset, containing over 10 million segmentation masks across 210 anatomical structures in 25,692 studies. It is constructed by projecting large-scale 3D anatomical masks from CT volumes into 2D radiographic space through canonical radiographic geometry, shifting annotation to volumetric space to handle overlaps and partial visibility. The authors demonstrate utility by training XAnatomy to predict structure-specific masks and derive clinical measurements, reporting diagnostic accuracies of 96.4%, 95.6%, and 89.2% for cardiomegaly, kyphosis, and scoliosis respectively.

Significance. If the projected 2D labels prove accurate, the dataset would represent a substantial advance by providing unprecedented scale and coverage for training segmentation models and enabling geometric measurements as evidence for radiograph interpretation. The volumetric-to-2D projection approach addresses a key limitation of direct 2D annotation for overlapping or partially visible structures.

major comments (3)

[Abstract] Abstract and dataset construction description: The central claim that the 2D masks are 'physically grounded' and reliably represent structures (including overlaps and partial visibility) is not supported by any quantitative validation of projection accuracy, error metrics, or comparison to real 2D annotations or existing datasets.
[Dataset construction] Dataset construction pipeline: The projection uses 'canonical radiographic geometry' without any described registration, pose normalization, or adjustment for systematic differences between CT acquisitions (typically supine) and chest X-ray protocols (upright PA, breathing phase, arm position, source-to-detector distance), which risks systematic boundary errors in the projected masks for structures whose 3D extent varies with viewpoint.
[Experiments / Results] Demonstration experiments: The reported accuracies (96.4% for cardiomegaly etc.) are obtained by training on the projected labels, but without independent validation of label fidelity (e.g., against manual 2D annotations or multi-rater agreement), it is unclear whether performance reflects true anatomical correspondence or projection-induced artifacts.

minor comments (2)

[Abstract] Clarify whether the 25,692 studies correspond to unique patients or include follow-up scans, and provide basic demographics or acquisition parameter statistics for the source CT volumes.
Add a figure or diagram explicitly showing the projection geometry parameters and an example of how overlapping 3D structures map to 2D masks to improve readability of the method.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the thoughtful and constructive comments, which highlight important aspects of validation and methodological detail. We address each major comment point by point below and will incorporate revisions to strengthen the manuscript.

read point-by-point responses

Referee: [Abstract] Abstract and dataset construction description: The central claim that the 2D masks are 'physically grounded' and reliably represent structures (including overlaps and partial visibility) is not supported by any quantitative validation of projection accuracy, error metrics, or comparison to real 2D annotations or existing datasets.

Authors: The physical grounding of the 2D masks follows directly from applying a standard radiographic projection model to 3D volumetric segmentations, which inherently encodes overlaps and partial visibility as the projected footprint of each structure. We acknowledge that the current manuscript does not include quantitative error metrics or direct comparisons to manual 2D annotations. In the revised version we will add a dedicated validation subsection that reports projection accuracy on a subset of cases with available expert 2D annotations, including Dice coefficients and boundary error statistics. revision: yes
Referee: [Dataset construction] Dataset construction pipeline: The projection uses 'canonical radiographic geometry' without any described registration, pose normalization, or adjustment for systematic differences between CT acquisitions (typically supine) and chest X-ray protocols (upright PA, breathing phase, arm position, source-to-detector distance), which risks systematic boundary errors in the projected masks for structures whose 3D extent varies with viewpoint.

Authors: The pipeline employs a canonical radiographic geometry with standard source-to-detector distance and projection parameters as described in the methods. We agree that unaccounted differences in patient pose and breathing phase between CT and upright chest radiographs represent a potential source of systematic error. We will expand the dataset construction section with explicit parameter values and add a limitations paragraph discussing these acquisition differences and their possible impact on boundary accuracy for deformable structures. revision: partial
Referee: [Experiments / Results] Demonstration experiments: The reported accuracies (96.4% for cardiomegaly etc.) are obtained by training on the projected labels, but without independent validation of label fidelity (e.g., against manual 2D annotations or multi-rater agreement), it is unclear whether performance reflects true anatomical correspondence or projection-induced artifacts.

Authors: The reported diagnostic accuracies demonstrate downstream utility of the projected labels for deriving geometric measurements. We concur that an independent check against manual 2D annotations would help separate true anatomical fidelity from projection artifacts. In the revision we will include a limited validation experiment on a held-out subset, comparing model predictions and projected labels against radiologist-drawn 2D masks and reporting agreement metrics. revision: yes

Circularity Check

0 steps flagged

Dataset construction via external CT projection is self-contained with no circular reduction

full rationale

The paper's central claim is the generation of 2D anatomy masks by projecting 3D CT-derived masks into radiographic space using canonical geometry. This is a direct forward transformation from independent volumetric inputs; no equations, fitted parameters, or self-citations are shown that would make the output equivalent to the inputs by construction. The method relies on external CT volumes and standard projection operators rather than any self-referential definition or renamed empirical fit. The derivation chain is therefore non-circular and externally grounded.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The approach rests on standard assumptions from medical imaging physics and CT segmentation quality; no new free parameters or invented entities are introduced in the abstract description.

axioms (1)

domain assumption Canonical radiographic geometry accurately maps 3D CT structures to 2D projections while preserving separability for overlapping anatomy.
Invoked when shifting annotation from uncertain 2D boundaries to volumetric definitions.

pith-pipeline@v0.9.0 · 5826 in / 1335 out tokens · 68311 ms · 2026-05-20T13:17:39.302264+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

51 extracted references · 51 canonical work pages · 4 internal anchors

[1]

Hybrid intelligence in medical image segmentation.Scientific Reports, 15(1):41200, 2025

Namia Mohamed Ali, Solomon Sunday Oyelere, Nitya Jitani, Rosy Sarmah, and Simon Andrew. Hybrid intelligence in medical image segmentation.Scientific Reports, 15(1):41200, 2025

work page 2025
[2]

MAIRA-2: grounded radiology report generation.arXiv preprint arXiv:2406.04449,

Shruthi Bannur, Kenza Bouzid, Daniel C Castro, Anton Schwaighofer, Anja Thieme, Sam Bond-Taylor, Maximilian Ilse, Fernando Pérez-García, Valentina Salvatelli, Harshita Sharma, et al. Maira-2: Grounded radiology report generation.arXiv preprint arXiv:2406.04449, 2024

work page arXiv 2024
[3]

Towards robust computation of cardiothoracic ratio from chest x-ray

Matilde Bodritti, Adriyana Danudibroto, and Jan Aelterman. Towards robust computation of cardiothoracic ratio from chest x-ray. InMedical Imaging with Deep Learning (MIDL), 2023

work page 2023
[4]

Swin-unet: Unet-like pure transformer for medical image segmentation

Hu Cao, Yueyue Wang, Joy Chen, Dongsheng Jiang, Xiaopeng Zhang, Qi Tian, and Manning Wang. Swin-unet: Unet-like pure transformer for medical image segmentation. InEuropean conference on computer vision, pages 205–218. Springer, 2022

work page 2022
[5]

TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation

Jieneng Chen, Yongyi Lu, Qihang Yu, Xiangde Luo, Ehsan Adeli, Yan Wang, Le Lu, Alan L Yuille, and Yuyin Zhou. Transunet: Transformers make strong encoders for medical image segmentation.arXiv preprint arXiv:2102.04306, 2021

work page internal anchor Pith review Pith/arXiv arXiv 2021
[6]

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Junying Chen, Zhenyang Cai, Ke Ji, Xidong Wang, Wanlong Liu, Rongsheng Wang, Jianye Hou, and Benyou Wang. Huatuogpt-o1, towards medical complex reasoning with llms.arXiv preprint arXiv:2412.18925, 2024

work page internal anchor Pith review Pith/arXiv arXiv 2024
[7]

Encoder-decoder with atrous separable convolution for semantic image segmentation

Liang-Chieh Chen, Yukun Zhu, George Papandreou, Florian Schroff, and Hartwig Adam. Encoder-decoder with atrous separable convolution for semantic image segmentation. In Proceedings of the European conference on computer vision (ECCV), pages 801–818, 2018

work page 2018
[8]

Masked-attention mask transformer for universal image segmentation

Bowen Cheng, Ishan Misra, Alexander G Schwing, Alexander Kirillov, and Rohit Girdhar. Masked-attention mask transformer for universal image segmentation. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 1290–1299, 2022

work page 2022
[9]

Monai label: A framework for ai-assisted interactive labeling of 3d medical images.Medical Image Analysis, 95:103207, 2024

Andres Diaz-Pinto, Sachidanand Alle, Vishwesh Nath, Yucheng Tang, Alvin Ihsani, Muhammad Asad, Fernando Pérez-García, Pritesh Mehta, Wenqi Li, Mona Flores, et al. Monai label: A framework for ai-assisted interactive labeling of 3d medical images.Medical Image Analysis, 95:103207, 2024

work page 2024
[10]

Measures of the amount of ecologic association between species.Ecology, 26:297–302, 1945

Lee Raymond Dice. Measures of the amount of ecologic association between species.Ecology, 26:297–302, 1945. URLhttps://api.semanticscholar.org/CorpusID:53335638

work page 1945
[11]

Milone, and Enzo Ferrante

Nicolás Gaggion, Candelaria Mosquera, Lucas Mansilla, Julia Mariel Saidman, Martina Aineseder, Diego H. Milone, and Enzo Ferrante. Chexmask: A large-scale dataset of anatomical segmentation masks for multi-center chest x-ray images.Scientific Data, 11(1):511, 2024. doi: 10.1038/s41597-024-03358-1

work page doi:10.1038/s41597-024-03358-1 2024
[12]

Chexmask: a large-scale dataset of anatomical segmentation masks for multi-center chest x-ray images.Scientific Data, 11(1):511, 2024

Nicolás Gaggion, Candelaria Mosquera, Lucas Mansilla, Julia Mariel Saidman, Martina Aineseder, Diego H Milone, and Enzo Ferrante. Chexmask: a large-scale dataset of anatomical segmentation masks for multi-center chest x-ray images.Scientific Data, 11(1):511, 2024

work page 2024
[13]

arXiv preprint arXiv:2403.17834 , year=

Ibrahim Ethem Hamamci, Sezgin Er, Chenyu Wang, Furkan Almas, Ayse Gulnihan Simsek, Sevval Nil Esirgun, Irem Dogan, Omer Faruk Durugol, Benjamin Hou, Suprosanna Shit, et al. Developing generalist foundation models from a multimodal dataset for 3d computed tomography.arXiv preprint arXiv:2403.17834, 2024

work page arXiv 2024
[14]

Sánchez, Pim A

Laurens Hogeweg, Clara I. Sánchez, Pim A. de Jong, and Mathilde van Ginneken. Clavicle segmentation in chest radiographs.Medical Image Analysis, 16(8):1490–1502, 2012. doi: 10.1016/j.media.2012.06.009

work page doi:10.1016/j.media.2012.06.009 2012
[15]

nnu-net: a self-configuring method for deep learning-based biomedical image segmentation.Nature methods, 18(2):203–211, 2021

Fabian Isensee, Paul F Jaeger, Simon AA Kohl, Jens Petersen, and Klaus H Maier-Hein. nnu-net: a self-configuring method for deep learning-based biomedical image segmentation.Nature methods, 18(2):203–211, 2021

work page 2021
[16]

Distribution de la flore alpine dans le bassin des dranses et dans quelques régions voisines.Bull Soc Vaudoise Sci Nat, 37:241–272, 1901

Paul Jaccard. Distribution de la flore alpine dans le bassin des dranses et dans quelques régions voisines.Bull Soc Vaudoise Sci Nat, 37:241–272, 1901. 10

work page 1901
[17]

Two public chest x-ray datasets for computer-aided screening of pulmonary diseases

Stefan Jaeger, Sema Candemir, Sameer Antani, Yì-Xiáng J Wáng, Pu-Xuan Lu, and George Thoma. Two public chest x-ray datasets for computer-aided screening of pulmonary diseases. Quantitative imaging in medicine and surgery, 4(6):475, 2014

work page 2014
[18]

Cardionet: Automatic semantic segmentation to calculate the cardiothoracic ratio for cardiomegaly and other chest diseases.Journal of Personalized Medicine, 12(6):988,

Abbas Jafar, Muhammad Talha Hameed, Nadeem Akram, Umer Waqas, Hyung Seok Kim, and Rizwan Ali Naqvi. Cardionet: Automatic semantic segmentation to calculate the cardiothoracic ratio for cardiomegaly and other chest diseases.Journal of Personalized Medicine, 12(6):988,

work page
[19]

doi: 10.3390/jpm12060988

work page doi:10.3390/jpm12060988
[20]

Alistair E. W. Johnson, Tom J. Pollard, Nathaniel R. Greenbaum, Matthew P. Lungren, Chih ying Deng, Yifan Peng, Zhiyong Lu, Roger G. Mark, Seth J. Berkowitz, and Steven Horng. Mimic-cxr, a de-identified publicly available database of chest radiographs with free-text reports. Scientific Data, 6:317, 2019. doi: 10.1038/s41597-019-0322-0

work page doi:10.1038/s41597-019-0322-0 2019
[21]

Pointrend: Image segmentation as rendering

Alexander Kirillov, Yuxin Wu, Kaiming He, and Ross Girshick. Pointrend: Image segmentation as rendering. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 9799–9808, 2020

work page 2020
[22]

Segment anything

Alexander Kirillov, Eric Mintun, Nikhila Ravi, Hanzi Mao, Chloe Rolland, Laura Gustafson, Tete Xiao, Spencer Whitehead, Alexander C Berg, Wan-Yen Lo, et al. Segment anything. In Proceedings of the IEEE/CVF international conference on computer vision, pages 4015–4026, 2023

work page 2023
[23]

Med-r1: Reinforcement learning for generalizable medical reasoning in vision-language models.IEEE Transactions on Medical Imaging, 2026

Yuxiang Lai, Jike Zhong, Ming Li, Shitian Zhao, Yuheng Li, Konstantinos Psounis, and Xiaofeng Yang. Med-r1: Reinforcement learning for generalizable medical reasoning in vision-language models.IEEE Transactions on Medical Imaging, 2026

work page 2026
[24]

U-kan makes strong backbone for medical image segmentation and generation

Chenxin Li, Xinyu Liu, Wuyang Li, Cheng Wang, Hengyu Liu, Yifan Liu, Zhen Chen, and Yixuan Yuan. U-kan makes strong backbone for medical image segmentation and generation. InProceedings of the AAAI conference on artificial intelligence, volume 39, pages 4652–4660, 2025

work page 2025
[25]

Llava-med: Training a large language-and-vision assistant for biomedicine in one day.Advances in Neural Information Processing Systems, 36: 28541–28564, 2023

Chunyuan Li, Cliff Wong, Sheng Zhang, Naoto Usuyama, Haotian Liu, Jianwei Yang, Tristan Naumann, Hoifung Poon, and Jianfeng Gao. Llava-med: Training a large language-and-vision assistant for biomedicine in one day.Advances in Neural Information Processing Systems, 36: 28541–28564, 2023

work page 2023
[26]

U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation

Jun Ma, Feifei Li, and Bo Wang. U-mamba: Enhancing long-range dependency for biomedical image segmentation.arXiv preprint arXiv:2401.04722, 2024

work page internal anchor Pith review Pith/arXiv arXiv 2024
[27]

Anatomy-driven pathology detection on chest x-rays

Philip Müller, Felix Meissen, Johannes Brandt, Georgios Kaissis, and Daniel Rueckert. Anatomy-driven pathology detection on chest x-rays. InMedical Image Computing and Computer Assisted Intervention – MICCAI 2023, pages 57–66, 2023

work page 2023
[28]

Vindr-ribcxr: A benchmark dataset for automatic segmentation and labeling of individual ribs on chest x-rays.arXiv preprint arXiv:2107.01327, 2021

Hoang C Nguyen, Tung T Le, Hieu H Pham, and Ha Q Nguyen. Vindr-ribcxr: A benchmark dataset for automatic segmentation and labeling of individual ribs on chest x-rays.arXiv preprint arXiv:2107.01327, 2021

work page arXiv 2021
[29]

Error metrics for quantitative evaluation of medical image segmentation

Wiro J Niessen, Carolien J Bouma, Koen L Vincken, and Max A Viergever. Error metrics for quantitative evaluation of medical image segmentation. InPerformance characterization in computer vision, pages 275–284. Springer, 2000

work page 2000
[30]

Deep learning to achieve clinically applicable segmentation of head and neck anatomy for radiotherapy.arXiv preprint arXiv:1809.04430, 2018

Stanislav Nikolov, Sam Blackwell, Alexei Zverovitch, Ruheena Mendes, Michelle Livne, Jeffrey De Fauw, Yojan Patel, Clemens Meyer, Harry Askham, Bernardino Romera-Paredes, et al. Deep learning to achieve clinically applicable segmentation of head and neck anatomy for radiotherapy.arXiv preprint arXiv:1809.04430, 2018

work page arXiv 2018
[31]

Beer–lambert law for optical tissue diagnostics: current state of the art and the main limitations.Journal of biomedical optics, 26(10):100901–100901, 2021

Ilze Oshina and Janis Spigulis. Beer–lambert law for optical tissue diagnostics: current state of the art and the main limitations.Journal of biomedical optics, 26(10):100901–100901, 2021

work page 2021
[32]

S-sam: Svd-based fine-tuning of segment anything model for medical image segmentation

Jay N Paranjape, Shameema Sikder, S Swaroop Vedula, and Vishal M Patel. S-sam: Svd-based fine-tuning of segment anything model for medical image segmentation. InInternational Conference on Medical Image Computing and Computer-Assisted Intervention, pages 720–730. Springer, 2024. 11

work page 2024
[33]

U-net: Convolutional networks for biomedical image segmentation

Olaf Ronneberger, Philipp Fischer, and Thomas Brox. U-net: Convolutional networks for biomedical image segmentation. InInternational Conference on Medical image computing and computer-assisted intervention, pages 234–241. Springer, 2015

work page 2015
[34]

Vm-unet: Vision mamba unet for medical image segmentation.ACM Transactions on Multimedia Computing, Communications and Applications, 2024

Jiacheng Ruan, Jincheng Li, and Suncheng Xiang. Vm-unet: Vision mamba unet for medical image segmentation.ACM Transactions on Multimedia Computing, Communications and Applications, 2024

work page 2024
[35]

Saquib Sarfraz, Matthias A

Constantin Seibold, Simon Reiss, M. Saquib Sarfraz, Matthias A. Fink, Victoria Mayer, Jan Sellner, Moon Sung Kim, Klaus H. Maier-Hein, Jens Kleesiek, and Rainer Stiefelhagen. De- tailed annotations of chest x-rays via ct projection for report understanding.arXiv preprint arXiv:2210.03416, 2022

work page arXiv 2022
[36]

Fink, Moon Kim, Simon Reiss, Ken Herrmann, Jens Kleesiek, and Rainer Stiefelhagen

Constantin Seibold, Alexander Jaus, Matthias A. Fink, Moon Kim, Simon Reiss, Ken Herrmann, Jens Kleesiek, and Rainer Stiefelhagen. Accurate fine-grained segmentation of human anatomy in radiographs via volumetric pseudo-labeling.arXiv preprint arXiv:2306.03934, 2023

work page arXiv 2023
[37]

MedGemma Technical Report

Andrew Sellergren, Sahar Kazemzadeh, Tiam Jaroensri, Atilla Kiraly, Madeleine Traverse, Timo Kohlberger, Shawn Xu, Fayaz Jamil, Cían Hughes, Charles Lau, et al. Medgemma technical report.arXiv preprint arXiv:2507.05201, 2025

work page internal anchor Pith review Pith/arXiv arXiv 2025
[38]

Junji Shiraishi, Shigehiko Katsuragawa, Junpei Ikezoe, Tsuneo Matsumoto, Takeshi Kobayashi, Ken-ichi Komatsu, Mitate Matsui, Hiroshi Fujita, Yoshie Kodera, and Kunio Doi. Development of a digital image database for chest radiographs with and without a lung nodule: receiver operating characteristic analysis of radiologists’ detection of pulmonary nodules.A...

work page 2000
[39]

Metrics for evaluating 3d medical image segmentation: analysis, selection, and tool.BMC medical imaging, 15(1):29, 2015

Abdel Aziz Taha and Allan Hanbury. Metrics for evaluating 3d medical image segmentation: analysis, selection, and tool.BMC medical imaging, 15(1):29, 2015

work page 2015
[40]

Interactive and explainable region-guided radiology report generation

Tim Tanida, Philip Müller, Georgios Kaissis, and Daniel Rueckert. Interactive and explainable region-guided radiology report generation. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023

work page 2023
[41]

Unext: Mlp-based rapid medical image segmentation network

Jeya Maria Jose Valanarasu and Vishal M Patel. Unext: Mlp-based rapid medical image segmentation network. InInternational conference on medical image computing and computer- assisted intervention, pages 23–33. Springer, 2022

work page 2022
[42]

Stegmann, and Marco Loog

Bram van Ginneken, Mikkel B. Stegmann, and Marco Loog. Segmentation of anatomical structures in chest radiographs using supervised methods: A comparative study on a public database.Medical Image Analysis, 10(1):19–40, 2006. doi: 10.1016/j.media.2005.02.002

work page doi:10.1016/j.media.2005.02.002 2006
[43]

Seaformer: Squeeze-enhanced axial transformer for mobile semantic segmentation

Qiang Wan, Zilong Huang, Jiachen Lu, Gang Yu, and Li Zhang. Seaformer: Squeeze-enhanced axial transformer for mobile semantic segmentation. InThe eleventh international conference on learning representations, 2023

work page 2023
[44]

Medical sam adapter: Adapting segment anything model for medical image segmentation

Junde Wu, Ziyue Wang, Mingxuan Hong, Wei Ji, Huazhu Fu, Yanwu Xu, Min Xu, and Yueming Jin. Medical sam adapter: Adapting segment anything model for medical image segmentation. Medical image analysis, 102:103547, 2025

work page 2025
[45]

Unified perceptual parsing for scene understanding

Tete Xiao, Yingcheng Liu, Bolei Zhou, Yuning Jiang, and Jian Sun. Unified perceptual parsing for scene understanding. InProceedings of the European conference on computer vision (ECCV), pages 418–434, 2018

work page 2018
[46]

Segformer: Simple and efficient design for semantic segmentation with transformers.Advances in neural information processing systems, 34:12077–12090, 2021

Enze Xie, Wenhai Wang, Zhiding Yu, Anima Anandkumar, Jose M Alvarez, and Ping Luo. Segformer: Simple and efficient design for semantic segmentation with transformers.Advances in neural information processing systems, 34:12077–12090, 2021

work page 2021
[47]

Pidnet: A real-time semantic segmentation network inspired by pid controllers

Jiacong Xu, Zixiang Xiong, and Shankar P Bhattacharyya. Pidnet: A real-time semantic segmentation network inspired by pid controllers. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 19529–19539, 2023. 12

work page 2023
[48]

Topformer: Token pyramid transformer for mobile semantic segmentation

Wenqiang Zhang, Zilong Huang, Guozhong Luo, Tao Chen, Xinggang Wang, Wenyu Liu, Gang Yu, and Chunhua Shen. Topformer: Token pyramid transformer for mobile semantic segmentation. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 12083–12093, 2022

work page 2022
[49]

Radgenome-chest ct: A grounded vision-language dataset for chest ct analysis.arXiv preprint arXiv:2404.16754, 2024

Xiaoman Zhang, Chaoyi Wu, Ziheng Zhao, Jiayu Lei, Ya Zhang, Yanfeng Wang, and Weidi Xie. Radgenome-chest ct: A grounded vision-language dataset for chest ct analysis.arXiv preprint arXiv:2404.16754, 2024

work page arXiv 2024
[50]

Large-vocabulary segmentation for medical images with text prompts.NPJ Digital Medicine, 8(1):566, 2025

Ziheng Zhao, Yao Zhang, Chaoyi Wu, Xiaoman Zhang, Xiao Zhou, Ya Zhang, Yanfeng Wang, and Weidi Xie. Large-vocabulary segmentation for medical images with text prompts.NPJ Digital Medicine, 8(1):566, 2025

work page 2025
[51]

# !# $"# #$ $! $ !$

Zongwei Zhou, Md Mahfuzur Rahman Siddiquee, Nima Tajbakhsh, and Jianming Liang. Unet++: A nested u-net architecture for medical image segmentation. InInternational workshop on deep learning in medical image analysis, pages 3–11. Springer, 2018. 13 A Radiographic Transferability and Label Quality We assess whether anatomy supervision fromRadGenome-Anatomyt...

work page 2018

[1] [1]

Hybrid intelligence in medical image segmentation.Scientific Reports, 15(1):41200, 2025

Namia Mohamed Ali, Solomon Sunday Oyelere, Nitya Jitani, Rosy Sarmah, and Simon Andrew. Hybrid intelligence in medical image segmentation.Scientific Reports, 15(1):41200, 2025

work page 2025

[2] [2]

MAIRA-2: grounded radiology report generation.arXiv preprint arXiv:2406.04449,

Shruthi Bannur, Kenza Bouzid, Daniel C Castro, Anton Schwaighofer, Anja Thieme, Sam Bond-Taylor, Maximilian Ilse, Fernando Pérez-García, Valentina Salvatelli, Harshita Sharma, et al. Maira-2: Grounded radiology report generation.arXiv preprint arXiv:2406.04449, 2024

work page arXiv 2024

[3] [3]

Towards robust computation of cardiothoracic ratio from chest x-ray

Matilde Bodritti, Adriyana Danudibroto, and Jan Aelterman. Towards robust computation of cardiothoracic ratio from chest x-ray. InMedical Imaging with Deep Learning (MIDL), 2023

work page 2023

[4] [4]

Swin-unet: Unet-like pure transformer for medical image segmentation

Hu Cao, Yueyue Wang, Joy Chen, Dongsheng Jiang, Xiaopeng Zhang, Qi Tian, and Manning Wang. Swin-unet: Unet-like pure transformer for medical image segmentation. InEuropean conference on computer vision, pages 205–218. Springer, 2022

work page 2022

[5] [5]

TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation

Jieneng Chen, Yongyi Lu, Qihang Yu, Xiangde Luo, Ehsan Adeli, Yan Wang, Le Lu, Alan L Yuille, and Yuyin Zhou. Transunet: Transformers make strong encoders for medical image segmentation.arXiv preprint arXiv:2102.04306, 2021

work page internal anchor Pith review Pith/arXiv arXiv 2021

[6] [6]

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Junying Chen, Zhenyang Cai, Ke Ji, Xidong Wang, Wanlong Liu, Rongsheng Wang, Jianye Hou, and Benyou Wang. Huatuogpt-o1, towards medical complex reasoning with llms.arXiv preprint arXiv:2412.18925, 2024

work page internal anchor Pith review Pith/arXiv arXiv 2024

[7] [7]

Encoder-decoder with atrous separable convolution for semantic image segmentation

Liang-Chieh Chen, Yukun Zhu, George Papandreou, Florian Schroff, and Hartwig Adam. Encoder-decoder with atrous separable convolution for semantic image segmentation. In Proceedings of the European conference on computer vision (ECCV), pages 801–818, 2018

work page 2018

[8] [8]

Masked-attention mask transformer for universal image segmentation

Bowen Cheng, Ishan Misra, Alexander G Schwing, Alexander Kirillov, and Rohit Girdhar. Masked-attention mask transformer for universal image segmentation. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 1290–1299, 2022

work page 2022

[9] [9]

Monai label: A framework for ai-assisted interactive labeling of 3d medical images.Medical Image Analysis, 95:103207, 2024

Andres Diaz-Pinto, Sachidanand Alle, Vishwesh Nath, Yucheng Tang, Alvin Ihsani, Muhammad Asad, Fernando Pérez-García, Pritesh Mehta, Wenqi Li, Mona Flores, et al. Monai label: A framework for ai-assisted interactive labeling of 3d medical images.Medical Image Analysis, 95:103207, 2024

work page 2024

[10] [10]

Measures of the amount of ecologic association between species.Ecology, 26:297–302, 1945

Lee Raymond Dice. Measures of the amount of ecologic association between species.Ecology, 26:297–302, 1945. URLhttps://api.semanticscholar.org/CorpusID:53335638

work page 1945

[11] [11]

Milone, and Enzo Ferrante

Nicolás Gaggion, Candelaria Mosquera, Lucas Mansilla, Julia Mariel Saidman, Martina Aineseder, Diego H. Milone, and Enzo Ferrante. Chexmask: A large-scale dataset of anatomical segmentation masks for multi-center chest x-ray images.Scientific Data, 11(1):511, 2024. doi: 10.1038/s41597-024-03358-1

work page doi:10.1038/s41597-024-03358-1 2024

[12] [12]

Chexmask: a large-scale dataset of anatomical segmentation masks for multi-center chest x-ray images.Scientific Data, 11(1):511, 2024

Nicolás Gaggion, Candelaria Mosquera, Lucas Mansilla, Julia Mariel Saidman, Martina Aineseder, Diego H Milone, and Enzo Ferrante. Chexmask: a large-scale dataset of anatomical segmentation masks for multi-center chest x-ray images.Scientific Data, 11(1):511, 2024

work page 2024

[13] [13]

arXiv preprint arXiv:2403.17834 , year=

Ibrahim Ethem Hamamci, Sezgin Er, Chenyu Wang, Furkan Almas, Ayse Gulnihan Simsek, Sevval Nil Esirgun, Irem Dogan, Omer Faruk Durugol, Benjamin Hou, Suprosanna Shit, et al. Developing generalist foundation models from a multimodal dataset for 3d computed tomography.arXiv preprint arXiv:2403.17834, 2024

work page arXiv 2024

[14] [14]

Sánchez, Pim A

Laurens Hogeweg, Clara I. Sánchez, Pim A. de Jong, and Mathilde van Ginneken. Clavicle segmentation in chest radiographs.Medical Image Analysis, 16(8):1490–1502, 2012. doi: 10.1016/j.media.2012.06.009

work page doi:10.1016/j.media.2012.06.009 2012

[15] [15]

nnu-net: a self-configuring method for deep learning-based biomedical image segmentation.Nature methods, 18(2):203–211, 2021

Fabian Isensee, Paul F Jaeger, Simon AA Kohl, Jens Petersen, and Klaus H Maier-Hein. nnu-net: a self-configuring method for deep learning-based biomedical image segmentation.Nature methods, 18(2):203–211, 2021

work page 2021

[16] [16]

Distribution de la flore alpine dans le bassin des dranses et dans quelques régions voisines.Bull Soc Vaudoise Sci Nat, 37:241–272, 1901

Paul Jaccard. Distribution de la flore alpine dans le bassin des dranses et dans quelques régions voisines.Bull Soc Vaudoise Sci Nat, 37:241–272, 1901. 10

work page 1901

[17] [17]

Two public chest x-ray datasets for computer-aided screening of pulmonary diseases

Stefan Jaeger, Sema Candemir, Sameer Antani, Yì-Xiáng J Wáng, Pu-Xuan Lu, and George Thoma. Two public chest x-ray datasets for computer-aided screening of pulmonary diseases. Quantitative imaging in medicine and surgery, 4(6):475, 2014

work page 2014

[18] [18]

Cardionet: Automatic semantic segmentation to calculate the cardiothoracic ratio for cardiomegaly and other chest diseases.Journal of Personalized Medicine, 12(6):988,

Abbas Jafar, Muhammad Talha Hameed, Nadeem Akram, Umer Waqas, Hyung Seok Kim, and Rizwan Ali Naqvi. Cardionet: Automatic semantic segmentation to calculate the cardiothoracic ratio for cardiomegaly and other chest diseases.Journal of Personalized Medicine, 12(6):988,

work page

[19] [19]

doi: 10.3390/jpm12060988

work page doi:10.3390/jpm12060988

[20] [20]

Alistair E. W. Johnson, Tom J. Pollard, Nathaniel R. Greenbaum, Matthew P. Lungren, Chih ying Deng, Yifan Peng, Zhiyong Lu, Roger G. Mark, Seth J. Berkowitz, and Steven Horng. Mimic-cxr, a de-identified publicly available database of chest radiographs with free-text reports. Scientific Data, 6:317, 2019. doi: 10.1038/s41597-019-0322-0

work page doi:10.1038/s41597-019-0322-0 2019

[21] [21]

Pointrend: Image segmentation as rendering

Alexander Kirillov, Yuxin Wu, Kaiming He, and Ross Girshick. Pointrend: Image segmentation as rendering. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 9799–9808, 2020

work page 2020

[22] [22]

Segment anything

Alexander Kirillov, Eric Mintun, Nikhila Ravi, Hanzi Mao, Chloe Rolland, Laura Gustafson, Tete Xiao, Spencer Whitehead, Alexander C Berg, Wan-Yen Lo, et al. Segment anything. In Proceedings of the IEEE/CVF international conference on computer vision, pages 4015–4026, 2023

work page 2023

[23] [23]

Med-r1: Reinforcement learning for generalizable medical reasoning in vision-language models.IEEE Transactions on Medical Imaging, 2026

Yuxiang Lai, Jike Zhong, Ming Li, Shitian Zhao, Yuheng Li, Konstantinos Psounis, and Xiaofeng Yang. Med-r1: Reinforcement learning for generalizable medical reasoning in vision-language models.IEEE Transactions on Medical Imaging, 2026

work page 2026

[24] [24]

U-kan makes strong backbone for medical image segmentation and generation

Chenxin Li, Xinyu Liu, Wuyang Li, Cheng Wang, Hengyu Liu, Yifan Liu, Zhen Chen, and Yixuan Yuan. U-kan makes strong backbone for medical image segmentation and generation. InProceedings of the AAAI conference on artificial intelligence, volume 39, pages 4652–4660, 2025

work page 2025

[25] [25]

Llava-med: Training a large language-and-vision assistant for biomedicine in one day.Advances in Neural Information Processing Systems, 36: 28541–28564, 2023

Chunyuan Li, Cliff Wong, Sheng Zhang, Naoto Usuyama, Haotian Liu, Jianwei Yang, Tristan Naumann, Hoifung Poon, and Jianfeng Gao. Llava-med: Training a large language-and-vision assistant for biomedicine in one day.Advances in Neural Information Processing Systems, 36: 28541–28564, 2023

work page 2023

[26] [26]

U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation

Jun Ma, Feifei Li, and Bo Wang. U-mamba: Enhancing long-range dependency for biomedical image segmentation.arXiv preprint arXiv:2401.04722, 2024

work page internal anchor Pith review Pith/arXiv arXiv 2024

[27] [27]

Anatomy-driven pathology detection on chest x-rays

Philip Müller, Felix Meissen, Johannes Brandt, Georgios Kaissis, and Daniel Rueckert. Anatomy-driven pathology detection on chest x-rays. InMedical Image Computing and Computer Assisted Intervention – MICCAI 2023, pages 57–66, 2023

work page 2023

[28] [28]

Vindr-ribcxr: A benchmark dataset for automatic segmentation and labeling of individual ribs on chest x-rays.arXiv preprint arXiv:2107.01327, 2021

Hoang C Nguyen, Tung T Le, Hieu H Pham, and Ha Q Nguyen. Vindr-ribcxr: A benchmark dataset for automatic segmentation and labeling of individual ribs on chest x-rays.arXiv preprint arXiv:2107.01327, 2021

work page arXiv 2021

[29] [29]

Error metrics for quantitative evaluation of medical image segmentation

Wiro J Niessen, Carolien J Bouma, Koen L Vincken, and Max A Viergever. Error metrics for quantitative evaluation of medical image segmentation. InPerformance characterization in computer vision, pages 275–284. Springer, 2000

work page 2000

[30] [30]

Deep learning to achieve clinically applicable segmentation of head and neck anatomy for radiotherapy.arXiv preprint arXiv:1809.04430, 2018

Stanislav Nikolov, Sam Blackwell, Alexei Zverovitch, Ruheena Mendes, Michelle Livne, Jeffrey De Fauw, Yojan Patel, Clemens Meyer, Harry Askham, Bernardino Romera-Paredes, et al. Deep learning to achieve clinically applicable segmentation of head and neck anatomy for radiotherapy.arXiv preprint arXiv:1809.04430, 2018

work page arXiv 2018

[31] [31]

Beer–lambert law for optical tissue diagnostics: current state of the art and the main limitations.Journal of biomedical optics, 26(10):100901–100901, 2021

Ilze Oshina and Janis Spigulis. Beer–lambert law for optical tissue diagnostics: current state of the art and the main limitations.Journal of biomedical optics, 26(10):100901–100901, 2021

work page 2021

[32] [32]

S-sam: Svd-based fine-tuning of segment anything model for medical image segmentation

Jay N Paranjape, Shameema Sikder, S Swaroop Vedula, and Vishal M Patel. S-sam: Svd-based fine-tuning of segment anything model for medical image segmentation. InInternational Conference on Medical Image Computing and Computer-Assisted Intervention, pages 720–730. Springer, 2024. 11

work page 2024

[33] [33]

U-net: Convolutional networks for biomedical image segmentation

Olaf Ronneberger, Philipp Fischer, and Thomas Brox. U-net: Convolutional networks for biomedical image segmentation. InInternational Conference on Medical image computing and computer-assisted intervention, pages 234–241. Springer, 2015

work page 2015

[34] [34]

Vm-unet: Vision mamba unet for medical image segmentation.ACM Transactions on Multimedia Computing, Communications and Applications, 2024

Jiacheng Ruan, Jincheng Li, and Suncheng Xiang. Vm-unet: Vision mamba unet for medical image segmentation.ACM Transactions on Multimedia Computing, Communications and Applications, 2024

work page 2024

[35] [35]

Saquib Sarfraz, Matthias A

Constantin Seibold, Simon Reiss, M. Saquib Sarfraz, Matthias A. Fink, Victoria Mayer, Jan Sellner, Moon Sung Kim, Klaus H. Maier-Hein, Jens Kleesiek, and Rainer Stiefelhagen. De- tailed annotations of chest x-rays via ct projection for report understanding.arXiv preprint arXiv:2210.03416, 2022

work page arXiv 2022

[36] [36]

Fink, Moon Kim, Simon Reiss, Ken Herrmann, Jens Kleesiek, and Rainer Stiefelhagen

Constantin Seibold, Alexander Jaus, Matthias A. Fink, Moon Kim, Simon Reiss, Ken Herrmann, Jens Kleesiek, and Rainer Stiefelhagen. Accurate fine-grained segmentation of human anatomy in radiographs via volumetric pseudo-labeling.arXiv preprint arXiv:2306.03934, 2023

work page arXiv 2023

[37] [37]

MedGemma Technical Report

Andrew Sellergren, Sahar Kazemzadeh, Tiam Jaroensri, Atilla Kiraly, Madeleine Traverse, Timo Kohlberger, Shawn Xu, Fayaz Jamil, Cían Hughes, Charles Lau, et al. Medgemma technical report.arXiv preprint arXiv:2507.05201, 2025

work page internal anchor Pith review Pith/arXiv arXiv 2025

[38] [38]

Junji Shiraishi, Shigehiko Katsuragawa, Junpei Ikezoe, Tsuneo Matsumoto, Takeshi Kobayashi, Ken-ichi Komatsu, Mitate Matsui, Hiroshi Fujita, Yoshie Kodera, and Kunio Doi. Development of a digital image database for chest radiographs with and without a lung nodule: receiver operating characteristic analysis of radiologists’ detection of pulmonary nodules.A...

work page 2000

[39] [39]

Metrics for evaluating 3d medical image segmentation: analysis, selection, and tool.BMC medical imaging, 15(1):29, 2015

Abdel Aziz Taha and Allan Hanbury. Metrics for evaluating 3d medical image segmentation: analysis, selection, and tool.BMC medical imaging, 15(1):29, 2015

work page 2015

[40] [40]

Interactive and explainable region-guided radiology report generation

Tim Tanida, Philip Müller, Georgios Kaissis, and Daniel Rueckert. Interactive and explainable region-guided radiology report generation. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023

work page 2023

[41] [41]

Unext: Mlp-based rapid medical image segmentation network

Jeya Maria Jose Valanarasu and Vishal M Patel. Unext: Mlp-based rapid medical image segmentation network. InInternational conference on medical image computing and computer- assisted intervention, pages 23–33. Springer, 2022

work page 2022

[42] [42]

Stegmann, and Marco Loog

Bram van Ginneken, Mikkel B. Stegmann, and Marco Loog. Segmentation of anatomical structures in chest radiographs using supervised methods: A comparative study on a public database.Medical Image Analysis, 10(1):19–40, 2006. doi: 10.1016/j.media.2005.02.002

work page doi:10.1016/j.media.2005.02.002 2006

[43] [43]

Seaformer: Squeeze-enhanced axial transformer for mobile semantic segmentation

Qiang Wan, Zilong Huang, Jiachen Lu, Gang Yu, and Li Zhang. Seaformer: Squeeze-enhanced axial transformer for mobile semantic segmentation. InThe eleventh international conference on learning representations, 2023

work page 2023

[44] [44]

Medical sam adapter: Adapting segment anything model for medical image segmentation

Junde Wu, Ziyue Wang, Mingxuan Hong, Wei Ji, Huazhu Fu, Yanwu Xu, Min Xu, and Yueming Jin. Medical sam adapter: Adapting segment anything model for medical image segmentation. Medical image analysis, 102:103547, 2025

work page 2025

[45] [45]

Unified perceptual parsing for scene understanding

Tete Xiao, Yingcheng Liu, Bolei Zhou, Yuning Jiang, and Jian Sun. Unified perceptual parsing for scene understanding. InProceedings of the European conference on computer vision (ECCV), pages 418–434, 2018

work page 2018

[46] [46]

Segformer: Simple and efficient design for semantic segmentation with transformers.Advances in neural information processing systems, 34:12077–12090, 2021

Enze Xie, Wenhai Wang, Zhiding Yu, Anima Anandkumar, Jose M Alvarez, and Ping Luo. Segformer: Simple and efficient design for semantic segmentation with transformers.Advances in neural information processing systems, 34:12077–12090, 2021

work page 2021

[47] [47]

Pidnet: A real-time semantic segmentation network inspired by pid controllers

Jiacong Xu, Zixiang Xiong, and Shankar P Bhattacharyya. Pidnet: A real-time semantic segmentation network inspired by pid controllers. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 19529–19539, 2023. 12

work page 2023

[48] [48]

Topformer: Token pyramid transformer for mobile semantic segmentation

Wenqiang Zhang, Zilong Huang, Guozhong Luo, Tao Chen, Xinggang Wang, Wenyu Liu, Gang Yu, and Chunhua Shen. Topformer: Token pyramid transformer for mobile semantic segmentation. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 12083–12093, 2022

work page 2022

[49] [49]

Radgenome-chest ct: A grounded vision-language dataset for chest ct analysis.arXiv preprint arXiv:2404.16754, 2024

Xiaoman Zhang, Chaoyi Wu, Ziheng Zhao, Jiayu Lei, Ya Zhang, Yanfeng Wang, and Weidi Xie. Radgenome-chest ct: A grounded vision-language dataset for chest ct analysis.arXiv preprint arXiv:2404.16754, 2024

work page arXiv 2024

[50] [50]

Large-vocabulary segmentation for medical images with text prompts.NPJ Digital Medicine, 8(1):566, 2025

Ziheng Zhao, Yao Zhang, Chaoyi Wu, Xiaoman Zhang, Xiao Zhou, Ya Zhang, Yanfeng Wang, and Weidi Xie. Large-vocabulary segmentation for medical images with text prompts.NPJ Digital Medicine, 8(1):566, 2025

work page 2025

[51] [51]

# !# $"# #$ $! $ !$

Zongwei Zhou, Md Mahfuzur Rahman Siddiquee, Nima Tajbakhsh, and Jianming Liang. Unet++: A nested u-net architecture for medical image segmentation. InInternational workshop on deep learning in medical image analysis, pages 3–11. Springer, 2018. 13 A Radiographic Transferability and Label Quality We assess whether anatomy supervision fromRadGenome-Anatomyt...

work page 2018