SemEnrich: Self-Supervised Semantic Enrichment of Radiology Reports for Vision-Language Learning

· 2026 · cs.LG · arXiv 2604.09887

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Medical vision-language datasets are often limited in size and biased toward negative findings, as clinicians report abnormalities mostly but might omit some positive/neutral findings because they might be considered as irrelevant to the patient's condition. We propose a self-supervised data enrichment method that leverages semantic clustering of report sentences. Then we enrich the findings in the medical reports in the training set by adding positive/neutral observations from different clusters in a self-supervised manner. Our approach yields consistent gains in supervised fine-tuning (5.63%, 3.04%, 7.40%, 5.30%, 7.47% average gains on COMET score, Bert score, Sentence Bleu, CheXbert-F1 and RadGraph-F1 scores respectively). Ablation studies confirm that improvements stem from semantic clustering rather than random augmentation. Furthermore, we introduce a way to incorporate semantic cluster information into the reward design for GRPO training, which leads to further performance gains (2.78%, 3.14%, 12.80% average gains on COMET score, Bert score and Sentence Bleu scores respectively). We share our code at https://anonymous.4open.science/r/SemEnrich-75CF

representative citing papers

Discrete Diffusion Language Models for Interactive Radiology Report Drafting

cs.AI · 2026-07-01 · unverdicted · novelty 6.0

Diffusion LM matches AR performance on medical VQA, runs 3.5-4.4x faster, and enables bidirectional infilling for interactive radiology report drafting.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Discrete Diffusion Language Models for Interactive Radiology Report Drafting cs.AI · 2026-07-01 · unverdicted · none · ref 7 · internal anchor
Diffusion LM matches AR performance on medical VQA, runs 3.5-4.4x faster, and enables bidirectional infilling for interactive radiology report drafting.

SemEnrich: Self-Supervised Semantic Enrichment of Radiology Reports for Vision-Language Learning

fields

years

verdicts

representative citing papers

citing papers explorer