C.; Schwaighofer, A.; Lungren, M

Pérez-García, F · 2024 · arXiv 2401.10815

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

citation-role summary

baseline 1

citation-polarity summary

baseline 1

representative citing papers

Domain-Specific Latent Representations Improve the Fidelity of Diffusion-Based Medical Image Super-Resolution

cs.CV · 2026-04-14 · accept · novelty 6.0

Replacing the generic Stable Diffusion VAE with domain-specific MedVAE pretrained on 1.6M medical images improves diffusion-based SR PSNR by 2.91-3.29 dB on knee/brain MRI and chest X-ray, with gains in fine details and VAE quality predicting SR performance (R²=0.67).

RA-RRG: Multimodal Retrieval-Augmented Radiology Report Generation with Key Phrase Extraction

cs.CV · 2025-04-10 · unverdicted · novelty 6.0

RA-RRG extracts key phrases with LLMs, retrieves them via multimodal similarity, and conditions report generation on them to achieve SOTA CheXbert scores and competitive RadGraph F1 on MIMIC-CXR and IU X-ray while supporting multi-view inputs.

M4CXR: Exploring Multi-task Potentials of Multi-modal Large Language Models for Chest X-ray Interpretation

cs.CV · 2024-08-29 · unverdicted · novelty 5.0

M4CXR is a multi-modal large language model that performs multiple tasks in chest X-ray analysis including report generation with claimed SOTA clinical accuracy using chain-of-thought prompting.

citing papers explorer

Showing 3 of 3 citing papers.

Domain-Specific Latent Representations Improve the Fidelity of Diffusion-Based Medical Image Super-Resolution cs.CV · 2026-04-14 · accept · none · ref 43
Replacing the generic Stable Diffusion VAE with domain-specific MedVAE pretrained on 1.6M medical images improves diffusion-based SR PSNR by 2.91-3.29 dB on knee/brain MRI and chest X-ray, with gains in fine details and VAE quality predicting SR performance (R²=0.67).
RA-RRG: Multimodal Retrieval-Augmented Radiology Report Generation with Key Phrase Extraction cs.CV · 2025-04-10 · unverdicted · none · ref 37
RA-RRG extracts key phrases with LLMs, retrieves them via multimodal similarity, and conditions report generation on them to achieve SOTA CheXbert scores and competitive RadGraph F1 on MIMIC-CXR and IU X-ray while supporting multi-view inputs.
M4CXR: Exploring Multi-task Potentials of Multi-modal Large Language Models for Chest X-ray Interpretation cs.CV · 2024-08-29 · unverdicted · none · ref 38
M4CXR is a multi-modal large language model that performs multiple tasks in chest X-ray analysis including report generation with claimed SOTA clinical accuracy using chain-of-thought prompting.

C.; Schwaighofer, A.; Lungren, M

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer