MedCLIP: Contrastive Learning from Unpaired Medical Images and Text.Proc Conf Empir Methods Nat Lang Process

Wang Z, Wu Z, Agarwal D, Sun J · 2022 · DOI 10.18653/v1/2022.emnlp-main.256

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open at publisher browse 3 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

REVEAL: Multimodal Vision-Language Alignment of Retinal Morphometry and Clinical Risks for Incident AD and Dementia Prediction

cs.CV · 2026-04-20 · unverdicted · novelty 6.0

REVEAL uses vision-language alignment of retinal morphometry and clinical risk narratives plus group contrastive learning to predict AD and dementia about 8 years early.

Representation geometry shapes task performance in vision-language modeling for CT enterography

cs.CV · 2026-04-14 · unverdicted · novelty 6.0

Mean pooling and multi-window RGB encoding optimize vision-language performance on CT enterography, with retrieval-augmented generation substantially improving automated report severity accuracy over fine-tuning alone.

MIRAGE: Retrieval and Generation of Multimodal Images and Texts for Medical Education

cs.CV · 2026-05-06 · unverdicted · novelty 3.0

MIRAGE combines a medical CLIP model, a diffusion generator, and an LLM into an accessible interface for retrieving and creating educational medical images and texts.

citing papers explorer

Showing 3 of 3 citing papers.

REVEAL: Multimodal Vision-Language Alignment of Retinal Morphometry and Clinical Risks for Incident AD and Dementia Prediction cs.CV · 2026-04-20 · unverdicted · none · ref 5
REVEAL uses vision-language alignment of retinal morphometry and clinical risk narratives plus group contrastive learning to predict AD and dementia about 8 years early.
Representation geometry shapes task performance in vision-language modeling for CT enterography cs.CV · 2026-04-14 · unverdicted · none · ref 26
Mean pooling and multi-window RGB encoding optimize vision-language performance on CT enterography, with retrieval-augmented generation substantially improving automated report severity accuracy over fine-tuning alone.
MIRAGE: Retrieval and Generation of Multimodal Images and Texts for Medical Education cs.CV · 2026-05-06 · unverdicted · none · ref 5
MIRAGE combines a medical CLIP model, a diffusion generator, and an LLM into an accessible interface for retrieving and creating educational medical images and texts.

MedCLIP: Contrastive Learning from Unpaired Medical Images and Text.Proc Conf Empir Methods Nat Lang Process

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer