Guid- ing medical vision-language models with diverse visual prompts: Framework design and comprehensive explo- ration of prompt variations,

Kangyu Zhu, Ziyuan Qin, Huahui Yi, Zekun Jiang, Qicheng Lao, Shaoting Zhang, Kang Li, “Guiding medical vision-language models with diverse visual prompts: Framework design, comprehensive exploration of prompt variations,” inProce · 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

RoiMAM: Region-of-Interest Medical Attention Model for Efficient Vision-Language Understanding

cs.CV · 2026-05-15 · unverdicted · novelty 5.0

RoiMAM integrates a training-free ROI Generation Module with Semantic Selective Suppression and a Text Prompt Enhancer to produce a compact VLM that reports 2 percent and 4.6 percent accuracy gains on SLAKE and PMC-VQA at less than 20 percent the size of MedVInT-TD.

citing papers explorer

Showing 1 of 1 citing paper.

RoiMAM: Region-of-Interest Medical Attention Model for Efficient Vision-Language Understanding cs.CV · 2026-05-15 · unverdicted · none · ref 9
RoiMAM integrates a training-free ROI Generation Module with Semantic Selective Suppression and a Text Prompt Enhancer to produce a compact VLM that reports 2 percent and 4.6 percent accuracy gains on SLAKE and PMC-VQA at less than 20 percent the size of MedVInT-TD.

Guid- ing medical vision-language models with diverse visual prompts: Framework design and comprehensive explo- ration of prompt variations,

fields

years

verdicts

representative citing papers

citing papers explorer