Applies PEFT to Florence-2 for GI endoscopy VQA and LoRA-adapted Stable Diffusion 2.1 for synthetic image generation, reporting ROUGE/BLEU gains and image quality metrics on Kvasir-VQA.
Yanet al., ”Vision-language large learning model, GPT4V , accu- rately classifies the Boston Bowel Preparation Scale score,”BMJ Open Gastroenterology, vol
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CV 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Parameter-Efficient VLMs for Gastrointestinal Endoscopy: Medical Image Generation and Clinical Visual Question Answering
Applies PEFT to Florence-2 for GI endoscopy VQA and LoRA-adapted Stable Diffusion 2.1 for synthetic image generation, reporting ROUGE/BLEU gains and image quality metrics on Kvasir-VQA.