Personalized soft prompts steer VLM attention to match user-specific gaze patterns, yielding better attention alignment and click prediction in recommendation simulations.
Zhuet al., ”Guiding medical vision-language models with explicit visual prompts: Framework design and comprehensive exploration of prompt variations,” inProc
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2verdicts
UNVERDICTED 2representative citing papers
Applies PEFT to Florence-2 for GI endoscopy VQA and LoRA-adapted Stable Diffusion 2.1 for synthetic image generation, reporting ROUGE/BLEU gains and image quality metrics on Kvasir-VQA.
citing papers explorer
-
Through Their Eyes: Fixation-aligned Tuning for Personalized User Emulation
Personalized soft prompts steer VLM attention to match user-specific gaze patterns, yielding better attention alignment and click prediction in recommendation simulations.
-
Parameter-Efficient VLMs for Gastrointestinal Endoscopy: Medical Image Generation and Clinical Visual Question Answering
Applies PEFT to Florence-2 for GI endoscopy VQA and LoRA-adapted Stable Diffusion 2.1 for synthetic image generation, reporting ROUGE/BLEU gains and image quality metrics on Kvasir-VQA.