A new UML class diagram VQA benchmark and 16k dataset enable LoRA fine-tuning to outperform Qwen 3.5 27B.
MM-GEN: Enhancing Task Performance Through Targeted Multimodal Data Curation
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2representative citing papers
Data curation alone raises VLM accuracy by more than 11 points on average across many benchmarks while cutting required training compute by up to 87 times.
citing papers explorer
-
Unlocking UML Class Diagram Understanding in Vision Language Models
A new UML class diagram VQA benchmark and 16k dataset enable LoRA fine-tuning to outperform Qwen 3.5 27B.
-
20/20 Vision Language Models: A Prescription for Better VLMs through Data Curation Alone
Data curation alone raises VLM accuracy by more than 11 points on average across many benchmarks while cutting required training compute by up to 87 times.