← back to paper
arxiv: 2604.18486 · 2 revisions
Xiaomi OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation