← back to paper
arxiv: 2605.20246 · 2 revisions
GROW: Aligning GRPO with State-Action Modeling for Open-World VLM Agents