SeeTraceAct: Visibility-Aware Latent Planning from Cross-Embodiment Demonstration Videos

Chris Dongjoo Kim; Dieter Fox; Jaehyeon Son; Jaemin Cho; Jeremiah Coholich; Jinhoo Kim; Junhyun Kim; Kyle Kam; Seok Joon Kim; Zsolt Kira

arxiv: 2606.02745 · v1 · pith:QI3N4ZZ2new · submitted 2026-06-01 · 💻 cs.RO · cs.LG

SeeTraceAct: Visibility-Aware Latent Planning from Cross-Embodiment Demonstration Videos

Jaehyeon Son , Junhyun Kim , Kyle Kam , Jeremiah Coholich , Seok Joon Kim , Jinhoo Kim , Chris Dongjoo Kim , Jaemin Cho

show 2 more authors

Dieter Fox Zsolt Kira

This is my paper

classification 💻 cs.RO cs.LG

keywords demo-conditionedrobocasa-dcseetraceactconditionedcross-embodimentdemonstrationdemonstrationsreal-world

0 comments

read the original abstract

Vision-language-action models (VLAs) are promising general-purpose robot policies, but adapting them to new tasks typically requires costly task-specific teleoperation data. As an alternative, we study one-shot demo-conditioned VLAs, where a robot policy is conditioned on a single demonstration video of an unseen task. We find that existing end-to-end approaches often struggle when successful execution requires precisely localizing small target regions. To address this limitation, we propose SeeTraceAct, a demo-conditioned VLA framework that encourages precise spatial grounding through visibility-aware prediction of future end-effector traces. To enable reproducible evaluation with cross-embodiment demonstrations, we introduce and release RoboCasa-DC, a demo-conditioned extension of RoboCasa with episode-paired humanoid videos. Experiments on RoboCasa-DC and a real-world benchmark, where a Franka Panda arm is conditioned on human demonstrations, show that SeeTraceAct outperforms baselines, achieving the best success rate across all four RoboCasa-DC settings and improving real-world average success by 12.5 percentage points.

This paper has not been read by Pith yet.

SeeTraceAct: Visibility-Aware Latent Planning from Cross-Embodiment Demonstration Videos

discussion (0)