pith. sign in

hub Canonical reference

Videovla: Video generators can be generalizable robot manipulators.arXiv preprint arXiv:2512.06963

Canonical reference. 86% of citing Pith papers cite this work as background.

12 Pith papers citing it
Background 86% of classified citations

hub tools

citation-role summary

background 6 other 1

citation-polarity summary

fields

cs.RO 8 cs.CV 4

years

2026 12

polarities

background 6 unclear 1

clear filters

representative citing papers

Causal World Modeling for Robot Control

cs.CV · 2026-01-29 · unverdicted · novelty 5.0

LingBot-VA combines video world modeling with policy learning via Mixture-of-Transformers, closed-loop rollouts, and asynchronous inference to improve robot manipulation in simulation and real settings.

citing papers explorer

Showing 12 of 12 citing papers.