pith. sign in

VideoVLA: Video generators can be generalizable robot manipulators

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.RO 1

years

2026 1

verdicts

UNVERDICTED 1

representative citing papers

PhysBrain 1.0 Technical Report

cs.RO · 2026-05-14 · unverdicted · novelty 5.0

PhysBrain 1.0 extracts scene elements, spatial dynamics, actions and depth relations from human egocentric video to create QA supervision for VLMs, then transfers the resulting physical priors to VLA policies via capability-preserving adaptation.

citing papers explorer

Showing 1 of 1 citing paper.

  • PhysBrain 1.0 Technical Report cs.RO · 2026-05-14 · unverdicted · none · ref 32

    PhysBrain 1.0 extracts scene elements, spatial dynamics, actions and depth relations from human egocentric video to create QA supervision for VLMs, then transfers the resulting physical priors to VLA policies via capability-preserving adaptation.