Latent action pretraining through world modeling

Bahey Tharwat, Yara Nasser, Ali Abouzeid, Ian Reid · 2025 · arXiv 2509.18428

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Learning Visual Feature-Based World Models via Residual Latent Action

cs.CV · 2026-05-08 · unverdicted · novelty 7.0

RLA-WM predicts residual latent actions via flow matching to create visual feature world models that outperform prior feature-based and diffusion approaches while enabling offline video-based robot RL.

LA-Pose: Latent Action Pretraining Meets Pose Estimation

cs.CV · 2026-04-30 · unverdicted · novelty 6.0

LA-Pose achieves over 10% higher pose accuracy than recent feed-forward methods on Waymo and PandaSet benchmarks by repurposing latent actions from self-supervised inverse-dynamics pretraining while using orders of magnitude less labeled 3D data.

OrbiSim: World Models as Differentiable Physics Engines for Embodied Intelligence

cs.RO · 2026-05-12 · unverdicted · novelty 5.0

OrbiSim builds a differentiable physics engine from world models to support gradient-based policy optimization and contact modeling in robotics.

citing papers explorer

Showing 3 of 3 citing papers.

Learning Visual Feature-Based World Models via Residual Latent Action cs.CV · 2026-05-08 · unverdicted · none · ref 43
RLA-WM predicts residual latent actions via flow matching to create visual feature world models that outperform prior feature-based and diffusion approaches while enabling offline video-based robot RL.
LA-Pose: Latent Action Pretraining Meets Pose Estimation cs.CV · 2026-04-30 · unverdicted · none · ref 28
LA-Pose achieves over 10% higher pose accuracy than recent feed-forward methods on Waymo and PandaSet benchmarks by repurposing latent actions from self-supervised inverse-dynamics pretraining while using orders of magnitude less labeled 3D data.
OrbiSim: World Models as Differentiable Physics Engines for Embodied Intelligence cs.RO · 2026-05-12 · unverdicted · none · ref 37
OrbiSim builds a differentiable physics engine from world models to support gradient-based policy optimization and contact modeling in robotics.

Latent action pretraining through world modeling

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer