Dexterous world models.arXiv preprint arXiv:2512.17907, 2025

· 2025 · arXiv 2512.17907

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

Wh0: Generative World Models as Scalable Sources of Egocentric Human Hand Manipulation Data

cs.RO · 2026-06-20 · unverdicted · novelty 6.0

Wh0 generates scalable egocentric human manipulation videos with world models and converts them to boost pretrained VLA models' zero-shot dexterous task success from 8.3% to 38.9% on 18 real-world tasks.

DexSIM: Real-time Dexterous Simulation with Unified Causal Video Diffusion

cs.CV · 2026-05-23 · unverdicted · novelty 6.0

DexSIM is a bi-directional video diffusion model with hand trajectory embedding and spatial memory cache for real-time dexterous hand-object simulation at 15 FPS.

DeWorldSG: Depth-Aware 3D Semantic Scene Graph Generation via World-Model Priors

cs.CV · 2026-07-01 · unverdicted · novelty 5.0

DeWorldSG improves 3D scene graph generation from RGB-D sequences by using depth-guided 3D Gaussian object nodes and V-JEPA 2 world-model priors for spatiotemporal relation refinement, reporting large recall gains on 3DSSG and ReplicaSSG.

AnchorWorld: Embodied Egocentric World Simulation with View-based Evolution Customization

cs.CV · 2026-06-05 · unverdicted · novelty 5.0

AnchorWorld proposes a simulation framework that adds exogenous viewpoint supervision for full-body grounding and anchor-view text customization for dynamic world evolution in egocentric settings.

citing papers explorer

Showing 4 of 4 citing papers after filters.

Wh0: Generative World Models as Scalable Sources of Egocentric Human Hand Manipulation Data cs.RO · 2026-06-20 · unverdicted · none · ref 34
Wh0 generates scalable egocentric human manipulation videos with world models and converts them to boost pretrained VLA models' zero-shot dexterous task success from 8.3% to 38.9% on 18 real-world tasks.
DexSIM: Real-time Dexterous Simulation with Unified Causal Video Diffusion cs.CV · 2026-05-23 · unverdicted · none · ref 7
DexSIM is a bi-directional video diffusion model with hand trajectory embedding and spatial memory cache for real-time dexterous hand-object simulation at 15 FPS.
DeWorldSG: Depth-Aware 3D Semantic Scene Graph Generation via World-Model Priors cs.CV · 2026-07-01 · unverdicted · none · ref 18
DeWorldSG improves 3D scene graph generation from RGB-D sequences by using depth-guided 3D Gaussian object nodes and V-JEPA 2 world-model priors for spatiotemporal relation refinement, reporting large recall gains on 3DSSG and ReplicaSSG.
AnchorWorld: Embodied Egocentric World Simulation with View-based Evolution Customization cs.CV · 2026-06-05 · unverdicted · none · ref 24
AnchorWorld proposes a simulation framework that adds exogenous viewpoint supervision for full-body grounding and anchor-view text customization for dynamic world evolution in egocentric settings.

Dexterous world models.arXiv preprint arXiv:2512.17907, 2025

fields

years

verdicts

representative citing papers

citing papers explorer