Gradient-based planning with world models.arXiv:2312.17227,

Jyothir S V , Siddhartha Jalagam, Yann LeCun, Vlad Sobal · 2023 · arXiv 2312.17227

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

read on arXiv browse 6 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Learning Visual Feature-Based World Models via Residual Latent Action

cs.CV · 2026-05-08 · unverdicted · novelty 7.0

RLA-WM predicts residual latent actions via flow matching to create visual feature world models that outperform prior feature-based and diffusion approaches while enabling offline video-based robot RL.

Gradient-Based Join Ordering

cs.DB · 2025-11-18 · unverdicted · novelty 7.0

Relaxing join orders to a differentiable soft adjacency matrix and optimizing with gradients plus a GNN cost model yields plans that match or beat discrete search while scaling better on graph datasets.

Slot-MPC: Goal-Conditioned Model Predictive Control with Object-Centric Representations

cs.LG · 2026-05-14 · unverdicted · novelty 6.0

Slot-MPC learns slot representations to build a differentiable object-centric dynamics model that supports efficient gradient-based MPC for robotic manipulation in novel situations.

Dream-MPC: Gradient-Based Model Predictive Control with Latent Imagination

cs.LG · 2026-05-06 · unverdicted · novelty 6.0 · 2 refs

Dream-MPC refines policy-generated trajectories by gradient ascent in a latent world model with uncertainty regularization and temporal amortization, improving base policy performance and beating gradient-free MPC on 24 continuous control tasks.

Scaling World-Model Reinforcement Learning Through Diffusion Policy Optimization

cs.LG · 2026-05-25 · unverdicted · novelty 5.0

MBDPO reformulates policy optimization as a diffusion process over searched trajectories in latent world models to reduce misalignment between search and value learning.

Emotion-Conditioned Short-Horizon Human Pose Forecasting with a Lightweight Predictive World Model

cs.CV · 2026-04-26 · unverdicted · novelty 3.0

Facial emotion embeddings improve short-term pose forecasting accuracy for emotion-driven motions when fused via normalized gating in a lightweight LSTM world model, but not with simple multimodal fusion.

citing papers explorer

Showing 6 of 6 citing papers.

Learning Visual Feature-Based World Models via Residual Latent Action cs.CV · 2026-05-08 · unverdicted · none · ref 32
RLA-WM predicts residual latent actions via flow matching to create visual feature world models that outperform prior feature-based and diffusion approaches while enabling offline video-based robot RL.
Gradient-Based Join Ordering cs.DB · 2025-11-18 · unverdicted · none · ref 26
Relaxing join orders to a differentiable soft adjacency matrix and optimizing with gradients plus a GNN cost model yields plans that match or beat discrete search while scaling better on graph datasets.
Slot-MPC: Goal-Conditioned Model Predictive Control with Object-Centric Representations cs.LG · 2026-05-14 · unverdicted · none · ref 10
Slot-MPC learns slot representations to build a differentiable object-centric dynamics model that supports efficient gradient-based MPC for robotic manipulation in novel situations.
Dream-MPC: Gradient-Based Model Predictive Control with Latent Imagination cs.LG · 2026-05-06 · unverdicted · none · ref 5 · 2 links
Dream-MPC refines policy-generated trajectories by gradient ascent in a latent world model with uncertainty regularization and temporal amortization, improving base policy performance and beating gradient-free MPC on 24 continuous control tasks.
Scaling World-Model Reinforcement Learning Through Diffusion Policy Optimization cs.LG · 2026-05-25 · unverdicted · none · ref 8
MBDPO reformulates policy optimization as a diffusion process over searched trajectories in latent world models to reduce misalignment between search and value learning.
Emotion-Conditioned Short-Horizon Human Pose Forecasting with a Lightweight Predictive World Model cs.CV · 2026-04-26 · unverdicted · none · ref 15
Facial emotion embeddings improve short-term pose forecasting accuracy for emotion-driven motions when fused via normalized gating in a lightweight LSTM world model, but not with simple multimodal fusion.

Gradient-based planning with world models.arXiv:2312.17227,

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer