Bisimulation metric for model predictive control

Yutaka Shimizu, Masayoshi Tomizuka · 2024 · arXiv 2410.04553

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Scaling World-Model Reinforcement Learning Through Diffusion Policy Optimization

cs.LG · 2026-05-25 · unverdicted · novelty 5.0

MBDPO reformulates policy optimization as a diffusion process over searched trajectories in latent world models to reduce misalignment between search and value learning.

citing papers explorer

Showing 1 of 1 citing paper.

Scaling World-Model Reinforcement Learning Through Diffusion Policy Optimization cs.LG · 2026-05-25 · unverdicted · none · ref 31
MBDPO reformulates policy optimization as a diffusion process over searched trajectories in latent world models to reduce misalignment between search and value learning.

Bisimulation metric for model predictive control

fields

years

verdicts

representative citing papers

citing papers explorer