Goal-conditioned reinforcement learning with disentanglement-based reachability planning.arXiv preprint arXiv:2307.10846, 2023

Zhifeng Qian, Mingyu You, Hongjun Zhou, Xuanhui Xu, Bin He · 2023 · arXiv 2307.10846

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Beyond Euclidean Proximity: Repairing Latent World Models with Horizon-Matched Trajectory Reachability Metrics

cs.LG · 2026-05-21 · unverdicted · novelty 5.0

TRM trains a small horizon-matched pairwise head on trajectory data to improve terminal-state ranking in latent MPC, raising success from 7% to 97% on TwoRoom and 32.7% to 84% on PLDM without changing the encoder or dynamics.

citing papers explorer

Showing 1 of 1 citing paper.

Beyond Euclidean Proximity: Repairing Latent World Models with Horizon-Matched Trajectory Reachability Metrics cs.LG · 2026-05-21 · unverdicted · none · ref 13
TRM trains a small horizon-matched pairwise head on trajectory data to improve terminal-state ranking in latent MPC, raising success from 7% to 97% on TwoRoom and 32.7% to 84% on PLDM without changing the encoder or dynamics.

Goal-conditioned reinforcement learning with disentanglement-based reachability planning.arXiv preprint arXiv:2307.10846, 2023

fields

years

verdicts

representative citing papers

citing papers explorer