Laplacian Representations for Decision-Time Planning

Dikshant Shehmar; Marlos C. Machado; Matthew E. Taylor; Matthew Schlegel

arxiv: 2602.05031 · v2 · pith:UJ7PIITDnew · submitted 2026-02-04 · 💻 cs.LG

Laplacian Representations for Decision-Time Planning

Dikshant Shehmar , Matthew Schlegel , Matthew E. Taylor , Marlos C. Machado This is my paper

classification 💻 cs.LG

keywords planningdecision-timedistanceslaplacianlong-horizonrepresentationrepresentationsalgorithm

0 comments

read the original abstract

Planning with a learned model remains a key challenge in model-based reinforcement learning (RL). In decision-time planning, state representations are critical as they must support local cost computation while preserving long-horizon structure. In this paper, we show that the Laplacian representation provides an effective latent space for planning by capturing state-space distances at multiple time scales. This representation preserves meaningful distances and naturally decomposes long-horizon problems into subgoals, also mitigating the compounding errors that arise over long prediction horizons. Building on these properties, we introduce ALPS, a hierarchical planning algorithm, and demonstrate that it outperforms commonly used baselines on a selection of offline goal-conditioned RL tasks from OGBench, a benchmark previously dominated by model-free methods.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Spectral Alignment in Forward-Backward Representations via Temporal Abstraction
cs.LG 2026-03 unverdicted novelty 4.0

Temporal abstraction functions as a low-pass filter on transition dynamics to lower the effective rank of successor representations while bounding value function error in forward-backward learning.