pith. sign in

Horizon reduction makes rl scalable

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

citation-role summary

baseline 1

citation-polarity summary

years

2026 7 2025 1

roles

baseline 1

polarities

baseline 1

representative citing papers

Goal-Conditioned Agents that Learn Everything All at Once

cs.LG · 2026-05-22 · unverdicted · novelty 6.0

LEO enables efficient all-goals learning in goal-conditioned RL by jointly predicting for all goals in one network pass, yielding >250x speedup over relabelling and better performance on Craftax.

Hierarchical Behaviour Spaces

cs.AI · 2026-04-27 · unverdicted · novelty 6.0

Hierarchical Behaviour Spaces uses linear combinations of reward functions to induce expressive behavior spaces in hierarchical RL, yielding strong performance on NetHack primarily through better exploration rather than long-term planning.

Scalable Option Learning in High-Throughput Environments

cs.LG · 2025-08-30 · unverdicted · novelty 6.0

SOL is a new hierarchical RL algorithm that reaches 35x higher throughput and outperforms flat agents when trained on 30 billion frames in NetHack while showing positive scaling.

citing papers explorer

Showing 8 of 8 citing papers.