pith. sign in

Adams, Tyler Cody, and Peter A

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

citation-role summary

background 1

citation-polarity summary

fields

cs.AI 2 cs.LG 2

years

2026 3 2024 1

verdicts

UNVERDICTED 4

roles

background 1

polarities

support 1

clear filters

representative citing papers

Understanding Goal Generalisation in Sequential Reinforcement Learning

cs.LG · 2026-05-22 · unverdicted · novelty 6.0

Empirical analysis of over 100 sequential RL training pipelines across 250+ OOD environments finds salient features drive generalization and early goals persist, with latent policy gradients simulating latent variable evolution to predict OOD behavior from training history.

citing papers explorer

Showing 1 of 1 citing paper after filters.