pith. sign in

Alignment Pretraining : AI Discourse Causes Self - Fulfilling ( Mis )alignment, January 2026

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

citation-role summary

dataset 1

citation-polarity summary

fields

cs.CL 1 cs.LG 1

years

2026 2

roles

dataset 1

polarities

use dataset 1

representative citing papers

Understanding Goal Generalisation in Sequential Reinforcement Learning

cs.LG · 2026-05-22 · unverdicted · novelty 6.0

Empirical analysis of over 100 sequential RL training pipelines across 250+ OOD environments finds salient features drive generalization and early goals persist, with latent policy gradients simulating latent variable evolution to predict OOD behavior from training history.

citing papers explorer

Showing 2 of 2 citing papers.