pith. sign in

Foster, and Alexander Rakhlin

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.LG 1

years

2026 1

verdicts

UNVERDICTED 1

representative citing papers

The Role of Generator Access in Autoregressive Post-Training

cs.LG · 2026-04-06 · unverdicted · novelty 5.0

Limited generator access in autoregressive post-training confines learners to root-start rollouts whose value is bounded by on-policy prefix probabilities, while weak prefix control unlocks richer observations and produces an exponential gap in KL-regularized outcome-reward training.

citing papers explorer

Showing 1 of 1 citing paper.

  • The Role of Generator Access in Autoregressive Post-Training cs.LG · 2026-04-06 · unverdicted · none · ref 14

    Limited generator access in autoregressive post-training confines learners to root-start rollouts whose value is bounded by on-policy prefix probabilities, while weak prefix control unlocks richer observations and produces an exponential gap in KL-regularized outcome-reward training.