← back to paper
arxiv: 2605.11775 · 2 revisions
Entropy Polarity in Reinforcement Fine-Tuning: Direction, Asymmetry, and Control