Is One Layer Enough? Training A Single Transformer Layer Can Match Full-Parameter RL Training cs.LG · 2026-07-01