pith. machine review for the scientific record. sign in

Lu Pan

Identifiers

No identifiers captured yet.

Papers (3)

  1. From $\log \pi$ to $\pi$: Taming Divergence in Soft Clipping via Bilateral Decoupled Decay of Probability Gradient Weight cs.LG · 2026 · author #8
  2. How to Allocate, How to Learn? Dynamic Rollout Allocation and Advantage Modulation for Policy Optimization cs.LG · 2026 · author #7
  3. MASPO: Unifying Gradient Utilization, Probability Mass, and Signal Reliability for Robust and Sample-Efficient LLM Reasoning cs.LG · 2026 · author #8

Mentions

No mention provenance yet.

Frequent Coauthors