pith. sign in

Minxuan Lv

Identifiers

  • name variant Minxuan Lv 0.60 · backfill

Papers (5)

  1. GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment cs.CL · 2026 · author #1
  2. Kwai Summary Attention Technical Report cs.CL · 2026 · author #29
  3. Good Reasoning Makes Good Demonstrations: Implicit Reasoning Quality Supervision via In-Context Reinforcement Learning cs.LG · 2026 · author #2
  4. Entropy Ratio Clipping as a Soft Global Constraint for Stable Reinforcement Learning cs.LG · 2025 · author #3
  5. CE-GPPO: Coordinating Entropy via Gradient-Preserving Clipping Policy Optimization in Reinforcement Learning cs.LG · 2025 · author #3

Mentions

  • 2603.09803 #2 · arxiv_oai · confidence 0.70 Minxuan Lv
  • 2605.19577 #1 · arxiv_oai · confidence 0.70 Minxuan Lv

Frequent Coauthors