pith. sign in

Shuguang Ma

Identifiers

  • name variant Shuguang Ma 0.60 · backfill

Papers (3)

  1. ADaPT: Token-Level Decoupling for Efficient Large Reasoning Models cs.LG · 2026 · author #8
  2. Don't Tell the Answer, Truly Guide the Reasoning During RL Rollouts cs.LG · 2025 · author #8
  3. Selective Expert Guidance for Effective and Diverse Exploration in Reinforcement Learning of LLMs cs.AI · 2025 · author #8

Mentions

  • 2510.04140 #8 · arxiv_oai · confidence 0.70 Shuguang Ma
  • 2510.09388 #8 · arxiv_oai · confidence 0.70 Shuguang Ma
  • 2606.19919 #8 · arxiv_oai · confidence 0.70 Shuguang Ma

Frequent Coauthors