pith. sign in

Shang Qu

Identifiers

  • name variant Shang Qu 0.60 · backfill

Papers (3)

  1. A Survey of Reinforcement Learning for Large Reasoning Models cs.CL · 2025 · author #15
  2. TTRL: Test-Time Reinforcement Learning cs.CL · 2025 · author #4
  3. MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding cs.AI · 2025 · author #2

Mentions

  • 2509.08827 #15 · arxiv_oai · confidence 0.70 Shang Qu
  • 2501.18362 #2 · arxiv_oai · confidence 0.70 Shang Qu

Frequent Coauthors