Shang Qu
Identifiers
- name variant Shang Qu 0.60 · backfill
Papers (3)
- A Survey of Reinforcement Learning for Large Reasoning Models cs.CL · 2025 · author #15
- TTRL: Test-Time Reinforcement Learning cs.CL · 2025 · author #4
- MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding cs.AI · 2025 · author #2
Mentions
- 2509.08827 #15 · arxiv_oai · confidence 0.70 Shang Qu
- 2501.18362 #2 · arxiv_oai · confidence 0.70 Shang Qu
Frequent Coauthors
- Bowen Zhou 3 shared papers
- Ermo Hua 3 shared papers
- Kaiyan Zhang 3 shared papers
- Ning Ding 3 shared papers
- Xuekai Zhu 3 shared papers
- Yuxin Zuo 3 shared papers
- Biqing Qi 2 shared papers
- Ganqu Cui 2 shared papers
- Haozhan Li 2 shared papers
- Xinwei Long 2 shared papers
- Youbang Sun 2 shared papers
- Yuchen Zhang 2 shared papers
- Zhiyuan Ma 2 shared papers
- Bingxiang He 1 shared papers
- Che Jiang 1 shared papers
- Dong Li 1 shared papers
- Fangfu Liu 1 shared papers
- Guoli Jia 1 shared papers
- Huayu Chen 1 shared papers
- Jiaze Ma 1 shared papers