Kaiyan Zhang
Identifiers
- name variant Kaiyan Zhang 0.60 · backfill
Papers (8)
- Post-Trained MoE Can Skip Half Experts via Self-Distillation cs.LG · 2026 · author #3
- Can AI Be a Good Peer Reviewer? A Survey of Peer Review Process, Evaluation, and the Future cs.CL · 2026 · author #6
- MARS$^2$: Scaling Multi-Agent Tree Search via Reinforcement Learning for Code Generation cs.AI · 2026 · author #6
- SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning cs.RO · 2025 · author #6
- A Survey of Reinforcement Learning for Large Reasoning Models cs.CL · 2025 · author #1
- TTRL: Test-Time Reinforcement Learning cs.CL · 2025 · author #2
- Process Reinforcement through Implicit Rewards cs.LG · 2025 · author #15
- MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding cs.AI · 2025 · author #7
Mentions
- 2605.18643 #3 · arxiv_oai · confidence 0.70 Kaiyan Zhang
- 2509.08827 #1 · arxiv_oai · confidence 0.70 Kaiyan Zhang
- 2501.18362 #7 · arxiv_oai · confidence 0.70 Kaiyan Zhang
Frequent Coauthors
- Bowen Zhou 7 shared papers
- Ning Ding 6 shared papers
- Ganqu Cui 5 shared papers
- Yuxin Zuo 5 shared papers
- Xuekai Zhu 4 shared papers
- Youbang Sun 4 shared papers
- Yuchen Fan 4 shared papers
- Yuchen Zhang 4 shared papers
- Biqing Qi 3 shared papers
- Ermo Hua 3 shared papers
- Haozhan Li 3 shared papers
- Shang Qu 3 shared papers
- Xingtai Lv 3 shared papers
- Bingxiang He 2 shared papers
- Huayu Chen 2 shared papers
- Lifan Yuan 2 shared papers
- Li Sheng 2 shared papers
- Pengfei Li 2 shared papers
- Shijie Wang 2 shared papers
- Weize Chen 2 shared papers