pith. sign in

Kaiyan Zhang

Identifiers

  • name variant Kaiyan Zhang 0.60 · backfill

Papers (8)

  1. Post-Trained MoE Can Skip Half Experts via Self-Distillation cs.LG · 2026 · author #3
  2. Can AI Be a Good Peer Reviewer? A Survey of Peer Review Process, Evaluation, and the Future cs.CL · 2026 · author #6
  3. MARS$^2$: Scaling Multi-Agent Tree Search via Reinforcement Learning for Code Generation cs.AI · 2026 · author #6
  4. SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning cs.RO · 2025 · author #6
  5. A Survey of Reinforcement Learning for Large Reasoning Models cs.CL · 2025 · author #1
  6. TTRL: Test-Time Reinforcement Learning cs.CL · 2025 · author #2
  7. Process Reinforcement through Implicit Rewards cs.LG · 2025 · author #15
  8. MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding cs.AI · 2025 · author #7

Mentions

  • 2605.18643 #3 · arxiv_oai · confidence 0.70 Kaiyan Zhang
  • 2509.08827 #1 · arxiv_oai · confidence 0.70 Kaiyan Zhang
  • 2501.18362 #7 · arxiv_oai · confidence 0.70 Kaiyan Zhang

Frequent Coauthors