pith. sign in

Yuxin Zuo

Identifiers

  • name variant Yuxin Zuo 0.60 · backfill

Papers (9)

  1. Post-Trained MoE Can Skip Half Experts via Self-Distillation cs.LG · 2026 · author #7
  2. Teaching Thinking Models to Reason with Tools: A Full-Pipeline Recipe for Tool-Integrated Reasoning cs.CL · 2026 · author #4
  3. Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe cs.LG · 2026 · author #2
  4. Towards Knowledgeable Deep Research: Framework and Benchmark cs.AI · 2026 · author #8
  5. SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning cs.RO · 2025 · author #2
  6. A Survey of Reinforcement Learning for Large Reasoning Models cs.CL · 2025 · author #2
  7. The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models cs.LG · 2025 · author #6
  8. TTRL: Test-Time Reinforcement Learning cs.CL · 2025 · author #1
  9. MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding cs.AI · 2025 · author #1

Mentions

  • 2605.18643 #7 · arxiv_oai · confidence 0.70 Yuxin Zuo
  • 2509.08827 #2 · arxiv_oai · confidence 0.70 Yuxin Zuo
  • 2501.18362 #1 · arxiv_oai · confidence 0.70 Yuxin Zuo

Frequent Coauthors