pith. sign in

Ganqu Cui

Identifiers

  • name variant Ganqu Cui 0.60 · backfill

Papers (15)

  1. Post-Trained MoE Can Skip Half Experts via Self-Distillation cs.LG · 2026 · author #10
  2. Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling cs.AI · 2026 · author #21
  3. Teaching Thinking Models to Reason with Tools: A Full-Pipeline Recipe for Tool-Integrated Reasoning cs.CL · 2026 · author #12
  4. TEMPO: Scaling Test-time Training for Large Reasoning Models cs.LG · 2026 · author #9
  5. MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe cs.LG · 2025 · author #29
  6. SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning cs.RO · 2025 · author #10
  7. A Survey of Reinforcement Learning for Large Reasoning Models cs.CL · 2025 · author #35
  8. InternBootcamp Technical Report: Boosting LLM Reasoning with Verifiable Task Scaling cs.CL · 2025 · author #8
  9. The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models cs.LG · 2025 · author #1
  10. TTRL: Test-Time Reinforcement Learning cs.CL · 2025 · author #5
  11. Learning to Reason under Off-Policy Guidance cs.LG · 2025 · author #5
  12. Process Reinforcement through Implicit Rewards cs.LG · 2025 · author #1
  13. MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies cs.CL · 2024 · author #5
  14. UltraFeedback: Boosting Language Models with Scaled AI Feedback cs.CL · 2023 · author #1
  15. Tool Learning with Foundation Models cs.CL · 2023 · author #6

Mentions

  • 2508.08636 #8 · arxiv_oai · confidence 0.70 Ganqu Cui
  • 2605.18643 #10 · arxiv_oai · confidence 0.70 Ganqu Cui
  • 2509.08827 #35 · arxiv_oai · confidence 0.70 Ganqu Cui
  • 2310.01377 #1 · arxiv_oai · confidence 0.70 Ganqu Cui
  • 2304.08354 #6 · arxiv_oai · confidence 0.70 Ganqu Cui
  • 2504.14945 #5 · arxiv_oai · confidence 0.70 Ganqu Cui
  • 2509.18154 #29 · arxiv_oai · confidence 0.70 Ganqu Cui

Frequent Coauthors