pith. sign in

Xiangpeng Wei

Identifiers

No identifiers captured yet.

Papers (2)

  1. VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks cs.AI · 2025 · author #11
  2. DAPO: An Open-Source LLM Reinforcement Learning System at Scale cs.LG · 2025 · author #27

Mentions

No mention provenance yet.

Frequent Coauthors