Yun Qu
Identifiers
- name variant Yun Qu 0.60 · backfill
Papers (2)
- Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex cs.LG · 2026 · author #1
- Small Generalizable Prompt Predictive Models Can Steer Efficient RL Post-Training of Large Reasoning Models cs.AI · 2026 · author #1
Mentions
- 2605.06139 #1 · arxiv_oai · confidence 0.70 Yun Qu
- 2602.01970 #1 · arxiv_oai · confidence 0.70 Yun Qu
Frequent Coauthors
- Clive Bai 2 shared papers
- Heming Zou 2 shared papers
- Kai Yang 2 shared papers
- Qi Wang 2 shared papers
- Saiyong Yang 2 shared papers
- Weijie Liu 2 shared papers
- Xiangyang Ji 2 shared papers
- Yangkun Chen 2 shared papers
- Yixiu Mao 2 shared papers
- Yuhang Jiang 2 shared papers
- Lizhou Cai 1 shared papers
- Wutong Xu 1 shared papers
- Yingyue Li 1 shared papers