pith. sign in

Yun Qu

Identifiers

  • name variant Yun Qu 0.60 · backfill

Papers (2)

  1. Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex cs.LG · 2026 · author #1
  2. Small Generalizable Prompt Predictive Models Can Steer Efficient RL Post-Training of Large Reasoning Models cs.AI · 2026 · author #1

Mentions

  • 2605.06139 #1 · arxiv_oai · confidence 0.70 Yun Qu
  • 2602.01970 #1 · arxiv_oai · confidence 0.70 Yun Qu

Frequent Coauthors