pith. sign in

Yangkun Chen

Identifiers

  • name variant Yangkun Chen 0.60 · backfill

Papers (5)

  1. RLVR Datasets and Where to Find Them: Tracing Data Lineage for Better Training Data cs.LG · 2026 · author #6
  2. Debiased Model-based Representations for Sample-efficient Continuous Control cs.LG · 2026 · author #5
  3. Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex cs.LG · 2026 · author #12
  4. Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models cs.CL · 2026 · author #5
  5. Small Generalizable Prompt Predictive Models Can Steer Efficient RL Post-Training of Large Reasoning Models cs.AI · 2026 · author #9

Mentions

  • 2605.26971 #6 · arxiv_oai · confidence 0.70 Yangkun Chen
  • 2605.06139 #12 · arxiv_oai · confidence 0.70 Yangkun Chen
  • 2602.01970 #9 · arxiv_oai · confidence 0.70 Yangkun Chen

Frequent Coauthors