Runpeng Dai
Identifiers
- name variant Runpeng Dai 0.60 · backfill
Papers (5)
- Small RL Controller, Large Language Model: RL-Guided Adaptive Sampling for Test-Time Scaling cs.CL · 2026 · author #1
- G-Zero: Self-Play for Open-Ended Generation from Zero Data cs.LG · 2026 · author #4
- DeltaRubric: Generative Multimodal Reward Modeling via Joint Planning and Verification cs.CL · 2026 · author #6
- Reinforcing Multimodal Reasoning Against Visual Degradation cs.CV · 2026 · author #6
- LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling cs.CL · 2026 · author #7
Mentions
- 2606.03102 #1 · arxiv_oai · confidence 0.70 Runpeng Dai
Frequent Coauthors
- Tong Zheng 5 shared papers
- Rui Liu 4 shared papers
- Chengsong Huang 3 shared papers
- Haolin Liu 3 shared papers
- Dian Yu 2 shared papers
- Haitao Mi 2 shared papers
- Leoweiliang 2 shared papers
- Pratap Tokekar 2 shared papers
- Yucheng Shi 2 shared papers
- Chenxi Liu 1 shared papers
- Heng Huang 1 shared papers
- Hongming Zhang 1 shared papers
- Hongtu Zhu 1 shared papers
- Huiwen Bao 1 shared papers
- Jiaxin Huang 1 shared papers
- Jinyuan Li 1 shared papers
- Langlin Huang 1 shared papers
- Ruibo Chen 1 shared papers
- Sheng Zhang 1 shared papers
- Tianyi Xiong 1 shared papers