pith. sign in

Runpeng Dai

Identifiers

  • name variant Runpeng Dai 0.60 · backfill

Papers (5)

  1. Small RL Controller, Large Language Model: RL-Guided Adaptive Sampling for Test-Time Scaling cs.CL · 2026 · author #1
  2. G-Zero: Self-Play for Open-Ended Generation from Zero Data cs.LG · 2026 · author #4
  3. DeltaRubric: Generative Multimodal Reward Modeling via Joint Planning and Verification cs.CL · 2026 · author #6
  4. Reinforcing Multimodal Reasoning Against Visual Degradation cs.CV · 2026 · author #6
  5. LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling cs.CL · 2026 · author #7

Mentions

  • 2606.03102 #1 · arxiv_oai · confidence 0.70 Runpeng Dai

Frequent Coauthors