pith. sign in

Chengsong Huang

Identifiers

  • name variant Chengsong Huang 0.60 · backfill

Papers (9)

  1. Small RL Controller, Large Language Model: RL-Guided Adaptive Sampling for Test-Time Scaling cs.CL · 2026 · author #4
  2. You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories cs.LG · 2026 · author #4
  3. Process Rewards with Learned Reliability cs.CL · 2026 · author #3
  4. G-Zero: Self-Play for Open-Ended Generation from Zero Data cs.LG · 2026 · author #1
  5. LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling cs.CL · 2026 · author #3
  6. Nonsense Helps: Prompt Space Perturbation Broadens Reasoning Exploration cs.AI · 2026 · author #2
  7. MoCo: A One-Stop Shop for Model Collaboration Research cs.CL · 2026 · author #14
  8. Self-Rewarding Vision-Language Model via Reasoning Decomposition cs.CV · 2025 · author #3
  9. R-Zero: Self-Evolving Reasoning LLM from Zero Data cs.LG · 2025 · author #1

Mentions

  • 2606.03102 #4 · arxiv_oai · confidence 0.70 Chengsong Huang
  • 2605.21468 #4 · arxiv_oai · confidence 0.70 Chengsong Huang
  • 2605.15529 #3 · arxiv_oai · confidence 0.70 Chengsong Huang

Frequent Coauthors