Zhouhao Sun
Identifiers
No identifiers captured yet.
Papers (4)
- GR-Ben: A General Reasoning Benchmark for Evaluating Process Reward Models cs.AI · 2026 · author #1
- Consolidation or Adaptation? PRISM: Disentangling SFT and RL Data via Gradient Concentration cs.AI · 2026 · author #8
- MAESTRO: Meta-learning Adaptive Estimation of Scalarization Trade-offs for Reward Optimization cs.LG · 2026 · author #8
- Large Language Models Are Still Misled by Simple Bias Ensembles cs.CL · 2025 · author #1
Mentions
No mention provenance yet.
Frequent Coauthors
- Bibo Cai 4 shared papers
- Bing Qin 4 shared papers
- Li Du 4 shared papers
- Ting Liu 4 shared papers
- Xiao Ding 4 shared papers
- Yang Zhao 4 shared papers
- Kai Xiong 3 shared papers
- Hepeng Wang 2 shared papers
- Jinglong Gao 2 shared papers
- Yangou Ouyang 2 shared papers
- Zhiyuan Kan 2 shared papers
- Fei Zhang 1 shared papers
- weidi tang 1 shared papers
- Xinran Dai 1 shared papers
- Xuan Zhang 1 shared papers