Binghai Wang
Identifiers
- name variant Binghai Wang 0.60 · backfill
Papers (3)
- EVPO: Explained Variance Policy Optimization for Adaptive Critic Utilization in LLM Post-Training cs.LG · 2026 · author #9
- MM-Doc-R1: Training Agents for Long Document Visual Question Answering through Multi-turn Reinforcement Learning cs.CL · 2026 · author #3
- Secrets of RLHF in Large Language Models Part I: PPO cs.CL · 2023 · author #6
Mentions
- 2307.04964 #6 · arxiv_oai · confidence 0.70 Binghai Wang
Frequent Coauthors
- Shihan Dou 3 shared papers
- Tao Gui 3 shared papers
- Xuanjing Huang 3 shared papers
- Hang Yan 2 shared papers
- Jiahang Lin 2 shared papers
- Qi Zhang 2 shared papers
- Rui Zheng 2 shared papers
- Shichun Liu 2 shared papers
- Songyang Gao 2 shared papers
- Yuhao Zhou 2 shared papers
- Zhenhua Han 2 shared papers
- Zhiheng Xi 2 shared papers
- Cheng Chang 1 shared papers
- Chengjun Pan 1 shared papers
- Dingwei Zhu 1 shared papers
- Enyu Zhou 1 shared papers
- Haoran Huang 1 shared papers
- Honglin Guo 1 shared papers
- Jiazheng Zhang 1 shared papers
- Junzhe Wang 1 shared papers