Guowei Rong
Identifiers
- name variant Guowei Rong 0.60 · backfill
Papers (1)
- Mitigating Reward Hacking in RLHF via Bayesian Non-negative Reward Modeling cs.LG · 2026 · author #2
Mentions
- 2602.10623 #2 · arxiv_oai · confidence 0.70 Guowei Rong
Frequent Coauthors
- Bo Chen 1 shared papers
- Dandan Guo 1 shared papers
- Mingyuan Zhou 1 shared papers
- Zhibin Duan 1 shared papers
- Zhuo Li 1 shared papers