Bibo Cai
Identifiers
- name variant Bibo Cai 0.60 · backfill
Papers (7)
- DeepTool: Scaling Interleaved Deliberation in Tool-Integrated Reasoning via Process-Supervised Reinforcement Learning cs.AI · 2026 · author #3
- GR-Ben: A General Reasoning Benchmark for Evaluating Process Reward Models cs.AI · 2026 · author #4
- TinyJudge: Unverifiable Constraint Alignment via Lightweight Specialist Ensembles cs.CL · 2026 · author #11
- The Tool-Overuse Illusion: Why Does LLM Prefer External Tools over Internal Knowledge? cs.AI · 2026 · author #11
- Consolidation or Adaptation? PRISM: Disentangling SFT and RL Data via Gradient Concentration cs.AI · 2026 · author #5
- MAESTRO: Meta-learning Adaptive Estimation of Scalarization Trade-offs for Reward Optimization cs.LG · 2026 · author #5
- Large Language Models Are Still Misled by Simple Bias Ensembles cs.CL · 2025 · author #5
Mentions
- 2606.07520 #11 · arxiv_oai · confidence 0.70 Bibo Cai
- 2605.29568 #3 · arxiv_oai · confidence 0.70 Bibo Cai
Frequent Coauthors
- Ting Liu 7 shared papers
- Xiao Ding 7 shared papers
- Bing Qin 5 shared papers
- Zhouhao Sun 5 shared papers
- Kai Xiong 4 shared papers
- Li Du 4 shared papers
- Yang Zhao 4 shared papers
- Dandan Tu 2 shared papers
- Haonan Song 2 shared papers
- Hepeng Wang 2 shared papers
- Jinglong Gao 2 shared papers
- Wu Ning 2 shared papers
- Yangou Ouyang 2 shared papers
- Yirong Zeng 2 shared papers
- Yufei Liu 2 shared papers
- Yutai Hou 2 shared papers
- Yuxian Wang 2 shared papers
- Zhiyuan Kan 2 shared papers
- Fei Zhang 1 shared papers
- Qixun Zhang 1 shared papers