Caishuang Huang
Identifiers
No identifiers captured yet.
Papers (3)
- DFPO: Scaling Value Modeling via Distributional Flow towards Robust and Generalizable LLM Post-Training cs.LG · 2026 · author #14
- DVPO: Distributional Value Modeling-based Policy Optimization for LLM Post-Training cs.LG · 2025 · author #15
- MulDimIF: A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models cs.CL · 2025 · author #2
Mentions
No mention provenance yet.
Frequent Coauthors
- Junjie Ye 3 shared papers
- Tao Gui 3 shared papers
- Chenhao Huang 2 shared papers
- Dingwei Zhu 2 shared papers
- Jiazheng Zhang 2 shared papers
- Ming Zhang 2 shared papers
- Qi Zhang 2 shared papers
- Shichun Liu 2 shared papers
- Shihan Dou 2 shared papers
- Sixian Li 2 shared papers
- Xuanjing Huang 2 shared papers
- Yajie Yang 2 shared papers
- Yuhui Wang 2 shared papers
- Yunke Zhang 2 shared papers
- Yuran Wang 2 shared papers
- Zhiheng Xi 2 shared papers
- Chenyuan Yang 1 shared papers
- Honglin Guo 1 shared papers
- Jiahan Li 1 shared papers
- Jianping Fan 1 shared papers