Yajie Yang
Identifiers
- name variant Yajie Yang 0.60 · backfill
Papers (4)
- Enhancing LLM-based Search Agents via Contribution Weighted Group Relative Policy Optimization cs.LG · 2026 · author #3
- SciAgentGym: Benchmarking Multi-Step Scientific Tool-use in LLM Agents cs.CL · 2026 · author #2
- DFPO: Scaling Value Modeling via Distributional Flow towards Robust and Generalizable LLM Post-Training cs.LG · 2026 · author #10
- DVPO: Distributional Value Modeling-based Policy Optimization for LLM Post-Training cs.LG · 2025 · author #10
Mentions
- 2602.12984 #2 · arxiv_oai · confidence 0.70 Yajie Yang
Frequent Coauthors
- Shihan Dou 4 shared papers
- Tao Gui 4 shared papers
- Zhiheng Xi 4 shared papers
- Jiazheng Zhang 3 shared papers
- Ming Zhang 3 shared papers
- Qi Zhang 3 shared papers
- Caishuang Huang 2 shared papers
- Chenhao Huang 2 shared papers
- Dingwei Zhu 2 shared papers
- Junjie Ye 2 shared papers
- Junlin Shang 2 shared papers
- Shichun Liu 2 shared papers
- Sixian Li 2 shared papers
- Xuanjing Huang 2 shared papers
- Yuhui Wang 2 shared papers
- Yunke Zhang 2 shared papers
- Yuran Wang 2 shared papers
- Binze Hu 1 shared papers
- Hao Luo 1 shared papers
- Honglin Guo 1 shared papers