pith. sign in

Yajie Yang

Identifiers

  • name variant Yajie Yang 0.60 · backfill

Papers (4)

  1. Enhancing LLM-based Search Agents via Contribution Weighted Group Relative Policy Optimization cs.LG · 2026 · author #3
  2. SciAgentGym: Benchmarking Multi-Step Scientific Tool-use in LLM Agents cs.CL · 2026 · author #2
  3. DFPO: Scaling Value Modeling via Distributional Flow towards Robust and Generalizable LLM Post-Training cs.LG · 2026 · author #10
  4. DVPO: Distributional Value Modeling-based Policy Optimization for LLM Post-Training cs.LG · 2025 · author #10

Mentions

  • 2602.12984 #2 · arxiv_oai · confidence 0.70 Yajie Yang

Frequent Coauthors