pith. sign in

Guochao Jiang

Identifiers

  • name variant Guochao Jiang 0.60 · backfill

Papers (2)

  1. DVAO: Dynamic Variance-adaptive Advantage Optimization for Multi-reward Reinforcement Learning cs.CL · 2026 · author #1
  2. Beyond Stochastic Exploration: What Makes Training Data Valuable for Agentic Search cs.AI · 2026 · author #3

Mentions

  • 2605.25604 #1 · arxiv_oai · confidence 0.70 Guochao Jiang

Frequent Coauthors