pith. sign in

Guofeng Quan

Identifiers

  • name variant Guofeng Quan 0.60 · backfill

Papers (2)

  1. DVAO: Dynamic Variance-adaptive Advantage Optimization for Multi-reward Reinforcement Learning cs.CL · 2026 · author #3
  2. Beyond Stochastic Exploration: What Makes Training Data Valuable for Agentic Search cs.AI · 2026 · author #4

Mentions

  • 2605.25604 #3 · arxiv_oai · confidence 0.70 Guofeng Quan

Frequent Coauthors