Deqing Yang

Identifiers

name variant Deqing Yang 0.60 · backfill

Papers (7)

Deep Research as Rubric for Reinforcement Learning cs.CL · 2026 · author #12
ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation cs.LG · 2026 · author #8
M3D-Stereo: A Multiple-Medium and Multiple-Degradation Dataset for Stereo Image Restoration cs.CV · 2026 · author #1
Good Reasoning Makes Good Demonstrations: Implicit Reasoning Quality Supervision via In-Context Reinforcement Learning cs.LG · 2026 · author #8
Outcome-Grounded Advantage Reshaping for Fine-Grained Credit Assignment in Mathematical Reasoning cs.CL · 2026 · author #8
Confidence Estimation for LLMs in Multi-turn Interactions cs.CL · 2026 · author #7
What Makes an Ideal Quote? Recommending "Unexpected yet Rational" Quotations via Novelty cs.IR · 2025 · author #6

Mentions

2603.09803 #8 · arxiv_oai · confidence 0.70 Deqing Yang
2601.07408 #8 · arxiv_oai · confidence 0.70 Deqing Yang
2606.01091 #12 · arxiv_oai · confidence 0.70 Deqing Yang
2605.28293 #8 · arxiv_oai · confidence 0.70 Deqing Yang

Frequent Coauthors

Jiaqing Liang 3 shared papers
Ao Xu 2 shared papers
Hengrui Chen 2 shared papers
Hongru Hou 2 shared papers
Tiehua Mei 2 shared papers
Yanghua Xiao 2 shared papers
Bo Chen 1 shared papers
Bowei Zhang 1 shared papers
Caiqi Zhang 1 shared papers
Chengzu Li 1 shared papers
Dajiang Lu 1 shared papers
Denghui Geng 1 shared papers
Feng Xiao 1 shared papers
Guanglei Yue 1 shared papers
Hongcheng Guo 1 shared papers
Jinhui Huang 1 shared papers
Jin Xiao 1 shared papers
Lefan Zhang 1 shared papers
Leiyu Pan 1 shared papers
Liu Kang 1 shared papers