pith. sign in

Shangdong Yang

Identifiers

  • name variant Shangdong Yang 0.60 · backfill

Papers (3)

  1. Behavior-Aware Auxiliary Corrections for Off-Policy Temporal-Difference Prediction cs.AI · 2026 · author #4
  2. Behavior-Induced Mirror-Prox Temporal-Difference Learning for Faster Off-Policy Prediction cs.AI · 2026 · author #3
  3. Regularized Centered Emphatic Temporal Difference Learning cs.AI · 2026 · author #5

Mentions

  • 2605.28855 #4 · arxiv_oai · confidence 0.70 Shangdong Yang
  • 2605.28849 #3 · arxiv_oai · confidence 0.70 Shangdong Yang

Frequent Coauthors