Nigel Tao
Identifiers
- name variant Nigel Tao 0.60 · backfill
Papers (1)
- The Optimal Reward Baseline for Gradient-Based Reinforcement Learning cs.LG · 2013 · author #2
Mentions
- 1301.2315 #2 · backfill · confidence 0.70 Nigel Tao
Frequent Coauthors
- Lex Weaver 1 shared papers