pith. sign in

Nigel Tao

Identifiers

  • name variant Nigel Tao 0.60 · backfill

Papers (1)

  1. The Optimal Reward Baseline for Gradient-Based Reinforcement Learning cs.LG · 2013 · author #2

Mentions

  • 1301.2315 #2 · backfill · confidence 0.70 Nigel Tao

Frequent Coauthors