pith. sign in

Richard S. Sutton

Identifiers

  • name variant Richard S. Sutton 0.60 · backfill

Papers (34)

  1. Intentional Updates for Streaming Reinforcement Learning cs.LG · 2026 · author #5
  2. Learning Feature Relevance Through Step Size Adaptation in Temporal-Difference Learning cs.LG · 2019 · author #5
  3. Should All Temporal Difference Learning Use Emphasis? cs.AI · 2019 · author #3
  4. Understanding Multi-Step Deep Reinforcement Learning: A Systematic Study of the DQN Target cs.LG · 2019 · author #2
  5. Online Off-policy Prediction cs.LG · 2018 · author #4
  6. Predicting Periodicity with Temporal Difference Learning cs.LG · 2018 · author #3
  7. Per-decision Multi-step Temporal Difference Learning with Control Variates cs.LG · 2018 · author #2
  8. Integrating Episodic Memory into a Reinforcement Learning Agent using Reservoir Sampling cs.LG · 2018 · author #2
  9. Two geometric input transformation methods for fast online reinforcement learning with neural nets cs.LG · 2018 · author #4
  10. TIDBD: Adapting Temporal-difference Step-sizes Through Stochastic Meta-descent cs.LG · 2018 · author #4
  11. Reactive Reinforcement Learning in Asynchronous Environments cs.AI · 2018 · author #3
  12. Directly Estimating the Variance of the {\lambda}-Return Using Temporal-Difference Methods cs.AI · 2018 · author #7
  13. A Deeper Look at Experience Replay cs.LG · 2017 · author #2
  14. Communicative Capital for Prosthetic Agents cs.AI · 2017 · author #2
  15. A First Empirical Study of Emphatic Temporal Difference Learning cs.AI · 2017 · author #3
  16. GQ($\lambda$) Quick Reference and Implementation Guide cs.LG · 2017 · author #2
  17. On Generalized Bellman Equations and Temporal-Difference Learning cs.LG · 2017 · author #3
  18. Multi-step Reinforcement Learning: A Unifying Algorithm cs.AI · 2017 · author #4
  19. Multi-step Off-policy Learning Without Importance Sampling Ratios cs.LG · 2017 · author #3
  20. Learning Representations by Stochastic Meta-Gradient Descent in Neural Networks cs.LG · 2016 · author #3
  21. Face valuing: Training user interfaces with facial expressions and reinforcement learning cs.HC · 2016 · author #3
  22. True Online Temporal-Difference Learning cs.AI · 2015 · author #5
  23. Learning to Predict Independent of Span cs.LG · 2015 · author #2
  24. True Online Emphatic TD($\lambda$): Quick Reference and Implementation Guide cs.LG · 2015 · author #1
  25. Emphatic Temporal-Difference Learning cs.LG · 2015 · author #4
  26. An Empirical Evaluation of True Online TD({\lambda}) cs.AI · 2015 · author #4
  27. Temporal-Difference Networks cs.LG · 2015 · author #1
  28. An Emphatic Approach to the Problem of Off-policy Temporal-Difference Learning cs.LG · 2015 · author #1
  29. Temporal-Difference Learning to Assist Human Decision Making during the Control of an Artificial Limb cs.AI · 2013 · author #4
  30. Planning by Prioritized Sweeping with Small Backups cs.AI · 2013 · author #2
  31. Scaling Life-long Off-policy Learning cs.AI · 2012 · author #3
  32. Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping cs.AI · 2012 · author #1
  33. Off-Policy Actor-Critic cs.LG · 2012 · author #3
  34. Multi-timescale Nexting in a Reinforcement Learning Robot cs.LG · 2011 · author #3

Mentions

  • 1309.4714 #4 · backfill · confidence 0.70 Richard S. Sutton
  • 1301.2343 #2 · backfill · confidence 0.70 Richard S. Sutton
  • 1206.6262 #3 · backfill · confidence 0.70 Richard S. Sutton
  • 1206.3285 #1 · backfill · confidence 0.70 Richard S. Sutton
  • 1205.4839 #3 · backfill · confidence 0.70 Richard S. Sutton
  • 1112.1133 #3 · backfill · confidence 0.70 Richard S. Sutton

Frequent Coauthors