Richard S. Sutton
Identifiers
- name variant Richard S. Sutton 0.60 · backfill
Papers (34)
- Intentional Updates for Streaming Reinforcement Learning cs.LG · 2026 · author #5
- Learning Feature Relevance Through Step Size Adaptation in Temporal-Difference Learning cs.LG · 2019 · author #5
- Should All Temporal Difference Learning Use Emphasis? cs.AI · 2019 · author #3
- Understanding Multi-Step Deep Reinforcement Learning: A Systematic Study of the DQN Target cs.LG · 2019 · author #2
- Online Off-policy Prediction cs.LG · 2018 · author #4
- Predicting Periodicity with Temporal Difference Learning cs.LG · 2018 · author #3
- Per-decision Multi-step Temporal Difference Learning with Control Variates cs.LG · 2018 · author #2
- Integrating Episodic Memory into a Reinforcement Learning Agent using Reservoir Sampling cs.LG · 2018 · author #2
- Two geometric input transformation methods for fast online reinforcement learning with neural nets cs.LG · 2018 · author #4
- TIDBD: Adapting Temporal-difference Step-sizes Through Stochastic Meta-descent cs.LG · 2018 · author #4
- Reactive Reinforcement Learning in Asynchronous Environments cs.AI · 2018 · author #3
- Directly Estimating the Variance of the {\lambda}-Return Using Temporal-Difference Methods cs.AI · 2018 · author #7
- A Deeper Look at Experience Replay cs.LG · 2017 · author #2
- Communicative Capital for Prosthetic Agents cs.AI · 2017 · author #2
- A First Empirical Study of Emphatic Temporal Difference Learning cs.AI · 2017 · author #3
- GQ($\lambda$) Quick Reference and Implementation Guide cs.LG · 2017 · author #2
- On Generalized Bellman Equations and Temporal-Difference Learning cs.LG · 2017 · author #3
- Multi-step Reinforcement Learning: A Unifying Algorithm cs.AI · 2017 · author #4
- Multi-step Off-policy Learning Without Importance Sampling Ratios cs.LG · 2017 · author #3
- Learning Representations by Stochastic Meta-Gradient Descent in Neural Networks cs.LG · 2016 · author #3
- Face valuing: Training user interfaces with facial expressions and reinforcement learning cs.HC · 2016 · author #3
- True Online Temporal-Difference Learning cs.AI · 2015 · author #5
- Learning to Predict Independent of Span cs.LG · 2015 · author #2
- True Online Emphatic TD($\lambda$): Quick Reference and Implementation Guide cs.LG · 2015 · author #1
- Emphatic Temporal-Difference Learning cs.LG · 2015 · author #4
- An Empirical Evaluation of True Online TD({\lambda}) cs.AI · 2015 · author #4
- Temporal-Difference Networks cs.LG · 2015 · author #1
- An Emphatic Approach to the Problem of Off-policy Temporal-Difference Learning cs.LG · 2015 · author #1
- Temporal-Difference Learning to Assist Human Decision Making during the Control of an Artificial Limb cs.AI · 2013 · author #4
- Planning by Prioritized Sweeping with Small Backups cs.AI · 2013 · author #2
- Scaling Life-long Off-policy Learning cs.AI · 2012 · author #3
- Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping cs.AI · 2012 · author #1
- Off-Policy Actor-Critic cs.LG · 2012 · author #3
- Multi-timescale Nexting in a Reinforcement Learning Robot cs.LG · 2011 · author #3
Mentions
- 1309.4714 #4 · backfill · confidence 0.70 Richard S. Sutton
- 1301.2343 #2 · backfill · confidence 0.70 Richard S. Sutton
- 1206.6262 #3 · backfill · confidence 0.70 Richard S. Sutton
- 1206.3285 #1 · backfill · confidence 0.70 Richard S. Sutton
- 1205.4839 #3 · backfill · confidence 0.70 Richard S. Sutton
- 1112.1133 #3 · backfill · confidence 0.70 Richard S. Sutton
Frequent Coauthors
- Patrick M. Pilarski 8 shared papers
- A. Rupam Mahmood 6 shared papers
- Adam White 5 shared papers
- Martha White 5 shared papers
- Huizhen Yu 4 shared papers
- Sina Ghiassian 4 shared papers
- Vivek Veeriah 4 shared papers
- Harm van Seijen 3 shared papers
- Kristopher De Asis 3 shared papers
- Alex Kearney 2 shared papers
- Ann L. Edwards 2 shared papers
- Banafsheh Rafiee 2 shared papers
- Brendan Bennett 2 shared papers
- Craig Sherstan 2 shared papers
- Jaden B. Travnik 2 shared papers
- J. Fernando Hernandez-Garcia 2 shared papers
- Joseph Modayil 2 shared papers
- Kory W. Mathewson 2 shared papers
- Shangtong Zhang 2 shared papers
- Adam S. R. Parker 1 shared papers