Philip S. Thomas

Identifiers

name variant Philip S. Thomas 0.60 · backfill

Papers (15)

Classical Policy Gradient: Preserving Bellman's Principle of Optimality cs.LG · 2019 · author #1
A Meta-MDP Approach to Exploration for Lifelong Reinforcement Learning cs.LG · 2019 · author #2
Learning Action Representations for Reinforcement Learning cs.LG · 2019 · author #5
Privacy Preserving Off-Policy Evaluation cs.LG · 2019 · author #2
Natural Option Critic cs.LG · 2018 · author #2
A Compression-Inspired Framework for Macro Discovery cs.AI · 2017 · author #3
On Ensuring that Intelligent Machines Are Well-Behaved cs.AI · 2017 · author #1
Policy Gradient Methods for Reinforcement Learning with Function Approximation and Action-Dependent Baselines cs.AI · 2017 · author #1
Data-Efficient Policy Evaluation Through Behavior Policy Search cs.AI · 2017 · author #2
Decoupling Learning Rules from Representations cs.AI · 2017 · author #1
Using Options and Covariance Testing for Long Horizon Off-Policy Policy Evaluation cs.AI · 2017 · author #2
Importance Sampling with Unequal Support cs.LG · 2016 · author #1
Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning cs.LG · 2016 · author #1
A Notation for Markov Decision Processes cs.AI · 2015 · author #1
Increasing the Action Gap: New Operators for Reinforcement Learning cs.AI · 2015 · author #4

Mentions

1711.09048 #3 · arxiv_oai · confidence 0.70 Philip S. Thomas

Frequent Coauthors

Emma Brunskill 6 shared papers
Francisco M. Garcia 2 shared papers
James Kostas 2 shared papers
Yash Chandak 2 shared papers
Andrew G. Barto 1 shared papers
Arthur Guez 1 shared papers
Billy Okal 1 shared papers
Bruno Castro da Silva 1 shared papers
Bruno C. da Silva 1 shared papers
Chris Nota 1 shared papers
Christoph Dann 1 shared papers
Georgios Theocharous 1 shared papers
Georg Ostrovski 1 shared papers
Gerome Miklau 1 shared papers
Josiah P. Hanna 1 shared papers
Marc G. Bellemare 1 shared papers
Peter Stone 1 shared papers
R\'emi Munos 1 shared papers
Saket Tiwari 1 shared papers
Scott Jordan 1 shared papers