pith. sign in

arxiv: 1705.03967 · v1 · pith:FGBYWXQ7new · submitted 2017-05-10 · 💻 cs.LG

GQ(λ) Quick Reference and Implementation Guide

classification 💻 cs.LG
keywords adamalgorithmcodedocumentguideimplementationlambdamaei
0
0 comments X
read the original abstract

This document should serve as a quick reference for and guide to the implementation of linear GQ($\lambda$), a gradient-based off-policy temporal-difference learning algorithm. Explanation of the intuition and theory behind the algorithm are provided elsewhere (e.g., Maei & Sutton 2010, Maei 2011). If you questions or concerns about the content in this document or the attached java code please email Adam White (adam.white@ualberta.ca). The code is provided as part of the source files in the arXiv submission.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.