pith. sign in

arxiv: 1809.09501 · v1 · pith:HJEMFWEDnew · submitted 2018-09-25 · 💻 cs.LG · stat.ML

Anderson Acceleration for Reinforcement Learning

classification 💻 cs.LG stat.ML
keywords accelerationandersonappliedlearningreinforcementdiscussacceleratingbeen
0
0 comments X
read the original abstract

Anderson acceleration is an old and simple method for accelerating the computation of a fixed point. However, as far as we know and quite surprisingly, it has never been applied to dynamic programming or reinforcement learning. In this paper, we explain briefly what Anderson acceleration is and how it can be applied to value iteration, this being supported by preliminary experiments showing a significant speed up of convergence, that we critically discuss. We also discuss how this idea could be applied more generally to (deep) reinforcement learning.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Fault Tolerance of Accelerated Asynchronous Fixed-Point Iterations on Flexible Computing Infrastructure

    cs.DC 2026-05 unverdicted novelty 6.0

    Asynchronous execution yields 2.9x-16.9x speedups across Jacobi, value iteration, and SCF methods; Anderson acceleration succeeds only under evaluation-level perturbation, not iterate-level corruption.