Accuracy-based Curriculum Learning in Deep Reinforcement Learning
read the original abstract
In this paper, we investigate a new form of automated curriculum learning based on adaptive selection of accuracy requirements, called accuracy-based curriculum learning. Using a reinforcement learning agent based on the Deep Deterministic Policy Gradient algorithm and addressing the Reacher environment, we first show that an agent trained with various accuracy requirements sampled randomly learns more efficiently than when asked to be very accurate at all times. Then we show that adaptive selection of accuracy requirements, based on a local measure of competence progress, automatically generates a curriculum where difficulty progressively increases, resulting in a better learning efficiency than sampling randomly.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
Scenario Generation for Risk-Aware Reinforcement Learning with Probably Approximately Safe Guarantees
Approximates encountered state distribution via VAE and constructs dual bound barrier certificates to provide probably approximately safe guarantees in RL by optimizing the non-robust region.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.