pith. sign in

arxiv: 1805.09692 · v2 · pith:4RQZGDREnew · submitted 2018-05-24 · 📊 stat.ML · cs.AI· cs.LG· cs.NE

Been There, Done That: Meta-Learning with Episodic Recall

classification 📊 stat.ML cs.AIcs.LGcs.NE
keywords environmentsmeta-learningagentsepisodictaskstheyarchitectureexplore
0
0 comments X
read the original abstract

Meta-learning agents excel at rapidly learning new tasks from open-ended task distributions; yet, they forget what they learn about each task as soon as the next begins. When tasks reoccur - as they do in natural environments - metalearning agents must explore again instead of immediately exploiting previously discovered solutions. We propose a formalism for generating open-ended yet repetitious environments, then develop a meta-learning architecture for solving these environments. This architecture melds the standard LSTM working memory with a differentiable neural episodic memory. We explore the capabilities of agents with this episodic LSTM in five meta-learning environments with reoccurring tasks, ranging from bandits to navigation and stochastic sequential decision problems.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.