pith. sign in

arxiv: 1807.01736 · v1 · pith:JITAOUYInew · submitted 2018-07-04 · 💻 cs.LG · cs.AI· stat.ML

Transfer with Model Features in Reinforcement Learning

classification 💻 cs.LG cs.AIstat.ML
keywords featureslearningmodelequivalentmodel-reductionrepresentationsuccessortasks
0
0 comments X
read the original abstract

A key question in Reinforcement Learning is which representation an agent can learn to efficiently reuse knowledge between different tasks. Recently the Successor Representation was shown to have empirical benefits for transferring knowledge between tasks with shared transition dynamics. This paper presents Model Features: a feature representation that clusters behaviourally equivalent states and that is equivalent to a Model-Reduction. Further, we present a Successor Feature model which shows that learning Successor Features is equivalent to learning a Model-Reduction. A novel optimization objective is developed and we provide bounds showing that minimizing this objective results in an increasingly improved approximation of a Model-Reduction. Further, we provide transfer experiments on randomly generated MDPs which vary in their transition and reward functions but approximately preserve behavioural equivalence between states. These results demonstrate that Model Features are suitable for transfer between tasks with varying transition and reward functions.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Smaller Abstract State Spaces Enable Cross-Scale Generalization in Reinforcement Learning

    cs.LG 2026-05 unverdicted novelty 5.0

    A bound on OOD test performance in POMDPs decomposes loss into approximation and estimation errors, indicating that smaller abstract state spaces improve generalization in RL agents.