Towards a Simple Approach to Multi-step Model-based Reinforcement Learning

Dipendra Misra; Evan Cater; Kavosh Asadi; Michael L. Littman

arxiv: 1811.00128 · v1 · pith:E4ENK52Enew · submitted 2018-10-31 · 💻 cs.LG · cs.AI· stat.ML

Towards a Simple Approach to Multi-step Model-based Reinforcement Learning

Kavosh Asadi , Evan Cater , Dipendra Misra , Michael L. Littman This is my paper

classification 💻 cs.LG cs.AIstat.ML

keywords modelmodel-basedmulti-steplearnlearningreinforcementactionadvantage

0 comments

read the original abstract

When environmental interaction is expensive, model-based reinforcement learning offers a solution by planning ahead and avoiding costly mistakes. Model-based agents typically learn a single-step transition model. In this paper, we propose a multi-step model that predicts the outcome of an action sequence with variable length. We show that this model is easy to learn, and that the model can make policy-conditional predictions. We report preliminary results that show a clear advantage for the multi-step model compared to its one-step counterpart.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Is Conditional Generative Modeling all you need for Decision-Making?
cs.LG 2022-11 unverdicted novelty 6.0

Return-conditional diffusion models for policies outperform offline RL on benchmarks by circumventing dynamic programming and enable constraint or skill composition.