Guided cost learning: Deep inverse optimal control via policy optimization

Chelsea Finn, Sergey Levine, Pieter Abbeel · 2016

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Benchmarking Model-Based Reinforcement Learning

cs.LG · 2019-07-03 · accept · novelty 7.0

Introduces a benchmark suite of over 18 MBRL environments, evaluates multiple algorithms under consistent settings, and identifies three core challenges: dynamics bottleneck, planning horizon dilemma, and early-termination dilemma.

Learning Reward Functions by Integrating Human Demonstrations and Preferences

cs.RO · 2019-06-21 · unverdicted · novelty 6.0

DemPref uses demonstrations to form a coarse reward prior and ground active preference queries, achieving higher efficiency than pure preference learning and higher user preference than IRL in experiments.

citing papers explorer

Showing 2 of 2 citing papers.

Benchmarking Model-Based Reinforcement Learning cs.LG · 2019-07-03 · accept · none · ref 17
Introduces a benchmark suite of over 18 MBRL environments, evaluates multiple algorithms under consistent settings, and identifies three core challenges: dynamics bottleneck, planning horizon dilemma, and early-termination dilemma.
Learning Reward Functions by Integrating Human Demonstrations and Preferences cs.RO · 2019-06-21 · unverdicted · none · ref 18
DemPref uses demonstrations to form a coarse reward prior and ground active preference queries, achieving higher efficiency than pure preference learning and higher user preference than IRL in experiments.

Guided cost learning: Deep inverse optimal control via policy optimization

fields

years

verdicts

representative citing papers

citing papers explorer