Learning Reward Functions by Integrating Human Demonstrations and Preferences

Brian D Ziebart, Andrew L Maas, J Andrew Bagnell, Anind K Dey · 2008

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Learning Reward Functions by Integrating Human Demonstrations and Preferences

cs.RO · 2019-06-21 · unverdicted · novelty 6.0

DemPref uses demonstrations to form a coarse reward prior and ground active preference queries, achieving higher efficiency than pure preference learning and higher user preference than IRL in experiments.

citing papers explorer

Showing 1 of 1 citing paper.

Learning Reward Functions by Integrating Human Demonstrations and Preferences cs.RO · 2019-06-21 · unverdicted · none · ref 46
DemPref uses demonstrations to form a coarse reward prior and ground active preference queries, achieving higher efficiency than pure preference learning and higher user preference than IRL in experiments.

Learning Reward Functions by Integrating Human Demonstrations and Preferences

fields

years

verdicts

representative citing papers

citing papers explorer