pith. sign in

Introducing machine learning within an interactive evolutionary design environment

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

stat.ML 1

years

2017 1

verdicts

ACCEPT 1

representative citing papers

Deep reinforcement learning from human preferences

stat.ML · 2017-06-12 · accept · novelty 7.0

Reinforcement learning agents solve complex tasks without access to the reward function by training a reward predictor from human comparisons of trajectory segments, requiring feedback on less than 1% of interactions.

citing papers explorer

Showing 1 of 1 citing paper.

  • Deep reinforcement learning from human preferences stat.ML · 2017-06-12 · accept · none · ref 8

    Reinforcement learning agents solve complex tasks without access to the reward function by training a reward predictor from human comparisons of trajectory segments, requiring feedback on less than 1% of interactions.