arXiv preprint arXiv:2309.15257 , year=

STARC: A general framework for quantifying differences between reward functions , author= · arXiv 2309.15257

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Learning the Preferences of a Learning Agent

cs.AI · 2026-05-09 · unverdicted · novelty 6.0

Formalizes preference learning from a no-regret or Boltzmann-converging learner with theoretical guarantees or impossibility results for IRL algorithms.

citing papers explorer

Showing 1 of 1 citing paper.

Learning the Preferences of a Learning Agent cs.AI · 2026-05-09 · unverdicted · none · ref 16
Formalizes preference learning from a no-regret or Boltzmann-converging learner with theoretical guarantees or impossibility results for IRL algorithms.

arXiv preprint arXiv:2309.15257 , year=

fields

years

verdicts

representative citing papers

citing papers explorer