pith. machine review for the scientific record. sign in

Online intrin- sic rewards for decision making agents from large language model feedback.arXiv preprint arXiv:2410.23022

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.AI 1

years

2026 1

verdicts

UNVERDICTED 1

representative citing papers

Hierarchical Behaviour Spaces

cs.AI · 2026-04-27 · unverdicted · novelty 6.0

Hierarchical Behaviour Spaces uses linear combinations of reward functions to induce expressive behavior spaces in hierarchical RL, yielding strong performance on NetHack primarily through better exploration rather than long-term planning.

citing papers explorer

Showing 1 of 1 citing paper.

  • Hierarchical Behaviour Spaces cs.AI · 2026-04-27 · unverdicted · none · ref 20

    Hierarchical Behaviour Spaces uses linear combinations of reward functions to induce expressive behavior spaces in hierarchical RL, yielding strong performance on NetHack primarily through better exploration rather than long-term planning.