Expressing arbitrary reward functions as potential- based advice

Anna Harutyunyan, Sam Devlin, Peter Vrancx, Ann Nowé · 2015

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Large Language Model Guided Incentive Aware Reward Design for Cooperative Multi-Agent Reinforcement Learning

cs.LG · 2026-03-25 · unverdicted · novelty 7.0

LLM-guided synthesis of reward programs yields higher task returns in cooperative multi-agent RL across Overcooked layouts with interaction bottlenecks.

citing papers explorer

Showing 1 of 1 citing paper.

Large Language Model Guided Incentive Aware Reward Design for Cooperative Multi-Agent Reinforcement Learning cs.LG · 2026-03-25 · unverdicted · none · ref 17
LLM-guided synthesis of reward programs yields higher task returns in cooperative multi-agent RL across Overcooked layouts with interaction bottlenecks.

Expressing arbitrary reward functions as potential- based advice

fields

years

verdicts

representative citing papers

citing papers explorer