Evolutionary optimization discovers developmental reward schedules that improve performance over extrinsic-only baselines on some MiniGrid tasks, with novelty emerging as the dominant early signal.
Proximal Evolutionary Strategy: Improving deep reinforcement learning through evolutionary policy optimization,
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.LG 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Evolutionary Discovery of Developmental Reward Schedules in Deep Reinforcement Learning
Evolutionary optimization discovers developmental reward schedules that improve performance over extrinsic-only baselines on some MiniGrid tasks, with novelty emerging as the dominant early signal.