Predictive auxiliary objectives in deep rl mimic learning in the brain.arXiv preprint arXiv:2310.06089, 2023

Ching Fang, Kimberly L Stachenfeld · 2023 · arXiv 2310.06089

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Task-Induced Representational Invariances Depend on Learning Objective in Deep RL

cs.LG · 2026-06-01 · unverdicted · novelty 7.0

In navigation tasks, DQN learns MDP-homomorphism-invariant representations while PPO learns action-symmetric ones despite comparable performance, with effects on transfer and in LLMs.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Task-Induced Representational Invariances Depend on Learning Objective in Deep RL cs.LG · 2026-06-01 · unverdicted · none · ref 19
In navigation tasks, DQN learns MDP-homomorphism-invariant representations while PPO learns action-symmetric ones despite comparable performance, with effects on transfer and in LLMs.

Predictive auxiliary objectives in deep rl mimic learning in the brain.arXiv preprint arXiv:2310.06089, 2023

fields

years

verdicts

representative citing papers

citing papers explorer