Hierarchical reinforcement learning with the maxq value function decomposition

Thomas G Dietterich · 2000

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives

cs.LG · 2019-06-25 · unverdicted · novelty 6.0

RL policies decompose into information-regularized primitives that compete by requesting state information amounts, with the greediest one acting, yielding better generalization than flat or hierarchical baselines.

citing papers explorer

Showing 1 of 1 citing paper.

Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives cs.LG · 2019-06-25 · unverdicted · none · ref 10
RL policies decompose into information-regularized primitives that compete by requesting state information amounts, with the greediest one acting, yielding better generalization than flat or hierarchical baselines.

Hierarchical reinforcement learning with the maxq value function decomposition

fields

years

verdicts

representative citing papers

citing papers explorer