On the role of information structure in rein- forcement learning for partially-observable sequential teams and games

· 2024 · arXiv 2403.00993

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Approximations and Learning for Decentralized Stochastic Control and Near Optimal Finite Window Policies

math.OC · 2026-04-30 · unverdicted · novelty 7.0

Finite sliding window policies achieve near-optimality and Q-learning converges to them for decentralized stochastic control under OSDISP and KSPISP information structures when a predictor stability condition holds in expected total variation.

citing papers explorer

Showing 1 of 1 citing paper.

Approximations and Learning for Decentralized Stochastic Control and Near Optimal Finite Window Policies math.OC · 2026-04-30 · unverdicted · none · ref 1
Finite sliding window policies achieve near-optimality and Q-learning converges to them for decentralized stochastic control under OSDISP and KSPISP information structures when a predictor stability condition holds in expected total variation.

On the role of information structure in rein- forcement learning for partially-observable sequential teams and games

fields

years

verdicts

representative citing papers

citing papers explorer