Monte-carlo expectation maximization for decentralized pomdps.Proccedings of the International joint conference on artificial intelligence, 2013

Feng Wu, Shlomo Zilberstein, Nicholas R Jennings · 2013

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Approximations and Learning for Decentralized Stochastic Control and Near Optimal Finite Window Policies

math.OC · 2026-04-30 · unverdicted · novelty 7.0

Finite sliding window policies achieve near-optimality and Q-learning converges to them for decentralized stochastic control under OSDISP and KSPISP information structures when a predictor stability condition holds in expected total variation.

citing papers explorer

Showing 1 of 1 citing paper.

Approximations and Learning for Decentralized Stochastic Control and Near Optimal Finite Window Policies math.OC · 2026-04-30 · unverdicted · none · ref 54
Finite sliding window policies achieve near-optimality and Q-learning converges to them for decentralized stochastic control under OSDISP and KSPISP information structures when a predictor stability condition holds in expected total variation.

Monte-carlo expectation maximization for decentralized pomdps.Proccedings of the International joint conference on artificial intelligence, 2013

fields

years

verdicts

representative citing papers

citing papers explorer