2014.Markov decision processes: discrete stochastic dynamic programming

Martin L Puterman · 2014

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

browse 3 citing papers

representative citing papers

Robustness Analysis of POMDP Policies to Observation Perturbations

cs.AI · 2026-04-23 · unverdicted · novelty 7.0

POMDP policies can be checked for robustness to observation model changes by solving a bi-level optimization via root-finding with the Robust Interval Search algorithm, which runs in polynomial time for non-sticky history-independent deviations when using finite-state controllers.

Generative Auto-Bidding with Unified Modeling and Exploration

cs.AI · 2026-05-19 · unverdicted · novelty 6.0

GUIDE integrates a Decision Transformer for joint modeling of bidding actions and states with Q-value regularization for exploration and an IDM for safe policy fallback, outperforming baselines in simulations and real Taobao deployment with gains in GMV, clicks, cost, and ROI.

Co-Investment in Mobile Edge Computing with Infrastructure Update and Dynamic Participation

cs.GT · 2025-10-17 · unverdicted · novelty 5.0

A coalitional game model for MEC co-investment that incorporates resource updates and dynamic player participation to increase total payoffs and strengthen investment incentives.

citing papers explorer

Showing 3 of 3 citing papers.

Robustness Analysis of POMDP Policies to Observation Perturbations cs.AI · 2026-04-23 · unverdicted · none · ref 50
POMDP policies can be checked for robustness to observation model changes by solving a bi-level optimization via root-finding with the Robust Interval Search algorithm, which runs in polynomial time for non-sticky history-independent deviations when using finite-state controllers.
Generative Auto-Bidding with Unified Modeling and Exploration cs.AI · 2026-05-19 · unverdicted · none · ref 32
GUIDE integrates a Decision Transformer for joint modeling of bidding actions and states with Q-value regularization for exploration and an IDM for safe policy fallback, outperforming baselines in simulations and real Taobao deployment with gains in GMV, clicks, cost, and ROI.
Co-Investment in Mobile Edge Computing with Infrastructure Update and Dynamic Participation cs.GT · 2025-10-17 · unverdicted · none · ref 31
A coalitional game model for MEC co-investment that incorporates resource updates and dynamic player participation to increase total payoffs and strengthen investment incentives.

2014.Markov decision processes: discrete stochastic dynamic programming

fields

years

verdicts

representative citing papers

citing papers explorer