Max-policy iteration is solved via terminating value iteration for integers/floats and min-policy iteration for rationals, with termination and optimality proofs.
In: Conference Record of the 4th ACM Symposium on Principles of Programming Languages
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.PL 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Max-Policy Iteration, Revisited
Max-policy iteration is solved via terminating value iteration for integers/floats and min-policy iteration for rationals, with termination and optimality proofs.