The following result establishes the convergence of the algorithm under the above assumptions

The initial sublevel setL c :={x:f(x)≤c}withc=f(x 0)is bounded (hence compact)

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

cs.LG · 2026-01-26 · unverdicted · novelty 7.0

Foundational geometric, stationarity, and convergence results are derived for Bellman residual minimization applied to policy optimization in MDPs.

Showing 1 of 1 citing paper.

Bellman Residual Minimization for Control: Geometry, Stationarity, and Convergence cs.LG · 2026-01-26 · unverdicted · none · ref 2
Foundational geometric, stationarity, and convergence results are derived for Bellman residual minimization applied to policy optimization in MDPs.