The Effect of Quadrature on the Convergence of Policy Iteration for Hamilton-Jacobi-Bellman Equations
Pith reviewed 2026-06-25 22:38 UTC · model grok-4.3
The pith
Enforcing matching quadrature restores superlinear convergence of policy iteration for Hamilton-Jacobi-Bellman equations.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
When finite element discretizations of Hamilton-Jacobi-Bellman equations employ automatic quadrature, the resulting non-matching rules between variational terms cause policy iteration to lose its superlinear convergence. Enforcing matching quadrature across those terms recovers the superlinear rate predicted by theory.
What carries the argument
Matching quadrature rules across the integral terms in the discrete variational formulation of the Hamilton-Jacobi-Bellman equation, which maintains the consistency required for superlinear policy iteration convergence.
If this is right
- Policy iteration exhibits superlinear convergence once quadrature rules are forced to match.
- Automatic quadrature selection in finite element libraries can eliminate the theoretical superlinear rate without user intervention.
- Explicit control over quadrature consistency is necessary to retain expected convergence behavior in variational discretizations of Hamilton-Jacobi-Bellman equations.
- The convergence loss is tied directly to quadrature mismatch in the variational terms rather than to the underlying discretization order.
Where Pith is reading between the lines
- The same quadrature-consistency requirement may affect iterative solvers for other nonlinear variational problems.
- Finite element library designers could add automatic detection or enforcement of quadrature matching for Hamilton-Jacobi-Bellman-type equations.
- Convergence theory for policy iteration on these equations implicitly assumes consistent numerical integration across all terms.
Load-bearing premise
That any observed loss of superlinear convergence stems specifically from the non-matching quadratures rather than from other discretization choices or implementation details.
What would settle it
A controlled numerical test showing superlinear convergence despite deliberately non-matching quadratures, or loss of superlinear convergence even after enforcing matching rules, would contradict the central claim.
Figures
read the original abstract
Modern finite element libraries allow users to express partial differential equations directly in variational form, with the added convenience of automatic quadrature selection. In the context of Hamilton-Jacobi-Bellman (HJB) equations, automatic quadrature selection can result in nonmatching quadratures between different terms that may lead to loss of convergence of the policy iteration, which is otherwise expected from theory to converge superlinearly. The simple remedy of enforcing matching quadrature recovers the expected superlinear convergence.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript examines how automatic quadrature selection in modern finite-element libraries can produce non-matching quadrature rules across terms in the variational formulation of Hamilton-Jacobi-Bellman equations. It reports that this mismatch degrades the discrete operators inside policy iteration, destroying the superlinear convergence that theory predicts, and shows that simply enforcing identical quadrature rules across all terms restores the expected rate.
Significance. If the numerical evidence and implementation details hold, the observation supplies a concrete, low-cost safeguard for practitioners who rely on high-level FEM libraries to discretize HJB problems. It converts an otherwise opaque implementation choice into an explicit requirement for preserving theoretical convergence guarantees.
minor comments (1)
- The abstract states the central observation but supplies neither the precise variational form, the quadrature rules employed, nor any convergence tables; a short methods paragraph or reference to a representative experiment would strengthen the claim even at the abstract level.
Simulated Author's Rebuttal
We thank the referee for their careful summary of our manuscript and for highlighting its potential practical value as a low-cost safeguard for FEM discretizations of HJB equations. The recommendation of 'uncertain' appears to stem from the absence of detailed major comments; we provide the following responses to the referee summary itself and stand ready to supply further implementation details or additional numerical tests if requested.
read point-by-point responses
-
Referee: The manuscript examines how automatic quadrature selection in modern finite-element libraries can produce non-matching quadrature rules across terms in the variational formulation of Hamilton-Jacobi-Bellman equations. It reports that this mismatch degrades the discrete operators inside policy iteration, destroying the superlinear convergence that theory predicts, and shows that simply enforcing identical quadrature rules across all terms restores the expected rate.
Authors: This is an accurate encapsulation of our central observation and numerical demonstration. The paper shows both the degradation under mismatched automatic quadrature and the recovery under enforced matching, consistent with the theoretical superlinear convergence of policy iteration when the discrete operators are consistent. revision: no
-
Referee: If the numerical evidence and implementation details hold, the observation supplies a concrete, low-cost safeguard for practitioners who rely on high-level FEM libraries to discretize HJB problems.
Authors: We agree that the practical takeaway is the explicit requirement to match quadrature rules. The manuscript already includes code snippets and library-specific instructions (FEniCS/DOLFINx) demonstrating how to enforce matching; we can expand the appendix with a minimal reproducible example if the referee or editor desires. revision: partial
Circularity Check
No significant circularity identified
full rationale
The abstract and description present a numerical observation about quadrature mismatch affecting policy iteration convergence rates for HJB equations, with the remedy of matching quadrature restoring expected superlinear behavior. No equations, fitted parameters, self-citations, or derivation steps are supplied that reduce by construction to the paper's own inputs. The claim rests on standard numerical analysis expectations rather than any self-referential mechanism, making the argument self-contained.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Bellman , Dynamic programming , Princeton University Press, Princeton, NJ, 1957
R. Bellman , Dynamic programming , Princeton University Press, Princeton, NJ, 1957
1957
-
[2]
O. Bokanowski, S. Maroso, and H. Zidani , Some convergence results for H oward's algorithm , SIAM J. Numer. Anal., 47 (2009), pp. 3001--3026, https://doi.org/10.1137/08073041X
-
[3]
T. Hall, I. Smears, E. S\"uli, and H. Wells , Code and data for `` The Effect of Quadrature on the Convergence of Policy Iteration for Hamilton--Jacobi--Bellman Equations '' . https://github.com/tomhall-git/hjb-policy-iteration-quadrature, 2026
2026
-
[4]
D. A. Ham, P. H. J. Kelly, L. Mitchell, C. J. Cotter, R. C. Kirby, K. Sagiyama, N. Bouziani, S. Vorderwuelbecke, T. J. Gregory, J. Betteridge, D. R. Shapero, R. W. Nixon-Hill, C. J. Ward, P. E. Farrell, P. D. Brubeck, I. Marsden, T. H. Gibson, M. Homolya, T. Sun, A. T. T. McRae, F. Luporini, A. Gregory, M. Lange, S. W. Funke, F. Rathgeber, G.-T. Bercea, a...
-
[5]
R. A. Howard , Dynamic programming and M arkov processes , Technology Press of M.I.T., Cambridge, MA; John Wiley & Sons, Inc., New York-London, 1960
1960
-
[6]
M. L. Puterman and S. L. Brumelle , On the convergence of policy iteration in stationary dynamic programming , Math. Oper. Res., 4 (1979), pp. 60--69, https://doi.org/10.1287/moor.4.1.60
-
[7]
J. Schoeberl, M. Hochsteger, C. Lackner, M. Neunteufel, C. Lehrenfeld, P. L. Lederer, M. Rambausek, J. Gopalakrishnan, C. Wintersteiger, A. Pechstein, P. Stocker, F. Orlandini, T. Danczul, A. Schlüter, U. Zerbinati, B. Schwarzenbacher, Dies-Das, D. Drake, F. Heimann, F. Eckhofer, F. Ballarin, J. Zimmermann, and L. Kogler , Ngsolve/ngsolve: v6.2.2604 , Apr...
-
[8]
I. Smears and E. S\"uli , Discontinuous G alerkin finite element approximation of H amilton- J acobi- B ellman equations with C ordes coefficients , SIAM J. Numer. Anal., 52 (2014), pp. 993--1016, https://doi.org/10.1137/130909536
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.