pith. sign in

arxiv: 2604.05882 · v1 · submitted 2026-04-07 · 🧮 math.OC · math.AP

Lecture Note for Bounded Controls in Continuous-Time and Control of Several Variables

Pith reviewed 2026-05-10 18:50 UTC · model grok-4.3

classification 🧮 math.OC math.AP
keywords optimal controlbounded controlsPontryagin maximum principlebox constraintsprojection formulaforward-backward sweepadjoint systemscontinuous-time control
0
0 comments X

The pith

Optimal control problems with box constraints on the control require a modified Pontryagin maximum principle that includes intrinsic projection onto the admissible set.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This lecture note develops the first-order necessary conditions for continuous-time optimal control when the control variable is restricted to a compact box. It shows how Pontryagin's maximum principle must be adjusted so that the control is projected or clamped directly within the optimality system. The note stresses the difference between applying this projection inside the coupled forward-backward equations and simply truncating an unconstrained solution after the fact. Correct implementation via forward-backward sweep ensures the state and adjoint remain consistent with the bounds throughout. This handling matters for any applied problem in which controls represent physical quantities that cannot exceed given limits.

Core claim

The paper establishes the precise modification of Pontryagin's maximum principle for compact admissible control sets, supplies the projection or clamping formula for scalar quadratic Hamiltonians, and demonstrates the distinction between intrinsic projection inside the optimality system and post-hoc truncation of an unconstrained solution, together with the corresponding forward-backward sweep procedure.

What carries the argument

The projection or clamping formula that maps the candidate control derived from the Hamiltonian back into the box bounds, applied inside the coupled optimality system rather than after solution.

If this is right

  • The optimality system now contains the projected control at each instant along the trajectory.
  • Forward-backward sweep iterations apply the projection at every step to keep state and adjoint consistent.
  • Post-hoc truncation of an unconstrained solution can produce incorrect adjoint variables and higher costs.
  • The same projection extends componentwise to problems with several control variables.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Numerical implementations that embed the projection inside the iteration will avoid the inconsistencies that arise from external clipping.
  • Biological models with bounded inputs, as referenced in the source book, would be natural test cases to observe whether the two approaches yield measurably different trajectories.
  • The distinction suggests that some existing numerical codes for bounded optimal control may need re-examination if they rely on post-processing truncation.

Load-bearing premise

Readers already understand variational arguments, adjoint systems, and basic nonlinear analysis for unconstrained problems.

What would settle it

Solve a simple scalar linear-quadratic optimal control problem with box bounds once by clamping inside the forward-backward system and once by solving unconstrained then truncating afterward; different final costs or trajectories would confirm the distinction is required.

Figures

Figures reproduced from arXiv: 2604.05882 by Louis Shuo Wang.

Figure 1
Figure 1. Figure 1: The projection formula in action. The unconstrained ideal control [PITH_FULL_IMAGE:figures/full_fig_p007_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Geometric interpretation of the variational inequality [PITH_FULL_IMAGE:figures/full_fig_p009_2.png] view at source ↗
read the original abstract

In this note, we develop the first-order theory of optimal control problems with box constraints on the control. We emphasize the precise modification of Pontryagin's maximum principle when the admissible control set is compact, the projection/clamping formula for scalar quadratic Hamiltonians, the distinction between intrinsic projection inside the optimality system and post hoc truncation of an unconstrained solution, and the corresponding forward-backward sweep implementation. The presentation is pitched at senior PhD students who are already comfortable with variational arguments, adjoint systems, and basic nonlinear analysis. These notes are mainly based on the book ``optimal control applied to biological models'' of Suzanne Lenhart and John T. Workman.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

0 major / 3 minor

Summary. This lecture note develops the first-order theory of optimal control problems with box constraints on the control. It emphasizes the precise modification of Pontryagin's maximum principle for compact admissible control sets, the projection/clamping formula for scalar quadratic Hamiltonians, the distinction between intrinsic projection inside the optimality system versus post-hoc truncation of an unconstrained solution, and the corresponding forward-backward sweep implementation. The presentation is based primarily on the book by Lenhart and Workman and is pitched at senior PhD students already familiar with variational arguments, adjoint systems, and basic nonlinear analysis.

Significance. If the exposition accurately captures the standard theory, the note provides a focused pedagogical resource that clarifies subtle but practically important distinctions in applying Pontryagin's principle to bounded controls. The explicit treatment of intrinsic projection within the optimality system versus post-hoc truncation, together with the forward-backward sweep implementation, strengthens understanding of how to derive and solve the necessary conditions correctly; this is a useful supplement for students and applied researchers even though no new theorems or derivations are introduced.

minor comments (3)
  1. [Title] The title appears truncated (ending with 'and Control of Several Variables'); a complete title would improve discoverability and clarity.
  2. [Introduction] While the abstract states the notes are 'mainly based on' the Lenhart-Workman book, a short paragraph in the introduction citing the specific chapters or theorems being re-presented would help readers cross-reference the source material.
  3. [Preliminaries] Notation for the admissible control set (e.g., the precise definition of the box constraints) should be introduced once in a dedicated preliminary section rather than assumed from the referenced book, to make the note more self-contained.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for the positive and constructive report. The assessment that the lecture notes clarify subtle distinctions in applying Pontryagin's principle to box-constrained controls, while remaining accessible to senior PhD students, aligns with our intent. We appreciate the recommendation for minor revision and have used the opportunity to improve readability and precision in several places.

Circularity Check

0 steps flagged

Expository lecture note with no circular derivations or self-referential claims

full rationale

The manuscript is explicitly positioned as a pedagogical clarification of standard first-order necessary conditions (modified PMP for compact control sets, clamping formulas, intrinsic vs. post-hoc projection) drawn from the external book by Lenhart and Workman. No novel theorems, parameter fits, or derivations are asserted; the note contains no self-citations, no fitted inputs renamed as predictions, and no load-bearing steps that reduce to the paper's own inputs by construction. All content is self-contained against the cited external reference.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The lecture note relies on established domain assumptions from optimal control theory without introducing new free parameters, axioms beyond standard ones, or invented entities.

axioms (1)
  • domain assumption Standard assumptions of optimal control theory including existence of solutions, differentiability of the Hamiltonian, and validity of Pontryagin's maximum principle for unconstrained cases.
    Invoked as the foundation for the modifications described when adding box constraints.

pith-pipeline@v0.9.0 · 5399 in / 1352 out tokens · 62409 ms · 2026-05-10T18:50:30.732731+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. A Diagnostic Framework for Implementation Risk in Bilevel Decision Problems: The Ambiguity Premium and the Robustness--Efficiency Frontier

    math.OC 2026-05 unverdicted novelty 6.0

    The paper defines the ambiguity premium Δ_ε(x) as the gap between pessimistic and optimistic upper-level values over ε-optimal follower responses and provides bounds plus a screening workflow to trace robustness-effic...

  2. Optimization Workshop Notes for Mathematical Programming with Equilibrium Constraints Algorithms: Penalty Interior-Point, Implicit-Programming, and Piecewise SQP

    math.OC 2026-04 unverdicted novelty 2.0

    Workshop notes explain models, subproblems, globalization, and convergence assumptions for PIPA, monotone-LCP PIPA, implicit-programming, and PSQP algorithms applied to MPECs.

Reference graph

Works this paper leans on

71 extracted references · 71 canonical work pages · cited by 2 Pith papers

  1. [1]

    King and J

    D. King and J. Roughgarden. Graded allocation between vegetative and reproductive growth for annual plants in growing seasons of random length.Theoretical Population Biology, 22(1): 1–16, 1982. doi: 10.1016/0040-5809(82)90032-6

  2. [2]

    D. Cohen. Maximizing final yield when growth is limited by time or by limiting resources. Journal of Theoretical Biology, 33(2):299–307, 1971. doi: 10.1016/0022-5193(71)90068-3

  3. [3]

    Barbu, editor.Analysis and control of nonlinear infinite dimensional systems

    V. Barbu, editor.Analysis and control of nonlinear infinite dimensional systems. Number v. 190 in Mathematics in science and engineering. Academic Press, Boston, 1993. ISBN 978-0-08-095876-7. 20

  4. [4]

    Barbu.Mathematical methods in optimization of differential systems

    V. Barbu.Mathematical methods in optimization of differential systems. Number 310 in Mathematics and Its Applications. Springer, Dordrecht, 1994. ISBN 978-94-010-4327-4 978-94- 011-0760-0. doi: 10.1007/978-94-011-0760-0

  5. [5]

    Li and J

    X. Li and J. Yong.Optimal control theory for infinite dimensional systems. Birkhäuser Boston, Boston, MA, 1995. ISBN 978-1-4612-8712-4 978-1-4612-4260-4. doi: 10.1007/978-1-4612-4260-4

  6. [6]

    Z. Liu, L. S. Wang, J. Yu, J. Zhang, E. Martel, and S. Li. Bidirectional endothelial feedback drives turing-vascular patterning and drug-resistance niches: a hybrid PDE-agent-based study. Bioengineering, 12(10):1097, 2025. doi: 10.3390/bioengineering12101097

  7. [7]

    D. H. Jacobson, D. H. Martin, M. Pachter, and T. Geveci, editors.Extensions of linear-quadratic control theory, volume 27 ofLecture Notes in Control and Information Sciences. Springer-Verlag, Berlin/Heidelberg, 1980. ISBN 978-3-540-10069-0. doi: 10.1007/BFb0004370

  8. [8]

    Yong and X

    J. Yong and X. Y. Zhou.Stochastic controls. Springer New York, New York, NY, 1999. ISBN 978-1-4612-7154-3 978-1-4612-1466-3. doi: 10.1007/978-1-4612-1466-3

  9. [9]

    Z. Wang, D. Wang, and J. Yu. Multi-strategy Hybrid Improved Intelligent Algorithm for Solving UAV-MTSP.Information Technology and Control, 54(2):413–438, 2025. doi: 10.5755/ j01.itc.54.2.40640

  10. [10]

    W. R. Wade.An introduction to analysis. Pearson/Prentice Hall, Upper Saddle River, N.J, 4th ed edition, 2010. ISBN 978-0-13-229638-0

  11. [11]

    D. D. Mooney and R. J. Swift.A course in mathematical modeling. Classroom resource materials. Mathematical Association of America, Washington, DC, 1999. ISBN 978-0-88385-712-0

  12. [12]

    Edelstein-Keshet.Mathematical models in biology

    L. Edelstein-Keshet.Mathematical models in biology. Society for Industrial and Applied Mathematics, 2005. ISBN 978-0-89871-554-5 978-0-89871-914-7. doi: 10.1137/1.9780898719147

  13. [13]

    D. S. Jones, M. Plank, and B. D. Sleeman.Differential equations and mathematical biology. Chapman and Hall/CRC, 0 edition, 2009. ISBN 978-0-429-13659-7. doi: 10.1201/9781420083583

  14. [14]

    Kot.Elements of mathematical ecology

    M. Kot.Elements of mathematical ecology. Cambridge University Press, 1 edition, 2001. ISBN 978-0-521-00150-2 978-0-521-80213-0 978-0-511-60852-0. doi: 10.1017/CBO9780511608520

  15. [15]

    J. D. Murray, editor.Mathematical biology: II: Spatial models and biomedical applications, volume 18 ofInterdisciplinary Applied Mathematics. Springer New York, New York, NY, 2003. ISBN 978-0-387-95228-4 978-0-387-22438-1. doi: 10.1007/b98869

  16. [16]

    L. S. Pontryagin.Mathematical theory of optimal processes. Routledge, 1 edition, 2018. ISBN 978-0-203-74931-9. doi: 10.1201/9780203749319

  17. [17]

    J. Yu, L. S. Wang, Z. Liu, and J. Liu. Pattern suppression and recovery under one-way versus two-way chemotactic coupling in hybrid partial differential equation–ordinary differential equation models.Transport Phenomena, 2026. doi: 10.1515/tp-2026-0023

  18. [18]

    Rudin.Real and complex analysis

    W. Rudin.Real and complex analysis. McGraw-Hill, New York, 3rd ed edition, 1987. ISBN 978-0-07-054234-1

  19. [19]

    E. M. Stein and R. Shakarchi.Real analysis: Measure theory, integration, and Hilbert spaces. Number v. 3 in Princeton lectures in analysis. Princeton University Press, Princeton, N.J, 2005. ISBN 978-0-691-11386-9. 21

  20. [20]

    L. S. Wang and J. Yu. Analysis framework for stochastic predator–prey model with demographic noise.Results in Applied Mathematics, 27:100621, 2025. doi: 10.1016/j.rinam.2025.100621

  21. [21]

    F. H. Clarke.Optimization and nonsmooth analysis. Society for Industrial and Applied Mathematics, January 1990. ISBN 978-0-89871-256-8 978-1-61197-130-9. doi: 10.1137/1. 9781611971309

  22. [22]

    SpringerNewYork, New York, NY, 1975

    W.FlemingandR.Rishel.Deterministic and stochastic optimal control. SpringerNewYork, New York, NY, 1975. ISBN 978-1-4612-6382-1 978-1-4612-6380-7. doi: 10.1007/978-1-4612-6380-7

  23. [23]

    Y. Gao, L. Li, and J. Yu. Rolling prediction model of closing price based on EEMD data noise reduction and HGS-DELM. In2022 International Conference on Data Analytics, Computing and Artificial Intelligence (ICDACAI), pages 255–260, Zakopane, Poland, 2022. IEEE. ISBN 978-1-6654-5470-4. doi: 10.1109/ICDACAI57211.2022.00059

  24. [24]

    M. I. Kamien and N. L. Schwartz.Dynamic optimization: The calculus of variations and optimal control in economics and management. Number 31 in Advanced textbooks in economics. Elsevier, Amsterdam Heidelberg, 2. ed., 7. impr edition, 2003. ISBN 978-0-444-01609-6

  25. [25]

    Macki and A

    J. Macki and A. Strauss.Introduction to optimal control theory. Undergraduate Texts in Mathematics. Springer New York, New York, NY, 1982. ISBN 978-1-4612-5673-1 978-1-4612- 5671-7. doi: 10.1007/978-1-4612-5671-7

  26. [26]

    Cesari.Optimization—theory and applications

    L. Cesari.Optimization—theory and applications. Springer New York, New York, NY, 1983. ISBN 978-1-4613-8167-9 978-1-4613-8165-5. doi: 10.1007/978-1-4613-8165-5

  27. [27]

    D. L. Lukes.Differential equations: Classical to controlled. Number v. 162 in Mathematics in science and engineering. Academic Press, New York, 1982. ISBN 978-0-08-095668-8

  28. [28]

    A. F. Filippov. On certain questions in the theory of optimal control.Journal of the Society for Industrial and Applied Mathematics Series A Control, 1(1):76–84, 1962. doi: 10.1137/0301006

  29. [29]

    Edelstein-Keshet

    L. Edelstein-Keshet. Mathematical methods and models in the biological sciences vol. 1 (martin eisen).SIAM Review, 33(1):139–141, 1991. doi: 10.1137/1033030

  30. [30]

    L. D. Berkovitz. Optimal control theory.The American Mathematical Monthly, 83(4):225–239,

  31. [31]

    doi: 10.1080/00029890.1976.11994086

  32. [32]

    Sethi.Optimal control theory

    S. Sethi.Optimal control theory. Springer-Verlag, New York, 2000. ISBN 978-0-387-28092-9. doi: 10.1007/0-387-29903-3

  33. [33]

    F. L. Lewis.Optimal control. Wiley, New York, 1986. ISBN 978-0-471-81240-1

  34. [34]

    R. L. Burden and J. D. Faires.Numerical analysis. Brooks/Cole Publ. Co, Pacific Grove, Calif. Bonn, 6. ed., [nachdr.] edition, 1997. ISBN 978-0-534-95532-8

  35. [35]

    L. S. Wang and J. Yu. Algebraic–spectral thresholds and discrete–continuous stability transfer in Leslie–Gower systems.Electronic Research Archive, 34(1):251–290, 2026. doi: 10.3934/era. 2026013

  36. [36]

    De Feo, S

    F. De Feo, S. Federico, and A. Swiech. Optimal control of stochastic delay differential equations and applications to path-dependent financial and economic models.SIAM Journal on Control and Optimization, 62(3):1490–1520, 2024. doi: 10.1137/23M1553960. 22

  37. [37]

    H. Wang, J. Yong, and J. Zhang. Path dependent Feynman–Kac formula for forward backward stochastic Volterra integral equations.Annales de l’Institut Henri Poincaré, Probabilités et Statistiques, 58(2), 2022. doi: 10.1214/21-AIHP1158

  38. [38]

    Federico, D

    S. Federico, D. Ghilli, and F. Gozzi. Linear-quadratic mean field games in hilbert spaces.SIAM Journal on Mathematical Analysis, 57(6):5821–5853, 2025. doi: 10.1137/24M1642895

  39. [39]

    Lin and J

    P. Lin and J. Yong. Controlled singular Volterra integral equations and Pontryagin maximum principle.SIAM Journal on Control and Optimization, 58(1):136–164, 2020. ISSN 0363-0129, 1095-7138. doi: 10.1137/19M124602X

  40. [40]

    S.-F. Wang, L. Hu, and L.-F. Nie. Global dynamics and optimal control of an age-structure Malaria transmission model with vaccination and relapse.Chaos, Solitons & Fractals, 150: 111216, 2021. doi: 10.1016/j.chaos.2021.111216

  41. [41]

    Bayen, A

    T. Bayen, A. Bouali, and L. Bourdin. The hybrid maximum principle for optimal control problems with spatially heterogeneous dynamics is a consequence of a pontryagin maximum principle for L1-local solutions.SIAM Journal on Control and Optimization, 62(4):2412–2432,

  42. [42]

    doi: 10.1137/23M155311X

  43. [43]

    H. Wang, J. Yong, and C. Zhou. Linear-quadratic optimal controls for stochastic volterra integral equations: Causal state feedback and path-dependent riccati equations.SIAM Journal on Control and Optimization, 61(4):2595–2629, 2023. doi: 10.1137/22M1492696

  44. [44]

    Huang and H.-C

    J.-P. Huang and H.-C. Zhou. Infinite horizon linear quadratic optimal control problems for singular Volterra integral equations.SIAM Journal on Control and Optimization, 63(1):57–85,

  45. [45]

    doi: 10.1137/24M1641713

  46. [46]

    Dou and Q

    F. Dou and Q. Lü. Time-inconsistent linear quadratic optimal control problems for stochastic evolution equations.SIAM Journal on Control and Optimization, 58(1):485–509, 2020. doi: 10.1137/19M1250339

  47. [47]

    H. Liu, G. Wang, Y. Xu, and H. Yu. Characterizations of complete stabilizability.SIAM Journal on Control and Optimization, 60(4):2040–2069, 2022. doi: 10.1137/20M1386761

  48. [48]

    Liang, L

    Y. Liang, L. S. Wang, J. Yu, and Z. Liu. Global well-posedness and stability of nonlocal damage-structured lineage model with feedback and dedifferentiation.Mathematics, 13(22): 3583, 2025. doi: 10.3390/math13223583

  49. [49]

    Wang and J

    T. Wang and J. Yong. Spike variations for stochastic volterra integral equations.SIAM Journal on Control and Optimization, 61(6):3608–3634, 2023. doi: 10.1137/22M1522097

  50. [50]

    Trélat, X

    E. Trélat, X. Zeng, and C. Zhang. The exponential turnpike property for periodic linear quadratic optimal control problems in infinite dimension.SIAM Journal on Control and Optimization, 63(4):2524–2546, 2025. doi: 10.1137/24M1638975

  51. [51]

    Calvia, S

    A. Calvia, S. Federico, and F. Gozzi. State constrained control problems in banach lattices and applications.SIAM Journal on Control and Optimization, 59(6):4481–4510, 2021. doi: 10.1137/20M1376959

  52. [52]

    Hasenohr, C

    I. Hasenohr, C. Pouchol, Y. Privat, and C. Zhang. Computer-assisted proofs of nonreachability for finite-dimensional linear control systems.SIAM Journal on Control and Optimization, 63 (5):3272–3296, 2025. doi: 10.1137/24M1658711. 23

  53. [53]

    L. S. Wang, J. Yu, S. Li, and Z. Liu. Analysis and mean-field limit of a hybrid PDE-ABM modeling angiogenesis-regulated resistance evolution.Mathematics, 13(17):2898, 2025. doi: 10.3390/math13172898

  54. [54]

    Conforti, A

    G. Conforti, A. Durmus, and M. G. Silveri. KL convergence guarantees for score diffusion models under minimal data assumptions.SIAM Journal on Mathematics of Data Science, 7(1): 86–109, 2025. doi: 10.1137/23M1613670

  55. [55]

    Szpruch, T

    L. Szpruch, T. Treetanthiploet, and Y. Zhang. Optimal scheduling of entropy regularizer for continuous-time linear-quadratic reinforcement learning.SIAM Journal on Control and Optimization, 62(1):135–166, 2024. ISSN 0363-0129, 1095-7138. doi: 10.1137/22M1515744

  56. [56]

    X. Li, D. Verma, and L. Ruthotto. A neural network approach for stochastic optimal control. SIAM Journal on Scientific Computing, 46(5):C535–C556, 2024. doi: 10.1137/23M155832X

  57. [57]

    Archibald, F

    R. Archibald, F. Bao, Y. Cao, and H. Sun. Numerical analysis for convergence of a sample-wise backpropagation method for training stochastic neural networks.SIAM Journal on Numerical Analysis, 62(2):593–621, 2024. doi: 10.1137/22M1523765

  58. [58]

    W. Tang, Y. P. Zhang, and X. Y. Zhou. Exploratory HJB equations and their convergence. SIAM Journal on Control and Optimization, 60(6):3191–3216, 2022. doi: 10.1137/21M1448185

  59. [59]

    Giegrich, C

    M. Giegrich, C. Reisinger, and Y. Zhang. Convergence of policy gradient methods for finite- horizon exploratory linear-quadratic control problems.SIAM Journal on Control and Opti- mization, 62(2):1060–1092, 2024. doi: 10.1137/22M1533517

  60. [60]

    J. Han, W. Hu, J. Long, and Y. Zhao. Deep picard iteration for high-dimensional nonlinear PDEs.SIAM Journal on Scientific Computing, 48(1):C1–C24, 2026. doi: 10.1137/24M169312X

  61. [61]

    Liu and D

    H. Liu and D. Firoozi. Hilbert space-valued LQ mean field games: an infinite-dimensional analysis.SIAM Journal on Control and Optimization, 63(5):3297–3327, 2025. doi: 10.1137/ 24M1675096

  62. [62]

    W. Cai, S. Fang, and T. Zhou. SOC-MartNet: A martingale neural network for the hamil- ton–jacobi–bellman equation without explicit inf H in stochastic optimal controls.SIAM Journal on Scientific Computing, 47(4):C795–C819, 2025. doi: 10.1137/24M1681033

  63. [63]

    Mayorga and A

    S. Mayorga and A. Swiech. Finite dimensional approximations of Hamilton–Jacobi–Bellman equations for stochastic particle systems with common noise.SIAM Journal on Control and Optimization, 61(2):820–851, 2023. doi: 10.1137/22M1489186

  64. [64]

    L. S. Wang, J. Yu, and Z. Liu. A damage-structured PDE model of stem cell hierarchies: The dual role of dedifferentiation in tissue homeostasis and aging.PLOS One, 21(2):e0335163, 2026. doi: 10.1371/journal.pone.0335163

  65. [65]

    Zhou and J

    M. Zhou and J. Lu. A policy gradient framework for stochastic optimal control problems with global convergence guarantee.SIAM Journal on Control and Optimization, 63(4):2605–2631,

  66. [66]

    doi: 10.1137/23M1570739

  67. [67]

    W. Meng, J. Shi, T. Wang, and J.-F. Zhang. A general maximum principle for optimal control of stochastic differential delay systems.SIAM Journal on Control and Optimization, 63(1): 175–205, 2025. doi: 10.1137/23M1552024. 24

  68. [68]

    T. Lew, R. Bonalli, and M. Pavone. Sample average approximation for stochastic programming with equality constraints.SIAM Journal on Optimization, 34(4):3506–3533, 2024. doi: 10. 1137/23M1573227

  69. [69]

    Huang, Y

    Y. Huang, Y. Jia, and X. Y. Zhou. Sublinear regret for a class of continuous-time linear- quadratic reinforcement learning problems.SIAM Journal on Control and Optimization, 63(5): 3452–3474, 2025. doi: 10.1137/24M1695075

  70. [70]

    Y. Wang, J. Liu, A. Bensoussan, K.-F. C. Yiu, and J. Wei. On stochastic control problems with higher-order moments.SIAM Journal on Control and Optimization, 63(3):1560–1589,

  71. [71]

    doi: 10.1137/23M1621058. 25