Numerical approximation of Markovian BSDEs in infinite horizon and elliptic PDEs

Adrien Richou (IMB); Charu Shardul (CMAP); Emmanuel Gobet (LPSM (UMR\_8001))

arxiv: 2604.24392 · v1 · submitted 2026-04-27 · 🧮 math.PR

Numerical approximation of Markovian BSDEs in infinite horizon and elliptic PDEs

Emmanuel Gobet (LPSM (UMR\_8001)) , Adrien Richou (IMB) , Charu Shardul (CMAP) This is my paper

Pith reviewed 2026-05-08 01:43 UTC · model grok-4.3

classification 🧮 math.PR

keywords backward stochastic differential equationsinfinite horizonnumerical approximationneural networksMarkovian BSDEselliptic partial differential equationsMalliavin derivative

0 comments

The pith

Numerical schemes based on Picard contraction and neural networks approximate infinite-horizon Markovian BSDEs with proven convergence.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper focuses on backward stochastic differential equations defined over an infinite time horizon with a Markovian driver. It establishes a representation of the solution via the Malliavin derivative and demonstrates contraction of the associated Picard iteration under appropriate conditions. Building on this, the authors propose a grid-based scheme suitable for low dimensions, a neural network scheme with a proven convergence rate that mitigates the curse of dimensionality, and an alternative neural network method that performs well even when the Lipschitz constant in the z-variable exceeds the contraction threshold.

Core claim

A probabilistic representation using the Malliavin derivative allows proving contraction of a Picard scheme for infinite-horizon BSDEs, which in turn justifies two numerical approximation schemes with error bounds and convergence results, while a third neural network scheme succeeds without relying on contraction.

What carries the argument

The Picard fixed-point iteration based on the contraction mapping for the infinite-horizon BSDE, approximated via space grids or neural networks.

If this is right

The grid-based scheme yields tight error bounds in low dimensions but suffers from exponential growth in computational time as dimension increases.
Neural network approximations prove convergent and maintain accuracy in very high dimensions by avoiding the curse of dimensionality.
The non-contractive neural scheme extends practical applicability to BSDEs with stronger dependence on the control variable.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

These methods could be tested on specific elliptic PDEs arising from the BSDE representation to verify accuracy against known analytical solutions.
Combining the schemes with variance reduction techniques might further improve performance in high-dimensional settings.
Extension to non-Markovian cases would require different representations beyond Malliavin calculus.

Load-bearing premise

The contraction property for the Picard scheme holds only under stronger assumptions on the Lipschitz constants than those required for mere existence and uniqueness of the BSDE solution.

What would settle it

A numerical test case where the error bounds for the grid or neural schemes do not hold as dimension increases or when the z-Lipschitz constant violates the contraction condition.

Figures

Figures reproduced from arXiv: 2604.24392 by Adrien Richou (IMB), Charu Shardul (CMAP), Emmanuel Gobet (LPSM (UMR\_8001)).

**Figure 1.** Figure 1: Box plot for errors in u(x) and ¯u(x) for p = 0 and 2 for Picard iterations n = 3, . . . , 10 plotted against the grid points. 2 4 6 8 10 12 Picard iteration n 6 5 4 3 2 1 su p x lo g( u n p (x)) Max log error in u(x) vs Picard iterations p = 0 p = 1 p = 2 p = 3 p = 4 p = 5 p = 6 2 4 6 8 10 12 Picard iteration n 5 4 3 2 1 su p x lo g( u n p (x)) Max log error in u(x) vs Picard iterations p = 0 p = 1 p = 2 … view at source ↗

**Figure 2.** Figure 2: Plot of the max log errors on the grid against Picard iterations for different values of boundary truncation p. view at source ↗

**Figure 3.** Figure 3: Box plot for the supremum of the log errors of view at source ↗

**Figure 4.** Figure 4: Picard iterations for the general SDE with view at source ↗

**Figure 5.** Figure 5: Picard iterations for Algorithm 1 for d = 1. follows: |u n d − u|L2 µ0 ≈ view at source ↗

**Figure 6.** Figure 6: Relative errors ∆u n d and ∆¯u n d (see (4.1)) for n = 5 and different values of d between 1 and 50 for 5 experiments. 0.0 0.4 0.8 1.2 1.6 2.0 2.4 2.8 3.2 3.6 4.0 4.4 4.8 5.2 Kz 4.5 4.0 3.5 3.0 2.5 2.0 1.5 1.0 0.5 u n Kz log of L 2 0 errors for u 0.0 0.4 0.8 1.2 1.6 2.0 2.4 2.8 3.2 3.6 4.0 4.4 4.8 5.2 Kz 4.0 3.5 3.0 2.5 2.0 1.5 1.0 0.5 u n Kz log of L 2 0 errors for u view at source ↗

**Figure 7.** Figure 7: Logarithm of L 2 µ0 errors ∆u n Kz and ∆¯u n Kz for n = 5 with different values of Kz ∈ {0, 0.4, . . . , 5.2} for 5 experiments for the NN based iterative contraction scheme, Algorithm 1. Similarly to the previous scheme, view at source ↗

**Figure 8.** Figure 8: Picard iterations (Algorithm 1) for u n, ¯u 1,n and ¯u 2,n for d = 2 and n = 1, 3, 5 and 7 (from top to bottom). The wire-frame plots are used for the analytical solution. By using Proposition 2.1 and Young’s inequality, we have E Z +∞ 0 e −as|f(Xx s , Y x s , Zx s ) + aY x s |ds ⩽CE Z +∞ 0 e −(2a−λ)s + e −λs |Y x s | 2 + |Z x s | 2 + |Xx s | 2r ds < +∞, 26 view at source ↗

**Figure 9.** Figure 9: Solution u(x) and ¯u(x) for d = 2 for the NN based direct scheme (Algorithm 2). 0.0 0.4 0.8 1.2 1.6 2.0 2.4 2.8 3.2 3.6 4.0 4.4 4.8 5.2 Kz 4.5 4.0 3.5 3.0 2.5 2.0 1.5 1.0 u Kz log of L 2 0 errors for u 0.0 0.4 0.8 1.2 1.6 2.0 2.4 2.8 3.2 3.6 4.0 4.4 4.8 5.2 Kz 5.0 4.5 4.0 3.5 3.0 2.5 2.0 1.5 1.0 u Kz log of L 2 0 errors for u view at source ↗

**Figure 10.** Figure 10: Logarithm of L 2 µ0 errors ∆u n Kz and ∆¯u n Kz for n = 5, M = 3000 and Mx = 512 with different values of Kz ∈ {0, 0.4, . . . , 5.2} for 5 experiments for the NN based direct scheme, Algorithm 2. where we have set λ := 2µ − K2 f,z(1d′>1 ∨ 1r>0). The Lebesgue theorem gives us that the integral term in (A.1) tends to E Z +∞ 0 e −as(f(Xx s , Y x s , Zx s ) + aY x s )ds when T → +∞. Moreover, by Propositio… view at source ↗

read the original abstract

We study backward stochastic differential equations (BSDEs) in infinite horizon and design efficient numerical schemes for solving them. We establish a probabilistic representation of the solution of the BSDE using Malliavin derivative and prove results for contraction of a Picard scheme. We develop three numerical schemes, of which the first two are based on a fixed point argument using contraction, imposing additional assumptions compared to what is needed for existence and uniqueness of the solution. The first scheme is a space grid based approximation where we establish tight numerical error bounds using a growth truncation argument; it performs well in low dimensions but computational times increase exponentially with dimension. The second scheme uses neural network approximations for which we have proved a convergence result. Using neural networks alleviates the curse of dimensionality, giving good accuracy in very high dimensions. The third scheme also uses neural networks but does not rely on contraction arguments, showcasing good performance even for larger z-Lipschitz dependence outside the domain of contraction.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper gives three concrete numerical schemes for infinite-horizon Markovian BSDEs, with contraction-based proofs and error bounds only for the first two under extra Lipschitz restrictions, while the third relies on experiments alone.

read the letter

The main point is that the authors extend BSDE numerics to the infinite-horizon Markovian setting by using a Malliavin representation, then build a Picard contraction argument to justify two schemes and add a third neural-network scheme that drops the contraction step. The grid scheme gets explicit error bounds via growth truncation and works in low dimensions. The second scheme uses neural nets with a proved convergence result under the contraction regime, which helps in high dimensions. The third scheme is tested on cases with larger z-Lipschitz constants and shows decent accuracy in the reported runs. That combination is the actual new piece beyond standard finite-horizon methods. The stress-test note holds: the convergence theorem for the neural-network scheme is tied to the same restrictive contraction assumptions needed for the fixed-point argument, so it does not apply to the full range where the BSDE is known to exist. Scheme 3 has no theorem at all, only numerical evidence, which leaves its reliability dependent on the specific test cases. The extra assumptions for contraction are clearly stated but do narrow the practical scope. The elliptic-PDE link is mentioned but not developed in detail beyond the BSDE representation. This is aimed at people who need implementable solvers for long-run stochastic control or finance problems in moderate to high dimensions. A reader who wants methods with some analysis attached will find usable ideas, though they will have to check the Lipschitz condition case by case. The work is grounded enough in standard probabilistic tools and has concrete numerical claims to deserve a serious referee. I would send it to peer review; referees can verify the contraction proofs, ask for more on the scope of the theorems, and suggest additional tests or analysis for the unproven scheme.

Referee Report

2 major / 2 minor

Summary. The paper develops three numerical schemes for infinite-horizon Markovian BSDEs (linked to elliptic PDEs). Schemes 1 and 2 rely on a Picard fixed-point contraction mapping, with scheme 1 using a space-grid approximation that yields explicit error bounds via growth truncation, and scheme 2 using neural-network approximations for which a convergence result is proved. Scheme 3 employs neural networks without the contraction assumption and is supported by numerical experiments showing good performance for larger z-Lipschitz constants. The approach uses Malliavin calculus for the probabilistic representation and imposes additional assumptions for the contraction-based schemes beyond those needed for existence and uniqueness.

Significance. If the convergence results hold under the stated conditions, the work provides concrete, implementable methods that mitigate the curse of dimensionality in high-dimensional infinite-horizon BSDEs via neural networks, while the grid scheme offers rigorous error control in low dimensions. The explicit treatment of the non-contraction regime through numerical validation is a practical strength, though the theoretical guarantees remain confined to the contraction setting.

major comments (2)

[Abstract / scheme descriptions] Abstract and the description of the schemes: the headline statement that 'we have proved a convergence result' for the neural-network scheme (scheme 2) is accurate only under the additional contraction assumptions (sufficiently small z-Lipschitz constant). These assumptions are explicitly noted as stricter than those required for existence/uniqueness of the infinite-horizon BSDE, so the proved convergence does not extend to the full existence regime; this qualification should be stated more prominently to avoid overstating the scope of the theoretical result.
[Third scheme / numerical experiments] Section on the third scheme: no convergence theorem is provided for the neural-network scheme that operates outside the contraction domain. While numerical experiments demonstrate good performance for larger z-Lipschitz dependence, the absence of any error bound or consistency proof means the method lacks the theoretical backing claimed for the first two schemes; this gap is load-bearing for the paper's assertion of 'efficient numerical schemes … with proved convergence'.

minor comments (2)

[Introduction / theorem statements] Clarify the precise statement of the additional assumptions (e.g., the threshold on the z-Lipschitz constant) in the introduction and in the statements of the convergence theorems for schemes 1 and 2.
[PDE connection] The link between the BSDE schemes and the associated elliptic PDEs is mentioned but could be made more explicit, e.g., by stating the PDE form and the precise correspondence used for validation.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the careful reading and constructive comments. We address each major comment below and will revise the manuscript accordingly to improve precision and clarity.

read point-by-point responses

Referee: [Abstract / scheme descriptions] Abstract and the description of the schemes: the headline statement that 'we have proved a convergence result' for the neural-network scheme (scheme 2) is accurate only under the additional contraction assumptions (sufficiently small z-Lipschitz constant). These assumptions are explicitly noted as stricter than those required for existence/uniqueness of the infinite-horizon BSDE, so the proved convergence does not extend to the full existence regime; this qualification should be stated more prominently to avoid overstating the scope of the theoretical result.

Authors: We agree that the abstract and scheme descriptions should qualify the convergence result for Scheme 2 more prominently. The proved convergence holds only under the contraction assumptions (sufficiently small z-Lipschitz constant), which are stricter than the conditions for existence and uniqueness. We will revise the abstract to state explicitly that the convergence result for the neural-network scheme is established under these additional assumptions, thereby avoiding any overstatement of scope. revision: yes
Referee: [Third scheme / numerical experiments] Section on the third scheme: no convergence theorem is provided for the neural-network scheme that operates outside the contraction domain. While numerical experiments demonstrate good performance for larger z-Lipschitz dependence, the absence of any error bound or consistency proof means the method lacks the theoretical backing claimed for the first two schemes; this gap is load-bearing for the paper's assertion of 'efficient numerical schemes … with proved convergence'.

Authors: We acknowledge that Scheme 3 lacks a convergence theorem, as it is intended for the non-contraction regime where the Picard fixed-point argument does not apply. The manuscript presents Scheme 3 as a practical alternative validated by numerical experiments showing good performance for larger z-Lipschitz constants, without claiming a theoretical convergence result for it. We will revise the relevant sections (including any statements referring to 'proved convergence' for the schemes) to clearly restrict such claims to Schemes 1 and 2, and to emphasize the experimental nature of Scheme 3. We do not currently have a proof for Scheme 3 and do not assert one. revision: partial

Circularity Check

0 steps flagged

No significant circularity; derivations rely on external Malliavin calculus and fixed-point theory

full rationale

The paper derives a probabilistic representation via Malliavin derivatives and proves Picard contraction under explicit Lipschitz conditions, then constructs three schemes with error bounds or convergence statements that follow directly from those contraction results. Scheme 2's NN convergence is proved inside the same contraction regime already stated for existence, while scheme 3 is presented only numerically without a theorem; neither step reduces to self-definition, fitted inputs renamed as predictions, or load-bearing self-citations. All load-bearing steps cite standard external probabilistic tools rather than prior author work that would close a loop.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Relies on standard existence/uniqueness for BSDEs and contraction mapping principles; no explicit free parameters or invented entities detailed in abstract.

axioms (1)

domain assumption Existence and uniqueness of solution to the infinite-horizon BSDE under suitable conditions
Invoked as baseline before imposing additional assumptions for the contraction-based schemes.

pith-pipeline@v0.9.0 · 5477 in / 1101 out tokens · 67570 ms · 2026-05-08T01:43:25.724185+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

18 extracted references · 18 canonical work pages

[1]

C. Beck, L. Gonon, and A. Jentzen. Overcoming the curse of dimensionality in the numerical approximation of high-dimensional semilinear elliptic partial differential equations. Partial Differ. Equ. Appl. , 5(6):Paper No. 31, 47, 2024

work page 2024
[2]

Briand and Y

P. Briand and Y. Hu. Stability of BSDEs with random terminal time and homogenization of semilinear elliptic PDEs . Journal of Functional Analysis , 155(2):455--494, 1998

work page 1998
[3]

Briand and Y

P. Briand and Y. Hu. Quadratic BSDE s with convex generators and unbounded terminal conditions . Probab. Theory Related Fields , 141 (3-4):543--567, 2008

work page 2008
[4]

Chamakh, E

L. Chamakh, E. Gobet, and W. Liu. Orlicz norms and concentration inequalities for -heavy tailed random variables . hal-03175697

work page
[5]

Chessari, R

J. Chessari, R. Kawai, Y. Shinozaki, and T. Yamada. Numerical methods for backward stochastic differential equations: A survey . Probability Surveys , 20(none):486 -- 567, 2023

work page 2023
[6]

El Karoui, S

N. El Karoui, S. Peng, and M.-C. Quenez. Backward stochastic differential equations in finance. Math. Finance , 7 (1):1--71, 1997

work page 1997
[7]

Fuhrman and G

M. Fuhrman and G. Tessitore. The B ismut- E lworthy formula for backward SDE s and applications to nonlinear K olmogorov equations and control in infinite dimensional spaces. Stoch. Stoch. Rep. , 74 (1-2):429--464, 2002

work page 2002
[8]

Gobet, J

E. Gobet, J. G. L \'o pez-Salas, and C. V \'a zquez. Quasi-Regression Monte-Carlo scheme for semi-linear PDEs and BSDEs with large scale parallelization on GPUs . Archives of Computational Methods in Engineering , 27(3):889--921, April 2019

work page 2019
[9]

Gobet, A

E. Gobet, A. Richou, and L. Szpruch. Numerical approximation of ergodic BSDE s using non linear F eynman- K ac formulas. Stochastic Process. Appl. , 195:Paper No. 104871, 20, 2026

work page 2026
[10]

K. Hornik. Approximation capabilities of multilayer feedforward networks. Neural networks , 4(2):251--257, 1991

work page 1991
[11]

Kremsner, A

S. Kremsner, A. Steinicke, and M. Szölgyenyi. A deep neural network algorithm for semilinear elliptic PDEs with applications in insurance mathematics. Risks , 8(4), 2020

work page 2020
[12]

H. Kunita. Stochastic flows and stochastic differential equations . Cambridge Studies in Advanced Mathematics. 24. Cambridge: Cambridge University Press , 1997

work page 1997
[13]

Leshno, V

M. Leshno, V. Y. Lin, A. Pinkus, and S. Schocken. Multilayer feedforward networks with a nonpolynomial activation function can approximate any function. Neural networks , 6(6):861--867, 1993

work page 1993
[14]

Ma and J

J. Ma and J. Zhang. Representation theorems for backward stochastic differential equations. Ann. Appl. Probab. , 12(4):1390--1418, 2002

work page 2002
[15]

Nüsken and L

N. Nüsken and L. Richter. Interpolating between BSDE s and PINN s -- deep learning for elliptic and parabolic boundary value problems, 2021

work page 2021
[16]

Pardoux and A

E. Pardoux and A. R a s canu. Stochastic Differential Equations, Backward SDEs, Partial Differential Equations , volume 69 of Stochastic Modelling and Applied Probability . Springer-Verlag, 2014

work page 2014
[17]

Talagrand

M. Talagrand. Isoperimetry and integrability of the sum of independent B anach-space valued random variables. Ann. Probab. , 17(4):1546--1570, 1989

work page 1989
[18]

Van der Vaart and J

A W. Van der Vaart and J. A. Wellner. Weak Convergence and Empirical Processes: With Applications to Statistics . Springer Series in Statistics. Springer-Verlag, New York, 1996

work page 1996

[1] [1]

C. Beck, L. Gonon, and A. Jentzen. Overcoming the curse of dimensionality in the numerical approximation of high-dimensional semilinear elliptic partial differential equations. Partial Differ. Equ. Appl. , 5(6):Paper No. 31, 47, 2024

work page 2024

[2] [2]

Briand and Y

P. Briand and Y. Hu. Stability of BSDEs with random terminal time and homogenization of semilinear elliptic PDEs . Journal of Functional Analysis , 155(2):455--494, 1998

work page 1998

[3] [3]

Briand and Y

P. Briand and Y. Hu. Quadratic BSDE s with convex generators and unbounded terminal conditions . Probab. Theory Related Fields , 141 (3-4):543--567, 2008

work page 2008

[4] [4]

Chamakh, E

L. Chamakh, E. Gobet, and W. Liu. Orlicz norms and concentration inequalities for -heavy tailed random variables . hal-03175697

work page

[5] [5]

Chessari, R

J. Chessari, R. Kawai, Y. Shinozaki, and T. Yamada. Numerical methods for backward stochastic differential equations: A survey . Probability Surveys , 20(none):486 -- 567, 2023

work page 2023

[6] [6]

El Karoui, S

N. El Karoui, S. Peng, and M.-C. Quenez. Backward stochastic differential equations in finance. Math. Finance , 7 (1):1--71, 1997

work page 1997

[7] [7]

Fuhrman and G

M. Fuhrman and G. Tessitore. The B ismut- E lworthy formula for backward SDE s and applications to nonlinear K olmogorov equations and control in infinite dimensional spaces. Stoch. Stoch. Rep. , 74 (1-2):429--464, 2002

work page 2002

[8] [8]

Gobet, J

E. Gobet, J. G. L \'o pez-Salas, and C. V \'a zquez. Quasi-Regression Monte-Carlo scheme for semi-linear PDEs and BSDEs with large scale parallelization on GPUs . Archives of Computational Methods in Engineering , 27(3):889--921, April 2019

work page 2019

[9] [9]

Gobet, A

E. Gobet, A. Richou, and L. Szpruch. Numerical approximation of ergodic BSDE s using non linear F eynman- K ac formulas. Stochastic Process. Appl. , 195:Paper No. 104871, 20, 2026

work page 2026

[10] [10]

K. Hornik. Approximation capabilities of multilayer feedforward networks. Neural networks , 4(2):251--257, 1991

work page 1991

[11] [11]

Kremsner, A

S. Kremsner, A. Steinicke, and M. Szölgyenyi. A deep neural network algorithm for semilinear elliptic PDEs with applications in insurance mathematics. Risks , 8(4), 2020

work page 2020

[12] [12]

H. Kunita. Stochastic flows and stochastic differential equations . Cambridge Studies in Advanced Mathematics. 24. Cambridge: Cambridge University Press , 1997

work page 1997

[13] [13]

Leshno, V

M. Leshno, V. Y. Lin, A. Pinkus, and S. Schocken. Multilayer feedforward networks with a nonpolynomial activation function can approximate any function. Neural networks , 6(6):861--867, 1993

work page 1993

[14] [14]

Ma and J

J. Ma and J. Zhang. Representation theorems for backward stochastic differential equations. Ann. Appl. Probab. , 12(4):1390--1418, 2002

work page 2002

[15] [15]

Nüsken and L

N. Nüsken and L. Richter. Interpolating between BSDE s and PINN s -- deep learning for elliptic and parabolic boundary value problems, 2021

work page 2021

[16] [16]

Pardoux and A

E. Pardoux and A. R a s canu. Stochastic Differential Equations, Backward SDEs, Partial Differential Equations , volume 69 of Stochastic Modelling and Applied Probability . Springer-Verlag, 2014

work page 2014

[17] [17]

Talagrand

M. Talagrand. Isoperimetry and integrability of the sum of independent B anach-space valued random variables. Ann. Probab. , 17(4):1546--1570, 1989

work page 1989

[18] [18]

Van der Vaart and J

A W. Van der Vaart and J. A. Wellner. Weak Convergence and Empirical Processes: With Applications to Statistics . Springer Series in Statistics. Springer-Verlag, New York, 1996

work page 1996