Linearly Solvable Continuous-Time General-Sum Stochastic Differential Games

Monika Tomar; Takashi Tanaka

arxiv: 2604.07479 · v1 · submitted 2026-04-08 · 🧮 math.OC · cs.GT· cs.SY· econ.TH· eess.SY

Linearly Solvable Continuous-Time General-Sum Stochastic Differential Games

Monika Tomar , Takashi Tanaka This is my paper

Pith reviewed 2026-05-10 16:53 UTC · model grok-4.3

classification 🧮 math.OC cs.GTcs.SYecon.THeess.SY

keywords stochastic differential gamesgeneral-sum gamesCole-Hopf transformationHamilton-Jacobi-Bellman equationsNash equilibriapath integralsdistribution planning

0 comments

The pith

A class of continuous-time stochastic general-sum differential games admits exact solutions by transforming their nonlinear HJB equations into a decoupled linear PDE system.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that certain multi-agent stochastic games with conflicting interests can be solved exactly without grid discretization. By modeling them as distribution planning games where spatial conflicts are captured through cross-log-likelihood ratios, a generalized multivariate Cole-Hopf transformation turns the coupled nonlinear Hamilton-Jacobi-Bellman equations into independent linear partial differential equations. The linear system then yields feedback Nash equilibrium strategies via the Feynman-Kac path integral representation. A sympathetic reader would care because the approach sidesteps the exponential growth in computation that normally occurs when solving high-dimensional multi-agent problems directly.

Core claim

The authors introduce a class of continuous-time finite-player stochastic general-sum differential games that admit solutions through an exact linear PDE system. Formulating the game as a distribution planning problem with cross-log-likelihood ratio costs to model spatial conflicts such as congestion, they apply a generalized multivariate Cole-Hopf transformation that decouples the associated nonlinear Hamilton-Jacobi-Bellman equations into a system of linear partial differential equations. This reduction permits efficient grid-free computation of feedback Nash equilibrium strategies via the Feynman-Kac path integral method.

What carries the argument

The generalized multivariate Cole-Hopf transformation, which converts the nonlinear coupled HJB equations of the distribution planning game into a decoupled system of linear PDEs.

If this is right

Feedback Nash equilibrium strategies become computable via path integrals without state-space discretization.
The approach applies directly to multi-agent problems involving spatial conflicts such as congestion avoidance.
The linear PDE reduction removes the curse of dimensionality for this class of games.
Exact equilibrium solutions are obtained for any finite number of players satisfying the formulation conditions.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Similar linearizing transformations could be sought for other cost structures or dynamics outside distribution planning games.
The method may enable real-time equilibrium computation in continuous-time multi-robot navigation tasks.
The linear PDE system might connect to existing path-integral techniques used in single-agent stochastic control.

Load-bearing premise

The games must be formulated specifically as distribution planning games using cross-log-likelihood ratios for costs, and the dynamics and cost structures must permit the Cole-Hopf transformation to exactly linearize the nonlinear HJB system.

What would settle it

For a low-dimensional instance of such a game with a known closed-form equilibrium, numerically solve the original nonlinear HJB system and compare the resulting strategies and values against those obtained from the transformed linear PDE system; mismatch would falsify the exact linearization claim.

Figures

Figures reproduced from arXiv: 2604.07479 by Monika Tomar, Takashi Tanaka.

**Figure 1.** Figure 1: Equilibrium Measures. −2 −1 0 1 2 State (x) 0 1 2 3 4 5 6 Terminal Density γ = −0.60 P1 Density P2 Density −2 −1 0 1 2 State (x) γ = +0.00 −2 −1 0 1 2 State (x) γ = +0.60 [PITH_FULL_IMAGE:figures/full_fig_p006_1.png] view at source ↗

**Figure 5.** Figure 5: Asymmetric Interaction based Equilibrium Measures [PITH_FULL_IMAGE:figures/full_fig_p007_5.png] view at source ↗

read the original abstract

This paper introduces a class of continuous-time, finite-player stochastic general-sum differential games that admit solutions through an exact linear PDE system. We formulate a distribution planning game utilizing the cross-log-likelihood ratio to naturally model multi-agent spatial conflicts, such as congestion avoidance. By applying a generalized multivariate Cole-Hopf transformation, we decouple the associated non-linear Hamilton-Jacobi-Bellman (HJB) equations into a system of linear partial differential equations. This reduction enables the efficient, grid-free computation of feedback Nash equilibrium strategies via the Feynman-Kac path integral method, effectively overcoming the curse of dimensionality.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper gives an exact linear reduction for a narrow class of general-sum stochastic games via a multivariate Cole-Hopf transform on costs that use cross-log-likelihood ratios.

read the letter

The main point is that for distribution planning games where running costs include a cross-log-likelihood ratio to penalize spatial overlaps, the coupled nonlinear HJB equations decouple into independent linear PDEs. The authors apply a generalized multivariate Cole-Hopf transformation that cancels the nonlinear and interaction terms exactly, then recover the feedback Nash strategies from Feynman-Kac expectations. This yields a grid-free method that sidesteps the usual dimensionality issues for this setup.

Referee Report

0 major / 3 minor

Summary. The paper introduces a class of continuous-time finite-player stochastic general-sum differential games formulated as distribution planning games, where running costs incorporate a cross-log-likelihood ratio term to encode spatial interactions such as congestion. For this restricted class, a generalized multivariate Cole-Hopf transformation is applied to convert the coupled nonlinear Hamilton-Jacobi-Bellman (HJB) system into a set of independent linear PDEs, whose solutions are represented via Feynman-Kac expectations to yield feedback Nash equilibrium strategies in a grid-free manner.

Significance. If the claimed exact decoupling holds under the stated restrictions on dynamics and costs, the work offers a meaningful advance for a subclass of general-sum stochastic games by replacing coupled nonlinear PDEs with decoupled linear ones solvable via path integrals. This directly addresses the curse of dimensionality in continuous time and provides an exact, non-approximate route to equilibria for problems that are otherwise intractable. The internal consistency of the transformation for the chosen cost structure is a clear strength.

minor comments (3)

The abstract and introduction should explicitly list the precise conditions on the drift, diffusion, and cost functions that guarantee exact cancellation of nonlinear and cross terms under the Cole-Hopf map; without this, readers may overgeneralize the result beyond the distribution-planning class.
A low-dimensional illustrative example (e.g., two agents in 1D with explicit linear PDE solutions) would strengthen the exposition and allow immediate verification of the claimed decoupling.
Notation for the multivariate transformation (e.g., the precise form of the vector-valued log-transform and the resulting linear operators) should be introduced with a dedicated equation block early in the derivation section to improve readability.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for the positive and accurate summary of our work, as well as the recommendation for minor revision. The referee correctly identifies the core contribution: the introduction of a restricted class of continuous-time general-sum stochastic differential games that become linearly solvable via a multivariate Cole-Hopf transformation, enabling grid-free Feynman-Kac computation of feedback Nash equilibria.

Circularity Check

0 steps flagged

No significant circularity; derivation applies known transformation to tailored game class

full rationale

The paper defines a restricted class of distribution planning games whose running costs use a cross-log-likelihood ratio term chosen precisely so that a generalized multivariate Cole-Hopf transformation exactly cancels the nonlinear and coupling terms in the HJB system, yielding independent linear PDEs solvable by Feynman-Kac. This is a standard mathematical reduction for the specially formulated class rather than a self-referential definition, fitted prediction, or load-bearing self-citation. No equations or steps reduce the claimed result to its own inputs by construction; the construction is self-contained once the cost structure is restricted as stated.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the domain-specific game formulation and the mathematical applicability of the Cole-Hopf transformation; no free parameters or invented entities are indicated in the abstract.

axioms (1)

domain assumption The stochastic differential game is formulated as a distribution planning game with costs defined via cross-log-likelihood ratio to capture multi-agent spatial conflicts.
This specific modeling choice is required for the transformation to decouple the HJB equations into linear PDEs.

pith-pipeline@v0.9.0 · 5398 in / 1183 out tokens · 61853 ms · 2026-05-10T16:53:46.472967+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel echoes

?

echoes
ECHOES: this paper passage has the same mathematical shape or conceptual pattern as the Recognition theorem, but is not a direct formal dependency.

By applying a generalized multivariate Cole-Hopf transformation, we decouple the associated non-linear Hamilton-Jacobi-Bellman (HJB) equations into a system of linear partial differential equations.

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

16 extracted references · 16 canonical work pages

[1]

Basar and G

T. Basar and G. J. Olsder,Dynamic Noncooperative Game Theory, 2nd ed., ser. Classics in Applied Mathematics. Philadelphia, PA: SIAM, 1998, vol. 23. 0 1 2 3 Time (t) −2 −1 0 1 2 State (x) γ = −0.60 P1 Mean P2 Mean Well Centers 0 1 2 3 Time (t) γ = +0.00 0 1 2 3 Time (t) γ = +0.60 Fig. 5. Asymmetric Interaction based Equilibrium Measures

work page 1998
[2]

Stochastic differential games and viscosity solutions of Hamilton–Jacobi–Bellman–Isaacs equations,

R. Buckdahn and J. Li, “Stochastic differential games and viscosity solutions of Hamilton–Jacobi–Bellman–Isaacs equations,”SIAM Jour- nal on Control and Optimization, vol. 47, no. 1, pp. 444–475, 2008

work page 2008
[3]

Nash equilibria for nonzero-sum ergodic stochastic differential games,

S. N. Cohen and V . Fedyashov, “Nash equilibria for nonzero-sum ergodic stochastic differential games,”Journal of Applied Probability, vol. 54, no. 4, pp. 977–994, 2017

work page 2017
[4]

Nonzero-sum risk- sensitive stochastic differential games: A multi-parameter eigenvalue problem approach,

M. K. Ghosh, K. S. Kumar, C. Pal, and S. Pradhan, “Nonzero-sum risk- sensitive stochastic differential games: A multi-parameter eigenvalue problem approach,”Systems & Control Letters, vol. 172, p. 105443, 2023

work page 2023
[5]

Linearly-solvable Markov decision problems,

E. Todorov, “Linearly-solvable Markov decision problems,”Advances in neural information processing systems, vol. 19, 2006

work page 2006
[6]

Linear theory for control of nonlinear stochastic systems,

H. J. Kappen, “Linear theory for control of nonlinear stochastic systems,”Physical review letters, vol. 95, no. 20, p. 200201, 2005

work page 2005
[7]

Relative entropy and free energy dualities: Connections to path integral and kl control,

E. A. Theodorou and E. Todorov, “Relative entropy and free energy dualities: Connections to path integral and kl control,” in2012 ieee 51st ieee conference on decision and control (cdc). IEEE, 2012, pp. 1466–1473

work page 2012
[8]

Linearly solvable Markov games,

K. Dvijotham and E. Todorov, “Linearly solvable Markov games,” in 2012 American Control Conference (ACC). IEEE, 2012, pp. 1845– 1850

work page 2012
[9]

Linearly solvable mean-field traffic routing games,

T. Tanaka, E. Nekouei, A. R. Pedram, and K. H. Johansson, “Linearly solvable mean-field traffic routing games,”IEEE Transactions on Automatic Control, vol. 66, no. 2, pp. 880–887, 2020

work page 2020
[10]

Linearly Solvable General- Sum Markov Games,

B. C. Soper, C. J. Miller, and D. M. Merl, “Linearly Solvable General- Sum Markov Games,” in2024 60th Annual Allerton Conference on Communication, Control, and Computing. IEEE, 2024, pp. 1–8

work page 2024
[11]

Risk-minimizing two-player zero-sum stochastic differential game via path integral control,

A. Patil, Y . Zhou, D. Fridovich-Keil, and T. Tanaka, “Risk-minimizing two-player zero-sum stochastic differential game via path integral control,” in2023 62nd IEEE Conference on Decision and Control (CDC). IEEE, 2023, pp. 3095–3101

work page 2023
[12]

Linearly-solvable mean-field ap- proximation for multi-team road traffic games,

A. R. Pedram and T. Tanaka, “Linearly-solvable mean-field ap- proximation for multi-team road traffic games,” in2019 IEEE 58th Conference on Decision and Control (CDC). IEEE, 2019, pp. 1243– 1248

work page 2019
[13]

A mean field game approach for multi-lane traffic management,

A. Festa and S. G ¨ottlich, “A mean field game approach for multi-lane traffic management,”IFAC-PapersOnLine, vol. 51, no. 32, pp. 793– 798, 2018

work page 2018
[14]

A differential game approach to multi-agent collision avoidance,

T. Mylvaganam, M. Sassano, and A. Astolfi, “A differential game approach to multi-agent collision avoidance,”IEEE Transactions on Automatic Control, vol. 62, no. 8, pp. 4229–4235, 2017

work page 2017
[15]

Decentralized safe multi-agent stochastic optimal control using deep fbsdes and admm,

M. A. Pereira, A. D. Saravanos, O. So, and E. A. Theodorou, “Decentralized safe multi-agent stochastic optimal control using deep FBSDEs and ADMM,”arXiv preprint arXiv:2202.10658, 2022

work page arXiv 2022
[16]

Stochastic differential equations,

B. Øksendal, “Stochastic differential equations,” inStochastic differ- ential equations: an introduction with applications. Springer, 2003, pp. 38–50

work page 2003

[1] [1]

Basar and G

T. Basar and G. J. Olsder,Dynamic Noncooperative Game Theory, 2nd ed., ser. Classics in Applied Mathematics. Philadelphia, PA: SIAM, 1998, vol. 23. 0 1 2 3 Time (t) −2 −1 0 1 2 State (x) γ = −0.60 P1 Mean P2 Mean Well Centers 0 1 2 3 Time (t) γ = +0.00 0 1 2 3 Time (t) γ = +0.60 Fig. 5. Asymmetric Interaction based Equilibrium Measures

work page 1998

[2] [2]

Stochastic differential games and viscosity solutions of Hamilton–Jacobi–Bellman–Isaacs equations,

R. Buckdahn and J. Li, “Stochastic differential games and viscosity solutions of Hamilton–Jacobi–Bellman–Isaacs equations,”SIAM Jour- nal on Control and Optimization, vol. 47, no. 1, pp. 444–475, 2008

work page 2008

[3] [3]

Nash equilibria for nonzero-sum ergodic stochastic differential games,

S. N. Cohen and V . Fedyashov, “Nash equilibria for nonzero-sum ergodic stochastic differential games,”Journal of Applied Probability, vol. 54, no. 4, pp. 977–994, 2017

work page 2017

[4] [4]

Nonzero-sum risk- sensitive stochastic differential games: A multi-parameter eigenvalue problem approach,

M. K. Ghosh, K. S. Kumar, C. Pal, and S. Pradhan, “Nonzero-sum risk- sensitive stochastic differential games: A multi-parameter eigenvalue problem approach,”Systems & Control Letters, vol. 172, p. 105443, 2023

work page 2023

[5] [5]

Linearly-solvable Markov decision problems,

E. Todorov, “Linearly-solvable Markov decision problems,”Advances in neural information processing systems, vol. 19, 2006

work page 2006

[6] [6]

Linear theory for control of nonlinear stochastic systems,

H. J. Kappen, “Linear theory for control of nonlinear stochastic systems,”Physical review letters, vol. 95, no. 20, p. 200201, 2005

work page 2005

[7] [7]

Relative entropy and free energy dualities: Connections to path integral and kl control,

E. A. Theodorou and E. Todorov, “Relative entropy and free energy dualities: Connections to path integral and kl control,” in2012 ieee 51st ieee conference on decision and control (cdc). IEEE, 2012, pp. 1466–1473

work page 2012

[8] [8]

Linearly solvable Markov games,

K. Dvijotham and E. Todorov, “Linearly solvable Markov games,” in 2012 American Control Conference (ACC). IEEE, 2012, pp. 1845– 1850

work page 2012

[9] [9]

Linearly solvable mean-field traffic routing games,

T. Tanaka, E. Nekouei, A. R. Pedram, and K. H. Johansson, “Linearly solvable mean-field traffic routing games,”IEEE Transactions on Automatic Control, vol. 66, no. 2, pp. 880–887, 2020

work page 2020

[10] [10]

Linearly Solvable General- Sum Markov Games,

B. C. Soper, C. J. Miller, and D. M. Merl, “Linearly Solvable General- Sum Markov Games,” in2024 60th Annual Allerton Conference on Communication, Control, and Computing. IEEE, 2024, pp. 1–8

work page 2024

[11] [11]

Risk-minimizing two-player zero-sum stochastic differential game via path integral control,

A. Patil, Y . Zhou, D. Fridovich-Keil, and T. Tanaka, “Risk-minimizing two-player zero-sum stochastic differential game via path integral control,” in2023 62nd IEEE Conference on Decision and Control (CDC). IEEE, 2023, pp. 3095–3101

work page 2023

[12] [12]

Linearly-solvable mean-field ap- proximation for multi-team road traffic games,

A. R. Pedram and T. Tanaka, “Linearly-solvable mean-field ap- proximation for multi-team road traffic games,” in2019 IEEE 58th Conference on Decision and Control (CDC). IEEE, 2019, pp. 1243– 1248

work page 2019

[13] [13]

A mean field game approach for multi-lane traffic management,

A. Festa and S. G ¨ottlich, “A mean field game approach for multi-lane traffic management,”IFAC-PapersOnLine, vol. 51, no. 32, pp. 793– 798, 2018

work page 2018

[14] [14]

A differential game approach to multi-agent collision avoidance,

T. Mylvaganam, M. Sassano, and A. Astolfi, “A differential game approach to multi-agent collision avoidance,”IEEE Transactions on Automatic Control, vol. 62, no. 8, pp. 4229–4235, 2017

work page 2017

[15] [15]

Decentralized safe multi-agent stochastic optimal control using deep fbsdes and admm,

M. A. Pereira, A. D. Saravanos, O. So, and E. A. Theodorou, “Decentralized safe multi-agent stochastic optimal control using deep FBSDEs and ADMM,”arXiv preprint arXiv:2202.10658, 2022

work page arXiv 2022

[16] [16]

Stochastic differential equations,

B. Øksendal, “Stochastic differential equations,” inStochastic differ- ential equations: an introduction with applications. Springer, 2003, pp. 38–50

work page 2003