Mean field social optimization: feedback person-by-person optimality and the dynamic programming equation

Li-Hsien Sun; Minyi Huang; Shuenn-Jyi Sheu

arxiv: 2508.14236 · v2 · pith:KAOUXHT5new · submitted 2025-08-19 · 🧮 math.OC

Mean field social optimization: feedback person-by-person optimality and the dynamic programming equation

Minyi Huang , Shuenn-Jyi Sheu , Li-Hsien Sun This is my paper

Pith reviewed 2026-05-21 23:16 UTC · model grok-4.3

classification 🧮 math.OC

keywords mean field social optimizationmaster equationperson-by-person optimalitydynamic programmingHamilton-Jacobi-Bellman equationnonlinear diffusion modelslinear-quadratic casesystemic risk

0 comments

The pith

Control laws from the master equation achieve epsilon-person-by-person optimality in mean field social optimization.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper derives a master equation, a new Hamilton-Jacobi-Bellman equation for the value function, by applying dynamic programming to a representative agent that chooses cooperative controls in nonlinear diffusion models. It establishes that the resulting feedback control laws satisfy epsilon-person-by-person optimality under regularity conditions, meaning no single agent can substantially improve the overall social cost by unilateral deviation. This optimality serves as a necessary condition for nearly attaining the social optimum in large populations. Tight error estimates of order O(1/N) for the social cost of order O(N) are obtained through multi-scale analysis that introduces two auxiliary master equations. The approach yields explicit solutions in the linear-quadratic case and applies to systemic risk models.

Core claim

By dynamic programming with a representative agent employing cooperative optimizer selection, we derive a new Hamilton--Jacobi--Bellman equation to be called the master equation of the value function. Under some regularity conditions, we establish epsilon-person-by-person optimality of the master equation-based control laws, which may be viewed as a necessary condition for nearly attaining the social optimum. A major challenge in the analysis is to obtain tight estimates, within an error of O(1/N), of the social cost having order O(N). This will be accomplished by multi-scale analysis via constructing two auxiliary master equations.

What carries the argument

The master equation, a new Hamilton-Jacobi-Bellman equation for the value function derived via dynamic programming on a representative cooperative agent, which generates the feedback control laws and enables both the epsilon-optimality proof and the O(1/N) social cost estimates.

If this is right

Explicit solutions for the master equations exist in the linear-quadratic case.
The framework extends directly to systemic risk problems.
The O(1/N) estimates guarantee that the social cost remains close to optimal when the population size N is large.
Epsilon-person-by-person optimality functions as a necessary condition for nearly reaching the social optimum.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same dynamic programming construction could be used to compare social optima against non-cooperative mean-field equilibria in the same diffusion models.
The multi-scale analysis with auxiliary equations offers a template for deriving higher-order approximations in other mean-field control settings.
Numerical evaluation of the 1/N rate in specific systemic risk examples would provide direct evidence of the error bound.

Load-bearing premise

The underlying models must satisfy regularity conditions that allow derivation of the master equation together with the tight O(1/N) error bounds obtained from multi-scale analysis using auxiliary equations.

What would settle it

In a concrete nonlinear diffusion model satisfying the regularity conditions, compute the social cost under the master-equation controls and verify whether the gap to the true social optimum exceeds order 1/N or whether a single agent's deviation improves the total social cost by more than epsilon.

read the original abstract

We consider mean field social optimization in nonlinear diffusion models. By dynamic programming with a representative agent employing cooperative optimizer selection, we derive a new Hamilton--Jacobi--Bellman (HJB) equation to be called the master equation of the value function. Under some regularity conditions, we establish $\epsilon$-person-by-person optimality of the master equation-based control laws, which may be viewed as a necessary condition for nearly attaining the social optimum. A major challenge in the analysis is to obtain tight estimates, within an error of $O(1/N)$, of the social cost having order $O(N)$. This will be accomplished by multi-scale analysis via constructing two auxiliary master equations. We illustrate explicit solutions of the master equations for the linear-quadratic (LQ) case, and give an application to systemic risk.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper derives a master equation via dynamic programming for cooperative mean-field control and proves epsilon person-by-person optimality with O(1/N) social-cost error using two auxiliary equations.

read the letter

The core contribution is a dynamic programming derivation of a master equation for the value function in mean-field social optimization, followed by a proof that the resulting feedback laws achieve epsilon-person-by-person optimality. They get the sharp O(1/N) gap on the total social cost of order O(N) by running multi-scale analysis on two auxiliary master equations. The LQ case is solved in closed form and they include a systemic-risk application, which makes the abstract claims more concrete than usual in this area.

Referee Report

1 major / 2 minor

Summary. The paper studies mean field social optimization for nonlinear diffusion models. Using dynamic programming for a representative cooperative agent, it derives a master HJB equation for the value function. Under regularity conditions, the authors establish ε-person-by-person optimality of the resulting feedback controls as a necessary condition for nearly attaining the social optimum. The central technical contribution is a multi-scale analysis that constructs two auxiliary master equations to obtain O(1/N) estimates for the social cost (of order O(N)). Explicit solutions are given for the linear-quadratic case, together with an application to systemic risk.

Significance. If the regularity assumptions can be verified to propagate through the auxiliary equations and the O(1/N) error bounds close rigorously, the work would supply a dynamic-programming route to decentralized nearly-optimal controls in cooperative mean-field settings. This would strengthen the link between person-by-person optimality and social optimality in large-population problems and provide a template for applications such as systemic-risk control.

major comments (1)

[§4] §4 (Multi-scale analysis and auxiliary master equations): The O(1/N) social-cost bound is obtained by constructing two auxiliary master equations and performing multi-scale analysis. The argument requires that the auxiliary value functions inherit the same C^{2,1} regularity, uniform ellipticity, and N-independent growth bounds used for the original master equation. The manuscript states these as assumptions but does not provide an explicit propagation argument (e.g., a priori estimates showing that second derivatives remain bounded independently of N after the first correction). Without this step the error may only be o(1), which would invalidate the claimed ε-person-by-person optimality.

minor comments (2)

Notation for the master equation and the two auxiliary equations should be made uniform across sections to avoid confusion between the original value function and the corrected ones.
The LQ section provides explicit solutions; a brief remark on how the general nonlinear proof reduces to the LQ case would help readers verify consistency.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the careful reading of the manuscript and for the constructive comment on the multi-scale analysis. We address the major comment below and will incorporate the necessary additions in the revised version.

read point-by-point responses

Referee: §4 (Multi-scale analysis and auxiliary master equations): The O(1/N) social-cost bound is obtained by constructing two auxiliary master equations and performing multi-scale analysis. The argument requires that the auxiliary value functions inherit the same C^{2,1} regularity, uniform ellipticity, and N-independent growth bounds used for the original master equation. The manuscript states these as assumptions but does not provide an explicit propagation argument (e.g., a priori estimates showing that second derivatives remain bounded independently of N after the first correction). Without this step the error may only be o(1), which would invalidate the claimed ε-person-by-person optimality.

Authors: We agree that the manuscript would benefit from an explicit propagation argument to confirm that the auxiliary value functions satisfy the same C^{2,1} regularity, uniform ellipticity, and N-independent growth bounds. In the revised version we will add a dedicated subsection (or appendix) containing a priori estimates for the first- and second-order correction terms. These estimates will be derived by differentiating the auxiliary master equations, applying the uniform ellipticity assumption on the diffusion coefficients, and using the Lipschitz and growth conditions already imposed on the running cost and drift to obtain N-independent bounds on the second derivatives. With this step the multi-scale expansion closes at order O(1/N) as claimed, rather than merely o(1). revision: yes

Circularity Check

0 steps flagged

Derivation chain self-contained via dynamic programming and auxiliary analysis

full rationale

The paper starts from standard dynamic programming applied to a representative agent under cooperative selection to derive the master HJB equation, then uses multi-scale analysis with two explicitly constructed auxiliary master equations to obtain the O(1/N) social-cost estimates needed for ε-person-by-person optimality. This chain relies on regularity assumptions and explicit LQ solutions rather than any self-definition, fitted-input renaming, or load-bearing self-citation that reduces the central claim to its own inputs. The auxiliary equations are introduced for error control and do not presuppose the optimality result they help establish. No quoted step equates a prediction or uniqueness claim directly to a prior fit or self-referential definition.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claims rest on regularity conditions for the diffusion models and value functions, which are standard domain assumptions in stochastic control but not independently verified here.

axioms (1)

domain assumption Regularity conditions on the nonlinear diffusion models and value functions
Invoked to derive the master equation and establish ε-person-by-person optimality with O(1/N) estimates.

pith-pipeline@v0.9.0 · 5671 in / 1225 out tokens · 50352 ms · 2026-05-21T23:16:26.290192+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

We derive a new Hamilton–Jacobi–Bellman (HJB) equation to be called the master equation of the value function... multi-scale analysis via constructing two auxiliary master equations.
IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Under some regularity conditions, we establish ε-person-by-person optimality of the master equation-based control laws

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

42 extracted references · 42 canonical work pages

[1]

Adler, M

R. Adler, M. Bazin, and M. Schiffer. Introduction to General Relativity, New York: McGraw–Hill, 1975

work page 1975
[2]

Ambrosio, N

L. Ambrosio, N. Gigli, and G. Savare. Gradient flows in metric spaces and the Wasserstein spaces of proba- bility measures, Lectures in Mathematics, ETH Zurich: Birkhauser, 2005

work page 2005
[3]

Arabneydi and A

J. Arabneydi and A. Mahajan. Team-optimal solution of finite number of mean-field coupled LQG subsys- tems. Proc. 54th IEEE CDC, Osaka, Japan, pp. 5308–5313, Dec. 2015

work page 2015
[4]

Balandat and C

M. Balandat and C. J. Tomlin. On efficiency in mean field differential games. Proc. American Control Con- ference, Washington, DC, pp. 2527–2532, June 2013

work page 2013
[5]

Bayraktar, A

E. Bayraktar, A. Cosso, and H. Pham. Randomized dynamic programming principle and Feynman–Kac representation for optimal control of Mckean–Vlasov dynamics. Trans. American Mathematical Society , 370(3):2115–2160, 2018

work page 2018
[6]

Bensoussan, J

A. Bensoussan, J. Frehse, and P. Yam. Mean Field Games and Mean Field Type Control Theory, New York: Springer, 2013

work page 2013
[7]

P. E. Caines, M. Huang, and R. P. Malham´e, Mean Field Games. In Handbook of Dynamic Game Theory, T. Basar and G. Zaccour Eds., pp. 345–372, Berlin: Springer, 2017

work page 2017
[8]

Cardaliaguet, S

P. Cardaliaguet, S. Daudin, J. Jackson, and P. Souganidis. An algebraic convergence rate for the optimal control of Mckean–Vlasov dynamics, arXiv:2203.14554, 2022

work page arXiv 2022
[9]

Cardaliaguet, F

P. Cardaliaguet, F. Delarue, J.-M. Lasry, and P.-L. Lions. The master equation and the convergence problem in mean field games, Princeton University Press, 2019

work page 2019
[10]

Cardaliaguet and C

P. Cardaliaguet and C. Rainer. On the (in)efficiency of MFG equilibria. SIAM Journal on Control and Opti- mization, 57(4):2292–2314, 2019

work page 2019
[11]

Cavallazzi

T. Cavallazzi. It ˆo–Krylov formula for a flow of measures. arXiv:2110.05251, 2021

work page arXiv 2021
[12]

Charalambous and N

C.D. Charalambous and N. U. Ahmed. Centralized versus decentralized optimization of distributed stochastic differential decision systems with different information structures Part II: applications. IEEE Trans. Autom. Control, vol. 63, no. 7, pp. 1913–1928, 2018

work page 1913
[13]

Carmona and F

R. Carmona and F. Delarue. Forward-backward stochastic differential equations and controlled McKean– Vlasov dynamics, Annals of Probability, 43(5): 2647–2700, 2015

work page 2015
[14]

Carmona and F

R. Carmona and F. Delarue. Probabilistic Theory of Mean Field Games with Applications, vol I and II, Cham: Springer, 2018

work page 2018
[15]

Carmona, J.-P

R. Carmona, J.-P. Fouque, and L.-H. Sun. Mean field games and systemic risk. Communications in Mathe- matical Sciences, vol. 13, no. 4, pp. 911–933, 2015

work page 2015
[16]

Chassagneux, D

J.-F. Chassagneux, D. Crisan, and F. Delarue. A probabilistic approach to classical solutions of the master equation for large population equilibria, Memoirs of AMS, 2022

work page 2022
[17]

Y . Chen, A. Busic, and S. P. Meyn. State estimation for the individual and the population in mean field control with application to demand dispatch. IEEE Trans. Autom. Control, vol. 62, pp. 1138–1149, Mar. 2017

work page 2017
[18]

R. A. d’Inverno. Introducing Einstein’s relativity. New York: Oxford University Press, 1992

work page 1992
[19]

M. F. Djete, D. Possamai, and X. Tan. McKean–Vlasov optimal control: the dynamic programming principle, Ann. Probab., vol. 50(2), pp. 791–833, 2022

work page 2022
[20]

M. F. Djete, D. Possamai, and X. Tan. McKean–Vlasov optimal control: limit theory and equivalence between different formulations, Math. Oper. Res., arXiv:2001.00925, 2020

work page arXiv 2001
[21]

X. Feng, Y . Hu, and J. Huang. A unified approach to linear-quadratic Gaussian mean-field team: homogene- ity, heterogeneity and quasi-exchangeability,Ann. Appl. Probab., 2024 (avail. online)

work page 2024
[22]

Hairer and G

E. Hairer and G. Wanner, Analysis by Its History, New York: Springer, 2008

work page 2008
[23]

M. H. Holmes. Introduction to perturbation methods, 2nd ed., New York: Springer, 2013

work page 2013
[24]

Huang, B.-C

J. Huang, B.-C. Wang, and J. Yong. Social optima in mean field linear-quadratic-Gaussian control with volatility uncertainty. SIAM J. Control Optim., vol. 59, no. 2, pp. 835-856, 2021

work page 2021
[25]

Huang, P.E

M. Huang, P.E. Caines, and R.P. Malham´e. Social optima in mean field LQG control: centralized and decen- tralized strategies. IEEE Trans. Autom. Control, vol. 57, no. 7, pp. 1736–1751, 2012

work page 2012
[26]

Huang and S.L

M. Huang and S.L. Nguyen. Linear-quadratic mean field teams with a major agent, Proc. 55th IEEE CDC, Las Vegas, Dec. 2016, pp. 6958–6963

work page 2016
[27]

Huang, S.-J

M. Huang, S.-J. Sheu, and L.-H. Sun. Mean field social optimization: feedback person-by-person optimality and the master equation. Proc. 59th IEEE CDC, Jeju Island, Korea, Dec. 2020, pp. 4921–4926

work page 2020
[28]

Huang and X

M. Huang and X. Yang. Linear quadratic mean field social optimization: asymptotic solvability and decen- tralized control. Applied Math. Optim., vol. 80, pp. 1969–2010, 2021

work page 1969
[29]

Y .-C. Ho. Team decision theory and information structures. Proc. IEEE, vol. 68, no. 6, pp. 644–654, June 1980. MEAN FIELD SOCIAL OPTIMIZATION: PERSON-BY-PERSON OPTIMALITY 27

work page 1980
[30]

D. Lacker. Limit theory for controlled McKean–Vlasov dynamics. SIAM J. Control Optim. , vol. 55, pp. 1641–1672, 2017

work page 2017
[31]

Nuno and B

G. Nuno and B. Moll. Social optima in economies with heterogeneous agents.Review of Economic Dynamics, vol. 28, pp. 150–180, 2018

work page 2018
[32]

Pham and X

H. Pham and X. Wei. Dynamic programming for optimal control of stochastic Mckean–Vlasov dynamics. SIAM Journal on Control and Optimization, 55(2):1069–1101, 2017

work page 2017
[33]

Piccoli, F

B. Piccoli, F. Rossi, and E. Trelat. Control to flocking of the kinetic Cucker–Smale model. SIAM J. Math. Anal., vol. 47, no. 6, pp. 4685–4719, 2015

work page 2015
[34]

Salhab, J

R. Salhab, J. L. Ny, and R. P. Malham ´e. Dynamic collective choice: social optima. IEEE Trans. Autom. Control, 63(10):3487–3494, Oct. 2018

work page 2018
[35]

W. E. Schmitendorf and G. Moriarty. A sufficiency condition for coalitive Pareto-optimal solutions.J. Optim. Theory Appl., vol. 18, no. 1, pp. 93–102, 1976

work page 1976
[36]

N. Sen, M. Huang, and R. P. Malham´e. Mean field social control with decentralized strategies and optimality characterization. Proc. the 55th IEEE CDC, Las Vegas, NV , pp. 6056–6061, Dec. 2016

work page 2016
[37]

Talbi, N

M. Talbi, N. Touzi, and J. Zhang. Dynamic programming equation for the mean field optimal stopping problem. arXiv:2103.05736, 2021

work page arXiv 2021
[38]

Villani, Optimal Transport: Old and New, Berlin: Springer, 2009

C. Villani, Optimal Transport: Old and New, Berlin: Springer, 2009

work page 2009
[39]

P. R. de Waal and J. H. van Schuppen. A class of team problems with discrete action spaces: optimality conditions based on multimodularity. SIAM J. Control Optim., vol. 38, no. 3, pp. 875–892, 2000

work page 2000
[40]

Wang and J.-F

B.-C. Wang and J.-F. Zhang. Social optima in mean field linear-quadratic-Gaussian models with Markov jump parameters, SIAM J. Control Optim., vol. 55, pp. 429–456, 2017

work page 2017
[41]

Wu and J

C. Wu and J. Zhang. Viscosity solutions to parabolic master equations and McKean–Vlasov SDEs with closed-loop controls, Ann. Appl. Probab., vol. 30, no. 2, pp. 936–986, 2020. APPENDIX A: A FORMAL DERIVATION OF THE MASTER EQUATION OF V This appendix considers a more general model with diffusion coefficientsσ(X i t , ui t, µ−i t ) and σ0(X i t , ui t, µ−i t...

work page 2020
[42]

This eliminates G1 3 as a common part of both sides of (C.3). Now we only need to show Ψ2 V = Ψ2 U ,(C.4) where Ψ2 V :=1 2 ⟨µ⊗2(dydz), Tr[∂2 z δµµV (t, y, µ; z, x)(Σ + Σ0)]⟩ + ⟨µ⊗2(dydz), Tr[∂yz δµµV (t, y, µ; z, x)Σ0]⟩ + 1 2 ⟨µ⊗3(dydzdw), Tr[∂zwδµµµV (t, y, µ; z, w, x)Σ0]⟩ − 1 2 ⟨µ(dy), Tr{[∂2 yδµV (t, y, µ; x)]|x=y(Σ + Σ0)}⟩ − 1 2 ⟨µ⊗2(dydz), Tr{∂2 z δµ...

work page

[1] [1]

Adler, M

R. Adler, M. Bazin, and M. Schiffer. Introduction to General Relativity, New York: McGraw–Hill, 1975

work page 1975

[2] [2]

Ambrosio, N

L. Ambrosio, N. Gigli, and G. Savare. Gradient flows in metric spaces and the Wasserstein spaces of proba- bility measures, Lectures in Mathematics, ETH Zurich: Birkhauser, 2005

work page 2005

[3] [3]

Arabneydi and A

J. Arabneydi and A. Mahajan. Team-optimal solution of finite number of mean-field coupled LQG subsys- tems. Proc. 54th IEEE CDC, Osaka, Japan, pp. 5308–5313, Dec. 2015

work page 2015

[4] [4]

Balandat and C

M. Balandat and C. J. Tomlin. On efficiency in mean field differential games. Proc. American Control Con- ference, Washington, DC, pp. 2527–2532, June 2013

work page 2013

[5] [5]

Bayraktar, A

E. Bayraktar, A. Cosso, and H. Pham. Randomized dynamic programming principle and Feynman–Kac representation for optimal control of Mckean–Vlasov dynamics. Trans. American Mathematical Society , 370(3):2115–2160, 2018

work page 2018

[6] [6]

Bensoussan, J

A. Bensoussan, J. Frehse, and P. Yam. Mean Field Games and Mean Field Type Control Theory, New York: Springer, 2013

work page 2013

[7] [7]

P. E. Caines, M. Huang, and R. P. Malham´e, Mean Field Games. In Handbook of Dynamic Game Theory, T. Basar and G. Zaccour Eds., pp. 345–372, Berlin: Springer, 2017

work page 2017

[8] [8]

Cardaliaguet, S

P. Cardaliaguet, S. Daudin, J. Jackson, and P. Souganidis. An algebraic convergence rate for the optimal control of Mckean–Vlasov dynamics, arXiv:2203.14554, 2022

work page arXiv 2022

[9] [9]

Cardaliaguet, F

P. Cardaliaguet, F. Delarue, J.-M. Lasry, and P.-L. Lions. The master equation and the convergence problem in mean field games, Princeton University Press, 2019

work page 2019

[10] [10]

Cardaliaguet and C

P. Cardaliaguet and C. Rainer. On the (in)efficiency of MFG equilibria. SIAM Journal on Control and Opti- mization, 57(4):2292–2314, 2019

work page 2019

[11] [11]

Cavallazzi

T. Cavallazzi. It ˆo–Krylov formula for a flow of measures. arXiv:2110.05251, 2021

work page arXiv 2021

[12] [12]

Charalambous and N

C.D. Charalambous and N. U. Ahmed. Centralized versus decentralized optimization of distributed stochastic differential decision systems with different information structures Part II: applications. IEEE Trans. Autom. Control, vol. 63, no. 7, pp. 1913–1928, 2018

work page 1913

[13] [13]

Carmona and F

R. Carmona and F. Delarue. Forward-backward stochastic differential equations and controlled McKean– Vlasov dynamics, Annals of Probability, 43(5): 2647–2700, 2015

work page 2015

[14] [14]

Carmona and F

R. Carmona and F. Delarue. Probabilistic Theory of Mean Field Games with Applications, vol I and II, Cham: Springer, 2018

work page 2018

[15] [15]

Carmona, J.-P

R. Carmona, J.-P. Fouque, and L.-H. Sun. Mean field games and systemic risk. Communications in Mathe- matical Sciences, vol. 13, no. 4, pp. 911–933, 2015

work page 2015

[16] [16]

Chassagneux, D

J.-F. Chassagneux, D. Crisan, and F. Delarue. A probabilistic approach to classical solutions of the master equation for large population equilibria, Memoirs of AMS, 2022

work page 2022

[17] [17]

Y . Chen, A. Busic, and S. P. Meyn. State estimation for the individual and the population in mean field control with application to demand dispatch. IEEE Trans. Autom. Control, vol. 62, pp. 1138–1149, Mar. 2017

work page 2017

[18] [18]

R. A. d’Inverno. Introducing Einstein’s relativity. New York: Oxford University Press, 1992

work page 1992

[19] [19]

M. F. Djete, D. Possamai, and X. Tan. McKean–Vlasov optimal control: the dynamic programming principle, Ann. Probab., vol. 50(2), pp. 791–833, 2022

work page 2022

[20] [20]

M. F. Djete, D. Possamai, and X. Tan. McKean–Vlasov optimal control: limit theory and equivalence between different formulations, Math. Oper. Res., arXiv:2001.00925, 2020

work page arXiv 2001

[21] [21]

X. Feng, Y . Hu, and J. Huang. A unified approach to linear-quadratic Gaussian mean-field team: homogene- ity, heterogeneity and quasi-exchangeability,Ann. Appl. Probab., 2024 (avail. online)

work page 2024

[22] [22]

Hairer and G

E. Hairer and G. Wanner, Analysis by Its History, New York: Springer, 2008

work page 2008

[23] [23]

M. H. Holmes. Introduction to perturbation methods, 2nd ed., New York: Springer, 2013

work page 2013

[24] [24]

Huang, B.-C

J. Huang, B.-C. Wang, and J. Yong. Social optima in mean field linear-quadratic-Gaussian control with volatility uncertainty. SIAM J. Control Optim., vol. 59, no. 2, pp. 835-856, 2021

work page 2021

[25] [25]

Huang, P.E

M. Huang, P.E. Caines, and R.P. Malham´e. Social optima in mean field LQG control: centralized and decen- tralized strategies. IEEE Trans. Autom. Control, vol. 57, no. 7, pp. 1736–1751, 2012

work page 2012

[26] [26]

Huang and S.L

M. Huang and S.L. Nguyen. Linear-quadratic mean field teams with a major agent, Proc. 55th IEEE CDC, Las Vegas, Dec. 2016, pp. 6958–6963

work page 2016

[27] [27]

Huang, S.-J

M. Huang, S.-J. Sheu, and L.-H. Sun. Mean field social optimization: feedback person-by-person optimality and the master equation. Proc. 59th IEEE CDC, Jeju Island, Korea, Dec. 2020, pp. 4921–4926

work page 2020

[28] [28]

Huang and X

M. Huang and X. Yang. Linear quadratic mean field social optimization: asymptotic solvability and decen- tralized control. Applied Math. Optim., vol. 80, pp. 1969–2010, 2021

work page 1969

[29] [29]

Y .-C. Ho. Team decision theory and information structures. Proc. IEEE, vol. 68, no. 6, pp. 644–654, June 1980. MEAN FIELD SOCIAL OPTIMIZATION: PERSON-BY-PERSON OPTIMALITY 27

work page 1980

[30] [30]

D. Lacker. Limit theory for controlled McKean–Vlasov dynamics. SIAM J. Control Optim. , vol. 55, pp. 1641–1672, 2017

work page 2017

[31] [31]

Nuno and B

G. Nuno and B. Moll. Social optima in economies with heterogeneous agents.Review of Economic Dynamics, vol. 28, pp. 150–180, 2018

work page 2018

[32] [32]

Pham and X

H. Pham and X. Wei. Dynamic programming for optimal control of stochastic Mckean–Vlasov dynamics. SIAM Journal on Control and Optimization, 55(2):1069–1101, 2017

work page 2017

[33] [33]

Piccoli, F

B. Piccoli, F. Rossi, and E. Trelat. Control to flocking of the kinetic Cucker–Smale model. SIAM J. Math. Anal., vol. 47, no. 6, pp. 4685–4719, 2015

work page 2015

[34] [34]

Salhab, J

R. Salhab, J. L. Ny, and R. P. Malham ´e. Dynamic collective choice: social optima. IEEE Trans. Autom. Control, 63(10):3487–3494, Oct. 2018

work page 2018

[35] [35]

W. E. Schmitendorf and G. Moriarty. A sufficiency condition for coalitive Pareto-optimal solutions.J. Optim. Theory Appl., vol. 18, no. 1, pp. 93–102, 1976

work page 1976

[36] [36]

N. Sen, M. Huang, and R. P. Malham´e. Mean field social control with decentralized strategies and optimality characterization. Proc. the 55th IEEE CDC, Las Vegas, NV , pp. 6056–6061, Dec. 2016

work page 2016

[37] [37]

Talbi, N

M. Talbi, N. Touzi, and J. Zhang. Dynamic programming equation for the mean field optimal stopping problem. arXiv:2103.05736, 2021

work page arXiv 2021

[38] [38]

Villani, Optimal Transport: Old and New, Berlin: Springer, 2009

C. Villani, Optimal Transport: Old and New, Berlin: Springer, 2009

work page 2009

[39] [39]

P. R. de Waal and J. H. van Schuppen. A class of team problems with discrete action spaces: optimality conditions based on multimodularity. SIAM J. Control Optim., vol. 38, no. 3, pp. 875–892, 2000

work page 2000

[40] [40]

Wang and J.-F

B.-C. Wang and J.-F. Zhang. Social optima in mean field linear-quadratic-Gaussian models with Markov jump parameters, SIAM J. Control Optim., vol. 55, pp. 429–456, 2017

work page 2017

[41] [41]

Wu and J

C. Wu and J. Zhang. Viscosity solutions to parabolic master equations and McKean–Vlasov SDEs with closed-loop controls, Ann. Appl. Probab., vol. 30, no. 2, pp. 936–986, 2020. APPENDIX A: A FORMAL DERIVATION OF THE MASTER EQUATION OF V This appendix considers a more general model with diffusion coefficientsσ(X i t , ui t, µ−i t ) and σ0(X i t , ui t, µ−i t...

work page 2020

[42] [42]

This eliminates G1 3 as a common part of both sides of (C.3). Now we only need to show Ψ2 V = Ψ2 U ,(C.4) where Ψ2 V :=1 2 ⟨µ⊗2(dydz), Tr[∂2 z δµµV (t, y, µ; z, x)(Σ + Σ0)]⟩ + ⟨µ⊗2(dydz), Tr[∂yz δµµV (t, y, µ; z, x)Σ0]⟩ + 1 2 ⟨µ⊗3(dydzdw), Tr[∂zwδµµµV (t, y, µ; z, w, x)Σ0]⟩ − 1 2 ⟨µ(dy), Tr{[∂2 yδµV (t, y, µ; x)]|x=y(Σ + Σ0)}⟩ − 1 2 ⟨µ⊗2(dydz), Tr{∂2 z δµ...

work page