Efficient Solution of Generalized Sylvester Equations via Preconditioned Alternating Anderson Acceleration

Chun-Hua Zhang; Hongjia Chen; Lei Du; Zhongming Teng

arxiv: 2604.15766 · v1 · submitted 2026-04-17 · 🧮 math.NA · cs.NA

Efficient Solution of Generalized Sylvester Equations via Preconditioned Alternating Anderson Acceleration

Hongjia Chen , Chun-Hua Zhang , Zhongming Teng , Lei Du This is my paper

Pith reviewed 2026-05-10 08:46 UTC · model grok-4.3

classification 🧮 math.NA cs.NA

keywords generalized Sylvester equationsAnderson accelerationpreconditioningNeumann seriesiterative methodsmatrix equationsnumerical linear algebra

0 comments

The pith

A preconditioned alternating Anderson acceleration method solves generalized Sylvester equations more efficiently by accelerating fixed-point iterations with a Neumann series approximation.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces the preconditioned-alternating Anderson acceleration (P-aAA) method for generalized Sylvester matrix equations that arise in scientific applications. It alternates between preconditioned fixed-point iterations and Anderson acceleration updates using a first-order Neumann series approximation for the preconditioner. This targets cases where the spectral radius of the associated operator is large but at most 1, which slows standard iterations. The authors prove that the preconditioner improves the convergence rate without raising overall computational complexity. Numerical tests on medium- and large-scale problems show the method reduces iteration counts and run times compared with existing approaches.

Core claim

The central claim is that the P-aAA method, which combines a matrix-oriented variant of Anderson acceleration with a novel preconditioning operator based on a first-order Neumann series approximation, efficiently solves generalized Sylvester equations. By alternating between preconditioned fixed-point iterations and acceleration updates, the scheme reduces both iteration count and computational cost. Theoretical analysis establishes that the preconditioning accelerates convergence without increasing overall complexity, and extensive experiments confirm consistent outperformance over state-of-the-art methods on medium- and large-scale instances.

What carries the argument

The key machinery is the preconditioning operator derived from a first-order Neumann series approximation to the inverse of the fixed-point operator, which enhances the convergence rate of the alternating Anderson acceleration scheme for generalized Sylvester equations.

Load-bearing premise

The spectral radius of the associated operator is large but not greater than 1, allowing the fixed-point iteration base to converge and the Neumann preconditioner to be effective for general coefficient matrices.

What would settle it

A concrete counterexample would be a test problem with spectral radius at most 1 on which the preconditioned method requires the same or more iterations and time than the non-preconditioned alternating Anderson acceleration version.

Figures

Figures reproduced from arXiv: 2604.15766 by Chun-Hua Zhang, Hongjia Chen, Lei Du, Zhongming Teng.

**Figure 2.** Figure 2: The iteration numbers and relative residual error of the pro [PITH_FULL_IMAGE:figures/full_fig_p016_2.png] view at source ↗

**Figure 3.** Figure 3: The iteration numbers and relative residual error of the pro [PITH_FULL_IMAGE:figures/full_fig_p017_3.png] view at source ↗

read the original abstract

This paper considers the numerical solution of generalized Sylvester matrix equations, which arise in many scientific and engineering applications but remain challenging to solve efficiently, particularly when the coefficient matrices are general and the spectral radius of the associated operator is large but not greater than $1$. We propose a new iterative method, termed preconditioned-alternating Anderson acceleration (P-aAA), which combines a matrix-oriented variant of Anderson acceleration (AA) with a novel preconditioning strategy. The method alternates between preconditioned fixed-point iterations and Anderson acceleration updates, thereby reducing both computational cost and iteration count. A key contribution is the development of an efficient preconditioning operator based on a first-order Neumann series approximation, which avoids expensive operator inversions while enhancing convergence. We theoretically prove that the proposed preconditioning operator accelerates the convergence rate without increasing the overall computational complexity. Extensive numerical experiments further demonstrate that the proposed approach consistently outperforms existing state-of-the-art methods for both medium- and large-scale problems, achieving substantial reductions in computation time and iteration number.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper gives a practical P-aAA method for generalized Sylvester equations that uses a cheap Neumann preconditioner inside an alternating Anderson scheme and shows it cuts iterations without raising dominant cost.

read the letter

The core idea is straightforward: alternate a first-order Neumann-preconditioned fixed-point step with Anderson acceleration updates on the generalized Sylvester operator. The preconditioner is built to avoid any matrix inversions, and the authors prove it improves the convergence rate when the spectral radius sits between zero and one but is not tiny. Numerical runs on medium and large test cases then report fewer iterations and lower wall-clock time than several existing solvers.

Axiom & Free-Parameter Ledger

2 free parameters · 1 axioms · 0 invented entities

The ledger captures the key domain assumption on spectral radius and the tunable parameters implied by the method description; no new entities are postulated.

free parameters (2)

Anderson acceleration depth m
Number of previous iterates retained for the acceleration update; chosen by the user or tuned for performance.
Neumann series truncation order
Order at which the infinite series for the preconditioner is cut off; balances accuracy and cost.

axioms (1)

domain assumption Spectral radius of the fixed-point operator is at most 1
Invoked to guarantee convergence of the base iteration and effectiveness of the preconditioner.

pith-pipeline@v0.9.0 · 5479 in / 1259 out tokens · 45380 ms · 2026-05-10T08:46:14.225092+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

32 extracted references · 32 canonical work pages

[1]

H. An, X. Jia, and H. F. W alker , Anderson acceleration and application to the three- temperature energy equations, Journal of Computational Physics, 347 (2017), pp. 1–19

work page 2017
[2]

D. G. Anderson , Iterative procedures for nonlinear integral equations , Journal of the ACM (JACM), 12 (1965), pp. 547–560

work page 1965
[3]

Anderson acceleration, mixing and extrapolat ion

D. G. Anderson , Comments on “Anderson acceleration, mixing and extrapolat ion”, Numerical Algorithms, 80 (2019), pp. 135–234

work page 2019
[4]

Bai and D

Z. Bai and D. Skoogh , A projection method for model reduction of bilinear dynamic al systems , Linear Algebra and its Applications, 415 (2006), pp. 406–42 5

work page 2006
[5]

R. H. Bartels and G. W. Stew art , Algorithm 432: Solution of the matrix equation AX+ XB= C , Communications of the ACM, 15 (1972), pp. 820–826

work page 1972
[6]

Benner and T

P. Benner and T. Breiten , Low rank methods for a class of generalized Lyapunov equatio ns and related issues , Numerische Mathematik, 124 (2013), pp. 441–470

work page 2013
[7]

Benner and T

P. Benner and T. Damm , Lyapunov equations, energy functionals, and model order re duction of bilinear and stochastic systems , SIAM Journal on Control and Optimization, 49 (2011), pp. 686–711

work page 2011
[8]

Benner, R.-C

P. Benner, R.-C. Li, and N. Truhar , On the ADI method for Sylvester equations , Journal of Computational and Applied Mathematics, 233 (2009), pp. 1 035–1045

work page 2009
[9]

Bioli, D

I. Bioli, D. Kressner, and L. Robol , Preconditioned low-rank Riemannian optimization for symmetric positive deﬁnite linear matrix equations , SIAM Journal on Scientiﬁc Computing, 47 (2025), pp. A1091–A1116

work page 2025
[10]

Breiten and E

T. Breiten and E. Ringh , Residual-based iterations for the generalized Lyapunov eq uation, BIT Numerical Mathematics, 59 (2019), pp. 823–852

work page 2019
[11]

Damm , Direct methods and ADI-preconditioned Krylov subspace met hods for generalized Lyapunov equations , Numerical Linear Algebra with Applications, 15 (2008), pp

T. Damm , Direct methods and ADI-preconditioned Krylov subspace met hods for generalized Lyapunov equations , Numerical Linear Algebra with Applications, 15 (2008), pp . 853–871

work page 2008
[12]

J. W. Demmel , Applied Numerical Linear Algebra , SIAM, 1997

work page 1997
[13]

Ganine, N

V. Ganine, N. Hills, and B. Lapworth , Nonlinear acceleration of coupled ﬂuid–structure transient thermal problems by Anderson mixing , International Journal for Numerical Meth- ods in Fluids, 71 (2013), pp. 939–959

work page 2013
[14]

Hao and V

Y. Hao and V. Simoncini , The Sherman–Morrison–Woodbury formula for generalized li near matrix equations and applications , Numerical Linear Algebra with Applications, 28 (2021), p. e2384

work page 2021
[15]

Jarlebring, G

E. Jarlebring, G. Mele, D. Palitta, and E. Ringh , Krylov methods for low-rank commuting generalized Sylvester equations , Numerical Linear Algebra with Applications, 25 (2018), p. e2176

work page 2018
[16]

Kressner and P

D. Kressner and P. Sirkovi ´c, Truncated low-rank methods for solving general linear matr ix equations, Numerical Linear Algebra with Applications, 22 (2015), pp . 564–583

work page 2015
[17]

Y. Lin, L. Bao, and Y. Wei , Order reduction of bilinear MIMO dynamical systems using ne w block Krylov subspaces, Computers & Mathematics with Applications, 58 (2009), pp. 1093– 1102

work page 2009
[18]

Mai and M

V. Mai and M. Johansson , Anderson acceleration of proximal gradient methods , in Interna- tional Conference on Machine Learning, PMLR, 2020, pp. 6620 –6629

work page 2020
[19]

Palitta, M

D. Palitta, M. Iannacito, and V. Simoncini , A subspace-conjugate gradient method for linear matrix equations , SIAM Journal on Matrix Analysis and Applications, 46 (2025 ), pp. 2197–2225

work page 2025
[20]

Palitta and V

D. Palitta and V. Simoncini , Matrix-equation-based strategies for convection–diﬀusi on equa- tions, BIT Numerical Mathematics, 56 (2016), pp. 751–776

work page 2016
[21]

F. A. Potra and H. Engler , A characterization of the behavior of the Anderson accelera tion on linear problems , Linear Algebra and its Applications, 438 (2013), pp. 1002– 1011

work page 2013
[22]

Saad , Iterative methods for sparse linear systems , SIAM, 2003

Y. Saad , Iterative methods for sparse linear systems , SIAM, 2003

work page 2003
[23]

Saad , Acceleration methods for ﬁxed-point iterations , Acta Numerica, 34 (2025), pp

Y. Saad , Acceleration methods for ﬁxed-point iterations , Acta Numerica, 34 (2025), pp. 809– 890

work page 2025
[24]

S. D. Shank, V. Simoncini, and D. B. Szyld , Eﬃcient low-rank solution of generalized Lya- punov equations , Numerische Mathematik, 134 (2016), pp. 327–342

work page 2016
[25]

Simoncini , Computational methods for linear matrix equations , SIAM Review, 58 (2016), pp

V. Simoncini , Computational methods for linear matrix equations , SIAM Review, 58 (2016), pp. 377–441. 19

work page 2016
[26]

Simoncini and Y

V. Simoncini and Y. Hao , Analysis of the truncated conjugate gradient method for lin ear matrix equations, SIAM Journal on Matrix Analysis and Applications, 44 (2023 ), pp. 359– 381

work page 2023
[27]

H. D. Sterck and Y. He , On the asymptotic linear convergence speed of anderson acce leration, Nesterov acceleration, and nonlinear gmres , SIAM Journal on Scientiﬁc Computing, 43 (2021), pp. S21–S46

work page 2021
[28]

Toth and C

A. Toth and C. T. Kelley , Convergence analysis for Anderson acceleration , SIAM Journal on Numerical Analysis, 53 (2015), pp. 805–819

work page 2015
[29]

Voet , Preconditioning techniques for generalized Sylvester mat rix equations , Numerical Linear Algebra with Applications, 32 (2025), p

Y. Voet , Preconditioning techniques for generalized Sylvester mat rix equations , Numerical Linear Algebra with Applications, 32 (2025), p. e70020

work page 2025
[30]

H. F. W alker, Anderson acceleration: Algorithms and implementations , WPI Math. Sciences Dept. Report MS-6-15-50, (2011)

work page 2011
[31]

H. F. W alker and P. Ni , Anderson acceleration for ﬁxed-point iterations , SIAM Journal on Numerical Analysis, 49 (2011), pp. 1715–1735

work page 2011
[32]

F. Wei, C. Bao, and Y. Liu , Stochastic Anderson mixing for nonconvex stochastic optim iza- tion, Advances in Neural Information Processing Systems, 34 (20 21), pp. 22995–23008. 20

work page

[1] [1]

H. An, X. Jia, and H. F. W alker , Anderson acceleration and application to the three- temperature energy equations, Journal of Computational Physics, 347 (2017), pp. 1–19

work page 2017

[2] [2]

D. G. Anderson , Iterative procedures for nonlinear integral equations , Journal of the ACM (JACM), 12 (1965), pp. 547–560

work page 1965

[3] [3]

Anderson acceleration, mixing and extrapolat ion

D. G. Anderson , Comments on “Anderson acceleration, mixing and extrapolat ion”, Numerical Algorithms, 80 (2019), pp. 135–234

work page 2019

[4] [4]

Bai and D

Z. Bai and D. Skoogh , A projection method for model reduction of bilinear dynamic al systems , Linear Algebra and its Applications, 415 (2006), pp. 406–42 5

work page 2006

[5] [5]

R. H. Bartels and G. W. Stew art , Algorithm 432: Solution of the matrix equation AX+ XB= C , Communications of the ACM, 15 (1972), pp. 820–826

work page 1972

[6] [6]

Benner and T

P. Benner and T. Breiten , Low rank methods for a class of generalized Lyapunov equatio ns and related issues , Numerische Mathematik, 124 (2013), pp. 441–470

work page 2013

[7] [7]

Benner and T

P. Benner and T. Damm , Lyapunov equations, energy functionals, and model order re duction of bilinear and stochastic systems , SIAM Journal on Control and Optimization, 49 (2011), pp. 686–711

work page 2011

[8] [8]

Benner, R.-C

P. Benner, R.-C. Li, and N. Truhar , On the ADI method for Sylvester equations , Journal of Computational and Applied Mathematics, 233 (2009), pp. 1 035–1045

work page 2009

[9] [9]

Bioli, D

I. Bioli, D. Kressner, and L. Robol , Preconditioned low-rank Riemannian optimization for symmetric positive deﬁnite linear matrix equations , SIAM Journal on Scientiﬁc Computing, 47 (2025), pp. A1091–A1116

work page 2025

[10] [10]

Breiten and E

T. Breiten and E. Ringh , Residual-based iterations for the generalized Lyapunov eq uation, BIT Numerical Mathematics, 59 (2019), pp. 823–852

work page 2019

[11] [11]

Damm , Direct methods and ADI-preconditioned Krylov subspace met hods for generalized Lyapunov equations , Numerical Linear Algebra with Applications, 15 (2008), pp

T. Damm , Direct methods and ADI-preconditioned Krylov subspace met hods for generalized Lyapunov equations , Numerical Linear Algebra with Applications, 15 (2008), pp . 853–871

work page 2008

[12] [12]

J. W. Demmel , Applied Numerical Linear Algebra , SIAM, 1997

work page 1997

[13] [13]

Ganine, N

V. Ganine, N. Hills, and B. Lapworth , Nonlinear acceleration of coupled ﬂuid–structure transient thermal problems by Anderson mixing , International Journal for Numerical Meth- ods in Fluids, 71 (2013), pp. 939–959

work page 2013

[14] [14]

Hao and V

Y. Hao and V. Simoncini , The Sherman–Morrison–Woodbury formula for generalized li near matrix equations and applications , Numerical Linear Algebra with Applications, 28 (2021), p. e2384

work page 2021

[15] [15]

Jarlebring, G

E. Jarlebring, G. Mele, D. Palitta, and E. Ringh , Krylov methods for low-rank commuting generalized Sylvester equations , Numerical Linear Algebra with Applications, 25 (2018), p. e2176

work page 2018

[16] [16]

Kressner and P

D. Kressner and P. Sirkovi ´c, Truncated low-rank methods for solving general linear matr ix equations, Numerical Linear Algebra with Applications, 22 (2015), pp . 564–583

work page 2015

[17] [17]

Y. Lin, L. Bao, and Y. Wei , Order reduction of bilinear MIMO dynamical systems using ne w block Krylov subspaces, Computers & Mathematics with Applications, 58 (2009), pp. 1093– 1102

work page 2009

[18] [18]

Mai and M

V. Mai and M. Johansson , Anderson acceleration of proximal gradient methods , in Interna- tional Conference on Machine Learning, PMLR, 2020, pp. 6620 –6629

work page 2020

[19] [19]

Palitta, M

D. Palitta, M. Iannacito, and V. Simoncini , A subspace-conjugate gradient method for linear matrix equations , SIAM Journal on Matrix Analysis and Applications, 46 (2025 ), pp. 2197–2225

work page 2025

[20] [20]

Palitta and V

D. Palitta and V. Simoncini , Matrix-equation-based strategies for convection–diﬀusi on equa- tions, BIT Numerical Mathematics, 56 (2016), pp. 751–776

work page 2016

[21] [21]

F. A. Potra and H. Engler , A characterization of the behavior of the Anderson accelera tion on linear problems , Linear Algebra and its Applications, 438 (2013), pp. 1002– 1011

work page 2013

[22] [22]

Saad , Iterative methods for sparse linear systems , SIAM, 2003

Y. Saad , Iterative methods for sparse linear systems , SIAM, 2003

work page 2003

[23] [23]

Saad , Acceleration methods for ﬁxed-point iterations , Acta Numerica, 34 (2025), pp

Y. Saad , Acceleration methods for ﬁxed-point iterations , Acta Numerica, 34 (2025), pp. 809– 890

work page 2025

[24] [24]

S. D. Shank, V. Simoncini, and D. B. Szyld , Eﬃcient low-rank solution of generalized Lya- punov equations , Numerische Mathematik, 134 (2016), pp. 327–342

work page 2016

[25] [25]

Simoncini , Computational methods for linear matrix equations , SIAM Review, 58 (2016), pp

V. Simoncini , Computational methods for linear matrix equations , SIAM Review, 58 (2016), pp. 377–441. 19

work page 2016

[26] [26]

Simoncini and Y

V. Simoncini and Y. Hao , Analysis of the truncated conjugate gradient method for lin ear matrix equations, SIAM Journal on Matrix Analysis and Applications, 44 (2023 ), pp. 359– 381

work page 2023

[27] [27]

H. D. Sterck and Y. He , On the asymptotic linear convergence speed of anderson acce leration, Nesterov acceleration, and nonlinear gmres , SIAM Journal on Scientiﬁc Computing, 43 (2021), pp. S21–S46

work page 2021

[28] [28]

Toth and C

A. Toth and C. T. Kelley , Convergence analysis for Anderson acceleration , SIAM Journal on Numerical Analysis, 53 (2015), pp. 805–819

work page 2015

[29] [29]

Voet , Preconditioning techniques for generalized Sylvester mat rix equations , Numerical Linear Algebra with Applications, 32 (2025), p

Y. Voet , Preconditioning techniques for generalized Sylvester mat rix equations , Numerical Linear Algebra with Applications, 32 (2025), p. e70020

work page 2025

[30] [30]

H. F. W alker, Anderson acceleration: Algorithms and implementations , WPI Math. Sciences Dept. Report MS-6-15-50, (2011)

work page 2011

[31] [31]

H. F. W alker and P. Ni , Anderson acceleration for ﬁxed-point iterations , SIAM Journal on Numerical Analysis, 49 (2011), pp. 1715–1735

work page 2011

[32] [32]

F. Wei, C. Bao, and Y. Liu , Stochastic Anderson mixing for nonconvex stochastic optim iza- tion, Advances in Neural Information Processing Systems, 34 (20 21), pp. 22995–23008. 20

work page