pith. sign in

arxiv: 2604.15766 · v1 · submitted 2026-04-17 · 🧮 math.NA · cs.NA

Efficient Solution of Generalized Sylvester Equations via Preconditioned Alternating Anderson Acceleration

Pith reviewed 2026-05-10 08:46 UTC · model grok-4.3

classification 🧮 math.NA cs.NA
keywords generalized Sylvester equationsAnderson accelerationpreconditioningNeumann seriesiterative methodsmatrix equationsnumerical linear algebra
0
0 comments X

The pith

A preconditioned alternating Anderson acceleration method solves generalized Sylvester equations more efficiently by accelerating fixed-point iterations with a Neumann series approximation.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces the preconditioned-alternating Anderson acceleration (P-aAA) method for generalized Sylvester matrix equations that arise in scientific applications. It alternates between preconditioned fixed-point iterations and Anderson acceleration updates using a first-order Neumann series approximation for the preconditioner. This targets cases where the spectral radius of the associated operator is large but at most 1, which slows standard iterations. The authors prove that the preconditioner improves the convergence rate without raising overall computational complexity. Numerical tests on medium- and large-scale problems show the method reduces iteration counts and run times compared with existing approaches.

Core claim

The central claim is that the P-aAA method, which combines a matrix-oriented variant of Anderson acceleration with a novel preconditioning operator based on a first-order Neumann series approximation, efficiently solves generalized Sylvester equations. By alternating between preconditioned fixed-point iterations and acceleration updates, the scheme reduces both iteration count and computational cost. Theoretical analysis establishes that the preconditioning accelerates convergence without increasing overall complexity, and extensive experiments confirm consistent outperformance over state-of-the-art methods on medium- and large-scale instances.

What carries the argument

The key machinery is the preconditioning operator derived from a first-order Neumann series approximation to the inverse of the fixed-point operator, which enhances the convergence rate of the alternating Anderson acceleration scheme for generalized Sylvester equations.

Load-bearing premise

The spectral radius of the associated operator is large but not greater than 1, allowing the fixed-point iteration base to converge and the Neumann preconditioner to be effective for general coefficient matrices.

What would settle it

A concrete counterexample would be a test problem with spectral radius at most 1 on which the preconditioned method requires the same or more iterations and time than the non-preconditioned alternating Anderson acceleration version.

Figures

Figures reproduced from arXiv: 2604.15766 by Chun-Hua Zhang, Hongjia Chen, Lei Du, Zhongming Teng.

Figure 1
Figure 1. Figure 1: The iteration numbers and relative residual error of the pro [PITH_FULL_IMAGE:figures/full_fig_p015_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: The iteration numbers and relative residual error of the pro [PITH_FULL_IMAGE:figures/full_fig_p016_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: The iteration numbers and relative residual error of the pro [PITH_FULL_IMAGE:figures/full_fig_p017_3.png] view at source ↗
read the original abstract

This paper considers the numerical solution of generalized Sylvester matrix equations, which arise in many scientific and engineering applications but remain challenging to solve efficiently, particularly when the coefficient matrices are general and the spectral radius of the associated operator is large but not greater than $1$. We propose a new iterative method, termed preconditioned-alternating Anderson acceleration (P-aAA), which combines a matrix-oriented variant of Anderson acceleration (AA) with a novel preconditioning strategy. The method alternates between preconditioned fixed-point iterations and Anderson acceleration updates, thereby reducing both computational cost and iteration count. A key contribution is the development of an efficient preconditioning operator based on a first-order Neumann series approximation, which avoids expensive operator inversions while enhancing convergence. We theoretically prove that the proposed preconditioning operator accelerates the convergence rate without increasing the overall computational complexity. Extensive numerical experiments further demonstrate that the proposed approach consistently outperforms existing state-of-the-art methods for both medium- and large-scale problems, achieving substantial reductions in computation time and iteration number.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Axiom & Free-Parameter Ledger

2 free parameters · 1 axioms · 0 invented entities

The ledger captures the key domain assumption on spectral radius and the tunable parameters implied by the method description; no new entities are postulated.

free parameters (2)
  • Anderson acceleration depth m
    Number of previous iterates retained for the acceleration update; chosen by the user or tuned for performance.
  • Neumann series truncation order
    Order at which the infinite series for the preconditioner is cut off; balances accuracy and cost.
axioms (1)
  • domain assumption Spectral radius of the fixed-point operator is at most 1
    Invoked to guarantee convergence of the base iteration and effectiveness of the preconditioner.

pith-pipeline@v0.9.0 · 5479 in / 1259 out tokens · 45380 ms · 2026-05-10T08:46:14.225092+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

32 extracted references · 32 canonical work pages

  1. [1]

    H. An, X. Jia, and H. F. W alker , Anderson acceleration and application to the three- temperature energy equations, Journal of Computational Physics, 347 (2017), pp. 1–19

  2. [2]

    D. G. Anderson , Iterative procedures for nonlinear integral equations , Journal of the ACM (JACM), 12 (1965), pp. 547–560

  3. [3]

    Anderson acceleration, mixing and extrapolat ion

    D. G. Anderson , Comments on “Anderson acceleration, mixing and extrapolat ion”, Numerical Algorithms, 80 (2019), pp. 135–234

  4. [4]

    Bai and D

    Z. Bai and D. Skoogh , A projection method for model reduction of bilinear dynamic al systems , Linear Algebra and its Applications, 415 (2006), pp. 406–42 5

  5. [5]

    R. H. Bartels and G. W. Stew art , Algorithm 432: Solution of the matrix equation AX+ XB= C , Communications of the ACM, 15 (1972), pp. 820–826

  6. [6]

    Benner and T

    P. Benner and T. Breiten , Low rank methods for a class of generalized Lyapunov equatio ns and related issues , Numerische Mathematik, 124 (2013), pp. 441–470

  7. [7]

    Benner and T

    P. Benner and T. Damm , Lyapunov equations, energy functionals, and model order re duction of bilinear and stochastic systems , SIAM Journal on Control and Optimization, 49 (2011), pp. 686–711

  8. [8]

    Benner, R.-C

    P. Benner, R.-C. Li, and N. Truhar , On the ADI method for Sylvester equations , Journal of Computational and Applied Mathematics, 233 (2009), pp. 1 035–1045

  9. [9]

    Bioli, D

    I. Bioli, D. Kressner, and L. Robol , Preconditioned low-rank Riemannian optimization for symmetric positive definite linear matrix equations , SIAM Journal on Scientific Computing, 47 (2025), pp. A1091–A1116

  10. [10]

    Breiten and E

    T. Breiten and E. Ringh , Residual-based iterations for the generalized Lyapunov eq uation, BIT Numerical Mathematics, 59 (2019), pp. 823–852

  11. [11]

    Damm , Direct methods and ADI-preconditioned Krylov subspace met hods for generalized Lyapunov equations , Numerical Linear Algebra with Applications, 15 (2008), pp

    T. Damm , Direct methods and ADI-preconditioned Krylov subspace met hods for generalized Lyapunov equations , Numerical Linear Algebra with Applications, 15 (2008), pp . 853–871

  12. [12]

    J. W. Demmel , Applied Numerical Linear Algebra , SIAM, 1997

  13. [13]

    Ganine, N

    V. Ganine, N. Hills, and B. Lapworth , Nonlinear acceleration of coupled fluid–structure transient thermal problems by Anderson mixing , International Journal for Numerical Meth- ods in Fluids, 71 (2013), pp. 939–959

  14. [14]

    Hao and V

    Y. Hao and V. Simoncini , The Sherman–Morrison–Woodbury formula for generalized li near matrix equations and applications , Numerical Linear Algebra with Applications, 28 (2021), p. e2384

  15. [15]

    Jarlebring, G

    E. Jarlebring, G. Mele, D. Palitta, and E. Ringh , Krylov methods for low-rank commuting generalized Sylvester equations , Numerical Linear Algebra with Applications, 25 (2018), p. e2176

  16. [16]

    Kressner and P

    D. Kressner and P. Sirkovi ´c, Truncated low-rank methods for solving general linear matr ix equations, Numerical Linear Algebra with Applications, 22 (2015), pp . 564–583

  17. [17]

    Y. Lin, L. Bao, and Y. Wei , Order reduction of bilinear MIMO dynamical systems using ne w block Krylov subspaces, Computers & Mathematics with Applications, 58 (2009), pp. 1093– 1102

  18. [18]

    Mai and M

    V. Mai and M. Johansson , Anderson acceleration of proximal gradient methods , in Interna- tional Conference on Machine Learning, PMLR, 2020, pp. 6620 –6629

  19. [19]

    Palitta, M

    D. Palitta, M. Iannacito, and V. Simoncini , A subspace-conjugate gradient method for linear matrix equations , SIAM Journal on Matrix Analysis and Applications, 46 (2025 ), pp. 2197–2225

  20. [20]

    Palitta and V

    D. Palitta and V. Simoncini , Matrix-equation-based strategies for convection–diffusi on equa- tions, BIT Numerical Mathematics, 56 (2016), pp. 751–776

  21. [21]

    F. A. Potra and H. Engler , A characterization of the behavior of the Anderson accelera tion on linear problems , Linear Algebra and its Applications, 438 (2013), pp. 1002– 1011

  22. [22]

    Saad , Iterative methods for sparse linear systems , SIAM, 2003

    Y. Saad , Iterative methods for sparse linear systems , SIAM, 2003

  23. [23]

    Saad , Acceleration methods for fixed-point iterations , Acta Numerica, 34 (2025), pp

    Y. Saad , Acceleration methods for fixed-point iterations , Acta Numerica, 34 (2025), pp. 809– 890

  24. [24]

    S. D. Shank, V. Simoncini, and D. B. Szyld , Efficient low-rank solution of generalized Lya- punov equations , Numerische Mathematik, 134 (2016), pp. 327–342

  25. [25]

    Simoncini , Computational methods for linear matrix equations , SIAM Review, 58 (2016), pp

    V. Simoncini , Computational methods for linear matrix equations , SIAM Review, 58 (2016), pp. 377–441. 19

  26. [26]

    Simoncini and Y

    V. Simoncini and Y. Hao , Analysis of the truncated conjugate gradient method for lin ear matrix equations, SIAM Journal on Matrix Analysis and Applications, 44 (2023 ), pp. 359– 381

  27. [27]

    H. D. Sterck and Y. He , On the asymptotic linear convergence speed of anderson acce leration, Nesterov acceleration, and nonlinear gmres , SIAM Journal on Scientific Computing, 43 (2021), pp. S21–S46

  28. [28]

    Toth and C

    A. Toth and C. T. Kelley , Convergence analysis for Anderson acceleration , SIAM Journal on Numerical Analysis, 53 (2015), pp. 805–819

  29. [29]

    Voet , Preconditioning techniques for generalized Sylvester mat rix equations , Numerical Linear Algebra with Applications, 32 (2025), p

    Y. Voet , Preconditioning techniques for generalized Sylvester mat rix equations , Numerical Linear Algebra with Applications, 32 (2025), p. e70020

  30. [30]

    H. F. W alker, Anderson acceleration: Algorithms and implementations , WPI Math. Sciences Dept. Report MS-6-15-50, (2011)

  31. [31]

    H. F. W alker and P. Ni , Anderson acceleration for fixed-point iterations , SIAM Journal on Numerical Analysis, 49 (2011), pp. 1715–1735

  32. [32]

    F. Wei, C. Bao, and Y. Liu , Stochastic Anderson mixing for nonconvex stochastic optim iza- tion, Advances in Neural Information Processing Systems, 34 (20 21), pp. 22995–23008. 20