The Jordan canonical form of the Fr\'{e}chet derivative of a matrix function and the bivariate Jordan problem
Pith reviewed 2026-05-16 23:42 UTC · model grok-4.3
The pith
The Jordan canonical form of the Fréchet derivative of f(A) is determined by the Jordan form of A and the polynomial f.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Given an algebraically closed field F of characteristic zero, a square matrix A in F^{n x n} and a polynomial f in F[w], the Jordan canonical form of the formal Fréchet derivative of f(A) is determined in terms of the Jordan canonical form of A and of f. The result extends via Hermite interpolation to solve the corresponding problem over the complexes, and partial results are given for the Jordan form of linear combinations of the form sum a_{ij} (X^i ⊗ Y^j).
What carries the argument
The formal Fréchet derivative of the map sending a matrix X to f(X), whose Jordan canonical form is constructed from the Jordan blocks of A using f and its derivatives at the eigenvalues of A.
If this is right
- The Jordan form of the derivative operator can be constructed explicitly from the Jordan blocks of A and the derivatives of f.
- It solves Research Problem 3.11 from Higham's book on functions of matrices for the complex case.
- Bounds on the number and sizes of Jordan blocks are provided for the bivariate Kronecker product case.
- Partial explicit solutions are available for the bivariate problem under specific assumptions.
Where Pith is reading between the lines
- Numerical algorithms for matrix function derivatives can use this structure for efficiency.
- The approach may generalize to analytic matrix functions beyond polynomials.
- The bivariate results could help analyze stability in systems with multiple matrix variables.
Load-bearing premise
The field is algebraically closed of characteristic zero and f is a polynomial.
What would settle it
For a 2x2 Jordan block matrix A with a single eigenvalue and a quadratic polynomial f, explicitly compute the Fréchet derivative operator on 4x4 matrices and check whether its Jordan blocks match the sizes and numbers predicted from A's structure and f's derivatives.
Figures
read the original abstract
Let $\mathbb{F}$ be an algebraically closed field of characteristic $0$. Given a square matrix $A \in \mathbb{F}^{n \times n}$ and a polynomial $f \in \mathbb{F}[w]$, we determine the Jordan canonical form of the formal Fr\'{e}chet derivative of $f(A)$, in terms of that of $A$ and of $f$. When $\mathbb{F}\subseteq \mathbb{C}$, via Hermite interpolation, our result provides a solution to [N.J. Higham, \emph{Functions of Matrices: Theory and Computation}, Research Problem 3.11]. A generalization consists of finding the Jordan canonical form of linear combinations of Kronecker products of powers of two square matrices, i.e., $\sum_{i,j} a_{ij} (X^i \otimes Y^j)$. For this generalization, we provide some new partial results, including a partial solution under certain assumptions and general bounds on the number and the sizes of Jordan blocks.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript determines the Jordan canonical form of the formal Fréchet derivative of a polynomial matrix function f(A) for an n×n matrix A over an algebraically closed field of characteristic zero, expressing the result directly in terms of the Jordan structure of A and the divided differences induced by f. It reduces the problem to single Jordan blocks, applies the known action of the derivative on such blocks, and assembles the global form; when the field is a subfield of the complexes, Hermite interpolation yields a solution to Higham’s Research Problem 3.11. Partial results and bounds on block numbers and sizes are also given for the bivariate generalization consisting of linear combinations of Kronecker products of powers of two matrices.
Significance. If the explicit construction holds, the work supplies a complete, constructive answer to a long-standing open problem in matrix-function theory and supplies concrete data (Jordan block sizes and eigenvalues) that can be used in both theoretical proofs and numerical algorithms for Fréchet derivatives. The bivariate bounds, although limited to special coefficient and spectral regimes, are new and may serve as a starting point for further progress on the general Kronecker-sum Jordan problem.
minor comments (2)
- [Main theorem section] The reduction to a single Jordan block (presumably in the main theorem statement) would be easier to follow if the precise block-wise formula for the Fréchet derivative were displayed as a separate displayed equation before the global assembly argument.
- [Bivariate results] In the bivariate section the precise hypotheses on the coefficients a_{ij} and on the spectra of X and Y that permit the partial Jordan-form description should be stated as a numbered assumption or hypothesis immediately before the corresponding theorem.
Simulated Author's Rebuttal
We thank the referee for the positive assessment of our manuscript and the recommendation for minor revision. The report correctly summarizes our main contribution: an explicit determination of the Jordan canonical form of the Fréchet derivative of a polynomial matrix function f(A) in terms of the Jordan structure of A and the divided differences of f. We are pleased that the work is viewed as supplying a constructive answer to Higham's Research Problem 3.11 and providing new partial results for the bivariate Kronecker case. No specific major comments were raised in the report.
Circularity Check
No significant circularity detected
full rationale
The derivation reduces the problem to the action of the Fréchet derivative on individual Jordan blocks of A, applies the standard divided-difference formula for that action (a known result from matrix function theory), and assembles the global Jordan structure via classical similarity transformations and Hermite interpolation. No equation or step is defined in terms of its own output, no fitted parameter is relabeled as a prediction, and no load-bearing premise rests on a self-citation chain. The bivariate partial results are explicitly qualified by extra assumptions and do not claim a complete closed-form result. The central claim therefore remains an independent explicit construction built on external classical facts.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption F is an algebraically closed field of characteristic 0
Reference graph
Works this paper leans on
-
[1]
A. C. Aitken, The normal form of compound and induced matrices.Proc. London Math. Soc.38:354–376 (1934)
work page 1934
-
[2]
M. S. Berger,Nonlinearity and functional analysis: Lectures on nonlinear problems in mathematical analysis, Academic Press, New York City, NY, USA, 1977
work page 1977
-
[3]
M. Boij, J. Migliore, R.M. Mir´ o-Roig and U. Nagel, The non-Lefschetz locus,J. Algebra 505:288–320 (2018)
work page 2018
-
[4]
R. A. Brualdi, Combinatorial verification of the elementary divisors of tensor products. Linear Algebra Appl.71:31–47 (1985)
work page 1985
-
[5]
I. Gohberg and A. Semencul, On the inversion of finite Toeplitz matrices and their continuous analogs.Mat. Issled.2:201–233 (1972)
work page 1972
-
[6]
N. J. Higham,Functions of Matrices: Theory and Computation, SIAM, Philadelphia, PA, USA, 2008
work page 2008
-
[7]
R. A. Horn and C. R. Johnson,Matrix Analysis, Cambridge University Press, 2013 (Second edition). 27
work page 2013
-
[8]
R. A. Horn and C. R. Johnson,Topics in Matrix Analysis, Cambridge University Press, 1991
work page 1991
-
[9]
D. E. Littlewood, On induced and compound matrices.Proc. London Math. Soc. 39(2):370–381 (1935)
work page 1935
-
[10]
Kressner, Bivariate matrix functions.Oper
D. Kressner, Bivariate matrix functions.Oper. Matrices8(2):449–466 (2014)
work page 2014
-
[11]
M. Marcus and H. Robinson, Elementary divisors of tensor products.Commun. ACM 18:36–39 (1975)
work page 1975
-
[12]
C. W. Norman, On the Jordan form of a family of linear mappings.Linear Algebra Appl., 257:225—241 (1997)
work page 1997
-
[13]
W. E. Roth, On direct product matrices.Bull. Amer. Math. Soc.40:461–468 (1934)
work page 1934
-
[14]
R. P. Stanley,Enumerative Combinatorics, Volume 2, Cambridge University Press, 1999
work page 1999
-
[15]
Weyr, R´ epartition des matrices en esp` eces et formation de toutes les esp` eces.C
E. Weyr, R´ epartition des matrices en esp` eces et formation de toutes les esp` eces.C. R. Acad. Sci. Paris100:966–969 (1885). 28
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.