Self-Supervised Learning for Sparse Matrix Reordering

Fangfang Liu; Huiyuan Li; Shuzi Niu; Tao Yuan; Wenjia Wu; Ziwei Li

arxiv: 2605.17403 · v1 · pith:SDHR6CSUnew · submitted 2026-05-17 · 💻 cs.LG

Self-Supervised Learning for Sparse Matrix Reordering

Ziwei Li , Tao Yuan , Fangfang Liu , Shuzi Niu , Huiyuan Li , Wenjia Wu This is my paper

Pith reviewed 2026-05-20 14:03 UTC · model grok-4.3

classification 💻 cs.LG

keywords sparse matrix reorderingfill-in reductionself-supervised learningLU factorizationgraph neural networkpath triplet inequalitiesmatrix ordering

0 comments

The pith

Self-supervised learning reorders sparse matrices to cut fill-ins by enforcing path triplet inequalities from the Fill-Path Theorem.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper sets out to demonstrate that a self-supervised graph network can learn effective row and column permutations for sparse matrices, which matters because the problem of finding fill-in-minimizing orderings is NP-complete yet critical for lowering memory use and runtime during factorization. It starts from the Fill-Path Theorem's revelation that fill-ins arise directly from violations of path triplet inequalities in the matrix graph. A multigrid graph network first extracts structural features for each vertex, after which a sampling strategy draws triplets according to those inequalities and an end-max chain loss trains the model to reduce the number of violations. On the SuiteSparse collection the resulting orderings yield measurable reductions in fill-ins and corresponding speedups in LU factorization time.

Core claim

The Fill-Path Theorem supplies a direct and intrinsic relationship between fill-in generation and the sparse structure of the matrix as path triplet inequalities; these inequalities can be turned into a triplet sampling strategy and an end-max chain loss that, when applied to vertex embeddings from a multigrid graph network, produce orderings whose predicted scores satisfy the inequalities and thereby reduce fill-ins.

What carries the argument

Triplet sampling strategy derived from path triplet inequalities together with the end-max chain loss function that penalizes violations of those inequalities on multigrid graph network embeddings.

If this is right

Orderings generated by the method reduce the number of new nonzeros created during LU factorization on publicly available sparse matrices.
The reduction in fill-ins directly lowers both memory footprint and computation time for subsequent factorizations.
The self-supervised procedure outperforms prior graph-theoretic and deep-learning reorderers that rely on surrogate objectives lacking theoretical ties to fill-in generation.
Because the loss is built directly on the inequalities, the learned ordering respects the structural constraints identified by the Fill-Path Theorem.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same inequality-driven sampling and loss pattern could be tested on related reordering problems such as bandwidth or profile minimization.
Replacing the multigrid network with other graph encoders might reveal whether the performance gain comes mainly from the loss design or from the architecture.
The approach opens the possibility of online reordering during iterative solvers that repeatedly factor updated matrices.

Load-bearing premise

The path triplet inequalities supplied by the Fill-Path Theorem are sufficient, when used for sampling and loss design, to train orderings that actually minimize fill-ins during factorization.

What would settle it

Running the learned orderings on the SuiteSparse collection and finding that they produce more fill-ins or longer LU factorization times than established heuristics such as AMD or METIS would show the method does not deliver the claimed reductions.

Figures

Figures reproduced from arXiv: 2605.17403 by Fangfang Liu, Huiyuan Li, Shuzi Niu, Tao Yuan, Wenjia Wu, Ziwei Li.

**Figure 2.** Figure 2: The Combined Nonzero Pattern of the LU Factors [PITH_FULL_IMAGE:figures/full_fig_p007_2.png] view at source ↗

**Figure 3.** Figure 3: Illustration of (a)sampling a path from the graph; (b) sampling a triplet [PITH_FULL_IMAGE:figures/full_fig_p010_3.png] view at source ↗

**Figure 4.** Figure 4: Learning Curves of the Proposed Method with the MultigridSAGE back [PITH_FULL_IMAGE:figures/full_fig_p013_4.png] view at source ↗

read the original abstract

Rearranging the rows or columns of a sparse matrix using an appropriate ordering can significantly reduce fill-ins, i.e., new nonzeros introduced during matrix factorization, decreasing memory usage and runtime. However, finding an ordering that minimizes fill-ins is NP-complete. Existing approaches, including graph-theoretic and deep learning methods, rely on surrogate objectives without theoretical guarantees. The Fill-Path Theorem reveals a direct and intrinsic relationship between fill-in generation and the sparse structure of the matrix as path triplet inequalities. Here we first employ a multigrid graph network to capture structural information for each vertex. We then derive a triplet sampling strategy based on inequalities. Finally, we introduce an end-max chain loss function to reduce the number of triplets whose predicted scores satisfy these inequalities. Experimental evaluations on the publicly available SuiteSparse matrix collection demonstrate the superiority of the proposed method in terms of both fill-in reduction and speedup in LU factorization time.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper introduces a theorem-guided self-supervised pipeline for sparse matrix reordering that reports fill-in and LU speed gains on SuiteSparse, but the direct link from the loss to actual minimization stays lightly checked.

read the letter

The main thing here is a self-supervised method for reordering sparse matrices to cut fill-ins during LU factorization. It uses a multigrid graph network to extract vertex features, samples triplets from the Fill-Path Theorem inequalities, and trains with an end-max chain loss to reduce violations. On the SuiteSparse collection it shows lower fill-in counts and faster factorization times than earlier methods. What stands out as new is pulling the sampling strategy and loss directly from the theorem rather than generic surrogates. The multigrid network fits the hierarchical nature of sparse graphs, and evaluating on a public benchmark set is a clear plus for anyone wanting to reproduce or extend the work. The practical focus on runtime and memory metrics also aligns with needs in scientific computing. The softer spots are in the validation. Results are presented without error bars, ablations, or checks on how sensitive outcomes are to the sampling or loss details. The central assumption—that lowering violations of those theorem-derived inequalities produces orderings with genuinely fewer fill-ins—is used to justify the design but is not tested head-on, for example by comparing against known optimal orderings on small matrices. This makes it harder to credit the gains specifically to the theorem components versus the network or training procedure. Readers working on numerical linear algebra or ML for sparse solvers would find the most use in the method description and benchmark numbers. Someone looking for graph-based ideas applied to matrix problems could take pieces from it. The distinct combination and results on standard data are enough to justify a serious referee, even though more targeted experiments would strengthen the case. I recommend sending it to peer review rather than a desk reject.

Referee Report

2 major / 2 minor

Summary. The paper proposes a self-supervised learning framework for sparse matrix reordering to minimize fill-ins during LU factorization. It uses a multigrid graph network to encode vertex structural information, derives a triplet sampling strategy from the Fill-Path Theorem's path triplet inequalities, and introduces an end-max chain loss to penalize violations of those inequalities. Experiments on the SuiteSparse collection report superior fill-in reduction and LU factorization speedup relative to prior methods.

Significance. If the link between the theorem-derived loss and actual fill-in minimization holds, the work would provide a rare theoretically grounded surrogate for an NP-complete ordering problem, advancing the intersection of graph neural networks and numerical linear algebra. The self-supervised formulation avoids the need for optimal-ordering labels, which is a practical strength; however, the current evidence does not yet isolate the contribution of the Fill-Path components from the network architecture itself.

major comments (2)

[Derivation of triplet sampling and end-max chain loss (around the loss definition and sampling strategy)] The central claim that reducing violations of the Fill-Path Theorem inequalities produces orderings that minimize fill-ins is load-bearing for the entire pipeline, yet the manuscript provides no direct verification (e.g., comparison against known optimal orderings or lower bounds on fill-in) that the learned orderings approach the true minimum rather than a heuristic improvement.
[Experimental evaluations section] Experimental results claim superiority on SuiteSparse without error bars, ablation studies isolating the end-max chain loss versus the multigrid network alone, or sensitivity analysis on post-hoc hyperparameter choices; this leaves open whether the reported gains in fill-in reduction and LU speedup are robust or attributable to the theorem-derived components.

minor comments (2)

[Method section] The mathematical definition of the end-max chain loss should be stated explicitly with all variables and the precise form of the penalty term.
[Figures in experimental section] Figure captions and axis labels for fill-in and runtime plots should include the exact baseline methods and matrix dimensions for reproducibility.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed and constructive feedback on our manuscript. We believe the comments will help improve the clarity and rigor of our work, particularly regarding the theoretical grounding and experimental validation. Below we provide point-by-point responses to the major comments.

read point-by-point responses

Referee: [Derivation of triplet sampling and end-max chain loss (around the loss definition and sampling strategy)] The central claim that reducing violations of the Fill-Path Theorem inequalities produces orderings that minimize fill-ins is load-bearing for the entire pipeline, yet the manuscript provides no direct verification (e.g., comparison against known optimal orderings or lower bounds on fill-in) that the learned orderings approach the true minimum rather than a heuristic improvement.

Authors: We agree that direct empirical verification against optimal orderings would strengthen the central claim. However, the minimum fill-in problem is NP-complete, making computation of optimal orderings infeasible for the majority of SuiteSparse matrices used in our experiments. The Fill-Path Theorem provides necessary conditions for fill-in generation, and our self-supervised loss is designed to minimize violations of these conditions as a surrogate. We demonstrate through experiments that this leads to reduced fill-ins and faster LU factorization compared to existing methods. To address this, we will add a new section or appendix with experiments on small matrices (e.g., from the Florida Sparse Matrix Collection with known optimal orderings or computable via branch-and-bound) to show that our method approaches optimal fill-in reduction where possible. We will also include a more detailed derivation and discussion of how the loss relates to fill-in minimization. revision: yes
Referee: [Experimental evaluations section] Experimental results claim superiority on SuiteSparse without error bars, ablation studies isolating the end-max chain loss versus the multigrid network alone, or sensitivity analysis on post-hoc hyperparameter choices; this leaves open whether the reported gains in fill-in reduction and LU speedup are robust or attributable to the theorem-derived components.

Authors: We acknowledge the lack of error bars, ablations, and sensitivity analysis in the current experimental section. The reported results are averages over the SuiteSparse collection, but to demonstrate robustness and isolate the contributions of the Fill-Path Theorem-derived components (triplet sampling and end-max chain loss), we will conduct additional experiments. Specifically, we will include: (1) error bars from multiple independent runs with different random seeds, (2) ablation studies comparing the full model against variants without the end-max chain loss or with random triplet sampling, and (3) sensitivity analysis on key hyperparameters such as the number of triplets sampled and the multigrid levels. These will be incorporated into the revised manuscript to better attribute the performance gains. revision: yes

Circularity Check

0 steps flagged

No significant circularity: external theorem grounds surrogate loss with independent empirical validation

full rationale

The derivation begins from the Fill-Path Theorem (presented as revealing an intrinsic relationship between fill-in and path triplet inequalities), uses it to define triplet sampling and the end-max chain loss, then trains a multigrid graph network to minimize violations of those inequalities. This is a standard surrogate-objective design rather than a reduction by construction. Reported gains are measured on the external, public SuiteSparse collection via fill-in counts and LU factorization times, with no evidence that the central claims or performance metrics are equivalent to the fitted parameters or to a self-citation chain. The method is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 1 invented entities

The central claim rests on the Fill-Path Theorem as the source of inequalities, the assumption that a multigrid graph network can embed the necessary structural information, and the introduction of a new loss function whose effectiveness is demonstrated only empirically.

free parameters (1)

Graph network weights and hyperparameters
Trained end-to-end via the end-max chain loss on sampled triplets.

axioms (1)

domain assumption Fill-Path Theorem reveals a direct and intrinsic relationship between fill-in generation and the sparse structure of the matrix as path triplet inequalities
Invoked to justify the triplet sampling strategy and loss design.

invented entities (1)

End-max chain loss function no independent evidence
purpose: Reduce the number of triplets whose predicted scores satisfy the path inequalities
New loss introduced to train the ordering model in a self-supervised manner.

pith-pipeline@v0.9.0 · 5688 in / 1364 out tokens · 46764 ms · 2026-05-20T14:03:21.745997+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

The Fill-Path Theorem reveals a direct and intrinsic relationship between fill-in generation and the sparse structure of the matrix as path triplet inequalities... we construct an end-max chain loss function to reduce the number of triplets whose predicted scores satisfy these inequalities.
IndisputableMonolith/Foundation/ArithmeticFromLogic.lean embed_injective unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

a fill-in occurs at position (i, j) ... if and only if there exists a path i=v0 → v1 → ⋯ → vk = j such that all intermediate vertices satisfy vm < min(i, j)

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

28 extracted references · 28 canonical work pages

[1]

Springer, Berlin/Heidelberg (1971)

Wilkinson, J.H., Reinsch, C.: Handbook for Automatic Computation: Volume II: Linear Algebra. Springer, Berlin/Heidelberg (1971)

work page 1971
[2]

Claren- don Press, Oxford (1986)

Duff, I.S., Erisman, A.M., Reid, J.K.: Direct Methods for Sparse Matrices. Claren- don Press, Oxford (1986)

work page 1986
[3]

SIAM Journal on Algebraic Discrete Methods2(1), 77–79 (1981)

Yannakakis, M.: Computing the Minimum Fill-In is NP-Complete. SIAM Journal on Algebraic Discrete Methods2(1), 77–79 (1981). https://doi.org/10.1137/0602010

work page doi:10.1137/0602010 1981
[4]

SIAM (2003)

Saad, Y.: Iterative Methods for Sparse Linear Systems. SIAM (2003)

work page 2003
[5]

In: Proceedings of the 1969 24th National Conference, pp

Cuthill, E., McKee, J.: Reducing the Bandwidth of Sparse Symmetric Matrices. In: Proceedings of the 1969 24th National Conference, pp. 157–172. ACM (1969)

work page 1969
[6]

George, A.: Computer Implementation of the Finite Element Method. Ph.D. thesis, Stanford University (1971)

work page 1971
[7]

Management Science3(3), 255–269 (1957)

Markowitz, H.M.: The Elimination Form of the Inverse and Its Application to Linear Programming. Management Science3(3), 255–269 (1957)

work page 1957
[8]

In: Graph Theory and Computing, pp

Rose, D.J.: A Graph-Theoretic Study of the Numerical Solution of Sparse Positive Definite Systems of Linear Equations. In: Graph Theory and Computing, pp. 183–

work page
[9]

SIAM Journal on Scientific Computing20(1), 359–392 (1998)

Karypis, G., Kumar, V.: A Fast and High Quality Multilevel Scheme for Parti- tioning Irregular Graphs. SIAM Journal on Scientific Computing20(1), 359–392 (1998)

work page 1998
[10]

SIAM Journal on Numerical Analysis10(2), 345–363 (1973)

George, J.A.: Nested Dissection of a Regular Finite Element Mesh. SIAM Journal on Numerical Analysis10(2), 345–363 (1973)

work page 1973
[11]

ACM Transactions on Mathematical Software (1985) 16 Authors Suppressed Due to Excessive Length

Liu, J.W.H.: Modification of the Minimum-Degree Algorithm by Multiple Elimi- nation. ACM Transactions on Mathematical Software (1985) 16 Authors Suppressed Due to Excessive Length

work page 1985
[12]

SIAM Journal on Matrix Analysis and Applications17(4), 886–905 (1996)

Amestoy, P.R., Davis, T.A., Duff, I.S.: An Approximate Minimum Degree Ordering Algorithm. SIAM Journal on Matrix Analysis and Applications17(4), 886–905 (1996)

work page 1996
[13]

ACM Transactions on Mathematical Software 30(3), 377–380 (2004)

Davis, T.A., Gilbert, J.R., Larimore, S.I., Ng, E.G.: A Column Approximate Min- imum Degree Ordering Algorithm. ACM Transactions on Mathematical Software 30(3), 377–380 (2004)

work page 2004
[14]

Journal of Machine Learning Research (2022)

Gatti, A., Hu, Z., Smidt, T., Ghysels, P.: Graph Partitioning and Sparse Matrix Ordering Using Reinforcement Learning and Graph Neural Networks. Journal of Machine Learning Research (2022)

work page 2022
[15]

IEEE Access (2023)

Booth, J.D., Bolet, G.S.: Neural Acceleration of Graph Based Utility Functions for Sparse Matrices. IEEE Access (2023)

work page 2023
[16]

In: Joint European Conference on Machine Learning and Knowledge Discovery in Databases

Dasgupta, A., Kumar, P.: Alpha Elimination: Using Deep Reinforcement Learn- ing to Reduce Fill-In During Sparse Matrix Decomposition. In: Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer (2023)

work page 2023
[17]

ACM Transactions on Mathematical Software38(1), 1–25 (2011)

Davis, T.A., Hu, Y.: The University of Florida Sparse Matrix Collection. ACM Transactions on Mathematical Software38(1), 1–25 (2011)

work page 2011
[18]

In: Advances in Neural Information Processing Systems, pp

Hamilton, W., Ying, Z., Leskovec, J.: Inductive Representation Learning on Large Graphs. In: Advances in Neural Information Processing Systems, pp. 1024–1034 (2017)

work page 2017
[19]

Technical report, LaBRI, Université Bordeaux, Bordeaux, France (2020)

Pellegrini, F., Chevalier, C.: SCOTCH: Static Mapping, Graph, Mesh and Hyper- graph Partitioning, and Parallel and Sequential Sparse Matrix Ordering Package. Technical report, LaBRI, Université Bordeaux, Bordeaux, France (2020)

work page 2020
[20]

Technical report, University of Minnesota, Minneapolis, MN, USA (1998)

Karypis, G., Kumar, V.: METIS, a Software Package for Partitioning Unstructured Graphs, Partitioning Meshes, and Computing Fill-Reduced Orderings of Sparse Matrices. Technical report, University of Minnesota, Minneapolis, MN, USA (1998)

work page 1998
[21]

ACM Transactions on Mathematical Software31(3), 302–325 (2005)

Li, X.S.: An Overview of SuperLU: Algorithms, Implementation, and User Inter- face. ACM Transactions on Mathematical Software31(3), 302–325 (2005)

work page 2005
[22]

In: Proceedings of the 2000 International Workshop on Applied Parallel Computing

Amestoy, P.R., Duff, I.S., L’Excellent, J.Y., Koster, J.: MUMPS: A General Pur- pose Distributed Memory Sparse Solver. In: Proceedings of the 2000 International Workshop on Applied Parallel Computing. Springer, Berlin/Heidelberg (2000)

work page 2000
[23]

arXiv preprint arXiv:2203.06944 (2022)

Grementieri, L., Galeone, P.: Towards Neural Sparse Linear Solvers. arXiv preprint arXiv:2203.06944 (2022). https://arxiv.org/abs/2203.06944

work page arXiv 2022
[24]

arXiv preprint arXiv:2310.06630 (2023)

Zou, H., Xu, X., Zhang, C.-S.: A Survey on Intelligent Iterative Methods for Solv- ing Sparse Linear Algebraic Equations. arXiv preprint arXiv:2310.06630 (2023). https://arxiv.org/abs/2310.06630

work page arXiv 2023
[25]

SIAM Journal on Computing5(2), 266–283 (1976)

Rose, D.J., Tarjan, R.E., Lueker, G.S.: Algorithmic Aspects of Ver- tex Elimination. SIAM Journal on Computing5(2), 266–283 (1976). https://doi.org/10.1137/0205021

work page doi:10.1137/0205021 1976
[26]

T., Pothen, A., Simon, H

S. T. Barnard, A. Pothen, and H. D. Simon, “A spectral algorithm for en- velope reduction of sparse matrices,” inProceedings of Supercomputing ’93: The 1993 ACM/IEEE Conference on Supercomputing, 1993, pp. 493–502. doi: 10.1145/169627.169790

work page doi:10.1145/169627.169790 1993
[27]

G., Ghysels, P.: Deep Learning and Spectral Embedding for Graph Partitioning

Gatti, A., Hu, Z., Smidt, T., Ng, E. G., Ghysels, P.: Deep Learning and Spectral Embedding for Graph Partitioning. In: Proceedings of the 2021 SIAM Conference on Computational Science and Engineering (CSE21), pp. 1–12. SIAM, Philadel- phia, PA, USA (2021). https://doi.org/10.1137/1.9781611977141.3

work page doi:10.1137/1.9781611977141.3 2021
[28]

Bridging the Gap between Sparse Matrix Reordering and Factorization: A Deep Learning Framework for Fill-in Reduction,

Z. Li, T. Yuan, S. Niu, and H. Li, “Bridging the Gap between Sparse Matrix Reordering and Factorization: A Deep Learning Framework for Fill-in Reduction,” inDatabase Systems for Advanced Applications (DASF AA 2025), Springer, 2025

work page 2025

[1] [1]

Springer, Berlin/Heidelberg (1971)

Wilkinson, J.H., Reinsch, C.: Handbook for Automatic Computation: Volume II: Linear Algebra. Springer, Berlin/Heidelberg (1971)

work page 1971

[2] [2]

Claren- don Press, Oxford (1986)

Duff, I.S., Erisman, A.M., Reid, J.K.: Direct Methods for Sparse Matrices. Claren- don Press, Oxford (1986)

work page 1986

[3] [3]

SIAM Journal on Algebraic Discrete Methods2(1), 77–79 (1981)

Yannakakis, M.: Computing the Minimum Fill-In is NP-Complete. SIAM Journal on Algebraic Discrete Methods2(1), 77–79 (1981). https://doi.org/10.1137/0602010

work page doi:10.1137/0602010 1981

[4] [4]

SIAM (2003)

Saad, Y.: Iterative Methods for Sparse Linear Systems. SIAM (2003)

work page 2003

[5] [5]

In: Proceedings of the 1969 24th National Conference, pp

Cuthill, E., McKee, J.: Reducing the Bandwidth of Sparse Symmetric Matrices. In: Proceedings of the 1969 24th National Conference, pp. 157–172. ACM (1969)

work page 1969

[6] [6]

George, A.: Computer Implementation of the Finite Element Method. Ph.D. thesis, Stanford University (1971)

work page 1971

[7] [7]

Management Science3(3), 255–269 (1957)

Markowitz, H.M.: The Elimination Form of the Inverse and Its Application to Linear Programming. Management Science3(3), 255–269 (1957)

work page 1957

[8] [8]

In: Graph Theory and Computing, pp

Rose, D.J.: A Graph-Theoretic Study of the Numerical Solution of Sparse Positive Definite Systems of Linear Equations. In: Graph Theory and Computing, pp. 183–

work page

[9] [9]

SIAM Journal on Scientific Computing20(1), 359–392 (1998)

Karypis, G., Kumar, V.: A Fast and High Quality Multilevel Scheme for Parti- tioning Irregular Graphs. SIAM Journal on Scientific Computing20(1), 359–392 (1998)

work page 1998

[10] [10]

SIAM Journal on Numerical Analysis10(2), 345–363 (1973)

George, J.A.: Nested Dissection of a Regular Finite Element Mesh. SIAM Journal on Numerical Analysis10(2), 345–363 (1973)

work page 1973

[11] [11]

ACM Transactions on Mathematical Software (1985) 16 Authors Suppressed Due to Excessive Length

Liu, J.W.H.: Modification of the Minimum-Degree Algorithm by Multiple Elimi- nation. ACM Transactions on Mathematical Software (1985) 16 Authors Suppressed Due to Excessive Length

work page 1985

[12] [12]

SIAM Journal on Matrix Analysis and Applications17(4), 886–905 (1996)

Amestoy, P.R., Davis, T.A., Duff, I.S.: An Approximate Minimum Degree Ordering Algorithm. SIAM Journal on Matrix Analysis and Applications17(4), 886–905 (1996)

work page 1996

[13] [13]

ACM Transactions on Mathematical Software 30(3), 377–380 (2004)

Davis, T.A., Gilbert, J.R., Larimore, S.I., Ng, E.G.: A Column Approximate Min- imum Degree Ordering Algorithm. ACM Transactions on Mathematical Software 30(3), 377–380 (2004)

work page 2004

[14] [14]

Journal of Machine Learning Research (2022)

Gatti, A., Hu, Z., Smidt, T., Ghysels, P.: Graph Partitioning and Sparse Matrix Ordering Using Reinforcement Learning and Graph Neural Networks. Journal of Machine Learning Research (2022)

work page 2022

[15] [15]

IEEE Access (2023)

Booth, J.D., Bolet, G.S.: Neural Acceleration of Graph Based Utility Functions for Sparse Matrices. IEEE Access (2023)

work page 2023

[16] [16]

In: Joint European Conference on Machine Learning and Knowledge Discovery in Databases

Dasgupta, A., Kumar, P.: Alpha Elimination: Using Deep Reinforcement Learn- ing to Reduce Fill-In During Sparse Matrix Decomposition. In: Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer (2023)

work page 2023

[17] [17]

ACM Transactions on Mathematical Software38(1), 1–25 (2011)

Davis, T.A., Hu, Y.: The University of Florida Sparse Matrix Collection. ACM Transactions on Mathematical Software38(1), 1–25 (2011)

work page 2011

[18] [18]

In: Advances in Neural Information Processing Systems, pp

Hamilton, W., Ying, Z., Leskovec, J.: Inductive Representation Learning on Large Graphs. In: Advances in Neural Information Processing Systems, pp. 1024–1034 (2017)

work page 2017

[19] [19]

Technical report, LaBRI, Université Bordeaux, Bordeaux, France (2020)

Pellegrini, F., Chevalier, C.: SCOTCH: Static Mapping, Graph, Mesh and Hyper- graph Partitioning, and Parallel and Sequential Sparse Matrix Ordering Package. Technical report, LaBRI, Université Bordeaux, Bordeaux, France (2020)

work page 2020

[20] [20]

Technical report, University of Minnesota, Minneapolis, MN, USA (1998)

Karypis, G., Kumar, V.: METIS, a Software Package for Partitioning Unstructured Graphs, Partitioning Meshes, and Computing Fill-Reduced Orderings of Sparse Matrices. Technical report, University of Minnesota, Minneapolis, MN, USA (1998)

work page 1998

[21] [21]

ACM Transactions on Mathematical Software31(3), 302–325 (2005)

Li, X.S.: An Overview of SuperLU: Algorithms, Implementation, and User Inter- face. ACM Transactions on Mathematical Software31(3), 302–325 (2005)

work page 2005

[22] [22]

In: Proceedings of the 2000 International Workshop on Applied Parallel Computing

Amestoy, P.R., Duff, I.S., L’Excellent, J.Y., Koster, J.: MUMPS: A General Pur- pose Distributed Memory Sparse Solver. In: Proceedings of the 2000 International Workshop on Applied Parallel Computing. Springer, Berlin/Heidelberg (2000)

work page 2000

[23] [23]

arXiv preprint arXiv:2203.06944 (2022)

Grementieri, L., Galeone, P.: Towards Neural Sparse Linear Solvers. arXiv preprint arXiv:2203.06944 (2022). https://arxiv.org/abs/2203.06944

work page arXiv 2022

[24] [24]

arXiv preprint arXiv:2310.06630 (2023)

Zou, H., Xu, X., Zhang, C.-S.: A Survey on Intelligent Iterative Methods for Solv- ing Sparse Linear Algebraic Equations. arXiv preprint arXiv:2310.06630 (2023). https://arxiv.org/abs/2310.06630

work page arXiv 2023

[25] [25]

SIAM Journal on Computing5(2), 266–283 (1976)

Rose, D.J., Tarjan, R.E., Lueker, G.S.: Algorithmic Aspects of Ver- tex Elimination. SIAM Journal on Computing5(2), 266–283 (1976). https://doi.org/10.1137/0205021

work page doi:10.1137/0205021 1976

[26] [26]

T., Pothen, A., Simon, H

S. T. Barnard, A. Pothen, and H. D. Simon, “A spectral algorithm for en- velope reduction of sparse matrices,” inProceedings of Supercomputing ’93: The 1993 ACM/IEEE Conference on Supercomputing, 1993, pp. 493–502. doi: 10.1145/169627.169790

work page doi:10.1145/169627.169790 1993

[27] [27]

G., Ghysels, P.: Deep Learning and Spectral Embedding for Graph Partitioning

Gatti, A., Hu, Z., Smidt, T., Ng, E. G., Ghysels, P.: Deep Learning and Spectral Embedding for Graph Partitioning. In: Proceedings of the 2021 SIAM Conference on Computational Science and Engineering (CSE21), pp. 1–12. SIAM, Philadel- phia, PA, USA (2021). https://doi.org/10.1137/1.9781611977141.3

work page doi:10.1137/1.9781611977141.3 2021

[28] [28]

Bridging the Gap between Sparse Matrix Reordering and Factorization: A Deep Learning Framework for Fill-in Reduction,

Z. Li, T. Yuan, S. Niu, and H. Li, “Bridging the Gap between Sparse Matrix Reordering and Factorization: A Deep Learning Framework for Fill-in Reduction,” inDatabase Systems for Advanced Applications (DASF AA 2025), Springer, 2025

work page 2025