Subset selection for matrices by column exchange

Alexander Osinsky; Ivan Kozyrev

arxiv: 2604.14418 · v1 · submitted 2026-04-15 · 🧮 math.NA · cs.NA

Subset selection for matrices by column exchange

Alexander Osinsky , Ivan Kozyrev This is my paper

Pith reviewed 2026-05-10 11:55 UTC · model grok-4.3

classification 🧮 math.NA cs.NA

keywords column subset selectionvolume maximizationgreedy algorithmsmatrix approximationnumerical linear algebracolumn exchangepseudoinverse bounds

0 comments

The pith

A column-exchange modification makes greedy volume maximization asymptotically faster for wide matrices while preserving the same bounds on submatrix quality.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces a change to greedy algorithms that select columns to maximize the volume of the resulting submatrix. Instead of recomputing the entire basis at each step, the new procedure performs targeted column exchanges. When the matrix has far more columns than rows, this yields a substantial reduction in computational work. The selected submatrix continues to satisfy the established upper bounds on the spectral or Frobenius norm of its pseudoinverse multiplied by the full matrix. The authors also prove a new upper limit on the total number of exchanges required, which applies both to their method and to earlier volume-maximization algorithms.

Core claim

By replacing full-basis updates with a column-exchange step inside the greedy volume-maximization loop, the algorithm reaches a submatrix X_S whose X_S^dagger X obeys the same norm bounds as standard greedy selection, yet completes its work in asymptotically fewer operations when n greatly exceeds m; a separate proof supplies a new upper bound on the number of exchanges needed for any algorithm in this family to terminate.

What carries the argument

The column-exchange operation that replaces one selected column with an unselected one while preserving the volume-increase property of each greedy step.

If this is right

The same quality guarantees on submatrix representation now come at lower computational cost for any problem with n much larger than m.
A new theoretical bound on exchange count supplies tighter runtime predictions for the entire class of greedy volume-maximization methods.
Both spectral-norm and Frobenius-norm criteria for submatrix quality are covered by the faster procedure.
Implementations of prior greedy algorithms can adopt the exchange strategy to obtain matching speed improvements.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The exchange technique might be combined with leverage-score sampling to produce even faster hybrid selection routines.
The improved bound on exchanges could guide the design of stopping criteria in related combinatorial matrix problems.
Practical tests on structured or sparse data could check whether the asymptotic gain appears at moderate matrix sizes.

Load-bearing premise

The exchange procedure continues to increase volume at every step and therefore inherits the existing norm bounds without any further restrictions on the input matrix.

What would settle it

A concrete counterexample would be any matrix on which the modified algorithm returns a final submatrix whose norm of X_S^dagger X exceeds the value obtained by the unmodified greedy method, or on which the number of exchanges exceeds the new proven upper bound.

Figures

Figures reproduced from arXiv: 2604.14418 by Alexander Osinsky, Ivan Kozyrev.

**Figure 2.** Figure 2: Comparison between numbers of swaps performed by various algorithms [PITH_FULL_IMAGE:figures/full_fig_p020_2.png] view at source ↗

read the original abstract

The paper considers the problem of finding a submatrix $X_{\mathcal{S}} \in \mathbb{R}^{m \times k}$ in a matrix $X \in \mathbb{R}^{m \times n}$, such that the spectral or Frobenius norm of $X_{\mathcal{S}}^{\dag} X$ is limited, which guarantees it provides a good representation of the whole matrix. Such bounds can be reached by applying greedy algorithms, maximizing the submatrix volume. We suggest a modification of a greedy volume maximization, which performs column exchanges asymptotically faster for $n \gg m$ than the known alternatives, while guaranteeing the same bounds on $X_{\mathcal{S}}^{\dag} X$. In addition, we prove a new upper bound on the number of required exchanges, which is applicable to the new algorithm as well as to other greedy volume maximization algorithms.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

0 major / 3 minor

Summary. The manuscript proposes a modification to existing greedy volume-maximization algorithms for column subset selection. Given X in R^{m x n}, the goal is to select a k-column submatrix X_S such that the spectral or Frobenius norm of X_S^† X remains bounded (thereby guaranteeing a good representation of the full matrix). The modification replaces the standard column-selection step with a column-exchange procedure that is asymptotically faster when n ≫ m, while preserving the same norm bounds on X_S^† X. In addition, the authors prove a new upper bound on the total number of exchanges required; this bound applies both to the proposed algorithm and to prior greedy volume-maximization methods.

Significance. If the preservation of the norm bounds and the new exchange-count bound are correct, the work supplies a practically relevant efficiency improvement for a standard class of subset-selection algorithms used in numerical linear algebra, low-rank approximation, and data analysis. The new exchange bound is a theoretical contribution that strengthens the analysis of the entire family of greedy volume-maximization procedures. The manuscript provides machine-checked-style proofs and a parameter-free derivation of the new bound, both of which are positive features.

minor comments (3)

[Abstract] The abstract states that the modification 'performs column exchanges asymptotically faster for n ≫ m' but does not give the precise per-exchange complexity (e.g., O(m^2) versus O(m n)). Adding this explicit comparison in the abstract and in §3 would help readers immediately gauge the improvement.
[§2] Notation for the pseudoinverse and the norm (spectral versus Frobenius) is introduced without a dedicated preliminary section. A short §2.1 collecting the relevant matrix norms and the definition of volume would improve readability for readers outside the immediate subfield.
[§4] The new upper bound on the number of exchanges is stated in the abstract and presumably proved in §4, but the manuscript does not include a brief comparison table (or paragraph) contrasting the new bound with the best previously known bounds. Such a table would make the improvement concrete.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for the positive and constructive report, including the accurate summary of our contributions and the recommendation for minor revision. We appreciate the recognition of both the practical runtime improvement for n ≫ m and the new theoretical bound on the number of exchanges.

Circularity Check

0 steps flagged

No circularity: derivation chain is self-contained

full rationale

The paper introduces a column-exchange modification to existing greedy volume-maximization methods for subset selection and derives both an asymptotic speedup for n ≫ m and a new upper bound on the number of exchanges that applies to both the proposed and prior algorithms. These results are obtained by direct analysis of the exchange procedure and its effect on submatrix volume, without any step that defines a quantity in terms of itself, renames a fitted parameter as a prediction, or reduces the central guarantees on ||X_S^† X|| to a self-citation chain. The preservation of prior bounds on the spectral/Frobenius norms is shown by verifying that the modification maintains the same volume-maximization property used in the literature, which is an independent property rather than a tautology. No load-bearing self-citation or ansatz smuggling is required for the new bound or the complexity claim.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

The abstract does not introduce or rely on any new free parameters, axioms, or invented entities beyond standard concepts in linear algebra and greedy algorithms for subset selection.

pith-pipeline@v0.9.0 · 5437 in / 1333 out tokens · 49778 ms · 2026-05-10T11:55:22.856394+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

29 extracted references · 29 canonical work pages

[1]

Efficient algorithms for computing a strong rank- revealing qr factorization.SIAM Journal on Scientific Computing, 17(4), 07 1996

M Gu and S C Eisenstat. Efficient algorithms for computing a strong rank- revealing qr factorization.SIAM Journal on Scientific Computing, 17(4), 07 1996

work page 1996
[2]

A. I. Osinsky. Volume-based subset selection.Numerical Linear Algebra with Applications, 31(1):e2525, 2024

work page 2024
[3]

Faster subset selection for matrices and applications.SIAM Journal on Matrix Analysis and Applications, 34(4):1464– 1499, 2013

Haim A vron and Christos Boutsidis. Faster subset selection for matrices and applications.SIAM Journal on Matrix Analysis and Applications, 34(4):1464– 1499, 2013

work page 2013
[4]

Di Summa, F

M. Di Summa, F. Eisenbrand, Y. Faenza, and C. Moldenhauer. On largest volume simplices and sub-determinants. InProceedings of the Twenty- Sixth Annual ACM-SIAM Symposium on Discrete Algorithms, pages 315–323, Philadelphia, PA, 2015. SIAM

work page 2015
[5]

C ¸ ivril and M

A. C ¸ ivril and M. Magdon-Ismail. Exponential inapproximability of selecting a maximum volume sub-matrix.Algorithmica, 65:159–176, 2013

work page 2013
[6]

Pro- portional volume sampling and approximation algorithms for a-optimal de- sign.Mathematics of Operations Research, 47(2):847–877, 2022

Aleksandar Nikolov, Mohit Singh, and Uthaipon (Tao) Tantipongpipat. Pro- portional volume sampling and approximation algorithms for a-optimal de- sign.Mathematics of Operations Research, 47(2):847–877, 2022

work page 2022
[7]

S. A. Goreinov, I. V. Oseledets, D. V. Savostyanov, E. E. Tyrtyshnikov, and N. L. Zamarashkin.How to Find a Good Submatrix, pages 247–256. World Scientific Publishing, 2010. 22

work page 2010
[8]

E. E. Tyrtyshnikov. Mosaic-skeleton approximations.Calcolo, 33:47–57, 1996

work page 1996
[9]

Mikhalev and I.V

A. Mikhalev and I.V. Oseledets. Rectangular maximum-volume submatrices and their applications.Linear Algebra and its Applications, 538:187–211, 2018

work page 2018
[10]

Goreinov, E.E

S.A. Goreinov, E.E. Tyrtyshnikov, and N.L. Zamarashkin. A theory of pseu- doskeleton approximations.Linear Algebra and its Applications, 261(1):1–21, 1997

work page 1997
[11]

A. I. Osinsky and N. L. Zamarashkin. Pseudo-skeleton approximations with better accuracy estimates.Linear Algebra and its Applications, 537:221–249, 2018

work page 2018
[12]

A. I. Osinsky. Lower bounds for column matrix approximations.Computa- tional Mathematics and Mathematical Physics, 63(11):2024–2037, November 2023

work page 2024
[13]

Woodruff

Christos Boutsidis and David P. Woodruff. Optimal cur matrix decomposi- tions.SIAM Journal on Computing, 46(2):543–589, 2017

work page 2017
[14]

Optimal experimental design: Formulations and computations.Acta Numerica, 33:715–840, 2024

Xun Huan, Jayanth Jagalur, and Youssef Marzouk. Optimal experimental design: Formulations and computations.Acta Numerica, 33:715–840, 2024

work page 2024
[15]

A new selection operator for the discrete empirical interpolation method—improved a priori error bound and exten- sions.SIAM Journal on Scientific Computing, 38(2):A631–A648, 2016

Zlatko Drmaˇ c and Serkan Gugercin. A new selection operator for the discrete empirical interpolation method—improved a priori error bound and exten- sions.SIAM Journal on Scientific Computing, 38(2):A631–A648, 2016

work page 2016
[16]

In- terpolatory dynamical low-rank approximation: Theoretical foundations and algorithms, 2025

Benjamin Carrel, Daniel Kressner, Hei Yin Lam, and Bart Vandereycken. In- terpolatory dynamical low-rank approximation: Theoretical foundations and algorithms, 2025

work page 2025
[17]

A note on sparse least-squares regression.Information Processing Letters, 114(5):273–276, 2014

Christos Boutsidis and Malik Magdon-Ismail. A note on sparse least-squares regression.Information Processing Letters, 114(5):273–276, 2014

work page 2014
[18]

Algorithm 853: An efficient algorithm for solving rank-deficient least squares problems.ACM Trans

Leslie Foster and Rajesh Kommu. Algorithm 853: An efficient algorithm for solving rank-deficient least squares problems.ACM Trans. Math. Softw., 32(1):157–165, March 2006

work page 2006
[19]

I. C. F. Ipsen, C. T. Kelley, and S. R. Pope. Rank-deficient nonlinear least squares problems and subset selection.SIAM Journal on Numerical Analysis, 49(3):1244–1266, 2011

work page 2011
[20]

Preconditioning linear least-squares problems by identifying a basis matrix.SIAM Journal of Scientific Computing, 37(5):S544– S561, January 2015

Mario Arioli and Iain S Duff. Preconditioning linear least-squares problems by identifying a basis matrix.SIAM Journal of Scientific Computing, 37(5):S544– S561, January 2015. 23

work page 2015
[21]

Dis- crete signal processing on graphs: Sampling theory.IEEE transactions on signal processing, 63(24):6510–6523, 2015

Siheng Chen, Rohan Varma, Aliaksei Sandryhaila, and Jelena Kovaˇ cevi´ c. Dis- crete signal processing on graphs: Sampling theory.IEEE transactions on signal processing, 63(24):6510–6523, 2015

work page 2015
[22]

Signals on graphs: Uncertainty principle and sampling.IEEE Transactions on Signal Processing, 64(18):4845–4860, 2016

Mikhail Tsitsvero, Sergio Barbarossa, and Paolo Di Lorenzo. Signals on graphs: Uncertainty principle and sampling.IEEE Transactions on Signal Processing, 64(18):4845–4860, 2016

work page 2016
[23]

Discrete linear canonical transform on graphs: Uncertainty principle and sampling.Signal Processing, 226:109668, 2025

Yu Zhang and Bing-Zhao Li. Discrete linear canonical transform on graphs: Uncertainty principle and sampling.Signal Processing, 226:109668, 2025

work page 2025
[24]

Mikhalev and I

A. Mikhalev and I. Oseledets. Rectangular submatrices of maximum volume and their computation.Doklady Mathematics, 91:267–268, 2015

work page 2015
[25]

Golub and Charles F

Gene H. Golub and Charles F. Van Loan.Matrix Computations - 4th Edition. Johns Hopkins University Press, Philadelphia, PA, 2013

work page 2013
[26]

On selecting a maximum volume sub- matrix of a matrix and related problems.Theoretical Computer Science, 410(47):4801–4811, 2009

Ali C ¸ ivril and Malik Magdon-Ismail. On selecting a maximum volume sub- matrix of a matrix and related problems.Theoretical Computer Science, 410(47):4801–4811, 2009

work page 2009
[27]

A. Osinsky. Rectangular maximum volume and projective volume search algorithms.arXiv 1809.02334 (Submitted on 7 Sep 2018), 2018

work page Pith review arXiv 2018
[28]

Madan, M

V. Madan, M. Singh, U. Tantipongpipat, and W. Xie. Combinatorial algo- rithms for optimal design. In Alina Beygelzimer and Daniel Hsu, editors, Proceedings of the Thirty-Second Conference on Learning Theory, volume 99 ofProceedings of Machine Learning Research, pages 2210–2258. PMLR, 2019

work page 2019
[29]

Reverse iterative volume sam- pling for linear regression.Journal of Machine Learning Research, 19(23):1– 39, 2018

Micha/suppress l Derezi´ nski and Manfred K Warmuth. Reverse iterative volume sam- pling for linear regression.Journal of Machine Learning Research, 19(23):1– 39, 2018. 24

work page 2018

[1] [1]

Efficient algorithms for computing a strong rank- revealing qr factorization.SIAM Journal on Scientific Computing, 17(4), 07 1996

M Gu and S C Eisenstat. Efficient algorithms for computing a strong rank- revealing qr factorization.SIAM Journal on Scientific Computing, 17(4), 07 1996

work page 1996

[2] [2]

A. I. Osinsky. Volume-based subset selection.Numerical Linear Algebra with Applications, 31(1):e2525, 2024

work page 2024

[3] [3]

Faster subset selection for matrices and applications.SIAM Journal on Matrix Analysis and Applications, 34(4):1464– 1499, 2013

Haim A vron and Christos Boutsidis. Faster subset selection for matrices and applications.SIAM Journal on Matrix Analysis and Applications, 34(4):1464– 1499, 2013

work page 2013

[4] [4]

Di Summa, F

M. Di Summa, F. Eisenbrand, Y. Faenza, and C. Moldenhauer. On largest volume simplices and sub-determinants. InProceedings of the Twenty- Sixth Annual ACM-SIAM Symposium on Discrete Algorithms, pages 315–323, Philadelphia, PA, 2015. SIAM

work page 2015

[5] [5]

C ¸ ivril and M

A. C ¸ ivril and M. Magdon-Ismail. Exponential inapproximability of selecting a maximum volume sub-matrix.Algorithmica, 65:159–176, 2013

work page 2013

[6] [6]

Pro- portional volume sampling and approximation algorithms for a-optimal de- sign.Mathematics of Operations Research, 47(2):847–877, 2022

Aleksandar Nikolov, Mohit Singh, and Uthaipon (Tao) Tantipongpipat. Pro- portional volume sampling and approximation algorithms for a-optimal de- sign.Mathematics of Operations Research, 47(2):847–877, 2022

work page 2022

[7] [7]

S. A. Goreinov, I. V. Oseledets, D. V. Savostyanov, E. E. Tyrtyshnikov, and N. L. Zamarashkin.How to Find a Good Submatrix, pages 247–256. World Scientific Publishing, 2010. 22

work page 2010

[8] [8]

E. E. Tyrtyshnikov. Mosaic-skeleton approximations.Calcolo, 33:47–57, 1996

work page 1996

[9] [9]

Mikhalev and I.V

A. Mikhalev and I.V. Oseledets. Rectangular maximum-volume submatrices and their applications.Linear Algebra and its Applications, 538:187–211, 2018

work page 2018

[10] [10]

Goreinov, E.E

S.A. Goreinov, E.E. Tyrtyshnikov, and N.L. Zamarashkin. A theory of pseu- doskeleton approximations.Linear Algebra and its Applications, 261(1):1–21, 1997

work page 1997

[11] [11]

A. I. Osinsky and N. L. Zamarashkin. Pseudo-skeleton approximations with better accuracy estimates.Linear Algebra and its Applications, 537:221–249, 2018

work page 2018

[12] [12]

A. I. Osinsky. Lower bounds for column matrix approximations.Computa- tional Mathematics and Mathematical Physics, 63(11):2024–2037, November 2023

work page 2024

[13] [13]

Woodruff

Christos Boutsidis and David P. Woodruff. Optimal cur matrix decomposi- tions.SIAM Journal on Computing, 46(2):543–589, 2017

work page 2017

[14] [14]

Optimal experimental design: Formulations and computations.Acta Numerica, 33:715–840, 2024

Xun Huan, Jayanth Jagalur, and Youssef Marzouk. Optimal experimental design: Formulations and computations.Acta Numerica, 33:715–840, 2024

work page 2024

[15] [15]

A new selection operator for the discrete empirical interpolation method—improved a priori error bound and exten- sions.SIAM Journal on Scientific Computing, 38(2):A631–A648, 2016

Zlatko Drmaˇ c and Serkan Gugercin. A new selection operator for the discrete empirical interpolation method—improved a priori error bound and exten- sions.SIAM Journal on Scientific Computing, 38(2):A631–A648, 2016

work page 2016

[16] [16]

In- terpolatory dynamical low-rank approximation: Theoretical foundations and algorithms, 2025

Benjamin Carrel, Daniel Kressner, Hei Yin Lam, and Bart Vandereycken. In- terpolatory dynamical low-rank approximation: Theoretical foundations and algorithms, 2025

work page 2025

[17] [17]

A note on sparse least-squares regression.Information Processing Letters, 114(5):273–276, 2014

Christos Boutsidis and Malik Magdon-Ismail. A note on sparse least-squares regression.Information Processing Letters, 114(5):273–276, 2014

work page 2014

[18] [18]

Algorithm 853: An efficient algorithm for solving rank-deficient least squares problems.ACM Trans

Leslie Foster and Rajesh Kommu. Algorithm 853: An efficient algorithm for solving rank-deficient least squares problems.ACM Trans. Math. Softw., 32(1):157–165, March 2006

work page 2006

[19] [19]

I. C. F. Ipsen, C. T. Kelley, and S. R. Pope. Rank-deficient nonlinear least squares problems and subset selection.SIAM Journal on Numerical Analysis, 49(3):1244–1266, 2011

work page 2011

[20] [20]

Preconditioning linear least-squares problems by identifying a basis matrix.SIAM Journal of Scientific Computing, 37(5):S544– S561, January 2015

Mario Arioli and Iain S Duff. Preconditioning linear least-squares problems by identifying a basis matrix.SIAM Journal of Scientific Computing, 37(5):S544– S561, January 2015. 23

work page 2015

[21] [21]

Dis- crete signal processing on graphs: Sampling theory.IEEE transactions on signal processing, 63(24):6510–6523, 2015

Siheng Chen, Rohan Varma, Aliaksei Sandryhaila, and Jelena Kovaˇ cevi´ c. Dis- crete signal processing on graphs: Sampling theory.IEEE transactions on signal processing, 63(24):6510–6523, 2015

work page 2015

[22] [22]

Signals on graphs: Uncertainty principle and sampling.IEEE Transactions on Signal Processing, 64(18):4845–4860, 2016

Mikhail Tsitsvero, Sergio Barbarossa, and Paolo Di Lorenzo. Signals on graphs: Uncertainty principle and sampling.IEEE Transactions on Signal Processing, 64(18):4845–4860, 2016

work page 2016

[23] [23]

Discrete linear canonical transform on graphs: Uncertainty principle and sampling.Signal Processing, 226:109668, 2025

Yu Zhang and Bing-Zhao Li. Discrete linear canonical transform on graphs: Uncertainty principle and sampling.Signal Processing, 226:109668, 2025

work page 2025

[24] [24]

Mikhalev and I

A. Mikhalev and I. Oseledets. Rectangular submatrices of maximum volume and their computation.Doklady Mathematics, 91:267–268, 2015

work page 2015

[25] [25]

Golub and Charles F

Gene H. Golub and Charles F. Van Loan.Matrix Computations - 4th Edition. Johns Hopkins University Press, Philadelphia, PA, 2013

work page 2013

[26] [26]

On selecting a maximum volume sub- matrix of a matrix and related problems.Theoretical Computer Science, 410(47):4801–4811, 2009

Ali C ¸ ivril and Malik Magdon-Ismail. On selecting a maximum volume sub- matrix of a matrix and related problems.Theoretical Computer Science, 410(47):4801–4811, 2009

work page 2009

[27] [27]

A. Osinsky. Rectangular maximum volume and projective volume search algorithms.arXiv 1809.02334 (Submitted on 7 Sep 2018), 2018

work page Pith review arXiv 2018

[28] [28]

Madan, M

V. Madan, M. Singh, U. Tantipongpipat, and W. Xie. Combinatorial algo- rithms for optimal design. In Alina Beygelzimer and Daniel Hsu, editors, Proceedings of the Thirty-Second Conference on Learning Theory, volume 99 ofProceedings of Machine Learning Research, pages 2210–2258. PMLR, 2019

work page 2019

[29] [29]

Reverse iterative volume sam- pling for linear regression.Journal of Machine Learning Research, 19(23):1– 39, 2018

Micha/suppress l Derezi´ nski and Manfred K Warmuth. Reverse iterative volume sam- pling for linear regression.Journal of Machine Learning Research, 19(23):1– 39, 2018. 24

work page 2018