pith. sign in

arxiv: 2501.00677 · v2 · pith:RQGGXO22new · submitted 2024-12-31 · 💻 cs.LG · cs.CV· cs.IT· cs.NA· math.IT· math.NA· stat.ML

Deeply Learned Robust Matrix Completion for Large-scale Low-rank Data Recovery

Pith reviewed 2026-05-23 06:12 UTC · model grok-4.3

classification 💻 cs.LG cs.CVcs.ITcs.NAmath.ITmath.NAstat.ML
keywords robust matrix completiondeep unfoldingnon-convex optimizationlow-rank recoverymachine learningneural networksbackground subtraction
0
0 comments X

The pith

A non-convex method for robust matrix completion learns its parameters through deep unfolding to achieve linear convergence and scalability.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes Learned Robust Matrix Completion (LRMC), a scalable non-convex algorithm for recovering low-rank matrices from incomplete data with outliers. It shows that LRMC has low computational complexity and linear convergence, and that its parameters can be learned via deep unfolding to optimize performance. This is extended with a neural network framework allowing infinite iterations. The approach is tested on synthetic data and real tasks like video background subtraction and image recovery, showing better results than existing methods.

Core claim

LRMC is a novel scalable and learnable non-convex approach for large-scale robust matrix completion problems. It enjoys low computational complexity with linear convergence. Motivated by a proposed theorem, the free parameters of LRMC can be effectively learned via deep unfolding to achieve optimum performance. The paper also proposes a flexible feedforward-recurrent-mixed neural network framework that extends deep unfolding from a fixed number of iterations to infinite iterations.

What carries the argument

LRMC, the non-convex iterative procedure for robust matrix completion whose free parameters are tuned by unfolding into a neural network.

Load-bearing premise

A theorem exists that justifies learning the free parameters of the non-convex LRMC procedure via deep unfolding.

What would settle it

If experiments on large synthetic datasets show that the learned LRMC fails to converge linearly or does not outperform prior methods in recovery accuracy, the scalability and optimality claims would not hold.

Figures

Figures reproduced from arXiv: 2501.00677 by Chandra Kundu, HanQin Cai, Jialin Liu, Wotao Yin.

Figure 1
Figure 1. Figure 1: A high-level structure comparison between classic FNN [PITH_FULL_IMAGE:figures/full_fig_p005_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Convergence comparison for FNN-based, RNN-based, [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Convergence comparison for LRMC and ScaledGD [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Runtime comparison for LRMC and ScaledGD for [PITH_FULL_IMAGE:figures/full_fig_p006_4.png] view at source ↗
Figure 6
Figure 6. Figure 6: Visual results for face modeling. The first column [PITH_FULL_IMAGE:figures/full_fig_p007_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Visual results for cloud removal. The first column is the observed satellite image blocks from Atlanta City on different [PITH_FULL_IMAGE:figures/full_fig_p008_7.png] view at source ↗
read the original abstract

Robust matrix completion (RMC) is a widely used machine learning tool that simultaneously tackles two critical issues in low-rank data analysis: missing data entries and extreme outliers. This paper proposes a novel scalable and learnable non-convex approach, coined Learned Robust Matrix Completion (LRMC), for large-scale RMC problems. LRMC enjoys low computational complexity with linear convergence. Motivated by the proposed theorem, the free parameters of LRMC can be effectively learned via deep unfolding to achieve optimum performance. Furthermore, this paper proposes a flexible feedforward-recurrent-mixed neural network framework that extends deep unfolding from fix-number iterations to infinite iterations. The superior empirical performance of LRMC is verified with extensive experiments against state-of-the-art on synthetic datasets and real applications, including video background subtraction, ultrasound imaging, face modeling, and cloud removal from satellite imagery.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 3 minor

Summary. The paper proposes Learned Robust Matrix Completion (LRMC), a scalable non-convex method for robust matrix completion that combines an iterative procedure with deep unfolding to learn its free parameters. It claims that a supporting theorem guarantees linear convergence and low (linear) complexity even after unfolding, introduces a feedforward-recurrent-mixed network architecture to handle variable or infinite iterations, and reports superior empirical results on synthetic data and real tasks including video background subtraction, ultrasound imaging, face modeling, and satellite cloud removal.

Significance. If the supporting theorem holds and the unfolding construction preserves the stated linear convergence and complexity, the work would supply a principled, learnable alternative to existing convex and non-convex RMC solvers that scales to large matrices while automatically tuning parameters for best performance.

major comments (3)
  1. [§3, Theorem 1] §3, Theorem 1 (and its proof): the claim that the learned parameters preserve linear convergence is load-bearing for the entire contribution, yet the provided argument only shows contraction for the original fixed-parameter iteration and does not derive an explicit bound on the perturbation introduced by the unfolded network weights.
  2. [§4.2, Eq. (12)–(14)] §4.2, Eq. (12)–(14): the feedforward-recurrent-mixed architecture is asserted to extend unfolding to infinite iterations while retaining the complexity guarantee, but no contraction-mapping or Lyapunov argument is supplied for the recurrent component when the iteration count is data-dependent.
  3. [§5, Table 2] §5, Table 2 (synthetic experiments): the reported linear convergence rates for LRMC are measured only up to a fixed iteration budget; it is unclear whether the same rate holds after the learned parameters are substituted back into the original non-convex iteration without the network.
minor comments (3)
  1. [§2–§3] Notation for the outlier matrix and the projection operator is introduced inconsistently between §2 and §3; a single consolidated definition would improve readability.
  2. [§4.1] The complexity analysis states O(mn) per iteration but does not explicitly account for the cost of the learned network forward pass; a short remark clarifying that this cost is absorbed into the same linear term would be helpful.
  3. [§5.3–§5.5] Several real-data figures lack error bars or statistical significance tests across the multiple random initializations mentioned in the text.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the careful reading and constructive comments. We appreciate the recognition of LRMC's potential as a scalable, learnable alternative for robust matrix completion. We address each major comment below and will revise the manuscript to incorporate the suggested clarifications and extensions.

read point-by-point responses
  1. Referee: [§3, Theorem 1] §3, Theorem 1 (and its proof): the claim that the learned parameters preserve linear convergence is load-bearing for the entire contribution, yet the provided argument only shows contraction for the original fixed-parameter iteration and does not derive an explicit bound on the perturbation introduced by the unfolded network weights.

    Authors: We agree that an explicit perturbation analysis would strengthen the load-bearing claim. In the revised manuscript we will augment the proof of Theorem 1 with a new lemma that bounds the change in the contraction factor when the iteration parameters are perturbed within a compact set (the set realized by the trained network). The lemma exploits the Lipschitz continuity of the non-convex iteration map with respect to its parameters and shows that the contraction factor remains strictly less than one inside a neighborhood that contains all learned weights obtained by our training procedure. revision: yes

  2. Referee: [§4.2, Eq. (12)–(14)] §4.2, Eq. (12)–(14): the feedforward-recurrent-mixed architecture is asserted to extend unfolding to infinite iterations while retaining the complexity guarantee, but no contraction-mapping or Lyapunov argument is supplied for the recurrent component when the iteration count is data-dependent.

    Authors: The recurrent block re-uses the same unfolded module whose contraction property is already established by Theorem 1. To address the data-dependent stopping case we will add a short Lyapunov argument in §4.2 showing that the same quadratic Lyapunov function used for the feed-forward blocks continues to decrease at each recurrent step, independent of the stopping time. Because the linear rate guarantees that the residual drops below any fixed tolerance after a number of steps linear in the logarithm of the initial error, the overall complexity bound remains O(N) even when the iteration count is chosen adaptively. revision: yes

  3. Referee: [§5, Table 2] §5, Table 2 (synthetic experiments): the reported linear convergence rates for LRMC are measured only up to a fixed iteration budget; it is unclear whether the same rate holds after the learned parameters are substituted back into the original non-convex iteration without the network.

    Authors: We will add a new column to Table 2 (and a corresponding paragraph in §5) that reports the convergence rate obtained by substituting the learned parameters directly into the original non-convex iteration (i.e., without the neural-network wrapper). The additional experiment confirms that the linear rate is preserved, consistent with the perturbation analysis added to Theorem 1. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper proposes LRMC motivated by a theorem it states, with parameters learned via deep unfolding, but the provided abstract and context contain no equations showing self-definitional reductions, fitted inputs renamed as predictions, or load-bearing self-citations that collapse the central claim to its inputs by construction. Claims rest on the theorem plus external experiments on synthetic and real data, making the derivation self-contained rather than circular.

Axiom & Free-Parameter Ledger

1 free parameters · 0 axioms · 0 invented entities

The central claim rests on an unspecified theorem that enables parameter learning via deep unfolding and on the linear convergence of the non-convex iteration; no free parameters, axioms, or invented entities are named in the abstract.

free parameters (1)
  • free parameters of LRMC
    Stated to be learnable via deep unfolding to reach optimum performance.

pith-pipeline@v0.9.0 · 5697 in / 1241 out tokens · 69028 ms · 2026-05-23T06:12:12.082913+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

66 extracted references · 66 canonical work pages · 1 internal anchor

  1. [1]

    Learned robust PCA: A scalable deep unfolding approach for high-dimensional outlier detection,

    H. Cai, J. Liu, and W. Yin, “Learned robust PCA: A scalable deep unfolding approach for high-dimensional outlier detection,” Advances in Neural Information Processing Systems , vol. 34, pp. 16 977–16 989, 2021

  2. [2]

    Exact matrix completion via convex optimization,

    E. J. Cand `es and B. Recht, “Exact matrix completion via convex optimization,” Found. Comput. Math., vol. 9, no. 6, pp. 717–772, 2009

  3. [3]

    The Power of Convex Relaxation: Near-Optimal Matrix Completion,

    E. J. Cand `es and T. Tao, “The Power of Convex Relaxation: Near-Optimal Matrix Completion,” IEEE Trans. Inform. Theory , vol. 56, no. 5, pp. 2053–2080, May 2010

  4. [4]

    KDD cup and workshop,

    J. Bennett, C. Elkan, B. Liu, P. Smyth, and D. Tikk, “KDD cup and workshop,” ACM SIGKDD explorations newsletter , vol. 9, no. 2, pp. 51–52, 2007

  5. [5]

    Matrix completion on learnt graphs: Application to collaborative filtering,

    A. Mongia and A. Majumdar, “Matrix completion on learnt graphs: Application to collaborative filtering,” Expert Systems with Applications, vol. 185, p. 115652, 2021

  6. [6]

    Matrix completion with cross-concentrated sampling: Bridging uniform sampling and CUR sampling,

    H. Cai, L. Huang, P. Li, and D. Needell, “Matrix completion with cross-concentrated sampling: Bridging uniform sampling and CUR sampling,” IEEE Trans. Pattern Anal. Mach. Intell. , vol. 45, no. 8, pp. 10 100–10 113, 2023

  7. [7]

    Recovering the missing components in a large noisy low-rank matrix: Application to sfm,

    P. Chen and D. Suter, “Recovering the missing components in a large noisy low-rank matrix: Application to sfm,” IEEE Trans. Pattern Anal. Mach. Intell. , vol. 26, no. 8, pp. 1051–1063, 2004

  8. [8]

    Fast and accurate matrix completion via truncated nuclear norm regularization,

    Y . Hu, D. Zhang, J. Ye, X. Li, and X. He, “Fast and accurate matrix completion via truncated nuclear norm regularization,” IEEE Trans. Pattern Anal. Mach. Intell. , vol. 35, no. 9, pp. 2117–2130, 2012

  9. [9]

    Color image inpainting via robust pure quaternion matrix completion: Error bound and weighted loss,

    J. Chen and M. K. Ng, “Color image inpainting via robust pure quaternion matrix completion: Error bound and weighted loss,” SIAM J. Imaging Sci. , vol. 15, no. 3, pp. 1469–1498, 2022

  10. [10]

    Accelerated nmr spectroscopy with low-rank reconstruction,

    X. Qu, M. Mayzel, J.-F. Cai, Z. Chen, and V . Orekhov, “Accelerated nmr spectroscopy with low-rank reconstruction,” Angewandte Chemie International Edition , vol. 54, no. 3, pp. 852–854, 2015

  11. [11]

    Fast and provable algorithms for spectrally sparse signal reconstruction via low-rank hankel matrix completion,

    J.-F. Cai, T. Wang, and K. Wei, “Fast and provable algorithms for spectrally sparse signal reconstruction via low-rank hankel matrix completion,” Appl. Comput. Harmon. Anal., vol. 46, no. 1, pp. 94–121, 2019

  12. [12]

    Structured gradient descent for fast robust low-rank hankel matrix completion,

    H. Cai, J.-F. Cai, and J. You, “Structured gradient descent for fast robust low-rank hankel matrix completion,” SIAM J. Sci. Comput., vol. 45, no. 3, pp. A1172–A1198, 2023

  13. [13]

    Accelerating ill- conditioned hankel matrix recovery via structured newton-like descent,

    H. Cai, L. Huang, X. Lu, and J. You, “Accelerating ill- conditioned hankel matrix recovery via structured newton-like descent,” arXiv:2406.07409, 2024

  14. [14]

    Uniqueness of low-rank matrix completion by rigidity theory,

    A. Singer and M. Cucuringu, “Uniqueness of low-rank matrix completion by rigidity theory,” SIAM J. Matrix Anal. Appl. , vol. 31, no. 4, pp. 1621–1641, 2010

  15. [15]

    Exact reconstruction of Euclidean distance geometry problem using low-rank matrix completion,

    A. Tasissa and R. Lai, “Exact reconstruction of Euclidean distance geometry problem using low-rank matrix completion,” IEEE Trans. Inform. Theory , vol. 65, no. 5, pp. 3124–3144, 2018

  16. [16]

    Riemannian optimization for non-convex Euclidean distance geometry with global recovery guarantees,

    C. Smith, H. Cai, and A. Tasissa, “Riemannian optimization for non-convex Euclidean distance geometry with global recovery guarantees,” arXiv:2410.06376, 2024

  17. [17]

    Structured sampling for robust Euclidean distance geometry,

    C. Kundu, A. Tasissa, and H. Cai, “Structured sampling for robust Euclidean distance geometry,” arXiv:2412.10664, 2024

  18. [18]

    Laplacian convolutional representation for traffic time series imputation,

    X. Chen, Z. Cheng, H. Cai, N. Saunier, and L. Sun, “Laplacian convolutional representation for traffic time series imputation,” IEEE Trans. Knowl. Data Eng. , vol. 36, no. 11, pp. 6490–6502, 2024

  19. [19]

    Forecasting urban traffic states with sparse data using hankel temporal matrix factorization,

    X. Chen, X.-L. Zhao, and C. Cheng, “Forecasting urban traffic states with sparse data using hankel temporal matrix factorization,” INFORMS J. Comput. , 2024

  20. [20]

    Low-rank matrix recovery from errors and erasures,

    Y . Chen, A. Jalali, S. Sanghavi, and C. Caramanis, “Low-rank matrix recovery from errors and erasures,” IEEE Trans. Inform. Theory, vol. 59, no. 7, pp. 4324–4337, 2013

  21. [21]

    Nearly optimal robust matrix completion,

    Y . Cherapanamjeri, K. Gupta, and P. Jain, “Nearly optimal robust matrix completion,” in International Conference on Machine Learning, 2017, pp. 797–805

  22. [22]

    Outlier-robust matrix completion via ℓp-minimization,

    W.-J. Zeng and H. C. So, “Outlier-robust matrix completion via ℓp-minimization,” IEEE Trans. Signal Process. , vol. 66, no. 5, pp. 1125–1140, 2017

  23. [23]

    Robust CUR decomposition: Theory and imaging applications,

    H. Cai, K. Hamm, L. Huang, and D. Needell, “Robust CUR decomposition: Theory and imaging applications,” SIAM J. Imaging Sci., vol. 14, no. 4, pp. 1472–1503, 2021

  24. [24]

    Robust Matrix Completion with Heavy- tailed Noise,

    B. Wang and J. Fan, “Robust Matrix Completion with Heavy- tailed Noise,” J. Am. Stat. Assoc. , pp. 1–22, 2024

  25. [25]

    On the robustness of cross-concentrated sampling for matrix completion,

    H. Cai, L. Huang, C. Kundu, and B. Su, “On the robustness of cross-concentrated sampling for matrix completion,” in Conference on Information Sciences and Systems , 2024

  26. [26]

    Learning fast approximations of sparse coding,

    K. Gregor and Y . LeCun, “Learning fast approximations of sparse coding,” in International Conference on Machine Learning, 2010, pp. 399–406

  27. [27]

    The Augmented Lagrange Multiplier Method for Exact Recovery of Corrupted Low-Rank Matrices

    Z. Lin, M. Chen, and Y . Ma, “The augmented Lagrange multiplier method for exact recovery of corrupted low-rank matrices,” arXiv preprint arXiv:1009.5055 , 2010

  28. [28]

    Robust principal component analysis?

    E. J. Cand `es, X. Li, Y . Ma, and J. Wright, “Robust principal component analysis?” J. ACM, vol. 58, no. 3, pp. 1–37, 2011

  29. [29]

    Fast algorithms for robust PCA via gradient descent,

    X. Yi, D. Park, Y . Chen, and C. Caramanis, “Fast algorithms for robust PCA via gradient descent,” Advances in Neural Information Processing Systems , vol. 29, 2016

  30. [30]

    Robust PCA by manifold optimization,

    T. Zhang and Y . Yang, “Robust PCA by manifold optimization,” J. Mach. Learn. Res. , vol. 19, no. 80, pp. 1–39, 2018

  31. [31]

    Accelerating ill-conditioned low- rank matrix estimation via scaled gradient descent,

    T. Tong, C. Ma, and Y . Chi, “Accelerating ill-conditioned low- rank matrix estimation via scaled gradient descent,” J. Mach. Learn. Res., vol. 22, no. 1, pp. 6639–6701, 2021. 14

  32. [32]

    Maximal sparsity with deep networks?

    B. Xin, Y . Wang, W. Gao, D. Wipf, and B. Wang, “Maximal sparsity with deep networks?” Advances in Neural Information Processing Systems, vol. 29, pp. 4340–4348, 2016

  33. [33]

    Deep ADMM-Net for compressive sensing MRI,

    Y . Yang, J. Sun, H. Li, and Z. Xu, “Deep ADMM-Net for compressive sensing MRI,” in Advances in Neural Information Processing Systems, 2016, pp. 10–18

  34. [34]

    Learned D-AMP: Principled neural network based compressive image recovery,

    C. A. Metzler, A. Mousavi, and R. G. Baraniuk, “Learned D-AMP: Principled neural network based compressive image recovery,” Advances in Neural Information Processing Systems , pp. 1773–1784, 2017

  35. [35]

    ISTA-Net: Interpretable optimization- inspired deep network for image compressive sensing,

    J. Zhang and B. Ghanem, “ISTA-Net: Interpretable optimization- inspired deep network for image compressive sensing,” in IEEE Conference on Computer Vision and Pattern Recognition , 2018, pp. 1828–1837

  36. [36]

    Learned primal-dual reconstruction,

    J. Adler and O. ¨Oktem, “Learned primal-dual reconstruction,” IEEE Trans. Med. Imaging , vol. 37, no. 6, pp. 1322–1332, 2018

  37. [37]

    ALISTA: Analytic weights are as good as learned weights in LISTA,

    J. Liu, X. Chen, Z. Wang, and W. Yin, “ALISTA: Analytic weights are as good as learned weights in LISTA,” in Interna- tional Conference on Learning Representations , 2019

  38. [38]

    Algorithm unrolling: Interpretable, efficient deep learning for signal and image processing,

    V . Monga, Y . Li, and Y . C. Eldar, “Algorithm unrolling: Interpretable, efficient deep learning for signal and image processing,” IEEE Signal Process. Mag. , vol. 38, no. 2, pp. 18–44, 2021

  39. [39]

    WTDUN: Wavelet tree-structured sampling and deep unfolding network for image compressed sensing,

    K. Han, J. Wang, Y . Shi, H. Cai, N. Ling, and B. Yin, “WTDUN: Wavelet tree-structured sampling and deep unfolding network for image compressed sensing,” ACM Trans. Multimedia Comput. Commun. Appl., vol. 21, no. 1, pp. 33.1–33.22, 2024

  40. [40]

    Optimization guarantees of unfolded ista and admm networks with smooth soft-thresholding,

    S. B. Shah, P. Pradhan, W. Pu, R. Randhi, M. R. Rodrigues, and Y . C. Eldar, “Optimization guarantees of unfolded ista and admm networks with smooth soft-thresholding,” IEEE Trans. Signal Process., 2024

  41. [41]

    Proximal gradient- based unfolding for massive random access in iot networks,

    Y . Zou, Y . Zhou, X. Chen, and Y . C. Eldar, “Proximal gradient- based unfolding for massive random access in iot networks,” IEEE Trans. Wireless Commun. , 2024

  42. [42]

    Unrolled denoising networks provably learn optimal bayesian inference,

    A. Karan, K. Shah, S. Chen, and Y . C. Eldar, “Unrolled denoising networks provably learn optimal bayesian inference,” arXiv:2409.12947, 2024

  43. [43]

    Learning to learn by gradient descent by gradient descent,

    M. Andrychowicz, M. Denil, S. Gomez, M. W. Hoffman, D. Pfau, T. Schaul, B. Shillingford, and N. D. Freitas, “Learning to learn by gradient descent by gradient descent,” in Advances in Neural Information Processing Systems , 2016, pp. 3981–3989

  44. [44]

    Learned optimizers that scale and generalize,

    O. Wichrowska, N. Maheswaranathan, M. W. Hoffman, S. G. Colmenarejo, M. Denil, N. Freitas, and J. Sohl-Dickstein, “Learned optimizers that scale and generalize,” in International Conference on Machine Learning , 2017, pp. 3751–3760

  45. [45]

    Understanding and correcting pathologies in the training of learned optimizers,

    L. Metz, N. Maheswaranathan, J. Nixon, D. Freeman, and J. Sohl- Dickstein, “Understanding and correcting pathologies in the training of learned optimizers,” in International Conference on Machine Learning, 2019, pp. 4556–4565

  46. [46]

    A closer look at learned optimization: Stability, robustness, and inductive biases,

    J. Harrison, L. Metz, and J. Sohl-Dickstein, “A closer look at learned optimization: Stability, robustness, and inductive biases,” Advances in Neural Information Processing Systems , vol. 35, pp. 3758–3773, 2022

  47. [47]

    Towards constituting mathematical structures for learning to optimize,

    J. Liu, X. Chen, Z. Wang, W. Yin, and H. Cai, “Towards constituting mathematical structures for learning to optimize,” in International Conference on Machine Learning , vol. 202, 2023, pp. 21 426–21 449

  48. [48]

    A mathematics- inspired learning-to-optimize framework for decentralized opti- mization,

    Y . He, Q. Shang, X. Huang, J. Liu, and K. Yuan, “A mathematics- inspired learning-to-optimize framework for decentralized opti- mization,” arXiv:2410.01700, 2024

  49. [49]

    Deep convolutional robust PCA with application to ultrasound imaging,

    R. Cohen, Y . Zhang, O. Solomon, D. Toberman, L. Taieb, R. J. van Sloun, and Y . C. Eldar, “Deep convolutional robust PCA with application to ultrasound imaging,” in IEEE International Conference on Acoustics, Speech and Signal Processing , 2019, pp. 3212–3216

  50. [50]

    Deep unfolded robust PCA with application to clutter suppression in ultrasound,

    O. Solomon, R. Cohen, Y . Zhang, Y . Yang, Q. He, J. Luo, R. J. van Sloun, and Y . C. Eldar, “Deep unfolded robust PCA with application to clutter suppression in ultrasound,” IEEE Trans. Med. Imaging, vol. 39, no. 4, pp. 1051–1063, 2019

  51. [51]

    A deep-unfolded reference-based RPCA network for video foreground-background separation,

    H. V . Luong, B. Joukovsky, Y . C. Eldar, and N. Deligiannis, “A deep-unfolded reference-based RPCA network for video foreground-background separation,” in European Signal Pro- cessing Conference, 2021, pp. 1432–1436

  52. [52]

    Multimodal unrolled robust pca for background foreground separation,

    S. Markowitz, C. Snyder, Y . C. Eldar, and M. N. Do, “Multimodal unrolled robust pca for background foreground separation,” IEEE Trans. Image Process., vol. 31, pp. 3553–3564, 2022

  53. [53]

    Interpretable neural networks for video separation: Deep unfolding rpca with foreground masking,

    B. Joukovsky, Y . C. Eldar, and N. Deligiannis, “Interpretable neural networks for video separation: Deep unfolding rpca with foreground masking,” IEEE Trans. Image Process. , 2023

  54. [54]

    A singular value thresholding algorithm for matrix completion,

    J.-F. Cai, E. J. Cand `es, and Z. Shen, “A singular value thresholding algorithm for matrix completion,” SIAM J. Optim. , vol. 20, no. 4, pp. 1956–1982, 2010

  55. [55]

    Denise: Deep learning based robust PCA for positive semidefinite matrices,

    C. Herrera, F. Krach, A. Kratsios, P. Ruyssen, and J. Teich- mann, “Denise: Deep learning based robust PCA for positive semidefinite matrices,” arXiv:2004.13612, 2020

  56. [56]

    Non-convex robust PCA,

    P. Netrapalli, U. Niranjan, S. Sanghavi, A. Anandkumar, and P. Jain, “Non-convex robust PCA,” in Advances in Neural Information Processing Systems , 2014, pp. 1107–1115

  57. [57]

    Accelerated alternating projections for robust principal component analysis,

    H. Cai, J.-F. Cai, and K. Wei, “Accelerated alternating projections for robust principal component analysis,” J. Mach. Learn. Res. , vol. 20, no. 1, pp. 685–717, 2019

  58. [58]

    Rapid robust principal component analysis: CUR accelerated inexact low rank estimation,

    H. Cai, K. Hamm, L. Huang, J. Li, and T. Wang, “Rapid robust principal component analysis: CUR accelerated inexact low rank estimation,” IEEE Signal Process. Lett. , vol. 28, pp. 116–120, 2020

  59. [59]

    Riemannian CUR decompo- sitions for robust principal component analysis,

    K. Hamm, M. Meskini, and H. Cai, “Riemannian CUR decompo- sitions for robust principal component analysis,” in Topological, Algebraic and Geometric Learning Workshops , 2022, pp. 152– 160

  60. [60]

    Theoretical analysis on conver- gence acceleration of deep-unfolded gradient descent,

    S. Takabe and T. Wadayama, “Theoretical analysis on conver- gence acceleration of deep-unfolded gradient descent,” IEICE Technical Report, vol. 120, no. 142, pp. 25–30, 2020

  61. [61]

    A simpler approach to matrix completion

    B. Recht, “A simpler approach to matrix completion.” J. Mach. Learn. Res., vol. 12, no. 12, 2011

  62. [62]

    Rank-sparsity incoherence for matrix decomposition,

    V . Chandrasekaran, S. Sanghavi, P. A. Parrilo, and A. S. Willsky, “Rank-sparsity incoherence for matrix decomposition,” SIAM J. Optim., vol. 21, no. 2, pp. 572–596, 2011

  63. [63]

    Theoretical linear convergence of unfolded ista and its practical weights and thresholds,

    X. Chen, J. Liu, Z. Wang, and W. Yin, “Theoretical linear convergence of unfolded ista and its practical weights and thresholds,” Advances in Neural Information Processing Systems , vol. 31, 2018

  64. [64]

    Training stronger baselines for learning to optimize,

    T. Chen, W. Zhang, Z. Jingyang, S. Chang, S. Liu, L. Amini, and Z. Wang, “Training stronger baselines for learning to optimize,” Advances in Neural Information Processing Systems , vol. 33, pp. 7332–7343, 2020

  65. [65]

    A large- scale benchmark dataset for event recognition in surveillance video,

    S. Oh, A. Hoogs, A. Perera, N. Cuntoor, C.-C. Chen, J. T. Lee, S. Mukherjee, J. Aggarwal, H. Lee, L. Davis et al., “A large- scale benchmark dataset for event recognition in surveillance video,” in CVPR 2011, 2011, pp. 3153–3160

  66. [66]

    The AR face database,

    A. Martinez and R. Benavente, “The AR face database,” CVC technical report, 24 , 1998