On the resolution of $\ell_1$-norm minimization via a two-metric adaptive projection method

Hanju Wu; Yue Xie

arxiv: 2504.12260 · v3 · submitted 2025-04-16 · 🧮 math.OC

On the resolution of ell₁-norm minimization via a two-metric adaptive projection method

Hanju Wu , Yue Xie This is my paper

Pith reviewed 2026-05-22 19:59 UTC · model grok-4.3

classification 🧮 math.OC

keywords ℓ1-norm minimizationtwo-metric projectionactive-set methodmanifold identificationsuperlinear convergencelarge-scale optimizationerror bound condition

0 comments

The pith

A two-metric adaptive projection method solves ℓ1-norm minimization by identifying the underlying sparse manifold in finite iterations and then switching to unconstrained Newton steps on the reduced subspace.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces an algorithm for ℓ1-norm minimization that extends Bertsekas-style two-metric projection without first converting the problem into an equivalent bound-constrained form. It refines the index-set partition, applies an adaptive projection, and uses a custom linesearch to allow singular Hessians and inexact Newton solves. The method is shown to converge globally and, after a finite number of steps, to identify the active manifold so that subsequent iterations solve an unconstrained Newton problem on the identified subspace. Under an error-bound condition near the solution, the local convergence rate becomes superlinear, which is especially useful when the optimum is sparse.

Core claim

The central claim is that the proposed two-metric adaptive projection method for ℓ1-norm minimization achieves global convergence, finite-time manifold identification, and, under an error bound condition, locally superlinear convergence. After the manifold is identified the algorithm reduces exactly to an unconstrained Newton method on the corresponding low-dimensional subspace while still accommodating singular Hessians and inexact linear solves.

What carries the argument

The two-metric adaptive projection step that uses a refined partition of the index set together with a novel linesearch rule to drive manifold identification.

If this is right

The algorithm terminates in finite steps on the manifold for any sparse solution and thereafter behaves like an unconstrained Newton solver.
Global convergence is guaranteed without requiring the Hessian to be positive definite at every iteration.
Inexact Newton solves and singular Hessians are permitted without losing the theoretical guarantees.
The method remains practical for large-scale problems because each iteration has low cost until the manifold is identified.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The finite manifold-identification property could be exploited to warm-start other sparse solvers once the support is known.
Similar index-set partitioning ideas might extend the same guarantees to ℓ1-regularized problems with additional convex constraints.
The linesearch rule may be reusable in other active-set frameworks that must handle nondifferentiable points.

Load-bearing premise

An error bound condition holds in a neighborhood of the solution.

What would settle it

A numerical test on a sparse problem where the iterates fail to reach the exact support set in finitely many steps or where the error reduction after identification is only linear rather than superlinear.

read the original abstract

In this work, we propose an efficient two-metric adaptive projection method for solving the $\ell_1$-norm minimization problem. Our approach is inspired by the two-metric projection method, a simple yet elegant algorithm proposed by Bertsekas for bound/box-constrained optimization problems. The low per-iteration cost of this method, combined with the ability to incorporate Hessian information, makes it particularly attractive for large-scale problems, and our proposed method inherits these advantages. Previous attempts to extend the two-metric projection method to $\ell_1$-norm minimization rely on an intermediate reformulation as a bound-constrained problem, which can lead to numerical instabilities in practice, in sharp contrast to our approach. Our algorithm features a refined partition of the index set, an adaptive projection, and a novel linesearch rule. It can accommodate singular Hessians as well as inexact solutions to the Newton linear system for practical implementation. We show that our method is theoretically sound - it has global convergence. Moreover, it is an active-set method capable of manifold identification: the underlying low-dimensional structure can be identified in a finite number of iterations, after which the algorithm reduces to an unconstrained Newton method on the identified subspace. Under an Error Bound condition, the method attains a locally superlinear convergence rate. Hence, when the solution is sparse, it achieves superfast convergence in terms of iterations while maintaining scalability, making it well-suited for large-scale problems. We conduct extensive numerical experiments to demonstrate the practical advantages of the proposed algorithm over several competitive methods from the literature, particularly in large-scale settings. }

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Extends two-metric projection to l1 minimization without bound reformulation, adding refined partition, adaptive projection and linesearch, with global convergence and local superlinear rate under error bound.

read the letter

The paper gives a direct extension of Bertsekas' two-metric projection method to the l1-norm minimization problem. It skips the intermediate bound-constrained reformulation that earlier attempts used, and instead introduces a refined index partition, an adaptive projection step, and a new linesearch rule. The method is designed to handle singular Hessians and inexact Newton solves, which matters for practical large-scale use in compressed sensing or lasso-type problems. It claims global convergence, finite identification of the active manifold, reduction to unconstrained Newton on that subspace, and local superlinear convergence when an error bound holds near the solution. When the solution is sparse, this structure should deliver fast iteration counts while keeping per-iteration work modest. The direct l1 treatment and the practical allowances for inexact steps are the clearest points of difference from prior work. The manifold-identification analysis follows a familiar pattern for active-set methods but is applied here without the reformulation step, which is a reasonable incremental step if the details hold. The main soft spots are the error bound assumption required for the local rate (standard but not automatic) and the lack of visible proof sketches in the abstract for how the linesearch and inexact Newton steps preserve the claimed properties. Numerical experiments are mentioned but not described in enough detail to judge their strength. This is a paper for people who build or use scalable solvers for sparse convex problems. A reader working on active-set or projection methods for nonsmooth optimization would find the algorithmic choices and the manifold-identification claim worth examining. It deserves a serious referee to verify the proofs and the experiments.

Referee Report

2 major / 2 minor

Summary. The paper proposes a two-metric adaptive projection method for the ℓ₁-norm minimization problem. The algorithm uses a refined partition of the index set, an adaptive projection step, and a novel linesearch rule. It accommodates singular Hessians and inexact Newton solves. Theoretical claims include global convergence, finite-time manifold identification (after which the method reduces to unconstrained Newton on the identified subspace), and local superlinear convergence under an error-bound condition. Extensive numerical experiments compare the method to existing approaches on large-scale instances.

Significance. If the convergence analysis holds, the method provides a practical active-set framework that combines the low per-iteration cost of projection methods with rapid local convergence once the support is identified. Avoiding an intermediate bound-constrained reformulation reduces numerical instability, while tolerance for singular Hessians and inexact linear solves improves applicability to large-scale sparse problems in signal processing and machine learning.

major comments (2)

[§4] §4 (Convergence analysis): the global convergence argument must explicitly verify that the novel linesearch rule and the adaptive projection preserve a sufficient descent property when the Hessian is singular or the Newton step is inexact; the current sketch does not address how the two-metric framework is modified to accommodate these relaxations.
[§5] §5 (Manifold identification): the finite identification result relies on the refined index-set partition; the proof should quantify the number of iterations needed to identify the active set in terms of the error-bound constant, or provide a counter-example showing that identification can fail without the error bound.

minor comments (2)

[§2] The problem statement (minimize ||x||₁ subject to Ax = b or the regularized form) is never written explicitly; add the mathematical formulation in §2 with clear notation for the linear operator A.
[Numerical experiments] Table 1 and Figure 3: axis labels and legend entries are too small for readability; increase font size and add a caption explaining the meaning of the “iteration count to identification” column.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed and constructive report. The comments highlight areas where the convergence analysis can be strengthened for clarity. We address each major comment below and will incorporate revisions to improve the manuscript.

read point-by-point responses

Referee: [§4] §4 (Convergence analysis): the global convergence argument must explicitly verify that the novel linesearch rule and the adaptive projection preserve a sufficient descent property when the Hessian is singular or the Newton step is inexact; the current sketch does not address how the two-metric framework is modified to accommodate these relaxations.

Authors: We agree that the global convergence argument in Section 4 requires more explicit verification for the cases of singular Hessians and inexact Newton steps. Although the manuscript claims accommodation of these features via the adaptive projection and linesearch, the provided sketch does not fully detail the modifications to the two-metric framework. In the revised manuscript, we will expand Section 4 with additional arguments showing that the linesearch rule preserves a sufficient descent property by ensuring the projected step yields a negative directional derivative even under singularity (via a suitable regularization or safeguarding), and that inexact solves are handled by controlling the error in the Newton direction within the two-metric projection. This will include new technical details or lemmas to make the argument rigorous. revision: yes
Referee: [§5] §5 (Manifold identification): the finite identification result relies on the refined index-set partition; the proof should quantify the number of iterations needed to identify the active set in terms of the error-bound constant, or provide a counter-example showing that identification can fail without the error bound.

Authors: We clarify that the finite-time manifold identification in Section 5 is proven using the refined index-set partition and holds independently of the error-bound condition. The error bound is used exclusively for establishing the local superlinear convergence rate once the manifold is identified. The proof demonstrates that the active set is identified after a finite number of iterations under the standing assumptions of the problem (e.g., existence of a minimizer and continuity properties), without invoking the error bound. Therefore, quantifying iterations specifically in terms of the error-bound constant is not applicable, as the result does not depend on it. We will add an explicit remark in the revised Section 5 to separate these results and note that identification succeeds without the error bound. A counter-example is not needed, as the existing argument shows success in the absence of the error bound; however, we can include a brief illustrative discussion if it aids clarity. revision: partial

Circularity Check

0 steps flagged

No significant circularity identified

full rationale

The paper develops an algorithmic extension of Bertsekas' two-metric projection method to l1-minimization, featuring a refined index partition, adaptive projection, and linesearch. Global convergence, finite-time manifold identification, reduction to unconstrained Newton on the identified subspace, and local superlinear rate under an Error Bound condition are established via standard arguments from nonsmooth optimization theory. These steps invoke external references (Bertsekas, error-bound literature) rather than self-citations or internal redefinitions; no equation or claim reduces by construction to a fitted input, ansatz smuggled via prior work by the same authors, or renaming of known results. The derivation remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The central claims rest on standard convex optimization assumptions plus the error-bound condition; no new free parameters or invented entities are introduced in the abstract.

axioms (2)

domain assumption The objective is convex and the feasible set is convex (implicit for ℓ1 minimization).
Required for global convergence of projection-type methods.
domain assumption An error bound condition holds in a neighborhood of the solution.
Explicitly invoked for the locally superlinear convergence rate.

pith-pipeline@v0.9.0 · 5819 in / 1413 out tokens · 42708 ms · 2026-05-22T19:59:41.862516+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

two-metric adaptive projection... manifold identification... superlinear convergence under Error Bound and strict complementarity (Theorem 4.7)
IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

J-cost uniqueness and φ-ladder constants

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Identifiability and Error Bound: Metric and Geometric Perspectives
math.OC 2026-05 unverdicted novelty 6.0

Local error bound in ambient Euclidean space is equivalent to local error bound on the identifiable manifold under mild assumptions, shown via metric and geometric arguments.

Reference graph

Works this paper leans on

37 extracted references · 37 canonical work pages · cited by 1 Pith paper · 1 internal anchor

[1]

Absil, R

P.-A. Absil, R. Mahony, and R. Sepulchre , Optimization algorithms on matrix manifolds , Princeton University Press, 2008

work page 2008
[2]

Attouch, J

H. Attouch, J. Bolte, P. Redont, and A. Soubeyran , Proximal alternating minimiza- tion and projection methods for nonconvex problems: An approach based on the Kurdyka- Lojasiewicz inequality, Mathematics of operations research, 35 (2010), pp. 438–457

work page 2010
[3]

Bareilles, F

G. Bareilles, F. Iutzeler, and J. Malick , Newton acceleration on manifolds identified by proximal gradient methods, Mathematical Programming, 200 (2023), pp. 37–70

work page 2023
[4]

H. H. Bauschke, P. L. Combettes, H. H. Bauschke, and P. L. Combettes , Correction to: convex analysis and monotone operator theory in Hilbert spaces , Springer, 2017

work page 2017
[5]

Beck, First-order methods in optimization , SIAM, 2017

A. Beck, First-order methods in optimization , SIAM, 2017

work page 2017
[6]

Beck and M

A. Beck and M. Teboulle, A fast iterative shrinkage-thresholding algorithm for linear inverse problems, SIAM journal on imaging sciences, 2 (2009), pp. 183–202

work page 2009
[7]

D. P. Bertsekas, Projected Newton methods for optimization problems with simple constraints, SIAM Journal on control and Optimization, 20 (1982), pp. 221–246

work page 1982
[8]

Cambridge University Press, Cambridge, UK (2023)

N. Boumal, An introduction to optimization on smooth manifolds, Cambridge University Press, 2023, https://doi.org/10.1017/9781009166164, https://www.nicolasboumal.net/book

work page doi:10.1017/9781009166164 2023
[9]

S. Boyd, N. Parikh, E. Chu, B. Peleato, J. Eckstein, et al., Distributed optimization and statistical learning via the alternating direction method of multipliers , Foundations and Trends® in Machine learning, 3 (2011), pp. 1–122

work page 2011
[10]

E. J. Cand `es, J. Romberg, and T. Tao , Robust uncertainty principles: Exact signal recon- struction from highly incomplete frequency information, IEEE Transactions on information theory, 52 (2006), pp. 489–509

work page 2006
[11]

Carmon, J

Y. Carmon, J. C. Duchi, O. Hinder, and A. Sidford , Lower bounds for finding stationary points i, Mathematical Programming, 184 (2020), pp. 71–120

work page 2020
[12]

Chang, Libsvm data: Classification, regression, and multi-label , http://www

C.-C. Chang, Libsvm data: Classification, regression, and multi-label , http://www. csie. ntu. edu. tw/˜ cjlin/libsvmtools/datasets/, (2008)

work page 2008
[13]

E. M. Gafni and D. P. Bertsekas, Two-metric projection methods for constrained optimiza- tion, SIAM Journal on Control and Optimization, 22 (1984), pp. 936–964

work page 1984
[14]

W. L. Hare and A. S. Lewis , Identifying active constraints via partial smoothness and prox- regularity, Journal of Convex Analysis, 11 (2004), pp. 251–266

work page 2004
[15]

J. Hu, T. Tian, S. Pan, and Z. Wen , On the local convergence of the semismooth Newton method for composite optimization , arXiv preprint arXiv:2211.01127, (2022)

work page arXiv 2022
[16]

Lee, Accelerating inexact successive quadratic approximation for regularized optimization through manifold identification, Mathematical Programming, 201 (2023), pp

C.-p. Lee, Accelerating inexact successive quadratic approximation for regularized optimization through manifold identification, Mathematical Programming, 201 (2023), pp. 599–633

work page 2023
[17]

Lewis and T

A. Lewis and T. Tian, Identifiability, the kl property in metric spaces, and subgradient curves, Foundations of Computational Mathematics, (2024), pp. 1–38

work page 2024
[18]

A. S. Lewis and J. Liang , Partial smoothness and constant rank , arXiv preprint arXiv:1807.03134, (2018)

work page internal anchor Pith review Pith/arXiv arXiv 2018
[19]

A. S. Lewis, J. Liang, and T. Tian , Partial smoothness and constant rank , SIAM Journal on Optimization, 32 (2022), pp. 276–291

work page 2022
[20]

X. Li, D. Sun, and K.-C. Toh , A highly efficient semismooth Newton augmented lagrangian method for solving lasso problems, SIAM Journal on Optimization, 28 (2018), pp. 433–458

work page 2018
[21]

Milzarek and M

A. Milzarek and M. Ulbrich , A semismooth Newton method with multidimensional filter globalization for l 1-optimization, SIAM Journal on Optimization, 24 (2014), pp. 298–333

work page 2014
[22]

B. S. Mordukhovich, X. Yuan, S. Zeng, and J. Zhang , A globally convergent proximal Newton-type method in nonsmooth convex optimization , Mathematical Programming, 198 (2023), pp. 899–936

work page 2023
[23]

Nesterov, Gradient methods for minimizing composite functions , Mathematical program- ming, 140 (2013), pp

Y. Nesterov, Gradient methods for minimizing composite functions , Mathematical program- ming, 140 (2013), pp. 125–161

work page 2013
[24]

Schmidt, Graphical model structure learning with l1-regularization , University of British Columbia, (2010), p

M. Schmidt, Graphical model structure learning with l1-regularization , University of British Columbia, (2010), p. 26. TWO-METRIC ADAPTIVE PROJECTION 29

work page 2010
[25]

Schmidt, G

M. Schmidt, G. Fung, and R. Rosales , Fast optimization methods for l1 regularization: A comparative study and two new approaches , in Machine Learning: ECML 2007: 18th European Conference on Machine Learning, Warsaw, Poland, September 17-21, 2007. Pro- ceedings 18, Springer, 2007, pp. 286–297

work page 2007
[26]

Tseng and S

P. Tseng and S. Yun , A coordinate gradient descent method for nonsmooth separable mini- mization, Mathematical Programming, 117 (2009), pp. 387–423

work page 2009
[27]

Z. Wen, W. Yin, D. Goldfarb, and Y. Zhang , A fast algorithm for sparse reconstruction based on shrinkage, subspace optimization, and continuation , SIAM Journal on Scientific Computing, 32 (2010), pp. 1832–1857

work page 2010
[28]

S. J. Wright, Identifiable surfaces in constrained optimization, SIAM Journal on Control and Optimization, 31 (1993), pp. 1063–1079

work page 1993
[29]

S. J. Wright, Coordinate descent algorithms, Mathematical programming, 151 (2015), pp. 3– 34

work page 2015
[30]

S. J. Wright, R. D. Nowak, and M. A. Figueiredo , Sparse reconstruction by separable approximation, IEEE Transactions on signal processing, 57 (2009), pp. 2479–2493

work page 2009
[31]

S. J. Wright and B. Recht , Optimization for data analysis , Cambridge University Press, 2022

work page 2022
[32]

Wu and Y

H. Wu and Y. Xie , A study on two-metric projection methods , arXiv preprint arXiv:2409.05321, (2024)

work page arXiv 2024
[33]

X. Xiao, Y. Li, Z. Wen, and L. Zhang , A regularized semi-smooth Newton method with projection steps for composite convex programs, Journal of Scientific Computing, 76 (2018), pp. 364–389

work page 2018
[34]

Yuan, C.-H

G.-X. Yuan, C.-H. Ho, and C.-J. Lin , An improved glmnet for l1-regularized logistic regres- sion, The Journal of Machine Learning Research, 13 (2012), pp. 1999–2030

work page 2012
[35]

M.-C. Yue, Z. Zhou, and A. M.-C. So , A family of inexact sqa methods for non-smooth convex minimization with provable convergence guarantees based on the luo–tseng error bound property, Mathematical Programming, 174 (2019), pp. 327–358

work page 2019
[36]

Yun and K.-C

S. Yun and K.-C. Toh , A coordinate gradient descent method for l1-regularized convex mini- mization, Computational Optimization and Applications, 48 (2011), pp. 273–307

work page 2011
[37]

Zhou and A

Z. Zhou and A. M.-C. So, A unified approach to error bounds for structured convex optimiza- tion problems, Mathematical Programming, 165 (2017), pp. 689–728

work page 2017

[1] [1]

Absil, R

P.-A. Absil, R. Mahony, and R. Sepulchre , Optimization algorithms on matrix manifolds , Princeton University Press, 2008

work page 2008

[2] [2]

Attouch, J

H. Attouch, J. Bolte, P. Redont, and A. Soubeyran , Proximal alternating minimiza- tion and projection methods for nonconvex problems: An approach based on the Kurdyka- Lojasiewicz inequality, Mathematics of operations research, 35 (2010), pp. 438–457

work page 2010

[3] [3]

Bareilles, F

G. Bareilles, F. Iutzeler, and J. Malick , Newton acceleration on manifolds identified by proximal gradient methods, Mathematical Programming, 200 (2023), pp. 37–70

work page 2023

[4] [4]

H. H. Bauschke, P. L. Combettes, H. H. Bauschke, and P. L. Combettes , Correction to: convex analysis and monotone operator theory in Hilbert spaces , Springer, 2017

work page 2017

[5] [5]

Beck, First-order methods in optimization , SIAM, 2017

A. Beck, First-order methods in optimization , SIAM, 2017

work page 2017

[6] [6]

Beck and M

A. Beck and M. Teboulle, A fast iterative shrinkage-thresholding algorithm for linear inverse problems, SIAM journal on imaging sciences, 2 (2009), pp. 183–202

work page 2009

[7] [7]

D. P. Bertsekas, Projected Newton methods for optimization problems with simple constraints, SIAM Journal on control and Optimization, 20 (1982), pp. 221–246

work page 1982

[8] [8]

Cambridge University Press, Cambridge, UK (2023)

N. Boumal, An introduction to optimization on smooth manifolds, Cambridge University Press, 2023, https://doi.org/10.1017/9781009166164, https://www.nicolasboumal.net/book

work page doi:10.1017/9781009166164 2023

[9] [9]

S. Boyd, N. Parikh, E. Chu, B. Peleato, J. Eckstein, et al., Distributed optimization and statistical learning via the alternating direction method of multipliers , Foundations and Trends® in Machine learning, 3 (2011), pp. 1–122

work page 2011

[10] [10]

E. J. Cand `es, J. Romberg, and T. Tao , Robust uncertainty principles: Exact signal recon- struction from highly incomplete frequency information, IEEE Transactions on information theory, 52 (2006), pp. 489–509

work page 2006

[11] [11]

Carmon, J

Y. Carmon, J. C. Duchi, O. Hinder, and A. Sidford , Lower bounds for finding stationary points i, Mathematical Programming, 184 (2020), pp. 71–120

work page 2020

[12] [12]

Chang, Libsvm data: Classification, regression, and multi-label , http://www

C.-C. Chang, Libsvm data: Classification, regression, and multi-label , http://www. csie. ntu. edu. tw/˜ cjlin/libsvmtools/datasets/, (2008)

work page 2008

[13] [13]

E. M. Gafni and D. P. Bertsekas, Two-metric projection methods for constrained optimiza- tion, SIAM Journal on Control and Optimization, 22 (1984), pp. 936–964

work page 1984

[14] [14]

W. L. Hare and A. S. Lewis , Identifying active constraints via partial smoothness and prox- regularity, Journal of Convex Analysis, 11 (2004), pp. 251–266

work page 2004

[15] [15]

J. Hu, T. Tian, S. Pan, and Z. Wen , On the local convergence of the semismooth Newton method for composite optimization , arXiv preprint arXiv:2211.01127, (2022)

work page arXiv 2022

[16] [16]

Lee, Accelerating inexact successive quadratic approximation for regularized optimization through manifold identification, Mathematical Programming, 201 (2023), pp

C.-p. Lee, Accelerating inexact successive quadratic approximation for regularized optimization through manifold identification, Mathematical Programming, 201 (2023), pp. 599–633

work page 2023

[17] [17]

Lewis and T

A. Lewis and T. Tian, Identifiability, the kl property in metric spaces, and subgradient curves, Foundations of Computational Mathematics, (2024), pp. 1–38

work page 2024

[18] [18]

A. S. Lewis and J. Liang , Partial smoothness and constant rank , arXiv preprint arXiv:1807.03134, (2018)

work page internal anchor Pith review Pith/arXiv arXiv 2018

[19] [19]

A. S. Lewis, J. Liang, and T. Tian , Partial smoothness and constant rank , SIAM Journal on Optimization, 32 (2022), pp. 276–291

work page 2022

[20] [20]

X. Li, D. Sun, and K.-C. Toh , A highly efficient semismooth Newton augmented lagrangian method for solving lasso problems, SIAM Journal on Optimization, 28 (2018), pp. 433–458

work page 2018

[21] [21]

Milzarek and M

A. Milzarek and M. Ulbrich , A semismooth Newton method with multidimensional filter globalization for l 1-optimization, SIAM Journal on Optimization, 24 (2014), pp. 298–333

work page 2014

[22] [22]

B. S. Mordukhovich, X. Yuan, S. Zeng, and J. Zhang , A globally convergent proximal Newton-type method in nonsmooth convex optimization , Mathematical Programming, 198 (2023), pp. 899–936

work page 2023

[23] [23]

Nesterov, Gradient methods for minimizing composite functions , Mathematical program- ming, 140 (2013), pp

Y. Nesterov, Gradient methods for minimizing composite functions , Mathematical program- ming, 140 (2013), pp. 125–161

work page 2013

[24] [24]

Schmidt, Graphical model structure learning with l1-regularization , University of British Columbia, (2010), p

M. Schmidt, Graphical model structure learning with l1-regularization , University of British Columbia, (2010), p. 26. TWO-METRIC ADAPTIVE PROJECTION 29

work page 2010

[25] [25]

Schmidt, G

M. Schmidt, G. Fung, and R. Rosales , Fast optimization methods for l1 regularization: A comparative study and two new approaches , in Machine Learning: ECML 2007: 18th European Conference on Machine Learning, Warsaw, Poland, September 17-21, 2007. Pro- ceedings 18, Springer, 2007, pp. 286–297

work page 2007

[26] [26]

Tseng and S

P. Tseng and S. Yun , A coordinate gradient descent method for nonsmooth separable mini- mization, Mathematical Programming, 117 (2009), pp. 387–423

work page 2009

[27] [27]

Z. Wen, W. Yin, D. Goldfarb, and Y. Zhang , A fast algorithm for sparse reconstruction based on shrinkage, subspace optimization, and continuation , SIAM Journal on Scientific Computing, 32 (2010), pp. 1832–1857

work page 2010

[28] [28]

S. J. Wright, Identifiable surfaces in constrained optimization, SIAM Journal on Control and Optimization, 31 (1993), pp. 1063–1079

work page 1993

[29] [29]

S. J. Wright, Coordinate descent algorithms, Mathematical programming, 151 (2015), pp. 3– 34

work page 2015

[30] [30]

S. J. Wright, R. D. Nowak, and M. A. Figueiredo , Sparse reconstruction by separable approximation, IEEE Transactions on signal processing, 57 (2009), pp. 2479–2493

work page 2009

[31] [31]

S. J. Wright and B. Recht , Optimization for data analysis , Cambridge University Press, 2022

work page 2022

[32] [32]

Wu and Y

H. Wu and Y. Xie , A study on two-metric projection methods , arXiv preprint arXiv:2409.05321, (2024)

work page arXiv 2024

[33] [33]

X. Xiao, Y. Li, Z. Wen, and L. Zhang , A regularized semi-smooth Newton method with projection steps for composite convex programs, Journal of Scientific Computing, 76 (2018), pp. 364–389

work page 2018

[34] [34]

Yuan, C.-H

G.-X. Yuan, C.-H. Ho, and C.-J. Lin , An improved glmnet for l1-regularized logistic regres- sion, The Journal of Machine Learning Research, 13 (2012), pp. 1999–2030

work page 2012

[35] [35]

M.-C. Yue, Z. Zhou, and A. M.-C. So , A family of inexact sqa methods for non-smooth convex minimization with provable convergence guarantees based on the luo–tseng error bound property, Mathematical Programming, 174 (2019), pp. 327–358

work page 2019

[36] [36]

Yun and K.-C

S. Yun and K.-C. Toh , A coordinate gradient descent method for l1-regularized convex mini- mization, Computational Optimization and Applications, 48 (2011), pp. 273–307

work page 2011

[37] [37]

Zhou and A

Z. Zhou and A. M.-C. So, A unified approach to error bounds for structured convex optimiza- tion problems, Mathematical Programming, 165 (2017), pp. 689–728

work page 2017