A operatorname{prox}-Based Semi-Smooth Newton Method for TV-Minimization
Pith reviewed 2026-05-22 03:08 UTC · model grok-4.3
The pith
A prox-based semi-smooth Newton method for total variation minimization is globally well-posed and locally superlinearly convergent after finite element discretization.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The paper establishes that the primal-dual optimality conditions for the total variation minimization problem can be rewritten as a nonlinear operator equation possessing Newton-type differentiability. For a conforming finite element discretization this yields a semi-smooth Newton method that is globally well-posed at every iteration and converges locally superlinearly to the discrete solution.
What carries the argument
The prox-based reformulation of the primal-dual optimality conditions as a Newton-differentiable nonlinear operator equation.
If this is right
- The approach extends directly to a large class of convex minimization problems.
- It coincides with established semi-smooth Newton methods for obstacle problems.
- The method satisfies a primal-dual invariance.
- Under suitable assumptions the scheme remains well-posed in the infinite-dimensional setting.
- Numerical performance shows reliable reduction of the discrete primal-dual gap to machine precision and robustness to proximity parameters.
Where Pith is reading between the lines
- The superlinear convergence rate could make the method preferable to first-order proximal algorithms for high-accuracy solutions on fine meshes.
- Primal-dual invariance might be used to construct efficient a posteriori error indicators for adaptive mesh refinement.
- The method could be combined with multigrid or other acceleration techniques to further improve efficiency on large-scale problems.
Load-bearing premise
The reformulation of the primal-dual optimality conditions must produce a nonlinear operator equation that possesses Newton-type differentiability.
What would settle it
Numerical computation of the discrete solutions on successively refined meshes for a problem with known exact solution, verifying that the number of Newton iterations remains bounded while the error decreases at a superlinear rate independent of the mesh size.
Figures
read the original abstract
In this paper, we devise a $\operatorname{prox}$-based semi-smooth Newton method for the non-differentiable TV-minimization problem. To this end, the primal-dual optimality conditions are reformulated as a nonlinear operator equation with Newton-(type-)differentiable structure. We investigate the question of well-posedness of the resulting semi-smooth Newton scheme in the infinite-dimensional setting and identify structural properties of the associated Newton-type derivatives. For a conforming finite element discretization, we prove that the resulting semi-smooth Newton method is globally well-posed and locally super-linearly convergent. The approach extends to a large class of convex minimization problems, coincides with established semi-smooth Newton methods for obstacle problems, satisfies a primal-dual invariance, and, under suitable additional assumptions, is well-posed in the infinite-dimensional setting. Numerical experiments indicate a robust practical performance of the proposed method, including reliable reduction of the discrete primal-dual gap estimator to machine precision, robustness with respect to the choice of proximity parameters, an improved convergence basin compared to a canonical primal semi-smooth Newton method, and effective performance even for quadratically graded meshes using only a mesh-independent initialization criterion.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes a prox-based semi-smooth Newton method for total variation (TV) minimization. Primal-dual optimality conditions are reformulated as a nonlinear operator equation with Newton-type differentiable structure. Well-posedness is analyzed in infinite dimensions with structural properties of the Newton-type derivatives identified. For conforming finite element discretizations, global well-posedness and local superlinear convergence of the semi-smooth Newton scheme are proved. The approach extends to a broad class of convex minimization problems, coincides with known methods for obstacle problems, satisfies primal-dual invariance, and numerical experiments demonstrate robust performance including reliable primal-dual gap reduction to machine precision, robustness to proximity parameters, and improved convergence basin relative to a primal semi-smooth Newton method.
Significance. If the global well-posedness and convergence results hold, the work supplies a theoretically supported solver for TV-minimization with guaranteed iteration existence from arbitrary initial data and local superlinear rates, extending prior semi-smooth Newton schemes while offering practical robustness on graded meshes. The identification of structural properties of the prox-based derivatives and the primal-dual invariance are notable strengths that could benefit variational imaging and related optimization tasks.
major comments (2)
- [§5] §5 (Discrete well-posedness and convergence): The proof that the semi-smooth Newton iteration is globally well-posed (i.e., the Newton step exists at every iterate for arbitrary initial data) requires uniform invertibility (or a merit-function guarantee) of the generalized derivative at all generated points. The structural properties derived for the Jacobian of the proximal mapping of the dual ball constraint do not explicitly exclude singularity when discrete gradients align with the boundary of the constraint set or for certain proximity-parameter values; this must be addressed to support the global existence claim independent of mesh size and initialization.
- [§4.2] §4.2 (Infinite-dimensional analysis): The well-posedness investigation identifies Newton-type differentiability but leaves open whether the generalized derivatives satisfy the conditions for global existence of the iteration in the continuous setting; the 'suitable additional assumptions' mentioned for infinite-dimensional well-posedness need to be stated explicitly and verified to be non-vacuous for the TV case.
minor comments (3)
- [Abstract] The abstract states that the method 'coincides with established semi-smooth Newton methods for obstacle problems'; a brief remark or reference to the precise coincidence (e.g., via choice of prox) would clarify the connection.
- [Numerical experiments] Numerical section: The reported robustness with respect to proximity parameters is supported by experiments, but quantitative bounds or sensitivity plots would strengthen the claim that performance remains reliable across a wide range of parameter choices.
- [§3] Notation: The definition of the Newton-type derivative (likely in §3) uses a generalized Jacobian; ensuring consistent notation between the continuous and discrete settings would aid readability.
Simulated Author's Rebuttal
We thank the referee for the careful reading and constructive comments on our manuscript. We address the major comments point by point below, indicating the revisions we plan to incorporate.
read point-by-point responses
-
Referee: [§5] §5 (Discrete well-posedness and convergence): The proof that the semi-smooth Newton iteration is globally well-posed (i.e., the Newton step exists at every iterate for arbitrary initial data) requires uniform invertibility (or a merit-function guarantee) of the generalized derivative at all generated points. The structural properties derived for the Jacobian of the proximal mapping of the dual ball constraint do not explicitly exclude singularity when discrete gradients align with the boundary of the constraint set or for certain proximity-parameter values; this must be addressed to support the global existence claim independent of mesh size and initialization.
Authors: We appreciate the referee highlighting the need for greater explicitness in the invertibility argument. The structural properties of the proximal mapping Jacobian, including its monotonicity and the positive contribution from the regularization term on the active set, do ensure nonsingularity even when discrete gradients align with the boundary or for admissible proximity parameters. To make this fully rigorous and independent of mesh size and initialization, we will revise §5 by adding a dedicated lemma with case analysis for boundary alignment scenarios. This will strengthen the global well-posedness proof without altering the main claims. revision: yes
-
Referee: [§4.2] §4.2 (Infinite-dimensional analysis): The well-posedness investigation identifies Newton-type differentiability but leaves open whether the generalized derivatives satisfy the conditions for global existence of the iteration in the continuous setting; the 'suitable additional assumptions' mentioned for infinite-dimensional well-posedness need to be stated explicitly and verified to be non-vacuous for the TV case.
Authors: We agree that the suitable additional assumptions for infinite-dimensional well-posedness should be stated explicitly. In the revision of §4.2, we will list them clearly (e.g., a positive lower bound on the proximity parameter and suitable integrability of the data) and verify that they are non-vacuous for the TV-minimization problem by confirming they hold under the standard assumptions on the domain and forcing term used throughout the paper and in the numerical section. revision: yes
Circularity Check
No circularity: convergence claims follow from structural analysis of Newton derivatives under standard semismooth theory
full rationale
The derivation begins with a reformulation of primal-dual optimality conditions into a nonlinear operator equation possessing Newton-type differentiability. Structural properties of the associated generalized derivatives are identified, after which standard semismooth Newton convergence results are applied to establish global well-posedness and local superlinear convergence for the conforming finite-element discretization. These steps rely on the explicit form of the proximal mapping Jacobian for the dual ball and on mesh-independent initialization, without reducing the well-posedness statement to a self-definition, a fitted parameter renamed as prediction, or a load-bearing self-citation whose content is itself unverified. The reported coincidence with established obstacle-problem schemes and the extension to broader convex problems supply independent content. No equation or theorem in the chain equates the target result to its own inputs by construction.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Primal-dual optimality conditions admit a reformulation as a nonlinear operator equation with Newton-(type-)differentiable structure.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
the nonlinear mapping F_ε : A → B ... admits the root (z_ε, u_ε) ... Newton-type derivative J_Fε(y, v)(δy, δv) := [(1−J_prox( a ))∇δv − γ J_prox(a) δy; α δv − div δy]
-
IndisputableMonolith/Foundation/AbsoluteFloorClosure.leanabsolute_floor_iff_bare_distinguishability unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Theorem 3.2 ... Algorithm 3.1 is globally well-posed ... Newton derivative J_Fε_h is invertible with uniformly bounded inverse
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
H. Antil,S. Bartels,A. Kaltenbach, andR. Khandelwal, Variational problems with gradient constraints: A priori and a posteriori error identities,Math. Comput.(2025). doi:10.1090/mcom/4146
- [2]
-
[3]
doi:10.1137/1.9781611973488
-
[4]
A. AuslenderandM. Teboulle, Interior gradient and proximal methods for convex and conic optimization,SIAM J. Optim.16no. 3 (2006), 697–725. doi:10.1137/S1052623403427823
-
[5]
Balay et al.,PETSc Users Manual, Tech
S. Balay et al.,PETSc Users Manual, Tech. Report ANL-95/11 - Revision 3.6, Argonne National Laboratory, 2015
work page 2015
-
[6]
S. BartelsandA. Kaltenbach,A prox-Based Semi-Smooth Newton Method for Convex Varia- tional Problems, in preparation
-
[7]
S. BartelsandA. Kaltenbach, Explicit A Posteriori Error Representation for Variational Problems and Application to TV-Minimization,Found. Comput. Math.(2024). doi:10.1007/s10208- 024-09676-5
-
[8]
Bartels,Numerical methods for nonlinear partial differential equations,Springer Ser
S. Bartels,Numerical methods for nonlinear partial differential equations,Springer Ser. Comput. Math.47, Cham: Springer, 2015. doi:10.1007/978-3-319-13797-1 1
-
[9]
S. Bartels, Nonconforming discretizations of convex minimization problems and precise relations to mixed methods,Comput. Math. Appl.93(2021), 214–229. doi:10.1016/j.camwa.2021.04.014
-
[10]
S. Bartels,L. Diening, andR. H. Nochetto, Unconditional stability of semi-implicit discretiza- tions of singular flows,SIAM J. Numer. Anal.56no. 3 (2018), 1896–1914. doi:10.1137/17M1159166
-
[11]
S. Bartels,T. Gudi, andA. Kaltenbach, A priori and a posteriori error identities for the scalar Signorini problem,SIAM J. Numer. Anal.63no. 5 (2025), 2155–2186. doi:10.1137/24M1677691
-
[12]
S. BartelsandA. Kaltenbach, Error estimates for total-variation regularized minimization problems with singular dual solutions,Numer. Math.152no. 4 (2022), 881–906. doi:10.1007/s00211- 022-01324-w
-
[13]
S. BartelsandA. Kaltenbach, Explicit and efficient error estimation for convex minimization problems,Math. Comput.92no. 343 (2023), 2247–2279. doi:10.1090/mcom/3821
-
[14]
S. Bartels,R. Tovey, andF. Wassmer, Singular solutions, graded meshes, and adaptivity for total-variation regularized minimization problems,ESAIM, Math. Model. Numer. Anal.56no. 6 (2022), 1871–1888. doi:10.1051/m2an/2022056
-
[15]
H. H. BauschkeandP. L. Combettes,Convex analysis and monotone operator theory in Hilbert spaces, 2nd ed.,CMS Books Math./Ouvrages Math. SMC, Cham: Springer, 2017. doi:10.1007/978-3- 319-48311-5
-
[16]
Beck.First-order methods in optimization
A. Beck,First-order methods in optimization,MOS-SIAM Ser. Optim.25, SIAM; Philadelphia, MOS, 2017. doi:10.1137/1.9781611974997
-
[17]
A. BeckandM. Teboulle, Fast gradient-based algorithms for constrained total variation image denoising and deblurring problems,IEEE Trans. Image Process.18no. 11 (2009), 2419–2434. doi:10.1109/TIP.2009.2028250
-
[18]
A fast iterative shrinkage-thresholding algorithm for linear inverse problems,
A. BeckandM. Teboulle, A fast iterative shrinkage-thresholding algorithm for linear inverse problems,SIAM J. Imaging Sci.2no. 1 (2009), 183–202. doi:10.1137/080716542
-
[19]
Y. BoykovandV. Kolmogorov, An experimental comparison of min-cut/max-flow algorithms for energy minimization in vision, inEnergy Minimization Methods in Computer Vision and Pattern Recognition,Lecture Notes in Computer Science2134, Springer, 2001, pp. 359–374. doi:10.1007/3- 540-44745-8 24
work page doi:10.1007/3- 2001
-
[20]
H. Brezis,Functional analysis, Sobolev spaces and partial differential equations,Universitext, New York, NY: Springer, 2011. doi:10.1007/978-0-387-70914-7
-
[21]
Chambolle, An algorithm for total variation minimization and applications,J
A. Chambolle, An algorithm for total variation minimization and applications,J. Math. Imaging Vis.20no. 1-2 (2004), 89–97. doi:10.1023/B:JMIV.0000011325.36760.1e
-
[22]
A. ChambolleandT. Pock, A first-order primal-dual algorithm for convex problems with applications to imaging,J. Math. Imaging Vis.40no. 1 (2011), 120–145. doi:10.1007/s10851-010- 0251-1
-
[23]
A. ChambolleandT. Pock, Crouzeix-raviart approximation of the total variation on simplicial meshes,J. Math. Imaging Vis.62no. 6-7 (2020), 872–899. doi:10.1007/s10851-019-00939-3. prox-based semi-smooth Newton method for TV-minimization21
-
[24]
T. F. Chan,G. H. Golub, andP. Mulet, A nonlinear primal-dual method for to- tal variation-based image restoration,SIAM J. Sci. Comput.20no. 6 (1999), 1964–1977. doi:10.1137/S1064827596299767
-
[25]
T. F. Chan,H. M. Zhou, andR. H.-F. Chan, Continuation method for total variation denoising problems, inAdvanced Signal Processing Algorithms(F. T. Luk, ed.),2563, International Society for Optics and Photonics, SPIE, 1995, pp. 314–325. doi:10.1117/12.211408
-
[26]
X. Chen,Z. Nashed, andL. Qi, Smoothing methods and semismooth methods for nondifferentiable operator equations,SIAM J. Numer. Anal.38no. 4 (2000), 1200–1216. doi:10.1137/S0036142999356719
-
[27]
J. DarbonandM. Sigelle, A Fast and Exact Algorithm for Total Variation Minimization, in Pattern Recognition and Image Analysis(J. S. Marques,N. P ´erez de la Blanca, andP. Pina, eds.), Springer Berlin Heidelberg, Berlin, Heidelberg, 2005, pp. 351–359. doi:10.1007/11492429 43
-
[28]
L. DieningandJ. Storn, Guaranteed upper bounds for iteration errors and modified Kaˇ canov schemes via discrete duality,Comput. Methods Appl. Math.25no. 3 (2025), 587–600. doi:10.1515/cmam-2025-0017
-
[29]
D. C. DobsonandC. R. Vogel, Convergence of an iterative method for total variation denoising, SIAM J. Numer. Anal.34no. 5 (1997), 1779–1791. doi:10.1137/S003614299528701X
-
[30]
I. EkelandandR. T ´emam,Convex analysis and variational problems,Class. Appl. Math.28, SIAM, 1999. doi:10.1137/1.9781611971088
-
[31]
D. GilbargandN. S. Trudinger,Elliptic partial differential equations of second order, reprint of the 1998 ed.,Class. Math., Berlin: Springer, 2001
work page 1998
-
[32]
D. GoldfarbandW. Yin, Second-order cone programming methods for total variation-based image restoration,SIAM J. Sci. Comput.27no. 2 (2005), 622–645 (English). doi:10.1137/040608982
-
[33]
T. GoldsteinandS. Osher, The split Bregman method for ℓ1-regularized problems,SIAM J. Imaging Sci.2no. 2 (2009), 323–343 (English). doi:10.1137/080725891
-
[34]
R. GriesseandD. A. Lorenz, A semismooth Newton method for Tikhonov functionals with sparsity constraints,Inverse Probl.24no. 3 (2008), 19, Id/No 035007. doi:10.1088/0266-5611/24/3/035007
-
[35]
K. Gr¨oger, A W 1,p-estimate for solutions to mixed boundary value problems for second order elliptic differential equations,Math. Ann.283no. 4 (1989), 679–687. doi:10.1007/BF01442860
-
[36]
M. Hinterm¨uller,K. Ito, andK. Kunisch, The primal-dual active set strategy as a semismooth Newton method,SIAM J. Optim.13no. 3 (2003), 865–888. doi:10.1137/S1052623401383558
-
[37]
M. Hinterm ¨ullerandK. Kunisch, Total bounded variation regularization as a bilater- ally constrained optimization problem,SIAM J. Appl. Math.64no. 4 (2004), 1311–1333. doi:10.1137/S0036139903422784
-
[38]
M. Hinterm ¨ullerandG. Stadler, An infeasible primal-dual algorithm for total bounded variation–based Inf-convolution-type image restoration,SIAM J. Sci. Comput.28no. 1 (2006), 1–23. doi:10.1137/040613263
-
[39]
P. J. Huber, Robust estimation of a location parameter,Ann. Math. Stat.35(1964), 73–101. doi:10.1214/aoms/1177703732
-
[40]
J. D. Hunter, Matplotlib: A 2D Graphics Environment,Computing in Science & Engineering9 no. 3 (2007), 90–95. doi:10.1109/MCSE.2007.55
-
[41]
A. KaltenbachandT. Schreiber,Learning Proximity-Certified Warmstarts for TV-minimization, in preparation
- [42]
-
[43]
doi:10.1007/978-3-642-23099-8. [42]A. MarquinaandS. Osher, Explicit algorithms for a new time dependent model based on level set motion for nonlinear deblurring and noise removal,SIAM J. Sci. Comput.22no. 2 (2000), 387–405. doi:10.1137/S1064827599351751
-
[44]
M. K. Ng,L. Qi,Y.-f. Yang, andY.-m. Huang, On semismooth Newton’s methods for total variation minimization,J. Math. Imaging Vis.27no. 3 (2007), 265–276. doi:10.1007/s10851-007- 0650-0
-
[45]
H. Rademacher, ¨Uber partielle und totale Differenzierbarkeit von Funktionen mehrerer Vari- ablen und ¨ uber die Transformation der Doppelintegrale. I, II.,Math. Ann.79(1920), 340–359. doi:10.1007/BF01498415
-
[46]
L. I. Rudin,S. Osher, andE. Fatemi, Nonlinear total variation based noise removal algorithms, Physica D60no. 1-4 (1992), 259–268. doi:10.1016/0167-2789(92)90242-F. S. Bartels and A. Kaltenbach22
-
[47]
M. Souiai,Newton Methods for Total Variation Minimization, Master’s thesis, Rheinische Friedrich- Wilhelms-Universit¨ at Bonn, 2010
work page 2010
-
[48]
X.-C. Tai,R. Winther,X. Zhang, andW. Zheng, A uniform preconditioner for a Newton algorithm for total variation minimization and minimum-surface problems,SIAM J. Numer. Anal. 61no. 5 (2023), 2062–2083. doi:10.1137/22M1512776
-
[49]
Ulbrich, Semismooth Newton methods for operator equations in function spaces,SIAM J
M. Ulbrich, Semismooth Newton methods for operator equations in function spaces,SIAM J. Optim.13no. 3 (2003), 805–841. doi:10.1137/S1052623400371569
-
[50]
C. R. VogelandM. E. Oman, Iterative methods for total variation denoising,SIAM J. Sci. Comput.17no. 1 (1996), 227–238. doi:10.1137/0917016
-
[51]
C. WuandX.-C. Tai, Augmented Lagrangian method, dual methods, and split Bregman iteration for ROF, vectorial TV, and high order models,SIAM J. Imaging Sci.3no. 3 (2010), 300–339. doi:10.1137/090767558
-
[52]
Zeidler,Nonlinear functional analysis and its applications
E. Zeidler,Nonlinear functional analysis and its applications. II/B: Nonlinear monotone operators. Transl. from the German by the author and by Leo F. Boron, New York etc.: Springer-Verlag, 1990
work page 1990
-
[53]
M. ZhuandT. F. Chan,An Efficient Primal–Dual Hybrid Gradient Algorithm for Total Variation Image Restoration, CAM Report 08-34, UCLA, Department of Mathematics, 2008. Appendix In this appendix, we provide the proof of Theorem 3.6(3.26). Proof (of Theorem 3.6(3.26)). ad (3.26a).For every vh ∈V h, using the binomial formula and the discrete convex optimalit...
work page 2008
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.