$\gamma$-weakly $\theta$-up-concavity: A Unified Framework for Non-Convex Optimization Beyond DR-Submodular and OSS Functions

Mohammad Pedramfar; Vaneet Aggarwal

arxiv: 2602.13506 · v2 · submitted 2026-02-13 · 💻 cs.LG · cs.AI· math.OC

γ-weakly θ-up-concavity: A Unified Framework for Non-Convex Optimization Beyond DR-Submodular and OSS Functions

Mohammad Pedramfar , Vaneet Aggarwal This is my paper

Pith reviewed 2026-05-15 21:58 UTC · model grok-4.3

classification 💻 cs.LG cs.AImath.OC

keywords non-convex optimizationDR-submodular maximizationOSS functionsup-concavitylinear surrogatesapproximation algorithmsonline optimizationregret bounds

0 comments

The pith

γ-weakly θ-up-concave functions admit linear surrogates whose gains approximate the original objective with explicit coefficients.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper defines γ-weakly θ-up-concavity as a first-order condition on non-convex functions that strictly generalizes both DR-submodular and OSS functions while also covering scale-dependent curvature behaviors such as accumulating-then-diminishing returns and flat-start regimes. The central result shows that any function meeting this condition is upper-linearizable, so that at every feasible point a linear surrogate exists whose value gains track those of the original function up to a factor that depends on the curvature parameters and the feasible-set geometry. This linearizability reduces a range of non-convex problems to linear optimization oracles, immediately supplying approximation ratios for offline maximization and both static and dynamic regret bounds for online settings. The same machinery recovers the known optimal ratio for DR-submodular maximization and tightens existing ratios for OSS maximization under matroid constraints.

Core claim

γ-weakly θ-up-concave functions are upper-linearizable: for any feasible point, a linear surrogate can be constructed whose gains provably approximate the original objective. A nonuniform upper-linearization argument produces approximation coefficients that depend explicitly on the curvature parameters γ and θ together with the geometry of the feasible region. This property yields unified approximation guarantees for offline optimization as well as static and dynamic regret bounds in online settings via standard reductions to linear optimization.

What carries the argument

The γ-weakly θ-up-concavity first-order condition, which characterizes scale-dependent curvature and supports construction of a linear upper surrogate at each point.

If this is right

Standard linear-optimization algorithms immediately give approximation guarantees for offline maximization of any γ-weakly θ-up-concave objective.
Static regret bounds follow for online maximization of such functions by the same reduction.
Dynamic regret bounds likewise follow for online maximization.
The optimal known approximation ratio is recovered for DR-submodular maximization.
Tighter approximation ratios are obtained for OSS maximization, especially over matroid constraints.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same first-order condition can be verified on loss surfaces arising in deep learning to decide whether the guarantees apply directly.
Exploiting the specific geometry of a constraint set may produce tighter coefficients than the general bound.
Algorithms could adaptively estimate γ and θ from samples to improve practical performance beyond worst-case ratios.
Similar first-order curvature conditions might be derived for other families of non-convex problems to obtain analogous linearization results.

Load-bearing premise

The objective function satisfies the γ-weakly θ-up-concavity first-order inequality for some fixed parameters γ and θ.

What would settle it

A concrete function that obeys the γ-weakly θ-up-concavity condition yet for which the best linear surrogate at some feasible point achieves a strictly worse gain-approximation ratio than the coefficient derived from γ, θ, and the geometry.

read the original abstract

Optimizing non-convex functions is a fundamental challenge across machine learning and combinatorial optimization. We introduce and study $\gamma$-weakly $\theta$-up-concavity, a novel first-order condition that characterizes a broad class of such functions. This condition provides a powerful unifying framework, strictly generalizing both DR-submodular and One-Sided Smooth (OSS) functions while capturing broader forms of scale-dependent curvature, including accumulating-then-diminishing returns and flat-start behavior. Our central theoretical contribution demonstrates that $\gamma$-weakly $\theta$-up-concave functions are upper-linearizable: for any feasible point, we can construct a linear surrogate whose gains provably approximate the original non-linear objective. A key technical contribution is a nonuniform upper-linearization argument yielding approximation coefficients that depend explicitly on the curvature parameters and the geometry of the feasible region. This linearizability yields immediate and unified approximation guarantees for a wide range of problems. Specifically, we obtain unified approximation guarantees for offline optimization as well as static and dynamic regret bounds in online settings via standard reductions to linear optimization. Moreover, our framework recovers the optimal approximation coefficient for DR-submodular maximization and improves existing approximation coefficients for OSS optimization, particularly over matroid constraints.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The new γ-weakly θ-up-concavity condition unifies DR-submodular and OSS via nonuniform upper-linearization, but the error control on discrete or scale-varying domains still needs checking.

read the letter

This paper introduces γ-weakly θ-up-concavity as a first-order condition that strictly generalizes both DR-submodular and OSS functions. It claims these functions admit a nonuniform upper-linearization that produces explicit approximation coefficients depending on the curvature parameters and the geometry of the feasible set, which then yields unified offline approximation guarantees plus static and dynamic regret bounds through standard reductions to linear optimization. The abstract also states that the framework recovers the optimal DR-submodular ratio and improves the OSS coefficient over matroids.

Referee Report

2 major / 2 minor

Summary. The paper introduces γ-weakly θ-up-concavity as a first-order condition that strictly generalizes DR-submodular and OSS functions, capturing scale-dependent curvature such as accumulating-then-diminishing returns. It claims that such functions are upper-linearizable: for any feasible point a linear surrogate can be constructed whose gains approximate the original objective, with explicit coefficients depending on γ, θ and the geometry of the feasible region. This yields unified approximation guarantees for offline maximization and static/dynamic regret bounds for online settings via reductions to linear optimization, recovering the optimal DR-submodular coefficient and improving OSS coefficients over matroids.

Significance. If the upper-linearization holds with the stated explicit coefficients, the framework unifies several non-convex classes under a single first-order condition and supplies improved or optimal approximation guarantees across offline and online problems. The nonuniform argument and recovery of known optimal DR-submodular results are notable strengths; the explicit dependence on curvature parameters and geometry avoids parameter-free claims while still delivering concrete bounds.

major comments (2)

[Upper-linearization argument] The central upper-linearization claim (abstract and §3–4) asserts that the γ-weakly θ-up-concavity first-order condition alone produces a linear surrogate with approximation error controlled solely by γ, θ and geometry. No explicit derivation, error bound, or handling of residual higher-order terms is visible for domains with non-constant curvature or discrete structure (e.g., matroids); the nonuniform argument must be expanded to show uniform control when curvature accumulates then diminishes.
[Comparison with DR-submodular and OSS] Table 1 and the OSS improvement claim: the recovered DR-submodular coefficient is stated to be optimal, yet the dependence of the new OSS coefficient on θ must be shown to be strictly better than prior work without post-hoc fitting of θ to the instance; otherwise the guarantee reduces to a reparameterization of the input definition.

minor comments (2)

[Introduction and definitions] The abstract refers to 'flat-start behavior' and 'accumulating-then-diminishing returns' without a formal definition or concrete example function; add these in §2 to make the generalization claim verifiable.
[Notation] Notation for the linear surrogate (e.g., the gain function) should be introduced once and used consistently; several symbols appear only in the abstract and are not defined until later sections.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the careful reading and constructive comments on our manuscript. We address each major comment below and indicate the revisions that will be incorporated.

read point-by-point responses

Referee: [Upper-linearization argument] The central upper-linearization claim (abstract and §3–4) asserts that the γ-weakly θ-up-concavity first-order condition alone produces a linear surrogate with approximation error controlled solely by γ, θ and geometry. No explicit derivation, error bound, or handling of residual higher-order terms is visible for domains with non-constant curvature or discrete structure (e.g., matroids); the nonuniform argument must be expanded to show uniform control when curvature accumulates then diminishes.

Authors: We appreciate the referee's suggestion to strengthen the presentation of the upper-linearization argument. The derivation begins from the first-order condition in Section 3 and constructs the linear surrogate with error bounds that depend explicitly on γ, θ, and the geometry of the feasible region via the nonuniform argument. To improve clarity for cases of non-constant curvature and discrete structures such as matroids, we will expand the proof with additional intermediate steps that explicitly bound residual higher-order terms and demonstrate uniform control. revision: yes
Referee: [Comparison with DR-submodular and OSS] Table 1 and the OSS improvement claim: the recovered DR-submodular coefficient is stated to be optimal, yet the dependence of the new OSS coefficient on θ must be shown to be strictly better than prior work without post-hoc fitting of θ to the instance; otherwise the guarantee reduces to a reparameterization of the input definition.

Authors: The DR-submodular coefficient is recovered exactly by substituting γ = 1 and θ = 1, matching the known optimal bound with no fitting involved. For OSS functions, θ is an intrinsic parameter of the function class arising from the curvature condition, not chosen post-hoc for any instance. The resulting coefficient is strictly tighter than prior OSS bounds whenever θ < 1. We will add a clarifying paragraph and direct comparison to prior OSS results in the revision. revision: partial

Circularity Check

0 steps flagged

No circularity: derivation proceeds directly from the introduced first-order condition

full rationale

The paper introduces γ-weakly θ-up-concavity explicitly as a first-order condition on the objective and then derives the upper-linearizability property, nonuniform approximation coefficients, and resulting offline/online guarantees as consequences of that condition. The dependence of coefficients on γ and θ is a parameterization of the function class rather than a fitted input renamed as a prediction. No self-citations are invoked as load-bearing premises, no uniqueness theorems are imported from prior author work, and no ansatz is smuggled via citation. The recovery of DR-submodular coefficients is presented as a special case of the new framework, not as a re-derivation of the input. The derivation chain is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

2 free parameters · 1 axioms · 0 invented entities

The entire framework rests on the newly defined γ-weakly θ-up-concavity property; all linearization and approximation results are derived from this single domain assumption.

free parameters (2)

γ
Curvature parameter that controls the strength of the weak up-concavity; appears in the definition and in the approximation coefficients.
θ
Curvature parameter that controls the up-concavity behavior; appears in the definition and in the approximation coefficients.

axioms (1)

domain assumption The objective satisfies the γ-weakly θ-up-concavity first-order condition for given γ and θ.
This is the central premise invoked to establish upper-linearizability and all subsequent guarantees.

pith-pipeline@v0.9.0 · 5535 in / 1413 out tokens · 27529 ms · 2026-05-15T21:58:03.368182+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel; dAlembert_to_ODE_general echoes

?

echoes
ECHOES: this paper passage has the same mathematical shape or conceptual pattern as the Recognition theorem, but is not a direct formal dependency.

γ-weakly θ-up-concave if γ θ(x)/θ(y) ⟨∇f(y),y-x⟩ ≤ f(y)-f(x) ≤ θ(y)/(γ θ(x)) ⟨∇f(x),y-x⟩ (Eq. 2); g(f,x) = ∫_0^1 e^{l(r,x)} ∇f(rx) dr with α_x = 1-exp(-γ/R_θ ∫_0^1 θ(sx) ds) (Theorem 4)
IndisputableMonolith/Foundation/ArithmeticFromLogic.lean embed_injective; embed_strictMono_of_one_lt echoes

?

echoes
ECHOES: this paper passage has the same mathematical shape or conceptual pattern as the Recognition theorem, but is not a direct formal dependency.

Restrict to maximal convex subset K* = conv(K_m) away from origin to bound α away from 0 (Corollary 6, Remark 10)

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.