Towards Robust, Locally Linear Deep Networks

David Alvarez-Melis; Guang-He Lee; Tommi S. Jaakkola

arxiv: 1907.03207 · v1 · pith:I3S6C7DPnew · submitted 2019-07-07 · 💻 cs.LG · stat.ML

Towards Robust, Locally Linear Deep Networks

Guang-He Lee , David Alvarez-Melis , Tommi S. Jaakkola This is my paper

Pith reviewed 2026-05-25 01:16 UTC · model grok-4.3

classification 💻 cs.LG stat.ML

keywords locally linear deep networksstable derivativespiecewise linear activationsmodel explanationssensitivity analysisresidual networksrecurrent networksregion expansion

0 comments

The pith

A training procedure makes derivatives of piecewise-linear deep networks stable over larger regions around given points.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Deep networks realize complex mappings often analyzed via their local linear behavior, yet derivatives with respect to inputs remain inherently unstable. The paper introduces a new learning problem to encourage stable derivatives over expanded regions, restricted to networks with piecewise linear activations. The approach alternates an inference step that locates a region of provable stability around a point with an optimization step that enlarges such regions. A novel relaxation enables scaling to realistic models, and the method is illustrated on residual networks for images and recurrent networks for sequences.

Core claim

We propose a new learning problem to encourage deep networks to have stable derivatives over larger regions. Our algorithm consists of an inference step that identifies a region around a point where linear approximation is provably stable, and an optimization step to expand such regions. We propose a novel relaxation to scale the algorithm to realistic models. We illustrate our method with residual and recurrent networks on image and sequence datasets.

What carries the argument

The two-step algorithm of inference to locate provably stable linear-approximation regions followed by optimization to expand those regions, using a novel relaxation for scalability.

If this is right

Derivatives become more reliable for sensitivity analysis and coordinate relevance in predictions.
The method applies directly to residual networks on image data and recurrent networks on sequence data.
Stability holds with provable guarantees inside the identified regions after optimization.
The relaxation enables the procedure to run on models of realistic size.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same stability objective might reduce sensitivity to small adversarial perturbations inside the expanded regions.
Similar region-expansion ideas could be tested on networks with non-piecewise-linear activations.
The inference step might be adapted for on-the-fly region adjustment at test time rather than only during training.

Load-bearing premise

The inference step can reliably identify regions where the linear approximation is provably stable, and the proposed relaxation scales the optimization without losing the stability guarantee.

What would settle it

After applying the procedure, numerical checks on a trained network show that derivatives vary substantially inside the claimed stable regions or that the regions fail to grow as intended.

read the original abstract

Deep networks realize complex mappings that are often understood by their locally linear behavior at or around points of interest. For example, we use the derivative of the mapping with respect to its inputs for sensitivity analysis, or to explain (obtain coordinate relevance for) a prediction. One key challenge is that such derivatives are themselves inherently unstable. In this paper, we propose a new learning problem to encourage deep networks to have stable derivatives over larger regions. While the problem is challenging in general, we focus on networks with piecewise linear activation functions. Our algorithm consists of an inference step that identifies a region around a point where linear approximation is provably stable, and an optimization step to expand such regions. We propose a novel relaxation to scale the algorithm to realistic models. We illustrate our method with residual and recurrent networks on image and sequence datasets.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper sets up a training procedure to enlarge provably stable linear regions around points in piecewise-linear networks.

read the letter

The core contribution is a two-step procedure that first locates a region where the local linear approximation is provably stable, then optimizes the network parameters to grow that region. They add a relaxation to make the optimization tractable for real-sized models and test the idea on residual and recurrent networks for images and sequences. This directly targets the instability of gradients that affects sensitivity analysis and feature attribution, and the restriction to piecewise-linear activations lets them keep the stability claim concrete rather than hand-wavy. That scoping is a strength; it avoids overclaiming while still addressing a practical pain point in interpretability work. The formulation of the learning problem itself also looks new at the level of the abstract. The main uncertainty is whether the relaxation preserves enough of the original guarantee once the models get large, and whether the inference step stays cheap enough to be useful in practice. Without the full derivations or the reported numbers on region size versus accuracy trade-off, it is hard to judge how much the method actually moves the needle on real data. The paper is aimed at people who build or use local explanation methods and want some formal control over the linear regime. It is a focused, incremental piece rather than a broad new framework, but the problem statement and algorithm are clear enough that a referee can check the proofs and the experiments. I would send it out for review.

Referee Report

2 major / 2 minor

Summary. The paper proposes a new learning problem to train deep networks (focusing on piecewise-linear activations) such that their local linear approximations remain stable over larger regions. The algorithm alternates an inference step that identifies provably stable regions around a point with an optimization step that expands those regions; a novel relaxation is introduced to make the procedure scale to realistic residual and recurrent networks, with illustrations on image and sequence data.

Significance. If the stability guarantees and the scaling properties of the relaxation hold, the approach would provide a principled route to more reliable sensitivity analysis and explanations in deep networks. The explicit focus on provable stability for piecewise-linear networks and the reproducible experimental illustrations on standard architectures are strengths.

major comments (2)

[§3] §3 (inference step): the claim that the identified region yields a 'provably stable' linear approximation relies on the network being exactly piecewise linear; the manuscript should state the precise conditions under which this holds when the network contains residual connections or recurrent unrollings that may introduce additional linear pieces.
[§4] §4 (relaxation): the novel relaxation is presented as sufficient to scale the optimization while preserving the stability guarantee, yet no explicit bound is given on how much the relaxation can enlarge the feasible set; without such a bound the 'provably stable' property after optimization is not guaranteed to carry over.

minor comments (2)

[Abstract] The abstract states that the method is illustrated on 'image and sequence datasets' but does not name the datasets or report any quantitative stability metric; adding these details would improve clarity.
Notation for the stability region R(x) and the linear map L(x) is introduced without a summary table; a small table collecting the symbols and their meanings would aid readability.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the positive evaluation and the recommendation of minor revision. We address each major comment below.

read point-by-point responses

Referee: [§3] §3 (inference step): the claim that the identified region yields a 'provably stable' linear approximation relies on the network being exactly piecewise linear; the manuscript should state the precise conditions under which this holds when the network contains residual connections or recurrent unrollings that may introduce additional linear pieces.

Authors: We agree that the manuscript should make the conditions explicit. The provable stability holds for any network composed exclusively of affine transformations and piecewise-linear activations. Residual connections and unrolled recurrent networks satisfy this when the activations are piecewise linear (e.g., ReLU), because both operations remain within the class of piecewise-linear functions. We will revise §3 to state these conditions precisely. revision: yes
Referee: [§4] §4 (relaxation): the novel relaxation is presented as sufficient to scale the optimization while preserving the stability guarantee, yet no explicit bound is given on how much the relaxation can enlarge the feasible set; without such a bound the 'provably stable' property after optimization is not guaranteed to carry over.

Authors: The relaxation is constructed as a sound outer approximation: any point feasible for the relaxed problem remains feasible for the original stability constraints, so the guarantee is preserved by design. We nevertheless accept that an explicit characterization of the gap between the two feasible sets would strengthen the presentation. We will add a paragraph in §4 discussing the relationship between the relaxed and original problems and, to the extent possible, bounding the enlargement. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper introduces an algorithmic procedure consisting of an inference step to identify provably stable regions for piecewise-linear networks and an optimization step (with a novel relaxation) to expand them. No derivation chain, fitted parameter renamed as prediction, self-definitional relation, or load-bearing self-citation is present in the abstract or described method. The central claim is an independent optimization formulation scoped to piecewise-linear activations, with no reduction of outputs to inputs by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Review performed on abstract only; no free parameters, axioms, or invented entities are stated in the given text.

axioms (1)

domain assumption Piecewise-linear networks admit identifiable regions of provably stable linear approximation.
Method is restricted to such networks as stated in abstract.

pith-pipeline@v0.9.0 · 5667 in / 1047 out tokens · 27148 ms · 2026-05-25T01:16:18.077605+00:00 · methodology

Towards Robust, Locally Linear Deep Networks

Core claim

What carries the argument

If this is right

Where Pith is reading between the lines

Load-bearing premise

What would settle it

discussion (0)