Bounds on Deep Neural Network Partial Derivatives with Respect to Parameters

Brandon C. Fallin; Cristian F. Nino; Omkar Sudhir Patil; Rebecca G. Hart; Warren E. Dixon

arxiv: 2503.21007 · v2 · pith:IUQN7TXTnew · submitted 2025-03-26 · 📡 eess.SY · cs.SY

Bounds on Deep Neural Network Partial Derivatives with Respect to Parameters

Omkar Sudhir Patil , Brandon C. Fallin , Cristian F. Nino , Rebecca G. Hart , Warren E. Dixon This is my paper

Pith reviewed 2026-05-22 21:41 UTC · model grok-4.3

classification 📡 eess.SY cs.SY

keywords deep neural networkspartial derivativesLyapunov stabilityboundscontrol systemsactivation functionsTaylor approximation

0 comments

The pith

Polynomial bounds on first and second partial derivatives of fully-connected DNNs with respect to parameters are derived in closed form.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes explicit polynomial bounds on the first and second partial derivatives of the outputs of fully-connected deep neural networks with respect to the network parameters. These bounds are obtained through lemmas that apply to common activation functions including sigmoidal and ReLU-like types. The closed-form expressions are developed to support Lyapunov-based stability analysis for neural network controllers and identifiers. The work also supplies bounds on the remainder terms of first-order Taylor expansions of the network mapping.

Core claim

For fully-connected DNNs the first and second partial derivatives of the network output with respect to its parameters admit polynomial upper bounds that can be written explicitly in terms of the weights, the activation function properties, and the network depth. The same layer-wise bounding technique produces explicit controls on the higher-order terms that appear when the network is replaced by its first-order Taylor approximation around a given parameter value.

What carries the argument

Closed-form polynomial bounds obtained by inductive layer-wise analysis of the partial derivatives of the DNN output with respect to parameters.

If this is right

Lyapunov-based stability certificates for DNN controllers can be written with explicit, computable expressions rather than assumed bounds.
Gradient-based training algorithms obtain rigorous remainder controls for first-order Taylor approximations of the network.
Safety-critical control applications gain concrete derivative limits that replace informal bounding arguments.
The same layer-wise technique yields bounds on both first- and second-order parameter sensitivities.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same inductive bounding argument may be adaptable to other feed-forward architectures if their layer recursions can be written in comparable form.
The explicit polynomial degree and coefficients could be used to derive quantitative robustness margins for parameter perturbations in learned controllers.
Numerical checks of bound tightness on trained networks would indicate how much conservatism is introduced by the polynomial expressions.

Load-bearing premise

The bounding lemmas apply only to fully-connected networks whose activation functions belong to the sigmoidal or ReLU-like classes.

What would settle it

A direct numerical evaluation on a small fully-connected network with a sigmoidal activation that finds any second partial derivative larger than the stated polynomial expression at some finite parameter vector would falsify the bound.

read the original abstract

Deep neural networks (DNNs) have emerged as a powerful tool with a growing body of literature exploring Lyapunov-based approaches for real-time system identification and control. These methods depend on establishing bounds for the second partial derivatives of DNNs with respect to their parameters, a requirement often assumed but rarely addressed explicitly. This paper provides rigorous mathematical formulations of polynomial bounds on both the first and second partial derivatives of DNNs with respect to their parameters. We present lemmas that characterize these bounds for fully-connected DNNs, while accommodating various classes of activation function including sigmoidal and ReLU-like functions. Our analysis yields closed-form expressions that enable precise stability guarantees for Lyapunov-based deep neural networks (Lb-DNNs). Furthermore, we extend our results to bound the higher-order terms in first-order Taylor approximations of DNNs, providing important tools for convergence analysis in gradient-based learning algorithms. The developed theoretical framework develops explicit, computable expressions, for previously assumed bounds, thereby strengthening the mathematical foundation of neural network applications in safety-critical control systems.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Only the abstract is available, so the claimed closed-form bounds on DNN partial derivatives cannot be checked or evaluated.

read the letter

The key point is that this paper claims to supply explicit polynomial bounds on the first and second partial derivatives of fully-connected DNNs with respect to parameters, plus higher-order terms in Taylor expansions, for use in Lyapunov stability proofs. It targets sigmoidal and ReLU-like activations. But the full text is missing, so none of the lemmas or expressions can be inspected for correctness, tightness, or scaling behavior with depth and width. Without those details the central contribution stays unverified. The work does address a real gap. Lyapunov-based neural control papers have long needed these bounds but often treated them as assumptions rather than derived results, so explicit closed forms could in principle tighten guarantees and help convergence analysis in gradient methods. That scope makes sense for the intended audience in safety-critical control. The obvious limitation is the lack of any derivation, example bound, or discussion of practical size. The abstract states the results hold for the listed activations and fully-connected layers, yet gives no indication of how loose the bounds become for realistic networks or whether they remain computable. This leaves open whether the expressions are actually usable or if hidden restrictions make them narrow. The paper is aimed at researchers who build stability arguments around neural controllers and need concrete bounds rather than hand-wavy ones. A reader in that niche might gain something if the math checks out, but right now there is no way to know. I would not bring this to a reading group or cite it. It does not look ready for peer review because the main claims cannot be assessed from the available material.

Referee Report

1 major / 0 minor

Summary. The paper claims to provide rigorous mathematical formulations of polynomial bounds on both the first and second partial derivatives of DNNs with respect to their parameters. It presents lemmas characterizing these bounds for fully-connected DNNs accommodating sigmoidal and ReLU-like activation functions, yielding closed-form expressions that enable precise stability guarantees for Lyapunov-based deep neural networks (Lb-DNNs). The work further extends the results to bound higher-order terms in first-order Taylor approximations of DNNs for convergence analysis in gradient-based learning algorithms.

Significance. If the claimed lemmas and closed-form expressions were verified to hold with the stated generality, the results would strengthen the foundations of Lyapunov-based DNN control by replacing implicit assumptions with explicit, computable bounds, which is relevant for safety-critical applications. However, with only the abstract available and no derivations, lemmas, or expressions provided, the significance cannot be assessed.

major comments (1)

Abstract: The manuscript asserts the existence of 'rigorous mathematical formulations,' 'lemmas,' and 'closed-form expressions' for the bounds, yet consists solely of the abstract with no derivations, no explicit polynomial expressions, no statements of the lemmas, and no details on network depth/width restrictions or activation function classes. This renders the central claims unverifiable and load-bearing for any evaluation of the work.

Simulated Author's Rebuttal

1 responses · 1 unresolved

We thank the referee for their assessment. We acknowledge that the provided manuscript consists solely of the abstract and does not contain the derivations, lemmas, or closed-form expressions referenced in the claims.

read point-by-point responses

Referee: Abstract: The manuscript asserts the existence of 'rigorous mathematical formulations,' 'lemmas,' and 'closed-form expressions' for the bounds, yet consists solely of the abstract with no derivations, no explicit polynomial expressions, no statements of the lemmas, and no details on network depth/width restrictions or activation function classes. This renders the central claims unverifiable and load-bearing for any evaluation of the work.

Authors: We agree that the referee's observation is accurate. The text supplied for review contains only the abstract, which summarizes the intended contributions but provides none of the supporting mathematical content. As a result, the lemmas characterizing the polynomial bounds on first- and second-order partial derivatives (for fully-connected networks with sigmoidal and ReLU-like activations) and the associated closed-form expressions cannot be verified from the available material. We will revise the submission to include the complete manuscript with all derivations, lemma statements, network architecture restrictions, and explicit expressions. revision: yes

standing simulated objections not resolved

The explicit statements of the lemmas, the polynomial bound expressions, and any proofs or derivations, none of which appear in the provided manuscript text.

Circularity Check

0 steps flagged

No circularity detectable; full derivations unavailable

full rationale

Only the abstract is supplied, which asserts the existence of lemmas and closed-form polynomial bounds on first- and second-order partial derivatives of fully-connected DNNs (sigmoidal/ReLU activations) without exhibiting any equations, derivation steps, or citations. No load-bearing claims, self-definitions, fitted inputs, or self-citation chains are present to inspect. The claimed results are therefore treated as self-contained mathematical contributions with no evidence of circular reduction.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Insufficient information from abstract only to identify specific free parameters, axioms or invented entities; no details on derivations provided.

pith-pipeline@v0.9.0 · 5695 in / 1064 out tokens · 51355 ms · 2026-05-22T21:41:33.070240+00:00 · methodology

Bounds on Deep Neural Network Partial Derivatives with Respect to Parameters

Core claim

What carries the argument

If this is right

Where Pith is reading between the lines

Load-bearing premise

What would settle it

discussion (0)