pith. sign in

arxiv: 2606.26446 · v1 · pith:DFI3S7KWnew · submitted 2026-06-24 · 🧮 math.OC · cs.SY· eess.SY

Input Convex Neural Network as a Surrogate in Stability-Constrained Optimization for IBR-dominated Power Systems

Pith reviewed 2026-06-26 00:57 UTC · model grok-4.3

classification 🧮 math.OC cs.SYeess.SY
keywords input convex neural networksstability-constrained optimizationpower systemsepigraph reformulationunit commitmentmixed-integer programming
0
0 comments X

The pith

ICNN stability surrogates in power-system optimization lose their convexity benefits under generic Big-M encodings or reversed inequalities.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper identifies two formulation pitfalls that can erase the convexity advantages of input convex neural networks when they surrogate stability indices inside power-system optimization. Generic Big-M mixed-integer reformulations introduce auxiliary binary variables that are unnecessary for enforcing ICNN sublevel constraints. Reversing the stability inequality converts a convex sublevel set into a generally nonconvex superlevel set, which invalidates the global-convergence guarantees of cut-based methods. The authors supply an exact LP epigraph reformulation for ReLU-ICNNs, an outer-approximation scheme that retains global guarantees under the sublevel convention, and a feasibility-preserving inner-approximation scheme for the superlevel convention, tested on IEEE 14- and 118-bus unit commitment instances.

Core claim

Input convex neural networks can serve as surrogates for stability indices in optimization only when the constraint is kept in sublevel-set form; generic Big-M encodings add superfluous binaries while sign reversal produces nonconvex superlevel sets that break cut-based global convergence. Exact LP-based epigraph reformulations and tailored outer- and inner-approximation schemes restore the lost properties.

What carries the argument

The epigraph reformulation of ReLU-ICNN sublevel constraints into linear inequalities, together with outer-approximation cuts (sublevel) and inner-approximation cuts (superlevel).

If this is right

  • Unit commitment problems with stability constraints become solvable by linear programming when the exact epigraph reformulation is used.
  • Cut-based methods recover global convergence guarantees only under the sublevel convention.
  • The inner-approximation scheme maintains feasibility even when engineering requirements force the superlevel convention.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same LP epigraph approach could be tested on other convex neural surrogates beyond ReLU-ICNNs.
  • Numerical experiments on systems larger than 118 buses would show whether the approximation gaps stay small under realistic operating conditions.
  • Alternative convex surrogate architectures might sidestep the superlevel nonconvexity issue without needing inner approximations.

Load-bearing premise

The stability index itself must be a convex function of the decision variables under the chosen input encoding.

What would settle it

A concrete instance in which a cut-based solver applied to a superlevel ICNN constraint either fails to converge to the known global optimum or declares infeasibility while a feasible point exists.

Figures

Figures reproduced from arXiv: 2606.26446 by Fei Teng, Hongyang Jia, Ning Zhang, Wangkun Xu, Yi Wang.

Figure 1
Figure 1. Figure 1: Illustration of ICNN constraint sets {z : ψ(z; θ) ≤ ϕc} and {z : ψ(z; θ) ≥ ϕc} as well as an example of cutting plane. For multiple ICNN constraints, one may add all violated cuts at each iteration, or only the cut associated with the most violated constraint. In addition, compared with the direct reformulation in Proposition 3, which introduces auxiliary variables and constraints proportional to the ICNN … view at source ↗
Figure 2
Figure 2. Figure 2: Illustration on the OA cuts for single time step case. [PITH_FULL_IMAGE:figures/full_fig_p008_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Illustration on the IA cuts for single time step case. [PITH_FULL_IMAGE:figures/full_fig_p009_3.png] view at source ↗
read the original abstract

Input convex neural networks (ICNNs) are increasingly used as surrogates for stability indices and embedded as constraints in power-system optimization. This letter clarifies two recurring formulation limitations that can negate ICNN convexity benefits: (i) applying generic Big-$M$ mixed-integer reformulations introduces auxiliary binaries that are unnecessary for enforcing ICNN sublevel constraints; and (ii) reversing the stability inequality transforms a convex sublevel set into a generally nonconvex superlevel set, invalidating global-convergence guarantees of cut-based methods. After clarifying the limitations, we provide (i) an exact LP-based epigraph reformulation for ReLU-ICNNs, (ii) an outer-approximation scheme with global guarantees under the sublevel convention, and (iii) a feasibility-preserving inner-approximation scheme for the superlevel convention, with simulations on IEEE 14- and 118-bus unit commitment instances.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The paper identifies two common pitfalls when embedding ICNN surrogates for stability indices in power-system optimization: generic Big-M mixed-integer reformulations that introduce unnecessary auxiliary binaries for sublevel constraints, and reversal of the stability inequality that converts a convex sublevel set into a generally nonconvex superlevel set. It supplies an exact LP epigraph reformulation for ReLU-ICNNs, an outer-approximation algorithm with global guarantees under the sublevel convention, a feasibility-preserving inner-approximation for the superlevel case, and numerical results on IEEE 14- and 118-bus unit commitment instances.

Significance. If the reformulations are exact and the global guarantees hold, the work is significant because it directly addresses formulation choices that can erase the convexity benefits of ICNN surrogates in stability-constrained optimization for IBR-dominated grids. Credit is due for the explicit LP epigraph reformulation, the approximation schemes with stated convergence properties, and the simulations on standard test systems.

major comments (2)
  1. [Abstract] Abstract and reformulation sections: the claim of an 'exact LP-based epigraph reformulation for ReLU-ICNNs' is central to the contribution, yet the manuscript supplies no derivation, proof sketch, or verification that the epigraph remains exact for arbitrary ReLU-ICNN architectures without hidden Big-M terms.
  2. [Main text] Outer-approximation scheme (global-guarantees paragraph): the global-convergence claim under the sublevel convention requires the underlying stability index (e.g., small-signal margin or critical eigenvalue) to be convex in the decision variables under the chosen input encoding; no argument, reference, or empirical check is provided for this property, which is load-bearing for both the convexity preservation and the outer-approximation guarantees.
minor comments (1)
  1. [Numerical results] Simulations section: the description of ICNN training data generation, architecture depth/width, and the precise mapping from decision variables to ICNN inputs is too brief for full reproducibility of the IEEE 14- and 118-bus results.

Simulated Author's Rebuttal

2 responses · 0 unresolved

Thank you for the constructive review. We address the two major comments point-by-point below, indicating planned revisions where appropriate.

read point-by-point responses
  1. Referee: [Abstract] Abstract and reformulation sections: the claim of an 'exact LP-based epigraph reformulation for ReLU-ICNNs' is central to the contribution, yet the manuscript supplies no derivation, proof sketch, or verification that the epigraph remains exact for arbitrary ReLU-ICNN architectures without hidden Big-M terms.

    Authors: We agree that an explicit derivation would strengthen the central claim. The LP epigraph is constructed recursively: each ReLU layer output z_l = max(0, W_l z_{l-1} + b_l) is replaced by the pair of linear inequalities z_l >= 0, z_l >= W_l z_{l-1} + b_l, which together define the epigraph of the layer without auxiliary binaries. Because each step preserves convexity and the overall ICNN is a composition of such layers, the final epigraph remains an exact LP representation for any finite depth and width. In the revised manuscript we will insert a short derivation subsection (with a two-layer verification example) immediately after the reformulation statement. revision: yes

  2. Referee: [Main text] Outer-approximation scheme (global-guarantees paragraph): the global-convergence claim under the sublevel convention requires the underlying stability index (e.g., small-signal margin or critical eigenvalue) to be convex in the decision variables under the chosen input encoding; no argument, reference, or empirical check is provided for this property, which is load-bearing for both the convexity preservation and the outer-approximation guarantees.

    Authors: The global-convergence guarantees of the outer-approximation scheme are stated for the optimization problem that employs the ICNN surrogate under the sublevel convention; they follow from standard cutting-plane theory applied to a convex feasible set defined by the surrogate and do not require the true stability index to be convex. The ICNN itself supplies the convexity by construction, so the algorithm converges to the global optimum of the surrogate-constrained problem regardless of the convexity properties of the underlying physical index. We will revise the global-guarantees paragraph to make this distinction explicit and to note that any mismatch between the true index and its convex surrogate is a modeling choice outside the scope of the algorithmic guarantees. revision: partial

Circularity Check

0 steps flagged

No significant circularity; derivations rely on standard ICNN and convex-optimization properties

full rationale

The paper identifies two formulation issues with ICNN surrogates in stability-constrained optimization and supplies LP epigraph reformulations plus outer/inner approximation schemes. These rest on known ReLU convexity and mixed-integer epigraph properties rather than any self-referential fit, self-citation chain, or renaming. No equation or claim reduces by construction to a fitted parameter or prior author result; the sublevel-convention guarantees follow directly from the cited convex-analysis facts. The convexity assumption on the underlying stability index is an external modeling choice, not a circular derivation step.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The work relies on standard properties of input-convex ReLU networks and epigraph reformulations from convex optimization; no new free parameters, axioms, or invented entities are introduced beyond those already present in the cited ICNN and power-system literature.

axioms (2)
  • standard math ReLU-ICNNs define convex functions when the weight matrices satisfy the input-convexity conditions stated in the referenced ICNN literature.
    Invoked when claiming that the sublevel set remains convex.
  • domain assumption The stability index can be represented as the sublevel set of the ICNN surrogate.
    Central modeling choice that enables the convexity benefit.

pith-pipeline@v0.9.1-grok · 5702 in / 1593 out tokens · 26829 ms · 2026-06-26T00:57:38.514859+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

16 extracted references

  1. [1]

    Data-driven security and stability rule in high renewable penetrated power system operation,

    N. Zhang, H. Jia, Q. Hou, Z. Zhang, T. Xia, X. Cai, and J. Wang, “Data-driven security and stability rule in high renewable penetrated power system operation,”Proceedings of the IEEE, vol. 111, no. 7, pp. 788–805, 2022

  2. [2]

    Learning-augmented power system operations: A unified optimization view,

    W. Xu, Z. Chu, and F. Teng, “Learning-augmented power system operations: A unified optimization view,” 2026. [Online]. Available: https://arxiv.org/abs/2505.05203

  3. [3]

    Input convex neural networks,

    B. Amos, L. Xu, and J. Z. Kolter, “Input convex neural networks,” in International conference on machine learning. PMLR, 2017, pp. 146– 155

  4. [4]

    Regional frequency-constrained planning for the optimal sizing of power systems via enhanced input convex neural networks,

    Y . Wang and G. Strbac, “Regional frequency-constrained planning for the optimal sizing of power systems via enhanced input convex neural networks,”IEEE Transactions on Sustainable Energy, 2025

  5. [5]

    Transient stability-constrained unit commitment using input convex neural network,

    T. Wu and J. Wang, “Transient stability-constrained unit commitment using input convex neural network,”IEEE Transactions on Neural Networks and Learning Systems, 2023

  6. [6]

    Small signal stability analysis of multi-infeed power electronic systems based on grid strength assessment,

    W. Dong, H. Xin, D. Wu, and L. Huang, “Small signal stability analysis of multi-infeed power electronic systems based on grid strength assessment,”IEEE Transactions on Power Systems, vol. 34, no. 2, pp. 1393–1403, 2019

  7. [7]

    Generalized operational short-circuit ratio for grid strength assessment in power systems with high renewable penetration,

    C. Liu, H. Xin, D. Wu, H. Gao, H. Yuan, and Y . Zhou, “Generalized operational short-circuit ratio for grid strength assessment in power systems with high renewable penetration,”IEEE Transactions on Power Systems, 2023

  8. [8]

    On the incorporation of stability constraints into sequential operational scheduling,

    W. Xu, Z. Chu, F. Capitanescu, and F. Teng, “On the incorporation of stability constraints into sequential operational scheduling,” in2025 IEEE Power & Energy Society General Meeting (PESGM), 2025, pp. 1–5

  9. [9]

    Boyd and L

    S. Boyd and L. Vandenberghe,Convex optimization. Cambridge university press, 2004

  10. [10]

    E. N. Pistikopoulos, M. C. Georgiadis, and V . Dua,Multi-parametric programming: theory, algorithms and applications. Wiley, 2007

  11. [11]

    A. J. Conejo and L. Baringo,Power system operations. Springer, 2018, vol. 11

  12. [12]

    Availability adversarial attack and countermeasures for deep learning-based load forecasting,

    W. Xu and F. Teng, “Availability adversarial attack and countermeasures for deep learning-based load forecasting,” in2023 IEEE Belgrade PowerTech, 2023, pp. 01–06

  13. [13]

    On the effectiveness of interval bound propagation for training verifiably robust models,

    S. Gowal, K. Dvijotham, R. Stanforth, R. Bunel, C. Qin, J. Uesato, R. Arandjelovic, T. Mann, and P. Kohli, “On the effectiveness of interval bound propagation for training verifiably robust models,”arXiv preprint arXiv:1810.12715, 2018

  14. [14]

    A. J. Conejo, E. Castillo, R. Minguez, and R. Garcia-Bertrand,De- composition techniques in mathematical programming: engineering and science applications. Springer, 2006

  15. [15]

    Pytorch: An imperative style, high-performance deep learning library,

    A. Paszke, S. Gross, F. Massa, A. Lerer, J. Bradbury, G. Chanan, T. Killeen, Z. Lin, N. Gimelshein, L. Antigaet al., “Pytorch: An imperative style, high-performance deep learning library,”Advances in neural information processing systems, vol. 32, 2019

  16. [16]

    A synthetic texas power system with time-series weather-dependent spatiotemporal profiles,

    J. Lu, X. Li, H. Li, T. Chegini, C. Gamarra, Y . E. Yang, M. Cook, and G. Dillingham, “A synthetic texas power system with time-series weather-dependent spatiotemporal profiles,”Sustainable Energy, Grids and Networks, p. 101774, 2025