pith. sign in

arxiv: 2507.02636 · v2 · submitted 2025-07-03 · 🧮 math.OC · cs.SY· eess.SY

Online Convex Optimization for Coordinated Long-Term and Short-Term Isolated Microgrid Dispatch

Pith reviewed 2026-05-19 06:28 UTC · model grok-4.3

classification 🧮 math.OC cs.SYeess.SY
keywords microgrid dispatchonline convex optimizationlong-duration energy storagestate-of-charge trackingregret boundsconvex hull approximationisolated microgridhybrid energy storage
0
0 comments X

The pith

A non-anticipatory framework coordinates long-term and short-term dispatch in isolated microgrids by approximating long-duration storage dynamics with a convex hull, using kernel regression for state-of-charge targets, and applying adaptive

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes a dispatch method for isolated microgrids that pairs long-duration and short-duration energy storage without requiring forecasts of future conditions. It replaces the nonconvex electrochemical model of long-duration storage with a convex hull approximation to keep the optimization tractable, then generates hindsight-optimal state-of-charge trajectories offline and uses kernel regression to produce dynamic reference trajectories online. An adaptive online convex optimization routine with explicit state-of-charge tracking and expert tracking is shown to produce policies whose cumulative cost exceeds the best fixed policy in hindsight by an amount that grows sublinearly in time. Simulations indicate that the resulting schedule lowers total cost by 73.4 percent relative to prior methods and removes load loss entirely. A reader would care because the approach turns an otherwise intractable long-horizon planning problem into a sequence of online convex problems whose performance guarantees improve with longer storage duration and more training data.

Core claim

By replacing the nonconvex LDES dynamics with a convex hull approximation, training kernel regression on hindsight-optimal SoC and net-load trajectories, and running an adaptive OCO algorithm that penalizes deviation from the dynamic SoC reference while tracking expert advice, both the long-term contract and short-term power decisions achieve sublinear regret; the combined policy reduces operating cost by 73.4 percent and eliminates load shedding compared with state-of-the-art baselines, with further gains as LDES duration increases.

What carries the argument

Adaptive online convex optimization algorithm that augments the standard regret-minimizing update with an SoC-reference tracking term and an expert-tracking term, allowing the step-size to adapt while enforcing consistency with the long-term SoC target generated by kernel regression.

If this is right

  • Long-term and short-term decisions can be computed sequentially without future information while still guaranteeing sublinear regret for both horizons.
  • Cost savings and reliability improve as the physical duration of the long-duration storage increases.
  • The framework tolerates forecast errors and sudden component failures without inducing load loss.
  • Stronger penalties on SoC deviation and more regression training scenarios tighten the regret bounds.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same convex-hull-plus-tracking structure could be tested on other nonconvex storage technologies whose dynamics admit a tight convex outer approximation.
  • Regret bounds that improve with finer convex approximations suggest a practical trade-off between model fidelity and computational speed that could be quantified on larger networks.
  • Because the method is non-anticipatory, it may integrate directly with real-time market signals or fault-detection systems without requiring separate forecast modules.

Load-bearing premise

The convex hull approximation of the nonconvex LDES electrochemical dynamics is accurate enough that decisions computed on the approximate model remain near-optimal when executed on the true dynamics.

What would settle it

Apply the online decisions produced by the approximated model to a high-fidelity nonconvex simulation of the LDES and check whether the realized cost and load-loss statistics remain within a few percent of the values reported under the convex model.

Figures

Figures reproduced from arXiv: 2507.02636 by Bolun Xu, Ning Qi, Yousuf Baker.

Figure 1
Figure 1. Figure 1: Illustration of Convex Hull Approximation for the LDES Model. [PITH_FULL_IMAGE:figures/full_fig_p003_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Non-anticipatory long-short-term coordinated dispatch framework. [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Diagram of the test microgrid system. respectively. The initial SoC and final SoC target of LDES are set to 0.2 and 0.5, respectively. The nonconvex model of LDES is adopted from the semi-empirical model [5]. The ground-truth data for renewable generation and load power from 1984 to 2024 are publicly available [31]. The optimization is performed hourly over an entire year and implemented in MATLAB with Gur… view at source ↗
Figure 4
Figure 4. Figure 4: Training process and testing performance of kernel regression. [PITH_FULL_IMAGE:figures/full_fig_p006_4.png] view at source ↗
Figure 6
Figure 6. Figure 6: Performance of the proposed method: (a) varying with penalty [PITH_FULL_IMAGE:figures/full_fig_p007_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Comparison of operational performance across different methods: [PITH_FULL_IMAGE:figures/full_fig_p008_7.png] view at source ↗
read the original abstract

This paper proposes a novel non-anticipatory long-short-term coordinated dispatch framework for isolated microgrid with hybrid short-long-duration energy storages (LDES). We introduce a convex hull approximation model for nonconvex LDES electrochemical dynamics, facilitating computational tractability and accuracy. To address temporal coupling in SoC dynamics and long-term contracts, we generate hindsight-optimal state-of-charge (SoC) trajectories of LDES and netloads for offline training. In the online stage, we employ kernel regression to dynamically update the SoC reference and propose an adaptive online convex optimization (OCO) algorithm with SoC reference tracking and expert tracking to mitigate myopia and enable adaptive step-size optimization. We rigorously prove that both long-term and short-term policies achieve sublinear regret bounds over time, which improves with more regression scenarios, stronger tracking penalties, and finer convex approximations. Simulation results show that the proposed method outperforms state-of-the-art methods, reducing costs by 73.4%, eliminating load loss via reference tracking, and achieving an additional 2.4% cost saving via the OCO algorithm. These benefits scale up with longer LDES durations, and the method demonstrates resilience to poor forecasts and unexpected system faults.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 2 minor

Summary. The paper proposes a non-anticipatory long-short-term coordinated dispatch framework for isolated microgrids with hybrid short- and long-duration energy storage. It introduces a convex hull approximation for nonconvex LDES electrochemical dynamics to enable tractability, generates hindsight-optimal SoC trajectories offline for kernel regression-based dynamic references, and develops an adaptive OCO algorithm incorporating SoC reference tracking and expert tracking. The authors prove sublinear regret bounds for the long-term and short-term policies (improving with more regression scenarios, stronger tracking penalties, and finer approximations) and report simulation results showing 73.4% cost reduction versus state-of-the-art methods, elimination of load loss, and an additional 2.4% saving from the OCO component.

Significance. If the convex approximation accuracy is sufficient for the regret guarantees to translate to the true nonconvex dynamics, the framework provides a practical, theoretically grounded online method for microgrid dispatch that addresses temporal coupling and myopia in long-duration storage planning. The rigorous sublinear regret analysis, the scaling of benefits with LDES duration, and the resilience claims to forecast errors are strengths that could advance reliable operation of isolated systems with high renewable penetration.

major comments (1)
  1. The sublinear regret bounds are derived for the convexified problem (see the long-term and short-term policy analyses). No quantitative bound is supplied on the approximation error (e.g., Hausdorff distance between the convex hull and true nonconvex LDES feasible set, or the resulting optimality gap when the convex decisions are applied to the actual electrochemical dynamics). This is load-bearing for the central claims of 73.4% cost reduction and zero load loss, as the simulations demonstrate performance on the approximated model but do not include a direct comparison against a nonconvex reference solver on identical instances.
minor comments (2)
  1. Clarify in the simulation section the precise data exclusion rules, forecast error models, and parameter tuning procedure for the tracking penalties and convex approximation granularity to support reproducibility of the 73.4% figure.
  2. Add a short discussion of how the kernel regression scenario count and approximation fineness were selected in the numerical experiments.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the detailed and constructive review. The major comment correctly identifies that our regret analysis applies to the convexified problem and that a quantitative characterization of the convex hull approximation error is not provided in the current manuscript. We address this point directly below and commit to revisions that strengthen the connection between the theoretical guarantees and the nonconvex dynamics.

read point-by-point responses
  1. Referee: The sublinear regret bounds are derived for the convexified problem (see the long-term and short-term policy analyses). No quantitative bound is supplied on the approximation error (e.g., Hausdorff distance between the convex hull and true nonconvex LDES feasible set, or the resulting optimality gap when the convex decisions are applied to the actual electrochemical dynamics). This is load-bearing for the central claims of 73.4% cost reduction and zero load loss, as the simulations demonstrate performance on the approximated model but do not include a direct comparison against a nonconvex reference solver on identical instances.

    Authors: We agree that the sublinear regret bounds hold for the convexified formulation, which is necessary for tractable online optimization and for applying standard OCO analysis tools. The convex hull is constructed as a tight outer approximation of the nonconvex LDES electrochemical feasible set, ensuring that every convex-feasible decision remains feasible (though possibly conservative) when applied to the true dynamics. We acknowledge that the manuscript does not supply an explicit quantitative bound such as the Hausdorff distance or a derived optimality gap. In the revision we will add a dedicated subsection that (i) computes the Hausdorff distance numerically for the LDES parameters used in the case studies, (ii) provides an a-posteriori bound on the optimality gap incurred by projecting convex decisions onto the nonconvex set, and (iii) reports offline comparisons against a nonconvex solver on representative small-scale instances where exact solution is feasible. These additions will clarify the conditions under which the reported cost reductions and zero load-loss results translate to the original nonconvex system. The 73.4 % figure and load-loss elimination are obtained by running the full proposed pipeline (including the convex model) against benchmark methods that themselves rely on approximations or heuristics; we will make this comparison basis explicit. revision: yes

Circularity Check

0 steps flagged

No significant circularity in derivation chain

full rationale

The paper derives sublinear regret bounds mathematically for the adaptive OCO algorithm applied to the convexified long-term and short-term problems. The dependence of the bound on hyperparameters (number of regression scenarios, tracking penalties, approximation fineness) is a standard analytic feature and does not constitute a reduction to fitted values or self-definition. The convex-hull approximation is introduced as an explicit modeling step for tractability; performance on the original nonconvex dynamics is evaluated via simulation rather than claimed by construction from the regret proof. No self-citations, ansatz smuggling, or renaming of known results appear as load-bearing steps in the abstract or described framework. The derivation remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

2 free parameters · 2 axioms · 0 invented entities

Only the abstract is available, so the ledger is necessarily incomplete and limited to elements explicitly named in the summary.

free parameters (2)
  • tracking penalty weights
    Used to balance SoC reference tracking against cost minimization in the OCO stage; their specific values affect both regret and reported savings.
  • convex approximation granularity
    Fineness of the hull model is stated to improve regret; chosen to trade accuracy for tractability.
axioms (2)
  • domain assumption Hindsight-optimal SoC trajectories generated offline are representative enough to train a kernel regressor that produces useful online references.
    Invoked to justify the offline-to-online transfer; location: abstract description of training stage.
  • domain assumption The convex hull approximation preserves the essential feasible region and cost behavior of the true nonconvex LDES dynamics.
    Central modeling choice enabling tractability; location: abstract introduction of the model.

pith-pipeline@v0.9.0 · 5750 in / 1641 out tokens · 58811 ms · 2026-05-19T06:28:55.207348+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

35 extracted references · 35 canonical work pages

  1. [1]

    Practical energy management systems for isolated microgrids,

    B. V . Solanki, C. A. Ca ˜nizares, and K. Bhattacharya, “Practical energy management systems for isolated microgrids,” IEEE Transactions on Smart Grid , vol. 10, no. 5, pp. 4762–4775, 2018

  2. [2]

    Feasibility, environmental, and economic analysis of alternative fuel distributed power systems for reliable off-grid energy supply,

    Z. Wang, Y . Lin, Y . Guoet al., “Feasibility, environmental, and economic analysis of alternative fuel distributed power systems for reliable off-grid energy supply,” Applied Energy, vol. 384, p. 125493, 2025

  3. [3]

    Long-term and short-term coordinated scheduling for wind-pv-hydro-storage hybrid energy system based on deep reinforcement learning,

    H. Zhang, K. Liao, J. Yang et al. , “Long-term and short-term coordinated scheduling for wind-pv-hydro-storage hybrid energy system based on deep reinforcement learning,” IEEE Transactions on Sustainable Energy, 2025

  4. [4]

    Transferable energy storage bidder,

    Y . Baker, N. Zheng, and B. Xu, “Transferable energy storage bidder,” IEEE Transactions on Power Systems , vol. 39, no. 2, pp. 4117–4126, 2023

  5. [5]

    Semi-empirical model and experimental validation for the performance evaluation of a 15 kw alkaline water electrolyzer,

    M. S ´anchez, E. Amores, L. Rodr ´ıguez et al. , “Semi-empirical model and experimental validation for the performance evaluation of a 15 kw alkaline water electrolyzer,” International Journal of Hydrogen Energy , vol. 43, no. 45, pp. 20 332–20 345, 2018

  6. [6]

    Two-layer robust optimization framework for resilience enhancement of microgrids con- sidering hydrogen and electrical energy storage systems,

    S. M. A. Hashemifar, M. Joorabian, and M. S. Javadi, “Two-layer robust optimization framework for resilience enhancement of microgrids con- sidering hydrogen and electrical energy storage systems,” International Journal of Hydrogen Energy , vol. 47, no. 79, pp. 33 597–33 618, 2022

  7. [7]

    Robust optimization of scale and revenue for integrated power-to-hydrogen systems within energy, ancillary services, and hydrogen markets,

    Z. Gu, G. Pan, W. Gu et al., “Robust optimization of scale and revenue for integrated power-to-hydrogen systems within energy, ancillary services, and hydrogen markets,” IEEE Transactions on Power Systems , vol. 39, no. 3, pp. 5008–5023, 2023

  8. [8]

    A data-driven stochastic optimization approach to the seasonal storage energy management,

    G. Darivianakis, A. Eichler, R. S. Smith et al., “A data-driven stochastic optimization approach to the seasonal storage energy management,” IEEE control systems letters , vol. 1, no. 2, pp. 394–399, 2017

  9. [9]

    Forecast-driven stochastic optimization scheduling of an energy management system for an isolated hydrogen microgrid,

    W. Dong, H. Sun, C. Mei et al., “Forecast-driven stochastic optimization scheduling of an energy management system for an isolated hydrogen microgrid,” Energy Conversion and Management , vol. 277, p. 116640, 2023

  10. [10]

    Distributionally robust optimal scheduling with heterogeneous uncertainty information: A framework for hydrogen systems,

    A. Zhou, M. E. Khodayar, and J. Wang, “Distributionally robust optimal scheduling with heterogeneous uncertainty information: A framework for hydrogen systems,” IEEE Transactions on Sustainable Energy , 2024

  11. [11]

    Long-term operation of isolated microgrids with renewables and hybrid seasonal-battery storage,

    Z. Guo, W. Wei, J. Bai et al. , “Long-term operation of isolated microgrids with renewables and hybrid seasonal-battery storage,” Applied Energy, vol. 349, p. 121628, 2023

  12. [12]

    Hierarchical model predictive control for islanded and grid-connected microgrids with wind generation and hydrogen energy storage systems,

    M. B. Abdelghany, V . Mariani, D. Liuzza et al. , “Hierarchical model predictive control for islanded and grid-connected microgrids with wind generation and hydrogen energy storage systems,” International Journal of Hydrogen Energy , vol. 51, pp. 595–610, 2024

  13. [13]

    A soft actor-critic deep reinforcement learning method for multi-timescale coordinated operation of microgrids,

    C. Hu, Z. Cai, Y . Zhang et al. , “A soft actor-critic deep reinforcement learning method for multi-timescale coordinated operation of microgrids,” Protection and Control of Modern Power Systems , vol. 7, no. 1, p. 29, 2022

  14. [14]

    A stochastic dynamic programming model for hydropower scheduling with state-dependent maximum discharge constraints,

    L. E. Sch ¨affer, A. Helseth, and M. Korp ˚as, “A stochastic dynamic programming model for hydropower scheduling with state-dependent maximum discharge constraints,” Renewable Energy , vol. 194, pp. 571–581, 2022

  15. [15]

    Multi-stage real-time operation of a multi-energy microgrid with electrical and thermal energy storage assets: A data-driven mpc-adp approach,

    Z. Li, L. Wu, Y . Xu et al. , “Multi-stage real-time operation of a multi-energy microgrid with electrical and thermal energy storage assets: A data-driven mpc-adp approach,” IEEE Transactions on Smart Grid, vol. 13, no. 1, pp. 213–226, 2021

  16. [16]

    Real-time energy management in microgrids,

    W. Shi, N. Li, C.-C. Chu et al. , “Real-time energy management in microgrids,” IEEE Transactions on Smart Grid , vol. 8, no. 1, pp. 228–238, 2015

  17. [17]

    Joint optimization and learning approach for smart operation of hydrogen-based building energy systems,

    L. Yu, Z. Xu, X. Guan et al., “Joint optimization and learning approach for smart operation of hydrogen-based building energy systems,” IEEE Transactions on Smart Grid , vol. 14, no. 1, pp. 199–216, 2022

  18. [18]

    Predicting residential energy consumption using cnn-lstm neural networks,

    T.-Y . Kim and S.-B. Cho, “Predicting residential energy consumption using cnn-lstm neural networks,” Energy, vol. 182, pp. 72–81, 2019

  19. [19]

    Online convex optimization of multi-energy building-to-grid ancillary services,

    A. Lesage-Landry, H. Wang, I. Shames et al. , “Online convex optimization of multi-energy building-to-grid ancillary services,” IEEE Transactions on Control Systems Technology , vol. 28, no. 6, pp. 2416–2431, 2019

  20. [20]

    Real-time feedback-based optimization of distribution grids: A unified approach,

    A. Bernstein and E. Dall’Anese, “Real-time feedback-based optimization of distribution grids: A unified approach,” IEEE Transactions on Control of Network Systems , vol. 6, no. 3, pp. 1197–1209, 2019

  21. [21]

    Real-time feedback based online aggregate ev power flexibility characterization,

    D. Yan, S. Huang, and Y . Chen, “Real-time feedback based online aggregate ev power flexibility characterization,” IEEE Transactions on Sustainable Energy, vol. 15, no. 1, pp. 658–673, 2023

  22. [22]

    A lagrangian-informed long-term dispatch policy for coupled hydropower and photovoltaic systems,

    E. Cohn, N. Qi, U. Lall et al., “A lagrangian-informed long-term dispatch policy for coupled hydropower and photovoltaic systems,” in 2025 IEEE Power & Energy Society General Meeting . IEEE, 2025, pp. 1–5

  23. [23]

    Long-term energy management for microgrid with hybrid hydrogen-battery energy storage: A prediction- free coordinated optimization framework,

    N. Qi, K. Huang, Z. Fan et al. , “Long-term energy management for microgrid with hybrid hydrogen-battery energy storage: A prediction- free coordinated optimization framework,” Applied Energy, vol. 377, p. 124485, 2025

  24. [24]

    Shrinking and receding horizon approaches for long-term operational planning of energy storage and supply systems,

    T. Wakui, K. Akai, and R. Yokoyama, “Shrinking and receding horizon approaches for long-term operational planning of energy storage and supply systems,” Energy, vol. 239, p. 122066, 2022

  25. [25]

    Securing long-term dispatch of isolated microgrids with high-penetration renewable generation: A controlled evolution-based framework,

    K. Kang, Y . Su, P. Yang et al., “Securing long-term dispatch of isolated microgrids with high-penetration renewable generation: A controlled evolution-based framework,”Applied Energy, vol. 381, p. 125140, 2025

  26. [26]

    Sufficient conditions for exact relaxation of complementarity constraints for storage-concerned economic dispatch,

    Z. Li, Q. Guo, H. Sun et al. , “Sufficient conditions for exact relaxation of complementarity constraints for storage-concerned economic dispatch,” IEEE Transactions on Power Systems , vol. 31, no. 2, pp. 1653–1654, 2015

  27. [27]

    A conic model for electrolyzer scheduling,

    E. Raheli, Y . Werner, and J. Kazempour, “A conic model for electrolyzer scheduling,” Computers & Chemical Engineering , vol. 179, p. 108450, 2023

  28. [28]

    Locational energy storage bid bounds for facilitating social welfare convergence,

    N. Qi and B. Xu, “Locational energy storage bid bounds for facilitating social welfare convergence,” IEEE Transactions on Energy Markets, Policy and Regulation , 2025

  29. [29]

    Long-term scenario generation of renewable energy generation using attention-based conditional generative adversarial networks,

    H. Li, H. Yu, Z. Liu et al. , “Long-term scenario generation of renewable energy generation using attention-based conditional generative adversarial networks,” Energy Conversion and Economics , vol. 5, no. 1, pp. 15–27, 2024

  30. [30]

    Knowledge-integrated gan model for stochastic time-series simulation of year-round weather for photovoltaic integration analysis,

    X. Fu, F. Chang, H. Sun et al. , “Knowledge-integrated gan model for stochastic time-series simulation of year-round weather for photovoltaic integration analysis,” IEEE Transactions on Power Systems , 2025

  31. [31]

    Data and proof for online convex optimization,

    N. Qi, “Data and proof for online convex optimization,” 2025, [Online]. Available: https://github.com/thuqining/online convex optimization.git

  32. [32]

    Online convex optimization using predictions,

    N. Chen, A. Agarwal, A. Wierman et al. , “Online convex optimization using predictions,” in Proceedings of the 2015 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, 2015, pp. 191–204

  33. [33]

    Convex hull based robust security region for electricity-gas integrated energy systems,

    S. Chen, Z. Wei, G. Sun et al. , “Convex hull based robust security region for electricity-gas integrated energy systems,” IEEE Transactions on Power Systems , vol. 34, no. 3, pp. 1740–1748, 2018

  34. [34]

    Distributed online convex optimization with time-varying coupled inequality constraints,

    X. Yi, X. Li, L. Xie et al. , “Distributed online convex optimization with time-varying coupled inequality constraints,” IEEE Transactions on Signal Processing , vol. 68, pp. 731–746, 2020

  35. [35]

    Adaptive online learning in dynamic environments,

    L. Zhang, S. Lu, and Z.-H. Zhou, “Adaptive online learning in dynamic environments,” Advances in neural information processing systems , vol. 31, 2018. IEEE TRANSACTIONS ON SMART GRID, VOL. X, NO. X, XX JULY 2025 10 APPENDIX A. Proof of Equivalence between Long-Term Trajectory Tracking and Explicit Time-Coupling Constraints On one hand, since the referenc...