Online Convex Optimization for Coordinated Long-Term and Short-Term Isolated Microgrid Dispatch
Pith reviewed 2026-05-19 06:28 UTC · model grok-4.3
The pith
A non-anticipatory framework coordinates long-term and short-term dispatch in isolated microgrids by approximating long-duration storage dynamics with a convex hull, using kernel regression for state-of-charge targets, and applying adaptive
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By replacing the nonconvex LDES dynamics with a convex hull approximation, training kernel regression on hindsight-optimal SoC and net-load trajectories, and running an adaptive OCO algorithm that penalizes deviation from the dynamic SoC reference while tracking expert advice, both the long-term contract and short-term power decisions achieve sublinear regret; the combined policy reduces operating cost by 73.4 percent and eliminates load shedding compared with state-of-the-art baselines, with further gains as LDES duration increases.
What carries the argument
Adaptive online convex optimization algorithm that augments the standard regret-minimizing update with an SoC-reference tracking term and an expert-tracking term, allowing the step-size to adapt while enforcing consistency with the long-term SoC target generated by kernel regression.
If this is right
- Long-term and short-term decisions can be computed sequentially without future information while still guaranteeing sublinear regret for both horizons.
- Cost savings and reliability improve as the physical duration of the long-duration storage increases.
- The framework tolerates forecast errors and sudden component failures without inducing load loss.
- Stronger penalties on SoC deviation and more regression training scenarios tighten the regret bounds.
Where Pith is reading between the lines
- The same convex-hull-plus-tracking structure could be tested on other nonconvex storage technologies whose dynamics admit a tight convex outer approximation.
- Regret bounds that improve with finer convex approximations suggest a practical trade-off between model fidelity and computational speed that could be quantified on larger networks.
- Because the method is non-anticipatory, it may integrate directly with real-time market signals or fault-detection systems without requiring separate forecast modules.
Load-bearing premise
The convex hull approximation of the nonconvex LDES electrochemical dynamics is accurate enough that decisions computed on the approximate model remain near-optimal when executed on the true dynamics.
What would settle it
Apply the online decisions produced by the approximated model to a high-fidelity nonconvex simulation of the LDES and check whether the realized cost and load-loss statistics remain within a few percent of the values reported under the convex model.
Figures
read the original abstract
This paper proposes a novel non-anticipatory long-short-term coordinated dispatch framework for isolated microgrid with hybrid short-long-duration energy storages (LDES). We introduce a convex hull approximation model for nonconvex LDES electrochemical dynamics, facilitating computational tractability and accuracy. To address temporal coupling in SoC dynamics and long-term contracts, we generate hindsight-optimal state-of-charge (SoC) trajectories of LDES and netloads for offline training. In the online stage, we employ kernel regression to dynamically update the SoC reference and propose an adaptive online convex optimization (OCO) algorithm with SoC reference tracking and expert tracking to mitigate myopia and enable adaptive step-size optimization. We rigorously prove that both long-term and short-term policies achieve sublinear regret bounds over time, which improves with more regression scenarios, stronger tracking penalties, and finer convex approximations. Simulation results show that the proposed method outperforms state-of-the-art methods, reducing costs by 73.4%, eliminating load loss via reference tracking, and achieving an additional 2.4% cost saving via the OCO algorithm. These benefits scale up with longer LDES durations, and the method demonstrates resilience to poor forecasts and unexpected system faults.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes a non-anticipatory long-short-term coordinated dispatch framework for isolated microgrids with hybrid short- and long-duration energy storage. It introduces a convex hull approximation for nonconvex LDES electrochemical dynamics to enable tractability, generates hindsight-optimal SoC trajectories offline for kernel regression-based dynamic references, and develops an adaptive OCO algorithm incorporating SoC reference tracking and expert tracking. The authors prove sublinear regret bounds for the long-term and short-term policies (improving with more regression scenarios, stronger tracking penalties, and finer approximations) and report simulation results showing 73.4% cost reduction versus state-of-the-art methods, elimination of load loss, and an additional 2.4% saving from the OCO component.
Significance. If the convex approximation accuracy is sufficient for the regret guarantees to translate to the true nonconvex dynamics, the framework provides a practical, theoretically grounded online method for microgrid dispatch that addresses temporal coupling and myopia in long-duration storage planning. The rigorous sublinear regret analysis, the scaling of benefits with LDES duration, and the resilience claims to forecast errors are strengths that could advance reliable operation of isolated systems with high renewable penetration.
major comments (1)
- The sublinear regret bounds are derived for the convexified problem (see the long-term and short-term policy analyses). No quantitative bound is supplied on the approximation error (e.g., Hausdorff distance between the convex hull and true nonconvex LDES feasible set, or the resulting optimality gap when the convex decisions are applied to the actual electrochemical dynamics). This is load-bearing for the central claims of 73.4% cost reduction and zero load loss, as the simulations demonstrate performance on the approximated model but do not include a direct comparison against a nonconvex reference solver on identical instances.
minor comments (2)
- Clarify in the simulation section the precise data exclusion rules, forecast error models, and parameter tuning procedure for the tracking penalties and convex approximation granularity to support reproducibility of the 73.4% figure.
- Add a short discussion of how the kernel regression scenario count and approximation fineness were selected in the numerical experiments.
Simulated Author's Rebuttal
We thank the referee for the detailed and constructive review. The major comment correctly identifies that our regret analysis applies to the convexified problem and that a quantitative characterization of the convex hull approximation error is not provided in the current manuscript. We address this point directly below and commit to revisions that strengthen the connection between the theoretical guarantees and the nonconvex dynamics.
read point-by-point responses
-
Referee: The sublinear regret bounds are derived for the convexified problem (see the long-term and short-term policy analyses). No quantitative bound is supplied on the approximation error (e.g., Hausdorff distance between the convex hull and true nonconvex LDES feasible set, or the resulting optimality gap when the convex decisions are applied to the actual electrochemical dynamics). This is load-bearing for the central claims of 73.4% cost reduction and zero load loss, as the simulations demonstrate performance on the approximated model but do not include a direct comparison against a nonconvex reference solver on identical instances.
Authors: We agree that the sublinear regret bounds hold for the convexified formulation, which is necessary for tractable online optimization and for applying standard OCO analysis tools. The convex hull is constructed as a tight outer approximation of the nonconvex LDES electrochemical feasible set, ensuring that every convex-feasible decision remains feasible (though possibly conservative) when applied to the true dynamics. We acknowledge that the manuscript does not supply an explicit quantitative bound such as the Hausdorff distance or a derived optimality gap. In the revision we will add a dedicated subsection that (i) computes the Hausdorff distance numerically for the LDES parameters used in the case studies, (ii) provides an a-posteriori bound on the optimality gap incurred by projecting convex decisions onto the nonconvex set, and (iii) reports offline comparisons against a nonconvex solver on representative small-scale instances where exact solution is feasible. These additions will clarify the conditions under which the reported cost reductions and zero load-loss results translate to the original nonconvex system. The 73.4 % figure and load-loss elimination are obtained by running the full proposed pipeline (including the convex model) against benchmark methods that themselves rely on approximations or heuristics; we will make this comparison basis explicit. revision: yes
Circularity Check
No significant circularity in derivation chain
full rationale
The paper derives sublinear regret bounds mathematically for the adaptive OCO algorithm applied to the convexified long-term and short-term problems. The dependence of the bound on hyperparameters (number of regression scenarios, tracking penalties, approximation fineness) is a standard analytic feature and does not constitute a reduction to fitted values or self-definition. The convex-hull approximation is introduced as an explicit modeling step for tractability; performance on the original nonconvex dynamics is evaluated via simulation rather than claimed by construction from the regret proof. No self-citations, ansatz smuggling, or renaming of known results appear as load-bearing steps in the abstract or described framework. The derivation remains self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
free parameters (2)
- tracking penalty weights
- convex approximation granularity
axioms (2)
- domain assumption Hindsight-optimal SoC trajectories generated offline are representative enough to train a kernel regressor that produces useful online references.
- domain assumption The convex hull approximation preserves the essential feasible region and cost behavior of the true nonconvex LDES dynamics.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We introduce a convex hull approximation model for nonconvex LDES electrochemical dynamics... adaptive online convex optimization (OCO) algorithm with SoC reference tracking and expert tracking... sublinear regret bounds
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We prove that both long-term and short-term policies achieve sublinear dynamic regret bounds over time, which improves with more regression scenarios, stronger tracking penalties, and finer convex approximations.
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Practical energy management systems for isolated microgrids,
B. V . Solanki, C. A. Ca ˜nizares, and K. Bhattacharya, “Practical energy management systems for isolated microgrids,” IEEE Transactions on Smart Grid , vol. 10, no. 5, pp. 4762–4775, 2018
work page 2018
-
[2]
Z. Wang, Y . Lin, Y . Guoet al., “Feasibility, environmental, and economic analysis of alternative fuel distributed power systems for reliable off-grid energy supply,” Applied Energy, vol. 384, p. 125493, 2025
work page 2025
-
[3]
H. Zhang, K. Liao, J. Yang et al. , “Long-term and short-term coordinated scheduling for wind-pv-hydro-storage hybrid energy system based on deep reinforcement learning,” IEEE Transactions on Sustainable Energy, 2025
work page 2025
-
[4]
Transferable energy storage bidder,
Y . Baker, N. Zheng, and B. Xu, “Transferable energy storage bidder,” IEEE Transactions on Power Systems , vol. 39, no. 2, pp. 4117–4126, 2023
work page 2023
-
[5]
M. S ´anchez, E. Amores, L. Rodr ´ıguez et al. , “Semi-empirical model and experimental validation for the performance evaluation of a 15 kw alkaline water electrolyzer,” International Journal of Hydrogen Energy , vol. 43, no. 45, pp. 20 332–20 345, 2018
work page 2018
-
[6]
S. M. A. Hashemifar, M. Joorabian, and M. S. Javadi, “Two-layer robust optimization framework for resilience enhancement of microgrids con- sidering hydrogen and electrical energy storage systems,” International Journal of Hydrogen Energy , vol. 47, no. 79, pp. 33 597–33 618, 2022
work page 2022
-
[7]
Z. Gu, G. Pan, W. Gu et al., “Robust optimization of scale and revenue for integrated power-to-hydrogen systems within energy, ancillary services, and hydrogen markets,” IEEE Transactions on Power Systems , vol. 39, no. 3, pp. 5008–5023, 2023
work page 2023
-
[8]
A data-driven stochastic optimization approach to the seasonal storage energy management,
G. Darivianakis, A. Eichler, R. S. Smith et al., “A data-driven stochastic optimization approach to the seasonal storage energy management,” IEEE control systems letters , vol. 1, no. 2, pp. 394–399, 2017
work page 2017
-
[9]
W. Dong, H. Sun, C. Mei et al., “Forecast-driven stochastic optimization scheduling of an energy management system for an isolated hydrogen microgrid,” Energy Conversion and Management , vol. 277, p. 116640, 2023
work page 2023
-
[10]
A. Zhou, M. E. Khodayar, and J. Wang, “Distributionally robust optimal scheduling with heterogeneous uncertainty information: A framework for hydrogen systems,” IEEE Transactions on Sustainable Energy , 2024
work page 2024
-
[11]
Long-term operation of isolated microgrids with renewables and hybrid seasonal-battery storage,
Z. Guo, W. Wei, J. Bai et al. , “Long-term operation of isolated microgrids with renewables and hybrid seasonal-battery storage,” Applied Energy, vol. 349, p. 121628, 2023
work page 2023
-
[12]
M. B. Abdelghany, V . Mariani, D. Liuzza et al. , “Hierarchical model predictive control for islanded and grid-connected microgrids with wind generation and hydrogen energy storage systems,” International Journal of Hydrogen Energy , vol. 51, pp. 595–610, 2024
work page 2024
-
[13]
C. Hu, Z. Cai, Y . Zhang et al. , “A soft actor-critic deep reinforcement learning method for multi-timescale coordinated operation of microgrids,” Protection and Control of Modern Power Systems , vol. 7, no. 1, p. 29, 2022
work page 2022
-
[14]
L. E. Sch ¨affer, A. Helseth, and M. Korp ˚as, “A stochastic dynamic programming model for hydropower scheduling with state-dependent maximum discharge constraints,” Renewable Energy , vol. 194, pp. 571–581, 2022
work page 2022
-
[15]
Z. Li, L. Wu, Y . Xu et al. , “Multi-stage real-time operation of a multi-energy microgrid with electrical and thermal energy storage assets: A data-driven mpc-adp approach,” IEEE Transactions on Smart Grid, vol. 13, no. 1, pp. 213–226, 2021
work page 2021
-
[16]
Real-time energy management in microgrids,
W. Shi, N. Li, C.-C. Chu et al. , “Real-time energy management in microgrids,” IEEE Transactions on Smart Grid , vol. 8, no. 1, pp. 228–238, 2015
work page 2015
-
[17]
L. Yu, Z. Xu, X. Guan et al., “Joint optimization and learning approach for smart operation of hydrogen-based building energy systems,” IEEE Transactions on Smart Grid , vol. 14, no. 1, pp. 199–216, 2022
work page 2022
-
[18]
Predicting residential energy consumption using cnn-lstm neural networks,
T.-Y . Kim and S.-B. Cho, “Predicting residential energy consumption using cnn-lstm neural networks,” Energy, vol. 182, pp. 72–81, 2019
work page 2019
-
[19]
Online convex optimization of multi-energy building-to-grid ancillary services,
A. Lesage-Landry, H. Wang, I. Shames et al. , “Online convex optimization of multi-energy building-to-grid ancillary services,” IEEE Transactions on Control Systems Technology , vol. 28, no. 6, pp. 2416–2431, 2019
work page 2019
-
[20]
Real-time feedback-based optimization of distribution grids: A unified approach,
A. Bernstein and E. Dall’Anese, “Real-time feedback-based optimization of distribution grids: A unified approach,” IEEE Transactions on Control of Network Systems , vol. 6, no. 3, pp. 1197–1209, 2019
work page 2019
-
[21]
Real-time feedback based online aggregate ev power flexibility characterization,
D. Yan, S. Huang, and Y . Chen, “Real-time feedback based online aggregate ev power flexibility characterization,” IEEE Transactions on Sustainable Energy, vol. 15, no. 1, pp. 658–673, 2023
work page 2023
-
[22]
A lagrangian-informed long-term dispatch policy for coupled hydropower and photovoltaic systems,
E. Cohn, N. Qi, U. Lall et al., “A lagrangian-informed long-term dispatch policy for coupled hydropower and photovoltaic systems,” in 2025 IEEE Power & Energy Society General Meeting . IEEE, 2025, pp. 1–5
work page 2025
-
[23]
N. Qi, K. Huang, Z. Fan et al. , “Long-term energy management for microgrid with hybrid hydrogen-battery energy storage: A prediction- free coordinated optimization framework,” Applied Energy, vol. 377, p. 124485, 2025
work page 2025
-
[24]
T. Wakui, K. Akai, and R. Yokoyama, “Shrinking and receding horizon approaches for long-term operational planning of energy storage and supply systems,” Energy, vol. 239, p. 122066, 2022
work page 2022
-
[25]
K. Kang, Y . Su, P. Yang et al., “Securing long-term dispatch of isolated microgrids with high-penetration renewable generation: A controlled evolution-based framework,”Applied Energy, vol. 381, p. 125140, 2025
work page 2025
-
[26]
Z. Li, Q. Guo, H. Sun et al. , “Sufficient conditions for exact relaxation of complementarity constraints for storage-concerned economic dispatch,” IEEE Transactions on Power Systems , vol. 31, no. 2, pp. 1653–1654, 2015
work page 2015
-
[27]
A conic model for electrolyzer scheduling,
E. Raheli, Y . Werner, and J. Kazempour, “A conic model for electrolyzer scheduling,” Computers & Chemical Engineering , vol. 179, p. 108450, 2023
work page 2023
-
[28]
Locational energy storage bid bounds for facilitating social welfare convergence,
N. Qi and B. Xu, “Locational energy storage bid bounds for facilitating social welfare convergence,” IEEE Transactions on Energy Markets, Policy and Regulation , 2025
work page 2025
-
[29]
H. Li, H. Yu, Z. Liu et al. , “Long-term scenario generation of renewable energy generation using attention-based conditional generative adversarial networks,” Energy Conversion and Economics , vol. 5, no. 1, pp. 15–27, 2024
work page 2024
-
[30]
X. Fu, F. Chang, H. Sun et al. , “Knowledge-integrated gan model for stochastic time-series simulation of year-round weather for photovoltaic integration analysis,” IEEE Transactions on Power Systems , 2025
work page 2025
-
[31]
Data and proof for online convex optimization,
N. Qi, “Data and proof for online convex optimization,” 2025, [Online]. Available: https://github.com/thuqining/online convex optimization.git
work page 2025
-
[32]
Online convex optimization using predictions,
N. Chen, A. Agarwal, A. Wierman et al. , “Online convex optimization using predictions,” in Proceedings of the 2015 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, 2015, pp. 191–204
work page 2015
-
[33]
Convex hull based robust security region for electricity-gas integrated energy systems,
S. Chen, Z. Wei, G. Sun et al. , “Convex hull based robust security region for electricity-gas integrated energy systems,” IEEE Transactions on Power Systems , vol. 34, no. 3, pp. 1740–1748, 2018
work page 2018
-
[34]
Distributed online convex optimization with time-varying coupled inequality constraints,
X. Yi, X. Li, L. Xie et al. , “Distributed online convex optimization with time-varying coupled inequality constraints,” IEEE Transactions on Signal Processing , vol. 68, pp. 731–746, 2020
work page 2020
-
[35]
Adaptive online learning in dynamic environments,
L. Zhang, S. Lu, and Z.-H. Zhou, “Adaptive online learning in dynamic environments,” Advances in neural information processing systems , vol. 31, 2018. IEEE TRANSACTIONS ON SMART GRID, VOL. X, NO. X, XX JULY 2025 10 APPENDIX A. Proof of Equivalence between Long-Term Trajectory Tracking and Explicit Time-Coupling Constraints On one hand, since the referenc...
work page 2018
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.