A Markov Decision Process Framework for Enhancing Power System Resilience during Wildfires under Decision-Dependent Uncertainty
Pith reviewed 2026-05-10 17:58 UTC · model grok-4.3
The pith
A Markov Decision Process optimizes power switching actions during wildfires to minimize total operational costs.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Representing network topologies as Markov states with transitions driven by exogenous weather and endogenous power flow dynamics allows an MDP formulation that optimizes switching sequences to minimize total operational costs throughout a wildfire event; the resulting policies are computed efficiently via approximate dynamic programming on post-decision states.
What carries the argument
Markov Decision Process in which network topologies are states and transitions combine weather-driven exogenous changes with power-flow-driven endogenous changes, solved by approximate dynamic programming on post-decision states.
If this is right
- The framework produces time-varying shutoff schedules that trade off immediate load loss against long-term ignition and damage costs.
- Approximate dynamic programming on post-decision states renders the approach computationally feasible for systems up to at least 138 buses.
- The same state representation supports repeated re-optimization as new weather observations arrive.
- Cost-minimizing policies differ across grid topologies, showing the value of tailoring decisions to each network's configuration.
Where Pith is reading between the lines
- The same MDP structure could be adapted to other time-evolving hazards such as storms or heat waves that also alter line failure probabilities.
- Embedding real-time sensor data into the transition probabilities would allow the model to update policies without full re-solving.
- Extending the state space to include crew locations or repair resources could turn the framework into a joint resilience and restoration planner.
Load-bearing premise
That the uncertainty in network conditions during a wildfire can be captured sufficiently well by Markov state transitions that depend only on weather and power flows.
What would settle it
Deploy the computed shutoff policy on a live distribution feeder during an actual wildfire and compare the realized total costs and ignition events against the model's predicted minimum.
Figures
read the original abstract
Wildfires pose an increasing threat to the safety and reliability of power systems, particularly in distribution networks located in fire-prone regions. To mitigate ignition risk from electrical infrastructure, utilities often employ safety power shutoffs, which proactively de-energize high-risk lines during hazardous weather and restore them once conditions improve. While this strategy can result in temporary load loss, it helps prevent equipment damage and wildfire ignition development in the system. In this paper, we develop a state-based decision-making framework to optimize such switching actions over time, with the goal of minimizing total operational costs throughout a wildfire event. The model represents network topologies as Markov states, with transitions influenced by both exogenous weather conditions and endogenous power flow dynamics. To address the computational challenges posed by the large state and action spaces, we propose an approximate dynamic programming algorithm based on post-decision states. The effectiveness and scalability of the proposed approach are demonstrated through case studies on 54-bus and 138-bus distribution systems, showcasing its potential for enhancing wildfire resilience across different grid configurations.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper develops an MDP framework for optimizing safety power shutoff actions in distribution networks during wildfires to minimize total operational costs. Network topologies are represented as discrete Markov states whose transitions depend on exogenous weather conditions and endogenous power-flow quantities; an approximate dynamic programming algorithm using post-decision states is proposed to solve the resulting large-scale problem. Effectiveness is illustrated via case studies on 54-bus and 138-bus test systems.
Significance. If the central modeling assumptions hold, the work supplies a computationally tractable, state-based policy for utilities facing decision-dependent wildfire risk, directly addressing a growing operational challenge. The use of post-decision states to mitigate the curse of dimensionality is a constructive algorithmic contribution, and the demonstration across two differently sized distribution systems provides initial evidence of scalability.
major comments (1)
- Abstract and modeling section: The claim that the MDP correctly optimizes shutoff policies under decision-dependent uncertainty rests on the unvalidated premise that topology transitions are Markovian and fully captured by weather plus power-flow dynamics. The 54-bus and 138-bus case studies rely on synthetic transition probabilities with no reported calibration, sensitivity analysis, or comparison against physics-based fire-spread models; without such evidence the optimality guarantees and resilience improvements cannot be assessed.
Simulated Author's Rebuttal
We thank the referee for the detailed and constructive feedback. The concern about validating the Markovian transitions and the use of synthetic probabilities is well-taken; we clarify our modeling rationale and commit to targeted revisions that strengthen the presentation without altering the core contribution.
read point-by-point responses
-
Referee: Abstract and modeling section: The claim that the MDP correctly optimizes shutoff policies under decision-dependent uncertainty rests on the unvalidated premise that topology transitions are Markovian and fully captured by weather plus power-flow dynamics. The 54-bus and 138-bus case studies rely on synthetic transition probabilities with no reported calibration, sensitivity analysis, or comparison against physics-based fire-spread models; without such evidence the optimality guarantees and resilience improvements cannot be assessed.
Authors: The Markov property is a deliberate modeling choice: the state vector explicitly includes the current network topology (as a discrete Markov state) together with exogenous weather conditions, so that the transition kernel depends on both weather and the endogenous power-flow quantities that result from the chosen switching action. This structure directly encodes decision-dependent uncertainty. We acknowledge that the numerical case studies employ illustrative transition probabilities chosen to demonstrate scalability rather than calibrated from field data. In the revised manuscript we will (i) add an explicit subsection on calibration approaches that leverage historical wildfire, weather, and outage records, (ii) include a sensitivity analysis that perturbs the transition probabilities over plausible ranges and reports the resulting policy and cost variations, and (iii) cite representative physics-based fire-spread models (e.g., those based on Rothermel or cellular automata) while clarifying that our framework is intended to accept transition probabilities generated by such models. Because the paper focuses on the decision-making algorithm rather than fire physics, a full empirical comparison lies outside the present scope; the added discussion will make this boundary explicit. revision: partial
Circularity Check
MDP modeling framework for wildfire resilience contains no circular reductions
full rationale
The paper introduces an MDP in which network topologies are treated as discrete Markov states whose transitions depend on exogenous weather and endogenous power-flow quantities. This representation is an explicit modeling assumption rather than a derived result obtained by fitting parameters to data or by self-referential definition. The subsequent approximate dynamic programming algorithm is a standard solution technique for the resulting large-scale MDP and does not rely on any fitted prediction that is forced by construction. Case studies on the 54-bus and 138-bus systems serve only to illustrate computational scalability under synthetic transition probabilities; they do not close a loop in which an output is redefined as an input. Consequently the claimed optimization framework remains self-contained and independent of its own outputs.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Network topologies can be represented as Markov states with transitions influenced by both exogenous weather conditions and endogenous power flow dynamics.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
The model represents network topologies as Markov states, with transitions influenced by both exogenous weather conditions and endogenous power flow dynamics.
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Human exposure and sensitivity to globally extreme wildfire events,
D. M. Bowman, G. J. Williamson, J. T. Abatzoglou, C. A. Kolden, M. A. Cochrane, and A. M. Smith, “Human exposure and sensitivity to globally extreme wildfire events,”Nature ecology & evolution, vol. 1, no. 3, p. 0058, 2017
work page 2017
-
[2]
arXiv preprint arXiv:2501.17880 , year=
S. T. Seydi, “Assessment of the january 2025 los angeles county wildfires: A multi-modal analysis of impact, response, and population exposure,”arXiv preprint arXiv:2501.17880, 2025
-
[3]
Wildfire mitigation plans in power systems: A literature review,
D. A. Z. Vazquez, F. Qiu, N. Fan, and K. Sharp, “Wildfire mitigation plans in power systems: A literature review,”IEEE Transactions on Power Systems, vol. 37, no. 5, pp. 3540–3551, 2022
work page 2022
-
[4]
C. Huang, et al, “A review of public safety power shutoffs (psps) for wildfire mitigation: Policies, practices, models and data sources,”IEEE Transactions on Energy Markets, Policy and Regulation, vol. 1, no. 3, pp. 187–197, 2023
work page 2023
-
[5]
J. T. Abatzoglou, C. M. Smith, D. L. Swain, T. Ptak, and C. A. Kolden, “Population exposure to pre-emptive de-energization aimed at averting wildfires in northern california,”Environmental Research Letters, vol. 15, no. 9, p. 094046, 2020
work page 2020
-
[6]
Optimally scheduling public safety power shutoffs,
A. Lesage-Landry, F. Pellerin, D. S. Callaway, and J. A. Taylor, “Optimally scheduling public safety power shutoffs,”Stochastic Systems, vol. 13, no. 4, pp. 438–456, 2023
work page 2023
-
[7]
A. Moreira, F. Pianc ´o, B. Fanzeres, A. Street, R. Jiang, C. Zhao, and M. Heleno, “Distribution system operation amidst wildfire-prone climate conditions under decision-dependent line availability uncertainty,”IEEE Transactions on Power Systems, vol. 39, no. 5, pp. 6522–6538, 2024
work page 2024
-
[8]
Enhancing power system operational resilience against wildfires,
M. Abdelmalak and M. Benidris, “Enhancing power system operational resilience against wildfires,”IEEE Transactions on Industry Applica- tions, vol. 58, no. 2, pp. 1611–1621, 2022
work page 2022
-
[9]
Decision-dependent uncertainty-aware distribution system planning un- der wildfire risk,
F. Pianc ´o, A. Moreira, B. Fanzeres, R. Jiang, C. Zhao, and M. Heleno, “Decision-dependent uncertainty-aware distribution system planning un- der wildfire risk,”IEEE Transactions on Power Systems, 2025
work page 2025
-
[10]
C. Wang, P. Ju, S. Lei, Z. Wang, F. Wu, and Y . Hou, “Markov decision process-based resilience enhancement for distribution systems: An approximate dynamic programming approach,”IEEE Transactions on Smart Grid, vol. 11, no. 3, pp. 2498–2510, 2019
work page 2019
-
[11]
C. Wang, S. Lei, P. Ju, C. Chen, C. Peng, and Y . Hou, “Mdp-based dis- tribution network reconfiguration with renewable distributed generation: Approximate dynamic programming approach,”IEEE Transactions on Smart Grid, vol. 11, no. 4, pp. 3620–3631, 2020
work page 2020
-
[12]
A markov decision process to enhance power system operation resilience during hurricanes,
M. Abdelmalak and M. Benidris, “A markov decision process to enhance power system operation resilience during hurricanes,” in2021 IEEE Power & Energy Society General Meeting (PESGM). IEEE, 2021, pp. 01–05. JOURNAL OF LATEX CLASS FILES, VOL. 14, NO. 8, AUGUST 2021 11
work page 2021
-
[13]
Distributionally robust distribution network configuration under random contingency,
S. Babaei, R. Jiang, and C. Zhao, “Distributionally robust distribution network configuration under random contingency,”IEEE Transactions on Power Systems, vol. 35, no. 5, pp. 3332–3341, 2020
work page 2020
-
[14]
A. Moreira, F. Pianc ´o, B. F. dos Santos, A. Street, R. Jiang, C. Zhao, and M. Heleno, “Distribution system operation amidst wildfire- prone climate conditions under decision-dependent line availability uncertainty - dataset,” 2023. [Online]. Available: https://dx.doi.org/10. 21227/318q-5k50
work page 2023
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.