pith. sign in

arxiv: 2507.05874 · v1 · submitted 2025-07-08 · 💻 cs.LG · cs.SY· eess.SY

Robust Power System State Estimation using Physics-Informed Neural Networks

Pith reviewed 2026-05-19 06:11 UTC · model grok-4.3

classification 💻 cs.LG cs.SYeess.SY
keywords physics-informed neural networkspower system state estimationrobustnessdata manipulation attackspower-flow equationsmachine learning for gridscybersecurity in power systems
0
0 comments X

The pith

Embedding power-flow equations into neural networks boosts state estimation accuracy in power systems by up to 83 percent on unseen data and 93 percent during attacks.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces a hybrid method that trains neural networks for power system state estimation by adding the governing power-flow equations directly to the loss function. This approach seeks to deliver more accurate voltage and angle estimates than conventional neural networks, both in normal operation and when measurements are faulty or deliberately altered. A sympathetic reader would care because reliable real-time state estimation underpins grid stability, especially as power systems incorporate more variable renewables and face rising cyber threats. The reported gains on held-out data and entirely new networks suggest the physics constraints help the model generalize and resist manipulation at critical points.

Core claim

By embedding the nonlinear power-flow equations into the neural-network loss function, the resulting physics-informed network produces state estimates whose accuracy exceeds that of an otherwise identical neural network by up to 83 percent on unseen subsets of the training distribution, 65 percent on entirely new system topologies, and 93 percent when a critical bus is subjected to data manipulation.

What carries the argument

Physics-informed neural network whose loss combines conventional data-fitting terms with the power-flow equations that relate bus voltages, angles, and injections.

If this is right

  • Accuracy remains higher on measurement patterns never encountered in training.
  • Performance advantage persists when the test system has a different topology from any training example.
  • Estimation error stays lower than a standard network when a critical bus measurement is falsified.
  • The same embedding yields gains under both normal operation and simple fault conditions.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The method could reduce the number of sensors required for acceptable accuracy in real-time monitoring.
  • Similar physics embedding might improve state estimation in other networked physical systems such as gas pipelines or water networks.
  • Integration with existing SCADA systems could provide an additional layer of attack detection without extra hardware.
  • The approach may scale to larger transmission networks if the embedded equations are solved efficiently inside the loss.

Load-bearing premise

The power-flow equations are embedded exactly and without approximation error that would break under realistic attack or fault conditions.

What would settle it

Remove the physics loss term entirely and retrain; if the accuracy advantage over the plain neural network disappears on the same test sets and attack scenarios, the central claim is falsified.

Figures

Figures reproduced from arXiv: 2507.05874 by Charalambos Konstantinou, Maria K. Michael, Markos Asprou, Solon Falas.

Figure 1
Figure 1. Figure 1: Model training procedure for the power system state estimation use case. [PITH_FULL_IMAGE:figures/full_fig_p004_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Scenario 1 (IEEE 14-bus System) - MAE for steady-state conditions using varying λd, λp, λc weights for data, physics, and constants, respectively, in the loss function. TABLE I: Computational Costs. Power System Training Time (s) Inference Time (ms) IEEE 14 34.375 0.03946 IEEE 118 39.709 0.03479 I. All measurements are derived from a single NVIDIA RTX 4000 Ada Generation GPU. B. Test cases and results A se… view at source ↗
Figure 3
Figure 3. Figure 3: Scenario 2 - Testing datasets with varying generator shutdowns. Models trained on a bus 2 shutdown are tested with load spikes at equal (S2.1) and double the initial shutdown magnitude (S2.2) [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Scenario 3 - Testing datasets with varying generation increases. Models trained on a bus 2 ramp-up are tested with scenarios simulating load drops at equal (S3.1) and double the initial increase (S3.2) [PITH_FULL_IMAGE:figures/full_fig_p007_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Scenario 4 - Testing datasets for 3-phase fault, with different per-unit resistance and locations. Graphs represent the testing set, plus Scenario S4.1 to S4.5 from left to right. of the model to unknown states of the system [PITH_FULL_IMAGE:figures/full_fig_p007_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Scenario 5 - Testing datasets, showing the local and global accuracy of the models during Data Manipulation Attacks. can act as a filter against such erroneous data. Such filtering properties enhance continuous and accurate state monitoring capabilities, even under data communication disruptions. To evaluate the robustness of the IEEE 14-bus system’s state estimation against data manipulation attacks, we t… view at source ↗
Figure 9
Figure 9. Figure 9: Scenario 8 - Testing datasets for 3-phase fault. Graphs represent the testing set, plus Scenario S8.1 [PITH_FULL_IMAGE:figures/full_fig_p009_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: Scenario 9 - Testing datasets showing the local and global accuracy of the models during Data Manipulation Attacks. the increasing data manipulation magnitude on bus 54. The PINN shows considerable improvements of up to ∼ 76.11% in accurately predicting the system-wide state. 3) Experimental Results Discussion: The PINN has been comprehensively tested in a large variety of datasets and has shown adequate … view at source ↗
Figure 8
Figure 8. Figure 8: Scenario 7 - Testing datasets with generator shutdown. Models trained on a bus 26 shutdown are tested with load spikes at equal (S7.1) the initial shutdown magnitude. different location, bus 33. The PINN generates more accurate results when tested against both the unknown subset of the training dataset and the completely unknown dataset, with a ∼ 37.67% improvement on the testing dataset. Finally, Scenario… view at source ↗
read the original abstract

Modern power systems face significant challenges in state estimation and real-time monitoring, particularly regarding response speed and accuracy under faulty conditions or cyber-attacks. This paper proposes a hybrid approach using physics-informed neural networks (PINNs) to enhance the accuracy and robustness, of power system state estimation. By embedding physical laws into the neural network architecture, PINNs improve estimation accuracy for transmission grid applications under both normal and faulty conditions, while also showing potential in addressing security concerns such as data manipulation attacks. Experimental results show that the proposed approach outperforms traditional machine learning models, achieving up to 83% higher accuracy on unseen subsets of the training dataset and 65% better performance on entirely new, unrelated datasets. Experiments also show that during a data manipulation attack against a critical bus in a system, the PINN can be up to 93% more accurate than an equivalent neural network.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. This paper proposes a hybrid physics-informed neural network (PINN) approach for power system state estimation. Physical laws in the form of power-flow equations are embedded as a residual term in the neural network loss function to improve accuracy and robustness under normal operation, faults, and data manipulation attacks. The central empirical claims are that the method outperforms traditional machine learning models by up to 83% on unseen subsets of the training dataset, 65% on entirely new unrelated datasets, and 93% during a data manipulation attack on a critical bus.

Significance. If the reported gains can be reproduced with full methodological transparency, the work would demonstrate a practical benefit of soft physics constraints for generalization and attack resilience in power-grid monitoring. The approach follows the standard PINN template and does not introduce new theoretical machinery, so its primary value would lie in the empirical validation on realistic grid models and attack scenarios.

major comments (2)
  1. [§4] §4 (PINN formulation) and the loss-function description: the embedding of the power-flow equations is presented only as an additive residual penalty without specifying whether the weighting hyperparameter is fixed or tuned, whether the AC equations are used in full nonlinear form or linearized, and whether any hard constraints or projection steps are applied. This detail is load-bearing for the 93% attack-robustness claim, because a standard soft penalty permits the network to trade off small nonzero physics residuals against fitting corrupted measurements.
  2. [§5.3] §5.3 (attack experiments) and Table reporting attack results: the 93% accuracy improvement is stated without the exact attack model (magnitude and location of manipulated measurements), the architecture and regularization details of the “equivalent neural network” baseline, error bars, or statistical significance tests. Without these controls it is impossible to attribute the gain specifically to the physics term rather than to generic regularization or outlier handling.
minor comments (2)
  1. [Abstract] Abstract: the accuracy metric underlying the 83%, 65%, and 93% figures is never defined (MSE, MAE, voltage-angle error, etc.).
  2. [§2] Notation: the symbols for voltage magnitudes, angles, and power injections are introduced without a consistent table of definitions or reference to standard power-system notation.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive comments, which highlight areas where additional methodological detail will strengthen the paper. We respond point-by-point to the major comments and commit to revisions that improve transparency without altering the core claims or experimental design.

read point-by-point responses
  1. Referee: §4 (PINN formulation) and the loss-function description: the embedding of the power-flow equations is presented only as an additive residual penalty without specifying whether the weighting hyperparameter is fixed or tuned, whether the AC equations are used in full nonlinear form or linearized, and whether any hard constraints or projection steps are applied. This detail is load-bearing for the 93% attack-robustness claim, because a standard soft penalty permits the network to trade off small nonzero physics residuals against fitting corrupted measurements.

    Authors: We agree that these implementation details were insufficiently specified. In the revised manuscript we will expand Section 4 to state that the weighting hyperparameter is tuned via cross-validation on a validation split, that the residual term employs the full nonlinear AC power-flow equations, and that the formulation uses only a soft penalty with no hard constraints or projection steps. We will also report the specific tuned value and include a short sensitivity study showing how performance varies with the weight, thereby supporting the robustness results under data manipulation. revision: yes

  2. Referee: §5.3 (attack experiments) and Table reporting attack results: the 93% accuracy improvement is stated without the exact attack model (magnitude and location of manipulated measurements), the architecture and regularization details of the “equivalent neural network” baseline, error bars, or statistical significance tests. Without these controls it is impossible to attribute the gain specifically to the physics term rather than to generic regularization or outlier handling.

    Authors: We concur that the attack section requires greater specificity to allow readers to attribute improvements to the physics term. The revised version will specify the attack model (magnitude and exact bus location of the manipulated measurements), confirm that the baseline neural network uses an identical architecture and regularization schedule except for the absence of the physics residual, report standard deviations across repeated runs with different random seeds, and add statistical significance tests (e.g., paired t-tests) comparing the two models. These additions will be placed in Section 5.3 and the associated table. revision: yes

Circularity Check

0 steps flagged

No significant circularity in the proposed PINN method or experimental claims

full rationale

The paper presents a hybrid PINN for power system state estimation by embedding power-flow equations as a residual term in the network loss, following the standard PINN construction of data loss plus physics loss. Performance claims (83% higher accuracy on unseen subsets, 65% on new datasets, 93% under data manipulation attack) are presented as direct experimental outcomes from comparisons against baseline neural networks on specific test cases, not as quantities derived by algebraic reduction or by fitting parameters that are then relabeled as predictions. No equations or steps in the described approach reduce the robustness improvements to self-definitions, fitted inputs called predictions, or load-bearing self-citations; the physical embedding is an independent modeling choice whose effectiveness is assessed externally via held-out data and attack simulations. The derivation chain therefore remains self-contained and does not collapse to its inputs by construction.

Axiom & Free-Parameter Ledger

2 free parameters · 2 axioms · 0 invented entities

The central claim rests on the assumption that standard neural-network training plus a physics residual term produces robust estimates. No new entities are introduced. Hyperparameters of the network and weighting of the physics term are free parameters whose values are not reported.

free parameters (2)
  • physics-loss weighting coefficient
    Controls the trade-off between data fit and satisfaction of power-flow equations; value not stated in abstract.
  • neural-network architecture hyperparameters
    Layer count, width, activation functions, and optimizer settings are chosen but not disclosed.
axioms (2)
  • domain assumption Power-flow equations can be evaluated pointwise inside the neural-network loss without significant discretization error.
    Invoked when the abstract states that physical laws are embedded into the architecture.
  • domain assumption Training and test distributions are sufficiently similar for the reported generalization claims to hold.
    Required for the 65% improvement on entirely new datasets.

pith-pipeline@v0.9.0 · 5690 in / 1473 out tokens · 40186 ms · 2026-05-19T06:11:53.895841+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

33 extracted references · 33 canonical work pages · 1 internal anchor

  1. [1]

    A comprehensive review of recent advances in smart grids: A sustainable future with renewable energy resources,

    I. Alotaibi et al., “A comprehensive review of recent advances in smart grids: A sustainable future with renewable energy resources,” Energies, vol. 13, no. 23, p. 6269, 2020

  2. [2]

    State estimation in electric power grids: Meeting new challenges presented by the requirements of the future grid,

    Y .-F. Huang et al. , “State estimation in electric power grids: Meeting new challenges presented by the requirements of the future grid,” IEEE Signal Processing Magazine , vol. 29, no. 5, pp. 33–43, 2012

  3. [3]

    Smart Grid Security: Threats, Challenges, and Solutions

    A. Sanjab et al. , “Smart grid security: Threats, challenges, and solu- tions,” arXiv preprint arXiv:1606.06992 , 2016

  4. [4]

    Malicious data attacks on smart grid state estimation: Attack strategies and countermeasures,

    O. Kosut et al., “Malicious data attacks on smart grid state estimation: Attack strategies and countermeasures,” in 2010 first IEEE international conference on smart grid communications . IEEE, 2010, pp. 220–225

  5. [5]

    State estimation of smart power grid: A literature survey,

    N. Khanam and M. Rihan, “State estimation of smart power grid: A literature survey,” in 2022 IEEE 2nd International Conference on Sustainable Energy and Future Electric Transportation (SeFeT). IEEE, 2022, pp. 1–6

  6. [6]

    Security of wide-area monitoring, protection, and control (wampac) systems of the smart grid: A survey on challenges and opportunities,

    S. Vahidi et al. , “Security of wide-area monitoring, protection, and control (wampac) systems of the smart grid: A survey on challenges and opportunities,” IEEE Communications Surveys & Tutorials , vol. 25, no. 2, pp. 1294–1335, 2023

  7. [7]

    A review of false data injection attacks against modern power systems,

    G. Liang et al., “A review of false data injection attacks against modern power systems,” IEEE Transactions on Smart Grid , vol. 8, no. 4, pp. 1630–1638, 2016

  8. [8]

    Analysis of the cyber attack on the ukrainian power grid,

    D. U. Case, “Analysis of the cyber attack on the ukrainian power grid,” Electricity information sharing and analysis center (E-ISAC) , vol. 388, no. 1-29, p. 3, 2016

  9. [9]

    {BlackIoT}:{IoT} botnet of high wattage devices can disrupt the power grid,

    S. Soltan, P. Mittal, and H. V . Poor, “ {BlackIoT}:{IoT} botnet of high wattage devices can disrupt the power grid,” in 27th USENIX Security Symposium (USENIX Security 18) , 2018, pp. 15–32

  10. [10]

    Robust data-driven state estimation for smart grid,

    Y . Weng et al. , “Robust data-driven state estimation for smart grid,” IEEE Transactions on Smart Grid , vol. 8, no. 4, pp. 1956–1967, 2016

  11. [11]

    Bayesian state estimation for unobservable distribution systems via deep learning,

    K. R. Mestav, J. Luengo-Rozas, and L. Tong, “Bayesian state estimation for unobservable distribution systems via deep learning,” IEEE Trans- actions on Power Systems , vol. 34, no. 6, pp. 4910–4920, 2019

  12. [12]

    Real-time power system state estimation and forecasting via deep unrolled neural networks,

    L. Zhang, G. Wang, and G. B. Giannakis, “Real-time power system state estimation and forecasting via deep unrolled neural networks,” IEEE Transactions on Signal Processing, vol. 67, no. 15, pp. 4069–4077, 2019

  13. [13]

    The old and the new: Can physics-informed deep-learning replace traditional linear solvers?

    S. Markidis, “The old and the new: Can physics-informed deep-learning replace traditional linear solvers?” Frontiers in big Data , vol. 4, p. 669097, 2021

  14. [14]

    Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations,

    M. Raissi, P. Perdikaris, and G. E. Karniadakis, “Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations,” Journal of Computational physics , vol. 378, pp. 686–707, 2019

  15. [15]

    A neural network-aided functional model of photovoltaic arrays for a wide range of atmospheric conditions,

    A. Angulo, M. Huerta, and F. Mancilla-David, “A neural network-aided functional model of photovoltaic arrays for a wide range of atmospheric conditions,” IEEE Transactions on Industrial Informatics, vol. 20, no. 2, pp. 2487–2496, 2024

  16. [16]

    Online dynamic security assessment: Using hybrid physics-guided deep learning models,

    G. Lu and S. Bu, “Online dynamic security assessment: Using hybrid physics-guided deep learning models,” IEEE Transactions on Industrial Informatics, vol. 20, no. 11, pp. 13 190–13 200, 2024

  17. [17]

    Physics-informed neural networks for power systems,

    G. S. Misyris, A. Venzke, and S. Chatzivasileiadis, “Physics-informed neural networks for power systems,” in 2020 IEEE Power & Energy Society General Meeting (PESGM) , 2020, pp. 1–5

  18. [18]

    A survey on state estimation techniques and challenges in smart distribution systems,

    K. Dehghanpour et al. , “A survey on state estimation techniques and challenges in smart distribution systems,” IEEE Transactions on Smart Grid, vol. 10, no. 2, pp. 2312–2322, 2019

  19. [19]

    Power system dynamic state estimation: Motivations, definitions, methodologies, and future work,

    J. Zhao et al. , “Power system dynamic state estimation: Motivations, definitions, methodologies, and future work,” IEEE Transactions on Power Systems, vol. 34, no. 4, pp. 3188–3198, 2019

  20. [20]

    Applications of physics-informed neural net- works in power systems-a review,

    B. Huang and J. Wang, “Applications of physics-informed neural net- works in power systems-a review,”IEEE Transactions on Power Systems, 2022

  21. [21]

    Physics-informed neural networks for accelerating power system state estimation,

    S. Falas et al., “Physics-informed neural networks for accelerating power system state estimation,” in 2023 IEEE PES Innovative Smart Grid Technologies Europe (ISGT EUROPE). IEEE, 2023, pp. 1–5

  22. [22]

    A comprehensive review on cyber-attacks in power systems: Impact analysis, detection, and cyber security,

    N. Tatipatri and S. Arun, “A comprehensive review on cyber-attacks in power systems: Impact analysis, detection, and cyber security,” IEEE Access, vol. 12, pp. 18 147–18 167, 2024

  23. [23]

    Adversarial artificial intelligence in blind false data injection in smart grid ac state estimation,

    M. Rahman, J. Yan, and E. T. Fapi, “Adversarial artificial intelligence in blind false data injection in smart grid ac state estimation,” IEEE Transactions on Industrial Informatics , 2024

  24. [24]

    Artificial intelligence and physics-based anomaly detection in the smart grid: A survey,

    G. B. Gaggero, P. Girdinio, and M. Marchese, “Artificial intelligence and physics-based anomaly detection in the smart grid: A survey,” IEEE Access, 2025

  25. [25]

    Physics-aware neural networks for distribution system state estimation,

    A. S. Zamzam and N. D. Sidiropoulos, “Physics-aware neural networks for distribution system state estimation,” IEEE Transactions on Power Systems, vol. 35, no. 6, pp. 4347–4356, 2020

  26. [26]

    Physics-informed deep neural network method for limited observability state estimation,

    J. Ostrometzky et al. , “Physics-informed deep neural network method for limited observability state estimation,” 2020. [Online]. Available: https://arxiv.org/abs/1910.06401

  27. [27]

    Physics-informed graphical neural network for parameter & state estimations in power systems,

    L. Pagnier and M. Chertkov, “Physics-informed graphical neural network for parameter & state estimations in power systems,” 2021. [Online]. Available: https://arxiv.org/abs/2102.06349

  28. [28]

    Data-driven priors for robust psse via gauss-newton unrolled neural networks,

    Q. Yang, A. Sadeghi, and G. Wang, “Data-driven priors for robust psse via gauss-newton unrolled neural networks,” IEEE Journal on Emerging and Selected Topics in Circuits and Systems, vol. 12, no. 1, pp. 172–181, 2022

  29. [29]

    Embedding power flow into ma- chine learning for parameter and state estimation,

    L. Pagnier and M. Chertkov, “Embedding power flow into ma- chine learning for parameter and state estimation,” arXiv preprint arXiv:2103.14251, 2021

  30. [30]

    Ai-driven anomaly and intrusion detection in energy systems: Current trends and future direction,

    G. Andronikidis et al. , “Ai-driven anomaly and intrusion detection in energy systems: Current trends and future direction,” in 2024 IEEE International Conference on Cyber Security and Resilience (CSR) . IEEE, 2024, pp. 777–782

  31. [31]

    Standard test systems for modern power system analysis: An overview,

    S. Peyghami et al. , “Standard test systems for modern power system analysis: An overview,” IEEE industrial electronics magazine , vol. 13, no. 4, pp. 86–105, 2019

  32. [32]

    The effect of variable weights in a wls state estimator considering instrument transformer uncertainties,

    M. Asprou, E. Kyriakides, and M. Albu, “The effect of variable weights in a wls state estimator considering instrument transformer uncertainties,” IEEE Transactions on Instrumentation and Measurement, vol. 63, no. 6, pp. 1484–1495, 2013

  33. [33]

    Understanding the difficulty of training deep feedforward neural networks,

    X. Glorot and Y . Bengio, “Understanding the difficulty of training deep feedforward neural networks,” in Proceedings of the thirteenth international conference on artificial intelligence and statistics . JMLR Workshop and Conference Proceedings, 2010, pp. 249–256