pith. sign in

arxiv: 2508.16457 · v2 · submitted 2025-08-22 · 📡 eess.SY · cs.SY

Wide-Area Power System Oscillations from Large-Scale AI Workloads

Pith reviewed 2026-05-18 21:24 UTC · model grok-4.3

classification 📡 eess.SY cs.SY
keywords AI datacenter loadspower system oscillationsgrid stabilityGPU power fluctuationsdynamic load modelingWECC systemNPCC system
0
0 comments X

The pith

Large-scale AI workloads create periodic power fluctuations that can amplify wide-area oscillations in electric power grids.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper develops a method to profile the power demand of AI datacenters based on GPU architecture and training cycles. It shows that these loads produce sustained fluctuations unlike normal ramping loads. These fluctuations can interact with the natural oscillation modes of the power system, potentially making them larger. Using standard test systems representing western and eastern North America, the authors examine how factors like datacenter size and location affect the response. The work provides a way to include growing AI electricity use in studies of grid stability.

Core claim

The central claim is that the periodic stochastic power fluctuations inherent to large-scale AI workloads during training and fine-tuning, driven by GPU computing architecture design and distributed mini-batch processing cycles, act as persistent forcing inputs capable of interacting with and amplifying local and inter-area oscillation modes in power systems. This is demonstrated through numerical studies on the WECC 179-bus and NPCC 140-bus systems under various conditions including system strength, penetration level, fluctuation frequency range, individual datacenter size, geographical deployment, fluctuation suppression level, and workload ratio.

What carries the argument

dynamic power profiling approach that extracts periodic stochastic power fluctuations from GPU architecture and mini-batch cycles to serve as forcing inputs for grid oscillation analysis

If this is right

  • Narrower fluctuation frequency bands intensify oscillations across multiple modes.
  • Larger single-site datacenter capacities increase the amplitude of oscillatory responses.
  • Dispersed geographical siting of datacenters can intensify oscillations across the system.
  • Higher AI load penetration levels and weaker system strength alter the variability of responses to these fluctuations.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Grid operators could develop targeted monitoring for frequency bands associated with AI workload cycles.
  • Transmission planners might evaluate datacenter siting policies to reduce risks of exciting inter-area modes.
  • Real-time control schemes could incorporate suppression of specific fluctuation bands as AI loads scale up.

Load-bearing premise

The periodic stochastic power fluctuations extracted from GPU architecture and mini-batch cycles are assumed to be representative forcing inputs that remain unchanged when scaled to the sizes and locations studied in the WECC and NPCC test systems.

What would settle it

Direct measurement of real-time power draw time series from an operating large GPU cluster during AI training, followed by spectral analysis to check if the frequency content and amplitude match the modeled fluctuations or produce the predicted amplification in a connected power system.

Figures

Figures reproduced from arXiv: 2508.16457 by Hao Zhu, Min-Seung Ko.

Figure 1
Figure 1. Figure 1: Example of a hierarchical AI workload composition. [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Example power consumption profiles for (a) training and (b) fine [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Example of AI workload modeling process. [PITH_FULL_IMAGE:figures/full_fig_p004_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Topology of the modified WECC 179-bus system with datacenters. [PITH_FULL_IMAGE:figures/full_fig_p005_4.png] view at source ↗
Figure 6
Figure 6. Figure 6: Eigen-analysis results of the WECC 179-bus system under different [PITH_FULL_IMAGE:figures/full_fig_p006_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Simulation results according to the datacenter penetration level: (a) electric frequency trajectories, (b) FFT results, (c) Prony analysis results, and (d) [PITH_FULL_IMAGE:figures/full_fig_p007_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Inter-area oscillation between GENROU 9, 12 and GENROU 10, 11: [PITH_FULL_IMAGE:figures/full_fig_p007_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Simulation results according to the individual datacenter size: (a) [PITH_FULL_IMAGE:figures/full_fig_p008_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: Impact of datacenter fluctuation frequency range on system oscilla [PITH_FULL_IMAGE:figures/full_fig_p008_10.png] view at source ↗
Figure 11
Figure 11. Figure 11: Simulation results according to the geographical location of data [PITH_FULL_IMAGE:figures/full_fig_p009_11.png] view at source ↗
Figure 12
Figure 12. Figure 12: Stochastic variation of frequency trajectories and FFT results for [PITH_FULL_IMAGE:figures/full_fig_p009_12.png] view at source ↗
Figure 13
Figure 13. Figure 13: Example of simulation failure by the low voltage issue. [PITH_FULL_IMAGE:figures/full_fig_p010_13.png] view at source ↗
read the original abstract

This paper develops a new dynamic power profiling approach for modeling AI-centric datacenter loads and analyzing their impact on grid operations, particularly their potential to induce wide-area grid oscillations. We characterize the periodic stochastic power fluctuations inherent to large-scale AI workloads during both the training and fine-tuning stages, driven by the state-of-the-art graphics processing unit (GPU) computing architecture design. % and distributed mini-batch processing cycles. These sustained, large power fluctuations, unlike conventional load ramping, act as persistent forcing inputs capable of interacting with and amplifying local and inter-area oscillation modes. Using the WECC 179-bus system and the NPCC 140-bus system, we have numerically studied the amplitude and variability of oscillatory responses under different factors. These factors include system strength, penetration level, fluctuation frequency range, individual datacenter size, geographical deployment, fluctuation suppression level, and workload ratio. Simulation results show that, notably, narrower fluctuation bands, larger single-site capacities, or dispersed siting can intensify oscillations across multiple modes. Our models and numerical studies provide a quantitative basis for integrating AI-dominant electricity demand into grid oscillation studies and further support the development of new planning and operational measures to power the growth of AI/computing load demands.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper develops a dynamic power profiling approach for AI datacenter loads, characterizing periodic stochastic power fluctuations from GPU architectures and mini-batch cycles during training and fine-tuning. It claims these fluctuations, unlike conventional load changes, act as persistent forcing inputs that interact with and amplify local and inter-area oscillation modes. Numerical studies on the WECC 179-bus and NPCC 140-bus systems examine impacts from factors including system strength, penetration level, fluctuation frequency range, datacenter size, geographical deployment, suppression level, and workload ratio, with results indicating that narrower fluctuation bands, larger single-site capacities, or dispersed siting can intensify oscillations across multiple modes.

Significance. If the results hold, the work is significant for highlighting a potential new class of disturbances from the expanding AI sector and providing a quantitative modeling basis using standard test systems. The architecture-driven extraction of fluctuation characteristics offers a reproducible starting point for integrating computing loads into grid dynamics studies, which could inform future planning and operational measures as AI demand grows.

major comments (2)
  1. [Section on dynamic power profiling] Section on dynamic power profiling (likely §II or §III): The central claim requires that GPU-derived periodic stochastic fluctuations remain representative forcing inputs after amplitude scaling to large single-site datacenter capacities and dispersed locations in the WECC/NPCC studies. No analysis is provided on aggregation effects across many GPUs (e.g., phase randomization, averaging, or low-pass filtering) that could narrow or shift the frequency content away from the 0.1–2 Hz inter-area mode range, which is load-bearing for the reported mode intensification.
  2. [Numerical studies section] Numerical studies section (likely §IV): Simulation results on oscillation intensification under narrower fluctuation bands and dispersed siting are presented without validation against measured datacenter power traces or sensitivity checks on how fluctuation parameters were selected. This leaves open whether the observed interactions depend on post-hoc tuning of the free parameters (fluctuation frequency range and amplitude) rather than robust properties of the model.
minor comments (2)
  1. [Abstract] Abstract: A stray LaTeX comment marker '% and distributed mini-batch processing cycles' appears in the text and should be cleaned up for publication.
  2. [Results presentation] Figure captions and results presentation: The abstract and studies mention amplitude and variability of oscillatory responses but lack explicit mention of error bars, number of Monte Carlo runs, or statistical significance measures, which would improve clarity even if not altering the main findings.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive and detailed comments, which help strengthen the manuscript. We address each major comment below, proposing targeted revisions to improve clarity and robustness while preserving the core contributions.

read point-by-point responses
  1. Referee: The central claim requires that GPU-derived periodic stochastic fluctuations remain representative forcing inputs after amplitude scaling to large single-site datacenter capacities and dispersed locations in the WECC/NPCC studies. No analysis is provided on aggregation effects across many GPUs (e.g., phase randomization, averaging, or low-pass filtering) that could narrow or shift the frequency content away from the 0.1–2 Hz inter-area mode range.

    Authors: We agree that aggregation effects merit explicit discussion. Our dynamic power profiling in Section II models the datacenter as a single aggregated load whose fluctuation statistics are derived from GPU architecture and synchronized mini-batch cycles during training/fine-tuning; these cycles introduce temporal alignment that tends to preserve spectral content in the 0.1–2 Hz band even after scaling. Nevertheless, to address the referee’s concern directly, we will add a short subsection (or appendix paragraph) examining the impact of phase randomization and averaging, supported by references to existing power-system load-aggregation studies, and will note the conditions under which the reported frequency content remains representative. revision: partial

  2. Referee: Simulation results on oscillation intensification under narrower fluctuation bands and dispersed siting are presented without validation against measured datacenter power traces or sensitivity checks on how fluctuation parameters were selected. This leaves open whether the observed interactions depend on post-hoc tuning of the free parameters (fluctuation frequency range and amplitude) rather than robust properties of the model.

    Authors: The fluctuation frequency ranges and amplitudes are obtained from the GPU-level power measurements and workload-cycle timings described in Section II, not chosen post hoc to produce the reported effects. We acknowledge that proprietary measured traces from large-scale AI facilities are not publicly available for direct validation. In the revised manuscript we will therefore include a dedicated sensitivity study that systematically varies the frequency-band edges and amplitude scaling factors over physically plausible intervals (drawn from published GPU power traces) and will demonstrate that the mode-amplification trends persist across these ranges. revision: partial

Circularity Check

0 steps flagged

No circularity: simulation-driven analysis of modeled AI load fluctuations on standard test systems

full rationale

The paper extracts periodic stochastic power fluctuations from GPU architecture and mini-batch cycles, then injects scaled versions of these time series as forcing inputs into the WECC 179-bus and NPCC 140-bus models. All reported results are obtained from time-domain or modal simulations that vary amplitude, frequency band, penetration, siting, and suppression level; none of the oscillation metrics are obtained by fitting parameters to the target modes or by renaming an input quantity as a prediction. The derivation chain therefore remains self-contained and externally falsifiable against the chosen test-system dynamics.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

Abstract-only review prevents exhaustive extraction; the central claim rests on the unverified assumption that GPU-driven fluctuation statistics transfer directly to grid-scale forcing inputs and on the standard small-signal stability models of the two test systems.

free parameters (1)
  • fluctuation frequency range and amplitude
    Parameters characterizing periodic stochastic power swings are introduced to drive the oscillation studies; their specific values are not stated in the abstract.
axioms (1)
  • domain assumption Standard small-signal stability models of the WECC 179-bus and NPCC 140-bus systems accurately capture inter-area modes under new load inputs.
    The numerical studies rely on these established test systems without additional validation mentioned.

pith-pipeline@v0.9.0 · 5743 in / 1280 out tokens · 36386 ms · 2026-05-18T21:24:07.973525+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Dynamic Modeling of Data-Center Power Delivery for Power System Resonance Analysis

    eess.SY 2026-04 unverdicted novelty 7.0

    Derives an explicit component-informed dynamic model of data-center power-delivery chains in the positive-sequence domain to reveal resonance mechanisms from server-load fluctuations.

  2. Grid Integration of Gigawatt-Scale AI Data Centers under Connect-and-Manage

    eess.SY 2026-05 conditional novelty 6.0

    A hierarchical request-acceptance protocol with learning-based planning and robust TSO evaluation reduces curtailment for GW-scale AI data centers from 9.1% to 2.8% while preserving 98.1% of frontier training workload.

  3. Regime-Adaptive Weighted Ensemble Learning for Computing-Driven Dynamic Load Forecasting in AI Data Centers

    eess.SY 2026-04 unverdicted novelty 5.0

    A regime-adaptive ensemble with weight-learned neural network and incremental feature engineering reduces minute-scale AI data center load forecasting errors to below 1% on the MIT Supercloud dataset.

Reference graph

Works this paper leans on

38 extracted references · 38 canonical work pages · cited by 3 Pith papers

  1. [1]

    The unseen AI dis- ruptions for power grids: LLM-induced transients,

    Y . Li, M. Mughees, Y . Chen, and Y . R. Li, “The unseen AI dis- ruptions for power grids: LLM-induced transients,” arXiv preprint arXiv:2409.11416, 2024

  2. [2]

    Practical guidance and considerations for large load interconnections,

    R. Quint et al. , “Practical guidance and considerations for large load interconnections,” Elevate Energy Consulting, Tech. Rep., 2025

  3. [3]

    2024 United States data center energy usage report,

    A. Shehabi et al., “2024 United States data center energy usage report,” Lawrence Berkeley National Laboratory, Berkeley, CA, Tech. Rep., 2024

  4. [4]

    Powering intelligence: Analyzing artificial intelligence and data center energy consumption,

    J. Aljbour, T. Wilson, and P. Patel, “Powering intelligence: Analyzing artificial intelligence and data center energy consumption,” EPRI White Paper no. 3002028905, Tech. Rep., 2024

  5. [5]

    AI load dynamics–a power electronics perspective,

    Y . Li and Y . Li, “AI load dynamics–a power electronics perspective,” arXiv preprint arXiv:2502.01647 , 2025

  6. [6]

    Data center model for transient stability analysis of power systems,

    A. Jimenez-Ruiz and F. Milano, “Data center model for transient stability analysis of power systems,” arXiv preprint arXiv:2505.16575 , 2025

  7. [7]

    Event records showing data center response to faults,

    R. O’Keefe, “Event records showing data center response to faults,”

  8. [8]

    Available: https://www.nerc.com/comm/RSTC/LLTF/ LLTF April Meeting & Technical Workshop Presentations .pdf

    [Online]. Available: https://www.nerc.com/comm/RSTC/LLTF/ LLTF April Meeting & Technical Workshop Presentations .pdf

  9. [9]

    Unplanned data center load transfer update,

    M. Parker and B. Sterling, “Unplanned data center load transfer update,” 2025. [Online]. Available: https://www.nerc.com/comm/RSTC/ LLTF/LLTF June Workshop Presentations.pdf

  10. [10]

    Integration and interaction of next-generation AI-focused data centers with smart grids and district energy systems: The state-of-the-art, opportunities and challenges,

    Y . Zhang, H. Tang, H. Li, and S. Wang, “Integration and interaction of next-generation AI-focused data centers with smart grids and district energy systems: The state-of-the-art, opportunities and challenges,” Renewable and Sustainable Energy Reviews , vol. 224, p. 116097, 2025

  11. [11]

    Balance of power: A full-stack approach to power and thermal fluctuations in ml infrastructure,

    H. Gan and P. Ranganathan, “Balance of power: A full-stack approach to power and thermal fluctuations in ml infrastructure,”

  12. [12]

    Available: https://cloud.google.com/blog/topics/systems/ mitigating-power-and-thermal-fluctuations-in-ml-infrastructure?hl=en

    [Online]. Available: https://cloud.google.com/blog/topics/systems/ mitigating-power-and-thermal-fluctuations-in-ml-infrastructure?hl=en

  13. [13]

    Characteristics and risks of emerging large loads,

    NERC Large Load Task Force, “Characteristics and risks of emerging large loads,” North American Electric Reliability Corporation (NERC), Tech. Rep., 2025

  14. [14]

    Characterizing power management opportunities for LLMs in the cloud,

    P. Patel et al. , “Characterizing power management opportunities for LLMs in the cloud,” in Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3 , 2024, pp. 207–222

  15. [15]

    Energy and AI,

    D. D’Ambrosio et al. , “Energy and AI,” International Energy Agency (IEA), Tech. Rep., 2025

  16. [16]

    AI, data centers and energy demand: Reassessing and exploring the trends,

    L. de Roucy-Rochegonde and A. Buffard, “AI, data centers and energy demand: Reassessing and exploring the trends,” Ifri Papers, 2025

  17. [17]

    Artificial intelligence: Supply chain constraints and energy implications,

    A. de Vries-Gao, “Artificial intelligence: Supply chain constraints and energy implications,” Joule, vol. 9, no. 6, 2025

  18. [18]

    En- ergy and carbon considerations of fine-tuning BERT,

    X. Wang, C. Na, E. Strubell, S. Friedler, and S. Luccioni, “En- ergy and carbon considerations of fine-tuning BERT,” arXiv preprint arXiv:2311.10267, 2023

  19. [19]

    Characterization and prediction of deep learning workloads in large-scale GPU datacenters,

    Q. Hu, P. Sun, S. Yan, Y . Wen, and T. Zhang, “Characterization and prediction of deep learning workloads in large-scale GPU datacenters,” in Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis , 2021, pp. 1–15

  20. [20]

    Scaling intelligence: The exponential growth of AI’s power needs,

    J. You et al. , “Scaling intelligence: The exponential growth of AI’s power needs,” EPRI White Paper no. 3002033669, Tech. Rep., 2025

  21. [21]

    The AI disruption: Challenges and guidance for data center design,

    V . Avelar, P. Donovan, P. Lin, W. Torell, and M. A. T. Arango, “The AI disruption: Challenges and guidance for data center design,” Artificial Intelligence in Medicine , vol. 138, 2023

  22. [22]

    Inference zones: How data centers support real- time AI,

    B. Eichman, “Inference zones: How data centers support real- time AI,” 2024. [Online]. Available: https://www.coresite.com/blog/ inference-zones-how-data-centers-support-real-time-ai

  23. [23]

    Characterization of large language model development in the datacenter,

    Q. Hu et al. , “Characterization of large language model development in the datacenter,” in 21st USENIX Symposium on Networked Systems Design and Implementation (NSDI 24) , 2024, pp. 709–729

  24. [24]

    Empirical measurements of AI training power demand on a GPU-accelerated node,

    I. Latif, A. C. Newkirk, M. R. Carbone, A. Munir, Y . Lin, J. Koomey, X. Yu, and Z. Dong, “Empirical measurements of AI training power demand on a GPU-accelerated node,” arXiv preprint arXiv:2412.08602, 2024

  25. [25]

    Methodology for fine- grain GPU power visibility and insights,

    V . Singhania, S. Aga, and M. Assem Ibrahim, “Methodology for fine- grain GPU power visibility and insights,”arXiv e-prints, pp. arXiv–2412, 2024

  26. [26]

    Understanding GPU power: A survey of profiling, modeling, and simulation methods,

    R. A. Bridges, N. Imam, and T. M. Mintz, “Understanding GPU power: A survey of profiling, modeling, and simulation methods,” ACM Computing Surveys (CSUR) , vol. 49, no. 3, pp. 1–27, 2016

  27. [27]

    Not all GPUs are created equal: characterizing variability in large-scale, accelerator-rich systems,

    P. Sinha, A. Guliani, R. Jain, B. Tran, M. D. Sinclair, and S. Venkatara- man, “Not all GPUs are created equal: characterizing variability in large-scale, accelerator-rich systems,” inSC22: International Conference for High Performance Computing, Networking, Storage and Analysis . IEEE, 2022, pp. 01–15. 11

  28. [28]

    Problems and opportunities in training deep learning software systems: An analysis of variance,

    H. V . Pham et al., “Problems and opportunities in training deep learning software systems: An analysis of variance,” in Proceedings of the 35th IEEE/ACM international conference on automated software engineering, 2020, pp. 771–783

  29. [29]

    PAL: A variability-aware policy for scheduling ML workloads in GPU clusters,

    R. Jain, B. Tran, K. Chen, M. D. Sinclair, and S. Venkataraman, “PAL: A variability-aware policy for scheduling ML workloads in GPU clusters,” in SC24: International Conference for High Performance Computing, Networking, Storage and Analysis . IEEE, 2024, pp. 1–18

  30. [30]

    FinGraV: Methodology for fine-grain gpu power visibility and insights,

    V . Singhania, S. Aga, and M. A. Ibrahim, “FinGraV: Methodology for fine-grain gpu power visibility and insights,” in 2025 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS). IEEE, 2025, pp. 96–107

  31. [31]

    Battery storage applications at data centers,

    S. G. Vennelaganti and S. Jones, “Battery storage applications at data centers,” 2025. [Online]. Avail- able: https://www.nerc.com/comm/RSTC/LLTF/LLTF April Meeting & Technical Workshop Presentations .pdf

  32. [32]

    AI-enabling workloads on large-scale GPU-accelerated system: characterization, opportunities, and implications,

    B. Li, R. Arora, S. Samsi, T. Patel, W. Arcand, D. Bestor, C. Byun, R. B. Roy, B. Bergeron, J. Holodnak et al. , “AI-enabling workloads on large-scale GPU-accelerated system: characterization, opportunities, and implications,” in 2022 IEEE International Symposium on High- Performance Computer Architecture (HPCA) . IEEE, 2022, pp. 1224– 1237

  33. [33]

    Power modeling for effective datacenter planning and compute management,

    A. Radovanovic, B. Chen, S. Talukdar, B. Roy, A. Duarte, and M. Shah- bazi, “Power modeling for effective datacenter planning and compute management,” IEEE Transactions on Smart Grid , vol. 13, no. 2, pp. 1611–1621, 2021

  34. [34]

    AI datacenter energy dilemma - race for AI datacenter space,

    D. N. D. Patel and J. E. Ontiveros, “AI datacenter energy dilemma - race for AI datacenter space,” 2024. [Online]. Available: https: //semianalysis.com/2024/03/13/ai-datacenter-energy-dilemma-race

  35. [35]

    Hybrid symbolic-numeric framework for power system modeling and analysis,

    H. Cui, F. Li, and K. Tomsovic, “Hybrid symbolic-numeric framework for power system modeling and analysis,” IEEE Transactions on Power Systems, vol. 36, no. 2, pp. 1373–1384, 2020

  36. [36]

    AI’s power requirements under exponential growth,

    L. H. Konstantin F. Pilz, Yusuf Mahmood, “AI’s power requirements under exponential growth,” RAND, Tech. Rep., 2025. [Online]. Available: https://www.rand.org/pubs/research reports/RRA3572-1.html

  37. [37]

    Effects of decreasing syn- chronous inertia on power system dynamics—overview of recent ex- periences and marketisation of services,

    B. Hartmann, I. V okony, and I. T ´aczi, “Effects of decreasing syn- chronous inertia on power system dynamics—overview of recent ex- periences and marketisation of services,” International Transactions on Electrical Energy Systems , vol. 29, no. 12, p. e12128, 2019

  38. [38]

    Performance of three mode-meter block-processing algorithms for automated dynamic stability assessment,

    D. J. Trudnowski, J. W. Pierre, N. Zhou, J. F. Hauer, and M. Parashar, “Performance of three mode-meter block-processing algorithms for automated dynamic stability assessment,” IEEE Transactions on Power Systems, vol. 23, no. 2, pp. 680–690, 2008