pith. sign in

arxiv: 2601.22487 · v2 · submitted 2026-01-30 · 💻 cs.DC

Coordinating GPU Data Centers and Power Grid Regulation Service for Exogenous Carbon Benefits

Pith reviewed 2026-05-16 10:01 UTC · model grok-4.3

classification 💻 cs.DC
keywords data centersfrequency regulationcarbon emissionspower gridexogenous carbonGPU computingEcoCenterAI energy consumption
0
0 comments X

The pith

GPU data centers can provide frequency regulation to power grids and produce exogenous carbon savings that exceed their own operational emissions.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper shows that GPU data centers can coordinate their power draw with grid needs to supply frequency regulation services normally provided by fossil-fueled plants. It defines an Exogenous Carbon metric to capture the resulting grid-wide emission reductions and introduces the EcoCenter framework to maximize the regulation capacity data centers can offer. Demonstrations indicate these grid-side savings can outweigh the carbon emitted by the data centers themselves during operation. A reader would care because the approach turns high-energy computing facilities into tools for grid decarbonization rather than just sources of demand.

Core claim

GPU data centers can flexibly modulate their power consumption to deliver frequency regulation reserves to the power grid, which reduces the amount of fossil-fueled generation required for that service and thereby generates exogenous carbon savings that can exceed the data centers' operational carbon emissions.

What carries the argument

The Exogenous Carbon metric, which quantifies grid-side carbon reductions from data center regulation participation, together with the EcoCenter optimization framework that maximizes allowable regulation provision.

If this is right

  • Data centers can become net carbon reducers for the grid rather than pure consumers.
  • Fewer fossil peaker plants are needed to maintain frequency stability.
  • AI workloads can be scheduled to supply regulation during periods of high grid need.
  • Grid operators gain a flexible, low-carbon alternative to traditional reserves.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Operators may site future data centers near grids with large regulation requirements to amplify savings.
  • New contracts could let data centers sell regulation capacity as a service to utilities.
  • The same coordination logic could extend to non-GPU high-power facilities if power modulation is feasible.
  • Policy incentives might emerge that credit data centers for measured exogenous carbon reductions.

Load-bearing premise

GPU data centers can adjust power draw for regulation services without unacceptable impacts on computing performance or that grid carbon-intensity models accurately capture the displaced fossil generation.

What would settle it

Real-world measurements from a data center providing regulation service that show no measurable drop in fossil-plant output or that document computing performance degradation large enough to make participation impractical.

Figures

Figures reproduced from arXiv: 2601.22487 by Ali Jahanshahi, Daniel Wong, Nanpeng Yu, Osten Anderson, Sara Rashidi Golrouye.

Figure 1
Figure 1. Figure 1: The growth of renewable solar and wind energy in [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Various types of demand response (DR) services [PITH_FULL_IMAGE:figures/full_fig_p002_2.png] view at source ↗
Figure 4
Figure 4. Figure 4: The “hidden emissions” of grid regulation reserves [PITH_FULL_IMAGE:figures/full_fig_p004_4.png] view at source ↗
Figure 6
Figure 6. Figure 6: EcoCenter resource management. Dashed blue boxes show ROCm modifications to provide power capping and core allocations APIs for the Controller to reshape server power. its power based on the regulation signal. At the end of each hour, the power grid credits the data centers with a monetary reward based on the performance score, which is taken into account in electricity cost savings on the data center. We … view at source ↗
Figure 5
Figure 5. Figure 5: EcoCenter overview. Red boxes show the inputs to the framework and the green boxes show the output. 4.1 System Overview EcoCenter overcomes GPU’s power modulation limitations by coordinating GPU DVFS, compute unit scaling, and multi-GPU load assignment as novel power modulation knobs [PITH_FULL_IMAGE:figures/full_fig_p005_5.png] view at source ↗
Figure 7
Figure 7. Figure 7: (a) GPT2 model training power model (b) GPT2 [PITH_FULL_IMAGE:figures/full_fig_p006_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: EcoCenter power reshaping policy with 8 GPUs. Workload Monitor monitors the LC resources, while Resource Manager (directly by Controller) reshapes the BE workload. to regulation provision. Unlike CPUs, which includes a plethora of low-power state knobs (DVFS, C-states, fine-grain power gating, preemption, etc.) to optimize non-peak utilization, GPU’s power knobs are limited to power capping and CU scaling,… view at source ↗
Figure 9
Figure 9. Figure 9: Regulation signals used in evaluation with varying [PITH_FULL_IMAGE:figures/full_fig_p009_9.png] view at source ↗
Figure 11
Figure 11. Figure 11: Comparison of Exogenous carbon savings with data center carbon emissions at different operational loads. [PITH_FULL_IMAGE:figures/full_fig_p010_11.png] view at source ↗
Figure 14
Figure 14. Figure 14: Normalized TCO w.r.t. baseline data center shows [PITH_FULL_IMAGE:figures/full_fig_p011_14.png] view at source ↗
read the original abstract

The rapid growth of AI/ML data centers has led to higher energy consumption and carbon emissions. The shift to renewable energy and growing data center energy demands can destabilize the power grid. Power grids rely on frequency regulation reserves, typically fossil-fueled power plants, to stabilize and balance the supply and demand of electricity. This paper sheds light on the hidden carbon emissions of frequency regulation service. Our work explores how modern GPU data centers can coordinate with power grids to reduce the need for fossil-fueled frequency regulation reserves. We first introduce a novel metric, Exogenous Carbon, to quantify grid-side carbon emission reductions resulting from data center participation in regulation service. We additionally introduce EcoCenter, a framework to maximize the amount of frequency regulation provision that GPU data centers can provide, and thus, reduce the amount of frequency regulation reserves necessary. We demonstrate that data center participation in frequency regulation can result in Exogenous carbon savings that can outweigh operational carbon emissions

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 2 minor

Summary. The manuscript introduces an 'Exogenous Carbon' metric to quantify grid-side carbon emission reductions achieved when GPU data centers supply frequency regulation reserves, proposes the EcoCenter framework to maximize such provision, and claims that the resulting savings can outweigh the data centers' operational carbon emissions.

Significance. If the displacement assumptions hold under realistic market conditions, the work could demonstrate a practical mechanism for data centers to contribute to grid stability while delivering net carbon benefits, offering a new angle on sustainable AI infrastructure.

major comments (3)
  1. [§4] §4: The definition of Exogenous Carbon uses a linear mapping from regulation provision to displaced reserves that assumes a displacement factor of exactly 1.0 with fossil marginal units. This assumption is load-bearing for the headline claim that savings outweigh emissions, yet the text provides no market simulation, sensitivity analysis on the factor, or validation against co-optimized energy/ancillary markets where batteries or hydro may be marginal.
  2. [Abstract and §5] Abstract and §5: The assertion that data-center participation yields net carbon savings is presented without accompanying methods, input data, error bars, or quantitative results, leaving the central quantitative demonstration unsupported in the available sections.
  3. [EcoCenter framework] EcoCenter framework description: No analysis is given of the performance or latency impact on GPU workloads when power draw is modulated for regulation services, which is essential to establish feasibility.
minor comments (2)
  1. [§4] Clarify the exact units, parameters, and any grid-specific assumptions in the Exogenous Carbon formula to improve reproducibility.
  2. [References] Add references to prior literature on data-center demand response for ancillary services and marginal-emission modeling.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive feedback on our work. Below we respond to each major comment and indicate the revisions planned for the manuscript.

read point-by-point responses
  1. Referee: [§4] The definition of Exogenous Carbon uses a linear mapping from regulation provision to displaced reserves that assumes a displacement factor of exactly 1.0 with fossil marginal units. This assumption is load-bearing for the headline claim that savings outweigh emissions, yet the text provides no market simulation, sensitivity analysis on the factor, or validation against co-optimized energy/ancillary markets where batteries or hydro may be marginal.

    Authors: We acknowledge the importance of validating the displacement factor assumption. It is based on the observation that fossil-fueled plants are the primary providers of frequency regulation in many grids. In the revision, we will incorporate a sensitivity analysis on the displacement factor (ranging from 0.5 to 1.2) and discuss implications for markets with significant battery or hydro participation. A brief market simulation will be added to §4 to demonstrate robustness under varying conditions. revision: yes

  2. Referee: [Abstract and §5] The assertion that data-center participation yields net carbon savings is presented without accompanying methods, input data, error bars, or quantitative results, leaving the central quantitative demonstration unsupported in the available sections.

    Authors: The quantitative demonstration relies on trace-driven simulations detailed in §5. To make this fully transparent, we will revise §5 to include a dedicated methods subsection describing the input datasets (power traces from production GPU clusters, grid carbon intensity data), the simulation setup, and results with error bars from repeated experiments under different regulation signal profiles. This will explicitly show how exogenous savings exceed operational emissions. revision: yes

  3. Referee: [EcoCenter framework] No analysis is given of the performance or latency impact on GPU workloads when power draw is modulated for regulation services, which is essential to establish feasibility.

    Authors: We agree that workload impact is critical for feasibility. The EcoCenter framework is designed to modulate power within safe GPU limits without requiring workload changes, drawing on established power management techniques. We will add an analysis subsection with references to prior studies on GPU power capping showing negligible latency impact for batch workloads, and include a short discussion of potential overheads. Full empirical evaluation may be added if space permits in the revision. revision: partial

Circularity Check

0 steps flagged

No significant circularity; metric and framework defined independently of result

full rationale

The paper defines a novel Exogenous Carbon metric to quantify grid-side emission reductions from data-center regulation participation and introduces the EcoCenter framework to maximize regulation provision. The demonstration that these savings can outweigh operational emissions is obtained by applying the defined metric to the modeled coordination scenarios. No equations, self-citations, or fitted parameters are shown that reduce the headline claim to its own inputs by construction. The load-bearing modeling choice (displacement of fossil reserves) is an external assumption rather than a definitional tautology, and the derivation chain remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 1 invented entities

Abstract-only review; no explicit free parameters, axioms, or invented entities are detailed beyond the introduction of the new Exogenous Carbon metric.

invented entities (1)
  • Exogenous Carbon no independent evidence
    purpose: Quantify grid-side carbon emission reductions from data center frequency regulation participation
    Newly defined metric introduced to capture benefits beyond direct data center emissions.

pith-pipeline@v0.9.0 · 5472 in / 1111 out tokens · 61037 ms · 2026-05-16T10:01:39.757418+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

90 extracted references · 90 canonical work pages · 2 internal anchors

  1. [1]

    Powering intelligence: Analyzing artificial intelligence and data center energy consumption,

    J. Aljbour, T. Wilson, and P. Patel, “Powering intelligence: Analyzing artificial intelligence and data center energy consumption, ”EPRI White Paper, 2024

  2. [2]

    Stanley, 2023

    M. Stanley, 2023. [Online]. Available: https://www.datacenterdynamics.com/en/ news/morgan-stanley-data-center-industry-will-emit-25bn-tons-of-co2-by- 2030/

  3. [3]

    Environmental report,

    Google, “Environmental report, ” 2023. [Online]. Available: https://www.gstatic. com/gumdrop/sustainability/google-2023-environmental-report.pdf

  4. [4]

    Powering sustainable transformation,

    Microsoft, “Powering sustainable transformation, ” 2025. [On- line]. Available: https://datacenters.microsoft.com/globe/powering-sustainable- transformation/

  5. [5]

    Electricity data browser,

    U. E. I. A. (EIA), “Electricity data browser, ” https://www.eia.gov/electricity/data/ browser/, 2023

  6. [6]

    Powermorph: Qos-aware server power reshaping for data center regulation service,

    A. Jahanshahi, N. Yu, and D. Wong, “Powermorph: Qos-aware server power reshaping for data center regulation service, ”ACM Transactions on Architecture and Code Optimization (TACO), vol. 19, no. 3, pp. 1–27, 2022

  7. [7]

    Supporting power grids with demand response at google data centers,

    Google, “Supporting power grids with demand response at google data centers, ”

  8. [8]

    Available: https://cloud.google.com/blog/products/infrastructure/ using-demand-response-to-reduce-data-center-power-consumption

    [Online]. Available: https://cloud.google.com/blog/products/infrastructure/ using-demand-response-to-reduce-data-center-power-consumption

  9. [9]

    EnergyQARE: QoS-Aware Data Center Participation in Smart Grid Regulation Service Reserve Provision,

    H. Chen, Y. Zhang, M. C. Caramanis, and A. K. Coskun, “EnergyQARE: QoS-Aware Data Center Participation in Smart Grid Regulation Service Reserve Provision, ”ACM Trans. Model. Perform. Eval. Comput. Syst., vol. 4, no. 1, pp. 2:1–2:31, Jan. 2019. [Online]. Available: http://doi.acm.org/10.1145/3243172

  10. [10]

    Towards improved power management in cloud gpus,

    P. Patel, Z. Gong, S. Rizvi, E. Choukse, P. Misra, T. Anderson, and A. Srira- man, “Towards improved power management in cloud gpus, ”IEEE Computer Architecture Letters, vol. 22, no. 2, pp. 141–144, 2023

  11. [11]

    Clicking clean virginia the dirty energy powering data center alley,

    G. Cook, E. Jardim, and C. Craighill, “Clicking clean virginia the dirty energy powering data center alley, ”Greenpeace A vailable from: https://www. greenpeace. org/usa/reports/click-clean-virginia/[Accessed 21 Mar 2020], 2019

  12. [12]

    Ai is poised to drive 160% increase in data center power demand,

    “Ai is poised to drive 160% increase in data center power demand, ” 2024. [Online]. Available: https://www.goldmansachs.com/insights/articles/AI-poised- to-drive-160-increase-in-power-demand

  13. [13]

    Llama 2: Open Foundation and Fine-Tuned Chat Models

    H. Touvron, L. Martin, K. Stone, P. Albert, A. Almahairi, Y. Babaei, N. Bashlykov, S. Batra, P. Bhargava, S. Bhosaleet al., “Llama 2: Open foundation and fine-tuned chat models, ”arXiv preprint arXiv:2307.09288, 2023

  14. [14]

    Carbon explorer: A holistic framework for designing carbon aware datacenters,

    B. Acun, B. Lee, F. Kazhamiaka, K. Maeng, U. Gupta, M. Chakkaravarthy, D. Brooks, and C.-J. Wu, “Carbon explorer: A holistic framework for designing carbon aware datacenters, ” inProceedings of the 28th ACM International Confer- ence on Architectural Support for Programming Languages and Operating Systems, Volume 2, 2023, pp. 118–132

  15. [15]

    Carbon responder: Coordinating demand response for the datacenter fleet,

    J. Xing, B. Acun, A. Sundarrajan, D. Brooks, M. Chakkaravarthy, N. Avila, C.-J. Wu, and B. C. Lee, “Carbon responder: Coordinating demand response for the datacenter fleet, ” 2023. [Online]. Available: https://arxiv.org/abs/2311.08589

  16. [16]

    Carbon-aware electricity cost minimization for sustainable data centers,

    H. Dou, Y. Qi, W. Wei, and H. Song, “Carbon-aware electricity cost minimization for sustainable data centers, ”IEEE Transactions on Sustainable Computing, vol. 2, no. 2, pp. 211–223, 2017

  17. [17]

    Temporal load balancing with service delay guarantees for data center energy cost optimization,

    J. Luo, L. Rao, and X. Liu, “Temporal load balancing with service delay guarantees for data center energy cost optimization, ”IEEE Transactions on Parallel and Distributed Systems, vol. 25, no. 3, pp. 775–784, 2013

  18. [18]

    On the limitations of carbon-aware temporal and spatial workload shifting in the cloud,

    T. Sukprasert, A. Souza, N. Bashir, D. Irwin, and P. Shenoy, “On the limitations of carbon-aware temporal and spatial workload shifting in the cloud, ” in Proceedings of the Nineteenth European Conference on Computer Systems, ser. EuroSys ’24. New York, NY, USA: Association for Computing Machinery, 2024, p. 924–941. [Online]. Available: https://doi.org/10...

  19. [19]

    Clover: Toward sustainable ai with carbon-aware machine learning inference service,

    B. Li, S. Samsi, V. Gadepally, and D. Tiwari, “Clover: Toward sustainable ai with carbon-aware machine learning inference service, ” inProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, 2023, pp. 1–15

  20. [20]

    In california, solar and wind boost the price of frequency regulation,

    GreenTechMedia, “In california, solar and wind boost the price of frequency regulation, ” 2022. [Online]. Available: http://www.greentechmedia.com/articles/ read/in-california-solar-and-wind-boosts-the-price-for-frequency-regulation

  21. [21]

    2025 california demand response potential study-charting california’s demand response future. final report on phase 2 results,

    P. Alstone, J. Potter, M. A. Piette, P. Schwartz, M. A. Berger, L. N. Dunn, S. J. Smith, M. D. Sohn, A. Aghajanzadeh, S. Stenssonet al., “2025 california demand response potential study-charting california’s demand response future. final report on phase 2 results, ” Lawrence Berkeley National Lab.(LBNL), Berkeley, CA (United States), Tech. Rep., 2017

  22. [22]

    Data center participation in demand response programs with quality-of-service guarantees,

    Y. Zhang, I. C. Paschalidis, and A. K. Coskun, “Data center participation in demand response programs with quality-of-service guarantees, ” inProceedings of the Tenth ACM International Conference on Future Energy Systems, 2019, pp. 285–302

  23. [23]

    Hpc data center participation in demand response: An adaptive policy with qos assurance,

    Y. Zhang, D. C. Wilson, I. C. Paschalidis, and A. K. Coskun, “Hpc data center participation in demand response: An adaptive policy with qos assurance, ”IEEE Transactions on Sustainable Computing, vol. 7, no. 1, pp. 157–171, 2022

  24. [24]

    Opportunities and challenges for data center demand response,

    A. Wierman, Z. Liu, I. Liu, and H. Mohsenian-Rad, “Opportunities and challenges for data center demand response, ” inInternational Green Computing Conference, 2014, pp. 1–10

  25. [25]

    Iswitch: Coordinating and optimizing renewable energy powered server clusters,

    C. Li, A. Qouneh, and T. Li, “Iswitch: Coordinating and optimizing renewable energy powered server clusters, ” inProceedings of the 39th Annual International Symposium on Computer Architecture, ser. ISCA ’12. USA: IEEE Computer Society, 2012, p. 512–523

  26. [26]

    Greenslot: scheduling energy consumption in green datacenters,

    Í. Goiri, K. Le, M. E. Haque, R. Beauchea, T. D. Nguyen, J. Guitart, J. Torres, and R. Bianchini, “Greenslot: scheduling energy consumption in green datacenters, ” inInternational Conference for High Performance Computing, Networking, Storage and Analysis, 2011

  27. [27]

    Solarcore: Solar energy driven multi-core architecture power management,

    C. Li, W. Zhang, C. Cho, and T. Li, “Solarcore: Solar energy driven multi-core architecture power management, ” in2011 IEEE 17th International Symposium on High Performance Computer Architecture, Feb 2011, pp. 205–216

  28. [28]

    Chameleon: Adapting throughput server to time-varying green power budget using online learning,

    C. Li, X. Li, R. Wang, T. Li, N. Goswami, and D. Qian, “Chameleon: Adapting throughput server to time-varying green power budget using online learning, ” inInternational Symposium on Low Power Electronics and Design (ISLPED), Sep. 2013, pp. 100–105

  29. [29]

    Geographical load balancing with renewables,

    Z. Liu, M. Lin, A. Wierman, S. H. Low, and L. L. Andrew, “Geographical load balancing with renewables, ”SIGMETRICS Perform. Eval. Rev., vol. 39, no. 3, p. 62–66, Dec. 2011. [Online]. Available: https://doi.org/10.1145/2160803.2160862

  30. [30]

    Adapting datacenter capacity for greener datacenters and grid,

    L. Lin and A. A. Chien, “Adapting datacenter capacity for greener datacenters and grid, ” inProceedings of the 14th ACM International Conference on Future Energy Systems, 2023, pp. 200–213

  31. [31]

    Carbonscaler: Leveraging cloud workload elasticity for optimizing carbon-efficiency,

    W. A. Hanafy, Q. Liang, N. Bashir, D. Irwin, and P. Shenoy, “Carbonscaler: Leveraging cloud workload elasticity for optimizing carbon-efficiency, ”Proc. ACM Meas. Anal. Comput. Syst., vol. 7, no. 3, Dec. 2023. [Online]. Available: https://doi.org/10.1145/3626788

  32. [32]

    The sunk carbon fallacy: Rethinking carbon footprint metrics for effective carbon-aware scheduling,

    N. Bashir, V. Gohil, A. B. Subramanya, M. Shahrad, D. Irwin, E. Olivetti, and C. Delimitrou, “The sunk carbon fallacy: Rethinking carbon footprint metrics for effective carbon-aware scheduling, ” inProceedings of the 2024 ACM Symposium on Cloud Computing, ser. SoCC ’24. New York, NY, USA: Association for Computing Machinery, 2024, p. 542–551. [Online]. Av...

  33. [33]

    Greenhouse gas emission tracking methodology,

    California ISO, “Greenhouse gas emission tracking methodology, ” Tech. Rep., 2016

  34. [34]

    Greenhouse gas coordination stakeholder recommendations for policy development,

    ——, “Greenhouse gas coordination stakeholder recommendations for policy development, ” Tech. Rep., 2024

  35. [35]

    PJM Manual 12: Balancing Operations,

    PJM, “PJM Manual 12: Balancing Operations, ” https://www.pjm.com/-/media/ documents/manuals/m12.ashx, 2022

  36. [36]

    Performance accuracy scores in caiso and miso regulation markets: A comparison based on real data and mathematical analysis,

    A. Sadeghi-Mobarakeh and H. Mohsenian-Rad, “Performance accuracy scores in caiso and miso regulation markets: A comparison based on real data and mathematical analysis, ”IEEE Transactions on Power Systems, vol. 33, no. 3, pp. 3196–3198, 2018

  37. [37]

    The data center as a grid load stabilizer,

    H. Chen, M. C. Caramanis, and A. K. Coskun, “The data center as a grid load stabilizer, ” in19th Asia and South Pacific Design Automation Conference (ASP-DAC). Singapore: IEEE, Jan. 2014, pp. 105–112. [Online]. Available: http://ieeexplore.ieee.org/document/6742874/

  38. [38]

    Dynamic Server Power Capping for Enabling Data Center Participation in Power Markets,

    H. Chen, C. Hankendi, M. C. Caramanis, and A. K. Coskun, “Dynamic Server Power Capping for Enabling Data Center Participation in Power Markets, ” in Proceedings of the International Conference on Computer-Aided Design, ser. ICCAD ’13. Piscataway, NJ, USA: IEEE Press, 2013, pp. 122–129

  39. [39]

    Reducing the data center electric- ity costs through participation in smart grid programs,

    H. Chen, M. C. Caramanis, and A. K. Coskun, “Reducing the data center electric- ity costs through participation in smart grid programs, ” inInternational Green Computing Conference, Nov. 2014, pp. 1–10

  40. [40]

    Characterizing power management opportunities for llms in the cloud,

    P. Patel, E. Choukse, C. Zhang, I. n. Goiri, B. Warrier, N. Mahalingam, and R. Bianchini, “Characterizing power management opportunities for llms in the cloud, ” inProceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3, ser. ASPLOS ’24. New York, NY, USA: Association for Comp...

  41. [41]

    Toward sustainable hpc: Carbon footprint estimation and environmental implications of hpc systems,

    B. Li, R. Basu Roy, D. Wang, S. Samsi, V. Gadepally, and D. Tiwari, “Toward sustainable hpc: Carbon footprint estimation and environmental implications of hpc systems, ” inProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, 2023, pp. 1–15

  42. [42]

    A guide to reducing carbon emissions through data center geographical load shifting,

    J. Lindberg, Y. Abdennadher, J. Chen, B. C. Lesieutre, and L. Roald, “A guide to reducing carbon emissions through data center geographical load shifting, ” in Proceedings of the Twelfth ACM International Conference on Future Energy Systems, ser. e-Energy ’21, 2021

  43. [43]

    Quantifying the Carbon Emissions of Machine Learning

    A. Lacoste, A. Luccioni, V. Schmidt, and T. Dandres, “Quantifying the carbon emissions of machine learning, ” 2019. [Online]. Available: https: //arxiv.org/abs/1910.09700

  44. [44]

    The leading api for granular electricity data reduce carbon emissions with actionable electricity data

    “The leading api for granular electricity data reduce carbon emissions with actionable electricity data. ” 2024. [Online]. Available: https://www. electricitymaps.com

  45. [45]

    Electricityemissions.jl: A framework for the comparison of carbon intensity signals,

    J. Gorka, N. Rhodes, and L. Roald, “Electricityemissions.jl: A framework for the comparison of carbon intensity signals, ” 2024. [Online]. Available: https://arxiv.org/abs/2411.06560

  46. [46]

    On the limitations of carbon-aware temporal and spatial workload shifting in the cloud,

    T. Sukprasert, A. Souza, N. Bashir, D. Irwin, and P. Shenoy, “On the limitations of carbon-aware temporal and spatial workload shifting in the cloud, ” inProceedings Coordinating Power Grid Frequency Regulation Service with Data Center Load Flexibility of the Nineteenth European Conference on Computer Systems, ser. EuroSys ’24, 2024

  47. [47]

    Process control strategies for reducing the minimum load of fossil- fired plants,

    S. Seachman, “Process control strategies for reducing the minimum load of fossil- fired plants, ” 2025. [Online]. Available: https://www.powermag.com/process- control-strategies-for-reducing-the-minimum-load-of-fossil-fired-plants/

  48. [48]

    Evaluation of market rules using a multi-agent system method,

    N.-P. Yu, C.-C. Liu, and J. Price, “Evaluation of market rules using a multi-agent system method, ”IEEE Transactions on Power Systems, vol. 25, no. 1, pp. 470–479, 2010

  49. [49]

    Feasibility study of combined cycle gas turbine (ccgt) power plant integration with adiabatic compressed air energy storage (acaes),

    J. D. Wojcik and J. Wang, “Feasibility study of combined cycle gas turbine (ccgt) power plant integration with adiabatic compressed air energy storage (acaes), ” Applied Energy, vol. 221, pp. 477–489, 2018

  50. [50]

    Ge gas turbine performance characteristics,

    F. J. Brooks, “Ge gas turbine performance characteristics, ” GE Power Systems, Tech. Rep., 2000

  51. [51]

    Optimizing deep decarbonization path- ways in California with power system planning using surrogate level-based Lagrangian relaxation,

    O. Anderson, M. A. Bragin, and N. Yu, “Optimizing deep decarbonization path- ways in California with power system planning using surrogate level-based Lagrangian relaxation, ”Applied Energy, vol. 377, 2025

  52. [52]

    Nvidia a100 tensor core gpu architecture,

    NVIDIA, “Nvidia a100 tensor core gpu architecture, ” https://images.nvidia. com/aem-dam/en-zz/Solutions/data-center/nvidia-ampere-architecture- whitepaper.pdf, 2020

  53. [53]

    Nvidia h100 tensor core gpu architecture,

    ——, “Nvidia h100 tensor core gpu architecture, ” https://www.advancedclustering. com/wp-content/uploads/2022/03/gtc22-whitepaper-hopper.pdf, 2022

  54. [54]

    Nvidia blackwell architecture technical brief,

    ——, “Nvidia blackwell architecture technical brief, ” https://cdn.prod.website- files.com/61dda201f29b7efc52c5fbaf/6602ea9d0ce8cb73fb6de87f_nvidia- blackwell-architecture-technical-brief.pdf, 2024

  55. [55]

    Workload prediction using arima model and its impact on cloud applications’ qos,

    R. N. Calheiros, E. Masoumi, R. Ranjan, and R. Buyya, “Workload prediction using arima model and its impact on cloud applications’ qos, ”IEEE Transactions on Cloud Computing, vol. 3, no. 4, pp. 449–458, 2015

  56. [56]

    Ame-wpc: Advanced model for efficient workload prediction in the cloud,

    K. Cetinski and M. B. Juric, “Ame-wpc: Advanced model for efficient workload prediction in the cloud, ”Journal of Network and Computer Applications, vol. 55, pp. 191–201, 2015

  57. [57]

    Host load prediction in a google compute cloud with a bayesian model,

    S. Di, D. Kondo, and W. Cirne, “Host load prediction in a google compute cloud with a bayesian model, ” inSC’12: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis. IEEE, 2012, pp. 1–11

  58. [58]

    Rpps: A novel resource prediction and provisioning scheme in cloud data center,

    W. Fang, Z. Lu, J. Wu, and Z. Cao, “Rpps: A novel resource prediction and provisioning scheme in cloud data center, ” in2012 IEEE Ninth International Conference on Services Computing, 2012, pp. 609–616

  59. [59]

    An adaptive prediction approach based on workload pattern discrimination in the cloud,

    C. Liu, C. Liu, Y. Shang, S. Chen, B. Cheng, and J. Chen, “An adaptive prediction approach based on workload pattern discrimination in the cloud, ”Journal of Network and Computer Applications, vol. 80, pp. 35–44, 2017

  60. [60]

    Cloud workload prediction and generation models,

    G. M. Wamba, Y. Li, A.-C. Orgerie, N. Beldiceanu, and J.-M. Menaud, “Cloud workload prediction and generation models, ” in2017 29th International Sympo- sium on Computer Architecture and High Performance Computing (SBAC-PAD), 2017, pp. 89–96

  61. [61]

    An efficient deep learning model to predict cloud workload for industry informatics,

    Q. Zhang, L. T. Yang, Z. Yan, Z. Chen, and P. Li, “An efficient deep learning model to predict cloud workload for industry informatics, ”IEEE Transactions on Industrial Informatics, vol. 14, no. 7, pp. 3170–3178, 2018

  62. [62]

    Wattwiser: Power & resource-efficient scheduling for multi-model multi-gpu inference servers,

    A. Jahanshahi, M. Rezvani, and D. Wong, “Wattwiser: Power & resource-efficient scheduling for multi-model multi-gpu inference servers, ” in2023 IEEE 14th Inter- national Green and Sustainable Computing Conference (IGSC), 2023

  63. [63]

    Green contexts - cuda driver api,

    NVIDIA, “Green contexts - cuda driver api, ” 2025, accessed: 2025-10-

  64. [64]

    Available: https://docs.nvidia.com/cuda/cuda-driver-api/group_ _CUDA__GREEN__CONTEXTS.html

    [Online]. Available: https://docs.nvidia.com/cuda/cuda-driver-api/group_ _CUDA__GREEN__CONTEXTS.html

  65. [65]

    Hardware compute partitioning on nvidia gpus for composable systems,

    J. Bakita and J. H. Anderson, “Hardware compute partitioning on nvidia gpus for composable systems, ” in37th Euromicro Conference on Real-Time Systems (ECRTS 2025), 2025. [Online]. Available: https://drops.dagstuhl.de/entities/document/10. 4230/LIPIcs.ECRTS.2025.21

  66. [66]

    Beware of fragmentation: Scheduling {GPU-Sharing } workloads with fragmentation gradient descent,

    Q. Weng, L. Yang, Y. Yu, W. Wang, X. Tang, G. Yang, and L. Zhang, “Beware of fragmentation: Scheduling {GPU-Sharing } workloads with fragmentation gradient descent, ” in2023 USENIX Annual Technical Conference (USENIX ATC 23), 2023, pp. 995–1008

  67. [67]

    Z3: An efficient smt solver,

    L. De Moura and N. Bjørner, “Z3: An efficient smt solver, ” inInternational con- ference on Tools and Algorithms for the Construction and Analysis of Systems. Springer, 2008, pp. 337–340

  68. [68]

    Rocm system management interface (rocm smi) library,

    AMD, “Rocm system management interface (rocm smi) library, ” https://github. com/ROCm/ROC-smi, 2024

  69. [69]

    Scaleserve: A scalable multi-gpu machine learning inference system and benchmarking suite,

    A. Jahanshahi, M. Chow, and D. Wong, “Scaleserve: A scalable multi-gpu machine learning inference system and benchmarking suite, ” inProceedings of the 14th Workshop on General Purpose Processing Using GPU, 2022, pp. 1–2

  70. [70]

    𝜇DPM: Dynamic power management for the microsecond era,

    C. Chou, L. N. Bhuyan, and D. Wong, “𝜇DPM: Dynamic power management for the microsecond era, ” in2019 IEEE International Symposium on High Performance Computer Architecture (HPCA), Feb. 2019, pp. 120–132

  71. [71]

    Elfen scheduling: Fine-grain principled borrowing from latency-critical workloads using simultaneous multi- threading,

    X. Yang, S. M. Blackburn, and K. S. McKinley, “Elfen scheduling: Fine-grain principled borrowing from latency-critical workloads using simultaneous multi- threading, ” in2016 USENIX Annual Technical Conference (USENIX ATC 16), Jun. 2016

  72. [72]

    Language models are unsupervised multitask learners,

    A. Radford, J. Wu, R. Child, D. Luan, D. Amodei, I. Sutskeveret al., “Language models are unsupervised multitask learners, ”OpenAI blog, vol. 1, no. 8, p. 9, 2019

  73. [73]

    Swim project,

    SWIMProjectUCB, “Swim project, ” 2013. [Online]. Available: https://github.com/ SWIMProjectUCB/SWIM/wiki

  74. [74]

    Field Experience from Li-Ion BESS Delivering Primary Frequency Regulation in the Danish Energy Market,

    M. Świerczyński, D. I. Stroe, R. Lærke, A. I. Stan, P. C. Kjær, R. Teodorescu, and S. K. Kær, “Field Experience from Li-Ion BESS Delivering Primary Frequency Regulation in the Danish Energy Market, ”ECS Transactions, vol. 61, no. 37, Sep. 2014

  75. [75]

    Overall data center costs,

    J. Hamilton, “Overall data center costs, ” 2025. [Online]. Available: https: //perspectives.mvdirona.com/2010/09/overall-data-center-costs/

  76. [76]

    L. A. Barroso, J. Clidaras, and U. Hölzle,The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines, Second Edition, 2013

  77. [77]

    Google data center pue performance,

    Google, “Google data center pue performance, ” 2025. [Online]. Available: https://datacenters.google/efficiency/

  78. [78]

    Amd instinct mi300 series systems,

    E. Corporation, “Amd instinct mi300 series systems, ” 2025. [Online]. Available: https://www.exxactcorp.com/category/AMD-Radeon-Instinct-Solutions

  79. [79]

    Market performance report, april 2024, ifm procurement and prices,

    CAISO, “Market performance report, april 2024, ifm procurement and prices, ”

  80. [80]

    Available: https://www.caiso.com/content/monthly-market- performance/apr-2024/ancillary-services.html

    [Online]. Available: https://www.caiso.com/content/monthly-market- performance/apr-2024/ancillary-services.html

Showing first 80 references.