Coordinating GPU Data Centers and Power Grid Regulation Service for Exogenous Carbon Benefits
Pith reviewed 2026-05-16 10:01 UTC · model grok-4.3
The pith
GPU data centers can provide frequency regulation to power grids and produce exogenous carbon savings that exceed their own operational emissions.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
GPU data centers can flexibly modulate their power consumption to deliver frequency regulation reserves to the power grid, which reduces the amount of fossil-fueled generation required for that service and thereby generates exogenous carbon savings that can exceed the data centers' operational carbon emissions.
What carries the argument
The Exogenous Carbon metric, which quantifies grid-side carbon reductions from data center regulation participation, together with the EcoCenter optimization framework that maximizes allowable regulation provision.
If this is right
- Data centers can become net carbon reducers for the grid rather than pure consumers.
- Fewer fossil peaker plants are needed to maintain frequency stability.
- AI workloads can be scheduled to supply regulation during periods of high grid need.
- Grid operators gain a flexible, low-carbon alternative to traditional reserves.
Where Pith is reading between the lines
- Operators may site future data centers near grids with large regulation requirements to amplify savings.
- New contracts could let data centers sell regulation capacity as a service to utilities.
- The same coordination logic could extend to non-GPU high-power facilities if power modulation is feasible.
- Policy incentives might emerge that credit data centers for measured exogenous carbon reductions.
Load-bearing premise
GPU data centers can adjust power draw for regulation services without unacceptable impacts on computing performance or that grid carbon-intensity models accurately capture the displaced fossil generation.
What would settle it
Real-world measurements from a data center providing regulation service that show no measurable drop in fossil-plant output or that document computing performance degradation large enough to make participation impractical.
Figures
read the original abstract
The rapid growth of AI/ML data centers has led to higher energy consumption and carbon emissions. The shift to renewable energy and growing data center energy demands can destabilize the power grid. Power grids rely on frequency regulation reserves, typically fossil-fueled power plants, to stabilize and balance the supply and demand of electricity. This paper sheds light on the hidden carbon emissions of frequency regulation service. Our work explores how modern GPU data centers can coordinate with power grids to reduce the need for fossil-fueled frequency regulation reserves. We first introduce a novel metric, Exogenous Carbon, to quantify grid-side carbon emission reductions resulting from data center participation in regulation service. We additionally introduce EcoCenter, a framework to maximize the amount of frequency regulation provision that GPU data centers can provide, and thus, reduce the amount of frequency regulation reserves necessary. We demonstrate that data center participation in frequency regulation can result in Exogenous carbon savings that can outweigh operational carbon emissions
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces an 'Exogenous Carbon' metric to quantify grid-side carbon emission reductions achieved when GPU data centers supply frequency regulation reserves, proposes the EcoCenter framework to maximize such provision, and claims that the resulting savings can outweigh the data centers' operational carbon emissions.
Significance. If the displacement assumptions hold under realistic market conditions, the work could demonstrate a practical mechanism for data centers to contribute to grid stability while delivering net carbon benefits, offering a new angle on sustainable AI infrastructure.
major comments (3)
- [§4] §4: The definition of Exogenous Carbon uses a linear mapping from regulation provision to displaced reserves that assumes a displacement factor of exactly 1.0 with fossil marginal units. This assumption is load-bearing for the headline claim that savings outweigh emissions, yet the text provides no market simulation, sensitivity analysis on the factor, or validation against co-optimized energy/ancillary markets where batteries or hydro may be marginal.
- [Abstract and §5] Abstract and §5: The assertion that data-center participation yields net carbon savings is presented without accompanying methods, input data, error bars, or quantitative results, leaving the central quantitative demonstration unsupported in the available sections.
- [EcoCenter framework] EcoCenter framework description: No analysis is given of the performance or latency impact on GPU workloads when power draw is modulated for regulation services, which is essential to establish feasibility.
minor comments (2)
- [§4] Clarify the exact units, parameters, and any grid-specific assumptions in the Exogenous Carbon formula to improve reproducibility.
- [References] Add references to prior literature on data-center demand response for ancillary services and marginal-emission modeling.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback on our work. Below we respond to each major comment and indicate the revisions planned for the manuscript.
read point-by-point responses
-
Referee: [§4] The definition of Exogenous Carbon uses a linear mapping from regulation provision to displaced reserves that assumes a displacement factor of exactly 1.0 with fossil marginal units. This assumption is load-bearing for the headline claim that savings outweigh emissions, yet the text provides no market simulation, sensitivity analysis on the factor, or validation against co-optimized energy/ancillary markets where batteries or hydro may be marginal.
Authors: We acknowledge the importance of validating the displacement factor assumption. It is based on the observation that fossil-fueled plants are the primary providers of frequency regulation in many grids. In the revision, we will incorporate a sensitivity analysis on the displacement factor (ranging from 0.5 to 1.2) and discuss implications for markets with significant battery or hydro participation. A brief market simulation will be added to §4 to demonstrate robustness under varying conditions. revision: yes
-
Referee: [Abstract and §5] The assertion that data-center participation yields net carbon savings is presented without accompanying methods, input data, error bars, or quantitative results, leaving the central quantitative demonstration unsupported in the available sections.
Authors: The quantitative demonstration relies on trace-driven simulations detailed in §5. To make this fully transparent, we will revise §5 to include a dedicated methods subsection describing the input datasets (power traces from production GPU clusters, grid carbon intensity data), the simulation setup, and results with error bars from repeated experiments under different regulation signal profiles. This will explicitly show how exogenous savings exceed operational emissions. revision: yes
-
Referee: [EcoCenter framework] No analysis is given of the performance or latency impact on GPU workloads when power draw is modulated for regulation services, which is essential to establish feasibility.
Authors: We agree that workload impact is critical for feasibility. The EcoCenter framework is designed to modulate power within safe GPU limits without requiring workload changes, drawing on established power management techniques. We will add an analysis subsection with references to prior studies on GPU power capping showing negligible latency impact for batch workloads, and include a short discussion of potential overheads. Full empirical evaluation may be added if space permits in the revision. revision: partial
Circularity Check
No significant circularity; metric and framework defined independently of result
full rationale
The paper defines a novel Exogenous Carbon metric to quantify grid-side emission reductions from data-center regulation participation and introduces the EcoCenter framework to maximize regulation provision. The demonstration that these savings can outweigh operational emissions is obtained by applying the defined metric to the modeled coordination scenarios. No equations, self-citations, or fitted parameters are shown that reduce the headline claim to its own inputs by construction. The load-bearing modeling choice (displacement of fossil reserves) is an external assumption rather than a definitional tautology, and the derivation chain remains self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
invented entities (1)
-
Exogenous Carbon
no independent evidence
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We first introduce a novel metric, Exogenous Carbon, to quantify grid-side carbon emission reductions resulting from data center participation in regulation service... Copw/RS = E_DC × CI_gen − (R_DC × MER_resv)
-
IndisputableMonolith/Foundation/ArithmeticFromLogic.leanLogicNat recovery unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
EcoCenter... coordinates power capping, core allocation, and multi-GPU coordination... to follow the regulation signal at 2-second granularity
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Powering intelligence: Analyzing artificial intelligence and data center energy consumption,
J. Aljbour, T. Wilson, and P. Patel, “Powering intelligence: Analyzing artificial intelligence and data center energy consumption, ”EPRI White Paper, 2024
work page 2024
-
[2]
M. Stanley, 2023. [Online]. Available: https://www.datacenterdynamics.com/en/ news/morgan-stanley-data-center-industry-will-emit-25bn-tons-of-co2-by- 2030/
work page 2023
-
[3]
Google, “Environmental report, ” 2023. [Online]. Available: https://www.gstatic. com/gumdrop/sustainability/google-2023-environmental-report.pdf
work page 2023
-
[4]
Powering sustainable transformation,
Microsoft, “Powering sustainable transformation, ” 2025. [On- line]. Available: https://datacenters.microsoft.com/globe/powering-sustainable- transformation/
work page 2025
-
[5]
U. E. I. A. (EIA), “Electricity data browser, ” https://www.eia.gov/electricity/data/ browser/, 2023
work page 2023
-
[6]
Powermorph: Qos-aware server power reshaping for data center regulation service,
A. Jahanshahi, N. Yu, and D. Wong, “Powermorph: Qos-aware server power reshaping for data center regulation service, ”ACM Transactions on Architecture and Code Optimization (TACO), vol. 19, no. 3, pp. 1–27, 2022
work page 2022
-
[7]
Supporting power grids with demand response at google data centers,
Google, “Supporting power grids with demand response at google data centers, ”
-
[8]
[Online]. Available: https://cloud.google.com/blog/products/infrastructure/ using-demand-response-to-reduce-data-center-power-consumption
-
[9]
EnergyQARE: QoS-Aware Data Center Participation in Smart Grid Regulation Service Reserve Provision,
H. Chen, Y. Zhang, M. C. Caramanis, and A. K. Coskun, “EnergyQARE: QoS-Aware Data Center Participation in Smart Grid Regulation Service Reserve Provision, ”ACM Trans. Model. Perform. Eval. Comput. Syst., vol. 4, no. 1, pp. 2:1–2:31, Jan. 2019. [Online]. Available: http://doi.acm.org/10.1145/3243172
-
[10]
Towards improved power management in cloud gpus,
P. Patel, Z. Gong, S. Rizvi, E. Choukse, P. Misra, T. Anderson, and A. Srira- man, “Towards improved power management in cloud gpus, ”IEEE Computer Architecture Letters, vol. 22, no. 2, pp. 141–144, 2023
work page 2023
-
[11]
Clicking clean virginia the dirty energy powering data center alley,
G. Cook, E. Jardim, and C. Craighill, “Clicking clean virginia the dirty energy powering data center alley, ”Greenpeace A vailable from: https://www. greenpeace. org/usa/reports/click-clean-virginia/[Accessed 21 Mar 2020], 2019
work page 2020
-
[12]
Ai is poised to drive 160% increase in data center power demand,
“Ai is poised to drive 160% increase in data center power demand, ” 2024. [Online]. Available: https://www.goldmansachs.com/insights/articles/AI-poised- to-drive-160-increase-in-power-demand
work page 2024
-
[13]
Llama 2: Open Foundation and Fine-Tuned Chat Models
H. Touvron, L. Martin, K. Stone, P. Albert, A. Almahairi, Y. Babaei, N. Bashlykov, S. Batra, P. Bhargava, S. Bhosaleet al., “Llama 2: Open foundation and fine-tuned chat models, ”arXiv preprint arXiv:2307.09288, 2023
work page internal anchor Pith review Pith/arXiv arXiv 2023
-
[14]
Carbon explorer: A holistic framework for designing carbon aware datacenters,
B. Acun, B. Lee, F. Kazhamiaka, K. Maeng, U. Gupta, M. Chakkaravarthy, D. Brooks, and C.-J. Wu, “Carbon explorer: A holistic framework for designing carbon aware datacenters, ” inProceedings of the 28th ACM International Confer- ence on Architectural Support for Programming Languages and Operating Systems, Volume 2, 2023, pp. 118–132
work page 2023
-
[15]
Carbon responder: Coordinating demand response for the datacenter fleet,
J. Xing, B. Acun, A. Sundarrajan, D. Brooks, M. Chakkaravarthy, N. Avila, C.-J. Wu, and B. C. Lee, “Carbon responder: Coordinating demand response for the datacenter fleet, ” 2023. [Online]. Available: https://arxiv.org/abs/2311.08589
-
[16]
Carbon-aware electricity cost minimization for sustainable data centers,
H. Dou, Y. Qi, W. Wei, and H. Song, “Carbon-aware electricity cost minimization for sustainable data centers, ”IEEE Transactions on Sustainable Computing, vol. 2, no. 2, pp. 211–223, 2017
work page 2017
-
[17]
Temporal load balancing with service delay guarantees for data center energy cost optimization,
J. Luo, L. Rao, and X. Liu, “Temporal load balancing with service delay guarantees for data center energy cost optimization, ”IEEE Transactions on Parallel and Distributed Systems, vol. 25, no. 3, pp. 775–784, 2013
work page 2013
-
[18]
On the limitations of carbon-aware temporal and spatial workload shifting in the cloud,
T. Sukprasert, A. Souza, N. Bashir, D. Irwin, and P. Shenoy, “On the limitations of carbon-aware temporal and spatial workload shifting in the cloud, ” in Proceedings of the Nineteenth European Conference on Computer Systems, ser. EuroSys ’24. New York, NY, USA: Association for Computing Machinery, 2024, p. 924–941. [Online]. Available: https://doi.org/10...
-
[19]
Clover: Toward sustainable ai with carbon-aware machine learning inference service,
B. Li, S. Samsi, V. Gadepally, and D. Tiwari, “Clover: Toward sustainable ai with carbon-aware machine learning inference service, ” inProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, 2023, pp. 1–15
work page 2023
-
[20]
In california, solar and wind boost the price of frequency regulation,
GreenTechMedia, “In california, solar and wind boost the price of frequency regulation, ” 2022. [Online]. Available: http://www.greentechmedia.com/articles/ read/in-california-solar-and-wind-boosts-the-price-for-frequency-regulation
work page 2022
-
[21]
P. Alstone, J. Potter, M. A. Piette, P. Schwartz, M. A. Berger, L. N. Dunn, S. J. Smith, M. D. Sohn, A. Aghajanzadeh, S. Stenssonet al., “2025 california demand response potential study-charting california’s demand response future. final report on phase 2 results, ” Lawrence Berkeley National Lab.(LBNL), Berkeley, CA (United States), Tech. Rep., 2017
work page 2025
-
[22]
Data center participation in demand response programs with quality-of-service guarantees,
Y. Zhang, I. C. Paschalidis, and A. K. Coskun, “Data center participation in demand response programs with quality-of-service guarantees, ” inProceedings of the Tenth ACM International Conference on Future Energy Systems, 2019, pp. 285–302
work page 2019
-
[23]
Hpc data center participation in demand response: An adaptive policy with qos assurance,
Y. Zhang, D. C. Wilson, I. C. Paschalidis, and A. K. Coskun, “Hpc data center participation in demand response: An adaptive policy with qos assurance, ”IEEE Transactions on Sustainable Computing, vol. 7, no. 1, pp. 157–171, 2022
work page 2022
-
[24]
Opportunities and challenges for data center demand response,
A. Wierman, Z. Liu, I. Liu, and H. Mohsenian-Rad, “Opportunities and challenges for data center demand response, ” inInternational Green Computing Conference, 2014, pp. 1–10
work page 2014
-
[25]
Iswitch: Coordinating and optimizing renewable energy powered server clusters,
C. Li, A. Qouneh, and T. Li, “Iswitch: Coordinating and optimizing renewable energy powered server clusters, ” inProceedings of the 39th Annual International Symposium on Computer Architecture, ser. ISCA ’12. USA: IEEE Computer Society, 2012, p. 512–523
work page 2012
-
[26]
Greenslot: scheduling energy consumption in green datacenters,
Í. Goiri, K. Le, M. E. Haque, R. Beauchea, T. D. Nguyen, J. Guitart, J. Torres, and R. Bianchini, “Greenslot: scheduling energy consumption in green datacenters, ” inInternational Conference for High Performance Computing, Networking, Storage and Analysis, 2011
work page 2011
-
[27]
Solarcore: Solar energy driven multi-core architecture power management,
C. Li, W. Zhang, C. Cho, and T. Li, “Solarcore: Solar energy driven multi-core architecture power management, ” in2011 IEEE 17th International Symposium on High Performance Computer Architecture, Feb 2011, pp. 205–216
work page 2011
-
[28]
Chameleon: Adapting throughput server to time-varying green power budget using online learning,
C. Li, X. Li, R. Wang, T. Li, N. Goswami, and D. Qian, “Chameleon: Adapting throughput server to time-varying green power budget using online learning, ” inInternational Symposium on Low Power Electronics and Design (ISLPED), Sep. 2013, pp. 100–105
work page 2013
-
[29]
Geographical load balancing with renewables,
Z. Liu, M. Lin, A. Wierman, S. H. Low, and L. L. Andrew, “Geographical load balancing with renewables, ”SIGMETRICS Perform. Eval. Rev., vol. 39, no. 3, p. 62–66, Dec. 2011. [Online]. Available: https://doi.org/10.1145/2160803.2160862
-
[30]
Adapting datacenter capacity for greener datacenters and grid,
L. Lin and A. A. Chien, “Adapting datacenter capacity for greener datacenters and grid, ” inProceedings of the 14th ACM International Conference on Future Energy Systems, 2023, pp. 200–213
work page 2023
-
[31]
Carbonscaler: Leveraging cloud workload elasticity for optimizing carbon-efficiency,
W. A. Hanafy, Q. Liang, N. Bashir, D. Irwin, and P. Shenoy, “Carbonscaler: Leveraging cloud workload elasticity for optimizing carbon-efficiency, ”Proc. ACM Meas. Anal. Comput. Syst., vol. 7, no. 3, Dec. 2023. [Online]. Available: https://doi.org/10.1145/3626788
-
[32]
The sunk carbon fallacy: Rethinking carbon footprint metrics for effective carbon-aware scheduling,
N. Bashir, V. Gohil, A. B. Subramanya, M. Shahrad, D. Irwin, E. Olivetti, and C. Delimitrou, “The sunk carbon fallacy: Rethinking carbon footprint metrics for effective carbon-aware scheduling, ” inProceedings of the 2024 ACM Symposium on Cloud Computing, ser. SoCC ’24. New York, NY, USA: Association for Computing Machinery, 2024, p. 542–551. [Online]. Av...
-
[33]
Greenhouse gas emission tracking methodology,
California ISO, “Greenhouse gas emission tracking methodology, ” Tech. Rep., 2016
work page 2016
-
[34]
Greenhouse gas coordination stakeholder recommendations for policy development,
——, “Greenhouse gas coordination stakeholder recommendations for policy development, ” Tech. Rep., 2024
work page 2024
-
[35]
PJM Manual 12: Balancing Operations,
PJM, “PJM Manual 12: Balancing Operations, ” https://www.pjm.com/-/media/ documents/manuals/m12.ashx, 2022
work page 2022
-
[36]
A. Sadeghi-Mobarakeh and H. Mohsenian-Rad, “Performance accuracy scores in caiso and miso regulation markets: A comparison based on real data and mathematical analysis, ”IEEE Transactions on Power Systems, vol. 33, no. 3, pp. 3196–3198, 2018
work page 2018
-
[37]
The data center as a grid load stabilizer,
H. Chen, M. C. Caramanis, and A. K. Coskun, “The data center as a grid load stabilizer, ” in19th Asia and South Pacific Design Automation Conference (ASP-DAC). Singapore: IEEE, Jan. 2014, pp. 105–112. [Online]. Available: http://ieeexplore.ieee.org/document/6742874/
-
[38]
Dynamic Server Power Capping for Enabling Data Center Participation in Power Markets,
H. Chen, C. Hankendi, M. C. Caramanis, and A. K. Coskun, “Dynamic Server Power Capping for Enabling Data Center Participation in Power Markets, ” in Proceedings of the International Conference on Computer-Aided Design, ser. ICCAD ’13. Piscataway, NJ, USA: IEEE Press, 2013, pp. 122–129
work page 2013
-
[39]
Reducing the data center electric- ity costs through participation in smart grid programs,
H. Chen, M. C. Caramanis, and A. K. Coskun, “Reducing the data center electric- ity costs through participation in smart grid programs, ” inInternational Green Computing Conference, Nov. 2014, pp. 1–10
work page 2014
-
[40]
Characterizing power management opportunities for llms in the cloud,
P. Patel, E. Choukse, C. Zhang, I. n. Goiri, B. Warrier, N. Mahalingam, and R. Bianchini, “Characterizing power management opportunities for llms in the cloud, ” inProceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3, ser. ASPLOS ’24. New York, NY, USA: Association for Comp...
-
[41]
Toward sustainable hpc: Carbon footprint estimation and environmental implications of hpc systems,
B. Li, R. Basu Roy, D. Wang, S. Samsi, V. Gadepally, and D. Tiwari, “Toward sustainable hpc: Carbon footprint estimation and environmental implications of hpc systems, ” inProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, 2023, pp. 1–15
work page 2023
-
[42]
A guide to reducing carbon emissions through data center geographical load shifting,
J. Lindberg, Y. Abdennadher, J. Chen, B. C. Lesieutre, and L. Roald, “A guide to reducing carbon emissions through data center geographical load shifting, ” in Proceedings of the Twelfth ACM International Conference on Future Energy Systems, ser. e-Energy ’21, 2021
work page 2021
-
[43]
Quantifying the Carbon Emissions of Machine Learning
A. Lacoste, A. Luccioni, V. Schmidt, and T. Dandres, “Quantifying the carbon emissions of machine learning, ” 2019. [Online]. Available: https: //arxiv.org/abs/1910.09700
work page internal anchor Pith review Pith/arXiv arXiv 2019
-
[44]
“The leading api for granular electricity data reduce carbon emissions with actionable electricity data. ” 2024. [Online]. Available: https://www. electricitymaps.com
work page 2024
-
[45]
Electricityemissions.jl: A framework for the comparison of carbon intensity signals,
J. Gorka, N. Rhodes, and L. Roald, “Electricityemissions.jl: A framework for the comparison of carbon intensity signals, ” 2024. [Online]. Available: https://arxiv.org/abs/2411.06560
-
[46]
On the limitations of carbon-aware temporal and spatial workload shifting in the cloud,
T. Sukprasert, A. Souza, N. Bashir, D. Irwin, and P. Shenoy, “On the limitations of carbon-aware temporal and spatial workload shifting in the cloud, ” inProceedings Coordinating Power Grid Frequency Regulation Service with Data Center Load Flexibility of the Nineteenth European Conference on Computer Systems, ser. EuroSys ’24, 2024
work page 2024
-
[47]
Process control strategies for reducing the minimum load of fossil- fired plants,
S. Seachman, “Process control strategies for reducing the minimum load of fossil- fired plants, ” 2025. [Online]. Available: https://www.powermag.com/process- control-strategies-for-reducing-the-minimum-load-of-fossil-fired-plants/
work page 2025
-
[48]
Evaluation of market rules using a multi-agent system method,
N.-P. Yu, C.-C. Liu, and J. Price, “Evaluation of market rules using a multi-agent system method, ”IEEE Transactions on Power Systems, vol. 25, no. 1, pp. 470–479, 2010
work page 2010
-
[49]
J. D. Wojcik and J. Wang, “Feasibility study of combined cycle gas turbine (ccgt) power plant integration with adiabatic compressed air energy storage (acaes), ” Applied Energy, vol. 221, pp. 477–489, 2018
work page 2018
-
[50]
Ge gas turbine performance characteristics,
F. J. Brooks, “Ge gas turbine performance characteristics, ” GE Power Systems, Tech. Rep., 2000
work page 2000
-
[51]
O. Anderson, M. A. Bragin, and N. Yu, “Optimizing deep decarbonization path- ways in California with power system planning using surrogate level-based Lagrangian relaxation, ”Applied Energy, vol. 377, 2025
work page 2025
-
[52]
Nvidia a100 tensor core gpu architecture,
NVIDIA, “Nvidia a100 tensor core gpu architecture, ” https://images.nvidia. com/aem-dam/en-zz/Solutions/data-center/nvidia-ampere-architecture- whitepaper.pdf, 2020
work page 2020
-
[53]
Nvidia h100 tensor core gpu architecture,
——, “Nvidia h100 tensor core gpu architecture, ” https://www.advancedclustering. com/wp-content/uploads/2022/03/gtc22-whitepaper-hopper.pdf, 2022
work page 2022
-
[54]
Nvidia blackwell architecture technical brief,
——, “Nvidia blackwell architecture technical brief, ” https://cdn.prod.website- files.com/61dda201f29b7efc52c5fbaf/6602ea9d0ce8cb73fb6de87f_nvidia- blackwell-architecture-technical-brief.pdf, 2024
work page 2024
-
[55]
Workload prediction using arima model and its impact on cloud applications’ qos,
R. N. Calheiros, E. Masoumi, R. Ranjan, and R. Buyya, “Workload prediction using arima model and its impact on cloud applications’ qos, ”IEEE Transactions on Cloud Computing, vol. 3, no. 4, pp. 449–458, 2015
work page 2015
-
[56]
Ame-wpc: Advanced model for efficient workload prediction in the cloud,
K. Cetinski and M. B. Juric, “Ame-wpc: Advanced model for efficient workload prediction in the cloud, ”Journal of Network and Computer Applications, vol. 55, pp. 191–201, 2015
work page 2015
-
[57]
Host load prediction in a google compute cloud with a bayesian model,
S. Di, D. Kondo, and W. Cirne, “Host load prediction in a google compute cloud with a bayesian model, ” inSC’12: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis. IEEE, 2012, pp. 1–11
work page 2012
-
[58]
Rpps: A novel resource prediction and provisioning scheme in cloud data center,
W. Fang, Z. Lu, J. Wu, and Z. Cao, “Rpps: A novel resource prediction and provisioning scheme in cloud data center, ” in2012 IEEE Ninth International Conference on Services Computing, 2012, pp. 609–616
work page 2012
-
[59]
An adaptive prediction approach based on workload pattern discrimination in the cloud,
C. Liu, C. Liu, Y. Shang, S. Chen, B. Cheng, and J. Chen, “An adaptive prediction approach based on workload pattern discrimination in the cloud, ”Journal of Network and Computer Applications, vol. 80, pp. 35–44, 2017
work page 2017
-
[60]
Cloud workload prediction and generation models,
G. M. Wamba, Y. Li, A.-C. Orgerie, N. Beldiceanu, and J.-M. Menaud, “Cloud workload prediction and generation models, ” in2017 29th International Sympo- sium on Computer Architecture and High Performance Computing (SBAC-PAD), 2017, pp. 89–96
work page 2017
-
[61]
An efficient deep learning model to predict cloud workload for industry informatics,
Q. Zhang, L. T. Yang, Z. Yan, Z. Chen, and P. Li, “An efficient deep learning model to predict cloud workload for industry informatics, ”IEEE Transactions on Industrial Informatics, vol. 14, no. 7, pp. 3170–3178, 2018
work page 2018
-
[62]
Wattwiser: Power & resource-efficient scheduling for multi-model multi-gpu inference servers,
A. Jahanshahi, M. Rezvani, and D. Wong, “Wattwiser: Power & resource-efficient scheduling for multi-model multi-gpu inference servers, ” in2023 IEEE 14th Inter- national Green and Sustainable Computing Conference (IGSC), 2023
work page 2023
-
[63]
Green contexts - cuda driver api,
NVIDIA, “Green contexts - cuda driver api, ” 2025, accessed: 2025-10-
work page 2025
-
[64]
Available: https://docs.nvidia.com/cuda/cuda-driver-api/group_ _CUDA__GREEN__CONTEXTS.html
[Online]. Available: https://docs.nvidia.com/cuda/cuda-driver-api/group_ _CUDA__GREEN__CONTEXTS.html
-
[65]
Hardware compute partitioning on nvidia gpus for composable systems,
J. Bakita and J. H. Anderson, “Hardware compute partitioning on nvidia gpus for composable systems, ” in37th Euromicro Conference on Real-Time Systems (ECRTS 2025), 2025. [Online]. Available: https://drops.dagstuhl.de/entities/document/10. 4230/LIPIcs.ECRTS.2025.21
work page 2025
-
[66]
Beware of fragmentation: Scheduling {GPU-Sharing } workloads with fragmentation gradient descent,
Q. Weng, L. Yang, Y. Yu, W. Wang, X. Tang, G. Yang, and L. Zhang, “Beware of fragmentation: Scheduling {GPU-Sharing } workloads with fragmentation gradient descent, ” in2023 USENIX Annual Technical Conference (USENIX ATC 23), 2023, pp. 995–1008
work page 2023
-
[67]
L. De Moura and N. Bjørner, “Z3: An efficient smt solver, ” inInternational con- ference on Tools and Algorithms for the Construction and Analysis of Systems. Springer, 2008, pp. 337–340
work page 2008
-
[68]
Rocm system management interface (rocm smi) library,
AMD, “Rocm system management interface (rocm smi) library, ” https://github. com/ROCm/ROC-smi, 2024
work page 2024
-
[69]
Scaleserve: A scalable multi-gpu machine learning inference system and benchmarking suite,
A. Jahanshahi, M. Chow, and D. Wong, “Scaleserve: A scalable multi-gpu machine learning inference system and benchmarking suite, ” inProceedings of the 14th Workshop on General Purpose Processing Using GPU, 2022, pp. 1–2
work page 2022
-
[70]
𝜇DPM: Dynamic power management for the microsecond era,
C. Chou, L. N. Bhuyan, and D. Wong, “𝜇DPM: Dynamic power management for the microsecond era, ” in2019 IEEE International Symposium on High Performance Computer Architecture (HPCA), Feb. 2019, pp. 120–132
work page 2019
-
[71]
X. Yang, S. M. Blackburn, and K. S. McKinley, “Elfen scheduling: Fine-grain principled borrowing from latency-critical workloads using simultaneous multi- threading, ” in2016 USENIX Annual Technical Conference (USENIX ATC 16), Jun. 2016
work page 2016
-
[72]
Language models are unsupervised multitask learners,
A. Radford, J. Wu, R. Child, D. Luan, D. Amodei, I. Sutskeveret al., “Language models are unsupervised multitask learners, ”OpenAI blog, vol. 1, no. 8, p. 9, 2019
work page 2019
-
[73]
SWIMProjectUCB, “Swim project, ” 2013. [Online]. Available: https://github.com/ SWIMProjectUCB/SWIM/wiki
work page 2013
-
[74]
M. Świerczyński, D. I. Stroe, R. Lærke, A. I. Stan, P. C. Kjær, R. Teodorescu, and S. K. Kær, “Field Experience from Li-Ion BESS Delivering Primary Frequency Regulation in the Danish Energy Market, ”ECS Transactions, vol. 61, no. 37, Sep. 2014
work page 2014
-
[75]
J. Hamilton, “Overall data center costs, ” 2025. [Online]. Available: https: //perspectives.mvdirona.com/2010/09/overall-data-center-costs/
work page 2025
-
[76]
L. A. Barroso, J. Clidaras, and U. Hölzle,The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines, Second Edition, 2013
work page 2013
-
[77]
Google data center pue performance,
Google, “Google data center pue performance, ” 2025. [Online]. Available: https://datacenters.google/efficiency/
work page 2025
-
[78]
Amd instinct mi300 series systems,
E. Corporation, “Amd instinct mi300 series systems, ” 2025. [Online]. Available: https://www.exxactcorp.com/category/AMD-Radeon-Instinct-Solutions
work page 2025
-
[79]
Market performance report, april 2024, ifm procurement and prices,
CAISO, “Market performance report, april 2024, ifm procurement and prices, ”
work page 2024
-
[80]
[Online]. Available: https://www.caiso.com/content/monthly-market- performance/apr-2024/ancillary-services.html
work page 2024
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.