pith. sign in

arxiv: 2606.00457 · v1 · pith:65CHDNRVnew · submitted 2026-05-30 · 💻 cs.PF

Maximizing Compute Capacity in AI Data Centers through Cooling, Energy Storage, and Computing Adaptation

Pith reviewed 2026-06-28 18:14 UTC · model grok-4.3

classification 💻 cs.PF
keywords data center power managementAI compute capacitycooling systemsbattery energy storagedynamic adaptationpower cappingsite-level constraints
0
0 comments X

The pith

ComputeAmp uses dynamic coordination of cooling, batteries, and compute throttling to run more servers under a fixed site power limit.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper claims that AI data centers face recurring multi-hour spikes in cooling power during hot weather that force conservative sizing of compute hardware to avoid exceeding total power capacity. By jointly managing cooling loads, battery storage to shift energy across hours, and temporary compute reductions, ComputeAmp aims to support a larger compute installation that stays within the same power budget on average. The approach exploits the fact that cooling demand is lower on most days, allowing batteries to charge then and discharge during peaks while compute adapts as needed. If successful, this would increase usable compute capacity without new grid connections or larger water systems.

Core claim

ComputeAmp is a framework that maximizes compute capacity by jointly and dynamically leveraging cooling, battery energy storage, and computing-based adaptation, expanding usable compute capacity within local power and water resource limits.

What carries the argument

ComputeAmp framework for joint dynamic management of cooling power demand, battery energy storage, and compute adaptation.

If this is right

  • Compute systems can be sized larger than the conservative peak-cooling rule without new power infrastructure.
  • Battery discharge during hot periods offsets cooling spikes while compute throttling prevents overload.
  • Water consumption stays within limits because the method avoids increasing peak cooling equipment size.
  • Operational algorithms must solve a joint optimization over time-varying cooling, storage state, and compute load.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same joint-control idea could extend to sites with on-site solar by aligning battery charge cycles with both temperature and solar output.
  • If cooling systems include evaporative options, the framework might further reduce water draw by shifting load timing.
  • Longer battery duration would be needed if heat waves last multiple days rather than hours.

Load-bearing premise

Cooling power demand increases substantially and predictably with ambient temperature during recurring multi-hour periods, creating exploitable variation.

What would settle it

A site-level measurement showing that total power still exceeds capacity during summer peaks even after applying the joint cooling-storage-compute schedule, or that average daily compute throughput does not increase.

Figures

Figures reproduced from arXiv: 2606.00457 by Adam Wierman, Mohammad A. Islam, Shaolei Ren.

Figure 1
Figure 1. Figure 1: Simulated hourly PUE of a highly-optimized AI data center without adiabatic assistance in Virginia and Texas, respectively, reproduced from [3], indicating substantially higher cooling power in the summer. The seasonal variation in cooling power creates an important challenge for sizing compute capacity within a limited site-level power budget, particularly as large-scale data centers face increasing regio… view at source ↗
Figure 2
Figure 2. Figure 2: compares the PUE of waterless and evaporative heat rejection. Across quarterly, monthly, and hourly timescales, evaporative cooling can deliver substantial reductions in peak cooling power, thereby en￾abling greater compute capacity without additional grid capacity expansion. For example, Figure 2a shows that the air-cooled site in Storey County, NV, exhibits more spikier summer PUE than evaporatively-cool… view at source ↗
Figure 3
Figure 3. Figure 3: Illustrative results for an AI data center simulated under idealized settings in Austin, TX, from July 1 to September 30, 2023. the dry coolers. For the adiabatic-assisted case, we assume an idealized evaporative pre-cooling process in which water is used only to reduce the inlet air temperature required by the dry coolers. The water require￾ment is computed from the air flow rate and outdoor psychrometric… view at source ↗
read the original abstract

The deployment of artificial intelligence is increasingly constrained by limited site-level power capacity, which must support both compute systems and non-compute systems (primarily cooling) at all times. Cooling power demand, especially in non-evaporative cooling systems, can increase substantially with ambient temperature in the summer, producing recurring periods of elevated cooling power that often lasts for multiple hours per day. Therefore, maximizing compute capacity under a limited site-level power budget is an important planning and operational challenge. Sizing the compute system conservatively based on peak cooling power can leave part of the site-level power capacity underutilized when the cooling power is below its peak, particularly in cooler months. On the other hand, sizing the compute system aggressively based on low cooling power can cause the total site-level power demand to exceed the site-level power capacity during hot days in the summer. This paper proposes ComputeAmp (Compute Amplifier), a framework that maximizes the compute capacity by jointly and dynamically leveraging cooling, battery energy storage, and computing-based adaptation. We discuss the opportunities and limitations of ComputeAmp and illustrate its potential to significantly expand usable compute capacity within local power and water resource limits. We also present a problem formulation for ComputeAmp and highlight a few algorithmic and operational challenges.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 2 minor

Summary. The paper proposes ComputeAmp, a conceptual framework to maximize usable compute capacity in AI data centers under fixed site-level power limits. It identifies recurring multi-hour periods of elevated cooling power demand driven by ambient temperature in non-evaporative systems, notes the resulting under-utilization or risk of overload depending on conservative vs. aggressive compute sizing, and advocates joint dynamic management of cooling, battery energy storage, and compute adaptation. The manuscript discusses opportunities and limitations, presents a high-level optimization problem formulation, and flags algorithmic and operational challenges, while claiming potential for significant capacity expansion within local power and water constraints.

Significance. If the joint optimization approach can be shown to deliver net gains after battery round-trip losses, adaptation overhead, and water constraints, it would address a practical bottleneck in AI infrastructure scaling. The core insight—that temperature-driven cooling variation creates exploitable slack that storage and compute throttling can arbitrage—is plausible and timely given rising data-center power densities, but the manuscript supplies no quantitative evidence, solved instances, or sensitivity analysis to establish the magnitude of the claimed expansion.

major comments (3)
  1. [Abstract and opportunities/limitations discussion] The central claim that ComputeAmp can 'significantly expand usable compute capacity' rests on the unverified premise that multi-hour cooling-power variation is both predictable and large enough to be arbitraged by storage and adaptation without net loss; the manuscript provides no data, model, or bound on this variation (e.g., no temperature-to-power curves or site measurements).
  2. [Problem formulation section] The problem formulation is described only at a narrative level; no explicit objective function, decision variables, or constraint set (e.g., power-balance equations coupling cooling, battery SOC, and compute throttling) appears, preventing assessment of whether the stated joint optimization is well-posed or tractable.
  3. [Entire manuscript] No simulation, trace-driven evaluation, or even stylized numerical example is supplied to quantify capacity gain, battery cycling cost, or water-use impact, so the 'potential to significantly expand' assertion cannot be evaluated.
minor comments (2)
  1. [Framework description] Notation for the three levers (cooling, storage, compute adaptation) is introduced informally; consistent symbols and a small table of variables would improve readability.
  2. [Abstract] The abstract states that the approach operates 'within local power and water resource limits' yet the water constraint is never formalized or discussed quantitatively.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive feedback. The comments correctly identify that the manuscript is a high-level conceptual proposal without quantitative backing or a fully specified optimization model. We agree these elements are needed to substantiate the claims and will revise the paper to address them.

read point-by-point responses
  1. Referee: [Abstract and opportunities/limitations discussion] The central claim that ComputeAmp can 'significantly expand usable compute capacity' rests on the unverified premise that multi-hour cooling-power variation is both predictable and large enough to be arbitraged by storage and adaptation without net loss; the manuscript provides no data, model, or bound on this variation (e.g., no temperature-to-power curves or site measurements).

    Authors: We agree that the manuscript supplies no site-specific measurements or explicit temperature-to-power curves. The discussion relies on the general, well-documented relationship between ambient temperature and cooling power in non-evaporative systems. In revision we will add representative curves drawn from public data-center cooling literature together with simple bounds on the magnitude and duration of the variation to make the premise explicit and falsifiable. revision: yes

  2. Referee: [Problem formulation section] The problem formulation is described only at a narrative level; no explicit objective function, decision variables, or constraint set (e.g., power-balance equations coupling cooling, battery SOC, and compute throttling) appears, preventing assessment of whether the stated joint optimization is well-posed or tractable.

    Authors: We accept that the formulation remains narrative. The revision will present an explicit optimization problem that defines the objective (maximize time-averaged compute capacity), the decision variables (cooling set-points, battery charge/discharge schedule, compute throttling factors), and the key constraints (site-level power balance, battery SOC dynamics, cooling-power dependence on temperature and compute load). revision: yes

  3. Referee: [Entire manuscript] No simulation, trace-driven evaluation, or even stylized numerical example is supplied to quantify capacity gain, battery cycling cost, or water-use impact, so the 'potential to significantly expand' assertion cannot be evaluated.

    Authors: We agree that no numerical evaluation of any kind is provided. The current manuscript is positioned as a framework proposal. In the revised version we will include a stylized numerical example that solves the formulated problem over a multi-day temperature trace, reports net capacity gain after round-trip battery losses and adaptation overhead, and notes water-use implications under the stated constraints. revision: yes

Circularity Check

0 steps flagged

No circularity: conceptual framework with optimization formulation only

full rationale

The paper presents a high-level proposal for ComputeAmp as a joint optimization framework over cooling, storage, and compute adaptation. It formulates an optimization problem and discusses opportunities/limitations but contains no equations, fitted parameters, predictions, or derivations that reduce to their own inputs by construction. No self-citations are load-bearing for any uniqueness claim, and the central premise rests on the observable physical variation in cooling power rather than any self-referential step. The derivation chain is therefore self-contained as a planning/operational model without circular reduction.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract supplies no explicit free parameters, axioms, or invented entities; the framework is described at a conceptual level only.

pith-pipeline@v0.9.1-grok · 5755 in / 1178 out tokens · 28829 ms · 2026-06-28T18:14:57.518903+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

28 extracted references · 1 canonical work pages

  1. [1]

    Powering intelligence 2026: Updated scenarios of U.S

    EPRI. Powering intelligence 2026: Updated scenarios of U.S. data center electricity use and power strategies.Technical Report, 2026

  2. [2]

    Van Zetten, Michael E

    Jeffrey D. Van Zetten, Michael E. Cholette, and Keivan Bamdad. Energy and carbon savings in data cen- tres through liquid-to-chip cooling with differential temperature control.Advances in Applied Energy, 22:100269, 2026

  3. [3]

    Wedan Emmanuel Gnibga and Andrew A. Chien. Datacenter cooling with cold underground thermal energy storage: Environmental and economic values. Ine-Energy, 2026

  4. [4]

    Smith, Alex Hubbard, Alex Newkirk, Nuoa Lei, Md Abu Bakar Siddik, Billie Holecek, Jonathan Koomey, Eric Masanet, and Dale Sartor

    Arman Shehabi, Sarah J. Smith, Alex Hubbard, Alex Newkirk, Nuoa Lei, Md Abu Bakar Siddik, Billie Holecek, Jonathan Koomey, Eric Masanet, and Dale Sartor. 2024 United States data center energy usage report.Lawrence Berkeley National Laboratory LBNL-2001637, December 2024

  5. [5]

    M. A. Islam, Xiaoqi Ren, Shaolei Ren, Adam Wierman, and Xiaorui Wang. A market approach for handling power emergencies in multi-tenant data center. InHPCA, 2016

  6. [6]

    Flexible data centers: A faster, more affordable path to power

    Carlo Brancucci, Dylan Cutler, and Jesse Jenkins. Flexible data centers: A faster, more affordable path to power. Technical report, Camus, Encoord, and Princeton ZERO Lab, December 2025

  7. [7]

    The energy and water use impacts of building system design for data centers: Design considerations for Oregon and Washington.PAE Technical Report, February 2025

    Karina Hershberg, Ben Burnett, Jonathan Roraff, Paula Hopker, and Marc Brune. The energy and water use impacts of building system design for data centers: Design considerations for Oregon and Washington.PAE Technical Report, February 2025

  8. [8]

    Google. Google response to the consultation on the establishment of a common EU rating scheme for data centres.https://ec.europa.eu/info/law/better-regulation/have-your-say/ initiatives/16035-Energy-efficiency-rating-scheme-for-data-centres-in-Europe/ F33395555_en, April 2026

  9. [9]

    Beyond PUE: Rethinking efficient data center design in an era of power scarcity

    Verrus Data Centers. Beyond PUE: Rethinking efficient data center design in an era of power scarcity. Technical Report, 2025. White paper

  10. [10]

    Tran, Dai H

    Nguyen H. Tran, Dai H. Tran, Shaolei Ren, Zhu Han, and Eui-Nam Huh; Choong Seon Hong. How geo-distributed data centers do demand response: A game-theoretic approach.IEEE Transactions on Smart Grid, pp(99), May 2015

  11. [11]

    Drought Monitor.https://droughtmonitor.unl.edu/CurrentMap.aspx

    U.S. Drought Monitor.https://droughtmonitor.unl.edu/CurrentMap.aspx

  12. [12]

    Amazon to invest$12 billion in first data center campuses in Louisiana.https://www

    Amazon. Amazon to invest$12 billion in first data center campuses in Louisiana.https://www. aboutamazon.com/news/company-news/amazon-data-center-louisiana-new-jobs, February 2026

  13. [13]

    Environmental report.https://www.gstatic.com/gumdrop/sustainability/ google-2025-environmental-report.pdf, 2025

    Google. Environmental report.https://www.gstatic.com/gumdrop/sustainability/ google-2025-environmental-report.pdf, 2025

  14. [14]

    Small bottle, big pipe: Quantifying and addressing the impact of data centers on public water systems.arXiv:2603.02705, 2026

    Yuelin Han, Pengfei Li, Adam Wierman, and Shaolei Ren. Small bottle, big pipe: Quantifying and addressing the impact of data centers on public water systems.arXiv:2603.02705, 2026. 10

  15. [15]

    U.S. EPA. Water reuse action plan 2.0.https://www.epa.gov/waterreuse/ water-reuse-action-plan-20, 2026

  16. [16]

    Growing the Internet While Reducing Energy Consumption.https://datacenters.google/ efficiency/

    Google. Growing the Internet While Reducing Energy Consumption.https://datacenters.google/ efficiency/

  17. [17]

    Hickenbottom

    Leila Karimi, Leeann Yacuel, Joseph Degraft-Johnson, Jamie Ashby, Michael Green, Matt Renner, Aryn Bergman, Robert Norwood, and Kerri L. Hickenbottom. Water-energy tradeoffs in data centers: A case study in hot-arid climates.Resources, Conservation and Recycling, 181:106194, 2022

  18. [18]

    Esha Choukse, Brijesh Warrier, Scot Heath, Luz Belmont, April Zhao, Hassan Ali Khan, Brian Harry, Matthew Kappel, Russell J. Hewett, Kushal Datta, Yu Pei, Caroline Lichtenberger, John Siegler, David Lukofsky, Zaid Kahn, Gurpreet Sahota, Andy Sullivan, Charles Frederick, Hien Thai, Rebecca Naughton, Daniel Jurnove, Justin Harp, Reid Carper, Nithish Mahalin...

  19. [19]

    Power provisioning for a warehouse-sized computer

    Xiaobo Fan, Wolf-Dietrich Weber, and Luiz Andre Barroso. Power provisioning for a warehouse-sized computer. InISCA, 2007

  20. [20]

    Opportunities and challenges for data center demand response

    Adam Wierman, Zhenhua Liu, Iris Liu, and Hamed Mohsenian-Rad. Opportunities and challenges for data center demand response. InIGCC, 2014

  21. [21]

    A truthful incentive mechanism for emer- gency demand response in colocation data centers

    Linquan Zhang, Shaolei Ren, Chuan Wu, and Zongpeng Li. A truthful incentive mechanism for emer- gency demand response in colocation data centers. InINFOCOM, 2015

  22. [22]

    Impact of charging rates on electric vehicle battery life.Findings, 2021(March), 2021

    Sivapriya Mothilal Bhagavathy, Hannah Budnitz, Tim Schwanen, and Malcolm McCulloch. Impact of charging rates on electric vehicle battery life.Findings, 2021(March), 2021

  23. [23]

    Islam, Kishwar Ahmed, Hong Xu, Nguyen H

    Mohammad A. Islam, Kishwar Ahmed, Hong Xu, Nguyen H. Tran, Gang Quan, and Shaolei Ren. Exploiting spatio-temporal diversity for water saving in geo-distributed data centers.IEEE Transactions on Cloud Computing, 6(3):734–746, 2018

  24. [24]

    Flexible cooling for AI growth: How zonal architecture supports diverse hardware needs

    Microsoft. Flexible cooling for AI growth: How zonal architecture supports diverse hardware needs. Microsoft Azure Architecture Blog, April 2026

  25. [25]

    Robust learning for smoothed online convex optimization with feedback delay

    Pengfei Li, Jianyi Yang, Adam Wierman, and Shaolei Ren. Robust learning for smoothed online convex optimization with feedback delay. InNeurIPS, 2023

  26. [26]

    Optimal robustness-consistency trade-offs for learning-augmented online algorithms

    Alexander Wei and Fred Zhang. Optimal robustness-consistency trade-offs for learning-augmented online algorithms. InNeurIPS, 2020

  27. [27]

    A regression approach to learning- augmented online algorithms

    Keerti Anand, Rong Ge, Amit Kumar, and Debmalya Panigrahi. A regression approach to learning- augmented online algorithms. In A. Beygelzimer, Y. Dauphin, P. Liang, and J. Wortman Vaughan, editors,Advances in Neural Information Processing Systems, 2021

  28. [28]

    Sutton and Andrew G

    Richard S. Sutton and Andrew G. Barto.Reinforcement Learning: An Introduction. MIT Press, 2018. 11