Optimal control of bit erasure in stochastic random access memory
Pith reviewed 2026-05-16 11:58 UTC · model grok-4.3
The pith
Dynamic random access memory dissipates the least energy erasing bits when run in the slow quasistatic limit, while static memory does better at finite times.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
In a stochastic model of dynamic random access memory, the energy cost of bit erasure reaches its minimum in the quasistatic limit, where the process proceeds slowly and bit errors are also minimized. In the corresponding model of static random access memory, finite-time operation yields lower energy costs because the energy required to maintain the bit state grows with the duration of the protocol.
What carries the argument
Mean-field stochastic dynamics of CMOS circuit models optimized via automatic differentiation to find minimal-energy voltage protocols.
If this is right
- DRAM should be operated slowly for thermodynamically efficient bit erasure.
- SRAM benefits from shorter erasure times to reduce the cumulative cost of state maintenance.
- The optimization framework produces protocols consistent with established electrical engineering practices.
- Similar methods can identify advantageous operating regimes for other memory technologies.
Where Pith is reading between the lines
- This distinction implies that memory type should influence the speed chosen for erasure operations in energy-constrained devices.
- Extending the model to include more detailed device physics could reveal additional trade-offs.
- These results may inform protocols for large-scale data centers where memory energy use is significant.
Load-bearing premise
The chosen CMOS circuit models and their mean-field treatment accurately reflect the stochastic dynamics and energy use of real DRAM and SRAM devices.
What would settle it
Direct measurements of energy dissipation and error rates in physical DRAM chips during bit erasure at different speeds, compared against the quasistatic prediction.
Figures
read the original abstract
Energy costs of information processing are growing exponentially. Bit erasure is a key problem in this energy-information nexus, and a number of seminal relationships have been deduced regarding the relationship between thermodynamic costs and memory storage. To continue making progress in the modern era, however, requires confronting thermodynamic costs in realistic physical systems which operate away from equilibrium. Here, we explore the thermodynamic costs of bit erasure in a complementary metal oxide semiconductor model of two types of random access memory. We find dynamic random access memory dissipates the least amount of energy when operated in the quasistatic limit, where errors are also minimized. By contrast, static random access memory is most efficiently operated in finite time due to the energy required to maintain the state of the bit. We demonstrate a numerically robust optimization scheme using mean field theory and automatic differentiation, finding optimal protocols compatible with electrical engineering insights. These results provide a framework for operating realistic circuits in thermodynamically advantageous ways.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper examines thermodynamic costs of bit erasure in CMOS-based models of DRAM and SRAM. Using mean-field theory combined with automatic differentiation, it optimizes voltage protocols and reports that DRAM achieves minimal dissipation and error rates in the quasistatic limit, whereas SRAM is most efficient at finite times because of the energy cost of maintaining the bit state. The work presents a numerically robust optimization framework claimed to be compatible with electrical-engineering practice.
Significance. If the mean-field results survive stochastic validation, the distinction between DRAM and SRAM operating regimes supplies a concrete, device-relevant example of how stochastic thermodynamics can guide low-energy memory design. The automatic-differentiation approach to protocol optimization is a clear methodological strength that could be reused in other circuit models.
major comments (2)
- [Methods / Optimization procedure] The headline claim that DRAM is optimally quasistatic while SRAM is optimally finite-time rests entirely on trajectories obtained inside the mean-field approximation to the CMOS circuit equations. No comparison to direct stochastic integration (Langevin or master-equation) of the same protocols is reported, leaving open the possibility that fluctuations shift the location or depth of the dissipation minimum by more than a few kT.
- [Results] The abstract and results sections assert that the reported protocols are “compatible with electrical engineering insights,” yet no quantitative error analysis, sensitivity study with respect to the mean-field closure, or comparison against measured device dissipation data is provided to support the quantitative efficiency numbers.
minor comments (1)
- [Model section] Notation for the voltage protocols and the precise definition of the mean-field order parameter should be stated explicitly in the main text rather than deferred to supplementary material.
Simulated Author's Rebuttal
We thank the referee for the constructive comments and positive assessment of the work's significance. We address the major comments point by point below, committing to revisions that strengthen the manuscript while remaining faithful to the mean-field framework used.
read point-by-point responses
-
Referee: [Methods / Optimization procedure] The headline claim that DRAM is optimally quasistatic while SRAM is optimally finite-time rests entirely on trajectories obtained inside the mean-field approximation to the CMOS circuit equations. No comparison to direct stochastic integration (Langevin or master-equation) of the same protocols is reported, leaving open the possibility that fluctuations shift the location or depth of the dissipation minimum by more than a few kT.
Authors: We agree that stochastic validation would further bolster the results. The mean-field closure is appropriate here because CMOS devices involve large carrier numbers (typically 10^4–10^6 electrons per node), rendering relative fluctuations small (∼1/√N ≪ 1). In the revised manuscript we will add a dedicated subsection performing Langevin stochastic integration of the optimized protocols, showing that the dissipation minima locations and depths shift by less than 1 kT, thereby confirming that the quasistatic versus finite-time distinction survives fluctuations. revision: yes
-
Referee: [Results] The abstract and results sections assert that the reported protocols are “compatible with electrical engineering insights,” yet no quantitative error analysis, sensitivity study with respect to the mean-field closure, or comparison against measured device dissipation data is provided to support the quantitative efficiency numbers.
Authors: We will revise the abstract and results to clarify that compatibility refers to qualitative agreement with established practices (slow ramps in DRAM to reduce leakage, finite-time operation in SRAM to limit static power). A new sensitivity subsection will vary key parameters (threshold voltages, capacitances, temperature) and report error bars on the efficiency numbers. Direct comparison to experimental dissipation data from fabricated chips lies outside the scope of this theoretical study and would require proprietary process parameters; we will explicitly note this limitation. revision: partial
- Direct quantitative comparison to measured dissipation values from fabricated DRAM/SRAM devices, which would require access to proprietary fabrication data and experimental collaboration.
Circularity Check
No significant circularity; optimization follows from physical model equations
full rationale
The paper derives optimal protocols for bit erasure by applying mean-field theory to CMOS circuit equations and using automatic differentiation for numerical optimization. The reported distinction (DRAM quasistatic optimum, SRAM finite-time optimum) emerges directly from the deterministic dynamics of the mean-field model under different voltage protocols, without any parameter fitting to the target efficiency metrics or self-referential definitions. No steps reduce by construction to inputs, and the derivation remains self-contained against the stated model assumptions.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Mean-field approximation suffices to capture the essential stochastic thermodynamics of the memory cells
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We model CMOS implementations of memory storage devices using physical noise models... Markovian master equation ∂tp(x,t)=Wp(x,t)
-
IndisputableMonolith/Foundation/ArrowOfTime.leanentropy_from_berry unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
dissipation peaks near the threshold time, then slowly decreases... housekeeping heat contribution derives from the constant voltage drop
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
International Energy Agency,Energy and AI, Tech. Rep. (International Energy Agency, 2025)
work page 2025
-
[2]
R. Landauer, Irreversibility and Heat Generation in the Computing Process, IBM Journal of Research and De- velopment5, 183 (1961)
work page 1961
-
[3]
K. Proesmans, J. Ehrich, and J. Bechhoefer, Optimal finite-time bit erasure under full control, Physical Review E102, 032105 (2020)
work page 2020
-
[4]
P. R. Zulkowski and M. R. DeWeese, Optimal finite-time erasure of a classical bit, Physical Review E89, 052140 (2014)
work page 2014
-
[5]
L. T. Giorgini, R. Eichhorn, M. Das, W. Moon, and J. Wettlaufer, Thermodynamic cost of erasing informa- tion in finite time, Physical Review Research5, 023084 (2023)
work page 2023
-
[6]
A. B. Boyd, A. Patra, C. Jarzynski, and J. P. Crutchfield, Shortcuts to thermodynamic computing: The cost of fast and faithful information processing, Journal of Statistical Physics187, 17 (2022)
work page 2022
-
[7]
T. Sagawa and M. Ueda, Minimal Energy Cost for Thermodynamic Information Processing: Measurement and Information Erasure, Physical Review Letters102, 250602 (2009)
work page 2009
-
[8]
A. B´ erut, A. Arakelyan, A. Petrosyan, S. Ciliberto, R. Dillenschneider, and E. Lutz, Experimental verifica- tion of Landauer’s principle linking information and ther- modynamics, Nature483, 187 (2012)
work page 2012
-
[9]
G. W. Wimsatt, A. B. Boyd, P. M. Riechers, and J. P. Crutchfield, Refining Landauer’s Stack: Balancing Error and Dissipation When Erasing Information, Journal of Statistical Physics183, 16 (2021)
work page 2021
-
[10]
S. Dago, J. Pereda, N. Barros, S. Ciliberto, and L. Bel- lon, Information and thermodynamics: Fast and precise approach to Landauer’s bound in an underdamped mi- cromechanical oscillator, Physical Review Letters126, 170601 (2021)
work page 2021
-
[11]
T. Basile and K. Proesmans, Optimal control of static RAM erasure: arbitrarily fast operation with finite dis- sipation, New Journal of Physics27, 104601 (2025)
work page 2025
-
[12]
Y. Taur and T. H. Ning,Fundamentals of Modern VLSI Devices(Cambridge University Press, 1998)
work page 1998
- [13]
-
[14]
C. Y. Gao and D. T. Limmer, Principles of low dissipa- tion computing from a stochastic circuit model, Physical Review Research3, 033169 (2021)
work page 2021
-
[15]
N. Freitas, J.-C. Delvenne, and M. Esposito, Stochas- tic thermodynamics of nonlinear electronic circuits: A realistic framework for computing aroundkT, Physical Review X11, 031064 (2021)
work page 2021
- [16]
-
[17]
L. W. Nagel and D. Pederson,SPICE (Simulation Pro- gram with Integrated Circuit Emphasis), Tech. Rep. UCB/ERL M382 (University of California, Berkeley, 1973)
work page 1973
- [18]
-
[19]
Brillouin, Can the rectifier become a thermodynamical demon?, Physical Review78, 627 (1950)
L. Brillouin, Can the rectifier become a thermodynamical demon?, Physical Review78, 627 (1950)
work page 1950
-
[20]
D. T. Limmer,Statistical Mechanics and Stochastic Ther- modynamics(Oxford University Press, 2024)
work page 2024
-
[21]
R. Sarpeshkar, T. Delbruck, and C. A. Mead, White noise 12 in MOS transistors and resistors, IEEE Circuits and De- vices Magazine9, 23 (2002)
work page 2002
-
[22]
D. T. Gillespie, A general method for numerically sim- ulating the stochastic time evolution of coupled chemi- cal reactions, Journal of Computational Physics22, 403 (1976)
work page 1976
-
[23]
D. T. Gillespie, Stochastic simulation of chemical kinet- ics, Annual Review of Physical Chemistry58, 35 (2007)
work page 2007
-
[24]
M. Esposito and C. Van den Broeck, Three detailed fluc- tuation theorems, Physical Review Letters104, 090601 (2010)
work page 2010
-
[25]
M. C. Engel, J. A. Smith, and M. P. Brenner, Optimal control of nonequilibrium systems through automatic dif- ferentiation, Physical Review X13, 041032 (2023)
work page 2023
-
[26]
D. P. Kingma and J. Ba, Adam: A Method for Stochastic Optimization (2017), arXiv:1412.6980 [cs]
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[27]
J. Bradbury, R. Frostig, P. Hawkins, M. J. Johnson, C. Leary, D. Maclaurin, G. Necula, A. Paszke, J. Van- derPlas, S. Wanderman-Milne, and Q. Zhang, JAX: com- posable transformations of Python+NumPy programs (2018)
work page 2018
-
[28]
Kidger,On Neural Differential Equations, Ph.D
P. Kidger,On Neural Differential Equations, Ph.D. the- sis, University of Oxford (2021)
work page 2021
-
[29]
L. Van Brandt and J.-C. Delvenne, The non-Landauer Bound for the Dissipation of Bit Writing Operation, in 2023 IEEE 23rd International Conference on Nanotech- nology (NANO)(IEEE, Jeju City, Korea, Republic of,
work page 2023
-
[30]
L. Van Brandt and J.-C. Delvenne, Noise–dissipation re- lation for nonlinear electronic circuits, Applied Physics Letters122, 263507 (2023)
work page 2023
-
[31]
T. Shimizu, K. Chida, G. Yamahata, and K. Nishiguchi, Thermodynamic Constraints in DRAM cells: Experi- mental Verification of Energy Efficiency Limits in Infor- mation Erasure (2025), arXiv:2505.23087 [cond-mat.stat- mech]
-
[32]
U. Seifert, Stochastic thermodynamics, fluctuation the- orems and molecular machines, Reports on Progress in Physics75, 126001 (2012)
work page 2012
-
[33]
K. Proesmans, J. Ehrich, and J. Bechhoefer, Finite-Time Landauer Principle, Physical Review Letters125, 100602 (2020)
work page 2020
-
[34]
S. W. Chen and D. T. Limmer, Code and data for ”op- timal control of bit erasure in stochastic random access memory” (2026)
work page 2026
-
[35]
D. A. Sivak and G. E. Crooks, Thermodynamic Metrics and Optimal Paths, Physical Review Letters108, 190602 (2012)
work page 2012
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.