Learning by training: emergent return-point memory from cyclically tuning disordered sphere packings

Carl P. Goodrich; Mengjie Zu

arxiv: 2509.01296 · v2 · submitted 2025-09-01 · ⚛️ physics.comp-ph · cond-mat.dis-nn· cond-mat.soft

Learning by training: emergent return-point memory from cyclically tuning disordered sphere packings

Mengjie Zu , Carl P. Goodrich This is my paper

Pith reviewed 2026-05-18 20:00 UTC · model grok-4.3

classification ⚛️ physics.comp-ph cond-mat.dis-nncond-mat.soft

keywords return-point memorymarginally absorbing manifoldsphere packingsinverse designcyclic trainingdisordered materialselastic propertiesadaptive systems

0 comments

The pith

Cyclically tuned sphere packings evolve toward a marginally absorbing manifold that remembers the training range via return-point memory.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper examines athermal disordered sphere packings that undergo cyclic inverse design to reach target elastic properties over a selected range. These systems move toward a marginally absorbing manifold that stores memory of the training interval, producing behavior akin to return-point memory in cyclically driven materials. The authors trace this outcome to discontinuities in the gradients of the trained elastic quantities and present this as a general mechanism. A reader would care because the result supplies a concrete physical model for how adaptation under varying conditions can generate lasting memory without separate storage mechanisms.

Core claim

Athermal disordered sphere packings subjected to cyclic inverse design evolve toward a marginally absorbing manifold. This manifold encodes memory of the training range and produces return-point memory that matches observations in other cyclically driven systems. The mechanism rests on gradient discontinuities in the trained elastic quantities, which the authors propose as a general route to such manifolds and their associated memory.

What carries the argument

The marginally absorbing manifold (MAM), a structure in configuration space that absorbs cyclic training trajectories and encodes the range of elastic targets through return-point memory.

If this is right

Trained packings retain information about the full range of past target properties through their configuration on the manifold.
Memory formation occurs automatically from the training process without requiring explicit encoding steps.
The same gradient-discontinuity mechanism can generate analogous memory in other adaptive physical systems.
Design of materials that adapt under repeated loading can exploit this manifold structure to store history.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The mechanism may apply to living systems that adapt to cyclic environmental stresses, such as cells or tissues under repeated mechanical loading.
Testing the model with different particle interaction potentials would check whether the gradient-discontinuity route remains dominant.
The framework could connect to machine-learning settings where models are trained on data drawn from varying distributions.

Load-bearing premise

Gradient discontinuities in the trained elastic quantities are both necessary and sufficient to produce the marginally absorbing manifold and its return-point memory, and this mechanism applies beyond the specific sphere-packing model.

What would settle it

A concrete counterexample would be a system of cyclically tuned packings that develops return-point memory while showing no gradient discontinuities in the trained elastic quantities.

Figures

Figures reproduced from arXiv: 2509.01296 by Carl P. Goodrich, Mengjie Zu.

**Figure 1.** Figure 1: FIG. 1. Training and cyclic training. (a) A schematic of our gradient-based optimization routine to tune the Poisson’s ratio [PITH_FULL_IMAGE:figures/full_fig_p003_1.png] view at source ↗

**Figure 2.** Figure 2: FIG. 2. Memory in four representative examples. Columns (a)–(c) show readout results following cyclic training targeting the [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗

**Figure 3.** Figure 3: FIG. 3. The evolution of the parameters during cyclic [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗

**Figure 4.** Figure 4: FIG. 4. Gradient Discontinuity Learning. (a) An example training function [PITH_FULL_IMAGE:figures/full_fig_p008_4.png] view at source ↗

**Figure 5.** Figure 5: FIG. 5. Confirming predictions made by Gradient [PITH_FULL_IMAGE:figures/full_fig_p010_5.png] view at source ↗

**Figure 7.** Figure 7: FIG. 7. Cyclic training on Poisson’s ratio between [PITH_FULL_IMAGE:figures/full_fig_p013_7.png] view at source ↗

**Figure 6.** Figure 6: FIG. 6. Robustness of MAMs. Column (a) shows a [PITH_FULL_IMAGE:figures/full_fig_p013_6.png] view at source ↗

**Figure 8.** Figure 8: FIG. 8. Memory in a representative example trained on [PITH_FULL_IMAGE:figures/full_fig_p014_8.png] view at source ↗

**Figure 9.** Figure 9: FIG. 9. Cyclic training on two elements of elastic modulus [PITH_FULL_IMAGE:figures/full_fig_p015_9.png] view at source ↗

read the original abstract

Many living and artificial systems improve their fitness or performance by adapting to changing environments or diverse training data. However, it remains unclear how such environmental variation influences adaptation, what is learned in the process, and whether memory of past conditions is retained. In this work, we investigate these questions using athermal disordered systems that are subject to cyclic inverse design, enabling them to attain target elastic properties spanning a chosen range. We demonstrate that such systems evolve toward a marginally absorbing manifold (MAM), which encodes memory of the training range that closely resembles return-point memory observed in cyclically driven systems. We further propose a general mechanism for the formation of MAMs and the corresponding memory that is based on gradient discontinuities in the trained quantities. Our model provides a simple and broadly applicable physical framework for understanding how adaptive systems learn under environmental change and how they retain memory of past experiences.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Cyclic inverse design on athermal sphere packings produces a marginally absorbing manifold that stores return-point memory, with gradient discontinuities offered as the driver, though that causal link still needs a control test.

read the letter

The main point is that training disordered sphere packings cyclically to hit elastic targets over a range makes them settle onto a marginally absorbing manifold that remembers the training interval in a return-point style. The authors tie this to discontinuities in the gradients of the trained quantities and suggest the pattern could be general for adaptive systems under varying conditions.

Referee Report

2 major / 2 minor

Summary. The manuscript studies athermal disordered sphere packings subjected to cyclic inverse design that targets elastic properties over a chosen range. It reports that the packings evolve toward a marginally absorbing manifold (MAM) whose geometry encodes memory of the training range in a manner that closely resembles return-point memory. The authors propose that gradient discontinuities in the trained elastic quantities provide a general mechanism for both MAM formation and the emergence of this memory.

Significance. If the central claims hold, the work supplies a concrete, minimal physical model for how adaptive systems acquire and retain memory of past conditions under environmental variation. The link between cyclic training, gradient discontinuities, and return-point memory is conceptually interesting and could inform broader studies of learning in physical and biological systems. The sphere-packing inverse-design protocol is a reasonable choice for an athermal, disordered setting, and the introduction of the MAM as an emergent structure is a useful organizing idea.

major comments (2)

[§4 (proposed general mechanism)] §4 (proposed general mechanism): The assertion that gradient discontinuities in the trained quantities are both necessary and sufficient for MAM formation and return-point memory is not tested by any control in which the discontinuities are removed or smoothed (for example by replacing the elastic response with a differentiable surrogate) while the cyclic inverse-design protocol is held fixed. Without such a control, it remains possible that the MAM and its memory properties arise from the geometry of configuration space or the form of the inverse-design objective rather than from the non-differentiable points.
[Results on evolution to the MAM] Results on evolution to the MAM: The claim that the systems evolve toward the MAM and encode memory of the training range is presented without quantitative support such as a distance-to-manifold metric tracked over training cycles, convergence statistics across independent realizations, or error bars. This absence makes it difficult to assess how robust or complete the evolution is.

minor comments (2)

[Methods / early Results] The definition of the MAM would be clearer if accompanied by an explicit mathematical characterization (e.g., a condition on the Hessian or on the set of admissible strains) rather than a purely descriptive statement.
[Figure captions] Figure captions should explicitly state the number of independent realizations used for averaging and the precise definition of any shaded regions or error bars.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their thoughtful and constructive comments on our manuscript. We are encouraged by the positive assessment of the significance and the usefulness of the MAM concept. We address each of the major comments below.

read point-by-point responses

Referee: §4 (proposed general mechanism): The assertion that gradient discontinuities in the trained quantities are both necessary and sufficient for MAM formation and return-point memory is not tested by any control in which the discontinuities are removed or smoothed (for example by replacing the elastic response with a differentiable surrogate) while the cyclic inverse-design protocol is held fixed. Without such a control, it remains possible that the MAM and its memory properties arise from the geometry of configuration space or the form of the inverse-design objective rather than from the non-differentiable points.

Authors: We agree that an explicit control experiment would provide stronger support for the proposed mechanism. However, in the context of athermal sphere packings, the gradient discontinuities stem directly from the discrete nature of contact formation and breaking, which is fundamental to the system's response. Implementing a fully differentiable surrogate while maintaining the inverse-design protocol and athermal conditions is challenging and would likely require significant modifications to the physical model. In the revised manuscript, we will add a dedicated paragraph in §4 discussing this point, including why such a control is difficult to implement without changing the essence of the system, and we will present additional supporting analysis from our existing data that links the memory properties specifically to the observed discontinuities. We believe this will clarify the scope of our claims. revision: partial
Referee: Results on evolution to the MAM: The claim that the systems evolve toward the MAM and encode memory of the training range is presented without quantitative support such as a distance-to-manifold metric tracked over training cycles, convergence statistics across independent realizations, or error bars. This absence makes it difficult to assess how robust or complete the evolution is.

Authors: We appreciate this feedback. While the manuscript includes several figures illustrating the evolution and memory encoding, we acknowledge that quantitative metrics would improve the presentation. In the revised manuscript, we will include new quantitative analyses: specifically, we will track and plot the distance to the MAM over the course of training cycles for multiple realizations, include error bars representing standard deviations across independent packings, and provide statistics on convergence rates. These additions will allow readers to better evaluate the robustness of the reported behavior. revision: yes

Circularity Check

0 steps flagged

No circularity: MAM emerges from training dynamics without self-referential definition or fitted prediction

full rationale

The paper presents the marginally absorbing manifold (MAM) as an emergent outcome of cyclic inverse design applied to athermal disordered sphere packings, with memory of the training range arising from the adaptation process rather than being presupposed in the definition. The proposed mechanism based on gradient discontinuities is offered as a general explanation derived from observed model behavior, not as a tautological fit or self-citation that reduces the central result to its inputs. No equations or derivations in the abstract or described claims show a 'prediction' that is statistically forced by construction from fitted parameters, nor does the argument rely on load-bearing self-citations or imported uniqueness theorems. The derivation remains self-contained, with the resemblance to return-point memory serving as an external analogy rather than an internal circular loop.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 1 invented entities

The central claim rests on the existence of gradient discontinuities created by the inverse-design training loop and on the assumption that athermal disordered packings can be driven to a manifold whose boundary encodes the training range.

axioms (1)

domain assumption Athermal disordered sphere packings can be subjected to cyclic inverse design that tunes their elastic properties across a chosen range
This is the experimental setup stated in the abstract.

invented entities (1)

marginally absorbing manifold (MAM) no independent evidence
purpose: Encodes memory of the training range and produces return-point memory
New state space region introduced to explain the observed memory; no independent falsifiable prediction outside the model is given in the abstract.

pith-pipeline@v0.9.0 · 5683 in / 1293 out tokens · 48564 ms · 2026-05-18T20:00:22.348087+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

We further propose a general mechanism for the formation of MAMs and the corresponding memory that is based on gradient discontinuities in the trained quantities.

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

93 extracted references · 93 canonical work pages · 7 internal anchors

[1]

The rest of this section will detail and quantify these observations, and discuss the role of contact changes

return-point changes in parameters, 2) return-point 4 changes in particle positions, 3) the number of required optimization steps, and 4) a particular component of the change in parameters after training. The rest of this section will detail and quantify these observations, and discuss the role of contact changes. In Section IV, we will build a theory bas...

work page
[2]

Ease of training We begin by quantifying the apparent speedup in training observed in Fig. 1c-d. Figure 2a(i) shows the number of optimization steps,nsteps, to train the system from ν∗ max to ν∗ read. Note that since we always start at ν∗ max, readout data forν∗ read = ν∗ max is meaningless and are not shown. Importantly, for readout calculations, we trai...

work page
[3]

3(a) shows how the parameters (which again are species-level particle diameters) evolve during the 23 training cycles

Training dynamics To better understand what makes training easier after 23 cycles, Fig. 3(a) shows how the parameters (which again are species-level particle diameters) evolve during the 23 training cycles. While there are occasionally clear and dramatic parameter changes, e.g. during the 7th and 8th cycles, they eventually reach a steady state and change...

work page
[4]

We now define two return-point measurements, both of which also exhibit memory

Return-point memory nsteps and θ⊥ both show clear features atν∗ read = ν∗ min and ν∗ read = ν∗ max, indicating memory. We now define two return-point measurements, both of which also exhibit memory. First, we define ∆θRP = vuut 1 nsp nspX α=1 θ†† α − θ† α 2 , (4) where the sum runs over the nsp parameters, and θ† and θ†† are the parameters before and afte...

work page
[5]

training function

MAMs with different training ranges and target quantities Figure 2a(i-iv) demonstrates that cyclic training leads to a MAM, thus storing memory of ν∗ min and ν∗ max. Importantly, this result is not specific to this one example system or the choice of ν∗ min and ν∗ max. Figure 2b and c show similar data for systems with different training ranges for ν, fir...

work page
[6]

R. O. Duda and P. E. Hart, Pattern classification and scene analysis, A Wiley-interscience publication (1973)

work page 1973
[7]

D. H. Ackley, G. E. Hinton, and T. J. Sejnowski, A learning algorithm for boltzmann machines, Cognitive science 9, 147 (1985)

work page 1985
[8]

D. E. Rumelhart, G. E. Hinton, and R. J. Williams, Learning internal representations by error propagation, Tech. Rep. (1985)

work page 1985
[9]

LeCun, L

Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, Gradient-based learning applied to document recognition, Proceedings of the IEEE86, 2278 (2002)

work page 2002
[10]

Krizhevsky, I

A. Krizhevsky, I. Sutskever, and G. E. Hinton, Imagenet classification with deep convolutional neural networks, Advances in neural information processing systems 25 (2012)

work page 2012
[11]

LeCun, Y

Y. LeCun, Y. Bengio, and G. Hinton, Deep learning, nature 521, 436 (2015)

work page 2015
[12]

An overview of gradient descent optimization algorithms

S. Ruder, An overview of gradient descent optimization algorithms, arXiv preprint arXiv:1609.04747 (2016)

work page internal anchor Pith review Pith/arXiv arXiv 2016
[13]

Bottou, F

L. Bottou, F. E. Curtis, and J. Nocedal, Optimization methods for large-scale machine learning, SIAM review 60, 223 (2018)

work page 2018
[14]

Scellier and Y

B. Scellier and Y. Bengio, Equilibrium propagation: Bridging the gap between energy-based models and backpropagation, Frontiers in computational neuroscience 11, 24 (2017)

work page 2017
[15]

Pashine, D

N. Pashine, D. Hexner, A. J. Liu, and S. R. Nagel, Directed aging, memory, and nature’s greed, Science advances5, eaax4215 (2019)

work page 2019
[16]

Stern, D

M. Stern, D. Hexner, J. W. Rocks, and A. J. Liu, Supervised learning in physical networks: From machine learning to learning machines, Physical Review X 11, 021045 (2021)

work page 2021
[17]

Stern and A

M. Stern and A. Murugan, Learning without neurons in physical systems, Annual Review of Condensed Matter Physics 14, 417 (2023)

work page 2023
[18]

Lopez-Pastor and F

V. Lopez-Pastor and F. Marquardt, Self-learning machines based on hamiltonian echo backpropagation, Physical Review X13, 031020 (2023)

work page 2023
[19]

V. R. Anisetti, B. Scellier, and J. M. Schwarz, Learning by non-interferingfeedbackchemical signalingin physical networks, Physical Review Research5, 023024 (2023)

work page 2023
[20]

V. R. Anisetti, A. Kandala, B. Scellier, and J. Schwarz, Frequency propagation: Multimechanism learning in nonlinear physical networks, Neural Computation 36, 596 (2024)

work page 2024
[21]

Dillavou, M

S. Dillavou, M. Stern, A. J. Liu, and D. J. Durian, Demonstration of decentralized physics-driven learning, Physical Review Applied18, 014040 (2022)

work page 2022
[22]

Stern, S

M. Stern, S. Dillavou, M. Z. Miskin, D. J. Durian, and A. J. Liu, Physical learning beyond the quasistatic limit, Physical Review Research4, L022037 (2022)

work page 2022
[23]

L. E. Altman, M. Stern, A. J. Liu, and D. J. Durian, Experimental demonstration of coupled learning in elastic networks, Physical Review Applied 22, 024053 (2024)

work page 2024
[24]

Dillavou, B

S. Dillavou, B. D. Beyer, M. Stern, A. J. Liu, M. Z. Miskin, and D. J. Durian, Machine learning without a processor: Emergent learning in a nonlinear analog network, Proceedings of the National Academy of Sciences 121, e2319718121 (2024)

work page 2024
[25]

Y. Tang, W. Ye, J. Jia, and Y. Chen, Learning stiffness tensors in self-activated solids via a local rule, Advanced Science 11, 2308584 (2024)

work page 2024
[26]

M. J. Falk, A. T. Strupp, B. Scellier, and A. Murugan, Temporal contrastive learning through implicit non- equilibrium memory, Nature Communications16, 2163 (2025)

work page 2025
[27]

Arinze, M

C. Arinze, M. Stern, S. R. Nagel, and A. Murugan, Learning to self-fold at a bifurcation, Physical Review E 107, 025001 (2023)

work page 2023
[28]

C. G. Evans, J. O’Brien, E. Winfree, and A. Murugan, Pattern recognition in the nucleation kinetics of non- equilibrium self-assembly, Nature625, 500 (2024)

work page 2024
[29]

Arzash, I

S. Arzash, I. Tah, A. J. Liu, and M. L. Manning, Rigidity of epithelial tissues as a double optimization problem, Phys. Rev. Res.7, 013157 (2025)

work page 2025
[30]

A. K. Behera, M. Rao, S. Sastry, and S. Vaikuntanathan, Enhanced associative memory, classification, and learning with active dynamics, Physical Review X13, 16 041043 (2023)

work page 2023
[31]

Souslov, V

J.Veenstra, C.Scheibner, M.Brandenbourger, J.Binysh, A. Souslov, V. Vitelli, and C. Coulais, Adaptive locomotion of active solids, Nature , 1 (2025)

work page 2025
[32]

M.ZuandC.P.Goodrich,Designingathermaldisordered solids with automatic differentiation, Communications Materials 5, 141 (2024)

work page 2024
[33]

M. Zu, A. Desai, and C. P. Goodrich, Fully independent response in disordered solids, Physical Review Letters 134, 238201 (2025)

work page 2025
[34]

Hirsch, C

M. Hirsch, C. Pugh, and M. Shub, Invariant manifolds (lecture notes in mathematics, 583) (1977)

work page 1977
[35]

J. A. Barker, D. E. Schreiber, B. G. Huth, and D. H. Everett, Magnetic hysteresis and minor loops: models and experiments, Proceedings of the Royal Society of London. A. Mathematical and Physical Sciences386, 251 (1983)

work page 1983
[36]

J. P. Sethna, K. Dahmen, S. Kartha, J. A. Krumhansl, B. W. Roberts, and J. D. Shore, Hysteresis and hierarchies: Dynamics of disorder-driven first-order phase transformations (1993)

work page 1993
[37]

Preisach, Über die magnetische nachwirkung, Zeitschrift für Physik94, 277 (1935)

F. Preisach, Über die magnetische nachwirkung, Zeitschrift für Physik94, 277 (1935)

work page 1935
[38]

N. C. Keim, J. D. Paulsen, Z. Zeravcic, S. Sastry, and S. R. Nagel, Memory formation in matter, Rev. Mod. Phys. 91, 035002 (2019)

work page 2019
[39]

J. D. Paulsen and N. C. Keim, Mechanical memories in solids, from disorder to design, Annual Review of Condensed Matter Physics16, 61 (2025), 2405.08158

work page arXiv 2025
[40]

J. D. Paulsen and N. C. Keim, Minimal descriptions of cyclic memories, Proceedings of the Royal Society A475, 20180874 (2019), 1809.09715

work page internal anchor Pith review Pith/arXiv arXiv 2019
[41]

J. D. Paulsen, N. C. Keim, and S. R. Nagel, Multiple transient memories in experiments on sheared non- brownian suspensions, Phys. Rev. Lett. 113, 068301 (2014)

work page 2014
[42]

Fiocco, G

D. Fiocco, G. Foffi, and S. Sastry, Encoding of memory in sheared amorphous solids, Phys. Rev. Lett.112, 025702 (2014)

work page 2014
[43]

N. C. Keim, J. D. Paulsen, and S. R. Nagel, Multiple transient memories in sheared suspensions: Robustness, structure, and routes to plasticity, Physical Review E88, 032306 (2013), 1307.1184

work page internal anchor Pith review Pith/arXiv arXiv 2013
[44]

Adhikari and S

M. Adhikari and S. Sastry, Memory formation in cyclically deformed amorphous solids and sphere assemblies, The European Physical Journal E 41, 105 (2018)

work page 2018
[45]

Mungan, S

M. Mungan, S. Sastry, K. Dahmen, and I. Regev, Networks and hierarchies: How amorphous materials learn to remember, Physical Review Letters123, 178002 (2019), 1905.09259

work page arXiv 2019
[46]

C. W. Lindeman and S. R. Nagel, Multiple memory formation in glassy landscapes, Science Advances 7, eabg7133 (2021)

work page 2021
[47]

R.Dinner, and M.L

D.R.Scheff, S.A.Redford, C.Lorpaiboon, S.Majumdar, A. R.Dinner, and M.L. Gardel, Actin filament alignment causes mechanical hysteresis in cross-linked networks, Soft Matter17, 5499 (2021)

work page 2021
[48]

D. J. Pine, J. P. Gollub, J. F. Brady, and A. M. Leshansky, Chaos and threshold for irreversibility in sheared suspensions, Nature438, 997 (2005)

work page 2005
[49]

Corté, P

L. Corté, P. M. Chaikin, J. P. Gollub, and D. J. Pine, Random organization in periodically driven systems, Nat. Phys.4, 420 (2008)

work page 2008
[50]

G. I. Menon and S. Ramaswamy, Universality class of the reversible-irreversible transition in sheared suspensions, Physical Review E79, 061108 (2009), 0801.3881

work page internal anchor Pith review Pith/arXiv arXiv 2009
[51]

Reversible plasticity in amorphous materials

M. Lundberg, K. Krishan, N. Xu, C. S. O’Hern, and M. Dennin, Reversible plastic events in amorphous materials, Physical Review E 77, 041505 (2008), 0707.4014

work page internal anchor Pith review Pith/arXiv arXiv 2008
[52]

N. C. Keim and P. E. Arratia, Mechanical and microscopic properties of the reversible plastic regime in a 2d jammed material, Phys. Rev. Lett.112, 028302 (2014)

work page 2014
[53]

N. C. Keim and D. Medina, Mechanical annealing and memories in a disordered solid, Science Advances 8, eabo1614 (2022), 2112.07008

work page arXiv 2022
[54]

N. C. Keim, J. Hass, B. Kroger, and D. Wieker, Global memory from local hysteresis in an amorphous solid, Phys. Rev. Res.2, 012004 (2020)

work page 2020
[55]

Bitzek, P

E. Bitzek, P. Koskinen, F. Gähler, M. Moseler, and P. Gumbsch, Structural Relaxation Made Simple, Physical Review Letters 97, 170201 (2006), publisher: American Physical Society

work page 2006
[56]

Guénolé, W

J. Guénolé, W. G. Nöhring, A. Vaid, F. Houllé, Z. Xie, A. Prakash, and E. Bitzek, Assessment and optimization of the fast inertial relaxation engine (fire) for energy minimization in atomistic simulations and its implementation in lammps, Computational Materials Science 175, 109584 (2020)

work page 2020
[57]

A. G. Baydin, B. A. Pearlmutter, A. A. Radul, and J. M. Siskind, Automatic differentiation in machine learning: a survey (2018), arXiv:1502.05767 [cs.SC]

work page internal anchor Pith review Pith/arXiv arXiv 2018
[58]

D. E. Rumelhart, G. E. Hinton, and R. J. Williams, Learning representations by back-propagating errors, Nature 323, 533 (1986)

work page 1986
[59]

R. E. Wengert, A simple automatic derivative evaluation program, Commun. ACM7, 463–464 (1964)

work page 1964
[60]

Note that these spikes are not observed in Ref. [27]. As will become clear in Section IV, this is due to the smooth attractive potential used in that paper

work page
[61]

N. C. Keim and S. R. Nagel, Generic transient memory formation in disordered systems with noise, Phys. Rev. Lett. 107, 010603 (2011)

work page 2011
[62]

Fiocco, G

D. Fiocco, G. Foffi, and S. Sastry, Memory effects in schematic models of glasses subjected to oscillatory deformation, Journal of Physics: Condensed Matter27, 194130 (2015)

work page 2015
[63]

Toiya, J

M. Toiya, J. Stambaugh, and W. Losert, Transient and oscillatory granular shear flow, Phys. Rev. Lett. 93, 088001 (2004)

work page 2004
[64]

J. Ren, J. A. Dijksman, and R. P. Behringer, Reynolds pressure and relaxation in a sheared granular system, Phys. Rev. Lett.110, 018302 (2013)

work page 2013
[65]

Laurson and M

L. Laurson and M. J. Alava, Dynamic hysteresis in cyclic deformation of crystalline solids, Phys. Rev. Lett.109, 155504 (2012)

work page 2012
[66]

M. D. Haw, W. C. K. Poon, P. N. Pusey, P. Hebraud, and F. Lequeux, Colloidal glasses under shear strain, Phys. Rev. E58, 4673 (1998)

work page 1998
[67]

M.Rubinstein, Nonmonotonic aging and memory retention in disordered mechanicalsystems,Phys.Rev.Lett

Y.Lahini, O.Gottesman, A.Amir,andS. M.Rubinstein, Nonmonotonic aging and memory retention in disordered mechanicalsystems,Phys.Rev.Lett. 118,085501(2017)

work page 2017
[68]

J. R. Royer and P. M. Chaikin, Precisely cyclic sand: Self-organization of periodically sheared frictional grains, Proceedings of the National Academy of Sciences112, 49 (2015). 17

work page 2015
[69]

M. O. Lavrentovich, A. J. Liu, and S. R. Nagel, Period proliferation in periodic states in cyclically sheared jammed solids, Phys. Rev. E96, 020101 (2017)

work page 2017
[70]

Mukherji, N

S. Mukherji, N. Kandula, A. K. Sood, and R. Ganapathy, Strength of mechanical memories is maximal at the yield point of a soft glass, Phys. Rev. Lett.122, 158001 (2019)

work page 2019
[71]

While we do not fully understand this, it is worth noting that nsteps is the weakest of our measurements since it depends on the details of the optimization algorithm, which are not controlled for

work page
[72]

In practice, the finite step size of any iterative ascent/descent algorithm will always result in some small amount of parameter drift

work page
[73]

Note that the red descent path happens to start very slightly to the left of the GD, so it flows leftwards, but it could just as easily have started to the right of the GD and flowed in the opposite direction, depending on where in the oscillation the ascent path stops

work page
[74]

They do not appear in our sphere packings, and we do not consider them further

Lower dimensional GDs are of course possible, but they are probably much less important due to the smaller chance of encountering them. They do not appear in our sphere packings, and we do not consider them further

work page
[75]

Adhikari, R

M. Adhikari, R. Sharma, and S. Karmakar, Encoding fast and fault-tolerant memories in bulk and nanoscale amorphous solids, Physical Review Letters134, 018202 (2025)

work page 2025
[76]

Alberch, From genes to phenotype: dynamical systems and evolvability., Genetica84, 5 (1991)

P. Alberch, From genes to phenotype: dynamical systems and evolvability., Genetica84, 5 (1991)

work page 1991
[77]

M. C. Cowperthwaite and L. A. Meyers, How mutational networks shape evolution: lessons from rna models, Annu. Rev. Ecol. Evol. Syst.38, 203 (2007)

work page 2007
[78]

Pigliucci, Genotype–phenotype mapping and the end of the ‘genes as blueprint’metaphor, Philosophical Transactions of the Royal Society B: Biological Sciences 365, 557 (2010)

M. Pigliucci, Genotype–phenotype mapping and the end of the ‘genes as blueprint’metaphor, Philosophical Transactions of the Royal Society B: Biological Sciences 365, 557 (2010)

work page 2010
[79]

S. E. Ahnert, Structural properties of genotype– phenotype maps, Journal of The Royal Society Interface 14, 20170275 (2017)

work page 2017
[80]

S. M. Scheiner, Genetics and evolution of phenotypic plasticity, Annual review of ecology and systematics24, 35 (1993)

work page 1993

Showing first 80 references.

[1] [1]

The rest of this section will detail and quantify these observations, and discuss the role of contact changes

return-point changes in parameters, 2) return-point 4 changes in particle positions, 3) the number of required optimization steps, and 4) a particular component of the change in parameters after training. The rest of this section will detail and quantify these observations, and discuss the role of contact changes. In Section IV, we will build a theory bas...

work page

[2] [2]

Ease of training We begin by quantifying the apparent speedup in training observed in Fig. 1c-d. Figure 2a(i) shows the number of optimization steps,nsteps, to train the system from ν∗ max to ν∗ read. Note that since we always start at ν∗ max, readout data forν∗ read = ν∗ max is meaningless and are not shown. Importantly, for readout calculations, we trai...

work page

[3] [3]

3(a) shows how the parameters (which again are species-level particle diameters) evolve during the 23 training cycles

Training dynamics To better understand what makes training easier after 23 cycles, Fig. 3(a) shows how the parameters (which again are species-level particle diameters) evolve during the 23 training cycles. While there are occasionally clear and dramatic parameter changes, e.g. during the 7th and 8th cycles, they eventually reach a steady state and change...

work page

[4] [4]

We now define two return-point measurements, both of which also exhibit memory

Return-point memory nsteps and θ⊥ both show clear features atν∗ read = ν∗ min and ν∗ read = ν∗ max, indicating memory. We now define two return-point measurements, both of which also exhibit memory. First, we define ∆θRP = vuut 1 nsp nspX α=1 θ†† α − θ† α 2 , (4) where the sum runs over the nsp parameters, and θ† and θ†† are the parameters before and afte...

work page

[5] [5]

training function

MAMs with different training ranges and target quantities Figure 2a(i-iv) demonstrates that cyclic training leads to a MAM, thus storing memory of ν∗ min and ν∗ max. Importantly, this result is not specific to this one example system or the choice of ν∗ min and ν∗ max. Figure 2b and c show similar data for systems with different training ranges for ν, fir...

work page

[6] [6]

R. O. Duda and P. E. Hart, Pattern classification and scene analysis, A Wiley-interscience publication (1973)

work page 1973

[7] [7]

D. H. Ackley, G. E. Hinton, and T. J. Sejnowski, A learning algorithm for boltzmann machines, Cognitive science 9, 147 (1985)

work page 1985

[8] [8]

D. E. Rumelhart, G. E. Hinton, and R. J. Williams, Learning internal representations by error propagation, Tech. Rep. (1985)

work page 1985

[9] [9]

LeCun, L

Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, Gradient-based learning applied to document recognition, Proceedings of the IEEE86, 2278 (2002)

work page 2002

[10] [10]

Krizhevsky, I

A. Krizhevsky, I. Sutskever, and G. E. Hinton, Imagenet classification with deep convolutional neural networks, Advances in neural information processing systems 25 (2012)

work page 2012

[11] [11]

LeCun, Y

Y. LeCun, Y. Bengio, and G. Hinton, Deep learning, nature 521, 436 (2015)

work page 2015

[12] [12]

An overview of gradient descent optimization algorithms

S. Ruder, An overview of gradient descent optimization algorithms, arXiv preprint arXiv:1609.04747 (2016)

work page internal anchor Pith review Pith/arXiv arXiv 2016

[13] [13]

Bottou, F

L. Bottou, F. E. Curtis, and J. Nocedal, Optimization methods for large-scale machine learning, SIAM review 60, 223 (2018)

work page 2018

[14] [14]

Scellier and Y

B. Scellier and Y. Bengio, Equilibrium propagation: Bridging the gap between energy-based models and backpropagation, Frontiers in computational neuroscience 11, 24 (2017)

work page 2017

[15] [15]

Pashine, D

N. Pashine, D. Hexner, A. J. Liu, and S. R. Nagel, Directed aging, memory, and nature’s greed, Science advances5, eaax4215 (2019)

work page 2019

[16] [16]

Stern, D

M. Stern, D. Hexner, J. W. Rocks, and A. J. Liu, Supervised learning in physical networks: From machine learning to learning machines, Physical Review X 11, 021045 (2021)

work page 2021

[17] [17]

Stern and A

M. Stern and A. Murugan, Learning without neurons in physical systems, Annual Review of Condensed Matter Physics 14, 417 (2023)

work page 2023

[18] [18]

Lopez-Pastor and F

V. Lopez-Pastor and F. Marquardt, Self-learning machines based on hamiltonian echo backpropagation, Physical Review X13, 031020 (2023)

work page 2023

[19] [19]

V. R. Anisetti, B. Scellier, and J. M. Schwarz, Learning by non-interferingfeedbackchemical signalingin physical networks, Physical Review Research5, 023024 (2023)

work page 2023

[20] [20]

V. R. Anisetti, A. Kandala, B. Scellier, and J. Schwarz, Frequency propagation: Multimechanism learning in nonlinear physical networks, Neural Computation 36, 596 (2024)

work page 2024

[21] [21]

Dillavou, M

S. Dillavou, M. Stern, A. J. Liu, and D. J. Durian, Demonstration of decentralized physics-driven learning, Physical Review Applied18, 014040 (2022)

work page 2022

[22] [22]

Stern, S

M. Stern, S. Dillavou, M. Z. Miskin, D. J. Durian, and A. J. Liu, Physical learning beyond the quasistatic limit, Physical Review Research4, L022037 (2022)

work page 2022

[23] [23]

L. E. Altman, M. Stern, A. J. Liu, and D. J. Durian, Experimental demonstration of coupled learning in elastic networks, Physical Review Applied 22, 024053 (2024)

work page 2024

[24] [24]

Dillavou, B

S. Dillavou, B. D. Beyer, M. Stern, A. J. Liu, M. Z. Miskin, and D. J. Durian, Machine learning without a processor: Emergent learning in a nonlinear analog network, Proceedings of the National Academy of Sciences 121, e2319718121 (2024)

work page 2024

[25] [25]

Y. Tang, W. Ye, J. Jia, and Y. Chen, Learning stiffness tensors in self-activated solids via a local rule, Advanced Science 11, 2308584 (2024)

work page 2024

[26] [26]

M. J. Falk, A. T. Strupp, B. Scellier, and A. Murugan, Temporal contrastive learning through implicit non- equilibrium memory, Nature Communications16, 2163 (2025)

work page 2025

[27] [27]

Arinze, M

C. Arinze, M. Stern, S. R. Nagel, and A. Murugan, Learning to self-fold at a bifurcation, Physical Review E 107, 025001 (2023)

work page 2023

[28] [28]

C. G. Evans, J. O’Brien, E. Winfree, and A. Murugan, Pattern recognition in the nucleation kinetics of non- equilibrium self-assembly, Nature625, 500 (2024)

work page 2024

[29] [29]

Arzash, I

S. Arzash, I. Tah, A. J. Liu, and M. L. Manning, Rigidity of epithelial tissues as a double optimization problem, Phys. Rev. Res.7, 013157 (2025)

work page 2025

[30] [30]

A. K. Behera, M. Rao, S. Sastry, and S. Vaikuntanathan, Enhanced associative memory, classification, and learning with active dynamics, Physical Review X13, 16 041043 (2023)

work page 2023

[31] [31]

Souslov, V

J.Veenstra, C.Scheibner, M.Brandenbourger, J.Binysh, A. Souslov, V. Vitelli, and C. Coulais, Adaptive locomotion of active solids, Nature , 1 (2025)

work page 2025

[32] [32]

M.ZuandC.P.Goodrich,Designingathermaldisordered solids with automatic differentiation, Communications Materials 5, 141 (2024)

work page 2024

[33] [33]

M. Zu, A. Desai, and C. P. Goodrich, Fully independent response in disordered solids, Physical Review Letters 134, 238201 (2025)

work page 2025

[34] [34]

Hirsch, C

M. Hirsch, C. Pugh, and M. Shub, Invariant manifolds (lecture notes in mathematics, 583) (1977)

work page 1977

[35] [35]

J. A. Barker, D. E. Schreiber, B. G. Huth, and D. H. Everett, Magnetic hysteresis and minor loops: models and experiments, Proceedings of the Royal Society of London. A. Mathematical and Physical Sciences386, 251 (1983)

work page 1983

[36] [36]

J. P. Sethna, K. Dahmen, S. Kartha, J. A. Krumhansl, B. W. Roberts, and J. D. Shore, Hysteresis and hierarchies: Dynamics of disorder-driven first-order phase transformations (1993)

work page 1993

[37] [37]

Preisach, Über die magnetische nachwirkung, Zeitschrift für Physik94, 277 (1935)

F. Preisach, Über die magnetische nachwirkung, Zeitschrift für Physik94, 277 (1935)

work page 1935

[38] [38]

N. C. Keim, J. D. Paulsen, Z. Zeravcic, S. Sastry, and S. R. Nagel, Memory formation in matter, Rev. Mod. Phys. 91, 035002 (2019)

work page 2019

[39] [39]

J. D. Paulsen and N. C. Keim, Mechanical memories in solids, from disorder to design, Annual Review of Condensed Matter Physics16, 61 (2025), 2405.08158

work page arXiv 2025

[40] [40]

J. D. Paulsen and N. C. Keim, Minimal descriptions of cyclic memories, Proceedings of the Royal Society A475, 20180874 (2019), 1809.09715

work page internal anchor Pith review Pith/arXiv arXiv 2019

[41] [41]

J. D. Paulsen, N. C. Keim, and S. R. Nagel, Multiple transient memories in experiments on sheared non- brownian suspensions, Phys. Rev. Lett. 113, 068301 (2014)

work page 2014

[42] [42]

Fiocco, G

D. Fiocco, G. Foffi, and S. Sastry, Encoding of memory in sheared amorphous solids, Phys. Rev. Lett.112, 025702 (2014)

work page 2014

[43] [43]

N. C. Keim, J. D. Paulsen, and S. R. Nagel, Multiple transient memories in sheared suspensions: Robustness, structure, and routes to plasticity, Physical Review E88, 032306 (2013), 1307.1184

work page internal anchor Pith review Pith/arXiv arXiv 2013

[44] [44]

Adhikari and S

M. Adhikari and S. Sastry, Memory formation in cyclically deformed amorphous solids and sphere assemblies, The European Physical Journal E 41, 105 (2018)

work page 2018

[45] [45]

Mungan, S

M. Mungan, S. Sastry, K. Dahmen, and I. Regev, Networks and hierarchies: How amorphous materials learn to remember, Physical Review Letters123, 178002 (2019), 1905.09259

work page arXiv 2019

[46] [46]

C. W. Lindeman and S. R. Nagel, Multiple memory formation in glassy landscapes, Science Advances 7, eabg7133 (2021)

work page 2021

[47] [47]

R.Dinner, and M.L

D.R.Scheff, S.A.Redford, C.Lorpaiboon, S.Majumdar, A. R.Dinner, and M.L. Gardel, Actin filament alignment causes mechanical hysteresis in cross-linked networks, Soft Matter17, 5499 (2021)

work page 2021

[48] [48]

D. J. Pine, J. P. Gollub, J. F. Brady, and A. M. Leshansky, Chaos and threshold for irreversibility in sheared suspensions, Nature438, 997 (2005)

work page 2005

[49] [49]

Corté, P

L. Corté, P. M. Chaikin, J. P. Gollub, and D. J. Pine, Random organization in periodically driven systems, Nat. Phys.4, 420 (2008)

work page 2008

[50] [50]

G. I. Menon and S. Ramaswamy, Universality class of the reversible-irreversible transition in sheared suspensions, Physical Review E79, 061108 (2009), 0801.3881

work page internal anchor Pith review Pith/arXiv arXiv 2009

[51] [51]

Reversible plasticity in amorphous materials

M. Lundberg, K. Krishan, N. Xu, C. S. O’Hern, and M. Dennin, Reversible plastic events in amorphous materials, Physical Review E 77, 041505 (2008), 0707.4014

work page internal anchor Pith review Pith/arXiv arXiv 2008

[52] [52]

N. C. Keim and P. E. Arratia, Mechanical and microscopic properties of the reversible plastic regime in a 2d jammed material, Phys. Rev. Lett.112, 028302 (2014)

work page 2014

[53] [53]

N. C. Keim and D. Medina, Mechanical annealing and memories in a disordered solid, Science Advances 8, eabo1614 (2022), 2112.07008

work page arXiv 2022

[54] [54]

N. C. Keim, J. Hass, B. Kroger, and D. Wieker, Global memory from local hysteresis in an amorphous solid, Phys. Rev. Res.2, 012004 (2020)

work page 2020

[55] [55]

Bitzek, P

E. Bitzek, P. Koskinen, F. Gähler, M. Moseler, and P. Gumbsch, Structural Relaxation Made Simple, Physical Review Letters 97, 170201 (2006), publisher: American Physical Society

work page 2006

[56] [56]

Guénolé, W

J. Guénolé, W. G. Nöhring, A. Vaid, F. Houllé, Z. Xie, A. Prakash, and E. Bitzek, Assessment and optimization of the fast inertial relaxation engine (fire) for energy minimization in atomistic simulations and its implementation in lammps, Computational Materials Science 175, 109584 (2020)

work page 2020

[57] [57]

A. G. Baydin, B. A. Pearlmutter, A. A. Radul, and J. M. Siskind, Automatic differentiation in machine learning: a survey (2018), arXiv:1502.05767 [cs.SC]

work page internal anchor Pith review Pith/arXiv arXiv 2018

[58] [58]

D. E. Rumelhart, G. E. Hinton, and R. J. Williams, Learning representations by back-propagating errors, Nature 323, 533 (1986)

work page 1986

[59] [59]

R. E. Wengert, A simple automatic derivative evaluation program, Commun. ACM7, 463–464 (1964)

work page 1964

[60] [60]

Note that these spikes are not observed in Ref. [27]. As will become clear in Section IV, this is due to the smooth attractive potential used in that paper

work page

[61] [61]

N. C. Keim and S. R. Nagel, Generic transient memory formation in disordered systems with noise, Phys. Rev. Lett. 107, 010603 (2011)

work page 2011

[62] [62]

Fiocco, G

D. Fiocco, G. Foffi, and S. Sastry, Memory effects in schematic models of glasses subjected to oscillatory deformation, Journal of Physics: Condensed Matter27, 194130 (2015)

work page 2015

[63] [63]

Toiya, J

M. Toiya, J. Stambaugh, and W. Losert, Transient and oscillatory granular shear flow, Phys. Rev. Lett. 93, 088001 (2004)

work page 2004

[64] [64]

J. Ren, J. A. Dijksman, and R. P. Behringer, Reynolds pressure and relaxation in a sheared granular system, Phys. Rev. Lett.110, 018302 (2013)

work page 2013

[65] [65]

Laurson and M

L. Laurson and M. J. Alava, Dynamic hysteresis in cyclic deformation of crystalline solids, Phys. Rev. Lett.109, 155504 (2012)

work page 2012

[66] [66]

M. D. Haw, W. C. K. Poon, P. N. Pusey, P. Hebraud, and F. Lequeux, Colloidal glasses under shear strain, Phys. Rev. E58, 4673 (1998)

work page 1998

[67] [67]

M.Rubinstein, Nonmonotonic aging and memory retention in disordered mechanicalsystems,Phys.Rev.Lett

Y.Lahini, O.Gottesman, A.Amir,andS. M.Rubinstein, Nonmonotonic aging and memory retention in disordered mechanicalsystems,Phys.Rev.Lett. 118,085501(2017)

work page 2017

[68] [68]

J. R. Royer and P. M. Chaikin, Precisely cyclic sand: Self-organization of periodically sheared frictional grains, Proceedings of the National Academy of Sciences112, 49 (2015). 17

work page 2015

[69] [69]

M. O. Lavrentovich, A. J. Liu, and S. R. Nagel, Period proliferation in periodic states in cyclically sheared jammed solids, Phys. Rev. E96, 020101 (2017)

work page 2017

[70] [70]

Mukherji, N

S. Mukherji, N. Kandula, A. K. Sood, and R. Ganapathy, Strength of mechanical memories is maximal at the yield point of a soft glass, Phys. Rev. Lett.122, 158001 (2019)

work page 2019

[71] [71]

While we do not fully understand this, it is worth noting that nsteps is the weakest of our measurements since it depends on the details of the optimization algorithm, which are not controlled for

work page

[72] [72]

In practice, the finite step size of any iterative ascent/descent algorithm will always result in some small amount of parameter drift

work page

[73] [73]

Note that the red descent path happens to start very slightly to the left of the GD, so it flows leftwards, but it could just as easily have started to the right of the GD and flowed in the opposite direction, depending on where in the oscillation the ascent path stops

work page

[74] [74]

They do not appear in our sphere packings, and we do not consider them further

Lower dimensional GDs are of course possible, but they are probably much less important due to the smaller chance of encountering them. They do not appear in our sphere packings, and we do not consider them further

work page

[75] [75]

Adhikari, R

M. Adhikari, R. Sharma, and S. Karmakar, Encoding fast and fault-tolerant memories in bulk and nanoscale amorphous solids, Physical Review Letters134, 018202 (2025)

work page 2025

[76] [76]

Alberch, From genes to phenotype: dynamical systems and evolvability., Genetica84, 5 (1991)

P. Alberch, From genes to phenotype: dynamical systems and evolvability., Genetica84, 5 (1991)

work page 1991

[77] [77]

M. C. Cowperthwaite and L. A. Meyers, How mutational networks shape evolution: lessons from rna models, Annu. Rev. Ecol. Evol. Syst.38, 203 (2007)

work page 2007

[78] [78]

Pigliucci, Genotype–phenotype mapping and the end of the ‘genes as blueprint’metaphor, Philosophical Transactions of the Royal Society B: Biological Sciences 365, 557 (2010)

M. Pigliucci, Genotype–phenotype mapping and the end of the ‘genes as blueprint’metaphor, Philosophical Transactions of the Royal Society B: Biological Sciences 365, 557 (2010)

work page 2010

[79] [79]

S. E. Ahnert, Structural properties of genotype– phenotype maps, Journal of The Royal Society Interface 14, 20170275 (2017)

work page 2017

[80] [80]

S. M. Scheiner, Genetics and evolution of phenotypic plasticity, Annual review of ecology and systematics24, 35 (1993)

work page 1993