Retroactive Interference Model of Power-Law Forgetting
Pith reviewed 2026-05-24 18:30 UTC · model grok-4.3
The pith
A retroactive interference model with one free integer parameter reproduces power-law forgetting in memory experiments.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We have devised a phenomenological model that is based on the principle of retroactive interference, driven by a multi-dimensional valence measure for acquired memories. The model has only one free integer parameter and can be solved analytically. Recognition experiments with long streams of words result in a good match to a five-dimensional version of the model.
What carries the argument
The multi-dimensional valence measure that quantifies similarity between memories and thereby sets the strength of retroactive interference.
If this is right
- Retention probability follows an explicit power-law form derived from the cumulative interference count in each dimension.
- Older memories become progressively more stable because the number of potential interfering successors saturates.
- The forgetting exponent is determined solely by the integer dimensionality parameter.
- The analytical solution gives retention at arbitrary times without numerical simulation.
Where Pith is reading between the lines
- If the valence dimensions map onto measurable semantic or perceptual features, the model predicts different forgetting rates for stimuli that differ in feature overlap.
- The single-parameter structure implies that power-law forgetting may appear generically whenever memories are represented in a space with finite dimensionality and similarity-based interference.
- Testing the model with controlled stimulus sets that vary the effective number of dimensions could directly measure the integer parameter from data.
Load-bearing premise
Retroactive interference driven by similarity in a multi-dimensional valence measure is the dominant mechanism producing the observed power-law forgetting.
What would settle it
An experiment in which new items are constructed to have zero similarity to prior items in every valence dimension, yet power-law forgetting is still observed, would falsify the model.
Figures
read the original abstract
Memory and forgetting constitute two sides of the same coin, and although the first has been rigorously investigated, the latter is often overlooked. A number of experiments under the realm of psychology and experimental neuroscience have described the properties of forgetting in humans and animals, showing that forgetting exhibits a power-law relationship with time. These results indicate a counter-intuitive property of forgetting, namely that old memories are more stable than younger ones. We have devised a phenomenological model that is based on the principle of retroactive interference, driven by a multi-dimensional valence measure for acquired memories. The model has only one free integer parameter and can be solved analytically. We performed recognition experiments with long streams of words were performed, resulting in a good match to a five-dimensional version of the model.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes a phenomenological model of forgetting based on retroactive interference driven by a multi-dimensional valence measure. It asserts that the model admits an analytical solution, has only one free integer parameter (the dimension count D), and yields a good match to recognition data from long word streams when instantiated in five dimensions.
Significance. If the analytical derivation is valid and the single free-parameter claim holds with D fixed independently of the fitting data, the work would supply a parsimonious account linking power-law forgetting to interference in a valence space, with potential relevance to memory models in psychology and neuroscience.
major comments (2)
- [Abstract] Abstract: The claim that the model has 'only one free integer parameter' is load-bearing for the central parsimony argument, yet the five-dimensional version is selected to match the experimental curves; if D=5 is not fixed a priori by independent theory or stimulus properties, the model effectively incorporates an additional tunable integer parameter, directly undermining the reported analytical prediction of power-law forgetting.
- [Abstract] Abstract: The abstract asserts an analytical solution and good data match but supplies no derivation steps, error analysis, exclusion criteria for the recognition experiments, or explicit equations showing how retroactive interference in D dimensions produces the power-law form; this gap prevents verification that the math supports the stated claims.
Simulated Author's Rebuttal
We thank the referee for their detailed review. We respond point-by-point to the major comments below, clarifying the role of the parameter D and the scope of the abstract while indicating planned revisions.
read point-by-point responses
-
Referee: [Abstract] Abstract: The claim that the model has 'only one free integer parameter' is load-bearing for the central parsimony argument, yet the five-dimensional version is selected to match the experimental curves; if D=5 is not fixed a priori by independent theory or stimulus properties, the model effectively incorporates an additional tunable integer parameter, directly undermining the reported analytical prediction of power-law forgetting.
Authors: The model is formulated for arbitrary positive integer D, with the analytical derivation showing that retroactive interference in D-dimensional valence space produces power-law forgetting whose exponent depends only on D. D is the sole free integer parameter; once chosen, no additional parameters are required to generate the functional form or to compare with data. We selected D=5 as the value providing the closest match to the word-stream recognition curves, but the power-law prediction itself is independent of that specific choice. We will revise the abstract to state explicitly that D is the single free parameter, selected by fit quality, while the analytical result holds for any integer D. revision: partial
-
Referee: [Abstract] Abstract: The abstract asserts an analytical solution and good data match but supplies no derivation steps, error analysis, exclusion criteria for the recognition experiments, or explicit equations showing how retroactive interference in D dimensions produces the power-law form; this gap prevents verification that the math supports the stated claims.
Authors: The abstract is a high-level summary. The full analytical derivation, including the explicit multi-dimensional interference equations that reduce to the observed power-law form, appears in the Methods section. Error analysis of the fits and the exclusion criteria applied to the recognition trials are reported in the Results and supplementary material. To improve accessibility, we will expand the abstract by one sentence referencing the key interference equation and directing readers to the main text for the complete derivation and experimental details. revision: yes
Circularity Check
Dimensionality selection functions as fitted parameter, making 'one free integer parameter' claim and data match non-independent
specific steps
-
fitted input called prediction
[Abstract]
"The model has only one free integer parameter and can be solved analytically. We performed recognition experiments with long streams of words were performed, resulting in a good match to a five-dimensional version of the model."
The single free integer parameter is the dimension, whose value (5) is selected to optimize agreement with the recognition experiment data. The 'good match' is therefore achieved by tuning the parameter to the target curves rather than predicting an independent outcome from the analytical solution.
full rationale
The abstract asserts a model with only one free integer parameter that is solved analytically and yields a good match in its five-dimensional version to recognition data. The dimension count is the integer parameter, and its value of 5 is chosen to produce the reported agreement with the experimental curves. This reduces the claimed match and parsimony to a post-hoc fit rather than an independent prediction from the derivation. The analytical solution itself may be non-circular, but the load-bearing claim of a single free parameter plus good match is undermined by this selection process.
Axiom & Free-Parameter Ledger
free parameters (1)
- dimension count D
axioms (1)
- domain assumption New memories interfere retroactively with older ones in proportion to their similarity in valence space
invented entities (1)
-
multi-dimensional valence measure
no independent evidence
Reference graph
Works this paper leans on
-
[1]
B. A. Richards, P. W. Frankland, The Persistence and Transience of Memory (2017). doi:10.1016/j.neuron.2017.04.037
-
[2]
Ebbinghaus, Memory: A contribution to experimental psychology., Dover, New York, 1964
H. Ebbinghaus, Memory: A contribution to experimental psychology., Dover, New York, 1964
work page 1964
-
[3]
J. Wixted, Analyzing the empirical course of forgetting, Journal of Experimental Psychology: Learning, Memory, and Cognition 16 (1990) 927–935. doi:10.1037/0278-7393.16.5.927
-
[4]
J. T. Wixted, E. B. Ebbesen, On the form of forgetting, Psychological Science 2 (6) (1991) 409–415. arXiv:https://doi.org/10.1111/ j.1467-9280.1991.tb00175.x, doi:10.1111/j.1467-9280.1991.tb00175.x. URL https://doi.org/10.1111/j.1467-9280.1991.tb00175.x
-
[5]
M. J. Kahana, M. Adler, Note on the power law of forgetting, bioRxiv. arXiv:https://www.biorxiv.org/content/early/2017/08/ 09/173765.full.pdf, doi:10.1101/173765. URL https://www.biorxiv.org/content/early/2017/08/09/173765
-
[6]
J. S. Fisher, G. Radvansky, Patterns of forgetting, Journal of Memory and Language 102 (2018) 130–141. doi:10.1016/j.jml.2018.05. 008
-
[7]
J. P. Nadal, G. Toulouse, J. P. Changeux, S. Dehaene, Networks of formal neurons and memory palimpsests, Epl 1 (10) (1986) 535–542. doi:10.1209/0295-5075/1/10/008
-
[8]
J. T. Wixted, The psychology and neuroscience of forgetting, Annu. Rev. Psychol 55 (2004) 235–69. doi:10.1146/annurev.psych.55. 090902.141555. URL www.annualreviews.org
-
[9]
J. T. Wixted, On common ground: Jost’s (1897) law of forgetting and Ribot’s (1881) law of retrograde amnesia (2004). doi:10.1037/ 0033-295X.111.4.864
work page 2004
-
[10]
G. D. A. Brown, S. Lewandowsky, Forgetting in memory models: Arguments against trace decay and consolidation failure, Forgetting (2010) 49–75doi:10.4324/9780203851647
-
[11]
t. K. Landauer, How much do people remember? some estimates of the quantity of learned information in long-term memory, Cognitive Science 10 (4) (1986) 477–493. doi:10.1016/S0364-0213(86)80014-3 . URL https://www.sciencedirect.com/science/article/pii/S0364021386800143
-
[12]
L. Standing, Learning 10 000 pictures, Quarterly Journal of Experimental Psychology 25 (I 973) (1973) 207–222
work page 1973
-
[13]
Oxford Univer- sity Press (2018).https://doi.org/10.1093/oso/9780198814788.001.0001
N. Cowan, C. C. Morey, Z. Chen, M. Bunting, What do estimates of working memory capacity tell us?, in: The Cognitive Neuroscience of Working Memory, Oxford University Press, 2007, pp. 43–58. doi:10.1093/acprof:oso/9780198570394.003.0003. URL http://www.oxfordscholarship.com/view/10.1093/acprof:oso/9780198570394.001.0001/ acprof-9780198570394-chapter-3
work page doi:10.1093/acprof:oso/9780198570394.003.0003 2007
-
[14]
A. G. Huth, W. A. de Heer, T. L. Gri ffiths, F. E. Theunissen, J. L. Gallant, Natural speech reveals the semantic maps that tile human cerebral cortex, Nature 532 (7600) (2016) 453–458. doi:10.1038/nature17637. URL http://www.nature.com/doifinder/10.1038/nature17637
-
[15]
M. K. Healey, P. Crutchley, M. J. Kahana, Individual differences in memory search and their relation to intelligence, Journal of Experimental Psychology: General 143 (4) (2014) 1553–1569. doi:10.1037/a0036306
-
[16]
D. A. Medler, J. R. Binder, MCWord: An on-line orthographic database of the English language (2005). 9 Georgiou et al. / (2019) 1–11 10 Appendix A. Solution of Kahana model We analyze the version of Kahana model [5] with linear decay of memory strength: S (t) = a− bt with positive random coefficients a and b. Other types of passive decay produce similar res...
work page 2005
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.