pith. sign in

arxiv: 2411.17692 · v2 · submitted 2024-11-26 · 🧬 q-bio.NC · cs.IT· math.IT· physics.bio-ph

Quantifying information stored in synaptic connections rather than in firing activities of neural networks

Pith reviewed 2026-05-23 17:14 UTC · model grok-4.3

classification 🧬 q-bio.NC cs.ITmath.ITphysics.bio-ph
keywords synaptic informationmutual informationHebbian networksautoassociative memorylog-normal distributionssynergistic interactionsinformation storage
0
0 comments X

The pith

Synaptic connections store quantifiable information via mutual information with data patterns, with joint encoding exceeding the sum of individual parts.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper builds a framework to measure information held directly in the strengths of synaptic connections in a neural network instead of in neuron firing patterns. It models networks that learn associations through Hebbian updates and treats the patterns to be remembered as following log-normal statistics. Analytical approximations are derived for the mutual information shared between the patterns and either one synapse, pairs of synapses, or larger groups. The calculations reveal that the total information captured when all synapses are considered together is larger than what would be obtained by simply adding up the contributions from each synapse separately. This provides a concrete method to track how much of the input data ends up encoded in the connection weights themselves.

Core claim

Using densely connected Hebbian networks for autoassociative memory with log-normal data patterns, analytical approximations are obtained for Shannon mutual information between the data and single synaptic connections, pairs, and arbitrary n-tuples. These approximations demonstrate synergistic interactions in which the joint information encoded by groups of synapses exceeds the sum of their individual contributions, while also aligning with known limits on pattern storage capacity and the distributed nature of coding in neural activity.

What carries the argument

Analytical approximations for Shannon mutual information between stored data patterns and n-tuples of synaptic connection strengths

Load-bearing premise

The calculations assume that input data patterns follow log-normal distributions and that the networks are densely connected Hebbian autoassociators.

What would settle it

Direct measurements of mutual information between actual synaptic weight distributions and input patterns in a biological or simulated network with non-log-normal statistics would fail to match the derived approximations if the synergy effect does not appear.

Figures

Figures reproduced from arXiv: 2411.17692 by Shreesh P Mysore, Xinhao Fan.

Figure 1
Figure 1. Figure 1: Model setup. (A) Real-world distribution modelled as several independent patterns [PITH_FULL_IMAGE:figures/full_fig_p003_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Information and the number of data patterns. For each [PITH_FULL_IMAGE:figures/full_fig_p009_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Information and the number of synaptic connections in an ensemble. For each [PITH_FULL_IMAGE:figures/full_fig_p011_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Information contributed per synaptic connection as a function of ensemble size. [PITH_FULL_IMAGE:figures/full_fig_p013_4.png] view at source ↗
read the original abstract

A cornerstone of our understanding of both biological and artificial neural networks is that they store information in the strengths of synaptic connections among the neurons. However, in contrast to the well-established theory for quantifying information encoded by the firing activity of neural networks, there does not exist a framework for quantifying information stored in the network's connection distribution itself. Here, we develop a theoretical framework for synaptic information by using densely connected Hebbian networks performing autoassociative memory tasks and by modeling data patterns to be stored as log-normal distributions. Specifically, we derive analytical approximations for Shannon mutual information between the data and singletons, pairs, and arbitrary n-tuples of synaptic connections within the network. Our framework corroborates well-established insights regarding pattern storage capacity, supports the principle of distributed coding in neural firing activities, and formalizes the heterogeneity inherent in information encoding across synapses in a network. Notably, it discovers synergistic interactions among synapses, revealing that the information encoded jointly by all the synapses exceeds the 'sum of its parts'. Taken together, this study introduces a powerful, interpretable framework for quantitatively understanding information storage in the synapses of neural networks, one that illustrates the duality of synaptic connectivity and neural population activity in learning and memory.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript develops a theoretical framework for quantifying Shannon mutual information stored in synaptic connections (rather than firing rates) of neural networks. It restricts attention to densely connected Hebbian autoassociative memory networks whose patterns are drawn from log-normal distributions, derives analytical approximations for the mutual information between the stored data and single synapses, pairs of synapses, and arbitrary n-tuples, and reports synergistic effects in which the joint information exceeds the sum of the marginal contributions. The framework is claimed to corroborate known capacity limits, support distributed coding, and formalize synaptic heterogeneity.

Significance. If the closed-form approximations prove accurate and the reported synergy is not an artifact of the two modeling choices, the work would supply a concrete, interpretable measure of information stored in connectivity that complements existing activity-based information theory. It would also supply a quantitative language for heterogeneity across synapses. No machine-checked proofs or reproducible code are supplied, but the attempt to obtain parameter-light analytic expressions for higher-order synaptic MI is a positive feature.

major comments (2)
  1. [Abstract and derivations section] Abstract and § on derivations: the analytical MI approximations are obtained only after imposing log-normal pattern statistics and a dense Hebbian outer-product rule; the manuscript contains no section that recomputes the same quantities (or the sign of the synergy) under Gaussian patterns, sparse binary patterns, or alternative plasticity rules. Because the MI quantities are functionals of the joint distribution induced by these two decisions, the synergy result is not shown to be robust, which is load-bearing for the claim that the framework reveals a general property of synaptic information storage.
  2. [Abstract] Abstract: the text states that 'analytical approximations exist' yet supplies neither the explicit expressions, their error bounds, nor any numerical validation against direct Monte-Carlo estimates of the mutual information. Without these steps it is impossible to judge the accuracy of the claimed closed forms or to rule out post-hoc fitting of the log-normal parameters.
minor comments (2)
  1. Notation for the n-tuple MI is introduced without an explicit recursive definition or pseudocode, making it difficult to verify the higher-order formulas.
  2. The manuscript would benefit from a short table comparing the analytic MI values to numerically estimated values for small networks.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments. We address each major point below and indicate the revisions planned for the next version of the manuscript.

read point-by-point responses
  1. Referee: [Abstract and derivations section] Abstract and § on derivations: the analytical MI approximations are obtained only after imposing log-normal pattern statistics and a dense Hebbian outer-product rule; the manuscript contains no section that recomputes the same quantities (or the sign of the synergy) under Gaussian patterns, sparse binary patterns, or alternative plasticity rules. Because the MI quantities are functionals of the joint distribution induced by these two decisions, the synergy result is not shown to be robust, which is load-bearing for the claim that the framework reveals a general property of synaptic information storage.

    Authors: We agree that the derivations and the reported synergy are obtained specifically under log-normal pattern statistics and the dense Hebbian outer-product rule. The manuscript presents a framework for this class of networks rather than claiming universality across all pattern distributions or plasticity rules. To address the robustness concern, the revised manuscript will include an expanded discussion of the modeling choices and a new subsection with numerical checks of the mutual-information approximations and synergy sign under Gaussian patterns (while retaining the analytic focus on the log-normal case). revision: yes

  2. Referee: [Abstract] Abstract: the text states that 'analytical approximations exist' yet supplies neither the explicit expressions, their error bounds, nor any numerical validation against direct Monte-Carlo estimates of the mutual information. Without these steps it is impossible to judge the accuracy of the claimed closed forms or to rule out post-hoc fitting of the log-normal parameters.

    Authors: We acknowledge that the abstract refers to the existence of analytical approximations without displaying the explicit forms, error bounds, or Monte-Carlo validation. The revised manuscript will move the leading closed-form expressions into the abstract and main text, include the derived error bounds, and add a dedicated validation section that compares the analytic mutual-information values against direct Monte-Carlo estimates for the same parameter regimes. revision: yes

Circularity Check

0 steps flagged

No significant circularity; derivations are explicit model-based approximations

full rationale

The paper starts from Shannon MI definitions and derives closed-form approximations under two explicit modeling choices (log-normal pattern statistics and dense Hebbian outer-product storage). These choices induce a joint distribution from which the MI expressions are computed; the resulting formulas are not equivalent to the inputs by construction, nor are any quantities fitted and then relabeled as predictions. No self-citation chains, uniqueness theorems, or ansatzes smuggled via prior work are invoked as load-bearing steps. The framework is therefore self-contained against its stated assumptions, and the reported synergy is a direct consequence of dependence structure in the induced distribution rather than a tautology.

Axiom & Free-Parameter Ledger

1 free parameters · 2 axioms · 0 invented entities

The central claim depends on two modeling choices whose justification is not supplied in the abstract: log-normal distribution of data patterns and the densely connected Hebbian autoassociative architecture. No new physical entities are introduced.

free parameters (1)
  • log-normal distribution parameters
    Shape and scale parameters of the log-normal used to model data patterns; these control the statistics from which mutual information is derived.
axioms (2)
  • domain assumption Shannon mutual information is the appropriate measure for quantifying information stored in synaptic weights.
    Invoked when the framework is defined as mutual information between data and connection strengths.
  • domain assumption Hebbian learning in a densely connected network is a sufficient model for autoassociative memory.
    Used to generate the connection distribution whose information content is then quantified.

pith-pipeline@v0.9.0 · 5754 in / 1374 out tokens · 38055 ms · 2026-05-23T17:14:36.106351+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

44 extracted references · 44 canonical work pages · 1 internal anchor

  1. [1]

    & Theunissen, F

    Borst, A. & Theunissen, F. E. Information theory and neural coding.Nature neuroscience 2, 947–957 (1999)

  2. [2]

    & Panzeri, S

    Quian Quiroga, R. & Panzeri, S. Extracting information from neuronal populations: information theory and decoding approaches. Nature Reviews Neuroscience 10, 173– 185 (2009)

  3. [3]

    G., Lazar, A

    Dimitrov, A. G., Lazar, A. A. & Victor, J. D. Information theory in neuroscience.Journal of computational neuroscience 30, 1–5 (2011)

  4. [4]

    Timme, N. M. & Lapish, C. A tutorial for information theory in neuroscience. eneuro 5 (2018)

  5. [5]

    & Rieke, F

    Bialek, W. & Rieke, F. Reliability and information transmission in spiking neurons. Trends in neurosciences15, 428–434 (1992)

  6. [6]

    E., Marre, O., Berry, M

    Palmer, S. E., Marre, O., Berry, M. J. & Bialek, W. Predictive information in a sensory population. Proceedings of the National Academy of Sciences 112, 6908–6913 (2015)

  7. [7]

    Self-organization in a perceptual network

    Linsker, R. Self-organization in a perceptual network. Computer 21, 105–117 (1988)

  8. [8]

    & Zaslavsky, N

    Tishby, N. & Zaslavsky, N. Deep learning and the information bottleneck principle, 1–5 (IEEE, 2015)

  9. [9]

    Hebb, D. O. The organization of behavior: A neuropsychological theory (Wiley, New York, 1949)

  10. [10]

    & LeDoux, J

    Lamprecht, R. & LeDoux, J. Structural plasticity and memory. Nature Reviews Neuro- science 5, 45–54 (2004)

  11. [11]

    & K ¨otter, R

    Sporns, O. & K ¨otter, R. Motifs in brain networks. PLoS biology 2, e369 (2004)

  12. [12]

    & Latora, V

    Battiston, F., Nicosia, V ., Chavez, M. & Latora, V . Multilayer motif analysis of brain networks. Chaos: An Interdisciplinary Journal of Nonlinear Science 27 (2017)

  13. [13]

    Baeg, E. H. et al. Learning-induced enduring changes in functional connectivity among prefrontal cortical neurons. Journal of Neuroscience 27, 909–918 (2007). 24

  14. [14]

    Bassett, D. S. et al. Dynamic reconfiguration of human brain networks during learning. Proceedings of the National Academy of Sciences 108, 7641–7646 (2011)

  15. [15]

    & Tsodyks, M

    Mongillo, G., Barak, O. & Tsodyks, M. Synaptic theory of working memory. Science 319, 1543–1546 (2008)

  16. [16]

    Stokes, M. G. ‘activity-silent’working memory in prefrontal cortex: a dynamic coding framework. Trends in cognitive sciences 19, 394–405 (2015)

  17. [17]

    Panichello, M. F. et al. Intermittent rate coding and cue-specific ensembles support working memory. Nature 1–8 (2024)

  18. [18]

    Hinton, G. E. & Van Camp, D. Keeping the neural networks simple by minimizing the description length of the weights, 5–13 (1993)

  19. [19]

    Emergence of Invariance and Disentanglement in Deep Representations

    Achille, A. & Soatto, S. On the emergence of invariance and disentangling in deep representations. arXiv preprint arXiv:1706.01350 125, 14 (2017)

  20. [20]

    & Sommer, F

    Knoblauch, A., Palm, G. & Sommer, F. T. Memory capacities for synaptic and structural plasticity. Neural Computation 22, 289–341 (2010)

  21. [21]

    J., Buneman, O

    Willshaw, D. J., Buneman, O. P. & Longuet-Higgins, H. C. Non-holographic associative memory. Nature 222, 960–962 (1969)

  22. [22]

    Associative memory: on the (puzzling) sparse coding limit

    Nadal, J.-P. Associative memory: on the (puzzling) sparse coding limit. Journal of Physics A: Mathematical and General 24, 1093 (1991)

  23. [23]

    & Kurfess, F

    Bosch, H. & Kurfess, F. J. Information storage capacity of incompletely connected associative memories. Neural Networks 11, 869–876 (1998)

  24. [24]

    Hopfield, J. J. Neurons with graded response have collective computational properties like those of two-state neurons. Proceedings of the national academy of sciences 81, 3088–3092 (1984)

  25. [25]

    The sum of log-normal probability distributions in scatter transmission sys- tems

    Fenton, L. The sum of log-normal probability distributions in scatter transmission sys- tems. IRE Transactions on communications systems 8, 57–67 (1960)

  26. [26]

    & Wagner, A

    Rissman, J. & Wagner, A. D. Distributed representations in memory: insights from functional brain imaging. Annual review of psychology 63, 101–128 (2012)

  27. [27]

    Stecker, G. C. & Middlebrooks, J. C. Distributed coding of sound locations in the audi- tory cortex. Biological cybernetics 89, 341–349 (2003)

  28. [28]

    A., Zatka-Haas, P., Carandini, M

    Steinmetz, N. A., Zatka-Haas, P., Carandini, M. & Harris, K. D. Distributed coding of choice, action and engagement across the mouse brain. Nature 576, 266–273 (2019)

  29. [29]

    & Jacques, J

    Abu-Mostafa, Y . & Jacques, J. S. Information capacity of the hopfield model. IEEE Transactions on Information Theory 31, 461–464 (1985)

  30. [30]

    J., Gutfreund, H

    Amit, D. J., Gutfreund, H. & Sompolinsky, H. Storing infinite numbers of patterns in a spin-glass model of neural networks. Physical Review Letters 55, 1530 (1985). 25

  31. [31]

    R., Treves, A

    Panzeri, S., Schultz, S. R., Treves, A. & Rolls, E. T. Correlations and the encoding of information in the nervous system. Proceedings of the Royal Society of London. Series B: Biological Sciences 266, 1001–1012 (1999)

  32. [32]

    & Berry, M

    Schneidman, E., Bialek, W. & Berry, M. J. Synergy, redundancy, and independence in population codes. Journal of Neuroscience 23, 11539–11553 (2003)

  33. [33]

    Luppi, A. I. et al. A synergistic core for human brain evolution and cognition. Nature Neuroscience 25, 771–782 (2022)

  34. [34]

    A., Zylberberg, J

    Cayco-Gajic, N. A., Zylberberg, J. & Shea-Brown, E. Triplet correlations among simi- larly tuned cells impact population coding. Frontiers in computational neuroscience 9, 57 (2015)

  35. [35]

    Kafashan, M. et al. Scaling of sensory information in large neural populations shows signatures of information-limiting correlations. Nature communications 12, 473 (2021)

  36. [36]

    & Fitzgerald, J

    Sun, W., Advani, M., Spruston, N., Saxe, A. & Fitzgerald, J. E. Organizing memories for generalization in complementary learning systems. Nature neuroscience 26, 1438–1448 (2023)

  37. [37]

    & Toyoizumi, T

    Kang, L. & Toyoizumi, T. Distinguishing examples while building concepts in hip- pocampal and artificial networks. Nature Communications 15, 647 (2024)

  38. [38]

    Durstewitz, D., Seamans, J. K. & Sejnowski, T. J. Neurocomputational models of work- ing memory. Nature neuroscience 3, 1184–1191 (2000)

  39. [39]

    Dong, D. W. & Hopfield, J. J. Dynamic properties of neural networks with adapting synapses. Network: Computation in Neural Systems 3, 267 (1992)

  40. [40]

    Olfactory computation and object perception

    Hopfield, J. Olfactory computation and object perception. Proceedings of the National Academy of Sciences 88, 6462–6466 (1991)

  41. [41]

    & Grassberger, P

    Kraskov, A., St ¨ogbauer, H. & Grassberger, P. Estimating mutual information. Physical Review E—Statistical, Nonlinear, and Soft Matter Physics 69, 066138 (2004)

  42. [42]

    & Mumford, D

    Huang, J. & Mumford, D. Statistics of natural images and models , V ol. 1, 541–547 (IEEE, 1999)

  43. [43]

    1/f noise’in music and speech

    V oss RFClarke, J. 1/f noise’in music and speech. Nature 258, 317318 (1975)

  44. [44]

    Piantadosi, S. T. Zipf’s word frequency law in natural language: A critical review and future directions. Psychonomic bulletin & review 21, 1112–1130 (2014). 26