Statistical Patterns in the Equations of Physics and the Emergence of a Meta-Law of Nature

Andrei Constantin; Deaglan Bartlett; Harry Desmond; Pedro G. Ferreira

arxiv: 2408.11065 · v2 · submitted 2024-08-12 · ⚛️ physics.soc-ph · cs.CL· hep-th· physics.data-an· physics.hist-ph

Statistical Patterns in the Equations of Physics and the Emergence of a Meta-Law of Nature

Andrei Constantin , Deaglan Bartlett , Harry Desmond , Pedro G. Ferreira This is my paper

Pith reviewed 2026-05-23 21:50 UTC · model grok-4.3

classification ⚛️ physics.soc-ph cs.CLhep-thphysics.data-anphysics.hist-ph

keywords physics equationsmathematical operatorsexponential decaymeta-lawsymbolic regressionstatistical patterns

0 comments

The pith

The frequency of mathematical operators in physics equations follows an exponential decay law rather than the power law of natural language.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper examines four collections of equations drawn from physics to identify statistical regularities in how mathematical operators appear. It reports that these frequencies decay exponentially, which the authors interpret as evidence for a meta-law that may combine efficient communication with constraints set by nature. A reader would care because the pattern promises to shrink the enormous space of candidate expressions when computers attempt to recover physical laws from data. The result is presented as distinct from the power-law distributions familiar from word counts in ordinary language.

Core claim

By analysing four corpora of physics equations and applying advanced implicit-likelihood techniques, the frequency of mathematical operators is shown to follow an exponential decay law. This stands in contrast to Zipf's power law for word frequencies in natural languages and is offered as a statistical meta-law of physics that may reflect both communication efficiency and constraints imposed by Nature itself.

What carries the argument

The exponential decay law fitted to the frequency distribution of mathematical operators parsed from the four physics equation corpora.

If this is right

The meta-law can drastically narrow the space of physically plausible expressions during symbolic regression.
It supplies a prior that may improve language models tasked with generating coherent mathematical representations.
The pattern supports further automation of physical law discovery.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same analysis could be repeated on equation collections from chemistry or biology to test whether the exponential form is physics-specific.
If the decay law holds, equation-generation algorithms could be biased toward operator sets that respect the observed frequencies.
The finding invites checks on whether the exponential shape survives when equations are filtered by subfield or by publication date.

Load-bearing premise

The four corpora are representative samples of physical equations without selection bias from source choice, equation parsing rules, or domain coverage.

What would settle it

An independent corpus of physics equations in which operator frequencies follow a power-law distribution instead of exponential decay.

Figures

Figures reproduced from arXiv: 2408.11065 by Andrei Constantin, Deaglan Bartlett, Harry Desmond, Pedro G. Ferreira.

**Figure 2.** Figure 2: FIG. 2: Distribution of expression complexity in the three corpora, which approximately corresponds to the number [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗

**Figure 3.** Figure 3: FIG. 3: Comparison between the [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗

**Figure 4.** Figure 4: FIG. 4: Posterior distributions of the fits to the different corpora, where we compare a Zipf (Eq. ( [PITH_FULL_IMAGE:figures/full_fig_p007_4.png] view at source ↗

**Figure 5.** Figure 5: FIG. 5: Frequency of operators in the Planck formula [PITH_FULL_IMAGE:figures/full_fig_p007_5.png] view at source ↗

read the original abstract

Physics seeks to uncover the laws of Nature and express them through mathematical equations. Despite the vast diversity of natural phenomena, physical equations exhibit structural regularities that set them apart from arbitrary mathematical expressions. While principles such as dimensional analysis have long guided the formulation of physical models, the exploration of more subtle statistical patterns within the equations of physics remains an open question. Here, by analysing four corpora of physics equations and applying advanced implicit-likelihood techniques, we find that the frequency of mathematical operators follows an exponential decay law, in contrast to Zipf's power law for word frequencies in natural languages. This reveals a statistical meta-law of physics, possibly reflecting a combination of communication efficiency and constraints imposed by Nature itself. The meta-law offers practical benefits for symbolic regression by drastically narrowing down the space of physically plausible expressions. More broadly, it may inform the development of language models that can generate coherent mathematical representations, advancing the automation of physical law discovery.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper finds an exponential decay in how often operators appear in physics equations, which could help prune symbolic regression searches, but the abstract gives almost no information on how the corpora were built or operators counted so the result is hard to trust yet.

read the letter

The main new thing here is the claim that operator frequencies in physics equations decay exponentially rather than following the power-law pattern seen in natural language. They used implicit-likelihood methods on four corpora and suggest this acts as a meta-law that could shrink the space of expressions worth trying in automated discovery. That contrast with Zipf's law is the clearest point of difference from prior statistical work on equations or language. If the pattern holds up, the practical payoff for symbolic regression is real even if narrow. The abstract does not describe the corpora sources, the exact definition of an operator, whether counts are per equation or global, or how the exponential form was chosen over alternatives, so the fit could easily be sensitive to those choices. The stress-test concern about selection and parsing artifacts looks like it lands on the current text. Without those details the meta-law label feels premature. This is the kind of empirical observation that belongs in a methods-heavy journal or an AI-for-science venue rather than a general physics outlet. Readers working on equation discovery tools would get the most out of it once the robustness checks are in place. It deserves a serious referee to check the data pipeline and fitting procedure rather than a desk reject.

Referee Report

3 major / 2 minor

Summary. The paper analyzes four corpora of physics equations using implicit-likelihood techniques and reports that the frequency distribution of mathematical operators follows an exponential decay law, in contrast to Zipf's power law observed for words in natural language; this pattern is interpreted as a statistical meta-law of physics arising from communication efficiency and natural constraints, with proposed applications to narrowing the search space in symbolic regression.

Significance. If the reported exponential regularity proves robust and independent of corpus construction choices, it would constitute a novel empirical regularity distinguishing physical equations from arbitrary mathematical expressions or linguistic patterns, potentially offering a data-driven prior for symbolic regression and automated law discovery.

major comments (3)

[Abstract and Methods] Abstract and Methods (corpus construction paragraph): the central claim that the exponential form is a genuine meta-law requires explicit documentation of the four corpora (sources, subfield coverage, selection criteria) and operator tokenization rules (which symbols count as operators, per-equation vs. global counts); without these, it is impossible to rule out selection or parsing artifacts as the source of the observed decay.
[Results] Results section (implicit-likelihood fitting): the exponential model is obtained by fitting to the observed frequencies; the downstream claim that this meta-law can narrow symbolic regression therefore reduces to reusing the fitted decay constant on the same data, creating a circularity that undermines the asserted practical benefit.
[Results] Results (model comparison): no quantitative comparison (e.g., likelihood ratio or cross-validation) is shown between the exponential fit and alternatives such as power-law or log-normal forms on the same count data; given the sparsity typical of operator frequencies, this leaves open whether the exponential is preferred by the data or by the fitting procedure itself.

minor comments (2)

[Introduction] Introduction: add a brief explicit comparison of the operator tokenization used here versus standard Zipf analyses in linguistics to strengthen the claimed contrast.
[Figures] Figure captions: ensure all panels report both the fitted decay constant and its uncertainty so readers can assess the precision of the meta-law.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for their constructive comments, which help clarify key aspects of the work. We address each major comment below.

read point-by-point responses

Referee: [Abstract and Methods] Abstract and Methods (corpus construction paragraph): the central claim that the exponential form is a genuine meta-law requires explicit documentation of the four corpora (sources, subfield coverage, selection criteria) and operator tokenization rules (which symbols count as operators, per-equation vs. global counts); without these, it is impossible to rule out selection or parsing artifacts as the source of the observed decay.

Authors: We agree that more explicit documentation is necessary. In the revised manuscript, we have added a new subsection in the Methods section that fully documents the four corpora, including their sources, subfield coverage, and selection criteria. We have also detailed the operator tokenization rules, specifying the symbols counted as operators and the counting approach (per-equation versus global). These additions allow for better assessment of potential artifacts. revision: yes
Referee: [Results] Results section (implicit-likelihood fitting): the exponential model is obtained by fitting to the observed frequencies; the downstream claim that this meta-law can narrow symbolic regression therefore reduces to reusing the fitted decay constant on the same data, creating a circularity that undermines the asserted practical benefit.

Authors: We respectfully disagree that this constitutes a circularity. The meta-law is established through empirical analysis of the corpora. Its application to symbolic regression involves using the observed exponential distribution as a general prior for expression generation in independent discovery tasks. We have revised the text to emphasize that the benefit applies to new searches beyond the training corpora. revision: no
Referee: [Results] Results (model comparison): no quantitative comparison (e.g., likelihood ratio or cross-validation) is shown between the exponential fit and alternatives such as power-law or log-normal forms on the same count data; given the sparsity typical of operator frequencies, this leaves open whether the exponential is preferred by the data or by the fitting procedure itself.

Authors: We accept this criticism. The revised manuscript now includes quantitative model comparisons using likelihood ratio tests and cross-validation between the exponential, power-law, and log-normal models on the operator frequency data. These comparisons, presented in the Results section, demonstrate that the exponential model provides a superior fit according to the data. revision: yes

Circularity Check

0 steps flagged

No significant circularity; empirical observation reported directly from corpus analysis

full rationale

The paper collects four corpora of physics equations, counts frequencies of mathematical operators, and applies implicit-likelihood fitting to identify an exponential decay pattern, which it names a statistical meta-law. This is an observational result from data processing rather than a derivation chain in which any claimed prediction or first-principles result reduces by construction to the inputs (no self-definitional loops, no fitted parameters renamed as independent predictions, and no load-bearing self-citations or uniqueness theorems are present in the provided text). The central claim is the existence of the fitted pattern itself and is therefore self-contained as an empirical finding.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The claim depends on the empirical fit of an exponential model whose decay rate is determined from the data and on the assumption that the chosen corpora faithfully represent the space of physical equations.

free parameters (1)

exponential decay constant
Rate parameter of the exponential model fitted to operator frequency counts across the four corpora.

axioms (1)

domain assumption The four corpora constitute an unbiased sample of physical equations.
The statistical pattern is extracted from these specific collections; any systematic bias in their selection would alter the observed frequencies.

pith-pipeline@v0.9.0 · 5707 in / 1326 out tokens · 33354 ms · 2026-05-23T21:50:30.680680+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

frequency of mathematical operators follows an exponential decay law, in contrast to Zipf's power law
IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

f(r)∼exp(−βr) with β∼0.3 across Feynman, Wikipedia and Inflationaris corpora

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

38 extracted references · 38 canonical work pages

[1]

Estoup, Gammes st´ enographiques: m´ ethode et exercices pour l’acquisition de la vitesse (Institut St´ enographique, 1916)

J.-B. Estoup, Gammes st´ enographiques: m´ ethode et exercices pour l’acquisition de la vitesse (Institut St´ enographique, 1916)

work page 1916
[2]

Dewey, Relative frequency of English speech sounds (Harvard University Press, 1923)

G. Dewey, Relative frequency of English speech sounds (Harvard University Press, 1923)

work page 1923
[3]

E. U. Condon, Statistics of vocabulary, Science 67, 300 (1928)

work page 1928
[4]

Kingsley Zipf, Selected studies of the principle of rel- ative frequency in language (Harvard University Press, 1932)

G. Kingsley Zipf, Selected studies of the principle of rel- ative frequency in language (Harvard University Press, 1932)

work page 1932
[5]

Newman, Power laws, Pareto distributions and Zipf’s law, Contemporary Physics 46, 323 (2005)

M. Newman, Power laws, Pareto distributions and Zipf’s law, Contemporary Physics 46, 323 (2005)

work page 2005
[6]

G. K. Zipf, The psycho-biology of language: an introduc- tion to dynamic philology. (Houghton Mifflin, 1935)

work page 1935
[7]

G. K. Zipf, Human behavior and the principle of least effort. (Addison-Wesley Press, 1949)

work page 1949
[8]

S. T. Piantadosi, H. Tily, and E. Gibson, Word lengths are optimized for efficient communication, Proceedings of the National Academy of Sciences 108, 3526 (2011)

work page 2011
[9]

H. A. Simon, On a class of skew distribution functions, Biometrika 42, 425 (1955)

work page 1955
[10]

Zanette and M

D. Zanette and M. Montemurro, Dynamics of text gener- ation with realistic Zipf’s distribution, Journal of Quan- titative Linguistics 12, 29 (2005)

work page 2005
[11]

Corominas-Murtra, R

B. Corominas-Murtra, R. Hanel, and S. Thurner, Un- derstanding scaling through history-dependent processes with collapsing sample space, Proceedings of the National Academy of Sciences 112, 5348 (2015)

work page 2015
[12]

Thurner, R

S. Thurner, R. Hanel, B. Liu, and B. Corominas-Murtra, Understanding Zipf’s law of word frequencies through sample-space collapse in sentence formation, Journal of the Royal Society Interface 12, 20150330 (2015)

work page 2015
[13]

H. A. Makse, S. Havlin, and H. E. Stanley, Modelling urban growth patterns, Nature 377, 608 (1995)

work page 1995
[14]

Gabaix, Zipf’s law for cities: An explanation, The Quarterly Journal of Economics 114, 739 (1999)

X. Gabaix, Zipf’s law for cities: An explanation, The Quarterly Journal of Economics 114, 739 (1999)

work page 1999
[15]

Breslau, P

L. Breslau, P. Cao, L. Fan, G. Phillips, and S. Shenker, Web caching and Zipf-like distributions: Evidence and implications, in IEEE INFOCOM’99. Conference on Computer Communications. Proceedings. Eighteenth An- nual Joint Conference of the IEEE Computer and Com- munications Societies. The Future is Now , Vol. 1 (IEEE,

work page
[16]

Blank and S

A. Blank and S. Solomon, Power laws in cities population, financial markets and internet sites (scaling in systems with a variable number of components), Physica A: Sta- tistical Mechanics and its Applications 287, 279 (2000)

work page 2000
[17]

Cristelli, M

M. Cristelli, M. Batty, and L. Pietronero, There is more than a power law in Zipf, Scientific reports 2, 812 (2012)

work page 2012
[18]

D. H. Zanette and S. C. Manrubia, Vertical transmission of culture and the distribution of family names, Phys- ica A: Statistical Mechanics and its Applications 295, 1 (2001)

work page 2001
[19]

Mora and W

T. Mora and W. Bialek, Are biological systems poised at criticality?, Journal of Statistical Physics 144, 268 (2011). 9

work page 2011
[20]

Tyrcha, Y

J. Tyrcha, Y. Roudi, M. Marsili, and J. Hertz, The ef- fect of nonstationarity on models inferred from neural data, Journal of Statistical Mechanics: Theory and Ex- periment 2013, P03005 (2013)

work page 2013
[21]

D. J. Schwab, I. Nemenman, and P. Mehta, Zipf’s law and criticality in multivariate data without fine-tuning, Physical Review Letters 113, 068102 (2014)

work page 2014
[22]

A. N. Whitehead and B. Russell, Principia Mathematica (Cambridge University Press, 1925–1927)

work page 1925
[23]

Wikipedia contributors, Lean (proof assistant) — Wikipedia, The Free Encyclopedia (2024), [Online; ac- cessed 12-August-2024]

work page 2024
[24]

R. P. Feynman, R. B. Leighton, and M. Sands,The Feyn- man Lectures on Physics (Basic Books, New York, NY,

work page
[25]

Originally published 1963-1965

work page 1963
[26]

Guimer` a, I

R. Guimer` a, I. Reichardt, A. Aguilar-Mogas, F. A. Mas- succi, M. Miranda, J. Pallar` es, and M. Sales-Pardo, A Bayesian machine scientist to aid in the solution of chal- lenging scientific problems, Science Advances 6 (2020)

work page 2020
[27]

Martin, C

J. Martin, C. Ringeval, and V. Vennin, Encyclopædia Inflationaris, Physics of the Dark Universe 5, 75 (2014), arXiv:1303.3787 [astro-ph.CO]

work page arXiv 2014
[28]

D. J. Bartlett, H. Desmond, and P. G. Ferreira, Priors for Symbolic Regression, in Proceedings of the Compan- ion Conference on Genetic and Evolutionary Computa- tion, GECCO ’23 Companion (Association for Comput- ing Machinery, New York, NY, USA, 2023) p. 2402–2411, arXiv:2304.06333 [cs.LG]

work page arXiv 2023
[29]

Sousa, D

T. Sousa, D. J. Bartlett, H. Desmond, and P. G. Fer- reira, Optimal inflationary potentials, Phys. Rev. D 109, 083524 (2024), arXiv:2310.16786 [astro-ph.CO]

work page arXiv 2024
[30]

Lample and F

G. Lample and F. Charton, Deep learning for symbolic mathematics, arXiv:1912.01412 [cs.SC]

work page arXiv 1912
[31]

Udrescu and M

S.-M. Udrescu and M. Tegmark, AI Feynman: A physics- inspired method for symbolic regression, Science Ad- vances 6 (2020), arXiv:1905.11481 [physics.comp-ph]

work page arXiv 2020
[32]

Cranmer, J

K. Cranmer, J. Brehmer, and G. Louppe, The frontier of simulation-based inference, Proceedings of the National Academy of Science 117, 30055 (2020), arXiv:1911.01429 [stat.ML]

work page arXiv 2020
[33]

Marin, P

J.-M. Marin, P. Pudlo, C. P. Robert, and R. J. Ryder, Approximate Bayesian Computational Methods, Statis- tics and Computing 22, 1167 (2012)

work page 2012
[34]

M. Ho, D. J. Bartlett, N. Chartier, C. Cuesta-Lazaro, S. Ding, A. Lapel, P. Lemos, C. C. Lovell, T. L. Makinen, C. Modi, V. Pandya, S. Pandey, L. A. Perez, B. Wan- delt, and G. L. Bryan, LtU-ILI: An All-in-One Frame- work for Implicit Inference in Astrophysics and Cosmol- ogy, The Open Journal of Astrophysics 7, 54 (2024), arXiv:2402.05137 [astro-ph.IM]

work page arXiv 2024
[35]

Lemos, A

P. Lemos, A. Coogan, Y. Hezaveh, and L. Perreault- Levasseur, Sampling-Based Accuracy Testing of Poste- rior Estimators for General Inference, 40th International Conference on Machine Learning 202, 19256 (2023), arXiv:2302.03026 [stat.ML]

work page arXiv 2023
[36]

Jeffrey and B

N. Jeffrey and B. D. Wandelt, Evidence Networks: simple losses for fast, amortized, neural Bayesian model com- parison, Machine Learning: Science and Technology 5, 015008 (2024), arXiv:2305.11241 [cs.LG]

work page arXiv 2024
[37]

Jeffreys, The Theory of Probability (OUP Oxford,

H. Jeffreys, The Theory of Probability (OUP Oxford,

work page
[38]

Biggio, T

L. Biggio, T. Bendinelli, A. Neitz, A. Lucchi, and G. Parascandolo, Neural Symbolic Regression that Scales, in Proceedings of the 38th International Con- ference on Machine Learning , Proceedings of Machine Learning Research, Vol. 139, edited by M. Meila and T. Zhang (PMLR, 2021) pp. 936–945

work page 2021

[1] [1]

Estoup, Gammes st´ enographiques: m´ ethode et exercices pour l’acquisition de la vitesse (Institut St´ enographique, 1916)

J.-B. Estoup, Gammes st´ enographiques: m´ ethode et exercices pour l’acquisition de la vitesse (Institut St´ enographique, 1916)

work page 1916

[2] [2]

Dewey, Relative frequency of English speech sounds (Harvard University Press, 1923)

G. Dewey, Relative frequency of English speech sounds (Harvard University Press, 1923)

work page 1923

[3] [3]

E. U. Condon, Statistics of vocabulary, Science 67, 300 (1928)

work page 1928

[4] [4]

Kingsley Zipf, Selected studies of the principle of rel- ative frequency in language (Harvard University Press, 1932)

G. Kingsley Zipf, Selected studies of the principle of rel- ative frequency in language (Harvard University Press, 1932)

work page 1932

[5] [5]

Newman, Power laws, Pareto distributions and Zipf’s law, Contemporary Physics 46, 323 (2005)

M. Newman, Power laws, Pareto distributions and Zipf’s law, Contemporary Physics 46, 323 (2005)

work page 2005

[6] [6]

G. K. Zipf, The psycho-biology of language: an introduc- tion to dynamic philology. (Houghton Mifflin, 1935)

work page 1935

[7] [7]

G. K. Zipf, Human behavior and the principle of least effort. (Addison-Wesley Press, 1949)

work page 1949

[8] [8]

S. T. Piantadosi, H. Tily, and E. Gibson, Word lengths are optimized for efficient communication, Proceedings of the National Academy of Sciences 108, 3526 (2011)

work page 2011

[9] [9]

H. A. Simon, On a class of skew distribution functions, Biometrika 42, 425 (1955)

work page 1955

[10] [10]

Zanette and M

D. Zanette and M. Montemurro, Dynamics of text gener- ation with realistic Zipf’s distribution, Journal of Quan- titative Linguistics 12, 29 (2005)

work page 2005

[11] [11]

Corominas-Murtra, R

B. Corominas-Murtra, R. Hanel, and S. Thurner, Un- derstanding scaling through history-dependent processes with collapsing sample space, Proceedings of the National Academy of Sciences 112, 5348 (2015)

work page 2015

[12] [12]

Thurner, R

S. Thurner, R. Hanel, B. Liu, and B. Corominas-Murtra, Understanding Zipf’s law of word frequencies through sample-space collapse in sentence formation, Journal of the Royal Society Interface 12, 20150330 (2015)

work page 2015

[13] [13]

H. A. Makse, S. Havlin, and H. E. Stanley, Modelling urban growth patterns, Nature 377, 608 (1995)

work page 1995

[14] [14]

Gabaix, Zipf’s law for cities: An explanation, The Quarterly Journal of Economics 114, 739 (1999)

X. Gabaix, Zipf’s law for cities: An explanation, The Quarterly Journal of Economics 114, 739 (1999)

work page 1999

[15] [15]

Breslau, P

L. Breslau, P. Cao, L. Fan, G. Phillips, and S. Shenker, Web caching and Zipf-like distributions: Evidence and implications, in IEEE INFOCOM’99. Conference on Computer Communications. Proceedings. Eighteenth An- nual Joint Conference of the IEEE Computer and Com- munications Societies. The Future is Now , Vol. 1 (IEEE,

work page

[16] [16]

Blank and S

A. Blank and S. Solomon, Power laws in cities population, financial markets and internet sites (scaling in systems with a variable number of components), Physica A: Sta- tistical Mechanics and its Applications 287, 279 (2000)

work page 2000

[17] [17]

Cristelli, M

M. Cristelli, M. Batty, and L. Pietronero, There is more than a power law in Zipf, Scientific reports 2, 812 (2012)

work page 2012

[18] [18]

D. H. Zanette and S. C. Manrubia, Vertical transmission of culture and the distribution of family names, Phys- ica A: Statistical Mechanics and its Applications 295, 1 (2001)

work page 2001

[19] [19]

Mora and W

T. Mora and W. Bialek, Are biological systems poised at criticality?, Journal of Statistical Physics 144, 268 (2011). 9

work page 2011

[20] [20]

Tyrcha, Y

J. Tyrcha, Y. Roudi, M. Marsili, and J. Hertz, The ef- fect of nonstationarity on models inferred from neural data, Journal of Statistical Mechanics: Theory and Ex- periment 2013, P03005 (2013)

work page 2013

[21] [21]

D. J. Schwab, I. Nemenman, and P. Mehta, Zipf’s law and criticality in multivariate data without fine-tuning, Physical Review Letters 113, 068102 (2014)

work page 2014

[22] [22]

A. N. Whitehead and B. Russell, Principia Mathematica (Cambridge University Press, 1925–1927)

work page 1925

[23] [23]

Wikipedia contributors, Lean (proof assistant) — Wikipedia, The Free Encyclopedia (2024), [Online; ac- cessed 12-August-2024]

work page 2024

[24] [24]

R. P. Feynman, R. B. Leighton, and M. Sands,The Feyn- man Lectures on Physics (Basic Books, New York, NY,

work page

[25] [25]

Originally published 1963-1965

work page 1963

[26] [26]

Guimer` a, I

R. Guimer` a, I. Reichardt, A. Aguilar-Mogas, F. A. Mas- succi, M. Miranda, J. Pallar` es, and M. Sales-Pardo, A Bayesian machine scientist to aid in the solution of chal- lenging scientific problems, Science Advances 6 (2020)

work page 2020

[27] [27]

Martin, C

J. Martin, C. Ringeval, and V. Vennin, Encyclopædia Inflationaris, Physics of the Dark Universe 5, 75 (2014), arXiv:1303.3787 [astro-ph.CO]

work page arXiv 2014

[28] [28]

D. J. Bartlett, H. Desmond, and P. G. Ferreira, Priors for Symbolic Regression, in Proceedings of the Compan- ion Conference on Genetic and Evolutionary Computa- tion, GECCO ’23 Companion (Association for Comput- ing Machinery, New York, NY, USA, 2023) p. 2402–2411, arXiv:2304.06333 [cs.LG]

work page arXiv 2023

[29] [29]

Sousa, D

T. Sousa, D. J. Bartlett, H. Desmond, and P. G. Fer- reira, Optimal inflationary potentials, Phys. Rev. D 109, 083524 (2024), arXiv:2310.16786 [astro-ph.CO]

work page arXiv 2024

[30] [30]

Lample and F

G. Lample and F. Charton, Deep learning for symbolic mathematics, arXiv:1912.01412 [cs.SC]

work page arXiv 1912

[31] [31]

Udrescu and M

S.-M. Udrescu and M. Tegmark, AI Feynman: A physics- inspired method for symbolic regression, Science Ad- vances 6 (2020), arXiv:1905.11481 [physics.comp-ph]

work page arXiv 2020

[32] [32]

Cranmer, J

K. Cranmer, J. Brehmer, and G. Louppe, The frontier of simulation-based inference, Proceedings of the National Academy of Science 117, 30055 (2020), arXiv:1911.01429 [stat.ML]

work page arXiv 2020

[33] [33]

Marin, P

J.-M. Marin, P. Pudlo, C. P. Robert, and R. J. Ryder, Approximate Bayesian Computational Methods, Statis- tics and Computing 22, 1167 (2012)

work page 2012

[34] [34]

M. Ho, D. J. Bartlett, N. Chartier, C. Cuesta-Lazaro, S. Ding, A. Lapel, P. Lemos, C. C. Lovell, T. L. Makinen, C. Modi, V. Pandya, S. Pandey, L. A. Perez, B. Wan- delt, and G. L. Bryan, LtU-ILI: An All-in-One Frame- work for Implicit Inference in Astrophysics and Cosmol- ogy, The Open Journal of Astrophysics 7, 54 (2024), arXiv:2402.05137 [astro-ph.IM]

work page arXiv 2024

[35] [35]

Lemos, A

P. Lemos, A. Coogan, Y. Hezaveh, and L. Perreault- Levasseur, Sampling-Based Accuracy Testing of Poste- rior Estimators for General Inference, 40th International Conference on Machine Learning 202, 19256 (2023), arXiv:2302.03026 [stat.ML]

work page arXiv 2023

[36] [36]

Jeffrey and B

N. Jeffrey and B. D. Wandelt, Evidence Networks: simple losses for fast, amortized, neural Bayesian model com- parison, Machine Learning: Science and Technology 5, 015008 (2024), arXiv:2305.11241 [cs.LG]

work page arXiv 2024

[37] [37]

Jeffreys, The Theory of Probability (OUP Oxford,

H. Jeffreys, The Theory of Probability (OUP Oxford,

work page

[38] [38]

Biggio, T

L. Biggio, T. Bendinelli, A. Neitz, A. Lucchi, and G. Parascandolo, Neural Symbolic Regression that Scales, in Proceedings of the 38th International Con- ference on Machine Learning , Proceedings of Machine Learning Research, Vol. 139, edited by M. Meila and T. Zhang (PMLR, 2021) pp. 936–945

work page 2021