Truth, Proof, and Reproducibility: There's no counter-attack for the codeless
Pith reviewed 2026-05-24 22:55 UTC · model grok-4.3
The pith
Mathematical proofs are inherently reproducible by design, but modern computational math now requires open code to sustain that standard.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Proofs have a distinctive quality of being necessarily reproducible, and are the cornerstone of mathematical science. However, the task of the modern mathematical scientist has drifted from that of blackboard rhetorician to a scientific workflow that now more closely resembles that of an experimental scientist. The computer is an analog for Boyle's pump, another kind of scientific instrument that needs detailed descriptions of how it generates results. In place of Boyle's hand-written notes, code must be available to enable reproduction of computational experiments.
What carries the argument
The computer-as-instrument analogy to Boyle's vacuum pump, which requires explicit descriptions (now code) so that results can be reproduced.
If this is right
- Open-source code becomes a necessary component of due scientific diligence for computational mathematical inquiry.
- Reproducibility standards applied to experimental science must now be applied to computational mathematics.
- The meaning of proof itself shifts when the workflow is computational rather than purely rhetorical.
- Mathematical science requires reorientation so that its reproducibility can be readily assessed.
Where Pith is reading between the lines
- Treating code as an extension of the proof process could change how computational claims are taught and reviewed.
- Fields outside mathematics that rely on computation might adopt similar expectations for sharing the programs that generate results.
- Lakatos-style conjectural dialogue could be used more broadly to examine truth under uncertainty in any computational setting.
Load-bearing premise
The computer functions as an analog for Boyle's pump, another kind of scientific instrument that needs detailed descriptions of how it generates results.
What would settle it
A collection of modern mathematical results shown to be fully verifiable and reproducible by readers who have no access to any code or computational artifacts.
Figures
read the original abstract
Current concerns about reproducibility in many research communities can be traced back to a high value placed on empirical reproducibility of the physical details of scientific experiments and observations. For example, the detailed descriptions by 17th century scientist Robert Boyle of his vacuum pump experiments are often held to be the ideal of reproducibility as a cornerstone of scientific practice. Victoria Stodden has claimed that the computer is an analog for Boyle's pump -- another kind of scientific instrument that needs detailed descriptions of how it generates results. In the place of Boyle's hand-written notes, we now expect code in open source programming languages to be available to enable others to reproduce and extend computational experiments. In this paper we show that there is another genealogy for reproducibility, starting at least from Euclid, in the production of proofs in mathematics. Proofs have a distinctive quality of being necessarily reproducible, and are the cornerstone of mathematical science. However, the task of the modern mathematical scientist has drifted from that of blackboard rhetorician, where the craft of proof reigned, to a scientific workflow that now more closely resembles that of an experimental scientist. So, what is proof in modern mathematics? And, if proof is unattainable in other fields, what is due scientific diligence in a computational experimental environment? How do we measure truth in the context of uncertainty? Adopting a manner of Lakatosian conversant conjecture between two mathematicians, we examine how proof informs our practice of computational statistical inquiry. We propose that a reorientation of mathematical science is necessary so that its reproducibility can be readily assessed.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper traces concerns about reproducibility to empirical traditions, citing Robert Boyle's vacuum pump experiments as an ideal, and notes Victoria Stodden's analogy of the computer to such instruments. It contrasts this with a separate genealogy of reproducibility rooted in Euclidean mathematical proofs, which it describes as necessarily reproducible and foundational to mathematical science. The manuscript argues that modern mathematical practice has shifted from blackboard-based proof craft toward computational workflows resembling experimental science, and proposes a reorientation of mathematical science—via a Lakatosian style of conversant conjecture—to better assess reproducibility, truth, and due diligence in computational statistical inquiry under uncertainty.
Significance. If the interpretive genealogy and normative proposal hold, the paper could enrich discussions in the history and philosophy of mathematics by offering an alternative framing for reproducibility that emphasizes proof traditions over experimental ones, potentially informing standards for computational mathematics and statistics. As a conceptual essay without derivations, data, or formal checks, its value lies in stimulating reflection on practice rather than providing testable claims or tools.
major comments (2)
- [Abstract] Abstract: the central distinction between the 'necessarily reproducible' quality of proofs and empirical reproducibility (Boyle/Stodden) is asserted without a definition of reproducibility in the mathematical context or explicit criteria for the claimed distinction; this interpretive claim is load-bearing for the alternative genealogy and the subsequent call for reorientation.
- [Abstract] Abstract: the assertion that 'the task of the modern mathematical scientist has drifted' from blackboard rhetorician to experimental-like workflow is presented as factual grounding for the reorientation proposal, yet no specific examples, citations, or periodization of this drift are supplied in the text; without such support the normative recommendation rests on an unevaluated historical premise.
minor comments (2)
- The title phrase 'There's no counter-attack for the codeless' is not explained or connected to the argument in the abstract.
- The manuscript would benefit from explicit section headings and a clearer roadmap to distinguish the historical genealogy from the Lakatosian examination and the final proposal.
Simulated Author's Rebuttal
We thank the referee for their insightful comments and recommendation for minor revision. We address each major comment point by point below, providing clarifications where appropriate.
read point-by-point responses
-
Referee: [Abstract] Abstract: the central distinction between the 'necessarily reproducible' quality of proofs and empirical reproducibility (Boyle/Stodden) is asserted without a definition of reproducibility in the mathematical context or explicit criteria for the claimed distinction; this interpretive claim is load-bearing for the alternative genealogy and the subsequent call for reorientation.
Authors: The manuscript does provide an implicit definition through the contrast with Boyle's experiments and the Euclidean tradition, where proofs are reproducible via logical deduction alone. However, to strengthen the abstract as suggested, we will incorporate an explicit definition of mathematical reproducibility as the independent verification of deductive steps by any reader familiar with the axioms and rules of inference. This revision will be made in the next version. revision: yes
-
Referee: [Abstract] Abstract: the assertion that 'the task of the modern mathematical scientist has drifted' from blackboard rhetorician to experimental-like workflow is presented as factual grounding for the reorientation proposal, yet no specific examples, citations, or periodization of this drift are supplied in the text; without such support the normative recommendation rests on an unevaluated historical premise.
Authors: This assertion is presented as a contemporary observation rather than a detailed historical analysis requiring periodization or specific citations. The paper's argument proceeds from this premise to discuss implications for computational inquiry using a Lakatosian approach. We maintain that the normative proposal does not rest on an unevaluated premise but on the logical consequences for assessing truth under uncertainty. No revision is planned for this point. revision: no
Circularity Check
No significant circularity identified
full rationale
The paper is a philosophical/historical essay (math.HO) with no equations, formal derivations, fitted parameters, predictions, or load-bearing self-citations. It traces ideas from Euclid and Boyle to modern practice and proposes a normative reorientation, but contains no steps that reduce a result to its own inputs by construction. All claims are interpretive and self-contained against external historical benchmarks.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Proofs in mathematics are necessarily reproducible by design.
- domain assumption Modern mathematical and statistical workflows now resemble experimental science more than pure proof construction.
Reference graph
Works this paper leans on
-
[1]
Nature 567(7748), 305 (Mar 2019)
Amrhein, V., Greenland, S., McShane, B.: Scientists rise up against statistical significance. Nature 567(7748), 305 (Mar 2019). https://doi.org/10.1038/d41586- 019-00857-9, http://www.nature.com/articles/d41586-019-00857-9 , bib- tex*[rights=2019 Nature]
-
[2]
Farrar, Straus and Giroux (Mar 2001), google-Books- ID: 6AUtQVhrY90C
Auburn, D.: Proof: A Play. Farrar, Straus and Giroux (Mar 2001), google-Books- ID: 6AUtQVhrY90C
work page 2001
-
[3]
In: International Conference on Theorem Proving in Higher Order Logics
Bertot, Y.: A short presentation of Coq. In: International Conference on Theorem Proving in Higher Order Logics. pp. 12–16 (2008), bibtex*[organization=Springer]
work page 2008
-
[4]
In: Proceedings of the 16th Annual Conference on Research in Undergraduate Mathematics Education
Brown, S.: Partial unpacking and indirect proofs: A study of students productive use of the symbolic proof scheme. In: Proceedings of the 16th Annual Conference on Research in Undergraduate Mathematics Education. vol. 2, pp. 47–54 (2013)
work page 2013
-
[5]
Four transformations on the Catalan triangle
Bryan, J.: Excuse Me, Do You Have a Moment to Talk About Ver- sion Control? The American Statistician 72(1), 20–27 (2018). ht- tps://doi.org/10.1080/00031305.2017.1399928, https://doi.org/10. 1080/00031305.2017.1399928, bibtex*[publisher=Taylor & Fran- cis;eprint=https://doi.org/10.1080/00031305.2017.1399928]
work page internal anchor Pith review Pith/arXiv arXiv doi:10.1080/00031305.2017.1399928 2018
-
[6]
Science 351(6280), 1433– 1436 (Mar 2016)
Camerer, C.F., Dreber, A., Forsell, E., Ho, T.H., Huber, J., Johannesson, M., Kirchler, M., Almenberg, J., Altmejd, A., Chan, T., Heikensten, E., Holzmeister, F., Imai, T., Isaksson, S., Nave, G., Pfeiffer, T., Razen, M., Wu, H.: Evaluating replicability of laboratory experiments in economics. Science 351(6280), 1433– 1436 (Mar 2016). https://doi.org/10.11...
-
[7]
Carroll, L.: The Annotated Alice: The Definitive Edition. W. W. Norton & Company, New York, updated, subsequent edition edn. (Nov 1999), bib- tex*[editora=undefined;editoratype=collaborator] 18 Charles T. Gray and Ben Marwick
work page 1999
-
[8]
Cambridge Uni- versity Press (Apr 2002), google-Books-ID: vVVTxeuiyvQC
Davey, B.A., Priestley, H.A.: Introduction to Lattices and Order. Cambridge Uni- versity Press (Apr 2002), google-Books-ID: vVVTxeuiyvQC
work page 2002
-
[9]
Davey, B.A.: When is a Proof? La Trobe University, 2 edn. (2009)
work page 2009
-
[10]
Order 35(2), 193–214 (Jul 2018)
Davey, B.A., Gray, C.T., Pitkethly, J.G.: The Homomorphism Lat- tice Induced by a Finite Algebra. Order 35(2), 193–214 (Jul 2018). https://doi.org/10.1007/s11083-017-9426-3, https://doi.org/10.1007/ s11083-017-9426-3 , bibtex*[shortjournal=Order;rights=All rights reserved]
-
[11]
Biostatist- ics 11(3), 385–388 (Jul 2010)
Donoho, D.L.: An invitation to reproducible computational research. Biostatist- ics 11(3), 385–388 (Jul 2010). https://doi.org/10.1093/biostatistics/kxq028, https://doi.org/10.1093/biostatistics/kxq028, bibtex*[eprint=http://oup.prod.sis.lan/biostatistics/article- pdf/11/3/385/716135/kxq028.pdf]
-
[12]
Fidler, F., Wilcox, J.: Reproducibility of Scientific Results. In: Zalta, E.N. (ed.) The Stanford Encyclopedia of Philosophy. Metaphysics Research Lab, Stanford University, winter 2018 edn. (2018)
work page 2018
-
[13]
PLOS ONE 13(7), e0200303 (Jul 2018)
Fraser, H., Parker, T., Nakagawa, S., Barnett, A., Fidler, F.: Ques- tionable research practices in ecology and evolution. PLOS ONE 13(7), e0200303 (Jul 2018). https://doi.org/10.1371/journal.pone.0200303, https:// journals.plos.org/plosone/article?id=10.1371/journal.pone.0200303, bib- tex*[shortjournal=PLOS ONE]
-
[14]
Prometheus Books (Mar 2011), google-Books-ID: RhXxaPTc EYC
Haack, S.: Defending Science - within Reason: Between Scientism And Cynicism. Prometheus Books (Mar 2011), google-Books-ID: RhXxaPTc EYC
work page 2011
-
[15]
Journal of Statistical Software 59(1), 1–23 (Sep 2014)
Hadley Wickham: Tidy Data. Journal of Statistical Software 59(1), 1–23 (Sep 2014). https://doi.org/10.18637/jss.v059.i10, https://www.jstatsoft.org/ index.php/jss/article/view/v059i10, bibtex*[rights=Copyright (c) 2013 Had- ley Wickham]
-
[16]
alexpghayes.com/blog/testing-statistical-software/, bibtex*[type=Blog]
Hayes, A.: testing statistical software - aleatoric (Jul 2019), https://www. alexpghayes.com/blog/testing-statistical-software/, bibtex*[type=Blog]
work page 2019
-
[17]
PLOS Biology 13(3), e1002106 (Mar 2015)
Head, M.L., Holman, L., Lanfear, R., Kahn, A.T., Jennions, M.D.: The Ex- tent and Consequences of P-Hacking in Science. PLOS Biology 13(3), e1002106 (Mar 2015). https://doi.org/10.1371/journal.pbio.1002106, https://journals. plos.org/plosbiology/article?id=10.1371/journal.pbio.1002106
-
[18]
com/resources/webinars/covr-bringing-test-coverage-to-r/
Hester, J.: covr: Bringing test coverage to R (Jan 2016), https://www.rstudio. com/resources/webinars/covr-bringing-test-coverage-to-r/
work page 2016
-
[19]
Hester, J.: covr: Test Coverage for Packages (2018), https://CRAN.R-project. org/package=covr
work page 2018
-
[20]
Cam- bridge University Press, Cambridge, reissue edition edn
Lakatos, I.: Proofs and Refutations: The Logic of Mathematical Discovery. Cam- bridge University Press, Cambridge, reissue edition edn. (Oct 2015)
work page 2015
-
[21]
LeVeque, R.J., Mitchell, I.M., Stodden, V.: Reproducible research for scientific computing: Tools and strategies for changing the culture. Comput Sci Eng 14 (2012). https://doi.org/10.1109/mcse.2012.38
-
[22]
Marwick, B.: rrtools: Creates a reproducible research compendium (2018), https: //github.com/benmarwick/rrtools
work page 2018
-
[23]
Marwick, B., Boettiger, C., Mullen, L.: Packaging data analytical work repro- ducibly using R (and friends). Tech. Rep. e3192v2, PeerJ Inc. (Mar 2018). https://doi.org/10.7287/peerj.preprints.3192v2, https://peerj.com/preprints/ 3192
-
[24]
University of Chicago Press (Sep 1996), google-Books-ID: j94XiVDwAZEC
Merton, R.K.: On Social Structure and Science. University of Chicago Press (Sep 1996), google-Books-ID: j94XiVDwAZEC
work page 1996
-
[25]
Murray, C.: How to Accuse the Other Guy of Lying with Statistics. Statistical Science 20(3), 239–241 (2005), https://www.jstor.org/stable/20061179 Truth, Proof, and Reproducibility: There’s no counter-attack for the codeless 19
-
[26]
Nowogrodzki, A.: How to support open-source software and stay sane. Nature 571, 133 (Jul 2019). https://doi.org/10.1038/d41586-019-02046-0, http://www.nature. com/articles/d41586-019-02046-0 , bibtex*[copyright=2019 Nature]
-
[27]
Parker, H.: Opinionated analysis development. preprint (2017). ht- tps://doi.org/10.7287/peerj.preprints.3210v1
-
[28]
Science 334(6060), 1226–1227 (Dec 2011)
Peng, R.D.: Reproducible Research in Computational Science. Science 334(6060), 1226–1227 (Dec 2011). https://doi.org/10.1126/science.1213847
-
[29]
University of Chicago Press (2010)
Pickering, A.: The mangle of practice: Time, agency, and science. University of Chicago Press (2010)
work page 2010
-
[30]
Robinson, D., Hayes, A.: broom: Convert Statistical Analysis Objects into Tidy Tibbles (2019), https://CRAN.R-project.org/package=broom
work page 2019
-
[31]
Shapin, S., Schaffer, S.: Leviathan and the air-pump: Hobbes, Boyle, and the ex- perimental life (New in paper), vol. 32. Princeton University Press (2011)
work page 2011
-
[32]
edge.org/response-detail/25340.%202014
Stodden, V.: What scientific idea is ready for retirement? (2014), https://www. edge.org/response-detail/25340.%202014
work page 2014
-
[33]
Stodden, V., Borwein, J., Bailey, D.H.: ”Setting the Default to Reproducible in Computational Science Research. SIAM News 46(5) (2013)
work page 2013
-
[34]
Srensen, M.H., Urzyczyn, P.: Lectures on the Curry-Howard isomorphism, vol. 149. Elsevier (2006)
work page 2006
-
[35]
PLOS Biology 16(11), e2006930 (Nov 2018)
Wallach, J.D., Boyack, K.W., Ioannidis, J.P.A.: Reproducible research practices, transparency, and open access data in the biomedical lit- erature, 20152017. PLOS Biology 16(11), e2006930 (Nov 2018). ht- tps://doi.org/10.1371/journal.pbio.2006930, https://journals.plos.org/ plosbiology/article?id=10.1371/journal.pbio.2006930
-
[36]
Westgate, M., Barrett, M., Grames, E., Gray, C., Hamilton, W.K., Kothe, E., McGuinness, L., O’Dea, R., Sanchez-Tojar, A., Schermann, M.: metaverse: Workflows for evidence synthesis projects (2019), (link:https://github.com/ rmetaverse/metaverse)github.com/rmetaverse/met, r package version 0.0.1
work page 2019
-
[37]
O’Reilly Media (2015), https://books.google.com.au/books?id=DqSxBwAAQBAJ, bibtex*[lccn=2015472811]
Wickham, H.: R Packages: Organize, Test, Document, and Share Your Code. O’Reilly Media (2015), https://books.google.com.au/books?id=DqSxBwAAQBAJ, bibtex*[lccn=2015472811]
work page 2015
-
[38]
Wickham, H.: testthat: Get Started with Testing (2011)
work page 2011
-
[39]
Wickham, H.: tidyverse: Easily Install and Load the ’Tidyverse’ (2017), https: //CRAN.R-project.org/package=tidyverse
work page 2017
-
[40]
PLoS Biology 12(1), e1001745 (Jan 2014)
Wilson, G., Aruliah, D.A., Brown, C.T., Chue Hong, N.P., Davis, M., Guy, R.T., Haddock, S.H.D., Huff, K.D., Mitchell, I.M., Plumbley, M.D., Waugh, B., White, E.P., Wilson, P.: Best Practices for Scientific Computing. PLoS Biology 12(1), e1001745 (Jan 2014). https://doi.org/10.1371/journal.pbio.1001745, https://dx. plos.org/10.1371/journal.pbio.1001745
-
[41]
PLOS Computational Biology 13(6), e1005510 (Jun 2017)
Wilson, G., Bryan, J., Cranston, K., Kitzes, J., Nederbragt, L., Teal, T.K.: Good enough practices in scientific computing. PLOS Computational Biology 13(6), e1005510 (Jun 2017). https://doi.org/10.1371/journal.pcbi.1005510, http: //dx.plos.org/10.1371/journal.pcbi.1005510
-
[42]
R News 5(1), 39–40 (2005), https://CRAN
Zeileis, A.: CRAN Task Views. R News 5(1), 39–40 (2005), https://CRAN. R-project.org/doc/Rnews/
work page 2005
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.