pith. sign in

arxiv: 2509.10547 · v2 · submitted 2025-09-08 · 🧬 q-bio.NC · cs.AI· cs.LG

Pursuit of biomarkers of brain diseases: Beyond cohort comparisons

Pith reviewed 2026-05-18 18:21 UTC · model grok-4.3

classification 🧬 q-bio.NC cs.AIcs.LG
keywords brain biomarkersdegeneracycohort studiesmultimodal neurosciencebrain diseasesthought experiments
0
0 comments X

The pith

Single-modality cohort comparisons cannot identify useful biomarkers for brain diseases even with unlimited data and powerful algorithms.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Despite collecting enormous amounts of brain data and developing powerful AI tools, researchers have struggled to find biomarkers that doctors can use for diagnosing or predicting brain diseases. The core problem is that brain features are degenerate: the same measurement can show up in healthy people and in those with various conditions. A thought experiment involving swapping brains between groups reveals that simply gathering more data or using better algorithms on single data types like brain scans or activity recordings will not overcome this. The solution proposed is to collect multimodal data, including activity, chemistry, and imaging, over time for individuals, use that to form natural groups, and only then identify the biomarkers that distinguish those groups.

Core claim

Using the Brain Swap thought experiment, we show that because brain features are degenerate, comparing cohorts on single data modalities cannot identify biomarkers that are useful for diagnosis or prognosis of brain diseases, regardless of data volume or analysis methods. We propose shifting to multimodal and longitudinal data to guide patient grouping prior to biomarker definition.

What carries the argument

The Brain Swap thought experiment, which demonstrates that interchanging brains between cohorts would not alter the observable data patterns in a single modality, thereby showing that such data cannot uniquely link to disease states.

If this is right

  • Clinically useful biomarkers for brain diseases will need to be multidimensional, incorporating information from multiple modalities.
  • Patient grouping for biomarker studies should be driven by patterns in multimodal and longitudinal data rather than by standard diagnostic labels.
  • Shifting away from single-modality cohort comparisons could improve the chances of biomarkers being adopted in clinical practice.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • This implies that many ongoing large-scale brain data collection projects may need to expand beyond single modalities to achieve their goals.
  • Machine learning approaches focused solely on one data type are likely to face fundamental limits in producing actionable clinical insights for brain disorders.

Load-bearing premise

Brain features are degenerate, so that the same feature can appear across different conditions and individuals.

What would settle it

A prospective study showing that a biomarker identified solely from single-modality cohort comparisons accurately predicts clinical outcomes in new patients would falsify the central claim.

read the original abstract

Despite the diversity and volume of brain data acquired and advanced AI-based algorithms to analyze them, brain features are rarely used in clinics for diagnosis and prognosis. Here we argue that the field continues to rely on cohort comparisons to seek biomarkers, despite the well-established degeneracy of brain features. Using a thought experiment (Brain Swap), we show that more data and more powerful algorithms will not be sufficient to identify biomarkers of brain diseases. We argue that instead of comparing patient versus healthy controls using single data type, we should use multimodal (e.g. brain activity, neurotransmitters, neuromodulators, brain imaging) and longitudinal brain data to guide the grouping before defining multidimensional biomarkers for brain diseases.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript argues that despite extensive brain data collection and advanced AI algorithms, clinically useful biomarkers for brain diseases remain elusive because the field relies on single-modality cohort comparisons. It invokes the well-established degeneracy of brain features and introduces the 'Brain Swap' thought experiment to claim that neither additional data volume nor more powerful algorithms can overcome this structural limitation. The authors conclude that researchers should instead use multimodal (e.g., activity, neurotransmitters, imaging) and longitudinal data to guide grouping of individuals before defining multidimensional biomarkers.

Significance. If the central argument is correct, the paper identifies a fundamental methodological barrier that explains the persistent failure of biomarker translation in neuroscience and psychiatry. It offers a conceptual reframing that could redirect resources away from ever-larger single-modality cohort studies toward integrated multimodal longitudinal designs. The absence of new empirical results or formal proofs means the significance is primarily philosophical and programmatic rather than immediately actionable.

major comments (2)
  1. [Brain Swap thought experiment] Brain Swap thought experiment (described in the main argument section): the construction asserts that any observed feature distribution remains compatible with both healthy and diseased states at the individual level due to degeneracy, yet it does not demonstrate why high-dimensional nonlinear methods (contrastive learning, causal representation learning, or subspace identification) cannot isolate disease-relevant components even under many-to-one mappings. A concrete counter-example or formal characterization of the degeneracy that rules out residual recoverable structure would be required to support the strong claim that 'more data and more powerful algorithms will not be sufficient.'
  2. [Discussion and recommendations] Transition from critique to proposal (final section): the recommendation to use multimodal and longitudinal data to 'guide the grouping' is presented without specifying how this avoids the same degeneracy problem once grouping is performed; if degeneracy is as pervasive as claimed, it is unclear why the same features would become informative after an initial multimodal clustering step.
minor comments (2)
  1. [Introduction] The abstract and introduction cite 'well-established degeneracy' without referencing specific foundational papers on degeneracy in neural systems; adding 2-3 key citations would improve grounding.
  2. [Throughout] The manuscript uses 'single data type' and 'single-modality' interchangeably; consistent terminology would reduce ambiguity for readers outside the immediate subfield.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their thoughtful and constructive comments, which help sharpen the conceptual argument of our manuscript. We respond to each major comment below and indicate where revisions will be made.

read point-by-point responses
  1. Referee: [Brain Swap thought experiment] Brain Swap thought experiment (described in the main argument section): the construction asserts that any observed feature distribution remains compatible with both healthy and diseased states at the individual level due to degeneracy, yet it does not demonstrate why high-dimensional nonlinear methods (contrastive learning, causal representation learning, or subspace identification) cannot isolate disease-relevant components even under many-to-one mappings. A concrete counter-example or formal characterization of the degeneracy that rules out residual recoverable structure would be required to support the strong claim that 'more data and more powerful algorithms will not be sufficient.'

    Authors: The Brain Swap thought experiment is meant to show that pervasive many-to-one mappings between latent states and observed features render any finite set of brain measurements compatible with multiple clinical interpretations at the individual level. While high-dimensional nonlinear methods can recover statistical structure or clusters, they cannot resolve the underlying ambiguity without additional constraints that break the degeneracy (e.g., temporal ordering or cross-modal consistency). We will revise the relevant section to include a brief illustrative example clarifying why residual recoverable structure remains limited under the form of degeneracy emphasized in the paper. revision: yes

  2. Referee: [Discussion and recommendations] Transition from critique to proposal (final section): the recommendation to use multimodal and longitudinal data to 'guide the grouping' is presented without specifying how this avoids the same degeneracy problem once grouping is performed; if degeneracy is as pervasive as claimed, it is unclear why the same features would become informative after an initial multimodal clustering step.

    Authors: The proposed workflow first uses rich multimodal longitudinal data to form subgroups that share coherent trajectories across modalities; within each such subgroup the effective degeneracy is reduced because the feature-to-state mapping is constrained by the shared dynamics and cross-modal relations. Biomarkers are then defined relative to these more homogeneous groups rather than across the full heterogeneous cohort. We agree that the manuscript would benefit from a clearer exposition of this mechanism and will expand the final section accordingly. revision: yes

Circularity Check

0 steps flagged

No circularity: argument rests on external degeneracy, not self-derived quantities or citations

full rationale

The paper advances a conceptual claim via the Brain Swap thought experiment that degeneracy of brain features renders single-modality cohort comparisons incapable of producing clinically useful biomarkers irrespective of data volume or algorithmic sophistication. This rests explicitly on the 'well-established' external property of degeneracy rather than any quantities, parameters, or results defined inside the paper. No equations, fitted inputs, self-citations, or uniqueness theorems appear in the provided text; the derivation chain is therefore self-contained against external benchmarks and does not reduce to its own inputs by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 1 invented entities

The central claim rests on the domain assumption that brain-feature degeneracy is both real and decisive for biomarker validity, plus the introduction of a new conceptual device (Brain Swap) whose independent falsifiability is not demonstrated.

axioms (1)
  • domain assumption Brain features exhibit degeneracy such that distinct underlying states can produce indistinguishable measurements.
    Invoked explicitly in the abstract as a well-established property that undermines cohort comparisons.
invented entities (1)
  • Brain Swap thought experiment no independent evidence
    purpose: To demonstrate that additional data volume or algorithmic power cannot overcome degeneracy in biomarker discovery.
    Introduced as a hypothetical scenario whose details are not supplied in the abstract.

pith-pipeline@v0.9.0 · 5637 in / 1327 out tokens · 43919 ms · 2026-05-18T18:21:48.715072+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

25 extracted references · 25 canonical work pages

  1. [1]

    Movement dis- orders: official journal of the Movement Disorder Society23(15), 2129–2170 (2008)

    Goetz, C.G., Tilley, B.C., Shaftman, S.R., Stebbins, G.T., Fahn, S., Martinez- Martin, P., Poewe, W., Sampaio, C., Stern, M.B., Dodel, R.,et al.: Movement 9 disorder society-sponsored revision of the unified parkinson’s disease rating scale (mds-updrs): scale presentation and clinimetric testing results. Movement dis- orders: official journal of the Movem...

  2. [2]

    Journal of the American Geriatrics Society53(4), 695–699 (2005)

    Nasreddine, Z.S., Phillips, N.A., B´ edirian, V., Charbonneau, S., Whitehead, V., Collin, I., Cummings, J.L., Chertkow, H.: The montreal cognitive assess- ment, moca: a brief screening tool for mild cognitive impairment. Journal of the American Geriatrics Society53(4), 695–699 (2005)

  3. [3]

    Scientific Data9(1), 212 (2022)

    Sandstr¨ om, M., Abrams, M., Bjaalie, J.G., Hicks, M., Kennedy, D.N., Kumar, A., Poline, J.-B., Roy, P.K., Tiesinga, P., Wachtler, T.,et al.: Recommendations for repositories and scientific gateways from a neuroscience perspective. Scientific Data9(1), 212 (2022)

  4. [4]

    Neuroimage124(Pt B), 1065 (2016)

    Eickhoff, S., Nichols, T.E., Van Horn, J.D., Turner, J.A.: Sharing the wealth: Neuroimaging data repositories. Neuroimage124(Pt B), 1065 (2016)

  5. [5]

    Nature Machine Intelligence6(12), 1421–1423 (2024)

    Grote, T., Freiesleben, T., Berens, P.: Foundation models in healthcare require rethinking reliability. Nature Machine Intelligence6(12), 1421–1423 (2024)

  6. [6]

    Science 242(4879), 741–745 (1988)

    Churchland, P.S., Sejnowski, T.J.: Perspectives on cognitive neuroscience. Science 242(4879), 741–745 (1988)

  7. [7]

    Philosophical Transactions of the Royal Society of London

    Bell, A.J.: Levels and loops: the future of artificial intelligence and neuroscience. Philosophical Transactions of the Royal Society of London. Series B: Biological Sciences354(1392), 2013–2020 (1999)

  8. [8]

    Current opinion in neurobiology41, 24–30 (2016)

    Hemberger, M., Pammer, L., Laurent, G.: Comparative approaches to cortical microcircuits. Current opinion in neurobiology41, 24–30 (2016)

  9. [9]

    Journal of Consciousness Studies 1(1), 10–16 (1994)

    Crick, F., Clark, J.: The astonishing hypothesis. Journal of Consciousness Studies 1(1), 10–16 (1994)

  10. [10]

    Neuron93(3), 480–490 (2017)

    Krakauer, J.W., Ghazanfar, A.A., Gomez-Marin, A., MacIver, M.A., Poeppel, D.: Neuroscience needs behavior: correcting a reductionist bias. Neuron93(3), 480–490 (2017)

  11. [11]

    Journal of Neuroscience44(40) (2024) 10

    Albantakis, L., Bernard, C., Brenner, N., Marder, E., Narayanan, R.: The brain’s best kept secret is its degenerate structure. Journal of Neuroscience44(40) (2024) 10

  12. [12]

    Science348(6234), 499– 500 (2015)

    Insel, T.R., Cuthbert, B.N.: Brain disorders? precisely. Science348(6234), 499– 500 (2015)

  13. [13]

    Nature neuroscience20(3), 365–377 (2017)

    Woo, C.-W., Chang, L.J., Lindquist, M.A., Wager, T.D.: Building better biomark- ers: brain models in translational neuroimaging. Nature neuroscience20(3), 365–377 (2017)

  14. [14]

    Philosophy of science 12(4), 316–321 (1945)

    Rosenblueth, A., Wiener, N.: The role of models in science. Philosophy of science 12(4), 316–321 (1945)

  15. [15]

    Princeton University Press, Princeton, NJ (2025)

    Rust, N.C.: Elusive Cures: Why Neuroscience Hasn’t Solved Brain Disor- ders—and How We Can Change That. Princeton University Press, Princeton, NJ (2025)

  16. [16]

    The Lancet Neurology11(11), 1006–1012 (2012)

    Stern, Y.: Cognitive reserve in ageing and alzheimer’s disease. The Lancet Neurology11(11), 1006–1012 (2012)

  17. [17]

    Nature511(7510), 421–427 (2014)

    Pantelis, C., Papadimitriou, G.N., Papiol, S., Parkhomenko, E., Pato, M.T., Paunio, T., Pejovic-Milovancevic, M., Perkins, D.O., Pietil¨ ainen, O.,et al.: Bio- logical insights from 108 schizophrenia-associated genetic loci. Nature511(7510), 421–427 (2014)

  18. [18]

    Health Information Science and Systems11(1), 32 (2023)

    Huang, G., Li, R., Bai, Q., Alty, J.: Multimodal learning of clinically accessible tests to aid diagnosis of neurodegenerative disorders: a scoping review. Health Information Science and Systems11(1), 32 (2023)

  19. [19]

    Translational psychiatry14(1), 231 (2024)

    Vieira, S., Bolton, T., Sch¨ ottner, M., Baecker, L., Marquand, A., Mechelli, A., Hagmann, P.: Multivariate brain-behaviour associations in psychiatric disorders. Translational psychiatry14(1), 231 (2024)

  20. [20]

    MIT press, Cambridge, MA (2015)

    Sterling, P., Laughlin, S.: Principles of Neural Design. MIT press, Cambridge, MA (2015)

  21. [21]

    Journal of artificial intelligence research12, 149–198 (2000)

    Baxter, J.: A model of inductive bias learning. Journal of artificial intelligence research12, 149–198 (2000)

  22. [22]

    Neuroimage158, 70–78 (2017)

    Gao, R., Peterson, E.J., Voytek, B.: Inferring synaptic excitation/inhibition balance from field potentials. Neuroimage158, 70–78 (2017)

  23. [23]

    science338(6105), 344–348 (2012)

    Scheffer, M., Carpenter, S.R., Lenton, T.M., Bascompte, J., Brock, W., Dakos, 11 V., Koppel, J., Leemput, I.A., Levin, S.A., Van Nes, E.H.,et al.: Anticipating critical transitions. science338(6105), 344–348 (2012)

  24. [24]

    Nature machine intelligence1(5), 206–215 (2019)

    Rudin, C.: Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nature machine intelligence1(5), 206–215 (2019)

  25. [25]

    Neuron107(6), 1048–1070 (2020) 12

    Yang, G.R., Wang, X.-J.: Artificial neural networks for neuroscientists: a primer. Neuron107(6), 1048–1070 (2020) 12