Pursuit of biomarkers of brain diseases: Beyond cohort comparisons
Pith reviewed 2026-05-18 18:21 UTC · model grok-4.3
The pith
Single-modality cohort comparisons cannot identify useful biomarkers for brain diseases even with unlimited data and powerful algorithms.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Using the Brain Swap thought experiment, we show that because brain features are degenerate, comparing cohorts on single data modalities cannot identify biomarkers that are useful for diagnosis or prognosis of brain diseases, regardless of data volume or analysis methods. We propose shifting to multimodal and longitudinal data to guide patient grouping prior to biomarker definition.
What carries the argument
The Brain Swap thought experiment, which demonstrates that interchanging brains between cohorts would not alter the observable data patterns in a single modality, thereby showing that such data cannot uniquely link to disease states.
If this is right
- Clinically useful biomarkers for brain diseases will need to be multidimensional, incorporating information from multiple modalities.
- Patient grouping for biomarker studies should be driven by patterns in multimodal and longitudinal data rather than by standard diagnostic labels.
- Shifting away from single-modality cohort comparisons could improve the chances of biomarkers being adopted in clinical practice.
Where Pith is reading between the lines
- This implies that many ongoing large-scale brain data collection projects may need to expand beyond single modalities to achieve their goals.
- Machine learning approaches focused solely on one data type are likely to face fundamental limits in producing actionable clinical insights for brain disorders.
Load-bearing premise
Brain features are degenerate, so that the same feature can appear across different conditions and individuals.
What would settle it
A prospective study showing that a biomarker identified solely from single-modality cohort comparisons accurately predicts clinical outcomes in new patients would falsify the central claim.
read the original abstract
Despite the diversity and volume of brain data acquired and advanced AI-based algorithms to analyze them, brain features are rarely used in clinics for diagnosis and prognosis. Here we argue that the field continues to rely on cohort comparisons to seek biomarkers, despite the well-established degeneracy of brain features. Using a thought experiment (Brain Swap), we show that more data and more powerful algorithms will not be sufficient to identify biomarkers of brain diseases. We argue that instead of comparing patient versus healthy controls using single data type, we should use multimodal (e.g. brain activity, neurotransmitters, neuromodulators, brain imaging) and longitudinal brain data to guide the grouping before defining multidimensional biomarkers for brain diseases.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript argues that despite extensive brain data collection and advanced AI algorithms, clinically useful biomarkers for brain diseases remain elusive because the field relies on single-modality cohort comparisons. It invokes the well-established degeneracy of brain features and introduces the 'Brain Swap' thought experiment to claim that neither additional data volume nor more powerful algorithms can overcome this structural limitation. The authors conclude that researchers should instead use multimodal (e.g., activity, neurotransmitters, imaging) and longitudinal data to guide grouping of individuals before defining multidimensional biomarkers.
Significance. If the central argument is correct, the paper identifies a fundamental methodological barrier that explains the persistent failure of biomarker translation in neuroscience and psychiatry. It offers a conceptual reframing that could redirect resources away from ever-larger single-modality cohort studies toward integrated multimodal longitudinal designs. The absence of new empirical results or formal proofs means the significance is primarily philosophical and programmatic rather than immediately actionable.
major comments (2)
- [Brain Swap thought experiment] Brain Swap thought experiment (described in the main argument section): the construction asserts that any observed feature distribution remains compatible with both healthy and diseased states at the individual level due to degeneracy, yet it does not demonstrate why high-dimensional nonlinear methods (contrastive learning, causal representation learning, or subspace identification) cannot isolate disease-relevant components even under many-to-one mappings. A concrete counter-example or formal characterization of the degeneracy that rules out residual recoverable structure would be required to support the strong claim that 'more data and more powerful algorithms will not be sufficient.'
- [Discussion and recommendations] Transition from critique to proposal (final section): the recommendation to use multimodal and longitudinal data to 'guide the grouping' is presented without specifying how this avoids the same degeneracy problem once grouping is performed; if degeneracy is as pervasive as claimed, it is unclear why the same features would become informative after an initial multimodal clustering step.
minor comments (2)
- [Introduction] The abstract and introduction cite 'well-established degeneracy' without referencing specific foundational papers on degeneracy in neural systems; adding 2-3 key citations would improve grounding.
- [Throughout] The manuscript uses 'single data type' and 'single-modality' interchangeably; consistent terminology would reduce ambiguity for readers outside the immediate subfield.
Simulated Author's Rebuttal
We thank the referee for their thoughtful and constructive comments, which help sharpen the conceptual argument of our manuscript. We respond to each major comment below and indicate where revisions will be made.
read point-by-point responses
-
Referee: [Brain Swap thought experiment] Brain Swap thought experiment (described in the main argument section): the construction asserts that any observed feature distribution remains compatible with both healthy and diseased states at the individual level due to degeneracy, yet it does not demonstrate why high-dimensional nonlinear methods (contrastive learning, causal representation learning, or subspace identification) cannot isolate disease-relevant components even under many-to-one mappings. A concrete counter-example or formal characterization of the degeneracy that rules out residual recoverable structure would be required to support the strong claim that 'more data and more powerful algorithms will not be sufficient.'
Authors: The Brain Swap thought experiment is meant to show that pervasive many-to-one mappings between latent states and observed features render any finite set of brain measurements compatible with multiple clinical interpretations at the individual level. While high-dimensional nonlinear methods can recover statistical structure or clusters, they cannot resolve the underlying ambiguity without additional constraints that break the degeneracy (e.g., temporal ordering or cross-modal consistency). We will revise the relevant section to include a brief illustrative example clarifying why residual recoverable structure remains limited under the form of degeneracy emphasized in the paper. revision: yes
-
Referee: [Discussion and recommendations] Transition from critique to proposal (final section): the recommendation to use multimodal and longitudinal data to 'guide the grouping' is presented without specifying how this avoids the same degeneracy problem once grouping is performed; if degeneracy is as pervasive as claimed, it is unclear why the same features would become informative after an initial multimodal clustering step.
Authors: The proposed workflow first uses rich multimodal longitudinal data to form subgroups that share coherent trajectories across modalities; within each such subgroup the effective degeneracy is reduced because the feature-to-state mapping is constrained by the shared dynamics and cross-modal relations. Biomarkers are then defined relative to these more homogeneous groups rather than across the full heterogeneous cohort. We agree that the manuscript would benefit from a clearer exposition of this mechanism and will expand the final section accordingly. revision: yes
Circularity Check
No circularity: argument rests on external degeneracy, not self-derived quantities or citations
full rationale
The paper advances a conceptual claim via the Brain Swap thought experiment that degeneracy of brain features renders single-modality cohort comparisons incapable of producing clinically useful biomarkers irrespective of data volume or algorithmic sophistication. This rests explicitly on the 'well-established' external property of degeneracy rather than any quantities, parameters, or results defined inside the paper. No equations, fitted inputs, self-citations, or uniqueness theorems appear in the provided text; the derivation chain is therefore self-contained against external benchmarks and does not reduce to its own inputs by construction.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Brain features exhibit degeneracy such that distinct underlying states can produce indistinguishable measurements.
invented entities (1)
-
Brain Swap thought experiment
no independent evidence
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Using a thought experiment (Brain Swap), we show that more data and more powerful algorithms will not be sufficient to identify biomarkers of brain diseases... degeneracy prevents from resorting to cohort comparison
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
the same behavior can be elicited by many brain activity patterns and the same neural activity pattern... could result in different behaviors... This is called degeneracy
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Movement dis- orders: official journal of the Movement Disorder Society23(15), 2129–2170 (2008)
Goetz, C.G., Tilley, B.C., Shaftman, S.R., Stebbins, G.T., Fahn, S., Martinez- Martin, P., Poewe, W., Sampaio, C., Stern, M.B., Dodel, R.,et al.: Movement 9 disorder society-sponsored revision of the unified parkinson’s disease rating scale (mds-updrs): scale presentation and clinimetric testing results. Movement dis- orders: official journal of the Movem...
work page 2008
-
[2]
Journal of the American Geriatrics Society53(4), 695–699 (2005)
Nasreddine, Z.S., Phillips, N.A., B´ edirian, V., Charbonneau, S., Whitehead, V., Collin, I., Cummings, J.L., Chertkow, H.: The montreal cognitive assess- ment, moca: a brief screening tool for mild cognitive impairment. Journal of the American Geriatrics Society53(4), 695–699 (2005)
work page 2005
-
[3]
Scientific Data9(1), 212 (2022)
Sandstr¨ om, M., Abrams, M., Bjaalie, J.G., Hicks, M., Kennedy, D.N., Kumar, A., Poline, J.-B., Roy, P.K., Tiesinga, P., Wachtler, T.,et al.: Recommendations for repositories and scientific gateways from a neuroscience perspective. Scientific Data9(1), 212 (2022)
work page 2022
-
[4]
Neuroimage124(Pt B), 1065 (2016)
Eickhoff, S., Nichols, T.E., Van Horn, J.D., Turner, J.A.: Sharing the wealth: Neuroimaging data repositories. Neuroimage124(Pt B), 1065 (2016)
work page 2016
-
[5]
Nature Machine Intelligence6(12), 1421–1423 (2024)
Grote, T., Freiesleben, T., Berens, P.: Foundation models in healthcare require rethinking reliability. Nature Machine Intelligence6(12), 1421–1423 (2024)
work page 2024
-
[6]
Science 242(4879), 741–745 (1988)
Churchland, P.S., Sejnowski, T.J.: Perspectives on cognitive neuroscience. Science 242(4879), 741–745 (1988)
work page 1988
-
[7]
Philosophical Transactions of the Royal Society of London
Bell, A.J.: Levels and loops: the future of artificial intelligence and neuroscience. Philosophical Transactions of the Royal Society of London. Series B: Biological Sciences354(1392), 2013–2020 (1999)
work page 2013
-
[8]
Current opinion in neurobiology41, 24–30 (2016)
Hemberger, M., Pammer, L., Laurent, G.: Comparative approaches to cortical microcircuits. Current opinion in neurobiology41, 24–30 (2016)
work page 2016
-
[9]
Journal of Consciousness Studies 1(1), 10–16 (1994)
Crick, F., Clark, J.: The astonishing hypothesis. Journal of Consciousness Studies 1(1), 10–16 (1994)
work page 1994
-
[10]
Krakauer, J.W., Ghazanfar, A.A., Gomez-Marin, A., MacIver, M.A., Poeppel, D.: Neuroscience needs behavior: correcting a reductionist bias. Neuron93(3), 480–490 (2017)
work page 2017
-
[11]
Journal of Neuroscience44(40) (2024) 10
Albantakis, L., Bernard, C., Brenner, N., Marder, E., Narayanan, R.: The brain’s best kept secret is its degenerate structure. Journal of Neuroscience44(40) (2024) 10
work page 2024
-
[12]
Science348(6234), 499– 500 (2015)
Insel, T.R., Cuthbert, B.N.: Brain disorders? precisely. Science348(6234), 499– 500 (2015)
work page 2015
-
[13]
Nature neuroscience20(3), 365–377 (2017)
Woo, C.-W., Chang, L.J., Lindquist, M.A., Wager, T.D.: Building better biomark- ers: brain models in translational neuroimaging. Nature neuroscience20(3), 365–377 (2017)
work page 2017
-
[14]
Philosophy of science 12(4), 316–321 (1945)
Rosenblueth, A., Wiener, N.: The role of models in science. Philosophy of science 12(4), 316–321 (1945)
work page 1945
-
[15]
Princeton University Press, Princeton, NJ (2025)
Rust, N.C.: Elusive Cures: Why Neuroscience Hasn’t Solved Brain Disor- ders—and How We Can Change That. Princeton University Press, Princeton, NJ (2025)
work page 2025
-
[16]
The Lancet Neurology11(11), 1006–1012 (2012)
Stern, Y.: Cognitive reserve in ageing and alzheimer’s disease. The Lancet Neurology11(11), 1006–1012 (2012)
work page 2012
-
[17]
Nature511(7510), 421–427 (2014)
Pantelis, C., Papadimitriou, G.N., Papiol, S., Parkhomenko, E., Pato, M.T., Paunio, T., Pejovic-Milovancevic, M., Perkins, D.O., Pietil¨ ainen, O.,et al.: Bio- logical insights from 108 schizophrenia-associated genetic loci. Nature511(7510), 421–427 (2014)
work page 2014
-
[18]
Health Information Science and Systems11(1), 32 (2023)
Huang, G., Li, R., Bai, Q., Alty, J.: Multimodal learning of clinically accessible tests to aid diagnosis of neurodegenerative disorders: a scoping review. Health Information Science and Systems11(1), 32 (2023)
work page 2023
-
[19]
Translational psychiatry14(1), 231 (2024)
Vieira, S., Bolton, T., Sch¨ ottner, M., Baecker, L., Marquand, A., Mechelli, A., Hagmann, P.: Multivariate brain-behaviour associations in psychiatric disorders. Translational psychiatry14(1), 231 (2024)
work page 2024
-
[20]
MIT press, Cambridge, MA (2015)
Sterling, P., Laughlin, S.: Principles of Neural Design. MIT press, Cambridge, MA (2015)
work page 2015
-
[21]
Journal of artificial intelligence research12, 149–198 (2000)
Baxter, J.: A model of inductive bias learning. Journal of artificial intelligence research12, 149–198 (2000)
work page 2000
-
[22]
Gao, R., Peterson, E.J., Voytek, B.: Inferring synaptic excitation/inhibition balance from field potentials. Neuroimage158, 70–78 (2017)
work page 2017
-
[23]
science338(6105), 344–348 (2012)
Scheffer, M., Carpenter, S.R., Lenton, T.M., Bascompte, J., Brock, W., Dakos, 11 V., Koppel, J., Leemput, I.A., Levin, S.A., Van Nes, E.H.,et al.: Anticipating critical transitions. science338(6105), 344–348 (2012)
work page 2012
-
[24]
Nature machine intelligence1(5), 206–215 (2019)
Rudin, C.: Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nature machine intelligence1(5), 206–215 (2019)
work page 2019
-
[25]
Neuron107(6), 1048–1070 (2020) 12
Yang, G.R., Wang, X.-J.: Artificial neural networks for neuroscientists: a primer. Neuron107(6), 1048–1070 (2020) 12
work page 2020
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.