Hierarchical Bayesian inference for community detection and connectivity of functional brain networks
Pith reviewed 2026-05-24 09:45 UTC · model grok-4.3
The pith
A Bayesian latent block model detects community structures in functional brain networks at both individual and group levels while preserving subject variability.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The central claim is that the proposed hierarchical Bayesian method based on the latent block model can robustly detect the community structure of weighted functional networks with an unknown number of communities at both individual and group levels, retaining individual network variability, and is more accurate than commonly used modularity models as shown in synthetic and real fMRI data analyses.
What carries the argument
Bayesian latent block model (LBM) applied hierarchically for multilayer community detection in weighted networks.
If this is right
- The method handles unknown number of communities without prior specification.
- Individual subject variability is preserved in the estimates.
- Consistency with predefined communities in simulated data based on the generative model.
- Improved split-half reproducibility in task fMRI data from 100 subjects.
Where Pith is reading between the lines
- This approach could enable better tracking of cognitive and behavioral changes through more reliable network estimates across subjects.
- The generative model offers a way to benchmark other community detection algorithms for fMRI signals.
- Direct inclusion of connectivity parameters in the model might link detected communities to integration properties of brain networks.
Load-bearing premise
The community structure-based multivariate Gaussian generative model accurately represents the statistical properties of real fMRI signals.
What would settle it
If applying the method to synthetic data generated from the model fails to recover the exact predefined community memberships, or if split-half reproducibility on real data is not superior to modularity methods.
Figures
read the original abstract
Most functional magnetic resonance imaging studies rely on estimates of hierarchically organized functional brain networks whose segregation and integration reflect the cognitive and behavioral changes in humans. However, most existing methods for estimating the community structure of networks from both individual and group-level analysis methods do not account for the variability between subjects. In this paper, we develop a new multilayer community detection method based on Bayesian latent block model (LBM). The method can robustly detect the community structure of weighted functional networks with an unknown number of communities at both individual and group levels and retain the variability of the individual networks. For validation, we propose a new community structure-based multivariate Gaussian generative model to simulate synthetic signal. Our simulation study shows that the community memberships estimated by hierarchical Bayesian inference are consistent with the predefined node labels in the generative model. The method is also tested via split-half reproducibility using working memory task fMRI data of 100 unrelated healthy subjects from the Human Connectome Project. Analyses using both synthetic and real data show that our proposed method is more accurate and reliable compared with the commonly used (multilayer) modularity models.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces a hierarchical Bayesian latent block model (LBM) for multilayer community detection in weighted functional brain networks from fMRI data. The method detects community structure at individual and group levels with unknown numbers of communities while retaining inter-subject variability. Validation uses a newly proposed community structure-based multivariate Gaussian generative model showing consistency of estimated memberships with predefined labels, plus split-half reproducibility on HCP working memory task data from 100 subjects; the authors claim superior accuracy and reliability versus (multilayer) modularity models.
Significance. If validated independently, the approach could improve estimation of hierarchically organized functional networks by explicitly modeling subject variability and unknown community counts via Bayesian inference. The split-half test on real HCP data supplies external grounding for reliability, and the method's ability to operate on weighted networks is a potential strength over standard modularity.
major comments (3)
- [Abstract] Abstract: the central accuracy claim rests on synthetic experiments generated from the authors' own community structure-based multivariate Gaussian model, which explicitly encodes the block/community structure that the LBM is designed to recover; this creates circularity and does not constitute an independent stress test against modularity baselines that optimize a different objective.
- [Abstract] Abstract: the simulation reports only qualitative consistency with predefined labels and provides no quantitative metrics (accuracy, error rates, or statistical comparisons) against the modularity baselines, preventing assessment of the claimed superiority.
- [Abstract] Abstract / generative model description: the manuscript does not report validation of the proposed multivariate Gaussian generative model against empirical fMRI statistics (heavy tails, temporal autocorrelation, non-Gaussianity), which is required before using it to benchmark detection performance.
minor comments (1)
- The split-half reproducibility test on HCP data is a positive external check, but the manuscript should clarify how community labels are aligned across halves and subjects for quantitative comparison.
Simulated Author's Rebuttal
We thank the referee for the constructive comments on our manuscript. We address each major comment point by point below, indicating where revisions will be made.
read point-by-point responses
-
Referee: [Abstract] Abstract: the central accuracy claim rests on synthetic experiments generated from the authors' own community structure-based multivariate Gaussian model, which explicitly encodes the block/community structure that the LBM is designed to recover; this creates circularity and does not constitute an independent stress test against modularity baselines that optimize a different objective.
Authors: The synthetic experiments are designed to test recovery of known structure under the model's assumptions, which is a standard validation approach for inference methods. The primary comparison to modularity relies on the independent split-half reproducibility analysis of real HCP data. We will revise the abstract and discussion to clarify the distinct roles of the two evaluations and reduce emphasis on synthetic accuracy claims relative to modularity. revision: partial
-
Referee: [Abstract] Abstract: the simulation reports only qualitative consistency with predefined labels and provides no quantitative metrics (accuracy, error rates, or statistical comparisons) against the modularity baselines, preventing assessment of the claimed superiority.
Authors: We agree that quantitative metrics are needed to support the superiority claim on synthetic data. In the revision we will add accuracy, adjusted Rand index, and statistical comparisons against the modularity baselines. revision: yes
-
Referee: [Abstract] Abstract / generative model description: the manuscript does not report validation of the proposed multivariate Gaussian generative model against empirical fMRI statistics (heavy tails, temporal autocorrelation, non-Gaussianity), which is required before using it to benchmark detection performance.
Authors: The generative model serves as a controlled simulation tool rather than a full statistical model of fMRI time series. We will add an explicit limitations paragraph discussing its simplifying assumptions, but a comprehensive match to all empirical fMRI properties lies outside the scope of the current work focused on the detection method. revision: partial
Circularity Check
Synthetic validation circular: data generated from author-proposed model whose block structure matches LBM by construction
specific steps
-
self definitional
[Abstract]
"For validation, we propose a new community structure-based multivariate Gaussian generative model to simulate synthetic signal. Our simulation study shows that the community memberships estimated by hierarchical Bayesian inference are consistent with the predefined node labels in the generative model."
The generative model is defined to contain the exact community structure that the hierarchical Bayesian LBM is designed to recover; therefore the reported consistency is true by construction of the simulation rather than an independent verification of the method's superiority.
full rationale
The paper's accuracy claim against modularity baselines rests primarily on synthetic recovery experiments. The authors introduce both the LBM inference method and a 'community structure-based multivariate Gaussian generative model' whose node labels are then recovered by the LBM. Because the generative process explicitly encodes the same community/block structure the LBM is built to infer, successful recovery is guaranteed under the model's own assumptions and does not constitute an independent test. The real-data split-half test addresses only reproducibility, not accuracy. No external validation of the generative model's match to empirical fMRI statistics is provided in the abstract or described validation chain.
Axiom & Free-Parameter Ledger
free parameters (1)
- number of communities
axioms (1)
- domain assumption Functional brain networks can be represented as weighted graphs whose community structure follows the assumptions of a latent block model.
invented entities (1)
-
community structure-based multivariate Gaussian generative model
no independent evidence
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.