High-dimensional Many-to-many-to-many Mediation Analysis

Binh T. Nguyen; Cong Khanh Truong; Duy-Cat Can; Oliver Y. Ch\'en; Tien Dat Nguyen; Trung Khang Tran

arxiv: 2604.02886 · v1 · submitted 2026-04-03 · 📊 stat.ME · q-bio.GN· q-bio.QM· stat.AP· stat.ML

High-dimensional Many-to-many-to-many Mediation Analysis

Tien Dat Nguyen , Trung Khang Tran , Cong Khanh Truong , Duy-Cat Can , Binh T. Nguyen , Oliver Y. Ch\'en This is my paper

Pith reviewed 2026-05-13 18:42 UTC · model grok-4.3

classification 📊 stat.ME q-bio.GNq-bio.QMstat.APstat.ML

keywords mediation analysishigh-dimensional dataindirect effectsvariable selectionAlzheimer's diseaseneuroimaginggenetic pathwaysasymptotic normality

0 comments

The pith

A mediation framework recovers consistent indirect-effect matrices when exposures, mediators, and outcomes are all high-dimensional and multivariate.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper formalizes many-to-many-to-many mediation analysis for settings in which hundreds of exposures act on hundreds of mediators that in turn affect multiple outcomes. It supplies estimators that perform simultaneous variable selection on the exposure and mediator sides, computes the full indirect-effect matrix, and supplies asymptotic theory showing consistency and element-wise normality together with explicit error bounds. Finite-sample behavior is checked in simulations, and the method is applied to Alzheimer's data linking 688 SNPs through 202 cortical-thickness measures to eleven cognitive and diagnostic scores. The framework identifies interpretable genetic-neural-cognitive pathways and yields better out-of-sample prediction than competing approaches.

Core claim

MMM mediation analysis estimates the indirect-effect matrix formed by the product of the high-dimensional exposure-to-mediator and mediator-to-outcome coefficient matrices, proves that the resulting estimators are consistent and element-wise asymptotically normal, and derives finite-sample error bounds while simultaneously performing variable selection on both the exposure and mediator sets.

What carries the argument

The indirect-effect matrix, obtained as the product of two sparse coefficient matrices estimated jointly under high-dimensional regularization.

If this is right

The estimated indirect-effect matrix supplies a direct ranking of which genetic variants influence which cognitive outcomes through which brain regions.
Variable selection on both exposures and mediators reduces the number of pathways that must be interpreted biologically.
The same fitted model can be used for out-of-sample prediction of the multivariate outcomes.
Error bounds on the indirect-effect matrix give a quantitative guarantee on how much estimation error is expected in finite samples.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same machinery could be applied to other layered high-dimensional systems such as multi-omics or environmental exposures to physiological and behavioral measures.
If the linearity assumption is relaxed, the current consistency guarantees would no longer hold, suggesting a natural next theoretical target.
The improvement in out-of-sample prediction implies that the selected pathways capture signal that simpler single-layer models miss.

Load-bearing premise

The structural equations relating exposures, mediators, and outcomes are linear, and the coefficient matrices are sufficiently sparse for high-dimensional inference to be valid.

What would settle it

A simulation study in which the true indirect-effect matrix is known but the estimator fails to converge in probability or to exhibit the claimed asymptotic normality as sample size grows.

Figures

Figures reproduced from arXiv: 2604.02886 by Binh T. Nguyen, Cong Khanh Truong, Duy-Cat Can, Oliver Y. Ch\'en, Tien Dat Nguyen, Trung Khang Tran.

**Figure 1.** Figure 1: An overview of the many–to–many–to–many (MMM) mediation analysis framework. (a) Schematic representation of different types of mediation analysis. An illustration of how classical univariate- and multivariate mediation analysis models extend to the many–to–many–to–many (MMM) setting, where multivariate exposures x, mediators m, and outcomes y interact through multiple indirect pathways. (b) The MMM model. … view at source ↗

**Figure 2.** Figure 2: Simulation results for the MMM mediation framework. (a) Heatmaps comparing the ground-truth coefficient matrices α0, β0, and the indirect-effect matrix α0β0 with their corresponding estimates αb, βb, and αbβb under a representative simulation setting. (b) Stability of αβ across combinations of sample size and noise level, and Type I error rates under null mediation paths. (c) Estimation error of (α, β, αβ)… view at source ↗

**Figure 3.** Figure 3: Application of many-to-many-to-many mediation analysis to Alzheimer’s disease. (a) A schematic overview of the analysis structure, showing 688 genome-wide significant SNPs (x), 202 cortical-thickness mediators (m), and 11 cognitive-behavior and diagnostic outcomes (y), together with covariates. (b) Heatmap of the estimated α effects, highlighting structured genetic–brain map. (c) Cortical surface visualiza… view at source ↗

read the original abstract

We study high-dimensional mediation analysis in which exposures, mediators, and outcomes are all multivariate, and both exposures and mediators may be high-dimensional. We formalize this as a many (exposures)-to-many (mediators)-to-many (outcomes) (MMM) mediation analysis problem. Methodologically, MMM mediation analysis simultaneously performs variable selection for high-dimensional exposures and mediators, estimates the indirect effect matrix (i.e., the coefficient matrices linking exposure-to-mediator and mediator-to-outcome pathways), and enables prediction of multivariate outcomes. Theoretically, we show that the estimated indirect effect matrices are consistent and element-wise asymptotically normal, and we derive error bounds for the estimators. To evaluate the efficacy of the MMM mediation framework, we first investigate its finite-sample performance, including convergence properties, the behavior of the asymptotic approximations, and robustness to noise, via simulation studies. We then apply MMM mediation analysis to data from the Alzheimer's Disease Neuroimaging Initiative to study how cortical thickness of 202 brain regions may mediate the effects of 688 genome-wide significant single nucleotide polymorphisms (SNPs) (selected from approximately 1.5 million SNPs) on eleven cognitive-behavioral and diagnostic outcomes. The MMM mediation framework identifies biologically interpretable, many-to-many-to-many genetic-neural-cognitive pathways and improves downstream out-of-sample classification and prediction performance. Taken together, our results demonstrate the potential of MMM mediation analysis and highlight the value of statistical methodology for investigating complex, high-dimensional multi-layer pathways in science. The MMM package is available at https://github.com/THELabTop/MMM-Mediation.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 2 minor

Summary. The paper introduces a high-dimensional many-to-many-to-many (MMM) mediation analysis framework for multivariate exposures, mediators, and outcomes. It develops estimators for indirect effect matrices that perform simultaneous variable selection, derives consistency, element-wise asymptotic normality, and error bounds under linearity and sparsity assumptions, validates finite-sample behavior via simulations, and applies the method to ADNI data (688 SNPs, 202 brain regions, 11 outcomes) to identify genetic-neural-cognitive pathways and improve out-of-sample prediction.

Significance. If the theoretical results hold under the stated conditions and the sparsity levels in the ADNI data satisfy the required rates, the framework would offer a principled approach to high-dimensional multi-layer mediation, with potential to advance pathway analysis in genomics and neuroimaging while providing practical gains in multivariate prediction.

major comments (3)

[Theoretical Results] Theoretical Results section: The consistency, element-wise asymptotic normality, and error bounds for the indirect effect matrices are derived under structural-equation linearity plus row/column sparsity on the exposure-mediator and mediator-outcome coefficient matrices, yet the ADNI application (688 SNPs, 202 regions) reports selected pathways without verifying realized sparsity levels or checking whether they obey the dimension-dependent rates needed for the guarantees to transfer.
[Application] Application section: The claim of improved downstream out-of-sample classification and prediction performance lacks any baseline comparisons, effect sizes, or error bars, so it is impossible to quantify the practical advantage of the MMM estimators over existing high-dimensional mediation or direct prediction methods.
[Simulation Studies] Simulation Studies section: The finite-sample investigation of convergence properties, asymptotic approximations, and robustness is described only at a high level; without quantitative metrics (e.g., empirical coverage rates or error decay as a function of dimension) it is difficult to confirm that the reported behavior aligns with the theorem rates.

minor comments (2)

[Abstract] The abstract states that the MMM package is available at a GitHub link, but the manuscript does not include a reproducibility statement or data availability details for the ADNI analysis.
[Methods] Notation for the indirect effect matrix (product of the two coefficient matrices) should be introduced with an explicit equation early in the Methods section to avoid ambiguity when discussing element-wise normality.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive and detailed comments. We address each major point below and will revise the manuscript accordingly to strengthen the presentation of the theoretical results, simulations, and application.

read point-by-point responses

Referee: [Theoretical Results] Theoretical Results section: The consistency, element-wise asymptotic normality, and error bounds for the indirect effect matrices are derived under structural-equation linearity plus row/column sparsity on the exposure-mediator and mediator-outcome coefficient matrices, yet the ADNI application (688 SNPs, 202 regions) reports selected pathways without verifying realized sparsity levels or checking whether they obey the dimension-dependent rates needed for the guarantees to transfer.

Authors: We thank the referee for this observation. The theoretical results are asymptotic and rely on sparsity rates that are satisfied by the selected models in practice. In the revised manuscript, we will add a supplementary table reporting the number of non-zero coefficients in the estimated exposure-mediator and mediator-outcome matrices for the ADNI analysis, along with a short discussion confirming that the realized sparsity levels (effective support sizes well below the dimension-dependent thresholds) are consistent with the conditions required for the consistency and normality guarantees to hold at the given dimensions. revision: yes
Referee: [Application] Application section: The claim of improved downstream out-of-sample classification and prediction performance lacks any baseline comparisons, effect sizes, or error bars, so it is impossible to quantify the practical advantage of the MMM estimators over existing high-dimensional mediation or direct prediction methods.

Authors: We agree that quantitative comparisons are needed to substantiate the practical gains. In the revision, we will expand the Application section to include direct comparisons against existing high-dimensional mediation approaches (e.g., sparse mediation via lasso) and standard multivariate prediction methods (e.g., ridge and lasso regression). We will report effect sizes such as differences in AUC and MSE, together with standard errors obtained from repeated cross-validation folds, to allow readers to assess the magnitude of improvement. revision: yes
Referee: [Simulation Studies] Simulation Studies section: The finite-sample investigation of convergence properties, asymptotic approximations, and robustness is described only at a high level; without quantitative metrics (e.g., empirical coverage rates or error decay as a function of dimension) it is difficult to confirm that the reported behavior aligns with the theorem rates.

Authors: We acknowledge that more granular quantitative results would better demonstrate alignment with theory. We will revise the Simulation Studies section to include explicit metrics: empirical coverage probabilities for the element-wise asymptotic normality, tables and plots of estimation error decay rates as functions of dimension (p and q), and robustness statistics under varying noise levels, with direct reference to the rates established in the theorems. revision: yes

Circularity Check

0 steps flagged

No circularity: consistency and normality results rest on standard high-dimensional regularity conditions independent of the fitted estimators.

full rationale

The paper defines new estimators for the indirect-effect matrices in the MMM framework and states consistency, element-wise asymptotic normality, and error bounds. These claims are presented as following from structural-equation linearity, row/column sparsity, restricted-eigenvalue conditions, and sub-Gaussian tails—standard assumptions in high-dimensional inference that are not defined in terms of the target quantities themselves. No equation reduces a derived result to a previously fitted parameter by construction, and no load-bearing step relies on a self-citation whose content is itself unverified or tautological. The ADNI application reports selected pathways but does not alter the logical status of the general theorems. The derivation chain therefore remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Abstract-only review yields limited visibility into specific parameters or assumptions; the method is inferred to rely on standard high-dimensional regularization and sparsity assumptions typical of variable-selection procedures in mediation models.

axioms (1)

domain assumption Sparsity and regularity conditions on the high-dimensional coefficient matrices sufficient for consistent selection and asymptotic normality
Required for the claimed consistency and element-wise asymptotic normality of the indirect-effect estimators in high dimensions.

pith-pipeline@v0.9.0 · 5622 in / 1358 out tokens · 69583 ms · 2026-05-13T18:42:17.434756+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

Foundation.RealityFromDistinction reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

multivariate linear structural equation model (LSEM): mi = α⊤xi + ζ⊤zi + ϵi, yi = β⊤mi + γ⊤xi + η⊤zi + ξi

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

9 extracted references · 9 canonical work pages

[1]

Baron, R. M. & Kenny, D. A. (1986), ‘The moderator–mediator variable distinction in social psychological research: Conceptual, strategic, and statistical considerations’,Journal of Personality and Social Psychology51(6), 1173–1182. Bellenguez, C., Küçükali, F., Jansen, I. E., Kleineidam, L., Moreno-Grau, S., Amin, N., Naj, A. C., Campos-Martin, R., Grenie...

work page 1986
[2]

& Pan, W.-C

Huang, Y.-T. & Pan, W.-C. (2015), ‘Hypothesis Test of Mediation Effect in Causal Mediation Model With High-Dimensional Continuous Mediators’,Biometrics72(2), 402–413. Imai, K., Keele, L. & Yamamoto, T. (2010), ‘Identification, Inference and Sensitivity Analysis for Causal Mediation Effects’,Statistical Science25(1), 51–71. 40 Jia, J. & Yu, B. (2010), ‘On ...

work page 2015
[3]

Katsumi, Y., Quimby, M., Hochberg, D., Jones, A., Brickhouse, M., Eldaief, M.C., Dickerson, B. C. & Touroutoglou, A. (2023), ‘Association of Regional Cortical Network Atrophy With Progression to Dementia in Patients With Primary Progressive Aphasia’,Neurology 100(3), e286–e296. Kunkle, B. W., Grenier-Boley, B., Sims, R., Bis, J. C., Damotte, V., Naj, A. C...

work page 2023
[4]

G., Dikmen, M., Hanoğlu, L

Mızrak, H. G., Dikmen, M., Hanoğlu, L. & Şakul, B. U. (2024), ‘Investigation of hemispheric asymmetry in Alzheimer’s disease patients during resting state revealed BY fNIRS’, Scientific Reports14(1), 13454. Mohan, A., Roberto, A. J., Mohan, A., Lorenzo, A., Jones, K., Carney, M. J., Liogier- Weyback, L., Hwang, S. & Lapidus, K. A. (2016), ‘The Significanc...

work page 2024
[5]

(2015a),Explanation in Causal Inference: Methods for Mediation and Interaction, Oxford University Press

VanderWeele, T. (2015a),Explanation in Causal Inference: Methods for Mediation and Interaction, Oxford University Press. VanderWeele, T. J. (2015b), ‘Mediation Analysis: A Practitioner’s Guide’,Annual Review Public Health37(1), 17–32. VanderWeele, T. & Vansteelandt, S. (2014), ‘Mediation Analysis with Multiple Mediators’, Epidemiologic Methods2(1), 95–115...

work page 2014
[6]

High-dimensional Many–to–many–to–many Mediation Analysis

45 Supplementary Materials to High-dimensional Many–to–many–to–many Mediation Analysis This document contains the Supplementary Materials to the paper “High-dimensional Many–to–many–to–many Mediation Analysis”. Appendix A.1 provides the proofs of all theorems and lemmas related to the consistency of the proposed method. Appendix A.2 presents the proofs of...

work page 2010
[7]

Proof of Theorem 3.6.We adapt the proof of Theorem 1 in Jia & Yu (2010)

We now start to prove Theorem 3.6. Proof of Theorem 3.6.We adapt the proof of Theorem 1 in Jia & Yu (2010). Analysis ofM(V) We haveVm is Gaussian random variable with mean: µVm =E(V m) = ( n∑ i=1 (mi)mm⊤ i,(1) )( n∑ ℓ=1 mℓ,(1)m⊤ ℓ,(1)+λY,2I )−1( λY,1 − →b+ 2λY,2βk,(1) ) . The EIC implies that: ⏐⏐⏐⏐ 1 n n∑ i=1 mi,(2)m⊤ i,(1) ( 1 n n∑ ℓ=1 mℓ,(1)m⊤ ℓ,(1)+ λY...

work page 2010
[8]

By applying Markov’s inequality and (A4), we obtain: P   max m∈Dc ⏐⏐⏐˜Vm ⏐⏐⏐ λY,1 >Ψ  ≤ E ( max m∈Dc |˜Vm| ) λY,1 Ψ ≤ 8 √ log(p−d) λY,1 Ψ max m∈Dc √ E (˜V 2 m ) . Now, by straightforward computation, sinceE(˜Vm) = 0, one hasE (˜V 2 m ) =Var (˜Vm ) , so: E (˜V 2 m ) = 4 n∑ i=1 (( mi ) m )2[ 1−m⊤ i,(1) ( n∑ ℓ=1 mℓ,(1)m⊤ ℓ,(1)+λY,2I )−1 mi,(1) ]2 Var (( ...

work page 2009
[9]

=E [⏐⏐⏐ ( ξ1 ) k ⏐⏐⏐ 2+ϖ]( max 1≤i≤n R2 i ) ϖ 2 . Moreover, for all1≤i≤n, sincev⊤v= 1, R2 i≤  ( n∑ ℓ=1 mℓ,(1)m⊤ ℓ,(1) )−1/2 mi,(1)  2 2 = 1 n  ( 1 n n∑ ℓ=1 mℓ,(1)m⊤ ℓ,(1) )−1/2 mi,(1)  2 2 ≤1 n ( Λ min (1 n n∑ ℓ=1 mℓ,(1)m⊤ ℓ,(1) ))−1 d m  2 ∞ . Consequently, 1 (√ Var(J3) )2+ϖ n∑ i=1 E [⏐⏐⏐ ( ξi ) k ⏐⏐⏐ 2+ϖ]⏐⏐⏐Ri ⏐⏐⏐ 2+ϖ ≤E [⏐⏐⏐ ( ξ1 ...

work page 2009

[1] [1]

Baron, R. M. & Kenny, D. A. (1986), ‘The moderator–mediator variable distinction in social psychological research: Conceptual, strategic, and statistical considerations’,Journal of Personality and Social Psychology51(6), 1173–1182. Bellenguez, C., Küçükali, F., Jansen, I. E., Kleineidam, L., Moreno-Grau, S., Amin, N., Naj, A. C., Campos-Martin, R., Grenie...

work page 1986

[2] [2]

& Pan, W.-C

Huang, Y.-T. & Pan, W.-C. (2015), ‘Hypothesis Test of Mediation Effect in Causal Mediation Model With High-Dimensional Continuous Mediators’,Biometrics72(2), 402–413. Imai, K., Keele, L. & Yamamoto, T. (2010), ‘Identification, Inference and Sensitivity Analysis for Causal Mediation Effects’,Statistical Science25(1), 51–71. 40 Jia, J. & Yu, B. (2010), ‘On ...

work page 2015

[3] [3]

Katsumi, Y., Quimby, M., Hochberg, D., Jones, A., Brickhouse, M., Eldaief, M.C., Dickerson, B. C. & Touroutoglou, A. (2023), ‘Association of Regional Cortical Network Atrophy With Progression to Dementia in Patients With Primary Progressive Aphasia’,Neurology 100(3), e286–e296. Kunkle, B. W., Grenier-Boley, B., Sims, R., Bis, J. C., Damotte, V., Naj, A. C...

work page 2023

[4] [4]

G., Dikmen, M., Hanoğlu, L

Mızrak, H. G., Dikmen, M., Hanoğlu, L. & Şakul, B. U. (2024), ‘Investigation of hemispheric asymmetry in Alzheimer’s disease patients during resting state revealed BY fNIRS’, Scientific Reports14(1), 13454. Mohan, A., Roberto, A. J., Mohan, A., Lorenzo, A., Jones, K., Carney, M. J., Liogier- Weyback, L., Hwang, S. & Lapidus, K. A. (2016), ‘The Significanc...

work page 2024

[5] [5]

(2015a),Explanation in Causal Inference: Methods for Mediation and Interaction, Oxford University Press

VanderWeele, T. (2015a),Explanation in Causal Inference: Methods for Mediation and Interaction, Oxford University Press. VanderWeele, T. J. (2015b), ‘Mediation Analysis: A Practitioner’s Guide’,Annual Review Public Health37(1), 17–32. VanderWeele, T. & Vansteelandt, S. (2014), ‘Mediation Analysis with Multiple Mediators’, Epidemiologic Methods2(1), 95–115...

work page 2014

[6] [6]

High-dimensional Many–to–many–to–many Mediation Analysis

45 Supplementary Materials to High-dimensional Many–to–many–to–many Mediation Analysis This document contains the Supplementary Materials to the paper “High-dimensional Many–to–many–to–many Mediation Analysis”. Appendix A.1 provides the proofs of all theorems and lemmas related to the consistency of the proposed method. Appendix A.2 presents the proofs of...

work page 2010

[7] [7]

Proof of Theorem 3.6.We adapt the proof of Theorem 1 in Jia & Yu (2010)

We now start to prove Theorem 3.6. Proof of Theorem 3.6.We adapt the proof of Theorem 1 in Jia & Yu (2010). Analysis ofM(V) We haveVm is Gaussian random variable with mean: µVm =E(V m) = ( n∑ i=1 (mi)mm⊤ i,(1) )( n∑ ℓ=1 mℓ,(1)m⊤ ℓ,(1)+λY,2I )−1( λY,1 − →b+ 2λY,2βk,(1) ) . The EIC implies that: ⏐⏐⏐⏐ 1 n n∑ i=1 mi,(2)m⊤ i,(1) ( 1 n n∑ ℓ=1 mℓ,(1)m⊤ ℓ,(1)+ λY...

work page 2010

[8] [8]

By applying Markov’s inequality and (A4), we obtain: P   max m∈Dc ⏐⏐⏐˜Vm ⏐⏐⏐ λY,1 >Ψ  ≤ E ( max m∈Dc |˜Vm| ) λY,1 Ψ ≤ 8 √ log(p−d) λY,1 Ψ max m∈Dc √ E (˜V 2 m ) . Now, by straightforward computation, sinceE(˜Vm) = 0, one hasE (˜V 2 m ) =Var (˜Vm ) , so: E (˜V 2 m ) = 4 n∑ i=1 (( mi ) m )2[ 1−m⊤ i,(1) ( n∑ ℓ=1 mℓ,(1)m⊤ ℓ,(1)+λY,2I )−1 mi,(1) ]2 Var (( ...

work page 2009

[9] [9]

=E [⏐⏐⏐ ( ξ1 ) k ⏐⏐⏐ 2+ϖ]( max 1≤i≤n R2 i ) ϖ 2 . Moreover, for all1≤i≤n, sincev⊤v= 1, R2 i≤  ( n∑ ℓ=1 mℓ,(1)m⊤ ℓ,(1) )−1/2 mi,(1)  2 2 = 1 n  ( 1 n n∑ ℓ=1 mℓ,(1)m⊤ ℓ,(1) )−1/2 mi,(1)  2 2 ≤1 n ( Λ min (1 n n∑ ℓ=1 mℓ,(1)m⊤ ℓ,(1) ))−1 d m  2 ∞ . Consequently, 1 (√ Var(J3) )2+ϖ n∑ i=1 E [⏐⏐⏐ ( ξi ) k ⏐⏐⏐ 2+ϖ]⏐⏐⏐Ri ⏐⏐⏐ 2+ϖ ≤E [⏐⏐⏐ ( ξ1 ...

work page 2009