Multivariate Partial Information Decomposition: Constructions, Inconsistencies, and Alternative Measures
Pith reviewed 2026-05-19 00:22 UTC · model grok-4.3
The pith
No lattice-based partial information decomposition can stay consistent for three or more sources.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Lattice-based partial information decomposition cannot be made consistent for three or more sources because no assignment of the information atoms satisfies the required consistency condition on every subset simultaneously; the three-source case where the sum of atoms exceeds total mutual information is extended to this general impossibility result. For two sources, explicit closed-form formulas are given for all atoms. New measures are defined via auxiliary systems of random variables that eliminate higher-order dependencies while preserving the target atoms, and these measures obey additivity and continuity.
What carries the argument
The impossibility theorem for lattice decompositions with more than two sources, together with auxiliary random variable systems that isolate unique and synergistic information by removing higher-order dependencies.
If this is right
- Two-source partial information decomposition now has explicit formulas that satisfy the complete set of axioms.
- Any lattice-based assignment of atoms will produce inconsistencies for four or more sources.
- The new measures remain additive and continuous by construction.
- Numerical checks on the Ising model confirm that the measures capture high-order relations without lattice contradictions.
Where Pith is reading between the lines
- The auxiliary-variable construction could be tested on other models such as neural networks or gene regulatory networks to check whether it recovers expected synergistic effects.
- A follow-up direction is to extend the same auxiliary approach to also quantify redundant information in the multivariate case.
- The impossibility result implies that future multivariate decomposition work should start from non-lattice structures rather than trying to repair the existing lattice.
Load-bearing premise
The auxiliary random variable systems correctly remove higher-order dependencies without introducing new artifacts or losing the original information relations.
What would settle it
A concrete three-source probability distribution on which the proposed unique or synergistic measure violates additivity or returns a negative value.
Figures
read the original abstract
While mutual information effectively quantifies dependence between two variables, it does not by itself reveal the complex, fine-grained interactions among variables, i.e., how multiple sources contribute redundantly, uniquely, or synergistically to a target in multivariate settings. The Partial Information Decomposition (PID) framework was introduced to address this by decomposing the mutual information between a set of source variables and a target variable into fine-grained information atoms such as redundant, unique, and synergistic components. In this work, we review the axiomatic system and desired properties of the PID framework and make three main contributions. First, we resolve the two-source PID case by providing explicit closed-form formulas for all information atoms that satisfy the full set of axioms and desirable properties. Second, we prove that for three or more sources, PID suffers from fundamental inconsistencies: we review the known three-variable counterexample where the sum of atoms exceeds the total information, and extend it to a comprehensive impossibility theorem showing that no lattice-based decomposition can be consistent for all subsets when the number of sources exceeds three. Finally, we deviate from the PID lattice approach to avoid its inconsistencies, and present explicit measures of multivariate unique and synergistic information. Our proposed measures, which rely on new systems of random variables that eliminate higher-order dependencies, satisfy key axioms such as additivity and continuity, provide a robust theoretical explanation of high-order relations, and show strong numerical performance in comprehensive experiments on the Ising model. Our findings highlight the need for a new framework for studying multivariate information decomposition.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper reviews the axiomatic foundations of Partial Information Decomposition (PID) and makes three contributions: (1) explicit closed-form expressions for all atoms in the two-source case that satisfy the full set of axioms including non-negativity, additivity, and continuity; (2) an impossibility theorem showing that no lattice-based PID can be consistent across all subsets for three or more sources, obtained by extending the known three-variable sum-exceeding-total counterexample; and (3) alternative explicit measures of multivariate unique and synergistic information constructed via auxiliary systems of random variables that are designed to remove higher-order dependencies while preserving the target atoms. These measures are shown to obey additivity and continuity, with numerical validation on the Ising model.
Significance. If the auxiliary-variable constructions are shown to isolate atoms without residual cross terms for arbitrary alphabets, the impossibility result would be a substantial clarification of the limits of lattice-based PID, while the new measures would supply a concrete, axiom-compliant alternative that avoids the lattice inconsistencies. The two-source closed forms and the Ising experiments already provide immediate practical value.
major comments (2)
- [Section on proposed measures] Section on proposed measures: the claim that the auxiliary random-variable systems eliminate higher-order dependencies while preserving the information atoms lacks an explicit proof that the construction commutes with the mutual-information functional for arbitrary finite alphabets; without this, it is unclear whether non-negativity and the absence of residual cross terms hold beyond the tested regimes.
- [Ising experiments] Ising-model experiments: the numerical checks are confined to a narrow parameter regime (binary variables, nearest-neighbor couplings); this does not yet constitute a general verification that the auxiliary construction isolates unique and synergistic parts without artifacts for higher-order dependencies.
minor comments (2)
- Notation for the auxiliary variables should be introduced with an explicit diagram or table showing the original joint, the auxiliary marginals, and the induced conditional independencies.
- The impossibility theorem statement would benefit from a short corollary clarifying which specific axioms (e.g., monotonicity or consistency with marginals) are violated by any lattice-based attempt for n≥3.
Simulated Author's Rebuttal
We thank the referee for the careful and constructive review. The comments correctly identify areas where additional rigor and breadth would strengthen the presentation of the auxiliary-variable constructions and their validation. We have revised the manuscript accordingly and address each point below.
read point-by-point responses
-
Referee: Section on proposed measures: the claim that the auxiliary random-variable systems eliminate higher-order dependencies while preserving the information atoms lacks an explicit proof that the construction commutes with the mutual-information functional for arbitrary finite alphabets; without this, it is unclear whether non-negativity and the absence of residual cross terms hold beyond the tested regimes.
Authors: We agree that an explicit general proof was not stated with sufficient clarity. The auxiliary systems are defined by adjoining independent copies chosen to match the relevant marginals while nullifying higher-order interactions; by construction this isolates the target atoms. In the revised manuscript we have added a lemma proving that the resulting mutual-information expressions commute with the original joint distribution for any finite alphabet, which directly yields non-negativity and the absence of residual cross terms. The proof relies only on the chain rule and the independence properties built into the auxiliary variables. revision: yes
-
Referee: Ising-model experiments: the numerical checks are confined to a narrow parameter regime (binary variables, nearest-neighbor couplings); this does not yet constitute a general verification that the auxiliary construction isolates unique and synergistic parts without artifacts for higher-order dependencies.
Authors: The original experiments were deliberately restricted to the binary nearest-neighbor Ising model to permit direct comparison with known analytic results on phase transitions. We acknowledge that this regime is narrow. In the revision we have added numerical results for ternary alphabets and for models with next-nearest-neighbor couplings, confirming that the auxiliary construction continues to isolate the intended atoms without visible artifacts. A fully exhaustive verification over all alphabets and dependency structures lies outside the scope of the present work; we have noted this limitation and the need for further study in the discussion section. revision: partial
Circularity Check
No significant circularity: explicit formulas, literature extension, and independent auxiliary constructions
full rationale
The paper resolves the two-source case with explicit closed-form formulas satisfying the full axiom set, extends a known three-variable counterexample (reviewed from prior literature) to a general impossibility theorem for n≥3 sources, and defines alternative unique/synergistic measures via explicit construction of auxiliary random variable systems that remove higher-order dependencies. None of these steps reduce by construction to fitted inputs, self-definitions, or load-bearing self-citations; the auxiliary systems are introduced as new objects and validated numerically on the Ising model. The derivation remains self-contained against external benchmarks with no reduction of predictions to their own inputs.
Axiom & Free-Parameter Ledger
axioms (1)
- standard math Standard PID axioms including non-negativity of atoms, additivity over independent sources, and continuity with respect to distributions
invented entities (1)
-
Auxiliary systems of random variables that eliminate higher-order dependencies
independent evidence
Forward citations
Cited by 1 Pith paper
-
Information-theoretic signatures of causality in Bayesian networks and hypergraphs
Partial information decomposition components map directly onto causal roles such as direct parents, children, and colliders in both pairwise Bayesian networks and higher-order hypergraphs.
Reference graph
Works this paper leans on
-
[1]
Consequently, they also share the same (unconditional) joint distribution
share the same conditional joint distribution. Consequently, they also share the same (unconditional) joint distribution. Then, we conclude that forS ′ 1 andS ′ 2 as above, we have thatI(S ′ 1;S 2) =I(S 1;S ′ 2). This, combined with Lemma 3, implies the commutativity of Red. Proof of Axioms 3 (Monotonicity) and 4 (Self-redundancy): According to Def. 3, Un...
-
[2]
Hence, our construction ofQvia (10) achieves the maximum entropy among allQ ′ ∈∆ P
and (T, S2) marginals, the entropy H(S ′ 1, S2, T) is maximized whenS ′ 1 andS 2 are conditionally independent givenT: Q∗(s1, s2, t) = Q∗(s1, t)Q∗(s2, t) Q∗(t) =P(s 1 |t)P(s 2 |t)P(t). Hence, our construction ofQvia (10) achieves the maximum entropy among allQ ′ ∈∆ P . Finally, since Definition 3 defines the unique information as Un(S1 →T|S 2) =I(S ′ 1;T|...
-
[3]
A. Lyu, A. Clark, and N. Raviv, Explicit formula for partial information decomposition, in 2024 IEEE International Symposium on Information Theory (ISIT)(IEEE, 2024) pp. 2329– 2334. 42
work page 2024
-
[4]
C. E. Shannon, A mathematical theory of communication, ACM SIGMOBILE mobile com- puting and communications review5, 3 (2001)
work page 2001
-
[5]
S. Watanabe, Information theoretical analysis of multivariate correlation, IBM Journal of research and development4, 66 (1960)
work page 1960
-
[6]
P. L. Williams and R. D. Beer, Nonnegative decomposition of multivariate information, arXiv preprint arXiv:1004.2515 (2010)
work page internal anchor Pith review Pith/arXiv arXiv 2010
-
[7]
J. Crampton and G. Loizou, The completion of a poset in a lattice of antichains, International Mathematical Journal1, 223 (2001)
work page 2001
-
[8]
E. Schneidman, W. Bialek, and M. J. Berry, Synergy, redundancy, and independence in pop- ulation codes, Journal of Neuroscience23, 11539 (2003)
work page 2003
-
[9]
T. F. Varley, M. Pope, M. Grazia, Joshua, and O. Sporns, Partial entropy decomposition re- veals higher-order information structures in human brain activity, Proceedings of the National Academy of Sciences120, e2300888120 (2023)
work page 2023
-
[10]
B. Rassouli, F. E. Rosas, and D. G¨ und¨ uz, Data disclosure under perfect sample privacy, IEEE Transactions on Information Forensics and Security15, 2012 (2019)
work page 2012
-
[11]
F. Hamman and S. Dutta, Demystifying local and global fairness trade-offs in federated learn- ing using partial information decomposition, arXiv preprint arXiv:2307.11333 (2023)
-
[12]
F. E. Rosas, P. A. Mediano, H. J. Jensen, A. K. Seth, A. B. Barrett, R. L. Carhart-Harris, and D. Bor, Reconciling emergences: An information-theoretic approach to identify causal emergence in multivariate data, PLoS computational biology16, e1008289 (2020)
work page 2020
-
[13]
R. A. Ince, The partial entropy decomposition: Decomposing multivariate entropy and mutual information via pointwise common surprisal, arXiv preprint arXiv:1702.01591 (2017)
work page internal anchor Pith review Pith/arXiv arXiv 2017
- [14]
- [15]
- [16]
-
[17]
V. Griffith, E. K. Chong, R. G. James, C. J. Ellison, and J. P. Crutchfield, Intersection information based on common randomness, Entropy16, 1985 (2014). 43
work page 1985
-
[18]
R. A. Ince, Measuring multivariate redundant information with pointwise common change in surprisal, Entropy19, 318 (2017)
work page 2017
-
[19]
N. Bertschinger, J. Rauh, E. Olbrich, and J. Jost, Shared information—new insights and problems in decomposing information in complex systems, inProceedings of the European conference on complex systems 2012(Springer, 2013) pp. 251–269
work page 2012
- [20]
-
[21]
N. Bertschinger, J. Rauh, E. Olbrich, J. Jost, and N. Ay, Quantifying unique information, Entropy16, 2161 (2014)
work page 2014
-
[22]
Kolchinsky, A novel approach to the partial information decomposition, Entropy24, 403 (2022)
A. Kolchinsky, A novel approach to the partial information decomposition, Entropy24, 403 (2022)
work page 2022
-
[23]
H. K. Ting, On the amount of information, Theory of Probability & Its Applications7, 439 (1962)
work page 1962
-
[24]
R. W. Yeung, A new outlook on shannon’s information measures, IEEE transactions on in- formation theory37, 466 (1991)
work page 1991
-
[25]
D. Chicharro and S. Panzeri, Synergy and redundancy in dual decompositions of mutual information gain and information loss, Entropy19, 71 (2017)
work page 2017
-
[26]
F. E. Rosas, P. A. Mediano, B. Rassouli, and A. B. Barrett, An operational information decomposition via synergistic disclosure, Journal of Physics A: Mathematical and Theoretical 53, 485001 (2020)
work page 2020
-
[27]
J. T. Lizier, B. Flecker, and P. L. Williams, Towards a synergy-based approach to measuring information modification, in2013 IEEE Symposium on Artificial Life (ALIFE)(IEEE, 2013) pp. 43–51
work page 2013
-
[28]
P. L. Williams,Information dynamics: Its theory and application to embodied cognitive sys- tems, Ph.D. thesis, Indiana University (2011)
work page 2011
-
[29]
V. Griffith and C. Koch, Quantifying synergistic mutual information, inGuided self- organization: inception(Springer, 2014) pp. 159–190
work page 2014
-
[30]
J. Rauh, P. K. Banerjee, E. Olbrich, G. Mont´ ufar, and J. Jost, Continuity and additivity properties of information decompositions, International Journal of Approximate Reasoning 161, 108979 (2023)
work page 2023
-
[31]
A. B. Barrett, Exploration of synergistic and redundant information sharing in static and 44 dynamical gaussian systems, Physical Review E91, 052802 (2015)
work page 2015
-
[32]
C. Finn and J. T. Lizier, Pointwise partial information decompositionusing the specificity and ambiguity lattices, Entropy20, 297 (2018)
work page 2018
-
[33]
R. G. James, J. Emenheiser, and J. P. Crutchfield, Unique information via dependency con- straints, Journal of Physics A: Mathematical and Theoretical52, 014002 (2018)
work page 2018
-
[34]
R. G. James, C. J. Ellison, and J. P. Crutchfield, “dit“: a python package for discrete infor- mation theory, Journal of Open Source Software3, 738 (2018)
work page 2018
-
[35]
R. J. Glauber, Time-dependent statistics of the ising model, Journal of mathematical physics 4, 294 (1963)
work page 1963
-
[36]
L. Barnett, J. T. Lizier, M. Harr´ e, A. K. Seth, and T. Bossomaier, Information flow in a kinetic ising model peaks in the disordered phase, Physical review letters111, 177203 (2013)
work page 2013
- [37]
-
[38]
E. T. Jaynes, Information theory and statistical mechanics, Physical review106, 620 (1957). 45
work page 1957
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.