pith. sign in

arxiv: 1907.02992 · v1 · pith:EVVZ562Ynew · submitted 2019-07-05 · ⚛️ physics.bio-ph · cond-mat.stat-mech· q-bio.SC

Towards a theory of assembly of protein complexes: lessons from equilibrium statistical physics

Pith reviewed 2026-05-25 01:26 UTC · model grok-4.3

classification ⚛️ physics.bio-ph cond-mat.stat-mechq-bio.SC
keywords protein complexesself-assemblyequilibrium thermodynamicsmultifarious assemblychimeric assemblyheterogeneous compositionsparse component usecellular noise
0
0 comments X

The pith

Equilibrium thermodynamics shows heterogeneous compositions and sparse component use enable reliable assembly of many distinct protein complexes.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper constructs an equilibrium thermodynamic model of self-assembly to determine how cells can produce multiple different protein complexes without forming incorrect mixed structures amid noise. The model produces four regimes, and only in the multifarious assembly regime do distinct complexes coexist stably. This regime appears when complex compositions are heterogeneous enough and when each protein component participates in only a few complexes. Database checks indicate that actual cellular complexes meet both requirements.

Core claim

Our equilibrium thermodynamic model of self-assembly exhibits four behaviors: diluted solution, liquid mixture, chimeric assembly, and multifarious assembly. In the multifarious regime different protein complexes coexist without forming erroneous chimeric structures. Two conditions must be met: complex compositions must be sufficiently heterogeneous and component usage across complexes must be sparse. Analysis of protein complex databases suggests cellular systems have evolved to satisfy both conditions.

What carries the argument

The multifarious assembly regime in the equilibrium self-assembly model, which permits stable coexistence of distinct complexes when compositions are heterogeneous and component sharing is sparse.

If this is right

  • Distinct protein complexes can form and coexist without producing chimeric errors under the two stated conditions.
  • Heterogeneous composition of complexes is required to reach the reliable regime.
  • Sparse use of each component by only a few complexes is also required.
  • Public databases of protein complexes are consistent with cells having evolved to meet both conditions.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Cells may use active, energy-consuming mechanisms to reinforce the sparsity and heterogeneity that equilibrium already favors.
  • The same two conditions could serve as design rules for engineering synthetic multi-protein systems that avoid cross-talk.
  • If sparsity is strictly necessary, the total number of distinct complexes that can be maintained is bounded by the size of the component pool.
  • The approach may extend to other noisy self-assembly problems such as virus capsid formation or organelle biogenesis.

Load-bearing premise

An equilibrium thermodynamic model captures the essential physical constraints on protein complex formation despite cells operating far from equilibrium.

What would settle it

A survey of protein complexes showing either largely overlapping compositions or dense component sharing across many complexes would indicate cells do not satisfy the conditions for the multifarious regime.

Figures

Figures reproduced from arXiv: 1907.02992 by Pablo Sartori, Stanislas Leibler.

Figure 1
Figure 1. Figure 1: FIG. 1 [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: FIG. 2 [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: FIG. 3 [PITH_FULL_IMAGE:figures/full_fig_p004_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: A. The histogram reveals a large abundance of high￾heterogeneity complexes, hc ∈ [0.8, 1.0], which supports the arguments presented in this work. From Fig. 4A it is also apparent that a significant number of complexes have intermediate values of heterogeneity, hc ∈ [0.4, 0.6). Given that our theory applies to structures without any geometrical symmetry (as described in Materials & Methods), we can ask whet… view at source ↗
Figure 5
Figure 5. Figure 5: FIG. 5 [PITH_FULL_IMAGE:figures/full_fig_p007_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: FIG. 6 [PITH_FULL_IMAGE:figures/full_fig_p010_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: We considered each of the c = 1, . . . , K different complexes to be a square arrangement of square components representing proteins with lateral size √ M, so that Mc = M (all complexes have the same size). Each complex contains N different component species, Nc = N ≤ M (all complexes have the same number of species). The total number of component species is Ntot ≥ N, and for each complex its N different c… view at source ↗
Figure 7
Figure 7. Figure 7: FIG. 7 [PITH_FULL_IMAGE:figures/full_fig_p011_7.png] view at source ↗
Figure 2
Figure 2. Figure 2: In this case the usage of the available component species is “dense”, and complexes are fully heterogeneous. [PITH_FULL_IMAGE:figures/full_fig_p012_2.png] view at source ↗
Figure 8
Figure 8. Figure 8: FIG. 8 [PITH_FULL_IMAGE:figures/full_fig_p013_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: FIG. 9 [PITH_FULL_IMAGE:figures/full_fig_p014_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: FIG. 10 [PITH_FULL_IMAGE:figures/full_fig_p015_10.png] view at source ↗
Figure 11
Figure 11. Figure 11: FIG. 11 [PITH_FULL_IMAGE:figures/full_fig_p016_11.png] view at source ↗
Figure 12
Figure 12. Figure 12: FIG. 12 [PITH_FULL_IMAGE:figures/full_fig_p018_12.png] view at source ↗
Figure 13
Figure 13. Figure 13: FIG. 13 [PITH_FULL_IMAGE:figures/full_fig_p020_13.png] view at source ↗
Figure 14
Figure 14. Figure 14: FIG. 14 [PITH_FULL_IMAGE:figures/full_fig_p021_14.png] view at source ↗
Figure 15
Figure 15. Figure 15: FIG. 15 [PITH_FULL_IMAGE:figures/full_fig_p022_15.png] view at source ↗
Figure 16
Figure 16. Figure 16: FIG. 16 [PITH_FULL_IMAGE:figures/full_fig_p022_16.png] view at source ↗
Figure 17
Figure 17. Figure 17: FIG. 17 [PITH_FULL_IMAGE:figures/full_fig_p023_17.png] view at source ↗
Figure 18
Figure 18. Figure 18: FIG. 18 [PITH_FULL_IMAGE:figures/full_fig_p024_18.png] view at source ↗
Figure 19
Figure 19. Figure 19: FIG. 19 [PITH_FULL_IMAGE:figures/full_fig_p026_19.png] view at source ↗
Figure 20
Figure 20. Figure 20: FIG. 20 [PITH_FULL_IMAGE:figures/full_fig_p027_20.png] view at source ↗
read the original abstract

Cellular functions are established through biological evolution, but are constrained by the laws of physics. For instance, the physics of protein folding limits the lengths of cellular polypeptide chains. Consequently, many cellular functions are carried out not by long, isolated proteins, but rather by multi-protein complexes. Protein complexes themselves do not escape physical constraints, one of the most important being the difficulty to assemble reliably in the presence of cellular noise. In order to lay the foundation for a theory of reliable protein complex assembly, we study here an equilibrium thermodynamic model of self-assembly that exhibits four distinct assembly behaviors: diluted protein solution, liquid mixture, "chimeric assembly" and "multifarious assembly". In the latter regime, different protein complexes can coexist without forming erroneous chimeric structures. We show that two conditions have to be fulfilled to attain this regime: (i) the composition of the complexes needs to be sufficiently heterogeneous, and (ii) the use of the set of components by the complexes has to be sparse. Our analysis of publicly available databases of protein complexes indicates that cellular protein systems might have indeed evolved so to satisfy both of these conditions.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 1 minor

Summary. The paper develops an equilibrium thermodynamic model of multi-protein self-assembly exhibiting four regimes (diluted solution, liquid mixture, chimeric assembly, multifarious assembly). It derives that the multifarious regime—reliable coexistence of distinct complexes without erroneous chimeras—requires (i) sufficiently heterogeneous complex compositions and (ii) sparse component usage across complexes. Public protein-complex databases are analyzed to argue that cellular systems appear to have evolved to satisfy both conditions.

Significance. If the central derivation holds, the work supplies a concrete statistical-physics criterion for reliable multifarious assembly and supplies an independent empirical check via database statistics. This is a strength: the conditions are falsifiable against composition data and could inform evolutionary hypotheses. The equilibrium framing, however, leaves open whether the identified conditions remain load-bearing once active cellular processes are included.

major comments (1)
  1. [Abstract and biological-implications section] Abstract (final sentence) and the section on biological implications: the inference that 'cellular protein systems might have indeed evolved so to satisfy both of these conditions' treats the equilibrium-model boundaries as directly relevant to vivo assembly. Because cells operate far from equilibrium with continuous energy dissipation, chaperones, and regulated disassembly, the heterogeneity/sparsity conditions may be neither necessary nor sufficient in vivo; a concrete test would be to recompute the regime diagram after adding a minimal non-equilibrium term (e.g., ATP-driven dissociation rate) and to check whether the multifarious window shrinks or disappears.
minor comments (1)
  1. [Abstract] The abstract lists the four regimes but does not explicitly name 'diluted protein solution, liquid mixture, chimeric assembly and multifarious assembly'; adding the names would improve immediate readability.

Simulated Author's Rebuttal

1 responses · 1 unresolved

We thank the referee for the constructive report. The central point concerns the scope of our equilibrium model and the phrasing of its biological implications. We address this below and will make targeted revisions to clarify the model's limitations while preserving the core results.

read point-by-point responses
  1. Referee: [Abstract and biological-implications section] Abstract (final sentence) and the section on biological implications: the inference that 'cellular protein systems might have indeed evolved so to satisfy both of these conditions' treats the equilibrium-model boundaries as directly relevant to vivo assembly. Because cells operate far from equilibrium with continuous energy dissipation, chaperones, and regulated disassembly, the heterogeneity/sparsity conditions may be neither necessary nor sufficient in vivo; a concrete test would be to recompute the regime diagram after adding a minimal non-equilibrium term (e.g., ATP-driven dissociation rate) and to check whether the multifarious window shrinks or disappears.

    Authors: We agree that the model is strictly at equilibrium and that in vivo assembly involves active processes. Our statement is intended only as an observation that the two conditions required for the multifarious regime in equilibrium are statistically satisfied by real protein-complex data; we do not claim these conditions are necessary or sufficient once energy dissipation, chaperones, or regulated disassembly are present. We will revise both the abstract and the biological-implications section to (i) state explicitly that the analysis is equilibrium, (ii) describe the database result as an empirical consistency check rather than evidence of evolutionary optimization, and (iii) note that non-equilibrium mechanisms may relax or replace the identified requirements. The suggested numerical test with an ATP-driven term would require an entirely new non-equilibrium formulation and is therefore outside the scope of the present work. revision: partial

standing simulated objections not resolved
  • Recomputing the regime diagram after introducing a minimal non-equilibrium term (e.g., ATP-driven dissociation) would require developing a new dynamical model, which lies beyond the equilibrium framework of the manuscript.

Circularity Check

0 steps flagged

No circularity: model-derived conditions checked against independent database

full rationale

The paper defines an equilibrium thermodynamic model of self-assembly, enumerates four regimes (diluted, liquid mixture, chimeric, multifarious) from its partition function and free-energy analysis, and derives the two conditions (heterogeneous composition, sparse component usage) as the parameter regime boundaries that suppress chimeras. These conditions are outputs of the model's equations rather than inputs or self-definitions. The subsequent analysis of public protein-complex databases constitutes an external empirical test of whether evolved systems occupy that regime; it does not feed back into the derivation or rename fitted parameters as predictions. No self-citations, ansatzes smuggled via prior work, or uniqueness theorems appear in the load-bearing steps. The derivation chain is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

Abstract-only review prevents exhaustive enumeration; the model necessarily introduces effective interaction energies or concentrations whose values are not specified here.

free parameters (1)
  • interaction energies or binding affinities
    Thermodynamic models of assembly typically fit or choose component interaction strengths to produce the observed regimes.
axioms (1)
  • domain assumption Protein complex assembly can be usefully approximated by an equilibrium thermodynamic model
    The paper explicitly studies an equilibrium thermodynamic model of self-assembly.

pith-pipeline@v0.9.0 · 5732 in / 1404 out tokens · 41478 ms · 2026-05-25T01:26:31.407546+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

56 extracted references · 56 canonical work pages

  1. [1]

    The complete atomic structure of the large ribosomal subunit at 2.4a resolution,

    Nenad Ban, Poul Nissen, Jeffrey Hansen, Peter B Moore, and Thomas A Steitz, “The complete atomic structure of the large ribosomal subunit at 2.4a resolution,” Science 289, 905–920 (2000)

  2. [2]

    Infinite assembly of folded proteins in evolution, disease, and engineering,

    Hector Garcia-Seisdedos, Jos´ e A Villegas, and Em- manuel D Levy, “Infinite assembly of folded proteins in evolution, disease, and engineering,” Angewandte Chemie International Edition (2018)

  3. [3]

    Ribosomes are optimized for autocatalytic production,

    Shlomi Reuveni, M˚ ans Ehrenberg, and Johan Paulsson, “Ribosomes are optimized for autocatalytic production,” Nature 547, 293 (2017)

  4. [4]

    Dynamic interactions of pro- teins in complex networks: a more structured view,

    Amelie Stein, Roland A Pache, Pau Bernad´ o, Miquel Pons, and Patrick Aloy, “Dynamic interactions of pro- teins in complex networks: a more structured view,” The FEBS journal 276, 5390–5405 (2009)

  5. [5]

    Combinatorial signal perception in the bmp pathway,

    Yaron E Antebi, James M Linton, Heidi Klumpe, Bogdan Bintu, Mengsha Gong, Christina Su, Reed McCardell, and Michael B Elowitz, “Combinatorial signal perception in the bmp pathway,” Cell 170, 1184–1196 (2017)

  6. [6]

    Recycling the cell cycle: cyclins re- visited,

    Andrew W Murray, “Recycling the cell cycle: cyclins re- visited,” Cell 116, 221–234 (2004)

  7. [7]

    Proteins evolve on the edge of supramolecular self-assembly,

    Hector Garcia-Seisdedos, Charly Empereur-Mot, Nadav Elad, and Emmanuel D Levy, “Proteins evolve on the edge of supramolecular self-assembly,” Nature 548, 244 (2017)

  8. [8]

    Genetic code translation displays a linear trade-off be- tween efficiency and accuracy of trna selection,

    Magnus Johansson, Jingji Zhang, and M˚ ans Ehrenberg, “Genetic code translation displays a linear trade-off be- tween efficiency and accuracy of trna selection,” Proceed- ings of the National Academy of Sciences 109, 131–136 (2012)

  9. [9]

    Golden triangle for folding rates of globular proteins,

    Sergiy O Garbuzynskiy, Dmitry N Ivankov, Natalya S Bogatyreva, and Alexei V Finkelstein, “Golden triangle for folding rates of globular proteins,” Proceedings of the National Academy of Sciences 110, 147–150 (2013)

  10. [10]

    Ron Milo and Rob Phillips, Cell biology by the numbers (Garland Science, New York, NY, USA, 2015)

  11. [11]

    CO- RUM: the comprehensive resource of mammalian protein complexes–2019,

    Madalina Giurgiu, Julian Reinhard, Barbara Brauner, Irmtraud Dunger-Kaltenbach, Gisela Fobo, Goar Frish- man, Corinna Montrone, and Andreas Ruepp, “CO- RUM: the comprehensive resource of mammalian protein complexes–2019,” Nucleic acids research 47, D559–D563 (2018)

  12. [12]

    Visualizing ribo- some biogenesis: parallel assembly pathways for the 30s subunit,

    Anke M Mulder, Craig Yoshioka, Andrea H Beck, Anne E Bunner, Ronald A Milligan, Clinton S Potter, Bridget Carragher, and James R Williamson, “Visualizing ribo- some biogenesis: parallel assembly pathways for the 30s subunit,” Science 330, 673–677 (2010)

  13. [13]

    Proteome survey reveals modularity of the yeast cell machinery,

    Anne-Claude Gavin, Patrick Aloy, Paola Grandi, Roland Krause, Markus Boesche, Martina Marzioch, Christina Rau, Lars Juhl Jensen, Sonja Bastuck, Birgit D¨ umpelfeld,et al. , “Proteome survey reveals modularity of the yeast cell machinery,” Nature 440, 631 (2006)

  14. [14]

    3D complex: a structural clas- sification of protein complexes,

    Emmanuel D Levy, Jose B Pereira-Leal, Cyrus Chothia, and Sarah A Teichmann, “3D complex: a structural clas- sification of protein complexes,” PLoS computational bi- ology 2, e155 (2006)

  15. [15]

    Principles of assembly reveal a periodic table of protein complexes,

    Sebastian E Ahnert, Joseph A Marsh, Helena Hern´ andez, Carol V Robinson, and Sarah A Teichmann, “Principles of assembly reveal a periodic table of protein complexes,” Science 350, aaa2245 (2015)

  16. [16]

    Molecular dynamics simula- tions of large macromolecular complexes,

    Juan R Perilla, Boon Chong Goh, C Keith Cassidy, Bo Liu, Rafael C Bernardi, Till Rudack, Hang Yu, Zhe Wu, and Klaus Schulten, “Molecular dynamics simula- tions of large macromolecular complexes,” Current opin- ion in structural biology 31, 64–74 (2015)

  17. [17]

    Jonathon Howard, Mechanics of motor proteins and the cytoskeleton (Sinauer associates, Sunderland, MA, USA, 2001)

  18. [18]

    The rotary motor of bacterial flagella,

    Howard C Berg, “The rotary motor of bacterial flagella,” Annual review of biochemistry 72, 19–54 (2003)

  19. [19]

    Multifarious assembly mixtures: Systems allowing retrieval of diverse stored structures,

    Arvind Murugan, Zorana Zeravcic, Michael P Brenner, and Stanislas Leibler, “Multifarious assembly mixtures: Systems allowing retrieval of diverse stored structures,” Proceedings of the National Academy of Sciences 112, 54–59 (2015)

  20. [20]

    (Dover, Mineola, NY, USA, 1989)

    Terrell L Hill, Free energy transduction and biochemical cycle kinetics, 2nd ed. (Dover, Mineola, NY, USA, 1989)

  21. [21]

    Germline P granules are liquid droplets that localize by controlled dissolution/condensation,

    Clifford P Brangwynne, Christian R Eckmann, David S Courson, Agata Rybarska, Carsten Hoege, J¨ obin Gharakhani, Frank J¨ ulicher, and Anthony A Hyman, “Germline P granules are liquid droplets that localize by controlled dissolution/condensation,” Science 324, 1729– 1732 (2009)

  22. [22]

    Instabilities in com- plex mixtures with a large number of components,

    Richard P Sear and Jos´ e A Cuesta, “Instabilities in com- plex mixtures with a large number of components,” Phys- ical review letters 91, 245701 (2003)

  23. [23]

    Aggresomes, inclusion bodies and pro- tein aggregation,

    Ron R Kopito, “Aggresomes, inclusion bodies and pro- tein aggregation,” Trends in cell biology 10, 524–530 9 (2000)

  24. [24]

    Complex portal 2018: extended content and enhanced visualization tools for macromolecular complexes,

    Birgit H M Meldal, Hema Bye-A-Jee, Luk´ aˇ s Gajdoˇ s, Zuzana Hammerov´ a, Aneta Hor´ aˇ ckov´ a, Filip Melicher, Livia Perfetto, Daniel Pokorn` y, Milagros Rodriguez Lopez, Alˇ zbˇ eta T¨ urkov´ a,et al. , “Complex portal 2018: extended content and enhanced visualization tools for macromolecular complexes,” Nucleic acids research 47, D550–D558 (2018)

  25. [25]

    The protein data bank,

    Helen M Berman, John Westbrook, Zukang Feng, Gary Gilliland, Talapady N Bhat, Helge Weissig, Ilya N Shindyalov, and Philip E Bourne, “The protein data bank,” Nucleic acids research 28, 235–242 (2000)

  26. [26]

    Nucleolus: the fasci- nating nuclear body,

    Valentina Sirri, Silvio Urcuqui-Inchima, Pascal Roussel, and Dani` ele Hernandez-Verdun, “Nucleolus: the fasci- nating nuclear body,” Histochemistry and cell biology 129, 13–31 (2008)

  27. [27]

    Ordering genes in a flagella pathway by analysis of expression kinetics from living bacteria,

    S Kalir, J McClure, K Pabbaraju, C Southward, M Ro- nen, S Leibler, MG Surette, and U Alon, “Ordering genes in a flagella pathway by analysis of expression kinetics from living bacteria,” Science 292, 2080–2083 (2001)

  28. [28]

    Dancing on DNA: kinetic aspects of search pro- cesses on DNA,

    Anahita Tafvizi, Leonid A Mirny, and Antoine M van Oijen, “Dancing on DNA: kinetic aspects of search pro- cesses on DNA,” Chemphyschem 12, 1481–1489 (2011)

  29. [29]

    Energetic fun- nel facilitates facilitated diffusion,

    Massimo Cencini and Simone Pigolotti, “Energetic fun- nel facilitates facilitated diffusion,” Nucleic acids research 46, 558–567 (2017)

  30. [30]

    Kinetic proofreading: a new mech- anism for reducing errors in biosynthetic processes re- quiring high specificity,

    John J Hopfield, “Kinetic proofreading: a new mech- anism for reducing errors in biosynthetic processes re- quiring high specificity,” Proceedings of the National Academy of Sciences 71, 4135–4139 (1974)

  31. [31]

    Nonequilibrium asso- ciative retrieval of multiple stored self-assembly targets,

    Gili Bisker and Jeremy L England, “Nonequilibrium asso- ciative retrieval of multiple stored self-assembly targets,” Proceedings of the National Academy of Sciences 115, E10531–E10538 (2018)

  32. [32]

    Driv- ing ribosome assembly,

    Dieter Kressler, Ed Hurt, and Jochen Ba βler, “Driv- ing ribosome assembly,” Biochimica et Biophysica Acta (BBA)-Molecular Cell Research 1803, 673–683 (2010)

  33. [33]

    Assembly mapping of 30s ribosomal proteins from e. coli,

    Shoji Mizushima and Masayasu Nomura, “Assembly mapping of 30s ribosomal proteins from e. coli,” Nature 226, 1214 (1970)

  34. [34]

    Assembly map of the large subunit (50s) of Escherichia coli ribosomes,

    R R¨ ohl and Knud H Nierhaus, “Assembly map of the large subunit (50s) of Escherichia coli ribosomes,” Pro- ceedings of the National Academy of Sciences 79, 729– 733 (1982)

  35. [35]

    Comparing protein folding in vitro and in vivo: foldability meets the fitness challenge,

    Karan S Hingorani and Lila M Gierasch, “Comparing protein folding in vitro and in vivo: foldability meets the fitness challenge,” Current opinion in structural biology 24, 81–90 (2014)

  36. [36]

    Heat oscillations driven by the em- bryonic cell cycle reveal the energetic costs of signaling,

    Jonathan Rodenfels, Karla M Neugebauer, and Jonathon Howard, “Heat oscillations driven by the em- bryonic cell cycle reveal the energetic costs of signaling,” Developmental cell 48, 646–658 (2019)

  37. [37]

    Statistics of shared components in complex component systems,

    Andrea Mazzolini, Marco Gherardi, Michele Caselle, Marco Cosentino Lagomarsino, and Matteo Osella, “Statistics of shared components in complex component systems,” Physical Review X 8, 021023 (2018)

  38. [38]

    Programmable materials and the nature of the DNA bond,

    Matthew R Jones, Nadrian C Seeman, and Chad A Mirkin, “Programmable materials and the nature of the DNA bond,” Science 347, 1260901 (2015)

  39. [39]

    Colloquium: Toward living matter with col- loidal particles,

    Zorana Zeravcic, Vinothan N Manoharan, and Michael P Brenner, “Colloquium: Toward living matter with col- loidal particles,” Reviews of Modern Physics 89, 031001 (2017)

  40. [40]

    Low- temperature behavior of two-dimensional gaussian ising spin glasses,

    J´ erˆ ome Houdayer and Alexander K Hartmann, “Low- temperature behavior of two-dimensional gaussian ising spin glasses,” Physical Review B 70, 014418 (2004)

  41. [41]

    The complex portal- an encyclopaedia of macromolecular complexes,

    Birgit HM Meldal, Oscar Forner-Martinez, Maria C Costanzo, Jose Dana, Janos Demeter, Marine Du- mousseau, Selina S Dwight, Anna Gaulton, Luana Li- cata, Anna N Melidoni, et al. , “The complex portal- an encyclopaedia of macromolecular complexes,” Nucleic acids research 43, D479–D484 (2014)

  42. [42]

    Up-to-date catalogues of yeast protein complexes,

    Shuye Pu, Jessica Wong, Brian Turner, Emerson Cho, and Shoshana J Wodak, “Up-to-date catalogues of yeast protein complexes,” Nucleic acids research 37, 825–831 (2008)

  43. [43]

    Detecting overlapping protein complexes in protein- protein interaction networks,

    Tam´ as Nepusz, Haiyuan Yu, and Alberto Paccanaro, “Detecting overlapping protein complexes in protein- protein interaction networks,” Nature methods 9, 471 (2012)

  44. [44]

    Global landscape of protein complexes in the yeast saccha- romyces cerevisiae,

    Nevan J Krogan, Gerard Cagney, Haiyuan Yu, Gouqing Zhong, Xinghua Guo, Alexandr Ignatchenko, Joyce Li, Shuye Pu, Nira Datta, Aaron P Tikuisis, et al. , “Global landscape of protein complexes in the yeast saccha- romyces cerevisiae,” Nature 440, 637 (2006)

  45. [45]

    Panorama of ancient metazoan macromolecular complexes,

    Cuihong Wan, Blake Borgeson, Sadhna Phanse, Fan Tu, Kevin Drew, Greg Clark, Xuejian Xiong, Olga Kagan, Julian Kwan, Alexandr Bezginov, et al. , “Panorama of ancient metazoan macromolecular complexes,” Nature 525, 339 (2015)

  46. [46]

    A high-accuracy consensus map of yeast protein complexes reveals modular nature of gene essentiality,

    G Traver Hart, Insuk Lee, and Edward M Marcotte, “A high-accuracy consensus map of yeast protein complexes reveals modular nature of gene essentiality,” BMC bioin- formatics 8, 236 (2007)

  47. [47]

    Integration of over 9,000 mass spectrometry experiments builds a global map of human protein complexes,

    Kevin Drew, Chanjae Lee, Ryan L Huizar, Fan Tu, Blake Borgeson, Claire D McWhite, Yun Ma, John B Walling- ford, and Edward M Marcotte, “Integration of over 9,000 mass spectrometry experiments builds a global map of human protein complexes,” Molecular systems biology 13, 932 (2017)

  48. [48]

    An efficient algorithm for large-scale detec- tion of protein families,

    Anton J Enright, Stijn Van Dongen, and Christos A Ouzounis, “An efficient algorithm for large-scale detec- tion of protein families,” Nucleic acids research30, 1575– 1584 (2002)

  49. [49]

    A human interactome in three quantitative dimensions or- ganized by stoichiometries and abundances,

    Marco Y Hein, Nina C Hubner, Ina Poser, J¨ urgen Cox, Nagarjuna Nagaraj, Yusuke Toyoda, Igor A Gak, Ina Weisswange, J¨ org Mansfeld, Frank Buchholz,et al. , “A human interactome in three quantitative dimensions or- ganized by stoichiometries and abundances,” Cell 163, 712–723 (2015)

  50. [50]

    The bioplex network: a systematic exploration of the human interactome,

    Edward L Huttlin, Lily Ting, Raphael J Bruckner, Fana Gebreab, Melanie P Gygi, John Szpyt, Stanley Tam, Gabriela Zarraga, Greg Colby, Kurt Baltier, et al. , “The bioplex network: a systematic exploration of the human interactome,” Cell 162, 425–440 (2015). 10 FIG. 6. Schematic representation of model parameters. A. Proteins (represented as gray shapes) of...

  51. [51]

    A lattice site i and a species α (including α = 0) were randomly selected

  52. [52]

    The change in energy, ∆U, associated with changing the species currently in i byα was calculated using Eq. 6

  53. [53]

    The Glauber transition probability was calculated, W = (1 + exp(∆U))−1

  54. [54]

    A random number p∈ [0, 1] was generated

  55. [55]

    peeling-off

    The species change was performed or not, depending on whether p<W or not. Steps 1− 5 constituted one Monte Carlo step. Typical simulations were run for ∼ 106 latt, where 1 latt is a lattice sweep and corresponds to L Monte Carlo steps, with L = √ L× √ L the size of the square lattice. Unless otherwise specified, we used √ L = 40. 12 In order to quantify ou...

  56. [56]

    random model

    and assigns a p−value for the interactions. Again, we clustered the data using the ClusterOne algorithm with weights given by 1−p (parameters as before). The histograms of qα for datasets IV and V are shown in panels B and C of Fig. 20. Panel B exhibits a very similar trend to panel A, with highly participatory proteins that cannot be explained by our sim...