Weighted persistent homology for osmolyte molecular aggregation and hydrogen-bonding network analysis
Pith reviewed 2026-05-24 21:55 UTC · model grok-4.3
The pith
Localized and interactive persistent homology models distinguish TMAO's concentration-dependent local networks from urea's clusters and global circles.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
From the localized persistent homology models, TMAO shows local network structures whose circle elements increase in total number and decrease in relative size with concentration, while urea shows local clusters around 6 Å and a few global circle elements at around 12 Å; the interactive PRDF recovers the same physical properties as the traditional RDF while also characterizing local interaction information, with clear differences at the first peak near 4 Å and in the second peak region from 5 Å to 10 Å.
What carries the argument
Localized persistent homology (LPH) and interactive persistent homology (IPH) applied to weighted molecular point clouds, producing persistent radial distribution functions (PRDF) at global and local scales.
If this is right
- Circle count and size trends provide a quantitative topological signature for osmolyte aggregation that scales with concentration.
- PRDF at local filtration scales supplies interaction detail beyond what global RDF captures.
- The reported separation of local clusters versus global circles offers a structural basis for comparing TMAO stabilization and urea denaturation effects.
- The same LPH and IPH constructions can be applied to other osmolyte or solvent systems to generate comparable topological descriptors.
Where Pith is reading between the lines
- If the topological signatures prove robust across force fields, they could serve as order parameters for coarse-grained models of osmolyte-protein interactions.
- Tracking how PRDF local peaks evolve along molecular dynamics trajectories might reveal transient network rearrangements not visible in time-averaged RDF.
- The distinction between local and global circle elements suggests a route to classify aggregation in multicomponent biomolecular mixtures.
Load-bearing premise
The weighting functions chosen for the localized and interactive models correctly encode the relevant physical interactions (hydrogen-bonding and molecular size) without introducing artifacts that would alter the reported differences in circle counts or peak positions.
What would settle it
Direct comparison of the reported PRDF first-peak heights at 4 Å and second-peak behavior between 5–10 Å against independently measured experimental radial distribution functions for TMAO and urea solutions at matching concentrations.
Figures
read the original abstract
It has long been observed that trimethylamin N-oxide (TMAO) and urea demonstrate dramatically different properties in a protein folding process. Even with the enormous theoretical and experimental research work of the two osmolytes, various aspects of their underlying mechanisms still remain largely elusive. In this paper, we propose to use the weighted persistent homology to systematically study the osmolytes molecular aggregation and their hydrogen-bonding network from a local topological perspective. We consider two weighted models, i.e., localized persistent homology (LPH) and interactive persistent homology (IPH). From the localized persistent homology models, we have found that TMAO and urea have very different local topology. TMAO shows local network structures. With the concentration increase, the circle elements in these networks show a clear increase in their total numbers and a decrease in their relative sizes. In contrast, urea shows two types of local topological patterns, i.e., local clusters around 6 \AA~ and a few global circle elements at around 12 \AA. From the interactive persistent homology models, it has been found that our persistent radial distribution function (PRDF) from the global-scale IPH has same physical properties as the traditional radial distribution function (RDF). Moreover, PRDFs from the local-scale IPH can also be generated and used to characterize the local interaction information. Other than the clear difference of the first peak value of PRDFs at filtration size 4\AA, TMAO and urea also shows very different behaviors at the second peak region from filtration size 5\AA~ to 10 \AA.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper applies two weighted persistent homology models—localized persistent homology (LPH) and interactive persistent homology (IPH)—to analyze molecular aggregation and hydrogen-bonding networks in TMAO and urea solutions. Using LPH, it reports that TMAO forms local network structures whose circle elements increase in number and decrease in relative size with rising concentration, whereas urea exhibits local clusters near 6 Å and sparse global circles near 12 Å. Using IPH, it introduces a persistent radial distribution function (PRDF) that is stated to recover the same physical properties as the conventional RDF while additionally capturing local interaction details, with noted differences in first-peak values at 4 Å filtration and behavior in the 5–10 Å region.
Significance. If the weighting schemes prove robust, the work offers a topological complement to standard RDF analysis that can distinguish concentration-dependent local network motifs in osmolytes, potentially clarifying mechanisms behind their opposing effects on protein stability. The explicit construction of PRDF from the global-scale IPH filtration provides a direct consistency check with RDF, which is a methodological strength when the underlying distances are preserved.
major comments (1)
- [LPH/IPH model definitions] LPH/IPH model definitions (abstract and methods): the weighting functions that encode hydrogen-bonding and molecular size are introduced without any cross-validation against independent observables such as coordination numbers, experimental RDF peak positions, or unweighted control filtrations. Because the reported distinctions—TMAO circle-count increase versus urea 6 Å clusters and 12 Å global circles, as well as PRDF peak differences—are generated directly by these weights, the absence of such checks makes the physical interpretation load-bearing and currently unsupported.
minor comments (2)
- [Abstract] Abstract: the statements that PRDF “has same physical properties as” RDF and that local-scale PRDFs “can also be generated” are purely qualitative; no quantitative metric (e.g., integrated absolute deviation or peak-position error) is supplied.
- [Abstract] Abstract: no information is given on the number of independent trajectories, system sizes, or concentration sampling used to generate the persistence diagrams, nor are error bars or variability measures reported for circle counts or PRDF peaks.
Simulated Author's Rebuttal
We thank the referee for the constructive review and for noting the potential of our approach as a topological complement to RDF analysis. We address the single major comment below.
read point-by-point responses
-
Referee: [LPH/IPH model definitions] LPH/IPH model definitions (abstract and methods): the weighting functions that encode hydrogen-bonding and molecular size are introduced without any cross-validation against independent observables such as coordination numbers, experimental RDF peak positions, or unweighted control filtrations. Because the reported distinctions—TMAO circle-count increase versus urea 6 Å clusters and 12 Å global circles, as well as PRDF peak differences—are generated directly by these weights, the absence of such checks makes the physical interpretation load-bearing and currently unsupported.
Authors: We thank the referee for highlighting the need for explicit validation of the weighting schemes. The weights are constructed from standard physical parameters in the methods: hydrogen-bonding weights use donor-acceptor distance and angle cutoffs consistent with typical H-bond geometry (~2.8 Å), while molecular-size weights are taken from the atomic radii in the underlying simulation force field. The global-scale IPH PRDF recovering the same physical properties as the conventional RDF (reported in the results) already provides an internal consistency check that the weighting preserves overall distribution features. We nevertheless agree that additional cross-checks against coordination numbers, literature experimental RDF peak positions, and unweighted control filtrations would strengthen the physical grounding of the local distinctions. We will incorporate these comparisons in the revised manuscript (e.g., a new subsection and supplementary figure showing weighted versus unweighted filtrations together with peak-position references). revision: yes
Circularity Check
No significant circularity; empirical application of weighted PH models to simulation data.
full rationale
The paper defines LPH and IPH weighting schemes, applies them to osmolyte trajectory data, and reports observed differences in circle counts, sizes, and PRDF peaks. The statement that global-scale PRDF recovers RDF properties is a post-hoc consistency check on the same distance data rather than a first-principles derivation or fitted prediction. No equations reduce by construction to their inputs, no parameters are fit on a subset then relabeled as predictions, and no load-bearing self-citations or uniqueness theorems appear in the supplied text. The central claims remain independent empirical observations from the chosen models.
Axiom & Free-Parameter Ledger
free parameters (1)
- weighting function parameters
axioms (1)
- domain assumption The underlying molecular dynamics trajectories accurately represent the physical system at the simulated concentrations and temperatures.
Reference graph
Works this paper leans on
-
[1]
Software available at http://www.mrzv.org/software/ dionysus
Dionysus: the persistent homology software. Software available at http://www.mrzv.org/software/ dionysus
-
[2]
M. J. Abraham, T. Murtola, R. Schulz, S. P´ all, J. C. Smith, B. Hess, and E. Lindahl. GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX, 1:19–25, 2015
work page 2015
- [3]
- [4]
- [5]
-
[6]
D. Bandyopadhyay, S. Mohan, S. K. Ghosh, and N. Choudhury. Molecular dynamics simulation of aqueous urea solution: is urea a structure breaker? The Journal of Physical Chemistry B, 118(40):11757– 11768, 2014
work page 2014
-
[7]
I. Baskakov and D. W. Bolen. Forcing thermodynamically unfolded proteins to fold. Journal of Biological Chemistry, 273(9):4831–4834, 1998
work page 1998
-
[8]
I. V. Baskakov, R. Kumar, G. Srinivasan, Y. S. Ji, D. W. Bolen, and E. B. Thompson. Trimethylamine N-oxide-induced cooperative folding of an intrinsically unfolded transcription-activating fragment of human glucocorticoid receptor. Journal of Biological Chemistry , 274(16):10693–10696, 1999
work page 1999
-
[9]
P. W. Bates, G. W. Wei, and Shan Zhao. Minimal molecular surfaces and their applications. Journal of Computational Chemistry , 29(3):380–91, 2008
work page 2008
- [10]
-
[11]
G. Bell, A. Lawson, J. Martin, J. Rudzinski, and C. Smyth. Weighted persistent homology. arXiv preprint arXiv:1709.00097, 2017
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[12]
P. Bendich, D. Cohen-Steiner, H. Edelsbrunner, J. Harer, and D. Morozov. Inferring local homology from sampled stratified spaces. In Foundations of Computer Science, 2007. FOCS’07. 48th Annual IEEE Symposium on, pages 536–546. IEEE, 2007. 13
work page 2007
-
[13]
P. Bendich, H. Edelsbrunner, and M. Kerber. Computing robustness and persistence for images. IEEE Transactions on Visualization and Computer Graphics , 16:1251–1260, 2010
work page 2010
-
[14]
P. Bendich, E. Gasparovic, J. Harer, R. Izmailov, and L. Ness. Multi-scale local shape analysis and feature selection in machine learning applications. In Neural Networks (IJCNN), 2015 International Joint Conference on, pages 1–8. IEEE, 2015
work page 2015
-
[15]
P. Bendich, B. Wang, and S. Mukherjee. Local homology transfer and stratification learning. In Pro- ceedings of the twenty-third annual ACM-SIAM symposium on Discrete Algorithms , pages 1355–1370. SIAM, 2012
work page 2012
-
[16]
H. J. C. Berendsen, D. van der Spoel, and R. van Drunen. GROMACS: a message-passing parallel molecular dynamics implementation. Computer physics communications, 91(1-3):43–56, 1995
work page 1995
- [17]
-
[18]
P. Bubenik. Statistical topological data analysis using persistence landscapes. The Journal of Machine Learning Research, 16(1):77–102, 2015
work page 2015
-
[19]
P. Bubenik and P. T. Kim. A statistical approach to persistent homology. Homology, Homotopy and Applications, 19:337–362, 2007
work page 2007
- [20]
-
[21]
Z. X. Cang, L. Mu, and G. W. Wei. Representability of algebraic topology for biomolecules in machine learning based scoring and virtual screening. PLoS computational biology, 14(1):e1005929, 2018
work page 2018
-
[22]
Z. X. Cang and G. W. Wei. Analysis and prediction of protein folding energy changes upon mutation by element specific persistent homology. Bioinformatics, 33(22):3549–3557, 2017
work page 2017
-
[23]
Z. X. Cang and G. W. Wei. Integration of element specific persistent homology and machine learning for protein-ligand binding affinity prediction. International journal for numerical methods in biomedical engineering, page 10.1002/cnm.2914, 2017
-
[24]
Z. X. Cang and G. W. Wei. TopologyNet: Topology based deep convolutional and multi-task neural networks for biomolecular property predictions. PLOS Computational Biology , 13(7):e1005690, 2017
work page 2017
- [25]
-
[26]
G. Carlsson, T. Ishkhanov, V. Silva, and A. Zomorodian. On the local behavior of spaces of natural images. International Journal of Computer Vision , 76(1):1–12, 2008
work page 2008
-
[27]
G. Carlsson, A. Zomorodian, A. Collins, and L. J. Guibas. Persistence barcodes for shapes. International Journal of Shape Modeling , 11(2):149–187, 2005
work page 2005
- [28]
-
[29]
T. V. Chalikian and K. J. Breslauer. Thermodynamic analysis of biomolecules: a volumetric approach. Current opinion in structural biology , 8(5):657–664, 1998
work page 1998
- [30]
-
[31]
H. Chintakunta, T. Gentimis, R. Gonzalez-Diaz, M. J. Jimenez, and H. Krim. An entropy-based persis- tence barcode. Pattern Recognition, 48(2):391–401, 2015. 14
work page 2015
-
[32]
J. Choi and M. Cho. Ion aggregation in high salt solutions. II. spectral graph analysis of water hydrogen- bonding network and ion aggregate structures. The Journal of chemical physics , 141(15):154502, 2014
work page 2014
-
[33]
J. Choi and M. Cho. Ion aggregation in high salt solutions. IV. graph-theoretical analyses of ion aggregate structure and water hydrogen bonding network. The Journal of chemical physics , 143(10):104110, 2015
work page 2015
-
[34]
J. Choi and M. Cho. Ion aggregation in high salt solutions. VI. spectral graph analysis of chaotropic ion aggregates. The Journal of chemical physics , 145(17):174501, 2016
work page 2016
-
[35]
J. H. Choi, H. Lee, H. R. Choi, and M. Cho. Graph theory and ion and molecular aggregation in aqueous solutions. Annual review of physical chemistry , 69:125–149, 2018
work page 2018
-
[36]
J. A. B. da Silva, F. G. B. Moreira, V. M. L. dos Santos, and R. L. Longo. On the hydrogen bond networks in the water–methanol mixtures: topology, percolation and small-world. Physical Chemistry Chemical Physics, 13(14):6452–6461, 2011
work page 2011
-
[37]
R. J. M. Dawson. Homology of weighted simplicial complexes. Cahiers de Topologie et G´ eom´ etrie Diff´ erentielle Cat´ egoriques, 31(3):229–243, 1990
work page 1990
-
[38]
B. Di Fabio and C. Landi. A Mayer-Vietoris formula for persistent homology with an application to shape recognition in the presence of occlusions. Foundations of Computational Mathematics, 11:499–527, 2011
work page 2011
-
[39]
V. M. L. dos Santos, F. G. B. Moreira, and R. L. Longo. Topology of the hydrogen bond networks in liquid water at room and supercritical conditions: a small-world structure. Chemical physics letters , 390(1):157–161, 2004
work page 2004
-
[40]
H. Edelsbrunner. Weighted alpha shapes , volume 92. University of Illinois at Urbana-Champaign, Department of Computer Science, 1992
work page 1992
-
[41]
H. Edelsbrunner and P. Koehl. The geometry of biomolecular solvation. Combinatorial and computa- tional geometry, 52:243–275, 2005
work page 2005
-
[42]
H. Edelsbrunner, D. Letscher, and A. Zomorodian. Topological persistence and simplification. Discrete Comput. Geom., 28:511–533, 2002
work page 2002
-
[43]
U. Essmann, L. Perera, M. L. Berkowitz, T. Darden, H. Lee, and L. G. Pedersen. A smooth particle mesh Ewald method. The Journal of chemical physics , 103(19):8577–8593, 1995
work page 1995
-
[44]
B. T. Fasy, J. Kim, F. Lecci, and C. Maria. Introduction to the r package tda. arXiv preprint arXiv:1411.1830, 2014
work page internal anchor Pith review Pith/arXiv arXiv 2014
-
[45]
B. T. Fasy and B. Wang. Exploring persistent local homology in topological data analysis. In Acoustics, Speech and Signal Processing (ICASSP), 2016 IEEE International Conference on , pages 6430–6434. IEEE, 2016
work page 2016
-
[46]
X. Feng, K. L. Xia, Y. Y. Tong, and G. W. Wei. Multiscale geometric modeling of macromolecules II: lagrangian representation. Journal of Computational Chemistry , 34:2100–2120, 2013
work page 2013
-
[47]
P. Frosini and C. Landi. Persistent Betti numbers for a noise tolerant shape-based approach to image retrieval. Pattern Recognition Letters, 34(8):863–872, 2013
work page 2013
-
[48]
M. Gameiro, Y. Hiraoka, S. Izumi, M. Kramar, K. Mischaikow, and V. Nanda. Topological measurement of protein compressibility via persistence diagrams. preprint, 2013
work page 2013
-
[49]
P. Ganguly, P. Boserman, N. F. van der Vegt, and J. E. Shea. Trimethylamine N-oxide counteracts urea denaturation by inhibiting protein–urea preferential interaction. Journal of the American Chemical Society, 140(1):483–492, 2017. 15
work page 2017
-
[50]
P. Ganguly, T. Hajari, J. E. Shea, and N. F. A. van der Vegt. Mutual exclusion of urea and trimethy- lamine N-oxide from amino acids in mixed solvent environment.The journal of physical chemistry letters, 6(4):581–585, 2015
work page 2015
-
[51]
P. Ganguly, N. F. van der Vegt, and J. E. Shea. Hydrophobic association in mixed urea–TMAO solutions. The journal of physical chemistry letters , 7(15):3052–3059, 2016
work page 2016
-
[52]
R. Ghrist. Barcodes: the persistent topology of data. Bulletin of the American Mathematical Society , 45(1):61–75, 2008
work page 2008
- [53]
- [54]
-
[55]
B. Hess, H. Bekker, H. J. C. Berendsen, and J. G. E. M. Fraaije. Lincs: a linear constraint solver for molecular simulations. Journal of computational chemistry , 18(12):1463–1472, 1997
work page 1997
-
[56]
Y. Hiraoka, T. Nakamura, A. Hirata, E. G. Escolar, K. Matsue, and Y. Nishiura. Hierarchical structures of amorphous solids characterized by persistent homology. Proceedings of the National Academy of Sciences, 113(26):7035–7040, 2016
work page 2016
-
[57]
D. Horak, S Maletic, and M. Rajkovic. Persistent homology of complex networks. Journal of Statistical Mechanics: Theory and Experiment , 2009(03):P03034, 2009
work page 2009
-
[58]
H. W. Horn, W. C. Swope, J. W. Pitera, J. D. Madura, T. J. Dick, G. L. Hura, and T. Head-Gordon. Development of an improved four-site water model for biomolecular simulations: TIP4P-Ew.The Journal of chemical physics , 120(20):9665–9678, 2004
work page 2004
- [59]
-
[60]
A. Idrissi, M. Gerard, P. Damay, M. Kiselev, Y. Puhovsky, E. Cinar, P. Lagant, and G. Vergoten. The effect of urea on the structure of water: A molecular dynamics simulation. The Journal of Physical Chemistry B, 114(13):4731–4738, 2010
work page 2010
-
[61]
P. M. Kasson, A. Zomorodian, S. Park, N. Singhal, L. J. Guibas, and V. S. Pande. Persistent voids a new structural metric for membrane fusion. Bioinformatics, 23:1753–1759, 2007
work page 2007
-
[62]
K. M. Kast, J. Brickmann, S. M. Kast, and R. S. Berry. Binary phases of aliphatic n-oxides and water: Force field development and molecular dynamics simulation. The Journal of Physical Chemistry A , 107(27):5342–5351, 2003
work page 2003
-
[63]
H Lee, H. Kang, M. K. Chung, B. Kim, and D. S. Lee. Persistent brain network homology from the perspective of dendrogram. Medical Imaging, IEEE Transactions on , 31(12):2267–2277, Dec 2012
work page 2012
-
[64]
Y. T. Liao, A. C. Manson, M. R. DeLyser, W. G. Noid, and P. S. Cremer. Trimethylamine N-oxide stabilizes proteins via a distinct mechanism compared with betaine and glycine. Proceedings of the National Academy of Sciences , 114(10):2479–2484, 2017
work page 2017
-
[65]
X. Liu, Z. Xie, and D. Y. Yi. A fast algorithm for constructing topological structure in large data. Homology, Homotopy and Applications , 14:221–238, 2012
work page 2012
-
[66]
C. Maria. Filtered complexes. In GUDHI User and Reference Manual . GUDHI Editorial Board, 2015
work page 2015
-
[67]
F. Meersman, D. Bowron, A. K. Soper, and M. H. J. Koch. An X-ray and neutron scattering study of the equilibrium between trimethylamine N-oxide and urea in aqueous solution. Physical Chemistry Chemical Physics, 13(30):13765–13771, 2011. 16
work page 2011
-
[68]
Z. Y. Meng, D. V. Anand, Y. P. Lu, J. Wu, and K. L. Xia. Weighted persistent homology for biomolecular data analysis. arXiv preprint arXiv:1903.02890 , 2019
work page internal anchor Pith review Pith/arXiv arXiv 1903
-
[69]
E. Merelli, M. Rucco, P. Sloot, and L. Tesei. Topological characterization of complex systems: Using persistent entropy. Entropy, 17(10):6872–6892, 2015
work page 2015
-
[70]
K. Mischaikow, M Mrozek, J. Reiss, and A. Szymczak. Construction of symbolic dynamics from exper- imental time series. Physical Review Letters, 82:1144–1147, 1999
work page 1999
-
[71]
K. Mischaikow and V. Nanda. Morse theory for filtrations and efficient computation of persistent homology. Discrete and Computational Geometry , 50(2):330–353, 2013
work page 2013
-
[72]
J. R. Munkres. Elements of algebraic topology . CRC Press, 2018
work page 2018
-
[73]
Perseus: the persistent homology software
Vidit Nanda. Perseus: the persistent homology software. Software available at http://www.sas.upenn. edu/~vnanda/perseus
-
[74]
D. D. Nguyen, Z. X. Cang, K. D. Wu, M. L. Wang, Y. Cao, and G. Wei. Wei. Mathematical deep learning for pose and binding affinity prediction and ranking in D3R Grand Challenges. Journal of computer-aided molecular design, 33(1):71–82, 2019
work page 2019
-
[75]
D. D. Nguyen, T. Xiao, M. L. Wang, and G. W. Wei. Rigidity strengthening: A mechanism for protein– ligand binding. Journal of chemical information and modeling , 57(7):1715–1721, 2017
work page 2017
- [76]
-
[77]
A. Oleinikova, N. Smolin, I. Brovchenko, A. Geiger, and Roland Winter. Formation of spanning wa- ter networks on protein surfaces via 2D percolation transition. The journal of physical chemistry B , 109(5):1988–1998, 2005
work page 1988
-
[78]
D. Pachauri, C. Hinrichs, M.K. Chung, S.C. Johnson, and V. Singh. Topology-based kernels with applica- tion to inference problems in alzheimer’s disease. Medical Imaging, IEEE Transactions on, 30(10):1760– 1770, Oct 2011
work page 2011
-
[79]
A. Panuszko, P. Bruzdziak, J. Zielkiewicz, D. Wyrzykowski, and J. Stangret. Effects of urea and trimethylamine-N-oxide on the properties of water and the secondary structure of hen egg white lysozyme. The Journal of Physical Chemistry B , 113(44):14797–14809, 2009
work page 2009
-
[80]
S. Paul and G. N. Patey. Structure and interaction in aqueous urea-trimethylamine-N-oxide solutions. Journal of the American Chemical Society , 129(14):4476–4482, 2007
work page 2007
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.