pith. sign in

arxiv: 2506.08618 · v5 · pith:MYQUONCPnew · submitted 2025-06-10 · 💻 cs.LG · cond-mat.mes-hall· cond-mat.other· cs.AI· cs.CV

HSG-12M: A Large-Scale Benchmark of Spatial Multigraphs from the Energy Spectra of Non-Hermitian Crystals

Pith reviewed 2026-05-21 23:47 UTC · model grok-4.3

classification 💻 cs.LG cond-mat.mes-hallcond-mat.othercs.AIcs.CV
keywords Hamiltonian spectral graphsnon-Hermitian crystalsspatial multigraphstopological fingerprintsalgebra-to-graph linkgraph neural networkscondensed matter physicscharacteristic polynomials
0
0 comments X

The pith

Spectral graphs from non-Hermitian crystal Hamiltonians serve as universal topological fingerprints for polynomials, vectors, and matrices.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents Poly2Graph, an automated pipeline that converts the energy spectra of one-dimensional non-Hermitian crystal Hamiltonians into spatial multigraphs embedded in the complex plane. It uses this tool to release HSG-12M, a dataset of roughly 16.7 million such graphs drawn from 1401 distinct characteristic-polynomial families. The central claim is that these graphs function as unique topological signatures that directly encode algebraic objects, creating a new bridge between polynomial algebra and graph structure while preserving geometric edge distinctions that standard benchmarks discard.

Core claim

Hamiltonian spectral graphs formed by the complex-plane energy spectra of non-Hermitian crystals act as universal topological fingerprints of the underlying characteristic polynomials, vectors, and matrices, thereby establishing a direct algebra-to-graph correspondence that retains multiple geometrically distinct trajectories as separate spatial edges.

What carries the argument

The Poly2Graph pipeline that losslessly maps 1-D crystal Hamiltonians to spatial multigraphs by extracting and connecting features from spectral potential data.

Load-bearing premise

The automated extraction process captures every geometric and topological feature of the spectra without introducing artifacts or selection biases that would change the resulting multigraph structure.

What would settle it

Discovery of two algebraically distinct polynomials or matrices that produce identical spatial multigraphs under the same mapping rules would falsify the universal fingerprint claim.

Figures

Figures reproduced from arXiv: 2506.08618 by Ching Hua Lee, Hakan Akg\"un, Kenji Kawaguchi, N. Duane Loh, Xianquan Yan.

Figure 1
Figure 1. Figure 1: Number of graphs v.s. number of classes in HSG-12M compared to other graph-classification datasets. HSG-12M is the only large-scale multigraph (i.e. unlike sim￾ple graph that only allows one edge between any node pair) dataset, with exceptional class diversity even exceeds all other simple graph datasets. T-HSG-5M holds temporal spatial multigraphs. Table A3 lists comprehensive comparison. In this work, we… view at source ↗
Figure 2
Figure 2. Figure 2: Poly2Graph pipeline. (a) Starting from a 1-D crystal Hamiltonian H(z) in momentum space—or, equivalently, its characteristic polynomial P(z, E) = det[H(z) − EI]. The crystal’s open-boundary spectrum solely depends on P(z, E). (b) The spectral potential Φ(E) (Ronkin function) is computed from the roots of P(z, E) = 0, following recent advances in non-Bloch band theory [81, 83, 84]. (c) The density of states… view at source ↗
read the original abstract

AI is transforming scientific research by revealing new ways to understand complex physical systems, but its impact remains constrained by the lack of large, high-quality domain-specific datasets. A rich, largely untapped resource lies in non-Hermitian quantum physics, where the energy spectra of crystals form intricate geometries on the complex plane -- termed as Hamiltonian spectral graphs. Despite their significance as fingerprints for electronic behavior, their systematic study has been intractable due to the reliance on manual extraction. To unlock this potential, we introduce Poly2Graph: a high-performance, open-source pipeline that automates the mapping of 1-D crystal Hamiltonians to spectral graphs. Using this tool, we present HSG-12M: a dataset containing 11.6 million static and 5.1 million dynamic Hamiltonian spectral graphs across 1401 characteristic-polynomial classes, distilled from 177 TB of spectral potential data. Crucially, HSG-12M is the first large-scale dataset of spatial multigraphs -- graphs embedded in a metric space where multiple geometrically distinct trajectories between two nodes are retained as separate edges. This simultaneously addresses a critical gap, as existing graph benchmarks overwhelmingly assume simple, non-spatial edges, discarding vital geometric information. Benchmarks with popular GNNs expose new challenges in learning spatial multi-edges at scale. Beyond its practical utility, we show that spectral graphs serve as universal topological fingerprints of polynomials, vectors, and matrices, forging a new algebra-to-graph link. HSG-12M lays the groundwork for data-driven scientific discovery in condensed matter physics, new opportunities in geometry-aware graph learning and beyond.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript introduces Poly2Graph, an open-source pipeline for automating the extraction of Hamiltonian spectral graphs (spatial multigraphs) from the energy spectra of non-Hermitian 1-D crystals. It releases the HSG-12M dataset containing 11.6 million static and 5.1 million dynamic graphs across 1401 characteristic-polynomial classes, distilled from 177 TB of spectral data. The work benchmarks popular GNNs on this dataset to highlight challenges in learning spatial multi-edges and claims that spectral graphs serve as universal topological fingerprints of polynomials, vectors, and matrices.

Significance. If the extraction pipeline proves accurate and the fingerprint claim is substantiated, the work supplies a large-scale benchmark for geometry-aware graph learning that retains metric embeddings and multi-edges, addressing a clear gap in existing graph datasets. The scale, open-source release, and potential applications to condensed-matter physics constitute notable strengths for data-driven discovery.

major comments (2)
  1. [§3] §3 (Poly2Graph pipeline description): the pipeline is presented as high-performance and lossless for mapping spectra to spatial multigraphs, yet no quantitative validation, error rates for root extraction or trajectory tracking, or comparison against manual baselines is provided. This directly undermines the artifact-free extraction assumption required for the universal fingerprint claim.
  2. [§5] §5 (universal topological fingerprint claim): the assertion that spectral graphs uniquely encode topological features of characteristic polynomials lacks a formal bijectivity argument or demonstration that distinct polynomials cannot produce isomorphic multigraphs after root clustering and edge retention. Without this, the algebra-to-graph link remains unproven.
minor comments (2)
  1. [Abstract] Abstract: clarify whether the 11.6M static and 5.1M dynamic graphs are disjoint or include overlap, and specify the exact total number of unique graphs.
  2. [§4] Figure captions and §4 (benchmarks): ensure all GNN performance tables include standard deviations across seeds and explicit ablations isolating the effect of spatial multi-edges versus simple-graph baselines.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive comments, which help clarify the presentation of the Poly2Graph pipeline and the supporting evidence for our claims. We address each major point below.

read point-by-point responses
  1. Referee: [§3] §3 (Poly2Graph pipeline description): the pipeline is presented as high-performance and lossless for mapping spectra to spatial multigraphs, yet no quantitative validation, error rates for root extraction or trajectory tracking, or comparison against manual baselines is provided. This directly undermines the artifact-free extraction assumption required for the universal fingerprint claim.

    Authors: We agree that explicit quantitative validation strengthens the lossless extraction claim. The pipeline relies on standard, deterministic numerical methods (companion-matrix eigendecomposition for roots and nearest-neighbor sorting for trajectories) whose accuracy is well-established for the polynomial degrees considered. Nevertheless, we will add a dedicated validation subsection in the revised manuscript that reports (i) maximum absolute errors versus high-precision symbolic solvers on a stratified sample of 10,000 polynomials, (ii) trajectory-tracking agreement with manual inspection on representative low-degree cases, and (iii) a comparison against an independent root-clustering baseline. These additions will directly support the artifact-free assumption. revision: yes

  2. Referee: [§5] §5 (universal topological fingerprint claim): the assertion that spectral graphs uniquely encode topological features of characteristic polynomials lacks a formal bijectivity argument or demonstration that distinct polynomials cannot produce isomorphic multigraphs after root clustering and edge retention. Without this, the algebra-to-graph link remains unproven.

    Authors: The mapping is constructed so that each characteristic polynomial determines a unique multiset of roots whose continuous trajectories in the complex plane are retained as distinct multi-edges; this construction is information-preserving by design. We therefore view the resulting spatial multigraph as a faithful topological encoding. We acknowledge, however, that a formal proof of injectivity (i.e., that non-isomorphic polynomials cannot yield isomorphic multigraphs) is not supplied. In revision we will (a) articulate the supporting reasoning from the pipeline’s deterministic construction, (b) report an empirical collision check across all 1401 classes in HSG-12M, and (c) qualify the wording from “universal” to “faithful topological fingerprints” while noting the scope of the current evidence. revision: partial

Circularity Check

0 steps flagged

Dataset construction and benchmarking paper; fingerprint claim asserted via procedural pipeline without reducing to self-fit or self-citation chain

full rationale

The paper centers on introducing the Poly2Graph pipeline to generate the HSG-12M dataset of spatial multigraphs from non-Hermitian crystal spectra, with the universal fingerprint claim presented as an observed consequence of this automated mapping across 1401 polynomial classes. No load-bearing step reduces by the paper's equations or self-citation to a fitted parameter or prior author result that is itself unverified; the construction is procedural and the central result remains an empirical dataset rather than a closed derivation. This yields a minor score for the general self-referential nature of any new pipeline but no significant circularity.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central contribution rests on the assumption that automated spectral-to-graph conversion preserves all relevant topological information and that the resulting multigraphs are faithful representations of the underlying polynomials and matrices.

axioms (1)
  • domain assumption The energy spectra of 1-D non-Hermitian crystal Hamiltonians can be reliably mapped to spatial multigraphs without loss of geometric distinction between trajectories.
    This premise underpins the entire dataset construction and the claim of universal fingerprints.

pith-pipeline@v0.9.0 · 5859 in / 1145 out tokens · 33219 ms · 2026-05-21T23:47:47.754262+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

155 extracted references · 155 canonical work pages · 7 internal anchors

  1. [1]

    Hamilton

    William L. Hamilton. Graph representation learning. Synthesis Lectures on Artificial Intelligence and Machine Learning, 14(3):1–159, 2020

  2. [2]

    Geometric Deep Learning: Grids, Groups, Graphs, Geodesics, and Gauges

    Michael M. Bronstein, Joan Bruna, Taco Cohen, and Petar Veliˇckovi´c. Geometric deep learning: Grids, groups, graphs, geodesics, and gauges. arXiv preprint arXiv:2104.13478, 2021

  3. [3]

    Deep Learning on Graphs

    Yao Ma and Jiliang Tang. Deep Learning on Graphs. Cambridge University Press, 2021

  4. [4]

    Graph neural networks

    Lingfei Wu, Peng Cui, Jian Pei, Liang Zhao, and Le Song. Graph neural networks. In Graph Neural Networks: Foundations, Frontiers, and Applications, pages 27–37. Springer, 2022

  5. [5]

    Graph neural networks

    Gabriele Corso, Hannes Stark, Stefanie Jegelka, Tommi Jaakkola, and Regina Barzilay. Graph neural networks. Nature Reviews Methods Primers, 4:17, 2024

  6. [6]

    Comparative analysis of feature extraction methods of malware detection

    Smita Ranveer and Swapnaja Hiray. Comparative analysis of feature extraction methods of malware detection. International Journal of Computer Applications, 120:1–7, June 2015

  7. [8]

    A simple yet effective baseline for non-attributed graph classification

    Cai Chen and Yusu Wang. A simple yet effective baseline for non-attributed graph classification. In International Conference on Learning Representations (ICLR), 2019

  8. [9]

    A fair comparison of graph neural networks for graph classification

    Federico Errica, Marco Podda, Davide Bacciu, and Alessio Micheli. A fair comparison of graph neural networks for graph classification. arXiv preprint arXiv:1912.09893, 2019

  9. [10]

    On the necessity of graph kernel baselines

    Till Schulz and Pascal Welke. On the necessity of graph kernel baselines. In ECML-PKDD GEM Workshop, 2019

  10. [11]

    Pitfalls of Graph Neural Network Evaluation

    Oleksandr Shchur, Maximilian Mumme, Aleksandar Bojchevski, and Stephan Günnemann. Pitfalls of graph neural network evaluation. arXiv preprint arXiv:1811.05868, 2018

  11. [12]

    Borgwardt, Cheng Soon Ong, Stefan Schönauer, S.V .N

    Karsten M. Borgwardt, Cheng Soon Ong, Stefan Schönauer, S.V .N. Vishwanathan, Alexander J. Smola, and Hans-Peter Kriegel. Protein function prediction via graph kernels. Bioinformatics (Oxford, England), 21(Suppl 1):i47–i56, 2005

  12. [13]

    Deep graph kernels

    Pinar Yanardag and SVN Vishwanathan. Deep graph kernels. In Proceedings of the 21st ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 1365–1374. ACM, 2015

  13. [14]

    Open graph benchmark: Datasets for machine learning on graphs

    Weihua Hu, Matthias Fey, Marinka Zitnik, Yuxiao Dong, Hongyu Ren, Bowen Liu, Michele Catasta, and Jure Leskovec. Open graph benchmark: Datasets for machine learning on graphs. In Advances in Neural Information Processing Systems 33 (NeurIPS 2020), 2020

  14. [15]

    GOOD: A graph out-of-distribution benchmark

    Shurui Gui, Xiner Li, Limei Wang, and Shuiwang Ji. GOOD: A graph out-of-distribution benchmark. In Advances in Neural Information Processing Systems 35 (NeurIPS 2022) Datasets and Benchmarks Track, 2022

  15. [16]

    Long range graph benchmark

    Vijay Dwivedi, Francesco M Bianchi, Feng Yan, and et al. Long range graph benchmark. arXiv preprint arXiv:2007.02839, 2020

  16. [17]

    A large-scale database for graph representation learning, 2021

    Scott Freitas, Yuxiao Dong, Joshua Neil, and Duen Horng Chau. A large-scale database for graph representation learning, 2021

  17. [18]

    PowerGraph: A power grid benchmark dataset for graph neural networks, 2024

    Anna Varbella, Kenza Amara, Blazhe Gjorgiev, and Giovanni Sansavini. PowerGraph: A power grid benchmark dataset for graph neural networks, 2024

  18. [19]

    MBDS: A multi-body dynamics simulation dataset for graph networks simulators

    Sheng Yang, Fengge Wu, and Junsuo Zhao. MBDS: A multi-body dynamics simulation dataset for graph networks simulators. October 2024

  19. [20]

    Ruddigkeit, R

    L. Ruddigkeit, R. van Deursen, L. C. Blum, and J.-L. Reymond. Enumeration of 166 billion organic small molecules in the chemical universe database GDB-17. Journal of Chemical Information and Modeling, 52:2864–2875, 2012

  20. [21]

    Ramakrishnan, P

    R. Ramakrishnan, P. O. Dral, M. Rupp, and O. A. V on Lilienfeld. Quantum chemistry structures and properties of 134 kilo molecules. Scientific Data, 1(1):1–7, 2014

  21. [22]

    J. J. Irwin, T. Sterling, M. M. Mysinger, E. S. Bolstad, and R. G. Coleman. ZINC: A free tool to discover chemistry for biology. Journal of Chemical Information and Modeling, 52(7):1757–1768, 2012

  22. [23]

    W. Jin, K. Yang, R. Barzilay, and T. Jaakkola. Learning multimodal graph-to-graph translation for molecular optimization, 2018

  23. [24]

    Molecular sets (MOSES): A benchmarking platform for molecular generation models

    Daniil Polykovskiy et al. Molecular sets (MOSES): A benchmarking platform for molecular generation models. Frontiers in Pharmacology, 11, 2020. 10

  24. [25]

    ChEMBL: Towards direct deposition of bioassay data

    David Mendez et al. ChEMBL: Towards direct deposition of bioassay data. Nucleic Acids Research, 47(D1):D930–D940, 2019

  25. [26]

    X. Guo, L. Zhao, C. Nowzari, S. Rafatirad, H. Homayoun, and S. M. Dinakarrao. Deep multi-attributed graph translation with node-edge co-evolution. In Proceedings of the 19th International Conference on Data Mining (ICDM), 2019

  26. [27]

    The ANI-1ccx and ANI-1x data sets, coupled-cluster and density functional theory properties for molecules

    Justin Smith, Roman Zubatyuk, Benjamin Nebgen, Nicholas Lubbers, Kipton Barros, Adrian Roitberg, Olexandr Isayev, and Sergei Tretiak. The ANI-1ccx and ANI-1x data sets, coupled-cluster and density functional theory properties for molecules. Scientific Data, 7:134, May 2020

  27. [28]

    Massively Multitask Networks for Drug Discovery

    Bharath Ramsundar, Steven Kearnes, Patrick Riley, Dale Webster, David Konerding, and Vijay Pande. Massively multitask networks for drug discovery. arXiv preprint arXiv:1502.02072, 2015

  28. [29]

    Maximum unbiased validation (MUV) data sets for virtual screening based on PubChem bioactivity data.Journal of Chemical Information and Modeling, 49(2):169– 184, 2009

    Sebastian G Rohrer and Knut Baumann. Maximum unbiased validation (MUV) data sets for virtual screening based on PubChem bioactivity data.Journal of Chemical Information and Modeling, 49(2):169– 184, 2009

  29. [30]

    Xifeng Yan, Hong Cheng, Jiawei Han, and Philip S. Yu. Mining significant graph patterns by leap search. In Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data , pages 433–444, 2008

  30. [31]

    MoleculeNet: A benchmark for molecular machine learning

    Zhenqin Wu, Bharath Ramsundar, Evan N Feinberg, Joseph Gomes, Caleb Geniesse, Aneesh S Pappu, Karl Leswing, and Vijay Pande. MoleculeNet: A benchmark for molecular machine learning. Chemical Science, 9(2):513–530, 2018

  31. [32]

    Comparison of descriptor spaces for chemical compound retrieval and classification

    Nikil Wale, Ian A Watson, and George Karypis. Comparison of descriptor spaces for chemical compound retrieval and classification. Knowledge and Information Systems, 14(3):347–375, 2008

  32. [33]

    Statisti- cal evaluation of the predictive toxicology challenge 2000–2001

    Hannu Toivonen, Ashwin Srinivasan, Ross D King, Stefan Kramer, and Christoph Helma. Statisti- cal evaluation of the predictive toxicology challenge 2000–2001. Bioinformatics (Oxford, England), 19(10):1183–1193, 2003

  33. [34]

    IAM graph database repository for graph based pattern recognition and machine learning

    Kaspar Riesen and Horst Bunke. IAM graph database repository for graph based pattern recognition and machine learning. In Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR), pages 287–297. Springer, 2008

  34. [35]

    Graph kernels for object category prediction in task-dependent robot grasping

    Marion Neumann, Plinio Moreno, Laura Antanas, Roman Garnett, and Kristian Kersting. Graph kernels for object category prediction in task-dependent robot grasping. In Online Proceedings of the Eleventh Workshop on Mining and Learning with Graphs, pages 0–6, 2013

  35. [36]

    Distinguishing enzyme structures from non-enzymes without alignments

    Paul D Dobson and Andrew J Doig. Distinguishing enzyme structures from non-enzymes without alignments. Journal of Molecular Biology, 330(4):771–783, 2003

  36. [37]

    An API oriented open-source python framework for unsupervised learning on graphs

    Benedek Rozemberczki, Oliver Kiss, and Rik Sarkar. An API oriented open-source python framework for unsupervised learning on graphs. In Proceedings of CIKM, 2020

  37. [38]

    Graphs over time: Densification laws, shrinking diameters and possible explanations

    Jure Leskovec, Jon Kleinberg, and Christos Faloutsos. Graphs over time: Densification laws, shrinking diameters and possible explanations. In Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery in Data Mining, pages 177–187, 2005

  38. [39]

    M. W. Weiner, P. S. Aisen, Jr. Jack, C. R., W. J. Jagust, J. Q. Trojanowski, L. Shaw, et al. The Alzheimer’s disease neuroimaging initiative: Progress report and future plans. Alzheimer’s & Dementia, 6(3):202–211, 2010

  39. [40]

    Di Martino, C.-G

    A. Di Martino, C.-G. Yan, Q. Li, E. Denio, F. X. Castellanos, K. Alaerts, et al. The autism brain imaging data exchange: Towards a large-scale evaluation of the intrinsic brain architecture in autism. Molecular Psychiatry, 19(6):659–667, 2014

  40. [41]

    Bayrak, Tyler Derr, Mudassir Shabbir, Daniel Moyer, Catie Chang, and Xenofon Koutsoukos

    Anwar Said, Roza G. Bayrak, Tyler Derr, Mudassir Shabbir, Daniel Moyer, Catie Chang, and Xenofon Koutsoukos. NeuroGraph: Benchmarks for graph machine learning in brain connectomics, 2024

  41. [42]

    The ENIGMA Toolbox: Multiscale neural contextualization of multisite neuroimaging datasets

    Sara Larivière, Casey Paquola, Bo-yong Park, Jessica Royer, Yezhou Wang, Oualid Benkarim, Reinder V os de Wael, Sofie L Valk, Sophia I Thomopoulos, Matthias Kirschner, Lindsay B Lewis, Alan C Evans, Sanjay M Sisodiya, Carrie R McDonald, Paul M Thompson, and Boris C Bernhardt. The ENIGMA Toolbox: Multiscale neural contextualization of multisite neuroimagin...

  42. [43]

    A collection of public transport network data sets for 25 cities

    Rainer Kujala, Christoffer Weckström, Richard Darst, Milos Mladenovic, and Jari Saramäki. A collection of public transport network data sets for 25 cities. Scientific Data, 5:180089, May 2018

  43. [44]

    Street network models and measures for every u.s

    Geoff Boeing. Street network models and measures for every u.s. city, county, urbanized area, census tract, and zillow-defined neighborhood. Urban Science, 3(28), 2019

  44. [45]

    Navigability of intercon- nected networks under random failures

    Manlio De Domenico, Albert Solé-Ribalta, Sergio Gómez, and Alex Arenas. Navigability of intercon- nected networks under random failures. Proceedings of the National Academy of Sciences, 111(23):8351– 8356, 2014

  45. [46]

    A unified spatial multigraph analysis for public transport performance

    Yaoli Wang, Di Zhu, Ganmin Yin, Zhou Huang, and Yu Liu. A unified spatial multigraph analysis for public transport performance. Scientific Reports, 10:9573, 2020. 11

  46. [47]

    Generative modeling for protein structures

    Namrata Anand and Po-Ssu Huang. Generative modeling for protein structures. In Advances in Neural Information Processing Systems (NeurIPS), pages 7505–7516, 2018

  47. [48]

    Generating tertiary protein structures via an interpretative variational autoencoder

    Xiaojie Guo, Sivani Tadepalli, Liang Zhao, and Amarda Shehu. Generating tertiary protein structures via an interpretative variational autoencoder. arXiv preprint, arXiv:2004.07119, 2020

  48. [49]

    Functional brain connectivity is predictable from anatomic network’s laplacian eigenstructure

    Farras Abdelnour, Michael Dayan, and Orrin Devinsky. Functional brain connectivity is predictable from anatomic network’s laplacian eigenstructure. NeuroImage, 172:728–739, 2018

  49. [50]

    Generative adversarial learning of protein tertiary structures

    Taseef Rahman, Yuanqi Du, Liang Zhao, and Amarda Shehu. Generative adversarial learning of protein tertiary structures. Molecules, 26(5):1209, 2021

  50. [52]

    Solé, Sergi Valverde, Pascal Kuntz, and Guy Theraulaz

    Jérôme Buhl, Jacques Gautrais, Nicholas Reeves, Ricard V . Solé, Sergi Valverde, Pascal Kuntz, and Guy Theraulaz. Topological patterns in street networks of self-organized urban settlements. The European Physical Journal B, 49:513–522, 2006

  51. [53]

    Structural properties of planar graphs of urban street patterns

    Alessio Cardillo, Salvatore Scellato, Vito Latora, and Sergio Porta. Structural properties of planar graphs of urban street patterns. Physical Review E, 73(6):066107, 2006

  52. [54]

    A variational autoencoder based generative model of urban human mobility

    Dou Huang, Xuan Song, and Zipei Fan. A variational autoencoder based generative model of urban human mobility. In Proceedings of the IEEE Conference on Multimedia Information Processing and Retrieval (MIPR), pages 425–430. IEEE, 2019

  53. [55]

    West and James H

    Geoffrey B. West and James H. Brown. The origin of allometric scaling laws in biology from genomes to ecosystems: Towards a quantitative unifying theory of biological structure and organization. Journal of Experimental Biology, 208:1575–1592, 2003

  54. [56]

    Fuhrer, Peter Federl, Brendan Lane, Anne-Gaëlle Rolland-Lagan, and Przemys- law Prusinkiewicz

    Adrian Runions, Anne M. Fuhrer, Peter Federl, Brendan Lane, Anne-Gaëlle Rolland-Lagan, and Przemys- law Prusinkiewicz. Modeling and visualization of leaf venation patterns. ACM Transactions on Graphics (TOG), 24(3):702–711, 2005

  55. [57]

    Complex brain networks: graph theoretical analysis of structural and functional systems

    Edward Bullmore and Olaf Sporns. Complex brain networks: graph theoretical analysis of structural and functional systems. Nature Reviews Neuroscience, 10:186–198, 2009

  56. [58]

    Scale-Free Networks

    Guido Caldarelli. Scale-Free Networks. Oxford University Press, Oxford, 2007

  57. [59]

    Fractal River Basins: Chance and Self-Organization

    Ignacio Rodriguez-Iturbe and Andrea Rinaldo. Fractal River Basins: Chance and Self-Organization . Cambridge University Press, Cambridge, 1997

  58. [60]

    Spatial networks

    Marc Barthélemy. Spatial networks. Physics Reports, 499(1-3):1–101, 2011

  59. [61]

    Deep generative models for spatial networks

    Xiaojie Guo, Yuanqi Du, and Liang Zhao. Deep generative models for spatial networks. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining , KDD ’21, pages 198–208, Virtual Event, Singapore, August 14–18 2021. ACM

  60. [62]

    Machine learning in materials science

    Jing Wei, Xuan Chu, Xiang-Yu Sun, Kun Xu, Hui-Xiong Deng, Jigen Chen, Zhongming Wei, and Ming Lei. Machine learning in materials science. InfoMat, 1(3):338–358, 2019

  61. [63]

    Machine learning for data-driven discovery in solid Earth geoscience

    Karianne Bergen, Paul Johnson, Maarten de Hoop, and Gregory Beroza. Machine learning for data-driven discovery in solid Earth geoscience. Science, 363:eaau0323, March 2019

  62. [64]

    Machine learning and the physical sciences.Reviews of Modern Physics, 91(4):045002, December 2019

    Giuseppe Carleo, Ignacio Cirac, Kyle Cranmer, Laurent Daudet, Maria Schuld, Naftali Tishby, Leslie V ogt-Maranto, and Lenka Zdeborová. Machine learning and the physical sciences.Reviews of Modern Physics, 91(4):045002, December 2019

  63. [65]

    From DFT to machine learning: Recent approaches to materials science–a review

    Gabriel R Schleder, Antonio C M Padilha, Carlos Mera Acosta, Marcio Costa, and Adalberto Fazzio. From DFT to machine learning: Recent approaches to materials science–a review. Journal of Physics: Materials, 2(3):032001, May 2019

  64. [66]

    Machine learning for molecular and materials science

    Keith Butler, Daniel Davies, Hugh Cartwright, Olexandr Isayev, and Aron Walsh. Machine learning for molecular and materials science. Nature, 559, July 2018

  65. [67]

    Espinosa

    Hanxun Jin, Enrui Zhang, and Horacio D. Espinosa. Recent advances and applications of machine learning in experimental solid mechanics: A review. Applied Mechanics Reviews, 75(6):061001, 2023

  66. [68]

    Machine learning in concrete science: Applications, challenges, and best practices

    Zhanzhao Li, Jinyoung Yoon, Rui Zhang, Farshad Rajabipour, Wil III, Ismaila Dabo, and Aleksandra Radli´nska. Machine learning in concrete science: Applications, challenges, and best practices. npj Computational Materials, 8:127, June 2022

  67. [69]

    Neural-network quantum state tomography

    Giacomo Torlai, Guglielmo Mazzola, Juan Carrasquilla, Matthias Troyer, Roger Melko, and Giuseppe Carleo. Neural-network quantum state tomography. Nature Physics, 14, May 2018

  68. [70]

    Dalziel, Drummond Buschman Fielding, Daniel Fortunato, Jared A

    Ruben Ohana, Michael McCabe, Lucas Thibaut Meyer, Rudy Morel, Fruzsina Julia Agocs, Miguel Beneitez, Marsha Berger, Blakesley Burkhart, Stuart B. Dalziel, Drummond Buschman Fielding, Daniel Fortunato, Jared A. Goldberg, Keiya Hirashima, Yan-Fei Jiang, Rich Kerswell, Suryanarayana Maddu, Jonah M. Miller, Payel Mukhopadhyay, Stefan S. Nixon, Jeff Shen, Roma...

  69. [71]

    Galaxy Zoo DECaLS: Detailed visual morphology measurements from volunteers and deep learning for 314 000 galaxies

    Mike Walmsley, Chris Lintott, Tobias Géron, Sandor Kruk, Coleman Krawczyk, Kyle W Willett, Steven Bamford, Lee S Kelvin, Lucy Fortson, Yarin Gal, William Keel, Karen L Masters, Vihang Mehta, Brooke D Simmons, Rebecca Smethurst, Lewis Smith, Elisabeth M Baeten, and Christine Macmillan. Galaxy Zoo DECaLS: Detailed visual morphology measurements from volunte...

  70. [72]

    ATOM3D: Tasks on molecules in three dimensions

    Raphael Townshend, Martin Vögele, Patricia Suriana, Alex Derry, Alexander Powers, Yianni Laloudakis, Sidhika Balachandar, Bowen Jing, Brandon Anderson, Stephan Eismann, Risi Kondor, Russ Altman, and Ron Dror. ATOM3D: Tasks on molecules in three dimensions. In J. Vanschoren and S. Yeung, editors, Proceedings of the Neural Information Processing Systems Tra...

  71. [73]

    PPB-affinity: Protein-protein binding affinity dataset for AI-based protein drug discovery

    H Liu, P Chen, X Zhai, KG Huo, S Zhou, L Han, and G Fan. PPB-affinity: Protein-protein binding affinity dataset for AI-based protein drug discovery. Scientific data, 11(1):1316, December 2024

  72. [74]

    Kinch, R

    Minkyung Baek, Frank DiMaio, Ivan Anishchenko, Justas Dauparas, Sergey Ovchinnikov, Gyu Rie Lee, Jue Wang, Qian Cong, Lisa N. Kinch, R. Dustin Schaeffer, Carla Millán, Heewook Park, Cole Adams, Craig R. Glassman, Andy DeGiovanni, Jose H. Pereira, Andria V . Rodrigues, Alberdina A. van Dijk, Andrea C. Ebrecht, D. J. Opperman, Thomas Sagmeister, Christoph B...

  73. [75]

    AlphaFold Protein Structure Database in 2024: Providing structure coverage for over 214 million protein sequences

    Mihaly Varadi, Damian Bertoni, Paulyna Magana, Urmila Paramval, Ivanna Pidruchna, Malarvizhi Radhakrishnan, Maxim Tsenkov, Sreenath Nair, Milot Mirdita, Jingi Yeo, Oleg Kovalevskiy, Kathryn Tunyasuvunakool, Agata Laydon, Augustin Žídek, Hamish Tomlinson, Dhavanthi Hariharan, Josh Abrahamson, Tim Green, John Jumper, Ewan Birney, Martin Steinegger, Demis Ha...

  74. [76]

    John Jumper, Richard Evans, Alexander Pritzel, Tim Green, Michael Figurnov, Olaf Ronneberger, Koray Tunyasuvunakool, Russ Bates, Augustin Žídek, Anna Potapenko, Charlie Bridgland, Clemens Meyer, Simon A. A. Kohl, Andrew J. Ballard, Andrew Cowie, Bernardino Romera-Paredes, Stig Nikolov, Rishub Jain, Jonas Adler, Tom Back, Stig Petersen, David Reiman, Ellen...

  75. [77]

    Lawrence Zitnick, and Zachary Ulissi

    Lowik Chanussot, Abhishek Das, Siddharth Goyal, Thibaut Lavril, Muhammed Shuaibi, Morgane Riviere, Kevin Tran, Javier Heras-Domingo, Caleb Ho, Weihua Hu, Aini Palizhati, Anuroop Sriram, Brandon Wood, Junwoong Yoon, Devi Parikh, C. Lawrence Zitnick, and Zachary Ulissi. Open catalyst 2020 (oc20) dataset and community challenges. ACS Catalysis, 11(10):6059–6...

  76. [78]

    Merchant, S

    A. Merchant, S. Batzner, Samuel S. Schoenholz, Muratahan Aykol, Gabin Cheon, and Ekin D. Cubuk. Scaling deep learning for materials discovery. Nature, 624(7990):80–85, 2023

  77. [79]

    Pieters, Eric A

    Huan Li, Hao Zheng, Ting Yue, Zhi Xie, Shanshan Yu, Jun Zhou, Tarun Kapri, Yiming Wang, Zhi Cao, Hong Zhao, Akerke Kemelbay, Jie He, Guojing Zhang, Paul F. Pieters, Eric A. Dailing, James R. Cappiello, Miquel Salmeron, Xiaodan Gu, Ting Xu, Peng Wu, Yuzhang Li, Karl B. Sharpless, and Yi Liu. Machine learning-accelerated discovery of heat-resistant polysulf...

  78. [80]

    Kipf, Ethan Fetaya, Kuan-Chieh Wang, Max Welling, and Richard Zemel

    Thomas N. Kipf, Ethan Fetaya, Kuan-Chieh Wang, Max Welling, and Richard Zemel. Neural relational inference for interacting systems. In International Conference on Machine Learning, pages 2688–2697, 2018

  79. [81]

    Zoology of non-Hermitian spectra and their graph topology

    Tommy Tai and Ching Hua Lee. Zoology of non-Hermitian spectra and their graph topology. Physical Review B, 107(22):L220301, June 2023

  80. [82]

    Topological Non-Hermitian skin effect

    Rijia Lin, Tommy Tai, Mengjie Yang, Linhu Li, and Ching Hua Lee. Topological Non-Hermitian skin effect. Frontiers of Physics, 18(5):53605, October 2023

Showing first 80 references.