HSG-12M: A Large-Scale Benchmark of Spatial Multigraphs from the Energy Spectra of Non-Hermitian Crystals

Ching Hua Lee; Hakan Akg\"un; Kenji Kawaguchi; N. Duane Loh; Xianquan Yan

arxiv: 2506.08618 · v5 · pith:MYQUONCPnew · submitted 2025-06-10 · 💻 cs.LG · cond-mat.mes-hall· cond-mat.other· cs.AI· cs.CV

HSG-12M: A Large-Scale Benchmark of Spatial Multigraphs from the Energy Spectra of Non-Hermitian Crystals

Xianquan Yan , Hakan Akg\"un , Kenji Kawaguchi , N. Duane Loh , Ching Hua Lee This is my paper

Pith reviewed 2026-05-21 23:47 UTC · model grok-4.3

classification 💻 cs.LG cond-mat.mes-hallcond-mat.othercs.AIcs.CV

keywords Hamiltonian spectral graphsnon-Hermitian crystalsspatial multigraphstopological fingerprintsalgebra-to-graph linkgraph neural networkscondensed matter physicscharacteristic polynomials

0 comments

The pith

Spectral graphs from non-Hermitian crystal Hamiltonians serve as universal topological fingerprints for polynomials, vectors, and matrices.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents Poly2Graph, an automated pipeline that converts the energy spectra of one-dimensional non-Hermitian crystal Hamiltonians into spatial multigraphs embedded in the complex plane. It uses this tool to release HSG-12M, a dataset of roughly 16.7 million such graphs drawn from 1401 distinct characteristic-polynomial families. The central claim is that these graphs function as unique topological signatures that directly encode algebraic objects, creating a new bridge between polynomial algebra and graph structure while preserving geometric edge distinctions that standard benchmarks discard.

Core claim

Hamiltonian spectral graphs formed by the complex-plane energy spectra of non-Hermitian crystals act as universal topological fingerprints of the underlying characteristic polynomials, vectors, and matrices, thereby establishing a direct algebra-to-graph correspondence that retains multiple geometrically distinct trajectories as separate spatial edges.

What carries the argument

The Poly2Graph pipeline that losslessly maps 1-D crystal Hamiltonians to spatial multigraphs by extracting and connecting features from spectral potential data.

Load-bearing premise

The automated extraction process captures every geometric and topological feature of the spectra without introducing artifacts or selection biases that would change the resulting multigraph structure.

What would settle it

Discovery of two algebraically distinct polynomials or matrices that produce identical spatial multigraphs under the same mapping rules would falsify the universal fingerprint claim.

Figures

Figures reproduced from arXiv: 2506.08618 by Ching Hua Lee, Hakan Akg\"un, Kenji Kawaguchi, N. Duane Loh, Xianquan Yan.

**Figure 1.** Figure 1: Number of graphs v.s. number of classes in HSG-12M compared to other graph-classification datasets. HSG-12M is the only large-scale multigraph (i.e. unlike simple graph that only allows one edge between any node pair) dataset, with exceptional class diversity even exceeds all other simple graph datasets. T-HSG-5M holds temporal spatial multigraphs. Table A3 lists comprehensive comparison. In this work, we… view at source ↗

**Figure 2.** Figure 2: Poly2Graph pipeline. (a) Starting from a 1-D crystal Hamiltonian H(z) in momentum space—or, equivalently, its characteristic polynomial P(z, E) = det[H(z) − EI]. The crystal’s open-boundary spectrum solely depends on P(z, E). (b) The spectral potential Φ(E) (Ronkin function) is computed from the roots of P(z, E) = 0, following recent advances in non-Bloch band theory [81, 83, 84]. (c) The density of states… view at source ↗

read the original abstract

AI is transforming scientific research by revealing new ways to understand complex physical systems, but its impact remains constrained by the lack of large, high-quality domain-specific datasets. A rich, largely untapped resource lies in non-Hermitian quantum physics, where the energy spectra of crystals form intricate geometries on the complex plane -- termed as Hamiltonian spectral graphs. Despite their significance as fingerprints for electronic behavior, their systematic study has been intractable due to the reliance on manual extraction. To unlock this potential, we introduce Poly2Graph: a high-performance, open-source pipeline that automates the mapping of 1-D crystal Hamiltonians to spectral graphs. Using this tool, we present HSG-12M: a dataset containing 11.6 million static and 5.1 million dynamic Hamiltonian spectral graphs across 1401 characteristic-polynomial classes, distilled from 177 TB of spectral potential data. Crucially, HSG-12M is the first large-scale dataset of spatial multigraphs -- graphs embedded in a metric space where multiple geometrically distinct trajectories between two nodes are retained as separate edges. This simultaneously addresses a critical gap, as existing graph benchmarks overwhelmingly assume simple, non-spatial edges, discarding vital geometric information. Benchmarks with popular GNNs expose new challenges in learning spatial multi-edges at scale. Beyond its practical utility, we show that spectral graphs serve as universal topological fingerprints of polynomials, vectors, and matrices, forging a new algebra-to-graph link. HSG-12M lays the groundwork for data-driven scientific discovery in condensed matter physics, new opportunities in geometry-aware graph learning and beyond.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Big dataset of spatial multigraphs from non-Hermitian spectra, but the extraction pipeline lacks the accuracy checks needed to support the universal fingerprint claim.

read the letter

The main thing here is a large new dataset of spatial multigraphs pulled from crystal energy spectra, paired with an automated pipeline, but the work does not show that the conversion from spectra to graphs is accurate or free of artifacts that could change the structure. They built Poly2Graph to process 1-D Hamiltonian characteristic polynomials and turn them into graphs that keep distinct geometric trajectories as separate edges rather than collapsing them. From 177 TB of spectral data they produced 11.6 million static and 5.1 million dynamic graphs across 1401 polynomial classes, then released the code and data. That scale and the decision to retain spatial multi-edges are the genuinely new parts, since most existing graph benchmarks discard geometric distinctions and treat edges as simple connections. The GNN benchmarks they run are straightforward and do show that standard models have trouble with this kind of multi-edge spatial data at scale. Releasing an open-source tool is also practical and lets others test or extend the graphs. The weaker sections are the validation and the broader claims. The pipeline is described in procedural terms, yet there are no quantitative accuracy numbers, no error analysis, and no comparison against manual extraction. The statement that spectral graphs act as universal topological fingerprints for polynomials, vectors, and matrices rests on the assumption that the mapping preserves all relevant invariants without collisions or bias from root clustering or edge selection. Nothing in the paper tests bijectivity or checks whether different inputs could produce isomorphic graphs after processing. That gap makes the algebra-to-graph link more of an observation than a demonstrated result. This paper is for graph ML researchers who want benchmarks that keep spatial and multi-edge structure, or for physicists interested in applying data-driven methods to condensed matter spectra. A reader looking for new challenges in geometry-aware GNNs would get concrete value from the scale and the reported difficulties. It has enough substance as a dataset release to go through peer review, though it will need added validation experiments on the extraction step to be convincing.

Referee Report

2 major / 2 minor

Summary. The manuscript introduces Poly2Graph, an open-source pipeline for automating the extraction of Hamiltonian spectral graphs (spatial multigraphs) from the energy spectra of non-Hermitian 1-D crystals. It releases the HSG-12M dataset containing 11.6 million static and 5.1 million dynamic graphs across 1401 characteristic-polynomial classes, distilled from 177 TB of spectral data. The work benchmarks popular GNNs on this dataset to highlight challenges in learning spatial multi-edges and claims that spectral graphs serve as universal topological fingerprints of polynomials, vectors, and matrices.

Significance. If the extraction pipeline proves accurate and the fingerprint claim is substantiated, the work supplies a large-scale benchmark for geometry-aware graph learning that retains metric embeddings and multi-edges, addressing a clear gap in existing graph datasets. The scale, open-source release, and potential applications to condensed-matter physics constitute notable strengths for data-driven discovery.

major comments (2)

[§3] §3 (Poly2Graph pipeline description): the pipeline is presented as high-performance and lossless for mapping spectra to spatial multigraphs, yet no quantitative validation, error rates for root extraction or trajectory tracking, or comparison against manual baselines is provided. This directly undermines the artifact-free extraction assumption required for the universal fingerprint claim.
[§5] §5 (universal topological fingerprint claim): the assertion that spectral graphs uniquely encode topological features of characteristic polynomials lacks a formal bijectivity argument or demonstration that distinct polynomials cannot produce isomorphic multigraphs after root clustering and edge retention. Without this, the algebra-to-graph link remains unproven.

minor comments (2)

[Abstract] Abstract: clarify whether the 11.6M static and 5.1M dynamic graphs are disjoint or include overlap, and specify the exact total number of unique graphs.
[§4] Figure captions and §4 (benchmarks): ensure all GNN performance tables include standard deviations across seeds and explicit ablations isolating the effect of spatial multi-edges versus simple-graph baselines.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive comments, which help clarify the presentation of the Poly2Graph pipeline and the supporting evidence for our claims. We address each major point below.

read point-by-point responses

Referee: [§3] §3 (Poly2Graph pipeline description): the pipeline is presented as high-performance and lossless for mapping spectra to spatial multigraphs, yet no quantitative validation, error rates for root extraction or trajectory tracking, or comparison against manual baselines is provided. This directly undermines the artifact-free extraction assumption required for the universal fingerprint claim.

Authors: We agree that explicit quantitative validation strengthens the lossless extraction claim. The pipeline relies on standard, deterministic numerical methods (companion-matrix eigendecomposition for roots and nearest-neighbor sorting for trajectories) whose accuracy is well-established for the polynomial degrees considered. Nevertheless, we will add a dedicated validation subsection in the revised manuscript that reports (i) maximum absolute errors versus high-precision symbolic solvers on a stratified sample of 10,000 polynomials, (ii) trajectory-tracking agreement with manual inspection on representative low-degree cases, and (iii) a comparison against an independent root-clustering baseline. These additions will directly support the artifact-free assumption. revision: yes
Referee: [§5] §5 (universal topological fingerprint claim): the assertion that spectral graphs uniquely encode topological features of characteristic polynomials lacks a formal bijectivity argument or demonstration that distinct polynomials cannot produce isomorphic multigraphs after root clustering and edge retention. Without this, the algebra-to-graph link remains unproven.

Authors: The mapping is constructed so that each characteristic polynomial determines a unique multiset of roots whose continuous trajectories in the complex plane are retained as distinct multi-edges; this construction is information-preserving by design. We therefore view the resulting spatial multigraph as a faithful topological encoding. We acknowledge, however, that a formal proof of injectivity (i.e., that non-isomorphic polynomials cannot yield isomorphic multigraphs) is not supplied. In revision we will (a) articulate the supporting reasoning from the pipeline’s deterministic construction, (b) report an empirical collision check across all 1401 classes in HSG-12M, and (c) qualify the wording from “universal” to “faithful topological fingerprints” while noting the scope of the current evidence. revision: partial

Circularity Check

0 steps flagged

Dataset construction and benchmarking paper; fingerprint claim asserted via procedural pipeline without reducing to self-fit or self-citation chain

full rationale

The paper centers on introducing the Poly2Graph pipeline to generate the HSG-12M dataset of spatial multigraphs from non-Hermitian crystal spectra, with the universal fingerprint claim presented as an observed consequence of this automated mapping across 1401 polynomial classes. No load-bearing step reduces by the paper's equations or self-citation to a fitted parameter or prior author result that is itself unverified; the construction is procedural and the central result remains an empirical dataset rather than a closed derivation. This yields a minor score for the general self-referential nature of any new pipeline but no significant circularity.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central contribution rests on the assumption that automated spectral-to-graph conversion preserves all relevant topological information and that the resulting multigraphs are faithful representations of the underlying polynomials and matrices.

axioms (1)

domain assumption The energy spectra of 1-D non-Hermitian crystal Hamiltonians can be reliably mapped to spatial multigraphs without loss of geometric distinction between trajectories.
This premise underpins the entire dataset construction and the claim of universal fingerprints.

pith-pipeline@v0.9.0 · 5859 in / 1145 out tokens · 33219 ms · 2026-05-21T23:47:47.754262+00:00 · methodology

Review history (2 revisions) →

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Poly2Graph ... combines algebraic geometry, non-Bloch band theory, and morphological image processing to fully automate spectral graph extraction ... Φ(E) = −log|aq(E)| − Σ log|zi(E)|, ρ(E) = −1/2π ∇²Φ(E)
IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

spectral graphs serve as universal topological fingerprints of polynomials, vectors, and matrices

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

155 extracted references · 155 canonical work pages · 7 internal anchors

[1]

Hamilton

William L. Hamilton. Graph representation learning. Synthesis Lectures on Artificial Intelligence and Machine Learning, 14(3):1–159, 2020

work page 2020
[2]

Geometric Deep Learning: Grids, Groups, Graphs, Geodesics, and Gauges

Michael M. Bronstein, Joan Bruna, Taco Cohen, and Petar Veliˇckovi´c. Geometric deep learning: Grids, groups, graphs, geodesics, and gauges. arXiv preprint arXiv:2104.13478, 2021

work page internal anchor Pith review Pith/arXiv arXiv 2021
[3]

Deep Learning on Graphs

Yao Ma and Jiliang Tang. Deep Learning on Graphs. Cambridge University Press, 2021

work page 2021
[4]

Graph neural networks

Lingfei Wu, Peng Cui, Jian Pei, Liang Zhao, and Le Song. Graph neural networks. In Graph Neural Networks: Foundations, Frontiers, and Applications, pages 27–37. Springer, 2022

work page 2022
[5]

Graph neural networks

Gabriele Corso, Hannes Stark, Stefanie Jegelka, Tommi Jaakkola, and Regina Barzilay. Graph neural networks. Nature Reviews Methods Primers, 4:17, 2024

work page 2024
[6]

Comparative analysis of feature extraction methods of malware detection

Smita Ranveer and Swapnaja Hiray. Comparative analysis of feature extraction methods of malware detection. International Journal of Computer Applications, 120:1–7, June 2015

work page 2015
[8]

A simple yet effective baseline for non-attributed graph classification

Cai Chen and Yusu Wang. A simple yet effective baseline for non-attributed graph classification. In International Conference on Learning Representations (ICLR), 2019

work page 2019
[9]

A fair comparison of graph neural networks for graph classification

Federico Errica, Marco Podda, Davide Bacciu, and Alessio Micheli. A fair comparison of graph neural networks for graph classification. arXiv preprint arXiv:1912.09893, 2019

work page arXiv 1912
[10]

On the necessity of graph kernel baselines

Till Schulz and Pascal Welke. On the necessity of graph kernel baselines. In ECML-PKDD GEM Workshop, 2019

work page 2019
[11]

Pitfalls of Graph Neural Network Evaluation

Oleksandr Shchur, Maximilian Mumme, Aleksandar Bojchevski, and Stephan Günnemann. Pitfalls of graph neural network evaluation. arXiv preprint arXiv:1811.05868, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018
[12]

Borgwardt, Cheng Soon Ong, Stefan Schönauer, S.V .N

Karsten M. Borgwardt, Cheng Soon Ong, Stefan Schönauer, S.V .N. Vishwanathan, Alexander J. Smola, and Hans-Peter Kriegel. Protein function prediction via graph kernels. Bioinformatics (Oxford, England), 21(Suppl 1):i47–i56, 2005

work page 2005
[13]

Deep graph kernels

Pinar Yanardag and SVN Vishwanathan. Deep graph kernels. In Proceedings of the 21st ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 1365–1374. ACM, 2015

work page 2015
[14]

Open graph benchmark: Datasets for machine learning on graphs

Weihua Hu, Matthias Fey, Marinka Zitnik, Yuxiao Dong, Hongyu Ren, Bowen Liu, Michele Catasta, and Jure Leskovec. Open graph benchmark: Datasets for machine learning on graphs. In Advances in Neural Information Processing Systems 33 (NeurIPS 2020), 2020

work page 2020
[15]

GOOD: A graph out-of-distribution benchmark

Shurui Gui, Xiner Li, Limei Wang, and Shuiwang Ji. GOOD: A graph out-of-distribution benchmark. In Advances in Neural Information Processing Systems 35 (NeurIPS 2022) Datasets and Benchmarks Track, 2022

work page 2022
[16]

Long range graph benchmark

Vijay Dwivedi, Francesco M Bianchi, Feng Yan, and et al. Long range graph benchmark. arXiv preprint arXiv:2007.02839, 2020

work page arXiv 2007
[17]

A large-scale database for graph representation learning, 2021

Scott Freitas, Yuxiao Dong, Joshua Neil, and Duen Horng Chau. A large-scale database for graph representation learning, 2021

work page 2021
[18]

PowerGraph: A power grid benchmark dataset for graph neural networks, 2024

Anna Varbella, Kenza Amara, Blazhe Gjorgiev, and Giovanni Sansavini. PowerGraph: A power grid benchmark dataset for graph neural networks, 2024

work page 2024
[19]

MBDS: A multi-body dynamics simulation dataset for graph networks simulators

Sheng Yang, Fengge Wu, and Junsuo Zhao. MBDS: A multi-body dynamics simulation dataset for graph networks simulators. October 2024

work page 2024
[20]

Ruddigkeit, R

L. Ruddigkeit, R. van Deursen, L. C. Blum, and J.-L. Reymond. Enumeration of 166 billion organic small molecules in the chemical universe database GDB-17. Journal of Chemical Information and Modeling, 52:2864–2875, 2012

work page 2012
[21]

Ramakrishnan, P

R. Ramakrishnan, P. O. Dral, M. Rupp, and O. A. V on Lilienfeld. Quantum chemistry structures and properties of 134 kilo molecules. Scientific Data, 1(1):1–7, 2014

work page 2014
[22]

J. J. Irwin, T. Sterling, M. M. Mysinger, E. S. Bolstad, and R. G. Coleman. ZINC: A free tool to discover chemistry for biology. Journal of Chemical Information and Modeling, 52(7):1757–1768, 2012

work page 2012
[23]

W. Jin, K. Yang, R. Barzilay, and T. Jaakkola. Learning multimodal graph-to-graph translation for molecular optimization, 2018

work page 2018
[24]

Molecular sets (MOSES): A benchmarking platform for molecular generation models

Daniil Polykovskiy et al. Molecular sets (MOSES): A benchmarking platform for molecular generation models. Frontiers in Pharmacology, 11, 2020. 10

work page 2020
[25]

ChEMBL: Towards direct deposition of bioassay data

David Mendez et al. ChEMBL: Towards direct deposition of bioassay data. Nucleic Acids Research, 47(D1):D930–D940, 2019

work page 2019
[26]

X. Guo, L. Zhao, C. Nowzari, S. Rafatirad, H. Homayoun, and S. M. Dinakarrao. Deep multi-attributed graph translation with node-edge co-evolution. In Proceedings of the 19th International Conference on Data Mining (ICDM), 2019

work page 2019
[27]

The ANI-1ccx and ANI-1x data sets, coupled-cluster and density functional theory properties for molecules

Justin Smith, Roman Zubatyuk, Benjamin Nebgen, Nicholas Lubbers, Kipton Barros, Adrian Roitberg, Olexandr Isayev, and Sergei Tretiak. The ANI-1ccx and ANI-1x data sets, coupled-cluster and density functional theory properties for molecules. Scientific Data, 7:134, May 2020

work page 2020
[28]

Massively Multitask Networks for Drug Discovery

Bharath Ramsundar, Steven Kearnes, Patrick Riley, Dale Webster, David Konerding, and Vijay Pande. Massively multitask networks for drug discovery. arXiv preprint arXiv:1502.02072, 2015

work page internal anchor Pith review Pith/arXiv arXiv 2015
[29]

Maximum unbiased validation (MUV) data sets for virtual screening based on PubChem bioactivity data.Journal of Chemical Information and Modeling, 49(2):169– 184, 2009

Sebastian G Rohrer and Knut Baumann. Maximum unbiased validation (MUV) data sets for virtual screening based on PubChem bioactivity data.Journal of Chemical Information and Modeling, 49(2):169– 184, 2009

work page 2009
[30]

Xifeng Yan, Hong Cheng, Jiawei Han, and Philip S. Yu. Mining significant graph patterns by leap search. In Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data , pages 433–444, 2008

work page 2008
[31]

MoleculeNet: A benchmark for molecular machine learning

Zhenqin Wu, Bharath Ramsundar, Evan N Feinberg, Joseph Gomes, Caleb Geniesse, Aneesh S Pappu, Karl Leswing, and Vijay Pande. MoleculeNet: A benchmark for molecular machine learning. Chemical Science, 9(2):513–530, 2018

work page 2018
[32]

Comparison of descriptor spaces for chemical compound retrieval and classification

Nikil Wale, Ian A Watson, and George Karypis. Comparison of descriptor spaces for chemical compound retrieval and classification. Knowledge and Information Systems, 14(3):347–375, 2008

work page 2008
[33]

Statisti- cal evaluation of the predictive toxicology challenge 2000–2001

Hannu Toivonen, Ashwin Srinivasan, Ross D King, Stefan Kramer, and Christoph Helma. Statisti- cal evaluation of the predictive toxicology challenge 2000–2001. Bioinformatics (Oxford, England), 19(10):1183–1193, 2003

work page 2000
[34]

IAM graph database repository for graph based pattern recognition and machine learning

Kaspar Riesen and Horst Bunke. IAM graph database repository for graph based pattern recognition and machine learning. In Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR), pages 287–297. Springer, 2008

work page 2008
[35]

Graph kernels for object category prediction in task-dependent robot grasping

Marion Neumann, Plinio Moreno, Laura Antanas, Roman Garnett, and Kristian Kersting. Graph kernels for object category prediction in task-dependent robot grasping. In Online Proceedings of the Eleventh Workshop on Mining and Learning with Graphs, pages 0–6, 2013

work page 2013
[36]

Distinguishing enzyme structures from non-enzymes without alignments

Paul D Dobson and Andrew J Doig. Distinguishing enzyme structures from non-enzymes without alignments. Journal of Molecular Biology, 330(4):771–783, 2003

work page 2003
[37]

An API oriented open-source python framework for unsupervised learning on graphs

Benedek Rozemberczki, Oliver Kiss, and Rik Sarkar. An API oriented open-source python framework for unsupervised learning on graphs. In Proceedings of CIKM, 2020

work page 2020
[38]

Graphs over time: Densification laws, shrinking diameters and possible explanations

Jure Leskovec, Jon Kleinberg, and Christos Faloutsos. Graphs over time: Densification laws, shrinking diameters and possible explanations. In Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery in Data Mining, pages 177–187, 2005

work page 2005
[39]

M. W. Weiner, P. S. Aisen, Jr. Jack, C. R., W. J. Jagust, J. Q. Trojanowski, L. Shaw, et al. The Alzheimer’s disease neuroimaging initiative: Progress report and future plans. Alzheimer’s & Dementia, 6(3):202–211, 2010

work page 2010
[40]

Di Martino, C.-G

A. Di Martino, C.-G. Yan, Q. Li, E. Denio, F. X. Castellanos, K. Alaerts, et al. The autism brain imaging data exchange: Towards a large-scale evaluation of the intrinsic brain architecture in autism. Molecular Psychiatry, 19(6):659–667, 2014

work page 2014
[41]

Bayrak, Tyler Derr, Mudassir Shabbir, Daniel Moyer, Catie Chang, and Xenofon Koutsoukos

Anwar Said, Roza G. Bayrak, Tyler Derr, Mudassir Shabbir, Daniel Moyer, Catie Chang, and Xenofon Koutsoukos. NeuroGraph: Benchmarks for graph machine learning in brain connectomics, 2024

work page 2024
[42]

The ENIGMA Toolbox: Multiscale neural contextualization of multisite neuroimaging datasets

Sara Larivière, Casey Paquola, Bo-yong Park, Jessica Royer, Yezhou Wang, Oualid Benkarim, Reinder V os de Wael, Sofie L Valk, Sophia I Thomopoulos, Matthias Kirschner, Lindsay B Lewis, Alan C Evans, Sanjay M Sisodiya, Carrie R McDonald, Paul M Thompson, and Boris C Bernhardt. The ENIGMA Toolbox: Multiscale neural contextualization of multisite neuroimagin...

work page 2021
[43]

A collection of public transport network data sets for 25 cities

Rainer Kujala, Christoffer Weckström, Richard Darst, Milos Mladenovic, and Jari Saramäki. A collection of public transport network data sets for 25 cities. Scientific Data, 5:180089, May 2018

work page 2018
[44]

Street network models and measures for every u.s

Geoff Boeing. Street network models and measures for every u.s. city, county, urbanized area, census tract, and zillow-defined neighborhood. Urban Science, 3(28), 2019

work page 2019
[45]

Navigability of intercon- nected networks under random failures

Manlio De Domenico, Albert Solé-Ribalta, Sergio Gómez, and Alex Arenas. Navigability of intercon- nected networks under random failures. Proceedings of the National Academy of Sciences, 111(23):8351– 8356, 2014

work page 2014
[46]

A unified spatial multigraph analysis for public transport performance

Yaoli Wang, Di Zhu, Ganmin Yin, Zhou Huang, and Yu Liu. A unified spatial multigraph analysis for public transport performance. Scientific Reports, 10:9573, 2020. 11

work page 2020
[47]

Generative modeling for protein structures

Namrata Anand and Po-Ssu Huang. Generative modeling for protein structures. In Advances in Neural Information Processing Systems (NeurIPS), pages 7505–7516, 2018

work page 2018
[48]

Generating tertiary protein structures via an interpretative variational autoencoder

Xiaojie Guo, Sivani Tadepalli, Liang Zhao, and Amarda Shehu. Generating tertiary protein structures via an interpretative variational autoencoder. arXiv preprint, arXiv:2004.07119, 2020

work page arXiv 2004
[49]

Functional brain connectivity is predictable from anatomic network’s laplacian eigenstructure

Farras Abdelnour, Michael Dayan, and Orrin Devinsky. Functional brain connectivity is predictable from anatomic network’s laplacian eigenstructure. NeuroImage, 172:728–739, 2018

work page 2018
[50]

Generative adversarial learning of protein tertiary structures

Taseef Rahman, Yuanqi Du, Liang Zhao, and Amarda Shehu. Generative adversarial learning of protein tertiary structures. Molecules, 26(5):1209, 2021

work page 2021
[52]

Solé, Sergi Valverde, Pascal Kuntz, and Guy Theraulaz

Jérôme Buhl, Jacques Gautrais, Nicholas Reeves, Ricard V . Solé, Sergi Valverde, Pascal Kuntz, and Guy Theraulaz. Topological patterns in street networks of self-organized urban settlements. The European Physical Journal B, 49:513–522, 2006

work page 2006
[53]

Structural properties of planar graphs of urban street patterns

Alessio Cardillo, Salvatore Scellato, Vito Latora, and Sergio Porta. Structural properties of planar graphs of urban street patterns. Physical Review E, 73(6):066107, 2006

work page 2006
[54]

A variational autoencoder based generative model of urban human mobility

Dou Huang, Xuan Song, and Zipei Fan. A variational autoencoder based generative model of urban human mobility. In Proceedings of the IEEE Conference on Multimedia Information Processing and Retrieval (MIPR), pages 425–430. IEEE, 2019

work page 2019
[55]

West and James H

Geoffrey B. West and James H. Brown. The origin of allometric scaling laws in biology from genomes to ecosystems: Towards a quantitative unifying theory of biological structure and organization. Journal of Experimental Biology, 208:1575–1592, 2003

work page 2003
[56]

Fuhrer, Peter Federl, Brendan Lane, Anne-Gaëlle Rolland-Lagan, and Przemys- law Prusinkiewicz

Adrian Runions, Anne M. Fuhrer, Peter Federl, Brendan Lane, Anne-Gaëlle Rolland-Lagan, and Przemys- law Prusinkiewicz. Modeling and visualization of leaf venation patterns. ACM Transactions on Graphics (TOG), 24(3):702–711, 2005

work page 2005
[57]

Complex brain networks: graph theoretical analysis of structural and functional systems

Edward Bullmore and Olaf Sporns. Complex brain networks: graph theoretical analysis of structural and functional systems. Nature Reviews Neuroscience, 10:186–198, 2009

work page 2009
[58]

Scale-Free Networks

Guido Caldarelli. Scale-Free Networks. Oxford University Press, Oxford, 2007

work page 2007
[59]

Fractal River Basins: Chance and Self-Organization

Ignacio Rodriguez-Iturbe and Andrea Rinaldo. Fractal River Basins: Chance and Self-Organization . Cambridge University Press, Cambridge, 1997

work page 1997
[60]

Spatial networks

Marc Barthélemy. Spatial networks. Physics Reports, 499(1-3):1–101, 2011

work page 2011
[61]

Deep generative models for spatial networks

Xiaojie Guo, Yuanqi Du, and Liang Zhao. Deep generative models for spatial networks. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining , KDD ’21, pages 198–208, Virtual Event, Singapore, August 14–18 2021. ACM

work page 2021
[62]

Machine learning in materials science

Jing Wei, Xuan Chu, Xiang-Yu Sun, Kun Xu, Hui-Xiong Deng, Jigen Chen, Zhongming Wei, and Ming Lei. Machine learning in materials science. InfoMat, 1(3):338–358, 2019

work page 2019
[63]

Machine learning for data-driven discovery in solid Earth geoscience

Karianne Bergen, Paul Johnson, Maarten de Hoop, and Gregory Beroza. Machine learning for data-driven discovery in solid Earth geoscience. Science, 363:eaau0323, March 2019

work page 2019
[64]

Machine learning and the physical sciences.Reviews of Modern Physics, 91(4):045002, December 2019

Giuseppe Carleo, Ignacio Cirac, Kyle Cranmer, Laurent Daudet, Maria Schuld, Naftali Tishby, Leslie V ogt-Maranto, and Lenka Zdeborová. Machine learning and the physical sciences.Reviews of Modern Physics, 91(4):045002, December 2019

work page 2019
[65]

From DFT to machine learning: Recent approaches to materials science–a review

Gabriel R Schleder, Antonio C M Padilha, Carlos Mera Acosta, Marcio Costa, and Adalberto Fazzio. From DFT to machine learning: Recent approaches to materials science–a review. Journal of Physics: Materials, 2(3):032001, May 2019

work page 2019
[66]

Machine learning for molecular and materials science

Keith Butler, Daniel Davies, Hugh Cartwright, Olexandr Isayev, and Aron Walsh. Machine learning for molecular and materials science. Nature, 559, July 2018

work page 2018
[67]

Espinosa

Hanxun Jin, Enrui Zhang, and Horacio D. Espinosa. Recent advances and applications of machine learning in experimental solid mechanics: A review. Applied Mechanics Reviews, 75(6):061001, 2023

work page 2023
[68]

Machine learning in concrete science: Applications, challenges, and best practices

Zhanzhao Li, Jinyoung Yoon, Rui Zhang, Farshad Rajabipour, Wil III, Ismaila Dabo, and Aleksandra Radli´nska. Machine learning in concrete science: Applications, challenges, and best practices. npj Computational Materials, 8:127, June 2022

work page 2022
[69]

Neural-network quantum state tomography

Giacomo Torlai, Guglielmo Mazzola, Juan Carrasquilla, Matthias Troyer, Roger Melko, and Giuseppe Carleo. Neural-network quantum state tomography. Nature Physics, 14, May 2018

work page 2018
[70]

Dalziel, Drummond Buschman Fielding, Daniel Fortunato, Jared A

Ruben Ohana, Michael McCabe, Lucas Thibaut Meyer, Rudy Morel, Fruzsina Julia Agocs, Miguel Beneitez, Marsha Berger, Blakesley Burkhart, Stuart B. Dalziel, Drummond Buschman Fielding, Daniel Fortunato, Jared A. Goldberg, Keiya Hirashima, Yan-Fei Jiang, Rich Kerswell, Suryanarayana Maddu, Jonah M. Miller, Payel Mukhopadhyay, Stefan S. Nixon, Jeff Shen, Roma...

work page 2024
[71]

Galaxy Zoo DECaLS: Detailed visual morphology measurements from volunteers and deep learning for 314 000 galaxies

Mike Walmsley, Chris Lintott, Tobias Géron, Sandor Kruk, Coleman Krawczyk, Kyle W Willett, Steven Bamford, Lee S Kelvin, Lucy Fortson, Yarin Gal, William Keel, Karen L Masters, Vihang Mehta, Brooke D Simmons, Rebecca Smethurst, Lewis Smith, Elisabeth M Baeten, and Christine Macmillan. Galaxy Zoo DECaLS: Detailed visual morphology measurements from volunte...

work page 2021
[72]

ATOM3D: Tasks on molecules in three dimensions

Raphael Townshend, Martin Vögele, Patricia Suriana, Alex Derry, Alexander Powers, Yianni Laloudakis, Sidhika Balachandar, Bowen Jing, Brandon Anderson, Stephan Eismann, Risi Kondor, Russ Altman, and Ron Dror. ATOM3D: Tasks on molecules in three dimensions. In J. Vanschoren and S. Yeung, editors, Proceedings of the Neural Information Processing Systems Tra...

work page 2021
[73]

PPB-affinity: Protein-protein binding affinity dataset for AI-based protein drug discovery

H Liu, P Chen, X Zhai, KG Huo, S Zhou, L Han, and G Fan. PPB-affinity: Protein-protein binding affinity dataset for AI-based protein drug discovery. Scientific data, 11(1):1316, December 2024

work page 2024
[74]

Kinch, R

Minkyung Baek, Frank DiMaio, Ivan Anishchenko, Justas Dauparas, Sergey Ovchinnikov, Gyu Rie Lee, Jue Wang, Qian Cong, Lisa N. Kinch, R. Dustin Schaeffer, Carla Millán, Heewook Park, Cole Adams, Craig R. Glassman, Andy DeGiovanni, Jose H. Pereira, Andria V . Rodrigues, Alberdina A. van Dijk, Andrea C. Ebrecht, D. J. Opperman, Thomas Sagmeister, Christoph B...

work page 2021
[75]

AlphaFold Protein Structure Database in 2024: Providing structure coverage for over 214 million protein sequences

Mihaly Varadi, Damian Bertoni, Paulyna Magana, Urmila Paramval, Ivanna Pidruchna, Malarvizhi Radhakrishnan, Maxim Tsenkov, Sreenath Nair, Milot Mirdita, Jingi Yeo, Oleg Kovalevskiy, Kathryn Tunyasuvunakool, Agata Laydon, Augustin Žídek, Hamish Tomlinson, Dhavanthi Hariharan, Josh Abrahamson, Tim Green, John Jumper, Ewan Birney, Martin Steinegger, Demis Ha...

work page 2024
[76]

John Jumper, Richard Evans, Alexander Pritzel, Tim Green, Michael Figurnov, Olaf Ronneberger, Koray Tunyasuvunakool, Russ Bates, Augustin Žídek, Anna Potapenko, Charlie Bridgland, Clemens Meyer, Simon A. A. Kohl, Andrew J. Ballard, Andrew Cowie, Bernardino Romera-Paredes, Stig Nikolov, Rishub Jain, Jonas Adler, Tom Back, Stig Petersen, David Reiman, Ellen...

work page 2021
[77]

Lawrence Zitnick, and Zachary Ulissi

Lowik Chanussot, Abhishek Das, Siddharth Goyal, Thibaut Lavril, Muhammed Shuaibi, Morgane Riviere, Kevin Tran, Javier Heras-Domingo, Caleb Ho, Weihua Hu, Aini Palizhati, Anuroop Sriram, Brandon Wood, Junwoong Yoon, Devi Parikh, C. Lawrence Zitnick, and Zachary Ulissi. Open catalyst 2020 (oc20) dataset and community challenges. ACS Catalysis, 11(10):6059–6...

work page 2020
[78]

Merchant, S

A. Merchant, S. Batzner, Samuel S. Schoenholz, Muratahan Aykol, Gabin Cheon, and Ekin D. Cubuk. Scaling deep learning for materials discovery. Nature, 624(7990):80–85, 2023

work page 2023
[79]

Pieters, Eric A

Huan Li, Hao Zheng, Ting Yue, Zhi Xie, Shanshan Yu, Jun Zhou, Tarun Kapri, Yiming Wang, Zhi Cao, Hong Zhao, Akerke Kemelbay, Jie He, Guojing Zhang, Paul F. Pieters, Eric A. Dailing, James R. Cappiello, Miquel Salmeron, Xiaodan Gu, Ting Xu, Peng Wu, Yuzhang Li, Karl B. Sharpless, and Yi Liu. Machine learning-accelerated discovery of heat-resistant polysulf...

work page 2025
[80]

Kipf, Ethan Fetaya, Kuan-Chieh Wang, Max Welling, and Richard Zemel

Thomas N. Kipf, Ethan Fetaya, Kuan-Chieh Wang, Max Welling, and Richard Zemel. Neural relational inference for interacting systems. In International Conference on Machine Learning, pages 2688–2697, 2018

work page 2018
[81]

Zoology of non-Hermitian spectra and their graph topology

Tommy Tai and Ching Hua Lee. Zoology of non-Hermitian spectra and their graph topology. Physical Review B, 107(22):L220301, June 2023

work page 2023
[82]

Topological Non-Hermitian skin effect

Rijia Lin, Tommy Tai, Mengjie Yang, Linhu Li, and Ching Hua Lee. Topological Non-Hermitian skin effect. Frontiers of Physics, 18(5):53605, October 2023

work page 2023

Showing first 80 references.

[1] [1]

Hamilton

William L. Hamilton. Graph representation learning. Synthesis Lectures on Artificial Intelligence and Machine Learning, 14(3):1–159, 2020

work page 2020

[2] [2]

Geometric Deep Learning: Grids, Groups, Graphs, Geodesics, and Gauges

Michael M. Bronstein, Joan Bruna, Taco Cohen, and Petar Veliˇckovi´c. Geometric deep learning: Grids, groups, graphs, geodesics, and gauges. arXiv preprint arXiv:2104.13478, 2021

work page internal anchor Pith review Pith/arXiv arXiv 2021

[3] [3]

Deep Learning on Graphs

Yao Ma and Jiliang Tang. Deep Learning on Graphs. Cambridge University Press, 2021

work page 2021

[4] [4]

Graph neural networks

Lingfei Wu, Peng Cui, Jian Pei, Liang Zhao, and Le Song. Graph neural networks. In Graph Neural Networks: Foundations, Frontiers, and Applications, pages 27–37. Springer, 2022

work page 2022

[5] [5]

Graph neural networks

Gabriele Corso, Hannes Stark, Stefanie Jegelka, Tommi Jaakkola, and Regina Barzilay. Graph neural networks. Nature Reviews Methods Primers, 4:17, 2024

work page 2024

[6] [6]

Comparative analysis of feature extraction methods of malware detection

Smita Ranveer and Swapnaja Hiray. Comparative analysis of feature extraction methods of malware detection. International Journal of Computer Applications, 120:1–7, June 2015

work page 2015

[7] [8]

A simple yet effective baseline for non-attributed graph classification

Cai Chen and Yusu Wang. A simple yet effective baseline for non-attributed graph classification. In International Conference on Learning Representations (ICLR), 2019

work page 2019

[8] [9]

A fair comparison of graph neural networks for graph classification

Federico Errica, Marco Podda, Davide Bacciu, and Alessio Micheli. A fair comparison of graph neural networks for graph classification. arXiv preprint arXiv:1912.09893, 2019

work page arXiv 1912

[9] [10]

On the necessity of graph kernel baselines

Till Schulz and Pascal Welke. On the necessity of graph kernel baselines. In ECML-PKDD GEM Workshop, 2019

work page 2019

[10] [11]

Pitfalls of Graph Neural Network Evaluation

Oleksandr Shchur, Maximilian Mumme, Aleksandar Bojchevski, and Stephan Günnemann. Pitfalls of graph neural network evaluation. arXiv preprint arXiv:1811.05868, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018

[11] [12]

Borgwardt, Cheng Soon Ong, Stefan Schönauer, S.V .N

Karsten M. Borgwardt, Cheng Soon Ong, Stefan Schönauer, S.V .N. Vishwanathan, Alexander J. Smola, and Hans-Peter Kriegel. Protein function prediction via graph kernels. Bioinformatics (Oxford, England), 21(Suppl 1):i47–i56, 2005

work page 2005

[12] [13]

Deep graph kernels

Pinar Yanardag and SVN Vishwanathan. Deep graph kernels. In Proceedings of the 21st ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 1365–1374. ACM, 2015

work page 2015

[13] [14]

Open graph benchmark: Datasets for machine learning on graphs

Weihua Hu, Matthias Fey, Marinka Zitnik, Yuxiao Dong, Hongyu Ren, Bowen Liu, Michele Catasta, and Jure Leskovec. Open graph benchmark: Datasets for machine learning on graphs. In Advances in Neural Information Processing Systems 33 (NeurIPS 2020), 2020

work page 2020

[14] [15]

GOOD: A graph out-of-distribution benchmark

Shurui Gui, Xiner Li, Limei Wang, and Shuiwang Ji. GOOD: A graph out-of-distribution benchmark. In Advances in Neural Information Processing Systems 35 (NeurIPS 2022) Datasets and Benchmarks Track, 2022

work page 2022

[15] [16]

Long range graph benchmark

Vijay Dwivedi, Francesco M Bianchi, Feng Yan, and et al. Long range graph benchmark. arXiv preprint arXiv:2007.02839, 2020

work page arXiv 2007

[16] [17]

A large-scale database for graph representation learning, 2021

Scott Freitas, Yuxiao Dong, Joshua Neil, and Duen Horng Chau. A large-scale database for graph representation learning, 2021

work page 2021

[17] [18]

PowerGraph: A power grid benchmark dataset for graph neural networks, 2024

Anna Varbella, Kenza Amara, Blazhe Gjorgiev, and Giovanni Sansavini. PowerGraph: A power grid benchmark dataset for graph neural networks, 2024

work page 2024

[18] [19]

MBDS: A multi-body dynamics simulation dataset for graph networks simulators

Sheng Yang, Fengge Wu, and Junsuo Zhao. MBDS: A multi-body dynamics simulation dataset for graph networks simulators. October 2024

work page 2024

[19] [20]

Ruddigkeit, R

L. Ruddigkeit, R. van Deursen, L. C. Blum, and J.-L. Reymond. Enumeration of 166 billion organic small molecules in the chemical universe database GDB-17. Journal of Chemical Information and Modeling, 52:2864–2875, 2012

work page 2012

[20] [21]

Ramakrishnan, P

R. Ramakrishnan, P. O. Dral, M. Rupp, and O. A. V on Lilienfeld. Quantum chemistry structures and properties of 134 kilo molecules. Scientific Data, 1(1):1–7, 2014

work page 2014

[21] [22]

J. J. Irwin, T. Sterling, M. M. Mysinger, E. S. Bolstad, and R. G. Coleman. ZINC: A free tool to discover chemistry for biology. Journal of Chemical Information and Modeling, 52(7):1757–1768, 2012

work page 2012

[22] [23]

W. Jin, K. Yang, R. Barzilay, and T. Jaakkola. Learning multimodal graph-to-graph translation for molecular optimization, 2018

work page 2018

[23] [24]

Molecular sets (MOSES): A benchmarking platform for molecular generation models

Daniil Polykovskiy et al. Molecular sets (MOSES): A benchmarking platform for molecular generation models. Frontiers in Pharmacology, 11, 2020. 10

work page 2020

[24] [25]

ChEMBL: Towards direct deposition of bioassay data

David Mendez et al. ChEMBL: Towards direct deposition of bioassay data. Nucleic Acids Research, 47(D1):D930–D940, 2019

work page 2019

[25] [26]

X. Guo, L. Zhao, C. Nowzari, S. Rafatirad, H. Homayoun, and S. M. Dinakarrao. Deep multi-attributed graph translation with node-edge co-evolution. In Proceedings of the 19th International Conference on Data Mining (ICDM), 2019

work page 2019

[26] [27]

The ANI-1ccx and ANI-1x data sets, coupled-cluster and density functional theory properties for molecules

Justin Smith, Roman Zubatyuk, Benjamin Nebgen, Nicholas Lubbers, Kipton Barros, Adrian Roitberg, Olexandr Isayev, and Sergei Tretiak. The ANI-1ccx and ANI-1x data sets, coupled-cluster and density functional theory properties for molecules. Scientific Data, 7:134, May 2020

work page 2020

[27] [28]

Massively Multitask Networks for Drug Discovery

Bharath Ramsundar, Steven Kearnes, Patrick Riley, Dale Webster, David Konerding, and Vijay Pande. Massively multitask networks for drug discovery. arXiv preprint arXiv:1502.02072, 2015

work page internal anchor Pith review Pith/arXiv arXiv 2015

[28] [29]

Maximum unbiased validation (MUV) data sets for virtual screening based on PubChem bioactivity data.Journal of Chemical Information and Modeling, 49(2):169– 184, 2009

Sebastian G Rohrer and Knut Baumann. Maximum unbiased validation (MUV) data sets for virtual screening based on PubChem bioactivity data.Journal of Chemical Information and Modeling, 49(2):169– 184, 2009

work page 2009

[29] [30]

Xifeng Yan, Hong Cheng, Jiawei Han, and Philip S. Yu. Mining significant graph patterns by leap search. In Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data , pages 433–444, 2008

work page 2008

[30] [31]

MoleculeNet: A benchmark for molecular machine learning

Zhenqin Wu, Bharath Ramsundar, Evan N Feinberg, Joseph Gomes, Caleb Geniesse, Aneesh S Pappu, Karl Leswing, and Vijay Pande. MoleculeNet: A benchmark for molecular machine learning. Chemical Science, 9(2):513–530, 2018

work page 2018

[31] [32]

Comparison of descriptor spaces for chemical compound retrieval and classification

Nikil Wale, Ian A Watson, and George Karypis. Comparison of descriptor spaces for chemical compound retrieval and classification. Knowledge and Information Systems, 14(3):347–375, 2008

work page 2008

[32] [33]

Statisti- cal evaluation of the predictive toxicology challenge 2000–2001

Hannu Toivonen, Ashwin Srinivasan, Ross D King, Stefan Kramer, and Christoph Helma. Statisti- cal evaluation of the predictive toxicology challenge 2000–2001. Bioinformatics (Oxford, England), 19(10):1183–1193, 2003

work page 2000

[33] [34]

IAM graph database repository for graph based pattern recognition and machine learning

Kaspar Riesen and Horst Bunke. IAM graph database repository for graph based pattern recognition and machine learning. In Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR), pages 287–297. Springer, 2008

work page 2008

[34] [35]

Graph kernels for object category prediction in task-dependent robot grasping

Marion Neumann, Plinio Moreno, Laura Antanas, Roman Garnett, and Kristian Kersting. Graph kernels for object category prediction in task-dependent robot grasping. In Online Proceedings of the Eleventh Workshop on Mining and Learning with Graphs, pages 0–6, 2013

work page 2013

[35] [36]

Distinguishing enzyme structures from non-enzymes without alignments

Paul D Dobson and Andrew J Doig. Distinguishing enzyme structures from non-enzymes without alignments. Journal of Molecular Biology, 330(4):771–783, 2003

work page 2003

[36] [37]

An API oriented open-source python framework for unsupervised learning on graphs

Benedek Rozemberczki, Oliver Kiss, and Rik Sarkar. An API oriented open-source python framework for unsupervised learning on graphs. In Proceedings of CIKM, 2020

work page 2020

[37] [38]

Graphs over time: Densification laws, shrinking diameters and possible explanations

Jure Leskovec, Jon Kleinberg, and Christos Faloutsos. Graphs over time: Densification laws, shrinking diameters and possible explanations. In Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery in Data Mining, pages 177–187, 2005

work page 2005

[38] [39]

M. W. Weiner, P. S. Aisen, Jr. Jack, C. R., W. J. Jagust, J. Q. Trojanowski, L. Shaw, et al. The Alzheimer’s disease neuroimaging initiative: Progress report and future plans. Alzheimer’s & Dementia, 6(3):202–211, 2010

work page 2010

[39] [40]

Di Martino, C.-G

A. Di Martino, C.-G. Yan, Q. Li, E. Denio, F. X. Castellanos, K. Alaerts, et al. The autism brain imaging data exchange: Towards a large-scale evaluation of the intrinsic brain architecture in autism. Molecular Psychiatry, 19(6):659–667, 2014

work page 2014

[40] [41]

Bayrak, Tyler Derr, Mudassir Shabbir, Daniel Moyer, Catie Chang, and Xenofon Koutsoukos

Anwar Said, Roza G. Bayrak, Tyler Derr, Mudassir Shabbir, Daniel Moyer, Catie Chang, and Xenofon Koutsoukos. NeuroGraph: Benchmarks for graph machine learning in brain connectomics, 2024

work page 2024

[41] [42]

The ENIGMA Toolbox: Multiscale neural contextualization of multisite neuroimaging datasets

Sara Larivière, Casey Paquola, Bo-yong Park, Jessica Royer, Yezhou Wang, Oualid Benkarim, Reinder V os de Wael, Sofie L Valk, Sophia I Thomopoulos, Matthias Kirschner, Lindsay B Lewis, Alan C Evans, Sanjay M Sisodiya, Carrie R McDonald, Paul M Thompson, and Boris C Bernhardt. The ENIGMA Toolbox: Multiscale neural contextualization of multisite neuroimagin...

work page 2021

[42] [43]

A collection of public transport network data sets for 25 cities

Rainer Kujala, Christoffer Weckström, Richard Darst, Milos Mladenovic, and Jari Saramäki. A collection of public transport network data sets for 25 cities. Scientific Data, 5:180089, May 2018

work page 2018

[43] [44]

Street network models and measures for every u.s

Geoff Boeing. Street network models and measures for every u.s. city, county, urbanized area, census tract, and zillow-defined neighborhood. Urban Science, 3(28), 2019

work page 2019

[44] [45]

Navigability of intercon- nected networks under random failures

Manlio De Domenico, Albert Solé-Ribalta, Sergio Gómez, and Alex Arenas. Navigability of intercon- nected networks under random failures. Proceedings of the National Academy of Sciences, 111(23):8351– 8356, 2014

work page 2014

[45] [46]

A unified spatial multigraph analysis for public transport performance

Yaoli Wang, Di Zhu, Ganmin Yin, Zhou Huang, and Yu Liu. A unified spatial multigraph analysis for public transport performance. Scientific Reports, 10:9573, 2020. 11

work page 2020

[46] [47]

Generative modeling for protein structures

Namrata Anand and Po-Ssu Huang. Generative modeling for protein structures. In Advances in Neural Information Processing Systems (NeurIPS), pages 7505–7516, 2018

work page 2018

[47] [48]

Generating tertiary protein structures via an interpretative variational autoencoder

Xiaojie Guo, Sivani Tadepalli, Liang Zhao, and Amarda Shehu. Generating tertiary protein structures via an interpretative variational autoencoder. arXiv preprint, arXiv:2004.07119, 2020

work page arXiv 2004

[48] [49]

Functional brain connectivity is predictable from anatomic network’s laplacian eigenstructure

Farras Abdelnour, Michael Dayan, and Orrin Devinsky. Functional brain connectivity is predictable from anatomic network’s laplacian eigenstructure. NeuroImage, 172:728–739, 2018

work page 2018

[49] [50]

Generative adversarial learning of protein tertiary structures

Taseef Rahman, Yuanqi Du, Liang Zhao, and Amarda Shehu. Generative adversarial learning of protein tertiary structures. Molecules, 26(5):1209, 2021

work page 2021

[50] [52]

Solé, Sergi Valverde, Pascal Kuntz, and Guy Theraulaz

Jérôme Buhl, Jacques Gautrais, Nicholas Reeves, Ricard V . Solé, Sergi Valverde, Pascal Kuntz, and Guy Theraulaz. Topological patterns in street networks of self-organized urban settlements. The European Physical Journal B, 49:513–522, 2006

work page 2006

[51] [53]

Structural properties of planar graphs of urban street patterns

Alessio Cardillo, Salvatore Scellato, Vito Latora, and Sergio Porta. Structural properties of planar graphs of urban street patterns. Physical Review E, 73(6):066107, 2006

work page 2006

[52] [54]

A variational autoencoder based generative model of urban human mobility

Dou Huang, Xuan Song, and Zipei Fan. A variational autoencoder based generative model of urban human mobility. In Proceedings of the IEEE Conference on Multimedia Information Processing and Retrieval (MIPR), pages 425–430. IEEE, 2019

work page 2019

[53] [55]

West and James H

Geoffrey B. West and James H. Brown. The origin of allometric scaling laws in biology from genomes to ecosystems: Towards a quantitative unifying theory of biological structure and organization. Journal of Experimental Biology, 208:1575–1592, 2003

work page 2003

[54] [56]

Fuhrer, Peter Federl, Brendan Lane, Anne-Gaëlle Rolland-Lagan, and Przemys- law Prusinkiewicz

Adrian Runions, Anne M. Fuhrer, Peter Federl, Brendan Lane, Anne-Gaëlle Rolland-Lagan, and Przemys- law Prusinkiewicz. Modeling and visualization of leaf venation patterns. ACM Transactions on Graphics (TOG), 24(3):702–711, 2005

work page 2005

[55] [57]

Complex brain networks: graph theoretical analysis of structural and functional systems

Edward Bullmore and Olaf Sporns. Complex brain networks: graph theoretical analysis of structural and functional systems. Nature Reviews Neuroscience, 10:186–198, 2009

work page 2009

[56] [58]

Scale-Free Networks

Guido Caldarelli. Scale-Free Networks. Oxford University Press, Oxford, 2007

work page 2007

[57] [59]

Fractal River Basins: Chance and Self-Organization

Ignacio Rodriguez-Iturbe and Andrea Rinaldo. Fractal River Basins: Chance and Self-Organization . Cambridge University Press, Cambridge, 1997

work page 1997

[58] [60]

Spatial networks

Marc Barthélemy. Spatial networks. Physics Reports, 499(1-3):1–101, 2011

work page 2011

[59] [61]

Deep generative models for spatial networks

Xiaojie Guo, Yuanqi Du, and Liang Zhao. Deep generative models for spatial networks. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining , KDD ’21, pages 198–208, Virtual Event, Singapore, August 14–18 2021. ACM

work page 2021

[60] [62]

Machine learning in materials science

Jing Wei, Xuan Chu, Xiang-Yu Sun, Kun Xu, Hui-Xiong Deng, Jigen Chen, Zhongming Wei, and Ming Lei. Machine learning in materials science. InfoMat, 1(3):338–358, 2019

work page 2019

[61] [63]

Machine learning for data-driven discovery in solid Earth geoscience

Karianne Bergen, Paul Johnson, Maarten de Hoop, and Gregory Beroza. Machine learning for data-driven discovery in solid Earth geoscience. Science, 363:eaau0323, March 2019

work page 2019

[62] [64]

Machine learning and the physical sciences.Reviews of Modern Physics, 91(4):045002, December 2019

Giuseppe Carleo, Ignacio Cirac, Kyle Cranmer, Laurent Daudet, Maria Schuld, Naftali Tishby, Leslie V ogt-Maranto, and Lenka Zdeborová. Machine learning and the physical sciences.Reviews of Modern Physics, 91(4):045002, December 2019

work page 2019

[63] [65]

From DFT to machine learning: Recent approaches to materials science–a review

Gabriel R Schleder, Antonio C M Padilha, Carlos Mera Acosta, Marcio Costa, and Adalberto Fazzio. From DFT to machine learning: Recent approaches to materials science–a review. Journal of Physics: Materials, 2(3):032001, May 2019

work page 2019

[64] [66]

Machine learning for molecular and materials science

Keith Butler, Daniel Davies, Hugh Cartwright, Olexandr Isayev, and Aron Walsh. Machine learning for molecular and materials science. Nature, 559, July 2018

work page 2018

[65] [67]

Espinosa

Hanxun Jin, Enrui Zhang, and Horacio D. Espinosa. Recent advances and applications of machine learning in experimental solid mechanics: A review. Applied Mechanics Reviews, 75(6):061001, 2023

work page 2023

[66] [68]

Machine learning in concrete science: Applications, challenges, and best practices

Zhanzhao Li, Jinyoung Yoon, Rui Zhang, Farshad Rajabipour, Wil III, Ismaila Dabo, and Aleksandra Radli´nska. Machine learning in concrete science: Applications, challenges, and best practices. npj Computational Materials, 8:127, June 2022

work page 2022

[67] [69]

Neural-network quantum state tomography

Giacomo Torlai, Guglielmo Mazzola, Juan Carrasquilla, Matthias Troyer, Roger Melko, and Giuseppe Carleo. Neural-network quantum state tomography. Nature Physics, 14, May 2018

work page 2018

[68] [70]

Dalziel, Drummond Buschman Fielding, Daniel Fortunato, Jared A

Ruben Ohana, Michael McCabe, Lucas Thibaut Meyer, Rudy Morel, Fruzsina Julia Agocs, Miguel Beneitez, Marsha Berger, Blakesley Burkhart, Stuart B. Dalziel, Drummond Buschman Fielding, Daniel Fortunato, Jared A. Goldberg, Keiya Hirashima, Yan-Fei Jiang, Rich Kerswell, Suryanarayana Maddu, Jonah M. Miller, Payel Mukhopadhyay, Stefan S. Nixon, Jeff Shen, Roma...

work page 2024

[69] [71]

Galaxy Zoo DECaLS: Detailed visual morphology measurements from volunteers and deep learning for 314 000 galaxies

Mike Walmsley, Chris Lintott, Tobias Géron, Sandor Kruk, Coleman Krawczyk, Kyle W Willett, Steven Bamford, Lee S Kelvin, Lucy Fortson, Yarin Gal, William Keel, Karen L Masters, Vihang Mehta, Brooke D Simmons, Rebecca Smethurst, Lewis Smith, Elisabeth M Baeten, and Christine Macmillan. Galaxy Zoo DECaLS: Detailed visual morphology measurements from volunte...

work page 2021

[70] [72]

ATOM3D: Tasks on molecules in three dimensions

Raphael Townshend, Martin Vögele, Patricia Suriana, Alex Derry, Alexander Powers, Yianni Laloudakis, Sidhika Balachandar, Bowen Jing, Brandon Anderson, Stephan Eismann, Risi Kondor, Russ Altman, and Ron Dror. ATOM3D: Tasks on molecules in three dimensions. In J. Vanschoren and S. Yeung, editors, Proceedings of the Neural Information Processing Systems Tra...

work page 2021

[71] [73]

PPB-affinity: Protein-protein binding affinity dataset for AI-based protein drug discovery

H Liu, P Chen, X Zhai, KG Huo, S Zhou, L Han, and G Fan. PPB-affinity: Protein-protein binding affinity dataset for AI-based protein drug discovery. Scientific data, 11(1):1316, December 2024

work page 2024

[72] [74]

Kinch, R

Minkyung Baek, Frank DiMaio, Ivan Anishchenko, Justas Dauparas, Sergey Ovchinnikov, Gyu Rie Lee, Jue Wang, Qian Cong, Lisa N. Kinch, R. Dustin Schaeffer, Carla Millán, Heewook Park, Cole Adams, Craig R. Glassman, Andy DeGiovanni, Jose H. Pereira, Andria V . Rodrigues, Alberdina A. van Dijk, Andrea C. Ebrecht, D. J. Opperman, Thomas Sagmeister, Christoph B...

work page 2021

[73] [75]

AlphaFold Protein Structure Database in 2024: Providing structure coverage for over 214 million protein sequences

Mihaly Varadi, Damian Bertoni, Paulyna Magana, Urmila Paramval, Ivanna Pidruchna, Malarvizhi Radhakrishnan, Maxim Tsenkov, Sreenath Nair, Milot Mirdita, Jingi Yeo, Oleg Kovalevskiy, Kathryn Tunyasuvunakool, Agata Laydon, Augustin Žídek, Hamish Tomlinson, Dhavanthi Hariharan, Josh Abrahamson, Tim Green, John Jumper, Ewan Birney, Martin Steinegger, Demis Ha...

work page 2024

[74] [76]

John Jumper, Richard Evans, Alexander Pritzel, Tim Green, Michael Figurnov, Olaf Ronneberger, Koray Tunyasuvunakool, Russ Bates, Augustin Žídek, Anna Potapenko, Charlie Bridgland, Clemens Meyer, Simon A. A. Kohl, Andrew J. Ballard, Andrew Cowie, Bernardino Romera-Paredes, Stig Nikolov, Rishub Jain, Jonas Adler, Tom Back, Stig Petersen, David Reiman, Ellen...

work page 2021

[75] [77]

Lawrence Zitnick, and Zachary Ulissi

Lowik Chanussot, Abhishek Das, Siddharth Goyal, Thibaut Lavril, Muhammed Shuaibi, Morgane Riviere, Kevin Tran, Javier Heras-Domingo, Caleb Ho, Weihua Hu, Aini Palizhati, Anuroop Sriram, Brandon Wood, Junwoong Yoon, Devi Parikh, C. Lawrence Zitnick, and Zachary Ulissi. Open catalyst 2020 (oc20) dataset and community challenges. ACS Catalysis, 11(10):6059–6...

work page 2020

[76] [78]

Merchant, S

A. Merchant, S. Batzner, Samuel S. Schoenholz, Muratahan Aykol, Gabin Cheon, and Ekin D. Cubuk. Scaling deep learning for materials discovery. Nature, 624(7990):80–85, 2023

work page 2023

[77] [79]

Pieters, Eric A

Huan Li, Hao Zheng, Ting Yue, Zhi Xie, Shanshan Yu, Jun Zhou, Tarun Kapri, Yiming Wang, Zhi Cao, Hong Zhao, Akerke Kemelbay, Jie He, Guojing Zhang, Paul F. Pieters, Eric A. Dailing, James R. Cappiello, Miquel Salmeron, Xiaodan Gu, Ting Xu, Peng Wu, Yuzhang Li, Karl B. Sharpless, and Yi Liu. Machine learning-accelerated discovery of heat-resistant polysulf...

work page 2025

[78] [80]

Kipf, Ethan Fetaya, Kuan-Chieh Wang, Max Welling, and Richard Zemel

Thomas N. Kipf, Ethan Fetaya, Kuan-Chieh Wang, Max Welling, and Richard Zemel. Neural relational inference for interacting systems. In International Conference on Machine Learning, pages 2688–2697, 2018

work page 2018

[79] [81]

Zoology of non-Hermitian spectra and their graph topology

Tommy Tai and Ching Hua Lee. Zoology of non-Hermitian spectra and their graph topology. Physical Review B, 107(22):L220301, June 2023

work page 2023

[80] [82]

Topological Non-Hermitian skin effect

Rijia Lin, Tommy Tai, Mengjie Yang, Linhu Li, and Ching Hua Lee. Topological Non-Hermitian skin effect. Frontiers of Physics, 18(5):53605, October 2023

work page 2023