Scaling Novel Graph Generation via Lightweight Structure-Guided Autoregressive Models

Alessio Barboni; Bishal Lakha; Edoardo Serra; Massimiliano Lupo Pasini

arxiv: 2606.04287 · v1 · pith:NAFMUQNJnew · submitted 2026-06-02 · 💻 cs.LG · cs.AI

Scaling Novel Graph Generation via Lightweight Structure-Guided Autoregressive Models

Alessio Barboni , Massimiliano Lupo Pasini , Bishal Lakha , Edoardo Serra This is my paper

Pith reviewed 2026-06-28 10:20 UTC · model grok-4.3

classification 💻 cs.LG cs.AI

keywords graph generationautoregressive modelstopological orderingnoveltymolecular graphssequence modelsLSTMMamba

0 comments

The pith

A structure-guided topological ordering serializes graphs into edge sequences so lightweight autoregressive models can generate novel graphs at near log-linear cost.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents an autoregressive framework that first converts any graph into a regular sequence of edges via a structure-guided topological ordering. This ordering replaces the quadratic or denoising costs of prior models with near log-linear generation. A two-phase training process adds exploration-oriented augmentation followed by iterative refinement to push the model toward graphs that differ from the training set. The result is reported on both molecular and non-molecular benchmarks, where novelty rises while validity and uniqueness stay high. The same framework runs on LSTM or Mamba backbones and extends to longer sequences when large-memory accelerators are available.

Core claim

By serializing graphs through structure-guided topological ordering into regular edge sequences and training with a two-phase strategy of augmentation plus refinement, the autoregressive model produces graphs that are more novel than those from prior methods while preserving high validity and uniqueness; the same pipeline supports both LSTM and Mamba causal backbones and runs longer sequences on large-memory hardware.

What carries the argument

Structure-guided topological ordering that converts graphs into regular edge sequences for autoregressive generation.

If this is right

Novelty rises on both molecular and non-molecular graph benchmarks while validity and uniqueness remain high.
The same pipeline works with LSTM and Mamba-style causal sequence models.
Large-memory accelerators enable experiments on graph sequences longer than typical GPU limits allow.
Near log-linear generation replaces the quadratic or full-adjacency costs of earlier diffusion and autoregressive approaches.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The serialization step could be tested on non-graph structured objects such as circuit netlists or dependency trees to check whether the log-linear benefit generalizes.
The two-phase training could be combined with other sequence backbones to measure how much the novelty gain depends on the choice of augmentation schedule.
If the topological ordering can be made differentiable, end-to-end learning of the serialization itself becomes a possible extension.

Load-bearing premise

The structure-guided topological ordering successfully serializes graphs into regular edge sequences that preserve essential properties for valid generation while achieving near log-linear complexity.

What would settle it

On standard molecular benchmarks such as QM9, if novelty does not increase over prior autoregressive baselines while validity falls below the levels reported for the new method, the central claim is falsified.

Figures

Figures reproduced from arXiv: 2606.04287 by Alessio Barboni, Bishal Lakha, Edoardo Serra, Massimiliano Lupo Pasini.

**Figure 1.** Figure 1: Training dynamics of the full method. Left: QM7x. Right: Transition1x. Across datasets, novelty increases substantially while uniqueness remains high and validity stays broadly stable, illustrating the intended exploration–refinement behavior of the two-phase training scheme. 5.2 Ablation Summary We summarize the main ablation findings here and provide full curves and discussion in Appendix D. Removing ReS… view at source ↗

**Figure 2.** Figure 2: Ablation of the two-phase training scheme on Transition1x. (a) With perturbed graphs but without ReST, the model explores more broadly but fails to maintain high validity. (b) Without perturbed graphs and without ReST, the model remains more conservative but exhibits a steady decline in novelty. D Additional Ablations D.1 Ablation on Phase 1 and the Role of ReST To assess whether the components of the prop… view at source ↗

**Figure 3.** Figure 3: Ablation of the node ordering strategy on Transition1x. Fully random ordering causes a severe collapse in Validity, while plain BFS without structural guidance recovers part of the lost structure but remains weaker than the full structure-guided traversal. Taken together, these two ablations illustrate the complementary roles of the two phases. Phase 1 with perturbed graphs encourages exploration but requi… view at source ↗

read the original abstract

Generating realistic and diverse graphs is a key problem in machine learning, with applications in molecular discovery, circuit design, cybersecurity, and beyond. However, current graph generative models remain limited by scalability and novelty. Diffusion-based methods often require costly full-adjacency operations and long denoising chains, while many autoregressive and hybrid models have at least quadratic complexity. In addition, these models often imitate training graphs rather than generalize beyond them. We propose a lightweight autoregressive framework to address these issues. It uses a structure-guided topological ordering to serialize graphs into regular edge sequences, enabling near log-linear generation, and a two-phase training strategy that combines exploration-oriented augmentation with iterative refinement to reduce overfitting and promote controlled novelty. Experiments on molecular and non-molecular benchmarks show that our approach improves novelty while preserving high validity and uniqueness. The framework also supports both LSTM and Mamba-style causal sequence backbones, with large-memory accelerators enabling longer graph-sequence experiments beyond typical GPU limits.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper's structure-guided ordering plus two-phase training gives a workable path to more novel autoregressive graph generation at better scale than quadratic baselines.

read the letter

The main point is that they serialize graphs via a structure-guided topological ordering to get near log-linear generation, then use a two-phase training mix of augmentation and refinement to push novelty without tanking validity or uniqueness.

What they actually deliver is a framework that works with both LSTM and Mamba backbones and runs longer sequences on large-memory hardware. The experiments cover molecular and non-molecular benchmarks and report direct gains on novelty while holding the other metrics steady. The ordering procedure and training split are described in enough detail to see how they avoid the usual imitation problem in autoregressive models.

The soft spot is that the novelty lift still depends on the ordering preserving the right graph properties across domains, and the results are benchmark-specific. It is not obvious yet how far this extends to graphs much larger than the tested sets or to tasks where validity is harder to maintain. No circularity or hidden fitting shows up in the argument.

This is for researchers who build or apply graph generators in chemistry, networks, or design tasks and want something lighter than diffusion. Readers who care about sequence-model backbones for structured data will get the most out of the LSTM-Mamba comparison.

The work is grounded enough in its own claims and measurements to deserve referee time rather than a desk reject.

Referee Report

0 major / 3 minor

Summary. The manuscript proposes a lightweight autoregressive framework for graph generation. It employs a structure-guided topological ordering to serialize graphs into regular edge sequences, enabling near log-linear generation complexity. A two-phase training strategy combines exploration-oriented augmentation with iterative refinement to reduce overfitting and promote novelty. Experiments on molecular and non-molecular benchmarks report improved novelty while preserving high validity and uniqueness; the framework supports both LSTM and Mamba-style causal sequence backbones and uses large-memory accelerators for longer sequences.

Significance. If the central claims hold, the work is significant for scaling graph generative models beyond quadratic complexity and diffusion-based costs, with direct relevance to molecular discovery and related domains. Credit is due for the explicit experimental measurements on validity, uniqueness, and novelty across benchmarks, the compatibility with multiple sequence backbones, and the practical use of large-memory accelerators. The structure-guided ordering and two-phase training are presented as load-bearing components that appear internally consistent based on the described procedure and results.

minor comments (3)

[Abstract] Abstract: the high-level claims would be strengthened by including at least one or two concrete quantitative results (e.g., novelty scores or validity percentages) rather than qualitative statements only.
[Method] The description of the topological ordering procedure and the two-phase training could benefit from an explicit complexity analysis or pseudocode to clarify the claimed near log-linear scaling.
[Experiments] Figure or table captions for the benchmark results should explicitly state the number of runs, error bars, and baseline implementations to improve reproducibility.

Simulated Author's Rebuttal

0 responses · 0 unresolved

Thank you for the positive assessment of our work and the recommendation for minor revision. We appreciate the recognition of the significance for scaling graph generative models, the experimental measurements, and the practical aspects of the framework. No major comments were listed in the report.

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The provided abstract and description outline a structure-guided topological ordering to serialize graphs into edge sequences for near log-linear autoregressive generation, combined with a two-phase training strategy using augmentation and refinement. No equations, fitted parameters renamed as predictions, self-citations as load-bearing premises, or uniqueness theorems imported from prior author work are present in the text. The central claims rest on the described procedure and benchmark experiments measuring validity, uniqueness, and novelty, which are externally falsifiable and do not reduce to self-definition or input fitting by construction. The derivation chain is self-contained against the stated assumptions.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract provides no information on free parameters, axioms, or invented entities.

pith-pipeline@v0.9.1-grok · 5705 in / 1022 out tokens · 28696 ms · 2026-06-28T10:20:46.947206+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

15 extracted references · 9 canonical work pages · 3 internal anchors

[2]

org/abs/2305.15562

URL https://arxiv. org/abs/2305.15562. Dexiong Chen, Markus Krimmel, and Karsten Borgwardt. Flatten graphs as sequences: Transformers are scalable graph generators. InAdvances in Neural Information Processing Systems, 2025a. URL https://arxiv.org/abs/2502.02216. To appear. Xiaohui Chen, Xu Han, Jiajing Hu, Francisco J. R. Ruiz, and Liping Liu. Order matte...

work page arXiv
[4]

RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment

URL https://arxiv. org/abs/2304.06767. Claire Donnat, Marinka Zitnik, David Hallac, and Jure Leskovec. Learning structural node em- beddings via diffusion wavelets. InProceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 1320–1329. ACM,

work page internal anchor Pith review Pith/arXiv arXiv
[5]

URLhttps://doi.org/10.1145/3219819.3220025

doi: 10.1145/3219819.3220025. URLhttps://doi.org/10.1145/3219819.3220025. Albert Gu and Tri Dao. Mamba: Linear-time sequence modeling with selective state spaces.arXiv preprint arXiv:2312.00752,

work page doi:10.1145/3219819.3220025
[7]

Reinforced Self-Training (ReST) for Language Modeling

URLhttps://arxiv.org/abs/2308.08998. Han Huang, Leilei Sun, Bowen Du, Yanjie Fu, and Weifeng Lv. GraphGDP: Generative diffusion processes for permutation invariant graph generation. In2022 IEEE International Conference on Data Mining, pages 201–210. IEEE,

work page internal anchor Pith review Pith/arXiv arXiv
[8]

URL https://arxiv.org/abs/2212.01842

doi: 10.1109/ICDM54844.2022.00030. URL https://arxiv.org/abs/2212.01842. 14 Yunhui Jang, Seul Lee, and Sungsoo Ahn. A simple and scalable representation for graph generation. InInternational Conference on Learning Representations,

work page doi:10.1109/icdm54844.2022.00030 2022
[9]

URLhttps://doi.org/10.1145/3450315

doi: 10.1145/3450315. URLhttps://doi.org/10.1145/3450315. Lingkai Kong, Jiaming Cui, Haotian Sun, Yuchen Zhuang, B. Aditya Prakash, and Chao Zhang. Autoregressive diffusion model for graph generation. InProceedings of the 40th International Conference on Machine Learning, volume 202 ofProceedings of Machine Learning Research, pages 17391–17408. PMLR,

work page doi:10.1145/3450315
[10]

Chenhao Niu, Yang Song, Jiaming Song, Shengjia Zhao, Aditya Grover, and Stefano Ermon

URL https://proceedings.neurips.cc/paper/2019/hash/ d0921d442ee91b896ad95059d13df618-Abstract.html. Chenhao Niu, Yang Song, Jiaming Song, Shengjia Zhao, Aditya Grover, and Stefano Ermon. Permutation invariant graph generation via score-based generative modeling. InProceed- ings of the 23rd International Conference on Artificial Intelligence and Statistics...

2019
[11]

URLhttps://doi.org/10.1145/3097983.3098061

doi: 10.1145/ 3097983.3098061. URLhttps://doi.org/10.1145/3097983.3098061. Yu Rong, Wenbing Huang, Tingyang Xu, and Junzhou Huang. DropEdge: Towards deep graph convo- lutional networks on node classification. InInternational Conference on Learning Representations,

work page doi:10.1145/3097983.3098061
[12]

Martin Simonovsky and Nikos Komodakis

URLhttps://openreview.net/forum?id=Hkx1qkrKPr. Martin Simonovsky and Nikos Komodakis. GraphV AE: Towards generation of small graphs using variational autoencoders. InArtificial Neural Networks and Machine Learning – ICANN 2018, pages 412–422. Springer,

2018
[13]

GraphVAE: Towards Generation of Small Graphs Using Variational Autoencoders

URLhttps://arxiv.org/abs/1802.03480. Antoine Siraudin, Fragkiskos D. Malliaros, and Christopher Morris. Cometh: A continuous-time discrete-state graph diffusion model.arXiv preprint arXiv:2406.06449,

work page internal anchor Pith review Pith/arXiv arXiv
[14]

Clement Vignac, Igor Krawczuk, Antoine Siraudin, Bohan Wang, V olkan Cevher, and Pascal Frossard

URL https: //arxiv.org/abs/2406.06449. Clement Vignac, Igor Krawczuk, Antoine Siraudin, Bohan Wang, V olkan Cevher, and Pascal Frossard. DiGress: Discrete denoising diffusion for graph generation. InInternational Conference on Learning Representations,

work page arXiv
[15]

Jiaxuan You, Rex Ying, Xiang Ren, William L

URL https://proceedings.neurips.cc/paper_files/paper/2024/ hash/91813e5ddd9658b99be4c532e274b49c-Abstract-Conference.html. Jiaxuan You, Rex Ying, Xiang Ren, William L. Hamilton, and Jure Leskovec. GraphRNN: Generating realistic graphs with deep auto-regressive models. InProceedings of the 35th International Conference on Machine Learning, volume 80 ofProc...

2024
[16]

Eric Zelikman, Yuhuai Wu, Jesse Mu, and Noah D

URL https://proceedings.neurips.cc/paper/2020/ hash/3fe230348e9a12c13120749e3f9fa4cd-Abstract.html. Eric Zelikman, Yuhuai Wu, Jesse Mu, and Noah D. Goodman. STaR: Bootstrapping reasoning with reasoning. InAdvances in Neural Information Processing Systems, volume 35, pages 15476– 15488,

2020
[17]

Lingxiao Zhao, Xueying Ding, and Leman Akoglu

URL https://proceedings.neurips.cc/paper_files/paper/2022/hash/ 639a9a172c044fbb64175b5fad42e9a5-Abstract-Conference.html. Lingxiao Zhao, Xueying Ding, and Leman Akoglu. PARD: Permutation-invariant autoregres- sive diffusion for graph generation. InAdvances in Neural Information Processing Systems, volume 37,

2022
[18]

A Complexity Analysis We analyze the per-graph time complexity of the proposed pipeline

URL https://proceedings.neurips.cc/paper_files/paper/2024/ hash/0d89cf183391e12063cb63ff0d75ed95-Abstract-Conference.html. A Complexity Analysis We analyze the per-graph time complexity of the proposed pipeline. Let n=|V| and m=|E| . We focus on sparse graphs with m=O(n) , as in the molecular and structural benchmarks considered in this work, and treat th...

2024

[1] [2]

org/abs/2305.15562

URL https://arxiv. org/abs/2305.15562. Dexiong Chen, Markus Krimmel, and Karsten Borgwardt. Flatten graphs as sequences: Transformers are scalable graph generators. InAdvances in Neural Information Processing Systems, 2025a. URL https://arxiv.org/abs/2502.02216. To appear. Xiaohui Chen, Xu Han, Jiajing Hu, Francisco J. R. Ruiz, and Liping Liu. Order matte...

work page arXiv

[2] [4]

RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment

URL https://arxiv. org/abs/2304.06767. Claire Donnat, Marinka Zitnik, David Hallac, and Jure Leskovec. Learning structural node em- beddings via diffusion wavelets. InProceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 1320–1329. ACM,

work page internal anchor Pith review Pith/arXiv arXiv

[3] [5]

URLhttps://doi.org/10.1145/3219819.3220025

doi: 10.1145/3219819.3220025. URLhttps://doi.org/10.1145/3219819.3220025. Albert Gu and Tri Dao. Mamba: Linear-time sequence modeling with selective state spaces.arXiv preprint arXiv:2312.00752,

work page doi:10.1145/3219819.3220025

[4] [7]

Reinforced Self-Training (ReST) for Language Modeling

URLhttps://arxiv.org/abs/2308.08998. Han Huang, Leilei Sun, Bowen Du, Yanjie Fu, and Weifeng Lv. GraphGDP: Generative diffusion processes for permutation invariant graph generation. In2022 IEEE International Conference on Data Mining, pages 201–210. IEEE,

work page internal anchor Pith review Pith/arXiv arXiv

[5] [8]

URL https://arxiv.org/abs/2212.01842

doi: 10.1109/ICDM54844.2022.00030. URL https://arxiv.org/abs/2212.01842. 14 Yunhui Jang, Seul Lee, and Sungsoo Ahn. A simple and scalable representation for graph generation. InInternational Conference on Learning Representations,

work page doi:10.1109/icdm54844.2022.00030 2022

[6] [9]

URLhttps://doi.org/10.1145/3450315

doi: 10.1145/3450315. URLhttps://doi.org/10.1145/3450315. Lingkai Kong, Jiaming Cui, Haotian Sun, Yuchen Zhuang, B. Aditya Prakash, and Chao Zhang. Autoregressive diffusion model for graph generation. InProceedings of the 40th International Conference on Machine Learning, volume 202 ofProceedings of Machine Learning Research, pages 17391–17408. PMLR,

work page doi:10.1145/3450315

[7] [10]

Chenhao Niu, Yang Song, Jiaming Song, Shengjia Zhao, Aditya Grover, and Stefano Ermon

URL https://proceedings.neurips.cc/paper/2019/hash/ d0921d442ee91b896ad95059d13df618-Abstract.html. Chenhao Niu, Yang Song, Jiaming Song, Shengjia Zhao, Aditya Grover, and Stefano Ermon. Permutation invariant graph generation via score-based generative modeling. InProceed- ings of the 23rd International Conference on Artificial Intelligence and Statistics...

2019

[8] [11]

URLhttps://doi.org/10.1145/3097983.3098061

doi: 10.1145/ 3097983.3098061. URLhttps://doi.org/10.1145/3097983.3098061. Yu Rong, Wenbing Huang, Tingyang Xu, and Junzhou Huang. DropEdge: Towards deep graph convo- lutional networks on node classification. InInternational Conference on Learning Representations,

work page doi:10.1145/3097983.3098061

[9] [12]

Martin Simonovsky and Nikos Komodakis

URLhttps://openreview.net/forum?id=Hkx1qkrKPr. Martin Simonovsky and Nikos Komodakis. GraphV AE: Towards generation of small graphs using variational autoencoders. InArtificial Neural Networks and Machine Learning – ICANN 2018, pages 412–422. Springer,

2018

[10] [13]

GraphVAE: Towards Generation of Small Graphs Using Variational Autoencoders

URLhttps://arxiv.org/abs/1802.03480. Antoine Siraudin, Fragkiskos D. Malliaros, and Christopher Morris. Cometh: A continuous-time discrete-state graph diffusion model.arXiv preprint arXiv:2406.06449,

work page internal anchor Pith review Pith/arXiv arXiv

[11] [14]

Clement Vignac, Igor Krawczuk, Antoine Siraudin, Bohan Wang, V olkan Cevher, and Pascal Frossard

URL https: //arxiv.org/abs/2406.06449. Clement Vignac, Igor Krawczuk, Antoine Siraudin, Bohan Wang, V olkan Cevher, and Pascal Frossard. DiGress: Discrete denoising diffusion for graph generation. InInternational Conference on Learning Representations,

work page arXiv

[12] [15]

Jiaxuan You, Rex Ying, Xiang Ren, William L

URL https://proceedings.neurips.cc/paper_files/paper/2024/ hash/91813e5ddd9658b99be4c532e274b49c-Abstract-Conference.html. Jiaxuan You, Rex Ying, Xiang Ren, William L. Hamilton, and Jure Leskovec. GraphRNN: Generating realistic graphs with deep auto-regressive models. InProceedings of the 35th International Conference on Machine Learning, volume 80 ofProc...

2024

[13] [16]

Eric Zelikman, Yuhuai Wu, Jesse Mu, and Noah D

URL https://proceedings.neurips.cc/paper/2020/ hash/3fe230348e9a12c13120749e3f9fa4cd-Abstract.html. Eric Zelikman, Yuhuai Wu, Jesse Mu, and Noah D. Goodman. STaR: Bootstrapping reasoning with reasoning. InAdvances in Neural Information Processing Systems, volume 35, pages 15476– 15488,

2020

[14] [17]

Lingxiao Zhao, Xueying Ding, and Leman Akoglu

URL https://proceedings.neurips.cc/paper_files/paper/2022/hash/ 639a9a172c044fbb64175b5fad42e9a5-Abstract-Conference.html. Lingxiao Zhao, Xueying Ding, and Leman Akoglu. PARD: Permutation-invariant autoregres- sive diffusion for graph generation. InAdvances in Neural Information Processing Systems, volume 37,

2022

[15] [18]

A Complexity Analysis We analyze the per-graph time complexity of the proposed pipeline

URL https://proceedings.neurips.cc/paper_files/paper/2024/ hash/0d89cf183391e12063cb63ff0d75ed95-Abstract-Conference.html. A Complexity Analysis We analyze the per-graph time complexity of the proposed pipeline. Let n=|V| and m=|E| . We focus on sparse graphs with m=O(n) , as in the molecular and structural benchmarks considered in this work, and treat th...

2024