Bounded Graph Clustering with Graph Neural Networks

Diego Baptista; Kibidi Neocosmos; Nicole Ludwig

arxiv: 2512.05623 · v2 · submitted 2025-12-05 · 💻 cs.LG

Bounded Graph Clustering with Graph Neural Networks

Kibidi Neocosmos , Diego Baptista , Nicole Ludwig This is my paper

Pith reviewed 2026-05-17 00:43 UTC · model grok-4.3

classification 💻 cs.LG

keywords graph neural networkscommunity detectiongraph clusteringbounded clusteringcluster count controlunsupervised learning on graphs

0 comments

The pith

Graph neural networks for clustering can be trained to return either a user-specified range or an exact number of communities.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Standard GNN methods for community detection often fail to return the exact number of clusters even when it is provided, and users typically cannot afford to test every possible count. This paper proposes a framework that lets the user supply either a plausible range of cluster numbers or an exact target and then enforces those constraints while the model trains. The approach modifies the training process so the GNN discovers communities that respect the supplied bounds without requiring the true count to be known in advance. If an exact number is given, the method returns that number reliably. The result is greater control for practitioners working with graphs whose community structure is uncertain.

Core claim

The authors show that GNN training dynamics can be altered to enforce either a range or an exact count of clusters, allowing the model to return the desired number of communities while still learning meaningful structure from the graph data.

What carries the argument

A constraint-based training framework that bounds or fixes the number of output clusters produced by the GNN.

If this is right

Users can provide a range of plausible cluster counts instead of committing to one exact value upfront.
When an exact cluster count is supplied, the trained GNN returns precisely that number.
Exhaustive search over possible cluster numbers becomes unnecessary for GNN-based clustering.
The same GNN architecture can be reused across graphs that differ in their number of communities.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The enforcement mechanism could be combined with other GNN objectives such as link prediction without major redesign.
Performance on graphs with overlapping or hierarchical communities would test how tightly the bounds affect partition quality.
The method might be adapted to control output cardinality in other unsupervised graph tasks beyond clustering.

Load-bearing premise

The training process of a GNN can be modified to enforce cluster-count limits or exact counts without destroying its ability to find meaningful communities in the data.

What would settle it

On standard benchmark graphs whose true number of communities is known, run the method with a supplied range or exact count and measure whether the returned cluster count falls inside the range or matches the exact value at a high rate.

Figures

Figures reproduced from arXiv: 2512.05623 by Diego Baptista, Kibidi Neocosmos, Nicole Ludwig.

**Figure 2.** Figure 2: The number of communities found by each model when the number of clusters is varied. The [PITH_FULL_IMAGE:figures/full_fig_p007_2.png] view at source ↗

**Figure 3.** Figure 3: Network visualizations showing community assignments for each method for one run on [PITH_FULL_IMAGE:figures/full_fig_p008_3.png] view at source ↗

**Figure 4.** Figure 4: A box-and-whisker plot for the adjusted rand index (ARI) score of each model on 10 [PITH_FULL_IMAGE:figures/full_fig_p008_4.png] view at source ↗

**Figure 5.** Figure 5: The number of communities predicted for the real datasets (table [PITH_FULL_IMAGE:figures/full_fig_p010_5.png] view at source ↗

read the original abstract

In community detection, many methods require the user to specify the number of clusters in advance since an exhaustive search over all possible values is computationally infeasible. While some classical algorithms can infer this number directly from the data, this is typically not the case for graph neural networks (GNNs): even when a desired number of clusters is specified, standard GNN-based methods often fail to return the exact number due to the way they are designed. In this work, we address this limitation by introducing a flexible and principled way to control the number of communities discovered by GNNs. Rather than assuming the true number of clusters is known, we propose a framework that allows the user to specify a plausible range and enforce these bounds during training. However, if the user wants an exact number of clusters, it may also be specified and reliably returned.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This paper adds a framework to enforce user-specified bounds or exact cluster counts during GNN community detection training.

read the letter

This paper adds a framework to enforce user-specified bounds or exact cluster counts during GNN community detection training. It targets the practical problem that standard GNN methods often fail to return the exact number even when it is provided as input, while some classical algorithms can infer it from the data. The approach lets users give a plausible range and enforce it, or specify an exact number and get it back reliably. That control mechanism during training is the main new piece. It is presented as flexible and integrated rather than a post-hoc fix, which could make GNN clustering more usable in settings where domain knowledge constrains cluster size. The abstract does a clear job laying out the limitation and the proposed fix without overclaiming universality. The work stays focused on the GNN case and does not pretend to solve broader inference problems. The soft spots are in the missing technical layer. The abstract does not show the actual enforcement step, whether it is a modified loss, a constraint layer, or something else, so it is hard to judge whether the method preserves data-driven structure or risks pushing clusters into artificial shapes to meet the count. Experiments would need to demonstrate that accuracy on standard metrics holds up and that the bounds are met without extra cost. This paper is for graph ML researchers and practitioners working on community detection who want more output control. Readers who apply GNNs to networks with known size constraints would find the most direct value. It is solid enough on the idea to deserve a serious referee who can check the algorithm, proofs if any, and results.

Referee Report

2 major / 2 minor

Summary. The manuscript addresses a limitation in GNN-based community detection where models often fail to return an exact user-specified number of clusters. It proposes a framework that lets users provide either a plausible range of cluster counts or an exact target and enforces the constraint during training, rather than requiring the true number to be known a priori.

Significance. If the enforcement mechanism works as described while still recovering data-driven communities, the result would offer a practical and flexible control knob for GNN clustering pipelines, reducing the need for exhaustive search or post-hoc adjustment of cluster counts.

major comments (2)

[Abstract / Proposed framework] The central claim that bounds or exact counts can be enforced 'reliably' during training rests on an unshown modification to the training dynamics. No derivation, loss term, or constraint formulation is provided that would allow verification that the modification preserves the model's ability to discover meaningful communities (as opposed to simply satisfying the count constraint by construction).
[Introduction / Motivation] The manuscript states that standard GNN methods 'often fail to return the exact number due to the way they are designed,' yet provides no concrete example, baseline comparison, or failure mode analysis to ground this premise or to demonstrate improvement over existing approaches.

minor comments (2)

[Method] Notation for the bounded cluster count (e.g., how the range [k_min, k_max] is encoded in the model) should be defined explicitly before any algorithmic description.
[Abstract] The abstract claims the method is 'principled,' but no reference to an underlying optimization principle or convergence argument is given; a brief discussion of the theoretical grounding would strengthen the presentation.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback on our manuscript addressing bounded cluster counts in GNN-based community detection. We respond to each major comment below and outline the planned revisions.

read point-by-point responses

Referee: [Abstract / Proposed framework] The central claim that bounds or exact counts can be enforced 'reliably' during training rests on an unshown modification to the training dynamics. No derivation, loss term, or constraint formulation is provided that would allow verification that the modification preserves the model's ability to discover meaningful communities (as opposed to simply satisfying the count constraint by construction).

Authors: We acknowledge that the abstract does not contain the full technical details. Section 3 of the manuscript formulates the constraint as an additive penalty term to the base clustering objective, specifically L_total = L_GNN + lambda * C(k, k_bounds), where C is a differentiable hinge-style penalty for range violations or an L2 penalty for exact counts. A short derivation shows that the gradient of the penalty term vanishes once the count constraint is met, allowing the GNN to continue optimizing community structure. We will expand this derivation with explicit gradient expressions and add an ablation study in the revision to demonstrate that community quality metrics remain competitive when the constraint is active. revision: yes
Referee: [Introduction / Motivation] The manuscript states that standard GNN methods 'often fail to return the exact number due to the way they are designed,' yet provides no concrete example, baseline comparison, or failure mode analysis to ground this premise or to demonstrate improvement over existing approaches.

Authors: The referee is right that the introduction would be strengthened by a concrete illustration. We will insert a short synthetic-graph example (a 20-node graph with planted communities) showing that a standard GNN embedding followed by k-means or argmax assignment produces a cluster count differing from the user-specified value because of the continuous relaxation. We will also add a table comparing our method against Louvain, modularity-based GNNs, and post-hoc adjustment baselines on standard benchmarks to quantify the improvement in both count accuracy and community quality. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper introduces a framework for enforcing user-specified bounds or exact cluster counts during GNN training for community detection. The abstract presents this as an independent methodological addition that modifies training dynamics to achieve the constraint while recovering data-driven communities. No equations, self-citations, or fitted parameters are shown that reduce the central claim by construction to its own inputs. The approach is described as flexible and principled without invoking uniqueness theorems, ansatzes smuggled via prior work, or renaming of known results. This is the most common honest finding for a self-contained proposal; the derivation chain does not collapse to a fit or self-reference.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Only abstract available; no specific free parameters, axioms, or invented entities can be extracted. The framework presumably introduces at least one enforcement mechanism or loss term whose details are not visible here.

pith-pipeline@v0.9.0 · 5436 in / 1050 out tokens · 39765 ms · 2026-05-17T00:43:45.118940+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

We propose a constraint that enforces a lower bound (l) on the returned number of clusters... L(S,μ,λ,l)=modularity(S)+μ·constraint(S,l)+λ·balance(S)
IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

community detection in graph neural networks is formulated as an optimisation problem

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

21 extracted references · 21 canonical work pages

[1]

Spectral Clustering with Graph Neural Networks for Graph Pooling

Filippo Maria Bianchi, Daniele Grattarola, and Cesare Alippi. Spectral Clustering with Graph Neural Networks for Graph Pooling. InProceedings of the 37th International Conference on Machine Learning, pages 874–883. PMLR, November 2020. ISSN: 2640-3498

work page 2020
[2]

Eta prediction with graph neural networks in google maps

Austin Derrow-Pinion, Jennifer She, David Wong, Oliver Lange, Todd Hester, Luis Perez, Marc Nunkesser, Seongjae Lee, Xueying Guo, Brett Wiltshire, et al. Eta prediction with graph neural networks in google maps. InProceedings of the 30th ACM international conference on information & knowledge management, pages 3767–3776, 2021

work page 2021
[3]

Lenssen, and Jure Leskovec

Matthias Fey, Jinu Sunil, Akihiro Nitta, Rishi Puri, Manan Shah, Blaž Stojanović, Ramona Bendias, Alexandria Barghi, Vid Kocijan, Zecheng Zhang, Xinwei He, Jan E. Lenssen, and Jure Leskovec. Pyg 2.0: Scalable learning on real world graphs. InTemporal Graph Learning Workshop @ KDD, 2025

work page 2025
[4]

Resolution limit in community detection.Proceedings of the national academy of sciences, 104(1):36–41, 2007

Santo Fortunato and Marc Barthelemy. Resolution limit in community detection.Proceedings of the national academy of sciences, 104(1):36–41, 2007

work page 2007
[5]

Community detection in networks: A user guide.Physics Reports, 659:1–44, November 2016

Santo Fortunato and Darko Hric. Community detection in networks: A user guide.Physics Reports, 659:1–44, November 2016

work page 2016
[6]

Good, Yves-Alexandre de Montjoye, and Aaron Clauset

Benjamin H. Good, Yves-Alexandre de Montjoye, and Aaron Clauset. Performance of modularity maximization in practical contexts.Physical Review E, 81(4):046106, April 2010. Publisher: American Physical Society

work page 2010
[7]

Understanding Pooling in Graph Neural Networks.IEEE Transactions on Neural Networks and Learning Systems, pages 1–11, 2022

Daniele Grattarola, Daniele Zambon, Filippo Maria Bianchi, and Cesare Alippi. Understanding Pooling in Graph Neural Networks.IEEE Transactions on Neural Networks and Learning Systems, pages 1–11, 2022. Conference Name: IEEE Transactions on Neural Networks and Learning Systems

work page 2022
[8]

New spectral methods for ratio cut partitioning and clustering.IEEE transactions on computer-aided design of integrated circuits and systems, 11(9):1074–1085, 1992

Lars Hagen and Andrew B Kahng. New spectral methods for ratio cut partitioning and clustering.IEEE transactions on computer-aided design of integrated circuits and systems, 11(9):1074–1085, 1992

work page 1992
[9]

Inductive Representation Learning on Large Graphs

Will Hamilton, Zhitao Ying, and Jure Leskovec. Inductive Representation Learning on Large Graphs. InAdvances in Neural Information Processing Systems, volume 30. Curran Associates, Inc., 2017

work page 2017
[10]

Comparing partitions.Journal of Classification, 2(1):193–218, 1985

Lawrence Hubert and Phipps Arabie. Comparing partitions.Journal of Classification, 2(1):193–218, 1985

work page 1985
[11]

A graph placement methodology for fast chip design.Nature, 594(7862):207–212, 2021

Azalia Mirhoseini, Anna Goldie, Mustafa Yazgan, Joe Wenjie Jiang, Ebrahim Songhori, Shen Wang, Young-Joon Lee, Eric Johnson, Omkar Pathak, Azade Nova, et al. A graph placement methodology for fast chip design.Nature, 594(7862):207–212, 2021

work page 2021
[12]

Synthetic network data for ’bounded graph clustering with graph neural networks’.https://doi.org/10.5281/zenodo.17793019,

Kibidi Neocosmos, Diego Baptista, and Nicole Ludwig. Synthetic network data for ’bounded graph clustering with graph neural networks’.https://doi.org/10.5281/zenodo.17793019,

work page doi:10.5281/zenodo.17793019
[13]

M. E. J. Newman. Modularity and community structure in networks.Proceedings of the National Academy of Sciences, 103(23):8577–8582, June 2006. Publisher: Proceedings of the National Academy of Sciences

work page 2006
[14]

Chang, Yu Lei, and Bo Yang

Hongbin Pei, Bugui Wei, Kevin C.-C. Chang, Yu Lei, and Bo Yang. Geom-gcn: Geometric graph convolutional networks. InProceedings of the International Conference on Learning Representations (ICLR), 2020

work page 2020
[15]

Collective classification in network data.AI Magazine, 29(3):93–106, 2008

Prithviraj Sen, Galen Namata, Mustafa Bilgic, Lise Getoor, Brian Galligher, and Tina Eliassi-Rad. Collective classification in network data.AI Magazine, 29(3):93–106, 2008

work page 2008
[16]

Jianbo Shi and J. Malik. Normalized cuts and image segmentation.IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(8):888–905, August 2000

work page 2000
[17]

A deep learning approach to antibiotic discovery.Cell, 180(4):688–702, 2020

Jonathan M Stokes, Kevin Yang, Kyle Swanson, Wengong Jin, Andres Cubillos-Ruiz, Nina M Donghia, Craig R MacNair, Shawn French, Lindsey A Carfrae, Zohar Bloom-Ackermann, et al. A deep learning approach to antibiotic discovery.Cell, 180(4):688–702, 2020. 11

work page 2020
[18]

Pytorch geometric datasets

PyG Team. Pytorch geometric datasets. https://pytorch-geometric.readthedocs.io/en/2.7.0/modules/datasets.html. Accessed: 2025-11-10

work page 2025
[19]

Graph Clustering with Graph Neural Networks

Anton Tsitsulin, John Palowitch, Bryan Perozzi, and Emmanuel Müller. Graph Clustering with Graph Neural Networks

work page
[20]

Graph convolutional neural networks for web-scale recommender systems

Rex Ying, Ruining He, Kaifeng Chen, Pong Eksombatchai, William L Hamilton, and Jure Leskovec. Graph convolutional neural networks for web-scale recommender systems. In Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining, pages 974–983, 2018

work page 2018
[21]

Hierarchical Graph Representation Learning with Differentiable Pooling

Zhitao Ying, Jiaxuan You, Christopher Morris, Xiang Ren, Will Hamilton, and Jure Leskovec. Hierarchical Graph Representation Learning with Differentiable Pooling. InAdvances in Neural Information Processing Systems, volume 31. Curran Associates, Inc., 2018. 12 Appendix A.1 Additional results on synthetic data This section presents additional experimental ...

work page 2018

[1] [1]

Spectral Clustering with Graph Neural Networks for Graph Pooling

Filippo Maria Bianchi, Daniele Grattarola, and Cesare Alippi. Spectral Clustering with Graph Neural Networks for Graph Pooling. InProceedings of the 37th International Conference on Machine Learning, pages 874–883. PMLR, November 2020. ISSN: 2640-3498

work page 2020

[2] [2]

Eta prediction with graph neural networks in google maps

Austin Derrow-Pinion, Jennifer She, David Wong, Oliver Lange, Todd Hester, Luis Perez, Marc Nunkesser, Seongjae Lee, Xueying Guo, Brett Wiltshire, et al. Eta prediction with graph neural networks in google maps. InProceedings of the 30th ACM international conference on information & knowledge management, pages 3767–3776, 2021

work page 2021

[3] [3]

Lenssen, and Jure Leskovec

Matthias Fey, Jinu Sunil, Akihiro Nitta, Rishi Puri, Manan Shah, Blaž Stojanović, Ramona Bendias, Alexandria Barghi, Vid Kocijan, Zecheng Zhang, Xinwei He, Jan E. Lenssen, and Jure Leskovec. Pyg 2.0: Scalable learning on real world graphs. InTemporal Graph Learning Workshop @ KDD, 2025

work page 2025

[4] [4]

Resolution limit in community detection.Proceedings of the national academy of sciences, 104(1):36–41, 2007

Santo Fortunato and Marc Barthelemy. Resolution limit in community detection.Proceedings of the national academy of sciences, 104(1):36–41, 2007

work page 2007

[5] [5]

Community detection in networks: A user guide.Physics Reports, 659:1–44, November 2016

Santo Fortunato and Darko Hric. Community detection in networks: A user guide.Physics Reports, 659:1–44, November 2016

work page 2016

[6] [6]

Good, Yves-Alexandre de Montjoye, and Aaron Clauset

Benjamin H. Good, Yves-Alexandre de Montjoye, and Aaron Clauset. Performance of modularity maximization in practical contexts.Physical Review E, 81(4):046106, April 2010. Publisher: American Physical Society

work page 2010

[7] [7]

Understanding Pooling in Graph Neural Networks.IEEE Transactions on Neural Networks and Learning Systems, pages 1–11, 2022

Daniele Grattarola, Daniele Zambon, Filippo Maria Bianchi, and Cesare Alippi. Understanding Pooling in Graph Neural Networks.IEEE Transactions on Neural Networks and Learning Systems, pages 1–11, 2022. Conference Name: IEEE Transactions on Neural Networks and Learning Systems

work page 2022

[8] [8]

New spectral methods for ratio cut partitioning and clustering.IEEE transactions on computer-aided design of integrated circuits and systems, 11(9):1074–1085, 1992

Lars Hagen and Andrew B Kahng. New spectral methods for ratio cut partitioning and clustering.IEEE transactions on computer-aided design of integrated circuits and systems, 11(9):1074–1085, 1992

work page 1992

[9] [9]

Inductive Representation Learning on Large Graphs

Will Hamilton, Zhitao Ying, and Jure Leskovec. Inductive Representation Learning on Large Graphs. InAdvances in Neural Information Processing Systems, volume 30. Curran Associates, Inc., 2017

work page 2017

[10] [10]

Comparing partitions.Journal of Classification, 2(1):193–218, 1985

Lawrence Hubert and Phipps Arabie. Comparing partitions.Journal of Classification, 2(1):193–218, 1985

work page 1985

[11] [11]

A graph placement methodology for fast chip design.Nature, 594(7862):207–212, 2021

Azalia Mirhoseini, Anna Goldie, Mustafa Yazgan, Joe Wenjie Jiang, Ebrahim Songhori, Shen Wang, Young-Joon Lee, Eric Johnson, Omkar Pathak, Azade Nova, et al. A graph placement methodology for fast chip design.Nature, 594(7862):207–212, 2021

work page 2021

[12] [12]

Synthetic network data for ’bounded graph clustering with graph neural networks’.https://doi.org/10.5281/zenodo.17793019,

Kibidi Neocosmos, Diego Baptista, and Nicole Ludwig. Synthetic network data for ’bounded graph clustering with graph neural networks’.https://doi.org/10.5281/zenodo.17793019,

work page doi:10.5281/zenodo.17793019

[13] [13]

M. E. J. Newman. Modularity and community structure in networks.Proceedings of the National Academy of Sciences, 103(23):8577–8582, June 2006. Publisher: Proceedings of the National Academy of Sciences

work page 2006

[14] [14]

Chang, Yu Lei, and Bo Yang

Hongbin Pei, Bugui Wei, Kevin C.-C. Chang, Yu Lei, and Bo Yang. Geom-gcn: Geometric graph convolutional networks. InProceedings of the International Conference on Learning Representations (ICLR), 2020

work page 2020

[15] [15]

Collective classification in network data.AI Magazine, 29(3):93–106, 2008

Prithviraj Sen, Galen Namata, Mustafa Bilgic, Lise Getoor, Brian Galligher, and Tina Eliassi-Rad. Collective classification in network data.AI Magazine, 29(3):93–106, 2008

work page 2008

[16] [16]

Jianbo Shi and J. Malik. Normalized cuts and image segmentation.IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(8):888–905, August 2000

work page 2000

[17] [17]

A deep learning approach to antibiotic discovery.Cell, 180(4):688–702, 2020

Jonathan M Stokes, Kevin Yang, Kyle Swanson, Wengong Jin, Andres Cubillos-Ruiz, Nina M Donghia, Craig R MacNair, Shawn French, Lindsey A Carfrae, Zohar Bloom-Ackermann, et al. A deep learning approach to antibiotic discovery.Cell, 180(4):688–702, 2020. 11

work page 2020

[18] [18]

Pytorch geometric datasets

PyG Team. Pytorch geometric datasets. https://pytorch-geometric.readthedocs.io/en/2.7.0/modules/datasets.html. Accessed: 2025-11-10

work page 2025

[19] [19]

Graph Clustering with Graph Neural Networks

Anton Tsitsulin, John Palowitch, Bryan Perozzi, and Emmanuel Müller. Graph Clustering with Graph Neural Networks

work page

[20] [20]

Graph convolutional neural networks for web-scale recommender systems

Rex Ying, Ruining He, Kaifeng Chen, Pong Eksombatchai, William L Hamilton, and Jure Leskovec. Graph convolutional neural networks for web-scale recommender systems. In Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining, pages 974–983, 2018

work page 2018

[21] [21]

Hierarchical Graph Representation Learning with Differentiable Pooling

Zhitao Ying, Jiaxuan You, Christopher Morris, Xiang Ren, Will Hamilton, and Jure Leskovec. Hierarchical Graph Representation Learning with Differentiable Pooling. InAdvances in Neural Information Processing Systems, volume 31. Curran Associates, Inc., 2018. 12 Appendix A.1 Additional results on synthetic data This section presents additional experimental ...

work page 2018