GATA2Floor applies multi-head GATv2 on facade graphs to predict building floor counts and softly assign elements to latent floors, with a label-free version using self-supervised features.
GATA2Floor: Graph attention for floor counting in street-view facades
1 Pith paper cite this work. Polarity classification is still indexing.
abstract
Automated analysis of building facades from street-level imagery has great potential for urban analytics, energy assessment, and emergency planning. However, it requires reasoning over spatially arranged elements rather than solely isolated detections. In this work, we model each facade as a graph over window/door detections with a vertical prior on edges. Additionally, we introduce GATA2Floor, a multi-head Graph Attention v2 (GATv2) based model that predicts the global floor count of a building and, via learnable cross-attention queries, softly assigns elements to latent floor slots, yielding interpretable outputs and robustness to irregular designs. To mitigate the lack of labeled datasets, we demonstrate that the proposed graph-based reasoning can be applied without annotations by leveraging a lightweight label-free proposal mechanism based on self-supervised features and vision-language scoring. Our approach demonstrates the value of graph-attention-based relational reasoning for facade understanding.
citation-role summary
citation-polarity summary
fields
cs.CV 1years
2026 1verdicts
UNVERDICTED 1roles
background 1polarities
background 1representative citing papers
citing papers explorer
-
GATA2Floor: Graph attention for floor counting in street-view facades
GATA2Floor applies multi-head GATv2 on facade graphs to predict building floor counts and softly assign elements to latent floors, with a label-free version using self-supervised features.