Lost in Serialization: Invariance and Generalization of LLM Graph Reasoners

· 2025 · cs.LG · arXiv 2511.10234

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

While promising, graph reasoners based on Large Language Models (LLMs) lack built-in invariance to symmetries in graph representations. Operating on sequential graph serializations, LLMs can produce different outputs under node reindexing, edge reordering, or formatting changes, raising robustness concerns. We systematically analyze these effects, studying how fine-tuning impacts encoding sensitivity as well generalization on unseen tasks. We propose a principled decomposition of graph serializations into node labeling, edge encoding, and syntax, and evaluate LLM robustness to variations of each of these factors on a comprehensive benchmarking suite. We also contribute a novel set of spectral tasks to further assess generalization abilities of fine-tuned reasoners. Results show that larger (non-fine-tuned) models are more robust. Fine-tuning reduces sensitivity to node relabeling but may increase it to variations in structure and format, while it does not consistently improve performance on unseen tasks.

representative citing papers

Detecting Differences Is Not Understanding Structure: Large Language Models Fail at Graph Isomorphism

cs.CL · 2026-06-08 · unverdicted · novelty 5.0

LLMs succeed at graph isomorphism detection but fail to recognize isomorphic graphs under node label permutation, indicating pattern exploitation over topological understanding.

citing papers explorer

Showing 1 of 1 citing paper.

Detecting Differences Is Not Understanding Structure: Large Language Models Fail at Graph Isomorphism cs.CL · 2026-06-08 · unverdicted · none · ref 30 · internal anchor
LLMs succeed at graph isomorphism detection but fail to recognize isomorphic graphs under node label permutation, indicating pattern exploitation over topological understanding.

Lost in Serialization: Invariance and Generalization of LLM Graph Reasoners

fields

years

verdicts

representative citing papers

citing papers explorer