pith. sign in

Predictive Maps of Multi-Agent Reasoning: A Successor-Representation Spectrum for LLM Communication Topologies

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it
abstract

Practitioners deploying multi-agent large language model (LLM) systems must currently choose between communication topologies such as chain, star, mesh, and richer variants without any pre-inference diagnostic for which topology will amplify drift, converge to consensus, or remain robust under perturbation. Existing evaluation answers these questions only post hoc and only for the task measured. We introduce a structural diagnostic for multi-agent LLM communication graphs based on the successor representation $M = (I - \gamma P)^{-1}$ of the row-stochastic communication operator, and we connect three of its spectral quantities, the spectral radius $\rho(M)$, the spectral gap $\Delta(M)$, and the condition number $\kappa(M)$, to three distinct failure modes. We derive closed-form spectra for the chain, star, and mesh under row-stochastic normalization, and validate the predictions on a 12-step structured state-tracking task with Qwen2.5-7B-Instruct over 100 independent trials. The condition number is a perfect rank-order predictor of empirical perturbation robustness ($r_s = 1.0$); the spectral gap partially predicts consensus dynamics ($r_s = 0.5$); and the spectral radius is perfectly \emph{inverted} with respect to cumulative error ($r_s = -1.0$). We trace this inversion to a regime in which linear spectra are blind to non-contracting bias drift, and we propose an affine-noise extension of the predictive map that recovers the empirical ordering. We read this as a first step toward representational, drift-aware structural diagnostics for multi-agent LLM systems, sitting alongside classical spectral and consensus theory.

fields

cs.CL 1

years

2026 1

verdicts

UNVERDICTED 1

clear filters

representative citing papers

ProvenAI: Provenance-Native Traces of Evidence in Generated Answers

cs.CL · 2026-06-24 · unverdicted · novelty 5.0

ProvenAI measures transparency in multi-hop QA via answer correctness, citation fidelity, and ablation-based document influence on HotpotQA, reporting 53.53% accuracy and 71.55% fidelity while identifying a citation-influence gap.

citing papers explorer

Showing 1 of 1 citing paper after filters.

  • ProvenAI: Provenance-Native Traces of Evidence in Generated Answers cs.CL · 2026-06-24 · unverdicted · none · ref 18 · internal anchor

    ProvenAI measures transparency in multi-hop QA via answer correctness, citation fidelity, and ablation-based document influence on HotpotQA, reporting 53.53% accuracy and 71.55% fidelity while identifying a citation-influence gap.