Anytime Detection of Strategic Deviations in Multi-Agent Systems

Etienne Gauthier; Francis Bach; Michael I. Jordan

arxiv: 2601.05427 · v3 · pith:5HWT3YSQnew · submitted 2026-01-08 · 💻 cs.GT

Anytime Detection of Strategic Deviations in Multi-Agent Systems

Etienne Gauthier , Francis Bach , Michael I. Jordan This is my paper

Pith reviewed 2026-05-25 07:37 UTC · model grok-4.3

classification 💻 cs.GT

keywords e-valuesanytime inferencemulti-agent systemsequilibrium detectionsupermartingalesrepeated gamesstrategic deviationsstochastic games

0 comments

The pith

A sequential e-value framework detects deviations from strategic equilibria in multi-agent interactions without requiring a fixed sample size.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces a method to monitor whether agents in repeated games are sticking to equilibrium play or deviating over time. It uses the e-value approach to build a supermartingale that grows when observed payoffs violate the benchmark conditions. This allows continuous testing that remains valid at any stopping time. Such a tool matters because multi-agent systems like markets or autonomous vehicle fleets can drift from expected rational behavior, and early detection helps maintain stability. The approach covers Nash, correlated, and coarse correlated equilibria in normal-form games and extends to stochastic settings.

Core claim

By betting against a known benchmark equilibrium, the method constructs a test supermartingale that accumulates evidence of departure whenever payoffs systematically violate the equilibrium conditions, providing a statistically valid measure of deviation that can be monitored online with finite-time guarantees.

What carries the argument

The test supermartingale derived from the e-value framework by betting against the benchmark payoff conditions.

If this is right

Unified detection for Nash, correlated, and coarse correlated equilibria in repeated normal-form games.
Finite-time guarantees and detailed analysis of detection times.
Increased detection power in large games via Benjamini-Hochberg procedures while controlling false discovery rate.
Extension to stochastic games to verify adherence to target policies online.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Could apply to monitoring AI agents in multi-agent environments for misalignment.
May integrate with online learning algorithms to trigger interventions upon detected deviations.
Potential use in regulatory oversight of algorithmic trading or automated markets.

Load-bearing premise

The benchmark equilibrium is known in advance and observed play can be directly compared to its payoff conditions without extra assumptions on deviation behavior.

What would settle it

A simulation or real-world dataset where agents systematically violate equilibrium payoffs but the supermartingale fails to grow significantly, or where matching play triggers false detection.

read the original abstract

In many multi-agent systems, agents interact repeatedly and are expected to settle into stable, rational behavior over time. Yet in practice, behavior often drifts, and detecting such deviations in real time remains an open challenge. We introduce a sequential testing framework that monitors whether observed play is consistent with a benchmark of strategic behavior, without assuming a fixed sample size. Our approach builds on the e-value framework for safe anytime-valid inference: by "betting" against the benchmark, we construct a test supermartingale that accumulates evidence whenever observed payoffs systematically violate the expected conditions. For repeated normal-form games, we take equilibrium as the benchmark, yielding a statistically sound, interpretable measure of departure from equilibrium that can be monitored online; our framework unifies the treatment of Nash, correlated, and coarse correlated equilibria, offering finite-time guarantees and a detailed analysis of detection times. We also leverage Benjamini-Hochberg-type procedures to increase detection power in large games while rigorously controlling the false discovery rate. Finally, we extend our method to stochastic games, verifying online whether observed trajectories adhere to a specified target policy, such as a computed equilibrium, broadening the framework's applicability to dynamic, state-dependent settings.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Applies e-value supermartingales to build anytime-valid tests for deviation from a known benchmark equilibrium, unifying Nash/CCE/CE and extending to stochastic games with target policies.

read the letter

The paper's main move is to take the e-value supermartingale construction for safe anytime inference and use it to monitor repeated play against a fixed equilibrium benchmark. By betting against the payoff inequalities that hold at the benchmark, they produce a process that grows when observed payoffs violate the equilibrium conditions, giving an interpretable online measure of departure with finite-time guarantees. The same template covers Nash, correlated, and coarse correlated equilibria by changing which inequalities serve as the null, and they carry the idea over to stochastic games by testing trajectories against a specified target policy. They also add a Benjamini-Hochberg layer for controlling false discoveries across many games or agents. That unification and the stochastic extension are the concrete new pieces; the rest follows from applying existing supermartingale arguments to game payoff matrices. The construction is clean when the benchmark mixed strategies or correlation device are known exactly, which matches the abstract's framing. The soft spot is exactly the one the stress-test flags: everything rests on the benchmark being given in advance. The conditional expectation step that delivers the supermartingale property uses the known equilibrium payoffs directly; if the benchmark instead has to be estimated from the same stream of play, that step fails and the anytime validity and detection-time bounds no longer hold. The paper does not appear to supply a workaround or robustness result for estimated benchmarks, so the practical scope is narrower than the title might suggest. This is for people working on online monitoring in multi-agent systems or sequential testing in games. A reader who already knows e-values and wants to see them applied to equilibrium conditions will get the most out of it. The work is coherent on its own terms and the claims are specific enough to check, so it deserves a serious referee to verify the supermartingale derivations and the stochastic-game extension.

Referee Report

1 major / 1 minor

Summary. The paper introduces a sequential testing framework for detecting strategic deviations from a benchmark in multi-agent systems. It uses the e-value framework to construct test supermartingales that accumulate evidence against the benchmark (equilibrium in normal-form games or target policy in stochastic games). The approach unifies Nash, correlated, and coarse correlated equilibria with finite-time guarantees, provides online monitoring, applies Benjamini-Hochberg procedures for FDR control in large games, and extends to dynamic settings.

Significance. If the supermartingale construction and finite-time bounds hold under the stated assumptions, the work offers a statistically sound, anytime-valid method for online detection of deviations that unifies multiple equilibrium concepts. The extension to stochastic games and FDR control are potential strengths for practical multi-agent monitoring.

major comments (1)

[Abstract] Abstract and strongest claim: the supermartingale is constructed by betting against payoff inequalities that hold at a known benchmark equilibrium (or target policy). The conditional expectation of the increment is non-positive only when the benchmark is known exactly in advance and the observed play is tested directly against its payoff conditions. No procedure for estimating the benchmark from the data stream is indicated, which would break the martingale property and invalidate the finite-time guarantees and anytime-valid p-values.

minor comments (1)

Ensure that all finite-time detection time analyses and unification claims are accompanied by explicit theorem statements and proof sketches in the main body, with clear statements of the known-benchmark assumption.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the careful review and for identifying the key assumption underlying our supermartingale construction. We address the major comment below.

read point-by-point responses

Referee: [Abstract] Abstract and strongest claim: the supermartingale is constructed by betting against payoff inequalities that hold at a known benchmark equilibrium (or target policy). The conditional expectation of the increment is non-positive only when the benchmark is known exactly in advance and the observed play is tested directly against its payoff conditions. No procedure for estimating the benchmark from the data stream is indicated, which would break the martingale property and invalidate the finite-time guarantees and anytime-valid p-values.

Authors: We agree that the supermartingale property holds precisely when the benchmark is known exactly in advance. This is the setting of the paper: the abstract states that we monitor consistency with 'a benchmark of strategic behavior' and verify adherence to 'a specified target policy, such as a computed equilibrium.' The full manuscript consistently treats the equilibrium (Nash, CCE, or CE) or target policy as given ex ante. No estimation procedure from the data stream is proposed or claimed, precisely because it would invalidate the martingale property. Our contribution is limited to the known-benchmark case, where the finite-time guarantees and anytime-valid p-values are valid. We have added one clarifying sentence in the introduction to make this assumption even more explicit. revision: partial

Circularity Check

0 steps flagged

No circularity; framework applies external e-value methods to a known benchmark

full rationale

The derivation constructs a test supermartingale by betting against payoff inequalities that hold at a pre-specified benchmark equilibrium (Nash, CE or CCE). This property is taken directly from the known payoff matrix and the external e-value framework; the paper does not fit parameters to the observed stream, rename a known result, or rely on self-citations for the supermartingale guarantee. The benchmark is treated as given rather than estimated from the same data, so no self-definitional or fitted-input reduction occurs. The central claims therefore remain independent of the inputs they are applied to.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract only; no free parameters, axioms, or invented entities are specified in the provided text.

pith-pipeline@v0.9.0 · 5736 in / 1026 out tokens · 29449 ms · 2026-05-25T07:37:39.245592+00:00 · methodology

Anytime Detection of Strategic Deviations in Multi-Agent Systems

Core claim

What carries the argument

If this is right

Where Pith is reading between the lines

Load-bearing premise

What would settle it

discussion (0)