Unifying Temporal and Structural Credit Assignment in LLM-Based Multi-Agent Prompt Optimization

Bo Jin; Mingze Zhao; Wenhao Li; Wenwu Li; Yuran Song

arxiv: 2605.30227 · v1 · pith:MTB6XFE7new · submitted 2026-05-28 · 💻 cs.MA · cs.AI

Unifying Temporal and Structural Credit Assignment in LLM-Based Multi-Agent Prompt Optimization

Wenwu Li , Yuran Song , Mingze Zhao , Bo Jin , Wenhao Li This is my paper

classification 💻 cs.MA cs.AI

keywords creditstructuralsignalstemporalassignmentdiscreteglobalmulti-agent

0 comments

read the original abstract

While Multi-Agent Systems (MAS) empower Large Language Models to tackle complex reasoning tasks through collaborative interaction, optimizing their dynamics remains a formidable challenge due to the discrete, non-differentiable nature of the computation graph and the sparsity of global supervisory signals. Existing black-box optimizers struggle to attribute trajectory-level failure to specific local components, resulting in inefficient, high-variance exploration. We argue that tractable MAS optimization needs structural inductive biases to disentangle error signals. We propose temporal and structural credit assignment, which decomposes the objective along two axes: (i) temporal credit, using state-space bottlenecks to identify critical rounds, and (ii) structural credit, using stationary role policies to isolate agent contributions. Leveraging these decomposed signals, we introduce a discrete, verbalized block coordinate descent algorithm for iterative refinement. Rather than indiscriminate global updates, it alternates between optimizing role prompts and aggregation protocols, using LLM-generated "proxy gradients" to target only the identified weak links. Across diverse reasoning benchmarks, our approach substantially reduces query complexity while improving performance, providing a principled and interpretable path toward self-improving MAS.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

MAS-PromptBench: When Does Prompt Optimization Improve Multi-Agent LLM Systems?
cs.LG 2026-06 unverdicted novelty 6.0

A new benchmark study finds that prompt optimization can deliver significant gains in multi-agent LLM systems but its effectiveness varies strongly with task, workflow, communication protocol, and team size.