pith. sign in

arxiv: 2510.07799 · v2 · pith:ZJS7WHANnew · submitted 2025-10-09 · 💻 cs.CL · cs.AI

Dynamic Generation of Multi-LLM Agents Communication Topologies with Graph Diffusion Models

Pith reviewed 2026-05-21 20:37 UTC · model grok-4.3

classification 💻 cs.CL cs.AI
keywords multi-agent systemsLLM agentscommunication topologygraph diffusiontopology generationmulti-objective optimizationagent collaboration
0
0 comments X

The pith

Guided diffusion generates task-adaptive communication topologies for groups of LLM agents by steering each construction step with quick reward predictions.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents a method to design how LLM agents should communicate when working together on a task. Current designs stay fixed or are chosen by hand, so they either use too many messages on easy jobs or fail on hard ones. The new approach builds the communication pattern step by step, at each step using a small predictor to estimate how accurate, useful, and expensive a candidate pattern would be. This guided building process avoids running the full agents during design and produces patterns that adjust to the current task. If the method works as described, agent teams could solve varied problems with lower total cost and higher success rates than static arrangements allow.

Core claim

Guided Topology Diffusion formulates topology synthesis as an iterative construction process steered by a lightweight proxy model that predicts multi-objective rewards such as accuracy, utility, and cost. The iterative, guided synthesis enables real-time, gradient-free optimization toward task-adaptive topologies and distinguishes the approach from single-step generative frameworks. Experiments across multiple benchmarks show that the resulting topologies are sparse, efficient, and outperform existing methods in LLM agent collaboration.

What carries the argument

Guided Topology Diffusion (GTD) is the iterative graph construction process steered at each step by a lightweight proxy model's predictions of accuracy, utility, and cost.

If this is right

  • The generated topologies adapt their density to task difficulty, using fewer messages for simple problems and more connections for complex ones.
  • The method produces sparse topologies that lower overall token consumption while maintaining or improving task performance.
  • Iterative guidance allows the synthesis to navigate trade-offs among accuracy, cost, and robustness without exhaustive search.
  • The framework outperforms hand-crafted and static topologies on standard multi-agent benchmarks.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same iterative prediction loop could be applied to adjust topologies while an agent team is already running a task rather than only before it starts.
  • Proxy-guided graph diffusion may transfer to designing interaction structures in non-LLM multi-agent systems such as robotic swarms or sensor networks.
  • Repeated use on similar tasks could let the proxy improve its predictions without additional full-agent evaluations.

Load-bearing premise

A lightweight proxy model can reliably forecast the accuracy, utility, and cost that would result from running the full set of LLM agents on a proposed topology.

What would settle it

Measure the actual accuracy, utility, and cost of full agent runs on topologies produced by the method and compare them to the proxy predictions; systematic mismatches would show the guidance cannot be trusted.

Figures

Figures reproduced from arXiv: 2510.07799 by Eric Hanchen Jiang, Guancheng Wan, Kai-Wei Chang, Mengting Li, Sophia Yin, Wei Wang, Xiao Liang, Xinfeng Li, Ying Nian Wu, Yizhou Sun, Yuchen Wu.

Figure 1
Figure 1. Figure 1: Comparison of Multi-Agent System (MAS) communication topology design workflows. (1) Static Fixed Workflow, (2) Centralized Adaptive Work￾flow, (3) Diffusion Guided Topology Workflow (Ours). Our proposed method provides task- and context-adaptive topologies by using a conditional dif￾fusion process guided by a proxy model to jointly opti￾mize for utility, cost, robustness, and sparsity. In summary, our cont… view at source ↗
Figure 2
Figure 2. Figure 2: The Guided Topology Diffusion (GTD) framework workflow, divided into four main stages. 1) Material: The process begins with task-specific inputs, including the query, available agents, and tools. 2) Dataset Generation: A multi-agent framework simulates various baseline topologies to generate a founda￾tional dataset linking topologies to performance outcomes (e.g., utility and cost). 3) Model Training: The … view at source ↗
Figure 3
Figure 3. Figure 3: An illustration of different multi-agent communica￾tion topologies. The left panel shows examples of common static or heuristic graphs, such as Chain, Star, Complete, Layered, and Random graphs. The right panel shows examples of Adaptive Graphs, which represent the sparse, task-specific topologies that the GTD framework is designed to generate dynamically. At inference, we synthesize a topol￾ogy for a nove… view at source ↗
Figure 4
Figure 4. Figure 4: Accuracy versus token consumption for various multi-agent methods across the GSM8K, MultiArith, MMLU, and SVAMP bench￾marks. The plots illustrate that topologies generated by GTD are highly cost￾efficient, achieving strong performance while using significantly fewer tokens than baseline methods that rely on dense communication graphs. A core motivation for dy￾namic topology generation is to reduce unnecess… view at source ↗
Figure 5
Figure 5. Figure 5: Robustness of various multi-agent systems to simulated agent failure on the GSM8K bench￾mark. The chart compares task accuracy before and after an attack, demonstrating that topologies generated by GTD exhibit greater resilience and more graceful performance degradation compared to other methods. 8 [PITH_FULL_IMAGE:figures/full_fig_p008_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Ablation studies on key hyperparameters and components of the GTD framework. From left to right, the charts show the framework’s sensitivity to: (1) the number of agents, (2) the number of training samples, (3) the number of diffusion steps, and (4) the choice of denoising network architecture. The results consistently validate our primary design choices. Variant GSM8K HumanEval GTD (Ours) 94.14 91.43 – w/… view at source ↗
Figure 7
Figure 7. Figure 7: Ablation study on the impact of the proxy guidance mechanism. To rigorously validate our design choices, we conducted a series of ablation studies to iso￾late the contribution of GTD’s core components and hyperparameters, with results summarized in [PITH_FULL_IMAGE:figures/full_fig_p009_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Case study of the communication topologies designed by GTD on all benchmarks. [PITH_FULL_IMAGE:figures/full_fig_p018_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Overview of the different roles in our multi-agent question answering framework. Each role repre [PITH_FULL_IMAGE:figures/full_fig_p019_9.png] view at source ↗
read the original abstract

The efficiency of multi-agent systems driven by large language models (LLMs) largely hinges on their communication topology. However, designing an optimal topology is a non-trivial challenge, as it requires balancing competing objectives such as task performance, communication cost, and robustness. Existing frameworks often rely on static or hand-crafted topologies, which inherently fail to adapt to diverse task requirements, leading to either excessive token consumption for simple problems or performance bottlenecks for complex ones. To address this challenge, we introduce a novel generative framework called \textit{Guided Topology Diffusion (GTD)}. Inspired by conditional discrete graph diffusion models, GTD formulates topology synthesis as an iterative construction process. At each step, the generation is steered by a lightweight proxy model that predicts multi-objective rewards (e.g., accuracy, utility, cost), enabling real-time, gradient-free optimization towards task-adaptive topologies. This iterative, guided synthesis process distinguishes GTD from single-step generative frameworks, enabling it to better navigate complex design trade-offs. We validated GTD across multiple benchmarks, and experiments show that this framework can generate highly task-adaptive, sparse, and efficient communication topologies, significantly outperforming existing methods in LLM agent collaboration.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper introduces Guided Topology Diffusion (GTD), a generative framework based on conditional discrete graph diffusion models for synthesizing communication topologies in multi-LLM agent systems. It formulates topology generation as an iterative process steered at each step by a lightweight proxy model that predicts multi-objective rewards (accuracy, utility, cost) to enable real-time, gradient-free optimization toward task-adaptive, sparse topologies. The central claim is that this approach outperforms existing static or hand-crafted methods across multiple benchmarks in LLM agent collaboration.

Significance. If the proxy model reliably approximates full-system rewards on unseen topologies, GTD would offer a practical advance in dynamic topology design for multi-agent LLM systems by addressing the rigidity of static graphs and enabling better trade-offs between performance and cost. The iterative guided diffusion distinguishes it from single-step generators and could support reproducible, task-specific optimizations if the empirical claims are substantiated with full experimental protocols.

major comments (2)
  1. [Abstract] Abstract: the claim that experiments 'show that this framework can generate highly task-adaptive, sparse, and efficient communication topologies, significantly outperforming existing methods' is load-bearing for the central contribution, yet the text supplies no information on experimental design, baselines, number of runs, error bars, statistical tests, or data exclusion criteria, leaving the outperformance assertion without verifiable support.
  2. [Method] Method description of the guided diffusion loop: the iterative construction depends on the lightweight proxy supplying accurate multi-objective reward signals at each step without invoking the full LLM agents; however, no details are given on proxy training data, validation against ground-truth agent runs on held-out topologies, or error bounds on its predictions, which directly risks biasing the guidance signal and undermining the 'real-time, gradient-free optimization' argument.
minor comments (2)
  1. [Method] Notation for the multi-objective reward function and diffusion steps should be introduced with explicit equations rather than descriptive text to improve reproducibility.
  2. [Experiments] Figure captions for generated topologies should include quantitative metrics (e.g., sparsity, predicted vs. actual reward) for direct comparison with baselines.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback on our manuscript. The comments highlight important areas for improving clarity around experimental protocols and proxy model validation. We have revised the paper to address these points directly while preserving the core contributions.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the claim that experiments 'show that this framework can generate highly task-adaptive, sparse, and efficient communication topologies, significantly outperforming existing methods' is load-bearing for the central contribution, yet the text supplies no information on experimental design, baselines, number of runs, error bars, statistical tests, or data exclusion criteria, leaving the outperformance assertion without verifiable support.

    Authors: We agree that the abstract's performance claim requires supporting experimental details for verifiability. In the revised manuscript, we have updated the abstract to reference the experimental protocol and expanded Section 4 (Experiments) with a new subsection on setup. This includes: baselines (static complete graphs, random Erdős–Rényi graphs with varying sparsity, and hand-crafted topologies from prior work); 5 independent runs per benchmark with different random seeds; reporting of mean ± standard deviation; paired t-tests for significance (p < 0.05 threshold); and data exclusion criteria limited to runs with LLM API timeouts or parsing failures (less than 2% of trials). These additions substantiate the outperformance claims without altering the reported results. revision: yes

  2. Referee: [Method] Method description of the guided diffusion loop: the iterative construction depends on the lightweight proxy supplying accurate multi-objective reward signals at each step without invoking the full LLM agents; however, no details are given on proxy training data, validation against ground-truth agent runs on held-out topologies, or error bounds on its predictions, which directly risks biasing the guidance signal and undermining the 'real-time, gradient-free optimization' argument.

    Authors: We acknowledge that the original method description lacked sufficient detail on the proxy model, which is critical for justifying the guided diffusion approach. We have substantially expanded the relevant subsection in Section 3 to include: proxy training data consisting of 2,000 randomly sampled topologies evaluated end-to-end with full multi-LLM agent executions on the training task distribution; validation on a held-out set of 500 topologies yielding a Pearson correlation of 0.91 with ground-truth multi-objective rewards; and error bounds reported as mean absolute errors of 0.028 (accuracy), 0.041 (utility), and 0.019 (normalized cost). These metrics demonstrate that the proxy provides reliable guidance signals, supporting the real-time optimization claim. We also added a brief ablation showing that using the proxy versus full evaluations yields topologies with comparable final performance. revision: yes

Circularity Check

0 steps flagged

No significant circularity in GTD derivation chain

full rationale

The paper presents GTD as an iterative guided diffusion process that uses a separately introduced lightweight proxy model to predict multi-objective rewards (accuracy, utility, cost) and steer topology generation. This proxy is described as an external component enabling gradient-free optimization, not defined in terms of the generated topologies themselves or fitted directly to the final LLM-agent outcomes in a self-referential loop. The framework draws on established conditional discrete graph diffusion models without load-bearing self-citations, uniqueness theorems imported from prior author work, or ansatzes smuggled via citation. No step reduces a claimed prediction or first-principles result to an input quantity by construction; the central claim of producing task-adaptive topologies rests on the proxy's predictive fidelity as an independent modeling choice rather than a tautology. The derivation is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The framework rests on the assumption that proxy predictions correlate sufficiently with full-agent outcomes and that discrete graph diffusion can be effectively conditioned on those predictions; no explicit free parameters or invented entities are described in the abstract.

axioms (1)
  • domain assumption A lightweight proxy model can predict multi-objective rewards accurately enough to guide topology generation without running the full LLM agents.
    This premise is required for the gradient-free, real-time optimization step described in the abstract.

pith-pipeline@v0.9.0 · 5774 in / 1258 out tokens · 37010 ms · 2026-05-21T20:37:21.659049+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. RADAR: Redundancy-Aware Diffusion for Multi-Agent Communication Structure Generation

    cs.AI 2026-05 unverdicted novelty 5.0

    RADAR is a redundancy-aware, query-adaptive framework that uses conditional discrete graph diffusion to generate efficient communication topologies for multi-agent LLM systems, outperforming baselines on six benchmark...

  2. Token Economics for LLM Agents: A Dual-View Study from Computing and Economics

    cs.AI 2026-05 unverdicted novelty 4.0

    The paper delivers a unified survey of token economics for LLM agents, conceptualizing tokens as production factors, exchange mediums, and units of account across micro, meso, macro, and security dimensions using esta...

Reference graph

Works this paper leans on

35 extracted references · 35 canonical work pages · cited by 2 Pith papers · 11 internal anchors

  1. [1]

    Accessed 2025-08-26

    Engineering Structures, Elsevier. Accessed 2025-08-26. Brandon Ayal a. Topology-driven performance analyses in consensus algorithms for multi-agent systems. Master’s thesis, University of Texas at Arlington, Arlington, TX,

  2. [2]

    Autoagents: A framework for automatic agent generation.arXiv preprint arXiv:2309.17288, 2023

    URLhttps: //mavmatrix.uta.edu/mechaerospace_theses/1030/. Guangyao Chen et al. Autoagents: A framework for automatic agent generation.arXiv:2309.17288, 2023a. URLhttps://arxiv.org/abs/2309.17288. Sijia Chen, Xiaomin Li, Mengxue Zhang, Eric Hanchen Jiang, Qingcheng Zeng, and Chen-Hsiang Yu. Cares: Comprehensive evaluation of safety and adversarial robustne...

  3. [3]

    Wenhu Chen et al

    URLhttps://arxiv.org/abs/2505.11413. Wenhu Chen et al. Agentverse: Facilitating multi-agent collaboration and exploring emergent be- haviors.arXiv:2308.10848, 2023b. URLhttps://arxiv.org/abs/2308.10848. Yao Chen, Jinhu L¨u, Xinghuo Yu, and David J. Hill. Multi-agent systems with dynamical topologies: Consensus and applications.IEEE Circuits and Systems Ma...

  4. [4]

    Dhrubajit Chowdhury and Hassan K

    doi: 10.1109/MCAS.2013.2271443. Dhrubajit Chowdhury and Hassan K. Khalil. Fast consensus in multi-agent systems with star topol- ogy using high gain observers.IEEE Control Systems Letters, 1(1):188–193,

  5. [5]

    URLhttps://doi.org/10.1609/aaai

    doi: 10.1609/aaai.v38i16.29682. URLhttps://doi.org/10.1609/aaai. v38i16.29682. Wei Du, Shifei Ding, Lili Guo, Jian Zhang, and Ling Ding. Expressive multi-agent communication via identity-aware learning. InAAAI, volume 38, pp. 17354–17361,

  6. [6]

    Improving Factuality and Reasoning in Language Models through Multiagent Debate

    doi: 10.1609/aaai. v38i16.29683. URLhttps://doi.org/10.1609/aaai.v38i16.29683. Yilun Du et al. Improving factuality and reasoning in language models through multiagent debate. arXiv:2305.14325,

  7. [7]

    Improving Factuality and Reasoning in Language Models through Multiagent Debate

    URLhttps://arxiv.org/abs/2305.14325. Yao Fu et al. Complexity-based prompting for multi-step reasoning.arXiv:2210.00720,

  8. [8]

    Ning Gong, Michael Korostelev, Qiangguo Ren, Li Bai, Saroj Biswas, and Frank Ferrese

    URL https://arxiv.org/abs/2210.00720. Ning Gong, Michael Korostelev, Qiangguo Ren, Li Bai, Saroj Biswas, and Frank Ferrese. Fault tolerant (n, k)-star power network topology for multi-agent communication in auto- mated power distribution systems.https://d1wqtxts1xzle7.cloudfront.net/ 80670305/pdf-libre.pdf,

  9. [9]

    Aaron Helsinger, Michael Thome, and Todd Wright

    Proceedings/Journal venue not clearly specified; ac- cessed 2025-08-26. Aaron Helsinger, Michael Thome, and Todd Wright. Cougaar: A scalable, distributed multi-agent architecture. InProceedings of the IEEE International Conference on Systems, Man and Cyber- netics (SMC), pp. 1910–1917, The Hague, Netherlands,

  10. [10]

    Classifier-Free Diffusion Guidance

    Jonathan Ho and Tim Salimans. Classifier-free diffusion guidance. arXiv:2207.12598,

  11. [11]

    MetaGPT: Meta Programming for A Multi-Agent Collaborative Framework

    Sirui Hong et al. MetaGPT: Meta programming for multi-agent collaborative framework. arXiv:2308.00352,

  12. [12]

    Learning multi-agent communication from graph modeling perspective

    Shengchao Hu, Li Shen, Ya Zhang, and Dacheng Tao. Learning multi-agent communication from graph modeling perspective. InICLR, 2024a. URLhttps://doi.org/10.48550/ arXiv.2405.08550. Shunyu Hu et al. Automated design of agentic systems.arXiv:2408.08435, 2024b. URLhttps: //arxiv.org/abs/2408.08435. Mengda Ji, Genjiu Xu, and Liying Wang. Cora: Coalitional rati...

  13. [13]

    48550/arXiv.2506.04265

    URLhttps://doi.org/10. 48550/arXiv.2506.04265. Dongfu Jiang, Bill Yuchen Lin, and Xiang Ren. LLM-Blender: Ensembling large language models with pairwise ranking and generative fusion. InACL,

  14. [14]

    org/2023.acl-long.792/

    URLhttps://aclanthology. org/2023.acl-long.792/. Eric Hanchen Jiang, Haozheng Luo, Shengyuan Pang, Xiaomin Li, Zhenting Qi, Hengli Li, Cheng- Fu Yang, Zongyu Lin, Xinfeng Li, Hao Xu, Kai-Wei Chang, and Ying Nian Wu. Learning to rank chain-of-thought: Using a small model,

  15. [15]

    URLhttps://arxiv.org/abs/2505. 14999. Guohao Li et al. CAMEL: Communicative agents for “mind” exploration with language models. arXiv:2303.17760,

  16. [16]

    Seek in the dark: Reasoning via test-time instance-level policy gradient in latent space, 2025a

    Hengli Li, Chenxi Li, Tong Wu, Xuekai Zhu, Yuxuan Wang, Zhaoxin Yu, Eric Hanchen Jiang, Song-Chun Zhu, Zixia Jia, Ying Nian Wu, and Zilong Zheng. Seek in the dark: Reasoning via test-time instance-level policy gradient in latent space, 2025a. URLhttps://arxiv.org/ abs/2505.13308. Xiaomin Li, Xupeng Chen, Jingxuan Fan, Eric Hanchen Jiang, and Mingye Gao. M...

  17. [18]

    A Dynamic LLM-Powered Agent Network for Task-Oriented Agent Collaboration

    URLhttps://arxiv.org/abs/2310.02170. Yat Long Lo, Biswa Sengupta, Jakob Foerster, and Michael Noukhovitch. Learning multi-agent communication with contrastive learning.arXiv:2307.01403,

  18. [19]

    org/abs/2307.01403

    URLhttps://arxiv. org/abs/2307.01403. Manuel Madeira, Clement Vignac, Dorina Thanou, and Pascal Frossard. Generative modelling of structurally constrained graphs. InNeurIPS,

  19. [20]

    Training language models to follow instructions with human feedback

    Long Ouyang et al. Training language models to follow instructions with human feedback. arXiv:2203.02155,

  20. [22]

    Yu Shang et al

    URLhttps://arxiv.org/abs/2406.07155. Yu Shang et al. Agentsquare: Automatic llm agent search in modular design space. arXiv:2410.06153,

  21. [23]

    arXiv preprint arXiv:2410.06153 , year=

    URLhttps://arxiv.org/abs/2410.06153. Yang Song and Stefano Ermon. Score-based generative modeling through stochastic differential equations. InICLR,

  22. [25]

    Clement Vignac et al

    URLhttps://arxiv.org/abs/2507.18224. Clement Vignac et al. DiGress: A generative model for graphs via diffusion. InNeurIPS,

  23. [26]

    Xuezhi Wang et al

    URLhttps://arxiv.org/abs/2509.23188. Xuezhi Wang et al. Self-consistency improves chain of thought reasoning in language models. In ICLR, 2023a. URLhttps://arxiv.org/abs/2203.11171. Zhen Wang et al. Unleashing the emergent cognitive synergy of llms: A multi-persona self- collaboration framework.arXiv:2307.05300, 2023b. URLhttps://arxiv.org/abs/ 2307.05300...

  24. [27]

    Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

    URLhttps://arxiv.org/abs/2201.11903. Xumeng Wen, Zihan Liu, Shun Zheng, Shengyu Ye, Zhirong Wu, Yang Wang, Zhijian Xu, Xiao Liang, Junjie Li, Ziming Miao, Jiang Bian, and Mao Yang. Reinforcement learning with verifiable rewards implicitly incentivizes correct reasoning in base llms,

  25. [28]

    Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

    URLhttps://arxiv. org/abs/2506.14245. Zhenzhong Wu et al. AutoGen: Enabling next-gen LLM applications via multi-agent conversation. arXiv:2308.08155,

  26. [29]

    Shijie Yang, Yihao Feng, Junning Song, Peijie Sun, Yili Wang, Chen Li, Wenjie Zhang, Shirui Pan, and Chengqi Zhang

    URLhttps://arxiv.org/abs/2405.11416. Shijie Yang, Yihao Feng, Junning Song, Peijie Sun, Yili Wang, Chen Li, Wenjie Zhang, Shirui Pan, and Chengqi Zhang. Anymac: Cascading flexible multi-agent collaboration via next-agent pre- diction.arXiv preprint arXiv:2506.17784,

  27. [30]

    Graph Convolutional Policy Network for Goal-Directed Molecular Graph Generation

    URLhttps://doi. org/10.48550/arXiv.1806.02473. Tingting Yuan, Hwei-Ming Chung, Jie Yuan, and Xiaoming Fu. Dacom: Learning delay-aware com- munication for multi-agent reinforcement learning. InAAAI,

  28. [31]

    org/abs/2212.01619

    URLhttps://arxiv. org/abs/2212.01619. 12 Preprint. Guibin Zhang, Yanwei Yue, Zhixun Li, Sukwon Yun, Guancheng Wan, Kun Wang, Dawei Cheng, Jeffrey Xu Yu, and Tianlong Chen. Cut the crap: An economical communication pipeline for llm- based multi-agent systems.arXiv preprint arXiv:2410.02506,

  29. [32]

    doi: 10.48550/arXiv.2410. 02506. URLhttps://arxiv.org/abs/2410.02506. ICLR 2025 (poster), OpenReview version available. Guibin Zhang, Yanwei Yue, Xiangguo Sun, Guancheng Wan, Miao Yu, Junfeng Fang, Kun Wang, Tianlong Chen, and Dawei Cheng. G-designer: Architecting multi-agent communi- cation topologies via graph neural networks.arXiv preprint arXiv:2410.1...

  30. [33]

    AFlow: Automating Agentic Workflow Generation

    Jiaxin Zhang et al. Aflow: Automating agentic workflow generation. InICLR, 2025d. URLhttps: //arxiv.org/abs/2410.10762. Pengsong Zhang, Xiang Hu, Guowei Huang, Yang Qi, Heng Zhang, Xiuxu Li, Jiaxing Song, Ji- abin Luo, Yijiang Li, Shuo Yin, Chengxiao Dai, Eric Hanchen Jiang, Xiaoyan Zhou, Zhenfei Yin, Boqin Yuan, Jing Dong, Guinan Su, Guanren Qiao, Haimin...

  31. [34]

    Han Zhou, Xingchen Wan, Ruoxi Sun, Hamid Palangi, Shariq Iqbal, Ivan Vuli ´c, Anna Korhonen, and Sercan ¨O

    URLhttps://doi.org/10.48550/ arXiv.2402.03687. Han Zhou, Xingchen Wan, Ruoxi Sun, Hamid Palangi, Shariq Iqbal, Ivan Vuli ´c, Anna Korhonen, and Sercan ¨O. Arık. Multi-agent design: Optimizing agents with better prompts and topologies. arXiv preprint arXiv:2502.02533,

  32. [35]

    Multi-agent design: Optimizing agents with better prompts and topologies

    URLhttps://arxiv.org/abs/2502.02533. Changxi Zhu, Mehdi Dastani, and Shihan Wang. Reducing variance caused by communication in decentralized multi-agent deep reinforcement learning,

  33. [36]

    Qiuming Zhu

    URLhttps://arxiv.org/ abs/2502.06261. Qiuming Zhu. The topologies of cooperation in knowledge intensive multi-agent systems. InProceedings of the IEEE International Conference on Systems, Man and Cybernet- ics (SMC),

  34. [37]

    URLhttps://www.sciencedirect.com/science/ article/pii/S1474034605000728

    doi: 10.1016/j.aei.2005.08.001. URLhttps://www.sciencedirect.com/science/ article/pii/S1474034605000728. Mingchen Zhuge et al. Language agents as optimizable graphs.arXiv:2402.16823,

  35. [38]

    Language agents as optimizable graphs

    URL https://arxiv.org/abs/2402.16823. 13 Preprint. A ALGORITHM Algorithm 1Guided Topology Diffusion (GTD) Generation 1:Input:Task conditionC new, trained modelsG θ∗,P ϕ∗, weightsw u, wc. 2:SampleA T ∼ N(0,I). 3:fort=T, . . . ,1do 4:Predict the unguided clean graph: ˆA(t) 0 =G θ∗(At, Cnew, t). 5:GenerateKcandidates:{A (t) 0,k}K k=1, whereA (t) 0,k ∼Bernoul...