pith. sign in

arxiv: 2605.15573 · v1 · pith:5QNVXKQ7new · submitted 2026-05-15 · 💻 cs.CL · cs.LG· cs.MA

Response-Conditioned Parallel-to-Sequential Orchestration for Multi-Agent Systems

Pith reviewed 2026-05-20 19:24 UTC · model grok-4.3

classification 💻 cs.CL cs.LGcs.MA
keywords multi-agent LLM systemsparallel sequential hybridresponse-conditioned policysparse DAG communicationpolicy gradient traininggeneralizable orchestrationacyclic graph construction
0
0 comments X

The pith

A lightweight transformer learns to predict when to insert one sequential communication step after parallel LLM agent responses, and the same policy works across different agent counts, tasks, and base models.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents Nexa as a hybrid system that first lets multiple LLM agents answer in parallel, embeds those responses, and then uses a small transformer to output a sparse directed graph deciding whether any agents should exchange messages in one additional step. This setup is meant to cut unnecessary communication and latency while still gaining accuracy from collaboration when the responses indicate it would help. The policy trains via policy gradients directly on the response embeddings and needs no separate reward models or external judges. The resulting graph is always acyclic by construction, and pure parallel execution is recovered exactly when the graph is empty. Experiments show the policy trained in one configuration can be reused when the number of agents, the task, or the underlying models change.

Core claim

Nexa formalizes hybrid execution as a response-conditioned policy that, after parallel responses, predicts a sparse directed acyclic graph; an empty graph leaves the system in pure parallel mode while a non-empty graph triggers exactly one round of sequential message propagation. The framework is acyclic by construction, strictly contains pure parallel execution, and the policy is trained with policy-gradient optimization on response embeddings without external LLM judges or test-time topology search. The learned policy transfers across changes in agent count, task, or base agent models.

What carries the argument

Nexa, the response-conditioned policy realized as a lightweight transformer that maps parallel response embeddings to a sparse directed acyclic communication graph for optional one-step sequential refinement.

If this is right

  • Pure parallel execution occurs automatically whenever the predicted graph is empty.
  • The communication graph remains acyclic because of the way the policy constructs it.
  • The hybrid method includes pure parallel execution with zero added communication cost as a special case.
  • A policy trained once can be reused without retraining when the number of agents, the task, or the base models change.
  • Training requires only policy-gradient updates on response embeddings and avoids external LLM judges or hand-designed topology search.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The transfer result suggests the policy could support runtime changes in agent population without retraining the orchestrator.
  • Because the method embeds responses rather than relying on fixed topologies, it might reduce manual engineering when new multi-agent tasks are introduced.
  • The single-step sequential addition could be tested for extension to two or three conditioned steps on tasks where deeper refinement is known to help.
  • Absence of a separate reward model may allow the same policy head to attach to pipelines that already score final answer quality.

Load-bearing premise

A lightweight transformer trained via policy gradient on response embeddings can reliably predict useful sparse communication graphs that improve accuracy without external judges or reward models.

What would settle it

Train the policy on responses from one task and agent count, then apply it unchanged to a new task or different agent count and check whether accuracy rises or latency falls compared with pure parallel execution on the new setting.

Figures

Figures reproduced from arXiv: 2605.15573 by Alex Iacob, Karthik Nandakumar, Lorenzo Sani, Meghdad Kurmanji, Nicholas D. Lane, Nurbek Tastan, Samuel Horvath.

Figure 1
Figure 1. Figure 1: Accuracy-cost tradeoff for multi-agent system baselines across three tasks. Each point cor [PITH_FULL_IMAGE:figures/full_fig_p007_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Agent-count transfer for NEXA. The pol￾icy is trained with N=10 Qwen2.5-1.5B agents and evaluated without retraining at N∈{5, . . . , 20}. Number of agents. We first examine gen￾eralizability across the number of agents. NEXA is trained with N = 10 agents and evaluated without retraining for N ∈ {5, 10, 15, 20}, keeping the task and agent backbone fixed. This setting tests whether the learned graph policy … view at source ↗
Figure 3
Figure 3. Figure 3: Task-transfer comparison for NEXA on Qwen2.5-1.5B. Model scale generalizability. We then evaluate whether the learned communication policy transfers across model scales. NEXA is trained using Qwen2.5-1.5B agents and evaluated with￾out retraining on Qwen2.5-7B agents, then compared against a policy trained directly with Qwen2.5-7B agents. As shown in [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Model-scale transfer for NEXA. Model generation transfer. Finally, we eval￾uate whether the learned communication policy remains usable when the underlying model is updated to a newer generation. NEXA trained on Qwen2.5-1.5B is evaluated without retraining on Qwen3.5-2B [Qwen Team, 2026] and compared against a policy trained directly on Qwen3.5-2B. At N = 5, the transferred policy reaches 77.40, compared w… view at source ↗
Figure 5
Figure 5. Figure 5: Model-generation transfer for NEXA. We further analyze how NEXA changes answers after communica￾tion by decomposing each example according to whether the initial draft (parallel execution responses) and final answer (sequential execution responses) are correct [PITH_FULL_IMAGE:figures/full_fig_p008_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Policy behavior analysis for [PITH_FULL_IMAGE:figures/full_fig_p008_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Policy behavior analysis for NEXA with Qwen2.5-1.5B agents on GSM8K. Rescue, harm, and preservation rates are computed by comparing each initial draft answer with the final answer after communication. NEXA rescues 15.6%-20.6% of initially wrong answers while preserving 92.8%-94.4% of initially correct answers across tested team sizes. We also observe that, as we scale the capability of the backbone, it lea… view at source ↗
Figure 8
Figure 8. Figure 8: Communication sparsity for NEXA on GSM8K with Qwen2.5-1.5B and Qwen2.5-7B agents. We report the fraction of examples whose predicted communication graph uses at most half of the possible edges. Across both model sizes, NEXA frequently selects low-edge plans, indicating that the learned policy does not rely on dense all-to-all communication as team size increases. E Experimental Settings Compute resources. … view at source ↗
read the original abstract

Multi-agent systems can solve complex tasks through collaboration between multiple Large Language Model agents. Existing collaboration frameworks typically operate in either a parallel or a sequential mode. In the parallel mode, agents respond independently to queries followed by aggregation of responses. In contrast, sequential systems allow agents to communicate via a directed topology and refine one another step by step. However, both modes are inadequate for achieving the desired objectives of minimizing communication and latency while simultaneously maximizing the accuracy of the final response. In this work, we introduce a hybrid paradigm called Nexa, a trainable response-conditioned policy that bridges the gap between the two modes. Nexa begins with a parallel execution stage, embeds the resulting responses into a shared semantic space, and then predicts a sparse directed acyclic communication graph. If the graph is empty, the system remains purely parallel; if it is non-empty, the system performs one sequential message propagation. The policy is a lightweight transformer model, and the method avoids the need for external LLM judges or reward models, as well as hand-crafted test-time topology search. We formalize this hybrid execution problem, show that the resulting graph is acyclic by construction, and that the framework strictly subsumes pure parallel execution, and present a training procedure based on policy-gradient optimization. Results demonstrate that the response-conditioned policy learned by Nexa under one setting can be reused when the number of agents, the task, or the underlying agent changes, thus emphasizing the generalizability of the learned communication policy.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript introduces Nexa, a hybrid orchestration framework for multi-agent LLM systems. It begins with parallel agent responses to a query, embeds those responses into a shared semantic space, and uses a lightweight transformer policy to predict a sparse directed acyclic graph (DAG) that governs a single round of sequential message propagation. The framework is claimed to be acyclic by construction, to strictly subsume pure parallel execution (via the empty-graph case), to require no external LLM judges or hand-crafted topology search, and to admit policy-gradient training; experiments are said to show that the learned response-conditioned policy generalizes when the number of agents, the task, or the base models change.

Significance. If the empirical generalization results hold, the work would provide a practical, reusable mechanism for trading off communication cost against accuracy in multi-agent LLM collaboration without per-instance search or external reward models. The explicit subsumption of parallel mode and the acyclicity guarantee are clean formal properties that could be useful for system design.

major comments (2)
  1. [Abstract] Abstract: the central claim that 'the response-conditioned policy learned by Nexa under one setting can be reused when the number of agents, the task, or the underlying agent changes' is load-bearing for the contribution, yet the abstract (and the manuscript as described) supplies no quantitative results, no evaluation protocol, and no details on how transfer across embedding distributions was measured. Without such evidence the transfer property remains an unverified assumption.
  2. [Abstract] Abstract: although the text states that the hybrid execution problem is formalized and that a training procedure based on policy-gradient optimization is presented, no equations appear for the policy, the objective, the DAG construction, or the acyclicity proof. The absence of these formal elements makes it impossible to verify the claimed 'acyclicity by construction' or the strict subsumption of parallel mode.
minor comments (2)
  1. [Abstract] The phrase 'underlying agent' is ambiguous; it should be clarified whether it refers to the base LLM, the prompt template, or another component.
  2. A diagram showing the parallel stage, embedding step, DAG prediction, and single propagation round would substantially improve readability of the hybrid execution flow.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive and detailed feedback. We address each major comment point by point below and indicate the revisions we will make to strengthen the presentation of our claims and formal elements.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the central claim that 'the response-conditioned policy learned by Nexa under one setting can be reused when the number of agents, the task, or the underlying agent changes' is load-bearing for the contribution, yet the abstract (and the manuscript as described) supplies no quantitative results, no evaluation protocol, and no details on how transfer across embedding distributions was measured. Without such evidence the transfer property remains an unverified assumption.

    Authors: We appreciate this observation regarding the load-bearing nature of the generalization claim. The full manuscript reports quantitative transfer results in the experiments section, including accuracy retention rates and communication savings when the policy trained under one agent count or task is applied to new agent counts (e.g., 3 to 5 agents), different tasks, and alternate base models, with an evaluation protocol that measures performance on held-out configurations without retraining. To address the concern directly in the abstract, we will revise it to include concise quantitative highlights and a brief reference to the transfer evaluation protocol. revision: yes

  2. Referee: [Abstract] Abstract: although the text states that the hybrid execution problem is formalized and that a training procedure based on policy-gradient optimization is presented, no equations appear for the policy, the objective, the DAG construction, or the acyclicity proof. The absence of these formal elements makes it impossible to verify the claimed 'acyclicity by construction' or the strict subsumption of parallel mode.

    Authors: We agree that the abstract, due to length constraints, omits explicit equations. The main body formalizes the hybrid execution problem, defines the lightweight transformer policy that predicts edge probabilities for the sparse DAG, presents the policy-gradient training objective, details the DAG construction procedure, and includes the proof of acyclicity by construction together with the strict subsumption of the parallel (empty-graph) case. We will add a short formal overview subsection early in the paper that explicitly states the key equations and proof outline to improve verifiability. revision: yes

Circularity Check

0 steps flagged

No significant circularity; derivation remains self-contained

full rationale

The paper's formal claims—that the predicted graph is acyclic by construction and that the framework strictly subsumes pure parallel execution via the empty-graph case—follow directly from the one-round parallel execution followed by optional single propagation design, without reducing to fitted parameters or prior self-referential results. The training procedure is described as policy-gradient optimization without external judges or reward models, indicating an objective defined externally to the policy itself. The reported reusability of the response-conditioned policy across agent counts, tasks, or base models is presented as an empirical experimental outcome rather than a first-principles derivation or prediction forced by construction. No self-citations, uniqueness theorems, or ansatzes are invoked as load-bearing steps in the abstract or described chain.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 1 invented entities

The central claims rest on the assumption that a learned policy can produce useful communication graphs from response embeddings and that policy-gradient training succeeds without additional supervision; no free parameters or invented entities are explicitly introduced beyond the policy model itself.

axioms (2)
  • standard math The predicted communication graph is acyclic by construction
    Stated in abstract as a formal property of the framework
  • domain assumption The hybrid framework strictly subsumes pure parallel execution
    Claimed when the predicted graph is empty
invented entities (1)
  • Nexa response-conditioned policy no independent evidence
    purpose: Predicts sparse DAG for optional sequential refinement after parallel execution
    New trainable component introduced to bridge parallel and sequential modes

pith-pipeline@v0.9.0 · 5823 in / 1404 out tokens · 36313 ms · 2026-05-20T19:24:52.054200+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

184 extracted references · 184 canonical work pages · 7 internal anchors

  1. [1]

    The Fourteenth International Conference on Learning Representations , year =

    Stochastic Self-Organization in Multi-Agent Systems , author =. The Fourteenth International Conference on Learning Representations , year =

  2. [2]

    Proceedings of the 41st International Conference on Machine Learning , pages =

    Zhuge, Mingchen and Wang, Wenyi and Kirsch, Louis and Faccio, Francesco and Khizbullin, Dmitrii and Schmidhuber, J\". Proceedings of the 41st International Conference on Machine Learning , pages =. 2024 , editor =

  3. [3]

    Cut the Crap: An Economical Communication Pipeline for

    Guibin Zhang and Yanwei Yue and Zhixun Li and Sukwon Yun and Guancheng Wan and Kun Wang and Dawei Cheng and Jeffrey Xu Yu and Tianlong Chen , booktitle=. Cut the Crap: An Economical Communication Pipeline for. 2025 , url=

  4. [4]

    Forty-second International Conference on Machine Learning , year=

    G-Designer: Architecting Multi-agent Communication Topologies via Graph Neural Networks , author=. Forty-second International Conference on Machine Learning , year=

  5. [5]

    Gradient Driven Rewards to Guarantee Fairness in Collaborative Machine Learning , url =

    Xu, Xinyi and Lyu, Lingjuan and Ma, Xingjun and Miao, Chenglin and Foo, Chuan Sheng and Low, Bryan Kian Hsiang , booktitle =. Gradient Driven Rewards to Guarantee Fairness in Collaborative Machine Learning , url =

  6. [6]

    AutoGen: Enabling Next-Gen

    Qingyun Wu and Gagan Bansal and Jieyu Zhang and Yiran Wu and Beibin Li and Erkang Zhu and Li Jiang and Xiaoyun Zhang and Shaokun Zhang and Jiale Liu and Ahmed Hassan Awadallah and Ryen W White and Doug Burger and Chi Wang , booktitle=. AutoGen: Enabling Next-Gen. 2024 , url=

  7. [7]

    arXiv preprint arXiv:2505.24239 , year=

    An Adversary-Resistant Multi-Agent LLM System via Credibility Scoring , author=. arXiv preprint arXiv:2505.24239 , year=

  8. [8]

    arXiv preprint arXiv:2505.16997 , year=

    X-MAS: Towards Building Multi-Agent Systems with Heterogeneous LLMs , author=. arXiv preprint arXiv:2505.16997 , year=

  9. [9]

    2025 , url=

    Rui Ye and Shuo Tang and Rui Ge and Yaxin Du and Zhenfei Yin and Siheng Chen and Jing Shao , booktitle=. 2025 , url=

  10. [10]

    arXiv preprint arXiv:2505.16988 , year=

    MASLab: A Unified and Comprehensive Codebase for LLM-based Multi-Agent Systems , author=. arXiv preprint arXiv:2505.16988 , year=

  11. [11]

    The Thirteenth International Conference on Learning Representations , year=

    Scaling Large Language Model-based Multi-Agent Collaboration , author=. The Thirteenth International Conference on Learning Representations , year=

  12. [12]

    GPT-4 Technical Report

    Gpt-4 technical report , author=. arXiv preprint arXiv:2303.08774 , year=

  13. [13]

    Abhimanyu Dubey and Abhinav Jauhri and Abhinav Pandey and Abhishek Kadian and Ahmad Al-Dahle and Aiesha Letman and Akhil Mathur and Alan Schelten and Amy Yang and Angela Fan and Anirudh Goyal and Anthony Hartshorn and Aobo Yang and Archi Mitra and Archie Sravankumar and Artem Korenev and Arthur Hinsvark and Arun Rao and Aston Zhang and Aurélien Rodriguez ...

  14. [14]

    2025 , eprint=

    Qwen2.5 Technical Report , author=. 2025 , eprint=

  15. [15]

    2023 , url=

    Guohao Li and Hasan Abed Al Kader Hammoud and Hani Itani and Dmitrii Khizbullin and Bernard Ghanem , booktitle=. 2023 , url=

  16. [16]

    C hat D ev: Communicative Agents for Software Development

    Qian, Chen and Liu, Wei and Liu, Hongzhang and Chen, Nuo and Dang, Yufan and Li, Jiahao and Yang, Cheng and Chen, Weize and Su, Yusheng and Cong, Xin and Xu, Juyuan and Li, Dahai and Liu, Zhiyuan and Sun, Maosong. C hat D ev: Communicative Agents for Software Development. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguist...

  17. [17]

    Advances in neural information processing systems , volume=

    Chain-of-thought prompting elicits reasoning in large language models , author=. Advances in neural information processing systems , volume=

  18. [18]

    Advances in neural information processing systems , volume=

    Tree of thoughts: Deliberate problem solving with large language models , author=. Advances in neural information processing systems , volume=

  19. [19]

    Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks

    Reimers, Nils and Gurevych, Iryna. Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing. 2019

  20. [20]

    Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence,

    Redefining Contributions: Shapley-Driven Federated Learning , author =. Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence,. 2024 , month =

  21. [21]

    Transactions on Machine Learning Research , issn=

    Nurbek Tastan and Samuel Horv. Transactions on Machine Learning Research , issn=. 2025 , url=

  22. [22]

    Proceedings of the 42nd International Conference on Machine Learning , pages =

    Tastan, Nurbek and Horv\'. Proceedings of the 42nd International Conference on Machine Learning , pages =. 2025 , editor =

  23. [23]

    Contributions to the Theory of Games II , editor =

    A Value for n-Person Games , author =. Contributions to the Theory of Games II , editor =. 1953 , publisher =

  24. [24]

    Federated Learning: Privacy and Incentive , pages=

    Collaborative fairness in federated learning , author=. Federated Learning: Privacy and Incentive , pages=. 2020 , publisher=

  25. [25]

    Forty-first International Conference on Machine Learning , year=

    Improving factuality and reasoning in language models through multiagent debate , author=. Forty-first International Conference on Machine Learning , year=

  26. [26]

    The Thirteenth International Conference on Learning Representations , year=

    Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains , author=. The Thirteenth International Conference on Learning Representations , year=

  27. [27]

    Liang, Z

    Liang, Tian and He, Zhiwei and Jiao, Wenxiang and Wang, Xing and Wang, Yan and Wang, Rui and Yang, Yujiu and Shi, Shuming and Tu, Zhaopeng. Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate. Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. 2024. doi:10.18653/v1/2024.emnlp-main.992

  28. [28]

    The Twelfth International Conference on Learning Representations , year=

    AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors , author=. The Twelfth International Conference on Learning Representations , year=

  29. [29]

    A Dynamic

    Zijun Liu and Yanzhe Zhang and Peng Li and Yang Liu and Diyi Yang , booktitle=. A Dynamic. 2024 , url=

  30. [30]

    The Thirteenth International Conference on Learning Representations , year=

    Automated Design of Agentic Systems , author=. The Thirteenth International Conference on Learning Representations , year=

  31. [31]

    2025 , url=

    Jiayi Zhang and Jinyu Xiang and Zhaoyang Yu and Fengwei Teng and Xiong-Hui Chen and Jiaqi Chen and Mingchen Zhuge and Xin Cheng and Sirui Hong and Jinlin Wang and Bingnan Zheng and Bang Liu and Yuyu Luo and Chenglin Wu , booktitle=. 2025 , url=

  32. [32]

    Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , month =

    Tastan, Nurbek and Nandakumar, Karthik , title =. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , month =. 2023 , pages =

  33. [33]

    Conference on Parsimony and Learning , pages =

    Tastan, Nurbek and Horv\'. Conference on Parsimony and Learning , pages =. 2025 , editor =

  34. [34]

    Advances in Neural Information Processing Systems , volume=

    Fjord: Fair and accurate federated learning under heterogeneous targets with ordered dropout , author=. Advances in Neural Information Processing Systems , volume=

  35. [35]

    The Twelfth International Conference on Learning Representations , year=

    Incentive-Aware Federated Learning with Training-Time Model Rewards , author=. The Twelfth International Conference on Learning Representations , year=

  36. [36]

    International Conference on Learning Representations , year=

    Slimmable Neural Networks , author=. International Conference on Learning Representations , year=

  37. [37]

    , title =

    Yu, Jiahui and Huang, Thomas S. , title =. Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) , month =

  38. [38]

    Scaling Learning Algorithms Towards

    Bengio, Yoshua and LeCun, Yann , booktitle =. Scaling Learning Algorithms Towards

  39. [39]

    arXiv preprint arXiv:2106.07289 , year=

    Decentralized personalized federated min-max problems , author=. arXiv preprint arXiv:2106.07289 , year=

  40. [40]

    EURO Journal on Computational Optimization , volume=

    Decentralized personalized federated learning: Lower bounds and optimal algorithm for all personalization modes , author=. EURO Journal on Computational Optimization , volume=. 2022 , publisher=

  41. [41]

    arXiv preprint arXiv:2406.00846 , year=

    Local Methods with Adaptivity via Scaling , author=. arXiv preprint arXiv:2406.00846 , year=

  42. [42]

    and Osindero, Simon and Teh, Yee Whye , journal =

    Hinton, Geoffrey E. and Osindero, Simon and Teh, Yee Whye , journal =. A Fast Learning Algorithm for Deep Belief Nets , volume =

  43. [43]

    2016 , publisher=

    Deep learning , author=. 2016 , publisher=

  44. [44]

    IEEE Network , volume=

    Generalizing AI: challenges and opportunities for plug and play AI solutions , author=. IEEE Network , volume=. 2020 , publisher=

  45. [45]

    arXiv preprint arXiv:2312.11230 , year=

    Dirichlet-based uncertainty quantification for personalized federated learning with improved posterior networks , author=. arXiv preprint arXiv:2312.11230 , year=

  46. [46]

    arXiv preprint arXiv:2402.05050 , year=

    Federated Learning Can Find Friends That Are Advantageous , author=. arXiv preprint arXiv:2402.05050 , year=

  47. [47]

    Federated Learning: Privacy and Incentive , pages=

    A principled approach to data valuation for federated learning , author=. Federated Learning: Privacy and Incentive , pages=. 2020 , publisher=

  48. [48]

    Towards More Efficient Data Valuation in Healthcare Federated Learning Using Ensembling , author=. Distributed, Collaborative, and Federated Learning, and Affordable AI and Healthcare for Resource Diverse Global Health: Third MICCAI Workshop, DeCaF 2022, and Second MICCAI Workshop, FAIR 2022, Held in Conjunction with MICCAI 2022, Singapore, September 18 a...

  49. [49]

    Advances in neural information processing systems , volume=

    Deep leakage from gradients , author=. Advances in neural information processing systems , volume=

  50. [50]

    Artificial intelligence and statistics , pages=

    Communication-efficient learning of deep networks from decentralized data , author=. Artificial intelligence and statistics , pages=. 2017 , organization=

  51. [51]

    How the GDPR will change the world , author=. Eur. Data Prot. L. Rev. , volume=. 2016 , publisher=

  52. [52]

    2009 , journal=

    Beyond the HIPAA privacy rule: enhancing privacy, improving health through research , author=. 2009 , journal=

  53. [53]

    arXiv preprint arXiv:2001.02610 , year=

    idlg: Improved deep leakage from gradients , author=. arXiv preprint arXiv:2001.02610 , year=

  54. [54]

    2009 , journal=

    Learning multiple layers of features from tiny images , author=. 2009 , journal=

  55. [55]

    Choquette-Choo and Natalie Dullerud and Adam Dziedzic and Yunxiang Zhang and Somesh Jha and Nicolas Papernot and Xiao Wang , booktitle=

    Christopher A. Choquette-Choo and Natalie Dullerud and Adam Dziedzic and Yunxiang Zhang and Somesh Jha and Nicolas Papernot and Xiao Wang , booktitle=. 2021 , url=

  56. [56]

    Cronus: Robust and Heterogeneous Collaborative Learning with Black-Box Knowledge Transfer,

    Cronus: Robust and heterogeneous collaborative learning with black-box knowledge transfer , author=. arXiv preprint arXiv:1912.11279 , year=

  57. [57]

    Foundations and Trends

    The algorithmic foundations of differential privacy , author=. Foundations and Trends. 2014 , publisher=

  58. [58]

    International Conference on Machine Learning , pages=

    The distributed discrete gaussian mechanism for federated learning with secure aggregation , author=. International Conference on Machine Learning , pages=. 2021 , organization=

  59. [59]

    arXiv preprint arXiv:2001.03618 , year=

    Encode, shuffle, analyze privacy revisited: Formalizations and empirical evaluation , author=. arXiv preprint arXiv:2001.03618 , year=

  60. [60]

    Learning Differentially Private Recurrent Language Models

    Learning differentially private recurrent language models , author=. arXiv preprint arXiv:1710.06963 , year=

  61. [61]

    Advances in neural information processing systems , volume=

    Simple and scalable predictive uncertainty estimation using deep ensembles , author=. Advances in neural information processing systems , volume=

  62. [62]

    International conference on machine learning , pages=

    Obfuscated gradients give a false sense of security: Circumventing defenses to adversarial examples , author=. International conference on machine learning , pages=. 2018 , organization=

  63. [63]

    International Conference on Artificial Intelligence and Statistics , pages=

    How to backdoor federated learning , author=. International Conference on Artificial Intelligence and Statistics , pages=. 2020 , organization=

  64. [64]

    Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining , pages=

    Towards fair federated learning , author=. Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining , pages=

  65. [65]

    arXiv preprint arXiv:2303.16520 , year=

    Fair Federated Medical Image Segmentation via Client Contribution Estimation , author=. arXiv preprint arXiv:2303.16520 , year=

  66. [66]

    FLamby: Datasets and Benchmarks for Cross-Silo Federated Learning in Realistic Healthcare Settings , volume =

    Ogier du Terrail, Jean and Ayed, Samy-Safwan and Cyffers, Edwige and Grimberg, Felix and He, Chaoyang and Loeb, Regis and Mangold, Paul and Marchand, Tanguy and Marfoq, Othmane and Mushtaq, Erum and Muzellec, Boris and Philippenko, Constantin and Silva, Santiago and Tele\'. FLamby: Datasets and Benchmarks for Cross-Silo Federated Learning in Realistic Hea...

  67. [67]

    Advances in Neural Information Processing Systems , volume=

    Personalized federated learning with theoretical guarantees: A model-agnostic meta-learning approach , author=. Advances in Neural Information Processing Systems , volume=

  68. [68]

    arXiv preprint arXiv:2011.10464 , year=

    A reputation mechanism is all you need: Collaborative fairness and adversarial robustness in federated learning , author=. arXiv preprint arXiv:2011.10464 , year=

  69. [69]

    IEEE Transactions on Big Data , year=

    FedFAIM: A Model Performance-based Fair Incentive Mechanism for Federated Learning , author=. IEEE Transactions on Big Data , year=

  70. [70]

    International Journal of Computer Vision , volume=

    Knowledge distillation: A survey , author=. International Journal of Computer Vision , volume=. 2021 , publisher=

  71. [71]

    IEEE Internet of Things Journal , volume=

    Incentive mechanism for reliable federated learning: A joint optimization approach to combining reputation and contract theory , author=. IEEE Internet of Things Journal , volume=. 2019 , publisher=

  72. [72]

    Proceedings of the Web Conference 2021 , pages=

    Incentive mechanism for horizontal federated learning based on reputation and reverse auction , author=. Proceedings of the Web Conference 2021 , pages=

  73. [73]

    Workshop on Federated Learning: Recent Advances and New Challenges (in Conjunction with NeurIPS 2022) , year=

    To federate or not to federate: incentivizing client participation in federated learning , author=. Workshop on Federated Learning: Recent Advances and New Challenges (in Conjunction with NeurIPS 2022) , year=

  74. [74]

    arXiv preprint arXiv:2205.14840 , year=

    Maximizing global model appeal in federated learning , author=. arXiv preprint arXiv:2205.14840 , year=

  75. [75]

    http://yann

    The MNIST database of handwritten digits , author=. http://yann. lecun. com/exdb/mnist/ , year=

  76. [76]

    Proceedings of the VLDB Endowment , volume=

    Distributed learning of fully connected neural networks using independent subnet training , author=. Proceedings of the VLDB Endowment , volume=

  77. [77]

    The Eleventh International Conference on Learning Representations , year=

    Sparse Random Networks for Communication-Efficient Federated Learning , author=. The Eleventh International Conference on Learning Representations , year=

  78. [78]

    Federated Optimization in Heterogeneous Networks , url =

    Li, Tian and Sahu, Anit Kumar and Zaheer, Manzil and Sanjabi, Maziar and Talwalkar, Ameet and Smith, Virginia , booktitle =. Federated Optimization in Heterogeneous Networks , url =

  79. [79]

    Foundations and trends

    Advances and open problems in federated learning , author=. Foundations and trends. 2021 , publisher=

  80. [80]

    Advances in neural information processing systems , volume=

    Tackling the objective inconsistency problem in heterogeneous federated optimization , author=. Advances in neural information processing systems , volume=

Showing first 80 references.