Multiple Approximate-Response Agents (MARA): Fast Near-Optimal Primal Recovery for Distributed Optimization

Garrett van Ryzin; Stephen Boyd; Tetiana Parshakova; Yicheng Bai

arxiv: 2503.12221 · v2 · submitted 2025-03-15 · 🧮 math.OC

Multiple Approximate-Response Agents (MARA): Fast Near-Optimal Primal Recovery for Distributed Optimization

Tetiana Parshakova , Yicheng Bai , Garrett van Ryzin , Stephen Boyd This is my paper

Pith reviewed 2026-05-23 00:01 UTC · model grok-4.3

classification 🧮 math.OC

keywords distributed optimizationprimal recoverydual methodsapproximate responsesconvex combinationfeasibilitysuboptimalityresidual minimization

0 comments

The pith

MARA recovers near-optimal feasible primal solutions rapidly by combining multiple approximate agent responses to dual prices.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Dual methods for distributed optimization let agents solve subproblems in parallel but struggle to recover primal feasibility, often needing many iterations or special conditions like strict convexity. The paper introduces multiple approximate-response agents (MARA) to fix this by having each agent return several primal responses with bounded suboptimality to each price query. These responses are generated in parallel, adding no wall-clock time to the underlying dual algorithm. MARA then selects a convex combination of the responses by minimizing the sum of primal residual and complementary slackness residual. Tests with price localization and dual subgradient methods show that this typically yields a feasible near-optimal solution in a few tens of iterations, with hyperparameters controlling speed versus suboptimality trade-offs.

Core claim

Rather than returning a single primal response to each price query, MARA requires agents to generate multiple primal responses, each of which has bounded suboptimality. Because these multiple responses can be computed in parallel, there is no increase in the wall-clock time of the underlying dual algorithm. MARA then constructs a convex combination of the multiple responses by minimizing the sum of the primal and complementary slackness residuals to produce a high-quality primal solution. The method is agnostic to how dual prices are computed, so MARA can be applied to enhance any dual algorithm.

What carries the argument

The MARA procedure of generating multiple bounded-suboptimal primal responses in parallel and forming their convex combination via minimization of primal plus complementary-slackness residuals.

If this is right

MARA applies to any dual algorithm because it only needs the price queries and the agent responses.
Convergence to feasible near-optimal solutions occurs in a few tens of iterations on the tested problems.
Hyperparameters of MARA can be tuned to trade speed, computational budget per iteration, and allowable suboptimality.
Individual responses may be suboptimal yet their combination still satisfies feasibility and near-optimality.
Wall-clock time remains unchanged because the extra responses are computed in parallel.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same residual-minimization step could be inserted after any iterative dual update that supplies prices, even outside distributed settings.
One could replace the convex-combination solver with a faster heuristic and measure how much feasibility quality is lost.
Testing on problems where the number of agents grows would reveal whether the per-iteration cost of solving the combination problem stays manageable.
The approach might pair with warm-starting techniques to further reduce the number of outer iterations needed.

Load-bearing premise

That a convex combination of the multiple bounded-suboptimal responses will produce a high-quality feasible solution by driving down the sum of primal and complementary slackness residuals.

What would settle it

Apply MARA to a distributed linear program with ten agents and observe that after fifty iterations the primal residual stays above 0.01 while the objective gap exceeds 5 percent.

read the original abstract

Dual methods are useful for distributed optimization because they allow agent-level subproblems to be solved in parallel. However, achieving primal feasibility with dual methods is a challenge; it can take many iterations to find prices that recover primal feasibility, and even with optimal dual prices primal feasibility is not guaranteed unless special conditions like strict convexity hold. To address this limitation, we propose a simple primal recovery method, multiple approximate-response agents (MARA), that is able to rapidly reduce primal infeasibility, tolerating some degree of suboptimality. The method is agnostic to how dual prices are computed, so MARA can be applied to enhance any dual algorithm. Rather than returning a single primal response to each price query, MARA requires agents to generate multiple primal responses, each of which has bounded suboptimality. Because these multiple responses can be computed in parallel, there is no increase in the wall-clock time of the underlying dual algorithm. MARA then constructs a convex combination of the multiple responses by minimizing the sum of the primal and complementary slackness residuals to produce a high-quality primal solution. Tests of MARA using both a price localization method and a dual subgradient method show that it typically converges to a feasible, near-optimal solution in a few tens of iterations. Moreover, hyperparameters of MARA can be flexibly tuned to control the trade-off among speed, computational budget, and degree of suboptimality of the feasible solutions.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

MARA gives a practical parallel trick for faster primal feasibility in dual methods, but the residual-minimization step supplies no bound on the final objective gap.

read the letter

The core idea here is to have each agent return several bounded-suboptimal primal responses to the same dual prices, then pick a convex combination that drives down the primal and complementary-slackness residuals. That combination is new enough on its own terms and sits on top of any existing dual-price routine without changing wall-clock time. The paper is right that this targets a real, recurring headache in distributed optimization: dual methods often leave you with an infeasible point even after the dual has converged. The construction is clean and the claim that it is agnostic to how the prices are generated holds up from the description. Tests on a price-localization method and a subgradient method are mentioned, and the abstract says feasible near-optimal points appear in a few tens of iterations, which is the kind of empirical signal that matters for practitioners. The soft spot is exactly the one the stress-test flags. The weights are chosen only to shrink the two residuals; the original objective never enters the minimization. Nothing in the given construction prevents the recovered point from having an arbitrarily bad cost relative to the best individual response or the true optimum, even if the residuals hit zero. The paper therefore rests the “near-optimal” label entirely on the numerical examples rather than on any relation between the per-response suboptimality tolerance, the number of responses, and the final gap. That is a genuine limitation, not a minor one, because the method is sold as producing high-quality feasible solutions. The work is still worth referee time because the practical problem is real, the mechanism is simple and parallelizable, and the experiments (once the full numbers and baselines are checked) could show whether the gap stays small in practice. Readers working on large-scale convex or nonconvex distributed problems would get value from seeing the details and the plots. I would send it to review rather than desk-reject.

Referee Report

2 major / 2 minor

Summary. The manuscript proposes Multiple Approximate-Response Agents (MARA) for primal recovery in dual-based distributed optimization. Agents return multiple bounded-suboptimal primal responses to each dual price query; these are combined via a convex combination whose weights minimize the sum of primal infeasibility and complementary-slackness residuals. The method is presented as agnostic to the underlying dual algorithm, incurs no extra wall-clock time when responses are computed in parallel, and is claimed to produce feasible near-optimal solutions in a few tens of iterations. Numerical tests using a price-localization method and a dual subgradient method are reported to support the claims.

Significance. If the empirical behavior holds under broader conditions, MARA supplies a lightweight, parallelizable heuristic that mitigates the well-known primal-feasibility bottleneck of dual decomposition. The parallel-response design and residual-minimization step are simple to implement and could be useful in applications where rapid feasibility is more important than a provable optimality gap. The absence of any analytic relation between per-response suboptimality tolerance, number of responses, and final objective gap, however, confines the contribution to a practical heuristic whose performance must be validated case-by-case.

major comments (2)

[§3] §3 (MARA construction), convex-combination step: the weight vector w is chosen to minimize ||Ax(w) - b|| + ||complementary-slackness residual(w)|| over the simplex, yet the original objective term is absent from this minimization. Consequently, a feasible point can be recovered whose cost deviates arbitrarily from the best individual response while still driving the chosen residuals to zero; no analytic bound relating the per-response Lagrangian suboptimality tolerance to the final optimality gap is supplied.
[Numerical experiments] Numerical experiments section: while the abstract states that MARA “typically converges … in a few tens of iterations,” the reported results supply no quantitative tables, error bars, or data-exclusion rules that would allow assessment of robustness across problem instances or hyper-parameter choices. The central claim of “near-optimal” therefore rests entirely on the specific test cases shown.

minor comments (2)

[§2] Notation for the number of responses per agent and the suboptimality tolerance should be introduced once in §2 and used consistently thereafter.
[Figures] Figure captions should explicitly state the values of the MARA hyperparameters (number of responses, tolerance) used in each panel.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the thoughtful and constructive report. We address each major comment below and indicate the revisions that will be incorporated.

read point-by-point responses

Referee: [§3] §3 (MARA construction), convex-combination step: the weight vector w is chosen to minimize ||Ax(w) - b|| + ||complementary-slackness residual(w)|| over the simplex, yet the original objective term is absent from this minimization. Consequently, a feasible point can be recovered whose cost deviates arbitrarily from the best individual response while still driving the chosen residuals to zero; no analytic bound relating the per-response Lagrangian suboptimality tolerance to the final optimality gap is supplied.

Authors: We acknowledge that the convex-combination step minimizes only the sum of the primal infeasibility and complementary-slackness residuals and does not include an explicit term for the original objective. Consequently, while each individual response satisfies a bounded Lagrangian suboptimality, the recovered point’s objective value is guaranteed only to lie within the convex hull of the individual objectives; no tighter analytic relation to the final optimality gap is derived. The manuscript presents MARA as a practical heuristic whose value lies in rapid feasibility recovery rather than in provable bounds. We will revise §3 to state this limitation explicitly and to note that the method’s performance must be assessed empirically on a case-by-case basis. revision: partial
Referee: [Numerical experiments] Numerical experiments section: while the abstract states that MARA “typically converges … in a few tens of iterations,” the reported results supply no quantitative tables, error bars, or data-exclusion rules that would allow assessment of robustness across problem instances or hyper-parameter choices. The central claim of “near-optimal” therefore rests entirely on the specific test cases shown.

Authors: We agree that the numerical section would be strengthened by additional quantitative detail. In the revised manuscript we will add tables reporting iteration counts to feasibility, objective gaps, and residual values across all tested instances, together with standard deviations (error bars) and explicit statements of the hyper-parameter ranges and instance-generation rules employed. These additions will allow readers to evaluate the robustness of the “few tens of iterations” claim beyond the illustrative figures. revision: yes

Circularity Check

0 steps flagged

No circularity: MARA is an explicit algorithmic construction with performance claims supported by experiments

full rationale

The paper defines MARA directly as a procedure that takes multiple bounded-suboptimal responses, forms their convex combination by minimizing the sum of primal and complementary-slackness residuals, and applies the result to any dual method. No equation or claim reduces by construction to its own inputs, no fitted parameter is relabeled as a prediction, and no load-bearing step relies on a self-citation chain or imported uniqueness theorem. The 'near-optimal' qualifier is presented as an empirical observation from the reported tests rather than a mathematical consequence of the construction itself. The derivation is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

2 free parameters · 0 axioms · 0 invented entities

Review based on abstract only; full paper would likely list the precise suboptimality bounds and the number of responses per agent as tunable quantities.

free parameters (2)

number of responses per agent
Hyperparameter controlling computational budget versus quality; stated as flexibly tunable.
suboptimality tolerance per response
Bound on how far each individual response may deviate from optimality; required for the method to function.

pith-pipeline@v0.9.0 · 5801 in / 1208 out tokens · 44063 ms · 2026-05-23T00:01:40.560829+00:00 · methodology

Multiple Approximate-Response Agents (MARA): Fast Near-Optimal Primal Recovery for Distributed Optimization

Core claim

What carries the argument

If this is right

Where Pith is reading between the lines

Load-bearing premise

What would settle it

discussion (0)