Convergence Detection of Asynchronous Iterations based on Modified Recursive Doubling
Pith reviewed 2026-05-25 10:57 UTC · model grok-4.3
The pith
A modified recursive doubling algorithm allows convergence detection in asynchronous iterations even when the number of processes is not a power of two.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The central claim is that a modified recursive doubling algorithm adapts to the non-power-of-two case for distributed convergence detection in asynchronous iterations, with algorithms illustrated based on the reduction operation. A concluding discussion covers implementation and applicability.
What carries the argument
Modified recursive doubling algorithm based on the reduction operation for handling arbitrary process counts in convergence detection.
Load-bearing premise
Asynchronous iterations allow direct use of reduction operations in the modified doubling without extra synchronization that would break the detection logic.
What would settle it
A simulation or run with three processes where the algorithm either misses actual convergence or falsely detects it when iterations are still ongoing.
Figures
read the original abstract
This paper addresses the distributed convergence detection problem in asynchronous iterations. A modified recursive doubling algorithm is investigated in order to adapt to the non-power-of-two case. Some convergence detection algorithms are illustrated based on the reduction operation. Finally, a concluding discussion about the implementation and the applicability is presented.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript addresses the distributed convergence detection problem in asynchronous iterations. It investigates a modified recursive doubling algorithm adapted to the non-power-of-two case and illustrates some convergence detection algorithms based on the reduction operation, followed by a discussion of implementation and applicability.
Significance. If the illustrated modification correctly extends recursive doubling to arbitrary process counts while preserving the detection logic, the work provides a practical adaptation for distributed systems that leverages existing reduction primitives rather than introducing new synchronization mechanisms. This is a modest but useful contribution at the level of algorithmic illustration, consistent with the absence of formal theorems or quantitative claims. The stress-test concern regarding synchronization overheads does not appear load-bearing given the paper's focus on standard reduction operations.
minor comments (2)
- The abstract and manuscript would benefit from explicit pseudocode or a diagram for the modified recursive doubling scheme to make the adaptation to non-power-of-two cases concrete.
- The concluding discussion on implementation and applicability could reference specific distributed computing frameworks (e.g., MPI collectives) to ground the claims.
Simulated Author's Rebuttal
We thank the referee for their careful reading and positive evaluation of the manuscript. We appreciate the recommendation for minor revision and the recognition that the work offers a practical adaptation of recursive doubling for convergence detection in asynchronous iterations using existing reduction primitives.
Circularity Check
No significant circularity detected
full rationale
The paper proposes and illustrates a modified recursive doubling scheme for distributed convergence detection in asynchronous iterations, specifically adapting the method to the non-power-of-two case via standard reduction operations. No equations, fitted parameters, self-referential definitions, or load-bearing self-citations appear in the abstract or described contribution. The work remains at the level of algorithmic presentation and discussion, which is self-contained against external benchmarks and does not reduce any claimed result to its own inputs by construction.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Asynchronousiterativemethodsformultiprocessors
G.M.Baudet. Asynchronousiterativemethodsformultiprocessors. J. ACM,25(2):226– 244, 1978
work page 1978
-
[2]
D. P. Bertsekas and J. N. Tsitsiklis.Parallel and Distributed Computation: Numerical Methods. Prentice-Hall, Inc., Upper Saddle River, NJ, USA, 1989
work page 1989
-
[3]
K. M. Chandy and L. Lamport. Distributed snapshots: Determining global states of distributed systems. ACM Transactions on Computer Systems , 3(1):63–75, 1985
work page 1985
-
[4]
D. Chazan and W. Miranker. Chaotic relaxation.Linear Algebra and its Applications , 2(2):199–222, 1969
work page 1969
-
[5]
M. N. El Tarazi. Some convergence results for asynchronous algorithms.Numerische Mathematik, 39(3):325–340, 1982
work page 1982
-
[6]
F. Magoulès and A.-K. Cheik Ahamed. Alinea: An advanced linear algebra library for massively parallel computations on graphics processing units. The International Journal of High Performance Computing Applications , 29(3):284–310, 2015
work page 2015
-
[7]
F. Magoulès, A.-K. Cheik Ahamed, and R. Putanowicz. Auto-tuned Krylov methods on cluster of graphics processing unit.International Journal of Computer Mathematics , 92(6):1222–1250, 2015
work page 2015
-
[8]
F. Magoulès, A.-K. Cheik Ahamed, and R. Putanowicz. Optimized Schwarz method withoutoverlapforthegravitational potentialequationonclusterofgraphicsprocessing unit. International Journal of Computer Mathematics , 93(6):955–980, 2015
work page 2015
-
[9]
F. Magoulès, A.-K. Cheik Ahamed, and A. Suzuki. Green computing on graphics pro- cessing units. Concurrency and Computation: Practice and Experience , 28(16):4305– 4325, 2016
work page 2016
-
[10]
F. Magoulès and G. Gbikpi-Benissan. JACK: an asynchronous communication kernel library for iterative algorithms.The Journal of Supercomputing, 73(8):3468–3487, 2017
work page 2017
-
[11]
F.MagoulèsandG.Gbikpi-Benissan. Distributedconvergencedetectionbasedonglobal residual error under asynchronous iterations.IEEE Transactions on Parallel and Dis- tributed Systems, 29(4):819–829, 2018
work page 2018
-
[12]
F. Magoulès and G. Gbikpi-Benissan. JACK2: An MPI-based communication library withnon-blockingsynchronizationforasynchronousiterations. Advances in Engineering Software, 119:116–133, 2018
work page 2018
-
[13]
F. Magoulès, G. Gbikpi-Benissan, and Q. Zou. Asynchronous iterations of Parareal algorithm for option pricing models.Mathematics, 6(4):1–18, 2018
work page 2018
-
[14]
F. Magoulès, D. B. Szyld, and C. Venet. Asynchronous optimized Schwarz methods with and without overlap.Numerische Mathematik, 137(1):199–227, 2017
work page 2017
-
[15]
F. Magoulès and C. Venet. Asynchronous iterative sub-structuring methods.Mathe- matics and Computers in Simulation , 145:34–49, 2018. 7
work page 2018
-
[16]
Message Passing Interface Forum. MPI: A message passing interface standard.Inter- national Journal of Supercomputer Applications , 8(3/4):159–416, 1994
work page 1994
-
[17]
J.-C. Miellou. Algorithmes de relaxation chaotique à retards.ESAIM: Mathematical Modelling and Numerical Analysis , 9(R1):55–82, 1975
work page 1975
-
[18]
J.-C. Miellou, P. Spiteri, and D. El Baz. A new stopping criterion for linear per- turbed asynchronous iterations. Journal of Computational and Applied Mathematics , 219(2):471–483, 2008
work page 2008
-
[19]
S. A. Savari and D. P. Bertsekas. Finite termination of asynchronous iterative algo- rithms. Parallel Computing, 22(1):39–56, 1996
work page 1996
- [20]
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.