Anytime Analysis on BinVal: Adaptive Parameters Help
Pith reviewed 2026-05-10 17:50 UTC · model grok-4.3
The pith
A self-adjusting mutation rate in the (1+1) EA yields O(k^{1+ε}) fixed-target runtimes on BinVal for every k=o(n), independent of string length n.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By replacing the fixed mutation rate in the standard (1+1) EA with a self-adjusting rate, the fixed-target run time for k ∈ o(n) and a constant ε >0 arbitrarily close to zero is in O(k^{1+ε}) for this algorithm. In particular, this run time is independent of n, holds simultaneously for all k ∈ o(n), and is close to the run time of Θ(k log k) for the (1+1) EA with the best fixed mutation rate if k is known.
What carries the argument
The self-adjusting mutation-rate rule inside the (1+1) EA that changes the mutation probability on the basis of recent success or failure.
Load-bearing premise
The specific self-adjusting mechanism for the mutation rate behaves exactly as modeled and the probabilistic analysis of fixed-target runtimes on BinVal holds without hidden dependencies on n or k.
What would settle it
A single run or a proof for some n and k=o(n) in which the self-adjusting (1+1) EA requires more than k^{1+ε} evaluations or whose runtime grows with n would falsify the claimed bound.
Figures
read the original abstract
While most theoretical run time analyses of discrete randomized search heuristics provide bounds on the expected number of evaluations to find the global optimum, we consider the anytime performance of evolutionary and estimation-of-distribution algorithms. For this purpose, we analyze the fixed-target run time of various algorithms using BinVal as fitness function and bound the run time to optimize the most significant $k \in o(n)$ bits of a bit string with length $n$. We analyze the run times such that they hold not only for a fixed $k$, but simultaneously for all $k \in o(n)$. For the standard (1+1) EA with fixed mutation rate $1/n$, we show that the fixed-target run time for all $k \in o(n)$ is in $\Theta(n \log k)$. Using an EDA instead, we get an expected number of evaluations of $\Theta(k \log n)$ for the sig-cGA. Replacing in the standard (1+1) EA the fixed mutation rate with a self-adjusting rate, we show that the fixed-target run time for $k \in o(n)$ and a constant $\varepsilon >0$ arbitrarily close to zero is in $\mathcal{O}\left(k^{1+\varepsilon}\right)$ for this algorithm. In particular, this run time is independent of $n$, holds simultaneously for all $k \in o(n)$, and is close to the run time of $\Theta(k \log k)$ for the (1+1) EA with the best fixed mutation rate if $k$ is known.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper analyzes the anytime (fixed-target) performance of evolutionary algorithms and EDAs on the BinVal fitness function, deriving runtime bounds to optimize the k most significant bits for all k = o(n) simultaneously. For the standard (1+1) EA with fixed mutation rate 1/n the bound is Θ(n log k); for the sig-cGA it is Θ(k log n); and for a self-adjusting (1+1) EA the bound is O(k^{1+ε}) for any constant ε>0 arbitrarily close to zero, independent of n and close to the Θ(k log k) optimum achievable when k is known in advance.
Significance. If the derivations hold, the work provides a strong demonstration that self-adaptive mutation rates can deliver near-optimal fixed-target runtimes on BinVal without knowledge of k and without n-dependence, while maintaining simultaneous validity across all k = o(n). The explicit comparison to both fixed-rate EAs and EDAs, together with the emphasis on anytime behavior rather than only global optimization, adds value to the literature on parameter control in evolutionary computation.
major comments (2)
- [§4] §4 (analysis of the self-adjusting (1+1) EA): the claimed O(k^{1+ε}) bound independent of n requires an explicit, n-free upper bound on the initial adaptation phase in which the mutation rate moves from its standard initialization Θ(1/n) to the scale Θ(1/k) appropriate for the current leading-bit position. Multiplicative update rules typically require Θ(log(n/k)) successful steps; if the per-step waiting time while the rate remains too large is Ω(1) on average, this phase contributes an Ω(log n) term that, for k = o(log n), exceeds any fixed power k^{1+ε} and re-introduces n-dependence, contradicting the stated independence. The drift or concentration argument must therefore absorb or eliminate this cost for every k = o(n) simultaneously.
- [Theorem 3] Theorem 3 (or the main theorem on the self-adjusting EA): the simultaneous-for-all-k requirement is load-bearing for the central claim, yet the proof sketch does not appear to contain a uniform bound showing that the adaptation cost incurred while optimizing the first few bits does not degrade the runtime for later, larger k. A concrete test would be to verify whether the total expected time up to target k remains O(k^{1+ε}) when the process begins at rate 1/n and must cross all intermediate scales.
minor comments (2)
- [Abstract] The abstract states the self-adjusting result holds 'for k ∈ o(n) and a constant ε >0 arbitrarily close to zero'; the precise dependence of the hidden constant on ε should be stated explicitly in the theorem statement to clarify how close to the Θ(k log k) optimum the bound actually is.
- [§2] Notation for the self-adjusting mechanism (update factor, success-based rule) is introduced without a dedicated preliminary subsection; a short paragraph recalling the exact update rule and initialization would improve readability for readers unfamiliar with the 1/5-rule variants.
Simulated Author's Rebuttal
We thank the referee for the careful and constructive review. The concerns about the adaptation phase in the self-adjusting (1+1) EA analysis are valid and point to a place where the proof sketch in Section 4 can be strengthened. We will revise the manuscript to supply the missing explicit bounds while preserving the stated claims.
read point-by-point responses
-
Referee: [§4] §4 (analysis of the self-adjusting (1+1) EA): the claimed O(k^{1+ε}) bound independent of n requires an explicit, n-free upper bound on the initial adaptation phase in which the mutation rate moves from its standard initialization Θ(1/n) to the scale Θ(1/k) appropriate for the current leading-bit position. Multiplicative update rules typically require Θ(log(n/k)) successful steps; if the per-step waiting time while the rate remains too large is Ω(1) on average, this phase contributes an Ω(log n) term that, for k = o(log n), exceeds any fixed power k^{1+ε} and re-introduces n-dependence, contradicting the stated independence. The drift or concentration argument must therefore absorb or eliminate this cost for every k = o(n) simultaneously.
Authors: We agree that an explicit n-independent bound on the initial adaptation phase must be stated. In the revised manuscript we will insert a new lemma that applies multiplicative drift to the mutation-rate process. When the current rate r ≫ 1/k the success probability for improving the leading bit remains Ω(1/k) (by the same binomial-tail estimates already used for the main drift), so the expected time to obtain a successful step that triggers a rate decrease is O(k). Consequently the Θ(log(n/k)) rate-adjustment steps cost only O(k log(n/k)) in total, which is absorbed into O(k^{1+ε}) for any fixed ε>0. The constants are chosen uniformly in k=o(n) by fixing ε first and then taking n large enough; the resulting bound remains independent of n in the leading term. revision: yes
-
Referee: [Theorem 3] Theorem 3 (or the main theorem on the self-adjusting EA): the simultaneous-for-all-k requirement is load-bearing for the central claim, yet the proof sketch does not appear to contain a uniform bound showing that the adaptation cost incurred while optimizing the first few bits does not degrade the runtime for later, larger k. A concrete test would be to verify whether the total expected time up to target k remains O(k^{1+ε}) when the process begins at rate 1/n and must cross all intermediate scales.
Authors: We accept that the current sketch does not explicitly verify uniformity across scales. The revised proof of Theorem 3 will decompose the run time into successive phases, one for each bit position m=1…k. The time to reach target m is bounded by O(m^{1+ε}) by induction; the additional adaptation cost at scale m is O(m^ε) by the new lemma mentioned above. Summing over m≤k therefore yields at most O(k^{1+ε}) + O(∑_{m=1}^k m^ε) = O(k^{1+ε}), which is uniform in k. This directly answers the concrete test: the total expected time starting from rate 1/n and crossing all intermediate scales remains O(k^{1+ε}). revision: yes
Circularity Check
No significant circularity in runtime derivation
full rationale
The paper presents direct probabilistic analysis deriving fixed-target runtime bounds on BinVal for the fixed-rate (1+1) EA (Θ(n log k)), the sig-cGA EDA (Θ(k log n)), and the self-adjusting (1+1) EA (O(k^{1+ε}) independent of n, simultaneously for all k ∈ o(n)). These are obtained from modeling the algorithms' behavior, drift analysis, and waiting-time arguments rather than by fitting parameters to the target quantity, redefining the result into itself, or relying on load-bearing self-citations whose content reduces to the present claim. The adaptation-phase concern raised by the skeptic is a potential gap in the correctness of the n-independence proof, but does not constitute circularity because the bound is not presupposed or constructed from the final statement. The derivation chain remains self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
axioms (1)
- standard math Standard asymptotic notation (big-O, Theta) and basic probability bounds for Markov chains and randomized algorithms
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Replacing in the standard (1+1) EA the fixed mutation rate with a self-adjusting rate, we show that the fixed-target run time for k ∈ o(n) ... is in O(k^{1+ε})
-
IndisputableMonolith/Foundation/ArithmeticFromLogic.leanembed_strictMono_of_one_lt unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
we design ... a combined potential function that results in handling the optimization of the bit string and adjustment of the mutation rate simultaneously
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Aishwaryaprajna and Jonathan E. Rowe. 2025. Evolutionary Anytime Algorithms. InEvolutionary Computation in Combinatorial Optimization: 25th European Con- ference, Proceedings (EvoCOP ’25). Springer-Verlag, 18–32. doi:10.1007/978-3-031- 86849-8_2
-
[2]
Maxim Buzdalov, Benjamin Doerr, Carola Doerr, and Dmitry Vinokurov. 2022. Fixed-Target Runtime Analysis.Algorithmica84, 6, 1762–1793. doi:10.1007/ S00453-021-00881-0
work page 2022
-
[3]
Stephan Cathabard, Per Kristian Lehre, and Xin Yao. 2011. Non-uniform mu- tation rates for problems with unknown solution lengths. InProceedings of the 11th Workshop Proceedings on Foundations of Genetic Algorithms (FOGA ’11). Association for Computing Machinery, 173–180. doi:10.1145/1967654.1967670
-
[4]
Benjamin Doerr, Carola Doerr, and Timo Kötzing. 2015. Solving Problems with Unknown Solution Length at (Almost) No Extra Cost. InProceedings of the 2015 Annual Conference on Genetic and Evolutionary Computation (GECCO ’15). Asso- ciation for Computing Machinery, 831–838. doi:10.1145/2739480.2754681
-
[5]
Benjamin Doerr, Carola Doerr, and Timo Kötzing. 2018. Static and Self-Adjusting Mutation Strengths for Multi-valued Decision Variables.Algorithmica80, 5 (May 2018), 1732–1768. doi:10.1007/s00453-017-0341-1
-
[6]
Benjamin Doerr, Carola Doerr, and Johannes Lengler. 2021. Self-Adjusting Muta- tion Rates with Provably Optimal Success Rules.Algorithmica83, 10, 3108–3147. doi:10.1007/S00453-021-00854-3
-
[7]
Benjamin Doerr and Leslie Ann Goldberg. 2010. Drift analysis with tail bounds. InProceedings of the 11th International Conference on Parallel Problem Solving from Nature: Part I (PPSN’10). Springer-Verlag, 174–183. doi:10.1007/978-3-642- 15844-5_18
-
[8]
Benjamin Doerr and Leslie Ann Goldberg. 2013. Adaptive Drift Analysis.Algo- rithmica65, 1, 224–250. doi:10.1007/S00453-011-9585-3
-
[9]
Benjamin Doerr, Daniel Johannsen, and Carola Winzen. 2012. Multiplicative Drift Analysis.Algorithmica64, 4, 673–697. doi:10.1007/S00453-012-9622-X
- [10]
-
[11]
Stefan Droste, Thomas Jansen, and Ingo Wegener. 2002. On the analysis of the (1+ 1) evolutionary algorithm.Theoretical Compututer Science276, 1–2 (April 2002), 51–81. doi:10.1016/S0304-3975(01)00182-7
-
[12]
Hafsteinn Einarsson, Marcelo Matheus Gauy, Johannes Lengler, Florian Meier, Asier Mujika, Angelika Steger, and Felix Weissenberger. 2019. The linear hidden subset problem for the (1+1) EA with scheduled and adaptive mutation rates. Theoretical Computer Science785 (2019), 150–170. doi:10.1016/j.tcs.2019.05.021
-
[13]
Jun He and Xin Yao. 2001. Drift analysis and average time complexity of evolu- tionary algorithms.Artificial Intelligence127, 1 (2001), 57–85. doi:10.1016/S0004- 3702(01)00058-3
- [14]
-
[15]
Jens Jägersküpper. 2008. A Blend of Markov-Chain and Drift Analysis. InProceed- ings of the 11th International Conference on Parallel Problem Solving from Nature (PPSN’08). Springer-Verlag, 41–51. doi:10.1007/978-3-540-87700-4_5
-
[16]
Thomas Jansen and Christine Zarges. 2012. Fixed budget computations: a different perspective on run time analysis. InProceedings of the 14th Annual Conference on Genetic and Evolutionary Computation (GECCO ’12). Association for Computing Machinery, 1325–1332. doi:10.1145/2330163.2330347
-
[17]
Timo Kötzing and Martin S. Krejca. 2019. First-hitting times under drift.Theoret- ical Computer Science796 (2019), 51–69. doi:10.1016/j.tcs.2019.08.021
-
[18]
Johannes Lengler and Nicholas Spooner. 2015. Fixed Budget Performance of the (1+1) EA on Linear Functions. InProceedings of the 2015 ACM Conference on Foundations of Genetic Algorithms XIII (FOGA ’15). Association for Computing Machinery, 52–61. doi:10.1145/2725494.2725506
- [19]
-
[20]
Jonathan Rowe. 2024. A Theoretical Investigation Of Termination Criteria For Evolutionary Algorithms. InEvolutionary Computation in Combinatorial Optimization(1 ed.)(Lecture Notes in Computer Science). Springer, 162–176. doi:10.1007/978-3-031-57712-3_11
-
[21]
Dirk Sudholt. 2013. A New Method for Lower Bounds on the Running Time of Evolutionary Algorithms.Transactions on Evolutionary Computation17, 3 (June 2013), 418–435. doi:10.1109/TEVC.2012.2202241
-
[22]
Dmitry Vinokurov, Maxim Buzdalov, Arina Buzdalova, Benjamin Doerr, and Carola Doerr. 2019. Fixed-target runtime analysis of the (1 + 1) EA with resam- pling. InProceedings of the Genetic and Evolutionary Computation Conference Companion (GECCO ’19). Association for Computing Machinery, 2068–2071. doi:10.1145/3319619.3326906
-
[23]
Carsten Witt. 2013. Tight Bounds on the Optimization Time of a Randomized Search Heuristic on Linear Functions.Combinatorics, Probability and Computing 22, 2 (2013), 294–318. doi:10.1017/S0963548312000600
- [24]
-
[25]
Shlomo Zilberstein. 1996. Using Anytime Algorithms in Intelligent Systems.AI Magazine17, 3 (Mar. 1996), 73. doi:10.1609/aimag.v17i3.1232 Acknowledgements This research was (partially) funded by the HPI Research School on Foundations of AI (FAI). GECCO ’26, July 13–17, 2026, San Jose, Costa Rica xxx A Experimental Anytime Analysis ofBinV al To extend the e...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.