pith. sign in

arxiv: 2606.08128 · v1 · pith:52LAHSOXnew · submitted 2026-06-06 · 💻 cs.NE · cs.DM

Gray-Box Optimization and the Vertex Coloring Problem

Pith reviewed 2026-06-27 18:55 UTC · model grok-4.3

classification 💻 cs.NE cs.DM
keywords gray-box optimizationvertex coloringrandom local searchevolutionary algorithmsbipartite graphsruntime analysisplateaus
0
0 comments X

The pith

Gray-box optimization enables random local search to find proper 2-colorings of bipartite graphs in expected O(n log n) time from random initializations.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper explores gray-box optimization, which provides limited problem-specific information to algorithms that primarily use fitness values. It shows that random local search can efficiently locate a valid 2-coloring in bipartite graphs when starting from a random 2-coloring. The (1+1) evolutionary algorithm, however, struggles to do so when beginning from an n-coloring unless it receives additional guidance on search plateaus. Gray-box operators are presented as a way to improve performance in these cases.

Core claim

The paper establishes that in the vertex coloring problem on bipartite graphs, gray-box random local search achieves an expected runtime of O(n log n) to reach a proper 2-coloring from a random 2-coloring, while the standard (1+1) EA cannot make progress from a proper n-coloring without extra plateau guidance, though gray-box methods can accelerate the process.

What carries the argument

Gray-box operators that combine fitness-based selection with some problem-specific information to guide search on plateaus.

If this is right

  • Random local search succeeds in finding 2-colorings efficiently on bipartite graphs under the specified conditions.
  • The (1+1) EA requires additional mechanisms to handle plateaus in the search space for this problem.
  • Gray-box approaches can reduce runtime compared to pure black-box methods.
  • These results apply specifically when starting from appropriate initial colorings.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Gray-box techniques may offer a practical middle ground for other graph coloring variants or combinatorial tasks.
  • Extending the analysis to graphs with odd cycles could clarify the boundaries of these runtime guarantees.
  • Similar gray-box strategies might improve performance in related evolutionary computation problems like scheduling or partitioning.

Load-bearing premise

The graphs are bipartite and the search begins from a random proper 2-coloring or n-coloring of suitable size.

What would settle it

Running RLS on large random bipartite graphs from random 2-colorings and finding that the average time to proper coloring exceeds O(n log n) substantially, or seeing the (1+1) EA succeed without any plateau guidance.

read the original abstract

Gray-box optimization is an approach for making some problem-specific information available to the algorithm while still relying on fitness information as the main guide to an optimum. This approach was shown to be beneficial in various combinatorial optimization tasks and neatly captures the continuum between fully black-box algorithms and tailored algorithms. In this work, we discuss different flavors of gray-box algorithms. We show that RLS can find a proper $2$-coloring in a bipartite graph starting from a random $2$-coloring, in an expected time of $\mathcal{O}(n \log n)$. In contrast, when starting from a proper $n$-coloring, the (1+1) EA cannot find such a coloring except when offered additional guiding on plateaus of the search space. Finally, we show the run time for this setting can be much improved by using gray-box operators.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 1 minor

Summary. The paper explores gray-box optimization approaches for the vertex coloring problem. It claims that randomized local search (RLS) finds a proper 2-coloring of a bipartite graph from a random proper 2-coloring in expected O(n log n) time. By contrast, the (1+1) EA fails to reach such a coloring from a proper n-coloring unless given additional plateau guidance, but gray-box operators can substantially improve performance in this setting.

Significance. If the stated runtime bounds and comparisons are correct, the work supplies concrete evidence that limited problem-specific information (gray-box) can yield polynomial-time improvements over standard black-box evolutionary algorithms on a canonical combinatorial problem, while remaining within the fitness-based paradigm.

major comments (1)
  1. [Abstract, §1] Abstract and §1: The central claims are theoretical runtime results (O(n log n) for RLS; failure of (1+1) EA without guidance), yet the manuscript provides neither the proofs, the precise definitions of the gray-box operators, nor the verification steps needed to check the probabilistic analysis. These omissions make the claims unverifiable from the given text.
minor comments (1)
  1. [Abstract] The scope conditions (bipartite input graphs and specific initializations) are stated clearly in the abstract but should be repeated at the start of the results section for emphasis.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for highlighting the need for greater verifiability of the theoretical claims. We agree that the submitted version requires expansion to include explicit proofs, operator definitions, and analysis steps, and we will revise accordingly.

read point-by-point responses
  1. Referee: [Abstract, §1] Abstract and §1: The central claims are theoretical runtime results (O(n log n) for RLS; failure of (1+1) EA without guidance), yet the manuscript provides neither the proofs, the precise definitions of the gray-box operators, nor the verification steps needed to check the probabilistic analysis. These omissions make the claims unverifiable from the given text.

    Authors: We accept this criticism. The current manuscript states the runtime results but does not supply the full proofs or sufficiently detailed operator definitions in a form that allows direct verification. In the revised version we will add: (i) precise definitions of the gray-box operators (including the limited problem-specific information they access while remaining fitness-based) in a new or expanded Section 2; (ii) the complete proof that RLS reaches a proper 2-coloring of a bipartite graph in expected O(n log n) time from a random proper 2-coloring, with all probabilistic lemmas and drift arguments; and (iii) the corresponding analysis showing that the (1+1) EA fails without plateau guidance, together with the improvement obtained by the gray-box variant. These additions will be placed in the main body so that the claims become checkable. revision: yes

Circularity Check

0 steps flagged

No significant circularity

full rationale

The paper's core results are runtime bounds derived via standard probabilistic analysis (coupon-collector style arguments for RLS on bipartite graphs from random proper 2-colorings, and failure modes for the (1+1) EA from n-colorings). These are explicitly scoped to bipartite inputs and stated initializations in the abstract, with no equations, parameters, or claims reducing to self-definitions, fitted inputs renamed as predictions, or load-bearing self-citations. The gray-box operator improvements are presented as direct algorithmic modifications with independent runtime gains. The derivation chain is self-contained against external benchmarks and does not collapse to its inputs by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The central claims rest on standard probabilistic runtime analysis for evolutionary algorithms and the assumption that the graph is bipartite; no free parameters or invented entities are introduced in the abstract.

axioms (2)
  • standard math Standard expected runtime analysis via coupon collector or similar probabilistic arguments applies to the RLS process on bipartite graphs.
    Invoked to obtain the O(n log n) bound stated in the abstract.
  • domain assumption The fitness function and neighborhood structure for vertex coloring are defined such that plateaus exist for the (1+1) EA.
    Required for the claim that the (1+1) EA cannot escape without additional guiding.

pith-pipeline@v0.9.1-grok · 5678 in / 1369 out tokens · 20822 ms · 2026-06-27T18:55:04.344269+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

19 extracted references · 17 canonical work pages · 1 internal anchor

  1. [1]

    Back and S

    T. Back and S. Khuri. 1994. An evolutionary heuristic for the maximum indepen- dent set problem. InCEC ‘94. 531–535 vol.2. https://doi.org/10.1109/ICEC.1994. 350004

  2. [2]

    Samuel Baguley, Tobias Friedrich, Timo Kötzing, Xiaoyue Li, Marcus Pappik, and Ziena Zeif. 2022. Analysis of a gray-box operator for vertex cover. InGECCO ’22. 1363–1371. https://doi.org/10.1145/3512290.3528848

  3. [4]

    Luke Branson and Andrew M. Sutton. 2021. Focused jump-and-repair constraint handling for fixed-parameter tractable graph problems. InFOGA ’21. New York, NY, USA, 1–10. https://doi.org/10.1145/3450218.3477304

  4. [5]

    Benjamin Doerr. 2020. Probabilistic Tools for the Analysis of Randomized Optimization Heuristics. 1–87. https://doi.org/10.1007/978-3-030-29414-4_1 arXiv:1801.06733 [cs]

  5. [6]

    Benjamin Doerr, Carola Doerr, and Timo Kötzing. 2018. Static and Self-Adjusting Mutation Strengths for Multi-valued Decision Variables.Algorithmica80 (May 2018), 1732–1768. https://doi.org/10.1007/s00453-017-0341-1

  6. [7]

    Benjamin Doerr, Daniel Johannsen, and Carola Winzen. 2012. Multiplicative Drift Analysis.Algorithmica64, 4 (Dec. 2012), 673–697. https://doi.org/10.1007/s00453- 012-9622-x

  7. [8]

    Simon Fischer and Ingo Wegener. 2005. The one-dimensional Ising model: Muta- tion versus recombination. 344, 2 (Nov. 2005), 208–225. https://doi.org/10.1016/ j.tcs.2005.04.002

  8. [9]

    Tobias Friedrich, Timo Kötzing, Aishwarya Radhakrishnan, Leon Schiller, Martin Schirneck, Georg Tennigkeit, and Simon Wietheger. 2023. Crossover for Cardi- nality Constrained Optimization.ACM Trans. Evol. Learn. Optim.3, 2 (June 2023), 5:1–5:32. https://doi.org/10.1145/3603629

  9. [10]

    Jun He, Xin Yao, and Jin Li. 2005. A Comparative Study of Three Evolutionary Algorithms Incorporating Different Amounts of Domain Knowledge for Node Covering Problem.Systems, Man, and Cybernetics35 (June 2005). https://doi. org/10.1109/TSMCC.2004.841903

  10. [11]

    J. L. W. V. Jensen. 1906. Sur les fonctions convexes et les inégualités entre les valeurs Moyennes. (Nov. 1906). https://doi.org/10.1007/bf02418571

  11. [12]

    Timo Kötzing. 2024. Theory of Stochastic Drift. https://doi.org/10.48550/arXiv. 2406.14589 arXiv:2406.14589

  12. [13]

    Timo Kötzing and Martin S. Krejca. 2019. First-hitting times under drift.Theoret- ical Computer Science796 (Dec. 2019), 51–69. https://doi.org/10.1016/j.tcs.2019. 08.021

  13. [14]

    Frank Neumann and Ingo Wegener. 2007. Randomized local search, evolutionary algorithms, and the minimum spanning tree problem.Theoretical Computer Science378 (June 2007), 32–40. https://doi.org/10.1016/j.tcs.2006.11.002

  14. [15]

    Oliveto and Carsten Witt

    Pietro S. Oliveto and Carsten Witt. 2011. Simplified Drift Analysis for Proving Lower Bounds in Evolutionary Computation.Algorithmica59 (March 2011), 369–386. https://doi.org/10.1007/s00453-010-9387-z

  15. [16]

    Oliveto and Carsten Witt

    Pietro S. Oliveto and Carsten Witt. 2012. Erratum: Simplified Drift Analysis for Proving Lower Bounds in Evolutionary Computation. https://doi.org/10.48550/ arXiv.1211.7184 arXiv:1211.7184

  16. [17]

    Krejca, Ralf Rothen- berger, Timo Kötzing, and Tobias Friedrich

    Jannik Peters, Daniel Stephan, Isabel Amon, Hans Gawendowicz, Julius Lischeid, Lennart Salabarria, Jonas Umland, Felix Werner, Martin S. Krejca, Ralf Rothen- berger, Timo Kötzing, and Tobias Friedrich. 2019. Mixed Integer Programming versus Evolutionary Computation for Optimizing a Hard Real-World Staff As- signment Problem.ICAPS29 (July 2019), 541–554. h...

  17. [18]

    Dirk Sudholt. 2005. Crossover is provably essential for the Ising model on trees. InGECCO ’05. 1161–1167. https://doi.org/10.1145/1068009.1068202

  18. [19]

    Dirk Sudholt and Christine Zarges. 2010. Analysis of an Iterated Local Search Algorithm for Vertex Coloring. InAlgorithms and Computation, Otfried Cheong, Kyung-Yong Chwa, and Kunsoo Park (Eds.). Vol. 6506. Springer, 340–352. https: //doi.org/10.1007/978-3-642-17517-6_31 Series Title: Lecture Notes in Computer Science

  19. [20]

    Darrell Whitley. 2015. Mk Landscapes, NK Landscapes, MAX-kSAT: A Proof that the Only Challenging Problems are Deceptive. InGECCO ’15. 927–934. https://doi.org/10.1145/2739480.2754809 Gray-Box Optimization and the Vertex Coloring Problem GECCO ’26, July 13–17, 2026, San Jose, Costa Rica A Preliminaries Here, we briefly state theorems from probability and d...