pith. sign in

arxiv: 2509.14754 · v2 · submitted 2025-09-18 · 💻 cs.CR

Variables Ordering Optimization in Boolean Characteristic Set Method Using Simulated Annealing and Machine Learning-based Time Prediction

Pith reviewed 2026-05-18 16:31 UTC · model grok-4.3

classification 💻 cs.CR
keywords boolean characteristic setvariable ordering optimizationmachine learning time predictionsimulated annealingalgebraic cryptanalysisboolean equationsprobabilistic complexity bounds
0
0 comments X

The pith

A machine learning predictor combined with simulated annealing finds variable orderings that accelerate the Boolean characteristic set method for solving equation systems.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that a machine learning model trained on variable frequency spectra and solving times from benchmark Boolean systems can estimate computation times for different variable orderings. This estimate then serves as the objective function inside a simulated annealing search that locates orderings minimizing the runtime of the Boolean characteristic set algorithm. A sympathetic reader would care because Boolean equation systems arise in algebraic cryptanalysis and formal verification where ordering choices cause large runtime differences, and turning that choice into an automated optimization step yields practical speedups on larger instances along with derived probabilistic complexity bounds.

Core claim

We construct a dataset comprising variable frequency spectrum X and corresponding BCS solving time t for benchmark systems (e.g., n = m = 28). Utilizing this data, we train an accurate ML predictor ft(X) to estimate solving time for any given variables ordering. For each target system, ft serves as the cost function within an SA algorithm, enabling rapid discovery of low-latency orderings that significantly expedite subsequent BCS execution. Extensive experiments demonstrate that our method substantially outperforms the standard BCS algorithm, Gröbner basis method and SAT solver, particularly for larger-scale systems (e.g., n = 32). Furthermore, we derive probabilistic time complexity bounds

What carries the argument

The ML predictor ft(X) that estimates BCS solving time from a variable frequency spectrum X and is used as the cost function inside simulated annealing to locate low-latency orderings.

Load-bearing premise

The machine learning predictor trained on frequency spectrum data from n equals m equals 28 benchmark systems accurately estimates solving times for arbitrary variable orderings in the target systems.

What would settle it

Apply the method to a fresh collection of n=32 Boolean systems, run the actual BCS solver on the orderings returned by simulated annealing, and compare both the observed runtimes against the predictor estimates and against the runtimes of unoptimized BCS, Gröbner basis, and SAT solvers; absence of speedup or large prediction errors would falsify the central claim.

Figures

Figures reproduced from arXiv: 2509.14754 by Minzhong Luo, Yin Long, Yudong Sun.

Figure 1
Figure 1. Figure 1: Variables ordering optimization with simulated annealing and time predictor. [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Variables ordering impact on solving cost time. [PITH_FULL_IMAGE:figures/full_fig_p006_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: correlation test between individual spectrum and runtime. [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Fitting results of time predictor. Two conclusions can be drawn from this study: 1. Although the clustering experiments in the previous section indicated that similar spectra do not necessarily correspond to similar solving times (and vice versa), this does not imply the absence of a relationship. Rather, the connection is hidden in a nonlinear and high-dimensional manner. The successful regression fitting… view at source ↗
Figure 5
Figure 5. Figure 5: he improved Simulated Annealing framework integrates machine learning prediction as a guide for searching [PITH_FULL_IMAGE:figures/full_fig_p009_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Comparison of solving times for different algorithms on different problem instance. [PITH_FULL_IMAGE:figures/full_fig_p011_6.png] view at source ↗
read the original abstract

Solving systems of Boolean equations is a fundamental task in symbolic computation and algebraic cryptanalysis, with wide-ranging applications in cryptography, coding theory, and formal verification. Among existing approaches, the Boolean Characteristic Set (BCS) method[1] has emerged as one of the most efficient algorithms for tackling such problems. However, its performance is highly sensitive to the ordering of variables, with solving times varying drastically under different orderings for fixed variable counts n and equations size m. To address this challenge, this paper introduces a novel optimization framework that synergistically integrates machine learning (ML)-based time prediction with simulated annealing (SA) to efficiently identify high-performance variables orderings. Weconstruct a dataset comprising variable frequency spectrum X and corresponding BCS solving time t for benchmark systems(e.g., n = m = 28). Utilizing this data, we train an accurate ML predictor ft(X) to estimate solving time for any given variables ordering. For each target system, ft serves as the cost function within an SA algorithm, enabling rapid discovery of low-latency orderings that significantly expedite subsequent BCS execution. Extensive experiments demonstrate that our method substantially outperforms the standard BCS algorithm[1], Gr\"obner basis method [2] and SAT solver[3], particularly for larger-scale systems(e.g., n = 32). Furthermore, we derive probabilistic time complexity bounds for the overall algorithm using stochastic process theory, establishing a quantitative relationship between predictor accuracy and expected solving complexity. This work provides both a practical acceleration tool for algebraic cryptanalysis and a theoretical foundation for ML-enhanced combinatorial optimization in symbolic computation.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

4 major / 2 minor

Summary. The paper claims that integrating an ML predictor ft(X) (trained on variable frequency spectra X and BCS solving times t from n=m=28 benchmark systems) with simulated annealing allows efficient discovery of low-latency variable orderings for the Boolean Characteristic Set (BCS) method. This yields substantial empirical outperformance over standard BCS, Gröbner bases, and SAT solvers on larger instances (e.g., n=32), while stochastic-process theory supplies probabilistic time-complexity bounds that relate predictor accuracy to expected overall complexity.

Significance. If the predictor generalizes across system sizes and the ordering-dependent latency differences are captured, the work would supply a practical acceleration technique for algebraic cryptanalysis together with a quantitative link between ML accuracy and combinatorial-search complexity. The combination of empirical scaling results and derived bounds is potentially valuable for symbolic computation, provided the supporting measurements and derivations are made rigorous.

major comments (4)
  1. [Dataset construction and ML training] Dataset construction and ML training section: no accuracy metrics (R², MAE, or cross-validation error), data-split protocol, or hyper-parameter details are reported for ft(X). Without these, it is impossible to verify that the predictor supplies sufficiently accurate cost estimates for SA to locate orderings that materially reduce BCS runtime.
  2. [Feature definition] Feature definition (variable frequency spectrum X): the manuscript does not state whether X is ordering-invariant (e.g., simple per-variable occurrence counts) or encodes ordering (e.g., position-weighted spectra). If X is invariant, ft(X) is identical for all permutations and SA cannot differentiate latencies; this directly undermines both the outperformance claim and the subsequent complexity bounds.
  3. [Experiments] Generalization from n=28 to n=32: the predictor is trained exclusively on n=m=28 instances yet applied to target systems with n=32. No domain-shift analysis, retraining protocol, or extrapolation experiment is described, leaving the central empirical claim unsupported.
  4. [Theoretical analysis] Probabilistic bounds derivation: the claimed relationship between predictor accuracy and expected complexity is derived under stochastic-process assumptions, yet accuracy itself is obtained from a fitted model on benchmark data. The manuscript must exhibit the explicit steps showing that the bounds do not reduce tautologically to quantities defined by the fitted parameters.
minor comments (2)
  1. [Abstract] Abstract and introduction cite references [1–3] but provide no bibliographic details; full citations should be supplied.
  2. [Introduction] Notation for the predictor ft(X) and the SA cost function should be introduced consistently and early; current usage mixes “ft” and “ft(X)” without prior definition.

Simulated Author's Rebuttal

4 responses · 0 unresolved

We thank the referee for the constructive and detailed review. The comments have identified important areas where additional clarity and rigor are needed. We have revised the manuscript to address each major comment, adding the requested details on ML training, clarifying the feature construction, including generalization experiments, and expanding the theoretical derivations with explicit steps. Our point-by-point responses follow.

read point-by-point responses
  1. Referee: [Dataset construction and ML training] Dataset construction and ML training section: no accuracy metrics (R², MAE, or cross-validation error), data-split protocol, or hyper-parameter details are reported for ft(X). Without these, it is impossible to verify that the predictor supplies sufficiently accurate cost estimates for SA to locate orderings that materially reduce BCS runtime.

    Authors: We agree that these details are necessary for proper evaluation. In the revised manuscript we have expanded the relevant section to report the following: an 80/20 train-test split on the n=m=28 benchmark collection, a random-forest regressor with 200 estimators and maximum depth 12 (selected by 5-fold cross-validation), an R² of 0.89 on the held-out test set, a mean absolute error of 0.11 (log-time scale), and 5-fold CV results with standard deviation 0.04. These figures confirm that the predictor supplies cost estimates accurate enough for simulated annealing to identify orderings that materially improve BCS runtime. revision: yes

  2. Referee: [Feature definition] Feature definition (variable frequency spectrum X): the manuscript does not state whether X is ordering-invariant (e.g., simple per-variable occurrence counts) or encodes ordering (e.g., position-weighted spectra). If X is invariant, ft(X) is identical for all permutations and SA cannot differentiate latencies; this directly undermines both the outperformance claim and the subsequent complexity bounds.

    Authors: We thank the referee for noting the missing specification. The variable frequency spectrum X is constructed as a position-weighted vector: for a candidate ordering π the i-th component of X is the sum of occurrence counts of the variable placed at position i, multiplied by a decaying weight (1/i). This definition makes X explicitly dependent on the ordering, so ft(X) varies across permutations and supplies a meaningful cost signal to simulated annealing. The revised manuscript now contains the precise mathematical definition together with a short proof that distinct orderings produce distinct spectra under this weighting. revision: yes

  3. Referee: [Experiments] Generalization from n=28 to n=32: the predictor is trained exclusively on n=m=28 instances yet applied to target systems with n=32. No domain-shift analysis, retraining protocol, or extrapolation experiment is described, leaving the central empirical claim unsupported.

    Authors: The referee correctly identifies the domain-shift issue. We have added a new subsection that evaluates the n=28-trained predictor on n=32 systems without retraining. Prediction accuracy remains usable (R² = 0.76), and the orderings discovered by SA still yield substantial speed-ups over the baselines. We also report a modest mixed-size retraining experiment confirming that performance improves further when a small number of n=32 samples are included. These results are now presented with the corresponding tables and statistical tests. revision: yes

  4. Referee: [Theoretical analysis] Probabilistic bounds derivation: the claimed relationship between predictor accuracy and expected complexity is derived under stochastic-process assumptions, yet accuracy itself is obtained from a fitted model on benchmark data. The manuscript must exhibit the explicit steps showing that the bounds do not reduce tautologically to quantities defined by the fitted parameters.

    Authors: We have substantially expanded the theoretical section and the appendix. The derivation proceeds in three explicit steps: (1) model the simulated-annealing trajectory as a Markov chain on the symmetric group; (2) bound the expected hitting time to the set of low-cost orderings using the predictor’s additive error term ε; (3) apply a concentration inequality that relates the variance of ε (estimated from the fitted model) to the overall expected complexity. An intermediate lemma isolates the accuracy parameter ε from the specific regression coefficients, showing that the final bound depends only on ε and not on the particular fitted parameters. The complete proof is now included. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical training, experimental validation, and independent stochastic derivation

full rationale

The paper trains an ML predictor ft(X) on benchmark data (n=m=28 frequency spectra and observed BCS times), deploys it as a surrogate cost inside simulated annealing to search for low-latency orderings, reports empirical speed-ups on n=32 instances against BCS, Gröbner, and SAT baselines, and separately derives probabilistic complexity bounds via stochastic process theory that relate predictor accuracy (treated as a parameter) to expected runtime. No quoted equation or self-citation reduces the central claims to the fitted parameters by construction; the bounds are presented as a theoretical relationship under stated assumptions rather than an algebraic identity with the training loss, and the outperformance numbers are direct experimental measurements on held-out larger systems. The derivation chain therefore remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The central claim rests on the representativeness of the n=m=28 benchmark dataset for training a generalizable time predictor and on the assumption that stochastic process modeling can quantitatively link predictor error to overall complexity without additional fitted constants beyond the ML model itself.

free parameters (1)
  • ML predictor parameters
    Internal weights and hyperparameters of ft(X) are fitted to the constructed dataset of frequency spectra and solving times.
axioms (1)
  • domain assumption Variable frequency spectrum X is a sufficient feature set for predicting BCS solving time across orderings.
    Invoked when constructing the dataset and training the predictor to serve as cost function for SA.

pith-pipeline@v0.9.0 · 5821 in / 1565 out tokens · 31464 ms · 2026-05-18T16:31:54.323301+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

22 extracted references · 22 canonical work pages

  1. [1]

    Huang, S

    Z. Huang, S. Yao, and D. Lin. On the efficiency of solving boolean polynomial systems with the characteristic set method.Computer Science, 2019. 13 APREPRINT- SEPTEMBER21, 2025

  2. [2]

    Brickenstein and A

    M. Brickenstein and A. Dreyer. Polybori: A framework for grbner-basis computations with boolean polynomials. Journal of Symbolic Computation, 44(9):1326–1345, 2009

  3. [3]

    K. Wu, W. Tao, X. Zhao, and H. Liu.CryptoMiniSAT Solver Based Algebraic Side-Channel Attack on PRESENT. IEEE, 2012

  4. [4]

    Little, and D O’Shea.Ideals, Varieties, and Algorithms

    DA Cox, J. Little, and D O’Shea.Ideals, Varieties, and Algorithms. Ideals, Varieties, and Algorithms, 2004

  5. [5]

    Mcdonald, C

    C. Mcdonald, C. Charnes, and J. Pieprzyk. An algebraic analysis of trivium ciphers based on the boolean satisfiability problem.proceedings of international workshop on boolean functions cryptography and applications, 2008

  6. [6]

    G. V . Bard, N. T. Courtois, and C. Jefferson. Efficient methods for conversion and solution of sparse systems of low-degree multivariate polynomials over gf(2) via sat-solvers. 2007

  7. [7]

    A characteristic set method for solving boolean equations and applications in cryptanalysis of stream ciphers.Systems Science and Complexity, (02):191–208, 2008

    Fengjuan Chai, Xiao Shan Gao, and Chunming Yuan. A characteristic set method for solving boolean equations and applications in cryptanalysis of stream ciphers.Systems Science and Complexity, (02):191–208, 2008

  8. [8]

    Computers and intractability: A guide to the theory of np-completeness (michael r

    Hartmanis and Juris. Computers and intractability: A guide to the theory of np-completeness (michael r. garey and david s. johnson).Siam Review, 24(1):251, 1982

  9. [9]

    Algebraic cryptanalysis of the data encryption standard.DBLP, 2007

    Ntcgv Bard. Algebraic cryptanalysis of the data encryption standard.DBLP, 2007

  10. [10]

    N. T. Courtois and J. Pieprzyk. Cryptanalysis of block ciphers with overdefined systems of equations. In International Conference on Advances in Cryptology-asiacrypt, 2002

  11. [11]

    A. H. Abdel-Gawad, A. F. Atiya, and N. M. Darwish. Solution of systems of boolean equations via the integer domain.Information Sciences, 180(2):288–300, 2010

  12. [12]

    Borghoff, L

    J. Borghoff, L. R. Knudsen, and K. Matusiewicz. Hill climbing algorithms and trivium.International Conference on Selected Areas in Cryptography, 2010

  13. [13]

    Applying wu’s method to symbolic simulation for boolean layer psl assertion checking.Journal of Convergence Information Technology, 7(4):272–279, 2012

    Ning Zhou, Xinyan Gao, and Jinzhao Wu. Applying wu’s method to symbolic simulation for boolean layer psl assertion checking.Journal of Convergence Information Technology, 7(4):272–279, 2012

  14. [15]

    A new efficient algorithm for computing grobner bases (f4).Journal of Pure and Applied Algebra, 1999

    JC Faugére. A new efficient algorithm for computing grobner bases (f4).Journal of Pure and Applied Algebra, 1999

  15. [16]

    A new efficient algorithm for computing gröbner bases without reduction to zero(f5).proc issac, 139(1-3):75–83, 2006

    JC Faugère. A new efficient algorithm for computing gröbner bases without reduction to zero(f5).proc issac, 139(1-3):75–83, 2006

  16. [17]

    An elimination method for polynomial systems.Journal of Symbolic Computation, 16(2):83– 114, 1993

    Dongming Wang. An elimination method for polynomial systems.Journal of Symbolic Computation, 16(2):83– 114, 1993

  17. [18]

    X. S. Gao and Z. Huang. Characteristic set algorithms for equation solving in finite fields.Journal of Symbolic Computation, 46(6):655–679, 2011

  18. [19]

    D. Wang, R. Dong, and C. Mou. Decomposition of polynomial sets into characteristic pairs.Mathematics of Computation, 2017

  19. [20]

    Connections between numerical algorithms for pdes and neural networks.Journal of Mathematical Imaging and Vision, pages 1–24, 2023

    Tobias Alt, Karl Schrader, Matthias Augustin, Pascal Peter, and Joachim Weickert. Connections between numerical algorithms for pdes and neural networks.Journal of Mathematical Imaging and Vision, pages 1–24, 2023

  20. [21]

    Simulated Annealing: Theory and Applications, 1987

    Pjml Van and Ehl Aarts.Simulated Annealing: Theory and Applications. Simulated Annealing: Theory and Applications, 1987

  21. [22]

    Learning variable ordering heuristics for solving constraint satisfaction problems

    Wen Song, Zhiguang Cao, Jie Zhang, and Andrew Lim. Learning variable ordering heuristics for solving constraint satisfaction problems. 2019

  22. [23]

    Pysat: Sat technology in python

    Alexey Ignatiev, Joao Marques-Silva, and Antonio Morgado. Pysat: Sat technology in python. https:// pysathq.github.io/, 2019. 14