On the Distribution of Unweighted Minimum Knapsack Instances with Large SOS Rank
Pith reviewed 2026-05-09 19:04 UTC · model grok-4.3
The pith
Perturbing the knapsack threshold q with small Gaussian noise yields expected SOS rank O(sqrt(n) log(n/sigma)) for unweighted minimum knapsack.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
For unweighted minimum knapsack, the SOS rank is constant when n-q is O(1). When q is O(1), linear rank is required precisely when q is exponentially close to an integer. As the main result, when q is perturbed by a Gaussian of mean zero and variance sigma squared, the expected SOS rank over the range of small q is O(sqrt(n) log(n/sigma)).
What carries the argument
Smoothed analysis of the SOS rank of the polynomial formulation of the unweighted minimum knapsack constraint, under Gaussian perturbations of the threshold q.
If this is right
- Constant-rank SOS solves unweighted MK exactly when the threshold lies within O(1) of n.
- Linear SOS rank is necessary only for q that are exponentially close to an integer when q itself is O(1).
- Sublinear rank holds in expectation for all small q after any Gaussian perturbation of variance sigma squared.
- The contrast with Lovasz-Schrijver and Sherali-Adams, which require rank n even for constant q, is preserved under perturbation.
Where Pith is reading between the lines
- High-rank instances occupy a set of q of vanishing measure, so average-case analysis over q would likely show sublinear SOS rank.
- The same perturbation technique could be applied to other combinatorial problems known to have worst-case high SOS rank, to locate the typical rank.
- The result suggests that the measure-zero character of hard instances may explain why SOS relaxations succeed on noisy or real-world data even when worst-case bounds are poor.
Load-bearing premise
The analysis uses the standard polynomial inequality encoding of the unweighted knapsack constraint together with the usual definition of SOS rank, and models perturbations as Gaussian with the given variance.
What would settle it
Compute or bound the SOS rank explicitly for a grid of q values spaced at distance sigma around several small integers, then check whether the average rank exceeds O(sqrt(n) log(n/sigma)).
Figures
read the original abstract
We analyze the sum-of-squares rank of unweighted instances of the Minimum Knapsack (MK) problem, i.e., minimization of $\sum_{i=1}^n x_i$ for 0/1 variables under the constraint $\sum_{i=1}^n x_i \geq q$, with $q \in \mathbb{R}$. Such instances have long served as a testbed for understanding the limitations of lift-and-project methods in Boolean optimization. For example, both the Lov\'asz-Schrijver and Sherali-Adams hierarchies require (maximal) rank $n$ to solve them, already when $q=1/2$ is constant. The SOS hierarchy requires only \emph{sublinear} rank $O(\sqrt{n})$ to solve unweighted MK when $q=1/2$. On the other hand, when $q$ is allowed to vary with~$n$, the SOS rank of the problem may become linear. Interestingly, this is known to happen both when $q$ is large, and when $q$ is very small ($0<q \leq 2^{-n}$). This raises the question of whether we should think of hard instances of unweighted MK as being typical for the SOS hierarchy, or as a consequence of very specific choices of the threshold parameter $q$. In this paper, we address this question by showing new upper and lower bounds on the SOS rank of unweighted MK in the whole regime of the parameter $q$. For $n-q \leq O(1)$, we show that the SOS rank is constant. In contrast, when $q \leq O(1)$, a linear rank is needed if $q$ is exponentially close to an integer. As our main positive result, we show that linear rank is very rare for $q \leq O(1)$. This can be expressed in the language of smoothed analysis: after perturbing $q$ by a Gaussian with mean $0$ and variance $\sigma^2$, the expected SOS rank of MK is $O(\sqrt{n} \log (n/\sigma))$.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper analyzes the SOS rank of unweighted minimum knapsack instances (minimize sum x_i subject to sum x_i >= q) over 0/1 variables. It establishes regime-specific bounds: constant rank when n-q = O(1); linear rank when q <= O(1) only if q is exponentially close to an integer; and an explicit (at most linear) functional dependence of rank on dist(q, Z). The main result is a smoothed-analysis bound: after adding Gaussian noise of variance sigma^2 to q, the expected SOS rank is O(sqrt(n) log(n/sigma)).
Significance. If the bounds hold, the work clarifies the typical-case behavior of the SOS hierarchy on a classic testbed for lift-and-project methods, showing that linear-rank instances are rare under natural perturbations. The explicit rank-vs-distance function and its direct integration against the Gaussian density to obtain the expectation are strengths, as is the separation of the constant-rank regime from the small-q regime.
minor comments (2)
- [§2.2] §2.2: the definition of the polynomial formulation of the MK constraint could explicitly state the degree-1 monomials used for the SOS relaxation to avoid ambiguity with the standard 0/1 encoding.
- [§4] The constant hidden in the O(1) for the n-q regime is not numerically illustrated; a short table of rank values for n-q = 1,2,3 would help readers verify the claimed constancy.
Simulated Author's Rebuttal
We thank the referee for their careful review of the manuscript and for recommending acceptance. The referee's summary correctly identifies the key regimes analyzed and the main smoothed-analysis result.
Circularity Check
No significant circularity; derivation self-contained
full rationale
The paper derives explicit upper and lower bounds on SOS rank as a function of the distance from q to the nearest integer (constant rank for n-q = O(1), linear rank only when q is exponentially close to an integer for q = O(1)). The smoothed-analysis expectation O(√n log(n/σ)) is obtained by direct integration of this rank function against the Gaussian density; the log factor arises from standard tail bounds on the distance to the nearest integer. No parameter is fitted to data and then renamed as a prediction, no self-citation is load-bearing for the central claim, and the polynomial formulation and SOS-rank definition are the standard ones stated in the problem setup. The derivation therefore reduces to the stated assumptions and the explicit rank-vs-distance relation rather than to any tautological or self-referential step.
Axiom & Free-Parameter Ledger
axioms (1)
- standard math Standard definition of sum-of-squares hierarchy rank for 0/1 polynomial optimization
Reference graph
Works this paper leans on
-
[1]
Spielman and Shang-Hua Teng , title =
John Dunagan and Daniel A. Spielman and Shang-Hua Teng , title =. Mathematical Programming , volume =
-
[2]
Spielman and Shang-Hua Teng , title =
Daniel A. Spielman and Shang-Hua Teng , title =. Journal of the ACM , volume =
-
[3]
Tengyu Ma and Jonathan Shi and David Steurer , title =. Proceedings of the 57th Annual IEEE Symposium on Foundations of Computer Science (FOCS 2016) , pages =. 2016 , publisher =
work page 2016
-
[4]
Boaz Barak and Fernando G. S. L. Brand. Hypercontractivity, sum-of-squares proofs, and their applications , booktitle =. 2012 , publisher =
work page 2012
-
[5]
Kothari and Jacob Steinhardt and David Steurer , title =
Pravesh K. Kothari and Jacob Steinhardt and David Steurer , title =. Proceedings of the 50th Annual ACM Symposium on Theory of Computing (STOC 2018) , pages =. 2018 , publisher =
work page 2018
-
[6]
Proceedings of the 49th Annual ACM Symposium on Theory of Computing (STOC 2017) , pages =
Prasad Raghavendra and Satish Rao and Tselil Schramm , title =. Proceedings of the 49th Annual ACM Symposium on Theory of Computing (STOC 2017) , pages =. 2017 , publisher =
work page 2017
-
[7]
Approximation, Randomization, and Combinatorial Optimization
Vijay Bhattiprolu and Venkatesan Guruswami and Euiwoong Lee , title =. Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques (APPROX/RANDOM 2017) , series =. 2017 , publisher =
work page 2017
-
[8]
Mathematics of Operations Research , volume =
Monique Laurent , title =. Mathematics of Operations Research , volume =
-
[9]
The K -moment problem for compact semi-algebraic sets , journal =
Konrad Schm. The K -moment problem for compact semi-algebraic sets , journal =
-
[10]
Indiana University Mathematics Journal , volume =
Mihai Putinar , title =. Indiana University Mathematics Journal , volume =
-
[11]
Annals of Pure and Applied Logic , volume =
Dima Grigoriev and Nicolai Vorobjov , title =. Annals of Pure and Applied Logic , volume =
- [12]
-
[13]
Dima Grigoriev and Edward A. Hirsch and Dmitrii V. Pasechnik , title =. Proceedings of the 19th International Symposium on Theoretical Aspects of Computer Science (STACS 2002) , pages =
work page 2002
-
[14]
Journal of Pure and Applied Algebra , volume =
Mareike Dressler , title =. Journal of Pure and Applied Algebra , volume =
- [15]
-
[16]
Sum-of-squares hierarchy lower bounds for symmetric formulations , booktitle =
Adam Kurpisz and Samuli Lepp. Sum-of-squares hierarchy lower bounds for symmetric formulations , booktitle =
-
[17]
Adam Kurpisz and Aaron Potechin and Elias Samuel Wirth , title =. Proceedings of the 48th International Colloquium on Automata, Languages, and Programming (ICALP 2021) , series =. 2021 , publisher =
work page 2021
-
[18]
Adam Kurpisz , title =. Proceedings of the 46th International Colloquium on Automata, Languages, and Programming (ICALP 2019) , series =. 2019 , publisher =
work page 2019
-
[19]
On the hardest problem formulations for the 0/1 Lasserre hierarchy , journal =
Adam Kurpisz and Samuli Lepp. On the hardest problem formulations for the 0/1 Lasserre hierarchy , journal =
-
[20]
Computational Complexity , volume =
Dima Grigoriev , title =. Computational Complexity , volume =
- [21]
-
[22]
Michel X. Goemans and David P. Williamson , title =. Journal of the ACM , volume =
-
[23]
On the Shannon capacity of a graph , journal =
L. On the Shannon capacity of a graph , journal =
-
[24]
Proceedings of the International Congress of Mathematicians (ICM 2014), Volume IV , pages =
Boaz Barak and David Steurer , title =. Proceedings of the International Congress of Mathematicians (ICM 2014), Volume IV , pages =. 2014 , publisher =
work page 2014
- [25]
-
[26]
Emerging Applications of Algebraic Geometry , pages =
Monique Laurent , title =. Emerging Applications of Algebraic Geometry , pages =. 2008 , publisher =
work page 2008
-
[27]
Mathematics of Operations Research , volume =
William Cook and Sanjeeb Dash , title =. Mathematics of Operations Research , volume =
-
[28]
Coppersmith--Rivlin type inequalities and the order of vanishing of polynomials at 1 , journal =
Tam. Coppersmith--Rivlin type inequalities and the order of vanishing of polynomials at 1 , journal =
-
[29]
Don Coppersmith and T. J. Rivlin , title =. SIAM Journal on Mathematical Analysis , volume =
-
[30]
Proceedings of the 31st Conference on Computational Complexity (CCC 2016) , pages =
Troy Lee and Anupam Prakash and Ronald de Wolf and Henry Yuen , title =. Proceedings of the 31st Conference on Computational Complexity (CCC 2016) , pages =. 2016 , publisher =
work page 2016
-
[31]
Grigoriy Blekherman and Pablo A. Parrilo and Rekha R. Thomas , title =
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.