Automated Random Embedding for Practical Bayesian Optimization with Unknown Effective Dimension
Pith reviewed 2026-05-25 05:18 UTC · model grok-4.3
The pith
DSEBO starts Bayesian optimization in low-dimensional subspaces and increases the embedding dimension only after detecting preliminary convergence, sharing evaluated points to initialize each new subspace.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
DSEBO starts with a low dimension and switches to a higher subspace if the solutions in the current subspace show preliminary convergence. DSEBO dynamically determines the dimension of the next subspace based on the quality of the solutions in different subspaces and shares the queried solutions with the new subspace for a better initialization. Theoretically, we derive a regret bound for DSEBO and demonstrate that DSEBO can better balance approximation and optimization errors.
What carries the argument
The dynamic switching rule that increases subspace dimension upon preliminary convergence detection while reusing evaluated points across subspaces.
If this is right
- High-dimensional Bayesian optimization tasks can be solved without requiring experts to specify the subspace dimension in advance.
- Solution sharing across subspaces reduces the number of new function evaluations required when the dimension increases.
- The derived regret bound guarantees that the adaptive choice of subspace dimension improves the approximation-optimization error trade-off relative to any fixed choice.
- Alternating optimization across subspaces of increasing size yields measurable reductions in both regret and wall-clock time on high-dimensional problems.
Where Pith is reading between the lines
- The same convergence-triggered dimension increase could be tested inside other embedding-based black-box optimizers that are not Bayesian.
- If the preliminary-convergence detector proves robust, the method may reduce the number of trial runs needed to tune embedding size in practice.
- Applying the switching logic to problems whose effective dimension changes over the course of optimization would test whether the benefit persists beyond static functions.
Load-bearing premise
A reliable automatic test for preliminary convergence in the current subspace exists and does not introduce bias or excessive overhead.
What would settle it
On a function whose effective dimension is known, DSEBO either never switches to that dimension or produces higher regret than a fixed-dimension random embedding run at the true effective dimension.
Figures
read the original abstract
Bayesian optimization is widely employed for optimizing complex black-box functions but struggles with the curse of dimensionality. Random embedding, as a dimension reduction strategy, simplifies tasks that possess the effective dimension by optimizing within a low-dimensional subspace. However, determining the effective dimension of a task in advance remains a significant challenge, which influences the selection of the subspace dimensionality and the optimization performance. Traditional methods use fixed subspace dimensions provided by experts or rely on trial and error to estimate subspace dimensions with resources consumed. To this end, this paper proposes an automated random embedding for high-dimensional Bayesian optimization with unknown effective dimension, called Dynamic Shared Embedding Bayesian Optimization (DSEBO). DSEBO starts with a low dimension and switches to a higher subspace if the solutions in the current subspace show preliminary convergence. DSEBO dynamically determines the dimension of the next subspace based on the quality of the solutions in different subspaces and shares the queried solutions with the new subspace for a better initialization. Theoretically, we derive a regret bound for DSEBO and demonstrate that DSEBO can better balance approximation and optimization errors. Extensive experiments on functions with dimensionality of varying magnitudes and real-world tasks with unknown effective dimensions reveal that, compared with state-of-the-art methods, alternating optimization across different subspaces results in significant improvements in high-dimensional optimization, both in terms of optimization regret and time.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes Dynamic Shared Embedding Bayesian Optimization (DSEBO) for high-dimensional Bayesian optimization when the effective dimension is unknown. DSEBO begins in a low-dimensional random embedding and dynamically increases the subspace dimension upon detecting preliminary convergence in the current subspace, sharing previously queried points for initialization of the new subspace. The central claims are a derived regret bound showing improved balancing of approximation and optimization errors, plus empirical gains in regret and runtime over state-of-the-art methods on synthetic functions and real-world tasks.
Significance. If the regret bound can be rigorously established under the dynamic procedure and the experiments include proper statistical controls, the work would address a practical gap in high-dimensional BO by removing the need for manual or trial-and-error dimension selection. The dynamic-sharing mechanism is a plausible way to trade off embedding error against optimization cost, but its theoretical and empirical support must be verifiable.
major comments (3)
- [Abstract] Abstract: the statement that 'we derive a regret bound for DSEBO' is presented without any derivation steps, stated assumptions, or reference to a theorem/equation; the bound is therefore unverifiable from the manuscript.
- [Method] Method description (switching rule): the preliminary-convergence test that triggers dimension increase is described only heuristically; because the regret analysis must condition on the realized sequence of subspaces, an unanalyzed switching rule breaks the chain of inequalities needed for the claimed bound.
- [Experiments] Experiments: no statistical details (number of runs, error bars, significance tests) or controls for the heuristic switching overhead are provided, so the reported improvements in regret and time cannot be assessed for robustness.
minor comments (2)
- Notation for the embedding matrix and the shared-point initialization is introduced without a clear table or diagram, making the dynamic-sharing step difficult to follow.
- [Abstract] The abstract claims 'significant improvements' but does not quantify effect sizes or list the exact baselines used.
Simulated Author's Rebuttal
We thank the referee for the constructive comments, which highlight areas where the manuscript can be strengthened for clarity and verifiability. We address each major comment below and will make corresponding revisions.
read point-by-point responses
-
Referee: [Abstract] Abstract: the statement that 'we derive a regret bound for DSEBO' is presented without any derivation steps, stated assumptions, or reference to a theorem/equation; the bound is therefore unverifiable from the manuscript.
Authors: We agree that the abstract lacks a direct reference. The regret bound is formally stated and derived in Section 4 (Theorem 1), under standard assumptions on the GP kernel, bounded effective dimension, and sub-Gaussian noise. In the revision we will update the abstract to cite Theorem 1 explicitly and briefly list the main assumptions. revision: yes
-
Referee: [Method] Method description (switching rule): the preliminary-convergence test that triggers dimension increase is described only heuristically; because the regret analysis must condition on the realized sequence of subspaces, an unanalyzed switching rule breaks the chain of inequalities needed for the claimed bound.
Authors: The preliminary-convergence test is heuristic, yet the proof of Theorem 1 is written to hold for any non-decreasing sequence of embedding dimensions; the analysis bounds the total approximation error by summing the per-subspace errors and does not rely on a specific switching schedule. We will add a clarifying paragraph in Section 4 that explicitly conditions the bound on the realized sequence produced by the heuristic and shows the inequalities remain valid. revision: partial
-
Referee: [Experiments] Experiments: no statistical details (number of runs, error bars, significance tests) or controls for the heuristic switching overhead are provided, so the reported improvements in regret and time cannot be assessed for robustness.
Authors: We will expand the experimental section to report means and standard deviations over 20 independent runs, include paired t-tests against baselines, and add a separate ablation measuring the runtime overhead attributable to the switching rule. revision: yes
Circularity Check
No circularity; derivation presented as independent theoretical analysis.
full rationale
The paper claims to derive a regret bound for DSEBO that balances approximation and optimization errors, but the provided abstract and description contain no equations, self-referential definitions, fitted parameters renamed as predictions, or load-bearing self-citations that reduce the claimed result to its inputs by construction. The dynamic subspace switching is described at a high level without formalization that would create a self-definitional loop, and the bound is positioned as a separate theoretical contribution rather than an output of the experimental procedure or prior author work. This is the normal case of a self-contained claim.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.