Introduces PZOS partial zeroth-order algorithm for MPECs that exploits leader's white-box cost information to achieve lower variance than full black-box zeroth-order methods, with convergence to partial Goldstein stationary points and empirical gains on routing and security games.
An optimal algorithm for bandit and zero-order convex optimization with two-point feedback.Journal of Machine Learning Research, 18(52):1–11
2 Pith papers cite this work. Polarity classification is still indexing.
verdicts
UNVERDICTED 2representative citing papers
Introduces a novel search direction enabling sublinear stochastic bilevel regret guarantees for first- and zeroth-order online bilevel optimization algorithms without relying on window smoothing.
citing papers explorer
-
Black-Box Followers, White-Box Leaders: Partial Zeroth-Order Methods for MPECs
Introduces PZOS partial zeroth-order algorithm for MPECs that exploits leader's white-box cost information to achieve lower variance than full black-box zeroth-order methods, with convergence to partial Goldstein stationary points and empirical gains on routing and security games.
-
Stochastic Regret Guarantees for Online Zeroth- and First-Order Bilevel Optimization
Introduces a novel search direction enabling sublinear stochastic bilevel regret guarantees for first- and zeroth-order online bilevel optimization algorithms without relying on window smoothing.