pith. sign in

Sample Average Approximation for Distributionally Robust Optimization with $\phi$-divergences

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it
abstract

It is well known that estimating the expectation of any given bounded random variable with values in $[-B, B]$ has a sample complexity of $\mathrm{O}(B^2/\epsilon^2)$ that is independent of the underlying probability measure. We show that this property can no longer hold when evaluating the worst-case expectation of the random variable, where the probability measures defining the expectation belong to a $\phi$-divergence ball centered at some nominal measure $P$. Specifically, the sample complexity and its dependence on the nominal measure can be completely characterized by the growth of the divergence function. When the divergence function $\phi$ exhibits superlinear growth, a $P$-independent sample complexity can be obtained for sample average approximation, which depends only on the growth of $\phi$, the radius of the divergence ball, and the target precision. We also provide sample complexity lower bounds and demonstrate the optimality of the obtained bounds for commonly used $\phi$-divergences. On the other hand, when superlinear growth does not hold for $\phi$, we show that for any estimation method, evaluating the worst-case expectation has a $P$-dependent sample complexity lower bound that can be made arbitrarily large by changing $P$.

fields

math.OC 1

years

2026 1

verdicts

UNVERDICTED 1

clear filters

representative citing papers

Robust Markov Decision Processes on Continuous State Spaces

math.OC · 2026-05-27 · unverdicted · novelty 6.0

Develops stochastic first-order methods for robust policy evaluation and approximate policy iteration in continuous-state robust MDPs, achieving 'O(1/ε^{2}) sample complexity for both evaluation and optimization.

citing papers explorer

Showing 1 of 1 citing paper after filters.

  • Robust Markov Decision Processes on Continuous State Spaces math.OC · 2026-05-27 · unverdicted · none · ref 31 · internal anchor

    Develops stochastic first-order methods for robust policy evaluation and approximate policy iteration in continuous-state robust MDPs, achieving 'O(1/ε^{2}) sample complexity for both evaluation and optimization.