Improved bounds for private and robust alignment

Weng, W · 2025 · arXiv 2512.23816

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

On the Optimal Sample Complexity of Offline Multi-Armed Bandits with KL Regularization

cs.LG · 2026-05-04 · unverdicted · novelty 6.0

Offline KL-regularized MABs require sample complexity scaling as O(η S A C^π*/ε) for large regularization and Ω(S A C^π*/ε²) for small regularization, with matching lower bounds across the full range.

citing papers explorer

Showing 1 of 1 citing paper.

On the Optimal Sample Complexity of Offline Multi-Armed Bandits with KL Regularization cs.LG · 2026-05-04 · unverdicted · none · ref 54
Offline KL-regularized MABs require sample complexity scaling as O(η S A C^π*/ε) for large regularization and Ω(S A C^π*/ε²) for small regularization, with matching lower bounds across the full range.

Improved bounds for private and robust alignment

fields

years

verdicts

representative citing papers

citing papers explorer