Improving on a Lottery: Efficient Estimation of Optimal Assignment Rules
read the original abstract
Scarce opportunities are often allocated by lotteries. We study how to improve such allocations by estimating optimal assignment rules that maximize welfare net of a Kullback--Leibler penalty for departing from the benchmark randomization. The framework covers discrete, continuous, and mixed treatments. Regret is asymptotically quadratic in the estimation error, so inefficient estimation raises the mean of limiting regret, not merely its dispersion. We show that inverse probability weighting with known assignment probabilities is inefficient, whereas estimated-propensity and doubly robust welfare criteria attain the efficient regret distribution. Simulations and a commitment-savings application quantify the resulting precision gains.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
Wasserstein Policy Learning for Distributional Outcomes
Establishes finite-sample regret bounds of order sqrt(N-dim(Π)/N) for IPW and DR estimators in Wasserstein policy learning with distributional outcomes, plus a matching minimax lower bound.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.