Robust Active Preference Elicitation

Duncan McElfresh; Eric Rice; John Dickerson; Phebe Vayanos; Yingxiao Ye

arxiv: 2003.01899 · v2 · pith:ZRMZMV3Wnew · submitted 2020-03-04 · 🧮 math.OC · cs.AI

Robust Active Preference Elicitation

Phebe Vayanos , Yingxiao Ye , Duncan McElfresh , John Dickerson , Eric Rice This is my paper

classification 🧮 math.OC cs.AI

keywords elicitationpreferenceproblemrobustactiveformoptimizationpreferences

0 comments

read the original abstract

We study the problem of eliciting the preferences of a decision-maker through a moderate number of pairwise comparison queries to make them a high quality recommendation for a specific problem. We are motivated by applications in high stakes domains, such as when choosing a policy for allocating scarce resources to satisfy basic needs (e.g., kidneys for transplantation or housing for those experiencing homelessness) where a consequential recommendation needs to be made from the (partially) elicited preferences. We model uncertainty in the preferences as being set based and} investigate two settings: a) an offline elicitation setting, where all queries are made at once, and b) an online elicitation setting, where queries are selected sequentially over time in an adaptive fashion. We propose robust optimization formulations of these problems which integrate the preference elicitation and recommendation phases with aim to either maximize worst-case utility or minimize worst-case regret, and study their complexity. For the offline case, where active preference elicitation takes the form of a two and half stage robust optimization problem with decision-dependent information discovery, we provide an equivalent reformulation in the form of a mixed-binary linear program which we solve via column-and-constraint generation. For the online setting, where active preference learning takes the form of a multi-stage robust optimization problem with decision-dependent information discovery, we propose a conservative solution approach. Numerical studies on synthetic data demonstrate that our methods outperform state-of-the art approaches from the literature in terms of worst-case rank, regret, and utility. We showcase how our methodology can be used to assist a homeless services agency in choosing a policy for allocating scarce housing resources of different types to people experiencing homelessness.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Coordinate-wise Polyhedral Method for Eliciting Multivariate Linear Utility and Univariate Nonlinear Utility Functions
math.OC 2026-06 unverdicted novelty 6.0

Introduces CPM for eliciting multivariate linear and univariate nonlinear utility functions via pre-specified coordinate-wise cuts and linear-system query design, with proven linear convergence rates and piecewise-lin...
Maximum Utility Split Method for Utility Preference Elicitation
math.OC 2026-06 unverdicted novelty 6.0

MUS designs preference questions using one probabilistic lottery and one deterministic outcome at the maximum utility range point to halve the ambiguity set of utility functions, converging to the true utility under m...