Two-Sided Time-Independent Regret for Matching Markets with Limited Interviews

· 2026 · cs.GT · arXiv 2602.12224

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Two-sided matching platforms rely on preferences from both sides, yet participants can evaluate only a small fraction of potential partners. In practice, they use low-cost pre-match screening, e.g., interviews, profile views, or trial tasks, to form noisy impressions before committing to applications and offers. We study bandit learning in matching markets with interviews, modeling these interactions as queried \emph{hints}~\citep{DBLP:conf/innovations/BhaskaraGIKM23} that reveal partial preference information to both sides while constraining subsequent applications. Our framework also allows firm-side uncertainty: firms, like agents, learn their preferences and may make early hiring mistakes. To address this, we introduce strategic deferral, a firm-side action that permits temporary vacancy, corrects premature commitments, and enables decentralized learning under coarse anonymous feedback. We design algorithms for centralized and decentralized markets and show that a constant number of interviews per round suffices for horizon-independent regret, improving over the $O(\log T)$ guarantees known without interviews. Our bounds are near-optimal: the centralized guarantee is within a factor $m$ of an information-theoretic lower bound, while decentralized algorithms match it up to polynomial factors in structured markets and remain horizon-independent in general markets.

representative citing papers

Learn to Match: Two-Sided Matching with Temporally Extended Feedback

cs.LG · 2026-06-04 · unverdicted · novelty 7.0

Learn2Match is a POMG-based MARL benchmark for two-sided matching with temporally extended feedback; independent PPO yields higher social welfare and lower regret than CA-ETC but higher information-friction loss.

citing papers explorer

Showing 1 of 1 citing paper.

Learn to Match: Two-Sided Matching with Temporally Extended Feedback cs.LG · 2026-06-04 · unverdicted · none · ref 38 · internal anchor
Learn2Match is a POMG-based MARL benchmark for two-sided matching with temporally extended feedback; independent PPO yields higher social welfare and lower regret than CA-ETC but higher information-friction loss.

Two-Sided Time-Independent Regret for Matching Markets with Limited Interviews

fields

years

verdicts

representative citing papers

citing papers explorer