Dynamic matching bandit for two-sided online markets.arXiv preprint arXiv:2205.03699, 2022

Yuantong Li, Chi-hua Wang, Guang Cheng, Will Wei Sun · 2022 · arXiv 2205.03699

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Learn to Match: Two-Sided Matching with Temporally Extended Feedback

cs.LG · 2026-06-04 · unverdicted · novelty 7.0

Learn2Match is a POMG-based MARL benchmark for two-sided matching with temporally extended feedback; independent PPO yields higher social welfare and lower regret than CA-ETC but higher information-friction loss.

A Linear Matching Bandit Approach to Online Multi-Human Multi-Robot Teaming

cs.LG · 2026-06-28 · unverdicted · novelty 6.0

LinMatch recasts linear matching bandits as maximum-weight matching LPs solvable by the Hungarian algorithm and proves tight regret bounds of tilde Theta(d sqrt(MKT)).

citing papers explorer

Showing 2 of 2 citing papers after filters.

Learn to Match: Two-Sided Matching with Temporally Extended Feedback cs.LG · 2026-06-04 · unverdicted · none · ref 30
Learn2Match is a POMG-based MARL benchmark for two-sided matching with temporally extended feedback; independent PPO yields higher social welfare and lower regret than CA-ETC but higher information-friction loss.
A Linear Matching Bandit Approach to Online Multi-Human Multi-Robot Teaming cs.LG · 2026-06-28 · unverdicted · none · ref 39
LinMatch recasts linear matching bandits as maximum-weight matching LPs solvable by the Hungarian algorithm and proves tight regret bounds of tilde Theta(d sqrt(MKT)).

Dynamic matching bandit for two-sided online markets.arXiv preprint arXiv:2205.03699, 2022

fields

years

verdicts

representative citing papers

citing papers explorer