← back to paper
arxiv: 2604.06738 · 2 revisions
Beyond Pessimism: Offline Learning in KL-regularized Games