BOND distills Bayesian posteriors over six opponent priority orderings from a teacher LLM into an 8B student model, achieving Brier score 0.114 on CaSiNo while enabling posterior trajectory audits and outperforming a 70B baseline in calibration.
arXiv preprint arXiv:2002.07788 , year=
2 Pith papers cite this work. Polarity classification is still indexing.
years
2026 2verdicts
UNVERDICTED 2representative citing papers
Self-play RL in a takeover auction model shows optimal due diligence is modest and finite, decreasing with cost and competition, while simple general methods outperform specialized ones in large intractable games.
citing papers explorer
-
Distilling Bayesian Belief States into Language Models for Auditable Negotiation
BOND distills Bayesian posteriors over six opponent priority orderings from a teacher LLM into an 8B student model, achieving Brier score 0.114 on CaSiNo while enabling posterior trajectory audits and outperforming a 70B baseline in calibration.
-
How Much Due Diligence Before You Bid? Learning in Intractable Takeover Auctions
Self-play RL in a takeover auction model shows optimal due diligence is modest and finite, decreasing with cost and competition, while simple general methods outperform specialized ones in large intractable games.