CRAFT benchmark shows multi-agent coordination under partial information remains unsolved for current LLMs, with smaller open-weight models often matching or beating frontier systems.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
CRAFT: Grounded Multi-Agent Coordination Under Partial Information
CRAFT benchmark shows multi-agent coordination under partial information remains unsolved for current LLMs, with smaller open-weight models often matching or beating frontier systems.