LLawCo extracts misaligned behavioral patterns from agent failures to derive laws, incorporates them via SFT into LLM reasoning, and reports 4.5% and 6.8% success rate gains on PARTNR-Dialog and TDW-MAT benchmarks.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.LG 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
LLawCo: Learning Laws of Cooperation for Modeling Embodied Multi-Agent Behavior
LLawCo extracts misaligned behavioral patterns from agent failures to derive laws, incorporates them via SFT into LLM reasoning, and reports 4.5% and 6.8% success rate gains on PARTNR-Dialog and TDW-MAT benchmarks.