CalBench is a new benchmark for multi-agent LLM calendar scheduling that measures task success, excess cost, communication efficiency, burden fairness, and privacy leakage under private information constraints.
Slot 2 requires me to reschedule an errand (catching up with a neighbor over coffee)
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.MA 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
No citing papers match the current filters.