pith. sign in

Must allow pets

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.LG 1

years

2025 1

verdicts

UNVERDICTED 1

representative citing papers

COMPASS: Benchmarking Constrained Optimization in LLM Agents

cs.LG · 2025-10-08 · unverdicted · novelty 7.0

COMPASS benchmark shows LLM agents reach 70-90% feasibility but only 20-60% optimality on constrained travel planning tasks, attributing the gap to insufficient search space exploration rather than tool use.

citing papers explorer

Showing 1 of 1 citing paper.

  • COMPASS: Benchmarking Constrained Optimization in LLM Agents cs.LG · 2025-10-08 · unverdicted · none · ref 5

    COMPASS benchmark shows LLM agents reach 70-90% feasibility but only 20-60% optimality on constrained travel planning tasks, attributing the gap to insufficient search space exploration rather than tool use.