OptiVerse is a new benchmark spanning neglected optimization domains that shows LLMs suffer sharp accuracy drops on hard problems due to modeling and logic errors, with a Dual-View Auditor Agent proposed to improve performance.
arXiv e-prints , pages=
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it