OptiVerse is a new benchmark spanning neglected optimization domains that shows LLMs suffer sharp accuracy drops on hard problems due to modeling and logic errors, with a Dual-View Auditor Agent proposed to improve performance.
arXiv preprint arXiv:2507.11737 (2025)
5 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 5roles
background 2polarities
background 2representative citing papers
LLM-MAS uses prompt-embedded design choices to drive multi-agent LLM simulations modeled as a controlled Markov chain, with an on-trajectory algorithm for zeroth-order gradient-based optimization of steady-state performance.
A survey compiling roles, applications, benchmarks, challenges, and future directions for large language models in operations research.
citing papers explorer
-
OptiVerse: A Comprehensive Benchmark towards Optimization Problem Solving
OptiVerse is a new benchmark spanning neglected optimization domains that shows LLMs suffer sharp accuracy drops on hard problems due to modeling and logic errors, with a Dual-View Auditor Agent proposed to improve performance.
-
Optimizing Service Operations via LLM-Powered Multi-Agent Simulation
LLM-MAS uses prompt-embedded design choices to drive multi-agent LLM simulations modeled as a controlled Markov chain, with an on-trajectory algorithm for zeroth-order gradient-based optimization of steady-state performance.
-
Large Language Models for Operations Research: A Comprehensive Survey
A survey compiling roles, applications, benchmarks, challenges, and future directions for large language models in operations research.
- Democratizing Large-Scale Re-Optimization with LLM-Guided Model Patches
- Dual-Cluster Memory Agent: Resolving Multi-Paradigm Ambiguity in Optimization Problem Solving