ATLAS introduces an LLM-orchestrated agentic framework for dynamic test-time scaling via extensible 'explore' actions, achieving higher accuracy with fewer API calls than fixed-workflow baselines on four benchmarks.
Optimal stopping vs best-of-n for inference time optimization.arXiv preprint arXiv:2510.01394, 2025
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2verdicts
UNVERDICTED 2representative citing papers
Query-centric AQP and proxy-model strategies reduce expensive model calls by 60-90% with under 10% error on TPC-DS and LLM tasks.
citing papers explorer
-
Query-Centric Optimization of AI Workflows via Approximate Query Processing and Proxy Models
Query-centric AQP and proxy-model strategies reduce expensive model calls by 60-90% with under 10% error on TPC-DS and LLM tasks.