pith. sign in

Budget-aware tool-use enables effective agent scaling

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

citation-role summary

background 3

citation-polarity summary

fields

cs.AI 3 cs.LG 3

years

2026 6

roles

background 3

polarities

background 3

representative citing papers

Why Retrying Fails: Context Contamination in LLM Agent Pipelines

cs.AI · 2026-05-08 · conditional · novelty 7.0

A Context-Contaminated Restart Model derives exact success probabilities and an optimal pipeline depth T* = sqrt(B * log(1/(1-ε1)) / log(1/(1-ε0))) for fixed budget B, validated on SWE-bench where it fits data far better than IID assumptions.

Evaluation-driven Scaling for Scientific Discovery

cs.LG · 2026-04-21 · unverdicted · novelty 6.0

SimpleTES scales test-time evaluation in LLMs to discover state-of-the-art solutions on 21 scientific problems across six domains, outperforming frontier models and optimization pipelines with examples like 2x faster LASSO and new Erdos constructions.

citing papers explorer

Showing 6 of 6 citing papers.