Expanding LLM agent boundaries with strategy-guided exploration.arXiv preprint arXiv:2603.02045

Andrew Szot, Michael Kirchhof, Omar Attia, Alexander Toshev · arXiv 2603.02045

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Exploration and Exploitation Errors Are Measurable for Language Model Agents

cs.AI · 2026-04-14 · unverdicted · novelty 7.0

A policy-agnostic metric and controllable 2D grid environments with task DAGs enable measurement of exploration and exploitation errors in language model agents from observed actions.

DORA Explorer: Improving the Exploration Ability of LLMs Without Training

cs.CL · 2026-04-19 · unverdicted · novelty 5.0

DORA Explorer boosts LLM agent exploration without training by ranking diverse actions using log-probabilities and a tunable parameter, yielding UCB-competitive results on multi-armed bandits and gains on text adventure environments.

citing papers explorer

Showing 2 of 2 citing papers.

Exploration and Exploitation Errors Are Measurable for Language Model Agents cs.AI · 2026-04-14 · unverdicted · none · ref 9
A policy-agnostic metric and controllable 2D grid environments with task DAGs enable measurement of exploration and exploitation errors in language model agents from observed actions.
DORA Explorer: Improving the Exploration Ability of LLMs Without Training cs.CL · 2026-04-19 · unverdicted · none · ref 10
DORA Explorer boosts LLM agent exploration without training by ranking diverse actions using log-probabilities and a tunable parameter, yielding UCB-competitive results on multi-armed bandits and gains on text adventure environments.

Expanding LLM agent boundaries with strategy-guided exploration.arXiv preprint arXiv:2603.02045

fields

years

verdicts

representative citing papers

citing papers explorer