Generalist agents reach published data-selection baselines but require scaffolds forcing method adaptation to autonomously compose a policy that outperforms baselines at one-tenth the data budget.
2603.05764 , archivePrefix=
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2verdicts
UNVERDICTED 2representative citing papers
GRACE-DS supplies metrics and a guarded sandbox for end-to-end evaluation of LLM AutoML agents on organization-specific tabular tasks, with flexible iterative interaction outperforming baselines on hidden-test quality and protocol validity across more than 7000 episodes.
citing papers explorer
-
Can Generalist Agents Automate Data Curation?
Generalist agents reach published data-selection baselines but require scaffolds forcing method adaptation to autonomously compose a policy that outperforms baselines at one-tenth the data budget.