(For target ambiguity this is true essentially by construction; rate fail only if the prompt somehow makes the choice trivial or moot.)

DECISION RELEVANT -resolving the ambiguity changes a task-level choice a competent solver should make: which column is fit, which column appears in the submission file, conse

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Ambig-DS: A Benchmark for Task-Framing Ambiguity in Data-Science Agents

cs.AI · 2026-05-10 · unverdicted · novelty 7.0

Ambig-DS shows data-science agents degrade on ambiguous tasks via silent wrong framings, with one clarifying question recovering much loss but agents unable to decide when to ask.

citing papers explorer

Showing 1 of 1 citing paper.

Ambig-DS: A Benchmark for Task-Framing Ambiguity in Data-Science Agents cs.AI · 2026-05-10 · unverdicted · none · ref 12
Ambig-DS shows data-science agents degrade on ambiguous tasks via silent wrong framings, with one clarifying question recovering much loss but agents unable to decide when to ask.

(For target ambiguity this is true essentially by construction; rate fail only if the prompt somehow makes the choice trivial or moot.)

fields

years

verdicts

representative citing papers

citing papers explorer