pith. sign in

Do large language models know what they don’t know? InFindings of the Association for Computational Linguistics, 2023

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.RO 1

years

2026 1

verdicts

CONDITIONAL 1

representative citing papers

The Yes-Man Syndrome: Benchmarking Abstention in Embodied Robotic Agents

cs.RO · 2026-05-19 · conditional · novelty 8.0

The paper presents RoboAbstention, a new benchmark showing frontier VLMs and embodied planners abstain on only 16.5-39% of 6,069 instructions grounded in robotics images, with prompting interventions raising rates to 88-93% but not solving the problem.

citing papers explorer

Showing 1 of 1 citing paper.

  • The Yes-Man Syndrome: Benchmarking Abstention in Embodied Robotic Agents cs.RO · 2026-05-19 · conditional · none · ref 42

    The paper presents RoboAbstention, a new benchmark showing frontier VLMs and embodied planners abstain on only 16.5-39% of 6,069 instructions grounded in robotics images, with prompting interventions raising rates to 88-93% but not solving the problem.