pith. machine review for the scientific record. sign in

Title resolution pending

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.AI 1

years

2026 1

verdicts

UNVERDICTED 1

representative citing papers

Automated alignment is harder than you think

cs.AI · 2026-05-07 · unverdicted · novelty 6.0 · 2 refs

Automating alignment research with AI agents risks undetected systematic errors in fuzzy tasks, producing overconfident but misleading safety evaluations that could enable deployment of misaligned AI.

citing papers explorer

Showing 1 of 1 citing paper.

  • Automated alignment is harder than you think cs.AI · 2026-05-07 · unverdicted · none · ref 7 · 2 links

    Automating alignment research with AI agents risks undetected systematic errors in fuzzy tasks, producing overconfident but misleading safety evaluations that could enable deployment of misaligned AI.