Can You Finetune Your

Elhassan, Fay, Ajroldi, Niccol · arXiv 2504.06446

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

CIAware-Bench: Benchmarking Control Intervention Awareness Across Frontier LLMs

cs.AI · 2026-06-09 · unverdicted · novelty 7.0

CIAware-Bench shows frontier LLMs exhibit low to moderate control intervention awareness, with detection accuracy reaching at most 0.87 across four task domains and eleven models.

citing papers explorer

Showing 1 of 1 citing paper after filters.

CIAware-Bench: Benchmarking Control Intervention Awareness Across Frontier LLMs cs.AI · 2026-06-09 · unverdicted · none · ref 29
CIAware-Bench shows frontier LLMs exhibit low to moderate control intervention awareness, with detection accuracy reaching at most 0.87 across four task domains and eleven models.

Can You Finetune Your

fields

years

verdicts

representative citing papers

citing papers explorer