← back to paper
arxiv: 2604.23238 · 2 revisions
Hiding in Plain Sight: Detectability-Aware Antidistillation of Reasoning Models