← back to paper
arxiv: 2605.05995 · 2 revisions
Safety Anchor: Defending Harmful Fine-tuning via Geometric Bottlenecks