pith. sign in

Yanchen Yin

Identifiers

  • name variant Yanchen Yin 0.60 · backfill

Papers (1)

  1. Robust Harmful Features Under Jailbreak Attacks: Mechanistic Evidence from Attention Head Specialization in Large Language Models cs.CR · 2026 · author #1

Mentions

  • 2606.28153 #1 · arxiv_oai · confidence 0.70 Yanchen Yin

Frequent Coauthors