pith.
Research
Integrity
Review
Pre-print
sign in
Physics
Mathematics
Computer Science
Biology
Finance
Statistics
Systems
Economics
authors
/ Yanchen Yin
Yanchen Yin
Identifiers
name variant
Yanchen Yin
0.60 · backfill
Papers (1)
Robust Harmful Features Under Jailbreak Attacks: Mechanistic Evidence from Attention Head Specialization in Large Language Models
cs.CR · 2026 · author #1
Mentions
2606.28153
#1 · arxiv_oai · confidence 0.70
Yanchen Yin
Frequent Coauthors
Dongqi Han
1 shared papers
Linghui Li
1 shared papers