Hoagy Cunningham
Identifiers
- name variant Hoagy Cunningham 0.60 · backfill
Papers (4)
- Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet cs.AI · 2026 · author #11
- Segment-Level Coherence for Robust Harmful Intent Probing in LLMs cs.CL · 2026 · author #5
- Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming cs.CL · 2025 · author #17
- Sparse Autoencoders Find Highly Interpretable Features in Language Models cs.LG · 2023 · author #1
Mentions
- 2605.29358 #11 · arxiv_oai · confidence 0.70 Hoagy Cunningham
- 2501.18837 #17 · arxiv_oai · confidence 0.70 Hoagy Cunningham
Frequent Coauthors
- Francesco Mosconi 2 shared papers
- Jerry Wei 2 shared papers
- Adam Jermyn 1 shared papers
- Adam Pearce 1 shared papers
- Adly Templeton 1 shared papers
- Aidan Ewart 1 shared papers
- Alex Silverstein 1 shared papers
- Alex Tamkin 1 shared papers
- Alwin Peng 1 shared papers
- Amanda Askell 1 shared papers
- Andy Dau 1 shared papers
- Andy Jones 1 shared papers
- Anjali Gopal 1 shared papers
- Bilgehan Sel 1 shared papers
- Brian Chen 1 shared papers
- Callum McDougall 1 shared papers
- Catherine Olsson 1 shared papers
- C. Daniel Freeman 1 shared papers
- Cem Anil 1 shared papers
- Chris Olah 1 shared papers