Nathaniel Li
Identifiers
- name variant Nathaniel Li 0.60 · backfill
Papers (5)
- Code World Model Preparedness Report cs.SE · 2026 · author #6
- Humanity's Last Exam cs.LG · 2025 · author #4
- The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning cs.LG · 2024 · author #1
- HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal cs.LG · 2024 · author #8
- Representation Engineering: A Top-Down Approach to AI Transparency cs.LG · 2023 · author #12
Mentions
- 2403.03218 #1 · arxiv_oai · confidence 0.70 Nathaniel Li
Frequent Coauthors
- Andy Zou 4 shared papers
- Dan Hendrycks 4 shared papers
- Long Phan 4 shared papers
- Mantas Mazeika 4 shared papers
- Steven Basart 3 shared papers
- Summer Yue 3 shared papers
- Zifan Wang 3 shared papers
- Adam Khoja 2 shared papers
- Alexander Pan 2 shared papers
- Alexandr Wang 2 shared papers
- Alice Gatti 2 shared papers
- Ann-Kathrin Dombrowski 2 shared papers
- Dawn Song 2 shared papers
- Michael Chen 2 shared papers
- Oam Patel 2 shared papers
- Oliver Zhang 2 shared papers
- Richard Ren 2 shared papers
- Shashwat Goel 2 shared papers
- Xuwang Yin 2 shared papers
- Ziwen Han 2 shared papers