Long Phan
Identifiers
- name variant Long Phan 0.60 · backfill
Papers (6)
- Reducing Political Manipulation with Consistency Training cs.CL · 2026 · author #1
- Humanity's Last Exam cs.LG · 2025 · author #1
- The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning cs.LG · 2024 · author #10
- HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal cs.LG · 2024 · author #2
- Representation Engineering: A Top-Down Approach to AI Transparency cs.LG · 2023 · author #2
- BLOOM: A 176B-Parameter Open-Access Multilingual Language Model cs.CL · 2022 · author #78
Mentions
- 2605.22771 #1 · arxiv_oai · confidence 0.70 Long Phan
- 2403.03218 #10 · arxiv_oai · confidence 0.70 Long Phan
Frequent Coauthors
- Dan Hendrycks 5 shared papers
- Andy Zou 4 shared papers
- Mantas Mazeika 4 shared papers
- Nathaniel Li 4 shared papers
- Adam Khoja 3 shared papers
- Alexander Pan 3 shared papers
- Steven Basart 3 shared papers
- Zifan Wang 3 shared papers
- Alexandr Wang 2 shared papers
- Alham Fikri Aji 2 shared papers
- Alice Gatti 2 shared papers
- Ann-Kathrin Dombrowski 2 shared papers
- Dawn Song 2 shared papers
- Genta Indra Winata 2 shared papers
- Gunjan Chhablani 2 shared papers
- Hailey Schoelkopf 2 shared papers
- Hieu Tran 2 shared papers
- Michael Chen 2 shared papers
- M Saiful Bari 2 shared papers
- Niklas Muennighoff 2 shared papers