pith. sign in

Anjali Gopal

Identifiers

  • name variant Anjali Gopal 0.60 · backfill

Papers (3)

  1. Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming cs.CL · 2025 · author #19
  2. The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning cs.LG · 2024 · author #3
  3. Will releasing the weights of future large language models grant widespread access to pandemic agents? cs.AI · 2023 · author #1

Mentions

  • 2310.18233 #1 · arxiv_oai · confidence 0.70 Anjali Gopal
  • 2501.18837 #19 · arxiv_oai · confidence 0.70 Anjali Gopal
  • 2403.03218 #3 · arxiv_oai · confidence 0.70 Anjali Gopal

Frequent Coauthors