pith. sign in

Linda Petrini

Identifiers

  • name variant Linda Petrini 0.60 · backfill

Papers (2)

  1. Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming cs.CL · 2025 · author #30
  2. Alignment faking in large language models cs.AI · 2024 · author #15

Mentions

  • 2501.18837 #30 · arxiv_oai · confidence 0.70 Linda Petrini

Frequent Coauthors