Tim Maxwell
Identifiers
No identifiers captured yet.
Papers (1)
- Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training cs.CR · 2024 · author #9
Mentions
No mention provenance yet.
Frequent Coauthors
- Adam Jermyn 1 shared papers
- Amanda Askell 1 shared papers
- Ansh Radhakrishnan 1 shared papers
- Buck Shlegeris 1 shared papers
- Carson Denison 1 shared papers
- Cem Anil 1 shared papers
- Daniel M. Ziegler 1 shared papers
- David Duvenaud 1 shared papers
- Deep Ganguli 1 shared papers
- Ethan Perez 1 shared papers
- Evan Hubinger 1 shared papers
- Fazl Barez 1 shared papers
- Holden Karnofsky 1 shared papers
- Jack Clark 1 shared papers
- Jan Brauner 1 shared papers
- Jared Kaplan 1 shared papers
- Jesse Mu 1 shared papers
- Kamal Ndousse 1 shared papers
- Kshitij Sachan 1 shared papers
- Logan Graham 1 shared papers