Jan Brauner
Identifiers
No identifiers captured yet.
Papers (2)
- Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training cs.CR · 2024 · author #29
- Measuring Faithfulness in Chain-of-Thought Reasoning cs.AI · 2023 · author #28
Mentions
No mention provenance yet.
Frequent Coauthors
- Ansh Radhakrishnan 2 shared papers
- Carson Denison 2 shared papers
- Ethan Perez 2 shared papers
- Evan Hubinger 2 shared papers
- Jared Kaplan 2 shared papers
- Newton Cheng 2 shared papers
- Nicholas Schiefer 2 shared papers
- Samuel R. Bowman 2 shared papers
- Tamera Lanham 2 shared papers
- Adam Jermyn 1 shared papers
- Amanda Askell 1 shared papers
- Anna Chen 1 shared papers
- Benoit Steiner 1 shared papers
- Buck Shlegeris 1 shared papers
- Cem Anil 1 shared papers
- Daniel M. Ziegler 1 shared papers
- Danny Hernandez 1 shared papers
- David Duvenaud 1 shared papers
- Deep Ganguli 1 shared papers
- Dustin Li 1 shared papers