Sam Ringer
Identifiers
No identifiers captured yet.
Papers (4)
- Discovering Language Model Behaviors with Model-Written Evaluations cs.CL · 2022 · author #2
- Constitutional AI: Harmlessness from AI Feedback cs.CL · 2022 · author #34
- Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned cs.CL · 2022 · author #28
- Language Models (Mostly) Know What They Know cs.CL · 2022 · author #28
Mentions
No mention provenance yet.
Frequent Coauthors
- Amanda Askell 4 shared papers
- Andy Jones 4 shared papers
- Anna Chen 4 shared papers
- Ben Mann 4 shared papers
- Catherine Olsson 4 shared papers
- Danny Hernandez 4 shared papers
- Dario Amodei 4 shared papers
- Dawn Drain 4 shared papers
- Deep Ganguli 4 shared papers
- Eli Tran-Johnson 4 shared papers
- Ethan Perez 4 shared papers
- Jackson Kernion 4 shared papers
- Jared Kaplan 4 shared papers
- Kamal Ndousse 4 shared papers
- Liane Lovitt 4 shared papers
- Nelson Elhage 4 shared papers
- Nicholas Joseph 4 shared papers
- Nicholas Schiefer 4 shared papers
- Nova DasSarma 4 shared papers
- Sam McCandlish 4 shared papers