pith. sign in

Sam Ringer

Identifiers

No identifiers captured yet.

Papers (4)

  1. Discovering Language Model Behaviors with Model-Written Evaluations cs.CL · 2022 · author #2
  2. Constitutional AI: Harmlessness from AI Feedback cs.CL · 2022 · author #34
  3. Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned cs.CL · 2022 · author #28
  4. Language Models (Mostly) Know What They Know cs.CL · 2022 · author #28

Mentions

No mention provenance yet.

Frequent Coauthors