pith. sign in

Jared Mueller

Identifiers

  • name variant Jared Mueller 0.60 · backfill

Papers (3)

  1. Discovering Language Model Behaviors with Model-Written Evaluations cs.CL · 2022 · author #28
  2. Constitutional AI: Harmlessness from AI Feedback cs.CL · 2022 · author #21
  3. Measuring Progress on Scalable Oversight for Large Language Models cs.HC · 2022 · author #22

Mentions

  • 2211.03540 #22 · arxiv_oai · confidence 0.70 Jared Mueller

Frequent Coauthors