pith. sign in

Newton Cheng

Identifiers

No identifiers captured yet.

Papers (4)

  1. Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training cs.CR · 2024 · author #10
  2. Towards Understanding Sycophancy in Language Models cs.CL · 2023 · author #7
  3. Measuring Faithfulness in Chain-of-Thought Reasoning cs.AI · 2023 · author #13
  4. BLOOM: A 176B-Parameter Open-Access Multilingual Language Model cs.CL · 2022 · author #224

Mentions

No mention provenance yet.

Frequent Coauthors