Dario Amodei — Pith Author Registry

Identifiers

name variant Dario Amodei 0.60 · backfill

Papers (27)

Discovering Language Model Behaviors with Model-Written Evaluations cs.CL · 2022 · author #20
Constitutional AI: Harmlessness from AI Feedback cs.CL · 2022 · author #47
Measuring Progress on Scalable Oversight for Large Language Models cs.HC · 2022 · author #16
In-context Learning and Induction Heads cs.LG · 2022 · author #21
Toy Models of Superposition cs.LG · 2022 · author #14
Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned cs.CL · 2022 · author #30
Language Models (Mostly) Know What They Know cs.CL · 2022 · author #29
Scaling Laws and Interpretability of Learning from Repeated Data cs.LG · 2022 · author #15
Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback cs.CL · 2022 · author #25
A General Language Assistant as a Laboratory for Alignment cs.CL · 2021 · author #17
Evaluating Large Language Models Trained on Code cs.LG · 2021 · author #55
Scaling Laws for Autoregressive Generative Modeling cs.LG · 2020 · author #18
Learning to summarize from human feedback cs.CL · 2020 · author #8
Language Models are Few-Shot Learners cs.CL · 2020 · author #31
Scaling Laws for Neural Language Models cs.LG · 2020 · author #10
Fine-Tuning Language Models from Human Preferences cs.CL · 2019 · author #6
An Empirical Model of Large-Batch Training cs.LG · 2018 · author #3
Reward learning from human preferences and demonstrations in Atari cs.LG · 2018 · author #6
Supervising strong learners by amplifying weak experts cs.LG · 2018 · author #3
Variational Option Discovery Algorithms cs.AI · 2018 · author #3
AI safety via debate stat.ML · 2018 · author #3
Deep reinforcement learning from human preferences stat.ML · 2017 · author #6
Learning a Natural Language Interface with Neural Programmer cs.CL · 2016 · author #5
Concrete Problems in AI Safety cs.AI · 2016 · author #1
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin cs.CL · 2015 · author #1
Searching for collective behavior in a network of real neurons q-bio.NC · 2013 · author #3
The simplest maximum entropy model for collective behavior in a neural network q-bio.NC · 2012 · author #4

Mentions

1306.3061 #3 · backfill · confidence 0.70 Dario Amodei
1207.6319 #4 · backfill · confidence 0.70 Dario Amodei
2009.01325 #8 · arxiv_oai · confidence 0.70 Dario Amodei
2205.10487 #15 · arxiv_oai · confidence 0.70 Dario Amodei
2211.03540 #16 · arxiv_oai · confidence 0.70 Dario Amodei
1706.03741 #6 · arxiv_oai · confidence 0.70 Dario Amodei

Frequent Coauthors

Jared Kaplan 15 shared papers
Sam McCandlish 15 shared papers
Tom Henighan 13 shared papers
Dawn Drain 10 shared papers
Nelson Elhage 10 shared papers
Nicholas Joseph 10 shared papers
Zac Hatfield-Dodds 10 shared papers
Amanda Askell 9 shared papers
Ben Mann 9 shared papers
Catherine Olsson 9 shared papers
Nova DasSarma 9 shared papers
Tom Brown 9 shared papers
Andy Jones 8 shared papers
Anna Chen 8 shared papers
Danny Hernandez 8 shared papers
Jackson Kernion 8 shared papers
Kamal Ndousse 8 shared papers
Scott Johnston 8 shared papers
Tristan Hume 8 shared papers
Yuntao Bai 8 shared papers