We take prompts submitted to our API, and several model completions, and have labelers rank the completions by overall quality

Agreement on rankings

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Training language models to follow instructions with human feedback

cs.CL · 2022-03-04 · accept · novelty 8.0

Fine-tuning GPT-3 on human demonstrations followed by reinforcement learning from human preference rankings yields smaller models that humans judge superior to the much larger base model on instruction following, truthfulness, and reduced toxicity.

citing papers explorer

Showing 1 of 1 citing paper.

Training language models to follow instructions with human feedback cs.CL · 2022-03-04 · accept · none · ref 3
Fine-tuning GPT-3 on human demonstrations followed by reinforcement learning from human preference rankings yields smaller models that humans judge superior to the much larger base model on instruction following, truthfulness, and reduced toxicity.

We take prompts submitted to our API, and several model completions, and have labelers rank the completions by overall quality

fields

years

verdicts

representative citing papers

citing papers explorer