WebGPT fine-tunes GPT-3 with web browsing, imitation learning, and human feedback to generate answers on ELI5 that humans prefer 56% over demonstrators and 69% over top Reddit answers.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2021 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
WebGPT: Browser-assisted question-answering with human feedback
WebGPT fine-tunes GPT-3 with web browsing, imitation learning, and human feedback to generate answers on ELI5 that humans prefer 56% over demonstrators and 69% over top Reddit answers.