pith:W5AW73QH
Teaching language models to support answers with verified quotes
A 280 billion parameter model can be trained to answer questions with specific cited evidence from documents and to abstain when uncertain.
arxiv:2203.11147 v1 · 2022-03-21 · cs.CL · cs.LG
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{W5AW73QHXHRS4ZAC566MZ6OMRR}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
Our 280 billion parameter model, GopherCite, is able to produce answers with high quality supporting evidence and abstain from answering when unsure. The model's response is found to be high-quality 80% of the time on this Natural Questions subset, and 67% of the time on the ELI5 subset. Abstaining from the third of questions for which it is most unsure improves performance to 90% and 80% respectively.
That human raters' preferences for 'high quality supporting evidence' during RLHP training generalize to produce reliable citations and that the model's internal uncertainty signal for abstention is well-calibrated without introducing new biases.
GopherCite produces answers with supporting evidence citations, rated high-quality 80% of the time on Natural Questions and 67% on ELI5, improving to 90% and 80% with abstention on uncertain questions.
References
Cited by
Receipt and verification
| First computed | 2026-05-17T23:38:14.206266Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
b7416fee07b9e32e6402efbcccf9cc8c75905c39fdbe93293cc1ef5a6e1101d5
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/W5AW73QHXHRS4ZAC566MZ6OMRR \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: b7416fee07b9e32e6402efbcccf9cc8c75905c39fdbe93293cc1ef5a6e1101d5
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "3f27c4af1e562a931e61026b868ea299ed8a7336b9e159d0dbe738400d973823",
"cross_cats_sorted": [
"cs.LG"
],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.CL",
"submitted_at": "2022-03-21T17:26:29Z",
"title_canon_sha256": "13c59520008b99b844030d7803e45e1caa24dcb0fc66e41169eb41631e273ec5"
},
"schema_version": "1.0",
"source": {
"id": "2203.11147",
"kind": "arxiv",
"version": 1
}
}