pith:6UG7QNNZ
GPTScore: Evaluate as You Desire
GPTScore uses zero-shot prompting of generative models ranging from 80M to 175B parameters to evaluate text according to arbitrary natural language criteria, tested on 4 tasks, 22 aspects, and 37 datasets.
arxiv:2302.04166 v2 · 2023-02-08 · cs.CL
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{6UG7QNNZDYOU574GK653U7DRKD}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
Experimental results on four text generation tasks, 22 evaluation aspects, and corresponding 37 datasets demonstrate that this approach can effectively allow us to achieve what one desires to evaluate for texts simply by natural language instructions.
That the emergent zero-shot instruction-following abilities of the tested pre-trained models can produce scores that meaningfully reflect the desired evaluation criteria without task-specific fine-tuning or annotated samples.
GPTScore uses zero-shot prompting of generative models ranging from 80M to 175B parameters to evaluate text according to arbitrary natural language criteria, tested on 4 tasks, 22 aspects, and 37 datasets.
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-17T23:38:13.551525Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
f50df835b91e1d4eff8657bbba7c7150d4fba63d1d8e0aa443e3eb3899ff1c48
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/6UG7QNNZDYOU574GK653U7DRKD \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: f50df835b91e1d4eff8657bbba7c7150d4fba63d1d8e0aa443e3eb3899ff1c48
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "5a087b62a0a246edf049a858a2bb51dfda02d2af9147f8f2f29c0ca602acb3ec",
"cross_cats_sorted": [],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.CL",
"submitted_at": "2023-02-08T16:17:29Z",
"title_canon_sha256": "fbe6d4804eed2dc343a3d0d23df63ede9cf07092a9013bf8f85857fc3b06ba7f"
},
"schema_version": "1.0",
"source": {
"id": "2302.04166",
"kind": "arxiv",
"version": 2
}
}