pith. sign in
Pith Number

pith:6UG7QNNZ

pith:2023:6UG7QNNZDYOU574GK653U7DRKD
not attested not anchored not stored refs pending

GPTScore: Evaluate as You Desire

Jinlan Fu, Pengfei Liu, See-kiong Ng, Zhengbao Jiang

GPTScore uses zero-shot prompting of generative models ranging from 80M to 175B parameters to evaluate text according to arbitrary natural language criteria, tested on 4 tasks, 22 aspects, and 37 datasets.

arxiv:2302.04166 v2 · 2023-02-08 · cs.CL

Add to your LaTeX paper
\usepackage{pith}
\pithnumber{6UG7QNNZDYOU574GK653U7DRKD}

Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge

Record completeness

1 Bitcoin timestamp
2 Internet Archive
3 Author claim open · sign in to claim
4 Citations open
5 Replications open
Portable graph bundle live · download bundle · merged state
The bundle contains the canonical record plus signed events. A mirror can host it anywhere and recompute the same current state with the deterministic merge algorithm.

Claims

C1strongest claim

Experimental results on four text generation tasks, 22 evaluation aspects, and corresponding 37 datasets demonstrate that this approach can effectively allow us to achieve what one desires to evaluate for texts simply by natural language instructions.

C2weakest assumption

That the emergent zero-shot instruction-following abilities of the tested pre-trained models can produce scores that meaningfully reflect the desired evaluation criteria without task-specific fine-tuning or annotated samples.

C3one line summary

GPTScore uses zero-shot prompting of generative models ranging from 80M to 175B parameters to evaluate text according to arbitrary natural language criteria, tested on 4 tasks, 22 aspects, and 37 datasets.

Formal links

2 machine-checked theorem links

Cited by

25 papers in Pith

Receipt and verification
First computed 2026-05-17T23:38:13.551525Z
Builder pith-number-builder-2026-05-17-v1
Signature Pith Ed25519 (pith-v1-2026-05) · public key
Schema pith-number/v1.0

Canonical hash

f50df835b91e1d4eff8657bbba7c7150d4fba63d1d8e0aa443e3eb3899ff1c48

Aliases

arxiv: 2302.04166 · arxiv_version: 2302.04166v2 · doi: 10.48550/arxiv.2302.04166 · pith_short_12: 6UG7QNNZDYOU · pith_short_16: 6UG7QNNZDYOU574G · pith_short_8: 6UG7QNNZ
Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/6UG7QNNZDYOU574GK653U7DRKD \
  | jq -c '.canonical_record' \
  | python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: f50df835b91e1d4eff8657bbba7c7150d4fba63d1d8e0aa443e3eb3899ff1c48
Canonical record JSON
{
  "metadata": {
    "abstract_canon_sha256": "5a087b62a0a246edf049a858a2bb51dfda02d2af9147f8f2f29c0ca602acb3ec",
    "cross_cats_sorted": [],
    "license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
    "primary_cat": "cs.CL",
    "submitted_at": "2023-02-08T16:17:29Z",
    "title_canon_sha256": "fbe6d4804eed2dc343a3d0d23df63ede9cf07092a9013bf8f85857fc3b06ba7f"
  },
  "schema_version": "1.0",
  "source": {
    "id": "2302.04166",
    "kind": "arxiv",
    "version": 2
  }
}