pith. sign in
Pith Number

pith:IZ5WUD72

pith:2026:IZ5WUD723UJZYHVNKZH5KMMKE3
not attested not anchored not stored refs pending

Evaluating the Evaluator: Problems with SemEval-2020 Task 1 for Lexical Semantic Change Detection

Bach Phan-Tat, Dirk Geeraerts, Dirk Speelmana, Kris Heylen, Stefano De Pascale

SemEval-2020 Task 1 for lexical semantic change detection has narrow definitions of change, corpus preprocessing errors, and limited target sets that make it a partial rather than definitive benchmark.

arxiv:2604.13232 v2 · 2026-04-14 · cs.CL

Add to your LaTeX paper
\usepackage{pith}
\pithnumber{IZ5WUD723UJZYHVNKZH5KMMKE3}

Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge

Record completeness

1 Bitcoin timestamp
2 Internet Archive
3 Author claim open · sign in to claim
4 Citations open
5 Replications open
Portable graph bundle live · download bundle · merged state
The bundle contains the canonical record plus signed events. A mirror can host it anywhere and recompute the same current state with the deterministic merge algorithm.

Claims

C1strongest claim

Taken together, these limitations suggest that the benchmark should be treated as a useful but partial test bed rather than a definitive measure of progress.

C2weakest assumption

That the listed corpus and preprocessing problems (OCR noise, malformed characters, truncated sentences, inconsistent lemmatisation, POS-tagging errors, missed targets) substantially distort model behaviour, complicate linguistic analysis, and reduce reproducibility.

C3one line summary

The SemEval-2020 Task 1 benchmark for lexical semantic change detection is limited by a narrow sense-based definition of change, substantial corpus and preprocessing errors, and small curated target sets that reduce realism.

Receipt and verification
First computed 2026-05-28T01:04:40.066161Z
Builder pith-number-builder-2026-05-17-v1
Signature Pith Ed25519 (pith-v1-2026-05) · public key
Schema pith-number/v1.0

Canonical hash

467b6a0ffadd139c1ead564fd5318a26e07c2f3dbe36350eadc5ff5c4cf7e178

Aliases

arxiv: 2604.13232 · arxiv_version: 2604.13232v2 · doi: 10.48550/arxiv.2604.13232 · pith_short_12: IZ5WUD723UJZ · pith_short_16: IZ5WUD723UJZYHVN · pith_short_8: IZ5WUD72
Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/IZ5WUD723UJZYHVNKZH5KMMKE3 \
  | jq -c '.canonical_record' \
  | python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 467b6a0ffadd139c1ead564fd5318a26e07c2f3dbe36350eadc5ff5c4cf7e178
Canonical record JSON
{
  "metadata": {
    "abstract_canon_sha256": "818417c178bc70c93a0c9c434c5071f97abd289d8b567722eaeafb8af2ab8db8",
    "cross_cats_sorted": [],
    "license": "http://creativecommons.org/licenses/by/4.0/",
    "primary_cat": "cs.CL",
    "submitted_at": "2026-04-14T19:01:25Z",
    "title_canon_sha256": "cf7530a3a7ba7f8e577050b2b5d5d6f841ff0018cbed7df1657c0aed2bfc7ce9"
  },
  "schema_version": "1.0",
  "source": {
    "id": "2604.13232",
    "kind": "arxiv",
    "version": 2
  }
}