pith. sign in
Pith Number

pith:2HMSE7KX

pith:2026:2HMSE7KXIKBAXSSDT6UAQQB4EZ
not attested not anchored not stored refs pending

SynCABEL: Synthetic Contextualized Augmentation for Biomedical Entity Linking

Adam Remaki, Christel G\'erardin, Eul\`alia Farr\'e-Maduell, Martin Krallinger, Xavier Tannier

SynCABEL generates synthetic training examples with large language models to overcome data scarcity in biomedical entity linking and reaches new state-of-the-art results on three multilingual benchmarks with up to 60 percent less human-anno

arxiv:2601.19667 v2 · 2026-01-27 · cs.CL · cs.AI · cs.IR · cs.LG

Add to your LaTeX paper
\usepackage{pith}
\pithnumber{2HMSE7KXIKBAXSSDT6UAQQB4EZ}

Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge

Record completeness

1 Bitcoin timestamp
2 Internet Archive
3 Author claim open · sign in to claim
4 Citations open
5 Replications open
Portable graph bundle live · download bundle · merged state
The bundle contains the canonical record plus signed events. A mirror can host it anywhere and recompute the same current state with the deterministic merge algorithm.

Claims

C1strongest claim

SynCABEL, when combined with decoder-only models and guided inference, establishes new state-of-the-art results across three widely used multilingual benchmarks: MedMentions for English, QUAERO for French, and SPACCC for Spanish.

C2weakest assumption

The assumption that LLM-generated synthetic examples are sufficiently representative of real biomedical text distributions and do not introduce systematic biases or hallucinations that would degrade downstream linking performance.

C3one line summary

SynCABEL generates LLM-based synthetic data for all candidate concepts in biomedical entity linking, reaching new SOTA results on MedMentions, QUAERO, and SPACCC with up to 60% less human-annotated data.

Cited by

1 paper in Pith

Receipt and verification
First computed 2026-05-18T02:44:31.836974Z
Builder pith-number-builder-2026-05-17-v1
Signature Pith Ed25519 (pith-v1-2026-05) · public key
Schema pith-number/v1.0

Canonical hash

d1d9227d5742820bca439fa808403c266343c5744b6176908579f1609a30cc2f

Aliases

arxiv: 2601.19667 · arxiv_version: 2601.19667v2 · doi: 10.48550/arxiv.2601.19667 · pith_short_12: 2HMSE7KXIKBA · pith_short_16: 2HMSE7KXIKBAXSSD · pith_short_8: 2HMSE7KX
Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/2HMSE7KXIKBAXSSDT6UAQQB4EZ \
  | jq -c '.canonical_record' \
  | python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: d1d9227d5742820bca439fa808403c266343c5744b6176908579f1609a30cc2f
Canonical record JSON
{
  "metadata": {
    "abstract_canon_sha256": "4307d55b4a2750067ba882e7ba207ed054dd71bffb1b002bec46141bf31fef43",
    "cross_cats_sorted": [
      "cs.AI",
      "cs.IR",
      "cs.LG"
    ],
    "license": "http://creativecommons.org/licenses/by-sa/4.0/",
    "primary_cat": "cs.CL",
    "submitted_at": "2026-01-27T14:47:17Z",
    "title_canon_sha256": "14d5b3f61f35d292638e87b1c4ce8066733d21fb1ea186fae240d1bd6f2ebd37"
  },
  "schema_version": "1.0",
  "source": {
    "id": "2601.19667",
    "kind": "arxiv",
    "version": 2
  }
}