Pith Number
pith:YL2LFA3J
pith:2026:YL2LFA3JTFL7ZY3I54A2L72CFP
not attested
not anchored
not stored
refs resolved
Explainable Semantic Textual Similarity via Dissimilar Span Detection
Detecting dissimilar spans between text pairs explains semantic similarity scores and boosts paraphrase detection performance.
arxiv:2603.21174 v1 · 2026-03-22 · cs.CL
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{YL2LFA3JTFL7ZY3I54A2L72CFP}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
1
Bitcoin timestamp
2
Internet Archive
3
Author claim
· sign in to
claim
4
Citations
5
Replications
✓
Portable graph bundle live · download bundle · merged
state
The bundle contains the canonical record plus signed events. A mirror can host it anywhere and recompute the same
current state with the deterministic merge algorithm.
Claims
C1strongest claim
DSD can lead to increased performance in the specific task of paraphrase detection.
C2weakest assumption
The semi-automated pipeline combining LLMs with human verification produces reliable and accurate labels for the Span Similarity Dataset.
C3one line summary
Introduces the Dissimilar Span Detection task and Span Similarity Dataset to explain semantic textual similarity by identifying differing spans between text pairs.
References
[1] Explainable Semantic Textual Similarity via Dissimilar Span Detection
[2] the alignment between pairs of segments 1 across the two sen- tences, where the relation between the segments is labeled with a relation type and a similarity score
[3] a woman” and “a man
[4] The modified spans could ei- ther be equivalent in meaning to the original one, or be semantically dissimilar
[5] In our case, {{ de- notes the beginning of a span, and }} its end
Cited by
Receipt and verification
| First computed | 2026-05-17T23:39:04.366399Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
c2f4b283699957fce368ef01a5ff422bff668c55ad776dbee3a10343b3cd9cab
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/YL2LFA3JTFL7ZY3I54A2L72CFP \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: c2f4b283699957fce368ef01a5ff422bff668c55ad776dbee3a10343b3cd9cab
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "2ac48fcefc85196017c93e08d6c23f3d5f20c0bd2e24afc3efa5131a2b273d9c",
"cross_cats_sorted": [],
"license": "http://creativecommons.org/licenses/by-sa/4.0/",
"primary_cat": "cs.CL",
"submitted_at": "2026-03-22T11:32:31Z",
"title_canon_sha256": "fbb00213ceeeacade36f54522cfe61669b19980dc471da101dd7a7712c4b328a"
},
"schema_version": "1.0",
"source": {
"id": "2603.21174",
"kind": "arxiv",
"version": 1
}
}