pith. sign in
Pith Number

pith:VZP4VYJK

pith:2026:VZP4VYJKSJMZ7ZHSSTAYQQ6ZLX
not attested not anchored not stored refs pending

AllSERP: Exhaustive Per-Element Enrichment of the Versatile AdSERP Dataset

K. Andrew Edmonds

AllSERP enriches the AdSERP dataset with pixel-accurate bounding boxes and semantic types for every SERP element.

arxiv:2605.04949 v2 · 2026-05-06 · cs.IR

Add to your LaTeX paper
\usepackage{pith}
\pithnumber{VZP4VYJKSJMZ7ZHSSTAYQQ6ZLX}

Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge

Record completeness

1 Bitcoin timestamp
2 Internet Archive
3 Author claim open · sign in to claim
4 Citations open
5 Replications open
Portable graph bundle live · download bundle · merged state
The bundle contains the canonical record plus signed events. A mirror can host it anywhere and recompute the same current state with the deterministic merge algorithm.

Claims

C1strongest claim

AllSERP adds pixel-accurate organic and widget bboxes via screenshot-anchored CV, semantic types across thirteen element types via an HTML parser, an inter-result gap-fill flavor (typed_gapfill), and X+Y click attribution that reaches 91.7 % of the corpus while flagging the rest at trial level. The Phase C ad-vs-non-ad partition is internally consistent with the shipped ad rectangles (0 disagreements across 38,250 classifications).

C2weakest assumption

The computer vision pipeline for extracting bounding boxes from screenshots and the HTML parser for semantic typing produce accurate results without substantial errors or missed elements, as no independent ground-truth validation or error metrics beyond internal ad consistency are described.

C3one line summary

AllSERP enriches the AdSERP SERP corpus with per-element bounding boxes, semantic types, typed gap-fill, and 91.7% click attribution via CV and HTML parsing, with full pipeline and artifacts shipped.

Receipt and verification
First computed 2026-05-20T01:05:15.356717Z
Builder pith-number-builder-2026-05-17-v1
Signature Pith Ed25519 (pith-v1-2026-05) · public key
Schema pith-number/v1.0

Canonical hash

ae5fcae12a92599fe4f294c18843d95de6543a27a9e95bbe86387d6c36dda2d3

Aliases

arxiv: 2605.04949 · arxiv_version: 2605.04949v2 · doi: 10.48550/arxiv.2605.04949 · pith_short_12: VZP4VYJKSJMZ · pith_short_16: VZP4VYJKSJMZ7ZHS · pith_short_8: VZP4VYJK
Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/VZP4VYJKSJMZ7ZHSSTAYQQ6ZLX \
  | jq -c '.canonical_record' \
  | python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: ae5fcae12a92599fe4f294c18843d95de6543a27a9e95bbe86387d6c36dda2d3
Canonical record JSON
{
  "metadata": {
    "abstract_canon_sha256": "c6be80545ba9130ae9084c603ec30bba10514d2526c86580eb2ac3fa41ab605b",
    "cross_cats_sorted": [],
    "license": "http://creativecommons.org/licenses/by/4.0/",
    "primary_cat": "cs.IR",
    "submitted_at": "2026-05-06T14:14:35Z",
    "title_canon_sha256": "ee52c958a706b9e173ed214519b6a4595afbaa8eb23a245cf6ad6990f67c89bc"
  },
  "schema_version": "1.0",
  "source": {
    "id": "2605.04949",
    "kind": "arxiv",
    "version": 2
  }
}