pith:PREMJTZC
Demystifying CLIP Data
MetaCLIP balances CommonCrawl image-text pairs using CLIP-derived metadata to exceed original CLIP performance on zero-shot benchmarks.
arxiv:2309.16671 v6 · 2023-09-28 · cs.CV · cs.CL
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{PREMJTZC4J7RNK4IHH6M6UBNEE}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
MetaCLIP applied to CommonCrawl with 400M image-text data pairs outperforms CLIP's data on multiple standard benchmarks. In zero-shot ImageNet classification, MetaCLIP achieves 70.8% accuracy, surpassing CLIP's 68.3% on ViT-B models. Scaling to 1B data attains 72.4%.
That metadata derived from CLIP's own concepts is sufficient to capture the key distributional properties that made CLIP data effective, and that explicit balancing over this metadata is the primary driver of the observed gains rather than other unmeasured factors in the raw pool.
MetaCLIP curates balanced 400M-pair subsets from CommonCrawl that outperform CLIP data, reaching 70.8% zero-shot ImageNet accuracy on ViT-B versus CLIP's 68.3%.
References
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-17T23:38:48.378631Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
7c48c4cf22e27f16ab8839fccf502d21084bf52b5499072da4555157a99911e5
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/PREMJTZC4J7RNK4IHH6M6UBNEE \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 7c48c4cf22e27f16ab8839fccf502d21084bf52b5499072da4555157a99911e5
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "ebf6224ad45b03c9907c6c2a98803e7c9116e2a50f09fa513efa5dcd022d1323",
"cross_cats_sorted": [
"cs.CL"
],
"license": "http://creativecommons.org/licenses/by-nc-sa/4.0/",
"primary_cat": "cs.CV",
"submitted_at": "2023-09-28T17:59:56Z",
"title_canon_sha256": "77371d4b9df8c37f41b4553938b8a1d5762fff9a02903ac4b560ddeca04e5b06"
},
"schema_version": "1.0",
"source": {
"id": "2309.16671",
"kind": "arxiv",
"version": 6
}
}