pith:KZGL6CQJ
FP8 Formats for Deep Learning
FP8 with E4M3 and E5M2 encodings matches 16-bit training accuracy on large language and image models without hyperparameter changes.
arxiv:2209.05433 v2 · 2022-09-12 · cs.LG
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{KZGL6CQJWFLXZEDBB2SHHCPXD5}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
We demonstrate the efficacy of the FP8 format on a variety of image and language tasks, effectively matching the result quality achieved by 16-bit training sessions. Our study covers the main modern neural network architectures - CNNs, RNNs, and Transformer-based models, leaving all the hyperparameters unchanged from the 16-bit baseline training sessions. Our training experiments include large, up to 175B parameter, language models.
That the chosen E4M3 and E5M2 encodings will preserve accuracy across all tasks and model scales without any hyperparameter retuning or task-specific adjustments.
FP8 formats E4M3 and E5M2 match 16-bit training accuracy on CNNs, RNNs, and Transformers up to 175B parameters without hyperparameter changes.
References
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-17T23:38:52.855387Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
564cbf0a09b1577c90610ea47389f71f657f9815920159d0397f4be07f509302
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/KZGL6CQJWFLXZEDBB2SHHCPXD5 \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 564cbf0a09b1577c90610ea47389f71f657f9815920159d0397f4be07f509302
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "c520978422b7a453110ed090b9d0c9b9786712d389b5a932b2e5184ded23748b",
"cross_cats_sorted": [],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.LG",
"submitted_at": "2022-09-12T17:39:55Z",
"title_canon_sha256": "a787abcfcac8027bbbf57bf19b5a9d2496b30a55d097c56c0d787ecb293565af"
},
"schema_version": "1.0",
"source": {
"id": "2209.05433",
"kind": "arxiv",
"version": 2
}
}