pith:W5HOEO55
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers
Sana-0.6B generates high-resolution images competitively with 12B-parameter models while running over 100 times faster on consumer GPUs.
arxiv:2410.10629 v3 · 2024-10-14 · cs.CV
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{W5HOEO55XSCA5OT2L5X5JMLG7D}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
Sana-0.6B is very competitive with modern giant diffusion model (e.g. Flux-12B), being 20 times smaller and 100+ times faster in measured throughput.
The 32x deep-compression autoencoder preserves sufficient perceptual quality and text alignment at high resolutions without introducing artifacts that linear attention cannot correct.
Sana-0.6B produces high-resolution images with strong text alignment at 20x smaller size and 100x higher throughput than Flux-12B by combining 32x image compression, linear DiT blocks, and a decoder-only LLM text encoder.
References
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-17T23:39:05.189950Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
b74ee23bbdbc840eba7a5f6fd4b166f8edfc5df31c739b840f04c5df7b535c9b
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/W5HOEO55XSCA5OT2L5X5JMLG7D \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: b74ee23bbdbc840eba7a5f6fd4b166f8edfc5df31c739b840f04c5df7b535c9b
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "15afb39d277185e4366dd1119febf88c6db3f739196fb284b0c253610968d58f",
"cross_cats_sorted": [],
"license": "http://creativecommons.org/licenses/by-nc-nd/4.0/",
"primary_cat": "cs.CV",
"submitted_at": "2024-10-14T15:36:42Z",
"title_canon_sha256": "e95928638dc8b30e52d7551e409e10ea8ac631fff31b5419ab2807f3683bd070"
},
"schema_version": "1.0",
"source": {
"id": "2410.10629",
"kind": "arxiv",
"version": 3
}
}