pith:Y5HR3MRX
eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers
An ensemble of stage-specialized diffusion models improves text alignment in image synthesis at the same inference cost.
arxiv:2211.01324 v5 · 2022-11-02 · cs.CV · cs.LG
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{Y5HR3MRXMK6FSX6PMAM4S5Z6Q4}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
Our ensemble of diffusion models, called eDiff-I, results in improved text alignment while maintaining the same inference computation cost and preserving high visual quality, outperforming previous large-scale text-to-image diffusion models on the standard benchmark.
The synthesis behavior qualitatively changes throughout the generation process such that early stages rely on text conditioning while later stages largely ignore it, making a single shared-parameter model suboptimal.
An ensemble of stage-specialized text-to-image diffusion models improves prompt alignment over single shared-parameter models while preserving visual quality and inference speed.
References
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-17T23:39:05.095621Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
c74f1db23762bc595fcf6019c9773e8727f464891dc08adab64526d371aebe71
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/Y5HR3MRXMK6FSX6PMAM4S5Z6Q4 \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: c74f1db23762bc595fcf6019c9773e8727f464891dc08adab64526d371aebe71
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "6ba932880558e507ebb983d38e510f4e566f11bba98dcd82cf7a99666372929f",
"cross_cats_sorted": [
"cs.LG"
],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.CV",
"submitted_at": "2022-11-02T17:43:04Z",
"title_canon_sha256": "e9f8d1e62e63e995eafd74d40a92cab36ae1e21fca1d6f3683a0e8e5ac675eea"
},
"schema_version": "1.0",
"source": {
"id": "2211.01324",
"kind": "arxiv",
"version": 5
}
}