pith:FPDZ72QB
Directly Fine-Tuning Diffusion Models on Differentiable Rewards
Diffusion models can be fine-tuned directly on differentiable rewards by backpropagating gradients through the full sampling process.
arxiv:2309.17400 v2 · 2023-09-29 · cs.CV · cs.LG
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{FPDZ72QBZQE26KWERZH6FTUZ2N}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
it is possible to backpropagate the reward function gradient through the full sampling procedure, and that doing so achieves strong performance on a variety of rewards, outperforming reinforcement learning-based approaches.
The reward function must be differentiable with respect to the generated samples, and the sampling process must allow stable gradient flow without excessive variance or memory issues.
DRaFT fine-tunes diffusion models by differentiating through sampling to maximize rewards, outperforming RL baselines and improving aesthetics on Stable Diffusion 1.4.
References
Cited by
Receipt and verification
| First computed | 2026-05-17T23:38:48.404088Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
2bc79fea01cc09af2ac48e4fe2ce99d35659314a765873ddd405927a2aa38e2e
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/FPDZ72QBZQE26KWERZH6FTUZ2N \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 2bc79fea01cc09af2ac48e4fe2ce99d35659314a765873ddd405927a2aa38e2e
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "587acc25088d331ade6829017f847a4b43d8971b06be5caf41cd8c2f78dc6469",
"cross_cats_sorted": [
"cs.LG"
],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.CV",
"submitted_at": "2023-09-29T17:01:02Z",
"title_canon_sha256": "2991c385ae42b382a118f4a82555f5882ffacc8d8a6358214db6212e6f3ac194"
},
"schema_version": "1.0",
"source": {
"id": "2309.17400",
"kind": "arxiv",
"version": 2
}
}