pith:AWCZRZQ4
Unified Reward Model for Multimodal Understanding and Generation
A single reward model trained jointly on image and video tasks improves preference alignment for both understanding and generation.
arxiv:2503.05236 v2 · 2025-03-07 · cs.CV
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{AWCZRZQ4QCF2K4O2MUR2KYFJ4P}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more
Record completeness
Claims
jointly learning to assess diverse visual tasks yields substantial mutual benefits... achieving consistent improvements across each domain.
The large-scale human preference dataset accurately represents human judgments across tasks and the two-stage filtering strategy produces high-quality, unbiased preference pairs without introducing selection artifacts.
UnifiedReward is the first unified reward model that jointly assesses multimodal understanding and generation to provide better preference signals for aligning vision models via DPO.
References
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-18T03:15:18.210251Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
058598e61c808ba571da6523a560a9e3c7a30acc1708721a83ab8864b033787e
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/AWCZRZQ4QCF2K4O2MUR2KYFJ4P \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 058598e61c808ba571da6523a560a9e3c7a30acc1708721a83ab8864b033787e
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "61deb671ab72436b4fe9934b639cef553965b827b86a5f6c7be13ed84a2aa7c3",
"cross_cats_sorted": [],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.CV",
"submitted_at": "2025-03-07T08:36:05Z",
"title_canon_sha256": "2aedac95cbbedd1322221f81bf6fbba2b5fd14fda6c12942a05f73b48664f4f8"
},
"schema_version": "1.0",
"source": {
"id": "2503.05236",
"kind": "arxiv",
"version": 2
}
}