pith:FPJIHIJK
Ranking-Aware Calibration for Reliable Multimodal Reinforcement Learning
Ranking signals from group-based RL can supervise confidence to improve calibration in vision-language models.
arxiv:2605.16999 v1 · 2026-05-16 · cs.LG
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{FPJIHIJKYXM2T7FECBVCEI2NXG}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
Their combination achieves the best calibration across all tested backbones while improving accuracy in the majority of settings.
That the ranking signals already produced by group-based RL directly reflect reasoning quality and can be used to supervise confidence without introducing new biases or requiring validation against external correctness measures.
RAC adds ranking-aware group loss and clean-corrupted pairwise loss to RL post-training to boost both accuracy and calibration in multimodal reasoning without extra annotations.
References
Receipt and verification
| First computed | 2026-05-20T00:03:35.254348Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
2bd283a12ac5d9a9fca4106a22234db98cfdb9f2c082b0574f71a469333f5381
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/FPJIHIJKYXM2T7FECBVCEI2NXG \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 2bd283a12ac5d9a9fca4106a22234db98cfdb9f2c082b0574f71a469333f5381
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "eb0e4e4bdacd9da5a7a856c499f1db0b6bd86661fdeef57d58e75cf673cff972",
"cross_cats_sorted": [],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.LG",
"submitted_at": "2026-05-16T13:51:29Z",
"title_canon_sha256": "b2feb1e35fc4fa7b0e1e2be760927bb4c072e5a54fd9b4284f5a4543a178af59"
},
"schema_version": "1.0",
"source": {
"id": "2605.16999",
"kind": "arxiv",
"version": 1
}
}