pith. the verified trust layer for science. sign in
Pith Number

pith:MUDDTA3E

pith:2023:MUDDTA3EGAO3LWODE3IBSEHJDO
not attested not anchored not stored refs resolved

Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined Levels

Annan Wang, Chaofeng Chen, Chunyi Li, Erli Zhang, Guangtao Zhai, Haoning Wu, Liang Liao, Qiong Yan, Weisi Lin, Weixia Zhang, Wenxiu Sun, Xiongkuo Min, Yixuan Gao, Zicheng Zhang

LMMs achieve better visual scoring by predicting discrete text-defined rating levels instead of numerical scores.

arxiv:2312.17090 v1 · 2023-12-28 · cs.CV · cs.CL · cs.LG

Add to your LaTeX paper
\usepackage{pith}
\pithnumber{MUDDTA3EGAO3LWODE3IBSEHJDO}

Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more

Record completeness

1 Bitcoin timestamp
2 Internet Archive
3 Author claim open · sign in to claim
4 Citations open
5 Replications open
Portable graph bundle live · download bundle · merged state
The bundle contains the canonical record plus signed events. A mirror can host it anywhere and recompute the same current state with the deterministic merge algorithm.

Claims

C1strongest claim

The proposed Q-Align achieves state-of-the-art performance on image quality assessment (IQA), image aesthetic assessment (IAA), as well as video quality assessment (VQA) tasks under the original LMM structure. With the syllabus, we further unify the three tasks into one model, termed the OneAlign.

C2weakest assumption

That training LMMs with discrete text-defined levels emulates human subjective judgment processes more effectively than direct numerical score regression, leading to better performance without architectural changes or extra data.

C3one line summary

Q-Align trains LMMs on discrete text-defined levels for visual scoring, achieving SOTA on IQA, IAA, and VQA while unifying the tasks in OneAlign.

References

293 extracted · 293 resolved · 9 Pith anchors

[1] FirstName LastName , title =
[2] FirstName Alpher , title =
[3] Journal of Foo , volume = 13, number = 1, pages =
[4] Journal of Foo , volume = 14, number = 1, pages =
[5] FirstName Alpher and FirstName Gamow , title =

Formal links

3 machine-checked theorem links

Cited by

25 papers in Pith

Receipt and verification
First computed 2026-05-17T23:38:50.870033Z
Builder pith-number-builder-2026-05-17-v1
Signature Pith Ed25519 (pith-v1-2026-05) · public key
Schema pith-number/v1.0

Canonical hash

6506398364301db5d9c326d01910e91b8e6f1cf78cfa923d524c2e204befdf59

Aliases

arxiv: 2312.17090 · arxiv_version: 2312.17090v1 · doi: 10.48550/arxiv.2312.17090 · pith_short_12: MUDDTA3EGAO3 · pith_short_16: MUDDTA3EGAO3LWOD · pith_short_8: MUDDTA3E
Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/MUDDTA3EGAO3LWODE3IBSEHJDO \
  | jq -c '.canonical_record' \
  | python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 6506398364301db5d9c326d01910e91b8e6f1cf78cfa923d524c2e204befdf59
Canonical record JSON
{
  "metadata": {
    "abstract_canon_sha256": "2d275bc65e5e2f8fe7a4b0e5f050b1ca381a4fb351da30dfaac05792e27d2383",
    "cross_cats_sorted": [
      "cs.CL",
      "cs.LG"
    ],
    "license": "http://creativecommons.org/licenses/by-nc-sa/4.0/",
    "primary_cat": "cs.CV",
    "submitted_at": "2023-12-28T16:10:25Z",
    "title_canon_sha256": "c390943399b25c80248b0569a0af41814140229c9ae0d4d47a6ae22dfe712147"
  },
  "schema_version": "1.0",
  "source": {
    "id": "2312.17090",
    "kind": "arxiv",
    "version": 1
  }
}