pith:WZ34WRPF
GME: Improving Universal Multimodal Retrieval by Multimodal LLMs
Training an MLLM on synthetically balanced fused text-image data produces a single dense retriever that leads on universal multimodal search tasks.
arxiv:2412.16855 v2 · 2024-12-22 · cs.CL · cs.IR
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{WZ34WRPFD24Y2BKDMFRBPVSYWP}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
Experimental results show that our method achieves state-of-the-art performance among existing UMR methods.
That the synthetic fused-modal training dataset is of high quality and sufficiently diverse to unlock the full potential of MLLMs for universal multimodal retrieval without introducing biases or artifacts.
GME achieves state-of-the-art results in universal multimodal retrieval by training on a balanced synthetic multimodal dataset.
References
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-17T23:38:53.247396Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
b677cb45e51eb98d0543616217d658b3dd0c9b77e6a47833ac9b256373655b97
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/WZ34WRPFD24Y2BKDMFRBPVSYWP \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: b677cb45e51eb98d0543616217d658b3dd0c9b77e6a47833ac9b256373655b97
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "748847fa281d19d5e0a56770ec05a9bba758fa4e7ad976dc2000ca0e11f1851a",
"cross_cats_sorted": [
"cs.IR"
],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.CL",
"submitted_at": "2024-12-22T04:40:24Z",
"title_canon_sha256": "ea6af16b54e7eb7912e00c33bd2c62ddb3a35dd38ab597b5206a3f4fbb5d0b62"
},
"schema_version": "1.0",
"source": {
"id": "2412.16855",
"kind": "arxiv",
"version": 2
}
}