pith:CTVKNTPZ
CogVLM: Visual Expert for Pretrained Language Models
A trainable visual expert module inserted into the attention and FFN layers of a frozen language model enables deep vision-language fusion.
arxiv:2311.03079 v2 · 2023-11-06 · cs.CV
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{CTVKNTPZG7G2XIRXBJVDSVT54A}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
CogVLM-17B achieves state-of-the-art performance on 10 classic cross-modal benchmarks... surpassing or matching PaLI-X 55B.
The visual expert module can be inserted into the attention and FFN layers of any frozen pretrained language model without requiring changes to the original architecture or loss functions.
CogVLM adds a trainable visual expert inside frozen language model layers for deep vision-language fusion and reports state-of-the-art results on ten cross-modal benchmarks while preserving NLP performance.
References
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-17T23:38:51.021764Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
14eaa6cdf937cdaba2370a6a39567de015ee54eca0c505143d4d420dfa34f0e5
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/CTVKNTPZG7G2XIRXBJVDSVT54A \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 14eaa6cdf937cdaba2370a6a39567de015ee54eca0c505143d4d420dfa34f0e5
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "9ed531cb4a2ee62bd4512e8535ec68ef02bf4f67385e61fe8e221a00b5f126b6",
"cross_cats_sorted": [],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.CV",
"submitted_at": "2023-11-06T13:04:39Z",
"title_canon_sha256": "679fe85268225460d07d2179c1c3c8b521429885cfbc6b874c9f34e37b4130b4"
},
"schema_version": "1.0",
"source": {
"id": "2311.03079",
"kind": "arxiv",
"version": 2
}
}