pith:CLXB2SP2
CogVLM2: Visual Language Models for Image and Video Understanding
The CogVLM2 family reaches state-of-the-art results on image and video benchmarks by refining visual expert architectures and training recipes.
arxiv:2408.16500 v1 · 2024-08-29 · cs.CV
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{CLXB2SP2UUUTKFNJPVGOMMGOJK}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
CogVLM2 family has achieved state-of-the-art results on benchmarks like MMBench, MM-Vet, TextVQA, MVBench and VCGBench.
That the reported benchmark improvements stem primarily from the described architecture changes and training recipes rather than undisclosed increases in model size, data volume, or compute.
CogVLM2 family achieves state-of-the-art results on image and video understanding benchmarks through improved visual expert architecture, higher resolution inputs, and automated temporal grounding for videos.
References
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-17T23:38:46.728968Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
12ee1d49faa5293515a97d4ce630ce4ab4f212633893fb842a1cdacd5ad6e731
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/CLXB2SP2UUUTKFNJPVGOMMGOJK \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 12ee1d49faa5293515a97d4ce630ce4ab4f212633893fb842a1cdacd5ad6e731
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "4a0b0561fb635c897f4d823de89ce5cf6644e8c9e66a8d0e0a72614a246a4b9f",
"cross_cats_sorted": [],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.CV",
"submitted_at": "2024-08-29T12:59:12Z",
"title_canon_sha256": "9b21e119188e35c13e6672981c4e7ab473790a80b12ce65aa9eb12dddf1a2839"
},
"schema_version": "1.0",
"source": {
"id": "2408.16500",
"kind": "arxiv",
"version": 1
}
}