pith:345QV4PT
MERVIN: A Unified Framework for Multimodal Event Retrieval in Vietnamese News Videos
A framework unifies visual frames, enhanced transcripts, and summaries for retrieving events in Vietnamese news videos.
arxiv:2605.16120 v1 · 2026-05-15 · cs.IR
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{345QV4PTXZL7TF33V6CPMWQTLM}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
MERVIN achieving 79 out of 88 points in AI Challenge HCMC 2025 qualification phase and successfully retrieved all results for every query in the final round.
That combining keyframes, Gemini-enhanced transcripts, and video summaries via separate visual and textual embeddings will produce meaningfully better semantic retrieval than simpler single-modality baselines for Vietnamese news content.
MERVIN is a multimodal retrieval system for Vietnamese news videos that integrates visual and textual features with LLM-enhanced transcripts and reports strong results on a 2025 AI challenge.
References
Formal links
Receipt and verification
| First computed | 2026-05-20T00:01:53.715193Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
df3b0af1f3be57f9977baf84f65a135b24c9c488d058cfb96228afeed66ff06d
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/345QV4PTXZL7TF33V6CPMWQTLM \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: df3b0af1f3be57f9977baf84f65a135b24c9c488d058cfb96228afeed66ff06d
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "15b8f30bb571cdd6c6e6cc90edfa601e3ce04f54f935a5145a6de89e396b69cc",
"cross_cats_sorted": [],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.IR",
"submitted_at": "2026-05-15T16:02:48Z",
"title_canon_sha256": "cf4c6d90698f5cb344f26f51bcc7e188c0c6986665f14ab2c73a4596dbc0506c"
},
"schema_version": "1.0",
"source": {
"id": "2605.16120",
"kind": "arxiv",
"version": 1
}
}