pith:F5SAP2TV
OmniDrop: Layer-wise Token Pruning for Omni-modal LLMs via Query-Guidance
Layer-wise token pruning inside the LLM decoder, guided by text queries, allows omni-modal models to process audiovisual inputs faster while maintaining or improving accuracy.
arxiv:2605.14458 v1 · 2026-05-14 · cs.AI
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{F5SAP2TVLX4NM6K2KGMP5NZQA6}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
Experimental results across various audiovisual benchmarks demonstrate that OmniDrop outperforms all baselines by up to 3.58 points while reducing prefill latency by up to 40% and memory usage by up to 14.7%.
That performing initial fusion in early layers followed by aggressive pruning in deeper layers, guided by text queries, reliably preserves semantic information without task-specific degradation.
OmniDrop is a training-free layer-wise token pruning framework for omni-modal LLMs that uses query guidance and temporal diversity to reduce prefill latency by up to 40% and memory by 14.7% while improving benchmark scores by up to 3.58 points.
References
Receipt and verification
| First computed | 2026-05-17T23:39:06.813478Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
2f6407ea755df8d6795a5198feb730078f9458cac5c43164d03ff50bdcdbf789
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/F5SAP2TVLX4NM6K2KGMP5NZQA6 \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 2f6407ea755df8d6795a5198feb730078f9458cac5c43164d03ff50bdcdbf789
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "a2aec49115402a41aa1f34d23c264ee1108ba0bcde0b78447ce43214e684e684",
"cross_cats_sorted": [],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.AI",
"submitted_at": "2026-05-14T06:54:37Z",
"title_canon_sha256": "2f0d6d89da70f9c913e7892280f53c6cf2693e18e4a8da795994f65cbc2d4b85"
},
"schema_version": "1.0",
"source": {
"id": "2605.14458",
"kind": "arxiv",
"version": 1
}
}