EvoPrune: Early-Stage Visual Token Prun- ing for Eﬃcient MLLMs

Yuhao Chen, Bin Shan, Xin Ye, Cheng Chen · 2026 · arXiv 2603.03681

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

VLMaxxing through FrameMogging Training-Free Anti-Recomputation for Video Vision-Language Models

cs.CV · 2026-05-05 · unverdicted · novelty 5.0

Training-free adaptive reuse of stable visual state in video VLMs reduces follow-up latency by 15-36x on Qwen2.5-VL while preserving correctness on VideoMME, with smaller first-query speedups via pruning.

citing papers explorer

Showing 1 of 1 citing paper.

VLMaxxing through FrameMogging Training-Free Anti-Recomputation for Video Vision-Language Models cs.CV · 2026-05-05 · unverdicted · none · ref 7
Training-free adaptive reuse of stable visual state in video VLMs reduces follow-up latency by 15-36x on Qwen2.5-VL while preserving correctness on VideoMME, with smaller first-query speedups via pruning.

EvoPrune: Early-Stage Visual Token Prun- ing for Eﬃcient MLLMs

fields

years

verdicts

representative citing papers

citing papers explorer