SVD-Prune selects vision tokens via SVD leverage scores to outperform attention-based pruning at extreme budgets of 32 or 16 tokens.
Visionzip: Longer is better but not necessary in vision language models,
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CV 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Beyond Attention Scores: SVD-Based Vision Token Pruning for Efficient Vision-Language Models
SVD-Prune selects vision tokens via SVD leverage scores to outperform attention-based pruning at extreme budgets of 32 or 16 tokens.