From Data to Model: A Survey of the Compression Lifecycle in MLLMs , url=

Wu, Hao, Tong, Junlong, Wang, Xudong, Tan, Yang, Zeng, Changyu, Antsiferova, Anastasia · arXiv 0375.554951

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Beyond FLOPs: Benchmarking Real Inference Acceleration of LLM Pruning under a GEMM-Centric Taxonomy

cs.LG · 2026-06-08 · conditional · novelty 7.0

A GEMM-centric taxonomy and unified benchmark show static depth pruning as the strongest Pareto-optimal baseline for LLM inference acceleration, with the frontier shifting to dynamic depth then static width pruning as quality loss rises.

miniReranker: Efficient Multimodal Reranking through Visual Cache Reuse and Interaction Sparsity

cs.IR · 2026-06-09 · unverdicted · novelty 6.0

miniReranker reduces multimodal reranking runtime to under 1% of the dense baseline under high-reuse conditions while retaining over 96% of performance via vision-first prompting, early exit, sparse cross-segment attention, and embedder-guided token pruning.

citing papers explorer

Showing 2 of 2 citing papers.

Beyond FLOPs: Benchmarking Real Inference Acceleration of LLM Pruning under a GEMM-Centric Taxonomy cs.LG · 2026-06-08 · conditional · none · ref 69
A GEMM-centric taxonomy and unified benchmark show static depth pruning as the strongest Pareto-optimal baseline for LLM inference acceleration, with the frontier shifting to dynamic depth then static width pruning as quality loss rises.
miniReranker: Efficient Multimodal Reranking through Visual Cache Reuse and Interaction Sparsity cs.IR · 2026-06-09 · unverdicted · none · ref 16
miniReranker reduces multimodal reranking runtime to under 1% of the dense baseline under high-reuse conditions while retaining over 96% of performance via vision-first prompting, early exit, sparse cross-segment attention, and embedder-guided token pruning.

From Data to Model: A Survey of the Compression Lifecycle in MLLMs , url=

fields

years

verdicts

representative citing papers

citing papers explorer