pith. sign in

Feast your eyes: Mixture-of- resolution adaptation for multimodal large language models

10 Pith papers cite this work. Polarity classification is still indexing.

10 Pith papers citing it

citation-role summary

background 3 baseline 1

citation-polarity summary

fields

cs.CV 10

verdicts

UNVERDICTED 10

clear filters

representative citing papers

Beyond Encoder Accumulation: Measuring Encoder Roles in Multi-Encoder VLMs

cs.CV · 2026-06-02 · unverdicted · novelty 6.0

Retraining all 31 subsets of five vision encoders shows Capacity and Necessity are distinct, pre-projector effective rank predicts residual performance at fixed parameter count, and high-Capacity plus adaptive complement pairs match the full five-encoder model.

Kwai Keye-VL-2.0 Technical Report

cs.CV · 2026-06-09 · unverdicted · novelty 4.0

Kwai Keye-VL-2.0-30B-A3B is a 30B MoE model with 3B active parameters using DSA adaptation and MOPD distillation that reports SOTA results on video understanding and agent benchmarks.

citing papers explorer

Showing 10 of 10 citing papers after filters.