Tokenpacker: Efficient visual projector for multimodal llm.International Journal of Computer Vision, pages 1–19, 2025

Wentong Li, Yuqian Yuan, Jian Liu, Dongqi Tang, Song Wang, Jie Qin, Jianke Zhu, Lei Zhang · 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

GA2-CLIP: Generic Attribute Anchor for Efficient Prompt Tuningin Video-Language Models

cs.CV · 2025-11-27 · unverdicted · novelty 6.0

GA2-CLIP uses generic attribute anchors and coupled hard-soft prompts to preserve generalization in prompt-tuned video-language models on base-to-new class tasks.

citing papers explorer

Showing 1 of 1 citing paper.

GA2-CLIP: Generic Attribute Anchor for Efficient Prompt Tuningin Video-Language Models cs.CV · 2025-11-27 · unverdicted · none · ref 20
GA2-CLIP uses generic attribute anchors and coupled hard-soft prompts to preserve generalization in prompt-tuned video-language models on base-to-new class tasks.

Tokenpacker: Efficient visual projector for multimodal llm.International Journal of Computer Vision, pages 1–19, 2025

fields

years

verdicts

representative citing papers

citing papers explorer