pith. sign in

Words over pixels? rethinking vision in multimodal large language models

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.IR 1

years

2026 1

verdicts

UNVERDICTED 1

representative citing papers

QKVQA: Question-Focused Filtering for Knowledge-based VQA

cs.IR · 2026-01-20 · unverdicted · novelty 6.0

QKVQA proposes a question-focused filtering method with QFF and CDA modules that boosts accuracy by 3.2 points on Encyclopedic-VQA and 2.2 points on InfoSeek over prior state-of-the-art.

citing papers explorer

Showing 1 of 1 citing paper.

  • QKVQA: Question-Focused Filtering for Knowledge-based VQA cs.IR · 2026-01-20 · unverdicted · none · ref 12

    QKVQA proposes a question-focused filtering method with QFF and CDA modules that boosts accuracy by 3.2 points on Encyclopedic-VQA and 2.2 points on InfoSeek over prior state-of-the-art.