pith. sign in

Q-frame: Query-aware frame selection and multi-resolution adaptation for video-llms

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

citation-role summary

background 1

citation-polarity summary

fields

cs.CV 6

years

2026 6

verdicts

UNVERDICTED 6

roles

background 1

polarities

background 1

clear filters

representative citing papers

PEEK: Picking Essential frames via Efficient Knowledge distillation

cs.CV · 2026-05-29 · unverdicted · novelty 6.0

PEEK distills caption-conditioned frame relevance into a lightweight visual model, outperforming adaptive baselines on ActivityNet Captions and MSR-VTT especially at 1-2 frame budgets while adding only 5.2% overhead.

Watch, Remember, Reason: Human-View Video Understanding with MLLMs

cs.CV · 2026-06-05 · unverdicted · novelty 4.0

This is a survey that frames video MLLM research via a human-view formulation of perceptual representations, memory states, reasoning traces, and predictions, then reviews methods, datasets, benchmarks, and open problems.

citing papers explorer

Showing 6 of 6 citing papers.