ADEPT introduces an entropy-driven dual-strategy agent that selects between ASK and REFINE actions to improve interactive video retrieval and outperforms non-interactive, heuristic, and Video-LLM baselines on two datasets.
ADEPT: An Entropy-Driven Dual-Strategy Agent for Interactive Video Retrieval
1 Pith paper cite this work. Polarity classification is still indexing.
abstract
This research aims to solve the challenge of video retrieval from massive datasets, caused by ambiguous user queries. Prevailing single-round retrieval paradigms face a performance bottleneck, as they lack effective feedback mechanisms to handle complex search intentions. The root cause is the "Intent-Query Gap", where users' intent cannot be captured by a simple text query. To solve this, we propose the ADEPT framework: a training-free agent that pioneers an entropy-driven decision engine to efficiently guide dialogue by dynamically selecting between ASK and REFINE strategies. Experiments on two challenging datasets demonstrate that ADEPT significantly outperforms all non-interactive, heuristic, and Video-LLM baselines. The core contribution of this work is an efficient and interpretable entropy-driven interactive strategy that sets a new performance benchmark for the field of interactive video retrieval.
fields
cs.IR 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
ADEPT: An Entropy-Driven Dual-Strategy Agent for Interactive Video Retrieval
ADEPT introduces an entropy-driven dual-strategy agent that selects between ASK and REFINE actions to improve interactive video retrieval and outperforms non-interactive, heuristic, and Video-LLM baselines on two datasets.