← back to paper
arxiv: 2606.25478 · 2 revisions
TACO: Towards Task-Consistent Open-Vocabulary Adaptation in Video Recognition