Ofa: Unifying architectures, tasks, and modalities through a simple sequence-to-sequence learning framework

Peng Wang, An Yang, Rui Men, Junyang Lin, Shuai Bai, Zhikang Li, Jianxin Ma, Chang Zhou, Jingren Zhou, Hongxia Yang · 2022

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Training-Free Zero-Shot Temporal Action Detection with Vision-Language Models

cs.CV · 2025-01-23 · unverdicted · novelty 6.0

FreeZAD applies vision-language models with LogOIC scoring and frequency-based actionness calibration for training-free zero-shot temporal action detection, outperforming unsupervised methods on THUMOS14 and ActivityNet-1.3 while using 1/13 the runtime.

citing papers explorer

Showing 1 of 1 citing paper.

Training-Free Zero-Shot Temporal Action Detection with Vision-Language Models cs.CV · 2025-01-23 · unverdicted · none · ref 36
FreeZAD applies vision-language models with LogOIC scoring and frequency-based actionness calibration for training-free zero-shot temporal action detection, outperforming unsupervised methods on THUMOS14 and ActivityNet-1.3 while using 1/13 the runtime.

Ofa: Unifying architectures, tasks, and modalities through a simple sequence-to-sequence learning framework

fields

years

verdicts

representative citing papers

citing papers explorer