ClipTBP adds clip-level alignment loss and dual boundary losses to existing moment retrieval models for more accurate and robust temporal boundary prediction.
et al.: Cross-modal contrastive learning with asymmetric co-attention network for video moment retrieval
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CV 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
ClipTBP: Clip-Pair based Temporal Boundary Prediction with Boundary-Aware Learning for Moment Retrieval
ClipTBP adds clip-level alignment loss and dual boundary losses to existing moment retrieval models for more accurate and robust temporal boundary prediction.