StructAlign uses simplex ETF geometry and cross-modal relation-preserving losses to mitigate intra- and cross-modal feature drift in continual text-to-video retrieval.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
representative citing papers
citing papers explorer
-
StructAlign: Structured Cross-Modal Alignment for Continual Text-to-Video Retrieval
StructAlign uses simplex ETF geometry and cross-modal relation-preserving losses to mitigate intra- and cross-modal feature drift in continual text-to-video retrieval.