StructAlign uses simplex ETF geometry and cross-modal relation-preserving losses to mitigate intra- and cross-modal feature drift in continual text-to-video retrieval.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it