A decoupled two-stage training pipeline with a single vision encoder enables joint image-to-image and text-to-image person re-identification by avoiding cross-task interference, with I2I pre-training and textual supervision shown to benefit both tasks.
Hierarchical prompt learning for image-and text-based person re-identification.arXiv preprint arXiv:2511.13575, 2025
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CV 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Towards Resolving Optimization Conflicts Between Image- and Text-Based Person Re-Identification
A decoupled two-stage training pipeline with a single vision encoder enables joint image-to-image and text-to-image person re-identification by avoiding cross-task interference, with I2I pre-training and textual supervision shown to benefit both tasks.