pith. sign in

arxiv: 2407.12758 · v1 · pith:ADY7PJDBnew · submitted 2024-07-17 · 💻 cs.CV

Mutual Information Guided Optimal Transport for Unsupervised Visible-Infrared Person Re-identification

classification 💻 cs.CV
keywords cross-modalityinformationmatchinglearningtrainingunsupervisedannotationsentropy
0
0 comments X
read the original abstract

Unsupervised visible infrared person re-identification (USVI-ReID) is a challenging retrieval task that aims to retrieve cross-modality pedestrian images without using any label information. In this task, the large cross-modality variance makes it difficult to generate reliable cross-modality labels, and the lack of annotations also provides additional difficulties for learning modality-invariant features. In this paper, we first deduce an optimization objective for unsupervised VI-ReID based on the mutual information between the model's cross-modality input and output. With equivalent derivation, three learning principles, i.e., "Sharpness" (entropy minimization), "Fairness" (uniform label distribution), and "Fitness" (reliable cross-modality matching) are obtained. Under their guidance, we design a loop iterative training strategy alternating between model training and cross-modality matching. In the matching stage, a uniform prior guided optimal transport assignment ("Fitness", "Fairness") is proposed to select matched visible and infrared prototypes. In the training stage, we utilize this matching information to introduce prototype-based contrastive learning for minimizing the intra- and cross-modality entropy ("Sharpness"). Extensive experimental results on benchmarks demonstrate the effectiveness of our method, e.g., 60.6% and 90.3% of Rank-1 accuracy on SYSU-MM01 and RegDB without any annotations.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Ranking vs. Assignment: The Metric Mismatch in Multi-View Object Association

    cs.CV 2026-06 unverdicted novelty 6.0

    Ranking metrics AP and FPR-95 can be made perfect via Sinkhorn normalization even when assignment is already correct, while optimal ranking can still produce incorrect assignments.