In Defense of the Triplet Loss for Person Re-Identification

Alexander Hermans , Lucas Beyer , Bastian Leibe

Authors on Pith no claims yet

classification 💻 cs.CV cs.NE

keywords learninglosstripletdeepend-to-endlargemetricperson

read the original abstract

In the past few years, the field of computer vision has gone through a revolution fueled mainly by the advent of large datasets and the adoption of deep convolutional neural networks for end-to-end learning. The person re-identification subfield is no exception to this. Unfortunately, a prevailing belief in the community seems to be that the triplet loss is inferior to using surrogate losses (classification, verification) followed by a separate metric learning step. We show that, for models trained from scratch as well as pretrained ones, using a variant of the triplet loss to perform end-to-end deep metric learning outperforms most other published methods by a large margin.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 10 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

From Global to Local: Rethinking CLIP Feature Aggregation for Person Re-Identification
cs.CV 2026-04 conditional novelty 7.0

SAGA-ReID improves CLIP-based person ReID by using structured anchor-guided aggregation of patch tokens, delivering up to 10.6 Rank-1 gains on occluded benchmarks over global pooling.
MELD: Multi-Task Equilibrated Learning Detector for AI-Generated Text
cs.CL 2026-05 unverdicted novelty 6.0

MELD is a multi-task AI-text detector using auxiliary heads, uncertainty-weighted losses, EMA distillation, and pairwise ranking that reaches 99.9% TPR at 1% FPR on a new held-out benchmark while remaining competitive...
Prompt-Anchored Vision-Text Distillation for Lifelong Person Re-identification
cs.CV 2026-05 unverdicted novelty 6.0

PAD uses prompt distillation on the text side and domain-adaptive EMA prompts on the visual side to balance stability and plasticity in lifelong person re-identification.
ICPR 2026 Competition on Privacy-Preserving Person Re-Identification from Top-View RGB-Depth Camera (TVRID)
cs.CV 2026-05 accept novelty 6.0

A new benchmark dataset and competition for top-view RGB-Depth person re-identification is released, with competition results showing RGB easier than depth and cross-modal retrieval.
Complexity of Linear Regions in Self-supervised Deep ReLU Networks
cs.LG 2026-04 unverdicted novelty 6.0

Self-supervised ReLU networks form substantially fewer linear regions than supervised models for comparable accuracy, with contrastive methods rapidly expanding regions and self-distillation consolidating them, enabli...
Thinking Before Matching: A Reinforcement Reasoning Paradigm Towards General Person Re-Identification
cs.CV 2026-04 unverdicted novelty 6.0

ReID-R achieves competitive person re-identification performance using chain-of-thought reasoning and reinforcement learning with only 14.3K non-trivial samples, about 20.9% of typical data scales, while providing int...
CraterBench-R: Instance-Level Crater Retrieval for Planetary Scale
cs.CV 2026-04 unverdicted novelty 6.0

CraterBench-R is a new retrieval benchmark where self-supervised ViTs with a training-free instance-token aggregation method achieve high accuracy for identifying individual craters while reducing storage needs.
Beyond Pedestrians: Caption-Guided CLIP Framework for High-Difficulty Video-based Person Re-Identification
cs.CV 2026-04 unverdicted novelty 5.0

CG-CLIP adds caption-guided memory refinement and token-based spatiotemporal aggregation to CLIP for video person ReID, outperforming SOTA on MARS, iLIDS-VID, SportsVReID and DanceVReID.
On the Properties of Feature Attribution for Supervised Contrastive Learning
cs.LG 2026-04 unverdicted novelty 4.0

Neural networks trained via supervised contrastive learning yield feature attributions that are more faithful, less complex, and more continuous than those from cross-entropy trained networks.
Identity-Aware U-Net: Fine-grained Cell Segmentation via Identity-Aware Representation Learning
cs.CV 2026-04 unverdicted novelty 4.0

Identity-Aware U-Net augments a U-Net backbone with an auxiliary embedding branch and triplet metric learning to discriminate among cells with near-identical shapes and textures.