Learning by tracking: Siamese CNN for robust target association

Cristian Canton Ferrer; Konrad Schindler; Laura Leal-Taix\'e

arxiv: 1604.07866 · v3 · pith:WLHVMZAOnew · submitted 2016-04-26 · 💻 cs.LG · cs.CV

Learning by tracking: Siamese CNN for robust target association

Laura Leal-Taix\'e , Cristian Canton Ferrer , Konrad Schindler This is my paper

classification 💻 cs.LG cs.CV

keywords learningtrackingapproachassociationinputmatchingpatchessiamese

0 comments

read the original abstract

This paper introduces a novel approach to the task of data association within the context of pedestrian tracking, by introducing a two-stage learning scheme to match pairs of detections. First, a Siamese convolutional neural network (CNN) is trained to learn descriptors encoding local spatio-temporal structures between the two input image patches, aggregating pixel values and optical flow information. Second, a set of contextual features derived from the position and size of the compared input patches are combined with the CNN output by means of a gradient boosting classifier to generate the final matching probability. This learning approach is validated by using a linear programming based multi-person tracker showing that even a simple and efficient tracker may outperform much more complex models when fed with our learned matching probabilities. Results on publicly available sequences show that our method meets state-of-the-art standards in multiple people tracking.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

NOOUGAT: Towards Unified Online and Offline Multi-Object Tracking
cs.CV 2025-09 unverdicted novelty 5.0

NOOUGAT unifies online and offline multi-object tracking with a GNN that processes non-overlapping subclips fused by an Autoregressive Long-term Tracking layer, reporting SOTA gains on DanceTrack, SportsMOT, and MOT20.