pith. sign in

hub Canonical reference

MOT16: a benchmark for multi-object tracking.arXiv preprint arXiv:1603.00831

Canonical reference. 80% of citing Pith papers cite this work as background.

15 Pith papers citing it
Background 80% of classified citations
abstract

Standardized benchmarks are crucial for the majority of computer vision applications. Although leaderboards and ranking tables should not be over-claimed, benchmarks often provide the most objective measure of performance and are therefore important guides for reseach. Recently, a new benchmark for Multiple Object Tracking, MOTChallenge, was launched with the goal of collecting existing and new data and creating a framework for the standardized evaluation of multiple object tracking methods. The first release of the benchmark focuses on multiple people tracking, since pedestrians are by far the most studied object in the tracking community. This paper accompanies a new release of the MOTChallenge benchmark. Unlike the initial release, all videos of MOT16 have been carefully annotated following a consistent protocol. Moreover, it not only offers a significant increase in the number of labeled boxes, but also provides multiple object classes beside pedestrians and the level of visibility for every single object of interest.

hub tools

citation-role summary

background 4 method 1

citation-polarity summary

verdicts

UNVERDICTED 15

representative citing papers

CityOS: Privacy Architecture for Urban Sensing

cs.OS · 2026-05-04 · unverdicted · novelty 7.0

CityOS is an edge runtime that enforces a three-tier privacy API for urban sensors: local raw data, differentially private single-location stats, and cross-location aggregates with per-user budgets enforced on devices.

Learned Nonlocal Feature Matching and Filtering for RAW Image Denoising

eess.IV · 2026-04-19 · unverdicted · novelty 7.0

A learnable nonlocal block that mimics classical neighbor matching and collaborative filtering on multiscale features produces competitive RAW denoising with far fewer parameters than current deep models and generalizes across sensors.

Towards Unconstrained Human-Object Interaction

cs.CV · 2026-04-15 · unverdicted · novelty 7.0

Introduces the U-HOI task and shows MLLMs plus a language-to-graph pipeline can handle human-object interactions without any predefined vocabulary at training or inference time.

STORM: End-to-End Referring Multi-Object Tracking in Videos

cs.CV · 2026-04-12 · unverdicted · novelty 7.0

STORM is an end-to-end MLLM for referring multi-object tracking that uses task-composition learning to leverage sub-task data and introduces the STORM-Bench dataset, achieving SOTA results.

SAMOFT: Robust Multi-Object Tracking via Region and Flow

cs.CV · 2026-05-10 · unverdicted · novelty 5.0

SAMOFT improves multi-object tracking by using SAM segmentation and optical flow for pixel-level motion matching, flexible centroid correction, and training-free motion pattern fixes on top of standard Kalman and ReID baselines.

citing papers explorer

Showing 15 of 15 citing papers.