pith. sign in

arxiv: 1907.01717 · v1 · pith:4KQTLMPWnew · submitted 2019-07-03 · 💻 cs.CV · eess.IV

Unsupervised Anomalous Trajectory Detection for Crowded Scenes

Pith reviewed 2026-05-25 10:49 UTC · model grok-4.3

classification 💻 cs.CV eess.IV
keywords anomalous trajectory detectionunsupervised anomaly detectioncrowded scenesmean shift clusteringShannon entropytrajectory featuresvideo surveillance
0
0 comments X

The pith

Mean-shift clustering on trajectory features combined with entropy-based detection identifies anomalous paths in crowded videos without any labels.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes an unsupervised algorithm for spotting unusual trajectories in videos of crowded scenes. It extracts object paths using a multi-feature tracker, converts them into multiple feature representations, applies mean-shift clustering separately to each, and uses Shannon entropy to flag anomalies within clusters. A voting step then decides which full trajectories are anomalous based on their feature behaviors. A sympathetic reader would care because this avoids the need for labeled examples of normal or abnormal motion, which are hard to obtain in real-world crowd monitoring. If the method works as described, it could automate detection of suspicious movements in public spaces using only the video data itself.

Core claim

The algorithm extracts trajectories from crowded scene videos using a multi feature video object tracker, transforms them into feature spaces, performs independent mean-shift clustering on the feature matrices, identifies anomalies using a Shannon Entropy based detector, and applies a voting mechanism to select trajectories with anomalous characteristics. This process allows detection of expected anomalous trajectories in various crowd scenes with different motion patterns.

What carries the argument

Independent mean-shift clustering on trajectory feature matrices combined with Shannon entropy anomaly detection and a voting mechanism

If this is right

  • The method detects anomalous trajectories in crowd videos from standard datasets representing various motion patterns.
  • The unsupervised nature means no labeled data is required for training or detection.
  • The voting mechanism combines information from multiple feature spaces to improve anomaly identification.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • If the clustering reliably groups similar trajectories, the entropy measure could highlight outliers in feature distributions that correspond to unusual behaviors.
  • This could be tested by applying the same pipeline to non-crowd videos such as sports or traffic to see if it generalizes beyond the paper's datasets.

Load-bearing premise

Independent mean-shift clustering on trajectory features combined with an entropy-based detector will reliably separate normal from anomalous trajectories without supervision or labeled data.

What would settle it

Running the algorithm on a crowded scene video containing known anomalous trajectories and observing whether it correctly identifies them or incorrectly labels normal trajectories as anomalous.

Figures

Figures reproduced from arXiv: 1907.01717 by Deepak Mishra, Deepan Das.

Figure 1
Figure 1. Figure 1: Typical Crowded Scenes connection between video features and video labels. Therefore, developing Unsupervised anomaly detection systems prove to be more challenging than supervised ones. An anomaly in a crowded scene can be determined from the motion patterns of it’s constituent pedestrians and objects. Analyzing trajectory data enables one to predict and identify anomalies with an ex￾cellent degree of acc… view at source ↗
Figure 2
Figure 2. Figure 2: Crowded scene and extracted trajectories [PITH_FULL_IMAGE:figures/full_fig_p002_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Different Clusterings and Anomalous Trajectory Classification [PITH_FULL_IMAGE:figures/full_fig_p005_3.png] view at source ↗
read the original abstract

We present an improved clustering based, unsupervised anomalous trajectory detection algorithm for crowded scenes. The proposed work is based on four major steps, namely, extraction of trajectories from crowded scene video, extraction of several features from these trajectories, independent mean-shift clustering and anomaly detection. First, the trajectories of all moving objects in a crowd are extracted using a multi feature video object tracker. These trajectories are then transformed into a set of feature spaces. Mean shift clustering is applied on these feature matrices to obtain distinct clusters, while a Shannon Entropy based anomaly detector identifies corresponding anomalies. In the final step, a voting mechanism identifies the trajectories that exhibit anomalous characteristics. The algorithm is tested on crowd scene videos from datasets. The videos represent various possible crowd scenes with different motion patterns and the method performs well to detect the expected anomalous trajectories from the scene.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 0 minor

Summary. The paper presents an unsupervised anomalous trajectory detection method for crowded scenes consisting of four steps: multi-feature video object tracking to extract trajectories, transformation of trajectories into multiple feature spaces, independent mean-shift clustering on the feature matrices, and Shannon entropy-based anomaly detection followed by a voting mechanism to identify anomalous trajectories. The algorithm is evaluated on crowd scene videos representing various motion patterns and is claimed to detect expected anomalies effectively.

Significance. If the pipeline can be shown to operate without supervision or data-dependent tuning while producing reliable separation of normal and anomalous trajectories, it would offer a practical contribution to video-based crowd monitoring. However, the absence of quantitative performance metrics, baseline comparisons, ablation studies, or details on hyperparameter selection in the abstract and method description makes it impossible to evaluate whether the claimed performance holds or advances the state of the art.

major comments (2)
  1. [Abstract / method description] Abstract and method description: the central claim that the approach is fully unsupervised and reliably separates normal from anomalous trajectories rests on mean-shift clustering and an entropy-based detector, yet no procedure is supplied for selecting the mean-shift kernel bandwidth or the entropy decision threshold. These choices are data-dependent and, if performed by inspection of the test videos, directly contradict the unsupervised guarantee.
  2. [Abstract] Abstract: the statement that 'the method performs well to detect the expected anomalous trajectories' is unsupported by any quantitative results, error bars, baseline comparisons, or ablation studies, so the performance claim cannot be verified and the soundness of the pipeline cannot be assessed.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback on our manuscript. We address the major comments point by point below, clarifying the unsupervised aspects of the method and committing to revisions that strengthen the presentation without altering the core claims.

read point-by-point responses
  1. Referee: [Abstract / method description] Abstract and method description: the central claim that the approach is fully unsupervised and reliably separates normal from anomalous trajectories rests on mean-shift clustering and an entropy-based detector, yet no procedure is supplied for selecting the mean-shift kernel bandwidth or the entropy decision threshold. These choices are data-dependent and, if performed by inspection of the test videos, directly contradict the unsupervised guarantee.

    Authors: We agree that explicit procedures for parameter selection are necessary to substantiate the unsupervised claim. The mean-shift bandwidth is computed in a data-driven manner as the median of pairwise Euclidean distances within each feature matrix, a standard automatic heuristic that requires no labeled data or anomaly-specific inspection. The entropy threshold is set to the 95th percentile of the entropy values computed over all trajectories in a given scene, again derived solely from the data distribution. We will add a dedicated subsection in the revised method description detailing these procedures to eliminate any ambiguity. revision: yes

  2. Referee: [Abstract] Abstract: the statement that 'the method performs well to detect the expected anomalous trajectories' is unsupported by any quantitative results, error bars, baseline comparisons, or ablation studies, so the performance claim cannot be verified and the soundness of the pipeline cannot be assessed.

    Authors: The abstract is necessarily brief and focuses on the overall outcome. The full manuscript presents qualitative results across multiple crowd videos with varying motion patterns, showing that the voting mechanism correctly flags trajectories deviating from the dominant clusters. We acknowledge that quantitative support would allow better verification of the claims. In the revision we will augment the experimental section with detection accuracy figures on scenes containing known anomalies, plus comparisons against at least two published trajectory anomaly baselines. revision: yes

Circularity Check

0 steps flagged

No circularity: standard off-the-shelf clustering and entropy applied to trajectories

full rationale

The paper describes a pipeline of trajectory extraction, feature transformation, mean-shift clustering, Shannon entropy anomaly scoring, and voting. No equations, fitted parameters, or self-referential definitions are present in the abstract or method outline. Mean-shift and entropy are invoked as standard algorithms without any claim that a derived quantity is obtained by fitting to the same data it is then used to predict. The unsupervised claim rests on the absence of labels rather than on any internal derivation that reduces to its own inputs.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The method rests on domain assumptions about clustering and entropy without introducing new entities or fitted parameters visible in the abstract.

axioms (2)
  • domain assumption Mean-shift clustering applied independently to trajectory feature matrices produces distinct and meaningful groups of normal behavior.
    Invoked in the clustering step of the pipeline.
  • domain assumption Shannon entropy computed on trajectory features can serve as a reliable indicator of anomalous behavior.
    Used to identify anomalies after clustering.

pith-pipeline@v0.9.0 · 5661 in / 1258 out tokens · 39664 ms · 2026-05-25T10:49:24.959777+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

22 extracted references · 22 canonical work pages

  1. [1]

    How effective is human video surveillance performance?

    N. Sulman, T. Sanocki, D. Goldgof, and R. Kasturi, “How effective is human video surveillance performance?” in Pattern Recognition, 2008. ICPR 2008. 19th International Conference on . IEEE, 2008, pp. 1–3. (a) Clustering based on the density feature (b) Anomalous trajectories based on Density feature (c) Clustering based on the Shape feature (d) Anomalous ...

  2. [2]

    Observe locally, infer globally: a space- time mrf for detecting abnormal activities with incremental updates,

    J. Kim and K. Grauman, “Observe locally, infer globally: a space- time mrf for detecting abnormal activities with incremental updates,” in Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on. IEEE, 2009, pp. 2921–2928

  3. [3]

    Abnormal event detection in crowded scenes using sparse representation,

    Y . Cong, J. Yuan, and J. Liu, “Abnormal event detection in crowded scenes using sparse representation,” Pattern Recognition, vol. 46, no. 7, pp. 1851–1864, 2013

  4. [4]

    A lagrangian particle dynamics approach for crowd flow segmentation and stability analysis,

    S. Ali and M. Shah, “A lagrangian particle dynamics approach for crowd flow segmentation and stability analysis,” in Computer Vision and Pattern Recognition, 2007. CVPR’07. IEEE Conference on . IEEE, 2007, pp. 1–6

  5. [5]

    Similarity based vehicle trajectory clustering and anomaly detection,

    Z. Fu, W. Hu, and T. Tan, “Similarity based vehicle trajectory clustering and anomaly detection,” in Image Processing, 2005. ICIP 2005. IEEE International Conference on , vol. 2. IEEE, 2005, pp. II–602

  6. [6]

    Multifeature object trajectory clustering for video analysis,

    N. Anjum and A. Cavallaro, “Multifeature object trajectory clustering for video analysis,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 18, no. 11, pp. 1555–1564, 2008

  7. [7]

    Counting pedestrians in video sequences using trajectory clustering,

    G. Antonini and J.-P. Thiran, “Counting pedestrians in video sequences using trajectory clustering,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 16, no. 8, pp. 1008–1020, 2006

  8. [8]

    Browsing and exploration of video sequences: A new scheme for key frame extraction and 3d visualization using entropy based jensen divergence,

    Q. Xu, Y . Liu, X. Li, Z. Yang, J. Wang, M. Sbert, and R. Scopigno, “Browsing and exploration of video sequences: A new scheme for key frame extraction and 3d visualization using entropy based jensen divergence,” Information Sciences , vol. 278, pp. 736–756, 2014

  9. [9]

    On entropy in network traffic anomaly detection,

    J. Santiago-Paz and D. Torres-Roman, “On entropy in network traffic anomaly detection,” Entropy, vol. 20, p. 2, 2015

  10. [10]

    Dowitcher: Effective worm detection and containment in the internet core,

    S. Ranjan, S. Shah, A. Nucci, M. Munafo, R. Cruz, and S. Muthukr- ishnan, “Dowitcher: Effective worm detection and containment in the internet core,” in INFOCOM 2007. 26th IEEE International Conference on Computer Communications. IEEE . IEEE, 2007, pp. 2541–2545

  11. [11]

    A trajectory clustering approach to crowd flow segmentation in videos,

    R. Sharma and T. Guha, “A trajectory clustering approach to crowd flow segmentation in videos,” 2016 IEEE International Conference on Image Processing (ICIP), pp. 1200–1204, 2016

  12. [12]

    Use of cluster analysis with anthropological data,

    F. E. Clements, “Use of cluster analysis with anthropological data,” American Anthropologist, vol. 56, no. 2, pp. 180–199, 1954

  13. [13]

    The estimation of the gradient of a density function, with applications in pattern recognition,

    K. Fukunaga and L. Hostetler, “The estimation of the gradient of a density function, with applications in pattern recognition,” IEEE Transactions on information theory , vol. 21, no. 1, pp. 32–40, 1975

  14. [14]

    Mean shift, mode seeking, and clustering,

    Y . Cheng, “Mean shift, mode seeking, and clustering,”IEEE transactions on pattern analysis and machine intelligence , vol. 17, no. 8, pp. 790– 799, 1995

  15. [15]

    Approximate clustering via the mountain method,

    R. R. Yager and D. P. Filev, “Approximate clustering via the mountain method,” IEEE Transactions on Systems, Man, and Cybernetics , vol. 24, no. 8, pp. 1279–1284, 1994

  16. [16]

    Mean shift-based clustering,

    K.-L. Wu and M.-S. Yang, “Mean shift-based clustering,” Pattern Recognition, vol. 40, no. 11, pp. 3035–3052, 2007

  17. [17]

    Mean shift: A robust approach toward feature space analysis,

    D. Comaniciu and P. Meer, “Mean shift: A robust approach toward feature space analysis,” IEEE Transactions on pattern analysis and machine intelligence, vol. 24, no. 5, pp. 603–619, 2002

  18. [18]

    Detecting dominant motions in dense crowds,

    A. M. Cheriyadat and R. J. Radke, “Detecting dominant motions in dense crowds,” IEEE Journal of Selected Topics in Signal Processing , vol. 2, no. 4, pp. 568–581, 2008

  19. [19]

    Anomaly detection based on trajectory analysis using kernel density estimation and information bottleneck techniques,

    Y . Guo, Q. Xu, Y . Yang, S. Liang, Y . Liu, and M. Sbert, “Anomaly detection based on trajectory analysis using kernel density estimation and information bottleneck techniques,” in Tech. Rep., Technical Report

  20. [20]

    University of Girona, 2014

  21. [21]

    Video anomaly detection based on a hierarchical activity discovery within spatio- temporal contexts,

    D. Xu, R. Song, X. Wu, N. Li, W. Feng, and H. Qian, “Video anomaly detection based on a hierarchical activity discovery within spatio- temporal contexts,” Neurocomputing, vol. 143, pp. 144–152, 2014

  22. [22]

    Abnormality detection in crowd videos by tracking sparse components,

    S. Biswas and V . Gupta, “Abnormality detection in crowd videos by tracking sparse components,” Machine Vision and Applications , vol. 28, no. 1-2, pp. 35–48, 2017