Human-centric video anomaly detection through spatio-temporal pose tokenization and transformer

· 2024 · arXiv 2408.15185

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

From Frames to Events: Rethinking Evaluation in Human-Centric Video Anomaly Detection

cs.CV · 2026-04-10 · conditional · novelty 8.0

State-of-the-art pose-based video anomaly detection models achieve over 52% frame-level AUC-ROC but drop below 10% event-level precision and 0.11 average F1 when evaluated with temporal action localization metrics on standard benchmarks.

Are Multimodal LLMs Ready for Surveillance? A Reality Check on Zero-Shot Anomaly Detection in the Wild

cs.CV · 2026-03-05 · unverdicted · novelty 4.0

Zero-shot MLLMs on ShanghaiTech and CHAD exhibit strong conservative bias with high precision but collapsed recall; class-specific prompts raise peak F1 from 0.09 to 0.64 yet recall remains the bottleneck.

citing papers explorer

Showing 2 of 2 citing papers.

From Frames to Events: Rethinking Evaluation in Human-Centric Video Anomaly Detection cs.CV · 2026-04-10 · conditional · none · ref 22
State-of-the-art pose-based video anomaly detection models achieve over 52% frame-level AUC-ROC but drop below 10% event-level precision and 0.11 average F1 when evaluated with temporal action localization metrics on standard benchmarks.
Are Multimodal LLMs Ready for Surveillance? A Reality Check on Zero-Shot Anomaly Detection in the Wild cs.CV · 2026-03-05 · unverdicted · none · ref 13
Zero-shot MLLMs on ShanghaiTech and CHAD exhibit strong conservative bias with high precision but collapsed recall; class-specific prompts raise peak F1 from 0.09 to 0.64 yet recall remains the bottleneck.

Human-centric video anomaly detection through spatio-temporal pose tokenization and transformer

fields

years

verdicts

representative citing papers

citing papers explorer