A skeleton-based zero-shot VAD method distills LLM knowledge for action typicality during training and performs test-time context uniqueness analysis to derive scene-adaptive normality boundaries, claiming SOTA results on four datasets with over 100 unseen scenes.
Spatial temporal graph convolutional networks for skeleton-based action recognition
3 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
verdicts
UNVERDICTED 3roles
background 1polarities
background 1representative citing papers
iPay fuses RGB and skeleton expert streams via dual-attention and a prior-driven Spatial Difference Discriminator to reach 83.45% accuracy on 500+ real-world payment clips from onboard transit cameras.
Extends Social-STGCNN with CVAE for multimodal trajectory prediction and reports moderate gains plus better diversity on ETH/UCY benchmarks and robot data.
citing papers explorer
-
Action Hints: Semantic Typicality and Context Uniqueness for Generalizable Skeleton-based Video Anomaly Detection
A skeleton-based zero-shot VAD method distills LLM knowledge for action typicality during training and performs test-time context uniqueness analysis to derive scene-adaptive normality boundaries, claiming SOTA results on four datasets with over 100 unseen scenes.
-
iPay: Integrated Payment Action Recognition via Multimodal Networks and Adaptive Spatial Prior Learning
iPay fuses RGB and skeleton expert streams via dual-attention and a prior-driven Spatial Difference Discriminator to reach 83.45% accuracy on 500+ real-world payment clips from onboard transit cameras.
-
On Improving Multimodal Pedestrian Trajectory Prediction with CVAE: A Study on Benchmark and Robot Data
Extends Social-STGCNN with CVAE for multimodal trajectory prediction and reports moderate gains plus better diversity on ETH/UCY benchmarks and robot data.