Advances in neural information processing systems , volume=

Two-stream convolutional networks for action recognition in videos , author=

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

cs.CV · 2023-10-03 · unverdicted · novelty 6.0

LanguageBind aligns video, infrared, depth, and audio to a frozen language encoder via contrastive learning on the new VIDAL-10M dataset, extending video-language pretraining to N modalities.

Hybrid Congestion Classification Framework Using Flow-Guided Attention and Empirical Mode Decomposition

cs.CV · 2026-05-06 · unverdicted · novelty 3.0

FLO-EMD integrates flow-guided attention and EMD on aggregated motion traces to classify light, medium, and heavy congestion at 97.5% accuracy on 1,050 surveillance clips.

citing papers explorer

Showing 2 of 2 citing papers.

LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment cs.CV · 2023-10-03 · unverdicted · none · ref 111
LanguageBind aligns video, infrared, depth, and audio to a frozen language encoder via contrastive learning on the new VIDAL-10M dataset, extending video-language pretraining to N modalities.
Hybrid Congestion Classification Framework Using Flow-Guided Attention and Empirical Mode Decomposition cs.CV · 2026-05-06 · unverdicted · none · ref 56
FLO-EMD integrates flow-guided attention and EMD on aggregated motion traces to classify light, medium, and heavy congestion at 97.5% accuracy on 1,050 surveillance clips.

Advances in neural information processing systems , volume=

fields

years

verdicts

representative citing papers

citing papers explorer