arXiv:2208.01897 (2022)

Leong, M · 2022 · arXiv 2208.01897

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Exploring High-Order Self-Similarity for Video Understanding

cs.CV · 2026-04-22 · unverdicted · novelty 6.0

The MOSS module learns and combines multi-order space-time self-similarity features to enhance temporal dynamics modeling in videos across action recognition, VQA, and robotic tasks.

TAG-Head: Time-Aligned Graph Head for Plug-and-Play Fine-grained Action Recognition

cs.CV · 2026-04-13 · unverdicted · novelty 6.0

TAG-Head adds a lightweight transformer-plus-time-aligned-graph head to existing 3D backbones to reach new RGB-only state-of-the-art on FineGym and HAA500 while surpassing some multimodal baselines.

citing papers explorer

Showing 2 of 2 citing papers.

Exploring High-Order Self-Similarity for Video Understanding cs.CV · 2026-04-22 · unverdicted · none · ref 36
The MOSS module learns and combines multi-order space-time self-similarity features to enhance temporal dynamics modeling in videos across action recognition, VQA, and robotic tasks.
TAG-Head: Time-Aligned Graph Head for Plug-and-Play Fine-grained Action Recognition cs.CV · 2026-04-13 · unverdicted · none · ref 18
TAG-Head adds a lightweight transformer-plus-time-aligned-graph head to existing 3D backbones to reach new RGB-only state-of-the-art on FineGym and HAA500 while surpassing some multimodal baselines.

arXiv:2208.01897 (2022)

fields

years

verdicts

representative citing papers

citing papers explorer