PIDNet uses progressive implicit decoupling with iMambaWave and Group3M blocks to fuse multimodal cues for improved action quality assessment on gymnastics datasets.
Umt: Unified multi-modal transformers for joint video moment retrieval and highlight detection
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
citation-role summary
baseline 1
citation-polarity summary
fields
cs.CV 1years
2026 1verdicts
UNVERDICTED 1roles
baseline 1polarities
baseline 1representative citing papers
citing papers explorer
-
PIDNet: Progressive Implicit Decouple Network for Multimodal Action Quality Assessment
PIDNet uses progressive implicit decoupling with iMambaWave and Group3M blocks to fuse multimodal cues for improved action quality assessment on gymnastics datasets.