Mm-gesture: towards precise micro-gesture recognition through multimodal fusion

Gu, J · 2025 · arXiv 2507.08344

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

iMiGUE-3K: A Large-Scale Benchmark for Micro-Gesture Analysis with Self-Supervised Learning

cs.CV · 2026-05-16 · unverdicted · novelty 8.0

iMiGUE-3K is the largest in-the-wild micro-gesture video dataset with 3.4K clips and 37M frames from real interviews, supporting self-supervised foundation models and benchmarks that show micro-gestures improve emotion understanding.

Micro-DualNet: Dual-Path Spatio-Temporal Network for Micro-Action Recognition

cs.CV · 2026-04-22 · unverdicted · novelty 5.0

Micro-DualNet employs dual ST and TS pathways with entity-level adaptive routing and Mutual Action Consistency loss to achieve competitive results on MA-52 and state-of-the-art on iMiGUE for micro-action recognition.

Rethinking the Role of Feature Engineering and Learning Strategies in Few-Shot Hidden Emotion Recognition

cs.CV · 2026-06-30 · unverdicted · novelty 3.0

A competition-winning multi-modal model for hidden emotion recognition integrates static and dynamic pose features via cross-attention and MIL pooling while noting representation collapse in vision foundation models on micro-dynamic tasks.

citing papers explorer

Showing 3 of 3 citing papers.

iMiGUE-3K: A Large-Scale Benchmark for Micro-Gesture Analysis with Self-Supervised Learning cs.CV · 2026-05-16 · unverdicted · none · ref 96
iMiGUE-3K is the largest in-the-wild micro-gesture video dataset with 3.4K clips and 37M frames from real interviews, supporting self-supervised foundation models and benchmarks that show micro-gestures improve emotion understanding.
Micro-DualNet: Dual-Path Spatio-Temporal Network for Micro-Action Recognition cs.CV · 2026-04-22 · unverdicted · none · ref 13
Micro-DualNet employs dual ST and TS pathways with entity-level adaptive routing and Mutual Action Consistency loss to achieve competitive results on MA-52 and state-of-the-art on iMiGUE for micro-action recognition.
Rethinking the Role of Feature Engineering and Learning Strategies in Few-Shot Hidden Emotion Recognition cs.CV · 2026-06-30 · unverdicted · none · ref 13
A competition-winning multi-modal model for hidden emotion recognition integrates static and dynamic pose features via cross-attention and MIL pooling while noting representation collapse in vision foundation models on micro-dynamic tasks.

Mm-gesture: towards precise micro-gesture recognition through multimodal fusion

fields

years

verdicts

representative citing papers

citing papers explorer