Modeling temporal dynamics and spatial configurations of actions using two- stream recurrent neural networks

Hongsong Wang, Liang Wang · 2017

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Marrying Text-to-Motion Generation with Skeleton-Based Action Recognition

cs.CV · 2026-04-18 · unverdicted · novelty 7.0

CoAMD unifies skeleton-based action recognition and text-to-motion generation through autoregressive diffusion guided by a multi-modal recognizer, reporting SOTA results on 13 benchmarks for four tasks.

Towards Universal Skeleton-Based Action Recognition

cs.CV · 2026-04-18 · unverdicted · novelty 5.0

A Transformer model with unified skeleton representation, two-stream motion encoder, and multi-grained motion-text contrastive alignment achieves effective recognition on a new integrated heterogeneous open-vocabulary skeleton dataset.

citing papers explorer

Showing 2 of 2 citing papers.

Marrying Text-to-Motion Generation with Skeleton-Based Action Recognition cs.CV · 2026-04-18 · unverdicted · none · ref 39
CoAMD unifies skeleton-based action recognition and text-to-motion generation through autoregressive diffusion guided by a multi-modal recognizer, reporting SOTA results on 13 benchmarks for four tasks.
Towards Universal Skeleton-Based Action Recognition cs.CV · 2026-04-18 · unverdicted · none · ref 52
A Transformer model with unified skeleton representation, two-stream motion encoder, and multi-grained motion-text contrastive alignment achieves effective recognition on a new integrated heterogeneous open-vocabulary skeleton dataset.

Modeling temporal dynamics and spatial configurations of actions using two- stream recurrent neural networks

fields

years

verdicts

representative citing papers

citing papers explorer