T2m-gpt: Generating human motion from textual descriptions with discrete representations

· 2023 · arXiv 2301.06052

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

read on arXiv browse 4 citing papers

citation-role summary

background 2 baseline 1

citation-polarity summary

background 2 baseline 1

representative citing papers

ScaleMoGen: Autoregressive Next-Scale Prediction for Human Motion Generation

cs.CV · 2026-05-12 · unverdicted · novelty 7.0

ScaleMoGen introduces a scale-wise autoregressive framework that quantizes motions into hierarchical discrete tokens and predicts next-scale maps to achieve SOTA FID 0.030 on HumanML3D and text-guided editing.

ExpertEdit: Learning Skill-Aware Motion Editing from Expert Videos

cs.CV · 2026-04-12 · unverdicted · novelty 7.0

ExpertEdit edits novice motions to expert skill levels by learning a motion prior from unpaired videos and infilling masked skill-critical spans.

cs.AI · 2026-05-07 · unverdicted · novelty 6.0 · 2 refs

SDFlow learns a global transport map via similarity-driven flow matching in VQ latent space, using low-rank manifold decomposition and a categorical posterior to handle discreteness, yielding SOTA long-horizon performance and inference speedups.

MSDformer: Multi-scale Discrete Transformer For Time Series Generation

cs.LG · 2025-05-20 · unverdicted · novelty 5.0

MSDformer introduces a multi-scale discrete transformer that tokenizes time series at multiple scales and models them autoregressively in discrete space, claiming superior performance over prior DTM methods with rate-distortion theoretical support.

citing papers explorer

Showing 4 of 4 citing papers.

ScaleMoGen: Autoregressive Next-Scale Prediction for Human Motion Generation cs.CV · 2026-05-12 · unverdicted · none · ref 41
ScaleMoGen introduces a scale-wise autoregressive framework that quantizes motions into hierarchical discrete tokens and predicts next-scale maps to achieve SOTA FID 0.030 on HumanML3D and text-guided editing.
ExpertEdit: Learning Skill-Aware Motion Editing from Expert Videos cs.CV · 2026-04-12 · unverdicted · none · ref 67
ExpertEdit edits novice motions to expert skill levels by learning a motion prior from unpaired videos and infilling masked skill-critical spans.
SDFlow: Similarity-Driven Flow Matching for Time Series Generation cs.AI · 2026-05-07 · unverdicted · none · ref 30 · 2 links
SDFlow learns a global transport map via similarity-driven flow matching in VQ latent space, using low-rank manifold decomposition and a categorical posterior to handle discreteness, yielding SOTA long-horizon performance and inference speedups.
MSDformer: Multi-scale Discrete Transformer For Time Series Generation cs.LG · 2025-05-20 · unverdicted · none · ref 54
MSDformer introduces a multi-scale discrete transformer that tokenizes time series at multiple scales and models them autoregressively in discrete space, claiming superior performance over prior DTM methods with rate-distortion theoretical support.

T2m-gpt: Generating human motion from textual descriptions with discrete representations

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer