Explicit Time-Frequency Dynamics for Skeleton-Based Gait Recognition

Egene Chung; Junhyug Noh; Luca Quagliato; Seoyeon Ko; Taeyong Lee; Yeojin Song

arxiv: 2604.03002 · v1 · submitted 2026-04-03 · 💻 cs.CV

Explicit Time-Frequency Dynamics for Skeleton-Based Gait Recognition

Seoyeon Ko , Yeojin Song , Egene Chung , Luca Quagliato , Taeyong Lee , Junhyug Noh This is my paper

Pith reviewed 2026-05-13 20:30 UTC · model grok-4.3

classification 💻 cs.CV

keywords skeleton gait recognitionwavelet transformtime-frequency analysisjoint velocitiesCASIA-Bcovariate shiftsplug-and-play modulemulti-scale CNN

0 comments

The pith

A plug-and-play wavelet stream that turns joint velocities into time-frequency scalograms improves skeleton gait recognition on CASIA-B and sets a new state of the art with GaitMixer.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Skeleton-based gait models capture spatial configurations well yet often leave explicit motion dynamics implicit, which hurts performance when appearance changes. This paper adds a Wavelet Feature Stream that converts per-joint velocity sequences into multi-scale scalograms via the continuous wavelet transform and feeds them through a lightweight multi-scale CNN. The resulting descriptor fuses with any existing backbone representation without architectural changes or extra labels. Experiments on CASIA-B show consistent accuracy lifts across strong backbones, with the largest gains under bag-carrying and coat-wearing conditions. If the claim holds, explicit time-frequency modeling supplies information that standard spatio-temporal encoders do not fully recover on their own.

Core claim

The paper claims that augmenting any skeleton gait backbone with a Wavelet Feature Stream—produced by applying the continuous wavelet transform to per-joint velocity sequences, extracting features from the resulting scalograms with a lightweight multi-scale CNN, and fusing the output with the backbone representation—delivers consistent accuracy gains on CASIA-B, especially under covariate shifts, and reaches new skeleton-based state-of-the-art performance when paired with GaitMixer.

What carries the argument

Wavelet Feature Stream: per-joint velocity sequences are transformed by the continuous wavelet transform into multi-scale scalograms from which a lightweight multi-scale CNN extracts dynamic cues for fusion with the backbone.

If this is right

The stream works as a plug-and-play addition requiring no backbone modifications or extra supervision.
Accuracy gains appear across multiple strong backbones including GaitMixer, GaitFormer, and GaitGraph.
Improvements are especially large under covariate shifts such as carrying bags or wearing coats.
New state-of-the-art skeleton-based gait recognition is achieved on CASIA-B when the stream is attached to GaitMixer.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same velocity-to-scalogram approach could be tested on other motion tasks where temporal frequency content matters, such as action recognition under viewpoint change.
Hybrid pipelines that inject classical signal-processing transforms before learned encoders may prove useful for any recognition setting dominated by appearance-invariant dynamics.
Varying the wavelet family or scale selection could be explored to optimize performance for specific covariates like speed changes or terrain differences.

Load-bearing premise

The time-frequency cues produced by the continuous wavelet transform on velocities supply genuinely new information that the backbone's own spatio-temporal encoders have not already captured implicitly.

What would settle it

An ablation in which the wavelet stream is added to a backbone already equipped with explicit multi-scale velocity modeling and no accuracy gain (or a drop) is observed would falsify the claim that the stream adds complementary information.

read the original abstract

Skeleton-based gait recognizers excel at modeling spatial configurations but often underuse explicit motion dynamics that are crucial under appearance changes. We introduce a plug-and-play Wavelet Feature Stream that augments any skeleton backbone with time-frequency dynamics of joint velocities. Concretely, per-joint velocity sequences are transformed by the continuous wavelet transform (CWT) into multi-scale scalograms, from which a lightweight multi-scale CNN learns discriminative dynamic cues. The resulting descriptor is fused with the backbone representation for classification, requiring no changes to the backbone architecture or additional supervision. Across CASIA-B, the proposed stream delivers consistent gains on strong skeleton backbones (e.g., GaitMixer, GaitFormer, GaitGraph) and establishes a new skeleton-based state of the art when attached to GaitMixer. The improvements are especially pronounced under covariate shifts such as carrying bags (BG) and wearing coats (CL), highlighting the complementarity of explicit time-frequency modeling and standard spatio-temporal encoders.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The wavelet stream on joint velocities is a clean plug-in but its claimed complementarity to backbone encoders still needs the full experiments to hold up.

read the letter

The paper adds a wavelet feature stream that converts per-joint velocity sequences into multi-scale scalograms via the continuous wavelet transform, then runs those through a small multi-scale CNN before fusing the result with any existing skeleton backbone. The whole thing is plug-and-play, needs no architecture changes or extra labels, and is tested on CASIA-B with models like GaitMixer, GaitFormer, and GaitGraph. The abstract reports consistent gains, especially under bag and coat conditions, and a new SOTA when attached to GaitMixer. That explicit time-frequency construction on velocities has not appeared in the skeleton-gait papers referenced here, so the core idea is new. The motivation is reasonable: standard spatio-temporal encoders may not surface frequency content as directly, and the method targets exactly the covariate shifts that hurt appearance-based cues. If the fusion really supplies non-redundant information, this could become a standard add-on for any skeleton pipeline facing real-world variation. The soft spot is verification. The abstract gives no tables, error bars, ablation breakdowns, or statistical tests, so it is impossible to tell whether the gains come from the wavelet representation itself or simply from the extra branch and parameter budget. The stress-test concern is fair: backbones like GaitMixer already process velocity-derived sequences through temporal convolutions or attention, and nothing in the provided description rules out that the scalogram features largely overlap with what those encoders already compute internally. If the full paper shows clean ablations that isolate the CWT step and confirms the improvements survive proper controls, the claim strengthens; without that, the complementarity remains unproven. This is for researchers working on skeleton-based gait who want a lightweight way to inject explicit frequency dynamics. Anyone already running CASIA-B experiments could test the stream quickly. The work is coherent on its own terms and engages the literature directly, so it deserves a serious referee to check the empirical details and decide whether the gains are real.

Referee Report

2 major / 2 minor

Summary. The paper proposes a plug-and-play Wavelet Feature Stream for skeleton-based gait recognition. Per-joint velocity sequences are converted via continuous wavelet transform (CWT) into multi-scale scalograms; a lightweight multi-scale CNN extracts time-frequency descriptors that are fused with the output of an unmodified backbone (GaitMixer, GaitFormer, GaitGraph, etc.). On CASIA-B the stream yields consistent gains, most pronounced under bag (BG) and coat (CL) covariates, and produces a new skeleton-based state of the art when attached to GaitMixer.

Significance. If the reported gains are shown to arise specifically from the explicit time-frequency representation rather than from extra parameters or ensembling, the module would supply a lightweight, architecture-agnostic way to inject multi-scale motion dynamics into existing spatio-temporal encoders, improving robustness to appearance changes without retraining or additional supervision.

major comments (2)

[Abstract] Abstract and experimental claims: the headline result (new SOTA on CASIA-B with GaitMixer, consistent gains under BG/CL) is stated without any quantitative tables, error bars, statistical tests, or ablation numbers in the supplied abstract; the central empirical claim therefore cannot be verified from the given text.
[Method] Method and fusion section: the argument that CWT velocity scalograms supply genuinely complementary cues rests on the untested assumption that standard backbones (GaitMixer et al.) do not already capture equivalent time-frequency information through their temporal convolutions or attention; no ablation (e.g., replacing CWT with raw velocity or comparing internal activations) is described to rule out redundancy or simple ensembling effects.

minor comments (2)

[Method] Notation: the precise CWT parameters (mother wavelet, scale range, discretization) and the exact fusion operator (concatenation, attention, etc.) should be stated explicitly for reproducibility.
[Figures] Figure clarity: scalogram visualizations and the architecture diagram of the multi-scale CNN would benefit from clearer labeling of frequency bands and channel dimensions.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback. We address each major comment below and have revised the manuscript to strengthen the empirical presentation and provide additional evidence of complementarity.

read point-by-point responses

Referee: [Abstract] Abstract and experimental claims: the headline result (new SOTA on CASIA-B with GaitMixer, consistent gains under BG/CL) is stated without any quantitative tables, error bars, statistical tests, or ablation numbers in the supplied abstract; the central empirical claim therefore cannot be verified from the given text.

Authors: We agree that the abstract should include key quantitative results to support the claims. In the revised manuscript we have updated the abstract to report the specific accuracy gains on CASIA-B (normal, BG, and CL conditions), the new skeleton-based SOTA numbers achieved with GaitMixer, and explicit references to the main result tables that contain error bars, statistical comparisons, and ablation studies. revision: yes
Referee: [Method] Method and fusion section: the argument that CWT velocity scalograms supply genuinely complementary cues rests on the untested assumption that standard backbones (GaitMixer et al.) do not already capture equivalent time-frequency information through their temporal convolutions or attention; no ablation (e.g., replacing CWT with raw velocity or comparing internal activations) is described to rule out redundancy or simple ensembling effects.

Authors: The referee is correct that direct evidence is needed to establish complementarity rather than redundancy or ensembling. While the original experiments already show consistent gains when the stream is attached to several unmodified backbones, we acknowledge the absence of targeted ablations such as raw-velocity replacement or feature-correlation analysis. We have added these experiments to the revised manuscript; the new results confirm that the CWT scalograms capture multi-scale dynamics that are not fully exploited by the temporal modules of the tested backbones. revision: yes

Circularity Check

0 steps flagged

No circularity: additive wavelet stream is independent of backbone internals

full rationale

The paper introduces an explicit CWT-based velocity scalogram stream processed by a lightweight multi-scale CNN and fused at the representation level with any existing skeleton backbone (GaitMixer, GaitFormer, GaitGraph). No equation or derivation reduces the claimed gains to a quantity already fitted inside the cited backbones; the fusion is presented as plug-and-play without architectural changes or additional supervision. The central claim rests on empirical improvements under covariate shifts rather than any self-definitional mapping or self-citation chain that would force the result by construction. The method is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Only the abstract is available; no explicit free parameters, axioms, or invented entities are stated. The approach relies on the standard mathematical properties of the continuous wavelet transform and the universal approximation capability of CNNs, both treated as background.

pith-pipeline@v0.9.0 · 5477 in / 1151 out tokens · 39536 ms · 2026-05-13T20:30:30.592823+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

22 extracted references · 22 canonical work pages · 1 internal anchor

[1]

Explicit Time-Frequency Dynamics for Skeleton-Based Gait Recognition

INTRODUCTION Gait recognition identifies individuals from their walking pat- terns and is attractive for its non-invasive, long-range nature. Methods are commonly grouped intoappearance-basedand skeleton-basedapproaches. Appearance-based systems learn from silhouettes [1, 2] or pixel intensities [3] and currently re- port state-of-the-art results on CASIA...

work page internal anchor Pith review Pith/arXiv arXiv 2026
[2]

Appearance models currently lead on CASIA-B, while skeleton models have rapidly narrowed the gap with improved spatio–temporal modeling

RELATED WORK Gait recognition identifies individuals from walking pat- terns and is typically approached viaappearance-basedor skeleton-basedmethods. Appearance models currently lead on CASIA-B, while skeleton models have rapidly narrowed the gap with improved spatio–temporal modeling. Appearance-based.Silhouette/pixel methods such as Gait- Net [5], GaitS...

work page
[3]

(1) Abackbone streamutilizes a state-of-the-art skeleton model (e.g., GaitMixer [11]) to encode global spatio-temporal pat- terns from joint sequences

METHOD Our framework consists of two complementary streams. (1) Abackbone streamutilizes a state-of-the-art skeleton model (e.g., GaitMixer [11]) to encode global spatio-temporal pat- terns from joint sequences. (2) The proposedwavelet feature streamexplicitly models motion dynamics by transforming per-joint velocities into time–frequency representations ...

work page
[4]

Experimental Settings Datasets.CASIA-B [15] is a widely used multi-view gait dataset comprising 124 subjects recorded from 11 viewpoints (angles0 ◦ to180 ◦ in18 ◦ steps)

EXPERIMENTS 4.1. Experimental Settings Datasets.CASIA-B [15] is a widely used multi-view gait dataset comprising 124 subjects recorded from 11 viewpoints (angles0 ◦ to180 ◦ in18 ◦ steps). Each subject provides 10 se- quences: six normal walking (NM), two with a coat (CL), and two carrying a bag (BG), totaling 13,640 sequences across all views. Following t...

work page
[5]

CONCLUSION We presented a plug-and-playWavelet Feature Streamthat injects explicit time–frequency dynamics of joint velocities into skeleton-based gait recognition. By transforming per- joint velocities with the continuous wavelet transform and learning multi-scale patterns with a lightweight CNN, our module complements spatial modeling in standard backbo...

work page 2025
[6]

Silhouette analysis-based gait recognition for human identification,

Liang Wang, Tieniu Tan, Huazhong Ning, and Weim- ing Hu, “Silhouette analysis-based gait recognition for human identification,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 25, no. 12, pp. 1505–1518, 2003

work page 2003
[7]

Individual recognition using gait energy image,

J. Han and Bir Bhanu, “Individual recognition using gait energy image,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 28, no. 2, pp. 316–322, 2006

work page 2006
[8]

Multimodal fea- ture fusion for CNN-based gait recognition: an empiri- cal comparison,

Francisco M Castro, Manuel J Marin-Jimenez, Nicol ´as Guil, and Nicol´as P´erez de la Blanca, “Multimodal fea- ture fusion for CNN-based gait recognition: an empiri- cal comparison,”Neural Computing and Applications, vol. 32, no. 17, pp. 14173–14193, 2020

work page 2020
[9]

3D local convolutional neural networks for gait recog- nition,

Zhen Huang, Dixiu Xue, Xu Shen, Xinmei Tian, Houqiang Li, Jianqiang Huang, and Xian-Sheng Hua, “3D local convolutional neural networks for gait recog- nition,” inProceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 14920– 14929

work page 2021
[10]

Gaitnet: An end-to-end network for gait based human identification,

Chunfeng Song, Yongzhen Huang, Yan Huang, Ning Jia, and Liang Wang, “Gaitnet: An end-to-end network for gait based human identification,”Pattern recogni- tion, vol. 96, pp. 106988, 2019

work page 2019
[11]

Gaitset: Cross-view gait recogni- tion through utilizing gait as a deep set,

Hanqing Chao, Kun Wang, Yiwei He, Junping Zhang, and Jianfeng Feng, “Gaitset: Cross-view gait recogni- tion through utilizing gait as a deep set,”IEEE transac- tions on pattern analysis and machine intelligence, vol. 44, no. 7, pp. 3467–3478, 2021

work page 2021
[12]

Gaitpart: Temporal part-based model for gait recognition,

Chao Fan, Yunjie Peng, Chunshui Cao, Xu Liu, Sai- hui Hou, Jiannan Chi, Yongzhen Huang, Qing Li, and Zhiqiang He, “Gaitpart: Temporal part-based model for gait recognition,” inProceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020, pp. 14225–14233

work page 2020
[13]

A model-based gait recognition method with body pose and human prior knowledge,

Rijun Liao, Shiqi Yu, Weizhi An, and Yongzhen Huang, “A model-based gait recognition method with body pose and human prior knowledge,”Pattern Recognition, vol. 98, pp. 107069, 2020

work page 2020
[14]

Gaitgraph: Graph convolutional network for skeleton-based gait recognition,

Torben Teepe, Ali Khan, Johannes Gilg, Fabian Her- zog, Stefan H ¨ormann, and Gerhard Rigoll, “Gaitgraph: Graph convolutional network for skeleton-based gait recognition,” in2021 IEEE international conference on image processing (ICIP). IEEE, 2021, pp. 2314–2318

work page 2021
[15]

Towards a deeper un- derstanding of skeleton-based gait recognition,

Torben Teepe, Johannes Gilg, Fabian Herzog, Stefan H¨ormann, and Gerhard Rigoll, “Towards a deeper un- derstanding of skeleton-based gait recognition,” inPro- ceedings of the IEEE/CVF conference on computer vi- sion and pattern recognition, 2022, pp. 1569–1577

work page 2022
[16]

Gaitmixer: skeleton-based gait representation learning via wide-spectrum multi- axial mixer,

Ekkasit Pinyoanuntapong, Ayman Ali, Pu Wang, Min- woo Lee, and Chen Chen, “Gaitmixer: skeleton-based gait representation learning via wide-spectrum multi- axial mixer,” inICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2023, pp. 1–5

work page 2023
[17]

Joint angle estimation with wavelet neural networks,

Saaveethya Sivakumar, Alpha Agape Gopalai, King Hann Lim, Darwin Gouwanda, and Sunita Chauhan, “Joint angle estimation with wavelet neural networks,”Scientific reports, vol. 11, no. 1, pp. 10306, 2021

work page 2021
[18]

Appropriate mother wavelets for continuous gait event detection based on time-frequency analysis for hemi- plegic and healthy individuals,

Ning Ji, Hui Zhou, Kaifeng Guo, Oluwarotimi Williams Samuel, Zhen Huang, Lisheng Xu, and Guanglin Li, “Appropriate mother wavelets for continuous gait event detection based on time-frequency analysis for hemi- plegic and healthy individuals,”Sensors, vol. 19, no. 16, pp. 3462, 2019

work page 2019
[19]

Facenet: A unified embedding for face recog- nition and clustering,

Florian Schroff, Dmitry Kalenichenko, and James Philbin, “Facenet: A unified embedding for face recog- nition and clustering,” inProceedings of the IEEE conference on computer vision and pattern recognition, 2015, pp. 815–823

work page 2015
[20]

A framework for evaluating the effect of view angle, clothing and car- rying condition on gait recognition,

Shiqi Yu, Daoliang Tan, and Tieniu Tan, “A framework for evaluating the effect of view angle, clothing and car- rying condition on gait recognition,” in18th Interna- tional Conference on Pattern Recognition (ICPR’06), 2006, vol. 4, pp. 441–444

work page 2006
[21]

Deep high-resolution representation learning for hu- man pose estimation,

Ke Sun, Bin Xiao, Dong Liu, and Jingdong Wang, “Deep high-resolution representation learning for hu- man pose estimation,”2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 5686–5696, 2019

work page 2019
[22]

Super- convergence: Very fast training of neural networks us- ing large learning rates,

Leslie N Smith and Nicholay Topin, “Super- convergence: Very fast training of neural networks us- ing large learning rates,” inArtificial intelligence and machine learning for multi-domain operations applica- tions. SPIE, 2019, vol. 11006, pp. 369–386

work page 2019

[1] [1]

Explicit Time-Frequency Dynamics for Skeleton-Based Gait Recognition

INTRODUCTION Gait recognition identifies individuals from their walking pat- terns and is attractive for its non-invasive, long-range nature. Methods are commonly grouped intoappearance-basedand skeleton-basedapproaches. Appearance-based systems learn from silhouettes [1, 2] or pixel intensities [3] and currently re- port state-of-the-art results on CASIA...

work page internal anchor Pith review Pith/arXiv arXiv 2026

[2] [2]

Appearance models currently lead on CASIA-B, while skeleton models have rapidly narrowed the gap with improved spatio–temporal modeling

RELATED WORK Gait recognition identifies individuals from walking pat- terns and is typically approached viaappearance-basedor skeleton-basedmethods. Appearance models currently lead on CASIA-B, while skeleton models have rapidly narrowed the gap with improved spatio–temporal modeling. Appearance-based.Silhouette/pixel methods such as Gait- Net [5], GaitS...

work page

[3] [3]

(1) Abackbone streamutilizes a state-of-the-art skeleton model (e.g., GaitMixer [11]) to encode global spatio-temporal pat- terns from joint sequences

METHOD Our framework consists of two complementary streams. (1) Abackbone streamutilizes a state-of-the-art skeleton model (e.g., GaitMixer [11]) to encode global spatio-temporal pat- terns from joint sequences. (2) The proposedwavelet feature streamexplicitly models motion dynamics by transforming per-joint velocities into time–frequency representations ...

work page

[4] [4]

Experimental Settings Datasets.CASIA-B [15] is a widely used multi-view gait dataset comprising 124 subjects recorded from 11 viewpoints (angles0 ◦ to180 ◦ in18 ◦ steps)

EXPERIMENTS 4.1. Experimental Settings Datasets.CASIA-B [15] is a widely used multi-view gait dataset comprising 124 subjects recorded from 11 viewpoints (angles0 ◦ to180 ◦ in18 ◦ steps). Each subject provides 10 se- quences: six normal walking (NM), two with a coat (CL), and two carrying a bag (BG), totaling 13,640 sequences across all views. Following t...

work page

[5] [5]

CONCLUSION We presented a plug-and-playWavelet Feature Streamthat injects explicit time–frequency dynamics of joint velocities into skeleton-based gait recognition. By transforming per- joint velocities with the continuous wavelet transform and learning multi-scale patterns with a lightweight CNN, our module complements spatial modeling in standard backbo...

work page 2025

[6] [6]

Silhouette analysis-based gait recognition for human identification,

Liang Wang, Tieniu Tan, Huazhong Ning, and Weim- ing Hu, “Silhouette analysis-based gait recognition for human identification,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 25, no. 12, pp. 1505–1518, 2003

work page 2003

[7] [7]

Individual recognition using gait energy image,

J. Han and Bir Bhanu, “Individual recognition using gait energy image,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 28, no. 2, pp. 316–322, 2006

work page 2006

[8] [8]

Multimodal fea- ture fusion for CNN-based gait recognition: an empiri- cal comparison,

Francisco M Castro, Manuel J Marin-Jimenez, Nicol ´as Guil, and Nicol´as P´erez de la Blanca, “Multimodal fea- ture fusion for CNN-based gait recognition: an empiri- cal comparison,”Neural Computing and Applications, vol. 32, no. 17, pp. 14173–14193, 2020

work page 2020

[9] [9]

3D local convolutional neural networks for gait recog- nition,

Zhen Huang, Dixiu Xue, Xu Shen, Xinmei Tian, Houqiang Li, Jianqiang Huang, and Xian-Sheng Hua, “3D local convolutional neural networks for gait recog- nition,” inProceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 14920– 14929

work page 2021

[10] [10]

Gaitnet: An end-to-end network for gait based human identification,

Chunfeng Song, Yongzhen Huang, Yan Huang, Ning Jia, and Liang Wang, “Gaitnet: An end-to-end network for gait based human identification,”Pattern recogni- tion, vol. 96, pp. 106988, 2019

work page 2019

[11] [11]

Gaitset: Cross-view gait recogni- tion through utilizing gait as a deep set,

Hanqing Chao, Kun Wang, Yiwei He, Junping Zhang, and Jianfeng Feng, “Gaitset: Cross-view gait recogni- tion through utilizing gait as a deep set,”IEEE transac- tions on pattern analysis and machine intelligence, vol. 44, no. 7, pp. 3467–3478, 2021

work page 2021

[12] [12]

Gaitpart: Temporal part-based model for gait recognition,

Chao Fan, Yunjie Peng, Chunshui Cao, Xu Liu, Sai- hui Hou, Jiannan Chi, Yongzhen Huang, Qing Li, and Zhiqiang He, “Gaitpart: Temporal part-based model for gait recognition,” inProceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020, pp. 14225–14233

work page 2020

[13] [13]

A model-based gait recognition method with body pose and human prior knowledge,

Rijun Liao, Shiqi Yu, Weizhi An, and Yongzhen Huang, “A model-based gait recognition method with body pose and human prior knowledge,”Pattern Recognition, vol. 98, pp. 107069, 2020

work page 2020

[14] [14]

Gaitgraph: Graph convolutional network for skeleton-based gait recognition,

Torben Teepe, Ali Khan, Johannes Gilg, Fabian Her- zog, Stefan H ¨ormann, and Gerhard Rigoll, “Gaitgraph: Graph convolutional network for skeleton-based gait recognition,” in2021 IEEE international conference on image processing (ICIP). IEEE, 2021, pp. 2314–2318

work page 2021

[15] [15]

Towards a deeper un- derstanding of skeleton-based gait recognition,

Torben Teepe, Johannes Gilg, Fabian Herzog, Stefan H¨ormann, and Gerhard Rigoll, “Towards a deeper un- derstanding of skeleton-based gait recognition,” inPro- ceedings of the IEEE/CVF conference on computer vi- sion and pattern recognition, 2022, pp. 1569–1577

work page 2022

[16] [16]

Gaitmixer: skeleton-based gait representation learning via wide-spectrum multi- axial mixer,

Ekkasit Pinyoanuntapong, Ayman Ali, Pu Wang, Min- woo Lee, and Chen Chen, “Gaitmixer: skeleton-based gait representation learning via wide-spectrum multi- axial mixer,” inICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2023, pp. 1–5

work page 2023

[17] [17]

Joint angle estimation with wavelet neural networks,

Saaveethya Sivakumar, Alpha Agape Gopalai, King Hann Lim, Darwin Gouwanda, and Sunita Chauhan, “Joint angle estimation with wavelet neural networks,”Scientific reports, vol. 11, no. 1, pp. 10306, 2021

work page 2021

[18] [18]

Appropriate mother wavelets for continuous gait event detection based on time-frequency analysis for hemi- plegic and healthy individuals,

Ning Ji, Hui Zhou, Kaifeng Guo, Oluwarotimi Williams Samuel, Zhen Huang, Lisheng Xu, and Guanglin Li, “Appropriate mother wavelets for continuous gait event detection based on time-frequency analysis for hemi- plegic and healthy individuals,”Sensors, vol. 19, no. 16, pp. 3462, 2019

work page 2019

[19] [19]

Facenet: A unified embedding for face recog- nition and clustering,

Florian Schroff, Dmitry Kalenichenko, and James Philbin, “Facenet: A unified embedding for face recog- nition and clustering,” inProceedings of the IEEE conference on computer vision and pattern recognition, 2015, pp. 815–823

work page 2015

[20] [20]

A framework for evaluating the effect of view angle, clothing and car- rying condition on gait recognition,

Shiqi Yu, Daoliang Tan, and Tieniu Tan, “A framework for evaluating the effect of view angle, clothing and car- rying condition on gait recognition,” in18th Interna- tional Conference on Pattern Recognition (ICPR’06), 2006, vol. 4, pp. 441–444

work page 2006

[21] [21]

Deep high-resolution representation learning for hu- man pose estimation,

Ke Sun, Bin Xiao, Dong Liu, and Jingdong Wang, “Deep high-resolution representation learning for hu- man pose estimation,”2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 5686–5696, 2019

work page 2019

[22] [22]

Super- convergence: Very fast training of neural networks us- ing large learning rates,

Leslie N Smith and Nicholay Topin, “Super- convergence: Very fast training of neural networks us- ing large learning rates,” inArtificial intelligence and machine learning for multi-domain operations applica- tions. SPIE, 2019, vol. 11006, pp. 369–386

work page 2019